Tag Content
SG ID
SG00004413 
UniProt Accession
Theoretical PI
8.42  
Molecular Weight
21059 Da  
Genbank Nucleotide ID
Genbank Protein ID
Gene Name
Cd164 
Gene Synonyms/Alias
 
Protein Name
Sialomucin core protein 24 
Protein Synonyms/Alias
MUC-24 Endolyn; Multi-glycosylated core protein 24;MGC-24MGC-24v CD_antigen=CD164;Flags: Precursor 
Organism
Mus musculus (Mouse) 
NCBI Taxonomy ID
10090 
Chromosome Location
chr:10;41239306-41250848;1
View in Ensembl genome browser  
Function in Stage
Uncertain 
Function in Cell Type
Uncertain 
Probability (GAS) of Function in Spermatogenesis
0.573312808 
The probability was calculated by GAS algorithm, ranging from 0 to 1. The closer it is to 1, the more possibly it functions in spermatogenesis.
Description
Temporarily unavailable 
Abstract of related literatures
1. MGC-24 is a sialomucin originally found in human gastric carcinoma cells, and in human hematopoietic progenitor cells. In the human, soluble and transmembrane forms of MGC-24 are present, and the transmembrane form has been implicated in adhesion of hematopoietic progenitor cells to marrow stroma cells. In the mouse, we found that only the transmembrane form was expressed in many organs. Northern blotting and in situ hybridization analysis showed that MGC-24 mRNA was widely expressed in various adult and embryonic tissues. The mouse MGC-24 gene, which we isolated, spanned about 12 kb and was comprised of six exons. The transmembrane domain and the cytoplasmic domain were encoded by a single exon; the finding agrees with the absence of an alternatively spliced product of mouse MGC-24. The minimal promoter of mouse MGC-24 was embedded in GC-rich sequences, in which two Sp1 binding motifs were found, but it lacked TATA and CAAT boxes. That the promoter resembles that of house-keeping genes is consistent with the broad expression of mouse MGC-24 mRNA. PMID: [10491205] 

2. Functional analyses have indicated that the human CD164 sialomucin may play a key role in hematopoiesis by facilitating the adhesion of human CD34(+) cells to the stroma and by negatively regulating CD34(+)CD38(lo/-) cell proliferation. We have identified three novel human CD164 variants derived by alternative splicing of bona fide exons from a single genomic transcription unit. The predominant CD164(E1-6) isoform, encoded by six exons, is a type I transmembrane protein containing two extracellular mucin domains (I and II) interrupted by a cysteine-rich non-mucin domain. The 103B2/9E10 and 105A5 epitopes, which specify ligand binding characteristics, are located on the exon 1-encoded mucin domain I. Three human CD164(E1-6) mRNA species, exhibiting differential polyadenylation site usage, are differentially expressed in hematopoietic and non-hematopoietic tissues. This study provides additional evidence that human CD164(E1-6) represents the ortholog of murine MGC-24v and rat endolyn. Comparative analysis of murine MGC-24v/CD164(E1-6) with human CD164(E1-6) revealed two potential splice variants and a similar genomic structure. Whereas the human CD164 gene is located on chromosome 6q21, the mouse gene occurs in a syntenic region on chromosome 10B1-B2. By confocal microscopy, human CD164 in CD34(+)CD38(+) hematopoietic progenitor (KG1B) and epithelial cell lines appears to be localized primarily in endosomes and lysosomes, with low concentrations at the cell surface. However, in a minority of KG1B cells, CD164 is more prominently expressed at the plasma membrane and in the recycling endosomes, suggesting that its distribution is regulated in cells of hematopoietic origin. PMID: [11027692] 

3. This study describes comprehensive polling of transcription start and termination sites and analysis of previously unidentified full-length complementary DNAs derived from the mouse genome. We identify the 5' and 3' boundaries of 181,047 transcripts with extensive variation in transcripts arising from alternative promoter usage, splicing, and polyadenylation. There are 16,247 new mouse protein-coding transcripts, including 5154 encoding previously unidentified proteins. Genomic mapping of the transcriptome reveals transcriptional forests, with overlapping transcription on both strands, separated by deserts in which few transcripts are observed. The data provide a comprehensive platform for the comparative analysis of mammalian transcriptional regulation in differentiation and development. PMID: [16141072] 

4. The National Institutes of Health's Mammalian Gene Collection (MGC) project was designed to generate and sequence a publicly accessible cDNA resource containing a complete open reading frame (ORF) for every human and mouse gene. The project initially used a random strategy to select clones from a large number of cDNA libraries from diverse tissues. Candidate clones were chosen based on 5'-EST sequences, and then fully sequenced to high accuracy and analyzed by algorithms developed for this project. Currently, more than 11,000 human and 10,000 mouse genes are represented in MGC by at least one clone with a full ORF. The random selection approach is now reaching a saturation point, and a transition to protocols targeted at the missing transcripts is now required to complete the mouse and human collections. Comparison of the sequence of the MGC clones to reference genome sequences reveals that most cDNA clones are of very high sequence quality, although it is likely that some cDNAs may carry missense variants as a consequence of experimental artifact, such as PCR, cloning, or reverse transcriptase errors. Recently, a rat cDNA component was added to the project, and ongoing frog (Xenopus) and zebrafish (Danio) cDNA projects were expanded to take advantage of the high-throughput MGC pipeline. PMID: [15489334] 

5. Myoblast fusion is fundamental to the development and regeneration of skeletal muscle. To fuse, myoblasts undergo cell-cell recognition and adhesion and merger of membranes between apposing cells. Cell migration must occur in advance of these events to bring myoblasts into proximity, but the factors that regulate myoblast motility are not fully understood. CD164 is a cell surface sialomucin that is targeted to endosomes and lysosomes via its intracellular region. In hematopoietic progenitor cells, CD164 forms complexes with the motility-stimulating chemokine receptor, CXCR4, in response to the CXCR4 ligand, CXCL12/SDF-1 (Forde, S., Tye, B. J., Newey, S. E., Roubelakis, M., Smythe, J., McGuckin, C. P., Pettengell, R., and Watt, S. M. (2007) Blood 109, 1825-1833). We have previously shown that CD164 stimulates myotube formation in vitro. We report here that CD164 is associated with CXCR4 in C2C12 myoblasts. Cells in which CD164 levels are increased or decreased via overexpression or RNA interference-mediated knockdown, respectively, show enhanced or reduced myotube formation and cell migration, the latter both basally and in response to CXCL12/SDF-1. Furthermore, expression of CD164 cytoplasmic tail mutants that alter the endosome/lysosome targeting sequence and, consequently, the subcellular localization in myoblasts, reveals a similar correlation between cell motility and myotube formation. Finally, Cd164 mRNA is expressed in the dorsal somite (the early myogenic compartment of the mouse embryo) and in premuscle masses. Taken together, these results suggest that CD164 is a regulator of myoblast motility and that this property contributes to its ability to promote myoblast fusion into myotubes. PMID: [18227060] 

Back to Top
Function
Sialomucin that may play a key role in hematopoiesis.May be involved in cell adhesion (By similarity). Promotesmyogenesis by enhancing CXCR4-dependent cell motility. Positivelyregulates myoblast migration and promotes myoblast fusion intomyotubes. 
Back to Top
Subcellular Location
Lysosome membrane; Single-pass type Imembrane protein (Potential). Endosome membrane; Single-pass typeI membrane protein (By similarity). Cell membrane; Single-passtype I membrane protein (By similarity). 
Tissue Specificity
Expressed at high levels in the submaxillarygland and kidney, at moderate levels in the brain, heart, lung,liver, intestine, testis, muscle and bone marrow, and at lowlevels in the pancreas, spleen and thymus. 
Gene Ontology
GO IDGO termEvidence
GO:0010008 C:endosome membrane IEA:UniProtKB-SubCell.
GO:0016021 C:integral to membrane IEA:UniProtKB-KW.
GO:0005765 C:lysosomal membrane IEA:UniProtKB-SubCell.
GO:0005886 C:plasma membrane ISO:MGI.
GO:0007157 P:heterophilic cell-cell adhesion ISO:MGI.
GO:0007517 P:muscle organ development IEA:UniProtKB-KW.
Back to Top
Interpro
IPR007947;    CD164_MGC24.
Back to Top
Pfam
PF05283;    MGC-24;    1.
Back to Top
SMART
PROSITE
PRINTS
PR01701;    CD164ANTIGEN.;   
Back to Top
Created Date
18-Oct-2012 
Record Type
GAS predicted 
Sequence Annotation
SIGNAL        1     23       Potential.
CHAIN        24    197       Sialomucin core protein 24.
                             /FTId=PRO_0000383341.
TOPO_DOM     24    162       Extracellular (Potential).
TRANSMEM    163    183       Helical; (Potential).
TOPO_DOM    184    197       Cytoplasmic (Potential).
REGION      191    197       Required for endosomal and lysosomal
                             localization (By similarity).
COMPBIAS     28    152       Thr-rich.
CARBOHYD     26     26       N-linked (GlcNAc...) (Potential).
CARBOHYD     33     33       N-linked (GlcNAc...) (Potential).
CARBOHYD     69     69       N-linked (GlcNAc...) (Potential).
CARBOHYD     75     75       N-linked (GlcNAc...) (Potential).
CARBOHYD     87     87       N-linked (GlcNAc...) (Potential).
CARBOHYD     98     98       N-linked (GlcNAc...) (Potential).
CARBOHYD    103    103       N-linked (GlcNAc...) (Potential).
CARBOHYD    120    120       N-linked (GlcNAc...) (Potential).
CARBOHYD    146    146       N-linked (GlcNAc...) (Potential).
MUTAGEN     193    193       N->G: Increased targeting to the
                             endosomal/lysosomal compartment and
                             reduced myotube formation.
MUTAGEN     194    194       Y->A: Ineffcient targeting to the
                             endosomal/lysosomal compartment and
                             enhanced myotube formation.
CONFLICT      2      2       S -> L (in Ref. 4; BAE26251).
CONFLICT     24     25       QP -> HA (in Ref. 3; BAA34547).
Back to Top
Nucleotide Sequence
Length: 2841 bp   Go to nucleotide: FASTA
Protein Sequence
Length: 197 bp   Go to amino acid: FASTA
The verified Protein-Protein interaction information
UniProt
Gene Symbol Ref Databases
ARRB1MINT 
Arrb1IntAct 
Other Protein-Protein interaction resources
String database  
View Microarray data
Comments