Tag Content
SG ID
SG00002341 
UniProt Accession
Theoretical PI
5.15  
Molecular Weight
108489 Da  
Genbank Nucleotide ID
Genbank Protein ID
Gene Name
Col6a1 
Gene Synonyms/Alias
 
Protein Name
Collagen alpha-1(VI) chain 
Protein Synonyms/Alias
Flags: Precursor 
Organism
Mus musculus (Mouse) 
NCBI Taxonomy ID
10090 
Chromosome Location
chr:10;76171537-76188913;-1
View in Ensembl genome browser  
Function in Stage
Uncertain 
Function in Cell Type
Uncertain 
Probability (GAS) of Function in Spermatogenesis
0.14397895 
The probability was calculated by GAS algorithm, ranging from 0 to 1. The closer it is to 1, the more possibly it functions in spermatogenesis.
Description
Temporarily unavailable 
Abstract of related literatures
1. The entire primary structure of the murine alpha 1(VI) collagen chain was deduced from cloned cDNA. The predicted polypeptide consists of 1025 amino acids and shows extensive homology with the corresponding human and chicken chains. A genomic clone isolated with a cDNA probe was found to contain about 13 kilobases of the 5'-flanking region and the first and second exon, coding for the 5'-untranslated sequence and signal peptide and part of the N-terminal portion of the mature protein, respectively. Polymerase chain reaction and primer extension analyses revealed two major and several minor transcription start sites distributed over 76 base pairs (bp). The region just upstream of the transcription initiation sites lacks canonical TATA and CAAT boxes and Sp1 binding sites, but contains putative binding sites for other transcription factors and a 90-bp polypyrimidine tract with elements of dyad symmetry. Chimeric constructs were derived from different fragments of the 5'-flanking genomic region and the chloramphenicol acetyltransferase (CAT) gene and expression of the reporter gene was assayed following transfection of various cell types. A construct containing sequences extending from -215 to +41 directed high levels of CAT expression. The data indicate that this region harbours a functional promoter. PMID: [8326912] 

2. cDNA clones encoding the alpha 1, alpha 2 and alpha 3 chains of mouse collagen VI have been isolated by screening cDNA libraries with the corresponding human probes. The composite cDNAs for the alpha 1, alpha 2, and alpha 3 chains are 2.5, 1.6 and 2.9 kb in size respectively. The alpha 1 and alpha 2 cDNAs encode the C-terminal portions of the chains as well as the entire 3'-untranslated regions, while the alpha 3 cDNAs encode a central segment of 959 amino acids flanking the triple-helical domain. The deduced amino acid sequences share 86-88% identity with the human counterparts and 67-73% identity with the chicken equivalents. Alignment of the deduced amino acid sequences of mouse, human and chicken collagens reveal that the key features of the protein, including the cysteine residues, imperfections in the Gly-Xaa-Xaa regions, Arg-Gly-Asp sequences and potential N-glycosylation sites, are mostly conserved. PMID: [8489506] 

Back to Top
Function
Collagen VI acts as a cell-binding protein. 
Back to Top
Subcellular Location
Secreted, extracellular space, extracellularmatrix (By similarity). 
Tissue Specificity
 
Gene Ontology
GO IDGO termEvidence
GO:0005581 C:collagen IEA:UniProtKB-KW.
GO:0031012 C:extracellular matrix IDA:MGI.
GO:0005576 C:extracellular region ISO:MGI.
GO:0043234 C:protein complex ISO:MGI.
GO:0042383 C:sarcolemma IDA:MGI.
GO:0007155 P:cell adhesion IEA:UniProtKB-KW.
GO:0071230 P:cellular response to amino acid stimulus IDA:MGI.
GO:0070208 P:protein heterotrimerization ISO:MGI.
Back to Top
Interpro
IPR008160;    Collagen.
IPR002035;    VWF_A.
Back to Top
Pfam
PF01391;    Collagen;    6.
PF00092;    VWA;    3.
Back to Top
SMART
SM00327;    VWA;    3.
Back to Top
PROSITE
PS50234;    VWFA;    3.
Back to Top
PRINTS
Created Date
18-Oct-2012 
Record Type
GAS predicted 
Sequence Annotation
SIGNAL        1     19
CHAIN        20   1025       Collagen alpha-1(VI) chain.
                             /FTId=PRO_0000005759.
DOMAIN       36    234       VWFA 1.
DOMAIN      614    802       VWFA 2.
DOMAIN      826   1018       VWFA 3.
REGION       20    255       N-terminal globular domain.
REGION      256    591       Triple-helical region.
REGION      592   1025       C-terminal globular domain.
MOTIF       261    263       Cell attachment site.
MOTIF       441    443       Cell attachment site.
MOTIF       477    479       Cell attachment site.
CARBOHYD    211    211       N-linked (GlcNAc...) (Potential).
CARBOHYD    515    515       N-linked (GlcNAc...) (Potential).
CARBOHYD    536    536       N-linked (GlcNAc...) (Potential).
CARBOHYD    801    801       N-linked (GlcNAc...) (Potential).
CARBOHYD    893    893       N-linked (GlcNAc...) (Potential).
CONFLICT    674    675       DM -> TL (in Ref. 2; CAA79152).
CONFLICT    709    709       T -> A (in Ref. 2; CAA79152).
CONFLICT    943    943       Missing (in Ref. 2; CAA79152).
CONFLICT    960    960       Q -> R (in Ref. 2; CAA79152).
Back to Top
Nucleotide Sequence
Length: 3991 bp   Go to nucleotide: FASTA
Protein Sequence
Length: 1025 bp   Go to amino acid: FASTA
The verified Protein-Protein interaction information
UniProt
Gene Symbol Ref Databases
Other Protein-Protein interaction resources
String database  
View Microarray data
Comments