Tag Content
UniProt Accession
Theoretical PI
Molecular Weight
19696 Da  
Genbank Nucleotide ID
Genbank Protein ID
Gene Name
Gene Synonyms/Alias
Protein Name
Protein Synonyms/Alias
Mus musculus (Mouse) 
NCBI Taxonomy ID
Chromosome Location
View in Ensembl genome browser  
Function in Stage
Function in Cell Type
Probability (GAS) of Function in Spermatogenesis
The probability was calculated by GAS algorithm, ranging from 0 to 1. The closer it is to 1, the more possibly it functions in spermatogenesis.
Temporarily unavailable 
Abstract of related literatures
1. Caltractin, a contractile Ca(2+)-binding protein, has not been found in any species other than algae. We now report the first isolation and sequencing of cDNA encoding mouse caltractin from a testis cDNA library by immunoscreening. Mouse caltractin has a molecular mass of 20 kDa, which is the same as that from algae. PMID: [8218402] 

2. Centrin is a centrosome component in species from yeast to humans. Here, the mouse centrin 1 gene (Cetn1) is analyzed with respect to its genomic structure, chromosome localization, tissue-specific expression, and phylogenetic relationship to the other mouse centrin genes and their human orthologs. Cetn1 is an intronless gene located on chromosome 18A2 that encodes a 172-amino-acid protein with a predicted molecular mass of 19,696 Da (pI 4.61) and all of the structural features common to centrin. Cetn1 possesses the sequence features of an expressed retroposon: the gene lacks introns, the open reading frame is not interrupted by stop codons, and the coding region is flanked by a pair of direct repeats. Reverse transcriptase-polymerase chain reaction and Northern blot analysis demonstrate that Cetn1 expression is limited exclusively to the testis in adult male mice. Cetn1 expression is first seen in the neonatal testis at 14 days postpartum, reaching adult levels by day 17. These observations provide new insight into the regulation, function, and evolutionary history of centrin in higher eukaryotes. PMID: [10486202] 

3. This study describes comprehensive polling of transcription start and termination sites and analysis of previously unidentified full-length complementary DNAs derived from the mouse genome. We identify the 5' and 3' boundaries of 181,047 transcripts with extensive variation in transcripts arising from alternative promoter usage, splicing, and polyadenylation. There are 16,247 new mouse protein-coding transcripts, including 5154 encoding previously unidentified proteins. Genomic mapping of the transcriptome reveals transcriptional forests, with overlapping transcription on both strands, separated by deserts in which few transcripts are observed. The data provide a comprehensive platform for the comparative analysis of mammalian transcriptional regulation in differentiation and development. PMID: [16141072] 

4. The National Institutes of Health's Mammalian Gene Collection (MGC) project was designed to generate and sequence a publicly accessible cDNA resource containing a complete open reading frame (ORF) for every human and mouse gene. The project initially used a random strategy to select clones from a large number of cDNA libraries from diverse tissues. Candidate clones were chosen based on 5'-EST sequences, and then fully sequenced to high accuracy and analyzed by algorithms developed for this project. Currently, more than 11,000 human and 10,000 mouse genes are represented in MGC by at least one clone with a full ORF. The random selection approach is now reaching a saturation point, and a transition to protocols targeted at the missing transcripts is now required to complete the mouse and human collections. Comparison of the sequence of the MGC clones to reference genome sequences reveals that most cDNA clones are of very high sequence quality, although it is likely that some cDNAs may carry missense variants as a consequence of experimental artifact, such as PCR, cloning, or reverse transcriptase errors. Recently, a rat cDNA component was added to the project, and ongoing frog (Xenopus) and zebrafish (Danio) cDNA projects were expanded to take advantage of the high-throughput MGC pipeline. PMID: [15489334] 

5. A variety of developmental disorders have been associated with ciliary defects, yet the controls that govern cilia disassembly are largely unknown. Here we report a mouse embryonic node gene, which we named Pitchfork (Pifo). Pifo associates with ciliary targeting complexes and accumulates at the basal body during cilia disassembly. Haploinsufficiency causes a unique node cilia duplication phenotype, left-right asymmetry defects, and heart failure. This phenotype is likely relevant in humans, because we identified a heterozygous R80K PIFO mutation in a fetus with situs inversus and cystic liver and kidneys, and in patient with double-outflow right ventricle. We show that PIFO, but not R80K PIFO, is sufficient to activate Aurora A, a protooncogenic kinase that induces cilia retraction, and that Pifo/PIFO mutation causes cilia retraction, basal body liberation, and overreplication defects. Thus, the observation of a disassembly phenotype in vivo provides an entry point to understand and categorize ciliary disease. AUTHOR AUDIO: PMID: [20643351] 

Back to Top
Plays a fundamental role in microtubule-organizingcenter structure and function. 
Back to Top
Subcellular Location
Cytoplasm, cytoskeleton, centrosome.Note=Centrosome of interphase and mitotic cells. 
Tissue Specificity
Expressed exclusively in testis. Localizes tothe caudal portion of spermatozoa in seminiferous tubule andepididymis. 
Gene Ontology
GO IDGO termEvidence
GO:0005814 C:centriole IEA:Compara.
GO:0005813 C:centrosome IDA:MGI.
GO:0032391 C:photoreceptor connecting cilium IDA:MGI.
GO:0000922 C:spindle pole IEA:Compara.
GO:0005524 F:ATP binding IEA:InterPro.
GO:0008026 F:ATP-dependent helicase activity IEA:InterPro.
GO:0005509 F:calcium ion binding IEA:InterPro.
GO:0003676 F:nucleic acid binding IEA:InterPro.
GO:0051301 P:cell division IEA:UniProtKB-KW.
GO:0034605 P:cellular response to heat IDA:MGI.
GO:0007067 P:mitosis IEA:UniProtKB-KW.
Back to Top
IPR011992;    EF-hand-like_dom.
IPR018247;    EF_Hand_1_Ca_BS.
IPR018249;    EF_HAND_2.
IPR002048;    EF_hand_Ca-bd.
IPR000629;    RNA-helicase_DEAD-box_CS.
Back to Top
SM00054;    EFh;    4.
Back to Top
PS00018;    EF_HAND_1;    2.
PS50222;    EF_HAND_2;    4.
Back to Top
Created Date
Record Type
GAS predicted 
Sequence Annotation
CHAIN         1    172       Centrin-1.
DOMAIN       28     63       EF-hand 1.
DOMAIN       64     99       EF-hand 2.
DOMAIN      101    136       EF-hand 3.
DOMAIN      137    172       EF-hand 4.
CA_BIND      41     52       1 (Probable).
CA_BIND     150    161       2 (Probable).
CONFLICT     95     95       Q -> R (in Ref. 3; BAB24213).
CONFLICT    145    145       M -> K (in Ref. 3; BAB24798).
Back to Top
Nucleotide Sequence
Length: 754 bp   Go to nucleotide: FASTA
Protein Sequence
Length: 172 bp   Go to amino acid: FASTA
The verified Protein-Protein interaction information
Gene Symbol Ref Databases
Other Protein-Protein interaction resources
String database  
View Microarray data