Tag Content
SG ID
SG00020151 
UniProt Accession
Theoretical PI
5.44  
Molecular Weight
46003 Da  
Genbank Nucleotide ID
Genbank Protein ID
Gene Name
Serpina1a 
Gene Synonyms/Alias
Dom1, Spi1-1 
Protein Name
Alpha-1-antitrypsin 1-1 
Protein Synonyms/Alias
AAT Alpha-1 protease inhibitor 1; Alpha-1-antiproteinase; Serine protease inhibitor 1-1; Serine protease inhibitor A1a;Serpin A1aFlags: Precursor 
Organism
Mus musculus (Mouse) 
NCBI Taxonomy ID
10090 
Chromosome Location
chr:12;105091799-105104137;-1
View in Ensembl genome browser  
Function in Stage
Uncertain 
Function in Cell Type
Uncertain 
Probability (GAS) of Function in Spermatogenesis
0.17222676 
The probability was calculated by GAS algorithm, ranging from 0 to 1. The closer it is to 1, the more possibly it functions in spermatogenesis.
Description
Temporarily unavailable 
Abstract of related literatures
1. The murine alpha 1-protease inhibitors (alpha 1-PI) are encoded by a small gene family on chromosome 12. Studies of alpha 1-PI and other serine protease inhibitor genes have revealed an unusually high rate of mutation of the reactive centers of the inhibitors. Using a modification of the PCR technique, we have previously identified five distinct alpha 1 PI reactive site sequences present in the genome of C57BL/6 mice. In this report, we use cDNA cloning techniques to demonstrate that all five genes are expressed in the adult mouse liver. DNA sequence analysis shows that three of the five mRNAs expressed have a substitution for methionine-353, which is essential for normal activity of the homologous human protein, alpha 1-antitrypsin (alpha 1-AT). Comparison of the DNA sequences of the five cDNAs indicates a higher degree of polymorphism in the carboxyl-terminal half of the protein and an extraordinarily replacement/silent ratio of nucleotide changes in a narrow region surrounding the reactive site. The clustering of polymorphisms near the reactive site combined with the high replacement/silent ratio suggest an evolutionary mechanism that apparently selects for functional diversity of the alpha 1-PI genes. Finally, modeling of the three-dimensional positions of the alpha 1-PI polymorphic residues into the homologous positions of the crystallographic structure of ovalbumin, a member of the alpha 1-PI supergene family, predicts that many of these amino acids are on the surfaces, which are likely to interact with the protease targets. PMID: [1946354] 

2. alpha(1)-Proteinase inhibitor (alpha(1)-PI) is a member of the serpin superfamily of serine proteinase inhibitors, which function in maintaining homeostasis through regulation of numerous proteolytic processes. In laboratory mice (Mus musculus domesticus), alpha(1)-PI occurs in multiple isoforms encoded by a family of three to five genes that are polymorphic among inbred strains and that are located at the Serpina1 locus on chromosome 12. In the present study, we have characterized the alpha(1)-PI gene family of inbred mice in more detail. We show that mice express seven isoforms, all of which are encoded by genes that map to the Serpina1 locus. In addition, polymorphism at the locus is defined by three haplotypes (Serpina1(b), Serpina1(c), and Serpina1(l)) that differ with regard to both the number and identity of alpha(1)-PI genes. Finally, we present the complete sequence of an 84-kb region of Serpina1 containing a tandem repeat of two alpha(1)-PI genes. PMID: [12408969] 

3. This study describes comprehensive polling of transcription start and termination sites and analysis of previously unidentified full-length complementary DNAs derived from the mouse genome. We identify the 5' and 3' boundaries of 181,047 transcripts with extensive variation in transcripts arising from alternative promoter usage, splicing, and polyadenylation. There are 16,247 new mouse protein-coding transcripts, including 5154 encoding previously unidentified proteins. Genomic mapping of the transcriptome reveals transcriptional forests, with overlapping transcription on both strands, separated by deserts in which few transcripts are observed. The data provide a comprehensive platform for the comparative analysis of mammalian transcriptional regulation in differentiation and development. PMID: [16141072] 

4. The National Institutes of Health's Mammalian Gene Collection (MGC) project was designed to generate and sequence a publicly accessible cDNA resource containing a complete open reading frame (ORF) for every human and mouse gene. The project initially used a random strategy to select clones from a large number of cDNA libraries from diverse tissues. Candidate clones were chosen based on 5'-EST sequences, and then fully sequenced to high accuracy and analyzed by algorithms developed for this project. Currently, more than 11,000 human and 10,000 mouse genes are represented in MGC by at least one clone with a full ORF. The random selection approach is now reaching a saturation point, and a transition to protocols targeted at the missing transcripts is now required to complete the mouse and human collections. Comparison of the sequence of the MGC clones to reference genome sequences reveals that most cDNA clones are of very high sequence quality, although it is likely that some cDNAs may carry missense variants as a consequence of experimental artifact, such as PCR, cloning, or reverse transcriptase errors. Recently, a rat cDNA component was added to the project, and ongoing frog (Xenopus) and zebrafish (Danio) cDNA projects were expanded to take advantage of the high-throughput MGC pipeline. PMID: [15489334] 

5. We have characterized a genomic clone of the mouse equivalent of alpha1-antitrypsin (alpha1-antiprotease) a gene that is expressed in liver. There are at least three other genomic sequences similar to the gene expressed in mouse liver and all appear to be located on chromosome 12. The duplicated regions might include genes that are also expressed in the liver and/or in other tissues. Comparison of the sequences of the mouse alpha1-antitrypsin sequence expressed in liver with sequences of alpha1-antitrypsins from other mammals reveals great homology in the coding regions, particularly in the amino acids at the active site, including the neighboring methionine and serine thought to play a key role in the protease inhibition. PMID: [3007061] 

6. The Mus musculus alpha 1-protease inhibitor gene cluster encodes five highly related proteins. The most significant amino acid polymorphisms lie within the reactive-site loop which is important in determining serpin substrate specificity. All five genes are transcribed in M. musculus adult liver and presumably secreted into plasma. In an attempt to characterize their protein products all five cDNAs were expressed in recombinant mammalian cells and the protease inhibition activity of each determined. Only two of the proteins were efficient inhibitors of neutrophil elastase, the major physiological target of the sole human alpha 1-protease inhibitor (antitrypsin). Four of the proteins were active against chymotrypsin, while no substrate could be identified for the fifth. PMID: [8619829] 

7. Alpha(1)-proteinase inhibitor (alpha(1)-PI) is a member of the serpin superfamily of serine proteinase inhibitors that are involved in the regulation of a number of proteolytic processes. Alpha(1)-PI, like most serpins, functions by covalent binding to, and inhibition of, target proteinases. The interaction between alpha(1)-PI and its target is directed by the so-called reactive center loop (RCL), an approximately 20 residue domain that extends out from the body of the alpha(1)-PI polypeptide and determines the inhibitor's specificity. Mice express at least seven closely related alpha(1)-PI isoforms, encoded by a family of genes clustered at the Spi1 locus on chromosome 12. The amino acid sequence of the RCL region is hypervariable among alpha(1)-PIs, a phenomenon that has been attributed to high rates of evolution driven by positive Darwinian selection. This suggests that the various isoforms are functionally diverse. To test this notion, we have compared the proteinase specificities of individual alpha(1)-PIs from each of the two mouse species. As predicted from the positive Darwinian selection hypothesis, the various alpha(1)-PIs differ in their ability to form covalent complexes with serine proteinases, such as elastase, trypsin, chymotrypsin, and cathepsin G. In addition, they differ in their binding ability to proteinases in crude snake venoms. Importantly, the RCL region of the alpha(1)-PI polypeptide is the primary determinant of isoform-specific differences in proteinase recognition, indicating that hypervariability within this region drives the functional diversification of alpha(1)-PIs during evolution. The possible physiological benefits of alpha(1)-PI diversity are discussed. PMID: [11961105] 

8. The major human plasma protease inhibitors, alpha(1)-antitrypsin and alpha(1)-antichymotrypsin, are each encoded by a single gene, whereas in the mouse they are represented by clusters of 5 and 14 genes, respectively. Although there is a high degree of overall sequence similarity within these groupings, the reactive-center loop (RCL) domain, which determines target protease specificity, is markedly divergent. The literature dealing with members of these mouse serine protease inhibitor (serpin) clusters has been complicated by inconsistent nomenclature. Furthermore, some investigators, unaware of the complexity of the family, have failed to distinguish between closely related genes when measuring expression levels or functional activity. We have reviewed the literature dealing with the mouse equivalents of human alpha(1)-antitrypsin and alpha(1)-antichymotrypsin and made use of the recently completed mouse genome sequence to propose a systematic nomenclature. We have also examined the extended mouse clade "a" serpin cluster at chromosome 12F1 and compared it with the syntenic region at human chromosome 14q32. In summarizing the literature and suggesting a standardized nomenclature, we aim to provide a logical structure on which future research may be based. PMID: [12659817] 

9. A comprehensive understanding of the mouse plasma proteome is important for studies using mouse models to identify protein markers of human disease. To enhance our analysis of the mouse plasma proteome, we have developed a method for isolating low-abundance proteins using a cysteine-containing glycopeptide strategy. This method involves two orthogonal affinity capture steps. First, glycoproteins are coupled to an azlactone copolymer gel using hydrazide chemistry and cysteine residues are then biotinylated. After trypsinization and extensive washing, tethered N-glycosylated tryptic peptides are released from the gel using PNGase F. Biotinylated cysteinyl-containing glycopeptides are then affinity selected using a monomeric avidin gel and analyzed by LC-MS/MS. We have applied the method to a proteome analysis of mouse plasma. In two independent analyses using 200 muL each of C57BL mouse plasma, 51 proteins were detected. Only 42 proteins were seen when the same plasma sample was analyzed by glycopeptides only. A total of 104 N-glycosylation sites were identified. Of these, 17 sites have hitherto not been annotated in the Swiss-Prot database whereas 48 were considered probable, potential, or by similarity - i.e., based on little or no experimental evidence. We show that analysis by cysteine-containing glycopeptides allows detection of low-abundance proteins such as the epidermal growth factor receptor, the Vitamin K-dependent protein Z, the hepatocyte growth factor activator, and the lymphatic endothelium-specific hyaluronan receptor as these proteins were not detected in the glycopeptide control analysis. PMID: [17330941] 

Back to Top
Function
Inhibitor of serine proteases. Its primary target iselastase, but it also has a moderate affinity for plasmin andthrombin. 
Back to Top
Subcellular Location
Secreted. 
Tissue Specificity
 
Gene Ontology
GO IDGO termEvidence
GO:0005576 C:extracellular region IDA:MGI.
GO:0004867 F:serine-type endopeptidase inhibitor activity IBA:RefGenome.
GO:0006953 P:acute-phase response IEA:UniProtKB-KW.
GO:0006487 P:protein N-linked glycosylation IDA:MGI.
GO:0030162 P:regulation of proteolysis IBA:RefGenome.
GO:0034097 P:response to cytokine stimulus IDA:MGI.
GO:0043434 P:response to peptide hormone stimulus IDA:MGI.
Back to Top
Interpro
IPR023795;    Protease_inhib_I4_serpin_CS.
IPR023796;    Sepin_dom.
IPR000215;    Serpin_fam.
Back to Top
Pfam
PF00079;    Serpin;    1.
Back to Top
SMART
SM00093;    SERPIN;    1.
Back to Top
PROSITE
PS00284;    SERPIN;    1.
Back to Top
PRINTS
Created Date
18-Oct-2012 
Record Type
GAS predicted 
Sequence Annotation
SIGNAL        1     24       By similarity.
CHAIN        25    413       Alpha-1-antitrypsin 1-1.
                             /FTId=PRO_0000032388.
REGION      368    387       RCL.
SITE        377    378       Reactive bond (By similarity).
CARBOHYD     64     64       N-linked (GlcNAc...).
CARBOHYD    101    101       N-linked (GlcNAc...).
CARBOHYD    265    265       N-linked (GlcNAc...) (Potential).
CONFLICT    246    246       H -> D (in Ref. 2; AAM47488, 4; AAH11040/
                             AAH37007/AAH49970/AAH57982/AAH57984/
                             AAH57989 and 5; AAA51624).
CONFLICT    323    323       P -> L (in Ref. 5; AAA51624).
CONFLICT    404    404       L -> V (in Ref. 2; AAM47488, 4; AAH11040/
                             AAH37007/AAH49970/AAH57982/AAH57984/
                             AAH57989 and 5; AAA51624).
Back to Top
Nucleotide Sequence
Length: 1366 bp   Go to nucleotide: FASTA
Protein Sequence
Length: 413 bp   Go to amino acid: FASTA
The verified Protein-Protein interaction information
UniProt
Gene Symbol Ref Databases
Other Protein-Protein interaction resources
String database  
View Microarray data
Comments