Tag Content
SG ID
SG00020043 
UniProt Accession
Theoretical PI
10.9  
Molecular Weight
14095 Da  
Genbank Nucleotide ID
Genbank Protein ID
Gene Name
Hist2h2aa2 
Gene Synonyms/Alias
 
Protein Name
Histone H2A type 2-A 
Protein Synonyms/Alias
H2a-614; H2a-615; Histone H2A.2; 
Organism
Mus musculus (Mouse) 
NCBI Taxonomy ID
10090 
Chromosome Location
chr:3;96043639-96044216;-1
View in Ensembl genome browser  
Function in Stage
Uncertain 
Function in Cell Type
Uncertain 
Probability (GAS) of Function in Spermatogenesis
0.596426691 
The probability was calculated by GAS algorithm, ranging from 0 to 1. The closer it is to 1, the more possibly it functions in spermatogenesis.
Description
Temporarily unavailable 
Abstract of related literatures

2. The histone gene cluster on mouse chromosome 3 has been isolated as a series of overlapping P1 clones, covering 110-120 kb, by probing with the histone H3-614 gene that had been mapped previously to mouse chromosome 3. There are genes for 10 core histone proteins present in a 55-kb cluster within this contig. There are three histone H3 genes, two of which are identical; four histone H2a genes, two of which are identical, one histone H4 gene; and two histone H2b genes. These histone H3 and H2a genes encode approximately 40% of the total H3 and H2a mRNA, whereas the histone H4 and histone H2b genes encode < 10% of the total H4 and H2b mRNA. There are no histone H1 genes present in this cluster. All of the histone H2a genes encode histone H2a.2 proteins (or variants of H2a.2), and account for all the H2a.2 genes in the mouse genome. All three histone H3 genes encode the histone H3.2 protein. A 21-kb region containing the adjacent H3-614 and H2a-614 genes has been duplicated and is present in an inverted repeat separated by 4.5 kb. The other two H2a genes are adjacent, with the 3' ends of their mRNAs separated by only 49 nucleotides in the DNA and the U7 snRNP binding sites separated by only 20 nucleotides. One of the histone H2b genes has lost the stem-loop sequence characteristic of the replication-dependent histone mRNAs and encodes only polyadenylated mRNAs. PMID: [8858345] 

3. The multigene family encoding the five classes of replication-dependent histones has been identified from the human and mouse genome sequence. The large cluster of histone genes, HIST1, on human chromosome 6 (6p21-p22) contains 55 histone genes, and Hist1 on mouse chromosome 13 contains 51 histone genes. There are two smaller clusters on human chromosome 1: HIST2 (at 1q21), which contains six genes, and HIST3 (at 1q42), which contains three histone genes. Orthologous Hist2 and Hist3 clusters are present on mouse chromosomes 3 and 11, respectively. The organization of the human and mouse histone genes in the HIST1 cluster is essentially identical. All of the histone H1 genes are in HIST1, which is spread over about 2 Mb. There are two large gaps (>250 kb each) within this cluster where there are no histone genes, but many other genes. Each of the histone genes encodes an mRNA that ends in a stemloop followed by a purine-rich region that is complementary to the 5' end of U7 snRNA. In addition to the histone genes on these clusters, only two other genes containing the stem-loop sequence were identified, a histone H4 gene on human chromosome 12 (mouse chromosome 6) and the previously described H2a.X gene located on human chromosome 11. Each of the 14 histone H4 genes encodes the same protein, and there are only three histone H3 proteins encoded by the 12 histone H3 genes in each species. In contrast, both the mouse and human H2a and H2b proteins consist of at least 10 non-allelic variants, making the complexity of the histone protein complement significantly greater than previously thought. PMID: [12408966] 

4. This study describes comprehensive polling of transcription start and termination sites and analysis of previously unidentified full-length complementary DNAs derived from the mouse genome. We identify the 5' and 3' boundaries of 181,047 transcripts with extensive variation in transcripts arising from alternative promoter usage, splicing, and polyadenylation. There are 16,247 new mouse protein-coding transcripts, including 5154 encoding previously unidentified proteins. Genomic mapping of the transcriptome reveals transcriptional forests, with overlapping transcription on both strands, separated by deserts in which few transcripts are observed. The data provide a comprehensive platform for the comparative analysis of mammalian transcriptional regulation in differentiation and development. PMID: [16141072] 

5. The National Institutes of Health's Mammalian Gene Collection (MGC) project was designed to generate and sequence a publicly accessible cDNA resource containing a complete open reading frame (ORF) for every human and mouse gene. The project initially used a random strategy to select clones from a large number of cDNA libraries from diverse tissues. Candidate clones were chosen based on 5'-EST sequences, and then fully sequenced to high accuracy and analyzed by algorithms developed for this project. Currently, more than 11,000 human and 10,000 mouse genes are represented in MGC by at least one clone with a full ORF. The random selection approach is now reaching a saturation point, and a transition to protocols targeted at the missing transcripts is now required to complete the mouse and human collections. Comparison of the sequence of the MGC clones to reference genome sequences reveals that most cDNA clones are of very high sequence quality, although it is likely that some cDNAs may carry missense variants as a consequence of experimental artifact, such as PCR, cloning, or reverse transcriptase errors. Recently, a rat cDNA component was added to the project, and ongoing frog (Xenopus) and zebrafish (Danio) cDNA projects were expanded to take advantage of the high-throughput MGC pipeline. PMID: [15489334] 

6. Each variant of histone 2A from mouse L1210 cells separates into at lest three bands on acid-urea gels, an unmodified band b0, and modified bands b1, b2, and, in the case of H2A.Z, b3. By analyzing tryptic peptides from these proteins on 50% acrylamide peptide gels, one can quantitate the fraction of H2A modified in various ways. With this methodology, the b1 form of H2A.1 was found to be a mixture of molecules, two-thirds of which were acetylated at lysine5 and one-third of which were phosphorylated at serine1. The b2 band of H2A.1 contained molecules with phosphorylated serine1, acetylated lysine5, and one or two other sites of modification not found in b1 H2A.1 b2 H2A.1 was not composed solely of molecules each with both a phosphorylated serine1 and an acetylated lysine5. The heteromorphous H2A variant H2A.X can be phosphorylated and acetylated. The phosphorylation site was the same as in H2A.1, but the acetylation site was slightly different. The heteromorphous variant H2A.Z was different from the other H2As in that it was not phosphorylated, and it did not contain the NH2-terminal peptide, which is the phosphorylation site in the .X, .1, and .2 variants. Its acetylation pattern was also different from that of H2A.1 in that two sites, neither of which was found in H2A.1, seem to be involved at the first level of modification. PMID: [7217105] 

7. In many higher organisms, 5%-15% of histone H2A is ubiquitylated at lysine 119 (uH2A). The function of this modification and the factors involved in its establishment, however, are unknown. Here we demonstrate that uH2A occurs on the inactive X chromosome in female mammals and that this correlates with recruitment of Polycomb group (PcG) proteins belonging to Polycomb repressor complex 1 (PRC1). Based on our observations, we tested the role of the PRC1 protein Ring1B and its closely related homolog Ring1A in H2A ubiquitylation. Analysis of Ring1B null embryonic stem (ES) cells revealed extensive depletion of global uH2A levels. On the inactive X chromosome, uH2A was maintained in Ring1A or Ring1B null cells, but not in double knockout cells, demonstrating an overlapping function for these proteins in development. These observations link H2A ubiquitylation, X inactivation, and PRC1 PcG function, suggesting an unanticipated and novel mechanism for chromatin-mediated heritable gene silencing. PMID: [15525528] 

8. Histone modifications are thought to serve as epigenetic markers that mediate dynamic changes in chromatin structure and regulation of gene expression. As a model system for understanding epigenetic silencing, X chromosome inactivation has been previously linked to a number of histone modifications including methylation and hypoacetylation. In this study, we provide evidence that supports H2A ubiquitination as a novel epigenetic marker for the inactive X chromosome (Xi) and links H2A ubiquitination to initiation of X inactivation. We found that the H2A-K119 ubiquitin E3 ligase Ring1b, a Polycomb group protein, is enriched on Xi in female trophoblast stem (TS) cells as well as differentiating embryonic stem (ES) cells. Consistent with Ring1b mediating H2A ubiquitination, ubiquitinated H2A (ubH2A) is also enriched on the Xi of both TS and ES cells. We demonstrate that the enrichment of Ring1b and ubH2A on Xi is transient during TS and ES cell differentiation, suggesting that the Ring1b and ubH2A are involved in the initiation of both imprinted and random X inactivation. Furthermore, we showed that the association of Ring1b and ubH2A with Xi is mitotically stable in non-differentiated TS cells. PMID: [15509584] 

9. Blimp1, a transcriptional repressor, has a crucial role in the specification of primordial germ cells (PGCs) in mice at embryonic day 7.5 (E7.5). This SET-PR domain protein can form complexes with various chromatin modifiers in a context-dependent manner. Here, we show that Blimp1 has a novel interaction with Prmt5, an arginine-specific histone methyltransferase, which mediates symmetrical dimethylation of arginine 3 on histone H2A and/or H4 tails (H2A/H4R3me2s). Prmt5 has been shown to associate with Tudor, a component of germ plasm in Drosophila melanogaster. Blimp1-Prmt5 colocalization results in high levels of H2A/H4 R3 methylation in PGCs at E8.5. However, at E11.5, Blimp1-Prmt5 translocates from the nucleus to the cytoplasm, resulting in the loss of H2A/H4 R3 methylation at the time of extensive epigenetic reprogramming of germ cells. Subsequently, Dhx38, a putative target of the Blimp1-Prmt5 complex, is upregulated. Interestingly, expression of Dhx38 is also seen in pluripotent embryonic germ cells that are derived from PGCs when Blimp1 expression is lost. Our study demonstrates that Blimp1 is involved in a novel transcriptional regulatory complex in the mouse germ-cell lineage. PMID: [16699504] 

Back to Top
Function
Core component of nucleosome. Nucleosomes wrap andcompact DNA into chromatin, limiting DNA accessibility to thecellular machineries which require DNA as a template. Histonesthereby play a central role in transcription regulation, DNArepair, DNA replication and chromosomal stability. DNAaccessibility is regulated via a complex set of post-translationalmodifications of histones, also called histone code, andnucleosome remodeling. 
Back to Top
Subcellular Location
Nucleus. Chromosome. 
Tissue Specificity
 
Gene Ontology
GO IDGO termEvidence
GO:0000786 C:nucleosome IEA:UniProtKB-KW.
GO:0005634 C:nucleus IEA:UniProtKB-SubCell.
GO:0003677 F:DNA binding NAS:UniProtKB.
GO:0006334 P:nucleosome assembly NAS:UniProtKB.
Back to Top
Interpro
IPR009072;    Histone-fold.
IPR007125;    Histone_core_D.
IPR002119;    Histone_H2A.
Back to Top
Pfam
PF00125;    Histone;    1.
Back to Top
SMART
SM00414;    H2A;    1.
Back to Top
PROSITE
PS00046;    HISTONE_H2A;    1.
Back to Top
PRINTS
PR00620;    HISTONEH2A.;   
Back to Top
Created Date
18-Oct-2012 
Record Type
GAS predicted 
Sequence Annotation
INIT_MET      1      1       Removed (By similarity).
CHAIN         2    130       Histone H2A type 2-A.
                             /FTId=PRO_0000055233.
MOD_RES       2      2       N-acetylserine.
MOD_RES       2      2       Phosphoserine; by RPS6KA5.
MOD_RES       4      4       Citrulline; alternate (By similarity).
MOD_RES       4      4       Symmetric dimethylarginine; by PRMT5;
                             alternate (Probable).
MOD_RES       6      6       N6-acetyllysine.
MOD_RES     121    121       Phosphothreonine (By similarity).
CROSSLNK     14     14       Glycyl lysine isopeptide (Lys-Gly)
                             (interchain with G-Cter in ubiquitin) (By
                             similarity).
CROSSLNK     16     16       Glycyl lysine isopeptide (Lys-Gly)
                             (interchain with G-Cter in ubiquitin) (By
                             similarity).
CROSSLNK    120    120       Glycyl lysine isopeptide (Lys-Gly)
                             (interchain with G-Cter in ubiquitin).
Back to Top
Nucleotide Sequence
Length: 3098 bp   Go to nucleotide: FASTA
Protein Sequence
Length: 130 bp   Go to amino acid: FASTA
The verified Protein-Protein interaction information
UniProt
Gene Symbol Ref Databases
Other Protein-Protein interaction resources
String database  
View Microarray data
Comments