Tag Content
SG ID
SG00019642 
UniProt Accession
Theoretical PI
8.02  
Molecular Weight
44550 Da  
Genbank Nucleotide ID
Genbank Protein ID
Gene Name
Pgk1 
Gene Synonyms/Alias
Pgk-1 
Protein Name
Phosphoglycerate kinase 1 
Protein Synonyms/Alias
EC=2.7.2.3 
Organism
Mus musculus (Mouse) 
NCBI Taxonomy ID
10090 
Chromosome Location
chr:X;103382439-103399038;1
View in Ensembl genome browser  
Function in Stage
Uncertain 
Function in Cell Type
Uncertain 
Probability (GAS) of Function in Spermatogenesis
0.143235625 
The probability was calculated by GAS algorithm, ranging from 0 to 1. The closer it is to 1, the more possibly it functions in spermatogenesis.
Description
Temporarily unavailable 
Abstract of related literatures
1. A clone containing cDNA for X chromosome-linked phosphoglycerate kinase (PGK-1) was isolated from a mouse myeloma cDNA library. The nucleotide (nt) sequence of the cDNA has been determined, and the amino acid (aa) sequence of the enzyme thereby deduced. At the nt level, the coding region of mouse PGK cDNA has 93% homology with human X-linked cDNA and 60% homology with the yeast gene. Mouse PGK-1 protein contains 416 aa and is 98%, 96% and 64% homologous with human, horse, and yeast enzyme sequences, respectively. PMID: [3542714] 

2. This study describes comprehensive polling of transcription start and termination sites and analysis of previously unidentified full-length complementary DNAs derived from the mouse genome. We identify the 5' and 3' boundaries of 181,047 transcripts with extensive variation in transcripts arising from alternative promoter usage, splicing, and polyadenylation. There are 16,247 new mouse protein-coding transcripts, including 5154 encoding previously unidentified proteins. Genomic mapping of the transcriptome reveals transcriptional forests, with overlapping transcription on both strands, separated by deserts in which few transcripts are observed. The data provide a comprehensive platform for the comparative analysis of mammalian transcriptional regulation in differentiation and development. PMID: [16141072] 

3. The mouse (Mus musculus) is the premier animal model for understanding human disease and development. Here we show that a comprehensive understanding of mouse biology is only possible with the availability of a finished, high-quality genome assembly. The finished clone-based assembly of the mouse strain C57BL/6J reported here has over 175,000 fewer gaps and over 139 Mb more of novel sequence, compared with the earlier MGSCv3 draft genome assembly. In a comprehensive analysis of this revised genome sequence, we are now able to define 20,210 protein-coding genes, over a thousand more than predicted in the human genome (19,042 genes). In addition, we identified 439 long, non-protein-coding RNAs with evidence for transcribed orthologs in human. We analyzed the complex and repetitive landscape of 267 Mb of sequence that was missing or misassembled in the previously published assembly, and we provide insights into the reasons for its resistance to sequencing and assembly by whole-genome shotgun approaches. Duplicated regions within newly assembled sequence tend to be of more recent ancestry than duplicates in the published draft, correcting our initial understanding of recent evolution on the mouse lineage. These duplicates appear to be largely composed of sequence regions containing transposable elements and duplicated protein-coding genes; of these, some may be fixed in the mouse population, but at least 40% of segmentally duplicated sequences are copy number variable even among laboratory mouse strains. Mouse lineage-specific regions contain 3,767 genes drawn mainly from rapidly-changing gene families associated with reproductive functions. The finished mouse genome assembly, therefore, greatly improves our understanding of rodent-specific biology and allows the delineation of ancestral biological functions that are shared with human from derived functions that are not. PMID: [19468303] 

4. The National Institutes of Health's Mammalian Gene Collection (MGC) project was designed to generate and sequence a publicly accessible cDNA resource containing a complete open reading frame (ORF) for every human and mouse gene. The project initially used a random strategy to select clones from a large number of cDNA libraries from diverse tissues. Candidate clones were chosen based on 5'-EST sequences, and then fully sequenced to high accuracy and analyzed by algorithms developed for this project. Currently, more than 11,000 human and 10,000 mouse genes are represented in MGC by at least one clone with a full ORF. The random selection approach is now reaching a saturation point, and a transition to protocols targeted at the missing transcripts is now required to complete the mouse and human collections. Comparison of the sequence of the MGC clones to reference genome sequences reveals that most cDNA clones are of very high sequence quality, although it is likely that some cDNAs may carry missense variants as a consequence of experimental artifact, such as PCR, cloning, or reverse transcriptase errors. Recently, a rat cDNA component was added to the project, and ongoing frog (Xenopus) and zebrafish (Danio) cDNA projects were expanded to take advantage of the high-throughput MGC pipeline. PMID: [15489334] 

5. We report the cloning of the mouse pgk-1 gene encoding the somatic cell isoform of the enzyme phosphoglycerate kinase. The gene is contained within a 16-kb region of the X chromosome and is interrupted by at least ten introns. The promoter region of the pgk-1 gene is rich in G and C nucleotides and contains five copies of the hexadeoxynucleotide, GGGCGG, the potential binding site for the Sp 1 transcription factor, a CCAAT sequence, but no TATA box. This promoter functions following DNA-mediated transfection into mammalian cells. The promoter of the mouse pgk-1 gene is homologous to the human pgk-1 promoter. A number of conserved motifs in the promoter may indicate a significant role for these sequences in expression of the pgk-1 gene. PMID: [3440520] 

6. During mammalian spermatogenesis the isozyme pattern of a glycolytic enzyme, phosphoglycerate kinase (PGK; ATP: 3-phospho-D-glycerate 1-phosphotransferase, EC 2.7.2.3), changes from the somatic-type PGK-1 to the testis-specific PGK-2, and this change has been suggested to involve transcription switch. We have isolated genomic DNA fragments which code for the mouse PGK isozymes and determined the transcription start site of each gene. The results demonstrate that transcriptions of the two PGK genes are initiated at multiple sites under the control of TATA box-lacking promoters. The putative promoter regions of the two genes contain several distinct sequences known as the CCAAT box and the GC box which possibly bind CCAAT-binding proteins and Sp1, respectively. We next developed a culture system in which spermatogenic gene expression is partly reproduced. When spermatogenic cells of 20-day-old rats were cultured, transcripts from PGK-2 and another spermatogenic gene PRPS3 became detectable, while expression of other non-spermatogenic genes did not significantly change during culture. These results suggest that two spermatogenic genes PGK-2 and PRPS3 were activated in culture according to a developmental program of spermatogenesis. Thus, this culture system may be useful for studying the molecular mechanism underlying mammalian spermatogenic gene expression. PMID: [2166582] 

7. The mouse X-linked Pgk-1 gene encodes phosphoglycerate kinase. When transfected into human cells, the Pgk-1b allele causes the appearance of mouse PGK-1b enzyme activity. We describe here cloning of mouse Pgk-1a, an allele of Pgk-1 which encodes an enzyme, PGK-1a, with distinct electrophoretic mobility. We constructed recombinants between the DNA encoding Pgk-1b and Pgk-1a and transfected these constructs into human cells to assess the electrophoretic characteristics of each recombinant. In this way the charge variation between the two proteins was localized to exons 4 or 5. Sequencing of these exons revealed a single base-pair difference between the two alleles at codon 155, which predicts the amino acids lysine and threonine in PGK-1b and PGK-1a, respectively. A number of other DNA sequence polymorphisms exist between Pgk-1b and Pgk-1a including part of an L1 repeated element unique to Pgk-1a. PMID: [1975492] 

8. The primary structures of six phosphoglycerate kinases (PGKs) are known: three from mammals, one from yeast, and two from trypanosomes. Comparison of the amino acid sequence of these enzymes reveals 154 invariant positions out of 392 positions in the aligned sequences. Most of the conserved positions fall into the twelve beta-sheets and adjacent peptide regions that form the inner loops surrounding the ATP and 3-phosphoglycerate-binding cleft. The homology between mammalian and yeast PGKs is greater than 94% for the inner-loop region, even though the overall homology is less than 65%. Trypanosome PGK has only 44% overall homology with the mammalian enzyme, but shows 74% homology in the inner-loop region. Trypanosome PGK contains a polypeptide segment in its N-terminal domain that is transposed in comparison with the other species. PMID: [3525226] 

9. While phosphorylation and O-GlcNAc (cytoplasmic and nuclear glycosylation) are linked to normal and pathological changes in cell states, these post-translational modifications have been difficult to analyze in proteomic studies. We describe advances in beta-elimination / Michael addition-based approaches which allow for mass spectrometry-based identification and comparative quantification of O-phosphate or O-GlcNAc-modified peptides, as well as cysteine-containing peptides for expression analysis. The method (BEMAD) involves differential isotopic labeling through Michael addition with normal dithiothreitol (DTT) (d0) or deuterated DTT (d6), and enrichment of these peptides by thiol chromatography. BEMAD was comparable to isotope-coded affinity tags (ICAT; a commercially available differential isotopic quantification technique) in protein expression analysis, but also provided the identity and relative amounts of both O-phosphorylation and O-GlcNAc modification sites. Specificity of O-phosphate vs. O-GlcNAc mapping is achieved through coupling enzymatic dephosphorylation or O-GlcNAc hydrolysis with differential isotopic labeling. Blocking of cysteine labeling by prior oxidation of a cytosolic lysate from mouse brain allowed specific targeting of serine / threonine post-translational modifications as demonstrated through identification of 21 phosphorylation sites (5 previously reported) in a single mass spectrometry analysis. These results demonstate BEMAD is suitable for large-scale quantitative analysis of both protein expression and serine / threonine post-translational modifications. PMID: [15648052] 

10. Metazoans employ reversible tyrosine phosphorylation to regulate innumerable biological processes. Thus, the large-scale identification of tyrosine phosphorylation sites from primary tissues is an essential step toward a molecular systems understanding of dynamic regulation in vivo. The relative paucity of phosphotyrosine has greatly limited its identification in large-scale phosphoproteomic experiments. However, using antiphosphotyrosine peptide immunoprecipitations, we report the largest study to date of tyrosine phosphorylation sites from primary tissue, identifying 414 unique tyrosine phosphorylation sites from murine brain. To measure the conservation of phosphorylated tyrosines and their surrounding residues, we constructed a computational pipeline and identified patterns of conservation within the signature of phosphotyrosine. PMID: [18034455] 

11. Activity-dependent protein phosphorylation is a highly dynamic yet tightly regulated process essential for cellular signaling. Although recognized as critical for neuronal functions, the extent and stoichiometry of phosphorylation in brain cells remain undetermined. In this study, we resolved activity-dependent changes in phosphorylation stoichiometry at specific sites in distinct subcellular compartments of brain cells. Following highly sensitive phosphopeptide enrichment using immobilized metal affinity chromatography and mass spectrometry, we isolated and identified 974 unique phosphorylation sites on 499 proteins, many of which are novel. To further explore the significance of specific phosphorylation sites, we used isobaric peptide labels and determined the absolute quantity of both phosphorylated and non-phosphorylated peptides of candidate phosphoproteins and estimated phosphorylation stoichiometry. The analyses of phosphorylation dynamics using differentially stimulated synaptic terminal preparations revealed activity-dependent changes in phosphorylation stoichiometry of target proteins. Using this method, we were able to differentiate between distinct isoforms of Ca2+/calmodulin-dependent protein kinase (CaMKII) and identify a novel activity-regulated phosphorylation site on the glutamate receptor subunit GluR1. Together these data illustrate that mass spectrometry-based methods can be used to determine activity-dependent changes in phosphorylation stoichiometry on candidate phosphopeptides following large scale phosphoproteome analysis of brain tissue. PMID: [17114649] 

12. Kinases play a prominent role in tumor development, pointing to the presence of specific phosphorylation patterns in tumor tissues. Here, we investigate whether recently developed high resolution mass spectrometric (MS) methods for proteome and phosphoproteome analysis can also be applied to solid tumors. As tumor model, we used TG3 mutant mice carrying skin melanomas. At total of 100 microg of solid tumor lysate yielded a melanoma proteome of 4443 identified proteins, including at least 88 putative melanoma markers previously found by cDNA microarray technology. Analysis of 2 mg of lysate from dissected melanoma with titansphere chromatography and 8 mg with strong cation exchange together resulted in the identification of more than 5600 phosphorylation sites on 2250 proteins. The phosphoproteome included many hits from pathways important in melanoma. One-month storage at -80 degrees C did not significantly decrease the number of identified phosphorylation sites. Thus, solid tumor can be analyzed by MS-based proteomics with similar efficiency as cell culture models and in amounts compatible with biopsies. PMID: [19367708] 

13. We used on-line electron capture dissociation (ECD) for the large scale identification and localization of sites of phosphorylation. Each FT-ICR ECD event was paired with a linear ion trap collision-induced dissociation (CID) event, allowing a direct comparison of the relative merits of ECD and CID for phosphopeptide identification and site localization. Linear ion trap CID was shown to be most efficient for phosphopeptide identification, whereas FT-ICR ECD was superior for localization of sites of phosphorylation. The combination of confident CID and ECD identification and confident CID and ECD localization is particularly valuable in cases where a phosphopeptide is identified just once within a phosphoproteomics experiment. PMID: [19131326] 

Back to Top
Function
 
Back to Top
Subcellular Location
Cytoplasm (By similarity). 
Tissue Specificity
 
Gene Ontology
GO IDGO termEvidence
GO:0005737 C:cytoplasm IEA:UniProtKB-SubCell.
GO:0005524 F:ATP binding ISS:UniProtKB.
GO:0004618 F:phosphoglycerate kinase activity ISS:UniProtKB.
GO:0006096 P:glycolysis IEA:UniProtKB-KW.
Back to Top
Interpro
IPR001576;    Phosphoglycerate_kinase.
IPR015901;    Phosphoglycerate_kinase_C.
IPR015911;    Phosphoglycerate_kinase_CS.
IPR015824;    Phosphoglycerate_kinase_N.
Back to Top
Pfam
PF00162;    PGK;    1.
Back to Top
SMART
PROSITE
PS00111;    PGLYCERATE_KINASE;    1.
Back to Top
PRINTS
PR00477;    PHGLYCKINASE.;   
Back to Top
Created Date
18-Oct-2012 
Record Type
GAS predicted 
Sequence Annotation
INIT_MET      1      1       Removed (By similarity).
CHAIN         2    417       Phosphoglycerate kinase 1.
                             /FTId=PRO_0000145835.
NP_BIND     373    376       ATP (By similarity).
REGION       24     26       Substrate binding (By similarity).
REGION       63     66       Substrate binding (By similarity).
BINDING      39     39       Substrate (By similarity).
BINDING     123    123       Substrate (By similarity).
BINDING     171    171       Substrate (By similarity).
BINDING     220    220       ATP (By similarity).
BINDING     313    313       ATP; via carbonyl oxygen (By similarity).
BINDING     344    344       ATP (By similarity).
MOD_RES      11     11       N6-acetyllysine (By similarity).
MOD_RES      30     30       N6-acetyllysine (By similarity).
MOD_RES      48     48       N6-acetyllysine (By similarity).
MOD_RES      75     75       N6-acetyllysine (By similarity).
MOD_RES      76     76       Phosphotyrosine.
MOD_RES      86     86       N6-acetyllysine (By similarity).
MOD_RES      97     97       N6-acetyllysine (By similarity).
MOD_RES     131    131       N6-acetyllysine; alternate (By
                             similarity).
MOD_RES     131    131       N6-malonyllysine; alternate (By
                             similarity).
MOD_RES     136    136       Phosphoserine (By similarity).
MOD_RES     146    146       N6-acetyllysine (By similarity).
MOD_RES     196    196       Phosphotyrosine.
MOD_RES     199    199       N6-acetyllysine (By similarity).
MOD_RES     203    203       Phosphoserine.
MOD_RES     243    243       Phosphothreonine (By similarity).
MOD_RES     267    267       N6-acetyllysine (By similarity).
MOD_RES     291    291       N6-acetyllysine (By similarity).
MOD_RES     323    323       N6-acetyllysine (By similarity).
MOD_RES     390    390       Phosphoserine.
MOD_RES     406    406       N6-acetyllysine (By similarity).
CONFLICT     56     56       K -> N (in Ref. 1; AAA70267).
CONFLICT    184    184       K -> R (in Ref. 2; BAE26693).
CONFLICT    265    265       I -> V (in Ref. 2; BAE37790).
Back to Top
Nucleotide Sequence
Length: 1653 bp   Go to nucleotide: FASTA
Protein Sequence
Length: 417 bp   Go to amino acid: FASTA
The verified Protein-Protein interaction information
UniProt
Gene Symbol Ref Databases
Dlg4IntAct 
HSPH1BioGRID 
Kcnma1IntAct 
Other Protein-Protein interaction resources
String database  
View Microarray data
Comments