dbacp05467
General Description
Peptide name : Peptidoglycan recognition protein 1
Source/Organism : Mouse
Linear/Cyclic : Not found
Chirality : Not found
Sequence Information
Sequence : MLFACALLALLGLATSCSFIVPRSEWRALPSECSSRLGHPVRYVVISHTAGSFCNSPDSCEQQARNVQHYHKNELGWCDVAYNFLIGEDGHVYEGRGWNIKGDHTGPIWNPMSIGITFMGNFMDRVPAKRALRAALNLLECGVSRGFLRSNYEVKGHRDVQSTLSPGDQLYQVIQSWEHYRE
Peptide length: 182
C-terminal modification: Not found
N-terminal modification : Not found
Non-natural peptide information: None
Activity Information
Assay type : Cytotoxicity assay
Assay time : 24h
Activity : Not found
Cell line : VMR-0
Cancer type : Mammary adenocarcinoma
Other activity : Anti-bacterial activity
Physicochemical Properties
Amino acid composition bar chart :
Molecular mass : 20489.0209 Dalton
Aliphatic index : 0.792
Instability index : 58.3346
Hydrophobicity (GRAVY) : -0.258
Isoelectric point : 7.071
Charge (pH 7) : 0.1585
Aromaticity : 0.104
Molar extinction coefficient (cysteine, cystine): (37930, 38305)
Hydrophobic/hydrophilic ratio : 1.11627907
hydrophobic moment : -0.078
Missing amino acid : None
Most occurring amino acid : L
Most occurring amino acid frequency : 17
Least occurring amino acid : M
Least occurring amino acid frequency : 4
Structural Information
3D structure :
Secondary structure fraction (Helix, Turn, Sheet): (0.2, 0.3, 0.3)
SMILES Notation: CC[C@H](C)[C@H](NC(=O)CNC(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(N)=O)NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@@H](NC(=O)[C@@H]1CCCN1C(=O)CNC(=O)[C@@H](NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](CC(=O)O)NC(=O)CNC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)CNC(=O)[C@H](CCCNC(=N)N)NC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@@H](NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)CNC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)CNC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CO)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CS)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CO)NC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@@H](NC(=O)[C@@H]1CCCN1C(=O)[C@H](Cc1c[nH]cn1)NC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CO)NC(=O)[C@H](CS)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)[C@H](CS)NC(=O)[C@H](C)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCSC)[C@@H](C)O)[C@@H](C)CC)C(C)C)C(C)C)C(C)C)C(C)C)[C@@H](C)CC)[C@@H](C)O)C(C)C)C(C)C)[C@@H](C)CC)C(C)C)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)CC)C(=O)N[C@H](C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CCSC)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)NCC(=O)N[C@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(=N)N)C(=O)NCC(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](Cc1c[nH]cn1)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@H](C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](Cc1c[nH]cn1)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)[C@@H](C)CC)C(C)C)[C@@H](C)O)C(C)C)C(C)C)C(C)C)C(C)C)[C@@H](C)O
Secondary Structure :
| Method | Prediction |
|---|---|
| GOR | HHHHHHHHHHETCEEEEEEECCTTTHTTCTTHTTTTTTCCEEEEEEEEETTCECCCTTCHHHHHHHHHHETTTTTTEHHHHHHHEETTTTEEETTTTEEETTCCCCCCCCCTCEEEEEECCHHHHCHHHHHHHHHHTHHHTTEETTEETTTTTETEEEEEEEECCCCCCEEEEEEHHHHHHH |
| Chou-Fasman (CF) | HHHHHHHHHHHEEEEEEEEEECHHHHHCCCCCCCCCCCEEEEEEEECCCCCCCCCCCHHHHHHHEEECCCHHHHCCCCEECEEECCCCEEEECCCCEECCCCCCEEEECCEEEEEEECCCCCCCCHHHHHHHHHHHHHHEEEECEEECCCCCCCCCCEEEEEECCCCCEEEEEEHHHHHCCC |
| Neural Network (NN) | HHHHHHHHHHHHHCCCCCCECCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCCCCCCCCCCCHHHHHHHHCCCCCCCHHHCCCCCCCCCCCCECCCCCCCCCCCCCCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCCCCC |
| Joint/Consensus | HHHHHHHHHHHCCEEEEEEECCCCCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEEECCCCCCEEEEEEHHHHCCC |
Molecular Descriptors and ADMET Properties
Molecular Descriptors: Not available.
ADMET Properties: Not available.
Cross Referencing databases
Pubmed Id : 9707603 9660837 12649138 16141072 15489334 14585845 21183079
Uniprot : Click here
PDB : Not available
CancerPPD : Not available
ApIAPDB : Not available
CancerPPD2 ID : Not available
Reference
1 : Carninci P, et al. The transcriptional landscape of the mammalian genome. Science. 2005; 309:1559-63. doi: 10.1126/science.1112014
2 : Dziarski R, et al. Defect in neutrophil killing and increased susceptibility to infection with nonpathogenic gram-positive bacteria in peptidoglycan recognition protein-S (PGRP-S)-deficient mice. Blood. 2003; 102:689-97. doi: 10.1182/blood-2002-12-3853
3 : Huttlin EL, et al. A tissue-specific atlas of mouse protein phosphorylation and expression. Cell. 2010; 143:1174-89. doi: 10.1016/j.cell.2010.12.001
4 : Gerhard DS, et al. The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC). Genome Res. 2004; 14:2121-7. doi: 10.1101/gr.2596504
5 : Sashchenko LP, et al. Peptidoglycan recognition protein tag7 forms a cytotoxic complex with heat shock protein 70 in solution and in lymphocytes. J Biol Chem. 2004; 279:2117-24. doi: 10.1074/jbc.M307513200
6 : Kang D, et al. A peptidoglycan recognition protein in innate immunity conserved from insects to humans. Proc Natl Acad Sci U S A. 1998; 95:10078-82. doi: 10.1073/pnas.95.17.10078
7 : Kiselev SL, et al. Molecular cloning and characterization of the mouse tag7 gene encoding a novel cytokine. J Biol Chem. 1998; 273:18633-9. doi: 10.1074/jbc.273.29.18633
Literature
Paper title : The transcriptional landscape of the mammalian genome.
Doi : https://doi.org/10.1126/science.1112014
Abstract : This study describes comprehensive polling of transcription start and termination sites and analysis of previously unidentified full-length complementary DNAs derived from the mouse genome. We identify the 5' and 3' boundaries of 181,047 transcripts with extensive variation in transcripts arising from alternative promoter usage, splicing, and polyadenylation. There are 16,247 new mouse protein-coding transcripts, including 5154 encoding previously unidentified proteins. Genomic mapping of the transcriptome reveals transcriptional forests, with overlapping transcription on both strands, separated by deserts in which few transcripts are observed. The data provide a comprehensive platform for the comparative analysis of mammalian transcriptional regulation in differentiation and development.
Paper title : Defect in neutrophil killing and increased susceptibility to infection with nonpathogenic gram-positive bacteria in peptidoglycan recognition protein-S (PGRP-S)-deficient mice.
Doi : https://doi.org/10.1182/blood-2002-12-3853
Abstract : Insect peptidoglycan recognition protein-S (PGRP-S), a member of a family of innate immunity pattern recognition molecules conserved from insects to mammals, recognizes bacterial cell wall peptidoglycan and activates 2 antimicrobial defense systems, prophenoloxidase cascade and antimicrobial peptides through Toll receptor. We show that mouse PGRP-S is present in neutrophil tertiary granules and that PGRP-S-deficient (PGRP-S-/-) mice have increased susceptibility to intraperitoneal infection with gram-positive bacteria of low pathogenicity but not with more pathogenic gram-positive or gram-negative bacteria. PGRP-S-/- mice have normal inflammatory responses and production of tumor necrosis factor alpha (TNF-alpha) and interleukin 6 (IL-6). Neutrophils from PGRP-S-/- mice have normal phagocytic uptake of bacteria but are defective in intracellular killing and digestion of relatively nonpathogenic gram-positive bacteria. Therefore, mammalian PGRP-S functions in intracellular killing of bacteria. Thus, only bacterial recognition by PGRP-S, but not its effector function, is conserved from insects to mammals.
Paper title : A tissue-specific atlas of mouse protein phosphorylation and expression.
Doi : https://doi.org/10.1016/j.cell.2010.12.001
Abstract : Although most tissues in an organism are genetically identical, the biochemistry of each is optimized to fulfill its unique physiological roles, with important consequences for human health and disease. Each tissue's unique physiology requires tightly regulated gene and protein expression coordinated by specialized, phosphorylation-dependent intracellular signaling. To better understand the role of phosphorylation in maintenance of physiological differences among tissues, we performed proteomic and phosphoproteomic characterizations of nine mouse tissues. We identified 12,039 proteins, including 6296 phosphoproteins harboring nearly 36,000 phosphorylation sites. Comparing protein abundances and phosphorylation levels revealed specialized, interconnected phosphorylation networks within each tissue while suggesting that many proteins are regulated by phosphorylation independently of their expression. Our data suggest that the "typical" phosphoprotein is widely expressed yet displays variable, often tissue-specific phosphorylation that tunes protein activity to the specific needs of each tissue. We offer this dataset as an online resource for the biological research community.
Paper title : The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC).
Doi : https://doi.org/10.1101/gr.2596504
Abstract : The National Institutes of Health's Mammalian Gene Collection (MGC) project was designed to generate and sequence a publicly accessible cDNA resource containing a complete open reading frame (ORF) for every human and mouse gene. The project initially used a random strategy to select clones from a large number of cDNA libraries from diverse tissues. Candidate clones were chosen based on 5'-EST sequences, and then fully sequenced to high accuracy and analyzed by algorithms developed for this project. Currently, more than 11,000 human and 10,000 mouse genes are represented in MGC by at least one clone with a full ORF. The random selection approach is now reaching a saturation point, and a transition to protocols targeted at the missing transcripts is now required to complete the mouse and human collections. Comparison of the sequence of the MGC clones to reference genome sequences reveals that most cDNA clones are of very high sequence quality, although it is likely that some cDNAs may carry missense variants as a consequence of experimental artifact, such as PCR, cloning, or reverse transcriptase errors. Recently, a rat cDNA component was added to the project, and ongoing frog (Xenopus) and zebrafish (Danio) cDNA projects were expanded to take advantage of the high-throughput MGC pipeline.
Paper title : Peptidoglycan recognition protein tag7 forms a cytotoxic complex with heat shock protein 70 in solution and in lymphocytes.
Doi : https://doi.org/10.1074/jbc.M307513200
Abstract : The peptidoglycan recognition protein Tag7 is shown to form a stable 1:1 complex with the major stress protein Hsp70. Neither protein is cytotoxic by itself, but their complex induces apoptotic death in several tumor-derived cell lines even at subnanomolar concentrations. The minimal part of Hsp70 needed to evoke cytotoxicity is residues 450-463 of its peptide-binding domain, but full cytotoxicity requires its ATPase activity; remarkably, Tag7 liberated from the complex at high ATP is not cytotoxic. The Tag7-Hsp70 complex is produced by tag7-transfected cells and by lymphokine-activated killers, being assembled within the cell and released into the medium through the Golgi apparatus by a mechanism different from the commonly known granule exocytosis. Thus, we demonstrate how a heat shock protein may perform functions clearly distinct from chaperoning or cell rescue and how peptidoglycan recognition proteins may be involved in innate immunity and anti-cancer defense.
Paper title : A peptidoglycan recognition protein in innate immunity conserved from insects to humans.
Doi : https://doi.org/10.1073/pnas.95.17.10078
Abstract : Innate nonself recognition must rely on common structures of invading microbes. In a differential display screen for up-regulated immune genes in the moth Trichoplusia ni we have found mechanisms for recognition of bacterial cell wall fragments. One bacteria-induced gene encodes a protein that, after expression in the baculovirus system, was shown to be a peptidoglycan recognition protein (PGRP). It binds strongly to Gram-positive bacteria. We have also cloned the corresponding cDNA from mouse and human and shown this gene to be expressed in a variety of organs, notably organs of the immune system-i.e., bone marrow and spleen. In addition, purified recombinant murine PGRP was shown to possess peptidoglycan affinity. From our results and the sequence homology, we conclude that PGRP is a ubiquitous protein involved in innate immunity, conserved from insects to humans.
Paper title : Molecular cloning and characterization of the mouse tag7 gene encoding a novel cytokine.
Doi : https://doi.org/10.1074/jbc.273.29.18633
Abstract : Cloning of the mouse tag7 gene encoding a novel cytokine is described. The Tag7 protein consists of 182 amino acids. Genomic organization of the tag7 gene and its promoter region remind those of the genes of the tumor necrosis factor locus, although the tag7 gene is not linked to this locus. The gene is located on chromosome 7 at the area that corresponds to band 7A3, which has genetic linkage with lupus-like disease in mouse models. tag7 transcription is essential for lymphoid organs. It is also detected in certain areas of lungs, brain, and intestine and in some tumors. Tag7 protein is detectable in both cell-associated and soluble forms. The soluble form of Tag7 triggers apoptosis in mouse L929 cells in vitro and does not involve NF-kappaB activation. The relationship between Tag7 and tumor necrosis factor family of ligands is discussed.