dbacp02591
General Description
Peptide name : CTP synthase 1
Source/Organism : Not found
Linear/Cyclic : Not found
Chirality : Not found
Sequence Information
Sequence : MKYILVTGGVISGIGKGIIASSVGTILKSCGLHVTSIKIDPYINIDAGTFSPYEHGEVFVLDDGGEVDLDLGNYERFLDIRLTKDNNLTTGKIYQYVINKERKGDYLGKTVQVVPHITDAIQEWVMRQALIPVDEDGLEPQVCVIELGGTVGDIESMPFIEAFRQFQFKVKRENFCNIHVSLVPQPSSTGEQKTKPTQNSVRELRGLGLSPDLVVCRCSNPLDTSVKEKISMFCHVEPEQVICVHDVSSIYRVPLLLEEQGVVDYFLRRLDLPIERQPRKMLMKWKEMADRYDRLLETCSIALVGKYTKFSDSYASVIKALEHSALAINHKLEIKYIDSADLEPITSQEEPVRYHEAWQKLCSAHGVLVPGGFGVRGTEGKIQAIAWARNQKKPFLGVCLGMQLAVVEFSRNVLGWQDANSTEFDPTTSHPVVVDMPEHNPGQMGGTMRLGKRRTLFQTKNSVMRKLYGDADYLEERHRHRFEVNPVWKKCLEEQGLKFVGQDVEGERMEIVELEDHPFFVGVQYHPEFLSRPIKPSPPYFGLLLASVGRLSHYLQKGCRLSPRDTYSDRSGSSSPDSEITELKFPSINHD
Peptide length: 591
C-terminal modification: Not found
N-terminal modification : Not found
Non-natural peptide information: None
Activity Information
Assay type : Antibody-based assay
Assay time : 48h
Activity : Not found
Cell line : U937
Cancer type : Not specified
Other activity : Not found
Physicochemical Properties
Amino acid composition bar chart :
Molecular mass : 66689.577 Dalton
Aliphatic index : 0.884
Instability index : 40.8254
Hydrophobicity (GRAVY) : -0.306
Isoelectric point : 6.0232
Charge (pH 7) : -8.9378
Aromaticity : 0.082
Molar extinction coefficient (cysteine, cystine): (62800, 63550)
Hydrophobic/hydrophilic ratio : 1.00338983
hydrophobic moment : -0.029
Missing amino acid : None
Most occurring amino acid : L
Most occurring amino acid frequency : 54
Least occurring amino acid : W
Least occurring amino acid frequency : 6
Structural Information
3D structure : Not Available
Secondary structure fraction (Helix, Turn, Sheet): (0.2, 0.2, 0.3)
SMILES Notation: CC[C@H](C)[C@H](NC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)CNC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCSC)[C@@H](C)CC)C(C)C)[C@@H](C)O)C(C)C)[C@@H](C)CC)C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)NCC(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@H](C(=O)NCC(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](Cc1c[nH]cn1)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CO)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)NCC(=O)N[C@H](C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](Cc1c[nH]cn1)C(=O)NCC(=O)N[C@@H](CCC(=O)O)C(=O)N[C@H](C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)NCC(=O)N[C@@H](CCC(=O)O)C(=O)N[C@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@H](C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(=O)N[C@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(=O)N[C@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1c[nH]cn1)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@H](C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(=O)N1CCC[C@H]1C(=O)N[C@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(=O)N[C@@H](CS)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)NCC(=O)N[C@H](C(=O)N[C@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)N[C@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(=O)N[C@@H](Cc1c[nH]cn1)C(=O)N[C@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@H](C(=O)NCC(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@H](C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(=N)N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@H](C(=O)N[C@@H](CO)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CS)C(=O)N[C@@H](Cc1c[nH]cn1)C(=O)N[C@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CS)C(=O)N[C@H](C(=O)N[C@@H](Cc1c[nH]cn1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@H](C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](Cc1c[nH]cn1)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](Cc1c[nH]cn1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1c[nH]cn1)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](Cc1c[nH]cn1)C(=O)NCC(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(=O)N1CCC[C@H]1C(=O)NCC(=O)NCC(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(=O)N[C@H](C(=O)N[C@@H](CCCNC(=N)N)C(=O)NCC(=O)N[C@H](C(=O)N[C@@H](CCC(=O)O)C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@H](C(=O)N[C@@H](C)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1c[nH]cn1)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](Cc1c[nH]cn1)C(=O)N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)NCC(=O)NCC(=O)N[C@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](Cc1c[nH]cn1)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](Cc1c[nH]cn1)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@H](C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@H](C(=O)N[C@@H](CCC(=O)O)C(=O)NCC(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](Cc1c[nH]cn1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@H](C(=O)NCC(=O)N[C@H](C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1c[nH]cn1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@H](C(=O)NCC(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1c[nH]cn1)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@H](C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](Cc1ccccc1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](Cc1c[nH]cn1)C(=O)N[C@@H](CC(=O)O)C(=O)O)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)O)C(C)C)[C@@H](C)CC)C(C)C)C(C)C)C(C)C)[C@@H](C)CC)C(C)C)C(C)C)C(C)C)C(C)C)C(C)C)[C@@H](C)O)[C@@H](C)O)[C@@H](C)O)C(C)C)C(C)C)C(C)C)[C@@H](C)O)[C@@H](C)O)[C@@H](C)O)C(C)C)C(C)C)C(C)C)C(C)C)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)O)C(C)C)C(C)C)C(C)C)C(C)C)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)CC)C(C)C)[C@@H](C)O)C(C)C)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)CC)C(C)C)C(C)C)C(C)C)[C@@H](C)CC)C(C)C)C(C)C)[C@@H](C)CC)C(C)C)C(C)C)[C@@H](C)CC)C(C)C)[C@@H](C)O)C(C)C)C(C)C)C(C)C)[C@@H](C)O)[C@@H](C)O)[C@@H](C)O)C(C)C)C(C)C)[C@@H](C)CC)C(C)C)[C@@H](C)CC)[C@@H](C)CC)C(C)C)[C@@H](C)O)[C@@H](C)CC)C(C)C)C(C)C)C(C)C)[C@@H](C)CC)C(C)C)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)CC)C(C)C)C(C)C)C(C)C)[C@@H](C)O)[C@@H](C)CC)C(C)C)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)O)[C@@H](C)O)[C@@H](C)CC)C(C)C)C(C)C)C(C)C)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)O)C(C)C)[C@@H](C)CC)[C@@H](C)O)C(C)C)[C@@H](C)CC)[C@@H](C)CC
Secondary Structure :
| Method | Prediction |
|---|---|
| GOR | HEEEEEECEEEEEEEEEEEEEEECEEEEETTEEEEEEECCCCEECCTTCCCCTTTTHEEEETTTCCEEETTTCHHHHHHHHHTTTTTCCTCEEEEEEHHHHTTTCCTTEEEEEEEEEEHHHHHHHHHHHCCCCCCTTCCTHEEEEETTCCECCCTTCHHHHHHHHHHHHHHHTTHHEEEEEECCCCCCCCCEECCCCCTTHEEEEETTCCCTEEEEETTCCCCCHHHHHHHEEHHCCTTHEEEEEETTEEEECTEHHHTTTEHHHHHHTTTCHHHHHHHHHHHHHHHHHHHHHTTHHHHEHHHHTCCEEECCTHHHHHHHHHHHHHHHHHHHHHHEECTTTCCCCCCTTCHHHHHHHHHHHHHTTTEEEETCEEEECCTTHHHHHHHHHHTTCCCEEEEHTHHHHHHHHHEEEEECCTTTTCEECCCCCCCEEEECCTCCTTCCTCEEHHTHHHHEEHHTTTHHEEETTCTHHHHHHHHHHHHCCCHHHHHHHHHHHHEETCCHHHHHHHHHHHHHCHHEEECTCCCTTEECTCCCCCCCEEEEEEEEEEEEEEETTTCETCTTCCEEEETTCCCCTTTHHHHHCTTCCCT |
| Chou-Fasman (CF) | EEEEEEEEEEEEEEEEEEEEEEEEEECCCCEEEEEEECEEEEECCCEEEECCCCCEEEECCCCCHHHHHCCCCCCCCCCCCCCCCCEEEEEEEEEEEHHHHCCEEEEEEEEEEEEEEEHHHHHEEHHHHEEHHHHHCCCEEEEECCEEEEEHHHHCCHHHHHCCCCCHHHHHHCEEEEEEEECCCCCCCCCCCCCCCEEHHHHHCEECCCEEEEEECCCCCEEEHHHHHEECCHHHHHEEEEEEEEEEEEEEEHHHHHHEEEEEEHHHHHCCCCCCCHHHHHHHHHHHHCCHHHHHCEEEEEEEEEEECCCCCEEEEHHHHHHHHHHHHHHHHEEEHHHHHHEEEHHHHCEEHHHHHHHHHHCCEEEEECCEEEECCCCCCCHHHHHHHHHHHCEEEEHHHHHCCEECCEEEEECCCCCCCCCCCEECEEEEECCCCCCCCCCEEEECCCCCEEEEECCCEECCCEECHHHHHHHHHHHHHHCEEEHHHHHHHHHHEEEEEHHHHHHHHHCHHHHCCEEEEEECCCCCCCCCCCCCCCEEECCCEEEECCEEECCCCEECCCCEEECCCCCCCCCCCCCCCCCCCCCCCCC |
| Neural Network (NN) | CEEEEEECCCEECCCCEEEEECCCEEEEECCCCCCCCCCCCCCECCCCCCCCCCCCCEEEECCCCCCHCCCCCCCHHHHHHCCCCCCCCCCCCEEEEECCCCCCCCCCCCCEEECCCCCCHHHHHHHHCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCHHHHHHHHHHHCCCCCCHCCCCCCCCCCCCCCCCCCCCCCCHHHHHCCCCCCCEEEECCCCCCCCCHCCHHHEECCCCCCCEEEEECCCCCCCEEHHHHCCCHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHCHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCEECCCCCCCCCCCCCEECCCCCHHHHCCCCHHEECCCCCHHHHHHHCCCCCCCCCCHHHHHHHCCCCCCCCCCCCHHHHHHCCCCCCCECCCCCCCCCCCCCCCCCCCCEEEHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCEECCCCCCCCCC |
| Joint/Consensus | CEEEEEEEEEEEEEEEEEEEEEEEEEEEECCEEEEEECCCCCEECCCCCCCCCCCCCEEEECCCCCCCCCCCCCCHHHHHHCCCCCCCCCCEEEEEEHHHHCCCCCCCEEEEEEEEEEHHHHHHHHHHHCCCCCCCCCCCCEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCHHHHCCCCCCCCEEEECCCCCCCCHHHHHHHEECCCCCCCEEEEEECCEEEEECHHHHCCCCHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCEEECCCEEECCCCCCHHHHHHHHHCCCCCEEEHHHHHHHHHHHHEEEECCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCEECCCCCCCEECCCCCCCEEECCCCHHHHHHHHHHHCCCCHHHHHHHHHHCCEECCCHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCCCCEEEEEEEECCEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC |
Molecular Descriptors and ADMET Properties
Molecular Descriptors: Not available.
ADMET Properties: Not available.
Cross Referencing databases
Reference
1 : Burkard TR, et al. Initial characterization of the human central proteome. BMC Syst Biol. 2011; 5:17. doi: 10.1186/1752-0509-5-17
2 : Mayya V, et al. Quantitative phosphoproteomic analysis of T cell receptor signaling reveals system-wide modulation of protein-protein interactions. Sci Signal. 2009; 2:ra46. doi: 10.1126/scisignal.2000007
3 : Gregory SG, et al. The DNA sequence and biological annotation of human chromosome 1. Nature. 2006; 441:315-21. doi: 10.1038/nature04727
4 : Rigbolt KT, et al. System-wide temporal characterization of the proteome and phosphoproteome of human embryonic stem cell differentiation. Sci Signal. 2011; 4:rs3. doi: 10.1126/scisignal.2001570
5 : Zahedi RP, et al. Phosphoproteome of resting human platelets. J Proteome Res. 2008; 7:526-34. doi: 10.1021/pr0704130
6 : Martin E, et al. CTP synthase 1 deficiency in humans reveals its central role in lymphocyte proliferation. Nature. 2014; 510:288-92. doi: 10.1038/nature13386
7 : Schoepe J, et al. Cloning, expression, purification and preliminary crystallographic characterization of a shikimate dehydrogenase from Corynebacterium glutamicum. Acta Crystallogr Sect F Struct Biol Cryst Commun. 2006; 62:635-7. doi: 10.1107/S1744309106017805
8 : Beausoleil SA, et al. A probability-based approach for high-throughput protein phosphorylation analysis and site localization. Nat Biotechnol. 2006; 24:1285-92. doi: 10.1038/nbt1240
9 : Gerhard DS, et al. The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC). Genome Res. 2004; 14:2121-7. doi: 10.1101/gr.2596504
10 : Daub H, et al. Kinase-selective enrichment enables quantitative phosphoproteomics of the kinome across the cell cycle. Mol Cell. 2008; 31:438-48. doi: 10.1016/j.molcel.2008.07.007
11 : Choudhary C, et al. Lysine acetylation targets protein complexes and co-regulates major cellular functions. Science. 2009; 325:834-40. doi: 10.1126/science.1175371
12 : Zhou H, et al. Toward a comprehensive characterization of a human cancer cell phosphoproteome. J Proteome Res. 2013; 12:260-71. doi: 10.1021/pr300630k
13 : Strochlic TI, et al. Ack kinase regulates CTP synthase filaments during Drosophila oogenesis. EMBO Rep. 2014; 15:1184-91. doi: 10.15252/embr.201438688
14 : Yamauchi M, et al. Molecular cloning of the human CTP synthetase gene by functional complementation with purified human metaphase chromosomes. EMBO J. 1990; 9:2095-9. doi: 10.1002/j.1460-2075.1990.tb07377.x
15 : Olsen JV, et al. Global, in vivo, and site-specific phosphorylation dynamics in signaling networks. Cell. 2006; 127:635-48. doi: 10.1016/j.cell.2006.09.026
16 : Cantin GT, et al. Combining protein-based IMAC, peptide-based IMAC, and MudPIT for efficient phosphoproteomic analysis. J Proteome Res. 2008; 7:1346-51. doi: 10.1021/pr0705441
17 : Gauci S, et al. Lys-N and trypsin cover complementary parts of the phosphoproteome in a refined SCX-based approach. Anal Chem. 2009; 81:4493-501. doi: 10.1021/ac9004309
18 : Giorgianni F, et al. Toward a global characterization of the phosphoproteome in prostate cancer cells: identification of phosphoproteins in the LNCaP cell line. Electrophoresis. 2007; 28:2027-34. doi: 10.1002/elps.200600782
19 : Ota T, et al. Complete sequencing and characterization of 21,243 full-length human cDNAs. Nat Genet. 2004; 36:40-5. doi: 10.1038/ng1285
20 : Dephoure N, et al. A quantitative atlas of mitotic phosphorylation. Proc Natl Acad Sci U S A. 2008; 105:10762-7. doi: 10.1073/pnas.0805139105
21 : Olsen JV, et al. Quantitative phosphoproteomics reveals widespread full phosphorylation site occupancy during mitosis. Sci Signal. 2010; 3:ra3. doi: 10.1126/scisignal.2000475
22 : Han GS, et al. Expression of Human CTP synthetase in Saccharomyces cerevisiae reveals phosphorylation by protein kinase A. J Biol Chem. 2005; 280:38328-36. doi: 10.1074/jbc.M509622200
Literature
Paper title : Initial characterization of the human central proteome.
Doi : https://doi.org/10.1186/1752-0509-5-17
Abstract : BACKGROUND: On the basis of large proteomics datasets measured from seven human cell lines we consider their intersection as an approximation of the human central proteome, which is the set of proteins ubiquitously expressed in all human cells. Composition and properties of the central proteome are investigated through bioinformatics analyses. RESULTS: We experimentally identify a central proteome comprising 1,124 proteins that are ubiquitously and abundantly expressed in human cells using state of the art mass spectrometry and protein identification bioinformatics. The main represented functions are proteostasis, primary metabolism and proliferation. We further characterize the central proteome considering gene structures, conservation, interaction networks, pathways, drug targets, and coordination of biological processes. Among other new findings, we show that the central proteome is encoded by exon-rich genes, indicating an increased regulatory flexibility through alternative splicing to adapt to multiple environments, and that the protein interaction network linking the central proteome is very efficient for synchronizing translation with other biological processes. Surprisingly, at least 10% of the central proteome has no or very limited functional annotation. CONCLUSIONS: Our data and analysis provide a new and deeper description of the human central proteome compared to previous results thereby extending and complementing our knowledge of commonly expressed human proteins. All the data are made publicly available to help other researchers who, for instance, need to compare or link focused datasets to a common background.
Paper title : Quantitative phosphoproteomic analysis of T cell receptor signaling reveals system-wide modulation of protein-protein interactions.
Doi : https://doi.org/10.1126/scisignal.2000007
Abstract : Protein phosphorylation events during T cell receptor (TCR) signaling control the formation of complexes among proteins proximal to the TCR, the activation of kinase cascades, and the activation of transcription factors; however, the mode and extent of the influence of phosphorylation in coordinating the diverse phenomena associated with T cell activation are unclear. Therefore, we used the human Jurkat T cell leukemia cell line as a model system and performed large-scale quantitative phosphoproteomic analyses of TCR signaling. We identified 10,665 unique phosphorylation sites, of which 696 showed TCR-responsive changes. In addition, we analyzed broad trends in phosphorylation data sets to uncover underlying mechanisms associated with T cell activation. We found that, upon stimulation of the TCR, phosphorylation events extensively targeted protein modules involved in all of the salient phenomena associated with T cell activation: patterning of surface proteins, endocytosis of the TCR, formation of the F-actin cup, inside-out activation of integrins, polarization of microtubules, production of cytokines, and alternative splicing of messenger RNA. Further, case-by-case analysis of TCR-responsive phosphorylation sites on proteins belonging to relevant functional modules together with network analysis allowed us to deduce that serine-threonine (S-T) phosphorylation modulated protein-protein interactions (PPIs) in a system-wide fashion. We also provide experimental support for this inference by showing that phosphorylation of tubulin on six distinct serine residues abrogated PPIs during the assembly of microtubules. We propose that modulation of PPIs by stimulus-dependent changes in S-T phosphorylation state is a widespread phenomenon applicable to many other signaling systems.
Paper title : The DNA sequence and biological annotation of human chromosome 1.
Doi : https://doi.org/10.1038/nature04727
Abstract : The reference sequence for each human chromosome provides the framework for understanding genome function, variation and evolution. Here we report the finished sequence and biological annotation of human chromosome 1. Chromosome 1 is gene-dense, with 3,141 genes and 991 pseudogenes, and many coding sequences overlap. Rearrangements and mutations of chromosome 1 are prevalent in cancer and many other diseases. Patterns of sequence variation reveal signals of recent selection in specific genes that may contribute to human fitness, and also in regions where no function is evident. Fine-scale recombination occurs in hotspots of varying intensity along the sequence, and is enriched near genes. These and other studies of human biology and disease encoded within chromosome 1 are made possible with the highly accurate annotated sequence, as part of the completed set of chromosome sequences that comprise the reference human genome.
Paper title : System-wide temporal characterization of the proteome and phosphoproteome of human embryonic stem cell differentiation.
Doi : https://doi.org/10.1126/scisignal.2001570
Abstract : To elucidate cellular events underlying the pluripotency of human embryonic stem cells (hESCs), we performed parallel quantitative proteomic and phosphoproteomic analyses of hESCs during differentiation initiated by a diacylglycerol analog or transfer to media that had not been conditioned by feeder cells. We profiled 6521 proteins and 23,522 phosphorylation sites, of which almost 50% displayed dynamic changes in phosphorylation status during 24 hours of differentiation. These data are a resource for studies of the events associated with the maintenance of hESC pluripotency and those accompanying their differentiation. From these data, we identified a core hESC phosphoproteome of sites with similar robust changes in response to the two distinct treatments. These sites exhibited distinct dynamic phosphorylation patterns, which were linked to known or predicted kinases on the basis of the matching sequence motif. In addition to identifying previously unknown phosphorylation sites on factors associated with differentiation, such as kinases and transcription factors, we observed dynamic phosphorylation of DNA methyltransferases (DNMTs). We found a specific interaction of DNMTs during early differentiation with the PAF1 (polymerase-associated factor 1) transcriptional elongation complex, which binds to promoters of the pluripotency and known DNMT target genes encoding OCT4 and NANOG, thereby providing a possible molecular link for the silencing of these genes during differentiation.
Paper title : Phosphoproteome of resting human platelets.
Doi : https://doi.org/10.1021/pr0704130
Abstract : Beside their main physiological function in hemostasis, platelets are also highly involved in pathological processes, such as atherothrombosis and inflammation. During hemostasis, binding of adhesive substrates to tyrosine-kinase-linked adhesion receptors and/or soluble agonists to G-protein coupled receptors leads to a cascade of intracellular signaling processes based on substrate (de)phosphorylation. The same mechanisms are involved in platelet activation at sites of atherosclerotic plaque rupture, contributing to vessel occlusion and consequently to pathologic states, such as myocardial infarction, stroke, or peripheral artery disease. To gain a deeper insight into platelet function, we analyzed the phosphoproteome of resting platelets and identified 564 phosphorylation sites from more than 270 proteins, of which many have not been described in platelets before. Among those were several unknown potential protein kinase A (PKA) and protein kinase G (PKG) substrates. Because platelet inhibition is tightly regulated especially by PKA and PKG activity, these proteins may represent important new targets for cardiovascular research. Thus, our finding that GPIbalpha is phosphorylated at Ser603 in resting platelets may represent a novel mechanism for the regulation of one of the most important platelet receptor (GPIb-IX-V) mediated signaling pathways by PKA/PKG.
Paper title : CTP synthase 1 deficiency in humans reveals its central role in lymphocyte proliferation.
Doi : https://doi.org/10.1038/nature13386
Abstract : Lymphocyte functions triggered by antigen recognition and co-stimulation signals are associated with a rapid and intense cell division, and hence with metabolism adaptation. The nucleotide cytidine 5' triphosphate (CTP) is a precursor required for the metabolism of DNA, RNA and phospholipids. CTP originates from two sources: a salvage pathway and a de novo synthesis pathway that depends on two enzymes, the CTP synthases (or synthetases) 1 and 2 (CTPS1 with CTPS2); the respective roles of these two enzymes are not known. CTP synthase activity is a potentially important step for DNA synthesis in lymphocytes. Here we report the identification of a loss-of-function homozygous mutation (rs145092287) in CTPS1 in humans that causes a novel and life-threatening immunodeficiency, characterized by an impaired capacity of activated T and B cells to proliferate in response to antigen receptor-mediated activation. In contrast, proximal and distal T-cell receptor (TCR) signalling events and responses were only weakly affected by the absence of CTPS1. Activated CTPS1-deficient cells had decreased levels of CTP. Normal T-cell proliferation was restored in CTPS1-deficient cells by expressing wild-type CTPS1 or by addition of exogenous CTP or its nucleoside precursor, cytidine. CTPS1 expression was found to be low in resting T cells, but rapidly upregulated following TCR activation. These results highlight a key and specific role of CTPS1 in the immune system by its capacity to sustain the proliferation of activated lymphocytes during the immune response. CTPS1 may therefore represent a therapeutic target of immunosuppressive drugs that could specifically dampen lymphocyte activation.
Paper title : Cloning, expression, purification and preliminary crystallographic characterization of a shikimate dehydrogenase from Corynebacterium glutamicum.
Doi : https://doi.org/10.1107/S1744309106017805
Abstract : The shikimate dehydrogenase from Corynebacterium glutamicum has been cloned into an Escherichia coli expression vector, overexpressed and purified. Native crystals were obtained by the vapour-diffusion technique using 2-methyl-2,4-pentanediol as a precipitant. The crystals belong to the centred monoclinic space group C2, with unit-cell parameters a = 118.77, b = 63.17, c = 35.67 angstroms, beta = 92.26 degrees (at 100 K), and diffract to 1.64 angstroms on a synchrotron X-ray source. The asymmetric unit is likely to contain one molecule, corresponding to a packing density of 2.08 angstroms3 Da(-1) and a solvent content of about 41%.
Paper title : A probability-based approach for high-throughput protein phosphorylation analysis and site localization.
Doi : https://doi.org/10.1038/nbt1240
Abstract : Data analysis and interpretation remain major logistical challenges when attempting to identify large numbers of protein phosphorylation sites by nanoscale reverse-phase liquid chromatography/tandem mass spectrometry (LC-MS/MS) (Supplementary Figure 1 online). In this report we address challenges that are often only addressable by laborious manual validation, including data set error, data set sensitivity and phosphorylation site localization. We provide a large-scale phosphorylation data set with a measured error rate as determined by the target-decoy approach, we demonstrate an approach to maximize data set sensitivity by efficiently distracting incorrect peptide spectral matches (PSMs), and we present a probability-based score, the Ascore, that measures the probability of correct phosphorylation site localization based on the presence and intensity of site-determining ions in MS/MS spectra. We applied our methods in a fully automated fashion to nocodazole-arrested HeLa cell lysate where we identified 1,761 nonredundant phosphorylation sites from 491 proteins with a peptide false-positive rate of 1.3%.
Paper title : The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC).
Doi : https://doi.org/10.1101/gr.2596504
Abstract : The National Institutes of Health's Mammalian Gene Collection (MGC) project was designed to generate and sequence a publicly accessible cDNA resource containing a complete open reading frame (ORF) for every human and mouse gene. The project initially used a random strategy to select clones from a large number of cDNA libraries from diverse tissues. Candidate clones were chosen based on 5'-EST sequences, and then fully sequenced to high accuracy and analyzed by algorithms developed for this project. Currently, more than 11,000 human and 10,000 mouse genes are represented in MGC by at least one clone with a full ORF. The random selection approach is now reaching a saturation point, and a transition to protocols targeted at the missing transcripts is now required to complete the mouse and human collections. Comparison of the sequence of the MGC clones to reference genome sequences reveals that most cDNA clones are of very high sequence quality, although it is likely that some cDNAs may carry missense variants as a consequence of experimental artifact, such as PCR, cloning, or reverse transcriptase errors. Recently, a rat cDNA component was added to the project, and ongoing frog (Xenopus) and zebrafish (Danio) cDNA projects were expanded to take advantage of the high-throughput MGC pipeline.
Paper title : Kinase-selective enrichment enables quantitative phosphoproteomics of the kinome across the cell cycle.
Doi : https://doi.org/10.1016/j.molcel.2008.07.007
Abstract : Protein kinases are pivotal regulators of cell signaling that modulate each other's functions and activities through site-specific phosphorylation events. These key regulatory modifications have not been studied comprehensively, because low cellular abundance of kinases has resulted in their underrepresentation in previous phosphoproteome studies. Here, we combine kinase-selective affinity purification with quantitative mass spectrometry to analyze the cell-cycle regulation of protein kinases. This proteomics approach enabled us to quantify 219 protein kinases from S and M phase-arrested human cancer cells. We identified more than 1000 phosphorylation sites on protein kinases. Intriguingly, half of all kinase phosphopeptides were upregulated in mitosis. Our data reveal numerous unknown M phase-induced phosphorylation sites on kinases with established mitotic functions. We also find potential phosphorylation networks involving many protein kinases not previously implicated in mitotic progression. These results provide a vastly extended knowledge base for functional studies on kinases and their regulation through site-specific phosphorylation.
Paper title : Lysine acetylation targets protein complexes and co-regulates major cellular functions.
Doi : https://doi.org/10.1126/science.1175371
Abstract : Lysine acetylation is a reversible posttranslational modification of proteins and plays a key role in regulating gene expression. Technological limitations have so far prevented a global analysis of lysine acetylation's cellular roles. We used high-resolution mass spectrometry to identify 3600 lysine acetylation sites on 1750 proteins and quantified acetylation changes in response to the deacetylase inhibitors suberoylanilide hydroxamic acid and MS-275. Lysine acetylation preferentially targets large macromolecular complexes involved in diverse cellular processes, such as chromatin remodeling, cell cycle, splicing, nuclear transport, and actin nucleation. Acetylation impaired phosphorylation-dependent interactions of 14-3-3 and regulated the yeast cyclin-dependent kinase Cdc28. Our data demonstrate that the regulatory scope of lysine acetylation is broad and comparable with that of other major posttranslational modifications.
Paper title : Toward a comprehensive characterization of a human cancer cell phosphoproteome.
Doi : https://doi.org/10.1021/pr300630k
Abstract : Mass spectrometry (MS)-based phosphoproteomics has achieved extraordinary success in qualitative and quantitative analysis of cellular protein phosphorylation. Considering that an estimated level of phosphorylation in a cell is placed at well above 100,000 sites, there is still much room for improvement. Here, we attempt to extend the depth of phosphoproteome coverage while maintaining realistic aspirations in terms of available material, robustness, and instrument running time. We developed three strategies, where each provided a different balance between these three key parameters. The first strategy simply used enrichment by Ti(4+)-IMAC followed by reversed chromatography LC-MS (termed 1D). The second strategy incorporated an additional fractionation step through the use of HILIC (2D). Finally, a third strategy was designed employing first an SCX fractionation, followed by Ti(4+)-IMAC enrichment and additional fractionation by HILIC (3D). A preliminary evaluation was performed on the HeLa cell line. Detecting 3700 phosphopeptides in about 2 h, the 1D strategy was found to be the most sensitive but limited in comprehensivity, mainly due to issues with complexity and dynamic range. Overall, the best balance was achieved using the 2D based strategy, identifying close to 17,000 phosphopeptides with less than 1 mg of material in about 48 h. Subsequently, we confirmed the findings with the K562 cell sample. When sufficient material was available, the 3D strategy increased phosphoproteome allowing over 22,000 unique phosphopeptides to be identified. Unfortunately, the 3D strategy required more time and over 1 mg of material before it started to outperform 2D. Ultimately, combining all strategies, we were able to identify over 16,000 and nearly 24,000 unique phosphorylation sites from the cancer cell lines HeLa and K562, respectively. In summary, we demonstrate the need to carry out extensive fractionation for deep mining of the phosphoproteome and provide a guide for appropriate strategies depending on sample amount and/or analysis time.
Paper title : Ack kinase regulates CTP synthase filaments during Drosophila oogenesis.
Doi : https://doi.org/10.15252/embr.201438688
Abstract : The enzyme CTP synthase (CTPS) dynamically assembles into macromolecular filaments in bacteria, yeast, Drosophila, and mammalian cells, but the role of this morphological reorganization in regulating CTPS activity is controversial. During Drosophila oogenesis, CTPS filaments are transiently apparent in ovarian germline cells during a period of intense genomic endoreplication and stockpiling of ribosomal RNA. Here, we demonstrate that CTPS filaments are catalytically active and that their assembly is regulated by the non-receptor tyrosine kinase DAck, the Drosophila homologue of mammalian Ack1 (activated cdc42-associated kinase 1), which we find also localizes to CTPS filaments. Egg chambers from flies deficient in DAck or lacking DAck catalytic activity exhibit disrupted CTPS filament architecture and morphological defects that correlate with reduced fertility. Furthermore, ovaries from these flies exhibit reduced levels of total RNA, suggesting that DAck may regulate CTP synthase activity. These findings highlight an unexpected function for DAck and provide insight into a novel pathway for the developmental control of an essential metabolic pathway governing nucleotide biosynthesis.
Paper title : Molecular cloning of the human CTP synthetase gene by functional complementation with purified human metaphase chromosomes.
Doi : https://doi.org/10.1002/j.1460-2075.1990.tb07377.x
Abstract : Successive rounds of chromosome-mediated gene transfer were used to complement a hamster cytidine auxotroph deficient in CTP synthetase activity and eventually to clone human genomic and cDNA fragments coding for the structural gene. Our approach was to isolate human Alu+ fragments from a tertiary transfectant and to utilize these fragments to screen a panel of primary transfectants. In this manner two DNA fragments, both mapping within the structural gene, were identified and used to clone a partial length cDNA. The remaining portion of the open reading frame was obtained through the RACE polymerase chain reaction technique. The open reading frame encodes 591 amino acids having a striking degree of similarity to the Escherichia coli structural gene (48% identical amino acids with 76% overall similarity including conservative substitutions) with the glutamine amide transfer domain being particularly conserved. As regulatory mutations of CTP synthetase confer both multi-drug resistance to agents widely used in cancer chemotherapy and a mutator phenotype, the cloning of the structural gene will be important in assessing the relevance of such phenotypes to the development of cellular drug resistance.
Paper title : Global, in vivo, and site-specific phosphorylation dynamics in signaling networks.
Doi : https://doi.org/10.1016/j.cell.2006.09.026
Abstract : Cell signaling mechanisms often transmit information via posttranslational protein modifications, most importantly reversible protein phosphorylation. Here we develop and apply a general mass spectrometric technology for identification and quantitation of phosphorylation sites as a function of stimulus, time, and subcellular location. We have detected 6,600 phosphorylation sites on 2,244 proteins and have determined their temporal dynamics after stimulating HeLa cells with epidermal growth factor (EGF) and recorded them in the Phosida database. Fourteen percent of phosphorylation sites are modulated at least 2-fold by EGF, and these were classified by their temporal profiles. Surprisingly, a majority of proteins contain multiple phosphorylation sites showing different kinetics, suggesting that they serve as platforms for integrating signals. In addition to protein kinase cascades, the targets of reversible phosphorylation include ubiquitin ligases, guanine nucleotide exchange factors, and at least 46 different transcriptional regulators. The dynamic phosphoproteome provides a missing link in a global, integrative view of cellular regulation.
Paper title : Combining protein-based IMAC, peptide-based IMAC, and MudPIT for efficient phosphoproteomic analysis.
Doi : https://doi.org/10.1021/pr0705441
Abstract : Immobilized metal affinity chromatography (IMAC) is a common strategy used for the enrichment of phosphopeptides from digested protein mixtures. However, this strategy by itself is inefficient when analyzing complex protein mixtures. Here, we assess the effectiveness of using protein-based IMAC as a pre-enrichment step prior to peptide-based IMAC. Ultimately, we couple the two IMAC-based enrichments and MudPIT in a quantitative phosphoproteomic analysis of the epidermal growth factor pathway in mammalian cells identifying 4470 unique phosphopeptides containing 4729 phosphorylation sites.
Paper title : Lys-N and trypsin cover complementary parts of the phosphoproteome in a refined SCX-based approach.
Doi : https://doi.org/10.1021/ac9004309
Abstract : The analysis of proteome-wide phosphorylation events is still a major analytical challenge because of the enormous complexity of protein phosphorylation networks. In this work, we evaluate the complementarity of Lys-N, Lys-C, and trypsin with regard to their ability to contribute to the global analysis of the phosphoproteome. A refined version of low-pH strong cation exchange was used to efficiently separate N-terminally acetylated, phosphorylated, and nonmodified peptides. A total of 5036 nonredundant phosphopeptides could be identified with a false discovery rate of <1% from 1 mg of protein using a combination of the three enzymes. Our data revealed that the overlap between the phosphopeptide data sets generated with different proteases was marginal, whereas the overlap between two similarly generated tryptic data sets was found to be at least 4 times higher. In this way, the parallel use of Lys-N and trypsin enabled a 72% increase in the number of detected phosphopeptides as compared to trypsin alone, whereas a trypsin replicate experiment only led to a 25% increase. Thus, when focusing solely on the trypsin and Lys-N data, we identified 4671 nonredundant phosphopeptides. Further analysis of the detected sites showed that the Lys-N and trypsin data sets were enriched in significantly different phosphorylation motifs, further evidencing that multiprotease approaches are very valuable in phosphoproteome analyses.
Paper title : Toward a global characterization of the phosphoproteome in prostate cancer cells: identification of phosphoproteins in the LNCaP cell line.
Doi : https://doi.org/10.1002/elps.200600782
Abstract : Protein phosphorylation plays a major role in most cell-signaling pathways in all eukaryotic cells. Disruptions in phosphorylation-mediated cell-signaling events are associated with various diseases, including cancer. Here, we applied a fully non-gel-based methodology to obtain an initial panel of phosphoproteins from the LNCaP human prostate cancer cell line. The analytical strategy involved enrichment of phosphopeptides by immobilized metal ion affinity chromatography, the use of POROS Oligo R3 to capture phosphopeptides that were not retained with a C18 packing, and gas-phase fractionation in the m/z dimension to extend the dynamic range of the LC-MS/MS analysis. In this pilot investigation, 137 phosphorylation sites in 81 phosphoproteins were identified. The characterized phosphoproteins include kinases, co-regulators of steroid receptors, and a number of cancer-related proteins.
Paper title : Complete sequencing and characterization of 21,243 full-length human cDNAs.
Doi : https://doi.org/10.1038/ng1285
Abstract : As a base for human transcriptome and functional genomics, we created the "full-length long Japan" (FLJ) collection of sequenced human cDNAs. We determined the entire sequence of 21,243 selected clones and found that 14,490 cDNAs (10,897 clusters) were unique to the FLJ collection. About half of them (5,416) seemed to be protein-coding. Of those, 1,999 clusters had not been predicted by computational methods. The distribution of GC content of nonpredicted cDNAs had a peak at approximately 58% compared with a peak at approximately 42%for predicted cDNAs. Thus, there seems to be a slight bias against GC-rich transcripts in current gene prediction procedures. The rest of the cDNAs unique to the FLJ collection (5,481) contained no obvious open reading frames (ORFs) and thus are candidate noncoding RNAs. About one-fourth of them (1,378) showed a clear pattern of splicing. The distribution of GC content of noncoding cDNAs was narrow and had a peak at approximately 42%, relatively low compared with that of protein-coding cDNAs.
Paper title : A quantitative atlas of mitotic phosphorylation.
Doi : https://doi.org/10.1073/pnas.0805139105
Abstract : The eukaryotic cell division cycle is characterized by a sequence of orderly and highly regulated events resulting in the duplication and separation of all cellular material into two newly formed daughter cells. Protein phosphorylation by cyclin-dependent kinases (CDKs) drives this cycle. To gain further insight into how phosphorylation regulates the cell cycle, we sought to identify proteins whose phosphorylation is cell cycle regulated. Using stable isotope labeling along with a two-step strategy for phosphopeptide enrichment and high mass accuracy mass spectrometry, we examined protein phosphorylation in a human cell line arrested in the G(1) and mitotic phases of the cell cycle. We report the identification of >14,000 different phosphorylation events, more than half of which, to our knowledge, have not been described in the literature, along with relative quantitative data for the majority of these sites. We observed >1,000 proteins with increased phosphorylation in mitosis including many known cell cycle regulators. The majority of sites on regulated phosphopeptides lie in [S/T]P motifs, the minimum required sequence for CDKs, suggesting that many of the proteins may be CDK substrates. Analysis of non-proline site-containing phosphopeptides identified two unique motifs that suggest there are at least two undiscovered mitotic kinases.
Paper title : Quantitative phosphoproteomics reveals widespread full phosphorylation site occupancy during mitosis.
Doi : https://doi.org/10.1126/scisignal.2000475
Abstract : Eukaryotic cells replicate by a complex series of evolutionarily conserved events that are tightly regulated at defined stages of the cell division cycle. Progression through this cycle involves a large number of dedicated protein complexes and signaling pathways, and deregulation of this process is implicated in tumorigenesis. We applied high-resolution mass spectrometry-based proteomics to investigate the proteome and phosphoproteome of the human cell cycle on a global scale and quantified 6027 proteins and 20,443 unique phosphorylation sites and their dynamics. Co-regulated proteins and phosphorylation sites were grouped according to their cell cycle kinetics and compared to publicly available messenger RNA microarray data. Most detected phosphorylation sites and more than 20% of all quantified proteins showed substantial regulation, mainly in mitotic cells. Kinase-motif analysis revealed global activation during S phase of the DNA damage response network, which was mediated by phosphorylation by ATM or ATR or DNA-dependent protein kinases. We determined site-specific stoichiometry of more than 5000 sites and found that most of the up-regulated sites phosphorylated by cyclin-dependent kinase 1 (CDK1) or CDK2 were almost fully phosphorylated in mitotic cells. In particular, nuclear proteins and proteins involved in regulating metabolic processes have high phosphorylation site occupancy in mitosis. This suggests that these proteins may be inactivated by phosphorylation in mitotic cells.
Paper title : Expression of Human CTP synthetase in Saccharomyces cerevisiae reveals phosphorylation by protein kinase A.
Doi : https://doi.org/10.1074/jbc.M509622200
Abstract : CTP synthetase (EC 6.3.4.2, UTP:ammonia ligase (ADP-forming)) is an essential enzyme in all organisms; it generates the CTP required for the synthesis of nucleic acids and membrane phospholipids. In this work we showed that the human CTP synthetase genes, CTPS1 and CTPS2, were functional in Saccharomyces cerevisiae and complemented the lethal phenotype of the ura7Delta ura8Delta mutant lacking CTP synthetase activity. The expression of the CTPS1- and CTPS2-encoded human CTP synthetase enzymes in the ura7Delta ura8Delta mutant was shown by immunoblot analysis of CTP synthetase proteins, the measurement of CTP synthetase activity, and the synthesis of CTP in vivo. Phosphoamino acid and phosphopeptide mapping analyses of human CTP synthetase 1 isolated from (32)P(i)-labeled cells revealed that the enzyme was phosphorylated on multiple serine residues in vivo. Activation of protein kinase A activity in yeast resulted in transient increases (2-fold) in the phosphorylation of human CTP synthetase 1 and the cellular level of CTP. Human CTP synthetase 1 was also phosphorylated by mammalian protein kinase A in vitro. Using human CTP synthetase 1 purified from Escherichia coli as a substrate, protein kinase A activity was dose- and time-dependent, and dependent on the concentrations of CTP synthetase 1 and ATP. These studies showed that S. cerevisiae was useful for the analysis of human CTP synthetase phosphorylation.