dbACP: A Comprehensive Database of Anti-Cancer Peptides

dbacp02787

General Description

Peptide name : DNAtopoisomerase 2-alpha

Source/Organism : Human

Linear/Cyclic : Not found

Chirality : Not found

Sequence Information

Sequence : MEVSPLQPVNEnMQVNKIKKNEDAKKRLSVERIYQKKTQLEHILLRPDTYIGSVELVTQQMWVYDEDVGINYREVTFVPGLYKIFDEILVNAADNKQRDPKMSCIRVTIDPENNLISIWNNGKGIPVVEHKVEKMYVPALIFGQLLTSSNYDDDEKKVTGGRNGYGAKLCNIFSTKFTVETASREYKKMFKQTWMDnMGRAGEMELKPFNGEDYTCITFQPDLSKFKMQSLDKDIVALMVRRAYDIAGSTKDVKVFLNGNKLPVKGFRSYVDMYLKDKLDETGNSLKVIHEQVNHRWEVCLTMSEKGFQQISFVNSIATSKGGRHVDYVADQIVTKLVDVVKKKNKGGVAVKAHQVKNHMWIFVNALIENPTFDSQTKEnMTLQPKSFGSTCQLSEKFIKAAIGCGIVESILNWVKFKAQVQLNKKCSAVKHNRIKGIPKLDDANDAGGRNSTECTLILTEGDSAKTLAVSGLGVVGRDKYGVFPLRGKILNVREASHKQIMENAEINNIIKIVGLQYKKNYEDEDSLKTLRYGKIMIMTDQDQDGSHIKGLLINFIHHNWPSLLRHRFLEEFITPIVKVSKNKQEMAFYSLPEFEEWKSSTPNHKKWKVKYYKGLGTSTSKEAKEYFADMKRHRIQFKYSGPEDDAAISLAFSKKQIDDRKEWLTNFMEDRRQRKLLGLPEDYLYGQTTTYLTYNDFINKELILFSNSDNERSIPSMVDGLKPGQRKVLFTCFKRNDKREVKVAQLAGSVAEMSSYHHGEMSLMMTIINLAQNFVGSNNLNLLQPIGQFGTRLHGGKDSASPRYIFTMLSSLARLLFPPKDDHTLKFLYDDNQRVEPEWYIPIIPMVLINGAEGIGTGWSCKIPNFDVREIVNNIRRLMDGEEPLPMLPSYKNFKGTIEELAPNQYVISGEVAILNSTTIEISELPVRTWTQTYKEQVLEPMLNGTEKTPPLITDYREYHTDTTVKFVVKMTEEKLAEAERVGLHKVFKLQTSLTCNSMVLFDHVGCLKKYDTVLDILRDFFELRLKYYGLRKEWLLGMLGAESAKLNNQARFILEKIDGKIIIENKPKKELIKVLIQRGYDSDPVKAWKEAQQKVPDEEENEESDNEKETEKSDSVTDSGPTFNYLLDMPLWYLTKEKKDELCRLRNEKEQELDTLKRKSPSDLWKEDLATFIEELEAVEAKEKQDEQVGLPGKGGKAKGKKTQMAEVLPSPRGQRVIPRITIEMKAEAEKKNKKKIKNENTEGSPQEDGVELEGLKQRLEKKQKREPGTKTKKQTTLAFKPIKKGKKRNPWSDSESDRSSDESNFDVPPRETEPRRAATKTKFTMDLDSDEDFSDFDEKTDDEDFVPSDASPPKTKTSPKLSNKELKPQKSVVSDLEADDVKGSVPLSSSPPATHFPDETEITNPVPKKNVTVKKTAAKSQSSTSTTGAKKRAAPKGTKRDPALNSGVSQKPDPAKTKNRRKRKPSTSDDSDSNFEKIVSKAVTSKKSKGESDDFHMDFDSAVAPRAKSVRAKKPIKYLEESDEDDLF

Peptide length: 1531

C-terminal modification: Not found

N-terminal modification : Not found

Non-natural peptide information: None

Activity Information

Assay type : Antibody-based assay

Assay time : 48h

Activity : Not found

Cell line : HaCat

Cancer type : Not specified

Other activity : Not found

Physicochemical Properties

Amino acid composition bar chart :

Molecular mass : 174383.028 Dalton

Aliphatic index : 0.756

Instability index : 40.2918

Hydrophobicity (GRAVY) : -0.695

Isoelectric point : 8.8222

Charge (pH 7) : 21.7025

Aromaticity : 0.077

Molar extinction coefficient (cysteine, cystine): (163070, 163820)

Hydrophobic/hydrophilic ratio : 0.76239907

hydrophobic moment : 0.0177

Missing amino acid : None

Most occurring amino acid : K

Most occurring amino acid frequency : 179

Least occurring amino acid : n

Least occurring amino acid frequency : 3

Structural Information

3D structure : Not Available

Secondary structure fraction (Helix, Turn, Sheet): (0.3, 0.2, 0.3)

SMILES Notation: CC[C@H](C)[C@H](NC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C)NC(=O)CNC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)NC(=O)[C@@H](NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1C(=O)[C@@H]1CCCN1C(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCSC)NC(=O)[C@@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CO)NC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CNC(=O)CNC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CCC(N)=O)NC(=O)CNC(=O)[C@@H](NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CO)NC(=O)CNC(=O)[C@@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CCSC)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CO)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(=O)O)NC(=O)CNC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)CNC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CCCCN)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CS)NC(=O)[C@@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CCC(N)=O)NC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@H](CCCCN)NC(=O)[C@H](CC(C)C)NC(=O)CNC(=O)[C@H](CC(=O)O)NC(=O)[C@@H](NC(=O)[C@H](CCSC)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CO)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CCC(N)=O)NC(=O)CNC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(C)C)NC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1C(=O)CNC(=O)[C@H](CO)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CCCCN)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)CNC(=O)[C@H](CCCCN)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@H](CCCCN)NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(C)C)NC(=O)[C@H](CO)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](C)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1C(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(C)C)NC(=O)CNC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](CO)NC(=O)CNC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H](NC(=O)[C@H](CCSC)NC(=O)[C@@H](NC(=O)[C@H](CCSC)NC(=O)[C@@H](NC(=O)[C@H](CCCCN)NC(=O)CNC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](CO)NC(=O)[C@H](C)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CCCCN)NC(=O)CNC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1C(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCNC(=N)N)NC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CCCCN)NC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCCNC(=N)N)NC(=O)CNC(=O)CNC(=O)[C@H](C)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](C)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@H](CS)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CS)NC(=O)CNC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)CNC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@H](CCSC)NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CNC(=O)CNC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H](NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](CCCNC(=N)N)NC(=O)CNC(=O)CNC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](Cc1ccccc1)NC(=O)CNC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCSC)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CS)NC(=O)[C@@H](NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CO)NC(=O)[C@H](CC(N)=O)NC(=O)CNC(=O)[C@@H](NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CO)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](Cc1ccccc1)NC(=O)CNC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(N)=O)NC(=O)CNC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](C)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@@H](NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCCN)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(N)=O)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CS)NC(=O)[C@@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)CNC(=O)[C@H](CC(N)=O)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCCCN)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(=O)O)NC(=O)CNC(=O)[C@H](C)NC(=O)[C@H](CCCNC(=N)N)NC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CO)NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C)NC(=O)CNC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)CNC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCCNC(=N)N)NC(=O)CNC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCC(N)=O)NC(=O)CNC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H](NC(=O)[C@H](CCCCN)NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCCCN)NC(=O)CNC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)O)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@@H](NC(=O)[C@H](CS)NC(=O)[C@H](CO)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](NC(=O)[C@H](CCCCN)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CC(C)C)NC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](NC(=O)CNC(=O)[C@@H](NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@@H](NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)CNC(=O)[C@@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@@H](NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@@H](NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H](N)CCSC)C(C)C)C(C)C)C(C)C)[C@@H](C)CC)C(C)C)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)CC)C(C)C)C(C)C)[C@@H](C)O)C(C)C)C(C)C)[C@@H](C)CC)C(C)C)[C@@H](C)O)C(C)C)[C@@H](C)CC)[C@@H](C)CC)C(C)C)[C@@H](C)CC)C(C)C)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)CC)C(C)C)C(C)C)C(C)C)C(C)C)[C@@H](C)CC)[C@@H](C)O)C(C)C)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)O)C(C)C)[C@@H](C)O)[C@@H](C)O)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)CC)C(C)C)C(C)C)[C@@H](C)CC)[C@@H](C)O)C(C)C)C(C)C)C(C)C)C(C)C)[C@@H](C)O)C(C)C)[C@@H](C)CC)C(C)C)C(C)C)[C@@H](C)O)[C@@H](C)CC)C(C)C)[C@@H](C)CC)[C@@H](C)O)C(C)C)C(C)C)[C@@H](C)CC)C(C)C)[C@@H](C)O)C(C)C)C(C)C)C(C)C)C(C)C)C(C)C)C(C)C)[C@@H](C)CC)C(C)C)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)O)[C@@H](C)O)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)CC)C(C)C)[C@@H](C)CC)C(C)C)C(C)C)C(C)C)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)O)C(C)C)C(C)C)C(C)C)C(C)C)[C@@H](C)CC)C(C)C)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)CC)C(C)C)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)CC)C(C)C)C(C)C)[C@@H](C)O)C(C)C)[C@@H](C)O)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)O)[C@@H](C)O)[C@@H](C)O)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)CC)C(C)C)C(C)C)[C@@H](C)O)C(C)C)C(C)C)C(C)C)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)CC)C(C)C)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)O)C(C)C)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)CC)C(C)C)[C@@H](C)CC)C(=O)NCC(=O)N[C@H](C(=O)NCC(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@H](C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCC(=O)O)C(=O)N[C@H](C(=O)N[C@@H](C)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@H](C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@H](C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1c[nH]cn1)C(=O)N[C@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](Cc1c[nH]cn1)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](Cc1c[nH]cn1)C(=O)N[C@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(=N)N)C(=O)NCC(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)NCC(=O)N1CCC[C@H]1C(=O)N[C@H](C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@H](C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)NCC(=O)NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(=N)N)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@H](C(=O)N[C@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(=O)N[C@@H](CCC(=O)O)C(=O)NCC(=O)N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)N[C@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@H]1C(=O)NCC(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@H](C(=O)N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@H](C(=O)N[C@@H](Cc1c[nH]cn1)C(=O)N[C@@H](Cc1ccccc1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@H](C(=O)N[C@@H](CO)C(=O)N[C@H](C(=O)N[C@H](C(=O)NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)NCC(=O)N[C@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1c[nH]cn1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@H](C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@H](C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](Cc1ccccc1)C(=O)O)[C@@H](C)CC)C(C)C)C(C)C)[C@@H](C)O)C(C)C)C(C)C)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)O)C(C)C)[C@@H](C)O)[C@@H](C)O)[C@@H](C)O)[C@@H](C)O)[C@@H](C)O)C(C)C)[C@@H](C)O)C(C)C)C(C)C)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)O)C(C)C)C(C)C)C(C)C)C(C)C)[C@@H](C)O)[C@@H](C)O)C(C)C)[C@@H](C)O)[C@@H](C)O)[C@@H](C)O)[C@@H](C)O)[C@@H](C)O)C(C)C)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)O)[C@@H](C)O)[C@@H](C)O)C(C)C)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)CC)C(C)C)C(C)C)[C@@H](C)O)C(C)C)C(C)C)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)O)[C@@H](C)O)[C@@H](C)O)[C@@H](C)O)C(C)C)[C@@H](C)O)C(C)C)C(C)C)[C@@H](C)CC)C(C)C)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)CC)C(C)C)[C@@H](C)O)C(C)C)C(C)C)[C@@H](C)O)[C@@H](C)O)C(C)C)C(C)C)[C@@H](C)O)C(C)C)C(C)C)C(C)C)[C@@H](C)O)[C@@H](C)O)[C@@H](C)O)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)O)C(C)C)[C@@H](C)O)[C@@H](C)O)[C@@H](C)O)C(C)C)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)O)[C@@H](C)CC)C(C)C)[C@@H](C)CC)C(C)C)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)CC)C(C)C)[C@@H](C)CC)C(C)C)[C@@H](C)CC)[C@@H](C)O

Secondary Structure :

Method Prediction
GOR EEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHTHHHHEEEECTTCEEEEEEEEHHEEEEETTTHTCCCEEEEECTTCCEEEHHHHHHHHHHTCCCTTTEEEEEECCCTTCHEEEETTTTCCEHHHHHHHHHHCHHHHEEEEETTTCCCTTHTEEETCCCCCTTEEEHHHHCEECHHHHHHHHHHHHHHHHHHHHHHHHHHHHCTTTTTTTEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHTTCCCHHEEEETTTCCCETTEEEEEHHHHTTTHHHTTTTHEEEEHHHTHHHHHHHHHHHHHCHHEEEEEEEEHTTTCEEEEEEHHHHHHHEHHHHHHTTTTCHHHHHHHHHHHHHHEHHHHHTCCCCHHTHTTHHHCCCCCTCCCHHHHHHHHHHHHTTEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHTCCCCCCCTTTTTCCTTTTEEEEEEHCCHHHEEEEEEEEEEEETTTTCCEETTTEHHHHHHHHHHHHHHHHHHHHHHEEEHHHTTTCCHHHHHHHHHTTEEEEEEHHTTTTCEETTEEEEEEETTCTTHHHHHHHHHHHCHHHHHHTTHHHHHHHTCTHHHHHHTTCHHHHHHHHHEETTCCCCCTHHHHHHHHHHHHHHHHEETTCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHTTTTCCTTEETTTEEEEEEECCHHHHHHEEETTTTTTTCCCEEEETCCTTCCTEEEHHHHTTTHHHHHHHHHHTHHHHHHHTTTTHHHHHHHHHHHHHEEETCCTCTTECCCCEEEEEEETCCCTTCCTEEEEEHHTHHHHECCTTTTTTHEEECCTTTTCCTTTECCCCEEEEETTCTEEEEEEETTCCCCHHHHHHHHHHHEETTCCCCCCCCTTTTTTTCEHHHCTTCEEEEEEEEEETTCEEEEEECEEHEEEHHCTHHHHCHHHTTCTCCCCCEEEEETTTCCCHHHHHHHHHHHHHHHHHHHHHHHHEEEETTETTTTEEEEHHHTTTTTTCCEEHHHHHHHHHHHHHTTHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHEEEEECCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHTTTCEETTCCCCEEEETCHHHHHHHTHHHHHHHHHHHHHHHHHHHHTTCCTTHHHHHHHHHHHHHHHHHHHHHHHHHHCCTTTTCHHHHHHHHHHEECCCCTTCEECEEHHHHHHHHHHHHHHHHHHTTTCTTCCCTTHHHHHHHHHHHHHHHHHCTTCHHTTCHHHHHHHHHTTTTCCCTTTTTTCTTTTTTTCCCCCTTCCHHHHHHHHHEEECCTTTHHHHHHHTTCCCTTCCCCTTCCTCCCCCCTTTTHHHCHTTEEEEHHHHHHHTTCCEETTCCCCCCCCTTTHECCCCTTTCHHHHHHHHHTTTTEEEEHHHHHHCTTTCCCCCTEECCCCCCCCCTHHHHHHTTCTCCCCCTTTHHHHHHHHHHHHTTTTTTTHHHHHHHHHHHCHHHHHHHHTCCHHHHTTTHHHHHH
Chou-Fasman (CF) CCCCCCCCCCCEECCHHHHHHHHHHHCCCEEEEHHHHHHHHEECCCCEEEECCEEEECCEEEHHHHEEEECEEEEEEECEEEHHHHEEHHHHHHHHCCCCCEEEEEEECCCCEEEEEECCCCCEEEEHHHHHHEEECCCEEECEEECCCCCHHHHEEEECCCCCCCCCEEEEEEEEEECCCCCHHHHHHEEEECCCCCCHHHHHHHCCCCCCEEEEEECCCHHHHHHHHHHHEEECCEECCCEECCCCCCCEEEECCCCCEECCCCEEEEEHHHHHHHHCCCCCEEEHHHHEECCCCCEEEHHHHHEEEEEEEECEECCCCCCCEEEECCEEEEECEEEECCCCCEEEEHHHHHHHHHEEEEEHHHHCCCCCCCCHHHHHHCCCCCCEEEEEHHHHHHHHHEEEEEEECEEEEEHHHHHEEHHHHHCHHHHCCEECCCHHHHHHCCCCCCCCCCEEEECCCCHHHHEEEECEEEEECCCEEEECCCEECEEHHHHHHHHHHHHHHCEEEEEEEEECCCCHHHHHHHCCCEECCCEEEEHHHHCCCCCEECEEEEEECCCCCCHHHHHHHHHEEEEEEECCCHHHHHHEEEHHHHHHHCCCCCCHHHHHEEEEECEEEECHHHHHHHHHHHHHEEEEEECCCCHHHHHEEHHHHHHHHHHHHHHEEEHHHHHHHHHHHCCCCCEEEEEEEEEEECCEEEHHHHEEECCCCCCCEEEEEEECCCCCCCCEEEEEHHHHHHHHHCHHHHHEEHHHHEECCHHHHHHHEEEEHHHHEEEECCCHHHHEEEECEEEECCCCCCCCCCEEEEEECCHHHHHHCCCCHHHHCCEECCCCCCCCCCEEEEEEEEEEECCCCCEEEEECCCCCCEECEEEEECEEHHHHHHHCCCCCCCCCCCEEEHHHHHCCEEEECCCCCCEEEEEEEHHHHEEEEEEEEHHHHHHHHCCCCCCCCCCEEEECCCCEEEEEEEEEEEHHHHHHHHHHHEEHHHHCCCEEEEEECEEEECEEEEHHHHEEEEEHHHHHHHHHHHEEEEHHHHHCCCCHHHHHHHHCCCCEEHHHHCCCEEEECCCHHHHHEEEEEEECCCCCCHHHHHHHHHCCHHHHHHHHHHHHHHHHHCEEEECCCCCEEEHHHHHEEEEHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHEEHHHHHHHHHHHHHHHHEECCCCCCHHHHCHHHHHHHCCCCCCEEEEEEEEHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCEECCCEEECCCCCCHHHHHCCCCCCCCCCCCCCCCCEECCCCCCCCCHHHHCCCHHHHHHHHHHCCCHHHHHHHHHEECCCCCCCCEECCCCHHHHHHHHCEEEEEHHHHHHCEEEEECCCCCCCCCCCCCCEEEECCCCCEEEEEHHHHHCCEEEEEEHHHHHHHCCCCCCHHHHCEEEECCCCCCCCCCCCCCCCCCCCCCCCHHHHEECCEEECCCCCCCCCHHHHHCHHHHCHHHHEECCCCCEEHHHHHHHHHCCC
Neural Network (NN) CCCCCCCCCCCCHHHHHCCCCCHHHHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEHHHHHEEECCCCCCCCCCCEECCCCCCCCCCHHHHHCCCCCCCCCCCCEEEECCCCCCCCEEECCCCCCCEEHHHHHHHHHHCHHHHHEECCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCEEEHHHHHHHHHHHCHHHCCCCCCCCCCCCCCCCCCCCEECCCCCCCHHHHHCCCCHHHHHHHHHHCCCCCCCCCHEEHHCCCCCCCCCCCCHHHHHHCCCCCCCCCCCHHHHHHHCCCHHHHHHCCCCCCCEEEEEEECECCCCCCEECCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHCCHHHHCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCCCCHHHEHCCCCECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHCCCCCCCCCCHHHHHHHCCCEEEECCCCCCCCCCCCEEEHHHCCCCCCHHHHHHHCCCCCHHHHCCCCCCHHHHCCCCCCCCCCCCCCCCCCHHHEEEECCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCEECCCEEEEEECCCCHHHHEECCCCCCCCCCCCCCCCCCCCCCEEEEEHCCCCCCHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHCCHHCCCCCCCCCCCCCCCCCEECCCCCCCCCCCEEEEEHHHHHHCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCEEEEECCCCCEEECCCCCCCCCCCCCCHCCHCCCCCCCCCCCEEECCECCCCCCEEEEHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEHCCCCCCHHHHHHEECCCCCCCCCHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCECHCCCHHHHHCCCCCHHHHHHHCCCHHHHHHHCCCCCCCCHHHCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCHHHCCCCCCCCCECCEEHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCCCCHHHHHHHCCCCCCCCCCCCCCCHH
Joint/Consensus CCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCHHHHEECCCCCCEECCEEEECCEEEECCCCCCCCCEEEEECCCCCCCCHHHHHHHHHHCCCCCCCEEEEEECCCCCCCEEECCCCCCCEEHHHHHHHHHCCCCCEEEEECCCCCCCCCCEEECCCCCCCCEEEECCCCEEECCHHHHHHHHHHHCCCCCCCCHHHHHHHCCCCCCCCEEEECCCCHHHHHHHHHHCHHHHHHHHHHCCCCCCCCCCEECCCCCCCCCCCEEEEEHHHHCCCCCCCCCCCCHHHHHHCCCHHHHHHHHHHCCCEEEEEEEEECCCCCCEEEECHHHHHHHCHHHHHHCCCCCHHHHHHHHHHHHHHCHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCEEECHHHHHHHHHHHHHHHHHCHHHHCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCCCCCEEEEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEECCCCCCCCHHHHHHHHHCCEEEEECCCCCCCCEECCEEEEECCCCCCCHHHHHHHHCCCCHHHHCCCCHHHHHHCCCCHHHHCCCCCCHHHHHHHEEECCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEECCEEEEEEECCCHHHHHEECCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCHHHHHHHHHHCHHHHHHHCCCCHHHHHHHHHHHHHEEECCCCCCCCCCCCEEEEECCCCCCCCCCCEEEEEHHHHHHHCCCCCCCCCCEECCCCCCCCCCCCCCCCCEEEEECCCCEEEEECCCCCCCCCCHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCEEEEECCCEEEEECCEEEEEECCCCHHHHCCCCCCCCCCCCCEEECCCCCCCCCEEEEHHHHHHHHHHHHHHHHHHHCCEECCCCCCCEEEECCCCCCCCCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEECCCCHHHHHHHEEEECCCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHCHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHCCCCCCCCEEEEEEHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCCEEECHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCHHHHHCHHHHCCHHHHHHHCCCCCCCCCCCCCCCCC

Molecular Descriptors and ADMET Properties

Molecular Descriptors: Not available.

ADMET Properties: Not available.

Cross Referencing databases

CancerPPD : Not available

ApIAPDB : Not available

CancerPPD2 ID : Not available

Reference

1 : Cantin GT, et al. Combining protein-based IMAC, peptide-based IMAC, and MudPIT for efficient phosphoproteomic analysis. J Proteome Res. 2008; 7:1346-51. doi: 10.1021/pr0705441

2 : Hendriks IA, et al. Uncovering global SUMOylation signaling networks in a site-specific manner. Nat Struct Mol Biol. 2014; 21:927-36. doi: 10.1038/nsmb.2890

3 : Mayya V, et al. Quantitative phosphoproteomic analysis of T cell receptor signaling reveals system-wide modulation of protein-protein interactions. Sci Signal. 2009; 2:ra46. doi: 10.1126/scisignal.2000007

4 : Impens F, et al. Mapping of SUMO sites and analysis of SUMOylation changes induced by external stimuli. Proc Natl Acad Sci U S A. 2014; 111:12432-7. doi: 10.1073/pnas.1413825111

5 : Escargueil AE, et al. Mitotic phosphorylation of DNA topoisomerase II alpha by protein kinase CK2 creates the MPM-2 phosphoepitope on Ser-1469. J Biol Chem. 2000; 275:34710-8. doi: 10.1074/jbc.M005179200

6 : Schmidt BH, et al. Structure of a topoisomerase II-DNA-nucleotide complex reveals a new control mechanism for ATPase activity. Nat Struct Mol Biol. 2012; 19:1147-54. doi: 10.1038/nsmb.2388

7 : Olsen JV, et al. Quantitative phosphoproteomics reveals widespread full phosphorylation site occupancy during mitosis. Sci Signal. 2010; 3:ra3. doi: 10.1126/scisignal.2000475

8 : Zody MC, et al. DNA sequence of human chromosome 17 and analysis of rearrangement in the human lineage. Nature. 2006; 440:1045-9. doi: 10.1038/nature04689

9 : Williamson EA, et al. The SET and transposase domain protein Metnase enhances chromosome decatenation: regulation by automethylation. Nucleic Acids Res. 2008; 36:5822-31. doi: 10.1093/nar/gkn560

10 : Grozav AG, et al. Casein kinase I delta/epsilon phosphorylates topoisomerase IIalpha at serine-1106 and modulates DNA cleavage activity. Nucleic Acids Res. 2009; 37:382-92. doi: 10.1093/nar/gkn934

11 : Daiß JL, et al. The human RNA polymerase I structure reveals an HMG-like docking domain specific to metazoans. Life Sci Alliance. 2022; 5:(unknown pages). doi: 10.26508/lsa.202201568

12 : Mirski SE, et al. Identification of functional nuclear export sequences in human topoisomerase IIalpha and beta. Biochem Biophys Res Commun. 2003; 306:905-11. doi: 10.1016/s0006-291x(03)01077-5

13 : Hendriks IA, et al. Site-specific mapping of the human SUMO proteome reveals co-modification with phosphorylation. Nat Struct Mol Biol. 2017; 24:325-336. doi: 10.1038/nsmb.3366

14 : Lee S, et al. DNA cleavage and opening reactions of human topoisomerase IIα are regulated via Mg2+-mediated dynamic bending of gate-DNA. Proc Natl Acad Sci U S A. 2012; 109:2925-30. doi: 10.1073/pnas.1115704109

15 : Deweese JE, et al. Metal ion interactions in the DNA cleavage/ligation active site of human topoisomerase IIalpha. Biochemistry. 2009; 48:8940-7. doi: 10.1021/bi900875c

16 : Daub H, et al. Kinase-selective enrichment enables quantitative phosphoproteomics of the kinome across the cell cycle. Mol Cell. 2008; 31:438-48. doi: 10.1016/j.molcel.2008.07.007

17 : Van Damme P, et al. N-terminal acetylome analyses and functional insights of the N-terminal acetyltransferase NatB. Proc Natl Acad Sci U S A. 2012; 109:12449-54. doi: 10.1073/pnas.1210303109

18 : Hinds M, et al. Identification of a point mutation in the topoisomerase II gene from a human leukemia cell line containing an amsacrine-resistant form of topoisomerase II. Cancer Res. 1991; 51:4729-31.

19 : Gerhard DS, et al. The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC). Genome Res. 2004; 14:2121-7. doi: 10.1101/gr.2596504

20 : Stanger FV, et al. Structure of the N-terminal Gyrase B fragment in complex with ADP⋅Pi reveals rigid-body motion induced by ATP hydrolysis. PLoS One. 2014; 9:e107289. doi: 10.1371/journal.pone.0107289

21 : Wei H, et al. Nucleotide-dependent domain movement in the ATPase domain of a human type IIA DNA topoisomerase. J Biol Chem. 2005; 280:37041-7. doi: 10.1074/jbc.M506520200

22 : De Haro LP, et al. Metnase promotes restart and repair of stalled and collapsed replication forks. Nucleic Acids Res. 2010; 38:5681-91. doi: 10.1093/nar/gkq339

23 : Wendorff TJ, et al. The structure of DNA-bound human topoisomerase II alpha: conformational mechanisms for coordinating inter-subunit interactions with DNA cleavage. J Mol Biol. 2012; 424:109-24. doi: 10.1016/j.jmb.2012.07.014

24 : Yun J, et al. Interaction between glucose-regulated destruction domain of DNA topoisomerase IIalpha and MPN domain of Jab1/CSN5. J Biol Chem. 2004; 279:31296-303. doi: 10.1074/jbc.M401411200

25 : Lang AJ, et al. Structural organization of the human TOP2A and TOP2B genes. Gene. 1998; 221:255-66. doi: 10.1016/s0378-1119(98)00468-5

26 : Turley H, et al. The distribution and expression of the two isoforms of DNA topoisomerase II in normal and neoplastic human tissues. Br J Cancer. 1997; 75:1340-6. doi: 10.1038/bjc.1997.227

27 : Olsen JV, et al. Global, in vivo, and site-specific phosphorylation dynamics in signaling networks. Cell. 2006; 127:635-48. doi: 10.1016/j.cell.2006.09.026

28 : Dephoure N, et al. A quantitative atlas of mitotic phosphorylation. Proc Natl Acad Sci U S A. 2008; 105:10762-7. doi: 10.1073/pnas.0805139105

29 : Xiao Z, et al. System-wide Analysis of SUMOylation Dynamics in Response to Replication Stress Reveals Novel Small Ubiquitin-like Modified Target Proteins and Acceptor Lysines Relevant for Genome Stability. Mol Cell Proteomics. 2015; 14:1419-34. doi: 10.1074/mcp.O114.044792

30 : Ferrar T, et al. Taperin (c9orf75), a mutated gene in nonsyndromic deafness, encodes a vertebrate specific, nuclear localized protein phosphatase one alpha (PP1α) docking protein. Biol Open. 2012; 1:128-39. doi: 10.1242/bio.2011049

31 : Rigbolt KT, et al. System-wide temporal characterization of the proteome and phosphoproteome of human embryonic stem cell differentiation. Sci Signal. 2011; 4:rs3. doi: 10.1126/scisignal.2001570

32 : Wyles JP, et al. Nuclear interactions of topoisomerase II alpha and beta with phospholipid scramblase 1. Nucleic Acids Res. 2007; 35:4076-85. doi: 10.1093/nar/gkm434

33 : Hendriks IA, et al. SUMO-2 Orchestrates Chromatin Modifiers in Response to DNA Damage. Cell Rep. 2015; 10:1778-1791. doi: 10.1016/j.celrep.2015.02.033

34 : Burkard TR, et al. Initial characterization of the human central proteome. BMC Syst Biol. 2011; 5:17. doi: 10.1186/1752-0509-5-17

35 : Singh SK, et al. GANP regulates recruitment of AID to immunoglobulin variable regions by modulating transcription and nucleosome occupancy. Nat Commun. 2013; 4:1830. doi: 10.1038/ncomms2823

36 : Bugg BY, et al. Expression of a mutant DNA topoisomerase II in CCRF-CEM human leukemic cells selected for resistance to teniposide. Proc Natl Acad Sci U S A. 1991; 88:7654-8. doi: 10.1073/pnas.88.17.7654

37 : Tsai-Pflugfelder M, et al. Cloning and sequencing of cDNA encoding human DNA topoisomerase II and localization of the gene to chromosome region 17q21-22. Proc Natl Acad Sci U S A. 1988; 85:7177-81. doi: 10.1073/pnas.85.19.7177

38 : Sng JH, et al. Molecular cloning and characterization of the human topoisomerase IIalpha and IIbeta genes: evidence for isoform evolution through gene duplication. Biochim Biophys Acta. 1999; 1444:395-406. doi: 10.1016/s0167-4781(99)00020-2

39 : Ramamoorthy M, et al. RECQL5 cooperates with Topoisomerase II alpha in DNA decatenation and cell cycle progression. Nucleic Acids Res. 2012; 40:1621-35. doi: 10.1093/nar/gkr844

40 : Beausoleil SA, et al. A probability-based approach for high-throughput protein phosphorylation analysis and site localization. Nat Biotechnol. 2006; 24:1285-92. doi: 10.1038/nbt1240

41 : Wasserman RA, et al. Use of yeast in the study of anticancer drugs targeting DNA topoisomerases: expression of a functional recombinant human DNA topoisomerase II alpha in yeast. Cancer Res. 1993; 53:3591-6.

42 : Deweese JE, et al. Use of divalent metal ions in the dna cleavage reaction of human type II topoisomerases. Biochemistry. 2009; 48:1862-9. doi: 10.1021/bi8023256

43 : Zhou K, et al. RNA helicase A interacts with dsDNA and topoisomerase IIalpha. Nucleic Acids Res. 2003; 31:2253-60. doi: 10.1093/nar/gkg328

44 : Iida M, et al. Plk3 phosphorylates topoisomerase IIalpha at Thr(1342), a site that is not recognized by Plk1. Biochem J. 2008; 411:27-32. doi: 10.1042/BJ20071394

45 : Yu LR, et al. Improved titanium dioxide enrichment of phosphopeptides from HeLa cells and high confident phosphopeptide identification by cross-validation of MS/MS and MS/MS/MS spectra. J Proteome Res. 2007; 6:4150-62. doi: 10.1021/pr070152u

46 : Zhou H, et al. Toward a comprehensive characterization of a human cancer cell phosphoproteome. J Proteome Res. 2013; 12:260-71. doi: 10.1021/pr300630k

47 : Gauci S, et al. Lys-N and trypsin cover complementary parts of the phosphoproteome in a refined SCX-based approach. Anal Chem. 2009; 81:4493-501. doi: 10.1021/ac9004309

48 : Zini N, et al. Discrete localization of different DNA topoisomerases in HeLa and K562 cell nuclei and subnuclear fractions. Exp Cell Res. 1994; 210:336-48. doi: 10.1006/excr.1994.1046

49 : Li J, et al. The Epstein-Barr virus deubiquitinating enzyme BPLF1 regulates the activity of topoisomerase II during productive infection. PLoS Pathog. 2021; 17:e1009954. doi: 10.1371/journal.ppat.1009954

50 : Nicolai S, et al. Identification of Novel Proteins Co-Purifying with Cockayne Syndrome Group B (CSB) Reveals Potential Roles for CSB in RNA Metabolism and Chromatin Dynamics. PLoS One. 2015; 10:e0128558. doi: 10.1371/journal.pone.0128558

Literature

Paper title : Combining protein-based IMAC, peptide-based IMAC, and MudPIT for efficient phosphoproteomic analysis.

Doi : https://doi.org/10.1021/pr0705441

Abstract : Immobilized metal affinity chromatography (IMAC) is a common strategy used for the enrichment of phosphopeptides from digested protein mixtures. However, this strategy by itself is inefficient when analyzing complex protein mixtures. Here, we assess the effectiveness of using protein-based IMAC as a pre-enrichment step prior to peptide-based IMAC. Ultimately, we couple the two IMAC-based enrichments and MudPIT in a quantitative phosphoproteomic analysis of the epidermal growth factor pathway in mammalian cells identifying 4470 unique phosphopeptides containing 4729 phosphorylation sites.

Paper title : Uncovering global SUMOylation signaling networks in a site-specific manner.

Doi : https://doi.org/10.1038/nsmb.2890

Abstract : SUMOylation is a reversible post-translational modification essential for genome stability. Using high-resolution MS, we have studied global SUMOylation in human cells in a site-specific manner, identifying a total of >4,300 SUMOylation sites in >1,600 proteins. To our knowledge, this is the first time that >1,000 SUMOylation sites have been identified under standard growth conditions. We quantitatively studied SUMOylation dynamics in response to SUMO protease inhibition, proteasome inhibition and heat shock. Many SUMOylated lysines have previously been reported to be ubiquitinated, acetylated or methylated, thus indicating cross-talk between SUMO and other post-translational modifications. We identified 70 phosphorylation and four acetylation events in proximity to SUMOylation sites, and we provide evidence for acetylation-dependent SUMOylation of endogenous histone H3. SUMOylation regulates target proteins involved in all nuclear processes including transcription, DNA repair, chromatin remodeling, precursor-mRNA splicing and ribosome assembly.

Paper title : Quantitative phosphoproteomic analysis of T cell receptor signaling reveals system-wide modulation of protein-protein interactions.

Doi : https://doi.org/10.1126/scisignal.2000007

Abstract : Protein phosphorylation events during T cell receptor (TCR) signaling control the formation of complexes among proteins proximal to the TCR, the activation of kinase cascades, and the activation of transcription factors; however, the mode and extent of the influence of phosphorylation in coordinating the diverse phenomena associated with T cell activation are unclear. Therefore, we used the human Jurkat T cell leukemia cell line as a model system and performed large-scale quantitative phosphoproteomic analyses of TCR signaling. We identified 10,665 unique phosphorylation sites, of which 696 showed TCR-responsive changes. In addition, we analyzed broad trends in phosphorylation data sets to uncover underlying mechanisms associated with T cell activation. We found that, upon stimulation of the TCR, phosphorylation events extensively targeted protein modules involved in all of the salient phenomena associated with T cell activation: patterning of surface proteins, endocytosis of the TCR, formation of the F-actin cup, inside-out activation of integrins, polarization of microtubules, production of cytokines, and alternative splicing of messenger RNA. Further, case-by-case analysis of TCR-responsive phosphorylation sites on proteins belonging to relevant functional modules together with network analysis allowed us to deduce that serine-threonine (S-T) phosphorylation modulated protein-protein interactions (PPIs) in a system-wide fashion. We also provide experimental support for this inference by showing that phosphorylation of tubulin on six distinct serine residues abrogated PPIs during the assembly of microtubules. We propose that modulation of PPIs by stimulus-dependent changes in S-T phosphorylation state is a widespread phenomenon applicable to many other signaling systems.

Paper title : Mapping of SUMO sites and analysis of SUMOylation changes induced by external stimuli.

Doi : https://doi.org/10.1073/pnas.1413825111

Abstract : SUMOylation is an essential ubiquitin-like modification involved in important biological processes in eukaryotic cells. Identification of small ubiquitin-related modifier (SUMO)-conjugated residues in proteins is critical for understanding the role of SUMOylation but remains experimentally challenging. We have set up a powerful and high-throughput method combining quantitative proteomics and peptide immunocapture to map SUMOylation sites and have analyzed changes in SUMOylation in response to stimuli. With this technique we identified 295 SUMO1 and 167 SUMO2 sites on endogenous substrates of human cells. We further used this strategy to characterize changes in SUMOylation induced by listeriolysin O, a bacterial toxin that impairs the host cell SUMOylation machinery, and identified several classes of host proteins specifically deSUMOylated in response to this toxin. Our approach constitutes an unprecedented tool, broadly applicable to various SUMO-regulated cellular processes in health and disease.

Paper title : Mitotic phosphorylation of DNA topoisomerase II alpha by protein kinase CK2 creates the MPM-2 phosphoepitope on Ser-1469.

Doi : https://doi.org/10.1074/jbc.M005179200

Abstract : DNA topoisomerase II alpha is required for chromatin condensation during prophase. This process is temporally linked with the appearance of mitosis-specific phosphorylation sites on topoisomerase IIalpha including one recognized by the MPM-2 monoclonal antibody. We now report that the ability of mitotic extracts to create the MPM-2 epitope on human topoisomerase II alpha is abolished by immunodepletion of protein kinase CK2. Furthermore, the MPM-2 phosphoepitope on topoisomerase II alpha can be generated by purified CK2. Phosphorylation of C-truncated topoisomerase II alpha mutant proteins conclusively shows, that the MPM-2 epitope is present in the last 163 amino acids. Use of peptides containing all conserved CK2 consensus sites in this region indicates that only the peptide containing Arg-1466 to Ala-1485 is able to compete with topoisomerase II alpha for binding of the MPM-2 antibody. Replacement of Ser-1469 with Ala abolishes the ability of the phosphorylated peptide to bind to the MPM-2 antibody while a peptide containing phosphorylated Ser-1469 binds tightly. Surprisingly, the MPM-2 phosphoepitope influences neither the catalytic activity of topoisomerase II alpha nor its ability to form molecular complexes with CK2 in vitro. In conclusion, we have identified protein kinase CK2 as a new MPM-2 kinase able to phosphorylate an important mitotic protein, topoisomerase II alpha, on Ser-1469.

Paper title : Structure of a topoisomerase II-DNA-nucleotide complex reveals a new control mechanism for ATPase activity.

Doi : https://doi.org/10.1038/nsmb.2388

Abstract : Type IIA topoisomerases control DNA supercoiling and disentangle chromosomes through a complex ATP-dependent strand-passage mechanism. Although a general framework exists for type IIA topoisomerase function, the architecture of the full-length enzyme has remained undefined. Here we present the structure of a fully catalytic Saccharomyces cerevisiae topoisomerase II homodimer complexed with DNA and a nonhydrolyzable ATP analog. The enzyme adopts a domain-swapped configuration wherein the ATPase domain of one protomer sits atop the nucleolytic region of its partner subunit. This organization produces an unexpected interaction between bound DNA and a conformational transducing element in the ATPase domain, which we show is critical for both DNA-stimulated ATP hydrolysis and global topoisomerase activity. Our data indicate that the ATPase domains pivot about each other to ensure unidirectional strand passage and that this state senses bound DNA to promote ATP turnover and enzyme reset.

Paper title : Quantitative phosphoproteomics reveals widespread full phosphorylation site occupancy during mitosis.

Doi : https://doi.org/10.1126/scisignal.2000475

Abstract : Eukaryotic cells replicate by a complex series of evolutionarily conserved events that are tightly regulated at defined stages of the cell division cycle. Progression through this cycle involves a large number of dedicated protein complexes and signaling pathways, and deregulation of this process is implicated in tumorigenesis. We applied high-resolution mass spectrometry-based proteomics to investigate the proteome and phosphoproteome of the human cell cycle on a global scale and quantified 6027 proteins and 20,443 unique phosphorylation sites and their dynamics. Co-regulated proteins and phosphorylation sites were grouped according to their cell cycle kinetics and compared to publicly available messenger RNA microarray data. Most detected phosphorylation sites and more than 20% of all quantified proteins showed substantial regulation, mainly in mitotic cells. Kinase-motif analysis revealed global activation during S phase of the DNA damage response network, which was mediated by phosphorylation by ATM or ATR or DNA-dependent protein kinases. We determined site-specific stoichiometry of more than 5000 sites and found that most of the up-regulated sites phosphorylated by cyclin-dependent kinase 1 (CDK1) or CDK2 were almost fully phosphorylated in mitotic cells. In particular, nuclear proteins and proteins involved in regulating metabolic processes have high phosphorylation site occupancy in mitosis. This suggests that these proteins may be inactivated by phosphorylation in mitotic cells.

Paper title : DNA sequence of human chromosome 17 and analysis of rearrangement in the human lineage.

Doi : https://doi.org/10.1038/nature04689

Abstract : Chromosome 17 is unusual among the human chromosomes in many respects. It is the largest human autosome with orthology to only a single mouse chromosome, mapping entirely to the distal half of mouse chromosome 11. Chromosome 17 is rich in protein-coding genes, having the second highest gene density in the genome. It is also enriched in segmental duplications, ranking third in density among the autosomes. Here we report a finished sequence for human chromosome 17, as well as a structural comparison with the finished sequence for mouse chromosome 11, the first finished mouse chromosome. Comparison of the orthologous regions reveals striking differences. In contrast to the typical pattern seen in mammalian evolution, the human sequence has undergone extensive intrachromosomal rearrangement, whereas the mouse sequence has been remarkably stable. Moreover, although the human sequence has a high density of segmental duplication, the mouse sequence has a very low density. Notably, these segmental duplications correspond closely to the sites of structural rearrangement, demonstrating a link between duplication and rearrangement. Examination of the main classes of duplicated segments provides insight into the dynamics underlying expansion of chromosome-specific, low-copy repeats in the human genome.

Paper title : The SET and transposase domain protein Metnase enhances chromosome decatenation: regulation by automethylation.

Doi : https://doi.org/10.1093/nar/gkn560

Abstract : Metnase is a human SET and transposase domain protein that methylates histone H3 and promotes DNA double-strand break repair. We now show that Metnase physically interacts and co-localizes with Topoisomerase IIalpha (Topo IIalpha), the key chromosome decatenating enzyme. Metnase promotes progression through decatenation and increases resistance to the Topo IIalpha inhibitors ICRF-193 and VP-16. Purified Metnase greatly enhanced Topo IIalpha decatenation of kinetoplast DNA to relaxed circular forms. Nuclear extracts containing Metnase decatenated kDNA more rapidly than those without Metnase, and neutralizing anti-sera against Metnase reversed that enhancement of decatenation. Metnase automethylates at K485, and the presence of a methyl donor blocked the enhancement of Topo IIalpha decatenation by Metnase, implying an internal regulatory inhibition. Thus, Metnase enhances Topo IIalpha decatenation, and this activity is repressed by automethylation. These results suggest that cancer cells could subvert Metnase to mediate clinically relevant resistance to Topo IIalpha inhibitors.

Paper title : Casein kinase I delta/epsilon phosphorylates topoisomerase IIalpha at serine-1106 and modulates DNA cleavage activity.

Doi : https://doi.org/10.1093/nar/gkn934

Abstract : We previously reported that phosphorylation of topoisomerase (topo) IIalpha at serine-1106 (Ser-1106) regulates enzyme activity and sensitivity to topo II-targeted drugs. In this study we demonstrate that phosphorylation of Ser-1106, which is flanked by acidic amino acids, is regulated in vivo by casein kinase (CK) Idelta and/or CKIepsilon, but not by CKII. The CKI inhibitors, CKI-7 and IC261, reduced Ser-1106 phosphorylation and decreased formation of etoposide-stabilized topo II-DNA cleavable complex. In contrast, the CKII inhibitor, 5,6-dichlorobenzimidazole riboside, did not affect etoposide-stabilized topo II-DNA cleavable complex formation. Since, IC261 specifically targets the Ca(2+)-regulated isozymes, CKIdelta and CKIepsilon, we examined the effect of down-regulating these enzymes on Ser-1106 phosphorylation. Down-regulation of these isozymes with targeted si-RNAs led to hypophosphorylation of the Ser-1106 containing peptide. However, si-RNA-mediated down-regulation of CKIIalpha and alpha' did not alter Ser-1106 phosphorylation. Furthermore, reduced phosphorylation of Ser-1106, observed in HRR25 (CKIdelta/epsilon homologous gene)-deleted Saccharomyces cerevisiae cells transformed with human topo IIalpha, was enhanced following expression of human CKIepsilon. Down-regulation of CKIdelta and CKIepsilon also led to reduced formation of etoposide stabilized topo II-DNA cleavable complex. These results provide strong support for an essential role of CKIdelta/epsilon in phosphorylating Ser-1106 in human topo IIalpha and in regulating enzyme function.

Paper title : The human RNA polymerase I structure reveals an HMG-like docking domain specific to metazoans.

Doi : https://doi.org/10.26508/lsa.202201568

Abstract : Transcription of the ribosomal RNA precursor by RNA polymerase (Pol) I is a major determinant of cellular growth, and dysregulation is observed in many cancer types. Here, we present the purification of human Pol I from cells carrying a genomic GFP fusion on the largest subunit allowing the structural and functional analysis of the enzyme across species. In contrast to yeast, human Pol I carries a single-subunit stalk, and in vitro transcription indicates a reduced proofreading activity. Determination of the human Pol I cryo-EM reconstruction in a close-to-native state rationalizes the effects of disease-associated mutations and uncovers an additional domain that is built into the sequence of Pol I subunit RPA1. This "dock II" domain resembles a truncated HMG box incapable of DNA binding which may serve as a downstream transcription factor-binding platform in metazoans. Biochemical analysis, in situ modelling, and ChIP data indicate that Topoisomerase 2a can be recruited to Pol I via the domain and cooperates with the HMG box domain-containing factor UBF. These adaptations of the metazoan Pol I transcription system may allow efficient release of positive DNA supercoils accumulating downstream of the transcription bubble.

Paper title : Identification of functional nuclear export sequences in human topoisomerase IIalpha and beta.

Doi : https://doi.org/10.1016/s0006-291x(03)01077-5

Abstract : Nuclear localization of topoisomerase IIalpha and beta is important for normal cell function as well as being a determinant of tumour cell sensitivity to topoisomerase II-targeting chemotherapeutic agents. However, topoisomerase II is cytoplasmic under certain circumstances, indicating that it may undergo active nuclear export. We have examined the ability of Leu-rich potential nuclear export signal (NES) sequences present in human topoisomerase IIalpha and beta to direct the export of a green fluorescent protein-glutathione-S-transferase fusion protein following microinjection into HeLa cell nuclei. Of 12 sequences tested, only one potential NES sequence from the comparable location in each isoform (alphaNES(1018-1028) and betaNES(1034-1044)) was active. Mutation of hydrophobic residues in alphaNES(1018-1028) and betaNES(1034-1044) substantially reduced their nuclear export activity as did leptomycin B treatment of microinjected cells. Our results provide the first evidence of active nuclear export of topoisomerase II and suggest it is mediated by a CRM1-dependent pathway.

Paper title : Site-specific mapping of the human SUMO proteome reveals co-modification with phosphorylation.

Doi : https://doi.org/10.1038/nsmb.3366

Abstract : Small ubiquitin-like modifiers (SUMOs) are post-translational modifications (PTMs) that regulate nuclear cellular processes. Here we used an augmented K0-SUMO proteomics strategy to identify 40,765 SUMO acceptor sites and quantify their fractional contribution for 6,747 human proteins. Structural-predictive analyses revealed that lysines residing in disordered regions are preferentially targeted by SUMO, in notable contrast to other widespread lysine modifications. In our data set, we identified 807 SUMOylated peptides that were co-modified by phosphorylation, along with dozens of SUMOylated peptides that were co-modified by ubiquitylation, acetylation and methylation. Notably, 9% of the identified SUMOylome occurred proximal to phosphorylation, and numerous SUMOylation sites were found to be fully dependent on prior phosphorylation events. SUMO-proximal phosphorylation occurred primarily in a proline-directed manner, and inhibition of cyclin-dependent kinases dynamically affected co-modification. Collectively, we present a comprehensive analysis of the SUMOylated proteome, uncovering the structural preferences for SUMO and providing system-wide evidence for a remarkable degree of cross-talk between SUMOylation and other major PTMs.

Paper title : DNA cleavage and opening reactions of human topoisomerase IIα are regulated via Mg2+-mediated dynamic bending of gate-DNA.

Doi : https://doi.org/10.1073/pnas.1115704109

Abstract : Topoisomerase II resolves intrinsic topological problems of double-stranded DNA. As part of its essential cellular functions, the enzyme generates DNA breaks, but the regulation of this potentially dangerous process is not well understood. Here we report single-molecule fluorescence experiments that reveal a previously uncharacterized sequence of events during DNA cleavage by topoisomerase II: nonspecific DNA binding, sequence-specific DNA bending, and stochastic cleavage of DNA. We have identified unexpected structural roles of Mg(2+) ions coordinated in the TOPRIM (topoisomerase-primase) domain in inducing cleavage-competent DNA bending. A break at one scissile bond dramatically stabilized DNA bending, explaining how two scission events in opposing strands can be coordinated to achieve a high probability of double-stranded cleavage. Clamping of the protein N-gate greatly enhanced the rate and degree of DNA bending, resulting in a significant stimulation of the DNA cleavage and opening reactions. Our data strongly suggest that the accurate cleavage of DNA by topoisomerase II is regulated through a tight coordination with DNA bending.

Paper title : Metal ion interactions in the DNA cleavage/ligation active site of human topoisomerase IIalpha.

Doi : https://doi.org/10.1021/bi900875c

Abstract : Human topoisomerase IIalpha utilizes a two-metal-ion mechanism for DNA cleavage. One of the metal ions (M(1)(2+)) is believed to make a critical interaction with the 3'-bridging atom of the scissile phosphate, while the other (M(2)(2+)) is believed to interact with a nonbridging oxygen of the scissile phosphate. Based on structural and mutagenesis studies of prokaryotic nucleic acid enzymes, it has been proposed that the active site divalent metal ions interact with type II topoisomerases through a series of conserved acidic amino acid residues. The homologous residues in human topoisomerase IIalpha are E461, D541, D543, and D545. To address the validity of these assignments and to delineate interactions between individual amino acids and M(1)(2+) and M(2)(2+), we individually mutated each of these acidic amino acid residues in topoisomerase IIalpha to either cysteine or alanine. Mutant enzymes displayed a marked loss of catalytic and DNA cleavage activity as well as a reduced affinity for divalent metal ions. Additional experiments determined the ability of wild-type and mutant topoisomerase IIalpha enzymes to cleave an oligonucleotide substrate that contained a sulfur atom in place of the 3'-bridging oxygen of the scissile phosphate in the presence of Mg2+, Mn2+, or Ca2+. On the basis of the results of these studies, we conclude that the four acidic amino acid residues interact with metal ions in the DNA cleavage/ligation active site of topoisomerase IIalpha. Furthermore, we propose that M(1)(2+) interacts with E461, D543, and D545 and M(2)(2+) interacts with E461 and D541.

Paper title : Kinase-selective enrichment enables quantitative phosphoproteomics of the kinome across the cell cycle.

Doi : https://doi.org/10.1016/j.molcel.2008.07.007

Abstract : Protein kinases are pivotal regulators of cell signaling that modulate each other's functions and activities through site-specific phosphorylation events. These key regulatory modifications have not been studied comprehensively, because low cellular abundance of kinases has resulted in their underrepresentation in previous phosphoproteome studies. Here, we combine kinase-selective affinity purification with quantitative mass spectrometry to analyze the cell-cycle regulation of protein kinases. This proteomics approach enabled us to quantify 219 protein kinases from S and M phase-arrested human cancer cells. We identified more than 1000 phosphorylation sites on protein kinases. Intriguingly, half of all kinase phosphopeptides were upregulated in mitosis. Our data reveal numerous unknown M phase-induced phosphorylation sites on kinases with established mitotic functions. We also find potential phosphorylation networks involving many protein kinases not previously implicated in mitotic progression. These results provide a vastly extended knowledge base for functional studies on kinases and their regulation through site-specific phosphorylation.

Paper title : N-terminal acetylome analyses and functional insights of the N-terminal acetyltransferase NatB.

Doi : https://doi.org/10.1073/pnas.1210303109

Abstract : Protein N-terminal acetylation (Nt-acetylation) is an important mediator of protein function, stability, sorting, and localization. Although the responsible enzymes are thought to be fairly well characterized, the lack of identified in vivo substrates, the occurrence of Nt-acetylation substrates displaying yet uncharacterized N-terminal acetyltransferase (NAT) specificities, and emerging evidence of posttranslational Nt-acetylation, necessitate the use of genetic models and quantitative proteomics. NatB, which targets Met-Glu-, Met-Asp-, and Met-Asn-starting protein N termini, is presumed to Nt-acetylate 15% of all yeast and 18% of all human proteins. We here report on the evolutionary traits of NatB from yeast to human and demonstrate that ectopically expressed hNatB in a yNatB-Δ yeast strain partially complements the natB-Δ phenotypes and partially restores the yNatB Nt-acetylome. Overall, combining quantitative N-terminomics with yeast studies and knockdown of hNatB in human cell lines, led to the unambiguous identification of 180 human and 110 yeast NatB substrates. Interestingly, these substrates included Met-Gln- N-termini, which are thus now classified as in vivo NatB substrates. We also demonstrate the requirement of hNatB activity for maintaining the structure and function of actomyosin fibers and for proper cellular migration. In addition, expression of tropomyosin-1 restored the altered focal adhesions and cellular migration defects observed in hNatB-depleted HeLa cells, indicative for the conserved link between NatB, tropomyosin, and actin cable function from yeast to human.

Paper title : Identification of a point mutation in the topoisomerase II gene from a human leukemia cell line containing an amsacrine-resistant form of topoisomerase II.

Doi : https://doi.org/Not available

Abstract : HL-60/AMSA is a human leukemia cell line that is 50- to 100-fold more resistant to the cytotoxic actions of the topoisomerase II-reactive intercalator amsacrine than is its drug-sensitive HL-60 parent line. Previously, we have shown that the topoisomerase II from HL-60/AMSA is also resistant to inhibition by amsacrine and other intercalating agents. We therefore sought the molecular basis for the resistance of the topoisomerase II of HL-60/AMSA and, by inference, of the HL-60/AMSA line itself. We report the cloning and sequencing of the topoisomerase II genes from both the sensitive and resistant leukemia cell lines using polymerase chain reaction technology. We have identified a single base change associated with the drug-resistant form of topoisomerase II. This mutation is present in both cloned HL-60/AMSA complementary DNA and extracted HL-60/AMSA genomic DNA. A rapid assay for this mutation in clinical samples has been developed and applied to the DNA of cells from both normal volunteers and leukemia patients. Thus far, the HL-60/AMSA genotype has not been identified in the cells from any individual, suggesting that this genotype is indeed a mutation and not an allelic form of topoisomerase II. The novel assay developed will allow a rapid search for the prevalence of this mutation in clinical samples from patients with leukemia who have relapsed following intercalator therapy.

Paper title : The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC).

Doi : https://doi.org/10.1101/gr.2596504

Abstract : The National Institutes of Health's Mammalian Gene Collection (MGC) project was designed to generate and sequence a publicly accessible cDNA resource containing a complete open reading frame (ORF) for every human and mouse gene. The project initially used a random strategy to select clones from a large number of cDNA libraries from diverse tissues. Candidate clones were chosen based on 5'-EST sequences, and then fully sequenced to high accuracy and analyzed by algorithms developed for this project. Currently, more than 11,000 human and 10,000 mouse genes are represented in MGC by at least one clone with a full ORF. The random selection approach is now reaching a saturation point, and a transition to protocols targeted at the missing transcripts is now required to complete the mouse and human collections. Comparison of the sequence of the MGC clones to reference genome sequences reveals that most cDNA clones are of very high sequence quality, although it is likely that some cDNAs may carry missense variants as a consequence of experimental artifact, such as PCR, cloning, or reverse transcriptase errors. Recently, a rat cDNA component was added to the project, and ongoing frog (Xenopus) and zebrafish (Danio) cDNA projects were expanded to take advantage of the high-throughput MGC pipeline.

Paper title : Structure of the N-terminal Gyrase B fragment in complex with ADP⋅Pi reveals rigid-body motion induced by ATP hydrolysis.

Doi : https://doi.org/10.1371/journal.pone.0107289

Abstract : Type II DNA topoisomerases are essential enzymes that catalyze topological rearrangement of double-stranded DNA using the free energy generated by ATP hydrolysis. Bacterial DNA gyrase is a prototype of this family and is composed of two subunits (GyrA, GyrB) that form a GyrA2GyrB2 heterotetramer. The N-terminal 43-kDa fragment of GyrB (GyrB43) from E. coli comprising the ATPase and the transducer domains has been studied extensively. The dimeric fragment is competent for ATP hydrolysis and its structure in complex with the substrate analog AMPPNP is known. Here, we have determined the remaining conformational states of the enzyme along the ATP hydrolysis reaction path by solving crystal structures of GyrB43 in complex with ADP⋅BeF3, ADP⋅Pi, and ADP. Upon hydrolysis, the enzyme undergoes an obligatory 12° domain rearrangement to accommodate the 1.5 Å increase in distance between the γ- and β-phosphate of the nucleotide within the sealed binding site at the domain interface. Conserved residues from the QTK loop of the transducer domain (also part of the domain interface) couple the small structural change within the binding site with the rigid body motion. The domain reorientation is reflected in a significant 7 Å increase in the separation of the two transducer domains of the dimer that would embrace one of the DNA segments in full-length gyrase. The observed conformational change is likely to be relevant for the allosteric coordination of ATP hydrolysis with DNA binding, cleavage/re-ligation and/or strand passage.

Paper title : Nucleotide-dependent domain movement in the ATPase domain of a human type IIA DNA topoisomerase.

Doi : https://doi.org/10.1074/jbc.M506520200

Abstract : Type IIA DNA topoisomerases play multiple essential roles in the management of higher-order DNA structure, including modulation of topological state, chromosome segregation, and chromatin condensation. These diverse physiologic functions are all accomplished through a common molecular mechanism, wherein the protein catalyzes transient cleavage of a DNA duplex (the G-segment) to yield a double-stranded gap through which another duplex (the T-segment) is passed. The overall process is orchestrated by the opening and closing of molecular "gates" in the topoisomerase structure, which is regulated by ATP binding, hydrolysis, and release of ADP and inorganic phosphate. Here we present two crystal structures of the ATPase domain of human DNA topoisomerase IIalpha in different nucleotide-bound states. Comparison of these structures revealed rigid-body movement of the structural modules within the ATPase domain, suggestive of the motions of a molecular gate.

Paper title : Metnase promotes restart and repair of stalled and collapsed replication forks.

Doi : https://doi.org/10.1093/nar/gkq339

Abstract : Metnase is a human protein with methylase (SET) and nuclease domains that is widely expressed, especially in proliferating tissues. Metnase promotes non-homologous end-joining (NHEJ), and knockdown causes mild hypersensitivity to ionizing radiation. Metnase also promotes plasmid and viral DNA integration, and topoisomerase IIα (TopoIIα)-dependent chromosome decatenation. NHEJ factors have been implicated in the replication stress response, and TopoIIα has been proposed to relax positive supercoils in front of replication forks. Here we show that Metnase promotes cell proliferation, but it does not alter cell cycle distributions, or replication fork progression. However, Metnase knockdown sensitizes cells to replication stress and confers a marked defect in restart of stalled replication forks. Metnase promotes resolution of phosphorylated histone H2AX, a marker of DNA double-strand breaks at collapsed forks, and it co-immunoprecipitates with PCNA and RAD9, a member of the PCNA-like RAD9-HUS1-RAD1 intra-S checkpoint complex. Metnase also promotes TopoIIα-mediated relaxation of positively supercoiled DNA. Metnase is not required for RAD51 focus formation after replication stress, but Metnase knockdown cells show increased RAD51 foci in the presence or absence of replication stress. These results establish Metnase as a key factor that promotes restart of stalled replication forks, and implicate Metnase in the repair of collapsed forks.

Paper title : The structure of DNA-bound human topoisomerase II alpha: conformational mechanisms for coordinating inter-subunit interactions with DNA cleavage.

Doi : https://doi.org/10.1016/j.jmb.2012.07.014

Abstract : Type II topoisomerases are required for the management of DNA superhelicity and chromosome segregation, and serve as frontline targets for a variety of small-molecule therapeutics. To better understand how these enzymes act in both contexts, we determined the 2.9-Å-resolution structure of the DNA cleavage core of human topoisomerase IIα (TOP2A) bound to a doubly nicked, 30-bp duplex oligonucleotide. In accord with prior biochemical and structural studies, TOP2A significantly bends its DNA substrate using a bipartite, nucleolytic center formed at an N-terminal dimerization interface of the cleavage core. However, the protein also adopts a global conformation in which the second of its two inter-protomer contact points, one at the C-terminus, has separated. This finding, together with comparative structural analyses, reveals that the principal site of DNA engagement undergoes highly quantized conformational transitions between distinct binding, cleavage, and drug-inhibited states that correlate with the control of subunit-subunit interactions. Additional consideration of our TOP2A model in light of an etoposide-inhibited complex of human topoisomerase IIβ (TOP2B) suggests possible modification points for developing paralog-specific inhibitors to overcome the tendency of topoisomerase II-targeting chemotherapeutics to generate secondary malignancies.

Paper title : Interaction between glucose-regulated destruction domain of DNA topoisomerase IIalpha and MPN domain of Jab1/CSN5.

Doi : https://doi.org/10.1074/jbc.M401411200

Abstract : DNA topoisomerase (topo) IIalpha, an essential enzyme for cell proliferation, is targeted to a proteasome-dependent degradation pathway when human tumor cells are glucose-starved. Here we show that the topo IIalpha destabilization depends on the newly identified domain, GRDD (glucose-regulated destruction domain), which was mapped to the N-terminal 70-170 amino acid sequence. Indeed, the deletion of GRDD conferred a stable feature on topo IIalpha, whereas the fusion of GRDD rendered green fluorescent protein unstable under glucose starvation conditions. Nuclear localization was a prerequisite for GRDD function, because the inhibition of nuclear translocation resulted in the suppression of GRDD-mediated topo IIalpha degradation. Further, GRDD was identified as an interactive domain for Jab1/CSN5, which promoted the degradation of topo IIalpha in a manner dependent on the MPN (Mpr1p/Prd1p N terminus) domain. Depleting Jab1/CSN5 by antisense oligonucleotide and treating cells with the CSN-associated kinase inhibitor, curcumin, inhibited topo IIalpha degradation induced by glucose starvation. These findings demonstrate that GRDD can act as a stress-activated degron for regulating topo IIalpha stability, possibly through interaction with the MPN domain of Jab1/CSN5.

Paper title : Structural organization of the human TOP2A and TOP2B genes.

Doi : https://doi.org/10.1016/s0378-1119(98)00468-5

Abstract : Eucaryotic topoisomerase II is an essential nuclear enzyme involved in processes such as chromosome condensation, chromatid separation, and in the relief of torsional stress that occurs during DNA transcription and replication. In cells from vertebrate species, there are two forms of the enzyme, designated alpha and beta. Human topoisomerase IIalpha (TOP2A) is encoded by the TOP2A gene on chromosome 17q21-22, and human topoisomerase IIbeta (TOP2B) is encoded by the TOP2B gene on chromosome 3p24. The protein products of these two genes are important cellular targets of several drugs widely used in the treatment of many human cancers, and a variety of mutations in TOP2A have been associated with the development of drug resistance. In the present study, we have defined the intron-exon structures of TOP2A and TOP2B. TOP2A is approx. 30kb whereas TOP2B is at least 49kb. TOP2A and TOP2B contain 35 and 36 exons, respectively, and both genes contain a high proportion of class 0 introns. Alignment of the amino-acid sequences of the two proteins indicates that the intron-exon organization of the two genes is highly conserved, except for the regions encoding the extreme NH2 and COOH termini of the proteins. These findings suggest strongly that the vertebrate isoforms evolved by duplication of an ancestral gene. Mutations in TOP2A associated with drug resistance show clustering in exons 12, 13, 19-21 and 34-35. Knowledge of the genomic organization of TOP2A and TOP2B will be useful for detection of mutations in clinical samples from patients with drug-resistant malignant disease.

Paper title : The distribution and expression of the two isoforms of DNA topoisomerase II in normal and neoplastic human tissues.

Doi : https://doi.org/10.1038/bjc.1997.227

Abstract : In mammalian cells, there are two isoforms of DNA topoisomerase II, designated alpha (170-kDa form) and beta (180-kDa form). Previous studies using cell lines have shown that the topoisomerase IIalpha and beta isoforms are differentially regulated during the cell cycle and in response to changes in growth state. Moreover, both isoforms can act as targets for a range of anti-tumour drugs. Here, we have analysed the normal tissue distribution in humans of topoisomerase IIalpha and beta using isoform-specific antibodies. In addition, we have studied expression of these isoforms in 69 primary tumour biopsies, representative either of tumours that are responsive to topoisomerase II-targeting drugs (breast, lung, lymphoma and seminoma) or of those that show de novo drug resistance (colon). Topoisomerase IIalpha was expressed exclusively in the proliferating compartments of all normal tissues, and was detectable in both the cell nucleus and cytoplasm. In biologically aggressive or rapidly proliferating tumours (e.g. high-grade lymphomas and seminomas), there was a high level of topoisomerase IIalpha, although expression was still detectable in colon tumours, indicating that expression of this isoform is not sufficient to explain the intrinsic drug resistance of colon tumours. Topoisomerase IIbeta was expressed ubiquitously in vivo and was localized in both the nucleoli and the nucleoplasm. This isoform was present in quiescent cell populations, but was expressed at a generally higher level in all tumours and proliferating cells than in normal quiescent tissues. We conclude that topoisomerase IIalpha is a strict proliferation marker in normal and neoplastic cells in vivo, but that topoisomerase IIbeta has a much more general cell and tissue distribution than has topoisomerase IIalpha. The apparent up-regulation of topoisomerase IIbeta in neoplastic cells has implications for the response of patients to anti-tumour therapies that include topoisomerase II-targeting drugs.

Paper title : Global, in vivo, and site-specific phosphorylation dynamics in signaling networks.

Doi : https://doi.org/10.1016/j.cell.2006.09.026

Abstract : Cell signaling mechanisms often transmit information via posttranslational protein modifications, most importantly reversible protein phosphorylation. Here we develop and apply a general mass spectrometric technology for identification and quantitation of phosphorylation sites as a function of stimulus, time, and subcellular location. We have detected 6,600 phosphorylation sites on 2,244 proteins and have determined their temporal dynamics after stimulating HeLa cells with epidermal growth factor (EGF) and recorded them in the Phosida database. Fourteen percent of phosphorylation sites are modulated at least 2-fold by EGF, and these were classified by their temporal profiles. Surprisingly, a majority of proteins contain multiple phosphorylation sites showing different kinetics, suggesting that they serve as platforms for integrating signals. In addition to protein kinase cascades, the targets of reversible phosphorylation include ubiquitin ligases, guanine nucleotide exchange factors, and at least 46 different transcriptional regulators. The dynamic phosphoproteome provides a missing link in a global, integrative view of cellular regulation.

Paper title : A quantitative atlas of mitotic phosphorylation.

Doi : https://doi.org/10.1073/pnas.0805139105

Abstract : The eukaryotic cell division cycle is characterized by a sequence of orderly and highly regulated events resulting in the duplication and separation of all cellular material into two newly formed daughter cells. Protein phosphorylation by cyclin-dependent kinases (CDKs) drives this cycle. To gain further insight into how phosphorylation regulates the cell cycle, we sought to identify proteins whose phosphorylation is cell cycle regulated. Using stable isotope labeling along with a two-step strategy for phosphopeptide enrichment and high mass accuracy mass spectrometry, we examined protein phosphorylation in a human cell line arrested in the G(1) and mitotic phases of the cell cycle. We report the identification of >14,000 different phosphorylation events, more than half of which, to our knowledge, have not been described in the literature, along with relative quantitative data for the majority of these sites. We observed >1,000 proteins with increased phosphorylation in mitosis including many known cell cycle regulators. The majority of sites on regulated phosphopeptides lie in [S/T]P motifs, the minimum required sequence for CDKs, suggesting that many of the proteins may be CDK substrates. Analysis of non-proline site-containing phosphopeptides identified two unique motifs that suggest there are at least two undiscovered mitotic kinases.

Paper title : System-wide Analysis of SUMOylation Dynamics in Response to Replication Stress Reveals Novel Small Ubiquitin-like Modified Target Proteins and Acceptor Lysines Relevant for Genome Stability.

Doi : https://doi.org/10.1074/mcp.O114.044792

Abstract : Genotoxic agents can cause replication fork stalling in dividing cells because of DNA lesions, eventually leading to replication fork collapse when the damage is not repaired. Small Ubiquitin-like Modifiers (SUMOs) are known to counteract replication stress, nevertheless, only a small number of relevant SUMO target proteins are known. To address this, we have purified and identified SUMO-2 target proteins regulated by replication stress in human cells. The developed methodology enabled single step purification of His10-SUMO-2 conjugates under denaturing conditions with high yield and high purity. Following statistical analysis on five biological replicates, a total of 566 SUMO-2 targets were identified. After 2 h of hydroxyurea treatment, 10 proteins were up-regulated for SUMOylation and two proteins were down-regulated for SUMOylation, whereas after 24 h, 35 proteins were up-regulated for SUMOylation, and 13 proteins were down-regulated for SUMOylation. A site-specific approach was used to map over 1000 SUMO-2 acceptor lysines in target proteins. The methodology is generic and is widely applicable in the ubiquitin field. A large subset of these identified proteins function in one network that consists of interacting replication factors, transcriptional regulators, DNA damage response factors including MDC1, ATR-interacting protein ATRIP, the Bloom syndrome protein and the BLM-binding partner RMI1, the crossover junction endonuclease EME1, BRCA1, and CHAF1A. Furthermore, centromeric proteins and signal transducers were dynamically regulated by SUMOylation upon replication stress. Our results uncover a comprehensive network of SUMO target proteins dealing with replication damage and provide a framework for detailed understanding of the role of SUMOylation to counteract replication stress. Ultimately, our study reveals how a post-translational modification is able to orchestrate a large variety of different proteins to integrate different nuclear processes with the aim of dealing with the induced DNA damage.

Paper title : Taperin (c9orf75), a mutated gene in nonsyndromic deafness, encodes a vertebrate specific, nuclear localized protein phosphatase one alpha (PP1α) docking protein.

Doi : https://doi.org/10.1242/bio.2011049

Abstract : The promiscuous activity of protein phosphatase one (PP1) is controlled in the cell by associated proteins termed regulatory or targeting subunits. Using biochemical and proteomic approaches we demonstrate that the autosomal recessive nonsyndromic hearing loss gene, taperin (C9orf75), encodes a protein that preferentially docks the alpha isoform of PP1. Taperin associates with PP1 through a classic 'RVxF' motif and suppresses the general phosphatase activity of the enzyme. The steady-state localization of taperin is predominantly nuclear, however we demonstrate here that the protein can shuttle between the nucleus and cytoplasm and that it is found complexed to PP1 in both of these cellular compartments. Although originally identified as a hearing loss gene, Western blot analyses with taperin-specific antibodies revealed that the protein is widely expressed across mammalian tissues as multiple splice variants. Taperin is a recent proteome addition appearing during the vertebrate lineage with the PP1 binding site embedded within the most conserved region of the protein. Taperin also shares an ancestral relationship with the cytosolic actin binding protein phostensin, another PP1 interacting partner. Quantitative Stable Isotope Labeling by Amino acids in Culture (SILAC)-based mass spectrometry was employed to uncover additional taperin binding partners, and revealed an interaction with the DNA damage response proteins Ku70, Ku80, PARP and topoisomerases I and IIα. Consistent with this, we demonstrate the active recruitment of taperin to sites of DNA damage. This makes taperin a new addition to the family of PP1 targeting subunits involved in the DNA damage repair pathway.

Paper title : System-wide temporal characterization of the proteome and phosphoproteome of human embryonic stem cell differentiation.

Doi : https://doi.org/10.1126/scisignal.2001570

Abstract : To elucidate cellular events underlying the pluripotency of human embryonic stem cells (hESCs), we performed parallel quantitative proteomic and phosphoproteomic analyses of hESCs during differentiation initiated by a diacylglycerol analog or transfer to media that had not been conditioned by feeder cells. We profiled 6521 proteins and 23,522 phosphorylation sites, of which almost 50% displayed dynamic changes in phosphorylation status during 24 hours of differentiation. These data are a resource for studies of the events associated with the maintenance of hESC pluripotency and those accompanying their differentiation. From these data, we identified a core hESC phosphoproteome of sites with similar robust changes in response to the two distinct treatments. These sites exhibited distinct dynamic phosphorylation patterns, which were linked to known or predicted kinases on the basis of the matching sequence motif. In addition to identifying previously unknown phosphorylation sites on factors associated with differentiation, such as kinases and transcription factors, we observed dynamic phosphorylation of DNA methyltransferases (DNMTs). We found a specific interaction of DNMTs during early differentiation with the PAF1 (polymerase-associated factor 1) transcriptional elongation complex, which binds to promoters of the pluripotency and known DNMT target genes encoding OCT4 and NANOG, thereby providing a possible molecular link for the silencing of these genes during differentiation.

Paper title : Nuclear interactions of topoisomerase II alpha and beta with phospholipid scramblase 1.

Doi : https://doi.org/10.1093/nar/gkm434

Abstract : DNA topoisomerase (topo) II modulates DNA topology and is essential for cell division. There are two isoforms of topo II (alpha and beta) that have limited functional redundancy, although their catalytic mechanisms appear the same. Using their COOH-terminal domains (CTDs) in yeast two-hybrid analysis, we have identified phospholipid scramblase 1 (PLSCR1) as a binding partner of both topo II alpha and beta. Although predominantly a plasma membrane protein involved in phosphatidylserine externalization, PLSCR1 can also be imported into the nucleus where it may have a tumour suppressor function. The interactions of PLSCR1 and topo II were confirmed by pull-down assays with topo II alpha and beta CTD fusion proteins and endogenous PLSCR1, and by co-immunoprecipitation of endogenous PLSCR1 and topo II alpha and beta from HeLa cell nuclear extracts. PLSCR1 also increased the decatenation activity of human topo IIalpha. A conserved basic sequence in the CTD of topo IIalpha was identified as being essential for binding to PLSCR1 and binding of the two proteins could be inhibited by a synthetic peptide corresponding to topo IIalpha amino acids 1430-1441. These studies reveal for the first time a physical and functional interaction between topo II and PLSCR1.

Paper title : SUMO-2 Orchestrates Chromatin Modifiers in Response to DNA Damage.

Doi : https://doi.org/10.1016/j.celrep.2015.02.033

Abstract : Small ubiquitin-like modifiers play critical roles in the DNA damage response (DDR). To increase our understanding of SUMOylation in the mammalian DDR, we employed a quantitative proteomics approach in order to identify dynamically regulated SUMO-2 conjugates and modification sites upon treatment with the DNA damaging agent methyl methanesulfonate (MMS). We have uncovered a dynamic set of 20 upregulated and 33 downregulated SUMO-2 conjugates, and 755 SUMO-2 sites, of which 362 were dynamic in response to MMS. In contrast to yeast, where a response is centered on homologous recombination, we identified dynamically SUMOylated interaction networks of chromatin modifiers, transcription factors, DNA repair factors, and nuclear body components. SUMOylated chromatin modifiers include JARID1B/KDM5B, JARID1C/KDM5C, p300, CBP, PARP1, SetDB1, and MBD1. Whereas SUMOylated JARID1B was ubiquitylated by the SUMO-targeted ubiquitin ligase RNF4 and degraded by the proteasome in response to DNA damage, JARID1C was SUMOylated and recruited to the chromatin to demethylate histone H3K4.

Paper title : Initial characterization of the human central proteome.

Doi : https://doi.org/10.1186/1752-0509-5-17

Abstract : BACKGROUND: On the basis of large proteomics datasets measured from seven human cell lines we consider their intersection as an approximation of the human central proteome, which is the set of proteins ubiquitously expressed in all human cells. Composition and properties of the central proteome are investigated through bioinformatics analyses. RESULTS: We experimentally identify a central proteome comprising 1,124 proteins that are ubiquitously and abundantly expressed in human cells using state of the art mass spectrometry and protein identification bioinformatics. The main represented functions are proteostasis, primary metabolism and proliferation. We further characterize the central proteome considering gene structures, conservation, interaction networks, pathways, drug targets, and coordination of biological processes. Among other new findings, we show that the central proteome is encoded by exon-rich genes, indicating an increased regulatory flexibility through alternative splicing to adapt to multiple environments, and that the protein interaction network linking the central proteome is very efficient for synchronizing translation with other biological processes. Surprisingly, at least 10% of the central proteome has no or very limited functional annotation. CONCLUSIONS: Our data and analysis provide a new and deeper description of the human central proteome compared to previous results thereby extending and complementing our knowledge of commonly expressed human proteins. All the data are made publicly available to help other researchers who, for instance, need to compare or link focused datasets to a common background.

Paper title : GANP regulates recruitment of AID to immunoglobulin variable regions by modulating transcription and nucleosome occupancy.

Doi : https://doi.org/10.1038/ncomms2823

Abstract : Somatic hypermutation in B cells is initiated by activation-induced cytidine deaminase-catalyzed C→U deamination at immunoglobulin variable regions. Here we investigate the role of the germinal centre-associated nuclear protein (GANP) in enhancing the access of activation-induced cytidine deaminase (AID) to immunoglobulin variable regions. We show that the nuclear export factor GANP is involved in chromatin modification at rearranged immunoglobulin variable loci, and its activity requires a histone acetyltransferase domain. GANP interacts with the transcription stalling protein Spt5 and facilitates RNA Pol-II recruitment to immunoglobulin variable regions. Germinal centre B cells from ganp-transgenic mice showed a higher AID occupancy at the immunoglobulin variable region, whereas B cells from conditional ganp-knockout mice exhibit a lower AID accessibility. These findings suggest that GANP-mediated chromatin modification promotes transcription complex recruitment and positioning at immunoglobulin variable loci to favour AID targeting.

Paper title : Expression of a mutant DNA topoisomerase II in CCRF-CEM human leukemic cells selected for resistance to teniposide.

Doi : https://doi.org/10.1073/pnas.88.17.7654

Abstract : Nuclear extracts from teniposide (VM-26)-resistant sublines of the human leukemic cell line CCRF-CEM have decreased levels of DNA topoisomerase II catalytic activity and decreased capacity to form drug-stabilized covalent protein-DNA complexes. The ATP concentration required for equivalent activity in a DNA-unknotting assay is 2- to 8-fold higher in nuclear extracts from drug-resistant cell lines as compared with the parental line. When adenosine 5'-[beta,gamma-imido]triphosphate is substituted for ATP in complex-formation assays, no significant change is seen with drug-sensitive cells, but a 50-65% reduction is seen with VM-26-resistant cells. Collectively, these results indicate that an alteration in ATP binding may be involved in the resistance phenotype. Therefore, we identified regions of the topoisomerase II sequence that conform to previously identified nucleotide-binding sites. Starting with cDNA as the template we determined the sequence of the topoisomerase II mRNA surrounding these sites by sequencing DNA fragments produced by the polymerase chain reaction. In the region corresponding to the consensus B ATP-binding sequence described by Walker et al. [Walker, J. E., Saraste, M., Runswick, M. J. & Gay, N. J. (1982) EMBO J. 1, 945-951], the cDNA from the two VM-26-resistant sublines contained an altered sequence having a G----A base change. This base substitution results in the replacement of the conserved arginine at position 449 with a glutamine. Hybridization with allele-specific oligonucleotides confirmed the presence of both the normal and the altered sequence in the resistant cell lines, whereas only the normal sequence was found in the sensitive CEM cells. A chemical mismatch cleavage procedure for the detection of mispaired bases in DNA duplexes identified no other alterations in the 5' third of the mRNA coding sequence, which contains the complete ATP-binding domain of topoisomerase II. The presence of mRNA encoding topoisomerase II with Gln449 correlates both with the presence of a topoisomerase II protein whose interaction with ATP is altered and with increased resistance to the cytotoxicity of VM-26.

Paper title : Cloning and sequencing of cDNA encoding human DNA topoisomerase II and localization of the gene to chromosome region 17q21-22.

Doi : https://doi.org/10.1073/pnas.85.19.7177

Abstract : Two overlapping cDNA clones encoding human DNA topoisomerase II were identified by two independent methods. In one, a human cDNA library in phage lambda was screened by hybridization with a mixed oligonucleotide probe encoding a stretch of seven amino acids found in yeast and Drosophila DNA topoisomerase II; in the other, a different human cDNA library in a lambda gt11 expression vector was screened for the expression of antigenic determinants that are recognized by rabbit antibodies specific to human DNA topoisomerase II. The entire coding sequences of the human DNA topoisomerase II gene were determined from these and several additional clones, identified through the use of the cloned human TOP2 gene sequences as probes. Hybridization between the cloned sequences and mRNA and genomic DNA indicates that the human enzyme is encoded by a single-copy gene. The location of the gene was mapped to chromosome 17q21-22 by in situ hybridization of a cloned fragment to metaphase chromosomes and by hybridization analysis with a panel of mouse-human hybrid cell lines, each retaining a subset of human chromosomes.

Paper title : Molecular cloning and characterization of the human topoisomerase IIalpha and IIbeta genes: evidence for isoform evolution through gene duplication.

Doi : https://doi.org/10.1016/s0167-4781(99)00020-2

Abstract : Human DNA topoisomerase II is essential for chromosome segregation and is the target for several clinically important anticancer agents. It is expressed as genetically distinct alpha and beta isoforms encoded by the TOP2alpha and TOP2beta genes that map to chromosomes 17q21-22 and 3p24, respectively. The genes display different patterns of cell cycle- and tissue-specific expression, with the alpha isoform markedly upregulated in proliferating cells. In addition to the fundamental role of TOP2alpha and TOP2beta genes in cell growth and development, altered expression and rearrangement of both genes are implicated in anticancer drug resistance. Here, we report the complete structure of the human topoisomerase IIalpha gene, which consists of 35 exons spanning 27.5 kb. Sequence data for the exon-intron boundaries were determined and examined in the context of topoisomerase IIalpha protein structure comprising three functional domains associated with energy transduction, DNA breakage-reunion activity and nuclear localization. The organization of the 3' half of human TOP2beta, including sequence specifying the C-terminal nuclear localization domain, was also elucidated. Of the 15 introns identified in this 20 kb region of TOP2beta, the first nine and the last intron align in identical positions and display the same phases as introns in TOP2alpha. Though their extreme 3' ends differ, the striking conservation suggests the two genes diverged recently in evolutionary terms consistent with a gene duplication event. Access to TOP2alpha and TOP2beta gene structures should aid studies of mutations and gene rearrangements associated with anticancer drug resistance.

Paper title : RECQL5 cooperates with Topoisomerase II alpha in DNA decatenation and cell cycle progression.

Doi : https://doi.org/10.1093/nar/gkr844

Abstract : DNA decatenation mediated by Topoisomerase II is required to separate the interlinked sister chromatids post-replication. SGS1, a yeast homolog of the human RecQ family of helicases interacts with Topoisomerase II and plays a role in chromosome segregation, but this functional interaction has yet to be identified in higher organisms. Here, we report a physical and functional interaction of Topoisomerase IIα with RECQL5, one of five mammalian RecQ helicases, during DNA replication. Direct interaction of RECQL5 with Topoisomerase IIα stimulates the decatenation activity of Topoisomerase IIα. Consistent with these observations, RECQL5 co-localizes with Topoisomerase IIα during S-phase of the cell cycle. Moreover, cells with stable depletions of RECQL5 display a slow proliferation rate, a G2/M cell cycle arrest and late S-phase cycling defects. Metaphase spreads generated from RECQL5-depleted cells exhibit undercondensed and entangled chromosomes. Further, RECQL5-depleted cells activate a G2/M checkpoint and undergo apoptosis. These phenotypes are similar to those observed when Topoisomerase II catalytic activity is inhibited. These results reveal an important role for RECQL5 in the maintenance of genomic stability and a new insight into the decatenation process.

Paper title : A probability-based approach for high-throughput protein phosphorylation analysis and site localization.

Doi : https://doi.org/10.1038/nbt1240

Abstract : Data analysis and interpretation remain major logistical challenges when attempting to identify large numbers of protein phosphorylation sites by nanoscale reverse-phase liquid chromatography/tandem mass spectrometry (LC-MS/MS) (Supplementary Figure 1 online). In this report we address challenges that are often only addressable by laborious manual validation, including data set error, data set sensitivity and phosphorylation site localization. We provide a large-scale phosphorylation data set with a measured error rate as determined by the target-decoy approach, we demonstrate an approach to maximize data set sensitivity by efficiently distracting incorrect peptide spectral matches (PSMs), and we present a probability-based score, the Ascore, that measures the probability of correct phosphorylation site localization based on the presence and intensity of site-determining ions in MS/MS spectra. We applied our methods in a fully automated fashion to nocodazole-arrested HeLa cell lysate where we identified 1,761 nonredundant phosphorylation sites from 491 proteins with a peptide false-positive rate of 1.3%.

Paper title : Use of yeast in the study of anticancer drugs targeting DNA topoisomerases: expression of a functional recombinant human DNA topoisomerase II alpha in yeast.

Doi : https://doi.org/Not available

Abstract : A plasmid was constructed for the expression of human DNA topoisomerase II alpha in yeast from a galactose-inducible promoter of the yeast GAL1 gene. Expression of a recombinant human enzyme, in which the first 28 of the 1531 codons of human DNA topoisomerase II alpha were replaced by the first five codons of yeast DNA topoisomerase II, was shown to rescue the lethal phenotype of thermal sensitive yeast DNA topoisomerase II mutants at 35 degrees C. Purification of the human enzyme overexpressed in yeast yielded a single polypeptide with an apparent mass of 170 kDa, and the properties of the purified recombinant enzyme were found to be the same as those reported for human DNA topoisomerase II alpha purified from HeLa cells. Studies with the anticancer drug amsacrine indicated that the human enzyme, either inside yeast cells or in its purified form, is a target of the drug; inhibition of the purified enzyme by teniposide (VM-26) and merbarone was also demonstrated. These studies demonstrate that yeast strains expressing human DNA topoisomerase II alpha provide a convenient system for studying drugs targeting the enzyme; unlike mammalian systems, potential complications due to the presence of human DNA topoisomerase II beta can be eliminated in this system. Overexpression of human DNA topoisomerase II alpha in yeast also provides a convenient source of the enzyme for in vitro studies.

Paper title : Use of divalent metal ions in the dna cleavage reaction of human type II topoisomerases.

Doi : https://doi.org/10.1021/bi8023256

Abstract : All type II topoisomerases require divalent metal ions to cleave and ligate DNA. To further elucidate the mechanistic basis for these critical enzyme-mediated events, the role of the metal ion in the DNA cleavage reaction of human topoisomerase IIbeta was characterized and compared to that of topoisomerase IIalpha. This study utilized divalent metal ions with varying thiophilicities in conjunction with DNA cleavage substrates that substituted a sulfur atom for the 3'-bridging oxygen or the nonbridging oxygens of the scissile phosphate. On the basis of time courses of DNA cleavage, cation titrations, and metal ion mixing experiments, we propose the following model for the use of divalent metal ions by human type II topoisomerases. First, both enzymes employ a two-metal ion mechanism to support DNA cleavage. Second, an interaction between one divalent metal ion and the 3'-bridging atom of the scissile phosphate greatly enhances enzyme-mediated DNA cleavage, most likely by stabilizing the leaving 3'-oxygen. Third, there is an important interaction between a divalent second metal ion and a nonbridging atom of the scissile phosphate that stimulates DNA cleavage mediated by topoisomerase IIbeta. If this interaction exists in topoisomerase IIalpha, its effects on DNA cleavage are equivocal. This last aspect of the model highlights a difference in metal ion utilization during DNA cleavage mediated by human topoisomerase IIalpha and IIbeta.

Paper title : RNA helicase A interacts with dsDNA and topoisomerase IIalpha.

Doi : https://doi.org/10.1093/nar/gkg328

Abstract : RNA helicase A (RHA) is a multifunctional protein involved in various nuclear processes such as transcription and RNA export. It is believed that the interacting factors play important roles in determining the functional specificity of RHA. Here we show that RHA directly interacts with double-stranded (ds) nucleic acids (NAs) and assembles complexes with topoisomerase IIalpha. First, electrophoresis mobility shift assays demonstrate that RHA interacts with dsDNAs of different lengths ranging from 15 to 104 bp. Secondly, the binding of RHA to closed circular dsDNA stimulates the relaxation reaction catalyzed by either calf thymus topoisomerase I or HeLa topoisomerase IIalpha. Thirdly, immunoprecipitation, coupled with western blot analysis using anti-RHA and anti-topoisomerase IIalpha antibodies, shows that RHA and topoisomerase IIalpha assemble a complex in the presence of as yet unknown RNA molecules and additional protein factors such as Ubc9. Our observation suggests physical and functional interaction between RHA and topoisomerase IIalpha, which, perhaps, play important roles in regulating chromatin structure. The putative role of RHA-topoisomerase IIalpha complex in RNA polymerase II-mediated transcription is discussed.

Paper title : Plk3 phosphorylates topoisomerase IIalpha at Thr(1342), a site that is not recognized by Plk1.

Doi : https://doi.org/10.1042/BJ20071394

Abstract : The Plk (polo-like kinase) family is involved in cell-cycle machinery. Despite the possible overlapping involvement of Plk1 and Plk3 in cell-cycle distribution, the precise role of each Plk might be different. To investigate mechanisms that may differentiate their physiological roles, we compared the substrate specificities of Plk1 and Plk3 using synthetic peptides. Among these substrate peptides, topoisomerase IIalpha EKT(1342)DDE-containing synthetic peptide was strongly phosphorylated by Plk3 but not by Plk1. By modulating the topoisomerase IIalpha peptide, we identified residues at positions +1, +2 and +4 as determinants of differential substrate recognition between Plk1 and Plk3. Acidic residues at positions +2 and +4 appear to be a positive determinant for Plk3 but not Plk1. Variation at position +1 appears to be tolerated by Plk3, while a hydrophobic residue at +1 is critical for Plk1 activity. The direct phosphorylation of Thr(1342) of topoisomerase IIalpha by Plk3 was demonstrated with an in vitro kinase assay, and overexpression of Plk3 induced the phosphorylation of Thr(1342) in cellular topoisomerase IIalpha. Furthermore, the physical interaction between Plk3 and topoisomerase IIalpha was also demonstrated in cells in addition to phosphorylation. These data suggest that topoisomerase IIalpha is a novel physiological substrate for Plk3 and that Plk1 and Plk3 play different roles in cell-cycle regulation.

Paper title : Improved titanium dioxide enrichment of phosphopeptides from HeLa cells and high confident phosphopeptide identification by cross-validation of MS/MS and MS/MS/MS spectra.

Doi : https://doi.org/10.1021/pr070152u

Abstract : Enrichment is essential for phosphoproteome analysis because phosphorylated proteins are usually present in cells in low abundance. Recently, titanium dioxide (TiO2) has been demonstrated to enrich phosphopeptides from simple peptide mixtures with high specificity; however, the technology has not been optimized. In the present study, significant non-specific bindings were observed when proteome samples were applied to TiO2 columns. Column wash with an NH4Glu solution after loading peptide mixtures significantly increased the efficiency of TiO2 phosphopeptide enrichment with a recovery of up to 84%. Also, for proteome samples, more than a 2-fold increase in unique phosphopeptide identifications has been achieved. The use of NH4Glu for a TiO2 column wash does not significantly reduce the phosphopeptide recovery. A total of 858 phosphopeptides corresponding to 1034 distinct phosphosites has been identified from HeLa cells using the improved TiO2 enrichment procedure in combination with data-dependent neutral loss nano-RPLC-MS2-MS3 analysis. While 41 and 35% of the phosphopeptides were identified only by MS2 and MS3, respectively, 24% was identified by both MS2 and MS3. Cross-validation of the phosphopeptide assignment by MS2 and MS3 scans resulted in the highest confidence in identification (99.5%). Many phosphosites identified in this study appear to be novel, including sites from antigen Ki-67, nucleolar phosphoprotein p130, and Treacle protein. The study also indicates that evaluation of confidence levels for phosphopeptide identification via the reversed sequence database searching strategy might underestimate the false positive rate.

Paper title : Toward a comprehensive characterization of a human cancer cell phosphoproteome.

Doi : https://doi.org/10.1021/pr300630k

Abstract : Mass spectrometry (MS)-based phosphoproteomics has achieved extraordinary success in qualitative and quantitative analysis of cellular protein phosphorylation. Considering that an estimated level of phosphorylation in a cell is placed at well above 100,000 sites, there is still much room for improvement. Here, we attempt to extend the depth of phosphoproteome coverage while maintaining realistic aspirations in terms of available material, robustness, and instrument running time. We developed three strategies, where each provided a different balance between these three key parameters. The first strategy simply used enrichment by Ti(4+)-IMAC followed by reversed chromatography LC-MS (termed 1D). The second strategy incorporated an additional fractionation step through the use of HILIC (2D). Finally, a third strategy was designed employing first an SCX fractionation, followed by Ti(4+)-IMAC enrichment and additional fractionation by HILIC (3D). A preliminary evaluation was performed on the HeLa cell line. Detecting 3700 phosphopeptides in about 2 h, the 1D strategy was found to be the most sensitive but limited in comprehensivity, mainly due to issues with complexity and dynamic range. Overall, the best balance was achieved using the 2D based strategy, identifying close to 17,000 phosphopeptides with less than 1 mg of material in about 48 h. Subsequently, we confirmed the findings with the K562 cell sample. When sufficient material was available, the 3D strategy increased phosphoproteome allowing over 22,000 unique phosphopeptides to be identified. Unfortunately, the 3D strategy required more time and over 1 mg of material before it started to outperform 2D. Ultimately, combining all strategies, we were able to identify over 16,000 and nearly 24,000 unique phosphorylation sites from the cancer cell lines HeLa and K562, respectively. In summary, we demonstrate the need to carry out extensive fractionation for deep mining of the phosphoproteome and provide a guide for appropriate strategies depending on sample amount and/or analysis time.

Paper title : Lys-N and trypsin cover complementary parts of the phosphoproteome in a refined SCX-based approach.

Doi : https://doi.org/10.1021/ac9004309

Abstract : The analysis of proteome-wide phosphorylation events is still a major analytical challenge because of the enormous complexity of protein phosphorylation networks. In this work, we evaluate the complementarity of Lys-N, Lys-C, and trypsin with regard to their ability to contribute to the global analysis of the phosphoproteome. A refined version of low-pH strong cation exchange was used to efficiently separate N-terminally acetylated, phosphorylated, and nonmodified peptides. A total of 5036 nonredundant phosphopeptides could be identified with a false discovery rate of <1% from 1 mg of protein using a combination of the three enzymes. Our data revealed that the overlap between the phosphopeptide data sets generated with different proteases was marginal, whereas the overlap between two similarly generated tryptic data sets was found to be at least 4 times higher. In this way, the parallel use of Lys-N and trypsin enabled a 72% increase in the number of detected phosphopeptides as compared to trypsin alone, whereas a trypsin replicate experiment only led to a 25% increase. Thus, when focusing solely on the trypsin and Lys-N data, we identified 4671 nonredundant phosphopeptides. Further analysis of the detected sites showed that the Lys-N and trypsin data sets were enriched in significantly different phosphorylation motifs, further evidencing that multiprotease approaches are very valuable in phosphoproteome analyses.

Paper title : Discrete localization of different DNA topoisomerases in HeLa and K562 cell nuclei and subnuclear fractions.

Doi : https://doi.org/10.1006/excr.1994.1046

Abstract : Monoclonal antibodies raised against DNA topoisomerase I and against topoisomerase II alpha and beta isoforms, which have been previously demonstrated to be highly specific and capable of detecting cell cycle-related variations of the topoisomerase II isoforms (Negri et al., 1992, Exp. Cell Res. 200, 452-459), have been utilized for a fine subcellular localization. Immunocytochemistry by confocal and electron microscopy have been used for a topological and quantitative evaluation of the fine distribution of the different topoisomerases in HeLa and K562 cells. Topoisomerase I and topoisomerase II alpha are present both in the nucleoplasm and in the nucleolus, though at different relative ratios, while topoisomerase II beta is exclusively present at the nucleolar level. This is further confirmed by immunoblotting and immunocytochemical quantitative evaluations performed on purified nuclear matrix fractions obtained from K562 cells. In fact, the amount of topoisomerase I and topoisomerase II alpha present in the whole cell nuclei is partly lost in isolated nuclei but, while topoisomerase I is further significantly reduced in nuclear matrix preparations, the topoisomerase II alpha content is only slightly decreased. On the other hand, the great majority of topoisomerase II beta is retained in the nuclear matrix and can be detected exclusively in association with the nucleolar remnant. These results are consistent with specific functional roles hypothesized for the different topoisomerase types.

Paper title : The Epstein-Barr virus deubiquitinating enzyme BPLF1 regulates the activity of topoisomerase II during productive infection.

Doi : https://doi.org/10.1371/journal.ppat.1009954

Abstract : Topoisomerases are essential for the replication of herpesviruses but the mechanisms by which the viruses hijack the cellular enzymes are largely unknown. We found that topoisomerase-II (TOP2) is a substrate of the Epstein-Barr virus (EBV) ubiquitin deconjugase BPLF1. BPLF1 co-immunoprecipitated and deubiquitinated TOP2, and stabilized SUMOylated TOP2 trapped in cleavage complexes (TOP2ccs), which halted the DNA damage response to TOP2-induced double strand DNA breaks and promoted cell survival. Induction of the productive virus cycle in epithelial and lymphoid cell lines carrying recombinant EBV encoding the active enzyme was accompanied by TOP2 deubiquitination, accumulation of TOP2ccs and resistance to Etoposide toxicity. The protective effect of BPLF1 was dependent on the expression of tyrosyl-DNA phosphodiesterase 2 (TDP2) that releases DNA-trapped TOP2 and promotes error-free DNA repair. These findings highlight a previously unrecognized function of BPLF1 in supporting a non-proteolytic pathway for TOP2ccs debulking that favors cell survival and virus production.

Paper title : Identification of Novel Proteins Co-Purifying with Cockayne Syndrome Group B (CSB) Reveals Potential Roles for CSB in RNA Metabolism and Chromatin Dynamics.

Doi : https://doi.org/10.1371/journal.pone.0128558

Abstract : The CSB protein, a member of the SWI/SNF ATP dependent chromatin remodeling family of proteins, plays a role in a sub-pathway of nucleotide excision repair (NER) known as transcription coupled repair (TCR). CSB is frequently mutated in Cockayne syndrome group B, a segmental progeroid human autosomal recessive disease characterized by growth failure and degeneration of multiple organs. Though initially classified as a DNA repair protein, recent studies have demonstrated that the loss of CSB results in pleiotropic effects. Identification of novel proteins belonging to the CSB interactome may be useful not only for predicting the molecular basis for diverse pathological symptoms of CS-B patients but also for unraveling the functions of CSB in addition to its authentic role in DNA repair. In this study, we performed tandem affinity purification (TAP) technology coupled with mass spectrometry and co-immunoprecipitation studies to identify and characterize the proteins that potentially interact with CSB-TAP. Our approach revealed 33 proteins that were not previously known to interact with CSB. These newly identified proteins indicate potential roles for CSB in RNA metabolism involving repression and activation of transcription process and in the maintenance of chromatin dynamics and integrity.