dbACP: A Comprehensive Database of Anti-Cancer Peptides

dbacp02797

General Description

Peptide name : DNAtopoisomerase 2-beta

Source/Organism : Human

Linear/Cyclic : Not found

Chirality : Not found

Sequence Information

Sequence : MAKSGGCGAGAGVGGGNGALTWVTLFDQNNAAKKEESETANKNDSSKKLSVERVYQKKTQLEHILLRPDTYIGSVEPLTQFMWVYDEDVGMNCREVTFVPGLYKIFDEILVNAADNKQRDKnMTCIKVSIDPESNIISIWNNGKGIPVVEHKVEKVYVPALIFGQLLTSSNYDDDEKKVTGGRNGYGAKLCNIFSTKFTVETACKEYKHSFKQTWMNnMMKTSEAKIKHFDGEDYTCITFQPDLSKFKMEKLDKDIVALMTRRAYDLAGSCRGVKVMFNGKKLPVNGFRSYVDLYVKDKLDETGVALKVIHELANERWDVCLTLSEKGFQQISFVNSIATTKGGRHVDYVVDQVVGKLIEVVKKKNKAGVSVKPFQVKNHIWVFINCLIENPTFDSQTKEnMTLQPKSFGSKCQLSEKFFKAASNCGIVESILNWVKFKAQTQLNKKCSSVKYSKIKGIPKLDDANDAGGKHSLECTLILTEGDSAKSLAVSGLGVIGRDRYGVFPLRGKILNVREASHKQIMENAEINNIIKIVGLQYKKSYDDAESLKTLRYGKIMIMTDQDQDGSHIKGLLINFIHHNWPSLLKHGFLEEFITPIVKASKNKQELSFYSIPEFDEWKKHIENQKAWKIKYYKGLGTSTAKEAKEYFADMERHRILFRYAGPEDDAAITLAFSKKKIDDRKEWLTNFMEDRRQRRLHGLPEQFLYGTATKHLTYNDFINKELILFSNSDNERSIPSLVDGFKPGQRKVLFTCFKRNDKREVKVAQLAGSVAEMSAYHHGEQALMMTIVNLAQNFVGSNNINLLQPIGQFGTRLHGGKDAASPRYIFTMLSTLARLLFPAVDDNLLKFLYDDNQRVEPEWYIPIIPMVLINGAEGIGTGWACKLPNYDAREIVNNVRRMLDGLDPHPMLPNYKNFKGTIQELGQNQYAVSGEIFVVDRNTVEITELPVRTWTQVYKEQVLEPMLNGTDKTPALISDYKEYHTDTTVKFVVKMTEEKLAQAEAAGLHKVFKLQTTLTCNSMVLFDHMGCLKKYETVQDILKEFFDLRLSYYGLRKEWLVGMLGAESTKLNNQARFILEKIQGKITIENRSKKDLIQMLVQRGYESDPVKAWKEAQEKAAEEDETQNQHDDSSSDSGTPSGPDFNYILnMSLWSLTKEKVEELIKQRDAKGREVNDLKRKSPSDLWKEDLAAFVEELDKVESQEREDVLAGMSGKAIKGKVGKPKVKKLQLEETMPSPYGRRIIPEITAMKADASKKLLKKKKGDLDTAAVKVEFDEEFSGAPVEGAGEEALTPSVPINKGPKPKREKKEPGTRVRKTPTSSGKPSAKKVKKRNPWSDDESKSESDLEETEPVVIPRDSLLRRAAAERPKYTFDFSEEEDDDADDDDDDNNDLEELKVKASPITNDGEDEFVPSDGLDKDEYTFSPGKSKATPEKSLHDKKSQDFGNLFSFPSYSQKSEDDSAKFDSNEEDSASVFSPSFGLKQTDKVPSKTVAAKKGKPSSDTVPKPKRAPKQKKVVEAVNSDSDSEFGIPKKTTTPKGKGRGAKKRKASGSENEGDYNPGRKTSKTTSKKPKKTSFDQDSDVDIFPSDFPTEPPSLPRTGRARKEVKYFAESDEEEDDVDFAMFN

Peptide length: 1626

C-terminal modification: Not found

N-terminal modification : Not found

Non-natural peptide information: None

Activity Information

Assay type : Antibody-based assay

Assay time : 48h

Activity : Not found

Cell line : HEK293

Cancer type : Not specified

Other activity : Not found

Physicochemical Properties

Amino acid composition bar chart :

Molecular mass : 183264.957 Dalton

Aliphatic index : 0.732

Instability index : 36.154

Hydrophobicity (GRAVY) : -0.653

Isoelectric point : 8.1431

Charge (pH 7) : 7.8515

Aromaticity : 0.084

Molar extinction coefficient (cysteine, cystine): (173040, 174040)

Hydrophobic/hydrophilic ratio : 0.80422691

hydrophobic moment : 0.0554

Missing amino acid : None

Most occurring amino acid : K

Most occurring amino acid frequency : 187

Least occurring amino acid : n

Least occurring amino acid frequency : 4

Structural Information

3D structure : Not Available

Secondary structure fraction (Helix, Turn, Sheet): (0.3, 0.3, 0.3)

SMILES Notation: CC[C@H](C)[C@H](NC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C)NC(=O)CNC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)NC(=O)[C@@H](NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCSC)NC(=O)[C@@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CO)NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CNC(=O)CNC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CCC(N)=O)NC(=O)CNC(=O)[C@@H](NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CO)NC(=O)CNC(=O)[C@@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CCSC)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCC(=O)O)NC(=O)CNC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)CNC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CCCCN)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CS)NC(=O)[C@@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CCC(N)=O)NC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@H](CCCCN)NC(=O)[C@H](Cc1ccccc1)NC(=O)CNC(=O)[C@H](CC(=O)O)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CO)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(C)C)NC(=O)CNC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H](NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1C(=O)CNC(=O)[C@H](C)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)CNC(=O)[C@H](CCCCN)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@H](CCCCN)NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H](NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CO)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)NC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](Cc1ccccc1)NC(=O)CNC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1C(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(C)C)NC(=O)CNC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](CO)NC(=O)CNC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H](NC(=O)[C@H](CCSC)NC(=O)[C@@H](NC(=O)[C@H](CCSC)NC(=O)[C@@H](NC(=O)[C@H](CCCCN)NC(=O)CNC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](CO)NC(=O)[C@H](C)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CCCCN)NC(=O)CNC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1C(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCNC(=N)N)NC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CO)NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](CCCCN)NC(=O)CNC(=O)CNC(=O)[C@H](C)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](C)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@H](CS)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CS)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)NC(=O)CNC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CS)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@@H](NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCCN)NC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H](NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](CCCNC(=N)N)NC(=O)CNC(=O)CNC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](Cc1ccccc1)NC(=O)CNC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CS)NC(=O)[C@@H](NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CNC(=O)[C@@H](NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CO)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](Cc1ccccc1)NC(=O)CNC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)NC(=O)CNC(=O)[C@H](CC(N)=O)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CCSC)NC(=O)[C@@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CS)NC(=O)[C@H](CO)NC(=O)CNC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](C)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@@H](NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCCN)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(N)=O)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CS)NC(=O)[C@@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)O)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@H](CCCCN)NC(=O)[C@H](C)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCSC)NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CO)NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](CCCCN)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CS)NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C)NC(=O)CNC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)CNC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCCNC(=N)N)NC(=O)CNC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCC(N)=O)NC(=O)CNC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@@H](NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H](NC(=O)[C@H](CCCCN)NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCCCN)NC(=O)CNC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)O)NC(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@H](CS)NC(=O)[C@@H](NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](NC(=O)[C@H](CCCCN)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CC(C)C)NC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CS)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCSC)NC(=O)CNC(=O)[C@@H](NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@@H](NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@H](CCSC)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)NC(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)CNC(=O)[C@@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@@H](NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@@H](NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)CNC(=O)[C@H](CC(N)=O)NC(=O)CNC(=O)CNC(=O)CNC(=O)[C@@H](NC(=O)CNC(=O)[C@H](C)NC(=O)CNC(=O)[C@H](C)NC(=O)CNC(=O)[C@H](CS)NC(=O)CNC(=O)CNC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCSC)C(C)C)[C@@H](C)O)C(C)C)[C@@H](C)O)[C@@H](C)O)C(C)C)C(C)C)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)CC)C(C)C)[C@@H](C)O)C(C)C)C(C)C)C(C)C)[C@@H](C)O)C(C)C)[C@@H](C)CC)[C@@H](C)CC)C(C)C)[C@@H](C)O)[C@@H](C)CC)C(C)C)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)CC)C(C)C)C(C)C)C(C)C)C(C)C)C(C)C)[C@@H](C)CC)[C@@H](C)O)C(C)C)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)O)C(C)C)[C@@H](C)O)[C@@H](C)O)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)CC)C(C)C)[C@@H](C)O)C(C)C)C(C)C)C(C)C)C(C)C)C(C)C)[C@@H](C)O)C(C)C)C(C)C)[C@@H](C)CC)C(C)C)[C@@H](C)O)[C@@H](C)CC)C(C)C)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)O)C(C)C)C(C)C)C(C)C)C(C)C)C(C)C)[C@@H](C)CC)C(C)C)C(C)C)C(C)C)C(C)C)C(C)C)[C@@H](C)CC)C(C)C)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)O)[C@@H](C)O)[C@@H](C)CC)C(C)C)[C@@H](C)CC)C(C)C)[C@@H](C)O)C(C)C)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)O)C(C)C)C(C)C)[C@@H](C)CC)C(C)C)[C@@H](C)CC)C(C)C)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)CC)C(C)C)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)CC)C(C)C)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)O)[C@@H](C)O)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)CC)C(C)C)C(C)C)[C@@H](C)O)C(C)C)C(C)C)C(C)C)[C@@H](C)O)[C@@H](C)CC)C(C)C)C(C)C)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)O)C(C)C)C(C)C)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)CC)C(C)C)[C@@H](C)CC)C(=O)NCC(=O)N[C@H](C(=O)NCC(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1c[nH]cn1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](C)C(=O)N[C@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCC(=O)O)C(=O)N[C@H](C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@H](C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@H](C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1c[nH]cn1)C(=O)N[C@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](Cc1c[nH]cn1)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](Cc1c[nH]cn1)C(=O)N[C@@H](CCSC)C(=O)NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(=O)NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(=N)N)C(=O)NCC(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@H](C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](Cc1c[nH]cn1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)NCC(=O)N[C@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)NCC(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@H](C(=O)N[C@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(=O)N[C@@H](CCC(=O)O)C(=O)NCC(=O)N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@H](C(=O)N1CCC[C@H]1C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@H]1C(=O)NCC(=O)N[C@H](C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@H](C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N1CCC[C@H]1C(=O)N[C@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@H](C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@H](C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](Cc1c[nH]cn1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1ccccc1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@H](C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@H](C(=O)N[C@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(=O)N[C@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(=N)N)C(=O)NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(=O)O)C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@H](C(=O)N[C@@H](Cc1ccccc1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](Cc1ccccc1)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@H](C(=O)NCC(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(N)=O)C(=O)O)C(C)C)C(C)C)[C@@H](C)O)[C@@H](C)O)[C@@H](C)CC)C(C)C)[C@@H](C)O)[C@@H](C)O)[C@@H](C)O)[C@@H](C)O)[C@@H](C)O)[C@@H](C)O)[C@@H](C)O)[C@@H](C)CC)C(C)C)C(C)C)C(C)C)C(C)C)[C@@H](C)O)C(C)C)[C@@H](C)O)C(C)C)[C@@H](C)O)C(C)C)[C@@H](C)O)[C@@H](C)O)C(C)C)[C@@H](C)O)[C@@H](C)CC)C(C)C)[C@@H](C)O)[C@@H](C)CC)C(C)C)C(C)C)[C@@H](C)O)C(C)C)[C@@H](C)O)[C@@H](C)O)C(C)C)[C@@H](C)O)[C@@H](C)CC)C(C)C)[C@@H](C)O)C(C)C)C(C)C)C(C)C)[C@@H](C)O)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)O)C(C)C)C(C)C)[C@@H](C)CC)C(C)C)C(C)C)C(C)C)C(C)C)[C@@H](C)CC)C(C)C)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)O)C(C)C)C(C)C)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)O)C(C)C)[C@@H](C)CC)C(C)C)[C@@H](C)O)C(C)C)[C@@H](C)O)[C@@H](C)O)[C@@H](C)O)C(C)C)[C@@H](C)O)C(C)C)C(C)C)C(C)C)[C@@H](C)O)[C@@H](C)O)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)O)C(C)C)C(C)C)[C@@H](C)O)[C@@H](C)O)C(C)C)[C@@H](C)O)[C@@H](C)CC)C(C)C)[C@@H](C)O)C(C)C)C(C)C)[C@@H](C)CC)C(C)C)[C@@H](C)CC)[C@@H](C)O)C(C)C)C(C)C)[C@@H](C)CC)[C@@H](C)O

Secondary Structure :

Method Prediction
GOR HHTTTCCCEEEEEECCCCEEEEEEEHHHHHHHHHHHHHHHHHHHTTTTHHHHHHHHTTHHHHEEEECTTCEEECCCCHHEEEEEETTTHTHTHHTEECCTTCCEEEHHHHHHHHHHHHHTTTHEEEEECCCCTTEEEEEETTTTCCEHEHHHHHHHECCHEEEEEEETTTCCCTTHTEEETCCCCCTTEEEHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHECTTTTTEEEECCCCHHHHHHHHHHHHHHHHHHHHHCTTTTTHTTHEEEETTTTCCTTTCEEEEEHHHTTTHHHHHHHHHHHHHHHHHHHHHHHHHHHTTCEEEEEEEEEEEETTCEEEEEEEEHEHHHHHHHHHHHHHTTEEECHHHHTTCEEEEETTHCTCCCCCHTHTTHHHHCCCCTTCTTHHHHHHHHHTTTTCEEEHHHHHHHHHHHHHHTTTTHHHTHTEETCCCCCCHHHHTTTCCTTTEEEEEEHCCHHHHEEEEEEEEEEEETTTCCCETTTEHHHHHHHHHHHHHHHHHHHHHHEEEHHHTTCCCHHHHHHHHHTTEEEEEEHHTTTTCEETTEEEEEEETTCTTHHTTTHHHHHHCHHHHHHTTTTHHEETCCHHHHHHHHHHHHHHHHHHHEETTTCCCHHHHHHHHHHHHHHHHHHHETTCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHTTTTCCTHHEETTETTEEECCTHHHHHHEEETTTTTTTCCCEEEETCCTTCTTEEEHHHHTTTHHHHHHHHHHTHHHHHHHHHHTHHHHHEEEHHHHHEEETCCCCCEECCCCEEEEEETTCCCTTCTHEEEEEHHHHHHHECCCCCTTHHEEEETTTTTCCTTTECCCCEEEEETTCTEEEEEEETTCTCCTHHHHHHHHHHEETTCCCCCCCCTCTTTCCEEEHHTCCCCEETTEEEEEHTTTCEEEECEEHEEEHHHHHHHHCHHETTCCCCCCEEHTTTTTTCTHHHHHHHHHHHHHHHHHHHHHHHHHHEEEEEETTTTEEEHHHHTHHTTHHHHHHHHHHHHHHHHTTTTHHHHHHEHHHHHHHHHHTHHHHHHHHHHHTHHHHHHHHTHHHHHEEEETCCTTCHHHHHHHHHHHHHHHHHHHTCCCTTTETTCCCCCCCCEEEEEHHHHHHHHHHHHHHHHHHHHTHHHHHHHHTTCCTTHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHTTCCCCTCCEECHHHHHHHHHHHHHHHHHTTTCHHHHHHHHHHHHHHTTCCHTTTHHHHHCCCCCCCTCCCCHHTTTCTTCEEEECCCCTCCCHHHHHTTTCTCCCHTTHTTTHHHTHCHEECCCTHHHHHHHHHCTTCCCHHHHHHHHHHCCCCTTTTHHHHHHHHTCCCCCTCCCHECCCTTCTTTTEEECTTCTTCCHHHHHHHTTTTTTTCEEECCCTTTTCCHHHHHHTTTTTTHHEEECCCTTCCECCCCTTHHHHHHTCCCCCCCCCCCCHCHHHHHEEEEETCTTTHTTECCTTCCCCTTCCHHHHHHHTTTTTTTTCCCTTCCETCCCTTCTTCCEECCTCCCECCCCCCCCCCTTCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH
Chou-Fasman (CF) CCCCCCCCCEEEECCCCEEEEEEECCCHHHHHHHHHHHHHCCCCCCCCCCEEEEHHHHHHHHEECCCCEEEECCCCEEECEEEHHHHEECCCCEEEEEECEEEHHHHEEHHHHHHHHHHHHEEEEEEECCCCCEEEEEECCCCCEEEEHHHHHHEEECCCEEECEEECCCCCHHHHEEEECCCCCCCCCEEEEEEEEEEHHHHHHHCCCCEEEECHHHHHHHHHHHHHCCCCCEEEEEECCCHHHHHHHHHHHEEECCEEHHHHHHHCCCEEEECCCCCCCEECCCCEEEEEEEEHHHHHCEEECCEEHHHHHHHHEEEEEEHHHHHEEEEEEEECEECCCCCCCEEEEEEEEEEECEECCHHHHHHEEECCCEECCCEEEEEEEECCCCCCCCCCHHHHHHCCCCCCCCCCCHHHHHHHHHCCEEEEECEEEEEHHHHHEEHHHHCEEEECCCCCCCCHHHHHHCCCCCHHHHHEEEECCCCHHHHEEEECEEEEECCCEEEECCCEECEEHHHHHHHHHHHHHHCEEEEEEEEECCCCCCHHHHHCCCEECCCEEEEHHHHCCCCCEECEEEEEECCCCCCHHHHHHHHHEEEEEECCCCHHHHHEEEECHHHHHHHHHHHHHHHHHEEEEECEEEEHHHHHHHHHHHHHHEEEEEECCCCHHHHHEEHHHHHHHHHHHHHHEEEHHHHHHHHHHHCCCCCEEEEEHHHHEECCEEEHHHHEEECCCCCCCEEEEEEECCCCCCCCEEEEEHHHHHHHHHCHHHHHEEHHHHHCCCHHHHHHHEEEEHHHHEEEECCCEECCEEEECEEEECCCCHHHHCCEEEEEEEEEHHHHHHCCCCHHHHHEECCCCCCCCCCEEEEEEEEEEECCCCCEEEEHHHHCCCCCCCEEEECEEHHHHHCCCCCCCCCCCCCEEEECCCCCCCEEECCCEEEECCEEEECCCCEEEEEEEEHHHHHHHHCCCCCCCCCCEEECHHHHEEEEEEEEEEEHHHHHHHHHHHHHHHHHCCCEEEEEECEEEEHHHHHHHHHHEEEEHHHHHHHHHHEEEEEHHHHHEEECHHHHHCCCCCCCEEHHHHEEEEEEECCCHHHHHHHEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEECCCEEEEHHHHHHHHHHHHHHHHHHHCHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHCCCCCCHHHHHHHHHCCCCCEEEEEEECCHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHCCCCCCHHHHHHHHEECEECCCCCCCHHHHHCCCEEEECEECCCCCCCHHHHHHCCCCCHHHHHHCHHHHHHEEEEECCHHHHHHHHHHHCCEEEEHHHHHHHHHHHHHCCCCHHHHHHHHCCEEECCHHHHHCCCCHHHHHHEEECCCCCCCCCCHHHHHHHHHCCCCEEECCCCCCHHHHHHHHHCHHHHHCEEEECCCCHHHHHCCCCCEEHHHHHCCCCCEEECCCHHHHHHHCHHHHCCCCCCCCCCCCCEEEECCCCCCHHHHHHCCCCCCCCCCCCCCCCEEEECCCCCCEECCCCCCEEEECCCCCCCCCCCCCCCHHHHHHCHHHHHHHHHHHHHHHHCCC
Neural Network (NN) CCCCCCCCCCCCCCCCCCCEEEEEHHHCCCCCCHHHCHHHCCCCCCCCHHHHHHHHHCCHHHHHCCCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCEEEECCCCCCCCEEEEECCCCCCEEEHHHCCHHCCCHHHHHEECCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCCHHHHHHHHCCCCCHHHHHHHHHHHCCHHHCCCCCCCCECCCCCCCHHHHHHHCCHHHHHHHHHHHCCCCCCCCCHHHHHCCCCCCCCCCCCEEEEHHCCCCCHHHHHHHHHHHHHCCHHHHHHHCCCCCCCEEEEEEEEEECCCCCCEEEEEHHHHHHHHHHHHHCCCCCCCCCCCHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCCCCCHHHHCCCCEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCHHHEHHHHHCCCCCCCHHHHHHHHCCCEEEECCCCCCCCCCCCEEEHHHCCCCCCHHCCCCCCCCCCHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHEECCCCCCCCCHHHHHHHHHHHHHHHECCCCCCCCHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCCCCCHEHHCCCCCCCCCCCCCHHHHEECCCCCCCCCCCCCCCCCCCCCCEEEEEHCCCCCCHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCHCCCCCCCCCCCCCCCCCCCECCCCCCCCCCCEEEEEHHHHHHHCCCCCCHHHHHHCCCCCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCEECCCCCCCCCCCCEEEEECCCCCCECCCCCCCCHHCCCHHCCHCCCCCCCCCCCEECCCCCCCCCCEEEEHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHCHCCCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCHHCCHHHHHHHHCCCCEEHCCCCCCHHHHHHHHCCCCCCCCHHHHHHHHHHHHCHHCCCCCCCCCCCCCCCCCCCCEEHHHHHHHHHHHHHHHHHHHHHCCCCCHHHCCCCCCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCEECCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCHEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCHHHHHHHH
Joint/Consensus CCCCCCCCCEEEECCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHCHHHHEECCCCCCEECCCCCCCEEEEECCCCCCCCCCCEECCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCCCCEEEEEECCCCCCEEEHHHHHHHCCCCEEEEEEECCCCCCCCCCEEECCCCCCCCEEEECCCCCCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCCCCCEEEECCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEEEEEEEECCCCCEEEEEEEEEHHHHHHHHHHHHHCCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCCHHHHEEEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEECCCCCCCCHHHHHHHHHCCEEEEECCCCCCCCEECCEEEEECCCCCCCCCCCCCCCCCCCHHHHCCCCCCCCEECCCHHHHHHHHHHHHHHHHHHHEECCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEECCCCCEECCCCCHHHHHEECCCCCCCCCCCEEECCCCCCCCEEEEECCCCCCHHHHHHHHHHCHHHHHHHHHHCHHHHHEEEHHHHHEEECCCCCCCCCCCCEEEEECCCCCCCCCCCEEEEEHHHHHHHCCCCCCHHHHEECCCCCCCCCCCCCCCCCEEEEECCCCEEEECCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCEEEEECCCCCCCCCCEEEEEEHHHHHHHHCCCCCCCCCCCCEECCCCCCCCCCCEEEEHHHHHHHHHHHHHHHHHHHHCEEEEECCCCEEHHHHHCCCCCCCCHHHHHHHHHHHHHCCCCHHHHHHCHHHHHHCCCCCCHHHHHHHHCCCCEECCCCCCHHHHHEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCEEECCHHHHHHHHHHHHHHHHHHHHCHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHCCCCCCHHHHHHHHCCCCCCCCCEEECCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCCCCCCHHHHCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCHHHHHCCCCCCCCCCCCCCCCCCCCCCEECCCCHHHHHHHHHCCCCCCHHHHHHHHHHCCCCCCCCHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCEECCCCCCCCCCHHHHHHCCCCCCCCEECCCCCCCCCCHHHHHCCCCCCCCCEECCCCCCCCCCCCCCCCHHHHHCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH

Molecular Descriptors and ADMET Properties

Molecular Descriptors: Not available.

ADMET Properties: Not available.

Cross Referencing databases

CancerPPD : Not available

ApIAPDB : Not available

CancerPPD2 ID : Not available

Reference

1 : Rigbolt KT, et al. System-wide temporal characterization of the proteome and phosphoproteome of human embryonic stem cell differentiation. Sci Signal. 2011; 4:rs3. doi: 10.1126/scisignal.2001570

2 : Mayya V, et al. Quantitative phosphoproteomic analysis of T cell receptor signaling reveals system-wide modulation of protein-protein interactions. Sci Signal. 2009; 2:ra46. doi: 10.1126/scisignal.2000007

3 : Beausoleil SA, et al. A probability-based approach for high-throughput protein phosphorylation analysis and site localization. Nat Biotechnol. 2006; 24:1285-92. doi: 10.1038/nbt1240

4 : Zini N, et al. Discrete localization of different DNA topoisomerases in HeLa and K562 cell nuclei and subnuclear fractions. Exp Cell Res. 1994; 210:336-48. doi: 10.1006/excr.1994.1046

5 : Gauci S, et al. Lys-N and trypsin cover complementary parts of the phosphoproteome in a refined SCX-based approach. Anal Chem. 2009; 81:4493-501. doi: 10.1021/ac9004309

6 : Wyles JP, et al. Nuclear interactions of topoisomerase II alpha and beta with phospholipid scramblase 1. Nucleic Acids Res. 2007; 35:4076-85. doi: 10.1093/nar/gkm434

7 : Bian Y, et al. An enzyme assisted RP-RPLC approach for in-depth analysis of human liver phosphoproteome. J Proteomics. 2014; 96:253-62. doi: 10.1016/j.jprot.2013.11.014

8 : Turley H, et al. The distribution and expression of the two isoforms of DNA topoisomerase II in normal and neoplastic human tissues. Br J Cancer. 1997; 75:1340-6. doi: 10.1038/bjc.1997.227

9 : Carrascal M, et al. Phosphorylation analysis of primary human T lymphocytes using sequential IMAC and titanium oxide enrichment. J Proteome Res. 2008; 7:5167-76. doi: 10.1021/pr800500r

10 : Olsen JV, et al. Global, in vivo, and site-specific phosphorylation dynamics in signaling networks. Cell. 2006; 127:635-48. doi: 10.1016/j.cell.2006.09.026

11 : Zhou H, et al. Toward a comprehensive characterization of a human cancer cell phosphoproteome. J Proteome Res. 2013; 12:260-71. doi: 10.1021/pr300630k

12 : Austin CA and Fisher LM. Isolation and characterization of a human cDNA clone encoding a novel DNA topoisomerase II homologue from HeLa cells. FEBS Lett. 1990; 266:115-7. doi: 10.1016/0014-5793(90)81520-x

13 : Davies SL, et al. Human cells express two differentially spliced forms of topoisomerase II beta mRNA. Nucleic Acids Res. 1993; 21:3719-23. doi: 10.1093/nar/21.16.3719

14 : Dephoure N, et al. A quantitative atlas of mitotic phosphorylation. Proc Natl Acad Sci U S A. 2008; 105:10762-7. doi: 10.1073/pnas.0805139105

15 : Li J, et al. The Epstein-Barr virus deubiquitinating enzyme BPLF1 regulates the activity of topoisomerase II during productive infection. PLoS Pathog. 2021; 17:e1009954. doi: 10.1371/journal.ppat.1009954

16 : Jenkins JR, et al. Isolation of cDNA clones encoding the beta isozyme of human DNA topoisomerase II and localisation of the gene to chromosome 3p24. Nucleic Acids Res. 1992; 20:5587-92. doi: 10.1093/nar/20.21.5587

17 : Mirski SE, et al. Identification of functional nuclear export sequences in human topoisomerase IIalpha and beta. Biochem Biophys Res Commun. 2003; 306:905-11. doi: 10.1016/s0006-291x(03)01077-5

18 : Hendriks IA, et al. Uncovering global SUMOylation signaling networks in a site-specific manner. Nat Struct Mol Biol. 2014; 21:927-36. doi: 10.1038/nsmb.2890

19 : Wu CC, et al. Structural basis of type II topoisomerase inhibition by the anticancer drug etoposide. Science. 2011; 333:459-62. doi: 10.1126/science.1204117

20 : Hiraide T, et al. A de novo TOP2B variant associated with global developmental delay and autism spectrum disorder. Mol Genet Genomic Med. 2020; 8:e1145. doi: 10.1002/mgg3.1145

21 : Yuwen H, et al. Binding of wild-type p53 by topoisomerase II and overexpression of topoisomerase II in human hepatocellular carcinoma. Biochem Biophys Res Commun. 1997; 234:194-7. doi: 10.1006/bbrc.1997.6539

22 : Xiao Z, et al. System-wide Analysis of SUMOylation Dynamics in Response to Replication Stress Reveals Novel Small Ubiquitin-like Modified Target Proteins and Acceptor Lysines Relevant for Genome Stability. Mol Cell Proteomics. 2015; 14:1419-34. doi: 10.1074/mcp.O114.044792

23 : Burkard TR, et al. Initial characterization of the human central proteome. BMC Syst Biol. 2011; 5:17. doi: 10.1186/1752-0509-5-17

24 : Hendriks IA, et al. Site-specific mapping of the human SUMO proteome reveals co-modification with phosphorylation. Nat Struct Mol Biol. 2017; 24:325-336. doi: 10.1038/nsmb.3366

25 : Matsuoka S, et al. ATM and ATR substrate analysis reveals extensive protein networks responsive to DNA damage. Science. 2007; 316:1160-6. doi: 10.1126/science.1140321

26 : Chung TD, et al. Characterization and immunological identification of cDNA clones encoding two human DNA topoisomerase II isozymes. Proc Natl Acad Sci U S A. 1989; 86:9431-5. doi: 10.1073/pnas.86.23.9431

27 : Austin CA, et al. Novel HeLa topoisomerase II is the II beta isoform: complete coding sequence and homology with other type II topoisomerases. Biochim Biophys Acta. 1993; 1172:283-91. doi: 10.1016/0167-4781(93)90215-y

28 : West KL, et al. Mutagenesis of E477 or K505 in the B' domain of human topoisomerase II beta increases the requirement for magnesium ions during strand passage. Biochemistry. 2000; 39:1223-33. doi: 10.1021/bi991328b

29 : Cantin GT, et al. Combining protein-based IMAC, peptide-based IMAC, and MudPIT for efficient phosphoproteomic analysis. J Proteome Res. 2008; 7:1346-51. doi: 10.1021/pr0705441

30 : Hendriks IA, et al. SUMO-2 Orchestrates Chromatin Modifiers in Response to DNA Damage. Cell Rep. 2015; 10:1778-1791. doi: 10.1016/j.celrep.2015.02.033

31 : Lam CW, et al. Global developmental delay and intellectual disability associated with a de novo TOP2B mutation. Clin Chim Acta. 2017; 469:63-68. doi: 10.1016/j.cca.2017.03.022

32 : Papapietro O, et al. Topoisomerase 2β mutation impairs early B-cell development. Blood. 2020; 135:1497-1501. doi: 10.1182/blood.2019003299

33 : Wang SM, et al. Genomic Action of Sigma-1 Receptor Chaperone Relates to Neuropathic Pain. Mol Neurobiol. 2021; 58:2523-2541. doi: 10.1007/s12035-020-02276-8

34 : Olsen JV, et al. Quantitative phosphoproteomics reveals widespread full phosphorylation site occupancy during mitosis. Sci Signal. 2010; 3:ra3. doi: 10.1126/scisignal.2000475

35 : Broderick L, et al. Mutations in topoisomerase IIβ result in a B cell immunodeficiency. Nat Commun. 2019; 10:3644. doi: 10.1038/s41467-019-11570-6

36 : Giorgianni F, et al. Toward a global characterization of the phosphoproteome in prostate cancer cells: identification of phosphoproteins in the LNCaP cell line. Electrophoresis. 2007; 28:2027-34. doi: 10.1002/elps.200600782

37 : Sng JH, et al. Molecular cloning and characterization of the human topoisomerase IIalpha and IIbeta genes: evidence for isoform evolution through gene duplication. Biochim Biophys Acta. 1999; 1444:395-406. doi: 10.1016/s0167-4781(99)00020-2

Literature

Paper title : System-wide temporal characterization of the proteome and phosphoproteome of human embryonic stem cell differentiation.

Doi : https://doi.org/10.1126/scisignal.2001570

Abstract : To elucidate cellular events underlying the pluripotency of human embryonic stem cells (hESCs), we performed parallel quantitative proteomic and phosphoproteomic analyses of hESCs during differentiation initiated by a diacylglycerol analog or transfer to media that had not been conditioned by feeder cells. We profiled 6521 proteins and 23,522 phosphorylation sites, of which almost 50% displayed dynamic changes in phosphorylation status during 24 hours of differentiation. These data are a resource for studies of the events associated with the maintenance of hESC pluripotency and those accompanying their differentiation. From these data, we identified a core hESC phosphoproteome of sites with similar robust changes in response to the two distinct treatments. These sites exhibited distinct dynamic phosphorylation patterns, which were linked to known or predicted kinases on the basis of the matching sequence motif. In addition to identifying previously unknown phosphorylation sites on factors associated with differentiation, such as kinases and transcription factors, we observed dynamic phosphorylation of DNA methyltransferases (DNMTs). We found a specific interaction of DNMTs during early differentiation with the PAF1 (polymerase-associated factor 1) transcriptional elongation complex, which binds to promoters of the pluripotency and known DNMT target genes encoding OCT4 and NANOG, thereby providing a possible molecular link for the silencing of these genes during differentiation.

Paper title : Quantitative phosphoproteomic analysis of T cell receptor signaling reveals system-wide modulation of protein-protein interactions.

Doi : https://doi.org/10.1126/scisignal.2000007

Abstract : Protein phosphorylation events during T cell receptor (TCR) signaling control the formation of complexes among proteins proximal to the TCR, the activation of kinase cascades, and the activation of transcription factors; however, the mode and extent of the influence of phosphorylation in coordinating the diverse phenomena associated with T cell activation are unclear. Therefore, we used the human Jurkat T cell leukemia cell line as a model system and performed large-scale quantitative phosphoproteomic analyses of TCR signaling. We identified 10,665 unique phosphorylation sites, of which 696 showed TCR-responsive changes. In addition, we analyzed broad trends in phosphorylation data sets to uncover underlying mechanisms associated with T cell activation. We found that, upon stimulation of the TCR, phosphorylation events extensively targeted protein modules involved in all of the salient phenomena associated with T cell activation: patterning of surface proteins, endocytosis of the TCR, formation of the F-actin cup, inside-out activation of integrins, polarization of microtubules, production of cytokines, and alternative splicing of messenger RNA. Further, case-by-case analysis of TCR-responsive phosphorylation sites on proteins belonging to relevant functional modules together with network analysis allowed us to deduce that serine-threonine (S-T) phosphorylation modulated protein-protein interactions (PPIs) in a system-wide fashion. We also provide experimental support for this inference by showing that phosphorylation of tubulin on six distinct serine residues abrogated PPIs during the assembly of microtubules. We propose that modulation of PPIs by stimulus-dependent changes in S-T phosphorylation state is a widespread phenomenon applicable to many other signaling systems.

Paper title : A probability-based approach for high-throughput protein phosphorylation analysis and site localization.

Doi : https://doi.org/10.1038/nbt1240

Abstract : Data analysis and interpretation remain major logistical challenges when attempting to identify large numbers of protein phosphorylation sites by nanoscale reverse-phase liquid chromatography/tandem mass spectrometry (LC-MS/MS) (Supplementary Figure 1 online). In this report we address challenges that are often only addressable by laborious manual validation, including data set error, data set sensitivity and phosphorylation site localization. We provide a large-scale phosphorylation data set with a measured error rate as determined by the target-decoy approach, we demonstrate an approach to maximize data set sensitivity by efficiently distracting incorrect peptide spectral matches (PSMs), and we present a probability-based score, the Ascore, that measures the probability of correct phosphorylation site localization based on the presence and intensity of site-determining ions in MS/MS spectra. We applied our methods in a fully automated fashion to nocodazole-arrested HeLa cell lysate where we identified 1,761 nonredundant phosphorylation sites from 491 proteins with a peptide false-positive rate of 1.3%.

Paper title : Discrete localization of different DNA topoisomerases in HeLa and K562 cell nuclei and subnuclear fractions.

Doi : https://doi.org/10.1006/excr.1994.1046

Abstract : Monoclonal antibodies raised against DNA topoisomerase I and against topoisomerase II alpha and beta isoforms, which have been previously demonstrated to be highly specific and capable of detecting cell cycle-related variations of the topoisomerase II isoforms (Negri et al., 1992, Exp. Cell Res. 200, 452-459), have been utilized for a fine subcellular localization. Immunocytochemistry by confocal and electron microscopy have been used for a topological and quantitative evaluation of the fine distribution of the different topoisomerases in HeLa and K562 cells. Topoisomerase I and topoisomerase II alpha are present both in the nucleoplasm and in the nucleolus, though at different relative ratios, while topoisomerase II beta is exclusively present at the nucleolar level. This is further confirmed by immunoblotting and immunocytochemical quantitative evaluations performed on purified nuclear matrix fractions obtained from K562 cells. In fact, the amount of topoisomerase I and topoisomerase II alpha present in the whole cell nuclei is partly lost in isolated nuclei but, while topoisomerase I is further significantly reduced in nuclear matrix preparations, the topoisomerase II alpha content is only slightly decreased. On the other hand, the great majority of topoisomerase II beta is retained in the nuclear matrix and can be detected exclusively in association with the nucleolar remnant. These results are consistent with specific functional roles hypothesized for the different topoisomerase types.

Paper title : Lys-N and trypsin cover complementary parts of the phosphoproteome in a refined SCX-based approach.

Doi : https://doi.org/10.1021/ac9004309

Abstract : The analysis of proteome-wide phosphorylation events is still a major analytical challenge because of the enormous complexity of protein phosphorylation networks. In this work, we evaluate the complementarity of Lys-N, Lys-C, and trypsin with regard to their ability to contribute to the global analysis of the phosphoproteome. A refined version of low-pH strong cation exchange was used to efficiently separate N-terminally acetylated, phosphorylated, and nonmodified peptides. A total of 5036 nonredundant phosphopeptides could be identified with a false discovery rate of <1% from 1 mg of protein using a combination of the three enzymes. Our data revealed that the overlap between the phosphopeptide data sets generated with different proteases was marginal, whereas the overlap between two similarly generated tryptic data sets was found to be at least 4 times higher. In this way, the parallel use of Lys-N and trypsin enabled a 72% increase in the number of detected phosphopeptides as compared to trypsin alone, whereas a trypsin replicate experiment only led to a 25% increase. Thus, when focusing solely on the trypsin and Lys-N data, we identified 4671 nonredundant phosphopeptides. Further analysis of the detected sites showed that the Lys-N and trypsin data sets were enriched in significantly different phosphorylation motifs, further evidencing that multiprotease approaches are very valuable in phosphoproteome analyses.

Paper title : Nuclear interactions of topoisomerase II alpha and beta with phospholipid scramblase 1.

Doi : https://doi.org/10.1093/nar/gkm434

Abstract : DNA topoisomerase (topo) II modulates DNA topology and is essential for cell division. There are two isoforms of topo II (alpha and beta) that have limited functional redundancy, although their catalytic mechanisms appear the same. Using their COOH-terminal domains (CTDs) in yeast two-hybrid analysis, we have identified phospholipid scramblase 1 (PLSCR1) as a binding partner of both topo II alpha and beta. Although predominantly a plasma membrane protein involved in phosphatidylserine externalization, PLSCR1 can also be imported into the nucleus where it may have a tumour suppressor function. The interactions of PLSCR1 and topo II were confirmed by pull-down assays with topo II alpha and beta CTD fusion proteins and endogenous PLSCR1, and by co-immunoprecipitation of endogenous PLSCR1 and topo II alpha and beta from HeLa cell nuclear extracts. PLSCR1 also increased the decatenation activity of human topo IIalpha. A conserved basic sequence in the CTD of topo IIalpha was identified as being essential for binding to PLSCR1 and binding of the two proteins could be inhibited by a synthetic peptide corresponding to topo IIalpha amino acids 1430-1441. These studies reveal for the first time a physical and functional interaction between topo II and PLSCR1.

Paper title : An enzyme assisted RP-RPLC approach for in-depth analysis of human liver phosphoproteome.

Doi : https://doi.org/10.1016/j.jprot.2013.11.014

Abstract : UNLABELLED: Protein phosphorylation is one of the most common post-translational modifications. It plays key roles in regulating diverse biological processes of liver tissues. To better understand the role of protein phosphorylation in liver functions, it is essential to perform in-depth phosphoproteome analysis of human liver. Here, an enzyme assisted reversed-phase-reversed-phase liquid chromatography (RP-RPLC) approach with both RPLC separations operated with optimized acidic mobile phase was developed. High orthogonal separation was achieved by trypsin digestion of the Glu-C generated peptides in the fractions collected from the first RPLC separation. The phosphoproteome coverage was further improved by using two types of instruments, i.e. TripleTOF 5600 and LTQ Orbitrap Velos. A total of 22,446 phosphorylation sites, corresponding to 6526 nonredundant phosphoproteins were finally identified from normal human liver tissues. Of these sites, 15,229 sites were confidently localized with Ascore≥13. This dataset was the largest phosphoproteome dataset of human liver. It can be a public resource for the liver research community and holds promise for further biology studies. BIOLOGICAL SIGNIFICANCE: The enzyme assisted approach enabled the two RPLC separations operated both with optimized acidic mobile phases. The identifications from TripleTOF 5600 and Orbitrap Velos are highly complementary. The largest phosphoproteome dataset of human liver was generated.

Paper title : The distribution and expression of the two isoforms of DNA topoisomerase II in normal and neoplastic human tissues.

Doi : https://doi.org/10.1038/bjc.1997.227

Abstract : In mammalian cells, there are two isoforms of DNA topoisomerase II, designated alpha (170-kDa form) and beta (180-kDa form). Previous studies using cell lines have shown that the topoisomerase IIalpha and beta isoforms are differentially regulated during the cell cycle and in response to changes in growth state. Moreover, both isoforms can act as targets for a range of anti-tumour drugs. Here, we have analysed the normal tissue distribution in humans of topoisomerase IIalpha and beta using isoform-specific antibodies. In addition, we have studied expression of these isoforms in 69 primary tumour biopsies, representative either of tumours that are responsive to topoisomerase II-targeting drugs (breast, lung, lymphoma and seminoma) or of those that show de novo drug resistance (colon). Topoisomerase IIalpha was expressed exclusively in the proliferating compartments of all normal tissues, and was detectable in both the cell nucleus and cytoplasm. In biologically aggressive or rapidly proliferating tumours (e.g. high-grade lymphomas and seminomas), there was a high level of topoisomerase IIalpha, although expression was still detectable in colon tumours, indicating that expression of this isoform is not sufficient to explain the intrinsic drug resistance of colon tumours. Topoisomerase IIbeta was expressed ubiquitously in vivo and was localized in both the nucleoli and the nucleoplasm. This isoform was present in quiescent cell populations, but was expressed at a generally higher level in all tumours and proliferating cells than in normal quiescent tissues. We conclude that topoisomerase IIalpha is a strict proliferation marker in normal and neoplastic cells in vivo, but that topoisomerase IIbeta has a much more general cell and tissue distribution than has topoisomerase IIalpha. The apparent up-regulation of topoisomerase IIbeta in neoplastic cells has implications for the response of patients to anti-tumour therapies that include topoisomerase II-targeting drugs.

Paper title : Phosphorylation analysis of primary human T lymphocytes using sequential IMAC and titanium oxide enrichment.

Doi : https://doi.org/10.1021/pr800500r

Abstract : T lymphocytes mediate cellular and humoral defense against foreign bodies or autoantigens. An understanding of T-cell information processing furthers studies of the immunological response. We describe a large-scale phosphorylation analysis of primary T cells using a multidimensional separation strategy, involving preparative SDS-PAGE for prefractionation, in-gel digestion and sequential phosphopeptide enrichment using IMAC and TiO2. A total of 281 phosphorylation sites (197 of high confidence, Ascore > 15), mapping to 204 human gene sequences, were identified by LC-MS(n) analysis in an LTQ linear ion trap. Subsequently, we created the LymPHOS database (http://lymphos.org), which links mass spectrometric peptide information to phosphorylation sites and phosphoprotein sequences.

Paper title : Global, in vivo, and site-specific phosphorylation dynamics in signaling networks.

Doi : https://doi.org/10.1016/j.cell.2006.09.026

Abstract : Cell signaling mechanisms often transmit information via posttranslational protein modifications, most importantly reversible protein phosphorylation. Here we develop and apply a general mass spectrometric technology for identification and quantitation of phosphorylation sites as a function of stimulus, time, and subcellular location. We have detected 6,600 phosphorylation sites on 2,244 proteins and have determined their temporal dynamics after stimulating HeLa cells with epidermal growth factor (EGF) and recorded them in the Phosida database. Fourteen percent of phosphorylation sites are modulated at least 2-fold by EGF, and these were classified by their temporal profiles. Surprisingly, a majority of proteins contain multiple phosphorylation sites showing different kinetics, suggesting that they serve as platforms for integrating signals. In addition to protein kinase cascades, the targets of reversible phosphorylation include ubiquitin ligases, guanine nucleotide exchange factors, and at least 46 different transcriptional regulators. The dynamic phosphoproteome provides a missing link in a global, integrative view of cellular regulation.

Paper title : Toward a comprehensive characterization of a human cancer cell phosphoproteome.

Doi : https://doi.org/10.1021/pr300630k

Abstract : Mass spectrometry (MS)-based phosphoproteomics has achieved extraordinary success in qualitative and quantitative analysis of cellular protein phosphorylation. Considering that an estimated level of phosphorylation in a cell is placed at well above 100,000 sites, there is still much room for improvement. Here, we attempt to extend the depth of phosphoproteome coverage while maintaining realistic aspirations in terms of available material, robustness, and instrument running time. We developed three strategies, where each provided a different balance between these three key parameters. The first strategy simply used enrichment by Ti(4+)-IMAC followed by reversed chromatography LC-MS (termed 1D). The second strategy incorporated an additional fractionation step through the use of HILIC (2D). Finally, a third strategy was designed employing first an SCX fractionation, followed by Ti(4+)-IMAC enrichment and additional fractionation by HILIC (3D). A preliminary evaluation was performed on the HeLa cell line. Detecting 3700 phosphopeptides in about 2 h, the 1D strategy was found to be the most sensitive but limited in comprehensivity, mainly due to issues with complexity and dynamic range. Overall, the best balance was achieved using the 2D based strategy, identifying close to 17,000 phosphopeptides with less than 1 mg of material in about 48 h. Subsequently, we confirmed the findings with the K562 cell sample. When sufficient material was available, the 3D strategy increased phosphoproteome allowing over 22,000 unique phosphopeptides to be identified. Unfortunately, the 3D strategy required more time and over 1 mg of material before it started to outperform 2D. Ultimately, combining all strategies, we were able to identify over 16,000 and nearly 24,000 unique phosphorylation sites from the cancer cell lines HeLa and K562, respectively. In summary, we demonstrate the need to carry out extensive fractionation for deep mining of the phosphoproteome and provide a guide for appropriate strategies depending on sample amount and/or analysis time.

Paper title : Isolation and characterization of a human cDNA clone encoding a novel DNA topoisomerase II homologue from HeLa cells.

Doi : https://doi.org/10.1016/0014-5793(90)81520-x

Abstract : We have isolated and sequenced 3 human DNA topoisomerase II (topo II) partial cDNA clones from a HeLa carcinoma cell cDNA library. Two clones were identical to an internal fragment of HeLa topo II cDNA. The third clone, CAA5, had a different and novel sequence which shared significant nucleotide (62%) and predicted peptide (70%) homologies with a region of the HeLa topo II cDNA. Our results suggest that HeLa cells express at least two homologous forms of DNA topoisomerase II. The new HeLa topo II homologue is discussed in relation to topo II isoenzymes recently described in a Burkitt lymphoma and other cell lines.

Paper title : Human cells express two differentially spliced forms of topoisomerase II beta mRNA.

Doi : https://doi.org/10.1093/nar/21.16.3719

Abstract : Screening of a human B-cell cDNA library with a topoisomerase II beta gene-specific probe revealed the presence of two distinct forms of topoisomerase II beta cDNA. One form (designated topoisomerase II beta-1), representing the majority of the clones, would encode the topoisomerase II beta amino acid sequence reported recently [Jenkins, J.R. et al. (1992) Nucleic Acids Res., 20, 5587-5592]. The second form (designated topoisomerase II beta-2) would encode a protein containing an additional 5 amino acids inserted after Valine-23 of the topoisomerase II beta-1 protein sequence. The topoisomerase II beta-1 and beta-2 mRNAs were both widely expressed in human cell lines and tissues. Topoisomerase II beta-2 mRNA was expressed at a lower level than that of the beta-1 form, but the relative expression of the two forms varied in different cell types. Analysis of genomic DNA clones revealed that the two forms of topoisomerase II beta mRNA arose via differential splicing. These data indicate that in addition to the closely related topoisomerase II alpha and beta isozymes, there are two forms of topoisomerase II beta mRNA widely expressed in human cells.

Paper title : A quantitative atlas of mitotic phosphorylation.

Doi : https://doi.org/10.1073/pnas.0805139105

Abstract : The eukaryotic cell division cycle is characterized by a sequence of orderly and highly regulated events resulting in the duplication and separation of all cellular material into two newly formed daughter cells. Protein phosphorylation by cyclin-dependent kinases (CDKs) drives this cycle. To gain further insight into how phosphorylation regulates the cell cycle, we sought to identify proteins whose phosphorylation is cell cycle regulated. Using stable isotope labeling along with a two-step strategy for phosphopeptide enrichment and high mass accuracy mass spectrometry, we examined protein phosphorylation in a human cell line arrested in the G(1) and mitotic phases of the cell cycle. We report the identification of >14,000 different phosphorylation events, more than half of which, to our knowledge, have not been described in the literature, along with relative quantitative data for the majority of these sites. We observed >1,000 proteins with increased phosphorylation in mitosis including many known cell cycle regulators. The majority of sites on regulated phosphopeptides lie in [S/T]P motifs, the minimum required sequence for CDKs, suggesting that many of the proteins may be CDK substrates. Analysis of non-proline site-containing phosphopeptides identified two unique motifs that suggest there are at least two undiscovered mitotic kinases.

Paper title : The Epstein-Barr virus deubiquitinating enzyme BPLF1 regulates the activity of topoisomerase II during productive infection.

Doi : https://doi.org/10.1371/journal.ppat.1009954

Abstract : Topoisomerases are essential for the replication of herpesviruses but the mechanisms by which the viruses hijack the cellular enzymes are largely unknown. We found that topoisomerase-II (TOP2) is a substrate of the Epstein-Barr virus (EBV) ubiquitin deconjugase BPLF1. BPLF1 co-immunoprecipitated and deubiquitinated TOP2, and stabilized SUMOylated TOP2 trapped in cleavage complexes (TOP2ccs), which halted the DNA damage response to TOP2-induced double strand DNA breaks and promoted cell survival. Induction of the productive virus cycle in epithelial and lymphoid cell lines carrying recombinant EBV encoding the active enzyme was accompanied by TOP2 deubiquitination, accumulation of TOP2ccs and resistance to Etoposide toxicity. The protective effect of BPLF1 was dependent on the expression of tyrosyl-DNA phosphodiesterase 2 (TDP2) that releases DNA-trapped TOP2 and promotes error-free DNA repair. These findings highlight a previously unrecognized function of BPLF1 in supporting a non-proteolytic pathway for TOP2ccs debulking that favors cell survival and virus production.

Paper title : Isolation of cDNA clones encoding the beta isozyme of human DNA topoisomerase II and localisation of the gene to chromosome 3p24.

Doi : https://doi.org/10.1093/nar/20.21.5587

Abstract : Topoisomerases catalyse the interconversion of topological isomers of DNA and have key roles in nucleic acid metabolism. Human cells express two distinct type II topoisomerase isozymes, designated topoisomerase II alpha (170 kDa form) and topoisomerase II beta (180 kDa form). We have isolated cDNA clones encoding the beta isozyme from a human B-cell library. The proposed coding region for the topoisomerase II beta protein is 4,863 nucleotides long and would encode a polypeptide with a calculated M(r) of 182,705. The predicted topoisomerase II beta protein sequence shows striking similarity (72% identical residues) to that of the human alpha isozyme, and homology to topoisomerase II proteins from Drosophila, yeast and bacteria. Regions of greatest amino acid sequence divergence lie at the extreme N-terminus and over a C-terminal domain comprising approximately 25% of the total protein. We have quantified the level of topoisomerase II beta mRNA in a panel of human tumour cell lines of different origin using an RNase protection assay, and compared the level to that of topoisomerase II alpha mRNA. Topoisomerase II beta mRNA was expressed in haemopoietic, epithelial and fibroblast cell lines, although to different extents, with U937 cells (promonocytic leukaemia) showing a particularly high level. There was no obvious relationship in terms of level of expression between the topoisomerase II alpha and beta genes. We have localised the gene encoding topoisomerase II beta protein to chromosome 3p24 in the human genome.

Paper title : Identification of functional nuclear export sequences in human topoisomerase IIalpha and beta.

Doi : https://doi.org/10.1016/s0006-291x(03)01077-5

Abstract : Nuclear localization of topoisomerase IIalpha and beta is important for normal cell function as well as being a determinant of tumour cell sensitivity to topoisomerase II-targeting chemotherapeutic agents. However, topoisomerase II is cytoplasmic under certain circumstances, indicating that it may undergo active nuclear export. We have examined the ability of Leu-rich potential nuclear export signal (NES) sequences present in human topoisomerase IIalpha and beta to direct the export of a green fluorescent protein-glutathione-S-transferase fusion protein following microinjection into HeLa cell nuclei. Of 12 sequences tested, only one potential NES sequence from the comparable location in each isoform (alphaNES(1018-1028) and betaNES(1034-1044)) was active. Mutation of hydrophobic residues in alphaNES(1018-1028) and betaNES(1034-1044) substantially reduced their nuclear export activity as did leptomycin B treatment of microinjected cells. Our results provide the first evidence of active nuclear export of topoisomerase II and suggest it is mediated by a CRM1-dependent pathway.

Paper title : Uncovering global SUMOylation signaling networks in a site-specific manner.

Doi : https://doi.org/10.1038/nsmb.2890

Abstract : SUMOylation is a reversible post-translational modification essential for genome stability. Using high-resolution MS, we have studied global SUMOylation in human cells in a site-specific manner, identifying a total of >4,300 SUMOylation sites in >1,600 proteins. To our knowledge, this is the first time that >1,000 SUMOylation sites have been identified under standard growth conditions. We quantitatively studied SUMOylation dynamics in response to SUMO protease inhibition, proteasome inhibition and heat shock. Many SUMOylated lysines have previously been reported to be ubiquitinated, acetylated or methylated, thus indicating cross-talk between SUMO and other post-translational modifications. We identified 70 phosphorylation and four acetylation events in proximity to SUMOylation sites, and we provide evidence for acetylation-dependent SUMOylation of endogenous histone H3. SUMOylation regulates target proteins involved in all nuclear processes including transcription, DNA repair, chromatin remodeling, precursor-mRNA splicing and ribosome assembly.

Paper title : Structural basis of type II topoisomerase inhibition by the anticancer drug etoposide.

Doi : https://doi.org/10.1126/science.1204117

Abstract : Type II topoisomerases (TOP2s) resolve the topological problems of DNA by transiently cleaving both strands of a DNA duplex to form a cleavage complex through which another DNA segment can be transported. Several widely prescribed anticancer drugs increase the population of TOP2 cleavage complex, which leads to TOP2-mediated chromosome DNA breakage and death of cancer cells. We present the crystal structure of a large fragment of human TOP2β complexed to DNA and to the anticancer drug etoposide to reveal structural details of drug-induced stabilization of a cleavage complex. The interplay between the protein, the DNA, and the drug explains the structure-activity relations of etoposide derivatives and the molecular basis of drug-resistant mutations. The analysis of protein-drug interactions provides information applicable for developing an isoform-specific TOP2-targeting strategy.

Paper title : A de novo TOP2B variant associated with global developmental delay and autism spectrum disorder.

Doi : https://doi.org/10.1002/mgg3.1145

Abstract : BACKGROUND: TOP2B encodes type II topoisomerase beta, which controls topological changes during DNA transcription. TOP2B is expressed in the developing nervous system and is involved in brain development and neural differentiation. Recently, a de novo missense TOP2B variant (c.187C>T) has been identified in an individual with neurodevelopmental disorder (NDD). However, the association between TOP2B variants and NDDs remains uncertain. METHODS: Trio-based whole-exome sequencing was performed on a 7-year-old girl, presenting muscle hypotonia, stereotypic hand movements, epilepsy, global developmental delay, and autism spectrum disorder. Brain magnetic resonance images were normal. She was unable to walk independently and spoke no meaningful words. RESULTS: We found a de novo variant in TOP2B (NM_001330700.1:c.187C>T, p.(His63Tyr)), which is identical to the previous case. The clinical features of the two individuals with the c.187C>T variant overlapped. CONCLUSION: Our study supports the finding that TOP2B variants may cause NDDs.

Paper title : Binding of wild-type p53 by topoisomerase II and overexpression of topoisomerase II in human hepatocellular carcinoma.

Doi : https://doi.org/10.1006/bbrc.1997.6539

Abstract : In order to study the mechanisms by which p53 function is regulated, human wild-type p53 cDNA was cloned into a vaccinia virus vector and the expressed p53 protein was used to investigate binding of the p53 by cellular proteins from a cDNA expression library from human liver. One protein that bound wild-type p53 had > 99% homology with DNA topoisomerase IIb. p53 protein was coimmunoprecipitated from topoisomerase II-rich cell lysates (but not from topoisomerase II-deficient cell lysates) by an antibody to topoisomerase IIa and IIb. This binding was shown to occur without a dsDNA intermediary. Hepatocellular carcinomas (HCCs) and adjacent nontumorous liver tissues from ten patients were studied to determine the level of expression of topoisomerase II and p53. Overexpressed topoisomerase II proteins were detected by western blot in six of ten HCCs (60%), including several in which presumed wild-type p53 was detected by immunohistochemistry. No topoisomerase II expression was detectable in the ten nontumorous liver tissues from the same patients or in a sample of normal human liver.

Paper title : System-wide Analysis of SUMOylation Dynamics in Response to Replication Stress Reveals Novel Small Ubiquitin-like Modified Target Proteins and Acceptor Lysines Relevant for Genome Stability.

Doi : https://doi.org/10.1074/mcp.O114.044792

Abstract : Genotoxic agents can cause replication fork stalling in dividing cells because of DNA lesions, eventually leading to replication fork collapse when the damage is not repaired. Small Ubiquitin-like Modifiers (SUMOs) are known to counteract replication stress, nevertheless, only a small number of relevant SUMO target proteins are known. To address this, we have purified and identified SUMO-2 target proteins regulated by replication stress in human cells. The developed methodology enabled single step purification of His10-SUMO-2 conjugates under denaturing conditions with high yield and high purity. Following statistical analysis on five biological replicates, a total of 566 SUMO-2 targets were identified. After 2 h of hydroxyurea treatment, 10 proteins were up-regulated for SUMOylation and two proteins were down-regulated for SUMOylation, whereas after 24 h, 35 proteins were up-regulated for SUMOylation, and 13 proteins were down-regulated for SUMOylation. A site-specific approach was used to map over 1000 SUMO-2 acceptor lysines in target proteins. The methodology is generic and is widely applicable in the ubiquitin field. A large subset of these identified proteins function in one network that consists of interacting replication factors, transcriptional regulators, DNA damage response factors including MDC1, ATR-interacting protein ATRIP, the Bloom syndrome protein and the BLM-binding partner RMI1, the crossover junction endonuclease EME1, BRCA1, and CHAF1A. Furthermore, centromeric proteins and signal transducers were dynamically regulated by SUMOylation upon replication stress. Our results uncover a comprehensive network of SUMO target proteins dealing with replication damage and provide a framework for detailed understanding of the role of SUMOylation to counteract replication stress. Ultimately, our study reveals how a post-translational modification is able to orchestrate a large variety of different proteins to integrate different nuclear processes with the aim of dealing with the induced DNA damage.

Paper title : Initial characterization of the human central proteome.

Doi : https://doi.org/10.1186/1752-0509-5-17

Abstract : BACKGROUND: On the basis of large proteomics datasets measured from seven human cell lines we consider their intersection as an approximation of the human central proteome, which is the set of proteins ubiquitously expressed in all human cells. Composition and properties of the central proteome are investigated through bioinformatics analyses. RESULTS: We experimentally identify a central proteome comprising 1,124 proteins that are ubiquitously and abundantly expressed in human cells using state of the art mass spectrometry and protein identification bioinformatics. The main represented functions are proteostasis, primary metabolism and proliferation. We further characterize the central proteome considering gene structures, conservation, interaction networks, pathways, drug targets, and coordination of biological processes. Among other new findings, we show that the central proteome is encoded by exon-rich genes, indicating an increased regulatory flexibility through alternative splicing to adapt to multiple environments, and that the protein interaction network linking the central proteome is very efficient for synchronizing translation with other biological processes. Surprisingly, at least 10% of the central proteome has no or very limited functional annotation. CONCLUSIONS: Our data and analysis provide a new and deeper description of the human central proteome compared to previous results thereby extending and complementing our knowledge of commonly expressed human proteins. All the data are made publicly available to help other researchers who, for instance, need to compare or link focused datasets to a common background.

Paper title : Site-specific mapping of the human SUMO proteome reveals co-modification with phosphorylation.

Doi : https://doi.org/10.1038/nsmb.3366

Abstract : Small ubiquitin-like modifiers (SUMOs) are post-translational modifications (PTMs) that regulate nuclear cellular processes. Here we used an augmented K0-SUMO proteomics strategy to identify 40,765 SUMO acceptor sites and quantify their fractional contribution for 6,747 human proteins. Structural-predictive analyses revealed that lysines residing in disordered regions are preferentially targeted by SUMO, in notable contrast to other widespread lysine modifications. In our data set, we identified 807 SUMOylated peptides that were co-modified by phosphorylation, along with dozens of SUMOylated peptides that were co-modified by ubiquitylation, acetylation and methylation. Notably, 9% of the identified SUMOylome occurred proximal to phosphorylation, and numerous SUMOylation sites were found to be fully dependent on prior phosphorylation events. SUMO-proximal phosphorylation occurred primarily in a proline-directed manner, and inhibition of cyclin-dependent kinases dynamically affected co-modification. Collectively, we present a comprehensive analysis of the SUMOylated proteome, uncovering the structural preferences for SUMO and providing system-wide evidence for a remarkable degree of cross-talk between SUMOylation and other major PTMs.

Paper title : ATM and ATR substrate analysis reveals extensive protein networks responsive to DNA damage.

Doi : https://doi.org/10.1126/science.1140321

Abstract : Cellular responses to DNA damage are mediated by a number of protein kinases, including ATM (ataxia telangiectasia mutated) and ATR (ATM and Rad3-related). The outlines of the signal transduction portion of this pathway are known, but little is known about the physiological scope of the DNA damage response (DDR). We performed a large-scale proteomic analysis of proteins phosphorylated in response to DNA damage on consensus sites recognized by ATM and ATR and identified more than 900 regulated phosphorylation sites encompassing over 700 proteins. Functional analysis of a subset of this data set indicated that this list is highly enriched for proteins involved in the DDR. This set of proteins is highly interconnected, and we identified a large number of protein modules and networks not previously linked to the DDR. This database paints a much broader landscape for the DDR than was previously appreciated and opens new avenues of investigation into the responses to DNA damage in mammals.

Paper title : Characterization and immunological identification of cDNA clones encoding two human DNA topoisomerase II isozymes.

Doi : https://doi.org/10.1073/pnas.86.23.9431

Abstract : Several DNA topoisomerase II (Topo II; EC 5.99.1.3) partial cDNA clones obtained from a human Raji-HN2 cDNA library were sequenced and two classes of nucleotide sequences were found. One member of the first class, SP1, was identical to an internal fragment of human HeLa cell Topo II cDNA described earlier. A member of the second class, SP11, shared extensive nucleotide (75%) and predicted peptide (92%) sequence similarities with the first two-thirds of HeLa Topo II. Each class of cDNAs hybridized to unique, nonoverlapping restriction enzyme fragments of genomic DNA from several human cell lines. Synthetic 24-mer oligonucleotide probes specific for each cDNA class hybridized to 6.5-kilobase mRNAs; furthermore, hybridization of probe specific for one class was not blocked by probe specific for the other. Antibodies raised against a synthetic SP1-encoded dodecapeptide specifically recognized the 170-kDa form of Topo II, while antibodies raised against the corresponding SP11-encoded dodecapeptide, or a second unique SP11-encoded tridecapeptide, selectively recognized the 180-kDa form of Topo II. These data provide genetic and immunochemical evidence for two Topo II isozymes.

Paper title : Novel HeLa topoisomerase II is the II beta isoform: complete coding sequence and homology with other type II topoisomerases.

Doi : https://doi.org/10.1016/0167-4781(93)90215-y

Abstract : DNA topoisomerase (topo) II mediates DNA strand passage in an ATP-dependent reaction. Human cell lines express at least two genetically distinct forms of the enzyme, topo II alpha (p170) and II beta (p180). Previously, we isolated a novel HeLa cDNA clone (CAA5) that partially encodes a protein homologous to topo II alpha (Austin, C.A. and Fisher, L.M. (1990) FEBS Lett. 266, 115-117). In this paper we show that CAA5 encodes a C-terminal segment of human topo II beta. We report here for the first time cDNA clones spanning the entire coding sequence. Overlapping clones specifying the 3' end of the cDNA have been isolated, mapped and sequenced. The missing 5' coding sequence was obtained by an inverse PCR protocol and from a specifically primed cDNA library. Human topo II beta is a 1621 amino acid protein which is closely homologous to topo II alpha in the N-terminal three quarters of its sequence. In contrast, the C-terminal segments of the alpha and beta sequences show considerable divergence suggesting these regions may mediate different cellular functions of the two isoforms. Southern blot analysis of yeast and Drosophila DNA using human alpha and beta specific probes detected a single topo II homologue in these lower eukaryotes. Comparison of the protein sequence for human topo II beta with other type II topoisomerases revealed several conserved motifs and has allowed identification of the likely ATPase- and DNA breakage-reunion domains.

Paper title : Mutagenesis of E477 or K505 in the B' domain of human topoisomerase II beta increases the requirement for magnesium ions during strand passage.

Doi : https://doi.org/10.1021/bi991328b

Abstract : A type II topoisomerase is essential for decatenating DNA replication products, and it accomplishes this task by passing one DNA duplex through a transient break in a second duplex. The B' domain of topoisomerase II contains three highly conserved motifs, EGDSA, PL(R/K)GK(I/L/M)LNVR, and IMTD(Q/A)DXD. We have investigated these motifs in topoisomerase II beta by mutagenesis, and report that they play a critical role in establishing the DNA cleavage-religation equilibrium. In addition, the mutations E477Q (EGDSA) and K505E (PLRGKILNVR) increase the optimal magnesium ion concentration for strand passage, without affecting the Mg(2+) dependence of ATP hydrolysis. It is likely that the binding affinity of the magnesium ion(s) specifically required for DNA cleavage has been reduced by these mutations. The crystal structure of yeast topo II indicates that residues E477 and K505 may help to position the three aspartate residues of the IMTD(Q/A)DXD motif for magnesium ion coordination, and we propose two possible locations for the magnesium ion binding site(s). These observations are consistent with a previous model in which the B' domain is positioned such that these acidic residues lie next to the active site tyrosine residue. A magnesium ion bound by these aspartate residues could therefore mediate the DNA cleavage-religation reaction.

Paper title : Combining protein-based IMAC, peptide-based IMAC, and MudPIT for efficient phosphoproteomic analysis.

Doi : https://doi.org/10.1021/pr0705441

Abstract : Immobilized metal affinity chromatography (IMAC) is a common strategy used for the enrichment of phosphopeptides from digested protein mixtures. However, this strategy by itself is inefficient when analyzing complex protein mixtures. Here, we assess the effectiveness of using protein-based IMAC as a pre-enrichment step prior to peptide-based IMAC. Ultimately, we couple the two IMAC-based enrichments and MudPIT in a quantitative phosphoproteomic analysis of the epidermal growth factor pathway in mammalian cells identifying 4470 unique phosphopeptides containing 4729 phosphorylation sites.

Paper title : SUMO-2 Orchestrates Chromatin Modifiers in Response to DNA Damage.

Doi : https://doi.org/10.1016/j.celrep.2015.02.033

Abstract : Small ubiquitin-like modifiers play critical roles in the DNA damage response (DDR). To increase our understanding of SUMOylation in the mammalian DDR, we employed a quantitative proteomics approach in order to identify dynamically regulated SUMO-2 conjugates and modification sites upon treatment with the DNA damaging agent methyl methanesulfonate (MMS). We have uncovered a dynamic set of 20 upregulated and 33 downregulated SUMO-2 conjugates, and 755 SUMO-2 sites, of which 362 were dynamic in response to MMS. In contrast to yeast, where a response is centered on homologous recombination, we identified dynamically SUMOylated interaction networks of chromatin modifiers, transcription factors, DNA repair factors, and nuclear body components. SUMOylated chromatin modifiers include JARID1B/KDM5B, JARID1C/KDM5C, p300, CBP, PARP1, SetDB1, and MBD1. Whereas SUMOylated JARID1B was ubiquitylated by the SUMO-targeted ubiquitin ligase RNF4 and degraded by the proteasome in response to DNA damage, JARID1C was SUMOylated and recruited to the chromatin to demethylate histone H3K4.

Paper title : Global developmental delay and intellectual disability associated with a de novo TOP2B mutation.

Doi : https://doi.org/10.1016/j.cca.2017.03.022

Abstract : BACKGROUND: More than 100 genes had been identified for autism spectrum disorder (ASD). With the advancement of whole-exome/genome sequencing (WES/WGS), disease-causing gene in ASD can be identified in a holistic and unbiased approach. The identification of new ASD genes can further explore the molecular basis of ASD. METHODS: We report a 15yo girl with developmental delay, intellectual disability, hypotonia, microcephaly and autistic feature. She first presented at 6months old with primitive response to noise. Physical examination showed the patient was hypotonic despite normal muscle power and reflexes. She also had progressive microcephaly. Developmental assessment at 6y showed the patient had a corresponding functional age of 1y. The patient also had autistic feature. RESULTS: The patient had no abnormal biochemical or radiological findings. To investigate the molecular basis of the clinical presentation, we applied clinical whole-exome sequencing (WES) for the proband and the family, and we identified a novel de novo heterozygous missense pathogenic variant, TOP2B: NM_001068.2:c.172C>T; NP_001059.2:p.His58Tyr. TOP2B encodes for the enzyme, topoisomerase II isoenzyme beta which is abundant in both developing and adult brain. Defect of topoisomerase is also known to cause ASD. CONCLUSIONS: Using clinical WES, we were able to identify the disease-causing gene for this patient in a holistic approach and end the diagnostic odyssey with a therapeutic impact.

Paper title : Topoisomerase 2β mutation impairs early B-cell development.

Doi : https://doi.org/10.1182/blood.2019003299

Abstract : Not available

Paper title : Genomic Action of Sigma-1 Receptor Chaperone Relates to Neuropathic Pain.

Doi : https://doi.org/10.1007/s12035-020-02276-8

Abstract : Sigma-1 receptors (Sig-1Rs) are endoplasmic reticulum (ER) chaperones implicated in neuropathic pain. Here we examine if the Sig-1R may relate to neuropathic pain at the level of dorsal root ganglia (DRG). We focus on the neuronal excitability of DRG in a "spare nerve injury" (SNI) model of neuropathic pain in rats and find that Sig-1Rs likely contribute to the genesis of DRG neuronal excitability by decreasing the protein level of voltage-gated Cav2.2 as a translational inhibitor of mRNA. Specifically, during SNI, Sig-1Rs translocate from ER to the nuclear envelope via a trafficking protein Sec61β. At the nucleus, the Sig-1R interacts with cFos and binds to the promoter of 4E-BP1, leading to an upregulation of 4E-BP1 that binds and prevents eIF4E from initiating the mRNA translation for Cav2.2. Interestingly, in Sig-1R knockout HEK cells, Cav2.2 is upregulated. In accordance with those findings, we find that intra-DRG injection of Sig-1R agonist (+)pentazocine increases frequency of action potentials via regulation of voltage-gated Ca2+ channels. Conversely, intra-DRG injection of Sig-1R antagonist BD1047 attenuates neuropathic pain. Hence, we discover that the Sig-1R chaperone causes neuropathic pain indirectly as a translational inhibitor.

Paper title : Quantitative phosphoproteomics reveals widespread full phosphorylation site occupancy during mitosis.

Doi : https://doi.org/10.1126/scisignal.2000475

Abstract : Eukaryotic cells replicate by a complex series of evolutionarily conserved events that are tightly regulated at defined stages of the cell division cycle. Progression through this cycle involves a large number of dedicated protein complexes and signaling pathways, and deregulation of this process is implicated in tumorigenesis. We applied high-resolution mass spectrometry-based proteomics to investigate the proteome and phosphoproteome of the human cell cycle on a global scale and quantified 6027 proteins and 20,443 unique phosphorylation sites and their dynamics. Co-regulated proteins and phosphorylation sites were grouped according to their cell cycle kinetics and compared to publicly available messenger RNA microarray data. Most detected phosphorylation sites and more than 20% of all quantified proteins showed substantial regulation, mainly in mitotic cells. Kinase-motif analysis revealed global activation during S phase of the DNA damage response network, which was mediated by phosphorylation by ATM or ATR or DNA-dependent protein kinases. We determined site-specific stoichiometry of more than 5000 sites and found that most of the up-regulated sites phosphorylated by cyclin-dependent kinase 1 (CDK1) or CDK2 were almost fully phosphorylated in mitotic cells. In particular, nuclear proteins and proteins involved in regulating metabolic processes have high phosphorylation site occupancy in mitosis. This suggests that these proteins may be inactivated by phosphorylation in mitotic cells.

Paper title : Mutations in topoisomerase IIβ result in a B cell immunodeficiency.

Doi : https://doi.org/10.1038/s41467-019-11570-6

Abstract : B cell development is a highly regulated process involving multiple differentiation steps, yet many details regarding this pathway remain unknown. Sequencing of patients with B cell-restricted immunodeficiency reveals autosomal dominant mutations in TOP2B. TOP2B encodes a type II topoisomerase, an essential gene required to alleviate topological stress during DNA replication and gene transcription, with no previously known role in B cell development. We use Saccharomyces cerevisiae, and knockin and knockout murine models, to demonstrate that patient mutations in TOP2B have a dominant negative effect on enzyme function, resulting in defective proliferation, survival of B-2 cells, causing a block in B cell development, and impair humoral function in response to immunization.

Paper title : Toward a global characterization of the phosphoproteome in prostate cancer cells: identification of phosphoproteins in the LNCaP cell line.

Doi : https://doi.org/10.1002/elps.200600782

Abstract : Protein phosphorylation plays a major role in most cell-signaling pathways in all eukaryotic cells. Disruptions in phosphorylation-mediated cell-signaling events are associated with various diseases, including cancer. Here, we applied a fully non-gel-based methodology to obtain an initial panel of phosphoproteins from the LNCaP human prostate cancer cell line. The analytical strategy involved enrichment of phosphopeptides by immobilized metal ion affinity chromatography, the use of POROS Oligo R3 to capture phosphopeptides that were not retained with a C18 packing, and gas-phase fractionation in the m/z dimension to extend the dynamic range of the LC-MS/MS analysis. In this pilot investigation, 137 phosphorylation sites in 81 phosphoproteins were identified. The characterized phosphoproteins include kinases, co-regulators of steroid receptors, and a number of cancer-related proteins.

Paper title : Molecular cloning and characterization of the human topoisomerase IIalpha and IIbeta genes: evidence for isoform evolution through gene duplication.

Doi : https://doi.org/10.1016/s0167-4781(99)00020-2

Abstract : Human DNA topoisomerase II is essential for chromosome segregation and is the target for several clinically important anticancer agents. It is expressed as genetically distinct alpha and beta isoforms encoded by the TOP2alpha and TOP2beta genes that map to chromosomes 17q21-22 and 3p24, respectively. The genes display different patterns of cell cycle- and tissue-specific expression, with the alpha isoform markedly upregulated in proliferating cells. In addition to the fundamental role of TOP2alpha and TOP2beta genes in cell growth and development, altered expression and rearrangement of both genes are implicated in anticancer drug resistance. Here, we report the complete structure of the human topoisomerase IIalpha gene, which consists of 35 exons spanning 27.5 kb. Sequence data for the exon-intron boundaries were determined and examined in the context of topoisomerase IIalpha protein structure comprising three functional domains associated with energy transduction, DNA breakage-reunion activity and nuclear localization. The organization of the 3' half of human TOP2beta, including sequence specifying the C-terminal nuclear localization domain, was also elucidated. Of the 15 introns identified in this 20 kb region of TOP2beta, the first nine and the last intron align in identical positions and display the same phases as introns in TOP2alpha. Though their extreme 3' ends differ, the striking conservation suggests the two genes diverged recently in evolutionary terms consistent with a gene duplication event. Access to TOP2alpha and TOP2beta gene structures should aid studies of mutations and gene rearrangements associated with anticancer drug resistance.