dbACP: A Comprehensive Database of Anti-Cancer Peptides

dbacp02514

General Description

Peptide name : Class I SAM-dependent methyltransferase

Source/Organism : Streptomyces lasalocidi

Linear/Cyclic : Linear

Chirality : Not found

Sequence Information

Sequence : MTEVFDAVYRGESPFGKRPPWDIGAPQPAYVALEKAGLIQGAVLDAGCGTGEDALHLAGLGYAVTGLDLSPTAISVARDKADARGLGAVFEVADALDLTGWEERFDTVIDSGLAHTFEGDRLRAYATALHRACRPGAVAHILSISDRGSAEMQARLAEAIDEIPAPLPDDDESPTLKRSADHLRDGFAEGWTIESIDESLMRGVIPTTSELLDVHAWLGRFRRD

Peptide length: 224

C-terminal modification: Linear

N-terminal modification : Not found

Non-natural peptide information: None

Activity Information

Assay type : Not specified

Assay time : Not found

Activity : Not found

Cell line : Not found

Cancer type : Not found

Other activity : Not found

Physicochemical Properties

Amino acid composition bar chart :

Molecular mass : 24008.563 Dalton

Aliphatic index : 0.890

Instability index : 41.8263

Hydrophobicity (GRAVY) : -0.153

Isoelectric point : 4.6377

Charge (pH 7) : -16.9318

Aromaticity : 0.067

Molar extinction coefficient (cysteine, cystine): (27960, 28085)

Hydrophobic/hydrophilic ratio : 1.33333333

hydrophobic moment : 0.0325

Missing amino acid : N

Most occurring amino acid : A

Most occurring amino acid frequency : 32

Least occurring amino acid : C

Least occurring amino acid frequency : 2

Structural Information

3D structure :

Secondary structure fraction (Helix, Turn, Sheet): (0.3, 0.3, 0.3)

SMILES Notation: CC[C@H](C)[C@H](NC(=O)[C@H](C)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CNC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CS)NC(=O)[C@H](C)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](C)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CC(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)CNC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)CNC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)CNC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](C)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H]1CCCN1C(=O)[C@H](CO)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)CNC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)CNC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CS)NC(=O)CNC(=O)[C@H](C)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)CNC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)CNC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(N)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@@H]1CCCN1C(=O)[C@@H]1CCCN1C(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CCCCN)NC(=O)CNC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)NC(=O)CNC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](NC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H](NC(=O)[C@@H](N)CCSC)[C@@H](C)O)C(C)C)C(C)C)[C@@H](C)CC)C(C)C)[C@@H](C)CC)C(C)C)[C@@H](C)O)C(C)C)[C@@H](C)O)[C@@H](C)O)[C@@H](C)CC)C(C)C)C(C)C)C(C)C)[C@@H](C)O)[C@@H](C)O)C(C)C)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)O)C(C)C)[C@@H](C)CC)[C@@H](C)CC)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](Cc1c[nH]cn1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(=O)O)C(=O)NCC(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)N[C@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(=N)N)C(=O)NCC(=O)N[C@H](C(=O)N[C@H](C(=O)N1CCC[C@H]1C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@H](C(=O)N[C@@H](Cc1c[nH]cn1)C(=O)N[C@@H](C)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)C(C)C)[C@@H](C)O)[C@@H](C)O)[C@@H](C)CC)C(C)C)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)O)[C@@H](C)CC

Secondary Structure :

Method Prediction
GOR HHHHHHHHHTTTCCTTCCCCCCCCCCCCHHHHHHHHHHHEEEEEETTTCCCCHHHHHTTEEEEEEEECCCCCEEEHHHHHHHHTTTTHHHHHHHHHHHTTHHHHHHHEEHTTHHHHETTHHHHHHHHHHHHHHCTTCHEEEEEEECTTHHHHHHHHHHHHHHCCCCCCCCTTCHHHHHHTTHHHTTHHTTHHHHHHHHHHEEEECCCCCEEEHHHHHHHHHHHT
Chou-Fasman (CF) CCCCCEEECCCCCCCCCCCEECCCCCCEEEHHHHHEEEEEEHHHHCEECCHHHHHHHCEEEEEEECCCCCEEEEEHHHHHHHHCCEECCHHHHHHHEECHHHHEEEEEECHHHHHCHHHHHHHHHHHHHHHCCCCCCCCEEEEECCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCHHHHCEEEEHHHHHHHEEEEEEECCCCHHHHHEECCCCCC
Neural Network (NN) CCCHCCEECCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCECCCCCCCCHHHHHHCCCEEECCCCCCCCCEEHHCCCCCHCCCHHHHHHHHHHHHCCCCCCCCEEECCCCCCCCCCCHHHHHHHHHHHCCCCCCHEEEEEECCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCECCCCCCCCCECCCCCCCHHHHHHHHCCCCCC
Joint/Consensus CCCCCCEECCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHEECCCCCCCCCCCHHHHHCCEEEEEECCCCCCCEEEHHHHHHHHCCCCHHHHHHHHHHHCCCCCCCCEEECCCCCCCCCCHHHHHHHHHHHHCCCCCCCEEEEEECCCCHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHCCCCCCCCCCCCCCHHHHHHHEEEECCCCCCCHHHHHHHCCCCCC

Molecular Descriptors and ADMET Properties

Molecular Descriptors: Not available.

ADMET Properties: Not available.

Cross Referencing databases

Pubmed Id : 16799553 24302672

Uniprot : Not available

PDB : Not available

CancerPPD : Not available

ApIAPDB : Not available

CancerPPD2 ID : Not available

Reference

1 : Watanabe K, et al. Total biosynthesis of antitumor nonribosomal peptides in Escherichia coli. Nat Chem Biol. 2006; 2:423-8. doi: 10.1038/nchembio803

2 : Hotta K, et al. Conversion of a disulfide bond into a thioacetal group during echinomycin biosynthesis. Angew Chem Int Ed Engl. 2014; 53:824-8. doi: 10.1002/anie.201307404

Literature

Paper title : Total biosynthesis of antitumor nonribosomal peptides in Escherichia coli.

Doi : https://doi.org/10.1038/nchembio803

Abstract : Nonribosomal peptides (NRPs) are a class of microbial secondary metabolites that have a wide variety of medicinally important biological activities, such as antibiotic (vancomycin), immunosuppressive (cyclosporin A), antiviral (luzopeptin A) and antitumor (echinomycin and triostin A) activities. However, many microbes are not amenable to cultivation and require time-consuming empirical optimization of incubation conditions for mass production of desired secondary metabolites for clinical and commercial use. Therefore, a fast, simple system for heterologous production of natural products is much desired. Here we show the first example of the de novo total biosynthesis of biologically active forms of heterologous NRPs in Escherichia coli. Our system can serve not only as an effective and flexible platform for large-scale preparation of natural products from simple carbon and nitrogen sources, but also as a general tool for detailed characterizations and rapid engineering of biosynthetic pathways for microbial syntheses of novel compounds and their analogs.

Paper title : Conversion of a disulfide bond into a thioacetal group during echinomycin biosynthesis.

Doi : https://doi.org/10.1002/anie.201307404

Abstract : Echinomycin is a nonribosomal depsipeptide natural product with a range of interesting bioactivities that make it an important target for drug discovery and development. It contains a thioacetal bridge, a unique chemical motif derived from the disulfide bond of its precursor antibiotic triostin A by the action of an S-adenosyl-L-methionine-dependent methyltransferase, Ecm18. The crystal structure of Ecm18 in complex with its reaction products S-adenosyl-L-homocysteine and echinomycin was determined at 1.50 Å resolution. Phasing was achieved using a new molecular replacement package called AMPLE, which automatically derives search models from structure predictions based on ab initio protein modelling. Structural analysis indicates that a combination of proximity effects, medium effects, and catalysis by strain drives the unique transformation of the disulfide bond into the thioacetal linkage.