dbacp05790
General Description
Peptide name : Putative tryptophan 2,3-dioxygenase
Source/Organism : Streptomyces lasalocidi
Linear/Cyclic : Linear
Chirality : Not found
Sequence Information
Sequence : METLLSLQEPRVPHTAGGAAVLAEQFFIITHQACELWLKQLGADLDAAAEALLPPCDPHDLELGLEFLLRSVDVLRVLQQQLLVLERLPLRYFAEFRAYLDTASGAQSAQFRRLTALFGNSHDQGSLYEAFAAAAEYHGRSVDEVCRRGVETGVLHRLAEGLVDLGNGYWHWKVAHLALVSKMIGEQSGTGGSSGVDFLARRVTRPFPELRRLRGKVHHA
Peptide length: 220
C-terminal modification: Linear
N-terminal modification : Not found
Non-natural peptide information: None
Activity Information
Assay type : Not specified
Assay time : Not found
Activity : Not found
Cell line : Not found
Cancer type : Not found
Other activity : Not found
Physicochemical Properties
Amino acid composition bar chart :
Molecular mass : 24347.5069 Dalton
Aliphatic index : 1.002
Instability index : 43.3673
Hydrophobicity (GRAVY) : -0.042
Isoelectric point : 6.1952
Charge (pH 7) : -3.6106
Aromaticity : 0.081
Molar extinction coefficient (cysteine, cystine): (23950, 24075)
Hydrophobic/hydrophilic ratio : 1.31578947
hydrophobic moment : 0.2203
Missing amino acid : None
Most occurring amino acid : L
Most occurring amino acid frequency : 35
Least occurring amino acid : M
Least occurring amino acid frequency : 2
Structural Information
3D structure :
Secondary structure fraction (Helix, Turn, Sheet): (0.3, 0.2, 0.3)
SMILES Notation: CC[C@H](C)[C@H](NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@H](CCCCN)NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)CNC(=O)[C@H](CC(N)=O)NC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)CNC(=O)[C@@H](NC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CS)NC(=O)[C@@H](NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)[C@H](CCCNC(=N)N)NC(=O)CNC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](C)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CO)NC(=O)CNC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](CO)NC(=O)[C@H](CC(N)=O)NC(=O)CNC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)NC(=O)CNC(=O)[C@H](CO)NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](C)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)NC(=O)[C@@H]1CCCN1C(=O)[C@@H]1CCCN1C(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)NC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CNC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CO)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H](N)CCSC)[C@@H](C)O)C(C)C)[C@@H](C)O)C(C)C)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)O)C(C)C)C(C)C)C(C)C)C(C)C)[C@@H](C)O)[C@@H](C)O)C(C)C)C(C)C)C(C)C)[C@@H](C)O)C(C)C)C(C)C)C(C)C)C(C)C)C(=O)NCC(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)NCC(=O)N[C@H](C(=O)NCC(=O)NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)NCC(=O)N[C@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CCCNC(=N)N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(=N)N)C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N[C@@H](Cc1c[nH]cn1)C(=O)N[C@@H](Cc1c[nH]cn1)C(=O)N[C@@H](C)C(=O)O)C(C)C)[C@@H](C)O)C(C)C)C(C)C)[C@@H](C)O
Secondary Structure :
| Method | Prediction |
|---|---|
| GOR | HHHHHHHTCTTCEEETTCHHHHHHHHHHHHHHHHHHHHHHTTHHHHHHHHHTCCTCCCTTTTTHHHHHHHHHHHEEHHHHHHHEHHTCHHHHHHHHHHHHHHHCCHHHHHHHHHEEEETCCCTTTCHHHHHHHHHHHTTTTEHHHEETTTTEEEEHHHHTTEEETTTTEEHHHHHHHHHHHEEEEEETCCTCCCEEEEHHHTTCCCCCHHHHHTTHEEEH |
| Chou-Fasman (CF) | CCCHHHHHCCEECCCHHHHHHHHHEEEEEHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHEEEEEEEEECCCEEHHHHHHEEEHHHHCCCCCCCCCCHHHHHHCHHHHCCCCCCCCCHHHHHHHHHHCCCEEHHHHCCCCEEEEEEHHHHHHEEECCCEEECCHHHHHHEEEHHHHCCEECCCCEEEEHHHHEEEECCHHHHHCCEECCCCC |
| Neural Network (NN) | HHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCCCCCCEEECCCCCHHHHHHHCHHHHHCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEECCCCCCCCHHHHHHHHCCC |
| Joint/Consensus | HHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHEEHHHHHHHHHHHCCHHHHHHHHHHCCCCCCHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCCCCCCCCEECCCCEEEHHHHHCCEECCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCEEEECCCCCCCCCCHHHHHCCCCCCC |
Molecular Descriptors and ADMET Properties
Molecular Descriptors: Not available.
ADMET Properties: Not available.
Cross Referencing databases
Reference
1 : Watanabe K, et al. Total biosynthesis of antitumor nonribosomal peptides in Escherichia coli. Nat Chem Biol. 2006; 2:423-8. doi: 10.1038/nchembio803
Literature
Paper title : Total biosynthesis of antitumor nonribosomal peptides in Escherichia coli.
Doi : https://doi.org/10.1038/nchembio803
Abstract : Nonribosomal peptides (NRPs) are a class of microbial secondary metabolites that have a wide variety of medicinally important biological activities, such as antibiotic (vancomycin), immunosuppressive (cyclosporin A), antiviral (luzopeptin A) and antitumor (echinomycin and triostin A) activities. However, many microbes are not amenable to cultivation and require time-consuming empirical optimization of incubation conditions for mass production of desired secondary metabolites for clinical and commercial use. Therefore, a fast, simple system for heterologous production of natural products is much desired. Here we show the first example of the de novo total biosynthesis of biologically active forms of heterologous NRPs in Escherichia coli. Our system can serve not only as an effective and flexible platform for large-scale preparation of natural products from simple carbon and nitrogen sources, but also as a general tool for detailed characterizations and rapid engineering of biosynthetic pathways for microbial syntheses of novel compounds and their analogs.