dbacp01226
General Description
Peptide name : ATP-dependent Clp protease proteolytic subunit
Source/Organism : Soil-dwelling bacterium
Linear/Cyclic : Linear
Chirality : Not found
Sequence Information
Sequence : MIRPAARYVLPEFTERTATGTRTQDPYSKLLSERIVFLGSPIDDTAATDLIAQLMYLEHADPDRPLSLYINSPGGSFQAMAAVYDTMQFLTCEVETFCLGQAGSYAAALLAAGAKGRRHALPGARVVIQQPALEEPMRGQPSDLEIHARELVRTREMFAAMLVRHTGRTAEQITADIERDTILDAKAALAHGLVDHVVENR
Peptide length: 201
C-terminal modification: Linear
N-terminal modification : Not found
Non-natural peptide information: None
Activity Information
Assay type : Not specified
Assay time : Not found
Activity : Not found
Cell line : Not found
Cancer type : Not found
Other activity : Not found
Physicochemical Properties
Amino acid composition bar chart :
Molecular mass : 22062.9229 Dalton
Aliphatic index : 0.904
Instability index : 34.297
Hydrophobicity (GRAVY) : -0.127
Isoelectric point : 5.5073
Charge (pH 7) : -5.9528
Aromaticity : 0.059
Molar extinction coefficient (cysteine, cystine): (8940, 9065)
Hydrophobic/hydrophilic ratio : 1.18478260
hydrophobic moment : 0.013
Missing amino acid : W
Most occurring amino acid : A
Most occurring amino acid frequency : 29
Least occurring amino acid : N
Least occurring amino acid frequency : 2
Structural Information
3D structure :
Secondary structure fraction (Helix, Turn, Sheet): (0.3, 0.2, 0.3)
SMILES Notation: CC[C@H](C)[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H](NC(=O)[C@@H]1CCCN1C(=O)[C@H](CO)NC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@@H](NC(=O)CNC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCCNC(=N)N)NC(=O)[C@@H](NC(=O)[C@@H](N)CCSC)[C@@H](C)CC)C(C)C)[C@@H](C)O)[C@@H](C)O)[C@@H](C)O)[C@@H](C)O)[C@@H](C)O)[C@@H](C)CC)C(C)C)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](Cc1c[nH]cn1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)NCC(=O)NCC(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@H](C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@H](C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)NCC(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](Cc1c[nH]cn1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(=N)N)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@H](C(=O)N[C@@H](Cc1c[nH]cn1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@H](C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](Cc1c[nH]cn1)C(=O)N[C@H](C(=O)NCC(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](Cc1c[nH]cn1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](Cc1c[nH]cn1)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(=N)N)C(=O)O)C(C)C)C(C)C)C(C)C)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)O)C(C)C)[C@@H](C)O)C(C)C)[C@@H](C)CC)[C@@H](C)CC)C(C)C)C(C)C)[C@@H](C)O)C(C)C)[C@@H](C)O)[C@@H](C)O)C(C)C)[C@@H](C)CC
Secondary Structure :
| Method | Prediction |
|---|---|
| GOR | EHCHHHHTTCTTHHHHHTTCCCCCCCTTTTHHHEEEEECCCCCCCHHHHHHHHHHHHTTCCTCCTTEEEECCCCCCHHHHHHHHCHHHECHTHHHHEEETTTCCHHHHHHHHHHHHHHETCTTCEEEECCCCHHCHHTTCCTTHHHHHHHHHHHHHHHHHHHHHHTTCHHHHHHHHHHHHHHHHHHHHHHHTEEEEEHHHH |
| Chou-Fasman (CF) | CCCCCEEECCCCCCCEEEEEECCCCCCHHHHHEEEEECCCCCHHHHHCHHHHCHHHHHHHCCCCCEEEECCCCCCHHHHHEEEECCCEECCCCCCEEECCCCCHHHHHHHHHCCCCCCCCCCEEEEEEHHHHHHHCCCCCCHHHHHHHHHEEHHHHHHHHEEECCCCCCCEEEEHHHHEEEHHHHHHHHHEECEEEECCCC |
| Neural Network (NN) | CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCCEEECCCCCCCCHHHHHHHHHHHHHHHCCCHHHHCCCCCHHHHHHHHHHCCCCCCCCCCCEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCECCCCHHHHHHHHHHHHHHHHHHHCCC |
| Joint/Consensus | CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCHHHHHHHHHHHHCCCCCCCCCEEECCCCCCCHHHHHHHHCCCCCCCCCCCCEECCCCCCHHHHHHHHHHCCCCCCCCCCEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHCHHHHHHHHHHHHCCEEEECCCC |
Molecular Descriptors and ADMET Properties
Molecular Descriptors: Not available.
ADMET Properties: Not available.
Cross Referencing databases
CancerPPD : Not available
ApIAPDB : Not available
CancerPPD2 ID : Not available
Reference
1 : Wang L, et al. Draft genome sequence of Streptomyces globisporus C-1027, which produces an antitumor antibiotic consisting of a nine-membered enediyne with a chromoprotein. J Bacteriol. 2012; 194:4144. doi: 10.1128/JB.00797-12
Literature
Paper title : Draft genome sequence of Streptomyces globisporus C-1027, which produces an antitumor antibiotic consisting of a nine-membered enediyne with a chromoprotein.
Doi : https://doi.org/10.1128/JB.00797-12
Abstract : Streptomyces globisporus C-1027 is the producer of antitumor antibiotic C-1027, a nine-membered enediyne-containing compound. Here we present a draft genome sequence of S. globisporus C-1027 containing the intact biosynthetic gene cluster for this antibiotic. The genome also carries numerous sets of genes for the biosynthesis of diverse secondary metabolites.