dbACP: A Comprehensive Database of Anti-Cancer Peptides

dbacp01227

General Description

Peptide name : ATP-dependent Clp protease proteolytic subunit

Source/Organism : Soil-dwelling bacterium

Linear/Cyclic : Linear

Chirality : Not found

Sequence Information

Sequence : MPYAAGEPSLGGGLGDQVYSRLLGERIIFLGQQVDDDIANKITAQLLLLAADPDKDIYLYINSPGGSVTAGMAVYDTMQYIPNDVVTIGMGMAASMGQFLLTGGASGKRFALPNTDILMHQGSAGIGGTASDIKIQAEYLLRTKTRMAEITAHHSGQTVETIIRDGDRDRWYTAEEAKDYGLIDEIITFASGIPGGGGTGA

Peptide length: 201

C-terminal modification: Linear

N-terminal modification : Not found

Non-natural peptide information: None

Activity Information

Assay type : Not specified

Assay time : Not found

Activity : Not found

Cell line : Not found

Cancer type : Not found

Other activity : Not found

Physicochemical Properties

Amino acid composition bar chart :

Molecular mass : 21206.7202 Dalton

Aliphatic index : 0.904

Instability index : 28.0597

Hydrophobicity (GRAVY) : -0.043

Isoelectric point : 4.5816

Charge (pH 7) : -10.2128

Aromaticity : 0.069

Molar extinction coefficient (cysteine, cystine): (18910, 18910)

Hydrophobic/hydrophilic ratio : 1.28409090

hydrophobic moment : -0.208

Missing amino acid : C

Most occurring amino acid : G

Most occurring amino acid frequency : 29

Least occurring amino acid : W

Least occurring amino acid frequency : 1

Structural Information

3D structure :

Secondary structure fraction (Helix, Turn, Sheet): (0.3, 0.3, 0.3)

SMILES Notation: CC[C@H](C)[C@H](NC(=O)CNC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CNC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CCCCN)NC(=O)CNC(=O)[C@H](CO)NC(=O)[C@H](C)NC(=O)CNC(=O)CNC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CCC(N)=O)NC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@H](CO)NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@H](CCSC)NC(=O)CNC(=O)[C@H](CCSC)NC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@H](CCSC)NC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@H](CO)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@@H](NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CCC(=O)O)NC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CO)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(=O)O)NC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)CNC(=O)CNC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)NC(=O)CNC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCSC)C(C)C)[C@@H](C)CC)[C@@H](C)CC)C(C)C)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)CC)C(C)C)[C@@H](C)O)C(C)C)[C@@H](C)O)[C@@H](C)CC)C(C)C)C(C)C)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)O)[C@@H](C)CC)C(=O)NCC(=O)NCC(=O)N[C@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](C)C(=O)N[C@@H](Cc1c[nH]cn1)C(=O)N[C@@H](Cc1c[nH]cn1)C(=O)N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)NCC(=O)N[C@H](C(=O)N1CCC[C@H]1C(=O)NCC(=O)NCC(=O)NCC(=O)NCC(=O)N[C@H](C(=O)NCC(=O)N[C@@H](C)C(=O)O)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)O)C(C)C)[C@@H](C)O)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)O

Secondary Structure :

Method Prediction
GOR CCCCTCCCCETTCCEEEEEETTTTHEEEEETCCCCCHHHHHHHHHHHHHHCCTTTCEEEEEECCTCCEEEEEEEEECCECCCCCCEEEEEHHHHHHHHEEEETCCTTEEETCTTCHHEEEETCTCCCCCEEHHHHHHHHHHHHHHHHHHHHHHHTTCEEEEEEETTCTTHHHHHHHHHHTTCHHHEEEEEECCCTCCCCEE
Chou-Fasman (CF) CHHHHCCCCCCCCCEEEEEHHHHHEEEEECEEHHHHHHHEEEHHHHHHHHHCCCEEEEEECCCCEEEECCCEEEECEEEECCEEEEEECHHHHHCCCCEEECCCCCCHHHHCCEEHHHHCCCEEEECCCCCEEHHHHHEECEEHHHHHHCCCCCEEEEEEEEECCCCCEEEHHHHHHCEEECCCEEEECEECCCCCCCCCC
Neural Network (NN) CCCCCCCCCCCCCCCCCEEEHHCCCCEEECCCCCCCCHHHHHHHHHHHHCCCCCCCEEEEECCCCCCEEEEEEEECCCCCCCCCCEEEECCHHHHCHHEEECCCCCCCCECCCCCCHHHHCCCCCCCCCCCCHHHHHHHHHHHCCHHHHHHHCCCCCCEEEEEECCCCCCCHHHCCCCCCCCCEEEEEECCCCCCCCCCCC
Joint/Consensus CCCCCCCCCCCCCCEEEEEECCCCCEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCEEEEECCCCCCEEEEEEEECCCCCCCCCCEEEECHHHHHCCCEEECCCCCCCCCCCCCCHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEEECCCCCCCHHHHHHCCCCCCCEEEEEEECCCCCCCCCC

Molecular Descriptors and ADMET Properties

Molecular Descriptors: Not available.

ADMET Properties: Not available.

Cross Referencing databases

Pubmed Id : 22815456

Uniprot : Not available

PDB : Not available

CancerPPD : Not available

ApIAPDB : Not available

CancerPPD2 ID : Not available

Reference

1 : Wang L, et al. Draft genome sequence of Streptomyces globisporus C-1027, which produces an antitumor antibiotic consisting of a nine-membered enediyne with a chromoprotein. J Bacteriol. 2012; 194:4144. doi: 10.1128/JB.00797-12

Literature

Paper title : Draft genome sequence of Streptomyces globisporus C-1027, which produces an antitumor antibiotic consisting of a nine-membered enediyne with a chromoprotein.

Doi : https://doi.org/10.1128/JB.00797-12

Abstract : Streptomyces globisporus C-1027 is the producer of antitumor antibiotic C-1027, a nine-membered enediyne-containing compound. Here we present a draft genome sequence of S. globisporus C-1027 containing the intact biosynthetic gene cluster for this antibiotic. The genome also carries numerous sets of genes for the biosynthesis of diverse secondary metabolites.