dbACP: A Comprehensive Database of Anti-Cancer Peptides

dbacp03455

General Description

Peptide name : Interferon gamma (IFN-gamma)

Source/Organism : Chinese hamster

Linear/Cyclic : Not found

Chirality : Not found

Sequence Information

Sequence : MSGCYCQRTLIEEIENLKKYFNSSSEDVGNGGDLVFNTLMNWQKDGDTKIIQSQIVSFYFKLFEALKDNQAIQRSIDTIKADLFANFFNSSMEKLNDFVRITKIPVNDVQVQRKAVNELISVMPLLSPKLSLRKRKRSRCCFGGGNRPNKNNLASTI

Peptide length: 157

C-terminal modification: Not found

N-terminal modification : Not found

Non-natural peptide information: None

Activity Information

Assay type : Not specified

Assay time : Not found

Activity : Not found

Cell line : Not found

Cancer type : Not found

Other activity : Not found

Physicochemical Properties

Amino acid composition bar chart :

Molecular mass : 17910.3999 Dalton

Aliphatic index : 0.850

Instability index : 42.8701

Hydrophobicity (GRAVY) : -0.382

Isoelectric point : 9.3639

Charge (pH 7) : 6.4734

Aromaticity : 0.089

Molar extinction coefficient (cysteine, cystine): (9970, 10220)

Hydrophobic/hydrophilic ratio : 0.84705882

hydrophobic moment : -0.018

Missing amino acid : H

Most occurring amino acid : N

Most occurring amino acid frequency : 15

Least occurring amino acid : W

Least occurring amino acid frequency : 1

Structural Information

3D structure :

Secondary structure fraction (Helix, Turn, Sheet): (0.2, 0.3, 0.3)

SMILES Notation: CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CC(N)=O)NC(=O)CNC(=O)CNC(=O)CNC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CS)NC(=O)[C@H](CS)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CO)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CO)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CO)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCSC)NC(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](C)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCCN)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@H](CC(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)O)NC(=O)CNC(=O)CNC(=O)[C@H](CC(N)=O)NC(=O)CNC(=O)[C@@H](NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H](NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CS)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CS)NC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H](N)CCSC)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)CC)C(C)C)C(C)C)[C@@H](C)O)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)CC)C(C)C)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)CC)C(C)C)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)CC)C(C)C)C(C)C)C(C)C)C(C)C)[C@@H](C)CC)C(C)C)[C@@H](C)O)C(=O)O

Secondary Structure :

Method Prediction
GOR ETTCCTTTTHHHHHHHHHHHETTTTTETTTTCCEEEEHHHHHHTTTCCEEEEHEHHHHHHHHHHHHHHHHHHHHEEHHHHHHHHHHHHHTHHHHHHHHHEEEECCCCCHHHHHHHHHHHEEECCCCCTTHHHHHTTTTEEEETTCCCTTTCTTTEEE
Chou-Fasman (CF) CEEEEEEEEHHHHHHHHHCCCCCCCCCCCCCEEEEEHHHHHHHCCCEEEEEEEEEEEEEHHHHHHHHHHHHEEEEEEEHHHHHHHCCCCHHHHHHEEEEEEEEECCCEEEHHHHHHHHEEEECCCCCCHHHHHHCCCEEEECCCCCCCCCCCEECCC
Neural Network (NN) CCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCEEHHHHCCCCCCCCCCEEEEEEEHHHHHHHHHHHCCCCHHCCCCCCHHHHHHHHHHCCCCHHCCCCEECCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHCCCCEEECCCCCCCCCCCCCCEE
Joint/Consensus CCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCEEEHHHHHHHCCCCCEEEEEEEEHHHHHHHHHHHHHHHHHCEECCHHHHHHHHHHHCHHHHHCCCCEEEECCCCCHHHHHHHHHHHEEECCCCCCCHHHHHCCCCEEECCCCCCCCCCCCCCEE

Molecular Descriptors and ADMET Properties

Molecular Descriptors: Not available.

ADMET Properties: Not available.

Cross Referencing databases

Pubmed Id : 21804562

Uniprot : Not available

PDB : Not available

CancerPPD : Not available

ApIAPDB : Not available

CancerPPD2 ID : Not available

Reference

1 : Xu X, et al. The genomic sequence of the Chinese hamster ovary (CHO)-K1 cell line. Nat Biotechnol. 2011; 29:735-41. doi: 10.1038/nbt.1932

Literature

Paper title : The genomic sequence of the Chinese hamster ovary (CHO)-K1 cell line.

Doi : https://doi.org/10.1038/nbt.1932

Abstract : Chinese hamster ovary (CHO)-derived cell lines are the preferred host cells for the production of therapeutic proteins. Here we present a draft genomic sequence of the CHO-K1 ancestral cell line. The assembly comprises 2.45 Gb of genomic sequence, with 24,383 predicted genes. We associate most of the assembled scaffolds with 21 chromosomes isolated by microfluidics to identify chromosomal locations of genes. Furthermore, we investigate genes involved in glycosylation, which affect therapeutic protein quality, and viral susceptibility genes, which are relevant to cell engineering and regulatory concerns. Homologs of most human glycosylation-associated genes are present in the CHO-K1 genome, although 141 of these homologs are not expressed under exponential growth conditions. Many important viral entry genes are also present in the genome but not expressed, which may explain the unusual viral resistance property of CHO cell lines. We discuss how the availability of this genome sequence may facilitate genome-scale science for the optimization of biopharmaceutical protein production.