dbacp03455
General Description
Peptide name : Interferon gamma (IFN-gamma)
Source/Organism : Chinese hamster
Linear/Cyclic : Not found
Chirality : Not found
Sequence Information
Sequence : MSGCYCQRTLIEEIENLKKYFNSSSEDVGNGGDLVFNTLMNWQKDGDTKIIQSQIVSFYFKLFEALKDNQAIQRSIDTIKADLFANFFNSSMEKLNDFVRITKIPVNDVQVQRKAVNELISVMPLLSPKLSLRKRKRSRCCFGGGNRPNKNNLASTI
Peptide length: 157
C-terminal modification: Not found
N-terminal modification : Not found
Non-natural peptide information: None
Activity Information
Assay type : Not specified
Assay time : Not found
Activity : Not found
Cell line : Not found
Cancer type : Not found
Other activity : Not found
Physicochemical Properties
Amino acid composition bar chart :
Molecular mass : 17910.3999 Dalton
Aliphatic index : 0.850
Instability index : 42.8701
Hydrophobicity (GRAVY) : -0.382
Isoelectric point : 9.3639
Charge (pH 7) : 6.4734
Aromaticity : 0.089
Molar extinction coefficient (cysteine, cystine): (9970, 10220)
Hydrophobic/hydrophilic ratio : 0.84705882
hydrophobic moment : -0.018
Missing amino acid : H
Most occurring amino acid : N
Most occurring amino acid frequency : 15
Least occurring amino acid : W
Least occurring amino acid frequency : 1
Structural Information
3D structure :
Secondary structure fraction (Helix, Turn, Sheet): (0.2, 0.3, 0.3)
SMILES Notation: CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CC(N)=O)NC(=O)CNC(=O)CNC(=O)CNC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CS)NC(=O)[C@H](CS)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CO)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CO)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CO)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCSC)NC(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](C)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCCN)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@H](CC(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)O)NC(=O)CNC(=O)CNC(=O)[C@H](CC(N)=O)NC(=O)CNC(=O)[C@@H](NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H](NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CS)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CS)NC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H](N)CCSC)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)CC)C(C)C)C(C)C)[C@@H](C)O)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)CC)C(C)C)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)CC)C(C)C)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)CC)C(C)C)C(C)C)C(C)C)C(C)C)[C@@H](C)CC)C(C)C)[C@@H](C)O)C(=O)O
Secondary Structure :
| Method | Prediction |
|---|---|
| GOR | ETTCCTTTTHHHHHHHHHHHETTTTTETTTTCCEEEEHHHHHHTTTCCEEEEHEHHHHHHHHHHHHHHHHHHHHEEHHHHHHHHHHHHHTHHHHHHHHHEEEECCCCCHHHHHHHHHHHEEECCCCCTTHHHHHTTTTEEEETTCCCTTTCTTTEEE |
| Chou-Fasman (CF) | CEEEEEEEEHHHHHHHHHCCCCCCCCCCCCCEEEEEHHHHHHHCCCEEEEEEEEEEEEEHHHHHHHHHHHHEEEEEEEHHHHHHHCCCCHHHHHHEEEEEEEEECCCEEEHHHHHHHHEEEECCCCCCHHHHHHCCCEEEECCCCCCCCCCCEECCC |
| Neural Network (NN) | CCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCEEHHHHCCCCCCCCCCEEEEEEEHHHHHHHHHHHCCCCHHCCCCCCHHHHHHHHHHCCCCHHCCCCEECCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHCCCCEEECCCCCCCCCCCCCCEE |
| Joint/Consensus | CCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCEEEHHHHHHHCCCCCEEEEEEEEHHHHHHHHHHHHHHHHHCEECCHHHHHHHHHHHCHHHHHCCCCEEEECCCCCHHHHHHHHHHHEEECCCCCCCHHHHHCCCCEEECCCCCCCCCCCCCCEE |
Molecular Descriptors and ADMET Properties
Molecular Descriptors: Not available.
ADMET Properties: Not available.
Cross Referencing databases
CancerPPD : Not available
ApIAPDB : Not available
CancerPPD2 ID : Not available
Reference
1 : Xu X, et al. The genomic sequence of the Chinese hamster ovary (CHO)-K1 cell line. Nat Biotechnol. 2011; 29:735-41. doi: 10.1038/nbt.1932
Literature
Paper title : The genomic sequence of the Chinese hamster ovary (CHO)-K1 cell line.
Doi : https://doi.org/10.1038/nbt.1932
Abstract : Chinese hamster ovary (CHO)-derived cell lines are the preferred host cells for the production of therapeutic proteins. Here we present a draft genomic sequence of the CHO-K1 ancestral cell line. The assembly comprises 2.45 Gb of genomic sequence, with 24,383 predicted genes. We associate most of the assembled scaffolds with 21 chromosomes isolated by microfluidics to identify chromosomal locations of genes. Furthermore, we investigate genes involved in glycosylation, which affect therapeutic protein quality, and viral susceptibility genes, which are relevant to cell engineering and regulatory concerns. Homologs of most human glycosylation-associated genes are present in the CHO-K1 genome, although 141 of these homologs are not expressed under exponential growth conditions. Many important viral entry genes are also present in the genome but not expressed, which may explain the unusual viral resistance property of CHO cell lines. We discuss how the availability of this genome sequence may facilitate genome-scale science for the optimization of biopharmaceutical protein production.