dbacp03389
General Description
Peptide name : Interferon gamma (IFN-gamma)
Source/Organism : Arabian camel
Linear/Cyclic : Not found
Chirality : Not found
Sequence Information
Sequence : MNYTSYILAFQLCVILGSSGCYCQAPFFDEIENLKKYFNASNPDVADGGPLFLEILKNWKEESDKKIIQSQIVSFYFKLFENLKDNQIIQRSMDIIKQDMFQKFLNGSSEKLEDFKKLIQIPVDNLKVQRKAISELIKVMNDLSPKSNLRKRKRSQNLFRGRRASK
Peptide length: 166
C-terminal modification: Not found
N-terminal modification : Not found
Non-natural peptide information: None
Activity Information
Assay type : Not specified
Assay time : Not found
Activity : Not found
Cell line : Not found
Cancer type : Not found
Other activity : Not found
Physicochemical Properties
Amino acid composition bar chart :
Molecular mass : 19402.3173 Dalton
Aliphatic index : 0.892
Instability index : 42.7169
Hydrophobicity (GRAVY) : -0.45
Isoelectric point : 9.4737
Charge (pH 7) : 8.4821
Aromaticity : 0.108
Molar extinction coefficient (cysteine, cystine): (12950, 13075)
Hydrophobic/hydrophilic ratio : 0.82417582
hydrophobic moment : -0.226
Missing amino acid : H
Most occurring amino acid : K
Most occurring amino acid frequency : 20
Least occurring amino acid : T
Least occurring amino acid frequency : 1
Structural Information
3D structure :
Secondary structure fraction (Helix, Turn, Sheet): (0.3, 0.2, 0.3)
SMILES Notation: CC[C@H](C)[C@H](NC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)CNC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CO)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCCN)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1C(=O)CNC(=O)CNC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(N)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H](NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CS)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CS)NC(=O)CNC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CS)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCSC)[C@@H](C)O)[C@@H](C)CC)C(C)C)[C@@H](C)CC)[C@@H](C)CC)C(C)C)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)CC)C(C)C)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)CC)C(C)C)C(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CCCNC(=N)N)C(=O)NCC(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)C(C)C)[C@@H](C)CC
Secondary Structure :
| Method | Prediction |
|---|---|
| GOR | CCCCCEEHHEEEEEEETCTTCCTTCCCHHHHHHHHHHETTTCCCCCTTCCHHHHHHHHHHHHHHHHHHHHEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHTTTTHHHHHHHHHHECCCHHHHHHHHHHHHHHHEECCCCTTTHHHHHHTTTTHHHHHHTTT |
| Chou-Fasman (CF) | EEEEEEHHHHEEEEECCCEEEHHHHCHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHEEEEEEEEEEEEHHHHHHHHCEEEECCEEEEHHHHHHHHCCCCHHHHHHHHHHEEEECCHHHHHHHHHHHHEECCCCCCCCCCHHHHCCCCCCCCCCCCCCC |
| Neural Network (NN) | CCCCCCHHHHHEEEECCCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHCCCCCCCCCHEEEEHHEHHHHHHCCCCCCHHHCCCCHHHHCHHHHHCCCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCHHCCCCCHHHHCCCCCC |
| Joint/Consensus | CCCCCCHHHHEEEEECCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEECCCHHHHHHHHHCCCCCCCCCCHHHHHHHHHHCCCCCHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCHHHHCCCCCCCCCCCCCCC |
Molecular Descriptors and ADMET Properties
Molecular Descriptors: Not available.
ADMET Properties: Not available.
Cross Referencing databases
CancerPPD : Not available
ApIAPDB : Not available
CancerPPD2 ID : Not available
Reference
1 : Elbers JP, et al. Improving Illumina assemblies with Hi-C and long reads: An example with the North African dromedary. Mol Ecol Resour. 2019; 19:1015-1026. doi: 10.1111/1755-0998.13020
2 : Nagarajan G, et al. Cloning and sequence analysis of IL-2, IL-4 and IFN-γ from Indian Dromedary camels (Camelus dromedarius). Res Vet Sci. 2012; 92:420-6. doi: 10.1016/j.rvsc.2011.03.028
Literature
Paper title : Improving Illumina assemblies with Hi-C and long reads: An example with the North African dromedary.
Doi : https://doi.org/10.1111/1755-0998.13020
Abstract : Researchers have assembled thousands of eukaryotic genomes using Illumina reads, but traditional mate-pair libraries cannot span all repetitive elements, resulting in highly fragmented assemblies. However, both chromosome conformation capture techniques, such as Hi-C and Dovetail Genomics Chicago libraries and long-read sequencing, such as Pacific Biosciences and Oxford Nanopore, help span and resolve repetitive regions and therefore improve genome assemblies. One important livestock species of arid regions that does not have a high-quality contiguous reference genome is the dromedary (Camelus dromedarius). Draft genomes exist but are highly fragmented, and a high-quality reference genome is needed to understand adaptation to desert environments and artificial selection during domestication. Dromedaries are among the last livestock species to have been domesticated, and together with wild and domestic Bactrian camels, they are the only representatives of the Camelini tribe, which highlights their evolutionary significance. Here we describe our efforts to improve the North African dromedary genome. We used Chicago and Hi-C sequencing libraries from Dovetail Genomics to resolve the order of previously assembled contigs, producing almost chromosome-level scaffolds. Remaining gaps were filled with Pacific Biosciences long reads, and then scaffolds were comparatively mapped to chromosomes. Long reads added 99.32 Mbp to the total length of the new assembly. Dovetail Chicago and Hi-C libraries increased the longest scaffold over 12-fold, from 9.71 Mbp to 124.99 Mbp and the scaffold N50 over 50-fold, from 1.48 Mbp to 75.02 Mbp. We demonstrate that Illumina de novo assemblies can be substantially upgraded by combining chromosome conformation capture and long-read sequencing.
Paper title : Cloning and sequence analysis of IL-2, IL-4 and IFN-γ from Indian Dromedary camels (Camelus dromedarius).
Doi : https://doi.org/10.1016/j.rvsc.2011.03.028
Abstract : The cDNAs of three cytokines, viz., IL-2, IL-4 and IFN-γ from Dromedary camels were amplified by PCR using Bactrian camel sequences and subsequently cloned for sequence analysis. Relationship based on amino acid sequences revealed that Dromedary camel IL-2 shared 99.5% and 99.3% identity at the nucleotide and amino acid levels with Bactrian camel IL-2. In the case of IL-4, the identity of Dromedary camel was 99.7% and 99.2% at the nucleotide and amino acid levels, respectively with that of Bactrian camel. The Dromedary camel IFN-γ shared 100% identity both at nucleotide and amino acid levels with Bactrian camel IFN-γ. Phylogenetic analysis based on amino acid sequences indicated the close relationship in these cytokine genes between the Dromedary camel and other camelids.