dbacp01593
General Description
Peptide name : Beta-galactoside-specific lectin 1
Source/Organism : European mistletoe
Linear/Cyclic : Cyclic
Chirality : L
Sequence Information
Sequence : MNGHLASRRAWVWYFLMLGQVFGATVKAETKFSYERLRLRVTHQTTGEEYFRFITLLRDYVSSGSFSNEIPLLRQSTIPVSDAQRFVLVELTNEGGDSITAAIDVTNLYVVAYQAGDQSYFLRDAPRGAETHLFTGTTRSSLPFNGSYPDLERYAGHRDQIPLGIDQLIQSVTALRFPGGSTRTQARSILILIQMISEAARFNPILWRARQYINSGASFLPDVYMLELETSWGQQSTQVQQSTDGVFNNPIRLAIPPGNFVTLTNVRDVIASLAIMLFVCGERPSSSDVRYWPLVIRPVIADDVTCSASEPTVRIVGRNGMCVDVRDDDFHDGNQIQLWPSKSNNDPNQLWTIKRDGTIRSNGSCLTTYGYTAGVYVMIFDCNTAVREATLWEIWGNGTIINPRSNLVLAASSGIKGTTLTVQTLDYTLGQGWLAGNDTAPREVTIYGFRDLCMESNGGSVWVETCVISQQNQRWALYGDGSIRPKQNQDQCLTCGRDSVSTVINIVSCSAGSSGQRWVFTNEGAILNLKNGLAMDVAQANPKLRRIIIYPATGKPNQMWLPVP
Peptide length: 564
C-terminal modification: Cyclic
N-terminal modification : Not found
Non-natural peptide information: None
Activity Information
Assay type : Not specified
Assay time : Not found
Activity : Not found
Cell line : Not found
Cancer type : Not found
Other activity : Not found
Physicochemical Properties
Amino acid composition bar chart :
Molecular mass : 62627.1644 Dalton
Aliphatic index : 0.876
Instability index : 32.802
Hydrophobicity (GRAVY) : -0.160
Isoelectric point : 6.552
Charge (pH 7) : -1.1013
Aromaticity : 0.094
Molar extinction coefficient (cysteine, cystine): (105310, 105935)
Hydrophobic/hydrophilic ratio : 1.05839416
hydrophobic moment : 0.0541
Missing amino acid : None
Most occurring amino acid : L
Most occurring amino acid frequency : 48
Least occurring amino acid : H
Least occurring amino acid frequency : 5
Structural Information
3D structure : Not Available
Secondary structure fraction (Helix, Turn, Sheet): (0.2, 0.3, 0.4)
SMILES Notation: CC[C@H](C)[C@H](NC(=O)CNC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(N)=O)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CC(N)=O)NC(=O)CNC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@@H](NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](NC(=O)[C@H](CCSC)NC(=O)[C@@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)CNC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CS)NC(=O)[C@H](CO)NC(=O)CNC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1C(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(N)=O)NC(=O)CNC(=O)[C@H](CC(=O)O)NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@@H](NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H](NC(=O)[C@H](CS)NC(=O)[C@H](CCSC)NC(=O)CNC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCCNC(=N)N)NC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@H](CS)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCCNC(=N)N)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1C(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@@H](NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CCC(=O)O)NC(=O)CNC(=O)[C@H](CS)NC(=O)[C@@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCSC)NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CO)NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CC(N)=O)NC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@@H]1CCCN1C(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@@H](NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CC(=O)O)NC(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)CNC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCSC)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@@H](NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(C)C)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CO)NC(=O)[C@H](C)NC(=O)CNC(=O)[C@H](CO)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](C)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(N)=O)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)CNC(=O)[C@H](C)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CO)NC(=O)CNC(=O)[C@H](CC(N)=O)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(C)C)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)CNC(=O)[C@@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@@H](NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C)NC(=O)CNC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(=O)O)NC(=O)CNC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)NC(=O)CNC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CO)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CO)NC(=O)CNC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CO)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)CNC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](NC(=O)[C@H](CCC(N)=O)NC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@@H](NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@H](C)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CO)NC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)CNC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCSC)C(C)C)C(C)C)[C@@H](C)O)C(C)C)[C@@H](C)O)C(C)C)[C@@H](C)O)[C@@H](C)O)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)O)C(C)C)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)CC)C(C)C)C(C)C)C(C)C)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)CC)C(C)C)[C@@H](C)O)C(C)C)C(C)C)[C@@H](C)O)[C@@H](C)O)[C@@H](C)O)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)CC)C(C)C)[C@@H](C)O)[C@@H](C)O)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)CC)C(C)C)[C@@H](C)O)[C@@H](C)O)C(C)C)[C@@H](C)O)C(C)C)[C@@H](C)CC)[C@@H](C)CC)C(C)C)[C@@H](C)O)[C@@H](C)O)C(C)C)C(C)C)[C@@H](C)CC)[C@@H](C)CC)C(C)C)C(C)C)C(C)C)[C@@H](C)CC)C(C)C)[C@@H](C)CC)C(C)C)[C@@H](C)O)[C@@H](C)O)C(C)C)[C@@H](C)CC)C(C)C)C(C)C)C(C)C)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)O)[C@@H](C)O)C(C)C)C(C)C)[C@@H](C)CC)[C@@H](C)O)C(C)C)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)CC)C(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@H](C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)NCC(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(=O)NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@H](C(=O)N[C@@H](CS)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(=O)N[C@@H](CCCNC(=N)N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)N[C@H](C(=O)N[C@@H](CO)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@H](C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(=O)O)C(=O)NCC(=O)N[C@@H](C)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(=O)N1CCC[C@H]1C(=O)O)C(C)C)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)CC)C(C)C)[C@@H](C)CC)[C@@H](C)O)C(C)C)C(C)C)[C@@H](C)CC)[C@@H](C)CC)C(C)C)[C@@H](C)O)C(C)C)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)CC)C(C)C)[C@@H](C)O)C(C)C)C(C)C)[C@@H](C)CC)[C@@H](C)O)C(C)C)[C@@H](C)O)[C@@H](C)O)[C@@H](C)O)C(C)C)[C@@H](C)O)[C@@H](C)O)[C@@H](C)O
Secondary Structure :
| Method | Prediction |
|---|---|
| GOR | ETTHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEEEEETTTCCHHHHHHEEEEEEEETTCCCTCCHHEEEEECCCCHHHHEEEEEEETTTCCCEEEEEEEEEEEEEEEETTCCCEEEETCTTTHHEEEEECCCETTCCTTTCCCTHHHHTTCCTCCCCCEEEEEEEEEEEEETTCCCCEHHHEEEEEEHHHHHHHHTCCHHHHHHHEECTTCCCCCTHHEEHHHHHHTEEEEEEEEEEECEECCCCEEECCTCCEEEEEEEEHHHHHHHHEEEEETCCTTTTCEEECEEEECEEEECCEEETTTCCEEEEEETTTEEEECTTTTCCTTCCEEECCTTTTCCTTTEEEEEETCCEEETTTEEEEECCETCEEEEEEHTTHHHHHHHEHHHETTCCEECCTTTEEEEEETTEEEEEEEEEEEEEEEEEEEETTCCCCCTTEEEEEEEHHHETTTTCCEEEEEEEEHHTCCHEEEETTCCECCCCCCTTEEEECCCTEEEEEEEEEEETTTTTEEEEEEHTHHHHHHHTTHEHHHHHTCTTHEEEEEECCTCCCCCEECCCC |
| Chou-Fasman (CF) | CCHHHHHHEEEEEECCCEEEEEEEEHHHHEECCHHHHEEEEEEEEHHHHEEEEEECCEEEECCCCCCCCCCCCEEEEECHHHHEEEECCCCCCCCEEECCCEEEEEEEEEECCCCCCEEEHHHHCCHHHHEEEEEEEECCCCCCCCCCHHHHCCCCCCCEEEEECCEEEEEECCCCCCEEEECCCEEEEEECEEHHHHHHCEEEEHHHHEEEECCCCCCEEEEHHHHHEEEECEEEEEEECCEEEECCCCCCCCCCCEEEEEEECEEEEECCCCEEEEECCCCCCEEEEEEEEEEEEEECCEEEECCCCEEEEEEECCCEEEEHHHHHHCCCEEEECCCCCCCCCCCCEEEECCCEEEECCCEEEEEEEEEEEEEEECCCCCEEHHHHCCCEEECCEEEECCCCEEHHHHCEEEEEEEEEEECEEEEECCCCCCCCCCCCCEEEEEECHHHHHHCCCEEEEEEEEEEECCCCCCEEECCEEECCCHHHHHEEEECCEEEEEEEEEEECCCCCCCEEEEECCCCCCCCCCCHHHHHHHHCCCCCEEEEEEECCCCCCCCCEECCC |
| Neural Network (NN) | HCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHEEECCCCCCCCEEEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCHHEEEECCCCCCCCCEEEEEHHHHHEEEECCCCCCECCCCCCCCCCEEEEECCCCCCCCCCCCCCCCCHCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCCCHHHEEEEHHHHHCCCCCCHHHHHCHECCCCCCCCCCHHHHHHCCCCCCCCCEEECCCCCCCCCCCCCCCCCCCCEEECCCHHHHHHHHHHHEECCCCCCCCCCCECCCCCCCCCCCCCCCCCCCCCEEEEECCCCEECCCCCCCCCCCCCECCCCCCCCCCCCCEEECCCCCEECCCCEEEEECCCEEEEEEEEECCCHHHHHHHHHECCCCCCCCCCCCHEEHHCCCCCCCEEEEEEECCCCCCCCCCCCCCCCCCEEEECCEEECECCCCCCEEEEEEEHCCCCCCEEECCCCCCCCCCCCCCEEECCCCCCCEEEEEEEECCCCCCCEEEECCCCHHHHHHCCCHHHHHHHCCCCCEEEEECCCCCCCCCCCCCC |
| Joint/Consensus | CCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHEEEEECCCCCCEEEEEEEEEEEECCCCCCCCCCCCEEEECCCCCCCCEEEEECCCCCCCCCEEEEEEEEEEEEEECCCCCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCEEEEEEHHHHHHHCCCCHHHHHCCEECCCCCCCCCHHHHHHCCCCCEEEEEEECCEEEECCCCCCCCCCCCCEEEEEEEEHHHHHHHHEEEECCCCCCCCCEEEEEEEEEEECCCCEECCCCCCEEEEEECCCEEECCCCCCCCCCCCEECCCCCCCCCCCCEEEECCCCCEECCCCEEEEECCEEEEEEEEECCCCHHHHHHCCCCCCCCCCCCCCCCCEECCCCCEEEEEEEEEEEEEECCCCCCCCCCCCCCCEEEECCCCCCCCCCCCCEEEEEEEECCCCCCEEECCCCCCCCCCCCCCEEECCCCCEEEEEEEEEECCCCCCEEEEECCCCHHHHHHCCHHHHHHHCCCCCEEEEEECCCCCCCCCCCCCC |
Molecular Descriptors and ADMET Properties
Molecular Descriptors: Not available.
ADMET Properties: Not available.
Cross Referencing databases
Reference
1 : Mikeska R, et al. Mistletoe lectin I in complex with galactose and lactose reveals distinct sugar-binding properties. Acta Crystallogr Sect F Struct Biol Cryst Commun. 2005; 61:17-25. doi: 10.1107/S1744309104031501
2 : Dietrich JB, et al. Identity of the N-terminal sequences of the three A chains of mistletoe (Viscum album L.) lectins: homology with ricin-like plant toxins and single-chain ribosome-inhibiting proteins. Anticancer Drugs. 1992; 3:507-11. doi: 10.1097/00001813-199210000-00010
3 : Soler MH, et al. Complete amino acid sequence of the B chain of mistletoe lectin I. Biochem Biophys Res Commun. 1998; 246:596-601. doi: 10.1006/bbrc.1998.8670
4 : Krauspenhaar R, et al. Crystallisation under microgravity of mistletoe lectin I from Viscum album with adenine monophosphate and the crystal structure at 1.9 A resolution. Acta Crystallogr D Biol Crystallogr. 2002; 58:1704-7. doi: 10.1107/s0907444902014270
5 : Niwa H, et al. Crystal structure at 3 A of mistletoe lectin I, a dimeric type-II ribosome-inactivating protein, complexed with galactose. Eur J Biochem. 2003; 270:2739-49. doi: 10.1046/j.1432-1033.2003.03646.x
6 : Mishra V, et al. Purification and characterization of four isoforms of Himalayan mistletoe ribosome-inactivating protein from Viscum album having unique sugar affinity. Arch Biochem Biophys. 2004; 423:288-301. doi: 10.1016/j.abb.2003.12.033
7 : Kourmanova AG, et al. Cloning and characterization of the genes encoding toxic lectins in mistletoe (Viscum album L). Eur J Biochem. 2004; 271:2350-60. doi: 10.1111/j.1432-1033.2004.04153.x
8 : Huguet Soler M, et al. Complete amino acid sequence of the A chain of mistletoe lectin I. FEBS Lett. 1996; 399:153-7. doi: 10.1016/s0014-5793(96)01309-9
9 : Eschenburg S, et al. Primary structure and molecular modeling of mistletoe lectin I from Viscum album. Biochem Biophys Res Commun. 1998; 247:367-72. doi: 10.1006/bbrc.1998.8760
Literature
Paper title : Mistletoe lectin I in complex with galactose and lactose reveals distinct sugar-binding properties.
Doi : https://doi.org/10.1107/S1744309104031501
Abstract : The structures of mistletoe lectin I (ML-I) from Viscum album complexed with lactose and galactose have been determined at 2.3 A resolution and refined to R factors of 20.9% (Rfree = 23.6%) and 20.9 (Rfree = 24.6%), respectively. ML-I is a heterodimer and belongs to the class of ribosome-inactivating proteins of type II, which consist of two chains. The A-chain has rRNA N-glycosidase activity and irreversibly inhibits eukaryotic ribosomes. The B-chain is a lectin and preferentially binds to galactose-terminated glycolipids and glycoproteins on cell membranes. Saccharide binding is performed by two binding sites in subdomains alpha1 and gamma2 of the ML-I B-chain separated by approximately 62 A from each other. The favoured binding of galactose in subdomain alpha1 is achieved via hydrogen bonds connecting the 4-hydroxyl and 3-hydroxyl groups of the sugar moiety with the side chains of Asp23B, Gln36B and Lys41B and the main chain of 26B. The aromatic ring of Trp38B on top of the preferred binding pocket supports van der Waals packing of the apolar face of galactose and stabilizes the sugar-lectin complex. In the galactose-binding site II of subdomain gamma2, Tyr249B provides the hydrophobic stacking and the side chains of Asp235B, Gln238B and Asn256B are hydrogen-bonding partners for galactose. In the case of the galactose-binding site I, the 2-hydroxyl group also stabilizes the sugar-protein complex, an interaction thus far rarely detected in galactose-specific lectins. Finally, a potential third low-affinity galactose-binding site in subunit beta1 was identified in the present ML-I structures, in which a glycerol molecule from the cryoprotectant buffer has bound, mimicking the sugar compound.
Paper title : Identity of the N-terminal sequences of the three A chains of mistletoe (Viscum album L.) lectins: homology with ricin-like plant toxins and single-chain ribosome-inhibiting proteins.
Doi : https://doi.org/10.1097/00001813-199210000-00010
Abstract : Mistletoe lectin (ML) I increases the production of cytokines by mononuclear cells and has been proposed as a useful biological response modifier in the treatment of cancer. Two other lectins, ML II and ML III, have been identified in mistletoe. We report that the N-terminal sequences of the three A chains of ML I, ML II and ML III are identical, and have interesting homology with the N-terminal sequences of the A chain of ricin-like toxins and of single-chain ribosome-inhibiting proteins. In addition, the three mistletoe lectins inhibit the growth of the human tumor cell line Molt 4, ML III being the most potent. followed by ML II and ML I. This inhibition is suppressed by addition of rabbit anti-ML I antibodies to the cultured cells. The data obtained suggest that the three lectins have amino acid sequences which show extensive homology and exert very similar biological effects. They may be derived from the same precursor.
Paper title : Complete amino acid sequence of the B chain of mistletoe lectin I.
Doi : https://doi.org/10.1006/bbrc.1998.8670
Abstract : The primary structure of the B chain of mistletoe lectin I, the component of a commercially available extract from Viscum album exhibiting immunomodulatory capacity, was established based on amino acid sequence analysis of the protein and peptides derived from its enzymatic digestion. It is composed of 264 residues, including seven cysteine residues and three N-linked carbohydrate chains. The amino acid sequence of MLB shows a high homology with those from other structurally related galactoside-specific lectins such as ricin and abrin with 169 and 146 identities, respectively. These results are of crucial importance in order to understand the biological activity of ML-1.
Paper title : Crystallisation under microgravity of mistletoe lectin I from Viscum album with adenine monophosphate and the crystal structure at 1.9 A resolution.
Doi : https://doi.org/10.1107/s0907444902014270
Abstract : The crystal structure of the ribosome-inactivating protein (RIP) mistletoe lectin I (ML-I) from Viscum album in complex with adenine has been refined to 1.9 A resolution. High quality crystals of the ML-I complex were obtained by the method of vapour diffusion using the high density protein crystal growth system (HDPCG) on the international space station, mission ISS 6A. Hexagonal crystals were grown during three months under microgravity conditions. Diffraction data to 1.9A were collected applying synchrotron radiation and cryo- techniques. The structure was refined subsequently to analyse the structure of ML-I and particularly the active site conformation, complexed by adenine that mimics the RNA substrate binding.
Paper title : Crystal structure at 3 A of mistletoe lectin I, a dimeric type-II ribosome-inactivating protein, complexed with galactose.
Doi : https://doi.org/10.1046/j.1432-1033.2003.03646.x
Abstract : The X-ray structure of mistletoe lectin I (MLI), a type-II ribosome-inactivating protein (RIP), cocrystallized with galactose is described. The model was refined at 3.0 A resolution to an R-factor of 19.9% using 21 899 reflections, with Rfree 24.0%. MLI forms a homodimer (A-B)2 in the crystal, as it does in solution at high concentration. The dimer is formed through contacts between the N-terminal domains of two B-chains involving weak polar and non-polar interactions. Consequently, the overall arrangement of sugar-binding sites in MLI differs from those in monomeric type-II RIPs: two N-terminal sugar-binding sites are 15 A apart on one side of the dimer, and two C-terminal sugar-binding sites are 87 A apart on the other side. Galactose binding is achieved by common hydrogen bonds for the two binding sites via hydroxy groups 3-OH and 4-OH and hydrophobic contact by an aromatic ring. In addition, at the N-terminal site 2-OH forms hydrogen bonds with Asp27 and Lys41, and at the C-terminal site 3-OH and 6-OH undergo water-mediated interactions and C5 has a hydrophobic contact. MLI is a galactose-specific lectin and shows little affinity for N-acetylgalactosamine. The reason for this is discussed. Structural differences among the RIPs investigated in this study (their quaternary structures, location of sugar-binding sites, and fine sugar specificities of their B-chains, which could have diverged through evolution from a two-domain protein) may affect the binding sites, and consequently the cellular transport processes and biological responses of these toxins.
Paper title : Purification and characterization of four isoforms of Himalayan mistletoe ribosome-inactivating protein from Viscum album having unique sugar affinity.
Doi : https://doi.org/10.1016/j.abb.2003.12.033
Abstract : Ribosome-inactivating proteins having antitumor and immunomodulatory properties constitute the active principle of widely used mistletoe therapy in Europe. This is the first report of the four isoforms of Himalayan mistletoe ribosome-inactivating proteins (HmRips) from Viscum album parasitized on wild apple inhabiting NW Himalayas. HmRips were purified by affinity chromatography and four isoforms were separated by ion-exchange chromatography. HmRip 1, 2, 3, and 4 have isoelectric points of 6.6, 6.1, 5.2, and 4.7, respectively. Disulfide linked toxin and lectin subunits of HmRip 1 and 2 isoforms have molecular weights of 28 and 34kDa while those of HmRip 3 and 4 have 28 and 32kDa. The isoforms lacked blood group specificity and showed positive activity with seven mammalian erythrocyte types but did not show any activity with avian erythrocyte type. Lectin activity of HmRips remained unchanged for a wide range of temperatures (0-65 degrees C) and pH (3-9). Unlike other type II Rips, the HmRip 1, 2, and 4 showed unique affinity towards l-rhamnose, meso-inositol, and l-arabinose while HmRip 3 has specificity to gal/galNAc. Sugar binding studies with 22 sugars also suggested that the C-4 hydroxyl of galactose might be the critical site involved in sugar binding of HmRips. Type II Rips are known to be galactoside specific and do not have affinity for l-rhamnose and meso-inositol. However, HmRip 1, 2, and 4 having equal affinity for galactose and l-rhamnose do not strictly fit into any of the four structural classes of the lectins and represent a new class of type II Rips and plant lectins.
Paper title : Cloning and characterization of the genes encoding toxic lectins in mistletoe (Viscum album L).
Doi : https://doi.org/10.1111/j.1432-1033.2004.04153.x
Abstract : Leaves of mistletoe (Viscum album L) contain three toxic lectins (type 2 ribosome-inactivating proteins) MLI, MLII, and MLIII, differing in molecular mass and carbohydrate specificity. Clones, containing sequences of three gene variants designated ml1p, ml2p, and ml3p, were obtained using PCR amplification from cDNA and from mistletoe genomic DNA. The quantitative ratio of the ml1p, ml2p, and ml3p genes in genomic DNA was found to be 1.5 : 1 : 4, respectively, whereas the ratio of their mRNA was 50 : 10 : 1. The quantitative prevalence of the ml1p transcript correlates well with the observation that MLI is quantitatively dominant over MLII and MLIII in the mistletoe extract. The sequences of the proteins encoded by the ml1p, ml2p, and ml3p genes are identical to MLI by 98, 88, and 77%, respectively. The similarity to MLI of the amino acid sequence encoded by the gene ml1p, the quantitative prevalent of its mRNA, as well as structural properties of the B-chain indicate that the gene, ml1p, corresponds to MLI. Western blot analysis of recombinant A-chains encoded by the three variants of mlp genes with the monoclonal antibody MNA4 having differential affinity to MLI, MLII and MLIII A-chains suggests that the ml2p and ml3p genes correspond to MLII and MLIII, respectively. Structural differences in the carbohydrate-binding sites of the B-subunits of ML1p, ML2p, and ML3p probably explain the difference in sugar specificity of MLI, MLII and MLIII.
Paper title : Complete amino acid sequence of the A chain of mistletoe lectin I.
Doi : https://doi.org/10.1016/s0014-5793(96)01309-9
Abstract : The complete amino acid sequence of the A chain of mistletoe lectin I was determined via Edman degradation sequencing of the N-terminus and tryptic and endoproteinase Asp-N overlapping fragments, amino acid analysis and MALDI-MS. The data obtained show a great homology with the chains of ribosome-inactivating proteins such as ricin and abrin with 111 (abrin-a) and 103 (ricin-D) amino acid residues conserved, respectively. The knowledge of the primary structure of MLA will have a fundamental impact on elucidating the biological function of medically applied mistletoe lectins on a molecular basis.
Paper title : Primary structure and molecular modeling of mistletoe lectin I from Viscum album.
Doi : https://doi.org/10.1006/bbrc.1998.8760
Abstract : The first three-dimensional structure of the ribosome inactivating protein mistletoe lectin I (ML-I) from Viscum album has been modeled on the basis of the X-ray structure of castor bean ricin from Ricinus communis. The relative high sequence homology and conserved secondary structure enabled accurate modeling. The 196 sequence changes between ML-I and ricin could be accomodated with only little pertubation in the main chain folding. A close comparison of the primary structures of ML-I and ricin is given and the effects of the sequence changes are elucidated on the basis of the modeled three-dimensional structure. Differences have been identified in the vicinity of the active site, in the high affinity galactose binding site and in the interface between the A and B chains, which might account for the reduced cytotoxicity of ML-I.