dbacp03515
General Description
Peptide name : Kallikrein-5
Source/Organism : Human
Linear/Cyclic : Linear
Chirality : Not found
Sequence Information
Sequence : MATARPPWMWVLCALITALLLGVTEHVLANNDVSCDHPSNTVPSGSNQDLGAGAGEDARSDDSSSRIINGSDCDMHTQPWQAALLLRPNQLYCGAVLVHPQWLLTAAHCRKKVFRVRLGHYSLSPVYESGQQMFQGVKSIPHPGYSHPGHSNDLMLIKLNRRIRPTKDVRPINVSSHCPSAGTKCLVSGWGTTKSPQVHFPKVLQCLNISVLSQKRCEDAYPRQIDDTMFCAGDKAGRDSCQGDSGGPVVCNGSLQGLVSWGDYPCARPNRPGVYTNLCKFTKWIQETIQANS
Peptide length: 293
C-terminal modification: Linear
N-terminal modification : Not found
Non-natural peptide information: None
Activity Information
Assay type : Not specified
Assay time : Not found
Activity : Not found
Cell line : Not found
Cancer type : Not found
Other activity : Not found
Physicochemical Properties
Amino acid composition bar chart :
Molecular mass : 32020.1275 Dalton
Aliphatic index : 0.765
Instability index : 43.7696
Hydrophobicity (GRAVY) : -0.323
Isoelectric point : 8.639
Charge (pH 7) : 6.3346
Aromaticity : 0.064
Molar extinction coefficient (cysteine, cystine): (48930, 49805)
Hydrophobic/hydrophilic ratio : 1.10791366
hydrophobic moment : 0.0111
Missing amino acid : None
Most occurring amino acid : S
Most occurring amino acid frequency : 27
Least occurring amino acid : E
Least occurring amino acid frequency : 5
Structural Information
3D structure :
Secondary structure fraction (Helix, Turn, Sheet): (0.2, 0.3, 0.3)
SMILES Notation: CC[C@H](C)[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CO)NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](CO)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)CNC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@@H](NC(=O)[C@@H]1CCCN1C(=O)[C@H](CO)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CO)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@@H](NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CS)NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)CNC(=O)[C@H](CS)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)NC(=O)CNC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](C)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)CNC(=O)[C@H](C)NC(=O)CNC(=O)[C@H](C)NC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CO)NC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1C(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)[C@H](CS)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@H](CCSC)NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@@H]1CCCN1C(=O)[C@@H]1CCCN1C(=O)[C@H](CCCNC(=N)N)NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)CCSC)[C@@H](C)O)C(C)C)[C@@H](C)CC)[C@@H](C)O)C(C)C)[C@@H](C)O)C(C)C)C(C)C)[C@@H](C)O)C(C)C)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)O)C(C)C)C(C)C)[C@@H](C)O)C(C)C)C(C)C)C(C)C)C(C)C)[C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@H](C(=O)N[C@@H](CCCNC(=N)N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@H](C(=O)N[C@@H](CCCNC(=N)N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1c[nH]cn1)C(=O)N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)NCC(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(=O)N[C@@H](Cc1c[nH]cn1)C(=O)N[C@@H](Cc1ccccc1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(=O)N[C@@H](CO)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCCNC(=N)N)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)NCC(=O)NCC(=O)N1CCC[C@H]1C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(=N)N)C(=O)N1CCC[C@H]1C(=O)NCC(=O)N[C@H](C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@H](C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@H](C(=O)N[C@H](C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)O)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)CC)[C@@H](C)O)[C@@H](C)O)C(C)C)C(C)C)C(C)C)C(C)C)[C@@H](C)O)[C@@H](C)CC)C(C)C)[C@@H](C)CC)C(C)C)C(C)C)[C@@H](C)O)[C@@H](C)O)C(C)C)[C@@H](C)O)C(C)C)[C@@H](C)CC)C(C)C)[C@@H](C)O)[C@@H](C)CC
Secondary Structure :
| Method | Prediction |
|---|---|
| GOR | HHCCCCCCEEHHHHHHHHHEETEEEEEETTTTTEECCTTCCCCCCCCCETTCCCCHHHHCTTTTEEEETTCECECCCCCCHHHHHHCTTTEEEEEEEECCTHHHHHHHHHHHHHEEEETCETECCEEETTHHEHEEEECCCCTTEECTTCCTTHHHHHHHHHHCCCCCCCEEEEEECCCTTTCEEEEETCCCETCCCECCCTEEEEEEEEEETTTTHTTTCCTCCCCHHHHHTCCTCCETEECCTTCEEEEETTEEEEEEETCCTTCCTTCTTCEEEHHHHHHHHHHHHHHTT |
| Chou-Fasman (CF) | CCCCCCEEEEECCEEECCCEEEHHHHHHHCCEECCCCCEEEECCCCCCCCHHHHHHHHCCCCCEEEEECCCHHHHCCCHHHHHHHCCCEEEEEEEEEECCCCHHHHHHHHHEEEEECCCEECEEEECCCHHHHEEEECCCCCCCCCCCCCCCCCHHHHHEEEECCCCCCCEEEECCCCCCCEEEEEECEEEECCEEEECEECCCCEEEEEEHHHHHHHCCCCCCCCCCHHHHHHHHCCCCCCCCCEEEEECCCCCEEEECCCCCCCCCCCEEEEEECCCCEEECEEEECCCCC |
| Neural Network (NN) | CCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCHHHHHHCCCCCCCCHHHHCHHHHHHHHHHHHHHHHEEECCCCCCCCCCCCCCEEECCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCEECCCCCCCCCCEEEEECCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCHCCCCCCCCCCCCCCCCCCEEECCCCCEEECCCCCCCCCCCCCCCEEECCCCCCCCHHHHCCCC |
| Joint/Consensus | CCCCCCCCEEHHHHHHHHHEECHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCCCCCHHHHHHCCCCEEEEEEEECCCHHHHHHHHHHHHHEEEECCEEECCCCCCCCCEEEEECCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCCEEEECCCCCCCCCEEEEECCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCCCCCEEEECCCCEEEECCCCCCCCCCCCCCCEEECCCCCCCCHHHHCCCC |
Molecular Descriptors and ADMET Properties
Molecular Descriptors: Not available.
ADMET Properties: Not available.
Cross Referencing databases
Pubmed Id : 10514489 10652563 11054574 15361712 28224083 12975309 15057824 15489334 19194479 19190773 17881000
Uniprot : Click here
PDB : Not available
CancerPPD : Not available
ApIAPDB : Not available
CancerPPD2 ID : Not available
Reference
1 : Kurlender L, et al. Differential expression of a human kallikrein 5 (KLK5) splice variant in ovarian and prostate cancer. Tumour Biol. 2004; 25:149-56. doi: 10.1159/000079147
2 : Grimwood J, et al. The DNA sequence and biology of human chromosome 19. Nature. 2004; 428:529-35. doi: 10.1038/nature02399
3 : Debela M, et al. Structural basis of the zinc inhibition of human tissue kallikrein 5. J Mol Biol. 2007; 373:1017-31. doi: 10.1016/j.jmb.2007.08.042
4 : Meyer-Hoffert U, et al. Identification of lympho-epithelial Kazal-type inhibitor 2 in human skin as a kallikrein-related peptidase 5-specific protease inhibitor. PLoS One. 2009; 4:e4372. doi: 10.1371/journal.pone.0004372
5 : Brattsand M and Egelrud T. Purification, molecular cloning, and expression of a human stratum corneum trypsin-like serine protease with possible function in desquamation. J Biol Chem. 1999; 274:30033-40. doi: 10.1074/jbc.274.42.30033
6 : Brattsand M, et al. SPINK9: a selective, skin-specific Kazal-type serine protease inhibitor. J Invest Dermatol. 2009; 129:1656-65. doi: 10.1038/jid.2008.448
7 : Yousef GM, et al. Identification of novel human kallikrein-like genes on chromosome 19q13.3-q13.4. Anticancer Res. 1999; 19:2843-52.
8 : Gerhard DS, et al. The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC). Genome Res. 2004; 14:2121-7. doi: 10.1101/gr.2596504
9 : Gan L, et al. Sequencing and expression analysis of the serine protease gene cluster located in chromosome 19q13 region. Gene. 2000; 257:119-30. doi: 10.1016/s0378-1119(00)00382-6
10 : Premzl M. Comparative genomic analysis of eutherian kallikrein genes. Mol Genet Metab Rep. 2017; 10:96-99. doi: 10.1016/j.ymgmr.2017.01.009
11 : Clark HF, et al. The secreted protein discovery initiative (SPDI), a large-scale effort to identify novel human secreted and transmembrane proteins: a bioinformatics assessment. Genome Res. 2003; 13:2265-70. doi: 10.1101/gr.1293003
Literature
Paper title : Differential expression of a human kallikrein 5 (KLK5) splice variant in ovarian and prostate cancer.
Doi : https://doi.org/10.1159/000079147
Abstract : The presence of more than one mRNA form is common among kallikrein genes. We identified an mRNA transcript of the human kallikrein gene 5 (KLK5), denoted KLK5 splice variant 1 (KLK5-SV1). This variant has a different 5'-splice site, but encodes the same protein as the classical KLK5 transcript. RT-PCR analysis of this variant transcript expression in 29 human tissues indicated highest expression in the cervix, salivary gland, kidney, mammary gland, and skin. Comparative analysis of the expression levels of KLK5-SV1, another splice variant named KLK5 splice variant 2 (KLK5-SV2), and the classical KLK5 form showed that out of all three mRNA transcripts, the classical form is predominantly expressed (found in more tissues and at higher expression levels) followed by KLK5-SV1. KLK5-SV1 is expressed at high levels in ovarian, pancreatic, breast and prostate cancer cell lines. KLK5-SV1 was also found to be expressed in 9/10 ovarian cancer tissues, but it was not found in one normal ovarian tissue tested. Hormonal regulation experiments suggest that KLK5-SV1 is regulated by steroid hormones in the BT-474 breast cancer cell line. Furthermore, this variant had significantly higher expression in normal prostate tissues compared to their matched cancer tissue counterparts. KLK5-SV1 may have clinical utility in various malignancies and should be further explored as a potential new biomarker for prostate and ovarian cancer.
Paper title : The DNA sequence and biology of human chromosome 19.
Doi : https://doi.org/10.1038/nature02399
Abstract : Chromosome 19 has the highest gene density of all human chromosomes, more than double the genome-wide average. The large clustered gene families, corresponding high G + C content, CpG islands and density of repetitive DNA indicate a chromosome rich in biological and evolutionary significance. Here we describe 55.8 million base pairs of highly accurate finished sequence representing 99.9% of the euchromatin portion of the chromosome. Manual curation of gene loci reveals 1,461 protein-coding genes and 321 pseudogenes. Among these are genes directly implicated in mendelian disorders, including familial hypercholesterolaemia and insulin-resistant diabetes. Nearly one-quarter of these genes belong to tandemly arranged families, encompassing more than 25% of the chromosome. Comparative analyses show a fascinating picture of conservation and divergence, revealing large blocks of gene orthology with rodents, scattered regions with more recent gene family expansions and deletions, and segments of coding and non-coding conservation with the distant fish species Takifugu.
Paper title : Structural basis of the zinc inhibition of human tissue kallikrein 5.
Doi : https://doi.org/10.1016/j.jmb.2007.08.042
Abstract : Human kallikrein 5 (hK5) is a member of the tissue kallikrein family of serine peptidases. It has trypsin-like substrate specificity, is inhibited by metal ions, and is abundantly expressed in human skin, where it is believed to play a central role in desquamation. To further understand the interaction of hK5 with substrates and metal ions, active recombinant hK5 was crystallized in complex with the tripeptidyl aldehyde inhibitor leupeptin, and structures at 2.3 A resolution were obtained with and without Zn2+. While the overall structure and the specificity of S1 pocket for basic side-chains were similar to that of hK4, a closely related family member, both differed in their interaction with Zn2+. Unlike hK4, the 75-loop of hK5 is not structured to bind a Zn2+. Instead, Zn2+ binds adjacent to the active site, becoming coordinated by the imidazole rings of His99 and His96 not present in hK4. This zinc binding is accompanied by a large shift in the backbone conformation of the 99-loop and by large movements of both His side-chains. Modeling studies show that in the absence of bound leupeptin, Zn2+ is likely further coordinated by the imidazolyl side-chain of the catalytic His57 which can, similar to equivalent His57 imidazole groups in the related rat kallikrein proteinase tonin and in an engineered metal-binding rat trypsin, rotate out of its triad position to provide the third co-ordination site of the bound Zn2+, rendering Zn2+-bound hK5 inactive. In solution, this mode of binding likely occurs in the presence of free and substrate saturated hK5, as kinetic analyses of Zn2+ inhibition indicate a non-competitive mechanism. Supporting the His57 re-orientation, Zn2+ does not fully inhibit hK5 hydrolysis of tripeptidyl substrates containing a P2-His residue. The P2 and His57 imidazole groups would lie next to each other in the enzyme-substrate complex, indicating that incomplete inhibition is due to competition between both imidazole groups for Zn2+. The His96-99-57 triad is thus suggested to be responsible for the Zn2+-mediated inhibition of hK5 catalysis.
Paper title : Identification of lympho-epithelial Kazal-type inhibitor 2 in human skin as a kallikrein-related peptidase 5-specific protease inhibitor.
Doi : https://doi.org/10.1371/journal.pone.0004372
Abstract : Kallikreins-related peptidases (KLKs) are serine proteases and have been implicated in the desquamation process of the skin. Their activity is tightly controlled by epidermal protease inhibitors like the lympho-epithelial Kazal-type inhibitor (LEKTI). Defects of the LEKTI-encoding gene serine protease inhibitor Kazal type (Spink)5 lead to the absence of LEKTI and result in the genodermatose Netherton syndrome, which mimics the common skin disease atopic dermatitis. Since many KLKs are expressed in human skin with KLK5 being considered as one of the most important KLKs in skin desquamation, we proposed that more inhibitors are present in human skin. Herein, we purified from human stratum corneum by HPLC techniques a new KLK5-inhibiting peptide encoded by a member of the Spink family, designated as Spink9 located on chromosome 5p33.1. This peptide is highly homologous to LEKTI and was termed LEKTI-2. Recombinant LEKTI-2 inhibited KLK5 but not KLK7, 14 or other serine proteases tested including trypsin, plasmin and thrombin. Spink9 mRNA expression was detected in human skin samples and in cultured keratinocytes. LEKTI-2 immune-expression was focally localized at the stratum granulosum and stratum corneum at palmar and plantar sites in close localization to KLK5. At sites of plantar hyperkeratosis, LEKTI-2 expression was increased. We suggest that LEKTI-2 contributes to the regulation of the desquamation process in human skin by specifically inhibiting KLK5.
Paper title : Purification, molecular cloning, and expression of a human stratum corneum trypsin-like serine protease with possible function in desquamation.
Doi : https://doi.org/10.1074/jbc.274.42.30033
Abstract : A new human 33-kDa serine protease was purified from human epidermis, and its cDNA was cloned from a keratinocyte library, from mRNA from a human keratinocyte line (HaCat) and from mRNA from human skin. Polyclonal antibodies specific for the new protein detected three groups of proteins in partially purified extracts of cornified eptihelium of human plantar skin. The three components are proposed to correspond to proenzyme, active enzyme, and proteolytically modified active enzyme. After N-deglycosylation, there was a decrease in apparent molecular mass of all detected components. Expression of the cloned cDNA in a eukaryotic virus-derived system yielded a recombinant protein that could be converted to an active protease by treatment with trypsin. Polymerase chain reaction analyses of cDNA from a number of human tissues showed high expression of the new enzyme in the skin and low expression in brain, placenta, and kidney. Homology searches yielded the highest score for porcine enamel matrix protease (55% amino acid sequence homology). High scores were also obtained for human and mouse neuropsin and for human stratum corneum chymotryptic enzyme. The function of this new protease, tentatively named stratum corneum tryptic enzyme, may be related to stratum corneum turnover and desquamation in the epidermis.
Paper title : SPINK9: a selective, skin-specific Kazal-type serine protease inhibitor.
Doi : https://doi.org/10.1038/jid.2008.448
Abstract : A previously unreported Kazal-type serine protease inhibitor, serine protease inhibitor Kazal type 9 (SPINK9), was identified in human skin. SPINK9 expression was strong in palmar epidermis, but not detectable or very low in non palmoplantar skin. Analysis of a human cDNA panel showed intermediate expression in thymus, pancreas, liver, and brain, and low or undetectable expression in other tissues. Using kallikrein-related peptidases (KLKs) 5, 7, 8, and 14, thrombin, trypsin, and chymotrypsin, inhibition with recombinant SPINK9 was seen only for KLK5 using low molecular weight substrates, with an apparent K(i) of 65 nM. Also KLK5 degradation of fibrinogen was totally inhibited by SPINK9. Slight inhibition of KLK8 using fibrinogen substrate could be observed using high concentrations of SPINK9. Analyses by surface plasmon resonance showed heterogeneous binding to SPINK9 of KLK5 and KLK8, but no binding of KLK7 or KLK14. KLK5 has been suggested to play a central role in skin desquamation as an initiating activating enzyme in proteolytic cascades formed by KLKs. An apparently KLK5-specific inhibitor, such as SPINK9, may play a significant regulatory role in such cascades. We suggest a possible role for SPINK9 in the site-specific epidermal differentiation of palms and soles.
Paper title : Identification of novel human kallikrein-like genes on chromosome 19q13.3-q13.4.
Doi : https://doi.org/Not available
Abstract : The human kallikrein gene family is localized on chromosome 19q13.3-q13.4 and currently includes three members: KLK1 or pancreatic/renal kallikrein, KLK2 or human glandular kallikrein and KLK3 or prostate-specific antigen (PSA). The latter two genes are almost prostate-specific and they are used for diagnosis and monitoring of prostate cancer and more recently, in breast cancer applications. In this paper, we analyzed a 300Kb genomic DNA region around chromosome 19q13.3-q13.4 in an effort to map known kallikrein or kallikrein-like genes and identify new kallikrein-like genes. Using the known kallikrein or kallikrein-like genes PSA, KLK2, enzyme and normal epithelial cell-specific 1 gene (NES1) as landmarks, we have identified another six novel genes of which, five have protein homologies and gene structure similarities with other kallikreins or kallikrein-like genes. We conclude, contrary to the current belief, that the human kallikrein gene locus contains a large number of kallikrein-like genes (at least thirteen). In this paper, we present a detailed description of the human kallikrein gene locus, encompassing the already known and newly identified genes. These new genes, like the already known kallikreins, may have utility for diagnosis, monitoring and therapeutics of various cancers including those of the breast, prostate and testis.
Paper title : The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC).
Doi : https://doi.org/10.1101/gr.2596504
Abstract : The National Institutes of Health's Mammalian Gene Collection (MGC) project was designed to generate and sequence a publicly accessible cDNA resource containing a complete open reading frame (ORF) for every human and mouse gene. The project initially used a random strategy to select clones from a large number of cDNA libraries from diverse tissues. Candidate clones were chosen based on 5'-EST sequences, and then fully sequenced to high accuracy and analyzed by algorithms developed for this project. Currently, more than 11,000 human and 10,000 mouse genes are represented in MGC by at least one clone with a full ORF. The random selection approach is now reaching a saturation point, and a transition to protocols targeted at the missing transcripts is now required to complete the mouse and human collections. Comparison of the sequence of the MGC clones to reference genome sequences reveals that most cDNA clones are of very high sequence quality, although it is likely that some cDNAs may carry missense variants as a consequence of experimental artifact, such as PCR, cloning, or reverse transcriptase errors. Recently, a rat cDNA component was added to the project, and ongoing frog (Xenopus) and zebrafish (Danio) cDNA projects were expanded to take advantage of the high-throughput MGC pipeline.
Paper title : Sequencing and expression analysis of the serine protease gene cluster located in chromosome 19q13 region.
Doi : https://doi.org/10.1016/s0378-1119(00)00382-6
Abstract : The human kallikrein gene cluster, located in the chromosome band 19q13, contains several tissue-specific serine protease genes including the prostate-specific KLK2, KLK3 and prostase genes. To further characterize the gene cluster, we have mapped, sequenced, and analyzed the genomic sequence from the region. The results of EST database searches and GENSCAN gene prediction analysis reveal 13 serine protease genes and several pseudogenes in the region. Expression analysis by RT-PCR indicates that most of these protease genes are expressed only in a subset of the 35 different normal tissues that have been examined. Several protease genes expressed in skin show higher expression levels in psoriatic lesion samples than in non-lesional skin samples from the same patient. This suggests that the imbalance of a complex protease cascade in skin may contribute to the pathology of disease. The proteases, excluding the kallikrein genes, share approximately 40% of their sequences suggesting that the serine protease gene cluster on chromosome 19q13 arose from ancient gene duplications.
Paper title : Comparative genomic analysis of eutherian kallikrein genes.
Doi : https://doi.org/10.1016/j.ymgmr.2017.01.009
Abstract : The present study made attempts to update and revise eutherian kallikrein genes implicated in major physiological and pathological processes and in medical molecular diagnostics. Using eutherian comparative genomic analysis protocol and free available genomic sequence assemblies, the tests of reliability of eutherian public genomic sequences annotated most comprehensive curated third party data gene data set of eutherian kallikrein genes including 121 complete coding sequences among 335 potential coding sequences. The present analysis first described 13 major gene clusters of eutherian kallikrein genes, and explained their differential gene expansion patterns. One updated classification and nomenclature of eutherian kallikrein genes was proposed, as new framework of future experiments.
Paper title : The secreted protein discovery initiative (SPDI), a large-scale effort to identify novel human secreted and transmembrane proteins: a bioinformatics assessment.
Doi : https://doi.org/10.1101/gr.1293003
Abstract : A large-scale effort, termed the Secreted Protein Discovery Initiative (SPDI), was undertaken to identify novel secreted and transmembrane proteins. In the first of several approaches, a biological signal sequence trap in yeast cells was utilized to identify cDNA clones encoding putative secreted proteins. A second strategy utilized various algorithms that recognize features such as the hydrophobic properties of signal sequences to identify putative proteins encoded by expressed sequence tags (ESTs) from human cDNA libraries. A third approach surveyed ESTs for protein sequence similarity to a set of known receptors and their ligands with the BLAST algorithm. Finally, both signal-sequence prediction algorithms and BLAST were used to identify single exons of potential genes from within human genomic sequence. The isolation of full-length cDNA clones for each of these candidate genes resulted in the identification of >1000 novel proteins. A total of 256 of these cDNAs are still novel, including variants and novel genes, per the most recent GenBank release version. The success of this large-scale effort was assessed by a bioinformatics analysis of the proteins through predictions of protein domains, subcellular localizations, and possible functional roles. The SPDI collection should facilitate efforts to better understand intercellular communication, may lead to new understandings of human diseases, and provides potential opportunities for the development of therapeutics.