FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6695, 335 aa 1>>>pF1KE6695 335 - 335 aa - 335 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.9547+/-0.000376; mu= 18.1972+/- 0.023 mean_var=68.9603+/-13.737, 0's: 0 Z-trim(112.9): 58 B-trim: 62 in 1/52 Lambda= 0.154445 statistics sampled from 21919 (21977) to 21919 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.632), E-opt: 0.2 (0.258), width: 16 Scan time: 4.490 The best scores are: opt bits E(85289) NP_004381 (OMIM: 116820) pro-cathepsin H isoform a ( 335) 2349 532.5 4.8e-151 XP_016877441 (OMIM: 116820) PREDICTED: pro-catheps ( 297) 2081 472.7 4.2e-133 XP_005254238 (OMIM: 116820) PREDICTED: pro-catheps ( 297) 2081 472.7 4.2e-133 XP_016877440 (OMIM: 116820) PREDICTED: pro-catheps ( 317) 2068 469.9 3.3e-132 NP_001306066 (OMIM: 116820) pro-cathepsin H isofor ( 201) 1207 277.9 1.3e-74 XP_011519578 (OMIM: 116820) PREDICTED: pro-catheps ( 169) 1143 263.5 2.2e-70 NP_000387 (OMIM: 265800,601105) cathepsin K prepro ( 329) 804 188.2 2e-47 NP_001903 (OMIM: 116880) cathepsin L1 isoform 1 pr ( 333) 799 187.1 4.5e-47 NP_666023 (OMIM: 116880) cathepsin L1 isoform 1 pr ( 333) 799 187.1 4.5e-47 NP_001244901 (OMIM: 116880) cathepsin L1 isoform 1 ( 333) 799 187.1 4.5e-47 XP_005251773 (OMIM: 116880) PREDICTED: cathepsin L ( 333) 799 187.1 4.5e-47 NP_001244900 (OMIM: 116880) cathepsin L1 isoform 1 ( 333) 799 187.1 4.5e-47 NP_001188504 (OMIM: 603308) cathepsin L2 prepropro ( 334) 792 185.6 1.3e-46 NP_001324 (OMIM: 603308) cathepsin L2 preproprotei ( 334) 792 185.6 1.3e-46 NP_004070 (OMIM: 116845) cathepsin S isoform 1 pre ( 331) 725 170.6 4.1e-42 XP_011543630 (OMIM: 603539,615362) PREDICTED: cath ( 424) 669 158.2 2.8e-38 NP_003784 (OMIM: 603539,615362) cathepsin F precur ( 484) 669 158.3 3.1e-38 NP_001186668 (OMIM: 116845) cathepsin S isoform 2 ( 281) 584 139.2 1e-32 XP_016869782 (OMIM: 116880) PREDICTED: cathepsin L ( 272) 575 137.1 4e-32 XP_011516565 (OMIM: 116880) PREDICTED: cathepsin L ( 272) 575 137.1 4e-32 NP_001325 (OMIM: 600550) cathepsin O preproprotein ( 321) 535 128.3 2.2e-29 NP_001805 (OMIM: 170650,245000,245010,602365) dipe ( 463) 383 94.5 4.6e-19 NP_001244902 (OMIM: 116880) cathepsin L1 isoform 2 ( 151) 375 92.4 6.7e-19 NP_001304166 (OMIM: 116810) cathepsin B isoform 2 ( 215) 277 70.7 3.3e-12 XP_016868590 (OMIM: 116810) PREDICTED: cathepsin B ( 245) 277 70.7 3.6e-12 NP_680093 (OMIM: 116810) cathepsin B isoform 1 pre ( 339) 277 70.8 4.7e-12 NP_680092 (OMIM: 116810) cathepsin B isoform 1 pre ( 339) 277 70.8 4.7e-12 NP_680090 (OMIM: 116810) cathepsin B isoform 1 pre ( 339) 277 70.8 4.7e-12 XP_016868586 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 277 70.8 4.7e-12 XP_006716307 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 277 70.8 4.7e-12 NP_001899 (OMIM: 116810) cathepsin B isoform 1 pre ( 339) 277 70.8 4.7e-12 XP_011542114 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 277 70.8 4.7e-12 XP_006716308 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 277 70.8 4.7e-12 XP_016868589 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 277 70.8 4.7e-12 XP_016868588 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 277 70.8 4.7e-12 XP_016868587 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 277 70.8 4.7e-12 NP_680091 (OMIM: 116810) cathepsin B isoform 1 pre ( 339) 277 70.8 4.7e-12 NP_001327 (OMIM: 603169) cathepsin Z preproprotein ( 303) 185 50.3 6.3e-06 NP_001326 (OMIM: 602364) cathepsin W preproprotein ( 376) 174 47.9 4.1e-05 XP_005271163 (OMIM: 616064) PREDICTED: tubulointer ( 440) 167 46.4 0.00014 NP_001191344 (OMIM: 616064) tubulointerstitial nep ( 362) 151 42.8 0.0014 XP_005271164 (OMIM: 616064) PREDICTED: tubulointer ( 362) 151 42.8 0.0014 XP_011540248 (OMIM: 616064) PREDICTED: tubulointer ( 408) 151 42.8 0.0015 NP_001191343 (OMIM: 616064) tubulointerstitial nep ( 436) 151 42.8 0.0016 NP_071447 (OMIM: 616064) tubulointerstitial nephri ( 467) 151 42.8 0.0017 XP_016866237 (OMIM: 606749) PREDICTED: tubulointer ( 309) 143 40.9 0.0042 XP_016866236 (OMIM: 606749) PREDICTED: tubulointer ( 351) 143 41.0 0.0047 XP_016866235 (OMIM: 606749) PREDICTED: tubulointer ( 401) 143 41.0 0.0052 XP_011512799 (OMIM: 606749) PREDICTED: tubulointer ( 426) 143 41.0 0.0054 XP_006715125 (OMIM: 606749) PREDICTED: tubulointer ( 458) 143 41.1 0.0057 >>NP_004381 (OMIM: 116820) pro-cathepsin H isoform a pre (335 aa) initn: 2349 init1: 2349 opt: 2349 Z-score: 2832.6 bits: 532.5 E(85289): 4.8e-151 Smith-Waterman score: 2349; 99.7% identity (99.7% similar) in 335 aa overlap (1-335:1-335) 10 20 30 40 50 60 pF1KE6 MWATLPLLCAGAWLLGVPVCGAAELSVNSLEKFHFKSWMSKHRKTYSTEEYHHRLQTFAS ::::::::::::::::::::::::: :::::::::::::::::::::::::::::::::: NP_004 MWATLPLLCAGAWLLGVPVCGAAELCVNSLEKFHFKSWMSKHRKTYSTEEYHHRLQTFAS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 NWRKINAHNNGNHTFKMALNQFSDMSFAEIKHKYLWSEPQNCSATKSNYLRGTGPYPPSV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_004 NWRKINAHNNGNHTFKMALNQFSDMSFAEIKHKYLWSEPQNCSATKSNYLRGTGPYPPSV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 DWRKKGNFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_004 DWRKKGNFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHG 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 CQGGLPSQAFEYILYNKGIMGEDTYPYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAMV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_004 CQGGLPSQAFEYILYNKGIMGEDTYPYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAMV 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE6 EAVALYNPVSFAFEVTQDFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_004 EAVALYNPVSFAFEVTQDFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVK 250 260 270 280 290 300 310 320 330 pF1KE6 NSWGPQWGMNGYFLIERGKNMCGLAACASYPIPLV ::::::::::::::::::::::::::::::::::: NP_004 NSWGPQWGMNGYFLIERGKNMCGLAACASYPIPLV 310 320 330 >>XP_016877441 (OMIM: 116820) PREDICTED: pro-cathepsin H (297 aa) initn: 2081 init1: 2081 opt: 2081 Z-score: 2510.6 bits: 472.7 E(85289): 4.2e-133 Smith-Waterman score: 2081; 100.0% identity (100.0% similar) in 297 aa overlap (39-335:1-297) 10 20 30 40 50 60 pF1KE6 CAGAWLLGVPVCGAAELSVNSLEKFHFKSWMSKHRKTYSTEEYHHRLQTFASNWRKINAH :::::::::::::::::::::::::::::: XP_016 MSKHRKTYSTEEYHHRLQTFASNWRKINAH 10 20 30 70 80 90 100 110 120 pF1KE6 NNGNHTFKMALNQFSDMSFAEIKHKYLWSEPQNCSATKSNYLRGTGPYPPSVDWRKKGNF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 NNGNHTFKMALNQFSDMSFAEIKHKYLWSEPQNCSATKSNYLRGTGPYPPSVDWRKKGNF 40 50 60 70 80 90 130 140 150 160 170 180 pF1KE6 VSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 VSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQ 100 110 120 130 140 150 190 200 210 220 230 240 pF1KE6 AFEYILYNKGIMGEDTYPYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 AFEYILYNKGIMGEDTYPYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNP 160 170 180 190 200 210 250 260 270 280 290 300 pF1KE6 VSFAFEVTQDFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 VSFAFEVTQDFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWG 220 230 240 250 260 270 310 320 330 pF1KE6 MNGYFLIERGKNMCGLAACASYPIPLV ::::::::::::::::::::::::::: XP_016 MNGYFLIERGKNMCGLAACASYPIPLV 280 290 >>XP_005254238 (OMIM: 116820) PREDICTED: pro-cathepsin H (297 aa) initn: 2081 init1: 2081 opt: 2081 Z-score: 2510.6 bits: 472.7 E(85289): 4.2e-133 Smith-Waterman score: 2081; 100.0% identity (100.0% similar) in 297 aa overlap (39-335:1-297) 10 20 30 40 50 60 pF1KE6 CAGAWLLGVPVCGAAELSVNSLEKFHFKSWMSKHRKTYSTEEYHHRLQTFASNWRKINAH :::::::::::::::::::::::::::::: XP_005 MSKHRKTYSTEEYHHRLQTFASNWRKINAH 10 20 30 70 80 90 100 110 120 pF1KE6 NNGNHTFKMALNQFSDMSFAEIKHKYLWSEPQNCSATKSNYLRGTGPYPPSVDWRKKGNF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 NNGNHTFKMALNQFSDMSFAEIKHKYLWSEPQNCSATKSNYLRGTGPYPPSVDWRKKGNF 40 50 60 70 80 90 130 140 150 160 170 180 pF1KE6 VSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 VSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQ 100 110 120 130 140 150 190 200 210 220 230 240 pF1KE6 AFEYILYNKGIMGEDTYPYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 AFEYILYNKGIMGEDTYPYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNP 160 170 180 190 200 210 250 260 270 280 290 300 pF1KE6 VSFAFEVTQDFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 VSFAFEVTQDFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWG 220 230 240 250 260 270 310 320 330 pF1KE6 MNGYFLIERGKNMCGLAACASYPIPLV ::::::::::::::::::::::::::: XP_005 MNGYFLIERGKNMCGLAACASYPIPLV 280 290 >>XP_016877440 (OMIM: 116820) PREDICTED: pro-cathepsin H (317 aa) initn: 2068 init1: 2068 opt: 2068 Z-score: 2494.6 bits: 469.9 E(85289): 3.3e-132 Smith-Waterman score: 2068; 99.3% identity (99.7% similar) in 297 aa overlap (39-335:21-317) 10 20 30 40 50 60 pF1KE6 CAGAWLLGVPVCGAAELSVNSLEKFHFKSWMSKHRKTYSTEEYHHRLQTFASNWRKINAH : .::::::::::::::::::::::::::: XP_016 MVIFAVNLTQHQQWRPGVPHMWQHRKTYSTEEYHHRLQTFASNWRKINAH 10 20 30 40 50 70 80 90 100 110 120 pF1KE6 NNGNHTFKMALNQFSDMSFAEIKHKYLWSEPQNCSATKSNYLRGTGPYPPSVDWRKKGNF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 NNGNHTFKMALNQFSDMSFAEIKHKYLWSEPQNCSATKSNYLRGTGPYPPSVDWRKKGNF 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE6 VSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 VSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQ 120 130 140 150 160 170 190 200 210 220 230 240 pF1KE6 AFEYILYNKGIMGEDTYPYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 AFEYILYNKGIMGEDTYPYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNP 180 190 200 210 220 230 250 260 270 280 290 300 pF1KE6 VSFAFEVTQDFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 VSFAFEVTQDFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWG 240 250 260 270 280 290 310 320 330 pF1KE6 MNGYFLIERGKNMCGLAACASYPIPLV ::::::::::::::::::::::::::: XP_016 MNGYFLIERGKNMCGLAACASYPIPLV 300 310 >>NP_001306066 (OMIM: 116820) pro-cathepsin H isoform c (201 aa) initn: 1207 init1: 1207 opt: 1207 Z-score: 1460.5 bits: 277.9 E(85289): 1.3e-74 Smith-Waterman score: 1207; 100.0% identity (100.0% similar) in 171 aa overlap (165-335:31-201) 140 150 160 170 180 190 pF1KE6 QGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYIL :::::::::::::::::::::::::::::: NP_001 MWLSQGGLAPHAKALTASQLAGHWVVSFFQAEQQLVDCAQDFNNHGCQGGLPSQAFEYIL 10 20 30 40 50 60 200 210 220 230 240 250 pF1KE6 YNKGIMGEDTYPYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 YNKGIMGEDTYPYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFE 70 80 90 100 110 120 260 270 280 290 300 310 pF1KE6 VTQDFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNGYFL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 VTQDFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNGYFL 130 140 150 160 170 180 320 330 pF1KE6 IERGKNMCGLAACASYPIPLV ::::::::::::::::::::: NP_001 IERGKNMCGLAACASYPIPLV 190 200 >>XP_011519578 (OMIM: 116820) PREDICTED: pro-cathepsin H (169 aa) initn: 1159 init1: 1143 opt: 1143 Z-score: 1384.5 bits: 263.5 E(85289): 2.2e-70 Smith-Waterman score: 1143; 98.2% identity (98.8% similar) in 166 aa overlap (1-166:1-166) 10 20 30 40 50 60 pF1KE6 MWATLPLLCAGAWLLGVPVCGAAELSVNSLEKFHFKSWMSKHRKTYSTEEYHHRLQTFAS ::::::::::::::::::::::::: :::::::::::::::::::::::::::::::::: XP_011 MWATLPLLCAGAWLLGVPVCGAAELCVNSLEKFHFKSWMSKHRKTYSTEEYHHRLQTFAS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 NWRKINAHNNGNHTFKMALNQFSDMSFAEIKHKYLWSEPQNCSATKSNYLRGTGPYPPSV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 NWRKINAHNNGNHTFKMALNQFSDMSFAEIKHKYLWSEPQNCSATKSNYLRGTGPYPPSV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 DWRKKGNFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHG :::::::::::::::::::::::::::::::::::::::::::: . XP_011 DWRKKGNFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLPQLSS 130 140 150 160 190 200 210 220 230 240 pF1KE6 CQGGLPSQAFEYILYNKGIMGEDTYPYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAMV >>NP_000387 (OMIM: 265800,601105) cathepsin K preproprot (329 aa) initn: 422 init1: 225 opt: 804 Z-score: 972.3 bits: 188.2 E(85289): 2e-47 Smith-Waterman score: 804; 41.9% identity (68.6% similar) in 334 aa overlap (11-331:3-327) 10 20 30 40 50 pF1KE6 MWATLPLLCAGAWLLGVPVCGAAELSVNSLEKFHFKSWMSKHRKTYST--EEYHHRLQTF : .: .:: . : : . . :.. : . ::: :.. .: .:: . NP_000 MWGLKVLLLPVVSFA-LYPEEILDTHWELWKKTHRKQYNNKVDEISRRL-IW 10 20 30 40 50 60 70 80 90 100 110 pF1KE6 ASNWRKINAHNN----GNHTFKMALNQFSDMSFAEIKHKYLWSEPQNCSATKSN---YL- .: . :. :: : ::...:.:...::. :. .: . . : ..:: :. NP_000 EKNLKYISIHNLEASLGVHTYELAMNHLGDMTSEEVVQK-MTGLKVPLSHSRSNDTLYIP 60 70 80 90 100 120 130 140 150 160 170 pF1KE6 RGTGPYPPSVDWRKKGNFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLV . : : :::.:::: .:.:::::: :::::.::..::::. . :::.:.:. :.:: NP_000 EWEGRAPDSVDYRKKG-YVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLV 110 120 130 140 150 160 180 190 200 210 220 pF1KE6 DCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQGKDGYCKFQP-GKAIGFVKDVA ::... : :: :: ..::.:. :.:: .::.::: :.. : ..: ::: . NP_000 DCVSE--NDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAK-CRGYR 170 180 190 200 210 220 230 240 250 260 270 280 pF1KE6 NITIYDEEAMVEAVALYNPVSFAFEVT-QDFMMYRTGIYSSTSCHKTPDKVNHAVLAVGY .: .:.:. .::: .::: :.... .:..: :.: . ::.. :..::::::::: NP_000 EIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNS--DNLNHAVLAVGY 230 240 250 260 270 280 290 300 310 320 330 pF1KE6 GEKNGIPYWIVKNSWGPQWGMNGYFLIERGKN-MCGLAACASYPIPLV : ..: .::.::::: .:: .::.:. :.:: ::.: ::.: NP_000 GIQKGNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 290 300 310 320 >>NP_001903 (OMIM: 116880) cathepsin L1 isoform 1 prepro (333 aa) initn: 576 init1: 379 opt: 799 Z-score: 966.2 bits: 187.1 E(85289): 4.5e-47 Smith-Waterman score: 799; 38.3% identity (69.0% similar) in 339 aa overlap (6-331:3-331) 10 20 30 40 50 60 pF1KE6 MWATLPLLCAGAWLLGVPVCGAAELSVNSLEKFHFKSWMSKHRKTYSTEEYHHRLQTFAS : : .:. ::. ..: :. . . .. .: . : . :. .: : .. . NP_001 MNPTLILAAFCLGI---ASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRRAVWEK 10 20 30 40 50 70 80 90 100 110 pF1KE6 NWRKINAHNN----GNHTFKMALNQFSDMSFAEIKHKYLWSEPQNCSATKSNYLRGTGPY : . :. ::. :.:.: ::.: :.::. :... . . :: . :.. .. : NP_001 NMKMIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQ--VMNGFQNRKPRKGKVFQEPLFY 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE6 --PPSVDWRKKGNFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQ : :::::.:: .:.:::::: :::::.::.:::::. . ::...::.::.::::. NP_001 EAPRSVDWREKG-YVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSG 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE6 DFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQGKDGYCKFQPGKAIGFVKDVANITI- .:.::.::: . ::.:. : :. .:..:::.. . ::..: ... .:.. . : NP_001 PQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVA--NDTGFVDIP 180 190 200 210 220 240 250 260 270 280 pF1KE6 YDEEAMVEAVALYNPVSFAFEVTQD-FMMYRTGIYSSTSCHKTPDKVNHAVLAVGYG--- .:.:...::: .:.: :... .. :..:. ::: .: . . ..:.::.:::: NP_001 KQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDC--SSEDMDHGVLVVGYGFES 230 240 250 260 270 280 290 300 310 320 330 pF1KE6 -EKNGIPYWIVKNSWGPQWGMNGYFLIERGK-NMCGLAACASYPIPLV :... ::.:::::: .:::.:: . . . : ::.:. :::: NP_001 TESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 290 300 310 320 330 >>NP_666023 (OMIM: 116880) cathepsin L1 isoform 1 prepro (333 aa) initn: 576 init1: 379 opt: 799 Z-score: 966.2 bits: 187.1 E(85289): 4.5e-47 Smith-Waterman score: 799; 38.3% identity (69.0% similar) in 339 aa overlap (6-331:3-331) 10 20 30 40 50 60 pF1KE6 MWATLPLLCAGAWLLGVPVCGAAELSVNSLEKFHFKSWMSKHRKTYSTEEYHHRLQTFAS : : .:. ::. ..: :. . . .. .: . : . :. .: : .. . NP_666 MNPTLILAAFCLGI---ASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRRAVWEK 10 20 30 40 50 70 80 90 100 110 pF1KE6 NWRKINAHNN----GNHTFKMALNQFSDMSFAEIKHKYLWSEPQNCSATKSNYLRGTGPY : . :. ::. :.:.: ::.: :.::. :... . . :: . :.. .. : NP_666 NMKMIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQ--VMNGFQNRKPRKGKVFQEPLFY 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE6 --PPSVDWRKKGNFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQ : :::::.:: .:.:::::: :::::.::.:::::. . ::...::.::.::::. NP_666 EAPRSVDWREKG-YVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSG 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE6 DFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQGKDGYCKFQPGKAIGFVKDVANITI- .:.::.::: . ::.:. : :. .:..:::.. . ::..: ... .:.. . : NP_666 PQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVA--NDTGFVDIP 180 190 200 210 220 240 250 260 270 280 pF1KE6 YDEEAMVEAVALYNPVSFAFEVTQD-FMMYRTGIYSSTSCHKTPDKVNHAVLAVGYG--- .:.:...::: .:.: :... .. :..:. ::: .: . . ..:.::.:::: NP_666 KQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDC--SSEDMDHGVLVVGYGFES 230 240 250 260 270 280 290 300 310 320 330 pF1KE6 -EKNGIPYWIVKNSWGPQWGMNGYFLIERGK-NMCGLAACASYPIPLV :... ::.:::::: .:::.:: . . . : ::.:. :::: NP_666 TESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 290 300 310 320 330 >>NP_001244901 (OMIM: 116880) cathepsin L1 isoform 1 pre (333 aa) initn: 576 init1: 379 opt: 799 Z-score: 966.2 bits: 187.1 E(85289): 4.5e-47 Smith-Waterman score: 799; 38.3% identity (69.0% similar) in 339 aa overlap (6-331:3-331) 10 20 30 40 50 60 pF1KE6 MWATLPLLCAGAWLLGVPVCGAAELSVNSLEKFHFKSWMSKHRKTYSTEEYHHRLQTFAS : : .:. ::. ..: :. . . .. .: . : . :. .: : .. . NP_001 MNPTLILAAFCLGI---ASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRRAVWEK 10 20 30 40 50 70 80 90 100 110 pF1KE6 NWRKINAHNN----GNHTFKMALNQFSDMSFAEIKHKYLWSEPQNCSATKSNYLRGTGPY : . :. ::. :.:.: ::.: :.::. :... . . :: . :.. .. : NP_001 NMKMIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQ--VMNGFQNRKPRKGKVFQEPLFY 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE6 --PPSVDWRKKGNFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQ : :::::.:: .:.:::::: :::::.::.:::::. . ::...::.::.::::. NP_001 EAPRSVDWREKG-YVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSG 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE6 DFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQGKDGYCKFQPGKAIGFVKDVANITI- .:.::.::: . ::.:. : :. .:..:::.. . ::..: ... .:.. . : NP_001 PQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVA--NDTGFVDIP 180 190 200 210 220 240 250 260 270 280 pF1KE6 YDEEAMVEAVALYNPVSFAFEVTQD-FMMYRTGIYSSTSCHKTPDKVNHAVLAVGYG--- .:.:...::: .:.: :... .. :..:. ::: .: . . ..:.::.:::: NP_001 KQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDC--SSEDMDHGVLVVGYGFES 230 240 250 260 270 280 290 300 310 320 330 pF1KE6 -EKNGIPYWIVKNSWGPQWGMNGYFLIERGK-NMCGLAACASYPIPLV :... ::.:::::: .:::.:: . . . : ::.:. :::: NP_001 TESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 290 300 310 320 330 335 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 15:29:11 2016 done: Tue Nov 8 15:29:12 2016 Total Scan time: 4.490 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]