FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5424, 321 aa 1>>>pF1KE5424 321 - 321 aa - 321 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.2413+/-0.000352; mu= 15.8188+/- 0.022 mean_var=65.0103+/-13.451, 0's: 0 Z-trim(113.8): 52 B-trim: 0 in 0/55 Lambda= 0.159068 statistics sampled from 23209 (23261) to 23209 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.645), E-opt: 0.2 (0.273), width: 16 Scan time: 7.050 The best scores are: opt bits E(85289) NP_001325 (OMIM: 600550) cathepsin O preproprotein ( 321) 2241 523.1 3.1e-148 XP_011543630 (OMIM: 603539,615362) PREDICTED: cath ( 424) 568 139.2 1.4e-32 NP_003784 (OMIM: 603539,615362) cathepsin F precur ( 484) 568 139.2 1.6e-32 XP_016877441 (OMIM: 116820) PREDICTED: pro-catheps ( 297) 535 131.5 2e-30 XP_005254238 (OMIM: 116820) PREDICTED: pro-catheps ( 297) 535 131.5 2e-30 XP_016877440 (OMIM: 116820) PREDICTED: pro-catheps ( 317) 535 131.6 2.2e-30 NP_004381 (OMIM: 116820) pro-cathepsin H isoform a ( 335) 535 131.6 2.3e-30 NP_004070 (OMIM: 116845) cathepsin S isoform 1 pre ( 331) 502 124.0 4.3e-28 NP_001188504 (OMIM: 603308) cathepsin L2 prepropro ( 334) 499 123.3 6.9e-28 NP_001324 (OMIM: 603308) cathepsin L2 preproprotei ( 334) 499 123.3 6.9e-28 NP_001903 (OMIM: 116880) cathepsin L1 isoform 1 pr ( 333) 484 119.9 7.5e-27 XP_005251773 (OMIM: 116880) PREDICTED: cathepsin L ( 333) 484 119.9 7.5e-27 NP_001244900 (OMIM: 116880) cathepsin L1 isoform 1 ( 333) 484 119.9 7.5e-27 NP_666023 (OMIM: 116880) cathepsin L1 isoform 1 pr ( 333) 484 119.9 7.5e-27 NP_001244901 (OMIM: 116880) cathepsin L1 isoform 1 ( 333) 484 119.9 7.5e-27 NP_000387 (OMIM: 265800,601105) cathepsin K prepro ( 329) 466 115.7 1.3e-25 NP_001186668 (OMIM: 116845) cathepsin S isoform 2 ( 281) 407 102.2 1.4e-21 NP_001306066 (OMIM: 116820) pro-cathepsin H isofor ( 201) 355 90.1 4e-18 NP_001326 (OMIM: 602364) cathepsin W preproprotein ( 376) 357 90.8 4.9e-18 XP_016869782 (OMIM: 116880) PREDICTED: cathepsin L ( 272) 324 83.1 7.2e-16 XP_011516565 (OMIM: 116880) PREDICTED: cathepsin L ( 272) 324 83.1 7.2e-16 NP_001244902 (OMIM: 116880) cathepsin L1 isoform 2 ( 151) 262 68.7 8.4e-12 NP_001304166 (OMIM: 116810) cathepsin B isoform 2 ( 215) 213 57.6 2.7e-08 XP_016868590 (OMIM: 116810) PREDICTED: cathepsin B ( 245) 213 57.6 3.1e-08 NP_001899 (OMIM: 116810) cathepsin B isoform 1 pre ( 339) 213 57.7 4e-08 NP_680090 (OMIM: 116810) cathepsin B isoform 1 pre ( 339) 213 57.7 4e-08 XP_011542114 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 213 57.7 4e-08 XP_016868588 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 213 57.7 4e-08 NP_680092 (OMIM: 116810) cathepsin B isoform 1 pre ( 339) 213 57.7 4e-08 XP_016868587 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 213 57.7 4e-08 XP_006716307 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 213 57.7 4e-08 XP_006716308 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 213 57.7 4e-08 XP_016868586 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 213 57.7 4e-08 NP_680093 (OMIM: 116810) cathepsin B isoform 1 pre ( 339) 213 57.7 4e-08 NP_680091 (OMIM: 116810) cathepsin B isoform 1 pre ( 339) 213 57.7 4e-08 XP_016868589 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 213 57.7 4e-08 NP_001327 (OMIM: 603169) cathepsin Z preproprotein ( 303) 198 54.2 4e-07 XP_011519578 (OMIM: 116820) PREDICTED: pro-catheps ( 169) 188 51.8 1.2e-06 NP_001805 (OMIM: 170650,245000,245010,602365) dipe ( 463) 178 49.7 1.4e-05 XP_016866234 (OMIM: 606749) PREDICTED: tubulointer ( 438) 161 45.8 0.00019 XP_005271163 (OMIM: 616064) PREDICTED: tubulointer ( 440) 150 43.3 0.0011 >>NP_001325 (OMIM: 600550) cathepsin O preproprotein [Ho (321 aa) initn: 2241 init1: 2241 opt: 2241 Z-score: 2782.4 bits: 523.1 E(85289): 3.1e-148 Smith-Waterman score: 2241; 100.0% identity (100.0% similar) in 321 aa overlap (1-321:1-321) 10 20 30 40 50 60 pF1KE5 MDVRALPWLPWLLWLLCRGGGDADSRAPFTPTWPRSREREAAAFRESLNRHRYLNSLFPS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MDVRALPWLPWLLWLLCRGGGDADSRAPFTPTWPRSREREAAAFRESLNRHRYLNSLFPS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 ENSTAFYGINQFSYLFPEEFKAIYLRSKPSKFPRYSAEVHMSIPNVSLPLRFDWRDKQVV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 ENSTAFYGINQFSYLFPEEFKAIYLRSKPSKFPRYSAEVHMSIPNVSLPLRFDWRDKQVV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 TQVRNQQMCGGCWAFSVVGAVESAYAIKGKPLEDLSVQQVIDCSYNNYGCNGGSTLNALN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 TQVRNQQMCGGCWAFSVVGAVESAYAIKGKPLEDLSVQQVIDCSYNNYGCNGGSTLNALN 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 WLNKMQVKLVKDSEYPFKAQNGLCHYFSGSHSGFSIKGYSAYDFSDQEDEMAKALLTFGP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 WLNKMQVKLVKDSEYPFKAQNGLCHYFSGSHSGFSIKGYSAYDFSDQEDEMAKALLTFGP 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE5 LVVIVDAVSWQDYLGGIIQHHCSSGEANHAVLITGFDKTGSTPYWIVRNSWGSSWGVDGY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 LVVIVDAVSWQDYLGGIIQHHCSSGEANHAVLITGFDKTGSTPYWIVRNSWGSSWGVDGY 250 260 270 280 290 300 310 320 pF1KE5 AHVKMGSNVCGIADSVSSIFV ::::::::::::::::::::: NP_001 AHVKMGSNVCGIADSVSSIFV 310 320 >>XP_011543630 (OMIM: 603539,615362) PREDICTED: cathepsi (424 aa) initn: 510 init1: 232 opt: 568 Z-score: 705.6 bits: 139.2 E(85289): 1.4e-32 Smith-Waterman score: 568; 33.1% identity (64.9% similar) in 299 aa overlap (29-321:130-423) 10 20 30 40 50 pF1KE5 MDVRALPWLPWLLWLLCRGGGDADSRAPFTPTWPRSRE-REAAAFRESL--NRHRYLN :. :. :. : .: : .: :. : . XP_011 NETFSSVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQ 100 110 120 130 140 150 60 70 80 90 100 110 pF1KE5 SLFPSENSTAFYGINQFSYLFPEEFKAIYLRSKPSKFPRYSAEVHMSIPNVSLPLRFDWR .. . .:: ::...:: : :::..::: . : : . . :. ... : ..::: XP_011 KIQALDRGTAQYGVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDLA-PPEWDWR 160 170 180 190 200 210 120 130 140 150 160 170 pF1KE5 DKQVVTQVRNQQMCGGCWAFSVVGAVESAYAIKGKPLEDLSVQQVIDCSYNNYGCNGGST .: .::.:..: :::.::::::.: ::. . .. : .:: :...::. . .: :: XP_011 SKGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLP 220 230 240 250 260 270 180 190 200 210 220 230 pF1KE5 LNALNWLNKMQVKLVKDSEYPFKAQNGLCHYFSGSHSGFSIKGYSAYDFSDQEDEMAKAL :: . .... : ...: .... :. ::. .. :. .. ..:..:...: : XP_011 SNAYSAIKNLG-GLETEDDYSYQGHMQSCN-FSAEKAKVYIN--DSVELSQNEQKLAAWL 280 290 300 310 320 330 240 250 260 270 280 290 pF1KE5 LTFGPLVVIVDAVSWQDYLGGI---IQHHCSSGEANHAVLITGFDKTGSTPYWIVRNSWG ::. : ..: . : : :: .. :: .::::..:. . ...:.: ..:::: XP_011 AKRGPISVAINAFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNRSDVPFWAIKNSWG 340 350 360 370 380 390 300 310 320 pF1KE5 SSWGVDGYAHVKMGSNVCGIADSVSSIFV ..:: :: ... ::..::. .:: : XP_011 TDWGEKGYYYLHRGSGACGVNTMASSAVVD 400 410 420 >>NP_003784 (OMIM: 603539,615362) cathepsin F precursor (484 aa) initn: 488 init1: 232 opt: 568 Z-score: 704.8 bits: 139.2 E(85289): 1.6e-32 Smith-Waterman score: 568; 33.1% identity (64.9% similar) in 299 aa overlap (29-321:190-483) 10 20 30 40 50 pF1KE5 MDVRALPWLPWLLWLLCRGGGDADSRAPFTPTWPRSRE-REAAAFRESL--NRHRYLN :. :. :. : .: : .: :. : . NP_003 NETFSSVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQ 160 170 180 190 200 210 60 70 80 90 100 110 pF1KE5 SLFPSENSTAFYGINQFSYLFPEEFKAIYLRSKPSKFPRYSAEVHMSIPNVSLPLRFDWR .. . .:: ::...:: : :::..::: . : : . . :. ... : ..::: NP_003 KIQALDRGTAQYGVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDLA-PPEWDWR 220 230 240 250 260 270 120 130 140 150 160 170 pF1KE5 DKQVVTQVRNQQMCGGCWAFSVVGAVESAYAIKGKPLEDLSVQQVIDCSYNNYGCNGGST .: .::.:..: :::.::::::.: ::. . .. : .:: :...::. . .: :: NP_003 SKGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLP 280 290 300 310 320 330 180 190 200 210 220 230 pF1KE5 LNALNWLNKMQVKLVKDSEYPFKAQNGLCHYFSGSHSGFSIKGYSAYDFSDQEDEMAKAL :: . .... : ...: .... :. ::. .. :. .. ..:..:...: : NP_003 SNAYSAIKNLG-GLETEDDYSYQGHMQSCN-FSAEKAKVYIN--DSVELSQNEQKLAAWL 340 350 360 370 380 390 240 250 260 270 280 290 pF1KE5 LTFGPLVVIVDAVSWQDYLGGI---IQHHCSSGEANHAVLITGFDKTGSTPYWIVRNSWG ::. : ..: . : : :: .. :: .::::..:. . ...:.: ..:::: NP_003 AKRGPISVAINAFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNRSDVPFWAIKNSWG 400 410 420 430 440 450 300 310 320 pF1KE5 SSWGVDGYAHVKMGSNVCGIADSVSSIFV ..:: :: ... ::..::. .:: : NP_003 TDWGEKGYYYLHRGSGACGVNTMASSAVVD 460 470 480 >>XP_016877441 (OMIM: 116820) PREDICTED: pro-cathepsin H (297 aa) initn: 472 init1: 206 opt: 535 Z-score: 667.0 bits: 131.5 E(85289): 2e-30 Smith-Waterman score: 535; 34.6% identity (64.3% similar) in 266 aa overlap (62-317:34-291) 40 50 60 70 80 90 pF1KE5 TWPRSREREAAAFRESLNRHRYLNSLFPSENSTAFYGINQFSYLFPEEFKAIYLRSKPSK : : ...:::: . :.: :: :.:.. XP_016 HRKTYSTEEYHHRLQTFASNWRKINAHNNGNHTFKMALNQFSDMSFAEIKHKYLWSEPQN 10 20 30 40 50 60 100 110 120 130 140 pF1KE5 FP-RYSAEVHMSIPNVSLPLRFDWRDK-QVVTQVRNQQMCGGCWAFSVVGAVESAYAIKG : .. . : : ::: : . :. :.:: ::.::.::..::.::: :: XP_016 CSATKSNYLRGTGP---YPPSVDWRKKGNFVSPVKNQGACGSCWTFSTTGALESAIAIAT 70 80 90 100 110 120 150 160 170 180 190 200 pF1KE5 KPLEDLSVQQVIDCS--YNNYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHYF . .:. ::..::. .::.::.:: .:.... . . .:. ::.....: :.. XP_016 GKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDT-YPYQGKDGYCKFQ 130 140 150 160 170 210 220 230 240 250 260 pF1KE5 SGSHSGFSIKGYSAYDFSDQEDEMAKALLTFGPLVVIVDAVSWQDYL---GGIIQH---H :. :: .: . . : :. :..:. ..:. ... ::.. :: . : XP_016 PGKAIGF-VKDVANITIYD-EEAMVEAVALYNPVSFAFEVT--QDFMMYRTGIYSSTSCH 180 190 200 210 220 230 270 280 290 300 310 320 pF1KE5 CSSGEANHAVLITGFDKTGSTPYWIVRNSWGSSWGVDGYAHVKMGSNVCGIADSVSSIFV . ..::::: .:. . .. :::::.:::: .::..:: .. :.:.::.: .: XP_016 KTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASYPIP 240 250 260 270 280 290 XP_016 LV >>XP_005254238 (OMIM: 116820) PREDICTED: pro-cathepsin H (297 aa) initn: 472 init1: 206 opt: 535 Z-score: 667.0 bits: 131.5 E(85289): 2e-30 Smith-Waterman score: 535; 34.6% identity (64.3% similar) in 266 aa overlap (62-317:34-291) 40 50 60 70 80 90 pF1KE5 TWPRSREREAAAFRESLNRHRYLNSLFPSENSTAFYGINQFSYLFPEEFKAIYLRSKPSK : : ...:::: . :.: :: :.:.. XP_005 HRKTYSTEEYHHRLQTFASNWRKINAHNNGNHTFKMALNQFSDMSFAEIKHKYLWSEPQN 10 20 30 40 50 60 100 110 120 130 140 pF1KE5 FP-RYSAEVHMSIPNVSLPLRFDWRDK-QVVTQVRNQQMCGGCWAFSVVGAVESAYAIKG : .. . : : ::: : . :. :.:: ::.::.::..::.::: :: XP_005 CSATKSNYLRGTGP---YPPSVDWRKKGNFVSPVKNQGACGSCWTFSTTGALESAIAIAT 70 80 90 100 110 120 150 160 170 180 190 200 pF1KE5 KPLEDLSVQQVIDCS--YNNYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHYF . .:. ::..::. .::.::.:: .:.... . . .:. ::.....: :.. XP_005 GKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDT-YPYQGKDGYCKFQ 130 140 150 160 170 210 220 230 240 250 260 pF1KE5 SGSHSGFSIKGYSAYDFSDQEDEMAKALLTFGPLVVIVDAVSWQDYL---GGIIQH---H :. :: .: . . : :. :..:. ..:. ... ::.. :: . : XP_005 PGKAIGF-VKDVANITIYD-EEAMVEAVALYNPVSFAFEVT--QDFMMYRTGIYSSTSCH 180 190 200 210 220 230 270 280 290 300 310 320 pF1KE5 CSSGEANHAVLITGFDKTGSTPYWIVRNSWGSSWGVDGYAHVKMGSNVCGIADSVSSIFV . ..::::: .:. . .. :::::.:::: .::..:: .. :.:.::.: .: XP_005 KTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASYPIP 240 250 260 270 280 290 XP_005 LV >>XP_016877440 (OMIM: 116820) PREDICTED: pro-cathepsin H (317 aa) initn: 472 init1: 206 opt: 535 Z-score: 666.6 bits: 131.6 E(85289): 2.2e-30 Smith-Waterman score: 535; 34.6% identity (64.3% similar) in 266 aa overlap (62-317:54-311) 40 50 60 70 80 90 pF1KE5 TWPRSREREAAAFRESLNRHRYLNSLFPSENSTAFYGINQFSYLFPEEFKAIYLRSKPSK : : ...:::: . :.: :: :.:.. XP_016 HRKTYSTEEYHHRLQTFASNWRKINAHNNGNHTFKMALNQFSDMSFAEIKHKYLWSEPQN 30 40 50 60 70 80 100 110 120 130 140 pF1KE5 FP-RYSAEVHMSIPNVSLPLRFDWRDK-QVVTQVRNQQMCGGCWAFSVVGAVESAYAIKG : .. . : : ::: : . :. :.:: ::.::.::..::.::: :: XP_016 CSATKSNYLRGTGP---YPPSVDWRKKGNFVSPVKNQGACGSCWTFSTTGALESAIAIAT 90 100 110 120 130 140 150 160 170 180 190 200 pF1KE5 KPLEDLSVQQVIDCS--YNNYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHYF . .:. ::..::. .::.::.:: .:.... . . .:. ::.....: :.. XP_016 GKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDT-YPYQGKDGYCKFQ 150 160 170 180 190 210 220 230 240 250 260 pF1KE5 SGSHSGFSIKGYSAYDFSDQEDEMAKALLTFGPLVVIVDAVSWQDYL---GGIIQH---H :. :: .: . . : :. :..:. ..:. ... ::.. :: . : XP_016 PGKAIGF-VKDVANITIYD-EEAMVEAVALYNPVSFAFEVT--QDFMMYRTGIYSSTSCH 200 210 220 230 240 250 270 280 290 300 310 320 pF1KE5 CSSGEANHAVLITGFDKTGSTPYWIVRNSWGSSWGVDGYAHVKMGSNVCGIADSVSSIFV . ..::::: .:. . .. :::::.:::: .::..:: .. :.:.::.: .: XP_016 KTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASYPIP 260 270 280 290 300 310 XP_016 LV >>NP_004381 (OMIM: 116820) pro-cathepsin H isoform a pre (335 aa) initn: 418 init1: 206 opt: 535 Z-score: 666.2 bits: 131.6 E(85289): 2.3e-30 Smith-Waterman score: 538; 32.0% identity (61.1% similar) in 334 aa overlap (6-317:5-329) 10 20 30 40 50 pF1KE5 MDVRALPWL---PWLLWLLCRGGGD--ADSRAPFT-PTWPRSREREAAAFRESLNRHRYL :: : ::: . :... ..: : .: :..:.. . .: .: . . NP_004 MWATLPLLCAGAWLLGVPVCGAAELCVNSLEKFHFKSW-MSKHRKTYSTEEYHHRLQTF 10 20 30 40 50 60 70 80 90 100 pF1KE5 NSLF------PSENSTAFYGINQFSYLFPEEFKAIYLRSKPSKFP-RYSAEVHMSIPNVS : . . : : ...:::: . :.: :: :.:.. : .. . : NP_004 ASNWRKINAHNNGNHTFKMALNQFSDMSFAEIKHKYLWSEPQNCSATKSNYLRGTGP--- 60 70 80 90 100 110 110 120 130 140 150 160 pF1KE5 LPLRFDWRDK-QVVTQVRNQQMCGGCWAFSVVGAVESAYAIKGKPLEDLSVQQVIDCS-- : ::: : . :. :.:: ::.::.::..::.::: :: . .:. ::..::. NP_004 YPPSVDWRKKGNFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQD 120 130 140 150 160 170 170 180 190 200 210 220 pF1KE5 YNNYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHYFSGSHSGFSIKGYSAYDF .::.::.:: .:.... . . .:. ::.....: :.. :. :: .: . . NP_004 FNNHGCQGGLPSQAFEYILYNKGIMGEDT-YPYQGKDGYCKFQPGKAIGF-VKDVANITI 180 190 200 210 220 230 230 240 250 260 270 pF1KE5 SDQEDEMAKALLTFGPLVVIVDAVSWQDYL---GGIIQH---HCSSGEANHAVLITGFDK : :. :..:. ..:. ... ::.. :: . : . ..::::: .:. . NP_004 YD-EEAMVEAVALYNPVSFAFEVT--QDFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGE 240 250 260 270 280 290 280 290 300 310 320 pF1KE5 TGSTPYWIVRNSWGSSWGVDGYAHVKMGSNVCGIADSVSSIFV .. :::::.:::: .::..:: .. :.:.::.: .: NP_004 KNGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASYPIPLV 300 310 320 330 >>NP_004070 (OMIM: 116845) cathepsin S isoform 1 preprop (331 aa) initn: 398 init1: 241 opt: 502 Z-score: 625.4 bits: 124.0 E(85289): 4.3e-28 Smith-Waterman score: 502; 35.2% identity (64.4% similar) in 253 aa overlap (68-313:76-323) 40 50 60 70 80 90 pF1KE5 EREAAAFRESLNRHRYLNSLFPSENSTAFYGINQFSYLFPEEFKAIYLRSK-PSKFPRYS :.:... . :: ... . ::.. : NP_004 AVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQWQRNI 50 60 70 80 90 100 100 110 120 130 140 150 pF1KE5 AEVHMSIPNVSLPLRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAIKGKPLEDLS . . : :: :: :::.: ::.:. : ::.:::::.:::.:. .: : .:: NP_004 T--YKSNPNRILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLS 110 120 130 140 150 160 160 170 180 190 200 210 pF1KE5 VQQVIDCS---YNNYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHYFSGSHSG .:...::: :.: ::::: .:.... . . .:. ::.::.. :.: : ... NP_004 AQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNK-GIDSDASYPYKAMDQKCQYDSKYRAA 170 180 190 200 210 220 220 230 240 250 260 270 pF1KE5 FSIKGYSAYDFSDQEDEMAKALLTFGPLVVIVDAV--SWQDYLGGIIQHHCSSGEANHAV : :. .. .:: . .:. . ::. : ::: :. : .:. . . ..::.: NP_004 TCSK-YTELPYG-REDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGV 230 240 250 260 270 280 280 290 300 310 320 pF1KE5 LITGFDKTGSTPYWIVRNSWGSSWGVDGYAHV-KMGSNVCGIADSVSSIFV :..:. .. ::.:.:::: ..: .:: .. . .: :::: NP_004 LVVGYGDLNGKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYPEI 290 300 310 320 330 >>NP_001188504 (OMIM: 603308) cathepsin L2 preproprotein (334 aa) initn: 444 init1: 203 opt: 499 Z-score: 621.6 bits: 123.3 E(85289): 6.9e-28 Smith-Waterman score: 499; 35.3% identity (63.2% similar) in 269 aa overlap (60-317:67-330) 30 40 50 60 70 80 pF1KE5 TPTWPRSREREAAAFRESLNRHRYLNSLFPSENSTAF-YGINQFSYLFPEEFKAIYLRSK :... .: ...: :. . :::. .. . NP_001 RRLYGANEEGWRRAVWEKNMKMIELHNGEYSQGKHGFTMAMNAFGDMTNEEFRQMMGCFR 40 50 60 70 80 90 90 100 110 120 130 140 pF1KE5 PSKFPRYSAEVHMSIPNVSLPLRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAIK .:: . ..: ..:: ::: : :: :.::..::.:::::..::.:. . : NP_001 NQKFRK--GKVFREPLFLDLPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRK 100 110 120 130 140 150 150 160 170 180 190 200 pF1KE5 GKPLEDLSVQQVIDCSY--NNYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHY : .:: :...::: .: ::::: :.... : . : .. ::. : . .:.: NP_001 TGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQYV-KENGGLDSEESYPYVAVDEICKY 160 170 180 190 200 210 210 220 230 240 250 260 pF1KE5 FSGSHSGFSIKGYSAYDFSDQEDEMAKALLTFGPLVVIVDA--VSWQDYLGGI-IQHHCS .: . :... .: . ::. : ::. : .:: :.: : .:: .. :: NP_001 -RPENSVANDTGFTVVA-PGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCS 220 230 240 250 260 270 270 280 290 300 310 pF1KE5 SGEANHAVLITGFDKTGS----TPYWIVRNSWGSSWGVDGYAHV-KMGSNVCGIADSVSS : . .:.::..:. :. . ::.:.:::: :: .::... : .: :::: ..: NP_001 SKNLDHGVLVVGYGFEGANSNNSKYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASY 280 290 300 310 320 330 320 pF1KE5 IFV NP_001 PNV >>NP_001324 (OMIM: 603308) cathepsin L2 preproprotein [H (334 aa) initn: 444 init1: 203 opt: 499 Z-score: 621.6 bits: 123.3 E(85289): 6.9e-28 Smith-Waterman score: 499; 35.3% identity (63.2% similar) in 269 aa overlap (60-317:67-330) 30 40 50 60 70 80 pF1KE5 TPTWPRSREREAAAFRESLNRHRYLNSLFPSENSTAF-YGINQFSYLFPEEFKAIYLRSK :... .: ...: :. . :::. .. . NP_001 RRLYGANEEGWRRAVWEKNMKMIELHNGEYSQGKHGFTMAMNAFGDMTNEEFRQMMGCFR 40 50 60 70 80 90 90 100 110 120 130 140 pF1KE5 PSKFPRYSAEVHMSIPNVSLPLRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAIK .:: . ..: ..:: ::: : :: :.::..::.:::::..::.:. . : NP_001 NQKFRK--GKVFREPLFLDLPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRK 100 110 120 130 140 150 150 160 170 180 190 200 pF1KE5 GKPLEDLSVQQVIDCSY--NNYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHY : .:: :...::: .: ::::: :.... : . : .. ::. : . .:.: NP_001 TGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQYV-KENGGLDSEESYPYVAVDEICKY 160 170 180 190 200 210 210 220 230 240 250 260 pF1KE5 FSGSHSGFSIKGYSAYDFSDQEDEMAKALLTFGPLVVIVDA--VSWQDYLGGI-IQHHCS .: . :... .: . ::. : ::. : .:: :.: : .:: .. :: NP_001 -RPENSVANDTGFTVVA-PGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCS 220 230 240 250 260 270 270 280 290 300 310 pF1KE5 SGEANHAVLITGFDKTGS----TPYWIVRNSWGSSWGVDGYAHV-KMGSNVCGIADSVSS : . .:.::..:. :. . ::.:.:::: :: .::... : .: :::: ..: NP_001 SKNLDHGVLVVGYGFEGANSNNSKYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASY 280 290 300 310 320 330 320 pF1KE5 IFV NP_001 PNV 321 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 00:37:42 2016 done: Tue Nov 8 00:37:43 2016 Total Scan time: 7.050 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]