FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE5424, 321 aa
1>>>pF1KE5424 321 - 321 aa - 321 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.2413+/-0.000352; mu= 15.8188+/- 0.022
mean_var=65.0103+/-13.451, 0's: 0 Z-trim(113.8): 52 B-trim: 0 in 0/55
Lambda= 0.159068
statistics sampled from 23209 (23261) to 23209 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.645), E-opt: 0.2 (0.273), width: 16
Scan time: 7.050
The best scores are: opt bits E(85289)
NP_001325 (OMIM: 600550) cathepsin O preproprotein ( 321) 2241 523.1 3.1e-148
XP_011543630 (OMIM: 603539,615362) PREDICTED: cath ( 424) 568 139.2 1.4e-32
NP_003784 (OMIM: 603539,615362) cathepsin F precur ( 484) 568 139.2 1.6e-32
XP_016877441 (OMIM: 116820) PREDICTED: pro-catheps ( 297) 535 131.5 2e-30
XP_005254238 (OMIM: 116820) PREDICTED: pro-catheps ( 297) 535 131.5 2e-30
XP_016877440 (OMIM: 116820) PREDICTED: pro-catheps ( 317) 535 131.6 2.2e-30
NP_004381 (OMIM: 116820) pro-cathepsin H isoform a ( 335) 535 131.6 2.3e-30
NP_004070 (OMIM: 116845) cathepsin S isoform 1 pre ( 331) 502 124.0 4.3e-28
NP_001188504 (OMIM: 603308) cathepsin L2 prepropro ( 334) 499 123.3 6.9e-28
NP_001324 (OMIM: 603308) cathepsin L2 preproprotei ( 334) 499 123.3 6.9e-28
NP_001903 (OMIM: 116880) cathepsin L1 isoform 1 pr ( 333) 484 119.9 7.5e-27
XP_005251773 (OMIM: 116880) PREDICTED: cathepsin L ( 333) 484 119.9 7.5e-27
NP_001244900 (OMIM: 116880) cathepsin L1 isoform 1 ( 333) 484 119.9 7.5e-27
NP_666023 (OMIM: 116880) cathepsin L1 isoform 1 pr ( 333) 484 119.9 7.5e-27
NP_001244901 (OMIM: 116880) cathepsin L1 isoform 1 ( 333) 484 119.9 7.5e-27
NP_000387 (OMIM: 265800,601105) cathepsin K prepro ( 329) 466 115.7 1.3e-25
NP_001186668 (OMIM: 116845) cathepsin S isoform 2 ( 281) 407 102.2 1.4e-21
NP_001306066 (OMIM: 116820) pro-cathepsin H isofor ( 201) 355 90.1 4e-18
NP_001326 (OMIM: 602364) cathepsin W preproprotein ( 376) 357 90.8 4.9e-18
XP_016869782 (OMIM: 116880) PREDICTED: cathepsin L ( 272) 324 83.1 7.2e-16
XP_011516565 (OMIM: 116880) PREDICTED: cathepsin L ( 272) 324 83.1 7.2e-16
NP_001244902 (OMIM: 116880) cathepsin L1 isoform 2 ( 151) 262 68.7 8.4e-12
NP_001304166 (OMIM: 116810) cathepsin B isoform 2 ( 215) 213 57.6 2.7e-08
XP_016868590 (OMIM: 116810) PREDICTED: cathepsin B ( 245) 213 57.6 3.1e-08
NP_001899 (OMIM: 116810) cathepsin B isoform 1 pre ( 339) 213 57.7 4e-08
NP_680090 (OMIM: 116810) cathepsin B isoform 1 pre ( 339) 213 57.7 4e-08
XP_011542114 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 213 57.7 4e-08
XP_016868588 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 213 57.7 4e-08
NP_680092 (OMIM: 116810) cathepsin B isoform 1 pre ( 339) 213 57.7 4e-08
XP_016868587 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 213 57.7 4e-08
XP_006716307 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 213 57.7 4e-08
XP_006716308 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 213 57.7 4e-08
XP_016868586 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 213 57.7 4e-08
NP_680093 (OMIM: 116810) cathepsin B isoform 1 pre ( 339) 213 57.7 4e-08
NP_680091 (OMIM: 116810) cathepsin B isoform 1 pre ( 339) 213 57.7 4e-08
XP_016868589 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 213 57.7 4e-08
NP_001327 (OMIM: 603169) cathepsin Z preproprotein ( 303) 198 54.2 4e-07
XP_011519578 (OMIM: 116820) PREDICTED: pro-catheps ( 169) 188 51.8 1.2e-06
NP_001805 (OMIM: 170650,245000,245010,602365) dipe ( 463) 178 49.7 1.4e-05
XP_016866234 (OMIM: 606749) PREDICTED: tubulointer ( 438) 161 45.8 0.00019
XP_005271163 (OMIM: 616064) PREDICTED: tubulointer ( 440) 150 43.3 0.0011
>>NP_001325 (OMIM: 600550) cathepsin O preproprotein [Ho (321 aa)
initn: 2241 init1: 2241 opt: 2241 Z-score: 2782.4 bits: 523.1 E(85289): 3.1e-148
Smith-Waterman score: 2241; 100.0% identity (100.0% similar) in 321 aa overlap (1-321:1-321)
10 20 30 40 50 60
pF1KE5 MDVRALPWLPWLLWLLCRGGGDADSRAPFTPTWPRSREREAAAFRESLNRHRYLNSLFPS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 MDVRALPWLPWLLWLLCRGGGDADSRAPFTPTWPRSREREAAAFRESLNRHRYLNSLFPS
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE5 ENSTAFYGINQFSYLFPEEFKAIYLRSKPSKFPRYSAEVHMSIPNVSLPLRFDWRDKQVV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 ENSTAFYGINQFSYLFPEEFKAIYLRSKPSKFPRYSAEVHMSIPNVSLPLRFDWRDKQVV
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE5 TQVRNQQMCGGCWAFSVVGAVESAYAIKGKPLEDLSVQQVIDCSYNNYGCNGGSTLNALN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 TQVRNQQMCGGCWAFSVVGAVESAYAIKGKPLEDLSVQQVIDCSYNNYGCNGGSTLNALN
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE5 WLNKMQVKLVKDSEYPFKAQNGLCHYFSGSHSGFSIKGYSAYDFSDQEDEMAKALLTFGP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 WLNKMQVKLVKDSEYPFKAQNGLCHYFSGSHSGFSIKGYSAYDFSDQEDEMAKALLTFGP
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE5 LVVIVDAVSWQDYLGGIIQHHCSSGEANHAVLITGFDKTGSTPYWIVRNSWGSSWGVDGY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 LVVIVDAVSWQDYLGGIIQHHCSSGEANHAVLITGFDKTGSTPYWIVRNSWGSSWGVDGY
250 260 270 280 290 300
310 320
pF1KE5 AHVKMGSNVCGIADSVSSIFV
:::::::::::::::::::::
NP_001 AHVKMGSNVCGIADSVSSIFV
310 320
>>XP_011543630 (OMIM: 603539,615362) PREDICTED: cathepsi (424 aa)
initn: 510 init1: 232 opt: 568 Z-score: 705.6 bits: 139.2 E(85289): 1.4e-32
Smith-Waterman score: 568; 33.1% identity (64.9% similar) in 299 aa overlap (29-321:130-423)
10 20 30 40 50
pF1KE5 MDVRALPWLPWLLWLLCRGGGDADSRAPFTPTWPRSRE-REAAAFRESL--NRHRYLN
:. :. :. : .: : .: :. : .
XP_011 NETFSSVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQ
100 110 120 130 140 150
60 70 80 90 100 110
pF1KE5 SLFPSENSTAFYGINQFSYLFPEEFKAIYLRSKPSKFPRYSAEVHMSIPNVSLPLRFDWR
.. . .:: ::...:: : :::..::: . : : . . :. ... : ..:::
XP_011 KIQALDRGTAQYGVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDLA-PPEWDWR
160 170 180 190 200 210
120 130 140 150 160 170
pF1KE5 DKQVVTQVRNQQMCGGCWAFSVVGAVESAYAIKGKPLEDLSVQQVIDCSYNNYGCNGGST
.: .::.:..: :::.::::::.: ::. . .. : .:: :...::. . .: ::
XP_011 SKGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLP
220 230 240 250 260 270
180 190 200 210 220 230
pF1KE5 LNALNWLNKMQVKLVKDSEYPFKAQNGLCHYFSGSHSGFSIKGYSAYDFSDQEDEMAKAL
:: . .... : ...: .... :. ::. .. :. .. ..:..:...: :
XP_011 SNAYSAIKNLG-GLETEDDYSYQGHMQSCN-FSAEKAKVYIN--DSVELSQNEQKLAAWL
280 290 300 310 320 330
240 250 260 270 280 290
pF1KE5 LTFGPLVVIVDAVSWQDYLGGI---IQHHCSSGEANHAVLITGFDKTGSTPYWIVRNSWG
::. : ..: . : : :: .. :: .::::..:. . ...:.: ..::::
XP_011 AKRGPISVAINAFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNRSDVPFWAIKNSWG
340 350 360 370 380 390
300 310 320
pF1KE5 SSWGVDGYAHVKMGSNVCGIADSVSSIFV
..:: :: ... ::..::. .:: :
XP_011 TDWGEKGYYYLHRGSGACGVNTMASSAVVD
400 410 420
>>NP_003784 (OMIM: 603539,615362) cathepsin F precursor (484 aa)
initn: 488 init1: 232 opt: 568 Z-score: 704.8 bits: 139.2 E(85289): 1.6e-32
Smith-Waterman score: 568; 33.1% identity (64.9% similar) in 299 aa overlap (29-321:190-483)
10 20 30 40 50
pF1KE5 MDVRALPWLPWLLWLLCRGGGDADSRAPFTPTWPRSRE-REAAAFRESL--NRHRYLN
:. :. :. : .: : .: :. : .
NP_003 NETFSSVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQ
160 170 180 190 200 210
60 70 80 90 100 110
pF1KE5 SLFPSENSTAFYGINQFSYLFPEEFKAIYLRSKPSKFPRYSAEVHMSIPNVSLPLRFDWR
.. . .:: ::...:: : :::..::: . : : . . :. ... : ..:::
NP_003 KIQALDRGTAQYGVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDLA-PPEWDWR
220 230 240 250 260 270
120 130 140 150 160 170
pF1KE5 DKQVVTQVRNQQMCGGCWAFSVVGAVESAYAIKGKPLEDLSVQQVIDCSYNNYGCNGGST
.: .::.:..: :::.::::::.: ::. . .. : .:: :...::. . .: ::
NP_003 SKGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLP
280 290 300 310 320 330
180 190 200 210 220 230
pF1KE5 LNALNWLNKMQVKLVKDSEYPFKAQNGLCHYFSGSHSGFSIKGYSAYDFSDQEDEMAKAL
:: . .... : ...: .... :. ::. .. :. .. ..:..:...: :
NP_003 SNAYSAIKNLG-GLETEDDYSYQGHMQSCN-FSAEKAKVYIN--DSVELSQNEQKLAAWL
340 350 360 370 380 390
240 250 260 270 280 290
pF1KE5 LTFGPLVVIVDAVSWQDYLGGI---IQHHCSSGEANHAVLITGFDKTGSTPYWIVRNSWG
::. : ..: . : : :: .. :: .::::..:. . ...:.: ..::::
NP_003 AKRGPISVAINAFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNRSDVPFWAIKNSWG
400 410 420 430 440 450
300 310 320
pF1KE5 SSWGVDGYAHVKMGSNVCGIADSVSSIFV
..:: :: ... ::..::. .:: :
NP_003 TDWGEKGYYYLHRGSGACGVNTMASSAVVD
460 470 480
>>XP_016877441 (OMIM: 116820) PREDICTED: pro-cathepsin H (297 aa)
initn: 472 init1: 206 opt: 535 Z-score: 667.0 bits: 131.5 E(85289): 2e-30
Smith-Waterman score: 535; 34.6% identity (64.3% similar) in 266 aa overlap (62-317:34-291)
40 50 60 70 80 90
pF1KE5 TWPRSREREAAAFRESLNRHRYLNSLFPSENSTAFYGINQFSYLFPEEFKAIYLRSKPSK
: : ...:::: . :.: :: :.:..
XP_016 HRKTYSTEEYHHRLQTFASNWRKINAHNNGNHTFKMALNQFSDMSFAEIKHKYLWSEPQN
10 20 30 40 50 60
100 110 120 130 140
pF1KE5 FP-RYSAEVHMSIPNVSLPLRFDWRDK-QVVTQVRNQQMCGGCWAFSVVGAVESAYAIKG
: .. . : : ::: : . :. :.:: ::.::.::..::.::: ::
XP_016 CSATKSNYLRGTGP---YPPSVDWRKKGNFVSPVKNQGACGSCWTFSTTGALESAIAIAT
70 80 90 100 110 120
150 160 170 180 190 200
pF1KE5 KPLEDLSVQQVIDCS--YNNYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHYF
. .:. ::..::. .::.::.:: .:.... . . .:. ::.....: :..
XP_016 GKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDT-YPYQGKDGYCKFQ
130 140 150 160 170
210 220 230 240 250 260
pF1KE5 SGSHSGFSIKGYSAYDFSDQEDEMAKALLTFGPLVVIVDAVSWQDYL---GGIIQH---H
:. :: .: . . : :. :..:. ..:. ... ::.. :: . :
XP_016 PGKAIGF-VKDVANITIYD-EEAMVEAVALYNPVSFAFEVT--QDFMMYRTGIYSSTSCH
180 190 200 210 220 230
270 280 290 300 310 320
pF1KE5 CSSGEANHAVLITGFDKTGSTPYWIVRNSWGSSWGVDGYAHVKMGSNVCGIADSVSSIFV
. ..::::: .:. . .. :::::.:::: .::..:: .. :.:.::.: .:
XP_016 KTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASYPIP
240 250 260 270 280 290
XP_016 LV
>>XP_005254238 (OMIM: 116820) PREDICTED: pro-cathepsin H (297 aa)
initn: 472 init1: 206 opt: 535 Z-score: 667.0 bits: 131.5 E(85289): 2e-30
Smith-Waterman score: 535; 34.6% identity (64.3% similar) in 266 aa overlap (62-317:34-291)
40 50 60 70 80 90
pF1KE5 TWPRSREREAAAFRESLNRHRYLNSLFPSENSTAFYGINQFSYLFPEEFKAIYLRSKPSK
: : ...:::: . :.: :: :.:..
XP_005 HRKTYSTEEYHHRLQTFASNWRKINAHNNGNHTFKMALNQFSDMSFAEIKHKYLWSEPQN
10 20 30 40 50 60
100 110 120 130 140
pF1KE5 FP-RYSAEVHMSIPNVSLPLRFDWRDK-QVVTQVRNQQMCGGCWAFSVVGAVESAYAIKG
: .. . : : ::: : . :. :.:: ::.::.::..::.::: ::
XP_005 CSATKSNYLRGTGP---YPPSVDWRKKGNFVSPVKNQGACGSCWTFSTTGALESAIAIAT
70 80 90 100 110 120
150 160 170 180 190 200
pF1KE5 KPLEDLSVQQVIDCS--YNNYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHYF
. .:. ::..::. .::.::.:: .:.... . . .:. ::.....: :..
XP_005 GKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDT-YPYQGKDGYCKFQ
130 140 150 160 170
210 220 230 240 250 260
pF1KE5 SGSHSGFSIKGYSAYDFSDQEDEMAKALLTFGPLVVIVDAVSWQDYL---GGIIQH---H
:. :: .: . . : :. :..:. ..:. ... ::.. :: . :
XP_005 PGKAIGF-VKDVANITIYD-EEAMVEAVALYNPVSFAFEVT--QDFMMYRTGIYSSTSCH
180 190 200 210 220 230
270 280 290 300 310 320
pF1KE5 CSSGEANHAVLITGFDKTGSTPYWIVRNSWGSSWGVDGYAHVKMGSNVCGIADSVSSIFV
. ..::::: .:. . .. :::::.:::: .::..:: .. :.:.::.: .:
XP_005 KTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASYPIP
240 250 260 270 280 290
XP_005 LV
>>XP_016877440 (OMIM: 116820) PREDICTED: pro-cathepsin H (317 aa)
initn: 472 init1: 206 opt: 535 Z-score: 666.6 bits: 131.6 E(85289): 2.2e-30
Smith-Waterman score: 535; 34.6% identity (64.3% similar) in 266 aa overlap (62-317:54-311)
40 50 60 70 80 90
pF1KE5 TWPRSREREAAAFRESLNRHRYLNSLFPSENSTAFYGINQFSYLFPEEFKAIYLRSKPSK
: : ...:::: . :.: :: :.:..
XP_016 HRKTYSTEEYHHRLQTFASNWRKINAHNNGNHTFKMALNQFSDMSFAEIKHKYLWSEPQN
30 40 50 60 70 80
100 110 120 130 140
pF1KE5 FP-RYSAEVHMSIPNVSLPLRFDWRDK-QVVTQVRNQQMCGGCWAFSVVGAVESAYAIKG
: .. . : : ::: : . :. :.:: ::.::.::..::.::: ::
XP_016 CSATKSNYLRGTGP---YPPSVDWRKKGNFVSPVKNQGACGSCWTFSTTGALESAIAIAT
90 100 110 120 130 140
150 160 170 180 190 200
pF1KE5 KPLEDLSVQQVIDCS--YNNYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHYF
. .:. ::..::. .::.::.:: .:.... . . .:. ::.....: :..
XP_016 GKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDT-YPYQGKDGYCKFQ
150 160 170 180 190
210 220 230 240 250 260
pF1KE5 SGSHSGFSIKGYSAYDFSDQEDEMAKALLTFGPLVVIVDAVSWQDYL---GGIIQH---H
:. :: .: . . : :. :..:. ..:. ... ::.. :: . :
XP_016 PGKAIGF-VKDVANITIYD-EEAMVEAVALYNPVSFAFEVT--QDFMMYRTGIYSSTSCH
200 210 220 230 240 250
270 280 290 300 310 320
pF1KE5 CSSGEANHAVLITGFDKTGSTPYWIVRNSWGSSWGVDGYAHVKMGSNVCGIADSVSSIFV
. ..::::: .:. . .. :::::.:::: .::..:: .. :.:.::.: .:
XP_016 KTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASYPIP
260 270 280 290 300 310
XP_016 LV
>>NP_004381 (OMIM: 116820) pro-cathepsin H isoform a pre (335 aa)
initn: 418 init1: 206 opt: 535 Z-score: 666.2 bits: 131.6 E(85289): 2.3e-30
Smith-Waterman score: 538; 32.0% identity (61.1% similar) in 334 aa overlap (6-317:5-329)
10 20 30 40 50
pF1KE5 MDVRALPWL---PWLLWLLCRGGGD--ADSRAPFT-PTWPRSREREAAAFRESLNRHRYL
:: : ::: . :... ..: : .: :..:.. . .: .: . .
NP_004 MWATLPLLCAGAWLLGVPVCGAAELCVNSLEKFHFKSW-MSKHRKTYSTEEYHHRLQTF
10 20 30 40 50
60 70 80 90 100
pF1KE5 NSLF------PSENSTAFYGINQFSYLFPEEFKAIYLRSKPSKFP-RYSAEVHMSIPNVS
: . . : : ...:::: . :.: :: :.:.. : .. . :
NP_004 ASNWRKINAHNNGNHTFKMALNQFSDMSFAEIKHKYLWSEPQNCSATKSNYLRGTGP---
60 70 80 90 100 110
110 120 130 140 150 160
pF1KE5 LPLRFDWRDK-QVVTQVRNQQMCGGCWAFSVVGAVESAYAIKGKPLEDLSVQQVIDCS--
: ::: : . :. :.:: ::.::.::..::.::: :: . .:. ::..::.
NP_004 YPPSVDWRKKGNFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQD
120 130 140 150 160 170
170 180 190 200 210 220
pF1KE5 YNNYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHYFSGSHSGFSIKGYSAYDF
.::.::.:: .:.... . . .:. ::.....: :.. :. :: .: . .
NP_004 FNNHGCQGGLPSQAFEYILYNKGIMGEDT-YPYQGKDGYCKFQPGKAIGF-VKDVANITI
180 190 200 210 220 230
230 240 250 260 270
pF1KE5 SDQEDEMAKALLTFGPLVVIVDAVSWQDYL---GGIIQH---HCSSGEANHAVLITGFDK
: :. :..:. ..:. ... ::.. :: . : . ..::::: .:. .
NP_004 YD-EEAMVEAVALYNPVSFAFEVT--QDFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGE
240 250 260 270 280 290
280 290 300 310 320
pF1KE5 TGSTPYWIVRNSWGSSWGVDGYAHVKMGSNVCGIADSVSSIFV
.. :::::.:::: .::..:: .. :.:.::.: .:
NP_004 KNGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASYPIPLV
300 310 320 330
>>NP_004070 (OMIM: 116845) cathepsin S isoform 1 preprop (331 aa)
initn: 398 init1: 241 opt: 502 Z-score: 625.4 bits: 124.0 E(85289): 4.3e-28
Smith-Waterman score: 502; 35.2% identity (64.4% similar) in 253 aa overlap (68-313:76-323)
40 50 60 70 80 90
pF1KE5 EREAAAFRESLNRHRYLNSLFPSENSTAFYGINQFSYLFPEEFKAIYLRSK-PSKFPRYS
:.:... . :: ... . ::.. :
NP_004 AVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQWQRNI
50 60 70 80 90 100
100 110 120 130 140 150
pF1KE5 AEVHMSIPNVSLPLRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAIKGKPLEDLS
. . : :: :: :::.: ::.:. : ::.:::::.:::.:. .: : .::
NP_004 T--YKSNPNRILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLS
110 120 130 140 150 160
160 170 180 190 200 210
pF1KE5 VQQVIDCS---YNNYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHYFSGSHSG
.:...::: :.: ::::: .:.... . . .:. ::.::.. :.: : ...
NP_004 AQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNK-GIDSDASYPYKAMDQKCQYDSKYRAA
170 180 190 200 210 220
220 230 240 250 260 270
pF1KE5 FSIKGYSAYDFSDQEDEMAKALLTFGPLVVIVDAV--SWQDYLGGIIQHHCSSGEANHAV
: :. .. .:: . .:. . ::. : ::: :. : .:. . . ..::.:
NP_004 TCSK-YTELPYG-REDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGV
230 240 250 260 270 280
280 290 300 310 320
pF1KE5 LITGFDKTGSTPYWIVRNSWGSSWGVDGYAHV-KMGSNVCGIADSVSSIFV
:..:. .. ::.:.:::: ..: .:: .. . .: ::::
NP_004 LVVGYGDLNGKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYPEI
290 300 310 320 330
>>NP_001188504 (OMIM: 603308) cathepsin L2 preproprotein (334 aa)
initn: 444 init1: 203 opt: 499 Z-score: 621.6 bits: 123.3 E(85289): 6.9e-28
Smith-Waterman score: 499; 35.3% identity (63.2% similar) in 269 aa overlap (60-317:67-330)
30 40 50 60 70 80
pF1KE5 TPTWPRSREREAAAFRESLNRHRYLNSLFPSENSTAF-YGINQFSYLFPEEFKAIYLRSK
:... .: ...: :. . :::. .. .
NP_001 RRLYGANEEGWRRAVWEKNMKMIELHNGEYSQGKHGFTMAMNAFGDMTNEEFRQMMGCFR
40 50 60 70 80 90
90 100 110 120 130 140
pF1KE5 PSKFPRYSAEVHMSIPNVSLPLRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAIK
.:: . ..: ..:: ::: : :: :.::..::.:::::..::.:. . :
NP_001 NQKFRK--GKVFREPLFLDLPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRK
100 110 120 130 140 150
150 160 170 180 190 200
pF1KE5 GKPLEDLSVQQVIDCSY--NNYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHY
: .:: :...::: .: ::::: :.... : . : .. ::. : . .:.:
NP_001 TGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQYV-KENGGLDSEESYPYVAVDEICKY
160 170 180 190 200 210
210 220 230 240 250 260
pF1KE5 FSGSHSGFSIKGYSAYDFSDQEDEMAKALLTFGPLVVIVDA--VSWQDYLGGI-IQHHCS
.: . :... .: . ::. : ::. : .:: :.: : .:: .. ::
NP_001 -RPENSVANDTGFTVVA-PGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCS
220 230 240 250 260 270
270 280 290 300 310
pF1KE5 SGEANHAVLITGFDKTGS----TPYWIVRNSWGSSWGVDGYAHV-KMGSNVCGIADSVSS
: . .:.::..:. :. . ::.:.:::: :: .::... : .: :::: ..:
NP_001 SKNLDHGVLVVGYGFEGANSNNSKYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASY
280 290 300 310 320 330
320
pF1KE5 IFV
NP_001 PNV
>>NP_001324 (OMIM: 603308) cathepsin L2 preproprotein [H (334 aa)
initn: 444 init1: 203 opt: 499 Z-score: 621.6 bits: 123.3 E(85289): 6.9e-28
Smith-Waterman score: 499; 35.3% identity (63.2% similar) in 269 aa overlap (60-317:67-330)
30 40 50 60 70 80
pF1KE5 TPTWPRSREREAAAFRESLNRHRYLNSLFPSENSTAF-YGINQFSYLFPEEFKAIYLRSK
:... .: ...: :. . :::. .. .
NP_001 RRLYGANEEGWRRAVWEKNMKMIELHNGEYSQGKHGFTMAMNAFGDMTNEEFRQMMGCFR
40 50 60 70 80 90
90 100 110 120 130 140
pF1KE5 PSKFPRYSAEVHMSIPNVSLPLRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAIK
.:: . ..: ..:: ::: : :: :.::..::.:::::..::.:. . :
NP_001 NQKFRK--GKVFREPLFLDLPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRK
100 110 120 130 140 150
150 160 170 180 190 200
pF1KE5 GKPLEDLSVQQVIDCSY--NNYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHY
: .:: :...::: .: ::::: :.... : . : .. ::. : . .:.:
NP_001 TGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQYV-KENGGLDSEESYPYVAVDEICKY
160 170 180 190 200 210
210 220 230 240 250 260
pF1KE5 FSGSHSGFSIKGYSAYDFSDQEDEMAKALLTFGPLVVIVDA--VSWQDYLGGI-IQHHCS
.: . :... .: . ::. : ::. : .:: :.: : .:: .. ::
NP_001 -RPENSVANDTGFTVVA-PGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCS
220 230 240 250 260 270
270 280 290 300 310
pF1KE5 SGEANHAVLITGFDKTGS----TPYWIVRNSWGSSWGVDGYAHV-KMGSNVCGIADSVSS
: . .:.::..:. :. . ::.:.:::: :: .::... : .: :::: ..:
NP_001 SKNLDHGVLVVGYGFEGANSNNSKYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASY
280 290 300 310 320 330
320
pF1KE5 IFV
NP_001 PNV
321 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Tue Nov 8 00:37:42 2016 done: Tue Nov 8 00:37:43 2016
Total Scan time: 7.050 Total Display time: 0.010
Function used was FASTA [36.3.4 Apr, 2011]