FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE5601, 484 aa
1>>>pF1KE5601 484 - 484 aa - 484 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.1617+/-0.000323; mu= 19.6769+/- 0.020
mean_var=77.4924+/-15.558, 0's: 0 Z-trim(116.4): 68 B-trim: 131 in 1/57
Lambda= 0.145695
statistics sampled from 27517 (27585) to 27517 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.689), E-opt: 0.2 (0.323), width: 16
Scan time: 7.570
The best scores are: opt bits E(85289)
NP_003784 (OMIM: 603539,615362) cathepsin F precur ( 484) 3292 701.4 1.4e-201
XP_011543630 (OMIM: 603539,615362) PREDICTED: cath ( 424) 2816 601.3 1.7e-171
NP_001326 (OMIM: 602364) cathepsin W preproprotein ( 376) 697 155.9 1.9e-37
NP_004381 (OMIM: 116820) pro-cathepsin H isoform a ( 335) 669 149.9 1e-35
NP_001903 (OMIM: 116880) cathepsin L1 isoform 1 pr ( 333) 664 148.9 2.1e-35
XP_005251773 (OMIM: 116880) PREDICTED: cathepsin L ( 333) 664 148.9 2.1e-35
NP_001244900 (OMIM: 116880) cathepsin L1 isoform 1 ( 333) 664 148.9 2.1e-35
NP_666023 (OMIM: 116880) cathepsin L1 isoform 1 pr ( 333) 664 148.9 2.1e-35
NP_001244901 (OMIM: 116880) cathepsin L1 isoform 1 ( 333) 664 148.9 2.1e-35
XP_016877441 (OMIM: 116820) PREDICTED: pro-catheps ( 297) 656 147.2 6.1e-35
XP_005254238 (OMIM: 116820) PREDICTED: pro-catheps ( 297) 656 147.2 6.1e-35
XP_016877440 (OMIM: 116820) PREDICTED: pro-catheps ( 317) 656 147.2 6.4e-35
NP_001188504 (OMIM: 603308) cathepsin L2 prepropro ( 334) 641 144.1 5.9e-34
NP_001324 (OMIM: 603308) cathepsin L2 preproprotei ( 334) 641 144.1 5.9e-34
NP_000387 (OMIM: 265800,601105) cathepsin K prepro ( 329) 587 132.7 1.5e-30
NP_001325 (OMIM: 600550) cathepsin O preproprotein ( 321) 568 128.7 2.4e-29
NP_004070 (OMIM: 116845) cathepsin S isoform 1 pre ( 331) 564 127.9 4.4e-29
XP_016869782 (OMIM: 116880) PREDICTED: cathepsin L ( 272) 489 112.0 2.1e-24
XP_011516565 (OMIM: 116880) PREDICTED: cathepsin L ( 272) 489 112.0 2.1e-24
NP_001186668 (OMIM: 116845) cathepsin S isoform 2 ( 281) 453 104.5 4.1e-22
NP_001306066 (OMIM: 116820) pro-cathepsin H isofor ( 201) 408 94.9 2.2e-19
NP_001244902 (OMIM: 116880) cathepsin L1 isoform 2 ( 151) 312 74.6 2.1e-13
XP_011519578 (OMIM: 116820) PREDICTED: pro-catheps ( 169) 266 65.0 1.9e-10
NP_001805 (OMIM: 170650,245000,245010,602365) dipe ( 463) 257 63.5 1.5e-09
NP_001304166 (OMIM: 116810) cathepsin B isoform 2 ( 215) 228 57.1 5.8e-08
XP_016868590 (OMIM: 116810) PREDICTED: cathepsin B ( 245) 228 57.1 6.4e-08
XP_006716308 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 228 57.3 8.2e-08
XP_006716307 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 228 57.3 8.2e-08
NP_680091 (OMIM: 116810) cathepsin B isoform 1 pre ( 339) 228 57.3 8.2e-08
XP_016868587 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 228 57.3 8.2e-08
XP_016868589 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 228 57.3 8.2e-08
NP_680093 (OMIM: 116810) cathepsin B isoform 1 pre ( 339) 228 57.3 8.2e-08
XP_016868586 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 228 57.3 8.2e-08
NP_680092 (OMIM: 116810) cathepsin B isoform 1 pre ( 339) 228 57.3 8.2e-08
NP_001899 (OMIM: 116810) cathepsin B isoform 1 pre ( 339) 228 57.3 8.2e-08
XP_016868588 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 228 57.3 8.2e-08
XP_011542114 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 228 57.3 8.2e-08
NP_680090 (OMIM: 116810) cathepsin B isoform 1 pre ( 339) 228 57.3 8.2e-08
NP_001327 (OMIM: 603169) cathepsin Z preproprotein ( 303) 160 42.9 0.0015
>>NP_003784 (OMIM: 603539,615362) cathepsin F precursor (484 aa)
initn: 3292 init1: 3292 opt: 3292 Z-score: 3739.8 bits: 701.4 E(85289): 1.4e-201
Smith-Waterman score: 3292; 100.0% identity (100.0% similar) in 484 aa overlap (1-484:1-484)
10 20 30 40 50 60
pF1KE5 MAPWLQLLSLLGLLPGAVAAPAQPRAASFQAWGPPSPELLAPTRFALEMFNRGRAAGTRA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_003 MAPWLQLLSLLGLLPGAVAAPAQPRAASFQAWGPPSPELLAPTRFALEMFNRGRAAGTRA
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE5 VLGLVRGRVRRAGQGSLYSLEATLEEPPCNDPMVCRLPVSKKTLLCSFQVLDELGRHVLL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_003 VLGLVRGRVRRAGQGSLYSLEATLEEPPCNDPMVCRLPVSKKTLLCSFQVLDELGRHVLL
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE5 RKDCGPVDTKVPGAGEPKSAFTQGSAMISSLSQNHPDNRNETFSSVISLLNEDPLSQDLP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_003 RKDCGPVDTKVPGAGEPKSAFTQGSAMISSLSQNHPDNRNETFSSVISLLNEDPLSQDLP
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE5 VKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_003 VKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLT
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE5 EEEFRTIYLNTLLRKEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_003 EEEFRTIYLNTLLRKEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSV
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE5 TGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSAIKNLGGLETEDDYSYQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_003 TGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSAIKNLGGLETEDDYSYQ
310 320 330 340 350 360
370 380 390 400 410 420
pF1KE5 GHMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_003 GHMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPL
370 380 390 400 410 420
430 440 450 460 470 480
pF1KE5 RPLCSPWLIDHAVLLVGYGNRSDVPFWAIKNSWGTDWGEKGYYYLHRGSGACGVNTMASS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_003 RPLCSPWLIDHAVLLVGYGNRSDVPFWAIKNSWGTDWGEKGYYYLHRGSGACGVNTMASS
430 440 450 460 470 480
pF1KE5 AVVD
::::
NP_003 AVVD
>>XP_011543630 (OMIM: 603539,615362) PREDICTED: cathepsi (424 aa)
initn: 2815 init1: 2815 opt: 2816 Z-score: 3199.9 bits: 601.3 E(85289): 1.7e-171
Smith-Waterman score: 2816; 98.1% identity (98.6% similar) in 424 aa overlap (62-484:1-424)
40 50 60 70 80 90
pF1KE5 WGPPSPELLAPTRFALEMFNRGRAAGTRAVLGLVRGRVRR-AGQGSLYSLEATLEEPPCN
.: .: : :::::::::::::::::::
XP_011 MGPARWTNRSLAGQGSLYSLEATLEEPPCN
10 20 30
100 110 120 130 140 150
pF1KE5 DPMVCRLPVSKKTLLCSFQVLDELGRHVLLRKDCGPVDTKVPGAGEPKSAFTQGSAMISS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 DPMVCRLPVSKKTLLCSFQVLDELGRHVLLRKDCGPVDTKVPGAGEPKSAFTQGSAMISS
40 50 60 70 80 90
160 170 180 190 200 210
pF1KE5 LSQNHPDNRNETFSSVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 LSQNHPDNRNETFSSVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSV
100 110 120 130 140 150
220 230 240 250 260 270
pF1KE5 FVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 FVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDL
160 170 180 190 200 210
280 290 300 310 320 330
pF1KE5 APPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 APPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMD
220 230 240 250 260 270
340 350 360 370 380 390
pF1KE5 KACMGGLPSNAYSAIKNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSVELSQNEQKL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 KACMGGLPSNAYSAIKNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSVELSQNEQKL
280 290 300 310 320 330
400 410 420 430 440 450
pF1KE5 AAWLAKRGPISVAINAFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNRSDVPFWAIK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 AAWLAKRGPISVAINAFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNRSDVPFWAIK
340 350 360 370 380 390
460 470 480
pF1KE5 NSWGTDWGEKGYYYLHRGSGACGVNTMASSAVVD
::::::::::::::::::::::::::::::::::
XP_011 NSWGTDWGEKGYYYLHRGSGACGVNTMASSAVVD
400 410 420
>>NP_001326 (OMIM: 602364) cathepsin W preproprotein [Ho (376 aa)
initn: 793 init1: 250 opt: 697 Z-score: 793.5 bits: 155.9 E(85289): 1.9e-37
Smith-Waterman score: 823; 38.6% identity (66.4% similar) in 342 aa overlap (174-483:25-363)
150 160 170 180 190
pF1KE5 GSAMISSLSQNHPDNRNETFSSVISLLNEDPL-SQDL---PVKMASIFKNFVITYNRTYE
:: .::: :... :: : : .::.:
NP_001 MALTAHPSCLLALLVAGLAQGIRGPLRAQDLGPQPLELKEAFKLFQIQFNRSYL
10 20 30 40 50
200 210 220 230 240 250
pF1KE5 SKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRTIYLNTLLRKEPGN
: :: ::..:..:...::..: : :::..::: ::::::::: .: :. :.
NP_001 SPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDLTEEEFGQLYG---YRRAAGG
60 70 80 90 100 110
260 270 280 290 300 310
pF1KE5 KMKQAKSVGDLAPPEW-----DWRS-KGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQG
.... . . : : :::. ::.. .::: :. :::....::.: : ..
NP_001 VPSMGREIRSEEPEESVPFSCDWRKVAGAISPIKDQKNCNCCWAMAAAGNIETLWRISFW
120 130 140 150 160 170
320 330 340 350 360 370
pF1KE5 TLLSLSEQELLDCDKMDKACMGGLPSNAYSAIKNLGGLETEDDYSYQGHMQS--CNFSAE
....: :::::: . .: ::. .:. .. : .:: .: :: .::.... :. .
NP_001 DFVDVSVQELLDCGRCGDGCHGGFVWDAFITVLNNSGLASEKDYPFQGKVRAHRCHPKKY
180 190 200 210 220 230
380 390 400 410 420 430
pF1KE5 KAKVYINDSVELSQNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPLRPLCSPWLIDH
. ..:.: . :..::...: .:: :::.:.:: .:.::.:. . :.: :.::
NP_001 QKVAWIQDFIMLQNNEHRIAQYLATYGPITVTINMKPLQLYRKGVIKATPTTCDPQLVDH
240 250 260 270 280 290
440 450 460 470
pF1KE5 AVLLVGYGN-RSD-------------------VPFWAIKNSWGTDWGEKGYYYLHRGSGA
.:::::.:. .:. .:.: .:::::..::::::. :::::..
NP_001 SVLLVGFGSVKSEEGIWAETVSSQSQPQPPHPTPYWILKNSWGAQWGEKGYFRLHRGSNT
300 310 320 330 340 350
480
pF1KE5 CGVNTMASSAVVD
::.. . .: :
NP_001 CGITKFPLTARVQKPDMKPRVSCPP
360 370
>>NP_004381 (OMIM: 116820) pro-cathepsin H isoform a pre (335 aa)
initn: 543 init1: 237 opt: 669 Z-score: 762.3 bits: 149.9 E(85289): 1e-35
Smith-Waterman score: 669; 36.6% identity (67.3% similar) in 303 aa overlap (187-482:35-332)
160 170 180 190 200 210
pF1KE5 DNRNETFSSVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMV
::... . .:: : :: . ::..:..:
NP_004 LPLLCAGAWLLGVPVCGAAELCVNSLEKFHFKSWMSKHRKTY-STEEYHHRLQTFASNW-
10 20 30 40 50 60
220 230 240 250 260 270
pF1KE5 RAQKIQALDRG--TAQYGVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDLAPPE
.::.: . : : ......:::.. :.. :: . .. ..: . ...: ::
NP_004 --RKINAHNNGNHTFKMALNQFSDMSFAEIKHKYLWSEPQNCSATKSNYLRGTGPY-PPS
70 80 90 100 110
280 290 300 310 320 330
pF1KE5 WDWRSKGA-VTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDC--DKMDK
:::.:: :. ::.:: :::::.::.:: .:. . : .:::.::.:.:: : ..
NP_004 VDWRKKGNFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNH
120 130 140 150 160 170
340 350 360 370 380 390
pF1KE5 ACMGGLPSNAYSAIKNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSVELS-QNEQKL
.:.:::::.:. : :. :: : :::. :.:. :: ...: .... .:. .
NP_004 GCQGGLPSQAFEYILYNKGIMGEDTYPYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAM
180 190 200 210 220 230
400 410 420 430 440
pF1KE5 AAWLAKRGPISVAINAF-GMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNRSDVPFWAI
. .: .:.: :... ...:: :: .: ..:::: ::::... .:.: .
NP_004 VEAVALYNPVSFAFEVTQDFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIV
240 250 260 270 280 290
450 460 470 480
pF1KE5 KNSWGTDWGEKGYYYLHRGSGACGVNTMASSAVVD
::::: .:: .::. ..::.. ::. . :: .
NP_004 KNSWGPQWGMNGYFLIERGKNMCGLAACASYPIPLV
300 310 320 330
>>NP_001903 (OMIM: 116880) cathepsin L1 isoform 1 prepro (333 aa)
initn: 632 init1: 264 opt: 664 Z-score: 756.7 bits: 148.9 E(85289): 2.1e-35
Smith-Waterman score: 664; 39.7% identity (65.6% similar) in 305 aa overlap (194-483:36-333)
170 180 190 200 210 220
pF1KE5 SSVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQ-KIQ
.:: : .::. :: .:. .:: . . :
NP_001 ILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEG-WRRAVWEKNMKMIELHNQ
10 20 30 40 50 60
230 240 250 260 270 280
pF1KE5 ALDRGTAQY--GVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDLAPPEWDWRSK
.: .. ... :.:.: :::: .. . :: .:. : . :: ::: :
NP_001 EYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYE-APRSVDWREK
70 80 90 100 110 120
290 300 310 320 330
pF1KE5 GAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCD--KMDKACMGGLP
: :: ::.::.::::::::.:: .::: : . : :.:::::.:.::. . ...: :::
NP_001 GYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLM
130 140 150 160 170 180
340 350 360 370 380 390
pF1KE5 SNAYSAIKNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDS--VELSQNEQKLAAWLAK
. :.. ... :::..:..: :.. .::... : .: ::. :.. ..:. : .:
NP_001 DYAFQYVQDNGGLDSEESYPYEATEESCKYNP-KYSV-ANDTGFVDIPKQEKALMKAVAT
190 200 210 220 230 240
400 410 420 430 440
pF1KE5 RGPISVAINAFGMQ---FYRHGISRPLRPLCSPWLIDHAVLLVGYGNRS----DVPFWAI
:::::::.: : . ::..:: ..: :: .::.::.:::: .: . .: .
NP_001 VGPISVAIDA-GHESFLFYKEGIY--FEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLV
250 260 270 280 290
450 460 470 480
pF1KE5 KNSWGTDWGEKGYYYLHRGS-GACGVNTMASSAVVD
::::: .:: :: . . . ::. . :: .:
NP_001 KNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV
300 310 320 330
>>XP_005251773 (OMIM: 116880) PREDICTED: cathepsin L1 is (333 aa)
initn: 632 init1: 264 opt: 664 Z-score: 756.7 bits: 148.9 E(85289): 2.1e-35
Smith-Waterman score: 664; 39.7% identity (65.6% similar) in 305 aa overlap (194-483:36-333)
170 180 190 200 210 220
pF1KE5 SSVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQ-KIQ
.:: : .::. :: .:. .:: . . :
XP_005 ILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEG-WRRAVWEKNMKMIELHNQ
10 20 30 40 50 60
230 240 250 260 270 280
pF1KE5 ALDRGTAQY--GVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDLAPPEWDWRSK
.: .. ... :.:.: :::: .. . :: .:. : . :: ::: :
XP_005 EYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYE-APRSVDWREK
70 80 90 100 110 120
290 300 310 320 330
pF1KE5 GAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCD--KMDKACMGGLP
: :: ::.::.::::::::.:: .::: : . : :.:::::.:.::. . ...: :::
XP_005 GYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLM
130 140 150 160 170 180
340 350 360 370 380 390
pF1KE5 SNAYSAIKNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDS--VELSQNEQKLAAWLAK
. :.. ... :::..:..: :.. .::... : .: ::. :.. ..:. : .:
XP_005 DYAFQYVQDNGGLDSEESYPYEATEESCKYNP-KYSV-ANDTGFVDIPKQEKALMKAVAT
190 200 210 220 230 240
400 410 420 430 440
pF1KE5 RGPISVAINAFGMQ---FYRHGISRPLRPLCSPWLIDHAVLLVGYGNRS----DVPFWAI
:::::::.: : . ::..:: ..: :: .::.::.:::: .: . .: .
XP_005 VGPISVAIDA-GHESFLFYKEGIY--FEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLV
250 260 270 280 290
450 460 470 480
pF1KE5 KNSWGTDWGEKGYYYLHRGS-GACGVNTMASSAVVD
::::: .:: :: . . . ::. . :: .:
XP_005 KNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV
300 310 320 330
>>NP_001244900 (OMIM: 116880) cathepsin L1 isoform 1 pre (333 aa)
initn: 632 init1: 264 opt: 664 Z-score: 756.7 bits: 148.9 E(85289): 2.1e-35
Smith-Waterman score: 664; 39.7% identity (65.6% similar) in 305 aa overlap (194-483:36-333)
170 180 190 200 210 220
pF1KE5 SSVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQ-KIQ
.:: : .::. :: .:. .:: . . :
NP_001 ILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEG-WRRAVWEKNMKMIELHNQ
10 20 30 40 50 60
230 240 250 260 270 280
pF1KE5 ALDRGTAQY--GVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDLAPPEWDWRSK
.: .. ... :.:.: :::: .. . :: .:. : . :: ::: :
NP_001 EYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYE-APRSVDWREK
70 80 90 100 110 120
290 300 310 320 330
pF1KE5 GAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCD--KMDKACMGGLP
: :: ::.::.::::::::.:: .::: : . : :.:::::.:.::. . ...: :::
NP_001 GYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLM
130 140 150 160 170 180
340 350 360 370 380 390
pF1KE5 SNAYSAIKNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDS--VELSQNEQKLAAWLAK
. :.. ... :::..:..: :.. .::... : .: ::. :.. ..:. : .:
NP_001 DYAFQYVQDNGGLDSEESYPYEATEESCKYNP-KYSV-ANDTGFVDIPKQEKALMKAVAT
190 200 210 220 230 240
400 410 420 430 440
pF1KE5 RGPISVAINAFGMQ---FYRHGISRPLRPLCSPWLIDHAVLLVGYGNRS----DVPFWAI
:::::::.: : . ::..:: ..: :: .::.::.:::: .: . .: .
NP_001 VGPISVAIDA-GHESFLFYKEGIY--FEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLV
250 260 270 280 290
450 460 470 480
pF1KE5 KNSWGTDWGEKGYYYLHRGS-GACGVNTMASSAVVD
::::: .:: :: . . . ::. . :: .:
NP_001 KNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV
300 310 320 330
>>NP_666023 (OMIM: 116880) cathepsin L1 isoform 1 prepro (333 aa)
initn: 632 init1: 264 opt: 664 Z-score: 756.7 bits: 148.9 E(85289): 2.1e-35
Smith-Waterman score: 664; 39.7% identity (65.6% similar) in 305 aa overlap (194-483:36-333)
170 180 190 200 210 220
pF1KE5 SSVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQ-KIQ
.:: : .::. :: .:. .:: . . :
NP_666 ILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEG-WRRAVWEKNMKMIELHNQ
10 20 30 40 50 60
230 240 250 260 270 280
pF1KE5 ALDRGTAQY--GVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDLAPPEWDWRSK
.: .. ... :.:.: :::: .. . :: .:. : . :: ::: :
NP_666 EYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYE-APRSVDWREK
70 80 90 100 110 120
290 300 310 320 330
pF1KE5 GAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCD--KMDKACMGGLP
: :: ::.::.::::::::.:: .::: : . : :.:::::.:.::. . ...: :::
NP_666 GYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLM
130 140 150 160 170 180
340 350 360 370 380 390
pF1KE5 SNAYSAIKNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDS--VELSQNEQKLAAWLAK
. :.. ... :::..:..: :.. .::... : .: ::. :.. ..:. : .:
NP_666 DYAFQYVQDNGGLDSEESYPYEATEESCKYNP-KYSV-ANDTGFVDIPKQEKALMKAVAT
190 200 210 220 230 240
400 410 420 430 440
pF1KE5 RGPISVAINAFGMQ---FYRHGISRPLRPLCSPWLIDHAVLLVGYGNRS----DVPFWAI
:::::::.: : . ::..:: ..: :: .::.::.:::: .: . .: .
NP_666 VGPISVAIDA-GHESFLFYKEGIY--FEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLV
250 260 270 280 290
450 460 470 480
pF1KE5 KNSWGTDWGEKGYYYLHRGS-GACGVNTMASSAVVD
::::: .:: :: . . . ::. . :: .:
NP_666 KNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV
300 310 320 330
>>NP_001244901 (OMIM: 116880) cathepsin L1 isoform 1 pre (333 aa)
initn: 632 init1: 264 opt: 664 Z-score: 756.7 bits: 148.9 E(85289): 2.1e-35
Smith-Waterman score: 664; 39.7% identity (65.6% similar) in 305 aa overlap (194-483:36-333)
170 180 190 200 210 220
pF1KE5 SSVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQ-KIQ
.:: : .::. :: .:. .:: . . :
NP_001 ILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEG-WRRAVWEKNMKMIELHNQ
10 20 30 40 50 60
230 240 250 260 270 280
pF1KE5 ALDRGTAQY--GVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDLAPPEWDWRSK
.: .. ... :.:.: :::: .. . :: .:. : . :: ::: :
NP_001 EYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYE-APRSVDWREK
70 80 90 100 110 120
290 300 310 320 330
pF1KE5 GAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCD--KMDKACMGGLP
: :: ::.::.::::::::.:: .::: : . : :.:::::.:.::. . ...: :::
NP_001 GYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLM
130 140 150 160 170 180
340 350 360 370 380 390
pF1KE5 SNAYSAIKNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDS--VELSQNEQKLAAWLAK
. :.. ... :::..:..: :.. .::... : .: ::. :.. ..:. : .:
NP_001 DYAFQYVQDNGGLDSEESYPYEATEESCKYNP-KYSV-ANDTGFVDIPKQEKALMKAVAT
190 200 210 220 230 240
400 410 420 430 440
pF1KE5 RGPISVAINAFGMQ---FYRHGISRPLRPLCSPWLIDHAVLLVGYGNRS----DVPFWAI
:::::::.: : . ::..:: ..: :: .::.::.:::: .: . .: .
NP_001 VGPISVAIDA-GHESFLFYKEGIY--FEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLV
250 260 270 280 290
450 460 470 480
pF1KE5 KNSWGTDWGEKGYYYLHRGS-GACGVNTMASSAVVD
::::: .:: :: . . . ::. . :: .:
NP_001 KNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV
300 310 320 330
>>XP_016877441 (OMIM: 116820) PREDICTED: pro-cathepsin H (297 aa)
initn: 543 init1: 237 opt: 656 Z-score: 748.3 bits: 147.2 E(85289): 6.1e-35
Smith-Waterman score: 656; 36.8% identity (67.2% similar) in 296 aa overlap (194-482:4-294)
170 180 190 200 210 220
pF1KE5 SSVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQA
. .:: : :: . ::..:..: .::.:
XP_016 MSKHRKTY-STEEYHHRLQTFASNW---RKINA
10 20
230 240 250 260 270 280
pF1KE5 LDRG--TAQYGVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDLAPPEWDWRSKG
. : : ......:::.. :.. :: . .. ..: . ...: :: :::.::
XP_016 HNNGNHTFKMALNQFSDMSFAEIKHKYLWSEPQNCSATKSNYLRGTGPY-PPSVDWRKKG
30 40 50 60 70 80
290 300 310 320 330
pF1KE5 A-VTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDC--DKMDKACMGGLP
:. ::.:: :::::.::.:: .:. . : .:::.::.:.:: : ...:.::::
XP_016 NFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLP
90 100 110 120 130 140
340 350 360 370 380 390
pF1KE5 SNAYSAIKNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSVELS-QNEQKLAAWLAKR
:.:. : :. :: : :::. :.:. :: ...: .... .:. .. .:
XP_016 SQAFEYILYNKGIMGEDTYPYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALY
150 160 170 180 190 200
400 410 420 430 440 450
pF1KE5 GPISVAINAF-GMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNRSDVPFWAIKNSWGTD
.:.: :... ...:: :: .: ..:::: ::::... .:.: .::::: .
XP_016 NPVSFAFEVTQDFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQ
210 220 230 240 250 260
460 470 480
pF1KE5 WGEKGYYYLHRGSGACGVNTMASSAVVD
:: .::. ..::.. ::. . :: .
XP_016 WGMNGYFLIERGKNMCGLAACASYPIPLV
270 280 290
484 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Tue Nov 8 05:01:47 2016 done: Tue Nov 8 05:01:49 2016
Total Scan time: 7.570 Total Display time: 0.020
Function used was FASTA [36.3.4 Apr, 2011]