FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5601, 484 aa 1>>>pF1KE5601 484 - 484 aa - 484 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.1617+/-0.000323; mu= 19.6769+/- 0.020 mean_var=77.4924+/-15.558, 0's: 0 Z-trim(116.4): 68 B-trim: 131 in 1/57 Lambda= 0.145695 statistics sampled from 27517 (27585) to 27517 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.689), E-opt: 0.2 (0.323), width: 16 Scan time: 7.570 The best scores are: opt bits E(85289) NP_003784 (OMIM: 603539,615362) cathepsin F precur ( 484) 3292 701.4 1.4e-201 XP_011543630 (OMIM: 603539,615362) PREDICTED: cath ( 424) 2816 601.3 1.7e-171 NP_001326 (OMIM: 602364) cathepsin W preproprotein ( 376) 697 155.9 1.9e-37 NP_004381 (OMIM: 116820) pro-cathepsin H isoform a ( 335) 669 149.9 1e-35 NP_001903 (OMIM: 116880) cathepsin L1 isoform 1 pr ( 333) 664 148.9 2.1e-35 XP_005251773 (OMIM: 116880) PREDICTED: cathepsin L ( 333) 664 148.9 2.1e-35 NP_001244900 (OMIM: 116880) cathepsin L1 isoform 1 ( 333) 664 148.9 2.1e-35 NP_666023 (OMIM: 116880) cathepsin L1 isoform 1 pr ( 333) 664 148.9 2.1e-35 NP_001244901 (OMIM: 116880) cathepsin L1 isoform 1 ( 333) 664 148.9 2.1e-35 XP_016877441 (OMIM: 116820) PREDICTED: pro-catheps ( 297) 656 147.2 6.1e-35 XP_005254238 (OMIM: 116820) PREDICTED: pro-catheps ( 297) 656 147.2 6.1e-35 XP_016877440 (OMIM: 116820) PREDICTED: pro-catheps ( 317) 656 147.2 6.4e-35 NP_001188504 (OMIM: 603308) cathepsin L2 prepropro ( 334) 641 144.1 5.9e-34 NP_001324 (OMIM: 603308) cathepsin L2 preproprotei ( 334) 641 144.1 5.9e-34 NP_000387 (OMIM: 265800,601105) cathepsin K prepro ( 329) 587 132.7 1.5e-30 NP_001325 (OMIM: 600550) cathepsin O preproprotein ( 321) 568 128.7 2.4e-29 NP_004070 (OMIM: 116845) cathepsin S isoform 1 pre ( 331) 564 127.9 4.4e-29 XP_016869782 (OMIM: 116880) PREDICTED: cathepsin L ( 272) 489 112.0 2.1e-24 XP_011516565 (OMIM: 116880) PREDICTED: cathepsin L ( 272) 489 112.0 2.1e-24 NP_001186668 (OMIM: 116845) cathepsin S isoform 2 ( 281) 453 104.5 4.1e-22 NP_001306066 (OMIM: 116820) pro-cathepsin H isofor ( 201) 408 94.9 2.2e-19 NP_001244902 (OMIM: 116880) cathepsin L1 isoform 2 ( 151) 312 74.6 2.1e-13 XP_011519578 (OMIM: 116820) PREDICTED: pro-catheps ( 169) 266 65.0 1.9e-10 NP_001805 (OMIM: 170650,245000,245010,602365) dipe ( 463) 257 63.5 1.5e-09 NP_001304166 (OMIM: 116810) cathepsin B isoform 2 ( 215) 228 57.1 5.8e-08 XP_016868590 (OMIM: 116810) PREDICTED: cathepsin B ( 245) 228 57.1 6.4e-08 XP_006716308 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 228 57.3 8.2e-08 XP_006716307 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 228 57.3 8.2e-08 NP_680091 (OMIM: 116810) cathepsin B isoform 1 pre ( 339) 228 57.3 8.2e-08 XP_016868587 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 228 57.3 8.2e-08 XP_016868589 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 228 57.3 8.2e-08 NP_680093 (OMIM: 116810) cathepsin B isoform 1 pre ( 339) 228 57.3 8.2e-08 XP_016868586 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 228 57.3 8.2e-08 NP_680092 (OMIM: 116810) cathepsin B isoform 1 pre ( 339) 228 57.3 8.2e-08 NP_001899 (OMIM: 116810) cathepsin B isoform 1 pre ( 339) 228 57.3 8.2e-08 XP_016868588 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 228 57.3 8.2e-08 XP_011542114 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 228 57.3 8.2e-08 NP_680090 (OMIM: 116810) cathepsin B isoform 1 pre ( 339) 228 57.3 8.2e-08 NP_001327 (OMIM: 603169) cathepsin Z preproprotein ( 303) 160 42.9 0.0015 >>NP_003784 (OMIM: 603539,615362) cathepsin F precursor (484 aa) initn: 3292 init1: 3292 opt: 3292 Z-score: 3739.8 bits: 701.4 E(85289): 1.4e-201 Smith-Waterman score: 3292; 100.0% identity (100.0% similar) in 484 aa overlap (1-484:1-484) 10 20 30 40 50 60 pF1KE5 MAPWLQLLSLLGLLPGAVAAPAQPRAASFQAWGPPSPELLAPTRFALEMFNRGRAAGTRA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 MAPWLQLLSLLGLLPGAVAAPAQPRAASFQAWGPPSPELLAPTRFALEMFNRGRAAGTRA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 VLGLVRGRVRRAGQGSLYSLEATLEEPPCNDPMVCRLPVSKKTLLCSFQVLDELGRHVLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 VLGLVRGRVRRAGQGSLYSLEATLEEPPCNDPMVCRLPVSKKTLLCSFQVLDELGRHVLL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 RKDCGPVDTKVPGAGEPKSAFTQGSAMISSLSQNHPDNRNETFSSVISLLNEDPLSQDLP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 RKDCGPVDTKVPGAGEPKSAFTQGSAMISSLSQNHPDNRNETFSSVISLLNEDPLSQDLP 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 VKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 VKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLT 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE5 EEEFRTIYLNTLLRKEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 EEEFRTIYLNTLLRKEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSV 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE5 TGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSAIKNLGGLETEDDYSYQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 TGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSAIKNLGGLETEDDYSYQ 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE5 GHMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 GHMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPL 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE5 RPLCSPWLIDHAVLLVGYGNRSDVPFWAIKNSWGTDWGEKGYYYLHRGSGACGVNTMASS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_003 RPLCSPWLIDHAVLLVGYGNRSDVPFWAIKNSWGTDWGEKGYYYLHRGSGACGVNTMASS 430 440 450 460 470 480 pF1KE5 AVVD :::: NP_003 AVVD >>XP_011543630 (OMIM: 603539,615362) PREDICTED: cathepsi (424 aa) initn: 2815 init1: 2815 opt: 2816 Z-score: 3199.9 bits: 601.3 E(85289): 1.7e-171 Smith-Waterman score: 2816; 98.1% identity (98.6% similar) in 424 aa overlap (62-484:1-424) 40 50 60 70 80 90 pF1KE5 WGPPSPELLAPTRFALEMFNRGRAAGTRAVLGLVRGRVRR-AGQGSLYSLEATLEEPPCN .: .: : ::::::::::::::::::: XP_011 MGPARWTNRSLAGQGSLYSLEATLEEPPCN 10 20 30 100 110 120 130 140 150 pF1KE5 DPMVCRLPVSKKTLLCSFQVLDELGRHVLLRKDCGPVDTKVPGAGEPKSAFTQGSAMISS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 DPMVCRLPVSKKTLLCSFQVLDELGRHVLLRKDCGPVDTKVPGAGEPKSAFTQGSAMISS 40 50 60 70 80 90 160 170 180 190 200 210 pF1KE5 LSQNHPDNRNETFSSVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 LSQNHPDNRNETFSSVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSV 100 110 120 130 140 150 220 230 240 250 260 270 pF1KE5 FVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 FVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDL 160 170 180 190 200 210 280 290 300 310 320 330 pF1KE5 APPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 APPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMD 220 230 240 250 260 270 340 350 360 370 380 390 pF1KE5 KACMGGLPSNAYSAIKNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSVELSQNEQKL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 KACMGGLPSNAYSAIKNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSVELSQNEQKL 280 290 300 310 320 330 400 410 420 430 440 450 pF1KE5 AAWLAKRGPISVAINAFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNRSDVPFWAIK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 AAWLAKRGPISVAINAFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNRSDVPFWAIK 340 350 360 370 380 390 460 470 480 pF1KE5 NSWGTDWGEKGYYYLHRGSGACGVNTMASSAVVD :::::::::::::::::::::::::::::::::: XP_011 NSWGTDWGEKGYYYLHRGSGACGVNTMASSAVVD 400 410 420 >>NP_001326 (OMIM: 602364) cathepsin W preproprotein [Ho (376 aa) initn: 793 init1: 250 opt: 697 Z-score: 793.5 bits: 155.9 E(85289): 1.9e-37 Smith-Waterman score: 823; 38.6% identity (66.4% similar) in 342 aa overlap (174-483:25-363) 150 160 170 180 190 pF1KE5 GSAMISSLSQNHPDNRNETFSSVISLLNEDPL-SQDL---PVKMASIFKNFVITYNRTYE :: .::: :... :: : : .::.: NP_001 MALTAHPSCLLALLVAGLAQGIRGPLRAQDLGPQPLELKEAFKLFQIQFNRSYL 10 20 30 40 50 200 210 220 230 240 250 pF1KE5 SKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRTIYLNTLLRKEPGN : :: ::..:..:...::..: : :::..::: ::::::::: .: :. :. NP_001 SPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDLTEEEFGQLYG---YRRAAGG 60 70 80 90 100 110 260 270 280 290 300 310 pF1KE5 KMKQAKSVGDLAPPEW-----DWRS-KGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQG .... . . : : :::. ::.. .::: :. :::....::.: : .. NP_001 VPSMGREIRSEEPEESVPFSCDWRKVAGAISPIKDQKNCNCCWAMAAAGNIETLWRISFW 120 130 140 150 160 170 320 330 340 350 360 370 pF1KE5 TLLSLSEQELLDCDKMDKACMGGLPSNAYSAIKNLGGLETEDDYSYQGHMQS--CNFSAE ....: :::::: . .: ::. .:. .. : .:: .: :: .::.... :. . NP_001 DFVDVSVQELLDCGRCGDGCHGGFVWDAFITVLNNSGLASEKDYPFQGKVRAHRCHPKKY 180 190 200 210 220 230 380 390 400 410 420 430 pF1KE5 KAKVYINDSVELSQNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPLRPLCSPWLIDH . ..:.: . :..::...: .:: :::.:.:: .:.::.:. . :.: :.:: NP_001 QKVAWIQDFIMLQNNEHRIAQYLATYGPITVTINMKPLQLYRKGVIKATPTTCDPQLVDH 240 250 260 270 280 290 440 450 460 470 pF1KE5 AVLLVGYGN-RSD-------------------VPFWAIKNSWGTDWGEKGYYYLHRGSGA .:::::.:. .:. .:.: .:::::..::::::. :::::.. NP_001 SVLLVGFGSVKSEEGIWAETVSSQSQPQPPHPTPYWILKNSWGAQWGEKGYFRLHRGSNT 300 310 320 330 340 350 480 pF1KE5 CGVNTMASSAVVD ::.. . .: : NP_001 CGITKFPLTARVQKPDMKPRVSCPP 360 370 >>NP_004381 (OMIM: 116820) pro-cathepsin H isoform a pre (335 aa) initn: 543 init1: 237 opt: 669 Z-score: 762.3 bits: 149.9 E(85289): 1e-35 Smith-Waterman score: 669; 36.6% identity (67.3% similar) in 303 aa overlap (187-482:35-332) 160 170 180 190 200 210 pF1KE5 DNRNETFSSVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMV ::... . .:: : :: . ::..:..: NP_004 LPLLCAGAWLLGVPVCGAAELCVNSLEKFHFKSWMSKHRKTY-STEEYHHRLQTFASNW- 10 20 30 40 50 60 220 230 240 250 260 270 pF1KE5 RAQKIQALDRG--TAQYGVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDLAPPE .::.: . : : ......:::.. :.. :: . .. ..: . ...: :: NP_004 --RKINAHNNGNHTFKMALNQFSDMSFAEIKHKYLWSEPQNCSATKSNYLRGTGPY-PPS 70 80 90 100 110 280 290 300 310 320 330 pF1KE5 WDWRSKGA-VTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDC--DKMDK :::.:: :. ::.:: :::::.::.:: .:. . : .:::.::.:.:: : .. NP_004 VDWRKKGNFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNH 120 130 140 150 160 170 340 350 360 370 380 390 pF1KE5 ACMGGLPSNAYSAIKNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSVELS-QNEQKL .:.:::::.:. : :. :: : :::. :.:. :: ...: .... .:. . NP_004 GCQGGLPSQAFEYILYNKGIMGEDTYPYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAM 180 190 200 210 220 230 400 410 420 430 440 pF1KE5 AAWLAKRGPISVAINAF-GMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNRSDVPFWAI . .: .:.: :... ...:: :: .: ..:::: ::::... .:.: . NP_004 VEAVALYNPVSFAFEVTQDFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIV 240 250 260 270 280 290 450 460 470 480 pF1KE5 KNSWGTDWGEKGYYYLHRGSGACGVNTMASSAVVD ::::: .:: .::. ..::.. ::. . :: . NP_004 KNSWGPQWGMNGYFLIERGKNMCGLAACASYPIPLV 300 310 320 330 >>NP_001903 (OMIM: 116880) cathepsin L1 isoform 1 prepro (333 aa) initn: 632 init1: 264 opt: 664 Z-score: 756.7 bits: 148.9 E(85289): 2.1e-35 Smith-Waterman score: 664; 39.7% identity (65.6% similar) in 305 aa overlap (194-483:36-333) 170 180 190 200 210 220 pF1KE5 SSVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQ-KIQ .:: : .::. :: .:. .:: . . : NP_001 ILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEG-WRRAVWEKNMKMIELHNQ 10 20 30 40 50 60 230 240 250 260 270 280 pF1KE5 ALDRGTAQY--GVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDLAPPEWDWRSK .: .. ... :.:.: :::: .. . :: .:. : . :: ::: : NP_001 EYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYE-APRSVDWREK 70 80 90 100 110 120 290 300 310 320 330 pF1KE5 GAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCD--KMDKACMGGLP : :: ::.::.::::::::.:: .::: : . : :.:::::.:.::. . ...: ::: NP_001 GYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLM 130 140 150 160 170 180 340 350 360 370 380 390 pF1KE5 SNAYSAIKNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDS--VELSQNEQKLAAWLAK . :.. ... :::..:..: :.. .::... : .: ::. :.. ..:. : .: NP_001 DYAFQYVQDNGGLDSEESYPYEATEESCKYNP-KYSV-ANDTGFVDIPKQEKALMKAVAT 190 200 210 220 230 240 400 410 420 430 440 pF1KE5 RGPISVAINAFGMQ---FYRHGISRPLRPLCSPWLIDHAVLLVGYGNRS----DVPFWAI :::::::.: : . ::..:: ..: :: .::.::.:::: .: . .: . NP_001 VGPISVAIDA-GHESFLFYKEGIY--FEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLV 250 260 270 280 290 450 460 470 480 pF1KE5 KNSWGTDWGEKGYYYLHRGS-GACGVNTMASSAVVD ::::: .:: :: . . . ::. . :: .: NP_001 KNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 300 310 320 330 >>XP_005251773 (OMIM: 116880) PREDICTED: cathepsin L1 is (333 aa) initn: 632 init1: 264 opt: 664 Z-score: 756.7 bits: 148.9 E(85289): 2.1e-35 Smith-Waterman score: 664; 39.7% identity (65.6% similar) in 305 aa overlap (194-483:36-333) 170 180 190 200 210 220 pF1KE5 SSVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQ-KIQ .:: : .::. :: .:. .:: . . : XP_005 ILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEG-WRRAVWEKNMKMIELHNQ 10 20 30 40 50 60 230 240 250 260 270 280 pF1KE5 ALDRGTAQY--GVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDLAPPEWDWRSK .: .. ... :.:.: :::: .. . :: .:. : . :: ::: : XP_005 EYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYE-APRSVDWREK 70 80 90 100 110 120 290 300 310 320 330 pF1KE5 GAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCD--KMDKACMGGLP : :: ::.::.::::::::.:: .::: : . : :.:::::.:.::. . ...: ::: XP_005 GYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLM 130 140 150 160 170 180 340 350 360 370 380 390 pF1KE5 SNAYSAIKNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDS--VELSQNEQKLAAWLAK . :.. ... :::..:..: :.. .::... : .: ::. :.. ..:. : .: XP_005 DYAFQYVQDNGGLDSEESYPYEATEESCKYNP-KYSV-ANDTGFVDIPKQEKALMKAVAT 190 200 210 220 230 240 400 410 420 430 440 pF1KE5 RGPISVAINAFGMQ---FYRHGISRPLRPLCSPWLIDHAVLLVGYGNRS----DVPFWAI :::::::.: : . ::..:: ..: :: .::.::.:::: .: . .: . XP_005 VGPISVAIDA-GHESFLFYKEGIY--FEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLV 250 260 270 280 290 450 460 470 480 pF1KE5 KNSWGTDWGEKGYYYLHRGS-GACGVNTMASSAVVD ::::: .:: :: . . . ::. . :: .: XP_005 KNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 300 310 320 330 >>NP_001244900 (OMIM: 116880) cathepsin L1 isoform 1 pre (333 aa) initn: 632 init1: 264 opt: 664 Z-score: 756.7 bits: 148.9 E(85289): 2.1e-35 Smith-Waterman score: 664; 39.7% identity (65.6% similar) in 305 aa overlap (194-483:36-333) 170 180 190 200 210 220 pF1KE5 SSVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQ-KIQ .:: : .::. :: .:. .:: . . : NP_001 ILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEG-WRRAVWEKNMKMIELHNQ 10 20 30 40 50 60 230 240 250 260 270 280 pF1KE5 ALDRGTAQY--GVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDLAPPEWDWRSK .: .. ... :.:.: :::: .. . :: .:. : . :: ::: : NP_001 EYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYE-APRSVDWREK 70 80 90 100 110 120 290 300 310 320 330 pF1KE5 GAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCD--KMDKACMGGLP : :: ::.::.::::::::.:: .::: : . : :.:::::.:.::. . ...: ::: NP_001 GYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLM 130 140 150 160 170 180 340 350 360 370 380 390 pF1KE5 SNAYSAIKNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDS--VELSQNEQKLAAWLAK . :.. ... :::..:..: :.. .::... : .: ::. :.. ..:. : .: NP_001 DYAFQYVQDNGGLDSEESYPYEATEESCKYNP-KYSV-ANDTGFVDIPKQEKALMKAVAT 190 200 210 220 230 240 400 410 420 430 440 pF1KE5 RGPISVAINAFGMQ---FYRHGISRPLRPLCSPWLIDHAVLLVGYGNRS----DVPFWAI :::::::.: : . ::..:: ..: :: .::.::.:::: .: . .: . NP_001 VGPISVAIDA-GHESFLFYKEGIY--FEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLV 250 260 270 280 290 450 460 470 480 pF1KE5 KNSWGTDWGEKGYYYLHRGS-GACGVNTMASSAVVD ::::: .:: :: . . . ::. . :: .: NP_001 KNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 300 310 320 330 >>NP_666023 (OMIM: 116880) cathepsin L1 isoform 1 prepro (333 aa) initn: 632 init1: 264 opt: 664 Z-score: 756.7 bits: 148.9 E(85289): 2.1e-35 Smith-Waterman score: 664; 39.7% identity (65.6% similar) in 305 aa overlap (194-483:36-333) 170 180 190 200 210 220 pF1KE5 SSVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQ-KIQ .:: : .::. :: .:. .:: . . : NP_666 ILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEG-WRRAVWEKNMKMIELHNQ 10 20 30 40 50 60 230 240 250 260 270 280 pF1KE5 ALDRGTAQY--GVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDLAPPEWDWRSK .: .. ... :.:.: :::: .. . :: .:. : . :: ::: : NP_666 EYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYE-APRSVDWREK 70 80 90 100 110 120 290 300 310 320 330 pF1KE5 GAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCD--KMDKACMGGLP : :: ::.::.::::::::.:: .::: : . : :.:::::.:.::. . ...: ::: NP_666 GYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLM 130 140 150 160 170 180 340 350 360 370 380 390 pF1KE5 SNAYSAIKNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDS--VELSQNEQKLAAWLAK . :.. ... :::..:..: :.. .::... : .: ::. :.. ..:. : .: NP_666 DYAFQYVQDNGGLDSEESYPYEATEESCKYNP-KYSV-ANDTGFVDIPKQEKALMKAVAT 190 200 210 220 230 240 400 410 420 430 440 pF1KE5 RGPISVAINAFGMQ---FYRHGISRPLRPLCSPWLIDHAVLLVGYGNRS----DVPFWAI :::::::.: : . ::..:: ..: :: .::.::.:::: .: . .: . NP_666 VGPISVAIDA-GHESFLFYKEGIY--FEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLV 250 260 270 280 290 450 460 470 480 pF1KE5 KNSWGTDWGEKGYYYLHRGS-GACGVNTMASSAVVD ::::: .:: :: . . . ::. . :: .: NP_666 KNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 300 310 320 330 >>NP_001244901 (OMIM: 116880) cathepsin L1 isoform 1 pre (333 aa) initn: 632 init1: 264 opt: 664 Z-score: 756.7 bits: 148.9 E(85289): 2.1e-35 Smith-Waterman score: 664; 39.7% identity (65.6% similar) in 305 aa overlap (194-483:36-333) 170 180 190 200 210 220 pF1KE5 SSVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQ-KIQ .:: : .::. :: .:. .:: . . : NP_001 ILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEG-WRRAVWEKNMKMIELHNQ 10 20 30 40 50 60 230 240 250 260 270 280 pF1KE5 ALDRGTAQY--GVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDLAPPEWDWRSK .: .. ... :.:.: :::: .. . :: .:. : . :: ::: : NP_001 EYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYE-APRSVDWREK 70 80 90 100 110 120 290 300 310 320 330 pF1KE5 GAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCD--KMDKACMGGLP : :: ::.::.::::::::.:: .::: : . : :.:::::.:.::. . ...: ::: NP_001 GYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLM 130 140 150 160 170 180 340 350 360 370 380 390 pF1KE5 SNAYSAIKNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDS--VELSQNEQKLAAWLAK . :.. ... :::..:..: :.. .::... : .: ::. :.. ..:. : .: NP_001 DYAFQYVQDNGGLDSEESYPYEATEESCKYNP-KYSV-ANDTGFVDIPKQEKALMKAVAT 190 200 210 220 230 240 400 410 420 430 440 pF1KE5 RGPISVAINAFGMQ---FYRHGISRPLRPLCSPWLIDHAVLLVGYGNRS----DVPFWAI :::::::.: : . ::..:: ..: :: .::.::.:::: .: . .: . NP_001 VGPISVAIDA-GHESFLFYKEGIY--FEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLV 250 260 270 280 290 450 460 470 480 pF1KE5 KNSWGTDWGEKGYYYLHRGS-GACGVNTMASSAVVD ::::: .:: :: . . . ::. . :: .: NP_001 KNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 300 310 320 330 >>XP_016877441 (OMIM: 116820) PREDICTED: pro-cathepsin H (297 aa) initn: 543 init1: 237 opt: 656 Z-score: 748.3 bits: 147.2 E(85289): 6.1e-35 Smith-Waterman score: 656; 36.8% identity (67.2% similar) in 296 aa overlap (194-482:4-294) 170 180 190 200 210 220 pF1KE5 SSVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQA . .:: : :: . ::..:..: .::.: XP_016 MSKHRKTY-STEEYHHRLQTFASNW---RKINA 10 20 230 240 250 260 270 280 pF1KE5 LDRG--TAQYGVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDLAPPEWDWRSKG . : : ......:::.. :.. :: . .. ..: . ...: :: :::.:: XP_016 HNNGNHTFKMALNQFSDMSFAEIKHKYLWSEPQNCSATKSNYLRGTGPY-PPSVDWRKKG 30 40 50 60 70 80 290 300 310 320 330 pF1KE5 A-VTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDC--DKMDKACMGGLP :. ::.:: :::::.::.:: .:. . : .:::.::.:.:: : ...:.:::: XP_016 NFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLP 90 100 110 120 130 140 340 350 360 370 380 390 pF1KE5 SNAYSAIKNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSVELS-QNEQKLAAWLAKR :.:. : :. :: : :::. :.:. :: ...: .... .:. .. .: XP_016 SQAFEYILYNKGIMGEDTYPYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALY 150 160 170 180 190 200 400 410 420 430 440 450 pF1KE5 GPISVAINAF-GMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNRSDVPFWAIKNSWGTD .:.: :... ...:: :: .: ..:::: ::::... .:.: .::::: . XP_016 NPVSFAFEVTQDFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQ 210 220 230 240 250 260 460 470 480 pF1KE5 WGEKGYYYLHRGSGACGVNTMASSAVVD :: .::. ..::.. ::. . :: . XP_016 WGMNGYFLIERGKNMCGLAACASYPIPLV 270 280 290 484 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 05:01:47 2016 done: Tue Nov 8 05:01:49 2016 Total Scan time: 7.570 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]