Result of FASTA (omim) for pFN21AE5601
FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448

Query: pF1KE5601, 484 aa
  1>>>pF1KE5601 484 - 484 aa - 484 aa
Library: /omim/omim.rfq.tfa
  60827320 residues in 85289 sequences

Statistics:  Expectation_n fit: rho(ln(x))= 5.1617+/-0.000323; mu= 19.6769+/- 0.020
 mean_var=77.4924+/-15.558, 0's: 0 Z-trim(116.4): 68  B-trim: 131 in 1/57
 Lambda= 0.145695
 statistics sampled from 27517 (27585) to 27517 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
 ktup: 2, E-join: 1 (0.689), E-opt: 0.2 (0.323), width:  16
 Scan time:  7.570

The best scores are:                                      opt bits E(85289)
NP_003784 (OMIM: 603539,615362) cathepsin F precur ( 484) 3292 701.4 1.4e-201
XP_011543630 (OMIM: 603539,615362) PREDICTED: cath ( 424) 2816 601.3 1.7e-171
NP_001326 (OMIM: 602364) cathepsin W preproprotein ( 376)  697 155.9 1.9e-37
NP_004381 (OMIM: 116820) pro-cathepsin H isoform a ( 335)  669 149.9   1e-35
NP_001903 (OMIM: 116880) cathepsin L1 isoform 1 pr ( 333)  664 148.9 2.1e-35
XP_005251773 (OMIM: 116880) PREDICTED: cathepsin L ( 333)  664 148.9 2.1e-35
NP_001244900 (OMIM: 116880) cathepsin L1 isoform 1 ( 333)  664 148.9 2.1e-35
NP_666023 (OMIM: 116880) cathepsin L1 isoform 1 pr ( 333)  664 148.9 2.1e-35
NP_001244901 (OMIM: 116880) cathepsin L1 isoform 1 ( 333)  664 148.9 2.1e-35
XP_016877441 (OMIM: 116820) PREDICTED: pro-catheps ( 297)  656 147.2 6.1e-35
XP_005254238 (OMIM: 116820) PREDICTED: pro-catheps ( 297)  656 147.2 6.1e-35
XP_016877440 (OMIM: 116820) PREDICTED: pro-catheps ( 317)  656 147.2 6.4e-35
NP_001188504 (OMIM: 603308) cathepsin L2 prepropro ( 334)  641 144.1 5.9e-34
NP_001324 (OMIM: 603308) cathepsin L2 preproprotei ( 334)  641 144.1 5.9e-34
NP_000387 (OMIM: 265800,601105) cathepsin K prepro ( 329)  587 132.7 1.5e-30
NP_001325 (OMIM: 600550) cathepsin O preproprotein ( 321)  568 128.7 2.4e-29
NP_004070 (OMIM: 116845) cathepsin S isoform 1 pre ( 331)  564 127.9 4.4e-29
XP_016869782 (OMIM: 116880) PREDICTED: cathepsin L ( 272)  489 112.0 2.1e-24
XP_011516565 (OMIM: 116880) PREDICTED: cathepsin L ( 272)  489 112.0 2.1e-24
NP_001186668 (OMIM: 116845) cathepsin S isoform 2  ( 281)  453 104.5 4.1e-22
NP_001306066 (OMIM: 116820) pro-cathepsin H isofor ( 201)  408 94.9 2.2e-19
NP_001244902 (OMIM: 116880) cathepsin L1 isoform 2 ( 151)  312 74.6 2.1e-13
XP_011519578 (OMIM: 116820) PREDICTED: pro-catheps ( 169)  266 65.0 1.9e-10
NP_001805 (OMIM: 170650,245000,245010,602365) dipe ( 463)  257 63.5 1.5e-09
NP_001304166 (OMIM: 116810) cathepsin B isoform 2  ( 215)  228 57.1 5.8e-08
XP_016868590 (OMIM: 116810) PREDICTED: cathepsin B ( 245)  228 57.1 6.4e-08
XP_006716308 (OMIM: 116810) PREDICTED: cathepsin B ( 339)  228 57.3 8.2e-08
XP_006716307 (OMIM: 116810) PREDICTED: cathepsin B ( 339)  228 57.3 8.2e-08
NP_680091 (OMIM: 116810) cathepsin B isoform 1 pre ( 339)  228 57.3 8.2e-08
XP_016868587 (OMIM: 116810) PREDICTED: cathepsin B ( 339)  228 57.3 8.2e-08
XP_016868589 (OMIM: 116810) PREDICTED: cathepsin B ( 339)  228 57.3 8.2e-08
NP_680093 (OMIM: 116810) cathepsin B isoform 1 pre ( 339)  228 57.3 8.2e-08
XP_016868586 (OMIM: 116810) PREDICTED: cathepsin B ( 339)  228 57.3 8.2e-08
NP_680092 (OMIM: 116810) cathepsin B isoform 1 pre ( 339)  228 57.3 8.2e-08
NP_001899 (OMIM: 116810) cathepsin B isoform 1 pre ( 339)  228 57.3 8.2e-08
XP_016868588 (OMIM: 116810) PREDICTED: cathepsin B ( 339)  228 57.3 8.2e-08
XP_011542114 (OMIM: 116810) PREDICTED: cathepsin B ( 339)  228 57.3 8.2e-08
NP_680090 (OMIM: 116810) cathepsin B isoform 1 pre ( 339)  228 57.3 8.2e-08
NP_001327 (OMIM: 603169) cathepsin Z preproprotein ( 303)  160 42.9  0.0015


>>NP_003784 (OMIM: 603539,615362) cathepsin F precursor   (484 aa)
 initn: 3292 init1: 3292 opt: 3292  Z-score: 3739.8  bits: 701.4 E(85289): 1.4e-201
Smith-Waterman score: 3292; 100.0% identity (100.0% similar) in 484 aa overlap (1-484:1-484)

               10        20        30        40        50        60
pF1KE5 MAPWLQLLSLLGLLPGAVAAPAQPRAASFQAWGPPSPELLAPTRFALEMFNRGRAAGTRA
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_003 MAPWLQLLSLLGLLPGAVAAPAQPRAASFQAWGPPSPELLAPTRFALEMFNRGRAAGTRA
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KE5 VLGLVRGRVRRAGQGSLYSLEATLEEPPCNDPMVCRLPVSKKTLLCSFQVLDELGRHVLL
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_003 VLGLVRGRVRRAGQGSLYSLEATLEEPPCNDPMVCRLPVSKKTLLCSFQVLDELGRHVLL
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KE5 RKDCGPVDTKVPGAGEPKSAFTQGSAMISSLSQNHPDNRNETFSSVISLLNEDPLSQDLP
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_003 RKDCGPVDTKVPGAGEPKSAFTQGSAMISSLSQNHPDNRNETFSSVISLLNEDPLSQDLP
              130       140       150       160       170       180

              190       200       210       220       230       240
pF1KE5 VKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLT
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_003 VKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLT
              190       200       210       220       230       240

              250       260       270       280       290       300
pF1KE5 EEEFRTIYLNTLLRKEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSV
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_003 EEEFRTIYLNTLLRKEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSV
              250       260       270       280       290       300

              310       320       330       340       350       360
pF1KE5 TGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSAIKNLGGLETEDDYSYQ
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_003 TGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSAIKNLGGLETEDDYSYQ
              310       320       330       340       350       360

              370       380       390       400       410       420
pF1KE5 GHMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPL
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_003 GHMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPL
              370       380       390       400       410       420

              430       440       450       460       470       480
pF1KE5 RPLCSPWLIDHAVLLVGYGNRSDVPFWAIKNSWGTDWGEKGYYYLHRGSGACGVNTMASS
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_003 RPLCSPWLIDHAVLLVGYGNRSDVPFWAIKNSWGTDWGEKGYYYLHRGSGACGVNTMASS
              430       440       450       460       470       480

           
pF1KE5 AVVD
       ::::
NP_003 AVVD
           

>>XP_011543630 (OMIM: 603539,615362) PREDICTED: cathepsi  (424 aa)
 initn: 2815 init1: 2815 opt: 2816  Z-score: 3199.9  bits: 601.3 E(85289): 1.7e-171
Smith-Waterman score: 2816; 98.1% identity (98.6% similar) in 424 aa overlap (62-484:1-424)

              40        50        60        70         80        90
pF1KE5 WGPPSPELLAPTRFALEMFNRGRAAGTRAVLGLVRGRVRR-AGQGSLYSLEATLEEPPCN
                                     .: .:   :  :::::::::::::::::::
XP_011                               MGPARWTNRSLAGQGSLYSLEATLEEPPCN
                                             10        20        30

              100       110       120       130       140       150
pF1KE5 DPMVCRLPVSKKTLLCSFQVLDELGRHVLLRKDCGPVDTKVPGAGEPKSAFTQGSAMISS
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 DPMVCRLPVSKKTLLCSFQVLDELGRHVLLRKDCGPVDTKVPGAGEPKSAFTQGSAMISS
               40        50        60        70        80        90

              160       170       180       190       200       210
pF1KE5 LSQNHPDNRNETFSSVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSV
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 LSQNHPDNRNETFSSVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSV
              100       110       120       130       140       150

              220       230       240       250       260       270
pF1KE5 FVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDL
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 FVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDL
              160       170       180       190       200       210

              280       290       300       310       320       330
pF1KE5 APPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMD
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 APPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMD
              220       230       240       250       260       270

              340       350       360       370       380       390
pF1KE5 KACMGGLPSNAYSAIKNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSVELSQNEQKL
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 KACMGGLPSNAYSAIKNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSVELSQNEQKL
              280       290       300       310       320       330

              400       410       420       430       440       450
pF1KE5 AAWLAKRGPISVAINAFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNRSDVPFWAIK
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 AAWLAKRGPISVAINAFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNRSDVPFWAIK
              340       350       360       370       380       390

              460       470       480    
pF1KE5 NSWGTDWGEKGYYYLHRGSGACGVNTMASSAVVD
       ::::::::::::::::::::::::::::::::::
XP_011 NSWGTDWGEKGYYYLHRGSGACGVNTMASSAVVD
              400       410       420    

>>NP_001326 (OMIM: 602364) cathepsin W preproprotein [Ho  (376 aa)
 initn: 793 init1: 250 opt: 697  Z-score: 793.5  bits: 155.9 E(85289): 1.9e-37
Smith-Waterman score: 823; 38.6% identity (66.4% similar) in 342 aa overlap (174-483:25-363)

           150       160       170           180       190         
pF1KE5 GSAMISSLSQNHPDNRNETFSSVISLLNEDPL-SQDL---PVKMASIFKNFVITYNRTYE
                                     :: .:::   :...   :: : : .::.: 
NP_001       MALTAHPSCLLALLVAGLAQGIRGPLRAQDLGPQPLELKEAFKLFQIQFNRSYL
                     10        20        30        40        50    

     200       210       220       230       240       250         
pF1KE5 SKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRTIYLNTLLRKEPGN
       : ::   ::..:..:...::..:  : :::..::: :::::::::  .:     :.  :.
NP_001 SPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDLTEEEFGQLYG---YRRAAGG
           60        70        80        90       100          110 

     260       270             280       290       300       310   
pF1KE5 KMKQAKSVGDLAPPEW-----DWRS-KGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQG
         .... . .  : :      :::.  ::.. .:::  :. :::....::.:  : ..  
NP_001 VPSMGREIRSEEPEESVPFSCDWRKVAGAISPIKDQKNCNCCWAMAAAGNIETLWRISFW
             120       130       140       150       160       170 

           320       330       340       350       360         370 
pF1KE5 TLLSLSEQELLDCDKMDKACMGGLPSNAYSAIKNLGGLETEDDYSYQGHMQS--CNFSAE
        ....: :::::: .   .: ::.  .:. .. : .:: .: :: .::....  :. .  
NP_001 DFVDVSVQELLDCGRCGDGCHGGFVWDAFITVLNNSGLASEKDYPFQGKVRAHRCHPKKY
             180       190       200       210       220       230 

             380       390       400       410       420       430 
pF1KE5 KAKVYINDSVELSQNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPLRPLCSPWLIDH
       .  ..:.: . :..::...: .::  :::.:.::   .:.::.:. .     :.: :.::
NP_001 QKVAWIQDFIMLQNNEHRIAQYLATYGPITVTINMKPLQLYRKGVIKATPTTCDPQLVDH
             240       250       260       270       280       290 

             440                           450       460       470 
pF1KE5 AVLLVGYGN-RSD-------------------VPFWAIKNSWGTDWGEKGYYYLHRGSGA
       .:::::.:. .:.                   .:.: .:::::..::::::. :::::..
NP_001 SVLLVGFGSVKSEEGIWAETVSSQSQPQPPHPTPYWILKNSWGAQWGEKGYFRLHRGSNT
             300       310       320       330       340       350 

             480                
pF1KE5 CGVNTMASSAVVD            
       ::.. .  .: :             
NP_001 CGITKFPLTARVQKPDMKPRVSCPP
             360       370      

>>NP_004381 (OMIM: 116820) pro-cathepsin H isoform a pre  (335 aa)
 initn: 543 init1: 237 opt: 669  Z-score: 762.3  bits: 149.9 E(85289): 1e-35
Smith-Waterman score: 669; 36.6% identity (67.3% similar) in 303 aa overlap (187-482:35-332)

        160       170       180       190       200       210      
pF1KE5 DNRNETFSSVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMV
                                     ::...  . .:: : :: . ::..:..:  
NP_004 LPLLCAGAWLLGVPVCGAAELCVNSLEKFHFKSWMSKHRKTY-STEEYHHRLQTFASNW-
           10        20        30        40         50        60   

        220         230       240       250       260       270    
pF1KE5 RAQKIQALDRG--TAQYGVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDLAPPE
         .::.: . :  : ......:::..  :..  :: .  ..  ..: .  ...:   :: 
NP_004 --RKINAHNNGNHTFKMALNQFSDMSFAEIKHKYLWSEPQNCSATKSNYLRGTGPY-PPS
               70        80        90       100       110          

          280        290       300       310       320         330 
pF1KE5 WDWRSKGA-VTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDC--DKMDK
        :::.::  :. ::.:: :::::.::.:: .:.   .  : .:::.::.:.::  :  ..
NP_004 VDWRKKGNFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNH
     120       130       140       150       160       170         

             340       350       360       370       380        390
pF1KE5 ACMGGLPSNAYSAIKNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSVELS-QNEQKL
       .:.:::::.:.  :    :.  :: : :::.   :.:.  ::  ...: ....  .:. .
NP_004 GCQGGLPSQAFEYILYNKGIMGEDTYPYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAM
     180       190       200       210       220       230         

              400        410       420       430       440         
pF1KE5 AAWLAKRGPISVAINAF-GMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNRSDVPFWAI
       .  .:  .:.: :...   ...:: ::        .:  ..:::: ::::... .:.: .
NP_004 VEAVALYNPVSFAFEVTQDFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIV
     240       250       260       270       280       290         

     450       460       470       480     
pF1KE5 KNSWGTDWGEKGYYYLHRGSGACGVNTMASSAVVD 
       ::::: .:: .::. ..::.. ::. . ::  .   
NP_004 KNSWGPQWGMNGYFLIERGKNMCGLAACASYPIPLV
     300       310       320       330     

>>NP_001903 (OMIM: 116880) cathepsin L1 isoform 1 prepro  (333 aa)
 initn: 632 init1: 264 opt: 664  Z-score: 756.7  bits: 148.9 E(85289): 2.1e-35
Smith-Waterman score: 664; 39.7% identity (65.6% similar) in 305 aa overlap (194-483:36-333)

           170       180       190       200       210        220  
pF1KE5 SSVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQ-KIQ
                                     .:: :  .::. :: .:. .::   . . :
NP_001 ILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEG-WRRAVWEKNMKMIELHNQ
          10        20        30        40         50        60    

            230         240       250       260       270       280
pF1KE5 ALDRGTAQY--GVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDLAPPEWDWRSK
          .:  ..  ... :.:.: :::: .. .   ::   .:. :     . ::   ::: :
NP_001 EYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYE-APRSVDWREK
           70        80        90       100       110        120   

              290       300       310       320         330        
pF1KE5 GAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCD--KMDKACMGGLP
       : :: ::.::.::::::::.:: .::: : . : :.:::::.:.::.  . ...: ::: 
NP_001 GYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLM
           130       140       150       160       170       180   

      340       350       360       370       380         390      
pF1KE5 SNAYSAIKNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDS--VELSQNEQKLAAWLAK
       . :.. ... :::..:..: :..  .::...  : .:  ::.  :.. ..:. :   .: 
NP_001 DYAFQYVQDNGGLDSEESYPYEATEESCKYNP-KYSV-ANDTGFVDIPKQEKALMKAVAT
           190       200       210         220       230       240 

        400       410          420       430       440             
pF1KE5 RGPISVAINAFGMQ---FYRHGISRPLRPLCSPWLIDHAVLLVGYGNRS----DVPFWAI
        :::::::.: : .   ::..::   ..: ::   .::.::.:::: .:    .  .: .
NP_001 VGPISVAIDA-GHESFLFYKEGIY--FEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLV
             250        260         270       280       290        

     450       460        470       480    
pF1KE5 KNSWGTDWGEKGYYYLHRGS-GACGVNTMASSAVVD
       ::::: .::  ::  . .   . ::. . ::  .: 
NP_001 KNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 
      300       310       320       330    

>>XP_005251773 (OMIM: 116880) PREDICTED: cathepsin L1 is  (333 aa)
 initn: 632 init1: 264 opt: 664  Z-score: 756.7  bits: 148.9 E(85289): 2.1e-35
Smith-Waterman score: 664; 39.7% identity (65.6% similar) in 305 aa overlap (194-483:36-333)

           170       180       190       200       210        220  
pF1KE5 SSVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQ-KIQ
                                     .:: :  .::. :: .:. .::   . . :
XP_005 ILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEG-WRRAVWEKNMKMIELHNQ
          10        20        30        40         50        60    

            230         240       250       260       270       280
pF1KE5 ALDRGTAQY--GVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDLAPPEWDWRSK
          .:  ..  ... :.:.: :::: .. .   ::   .:. :     . ::   ::: :
XP_005 EYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYE-APRSVDWREK
           70        80        90       100       110        120   

              290       300       310       320         330        
pF1KE5 GAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCD--KMDKACMGGLP
       : :: ::.::.::::::::.:: .::: : . : :.:::::.:.::.  . ...: ::: 
XP_005 GYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLM
           130       140       150       160       170       180   

      340       350       360       370       380         390      
pF1KE5 SNAYSAIKNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDS--VELSQNEQKLAAWLAK
       . :.. ... :::..:..: :..  .::...  : .:  ::.  :.. ..:. :   .: 
XP_005 DYAFQYVQDNGGLDSEESYPYEATEESCKYNP-KYSV-ANDTGFVDIPKQEKALMKAVAT
           190       200       210         220       230       240 

        400       410          420       430       440             
pF1KE5 RGPISVAINAFGMQ---FYRHGISRPLRPLCSPWLIDHAVLLVGYGNRS----DVPFWAI
        :::::::.: : .   ::..::   ..: ::   .::.::.:::: .:    .  .: .
XP_005 VGPISVAIDA-GHESFLFYKEGIY--FEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLV
             250        260         270       280       290        

     450       460        470       480    
pF1KE5 KNSWGTDWGEKGYYYLHRGS-GACGVNTMASSAVVD
       ::::: .::  ::  . .   . ::. . ::  .: 
XP_005 KNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 
      300       310       320       330    

>>NP_001244900 (OMIM: 116880) cathepsin L1 isoform 1 pre  (333 aa)
 initn: 632 init1: 264 opt: 664  Z-score: 756.7  bits: 148.9 E(85289): 2.1e-35
Smith-Waterman score: 664; 39.7% identity (65.6% similar) in 305 aa overlap (194-483:36-333)

           170       180       190       200       210        220  
pF1KE5 SSVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQ-KIQ
                                     .:: :  .::. :: .:. .::   . . :
NP_001 ILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEG-WRRAVWEKNMKMIELHNQ
          10        20        30        40         50        60    

            230         240       250       260       270       280
pF1KE5 ALDRGTAQY--GVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDLAPPEWDWRSK
          .:  ..  ... :.:.: :::: .. .   ::   .:. :     . ::   ::: :
NP_001 EYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYE-APRSVDWREK
           70        80        90       100       110        120   

              290       300       310       320         330        
pF1KE5 GAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCD--KMDKACMGGLP
       : :: ::.::.::::::::.:: .::: : . : :.:::::.:.::.  . ...: ::: 
NP_001 GYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLM
           130       140       150       160       170       180   

      340       350       360       370       380         390      
pF1KE5 SNAYSAIKNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDS--VELSQNEQKLAAWLAK
       . :.. ... :::..:..: :..  .::...  : .:  ::.  :.. ..:. :   .: 
NP_001 DYAFQYVQDNGGLDSEESYPYEATEESCKYNP-KYSV-ANDTGFVDIPKQEKALMKAVAT
           190       200       210         220       230       240 

        400       410          420       430       440             
pF1KE5 RGPISVAINAFGMQ---FYRHGISRPLRPLCSPWLIDHAVLLVGYGNRS----DVPFWAI
        :::::::.: : .   ::..::   ..: ::   .::.::.:::: .:    .  .: .
NP_001 VGPISVAIDA-GHESFLFYKEGIY--FEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLV
             250        260         270       280       290        

     450       460        470       480    
pF1KE5 KNSWGTDWGEKGYYYLHRGS-GACGVNTMASSAVVD
       ::::: .::  ::  . .   . ::. . ::  .: 
NP_001 KNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 
      300       310       320       330    

>>NP_666023 (OMIM: 116880) cathepsin L1 isoform 1 prepro  (333 aa)
 initn: 632 init1: 264 opt: 664  Z-score: 756.7  bits: 148.9 E(85289): 2.1e-35
Smith-Waterman score: 664; 39.7% identity (65.6% similar) in 305 aa overlap (194-483:36-333)

           170       180       190       200       210        220  
pF1KE5 SSVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQ-KIQ
                                     .:: :  .::. :: .:. .::   . . :
NP_666 ILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEG-WRRAVWEKNMKMIELHNQ
          10        20        30        40         50        60    

            230         240       250       260       270       280
pF1KE5 ALDRGTAQY--GVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDLAPPEWDWRSK
          .:  ..  ... :.:.: :::: .. .   ::   .:. :     . ::   ::: :
NP_666 EYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYE-APRSVDWREK
           70        80        90       100       110        120   

              290       300       310       320         330        
pF1KE5 GAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCD--KMDKACMGGLP
       : :: ::.::.::::::::.:: .::: : . : :.:::::.:.::.  . ...: ::: 
NP_666 GYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLM
           130       140       150       160       170       180   

      340       350       360       370       380         390      
pF1KE5 SNAYSAIKNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDS--VELSQNEQKLAAWLAK
       . :.. ... :::..:..: :..  .::...  : .:  ::.  :.. ..:. :   .: 
NP_666 DYAFQYVQDNGGLDSEESYPYEATEESCKYNP-KYSV-ANDTGFVDIPKQEKALMKAVAT
           190       200       210         220       230       240 

        400       410          420       430       440             
pF1KE5 RGPISVAINAFGMQ---FYRHGISRPLRPLCSPWLIDHAVLLVGYGNRS----DVPFWAI
        :::::::.: : .   ::..::   ..: ::   .::.::.:::: .:    .  .: .
NP_666 VGPISVAIDA-GHESFLFYKEGIY--FEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLV
             250        260         270       280       290        

     450       460        470       480    
pF1KE5 KNSWGTDWGEKGYYYLHRGS-GACGVNTMASSAVVD
       ::::: .::  ::  . .   . ::. . ::  .: 
NP_666 KNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 
      300       310       320       330    

>>NP_001244901 (OMIM: 116880) cathepsin L1 isoform 1 pre  (333 aa)
 initn: 632 init1: 264 opt: 664  Z-score: 756.7  bits: 148.9 E(85289): 2.1e-35
Smith-Waterman score: 664; 39.7% identity (65.6% similar) in 305 aa overlap (194-483:36-333)

           170       180       190       200       210        220  
pF1KE5 SSVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQ-KIQ
                                     .:: :  .::. :: .:. .::   . . :
NP_001 ILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEG-WRRAVWEKNMKMIELHNQ
          10        20        30        40         50        60    

            230         240       250       260       270       280
pF1KE5 ALDRGTAQY--GVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDLAPPEWDWRSK
          .:  ..  ... :.:.: :::: .. .   ::   .:. :     . ::   ::: :
NP_001 EYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYE-APRSVDWREK
           70        80        90       100       110        120   

              290       300       310       320         330        
pF1KE5 GAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCD--KMDKACMGGLP
       : :: ::.::.::::::::.:: .::: : . : :.:::::.:.::.  . ...: ::: 
NP_001 GYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLM
           130       140       150       160       170       180   

      340       350       360       370       380         390      
pF1KE5 SNAYSAIKNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDS--VELSQNEQKLAAWLAK
       . :.. ... :::..:..: :..  .::...  : .:  ::.  :.. ..:. :   .: 
NP_001 DYAFQYVQDNGGLDSEESYPYEATEESCKYNP-KYSV-ANDTGFVDIPKQEKALMKAVAT
           190       200       210         220       230       240 

        400       410          420       430       440             
pF1KE5 RGPISVAINAFGMQ---FYRHGISRPLRPLCSPWLIDHAVLLVGYGNRS----DVPFWAI
        :::::::.: : .   ::..::   ..: ::   .::.::.:::: .:    .  .: .
NP_001 VGPISVAIDA-GHESFLFYKEGIY--FEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLV
             250        260         270       280       290        

     450       460        470       480    
pF1KE5 KNSWGTDWGEKGYYYLHRGS-GACGVNTMASSAVVD
       ::::: .::  ::  . .   . ::. . ::  .: 
NP_001 KNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 
      300       310       320       330    

>>XP_016877441 (OMIM: 116820) PREDICTED: pro-cathepsin H  (297 aa)
 initn: 543 init1: 237 opt: 656  Z-score: 748.3  bits: 147.2 E(85289): 6.1e-35
Smith-Waterman score: 656; 36.8% identity (67.2% similar) in 296 aa overlap (194-482:4-294)

           170       180       190       200       210       220   
pF1KE5 SSVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQA
                                     . .:: : :: . ::..:..:    .::.:
XP_016                            MSKHRKTY-STEEYHHRLQTFASNW---RKINA
                                           10        20            

             230       240       250       260       270       280 
pF1KE5 LDRG--TAQYGVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDLAPPEWDWRSKG
        . :  : ......:::..  :..  :: .  ..  ..: .  ...:   ::  :::.::
XP_016 HNNGNHTFKMALNQFSDMSFAEIKHKYLWSEPQNCSATKSNYLRGTGPY-PPSVDWRKKG
      30        40        50        60        70         80        

              290       300       310       320         330        
pF1KE5 A-VTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDC--DKMDKACMGGLP
         :. ::.:: :::::.::.:: .:.   .  : .:::.::.:.::  :  ...:.::::
XP_016 NFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLP
       90       100       110       120       130       140        

      340       350       360       370       380        390       
pF1KE5 SNAYSAIKNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSVELS-QNEQKLAAWLAKR
       :.:.  :    :.  :: : :::.   :.:.  ::  ...: ....  .:. ..  .:  
XP_016 SQAFEYILYNKGIMGEDTYPYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALY
      150       160       170       180       190       200        

       400        410       420       430       440       450      
pF1KE5 GPISVAINAF-GMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNRSDVPFWAIKNSWGTD
       .:.: :...   ...:: ::        .:  ..:::: ::::... .:.: .::::: .
XP_016 NPVSFAFEVTQDFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQ
      210       220       230       240       250       260        

        460       470       480     
pF1KE5 WGEKGYYYLHRGSGACGVNTMASSAVVD 
       :: .::. ..::.. ::. . ::  .   
XP_016 WGMNGYFLIERGKNMCGLAACASYPIPLV
      270       280       290       




484 residues in 1 query   sequences
60827320 residues in 85289 library sequences
 Tcomplib [36.3.4 Apr, 2011] (8 proc)
 start: Tue Nov  8 05:01:47 2016 done: Tue Nov  8 05:01:49 2016
 Total Scan time:  7.570 Total Display time:  0.020

Function used was FASTA [36.3.4 Apr, 2011]
Inquiries or Suggestions ?
Send a message to flexiclone AT kazusagt.com