FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1813, 329 aa 1>>>pF1KE1813 329 - 329 aa - 329 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.6184+/-0.000426; mu= 13.9177+/- 0.026 mean_var=66.2223+/-13.454, 0's: 0 Z-trim(110.7): 57 B-trim: 443 in 1/52 Lambda= 0.157606 statistics sampled from 19012 (19069) to 19012 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.595), E-opt: 0.2 (0.224), width: 16 Scan time: 5.530 The best scores are: opt bits E(85289) NP_000387 (OMIM: 265800,601105) cathepsin K prepro ( 329) 2258 522.6 4.5e-148 NP_004070 (OMIM: 116845) cathepsin S isoform 1 pre ( 331) 1226 287.9 2e-77 NP_001244900 (OMIM: 116880) cathepsin L1 isoform 1 ( 333) 1131 266.3 6.3e-71 NP_001903 (OMIM: 116880) cathepsin L1 isoform 1 pr ( 333) 1131 266.3 6.3e-71 NP_666023 (OMIM: 116880) cathepsin L1 isoform 1 pr ( 333) 1131 266.3 6.3e-71 XP_005251773 (OMIM: 116880) PREDICTED: cathepsin L ( 333) 1131 266.3 6.3e-71 NP_001244901 (OMIM: 116880) cathepsin L1 isoform 1 ( 333) 1131 266.3 6.3e-71 NP_001324 (OMIM: 603308) cathepsin L2 preproprotei ( 334) 1130 266.1 7.4e-71 NP_001188504 (OMIM: 603308) cathepsin L2 prepropro ( 334) 1130 266.1 7.4e-71 XP_016869782 (OMIM: 116880) PREDICTED: cathepsin L ( 272) 849 202.2 1.1e-51 XP_011516565 (OMIM: 116880) PREDICTED: cathepsin L ( 272) 849 202.2 1.1e-51 NP_001186668 (OMIM: 116845) cathepsin S isoform 2 ( 281) 804 191.9 1.3e-48 NP_004381 (OMIM: 116820) pro-cathepsin H isoform a ( 335) 803 191.7 1.8e-48 XP_005254238 (OMIM: 116820) PREDICTED: pro-catheps ( 297) 778 186.0 8.3e-47 XP_016877441 (OMIM: 116820) PREDICTED: pro-catheps ( 297) 778 186.0 8.3e-47 XP_016877440 (OMIM: 116820) PREDICTED: pro-catheps ( 317) 778 186.0 8.8e-47 XP_011543630 (OMIM: 603539,615362) PREDICTED: cath ( 424) 587 142.7 1.3e-33 NP_003784 (OMIM: 603539,615362) cathepsin F precur ( 484) 587 142.7 1.5e-33 NP_001244902 (OMIM: 116880) cathepsin L1 isoform 2 ( 151) 569 138.4 9.2e-33 NP_001306066 (OMIM: 116820) pro-cathepsin H isofor ( 201) 481 118.4 1.3e-26 NP_001325 (OMIM: 600550) cathepsin O preproprotein ( 321) 466 115.1 2e-25 NP_001326 (OMIM: 602364) cathepsin W preproprotein ( 376) 380 95.6 1.8e-19 XP_011519578 (OMIM: 116820) PREDICTED: pro-catheps ( 169) 327 83.4 3.7e-16 NP_001805 (OMIM: 170650,245000,245010,602365) dipe ( 463) 286 74.2 5.8e-13 NP_001304166 (OMIM: 116810) cathepsin B isoform 2 ( 215) 226 60.5 3.8e-09 XP_016868590 (OMIM: 116810) PREDICTED: cathepsin B ( 245) 226 60.5 4.3e-09 NP_680093 (OMIM: 116810) cathepsin B isoform 1 pre ( 339) 226 60.5 5.7e-09 XP_016868588 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 226 60.5 5.7e-09 XP_006716308 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 226 60.5 5.7e-09 XP_011542114 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 226 60.5 5.7e-09 XP_016868589 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 226 60.5 5.7e-09 XP_006716307 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 226 60.5 5.7e-09 NP_680091 (OMIM: 116810) cathepsin B isoform 1 pre ( 339) 226 60.5 5.7e-09 NP_001899 (OMIM: 116810) cathepsin B isoform 1 pre ( 339) 226 60.5 5.7e-09 NP_680090 (OMIM: 116810) cathepsin B isoform 1 pre ( 339) 226 60.5 5.7e-09 XP_016868587 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 226 60.5 5.7e-09 XP_016868586 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 226 60.5 5.7e-09 NP_680092 (OMIM: 116810) cathepsin B isoform 1 pre ( 339) 226 60.5 5.7e-09 XP_005271164 (OMIM: 616064) PREDICTED: tubulointer ( 362) 203 55.3 2.3e-07 NP_001191344 (OMIM: 616064) tubulointerstitial nep ( 362) 203 55.3 2.3e-07 NP_001327 (OMIM: 603169) cathepsin Z preproprotein ( 303) 202 55.1 2.3e-07 XP_005271163 (OMIM: 616064) PREDICTED: tubulointer ( 440) 203 55.4 2.7e-07 NP_071447 (OMIM: 616064) tubulointerstitial nephri ( 467) 203 55.4 2.8e-07 XP_011540248 (OMIM: 616064) PREDICTED: tubulointer ( 408) 200 54.7 4e-07 NP_001191343 (OMIM: 616064) tubulointerstitial nep ( 436) 200 54.7 4.3e-07 XP_016866234 (OMIM: 606749) PREDICTED: tubulointer ( 438) 185 51.3 4.6e-06 >>NP_000387 (OMIM: 265800,601105) cathepsin K preproprot (329 aa) initn: 2258 init1: 2258 opt: 2258 Z-score: 2779.3 bits: 522.6 E(85289): 4.5e-148 Smith-Waterman score: 2258; 100.0% identity (100.0% similar) in 329 aa overlap (1-329:1-329) 10 20 30 40 50 60 pF1KE1 MWGLKVLLLPVVSFALYPEEILDTHWELWKKTHRKQYNNKVDEISRRLIWEKNLKYISIH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 MWGLKVLLLPVVSFALYPEEILDTHWELWKKTHRKQYNNKVDEISRRLIWEKNLKYISIH 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 NLEASLGVHTYELAMNHLGDMTSEEVVQKMTGLKVPLSHSRSNDTLYIPEWEGRAPDSVD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 NLEASLGVHTYELAMNHLGDMTSEEVVQKMTGLKVPLSHSRSNDTLYIPEWEGRAPDSVD 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 YRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 YRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGG 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 YMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKALKRAVA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 YMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKALKRAVA 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE1 RVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 RVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGE 250 260 270 280 290 300 310 320 pF1KE1 NWGNKGYILMARNKNNACGIANLASFPKM ::::::::::::::::::::::::::::: NP_000 NWGNKGYILMARNKNNACGIANLASFPKM 310 320 >>NP_004070 (OMIM: 116845) cathepsin S isoform 1 preprop (331 aa) initn: 1079 init1: 449 opt: 1226 Z-score: 1511.1 bits: 287.9 E(85289): 2e-77 Smith-Waterman score: 1226; 55.5% identity (79.9% similar) in 328 aa overlap (6-329:7-331) 10 20 30 40 50 pF1KE1 MWGLKVLLLPVVSFA-LYPEEILDTHWELWKKTHRKQYNNKVDEISRRLIWEKNLKYIS :::. . : :. . :: ::.:::::. :::..: .: ::::::::::.. NP_004 MKRLVCVLLVCSSAVAQLHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVM 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE1 IHNLEASLGVHTYELAMNHLGDMTSEEVVQKMTGLKVPLSHSRSNDTLYIPEWEGRAPDS .:::: :.:.:.:.:.::::::::::::.. :..:.:: . .: . : . . ::: NP_004 LHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQWQR--NITYKSNPNRILPDS 70 80 90 100 110 120 130 140 150 160 170 pF1KE1 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE---ND ::.:.:: :: :: ::.::.:::::.:::::.::: :::::..:: :::::: .: : NP_004 VDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNK 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE1 GCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL ::.::.::.::::. :.::::. .::: .....:.:. .:: : : :.: : : .: NP_004 GCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQYDSKYRAATCSKYTELPYGREDVL 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE1 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIK :.::: :::::..:: :: .: .::::. :: ..:.::.::.:::: .:...:..: NP_004 KEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSC-TQNVNHGVLVVGYGDLNGKEYWLVK 240 250 260 270 280 290 300 310 320 pF1KE1 NSWGENWGNKGYILMARNKNNACGIANLASFPKM ::::.:.:..::: :::::.: ::::.. :.:.. NP_004 NSWGHNFGEEGYIRMARNKGNHCGIASFPSYPEI 300 310 320 330 >>NP_001244900 (OMIM: 116880) cathepsin L1 isoform 1 pre (333 aa) initn: 887 init1: 321 opt: 1131 Z-score: 1394.3 bits: 266.3 E(85289): 6.3e-71 Smith-Waterman score: 1131; 50.9% identity (76.7% similar) in 330 aa overlap (4-327:7-331) 10 20 30 40 50 pF1KE1 MWGLKVLLLPVVSFALYPEEILDTHWELWKKTHRKQYNNKVDEISRRLIWEKNLKYI : .. : ..: .: .. :...: :: : . :. . .: :: .::::.:.: NP_001 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMKMI 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 SIHNLEASLGVHTYELAMNHLGDMTSEEVVQKMTGLKVPLSHSRSNDTLYIPEWEGRAPD .:: : : :.. .::: .::::::: : :.:.. . :.. .. : . .:: NP_001 ELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQN--RKPRKGKVFQEPLFY-EAPR 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE1 SVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--ND :::.:.::::::::::::::::::::..::::::. .:::.:..:: :::::: . :. NP_001 SVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNE 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE1 GCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL ::.:: : :::::: : :.:::..::: . :::: ::: ..:. :. .::. .:::: NP_001 GCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK-QEKAL 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE1 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQK----GNKH .::: :::.::::::. :: ::..:.:.. .:.:....:.::.::::... .::. NP_001 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKY 240 250 260 270 280 290 300 310 320 pF1KE1 WIIKNSWGENWGNKGYILMARNKNNACGIANLASFPKM :..::::::.:: ::. ::... : ::::. ::.: NP_001 WLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 300 310 320 330 >>NP_001903 (OMIM: 116880) cathepsin L1 isoform 1 prepro (333 aa) initn: 887 init1: 321 opt: 1131 Z-score: 1394.3 bits: 266.3 E(85289): 6.3e-71 Smith-Waterman score: 1131; 50.9% identity (76.7% similar) in 330 aa overlap (4-327:7-331) 10 20 30 40 50 pF1KE1 MWGLKVLLLPVVSFALYPEEILDTHWELWKKTHRKQYNNKVDEISRRLIWEKNLKYI : .. : ..: .: .. :...: :: : . :. . .: :: .::::.:.: NP_001 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMKMI 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 SIHNLEASLGVHTYELAMNHLGDMTSEEVVQKMTGLKVPLSHSRSNDTLYIPEWEGRAPD .:: : : :.. .::: .::::::: : :.:.. . :.. .. : . .:: NP_001 ELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQN--RKPRKGKVFQEPLFY-EAPR 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE1 SVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--ND :::.:.::::::::::::::::::::..::::::. .:::.:..:: :::::: . :. NP_001 SVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNE 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE1 GCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL ::.:: : :::::: : :.:::..::: . :::: ::: ..:. :. .::. .:::: NP_001 GCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK-QEKAL 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE1 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQK----GNKH .::: :::.::::::. :: ::..:.:.. .:.:....:.::.::::... .::. NP_001 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKY 240 250 260 270 280 290 300 310 320 pF1KE1 WIIKNSWGENWGNKGYILMARNKNNACGIANLASFPKM :..::::::.:: ::. ::... : ::::. ::.: NP_001 WLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 300 310 320 330 >>NP_666023 (OMIM: 116880) cathepsin L1 isoform 1 prepro (333 aa) initn: 887 init1: 321 opt: 1131 Z-score: 1394.3 bits: 266.3 E(85289): 6.3e-71 Smith-Waterman score: 1131; 50.9% identity (76.7% similar) in 330 aa overlap (4-327:7-331) 10 20 30 40 50 pF1KE1 MWGLKVLLLPVVSFALYPEEILDTHWELWKKTHRKQYNNKVDEISRRLIWEKNLKYI : .. : ..: .: .. :...: :: : . :. . .: :: .::::.:.: NP_666 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMKMI 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 SIHNLEASLGVHTYELAMNHLGDMTSEEVVQKMTGLKVPLSHSRSNDTLYIPEWEGRAPD .:: : : :.. .::: .::::::: : :.:.. . :.. .. : . .:: NP_666 ELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQN--RKPRKGKVFQEPLFY-EAPR 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE1 SVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--ND :::.:.::::::::::::::::::::..::::::. .:::.:..:: :::::: . :. NP_666 SVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNE 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE1 GCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL ::.:: : :::::: : :.:::..::: . :::: ::: ..:. :. .::. .:::: NP_666 GCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK-QEKAL 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE1 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQK----GNKH .::: :::.::::::. :: ::..:.:.. .:.:....:.::.::::... .::. NP_666 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKY 240 250 260 270 280 290 300 310 320 pF1KE1 WIIKNSWGENWGNKGYILMARNKNNACGIANLASFPKM :..::::::.:: ::. ::... : ::::. ::.: NP_666 WLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 300 310 320 330 >>XP_005251773 (OMIM: 116880) PREDICTED: cathepsin L1 is (333 aa) initn: 887 init1: 321 opt: 1131 Z-score: 1394.3 bits: 266.3 E(85289): 6.3e-71 Smith-Waterman score: 1131; 50.9% identity (76.7% similar) in 330 aa overlap (4-327:7-331) 10 20 30 40 50 pF1KE1 MWGLKVLLLPVVSFALYPEEILDTHWELWKKTHRKQYNNKVDEISRRLIWEKNLKYI : .. : ..: .: .. :...: :: : . :. . .: :: .::::.:.: XP_005 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMKMI 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 SIHNLEASLGVHTYELAMNHLGDMTSEEVVQKMTGLKVPLSHSRSNDTLYIPEWEGRAPD .:: : : :.. .::: .::::::: : :.:.. . :.. .. : . .:: XP_005 ELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQN--RKPRKGKVFQEPLFY-EAPR 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE1 SVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--ND :::.:.::::::::::::::::::::..::::::. .:::.:..:: :::::: . :. XP_005 SVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNE 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE1 GCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL ::.:: : :::::: : :.:::..::: . :::: ::: ..:. :. .::. .:::: XP_005 GCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK-QEKAL 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE1 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQK----GNKH .::: :::.::::::. :: ::..:.:.. .:.:....:.::.::::... .::. XP_005 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKY 240 250 260 270 280 290 300 310 320 pF1KE1 WIIKNSWGENWGNKGYILMARNKNNACGIANLASFPKM :..::::::.:: ::. ::... : ::::. ::.: XP_005 WLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 300 310 320 330 >>NP_001244901 (OMIM: 116880) cathepsin L1 isoform 1 pre (333 aa) initn: 887 init1: 321 opt: 1131 Z-score: 1394.3 bits: 266.3 E(85289): 6.3e-71 Smith-Waterman score: 1131; 50.9% identity (76.7% similar) in 330 aa overlap (4-327:7-331) 10 20 30 40 50 pF1KE1 MWGLKVLLLPVVSFALYPEEILDTHWELWKKTHRKQYNNKVDEISRRLIWEKNLKYI : .. : ..: .: .. :...: :: : . :. . .: :: .::::.:.: NP_001 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMKMI 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 SIHNLEASLGVHTYELAMNHLGDMTSEEVVQKMTGLKVPLSHSRSNDTLYIPEWEGRAPD .:: : : :.. .::: .::::::: : :.:.. . :.. .. : . .:: NP_001 ELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQN--RKPRKGKVFQEPLFY-EAPR 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE1 SVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--ND :::.:.::::::::::::::::::::..::::::. .:::.:..:: :::::: . :. NP_001 SVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNE 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE1 GCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL ::.:: : :::::: : :.:::..::: . :::: ::: ..:. :. .::. .:::: NP_001 GCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK-QEKAL 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE1 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQK----GNKH .::: :::.::::::. :: ::..:.:.. .:.:....:.::.::::... .::. NP_001 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKY 240 250 260 270 280 290 300 310 320 pF1KE1 WIIKNSWGENWGNKGYILMARNKNNACGIANLASFPKM :..::::::.:: ::. ::... : ::::. ::.: NP_001 WLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 300 310 320 330 >>NP_001324 (OMIM: 603308) cathepsin L2 preproprotein [H (334 aa) initn: 1121 init1: 447 opt: 1130 Z-score: 1393.1 bits: 266.1 E(85289): 7.4e-71 Smith-Waterman score: 1130; 50.6% identity (76.5% similar) in 332 aa overlap (4-329:7-334) 10 20 30 40 50 pF1KE1 MWGLKVLLLPVVSFALYPEEILDTHWELWKKTHRKQYNNKVDEISRRLIWEKNLKYI : .. : ..: . .. :::.: :: :::. :. . .: :: .::::.:.: NP_001 MNLSLVLAAFCLGIASAVPKFDQNLDTKWYQWKATHRRLYGAN-EEGWRRAVWEKNMKMI 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 SIHNLEASLGVHTYELAMNHLGDMTSEEVVQKMTGLKVPLSHSRSNDTLYIPEWEGRAPD .:: : : : : . .::: .::::.:: : : .. .. :.. .. : . : NP_001 ELHNGEYSQGKHGFTMAMNAFGDMTNEEFRQMMGCFRN--QKFRKGKVFREPLFLD-LPK 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE1 SVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--ND :::.:::::::::::: :::::::::..::::::. .:::::..:: :::::: :. NP_001 SVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQ 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE1 GCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL ::.::.:. :::::..: :.:::..::::. .: : : : ...:. :. . :.:::: NP_001 GCNGGFMARAFQYVKENGGLDSEESYPYVAVDEICKYRPENSVANDTGFTVVAPGKEKAL 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE1 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGN----KH .::: :::.:::.::. .:::::..:.:.. .:.: ::.:.::.::::.. .: :. NP_001 MKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANSNNSKY 240 250 260 270 280 290 300 310 320 pF1KE1 WIIKNSWGENWGNKGYILMARNKNNACGIANLASFPKM :..::::: .::..::. .:..::: ::::. ::.:.. NP_001 WLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASYPNV 300 310 320 330 >>NP_001188504 (OMIM: 603308) cathepsin L2 preproprotein (334 aa) initn: 1121 init1: 447 opt: 1130 Z-score: 1393.1 bits: 266.1 E(85289): 7.4e-71 Smith-Waterman score: 1130; 50.6% identity (76.5% similar) in 332 aa overlap (4-329:7-334) 10 20 30 40 50 pF1KE1 MWGLKVLLLPVVSFALYPEEILDTHWELWKKTHRKQYNNKVDEISRRLIWEKNLKYI : .. : ..: . .. :::.: :: :::. :. . .: :: .::::.:.: NP_001 MNLSLVLAAFCLGIASAVPKFDQNLDTKWYQWKATHRRLYGAN-EEGWRRAVWEKNMKMI 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 SIHNLEASLGVHTYELAMNHLGDMTSEEVVQKMTGLKVPLSHSRSNDTLYIPEWEGRAPD .:: : : : : . .::: .::::.:: : : .. .. :.. .. : . : NP_001 ELHNGEYSQGKHGFTMAMNAFGDMTNEEFRQMMGCFRN--QKFRKGKVFREPLFLD-LPK 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE1 SVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--ND :::.:::::::::::: :::::::::..::::::. .:::::..:: :::::: :. NP_001 SVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQ 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE1 GCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL ::.::.:. :::::..: :.:::..::::. .: : : : ...:. :. . :.:::: NP_001 GCNGGFMARAFQYVKENGGLDSEESYPYVAVDEICKYRPENSVANDTGFTVVAPGKEKAL 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE1 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGN----KH .::: :::.:::.::. .:::::..:.:.. .:.: ::.:.::.::::.. .: :. NP_001 MKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANSNNSKY 240 250 260 270 280 290 300 310 320 pF1KE1 WIIKNSWGENWGNKGYILMARNKNNACGIANLASFPKM :..::::: .::..::. .:..::: ::::. ::.:.. NP_001 WLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASYPNV 300 310 320 330 >>XP_016869782 (OMIM: 116880) PREDICTED: cathepsin L1 is (272 aa) initn: 811 init1: 321 opt: 849 Z-score: 1049.2 bits: 202.2 E(85289): 1.1e-51 Smith-Waterman score: 849; 51.7% identity (74.5% similar) in 263 aa overlap (4-263:7-264) 10 20 30 40 50 pF1KE1 MWGLKVLLLPVVSFALYPEEILDTHWELWKKTHRKQYNNKVDEISRRLIWEKNLKYI : .. : ..: .: .. :...: :: : . :. . .: :: .::::.:.: XP_016 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMKMI 10 20 30 40 50 60 70 80 90 100 110 pF1KE1 SIHNLEASLGVHTYELAMNHLGDMTSEEVVQKMTGLKVPLSHSRSNDTLYIPEWEGRAPD .:: : : :.. .::: .::::::: : :.:.. . :.. .. : . .:: XP_016 ELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQN--RKPRKGKVFQEPLFY-EAPR 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE1 SVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--ND :::.:.::::::::::::::::::::..::::::. .:::.:..:: :::::: . :. XP_016 SVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNE 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE1 GCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL ::.:: : :::::: : :.:::..::: . :::: ::: ..:. :. .::. .:::: XP_016 GCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK-QEKAL 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE1 KRAVARVGPVSVAIDASLTSFQFYSK-GVYYDESCNSDNLNHAVLAVGYGIQKGNKHWII .::: :::.::::::. :: ::.. :: XP_016 MKAVATVGPISVAIDAGHESFLFYKEAGVKNGAWVAT 240 250 260 270 329 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 18:55:16 2016 done: Sun Nov 6 18:55:17 2016 Total Scan time: 5.530 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]