FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5142, 104 aa 1>>>pF1KE5142 104 - 104 aa - 104 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.7097+/-0.00026; mu= 12.4197+/- 0.016 mean_var=50.4980+/-10.091, 0's: 0 Z-trim(118.7): 23 B-trim: 0 in 0/57 Lambda= 0.180483 statistics sampled from 31819 (31842) to 31819 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.786), E-opt: 0.2 (0.373), width: 16 Scan time: 3.190 The best scores are: opt bits E(85289) NP_001159896 (OMIM: 169740) gastricsin isoform 2 p ( 315) 636 172.6 3.1e-43 NP_002621 (OMIM: 169740) gastricsin isoform 1 prep ( 388) 636 172.6 3.8e-43 NP_001073275 (OMIM: 169710) pepsin A-3 preproprote ( 388) 269 77.1 2.2e-14 XP_016854970 (OMIM: 169720) PREDICTED: pepsin A-4 ( 388) 268 76.8 2.6e-14 NP_055039 (OMIM: 169730) pepsin A-5 preproprotein ( 388) 268 76.8 2.6e-14 NP_001073276 (OMIM: 169720) pepsin A-4 preproprote ( 388) 268 76.8 2.6e-14 NP_683865 (OMIM: 116890) cathepsin E isoform b pre ( 363) 231 67.1 2e-11 XP_011507547 (OMIM: 116890) PREDICTED: cathepsin E ( 368) 231 67.1 2e-11 NP_001901 (OMIM: 116890) cathepsin E isoform a pre ( 396) 231 67.2 2.1e-11 XP_011507546 (OMIM: 116890) PREDICTED: cathepsin E ( 401) 231 67.2 2.1e-11 NP_001900 (OMIM: 116840,610127) cathepsin D prepro ( 412) 229 66.7 3.2e-11 NP_000528 (OMIM: 179820,267430,613092) renin prepr ( 406) 209 61.4 1.2e-09 XP_016883001 (OMIM: 605631) PREDICTED: napsin-A is ( 411) 167 50.5 2.3e-06 NP_004842 (OMIM: 605631) napsin-A preproprotein [H ( 420) 167 50.5 2.3e-06 XP_011525842 (OMIM: 605631) PREDICTED: napsin-A is ( 420) 167 50.5 2.3e-06 NP_001304260 (OMIM: 116890) cathepsin E isoform c ( 288) 138 42.9 0.00032 >>NP_001159896 (OMIM: 169740) gastricsin isoform 2 prepr (315 aa) initn: 636 init1: 636 opt: 636 Z-score: 897.1 bits: 172.6 E(85289): 3.1e-43 Smith-Waterman score: 636; 99.0% identity (99.0% similar) in 97 aa overlap (1-97:1-97) 10 20 30 40 50 60 pF1KE5 MKWMVVVLVCLQLLEAAVVKVPLKKFKSIRETMKEKGLLGEFLRTHKYDPAWKYRFGDLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MKWMVVVLVCLQLLEAAVVKVPLKKFKSIRETMKEKGLLGEFLRTHKYDPAWKYRFGDLS 10 20 30 40 50 60 70 80 90 100 pF1KE5 VTYEPMAYMDAAYFGEISIGTPPQNFLVLFDTGSSPLPLDPAPS ::::::::::::::::::::::::::::::::::: : NP_001 VTYEPMAYMDAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQSQACTSHSRFNPSES 70 80 90 100 110 120 NP_001 STYSTNGQTFSLQYGSGSLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFVYAQFDGIM 130 140 150 160 170 180 >>NP_002621 (OMIM: 169740) gastricsin isoform 1 prepropr (388 aa) initn: 651 init1: 636 opt: 636 Z-score: 895.7 bits: 172.6 E(85289): 3.8e-43 Smith-Waterman score: 636; 99.0% identity (99.0% similar) in 97 aa overlap (1-97:1-97) 10 20 30 40 50 60 pF1KE5 MKWMVVVLVCLQLLEAAVVKVPLKKFKSIRETMKEKGLLGEFLRTHKYDPAWKYRFGDLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_002 MKWMVVVLVCLQLLEAAVVKVPLKKFKSIRETMKEKGLLGEFLRTHKYDPAWKYRFGDLS 10 20 30 40 50 60 70 80 90 100 pF1KE5 VTYEPMAYMDAAYFGEISIGTPPQNFLVLFDTGSSPLPLDPAPS ::::::::::::::::::::::::::::::::::: : NP_002 VTYEPMAYMDAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQSQACTSHSRFNPSES 70 80 90 100 110 120 NP_002 STYSTNGQTFSLQYGSGSLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFVYAQFDGIM 130 140 150 160 170 180 >>NP_001073275 (OMIM: 169710) pepsin A-3 preproprotein [ (388 aa) initn: 275 init1: 135 opt: 269 Z-score: 379.2 bits: 77.1 E(85289): 2.2e-14 Smith-Waterman score: 269; 47.6% identity (70.9% similar) in 103 aa overlap (1-97:1-100) 10 20 30 40 50 pF1KE5 MKWMVVV-LVCLQLLEAAVVKVPLKKFKSIRETMKEKGLLGEFLRTHKYDPAWKYRFGDL :::.... :: :. : . :::: . ::.:.:..:.::: .::. :. .:: :: : . NP_001 MKWLLLLGLVALS--ECIMYKVPLIRKKSLRRTLSERGLLKDFLKKHNLNPARKY-FPQW 10 20 30 40 50 60 70 80 90 100 pF1KE5 S----VTYEPMA-YMDAAYFGEISIGTPPQNFLVLFDTGSSPLPLDPAPS . : .:. :.: ::: :.:::: :.: :.:::::: : NP_001 KAPTLVDEQPLENYLDMEYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYCSSLACTNHNR 60 70 80 90 100 110 NP_001 FNPEDSSTYQSTSETVSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYYA 120 130 140 150 160 170 >>XP_016854970 (OMIM: 169720) PREDICTED: pepsin A-4 isof (388 aa) initn: 275 init1: 135 opt: 268 Z-score: 377.8 bits: 76.8 E(85289): 2.6e-14 Smith-Waterman score: 268; 47.6% identity (69.9% similar) in 103 aa overlap (1-97:1-100) 10 20 30 40 50 pF1KE5 MKWMVVV-LVCLQLLEAAVVKVPLKKFKSIRETMKEKGLLGEFLRTHKYDPAWKYRFGDL :::.... :: :. : . :::: . ::.:.:..:.::: .::. :. .:: :: : . XP_016 MKWLLLLGLVALS--ECIMYKVPLIRKKSLRRTLSERGLLKDFLKKHNLNPARKY-FPQW 10 20 30 40 50 60 70 80 90 100 pF1KE5 S----VTYEPMA-YMDAAYFGEISIGTPPQNFLVLFDTGSSPLPLDPAPS : .:. :.: ::: :.:::: :.: :.:::::: : XP_016 EAPTLVDEQPLENYLDMEYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYCSSLACTNHNR 60 70 80 90 100 110 XP_016 FNPEDSSTYQSTSETVSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYYA 120 130 140 150 160 170 >>NP_055039 (OMIM: 169730) pepsin A-5 preproprotein [Hom (388 aa) initn: 275 init1: 135 opt: 268 Z-score: 377.8 bits: 76.8 E(85289): 2.6e-14 Smith-Waterman score: 268; 47.6% identity (69.9% similar) in 103 aa overlap (1-97:1-100) 10 20 30 40 50 pF1KE5 MKWMVVV-LVCLQLLEAAVVKVPLKKFKSIRETMKEKGLLGEFLRTHKYDPAWKYRFGDL :::.... :: :. : . :::: . ::.:.:..:.::: .::. :. .:: :: : . NP_055 MKWLLLLGLVALS--ECIMYKVPLIRKKSLRRTLSERGLLKDFLKKHNLNPARKY-FPQW 10 20 30 40 50 60 70 80 90 100 pF1KE5 S----VTYEPMA-YMDAAYFGEISIGTPPQNFLVLFDTGSSPLPLDPAPS : .:. :.: ::: :.:::: :.: :.:::::: : NP_055 EAPTLVDEQPLENYLDMEYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYCSSLACTNHNR 60 70 80 90 100 110 NP_055 FNPEDSSTYQSTSETVSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYYA 120 130 140 150 160 170 >>NP_001073276 (OMIM: 169720) pepsin A-4 preproprotein [ (388 aa) initn: 275 init1: 135 opt: 268 Z-score: 377.8 bits: 76.8 E(85289): 2.6e-14 Smith-Waterman score: 268; 47.6% identity (69.9% similar) in 103 aa overlap (1-97:1-100) 10 20 30 40 50 pF1KE5 MKWMVVV-LVCLQLLEAAVVKVPLKKFKSIRETMKEKGLLGEFLRTHKYDPAWKYRFGDL :::.... :: :. : . :::: . ::.:.:..:.::: .::. :. .:: :: : . NP_001 MKWLLLLGLVALS--ECIMYKVPLIRKKSLRRTLSERGLLKDFLKKHNLNPARKY-FPQW 10 20 30 40 50 60 70 80 90 100 pF1KE5 S----VTYEPMA-YMDAAYFGEISIGTPPQNFLVLFDTGSSPLPLDPAPS : .:. :.: ::: :.:::: :.: :.:::::: : NP_001 EAPTLVDEQPLENYLDMEYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYCSSLACTNHNR 60 70 80 90 100 110 NP_001 FNPEDSSTYQSTSETVSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYYA 120 130 140 150 160 170 >>NP_683865 (OMIM: 116890) cathepsin E isoform b precurs (363 aa) initn: 234 init1: 138 opt: 231 Z-score: 326.2 bits: 67.1 E(85289): 2e-11 Smith-Waterman score: 231; 39.8% identity (70.4% similar) in 98 aa overlap (4-97:5-102) 10 20 30 40 50 pF1KE5 MKWMVVVLVCLQLLEA--AVVKVPLKKFKSIRETMKEKGLLGEFLRTHKYDPAWKYRFG ....:: :.: :: .. .:::.. :... .. .. :.:: ..:. : . NP_683 MKTLLLLLLVLLELGEAQGSLHRVPLRRHPSLKKKLRARSQLSEFWKSHNLDMIQFTESC 10 20 30 40 50 60 60 70 80 90 100 pF1KE5 DLSVTY-EPMA-YMDAAYFGEISIGTPPQNFLVLFDTGSSPLPLDPAPS ... . ::. :.: ::: ::::.::::: :.:::::: : NP_683 SMDQSAKEPLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCTSPACKTHSRF 70 80 90 100 110 120 NP_683 QPSQSSTYSQPGQSFSIQYGTGSLSGIIGADQVSVEGLTVVGQQFGESVTEPGQTFVDAE 130 140 150 160 170 180 >>XP_011507547 (OMIM: 116890) PREDICTED: cathepsin E iso (368 aa) initn: 234 init1: 138 opt: 231 Z-score: 326.1 bits: 67.1 E(85289): 2e-11 Smith-Waterman score: 231; 39.8% identity (70.4% similar) in 98 aa overlap (4-97:5-102) 10 20 30 40 50 pF1KE5 MKWMVVVLVCLQLLEA--AVVKVPLKKFKSIRETMKEKGLLGEFLRTHKYDPAWKYRFG ....:: :.: :: .. .:::.. :... .. .. :.:: ..:. : . XP_011 MKTLLLLLLVLLELGEAQGSLHRVPLRRHPSLKKKLRARSQLSEFWKSHNLDMIQFTESC 10 20 30 40 50 60 60 70 80 90 100 pF1KE5 DLSVTY-EPMA-YMDAAYFGEISIGTPPQNFLVLFDTGSSPLPLDPAPS ... . ::. :.: ::: ::::.::::: :.:::::: : XP_011 SMDQSAKEPLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCTSPACKTHSRF 70 80 90 100 110 120 XP_011 QPSQSSTYSQPGQSFSIQYGTGSLSGIIGADQVSAFSYQVEGLTVVGQQFGESVTEPGQT 130 140 150 160 170 180 >>NP_001901 (OMIM: 116890) cathepsin E isoform a preprop (396 aa) initn: 212 init1: 138 opt: 231 Z-score: 325.6 bits: 67.2 E(85289): 2.1e-11 Smith-Waterman score: 231; 39.8% identity (70.4% similar) in 98 aa overlap (4-97:5-102) 10 20 30 40 50 pF1KE5 MKWMVVVLVCLQLLEA--AVVKVPLKKFKSIRETMKEKGLLGEFLRTHKYDPAWKYRFG ....:: :.: :: .. .:::.. :... .. .. :.:: ..:. : . NP_001 MKTLLLLLLVLLELGEAQGSLHRVPLRRHPSLKKKLRARSQLSEFWKSHNLDMIQFTESC 10 20 30 40 50 60 60 70 80 90 100 pF1KE5 DLSVTY-EPMA-YMDAAYFGEISIGTPPQNFLVLFDTGSSPLPLDPAPS ... . ::. :.: ::: ::::.::::: :.:::::: : NP_001 SMDQSAKEPLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCTSPACKTHSRF 70 80 90 100 110 120 NP_001 QPSQSSTYSQPGQSFSIQYGTGSLSGIIGADQVSVEGLTVVGQQFGESVTEPGQTFVDAE 130 140 150 160 170 180 >>XP_011507546 (OMIM: 116890) PREDICTED: cathepsin E iso (401 aa) initn: 212 init1: 138 opt: 231 Z-score: 325.5 bits: 67.2 E(85289): 2.1e-11 Smith-Waterman score: 231; 39.8% identity (70.4% similar) in 98 aa overlap (4-97:5-102) 10 20 30 40 50 pF1KE5 MKWMVVVLVCLQLLEA--AVVKVPLKKFKSIRETMKEKGLLGEFLRTHKYDPAWKYRFG ....:: :.: :: .. .:::.. :... .. .. :.:: ..:. : . XP_011 MKTLLLLLLVLLELGEAQGSLHRVPLRRHPSLKKKLRARSQLSEFWKSHNLDMIQFTESC 10 20 30 40 50 60 60 70 80 90 100 pF1KE5 DLSVTY-EPMA-YMDAAYFGEISIGTPPQNFLVLFDTGSSPLPLDPAPS ... . ::. :.: ::: ::::.::::: :.:::::: : XP_011 SMDQSAKEPLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCTSPACKTHSRF 70 80 90 100 110 120 XP_011 QPSQSSTYSQPGQSFSIQYGTGSLSGIIGADQVSAFSYQVEGLTVVGQQFGESVTEPGQT 130 140 150 160 170 180 104 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 21:47:31 2016 done: Mon Nov 7 21:47:32 2016 Total Scan time: 3.190 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]