FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE5142, 104 aa
1>>>pF1KE5142 104 - 104 aa - 104 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 4.7097+/-0.00026; mu= 12.4197+/- 0.016
mean_var=50.4980+/-10.091, 0's: 0 Z-trim(118.7): 23 B-trim: 0 in 0/57
Lambda= 0.180483
statistics sampled from 31819 (31842) to 31819 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.786), E-opt: 0.2 (0.373), width: 16
Scan time: 3.190
The best scores are: opt bits E(85289)
NP_001159896 (OMIM: 169740) gastricsin isoform 2 p ( 315) 636 172.6 3.1e-43
NP_002621 (OMIM: 169740) gastricsin isoform 1 prep ( 388) 636 172.6 3.8e-43
NP_001073275 (OMIM: 169710) pepsin A-3 preproprote ( 388) 269 77.1 2.2e-14
XP_016854970 (OMIM: 169720) PREDICTED: pepsin A-4 ( 388) 268 76.8 2.6e-14
NP_055039 (OMIM: 169730) pepsin A-5 preproprotein ( 388) 268 76.8 2.6e-14
NP_001073276 (OMIM: 169720) pepsin A-4 preproprote ( 388) 268 76.8 2.6e-14
NP_683865 (OMIM: 116890) cathepsin E isoform b pre ( 363) 231 67.1 2e-11
XP_011507547 (OMIM: 116890) PREDICTED: cathepsin E ( 368) 231 67.1 2e-11
NP_001901 (OMIM: 116890) cathepsin E isoform a pre ( 396) 231 67.2 2.1e-11
XP_011507546 (OMIM: 116890) PREDICTED: cathepsin E ( 401) 231 67.2 2.1e-11
NP_001900 (OMIM: 116840,610127) cathepsin D prepro ( 412) 229 66.7 3.2e-11
NP_000528 (OMIM: 179820,267430,613092) renin prepr ( 406) 209 61.4 1.2e-09
XP_016883001 (OMIM: 605631) PREDICTED: napsin-A is ( 411) 167 50.5 2.3e-06
NP_004842 (OMIM: 605631) napsin-A preproprotein [H ( 420) 167 50.5 2.3e-06
XP_011525842 (OMIM: 605631) PREDICTED: napsin-A is ( 420) 167 50.5 2.3e-06
NP_001304260 (OMIM: 116890) cathepsin E isoform c ( 288) 138 42.9 0.00032
>>NP_001159896 (OMIM: 169740) gastricsin isoform 2 prepr (315 aa)
initn: 636 init1: 636 opt: 636 Z-score: 897.1 bits: 172.6 E(85289): 3.1e-43
Smith-Waterman score: 636; 99.0% identity (99.0% similar) in 97 aa overlap (1-97:1-97)
10 20 30 40 50 60
pF1KE5 MKWMVVVLVCLQLLEAAVVKVPLKKFKSIRETMKEKGLLGEFLRTHKYDPAWKYRFGDLS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 MKWMVVVLVCLQLLEAAVVKVPLKKFKSIRETMKEKGLLGEFLRTHKYDPAWKYRFGDLS
10 20 30 40 50 60
70 80 90 100
pF1KE5 VTYEPMAYMDAAYFGEISIGTPPQNFLVLFDTGSSPLPLDPAPS
::::::::::::::::::::::::::::::::::: :
NP_001 VTYEPMAYMDAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQSQACTSHSRFNPSES
70 80 90 100 110 120
NP_001 STYSTNGQTFSLQYGSGSLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFVYAQFDGIM
130 140 150 160 170 180
>>NP_002621 (OMIM: 169740) gastricsin isoform 1 prepropr (388 aa)
initn: 651 init1: 636 opt: 636 Z-score: 895.7 bits: 172.6 E(85289): 3.8e-43
Smith-Waterman score: 636; 99.0% identity (99.0% similar) in 97 aa overlap (1-97:1-97)
10 20 30 40 50 60
pF1KE5 MKWMVVVLVCLQLLEAAVVKVPLKKFKSIRETMKEKGLLGEFLRTHKYDPAWKYRFGDLS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_002 MKWMVVVLVCLQLLEAAVVKVPLKKFKSIRETMKEKGLLGEFLRTHKYDPAWKYRFGDLS
10 20 30 40 50 60
70 80 90 100
pF1KE5 VTYEPMAYMDAAYFGEISIGTPPQNFLVLFDTGSSPLPLDPAPS
::::::::::::::::::::::::::::::::::: :
NP_002 VTYEPMAYMDAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQSQACTSHSRFNPSES
70 80 90 100 110 120
NP_002 STYSTNGQTFSLQYGSGSLTGFFGYDTLTVQSIQVPNQEFGLSENEPGTNFVYAQFDGIM
130 140 150 160 170 180
>>NP_001073275 (OMIM: 169710) pepsin A-3 preproprotein [ (388 aa)
initn: 275 init1: 135 opt: 269 Z-score: 379.2 bits: 77.1 E(85289): 2.2e-14
Smith-Waterman score: 269; 47.6% identity (70.9% similar) in 103 aa overlap (1-97:1-100)
10 20 30 40 50
pF1KE5 MKWMVVV-LVCLQLLEAAVVKVPLKKFKSIRETMKEKGLLGEFLRTHKYDPAWKYRFGDL
:::.... :: :. : . :::: . ::.:.:..:.::: .::. :. .:: :: : .
NP_001 MKWLLLLGLVALS--ECIMYKVPLIRKKSLRRTLSERGLLKDFLKKHNLNPARKY-FPQW
10 20 30 40 50
60 70 80 90 100
pF1KE5 S----VTYEPMA-YMDAAYFGEISIGTPPQNFLVLFDTGSSPLPLDPAPS
. : .:. :.: ::: :.:::: :.: :.:::::: :
NP_001 KAPTLVDEQPLENYLDMEYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYCSSLACTNHNR
60 70 80 90 100 110
NP_001 FNPEDSSTYQSTSETVSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYYA
120 130 140 150 160 170
>>XP_016854970 (OMIM: 169720) PREDICTED: pepsin A-4 isof (388 aa)
initn: 275 init1: 135 opt: 268 Z-score: 377.8 bits: 76.8 E(85289): 2.6e-14
Smith-Waterman score: 268; 47.6% identity (69.9% similar) in 103 aa overlap (1-97:1-100)
10 20 30 40 50
pF1KE5 MKWMVVV-LVCLQLLEAAVVKVPLKKFKSIRETMKEKGLLGEFLRTHKYDPAWKYRFGDL
:::.... :: :. : . :::: . ::.:.:..:.::: .::. :. .:: :: : .
XP_016 MKWLLLLGLVALS--ECIMYKVPLIRKKSLRRTLSERGLLKDFLKKHNLNPARKY-FPQW
10 20 30 40 50
60 70 80 90 100
pF1KE5 S----VTYEPMA-YMDAAYFGEISIGTPPQNFLVLFDTGSSPLPLDPAPS
: .:. :.: ::: :.:::: :.: :.:::::: :
XP_016 EAPTLVDEQPLENYLDMEYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYCSSLACTNHNR
60 70 80 90 100 110
XP_016 FNPEDSSTYQSTSETVSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYYA
120 130 140 150 160 170
>>NP_055039 (OMIM: 169730) pepsin A-5 preproprotein [Hom (388 aa)
initn: 275 init1: 135 opt: 268 Z-score: 377.8 bits: 76.8 E(85289): 2.6e-14
Smith-Waterman score: 268; 47.6% identity (69.9% similar) in 103 aa overlap (1-97:1-100)
10 20 30 40 50
pF1KE5 MKWMVVV-LVCLQLLEAAVVKVPLKKFKSIRETMKEKGLLGEFLRTHKYDPAWKYRFGDL
:::.... :: :. : . :::: . ::.:.:..:.::: .::. :. .:: :: : .
NP_055 MKWLLLLGLVALS--ECIMYKVPLIRKKSLRRTLSERGLLKDFLKKHNLNPARKY-FPQW
10 20 30 40 50
60 70 80 90 100
pF1KE5 S----VTYEPMA-YMDAAYFGEISIGTPPQNFLVLFDTGSSPLPLDPAPS
: .:. :.: ::: :.:::: :.: :.:::::: :
NP_055 EAPTLVDEQPLENYLDMEYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYCSSLACTNHNR
60 70 80 90 100 110
NP_055 FNPEDSSTYQSTSETVSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYYA
120 130 140 150 160 170
>>NP_001073276 (OMIM: 169720) pepsin A-4 preproprotein [ (388 aa)
initn: 275 init1: 135 opt: 268 Z-score: 377.8 bits: 76.8 E(85289): 2.6e-14
Smith-Waterman score: 268; 47.6% identity (69.9% similar) in 103 aa overlap (1-97:1-100)
10 20 30 40 50
pF1KE5 MKWMVVV-LVCLQLLEAAVVKVPLKKFKSIRETMKEKGLLGEFLRTHKYDPAWKYRFGDL
:::.... :: :. : . :::: . ::.:.:..:.::: .::. :. .:: :: : .
NP_001 MKWLLLLGLVALS--ECIMYKVPLIRKKSLRRTLSERGLLKDFLKKHNLNPARKY-FPQW
10 20 30 40 50
60 70 80 90 100
pF1KE5 S----VTYEPMA-YMDAAYFGEISIGTPPQNFLVLFDTGSSPLPLDPAPS
: .:. :.: ::: :.:::: :.: :.:::::: :
NP_001 EAPTLVDEQPLENYLDMEYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYCSSLACTNHNR
60 70 80 90 100 110
NP_001 FNPEDSSTYQSTSETVSITYGTGSMTGILGYDTVQVGGISDTNQIFGLSETEPGSFLYYA
120 130 140 150 160 170
>>NP_683865 (OMIM: 116890) cathepsin E isoform b precurs (363 aa)
initn: 234 init1: 138 opt: 231 Z-score: 326.2 bits: 67.1 E(85289): 2e-11
Smith-Waterman score: 231; 39.8% identity (70.4% similar) in 98 aa overlap (4-97:5-102)
10 20 30 40 50
pF1KE5 MKWMVVVLVCLQLLEA--AVVKVPLKKFKSIRETMKEKGLLGEFLRTHKYDPAWKYRFG
....:: :.: :: .. .:::.. :... .. .. :.:: ..:. : .
NP_683 MKTLLLLLLVLLELGEAQGSLHRVPLRRHPSLKKKLRARSQLSEFWKSHNLDMIQFTESC
10 20 30 40 50 60
60 70 80 90 100
pF1KE5 DLSVTY-EPMA-YMDAAYFGEISIGTPPQNFLVLFDTGSSPLPLDPAPS
... . ::. :.: ::: ::::.::::: :.:::::: :
NP_683 SMDQSAKEPLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCTSPACKTHSRF
70 80 90 100 110 120
NP_683 QPSQSSTYSQPGQSFSIQYGTGSLSGIIGADQVSVEGLTVVGQQFGESVTEPGQTFVDAE
130 140 150 160 170 180
>>XP_011507547 (OMIM: 116890) PREDICTED: cathepsin E iso (368 aa)
initn: 234 init1: 138 opt: 231 Z-score: 326.1 bits: 67.1 E(85289): 2e-11
Smith-Waterman score: 231; 39.8% identity (70.4% similar) in 98 aa overlap (4-97:5-102)
10 20 30 40 50
pF1KE5 MKWMVVVLVCLQLLEA--AVVKVPLKKFKSIRETMKEKGLLGEFLRTHKYDPAWKYRFG
....:: :.: :: .. .:::.. :... .. .. :.:: ..:. : .
XP_011 MKTLLLLLLVLLELGEAQGSLHRVPLRRHPSLKKKLRARSQLSEFWKSHNLDMIQFTESC
10 20 30 40 50 60
60 70 80 90 100
pF1KE5 DLSVTY-EPMA-YMDAAYFGEISIGTPPQNFLVLFDTGSSPLPLDPAPS
... . ::. :.: ::: ::::.::::: :.:::::: :
XP_011 SMDQSAKEPLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCTSPACKTHSRF
70 80 90 100 110 120
XP_011 QPSQSSTYSQPGQSFSIQYGTGSLSGIIGADQVSAFSYQVEGLTVVGQQFGESVTEPGQT
130 140 150 160 170 180
>>NP_001901 (OMIM: 116890) cathepsin E isoform a preprop (396 aa)
initn: 212 init1: 138 opt: 231 Z-score: 325.6 bits: 67.2 E(85289): 2.1e-11
Smith-Waterman score: 231; 39.8% identity (70.4% similar) in 98 aa overlap (4-97:5-102)
10 20 30 40 50
pF1KE5 MKWMVVVLVCLQLLEA--AVVKVPLKKFKSIRETMKEKGLLGEFLRTHKYDPAWKYRFG
....:: :.: :: .. .:::.. :... .. .. :.:: ..:. : .
NP_001 MKTLLLLLLVLLELGEAQGSLHRVPLRRHPSLKKKLRARSQLSEFWKSHNLDMIQFTESC
10 20 30 40 50 60
60 70 80 90 100
pF1KE5 DLSVTY-EPMA-YMDAAYFGEISIGTPPQNFLVLFDTGSSPLPLDPAPS
... . ::. :.: ::: ::::.::::: :.:::::: :
NP_001 SMDQSAKEPLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCTSPACKTHSRF
70 80 90 100 110 120
NP_001 QPSQSSTYSQPGQSFSIQYGTGSLSGIIGADQVSVEGLTVVGQQFGESVTEPGQTFVDAE
130 140 150 160 170 180
>>XP_011507546 (OMIM: 116890) PREDICTED: cathepsin E iso (401 aa)
initn: 212 init1: 138 opt: 231 Z-score: 325.5 bits: 67.2 E(85289): 2.1e-11
Smith-Waterman score: 231; 39.8% identity (70.4% similar) in 98 aa overlap (4-97:5-102)
10 20 30 40 50
pF1KE5 MKWMVVVLVCLQLLEA--AVVKVPLKKFKSIRETMKEKGLLGEFLRTHKYDPAWKYRFG
....:: :.: :: .. .:::.. :... .. .. :.:: ..:. : .
XP_011 MKTLLLLLLVLLELGEAQGSLHRVPLRRHPSLKKKLRARSQLSEFWKSHNLDMIQFTESC
10 20 30 40 50 60
60 70 80 90 100
pF1KE5 DLSVTY-EPMA-YMDAAYFGEISIGTPPQNFLVLFDTGSSPLPLDPAPS
... . ::. :.: ::: ::::.::::: :.:::::: :
XP_011 SMDQSAKEPLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCTSPACKTHSRF
70 80 90 100 110 120
XP_011 QPSQSSTYSQPGQSFSIQYGTGSLSGIIGADQVSAFSYQVEGLTVVGQQFGESVTEPGQT
130 140 150 160 170 180
104 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Mon Nov 7 21:47:31 2016 done: Mon Nov 7 21:47:32 2016
Total Scan time: 3.190 Total Display time: -0.020
Function used was FASTA [36.3.4 Apr, 2011]