FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE6168, 141 aa
1>>>pF1KE6168 141 - 141 aa - 141 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.1103+/-0.000279; mu= 12.2638+/- 0.017
mean_var=58.7165+/-11.871, 0's: 0 Z-trim(117.7): 28 B-trim: 0 in 0/54
Lambda= 0.167376
statistics sampled from 29916 (29944) to 29916 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.746), E-opt: 0.2 (0.351), width: 16
Scan time: 4.470
The best scores are: opt bits E(85289)
XP_006722990 (OMIM: 611340) PREDICTED: cysteine pr ( 360) 974 242.9 3.2e-64
NP_001268433 (OMIM: 611340) cysteine protease ATG4 ( 411) 974 243.0 3.6e-64
NP_116274 (OMIM: 611340) cysteine protease ATG4D i ( 474) 974 243.0 4e-64
NP_835739 (OMIM: 611339) cysteine protease ATG4C [ ( 458) 475 122.5 7.3e-28
XP_005271345 (OMIM: 611339) PREDICTED: cysteine pr ( 458) 475 122.5 7.3e-28
NP_116241 (OMIM: 611339) cysteine protease ATG4C [ ( 458) 475 122.5 7.3e-28
XP_011540615 (OMIM: 611339) PREDICTED: cysteine pr ( 423) 387 101.2 1.7e-21
XP_005247053 (OMIM: 611338) PREDICTED: cysteine pr ( 319) 273 73.6 2.6e-13
XP_016859129 (OMIM: 611338) PREDICTED: cysteine pr ( 319) 273 73.6 2.6e-13
NP_037457 (OMIM: 611338) cysteine protease ATG4B i ( 393) 273 73.7 3.1e-13
XP_016859127 (OMIM: 611338) PREDICTED: cysteine pr ( 457) 273 73.7 3.5e-13
XP_016859128 (OMIM: 611338) PREDICTED: cysteine pr ( 341) 264 71.5 1.2e-12
XP_005247052 (OMIM: 611338) PREDICTED: cysteine pr ( 341) 264 71.5 1.2e-12
NP_847896 (OMIM: 611338) cysteine protease ATG4B i ( 380) 264 71.5 1.4e-12
XP_005247050 (OMIM: 611338) PREDICTED: cysteine pr ( 405) 264 71.5 1.4e-12
XP_005247049 (OMIM: 611338) PREDICTED: cysteine pr ( 415) 264 71.5 1.5e-12
XP_016859126 (OMIM: 611338) PREDICTED: cysteine pr ( 479) 264 71.6 1.7e-12
NP_001308219 (OMIM: 300663) cysteine protease ATG4 ( 226) 241 65.8 4.1e-11
XP_011529144 (OMIM: 300663) PREDICTED: cysteine pr ( 321) 241 65.9 5.5e-11
NP_001308217 (OMIM: 300663) cysteine protease ATG4 ( 321) 241 65.9 5.5e-11
NP_001308216 (OMIM: 300663) cysteine protease ATG4 ( 321) 241 65.9 5.5e-11
NP_840055 (OMIM: 300663) cysteine protease ATG4A i ( 321) 241 65.9 5.5e-11
NP_443168 (OMIM: 300663) cysteine protease ATG4A i ( 398) 241 66.0 6.6e-11
XP_016858089 (OMIM: 611339) PREDICTED: cysteine pr ( 418) 225 62.1 1e-09
XP_016858090 (OMIM: 611339) PREDICTED: cysteine pr ( 418) 225 62.1 1e-09
NP_001308218 (OMIM: 300663) cysteine protease ATG4 ( 259) 192 54.0 1.7e-07
NP_840054 (OMIM: 300663) cysteine protease ATG4A i ( 336) 192 54.1 2.1e-07
XP_011529143 (OMIM: 300663) PREDICTED: cysteine pr ( 340) 192 54.1 2.1e-07
>>XP_006722990 (OMIM: 611340) PREDICTED: cysteine protea (360 aa)
initn: 974 init1: 974 opt: 974 Z-score: 1274.0 bits: 242.9 E(85289): 3.2e-64
Smith-Waterman score: 974; 100.0% identity (100.0% similar) in 141 aa overlap (1-141:220-360)
10 20 30
pF1KE6 MGGKPRHSLYFIGYQDDFLLYLDPHYCQPT
::::::::::::::::::::::::::::::
XP_006 VPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPT
190 200 210 220 230 240
40 50 60 70 80 90
pF1KE6 VDVSQADFPLESFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTRVLSSSSATE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_006 VDVSQADFPLESFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTRVLSSSSATE
250 260 270 280 290 300
100 110 120 130 140
pF1KE6 RYPMFTLAEGHAQDHSLDDLCSQLAQPTLRLPRTGRLLRAKRPSSEDFVFL
:::::::::::::::::::::::::::::::::::::::::::::::::::
XP_006 RYPMFTLAEGHAQDHSLDDLCSQLAQPTLRLPRTGRLLRAKRPSSEDFVFL
310 320 330 340 350 360
>>NP_001268433 (OMIM: 611340) cysteine protease ATG4D is (411 aa)
initn: 974 init1: 974 opt: 974 Z-score: 1273.1 bits: 243.0 E(85289): 3.6e-64
Smith-Waterman score: 974; 100.0% identity (100.0% similar) in 141 aa overlap (1-141:271-411)
10 20 30
pF1KE6 MGGKPRHSLYFIGYQDDFLLYLDPHYCQPT
::::::::::::::::::::::::::::::
NP_001 VPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPT
250 260 270 280 290 300
40 50 60 70 80 90
pF1KE6 VDVSQADFPLESFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTRVLSSSSATE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 VDVSQADFPLESFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTRVLSSSSATE
310 320 330 340 350 360
100 110 120 130 140
pF1KE6 RYPMFTLAEGHAQDHSLDDLCSQLAQPTLRLPRTGRLLRAKRPSSEDFVFL
:::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 RYPMFTLAEGHAQDHSLDDLCSQLAQPTLRLPRTGRLLRAKRPSSEDFVFL
370 380 390 400 410
>>NP_116274 (OMIM: 611340) cysteine protease ATG4D isofo (474 aa)
initn: 974 init1: 974 opt: 974 Z-score: 1272.2 bits: 243.0 E(85289): 4e-64
Smith-Waterman score: 974; 100.0% identity (100.0% similar) in 141 aa overlap (1-141:334-474)
10 20 30
pF1KE6 MGGKPRHSLYFIGYQDDFLLYLDPHYCQPT
::::::::::::::::::::::::::::::
NP_116 VPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPT
310 320 330 340 350 360
40 50 60 70 80 90
pF1KE6 VDVSQADFPLESFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTRVLSSSSATE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_116 VDVSQADFPLESFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTRVLSSSSATE
370 380 390 400 410 420
100 110 120 130 140
pF1KE6 RYPMFTLAEGHAQDHSLDDLCSQLAQPTLRLPRTGRLLRAKRPSSEDFVFL
:::::::::::::::::::::::::::::::::::::::::::::::::::
NP_116 RYPMFTLAEGHAQDHSLDDLCSQLAQPTLRLPRTGRLLRAKRPSSEDFVFL
430 440 450 460 470
>>NP_835739 (OMIM: 611339) cysteine protease ATG4C [Homo (458 aa)
initn: 495 init1: 400 opt: 475 Z-score: 621.2 bits: 122.5 E(85289): 7.3e-28
Smith-Waterman score: 475; 49.6% identity (72.3% similar) in 141 aa overlap (1-141:323-458)
10 20 30
pF1KE6 MGGKPRHSLYFIGYQDDFLLYLDPHYCQPT
.::::..: :: :.::: :.:.::::::
NP_835 VPVRLGGERTNTDYLEFVKGILSLEYCVGIIGGKPKQSYYFAGFQDDSLIYMDPHYCQSF
300 310 320 330 340 350
40 50 60 70 80 90
pF1KE6 VDVSQADFPLESFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTRVLSSSSATE
:::: :::::.::: ::.::.: :::::::.::: . ..:. :.:..:. :: :
NP_835 VDVSIKDFPLETFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFKRASEEITKMLKFSSK-E
360 370 380 390 400 410
100 110 120 130 140
pF1KE6 RYPMFTLAEGHAQDHSLDDLCSQLAQPTLRLPRTGRLLRAKRPSSEDFVFL
.::.::...::..:. : . . : . : :: :.:.::.:
NP_835 KYPLFTFVNGHSRDY--DFTSTTTNEEDLFSEDEKKQL--KRFSTEEFVLL
420 430 440 450
>>XP_005271345 (OMIM: 611339) PREDICTED: cysteine protea (458 aa)
initn: 495 init1: 400 opt: 475 Z-score: 621.2 bits: 122.5 E(85289): 7.3e-28
Smith-Waterman score: 475; 49.6% identity (72.3% similar) in 141 aa overlap (1-141:323-458)
10 20 30
pF1KE6 MGGKPRHSLYFIGYQDDFLLYLDPHYCQPT
.::::..: :: :.::: :.:.::::::
XP_005 VPVRLGGERTNTDYLEFVKGILSLEYCVGIIGGKPKQSYYFAGFQDDSLIYMDPHYCQSF
300 310 320 330 340 350
40 50 60 70 80 90
pF1KE6 VDVSQADFPLESFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTRVLSSSSATE
:::: :::::.::: ::.::.: :::::::.::: . ..:. :.:..:. :: :
XP_005 VDVSIKDFPLETFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFKRASEEITKMLKFSSK-E
360 370 380 390 400 410
100 110 120 130 140
pF1KE6 RYPMFTLAEGHAQDHSLDDLCSQLAQPTLRLPRTGRLLRAKRPSSEDFVFL
.::.::...::..:. : . . : . : :: :.:.::.:
XP_005 KYPLFTFVNGHSRDY--DFTSTTTNEEDLFSEDEKKQL--KRFSTEEFVLL
420 430 440 450
>>NP_116241 (OMIM: 611339) cysteine protease ATG4C [Homo (458 aa)
initn: 495 init1: 400 opt: 475 Z-score: 621.2 bits: 122.5 E(85289): 7.3e-28
Smith-Waterman score: 475; 49.6% identity (72.3% similar) in 141 aa overlap (1-141:323-458)
10 20 30
pF1KE6 MGGKPRHSLYFIGYQDDFLLYLDPHYCQPT
.::::..: :: :.::: :.:.::::::
NP_116 VPVRLGGERTNTDYLEFVKGILSLEYCVGIIGGKPKQSYYFAGFQDDSLIYMDPHYCQSF
300 310 320 330 340 350
40 50 60 70 80 90
pF1KE6 VDVSQADFPLESFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTRVLSSSSATE
:::: :::::.::: ::.::.: :::::::.::: . ..:. :.:..:. :: :
NP_116 VDVSIKDFPLETFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFKRASEEITKMLKFSSK-E
360 370 380 390 400 410
100 110 120 130 140
pF1KE6 RYPMFTLAEGHAQDHSLDDLCSQLAQPTLRLPRTGRLLRAKRPSSEDFVFL
.::.::...::..:. : . . : . : :: :.:.::.:
NP_116 KYPLFTFVNGHSRDY--DFTSTTTNEEDLFSEDEKKQL--KRFSTEEFVLL
420 430 440 450
>>XP_011540615 (OMIM: 611339) PREDICTED: cysteine protea (423 aa)
initn: 368 init1: 368 opt: 387 Z-score: 506.9 bits: 101.2 E(85289): 1.7e-21
Smith-Waterman score: 387; 60.5% identity (80.2% similar) in 81 aa overlap (1-81:323-403)
10 20 30
pF1KE6 MGGKPRHSLYFIGYQDDFLLYLDPHYCQPT
.::::..: :: :.::: :.:.::::::
XP_011 VPVRLGGERTNTDYLEFVKGILSLEYCVGIIGGKPKQSYYFAGFQDDSLIYMDPHYCQSF
300 310 320 330 340 350
40 50 60 70 80 90
pF1KE6 VDVSQADFPLESFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTRVLSSSSATE
:::: :::::.::: ::.::.: :::::::.::: . ..:. :.:.
XP_011 VDVSIKDFPLETFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFKRASEEITKNQIMIKAAI
360 370 380 390 400 410
100 110 120 130 140
pF1KE6 RYPMFTLAEGHAQDHSLDDLCSQLAQPTLRLPRTGRLLRAKRPSSEDFVFL
XP_011 INIHVQDFMWT
420
>>XP_005247053 (OMIM: 611338) PREDICTED: cysteine protea (319 aa)
initn: 239 init1: 129 opt: 273 Z-score: 360.0 bits: 73.6 E(85289): 2.6e-13
Smith-Waterman score: 273; 37.1% identity (60.8% similar) in 143 aa overlap (1-141:182-317)
10 20 30
pF1KE6 MGGKPRHSLYFIGYQDDFLLYLDPHYCQPT
.:::: . ::::: . :.::::: ::.
XP_005 IPLRLGLTDINEAYVETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPA
160 170 180 190 200 210
40 50 60 70 80
pF1KE6 VDVSQADF-PLESFHCTSPR-KMAFAKMDPSCTVGFYAGDRKEFETLCSELTRVLSSSSA
:. ... : : ::::: : .:..:..::: .:::. . .:. :... .. ..:
XP_005 VEPTDGCFIPDESFHCQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGA
220 230 240 250 260 270
90 100 110 120 130 140
pF1KE6 TERYPMFTLAEGHAQDHSLDDLCSQLAQPTLRLPRTGRLLRAKRPSSEDFVFL
::: :.: : : : .. . .: . :: : .::: .:
XP_005 ---LPMFELVE--LQPSHL--ACPDVLNLSLDSSDVERLERFFDSEDEDFEILSL
280 290 300 310
>>XP_016859129 (OMIM: 611338) PREDICTED: cysteine protea (319 aa)
initn: 239 init1: 129 opt: 273 Z-score: 360.0 bits: 73.6 E(85289): 2.6e-13
Smith-Waterman score: 273; 37.1% identity (60.8% similar) in 143 aa overlap (1-141:182-317)
10 20 30
pF1KE6 MGGKPRHSLYFIGYQDDFLLYLDPHYCQPT
.:::: . ::::: . :.::::: ::.
XP_016 IPLRLGLTDINEAYVETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPA
160 170 180 190 200 210
40 50 60 70 80
pF1KE6 VDVSQADF-PLESFHCTSPR-KMAFAKMDPSCTVGFYAGDRKEFETLCSELTRVLSSSSA
:. ... : : ::::: : .:..:..::: .:::. . .:. :... .. ..:
XP_016 VEPTDGCFIPDESFHCQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGA
220 230 240 250 260 270
90 100 110 120 130 140
pF1KE6 TERYPMFTLAEGHAQDHSLDDLCSQLAQPTLRLPRTGRLLRAKRPSSEDFVFL
::: :.: : : : .. . .: . :: : .::: .:
XP_016 ---LPMFELVE--LQPSHL--ACPDVLNLSLDSSDVERLERFFDSEDEDFEILSL
280 290 300 310
>>NP_037457 (OMIM: 611338) cysteine protease ATG4B isofo (393 aa)
initn: 224 init1: 129 opt: 273 Z-score: 358.6 bits: 73.7 E(85289): 3.1e-13
Smith-Waterman score: 273; 37.1% identity (60.8% similar) in 143 aa overlap (1-141:256-391)
10 20 30
pF1KE6 MGGKPRHSLYFIGYQDDFLLYLDPHYCQPT
.:::: . ::::: . :.::::: ::.
NP_037 IPLRLGLTDINEAYVETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPA
230 240 250 260 270 280
40 50 60 70 80
pF1KE6 VDVSQADF-PLESFHCTSPR-KMAFAKMDPSCTVGFYAGDRKEFETLCSELTRVLSSSSA
:. ... : : ::::: : .:..:..::: .:::. . .:. :... .. ..:
NP_037 VEPTDGCFIPDESFHCQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGA
290 300 310 320 330 340
90 100 110 120 130 140
pF1KE6 TERYPMFTLAEGHAQDHSLDDLCSQLAQPTLRLPRTGRLLRAKRPSSEDFVFL
::: :.: : : : .. . .: . :: : .::: .:
NP_037 ---LPMFELVE--LQPSHL--ACPDVLNLSLDSSDVERLERFFDSEDEDFEILSL
350 360 370 380 390
141 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Tue Nov 8 10:01:38 2016 done: Tue Nov 8 10:01:39 2016
Total Scan time: 4.470 Total Display time: -0.020
Function used was FASTA [36.3.4 Apr, 2011]