FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE6168, 141 aa
1>>>pF1KE6168 141 - 141 aa - 141 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.1778+/-0.000612; mu= 11.6694+/- 0.037
mean_var=56.4089+/-11.609, 0's: 0 Z-trim(110.7): 9 B-trim: 481 in 1/50
Lambda= 0.170766
statistics sampled from 11766 (11772) to 11766 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.758), E-opt: 0.2 (0.362), width: 16
Scan time: 1.210
The best scores are: opt bits E(32554)
CCDS12241.1 ATG4D gene_id:84971|Hs108|chr19 ( 474) 974 247.6 6.6e-66
CCDS623.1 ATG4C gene_id:84938|Hs108|chr1 ( 458) 475 124.6 6.5e-29
CCDS46564.1 ATG4B gene_id:23192|Hs108|chr2 ( 393) 273 74.8 5.4e-14
CCDS46565.1 ATG4B gene_id:23192|Hs108|chr2 ( 380) 264 72.6 2.4e-13
CCDS14538.1 ATG4A gene_id:115201|Hs108|chrX ( 398) 241 66.9 1.3e-11
>>CCDS12241.1 ATG4D gene_id:84971|Hs108|chr19 (474 aa)
initn: 974 init1: 974 opt: 974 Z-score: 1296.8 bits: 247.6 E(32554): 6.6e-66
Smith-Waterman score: 974; 100.0% identity (100.0% similar) in 141 aa overlap (1-141:334-474)
10 20 30
pF1KE6 MGGKPRHSLYFIGYQDDFLLYLDPHYCQPT
::::::::::::::::::::::::::::::
CCDS12 VPVRLGGETLNPVYVPCVKELLRCELCLGIMGGKPRHSLYFIGYQDDFLLYLDPHYCQPT
310 320 330 340 350 360
40 50 60 70 80 90
pF1KE6 VDVSQADFPLESFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTRVLSSSSATE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 VDVSQADFPLESFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTRVLSSSSATE
370 380 390 400 410 420
100 110 120 130 140
pF1KE6 RYPMFTLAEGHAQDHSLDDLCSQLAQPTLRLPRTGRLLRAKRPSSEDFVFL
:::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS12 RYPMFTLAEGHAQDHSLDDLCSQLAQPTLRLPRTGRLLRAKRPSSEDFVFL
430 440 450 460 470
>>CCDS623.1 ATG4C gene_id:84938|Hs108|chr1 (458 aa)
initn: 495 init1: 400 opt: 475 Z-score: 632.6 bits: 124.6 E(32554): 6.5e-29
Smith-Waterman score: 475; 49.6% identity (72.3% similar) in 141 aa overlap (1-141:323-458)
10 20 30
pF1KE6 MGGKPRHSLYFIGYQDDFLLYLDPHYCQPT
.::::..: :: :.::: :.:.::::::
CCDS62 VPVRLGGERTNTDYLEFVKGILSLEYCVGIIGGKPKQSYYFAGFQDDSLIYMDPHYCQSF
300 310 320 330 340 350
40 50 60 70 80 90
pF1KE6 VDVSQADFPLESFHCTSPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTRVLSSSSATE
:::: :::::.::: ::.::.: :::::::.::: . ..:. :.:..:. :: :
CCDS62 VDVSIKDFPLETFHCPSPKKMSFRKMDPSCTIGFYCRNVQDFKRASEEITKMLKFSSK-E
360 370 380 390 400 410
100 110 120 130 140
pF1KE6 RYPMFTLAEGHAQDHSLDDLCSQLAQPTLRLPRTGRLLRAKRPSSEDFVFL
.::.::...::..:. : . . : . : :: :.:.::.:
CCDS62 KYPLFTFVNGHSRDY--DFTSTTTNEEDLFSEDEKKQL--KRFSTEEFVLL
420 430 440 450
>>CCDS46564.1 ATG4B gene_id:23192|Hs108|chr2 (393 aa)
initn: 224 init1: 129 opt: 273 Z-score: 364.7 bits: 74.8 E(32554): 5.4e-14
Smith-Waterman score: 273; 37.1% identity (60.8% similar) in 143 aa overlap (1-141:256-391)
10 20 30
pF1KE6 MGGKPRHSLYFIGYQDDFLLYLDPHYCQPT
.:::: . ::::: . :.::::: ::.
CCDS46 IPLRLGLTDINEAYVETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPA
230 240 250 260 270 280
40 50 60 70 80
pF1KE6 VDVSQADF-PLESFHCTSPR-KMAFAKMDPSCTVGFYAGDRKEFETLCSELTRVLSSSSA
:. ... : : ::::: : .:..:..::: .:::. . .:. :... .. ..:
CCDS46 VEPTDGCFIPDESFHCQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGA
290 300 310 320 330 340
90 100 110 120 130 140
pF1KE6 TERYPMFTLAEGHAQDHSLDDLCSQLAQPTLRLPRTGRLLRAKRPSSEDFVFL
::: :.: : : : .. . .: . :: : .::: .:
CCDS46 ---LPMFELVE--LQPSHL--ACPDVLNLSLDSSDVERLERFFDSEDEDFEILSL
350 360 370 380 390
>>CCDS46565.1 ATG4B gene_id:23192|Hs108|chr2 (380 aa)
initn: 224 init1: 129 opt: 264 Z-score: 352.9 bits: 72.6 E(32554): 2.4e-13
Smith-Waterman score: 264; 41.6% identity (68.3% similar) in 101 aa overlap (1-99:256-353)
10 20 30
pF1KE6 MGGKPRHSLYFIGYQDDFLLYLDPHYCQPT
.:::: . ::::: . :.::::: ::.
CCDS46 IPLRLGLTDINEAYVETLKHCFMMPQSLGVIGGKPNSAHYFIGYVGEELIYLDPHTTQPA
230 240 250 260 270 280
40 50 60 70 80
pF1KE6 VDVSQADF-PLESFHCTSPR-KMAFAKMDPSCTVGFYAGDRKEFETLCSELTRVLSSSSA
:. ... : : ::::: : .:..:..::: .:::. . .:. :... .. ..:
CCDS46 VEPTDGCFIPDESFHCQHPPCRMSIAELDPSIAVGFFCKTEDDFNDWCQQVKKLSLLGGA
290 300 310 320 330 340
90 100 110 120 130 140
pF1KE6 TERYPMFTLAEGHAQDHSLDDLCSQLAQPTLRLPRTGRLLRAKRPSSEDFVFL
::: :.:
CCDS46 L---PMFELVELQPSHLACPDVLNLSLGESCQVQILLM
350 360 370 380
>>CCDS14538.1 ATG4A gene_id:115201|Hs108|chrX (398 aa)
initn: 235 init1: 126 opt: 241 Z-score: 322.0 bits: 66.9 E(32554): 1.3e-11
Smith-Waterman score: 241; 35.9% identity (64.1% similar) in 103 aa overlap (1-101:257-355)
10 20 30
pF1KE6 MGGKPRHSLYFIGYQDDFLLYLDPHYCQPT
.:::: .. ::::. : :..:::: :
CCDS14 VPLRLGINQINPVYVDAFKECFKMPQSLGALGGKPNNAYYFIGFLGDELIFLDPHTTQTF
230 240 250 260 270 280
40 50 60 70 80
pF1KE6 VDVSQ-ADFPLESFHCT-SPRKMAFAKMDPSCTVGFYAGDRKEFETLCSELTRVLSSSSA
::. . . ..::: ::..: . ..::: ..::. ..:.:.. :: . . .
CCDS14 VDTEENGTVNDQTFHCLQSPQRMNILNLDPSVALGFFCKEEKDFDNWCSLVQKEI----L
290 300 310 320 330 340
90 100 110 120 130 140
pF1KE6 TERYPMFTLAEGHAQDHSLDDLCSQLAQPTLRLPRTGRLLRAKRPSSEDFVFL
: :: :.. :
CCDS14 KENLRMFELVQKHPSHWPPFVPPAKPEVTTTGAEFIDSTEQLEEFDLEEDFEILSV
350 360 370 380 390
141 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Tue Nov 8 10:01:38 2016 done: Tue Nov 8 10:01:38 2016
Total Scan time: 1.210 Total Display time: -0.020
Function used was FASTA [36.3.4 Apr, 2011]