FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4383, 404 aa 1>>>pF1KE4383 404 - 404 aa - 404 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.5945+/-0.00105; mu= 14.9209+/- 0.063 mean_var=59.0168+/-11.624, 0's: 0 Z-trim(102.0): 28 B-trim: 22 in 2/47 Lambda= 0.166950 statistics sampled from 6767 (6774) to 6767 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.571), E-opt: 0.2 (0.208), width: 16 Scan time: 2.770 The best scores are: opt bits E(32554) CCDS75313.1 ETF1 gene_id:2107|Hs108|chr5 ( 404) 2623 640.5 8.2e-184 CCDS75314.1 ETF1 gene_id:2107|Hs108|chr5 ( 423) 2623 640.5 8.6e-184 CCDS4207.1 ETF1 gene_id:2107|Hs108|chr5 ( 437) 2623 640.5 8.8e-184 >>CCDS75313.1 ETF1 gene_id:2107|Hs108|chr5 (404 aa) initn: 2623 init1: 2623 opt: 2623 Z-score: 3413.5 bits: 640.5 E(32554): 8.2e-184 Smith-Waterman score: 2623; 100.0% identity (100.0% similar) in 404 aa overlap (1-404:1-404) 10 20 30 40 50 60 pF1KE4 MISLIIPPKDQISRVAKMLADEFGTASNIKSRVNRLSVLGAITSVQQRLKLYNKVPPNGL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 MISLIIPPKDQISRVAKMLADEFGTASNIKSRVNRLSVLGAITSVQQRLKLYNKVPPNGL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 VVYCGTIVTEEGKEKKVNIDFEPFKPINTSLYLCDNKFHTEALTALLSDDSKFGFIVIDG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 VVYCGTIVTEEGKEKKVNIDFEPFKPINTSLYLCDNKFHTEALTALLSDDSKFGFIVIDG 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 SGALFGTLQGNTREVLHKFTVDLPKKHGRGGQSALRFARLRMEKRHNYVRKVAETAVQLF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 SGALFGTLQGNTREVLHKFTVDLPKKHGRGGQSALRFARLRMEKRHNYVRKVAETAVQLF 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 ISGDKVNVAGLVLAGSADFKTELSQSDMFDQRLQSKVLKLVDISYGGENGFNQAIELSTE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 ISGDKVNVAGLVLAGSADFKTELSQSDMFDQRLQSKVLKLVDISYGGENGFNQAIELSTE 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 VLSNVKFIQEKKLIGRYFDEISQDTGKYCFGVEDTLKALEMGAVEILIVYENLDIMRYVL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 VLSNVKFIQEKKLIGRYFDEISQDTGKYCFGVEDTLKALEMGAVEILIVYENLDIMRYVL 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE4 HCQGTEEEKILYLTPEQEKDKSHFTDKETGQEHELIESMPLLEWFANNYKKFGATLEIVT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 HCQGTEEEKILYLTPEQEKDKSHFTDKETGQEHELIESMPLLEWFANNYKKFGATLEIVT 310 320 330 340 350 360 370 380 390 400 pF1KE4 DKSQEGSQFVKGFGGIGGILRYRVDFQGMEYQGGDDEFFDLDDY :::::::::::::::::::::::::::::::::::::::::::: CCDS75 DKSQEGSQFVKGFGGIGGILRYRVDFQGMEYQGGDDEFFDLDDY 370 380 390 400 >>CCDS75314.1 ETF1 gene_id:2107|Hs108|chr5 (423 aa) initn: 2623 init1: 2623 opt: 2623 Z-score: 3413.1 bits: 640.5 E(32554): 8.6e-184 Smith-Waterman score: 2623; 100.0% identity (100.0% similar) in 404 aa overlap (1-404:20-423) 10 20 30 40 pF1KE4 MISLIIPPKDQISRVAKMLADEFGTASNIKSRVNRLSVLGA ::::::::::::::::::::::::::::::::::::::::: CCDS75 MKQDVLNCTEGPIHSNGTSMISLIIPPKDQISRVAKMLADEFGTASNIKSRVNRLSVLGA 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE4 ITSVQQRLKLYNKVPPNGLVVYCGTIVTEEGKEKKVNIDFEPFKPINTSLYLCDNKFHTE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 ITSVQQRLKLYNKVPPNGLVVYCGTIVTEEGKEKKVNIDFEPFKPINTSLYLCDNKFHTE 70 80 90 100 110 120 110 120 130 140 150 160 pF1KE4 ALTALLSDDSKFGFIVIDGSGALFGTLQGNTREVLHKFTVDLPKKHGRGGQSALRFARLR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 ALTALLSDDSKFGFIVIDGSGALFGTLQGNTREVLHKFTVDLPKKHGRGGQSALRFARLR 130 140 150 160 170 180 170 180 190 200 210 220 pF1KE4 MEKRHNYVRKVAETAVQLFISGDKVNVAGLVLAGSADFKTELSQSDMFDQRLQSKVLKLV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 MEKRHNYVRKVAETAVQLFISGDKVNVAGLVLAGSADFKTELSQSDMFDQRLQSKVLKLV 190 200 210 220 230 240 230 240 250 260 270 280 pF1KE4 DISYGGENGFNQAIELSTEVLSNVKFIQEKKLIGRYFDEISQDTGKYCFGVEDTLKALEM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 DISYGGENGFNQAIELSTEVLSNVKFIQEKKLIGRYFDEISQDTGKYCFGVEDTLKALEM 250 260 270 280 290 300 290 300 310 320 330 340 pF1KE4 GAVEILIVYENLDIMRYVLHCQGTEEEKILYLTPEQEKDKSHFTDKETGQEHELIESMPL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 GAVEILIVYENLDIMRYVLHCQGTEEEKILYLTPEQEKDKSHFTDKETGQEHELIESMPL 310 320 330 340 350 360 350 360 370 380 390 400 pF1KE4 LEWFANNYKKFGATLEIVTDKSQEGSQFVKGFGGIGGILRYRVDFQGMEYQGGDDEFFDL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 LEWFANNYKKFGATLEIVTDKSQEGSQFVKGFGGIGGILRYRVDFQGMEYQGGDDEFFDL 370 380 390 400 410 420 pF1KE4 DDY ::: CCDS75 DDY >>CCDS4207.1 ETF1 gene_id:2107|Hs108|chr5 (437 aa) initn: 2623 init1: 2623 opt: 2623 Z-score: 3412.9 bits: 640.5 E(32554): 8.8e-184 Smith-Waterman score: 2623; 100.0% identity (100.0% similar) in 404 aa overlap (1-404:34-437) 10 20 30 pF1KE4 MISLIIPPKDQISRVAKMLADEFGTASNIK :::::::::::::::::::::::::::::: CCDS42 DPSAADRNVEIWKIKKLIKSLEAARGNGTSMISLIIPPKDQISRVAKMLADEFGTASNIK 10 20 30 40 50 60 40 50 60 70 80 90 pF1KE4 SRVNRLSVLGAITSVQQRLKLYNKVPPNGLVVYCGTIVTEEGKEKKVNIDFEPFKPINTS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 SRVNRLSVLGAITSVQQRLKLYNKVPPNGLVVYCGTIVTEEGKEKKVNIDFEPFKPINTS 70 80 90 100 110 120 100 110 120 130 140 150 pF1KE4 LYLCDNKFHTEALTALLSDDSKFGFIVIDGSGALFGTLQGNTREVLHKFTVDLPKKHGRG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 LYLCDNKFHTEALTALLSDDSKFGFIVIDGSGALFGTLQGNTREVLHKFTVDLPKKHGRG 130 140 150 160 170 180 160 170 180 190 200 210 pF1KE4 GQSALRFARLRMEKRHNYVRKVAETAVQLFISGDKVNVAGLVLAGSADFKTELSQSDMFD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 GQSALRFARLRMEKRHNYVRKVAETAVQLFISGDKVNVAGLVLAGSADFKTELSQSDMFD 190 200 210 220 230 240 220 230 240 250 260 270 pF1KE4 QRLQSKVLKLVDISYGGENGFNQAIELSTEVLSNVKFIQEKKLIGRYFDEISQDTGKYCF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 QRLQSKVLKLVDISYGGENGFNQAIELSTEVLSNVKFIQEKKLIGRYFDEISQDTGKYCF 250 260 270 280 290 300 280 290 300 310 320 330 pF1KE4 GVEDTLKALEMGAVEILIVYENLDIMRYVLHCQGTEEEKILYLTPEQEKDKSHFTDKETG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 GVEDTLKALEMGAVEILIVYENLDIMRYVLHCQGTEEEKILYLTPEQEKDKSHFTDKETG 310 320 330 340 350 360 340 350 360 370 380 390 pF1KE4 QEHELIESMPLLEWFANNYKKFGATLEIVTDKSQEGSQFVKGFGGIGGILRYRVDFQGME :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 QEHELIESMPLLEWFANNYKKFGATLEIVTDKSQEGSQFVKGFGGIGGILRYRVDFQGME 370 380 390 400 410 420 400 pF1KE4 YQGGDDEFFDLDDY :::::::::::::: CCDS42 YQGGDDEFFDLDDY 430 404 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Thu Nov 3 06:49:33 2016 done: Thu Nov 3 06:49:34 2016 Total Scan time: 2.770 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]