FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE4415, 429 aa
1>>>pF1KE4415 429 - 429 aa - 429 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.5098+/-0.000967; mu= 15.9448+/- 0.057
mean_var=64.2801+/-13.406, 0's: 0 Z-trim(104.0): 31 B-trim: 0 in 0/48
Lambda= 0.159969
statistics sampled from 7670 (7695) to 7670 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.603), E-opt: 0.2 (0.236), width: 16
Scan time: 2.810
The best scores are: opt bits E(32554)
CCDS44757.1 SLC37A2 gene_id:219855|Hs108|chr11 ( 501) 247 66.0 9.2e-11
CCDS31714.1 SLC37A2 gene_id:219855|Hs108|chr11 ( 505) 247 66.0 9.2e-11
>>CCDS44757.1 SLC37A2 gene_id:219855|Hs108|chr11 (501 aa)
initn: 263 init1: 117 opt: 247 Z-score: 306.7 bits: 66.0 E(32554): 9.2e-11
Smith-Waterman score: 301; 23.6% identity (54.7% similar) in 424 aa overlap (43-421:77-485)
20 30 40 50 60
pF1KE4 IFSAMFGGYSLYYFNRKTFSFVMPSLVEEIPLDKDD----LGFITSSQSAAYAISKFVSG
:.:::. :: . .. ::::. :.::
CCDS44 KSRLHQNCSEQIKPINDTHSLNDTMWCSWAPFDKDNYKELLGGVDNAFLIAYAIGMFISG
50 60 70 80 90 100
70 80 90 100 110 120
pF1KE4 VLSDQMSARWLFSSGLLLVGLVNIFFA----WS-STVPVFAALWFLNGLAQGLGWPPCGK
:..... :. .:.:.:: :: . .:. :. . :... :::.: :::
CCDS44 VFGERLPLRYYLSAGMLLSGLFTSLFGLGYFWNIHELWYFVVIQVCNGLVQTTGWPSVVT
110 120 130 140 150 160
130 140 150 160 170 180
pF1KE4 VLRKWFEPSQFGTWWAILSTSMNLAGGLGPILATILAQSYSWRSTLALSGALCVVVSFLC
. .:: .. : .: .. .... :: ..: : ... .: .. . : . .:.. .
CCDS44 CVGNWFGKGKRGFIMGIWNSHTSVGNILGSLIAGIWVNG-QWGLSFIVPGIITAVMGVIT
170 180 190 200 210 220
190 200 210
pF1KE4 LLLI--------------HNEPADVGLRNLDPMPS-------------EGKKGSLKEEST
.:.. :.:::. :: : . .:: .: ..
CCDS44 FLFLIEHPEDVDCAPPQHHGEPAENQDNPEDPGNSPCSIRESGLETVAKCSKGPCEEPAA
230 240 250 260 270 280
220 230 240 250 260 270
pF1KE4 LQEL--LLSPYLWVLSTGYLVVFGVKTCCTDWGQFFLIQEKGQSALVGSSYMSALEVGGL
.. . : : . .: : . :. : ... . :: ... . ..:::.
CCDS44 ISFFGALRIPGVVEFSLCLLFAKLVSYTFLYWLPLYIANVAHFSAKEAGDLSTLFDVGGI
290 300 310 320 330 340
280 290 300 310 320 330
pF1KE4 VGSIAAGYLSDRAMAKAGLSNYGNPRHGLLLFMMAGMTVSMYLFRVTVTSDSPKLWILVL
.:.:.:: .:: . ..: ..:.. : : :.:. . .:. :..:
CCDS44 IGGIVAGLVSDYTNGRATTCC-------VMLILAAPM---MFLYNY-IGQDGIASSIVML
350 360 370 380 390
340 350 360 370 380
pF1KE4 GAVFGFSSYGPIALF--GVIANESAPPNLCGTSHA---IVGLMANVGGFLAGL-PF-STI
. : :: ::. .: :. .. .: :...: ..... ..:.. :.: :. . .
CCDS44 -IICGGLVNGPYALITTAVSADLGTHKSLKGNAKALSTVTAIIDGTGSIGAALGPLLAGL
400 410 420 430 440 450
390 400 410 420
pF1KE4 AKHYSWSTAFWVAEVICAASTAAFFLLRNIRTKMGRVSKKAE
. .:...:.. .: : : ..: : . ..
CCDS44 ISPTGWNNVFYM--LISADVLACLLLCRLVYKEILAWKVSLSRGSGYKEI
460 470 480 490 500
>>CCDS31714.1 SLC37A2 gene_id:219855|Hs108|chr11 (505 aa)
initn: 263 init1: 117 opt: 247 Z-score: 306.7 bits: 66.0 E(32554): 9.2e-11
Smith-Waterman score: 301; 23.6% identity (54.7% similar) in 424 aa overlap (43-421:77-485)
20 30 40 50 60
pF1KE4 IFSAMFGGYSLYYFNRKTFSFVMPSLVEEIPLDKDD----LGFITSSQSAAYAISKFVSG
:.:::. :: . .. ::::. :.::
CCDS31 KSRLHQNCSEQIKPINDTHSLNDTMWCSWAPFDKDNYKELLGGVDNAFLIAYAIGMFISG
50 60 70 80 90 100
70 80 90 100 110 120
pF1KE4 VLSDQMSARWLFSSGLLLVGLVNIFFA----WS-STVPVFAALWFLNGLAQGLGWPPCGK
:..... :. .:.:.:: :: . .:. :. . :... :::.: :::
CCDS31 VFGERLPLRYYLSAGMLLSGLFTSLFGLGYFWNIHELWYFVVIQVCNGLVQTTGWPSVVT
110 120 130 140 150 160
130 140 150 160 170 180
pF1KE4 VLRKWFEPSQFGTWWAILSTSMNLAGGLGPILATILAQSYSWRSTLALSGALCVVVSFLC
. .:: .. : .: .. .... :: ..: : ... .: .. . : . .:.. .
CCDS31 CVGNWFGKGKRGFIMGIWNSHTSVGNILGSLIAGIWVNG-QWGLSFIVPGIITAVMGVIT
170 180 190 200 210 220
190 200 210
pF1KE4 LLLI--------------HNEPADVGLRNLDPMPS-------------EGKKGSLKEEST
.:.. :.:::. :: : . .:: .: ..
CCDS31 FLFLIEHPEDVDCAPPQHHGEPAENQDNPEDPGNSPCSIRESGLETVAKCSKGPCEEPAA
230 240 250 260 270 280
220 230 240 250 260 270
pF1KE4 LQEL--LLSPYLWVLSTGYLVVFGVKTCCTDWGQFFLIQEKGQSALVGSSYMSALEVGGL
.. . : : . .: : . :. : ... . :: ... . ..:::.
CCDS31 ISFFGALRIPGVVEFSLCLLFAKLVSYTFLYWLPLYIANVAHFSAKEAGDLSTLFDVGGI
290 300 310 320 330 340
280 290 300 310 320 330
pF1KE4 VGSIAAGYLSDRAMAKAGLSNYGNPRHGLLLFMMAGMTVSMYLFRVTVTSDSPKLWILVL
.:.:.:: .:: . ..: ..:.. : : :.:. . .:. :..:
CCDS31 IGGIVAGLVSDYTNGRATTCC-------VMLILAAPM---MFLYNY-IGQDGIASSIVML
350 360 370 380 390
340 350 360 370 380
pF1KE4 GAVFGFSSYGPIALF--GVIANESAPPNLCGTSHA---IVGLMANVGGFLAGL-PF-STI
. : :: ::. .: :. .. .: :...: ..... ..:.. :.: :. . .
CCDS31 -IICGGLVNGPYALITTAVSADLGTHKSLKGNAKALSTVTAIIDGTGSIGAALGPLLAGL
400 410 420 430 440 450
390 400 410 420
pF1KE4 AKHYSWSTAFWVAEVICAASTAAFFLLRNIRTKMGRVSKKAE
. .:...:.. .: : : ..: : . ..
CCDS31 ISPTGWNNVFYM--LISADVLACLLLCRLVYKEILAWKVSLSRGSGSSMVLTHQ
460 470 480 490 500
429 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 01:15:38 2016 done: Sun Nov 6 01:15:38 2016
Total Scan time: 2.810 Total Display time: -0.020
Function used was FASTA [36.3.4 Apr, 2011]