FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4415, 429 aa 1>>>pF1KE4415 429 - 429 aa - 429 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.5098+/-0.000967; mu= 15.9448+/- 0.057 mean_var=64.2801+/-13.406, 0's: 0 Z-trim(104.0): 31 B-trim: 0 in 0/48 Lambda= 0.159969 statistics sampled from 7670 (7695) to 7670 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.603), E-opt: 0.2 (0.236), width: 16 Scan time: 2.810 The best scores are: opt bits E(32554) CCDS44757.1 SLC37A2 gene_id:219855|Hs108|chr11 ( 501) 247 66.0 9.2e-11 CCDS31714.1 SLC37A2 gene_id:219855|Hs108|chr11 ( 505) 247 66.0 9.2e-11 >>CCDS44757.1 SLC37A2 gene_id:219855|Hs108|chr11 (501 aa) initn: 263 init1: 117 opt: 247 Z-score: 306.7 bits: 66.0 E(32554): 9.2e-11 Smith-Waterman score: 301; 23.6% identity (54.7% similar) in 424 aa overlap (43-421:77-485) 20 30 40 50 60 pF1KE4 IFSAMFGGYSLYYFNRKTFSFVMPSLVEEIPLDKDD----LGFITSSQSAAYAISKFVSG :.:::. :: . .. ::::. :.:: CCDS44 KSRLHQNCSEQIKPINDTHSLNDTMWCSWAPFDKDNYKELLGGVDNAFLIAYAIGMFISG 50 60 70 80 90 100 70 80 90 100 110 120 pF1KE4 VLSDQMSARWLFSSGLLLVGLVNIFFA----WS-STVPVFAALWFLNGLAQGLGWPPCGK :..... :. .:.:.:: :: . .:. :. . :... :::.: ::: CCDS44 VFGERLPLRYYLSAGMLLSGLFTSLFGLGYFWNIHELWYFVVIQVCNGLVQTTGWPSVVT 110 120 130 140 150 160 130 140 150 160 170 180 pF1KE4 VLRKWFEPSQFGTWWAILSTSMNLAGGLGPILATILAQSYSWRSTLALSGALCVVVSFLC . .:: .. : .: .. .... :: ..: : ... .: .. . : . .:.. . CCDS44 CVGNWFGKGKRGFIMGIWNSHTSVGNILGSLIAGIWVNG-QWGLSFIVPGIITAVMGVIT 170 180 190 200 210 220 190 200 210 pF1KE4 LLLI--------------HNEPADVGLRNLDPMPS-------------EGKKGSLKEEST .:.. :.:::. :: : . .:: .: .. CCDS44 FLFLIEHPEDVDCAPPQHHGEPAENQDNPEDPGNSPCSIRESGLETVAKCSKGPCEEPAA 230 240 250 260 270 280 220 230 240 250 260 270 pF1KE4 LQEL--LLSPYLWVLSTGYLVVFGVKTCCTDWGQFFLIQEKGQSALVGSSYMSALEVGGL .. . : : . .: : . :. : ... . :: ... . ..:::. CCDS44 ISFFGALRIPGVVEFSLCLLFAKLVSYTFLYWLPLYIANVAHFSAKEAGDLSTLFDVGGI 290 300 310 320 330 340 280 290 300 310 320 330 pF1KE4 VGSIAAGYLSDRAMAKAGLSNYGNPRHGLLLFMMAGMTVSMYLFRVTVTSDSPKLWILVL .:.:.:: .:: . ..: ..:.. : : :.:. . .:. :..: CCDS44 IGGIVAGLVSDYTNGRATTCC-------VMLILAAPM---MFLYNY-IGQDGIASSIVML 350 360 370 380 390 340 350 360 370 380 pF1KE4 GAVFGFSSYGPIALF--GVIANESAPPNLCGTSHA---IVGLMANVGGFLAGL-PF-STI . : :: ::. .: :. .. .: :...: ..... ..:.. :.: :. . . CCDS44 -IICGGLVNGPYALITTAVSADLGTHKSLKGNAKALSTVTAIIDGTGSIGAALGPLLAGL 400 410 420 430 440 450 390 400 410 420 pF1KE4 AKHYSWSTAFWVAEVICAASTAAFFLLRNIRTKMGRVSKKAE . .:...:.. .: : : ..: : . .. CCDS44 ISPTGWNNVFYM--LISADVLACLLLCRLVYKEILAWKVSLSRGSGYKEI 460 470 480 490 500 >>CCDS31714.1 SLC37A2 gene_id:219855|Hs108|chr11 (505 aa) initn: 263 init1: 117 opt: 247 Z-score: 306.7 bits: 66.0 E(32554): 9.2e-11 Smith-Waterman score: 301; 23.6% identity (54.7% similar) in 424 aa overlap (43-421:77-485) 20 30 40 50 60 pF1KE4 IFSAMFGGYSLYYFNRKTFSFVMPSLVEEIPLDKDD----LGFITSSQSAAYAISKFVSG :.:::. :: . .. ::::. :.:: CCDS31 KSRLHQNCSEQIKPINDTHSLNDTMWCSWAPFDKDNYKELLGGVDNAFLIAYAIGMFISG 50 60 70 80 90 100 70 80 90 100 110 120 pF1KE4 VLSDQMSARWLFSSGLLLVGLVNIFFA----WS-STVPVFAALWFLNGLAQGLGWPPCGK :..... :. .:.:.:: :: . .:. :. . :... :::.: ::: CCDS31 VFGERLPLRYYLSAGMLLSGLFTSLFGLGYFWNIHELWYFVVIQVCNGLVQTTGWPSVVT 110 120 130 140 150 160 130 140 150 160 170 180 pF1KE4 VLRKWFEPSQFGTWWAILSTSMNLAGGLGPILATILAQSYSWRSTLALSGALCVVVSFLC . .:: .. : .: .. .... :: ..: : ... .: .. . : . .:.. . CCDS31 CVGNWFGKGKRGFIMGIWNSHTSVGNILGSLIAGIWVNG-QWGLSFIVPGIITAVMGVIT 170 180 190 200 210 220 190 200 210 pF1KE4 LLLI--------------HNEPADVGLRNLDPMPS-------------EGKKGSLKEEST .:.. :.:::. :: : . .:: .: .. CCDS31 FLFLIEHPEDVDCAPPQHHGEPAENQDNPEDPGNSPCSIRESGLETVAKCSKGPCEEPAA 230 240 250 260 270 280 220 230 240 250 260 270 pF1KE4 LQEL--LLSPYLWVLSTGYLVVFGVKTCCTDWGQFFLIQEKGQSALVGSSYMSALEVGGL .. . : : . .: : . :. : ... . :: ... . ..:::. CCDS31 ISFFGALRIPGVVEFSLCLLFAKLVSYTFLYWLPLYIANVAHFSAKEAGDLSTLFDVGGI 290 300 310 320 330 340 280 290 300 310 320 330 pF1KE4 VGSIAAGYLSDRAMAKAGLSNYGNPRHGLLLFMMAGMTVSMYLFRVTVTSDSPKLWILVL .:.:.:: .:: . ..: ..:.. : : :.:. . .:. :..: CCDS31 IGGIVAGLVSDYTNGRATTCC-------VMLILAAPM---MFLYNY-IGQDGIASSIVML 350 360 370 380 390 340 350 360 370 380 pF1KE4 GAVFGFSSYGPIALF--GVIANESAPPNLCGTSHA---IVGLMANVGGFLAGL-PF-STI . : :: ::. .: :. .. .: :...: ..... ..:.. :.: :. . . CCDS31 -IICGGLVNGPYALITTAVSADLGTHKSLKGNAKALSTVTAIIDGTGSIGAALGPLLAGL 400 410 420 430 440 450 390 400 410 420 pF1KE4 AKHYSWSTAFWVAEVICAASTAAFFLLRNIRTKMGRVSKKAE . .:...:.. .: : : ..: : . .. CCDS31 ISPTGWNNVFYM--LISADVLACLLLCRLVYKEILAWKVSLSRGSGSSMVLTHQ 460 470 480 490 500 429 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 01:15:38 2016 done: Sun Nov 6 01:15:38 2016 Total Scan time: 2.810 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]