FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6363, 424 aa 1>>>pF1KE6363 424 - 424 aa - 424 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.2949+/-0.00103; mu= 16.7876+/- 0.062 mean_var=56.8326+/-11.558, 0's: 0 Z-trim(102.0): 33 B-trim: 401 in 1/47 Lambda= 0.170128 statistics sampled from 6745 (6766) to 6745 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.564), E-opt: 0.2 (0.208), width: 16 Scan time: 2.340 The best scores are: opt bits E(32554) CCDS2967.1 SLC35A5 gene_id:55032|Hs108|chr3 ( 424) 2796 694.8 4e-200 CCDS5010.1 SLC35A1 gene_id:10559|Hs108|chr6 ( 337) 252 70.4 3e-12 >>CCDS2967.1 SLC35A5 gene_id:55032|Hs108|chr3 (424 aa) initn: 2796 init1: 2796 opt: 2796 Z-score: 3706.3 bits: 694.8 E(32554): 4e-200 Smith-Waterman score: 2796; 100.0% identity (100.0% similar) in 424 aa overlap (1-424:1-424) 10 20 30 40 50 60 pF1KE6 MEKQCCSHPVICSLSTMYTFLLGAIFIALSSSRILLVKYSANEENKYDYLPTTVNVCSEL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS29 MEKQCCSHPVICSLSTMYTFLLGAIFIALSSSRILLVKYSANEENKYDYLPTTVNVCSEL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 VKLVFCVLVSFCVIKKDHQSRNLKYASWKEFSDFMKWSIPAFLYFLDNLIVFYVLSYLQP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS29 VKLVFCVLVSFCVIKKDHQSRNLKYASWKEFSDFMKWSIPAFLYFLDNLIVFYVLSYLQP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 AMAVIFSNFSIITTALLFRIVLKRRLNWIQWASLLTLFLSIVALTAGTKTLQHNLAGRGF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS29 AMAVIFSNFSIITTALLFRIVLKRRLNWIQWASLLTLFLSIVALTAGTKTLQHNLAGRGF 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 HHDAFFSPSNSCLLFRSECPRKDNCTAKEWTFPEAKWNTTARVFSHIRLGMGHVLIIVQC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS29 HHDAFFSPSNSCLLFRSECPRKDNCTAKEWTFPEAKWNTTARVFSHIRLGMGHVLIIVQC 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE6 FISSMANIYNEKILKEGNQLTESIFIQNSKLYFFGILFNGLTLGLQRSNRDQIKNCGFFY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS29 FISSMANIYNEKILKEGNQLTESIFIQNSKLYFFGILFNGLTLGLQRSNRDQIKNCGFFY 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE6 GHSAFSVALIFVTAFQGLSVAFILKFLDNMFHVLMAQVTTVIITTVSVLVFDFRPSLEFF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS29 GHSAFSVALIFVTAFQGLSVAFILKFLDNMFHVLMAQVTTVIITTVSVLVFDFRPSLEFF 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE6 LEAPSVLLSIFIYNASKPQVPEYAPRQERIRDLSGNLWERSSGDGEELERLTKPKSDESD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS29 LEAPSVLLSIFIYNASKPQVPEYAPRQERIRDLSGNLWERSSGDGEELERLTKPKSDESD 370 380 390 400 410 420 pF1KE6 EDTF :::: CCDS29 EDTF >>CCDS5010.1 SLC35A1 gene_id:10559|Hs108|chr6 (337 aa) initn: 259 init1: 161 opt: 252 Z-score: 333.4 bits: 70.4 E(32554): 3e-12 Smith-Waterman score: 318; 25.8% identity (56.0% similar) in 368 aa overlap (13-373:7-313) 10 20 30 40 50 pF1KE6 MEKQCCSHPVICSLSTMYTFLLGAIFIALSSSRILLVKYSANEENKYDYLPTTVNVC-SE ... .. . :.. ... . ..:. . . : :. ::. :: .: CCDS50 MAAPRDNVTLLFKLYCLAVMTLMAAVYTIALRYTRTSD-KELYFSTTA-VCITE 10 20 30 40 50 60 70 80 90 100 110 pF1KE6 LVKLVFCVLVSFCVIKKDHQSRNLKYASWKEF-----SDFMKWSIPAFLYFLDNLIVFYV ..:: :.: .. :. : . :: .: ....: :.:...: ..: ..: . CCDS50 VIKL----LLSVGILAKETGSLGRFKASLRENVLGSPKELLKLSVPSLVYAVQNNMAFLA 60 70 80 90 100 120 130 140 150 160 170 pF1KE6 LSYLQPAMAVIFSNFSIITTALLFRIVLKRRLNWIQWASLLTLFLSIVALTAGTKTLQHN :: :. :. . ...: ::: ..:.: :. .:: .:. : ::. .: CCDS50 LSNLDAAVYQVTYQLKIPCTALCTVLMLNRTLSKLQW-------VSVFMLCAGVTLVQ-- 110 120 130 140 150 180 190 200 210 220 230 pF1KE6 LAGRGFHHDAFFSPSNSCLLFRSECPRKDNCTAKEWTFPEAKWNTTARVFSHIRLGMGHV : .: : . : .. ::.: . CCDS50 -----------------------------------WKPAQA---TKVVVEQNPLLGFGAI 160 170 180 240 250 260 270 280 290 pF1KE6 LIIVQCFISSMANIYNEKILKEGNQLTESIFIQNSKLYFFGILFNGLTL-GLQRSNRDQI : : : :..:..: ::.:: .. :....: ..:. ::. .:: :. :. .: CCDS50 AIAVLC--SGFAGVYFEKVLKSSDT---SLWVRNIQMYLSGII---VTLAGVYLSDGAEI 190 200 210 220 230 300 310 320 330 340 350 pF1KE6 KNCGFFYGHSAFSVALIFVTAFQGLSVAFILKFLDNMFHVLMAQVTTVIITTVSVLVFDF :. :::::.. . .::... :: .. ..:. ::... . : .. :. : .::..: . CCDS50 KEKGFFYGYTYYVWFVIFLASVGGLYTSVVVKYTDNIMKGFSAAAAIVLSTIASVMLFGL 240 250 260 270 280 290 360 370 380 390 400 410 pF1KE6 RPSLEFFLEAPSVLLSIFIYNASKPQVPEYAPRQERIRDLSGNLWERSSGDGEELERLTK . .: : : . : .::..: CCDS50 QITLTFALGTLLVCVSIYLYGLPRQDTTSIQQGETASKERVIGV 300 310 320 330 424 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 12:29:03 2016 done: Tue Nov 8 12:29:03 2016 Total Scan time: 2.340 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]