FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6373, 337 aa 1>>>pF1KE6373 337 - 337 aa - 337 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.8021+/-0.00097; mu= 18.7165+/- 0.058 mean_var=63.6828+/-13.371, 0's: 0 Z-trim(104.7): 24 B-trim: 526 in 1/48 Lambda= 0.160718 statistics sampled from 7993 (8014) to 7993 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.6), E-opt: 0.2 (0.246), width: 16 Scan time: 1.960 The best scores are: opt bits E(32554) CCDS6717.1 SLC35D2 gene_id:11046|Hs108|chr9 ( 337) 2155 508.5 3.1e-144 CCDS636.1 SLC35D1 gene_id:23169|Hs108|chr1 ( 355) 1236 295.4 4.5e-80 CCDS69625.1 SLC35D2 gene_id:11046|Hs108|chr9 ( 249) 1029 247.3 9.7e-66 CCDS34544.1 SLC35D3 gene_id:340146|Hs108|chr6 ( 416) 286 75.2 1e-13 >>CCDS6717.1 SLC35D2 gene_id:11046|Hs108|chr9 (337 aa) initn: 2155 init1: 2155 opt: 2155 Z-score: 2702.9 bits: 508.5 E(32554): 3.1e-144 Smith-Waterman score: 2155; 100.0% identity (100.0% similar) in 337 aa overlap (1-337:1-337) 10 20 30 40 50 60 pF1KE6 MTAGGQAEAEGAGGEPGAARLPSRVARLLSALFYGTCSFLIVLVNKALLTTYGFPSPIFL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS67 MTAGGQAEAEGAGGEPGAARLPSRVARLLSALFYGTCSFLIVLVNKALLTTYGFPSPIFL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 GIGQMAATIMILYVSKLNKIIHFPDFDKKIPVKLFPLPLLYVGNHISGLSSTSKLSLPMF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS67 GIGQMAATIMILYVSKLNKIIHFPDFDKKIPVKLFPLPLLYVGNHISGLSSTSKLSLPMF 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 TVLRKFTIPLTLLLETIILGKQYSLNIILSVFAIILGAFIAAGSDLAFNLEGYIFVFLND :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS67 TVLRKFTIPLTLLLETIILGKQYSLNIILSVFAIILGAFIAAGSDLAFNLEGYIFVFLND 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 IFTAANGVYTKQKMDPKELGKYGVLFYNACFMIIPTLIISVSTGDLQQATEFNQWKNVVF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS67 IFTAANGVYTKQKMDPKELGKYGVLFYNACFMIIPTLIISVSTGDLQQATEFNQWKNVVF 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE6 ILQFLLSCFLGFLLMYSTVLCSYYNSALTTAVVGAIKNVSVAYIGILIGGDYIFSLLNFV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS67 ILQFLLSCFLGFLLMYSTVLCSYYNSALTTAVVGAIKNVSVAYIGILIGGDYIFSLLNFV 250 260 270 280 290 300 310 320 330 pF1KE6 GLNICMAGGLRYSFLTLSSQLKPKPVGEENICLDLKS ::::::::::::::::::::::::::::::::::::: CCDS67 GLNICMAGGLRYSFLTLSSQLKPKPVGEENICLDLKS 310 320 330 >>CCDS636.1 SLC35D1 gene_id:23169|Hs108|chr1 (355 aa) initn: 1219 init1: 1219 opt: 1236 Z-score: 1551.0 bits: 295.4 E(32554): 4.5e-80 Smith-Waterman score: 1236; 57.7% identity (85.2% similar) in 310 aa overlap (27-336:42-350) 10 20 30 40 50 pF1KE6 MTAGGQAEAEGAGGEPGAARLPSRVARLLSALFYGTCSFLIVLVNKALLTTYGFPS .::.: :::. :::::.:::..::.: ::: CCDS63 VKGEAPAKSSTLRDEEELGMASAETLTVFLKLLAAGFYGVSSFLIVVVNKSVLTNYRFPS 20 30 40 50 60 70 60 70 80 90 100 110 pF1KE6 PIFLGIGQMAATIMILYVSKLNKIIHFPDFDKKIPVKLFPLPLLYVGNHISGLSSTSKLS . .:.:::.::. .:.:.: ....:::.:...: : ::::::: ::.:.:: ::.::. CCDS63 SLCVGLGQMVATVAVLWVGKALRVVKFPDLDRNVPRKTFPLPLLYFGNQITGLFSTKKLN 80 90 100 110 120 130 120 130 140 150 160 170 pF1KE6 LPMFTVLRKFTIPLTLLLETIILGKQYSLNIILSVFAIILGAFIAAGSDLAFNLEGYIFV ::::::::.:.: .:.. : ..: : .: .: ..:::.:.:::.::.:::::.:::: :. CCDS63 LPMFTVLRRFSILFTMFAEGVLLKKTFSWGIKMTVFAMIIGAFVAASSDLAFDLEGYAFI 140 150 160 170 180 190 180 190 200 210 220 230 pF1KE6 FLNDIFTAANGVYTKQKMDPKELGKYGVLFYNACFMIIPTLIISVSTGDLQQATEFNQWK ..::..:::::.:.:::.: :::::::.:.::: :::.::: :. ::: :.:.::. : CCDS63 LINDVLTAANGAYVKQKLDSKELGKYGLLYYNALFMILPTLAIAYFTGDAQKAVEFEGWA 200 210 220 230 240 250 240 250 260 270 280 290 pF1KE6 NVVFILQFLLSCFLGFLLMYSTVLCSYYNSALTTAVVGAIKNVSVAYIGILIGGDYIFSL ...:.::: ::: .::.:::.::::. :::::::..:: :::. ..:::...::::::. CCDS63 DTLFLLQFTLSCVMGFILMYATVLCTQYNSALTTTIVGCIKNILITYIGMVFGGDYIFTW 260 270 280 290 300 310 300 310 320 330 pF1KE6 LNFVGLNICMAGGLRYSFLTLSSQLKPKPVGEENICLDLKS ::.:::: .::.: ::..:.. . : .: : ::.: CCDS63 TNFIGLNISIAGSLVYSYITFTEEQLSKQ-SEANNKLDIKGKGAV 320 330 340 350 >>CCDS69625.1 SLC35D2 gene_id:11046|Hs108|chr9 (249 aa) initn: 1043 init1: 1025 opt: 1029 Z-score: 1293.7 bits: 247.3 E(32554): 9.7e-66 Smith-Waterman score: 1387; 73.9% identity (73.9% similar) in 337 aa overlap (1-337:1-249) 10 20 30 40 50 60 pF1KE6 MTAGGQAEAEGAGGEPGAARLPSRVARLLSALFYGTCSFLIVLVNKALLTTYGFPSPIFL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 MTAGGQAEAEGAGGEPGAARLPSRVARLLSALFYGTCSFLIVLVNKALLTTYGFPSPIFL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 GIGQMAATIMILYVSKLNKIIHFPDFDKKIPVKLFPLPLLYVGNHISGLSSTSKLSLPMF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 GIGQMAATIMILYVSKLNKIIHFPDFDKKIPVKLFPLPLLYVGNHISGLSSTSKLSLPMF 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 TVLRKFTIPLTLLLETIILGKQYSLNIILSVFAIILGAFIAAGSDLAFNLEGYIFVFLND ::::::::::::::::::::::::::::::::::::::::::: CCDS69 TVLRKFTIPLTLLLETIILGKQYSLNIILSVFAIILGAFIAAG----------------- 130 140 150 160 190 200 210 220 230 240 pF1KE6 IFTAANGVYTKQKMDPKELGKYGVLFYNACFMIIPTLIISVSTGDLQQATEFNQWKNVVF CCDS69 ------------------------------------------------------------ 250 260 270 280 290 300 pF1KE6 ILQFLLSCFLGFLLMYSTVLCSYYNSALTTAVVGAIKNVSVAYIGILIGGDYIFSLLNFV ::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 -----------FLLMYSTVLCSYYNSALTTAVVGAIKNVSVAYIGILIGGDYIFSLLNFV 170 180 190 200 210 310 320 330 pF1KE6 GLNICMAGGLRYSFLTLSSQLKPKPVGEENICLDLKS ::::::::::::::::::::::::::::::::::::: CCDS69 GLNICMAGGLRYSFLTLSSQLKPKPVGEENICLDLKS 220 230 240 >>CCDS34544.1 SLC35D3 gene_id:340146|Hs108|chr6 (416 aa) initn: 230 init1: 180 opt: 286 Z-score: 359.6 bits: 75.2 E(32554): 1e-13 Smith-Waterman score: 286; 25.3% identity (54.7% similar) in 320 aa overlap (24-329:8-322) 10 20 30 40 50 60 pF1KE6 MTAGGQAEAEGAGGEPGAARLPSRVARLLSALFYGTCSFLIVLVNKALLTTYGFPSPIFL :: . :. .:. : . .. : :.. : : :: CCDS34 MRQLCRGRVLGISVAIAHGVFSGSLNILLKFLISRYQFS---FL 10 20 30 40 70 80 90 100 110 pF1KE6 GIGQ-MAATIMILYVSKLNKI--IHFPDFDKKIPVKLFPLPLLYVGNHISGLSSTSKLSL . : .... : . : .. : : : .. .. . .: . . : : ::: CCDS34 TLVQCLTSSTAALSLELLRRLGLIAVPPFGLSLARSFAGVAVLSTLQSSLTLWSLRGLSL 50 60 70 80 90 100 120 130 140 150 160 170 pF1KE6 PMFTVLRKFTIPLTLLLETIILGKQY--SLNIILSVFAIILGAFIAAGSDLAFNLEGYIF ::..:... .::. .: ... :. : ... .:. :: .:...::. . ::. CCDS34 PMYVVFKR-CLPLVTMLIGVLVLKNGAPSPGVLAAVLITTCGAALAGAGDLTGDPIGYVT 110 120 130 140 150 160 180 190 200 210 220 230 pF1KE6 VFLNDIFTAANGVYTKQKMDPKELGKYGVLFYNACFMIIPTLII-SVSTGDLQQATEFNQ : . :: : .. : : . . : : :.: : .. : .: : CCDS34 GVLAVLVHAAYLVLIQKASADTEHGPLTAQYVIAV-SATPLLVICSFASTDSIHAWTFPG 170 180 190 200 210 240 250 260 270 280 290 pF1KE6 WKNVVFILQFLLSCFLGFLLMYSTVLCSYYNSALTTAVVGAIKNVSVAYIGILIGGDYIF ::. ... :. ..: . ..:. :.: :::.::. ::..:.... .:.. .: CCDS34 WKDPAMVCIFVACILIGCAMNFTTLHCTYINSAVTTSFVGVVKSIATITVGMVAFSDVEP 220 230 240 250 260 270 300 310 320 330 pF1KE6 SLLNFVGLNICMAGGLRY---SFLTLSSQ-----LKPKPVGEENICLDLKS . : ..:. . :.. : .:. .: :. .: ::: CCDS34 TSLFIAGVVVNTLGSIIYCVAKFMETRKQSNYEDLEAQPRGEEAQLSGDQLPFVMEELPG 280 290 300 310 320 330 CCDS34 EGGNGRSEGGEAAGGPAQESRQEVRGSPRGVPLVAGSSEEGSRRSLKDAYLEVWRLVRGT 340 350 360 370 380 390 337 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 12:41:35 2016 done: Tue Nov 8 12:41:35 2016 Total Scan time: 1.960 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]