FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6360, 324 aa 1>>>pF1KE6360 324 - 324 aa - 324 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.0059+/-0.000572; mu= 14.0348+/- 0.035 mean_var=80.4167+/-16.432, 0's: 0 Z-trim(114.0): 12 B-trim: 0 in 0/54 Lambda= 0.143021 statistics sampled from 14613 (14625) to 14613 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.769), E-opt: 0.2 (0.449), width: 16 Scan time: 2.500 The best scores are: opt bits E(32554) CCDS4231.1 SLC35A4 gene_id:113829|Hs108|chr5 ( 324) 2127 447.5 6.7e-126 CCDS762.1 SLC35A3 gene_id:23443|Hs108|chr1 ( 325) 322 75.0 8.8e-14 CCDS60204.1 SLC35A3 gene_id:23443|Hs108|chr1 ( 367) 322 75.1 9.7e-14 CCDS65253.1 SLC35A2 gene_id:7355|Hs108|chrX ( 332) 294 69.3 4.9e-12 CCDS43937.1 SLC35A2 gene_id:7355|Hs108|chrX ( 393) 295 69.5 4.9e-12 CCDS14311.1 SLC35A2 gene_id:7355|Hs108|chrX ( 396) 295 69.5 4.9e-12 CCDS75973.1 SLC35A2 gene_id:7355|Hs108|chrX ( 406) 295 69.5 5.1e-12 CCDS65254.1 SLC35A2 gene_id:7355|Hs108|chrX ( 421) 295 69.5 5.2e-12 >>CCDS4231.1 SLC35A4 gene_id:113829|Hs108|chr5 (324 aa) initn: 2127 init1: 2127 opt: 2127 Z-score: 2373.8 bits: 447.5 E(32554): 6.7e-126 Smith-Waterman score: 2127; 100.0% identity (100.0% similar) in 324 aa overlap (1-324:1-324) 10 20 30 40 50 60 pF1KE6 MSVEDGGMPGLGRPRQARWTLMLLLSTAMYGAHAPLLALCHVDGRVPFRPSSAVLLTELT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 MSVEDGGMPGLGRPRQARWTLMLLLSTAMYGAHAPLLALCHVDGRVPFRPSSAVLLTELT 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 KLLLCAFSLLVGWQAWPQGPPPWRQAAPFALSALLYGANNNLVIYLQRYMDPSTYQVLSN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 KLLLCAFSLLVGWQAWPQGPPPWRQAAPFALSALLYGANNNLVIYLQRYMDPSTYQVLSN 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 LKIGSTAVLYCLCLRHRLSVRQGLALLLLMAAGACYAAGGLQVPGNTLPSPPPAAAASPM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 LKIGSTAVLYCLCLRHRLSVRQGLALLLLMAAGACYAAGGLQVPGNTLPSPPPAAAASPM 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 PLHITPLGLLLLILYCLISGLSSVYTELLMKRQRLPLALQNLFLYTFGVLLNLGLHAGGG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 PLHITPLGLLLLILYCLISGLSSVYTELLMKRQRLPLALQNLFLYTFGVLLNLGLHAGGG 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE6 SGPGLLEGFSGWAALVVLSQALNGLLMSAVMKHGSSITRLFVVSCSLVVNAVLSAVLLRL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 SGPGLLEGFSGWAALVVLSQALNGLLMSAVMKHGSSITRLFVVSCSLVVNAVLSAVLLRL 250 260 270 280 290 300 310 320 pF1KE6 QLTAAFFLATLLIGLAMRLYYGSR :::::::::::::::::::::::: CCDS42 QLTAAFFLATLLIGLAMRLYYGSR 310 320 >>CCDS762.1 SLC35A3 gene_id:23443|Hs108|chr1 (325 aa) initn: 304 init1: 130 opt: 322 Z-score: 360.9 bits: 75.0 E(32554): 8.8e-14 Smith-Waterman score: 322; 30.0% identity (65.8% similar) in 237 aa overlap (89-320:87-312) 60 70 80 90 100 110 pF1KE6 LTKLLLCAFSLLVGWQAWPQGPPPWRQAAPFALSALLYGANNNLVIYLQRYMDPSTYQVL .:. . .: .:::. .: .:::: CCDS76 LVYKDSKCSLRALNRVLHDEILNKPMETLKLAIPSGIYTLQNNLLYVALSNLDAATYQVT 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE6 SNLKIGSTAVLYCLCLRHRLSVRQGLALLLLMAAGACYAAGGLQVPGNTLPSPPPAAAAS .::: .::.. : ..:.: : :.:..::. :. . .: :... . .:.: CCDS76 YQLKILTTALFSVSMLSKKLGVYQWLSLVILMT-GVAF----VQWPSDSQLDSKELSAGS 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE6 PMPLHITPLGLLLLILYCLISGLSSVYTELLMKRQRLPLALQNLFLYTFGVLLNL-GLHA . .::. .. :. ::...:: : ..:. . . ..:. : :: ...: :.. CCDS76 QF------VGLMAVLTACFSSGFAGVYFEKILKETKQSVWIRNIQLGFFGSIFGLMGVYI 180 190 200 210 220 240 250 260 270 280 290 pF1KE6 GGG---SGPGLLEGFSGWAALVVLSQALNGLLMSAVMKHGSSITRLFVVSCSLVVNAVLS : : :...:.. . .::. :::.::...::.:....: . :..: :.......: CCDS76 YDGELVSKNGFFQGYNRLTWIVVVLQALGGLVIAAVIKYADNILKGFATSLSIILSTLIS 230 240 250 260 270 280 300 310 320 pF1KE6 AVLLR-LQLTAAFFLATLLIGLAMRLYYGSR :. . :..:::...:. : :: CCDS76 YFWLQDFVPTSVFFLGAILVITATFLYGYDPKPAGNPTKA 290 300 310 320 >>CCDS60204.1 SLC35A3 gene_id:23443|Hs108|chr1 (367 aa) initn: 304 init1: 130 opt: 322 Z-score: 360.1 bits: 75.1 E(32554): 9.7e-14 Smith-Waterman score: 322; 30.0% identity (65.8% similar) in 237 aa overlap (89-320:129-354) 60 70 80 90 100 110 pF1KE6 LTKLLLCAFSLLVGWQAWPQGPPPWRQAAPFALSALLYGANNNLVIYLQRYMDPSTYQVL .:. . .: .:::. .: .:::: CCDS60 LVYKDSKCSLRALNRVLHDEILNKPMETLKLAIPSGIYTLQNNLLYVALSNLDAATYQVT 100 110 120 130 140 150 120 130 140 150 160 170 pF1KE6 SNLKIGSTAVLYCLCLRHRLSVRQGLALLLLMAAGACYAAGGLQVPGNTLPSPPPAAAAS .::: .::.. : ..:.: : :.:..::. :. . .: :... . .:.: CCDS60 YQLKILTTALFSVSMLSKKLGVYQWLSLVILMT-GVAF----VQWPSDSQLDSKELSAGS 160 170 180 190 200 210 180 190 200 210 220 230 pF1KE6 PMPLHITPLGLLLLILYCLISGLSSVYTELLMKRQRLPLALQNLFLYTFGVLLNL-GLHA . .::. .. :. ::...:: : ..:. . . ..:. : :: ...: :.. CCDS60 QF------VGLMAVLTACFSSGFAGVYFEKILKETKQSVWIRNIQLGFFGSIFGLMGVYI 220 230 240 250 260 240 250 260 270 280 290 pF1KE6 GGG---SGPGLLEGFSGWAALVVLSQALNGLLMSAVMKHGSSITRLFVVSCSLVVNAVLS : : :...:.. . .::. :::.::...::.:....: . :..: :.......: CCDS60 YDGELVSKNGFFQGYNRLTWIVVVLQALGGLVIAAVIKYADNILKGFATSLSIILSTLIS 270 280 290 300 310 320 300 310 320 pF1KE6 AVLLR-LQLTAAFFLATLLIGLAMRLYYGSR :. . :..:::...:. : :: CCDS60 YFWLQDFVPTSVFFLGAILVITATFLYGYDPKPAGNPTKA 330 340 350 360 >>CCDS65253.1 SLC35A2 gene_id:7355|Hs108|chrX (332 aa) initn: 290 init1: 123 opt: 294 Z-score: 329.6 bits: 69.3 E(32554): 4.9e-12 Smith-Waterman score: 294; 33.2% identity (61.8% similar) in 238 aa overlap (89-320:54-277) 60 70 80 90 100 110 pF1KE6 LTKLLLCAFSLLVGWQAWPQGPPPWRQAAPFALSALLYGANNNLVIYLQRYMDPSTYQVL .:. .:.: .::: . .:.:: CCDS65 EPGTASAGNVKHLVLFLHEAVLVQYVDTLKLAVPSLIYTLQNNLQYVAISNLPAATFQVT 30 40 50 60 70 80 120 130 140 150 160 170 pF1KE6 SNLKIGSTAVLYCLCLRHRLSVRQGLALLLLMAAGACYAAGGLQVPGNTLPSPPPAAAAS .::: .::.. : : . :: : .::::.. : .. . :.... CCDS65 YQLKILTTALFSVLMLNRSLSRLQWASLLLLFT-------------GVAIVQAQQAGGGG 90 100 110 120 130 180 190 200 210 220 230 pF1KE6 PMPLHITP-LGLLLLILYCLISGLSSVYTELLMKRQRLPLALQNLFLYTFGVLLNL-GLH : :: .: :: .. :: ::...:: : ..: . . :.:: : ::. :.: :: CCDS65 PRPLDQNPGAGLAAVVASCLSSGFAGVYFEKILKGSSGSVWLRNLQLGLFGTALGLVGLW 140 150 160 170 180 190 240 250 260 270 280 290 pF1KE6 AGGGSGP---GLLEGFSGWAALVVLSQALNGLLMSAVMKHGSSITRLFVVSCSLVVNAVL . :.. :.. :.. . :::.::..:::...:.:....: . :..: :.:...: CCDS65 WAEGTAVATRGFFFGYTPAVWGVVLNQAFGGLLVAVVVKYADNILKGFATSLSIVLSTVA 200 210 220 230 240 250 300 310 320 pF1KE6 SAVLLRLQLTAAFFL-ATLLIGLAMRLYYGSR : :. ... : : : :.:: :. :: CCDS65 SIRLFGFHVDPLFALGAGLVIG-AVYLYSLPRGAAKAIASASASASGPCVHQQPPGQPPP 260 270 280 290 300 >>CCDS43937.1 SLC35A2 gene_id:7355|Hs108|chrX (393 aa) initn: 290 init1: 123 opt: 295 Z-score: 329.5 bits: 69.5 E(32554): 4.9e-12 Smith-Waterman score: 295; 30.6% identity (59.0% similar) in 288 aa overlap (48-320:65-338) 20 30 40 50 60 70 pF1KE6 RWTLMLLLSTAMYGAHAPLLALCHVDGRVPFRPSSAVLLTELTKLLLCAFSLLVGWQAWP : ..::...:. : : : . :.. .. CCDS43 LKYISLAVLVVQNASLILSIRYARTLPGDRFFATTAVVMAEVLKGLTCLLLLFAQKRGNV 40 50 60 70 80 90 80 90 100 110 120 pF1KE6 QGPPPWRQAA---------PFALSALLYGANNNLVIYLQRYMDPSTYQVLSNLKIGSTAV . . . : .:. .:.: .::: . .:.:: .::: .::. CCDS43 KHLVLFLHEAVLVQYVDTLKLAVPSLIYTLQNNLQYVAISNLPAATFQVTYQLKILTTAL 100 110 120 130 140 150 130 140 150 160 170 180 pF1KE6 LYCLCLRHRLSVRQGLALLLLMAAGACYAAGGLQVPGNTLPSPPPAAAASPMPLHITP-L . : : . :: : .::::.. : .. . :....: :: .: CCDS43 FSVLMLNRSLSRLQWASLLLLFT-------------GVAIVQAQQAGGGGPRPLDQNPGA 160 170 180 190 200 190 200 210 220 230 240 pF1KE6 GLLLLILYCLISGLSSVYTELLMKRQRLPLALQNLFLYTFGVLLNL-GLHAGGGSGP--- :: .. :: ::...:: : ..: . . :.:: : ::. :.: :: . :.. CCDS43 GLAAVVASCLSSGFAGVYFEKILKGSSGSVWLRNLQLGLFGTALGLVGLWWAEGTAVATR 210 220 230 240 250 260 250 260 270 280 290 300 pF1KE6 GLLEGFSGWAALVVLSQALNGLLMSAVMKHGSSITRLFVVSCSLVVNAVLSAVLLRLQLT :.. :.. . :::.::..:::...:.:....: . :..: :.:...: : :. ... CCDS43 GFFFGYTPAVWGVVLNQAFGGLLVAVVVKYADNILKGFATSLSIVLSTVASIRLFGFHVD 270 280 290 300 310 320 310 320 pF1KE6 AAFFL-ATLLIGLAMRLYYGSR : : : :.:: :. :: CCDS43 PLFALGAGLVIG-AVYLYSLPRGAAKAIASASASASGPCVHQQPPGQPPPPQLSSHRGDL 330 340 350 360 370 380 >>CCDS14311.1 SLC35A2 gene_id:7355|Hs108|chrX (396 aa) initn: 290 init1: 123 opt: 295 Z-score: 329.5 bits: 69.5 E(32554): 4.9e-12 Smith-Waterman score: 295; 30.6% identity (59.0% similar) in 288 aa overlap (48-320:65-338) 20 30 40 50 60 70 pF1KE6 RWTLMLLLSTAMYGAHAPLLALCHVDGRVPFRPSSAVLLTELTKLLLCAFSLLVGWQAWP : ..::...:. : : : . :.. .. CCDS14 LKYISLAVLVVQNASLILSIRYARTLPGDRFFATTAVVMAEVLKGLTCLLLLFAQKRGNV 40 50 60 70 80 90 80 90 100 110 120 pF1KE6 QGPPPWRQAA---------PFALSALLYGANNNLVIYLQRYMDPSTYQVLSNLKIGSTAV . . . : .:. .:.: .::: . .:.:: .::: .::. CCDS14 KHLVLFLHEAVLVQYVDTLKLAVPSLIYTLQNNLQYVAISNLPAATFQVTYQLKILTTAL 100 110 120 130 140 150 130 140 150 160 170 180 pF1KE6 LYCLCLRHRLSVRQGLALLLLMAAGACYAAGGLQVPGNTLPSPPPAAAASPMPLHITP-L . : : . :: : .::::.. : .. . :....: :: .: CCDS14 FSVLMLNRSLSRLQWASLLLLFT-------------GVAIVQAQQAGGGGPRPLDQNPGA 160 170 180 190 200 190 200 210 220 230 240 pF1KE6 GLLLLILYCLISGLSSVYTELLMKRQRLPLALQNLFLYTFGVLLNL-GLHAGGGSGP--- :: .. :: ::...:: : ..: . . :.:: : ::. :.: :: . :.. CCDS14 GLAAVVASCLSSGFAGVYFEKILKGSSGSVWLRNLQLGLFGTALGLVGLWWAEGTAVATR 210 220 230 240 250 260 250 260 270 280 290 300 pF1KE6 GLLEGFSGWAALVVLSQALNGLLMSAVMKHGSSITRLFVVSCSLVVNAVLSAVLLRLQLT :.. :.. . :::.::..:::...:.:....: . :..: :.:...: : :. ... CCDS14 GFFFGYTPAVWGVVLNQAFGGLLVAVVVKYADNILKGFATSLSIVLSTVASIRLFGFHVD 270 280 290 300 310 320 310 320 pF1KE6 AAFFL-ATLLIGLAMRLYYGSR : : : :.:: :. :: CCDS14 PLFALGAGLVIG-AVYLYSLPRGAAKAIASASASASGPCVHQQPPGQPPPPQLSSHRGDL 330 340 350 360 370 380 >>CCDS75973.1 SLC35A2 gene_id:7355|Hs108|chrX (406 aa) initn: 290 init1: 123 opt: 295 Z-score: 329.3 bits: 69.5 E(32554): 5.1e-12 Smith-Waterman score: 295; 30.6% identity (59.0% similar) in 288 aa overlap (48-320:78-351) 20 30 40 50 60 70 pF1KE6 RWTLMLLLSTAMYGAHAPLLALCHVDGRVPFRPSSAVLLTELTKLLLCAFSLLVGWQAWP : ..::...:. : : : . :.. .. CCDS75 LKYISLAVLVVQNASLILSIRYARTLPGDRFFATTAVVMAEVLKGLTCLLLLFAQKRGNV 50 60 70 80 90 100 80 90 100 110 120 pF1KE6 QGPPPWRQAA---------PFALSALLYGANNNLVIYLQRYMDPSTYQVLSNLKIGSTAV . . . : .:. .:.: .::: . .:.:: .::: .::. CCDS75 KHLVLFLHEAVLVQYVDTLKLAVPSLIYTLQNNLQYVAISNLPAATFQVTYQLKILTTAL 110 120 130 140 150 160 130 140 150 160 170 180 pF1KE6 LYCLCLRHRLSVRQGLALLLLMAAGACYAAGGLQVPGNTLPSPPPAAAASPMPLHITP-L . : : . :: : .::::.. : .. . :....: :: .: CCDS75 FSVLMLNRSLSRLQWASLLLLFT-------------GVAIVQAQQAGGGGPRPLDQNPGA 170 180 190 200 210 190 200 210 220 230 240 pF1KE6 GLLLLILYCLISGLSSVYTELLMKRQRLPLALQNLFLYTFGVLLNL-GLHAGGGSGP--- :: .. :: ::...:: : ..: . . :.:: : ::. :.: :: . :.. CCDS75 GLAAVVASCLSSGFAGVYFEKILKGSSGSVWLRNLQLGLFGTALGLVGLWWAEGTAVATR 220 230 240 250 260 270 250 260 270 280 290 300 pF1KE6 GLLEGFSGWAALVVLSQALNGLLMSAVMKHGSSITRLFVVSCSLVVNAVLSAVLLRLQLT :.. :.. . :::.::..:::...:.:....: . :..: :.:...: : :. ... CCDS75 GFFFGYTPAVWGVVLNQAFGGLLVAVVVKYADNILKGFATSLSIVLSTVASIRLFGFHVD 280 290 300 310 320 330 310 320 pF1KE6 AAFFL-ATLLIGLAMRLYYGSR : : : :.:: :. :: CCDS75 PLFALGAGLVIG-AVYLYSLPRGAAKAIASASASASGPCVHQQPPGQPPPPQLSSHRGDL 340 350 360 370 380 390 >>CCDS65254.1 SLC35A2 gene_id:7355|Hs108|chrX (421 aa) initn: 280 init1: 113 opt: 295 Z-score: 329.1 bits: 69.5 E(32554): 5.2e-12 Smith-Waterman score: 295; 30.6% identity (59.0% similar) in 288 aa overlap (48-320:93-366) 20 30 40 50 60 70 pF1KE6 RWTLMLLLSTAMYGAHAPLLALCHVDGRVPFRPSSAVLLTELTKLLLCAFSLLVGWQAWP : ..::...:. : : : . :.. .. CCDS65 LKYISLAVLVVQNASLILSIRYARTLPGDRFFATTAVVMAEVLKGLTCLLLLFAQKRGNV 70 80 90 100 110 120 80 90 100 110 120 pF1KE6 QGPPPWRQAA---------PFALSALLYGANNNLVIYLQRYMDPSTYQVLSNLKIGSTAV . . . : .:. .:.: .::: . .:.:: .::: .::. CCDS65 KHLVLFLHEAVLVQYVDTLKLAVPSLIYTLQNNLQYVAISNLPAATFQVTYQLKILTTAL 130 140 150 160 170 180 130 140 150 160 170 180 pF1KE6 LYCLCLRHRLSVRQGLALLLLMAAGACYAAGGLQVPGNTLPSPPPAAAASPMPLHITP-L . : : . :: : .::::.. : .. . :....: :: .: CCDS65 FSVLMLNRSLSRLQWASLLLLFT-------------GVAIVQAQQAGGGGPRPLDQNPGA 190 200 210 220 190 200 210 220 230 240 pF1KE6 GLLLLILYCLISGLSSVYTELLMKRQRLPLALQNLFLYTFGVLLNL-GLHAGGGSGP--- :: .. :: ::...:: : ..: . . :.:: : ::. :.: :: . :.. CCDS65 GLAAVVASCLSSGFAGVYFEKILKGSSGSVWLRNLQLGLFGTALGLVGLWWAEGTAVATR 230 240 250 260 270 280 250 260 270 280 290 300 pF1KE6 GLLEGFSGWAALVVLSQALNGLLMSAVMKHGSSITRLFVVSCSLVVNAVLSAVLLRLQLT :.. :.. . :::.::..:::...:.:....: . :..: :.:...: : :. ... CCDS65 GFFFGYTPAVWGVVLNQAFGGLLVAVVVKYADNILKGFATSLSIVLSTVASIRLFGFHVD 290 300 310 320 330 340 310 320 pF1KE6 AAFFL-ATLLIGLAMRLYYGSR : : : :.:: :. :: CCDS65 PLFALGAGLVIG-AVYLYSLPRGAAKAIASASASASGPCVHQQPPGQPPPPQLSSHRGDL 350 360 370 380 390 400 324 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 12:27:20 2016 done: Tue Nov 8 12:27:21 2016 Total Scan time: 2.500 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]