FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6276, 220 aa 1>>>pF1KE6276 220 - 220 aa - 220 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.0491+/-0.00108; mu= 14.5865+/- 0.064 mean_var=54.7906+/-11.057, 0's: 0 Z-trim(101.2): 40 B-trim: 0 in 0/50 Lambda= 0.173269 statistics sampled from 6389 (6416) to 6389 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.555), E-opt: 0.2 (0.197), width: 16 Scan time: 1.390 The best scores are: opt bits E(32554) CCDS60205.1 SLC35A3 gene_id:23443|Hs108|chr1 ( 220) 1366 349.8 8.2e-97 CCDS762.1 SLC35A3 gene_id:23443|Hs108|chr1 ( 325) 1313 336.6 1.1e-92 CCDS60204.1 SLC35A3 gene_id:23443|Hs108|chr1 ( 367) 1313 336.6 1.2e-92 CCDS43937.1 SLC35A2 gene_id:7355|Hs108|chrX ( 393) 690 180.9 1e-45 CCDS14311.1 SLC35A2 gene_id:7355|Hs108|chrX ( 396) 690 180.9 1e-45 CCDS75973.1 SLC35A2 gene_id:7355|Hs108|chrX ( 406) 690 180.9 1e-45 CCDS65254.1 SLC35A2 gene_id:7355|Hs108|chrX ( 421) 690 180.9 1.1e-45 CCDS5010.1 SLC35A1 gene_id:10559|Hs108|chr6 ( 337) 526 139.8 1.9e-33 CCDS65253.1 SLC35A2 gene_id:7355|Hs108|chrX ( 332) 503 134.1 1e-31 CCDS55043.1 SLC35A1 gene_id:10559|Hs108|chr6 ( 278) 420 113.3 1.5e-25 CCDS75974.1 SLC35A2 gene_id:7355|Hs108|chrX ( 224) 371 101.0 6.2e-22 CCDS75975.1 SLC35A2 gene_id:7355|Hs108|chrX ( 218) 364 99.3 2e-21 CCDS35247.1 SLC35A2 gene_id:7355|Hs108|chrX ( 242) 364 99.3 2.2e-21 >>CCDS60205.1 SLC35A3 gene_id:23443|Hs108|chr1 (220 aa) initn: 1366 init1: 1366 opt: 1366 Z-score: 1851.6 bits: 349.8 E(32554): 8.2e-97 Smith-Waterman score: 1366; 100.0% identity (100.0% similar) in 220 aa overlap (1-220:1-220) 10 20 30 40 50 60 pF1KE6 MFANLKYVSLGILVFQTTSLVLTMRYSRTLKEEGPRYLSSTAVVVAELLKIMACILLVYK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS60 MFANLKYVSLGILVFQTTSLVLTMRYSRTLKEEGPRYLSSTAVVVAELLKIMACILLVYK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 DSKCSLRALNRVLHDEILNKPMETLKLAIPSGIYTLQNNLLYVALSNLDAATYQVTYQLK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS60 DSKCSLRALNRVLHDEILNKPMETLKLAIPSGIYTLQNNLLYVALSNLDAATYQVTYQLK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 ILTTALFSVSMLSKKLGVYQWLSLVILMTGVAFVQWPSDSQLDSKELSAGSQFVGLMAVL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS60 ILTTALFSVSMLSKKLGVYQWLSLVILMTGVAFVQWPSDSQLDSKELSAGSQFVGLMAVL 130 140 150 160 170 180 190 200 210 220 pF1KE6 TACFSSGFAGVYFEKILKETKQSVWIRNIQLVSFSLEPSL :::::::::::::::::::::::::::::::::::::::: CCDS60 TACFSSGFAGVYFEKILKETKQSVWIRNIQLVSFSLEPSL 190 200 210 220 >>CCDS762.1 SLC35A3 gene_id:23443|Hs108|chr1 (325 aa) initn: 1312 init1: 1312 opt: 1313 Z-score: 1777.3 bits: 336.6 E(32554): 1.1e-92 Smith-Waterman score: 1313; 99.1% identity (99.1% similar) in 214 aa overlap (1-214:1-214) 10 20 30 40 50 60 pF1KE6 MFANLKYVSLGILVFQTTSLVLTMRYSRTLKEEGPRYLSSTAVVVAELLKIMACILLVYK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS76 MFANLKYVSLGILVFQTTSLVLTMRYSRTLKEEGPRYLSSTAVVVAELLKIMACILLVYK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 DSKCSLRALNRVLHDEILNKPMETLKLAIPSGIYTLQNNLLYVALSNLDAATYQVTYQLK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS76 DSKCSLRALNRVLHDEILNKPMETLKLAIPSGIYTLQNNLLYVALSNLDAATYQVTYQLK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 ILTTALFSVSMLSKKLGVYQWLSLVILMTGVAFVQWPSDSQLDSKELSAGSQFVGLMAVL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS76 ILTTALFSVSMLSKKLGVYQWLSLVILMTGVAFVQWPSDSQLDSKELSAGSQFVGLMAVL 130 140 150 160 170 180 190 200 210 220 pF1KE6 TACFSSGFAGVYFEKILKETKQSVWIRNIQLVSFSLEPSL ::::::::::::::::::::::::::::::: : CCDS76 TACFSSGFAGVYFEKILKETKQSVWIRNIQLGFFGSIFGLMGVYIYDGELVSKNGFFQGY 190 200 210 220 230 240 CCDS76 NRLTWIVVVLQALGGLVIAAVIKYADNILKGFATSLSIILSTLISYFWLQDFVPTSVFFL 250 260 270 280 290 300 >>CCDS60204.1 SLC35A3 gene_id:23443|Hs108|chr1 (367 aa) initn: 1312 init1: 1312 opt: 1313 Z-score: 1776.5 bits: 336.6 E(32554): 1.2e-92 Smith-Waterman score: 1313; 99.1% identity (99.1% similar) in 214 aa overlap (1-214:43-256) 10 20 30 pF1KE6 MFANLKYVSLGILVFQTTSLVLTMRYSRTL :::::::::::::::::::::::::::::: CCDS60 TDAPDQHLELKKPQELKEMERLPLANEDKTMFANLKYVSLGILVFQTTSLVLTMRYSRTL 20 30 40 50 60 70 40 50 60 70 80 90 pF1KE6 KEEGPRYLSSTAVVVAELLKIMACILLVYKDSKCSLRALNRVLHDEILNKPMETLKLAIP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS60 KEEGPRYLSSTAVVVAELLKIMACILLVYKDSKCSLRALNRVLHDEILNKPMETLKLAIP 80 90 100 110 120 130 100 110 120 130 140 150 pF1KE6 SGIYTLQNNLLYVALSNLDAATYQVTYQLKILTTALFSVSMLSKKLGVYQWLSLVILMTG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS60 SGIYTLQNNLLYVALSNLDAATYQVTYQLKILTTALFSVSMLSKKLGVYQWLSLVILMTG 140 150 160 170 180 190 160 170 180 190 200 210 pF1KE6 VAFVQWPSDSQLDSKELSAGSQFVGLMAVLTACFSSGFAGVYFEKILKETKQSVWIRNIQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS60 VAFVQWPSDSQLDSKELSAGSQFVGLMAVLTACFSSGFAGVYFEKILKETKQSVWIRNIQ 200 210 220 230 240 250 220 pF1KE6 LVSFSLEPSL : : CCDS60 LGFFGSIFGLMGVYIYDGELVSKNGFFQGYNRLTWIVVVLQALGGLVIAAVIKYADNILK 260 270 280 290 300 310 >>CCDS43937.1 SLC35A2 gene_id:7355|Hs108|chrX (393 aa) initn: 699 init1: 440 opt: 690 Z-score: 934.4 bits: 180.9 E(32554): 1e-45 Smith-Waterman score: 690; 54.8% identity (81.4% similar) in 210 aa overlap (5-214:35-241) 10 20 30 pF1KE6 MFANLKYVSLGILVFQTTSLVLTMRYSRTLKEEG :::.::..:: :..::.:..::.::: : CCDS43 GAGGSTAAPGPGAVSAGALEPGTASAAHRRLKYISLAVLVVQNASLILSIRYARTLP--G 10 20 30 40 50 60 40 50 60 70 80 90 pF1KE6 PRYLSSTAVVVAELLKIMACILLVYKDSKCSLRALNRVLHDEILNKPMETLKLAIPSGIY :....::::.::.:: ..:.::.. ... ... : ::. .: . ..:::::.:: :: CCDS43 DRFFATTAVVMAEVLKGLTCLLLLFAQKRGNVKHLVLFLHEAVLVQYVDTLKLAVPSLIY 70 80 90 100 110 120 100 110 120 130 140 150 pF1KE6 TLQNNLLYVALSNLDAATYQVTYQLKILTTALFSVSMLSKKLGVYQWLSLVILMTGVAFV :::::: :::.::: :::.:::::::::::::::: ::...:. :: ::..:.::::.: CCDS43 TLQNNLQYVAISNLPAATFQVTYQLKILTTALFSVLMLNRSLSRLQWASLLLLFTGVAIV 130 140 150 160 170 180 160 170 180 190 200 210 pF1KE6 QWPSDSQLDSKELSAGSQFVGLMAVLTACFSSGFAGVYFEKILKETKQSVWIRNIQLVSF : . . . :. . .:: ::...:.:::::::::::::: .. :::.::.:: : CCDS43 QAQQAGGGGPRPLDQNPG-AGLAAVVASCLSSGFAGVYFEKILKGSSGSVWLRNLQLGLF 190 200 210 220 230 240 220 pF1KE6 SLEPSL CCDS43 GTALGLVGLWWAEGTAVATRGFFFGYTPAVWGVVLNQAFGGLLVAVVVKYADNILKGFAT 250 260 270 280 290 300 >>CCDS14311.1 SLC35A2 gene_id:7355|Hs108|chrX (396 aa) initn: 699 init1: 440 opt: 690 Z-score: 934.3 bits: 180.9 E(32554): 1e-45 Smith-Waterman score: 690; 54.8% identity (81.4% similar) in 210 aa overlap (5-214:35-241) 10 20 30 pF1KE6 MFANLKYVSLGILVFQTTSLVLTMRYSRTLKEEG :::.::..:: :..::.:..::.::: : CCDS14 GAGGSTAAPGPGAVSAGALEPGTASAAHRRLKYISLAVLVVQNASLILSIRYARTLP--G 10 20 30 40 50 60 40 50 60 70 80 90 pF1KE6 PRYLSSTAVVVAELLKIMACILLVYKDSKCSLRALNRVLHDEILNKPMETLKLAIPSGIY :....::::.::.:: ..:.::.. ... ... : ::. .: . ..:::::.:: :: CCDS14 DRFFATTAVVMAEVLKGLTCLLLLFAQKRGNVKHLVLFLHEAVLVQYVDTLKLAVPSLIY 70 80 90 100 110 120 100 110 120 130 140 150 pF1KE6 TLQNNLLYVALSNLDAATYQVTYQLKILTTALFSVSMLSKKLGVYQWLSLVILMTGVAFV :::::: :::.::: :::.:::::::::::::::: ::...:. :: ::..:.::::.: CCDS14 TLQNNLQYVAISNLPAATFQVTYQLKILTTALFSVLMLNRSLSRLQWASLLLLFTGVAIV 130 140 150 160 170 180 160 170 180 190 200 210 pF1KE6 QWPSDSQLDSKELSAGSQFVGLMAVLTACFSSGFAGVYFEKILKETKQSVWIRNIQLVSF : . . . :. . .:: ::...:.:::::::::::::: .. :::.::.:: : CCDS14 QAQQAGGGGPRPLDQNPG-AGLAAVVASCLSSGFAGVYFEKILKGSSGSVWLRNLQLGLF 190 200 210 220 230 240 220 pF1KE6 SLEPSL CCDS14 GTALGLVGLWWAEGTAVATRGFFFGYTPAVWGVVLNQAFGGLLVAVVVKYADNILKGFAT 250 260 270 280 290 300 >>CCDS75973.1 SLC35A2 gene_id:7355|Hs108|chrX (406 aa) initn: 699 init1: 440 opt: 690 Z-score: 934.2 bits: 180.9 E(32554): 1e-45 Smith-Waterman score: 690; 54.8% identity (81.4% similar) in 210 aa overlap (5-214:48-254) 10 20 30 pF1KE6 MFANLKYVSLGILVFQTTSLVLTMRYSRTLKEEG :::.::..:: :..::.:..::.::: : CCDS75 VSAGALEPGTASAGETVCPSSRMGGGAHRRLKYISLAVLVVQNASLILSIRYARTLP--G 20 30 40 50 60 70 40 50 60 70 80 90 pF1KE6 PRYLSSTAVVVAELLKIMACILLVYKDSKCSLRALNRVLHDEILNKPMETLKLAIPSGIY :....::::.::.:: ..:.::.. ... ... : ::. .: . ..:::::.:: :: CCDS75 DRFFATTAVVMAEVLKGLTCLLLLFAQKRGNVKHLVLFLHEAVLVQYVDTLKLAVPSLIY 80 90 100 110 120 130 100 110 120 130 140 150 pF1KE6 TLQNNLLYVALSNLDAATYQVTYQLKILTTALFSVSMLSKKLGVYQWLSLVILMTGVAFV :::::: :::.::: :::.:::::::::::::::: ::...:. :: ::..:.::::.: CCDS75 TLQNNLQYVAISNLPAATFQVTYQLKILTTALFSVLMLNRSLSRLQWASLLLLFTGVAIV 140 150 160 170 180 190 160 170 180 190 200 210 pF1KE6 QWPSDSQLDSKELSAGSQFVGLMAVLTACFSSGFAGVYFEKILKETKQSVWIRNIQLVSF : . . . :. . .:: ::...:.:::::::::::::: .. :::.::.:: : CCDS75 QAQQAGGGGPRPLDQNPG-AGLAAVVASCLSSGFAGVYFEKILKGSSGSVWLRNLQLGLF 200 210 220 230 240 250 220 pF1KE6 SLEPSL CCDS75 GTALGLVGLWWAEGTAVATRGFFFGYTPAVWGVVLNQAFGGLLVAVVVKYADNILKGFAT 260 270 280 290 300 310 >>CCDS65254.1 SLC35A2 gene_id:7355|Hs108|chrX (421 aa) initn: 699 init1: 440 opt: 690 Z-score: 933.9 bits: 180.9 E(32554): 1.1e-45 Smith-Waterman score: 690; 54.8% identity (81.4% similar) in 210 aa overlap (5-214:63-269) 10 20 30 pF1KE6 MFANLKYVSLGILVFQTTSLVLTMRYSRTLKEEG :::.::..:: :..::.:..::.::: : CCDS65 LLTWEEAEARGQGLPQPLPDTSVRIPAHRRLKYISLAVLVVQNASLILSIRYARTLP--G 40 50 60 70 80 90 40 50 60 70 80 90 pF1KE6 PRYLSSTAVVVAELLKIMACILLVYKDSKCSLRALNRVLHDEILNKPMETLKLAIPSGIY :....::::.::.:: ..:.::.. ... ... : ::. .: . ..:::::.:: :: CCDS65 DRFFATTAVVMAEVLKGLTCLLLLFAQKRGNVKHLVLFLHEAVLVQYVDTLKLAVPSLIY 100 110 120 130 140 150 100 110 120 130 140 150 pF1KE6 TLQNNLLYVALSNLDAATYQVTYQLKILTTALFSVSMLSKKLGVYQWLSLVILMTGVAFV :::::: :::.::: :::.:::::::::::::::: ::...:. :: ::..:.::::.: CCDS65 TLQNNLQYVAISNLPAATFQVTYQLKILTTALFSVLMLNRSLSRLQWASLLLLFTGVAIV 160 170 180 190 200 210 160 170 180 190 200 210 pF1KE6 QWPSDSQLDSKELSAGSQFVGLMAVLTACFSSGFAGVYFEKILKETKQSVWIRNIQLVSF : . . . :. . .:: ::...:.:::::::::::::: .. :::.::.:: : CCDS65 QAQQAGGGGPRPLDQNPG-AGLAAVVASCLSSGFAGVYFEKILKGSSGSVWLRNLQLGLF 220 230 240 250 260 220 pF1KE6 SLEPSL CCDS65 GTALGLVGLWWAEGTAVATRGFFFGYTPAVWGVVLNQAFGGLLVAVVVKYADNILKGFAT 270 280 290 300 310 320 >>CCDS5010.1 SLC35A1 gene_id:10559|Hs108|chr6 (337 aa) initn: 464 init1: 306 opt: 526 Z-score: 713.9 bits: 139.8 E(32554): 1.9e-33 Smith-Waterman score: 526; 41.3% identity (75.5% similar) in 208 aa overlap (5-211:12-213) 10 20 30 40 50 pF1KE6 MFANLKYVSLGILVFQTTSLVLTMRYSRTLKEEGPRYLSSTAVVVAELLKIMA .: :........ ....::.:: .: :.:.::: ..:..:.. CCDS50 MAAPRDNVTLLFKLYCLAVMTLMAAVYTIALRYTRTSDKE--LYFSTTAVCITEVIKLLL 10 20 30 40 50 60 70 80 90 100 110 pF1KE6 CILLVYKDSKCSLRALNRVLHDEILNKPMETLKLAIPSGIYTLQNNLLYVALSNLDAATY . .. :.. :: .. :....:..: : :::..:: .:..:::. ..::::::::.: CCDS50 SVGILAKETG-SLGRFKASLRENVLGSPKELLKLSVPSLVYAVQNNMAFLALSNLDAAVY 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE6 QVTYQLKILTTALFSVSMLSKKLGVYQWLSLVILMTGVAFVQW-PSDSQLDSKELSAGSQ :::::::: ::: .: ::.. :. ::.:. .: .::..::: :... .: . . CCDS50 QVTYQLKIPCTALCTVLMLNRTLSKLQWVSVFMLCAGVTLVQWKPAQA---TKVVVEQNP 120 130 140 150 160 170 180 190 200 210 220 pF1KE6 FVGLMAVLTACFSSGFAGVYFEKILKETKQSVWIRNIQLVSFSLEPSL ..:. :. : . ::::::::::.:: . :.:.::::. CCDS50 LLGFGAIAIAVLCSGFAGVYFEKVLKSSDTSLWVRNIQMYLSGIIVTLAGVYLSDGAEIK 180 190 200 210 220 230 CCDS50 EKGFFYGYTYYVWFVIFLASVGGLYTSVVVKYTDNIMKGFSAAAAIVLSTIASVMLFGLQ 240 250 260 270 280 290 >>CCDS65253.1 SLC35A2 gene_id:7355|Hs108|chrX (332 aa) initn: 509 init1: 345 opt: 503 Z-score: 682.9 bits: 134.1 E(32554): 1e-31 Smith-Waterman score: 503; 56.9% identity (78.4% similar) in 153 aa overlap (62-214:29-180) 40 50 60 70 80 90 pF1KE6 EEGPRYLSSTAVVVAELLKIMACILLVYKDSKCSLRALNRVLHDEILNKPMETLKLAIPS : ... : ::. .: . ..:::::.:: CCDS65 MAAVGAGGSTAAPGPGAVSAGALEPGTASAGNVKHLVLFLHEAVLVQYVDTLKLAVPS 10 20 30 40 50 100 110 120 130 140 150 pF1KE6 GIYTLQNNLLYVALSNLDAATYQVTYQLKILTTALFSVSMLSKKLGVYQWLSLVILMTGV :::::::: :::.::: :::.:::::::::::::::: ::...:. :: ::..:.::: CCDS65 LIYTLQNNLQYVAISNLPAATFQVTYQLKILTTALFSVLMLNRSLSRLQWASLLLLFTGV 60 70 80 90 100 110 160 170 180 190 200 210 pF1KE6 AFVQWPSDSQLDSKELSAGSQFVGLMAVLTACFSSGFAGVYFEKILKETKQSVWIRNIQL :.:: . . . :. . .:: ::...:.:::::::::::::: .. :::.::.:: CCDS65 AIVQAQQAGGGGPRPLDQNPG-AGLAAVVASCLSSGFAGVYFEKILKGSSGSVWLRNLQL 120 130 140 150 160 170 220 pF1KE6 VSFSLEPSL : CCDS65 GLFGTALGLVGLWWAEGTAVATRGFFFGYTPAVWGVVLNQAFGGLLVAVVVKYADNILKG 180 190 200 210 220 230 >>CCDS55043.1 SLC35A1 gene_id:10559|Hs108|chr6 (278 aa) initn: 373 init1: 306 opt: 420 Z-score: 572.0 bits: 113.3 E(32554): 1.5e-25 Smith-Waterman score: 420; 37.4% identity (71.4% similar) in 203 aa overlap (5-200:12-208) 10 20 30 40 50 pF1KE6 MFANLKYVSLGILVFQTTSLVLTMRYSRTLKEEGPRYLSSTAVVVAELLKIMA .: :........ ....::.:: .: :.:.::: ..:..:.. CCDS55 MAAPRDNVTLLFKLYCLAVMTLMAAVYTIALRYTRTSDKE--LYFSTTAVCITEVIKLLL 10 20 30 40 50 60 70 80 90 100 110 pF1KE6 CILLVYKDSKCSLRALNRVLHDEILNKPMETLKLAIPSGIYTLQNNLLYVALSNLDAATY . .. :.. :: .. :....:..: : :::..:: .:..:::. ..::::::::.: CCDS55 SVGILAKETG-SLGRFKASLRENVLGSPKELLKLSVPSLVYAVQNNMAFLALSNLDAAVY 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE6 QVTYQLKILTTALFSVSMLSKKLGVYQWLSLVILMTGVAFVQW-PSDSQLDSKELSAGSQ :::::::: ::: .: ::.. :. ::.:. .: .::..::: :... .: . . CCDS55 QVTYQLKIPCTALCTVLMLNRTLSKLQWVSVFMLCAGVTLVQWKPAQA---TKVVVEQNP 120 130 140 150 160 170 180 190 200 210 220 pF1KE6 FVGLMAVLTACFSSGFA------GVYFEKILKETKQSVWIRNIQLVSFSLEPSL ..:. :. : . :::: :.: ..: : CCDS55 LLGFGAIAIAVLCSGFAVLASVGGLYTSVVVKYTDNIMKGFSAAAAIVLSTIASVMLFGL 180 190 200 210 220 230 CCDS55 QITLTFALGTLLVCVSIYLYGLPRQDTTSIQQGETASKERVIGV 240 250 260 270 220 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 11:42:37 2016 done: Tue Nov 8 11:42:38 2016 Total Scan time: 1.390 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]