FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6279, 242 aa 1>>>pF1KE6279 242 - 242 aa - 242 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.5939+/-0.000616; mu= 15.0501+/- 0.038 mean_var=91.4975+/-18.383, 0's: 0 Z-trim(114.1): 15 B-trim: 0 in 0/55 Lambda= 0.134082 statistics sampled from 14712 (14726) to 14712 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.786), E-opt: 0.2 (0.452), width: 16 Scan time: 2.250 The best scores are: opt bits E(32554) CCDS35247.1 SLC35A2 gene_id:7355|Hs108|chrX ( 242) 1592 316.9 7.6e-87 CCDS75975.1 SLC35A2 gene_id:7355|Hs108|chrX ( 218) 1403 280.3 7.1e-76 CCDS75974.1 SLC35A2 gene_id:7355|Hs108|chrX ( 224) 1227 246.3 1.3e-65 CCDS43937.1 SLC35A2 gene_id:7355|Hs108|chrX ( 393) 875 178.4 6.1e-45 CCDS14311.1 SLC35A2 gene_id:7355|Hs108|chrX ( 396) 875 178.4 6.2e-45 CCDS75973.1 SLC35A2 gene_id:7355|Hs108|chrX ( 406) 719 148.2 7.6e-36 CCDS65254.1 SLC35A2 gene_id:7355|Hs108|chrX ( 421) 686 141.9 6.5e-34 CCDS60205.1 SLC35A3 gene_id:23443|Hs108|chr1 ( 220) 364 79.3 2.3e-15 CCDS762.1 SLC35A3 gene_id:23443|Hs108|chr1 ( 325) 364 79.5 3e-15 CCDS60204.1 SLC35A3 gene_id:23443|Hs108|chr1 ( 367) 364 79.5 3.3e-15 CCDS65253.1 SLC35A2 gene_id:7355|Hs108|chrX ( 332) 315 70.0 2.2e-12 >>CCDS35247.1 SLC35A2 gene_id:7355|Hs108|chrX (242 aa) initn: 1592 init1: 1592 opt: 1592 Z-score: 1672.7 bits: 316.9 E(32554): 7.6e-87 Smith-Waterman score: 1592; 100.0% identity (100.0% similar) in 242 aa overlap (1-242:1-242) 10 20 30 40 50 60 pF1KE6 MAAVGAGGSTAAPGPGAVSAGALEPGTASAAHRRLKYISLAVLVVQNASLILSIRYARTL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 MAAVGAGGSTAAPGPGAVSAGALEPGTASAAHRRLKYISLAVLVVQNASLILSIRYARTL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 PGDRFFATTAVVMAEVLKGLTCLLLLFAQKRGNVKHLVLFLHEAVLVQYVDTLKLAVPSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 PGDRFFATTAVVMAEVLKGLTCLLLLFAQKRGNVKHLVLFLHEAVLVQYVDTLKLAVPSL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 IYTLQNNLQYVAISNLPAATFQPSPRCSQSHSLCLCLRLRALRSPAASRAATTTAAVFPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 IYTLQNNLQYVAISNLPAATFQPSPRCSQSHSLCLCLRLRALRSPAASRAATTTAAVFPP 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 WRPHHGALSAKVSAGEVRAGSNGGTQGRGTGVEGVGHLQDPSRHPPGPGSSGFGRWSFLP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS35 WRPHHGALSAKVSAGEVRAGSNGGTQGRGTGVEGVGHLQDPSRHPPGPGSSGFGRWSFLP 190 200 210 220 230 240 pF1KE6 GH :: CCDS35 GH >>CCDS75975.1 SLC35A2 gene_id:7355|Hs108|chrX (218 aa) initn: 1403 init1: 1403 opt: 1403 Z-score: 1475.7 bits: 280.3 E(32554): 7.1e-76 Smith-Waterman score: 1403; 100.0% identity (100.0% similar) in 212 aa overlap (31-242:7-218) 10 20 30 40 50 60 pF1KE6 MAAVGAGGSTAAPGPGAVSAGALEPGTASAAHRRLKYISLAVLVVQNASLILSIRYARTL :::::::::::::::::::::::::::::: CCDS75 MKLCRDAHRRLKYISLAVLVVQNASLILSIRYARTL 10 20 30 70 80 90 100 110 120 pF1KE6 PGDRFFATTAVVMAEVLKGLTCLLLLFAQKRGNVKHLVLFLHEAVLVQYVDTLKLAVPSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 PGDRFFATTAVVMAEVLKGLTCLLLLFAQKRGNVKHLVLFLHEAVLVQYVDTLKLAVPSL 40 50 60 70 80 90 130 140 150 160 170 180 pF1KE6 IYTLQNNLQYVAISNLPAATFQPSPRCSQSHSLCLCLRLRALRSPAASRAATTTAAVFPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 IYTLQNNLQYVAISNLPAATFQPSPRCSQSHSLCLCLRLRALRSPAASRAATTTAAVFPP 100 110 120 130 140 150 190 200 210 220 230 240 pF1KE6 WRPHHGALSAKVSAGEVRAGSNGGTQGRGTGVEGVGHLQDPSRHPPGPGSSGFGRWSFLP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 WRPHHGALSAKVSAGEVRAGSNGGTQGRGTGVEGVGHLQDPSRHPPGPGSSGFGRWSFLP 160 170 180 190 200 210 pF1KE6 GH :: CCDS75 GH >>CCDS75974.1 SLC35A2 gene_id:7355|Hs108|chrX (224 aa) initn: 1250 init1: 1226 opt: 1227 Z-score: 1291.5 bits: 246.3 E(32554): 1.3e-65 Smith-Waterman score: 1230; 86.3% identity (90.2% similar) in 234 aa overlap (1-231:1-222) 10 20 30 40 50 60 pF1KE6 MAAVGAGGSTAAPGPGAVSAGALEPGTASAAHRRLKYISLAVLVVQNASLILSIRYARTL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 MAAVGAGGSTAAPGPGAVSAGALEPGTASAAHRRLKYISLAVLVVQNASLILSIRYARTL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 PGDRFFATTAVVMAEVLKGLTCLLLLFAQKRGNVKHLVLFLHEAVLVQYVDTLKLAVPSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 PGDRFFATTAVVMAEVLKGLTCLLLLFAQKRGNVKHLVLFLHEAVLVQYVDTLKLAVPSL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 IYTLQNNLQYVAISNLPAATFQPSPRCSQSHSLCLCLRLRALRSPAASRAATTTAAVFPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 IYTLQNNLQYVAISNLPAATFQPSPRCSQSHSLCLCLRLRALRSPAASRAATTTAAVFPP 130 140 150 160 170 180 190 200 210 220 230 pF1KE6 WRPHHGALSAKVSAGEVRAGSNGGTQGRG---TGVEGVGHLQDPSRHPPGPGSSGFGRWS ::::::::::::. ::.: .:.: .: : . : ::... CCDS75 WRPHHGALSAKVAH-----------QGEGFLAAGIEDIG-LASFSLLALGPAGTKL 190 200 210 220 240 pF1KE6 FLPGH >>CCDS43937.1 SLC35A2 gene_id:7355|Hs108|chrX (393 aa) initn: 907 init1: 875 opt: 875 Z-score: 920.3 bits: 178.4 E(32554): 6.1e-45 Smith-Waterman score: 875; 100.0% identity (100.0% similar) in 142 aa overlap (1-142:1-142) 10 20 30 40 50 60 pF1KE6 MAAVGAGGSTAAPGPGAVSAGALEPGTASAAHRRLKYISLAVLVVQNASLILSIRYARTL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 MAAVGAGGSTAAPGPGAVSAGALEPGTASAAHRRLKYISLAVLVVQNASLILSIRYARTL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 PGDRFFATTAVVMAEVLKGLTCLLLLFAQKRGNVKHLVLFLHEAVLVQYVDTLKLAVPSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 PGDRFFATTAVVMAEVLKGLTCLLLLFAQKRGNVKHLVLFLHEAVLVQYVDTLKLAVPSL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 IYTLQNNLQYVAISNLPAATFQPSPRCSQSHSLCLCLRLRALRSPAASRAATTTAAVFPP :::::::::::::::::::::: CCDS43 IYTLQNNLQYVAISNLPAATFQVTYQLKILTTALFSVLMLNRSLSRLQWASLLLLFTGVA 130 140 150 160 170 180 >>CCDS14311.1 SLC35A2 gene_id:7355|Hs108|chrX (396 aa) initn: 907 init1: 875 opt: 875 Z-score: 920.2 bits: 178.4 E(32554): 6.2e-45 Smith-Waterman score: 875; 100.0% identity (100.0% similar) in 142 aa overlap (1-142:1-142) 10 20 30 40 50 60 pF1KE6 MAAVGAGGSTAAPGPGAVSAGALEPGTASAAHRRLKYISLAVLVVQNASLILSIRYARTL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 MAAVGAGGSTAAPGPGAVSAGALEPGTASAAHRRLKYISLAVLVVQNASLILSIRYARTL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 PGDRFFATTAVVMAEVLKGLTCLLLLFAQKRGNVKHLVLFLHEAVLVQYVDTLKLAVPSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 PGDRFFATTAVVMAEVLKGLTCLLLLFAQKRGNVKHLVLFLHEAVLVQYVDTLKLAVPSL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 IYTLQNNLQYVAISNLPAATFQPSPRCSQSHSLCLCLRLRALRSPAASRAATTTAAVFPP :::::::::::::::::::::: CCDS14 IYTLQNNLQYVAISNLPAATFQVTYQLKILTTALFSVLMLNRSLSRLQWASLLLLFTGVA 130 140 150 160 170 180 >>CCDS75973.1 SLC35A2 gene_id:7355|Hs108|chrX (406 aa) initn: 893 init1: 686 opt: 719 Z-score: 757.0 bits: 148.2 E(32554): 7.6e-36 Smith-Waterman score: 839; 91.6% identity (91.6% similar) in 155 aa overlap (1-142:1-155) 10 20 30 40 pF1KE6 MAAVGAGGSTAAPGPGAVSAGALEPGTASA-------------AHRRLKYISLAVLVVQN :::::::::::::::::::::::::::::: ::::::::::::::::: CCDS75 MAAVGAGGSTAAPGPGAVSAGALEPGTASAGETVCPSSRMGGGAHRRLKYISLAVLVVQN 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE6 ASLILSIRYARTLPGDRFFATTAVVMAEVLKGLTCLLLLFAQKRGNVKHLVLFLHEAVLV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS75 ASLILSIRYARTLPGDRFFATTAVVMAEVLKGLTCLLLLFAQKRGNVKHLVLFLHEAVLV 70 80 90 100 110 120 110 120 130 140 150 160 pF1KE6 QYVDTLKLAVPSLIYTLQNNLQYVAISNLPAATFQPSPRCSQSHSLCLCLRLRALRSPAA ::::::::::::::::::::::::::::::::::: CCDS75 QYVDTLKLAVPSLIYTLQNNLQYVAISNLPAATFQVTYQLKILTTALFSVLMLNRSLSRL 130 140 150 160 170 180 170 180 190 200 210 220 pF1KE6 SRAATTTAAVFPPWRPHHGALSAKVSAGEVRAGSNGGTQGRGTGVEGVGHLQDPSRHPPG CCDS75 QWASLLLLFTGVAIVQAQQAGGGGPRPLDQNPGAGLAAVVASCLSSGFAGVYFEKILKGS 190 200 210 220 230 240 >>CCDS65254.1 SLC35A2 gene_id:7355|Hs108|chrX (421 aa) initn: 893 init1: 686 opt: 686 Z-score: 722.3 bits: 141.9 E(32554): 6.5e-34 Smith-Waterman score: 809; 83.5% identity (83.5% similar) in 170 aa overlap (1-142:1-170) 10 20 30 pF1KE6 MAAVGAGGSTAAPGPGAVSAGALEPGTASA----------------------------AH :::::::::::::::::::::::::::::: :: CCDS65 MAAVGAGGSTAAPGPGAVSAGALEPGTASAELLLTWEEAEARGQGLPQPLPDTSVRIPAH 10 20 30 40 50 60 40 50 60 70 80 90 pF1KE6 RRLKYISLAVLVVQNASLILSIRYARTLPGDRFFATTAVVMAEVLKGLTCLLLLFAQKRG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS65 RRLKYISLAVLVVQNASLILSIRYARTLPGDRFFATTAVVMAEVLKGLTCLLLLFAQKRG 70 80 90 100 110 120 100 110 120 130 140 150 pF1KE6 NVKHLVLFLHEAVLVQYVDTLKLAVPSLIYTLQNNLQYVAISNLPAATFQPSPRCSQSHS :::::::::::::::::::::::::::::::::::::::::::::::::: CCDS65 NVKHLVLFLHEAVLVQYVDTLKLAVPSLIYTLQNNLQYVAISNLPAATFQVTYQLKILTT 130 140 150 160 170 180 160 170 180 190 200 210 pF1KE6 LCLCLRLRALRSPAASRAATTTAAVFPPWRPHHGALSAKVSAGEVRAGSNGGTQGRGTGV CCDS65 ALFSVLMLNRSLSRLQWASLLLLFTGVAIVQAQQAGGGGPRPLDQNPGAGLAAVVASCLS 190 200 210 220 230 240 >>CCDS60205.1 SLC35A3 gene_id:23443|Hs108|chr1 (220 aa) initn: 358 init1: 247 opt: 364 Z-score: 389.4 bits: 79.3 E(32554): 2.3e-15 Smith-Waterman score: 364; 40.1% identity (70.9% similar) in 172 aa overlap (35-200:5-175) 10 20 30 40 50 60 pF1KE6 GAGGSTAAPGPGAVSAGALEPGTASAAHRRLKYISLAVLVVQNASLILSIRYARTLP--G :::.::..:: :..::.:..::.::: : CCDS60 MFANLKYVSLGILVFQTTSLVLTMRYSRTLKEEG 10 20 30 70 80 90 100 110 120 pF1KE6 DRFFATTAVVMAEVLKGLTCLLLLFAQKRGNVKHLVLFLHEAVLVQYVDTLKLAVPSLIY :....::::.::.:: ..:.::.. ... ... : ::. .: . ..:::::.:: :: CCDS60 PRYLSSTAVVVAELLKIMACILLVYKDSKCSLRALNRVLHDEILNKPMETLKLAIPSGIY 40 50 60 70 80 90 130 140 150 160 170 pF1KE6 TLQNNLQYVAISNLPAATFQPSPRCSQSHSLCLCLRLRALRSPAA---SRAATTTAAVFP :::::: :::.::: :::.: . . . . . . . . . . : . :...: CCDS60 TLQNNLLYVALSNLDAATYQVTYQLKILTTALFSVSMLSKKLGVYQWLSLVILMTGVAFV 100 110 120 130 140 150 180 190 200 210 220 230 pF1KE6 PWRPHHGALSAK-VSAGEVRAGSNGGTQGRGTGVEGVGHLQDPSRHPPGPGSSGFGRWSF : : . :..: .::: .: CCDS60 QW-PSDSQLDSKELSAGSQFVGLMAVLTACFSSGFAGVYFEKILKETKQSVWIRNIQLVS 160 170 180 190 200 210 >>CCDS762.1 SLC35A3 gene_id:23443|Hs108|chr1 (325 aa) initn: 342 init1: 247 opt: 364 Z-score: 387.2 bits: 79.5 E(32554): 3e-15 Smith-Waterman score: 364; 40.1% identity (70.9% similar) in 172 aa overlap (35-200:5-175) 10 20 30 40 50 60 pF1KE6 GAGGSTAAPGPGAVSAGALEPGTASAAHRRLKYISLAVLVVQNASLILSIRYARTLP--G :::.::..:: :..::.:..::.::: : CCDS76 MFANLKYVSLGILVFQTTSLVLTMRYSRTLKEEG 10 20 30 70 80 90 100 110 120 pF1KE6 DRFFATTAVVMAEVLKGLTCLLLLFAQKRGNVKHLVLFLHEAVLVQYVDTLKLAVPSLIY :....::::.::.:: ..:.::.. ... ... : ::. .: . ..:::::.:: :: CCDS76 PRYLSSTAVVVAELLKIMACILLVYKDSKCSLRALNRVLHDEILNKPMETLKLAIPSGIY 40 50 60 70 80 90 130 140 150 160 170 pF1KE6 TLQNNLQYVAISNLPAATFQPSPRCSQSHSLCLCLRLRALRSPAA---SRAATTTAAVFP :::::: :::.::: :::.: . . . . . . . . . . : . :...: CCDS76 TLQNNLLYVALSNLDAATYQVTYQLKILTTALFSVSMLSKKLGVYQWLSLVILMTGVAFV 100 110 120 130 140 150 180 190 200 210 220 230 pF1KE6 PWRPHHGALSAK-VSAGEVRAGSNGGTQGRGTGVEGVGHLQDPSRHPPGPGSSGFGRWSF : : . :..: .::: .: CCDS76 QW-PSDSQLDSKELSAGSQFVGLMAVLTACFSSGFAGVYFEKILKETKQSVWIRNIQLGF 160 170 180 190 200 210 >>CCDS60204.1 SLC35A3 gene_id:23443|Hs108|chr1 (367 aa) initn: 342 init1: 247 opt: 364 Z-score: 386.5 bits: 79.5 E(32554): 3.3e-15 Smith-Waterman score: 364; 40.1% identity (70.9% similar) in 172 aa overlap (35-200:47-217) 10 20 30 40 50 60 pF1KE6 GAGGSTAAPGPGAVSAGALEPGTASAAHRRLKYISLAVLVVQNASLILSIRYARTLP--G :::.::..:: :..::.:..::.::: : CCDS60 DQHLELKKPQELKEMERLPLANEDKTMFANLKYVSLGILVFQTTSLVLTMRYSRTLKEEG 20 30 40 50 60 70 70 80 90 100 110 120 pF1KE6 DRFFATTAVVMAEVLKGLTCLLLLFAQKRGNVKHLVLFLHEAVLVQYVDTLKLAVPSLIY :....::::.::.:: ..:.::.. ... ... : ::. .: . ..:::::.:: :: CCDS60 PRYLSSTAVVVAELLKIMACILLVYKDSKCSLRALNRVLHDEILNKPMETLKLAIPSGIY 80 90 100 110 120 130 130 140 150 160 170 pF1KE6 TLQNNLQYVAISNLPAATFQPSPRCSQSHSLCLCLRLRALRSPAA---SRAATTTAAVFP :::::: :::.::: :::.: . . . . . . . . . . : . :...: CCDS60 TLQNNLLYVALSNLDAATYQVTYQLKILTTALFSVSMLSKKLGVYQWLSLVILMTGVAFV 140 150 160 170 180 190 180 190 200 210 220 230 pF1KE6 PWRPHHGALSAK-VSAGEVRAGSNGGTQGRGTGVEGVGHLQDPSRHPPGPGSSGFGRWSF : : . :..: .::: .: CCDS60 QW-PSDSQLDSKELSAGSQFVGLMAVLTACFSSGFAGVYFEKILKETKQSVWIRNIQLGF 200 210 220 230 240 250 242 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 11:44:07 2016 done: Tue Nov 8 11:44:07 2016 Total Scan time: 2.250 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]