FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6279, 242 aa 1>>>pF1KE6279 242 - 242 aa - 242 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.5910+/-0.000269; mu= 14.7985+/- 0.017 mean_var=90.9601+/-18.583, 0's: 0 Z-trim(121.2): 22 B-trim: 2052 in 1/56 Lambda= 0.134477 statistics sampled from 37383 (37410) to 37383 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.781), E-opt: 0.2 (0.439), width: 16 Scan time: 5.610 The best scores are: opt bits E(85289) NP_001027460 (OMIM: 300896,314375) UDP-galactose t ( 242) 1592 317.9 1e-86 NP_001269577 (OMIM: 300896,314375) UDP-galactose t ( 218) 1403 281.2 1e-75 NP_001269576 (OMIM: 300896,314375) UDP-galactose t ( 224) 1227 247.1 2e-65 NP_001035963 (OMIM: 300896,314375) UDP-galactose t ( 393) 875 179.0 1.1e-44 NP_005651 (OMIM: 300896,314375) UDP-galactose tran ( 396) 875 179.0 1.1e-44 NP_001269579 (OMIM: 300896,314375) UDP-galactose t ( 406) 719 148.7 1.4e-35 NP_001269580 (OMIM: 300896,314375) UDP-galactose t ( 421) 686 142.3 1.2e-33 XP_016856360 (OMIM: 605632,615553) PREDICTED: UDP- ( 228) 367 80.2 3.4e-15 XP_016856358 (OMIM: 605632,615553) PREDICTED: UDP- ( 284) 367 80.3 4e-15 XP_011539437 (OMIM: 605632,615553) PREDICTED: UDP- ( 326) 367 80.3 4.4e-15 NP_001258613 (OMIM: 605632,615553) UDP-N-acetylglu ( 220) 364 79.6 4.9e-15 XP_011539440 (OMIM: 605632,615553) PREDICTED: UDP- ( 262) 364 79.7 5.6e-15 XP_016856359 (OMIM: 605632,615553) PREDICTED: UDP- ( 269) 364 79.7 5.7e-15 XP_011539439 (OMIM: 605632,615553) PREDICTED: UDP- ( 311) 364 79.7 6.3e-15 XP_011539438 (OMIM: 605632,615553) PREDICTED: UDP- ( 325) 364 79.8 6.6e-15 XP_005270748 (OMIM: 605632,615553) PREDICTED: UDP- ( 325) 364 79.8 6.6e-15 NP_036375 (OMIM: 605632,615553) UDP-N-acetylglucos ( 325) 364 79.8 6.6e-15 NP_001258614 (OMIM: 605632,615553) UDP-N-acetylglu ( 367) 364 79.8 7.2e-15 NP_001269578 (OMIM: 300896,314375) UDP-galactose t ( 332) 315 70.3 4.8e-12 NP_001161870 (OMIM: 603585,605634) CMP-sialic acid ( 278) 247 57.0 4e-08 NP_006407 (OMIM: 603585,605634) CMP-sialic acid tr ( 337) 247 57.1 4.6e-08 >>NP_001027460 (OMIM: 300896,314375) UDP-galactose trans (242 aa) initn: 1592 init1: 1592 opt: 1592 Z-score: 1678.0 bits: 317.9 E(85289): 1e-86 Smith-Waterman score: 1592; 100.0% identity (100.0% similar) in 242 aa overlap (1-242:1-242) 10 20 30 40 50 60 pF1KE6 MAAVGAGGSTAAPGPGAVSAGALEPGTASAAHRRLKYISLAVLVVQNASLILSIRYARTL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MAAVGAGGSTAAPGPGAVSAGALEPGTASAAHRRLKYISLAVLVVQNASLILSIRYARTL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 PGDRFFATTAVVMAEVLKGLTCLLLLFAQKRGNVKHLVLFLHEAVLVQYVDTLKLAVPSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 PGDRFFATTAVVMAEVLKGLTCLLLLFAQKRGNVKHLVLFLHEAVLVQYVDTLKLAVPSL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 IYTLQNNLQYVAISNLPAATFQPSPRCSQSHSLCLCLRLRALRSPAASRAATTTAAVFPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 IYTLQNNLQYVAISNLPAATFQPSPRCSQSHSLCLCLRLRALRSPAASRAATTTAAVFPP 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 WRPHHGALSAKVSAGEVRAGSNGGTQGRGTGVEGVGHLQDPSRHPPGPGSSGFGRWSFLP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 WRPHHGALSAKVSAGEVRAGSNGGTQGRGTGVEGVGHLQDPSRHPPGPGSSGFGRWSFLP 190 200 210 220 230 240 pF1KE6 GH :: NP_001 GH >>NP_001269577 (OMIM: 300896,314375) UDP-galactose trans (218 aa) initn: 1403 init1: 1403 opt: 1403 Z-score: 1480.4 bits: 281.2 E(85289): 1e-75 Smith-Waterman score: 1403; 100.0% identity (100.0% similar) in 212 aa overlap (31-242:7-218) 10 20 30 40 50 60 pF1KE6 MAAVGAGGSTAAPGPGAVSAGALEPGTASAAHRRLKYISLAVLVVQNASLILSIRYARTL :::::::::::::::::::::::::::::: NP_001 MKLCRDAHRRLKYISLAVLVVQNASLILSIRYARTL 10 20 30 70 80 90 100 110 120 pF1KE6 PGDRFFATTAVVMAEVLKGLTCLLLLFAQKRGNVKHLVLFLHEAVLVQYVDTLKLAVPSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 PGDRFFATTAVVMAEVLKGLTCLLLLFAQKRGNVKHLVLFLHEAVLVQYVDTLKLAVPSL 40 50 60 70 80 90 130 140 150 160 170 180 pF1KE6 IYTLQNNLQYVAISNLPAATFQPSPRCSQSHSLCLCLRLRALRSPAASRAATTTAAVFPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 IYTLQNNLQYVAISNLPAATFQPSPRCSQSHSLCLCLRLRALRSPAASRAATTTAAVFPP 100 110 120 130 140 150 190 200 210 220 230 240 pF1KE6 WRPHHGALSAKVSAGEVRAGSNGGTQGRGTGVEGVGHLQDPSRHPPGPGSSGFGRWSFLP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 WRPHHGALSAKVSAGEVRAGSNGGTQGRGTGVEGVGHLQDPSRHPPGPGSSGFGRWSFLP 160 170 180 190 200 210 pF1KE6 GH :: NP_001 GH >>NP_001269576 (OMIM: 300896,314375) UDP-galactose trans (224 aa) initn: 1250 init1: 1226 opt: 1227 Z-score: 1295.7 bits: 247.1 E(85289): 2e-65 Smith-Waterman score: 1230; 86.3% identity (90.2% similar) in 234 aa overlap (1-231:1-222) 10 20 30 40 50 60 pF1KE6 MAAVGAGGSTAAPGPGAVSAGALEPGTASAAHRRLKYISLAVLVVQNASLILSIRYARTL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MAAVGAGGSTAAPGPGAVSAGALEPGTASAAHRRLKYISLAVLVVQNASLILSIRYARTL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 PGDRFFATTAVVMAEVLKGLTCLLLLFAQKRGNVKHLVLFLHEAVLVQYVDTLKLAVPSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 PGDRFFATTAVVMAEVLKGLTCLLLLFAQKRGNVKHLVLFLHEAVLVQYVDTLKLAVPSL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 IYTLQNNLQYVAISNLPAATFQPSPRCSQSHSLCLCLRLRALRSPAASRAATTTAAVFPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 IYTLQNNLQYVAISNLPAATFQPSPRCSQSHSLCLCLRLRALRSPAASRAATTTAAVFPP 130 140 150 160 170 180 190 200 210 220 230 pF1KE6 WRPHHGALSAKVSAGEVRAGSNGGTQGRG---TGVEGVGHLQDPSRHPPGPGSSGFGRWS ::::::::::::. ::.: .:.: .: : . : ::... NP_001 WRPHHGALSAKVAH-----------QGEGFLAAGIEDIG-LASFSLLALGPAGTKL 190 200 210 220 240 pF1KE6 FLPGH >>NP_001035963 (OMIM: 300896,314375) UDP-galactose trans (393 aa) initn: 907 init1: 875 opt: 875 Z-score: 923.3 bits: 179.0 E(85289): 1.1e-44 Smith-Waterman score: 875; 100.0% identity (100.0% similar) in 142 aa overlap (1-142:1-142) 10 20 30 40 50 60 pF1KE6 MAAVGAGGSTAAPGPGAVSAGALEPGTASAAHRRLKYISLAVLVVQNASLILSIRYARTL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MAAVGAGGSTAAPGPGAVSAGALEPGTASAAHRRLKYISLAVLVVQNASLILSIRYARTL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 PGDRFFATTAVVMAEVLKGLTCLLLLFAQKRGNVKHLVLFLHEAVLVQYVDTLKLAVPSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 PGDRFFATTAVVMAEVLKGLTCLLLLFAQKRGNVKHLVLFLHEAVLVQYVDTLKLAVPSL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 IYTLQNNLQYVAISNLPAATFQPSPRCSQSHSLCLCLRLRALRSPAASRAATTTAAVFPP :::::::::::::::::::::: NP_001 IYTLQNNLQYVAISNLPAATFQVTYQLKILTTALFSVLMLNRSLSRLQWASLLLLFTGVA 130 140 150 160 170 180 >>NP_005651 (OMIM: 300896,314375) UDP-galactose transloc (396 aa) initn: 907 init1: 875 opt: 875 Z-score: 923.3 bits: 179.0 E(85289): 1.1e-44 Smith-Waterman score: 875; 100.0% identity (100.0% similar) in 142 aa overlap (1-142:1-142) 10 20 30 40 50 60 pF1KE6 MAAVGAGGSTAAPGPGAVSAGALEPGTASAAHRRLKYISLAVLVVQNASLILSIRYARTL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_005 MAAVGAGGSTAAPGPGAVSAGALEPGTASAAHRRLKYISLAVLVVQNASLILSIRYARTL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 PGDRFFATTAVVMAEVLKGLTCLLLLFAQKRGNVKHLVLFLHEAVLVQYVDTLKLAVPSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_005 PGDRFFATTAVVMAEVLKGLTCLLLLFAQKRGNVKHLVLFLHEAVLVQYVDTLKLAVPSL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 IYTLQNNLQYVAISNLPAATFQPSPRCSQSHSLCLCLRLRALRSPAASRAATTTAAVFPP :::::::::::::::::::::: NP_005 IYTLQNNLQYVAISNLPAATFQVTYQLKILTTALFSVLMLNRSLSRLQWASLLLLFTGVA 130 140 150 160 170 180 >>NP_001269579 (OMIM: 300896,314375) UDP-galactose trans (406 aa) initn: 893 init1: 686 opt: 719 Z-score: 759.6 bits: 148.7 E(85289): 1.4e-35 Smith-Waterman score: 839; 91.6% identity (91.6% similar) in 155 aa overlap (1-142:1-155) 10 20 30 40 pF1KE6 MAAVGAGGSTAAPGPGAVSAGALEPGTASA-------------AHRRLKYISLAVLVVQN :::::::::::::::::::::::::::::: ::::::::::::::::: NP_001 MAAVGAGGSTAAPGPGAVSAGALEPGTASAGETVCPSSRMGGGAHRRLKYISLAVLVVQN 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE6 ASLILSIRYARTLPGDRFFATTAVVMAEVLKGLTCLLLLFAQKRGNVKHLVLFLHEAVLV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 ASLILSIRYARTLPGDRFFATTAVVMAEVLKGLTCLLLLFAQKRGNVKHLVLFLHEAVLV 70 80 90 100 110 120 110 120 130 140 150 160 pF1KE6 QYVDTLKLAVPSLIYTLQNNLQYVAISNLPAATFQPSPRCSQSHSLCLCLRLRALRSPAA ::::::::::::::::::::::::::::::::::: NP_001 QYVDTLKLAVPSLIYTLQNNLQYVAISNLPAATFQVTYQLKILTTALFSVLMLNRSLSRL 130 140 150 160 170 180 170 180 190 200 210 220 pF1KE6 SRAATTTAAVFPPWRPHHGALSAKVSAGEVRAGSNGGTQGRGTGVEGVGHLQDPSRHPPG NP_001 QWASLLLLFTGVAIVQAQQAGGGGPRPLDQNPGAGLAAVVASCLSSGFAGVYFEKILKGS 190 200 210 220 230 240 >>NP_001269580 (OMIM: 300896,314375) UDP-galactose trans (421 aa) initn: 893 init1: 686 opt: 686 Z-score: 724.8 bits: 142.3 E(85289): 1.2e-33 Smith-Waterman score: 809; 83.5% identity (83.5% similar) in 170 aa overlap (1-142:1-170) 10 20 30 pF1KE6 MAAVGAGGSTAAPGPGAVSAGALEPGTASA----------------------------AH :::::::::::::::::::::::::::::: :: NP_001 MAAVGAGGSTAAPGPGAVSAGALEPGTASAELLLTWEEAEARGQGLPQPLPDTSVRIPAH 10 20 30 40 50 60 40 50 60 70 80 90 pF1KE6 RRLKYISLAVLVVQNASLILSIRYARTLPGDRFFATTAVVMAEVLKGLTCLLLLFAQKRG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 RRLKYISLAVLVVQNASLILSIRYARTLPGDRFFATTAVVMAEVLKGLTCLLLLFAQKRG 70 80 90 100 110 120 100 110 120 130 140 150 pF1KE6 NVKHLVLFLHEAVLVQYVDTLKLAVPSLIYTLQNNLQYVAISNLPAATFQPSPRCSQSHS :::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 NVKHLVLFLHEAVLVQYVDTLKLAVPSLIYTLQNNLQYVAISNLPAATFQVTYQLKILTT 130 140 150 160 170 180 160 170 180 190 200 210 pF1KE6 LCLCLRLRALRSPAASRAATTTAAVFPPWRPHHGALSAKVSAGEVRAGSNGGTQGRGTGV NP_001 ALFSVLMLNRSLSRLQWASLLLLFTGVAIVQAQQAGGGGPRPLDQNPGAGLAAVVASCLS 190 200 210 220 230 240 >>XP_016856360 (OMIM: 605632,615553) PREDICTED: UDP-N-ac (228 aa) initn: 342 init1: 247 opt: 367 Z-score: 393.9 bits: 80.2 E(85289): 3.4e-15 Smith-Waterman score: 367; 50.8% identity (80.3% similar) in 122 aa overlap (35-153:5-126) 10 20 30 40 50 60 pF1KE6 GAGGSTAAPGPGAVSAGALEPGTASAAHRRLKYISLAVLVVQNASLILSIRYARTLP--G :::.::..:: :..::.:..::.::: : XP_016 MFANLKYVSLGILVFQTTSLVLTMRYSRTLKEEG 10 20 30 70 80 90 100 110 120 pF1KE6 DRFFATTAVVMAEVLKGLTCLLLLFAQKRGNVKHLVLFLHEAVLVQYVDTLKLAVPSLIY :....::::.::.:: ..:.::.. ... ... : ::. .: . ..:::::.:: :: XP_016 PRYLSSTAVVVAELLKIMACILLVYKDSKCSLRALNRVLHDEILNKPMETLKLAIPSGIY 40 50 60 70 80 90 130 140 150 160 170 180 pF1KE6 TLQNNLQYVAISNLPAATFQ-PSPRCSQSHSLCLCLRLRALRSPAASRAATTTAAVFPPW :::::: :::.::: :::.: :: .:. : XP_016 TLQNNLLYVALSNLDAATYQWPSDSQLDSKELSAGSQFVGLMAVLTACFSSGFAGVYFEK 100 110 120 130 140 150 190 200 210 220 230 240 pF1KE6 RPHHGALSAKVSAGEVRAGSNGGTQGRGTGVEGVGHLQDPSRHPPGPGSSGFGRWSFLPG XP_016 ILKETKQSVWIRNIQLGFFGSIFGLMGVYIYDGELVSKNGFFQGYNRLTWIVVVLQCLFP 160 170 180 190 200 210 >>XP_016856358 (OMIM: 605632,615553) PREDICTED: UDP-N-ac (284 aa) initn: 342 init1: 247 opt: 367 Z-score: 392.6 bits: 80.3 E(85289): 4e-15 Smith-Waterman score: 367; 50.8% identity (80.3% similar) in 122 aa overlap (35-153:5-126) 10 20 30 40 50 60 pF1KE6 GAGGSTAAPGPGAVSAGALEPGTASAAHRRLKYISLAVLVVQNASLILSIRYARTLP--G :::.::..:: :..::.:..::.::: : XP_016 MFANLKYVSLGILVFQTTSLVLTMRYSRTLKEEG 10 20 30 70 80 90 100 110 120 pF1KE6 DRFFATTAVVMAEVLKGLTCLLLLFAQKRGNVKHLVLFLHEAVLVQYVDTLKLAVPSLIY :....::::.::.:: ..:.::.. ... ... : ::. .: . ..:::::.:: :: XP_016 PRYLSSTAVVVAELLKIMACILLVYKDSKCSLRALNRVLHDEILNKPMETLKLAIPSGIY 40 50 60 70 80 90 130 140 150 160 170 180 pF1KE6 TLQNNLQYVAISNLPAATFQ-PSPRCSQSHSLCLCLRLRALRSPAASRAATTTAAVFPPW :::::: :::.::: :::.: :: .:. : XP_016 TLQNNLLYVALSNLDAATYQWPSDSQLDSKELSAGSQFVGLMAVLTACFSSGFAGVYFEK 100 110 120 130 140 150 190 200 210 220 230 240 pF1KE6 RPHHGALSAKVSAGEVRAGSNGGTQGRGTGVEGVGHLQDPSRHPPGPGSSGFGRWSFLPG XP_016 ILKETKQSVWIRNIQLGFFGSIFGLMGVYIYDGELVSKNGFFQGYNRLTWIVVVLQALGG 160 170 180 190 200 210 >>XP_011539437 (OMIM: 605632,615553) PREDICTED: UDP-N-ac (326 aa) initn: 342 init1: 247 opt: 367 Z-score: 391.8 bits: 80.3 E(85289): 4.4e-15 Smith-Waterman score: 367; 50.8% identity (80.3% similar) in 122 aa overlap (35-153:47-168) 10 20 30 40 50 60 pF1KE6 GAGGSTAAPGPGAVSAGALEPGTASAAHRRLKYISLAVLVVQNASLILSIRYARTLP--G :::.::..:: :..::.:..::.::: : XP_011 DQHLELKKPQELKEMERLPLANEDKTMFANLKYVSLGILVFQTTSLVLTMRYSRTLKEEG 20 30 40 50 60 70 70 80 90 100 110 120 pF1KE6 DRFFATTAVVMAEVLKGLTCLLLLFAQKRGNVKHLVLFLHEAVLVQYVDTLKLAVPSLIY :....::::.::.:: ..:.::.. ... ... : ::. .: . ..:::::.:: :: XP_011 PRYLSSTAVVVAELLKIMACILLVYKDSKCSLRALNRVLHDEILNKPMETLKLAIPSGIY 80 90 100 110 120 130 130 140 150 160 170 180 pF1KE6 TLQNNLQYVAISNLPAATFQ-PSPRCSQSHSLCLCLRLRALRSPAASRAATTTAAVFPPW :::::: :::.::: :::.: :: .:. : XP_011 TLQNNLLYVALSNLDAATYQWPSDSQLDSKELSAGSQFVGLMAVLTACFSSGFAGVYFEK 140 150 160 170 180 190 190 200 210 220 230 240 pF1KE6 RPHHGALSAKVSAGEVRAGSNGGTQGRGTGVEGVGHLQDPSRHPPGPGSSGFGRWSFLPG XP_011 ILKETKQSVWIRNIQLGFFGSIFGLMGVYIYDGELVSKNGFFQGYNRLTWIVVVLQALGG 200 210 220 230 240 250 242 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 11:44:08 2016 done: Tue Nov 8 11:44:09 2016 Total Scan time: 5.610 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]