FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3605, 458 aa 1>>>pF1KE3605 458 - 458 aa - 458 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.2119+/-0.000379; mu= 17.9962+/- 0.024 mean_var=64.4669+/-12.955, 0's: 0 Z-trim(111.7): 37 B-trim: 553 in 1/52 Lambda= 0.159737 statistics sampled from 20365 (20402) to 20365 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.62), E-opt: 0.2 (0.239), width: 16 Scan time: 9.260 The best scores are: opt bits E(85289) NP_002035 (OMIM: 137028) N-acetylgalactosamine kin ( 458) 3023 705.7 6.4e-203 XP_005254336 (OMIM: 137028) PREDICTED: N-acetylgal ( 462) 2979 695.6 7.3e-200 XP_006720524 (OMIM: 137028) PREDICTED: N-acetylgal ( 458) 2971 693.7 2.6e-199 NP_001001556 (OMIM: 137028) N-acetylgalactosamine ( 447) 2914 680.6 2.3e-195 NP_001275960 (OMIM: 137028) N-acetylgalactosamine ( 434) 2873 671.1 1.6e-192 NP_001275959 (OMIM: 137028) N-acetylgalactosamine ( 434) 2873 671.1 1.6e-192 XP_005254337 (OMIM: 137028) PREDICTED: N-acetylgal ( 451) 2870 670.5 2.6e-192 XP_006720525 (OMIM: 137028) PREDICTED: N-acetylgal ( 438) 2829 661.0 1.8e-189 XP_011519743 (OMIM: 137028) PREDICTED: N-acetylgal ( 333) 2114 496.2 5.7e-140 XP_016877553 (OMIM: 137028) PREDICTED: N-acetylgal ( 333) 2114 496.2 5.7e-140 XP_016877555 (OMIM: 137028) PREDICTED: N-acetylgal ( 329) 2106 494.3 2e-139 XP_016877554 (OMIM: 137028) PREDICTED: N-acetylgal ( 329) 2106 494.3 2e-139 XP_016877551 (OMIM: 137028) PREDICTED: N-acetylgal ( 401) 1693 399.2 1.1e-110 XP_005254341 (OMIM: 137028) PREDICTED: N-acetylgal ( 418) 1649 389.1 1.2e-107 XP_006720526 (OMIM: 137028) PREDICTED: N-acetylgal ( 429) 1649 389.1 1.3e-107 XP_016877552 (OMIM: 137028) PREDICTED: N-acetylgal ( 367) 1333 316.2 9.3e-86 XP_006720527 (OMIM: 137028) PREDICTED: N-acetylgal ( 378) 1333 316.2 9.5e-86 NP_000145 (OMIM: 230200,604313) galactokinase [Hom ( 392) 382 97.1 9.2e-20 >>NP_002035 (OMIM: 137028) N-acetylgalactosamine kinase (458 aa) initn: 3023 init1: 3023 opt: 3023 Z-score: 3764.0 bits: 705.7 E(85289): 6.4e-203 Smith-Waterman score: 3023; 99.8% identity (99.8% similar) in 458 aa overlap (1-458:1-458) 10 20 30 40 50 60 pF1KE3 MATESPATRRVQVAEHPRLLKLKEMFNSKFGSIPKFYVRAPGRVNIIGEHIDYCGYSVLP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_002 MATESPATRRVQVAEHPRLLKLKEMFNSKFGSIPKFYVRAPGRVNIIGEHIDYCGYSVLP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 MAVEQDVLIAVEPVKTYALQLANTNPLYPDFSTSANNIQIDKTKPLWHNYFLCGLKGIQE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_002 MAVEQDVLIAVEPVKTYALQLANTNPLYPDFSTSANNIQIDKTKPLWHNYFLCGLKGIQE 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 HFGLSNLTGMNCLVDGNIPPSSGLSSSSALVCCAGLVTLTVLGRNLSKVELAEICAKSER :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_002 HFGLSNLTGMNCLVDGNIPPSSGLSSSSALVCCAGLVTLTVLGRNLSKVELAEICAKSER 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 YIGTEGGGMDQSISFLAEEGTAKLIEFSPLRATDVKLPSGAVFVIANSCVEMNKAATSHF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_002 YIGTEGGGMDQSISFLAEEGTAKLIEFSPLRATDVKLPSGAVFVIANSCVEMNKAATSHF 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE3 NIRVMECRLAAKLLAKYKSLQWDKVLRLEEVQAKLGISLEEMLLVTEDALHPEPYNPEEI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_002 NIRVMECRLAAKLLAKYKSLQWDKVLRLEEVQAKLGISLEEMLLVTEDALHPEPYNPEEI 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE3 CRCLGISLEELRTQILSPNTQDVLIFKLYQWAKHVYSEAARVLQFKKICEEAPENMVQLL :::::::::::::::::::::::::::::: ::::::::::::::::::::::::::::: NP_002 CRCLGISLEELRTQILSPNTQDVLIFKLYQRAKHVYSEAARVLQFKKICEEAPENMVQLL 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE3 GELMNQSHMSCRDMYECSCPELDQLVDICRKFGAQGSRLTGAGWGGCTVSMVPADKLPSF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_002 GELMNQSHMSCRDMYECSCPELDQLVDICRKFGAQGSRLTGAGWGGCTVSMVPADKLPSF 370 380 390 400 410 420 430 440 450 pF1KE3 LANVHKAYYQRSDGSLAPEKQSLFATKPGGGALVLLEA :::::::::::::::::::::::::::::::::::::: NP_002 LANVHKAYYQRSDGSLAPEKQSLFATKPGGGALVLLEA 430 440 450 >>XP_005254336 (OMIM: 137028) PREDICTED: N-acetylgalacto (462 aa) initn: 2979 init1: 2979 opt: 2979 Z-score: 3709.2 bits: 695.6 E(85289): 7.3e-200 Smith-Waterman score: 2979; 99.8% identity (99.8% similar) in 450 aa overlap (1-450:1-450) 10 20 30 40 50 60 pF1KE3 MATESPATRRVQVAEHPRLLKLKEMFNSKFGSIPKFYVRAPGRVNIIGEHIDYCGYSVLP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 MATESPATRRVQVAEHPRLLKLKEMFNSKFGSIPKFYVRAPGRVNIIGEHIDYCGYSVLP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 MAVEQDVLIAVEPVKTYALQLANTNPLYPDFSTSANNIQIDKTKPLWHNYFLCGLKGIQE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 MAVEQDVLIAVEPVKTYALQLANTNPLYPDFSTSANNIQIDKTKPLWHNYFLCGLKGIQE 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 HFGLSNLTGMNCLVDGNIPPSSGLSSSSALVCCAGLVTLTVLGRNLSKVELAEICAKSER :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 HFGLSNLTGMNCLVDGNIPPSSGLSSSSALVCCAGLVTLTVLGRNLSKVELAEICAKSER 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 YIGTEGGGMDQSISFLAEEGTAKLIEFSPLRATDVKLPSGAVFVIANSCVEMNKAATSHF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 YIGTEGGGMDQSISFLAEEGTAKLIEFSPLRATDVKLPSGAVFVIANSCVEMNKAATSHF 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE3 NIRVMECRLAAKLLAKYKSLQWDKVLRLEEVQAKLGISLEEMLLVTEDALHPEPYNPEEI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 NIRVMECRLAAKLLAKYKSLQWDKVLRLEEVQAKLGISLEEMLLVTEDALHPEPYNPEEI 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE3 CRCLGISLEELRTQILSPNTQDVLIFKLYQWAKHVYSEAARVLQFKKICEEAPENMVQLL :::::::::::::::::::::::::::::: ::::::::::::::::::::::::::::: XP_005 CRCLGISLEELRTQILSPNTQDVLIFKLYQRAKHVYSEAARVLQFKKICEEAPENMVQLL 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE3 GELMNQSHMSCRDMYECSCPELDQLVDICRKFGAQGSRLTGAGWGGCTVSMVPADKLPSF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 GELMNQSHMSCRDMYECSCPELDQLVDICRKFGAQGSRLTGAGWGGCTVSMVPADKLPSF 370 380 390 400 410 420 430 440 450 pF1KE3 LANVHKAYYQRSDGSLAPEKQSLFATKPGGGALVLLEA :::::::::::::::::::::::::::::: XP_005 LANVHKAYYQRSDGSLAPEKQSLFATKPGGALEIPASSCILR 430 440 450 460 >>XP_006720524 (OMIM: 137028) PREDICTED: N-acetylgalacto (458 aa) initn: 2971 init1: 2971 opt: 2971 Z-score: 3699.2 bits: 693.7 E(85289): 2.6e-199 Smith-Waterman score: 2971; 99.8% identity (99.8% similar) in 449 aa overlap (1-449:1-449) 10 20 30 40 50 60 pF1KE3 MATESPATRRVQVAEHPRLLKLKEMFNSKFGSIPKFYVRAPGRVNIIGEHIDYCGYSVLP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_006 MATESPATRRVQVAEHPRLLKLKEMFNSKFGSIPKFYVRAPGRVNIIGEHIDYCGYSVLP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 MAVEQDVLIAVEPVKTYALQLANTNPLYPDFSTSANNIQIDKTKPLWHNYFLCGLKGIQE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_006 MAVEQDVLIAVEPVKTYALQLANTNPLYPDFSTSANNIQIDKTKPLWHNYFLCGLKGIQE 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 HFGLSNLTGMNCLVDGNIPPSSGLSSSSALVCCAGLVTLTVLGRNLSKVELAEICAKSER :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_006 HFGLSNLTGMNCLVDGNIPPSSGLSSSSALVCCAGLVTLTVLGRNLSKVELAEICAKSER 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 YIGTEGGGMDQSISFLAEEGTAKLIEFSPLRATDVKLPSGAVFVIANSCVEMNKAATSHF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_006 YIGTEGGGMDQSISFLAEEGTAKLIEFSPLRATDVKLPSGAVFVIANSCVEMNKAATSHF 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE3 NIRVMECRLAAKLLAKYKSLQWDKVLRLEEVQAKLGISLEEMLLVTEDALHPEPYNPEEI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_006 NIRVMECRLAAKLLAKYKSLQWDKVLRLEEVQAKLGISLEEMLLVTEDALHPEPYNPEEI 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE3 CRCLGISLEELRTQILSPNTQDVLIFKLYQWAKHVYSEAARVLQFKKICEEAPENMVQLL :::::::::::::::::::::::::::::: ::::::::::::::::::::::::::::: XP_006 CRCLGISLEELRTQILSPNTQDVLIFKLYQRAKHVYSEAARVLQFKKICEEAPENMVQLL 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE3 GELMNQSHMSCRDMYECSCPELDQLVDICRKFGAQGSRLTGAGWGGCTVSMVPADKLPSF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_006 GELMNQSHMSCRDMYECSCPELDQLVDICRKFGAQGSRLTGAGWGGCTVSMVPADKLPSF 370 380 390 400 410 420 430 440 450 pF1KE3 LANVHKAYYQRSDGSLAPEKQSLFATKPGGGALVLLEA ::::::::::::::::::::::::::::: XP_006 LANVHKAYYQRSDGSLAPEKQSLFATKPGASERYENNT 430 440 450 >>NP_001001556 (OMIM: 137028) N-acetylgalactosamine kina (447 aa) initn: 2914 init1: 2914 opt: 2914 Z-score: 3628.4 bits: 680.6 E(85289): 2.3e-195 Smith-Waterman score: 2914; 99.3% identity (99.5% similar) in 443 aa overlap (16-458:5-447) 10 20 30 40 50 60 pF1KE3 MATESPATRRVQVAEHPRLLKLKEMFNSKFGSIPKFYVRAPGRVNIIGEHIDYCGYSVLP . ::::::::::::::::::::::::::::::::::::::::::: NP_001 MPVLYDRLLKLKEMFNSKFGSIPKFYVRAPGRVNIIGEHIDYCGYSVLP 10 20 30 40 70 80 90 100 110 120 pF1KE3 MAVEQDVLIAVEPVKTYALQLANTNPLYPDFSTSANNIQIDKTKPLWHNYFLCGLKGIQE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MAVEQDVLIAVEPVKTYALQLANTNPLYPDFSTSANNIQIDKTKPLWHNYFLCGLKGIQE 50 60 70 80 90 100 130 140 150 160 170 180 pF1KE3 HFGLSNLTGMNCLVDGNIPPSSGLSSSSALVCCAGLVTLTVLGRNLSKVELAEICAKSER :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 HFGLSNLTGMNCLVDGNIPPSSGLSSSSALVCCAGLVTLTVLGRNLSKVELAEICAKSER 110 120 130 140 150 160 190 200 210 220 230 240 pF1KE3 YIGTEGGGMDQSISFLAEEGTAKLIEFSPLRATDVKLPSGAVFVIANSCVEMNKAATSHF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 YIGTEGGGMDQSISFLAEEGTAKLIEFSPLRATDVKLPSGAVFVIANSCVEMNKAATSHF 170 180 190 200 210 220 250 260 270 280 290 300 pF1KE3 NIRVMECRLAAKLLAKYKSLQWDKVLRLEEVQAKLGISLEEMLLVTEDALHPEPYNPEEI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 NIRVMECRLAAKLLAKYKSLQWDKVLRLEEVQAKLGISLEEMLLVTEDALHPEPYNPEEI 230 240 250 260 270 280 310 320 330 340 350 360 pF1KE3 CRCLGISLEELRTQILSPNTQDVLIFKLYQWAKHVYSEAARVLQFKKICEEAPENMVQLL :::::::::::::::::::::::::::::: ::::::::::::::::::::::::::::: NP_001 CRCLGISLEELRTQILSPNTQDVLIFKLYQRAKHVYSEAARVLQFKKICEEAPENMVQLL 290 300 310 320 330 340 370 380 390 400 410 420 pF1KE3 GELMNQSHMSCRDMYECSCPELDQLVDICRKFGAQGSRLTGAGWGGCTVSMVPADKLPSF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 GELMNQSHMSCRDMYECSCPELDQLVDICRKFGAQGSRLTGAGWGGCTVSMVPADKLPSF 350 360 370 380 390 400 430 440 450 pF1KE3 LANVHKAYYQRSDGSLAPEKQSLFATKPGGGALVLLEA :::::::::::::::::::::::::::::::::::::: NP_001 LANVHKAYYQRSDGSLAPEKQSLFATKPGGGALVLLEA 410 420 430 440 >>NP_001275960 (OMIM: 137028) N-acetylgalactosamine kina (434 aa) initn: 2873 init1: 2873 opt: 2873 Z-score: 3577.5 bits: 671.1 E(85289): 1.6e-192 Smith-Waterman score: 2873; 99.8% identity (99.8% similar) in 434 aa overlap (25-458:1-434) 10 20 30 40 50 60 pF1KE3 MATESPATRRVQVAEHPRLLKLKEMFNSKFGSIPKFYVRAPGRVNIIGEHIDYCGYSVLP :::::::::::::::::::::::::::::::::::: NP_001 MFNSKFGSIPKFYVRAPGRVNIIGEHIDYCGYSVLP 10 20 30 70 80 90 100 110 120 pF1KE3 MAVEQDVLIAVEPVKTYALQLANTNPLYPDFSTSANNIQIDKTKPLWHNYFLCGLKGIQE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MAVEQDVLIAVEPVKTYALQLANTNPLYPDFSTSANNIQIDKTKPLWHNYFLCGLKGIQE 40 50 60 70 80 90 130 140 150 160 170 180 pF1KE3 HFGLSNLTGMNCLVDGNIPPSSGLSSSSALVCCAGLVTLTVLGRNLSKVELAEICAKSER :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 HFGLSNLTGMNCLVDGNIPPSSGLSSSSALVCCAGLVTLTVLGRNLSKVELAEICAKSER 100 110 120 130 140 150 190 200 210 220 230 240 pF1KE3 YIGTEGGGMDQSISFLAEEGTAKLIEFSPLRATDVKLPSGAVFVIANSCVEMNKAATSHF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 YIGTEGGGMDQSISFLAEEGTAKLIEFSPLRATDVKLPSGAVFVIANSCVEMNKAATSHF 160 170 180 190 200 210 250 260 270 280 290 300 pF1KE3 NIRVMECRLAAKLLAKYKSLQWDKVLRLEEVQAKLGISLEEMLLVTEDALHPEPYNPEEI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 NIRVMECRLAAKLLAKYKSLQWDKVLRLEEVQAKLGISLEEMLLVTEDALHPEPYNPEEI 220 230 240 250 260 270 310 320 330 340 350 360 pF1KE3 CRCLGISLEELRTQILSPNTQDVLIFKLYQWAKHVYSEAARVLQFKKICEEAPENMVQLL :::::::::::::::::::::::::::::: ::::::::::::::::::::::::::::: NP_001 CRCLGISLEELRTQILSPNTQDVLIFKLYQRAKHVYSEAARVLQFKKICEEAPENMVQLL 280 290 300 310 320 330 370 380 390 400 410 420 pF1KE3 GELMNQSHMSCRDMYECSCPELDQLVDICRKFGAQGSRLTGAGWGGCTVSMVPADKLPSF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 GELMNQSHMSCRDMYECSCPELDQLVDICRKFGAQGSRLTGAGWGGCTVSMVPADKLPSF 340 350 360 370 380 390 430 440 450 pF1KE3 LANVHKAYYQRSDGSLAPEKQSLFATKPGGGALVLLEA :::::::::::::::::::::::::::::::::::::: NP_001 LANVHKAYYQRSDGSLAPEKQSLFATKPGGGALVLLEA 400 410 420 430 >>NP_001275959 (OMIM: 137028) N-acetylgalactosamine kina (434 aa) initn: 2873 init1: 2873 opt: 2873 Z-score: 3577.5 bits: 671.1 E(85289): 1.6e-192 Smith-Waterman score: 2873; 99.8% identity (99.8% similar) in 434 aa overlap (25-458:1-434) 10 20 30 40 50 60 pF1KE3 MATESPATRRVQVAEHPRLLKLKEMFNSKFGSIPKFYVRAPGRVNIIGEHIDYCGYSVLP :::::::::::::::::::::::::::::::::::: NP_001 MFNSKFGSIPKFYVRAPGRVNIIGEHIDYCGYSVLP 10 20 30 70 80 90 100 110 120 pF1KE3 MAVEQDVLIAVEPVKTYALQLANTNPLYPDFSTSANNIQIDKTKPLWHNYFLCGLKGIQE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MAVEQDVLIAVEPVKTYALQLANTNPLYPDFSTSANNIQIDKTKPLWHNYFLCGLKGIQE 40 50 60 70 80 90 130 140 150 160 170 180 pF1KE3 HFGLSNLTGMNCLVDGNIPPSSGLSSSSALVCCAGLVTLTVLGRNLSKVELAEICAKSER :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 HFGLSNLTGMNCLVDGNIPPSSGLSSSSALVCCAGLVTLTVLGRNLSKVELAEICAKSER 100 110 120 130 140 150 190 200 210 220 230 240 pF1KE3 YIGTEGGGMDQSISFLAEEGTAKLIEFSPLRATDVKLPSGAVFVIANSCVEMNKAATSHF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 YIGTEGGGMDQSISFLAEEGTAKLIEFSPLRATDVKLPSGAVFVIANSCVEMNKAATSHF 160 170 180 190 200 210 250 260 270 280 290 300 pF1KE3 NIRVMECRLAAKLLAKYKSLQWDKVLRLEEVQAKLGISLEEMLLVTEDALHPEPYNPEEI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 NIRVMECRLAAKLLAKYKSLQWDKVLRLEEVQAKLGISLEEMLLVTEDALHPEPYNPEEI 220 230 240 250 260 270 310 320 330 340 350 360 pF1KE3 CRCLGISLEELRTQILSPNTQDVLIFKLYQWAKHVYSEAARVLQFKKICEEAPENMVQLL :::::::::::::::::::::::::::::: ::::::::::::::::::::::::::::: NP_001 CRCLGISLEELRTQILSPNTQDVLIFKLYQRAKHVYSEAARVLQFKKICEEAPENMVQLL 280 290 300 310 320 330 370 380 390 400 410 420 pF1KE3 GELMNQSHMSCRDMYECSCPELDQLVDICRKFGAQGSRLTGAGWGGCTVSMVPADKLPSF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 GELMNQSHMSCRDMYECSCPELDQLVDICRKFGAQGSRLTGAGWGGCTVSMVPADKLPSF 340 350 360 370 380 390 430 440 450 pF1KE3 LANVHKAYYQRSDGSLAPEKQSLFATKPGGGALVLLEA :::::::::::::::::::::::::::::::::::::: NP_001 LANVHKAYYQRSDGSLAPEKQSLFATKPGGGALVLLEA 400 410 420 430 >>XP_005254337 (OMIM: 137028) PREDICTED: N-acetylgalacto (451 aa) initn: 2870 init1: 2870 opt: 2870 Z-score: 3573.6 bits: 670.5 E(85289): 2.6e-192 Smith-Waterman score: 2870; 99.3% identity (99.5% similar) in 435 aa overlap (16-450:5-439) 10 20 30 40 50 60 pF1KE3 MATESPATRRVQVAEHPRLLKLKEMFNSKFGSIPKFYVRAPGRVNIIGEHIDYCGYSVLP . ::::::::::::::::::::::::::::::::::::::::::: XP_005 MPVLYDRLLKLKEMFNSKFGSIPKFYVRAPGRVNIIGEHIDYCGYSVLP 10 20 30 40 70 80 90 100 110 120 pF1KE3 MAVEQDVLIAVEPVKTYALQLANTNPLYPDFSTSANNIQIDKTKPLWHNYFLCGLKGIQE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 MAVEQDVLIAVEPVKTYALQLANTNPLYPDFSTSANNIQIDKTKPLWHNYFLCGLKGIQE 50 60 70 80 90 100 130 140 150 160 170 180 pF1KE3 HFGLSNLTGMNCLVDGNIPPSSGLSSSSALVCCAGLVTLTVLGRNLSKVELAEICAKSER :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 HFGLSNLTGMNCLVDGNIPPSSGLSSSSALVCCAGLVTLTVLGRNLSKVELAEICAKSER 110 120 130 140 150 160 190 200 210 220 230 240 pF1KE3 YIGTEGGGMDQSISFLAEEGTAKLIEFSPLRATDVKLPSGAVFVIANSCVEMNKAATSHF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 YIGTEGGGMDQSISFLAEEGTAKLIEFSPLRATDVKLPSGAVFVIANSCVEMNKAATSHF 170 180 190 200 210 220 250 260 270 280 290 300 pF1KE3 NIRVMECRLAAKLLAKYKSLQWDKVLRLEEVQAKLGISLEEMLLVTEDALHPEPYNPEEI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 NIRVMECRLAAKLLAKYKSLQWDKVLRLEEVQAKLGISLEEMLLVTEDALHPEPYNPEEI 230 240 250 260 270 280 310 320 330 340 350 360 pF1KE3 CRCLGISLEELRTQILSPNTQDVLIFKLYQWAKHVYSEAARVLQFKKICEEAPENMVQLL :::::::::::::::::::::::::::::: ::::::::::::::::::::::::::::: XP_005 CRCLGISLEELRTQILSPNTQDVLIFKLYQRAKHVYSEAARVLQFKKICEEAPENMVQLL 290 300 310 320 330 340 370 380 390 400 410 420 pF1KE3 GELMNQSHMSCRDMYECSCPELDQLVDICRKFGAQGSRLTGAGWGGCTVSMVPADKLPSF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 GELMNQSHMSCRDMYECSCPELDQLVDICRKFGAQGSRLTGAGWGGCTVSMVPADKLPSF 350 360 370 380 390 400 430 440 450 pF1KE3 LANVHKAYYQRSDGSLAPEKQSLFATKPGGGALVLLEA :::::::::::::::::::::::::::::: XP_005 LANVHKAYYQRSDGSLAPEKQSLFATKPGGALEIPASSCILR 410 420 430 440 450 >>XP_006720525 (OMIM: 137028) PREDICTED: N-acetylgalacto (438 aa) initn: 2829 init1: 2829 opt: 2829 Z-score: 3522.7 bits: 661.0 E(85289): 1.8e-189 Smith-Waterman score: 2829; 99.8% identity (99.8% similar) in 426 aa overlap (25-450:1-426) 10 20 30 40 50 60 pF1KE3 MATESPATRRVQVAEHPRLLKLKEMFNSKFGSIPKFYVRAPGRVNIIGEHIDYCGYSVLP :::::::::::::::::::::::::::::::::::: XP_006 MFNSKFGSIPKFYVRAPGRVNIIGEHIDYCGYSVLP 10 20 30 70 80 90 100 110 120 pF1KE3 MAVEQDVLIAVEPVKTYALQLANTNPLYPDFSTSANNIQIDKTKPLWHNYFLCGLKGIQE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_006 MAVEQDVLIAVEPVKTYALQLANTNPLYPDFSTSANNIQIDKTKPLWHNYFLCGLKGIQE 40 50 60 70 80 90 130 140 150 160 170 180 pF1KE3 HFGLSNLTGMNCLVDGNIPPSSGLSSSSALVCCAGLVTLTVLGRNLSKVELAEICAKSER :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_006 HFGLSNLTGMNCLVDGNIPPSSGLSSSSALVCCAGLVTLTVLGRNLSKVELAEICAKSER 100 110 120 130 140 150 190 200 210 220 230 240 pF1KE3 YIGTEGGGMDQSISFLAEEGTAKLIEFSPLRATDVKLPSGAVFVIANSCVEMNKAATSHF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_006 YIGTEGGGMDQSISFLAEEGTAKLIEFSPLRATDVKLPSGAVFVIANSCVEMNKAATSHF 160 170 180 190 200 210 250 260 270 280 290 300 pF1KE3 NIRVMECRLAAKLLAKYKSLQWDKVLRLEEVQAKLGISLEEMLLVTEDALHPEPYNPEEI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_006 NIRVMECRLAAKLLAKYKSLQWDKVLRLEEVQAKLGISLEEMLLVTEDALHPEPYNPEEI 220 230 240 250 260 270 310 320 330 340 350 360 pF1KE3 CRCLGISLEELRTQILSPNTQDVLIFKLYQWAKHVYSEAARVLQFKKICEEAPENMVQLL :::::::::::::::::::::::::::::: ::::::::::::::::::::::::::::: XP_006 CRCLGISLEELRTQILSPNTQDVLIFKLYQRAKHVYSEAARVLQFKKICEEAPENMVQLL 280 290 300 310 320 330 370 380 390 400 410 420 pF1KE3 GELMNQSHMSCRDMYECSCPELDQLVDICRKFGAQGSRLTGAGWGGCTVSMVPADKLPSF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_006 GELMNQSHMSCRDMYECSCPELDQLVDICRKFGAQGSRLTGAGWGGCTVSMVPADKLPSF 340 350 360 370 380 390 430 440 450 pF1KE3 LANVHKAYYQRSDGSLAPEKQSLFATKPGGGALVLLEA :::::::::::::::::::::::::::::: XP_006 LANVHKAYYQRSDGSLAPEKQSLFATKPGGALEIPASSCILR 400 410 420 430 >>XP_011519743 (OMIM: 137028) PREDICTED: N-acetylgalacto (333 aa) initn: 2114 init1: 2114 opt: 2114 Z-score: 2633.9 bits: 496.2 E(85289): 5.7e-140 Smith-Waterman score: 2114; 99.7% identity (99.7% similar) in 321 aa overlap (130-450:1-321) 100 110 120 130 140 150 pF1KE3 IDKTKPLWHNYFLCGLKGIQEHFGLSNLTGMNCLVDGNIPPSSGLSSSSALVCCAGLVTL :::::::::::::::::::::::::::::: XP_011 MNCLVDGNIPPSSGLSSSSALVCCAGLVTL 10 20 30 160 170 180 190 200 210 pF1KE3 TVLGRNLSKVELAEICAKSERYIGTEGGGMDQSISFLAEEGTAKLIEFSPLRATDVKLPS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 TVLGRNLSKVELAEICAKSERYIGTEGGGMDQSISFLAEEGTAKLIEFSPLRATDVKLPS 40 50 60 70 80 90 220 230 240 250 260 270 pF1KE3 GAVFVIANSCVEMNKAATSHFNIRVMECRLAAKLLAKYKSLQWDKVLRLEEVQAKLGISL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 GAVFVIANSCVEMNKAATSHFNIRVMECRLAAKLLAKYKSLQWDKVLRLEEVQAKLGISL 100 110 120 130 140 150 280 290 300 310 320 330 pF1KE3 EEMLLVTEDALHPEPYNPEEICRCLGISLEELRTQILSPNTQDVLIFKLYQWAKHVYSEA ::::::::::::::::::::::::::::::::::::::::::::::::::: :::::::: XP_011 EEMLLVTEDALHPEPYNPEEICRCLGISLEELRTQILSPNTQDVLIFKLYQRAKHVYSEA 160 170 180 190 200 210 340 350 360 370 380 390 pF1KE3 ARVLQFKKICEEAPENMVQLLGELMNQSHMSCRDMYECSCPELDQLVDICRKFGAQGSRL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 ARVLQFKKICEEAPENMVQLLGELMNQSHMSCRDMYECSCPELDQLVDICRKFGAQGSRL 220 230 240 250 260 270 400 410 420 430 440 450 pF1KE3 TGAGWGGCTVSMVPADKLPSFLANVHKAYYQRSDGSLAPEKQSLFATKPGGGALVLLEA ::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 TGAGWGGCTVSMVPADKLPSFLANVHKAYYQRSDGSLAPEKQSLFATKPGGALEIPASSC 280 290 300 310 320 330 XP_011 ILR >>XP_016877553 (OMIM: 137028) PREDICTED: N-acetylgalacto (333 aa) initn: 2114 init1: 2114 opt: 2114 Z-score: 2633.9 bits: 496.2 E(85289): 5.7e-140 Smith-Waterman score: 2114; 99.7% identity (99.7% similar) in 321 aa overlap (130-450:1-321) 100 110 120 130 140 150 pF1KE3 IDKTKPLWHNYFLCGLKGIQEHFGLSNLTGMNCLVDGNIPPSSGLSSSSALVCCAGLVTL :::::::::::::::::::::::::::::: XP_016 MNCLVDGNIPPSSGLSSSSALVCCAGLVTL 10 20 30 160 170 180 190 200 210 pF1KE3 TVLGRNLSKVELAEICAKSERYIGTEGGGMDQSISFLAEEGTAKLIEFSPLRATDVKLPS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 TVLGRNLSKVELAEICAKSERYIGTEGGGMDQSISFLAEEGTAKLIEFSPLRATDVKLPS 40 50 60 70 80 90 220 230 240 250 260 270 pF1KE3 GAVFVIANSCVEMNKAATSHFNIRVMECRLAAKLLAKYKSLQWDKVLRLEEVQAKLGISL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 GAVFVIANSCVEMNKAATSHFNIRVMECRLAAKLLAKYKSLQWDKVLRLEEVQAKLGISL 100 110 120 130 140 150 280 290 300 310 320 330 pF1KE3 EEMLLVTEDALHPEPYNPEEICRCLGISLEELRTQILSPNTQDVLIFKLYQWAKHVYSEA ::::::::::::::::::::::::::::::::::::::::::::::::::: :::::::: XP_016 EEMLLVTEDALHPEPYNPEEICRCLGISLEELRTQILSPNTQDVLIFKLYQRAKHVYSEA 160 170 180 190 200 210 340 350 360 370 380 390 pF1KE3 ARVLQFKKICEEAPENMVQLLGELMNQSHMSCRDMYECSCPELDQLVDICRKFGAQGSRL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 ARVLQFKKICEEAPENMVQLLGELMNQSHMSCRDMYECSCPELDQLVDICRKFGAQGSRL 220 230 240 250 260 270 400 410 420 430 440 450 pF1KE3 TGAGWGGCTVSMVPADKLPSFLANVHKAYYQRSDGSLAPEKQSLFATKPGGGALVLLEA ::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 TGAGWGGCTVSMVPADKLPSFLANVHKAYYQRSDGSLAPEKQSLFATKPGGALEIPASSC 280 290 300 310 320 330 XP_016 ILR 458 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 20:25:53 2016 done: Sun Nov 6 20:25:54 2016 Total Scan time: 9.260 Total Display time: 0.070 Function used was FASTA [36.3.4 Apr, 2011]