Result of FASTA (omim) for pFN21AE0337
FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448

Query: pF1KE0337, 509 aa
  1>>>pF1KE0337 509 - 509 aa - 509 aa
Library: /omim/omim.rfq.tfa
  60827320 residues in 85289 sequences

Statistics:  Expectation_n fit: rho(ln(x))= 5.5715+/-0.000317; mu= 18.3981+/- 0.020
 mean_var=94.5855+/-19.422, 0's: 0 Z-trim(118.2): 127  B-trim: 0 in 0/53
 Lambda= 0.131875
 statistics sampled from 30737 (30867) to 30737 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
 ktup: 2, E-join: 1 (0.727), E-opt: 0.2 (0.362), width:  16
 Scan time: 10.280

The best scores are:                                      opt bits E(85289)
NP_001078896 (OMIM: 250100,607574) arylsulfatase A ( 509) 3545 684.6 1.8e-196
NP_000478 (OMIM: 250100,607574) arylsulfatase A is ( 509) 3545 684.6 1.8e-196
NP_001078894 (OMIM: 250100,607574) arylsulfatase A ( 509) 3545 684.6 1.8e-196
NP_001078895 (OMIM: 250100,607574) arylsulfatase A ( 509) 3545 684.6 1.8e-196
XP_011528992 (OMIM: 250100,607574) PREDICTED: aryl ( 423) 2972 575.5  1e-163
NP_001078897 (OMIM: 250100,607574) arylsulfatase A ( 423) 2972 575.5  1e-163
XP_016884289 (OMIM: 250100,607574) PREDICTED: aryl ( 547) 2802 543.3 6.7e-154
XP_011528993 (OMIM: 250100,607574) PREDICTED: aryl ( 387) 2557 496.6 5.6e-140
NP_000503 (OMIM: 253000,612222) N-acetylgalactosam ( 522)  958 192.4 2.6e-48
XP_005256358 (OMIM: 253000,612222) PREDICTED: N-ac ( 575)  946 190.2 1.4e-47
NP_001310473 (OMIM: 253000,612222) N-acetylgalacto ( 528)  889 179.3 2.4e-44
XP_016878601 (OMIM: 253000,612222) PREDICTED: N-ac ( 508)  877 177.0 1.1e-43
XP_016878600 (OMIM: 253000,612222) PREDICTED: N-ac ( 511)  877 177.0 1.1e-43
XP_011521284 (OMIM: 253000,612222) PREDICTED: N-ac ( 581)  877 177.1 1.2e-43
XP_016879857 (OMIM: 610008) PREDICTED: arylsulfata ( 413)  709 145.0 4.1e-34
XP_011522847 (OMIM: 610008) PREDICTED: arylsulfata ( 413)  709 145.0 4.1e-34
XP_011522845 (OMIM: 610008) PREDICTED: arylsulfata ( 427)  709 145.0 4.2e-34
XP_011522846 (OMIM: 610008) PREDICTED: arylsulfata ( 427)  709 145.0 4.2e-34
XP_016879855 (OMIM: 610008) PREDICTED: arylsulfata ( 461)  709 145.0 4.4e-34
XP_016879856 (OMIM: 610008) PREDICTED: arylsulfata ( 461)  709 145.0 4.4e-34
XP_011522843 (OMIM: 610008) PREDICTED: arylsulfata ( 488)  709 145.1 4.6e-34
XP_011522844 (OMIM: 610008) PREDICTED: arylsulfata ( 488)  709 145.1 4.6e-34
XP_016879850 (OMIM: 610008) PREDICTED: arylsulfata ( 525)  709 145.1 4.8e-34
NP_001254656 (OMIM: 610008) arylsulfatase G [Homo  ( 525)  709 145.1 4.8e-34
XP_016879853 (OMIM: 610008) PREDICTED: arylsulfata ( 525)  709 145.1 4.8e-34
XP_005257227 (OMIM: 610008) PREDICTED: arylsulfata ( 525)  709 145.1 4.8e-34
NP_055775 (OMIM: 610008) arylsulfatase G [Homo sap ( 525)  709 145.1 4.8e-34
XP_016879852 (OMIM: 610008) PREDICTED: arylsulfata ( 525)  709 145.1 4.8e-34
XP_016879851 (OMIM: 610008) PREDICTED: arylsulfata ( 525)  709 145.1 4.8e-34
XP_016879854 (OMIM: 610008) PREDICTED: arylsulfata ( 525)  709 145.1 4.8e-34
XP_011522839 (OMIM: 610008) PREDICTED: arylsulfata ( 552)  709 145.1   5e-34
XP_011522838 (OMIM: 610008) PREDICTED: arylsulfata ( 552)  709 145.1   5e-34
XP_011522840 (OMIM: 610008) PREDICTED: arylsulfata ( 552)  709 145.1   5e-34
XP_011522837 (OMIM: 610008) PREDICTED: arylsulfata ( 552)  709 145.1   5e-34
XP_016879849 (OMIM: 610008) PREDICTED: arylsulfata ( 552)  709 145.1   5e-34
XP_006721840 (OMIM: 610008) PREDICTED: arylsulfata ( 552)  709 145.1   5e-34
XP_011522842 (OMIM: 610008) PREDICTED: arylsulfata ( 551)  698 143.0 2.1e-33
XP_005257229 (OMIM: 610008) PREDICTED: arylsulfata ( 344)  692 141.7 3.3e-33
XP_006721842 (OMIM: 610008) PREDICTED: arylsulfata ( 329)  689 141.1 4.8e-33
NP_001310472 (OMIM: 253000,612222) N-acetylgalacto ( 337)  554 115.4 2.6e-25
XP_016878602 (OMIM: 253000,612222) PREDICTED: N-ac ( 390)  541 113.0 1.6e-24
XP_011541694 (OMIM: 253200,611542) PREDICTED: aryl ( 405)  517 108.4   4e-23
NP_942002 (OMIM: 253200,611542) arylsulfatase B is ( 413)  517 108.5   4e-23
XP_011541693 (OMIM: 253200,611542) PREDICTED: aryl ( 442)  517 108.5 4.2e-23
XP_016864960 (OMIM: 253200,611542) PREDICTED: aryl ( 451)  517 108.5 4.3e-23
XP_011541692 (OMIM: 253200,611542) PREDICTED: aryl ( 533)  517 108.6 4.9e-23
NP_000037 (OMIM: 253200,611542) arylsulfatase B is ( 533)  517 108.6 4.9e-23
XP_011541695 (OMIM: 253200,611542) PREDICTED: aryl ( 382)  503 105.8 2.4e-22
NP_000038 (OMIM: 300180,302950) arylsulfatase E is ( 589)  422 90.5 1.4e-17
XP_005274576 (OMIM: 300180,302950) PREDICTED: aryl ( 589)  422 90.5 1.4e-17


>>NP_001078896 (OMIM: 250100,607574) arylsulfatase A iso  (509 aa)
 initn: 3545 init1: 3545 opt: 3545  Z-score: 3648.4  bits: 684.6 E(85289): 1.8e-196
Smith-Waterman score: 3545; 100.0% identity (100.0% similar) in 509 aa overlap (1-509:1-509)

               10        20        30        40        50        60
pF1KE0 MSMGAPRSLLLALAAGLAVARPPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLR
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 MSMGAPRSLLLALAAGLAVARPPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLR
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KE0 FTDFYVPVSLCTPSRAALLTGRLPVRMGMYPGVLVPSSRGGLPLEEVTVAEVLAARGYLT
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 FTDFYVPVSLCTPSRAALLTGRLPVRMGMYPGVLVPSSRGGLPLEEVTVAEVLAARGYLT
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KE0 GMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVP
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 GMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVP
              130       140       150       160       170       180

              190       200       210       220       230       240
pF1KE0 IPLLANLSVEAQPPWLPGLEARYMAFAHDLMADAQRQDRPFFLYYASHHTHYPQFSGQSF
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 IPLLANLSVEAQPPWLPGLEARYMAFAHDLMADAQRQDRPFFLYYASHHTHYPQFSGQSF
              190       200       210       220       230       240

              250       260       270       280       290       300
pF1KE0 AERSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRMSRGGCSGLL
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 AERSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRMSRGGCSGLL
              250       260       270       280       290       300

              310       320       330       340       350       360
pF1KE0 RCGKGTTYEGGVREPALAFWPGHIAPGVTHELASSLDLLPTLAALAGAPLPNVTLDGFDL
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 RCGKGTTYEGGVREPALAFWPGHIAPGVTHELASSLDLLPTLAALAGAPLPNVTLDGFDL
              310       320       330       340       350       360

              370       380       390       400       410       420
pF1KE0 SPLLLGTGKSPRQSLFFYPSYPDEVRGVFAVRTGKYKAHFFTQGSAHSDTTADPACHASS
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 SPLLLGTGKSPRQSLFFYPSYPDEVRGVFAVRTGKYKAHFFTQGSAHSDTTADPACHASS
              370       380       390       400       410       420

              430       440       450       460       470       480
pF1KE0 SLTAHEPPLLYDLSKDPGENYNLLGGVAGATPEVLQALKQLQLLKAQLDAAVTFGPSQVA
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 SLTAHEPPLLYDLSKDPGENYNLLGGVAGATPEVLQALKQLQLLKAQLDAAVTFGPSQVA
              430       440       450       460       470       480

              490       500         
pF1KE0 RGEDPALQICCHPGCTPRPACCHCPDPHA
       :::::::::::::::::::::::::::::
NP_001 RGEDPALQICCHPGCTPRPACCHCPDPHA
              490       500         

>>NP_000478 (OMIM: 250100,607574) arylsulfatase A isofor  (509 aa)
 initn: 3545 init1: 3545 opt: 3545  Z-score: 3648.4  bits: 684.6 E(85289): 1.8e-196
Smith-Waterman score: 3545; 100.0% identity (100.0% similar) in 509 aa overlap (1-509:1-509)

               10        20        30        40        50        60
pF1KE0 MSMGAPRSLLLALAAGLAVARPPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLR
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_000 MSMGAPRSLLLALAAGLAVARPPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLR
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KE0 FTDFYVPVSLCTPSRAALLTGRLPVRMGMYPGVLVPSSRGGLPLEEVTVAEVLAARGYLT
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_000 FTDFYVPVSLCTPSRAALLTGRLPVRMGMYPGVLVPSSRGGLPLEEVTVAEVLAARGYLT
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KE0 GMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVP
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_000 GMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVP
              130       140       150       160       170       180

              190       200       210       220       230       240
pF1KE0 IPLLANLSVEAQPPWLPGLEARYMAFAHDLMADAQRQDRPFFLYYASHHTHYPQFSGQSF
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_000 IPLLANLSVEAQPPWLPGLEARYMAFAHDLMADAQRQDRPFFLYYASHHTHYPQFSGQSF
              190       200       210       220       230       240

              250       260       270       280       290       300
pF1KE0 AERSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRMSRGGCSGLL
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_000 AERSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRMSRGGCSGLL
              250       260       270       280       290       300

              310       320       330       340       350       360
pF1KE0 RCGKGTTYEGGVREPALAFWPGHIAPGVTHELASSLDLLPTLAALAGAPLPNVTLDGFDL
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_000 RCGKGTTYEGGVREPALAFWPGHIAPGVTHELASSLDLLPTLAALAGAPLPNVTLDGFDL
              310       320       330       340       350       360

              370       380       390       400       410       420
pF1KE0 SPLLLGTGKSPRQSLFFYPSYPDEVRGVFAVRTGKYKAHFFTQGSAHSDTTADPACHASS
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_000 SPLLLGTGKSPRQSLFFYPSYPDEVRGVFAVRTGKYKAHFFTQGSAHSDTTADPACHASS
              370       380       390       400       410       420

              430       440       450       460       470       480
pF1KE0 SLTAHEPPLLYDLSKDPGENYNLLGGVAGATPEVLQALKQLQLLKAQLDAAVTFGPSQVA
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_000 SLTAHEPPLLYDLSKDPGENYNLLGGVAGATPEVLQALKQLQLLKAQLDAAVTFGPSQVA
              430       440       450       460       470       480

              490       500         
pF1KE0 RGEDPALQICCHPGCTPRPACCHCPDPHA
       :::::::::::::::::::::::::::::
NP_000 RGEDPALQICCHPGCTPRPACCHCPDPHA
              490       500         

>>NP_001078894 (OMIM: 250100,607574) arylsulfatase A iso  (509 aa)
 initn: 3545 init1: 3545 opt: 3545  Z-score: 3648.4  bits: 684.6 E(85289): 1.8e-196
Smith-Waterman score: 3545; 100.0% identity (100.0% similar) in 509 aa overlap (1-509:1-509)

               10        20        30        40        50        60
pF1KE0 MSMGAPRSLLLALAAGLAVARPPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLR
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 MSMGAPRSLLLALAAGLAVARPPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLR
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KE0 FTDFYVPVSLCTPSRAALLTGRLPVRMGMYPGVLVPSSRGGLPLEEVTVAEVLAARGYLT
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 FTDFYVPVSLCTPSRAALLTGRLPVRMGMYPGVLVPSSRGGLPLEEVTVAEVLAARGYLT
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KE0 GMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVP
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 GMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVP
              130       140       150       160       170       180

              190       200       210       220       230       240
pF1KE0 IPLLANLSVEAQPPWLPGLEARYMAFAHDLMADAQRQDRPFFLYYASHHTHYPQFSGQSF
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 IPLLANLSVEAQPPWLPGLEARYMAFAHDLMADAQRQDRPFFLYYASHHTHYPQFSGQSF
              190       200       210       220       230       240

              250       260       270       280       290       300
pF1KE0 AERSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRMSRGGCSGLL
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 AERSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRMSRGGCSGLL
              250       260       270       280       290       300

              310       320       330       340       350       360
pF1KE0 RCGKGTTYEGGVREPALAFWPGHIAPGVTHELASSLDLLPTLAALAGAPLPNVTLDGFDL
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 RCGKGTTYEGGVREPALAFWPGHIAPGVTHELASSLDLLPTLAALAGAPLPNVTLDGFDL
              310       320       330       340       350       360

              370       380       390       400       410       420
pF1KE0 SPLLLGTGKSPRQSLFFYPSYPDEVRGVFAVRTGKYKAHFFTQGSAHSDTTADPACHASS
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 SPLLLGTGKSPRQSLFFYPSYPDEVRGVFAVRTGKYKAHFFTQGSAHSDTTADPACHASS
              370       380       390       400       410       420

              430       440       450       460       470       480
pF1KE0 SLTAHEPPLLYDLSKDPGENYNLLGGVAGATPEVLQALKQLQLLKAQLDAAVTFGPSQVA
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 SLTAHEPPLLYDLSKDPGENYNLLGGVAGATPEVLQALKQLQLLKAQLDAAVTFGPSQVA
              430       440       450       460       470       480

              490       500         
pF1KE0 RGEDPALQICCHPGCTPRPACCHCPDPHA
       :::::::::::::::::::::::::::::
NP_001 RGEDPALQICCHPGCTPRPACCHCPDPHA
              490       500         

>>NP_001078895 (OMIM: 250100,607574) arylsulfatase A iso  (509 aa)
 initn: 3545 init1: 3545 opt: 3545  Z-score: 3648.4  bits: 684.6 E(85289): 1.8e-196
Smith-Waterman score: 3545; 100.0% identity (100.0% similar) in 509 aa overlap (1-509:1-509)

               10        20        30        40        50        60
pF1KE0 MSMGAPRSLLLALAAGLAVARPPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLR
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 MSMGAPRSLLLALAAGLAVARPPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLR
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KE0 FTDFYVPVSLCTPSRAALLTGRLPVRMGMYPGVLVPSSRGGLPLEEVTVAEVLAARGYLT
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 FTDFYVPVSLCTPSRAALLTGRLPVRMGMYPGVLVPSSRGGLPLEEVTVAEVLAARGYLT
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KE0 GMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVP
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 GMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVP
              130       140       150       160       170       180

              190       200       210       220       230       240
pF1KE0 IPLLANLSVEAQPPWLPGLEARYMAFAHDLMADAQRQDRPFFLYYASHHTHYPQFSGQSF
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 IPLLANLSVEAQPPWLPGLEARYMAFAHDLMADAQRQDRPFFLYYASHHTHYPQFSGQSF
              190       200       210       220       230       240

              250       260       270       280       290       300
pF1KE0 AERSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRMSRGGCSGLL
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 AERSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRMSRGGCSGLL
              250       260       270       280       290       300

              310       320       330       340       350       360
pF1KE0 RCGKGTTYEGGVREPALAFWPGHIAPGVTHELASSLDLLPTLAALAGAPLPNVTLDGFDL
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 RCGKGTTYEGGVREPALAFWPGHIAPGVTHELASSLDLLPTLAALAGAPLPNVTLDGFDL
              310       320       330       340       350       360

              370       380       390       400       410       420
pF1KE0 SPLLLGTGKSPRQSLFFYPSYPDEVRGVFAVRTGKYKAHFFTQGSAHSDTTADPACHASS
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 SPLLLGTGKSPRQSLFFYPSYPDEVRGVFAVRTGKYKAHFFTQGSAHSDTTADPACHASS
              370       380       390       400       410       420

              430       440       450       460       470       480
pF1KE0 SLTAHEPPLLYDLSKDPGENYNLLGGVAGATPEVLQALKQLQLLKAQLDAAVTFGPSQVA
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 SLTAHEPPLLYDLSKDPGENYNLLGGVAGATPEVLQALKQLQLLKAQLDAAVTFGPSQVA
              430       440       450       460       470       480

              490       500         
pF1KE0 RGEDPALQICCHPGCTPRPACCHCPDPHA
       :::::::::::::::::::::::::::::
NP_001 RGEDPALQICCHPGCTPRPACCHCPDPHA
              490       500         

>>XP_011528992 (OMIM: 250100,607574) PREDICTED: arylsulf  (423 aa)
 initn: 2972 init1: 2972 opt: 2972  Z-score: 3060.2  bits: 575.5 E(85289): 1e-163
Smith-Waterman score: 2972; 100.0% identity (100.0% similar) in 423 aa overlap (87-509:1-423)

         60        70        80        90       100       110      
pF1KE0 GGLRFTDFYVPVSLCTPSRAALLTGRLPVRMGMYPGVLVPSSRGGLPLEEVTVAEVLAAR
                                     ::::::::::::::::::::::::::::::
XP_011                               MGMYPGVLVPSSRGGLPLEEVTVAEVLAAR
                                             10        20        30

        120       130       140       150       160       170      
pF1KE0 GYLTGMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQ
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 GYLTGMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQ
               40        50        60        70        80        90

        180       190       200       210       220       230      
pF1KE0 GLVPIPLLANLSVEAQPPWLPGLEARYMAFAHDLMADAQRQDRPFFLYYASHHTHYPQFS
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 GLVPIPLLANLSVEAQPPWLPGLEARYMAFAHDLMADAQRQDRPFFLYYASHHTHYPQFS
              100       110       120       130       140       150

        240       250       260       270       280       290      
pF1KE0 GQSFAERSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRMSRGGC
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 GQSFAERSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRMSRGGC
              160       170       180       190       200       210

        300       310       320       330       340       350      
pF1KE0 SGLLRCGKGTTYEGGVREPALAFWPGHIAPGVTHELASSLDLLPTLAALAGAPLPNVTLD
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 SGLLRCGKGTTYEGGVREPALAFWPGHIAPGVTHELASSLDLLPTLAALAGAPLPNVTLD
              220       230       240       250       260       270

        360       370       380       390       400       410      
pF1KE0 GFDLSPLLLGTGKSPRQSLFFYPSYPDEVRGVFAVRTGKYKAHFFTQGSAHSDTTADPAC
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 GFDLSPLLLGTGKSPRQSLFFYPSYPDEVRGVFAVRTGKYKAHFFTQGSAHSDTTADPAC
              280       290       300       310       320       330

        420       430       440       450       460       470      
pF1KE0 HASSSLTAHEPPLLYDLSKDPGENYNLLGGVAGATPEVLQALKQLQLLKAQLDAAVTFGP
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 HASSSLTAHEPPLLYDLSKDPGENYNLLGGVAGATPEVLQALKQLQLLKAQLDAAVTFGP
              340       350       360       370       380       390

        480       490       500         
pF1KE0 SQVARGEDPALQICCHPGCTPRPACCHCPDPHA
       :::::::::::::::::::::::::::::::::
XP_011 SQVARGEDPALQICCHPGCTPRPACCHCPDPHA
              400       410       420   

>>NP_001078897 (OMIM: 250100,607574) arylsulfatase A iso  (423 aa)
 initn: 2972 init1: 2972 opt: 2972  Z-score: 3060.2  bits: 575.5 E(85289): 1e-163
Smith-Waterman score: 2972; 100.0% identity (100.0% similar) in 423 aa overlap (87-509:1-423)

         60        70        80        90       100       110      
pF1KE0 GGLRFTDFYVPVSLCTPSRAALLTGRLPVRMGMYPGVLVPSSRGGLPLEEVTVAEVLAAR
                                     ::::::::::::::::::::::::::::::
NP_001                               MGMYPGVLVPSSRGGLPLEEVTVAEVLAAR
                                             10        20        30

        120       130       140       150       160       170      
pF1KE0 GYLTGMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQ
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 GYLTGMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQ
               40        50        60        70        80        90

        180       190       200       210       220       230      
pF1KE0 GLVPIPLLANLSVEAQPPWLPGLEARYMAFAHDLMADAQRQDRPFFLYYASHHTHYPQFS
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 GLVPIPLLANLSVEAQPPWLPGLEARYMAFAHDLMADAQRQDRPFFLYYASHHTHYPQFS
              100       110       120       130       140       150

        240       250       260       270       280       290      
pF1KE0 GQSFAERSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRMSRGGC
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 GQSFAERSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRMSRGGC
              160       170       180       190       200       210

        300       310       320       330       340       350      
pF1KE0 SGLLRCGKGTTYEGGVREPALAFWPGHIAPGVTHELASSLDLLPTLAALAGAPLPNVTLD
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 SGLLRCGKGTTYEGGVREPALAFWPGHIAPGVTHELASSLDLLPTLAALAGAPLPNVTLD
              220       230       240       250       260       270

        360       370       380       390       400       410      
pF1KE0 GFDLSPLLLGTGKSPRQSLFFYPSYPDEVRGVFAVRTGKYKAHFFTQGSAHSDTTADPAC
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 GFDLSPLLLGTGKSPRQSLFFYPSYPDEVRGVFAVRTGKYKAHFFTQGSAHSDTTADPAC
              280       290       300       310       320       330

        420       430       440       450       460       470      
pF1KE0 HASSSLTAHEPPLLYDLSKDPGENYNLLGGVAGATPEVLQALKQLQLLKAQLDAAVTFGP
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 HASSSLTAHEPPLLYDLSKDPGENYNLLGGVAGATPEVLQALKQLQLLKAQLDAAVTFGP
              340       350       360       370       380       390

        480       490       500         
pF1KE0 SQVARGEDPALQICCHPGCTPRPACCHCPDPHA
       :::::::::::::::::::::::::::::::::
NP_001 SQVARGEDPALQICCHPGCTPRPACCHCPDPHA
              400       410       420   

>>XP_016884289 (OMIM: 250100,607574) PREDICTED: arylsulf  (547 aa)
 initn: 2797 init1: 2797 opt: 2802  Z-score: 2884.0  bits: 543.3 E(85289): 6.7e-154
Smith-Waterman score: 3403; 93.0% identity (93.0% similar) in 541 aa overlap (1-503:1-541)

               10        20        30        40        50        60
pF1KE0 MSMGAPRSLLLALAAGLAVARPPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLR
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 MSMGAPRSLLLALAAGLAVARPPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLR
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KE0 FTDFYVPVSLCTPSRAALLTGRLPVRMGMYPGVLVPSSRGGLPLEEVTVAEVLAARGYLT
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 FTDFYVPVSLCTPSRAALLTGRLPVRMGMYPGVLVPSSRGGLPLEEVTVAEVLAARGYLT
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KE0 GMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVP
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 GMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVP
              130       140       150       160       170       180

              190       200       210       220       230       240
pF1KE0 IPLLANLSVEAQPPWLPGLEARYMAFAHDLMADAQRQDRPFFLYYASHHTHYPQFSGQSF
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 IPLLANLSVEAQPPWLPGLEARYMAFAHDLMADAQRQDRPFFLYYASHHTHYPQFSGQSF
              190       200       210       220       230       240

              250       260       270       280       290       300
pF1KE0 AERSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRMSRGGCSGLL
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 AERSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRMSRGGCSGLL
              250       260       270       280       290       300

              310       320       330       340       350       360
pF1KE0 RCGKGTTYEGGVREPALAFWPGHIAPGVTHELASSLDLLPTLAALAGAPLPNVTLDGFDL
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 RCGKGTTYEGGVREPALAFWPGHIAPGVTHELASSLDLLPTLAALAGAPLPNVTLDGFDL
              310       320       330       340       350       360

              370       380       390       400                    
pF1KE0 SPLLLGTGKSPRQSLFFYPSYPDEVRGVFAVRTGKYKAHFFTQG----------------
       ::::::::::::::::::::::::::::::::::::::::::::                
XP_016 SPLLLGTGKSPRQSLFFYPSYPDEVRGVFAVRTGKYKAHFFTQGNPSPWIPPPDLLTPPR
              370       380       390       400       410       420

                                410       420       430       440  
pF1KE0 ----------------------SAHSDTTADPACHASSSLTAHEPPLLYDLSKDPGENYN
                             ::::::::::::::::::::::::::::::::::::::
XP_016 SPRSLAPPLALALCTELAPSPGSAHSDTTADPACHASSSLTAHEPPLLYDLSKDPGENYN
              430       440       450       460       470       480

            450       460       470       480       490       500  
pF1KE0 LLGGVAGATPEVLQALKQLQLLKAQLDAAVTFGPSQVARGEDPALQICCHPGCTPRPACC
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 LLGGVAGATPEVLQALKQLQLLKAQLDAAVTFGPSQVARGEDPALQICCHPGCTPRPACC
              490       500       510       520       530       540

              
pF1KE0 HCPDPHA
       :      
XP_016 HCPDPHA
              

>>XP_011528993 (OMIM: 250100,607574) PREDICTED: arylsulf  (387 aa)
 initn: 2580 init1: 2553 opt: 2557  Z-score: 2634.0  bits: 496.6 E(85289): 5.6e-140
Smith-Waterman score: 2557; 96.6% identity (97.4% similar) in 387 aa overlap (1-382:1-384)

               10        20        30        40        50        60
pF1KE0 MSMGAPRSLLLALAAGLAVARPPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLR
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 MSMGAPRSLLLALAAGLAVARPPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLR
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KE0 FTDFYVPVSLCTPSRAALLTGRLPVRMGMYPGVLVPSSRGGLPLEEVTVAEVLAARGYLT
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 FTDFYVPVSLCTPSRAALLTGRLPVRMGMYPGVLVPSSRGGLPLEEVTVAEVLAARGYLT
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KE0 GMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVP
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 GMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVP
              130       140       150       160       170       180

              190       200       210       220       230       240
pF1KE0 IPLLANLSVEAQPPWLPGLEARYMAFAHDLMADAQRQDRPFFLYYASHHTHYPQFSGQSF
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 IPLLANLSVEAQPPWLPGLEARYMAFAHDLMADAQRQDRPFFLYYASHHTHYPQFSGQSF
              190       200       210       220       230       240

              250       260       270       280       290       300
pF1KE0 AERSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRMSRGGCSGLL
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 AERSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRMSRGGCSGLL
              250       260       270       280       290       300

              310       320       330       340       350       360
pF1KE0 RCGKGTTYEGGVREPALAFWPGHIAPGVTHELASSLDLLPTLAALAGAPLPNVTLDGFDL
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 RCGKGTTYEGGVREPALAFWPGHIAPGVTHELASSLDLLPTLAALAGAPLPNVTLDGFDL
              310       320       330       340       350       360

              370            380       390       400       410     
pF1KE0 SPLLLGTGKS-----PRQSLFFYPSYPDEVRGVFAVRTGKYKAHFFTQGSAHSDTTADPA
       :::::::::.     : :.:   :. :                                 
XP_011 SPLLLGTGKALPTVIPLQTL---PATPPAL                              
              370       380                                        

>>NP_000503 (OMIM: 253000,612222) N-acetylgalactosamine-  (522 aa)
 initn: 709 init1: 323 opt: 958  Z-score: 988.2  bits: 192.4 E(85289): 2.6e-48
Smith-Waterman score: 968; 37.1% identity (59.9% similar) in 534 aa overlap (9-506:13-512)

                   10         20           30        40        50  
pF1KE0     MSMGAPRSLLLAL-AAGLAVA---RPPNIVLIFADDLGYGDLGCYGHPSSTTPNLD
                   :::.: :::....   .::::.:.. ::.:.:::: ::.::  :::::
NP_000 MAAVVAATRWWQLLLVLSAAGMGASGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLD
               10        20        30        40        50        60

             60        70        80        90              100     
pF1KE0 QLAAGGLRFTDFYVPVSLCTPSRAALLTGRLPVRMGMYP------GVLVPSS-RGGLPLE
       ..:: :: : .::    ::.::::::::::::.: :.:       .. .:.   ::.:  
NP_000 RMAAEGLLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDS
               70        80        90       100       110       120

         110       120       130       140       150       160     
pF1KE0 EVTVAEVLAARGYLTGMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFP
       :  . :.:   ::.. ..::::::  :.  : : ..:: ...: :  :  :: .: .   
NP_000 EQLLPELLKKAGYVSKIVGKWHLGHRPQ--FHPLKHGFDEWFGSPNCH-FGPYDNKA--R
              130       140         150       160        170       

         170       180       190       200           210       220 
pF1KE0 PATPCDGGCDQGLVPIPLLANLSVEAQPPWLPGLEAR----YMAFAHDLMADAQRQDRPF
       :  :     :  .:      .   :  :  :   ::     :.  : :..    :. .::
NP_000 PNIPVYR--DWEMV------GRYYEEFPINLKTGEANLTQIYLQEALDFIKRQARH-HPF
         180               190       200       210       220       

             230       240       250       260       270       280 
pF1KE0 FLYYASHHTHYPQFSGQSFAERSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFT
       :::.:   :: : .... :   : :: .::.. :.: ..: ..  . :: . ..:.:.::
NP_000 FLYWAVDATHAPVYASKPFLGTSQRGRYGDAVREIDDSIGKILELLQDLHVADNTFVFFT
        230       240       250       260       270       280      

             290        300       310       320        330         
pF1KE0 ADNGPETMRMS-RGGCSGLLRCGKGTTYEGGVREPALAFWPGHIAPG-VTHELASSLDLL
       .:::   .    .:: .: . ::: ::.:::.::::::.::::.. : :.:.:.: .::.
NP_000 SDNGAALISAPEQGGSNGPFLCGKQTTFEGGMREPALAWWPGHVTAGQVSHQLGSIMDLF
        290       300       310       320       330       340      

     340        350       360       370       380         390      
pF1KE0 PTLAALAG-APLPNVTLDGFDLSPLLLGTGKSPRQSLFFYPSYPDEVRG--VFAVRTGKY
        :  :::: .:  . ..::..: : ::  :.   . .:.:       ::  ..:.  :..
NP_000 TTSLALAGLTPPSDRAIDGLNLLPTLL-QGRLMDRPIFYY-------RGDTLMAATLGQH
        350       360       370        380              390        

        400       410          420             430       440       
pF1KE0 KAHFFTQGSAHSDTTAD-PACHAS--SSLTAH------EPPLLYDLSKDPGENYNLLGGV
       ::::.:  ..  .       : ..  :..:.:      . ::.. :..:::: . :    
NP_000 KAHFWTWTNSWENFRQGIDFCPGQNVSGVTTHNLEDHTKLPLIFHLGRDPGERFPL----
      400       410       420       430       440       450        

       450       460       470       480       490              500
pF1KE0 AGATPEVLQALKQLQLLKAQLDAAVTFGPSQVARGEDPALQIC-------CHPGCTPRPA
       . :. :  .::...  .  : . :..  :.:      : :..:         :::     
NP_000 SFASAEYQEALSRITSVVQQHQEALV--PAQ------PQLNVCNWAVMNWAPPGCEKLGK
          460       470       480               490       500      

                       
pF1KE0 CCHCPDPHA       
       :   :.          
NP_000 CLTPPESIPKKCLWSH
        510       520  

>>XP_005256358 (OMIM: 253000,612222) PREDICTED: N-acetyl  (575 aa)
 initn: 657 init1: 323 opt: 946  Z-score: 975.3  bits: 190.2 E(85289): 1.4e-47
Smith-Waterman score: 961; 37.4% identity (61.4% similar) in 529 aa overlap (9-499:13-519)

                   10         20           30        40        50  
pF1KE0     MSMGAPRSLLLAL-AAGLAVA---RPPNIVLIFADDLGYGDLGCYGHPSSTTPNLD
                   :::.: :::....   .::::.:.. ::.:.:::: ::.::  :::::
XP_005 MAAVVAATRWWQLLLVLSAAGMGASGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLD
               10        20        30        40        50        60

             60        70        80        90              100     
pF1KE0 QLAAGGLRFTDFYVPVSLCTPSRAALLTGRLPVRMGMYP------GVLVPSS-RGGLPLE
       ..:: :: : .::    ::.::::::::::::.: :.:       .. .:.   ::.:  
XP_005 RMAAEGLLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDS
               70        80        90       100       110       120

         110       120       130       140       150       160     
pF1KE0 EVTVAEVLAARGYLTGMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFP
       :  . :.:   ::.. ..::::::  :.  : : ..:: ...: :  :  :: .: .   
XP_005 EQLLPELLKKAGYVSKIVGKWHLGHRPQ--FHPLKHGFDEWFGSPNCH-FGPYDNKA--R
              130       140         150       160        170       

         170        180        190       200       210         220 
pF1KE0 PATPCDGGCDQ-GLVPIPLLANLSV-EAQPPWLPGLEARYMAFAHDLMADAQRQDR--PF
       :  :     .. :     .  ::.. ::.      :   :.  : :..   .:: :  ::
XP_005 PNIPVYRDWEMVGRYYEEFPINLKTGEAN------LTQIYLQEALDFI---KRQARHHPF
         180       190       200             210          220      

             230       240       250       260       270       280 
pF1KE0 FLYYASHHTHYPQFSGQSFAERSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFT
       :::.:   :: : .... :   : :: .::.. :.: ..: ..  . :: . ..:.:.::
XP_005 FLYWAVDATHAPVYASKPFLGTSQRGRYGDAVREIDDSIGKILELLQDLHVADNTFVFFT
        230       240       250       260       270       280      

             290        300       310       320        330         
pF1KE0 ADNGPETMRMS-RGGCSGLLRCGKGTTYEGGVREPALAFWPGHIAPG-VTHELASSLDLL
       .:::   .    .:: .: . ::: ::.:::.::::::.::::.. : :.:.:.: .::.
XP_005 SDNGAALISAPEQGGSNGPFLCGKQTTFEGGMREPALAWWPGHVTAGQVSHQLGSIMDLF
        290       300       310       320       330       340      

     340        350       360       370       380         390      
pF1KE0 PTLAALAG-APLPNVTLDGFDLSPLLLGTGKSPRQSLFFYPSYPDEVRG--VFAVRTGKY
        :  :::: .:  . ..::..: : ::  :.   . .:.:       ::  ..:.  :..
XP_005 TTSLALAGLTPPSDRAIDGLNLLPTLL-QGRLMDRPIFYY-------RGDTLMAATLGQH
        350       360       370        380              390        

        400       410          420             430       440       
pF1KE0 KAHFFTQGSAHSDTTAD-PACHAS--SSLTAH------EPPLLYDLSKDPGENYNLLGGV
       ::::.:  ..  .       : ..  :..:.:      . ::.. :..:::: . :  . 
XP_005 KAHFWTWTNSWENFRQGIDFCPGQNVSGVTTHNLEDHTKLPLIFHLGRDPGERFPLSFAS
      400       410       420       430       440       450        

          450       460         470           480       490        
pF1KE0 AG---ATPEVLQALKQLQ--LLKAQLDAAV----TFGPSQVARGEDPALQICCHPGCTPR
       :    :  .. ....: :  :. :: .  :    ...:.: . . .:     : :    :
XP_005 AEYQEALSRITSVVQQHQEALVPAQPQLNVCNWAVMAPTQSSTSVQPDKTRPCSPPVEKR
      460       470       480       490       500       510        

      500                                                       
pF1KE0 PACCHCPDPHA                                              
       :                                                        
XP_005 PHHASIGQKHKHRRASKVSLKSVFRPQRMRRRLVQPWGPQQSGPLRSSFSVPCLGVP
      520       530       540       550       560       570     




509 residues in 1 query   sequences
60827320 residues in 85289 library sequences
 Tcomplib [36.3.4 Apr, 2011] (8 proc)
 start: Thu Nov  3 15:20:23 2016 done: Thu Nov  3 15:20:24 2016
 Total Scan time: 10.280 Total Display time:  0.090

Function used was FASTA [36.3.4 Apr, 2011]
Inquiries or Suggestions ?
Send a message to flexiclone AT kazusagt.com