FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE0337, 509 aa
1>>>pF1KE0337 509 - 509 aa - 509 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.5715+/-0.000317; mu= 18.3981+/- 0.020
mean_var=94.5855+/-19.422, 0's: 0 Z-trim(118.2): 127 B-trim: 0 in 0/53
Lambda= 0.131875
statistics sampled from 30737 (30867) to 30737 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.727), E-opt: 0.2 (0.362), width: 16
Scan time: 10.280
The best scores are: opt bits E(85289)
NP_001078896 (OMIM: 250100,607574) arylsulfatase A ( 509) 3545 684.6 1.8e-196
NP_000478 (OMIM: 250100,607574) arylsulfatase A is ( 509) 3545 684.6 1.8e-196
NP_001078894 (OMIM: 250100,607574) arylsulfatase A ( 509) 3545 684.6 1.8e-196
NP_001078895 (OMIM: 250100,607574) arylsulfatase A ( 509) 3545 684.6 1.8e-196
XP_011528992 (OMIM: 250100,607574) PREDICTED: aryl ( 423) 2972 575.5 1e-163
NP_001078897 (OMIM: 250100,607574) arylsulfatase A ( 423) 2972 575.5 1e-163
XP_016884289 (OMIM: 250100,607574) PREDICTED: aryl ( 547) 2802 543.3 6.7e-154
XP_011528993 (OMIM: 250100,607574) PREDICTED: aryl ( 387) 2557 496.6 5.6e-140
NP_000503 (OMIM: 253000,612222) N-acetylgalactosam ( 522) 958 192.4 2.6e-48
XP_005256358 (OMIM: 253000,612222) PREDICTED: N-ac ( 575) 946 190.2 1.4e-47
NP_001310473 (OMIM: 253000,612222) N-acetylgalacto ( 528) 889 179.3 2.4e-44
XP_016878601 (OMIM: 253000,612222) PREDICTED: N-ac ( 508) 877 177.0 1.1e-43
XP_016878600 (OMIM: 253000,612222) PREDICTED: N-ac ( 511) 877 177.0 1.1e-43
XP_011521284 (OMIM: 253000,612222) PREDICTED: N-ac ( 581) 877 177.1 1.2e-43
XP_016879857 (OMIM: 610008) PREDICTED: arylsulfata ( 413) 709 145.0 4.1e-34
XP_011522847 (OMIM: 610008) PREDICTED: arylsulfata ( 413) 709 145.0 4.1e-34
XP_011522845 (OMIM: 610008) PREDICTED: arylsulfata ( 427) 709 145.0 4.2e-34
XP_011522846 (OMIM: 610008) PREDICTED: arylsulfata ( 427) 709 145.0 4.2e-34
XP_016879855 (OMIM: 610008) PREDICTED: arylsulfata ( 461) 709 145.0 4.4e-34
XP_016879856 (OMIM: 610008) PREDICTED: arylsulfata ( 461) 709 145.0 4.4e-34
XP_011522843 (OMIM: 610008) PREDICTED: arylsulfata ( 488) 709 145.1 4.6e-34
XP_011522844 (OMIM: 610008) PREDICTED: arylsulfata ( 488) 709 145.1 4.6e-34
XP_016879850 (OMIM: 610008) PREDICTED: arylsulfata ( 525) 709 145.1 4.8e-34
NP_001254656 (OMIM: 610008) arylsulfatase G [Homo ( 525) 709 145.1 4.8e-34
XP_016879853 (OMIM: 610008) PREDICTED: arylsulfata ( 525) 709 145.1 4.8e-34
XP_005257227 (OMIM: 610008) PREDICTED: arylsulfata ( 525) 709 145.1 4.8e-34
NP_055775 (OMIM: 610008) arylsulfatase G [Homo sap ( 525) 709 145.1 4.8e-34
XP_016879852 (OMIM: 610008) PREDICTED: arylsulfata ( 525) 709 145.1 4.8e-34
XP_016879851 (OMIM: 610008) PREDICTED: arylsulfata ( 525) 709 145.1 4.8e-34
XP_016879854 (OMIM: 610008) PREDICTED: arylsulfata ( 525) 709 145.1 4.8e-34
XP_011522839 (OMIM: 610008) PREDICTED: arylsulfata ( 552) 709 145.1 5e-34
XP_011522838 (OMIM: 610008) PREDICTED: arylsulfata ( 552) 709 145.1 5e-34
XP_011522840 (OMIM: 610008) PREDICTED: arylsulfata ( 552) 709 145.1 5e-34
XP_011522837 (OMIM: 610008) PREDICTED: arylsulfata ( 552) 709 145.1 5e-34
XP_016879849 (OMIM: 610008) PREDICTED: arylsulfata ( 552) 709 145.1 5e-34
XP_006721840 (OMIM: 610008) PREDICTED: arylsulfata ( 552) 709 145.1 5e-34
XP_011522842 (OMIM: 610008) PREDICTED: arylsulfata ( 551) 698 143.0 2.1e-33
XP_005257229 (OMIM: 610008) PREDICTED: arylsulfata ( 344) 692 141.7 3.3e-33
XP_006721842 (OMIM: 610008) PREDICTED: arylsulfata ( 329) 689 141.1 4.8e-33
NP_001310472 (OMIM: 253000,612222) N-acetylgalacto ( 337) 554 115.4 2.6e-25
XP_016878602 (OMIM: 253000,612222) PREDICTED: N-ac ( 390) 541 113.0 1.6e-24
XP_011541694 (OMIM: 253200,611542) PREDICTED: aryl ( 405) 517 108.4 4e-23
NP_942002 (OMIM: 253200,611542) arylsulfatase B is ( 413) 517 108.5 4e-23
XP_011541693 (OMIM: 253200,611542) PREDICTED: aryl ( 442) 517 108.5 4.2e-23
XP_016864960 (OMIM: 253200,611542) PREDICTED: aryl ( 451) 517 108.5 4.3e-23
XP_011541692 (OMIM: 253200,611542) PREDICTED: aryl ( 533) 517 108.6 4.9e-23
NP_000037 (OMIM: 253200,611542) arylsulfatase B is ( 533) 517 108.6 4.9e-23
XP_011541695 (OMIM: 253200,611542) PREDICTED: aryl ( 382) 503 105.8 2.4e-22
NP_000038 (OMIM: 300180,302950) arylsulfatase E is ( 589) 422 90.5 1.4e-17
XP_005274576 (OMIM: 300180,302950) PREDICTED: aryl ( 589) 422 90.5 1.4e-17
>>NP_001078896 (OMIM: 250100,607574) arylsulfatase A iso (509 aa)
initn: 3545 init1: 3545 opt: 3545 Z-score: 3648.4 bits: 684.6 E(85289): 1.8e-196
Smith-Waterman score: 3545; 100.0% identity (100.0% similar) in 509 aa overlap (1-509:1-509)
10 20 30 40 50 60
pF1KE0 MSMGAPRSLLLALAAGLAVARPPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 MSMGAPRSLLLALAAGLAVARPPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLR
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE0 FTDFYVPVSLCTPSRAALLTGRLPVRMGMYPGVLVPSSRGGLPLEEVTVAEVLAARGYLT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 FTDFYVPVSLCTPSRAALLTGRLPVRMGMYPGVLVPSSRGGLPLEEVTVAEVLAARGYLT
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE0 GMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 GMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVP
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE0 IPLLANLSVEAQPPWLPGLEARYMAFAHDLMADAQRQDRPFFLYYASHHTHYPQFSGQSF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 IPLLANLSVEAQPPWLPGLEARYMAFAHDLMADAQRQDRPFFLYYASHHTHYPQFSGQSF
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE0 AERSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRMSRGGCSGLL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 AERSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRMSRGGCSGLL
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE0 RCGKGTTYEGGVREPALAFWPGHIAPGVTHELASSLDLLPTLAALAGAPLPNVTLDGFDL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 RCGKGTTYEGGVREPALAFWPGHIAPGVTHELASSLDLLPTLAALAGAPLPNVTLDGFDL
310 320 330 340 350 360
370 380 390 400 410 420
pF1KE0 SPLLLGTGKSPRQSLFFYPSYPDEVRGVFAVRTGKYKAHFFTQGSAHSDTTADPACHASS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 SPLLLGTGKSPRQSLFFYPSYPDEVRGVFAVRTGKYKAHFFTQGSAHSDTTADPACHASS
370 380 390 400 410 420
430 440 450 460 470 480
pF1KE0 SLTAHEPPLLYDLSKDPGENYNLLGGVAGATPEVLQALKQLQLLKAQLDAAVTFGPSQVA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 SLTAHEPPLLYDLSKDPGENYNLLGGVAGATPEVLQALKQLQLLKAQLDAAVTFGPSQVA
430 440 450 460 470 480
490 500
pF1KE0 RGEDPALQICCHPGCTPRPACCHCPDPHA
:::::::::::::::::::::::::::::
NP_001 RGEDPALQICCHPGCTPRPACCHCPDPHA
490 500
>>NP_000478 (OMIM: 250100,607574) arylsulfatase A isofor (509 aa)
initn: 3545 init1: 3545 opt: 3545 Z-score: 3648.4 bits: 684.6 E(85289): 1.8e-196
Smith-Waterman score: 3545; 100.0% identity (100.0% similar) in 509 aa overlap (1-509:1-509)
10 20 30 40 50 60
pF1KE0 MSMGAPRSLLLALAAGLAVARPPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_000 MSMGAPRSLLLALAAGLAVARPPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLR
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE0 FTDFYVPVSLCTPSRAALLTGRLPVRMGMYPGVLVPSSRGGLPLEEVTVAEVLAARGYLT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_000 FTDFYVPVSLCTPSRAALLTGRLPVRMGMYPGVLVPSSRGGLPLEEVTVAEVLAARGYLT
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE0 GMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_000 GMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVP
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE0 IPLLANLSVEAQPPWLPGLEARYMAFAHDLMADAQRQDRPFFLYYASHHTHYPQFSGQSF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_000 IPLLANLSVEAQPPWLPGLEARYMAFAHDLMADAQRQDRPFFLYYASHHTHYPQFSGQSF
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE0 AERSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRMSRGGCSGLL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_000 AERSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRMSRGGCSGLL
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE0 RCGKGTTYEGGVREPALAFWPGHIAPGVTHELASSLDLLPTLAALAGAPLPNVTLDGFDL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_000 RCGKGTTYEGGVREPALAFWPGHIAPGVTHELASSLDLLPTLAALAGAPLPNVTLDGFDL
310 320 330 340 350 360
370 380 390 400 410 420
pF1KE0 SPLLLGTGKSPRQSLFFYPSYPDEVRGVFAVRTGKYKAHFFTQGSAHSDTTADPACHASS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_000 SPLLLGTGKSPRQSLFFYPSYPDEVRGVFAVRTGKYKAHFFTQGSAHSDTTADPACHASS
370 380 390 400 410 420
430 440 450 460 470 480
pF1KE0 SLTAHEPPLLYDLSKDPGENYNLLGGVAGATPEVLQALKQLQLLKAQLDAAVTFGPSQVA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_000 SLTAHEPPLLYDLSKDPGENYNLLGGVAGATPEVLQALKQLQLLKAQLDAAVTFGPSQVA
430 440 450 460 470 480
490 500
pF1KE0 RGEDPALQICCHPGCTPRPACCHCPDPHA
:::::::::::::::::::::::::::::
NP_000 RGEDPALQICCHPGCTPRPACCHCPDPHA
490 500
>>NP_001078894 (OMIM: 250100,607574) arylsulfatase A iso (509 aa)
initn: 3545 init1: 3545 opt: 3545 Z-score: 3648.4 bits: 684.6 E(85289): 1.8e-196
Smith-Waterman score: 3545; 100.0% identity (100.0% similar) in 509 aa overlap (1-509:1-509)
10 20 30 40 50 60
pF1KE0 MSMGAPRSLLLALAAGLAVARPPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 MSMGAPRSLLLALAAGLAVARPPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLR
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE0 FTDFYVPVSLCTPSRAALLTGRLPVRMGMYPGVLVPSSRGGLPLEEVTVAEVLAARGYLT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 FTDFYVPVSLCTPSRAALLTGRLPVRMGMYPGVLVPSSRGGLPLEEVTVAEVLAARGYLT
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE0 GMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 GMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVP
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE0 IPLLANLSVEAQPPWLPGLEARYMAFAHDLMADAQRQDRPFFLYYASHHTHYPQFSGQSF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 IPLLANLSVEAQPPWLPGLEARYMAFAHDLMADAQRQDRPFFLYYASHHTHYPQFSGQSF
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE0 AERSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRMSRGGCSGLL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 AERSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRMSRGGCSGLL
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE0 RCGKGTTYEGGVREPALAFWPGHIAPGVTHELASSLDLLPTLAALAGAPLPNVTLDGFDL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 RCGKGTTYEGGVREPALAFWPGHIAPGVTHELASSLDLLPTLAALAGAPLPNVTLDGFDL
310 320 330 340 350 360
370 380 390 400 410 420
pF1KE0 SPLLLGTGKSPRQSLFFYPSYPDEVRGVFAVRTGKYKAHFFTQGSAHSDTTADPACHASS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 SPLLLGTGKSPRQSLFFYPSYPDEVRGVFAVRTGKYKAHFFTQGSAHSDTTADPACHASS
370 380 390 400 410 420
430 440 450 460 470 480
pF1KE0 SLTAHEPPLLYDLSKDPGENYNLLGGVAGATPEVLQALKQLQLLKAQLDAAVTFGPSQVA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 SLTAHEPPLLYDLSKDPGENYNLLGGVAGATPEVLQALKQLQLLKAQLDAAVTFGPSQVA
430 440 450 460 470 480
490 500
pF1KE0 RGEDPALQICCHPGCTPRPACCHCPDPHA
:::::::::::::::::::::::::::::
NP_001 RGEDPALQICCHPGCTPRPACCHCPDPHA
490 500
>>NP_001078895 (OMIM: 250100,607574) arylsulfatase A iso (509 aa)
initn: 3545 init1: 3545 opt: 3545 Z-score: 3648.4 bits: 684.6 E(85289): 1.8e-196
Smith-Waterman score: 3545; 100.0% identity (100.0% similar) in 509 aa overlap (1-509:1-509)
10 20 30 40 50 60
pF1KE0 MSMGAPRSLLLALAAGLAVARPPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 MSMGAPRSLLLALAAGLAVARPPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLR
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE0 FTDFYVPVSLCTPSRAALLTGRLPVRMGMYPGVLVPSSRGGLPLEEVTVAEVLAARGYLT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 FTDFYVPVSLCTPSRAALLTGRLPVRMGMYPGVLVPSSRGGLPLEEVTVAEVLAARGYLT
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE0 GMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 GMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVP
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE0 IPLLANLSVEAQPPWLPGLEARYMAFAHDLMADAQRQDRPFFLYYASHHTHYPQFSGQSF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 IPLLANLSVEAQPPWLPGLEARYMAFAHDLMADAQRQDRPFFLYYASHHTHYPQFSGQSF
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE0 AERSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRMSRGGCSGLL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 AERSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRMSRGGCSGLL
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE0 RCGKGTTYEGGVREPALAFWPGHIAPGVTHELASSLDLLPTLAALAGAPLPNVTLDGFDL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 RCGKGTTYEGGVREPALAFWPGHIAPGVTHELASSLDLLPTLAALAGAPLPNVTLDGFDL
310 320 330 340 350 360
370 380 390 400 410 420
pF1KE0 SPLLLGTGKSPRQSLFFYPSYPDEVRGVFAVRTGKYKAHFFTQGSAHSDTTADPACHASS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 SPLLLGTGKSPRQSLFFYPSYPDEVRGVFAVRTGKYKAHFFTQGSAHSDTTADPACHASS
370 380 390 400 410 420
430 440 450 460 470 480
pF1KE0 SLTAHEPPLLYDLSKDPGENYNLLGGVAGATPEVLQALKQLQLLKAQLDAAVTFGPSQVA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 SLTAHEPPLLYDLSKDPGENYNLLGGVAGATPEVLQALKQLQLLKAQLDAAVTFGPSQVA
430 440 450 460 470 480
490 500
pF1KE0 RGEDPALQICCHPGCTPRPACCHCPDPHA
:::::::::::::::::::::::::::::
NP_001 RGEDPALQICCHPGCTPRPACCHCPDPHA
490 500
>>XP_011528992 (OMIM: 250100,607574) PREDICTED: arylsulf (423 aa)
initn: 2972 init1: 2972 opt: 2972 Z-score: 3060.2 bits: 575.5 E(85289): 1e-163
Smith-Waterman score: 2972; 100.0% identity (100.0% similar) in 423 aa overlap (87-509:1-423)
60 70 80 90 100 110
pF1KE0 GGLRFTDFYVPVSLCTPSRAALLTGRLPVRMGMYPGVLVPSSRGGLPLEEVTVAEVLAAR
::::::::::::::::::::::::::::::
XP_011 MGMYPGVLVPSSRGGLPLEEVTVAEVLAAR
10 20 30
120 130 140 150 160 170
pF1KE0 GYLTGMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 GYLTGMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQ
40 50 60 70 80 90
180 190 200 210 220 230
pF1KE0 GLVPIPLLANLSVEAQPPWLPGLEARYMAFAHDLMADAQRQDRPFFLYYASHHTHYPQFS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 GLVPIPLLANLSVEAQPPWLPGLEARYMAFAHDLMADAQRQDRPFFLYYASHHTHYPQFS
100 110 120 130 140 150
240 250 260 270 280 290
pF1KE0 GQSFAERSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRMSRGGC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 GQSFAERSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRMSRGGC
160 170 180 190 200 210
300 310 320 330 340 350
pF1KE0 SGLLRCGKGTTYEGGVREPALAFWPGHIAPGVTHELASSLDLLPTLAALAGAPLPNVTLD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 SGLLRCGKGTTYEGGVREPALAFWPGHIAPGVTHELASSLDLLPTLAALAGAPLPNVTLD
220 230 240 250 260 270
360 370 380 390 400 410
pF1KE0 GFDLSPLLLGTGKSPRQSLFFYPSYPDEVRGVFAVRTGKYKAHFFTQGSAHSDTTADPAC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 GFDLSPLLLGTGKSPRQSLFFYPSYPDEVRGVFAVRTGKYKAHFFTQGSAHSDTTADPAC
280 290 300 310 320 330
420 430 440 450 460 470
pF1KE0 HASSSLTAHEPPLLYDLSKDPGENYNLLGGVAGATPEVLQALKQLQLLKAQLDAAVTFGP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 HASSSLTAHEPPLLYDLSKDPGENYNLLGGVAGATPEVLQALKQLQLLKAQLDAAVTFGP
340 350 360 370 380 390
480 490 500
pF1KE0 SQVARGEDPALQICCHPGCTPRPACCHCPDPHA
:::::::::::::::::::::::::::::::::
XP_011 SQVARGEDPALQICCHPGCTPRPACCHCPDPHA
400 410 420
>>NP_001078897 (OMIM: 250100,607574) arylsulfatase A iso (423 aa)
initn: 2972 init1: 2972 opt: 2972 Z-score: 3060.2 bits: 575.5 E(85289): 1e-163
Smith-Waterman score: 2972; 100.0% identity (100.0% similar) in 423 aa overlap (87-509:1-423)
60 70 80 90 100 110
pF1KE0 GGLRFTDFYVPVSLCTPSRAALLTGRLPVRMGMYPGVLVPSSRGGLPLEEVTVAEVLAAR
::::::::::::::::::::::::::::::
NP_001 MGMYPGVLVPSSRGGLPLEEVTVAEVLAAR
10 20 30
120 130 140 150 160 170
pF1KE0 GYLTGMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 GYLTGMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQ
40 50 60 70 80 90
180 190 200 210 220 230
pF1KE0 GLVPIPLLANLSVEAQPPWLPGLEARYMAFAHDLMADAQRQDRPFFLYYASHHTHYPQFS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 GLVPIPLLANLSVEAQPPWLPGLEARYMAFAHDLMADAQRQDRPFFLYYASHHTHYPQFS
100 110 120 130 140 150
240 250 260 270 280 290
pF1KE0 GQSFAERSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRMSRGGC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 GQSFAERSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRMSRGGC
160 170 180 190 200 210
300 310 320 330 340 350
pF1KE0 SGLLRCGKGTTYEGGVREPALAFWPGHIAPGVTHELASSLDLLPTLAALAGAPLPNVTLD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 SGLLRCGKGTTYEGGVREPALAFWPGHIAPGVTHELASSLDLLPTLAALAGAPLPNVTLD
220 230 240 250 260 270
360 370 380 390 400 410
pF1KE0 GFDLSPLLLGTGKSPRQSLFFYPSYPDEVRGVFAVRTGKYKAHFFTQGSAHSDTTADPAC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 GFDLSPLLLGTGKSPRQSLFFYPSYPDEVRGVFAVRTGKYKAHFFTQGSAHSDTTADPAC
280 290 300 310 320 330
420 430 440 450 460 470
pF1KE0 HASSSLTAHEPPLLYDLSKDPGENYNLLGGVAGATPEVLQALKQLQLLKAQLDAAVTFGP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 HASSSLTAHEPPLLYDLSKDPGENYNLLGGVAGATPEVLQALKQLQLLKAQLDAAVTFGP
340 350 360 370 380 390
480 490 500
pF1KE0 SQVARGEDPALQICCHPGCTPRPACCHCPDPHA
:::::::::::::::::::::::::::::::::
NP_001 SQVARGEDPALQICCHPGCTPRPACCHCPDPHA
400 410 420
>>XP_016884289 (OMIM: 250100,607574) PREDICTED: arylsulf (547 aa)
initn: 2797 init1: 2797 opt: 2802 Z-score: 2884.0 bits: 543.3 E(85289): 6.7e-154
Smith-Waterman score: 3403; 93.0% identity (93.0% similar) in 541 aa overlap (1-503:1-541)
10 20 30 40 50 60
pF1KE0 MSMGAPRSLLLALAAGLAVARPPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 MSMGAPRSLLLALAAGLAVARPPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLR
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE0 FTDFYVPVSLCTPSRAALLTGRLPVRMGMYPGVLVPSSRGGLPLEEVTVAEVLAARGYLT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 FTDFYVPVSLCTPSRAALLTGRLPVRMGMYPGVLVPSSRGGLPLEEVTVAEVLAARGYLT
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE0 GMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 GMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVP
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE0 IPLLANLSVEAQPPWLPGLEARYMAFAHDLMADAQRQDRPFFLYYASHHTHYPQFSGQSF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 IPLLANLSVEAQPPWLPGLEARYMAFAHDLMADAQRQDRPFFLYYASHHTHYPQFSGQSF
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE0 AERSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRMSRGGCSGLL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 AERSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRMSRGGCSGLL
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE0 RCGKGTTYEGGVREPALAFWPGHIAPGVTHELASSLDLLPTLAALAGAPLPNVTLDGFDL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 RCGKGTTYEGGVREPALAFWPGHIAPGVTHELASSLDLLPTLAALAGAPLPNVTLDGFDL
310 320 330 340 350 360
370 380 390 400
pF1KE0 SPLLLGTGKSPRQSLFFYPSYPDEVRGVFAVRTGKYKAHFFTQG----------------
::::::::::::::::::::::::::::::::::::::::::::
XP_016 SPLLLGTGKSPRQSLFFYPSYPDEVRGVFAVRTGKYKAHFFTQGNPSPWIPPPDLLTPPR
370 380 390 400 410 420
410 420 430 440
pF1KE0 ----------------------SAHSDTTADPACHASSSLTAHEPPLLYDLSKDPGENYN
::::::::::::::::::::::::::::::::::::::
XP_016 SPRSLAPPLALALCTELAPSPGSAHSDTTADPACHASSSLTAHEPPLLYDLSKDPGENYN
430 440 450 460 470 480
450 460 470 480 490 500
pF1KE0 LLGGVAGATPEVLQALKQLQLLKAQLDAAVTFGPSQVARGEDPALQICCHPGCTPRPACC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_016 LLGGVAGATPEVLQALKQLQLLKAQLDAAVTFGPSQVARGEDPALQICCHPGCTPRPACC
490 500 510 520 530 540
pF1KE0 HCPDPHA
:
XP_016 HCPDPHA
>>XP_011528993 (OMIM: 250100,607574) PREDICTED: arylsulf (387 aa)
initn: 2580 init1: 2553 opt: 2557 Z-score: 2634.0 bits: 496.6 E(85289): 5.6e-140
Smith-Waterman score: 2557; 96.6% identity (97.4% similar) in 387 aa overlap (1-382:1-384)
10 20 30 40 50 60
pF1KE0 MSMGAPRSLLLALAAGLAVARPPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 MSMGAPRSLLLALAAGLAVARPPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLR
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE0 FTDFYVPVSLCTPSRAALLTGRLPVRMGMYPGVLVPSSRGGLPLEEVTVAEVLAARGYLT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 FTDFYVPVSLCTPSRAALLTGRLPVRMGMYPGVLVPSSRGGLPLEEVTVAEVLAARGYLT
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE0 GMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 GMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVP
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE0 IPLLANLSVEAQPPWLPGLEARYMAFAHDLMADAQRQDRPFFLYYASHHTHYPQFSGQSF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 IPLLANLSVEAQPPWLPGLEARYMAFAHDLMADAQRQDRPFFLYYASHHTHYPQFSGQSF
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE0 AERSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRMSRGGCSGLL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 AERSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRMSRGGCSGLL
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE0 RCGKGTTYEGGVREPALAFWPGHIAPGVTHELASSLDLLPTLAALAGAPLPNVTLDGFDL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 RCGKGTTYEGGVREPALAFWPGHIAPGVTHELASSLDLLPTLAALAGAPLPNVTLDGFDL
310 320 330 340 350 360
370 380 390 400 410
pF1KE0 SPLLLGTGKS-----PRQSLFFYPSYPDEVRGVFAVRTGKYKAHFFTQGSAHSDTTADPA
:::::::::. : :.: :. :
XP_011 SPLLLGTGKALPTVIPLQTL---PATPPAL
370 380
>>NP_000503 (OMIM: 253000,612222) N-acetylgalactosamine- (522 aa)
initn: 709 init1: 323 opt: 958 Z-score: 988.2 bits: 192.4 E(85289): 2.6e-48
Smith-Waterman score: 968; 37.1% identity (59.9% similar) in 534 aa overlap (9-506:13-512)
10 20 30 40 50
pF1KE0 MSMGAPRSLLLAL-AAGLAVA---RPPNIVLIFADDLGYGDLGCYGHPSSTTPNLD
:::.: :::.... .::::.:.. ::.:.:::: ::.:: :::::
NP_000 MAAVVAATRWWQLLLVLSAAGMGASGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLD
10 20 30 40 50 60
60 70 80 90 100
pF1KE0 QLAAGGLRFTDFYVPVSLCTPSRAALLTGRLPVRMGMYP------GVLVPSS-RGGLPLE
..:: :: : .:: ::.::::::::::::.: :.: .. .:. ::.:
NP_000 RMAAEGLLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDS
70 80 90 100 110 120
110 120 130 140 150 160
pF1KE0 EVTVAEVLAARGYLTGMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFP
: . :.: ::.. ..:::::: :. : : ..:: ...: : : :: .: .
NP_000 EQLLPELLKKAGYVSKIVGKWHLGHRPQ--FHPLKHGFDEWFGSPNCH-FGPYDNKA--R
130 140 150 160 170
170 180 190 200 210 220
pF1KE0 PATPCDGGCDQGLVPIPLLANLSVEAQPPWLPGLEAR----YMAFAHDLMADAQRQDRPF
: : : .: . : : : :: :. : :.. :. .::
NP_000 PNIPVYR--DWEMV------GRYYEEFPINLKTGEANLTQIYLQEALDFIKRQARH-HPF
180 190 200 210 220
230 240 250 260 270 280
pF1KE0 FLYYASHHTHYPQFSGQSFAERSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFT
:::.: :: : .... : : :: .::.. :.: ..: .. . :: . ..:.:.::
NP_000 FLYWAVDATHAPVYASKPFLGTSQRGRYGDAVREIDDSIGKILELLQDLHVADNTFVFFT
230 240 250 260 270 280
290 300 310 320 330
pF1KE0 ADNGPETMRMS-RGGCSGLLRCGKGTTYEGGVREPALAFWPGHIAPG-VTHELASSLDLL
.::: . .:: .: . ::: ::.:::.::::::.::::.. : :.:.:.: .::.
NP_000 SDNGAALISAPEQGGSNGPFLCGKQTTFEGGMREPALAWWPGHVTAGQVSHQLGSIMDLF
290 300 310 320 330 340
340 350 360 370 380 390
pF1KE0 PTLAALAG-APLPNVTLDGFDLSPLLLGTGKSPRQSLFFYPSYPDEVRG--VFAVRTGKY
: :::: .: . ..::..: : :: :. . .:.: :: ..:. :..
NP_000 TTSLALAGLTPPSDRAIDGLNLLPTLL-QGRLMDRPIFYY-------RGDTLMAATLGQH
350 360 370 380 390
400 410 420 430 440
pF1KE0 KAHFFTQGSAHSDTTAD-PACHAS--SSLTAH------EPPLLYDLSKDPGENYNLLGGV
::::.: .. . : .. :..:.: . ::.. :..:::: . :
NP_000 KAHFWTWTNSWENFRQGIDFCPGQNVSGVTTHNLEDHTKLPLIFHLGRDPGERFPL----
400 410 420 430 440 450
450 460 470 480 490 500
pF1KE0 AGATPEVLQALKQLQLLKAQLDAAVTFGPSQVARGEDPALQIC-------CHPGCTPRPA
. :. : .::... . : . :.. :.: : :..: :::
NP_000 SFASAEYQEALSRITSVVQQHQEALV--PAQ------PQLNVCNWAVMNWAPPGCEKLGK
460 470 480 490 500
pF1KE0 CCHCPDPHA
: :.
NP_000 CLTPPESIPKKCLWSH
510 520
>>XP_005256358 (OMIM: 253000,612222) PREDICTED: N-acetyl (575 aa)
initn: 657 init1: 323 opt: 946 Z-score: 975.3 bits: 190.2 E(85289): 1.4e-47
Smith-Waterman score: 961; 37.4% identity (61.4% similar) in 529 aa overlap (9-499:13-519)
10 20 30 40 50
pF1KE0 MSMGAPRSLLLAL-AAGLAVA---RPPNIVLIFADDLGYGDLGCYGHPSSTTPNLD
:::.: :::.... .::::.:.. ::.:.:::: ::.:: :::::
XP_005 MAAVVAATRWWQLLLVLSAAGMGASGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLD
10 20 30 40 50 60
60 70 80 90 100
pF1KE0 QLAAGGLRFTDFYVPVSLCTPSRAALLTGRLPVRMGMYP------GVLVPSS-RGGLPLE
..:: :: : .:: ::.::::::::::::.: :.: .. .:. ::.:
XP_005 RMAAEGLLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDS
70 80 90 100 110 120
110 120 130 140 150 160
pF1KE0 EVTVAEVLAARGYLTGMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFP
: . :.: ::.. ..:::::: :. : : ..:: ...: : : :: .: .
XP_005 EQLLPELLKKAGYVSKIVGKWHLGHRPQ--FHPLKHGFDEWFGSPNCH-FGPYDNKA--R
130 140 150 160 170
170 180 190 200 210 220
pF1KE0 PATPCDGGCDQ-GLVPIPLLANLSV-EAQPPWLPGLEARYMAFAHDLMADAQRQDR--PF
: : .. : . ::.. ::. : :. : :.. .:: : ::
XP_005 PNIPVYRDWEMVGRYYEEFPINLKTGEAN------LTQIYLQEALDFI---KRQARHHPF
180 190 200 210 220
230 240 250 260 270 280
pF1KE0 FLYYASHHTHYPQFSGQSFAERSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFT
:::.: :: : .... : : :: .::.. :.: ..: .. . :: . ..:.:.::
XP_005 FLYWAVDATHAPVYASKPFLGTSQRGRYGDAVREIDDSIGKILELLQDLHVADNTFVFFT
230 240 250 260 270 280
290 300 310 320 330
pF1KE0 ADNGPETMRMS-RGGCSGLLRCGKGTTYEGGVREPALAFWPGHIAPG-VTHELASSLDLL
.::: . .:: .: . ::: ::.:::.::::::.::::.. : :.:.:.: .::.
XP_005 SDNGAALISAPEQGGSNGPFLCGKQTTFEGGMREPALAWWPGHVTAGQVSHQLGSIMDLF
290 300 310 320 330 340
340 350 360 370 380 390
pF1KE0 PTLAALAG-APLPNVTLDGFDLSPLLLGTGKSPRQSLFFYPSYPDEVRG--VFAVRTGKY
: :::: .: . ..::..: : :: :. . .:.: :: ..:. :..
XP_005 TTSLALAGLTPPSDRAIDGLNLLPTLL-QGRLMDRPIFYY-------RGDTLMAATLGQH
350 360 370 380 390
400 410 420 430 440
pF1KE0 KAHFFTQGSAHSDTTAD-PACHAS--SSLTAH------EPPLLYDLSKDPGENYNLLGGV
::::.: .. . : .. :..:.: . ::.. :..:::: . : .
XP_005 KAHFWTWTNSWENFRQGIDFCPGQNVSGVTTHNLEDHTKLPLIFHLGRDPGERFPLSFAS
400 410 420 430 440 450
450 460 470 480 490
pF1KE0 AG---ATPEVLQALKQLQ--LLKAQLDAAV----TFGPSQVARGEDPALQICCHPGCTPR
: : .. ....: : :. :: . : ...:.: . . .: : : :
XP_005 AEYQEALSRITSVVQQHQEALVPAQPQLNVCNWAVMAPTQSSTSVQPDKTRPCSPPVEKR
460 470 480 490 500 510
500
pF1KE0 PACCHCPDPHA
:
XP_005 PHHASIGQKHKHRRASKVSLKSVFRPQRMRRRLVQPWGPQQSGPLRSSFSVPCLGVP
520 530 540 550 560 570
509 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Thu Nov 3 15:20:23 2016 done: Thu Nov 3 15:20:24 2016
Total Scan time: 10.280 Total Display time: 0.090
Function used was FASTA [36.3.4 Apr, 2011]