FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5681, 522 aa 1>>>pF1KE5681 522 - 522 aa - 522 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.2257+/-0.000348; mu= 18.8185+/- 0.022 mean_var=75.1442+/-15.215, 0's: 0 Z-trim(115.6): 126 B-trim: 491 in 1/53 Lambda= 0.147954 statistics sampled from 26039 (26171) to 26039 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.676), E-opt: 0.2 (0.307), width: 16 Scan time: 9.390 The best scores are: opt bits E(85289) NP_000503 (OMIM: 253000,612222) N-acetylgalactosam ( 522) 3665 791.9 0 XP_005256358 (OMIM: 253000,612222) PREDICTED: N-ac ( 575) 3443 744.5 1.9e-214 NP_001310473 (OMIM: 253000,612222) N-acetylgalacto ( 528) 3403 736.0 6.6e-212 XP_011521284 (OMIM: 253000,612222) PREDICTED: N-ac ( 581) 3181 688.6 1.3e-197 XP_016878601 (OMIM: 253000,612222) PREDICTED: N-ac ( 508) 3180 688.4 1.4e-197 XP_016878600 (OMIM: 253000,612222) PREDICTED: N-ac ( 511) 3176 687.5 2.5e-197 NP_001310472 (OMIM: 253000,612222) N-acetylgalacto ( 337) 2350 511.1 2.1e-144 XP_016878602 (OMIM: 253000,612222) PREDICTED: N-ac ( 390) 2128 463.7 4.4e-130 NP_001078896 (OMIM: 250100,607574) arylsulfatase A ( 509) 958 214.1 8.2e-55 NP_001078895 (OMIM: 250100,607574) arylsulfatase A ( 509) 958 214.1 8.2e-55 NP_001078894 (OMIM: 250100,607574) arylsulfatase A ( 509) 958 214.1 8.2e-55 NP_000478 (OMIM: 250100,607574) arylsulfatase A is ( 509) 958 214.1 8.2e-55 XP_016884289 (OMIM: 250100,607574) PREDICTED: aryl ( 547) 906 203.0 1.9e-51 XP_011528993 (OMIM: 250100,607574) PREDICTED: aryl ( 387) 862 193.5 9.7e-49 NP_001078897 (OMIM: 250100,607574) arylsulfatase A ( 423) 570 131.2 6e-30 XP_011528992 (OMIM: 250100,607574) PREDICTED: aryl ( 423) 570 131.2 6e-30 XP_005274578 (OMIM: 300180,302950) PREDICTED: aryl ( 535) 552 127.4 1e-28 NP_001269560 (OMIM: 300180,302950) arylsulfatase E ( 544) 552 127.4 1.1e-28 XP_011522842 (OMIM: 610008) PREDICTED: arylsulfata ( 551) 548 126.6 1.9e-28 XP_011541694 (OMIM: 253200,611542) PREDICTED: aryl ( 405) 537 124.1 7.7e-28 NP_942002 (OMIM: 253200,611542) arylsulfatase B is ( 413) 537 124.1 7.8e-28 XP_011541693 (OMIM: 253200,611542) PREDICTED: aryl ( 442) 537 124.2 8.2e-28 XP_016864960 (OMIM: 253200,611542) PREDICTED: aryl ( 451) 537 124.2 8.4e-28 XP_011541692 (OMIM: 253200,611542) PREDICTED: aryl ( 533) 537 124.2 9.5e-28 NP_000037 (OMIM: 253200,611542) arylsulfatase B is ( 533) 537 124.2 9.5e-28 XP_011541695 (OMIM: 253200,611542) PREDICTED: aryl ( 382) 530 122.6 2.1e-27 XP_016879857 (OMIM: 610008) PREDICTED: arylsulfata ( 413) 495 115.2 3.9e-25 XP_011522847 (OMIM: 610008) PREDICTED: arylsulfata ( 413) 495 115.2 3.9e-25 XP_011522846 (OMIM: 610008) PREDICTED: arylsulfata ( 427) 495 115.2 4e-25 XP_011522845 (OMIM: 610008) PREDICTED: arylsulfata ( 427) 495 115.2 4e-25 XP_016879855 (OMIM: 610008) PREDICTED: arylsulfata ( 461) 487 113.5 1.4e-24 XP_016879856 (OMIM: 610008) PREDICTED: arylsulfata ( 461) 487 113.5 1.4e-24 XP_011522843 (OMIM: 610008) PREDICTED: arylsulfata ( 488) 487 113.5 1.5e-24 XP_011522844 (OMIM: 610008) PREDICTED: arylsulfata ( 488) 487 113.5 1.5e-24 XP_016879850 (OMIM: 610008) PREDICTED: arylsulfata ( 525) 487 113.5 1.5e-24 XP_016879854 (OMIM: 610008) PREDICTED: arylsulfata ( 525) 487 113.5 1.5e-24 XP_005257227 (OMIM: 610008) PREDICTED: arylsulfata ( 525) 487 113.5 1.5e-24 XP_016879853 (OMIM: 610008) PREDICTED: arylsulfata ( 525) 487 113.5 1.5e-24 NP_055775 (OMIM: 610008) arylsulfatase G [Homo sap ( 525) 487 113.5 1.5e-24 NP_001254656 (OMIM: 610008) arylsulfatase G [Homo ( 525) 487 113.5 1.5e-24 XP_016879851 (OMIM: 610008) PREDICTED: arylsulfata ( 525) 487 113.5 1.5e-24 XP_016879852 (OMIM: 610008) PREDICTED: arylsulfata ( 525) 487 113.5 1.5e-24 XP_011522837 (OMIM: 610008) PREDICTED: arylsulfata ( 552) 487 113.6 1.6e-24 XP_006721840 (OMIM: 610008) PREDICTED: arylsulfata ( 552) 487 113.6 1.6e-24 XP_011522839 (OMIM: 610008) PREDICTED: arylsulfata ( 552) 487 113.6 1.6e-24 XP_011522838 (OMIM: 610008) PREDICTED: arylsulfata ( 552) 487 113.6 1.6e-24 XP_016879849 (OMIM: 610008) PREDICTED: arylsulfata ( 552) 487 113.6 1.6e-24 XP_011522840 (OMIM: 610008) PREDICTED: arylsulfata ( 552) 487 113.6 1.6e-24 XP_006721842 (OMIM: 610008) PREDICTED: arylsulfata ( 329) 476 111.0 5.5e-24 XP_005257229 (OMIM: 610008) PREDICTED: arylsulfata ( 344) 476 111.1 5.6e-24 >>NP_000503 (OMIM: 253000,612222) N-acetylgalactosamine- (522 aa) initn: 3665 init1: 3665 opt: 3665 Z-score: 4227.7 bits: 791.9 E(85289): 0 Smith-Waterman score: 3665; 100.0% identity (100.0% similar) in 522 aa overlap (1-522:1-522) 10 20 30 40 50 60 pF1KE5 MAAVVAATRWWQLLLVLSAAGMGASGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 MAAVVAATRWWQLLLVLSAAGMGASGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLD 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 RMAAEGLLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 RMAAEGLLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 EQLLPELLKKAGYVSKIVGKWHLGHRPQFHPLKHGFDEWFGSPNCHFGPYDNKARPNIPV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 EQLLPELLKKAGYVSKIVGKWHLGHRPQFHPLKHGFDEWFGSPNCHFGPYDNKARPNIPV 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 YRDWEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKRQARHHPFFLYWAVDATHAPVY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 YRDWEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKRQARHHPFFLYWAVDATHAPVY 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE5 ASKPFLGTSQRGRYGDAVREIDDSIGKILELLQDLHVADNTFVFFTSDNGAALISAPEQG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 ASKPFLGTSQRGRYGDAVREIDDSIGKILELLQDLHVADNTFVFFTSDNGAALISAPEQG 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE5 GSNGPFLCGKQTTFEGGMREPALAWWPGHVTAGQVSHQLGSIMDLFTTSLALAGLTPPSD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 GSNGPFLCGKQTTFEGGMREPALAWWPGHVTAGQVSHQLGSIMDLFTTSLALAGLTPPSD 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE5 RAIDGLNLLPTLLQGRLMDRPIFYYRGDTLMAATLGQHKAHFWTWTNSWENFRQGIDFCP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 RAIDGLNLLPTLLQGRLMDRPIFYYRGDTLMAATLGQHKAHFWTWTNSWENFRQGIDFCP 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE5 GQNVSGVTTHNLEDHTKLPLIFHLGRDPGERFPLSFASAEYQEALSRITSVVQQHQEALV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 GQNVSGVTTHNLEDHTKLPLIFHLGRDPGERFPLSFASAEYQEALSRITSVVQQHQEALV 430 440 450 460 470 480 490 500 510 520 pF1KE5 PAQPQLNVCNWAVMNWAPPGCEKLGKCLTPPESIPKKCLWSH :::::::::::::::::::::::::::::::::::::::::: NP_000 PAQPQLNVCNWAVMNWAPPGCEKLGKCLTPPESIPKKCLWSH 490 500 510 520 >>XP_005256358 (OMIM: 253000,612222) PREDICTED: N-acetyl (575 aa) initn: 3437 init1: 3437 opt: 3443 Z-score: 3971.0 bits: 744.5 E(85289): 1.9e-214 Smith-Waterman score: 3443; 96.2% identity (96.5% similar) in 521 aa overlap (1-515:1-519) 10 20 30 40 50 60 pF1KE5 MAAVVAATRWWQLLLVLSAAGMGASGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 MAAVVAATRWWQLLLVLSAAGMGASGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLD 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 RMAAEGLLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 RMAAEGLLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 EQLLPELLKKAGYVSKIVGKWHLGHRPQFHPLKHGFDEWFGSPNCHFGPYDNKARPNIPV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 EQLLPELLKKAGYVSKIVGKWHLGHRPQFHPLKHGFDEWFGSPNCHFGPYDNKARPNIPV 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 YRDWEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKRQARHHPFFLYWAVDATHAPVY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 YRDWEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKRQARHHPFFLYWAVDATHAPVY 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE5 ASKPFLGTSQRGRYGDAVREIDDSIGKILELLQDLHVADNTFVFFTSDNGAALISAPEQG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 ASKPFLGTSQRGRYGDAVREIDDSIGKILELLQDLHVADNTFVFFTSDNGAALISAPEQG 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE5 GSNGPFLCGKQTTFEGGMREPALAWWPGHVTAGQVSHQLGSIMDLFTTSLALAGLTPPSD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 GSNGPFLCGKQTTFEGGMREPALAWWPGHVTAGQVSHQLGSIMDLFTTSLALAGLTPPSD 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE5 RAIDGLNLLPTLLQGRLMDRPIFYYRGDTLMAATLGQHKAHFWTWTNSWENFRQGIDFCP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 RAIDGLNLLPTLLQGRLMDRPIFYYRGDTLMAATLGQHKAHFWTWTNSWENFRQGIDFCP 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE5 GQNVSGVTTHNLEDHTKLPLIFHLGRDPGERFPLSFASAEYQEALSRITSVVQQHQEALV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 GQNVSGVTTHNLEDHTKLPLIFHLGRDPGERFPLSFASAEYQEALSRITSVVQQHQEALV 430 440 450 460 470 480 490 500 510 520 pF1KE5 PAQPQLNVCNWAVMNWAPPGC------EKLGKCLTPPESIPKKCLWSH :::::::::::::: :: .: : : :. : XP_005 PAQPQLNVCNWAVM--APTQSSTSVQPDKTRPCSPPVEKRPHHASIGQKHKHRRASKVSL 490 500 510 520 530 XP_005 KSVFRPQRMRRRLVQPWGPQQSGPLRSSFSVPCLGVP 540 550 560 570 >>NP_001310473 (OMIM: 253000,612222) N-acetylgalactosami (528 aa) initn: 3403 init1: 3403 opt: 3403 Z-score: 3925.4 bits: 736.0 E(85289): 6.6e-212 Smith-Waterman score: 3403; 99.8% identity (100.0% similar) in 483 aa overlap (40-522:46-528) 10 20 30 40 50 60 pF1KE5 WWQLLLVLSAAGMGASGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGLLF .::::::::::::::::::::::::::::: NP_001 PHLKTKQKWRRKTAWADRGAAPSCTHAGDAEMGWGDLGVYGEPSRETPNLDRMAAEGLLF 20 30 40 50 60 70 70 80 90 100 110 120 pF1KE5 PNFYSANPLCSPSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDSEQLLPELLK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 PNFYSANPLCSPSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDSEQLLPELLK 80 90 100 110 120 130 130 140 150 160 170 180 pF1KE5 KAGYVSKIVGKWHLGHRPQFHPLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDWEMVGR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 KAGYVSKIVGKWHLGHRPQFHPLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDWEMVGR 140 150 160 170 180 190 190 200 210 220 230 240 pF1KE5 YYEEFPINLKTGEANLTQIYLQEALDFIKRQARHHPFFLYWAVDATHAPVYASKPFLGTS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 YYEEFPINLKTGEANLTQIYLQEALDFIKRQARHHPFFLYWAVDATHAPVYASKPFLGTS 200 210 220 230 240 250 250 260 270 280 290 300 pF1KE5 QRGRYGDAVREIDDSIGKILELLQDLHVADNTFVFFTSDNGAALISAPEQGGSNGPFLCG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 QRGRYGDAVREIDDSIGKILELLQDLHVADNTFVFFTSDNGAALISAPEQGGSNGPFLCG 260 270 280 290 300 310 310 320 330 340 350 360 pF1KE5 KQTTFEGGMREPALAWWPGHVTAGQVSHQLGSIMDLFTTSLALAGLTPPSDRAIDGLNLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 KQTTFEGGMREPALAWWPGHVTAGQVSHQLGSIMDLFTTSLALAGLTPPSDRAIDGLNLL 320 330 340 350 360 370 370 380 390 400 410 420 pF1KE5 PTLLQGRLMDRPIFYYRGDTLMAATLGQHKAHFWTWTNSWENFRQGIDFCPGQNVSGVTT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 PTLLQGRLMDRPIFYYRGDTLMAATLGQHKAHFWTWTNSWENFRQGIDFCPGQNVSGVTT 380 390 400 410 420 430 430 440 450 460 470 480 pF1KE5 HNLEDHTKLPLIFHLGRDPGERFPLSFASAEYQEALSRITSVVQQHQEALVPAQPQLNVC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 HNLEDHTKLPLIFHLGRDPGERFPLSFASAEYQEALSRITSVVQQHQEALVPAQPQLNVC 440 450 460 470 480 490 490 500 510 520 pF1KE5 NWAVMNWAPPGCEKLGKCLTPPESIPKKCLWSH ::::::::::::::::::::::::::::::::: NP_001 NWAVMNWAPPGCEKLGKCLTPPESIPKKCLWSH 500 510 520 >>XP_011521284 (OMIM: 253000,612222) PREDICTED: N-acetyl (581 aa) initn: 3175 init1: 3175 opt: 3181 Z-score: 3668.7 bits: 688.6 E(85289): 1.3e-197 Smith-Waterman score: 3181; 95.6% identity (96.3% similar) in 482 aa overlap (40-515:46-525) 10 20 30 40 50 60 pF1KE5 WWQLLLVLSAAGMGASGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGLLF .::::::::::::::::::::::::::::: XP_011 PHLKTKQKWRRKTAWADRGAAPSCTHAGDAEMGWGDLGVYGEPSRETPNLDRMAAEGLLF 20 30 40 50 60 70 70 80 90 100 110 120 pF1KE5 PNFYSANPLCSPSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDSEQLLPELLK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 PNFYSANPLCSPSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDSEQLLPELLK 80 90 100 110 120 130 130 140 150 160 170 180 pF1KE5 KAGYVSKIVGKWHLGHRPQFHPLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDWEMVGR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 KAGYVSKIVGKWHLGHRPQFHPLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDWEMVGR 140 150 160 170 180 190 190 200 210 220 230 240 pF1KE5 YYEEFPINLKTGEANLTQIYLQEALDFIKRQARHHPFFLYWAVDATHAPVYASKPFLGTS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 YYEEFPINLKTGEANLTQIYLQEALDFIKRQARHHPFFLYWAVDATHAPVYASKPFLGTS 200 210 220 230 240 250 250 260 270 280 290 300 pF1KE5 QRGRYGDAVREIDDSIGKILELLQDLHVADNTFVFFTSDNGAALISAPEQGGSNGPFLCG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 QRGRYGDAVREIDDSIGKILELLQDLHVADNTFVFFTSDNGAALISAPEQGGSNGPFLCG 260 270 280 290 300 310 310 320 330 340 350 360 pF1KE5 KQTTFEGGMREPALAWWPGHVTAGQVSHQLGSIMDLFTTSLALAGLTPPSDRAIDGLNLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 KQTTFEGGMREPALAWWPGHVTAGQVSHQLGSIMDLFTTSLALAGLTPPSDRAIDGLNLL 320 330 340 350 360 370 370 380 390 400 410 420 pF1KE5 PTLLQGRLMDRPIFYYRGDTLMAATLGQHKAHFWTWTNSWENFRQGIDFCPGQNVSGVTT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 PTLLQGRLMDRPIFYYRGDTLMAATLGQHKAHFWTWTNSWENFRQGIDFCPGQNVSGVTT 380 390 400 410 420 430 430 440 450 460 470 480 pF1KE5 HNLEDHTKLPLIFHLGRDPGERFPLSFASAEYQEALSRITSVVQQHQEALVPAQPQLNVC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 HNLEDHTKLPLIFHLGRDPGERFPLSFASAEYQEALSRITSVVQQHQEALVPAQPQLNVC 440 450 460 470 480 490 490 500 510 520 pF1KE5 NWAVMNWAPPGC------EKLGKCLTPPESIPKKCLWSH ::::: :: .: : : :. : XP_011 NWAVM--APTQSSTSVQPDKTRPCSPPVEKRPHHASIGQKHKHRRASKVSLKSVFRPQRM 500 510 520 530 540 550 XP_011 RRRLVQPWGPQQSGPLRSSFSVPCLGVP 560 570 580 >>XP_016878601 (OMIM: 253000,612222) PREDICTED: N-acetyl (508 aa) initn: 3175 init1: 3175 opt: 3180 Z-score: 3668.4 bits: 688.4 E(85289): 1.4e-197 Smith-Waterman score: 3180; 99.1% identity (99.3% similar) in 459 aa overlap (40-498:46-504) 10 20 30 40 50 60 pF1KE5 WWQLLLVLSAAGMGASGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGLLF .::::::::::::::::::::::::::::: XP_016 PHLKTKQKWRRKTAWADRGAAPSCTHAGDAEMGWGDLGVYGEPSRETPNLDRMAAEGLLF 20 30 40 50 60 70 70 80 90 100 110 120 pF1KE5 PNFYSANPLCSPSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDSEQLLPELLK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 PNFYSANPLCSPSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDSEQLLPELLK 80 90 100 110 120 130 130 140 150 160 170 180 pF1KE5 KAGYVSKIVGKWHLGHRPQFHPLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDWEMVGR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 KAGYVSKIVGKWHLGHRPQFHPLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDWEMVGR 140 150 160 170 180 190 190 200 210 220 230 240 pF1KE5 YYEEFPINLKTGEANLTQIYLQEALDFIKRQARHHPFFLYWAVDATHAPVYASKPFLGTS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 YYEEFPINLKTGEANLTQIYLQEALDFIKRQARHHPFFLYWAVDATHAPVYASKPFLGTS 200 210 220 230 240 250 250 260 270 280 290 300 pF1KE5 QRGRYGDAVREIDDSIGKILELLQDLHVADNTFVFFTSDNGAALISAPEQGGSNGPFLCG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 QRGRYGDAVREIDDSIGKILELLQDLHVADNTFVFFTSDNGAALISAPEQGGSNGPFLCG 260 270 280 290 300 310 310 320 330 340 350 360 pF1KE5 KQTTFEGGMREPALAWWPGHVTAGQVSHQLGSIMDLFTTSLALAGLTPPSDRAIDGLNLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 KQTTFEGGMREPALAWWPGHVTAGQVSHQLGSIMDLFTTSLALAGLTPPSDRAIDGLNLL 320 330 340 350 360 370 370 380 390 400 410 420 pF1KE5 PTLLQGRLMDRPIFYYRGDTLMAATLGQHKAHFWTWTNSWENFRQGIDFCPGQNVSGVTT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 PTLLQGRLMDRPIFYYRGDTLMAATLGQHKAHFWTWTNSWENFRQGIDFCPGQNVSGVTT 380 390 400 410 420 430 430 440 450 460 470 480 pF1KE5 HNLEDHTKLPLIFHLGRDPGERFPLSFASAEYQEALSRITSVVQQHQEALVPAQPQLNVC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 HNLEDHTKLPLIFHLGRDPGERFPLSFASAEYQEALSRITSVVQQHQEALVPAQPQLNVC 440 450 460 470 480 490 490 500 510 520 pF1KE5 NWAVMNWAPPGCEKLGKCLTPPESIPKKCLWSH ::::: : XP_016 NWAVMPQRPSHQT 500 >>XP_016878600 (OMIM: 253000,612222) PREDICTED: N-acetyl (511 aa) initn: 3175 init1: 3175 opt: 3176 Z-score: 3663.7 bits: 687.5 E(85289): 2.5e-197 Smith-Waterman score: 3176; 98.3% identity (98.9% similar) in 465 aa overlap (40-504:46-506) 10 20 30 40 50 60 pF1KE5 WWQLLLVLSAAGMGASGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGLLF .::::::::::::::::::::::::::::: XP_016 PHLKTKQKWRRKTAWADRGAAPSCTHAGDAEMGWGDLGVYGEPSRETPNLDRMAAEGLLF 20 30 40 50 60 70 70 80 90 100 110 120 pF1KE5 PNFYSANPLCSPSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDSEQLLPELLK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 PNFYSANPLCSPSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDSEQLLPELLK 80 90 100 110 120 130 130 140 150 160 170 180 pF1KE5 KAGYVSKIVGKWHLGHRPQFHPLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDWEMVGR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 KAGYVSKIVGKWHLGHRPQFHPLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDWEMVGR 140 150 160 170 180 190 190 200 210 220 230 240 pF1KE5 YYEEFPINLKTGEANLTQIYLQEALDFIKRQARHHPFFLYWAVDATHAPVYASKPFLGTS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 YYEEFPINLKTGEANLTQIYLQEALDFIKRQARHHPFFLYWAVDATHAPVYASKPFLGTS 200 210 220 230 240 250 250 260 270 280 290 300 pF1KE5 QRGRYGDAVREIDDSIGKILELLQDLHVADNTFVFFTSDNGAALISAPEQGGSNGPFLCG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 QRGRYGDAVREIDDSIGKILELLQDLHVADNTFVFFTSDNGAALISAPEQGGSNGPFLCG 260 270 280 290 300 310 310 320 330 340 350 360 pF1KE5 KQTTFEGGMREPALAWWPGHVTAGQVSHQLGSIMDLFTTSLALAGLTPPSDRAIDGLNLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 KQTTFEGGMREPALAWWPGHVTAGQVSHQLGSIMDLFTTSLALAGLTPPSDRAIDGLNLL 320 330 340 350 360 370 370 380 390 400 410 420 pF1KE5 PTLLQGRLMDRPIFYYRGDTLMAATLGQHKAHFWTWTNSWENFRQGIDFCPGQNVSGVTT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 PTLLQGRLMDRPIFYYRGDTLMAATLGQHKAHFWTWTNSWENFRQGIDFCPGQNVSGVTT 380 390 400 410 420 430 430 440 450 460 470 480 pF1KE5 HNLEDHTKLPLIFHLGRDPGERFPLSFASAEYQEALSRITSVVQQHQEALVPAQPQLNVC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 HNLEDHTKLPLIFHLGRDPGERFPLSFASAEYQEALSRITSVVQQHQEALVPAQPQLNVC 440 450 460 470 480 490 490 500 510 520 pF1KE5 NWAVMNWAPPGCEKLGKCLTPPESIPKKCLWSH ::::: .: :. : XP_016 NWAVM--GP--CDDLQTPGL 500 510 >>NP_001310472 (OMIM: 253000,612222) N-acetylgalactosami (337 aa) initn: 2350 init1: 2350 opt: 2350 Z-score: 2713.4 bits: 511.1 E(85289): 2.1e-144 Smith-Waterman score: 2350; 100.0% identity (100.0% similar) in 337 aa overlap (186-522:1-337) 160 170 180 190 200 210 pF1KE5 FDEWFGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANLTQIYLQEALD :::::::::::::::::::::::::::::: NP_001 MVGRYYEEFPINLKTGEANLTQIYLQEALD 10 20 30 220 230 240 250 260 270 pF1KE5 FIKRQARHHPFFLYWAVDATHAPVYASKPFLGTSQRGRYGDAVREIDDSIGKILELLQDL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 FIKRQARHHPFFLYWAVDATHAPVYASKPFLGTSQRGRYGDAVREIDDSIGKILELLQDL 40 50 60 70 80 90 280 290 300 310 320 330 pF1KE5 HVADNTFVFFTSDNGAALISAPEQGGSNGPFLCGKQTTFEGGMREPALAWWPGHVTAGQV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 HVADNTFVFFTSDNGAALISAPEQGGSNGPFLCGKQTTFEGGMREPALAWWPGHVTAGQV 100 110 120 130 140 150 340 350 360 370 380 390 pF1KE5 SHQLGSIMDLFTTSLALAGLTPPSDRAIDGLNLLPTLLQGRLMDRPIFYYRGDTLMAATL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 SHQLGSIMDLFTTSLALAGLTPPSDRAIDGLNLLPTLLQGRLMDRPIFYYRGDTLMAATL 160 170 180 190 200 210 400 410 420 430 440 450 pF1KE5 GQHKAHFWTWTNSWENFRQGIDFCPGQNVSGVTTHNLEDHTKLPLIFHLGRDPGERFPLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 GQHKAHFWTWTNSWENFRQGIDFCPGQNVSGVTTHNLEDHTKLPLIFHLGRDPGERFPLS 220 230 240 250 260 270 460 470 480 490 500 510 pF1KE5 FASAEYQEALSRITSVVQQHQEALVPAQPQLNVCNWAVMNWAPPGCEKLGKCLTPPESIP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 FASAEYQEALSRITSVVQQHQEALVPAQPQLNVCNWAVMNWAPPGCEKLGKCLTPPESIP 280 290 300 310 320 330 520 pF1KE5 KKCLWSH ::::::: NP_001 KKCLWSH >>XP_016878602 (OMIM: 253000,612222) PREDICTED: N-acetyl (390 aa) initn: 2122 init1: 2122 opt: 2128 Z-score: 2456.4 bits: 463.7 E(85289): 4.4e-130 Smith-Waterman score: 2128; 94.0% identity (94.6% similar) in 336 aa overlap (186-515:1-334) 160 170 180 190 200 210 pF1KE5 FDEWFGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANLTQIYLQEALD :::::::::::::::::::::::::::::: XP_016 MVGRYYEEFPINLKTGEANLTQIYLQEALD 10 20 30 220 230 240 250 260 270 pF1KE5 FIKRQARHHPFFLYWAVDATHAPVYASKPFLGTSQRGRYGDAVREIDDSIGKILELLQDL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 FIKRQARHHPFFLYWAVDATHAPVYASKPFLGTSQRGRYGDAVREIDDSIGKILELLQDL 40 50 60 70 80 90 280 290 300 310 320 330 pF1KE5 HVADNTFVFFTSDNGAALISAPEQGGSNGPFLCGKQTTFEGGMREPALAWWPGHVTAGQV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 HVADNTFVFFTSDNGAALISAPEQGGSNGPFLCGKQTTFEGGMREPALAWWPGHVTAGQV 100 110 120 130 140 150 340 350 360 370 380 390 pF1KE5 SHQLGSIMDLFTTSLALAGLTPPSDRAIDGLNLLPTLLQGRLMDRPIFYYRGDTLMAATL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 SHQLGSIMDLFTTSLALAGLTPPSDRAIDGLNLLPTLLQGRLMDRPIFYYRGDTLMAATL 160 170 180 190 200 210 400 410 420 430 440 450 pF1KE5 GQHKAHFWTWTNSWENFRQGIDFCPGQNVSGVTTHNLEDHTKLPLIFHLGRDPGERFPLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 GQHKAHFWTWTNSWENFRQGIDFCPGQNVSGVTTHNLEDHTKLPLIFHLGRDPGERFPLS 220 230 240 250 260 270 460 470 480 490 500 pF1KE5 FASAEYQEALSRITSVVQQHQEALVPAQPQLNVCNWAVMNWAPPGC------EKLGKCLT ::::::::::::::::::::::::::::::::::::::: :: .: : XP_016 FASAEYQEALSRITSVVQQHQEALVPAQPQLNVCNWAVM--APTQSSTSVQPDKTRPCSP 280 290 300 310 320 510 520 pF1KE5 PPESIPKKCLWSH : :. : XP_016 PVEKRPHHASIGQKHKHRRASKVSLKSVFRPQRMRRRLVQPWGPQQSGPLRSSFSVPCLG 330 340 350 360 370 380 >>NP_001078896 (OMIM: 250100,607574) arylsulfatase A iso (509 aa) initn: 709 init1: 323 opt: 958 Z-score: 1105.1 bits: 214.1 E(85289): 8.2e-55 Smith-Waterman score: 968; 37.3% identity (60.5% similar) in 534 aa overlap (13-512:9-506) 10 20 30 40 50 60 pF1KE5 MAAVVAATRWWQLLLVLSAAGMGASGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLD :::.: :::.... .::::.:.. ::.:.:::: ::.:: ::::: NP_001 MSMGAPRSLLLAL-AAGLAVA---RPPNIVLIFADDLGYGDLGCYGHPSSTTPNLD 10 20 30 40 50 70 80 90 100 110 120 pF1KE5 RMAAEGLLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDS ..:: :: : .:: ::.::::::::::::.: :.: .. .:. ::.: NP_001 QLAAGGLRFTDFYVPVSLCTPSRAALLTGRLPVRMGMYP------GVLVPSS-RGGLPLE 60 70 80 90 100 130 140 150 160 170 pF1KE5 EQLLPELLKKAGYVSKIVGKWHLGHRPQ--FHPLKHGFDEWFGSPNCH-FGPYDNKA--R : . :.: ::.. ..:::::: :. : : ..:: ...: : : :: .: . NP_001 EVTVAEVLAARGYLTGMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFP 110 120 130 140 150 160 180 190 200 210 220 pF1KE5 PNIPVYRDWEMVGRYYEEFPINLKTGEAN------LTQIYLQEALDFI---KRQARHHPF : : .. : . ::.. ::. : :. : :.. .:: : :: NP_001 PATPCDGGCDQ-GLVPIPLLANLSV-EAQPPWLPGLEARYMAFAHDLMADAQRQDR--PF 170 180 190 200 210 220 230 240 250 260 270 280 pF1KE5 FLYWAVDATHAPVYASKPFLGTSQRGRYGDAVREIDDSIGKILELLQDLHVADNTFVFFT :::.: :: : .... : : :: .::.. :.: ..: .. . :: . ..:.:.:: NP_001 FLYYASHHTHYPQFSGQSFAERSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFT 230 240 250 260 270 280 290 300 310 320 330 340 pF1KE5 SDNGAALISAPEQGGSNGPFLCGKQTTFEGGMREPALAWWPGHVTAGQVSHQLGSIMDLF .::: . .:: .: . ::: ::.:::.::::::.::::.. : :.:.:.: .::. NP_001 ADNGPETMRMS-RGGCSGLLRCGKGTTYEGGVREPALAFWPGHIAPG-VTHELASSLDLL 290 300 310 320 330 350 360 370 380 390 pF1KE5 TTSLALAGLTPPSDRAIDGLNLLPTLL-QGRLMDRPIFYY-------RGDTLMAATLGQH : :::: .: . ..::..: : :: :. . .:.: :: ..:. :.. NP_001 PTLAALAG-APLPNVTLDGFDLSPLLLGTGKSPRQSLFFYPSYPDEVRG--VFAVRTGKY 340 350 360 370 380 390 400 410 420 430 440 450 pF1KE5 KAHFWTWTNSWENFRQGIDFCPGQNVSGVTTHNLEDHTKLPLIFHLGRDPGERFPL---- ::::.: .. . : . :..:.: . ::.. :..:::: . : NP_001 KAHFFTQGSAHSDTTAD-PAC--HASSSLTAH------EPPLLYDLSKDPGENYNLLGGV 400 410 420 430 440 460 470 480 490 500 pF1KE5 SFASAEYQEALSRITSVVQQHQEALV--PAQ------PQLNVCNWAVMNWAPPGCEKLGK . :. : .::... . : . :.. :.: : :..: ::: NP_001 AGATPEVLQALKQLQLLKAQLDAAVTFGPSQVARGEDPALQIC-------CHPGCTPRPA 450 460 470 480 490 500 510 520 pF1KE5 CLTPPESIPKKCLWSH : :. NP_001 CCHCPDPHA >>NP_001078895 (OMIM: 250100,607574) arylsulfatase A iso (509 aa) initn: 709 init1: 323 opt: 958 Z-score: 1105.1 bits: 214.1 E(85289): 8.2e-55 Smith-Waterman score: 968; 37.3% identity (60.5% similar) in 534 aa overlap (13-512:9-506) 10 20 30 40 50 60 pF1KE5 MAAVVAATRWWQLLLVLSAAGMGASGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLD :::.: :::.... .::::.:.. ::.:.:::: ::.:: ::::: NP_001 MSMGAPRSLLLAL-AAGLAVA---RPPNIVLIFADDLGYGDLGCYGHPSSTTPNLD 10 20 30 40 50 70 80 90 100 110 120 pF1KE5 RMAAEGLLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDS ..:: :: : .:: ::.::::::::::::.: :.: .. .:. ::.: NP_001 QLAAGGLRFTDFYVPVSLCTPSRAALLTGRLPVRMGMYP------GVLVPSS-RGGLPLE 60 70 80 90 100 130 140 150 160 170 pF1KE5 EQLLPELLKKAGYVSKIVGKWHLGHRPQ--FHPLKHGFDEWFGSPNCH-FGPYDNKA--R : . :.: ::.. ..:::::: :. : : ..:: ...: : : :: .: . NP_001 EVTVAEVLAARGYLTGMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFP 110 120 130 140 150 160 180 190 200 210 220 pF1KE5 PNIPVYRDWEMVGRYYEEFPINLKTGEAN------LTQIYLQEALDFI---KRQARHHPF : : .. : . ::.. ::. : :. : :.. .:: : :: NP_001 PATPCDGGCDQ-GLVPIPLLANLSV-EAQPPWLPGLEARYMAFAHDLMADAQRQDR--PF 170 180 190 200 210 220 230 240 250 260 270 280 pF1KE5 FLYWAVDATHAPVYASKPFLGTSQRGRYGDAVREIDDSIGKILELLQDLHVADNTFVFFT :::.: :: : .... : : :: .::.. :.: ..: .. . :: . ..:.:.:: NP_001 FLYYASHHTHYPQFSGQSFAERSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFT 230 240 250 260 270 280 290 300 310 320 330 340 pF1KE5 SDNGAALISAPEQGGSNGPFLCGKQTTFEGGMREPALAWWPGHVTAGQVSHQLGSIMDLF .::: . .:: .: . ::: ::.:::.::::::.::::.. : :.:.:.: .::. NP_001 ADNGPETMRMS-RGGCSGLLRCGKGTTYEGGVREPALAFWPGHIAPG-VTHELASSLDLL 290 300 310 320 330 350 360 370 380 390 pF1KE5 TTSLALAGLTPPSDRAIDGLNLLPTLL-QGRLMDRPIFYY-------RGDTLMAATLGQH : :::: .: . ..::..: : :: :. . .:.: :: ..:. :.. NP_001 PTLAALAG-APLPNVTLDGFDLSPLLLGTGKSPRQSLFFYPSYPDEVRG--VFAVRTGKY 340 350 360 370 380 390 400 410 420 430 440 450 pF1KE5 KAHFWTWTNSWENFRQGIDFCPGQNVSGVTTHNLEDHTKLPLIFHLGRDPGERFPL---- ::::.: .. . : . :..:.: . ::.. :..:::: . : NP_001 KAHFFTQGSAHSDTTAD-PAC--HASSSLTAH------EPPLLYDLSKDPGENYNLLGGV 400 410 420 430 440 460 470 480 490 500 pF1KE5 SFASAEYQEALSRITSVVQQHQEALV--PAQ------PQLNVCNWAVMNWAPPGCEKLGK . :. : .::... . : . :.. :.: : :..: ::: NP_001 AGATPEVLQALKQLQLLKAQLDAAVTFGPSQVARGEDPALQIC-------CHPGCTPRPA 450 460 470 480 490 500 510 520 pF1KE5 CLTPPESIPKKCLWSH : :. NP_001 CCHCPDPHA 522 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 05:48:54 2016 done: Tue Nov 8 05:48:55 2016 Total Scan time: 9.390 Total Display time: 0.070 Function used was FASTA [36.3.4 Apr, 2011]