# /usr/local/bin/fasta34_t -T 4 -b50 -d10 -E0.01 -H -O./tmp/mbj00115.fasta.nr -Q ../query/mKIAA4172.ptfa /cdna4/rodent/rouge_util/new.rouge/nfasta/nr 2 FASTA searches a protein or DNA sequence data bank version 34.26.5 April 26, 2007 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 mKIAA4172, 515 aa vs /cdna4/rodent/rouge_util/new.rouge/nfasta/nr library 2727779818 residues in 7921681 sequences statistics sampled from 60000 to 7917441 sequences Expectation_n fit: rho(ln(x))= 5.3497+/-0.000186; mu= 11.4885+/- 0.010 mean_var=75.5709+/-14.965, 0's: 39 Z-trim: 67 B-trim: 3136 in 1/66 Lambda= 0.147536 FASTA (3.5 Sept 2006) function [optimized, BL50 matrix (15:-5)] ktup: 2 join: 37, opt: 25, open/ext: -10/-2, width: 16 The best scores are: opt bits E(7921681) gi|12835776|dbj|BAB23356.1| unnamed protein produc ( 506) 3518 758.1 1.3e-216 gi|1703420|sp|P50428.1|ARSA_MOUSE RecName: Full=Ar ( 506) 3506 755.5 7.5e-216 gi|149017571|gb|EDL76575.1| arylsulfatase A, isofo ( 507) 3377 728.1 1.4e-207 gi|45686371|gb|AAL58668.2|AF316108_1 arylsulfatase ( 507) 3113 671.9 1.1e-190 gi|119593988|gb|EAW73582.1| arylsulfatase A, isofo ( 509) 3112 671.7 1.3e-190 gi|1399961|gb|AAB03341.1| arylsulfatase A [Homo sa ( 509) 3111 671.5 1.5e-190 gi|33874703|gb|AAH14210.2| Arylsulfatase A [Homo s ( 507) 3109 671.0 2.1e-190 gi|114221|sp|P15289.3|ARSA_HUMAN RecName: Full=Ary ( 507) 3108 670.8 2.4e-190 gi|52545967|emb|CAH56144.1| hypothetical protein [ ( 509) 3105 670.2 3.7e-190 gi|149759319|ref|XP_001490513.1| PREDICTED: simila ( 507) 3094 667.8 1.9e-189 gi|122132221|sp|Q08DD1.1|ARSA_BOVIN RecName: Full= ( 507) 3084 665.7 8.2e-189 gi|109094666|ref|XP_001113032.1| PREDICTED: arylsu ( 507) 3071 662.9 5.6e-188 gi|114687134|ref|XP_515228.2| PREDICTED: arylsulfa ( 680) 3069 662.6 9.5e-188 gi|12084623|pdb|1E2S|P Chain P, Crystal Structure ( 489) 3043 657.0 3.4e-186 gi|14277878|pdb|1E1Z|P Chain P, Crystal Structure ( 489) 3043 657.0 3.4e-186 gi|114326188|ref|NP_001041548.1| arylsulfatase A [ ( 507) 3043 657.0 3.5e-186 gi|40889086|pdb|1N2K|A Chain A, Crystal Structure ( 489) 3042 656.8 3.9e-186 gi|15826832|pdb|1E3C|P Chain P, Crystal Structure ( 489) 3038 655.9 7.1e-186 gi|14488709|pdb|1E33|P Chain P, Crystal Structure ( 489) 3023 652.7 6.5e-185 gi|76780271|gb|AAI05853.1| Arylsulfatase A [Rattus ( 497) 2637 570.6 3.5e-160 gi|146229327|ref|NP_001078897.1| arylsulfatase A i ( 423) 2634 569.9 4.9e-160 gi|126339031|ref|XP_001366628.1| PREDICTED: simila ( 506) 2622 567.4 3.3e-159 gi|109094674|ref|XP_001113004.1| PREDICTED: arylsu ( 421) 2597 562.0 1.1e-157 gi|118081865|ref|XP_424471.2| PREDICTED: similar t ( 503) 2216 481.0 3.4e-133 gi|94732633|emb|CAK05315.1| novel protein similar ( 500) 2047 445.0 2.3e-122 gi|148726008|emb|CAN88512.1| novel protein similar ( 503) 2047 445.0 2.3e-122 gi|120537984|gb|AAI29614.1| LOC100036898 protein [ ( 507) 2044 444.4 3.6e-122 gi|60552320|gb|AAH90818.1| Zgc:101575 [Danio rerio ( 499) 2041 443.7 5.5e-122 gi|50927224|gb|AAH79772.1| MGC86251 protein [Xenop ( 507) 2036 442.6 1.2e-121 gi|148672388|gb|EDL04335.1| arylsulfatase A, isofo ( 304) 1947 423.6 3.9e-116 gi|72110136|ref|XP_780327.1| PREDICTED: hypothetic ( 527) 1777 387.5 4.7e-105 gi|115715640|ref|XP_796991.2| PREDICTED: hypotheti ( 537) 1697 370.5 6.4e-100 gi|149017572|gb|EDL76576.1| arylsulfatase A, isofo ( 256) 1666 363.7 3.5e-98 gi|72079186|ref|XP_788588.1| PREDICTED: similar to ( 525) 1587 347.1 7e-93 gi|72079188|ref|XP_788607.1| PREDICTED: similar to ( 522) 1566 342.6 1.5e-91 gi|47216038|emb|CAG11369.1| unnamed protein produc ( 474) 1397 306.6 9.7e-81 gi|115978323|ref|XP_001186506.1| PREDICTED: simila ( 764) 1340 294.6 6.3e-77 gi|146332016|gb|ABQ22514.1| arylsulfatase A precur ( 203) 1192 262.7 6.8e-68 gi|223894428|gb|EEF60880.1| sulfatase [bacterium E ( 460) 1135 250.8 5.8e-64 gi|171910115|ref|ZP_02925585.1| arylsulfatase A [V ( 460) 1130 249.8 1.2e-63 gi|114738898|gb|ABI77023.1| sulfatase family prote ( 505) 1128 249.4 1.7e-63 gi|119451878|gb|EAW33111.1| arylsulfatase A [marin ( 479) 1122 248.1 4.1e-63 gi|126578336|gb|EAZ82500.1| arylsulfatase A [Algor ( 467) 1074 237.9 4.7e-60 gi|148843728|gb|EDL58087.1| arylsulfatase A [Planc ( 474) 1070 237.0 8.7e-60 gi|88783631|gb|EAR14802.1| arylsulfatase A [Robigi ( 492) 1063 235.5 2.5e-59 gi|210093195|gb|EEA41403.1| hypothetical protein B ( 514) 1060 234.9 4e-59 gi|149140467|gb|EDM28865.1| arylsulfatase A [Lenti ( 454) 1044 231.5 3.9e-58 gi|85830641|gb|EAQ49099.1| arylsulfatase A [Leeuwe ( 477) 1032 228.9 2.4e-57 gi|88710176|gb|EAR02408.1| arylsulfatase A [Flavob ( 469) 1025 227.4 6.6e-57 gi|115660827|ref|XP_798513.2| PREDICTED: similar t ( 706) 1026 227.8 7.8e-57 >>gi|12835776|dbj|BAB23356.1| unnamed protein product [M (506 aa) initn: 3518 init1: 3518 opt: 3518 Z-score: 4045.3 bits: 758.1 E(): 1.3e-216 Smith-Waterman score: 3518; 100.000% identity (100.000% similar) in 506 aa overlap (10-515:1-506) 10 20 30 40 50 60 mKIAA4 GHRPICISVMALGTLFLALAAGLSTASPPNILLIFADDLGYGDLGSYGHPSSTTPNLDQL ::::::::::::::::::::::::::::::::::::::::::::::::::: gi|128 MALGTLFLALAAGLSTASPPNILLIFADDLGYGDLGSYGHPSSTTPNLDQL 10 20 30 40 50 70 80 90 100 110 120 mKIAA4 AEGGLRFTDFYVPVSLCTPSRAALLTGRLPVRSGMYPGVLGPSSQGGLPLEEVTLAEVLA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: gi|128 AEGGLRFTDFYVPVSLCTPSRAALLTGRLPVRSGMYPGVLGPSSQGGLPLEEVTLAEVLA 60 70 80 90 100 110 130 140 150 160 170 180 mKIAA4 ARGYLTGMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPDIPCKGGC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: gi|128 ARGYLTGMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPDIPCKGGC 120 130 140 150 160 170 190 200 210 220 230 240 mKIAA4 DQGLVPIPLLANLTVEAQPPWLPGLEARYVSFSRDLMADAQRQGRPFFLYYASHHTHYPQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: gi|128 DQGLVPIPLLANLTVEAQPPWLPGLEARYVSFSRDLMADAQRQGRPFFLYYASHHTHYPQ 180 190 200 210 220 230 250 260 270 280 290 300 mKIAA4 FSGQSFTKRSGRGPFGDSLMELDGAVGALMTTVGDLGLLEETLVIFTADNGPELMRMSNG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: gi|128 FSGQSFTKRSGRGPFGDSLMELDGAVGALMTTVGDLGLLEETLVIFTADNGPELMRMSNG 240 250 260 270 280 290 310 320 330 340 350 360 mKIAA4 GCSGLLRCGKGTTFEGGVREPALVYWPGHITPGVTHELASSLDLLPTLAALTGAPLPNVT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: gi|128 GCSGLLRCGKGTTFEGGVREPALVYWPGHITPGVTHELASSLDLLPTLAALTGAPLPNVT 300 310 320 330 340 350 370 380 390 400 410 420 mKIAA4 LDGVDISPLLLGTGKSPRKSVFFYPPYPDEIHGVFAVRNGKYKAHFFTQGSAHSDTTSDP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: gi|128 LDGVDISPLLLGTGKSPRKSVFFYPPYPDEIHGVFAVRNGKYKAHFFTQGSAHSDTTSDP 360 370 380 390 400 410 430 440 450 460 470 480 mKIAA4 ACHAANRLTAHEPPLLYDLSQDPGENYNVLESIEGVSPEALQALKHIQLLKAQYDAAMTF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: gi|128 ACHAANRLTAHEPPLLYDLSQDPGENYNVLESIEGVSPEALQALKHIQLLKAQYDAAMTF 420 430 440 450 460 470 490 500 510 mKIAA4 GPSQIAKGEDPALQICCQPSCTPHPVCCHCPGSQS ::::::::::::::::::::::::::::::::::: gi|128 GPSQIAKGEDPALQICCQPSCTPHPVCCHCPGSQS 480 490 500 >>gi|1703420|sp|P50428.1|ARSA_MOUSE RecName: Full=Arylsu (506 aa) initn: 3506 init1: 3506 opt: 3506 Z-score: 4031.5 bits: 755.5 E(): 7.5e-216 Smith-Waterman score: 3506; 99.605% identity (100.000% similar) in 506 aa overlap (10-515:1-506) 10 20 30 40 50 60 mKIAA4 GHRPICISVMALGTLFLALAAGLSTASPPNILLIFADDLGYGDLGSYGHPSSTTPNLDQL ::::::::::::::::::::::::::::::::::::::::::::::::::: gi|170 MALGTLFLALAAGLSTASPPNILLIFADDLGYGDLGSYGHPSSTTPNLDQL 10 20 30 40 50 70 80 90 100 110 120 mKIAA4 AEGGLRFTDFYVPVSLCTPSRAALLTGRLPVRSGMYPGVLGPSSQGGLPLEEVTLAEVLA :::::::::::::::::::::::::::::::::.::::::::::::::::::.::::::: gi|170 AEGGLRFTDFYVPVSLCTPSRAALLTGRLPVRSAMYPGVLGPSSQGGLPLEELTLAEVLA 60 70 80 90 100 110 130 140 150 160 170 180 mKIAA4 ARGYLTGMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPDIPCKGGC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: gi|170 ARGYLTGMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPDIPCKGGC 120 130 140 150 160 170 190 200 210 220 230 240 mKIAA4 DQGLVPIPLLANLTVEAQPPWLPGLEARYVSFSRDLMADAQRQGRPFFLYYASHHTHYPQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: gi|170 DQGLVPIPLLANLTVEAQPPWLPGLEARYVSFSRDLMADAQRQGRPFFLYYASHHTHYPQ 180 190 200 210 220 230 250 260 270 280 290 300 mKIAA4 FSGQSFTKRSGRGPFGDSLMELDGAVGALMTTVGDLGLLEETLVIFTADNGPELMRMSNG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: gi|170 FSGQSFTKRSGRGPFGDSLMELDGAVGALMTTVGDLGLLEETLVIFTADNGPELMRMSNG 240 250 260 270 280 290 310 320 330 340 350 360 mKIAA4 GCSGLLRCGKGTTFEGGVREPALVYWPGHITPGVTHELASSLDLLPTLAALTGAPLPNVT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: gi|170 GCSGLLRCGKGTTFEGGVREPALVYWPGHITPGVTHELASSLDLLPTLAALTGAPLPNVT 300 310 320 330 340 350 370 380 390 400 410 420 mKIAA4 LDGVDISPLLLGTGKSPRKSVFFYPPYPDEIHGVFAVRNGKYKAHFFTQGSAHSDTTSDP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: gi|170 LDGVDISPLLLGTGKSPRKSVFFYPPYPDEIHGVFAVRNGKYKAHFFTQGSAHSDTTSDP 360 370 380 390 400 410 430 440 450 460 470 480 mKIAA4 ACHAANRLTAHEPPLLYDLSQDPGENYNVLESIEGVSPEALQALKHIQLLKAQYDAAMTF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: gi|170 ACHAANRLTAHEPPLLYDLSQDPGENYNVLESIEGVSPEALQALKHIQLLKAQYDAAMTF 420 430 440 450 460 470 490 500 510 mKIAA4 GPSQIAKGEDPALQICCQPSCTPHPVCCHCPGSQS ::::::::::::::::::::::::::::::::::: gi|170 GPSQIAKGEDPALQICCQPSCTPHPVCCHCPGSQS 480 490 500 >>gi|149017571|gb|EDL76575.1| arylsulfatase A, isoform C (507 aa) initn: 3377 init1: 3377 opt: 3377 Z-score: 3883.1 bits: 728.1 E(): 1.4e-207 Smith-Waterman score: 3377; 95.644% identity (98.416% similar) in 505 aa overlap (11-515:3-507) 10 20 30 40 50 60 mKIAA4 GHRPICISVMALGTLFLALAAGLSTASPPNILLIFADDLGYGDLGSYGHPSSTTPNLDQL ::::: :::::::::::::::.:::::::::::::::::::::::::::: gi|149 MGALGTLVLALAAGLSTASPPNIMLIFADDLGYGDLGSYGHPSSTTPNLDQL 10 20 30 40 50 70 80 90 100 110 120 mKIAA4 AEGGLRFTDFYVPVSLCTPSRAALLTGRLPVRSGMYPGVLGPSSQGGLPLEEVTLAEVLA : :::::::::::::::::::::::::::::::::::::::::::::::::::::::::: gi|149 AAGGLRFTDFYVPVSLCTPSRAALLTGRLPVRSGMYPGVLGPSSQGGLPLEEVTLAEVLA 60 70 80 90 100 110 130 140 150 160 170 180 mKIAA4 ARGYLTGMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPDIPCKGGC :::::::::::::::::::::::::::::::::::::::::::::::::::::: :.::: gi|149 ARGYLTGMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPDITCSGGC 120 130 140 150 160 170 190 200 210 220 230 240 mKIAA4 DQGLVPIPLLANLTVEAQPPWLPGLEARYVSFSRDLMADAQRQGRPFFLYYASHHTHYPQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: gi|149 DQGLVPIPLLANLTVEAQPPWLPGLEARYVSFSRDLMADAQRQGRPFFLYYASHHTHYPQ 180 190 200 210 220 230 250 260 270 280 290 300 mKIAA4 FSGQSFTKRSGRGPFGDSLMELDGAVGALMTTVGDLGLLEETLVIFTADNGPELMRMSNG :::::::::::::::::::::::::::::::.::::::: ::::::::::::::::::.: gi|149 FSGQSFTKRSGRGPFGDSLMELDGAVGALMTAVGDLGLLGETLVIFTADNGPELMRMSDG 240 250 260 270 280 290 310 320 330 340 350 360 mKIAA4 GCSGLLRCGKGTTFEGGVREPALVYWPGHITPGVTHELASSLDLLPTLAALTGAPLPNVT ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::.: gi|149 GCSGLLRCGKGTTFEGGVREPALVYWPGHITPGVTHELASSLDLLPTLAALTGAPLPNIT 300 310 320 330 340 350 370 380 390 400 410 420 mKIAA4 LDGVDISPLLLGTGKSPRKSVFFYPPYPDEIHGVFAVRNGKYKAHFFTQGSAHSDTTSDP ::::::::::::::::::.:::::::.:::::::::::::::::::::::::::::: :: gi|149 LDGVDISPLLLGTGKSPRNSVFFYPPFPDEIHGVFAVRNGKYKAHFFTQGSAHSDTTPDP 360 370 380 390 400 410 430 440 450 460 470 480 mKIAA4 ACHAANRLTAHEPPLLYDLSQDPGENYNVLESIEGVSPEALQALKHIQLLKAQYDAAMTF ::::::::::::::::::::.:::::::.:.: : ::::::::::::.::::.::::::: gi|149 ACHAANRLTAHEPPLLYDLSKDPGENYNLLDSTEEVSPEALQALKHIELLKAEYDAAMTF 420 430 440 450 460 470 490 500 510 mKIAA4 GPSQIAKGEDPALQICCQPSCTPHPVCCHCPGSQS .:::::::::::::::::::::::::::::: :.: gi|149 SPSQIAKGEDPALQICCQPSCTPHPVCCHCPDSHS 480 490 500 >>gi|45686371|gb|AAL58668.2|AF316108_1 arylsulfatase A [ (507 aa) initn: 3112 init1: 3112 opt: 3113 Z-score: 3579.4 bits: 671.9 E(): 1.1e-190 Smith-Waterman score: 3113; 86.364% identity (96.245% similar) in 506 aa overlap (9-514:1-506) 10 20 30 40 50 60 mKIAA4 GHRPICISVMALGTLFLALAAGLSTASPPNILLIFADDLGYGDLGSYGHPSSTTPNLDQL ..:: .: ::::.::...:::::.:::::::::::::::::::::::::::: gi|456 MVALWALTLALASGLAATSPPNIVLIFADDLGYGDLGSYGHPSSTTPNLDQL 10 20 30 40 50 70 80 90 100 110 120 mKIAA4 AEGGLRFTDFYVPVSLCTPSRAALLTGRLPVRSGMYPGVLGPSSQGGLPLEEVTLAEVLA : :::::::::::::::::::::::::::::: :.::::: :::.::::::::::::::: gi|456 AAGGLRFTDFYVPVSLCTPSRAALLTGRLPVRMGLYPGVLEPSSRGGLPLEEVTLAEVLA 60 70 80 90 100 110 130 140 150 160 170 180 mKIAA4 ARGYLTGMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPDIPCKGGC ::::::::::::::::::::::::::::::::::::::::::::::::::::. :: :.: gi|456 ARGYLTGMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPSTPCDGSC 120 130 140 150 160 170 190 200 210 220 230 240 mKIAA4 DQGLVPIPLLANLTVEAQPPWLPGLEARYVSFSRDLMADAQRQGRPFFLYYASHHTHYPQ ::::::.::::::.::::::::::::::::.:.::::::::::::::::::::::::::: gi|456 DQGLVPVPLLANLSVEAQPPWLPGLEARYVAFARDLMADAQRQGRPFFLYYASHHTHYPQ 180 190 200 210 220 230 250 260 270 280 290 300 mKIAA4 FSGQSFTKRSGRGPFGDSLMELDGAVGALMTTVGDLGLLEETLVIFTADNGPELMRMSNG ::::::. .::::::::::::::.:::::::.::::::: ::::::::::::: ::::.: gi|456 FSGQSFSGHSGRGPFGDSLMELDAAVGALMTAVGDLGLLGETLVIFTADNGPETMRMSHG 240 250 260 270 280 290 310 320 330 340 350 360 mKIAA4 GCSGLLRCGKGTTFEGGVREPALVYWPGHITPGVTHELASSLDLLPTLAALTGAPLPNVT :::::::::::::::::::::::..:::::.::::::::::::::::::::.:::::::: gi|456 GCSGLLRCGKGTTFEGGVREPALAFWPGHIAPGVTHELASSLDLLPTLAALAGAPLPNVT 300 310 320 330 340 350 370 380 390 400 410 420 mKIAA4 LDGVDISPLLLGTGKSPRKSVFFYPPYPDEIHGVFAVRNGKYKAHFFTQGSAHSDTTSDP :::::.::::::::::::...:::: ::::..::::::.:::::::::::: :::::.:: gi|456 LDGVDLSPLLLGTGKSPRRTLFFYPAYPDEVRGVFAVRSGKYKAHFFTQGSIHSDTTADP 360 370 380 390 400 410 430 440 450 460 470 480 mKIAA4 ACHAANRLTAHEPPLLYDLSQDPGENYNVLESIEGVSPEALQALKHIQLLKAQYDAAMTF ::::.. :::::::::.:::.:::::::.: .. :.::.::.::..::::::.:::.:: gi|456 ACHASSPLTAHEPPLLFDLSEDPGENYNLLGGVAEVAPEVLQVLKQLQLLKAQFDAAVTF 420 430 440 450 460 470 490 500 510 mKIAA4 GPSQIAKGEDPALQICCQPSCTPHPVCCHCPGSQS .:::::.::::::::::::::::.: ::::: : gi|456 SPSQIARGEDPALQICCQPSCTPRPSCCHCPELQP 480 490 500 >>gi|119593988|gb|EAW73582.1| arylsulfatase A, isoform C (509 aa) initn: 3109 init1: 3109 opt: 3112 Z-score: 3578.3 bits: 671.7 E(): 1.3e-190 Smith-Waterman score: 3112; 86.139% identity (96.040% similar) in 505 aa overlap (10-511:1-505) 10 20 30 40 50 mKIAA4 GHRPICISVMALG---TLFLALAAGLSTASPPNILLIFADDLGYGDLGSYGHPSSTTPNL :..: .:.:::::::..: ::::.::::::::::::: ::::::::::: gi|119 MSMGAPRSLLLALAAGLAVARPPNIVLIFADDLGYGDLGCYGHPSSTTPNL 10 20 30 40 50 60 70 80 90 100 110 mKIAA4 DQLAEGGLRFTDFYVPVSLCTPSRAALLTGRLPVRSGMYPGVLGPSSQGGLPLEEVTLAE :::: :::::::::::::::::::::::::::::: ::::::: :::.:::::::::.:: gi|119 DQLAAGGLRFTDFYVPVSLCTPSRAALLTGRLPVRMGMYPGVLVPSSRGGLPLEEVTVAE 60 70 80 90 100 110 120 130 140 150 160 170 mKIAA4 VLAARGYLTGMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPDIPCK ::::::::::::::::::::::::::::::::::::::::::::::::::::::: :: gi|119 VLAARGYLTGMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPATPCD 120 130 140 150 160 170 180 190 200 210 220 230 mKIAA4 GGCDQGLVPIPLLANLTVEAQPPWLPGLEARYVSFSRDLMADAQRQGRPFFLYYASHHTH ::::::::::::::::.:::::::::::::::..:..::::::::: ::::::::::::: gi|119 GGCDQGLVPIPLLANLSVEAQPPWLPGLEARYMAFAHDLMADAQRQDRPFFLYYASHHTH 180 190 200 210 220 230 240 250 260 270 280 290 mKIAA4 YPQFSGQSFTKRSGRGPFGDSLMELDGAVGALMTTVGDLGLLEETLVIFTADNGPELMRM :::::::::..:::::::::::::::.:::.:::..:::::::::::::::::::: ::: gi|119 YPQFSGQSFAERSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRM 240 250 260 270 280 290 300 310 320 330 340 350 mKIAA4 SNGGCSGLLRCGKGTTFEGGVREPALVYWPGHITPGVTHELASSLDLLPTLAALTGAPLP : ::::::::::::::.:::::::::..:::::.::::::::::::::::::::.::::: gi|119 SRGGCSGLLRCGKGTTYEGGVREPALAFWPGHIAPGVTHELASSLDLLPTLAALAGAPLP 300 310 320 330 340 350 360 370 380 390 400 410 mKIAA4 NVTLDGVDISPLLLGTGKSPRKSVFFYPPYPDEIHGVFAVRNGKYKAHFFTQGSAHSDTT :::::: :.::::::::::::.:.:::: ::::..::::::.:::::::::::::::::: gi|119 NVTLDGFDLSPLLLGTGKSPRQSLFFYPSYPDEVRGVFAVRSGKYKAHFFTQGSAHSDTT 360 370 380 390 400 410 420 430 440 450 460 470 mKIAA4 SDPACHAANRLTAHEPPLLYDLSQDPGENYNVLESIEGVSPEALQALKHIQLLKAQYDAA .::::::.. :::::::::::::.:::::::.: .. :..::.:::::..:::::: ::: gi|119 ADPACHASSSLTAHEPPLLYDLSKDPGENYNLLGGVAGATPEVLQALKQLQLLKAQLDAA 420 430 440 450 460 470 480 490 500 510 mKIAA4 MTFGPSQIAKGEDPALQICCQPSCTPHPVCCHCPGSQS .::::::.:.::::::::::.:.:::.:.::::: gi|119 VTFGPSQVARGEDPALQICCHPGCTPRPACCHCPDPHA 480 490 500 >>gi|1399961|gb|AAB03341.1| arylsulfatase A [Homo sapien (509 aa) initn: 3108 init1: 3108 opt: 3111 Z-score: 3577.1 bits: 671.5 E(): 1.5e-190 Smith-Waterman score: 3111; 86.139% identity (96.040% similar) in 505 aa overlap (10-511:1-505) 10 20 30 40 50 mKIAA4 GHRPICISVMALG---TLFLALAAGLSTASPPNILLIFADDLGYGDLGSYGHPSSTTPNL :..: .:.:::::::..: ::::.::::::::::::: ::::::::::: gi|139 MSMGAPRSLLLALAAGLAVARPPNIVLIFADDLGYGDLGCYGHPSSTTPNL 10 20 30 40 50 60 70 80 90 100 110 mKIAA4 DQLAEGGLRFTDFYVPVSLCTPSRAALLTGRLPVRSGMYPGVLGPSSQGGLPLEEVTLAE :::: :::::::::::::::::::::::::::::: ::::::: :::.:::::::::.:: gi|139 DQLAAGGLRFTDFYVPVSLCTPSRAALLTGRLPVRMGMYPGVLVPSSRGGLPLEEVTVAE 60 70 80 90 100 110 120 130 140 150 160 170 mKIAA4 VLAARGYLTGMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPDIPCK ::::::::::::::::::::::::::::::::::::::::::::::::::::::: :: gi|139 VLAARGYLTGMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPATPCD 120 130 140 150 160 170 180 190 200 210 220 230 mKIAA4 GGCDQGLVPIPLLANLTVEAQPPWLPGLEARYVSFSRDLMADAQRQGRPFFLYYASHHTH ::::::::::::::::.:::::::::::::::..:..::::::::: ::::::::::::: gi|139 GGCDQGLVPIPLLANLSVEAQPPWLPGLEARYMAFAHDLMADAQRQDRPFFLYYASHHTH 180 190 200 210 220 230 240 250 260 270 280 290 mKIAA4 YPQFSGQSFTKRSGRGPFGDSLMELDGAVGALMTTVGDLGLLEETLVIFTADNGPELMRM :::::::::..:::::::::::::::.:::.:::..:::::::::::::::::::: ::: gi|139 YPQFSGQSFAERSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRM 240 250 260 270 280 290 300 310 320 330 340 350 mKIAA4 SNGGCSGLLRCGKGTTFEGGVREPALVYWPGHITPGVTHELASSLDLLPTLAALTGAPLP : ::::::::::::::.:::::::::..:::::.::::::::::::::::::::.::::: gi|139 SRGGCSGLLRCGKGTTYEGGVREPALAFWPGHIAPGVTHELASSLDLLPTLAALAGAPLP 300 310 320 330 340 350 360 370 380 390 400 410 mKIAA4 NVTLDGVDISPLLLGTGKSPRKSVFFYPPYPDEIHGVFAVRNGKYKAHFFTQGSAHSDTT :::::: :.::::::::::::.:.:::: ::::..::::::.:::::::::::::::::: gi|139 NVTLDGFDLSPLLLGTGKSPRQSLFFYPSYPDEVRGVFAVRTGKYKAHFFTQGSAHSDTT 360 370 380 390 400 410 420 430 440 450 460 470 mKIAA4 SDPACHAANRLTAHEPPLLYDLSQDPGENYNVLESIEGVSPEALQALKHIQLLKAQYDAA .::::::.. :::::::::::::.:::::::.: .. :..::.:::::..:::::: ::: gi|139 ADPACHASSSLTAHEPPLLYDLSKDPGENYNLLGGVAGATPEVLQALKQLQLLKAQLDAA 420 430 440 450 460 470 480 490 500 510 mKIAA4 MTFGPSQIAKGEDPALQICCQPSCTPHPVCCHCPGSQS .::::::.:.::::::::::.:.:::.:.::::: gi|139 VTFGPSQVARGEDPALQICCHPGCTPRPACCHCPDPHA 480 490 500 >>gi|33874703|gb|AAH14210.2| Arylsulfatase A [Homo sapie (507 aa) initn: 3109 init1: 3109 opt: 3109 Z-score: 3574.8 bits: 671.0 E(): 2.1e-190 Smith-Waterman score: 3109; 86.948% identity (96.586% similar) in 498 aa overlap (14-511:6-503) 10 20 30 40 50 60 mKIAA4 GHRPICISVMALGTLFLALAAGLSTASPPNILLIFADDLGYGDLGSYGHPSSTTPNLDQL .:.:::::::..: ::::.::::::::::::: :::::::::::::: gi|338 MGAPRSLLLALAAGLAVARPPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQL 10 20 30 40 50 70 80 90 100 110 120 mKIAA4 AEGGLRFTDFYVPVSLCTPSRAALLTGRLPVRSGMYPGVLGPSSQGGLPLEEVTLAEVLA : :::::::::::::::::::::::::::::: ::::::: :::.:::::::::.::::: gi|338 AAGGLRFTDFYVPVSLCTPSRAALLTGRLPVRMGMYPGVLVPSSRGGLPLEEVTVAEVLA 60 70 80 90 100 110 130 140 150 160 170 180 mKIAA4 ARGYLTGMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPDIPCKGGC :::::::::::::::::::::::::::::::::::::::::::::::::::: :: ::: gi|338 ARGYLTGMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGC 120 130 140 150 160 170 190 200 210 220 230 240 mKIAA4 DQGLVPIPLLANLTVEAQPPWLPGLEARYVSFSRDLMADAQRQGRPFFLYYASHHTHYPQ :::::::::::::.:::::::::::::::..:..::::::::: :::::::::::::::: gi|338 DQGLVPIPLLANLSVEAQPPWLPGLEARYMAFAHDLMADAQRQDRPFFLYYASHHTHYPQ 180 190 200 210 220 230 250 260 270 280 290 300 mKIAA4 FSGQSFTKRSGRGPFGDSLMELDGAVGALMTTVGDLGLLEETLVIFTADNGPELMRMSNG ::::::..:::::::::::::::.:::.:::..:::::::::::::::::::: :::: : gi|338 FSGQSFAERSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRMSRG 240 250 260 270 280 290 310 320 330 340 350 360 mKIAA4 GCSGLLRCGKGTTFEGGVREPALVYWPGHITPGVTHELASSLDLLPTLAALTGAPLPNVT :::::::::::::.:::::::::..:::::.::::::::::::::::::::.:::::::: gi|338 GCSGLLRCGKGTTYEGGVREPALAFWPGHIAPGVTHELASSLDLLPTLAALAGAPLPNVT 300 310 320 330 340 350 370 380 390 400 410 420 mKIAA4 LDGVDISPLLLGTGKSPRKSVFFYPPYPDEIHGVFAVRNGKYKAHFFTQGSAHSDTTSDP ::: :.::::::::::::.:.:::: ::::..::::::.::::::::::::::::::.:: gi|338 LDGFDLSPLLLGTGKSPRQSLFFYPSYPDEVRGVFAVRSGKYKAHFFTQGSAHSDTTADP 360 370 380 390 400 410 430 440 450 460 470 480 mKIAA4 ACHAANRLTAHEPPLLYDLSQDPGENYNVLESIEGVSPEALQALKHIQLLKAQYDAAMTF ::::.. :::::::::::::.:::::::.: .. :..::.:::::..:::::: :::.:: gi|338 ACHASSSLTAHEPPLLYDLSKDPGENYNLLGGVAGATPEVLQALKQLQLLKAQLDAAVTF 420 430 440 450 460 470 490 500 510 mKIAA4 GPSQIAKGEDPALQICCQPSCTPHPVCCHCPGSQS ::::.:.::::::::::.:.:::.:.::::: gi|338 GPSQVARGEDPALQICCHPGCTPRPACCHCPDPHA 480 490 500 >>gi|114221|sp|P15289.3|ARSA_HUMAN RecName: Full=Arylsul (507 aa) initn: 3108 init1: 3108 opt: 3108 Z-score: 3573.7 bits: 670.8 E(): 2.4e-190 Smith-Waterman score: 3108; 86.948% identity (96.586% similar) in 498 aa overlap (14-511:6-503) 10 20 30 40 50 60 mKIAA4 GHRPICISVMALGTLFLALAAGLSTASPPNILLIFADDLGYGDLGSYGHPSSTTPNLDQL .:.:::::::..: ::::.::::::::::::: :::::::::::::: gi|114 MGAPRSLLLALAAGLAVARPPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQL 10 20 30 40 50 70 80 90 100 110 120 mKIAA4 AEGGLRFTDFYVPVSLCTPSRAALLTGRLPVRSGMYPGVLGPSSQGGLPLEEVTLAEVLA : :::::::::::::::::::::::::::::: ::::::: :::.:::::::::.::::: gi|114 AAGGLRFTDFYVPVSLCTPSRAALLTGRLPVRMGMYPGVLVPSSRGGLPLEEVTVAEVLA 60 70 80 90 100 110 130 140 150 160 170 180 mKIAA4 ARGYLTGMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPDIPCKGGC :::::::::::::::::::::::::::::::::::::::::::::::::::: :: ::: gi|114 ARGYLTGMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGC 120 130 140 150 160 170 190 200 210 220 230 240 mKIAA4 DQGLVPIPLLANLTVEAQPPWLPGLEARYVSFSRDLMADAQRQGRPFFLYYASHHTHYPQ :::::::::::::.:::::::::::::::..:..::::::::: :::::::::::::::: gi|114 DQGLVPIPLLANLSVEAQPPWLPGLEARYMAFAHDLMADAQRQDRPFFLYYASHHTHYPQ 180 190 200 210 220 230 250 260 270 280 290 300 mKIAA4 FSGQSFTKRSGRGPFGDSLMELDGAVGALMTTVGDLGLLEETLVIFTADNGPELMRMSNG ::::::..:::::::::::::::.:::.:::..:::::::::::::::::::: :::: : gi|114 FSGQSFAERSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRMSRG 240 250 260 270 280 290 310 320 330 340 350 360 mKIAA4 GCSGLLRCGKGTTFEGGVREPALVYWPGHITPGVTHELASSLDLLPTLAALTGAPLPNVT :::::::::::::.:::::::::..:::::.::::::::::::::::::::.:::::::: gi|114 GCSGLLRCGKGTTYEGGVREPALAFWPGHIAPGVTHELASSLDLLPTLAALAGAPLPNVT 300 310 320 330 340 350 370 380 390 400 410 420 mKIAA4 LDGVDISPLLLGTGKSPRKSVFFYPPYPDEIHGVFAVRNGKYKAHFFTQGSAHSDTTSDP ::: :.::::::::::::.:.:::: ::::..::::::.::::::::::::::::::.:: gi|114 LDGFDLSPLLLGTGKSPRQSLFFYPSYPDEVRGVFAVRTGKYKAHFFTQGSAHSDTTADP 360 370 380 390 400 410 430 440 450 460 470 480 mKIAA4 ACHAANRLTAHEPPLLYDLSQDPGENYNVLESIEGVSPEALQALKHIQLLKAQYDAAMTF ::::.. :::::::::::::.:::::::.: .. :..::.:::::..:::::: :::.:: gi|114 ACHASSSLTAHEPPLLYDLSKDPGENYNLLGGVAGATPEVLQALKQLQLLKAQLDAAVTF 420 430 440 450 460 470 490 500 510 mKIAA4 GPSQIAKGEDPALQICCQPSCTPHPVCCHCPGSQS ::::.:.::::::::::.:.:::.:.::::: gi|114 GPSQVARGEDPALQICCHPGCTPRPACCHCPDPHA 480 490 500 >>gi|52545967|emb|CAH56144.1| hypothetical protein [Homo (509 aa) initn: 3102 init1: 3102 opt: 3105 Z-score: 3570.2 bits: 670.2 E(): 3.7e-190 Smith-Waterman score: 3105; 85.941% identity (96.040% similar) in 505 aa overlap (10-511:1-505) 10 20 30 40 50 mKIAA4 GHRPICISVMALG---TLFLALAAGLSTASPPNILLIFADDLGYGDLGSYGHPSSTTPNL :..: .:.:::::::..: ::::.::::::::::::: ::::::::::: gi|525 MSMGAPRSLLLALAAGLAVARPPNIVLIFADDLGYGDLGCYGHPSSTTPNL 10 20 30 40 50 60 70 80 90 100 110 mKIAA4 DQLAEGGLRFTDFYVPVSLCTPSRAALLTGRLPVRSGMYPGVLGPSSQGGLPLEEVTLAE :::: :::::::::::::::::::::::::::::: ::::::: :::.:::::::::.:: gi|525 DQLAAGGLRFTDFYVPVSLCTPSRAALLTGRLPVRMGMYPGVLVPSSRGGLPLEEVTVAE 60 70 80 90 100 110 120 130 140 150 160 170 mKIAA4 VLAARGYLTGMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPDIPCK ::::::::::::::::::::::::::::::::::::::::::::::::::::::: :: gi|525 VLAARGYLTGMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPATPCD 120 130 140 150 160 170 180 190 200 210 220 230 mKIAA4 GGCDQGLVPIPLLANLTVEAQPPWLPGLEARYVSFSRDLMADAQRQGRPFFLYYASHHTH ::::::::::::::::.:::::::::::::::..:..::::::::: ::::::::::::: gi|525 GGCDQGLVPIPLLANLSVEAQPPWLPGLEARYMAFAHDLMADAQRQDRPFFLYYASHHTH 180 190 200 210 220 230 240 250 260 270 280 290 mKIAA4 YPQFSGQSFTKRSGRGPFGDSLMELDGAVGALMTTVGDLGLLEETLVIFTADNGPELMRM :::::::::..:::::::::::::::.:::.:::..:::::::::::::::::::: ::: gi|525 YPQFSGQSFAERSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRM 240 250 260 270 280 290 300 310 320 330 340 350 mKIAA4 SNGGCSGLLRCGKGTTFEGGVREPALVYWPGHITPGVTHELASSLDLLPTLAALTGAPLP : ::::::::::::::.:::::::::..:::::.::::::::::::::::::::.::::: gi|525 SRGGCSGLLRCGKGTTYEGGVREPALAFWPGHIAPGVTHELASSLDLLPTLAALAGAPLP 300 310 320 330 340 350 360 370 380 390 400 410 mKIAA4 NVTLDGVDISPLLLGTGKSPRKSVFFYPPYPDEIHGVFAVRNGKYKAHFFTQGSAHSDTT :::::: :.::::::::::::.:.:::: ::::..::::::.:::::::.:::::::::: gi|525 NVTLDGFDLSPLLLGTGKSPRQSLFFYPSYPDEVRGVFAVRSGKYKAHFLTQGSAHSDTT 360 370 380 390 400 410 420 430 440 450 460 470 mKIAA4 SDPACHAANRLTAHEPPLLYDLSQDPGENYNVLESIEGVSPEALQALKHIQLLKAQYDAA .::::::.. :::::::::::::.:::::::.: .. :..::.:::::..:::::: ::: gi|525 ADPACHASSSLTAHEPPLLYDLSKDPGENYNLLGGVAGATPEVLQALKQLQLLKAQLDAA 420 430 440 450 460 470 480 490 500 510 mKIAA4 MTFGPSQIAKGEDPALQICCQPSCTPHPVCCHCPGSQS .::::::.:.::::::::::.:.:::.:.::::: gi|525 VTFGPSQVARGEDPALQICCHPGCTPRPACCHCPDPHA 480 490 500 >>gi|149759319|ref|XP_001490513.1| PREDICTED: similar to (507 aa) initn: 3094 init1: 3094 opt: 3094 Z-score: 3557.6 bits: 667.8 E(): 1.9e-189 Smith-Waterman score: 3094; 87.525% identity (95.976% similar) in 497 aa overlap (15-511:7-503) 10 20 30 40 50 60 mKIAA4 GHRPICISVMALGTLFLALAAGLSTASPPNILLIFADDLGYGDLGSYGHPSSTTPNLDQL : ::::.::..:.::::::::::::::::::::::::::::::::: gi|149 MVAPWALALALASGLAAANPPNILLIFADDLGYGDLGSYGHPSSTTPNLDQL 10 20 30 40 50 70 80 90 100 110 120 mKIAA4 AEGGLRFTDFYVPVSLCTPSRAALLTGRLPVRSGMYPGVLGPSSQGGLPLEEVTLAEVLA : :::::::::::::::::::::::::::::: :.::::: :::.::::::::::::::: gi|149 AAGGLRFTDFYVPVSLCTPSRAALLTGRLPVRMGLYPGVLEPSSRGGLPLEEVTLAEVLA 60 70 80 90 100 110 130 140 150 160 170 180 mKIAA4 ARGYLTGMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPDIPCKGGC :::::::::::::::::::::::::::::::::::::::::::::::::::: :: :.: gi|149 ARGYLTGMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGSC 120 130 140 150 160 170 190 200 210 220 230 240 mKIAA4 DQGLVPIPLLANLTVEAQPPWLPGLEARYVSFSRDLMADAQRQGRPFFLYYASHHTHYPQ :::::::::::::.: .:::::::::::::.:.::::::::::::::::::::::::::: gi|149 DQGLVPIPLLANLSVVVQPPWLPGLEARYVAFARDLMADAQRQGRPFFLYYASHHTHYPQ 180 190 200 210 220 230 250 260 270 280 290 300 mKIAA4 FSGQSFTKRSGRGPFGDSLMELDGAVGALMTTVGDLGLLEETLVIFTADNGPELMRMSNG ::::::. :::::::::::::::.:::::::.::::::: ::::::.:::::: :::: : gi|149 FSGQSFAGRSGRGPFGDSLMELDAAVGALMTAVGDLGLLGETLVIFAADNGPETMRMSRG 240 250 260 270 280 290 310 320 330 340 350 360 mKIAA4 GCSGLLRCGKGTTFEGGVREPALVYWPGHITPGVTHELASSLDLLPTLAALTGAPLPNVT :::::::::::::::::::::::..:::::.:::::::::::::::::::::::::::.: gi|149 GCSGLLRCGKGTTFEGGVREPALAFWPGHIAPGVTHELASSLDLLPTLAALTGAPLPNIT 300 310 320 330 340 350 370 380 390 400 410 420 mKIAA4 LDGVDISPLLLGTGKSPRKSVFFYPPYPDEIHGVFAVRNGKYKAHFFTQGSAHSDTTSDP :::::.::::::::::::..::::: :::..::::::.::::::::::::::::::.:: gi|149 LDGVDLSPLLLGTGKSPRQTVFFYPANPDEVRGVFAVRSGKYKAHFFTQGSAHSDTTADP 360 370 380 390 400 410 430 440 450 460 470 480 mKIAA4 ACHAANRLTAHEPPLLYDLSQDPGENYNVLESIEGVSPEALQALKHIQLLKAQYDAAMTF ::::.. :::::::::.:::.::.::::.::.. :. :.::::::.::::::.::.::: gi|149 ACHASSPLTAHEPPLLFDLSEDPSENYNLLEGVAKVTSETLQALKHLQLLKAQFDATMTF 420 430 440 450 460 470 490 500 510 mKIAA4 GPSQIAKGEDPALQICCQPSCTPHPVCCHCPGSQS .:::.:.:::::::::::::::: : ::::: gi|149 SPSQMARGEDPALQICCQPSCTPWPSCCHCPEPHT 480 490 500 515 residues in 1 query sequences 2727779818 residues in 7921681 library sequences Tcomplib [34.26] (2 proc) start: Tue Mar 17 22:17:46 2009 done: Tue Mar 17 22:25:01 2009 Total Scan time: 971.940 Total Display time: 0.160 Function used was FASTA [version 34.26.5 April 26, 2007]