FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4445, 461 aa 1>>>pF1KE4445 461 - 461 aa - 461 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.8106+/-0.000334; mu= 20.7491+/- 0.021 mean_var=69.4440+/-13.530, 0's: 0 Z-trim(115.5): 15 B-trim: 0 in 0/55 Lambda= 0.153906 statistics sampled from 25993 (26007) to 25993 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.667), E-opt: 0.2 (0.305), width: 16 Scan time: 8.560 The best scores are: opt bits E(85289) NP_000138 (OMIM: 230000,612280) tissue alpha-L-fuc ( 466) 3321 746.5 3.5e-215 XP_016856394 (OMIM: 230000,612280) PREDICTED: tiss ( 365) 2379 537.2 2.7e-152 XP_005245878 (OMIM: 230000,612280) PREDICTED: tiss ( 341) 2377 536.8 3.5e-152 NP_114409 (OMIM: 136820) plasma alpha-L-fucosidase ( 467) 1860 422.1 1.6e-117 XP_011539469 (OMIM: 230000,612280) PREDICTED: tiss ( 255) 1798 408.1 1.4e-113 >>NP_000138 (OMIM: 230000,612280) tissue alpha-L-fucosid (466 aa) initn: 3321 init1: 3321 opt: 3321 Z-score: 3984.1 bits: 746.5 E(85289): 3.5e-215 Smith-Waterman score: 3321; 100.0% identity (100.0% similar) in 461 aa overlap (1-461:6-466) 10 20 30 40 50 pF1KE4 MRSRPAGPALLLLLLFLGAAESVRRAQPPRRYTPDWPSLDSRPLPAWFDEAKFGV ::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 MRAPGMRSRPAGPALLLLLLFLGAAESVRRAQPPRRYTPDWPSLDSRPLPAWFDEAKFGV 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE4 FIHWGVFSVPAWGSEWFWWHWQGEGRPQYQRFMRDNYPPGFSYADFGPQFTARFFHPEEW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 FIHWGVFSVPAWGSEWFWWHWQGEGRPQYQRFMRDNYPPGFSYADFGPQFTARFFHPEEW 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE4 ADLFQAAGAKYVVLTTKHHEGFTNWPSPVSWNWNSKDVGPHRDLVGELGTALRKRNIRYG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 ADLFQAAGAKYVVLTTKHHEGFTNWPSPVSWNWNSKDVGPHRDLVGELGTALRKRNIRYG 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE4 LYHSLLEWFHPLYLLDKKNGFKTQHFVSAKTMPELYDLVNSYKPDLIWSDGEWECPDTYW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 LYHSLLEWFHPLYLLDKKNGFKTQHFVSAKTMPELYDLVNSYKPDLIWSDGEWECPDTYW 190 200 210 220 230 240 240 250 260 270 280 290 pF1KE4 NSTNFLSWLYNDSPVKDEVVVNDRWGQNCSCHHGGYYNCEDKFKPQSLPDHKWEMCTSID :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 NSTNFLSWLYNDSPVKDEVVVNDRWGQNCSCHHGGYYNCEDKFKPQSLPDHKWEMCTSID 250 260 270 280 290 300 300 310 320 330 340 350 pF1KE4 KFSWGYRRDMALSDVTEESEIISELVQTVSLGGNYLLNIGPTKDGLIVPIFQERLLAVGK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 KFSWGYRRDMALSDVTEESEIISELVQTVSLGGNYLLNIGPTKDGLIVPIFQERLLAVGK 310 320 330 340 350 360 360 370 380 390 400 410 pF1KE4 WLSINGEAIYASKPWRVQWEKNTTSVWYTSKGSAVYAIFLHWPENGVLNLESPITTSTTK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 WLSINGEAIYASKPWRVQWEKNTTSVWYTSKGSAVYAIFLHWPENGVLNLESPITTSTTK 370 380 390 400 410 420 420 430 440 450 460 pF1KE4 ITMLGIQGDLKWSTDPDKGLFISLPQLPPSAVPAEFAWTIKLTGVK :::::::::::::::::::::::::::::::::::::::::::::: NP_000 ITMLGIQGDLKWSTDPDKGLFISLPQLPPSAVPAEFAWTIKLTGVK 430 440 450 460 >>XP_016856394 (OMIM: 230000,612280) PREDICTED: tissue a (365 aa) initn: 2377 init1: 2377 opt: 2379 Z-score: 2855.1 bits: 537.2 E(85289): 2.7e-152 Smith-Waterman score: 2379; 96.8% identity (98.0% similar) in 348 aa overlap (114-461:18-365) 90 100 110 120 130 140 pF1KE4 YQRFMRDNYPPGFSYADFGPQFTARFFHPEEWADLFQAAGAKYVVLTTKHHEGFTNWPSP .:.. .:::::::::::::::::: XP_016 MTSRLFPLHQGLQTGQRDWSSASFLRPMRYVVLTTKHHEGFTNWPSP 10 20 30 40 150 160 170 180 190 200 pF1KE4 VSWNWNSKDVGPHRDLVGELGTALRKRNIRYGLYHSLLEWFHPLYLLDKKNGFKTQHFVS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 VSWNWNSKDVGPHRDLVGELGTALRKRNIRYGLYHSLLEWFHPLYLLDKKNGFKTQHFVS 50 60 70 80 90 100 210 220 230 240 250 260 pF1KE4 AKTMPELYDLVNSYKPDLIWSDGEWECPDTYWNSTNFLSWLYNDSPVKDEVVVNDRWGQN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 AKTMPELYDLVNSYKPDLIWSDGEWECPDTYWNSTNFLSWLYNDSPVKDEVVVNDRWGQN 110 120 130 140 150 160 270 280 290 300 310 320 pF1KE4 CSCHHGGYYNCEDKFKPQSLPDHKWEMCTSIDKFSWGYRRDMALSDVTEESEIISELVQT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 CSCHHGGYYNCEDKFKPQSLPDHKWEMCTSIDKFSWGYRRDMALSDVTEESEIISELVQT 170 180 190 200 210 220 330 340 350 360 370 380 pF1KE4 VSLGGNYLLNIGPTKDGLIVPIFQERLLAVGKWLSINGEAIYASKPWRVQWEKNTTSVWY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 VSLGGNYLLNIGPTKDGLIVPIFQERLLAVGKWLSINGEAIYASKPWRVQWEKNTTSVWY 230 240 250 260 270 280 390 400 410 420 430 440 pF1KE4 TSKGSAVYAIFLHWPENGVLNLESPITTSTTKITMLGIQGDLKWSTDPDKGLFISLPQLP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 TSKGSAVYAIFLHWPENGVLNLESPITTSTTKITMLGIQGDLKWSTDPDKGLFISLPQLP 290 300 310 320 330 340 450 460 pF1KE4 PSAVPAEFAWTIKLTGVK :::::::::::::::::: XP_016 PSAVPAEFAWTIKLTGVK 350 360 >>XP_005245878 (OMIM: 230000,612280) PREDICTED: tissue a (341 aa) initn: 2377 init1: 2377 opt: 2377 Z-score: 2853.1 bits: 536.8 E(85289): 3.5e-152 Smith-Waterman score: 2377; 99.7% identity (100.0% similar) in 337 aa overlap (125-461:5-341) 100 110 120 130 140 150 pF1KE4 GFSYADFGPQFTARFFHPEEWADLFQAAGAKYVVLTTKHHEGFTNWPSPVSWNWNSKDVG .::::::::::::::::::::::::::::: XP_005 MRRRRYVVLTTKHHEGFTNWPSPVSWNWNSKDVG 10 20 30 160 170 180 190 200 210 pF1KE4 PHRDLVGELGTALRKRNIRYGLYHSLLEWFHPLYLLDKKNGFKTQHFVSAKTMPELYDLV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 PHRDLVGELGTALRKRNIRYGLYHSLLEWFHPLYLLDKKNGFKTQHFVSAKTMPELYDLV 40 50 60 70 80 90 220 230 240 250 260 270 pF1KE4 NSYKPDLIWSDGEWECPDTYWNSTNFLSWLYNDSPVKDEVVVNDRWGQNCSCHHGGYYNC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 NSYKPDLIWSDGEWECPDTYWNSTNFLSWLYNDSPVKDEVVVNDRWGQNCSCHHGGYYNC 100 110 120 130 140 150 280 290 300 310 320 330 pF1KE4 EDKFKPQSLPDHKWEMCTSIDKFSWGYRRDMALSDVTEESEIISELVQTVSLGGNYLLNI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 EDKFKPQSLPDHKWEMCTSIDKFSWGYRRDMALSDVTEESEIISELVQTVSLGGNYLLNI 160 170 180 190 200 210 340 350 360 370 380 390 pF1KE4 GPTKDGLIVPIFQERLLAVGKWLSINGEAIYASKPWRVQWEKNTTSVWYTSKGSAVYAIF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 GPTKDGLIVPIFQERLLAVGKWLSINGEAIYASKPWRVQWEKNTTSVWYTSKGSAVYAIF 220 230 240 250 260 270 400 410 420 430 440 450 pF1KE4 LHWPENGVLNLESPITTSTTKITMLGIQGDLKWSTDPDKGLFISLPQLPPSAVPAEFAWT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 LHWPENGVLNLESPITTSTTKITMLGIQGDLKWSTDPDKGLFISLPQLPPSAVPAEFAWT 280 290 300 310 320 330 460 pF1KE4 IKLTGVK ::::::: XP_005 IKLTGVK 340 >>NP_114409 (OMIM: 136820) plasma alpha-L-fucosidase pre (467 aa) initn: 1074 init1: 929 opt: 1860 Z-score: 2230.9 bits: 422.1 E(85289): 1.6e-117 Smith-Waterman score: 1860; 54.9% identity (77.7% similar) in 461 aa overlap (4-460:8-466) 10 20 30 40 50 pF1KE4 MRSRPAGPALLLLLLFLGAAESVRRAQPPRRYTPDWPSLDSRPLPAWFDEAKFGVF : : : ::::::.: :. :. : : :::.: ::::::.::::.: NP_114 MRPQELPRLAFPLLLLLLLLLPPPPC--PAHSATRFDPTWESLDARQLPAWFDQAKFGIF 10 20 30 40 50 60 70 80 90 100 110 pF1KE4 IHWGVFSVPAWGSEWFWWHWQGEGRPQYQRFMRDNYPPGFSYADFGPQFTARFFHPEEWA :::::::::..:::::::.:: : :.: .::.:::::.:.: :::: :::.::. ..:: NP_114 IHWGVFSVPSFGSEWFWWYWQKEKIPKYVEFMKDNYPPSFKYEDFGPLFTAKFFNANQWA 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE4 DLFQAAGAKYVVLTTKHHEGFTNWPSPVSWNWNSKDVGPHRDLVGELGTALRKR-NIRYG :.:::.::::.:::.::::::: : : :::::. : ::.::.: :: .:.:.: ..:.: NP_114 DIFQASGAKYIVLTSKHHEGFTLWGSEYSWNWNAIDEGPKRDIVKELEVAIRNRTDLRFG 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE4 LYHSLLEWFHPLYLLDKKNGFKTQHFVSAKTMPELYDLVNSYKPDLIWSDGEWECPDTYW ::.::.::::::.: :....:. ..: .::.::::.:::.:.:...::::. :: :: NP_114 LYYSLFEWFHPLFLEDESSSFHKRQFPVSKTLPELYELVNNYQPEVLWSDGDGGAPDQYW 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE4 NSTNFLSWLYNDSPVKDEVVVNDRWGQNCSCHHGGYYNCEDKFKPQSLPDHKWEMCTSID :::.::.::::.:::. ::.::::: . :.:::.:.: :...: : :::: : .:: NP_114 NSTGFLAWLYNESPVRGTVVTNDRWGAGSICKHGGFYTCSDRYNPGHLLPHKWENCMTID 240 250 260 270 280 290 300 310 320 330 340 350 pF1KE4 KFSWGYRRDMALSDVTEESEIISELVQTVSLGGNYLLNIGPTKDGLIVPIFQERLLAVGK :.::::::. ..:: :....::.::: ::: :.::::: :: : .:.::: .:. NP_114 KLSWGYRREAGISDYLTIEELVKQLVETVSCGGNLLMNIGPTLDGTISVVFEERLRQMGS 300 310 320 330 340 350 360 370 380 390 400 410 pF1KE4 WLSINGEAIYASKPWRVQWEKNTTSVWYTSKGSA--VYAIFLHWPENGVLNLESP-ITTS ::..:::::: .. :: : . : .:::::: . ::::::.:: .: : : : . NP_114 WLKVNGEAIYETHTWRSQNDTVTPDVWYTSKPKEKLVYAIFLKWPTSGQLFLGHPKAILG 360 370 380 390 400 410 420 430 440 450 460 pF1KE4 TTKITMLGIQGDLKWSTDPDKGLFISLPQLPPSAVPAEFAWTIKLTGVK .:.. .:: :.: . ..:... :::: .: ...:.. ::.: NP_114 ATEVKLLGHGQPLNWISLEQNGIMVELPQLTIHQMPCKWGWALALTNVI 420 430 440 450 460 >>XP_011539469 (OMIM: 230000,612280) PREDICTED: tissue a (255 aa) initn: 1798 init1: 1798 opt: 1798 Z-score: 2160.0 bits: 408.1 E(85289): 1.4e-113 Smith-Waterman score: 1798; 100.0% identity (100.0% similar) in 255 aa overlap (207-461:1-255) 180 190 200 210 220 230 pF1KE4 YHSLLEWFHPLYLLDKKNGFKTQHFVSAKTMPELYDLVNSYKPDLIWSDGEWECPDTYWN :::::::::::::::::::::::::::::: XP_011 MPELYDLVNSYKPDLIWSDGEWECPDTYWN 10 20 30 240 250 260 270 280 290 pF1KE4 STNFLSWLYNDSPVKDEVVVNDRWGQNCSCHHGGYYNCEDKFKPQSLPDHKWEMCTSIDK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 STNFLSWLYNDSPVKDEVVVNDRWGQNCSCHHGGYYNCEDKFKPQSLPDHKWEMCTSIDK 40 50 60 70 80 90 300 310 320 330 340 350 pF1KE4 FSWGYRRDMALSDVTEESEIISELVQTVSLGGNYLLNIGPTKDGLIVPIFQERLLAVGKW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 FSWGYRRDMALSDVTEESEIISELVQTVSLGGNYLLNIGPTKDGLIVPIFQERLLAVGKW 100 110 120 130 140 150 360 370 380 390 400 410 pF1KE4 LSINGEAIYASKPWRVQWEKNTTSVWYTSKGSAVYAIFLHWPENGVLNLESPITTSTTKI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 LSINGEAIYASKPWRVQWEKNTTSVWYTSKGSAVYAIFLHWPENGVLNLESPITTSTTKI 160 170 180 190 200 210 420 430 440 450 460 pF1KE4 TMLGIQGDLKWSTDPDKGLFISLPQLPPSAVPAEFAWTIKLTGVK ::::::::::::::::::::::::::::::::::::::::::::: XP_011 TMLGIQGDLKWSTDPDKGLFISLPQLPPSAVPAEFAWTIKLTGVK 220 230 240 250 461 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 01:05:44 2016 done: Sun Nov 6 01:05:45 2016 Total Scan time: 8.560 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]