FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5533, 467 aa 1>>>pF1KE5533 467 - 467 aa - 467 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.6985+/-0.000301; mu= 21.6053+/- 0.019 mean_var=65.5448+/-13.112, 0's: 0 Z-trim(116.6): 12 B-trim: 6 in 1/50 Lambda= 0.158418 statistics sampled from 27929 (27936) to 27929 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.684), E-opt: 0.2 (0.328), width: 16 Scan time: 8.420 The best scores are: opt bits E(85289) NP_114409 (OMIM: 136820) plasma alpha-L-fucosidase ( 467) 3322 767.9 0 NP_000138 (OMIM: 230000,612280) tissue alpha-L-fuc ( 466) 1866 435.1 1.9e-121 XP_016856394 (OMIM: 230000,612280) PREDICTED: tiss ( 365) 1290 303.4 6.9e-82 XP_005245878 (OMIM: 230000,612280) PREDICTED: tiss ( 341) 1286 302.4 1.2e-81 XP_011539469 (OMIM: 230000,612280) PREDICTED: tiss ( 255) 939 223.0 7.4e-58 >>NP_114409 (OMIM: 136820) plasma alpha-L-fucosidase pre (467 aa) initn: 3322 init1: 3322 opt: 3322 Z-score: 4099.6 bits: 767.9 E(85289): 0 Smith-Waterman score: 3322; 99.6% identity (100.0% similar) in 467 aa overlap (1-467:1-467) 10 20 30 40 50 60 pF1KE5 MRPQELPRLAFPLLLLLLLLLPPPPCPAHSATRFDPTWESLDARQLPAWFDQAKFGIFIH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_114 MRPQELPRLAFPLLLLLLLLLPPPPCPAHSATRFDPTWESLDARQLPAWFDQAKFGIFIH 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 WGVFSVPSFGSEWFWWYWQKEKIPKYVEFMKDNYPPSFKYEDFGPLFTAKFFNANQWADI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_114 WGVFSVPSFGSEWFWWYWQKEKIPKYVEFMKDNYPPSFKYEDFGPLFTAKFFNANQWADI 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 FQASGAKYIVLTSKHHEGFTLWGSEYSWNWNAIDEGPKRDIVKELEVAIRNRTDLRFGLY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_114 FQASGAKYIVLTSKHHEGFTLWGSEYSWNWNAIDEGPKRDIVKELEVAIRNRTDLRFGLY 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 YSLFEWFHPLFLEDESSSFHKRQFPVSKTLPELYELVNNYQPEVLWSDGDGGAPDQYWNS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_114 YSLFEWFHPLFLEDESSSFHKRQFPVSKTLPELYELVNNYQPEVLWSDGDGGAPDQYWNS 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE5 TGFLAWLYNESPVRGTVVTNDRWGAGSICKHGGFYTCSDRYNPGHLLPHKWENCMTIDKL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_114 TGFLAWLYNESPVRGTVVTNDRWGAGSICKHGGFYTCSDRYNPGHLLPHKWENCMTIDKL 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE5 SWGYRREAGISDYLTIEELVKQLVETVSCGGNLLMNIGPTLDGTISVVFEERLRQVGSWL :::::::::::::::::::::::::::::::::::::::::::::::::::::::.:::: NP_114 SWGYRREAGISDYLTIEELVKQLVETVSCGGNLLMNIGPTLDGTISVVFEERLRQMGSWL 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE5 KVNGEAIYETYTWRSQNDTVTPDVWYTSKPKEKLVYAIFLKWPTSGQLFLGHPKAILGAT ::::::::::.::::::::::::::::::::::::::::::::::::::::::::::::: NP_114 KVNGEAIYETHTWRSQNDTVTPDVWYTSKPKEKLVYAIFLKWPTSGQLFLGHPKAILGAT 370 380 390 400 410 420 430 440 450 460 pF1KE5 EVKLLGHGQPLNWISLEQNGIMVELPQLTIHQMPCKWGWALALTNVI ::::::::::::::::::::::::::::::::::::::::::::::: NP_114 EVKLLGHGQPLNWISLEQNGIMVELPQLTIHQMPCKWGWALALTNVI 430 440 450 460 >>NP_000138 (OMIM: 230000,612280) tissue alpha-L-fucosid (466 aa) initn: 1076 init1: 931 opt: 1866 Z-score: 2301.2 bits: 435.1 E(85289): 1.9e-121 Smith-Waterman score: 1866; 54.8% identity (76.8% similar) in 469 aa overlap (1-466:1-465) 10 20 30 40 50 pF1KE5 MR-PQELPRLAFPLLLLLLLLLPPPPCP--AHSATRFDPTWESLDARQLPAWFDQAKFGI :: : : : : ::::::.: :. :. : : :::.: ::::::.::::. NP_000 MRAPGMRSRPAGPALLLLLLFLGAAESVRRAQPPRRYTPDWPSLDSRPLPAWFDEAKFGV 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE5 FIHWGVFSVPSFGSEWFWWYWQKEKIPKYVEFMKDNYPPSFKYEDFGPLFTAKFFNANQW ::::::::::..:::::::.:: : :.: .::.:::::.:.: :::: :::.::. ..: NP_000 FIHWGVFSVPAWGSEWFWWHWQGEGRPQYQRFMRDNYPPGFSYADFGPQFTARFFHPEEW 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE5 ADIFQASGAKYIVLTSKHHEGFTLWGSEYSWNWNAIDEGPKRDIVKELEVAIRNRTDLRF ::.:::.::::.:::.::::::: : : :::::. : ::.::.: :: .:.:.: ..:. NP_000 ADLFQAAGAKYVVLTTKHHEGFTNWPSPVSWNWNSKDVGPHRDLVGELGTALRKR-NIRY 130 140 150 160 170 180 190 200 210 220 230 pF1KE5 GLYYSLFEWFHPLFLEDESSSFHKRQFPVSKTLPELYELVNNYQPEVLWSDGDGGAPDQY :::.::.::::::.: :....:. ..: .::.::::.:::.:.:...::::. :: : NP_000 GLYHSLLEWFHPLYLLDKKNGFKTQHFVSAKTMPELYDLVNSYKPDLIWSDGEWECPDTY 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE5 WNSTGFLAWLYNESPVRGTVVTNDRWGAGSICKHGGFYTCSDRYNPGHLLPHKWENCMTI ::::.::.::::.:::. ::.::::: . :.:::.:.: :...: : :::: : .: NP_000 WNSTNFLSWLYNDSPVKDEVVVNDRWGQNCSCHHGGYYNCEDKFKPQSLPDHKWEMCTSI 240 250 260 270 280 290 300 310 320 330 340 350 pF1KE5 DKLSWGYRREAGISDYLTIEELVKQLVETVSCGGNLLMNIGPTLDGTISVVFEERLRQVG ::.::::::. ..:: :....::.::: ::: :.::::: :: : .:.::: :: NP_000 DKFSWGYRRDMALSDVTEESEIISELVQTVSLGGNYLLNIGPTKDGLIVPIFQERLLAVG 300 310 320 330 340 350 360 370 380 390 400 410 pF1KE5 SWLKVNGEAIYETYTWRSQNDTVTPDVWYTSKPKEKLVYAIFLKWPTSGQLFLGHPKAIL .::..:::::: . :: : . : .:::::: . ::::::.:: .: : : : NP_000 KWLSINGEAIYASKPWRVQWEKNTTSVWYTSKGSA--VYAIFLHWPENGVLNLESP-ITT 360 370 380 390 400 410 420 430 440 450 460 pF1KE5 GATEVKLLGHGQPLNWISLEQNGIMVELPQLTIHQMPCKWGWALALTNVI ..:.. .:: :.: . ..:... :::: .: ...:.. ::.: NP_000 STTKITMLGIQGDLKWSTDPDKGLFISLPQLPPSAVPAEFAWTIKLTGVK 420 430 440 450 460 >>XP_016856394 (OMIM: 230000,612280) PREDICTED: tissue a (365 aa) initn: 1076 init1: 931 opt: 1290 Z-score: 1591.2 bits: 303.4 E(85289): 6.9e-82 Smith-Waterman score: 1290; 49.7% identity (74.3% similar) in 362 aa overlap (105-466:7-364) 80 90 100 110 120 130 pF1KE5 WWYWQKEKIPKYVEFMKDNYPPSFKYEDFGPLFTAKFFNANQWADIFQASGAKYIVLTSK :: . . .:.. .:.:::.: XP_016 MTSRLFPLHQGLQTGQRDWSSASFLRPMRYVVLTTK 10 20 30 140 150 160 170 180 190 pF1KE5 HHEGFTLWGSEYSWNWNAIDEGPKRDIVKELEVAIRNRTDLRFGLYYSLFEWFHPLFLED :::::: : : :::::. : ::.::.: :: .:.:.: ..:.:::.::.::::::.: : XP_016 HHEGFTNWPSPVSWNWNSKDVGPHRDLVGELGTALRKR-NIRYGLYHSLLEWFHPLYLLD 40 50 60 70 80 90 200 210 220 230 240 250 pF1KE5 ESSSFHKRQFPVSKTLPELYELVNNYQPEVLWSDGDGGAPDQYWNSTGFLAWLYNESPVR ....:. ..: .::.::::.:::.:.:...::::. :: :::::.::.::::.:::. XP_016 KKNGFKTQHFVSAKTMPELYDLVNSYKPDLIWSDGEWECPDTYWNSTNFLSWLYNDSPVK 100 110 120 130 140 150 260 270 280 290 300 310 pF1KE5 GTVVTNDRWGAGSICKHGGFYTCSDRYNPGHLLPHKWENCMTIDKLSWGYRREAGISDYL ::.::::: . :.:::.:.: :...: : :::: : .:::.::::::. ..:: XP_016 DEVVVNDRWGQNCSCHHGGYYNCEDKFKPQSLPDHKWEMCTSIDKFSWGYRRDMALSDVT 160 170 180 190 200 210 320 330 340 350 360 370 pF1KE5 TIEELVKQLVETVSCGGNLLMNIGPTLDGTISVVFEERLRQVGSWLKVNGEAIYETYTWR :....::.::: ::: :.::::: :: : .:.::: ::.::..:::::: . :: XP_016 EESEIISELVQTVSLGGNYLLNIGPTKDGLIVPIFQERLLAVGKWLSINGEAIYASKPWR 220 230 240 250 260 270 380 390 400 410 420 430 pF1KE5 SQNDTVTPDVWYTSKPKEKLVYAIFLKWPTSGQLFLGHPKAILGATEVKLLGHGQPLNWI : . : .:::::: . ::::::.:: .: : : : ..:.. .:: :.: XP_016 VQWEKNTTSVWYTSKGSA--VYAIFLHWPENGVLNLESP-ITTSTTKITMLGIQGDLKWS 280 290 300 310 320 330 440 450 460 pF1KE5 SLEQNGIMVELPQLTIHQMPCKWGWALALTNVI . ..:... :::: .: ...:.. ::.: XP_016 TDPDKGLFISLPQLPPSAVPAEFAWTIKLTGVK 340 350 360 >>XP_005245878 (OMIM: 230000,612280) PREDICTED: tissue a (341 aa) initn: 1076 init1: 931 opt: 1286 Z-score: 1586.6 bits: 302.4 E(85289): 1.2e-81 Smith-Waterman score: 1286; 52.1% identity (76.8% similar) in 340 aa overlap (127-466:5-340) 100 110 120 130 140 150 pF1KE5 SFKYEDFGPLFTAKFFNANQWADIFQASGAKYIVLTSKHHEGFTLWGSEYSWNWNAIDEG .:.:::.::::::: : : :::::. : : XP_005 MRRRRYVVLTTKHHEGFTNWPSPVSWNWNSKDVG 10 20 30 160 170 180 190 200 210 pF1KE5 PKRDIVKELEVAIRNRTDLRFGLYYSLFEWFHPLFLEDESSSFHKRQFPVSKTLPELYEL :.::.: :: .:.:.: ..:.:::.::.::::::.: :....:. ..: .::.::::.: XP_005 PHRDLVGELGTALRKR-NIRYGLYHSLLEWFHPLYLLDKKNGFKTQHFVSAKTMPELYDL 40 50 60 70 80 90 220 230 240 250 260 270 pF1KE5 VNNYQPEVLWSDGDGGAPDQYWNSTGFLAWLYNESPVRGTVVTNDRWGAGSICKHGGFYT ::.:.:...::::. :: :::::.::.::::.:::. ::.::::: . :.:::.:. XP_005 VNSYKPDLIWSDGEWECPDTYWNSTNFLSWLYNDSPVKDEVVVNDRWGQNCSCHHGGYYN 100 110 120 130 140 150 280 290 300 310 320 330 pF1KE5 CSDRYNPGHLLPHKWENCMTIDKLSWGYRREAGISDYLTIEELVKQLVETVSCGGNLLMN : :...: : :::: : .:::.::::::. ..:: :....::.::: ::: :.: XP_005 CEDKFKPQSLPDHKWEMCTSIDKFSWGYRRDMALSDVTEESEIISELVQTVSLGGNYLLN 160 170 180 190 200 210 340 350 360 370 380 390 pF1KE5 IGPTLDGTISVVFEERLRQVGSWLKVNGEAIYETYTWRSQNDTVTPDVWYTSKPKEKLVY :::: :: : .:.::: ::.::..:::::: . :: : . : .:::::: . :: XP_005 IGPTKDGLIVPIFQERLLAVGKWLSINGEAIYASKPWRVQWEKNTTSVWYTSKGSA--VY 220 230 240 250 260 270 400 410 420 430 440 450 pF1KE5 AIFLKWPTSGQLFLGHPKAILGATEVKLLGHGQPLNWISLEQNGIMVELPQLTIHQMPCK ::::.:: .: : : : ..:.. .:: :.: . ..:... :::: .: . XP_005 AIFLHWPENGVLNLESP-ITTSTTKITMLGIQGDLKWSTDPDKGLFISLPQLPPSAVPAE 280 290 300 310 320 330 460 pF1KE5 WGWALALTNVI ..:.. ::.: XP_005 FAWTIKLTGVK 340 >>XP_011539469 (OMIM: 230000,612280) PREDICTED: tissue a (255 aa) initn: 931 init1: 786 opt: 939 Z-score: 1159.7 bits: 223.0 E(85289): 7.4e-58 Smith-Waterman score: 939; 50.6% identity (74.3% similar) in 257 aa overlap (210-466:1-254) 180 190 200 210 220 230 pF1KE5 YYSLFEWFHPLFLEDESSSFHKRQFPVSKTLPELYELVNNYQPEVLWSDGDGGAPDQYWN .::::.:::.:.:...::::. :: ::: XP_011 MPELYDLVNSYKPDLIWSDGEWECPDTYWN 10 20 30 240 250 260 270 280 290 pF1KE5 STGFLAWLYNESPVRGTVVTNDRWGAGSICKHGGFYTCSDRYNPGHLLPHKWENCMTIDK ::.::.::::.:::. ::.::::: . :.:::.:.: :...: : :::: : .::: XP_011 STNFLSWLYNDSPVKDEVVVNDRWGQNCSCHHGGYYNCEDKFKPQSLPDHKWEMCTSIDK 40 50 60 70 80 90 300 310 320 330 340 350 pF1KE5 LSWGYRREAGISDYLTIEELVKQLVETVSCGGNLLMNIGPTLDGTISVVFEERLRQVGSW .::::::. ..:: :....::.::: ::: :.::::: :: : .:.::: ::.: XP_011 FSWGYRRDMALSDVTEESEIISELVQTVSLGGNYLLNIGPTKDGLIVPIFQERLLAVGKW 100 110 120 130 140 150 360 370 380 390 400 410 pF1KE5 LKVNGEAIYETYTWRSQNDTVTPDVWYTSKPKEKLVYAIFLKWPTSGQLFLGHPKAILGA :..:::::: . :: : . : .:::::: . ::::::.:: .: : : : .. XP_011 LSINGEAIYASKPWRVQWEKNTTSVWYTSKGSA--VYAIFLHWPENGVLNLESP-ITTST 160 170 180 190 200 420 430 440 450 460 pF1KE5 TEVKLLGHGQPLNWISLEQNGIMVELPQLTIHQMPCKWGWALALTNVI :.. .:: :.: . ..:... :::: .: ...:.. ::.: XP_011 TKITMLGIQGDLKWSTDPDKGLFISLPQLPPSAVPAEFAWTIKLTGVK 210 220 230 240 250 467 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 01:33:41 2016 done: Tue Nov 8 01:33:42 2016 Total Scan time: 8.420 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]