FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4554, 669 aa 1>>>pF1KE4554 669 - 669 aa - 669 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.2631+/-0.000418; mu= 19.6994+/- 0.026 mean_var=65.9806+/-13.783, 0's: 0 Z-trim(110.9): 7 B-trim: 860 in 1/52 Lambda= 0.157894 statistics sampled from 19303 (19309) to 19303 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.575), E-opt: 0.2 (0.226), width: 16 Scan time: 10.930 The best scores are: opt bits E(85289) NP_000144 (OMIM: 245200,606890) galactocerebrosida ( 685) 4572 1050.9 0 XP_011534920 (OMIM: 245200,606890) PREDICTED: gala ( 629) 4251 977.8 0 NP_001188331 (OMIM: 245200,606890) galactocerebros ( 659) 4251 977.8 0 NP_001188330 (OMIM: 245200,606890) galactocerebros ( 662) 4101 943.6 0 >>NP_000144 (OMIM: 245200,606890) galactocerebrosidase i (685 aa) initn: 4572 init1: 4572 opt: 4572 Z-score: 5623.6 bits: 1050.9 E(85289): 0 Smith-Waterman score: 4572; 99.7% identity (99.9% similar) in 669 aa overlap (1-669:17-685) 10 20 30 40 pF1KE4 MTAAAGSAGRAAVPLLLCALLAPGGAYVLDDSDGLGREFDGIGA :::::::::::::::::::::::::::::::::::::::::::: NP_000 MAEWLLSASWQRRAKAMTAAAGSAGRAAVPLLLCALLAPGGAYVLDDSDGLGREFDGIGA 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE4 VSGGGATSRLLVNYPEPYRSQILDYLFKPNFGASLHILKVEIGGDGQTTDGTEPSHMHYA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 VSGGGATSRLLVNYPEPYRSQILDYLFKPNFGASLHILKVEIGGDGQTTDGTEPSHMHYA 70 80 90 100 110 120 110 120 130 140 150 160 pF1KE4 LDENYFRGYEWWLMKEAKKRNPNITLIGLPWSFPGWLGKGFDWPYVNLQLTAYYVVTWIV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 LDENYFRGYEWWLMKEAKKRNPNITLIGLPWSFPGWLGKGFDWPYVNLQLTAYYVVTWIV 130 140 150 160 170 180 170 180 190 200 210 220 pF1KE4 GAKRYHDLDIDYIGIWNERSYNANYIKILRKMLNYQGLQRVKIIASDNLWESISASMLLD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 GAKRYHDLDIDYIGIWNERSYNANYIKILRKMLNYQGLQRVKIIASDNLWESISASMLLD 190 200 210 220 230 240 230 240 250 260 270 280 pF1KE4 AELFKVVDVIGAHYPGTHSAKDAKLTGKKLWSSEDFSTLNSDMGAGCWGRILNQNYINGY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 AELFKVVDVIGAHYPGTHSAKDAKLTGKKLWSSEDFSTLNSDMGAGCWGRILNQNYINGY 250 260 270 280 290 300 290 300 310 320 330 340 pF1KE4 MTSTIAWNLVASYYEQLPYGRCGLMTAQEPWSGHYVVESPVWVSAHTTQFTQPGWYYLKT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 MTSTIAWNLVASYYEQLPYGRCGLMTAQEPWSGHYVVESPVWVSAHTTQFTQPGWYYLKT 310 320 330 340 350 360 350 360 370 380 390 400 pF1KE4 VGHLEKGGSYVALTDGLGNLTIIIETMSHKHSKCIRPFLPYFNVSQQFATFVLKGSFSEI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 VGHLEKGGSYVALTDGLGNLTIIIETMSHKHSKCIRPFLPYFNVSQQFATFVLKGSFSEI 370 380 390 400 410 420 410 420 430 440 450 460 pF1KE4 PELQVWYTKLGKTSERFLFKQLDSLWLLDSDGSFTLSLHEDELFTLTTLTTGRKGSYPLP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 PELQVWYTKLGKTSERFLFKQLDSLWLLDSDGSFTLSLHEDELFTLTTLTTGRKGSYPLP 430 440 450 460 470 480 470 480 490 500 510 520 pF1KE4 PKSQPFPSTYKDDFNVDYPFFSEAPNFADQTGVFEYFTNIEDPGEHHFTLRQVLNQRPIT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 PKSQPFPSTYKDDFNVDYPFFSEAPNFADQTGVFEYFTNIEDPGEHHFTLRQVLNQRPIT 490 500 510 520 530 540 530 540 550 560 570 580 pF1KE4 WAADASNTISIIGDYNWTNLTTKCDVYIETPDTGGVFIAGRVNKGGILIRSARGIFFWIF ::::::::::::::::::::: :::::::::::::::::::::::::::::::::::::: NP_000 WAADASNTISIIGDYNWTNLTIKCDVYIETPDTGGVFIAGRVNKGGILIRSARGIFFWIF 550 560 570 580 590 600 590 600 610 620 630 640 pF1KE4 ANGSYRVTGDLAGWIIYALGRVEVTAKKWYTLTLTIKGHFASGMLNDKSLWTDIPVNFPK ::::::::::::::::::::::::::::::::::::::::.::::::::::::::::::: NP_000 ANGSYRVTGDLAGWIIYALGRVEVTAKKWYTLTLTIKGHFTSGMLNDKSLWTDIPVNFPK 610 620 630 640 650 660 650 660 pF1KE4 NGWAAIGTHSFEFAQFDNFLVEATR ::::::::::::::::::::::::: NP_000 NGWAAIGTHSFEFAQFDNFLVEATR 670 680 >>XP_011534920 (OMIM: 245200,606890) PREDICTED: galactoc (629 aa) initn: 4251 init1: 4251 opt: 4251 Z-score: 5229.0 bits: 977.8 E(85289): 0 Smith-Waterman score: 4251; 99.7% identity (99.8% similar) in 620 aa overlap (50-669:10-629) 20 30 40 50 60 70 pF1KE4 LLAPGGAYVLDDSDGLGREFDGIGAVSGGGATSRLLVNYPEPYRSQILDYLFKPNFGASL :::::::::::::::::::::::::::::: XP_011 MGFMVADLWATSRLLVNYPEPYRSQILDYLFKPNFGASL 10 20 30 80 90 100 110 120 130 pF1KE4 HILKVEIGGDGQTTDGTEPSHMHYALDENYFRGYEWWLMKEAKKRNPNITLIGLPWSFPG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 HILKVEIGGDGQTTDGTEPSHMHYALDENYFRGYEWWLMKEAKKRNPNITLIGLPWSFPG 40 50 60 70 80 90 140 150 160 170 180 190 pF1KE4 WLGKGFDWPYVNLQLTAYYVVTWIVGAKRYHDLDIDYIGIWNERSYNANYIKILRKMLNY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 WLGKGFDWPYVNLQLTAYYVVTWIVGAKRYHDLDIDYIGIWNERSYNANYIKILRKMLNY 100 110 120 130 140 150 200 210 220 230 240 250 pF1KE4 QGLQRVKIIASDNLWESISASMLLDAELFKVVDVIGAHYPGTHSAKDAKLTGKKLWSSED :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 QGLQRVKIIASDNLWESISASMLLDAELFKVVDVIGAHYPGTHSAKDAKLTGKKLWSSED 160 170 180 190 200 210 260 270 280 290 300 310 pF1KE4 FSTLNSDMGAGCWGRILNQNYINGYMTSTIAWNLVASYYEQLPYGRCGLMTAQEPWSGHY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 FSTLNSDMGAGCWGRILNQNYINGYMTSTIAWNLVASYYEQLPYGRCGLMTAQEPWSGHY 220 230 240 250 260 270 320 330 340 350 360 370 pF1KE4 VVESPVWVSAHTTQFTQPGWYYLKTVGHLEKGGSYVALTDGLGNLTIIIETMSHKHSKCI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 VVESPVWVSAHTTQFTQPGWYYLKTVGHLEKGGSYVALTDGLGNLTIIIETMSHKHSKCI 280 290 300 310 320 330 380 390 400 410 420 430 pF1KE4 RPFLPYFNVSQQFATFVLKGSFSEIPELQVWYTKLGKTSERFLFKQLDSLWLLDSDGSFT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 RPFLPYFNVSQQFATFVLKGSFSEIPELQVWYTKLGKTSERFLFKQLDSLWLLDSDGSFT 340 350 360 370 380 390 440 450 460 470 480 490 pF1KE4 LSLHEDELFTLTTLTTGRKGSYPLPPKSQPFPSTYKDDFNVDYPFFSEAPNFADQTGVFE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 LSLHEDELFTLTTLTTGRKGSYPLPPKSQPFPSTYKDDFNVDYPFFSEAPNFADQTGVFE 400 410 420 430 440 450 500 510 520 530 540 550 pF1KE4 YFTNIEDPGEHHFTLRQVLNQRPITWAADASNTISIIGDYNWTNLTTKCDVYIETPDTGG :::::::::::::::::::::::::::::::::::::::::::::: ::::::::::::: XP_011 YFTNIEDPGEHHFTLRQVLNQRPITWAADASNTISIIGDYNWTNLTIKCDVYIETPDTGG 460 470 480 490 500 510 560 570 580 590 600 610 pF1KE4 VFIAGRVNKGGILIRSARGIFFWIFANGSYRVTGDLAGWIIYALGRVEVTAKKWYTLTLT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 VFIAGRVNKGGILIRSARGIFFWIFANGSYRVTGDLAGWIIYALGRVEVTAKKWYTLTLT 520 530 540 550 560 570 620 630 640 650 660 pF1KE4 IKGHFASGMLNDKSLWTDIPVNFPKNGWAAIGTHSFEFAQFDNFLVEATR :::::.:::::::::::::::::::::::::::::::::::::::::::: XP_011 IKGHFTSGMLNDKSLWTDIPVNFPKNGWAAIGTHSFEFAQFDNFLVEATR 580 590 600 610 620 >>NP_001188331 (OMIM: 245200,606890) galactocerebrosidas (659 aa) initn: 4251 init1: 4251 opt: 4251 Z-score: 5228.7 bits: 977.8 E(85289): 0 Smith-Waterman score: 4251; 99.7% identity (99.8% similar) in 620 aa overlap (50-669:40-659) 20 30 40 50 60 70 pF1KE4 LLAPGGAYVLDDSDGLGREFDGIGAVSGGGATSRLLVNYPEPYRSQILDYLFKPNFGASL :::::::::::::::::::::::::::::: NP_001 THGPLPLADLGIHLPCVKVLHQVTPEEKPAATSRLLVNYPEPYRSQILDYLFKPNFGASL 10 20 30 40 50 60 80 90 100 110 120 130 pF1KE4 HILKVEIGGDGQTTDGTEPSHMHYALDENYFRGYEWWLMKEAKKRNPNITLIGLPWSFPG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 HILKVEIGGDGQTTDGTEPSHMHYALDENYFRGYEWWLMKEAKKRNPNITLIGLPWSFPG 70 80 90 100 110 120 140 150 160 170 180 190 pF1KE4 WLGKGFDWPYVNLQLTAYYVVTWIVGAKRYHDLDIDYIGIWNERSYNANYIKILRKMLNY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 WLGKGFDWPYVNLQLTAYYVVTWIVGAKRYHDLDIDYIGIWNERSYNANYIKILRKMLNY 130 140 150 160 170 180 200 210 220 230 240 250 pF1KE4 QGLQRVKIIASDNLWESISASMLLDAELFKVVDVIGAHYPGTHSAKDAKLTGKKLWSSED :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 QGLQRVKIIASDNLWESISASMLLDAELFKVVDVIGAHYPGTHSAKDAKLTGKKLWSSED 190 200 210 220 230 240 260 270 280 290 300 310 pF1KE4 FSTLNSDMGAGCWGRILNQNYINGYMTSTIAWNLVASYYEQLPYGRCGLMTAQEPWSGHY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 FSTLNSDMGAGCWGRILNQNYINGYMTSTIAWNLVASYYEQLPYGRCGLMTAQEPWSGHY 250 260 270 280 290 300 320 330 340 350 360 370 pF1KE4 VVESPVWVSAHTTQFTQPGWYYLKTVGHLEKGGSYVALTDGLGNLTIIIETMSHKHSKCI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 VVESPVWVSAHTTQFTQPGWYYLKTVGHLEKGGSYVALTDGLGNLTIIIETMSHKHSKCI 310 320 330 340 350 360 380 390 400 410 420 430 pF1KE4 RPFLPYFNVSQQFATFVLKGSFSEIPELQVWYTKLGKTSERFLFKQLDSLWLLDSDGSFT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 RPFLPYFNVSQQFATFVLKGSFSEIPELQVWYTKLGKTSERFLFKQLDSLWLLDSDGSFT 370 380 390 400 410 420 440 450 460 470 480 490 pF1KE4 LSLHEDELFTLTTLTTGRKGSYPLPPKSQPFPSTYKDDFNVDYPFFSEAPNFADQTGVFE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 LSLHEDELFTLTTLTTGRKGSYPLPPKSQPFPSTYKDDFNVDYPFFSEAPNFADQTGVFE 430 440 450 460 470 480 500 510 520 530 540 550 pF1KE4 YFTNIEDPGEHHFTLRQVLNQRPITWAADASNTISIIGDYNWTNLTTKCDVYIETPDTGG :::::::::::::::::::::::::::::::::::::::::::::: ::::::::::::: NP_001 YFTNIEDPGEHHFTLRQVLNQRPITWAADASNTISIIGDYNWTNLTIKCDVYIETPDTGG 490 500 510 520 530 540 560 570 580 590 600 610 pF1KE4 VFIAGRVNKGGILIRSARGIFFWIFANGSYRVTGDLAGWIIYALGRVEVTAKKWYTLTLT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 VFIAGRVNKGGILIRSARGIFFWIFANGSYRVTGDLAGWIIYALGRVEVTAKKWYTLTLT 550 560 570 580 590 600 620 630 640 650 660 pF1KE4 IKGHFASGMLNDKSLWTDIPVNFPKNGWAAIGTHSFEFAQFDNFLVEATR :::::.:::::::::::::::::::::::::::::::::::::::::::: NP_001 IKGHFTSGMLNDKSLWTDIPVNFPKNGWAAIGTHSFEFAQFDNFLVEATR 610 620 630 640 650 >>NP_001188330 (OMIM: 245200,606890) galactocerebrosidas (662 aa) initn: 4408 init1: 4101 opt: 4101 Z-score: 5044.0 bits: 943.6 E(85289): 0 Smith-Waterman score: 4366; 96.3% identity (96.4% similar) in 669 aa overlap (1-669:17-662) 10 20 30 40 pF1KE4 MTAAAGSAGRAAVPLLLCALLAPGGAYVLDDSDGLGREFDGIGA :::::::::::::::::::::::::::::::::::::::::::: NP_001 MAEWLLSASWQRRAKAMTAAAGSAGRAAVPLLLCALLAPGGAYVLDDSDGLGREFDGIGA 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE4 VSGGGATSRLLVNYPEPYRSQILDYLFKPNFGASLHILKVEIGGDGQTTDGTEPSHMHYA ::::: :::::::::::::::::::::::::::::::: NP_001 VSGGG-----------------------PNFGASLHILKVEIGGDGQTTDGTEPSHMHYA 70 80 90 110 120 130 140 150 160 pF1KE4 LDENYFRGYEWWLMKEAKKRNPNITLIGLPWSFPGWLGKGFDWPYVNLQLTAYYVVTWIV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 LDENYFRGYEWWLMKEAKKRNPNITLIGLPWSFPGWLGKGFDWPYVNLQLTAYYVVTWIV 100 110 120 130 140 150 170 180 190 200 210 220 pF1KE4 GAKRYHDLDIDYIGIWNERSYNANYIKILRKMLNYQGLQRVKIIASDNLWESISASMLLD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 GAKRYHDLDIDYIGIWNERSYNANYIKILRKMLNYQGLQRVKIIASDNLWESISASMLLD 160 170 180 190 200 210 230 240 250 260 270 280 pF1KE4 AELFKVVDVIGAHYPGTHSAKDAKLTGKKLWSSEDFSTLNSDMGAGCWGRILNQNYINGY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 AELFKVVDVIGAHYPGTHSAKDAKLTGKKLWSSEDFSTLNSDMGAGCWGRILNQNYINGY 220 230 240 250 260 270 290 300 310 320 330 340 pF1KE4 MTSTIAWNLVASYYEQLPYGRCGLMTAQEPWSGHYVVESPVWVSAHTTQFTQPGWYYLKT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MTSTIAWNLVASYYEQLPYGRCGLMTAQEPWSGHYVVESPVWVSAHTTQFTQPGWYYLKT 280 290 300 310 320 330 350 360 370 380 390 400 pF1KE4 VGHLEKGGSYVALTDGLGNLTIIIETMSHKHSKCIRPFLPYFNVSQQFATFVLKGSFSEI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 VGHLEKGGSYVALTDGLGNLTIIIETMSHKHSKCIRPFLPYFNVSQQFATFVLKGSFSEI 340 350 360 370 380 390 410 420 430 440 450 460 pF1KE4 PELQVWYTKLGKTSERFLFKQLDSLWLLDSDGSFTLSLHEDELFTLTTLTTGRKGSYPLP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 PELQVWYTKLGKTSERFLFKQLDSLWLLDSDGSFTLSLHEDELFTLTTLTTGRKGSYPLP 400 410 420 430 440 450 470 480 490 500 510 520 pF1KE4 PKSQPFPSTYKDDFNVDYPFFSEAPNFADQTGVFEYFTNIEDPGEHHFTLRQVLNQRPIT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 PKSQPFPSTYKDDFNVDYPFFSEAPNFADQTGVFEYFTNIEDPGEHHFTLRQVLNQRPIT 460 470 480 490 500 510 530 540 550 560 570 580 pF1KE4 WAADASNTISIIGDYNWTNLTTKCDVYIETPDTGGVFIAGRVNKGGILIRSARGIFFWIF ::::::::::::::::::::: :::::::::::::::::::::::::::::::::::::: NP_001 WAADASNTISIIGDYNWTNLTIKCDVYIETPDTGGVFIAGRVNKGGILIRSARGIFFWIF 520 530 540 550 560 570 590 600 610 620 630 640 pF1KE4 ANGSYRVTGDLAGWIIYALGRVEVTAKKWYTLTLTIKGHFASGMLNDKSLWTDIPVNFPK ::::::::::::::::::::::::::::::::::::::::.::::::::::::::::::: NP_001 ANGSYRVTGDLAGWIIYALGRVEVTAKKWYTLTLTIKGHFTSGMLNDKSLWTDIPVNFPK 580 590 600 610 620 630 650 660 pF1KE4 NGWAAIGTHSFEFAQFDNFLVEATR ::::::::::::::::::::::::: NP_001 NGWAAIGTHSFEFAQFDNFLVEATR 640 650 660 669 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 18:36:25 2016 done: Mon Nov 7 18:36:26 2016 Total Scan time: 10.930 Total Display time: 0.060 Function used was FASTA [36.3.4 Apr, 2011]