FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1230, 343 aa 1>>>pF1KE1230 343 - 343 aa - 343 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.2740+/-0.000419; mu= 16.1566+/- 0.026 mean_var=59.6592+/-11.983, 0's: 0 Z-trim(111.0): 12 B-trim: 343 in 1/48 Lambda= 0.166049 statistics sampled from 19425 (19435) to 19425 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.592), E-opt: 0.2 (0.228), width: 16 Scan time: 5.250 The best scores are: opt bits E(85289) NP_000502 (OMIM: 182100,612542) galactoside 2-alph ( 343) 2362 574.4 1.2e-163 NP_001091107 (OMIM: 182100,612542) galactoside 2-a ( 343) 2362 574.4 1.2e-163 NP_000139 (OMIM: 211100,616754) galactoside 2-alph ( 365) 1413 347.1 3.5e-95 NP_001316806 (OMIM: 211100,616754) galactoside 2-a ( 365) 1413 347.1 3.5e-95 XP_016882042 (OMIM: 211100,616754) PREDICTED: gala ( 365) 1413 347.1 3.5e-95 XP_006723190 (OMIM: 211100,616754) PREDICTED: gala ( 488) 1413 347.1 4.6e-95 XP_016882041 (OMIM: 211100,616754) PREDICTED: gala ( 488) 1413 347.1 4.6e-95 >>NP_000502 (OMIM: 182100,612542) galactoside 2-alpha-L- (343 aa) initn: 2362 init1: 2362 opt: 2362 Z-score: 3058.8 bits: 574.4 E(85289): 1.2e-163 Smith-Waterman score: 2362; 100.0% identity (100.0% similar) in 343 aa overlap (1-343:1-343) 10 20 30 40 50 60 pF1KE1 MLVVQMPFSFPMAHFILFVFTVSTIFHVQQRLAKIQAMWELPVQIPVLASTSKALGPSQL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 MLVVQMPFSFPMAHFILFVFTVSTIFHVQQRLAKIQAMWELPVQIPVLASTSKALGPSQL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 RGMWTINAIGRLGNQMGEYATLYALAKMNGRPAFIPAQMHSTLAPIFRITLPVLHSATAS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 RGMWTINAIGRLGNQMGEYATLYALAKMNGRPAFIPAQMHSTLAPIFRITLPVLHSATAS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 RIPWQNYHLNDWMEEEYRHIPGEYVRFTGYPCSWTFYHHLRQEILQEFTLHDHVREEAQK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 RIPWQNYHLNDWMEEEYRHIPGEYVRFTGYPCSWTFYHHLRQEILQEFTLHDHVREEAQK 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 FLRGLQVNGSRPGTFVGVHVRRGDYVHVMPKVWKGVVADRRYLQQALDWFRARYSSLIFV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 FLRGLQVNGSRPGTFVGVHVRRGDYVHVMPKVWKGVVADRRYLQQALDWFRARYSSLIFV 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE1 VTSNGMAWCRENIDTSHGDVVFAGDGIEGSPAKDFALLTQCNHTIMTIGTFGIWAAYLTG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_000 VTSNGMAWCRENIDTSHGDVVFAGDGIEGSPAKDFALLTQCNHTIMTIGTFGIWAAYLTG 250 260 270 280 290 300 310 320 330 340 pF1KE1 GDTIYLANYTLPDSPFLKIFKPEAAFLPEWTGIAADLSPLLKH ::::::::::::::::::::::::::::::::::::::::::: NP_000 GDTIYLANYTLPDSPFLKIFKPEAAFLPEWTGIAADLSPLLKH 310 320 330 340 >>NP_001091107 (OMIM: 182100,612542) galactoside 2-alpha (343 aa) initn: 2362 init1: 2362 opt: 2362 Z-score: 3058.8 bits: 574.4 E(85289): 1.2e-163 Smith-Waterman score: 2362; 100.0% identity (100.0% similar) in 343 aa overlap (1-343:1-343) 10 20 30 40 50 60 pF1KE1 MLVVQMPFSFPMAHFILFVFTVSTIFHVQQRLAKIQAMWELPVQIPVLASTSKALGPSQL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MLVVQMPFSFPMAHFILFVFTVSTIFHVQQRLAKIQAMWELPVQIPVLASTSKALGPSQL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 RGMWTINAIGRLGNQMGEYATLYALAKMNGRPAFIPAQMHSTLAPIFRITLPVLHSATAS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 RGMWTINAIGRLGNQMGEYATLYALAKMNGRPAFIPAQMHSTLAPIFRITLPVLHSATAS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 RIPWQNYHLNDWMEEEYRHIPGEYVRFTGYPCSWTFYHHLRQEILQEFTLHDHVREEAQK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 RIPWQNYHLNDWMEEEYRHIPGEYVRFTGYPCSWTFYHHLRQEILQEFTLHDHVREEAQK 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 FLRGLQVNGSRPGTFVGVHVRRGDYVHVMPKVWKGVVADRRYLQQALDWFRARYSSLIFV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 FLRGLQVNGSRPGTFVGVHVRRGDYVHVMPKVWKGVVADRRYLQQALDWFRARYSSLIFV 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE1 VTSNGMAWCRENIDTSHGDVVFAGDGIEGSPAKDFALLTQCNHTIMTIGTFGIWAAYLTG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 VTSNGMAWCRENIDTSHGDVVFAGDGIEGSPAKDFALLTQCNHTIMTIGTFGIWAAYLTG 250 260 270 280 290 300 310 320 330 340 pF1KE1 GDTIYLANYTLPDSPFLKIFKPEAAFLPEWTGIAADLSPLLKH ::::::::::::::::::::::::::::::::::::::::::: NP_001 GDTIYLANYTLPDSPFLKIFKPEAAFLPEWTGIAADLSPLLKH 310 320 330 340 >>NP_000139 (OMIM: 211100,616754) galactoside 2-alpha-L- (365 aa) initn: 1436 init1: 866 opt: 1413 Z-score: 1829.7 bits: 347.1 E(85289): 3.5e-95 Smith-Waterman score: 1413; 67.7% identity (85.4% similar) in 294 aa overlap (49-340:66-359) 20 30 40 50 60 70 pF1KE1 VFTVSTIFHVQQRLAKIQAMWELPVQIPVLASTSKALGPSQLRGMWTINAIGRLGNQMGE ::.: :..: : ::. ::.:::::. NP_000 LGLSILCPDRRLVTPPVAIFCLPGTAMGPNASSSCPQHPASLSGTWTVYPNGRFGNQMGQ 40 50 60 70 80 90 80 90 100 110 120 130 pF1KE1 YATLYALAKMNGRPAFIPAQMHSTLAPIFRITLPVLHSATASRIPWQNYHLNDWMEEEYR :::: :::..::: ::: ::..:::.:::::::: . :: ::.. .:.::: ::: NP_000 YATLLALAQLNGRRAFILPAMHAALAPVFRITLPVLAPEVDSRTPWRELQLHDWMSEEYA 100 110 120 130 140 150 140 150 160 170 180 190 pF1KE1 HIPGEYVRFTGYPCSWTFYHHLRQEILQEFTLHDHVREEAQKFLRGLQVN--GSRPGTFV . .....:.::::::.::::..: .:::::::.:::::. : :... :.:: ::: NP_000 DLRDPFLKLSGFPCSWTFFHHLREQIRREFTLHDHLREEAQSVLGQLRLGRTGDRPRTFV 160 170 180 190 200 210 200 210 220 230 240 250 pF1KE1 GVHVRRGDYVHVMPKVWKGVVADRRYLQQALDWFRARYSSLIFVVTSNGMAWCRENIDTS :::::::::..:::. :::::.: ::.::.::::::. . .:::::::: ::.:::::: NP_000 GVHVRRGDYLQVMPQRWKGVVGDSAYLRQAMDWFRARHEAPVFVVTSNGMEWCKENIDTS 220 230 240 250 260 270 260 270 280 290 300 310 pF1KE1 HGDVVFAGDGIEGSPAKDFALLTQCNHTIMTIGTFGIWAAYLTGGDTIYLANYTLPDSPF .:::.::::: :..: ::::::::::::::::::::.:::::.::::.::::.::::: : NP_000 QGDVTFAGDGQEATPWKDFALLTQCNHTIMTIGTFGFWAAYLAGGDTVYLANFTLPDSEF 280 290 300 310 320 330 320 330 340 pF1KE1 LKIFKPEAAFLPEWTGIAADLSPLLKH ::::::::::::::.:: :::::: NP_000 LKIFKPEAAFLPEWVGINADLSPLWTLAKP 340 350 360 >>NP_001316806 (OMIM: 211100,616754) galactoside 2-alpha (365 aa) initn: 1436 init1: 866 opt: 1413 Z-score: 1829.7 bits: 347.1 E(85289): 3.5e-95 Smith-Waterman score: 1413; 67.7% identity (85.4% similar) in 294 aa overlap (49-340:66-359) 20 30 40 50 60 70 pF1KE1 VFTVSTIFHVQQRLAKIQAMWELPVQIPVLASTSKALGPSQLRGMWTINAIGRLGNQMGE ::.: :..: : ::. ::.:::::. NP_001 LGLSILCPDRRLVTPPVAIFCLPGTAMGPNASSSCPQHPASLSGTWTVYPNGRFGNQMGQ 40 50 60 70 80 90 80 90 100 110 120 130 pF1KE1 YATLYALAKMNGRPAFIPAQMHSTLAPIFRITLPVLHSATASRIPWQNYHLNDWMEEEYR :::: :::..::: ::: ::..:::.:::::::: . :: ::.. .:.::: ::: NP_001 YATLLALAQLNGRRAFILPAMHAALAPVFRITLPVLAPEVDSRTPWRELQLHDWMSEEYA 100 110 120 130 140 150 140 150 160 170 180 190 pF1KE1 HIPGEYVRFTGYPCSWTFYHHLRQEILQEFTLHDHVREEAQKFLRGLQVN--GSRPGTFV . .....:.::::::.::::..: .:::::::.:::::. : :... :.:: ::: NP_001 DLRDPFLKLSGFPCSWTFFHHLREQIRREFTLHDHLREEAQSVLGQLRLGRTGDRPRTFV 160 170 180 190 200 210 200 210 220 230 240 250 pF1KE1 GVHVRRGDYVHVMPKVWKGVVADRRYLQQALDWFRARYSSLIFVVTSNGMAWCRENIDTS :::::::::..:::. :::::.: ::.::.::::::. . .:::::::: ::.:::::: NP_001 GVHVRRGDYLQVMPQRWKGVVGDSAYLRQAMDWFRARHEAPVFVVTSNGMEWCKENIDTS 220 230 240 250 260 270 260 270 280 290 300 310 pF1KE1 HGDVVFAGDGIEGSPAKDFALLTQCNHTIMTIGTFGIWAAYLTGGDTIYLANYTLPDSPF .:::.::::: :..: ::::::::::::::::::::.:::::.::::.::::.::::: : NP_001 QGDVTFAGDGQEATPWKDFALLTQCNHTIMTIGTFGFWAAYLAGGDTVYLANFTLPDSEF 280 290 300 310 320 330 320 330 340 pF1KE1 LKIFKPEAAFLPEWTGIAADLSPLLKH ::::::::::::::.:: :::::: NP_001 LKIFKPEAAFLPEWVGINADLSPLWTLAKP 340 350 360 >>XP_016882042 (OMIM: 211100,616754) PREDICTED: galactos (365 aa) initn: 1436 init1: 866 opt: 1413 Z-score: 1829.7 bits: 347.1 E(85289): 3.5e-95 Smith-Waterman score: 1413; 67.7% identity (85.4% similar) in 294 aa overlap (49-340:66-359) 20 30 40 50 60 70 pF1KE1 VFTVSTIFHVQQRLAKIQAMWELPVQIPVLASTSKALGPSQLRGMWTINAIGRLGNQMGE ::.: :..: : ::. ::.:::::. XP_016 LGLSILCPDRRLVTPPVAIFCLPGTAMGPNASSSCPQHPASLSGTWTVYPNGRFGNQMGQ 40 50 60 70 80 90 80 90 100 110 120 130 pF1KE1 YATLYALAKMNGRPAFIPAQMHSTLAPIFRITLPVLHSATASRIPWQNYHLNDWMEEEYR :::: :::..::: ::: ::..:::.:::::::: . :: ::.. .:.::: ::: XP_016 YATLLALAQLNGRRAFILPAMHAALAPVFRITLPVLAPEVDSRTPWRELQLHDWMSEEYA 100 110 120 130 140 150 140 150 160 170 180 190 pF1KE1 HIPGEYVRFTGYPCSWTFYHHLRQEILQEFTLHDHVREEAQKFLRGLQVN--GSRPGTFV . .....:.::::::.::::..: .:::::::.:::::. : :... :.:: ::: XP_016 DLRDPFLKLSGFPCSWTFFHHLREQIRREFTLHDHLREEAQSVLGQLRLGRTGDRPRTFV 160 170 180 190 200 210 200 210 220 230 240 250 pF1KE1 GVHVRRGDYVHVMPKVWKGVVADRRYLQQALDWFRARYSSLIFVVTSNGMAWCRENIDTS :::::::::..:::. :::::.: ::.::.::::::. . .:::::::: ::.:::::: XP_016 GVHVRRGDYLQVMPQRWKGVVGDSAYLRQAMDWFRARHEAPVFVVTSNGMEWCKENIDTS 220 230 240 250 260 270 260 270 280 290 300 310 pF1KE1 HGDVVFAGDGIEGSPAKDFALLTQCNHTIMTIGTFGIWAAYLTGGDTIYLANYTLPDSPF .:::.::::: :..: ::::::::::::::::::::.:::::.::::.::::.::::: : XP_016 QGDVTFAGDGQEATPWKDFALLTQCNHTIMTIGTFGFWAAYLAGGDTVYLANFTLPDSEF 280 290 300 310 320 330 320 330 340 pF1KE1 LKIFKPEAAFLPEWTGIAADLSPLLKH ::::::::::::::.:: :::::: XP_016 LKIFKPEAAFLPEWVGINADLSPLWTLAKP 340 350 360 >>XP_006723190 (OMIM: 211100,616754) PREDICTED: galactos (488 aa) initn: 1436 init1: 866 opt: 1413 Z-score: 1827.7 bits: 347.1 E(85289): 4.6e-95 Smith-Waterman score: 1413; 67.7% identity (85.4% similar) in 294 aa overlap (49-340:189-482) 20 30 40 50 60 70 pF1KE1 VFTVSTIFHVQQRLAKIQAMWELPVQIPVLASTSKALGPSQLRGMWTINAIGRLGNQMGE ::.: :..: : ::. ::.:::::. XP_006 LGLSILCPDRRLVTPPVAIFCLPGTAMGPNASSSCPQHPASLSGTWTVYPNGRFGNQMGQ 160 170 180 190 200 210 80 90 100 110 120 130 pF1KE1 YATLYALAKMNGRPAFIPAQMHSTLAPIFRITLPVLHSATASRIPWQNYHLNDWMEEEYR :::: :::..::: ::: ::..:::.:::::::: . :: ::.. .:.::: ::: XP_006 YATLLALAQLNGRRAFILPAMHAALAPVFRITLPVLAPEVDSRTPWRELQLHDWMSEEYA 220 230 240 250 260 270 140 150 160 170 180 190 pF1KE1 HIPGEYVRFTGYPCSWTFYHHLRQEILQEFTLHDHVREEAQKFLRGLQVN--GSRPGTFV . .....:.::::::.::::..: .:::::::.:::::. : :... :.:: ::: XP_006 DLRDPFLKLSGFPCSWTFFHHLREQIRREFTLHDHLREEAQSVLGQLRLGRTGDRPRTFV 280 290 300 310 320 330 200 210 220 230 240 250 pF1KE1 GVHVRRGDYVHVMPKVWKGVVADRRYLQQALDWFRARYSSLIFVVTSNGMAWCRENIDTS :::::::::..:::. :::::.: ::.::.::::::. . .:::::::: ::.:::::: XP_006 GVHVRRGDYLQVMPQRWKGVVGDSAYLRQAMDWFRARHEAPVFVVTSNGMEWCKENIDTS 340 350 360 370 380 390 260 270 280 290 300 310 pF1KE1 HGDVVFAGDGIEGSPAKDFALLTQCNHTIMTIGTFGIWAAYLTGGDTIYLANYTLPDSPF .:::.::::: :..: ::::::::::::::::::::.:::::.::::.::::.::::: : XP_006 QGDVTFAGDGQEATPWKDFALLTQCNHTIMTIGTFGFWAAYLAGGDTVYLANFTLPDSEF 400 410 420 430 440 450 320 330 340 pF1KE1 LKIFKPEAAFLPEWTGIAADLSPLLKH ::::::::::::::.:: :::::: XP_006 LKIFKPEAAFLPEWVGINADLSPLWTLAKP 460 470 480 >>XP_016882041 (OMIM: 211100,616754) PREDICTED: galactos (488 aa) initn: 1436 init1: 866 opt: 1413 Z-score: 1827.7 bits: 347.1 E(85289): 4.6e-95 Smith-Waterman score: 1413; 67.7% identity (85.4% similar) in 294 aa overlap (49-340:189-482) 20 30 40 50 60 70 pF1KE1 VFTVSTIFHVQQRLAKIQAMWELPVQIPVLASTSKALGPSQLRGMWTINAIGRLGNQMGE ::.: :..: : ::. ::.:::::. XP_016 LGLSILCPDRRLVTPPVAIFCLPGTAMGPNASSSCPQHPASLSGTWTVYPNGRFGNQMGQ 160 170 180 190 200 210 80 90 100 110 120 130 pF1KE1 YATLYALAKMNGRPAFIPAQMHSTLAPIFRITLPVLHSATASRIPWQNYHLNDWMEEEYR :::: :::..::: ::: ::..:::.:::::::: . :: ::.. .:.::: ::: XP_016 YATLLALAQLNGRRAFILPAMHAALAPVFRITLPVLAPEVDSRTPWRELQLHDWMSEEYA 220 230 240 250 260 270 140 150 160 170 180 190 pF1KE1 HIPGEYVRFTGYPCSWTFYHHLRQEILQEFTLHDHVREEAQKFLRGLQVN--GSRPGTFV . .....:.::::::.::::..: .:::::::.:::::. : :... :.:: ::: XP_016 DLRDPFLKLSGFPCSWTFFHHLREQIRREFTLHDHLREEAQSVLGQLRLGRTGDRPRTFV 280 290 300 310 320 330 200 210 220 230 240 250 pF1KE1 GVHVRRGDYVHVMPKVWKGVVADRRYLQQALDWFRARYSSLIFVVTSNGMAWCRENIDTS :::::::::..:::. :::::.: ::.::.::::::. . .:::::::: ::.:::::: XP_016 GVHVRRGDYLQVMPQRWKGVVGDSAYLRQAMDWFRARHEAPVFVVTSNGMEWCKENIDTS 340 350 360 370 380 390 260 270 280 290 300 310 pF1KE1 HGDVVFAGDGIEGSPAKDFALLTQCNHTIMTIGTFGIWAAYLTGGDTIYLANYTLPDSPF .:::.::::: :..: ::::::::::::::::::::.:::::.::::.::::.::::: : XP_016 QGDVTFAGDGQEATPWKDFALLTQCNHTIMTIGTFGFWAAYLAGGDTVYLANFTLPDSEF 400 410 420 430 440 450 320 330 340 pF1KE1 LKIFKPEAAFLPEWTGIAADLSPLLKH ::::::::::::::.:: :::::: XP_016 LKIFKPEAAFLPEWVGINADLSPLWTLAKP 460 470 480 343 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 03:17:39 2016 done: Tue Nov 8 03:17:40 2016 Total Scan time: 5.250 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]