FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6620, 327 aa 1>>>pF1KE6620 327 - 327 aa - 327 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.9615+/-0.00035; mu= 17.4521+/- 0.022 mean_var=56.0297+/-11.352, 0's: 0 Z-trim(113.7): 8 B-trim: 33 in 1/53 Lambda= 0.171343 statistics sampled from 23168 (23175) to 23168 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.645), E-opt: 0.2 (0.272), width: 16 Scan time: 5.760 The best scores are: opt bits E(85289) NP_612422 (OMIM: 613597,613616) 4-hydroxy-2-oxoglu ( 327) 2188 548.9 5.4e-156 NP_001128142 (OMIM: 613597,613616) 4-hydroxy-2-oxo ( 164) 671 173.7 2.3e-43 NP_110396 (OMIM: 611412) N-acetylneuraminate lyase ( 320) 311 84.9 2.5e-16 NP_001186985 (OMIM: 611412) N-acetylneuraminate ly ( 284) 270 74.7 2.6e-13 NP_001186979 (OMIM: 611412) N-acetylneuraminate ly ( 301) 257 71.5 2.5e-12 NP_001186981 (OMIM: 611412) N-acetylneuraminate ly ( 230) 247 69.0 1.1e-11 NP_001186980 (OMIM: 611412) N-acetylneuraminate ly ( 240) 223 63.1 7e-10 >>NP_612422 (OMIM: 613597,613616) 4-hydroxy-2-oxoglutara (327 aa) initn: 2188 init1: 2188 opt: 2188 Z-score: 2921.5 bits: 548.9 E(85289): 5.4e-156 Smith-Waterman score: 2188; 100.0% identity (100.0% similar) in 327 aa overlap (1-327:1-327) 10 20 30 40 50 60 pF1KE6 MLGPQVWSSVRQGLSRSLSRNVGVWASGEGKKVDIAGIYPPVTTPFTATAEVDYGKLEEN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_612 MLGPQVWSSVRQGLSRSLSRNVGVWASGEGKKVDIAGIYPPVTTPFTATAEVDYGKLEEN 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 LHKLGTFPFRGFVVQGSNGEFPFLTSSERLEVVSRVRQAMPKNRLLLAGSGCESTQATVE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_612 LHKLGTFPFRGFVVQGSNGEFPFLTSSERLEVVSRVRQAMPKNRLLLAGSGCESTQATVE 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 MTVSMAQVGADAAMVVTPCYYRGRMSSAALIHHYTKVADLSPIPVVLYSVPANTGLDLPV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_612 MTVSMAQVGADAAMVVTPCYYRGRMSSAALIHHYTKVADLSPIPVVLYSVPANTGLDLPV 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 DAVVTLSQHPNIVGMKDSGGDVTRIGLIVHKTRKQDFQVLAGSAGFLMASYALGAVGGVC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_612 DAVVTLSQHPNIVGMKDSGGDVTRIGLIVHKTRKQDFQVLAGSAGFLMASYALGAVGGVC 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE6 ALANVLGAQVCQLERLCCTGQWEDAQKLQHRLIEPNAAVTRRFGIPGLKKIMDWFGYYGG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_612 ALANVLGAQVCQLERLCCTGQWEDAQKLQHRLIEPNAAVTRRFGIPGLKKIMDWFGYYGG 250 260 270 280 290 300 310 320 pF1KE6 PCRAPLQELSPAEEEALRMDFTSNGWL ::::::::::::::::::::::::::: NP_612 PCRAPLQELSPAEEEALRMDFTSNGWL 310 320 >>NP_001128142 (OMIM: 613597,613616) 4-hydroxy-2-oxoglut (164 aa) initn: 671 init1: 671 opt: 671 Z-score: 899.5 bits: 173.7 E(85289): 2.3e-43 Smith-Waterman score: 806; 50.2% identity (50.2% similar) in 327 aa overlap (1-327:1-164) 10 20 30 40 50 60 pF1KE6 MLGPQVWSSVRQGLSRSLSRNVGVWASGEGKKVDIAGIYPPVTTPFTATAEVDYGKLEEN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MLGPQVWSSVRQGLSRSLSRNVGVWASGEGKKVDIAGIYPPVTTPFTATAEVDYGKLEEN 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 LHKLGTFPFRGFVVQGSNGEFPFLTSSERLEVVSRVRQAMPKNRLLLAGSGCESTQATVE ::::::::::: NP_001 LHKLGTFPFRG------------------------------------------------- 70 130 140 150 160 170 180 pF1KE6 MTVSMAQVGADAAMVVTPCYYRGRMSSAALIHHYTKVADLSPIPVVLYSVPANTGLDLPV NP_001 ------------------------------------------------------------ 190 200 210 220 230 240 pF1KE6 DAVVTLSQHPNIVGMKDSGGDVTRIGLIVHKTRKQDFQVLAGSAGFLMASYALGAVGGVC :::::: NP_001 ------------------------------------------------------AVGGVC 250 260 270 280 290 300 pF1KE6 ALANVLGAQVCQLERLCCTGQWEDAQKLQHRLIEPNAAVTRRFGIPGLKKIMDWFGYYGG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 ALANVLGAQVCQLERLCCTGQWEDAQKLQHRLIEPNAAVTRRFGIPGLKKIMDWFGYYGG 80 90 100 110 120 130 310 320 pF1KE6 PCRAPLQELSPAEEEALRMDFTSNGWL ::::::::::::::::::::::::::: NP_001 PCRAPLQELSPAEEEALRMDFTSNGWL 140 150 160 >>NP_110396 (OMIM: 611412) N-acetylneuraminate lyase iso (320 aa) initn: 117 init1: 67 opt: 311 Z-score: 414.1 bits: 84.9 E(85289): 2.5e-16 Smith-Waterman score: 320; 27.5% identity (55.6% similar) in 306 aa overlap (32-323:5-307) 10 20 30 40 50 60 pF1KE6 LGPQVWSSVRQGLSRSLSRNVGVWASGEGKKVDIAGIYPPVTTPFTATAEVDYGKLEENL : . :. . ::.: ..:.... . . . NP_110 MAFPKKKLQGLVAATITPMTENGEINFSVIGQYV 10 20 30 70 80 90 100 110 pF1KE6 HKLGTFP-FRGFVVQGSNGEFPFLTSSERLEVVSR-VRQAMPKNRLLLAGSGCESTQATV : ... :.:..:: :. ::: .:. . : .. : .. : : . . NP_110 DYLVKEQGVKNIFVNGTTGEGLSLSVSERRQVAEEWVTKGKDKLDQVIIHVGALSLKESQ 40 50 60 70 80 90 120 130 140 150 160 170 pF1KE6 EMTVSMAQVGADAAMVVTPCYYRGRMSSAALIHHYTKVADLSP-IPVVLYSVPANTGLDL :.. :..:::. :..: . . .. ::. .:: .: .: : .:: ::. . NP_110 ELAQHAAEIGADGIAVIAPFFLKP-WTKDILINFLKEVAAAAPALPFYYYHIPALTGVKI 100 110 120 130 140 150 180 190 200 210 220 230 pF1KE6 PVDAVVT--LSQHPNIVGMKDSGGDVTRIGLIVHKTRKQDFQVLAGSAGFLMASYALGAV .. .. :.. :.. :.: : :. .: : ..:.:.: : : :... ..::. NP_110 RAEELLDGILDKIPTFQGLKFSDTDLLDFGQCVDQNRQQQFAFLFGVDEQLLSALVMGAT 160 170 180 190 200 210 240 250 260 270 280 290 pF1KE6 GGVCALANVLGAQVCQLERLCCTGQWEDAQKLQ---HRLIEPNAAVTRRFGIPGLKKIMD :.: . : :: .. :. . .. : . : .:.: : .: ::. : :: NP_110 GAVGSTYNYLGKKTNQMLEAFEQKDFSLALNYQFCIQRFI--NFVVKLGFGVSQTKAIMT 220 230 240 250 260 270 300 310 320 pF1KE6 WF-GYYGGPCRAPLQ----ELSPAEEEALR-MDFTSNGWL : :: : ::: :.. . : :. .:: : NP_110 LVSGIPMGPPRLPLQKASREFTDSAEAKLKSLDFLSFTDLKDGNLEAGS 280 290 300 310 320 >>NP_001186985 (OMIM: 611412) N-acetylneuraminate lyase (284 aa) initn: 67 init1: 67 opt: 270 Z-score: 360.1 bits: 74.7 E(85289): 2.6e-13 Smith-Waterman score: 270; 26.4% identity (58.6% similar) in 227 aa overlap (32-253:5-230) 10 20 30 40 50 60 pF1KE6 LGPQVWSSVRQGLSRSLSRNVGVWASGEGKKVDIAGIYPPVTTPFTATAEVDYGKLEENL : . :. . ::.: ..:.... . . . NP_001 MAFPKKKLQGLVAATITPMTENGEINFSVIGQYV 10 20 30 70 80 90 100 110 pF1KE6 HKLGTFP-FRGFVVQGSNGEFPFLTSSERLEVVSR-VRQAMPKNRLLLAGSGCESTQATV : ... :.:..:: :. ::: .:. . : .. : .. : : . . NP_001 DYLVKEQGVKNIFVNGTTGEGLSLSVSERRQVAEEWVTKGKDKLDQVIIHVGALSLKESQ 40 50 60 70 80 90 120 130 140 150 160 170 pF1KE6 EMTVSMAQVGADAAMVVTPCYYRGRMSSAALIHHYTKVADLSP-IPVVLYSVPANTGLDL :.. :..:::. :..: . . .. ::. .:: .: .: : .:: ::. . NP_001 ELAQHAAEIGADGIAVIAPFFLKP-WTKDILINFLKEVAAAAPALPFYYYHIPALTGVKI 100 110 120 130 140 150 180 190 200 210 220 230 pF1KE6 PVDAVVT--LSQHPNIVGMKDSGGDVTRIGLIVHKTRKQDFQVLAGSAGFLMASYALGAV .. .. :.. :.. :.: : :. .: : ..:.:.: : : :... ..::. NP_001 RAEELLDGILDKIPTFQGLKFSDTDLLDFGQCVDQNRQQQFAFLFGVDEQLLSALVMGAT 160 170 180 190 200 210 240 250 260 270 280 290 pF1KE6 GGVCALANVLGAQVCQLERLCCTGQWEDAQKLQHRLIEPNAAVTRRFGIPGLKKIMDWFG :.: . : :: .. :. NP_001 GAVGSTYNYLGKKTNQMLEAFEQKDFSLALNYQFCIQRFINFVVKLENSKLKVSKNQRTL 220 230 240 250 260 270 >>NP_001186979 (OMIM: 611412) N-acetylneuraminate lyase (301 aa) initn: 117 init1: 67 opt: 257 Z-score: 342.4 bits: 71.5 E(85289): 2.5e-12 Smith-Waterman score: 266; 29.3% identity (55.6% similar) in 225 aa overlap (111-323:67-288) 90 100 110 120 130 140 pF1KE6 FPFLTSSERLEVVSRVRQAMPKNRLLLAGSGCESTQATVEMTVSMAQVGADAAMVVTPCY : : . . :.. :..:::. :..: . NP_001 GNCFLPVYKASPLTVTRLWAERLDQVIIHVGALSLKESQELAQHAAEIGADGIAVIAPFF 40 50 60 70 80 90 150 160 170 180 190 pF1KE6 YRGRMSSAALIHHYTKVADLSP-IPVVLYSVPANTGLDLPVDAVVT--LSQHPNIVGMKD . .. ::. .:: .: .: : .:: ::. . .. .. :.. :.. :.: NP_001 LKP-WTKDILINFLKEVAAAAPALPFYYYHIPALTGVKIRAEELLDGILDKIPTFQGLKF 100 110 120 130 140 150 200 210 220 230 240 250 pF1KE6 SGGDVTRIGLIVHKTRKQDFQVLAGSAGFLMASYALGAVGGVCALANVLGAQVCQLERLC : :. .: : ..:.:.: : : :... ..::.:.: . : :: .. :. . NP_001 SDTDLLDFGQCVDQNRQQQFAFLFGVDEQLLSALVMGATGAVGSTYNYLGKKTNQMLEAF 160 170 180 190 200 210 260 270 280 290 300 pF1KE6 CTGQWEDAQKLQ---HRLIEPNAAVTRRFGIPGLKKIMDWF-GYYGGPCRAPLQ----EL .. : . : .:.: : .: ::. : :: : :: : ::: :. NP_001 EQKDFSLALNYQFCIQRFI--NFVVKLGFGVSQTKAIMTLVSGIPMGPPRLPLQKASREF 220 230 240 250 260 270 310 320 pF1KE6 SPAEEEALR-MDFTSNGWL . . : :. .:: : NP_001 TDSAEAKLKSLDFLSFTDLKDGNLEAGS 280 290 300 >>NP_001186981 (OMIM: 611412) N-acetylneuraminate lyase (230 aa) initn: 67 init1: 67 opt: 247 Z-score: 330.8 bits: 69.0 E(85289): 1.1e-11 Smith-Waterman score: 247; 26.3% identity (58.7% similar) in 213 aa overlap (32-239:5-216) 10 20 30 40 50 60 pF1KE6 LGPQVWSSVRQGLSRSLSRNVGVWASGEGKKVDIAGIYPPVTTPFTATAEVDYGKLEENL : . :. . ::.: ..:.... . . . NP_001 MAFPKKKLQGLVAATITPMTENGEINFSVIGQYV 10 20 30 70 80 90 100 110 pF1KE6 HKLGTFP-FRGFVVQGSNGEFPFLTSSERLEVVSR-VRQAMPKNRLLLAGSGCESTQATV : ... :.:..:: :. ::: .:. . : .. : .. : : . . NP_001 DYLVKEQGVKNIFVNGTTGEGLSLSVSERRQVAEEWVTKGKDKLDQVIIHVGALSLKESQ 40 50 60 70 80 90 120 130 140 150 160 170 pF1KE6 EMTVSMAQVGADAAMVVTPCYYRGRMSSAALIHHYTKVADLSP-IPVVLYSVPANTGLDL :.. :..:::. :..: . . .. ::. .:: .: .: : .:: ::. . NP_001 ELAQHAAEIGADGIAVIAPFFLKP-WTKDILINFLKEVAAAAPALPFYYYHIPALTGVKI 100 110 120 130 140 150 180 190 200 210 220 230 pF1KE6 PVDAVVT--LSQHPNIVGMKDSGGDVTRIGLIVHKTRKQDFQVLAGSAGFLMASYALGAV .. .. :.. :.. :.: : :. .: : ..:.:.: : : :... ..::. NP_001 RAEELLDGILDKIPTFQGLKFSDTDLLDFGQCVDQNRQQQFAFLFGVDEQLLSALVMGAT 160 170 180 190 200 210 240 250 260 270 280 290 pF1KE6 GGVCALANVLGAQVCQLERLCCTGQWEDAQKLQHRLIEPNAAVTRRFGIPGLKKIMDWFG :.: NP_001 GAVGSFVSRDLSTLLSN 220 230 >>NP_001186980 (OMIM: 611412) N-acetylneuraminate lyase (240 aa) initn: 67 init1: 67 opt: 223 Z-score: 298.4 bits: 63.1 E(85289): 7e-10 Smith-Waterman score: 223; 25.4% identity (56.6% similar) in 205 aa overlap (32-231:5-208) 10 20 30 40 50 60 pF1KE6 LGPQVWSSVRQGLSRSLSRNVGVWASGEGKKVDIAGIYPPVTTPFTATAEVDYGKLEENL : . :. . ::.: ..:.... . . . NP_001 MAFPKKKLQGLVAATITPMTENGEINFSVIGQYV 10 20 30 70 80 90 100 110 pF1KE6 HKLGTFP-FRGFVVQGSNGEFPFLTSSERLEVVSR-VRQAMPKNRLLLAGSGCESTQATV : ... :.:..:: :. ::: .:. . : .. : .. : : . . NP_001 DYLVKEQGVKNIFVNGTTGEGLSLSVSERRQVAEEWVTKGKDKLDQVIIHVGALSLKESQ 40 50 60 70 80 90 120 130 140 150 160 170 pF1KE6 EMTVSMAQVGADAAMVVTPCYYRGRMSSAALIHHYTKVADLSP-IPVVLYSVPANTGLDL :.. :..:::. :..: . . .. ::. .:: .: .: : .:: ::. . NP_001 ELAQHAAEIGADGIAVIAPFFLKP-WTKDILINFLKEVAAAAPALPFYYYHIPALTGVKI 100 110 120 130 140 150 180 190 200 210 220 230 pF1KE6 PVDAVVT--LSQHPNIVGMKDSGGDVTRIGLIVHKTRKQDFQVLAGSAGFLMASYALGAV .. .. :.. :.. :.: : :. .: : ..:.:.: : : : . . NP_001 RAEELLDGILDKIPTFQGLKFSDTDLLDFGQCVDQNRQQQFAFLFGVDEFCIQRFINFVV 160 170 180 190 200 210 240 250 260 270 280 290 pF1KE6 GGVCALANVLGAQVCQLERLCCTGQWEDAQKLQHRLIEPNAAVTRRFGIPGLKKIMDWFG NP_001 KLENSKLKVSKNQRTLPLGTTNFPFLH 220 230 240 327 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 14:50:58 2016 done: Tue Nov 8 14:50:58 2016 Total Scan time: 5.760 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]