Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KCC000191A_C01 KCC000191A_c01
(1479 letters)
Database: nr
1,537,769 sequences; 498,525,298 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_229072.1| thiH protein, putative [Thermotoga maritima] gi... 258 2e-67
ref|ZP_00059905.1| COG1060: Thiamine biosynthesis enzyme ThiH an... 256 7e-67
ref|NP_781953.1| thiH protein [Clostridium tetani E88] gi|282034... 251 3e-65
ref|NP_719454.1| thiH protein, putative [Shewanella oneidensis M... 244 2e-63
ref|ZP_00129808.1| COG1060: Thiamine biosynthesis enzyme ThiH an... 243 6e-63
>ref|NP_229072.1| thiH protein, putative [Thermotoga maritima] gi|7462818|pir||B72274
hypothetical protein TM1267 - Thermotoga maritima (strain
MSB8) gi|4981824|gb|AAD36342.1|AE001782_3 thiH protein,
putative [Thermotoga maritima]
Length = 473
Score = 258 bits (659), Expect = 2e-67
Identities = 134/290 (46%), Positives = 189/290 (64%)
Frame = +3
Query: 609 WINEAAIHKALETSKADAQDAGRVREILAKAKEKAFVTEHAPVNAESKSEFVQGLTLEEC 788
+I E I + LE +K D RVREI+ K+ +K L EE
Sbjct: 16 FIPEEKIFELLEKTKNP--DPARVREIIQKSLDK------------------NRLEPEET 55
Query: 789 ATLINVDSNNVELMNEIFDTALAIKERIYGNRVVLFAPLYIANHCMNTCTYCAFRSANKG 968
ATL+NV+ + EL+ EIF+ A +KERIYGNR+VLFAPLYI N C+N C YC FR +NK
Sbjct: 56 ATLLNVE--DPELLEEIFEAARTLKERIYGNRIVLFAPLYIGNDCINDCVYCGFRVSNKV 113
Query: 969 MERSILTDDDLREEVAALQRQGHRRILALTGEHPKYTFDNFLHAVNVIASVKTEPEGSIR 1148
+ER LT++ L+EEV AL QGH+R++ + GEHP Y+ + ++++ + K G IR
Sbjct: 114 VERRTLTEEQLKEEVKALVSQGHKRLIVVYGEHPNYSPEFIARTIDIVYNTKYG-NGEIR 172
Query: 1149 RINVEIPPLSVSDMRRLKNTDSVGTFVLFQETYHRDTFKVMHPSGPKSDFDFRVLTQDRA 1328
R+NV P ++ + +K+ +GTF +FQETYHR+T+ +HP GPKS++++R+ DRA
Sbjct: 173 RVNVNAAPQTIEGYKIIKSV-GIGTFQIFQETYHRETYLKLHPRGPKSNYNWRLYGLDRA 231
Query: 1329 MRAGLDDVGIGALFGLYDYRYEVCAMLMHSEHLEREYNAGPHTISVPRMR 1478
M AG+DDVGIGALFGLYD+++EV +L H+ HLE + GPHTIS PR++
Sbjct: 232 MMAGIDDVGIGALFGLYDWKFEVMGLLYHTIHLEERFGVGPHTISFPRIK 281
>ref|ZP_00059905.1| COG1060: Thiamine biosynthesis enzyme ThiH and related
uncharacterized enzymes [Clostridium thermocellum ATCC
27405]
Length = 473
Score = 256 bits (654), Expect = 7e-67
Identities = 133/291 (45%), Positives = 184/291 (62%)
Frame = +3
Query: 606 EWINEAAIHKALETSKADAQDAGRVREILAKAKEKAFVTEHAPVNAESKSEFVQGLTLEE 785
++I E I LE K D +REILAKA+E +G++L E
Sbjct: 21 DFIKEDLIFSLLEKGKIT--DRNEIREILAKARE------------------CKGISLGE 60
Query: 786 CATLINVDSNNVELMNEIFDTALAIKERIYGNRVVLFAPLYIANHCMNTCTYCAFRSANK 965
A L+ ++ EL+ E++D A IK +IYG RVVLFAPLY +N C N C YC FR NK
Sbjct: 61 VAKLLYLEDE--ELLEELYDVAKYIKNKIYGKRVVLFAPLYTSNECTNNCLYCGFRHDNK 118
Query: 966 GMERSILTDDDLREEVAALQRQGHRRILALTGEHPKYTFDNFLHAVNVIASVKTEPEGSI 1145
+ R L+ +++ EE A++RQGH+R+L + GE P+ T N H + + ++ + I
Sbjct: 119 ELHRKTLSLEEIVEEAKAIERQGHKRLLLICGEDPRKT--NVKHFTDAMEAIYKSTD--I 174
Query: 1146 RRINVEIPPLSVSDMRRLKNTDSVGTFVLFQETYHRDTFKVMHPSGPKSDFDFRVLTQDR 1325
RRINVE P++V D R LK +GT+V+FQETYHR+T+++MHP G K+++D+R+ DR
Sbjct: 175 RRINVEAAPMTVEDYRELKKA-GIGTYVIFQETYHRETYRIMHPVGKKANYDWRITAIDR 233
Query: 1326 AMRAGLDDVGIGALFGLYDYRYEVCAMLMHSEHLEREYNAGPHTISVPRMR 1478
A G+DDVG+GALFGLYDYR+EV +LMH H E +Y GPHTISVPR+R
Sbjct: 234 AFEGGIDDVGVGALFGLYDYRFEVLGLLMHCMHFEEKYGVGPHTISVPRLR 284
>ref|NP_781953.1| thiH protein [Clostridium tetani E88] gi|28203448|gb|AAO35890.1| thiH
protein [Clostridium tetani E88]
Length = 478
Score = 251 bits (640), Expect = 3e-65
Identities = 130/292 (44%), Positives = 188/292 (63%), Gaps = 1/292 (0%)
Frame = +3
Query: 606 EWINEAAIHKALETSKADAQDAGRVREILAKAKEKAFVTEHAPVNAESKSEFVQGLTLEE 785
E+I + I KAL+ + A++ VRE+L KA E +GLT EE
Sbjct: 15 EFIIHSDIEKALDKGREKAKNKDYVRELLNKALE------------------CKGLTYEE 56
Query: 786 CATLINVDSNNVELMNEIFDTALAIKERIYGNRVVLFAPLYIANHCMNTCTYCAFRSANK 965
A L+NV+ ++ + +I+ A IKE+IYG R+VLFAPLYI+++C+N C YC ++ +N
Sbjct: 57 GAVLLNVEDEHI--LEDIYKAAKIIKEKIYGKRIVLFAPLYISSYCVNNCKYCGYKCSNN 114
Query: 966 GMERSILTDDDLREEVAALQRQGHRRILALTGEHP-KYTFDNFLHAVNVIASVKTEPEGS 1142
+R+ LT D++ EEV L+ GH+R+ GE + D L ++ I S+K GS
Sbjct: 115 TFKRNKLTMDEIAEEVKILESLGHKRLALEVGEDDVNCSIDYVLKSIKKIYSLKFN-NGS 173
Query: 1143 IRRINVEIPPLSVSDMRRLKNTDSVGTFVLFQETYHRDTFKVMHPSGPKSDFDFRVLTQD 1322
IRRINV I ++ + ++LK + +GT++LFQETYH++T++ MHP+GPKSD+++ D
Sbjct: 174 IRRINVNIAATTIENYKKLKEAE-IGTYILFQETYHKETYEKMHPTGPKSDYNYHTTAMD 232
Query: 1323 RAMRAGLDDVGIGALFGLYDYRYEVCAMLMHSEHLEREYNAGPHTISVPRMR 1478
RA AG+DDVGIG L+GLYDY+Y+ AMLMH EHLE+ GPHTISVPR+R
Sbjct: 233 RARMAGIDDVGIGVLYGLYDYKYDTVAMLMHGEHLEKATGVGPHTISVPRLR 284
>ref|NP_719454.1| thiH protein, putative [Shewanella oneidensis MR-1]
gi|24350249|gb|AAN56898.1|AE015824_9 thiH protein,
putative [Shewanella oneidensis MR-1]
Length = 479
Score = 244 bits (624), Expect = 2e-63
Identities = 129/297 (43%), Positives = 183/297 (61%), Gaps = 2/297 (0%)
Frame = +3
Query: 591 YRNPAEWINEAAIHKALETSKADAQDAGR--VREILAKAKEKAFVTEHAPVNAESKSEFV 764
Y +I++ AI + +E DA D R V IL KA++
Sbjct: 14 YNPNVNFIDDKAIWQTIE----DASDPSREQVLAILDKARQ------------------C 51
Query: 765 QGLTLEECATLINVDSNNVELMNEIFDTALAIKERIYGNRVVLFAPLYIANHCMNTCTYC 944
+GL++ E A L+ ++ M +F A IK IYGNR+V+FAPLY++NHC N+C+YC
Sbjct: 52 EGLSISETALLLQNQDKTLDEM--LFSVAREIKNTIYGNRIVMFAPLYVSNHCANSCSYC 109
Query: 945 AFRSANKGMERSILTDDDLREEVAALQRQGHRRILALTGEHPKYTFDNFLHAVNVIASVK 1124
F + N ++R L D++R+EVA L+ GH+RILA+ GEHP+ + ++ + SVK
Sbjct: 110 GFNADNHELKRKTLKQDEIRQEVAILEEMGHKRILAVYGEHPRNNVQAIVESIQTMYSVK 169
Query: 1125 TEPEGSIRRINVEIPPLSVSDMRRLKNTDSVGTFVLFQETYHRDTFKVMHPSGPKSDFDF 1304
G IRRINV P+SV D ++LK T ++GT+ FQETYH+DT+ +H G K+DF +
Sbjct: 170 QGKGGEIRRINVNCAPMSVEDFKQLK-TAAIGTYQCFQETYHQDTYSQVHLKGKKTDFLY 228
Query: 1305 RVLTQDRAMRAGLDDVGIGALFGLYDYRYEVCAMLMHSEHLEREYNAGPHTISVPRM 1475
R+ RAM AG+DDVGIGALFGLYD+R+E+ AML H + LE++ GPHTIS PR+
Sbjct: 229 RLYAMHRAMEAGIDDVGIGALFGLYDHRFELLAMLTHVQQLEKDCGVGPHTISFPRI 285
>ref|ZP_00129808.1| COG1060: Thiamine biosynthesis enzyme ThiH and related
uncharacterized enzymes [Desulfovibrio desulfuricans G20]
Length = 469
Score = 243 bits (620), Expect = 6e-63
Identities = 122/270 (45%), Positives = 167/270 (61%)
Frame = +3
Query: 666 DAGRVREILAKAKEKAFVTEHAPVNAESKSEFVQGLTLEECATLINVDSNNVELMNEIFD 845
DA RVREILAKA+E +GL EE ATL+ +D N EL E+F
Sbjct: 28 DAVRVREILAKARE------------------AKGLDAEETATLLQLD--NEELDAELFA 67
Query: 846 TALAIKERIYGNRVVLFAPLYIANHCMNTCTYCAFRSANKGMERSILTDDDLREEVAALQ 1025
TA +K+ IYGNR+VLFAPLYI N C N C YC F + N ++R L++D++R EV L+
Sbjct: 68 TAKKVKQTIYGNRLVLFAPLYITNECYNRCAYCGFNATNSDLKRRTLSEDEIRAEVEVLE 127
Query: 1026 RQGHRRILALTGEHPKYTFDNFLHAVNVIASVKTEPEGSIRRINVEIPPLSVSDMRRLKN 1205
R GH+R+L + GEHP+ D + V+ +E G IRR+N+ P +V R+L +
Sbjct: 128 RLGHKRLLLVYGEHPRLDADWMARTIQVVYDTVSEKSGEIRRVNINCAPQTVDGFRKLHD 187
Query: 1206 TDSVGTFVLFQETYHRDTFKVMHPSGPKSDFDFRVLTQDRAMRAGLDDVGIGALFGLYDY 1385
+GT+ FQETYH+ T+ H GPK D+ +R+ RAM AG+DDVG+G L GLYDY
Sbjct: 188 V-GIGTYQCFQETYHKATYDKAHLGGPKKDYLWRLYAMHRAMEAGIDDVGMGPLLGLYDY 246
Query: 1386 RYEVCAMLMHSEHLEREYNAGPHTISVPRM 1475
R+E+ A++ H+ LE+ + GPHTIS PR+
Sbjct: 247 RFEILALMQHAADLEKHFGVGPHTISFPRL 276