
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC119408.7 + phase: 0 /pseudo
(218 letters)
Database: nr
2,540,612 sequences; 863,360,394 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
dbj|BAA97176.1| unnamed protein product [Arabidopsis thaliana] 337 1e-91
gb|AAM45111.1| unknown protein [Arabidopsis thaliana] gi|1942400... 336 2e-91
gb|AAO64042.1| unknown protein [Arabidopsis thaliana] gi|2839369... 317 1e-85
gb|AAM63696.1| unknown [Arabidopsis thaliana] 314 1e-84
gb|AAK71553.1| unknown protein [Oryza sativa (japonica cultivar-... 280 2e-74
emb|CAB10523.1| hypothetical protein [Arabidopsis thaliana] gi|7... 246 2e-64
ref|ZP_00245212.1| COG2013: Uncharacterized conserved protein [R... 128 1e-28
gb|AAM07986.1| conserved hypothetical protein [Methanosarcina ac... 128 1e-28
ref|YP_038905.1| hypothetical protein BT9727_4593 [Bacillus thur... 123 3e-27
ref|YP_086190.1| hypothetical protein BCZK4615 [Bacillus cereus ... 123 3e-27
ref|NP_981313.1| hypothetical protein BCE5020 [Bacillus cereus A... 122 5e-27
gb|AAP11760.1| hypothetical protein [Bacillus cereus ATCC 14579]... 122 7e-27
gb|AAK79504.1| Uncharacterized conserved protein [Clostridium ac... 122 9e-27
ref|YP_148732.1| hypothetical protein GK2879 [Geobacillus kausto... 121 1e-26
gb|AAN86542.1| unknown [Eubacterium acidaminophilum] 120 3e-26
ref|ZP_00418039.1| Protein of unknown function DUF124 [Azotobact... 114 1e-24
ref|NP_633330.1| HTH DNA-binding protein [Methanosarcina mazei G... 114 3e-24
gb|EAM71983.1| Protein of unknown function DUF124 [Desulfuromona... 113 4e-24
ref|ZP_00129004.2| COG2013: Uncharacterized conserved protein [D... 112 1e-23
gb|AAT50532.1| PA3696 [synthetic construct] 109 5e-23
>dbj|BAA97176.1| unnamed protein product [Arabidopsis thaliana]
Length = 268
Score = 337 bits (864), Expect = 1e-91
Identities = 163/216 (75%), Positives = 190/216 (87%), Gaps = 1/216 (0%)
Query: 1 MAAPFFSTPFQPYVYQSQQDAIIPFQILGGESQVVQIMLKPHEKIIAKPGSMCFMSGSVE 60
MAAPFFSTPFQPYVYQSQQD I PFQILGGESQVVQIMLK EK+IAKP SMC+MSGS+E
Sbjct: 1 MAAPFFSTPFQPYVYQSQQDTITPFQILGGESQVVQIMLKSEEKVIAKPASMCYMSGSIE 60
Query: 61 MENAYLPENEVGIWQWLFGKTVTNIVVRNSGPSDGFVGIAAPYFARILPIDLATFNGEIL 120
MEN Y PE EVG+ QW+ GK+V+++V+RN+G +DGFVGIAAPY ARILPIDLA F GEIL
Sbjct: 61 MENTYTPEQEVGVLQWILGKSVSSVVLRNTGQNDGFVGIAAPYLARILPIDLAMFGGEIL 120
Query: 121 CQPDAFLCSVNDVKVSNTVDQRGRNVV-AGAEVFLRQKLSGQGLAFILGGGSVVQKILEV 179
CQPDAFLCSV+DVKV N+VDQR RN+V AGAE FLRQ+LSGQGLAFIL GGSVVQK+LEV
Sbjct: 121 CQPDAFLCSVHDVKVVNSVDQRARNIVAAGAEGFLRQRLSGQGLAFILAGGSVVQKVLEV 180
Query: 180 GEVLAVDVSCIVAVTSTVDIQIKYNGPARRTMFGVS 215
GEV ++DVSCI A+T ++D++IK N P RR +FG++
Sbjct: 181 GEVFSIDVSCIAALTPSIDVRIKNNAPFRRALFGLT 216
>gb|AAM45111.1| unknown protein [Arabidopsis thaliana] gi|19424009|gb|AAL87286.1|
unknown protein [Arabidopsis thaliana]
gi|22327636|ref|NP_199553.2| expressed protein
[Arabidopsis thaliana]
Length = 282
Score = 336 bits (862), Expect = 2e-91
Identities = 163/214 (76%), Positives = 188/214 (87%), Gaps = 1/214 (0%)
Query: 1 MAAPFFSTPFQPYVYQSQQDAIIPFQILGGESQVVQIMLKPHEKIIAKPGSMCFMSGSVE 60
MAAPFFSTPFQPYVYQSQQD I PFQILGGESQVVQIMLK EK+IAKP SMC+MSGS+E
Sbjct: 1 MAAPFFSTPFQPYVYQSQQDTITPFQILGGESQVVQIMLKSEEKVIAKPASMCYMSGSIE 60
Query: 61 MENAYLPENEVGIWQWLFGKTVTNIVVRNSGPSDGFVGIAAPYFARILPIDLATFNGEIL 120
MEN Y PE EVG+ QW+ GK+V+++V+RN+G +DGFVGIAAPY ARILPIDLA F GEIL
Sbjct: 61 MENTYTPEQEVGVLQWILGKSVSSVVLRNTGQNDGFVGIAAPYLARILPIDLAMFGGEIL 120
Query: 121 CQPDAFLCSVNDVKVSNTVDQRGRNVV-AGAEVFLRQKLSGQGLAFILGGGSVVQKILEV 179
CQPDAFLCSV+DVKV N+VDQR RN+V AGAE FLRQ+LSGQGLAFIL GGSVVQK+LEV
Sbjct: 121 CQPDAFLCSVHDVKVVNSVDQRARNIVAAGAEGFLRQRLSGQGLAFILAGGSVVQKVLEV 180
Query: 180 GEVLAVDVSCIVAVTSTVDIQIKYNGPARRTMFG 213
GEV ++DVSCI A+T ++D++IK N P RR +FG
Sbjct: 181 GEVFSIDVSCIAALTPSIDVRIKNNAPFRRALFG 214
>gb|AAO64042.1| unknown protein [Arabidopsis thaliana] gi|28393695|gb|AAO42260.1|
unknown protein [Arabidopsis thaliana]
gi|18414878|ref|NP_567527.1| expressed protein
[Arabidopsis thaliana]
Length = 285
Score = 317 bits (813), Expect = 1e-85
Identities = 159/217 (73%), Positives = 185/217 (84%), Gaps = 4/217 (1%)
Query: 1 MAAPFFSTPFQPYVYQSQQDAIIPFQILGGESQVVQIMLKPHEKIIAKPGSMCFMSGSVE 60
MAAPFFSTPFQPYVYQSQ+D I PFQILGGE+QVVQIMLKP EK+IAKPGSMC+MSGS+E
Sbjct: 1 MAAPFFSTPFQPYVYQSQEDTITPFQILGGEAQVVQIMLKPQEKVIAKPGSMCYMSGSIE 60
Query: 61 MENAYLPENEVGIWQWLFGKTVTNIVVRNSGPSDGFVGIAAPYFARILPIDLATFNGEIL 120
M+N Y PE EVG+ QW+ GK+V++IV+RN+G +DGFVGIAAP ARILP+DLA F G+IL
Sbjct: 61 MDNTYTPEQEVGVVQWILGKSVSSIVLRNTGQNDGFVGIAAPSLARILPLDLAMFGGDIL 120
Query: 121 CQPDAFLCSVNDVKVSNTVDQ--RGRNV-VAGAEVFLRQKLSGQGLAFILGGGSVVQKIL 177
CQPDAFLCSV+DVKV NTV Q R RN+ AGAE LRQ+LSGQGLAFI+ GGSVVQK L
Sbjct: 121 CQPDAFLCSVHDVKVVNTVYQRHRARNIAAAGAEGVLRQRLSGQGLAFIIAGGSVVQKNL 180
Query: 178 EVGEVLAVDVSCIVAVTSTVDIQIKYN-GPARRTMFG 213
EVGEVL +DVSCI A+T +++ QIKYN P RR +FG
Sbjct: 181 EVGEVLTIDVSCIAALTPSINFQIKYNAAPVRRAVFG 217
>gb|AAM63696.1| unknown [Arabidopsis thaliana]
Length = 285
Score = 314 bits (804), Expect = 1e-84
Identities = 157/217 (72%), Positives = 184/217 (84%), Gaps = 4/217 (1%)
Query: 1 MAAPFFSTPFQPYVYQSQQDAIIPFQILGGESQVVQIMLKPHEKIIAKPGSMCFMSGSVE 60
MAAPFFSTPFQPYVYQSQ+D I PFQILGGE+QVVQIMLKP EK+IAKPGSMC+MSGS+E
Sbjct: 1 MAAPFFSTPFQPYVYQSQEDTITPFQILGGEAQVVQIMLKPQEKVIAKPGSMCYMSGSIE 60
Query: 61 MENAYLPENEVGIWQWLFGKTVTNIVVRNSGPSDGFVGIAAPYFARILPIDLATFNGEIL 120
M+N Y PE EVG+ QW+ GK+V++IV+RN+G +DGFVGIAAP ARILP+DLA F G+IL
Sbjct: 61 MDNTYTPEQEVGVVQWILGKSVSSIVLRNTGQNDGFVGIAAPSLARILPLDLAMFGGDIL 120
Query: 121 CQPDAFLCSVNDVKVSNTVDQ--RGRNV-VAGAEVFLRQKLSGQGLAFILGGGSVVQKIL 177
CQPDAFLCSV+DVKV N+V Q R RN+ AGAE LRQ+LSGQGLAFI+ GGSVVQK L
Sbjct: 121 CQPDAFLCSVHDVKVVNSVYQRHRARNIAAAGAEGVLRQRLSGQGLAFIIAGGSVVQKNL 180
Query: 178 EVGEVLAVDVSCIVAVTSTVDIQIKYN-GPARRTMFG 213
EVGE L +DVSCI A+T +++ QIKYN P RR +FG
Sbjct: 181 EVGEFLTIDVSCIAALTPSINFQIKYNAAPVRRAVFG 217
>gb|AAK71553.1| unknown protein [Oryza sativa (japonica cultivar-group)]
gi|50918671|ref|XP_469732.1| unknown protein [Oryza
sativa (japonica cultivar-group)]
Length = 282
Score = 280 bits (716), Expect = 2e-74
Identities = 128/212 (60%), Positives = 171/212 (80%)
Query: 1 MAAPFFSTPFQPYVYQSQQDAIIPFQILGGESQVVQIMLKPHEKIIAKPGSMCFMSGSVE 60
MAAPFFSTPFQPYVYQSQ+ ++ FQI GG+ QV+Q+M+K EK+ KPG+MC+MSG+++
Sbjct: 1 MAAPFFSTPFQPYVYQSQEGSVTAFQISGGDVQVLQVMVKSQEKLTVKPGTMCYMSGNIQ 60
Query: 61 MENAYLPENEVGIWQWLFGKTVTNIVVRNSGPSDGFVGIAAPYFARILPIDLATFNGEIL 120
+N YLPEN+ G+WQW+FGK++++ V N G DG+VGI+AP+ RILP+DLA F GE+L
Sbjct: 61 TDNNYLPENDGGVWQWIFGKSISSSVFFNPGSDDGYVGISAPFPGRILPMDLANFGGELL 120
Query: 121 CQPDAFLCSVNDVKVSNTVDQRGRNVVAGAEVFLRQKLSGQGLAFILGGGSVVQKILEVG 180
CQ DAFLCSVNDV V++TV+QR RN+ GAEV L+QKL GQG+AF++GGGSV+QKIL
Sbjct: 121 CQADAFLCSVNDVSVTSTVEQRPRNIEIGAEVILKQKLRGQGMAFLVGGGSVMQKILAPR 180
Query: 181 EVLAVDVSCIVAVTSTVDIQIKYNGPARRTMF 212
EV+ VD +CIVA+T+T++ Q+K RR +F
Sbjct: 181 EVITVDAACIVAMTTTINFQLKTPNQPRRVVF 212
>emb|CAB10523.1| hypothetical protein [Arabidopsis thaliana]
gi|7268494|emb|CAB78745.1| hypothetical protein
[Arabidopsis thaliana] gi|7485161|pir||F71443
hypothetical protein - Arabidopsis thaliana
Length = 270
Score = 246 bits (629), Expect = 2e-64
Identities = 138/236 (58%), Positives = 164/236 (69%), Gaps = 35/236 (14%)
Query: 1 MAAPFFSTPFQPYVYQSQQDAIIPFQILGGESQVVQIMLKPHEKIIAKPGSMCFMSGSVE 60
MAAPFFSTPFQPYVYQSQ+D I PFQILGGE+QVVQIMLKP EK+IAKPGSMC+MSGS+E
Sbjct: 1 MAAPFFSTPFQPYVYQSQEDTITPFQILGGEAQVVQIMLKPQEKVIAKPGSMCYMSGSIE 60
Query: 61 MENAYLPENEVGIWQWLFGKTVTNIVVRNSGPSDGFVGIAAPYFARILPIDLATFNGEIL 120
M+N Y PE EVG+ QW+ GK+V++IV+RN+G +DGFVGIAAP ARILP
Sbjct: 61 MDNTYTPEQEVGVVQWILGKSVSSIVLRNTGQNDGFVGIAAPSLARILP----------- 109
Query: 121 CQPDAFLCSVNDVKVSNTVDQ--RGRNV-VAGAE------------------VFLRQKLS 159
PDAFLCSV+DVKV NTV Q R RN+ AGAE + R +L
Sbjct: 110 --PDAFLCSVHDVKVVNTVYQRHRARNIAAAGAERVVVVGGSETKAFWSGSCFYYRWRLW 167
Query: 160 GQGLAFILGGGSVVQKILEVGEVLAVDVSCIVAVTSTVDIQIKYN-GPARRTMFGV 214
L + VVQK LEVGEVL +DVSCI A+T +++ QIKYN P RR +FG+
Sbjct: 168 PFKLILLHLLHKVVQKNLEVGEVLTIDVSCIAALTPSINFQIKYNAAPVRRAVFGL 223
>ref|ZP_00245212.1| COG2013: Uncharacterized conserved protein [Rubrivivax gelatinosus
PM1]
Length = 268
Score = 128 bits (322), Expect = 1e-28
Identities = 72/202 (35%), Positives = 109/202 (53%), Gaps = 10/202 (4%)
Query: 22 IIPFQILGGESQVVQIMLKPHEKIIAKPGSMCFMSGSVEMENAY--LPENEVGIWQWLFG 79
+I ++I G E Q V++ L P E I + GSM FM + M+ + + G + L G
Sbjct: 3 VIDYEIKGAEMQFVEVELDPGEAAIGEAGSMMFMDAGIGMDTVFGDASAQQGGFFGKLLG 62
Query: 80 --------KTVTNIVVRNSGPSDGFVGIAAPYFARILPIDLATFNGEILCQPDAFLCSVN 131
+++ V N+ S V AAPY +ILP+DL T G ++CQ DAFLC+
Sbjct: 63 AGKRLVTGESLFTTVYTNNVGSKQRVAFAAPYPGKILPMDLRTLGGTLICQKDAFLCAAR 122
Query: 132 DVKVSNTVDQRGRNVVAGAEVFLRQKLSGQGLAFILGGGSVVQKILEVGEVLAVDVSCIV 191
V + ++ G E F+ QKL G GLAF+ GG+V+++ L+ G+ L VD C+V
Sbjct: 123 GVSLGIAFQRKLSVGFFGGEGFIMQKLDGDGLAFVHAGGTVLKRELQPGQTLLVDTGCVV 182
Query: 192 AVTSTVDIQIKYNGPARRTMFG 213
A T +VD +I+Y G + +FG
Sbjct: 183 AYTQSVDFEIQYVGKVKTALFG 204
>gb|AAM07986.1| conserved hypothetical protein [Methanosarcina acetivorans str.
C2A] gi|20093431|ref|NP_619506.1| hypothetical protein
MA4652 [Methanosarcina acetivorans C2A]
Length = 263
Score = 128 bits (321), Expect = 1e-28
Identities = 73/200 (36%), Positives = 113/200 (56%), Gaps = 11/200 (5%)
Query: 23 IPFQILGGESQVVQIMLKPHEKIIAKPGSMCFMSGSVEMENAYLPEN-------EVGIWQ 75
I ++I+G + Q+V+I L P E + A+ G+M +M ++M+ + E + G+ +
Sbjct: 5 IDYEIIGNDMQIVEIELDPGEAVQAEAGAMAYMGPGIQMQTSMGSEGGGLLGGLKKGLKR 64
Query: 76 WLFGKT--VTNIVVRNSGPSDGFVGIAAPYFARILPIDLATFNGEILCQPDAFLCSVNDV 133
L G++ +TN + + SG G V AAPY +I+P+DL+ F G ILCQ DAFLC+ V
Sbjct: 65 ALTGESFFITNFIHKGSGK--GHVAFAAPYPGKIIPLDLSKFGGSILCQKDAFLCAARGV 122
Query: 134 KVSNTVDQRGRNVVAGAEVFLRQKLSGQGLAFILGGGSVVQKILEVGEVLAVDVSCIVAV 193
V +R + G E F+ Q+L G G AF+ GG+VV+K L GE VD C+ A
Sbjct: 123 DVEVAFTRRIGTGLFGGEGFILQRLRGNGFAFVHIGGTVVRKDLAPGETYHVDTGCVAAF 182
Query: 194 TSTVDIQIKYNGPARRTMFG 213
T TV+ I ++ + +FG
Sbjct: 183 TETVNYDITWSKDFKNALFG 202
>ref|YP_038905.1| hypothetical protein BT9727_4593 [Bacillus thuringiensis serovar
konkukian str. 97-27] gi|49330902|gb|AAT61548.1|
conserved hypothetical protein [Bacillus thuringiensis
serovar konkukian str. 97-27]
Length = 260
Score = 123 bits (309), Expect = 3e-27
Identities = 69/203 (33%), Positives = 109/203 (52%), Gaps = 14/203 (6%)
Query: 23 IPFQILGGESQVVQIMLKPHEKIIAKPGSMCFMSGSVEMENAYLPENEVGIWQWLFGK-- 80
I +++ G + Q V+I L P E +IA+ G+M M +EME + + G LFGK
Sbjct: 6 IEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDYIEMETIF--GDGSGPSGGLFGKLM 63
Query: 81 ----------TVTNIVVRNSGPSDGFVGIAAPYFARILPIDLATFNGEILCQPDAFLCSV 130
++ V N+G V AAPY +I+P+DL + G+++CQ DAFLC+
Sbjct: 64 GAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAFLCAA 123
Query: 131 NDVKVSNTVDQRGRNVVAGAEVFLRQKLSGQGLAFILGGGSVVQKILEVGEVLAVDVSCI 190
V + ++ G E F+ QKL G GLAF+ GG+V ++ L+ GE L +D C+
Sbjct: 124 KGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRIDTGCL 183
Query: 191 VAVTSTVDIQIKYNGPARRTMFG 213
VA+T V+ +++ G + +FG
Sbjct: 184 VAMTKDVNYDVEFVGKVKTALFG 206
>ref|YP_086190.1| hypothetical protein BCZK4615 [Bacillus cereus E33L]
gi|51974108|gb|AAU15658.1| conserved hypothetical
protein [Bacillus cereus E33L]
Length = 263
Score = 123 bits (309), Expect = 3e-27
Identities = 69/203 (33%), Positives = 109/203 (52%), Gaps = 14/203 (6%)
Query: 23 IPFQILGGESQVVQIMLKPHEKIIAKPGSMCFMSGSVEMENAYLPENEVGIWQWLFGK-- 80
I +++ G + Q V+I L P E +IA+ G+M M +EME + + G LFGK
Sbjct: 9 IEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDYIEMETIF--GDGSGPSGGLFGKLM 66
Query: 81 ----------TVTNIVVRNSGPSDGFVGIAAPYFARILPIDLATFNGEILCQPDAFLCSV 130
++ V N+G V AAPY +I+P+DL + G+++CQ DAFLC+
Sbjct: 67 GAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAFLCAA 126
Query: 131 NDVKVSNTVDQRGRNVVAGAEVFLRQKLSGQGLAFILGGGSVVQKILEVGEVLAVDVSCI 190
V + ++ G E F+ QKL G GLAF+ GG+V ++ L+ GE L +D C+
Sbjct: 127 KGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRIDTGCL 186
Query: 191 VAVTSTVDIQIKYNGPARRTMFG 213
VA+T V+ +++ G + +FG
Sbjct: 187 VAMTKDVNYDVEFVGKVKTALFG 209
>ref|NP_981313.1| hypothetical protein BCE5020 [Bacillus cereus ATCC 10987]
gi|42739997|gb|AAS43921.1| conserved hypothetical
protein [Bacillus cereus ATCC 10987]
Length = 260
Score = 122 bits (307), Expect = 5e-27
Identities = 68/203 (33%), Positives = 109/203 (53%), Gaps = 14/203 (6%)
Query: 23 IPFQILGGESQVVQIMLKPHEKIIAKPGSMCFMSGSVEMENAYLPENEVGIWQWLFGK-- 80
I +++ G + Q V+I L P E +IA+ G+M M +EME + + G LFGK
Sbjct: 6 IEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDYIEMETIF--GDGSGPSGGLFGKLM 63
Query: 81 ----------TVTNIVVRNSGPSDGFVGIAAPYFARILPIDLATFNGEILCQPDAFLCSV 130
++ V N+G V AAPY +I+P+DL + G+++CQ DAFLC+
Sbjct: 64 GAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAFLCAA 123
Query: 131 NDVKVSNTVDQRGRNVVAGAEVFLRQKLSGQGLAFILGGGSVVQKILEVGEVLAVDVSCI 190
V + ++ G E F+ QKL G GLAF+ GG+V ++ L+ GE L +D C+
Sbjct: 124 KGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKHGEKLRIDTGCL 183
Query: 191 VAVTSTVDIQIKYNGPARRTMFG 213
VA+T ++ +++ G + +FG
Sbjct: 184 VAMTKDINYDVEFVGKVKTALFG 206
>gb|AAP11760.1| hypothetical protein [Bacillus cereus ATCC 14579]
gi|30022928|ref|NP_834559.1| hypothetical protein BC4860
[Bacillus cereus ATCC 14579]
Length = 260
Score = 122 bits (306), Expect = 7e-27
Identities = 68/203 (33%), Positives = 109/203 (53%), Gaps = 14/203 (6%)
Query: 23 IPFQILGGESQVVQIMLKPHEKIIAKPGSMCFMSGSVEMENAYLPENEVGIWQWLFGK-- 80
I +++ G + Q V+I L P E +IA+ G+M M +EME + + G LFGK
Sbjct: 6 IEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDYIEMETIF--GDGSGPSGGLFGKLM 63
Query: 81 ----------TVTNIVVRNSGPSDGFVGIAAPYFARILPIDLATFNGEILCQPDAFLCSV 130
++ V N+G V AAPY +I+P+DL + G+++CQ DAFLC+
Sbjct: 64 GAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAFLCAA 123
Query: 131 NDVKVSNTVDQRGRNVVAGAEVFLRQKLSGQGLAFILGGGSVVQKILEVGEVLAVDVSCI 190
V + ++ G E F+ QKL G GLAF+ GG+V ++ L+ GE L +D C+
Sbjct: 124 KGVSLGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRIDTGCL 183
Query: 191 VAVTSTVDIQIKYNGPARRTMFG 213
VA+T ++ +++ G + +FG
Sbjct: 184 VAMTKDINYDVEFVGKVKTALFG 206
>gb|AAK79504.1| Uncharacterized conserved protein [Clostridium acetobutylicum ATCC
824] gi|15894815|ref|NP_348164.1| hypothetical protein
CAC1537 [Clostridium acetobutylicum ATCC 824]
gi|25499006|pir||E97089 uncharacterized conserved
protein CAC1537 [imported] - Clostridium acetobutylicum
Length = 260
Score = 122 bits (305), Expect = 9e-27
Identities = 67/203 (33%), Positives = 110/203 (54%), Gaps = 12/203 (5%)
Query: 23 IPFQILGGESQVVQIMLKPHEKIIAKPGSMCFMSGSVEMENAYLPENEVGIWQWLF---- 78
I ++I+G E Q+V++ L P+E ++A+ G+M +M S+EME + ++ G L
Sbjct: 5 IDYKIIGSEMQIVEVELDPYESVVAEAGAMMYMDSSIEMETIFGDGSDKGSSGGLVSKLM 64
Query: 79 --------GKTVTNIVVRNSGPSDGFVGIAAPYFARILPIDLATFNGEILCQPDAFLCSV 130
G+++ + N G V AAPY +I+P+DL++FN ++CQ D FLC+
Sbjct: 65 GAGKRLVTGESLFMTIFTNRGVGKQKVAFAAPYPGKIIPMDLSSFNNYMICQKDCFLCAA 124
Query: 131 NDVKVSNTVDQRGRNVVAGAEVFLRQKLSGQGLAFILGGGSVVQKILEVGEVLAVDVSCI 190
V + ++ + G E F+ QKL G GL F+ GG++VQ+ L EV+ VD C+
Sbjct: 125 KGVSIGVEFTRKIGVGLFGGEGFMLQKLEGDGLTFVHSGGTIVQRELLPKEVIKVDTGCL 184
Query: 191 VAVTSTVDIQIKYNGPARRTMFG 213
VA T V+ I+ + +FG
Sbjct: 185 VAFTRDVNYDIEMVKGIKSAIFG 207
>ref|YP_148732.1| hypothetical protein GK2879 [Geobacillus kaustophilus HTA426]
gi|56381256|dbj|BAD77164.1| hypothetical conserved
protein [Geobacillus kaustophilus HTA426]
Length = 264
Score = 121 bits (304), Expect = 1e-26
Identities = 66/201 (32%), Positives = 106/201 (51%), Gaps = 10/201 (4%)
Query: 23 IPFQILGGESQVVQIMLKPHEKIIAKPGSMCFMSGSVEMENAYLPENEVGIW-------- 74
I +++ G + Q V+I L P+E ++A+ G M M + ME + + G
Sbjct: 6 IDYRLYGDDMQFVEIELDPYESVVAEAGGMMMMDDGIVMETVFGDGSSSGKGLLGRLVGA 65
Query: 75 --QWLFGKTVTNIVVRNSGPSDGFVGIAAPYFARILPIDLATFNGEILCQPDAFLCSVND 132
+ L G+++ V N G V AAPY +I+P+DL+ G+++CQ D+FLC+
Sbjct: 66 GKRLLTGESLFMTVFTNQGSGKRRVAFAAPYPGKIIPVDLSELGGKLICQKDSFLCAAKG 125
Query: 133 VKVSNTVDQRGRNVVAGAEVFLRQKLSGQGLAFILGGGSVVQKILEVGEVLAVDVSCIVA 192
V V ++ G E F+ QKL G GLAF+ GG++ ++ L+ GE L +D C+VA
Sbjct: 126 VSVGIEFQRKLGAGFFGGEGFIMQKLEGDGLAFLHAGGTIHRRDLQPGETLRIDTGCLVA 185
Query: 193 VTSTVDIQIKYNGPARRTMFG 213
+T V+ I+Y G + FG
Sbjct: 186 MTKDVNYDIEYVGNIKTAFFG 206
>gb|AAN86542.1| unknown [Eubacterium acidaminophilum]
Length = 268
Score = 120 bits (301), Expect = 3e-26
Identities = 68/203 (33%), Positives = 105/203 (51%), Gaps = 12/203 (5%)
Query: 23 IPFQILGGESQVVQIMLKPHEKIIAKPGSMCFMSGSVEMENAYLPENEVGIWQWLFGK-- 80
I + + G + Q+V+I L P E +IA+ G+M +M ++ME + G L GK
Sbjct: 7 IDYTLHGDDLQLVEIELDPGESVIAEAGAMLYMENGIQMEAVLGDASGKGEGTGLMGKLL 66
Query: 81 ----------TVTNIVVRNSGPSDGFVGIAAPYFARILPIDLATFNGEILCQPDAFLCSV 130
++ + N G V AAPY +I+P DL + G I+CQ DAFLC+
Sbjct: 67 GAGKRVIMGESLFMTLFTNKGSQKQKVAFAAPYPGKIVPFDLNAYGGRIICQKDAFLCAA 126
Query: 131 NDVKVSNTVDQRGRNVVAGAEVFLRQKLSGQGLAFILGGGSVVQKILEVGEVLAVDVSCI 190
+ V ++ + G E F+ ++L G G AF+ GG++V+K L E+L VD C+
Sbjct: 127 KGISVQMEFQKKIGVGLFGGEGFIMERLEGDGFAFLHAGGAIVEKELSQAELLKVDTGCL 186
Query: 191 VAVTSTVDIQIKYNGPARRTMFG 213
VA TS VD I++ G + +FG
Sbjct: 187 VAFTSGVDYDIQFMGDIKSAIFG 209
>ref|ZP_00418039.1| Protein of unknown function DUF124 [Azotobacter vinelandii AvOP]
gi|67086205|gb|EAM05675.1| Protein of unknown function
DUF124 [Azotobacter vinelandii AvOP]
Length = 248
Score = 114 bits (286), Expect = 1e-24
Identities = 66/199 (33%), Positives = 107/199 (53%), Gaps = 9/199 (4%)
Query: 23 IPFQILGGESQVVQIMLKPHEKIIAKPGSMCFMSGSVEMENAYLPENEVGI----W---- 74
+ ++ILG +Q V+I+L P E +IA+ G+M +M+ V E +E G+ W
Sbjct: 6 LDYEILGAHAQSVEIILDPGETVIAEAGAMNYMTEGVRFETRMGDGSESGVLGKLWGMGK 65
Query: 75 QWLFGKTVTNIVVRNSGPSDGFVGIAAPYFARILPIDLATFNGEILCQPDAFLCSVNDVK 134
+ L G+++ N G V AAPY ++P+DLA G ++CQ D+FLC+ +
Sbjct: 66 RMLTGESLFMTHFSNHGKRQARVAFAAPYPGTVVPVDLAEIGGTLICQKDSFLCAARGTE 125
Query: 135 VSNTVDQRGRNVVAGAEVFLRQKLSGQGLAFILGGGSVVQKILEVGEVLAVDVSCIVAVT 194
+ + +R G E F+ Q+LSG GLAF+ GG+V++K L E L +D C+V +
Sbjct: 126 IGISFSKRIGAGFFGGEGFILQRLSGDGLAFLHAGGAVIRKELR-DETLRLDTGCLVGFS 184
Query: 195 STVDIQIKYNGPARRTMFG 213
+D I+ G + +FG
Sbjct: 185 RGIDYDIQLAGGLKSMLFG 203
>ref|NP_633330.1| HTH DNA-binding protein [Methanosarcina mazei Go1]
gi|20905772|gb|AAM31002.1| HTH DNA-binding protein
[Methanosarcina mazei Goe1]
Length = 250
Score = 114 bits (284), Expect = 3e-24
Identities = 68/190 (35%), Positives = 104/190 (53%), Gaps = 11/190 (5%)
Query: 33 QVVQIMLKPHEKIIAKPGSMCFMSGSVEMENAYLPEN-------EVGIWQWLFGKT--VT 83
Q+V+I L P E + A+ G+M +M + M+ E + G+ + L G++ +T
Sbjct: 2 QIVEIELDPGEAVQAEAGAMAYMGPGILMQTGMGNEGGGLFGGLKKGLKRALTGESFFIT 61
Query: 84 NIVVRNSGPSDGFVGIAAPYFARILPIDLATFNGEILCQPDAFLCSVNDVKVSNTVDQRG 143
+ + + SG G V AAPY +ILP+DL+ F G ILCQ DAFLC+ ++V ++
Sbjct: 62 SFIHKGSGK--GHVAFAAPYPGKILPLDLSKFGGSILCQKDAFLCAAKGIEVELAFTRKL 119
Query: 144 RNVVAGAEVFLRQKLSGQGLAFILGGGSVVQKILEVGEVLAVDVSCIVAVTSTVDIQIKY 203
+ G E F+ Q+L G GLAFI GG+V++K L GE VD C+ A T V I +
Sbjct: 120 GAGLFGGEGFILQRLRGDGLAFIHIGGTVIRKDLAPGETYKVDTGCVAAFTENVTYDITW 179
Query: 204 NGPARRTMFG 213
+ + +FG
Sbjct: 180 SRDFKNALFG 189
>gb|EAM71983.1| Protein of unknown function DUF124 [Desulfuromonas acetoxidans DSM
684]
Length = 333
Score = 113 bits (282), Expect = 4e-24
Identities = 66/201 (32%), Positives = 102/201 (49%), Gaps = 10/201 (4%)
Query: 23 IPFQILGGESQVVQIMLKPHEKIIAKPGSMCFMSGSVEMENAY---------LPENEVGI 73
I F I G E Q V+I L P E +A+ G+M + + ++ ME + +G
Sbjct: 75 IDFTIHGTEMQFVEIELDPGESAVAEAGAMMYKASTISMETVFGDGGPQTGGFMGKLLGA 134
Query: 74 WQWLF-GKTVTNIVVRNSGPSDGFVGIAAPYFARILPIDLATFNGEILCQPDAFLCSVND 132
+ L G+++ V + G V APY I+P+ L G ++CQ DAFLC+
Sbjct: 135 GKRLVTGESLFTTVFTHQGHGKAHVAFGAPYPGNIIPVALDAMGGSLICQKDAFLCAARG 194
Query: 133 VKVSNTVDQRGRNVVAGAEVFLRQKLSGQGLAFILGGGSVVQKILEVGEVLAVDVSCIVA 192
V + + +R + G E F+ QKL G G+AF+ GGS+V++ L+ GE L VD C+VA
Sbjct: 195 VSIGLHLQKRILTGLFGGEGFIMQKLEGDGMAFLHAGGSIVERELKPGEELHVDTGCVVA 254
Query: 193 VTSTVDIQIKYNGPARRTMFG 213
V I+ G + ++FG
Sbjct: 255 YEPKVSFDIQQAGGIKTSLFG 275
>ref|ZP_00129004.2| COG2013: Uncharacterized conserved protein [Desulfovibrio
desulfuricans G20]
Length = 242
Score = 112 bits (279), Expect = 1e-23
Identities = 71/202 (35%), Positives = 105/202 (51%), Gaps = 13/202 (6%)
Query: 23 IPFQILGGESQVVQIMLKPHEKIIAKPGSMCFMSGSVEMENAYLPENEV---GIWQWLFG 79
+ ++I GG+ QVV++ L P E +IA+ G+MC+M G +E A + + G + L G
Sbjct: 6 VEYRITGGDLQVVEVELDPGETVIAEAGAMCWMDGDIEFA-ARMGDGSAADGGFFGKLLG 64
Query: 80 ---KTVTNIVV-----RNSGPSDGFVGIAAPYFARILPIDLATFNGEILCQPDAFLCSVN 131
+ VT + N G + V A ++P+DLA GE++CQ DAFLC+
Sbjct: 65 AGKRLVTGESLFMTHFTNQGQAKASVAFAGQVPGHVVPVDLAEIGGELICQRDAFLCAAR 124
Query: 132 DVKVSNTVDQRGRNVVAGAEVFLRQKLSGQGLAFILGGGSVVQKILEVGEVLAVDVSCIV 191
++ +R G E F+ Q+L G GLAF+ GG+VV+K L G L VD C+V
Sbjct: 125 GTRIDVAFSKRLGAGFFGGEGFVLQRLRGDGLAFVHAGGAVVRKELH-GGTLRVDTGCLV 183
Query: 192 AVTSTVDIQIKYNGPARRTMFG 213
A T V I +G + MFG
Sbjct: 184 AFTPGVSYDIGLSGGLKSMMFG 205
>gb|AAT50532.1| PA3696 [synthetic construct]
Length = 249
Score = 109 bits (273), Expect = 5e-23
Identities = 66/200 (33%), Positives = 103/200 (51%), Gaps = 11/200 (5%)
Query: 23 IPFQILGGESQVVQIMLKPHEKIIAKPGSMCFMSGSVEMENAYLPENEVG-----IW--- 74
+ ++ILGG Q V+I L P E +IA+ G+M +M+G + A + + G +W
Sbjct: 6 LDYRILGGSMQTVEIELDPGETVIAEAGAMNYMTGDIRF-TARMGDGSDGSLLGKLWSAG 64
Query: 75 -QWLFGKTVTNIVVRNSGPSDGFVGIAAPYFARILPIDLATFNGEILCQPDAFLCSVNDV 133
+ L G++V N G V AAPY ++ +DL G + CQ D+FLC+
Sbjct: 65 KRKLGGESVFMTHFTNEGQGKQHVAFAAPYPGSVVAVDLDDVGGRLFCQKDSFLCAAYGT 124
Query: 134 KVSNTVDQRGRNVVAGAEVFLRQKLSGQGLAFILGGGSVVQKILEVGEVLAVDVSCIVAV 193
+V +R G E F+ QKL G GL F+ GG+++++ L GE L VD C+VA
Sbjct: 125 RVGIAFTKRLGAGFFGGEGFILQKLEGDGLVFVHAGGTLIRRQLN-GETLRVDTGCLVAF 183
Query: 194 TSTVDIQIKYNGPARRTMFG 213
T +D ++ G + +FG
Sbjct: 184 TDGIDYDVQLAGGLKSMLFG 203
Database: nr
Posted date: Jul 5, 2005 12:34 AM
Number of letters in database: 863,360,394
Number of sequences in database: 2,540,612
Lambda K H
0.322 0.139 0.412
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 364,261,558
Number of Sequences: 2540612
Number of extensions: 14410674
Number of successful extensions: 26324
Number of sequences better than 10.0: 79
Number of HSP's better than 10.0 without gapping: 50
Number of HSP's successfully gapped in prelim test: 29
Number of HSP's that attempted gapping in prelim test: 26202
Number of HSP's gapped (non-prelim): 80
length of query: 218
length of database: 863,360,394
effective HSP length: 123
effective length of query: 95
effective length of database: 550,865,118
effective search space: 52332186210
effective search space used: 52332186210
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 72 (32.3 bits)
Medicago: description of AC119408.7