
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC146757.12 + phase: 0 /partial
(192 letters)
Database: nr
2,540,612 sequences; 863,360,394 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gb|AAF05861.1| unknown protein [Arabidopsis thaliana] gi|1522921... 162 5e-39
ref|ZP_00520355.1| Pyridoxamine 5'-phosphate oxidase-related [So... 39 0.10
emb|CAB61996.1| putative protein [Arabidopsis thaliana] gi|15229... 38 0.13
ref|ZP_00323630.1| COG1609: Transcriptional regulators [Pediococ... 36 0.50
ref|XP_640361.1| hypothetical protein DDB0205141 [Dictyostelium ... 34 2.5
ref|ZP_00324480.1| COG3210: Large exoproteins involved in heme u... 33 4.2
gb|AAD10268.1| putative histidine kinase [Lactobacillus sakei] 33 5.5
ref|NP_978181.1| sensor histidine kinase, putative [Bacillus cer... 33 5.5
ref|YP_098680.1| hypothetical protein BF1395 [Bacteroides fragil... 33 5.5
ref|ZP_00287487.1| COG1609: Transcriptional regulators [Enteroco... 32 7.2
dbj|BAA09310.1| CTP: phosphoethanolamine cytidylyltransferase [S... 32 7.2
ref|ZP_00236562.1| MW1208, putative [Bacillus cereus G9241] gi|4... 32 7.2
gb|AAP08709.1| Sensory Transduction Protein Kinase [Bacillus cer... 32 9.5
ref|NP_703540.1| hypothetical protein [Plasmodium falciparum 3D7... 32 9.5
ref|NP_816580.1| sugar-binding transcriptional regulator, LacI f... 32 9.5
emb|CAG62304.1| unnamed protein product [Candida glabrata CBS138... 32 9.5
gb|AAO78030.1| acetyl-CoA synthetase [Bacteroides thetaiotaomicr... 32 9.5
dbj|BAD82359.1| unknown protein [Oryza sativa (japonica cultivar... 32 9.5
ref|NP_143151.1| hypothetical protein PH1257 [Pyrococcus horikos... 32 9.5
>gb|AAF05861.1| unknown protein [Arabidopsis thaliana] gi|15229211|ref|NP_187052.1|
expressed protein [Arabidopsis thaliana]
Length = 236
Score = 162 bits (410), Expect = 5e-39
Identities = 87/192 (45%), Positives = 127/192 (65%), Gaps = 5/192 (2%)
Query: 3 ALTEKVHEVIRSEEKATRKFSYTVSGVLSSGGSS-TSRSDNLQKLL-EVTEKYSVYRFKT 60
A+ + V E I+SE KA +V +L+S SR D+L+ L+ + EKY +Y+F
Sbjct: 48 AVKKYVEEAIQSEMKAISDTPNSVRSILNSSDQMYASRCDSLRALINDAKEKYVIYKFVP 107
Query: 61 RSCTFIDGHGGTFDVNIEDLGTSKADLLAPFSAKLIDGINQSEARRRALVLLCFVYMNTN 120
SC FID +G T +++++ L SK D L +S KL+DGIN++E+RRRAL+L C +++ N
Sbjct: 108 SSCMFIDPNG-TKEIDLKVLELSKPDPLGTWSTKLVDGINKNESRRRALILFCLYFLDIN 166
Query: 121 AKDAYVTSVDRKGFDVLAKVTGPVSKDGVGQYQWKELRFMFEQEANDVETFCQHLVQMEE 180
A+DAY+ SVDRKGF +L KV P ++ +YQW+E RF FE+E DVE FC LV+ME+
Sbjct: 167 ARDAYMVSVDRKGFHLLGKV--PSEQEAGDEYQWREFRFEFEEEVKDVEAFCHQLVEMEQ 224
Query: 181 EVIKKVSASSGL 192
EV+ K + +GL
Sbjct: 225 EVVSKFTDHTGL 236
>ref|ZP_00520355.1| Pyridoxamine 5'-phosphate oxidase-related [Solibacter usitatus
Ellin6076] gi|67865705|gb|EAM60705.1| Pyridoxamine
5'-phosphate oxidase-related [Solibacter usitatus
Ellin6076]
Length = 272
Score = 38.5 bits (88), Expect = 0.10
Identities = 32/102 (31%), Positives = 47/102 (45%), Gaps = 3/102 (2%)
Query: 38 SRSDNLQKLLEVTEKYSVYRFKTRSCTFIDGHGGTFDVNIEDLGTSKADLLAPFSAKLID 97
SR N + + T+ ++ +R + FI G G V D + D LA + +I
Sbjct: 139 SRYSNARAWQDYTD-FAYFRLEISGVYFIGGFGVMGWVTAADYTAASPDPLAEAAPGIIR 197
Query: 98 GINQSEARRRALVLLCFVYMNTNAKDAYVTSVDRKGFDVLAK 139
+N A ALVL+ + A +A +TSVDR GF V K
Sbjct: 198 HMNADHAD--ALVLIARHFAGETADEASMTSVDRLGFHVRLK 237
>emb|CAB61996.1| putative protein [Arabidopsis thaliana] gi|15229097|ref|NP_190483.1|
pentatricopeptide (PPR) repeat-containing protein
[Arabidopsis thaliana] gi|11358281|pir||T46116
hypothetical protein T2J13.20 - Arabidopsis thaliana
Length = 1229
Score = 38.1 bits (87), Expect = 0.13
Identities = 26/107 (24%), Positives = 50/107 (46%), Gaps = 2/107 (1%)
Query: 31 SSGGSSTSRSDNLQKLLEVTEKYSVYRFKTRSCTFIDGHGGTFDVNIEDLGTSKADLLAP 90
S G S D+ Q + + Y+ + I G +V +ED+ ++ D +A
Sbjct: 1078 SKAGGDESEIDSSQDE-KARNVVAFYKLEMIRIQLITAQGDQTEVEVEDVRKAQPDAIAH 1136
Query: 91 FSAKLIDGINQSEAR-RRALVLLCFVYMNTNAKDAYVTSVDRKGFDV 136
SA++I + +S + AL LC+ + + A++ + +D GFD+
Sbjct: 1137 ASAEIISRLEESGDKITEALKSLCWRHNSIQAEEVKLIGIDSLGFDL 1183
>ref|ZP_00323630.1| COG1609: Transcriptional regulators [Pediococcus pentosaceus ATCC
25745]
Length = 332
Score = 36.2 bits (82), Expect = 0.50
Identities = 16/70 (22%), Positives = 35/70 (49%)
Query: 80 LGTSKADLLAPFSAKLIDGINQSEARRRALVLLCFVYMNTNAKDAYVTSVDRKGFDVLAK 139
+G D+ PF ++L+ GI ++ + +LC ++ +DAY+ + R+G D
Sbjct: 67 IGVLVPDITNPFFSELVRGIENIFYQKGFITMLCNADLDQEKEDAYLAELSRRGVDGFII 126
Query: 140 VTGPVSKDGV 149
+ +S + +
Sbjct: 127 ASSVISNESI 136
>ref|XP_640361.1| hypothetical protein DDB0205141 [Dictyostelium discoideum]
gi|60468377|gb|EAL66383.1| hypothetical protein
DDB0205141 [Dictyostelium discoideum]
Length = 1640
Score = 33.9 bits (76), Expect = 2.5
Identities = 26/102 (25%), Positives = 42/102 (40%), Gaps = 6/102 (5%)
Query: 6 EKVHEVIRSEEKATRKFSYTVSGVLSSGGSSTSRSDNLQKLLEVTEKYSVYRFKTRSCTF 65
EKV E I+S+ + T + +S S NL+ LL +K +V + K +
Sbjct: 15 EKVDEPIKSKNPQENVYKRTYENIGNSSRRSKPLDINLETLLNDNKKKTVKKIKKEDKSD 74
Query: 66 IDGHGGTFDVNIEDLGTSKADLLAPFSAKLIDGINQSEARRR 107
I+ T +N E L P L D N+ E+ ++
Sbjct: 75 INSSSNTIKINEE------VSELKPTKTGLNDSTNEIESEKK 110
>ref|ZP_00324480.1| COG3210: Large exoproteins involved in heme utilization or adhesion
[Trichodesmium erythraeum IMS101]
Length = 3101
Score = 33.1 bits (74), Expect = 4.2
Identities = 19/66 (28%), Positives = 33/66 (49%), Gaps = 2/66 (3%)
Query: 22 FSYTVSGVLSSGGSSTSRSDNLQKLLEVTEKYSVYRFKTRSCTFIDGHGGTFDVNIEDLG 81
+SY+ G +GG ++ ++ + EK +++ F R F +G GG DVNI
Sbjct: 1050 YSYSEKGNSRNGGDIKLNANKIKPTSDNEEKLTIHTFSFRKNKFGEGKGG--DVNITTNN 1107
Query: 82 TSKADL 87
SK ++
Sbjct: 1108 LSKTEI 1113
>gb|AAD10268.1| putative histidine kinase [Lactobacillus sakei]
Length = 478
Score = 32.7 bits (73), Expect = 5.5
Identities = 19/57 (33%), Positives = 33/57 (57%), Gaps = 2/57 (3%)
Query: 49 VTEKYSVYRF-KTRSCTFIDGHGGTFDVNIEDLGTSKADLLAPFSAKLIDGINQSEA 104
V +Y +YR + R+ T + G FDV+I++ G + D LA +++ +N+SEA
Sbjct: 186 VLARYQIYRISRLRNATHLVAEGD-FDVHIDNPGKDELDDLATDFNEMVHSLNESEA 241
>ref|NP_978181.1| sensor histidine kinase, putative [Bacillus cereus ATCC 10987]
gi|42736855|gb|AAS40789.1| sensor histidine kinase,
putative [Bacillus cereus ATCC 10987]
Length = 366
Score = 32.7 bits (73), Expect = 5.5
Identities = 28/98 (28%), Positives = 44/98 (44%), Gaps = 15/98 (15%)
Query: 43 LQKLLEVTEKYSVYRFKTRSCTFIDGHGGTFDVNIEDLGTSKADLLAPFSAKLIDGINQS 102
L+ + TE+ ++R K FI+ G DVNI + D+++P K+I G N +
Sbjct: 236 LKNMKPPTEQIGIHRMKL----FIEEFAGKHDVNIPFVYKGNLDMISPIQWKII-GENVT 290
Query: 103 EARRRALVLLCFVYMNTNAKDAYVTSVDRKGFDVLAKV 140
EA A+ DA V S+D + + KV
Sbjct: 291 EALTNAM----------KYADATVISIDIHVLNKMVKV 318
>ref|YP_098680.1| hypothetical protein BF1395 [Bacteroides fragilis YCH46]
gi|52215553|dbj|BAD48146.1| conserved hypothetical
protein [Bacteroides fragilis YCH46]
Length = 857
Score = 32.7 bits (73), Expect = 5.5
Identities = 24/83 (28%), Positives = 38/83 (44%), Gaps = 12/83 (14%)
Query: 81 GTSKADLLAPFSAKLIDGINQSEARRRA---LVLLCFVYMNTNAKDAYVTSVD---RKGF 134
G K + PF+ + DG+N+ E+R L+L+ + N Y S D RK
Sbjct: 393 GKEKIPVQTPFALSVRDGMNEVESRHNMLTDLLLMSEIKGYVNNPQYYFESRDDTRRKAI 452
Query: 135 DVLAKVTGPVSKDGVGQYQWKEL 157
D+L +V G +Y WK++
Sbjct: 453 DLLLRV------QGWRRYSWKQM 469
>ref|ZP_00287487.1| COG1609: Transcriptional regulators [Enterococcus faecium]
Length = 337
Score = 32.3 bits (72), Expect = 7.2
Identities = 19/79 (24%), Positives = 35/79 (44%)
Query: 80 LGTSKADLLAPFSAKLIDGINQSEARRRALVLLCFVYMNTNAKDAYVTSVDRKGFDVLAK 139
+G D+ PF + L+ GI ++ + +LC + + Y+ + R+G D
Sbjct: 67 IGVLVPDITNPFFSTLMRGIEDILYKQNFVTILCNADSDHQKEIEYLAELTRRGVDGFII 126
Query: 140 VTGPVSKDGVGQYQWKELR 158
T VS D + + K+ R
Sbjct: 127 ATSAVSTDAINENLKKQGR 145
>dbj|BAA09310.1| CTP: phosphoethanolamine cytidylyltransferase [Saccharomyces
cerevisiae]
Length = 323
Score = 32.3 bits (72), Expect = 7.2
Identities = 31/111 (27%), Positives = 49/111 (43%), Gaps = 11/111 (9%)
Query: 56 YRFKTRSCTFIDGHGGTFDV-NIEDLGTSKADLLAPFSAKLIDGINQSE-------ARRR 107
Y+F C ++DG F + +I+ L K DL KLI GI S+ + R
Sbjct: 192 YKFDAEDCVYVDGDFDLFHMGDIDQLRKLKMDLHP--DKKLIVGITTSDYSSTIMTMKER 249
Query: 108 ALVLLCFVYMNTNAKDAYVTSVDRKGFDVLAKVTGPVSKDG-VGQYQWKEL 157
L++L Y++ DA TS+ + + T ++ G +Y KEL
Sbjct: 250 VLIVLSCKYVDAVIIDADATSMSQYNCEKYHIGTAVLTAAGKFSEYLTKEL 300
>ref|ZP_00236562.1| MW1208, putative [Bacillus cereus G9241] gi|47557511|gb|EAL15838.1|
MW1208, putative [Bacillus cereus G9241]
Length = 366
Score = 32.3 bits (72), Expect = 7.2
Identities = 27/98 (27%), Positives = 44/98 (44%), Gaps = 15/98 (15%)
Query: 43 LQKLLEVTEKYSVYRFKTRSCTFIDGHGGTFDVNIEDLGTSKADLLAPFSAKLIDGINQS 102
L+ + TE+ ++R K FI+ G D+NI + D+++P K+I G N +
Sbjct: 236 LKNMKPPTEQIGIHRMKL----FIEEFAGKHDINIPFVYKGNLDMISPIQWKII-GENVT 290
Query: 103 EARRRALVLLCFVYMNTNAKDAYVTSVDRKGFDVLAKV 140
EA A+ DA V S+D + + KV
Sbjct: 291 EALTNAM----------KYADATVISIDIHVLNKMVKV 318
>gb|AAP08709.1| Sensory Transduction Protein Kinase [Bacillus cereus ATCC 14579]
gi|30019877|ref|NP_831508.1| Sensory Transduction
Protein Kinase [Bacillus cereus ATCC 14579]
Length = 366
Score = 32.0 bits (71), Expect = 9.5
Identities = 28/98 (28%), Positives = 44/98 (44%), Gaps = 15/98 (15%)
Query: 43 LQKLLEVTEKYSVYRFKTRSCTFIDGHGGTFDVNIEDLGTSKADLLAPFSAKLIDGINQS 102
L+ + TE+ ++R K FI+ G DVNI + D+++P K+I G N +
Sbjct: 236 LKNMKPPTEQIGIHRMKL----FIEEFAGKNDVNIPFVYKGNLDMISPIQWKII-GENVT 290
Query: 103 EARRRALVLLCFVYMNTNAKDAYVTSVDRKGFDVLAKV 140
EA A+ DA V S+D + + KV
Sbjct: 291 EALTNAM----------KYADATVISIDIHVLNKMVKV 318
>ref|NP_703540.1| hypothetical protein [Plasmodium falciparum 3D7]
gi|23504682|emb|CAD51560.1| hypothetical protein
[Plasmodium falciparum 3D7]
Length = 2488
Score = 32.0 bits (71), Expect = 9.5
Identities = 22/71 (30%), Positives = 33/71 (45%), Gaps = 8/71 (11%)
Query: 74 DVNIE---DLGTSKADLLAPFSAKLIDGINQSEARRRALVLLCFVYMN-----TNAKDAY 125
D++IE DL S +LL P++ +IDGIN + + MN N K AY
Sbjct: 1937 DISIESSKDLNISSNNLLTPYTNNMIDGINIQSTNEARTINTKMILMNPLINADNNKIAY 1996
Query: 126 VTSVDRKGFDV 136
++ K +V
Sbjct: 1997 KNMLNNKYLNV 2007
>ref|NP_816580.1| sugar-binding transcriptional regulator, LacI family [Enterococcus
faecalis V583] gi|29344893|gb|AAO82650.1| sugar-binding
transcriptional regulator, LacI family [Enterococcus
faecalis V583]
Length = 331
Score = 32.0 bits (71), Expect = 9.5
Identities = 16/72 (22%), Positives = 32/72 (44%)
Query: 80 LGTSKADLLAPFSAKLIDGINQSEARRRALVLLCFVYMNTNAKDAYVTSVDRKGFDVLAK 139
+G D+ PF A+LI GI + +++LC + + Y+T + R+ D
Sbjct: 67 IGVIVPDITNPFFAQLIRGIESVLYKENFILMLCNADQDVTREHEYLTELIRRSVDGFVI 126
Query: 140 VTGPVSKDGVGQ 151
+ +S + +
Sbjct: 127 ASSEISNQTINE 138
>emb|CAG62304.1| unnamed protein product [Candida glabrata CBS138]
gi|50293837|ref|XP_449330.1| unnamed protein product
[Candida glabrata]
Length = 885
Score = 32.0 bits (71), Expect = 9.5
Identities = 20/62 (32%), Positives = 33/62 (52%)
Query: 1 AKALTEKVHEVIRSEEKATRKFSYTVSGVLSSGGSSTSRSDNLQKLLEVTEKYSVYRFKT 60
AKAL+ ++ + E+A T S V +STSR+ ++ K+L+ E+ + R K
Sbjct: 138 AKALSVGTEKLFTASEEARHAGQKTYSIVSQVSAASTSRAMDMSKVLKGAERKAESRLKE 197
Query: 61 RS 62
RS
Sbjct: 198 RS 199
>gb|AAO78030.1| acetyl-CoA synthetase [Bacteroides thetaiotaomicron VPI-5482]
gi|29348333|ref|NP_811836.1| acetyl-CoA synthetase
[Bacteroides thetaiotaomicron VPI-5482]
Length = 686
Score = 32.0 bits (71), Expect = 9.5
Identities = 16/32 (50%), Positives = 19/32 (59%)
Query: 119 TNAKDAYVTSVDRKGFDVLAKVTGPVSKDGVG 150
+N K+ V R GF V+AKV GPV K VG
Sbjct: 505 SNKKEEVVDFARRCGFPVVAKVVGPVHKSDVG 536
>dbj|BAD82359.1| unknown protein [Oryza sativa (japonica cultivar-group)]
Length = 448
Score = 32.0 bits (71), Expect = 9.5
Identities = 19/86 (22%), Positives = 44/86 (51%), Gaps = 1/86 (1%)
Query: 54 SVYRFKTRSCTFIDGHGGTFDVNIEDLGTSKADLLAPFSAKLIDGINQSEAR-RRALVLL 112
++Y+ + + +G ++ +D ++ D+LA ++++I+ I +++ + AL L
Sbjct: 317 TIYKLEIMTVELFSIYGKQLMIDPQDFQDAEPDILANSASEIINRIKENDDQCAMALRSL 376
Query: 113 CFVYMNTNAKDAYVTSVDRKGFDVLA 138
C ++A + S+D G DV A
Sbjct: 377 CHRKKGLTVEEASLISIDSLGIDVRA 402
>ref|NP_143151.1| hypothetical protein PH1257 [Pyrococcus horikoshii OT3]
gi|3257676|dbj|BAA30359.1| 227aa long hypothetical
protein [Pyrococcus horikoshii OT3]
gi|7450311|pir||E71070 hypothetical protein PH1257 -
Pyrococcus horikoshii
Length = 227
Score = 32.0 bits (71), Expect = 9.5
Identities = 19/67 (28%), Positives = 35/67 (51%), Gaps = 9/67 (13%)
Query: 42 NLQKLLEVTEKYSVYRFKTRSCTFIDGHGGTFDVNIEDLGTSKADLLAPFSAKLIDGINQ 101
NL++L +++EKY ++ I G GG F+ + D+ KA ++ + K DG++
Sbjct: 165 NLEELKKLSEKYGIH---------IAGEGGEFETFVLDMPFFKAKIVIDDAEKFWDGLSG 215
Query: 102 SEARRRA 108
+RA
Sbjct: 216 KFIIKRA 222
Database: nr
Posted date: Jul 5, 2005 12:34 AM
Number of letters in database: 863,360,394
Number of sequences in database: 2,540,612
Lambda K H
0.316 0.131 0.362
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 289,335,284
Number of Sequences: 2540612
Number of extensions: 10830651
Number of successful extensions: 31320
Number of sequences better than 10.0: 19
Number of HSP's better than 10.0 without gapping: 6
Number of HSP's successfully gapped in prelim test: 13
Number of HSP's that attempted gapping in prelim test: 31311
Number of HSP's gapped (non-prelim): 19
length of query: 192
length of database: 863,360,394
effective HSP length: 121
effective length of query: 71
effective length of database: 555,946,342
effective search space: 39472190282
effective search space used: 39472190282
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 71 (32.0 bits)
Medicago: description of AC146757.12