
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC146588.13 - phase: 0
(107 letters)
Database: uniref100
2,790,947 sequences; 848,049,833 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
UniRef100_Q9C5K0 Hypothetical protein At1g70160 [Arabidopsis tha... 171 4e-42
UniRef100_O04530 F20P5.12 protein [Arabidopsis thaliana] 165 3e-40
UniRef100_Q7EZU2 Hypothetical protein P0456B03.107 [Oryza sativa] 153 1e-36
UniRef100_Q6H416 Hypothetical protein B1175F05.20 [Oryza sativa] 150 7e-36
UniRef100_Q5ZBU1 Hypothetical protein P0684C02.6-2 [Oryza sativa] 138 3e-32
UniRef100_Q8S1L2 Hypothetical protein P0684C02.6-1 [Oryza sativa] 138 3e-32
UniRef100_Q9SZ41 Hypothetical protein F10M23.360 [Arabidopsis th... 128 3e-29
UniRef100_Q94B24 Hypothetical protein MBG8.13 [Arabidopsis thali... 124 4e-28
UniRef100_Q9FFU3 Arabidopsis thaliana genomic DNA, chromosome 5,... 124 4e-28
UniRef100_Q93ZG0 AT4g27020/F10M23_360 [Arabidopsis thaliana] 55 5e-07
UniRef100_Q8PJG4 Hypothetical protein XAC2568 [Xanthomonas axono... 33 1.2
UniRef100_UPI000033B19C UPI000033B19C UniRef100 entry 33 2.1
UniRef100_UPI00003217F3 UPI00003217F3 UniRef100 entry 32 2.7
UniRef100_Q95LP0 Hypothetical protein [Macaca fascicularis] 32 3.5
UniRef100_UPI0000369645 UPI0000369645 UniRef100 entry 31 6.1
UniRef100_UPI0000140CE7 UPI0000140CE7 UniRef100 entry 31 6.1
UniRef100_Q5V3E7 A/G-specific adenine glycosylase [Haloarcula ma... 31 6.1
UniRef100_UPI00002E496E UPI00002E496E UniRef100 entry 31 7.9
UniRef100_Q9HPQ6 A/G specific adenine glycosylase, repair protei... 31 7.9
>UniRef100_Q9C5K0 Hypothetical protein At1g70160 [Arabidopsis thaliana]
Length = 523
Score = 171 bits (433), Expect = 4e-42
Identities = 85/112 (75%), Positives = 92/112 (81%), Gaps = 6/112 (5%)
Query: 1 MPSGMLGTLLSQIDAVPLFSDTAWGHKVNLDFLNKHMGAT---RSQPWRATTDPADVHSG 57
MPSGMLGTLLS ID +PLFS+TAWG NL FL KHMGAT RSQPWR+ +P DVHSG
Sbjct: 163 MPSGMLGTLLSLIDVLPLFSNTAWGQNANLAFLTKHMGATFEKRSQPWRSMINPEDVHSG 222
Query: 58 DFLAVSKICGRWGGFETLEKLVT---ADHAAVCLKDEMGSLWVGESGHENEK 106
DFLAVSKI GRWGGFETLEK VT A H AVCLKD++G+LWVGESGHENEK
Sbjct: 223 DFLAVSKIRGRWGGFETLEKWVTGAFAGHTAVCLKDDLGNLWVGESGHENEK 274
>UniRef100_O04530 F20P5.12 protein [Arabidopsis thaliana]
Length = 551
Score = 165 bits (417), Expect = 3e-40
Identities = 82/109 (75%), Positives = 89/109 (81%), Gaps = 6/109 (5%)
Query: 4 GMLGTLLSQIDAVPLFSDTAWGHKVNLDFLNKHMGAT---RSQPWRATTDPADVHSGDFL 60
GMLGTLLS ID +PLFS+TAWG NL FL KHMGAT RSQPWR+ +P DVHSGDFL
Sbjct: 194 GMLGTLLSLIDVLPLFSNTAWGQNANLAFLTKHMGATFEKRSQPWRSMINPEDVHSGDFL 253
Query: 61 AVSKICGRWGGFETLEKLVT---ADHAAVCLKDEMGSLWVGESGHENEK 106
AVSKI GRWGGFETLEK VT A H AVCLKD++G+LWVGESGHENEK
Sbjct: 254 AVSKIRGRWGGFETLEKWVTGAFAGHTAVCLKDDLGNLWVGESGHENEK 302
>UniRef100_Q7EZU2 Hypothetical protein P0456B03.107 [Oryza sativa]
Length = 547
Score = 153 bits (386), Expect = 1e-36
Identities = 77/112 (68%), Positives = 86/112 (76%), Gaps = 6/112 (5%)
Query: 1 MPSGMLGTLLSQIDAVPLFSDTAWGHKVNLDFLNKHMGAT---RSQPWRATTDPADVHSG 57
MPSGMLGTLLS ID +PLFS+T WG NL FL KHMGA+ R+QPW A DVHSG
Sbjct: 187 MPSGMLGTLLSLIDVIPLFSNTIWGQDANLAFLQKHMGASFEKRTQPWSANIRKEDVHSG 246
Query: 58 DFLAVSKICGRWGGFETLEKLVT---ADHAAVCLKDEMGSLWVGESGHENEK 106
DFLA+SKI GRWGGF+TLEK VT A H AVCLKDE G+LWV ESG+EN+K
Sbjct: 247 DFLALSKIRGRWGGFQTLEKWVTGAFAGHTAVCLKDENGTLWVAESGYENKK 298
>UniRef100_Q6H416 Hypothetical protein B1175F05.20 [Oryza sativa]
Length = 546
Score = 150 bits (379), Expect = 7e-36
Identities = 77/112 (68%), Positives = 83/112 (73%), Gaps = 6/112 (5%)
Query: 1 MPSGMLGTLLSQIDAVPLFSDTAWGHKVNLDFLNKHMGAT---RSQPWRATTDPADVHSG 57
MPSGMLGTLLS ID +PLFS+T WG NL FL KHMGA+ RSQPW T D+HSG
Sbjct: 186 MPSGMLGTLLSLIDVLPLFSNTGWGQHSNLAFLEKHMGASFEKRSQPWVTTIRKEDIHSG 245
Query: 58 DFLAVSKICGRWGGFETLEKLVT---ADHAAVCLKDEMGSLWVGESGHENEK 106
DFLA+SKI GRWG FETLEK VT A H AVCLKDE G +WV ESG ENEK
Sbjct: 246 DFLALSKIRGRWGAFETLEKWVTGAFAGHTAVCLKDEKGEVWVAESGFENEK 297
>UniRef100_Q5ZBU1 Hypothetical protein P0684C02.6-2 [Oryza sativa]
Length = 539
Score = 138 bits (348), Expect = 3e-32
Identities = 69/112 (61%), Positives = 80/112 (70%), Gaps = 6/112 (5%)
Query: 1 MPSGMLGTLLSQIDAVPLFSDTAWGHKVNLDFLNKHMGAT---RSQPWRATTDPADVHSG 57
MPSG +GTL + D PLF++T WG NL FL KHMGAT R +PW + + D+HSG
Sbjct: 177 MPSGTIGTLRALWDVFPLFTNTQWGENSNLAFLKKHMGATFEERPKPWVSELNVDDIHSG 236
Query: 58 DFLAVSKICGRWGGFETLEKLVT---ADHAAVCLKDEMGSLWVGESGHENEK 106
DFL +SKI GRWGGFETLEK VT A H AVCL+D G LWVGESGHENE+
Sbjct: 237 DFLVLSKIRGRWGGFETLEKWVTGAYAGHTAVCLRDSEGKLWVGESGHENEQ 288
>UniRef100_Q8S1L2 Hypothetical protein P0684C02.6-1 [Oryza sativa]
Length = 545
Score = 138 bits (348), Expect = 3e-32
Identities = 69/112 (61%), Positives = 80/112 (70%), Gaps = 6/112 (5%)
Query: 1 MPSGMLGTLLSQIDAVPLFSDTAWGHKVNLDFLNKHMGAT---RSQPWRATTDPADVHSG 57
MPSG +GTL + D PLF++T WG NL FL KHMGAT R +PW + + D+HSG
Sbjct: 183 MPSGTIGTLRALWDVFPLFTNTQWGENSNLAFLKKHMGATFEERPKPWVSELNVDDIHSG 242
Query: 58 DFLAVSKICGRWGGFETLEKLVT---ADHAAVCLKDEMGSLWVGESGHENEK 106
DFL +SKI GRWGGFETLEK VT A H AVCL+D G LWVGESGHENE+
Sbjct: 243 DFLVLSKIRGRWGGFETLEKWVTGAYAGHTAVCLRDSEGKLWVGESGHENEQ 294
>UniRef100_Q9SZ41 Hypothetical protein F10M23.360 [Arabidopsis thaliana]
Length = 523
Score = 128 bits (322), Expect = 3e-29
Identities = 64/112 (57%), Positives = 76/112 (67%), Gaps = 6/112 (5%)
Query: 1 MPSGMLGTLLSQIDAVPLFSDTAWGHKVNLDFLNKHMGAT---RSQPWRATTDPADVHSG 57
M +GMLGTL + D PLF++T WG N+ FL HMGA R +PW ++HSG
Sbjct: 161 MEAGMLGTLRALWDVFPLFTNTGWGENSNIAFLKNHMGANFYPRPKPWVTNITTDEIHSG 220
Query: 58 DFLAVSKICGRWGGFETLEKLVT---ADHAAVCLKDEMGSLWVGESGHENEK 106
D LA+SKI GRWGGFETLEK V+ A H AVCL+D G LWVGESG+ENEK
Sbjct: 221 DLLAISKIRGRWGGFETLEKWVSGAYAGHTAVCLRDSEGKLWVGESGNENEK 272
>UniRef100_Q94B24 Hypothetical protein MBG8.13 [Arabidopsis thaliana]
Length = 318
Score = 124 bits (312), Expect = 4e-28
Identities = 64/112 (57%), Positives = 75/112 (66%), Gaps = 6/112 (5%)
Query: 1 MPSGMLGTLLSQIDAVPLFSDTAWGHKVNLDFLNKHMGAT---RSQPWRATTDPADVHSG 57
M +GMLGTL + D PLFS+T WG NL FL KHMGA R +PW + SG
Sbjct: 169 MHAGMLGTLQALWDVFPLFSNTGWGESSNLAFLEKHMGANFEPRPEPWVTNVTTDQIQSG 228
Query: 58 DFLAVSKICGRWGGFETLEKLVT---ADHAAVCLKDEMGSLWVGESGHENEK 106
D LA+SKI GRWGGFETLEK V+ A H+AV L+D G LWVGESG+EN+K
Sbjct: 229 DLLAISKIRGRWGGFETLEKWVSGAYAGHSAVALRDSEGKLWVGESGNENDK 280
>UniRef100_Q9FFU3 Arabidopsis thaliana genomic DNA, chromosome 5, P1 clone:MBG8
[Arabidopsis thaliana]
Length = 531
Score = 124 bits (312), Expect = 4e-28
Identities = 64/112 (57%), Positives = 75/112 (66%), Gaps = 6/112 (5%)
Query: 1 MPSGMLGTLLSQIDAVPLFSDTAWGHKVNLDFLNKHMGAT---RSQPWRATTDPADVHSG 57
M +GMLGTL + D PLFS+T WG NL FL KHMGA R +PW + SG
Sbjct: 169 MHAGMLGTLQALWDVFPLFSNTGWGESSNLAFLEKHMGANFEPRPEPWVTNVTTDQIQSG 228
Query: 58 DFLAVSKICGRWGGFETLEKLVT---ADHAAVCLKDEMGSLWVGESGHENEK 106
D LA+SKI GRWGGFETLEK V+ A H+AV L+D G LWVGESG+EN+K
Sbjct: 229 DLLAISKIRGRWGGFETLEKWVSGAYAGHSAVALRDSEGKLWVGESGNENDK 280
>UniRef100_Q93ZG0 AT4g27020/F10M23_360 [Arabidopsis thaliana]
Length = 222
Score = 54.7 bits (130), Expect = 5e-07
Identities = 26/59 (44%), Positives = 34/59 (57%), Gaps = 3/59 (5%)
Query: 1 MPSGMLGTLLSQIDAVPLFSDTAWGHKVNLDFLNKHMGAT---RSQPWRATTDPADVHS 56
M +GMLGTL + D PLF++T WG N+ FL HMGA R +PW ++HS
Sbjct: 161 MEAGMLGTLRALWDVFPLFTNTGWGENSNIAFLKNHMGANFYPRPKPWVTNITTDEIHS 219
>UniRef100_Q8PJG4 Hypothetical protein XAC2568 [Xanthomonas axonopodis]
Length = 384
Score = 33.5 bits (75), Expect = 1.2
Identities = 24/86 (27%), Positives = 41/86 (46%), Gaps = 3/86 (3%)
Query: 13 IDAVPLFSDTAW-GHKVNLDFLNKHMGATRSQPWRATTDPADVHSGDFLAVSKICGRWGG 71
+ A+P + D+A ++ F+N + T W P ++ + A ++ RWGG
Sbjct: 159 VTALPYYRDSATLAGRLLAQFVNNNAALTYLG-WLPFAAPLTINGMCYAARTEQLRRWGG 217
Query: 72 FETLEKLVTADHA-AVCLKDEMGSLW 96
F L + +T D A A ++D G LW
Sbjct: 218 FTALLEQLTDDLAFATLVRDRGGRLW 243
>UniRef100_UPI000033B19C UPI000033B19C UniRef100 entry
Length = 222
Score = 32.7 bits (73), Expect = 2.1
Identities = 25/75 (33%), Positives = 33/75 (43%), Gaps = 9/75 (12%)
Query: 19 FSDTAWGHKVNLDFLNKHMGATRSQPWRATTDPADVHSGDFLAVSKICGRWGGFETLEK- 77
F + W +K L +H G + A TD D HSGD + V W E L+K
Sbjct: 151 FDNVLWSNKPTL----QHAGRRDNPNLVAWTDGQDTHSGDSMVVLP----WNDIEILDKR 202
Query: 78 LVTADHAAVCLKDEM 92
L D AAV ++ M
Sbjct: 203 LSKGDIAAVIMEPAM 217
>UniRef100_UPI00003217F3 UPI00003217F3 UniRef100 entry
Length = 274
Score = 32.3 bits (72), Expect = 2.7
Identities = 19/65 (29%), Positives = 30/65 (45%), Gaps = 5/65 (7%)
Query: 7 GTLLSQIDAVPLFSDTAWGHKVNLDFLNKHMGATRSQPWRATTDPADVHSGDFLAVSKIC 66
G ++ID++PL SD + +F +H+ R+ W +D DV L + C
Sbjct: 106 GISWNEIDSMPLHSDFFDSSSESKEFYKQHI-LIRTNSWLKDSDLEDVS----LIIDSAC 160
Query: 67 GRWGG 71
G W G
Sbjct: 161 GSWSG 165
>UniRef100_Q95LP0 Hypothetical protein [Macaca fascicularis]
Length = 447
Score = 32.0 bits (71), Expect = 3.5
Identities = 20/64 (31%), Positives = 30/64 (46%), Gaps = 8/64 (12%)
Query: 28 VNLDFLNKHMGATRSQPWRATTDPADV-------HSGDFLAVSKICGRWGGFETLEKLVT 80
+ L F +KH+ QPW + T PAD+ H+G+ LA GR F+ +
Sbjct: 1 MGLSFFSKHLPIQEGQPWASKT-PADIISTVEFNHTGELLATGDKGGRVVIFQREPESKN 59
Query: 81 ADHA 84
A H+
Sbjct: 60 APHS 63
>UniRef100_UPI0000369645 UPI0000369645 UniRef100 entry
Length = 129
Score = 31.2 bits (69), Expect = 6.1
Identities = 15/43 (34%), Positives = 23/43 (52%), Gaps = 2/43 (4%)
Query: 25 GHKVNLDFLNKHMGATRSQPWRATTDPADVHSGDFLAVSKICG 67
GH++ FL ++G+ + P + T PA+ HS F K CG
Sbjct: 82 GHRIKSSFLACYLGSFSASPHKFLTHPAEAHS--FRYSPKCCG 122
>UniRef100_UPI0000140CE7 UPI0000140CE7 UniRef100 entry
Length = 447
Score = 31.2 bits (69), Expect = 6.1
Identities = 20/64 (31%), Positives = 29/64 (45%), Gaps = 8/64 (12%)
Query: 28 VNLDFLNKHMGATRSQPWRATTDPADV-------HSGDFLAVSKICGRWGGFETLEKLVT 80
+ L F +KH+ QPW T PAD+ H+G+ LA GR F+ +
Sbjct: 1 MGLSFFSKHLPIQEGQPWALKT-PADIISTVEFNHTGELLATGDKGGRVVIFQREPESKN 59
Query: 81 ADHA 84
A H+
Sbjct: 60 APHS 63
>UniRef100_Q5V3E7 A/G-specific adenine glycosylase [Haloarcula marismortui]
Length = 311
Score = 31.2 bits (69), Expect = 6.1
Identities = 19/69 (27%), Positives = 33/69 (47%), Gaps = 7/69 (10%)
Query: 41 RSQPWRATTDPADVHSGDFLA----VSKICGRWGGFETLEKLVTADHAAVCLKDEMGSLW 96
RS PWR TTDP ++ + ++ + ++ W F L++ TA A + ++ W
Sbjct: 34 RSYPWRETTDPYEILVSEVMSQQTQLDRVVDAWEDF--LDRWPTAAALAEADRSDVVGFW 91
Query: 97 VGES-GHEN 104
S G+ N
Sbjct: 92 TSHSLGYNN 100
>UniRef100_UPI00002E496E UPI00002E496E UniRef100 entry
Length = 285
Score = 30.8 bits (68), Expect = 7.9
Identities = 24/88 (27%), Positives = 36/88 (40%), Gaps = 5/88 (5%)
Query: 20 SDTAWGHKVNLDFLNKHMGATRSQPWRATTDPADVHSGDFLAVSKICGRWGGFETLEKLV 79
S+ + K N+ F + A DPA +SGD L S+ G + E +
Sbjct: 33 SENSQNIKDNILFTETIQSLSNINAASAAIDPATGYSGDLLNFSEFMG-----QNDEVYI 87
Query: 80 TADHAAVCLKDEMGSLWVGESGHENEKV 107
ADH V +E+G+ S + N V
Sbjct: 88 NADHGFVSSGNELGNFQGSGSAYYNSIV 115
>UniRef100_Q9HPQ6 A/G specific adenine glycosylase, repair protein [Halobacterium
sp.]
Length = 312
Score = 30.8 bits (68), Expect = 7.9
Identities = 19/69 (27%), Positives = 33/69 (47%), Gaps = 7/69 (10%)
Query: 41 RSQPWRATTDPADVHSGDFLA----VSKICGRWGGFETLEKLVTADHAAVCLKDEMGSLW 96
RS PWR TTDP ++ + ++ +S++ W F L++ T A + ++ W
Sbjct: 34 RSFPWRETTDPYEILVSEVMSQQTQLSRVIDAWRAF--LDRWPTTAALAAADRSDVVGFW 91
Query: 97 VGES-GHEN 104
S G+ N
Sbjct: 92 SAHSLGYNN 100
Database: uniref100
Posted date: Jan 5, 2005 1:24 AM
Number of letters in database: 848,049,833
Number of sequences in database: 2,790,947
Lambda K H
0.317 0.134 0.427
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 185,134,129
Number of Sequences: 2790947
Number of extensions: 6446636
Number of successful extensions: 11330
Number of sequences better than 10.0: 19
Number of HSP's better than 10.0 without gapping: 10
Number of HSP's successfully gapped in prelim test: 9
Number of HSP's that attempted gapping in prelim test: 11302
Number of HSP's gapped (non-prelim): 19
length of query: 107
length of database: 848,049,833
effective HSP length: 83
effective length of query: 24
effective length of database: 616,401,232
effective search space: 14793629568
effective search space used: 14793629568
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 68 (30.8 bits)
Medicago: description of AC146588.13