
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC149210.5 + phase: 0
(128 letters)
Database: uniref100
2,790,947 sequences; 848,049,833 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
UniRef100_Q9FJF4 Arabidopsis thaliana genomic DNA, chromosome 5,... 64 8e-10
UniRef100_Q6ZJ98 Hypothetical protein OJ1734_E04.2 [Oryza sativa] 52 4e-06
UniRef100_Q9LTJ1 Similarity to unknown protein [Arabidopsis thal... 45 3e-04
UniRef100_Q7XLC7 OSJNBa0070C17.22 protein [Oryza sativa] 42 0.003
UniRef100_Q9LKA7 Gb|AAC80581.1 [Arabidopsis thaliana] 39 0.026
UniRef100_O95983 Methyl-CpG binding protein 3 [Homo sapiens] 38 0.045
UniRef100_Q7XWX3 OSJNBb0058J09.3 protein [Oryza sativa] 38 0.059
UniRef100_Q9Z2D8 Methyl-CpG binding protein 3 [Mus musculus] 37 0.100
UniRef100_Q9SNC0 Hypothetical protein F12A12.100 [Arabidopsis th... 37 0.13
UniRef100_Q8LCG6 Hypothetical protein [Arabidopsis thaliana] 37 0.13
UniRef100_Q6EQ81 Hypothetical protein P0453B09.50 [Oryza sativa] 36 0.17
UniRef100_Q6TGX7 Methyl-CpG binding domain protein 3 [Brachydani... 35 0.50
UniRef100_Q6IQ79 Methyl-CpG binding domain protein 3a [Brachydan... 35 0.50
UniRef100_Q8L791 Hypothetical protein At5g52230 [Arabidopsis tha... 35 0.50
UniRef100_Q9LTJ8 Arabidopsis thaliana genomic DNA, chromosome 5,... 35 0.50
UniRef100_Q6EPE3 Hypothetical protein B1151D08.6 [Oryza sativa] 34 0.84
UniRef100_Q94IQ7 Putative methyl-binding domain protein MBD108 [... 34 0.84
UniRef100_UPI0000436BAF UPI0000436BAF UniRef100 entry 33 1.1
UniRef100_Q8AYP2 Methyl-CpG binding protein MBD3 [Xenopus laevis] 33 1.1
UniRef100_Q7XWP5 OSJNBa0072D08.3 protein [Oryza sativa] 33 1.1
>UniRef100_Q9FJF4 Arabidopsis thaliana genomic DNA, chromosome 5, P1 clone:MMN10
[Arabidopsis thaliana]
Length = 306
Score = 63.9 bits (154), Expect = 8e-10
Identities = 34/77 (44%), Positives = 49/77 (63%), Gaps = 1/77 (1%)
Query: 3 RSMDVEMEHLNLNRNVMNLREKELQIVVQSPSKVEVPDGWLVEQRPRPSNPNHIDRYYIE 62
RS+ +L +RN + + + LQ +PDGW+VE++PR S+ +HIDR YIE
Sbjct: 147 RSLVSVERYLRESRNSIEQQLRVLQNRRGHSKDFRLPDGWIVEEKPRRSS-SHIDRSYIE 205
Query: 63 PHTGQKFRSLVSVQRYL 79
P TG KFRS+ +V+RYL
Sbjct: 206 PGTGNKFRSMAAVERYL 222
Score = 58.5 bits (140), Expect = 3e-08
Identities = 35/85 (41%), Positives = 52/85 (61%), Gaps = 4/85 (4%)
Query: 22 REKELQIVVQSPSK-VEVPDGWLVEQRPRPSNPNHIDRYYIEPHTGQKFRSLVSVQRYLN 80
R K+ V+ SK +P GW VE+ PR N ++ID+YY+E TG++FRSLVSV+RYL
Sbjct: 99 RTKQDHCGVEYASKGFRLPRGWSVEEVPR-KNSHYIDKYYVERKTGKRFRSLVSVERYLR 157
Query: 81 GGETGDYLPTQRIISENNNNVSSNF 105
E+ + + Q + +N S +F
Sbjct: 158 --ESRNSIEQQLRVLQNRRGHSKDF 180
Score = 53.9 bits (128), Expect = 8e-07
Identities = 28/65 (43%), Positives = 39/65 (59%), Gaps = 1/65 (1%)
Query: 22 REKELQIVVQSPSKVEVPDGWLVEQRPRPSNPNHIDRYYIEPHTGQKFRSLVSVQRYLNG 81
RE +LQI + ++ GW V RPR SN +D Y+IEP TG++F SL ++ R+L
Sbjct: 15 RETQLQIADPTSFCGKIMPGWTVVNRPRSSNNGVVDTYFIEPGTGRQFSSLEAIHRHL-A 73
Query: 82 GETGD 86
GE D
Sbjct: 74 GEVND 78
>UniRef100_Q6ZJ98 Hypothetical protein OJ1734_E04.2 [Oryza sativa]
Length = 422
Score = 51.6 bits (122), Expect = 4e-06
Identities = 31/74 (41%), Positives = 42/74 (55%), Gaps = 8/74 (10%)
Query: 38 VPDGWLVEQRPRPSNPN-HIDRYYIEPHTGQKFRSLVSVQRYLNGGETGDYL--PTQRII 94
+PDGW E RPR + N D YYI+P +G +FRSL V R+L G+ + P +R I
Sbjct: 343 LPDGWWKEDRPRKNGSNLKTDPYYIDPVSGYEFRSLKDVHRFLKSGDIYKCIVRPRKRTI 402
Query: 95 S-----ENNNNVSS 103
EN ++VSS
Sbjct: 403 QDPCTIENQSHVSS 416
>UniRef100_Q9LTJ1 Similarity to unknown protein [Arabidopsis thaliana]
Length = 225
Score = 45.4 bits (106), Expect = 3e-04
Identities = 23/53 (43%), Positives = 30/53 (56%), Gaps = 2/53 (3%)
Query: 32 SPSKVEVPDGWLVEQRPRPSNPN--HIDRYYIEPHTGQKFRSLVSVQRYLNGG 82
+P +P GW VE + R S +D+YY EP+TG+KFRS V YL G
Sbjct: 75 APGDNWLPPGWRVEDKIRTSGATAGSVDKYYYEPNTGRKFRSRTEVLYYLEHG 127
>UniRef100_Q7XLC7 OSJNBa0070C17.22 protein [Oryza sativa]
Length = 438
Score = 42.0 bits (97), Expect = 0.003
Identities = 22/54 (40%), Positives = 29/54 (52%), Gaps = 3/54 (5%)
Query: 33 PSKVEVPDGWLVEQRPRPSNPNHI---DRYYIEPHTGQKFRSLVSVQRYLNGGE 83
P+ +P GWL E RPR + D +YI+P +FRS VQRYL G+
Sbjct: 30 PAGEGLPHGWLKEYRPRKNQSGSRVKGDTFYIDPTNMYEFRSQKDVQRYLESGD 83
>UniRef100_Q9LKA7 Gb|AAC80581.1 [Arabidopsis thaliana]
Length = 1145
Score = 38.9 bits (89), Expect = 0.026
Identities = 23/83 (27%), Positives = 39/83 (46%), Gaps = 3/83 (3%)
Query: 1 MNRSMDVEMEHLNLNRNVMNLREKE--LQIVVQSPSKVEVPDGWLVEQRPRPSNPNHIDR 58
+N V L +++ + + KE ++ + KV W +E+R R + H+D
Sbjct: 213 LNNGSKVSPNELIMSKTCLKIDPKEDPRPLLYKYVCKVLTAARWKIEKRERSAGRKHVDT 272
Query: 59 YYIEPHTGQKFRSLVSVQRYLNG 81
+YI P G+KFR S + L G
Sbjct: 273 FYISPE-GRKFREFGSAWKALGG 294
>UniRef100_O95983 Methyl-CpG binding protein 3 [Homo sapiens]
Length = 291
Score = 38.1 bits (87), Expect = 0.045
Identities = 24/68 (35%), Positives = 37/68 (54%), Gaps = 6/68 (8%)
Query: 38 VPDGWLVEQRPRPS--NPNHIDRYYIEPHTGQKFRSLVSVQRYLNGG---ETGDYLPTQR 92
+P GW E+ PR S + H D +Y P +G+KFRS + RYL G T D+ +
Sbjct: 11 LPQGWEREEVPRRSGLSAGHRDVFYYSP-SGKKFRSKPQLARYLGGSMDLSTFDFRTGKM 69
Query: 93 IISENNNN 100
++S+ N +
Sbjct: 70 LMSKMNKS 77
>UniRef100_Q7XWX3 OSJNBb0058J09.3 protein [Oryza sativa]
Length = 385
Score = 37.7 bits (86), Expect = 0.059
Identities = 18/70 (25%), Positives = 32/70 (45%)
Query: 38 VPDGWLVEQRPRPSNPNHIDRYYIEPHTGQKFRSLVSVQRYLNGGETGDYLPTQRIISEN 97
+P GW++E R N N + ++Y+ P G + S V Y+N E + +
Sbjct: 125 LPKGWIIEVRAGGKNMNKMYKFYVYPPAGVRLFSKEDVLLYINKSEITGFDTNGECDTRT 184
Query: 98 NNNVSSNFTF 107
+N+ +N F
Sbjct: 185 KDNILANVEF 194
Score = 35.4 bits (80), Expect = 0.29
Identities = 28/94 (29%), Positives = 45/94 (47%), Gaps = 10/94 (10%)
Query: 23 EKELQIVVQSPSKVE-----VPDGWLVEQRPRPSNPNHIDRYYIEPHTGQKFRSLVSVQR 77
+KE+Q V+ + E +PDGW++E + I RYY P +G F V +
Sbjct: 25 KKEIQQDVEDLEEEEERPGWLPDGWIMEVYQ--GDDGTIYRYYTSPISGLTFTMKSEVLQ 82
Query: 78 YLNGGETGDYLPTQRIISEN---NNNVSSNFTFT 108
YL G +L ++ ++N NN+V + T T
Sbjct: 83 YLFSGMDERFLESKNCAADNQLINNSVYVSPTMT 116
>UniRef100_Q9Z2D8 Methyl-CpG binding protein 3 [Mus musculus]
Length = 285
Score = 37.0 bits (84), Expect = 0.100
Identities = 23/68 (33%), Positives = 37/68 (53%), Gaps = 6/68 (8%)
Query: 38 VPDGWLVEQRPRPS--NPNHIDRYYIEPHTGQKFRSLVSVQRYLNGG---ETGDYLPTQR 92
+P GW E+ PR S + H D +Y P +G+KFRS + RYL G T D+ +
Sbjct: 11 LPQGWEREEVPRRSGLSAGHRDVFYYSP-SGKKFRSKPQLARYLGGSMDLSTFDFRTGKM 69
Query: 93 IISENNNN 100
++++ N +
Sbjct: 70 LMNKMNKS 77
>UniRef100_Q9SNC0 Hypothetical protein F12A12.100 [Arabidopsis thaliana]
Length = 182
Score = 36.6 bits (83), Expect = 0.13
Identities = 20/47 (42%), Positives = 25/47 (52%), Gaps = 2/47 (4%)
Query: 38 VPDGWLVEQRPRPSNPNH--IDRYYIEPHTGQKFRSLVSVQRYLNGG 82
+P W E R R S +D++Y EP TG+KFRS V YL G
Sbjct: 35 LPPDWRTEIRVRTSGTKAGTVDKFYYEPITGRKFRSKNEVLYYLEHG 81
>UniRef100_Q8LCG6 Hypothetical protein [Arabidopsis thaliana]
Length = 182
Score = 36.6 bits (83), Expect = 0.13
Identities = 20/47 (42%), Positives = 25/47 (52%), Gaps = 2/47 (4%)
Query: 38 VPDGWLVEQRPRPSNPNH--IDRYYIEPHTGQKFRSLVSVQRYLNGG 82
+P W E R R S +D++Y EP TG+KFRS V YL G
Sbjct: 35 LPPDWRTEIRVRTSGTKAGTVDKFYYEPITGRKFRSKNEVLYYLEHG 81
>UniRef100_Q6EQ81 Hypothetical protein P0453B09.50 [Oryza sativa]
Length = 293
Score = 36.2 bits (82), Expect = 0.17
Identities = 23/65 (35%), Positives = 30/65 (45%), Gaps = 2/65 (3%)
Query: 29 VVQSPSKVEV-PDGWLVEQRPRPSNPNHIDRYYIEPHTGQKFRSLVSVQRYLNGGETGDY 87
++ P K EV P GW +E R D+YY KFRS VQ +L+ G+
Sbjct: 151 ILVHPRKDEVDPTGWDIENHLRNDQKTK-DKYYRHKDYNHKFRSKPEVQYFLDTGKAAGA 209
Query: 88 LPTQR 92
P QR
Sbjct: 210 TPIQR 214
>UniRef100_Q6TGX7 Methyl-CpG binding domain protein 3 [Brachydanio rerio]
Length = 273
Score = 34.7 bits (78), Expect = 0.50
Identities = 23/68 (33%), Positives = 36/68 (52%), Gaps = 6/68 (8%)
Query: 38 VPDGWLVEQRPRPS--NPNHIDRYYIEPHTGQKFRSLVSVQRYLNGG---ETGDYLPTQR 92
+P+GW +E+ R S + D YY P TG+KFRS + RYL + D+ +
Sbjct: 11 LPNGWKMEEVTRKSGLSAGKSDVYYFSP-TGKKFRSKPQLVRYLGKSMDLSSFDFRTGKM 69
Query: 93 IISENNNN 100
++S+ N N
Sbjct: 70 LMSKLNKN 77
>UniRef100_Q6IQ79 Methyl-CpG binding domain protein 3a [Brachydanio rerio]
Length = 273
Score = 34.7 bits (78), Expect = 0.50
Identities = 23/68 (33%), Positives = 36/68 (52%), Gaps = 6/68 (8%)
Query: 38 VPDGWLVEQRPRPS--NPNHIDRYYIEPHTGQKFRSLVSVQRYLNGG---ETGDYLPTQR 92
+P+GW +E+ R S + D YY P TG+KFRS + RYL + D+ +
Sbjct: 11 LPNGWKMEEVTRKSGLSAGKSDVYYFSP-TGKKFRSKPQLVRYLGKSMDLSSFDFRTGKM 69
Query: 93 IISENNNN 100
++S+ N N
Sbjct: 70 LMSKLNKN 77
>UniRef100_Q8L791 Hypothetical protein At5g52230 [Arabidopsis thaliana]
Length = 746
Score = 34.7 bits (78), Expect = 0.50
Identities = 16/63 (25%), Positives = 31/63 (48%), Gaps = 2/63 (3%)
Query: 27 QIVVQSPSKVEVPDGWL--VEQRPRPSNPNHIDRYYIEPHTGQKFRSLVSVQRYLNGGET 84
+++V+ + +P+GW+ +E R D ++I+P + F+S RY+ G
Sbjct: 28 KVIVEKSAAQGLPEGWIKKLEITNRSGRKTRRDPFFIDPKSEYIFQSFKDASRYVETGNI 87
Query: 85 GDY 87
G Y
Sbjct: 88 GHY 90
>UniRef100_Q9LTJ8 Arabidopsis thaliana genomic DNA, chromosome 5, BAC clone:F17P19
[Arabidopsis thaliana]
Length = 746
Score = 34.7 bits (78), Expect = 0.50
Identities = 16/63 (25%), Positives = 31/63 (48%), Gaps = 2/63 (3%)
Query: 27 QIVVQSPSKVEVPDGWL--VEQRPRPSNPNHIDRYYIEPHTGQKFRSLVSVQRYLNGGET 84
+++V+ + +P+GW+ +E R D ++I+P + F+S RY+ G
Sbjct: 28 KVIVEKSAAQGLPEGWIKKLEITNRSGRKTRRDPFFIDPKSEYIFQSFKDASRYVETGNI 87
Query: 85 GDY 87
G Y
Sbjct: 88 GHY 90
>UniRef100_Q6EPE3 Hypothetical protein B1151D08.6 [Oryza sativa]
Length = 254
Score = 33.9 bits (76), Expect = 0.84
Identities = 21/52 (40%), Positives = 27/52 (51%), Gaps = 2/52 (3%)
Query: 33 PSKVEV-PDGWLVEQRPRPSNPNHIDRYYIEPHTGQKFRSLVSVQRYLNGGE 83
P K EV PDGW+VE R D+YY KFRS V+ +L+ G+
Sbjct: 171 PRKHEVDPDGWVVEIHLRNDQKTK-DKYYRHKDYNHKFRSKPEVKSFLDTGK 221
>UniRef100_Q94IQ7 Putative methyl-binding domain protein MBD108 [Zea mays]
Length = 297
Score = 33.9 bits (76), Expect = 0.84
Identities = 20/51 (39%), Positives = 26/51 (50%), Gaps = 2/51 (3%)
Query: 30 VQSPSKVEVPDGWLVEQRPR-PSNPNHIDRYYIEPHTGQKFRSLVSVQRYL 79
+ P+ + P GW + R R D YY P TG+K RSLV V R+L
Sbjct: 133 IDKPNIAQPPHGWERQIRIRGEGGTKFADVYYTSP-TGRKLRSLVEVDRFL 182
>UniRef100_UPI0000436BAF UPI0000436BAF UniRef100 entry
Length = 762
Score = 33.5 bits (75), Expect = 1.1
Identities = 20/58 (34%), Positives = 30/58 (51%), Gaps = 3/58 (5%)
Query: 10 EHLNLNRNVMNLR---EKELQIVVQSPSKVEVPDGWLVEQRPRPSNPNHIDRYYIEPH 64
E L N++V+N R KE +I++QS +++ PD +L P H D Y PH
Sbjct: 490 EALQTNQSVINQRFMENKEKEIILQSALEIDQPDHFLQYMSRVKDPPVHQDSLYPGPH 547
>UniRef100_Q8AYP2 Methyl-CpG binding protein MBD3 [Xenopus laevis]
Length = 283
Score = 33.5 bits (75), Expect = 1.1
Identities = 23/68 (33%), Positives = 34/68 (49%), Gaps = 6/68 (8%)
Query: 38 VPDGWLVEQRPRPS--NPNHIDRYYIEPHTGQKFRSLVSVQRYLNGG---ETGDYLPTQR 92
+P GW E+ R S + D YY P +G+KFRS + RYL T D+ +
Sbjct: 11 LPQGWKKEEVTRRSGLSAGKSDVYYYSP-SGKKFRSKPQLARYLGNSMDLSTFDFRTGKM 69
Query: 93 IISENNNN 100
++S+ N N
Sbjct: 70 LMSKINKN 77
>UniRef100_Q7XWP5 OSJNBa0072D08.3 protein [Oryza sativa]
Length = 505
Score = 33.5 bits (75), Expect = 1.1
Identities = 21/85 (24%), Positives = 40/85 (46%), Gaps = 4/85 (4%)
Query: 13 NLNRNVMNLREKELQIVVQSPSKVEVPDGWLVEQRPRPSNPNHIDRYYIEPHTGQKFRSL 72
++ N ++K+ +++ + P + PDGW++E R + +I RYY P +G F +
Sbjct: 17 SVEANEAEEKDKDAEVLEELPDWL--PDGWIMEVRC--GDNGNIYRYYTSPVSGYTFSTK 72
Query: 73 VSVQRYLNGGETGDYLPTQRIISEN 97
+ YL L +Q +N
Sbjct: 73 METLHYLFSEMDERVLESQACADDN 97
Score = 33.5 bits (75), Expect = 1.1
Identities = 15/46 (32%), Positives = 24/46 (51%)
Query: 38 VPDGWLVEQRPRPSNPNHIDRYYIEPHTGQKFRSLVSVQRYLNGGE 83
+PDGW +E R + ++Y+ TG +F S +V Y N G+
Sbjct: 106 LPDGWAIEVRAGGKKMEKMYKFYVHLPTGMRFLSKENVLLYSNEGK 151
Score = 32.0 bits (71), Expect = 3.2
Identities = 20/59 (33%), Positives = 30/59 (49%), Gaps = 3/59 (5%)
Query: 38 VPDGWLVEQRPRPSNPN-HIDRYYIEPHTGQKFRSLVSVQRYLNGGETGD--YLPTQRI 93
+P+GW+ E R N D YY +P + FR+L SV YL G+ Y+P + +
Sbjct: 180 LPEGWVKEIIFRKCNDGIRKDPYYTDPVSRHVFRTLKSVINYLETGQITKHAYIPRRSV 238
Database: uniref100
Posted date: Jan 5, 2005 1:24 AM
Number of letters in database: 848,049,833
Number of sequences in database: 2,790,947
Lambda K H
0.320 0.137 0.417
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 236,082,732
Number of Sequences: 2790947
Number of extensions: 10004811
Number of successful extensions: 22614
Number of sequences better than 10.0: 54
Number of HSP's better than 10.0 without gapping: 28
Number of HSP's successfully gapped in prelim test: 26
Number of HSP's that attempted gapping in prelim test: 22572
Number of HSP's gapped (non-prelim): 62
length of query: 128
length of database: 848,049,833
effective HSP length: 104
effective length of query: 24
effective length of database: 557,791,345
effective search space: 13386992280
effective search space used: 13386992280
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 67 (30.4 bits)
Medicago: description of AC149210.5