
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC146789.8 + phase: 0 /pseudo
(2263 letters)
Database: ara_mips
26,719 sequences; 11,318,596 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
At3g01410 putative RNase H 74 7e-13
At1g37200 hypothetical protein 74 1e-12
At2g13330 F14O4.9 72 3e-12
At1g20390 hypothetical protein 72 4e-12
At3g31530 hypothetical protein 67 2e-10
At2g15410 putative retroelement pol polyprotein 65 3e-10
At1g24090 unknown protein 62 5e-09
At2g12920 pseudogene 57 9e-08
At5g51080 unknown protein 56 3e-07
At4g32220 hypothetical protein 49 3e-05
At4g04060 putative transposon protein 46 3e-04
At2g14400 putative retroelement pol polyprotein 38 0.059
At4g21420 hypothetical protein 37 0.10
At5g60250 unknown protein 37 0.17
At3g18810 protein kinase, putative 37 0.17
At2g11940 putative retroelement gag/pol polyprotein 35 0.50
At4g37620 putative protein 35 0.66
At4g35940 putative protein 34 1.1
At2g35940 putative homeodomain transcription factor 33 1.5
At2g15100 putative retroelement pol polyprotein 33 1.5
>At3g01410 putative RNase H
Length = 290
Score = 74.3 bits (181), Expect = 7e-13
Identities = 39/118 (33%), Positives = 64/118 (54%), Gaps = 2/118 (1%)
Query: 1705 GIGAVLLTPKGAHIPFTARLRFDCTNNIAEYEACIMGIEEAIDLRIKNIEIYGDSALVIN 1764
G GAVL + + + + TNN+AEY A ++G+ A+D KN+ + GDS LV
Sbjct: 167 GAGAVLRASDNSVLFYLREGVGNATNNVAEYRALLLGLRSALDKGFKNVHVLGDSMLVCM 226
Query: 1765 QIKGKWETLHAGLIPYRDYARRLLTFFNKVELHHIPRDENQMAD--ALATVFNDQGES 1820
Q++G W+T H + A+ L+ F ++ HI R++N AD A + +F G++
Sbjct: 227 QVQGAWKTNHPKMAELCKQAKELMNSFKTFDIKHIAREKNSEADKQANSAIFLADGQT 284
>At1g37200 hypothetical protein
Length = 1564
Score = 73.6 bits (179), Expect = 1e-12
Identities = 43/110 (39%), Positives = 61/110 (55%)
Query: 1704 NGIGAVLLTPKGAHIPFTARLRFDCTNNIAEYEACIMGIEEAIDLRIKNIEIYGDSALVI 1763
+G G L +P G I + RL F+ +NN +EYEA I GI+ A +RI++I + DS LV
Sbjct: 1054 SGEGIQLTSPTGEVIEQSFRLGFNASNNESEYEALIDGIKLAQGMRIRDIHAHSDSQLVT 1113
Query: 1764 NQIKGKWETLHAGLIPYRDYARRLLTFFNKVELHHIPRDENQMADALATV 1813
+Q G++E + Y + + L F EL IPR EN DALA +
Sbjct: 1114 SQFHGEYEAKDERMEAYLELVKTLTQQFESFELTRIPRGENTSTDALAAL 1163
>At2g13330 F14O4.9
Length = 889
Score = 72.4 bits (176), Expect = 3e-12
Identities = 41/110 (37%), Positives = 60/110 (54%)
Query: 1704 NGIGAVLLTPKGAHIPFTARLRFDCTNNIAEYEACIMGIEEAIDLRIKNIEIYGDSALVI 1763
+G+G L +P G I + +L F+ +NN +EYEA I GI+ A + I+ I Y DS LV
Sbjct: 656 SGVGIQLTSPTGEVIEQSLQLGFNASNNESEYEALIAGIKLAQEKGIREIHAYSDSQLVT 715
Query: 1764 NQIKGKWETLHAGLIPYRDYARRLLTFFNKVELHHIPRDENQMADALATV 1813
+Q G++E + Y + + L F +L IPR EN AD LA +
Sbjct: 716 SQFHGEYEAKDERMEAYLELVKTLAQQFESFKLTRIPRGENTSADTLAAL 765
>At1g20390 hypothetical protein
Length = 1791
Score = 72.0 bits (175), Expect = 4e-12
Identities = 39/110 (35%), Positives = 60/110 (54%)
Query: 1704 NGIGAVLLTPKGAHIPFTARLRFDCTNNIAEYEACIMGIEEAIDLRIKNIEIYGDSALVI 1763
+GIG L++P + + RLRF TNN+AEYE I G+ A ++I I + DS L+
Sbjct: 1223 SGIGIRLVSPTAEVLEQSFRLRFVATNNVAEYEVLIAGLRLAAGMQITTIHAFTDSQLIA 1282
Query: 1764 NQIKGKWETLHAGLIPYRDYARRLLTFFNKVELHHIPRDENQMADALATV 1813
Q+ G++E + + Y + + F +L IPR +N ADALA +
Sbjct: 1283 GQLSGEYEAKNEKMDAYLKIVQLMTKDFENFKLSKIPRGDNAPADALAAL 1332
>At3g31530 hypothetical protein
Length = 831
Score = 66.6 bits (161), Expect = 2e-10
Identities = 39/110 (35%), Positives = 57/110 (51%), Gaps = 11/110 (10%)
Query: 1704 NGIGAVLLTPKGAHIPFTARLRFDCTNNIAEYEACIMGIEEAIDLRIKNIEIYGDSALVI 1763
+G+G L +P G + TNN+AEYEA + G+ A L+I + DS L+
Sbjct: 427 SGVGIRLTSPTG-----------EATNNVAEYEALVAGLNLAWGLKIGKTRAFCDSQLIA 475
Query: 1764 NQIKGKWETLHAGLIPYRDYARRLLTFFNKVELHHIPRDENQMADALATV 1813
NQ G++ T + Y + + L F++ EL IPR EN ADALA +
Sbjct: 476 NQFNGEYTTQDKKMEAYLIHVQNLAKNFDEFELTRIPRGENTSADALAAL 525
>At2g15410 putative retroelement pol polyprotein
Length = 1787
Score = 65.5 bits (158), Expect = 3e-10
Identities = 37/89 (41%), Positives = 48/89 (53%)
Query: 1725 RFDCTNNIAEYEACIMGIEEAIDLRIKNIEIYGDSALVINQIKGKWETLHAGLIPYRDYA 1784
RF +NN AEYEA I G+ A + +K I+ Y DS LV +Q G +E + + Y
Sbjct: 1189 RFPASNNEAEYEALIAGLRLAHGIEVKKIQAYCDSQLVASQFSGNYEAKNERMDAYLKVV 1248
Query: 1785 RRLLTFFNKVELHHIPRDENQMADALATV 1813
R L F EL IPR +N ADALA +
Sbjct: 1249 RELSYNFEVFELTKIPRSDNAPADALAVL 1277
>At1g24090 unknown protein
Length = 535
Score = 61.6 bits (148), Expect = 5e-09
Identities = 42/113 (37%), Positives = 58/113 (51%), Gaps = 3/113 (2%)
Query: 1704 NGIGAVLLTPKGAHIPFTARLRFDCTNNIAEYEACIMGIEEAIDLRIKNIEIYGDSALVI 1763
+G AVL T G+ I + TNN AEY A I+G++ AI+ KNI++ GDS LV
Sbjct: 232 SGAAAVLKTEDGSLICRVRQGLGIATNNAAEYHALILGLKYAIEKGYKNIKVKGDSKLVC 291
Query: 1764 ---NQIKGKWETLHAGLIPYRDYARRLLTFFNKVELHHIPRDENQMADALATV 1813
QIKG+W+ H L A+ L E+ H+ R+ N AD A +
Sbjct: 292 MQKQQIKGQWKVNHEVLAKLHKEAKLLCNKCVSFEISHVLRNLNADADEQANL 344
>At2g12920 pseudogene
Length = 863
Score = 57.4 bits (137), Expect = 9e-08
Identities = 31/81 (38%), Positives = 48/81 (58%)
Query: 1733 AEYEACIMGIEEAIDLRIKNIEIYGDSALVINQIKGKWETLHAGLIPYRDYARRLLTFFN 1792
AEYEA + G++ A L+I I + DS+LV NQ G++ + Y +++ L F+
Sbjct: 559 AEYEALVAGLKLAQGLKIAKIRAFCDSSLVANQFNGEYTARDERMGAYLTHSQDLAKQFD 618
Query: 1793 KVELHHIPRDENQMADALATV 1813
+ EL IPR +N+ ADALA +
Sbjct: 619 EFELSRIPRGKNKSADALAAL 639
>At5g51080 unknown protein
Length = 322
Score = 55.8 bits (133), Expect = 3e-07
Identities = 37/110 (33%), Positives = 54/110 (48%)
Query: 1704 NGIGAVLLTPKGAHIPFTARLRFDCTNNIAEYEACIMGIEEAIDLRIKNIEIYGDSALVI 1763
+G AVL T G+ I + TNN AEY I+G++ AI+ I++ DS LV
Sbjct: 201 SGAAAVLKTEDGSLIFKMRQGLGIATNNAAEYHGLILGLKHAIEKGYTKIKVKTDSKLVC 260
Query: 1764 NQIKGKWETLHAGLIPYRDYARRLLTFFNKVELHHIPRDENQMADALATV 1813
Q+KG+W+ H L A++L E+ H+ R N AD A +
Sbjct: 261 MQMKGQWKVNHEVLSKLHKEAKQLSDKCLSFEISHVLRSLNSDADEQANM 310
>At4g32220 hypothetical protein
Length = 452
Score = 49.3 bits (116), Expect = 3e-05
Identities = 25/72 (34%), Positives = 43/72 (59%)
Query: 1700 SMYTNGIGAVLLTPKGAHIPFTARLRFDCTNNIAEYEACIMGIEEAIDLRIKNIEIYGDS 1759
S+ + +G +L +P + + RL+F +NN AEYEA + G+ A L K I+ + DS
Sbjct: 218 SLQGSSLGILLQSPTREILEQSLRLQFKASNNEAEYEALLAGLRLAKGLGAKQIKAFSDS 277
Query: 1760 ALVINQIKGKWE 1771
LV+++ G++E
Sbjct: 278 QLVVSRFSGEFE 289
>At4g04060 putative transposon protein
Length = 375
Score = 45.8 bits (107), Expect = 3e-04
Identities = 25/64 (39%), Positives = 35/64 (54%)
Query: 1750 IKNIEIYGDSALVINQIKGKWETLHAGLIPYRDYARRLLTFFNKVELHHIPRDENQMADA 1809
I++I + DS LVI+Q G++E + Y + + L F EL IPR EN ADA
Sbjct: 3 IRDIHAHSDSQLVISQFHGEYEAKDERMEAYLELVKTLTQQFESFELTMIPRGENTSADA 62
Query: 1810 LATV 1813
LA +
Sbjct: 63 LAAL 66
>At2g14400 putative retroelement pol polyprotein
Length = 1466
Score = 38.1 bits (87), Expect = 0.059
Identities = 19/53 (35%), Positives = 28/53 (51%)
Query: 1761 LVINQIKGKWETLHAGLIPYRDYARRLLTFFNKVELHHIPRDENQMADALATV 1813
+++NQ G +E + + Y + L F K EL IPR +N ADALA +
Sbjct: 790 IIVNQFNGDYEAKDSRMEAYLHVVKDLAKNFRKFELIRIPRGQNTTADALAAL 842
>At4g21420 hypothetical protein
Length = 229
Score = 37.4 bits (85), Expect = 0.10
Identities = 20/85 (23%), Positives = 46/85 (53%)
Query: 1727 DCTNNIAEYEACIMGIEEAIDLRIKNIEIYGDSALVINQIKGKWETLHAGLIPYRDYARR 1786
+ T+ +AE+ A G+E A++ + ++ + GD+ ++++ I + + +Y +
Sbjct: 108 EATSTMAEFAALKRGLELALENGLTDLWLEGDAKIIMDIISRRGRLRCEKTNKHVNYIKV 167
Query: 1787 LLTFFNKVELHHIPRDENQMADALA 1811
++ N L H+ R+ N++AD LA
Sbjct: 168 VMPELNNCVLSHVYREGNRVADKLA 192
>At5g60250 unknown protein
Length = 655
Score = 36.6 bits (83), Expect = 0.17
Identities = 22/79 (27%), Positives = 39/79 (48%)
Query: 1733 AEYEACIMGIEEAIDLRIKNIEIYGDSALVINQIKGKWETLHAGLIPYRDYARRLLTFFN 1792
AE +A I G+ EA+ L IK+I + DS + + GKW + D + ++ F+
Sbjct: 198 AELKALIRGLTEALKLGIKHIVFFCDSYPIFQYVTGKWMAKQKKISLLLDDLQSIMQHFS 257
Query: 1793 KVELHHIPRDENQMADALA 1811
+ + R++ + A LA
Sbjct: 258 SYQHVLVARNDVKFAYKLA 276
>At3g18810 protein kinase, putative
Length = 700
Score = 36.6 bits (83), Expect = 0.17
Identities = 17/33 (51%), Positives = 21/33 (63%)
Query: 438 NNHTNNNHTNNSHTNNTLNKISNNNLINNAHNN 470
+N+ NNN NN+ NN NK +NNN NN NN
Sbjct: 83 DNNNNNNGNNNNDNNNGNNKDNNNNGNNNNGNN 115
Score = 34.3 bits (77), Expect = 0.86
Identities = 14/33 (42%), Positives = 21/33 (63%)
Query: 438 NNHTNNNHTNNSHTNNTLNKISNNNLINNAHNN 470
NN+ +NN+ NN + NN N +N + NN +NN
Sbjct: 79 NNNNDNNNNNNGNNNNDNNNGNNKDNNNNGNNN 111
Score = 34.3 bits (77), Expect = 0.86
Identities = 15/33 (45%), Positives = 21/33 (63%)
Query: 438 NNHTNNNHTNNSHTNNTLNKISNNNLINNAHNN 470
N++ NNN+ NN++ NN N NNN NN + N
Sbjct: 82 NDNNNNNNGNNNNDNNNGNNKDNNNNGNNNNGN 114
Score = 33.1 bits (74), Expect = 1.9
Identities = 14/32 (43%), Positives = 21/32 (64%)
Query: 438 NNHTNNNHTNNSHTNNTLNKISNNNLINNAHN 469
NN+ + N+ NN++ NN N +NNN NN +N
Sbjct: 70 NNNNDGNNGNNNNDNNNNNNGNNNNDNNNGNN 101
Score = 32.7 bits (73), Expect = 2.5
Identities = 15/33 (45%), Positives = 19/33 (57%)
Query: 438 NNHTNNNHTNNSHTNNTLNKISNNNLINNAHNN 470
NN+ NNN+ NN+ N N NNN NN + N
Sbjct: 87 NNNGNNNNDNNNGNNKDNNNNGNNNNGNNNNGN 119
Score = 32.3 bits (72), Expect = 3.3
Identities = 15/33 (45%), Positives = 18/33 (54%)
Query: 438 NNHTNNNHTNNSHTNNTLNKISNNNLINNAHNN 470
NN NNN NN+ +N N + NN NN NN
Sbjct: 106 NNGNNNNGNNNNGNDNNGNNNNGNNNDNNNQNN 138
Score = 32.3 bits (72), Expect = 3.3
Identities = 17/38 (44%), Positives = 26/38 (67%), Gaps = 2/38 (5%)
Query: 433 HHKILNNHTNNNHTNNSHTNNTLNKISNNNLINNAHNN 470
++K NN+ NNN+ NN++ N+ N +NNN NN +NN
Sbjct: 100 NNKDNNNNGNNNNGNNNNGND--NNGNNNNGNNNDNNN 135
Score = 31.2 bits (69), Expect = 7.2
Identities = 16/35 (45%), Positives = 19/35 (53%), Gaps = 2/35 (5%)
Query: 438 NNHTNNNHTNNSHTNNTLNKISNNN--LINNAHNN 470
NN+ NNN N + NN N NNN NN +NN
Sbjct: 92 NNNDNNNGNNKDNNNNGNNNNGNNNNGNDNNGNNN 126
Score = 30.8 bits (68), Expect = 9.5
Identities = 14/33 (42%), Positives = 21/33 (63%), Gaps = 2/33 (6%)
Query: 438 NNHTNNNHTNNSHTNNTLNKISNNNLINNAHNN 470
NN+ NN+ +N++ NN N + NN NN +NN
Sbjct: 86 NNNNGNNNNDNNNGNNKDNNNNGNN--NNGNNN 116
>At2g11940 putative retroelement gag/pol polyprotein
Length = 1212
Score = 35.0 bits (79), Expect = 0.50
Identities = 20/54 (37%), Positives = 25/54 (46%)
Query: 1760 ALVINQIKGKWETLHAGLIPYRDYARRLLTFFNKVELHHIPRDENQMADALATV 1813
A + G +E + Y D R L F K EL +PR EN ADALA +
Sbjct: 740 ATTASDYSGDYEAKDNRMEAYLDLLRELAEKFEKFELIKVPRAENSAADALAAL 793
>At4g37620 putative protein
Length = 132
Score = 34.7 bits (78), Expect = 0.66
Identities = 25/82 (30%), Positives = 40/82 (48%), Gaps = 8/82 (9%)
Query: 1733 AEYEACIMGIEEAIDLRIKNIEIYGDSALVINQIKGKWETLHAGLIPYRDYARRLLTFFN 1792
AE A ++ +++A DL ++ I DS +I I T L +L +
Sbjct: 37 AEALAMLLALQQAKDLGFTSLSIASDSQQLIKAI-----TSRTPLTELHGILHDILLLAS 91
Query: 1793 K---VELHHIPRDENQMADALA 1811
+ V H IPR+EN++ADAL+
Sbjct: 92 EIGFVRFHSIPRNENRLADALS 113
>At4g35940 putative protein
Length = 451
Score = 33.9 bits (76), Expect = 1.1
Identities = 19/62 (30%), Positives = 26/62 (41%), Gaps = 9/62 (14%)
Query: 423 HNILNSHKILHHKILNNHTNNNHT---------NNSHTNNTLNKISNNNLINNAHNNLDH 473
HN N +I + LN NNN+ N H NN +I +N HNN +
Sbjct: 188 HNNNNEKRIEKQQPLNGRHNNNNEKLMEKQQPLNGRHNNNNEKRIEKQQPLNGRHNNKEK 247
Query: 474 QE 475
Q+
Sbjct: 248 QK 249
>At2g35940 putative homeodomain transcription factor
Length = 680
Score = 33.5 bits (75), Expect = 1.5
Identities = 13/25 (52%), Positives = 18/25 (72%)
Query: 438 NNHTNNNHTNNSHTNNTLNKISNNN 462
N+ NNN++NNS+ NNT +NNN
Sbjct: 39 NDSNNNNNSNNSNNNNTNTNTNNNN 63
>At2g15100 putative retroelement pol polyprotein
Length = 1329
Score = 33.5 bits (75), Expect = 1.5
Identities = 18/50 (36%), Positives = 27/50 (54%)
Query: 1764 NQIKGKWETLHAGLIPYRDYARRLLTFFNKVELHHIPRDENQMADALATV 1813
N+ G++E A + Y + R + F + EL IPR EN A+ALA +
Sbjct: 790 NRYSGEYEAKDACMEAYLNLVREVSGRFEQFELTRIPRAENSAANALAAL 839
Database: ara_mips
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,978,382
Number of sequences in database: 6832
Database: /data/blast2/ara_mips_chr2
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,737,135
Number of sequences in database: 4184
Database: /data/blast2/ara_mips_chr3
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,236,886
Number of sequences in database: 5377
Database: /data/blast2/ara_mips_chr4
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,748,816
Number of sequences in database: 4030
Database: /data/blast2/ara_mips_chr5
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,569,679
Number of sequences in database: 6098
Database: /data/blast2/ara_mips_chl
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 25,951
Number of sequences in database: 85
Database: /data/blast2/ara_mips_mit
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 21,747
Number of sequences in database: 113
Lambda K H
0.357 0.157 0.557
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 41,834,434
Number of Sequences: 26719
Number of extensions: 1545237
Number of successful extensions: 7645
Number of sequences better than 10.0: 28
Number of HSP's better than 10.0 without gapping: 17
Number of HSP's successfully gapped in prelim test: 11
Number of HSP's that attempted gapping in prelim test: 7443
Number of HSP's gapped (non-prelim): 102
length of query: 2263
length of database: 11,318,596
effective HSP length: 115
effective length of query: 2148
effective length of database: 8,245,911
effective search space: 17712216828
effective search space used: 17712216828
T: 11
A: 40
X1: 14 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 37 (21.7 bits)
S2: 68 (30.8 bits)
Medicago: description of AC146789.8