
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0014.18
(181 letters)
Database: ara_mips
26,719 sequences; 11,318,596 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
At4g08830 putative protein 57 5e-09
At2g22350 putative non-LTR retroelement reverse transcriptase 52 1e-07
At2g27870 putative non-LTR retroelement reverse transcriptase 51 4e-07
At5g33360 putative protein 49 2e-06
At3g32110 non-LTR reverse transcriptase, putative 49 2e-06
At1g17390 hypothetical protein 48 3e-06
At2g31080 putative non-LTR retroelement reverse transcriptase 47 6e-06
At2g07730 putative non-LTR retroelement reverse transcriptase 44 6e-05
At1g47320 hypothetical protein 40 9e-04
At5g61080 putative protein 37 0.006
At2g45230 putative non-LTR retroelement reverse transcriptase 36 0.013
At4g07516 putative protein 34 0.049
At4g15580 splicing factor like protein 32 0.19
At2g01840 putative non-LTR retroelement reverse transcriptase 32 0.19
At1g26950 hypothetical protein 32 0.19
At1g27870 hypothetical protein 31 0.42
At1g04630 unknown protein 30 0.54
At3g30570 putative reverse transcriptase 30 0.71
At4g29090 putative protein 29 1.2
At3g29820 hypothetical protein 29 1.2
>At4g08830 putative protein
Length = 947
Score = 57.0 bits (136), Expect = 5e-09
Identities = 29/84 (34%), Positives = 45/84 (53%)
Query: 20 IRWWPLDAVWVKLNSDGAYISSADMAACRGIIRDRFSAFVIAYARRLGSYSTLQAELWGI 79
+ W P D WVKLN+DGA + +A G++RD + +A +G S AELWG+
Sbjct: 839 VAWKPPDGEWVKLNTDGASRGNLGLATTGGVLRDGIGHWCGGFALDIGVCSAPLAELWGV 898
Query: 80 LCGARLLQERDY*RVLIETNLSIL 103
G + ER + RV +E + ++
Sbjct: 899 YYGLYMAWERRFTRVELEVDSELV 922
>At2g22350 putative non-LTR retroelement reverse transcriptase
Length = 321
Score = 52.4 bits (124), Expect = 1e-07
Identities = 33/98 (33%), Positives = 50/98 (50%), Gaps = 9/98 (9%)
Query: 2 LRVAQVTRVVSPVRDAPAIRWWPLDAVWVKLNSDGAYISSADMAACRGIIRDRFSAFVIA 61
+RV++V R+VS W + WVKLN+DGA + A G++RD A++
Sbjct: 142 VRVSRVERLVS---------WVSPEDGWVKLNTDGASRGNPGFATAGGVLRDHNGAWIGG 192
Query: 62 YARRLGSYSTLQAELWGILCGARLLQERDY*RVLIETN 99
+A +G S AELWG+ G + R RV +E +
Sbjct: 193 FAVNIGVCSAPLAELWGVYYGLFIAWGRGARRVELEVD 230
>At2g27870 putative non-LTR retroelement reverse transcriptase
Length = 314
Score = 50.8 bits (120), Expect = 4e-07
Identities = 29/84 (34%), Positives = 44/84 (51%)
Query: 20 IRWWPLDAVWVKLNSDGAYISSADMAACRGIIRDRFSAFVIAYARRLGSYSTLQAELWGI 79
I W + W KLN+DGA + +A+ G++RD A+ +A +G S AELWG+
Sbjct: 144 IAWSKPEEGWWKLNTDGASRGNPGLASAGGVLRDEEGAWRGGFALNIGVCSAPLAELWGV 203
Query: 80 LCGARLLQERDY*RVLIETNLSIL 103
G + ER R+ IE + I+
Sbjct: 204 YYGLYIAWERRVTRLEIEVDSEIV 227
>At5g33360 putative protein
Length = 306
Score = 48.5 bits (114), Expect = 2e-06
Identities = 24/80 (30%), Positives = 40/80 (50%)
Query: 20 IRWWPLDAVWVKLNSDGAYISSADMAACRGIIRDRFSAFVIAYARRLGSYSTLQAELWGI 79
+RW W KLN+DGA + +A G +R+ + + +A +G S AELWG+
Sbjct: 136 VRWSKPSLGWCKLNTDGASHGNPGLAIAGGALRNEYGEWCFGFALNIGRCSAPLAELWGV 195
Query: 80 LCGARLLQERDY*RVLIETN 99
G + +R R+ +E +
Sbjct: 196 YYGLFMAWDRGITRLELEVD 215
>At3g32110 non-LTR reverse transcriptase, putative
Length = 1911
Score = 48.5 bits (114), Expect = 2e-06
Identities = 25/72 (34%), Positives = 39/72 (53%)
Query: 11 VSPVRDAPAIRWWPLDAVWVKLNSDGAYISSADMAACRGIIRDRFSAFVIAYARRLGSYS 70
+S +R IRW P W K+N+DGA + +A+ G++R+ A+ +A +G S
Sbjct: 1732 LSGLRVNKPIRWTPPMEGWYKINTDGASRGNPGLASAGGVLRNSAGAWCGGFAVNIGRCS 1791
Query: 71 TLQAELWGILCG 82
AELWG+ G
Sbjct: 1792 APLAELWGVYYG 1803
>At1g17390 hypothetical protein
Length = 322
Score = 47.8 bits (112), Expect = 3e-06
Identities = 25/75 (33%), Positives = 36/75 (47%)
Query: 15 RDAPAIRWWPLDAVWVKLNSDGAYISSADMAACRGIIRDRFSAFVIAYARRLGSYSTLQA 74
R+ I W P W KLN+DGA + +A G++RD + ++ +G S A
Sbjct: 97 REERLIAWSPPRVGWFKLNTDGASRGNPRLATAGGVVRDGDGNWCYGFSLNIGICSAPLA 156
Query: 75 ELWGILCGARLLQER 89
ELWG G + ER
Sbjct: 157 ELWGAYYGLNIAWER 171
>At2g31080 putative non-LTR retroelement reverse transcriptase
Length = 1231
Score = 47.0 bits (110), Expect = 6e-06
Identities = 27/90 (30%), Positives = 46/90 (51%)
Query: 14 VRDAPAIRWWPLDAVWVKLNSDGAYISSADMAACRGIIRDRFSAFVIAYARRLGSYSTLQ 73
VR IRW WVK+ +DGA + +AA G IR+ ++ +A +GS +
Sbjct: 1055 VRVERMIRWQVPSDGWVKITTDGASRGNHGLAAAGGAIRNGQGEWLGGFALNIGSCAAPL 1114
Query: 74 AELWGILCGARLLQERDY*RVLIETNLSIL 103
AELWG G + ++ + RV ++ + ++
Sbjct: 1115 AELWGAYYGLLIAWDKGFRRVELDLDCKLV 1144
>At2g07730 putative non-LTR retroelement reverse transcriptase
Length = 970
Score = 43.5 bits (101), Expect = 6e-05
Identities = 31/97 (31%), Positives = 43/97 (43%), Gaps = 2/97 (2%)
Query: 4 VAQVTRVVSPVRDAPAIRWWPLDAVWVKLNSDGAYISSADMAACRGIIRDRFSAFVIAYA 63
V + V R IRW WVKL +DGA +AA G I + ++ +A
Sbjct: 784 VGTLNNHVKRARVERMIRWKAPSDRWVKLTTDGASRGHQGLAAASGAILNLQGEWLGGFA 843
Query: 64 RRLGSYSTLQAELWGILCGARLLQERDY*RVLIETNL 100
+GS AELWG G + ++ + RV E NL
Sbjct: 844 LNIGSCDAPLAELWGAYYGLLIAWDKGFRRV--ELNL 878
>At1g47320 hypothetical protein
Length = 259
Score = 39.7 bits (91), Expect = 9e-04
Identities = 21/74 (28%), Positives = 37/74 (49%)
Query: 30 VKLNSDGAYISSADMAACRGIIRDRFSAFVIAYARRLGSYSTLQAELWGILCGARLLQER 89
+K+N+DGA + +A G+++D + ++ +G S AELWG G L ER
Sbjct: 101 LKINTDGASRGNPGLATAGGVLQDNEGRWCGGFSLNIGRSSAPMAELWGAYYGLYLAWER 160
Query: 90 DY*RVLIETNLSIL 103
+ +E + I+
Sbjct: 161 KSSHIELEVDSEIV 174
>At5g61080 putative protein
Length = 348
Score = 37.0 bits (84), Expect = 0.006
Identities = 30/103 (29%), Positives = 44/103 (42%), Gaps = 9/103 (8%)
Query: 30 VKLNSDGAYISSADMAACRGIIRDRFSAFVIAYARRLGSYSTLQAELWGILCGARLLQER 89
VKLN G +D+ GI+RD+ +V Y R S + A L I G + L +
Sbjct: 195 VKLNIQGTSNPLSDLTRSAGIVRDQSGKWVFGYIRCHKSIPEVVAGLLAIYQGLKYLWDS 254
Query: 90 DY*RVLIETNLSILGVVLVLIPVLLLLTK*SRLFSRVSPRLGA 132
+ R+ +ET ++ LT S LF + LGA
Sbjct: 255 GFRRIHLET---------TSFEIINALTTKSSLFCKSKTLLGA 288
>At2g45230 putative non-LTR retroelement reverse transcriptase
Length = 1374
Score = 35.8 bits (81), Expect = 0.013
Identities = 26/96 (27%), Positives = 42/96 (43%), Gaps = 2/96 (2%)
Query: 9 RVVSPVRDAPAIRWWPLDAVWVKLNSDGAYISSADMAACRGIIRDRFSAFVIAYARRLGS 68
+V S RD ++W P WVK N+DGA+ ++R+ + R L S
Sbjct: 1203 QVTSSTRDR-CVKWQPPSHGWVKCNTDGAWSKDLGNCGVGWVLRNHTGRLLWLGLRALPS 1261
Query: 69 -YSTLQAELWGILCGARLLQERDY*RVLIETNLSIL 103
S L+ E+ + L +Y RV+ E++ L
Sbjct: 1262 QQSVLETEVEALRWAVLSLSRFNYRRVIFESDSQYL 1297
>At4g07516 putative protein
Length = 740
Score = 33.9 bits (76), Expect = 0.049
Identities = 22/83 (26%), Positives = 37/83 (44%)
Query: 35 DGAYISSADMAACRGIIRDRFSAFVIAYARRLGSYSTLQAELWGILCGARLLQERDY*RV 94
DGA + +A G++RD + +A +G S AELWG+ + E R+
Sbjct: 606 DGASPGNPGLATASGVLRDEHGNWRGDFALNIGICSAPLAELWGVYYKLYIAWEMRITRL 665
Query: 95 LIETNLSILGVVLVLIPVLLLLT 117
+E + I+ + I L + T
Sbjct: 666 ELEVDSEIVSAFFMCIERLTVTT 688
>At4g15580 splicing factor like protein
Length = 559
Score = 32.0 bits (71), Expect = 0.19
Identities = 17/47 (36%), Positives = 25/47 (53%)
Query: 20 IRWWPLDAVWVKLNSDGAYISSADMAACRGIIRDRFSAFVIAYARRL 66
I W WVK+++DGA + AA G+IRD +V +A +L
Sbjct: 26 IAWTKPPEGWVKVSTDGASRGNPGPAAAGGVIRDEDGLWVGGFALQL 72
>At2g01840 putative non-LTR retroelement reverse transcriptase
Length = 1715
Score = 32.0 bits (71), Expect = 0.19
Identities = 24/80 (30%), Positives = 38/80 (47%), Gaps = 2/80 (2%)
Query: 29 WVKLNSDGAYISSADMAACRGIIRDRFSAFVIAYARRLG-SYSTLQAELWGILCGARLLQ 87
++K N D Y+ D + I+RD + + +L SYS LQAE G L +++
Sbjct: 1563 FLKCNFDSGYVQGRDYTSTGWILRDCNGRVLHSGCAKLQQSYSALQAEALGFLHALQMVW 1622
Query: 88 ERDY*RVLIE-TNLSILGVV 106
R Y V E NL + ++
Sbjct: 1623 IRGYCYVWFEGDNLELTNLI 1642
>At1g26950 hypothetical protein
Length = 158
Score = 32.0 bits (71), Expect = 0.19
Identities = 16/56 (28%), Positives = 30/56 (53%)
Query: 48 RGIIRDRFSAFVIAYARRLGSYSTLQAELWGILCGARLLQERDY*RVLIETNLSIL 103
RG +RD + + ++A LG + AELWG+ G + E+ R+ +E + ++
Sbjct: 16 RGALRDEYGDWRGSFALNLGRCTAPLAELWGVYYGLVIAWEKGITRLELEVDSKLV 71
>At1g27870 hypothetical protein
Length = 213
Score = 30.8 bits (68), Expect = 0.42
Identities = 18/85 (21%), Positives = 42/85 (49%), Gaps = 3/85 (3%)
Query: 21 RWWPLDAVWVKLNSDGAYISSADMAACRGIIRDRFSAFVIAYARRLGSY--STLQAELWG 78
RW + W+K N DG++++ + ++RD ++++A + +G + L++E+
Sbjct: 51 RWRRPERGWIKCNFDGSFVNGDVKSKAGWVVRDSNGSYLLA-GQAIGRKVDNALESEIQA 109
Query: 79 ILCGARLLQERDY*RVLIETNLSIL 103
++ + Y RV E + +L
Sbjct: 110 LIISMQHCWSHGYKRVCFEGDNKML 134
>At1g04630 unknown protein
Length = 356
Score = 30.4 bits (67), Expect = 0.54
Identities = 24/83 (28%), Positives = 39/83 (46%), Gaps = 5/83 (6%)
Query: 18 PAIRWWPLDAV-WVKLNSDGAYISSADMAACRGIIRDRFSAFVIAYARRLGSYST--LQA 74
P W + W K N DG Y S+A A ++RD F+ A A +GS +T +++
Sbjct: 190 PTANLWTTPPIGWTKCNYDGTYHSNAPSKA-GWLLRDDRGTFLGA-AHAIGSITTNPMES 247
Query: 75 ELWGILCGARLLQERDY*RVLIE 97
EL ++ + R Y ++ E
Sbjct: 248 ELQALVMAMQHCWSRGYRKIYFE 270
>At3g30570 putative reverse transcriptase
Length = 1099
Score = 30.0 bits (66), Expect = 0.71
Identities = 15/34 (44%), Positives = 18/34 (52%)
Query: 20 IRWWPLDAVWVKLNSDGAYISSADMAACRGIIRD 53
I W A W KLN DGA + +AA G +RD
Sbjct: 1019 IGWRVPSAGWYKLNMDGASRGNPGLAAAGGALRD 1052
>At4g29090 putative protein
Length = 575
Score = 29.3 bits (64), Expect = 1.2
Identities = 23/88 (26%), Positives = 36/88 (40%), Gaps = 1/88 (1%)
Query: 21 RWWPLDAVWVKLNSDGAYISSADMAACRGIIRDRFSAFVIAYARRLGSY-STLQAELWGI 79
RW P WVK N+D + + ++R+ AR L S L+AEL +
Sbjct: 419 RWRPPPHQWVKCNTDATWNRDNERCGIGWVLRNEKGEVKWMGARALPKLKSVLEAELEAM 478
Query: 80 LCGARLLQERDY*RVLIETNLSILGVVL 107
L Y V+ E++ +L +L
Sbjct: 479 RWAVLSLSRFQYNYVIFESDSQVLIEIL 506
>At3g29820 hypothetical protein
Length = 332
Score = 29.3 bits (64), Expect = 1.2
Identities = 20/89 (22%), Positives = 37/89 (41%), Gaps = 17/89 (19%)
Query: 15 RDAPAIRWWPLDAVWVKLNSDGAYISSADMAACRGIIRDRFSAFVIAYARRLGSYSTLQA 74
R+ I W W+K+N++GA + +A G+ ++
Sbjct: 200 REERMIGWSAPQVGWIKVNTNGASRGNLGLATSAGVCE-----------------IVMEL 242
Query: 75 ELWGILCGARLLQERDY*RVLIETNLSIL 103
ELWG+ G L ER +V +E + +++
Sbjct: 243 ELWGVYYGLYLAWERMATQVELEIDSNMV 271
Database: ara_mips
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,978,382
Number of sequences in database: 6832
Database: /data/blast2/ara_mips_chr2
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,737,135
Number of sequences in database: 4184
Database: /data/blast2/ara_mips_chr3
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,236,886
Number of sequences in database: 5377
Database: /data/blast2/ara_mips_chr4
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,748,816
Number of sequences in database: 4030
Database: /data/blast2/ara_mips_chr5
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,569,679
Number of sequences in database: 6098
Database: /data/blast2/ara_mips_chl
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 25,951
Number of sequences in database: 85
Database: /data/blast2/ara_mips_mit
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 21,747
Number of sequences in database: 113
Lambda K H
0.336 0.145 0.454
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,361,096
Number of Sequences: 26719
Number of extensions: 116472
Number of successful extensions: 359
Number of sequences better than 10.0: 25
Number of HSP's better than 10.0 without gapping: 19
Number of HSP's successfully gapped in prelim test: 6
Number of HSP's that attempted gapping in prelim test: 340
Number of HSP's gapped (non-prelim): 25
length of query: 181
length of database: 11,318,596
effective HSP length: 93
effective length of query: 88
effective length of database: 8,833,729
effective search space: 777368152
effective search space used: 777368152
T: 11
A: 40
X1: 15 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 39 (21.7 bits)
S2: 57 (26.6 bits)
Lotus: description of TM0014.18