
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0331c.6
(426 letters)
Database: ara_mips
26,719 sequences; 11,318,596 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
At3g18720 unknown protein 58 1e-08
At2g31080 putative non-LTR retroelement reverse transcriptase 50 3e-06
At1g49360 unknown protein 47 2e-05
At2g07730 putative non-LTR retroelement reverse transcriptase 43 3e-04
At5g61080 putative protein 43 4e-04
At2g22350 putative non-LTR retroelement reverse transcriptase 42 8e-04
At1g17390 hypothetical protein 40 0.003
At5g33360 putative protein 38 0.012
At2g27870 putative non-LTR retroelement reverse transcriptase 37 0.021
At4g09710 RNA-directed DNA polymerase -like protein 36 0.035
At3g56470 putative protein 36 0.046
At3g32110 non-LTR reverse transcriptase, putative 36 0.046
At4g12370 putative protein 35 0.10
At1g47320 hypothetical protein 34 0.13
At4g18320 hypothetical protein 33 0.23
At4g08830 putative protein 33 0.23
At4g22400 putative protein 33 0.30
At1g26950 hypothetical protein 33 0.30
At2g17690 hypothetical protein 31 1.5
At1g04630 unknown protein 31 1.5
>At3g18720 unknown protein
Length = 380
Score = 57.8 bits (138), Expect = 1e-08
Identities = 67/265 (25%), Positives = 106/265 (39%), Gaps = 36/265 (13%)
Query: 132 QSYPWLLYVHNNLAIIRS----ILDPT--ETYSRSIPELSG-ANLLTVQHGGWCVLEKGG 184
QS PWL Y + + + +P+ +T+ PEL+G N L GW ++ K
Sbjct: 90 QSRPWLFYPQSQRGGPKEGDYVLFNPSRSQTHHLKFPELTGYRNKLACAKDGWLLVVKDN 149
Query: 185 KEKVRELLLWNPFLQEKIHLPSLKHENRIRNCI-LSCSPTETDQVCSIFLFPHEAYGYGP 243
+ V L NPF E+I LP + +N R+C+ S +PT T C + F +++ Y
Sbjct: 150 PDVVFFL---NPFTGERICLPQVP-QNSTRDCLTFSAAPTSTS--CCVISFTPQSFLYAV 203
Query: 244 NILFYCEVGDKQWFAVDYRKDLVRASPPEEDGVVYNPFLYSPVYFNGRLYAVSYS-RLLV 302
+ G+ W + D Y + ++ NG Y +S S RL V
Sbjct: 204 VKVDTWRPGESVWTTHHF------------DQKRYGEVINRCIFSNGMFYCLSTSGRLSV 251
Query: 303 IIEKLEPPNGLQIHSADVYLPDPFPHTFPCLRQLLVCNHE--LFHIQISMEYNKIFGIAV 360
E N L + F +RQ+ + HE +F + N+ +
Sbjct: 252 FDPSRETWNVLPVKPCRA-----FRRKIMLVRQVFMTEHEGDIFVVTTRRVNNR--KLLA 304
Query: 361 HKLNFSHMIWERVKSIKDMVFFLSD 385
KLN +WE +K + F SD
Sbjct: 305 FKLNLQGNVWEEMKVPNGLTVFSSD 329
>At2g31080 putative non-LTR retroelement reverse transcriptase
Length = 1231
Score = 49.7 bits (117), Expect = 3e-06
Identities = 36/109 (33%), Positives = 53/109 (48%), Gaps = 3/109 (2%)
Query: 3 GGGIRGDGGRWVSGFAMAQGAGDAFKSKLLALQKGLLHVWELGFRPVNCFVDCSEVQKVV 62
GG IR G W+ GFA+ G+ A ++L GLL W+ GFR V +DC V V
Sbjct: 1089 GGAIRNGQGEWLGGFALNIGSCAAPLAELWGAYYGLLIAWDKGFRRVELDLDCKLV--VG 1146
Query: 63 ITNSDVQN-YWHGAVILLIREVIARNLSVSVTPVPKEQKMVVDALAKLA 110
++ V N + ++ L + R+ V V+ V +E + D LA A
Sbjct: 1147 FLSTGVSNAHPLSFLVRLCQGFFTRDWLVRVSHVYREANRLADGLANYA 1195
>At1g49360 unknown protein
Length = 481
Score = 47.4 bits (111), Expect = 2e-05
Identities = 62/277 (22%), Positives = 105/277 (37%), Gaps = 33/277 (11%)
Query: 136 WLLY--VHNNLAIIRSILDPTE---TYSRSIPELSGANLLTVQHGGWCVLEKGGKEKVRE 190
WLLY + + DP E T ++PELS ++ + GW +L + +
Sbjct: 153 WLLYHDAFQDKGVSYGFFDPVEKKKTKEMNLPELSKSSGILYSKDGW-LLMNDSLSLIAD 211
Query: 191 LLLWNPFLQEKIHLPSLK-HENRIRNCILSCSPTETDQVCSIFLFPHEAYGYGPNILFYC 249
+ +NPF +E+I LP + E+ N SC+PT+ + C +F + + I +
Sbjct: 212 MYFFNPFTRERIDLPRNRIMESVHTNFAFSCAPTK--KSCLVFGINNISSSVAIKISTW- 268
Query: 250 EVGDKQWFAVDYRKDLVRASPPEEDGVVYNPFLYSPVYFNGRLYAVSYSRLLVIIEKLEP 309
G W +D P Y L + +Y +G Y S + L V
Sbjct: 269 RPGATTWL----HEDFPNLFPS------YFRRLGNILYSDGLFYTASETALGVFDPTART 318
Query: 310 PNGLQIHSADVYLPDPFPHTFPCLRQLLVCNHELFHIQISMEYNKIFGIAVHKLNFSHMI 369
N L + P P +R + +F + S V++LN +
Sbjct: 319 WNVLPV--------QPIPMAPRSIRWMTEYEGHIFLVDASS-----LEPMVYRLNRLESV 365
Query: 370 WERVKSIKDMVFFLSDKDLPFARRVNPGGGGLIYFTS 406
WE+ +++ FLSD + ++YF S
Sbjct: 366 WEKKETLDGSSIFLSDGSCVMTYGLTGSMSNILYFWS 402
>At2g07730 putative non-LTR retroelement reverse transcriptase
Length = 970
Score = 43.1 bits (100), Expect = 3e-04
Identities = 32/107 (29%), Positives = 51/107 (46%), Gaps = 1/107 (0%)
Query: 4 GGIRGDGGRWVSGFAMAQGAGDAFKSKLLALQKGLLHVWELGFRPVNCFVDCSEVQKVVI 63
G I G W+ GFA+ G+ DA ++L GLL W+ GFR V +D SE+ +
Sbjct: 829 GAILNLQGEWLGGFALNIGSCDAPLAELWGAYYGLLIAWDKGFRRVELNLD-SELVVGFL 887
Query: 64 TNSDVQNYWHGAVILLIREVIARNLSVSVTPVPKEQKMVVDALAKLA 110
+ + + ++ L + R+ V V+ V +E + D LA A
Sbjct: 888 STGISKAHPLSFLVRLCQGFFTRDWLVRVSHVYREANRLADGLANYA 934
>At5g61080 putative protein
Length = 348
Score = 42.7 bits (99), Expect = 4e-04
Identities = 26/106 (24%), Positives = 50/106 (46%), Gaps = 3/106 (2%)
Query: 4 GGIRGDGGRWVSGFAMAQGAGDAFKSKLLALQKGLLHVWELGFRPVNCFVDCSEVQKVVI 63
G +R G+WV G+ + + LLA+ +GL ++W+ GFR ++ E+ +
Sbjct: 214 GIVRDQSGKWVFGYIRCHKSIPEVVAGLLAIYQGLKYLWDSGFRRIHLETTSFEIINALT 273
Query: 64 TNSDVQNYWHGAVIL-LIREVIARNLSVSVTPVPKEQKMVVDALAK 108
T S + + +L +++I + + + KEQ + LAK
Sbjct: 274 TKSSL--FCKSKTLLGACKDMILKEWECDIYHISKEQNSCAEWLAK 317
>At2g22350 putative non-LTR retroelement reverse transcriptase
Length = 321
Score = 41.6 bits (96), Expect = 8e-04
Identities = 29/108 (26%), Positives = 50/108 (45%), Gaps = 1/108 (0%)
Query: 3 GGGIRGDGGRWVSGFAMAQGAGDAFKSKLLALQKGLLHVWELGFRPVNCFVDCSEVQKVV 62
GG +R G W+ GFA+ G A ++L + GL W G R V VD S++
Sbjct: 179 GGVLRDHNGAWIGGFAVNIGVCSAPLAELWGVYYGLFIAWGRGARRVELEVD-SKMVVGF 237
Query: 63 ITNSDVQNYWHGAVILLIREVIARNLSVSVTPVPKEQKMVVDALAKLA 110
+T ++ ++ L + +++ V ++ V +E + D LA A
Sbjct: 238 LTTGIADSHPLSFLLRLCYDFLSKGWIVRISHVYREANRLADGLANYA 285
>At1g17390 hypothetical protein
Length = 322
Score = 39.7 bits (91), Expect = 0.003
Identities = 28/108 (25%), Positives = 49/108 (44%), Gaps = 1/108 (0%)
Query: 3 GGGIRGDGGRWVSGFAMAQGAGDAFKSKLLALQKGLLHVWELGFRPVNCFVDCSEVQKVV 62
GG +R G W GF++ G A ++L GL WE G + +D V +
Sbjct: 130 GGVVRDGDGNWCYGFSLNIGICSAPLAELWGAYYGLNIAWERGVTQLEMEIDSEMVVGFL 189
Query: 63 ITNSDVQNYWHGAVILLIREVIARNLSVSVTPVPKEQKMVVDALAKLA 110
T D ++ ++ L +++++ SV ++ V +E + D LA A
Sbjct: 190 RTGID-DSHPLSFLVRLCHGLLSKDWSVRISHVYREANRLADGLANYA 236
>At5g33360 putative protein
Length = 306
Score = 37.7 bits (86), Expect = 0.012
Identities = 27/110 (24%), Positives = 49/110 (44%), Gaps = 1/110 (0%)
Query: 1 MGGGGIRGDGGRWVSGFAMAQGAGDAFKSKLLALQKGLLHVWELGFRPVNCFVDCSEVQK 60
+ GG +R + G W GFA+ G A ++L + GL W+ G + VD SE+
Sbjct: 162 IAGGALRNEYGEWCFGFALNIGRCSAPLAELWGVYYGLFMAWDRGITRLELEVD-SEMVV 220
Query: 61 VVITNSDVQNYWHGAVILLIREVIARNLSVSVTPVPKEQKMVVDALAKLA 110
+ ++ ++ + ++R+ V + V +E + D LA A
Sbjct: 221 GFLRTGIGSSHPLSFLVRMCHGFLSRDWIVRIGHVYREANRLADELANYA 270
>At2g27870 putative non-LTR retroelement reverse transcriptase
Length = 314
Score = 37.0 bits (84), Expect = 0.021
Identities = 29/108 (26%), Positives = 49/108 (44%), Gaps = 1/108 (0%)
Query: 3 GGGIRGDGGRWVSGFAMAQGAGDAFKSKLLALQKGLLHVWELGFRPVNCFVDCSEVQKVV 62
GG +R + G W GFA+ G A ++L + GL WE + VD SE+
Sbjct: 172 GGVLRDEEGAWRGGFALNIGVCSAPLAELWGVYYGLYIAWERRVTRLEIEVD-SEIVVGF 230
Query: 63 ITNSDVQNYWHGAVILLIREVIARNLSVSVTPVPKEQKMVVDALAKLA 110
+ + + ++ L + I+R+ V ++ V +E + D LA A
Sbjct: 231 LKIGINEVHPLSFLVRLCHDFISRDWRVRISHVYREANRLADGLANYA 278
>At4g09710 RNA-directed DNA polymerase -like protein
Length = 1274
Score = 36.2 bits (82), Expect = 0.035
Identities = 30/95 (31%), Positives = 47/95 (48%), Gaps = 16/95 (16%)
Query: 24 GDAFKSKLLALQKGLLHVWELGFRPVNCFVDCSEVQKVVITNSDVQNYWHGAVILLIREV 83
G A ++ LA+ L+ G R +N F DC E+ + + NS G I+ +R +
Sbjct: 1178 GSALMAETLAVHLALVDALSTGVRQLNVFSDCKEL--ISLLNS-------GKSIVELRGL 1228
Query: 84 I--ARNLSVSVTP-----VPKEQKMVVDALAKLAV 111
+ R LSVS T +P+ +V D+LAK A+
Sbjct: 1229 LHDIRELSVSFTHLCFFFIPRLSNVVADSLAKSAL 1263
>At3g56470 putative protein
Length = 367
Score = 35.8 bits (81), Expect = 0.046
Identities = 38/163 (23%), Positives = 67/163 (40%), Gaps = 16/163 (9%)
Query: 125 CLLPPTTQSYPWLLYVHNNLAIIRSILDPT--ETYSRSIPELSGANLLTVQHGGWCVLEK 182
C+ + PWL+Y + + DP+ + + PELSG + GW ++
Sbjct: 63 CVSLRVIHTSPWLIYF-SKTDDSYELYDPSMQKNCNLHFPELSGFRVC-YSKDGWLLMYN 120
Query: 183 GGKEKVRELLLWNPFLQEKIHLPSL--KHENRIRNCILSCSPTETDQVCSIFLFPHEAYG 240
+LL +NPF ++ + +P+L ++ R+ SC+PT T C +F +
Sbjct: 121 PNSY---QLLFFNPFTRDCVPMPTLWMAYDQRM---AFSCAPTSTS--CLLFTVTSVTWN 172
Query: 241 YGPNILFYCEVGDKQWFAVDYRKDLVRASPPEEDGVVYNPFLY 283
Y ++ K+W ++ L R E V N Y
Sbjct: 173 YITIKTYFANA--KEWKTSVFKNRLQRNFNTFEQIVFSNGVFY 213
>At3g32110 non-LTR reverse transcriptase, putative
Length = 1911
Score = 35.8 bits (81), Expect = 0.046
Identities = 27/108 (25%), Positives = 47/108 (43%), Gaps = 1/108 (0%)
Query: 3 GGGIRGDGGRWVSGFAMAQGAGDAFKSKLLALQKGLLHVWELGFRPVNCFVDCSEVQKVV 62
GG +R G W GFA+ G A ++L + GL W + VD SEV
Sbjct: 1769 GGVLRNSAGAWCGGFAVNIGRCSAPLAELWGVYYGLYMAWAKQLTHLELEVD-SEVVVGF 1827
Query: 63 ITNSDVQNYWHGAVILLIREVIARNLSVSVTPVPKEQKMVVDALAKLA 110
+ + + ++ L ++++ +V ++ V +E + D LA A
Sbjct: 1828 LKTGIGETHPLSFLVRLCHNFLSKDWTVRISHVYREANSLADGLANHA 1875
>At4g12370 putative protein
Length = 300
Score = 34.7 bits (78), Expect = 0.10
Identities = 24/83 (28%), Positives = 40/83 (47%), Gaps = 8/83 (9%)
Query: 157 YSRSIPELSGANLLTVQHGGWCVLEKGGKEKVRELLLWNPFLQEKIHLPSLKHENRIRNC 216
Y+ ++PEL+ + + GW ++ K RE+ +NPF +E I++P K
Sbjct: 21 YTLNLPELAKSTVC-YSRDGWLLMRKTIS---REMFFFNPFTRELINVP--KCTLSYDAI 74
Query: 217 ILSCSPTETDQVCSIFLFPHEAY 239
SC+P T C + F H +Y
Sbjct: 75 AFSCAP--TSGTCVLLAFKHVSY 95
>At1g47320 hypothetical protein
Length = 259
Score = 34.3 bits (77), Expect = 0.13
Identities = 26/105 (24%), Positives = 45/105 (42%), Gaps = 1/105 (0%)
Query: 3 GGGIRGDGGRWVSGFAMAQGAGDAFKSKLLALQKGLLHVWELGFRPVNCFVDCSEVQKVV 62
GG ++ + GRW GF++ G A ++L GL WE + VD SE+
Sbjct: 119 GGVLQDNEGRWCGGFSLNIGRSSAPMAELWGAYYGLYLAWERKSSHIELEVD-SEIVVGF 177
Query: 63 ITNSDVQNYWHGAVILLIREVIARNLSVSVTPVPKEQKMVVDALA 107
+ ++ ++ L I+++ V + V +E D LA
Sbjct: 178 LKTGISDHHPLSFLVRLCHGFISKDWRVRIFHVYREANRFADGLA 222
>At4g18320 hypothetical protein
Length = 320
Score = 33.5 bits (75), Expect = 0.23
Identities = 28/97 (28%), Positives = 41/97 (41%), Gaps = 9/97 (9%)
Query: 135 PWLLYVHNNLAIIRSILDPT--ETYSRSIPELSGANLLTVQHGGWCVLEKGG---KEKVR 189
PWLLY + DP E + P L GA L G W V+ + +
Sbjct: 66 PWLLYRERGSRVAH-FYDPVREELHHGKDPYLEGAKFLGSTLG-WVVMSNSSAIHRRQDH 123
Query: 190 ELLLWNPFLQEKIHLPSLKHENR--IRNCILSCSPTE 224
+ L+NPF ++ LP L +N IR +L+ P +
Sbjct: 124 QAFLYNPFTSQRYQLPLLTTDNPCVIRYGVLTGDPRQ 160
>At4g08830 putative protein
Length = 947
Score = 33.5 bits (75), Expect = 0.23
Identities = 20/52 (38%), Positives = 25/52 (47%), Gaps = 1/52 (1%)
Query: 4 GGIRGDG-GRWVSGFAMAQGAGDAFKSKLLALQKGLLHVWELGFRPVNCFVD 54
GG+ DG G W GFA+ G A ++L + GL WE F V VD
Sbjct: 867 GGVLRDGIGHWCGGFALDIGVCSAPLAELWGVYYGLYMAWERRFTRVELEVD 918
>At4g22400 putative protein
Length = 327
Score = 33.1 bits (74), Expect = 0.30
Identities = 31/114 (27%), Positives = 49/114 (42%), Gaps = 13/114 (11%)
Query: 135 PWLLYVHNNLAIIRSILDPTE--TYSRSIPELSGANLLTVQHGGWCVLEKG---GKEKVR 189
PWLLY R DP + + P L+ A L G W V+ + K
Sbjct: 66 PWLLYRERGSGETR-FFDPVRERVHRGNDPRLADARFLGSTLG-WLVMSESLDLNPRKTH 123
Query: 190 ELLLWNPFLQEKIHLPSLKHENR-----IRNCILSCSPTETD-QVCSIFLFPHE 237
+ L+NPF+ E LP L ++ +R +L+ +P++ + C I + HE
Sbjct: 124 QTFLYNPFISELQQLPELTLDSPYPTGVLRYGVLTGNPSDNNTYACLITDYLHE 177
>At1g26950 hypothetical protein
Length = 158
Score = 33.1 bits (74), Expect = 0.30
Identities = 27/107 (25%), Positives = 47/107 (43%), Gaps = 1/107 (0%)
Query: 4 GGIRGDGGRWVSGFAMAQGAGDAFKSKLLALQKGLLHVWELGFRPVNCFVDCSEVQKVVI 63
G +R + G W FA+ G A ++L + GL+ WE G + VD S++ +
Sbjct: 17 GALRDEYGDWRGSFALNLGRCTAPLAELWGVYYGLVIAWEKGITRLELEVD-SKLVAGFL 75
Query: 64 TNSDVQNYWHGAVILLIREVIARNLSVSVTPVPKEQKMVVDALAKLA 110
T ++ ++ L +++ V V+ V +E D LA A
Sbjct: 76 TTGIEDSHLLSFLVRLCYGFSSKDWIVRVSHVYREANRFADELANYA 122
>At2g17690 hypothetical protein
Length = 421
Score = 30.8 bits (68), Expect = 1.5
Identities = 22/72 (30%), Positives = 34/72 (46%), Gaps = 4/72 (5%)
Query: 354 KIFGIAVHKLNFSHMIWERVKSIKDMVFFLSDKDLPFARRVNPGGGGL---IYFTSMNNK 410
K G V+K + + W VKS+ D ++ D F+ + G L IYFT +
Sbjct: 332 KTIGFKVYKNDEELLKWVEVKSLGDKAIVIAT-DACFSVSAHEFYGCLPNSIYFTDKKEE 390
Query: 411 FVHIFNLEDNSL 422
V +F L+D S+
Sbjct: 391 EVKVFKLDDGSI 402
>At1g04630 unknown protein
Length = 356
Score = 30.8 bits (68), Expect = 1.5
Identities = 33/108 (30%), Positives = 50/108 (45%), Gaps = 11/108 (10%)
Query: 6 IRGDGGRWVSGFAMAQGA--GDAFKSKLLALQKGLLHVWELGFRPVNCFVDCSEVQKVVI 63
+R D G ++ G A A G+ + +S+L AL + H W G+R + D EV ++V
Sbjct: 223 LRDDRGTFL-GAAHAIGSITTNPMESELQALVMAMQHCWSRGYRKIYFEGDNKEVSEIV- 280
Query: 64 TNSDVQNYWHGAVILLIREVIA---RNLSVSVTPVPKEQKMVVDALAK 108
N N+ AV IR++ A + T + M DALAK
Sbjct: 281 -NGRSSNF---AVFNWIRDISAWRSKFEECKFTWTRRFSNMAADALAK 324
Database: ara_mips
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,978,382
Number of sequences in database: 6832
Database: /data/blast2/ara_mips_chr2
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,737,135
Number of sequences in database: 4184
Database: /data/blast2/ara_mips_chr3
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,236,886
Number of sequences in database: 5377
Database: /data/blast2/ara_mips_chr4
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,748,816
Number of sequences in database: 4030
Database: /data/blast2/ara_mips_chr5
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,569,679
Number of sequences in database: 6098
Database: /data/blast2/ara_mips_chl
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 25,951
Number of sequences in database: 85
Database: /data/blast2/ara_mips_mit
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 21,747
Number of sequences in database: 113
Lambda K H
0.323 0.142 0.452
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 10,853,770
Number of Sequences: 26719
Number of extensions: 505542
Number of successful extensions: 1315
Number of sequences better than 10.0: 38
Number of HSP's better than 10.0 without gapping: 17
Number of HSP's successfully gapped in prelim test: 21
Number of HSP's that attempted gapping in prelim test: 1293
Number of HSP's gapped (non-prelim): 43
length of query: 426
length of database: 11,318,596
effective HSP length: 102
effective length of query: 324
effective length of database: 8,593,258
effective search space: 2784215592
effective search space used: 2784215592
T: 11
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 61 (28.1 bits)
Lotus: description of TM0331c.6