
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0299.5
(547 letters)
Database: nr
2,540,612 sequences; 863,360,394 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gb|AAC63844.1| putative non-LTR retroelement reverse transcripta... 162 3e-38
dbj|BAB09815.1| non-LTR retroelement reverse transcriptase-like ... 146 2e-33
ref|NP_680357.1| RNase H domain-containing protein [Arabidopsis ... 144 9e-33
gb|AAC26674.1| putative non-LTR retroelement reverse transcripta... 130 1e-28
gb|AAB82639.1| putative non-LTR retroelement reverse transcripta... 127 1e-27
pir||A96682 protein F1E22.12 [imported] - Arabidopsis thaliana g... 125 5e-27
ref|NP_680149.1| reverse transcriptase-related [Arabidopsis thal... 117 7e-25
emb|CAB79667.1| putative protein [Arabidopsis thaliana] gi|49720... 112 2e-23
gb|AAD03565.2| putative non-LTR retroelement reverse transcripta... 110 1e-22
pir||T00833 RNA-directed DNA polymerase homolog T13L16.7 - Arabi... 110 1e-22
gb|AAT38702.1| putative RNase H domain containing protein [Solan... 108 6e-22
gb|AAD21778.1| putative non-LTR retroelement reverse transcripta... 107 7e-22
gb|AAD20714.1| putative non-LTR retroelement reverse transcripta... 104 8e-21
gb|AAF18538.1| Very similar to retrotransposon reverse transcrip... 103 2e-20
gb|AAD24831.1| putative non-LTR retroelement reverse transcripta... 101 5e-20
emb|CAB78094.1| RNA-directed DNA polymerase-like protein [Arabid... 101 5e-20
pir||S65812 RNA-directed DNA polymerase (EC 2.7.7.49) (clone DW1... 100 2e-19
gb|AAD32950.1| putative non-LTR retroelement reverse transcripta... 99 3e-19
gb|AAF23283.1| putative non-LTR reverse transcriptase [Arabidops... 98 6e-19
gb|AAP54692.1| putative reverse transcriptase [Oryza sativa (jap... 87 1e-15
>gb|AAC63844.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
thaliana] gi|25408124|pir||C84716 hypothetical protein
At2g31080 [imported] - Arabidopsis thaliana
Length = 1231
Score = 162 bits (410), Expect = 3e-38
Identities = 128/473 (27%), Positives = 211/473 (44%), Gaps = 35/473 (7%)
Query: 65 VVGKAVCQMIKKSDKLWVRVLEHKY----LRDTSIHKVQAHQHDSPIWKGI-LWARDMID 119
+V K ++++ + LW RV+ KY ++DTS K Q S W+ + + R+++
Sbjct: 728 LVAKVGWRLLQDKESLWARVVRKKYKVGGVQDTSWLKPQPRW--SSTWRSVAVGLREVVV 785
Query: 120 QRFEFRIGKGDT-SVWYQDWSGIGIIANQIPFVHISD--------VNLTLCDLIQDNKWN 170
+ + G G T W W Q P V + + + + + WN
Sbjct: 786 KGVGWVPGDGCTIRFWLDRW------LLQEPLVELGTDMIPEGERIKVAADYWLPGSGWN 839
Query: 171 LQRLYTNLPHSLQQQFLAVQPQICMNREDAWIWKDGSSGRYSVRDAYEWIN-HLAHNPIE 229
L+ L LP +++++ L+V Q+ + D WK G ++VR AY + + P
Sbjct: 840 LEILGLYLPETVKRRLLSVVVQVFLGNGDEISWKGTQDGAFTVRSAYSLLQGDVGDRPNM 899
Query: 230 DRKLNWVWKLRVPEKIRMFTWQVLHNAIPVNELWVRCHLASDATCARCGNVVEDGLHCLR 289
N +WKL PE++R+F W V N I N VR HL+ +A C+ C E LH LR
Sbjct: 900 GSFFNRIWKLITPERVRVFIWLVSQNVIMTNVERVRRHLSENAICSVCNGAEETILHVLR 959
Query: 290 DCSFS*DLWRRMGAI-NWRNFRYNNIISW-FSSM--ARGVHGIQFLAGVWGAWKWRCNWL 345
DC +WRR+ + F +++ W F++M +G+ F G+W AWKWRC +
Sbjct: 960 DCPAMEPIWRRLLPLRRHHEFFSQSLLEWLFTNMDPVKGIWPTLFGMGIWWAWKWRCCDV 1019
Query: 346 LDSQRWP------IEVVWRRIAHDHDDWAWCAPSN-DLLLCHPWSPPPPDTVKCNSDGSF 398
++ I+ + + H P+ + W P VK +DG+
Sbjct: 1020 FGERKICRDRLKFIKDMAEEVRRVHVGAVGNRPNGVRVERMIRWQVPSDGWVKITTDGAS 1079
Query: 399 REDVQRMGGVGVIRDHQGRWVAGCYLGEAAGNAFRAEAKALLDVLELAWNRGYSRLICDV 458
R + G IR+ QG W+ G L + A AE L +AW++G+ R+ D+
Sbjct: 1080 RGNHGLAAAGGAIRNGQGEWLGGFALNIGSCAAPLAELWGAYYGLLIAWDKGFRRVELDL 1139
Query: 459 NCDNLVTILVEAEAVQMHSEFHVLHSITQLLARDWHVRINSVHRDSNAVADHL 511
+C LV + H ++ RDW VR++ V+R++N +AD L
Sbjct: 1140 DC-KLVVGFLSTGVSNAHPLSFLVRLCQGFFTRDWLVRVSHVYREANRLADGL 1191
>dbj|BAB09815.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis
thaliana]
Length = 676
Score = 146 bits (368), Expect = 2e-33
Identities = 127/488 (26%), Positives = 219/488 (44%), Gaps = 34/488 (6%)
Query: 80 LWVRVLEHKYLRDTSIHK---VQAHQHDSPIWKGI-LWARDMIDQRFEFRIGKGDTSVWY 135
LW RVL KY + IH + S +W+ + + R+++++ + +G G ++
Sbjct: 184 LWARVLRSKY-KIGDIHDSAWMTPKGTWSALWRSVNVGLREVVNRGIGWVLGDGKIIRFW 242
Query: 136 QD-W----SGIGIIANQIPFVHISDVNLTLCDL-IQDNKWNLQRLYTNLPHSLQQQFLAV 189
QD W + +++Q+P + + + D I+ W+++R+ LP ++Q+ LAV
Sbjct: 243 QDRWLLSTPLLEWVSDQLP---VEERGQRVADYWIEGVGWDMERIAVFLPEFMRQRLLAV 299
Query: 190 QPQICMNREDAWIWKDGSSGRYSVRDAY--EWINHLAHNPIEDRKLNWVWKLRVPEKIRM 247
C ED W +GR++V AY + ++ ++ + R + VW++ VPE+ R+
Sbjct: 300 VIGGCYGVEDKMSWVGTENGRFTVSSAYLIQSVDEISKQCMS-RFFDRVWRVMVPERARI 358
Query: 248 FTWQVLHNAIPVNELWVRCHLASDATCARCGNVVEDGLHCLRDCSFS*DLWRR-MGAINW 306
F W V + + N VR H+A C C E +H LRDC +W R + +
Sbjct: 359 FLWLVGNQVVLTNAERVRRHMADSDVCPLCKGASESLIHVLRDCPAMMGIWMRVVPVMEQ 418
Query: 307 RNFRYNNIISWF-------SSMARGVHGIQFLAGVWGAWKWRCNWLL-DSQRWPIEVVWR 358
R F +++ W S R F VW WKWRC ++ + R V +
Sbjct: 419 RRFFETSLLEWMYGNLKERSDSERRSWPTLFALTVWWGWKWRCGYVFGEDSRCRDRVKFL 478
Query: 359 RIAHDHDDWAWCAPSND------LLLCHPWSPPPPDTVKCNSDGSFREDVQRMGGVGVIR 412
+ A + A A + D + W P V N+DG+ + + GVIR
Sbjct: 479 KSAVAEVEAAHLAANGDAREDVLVERMIAWRKPAEGWVTMNTDGASHGNPGQATAGGVIR 538
Query: 413 DHQGRWVAGCYLGEAAGNAFRAEAKALLDVLELAWNRGYSRLICDVNCDNLVTILVEAEA 472
D G W+ G L +A AE + L +AW RG+ R+ +V+ LV +++
Sbjct: 539 DEHGSWLVGFALNIGVCSAPLAELWGVYYGLVVAWERGWRRVRLEVD-SALVVGFLQSGI 597
Query: 473 VQMHSEFHVLHSITQLLARDWHVRINSVHRDSNAVADHLVRRGAAAMSSES*IIQSQDHD 532
H ++ +++DW VRI V+R++N +AD L A + ++ S
Sbjct: 598 GDSHPLAFLVRLCHGFISKDWIVRITHVYREANRLADGLANY-AFTLPFGFLLLDSCPEH 656
Query: 533 VEYLLLKD 540
V +LL+D
Sbjct: 657 VSSILLED 664
>ref|NP_680357.1| RNase H domain-containing protein [Arabidopsis thaliana]
Length = 633
Score = 144 bits (362), Expect = 9e-33
Identities = 126/476 (26%), Positives = 208/476 (43%), Gaps = 37/476 (7%)
Query: 65 VVGKAVCQMIKKSDKLWVRVLEHKY----LRDTSIHKVQAHQHDSPIWKGILWA-RDMID 119
++ K +++K LW RVL KY LRDT+ + ++ S W+ I R+++
Sbjct: 113 LLSKVGWRLMKDRTSLWARVLRSKYRIGGLRDTTW--INTKRNASSTWRSIKSGLREVVI 170
Query: 120 QRFEFRIGKG-DTSVWYQDWSGIGIIANQIPFVHISDVN-LTLCDLIQDNK-WNLQRLYT 176
+ +G G D W W I + +D + + +L + W+L ++
Sbjct: 171 PGMNWVVGDGKDICFWDDKWLVEDPIRDLAAVELPADFQGIKIRELWHEGSGWDLAKIIP 230
Query: 177 NLPHSLQQQFLAVQPQICMNREDAWIWKDGSSGRYSVRDAYEWINHLAHNPIEDRKL-NW 235
+ ++ + L++ D W ++G+++V+ AY ++ R+ +
Sbjct: 231 YVSEGVRLRLLSMVVDTVTGSNDRTSWGATANGQFTVKSAYSFLLQSETQAQNMRQFFDR 290
Query: 236 VWKLRVPEKIRMFTWQVLHNAIPVNELWVRCHLASDATCARCGNVVEDGLHCLRDCSFS* 295
VW++ E++R+F W V+H I + R HL++ C C E LH LRDC
Sbjct: 291 VWRVTTTERVRVFIWLVVHQVIMTDVERRRRHLSASGVCQVCKGGDETILHVLRDCPSIA 350
Query: 296 DLWRRM---GAINWRNFRYNNIISWFSSMARGVHGIQ-------FLAGVWGAWKWRC-NW 344
+W R+ G I F +NI+ W V I+ F VW AWKWRC N
Sbjct: 351 GIWGRLVPRGKIT--AFFASNILDWVYQNLSDVTEIRGCPWATLFAIVVWWAWKWRCGNV 408
Query: 345 LLDSQRWPIEVVWRRIAHDHDDWAWCAPSN---------DLLLCHPWSPPPPDTVKCNSD 395
++ R V R D W A N ++ + W+PP K N+D
Sbjct: 409 FGENGRCRDRV---RFVVDQAREIWIAHLNLRRGAMRGSEVEMSIKWTPPSTGWFKLNTD 465
Query: 396 GSFREDVQRMGGVGVIRDHQGRWVAGCYLGEAAGNAFRAEAKALLDVLELAWNRGYSRLI 455
G+ R + GV+RD +G+W G L +A AE + L +AW RG RL
Sbjct: 466 GASRGNPGLATAGGVVRDGEGQWCVGFVLNIGICSAPLAELWGVYYGLHIAWERGIRRLE 525
Query: 456 CDVNCDNLVTILVEAEAVQMHSEFHVLHSITQLLARDWHVRINSVHRDSNAVADHL 511
+V+ LV ++A H ++ ++RDW VRI+ V+R++N +AD L
Sbjct: 526 LEVD-STLVVGFLQAGIEDSHPLSFLVRLCYGFISRDWIVRISHVYREANRLADGL 580
>gb|AAC26674.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
thaliana] gi|25411326|pir||C84488 hypothetical protein
At2g07730 [imported] - Arabidopsis thaliana
Length = 970
Score = 130 bits (326), Expect = 1e-28
Identities = 95/327 (29%), Positives = 151/327 (46%), Gaps = 19/327 (5%)
Query: 199 DAWIWKDGSSGRYSVRDAYEWINHLAHN-PIEDRKLNWVWKLRVPEKIRMFTWQVLHNAI 257
D WK +G ++VR AYE + A P+ L +WKL PE++R+F W V H I
Sbjct: 609 DELSWKGTQNGDFTVRSAYELLKPEAEERPLIGSFLKQIWKLVAPERVRVFIWLVSHMVI 668
Query: 258 PVNELWVRCHLASDATCARCGNVVEDGLHCLRDCSFS*DLWRRMGAINWRNFRYNNIISW 317
N VR HL+ ATC+ C E LH LRDC +W+R+ +N ++
Sbjct: 669 MTNVERVRRHLSDIATCSVCNGADESILHVLRDCPAMTPIWQRLLPQRRQNEFFSQFEWL 728
Query: 318 FSSM--ARGVHGIQFLAGVWGAWKWRCNWLLDSQRWP------IEVVWRRIAHDH----D 365
F+++ A+G F G+W AWKWRC + ++ I+ + + H +
Sbjct: 729 FTNLDPAKGDWPTLFSMGIWWAWKWRCGDVFGERKLCRDRLKFIKDIAEEVRKAHVGTLN 788
Query: 366 DWAWCAPSNDLLLCHPWSPPPPDTVKCNSDGSFREDVQRMGGVGVIRDHQGRWVAGCYLG 425
+ A ++ W P VK +DG+ R G I + QG W+ G L
Sbjct: 789 NHVKRARVERMI---RWKAPSDRWVKLTTDGASRGHQGLAAASGAILNLQGEWLGGFALN 845
Query: 426 EAAGNAFRAEAKALLDVLELAWNRGYSRLICDVNCDN-LVTILVEAEAVQMHSEFHVLHS 484
+ +A AE L +AW++G+ R+ ++N D+ LV + + H ++
Sbjct: 846 IGSCDAPLAELWGAYYGLLIAWDKGFRRV--ELNLDSELVVGFLSTGISKAHPLSFLVRL 903
Query: 485 ITQLLARDWHVRINSVHRDSNAVADHL 511
RDW VR++ V+R++N +AD L
Sbjct: 904 CQGFFTRDWLVRVSHVYREANRLADGL 930
>gb|AAB82639.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
thaliana] gi|25408936|pir||A84888 hypothetical protein
At2g45230 [imported] - Arabidopsis thaliana
Length = 1374
Score = 127 bits (318), Expect = 1e-27
Identities = 128/499 (25%), Positives = 202/499 (39%), Gaps = 42/499 (8%)
Query: 59 E*F*YCVVGKAVCQMIKKSDKLWVRVLEHKYLRDTSIHKVQAHQHDSPIWKGILWARDMI 118
E F ++GK + +MI + D L +V + +Y + S WK I A+ +I
Sbjct: 859 EAFNIALLGKQLWRMITEKDSLMAKVFKSRYFSKSDPLNAPLGSRPSFAWKSIYEAQVLI 918
Query: 119 DQRFEFRIGKGDT-SVWYQDWSGI--GIIANQIPFVHI------SDVNLTLCDLIQDNK- 168
Q IG G+T +VW W G A + H+ + +++ L+ D +
Sbjct: 919 KQGIRAVIGNGETINVWTDPWIGAKPAKAAQAVKRSHLVSQYAANSIHVVKDLLLPDGRD 978
Query: 169 WNLQRLYTNLPHSLQQQFLAVQPQICMNREDAWIWKDGSSGRYSVRDAY----EWINHLA 224
WN + P + Q+ LA++P R D + W+ SG YSV+ Y E IN
Sbjct: 979 WNWNLVSLLFPDNTQENILALRPGGKETR-DRFTWEYSRSGHYSVKSGYWVMTEIINQ-R 1036
Query: 225 HNPIE------DRKLNWVWKLRVPEKIRMFTWQVLHNAIPVNELWVRCHLASDATCARCG 278
+NP E D +WKL VP KI F W+ ++N + V HLA + +C RC
Sbjct: 1037 NNPQEVLQPSLDPIFQQIWKLDVPPKIHHFLWRCVNNCLSVASNLAYRHLAREKSCVRCP 1096
Query: 279 NVVEDGLHCLRDCSFS*DLWR-----RMGAINWRNFRYNN---IISWFSSM-ARGVHGIQ 329
+ E H L C F+ W W + N ++S S H
Sbjct: 1097 SHGETVNHLLFKCPFARLTWAISPLPAPPGGEWAESLFRNMHHVLSVHKSQPEESDHHAL 1156
Query: 330 FLAGVWGAWKWRCNWLLDSQRWPIEVVWRRIAHDHDDW------AWCAPSNDLLLCHPWS 383
+W WK R + + + + V + D D W S+ C W
Sbjct: 1157 IPWILWRLWKNRNDLVFKGREFTAPQVILKATEDMDAWNNRKEPQPQVTSSTRDRCVKWQ 1216
Query: 384 PPPPDTVKCNSDGSFREDVQRMGGVGVIRDHQGR--WVAGCYLGEAAGNAFRAEAKALLD 441
PP VKCN+DG++ +D+ G V+R+H GR W+ G + + E +AL
Sbjct: 1217 PPSHGWVKCNTDGAWSKDLGNCGVGWVLRNHTGRLLWL-GLRALPSQQSVLETEVEALRW 1275
Query: 442 VLELAWNRGYSRLICDVNCDNLVTILVEAEAVQMHSEFHVLHSITQLLARDWHVRINSVH 501
+ Y R+I + + LV+++ + + S + I LL V+
Sbjct: 1276 AVLSLSRFNYRRVIFESDSQYLVSLI--QNEMDIPSLAPRIQDIRNLLRHFEEVKFQFTR 1333
Query: 502 RDSNAVADHLVRRGAAAMS 520
R+ N VAD R + M+
Sbjct: 1334 REGNNVADRTARESLSLMN 1352
>pir||A96682 protein F1E22.12 [imported] - Arabidopsis thaliana
gi|6686397|gb|AAF23831.1| F1E22.12 [Arabidopsis
thaliana]
Length = 1055
Score = 125 bits (313), Expect = 5e-27
Identities = 122/486 (25%), Positives = 198/486 (40%), Gaps = 57/486 (11%)
Query: 65 VVGKAVCQMIKKSDKLWVRVLEHKY----LRDTSIHKVQAHQHDSPIWKGI-LWARDMID 119
++ K +++++ + LW VL+ KY +RD+ + S W+ I + RD++
Sbjct: 222 LISKVGWRLLQEKNSLWTLVLQKKYHVGEIRDSRWLIPKGSW--SSTWRSIAIGLRDVVS 279
Query: 120 QRFEFRIGKGDT-SVWYQDWSGIGIIANQIPFVHISDVNL-TLCDL-------IQDNKWN 170
+ G G W W + P + + + T CD I W+
Sbjct: 280 HGVGWIPGDGQQIRFWTDRW------VSGKPLLELDNGERPTDCDTVVAKDLWIPGRGWD 333
Query: 171 LQRLYTNLPHSLQQQFLAVQPQICMNREDAWIWKDGSSGRYSVRDAYEWIN-HLAHNPIE 229
++ ++ + + AV + D WK G++SVR AYE + P
Sbjct: 334 FAKIDPYTTNNTRLELRAVVLDLVTGARDRLSWKFSQDGQFSVRSAYEMLTVDEVPRPNM 393
Query: 230 DRKLNWVWKLRVPEKIRMFTWQVLHNAIPVNELWVRCHLASDATCARCGNVVEDGLHCLR 289
N +WK+RVPE+++ F W V + A+ E R HL++ C C VE LH LR
Sbjct: 394 ASFFNCLWKVRVPERVKTFLWLVGNQAVMTEEERHRRHLSASNVCQVCKGGVESMLHVLR 453
Query: 290 DCSFS*DLW-RRMGAINWRNFRYNNIISWFSSMARGVHGIQ-------FLAGVWGAWKWR 341
DC +W R + + F ++ W G + F +W WKWR
Sbjct: 454 DCPAQLGIWVRVVPQRRQQGFFSKSLFEWLYDNLGDRSGCEDIPWSTIFAVIIWWGWKWR 513
Query: 342 CNWLLDS-----------QRWPIEVVWRRIAHDHDDWAWCA-PSNDLLLCHPWSPPPPDT 389
C + + W +EV AH + P + ++ W P
Sbjct: 514 CGNIFGENTKCRDRVKFVKEWAVEVY---RAHSGNVLVGITQPRVERMI--GWVSPCVGW 568
Query: 390 VKCNSDGSFREDVQRMGGVGVIRDHQGRWVAGCYLGEAAGNAFRAEAKALLDVLELAWNR 449
VK N+DG+ R + GV+RD G W G L +A +AE + L AW +
Sbjct: 569 VKVNTDGASRGNPGLASAGGVLRDCTGAWCGGFSLNIGRCSAPQAELWGVYYGLYFAWEK 628
Query: 450 GYSRLICDVNCDNLVTILVEAEAVQMHSEFHVLHSITQL----LARDWHVRINSVHRDSN 505
R+ +V+ + +V L S+ H L + +L L +DW VRI V+R++N
Sbjct: 629 KVPRVELEVDSEVIVGFLKTG-----ISDSHPLSFLVRLCHGFLQKDWLVRIVHVYREAN 683
Query: 506 AVADHL 511
+AD L
Sbjct: 684 RLADGL 689
>ref|NP_680149.1| reverse transcriptase-related [Arabidopsis thaliana]
Length = 594
Score = 117 bits (294), Expect = 7e-25
Identities = 95/369 (25%), Positives = 164/369 (43%), Gaps = 28/369 (7%)
Query: 105 SPIWKGI-LWARDMIDQRFEFRIGKGDTSVWYQD-W----SGIGIIANQIPFVHISDVNL 158
S +W+ + + R+++++ + +G G ++QD W + +++Q+P + +
Sbjct: 198 SALWRSVNVGLREVVNRGIGWVLGDGKIIRFWQDRWLLSTPLLEWVSDQLP---VEERGQ 254
Query: 159 TLCDL-IQDNKWNLQRLYTNLPHSLQQQFLAVQPQICMNREDAWIWKDGSSGRYSVRDAY 217
+ D I+ W+++R+ LP ++Q+ LAV C ED W +GR++V AY
Sbjct: 255 RVADYWIEGVGWDMERIAVFLPEFMRQRLLAVVIGGCYGVEDKMSWVGTENGRFTVSSAY 314
Query: 218 --EWINHLAHNPIEDRKLNWVWKLRVPEKIRMFTWQVLHNAIPVNELWVRCHLASDATCA 275
+ ++ ++ + R + VW++ VPE+ R+F W V + + N VR H+A C
Sbjct: 315 LIQSVDEISKQCMS-RFFDRVWRVMVPERARIFLWLVGNQVVLTNAERVRRHMADSDVCP 373
Query: 276 RCGNVVEDGLHCLRDCSFS*DLWRR-MGAINWRNFRYNNIISWF-------SSMARGVHG 327
C E +H LRDC +W R + + R F +++ W S R
Sbjct: 374 LCKGASESLIHVLRDCPAMMGIWMRVVPVMEQRRFFETSLLEWMYGNLKERSDSERRSWP 433
Query: 328 IQFLAGVWGAWKWRCNWLL-DSQRWPIEVVWRRIAHDHDDWAWCAPSND------LLLCH 380
F VW WKWRC ++ + R V + + A + A A + D +
Sbjct: 434 TLFALTVWWGWKWRCGYVFGEDSRCRDRVKFLKSAVAEVEAAHLAANGDAREDVLVERMI 493
Query: 381 PWSPPPPDTVKCNSDGSFREDVQRMGGVGVIRDHQGRWVAGCYLGEAAGNAFRAEAKALL 440
W P V N+DG+ + + GVIRD G W+ G L +A AE +
Sbjct: 494 AWRKPAEGWVTMNTDGASHGNPGQATAGGVIRDEHGSWLVGFALNIGVCSAPLAELWGVY 553
Query: 441 DVLELAWNR 449
L +AW R
Sbjct: 554 YGLVVAWER 562
>emb|CAB79667.1| putative protein [Arabidopsis thaliana] gi|4972055|emb|CAB43923.1|
putative protein [Arabidopsis thaliana]
gi|67633766|gb|AAY78807.1| putative reverse
transcriptase/RNA-dependent DNA polymerase [Arabidopsis
thaliana] gi|15233451|ref|NP_194638.1| reverse
transcriptase, putative / RNA-dependent DNA polymerase,
putative [Arabidopsis thaliana] gi|7485741|pir||T08964
hypothetical protein F19B15.120 - Arabidopsis thaliana
Length = 575
Score = 112 bits (281), Expect = 2e-23
Identities = 110/505 (21%), Positives = 214/505 (41%), Gaps = 52/505 (10%)
Query: 59 E*F*YCVVGKAVCQMIKKSDKLWVRVLEHKYLRDTSIHKVQAHQHDSPIWKGILWARDMI 118
E F ++GK + +M+ + + L +V + +Y + S +WK I +++++
Sbjct: 62 EAFNLALLGKQMWRMLSRPESLMAKVFKSRYFHKSDPLNAPLGSRPSFVWKSIHASQEIL 121
Query: 119 DQRFEFRIGKG-DTSVWYQDW-----SGIGIIANQIPFVHISDVN--LTLCDLIQDN--K 168
Q +G G D +W W + + ++P + V+ L + DLI ++ +
Sbjct: 122 RQGARAVVGNGEDIIIWRHKWLDSKPASAALRMQRVPPQEYASVSSILKVSDLIDESGRE 181
Query: 169 WN---LQRLYTNLPHSLQQQFLAVQPQICMNREDAWIWKDGSSGRYSVRDAYEWINHLAH 225
W ++ L+ + L + +I D++ W SSG Y+V+ Y + + +
Sbjct: 182 WRKDVIEMLFPEVERKLIGELRPGGRRIL----DSYTWDYTSSGDYTVKSGYWVLTQIIN 237
Query: 226 N-----PIEDRKLN----WVWKLRVPEKIRMFTWQVLHNAIPVNELWVRCHLASDATCAR 276
+ + LN +WK + KI+ F W+ L N++PV HL+ ++ C R
Sbjct: 238 KRSSPQEVSEPSLNPIYQKIWKSQTSPKIQHFLWKCLSNSLPVAGALAYRHLSKESACIR 297
Query: 277 CGNVVEDGLHCLRDCSFS*DLWR------RMGAINWRNFRYNNIISWFSSMARGVHGIQF 330
C + E H L C+F+ W +G W + Y N+ W ++ G +
Sbjct: 298 CPSCKETVNHLLFKCTFARLTWAISSIPIPLGG-EWADSIYVNLY-WVFNLGNGNPQWEK 355
Query: 331 LAG-----VWGAWKWRCNWLLDSQRWPIEVVWRRIAHDHDDW--------AWCAPSNDLL 377
+ +W WK R + + + + V RR D ++W P +
Sbjct: 356 ASQLVPWLLWRLWKNRNELVFRGREFNAQEVLRRAEDDLEEWRIRTEAESCGTKPQVNRS 415
Query: 378 LCHPWSPPPPDTVKCNSDGSFREDVQRMGGVGVIRDHQG--RWVAGCYLGEAAGNAFRAE 435
C W PPP VKCN+D ++ D +R G V+R+ +G +W+ L + + AE
Sbjct: 416 SCGRWRPPPHQWVKCNTDATWNRDNERCGIGWVLRNEKGEVKWMGARALPKLK-SVLEAE 474
Query: 436 AKALLDVLELAWNRGYSRLICDVNCDNLVTILVEAEAVQMHSEFHVLHSITQLLARDWHV 495
+A+ + Y+ +I + + L+ IL E S + + +LL++ V
Sbjct: 475 LEAMRWAVLSLSRFQYNYVIFESDSQVLIEILNNDEI--WPSLKPTIQDLQRLLSQFTEV 532
Query: 496 RINSVHRDSNAVADHLVRRGAAAMS 520
+ + R+ N +A+ + R + ++
Sbjct: 533 KFVFIPREGNTLAERVARESLSFLN 557
>gb|AAD03565.2| putative non-LTR retroelement reverse transcriptase [Arabidopsis
thaliana] gi|25411819|pir||H84557 hypothetical protein
At2g17910 [imported] - Arabidopsis thaliana
Length = 1344
Score = 110 bits (275), Expect = 1e-22
Identities = 93/384 (24%), Positives = 162/384 (41%), Gaps = 32/384 (8%)
Query: 61 F*YCVVGKAVCQMIKKSDKLWVRVLEHKYLRDTSIHKVQAHQHDSPIWKGILWARDMIDQ 120
F ++ K +++++ L+ RV + +Y ++ S W+ IL+ R+++ Q
Sbjct: 839 FNQALLAKQAWRVLQEKGSLFSRVFQSRYFSNSDFLSATRGSRPSYAWRSILFGRELLMQ 898
Query: 121 RFEFRIGKGD-TSVWYQDWSGIGIIANQIPFVHISDVNLTLCDLIQ--DNKWNLQRLYTN 177
IG G T VW W G + +V+L + LI WNL L
Sbjct: 899 GLRTVIGNGQKTFVWTDKWLHDGSNRRPLNRRRFINVDLKVSQLIDPTSRNWNLNMLRDL 958
Query: 178 LPHSLQQQFLAVQPQICMNREDAWIWKDGSSGRYSVRDAYEWINHLAHN---------PI 228
P + L +P +ED++ W +G YSV+ YE+++ H+ P
Sbjct: 959 FPWKDVEIILKQRPLFF--KEDSFCWLHSHNGLYSVKTGYEFLSKQVHHRLYQEAKVKPS 1016
Query: 229 EDRKLNWVWKLRVPEKIRMFTWQVLHNAIPVNELWVRCHLASDATCARCGNVVEDGLHCL 288
+ + +W L KIR+F W+ LH AIPV + + SD C C E H L
Sbjct: 1017 VNSLFDKIWNLHTAPKIRIFLWKALHGAIPVEDRLRTRGIRSDDGCLMCDTENETINHIL 1076
Query: 289 RDCSFS*DLWRRMGAINWRNFRYNNIISWFSSMARGV---------HGIQFLAG--VWGA 337
+C + +W + ++ ++N S +++M+R + H ++F++ +W
Sbjct: 1077 FECPLARQVW-AITHLSSAGSEFSN--SVYTNMSRLIDLTQQNDLPHHLRFVSPWILWFL 1133
Query: 338 WKWRCNWLLDSQRWPIEVVWRRIAHDHDDW--AWCAPSND--LLLCHPWSPPPPDTVKCN 393
WK R L + + + + + +W A ND L W PP P +KCN
Sbjct: 1134 WKNRNALLFEGKGSITTTLVDKAYEAYHEWFSAQTHMQNDEKHLKITKWCPPLPGELKCN 1193
Query: 394 SDGSFREDVQRMGGVGVIRDHQGR 417
++ + G V+RD QG+
Sbjct: 1194 IGFAWSKQHHFSGASWVVRDSQGK 1217
>pir||T00833 RNA-directed DNA polymerase homolog T13L16.7 - Arabidopsis thaliana
(fragment)
Length = 1365
Score = 110 bits (275), Expect = 1e-22
Identities = 93/384 (24%), Positives = 162/384 (41%), Gaps = 32/384 (8%)
Query: 61 F*YCVVGKAVCQMIKKSDKLWVRVLEHKYLRDTSIHKVQAHQHDSPIWKGILWARDMIDQ 120
F ++ K +++++ L+ RV + +Y ++ S W+ IL+ R+++ Q
Sbjct: 860 FNQALLAKQAWRVLQEKGSLFSRVFQSRYFSNSDFLSATRGSRPSYAWRSILFGRELLMQ 919
Query: 121 RFEFRIGKGD-TSVWYQDWSGIGIIANQIPFVHISDVNLTLCDLIQ--DNKWNLQRLYTN 177
IG G T VW W G + +V+L + LI WNL L
Sbjct: 920 GLRTVIGNGQKTFVWTDKWLHDGSNRRPLNRRRFINVDLKVSQLIDPTSRNWNLNMLRDL 979
Query: 178 LPHSLQQQFLAVQPQICMNREDAWIWKDGSSGRYSVRDAYEWINHLAHN---------PI 228
P + L +P +ED++ W +G YSV+ YE+++ H+ P
Sbjct: 980 FPWKDVEIILKQRPLFF--KEDSFCWLHSHNGLYSVKTGYEFLSKQVHHRLYQEAKVKPS 1037
Query: 229 EDRKLNWVWKLRVPEKIRMFTWQVLHNAIPVNELWVRCHLASDATCARCGNVVEDGLHCL 288
+ + +W L KIR+F W+ LH AIPV + + SD C C E H L
Sbjct: 1038 VNSLFDKIWNLHTAPKIRIFLWKALHGAIPVEDRLRTRGIRSDDGCLMCDTENETINHIL 1097
Query: 289 RDCSFS*DLWRRMGAINWRNFRYNNIISWFSSMARGV---------HGIQFLAG--VWGA 337
+C + +W + ++ ++N S +++M+R + H ++F++ +W
Sbjct: 1098 FECPLARQVW-AITHLSSAGSEFSN--SVYTNMSRLIDLTQQNDLPHHLRFVSPWILWFL 1154
Query: 338 WKWRCNWLLDSQRWPIEVVWRRIAHDHDDW--AWCAPSND--LLLCHPWSPPPPDTVKCN 393
WK R L + + + + + +W A ND L W PP P +KCN
Sbjct: 1155 WKNRNALLFEGKGSITTTLVDKAYEAYHEWFSAQTHMQNDEKHLKITKWCPPLPGELKCN 1214
Query: 394 SDGSFREDVQRMGGVGVIRDHQGR 417
++ + G V+RD QG+
Sbjct: 1215 IGFAWSKQHHFSGASWVVRDSQGK 1238
>gb|AAT38702.1| putative RNase H domain containing protein [Solanum demissum]
Length = 722
Score = 108 bits (269), Expect = 6e-22
Identities = 108/494 (21%), Positives = 207/494 (41%), Gaps = 57/494 (11%)
Query: 69 AVCQMIKKSDKLWVRVLEHKYLRDTSIHKVQAHQHD--SPIWKGILWARDMIDQRFEFRI 126
A C +++ D LW + K + + +H V + S W +L R ++ + I
Sbjct: 26 AKCTDLERKD-LWASLEATKRIYCSRVHPVAKAKSSKQSHTWSKMLKIRHSVENNILWII 84
Query: 127 GKGDTSVWYQDWSGIGIIANQI-PFVHISDVNLTLCDLIQDNKWNLQRLYTNLPHSLQQQ 185
G+ S+W+ +W G G ++N + P H + N+ D I +W+ +L LP + Q
Sbjct: 85 YAGNVSMWWDNWMGNGALSNILPPPSHYNKDNVK--DFIHKREWDFDKLSDILPPQVVNQ 142
Query: 186 FLAVQPQICMNREDAWIWKDGSSGRYSVRDAY-EWINHLAHNPIEDRKLNWVWKLRVPEK 244
+++ P N+ D IW +G ++ + AY + N N + N +W + P K
Sbjct: 143 IVSI-PIGDPNQSDYAIWIPSENGHFTTKSAYVDCSNTREKNDMR----NKIWHGKFPFK 197
Query: 245 IRMFTWQVLHNAIPVNELWVRCHLASDATCARCGNVVEDGL-HCLRDCSFS*DLWRRMGA 303
+ TW+++ N +P + + D+ C C N+ + + H + + LW++ G
Sbjct: 198 MSFLTWRLVQNKLPFYDTVGKFVDNIDSNCVCCKNMKTETINHVFLNSDVASYLWKKFGG 257
Query: 304 INWRNFRYNNIIS-----WFSSMARGVHGIQF----LAGVWGAWKWRCNWLLDSQRWPIE 354
+ R ++ I+ W +H + + W WK RC Q+
Sbjct: 258 TLGIDTRASSTINLLKTWWNVQTHNSIHNVIIHTLPILIFWEIWKRRCACKYGDQK---- 313
Query: 355 VVWRRIAHDHDDW-----------------AWCAPSNDLLLCHP--------WSPPPPDT 389
+W R +H W +W N + P W+ P +
Sbjct: 314 KMWYRTMENHVWWNLKMSLRMTFPSFEIGNSWRDLLNKVESLRPYPKWKIVHWNTPNINC 373
Query: 390 VKCNSDGSFREDVQRMGGVGVIRDHQGRWVAGCYLGEAAGNAFRAEAKALLDVLELAWNR 449
VK N+DGSF +G ++RDH R + + + + AEA A + +
Sbjct: 374 VKINTDGSFSSGNAGLG--WIVRDHTRRMIMAFSIPSSCSSNNLAEALAARFGILWCLQQ 431
Query: 450 GYSRLICDVNCDNLVTILVEAEAVQMHSEFHVLHSITQLLARDWHVRINSVHRDSNAVAD 509
G+ +++ +V ++ +A + + V+ I Q++A+ + +N +R++N VAD
Sbjct: 432 GFHNCYLELDSKLVVDMVRNGQATNLKIK-GVVEDIIQVVAK-MNCEVNHCYREANQVAD 489
Query: 510 HLVRRGAAAMSSES 523
L + A +S+E+
Sbjct: 490 ALAKH--AVISNEA 501
>gb|AAD21778.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
thaliana] gi|25410938|pir||G84429 hypothetical protein
At2g01840 [imported] - Arabidopsis thaliana
Length = 1715
Score = 107 bits (268), Expect = 7e-22
Identities = 120/523 (22%), Positives = 217/523 (40%), Gaps = 70/523 (13%)
Query: 51 WRLRHQGD------E*F*YCVVGKAVCQMIKKSDKLWVRVLEHKYLRDTSIHKVQAHQHD 104
W ++GD F ++ K +++ L R+ + Y +T+ + H
Sbjct: 1189 WGKENEGDLGFKDLHQFNRALLAKQAWRILTNPQSLLARLYKGLYYPNTTYLRANKGGHA 1248
Query: 105 SPIWKGILWARDMIDQRFEFRIGKGDTS-VWYQDWSGIGIIANQIPFVHISDVNLTLCDL 163
S W I + ++ Q R+G G T+ +W W + + + I D ++ + DL
Sbjct: 1249 SYGWNSIQEGKLLLQQGLRVRLGDGQTTKIWEDPW--LPTLPPRPARGPILDEDMKVADL 1306
Query: 164 IQDNK--WNLQRLYTNLPHSLQQQFLAVQPQICMNREDAWIWKDGSSGRYSVRDAYEWIN 221
++NK W+ ++ + + QQ D++ W + +Y+VR Y
Sbjct: 1307 WRENKREWD-PVIFEGVLNPEDQQLAKSLYLSNYAARDSYKWAYTRNTQYTVRSGYWVAT 1365
Query: 222 HL------AHNPIE-DRKLNW-VWKLRVPEKIRMFTWQVLHNAIPVNELWVRCHLASDAT 273
H+ NP+E D L +W+L++ KI+ F W+ L A+ ++ +D T
Sbjct: 1366 HVNLTEEEIINPLEGDVPLKQEIWRLKITPKIKHFIWRCLSGALSTTTQLRNRNIPADPT 1425
Query: 274 CARCGNVVEDGLHCLRDCSFS*DLWRRMGAINWRNFRYNNIISWFSSMARGV-------- 325
C RC N E H + CS++ +WR NF +N + + ++ +
Sbjct: 1426 CQRCCNADETINHIIFTCSYAQVVWRS------ANFSGSNRLCFTDNLEENIRLILQGKK 1479
Query: 326 -------HGIQFLAGVWGAWKWRCNWLLDS-QRWPIEVVWRRIAHDHDDWAW-------- 369
+G+ +W WK R +L R+P +V ++ + +W
Sbjct: 1480 NQNLPILNGLMPFWIMWRLWKSRNEYLFQQLDRFPWKVA-QKAEQEATEWVETMVNDTAI 1538
Query: 370 ---CAPSND--LLLCHPWSPPPPDTVKCNSDGSFREDVQRMGGVGVIRDHQGRWV-AGCY 423
A SND L WS PP +KCN D + + ++RD GR + +GC
Sbjct: 1539 SHNTAQSNDRPLSRSKQWSSPPEGFLKCNFDSGYVQGRDYTSTGWILRDCNGRVLHSGCA 1598
Query: 424 LGEAAGNAFRAEAKALLDVLELAWNRGYSRLICDVNCDNLVTILVEAEAVQMHSEFHVLH 483
+ + +A +AEA L L++ W RGY + + + L ++ + E + H+L
Sbjct: 1599 KLQQSYSALQAEALGFLHALQMVWIRGYCYVWFEGDNLELTNLINKTE------DHHLLE 1652
Query: 484 SITQLLARDWHVR-----INSVHRDSNAVADHLVRRGAAAMSS 521
++ + R W + I V+R+ N AD L + A +MSS
Sbjct: 1653 TLLYDI-RFWMTKLPFSSIGYVNRERNLAADKLTKY-ANSMSS 1693
>gb|AAD20714.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
thaliana] gi|25412331|pir||G84649 hypothetical protein
At2g25550 [imported] - Arabidopsis thaliana
Length = 1750
Score = 104 bits (259), Expect = 8e-21
Identities = 119/493 (24%), Positives = 197/493 (39%), Gaps = 57/493 (11%)
Query: 65 VVGKAVCQMIKKSDKLWVRVLEHKYLRDTSIHKVQAHQHDSPIWKGILWARDMIDQRFEF 124
++ K ++I+ + L+ RV++ +Y +D SI + + S W +L ++ +
Sbjct: 1247 LLAKQAWRLIQYPNSLFARVMKARYFKDVSILDAKVRKQQSYGWASLLDGIALLKKGTRH 1306
Query: 125 RIGKGDTSVWYQDWSGIGIIANQIPFVHISDVNLTLCDLIQDNKWNLQRLYTNLPHSLQQ 184
IG G G+ I + P ++ T ++ +N + + Y S
Sbjct: 1307 LIGDGQNIR-----IGLDNIVDSHPPRPLNTEE-TYKEMTINNLFERKGSYYFWDDSKIS 1360
Query: 185 QFLAVQPQICMNR--------EDAWIWKDGSSGRYSVRDAYEWINH------LAHNPIE- 229
QF+ ++R D IW ++G Y+VR Y + H A NP
Sbjct: 1361 QFVDQSDHGFIHRIYLAKSKKPDKIIWNYNTTGEYTVRSGYWLLTHDPSTNIPAINPPHG 1420
Query: 230 --DRKLNWVWKLRVPEKIRMFTWQVLHNAIPVNELWVRCHLASDATCARCGNVVEDGLHC 287
D K +W L + K++ F W+ L A+ E + D +C RC E H
Sbjct: 1421 SIDLKTR-IWNLPIMPKLKHFLWRALSQALATTERLTTRGMRIDPSCPRCHRENESINHA 1479
Query: 288 LRDCSFS*DLWRRMGAINWRN------FRYN--NIISWFSSMARG-VHGIQFLAGVWGAW 338
L C F+ WR + RN F N NI+++ H + + +W W
Sbjct: 1480 LFTCPFATMAWRLSDSSLIRNQLMSNDFEENISNILNFVQDTTMSDFHKLLPVWLIWRIW 1539
Query: 339 KWRCNWLLDSQR-WPIEVVWRRIAHDHDDWAWC--------APSNDLLLCH---PWSPPP 386
K R N + + R P + V A HD W PS + W PP
Sbjct: 1540 KARNNVVFNKFRESPSKTVLSAKAETHD---WLNATQSHKKTPSPTRQIAENKIEWRNPP 1596
Query: 387 PDTVKCNSDGSFREDVQRMGGVG--VIRDHQGRWVA-GCYLGEAAGNAFRAEAKALLDVL 443
VKCN D F DVQ++ G +IR+H G ++ G N AE KALL L
Sbjct: 1597 ATYVKCNFDAGF--DVQKLEATGGWIIRNHYGTPISWGSMKLAHTSNPLEAETKALLAAL 1654
Query: 444 ELAWNRGYSRLICDVNCDNLVTILVEAEAVQMHSEF-HVLHSITQLLARDWHVRINSVHR 502
+ W RGY+++ + +C L+ ++ + HS + L I+ + ++ + +
Sbjct: 1655 QQTWIRGYTQVFMEGDCQTLINLI---NGISFHSSLANHLEDISFWANKFASIQFGFIRK 1711
Query: 503 DSNAVADHLVRRG 515
N +A L + G
Sbjct: 1712 KGNKLAHVLAKYG 1724
>gb|AAF18538.1| Very similar to retrotransposon reverse transcriptase [Arabidopsis
thaliana] gi|25518314|pir||A86359 hypothetical protein
F12K8.9 - Arabidopsis thaliana
Length = 1231
Score = 103 bits (256), Expect = 2e-20
Identities = 102/403 (25%), Positives = 166/403 (40%), Gaps = 57/403 (14%)
Query: 61 F*YCVVGKAVCQMIKKSDKLWVRVLEHKYLRDTSIHKVQAHQHDSPIWKGILWARDMIDQ 120
F ++ K ++++ D L+ R+++ +Y S W+ IL RD++ +
Sbjct: 709 FNQALLAKQAWRLLQFPDCLFARLIKSRYFPVGEFLDSDVGSRPSFGWRSILHGRDLLCR 768
Query: 121 RFEFRIGKGDT-SVWYQDWSGIGIIANQIPFVH--ISDVNLTLCDLIQDNK--WNLQRLY 175
R+G G + VW W + + P++ I +V+L + DLI K W L +L
Sbjct: 769 GLVKRVGNGKSIRVWIDYWLDDNGL--RAPWIKNPIINVDLLVSDLIDYEKRDWRLDKLE 826
Query: 176 TNLPHSLQQQFLAVQPQICMNREDAWIWKDGSSGRYSVRDAYEWI------NHLAHNPIE 229
+ +P + + +D WIWK SG YSV+ Y W+ +A +
Sbjct: 827 EQFFPDDVVKIRENRPVVSL--DDFWIWKHNKSGDYSVKLGY-WLASNQNLGQVAIEAMM 883
Query: 230 DRKLN----WVWKLRVPEKIRMFTWQVLHNAIPVNELWVRCHLASDATCARCGNVVEDGL 285
LN VWKL+ KI++F W+VL AIPV +L + D+ C CG E
Sbjct: 884 QPSLNDLKTQVWKLQTEPKIKVFLWKVLSGAIPVVDLLSYRGMKLDSRCQTCGCEGESIQ 943
Query: 286 HCLRDCSFS*DLWR-----------RMGAI--NWRNFRYN-NIISWFSSMARGVHGIQFL 331
H L CSF +W G++ N +F N + + W + R I
Sbjct: 944 HVLFSCSFPRQVWAMSNIHVPLLGFECGSVYANLYHFLINRDNLKWPVELRRSFPWI--- 1000
Query: 332 AGVWGAWKWRCNWLLDSQRWPIEVVWRRIAHDHDDWAWC----------APSNDLLLCHP 381
+W WK R + + +R+ + ++ D +DW +D + P
Sbjct: 1001 --IWRIWKNRNLFFFEGKRFTVLETILKVRKDVEDWFAAQVVEKERRAEVGQSDQQVFSP 1058
Query: 382 --------WSPPPPDTVKCNSDGSFREDVQRMGGVGVIRDHQG 416
W PPP D VKCN S+ + G V+R+ +G
Sbjct: 1059 RNVSPVVRWLPPPTDWVKCNVGLSWSRRNRLAGVAWVLRNDRG 1101
>gb|AAD24831.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
thaliana] gi|25408166|pir||G84721 hypothetical protein
At2g31520 [imported] - Arabidopsis thaliana
Length = 1524
Score = 101 bits (252), Expect = 5e-20
Identities = 119/493 (24%), Positives = 195/493 (39%), Gaps = 57/493 (11%)
Query: 65 VVGKAVCQMIKKSDKLWVRVLEHKYLRDTSIHKVQAHQHDSPIWKGILWARDMIDQRFEF 124
++ K ++I+ + L+ RV++ +Y +D SI + + S W +L ++ +
Sbjct: 1021 LLAKQAWRLIQYPNSLFARVMKARYFKDVSILDAKVRKQQSYGWASLLDGIALLKKGTRH 1080
Query: 125 RIGKGDTSVWYQDWSGIGIIANQIPFVHISDVNLTLCDLIQDNKWNLQRLYTNLPHSLQQ 184
IG G G+ I + P ++ T ++ +N + + Y S
Sbjct: 1081 LIGDGQNIR-----IGLDNIVDSHPPRPLNTEE-TYKEMTINNLFERKGSYYFWDDSKIS 1134
Query: 185 QFLAVQPQICMNR--------EDAWIWKDGSSGRYSVRDAYEWINH------LAHNPIE- 229
QF+ ++R D IW ++G Y+VR Y + H A NP
Sbjct: 1135 QFVDQSDHGFIHRIYLAKSKKPDKIIWNYNTTGEYTVRSGYWLLTHDPSTNIPAINPPHG 1194
Query: 230 --DRKLNWVWKLRVPEKIRMFTWQVLHNAIPVNELWVRCHLASDATCARCGNVVEDGLHC 287
D K +W L + K++ F W+ L A+ E + D C RC E H
Sbjct: 1195 SIDLKTR-IWNLPIMPKLKHFLWRALSQALATTERLTTRGMRIDPICPRCHRENESINHA 1253
Query: 288 LRDCSFS*DLWRRMGAINWRN------FRYN--NIISWFSSMARG-VHGIQFLAGVWGAW 338
L C F+ W + RN F N NI+++ H + + +W W
Sbjct: 1254 LFTCPFATMAWWLSDSSLIRNQLMSNDFEENISNILNFVQDTTMSDFHKLLPVWLIWRIW 1313
Query: 339 KWRCNWLLDSQR-WPIEVVWRRIAHDHDDWAWC--------APSNDLLLCH---PWSPPP 386
K R N + + R P + V A HD W PS + W PP
Sbjct: 1314 KARNNVVFNKFRESPSKTVLSAKAETHD---WLNATQSHKKTPSPTRQIAENKIEWRNPP 1370
Query: 387 PDTVKCNSDGSFREDVQRMGGVG--VIRDHQGRWVA-GCYLGEAAGNAFRAEAKALLDVL 443
VKCN D F DVQ++ G +IR+H G ++ G N AE KALL L
Sbjct: 1371 ATYVKCNFDAGF--DVQKLEATGGWIIRNHYGTPISWGSMKLAHTSNPLEAETKALLAAL 1428
Query: 444 ELAWNRGYSRLICDVNCDNLVTILVEAEAVQMHSEF-HVLHSITQLLARDWHVRINSVHR 502
+ W RGY+++ + +C L+ ++ + HS + L I+ + ++ + R
Sbjct: 1429 QQTWIRGYTQVFMEGDCQTLINLI---NGISFHSSLANHLEDISFWANKFASIQFGFIRR 1485
Query: 503 DSNAVADHLVRRG 515
N +A L + G
Sbjct: 1486 KGNKLAHVLAKYG 1498
>emb|CAB78094.1| RNA-directed DNA polymerase-like protein [Arabidopsis thaliana]
gi|4538901|emb|CAB39638.1| RNA-directed DNA
polymerase-like protein [Arabidopsis thaliana]
gi|7485606|pir||T04018 hypothetical protein F17A8.60 -
Arabidopsis thaliana
Length = 1274
Score = 101 bits (252), Expect = 5e-20
Identities = 114/491 (23%), Positives = 204/491 (41%), Gaps = 51/491 (10%)
Query: 65 VVGKAVCQMIKKSDKLWVRVLEHKYLRDTSIHKVQAH-QHDSPIWKGILWARDMIDQRFE 123
+ K +++K+ L RVL KY +S A S W+GIL RD++ +
Sbjct: 799 IEAKLSWRILKEPHSLLSRVLLGKYCNTSSFMDCSASPSFASHGWRGILAGRDLLRKGLG 858
Query: 124 FRIGKGDT-SVWYQDWSGIGIIANQIPFVHISDVN--LTLCDLIQDN--KWNLQRLYTNL 178
+ IG+GD+ +VW + W + + Q P ++ N L++ DLI + WN++ + +L
Sbjct: 859 WSIGQGDSINVWTEAW--LSPSSPQTPIGPPTETNKDLSVHDLICHDVKSWNVEAIRKHL 916
Query: 179 PHSLQQQFLAVQPQICMNREDAWIWKDGSSGRYSVRDAYEWINHLAHNPIEDRKLNW--- 235
P + Q + + +D+ +W SG Y+ + Y + L P NW
Sbjct: 917 PQ-YEDQIRKITIN-ALPLQDSLVWLPVKSGEYTTKTGYA-LAKLNSFPASQLDFNWQKN 973
Query: 236 VWKLRVPEKIRMFTWQVLHNAIPVNELWVRCHLASDATCARCGNVVEDGLHCLRDCSFS* 295
+WK+ K++ F W+ + A+PV E R ++ ++ TC RCG E LH + C ++
Sbjct: 974 IWKIHTSPKVKHFLWKAMKGALPVGEALSRRNIEAEVTCKRCGQ-TESSLHLMLLCPYAK 1032
Query: 296 DLWRRMGAINWRNFRYNNIISWFSSMARGVHGIQFLAG---------------VWGAWKW 340
+W + +N + SS+A + + + +W WK
Sbjct: 1033 KVWELAPVL------FNPSEATHSSVALLLVDAKRMVALPPTGLGSAPLYPWLLWHLWKA 1086
Query: 341 RCNWLLDSQRWPIEVVWRRIAHDHDDWAWCAPSNDLLLCHP-----WSPPPPD--TVKCN 393
R + D+ E + + D W LL+ HP + P P+ C
Sbjct: 1087 RNRLIFDNHSCSEEGLVLKAILDARAWM----EAQLLIHHPSPISDYPSPTPNLKVTSCF 1142
Query: 394 SDGSFREDVQRMGGVGVIRDHQGRWVAGCYLGEAAGNAFRAEAKALLDVLELAWNRGYSR 453
D ++ G + ++ + G+A AE A+ L A + G +
Sbjct: 1143 VDAAWTTSGYCGMGWFLQDPYKVKIKENQSSSSFVGSALMAETLAVHLALVDALSTGVRQ 1202
Query: 454 LICDVNCDNLVTILVEAEA-VQMHSEFHVLHSITQLLARDWHVRINSVHRDSNAVADHLV 512
L +C L+++L ++ V++ +LH I +L H+ + R SN VAD L
Sbjct: 1203 LNVFSDCKELISLLNSGKSIVELRG---LLHDIRELSVSFTHLCFFFIPRLSNVVADSLA 1259
Query: 513 RRGAAAMSSES 523
+ + + S S
Sbjct: 1260 KSALSVILSSS 1270
>pir||S65812 RNA-directed DNA polymerase (EC 2.7.7.49) (clone DW15) - Arabidopsis
thaliana retrotransposon Ta11-1 gi|976278|gb|AAA75254.1|
reverse transcriptase
Length = 1333
Score = 99.8 bits (247), Expect = 2e-19
Identities = 98/405 (24%), Positives = 158/405 (38%), Gaps = 63/405 (15%)
Query: 59 E*F*YCVVGKAVCQMIKKSDKLWVRVLEHKYLRDTSIHKVQAHQHDSPIWKGILWARDMI 118
E F ++ K ++++ + L+ R + +Y + + S W+ IL RD++
Sbjct: 861 ESFNQALLAKQAWRLLQFPNSLFARFFKSRYYDEEDFLDAELKATPSYAWRSILHGRDLL 920
Query: 119 DQRFEFRIGKGD-TSVWYQDWSGIGIIAN--QIPFVHISDVNLTLC--DLIQDNKWNLQR 173
+ F ++G G TSVW W I N ++P VNL L DLI +R
Sbjct: 921 IKGFRKKVGNGSSTSVWMDPW----IYDNDPRLPLQKHFSVNLDLRVHDLINVEDRCRRR 976
Query: 174 LYTNLPHSLQQQFLAVQPQICMNR------EDAWIWKDGSSGRYSVRDAY---------E 218
L++ F +I + R +D W+W SG YSV+ Y E
Sbjct: 977 ------DRLEELFYPADIEIIVKRNPVVSMDDFWVWLHSKSGEYSVKSGYWLAFQTNKPE 1030
Query: 219 WINHLAHNPIEDRKLNWVWKLRVPEKIRMFTWQVLHNAIPVNELWVRCHLASDATCARCG 278
I P + +W KI++F W++L +A+PV +R + D C CG
Sbjct: 1031 LIREARVQPSTNGLKEKIWSTLTSPKIKLFLWRILSSALPVAYQIIRRGMPIDPRCQVCG 1090
Query: 279 NVVEDGLHCLRDCSFS*DLWRRMGAINWRNFRYNNIISWFSSMARGVHGIQFLAG----- 333
E H L CS + +W G + F + N SS+ + + L G
Sbjct: 1091 EEGESINHVLFTCSLARQVWALSG-VPTSQFGFQN-----SSIFANIQYLLELKGKGLIP 1144
Query: 334 ----------VWGAWKWRCNWLLDSQRW-PIEVVWRRIAHDHDDW------AWCAPSNDL 376
+W WK R + + P++ + +I D +W + +
Sbjct: 1145 EQIKKSWPWVLWRLWKNRDKLFFEGTIFSPLKSI-EKIRDDVQEWFLAQALVASVDAGET 1203
Query: 377 LLCHP----WSPPPPDTVKCNSDGSFREDVQRMGGVGVIRDHQGR 417
+ P W PPP VKCN G + + GG V+RD G+
Sbjct: 1204 VCSAPCPSSWEPPPLGWVKCNISGVWSGKKRVCGGAWVLRDDHGK 1248
>gb|AAD32950.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
thaliana] gi|25411805|pir||C84554 hypothetical protein
At2g17610 [imported] - Arabidopsis thaliana
Length = 773
Score = 99.0 bits (245), Expect = 3e-19
Identities = 78/327 (23%), Positives = 132/327 (39%), Gaps = 23/327 (7%)
Query: 61 F*YCVVGKAVCQMIKKSDKLWVRVLEHKYLRDTSIHKVQAHQHDSPIWKGILWARDMIDQ 120
F ++ K +++++ L RV + KY + +A S WK IL +I +
Sbjct: 316 FNIALLAKQSWRILQQPFSLMARVFKAKYFPKERLLDAKATSQSSYAWKSILHGTKLISR 375
Query: 121 RFEFRIGKGDT-SVWYQDWSGIGIIANQIPFVHISDVNLTLCDLIQDNKWNLQRLYTNLP 179
++ G G+ +W +W + + L + DL+ + +WN L +
Sbjct: 376 GLKYIAGNGNNIQLWKDNWLPLNPPRPPVGTCDSIYSQLKVSDLLIEGRWNEDLLCKLIH 435
Query: 180 HSLQQQFLAVQPQICMNREDAWIWKDGSSGRYSVRDAYEWINHLAH---------NPIED 230
+ A++P I DA W G YSV+ Y + L+ N +
Sbjct: 436 QNDIPHIRAIRPSIT-GANDAITWIYTHDGNYSVKSGYHLLRKLSQQQHASLPSPNEVSA 494
Query: 231 RKL-NWVWKLRVPEKIRMFTWQVLHNAIPVNELWVRCHLASDATCARCGNVVEDGLHCLR 289
+ + +WK P KI+ F W+ HNA+P R L +D TC RCG ED H L
Sbjct: 495 QTVFTNIWKQNAPPKIKHFWWRSAHNALPTAGNLKRRRLITDDTCQRCGEASEDVNHLLF 554
Query: 290 DCSFS*DLWRRM-------GAINWRNFRYN--NIISWFSSMARGVHGIQFLAGVWGAWKW 340
C S ++W + ++ +F N +I S + V F+ W WK
Sbjct: 555 QCRVSKEIWEQAHIKLCPGDSLMSNSFNQNLESIQKLNQSARKDVSLFPFIG--WRIWKM 612
Query: 341 RCNWLLDSQRWPIEVVWRRIAHDHDDW 367
R + + +++RW I ++ D W
Sbjct: 613 RNDLIFNNKRWSIPDSIQKALIDQQQW 639
>gb|AAF23283.1| putative non-LTR reverse transcriptase [Arabidopsis thaliana]
gi|15232695|ref|NP_187562.1| hypothetical protein
[Arabidopsis thaliana]
Length = 484
Score = 98.2 bits (243), Expect = 6e-19
Identities = 95/353 (26%), Positives = 144/353 (39%), Gaps = 43/353 (12%)
Query: 197 REDAWIWKDGSSGRYSVRDAYEWINH------LAHNPIE---DRKLNWVWKLRVPEKIRM 247
+ D IW ++G Y+VR Y + H A NP D K +W L + K++
Sbjct: 115 KPDKIIWNYNTTGEYTVRSGYWLLTHDPSTNIPAINPPHGSIDLKTR-IWNLPIMPKLKH 173
Query: 248 FTWQVLHNAIPVNELWVRCHLASDATCARCGNVVEDGLHCLRDCSFS*DLWRRMGAINWR 307
F W+ L A+ E + D +C RC E H L C F+ WR + R
Sbjct: 174 FLWRALSQALATTERLTTRGMRIDPSCPRCHRENESINHALFTCPFATMAWRLSDSSLIR 233
Query: 308 N------FRYN--NIISWFSSMARG-VHGIQFLAGVWGAWKWRCNWLLDSQR-WPIEVVW 357
N F N NI+++ H + + +W WK R N + + R P + V
Sbjct: 234 NQLMSNDFEENISNILNFVQDTTMSDFHKLLPVWLIWRIWKARNNVVFNKFRESPSKTVL 293
Query: 358 RRIAHDHDDWAWC--------APSNDLLLCH---PWSPPPPDTVKCNSDGSFREDVQRMG 406
A HD W PS + W PP VKCN D F DVQ++
Sbjct: 294 SAKAETHD---WLNATQSHKKTPSPTRQIAENKIEWRNPPATYVKCNFDAGF--DVQKLE 348
Query: 407 GVG--VIRDHQGRWVA-GCYLGEAAGNAFRAEAKALLDVLELAWNRGYSRLICDVNCDNL 463
G +IR+H G ++ G N AE KALL L+ W RGY+++ + +C L
Sbjct: 349 ATGGWIIRNHYGTPISWGSMKLAHTSNPLEAETKALLAALQQTWIRGYTQVFMEGDCQTL 408
Query: 464 VTILVEAEAVQMHSEF-HVLHSITQLLARDWHVRINSVHRDSNAVADHLVRRG 515
+ ++ + HS + L I+ + ++ + R N +A L + G
Sbjct: 409 INLI---NGISFHSSLANHLEDISFWANKFASIQFGFIRRKGNKLAHVLAKYG 458
>gb|AAP54692.1| putative reverse transcriptase [Oryza sativa (japonica
cultivar-group)] gi|37536206|ref|NP_922405.1| putative
reverse transcriptase [Oryza sativa (japonica
cultivar-group)] gi|27311287|gb|AAO00713.1|
retrotransposon protein, putative, unclassified [Oryza
sativa (japonica cultivar-group)]
Length = 1557
Score = 87.0 bits (214), Expect = 1e-15
Identities = 80/302 (26%), Positives = 129/302 (42%), Gaps = 34/302 (11%)
Query: 61 F*YCVVGKAVCQMIKKSDKLWVRVLEHKYLRDTSIHKVQAHQHDSPIWKGILWARDMIDQ 120
F ++ + ++I D L RVL+ KY + SI + SP W+ I +++ +
Sbjct: 1230 FNQALLARQAWRLIDNPDSLCARVLKAKYYPNGSIVDTSFGGNASPGWQAIEHGLELVKK 1289
Query: 121 RFEFRIGKG-DTSVWYQDWSGIGIIANQIPFVHISDVNLT-LCDLIQDN-KWNLQRLYTN 177
+RIG G VW W + ++ P ++ + + DL+ DN W+ ++
Sbjct: 1290 GIIWRIGNGRSVRVWQDPWLPRDL--SRRPITPKNNCRIKWVADLMLDNGMWDANKI--- 1344
Query: 178 LPHSLQQQFLAVQPQICM-------NREDAWIWKDGSSGRYSVRDAYE----WINHLAHN 226
Q FL V +I + + ED W G +SVR AY W A +
Sbjct: 1345 -----NQIFLPVDVEIILKLRTSSRDEEDFIAWHPDKLGNFSVRTAYRLAENWAKEEASS 1399
Query: 227 PIEDRKLN--W--VWKLRVPEKIRMFTWQVLHNAIPVNELWVRCHLASDATCARCGNVVE 282
D + W +WK VP K+++FTW+ N +P + + +L TC CG E
Sbjct: 1400 SSSDVNIRKAWELLWKCNVPSKVKIFTWRATSNCLPTWDNKKKRNLEISDTCVICGMEKE 1459
Query: 283 DGLHCLRDCSFS*DLWRRMGAINWRNFRYNNII---SW-FSSMARGVHGIQ--FLAGVWG 336
D +H L C + LW M N + R ++ + SW F+ +A Q FL +W
Sbjct: 1460 DTMHALCRCPQAKHLWLAMKESNDLSLRMDDHLLGPSWLFNRLALLPDHEQPMFLMVLWR 1519
Query: 337 AW 338
W
Sbjct: 1520 IW 1521
Database: nr
Posted date: Jul 5, 2005 12:34 AM
Number of letters in database: 863,360,394
Number of sequences in database: 2,540,612
Lambda K H
0.332 0.143 0.506
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 987,946,954
Number of Sequences: 2540612
Number of extensions: 42533091
Number of successful extensions: 98759
Number of sequences better than 10.0: 279
Number of HSP's better than 10.0 without gapping: 108
Number of HSP's successfully gapped in prelim test: 171
Number of HSP's that attempted gapping in prelim test: 98258
Number of HSP's gapped (non-prelim): 353
length of query: 547
length of database: 863,360,394
effective HSP length: 133
effective length of query: 414
effective length of database: 525,458,998
effective search space: 217540025172
effective search space used: 217540025172
T: 11
A: 40
X1: 15 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (22.0 bits)
S2: 78 (34.7 bits)
Lotus: description of TM0299.5