Lotus
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= TM0299.5
         (547 letters)

Database: nr 
           2,540,612 sequences; 863,360,394 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAC63844.1| putative non-LTR retroelement reverse transcripta...   162  3e-38
dbj|BAB09815.1| non-LTR retroelement reverse transcriptase-like ...   146  2e-33
ref|NP_680357.1| RNase H domain-containing protein [Arabidopsis ...   144  9e-33
gb|AAC26674.1| putative non-LTR retroelement reverse transcripta...   130  1e-28
gb|AAB82639.1| putative non-LTR retroelement reverse transcripta...   127  1e-27
pir||A96682 protein F1E22.12 [imported] - Arabidopsis thaliana g...   125  5e-27
ref|NP_680149.1| reverse transcriptase-related [Arabidopsis thal...   117  7e-25
emb|CAB79667.1| putative protein [Arabidopsis thaliana] gi|49720...   112  2e-23
gb|AAD03565.2| putative non-LTR retroelement reverse transcripta...   110  1e-22
pir||T00833 RNA-directed DNA polymerase homolog T13L16.7 - Arabi...   110  1e-22
gb|AAT38702.1| putative RNase H domain containing protein [Solan...   108  6e-22
gb|AAD21778.1| putative non-LTR retroelement reverse transcripta...   107  7e-22
gb|AAD20714.1| putative non-LTR retroelement reverse transcripta...   104  8e-21
gb|AAF18538.1| Very similar to retrotransposon reverse transcrip...   103  2e-20
gb|AAD24831.1| putative non-LTR retroelement reverse transcripta...   101  5e-20
emb|CAB78094.1| RNA-directed DNA polymerase-like protein [Arabid...   101  5e-20
pir||S65812 RNA-directed DNA polymerase (EC 2.7.7.49) (clone DW1...   100  2e-19
gb|AAD32950.1| putative non-LTR retroelement reverse transcripta...    99  3e-19
gb|AAF23283.1| putative non-LTR reverse transcriptase [Arabidops...    98  6e-19
gb|AAP54692.1| putative reverse transcriptase [Oryza sativa (jap...    87  1e-15

>gb|AAC63844.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana] gi|25408124|pir||C84716 hypothetical protein
            At2g31080 [imported] - Arabidopsis thaliana
          Length = 1231

 Score =  162 bits (410), Expect = 3e-38
 Identities = 128/473 (27%), Positives = 211/473 (44%), Gaps = 35/473 (7%)

Query: 65   VVGKAVCQMIKKSDKLWVRVLEHKY----LRDTSIHKVQAHQHDSPIWKGI-LWARDMID 119
            +V K   ++++  + LW RV+  KY    ++DTS  K Q     S  W+ + +  R+++ 
Sbjct: 728  LVAKVGWRLLQDKESLWARVVRKKYKVGGVQDTSWLKPQPRW--SSTWRSVAVGLREVVV 785

Query: 120  QRFEFRIGKGDT-SVWYQDWSGIGIIANQIPFVHISD--------VNLTLCDLIQDNKWN 170
            +   +  G G T   W   W        Q P V +          + +     +  + WN
Sbjct: 786  KGVGWVPGDGCTIRFWLDRW------LLQEPLVELGTDMIPEGERIKVAADYWLPGSGWN 839

Query: 171  LQRLYTNLPHSLQQQFLAVQPQICMNREDAWIWKDGSSGRYSVRDAYEWIN-HLAHNPIE 229
            L+ L   LP +++++ L+V  Q+ +   D   WK    G ++VR AY  +   +   P  
Sbjct: 840  LEILGLYLPETVKRRLLSVVVQVFLGNGDEISWKGTQDGAFTVRSAYSLLQGDVGDRPNM 899

Query: 230  DRKLNWVWKLRVPEKIRMFTWQVLHNAIPVNELWVRCHLASDATCARCGNVVEDGLHCLR 289
                N +WKL  PE++R+F W V  N I  N   VR HL+ +A C+ C    E  LH LR
Sbjct: 900  GSFFNRIWKLITPERVRVFIWLVSQNVIMTNVERVRRHLSENAICSVCNGAEETILHVLR 959

Query: 290  DCSFS*DLWRRMGAI-NWRNFRYNNIISW-FSSM--ARGVHGIQFLAGVWGAWKWRCNWL 345
            DC     +WRR+  +     F   +++ W F++M   +G+    F  G+W AWKWRC  +
Sbjct: 960  DCPAMEPIWRRLLPLRRHHEFFSQSLLEWLFTNMDPVKGIWPTLFGMGIWWAWKWRCCDV 1019

Query: 346  LDSQRWP------IEVVWRRIAHDHDDWAWCAPSN-DLLLCHPWSPPPPDTVKCNSDGSF 398
               ++        I+ +   +   H       P+   +     W  P    VK  +DG+ 
Sbjct: 1020 FGERKICRDRLKFIKDMAEEVRRVHVGAVGNRPNGVRVERMIRWQVPSDGWVKITTDGAS 1079

Query: 399  REDVQRMGGVGVIRDHQGRWVAGCYLGEAAGNAFRAEAKALLDVLELAWNRGYSRLICDV 458
            R +       G IR+ QG W+ G  L   +  A  AE       L +AW++G+ R+  D+
Sbjct: 1080 RGNHGLAAAGGAIRNGQGEWLGGFALNIGSCAAPLAELWGAYYGLLIAWDKGFRRVELDL 1139

Query: 459  NCDNLVTILVEAEAVQMHSEFHVLHSITQLLARDWHVRINSVHRDSNAVADHL 511
            +C  LV   +       H    ++        RDW VR++ V+R++N +AD L
Sbjct: 1140 DC-KLVVGFLSTGVSNAHPLSFLVRLCQGFFTRDWLVRVSHVYREANRLADGL 1191


>dbj|BAB09815.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis
           thaliana]
          Length = 676

 Score =  146 bits (368), Expect = 2e-33
 Identities = 127/488 (26%), Positives = 219/488 (44%), Gaps = 34/488 (6%)

Query: 80  LWVRVLEHKYLRDTSIHK---VQAHQHDSPIWKGI-LWARDMIDQRFEFRIGKGDTSVWY 135
           LW RVL  KY +   IH    +      S +W+ + +  R+++++   + +G G    ++
Sbjct: 184 LWARVLRSKY-KIGDIHDSAWMTPKGTWSALWRSVNVGLREVVNRGIGWVLGDGKIIRFW 242

Query: 136 QD-W----SGIGIIANQIPFVHISDVNLTLCDL-IQDNKWNLQRLYTNLPHSLQQQFLAV 189
           QD W      +  +++Q+P   + +    + D  I+   W+++R+   LP  ++Q+ LAV
Sbjct: 243 QDRWLLSTPLLEWVSDQLP---VEERGQRVADYWIEGVGWDMERIAVFLPEFMRQRLLAV 299

Query: 190 QPQICMNREDAWIWKDGSSGRYSVRDAY--EWINHLAHNPIEDRKLNWVWKLRVPEKIRM 247
               C   ED   W    +GR++V  AY  + ++ ++   +  R  + VW++ VPE+ R+
Sbjct: 300 VIGGCYGVEDKMSWVGTENGRFTVSSAYLIQSVDEISKQCMS-RFFDRVWRVMVPERARI 358

Query: 248 FTWQVLHNAIPVNELWVRCHLASDATCARCGNVVEDGLHCLRDCSFS*DLWRR-MGAINW 306
           F W V +  +  N   VR H+A    C  C    E  +H LRDC     +W R +  +  
Sbjct: 359 FLWLVGNQVVLTNAERVRRHMADSDVCPLCKGASESLIHVLRDCPAMMGIWMRVVPVMEQ 418

Query: 307 RNFRYNNIISWF-------SSMARGVHGIQFLAGVWGAWKWRCNWLL-DSQRWPIEVVWR 358
           R F   +++ W        S   R      F   VW  WKWRC ++  +  R    V + 
Sbjct: 419 RRFFETSLLEWMYGNLKERSDSERRSWPTLFALTVWWGWKWRCGYVFGEDSRCRDRVKFL 478

Query: 359 RIAHDHDDWAWCAPSND------LLLCHPWSPPPPDTVKCNSDGSFREDVQRMGGVGVIR 412
           + A    + A  A + D      +     W  P    V  N+DG+   +  +    GVIR
Sbjct: 479 KSAVAEVEAAHLAANGDAREDVLVERMIAWRKPAEGWVTMNTDGASHGNPGQATAGGVIR 538

Query: 413 DHQGRWVAGCYLGEAAGNAFRAEAKALLDVLELAWNRGYSRLICDVNCDNLVTILVEAEA 472
           D  G W+ G  L     +A  AE   +   L +AW RG+ R+  +V+   LV   +++  
Sbjct: 539 DEHGSWLVGFALNIGVCSAPLAELWGVYYGLVVAWERGWRRVRLEVD-SALVVGFLQSGI 597

Query: 473 VQMHSEFHVLHSITQLLARDWHVRINSVHRDSNAVADHLVRRGAAAMSSES*IIQSQDHD 532
              H    ++      +++DW VRI  V+R++N +AD L    A  +     ++ S    
Sbjct: 598 GDSHPLAFLVRLCHGFISKDWIVRITHVYREANRLADGLANY-AFTLPFGFLLLDSCPEH 656

Query: 533 VEYLLLKD 540
           V  +LL+D
Sbjct: 657 VSSILLED 664


>ref|NP_680357.1| RNase H domain-containing protein [Arabidopsis thaliana]
          Length = 633

 Score =  144 bits (362), Expect = 9e-33
 Identities = 126/476 (26%), Positives = 208/476 (43%), Gaps = 37/476 (7%)

Query: 65  VVGKAVCQMIKKSDKLWVRVLEHKY----LRDTSIHKVQAHQHDSPIWKGILWA-RDMID 119
           ++ K   +++K    LW RVL  KY    LRDT+   +   ++ S  W+ I    R+++ 
Sbjct: 113 LLSKVGWRLMKDRTSLWARVLRSKYRIGGLRDTTW--INTKRNASSTWRSIKSGLREVVI 170

Query: 120 QRFEFRIGKG-DTSVWYQDWSGIGIIANQIPFVHISDVN-LTLCDLIQDNK-WNLQRLYT 176
               + +G G D   W   W     I +       +D   + + +L  +   W+L ++  
Sbjct: 171 PGMNWVVGDGKDICFWDDKWLVEDPIRDLAAVELPADFQGIKIRELWHEGSGWDLAKIIP 230

Query: 177 NLPHSLQQQFLAVQPQICMNREDAWIWKDGSSGRYSVRDAYEWINHLAHNPIEDRKL-NW 235
            +   ++ + L++         D   W   ++G+++V+ AY ++          R+  + 
Sbjct: 231 YVSEGVRLRLLSMVVDTVTGSNDRTSWGATANGQFTVKSAYSFLLQSETQAQNMRQFFDR 290

Query: 236 VWKLRVPEKIRMFTWQVLHNAIPVNELWVRCHLASDATCARCGNVVEDGLHCLRDCSFS* 295
           VW++   E++R+F W V+H  I  +    R HL++   C  C    E  LH LRDC    
Sbjct: 291 VWRVTTTERVRVFIWLVVHQVIMTDVERRRRHLSASGVCQVCKGGDETILHVLRDCPSIA 350

Query: 296 DLWRRM---GAINWRNFRYNNIISWFSSMARGVHGIQ-------FLAGVWGAWKWRC-NW 344
            +W R+   G I    F  +NI+ W       V  I+       F   VW AWKWRC N 
Sbjct: 351 GIWGRLVPRGKIT--AFFASNILDWVYQNLSDVTEIRGCPWATLFAIVVWWAWKWRCGNV 408

Query: 345 LLDSQRWPIEVVWRRIAHDHDDWAWCAPSN---------DLLLCHPWSPPPPDTVKCNSD 395
             ++ R    V   R   D     W A  N         ++ +   W+PP     K N+D
Sbjct: 409 FGENGRCRDRV---RFVVDQAREIWIAHLNLRRGAMRGSEVEMSIKWTPPSTGWFKLNTD 465

Query: 396 GSFREDVQRMGGVGVIRDHQGRWVAGCYLGEAAGNAFRAEAKALLDVLELAWNRGYSRLI 455
           G+ R +       GV+RD +G+W  G  L     +A  AE   +   L +AW RG  RL 
Sbjct: 466 GASRGNPGLATAGGVVRDGEGQWCVGFVLNIGICSAPLAELWGVYYGLHIAWERGIRRLE 525

Query: 456 CDVNCDNLVTILVEAEAVQMHSEFHVLHSITQLLARDWHVRINSVHRDSNAVADHL 511
            +V+   LV   ++A     H    ++      ++RDW VRI+ V+R++N +AD L
Sbjct: 526 LEVD-STLVVGFLQAGIEDSHPLSFLVRLCYGFISRDWIVRISHVYREANRLADGL 580


>gb|AAC26674.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
           thaliana] gi|25411326|pir||C84488 hypothetical protein
           At2g07730 [imported] - Arabidopsis thaliana
          Length = 970

 Score =  130 bits (326), Expect = 1e-28
 Identities = 95/327 (29%), Positives = 151/327 (46%), Gaps = 19/327 (5%)

Query: 199 DAWIWKDGSSGRYSVRDAYEWINHLAHN-PIEDRKLNWVWKLRVPEKIRMFTWQVLHNAI 257
           D   WK   +G ++VR AYE +   A   P+    L  +WKL  PE++R+F W V H  I
Sbjct: 609 DELSWKGTQNGDFTVRSAYELLKPEAEERPLIGSFLKQIWKLVAPERVRVFIWLVSHMVI 668

Query: 258 PVNELWVRCHLASDATCARCGNVVEDGLHCLRDCSFS*DLWRRMGAINWRNFRYNNIISW 317
             N   VR HL+  ATC+ C    E  LH LRDC     +W+R+     +N  ++     
Sbjct: 669 MTNVERVRRHLSDIATCSVCNGADESILHVLRDCPAMTPIWQRLLPQRRQNEFFSQFEWL 728

Query: 318 FSSM--ARGVHGIQFLAGVWGAWKWRCNWLLDSQRWP------IEVVWRRIAHDH----D 365
           F+++  A+G     F  G+W AWKWRC  +   ++        I+ +   +   H    +
Sbjct: 729 FTNLDPAKGDWPTLFSMGIWWAWKWRCGDVFGERKLCRDRLKFIKDIAEEVRKAHVGTLN 788

Query: 366 DWAWCAPSNDLLLCHPWSPPPPDTVKCNSDGSFREDVQRMGGVGVIRDHQGRWVAGCYLG 425
           +    A    ++    W  P    VK  +DG+ R         G I + QG W+ G  L 
Sbjct: 789 NHVKRARVERMI---RWKAPSDRWVKLTTDGASRGHQGLAAASGAILNLQGEWLGGFALN 845

Query: 426 EAAGNAFRAEAKALLDVLELAWNRGYSRLICDVNCDN-LVTILVEAEAVQMHSEFHVLHS 484
             + +A  AE       L +AW++G+ R+  ++N D+ LV   +     + H    ++  
Sbjct: 846 IGSCDAPLAELWGAYYGLLIAWDKGFRRV--ELNLDSELVVGFLSTGISKAHPLSFLVRL 903

Query: 485 ITQLLARDWHVRINSVHRDSNAVADHL 511
                 RDW VR++ V+R++N +AD L
Sbjct: 904 CQGFFTRDWLVRVSHVYREANRLADGL 930


>gb|AAB82639.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana] gi|25408936|pir||A84888 hypothetical protein
            At2g45230 [imported] - Arabidopsis thaliana
          Length = 1374

 Score =  127 bits (318), Expect = 1e-27
 Identities = 128/499 (25%), Positives = 202/499 (39%), Gaps = 42/499 (8%)

Query: 59   E*F*YCVVGKAVCQMIKKSDKLWVRVLEHKYLRDTSIHKVQAHQHDSPIWKGILWARDMI 118
            E F   ++GK + +MI + D L  +V + +Y   +           S  WK I  A+ +I
Sbjct: 859  EAFNIALLGKQLWRMITEKDSLMAKVFKSRYFSKSDPLNAPLGSRPSFAWKSIYEAQVLI 918

Query: 119  DQRFEFRIGKGDT-SVWYQDWSGI--GIIANQIPFVHI------SDVNLTLCDLIQDNK- 168
             Q     IG G+T +VW   W G      A  +   H+      + +++    L+ D + 
Sbjct: 919  KQGIRAVIGNGETINVWTDPWIGAKPAKAAQAVKRSHLVSQYAANSIHVVKDLLLPDGRD 978

Query: 169  WNLQRLYTNLPHSLQQQFLAVQPQICMNREDAWIWKDGSSGRYSVRDAY----EWINHLA 224
            WN   +    P + Q+  LA++P     R D + W+   SG YSV+  Y    E IN   
Sbjct: 979  WNWNLVSLLFPDNTQENILALRPGGKETR-DRFTWEYSRSGHYSVKSGYWVMTEIINQ-R 1036

Query: 225  HNPIE------DRKLNWVWKLRVPEKIRMFTWQVLHNAIPVNELWVRCHLASDATCARCG 278
            +NP E      D     +WKL VP KI  F W+ ++N + V       HLA + +C RC 
Sbjct: 1037 NNPQEVLQPSLDPIFQQIWKLDVPPKIHHFLWRCVNNCLSVASNLAYRHLAREKSCVRCP 1096

Query: 279  NVVEDGLHCLRDCSFS*DLWR-----RMGAINWRNFRYNN---IISWFSSM-ARGVHGIQ 329
            +  E   H L  C F+   W            W    + N   ++S   S      H   
Sbjct: 1097 SHGETVNHLLFKCPFARLTWAISPLPAPPGGEWAESLFRNMHHVLSVHKSQPEESDHHAL 1156

Query: 330  FLAGVWGAWKWRCNWLLDSQRWPIEVVWRRIAHDHDDW------AWCAPSNDLLLCHPWS 383
                +W  WK R + +   + +    V  +   D D W           S+    C  W 
Sbjct: 1157 IPWILWRLWKNRNDLVFKGREFTAPQVILKATEDMDAWNNRKEPQPQVTSSTRDRCVKWQ 1216

Query: 384  PPPPDTVKCNSDGSFREDVQRMGGVGVIRDHQGR--WVAGCYLGEAAGNAFRAEAKALLD 441
            PP    VKCN+DG++ +D+   G   V+R+H GR  W+ G     +  +    E +AL  
Sbjct: 1217 PPSHGWVKCNTDGAWSKDLGNCGVGWVLRNHTGRLLWL-GLRALPSQQSVLETEVEALRW 1275

Query: 442  VLELAWNRGYSRLICDVNCDNLVTILVEAEAVQMHSEFHVLHSITQLLARDWHVRINSVH 501
             +       Y R+I + +   LV+++     + + S    +  I  LL     V+     
Sbjct: 1276 AVLSLSRFNYRRVIFESDSQYLVSLI--QNEMDIPSLAPRIQDIRNLLRHFEEVKFQFTR 1333

Query: 502  RDSNAVADHLVRRGAAAMS 520
            R+ N VAD   R   + M+
Sbjct: 1334 REGNNVADRTARESLSLMN 1352


>pir||A96682 protein F1E22.12 [imported] - Arabidopsis thaliana
           gi|6686397|gb|AAF23831.1| F1E22.12 [Arabidopsis
           thaliana]
          Length = 1055

 Score =  125 bits (313), Expect = 5e-27
 Identities = 122/486 (25%), Positives = 198/486 (40%), Gaps = 57/486 (11%)

Query: 65  VVGKAVCQMIKKSDKLWVRVLEHKY----LRDTSIHKVQAHQHDSPIWKGI-LWARDMID 119
           ++ K   +++++ + LW  VL+ KY    +RD+     +     S  W+ I +  RD++ 
Sbjct: 222 LISKVGWRLLQEKNSLWTLVLQKKYHVGEIRDSRWLIPKGSW--SSTWRSIAIGLRDVVS 279

Query: 120 QRFEFRIGKGDT-SVWYQDWSGIGIIANQIPFVHISDVNL-TLCDL-------IQDNKWN 170
               +  G G     W   W       +  P + + +    T CD        I    W+
Sbjct: 280 HGVGWIPGDGQQIRFWTDRW------VSGKPLLELDNGERPTDCDTVVAKDLWIPGRGWD 333

Query: 171 LQRLYTNLPHSLQQQFLAVQPQICMNREDAWIWKDGSSGRYSVRDAYEWIN-HLAHNPIE 229
             ++     ++ + +  AV   +     D   WK    G++SVR AYE +       P  
Sbjct: 334 FAKIDPYTTNNTRLELRAVVLDLVTGARDRLSWKFSQDGQFSVRSAYEMLTVDEVPRPNM 393

Query: 230 DRKLNWVWKLRVPEKIRMFTWQVLHNAIPVNELWVRCHLASDATCARCGNVVEDGLHCLR 289
               N +WK+RVPE+++ F W V + A+   E   R HL++   C  C   VE  LH LR
Sbjct: 394 ASFFNCLWKVRVPERVKTFLWLVGNQAVMTEEERHRRHLSASNVCQVCKGGVESMLHVLR 453

Query: 290 DCSFS*DLW-RRMGAINWRNFRYNNIISWFSSMARGVHGIQ-------FLAGVWGAWKWR 341
           DC     +W R +     + F   ++  W         G +       F   +W  WKWR
Sbjct: 454 DCPAQLGIWVRVVPQRRQQGFFSKSLFEWLYDNLGDRSGCEDIPWSTIFAVIIWWGWKWR 513

Query: 342 CNWLLDS-----------QRWPIEVVWRRIAHDHDDWAWCA-PSNDLLLCHPWSPPPPDT 389
           C  +              + W +EV     AH  +       P  + ++   W  P    
Sbjct: 514 CGNIFGENTKCRDRVKFVKEWAVEVY---RAHSGNVLVGITQPRVERMI--GWVSPCVGW 568

Query: 390 VKCNSDGSFREDVQRMGGVGVIRDHQGRWVAGCYLGEAAGNAFRAEAKALLDVLELAWNR 449
           VK N+DG+ R +       GV+RD  G W  G  L     +A +AE   +   L  AW +
Sbjct: 569 VKVNTDGASRGNPGLASAGGVLRDCTGAWCGGFSLNIGRCSAPQAELWGVYYGLYFAWEK 628

Query: 450 GYSRLICDVNCDNLVTILVEAEAVQMHSEFHVLHSITQL----LARDWHVRINSVHRDSN 505
              R+  +V+ + +V  L         S+ H L  + +L    L +DW VRI  V+R++N
Sbjct: 629 KVPRVELEVDSEVIVGFLKTG-----ISDSHPLSFLVRLCHGFLQKDWLVRIVHVYREAN 683

Query: 506 AVADHL 511
            +AD L
Sbjct: 684 RLADGL 689


>ref|NP_680149.1| reverse transcriptase-related [Arabidopsis thaliana]
          Length = 594

 Score =  117 bits (294), Expect = 7e-25
 Identities = 95/369 (25%), Positives = 164/369 (43%), Gaps = 28/369 (7%)

Query: 105 SPIWKGI-LWARDMIDQRFEFRIGKGDTSVWYQD-W----SGIGIIANQIPFVHISDVNL 158
           S +W+ + +  R+++++   + +G G    ++QD W      +  +++Q+P   + +   
Sbjct: 198 SALWRSVNVGLREVVNRGIGWVLGDGKIIRFWQDRWLLSTPLLEWVSDQLP---VEERGQ 254

Query: 159 TLCDL-IQDNKWNLQRLYTNLPHSLQQQFLAVQPQICMNREDAWIWKDGSSGRYSVRDAY 217
            + D  I+   W+++R+   LP  ++Q+ LAV    C   ED   W    +GR++V  AY
Sbjct: 255 RVADYWIEGVGWDMERIAVFLPEFMRQRLLAVVIGGCYGVEDKMSWVGTENGRFTVSSAY 314

Query: 218 --EWINHLAHNPIEDRKLNWVWKLRVPEKIRMFTWQVLHNAIPVNELWVRCHLASDATCA 275
             + ++ ++   +  R  + VW++ VPE+ R+F W V +  +  N   VR H+A    C 
Sbjct: 315 LIQSVDEISKQCMS-RFFDRVWRVMVPERARIFLWLVGNQVVLTNAERVRRHMADSDVCP 373

Query: 276 RCGNVVEDGLHCLRDCSFS*DLWRR-MGAINWRNFRYNNIISWF-------SSMARGVHG 327
            C    E  +H LRDC     +W R +  +  R F   +++ W        S   R    
Sbjct: 374 LCKGASESLIHVLRDCPAMMGIWMRVVPVMEQRRFFETSLLEWMYGNLKERSDSERRSWP 433

Query: 328 IQFLAGVWGAWKWRCNWLL-DSQRWPIEVVWRRIAHDHDDWAWCAPSND------LLLCH 380
             F   VW  WKWRC ++  +  R    V + + A    + A  A + D      +    
Sbjct: 434 TLFALTVWWGWKWRCGYVFGEDSRCRDRVKFLKSAVAEVEAAHLAANGDAREDVLVERMI 493

Query: 381 PWSPPPPDTVKCNSDGSFREDVQRMGGVGVIRDHQGRWVAGCYLGEAAGNAFRAEAKALL 440
            W  P    V  N+DG+   +  +    GVIRD  G W+ G  L     +A  AE   + 
Sbjct: 494 AWRKPAEGWVTMNTDGASHGNPGQATAGGVIRDEHGSWLVGFALNIGVCSAPLAELWGVY 553

Query: 441 DVLELAWNR 449
             L +AW R
Sbjct: 554 YGLVVAWER 562


>emb|CAB79667.1| putative protein [Arabidopsis thaliana] gi|4972055|emb|CAB43923.1|
           putative protein [Arabidopsis thaliana]
           gi|67633766|gb|AAY78807.1| putative reverse
           transcriptase/RNA-dependent DNA polymerase [Arabidopsis
           thaliana] gi|15233451|ref|NP_194638.1| reverse
           transcriptase, putative / RNA-dependent DNA polymerase,
           putative [Arabidopsis thaliana] gi|7485741|pir||T08964
           hypothetical protein F19B15.120 - Arabidopsis thaliana
          Length = 575

 Score =  112 bits (281), Expect = 2e-23
 Identities = 110/505 (21%), Positives = 214/505 (41%), Gaps = 52/505 (10%)

Query: 59  E*F*YCVVGKAVCQMIKKSDKLWVRVLEHKYLRDTSIHKVQAHQHDSPIWKGILWARDMI 118
           E F   ++GK + +M+ + + L  +V + +Y   +           S +WK I  +++++
Sbjct: 62  EAFNLALLGKQMWRMLSRPESLMAKVFKSRYFHKSDPLNAPLGSRPSFVWKSIHASQEIL 121

Query: 119 DQRFEFRIGKG-DTSVWYQDW-----SGIGIIANQIPFVHISDVN--LTLCDLIQDN--K 168
            Q     +G G D  +W   W     +   +   ++P    + V+  L + DLI ++  +
Sbjct: 122 RQGARAVVGNGEDIIIWRHKWLDSKPASAALRMQRVPPQEYASVSSILKVSDLIDESGRE 181

Query: 169 WN---LQRLYTNLPHSLQQQFLAVQPQICMNREDAWIWKDGSSGRYSVRDAYEWINHLAH 225
           W    ++ L+  +   L  +      +I     D++ W   SSG Y+V+  Y  +  + +
Sbjct: 182 WRKDVIEMLFPEVERKLIGELRPGGRRIL----DSYTWDYTSSGDYTVKSGYWVLTQIIN 237

Query: 226 N-----PIEDRKLN----WVWKLRVPEKIRMFTWQVLHNAIPVNELWVRCHLASDATCAR 276
                  + +  LN     +WK +   KI+ F W+ L N++PV       HL+ ++ C R
Sbjct: 238 KRSSPQEVSEPSLNPIYQKIWKSQTSPKIQHFLWKCLSNSLPVAGALAYRHLSKESACIR 297

Query: 277 CGNVVEDGLHCLRDCSFS*DLWR------RMGAINWRNFRYNNIISWFSSMARGVHGIQF 330
           C +  E   H L  C+F+   W        +G   W +  Y N+  W  ++  G    + 
Sbjct: 298 CPSCKETVNHLLFKCTFARLTWAISSIPIPLGG-EWADSIYVNLY-WVFNLGNGNPQWEK 355

Query: 331 LAG-----VWGAWKWRCNWLLDSQRWPIEVVWRRIAHDHDDW--------AWCAPSNDLL 377
            +      +W  WK R   +   + +  + V RR   D ++W            P  +  
Sbjct: 356 ASQLVPWLLWRLWKNRNELVFRGREFNAQEVLRRAEDDLEEWRIRTEAESCGTKPQVNRS 415

Query: 378 LCHPWSPPPPDTVKCNSDGSFREDVQRMGGVGVIRDHQG--RWVAGCYLGEAAGNAFRAE 435
            C  W PPP   VKCN+D ++  D +R G   V+R+ +G  +W+    L +   +   AE
Sbjct: 416 SCGRWRPPPHQWVKCNTDATWNRDNERCGIGWVLRNEKGEVKWMGARALPKLK-SVLEAE 474

Query: 436 AKALLDVLELAWNRGYSRLICDVNCDNLVTILVEAEAVQMHSEFHVLHSITQLLARDWHV 495
            +A+   +       Y+ +I + +   L+ IL   E     S    +  + +LL++   V
Sbjct: 475 LEAMRWAVLSLSRFQYNYVIFESDSQVLIEILNNDEI--WPSLKPTIQDLQRLLSQFTEV 532

Query: 496 RINSVHRDSNAVADHLVRRGAAAMS 520
           +   + R+ N +A+ + R   + ++
Sbjct: 533 KFVFIPREGNTLAERVARESLSFLN 557


>gb|AAD03565.2| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana] gi|25411819|pir||H84557 hypothetical protein
            At2g17910 [imported] - Arabidopsis thaliana
          Length = 1344

 Score =  110 bits (275), Expect = 1e-22
 Identities = 93/384 (24%), Positives = 162/384 (41%), Gaps = 32/384 (8%)

Query: 61   F*YCVVGKAVCQMIKKSDKLWVRVLEHKYLRDTSIHKVQAHQHDSPIWKGILWARDMIDQ 120
            F   ++ K   +++++   L+ RV + +Y  ++           S  W+ IL+ R+++ Q
Sbjct: 839  FNQALLAKQAWRVLQEKGSLFSRVFQSRYFSNSDFLSATRGSRPSYAWRSILFGRELLMQ 898

Query: 121  RFEFRIGKGD-TSVWYQDWSGIGIIANQIPFVHISDVNLTLCDLIQ--DNKWNLQRLYTN 177
                 IG G  T VW   W   G     +      +V+L +  LI      WNL  L   
Sbjct: 899  GLRTVIGNGQKTFVWTDKWLHDGSNRRPLNRRRFINVDLKVSQLIDPTSRNWNLNMLRDL 958

Query: 178  LPHSLQQQFLAVQPQICMNREDAWIWKDGSSGRYSVRDAYEWINHLAHN---------PI 228
             P    +  L  +P     +ED++ W    +G YSV+  YE+++   H+         P 
Sbjct: 959  FPWKDVEIILKQRPLFF--KEDSFCWLHSHNGLYSVKTGYEFLSKQVHHRLYQEAKVKPS 1016

Query: 229  EDRKLNWVWKLRVPEKIRMFTWQVLHNAIPVNELWVRCHLASDATCARCGNVVEDGLHCL 288
             +   + +W L    KIR+F W+ LH AIPV +      + SD  C  C    E   H L
Sbjct: 1017 VNSLFDKIWNLHTAPKIRIFLWKALHGAIPVEDRLRTRGIRSDDGCLMCDTENETINHIL 1076

Query: 289  RDCSFS*DLWRRMGAINWRNFRYNNIISWFSSMARGV---------HGIQFLAG--VWGA 337
             +C  +  +W  +  ++     ++N  S +++M+R +         H ++F++   +W  
Sbjct: 1077 FECPLARQVW-AITHLSSAGSEFSN--SVYTNMSRLIDLTQQNDLPHHLRFVSPWILWFL 1133

Query: 338  WKWRCNWLLDSQRWPIEVVWRRIAHDHDDW--AWCAPSND--LLLCHPWSPPPPDTVKCN 393
            WK R   L + +      +  +    + +W  A     ND   L    W PP P  +KCN
Sbjct: 1134 WKNRNALLFEGKGSITTTLVDKAYEAYHEWFSAQTHMQNDEKHLKITKWCPPLPGELKCN 1193

Query: 394  SDGSFREDVQRMGGVGVIRDHQGR 417
               ++ +     G   V+RD QG+
Sbjct: 1194 IGFAWSKQHHFSGASWVVRDSQGK 1217


>pir||T00833 RNA-directed DNA polymerase homolog T13L16.7 - Arabidopsis thaliana
            (fragment)
          Length = 1365

 Score =  110 bits (275), Expect = 1e-22
 Identities = 93/384 (24%), Positives = 162/384 (41%), Gaps = 32/384 (8%)

Query: 61   F*YCVVGKAVCQMIKKSDKLWVRVLEHKYLRDTSIHKVQAHQHDSPIWKGILWARDMIDQ 120
            F   ++ K   +++++   L+ RV + +Y  ++           S  W+ IL+ R+++ Q
Sbjct: 860  FNQALLAKQAWRVLQEKGSLFSRVFQSRYFSNSDFLSATRGSRPSYAWRSILFGRELLMQ 919

Query: 121  RFEFRIGKGD-TSVWYQDWSGIGIIANQIPFVHISDVNLTLCDLIQ--DNKWNLQRLYTN 177
                 IG G  T VW   W   G     +      +V+L +  LI      WNL  L   
Sbjct: 920  GLRTVIGNGQKTFVWTDKWLHDGSNRRPLNRRRFINVDLKVSQLIDPTSRNWNLNMLRDL 979

Query: 178  LPHSLQQQFLAVQPQICMNREDAWIWKDGSSGRYSVRDAYEWINHLAHN---------PI 228
             P    +  L  +P     +ED++ W    +G YSV+  YE+++   H+         P 
Sbjct: 980  FPWKDVEIILKQRPLFF--KEDSFCWLHSHNGLYSVKTGYEFLSKQVHHRLYQEAKVKPS 1037

Query: 229  EDRKLNWVWKLRVPEKIRMFTWQVLHNAIPVNELWVRCHLASDATCARCGNVVEDGLHCL 288
             +   + +W L    KIR+F W+ LH AIPV +      + SD  C  C    E   H L
Sbjct: 1038 VNSLFDKIWNLHTAPKIRIFLWKALHGAIPVEDRLRTRGIRSDDGCLMCDTENETINHIL 1097

Query: 289  RDCSFS*DLWRRMGAINWRNFRYNNIISWFSSMARGV---------HGIQFLAG--VWGA 337
             +C  +  +W  +  ++     ++N  S +++M+R +         H ++F++   +W  
Sbjct: 1098 FECPLARQVW-AITHLSSAGSEFSN--SVYTNMSRLIDLTQQNDLPHHLRFVSPWILWFL 1154

Query: 338  WKWRCNWLLDSQRWPIEVVWRRIAHDHDDW--AWCAPSND--LLLCHPWSPPPPDTVKCN 393
            WK R   L + +      +  +    + +W  A     ND   L    W PP P  +KCN
Sbjct: 1155 WKNRNALLFEGKGSITTTLVDKAYEAYHEWFSAQTHMQNDEKHLKITKWCPPLPGELKCN 1214

Query: 394  SDGSFREDVQRMGGVGVIRDHQGR 417
               ++ +     G   V+RD QG+
Sbjct: 1215 IGFAWSKQHHFSGASWVVRDSQGK 1238


>gb|AAT38702.1| putative RNase H domain containing protein [Solanum demissum]
          Length = 722

 Score =  108 bits (269), Expect = 6e-22
 Identities = 108/494 (21%), Positives = 207/494 (41%), Gaps = 57/494 (11%)

Query: 69  AVCQMIKKSDKLWVRVLEHKYLRDTSIHKVQAHQHD--SPIWKGILWARDMIDQRFEFRI 126
           A C  +++ D LW  +   K +  + +H V   +    S  W  +L  R  ++    + I
Sbjct: 26  AKCTDLERKD-LWASLEATKRIYCSRVHPVAKAKSSKQSHTWSKMLKIRHSVENNILWII 84

Query: 127 GKGDTSVWYQDWSGIGIIANQI-PFVHISDVNLTLCDLIQDNKWNLQRLYTNLPHSLQQQ 185
             G+ S+W+ +W G G ++N + P  H +  N+   D I   +W+  +L   LP  +  Q
Sbjct: 85  YAGNVSMWWDNWMGNGALSNILPPPSHYNKDNVK--DFIHKREWDFDKLSDILPPQVVNQ 142

Query: 186 FLAVQPQICMNREDAWIWKDGSSGRYSVRDAY-EWINHLAHNPIEDRKLNWVWKLRVPEK 244
            +++ P    N+ D  IW    +G ++ + AY +  N    N +     N +W  + P K
Sbjct: 143 IVSI-PIGDPNQSDYAIWIPSENGHFTTKSAYVDCSNTREKNDMR----NKIWHGKFPFK 197

Query: 245 IRMFTWQVLHNAIPVNELWVRCHLASDATCARCGNVVEDGL-HCLRDCSFS*DLWRRMGA 303
           +   TW+++ N +P  +   +     D+ C  C N+  + + H   +   +  LW++ G 
Sbjct: 198 MSFLTWRLVQNKLPFYDTVGKFVDNIDSNCVCCKNMKTETINHVFLNSDVASYLWKKFGG 257

Query: 304 INWRNFRYNNIIS-----WFSSMARGVHGIQF----LAGVWGAWKWRCNWLLDSQRWPIE 354
               + R ++ I+     W       +H +      +   W  WK RC      Q+    
Sbjct: 258 TLGIDTRASSTINLLKTWWNVQTHNSIHNVIIHTLPILIFWEIWKRRCACKYGDQK---- 313

Query: 355 VVWRRIAHDHDDW-----------------AWCAPSNDLLLCHP--------WSPPPPDT 389
            +W R   +H  W                 +W    N +    P        W+ P  + 
Sbjct: 314 KMWYRTMENHVWWNLKMSLRMTFPSFEIGNSWRDLLNKVESLRPYPKWKIVHWNTPNINC 373

Query: 390 VKCNSDGSFREDVQRMGGVGVIRDHQGRWVAGCYLGEAAGNAFRAEAKALLDVLELAWNR 449
           VK N+DGSF      +G   ++RDH  R +    +  +  +   AEA A    +     +
Sbjct: 374 VKINTDGSFSSGNAGLG--WIVRDHTRRMIMAFSIPSSCSSNNLAEALAARFGILWCLQQ 431

Query: 450 GYSRLICDVNCDNLVTILVEAEAVQMHSEFHVLHSITQLLARDWHVRINSVHRDSNAVAD 509
           G+     +++   +V ++   +A  +  +  V+  I Q++A+  +  +N  +R++N VAD
Sbjct: 432 GFHNCYLELDSKLVVDMVRNGQATNLKIK-GVVEDIIQVVAK-MNCEVNHCYREANQVAD 489

Query: 510 HLVRRGAAAMSSES 523
            L +   A +S+E+
Sbjct: 490 ALAKH--AVISNEA 501


>gb|AAD21778.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana] gi|25410938|pir||G84429 hypothetical protein
            At2g01840 [imported] - Arabidopsis thaliana
          Length = 1715

 Score =  107 bits (268), Expect = 7e-22
 Identities = 120/523 (22%), Positives = 217/523 (40%), Gaps = 70/523 (13%)

Query: 51   WRLRHQGD------E*F*YCVVGKAVCQMIKKSDKLWVRVLEHKYLRDTSIHKVQAHQHD 104
            W   ++GD        F   ++ K   +++     L  R+ +  Y  +T+  +     H 
Sbjct: 1189 WGKENEGDLGFKDLHQFNRALLAKQAWRILTNPQSLLARLYKGLYYPNTTYLRANKGGHA 1248

Query: 105  SPIWKGILWARDMIDQRFEFRIGKGDTS-VWYQDWSGIGIIANQIPFVHISDVNLTLCDL 163
            S  W  I   + ++ Q    R+G G T+ +W   W  +  +  +     I D ++ + DL
Sbjct: 1249 SYGWNSIQEGKLLLQQGLRVRLGDGQTTKIWEDPW--LPTLPPRPARGPILDEDMKVADL 1306

Query: 164  IQDNK--WNLQRLYTNLPHSLQQQFLAVQPQICMNREDAWIWKDGSSGRYSVRDAYEWIN 221
             ++NK  W+   ++  + +   QQ             D++ W    + +Y+VR  Y    
Sbjct: 1307 WRENKREWD-PVIFEGVLNPEDQQLAKSLYLSNYAARDSYKWAYTRNTQYTVRSGYWVAT 1365

Query: 222  HL------AHNPIE-DRKLNW-VWKLRVPEKIRMFTWQVLHNAIPVNELWVRCHLASDAT 273
            H+        NP+E D  L   +W+L++  KI+ F W+ L  A+         ++ +D T
Sbjct: 1366 HVNLTEEEIINPLEGDVPLKQEIWRLKITPKIKHFIWRCLSGALSTTTQLRNRNIPADPT 1425

Query: 274  CARCGNVVEDGLHCLRDCSFS*DLWRRMGAINWRNFRYNNIISWFSSMARGV-------- 325
            C RC N  E   H +  CS++  +WR        NF  +N + +  ++   +        
Sbjct: 1426 CQRCCNADETINHIIFTCSYAQVVWRS------ANFSGSNRLCFTDNLEENIRLILQGKK 1479

Query: 326  -------HGIQFLAGVWGAWKWRCNWLLDS-QRWPIEVVWRRIAHDHDDWAW-------- 369
                   +G+     +W  WK R  +L     R+P +V  ++   +  +W          
Sbjct: 1480 NQNLPILNGLMPFWIMWRLWKSRNEYLFQQLDRFPWKVA-QKAEQEATEWVETMVNDTAI 1538

Query: 370  ---CAPSND--LLLCHPWSPPPPDTVKCNSDGSFREDVQRMGGVGVIRDHQGRWV-AGCY 423
                A SND  L     WS PP   +KCN D  + +         ++RD  GR + +GC 
Sbjct: 1539 SHNTAQSNDRPLSRSKQWSSPPEGFLKCNFDSGYVQGRDYTSTGWILRDCNGRVLHSGCA 1598

Query: 424  LGEAAGNAFRAEAKALLDVLELAWNRGYSRLICDVNCDNLVTILVEAEAVQMHSEFHVLH 483
              + + +A +AEA   L  L++ W RGY  +  + +   L  ++ + E      + H+L 
Sbjct: 1599 KLQQSYSALQAEALGFLHALQMVWIRGYCYVWFEGDNLELTNLINKTE------DHHLLE 1652

Query: 484  SITQLLARDWHVR-----INSVHRDSNAVADHLVRRGAAAMSS 521
            ++   + R W  +     I  V+R+ N  AD L +  A +MSS
Sbjct: 1653 TLLYDI-RFWMTKLPFSSIGYVNRERNLAADKLTKY-ANSMSS 1693


>gb|AAD20714.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana] gi|25412331|pir||G84649 hypothetical protein
            At2g25550 [imported] - Arabidopsis thaliana
          Length = 1750

 Score =  104 bits (259), Expect = 8e-21
 Identities = 119/493 (24%), Positives = 197/493 (39%), Gaps = 57/493 (11%)

Query: 65   VVGKAVCQMIKKSDKLWVRVLEHKYLRDTSIHKVQAHQHDSPIWKGILWARDMIDQRFEF 124
            ++ K   ++I+  + L+ RV++ +Y +D SI   +  +  S  W  +L    ++ +    
Sbjct: 1247 LLAKQAWRLIQYPNSLFARVMKARYFKDVSILDAKVRKQQSYGWASLLDGIALLKKGTRH 1306

Query: 125  RIGKGDTSVWYQDWSGIGIIANQIPFVHISDVNLTLCDLIQDNKWNLQRLYTNLPHSLQQ 184
             IG G          G+  I +  P   ++    T  ++  +N +  +  Y     S   
Sbjct: 1307 LIGDGQNIR-----IGLDNIVDSHPPRPLNTEE-TYKEMTINNLFERKGSYYFWDDSKIS 1360

Query: 185  QFLAVQPQICMNR--------EDAWIWKDGSSGRYSVRDAYEWINH------LAHNPIE- 229
            QF+       ++R         D  IW   ++G Y+VR  Y  + H       A NP   
Sbjct: 1361 QFVDQSDHGFIHRIYLAKSKKPDKIIWNYNTTGEYTVRSGYWLLTHDPSTNIPAINPPHG 1420

Query: 230  --DRKLNWVWKLRVPEKIRMFTWQVLHNAIPVNELWVRCHLASDATCARCGNVVEDGLHC 287
              D K   +W L +  K++ F W+ L  A+   E      +  D +C RC    E   H 
Sbjct: 1421 SIDLKTR-IWNLPIMPKLKHFLWRALSQALATTERLTTRGMRIDPSCPRCHRENESINHA 1479

Query: 288  LRDCSFS*DLWRRMGAINWRN------FRYN--NIISWFSSMARG-VHGIQFLAGVWGAW 338
            L  C F+   WR   +   RN      F  N  NI+++         H +  +  +W  W
Sbjct: 1480 LFTCPFATMAWRLSDSSLIRNQLMSNDFEENISNILNFVQDTTMSDFHKLLPVWLIWRIW 1539

Query: 339  KWRCNWLLDSQR-WPIEVVWRRIAHDHDDWAWC--------APSNDLLLCH---PWSPPP 386
            K R N + +  R  P + V    A  HD   W          PS    +      W  PP
Sbjct: 1540 KARNNVVFNKFRESPSKTVLSAKAETHD---WLNATQSHKKTPSPTRQIAENKIEWRNPP 1596

Query: 387  PDTVKCNSDGSFREDVQRMGGVG--VIRDHQGRWVA-GCYLGEAAGNAFRAEAKALLDVL 443
               VKCN D  F  DVQ++   G  +IR+H G  ++ G        N   AE KALL  L
Sbjct: 1597 ATYVKCNFDAGF--DVQKLEATGGWIIRNHYGTPISWGSMKLAHTSNPLEAETKALLAAL 1654

Query: 444  ELAWNRGYSRLICDVNCDNLVTILVEAEAVQMHSEF-HVLHSITQLLARDWHVRINSVHR 502
            +  W RGY+++  + +C  L+ ++     +  HS   + L  I+    +   ++   + +
Sbjct: 1655 QQTWIRGYTQVFMEGDCQTLINLI---NGISFHSSLANHLEDISFWANKFASIQFGFIRK 1711

Query: 503  DSNAVADHLVRRG 515
              N +A  L + G
Sbjct: 1712 KGNKLAHVLAKYG 1724


>gb|AAF18538.1| Very similar to retrotransposon reverse transcriptase [Arabidopsis
            thaliana] gi|25518314|pir||A86359 hypothetical protein
            F12K8.9 - Arabidopsis thaliana
          Length = 1231

 Score =  103 bits (256), Expect = 2e-20
 Identities = 102/403 (25%), Positives = 166/403 (40%), Gaps = 57/403 (14%)

Query: 61   F*YCVVGKAVCQMIKKSDKLWVRVLEHKYLRDTSIHKVQAHQHDSPIWKGILWARDMIDQ 120
            F   ++ K   ++++  D L+ R+++ +Y               S  W+ IL  RD++ +
Sbjct: 709  FNQALLAKQAWRLLQFPDCLFARLIKSRYFPVGEFLDSDVGSRPSFGWRSILHGRDLLCR 768

Query: 121  RFEFRIGKGDT-SVWYQDWSGIGIIANQIPFVH--ISDVNLTLCDLIQDNK--WNLQRLY 175
                R+G G +  VW   W     +  + P++   I +V+L + DLI   K  W L +L 
Sbjct: 769  GLVKRVGNGKSIRVWIDYWLDDNGL--RAPWIKNPIINVDLLVSDLIDYEKRDWRLDKLE 826

Query: 176  TNLPHSLQQQFLAVQPQICMNREDAWIWKDGSSGRYSVRDAYEWI------NHLAHNPIE 229
                     +    +P + +  +D WIWK   SG YSV+  Y W+        +A   + 
Sbjct: 827  EQFFPDDVVKIRENRPVVSL--DDFWIWKHNKSGDYSVKLGY-WLASNQNLGQVAIEAMM 883

Query: 230  DRKLN----WVWKLRVPEKIRMFTWQVLHNAIPVNELWVRCHLASDATCARCGNVVEDGL 285
               LN     VWKL+   KI++F W+VL  AIPV +L     +  D+ C  CG   E   
Sbjct: 884  QPSLNDLKTQVWKLQTEPKIKVFLWKVLSGAIPVVDLLSYRGMKLDSRCQTCGCEGESIQ 943

Query: 286  HCLRDCSFS*DLWR-----------RMGAI--NWRNFRYN-NIISWFSSMARGVHGIQFL 331
            H L  CSF   +W              G++  N  +F  N + + W   + R    I   
Sbjct: 944  HVLFSCSFPRQVWAMSNIHVPLLGFECGSVYANLYHFLINRDNLKWPVELRRSFPWI--- 1000

Query: 332  AGVWGAWKWRCNWLLDSQRWPIEVVWRRIAHDHDDWAWC----------APSNDLLLCHP 381
              +W  WK R  +  + +R+ +     ++  D +DW                +D  +  P
Sbjct: 1001 --IWRIWKNRNLFFFEGKRFTVLETILKVRKDVEDWFAAQVVEKERRAEVGQSDQQVFSP 1058

Query: 382  --------WSPPPPDTVKCNSDGSFREDVQRMGGVGVIRDHQG 416
                    W PPP D VKCN   S+    +  G   V+R+ +G
Sbjct: 1059 RNVSPVVRWLPPPTDWVKCNVGLSWSRRNRLAGVAWVLRNDRG 1101


>gb|AAD24831.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana] gi|25408166|pir||G84721 hypothetical protein
            At2g31520 [imported] - Arabidopsis thaliana
          Length = 1524

 Score =  101 bits (252), Expect = 5e-20
 Identities = 119/493 (24%), Positives = 195/493 (39%), Gaps = 57/493 (11%)

Query: 65   VVGKAVCQMIKKSDKLWVRVLEHKYLRDTSIHKVQAHQHDSPIWKGILWARDMIDQRFEF 124
            ++ K   ++I+  + L+ RV++ +Y +D SI   +  +  S  W  +L    ++ +    
Sbjct: 1021 LLAKQAWRLIQYPNSLFARVMKARYFKDVSILDAKVRKQQSYGWASLLDGIALLKKGTRH 1080

Query: 125  RIGKGDTSVWYQDWSGIGIIANQIPFVHISDVNLTLCDLIQDNKWNLQRLYTNLPHSLQQ 184
             IG G          G+  I +  P   ++    T  ++  +N +  +  Y     S   
Sbjct: 1081 LIGDGQNIR-----IGLDNIVDSHPPRPLNTEE-TYKEMTINNLFERKGSYYFWDDSKIS 1134

Query: 185  QFLAVQPQICMNR--------EDAWIWKDGSSGRYSVRDAYEWINH------LAHNPIE- 229
            QF+       ++R         D  IW   ++G Y+VR  Y  + H       A NP   
Sbjct: 1135 QFVDQSDHGFIHRIYLAKSKKPDKIIWNYNTTGEYTVRSGYWLLTHDPSTNIPAINPPHG 1194

Query: 230  --DRKLNWVWKLRVPEKIRMFTWQVLHNAIPVNELWVRCHLASDATCARCGNVVEDGLHC 287
              D K   +W L +  K++ F W+ L  A+   E      +  D  C RC    E   H 
Sbjct: 1195 SIDLKTR-IWNLPIMPKLKHFLWRALSQALATTERLTTRGMRIDPICPRCHRENESINHA 1253

Query: 288  LRDCSFS*DLWRRMGAINWRN------FRYN--NIISWFSSMARG-VHGIQFLAGVWGAW 338
            L  C F+   W    +   RN      F  N  NI+++         H +  +  +W  W
Sbjct: 1254 LFTCPFATMAWWLSDSSLIRNQLMSNDFEENISNILNFVQDTTMSDFHKLLPVWLIWRIW 1313

Query: 339  KWRCNWLLDSQR-WPIEVVWRRIAHDHDDWAWC--------APSNDLLLCH---PWSPPP 386
            K R N + +  R  P + V    A  HD   W          PS    +      W  PP
Sbjct: 1314 KARNNVVFNKFRESPSKTVLSAKAETHD---WLNATQSHKKTPSPTRQIAENKIEWRNPP 1370

Query: 387  PDTVKCNSDGSFREDVQRMGGVG--VIRDHQGRWVA-GCYLGEAAGNAFRAEAKALLDVL 443
               VKCN D  F  DVQ++   G  +IR+H G  ++ G        N   AE KALL  L
Sbjct: 1371 ATYVKCNFDAGF--DVQKLEATGGWIIRNHYGTPISWGSMKLAHTSNPLEAETKALLAAL 1428

Query: 444  ELAWNRGYSRLICDVNCDNLVTILVEAEAVQMHSEF-HVLHSITQLLARDWHVRINSVHR 502
            +  W RGY+++  + +C  L+ ++     +  HS   + L  I+    +   ++   + R
Sbjct: 1429 QQTWIRGYTQVFMEGDCQTLINLI---NGISFHSSLANHLEDISFWANKFASIQFGFIRR 1485

Query: 503  DSNAVADHLVRRG 515
              N +A  L + G
Sbjct: 1486 KGNKLAHVLAKYG 1498


>emb|CAB78094.1| RNA-directed DNA polymerase-like protein [Arabidopsis thaliana]
            gi|4538901|emb|CAB39638.1| RNA-directed DNA
            polymerase-like protein [Arabidopsis thaliana]
            gi|7485606|pir||T04018 hypothetical protein F17A8.60 -
            Arabidopsis thaliana
          Length = 1274

 Score =  101 bits (252), Expect = 5e-20
 Identities = 114/491 (23%), Positives = 204/491 (41%), Gaps = 51/491 (10%)

Query: 65   VVGKAVCQMIKKSDKLWVRVLEHKYLRDTSIHKVQAH-QHDSPIWKGILWARDMIDQRFE 123
            +  K   +++K+   L  RVL  KY   +S     A     S  W+GIL  RD++ +   
Sbjct: 799  IEAKLSWRILKEPHSLLSRVLLGKYCNTSSFMDCSASPSFASHGWRGILAGRDLLRKGLG 858

Query: 124  FRIGKGDT-SVWYQDWSGIGIIANQIPFVHISDVN--LTLCDLIQDN--KWNLQRLYTNL 178
            + IG+GD+ +VW + W  +   + Q P    ++ N  L++ DLI  +   WN++ +  +L
Sbjct: 859  WSIGQGDSINVWTEAW--LSPSSPQTPIGPPTETNKDLSVHDLICHDVKSWNVEAIRKHL 916

Query: 179  PHSLQQQFLAVQPQICMNREDAWIWKDGSSGRYSVRDAYEWINHLAHNPIEDRKLNW--- 235
            P   + Q   +     +  +D+ +W    SG Y+ +  Y  +  L   P      NW   
Sbjct: 917  PQ-YEDQIRKITIN-ALPLQDSLVWLPVKSGEYTTKTGYA-LAKLNSFPASQLDFNWQKN 973

Query: 236  VWKLRVPEKIRMFTWQVLHNAIPVNELWVRCHLASDATCARCGNVVEDGLHCLRDCSFS* 295
            +WK+    K++ F W+ +  A+PV E   R ++ ++ TC RCG   E  LH +  C ++ 
Sbjct: 974  IWKIHTSPKVKHFLWKAMKGALPVGEALSRRNIEAEVTCKRCGQ-TESSLHLMLLCPYAK 1032

Query: 296  DLWRRMGAINWRNFRYNNIISWFSSMARGVHGIQFLAG---------------VWGAWKW 340
             +W     +      +N   +  SS+A  +   + +                 +W  WK 
Sbjct: 1033 KVWELAPVL------FNPSEATHSSVALLLVDAKRMVALPPTGLGSAPLYPWLLWHLWKA 1086

Query: 341  RCNWLLDSQRWPIEVVWRRIAHDHDDWAWCAPSNDLLLCHP-----WSPPPPD--TVKCN 393
            R   + D+     E +  +   D   W        LL+ HP     +  P P+     C 
Sbjct: 1087 RNRLIFDNHSCSEEGLVLKAILDARAWM----EAQLLIHHPSPISDYPSPTPNLKVTSCF 1142

Query: 394  SDGSFREDVQRMGGVGVIRDHQGRWVAGCYLGEAAGNAFRAEAKALLDVLELAWNRGYSR 453
             D ++        G  +   ++ +           G+A  AE  A+   L  A + G  +
Sbjct: 1143 VDAAWTTSGYCGMGWFLQDPYKVKIKENQSSSSFVGSALMAETLAVHLALVDALSTGVRQ 1202

Query: 454  LICDVNCDNLVTILVEAEA-VQMHSEFHVLHSITQLLARDWHVRINSVHRDSNAVADHLV 512
            L    +C  L+++L   ++ V++     +LH I +L     H+    + R SN VAD L 
Sbjct: 1203 LNVFSDCKELISLLNSGKSIVELRG---LLHDIRELSVSFTHLCFFFIPRLSNVVADSLA 1259

Query: 513  RRGAAAMSSES 523
            +   + + S S
Sbjct: 1260 KSALSVILSSS 1270


>pir||S65812 RNA-directed DNA polymerase (EC 2.7.7.49) (clone DW15) - Arabidopsis
            thaliana retrotransposon Ta11-1 gi|976278|gb|AAA75254.1|
            reverse transcriptase
          Length = 1333

 Score = 99.8 bits (247), Expect = 2e-19
 Identities = 98/405 (24%), Positives = 158/405 (38%), Gaps = 63/405 (15%)

Query: 59   E*F*YCVVGKAVCQMIKKSDKLWVRVLEHKYLRDTSIHKVQAHQHDSPIWKGILWARDMI 118
            E F   ++ K   ++++  + L+ R  + +Y  +      +     S  W+ IL  RD++
Sbjct: 861  ESFNQALLAKQAWRLLQFPNSLFARFFKSRYYDEEDFLDAELKATPSYAWRSILHGRDLL 920

Query: 119  DQRFEFRIGKGD-TSVWYQDWSGIGIIAN--QIPFVHISDVNLTLC--DLIQDNKWNLQR 173
             + F  ++G G  TSVW   W    I  N  ++P      VNL L   DLI       +R
Sbjct: 921  IKGFRKKVGNGSSTSVWMDPW----IYDNDPRLPLQKHFSVNLDLRVHDLINVEDRCRRR 976

Query: 174  LYTNLPHSLQQQFLAVQPQICMNR------EDAWIWKDGSSGRYSVRDAY---------E 218
                    L++ F     +I + R      +D W+W    SG YSV+  Y         E
Sbjct: 977  ------DRLEELFYPADIEIIVKRNPVVSMDDFWVWLHSKSGEYSVKSGYWLAFQTNKPE 1030

Query: 219  WINHLAHNPIEDRKLNWVWKLRVPEKIRMFTWQVLHNAIPVNELWVRCHLASDATCARCG 278
             I      P  +     +W      KI++F W++L +A+PV    +R  +  D  C  CG
Sbjct: 1031 LIREARVQPSTNGLKEKIWSTLTSPKIKLFLWRILSSALPVAYQIIRRGMPIDPRCQVCG 1090

Query: 279  NVVEDGLHCLRDCSFS*DLWRRMGAINWRNFRYNNIISWFSSMARGVHGIQFLAG----- 333
               E   H L  CS +  +W   G +    F + N     SS+   +  +  L G     
Sbjct: 1091 EEGESINHVLFTCSLARQVWALSG-VPTSQFGFQN-----SSIFANIQYLLELKGKGLIP 1144

Query: 334  ----------VWGAWKWRCNWLLDSQRW-PIEVVWRRIAHDHDDW------AWCAPSNDL 376
                      +W  WK R     +   + P++ +  +I  D  +W           + + 
Sbjct: 1145 EQIKKSWPWVLWRLWKNRDKLFFEGTIFSPLKSI-EKIRDDVQEWFLAQALVASVDAGET 1203

Query: 377  LLCHP----WSPPPPDTVKCNSDGSFREDVQRMGGVGVIRDHQGR 417
            +   P    W PPP   VKCN  G +    +  GG  V+RD  G+
Sbjct: 1204 VCSAPCPSSWEPPPLGWVKCNISGVWSGKKRVCGGAWVLRDDHGK 1248


>gb|AAD32950.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
           thaliana] gi|25411805|pir||C84554 hypothetical protein
           At2g17610 [imported] - Arabidopsis thaliana
          Length = 773

 Score = 99.0 bits (245), Expect = 3e-19
 Identities = 78/327 (23%), Positives = 132/327 (39%), Gaps = 23/327 (7%)

Query: 61  F*YCVVGKAVCQMIKKSDKLWVRVLEHKYLRDTSIHKVQAHQHDSPIWKGILWARDMIDQ 120
           F   ++ K   +++++   L  RV + KY     +   +A    S  WK IL    +I +
Sbjct: 316 FNIALLAKQSWRILQQPFSLMARVFKAKYFPKERLLDAKATSQSSYAWKSILHGTKLISR 375

Query: 121 RFEFRIGKGDT-SVWYQDWSGIGIIANQIPFVHISDVNLTLCDLIQDNKWNLQRLYTNLP 179
             ++  G G+   +W  +W  +      +         L + DL+ + +WN   L   + 
Sbjct: 376 GLKYIAGNGNNIQLWKDNWLPLNPPRPPVGTCDSIYSQLKVSDLLIEGRWNEDLLCKLIH 435

Query: 180 HSLQQQFLAVQPQICMNREDAWIWKDGSSGRYSVRDAYEWINHLAH---------NPIED 230
            +      A++P I     DA  W     G YSV+  Y  +  L+          N +  
Sbjct: 436 QNDIPHIRAIRPSIT-GANDAITWIYTHDGNYSVKSGYHLLRKLSQQQHASLPSPNEVSA 494

Query: 231 RKL-NWVWKLRVPEKIRMFTWQVLHNAIPVNELWVRCHLASDATCARCGNVVEDGLHCLR 289
           + +   +WK   P KI+ F W+  HNA+P      R  L +D TC RCG   ED  H L 
Sbjct: 495 QTVFTNIWKQNAPPKIKHFWWRSAHNALPTAGNLKRRRLITDDTCQRCGEASEDVNHLLF 554

Query: 290 DCSFS*DLWRRM-------GAINWRNFRYN--NIISWFSSMARGVHGIQFLAGVWGAWKW 340
            C  S ++W +         ++   +F  N  +I     S  + V    F+   W  WK 
Sbjct: 555 QCRVSKEIWEQAHIKLCPGDSLMSNSFNQNLESIQKLNQSARKDVSLFPFIG--WRIWKM 612

Query: 341 RCNWLLDSQRWPIEVVWRRIAHDHDDW 367
           R + + +++RW I    ++   D   W
Sbjct: 613 RNDLIFNNKRWSIPDSIQKALIDQQQW 639


>gb|AAF23283.1| putative non-LTR reverse transcriptase [Arabidopsis thaliana]
           gi|15232695|ref|NP_187562.1| hypothetical protein
           [Arabidopsis thaliana]
          Length = 484

 Score = 98.2 bits (243), Expect = 6e-19
 Identities = 95/353 (26%), Positives = 144/353 (39%), Gaps = 43/353 (12%)

Query: 197 REDAWIWKDGSSGRYSVRDAYEWINH------LAHNPIE---DRKLNWVWKLRVPEKIRM 247
           + D  IW   ++G Y+VR  Y  + H       A NP     D K   +W L +  K++ 
Sbjct: 115 KPDKIIWNYNTTGEYTVRSGYWLLTHDPSTNIPAINPPHGSIDLKTR-IWNLPIMPKLKH 173

Query: 248 FTWQVLHNAIPVNELWVRCHLASDATCARCGNVVEDGLHCLRDCSFS*DLWRRMGAINWR 307
           F W+ L  A+   E      +  D +C RC    E   H L  C F+   WR   +   R
Sbjct: 174 FLWRALSQALATTERLTTRGMRIDPSCPRCHRENESINHALFTCPFATMAWRLSDSSLIR 233

Query: 308 N------FRYN--NIISWFSSMARG-VHGIQFLAGVWGAWKWRCNWLLDSQR-WPIEVVW 357
           N      F  N  NI+++         H +  +  +W  WK R N + +  R  P + V 
Sbjct: 234 NQLMSNDFEENISNILNFVQDTTMSDFHKLLPVWLIWRIWKARNNVVFNKFRESPSKTVL 293

Query: 358 RRIAHDHDDWAWC--------APSNDLLLCH---PWSPPPPDTVKCNSDGSFREDVQRMG 406
              A  HD   W          PS    +      W  PP   VKCN D  F  DVQ++ 
Sbjct: 294 SAKAETHD---WLNATQSHKKTPSPTRQIAENKIEWRNPPATYVKCNFDAGF--DVQKLE 348

Query: 407 GVG--VIRDHQGRWVA-GCYLGEAAGNAFRAEAKALLDVLELAWNRGYSRLICDVNCDNL 463
             G  +IR+H G  ++ G        N   AE KALL  L+  W RGY+++  + +C  L
Sbjct: 349 ATGGWIIRNHYGTPISWGSMKLAHTSNPLEAETKALLAALQQTWIRGYTQVFMEGDCQTL 408

Query: 464 VTILVEAEAVQMHSEF-HVLHSITQLLARDWHVRINSVHRDSNAVADHLVRRG 515
           + ++     +  HS   + L  I+    +   ++   + R  N +A  L + G
Sbjct: 409 INLI---NGISFHSSLANHLEDISFWANKFASIQFGFIRRKGNKLAHVLAKYG 458


>gb|AAP54692.1| putative reverse transcriptase [Oryza sativa (japonica
            cultivar-group)] gi|37536206|ref|NP_922405.1| putative
            reverse transcriptase [Oryza sativa (japonica
            cultivar-group)] gi|27311287|gb|AAO00713.1|
            retrotransposon protein, putative, unclassified [Oryza
            sativa (japonica cultivar-group)]
          Length = 1557

 Score = 87.0 bits (214), Expect = 1e-15
 Identities = 80/302 (26%), Positives = 129/302 (42%), Gaps = 34/302 (11%)

Query: 61   F*YCVVGKAVCQMIKKSDKLWVRVLEHKYLRDTSIHKVQAHQHDSPIWKGILWARDMIDQ 120
            F   ++ +   ++I   D L  RVL+ KY  + SI       + SP W+ I    +++ +
Sbjct: 1230 FNQALLARQAWRLIDNPDSLCARVLKAKYYPNGSIVDTSFGGNASPGWQAIEHGLELVKK 1289

Query: 121  RFEFRIGKG-DTSVWYQDWSGIGIIANQIPFVHISDVNLT-LCDLIQDN-KWNLQRLYTN 177
               +RIG G    VW   W    +  ++ P    ++  +  + DL+ DN  W+  ++   
Sbjct: 1290 GIIWRIGNGRSVRVWQDPWLPRDL--SRRPITPKNNCRIKWVADLMLDNGMWDANKI--- 1344

Query: 178  LPHSLQQQFLAVQPQICM-------NREDAWIWKDGSSGRYSVRDAYE----WINHLAHN 226
                  Q FL V  +I +       + ED   W     G +SVR AY     W    A +
Sbjct: 1345 -----NQIFLPVDVEIILKLRTSSRDEEDFIAWHPDKLGNFSVRTAYRLAENWAKEEASS 1399

Query: 227  PIEDRKLN--W--VWKLRVPEKIRMFTWQVLHNAIPVNELWVRCHLASDATCARCGNVVE 282
               D  +   W  +WK  VP K+++FTW+   N +P  +   + +L    TC  CG   E
Sbjct: 1400 SSSDVNIRKAWELLWKCNVPSKVKIFTWRATSNCLPTWDNKKKRNLEISDTCVICGMEKE 1459

Query: 283  DGLHCLRDCSFS*DLWRRMGAINWRNFRYNNII---SW-FSSMARGVHGIQ--FLAGVWG 336
            D +H L  C  +  LW  M   N  + R ++ +   SW F+ +A      Q  FL  +W 
Sbjct: 1460 DTMHALCRCPQAKHLWLAMKESNDLSLRMDDHLLGPSWLFNRLALLPDHEQPMFLMVLWR 1519

Query: 337  AW 338
             W
Sbjct: 1520 IW 1521


  Database: nr
    Posted date:  Jul 5, 2005 12:34 AM
  Number of letters in database: 863,360,394
  Number of sequences in database:  2,540,612
  
Lambda     K      H
   0.332    0.143    0.506 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 987,946,954
Number of Sequences: 2540612
Number of extensions: 42533091
Number of successful extensions: 98759
Number of sequences better than 10.0: 279
Number of HSP's better than 10.0 without gapping: 108
Number of HSP's successfully gapped in prelim test: 171
Number of HSP's that attempted gapping in prelim test: 98258
Number of HSP's gapped (non-prelim): 353
length of query: 547
length of database: 863,360,394
effective HSP length: 133
effective length of query: 414
effective length of database: 525,458,998
effective search space: 217540025172
effective search space used: 217540025172
T: 11
A: 40
X1: 15 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (22.0 bits)
S2: 78 (34.7 bits)


Lotus: description of TM0299.5