Lotus
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= TM0378b.9
         (768 letters)

Database: nr 
           2,540,612 sequences; 863,360,394 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAL66754.1| putative copia-like retrotransposon Hopscotch pol...   674  0.0
gb|AAK51235.1| polyprotein [Arabidopsis thaliana]                     625  e-177
ref|XP_507316.1| PREDICTED P0623F08.22 gene product [Oryza sativ...   621  e-176
ref|NP_916434.1| putative gag/pol polyprotein [Oryza sativa (jap...   615  e-174
emb|CAB79576.1| putative protein [Arabidopsis thaliana] gi|32692...   612  e-173
gb|AAP51971.1| putative copia-type polyprotein [Oryza sativa (ja...   608  e-172
emb|CAB81170.1| retrotransposon like protein [Arabidopsis thalia...   592  e-167
gb|AAD21687.1| Strong similarity to gi|3600044 T12H20.12 proteas...   588  e-166
emb|CAC37623.1| copia-like polyprotein [Arabidopsis thaliana]         585  e-165
gb|AAD43604.1| T3P18.3 [Arabidopsis thaliana] gi|25301688|pir||H...   584  e-165
emb|CAE05956.3| OSJNBb0088C09.15 [Oryza sativa (japonica cultiva...   578  e-163
gb|AAW56918.1| putative polyprotein [Oryza sativa (japonica cult...   570  e-161
gb|AAT40550.1| putative receptor kinase [Solanum demissum]            537  e-151
emb|CAC95126.1| gag-pol polyprotein [Populus deltoides]               535  e-150
gb|AAD25646.1| putative retroelement pol polyprotein [Arabidopsi...   529  e-148
pir||E96608 probable retroelement polyprotein F25P12.89 [importe...   528  e-148
gb|AAP53905.1| putative pol polyprotein [Oryza sativa (japonica ...   526  e-147
gb|AAD26943.1| putative retroelement pol polyprotein [Arabidopsi...   525  e-147
gb|AAC33963.1| contains similarity to reverse transcriptases (Pf...   515  e-144
pir||G86301 probable retroelement polyprotein [imported] - Arabi...   513  e-144

>gb|AAL66754.1| putative copia-like retrotransposon Hopscotch polyprotein [Zea mays]
          Length = 1313

 Score =  674 bits (1740), Expect = 0.0
 Identities = 376/850 (44%), Positives = 508/850 (59%), Gaps = 105/850 (12%)

Query: 2    FLTQCGIQHRLTCPYVHQQNGRVEWKHRHITETGLALLAHAHMPLTFWGDAFETAAYLIN 61
            F  + GI H ++CP+ HQQNG  E KHRHI E GL+LLAHA MPL FW +AF  AAYLIN
Sbjct: 441  FFNKIGISHLVSCPHAHQQNGSAERKHRHIVEVGLSLLAHASMPLKFWDEAFLAAAYLIN 500

Query: 62   QLPIPTLDLSTPLQKLYDKKPDYSFLKCFGCLCYPHTRPYNTHKLAFRSAPCTFLGYSPM 121
            + P   L+L TP ++L+ K+PDYS L+ FGC+C+P+ RPYN+HKL FRS  C FLGYS +
Sbjct: 501  RTPTKILNLDTPFERLFHKQPDYSVLRIFGCVCWPNLRPYNSHKLQFRSKQCVFLGYSSL 560

Query: 122  HKGYNCL-TTEGKVIITRNVLFDETVFPFQFQTSKS-----------SSDIL-------- 161
            HKG+ CL  + G+V ++R+V FDE  FPF    S +           S D+L        
Sbjct: 561  HKGFKCLDVSTGRVYVSRDVTFDENFFPFASLHSNAGARLRSEIQLLSPDLLNPATFNSG 620

Query: 162  LDSSLPTTSVIPLLP---------------------ISNASPNVMPHANSV-MPTVAT-N 198
            +DS +   + +   P                     ++ + PN  P A++V +P  A+ +
Sbjct: 621  VDSLIDHAADMSTDPNQISGDNSVQDQAEHTPNMDVLAPSVPNTAPEADAVHIPGSASHS 680

Query: 199  GSAPVVQTTVSDFSQPA*DSGV-VHASAQ-----------------------PTEPPVPP 234
            GSAP   +T +       DS V  H SA                        PT    P 
Sbjct: 681  GSAPAAPSTAAPSPTCRGDSRVATHPSASHDYVASVDMSRGIAGREESFLTAPTTATSPA 740

Query: 235  SS---------------------NVHSM---TTRAKT----GIHRPKAYAAST------A 260
            SS                      V SM   +TR +T    G+ +PK Y   T       
Sbjct: 741  SSEPSGSLPATAADAQVPEPILGGVSSMDPASTRPRTRLQQGVRKPKVYTDGTIRYGCFT 800

Query: 261  LSSVPTSAK*ALSIPHWHQAMQEEYQALMNNNTWELVPLLHGRDAIGCKWVFRTKYNSDG 320
             S  P     AL   +W  AM  EY ALM N TW LVP   GR+ IGCKWV++ K  +DG
Sbjct: 801  SSGEPYDLNEALGDVNWKDAMDIEYSALMKNKTWHLVPPKKGRNVIGCKWVYKIKRKADG 860

Query: 321  SINKHKPRLVAKGFHQTEGYDYNETFSPVVKPTTIRTVLSLALMHQWPIRQFDFNNAFLN 380
            S++++K RLVAKG+ Q  G DY++TFSPVVK  TIR +LS+A+   W + Q D  NAFL+
Sbjct: 861  SLDRYKARLVAKGYKQQYGIDYDDTFSPVVKHATIRIILSIAVSRGWSLCQLDVQNAFLH 920

Query: 381  GELVEEVFMQQPPGFSKGSS-HLVCKLKKALYGLKQAPRAWFDKLSSTLARFGFQAAKCD 439
            G L EEV+MQQPPG+   +  + VCKL KALYGLKQAPRAW+ +LS+ L   GFQA+K D
Sbjct: 921  GVLEEEVYMQQPPGYEDSTKLNYVCKLDKALYGLKQAPRAWYSRLSNKLLSLGFQASKAD 980

Query: 440  TSLFCKFTKSETLYILVYVDDIIITGSSSTAIFSLIADLNAVFPLKDLGSLHYFLGIEAT 499
            TSLF     S T+++LVYVDDII+  S+  A  +L++DLN  F LKDLG L+YFLGIE  
Sbjct: 981  TSLFFYNKGSVTIFVLVYVDDIIVASSTHKATEALLSDLNKEFALKDLGDLNYFLGIEVN 1040

Query: 500  HTKDGGLILSQTKYINDLLIKAKMDSINPSPTPMTSGLKLSANSGA-LFPKPLL-YRSVV 557
              +D G+IL+Q KY +DLL K  M    P  TP+++  KLS + G+ L  K +  YRS+V
Sbjct: 1041 KVRD-GIILTQDKYASDLLKKVGMSDCKPISTPLSTSEKLSIHEGSPLGEKDITQYRSIV 1099

Query: 558  GALHYVTITRPEIAFPLNKVCQFMHSPLEPHWQAVKRILRYLHGTATHGLYLRRPSSMSI 617
            GAL Y+T+TRP+IAF +NKVCQF+H+P   HW AVKRILRY+      GL++ R  S  +
Sbjct: 1100 GALQYLTLTRPDIAFSVNKVCQFLHAPTTLHWAAVKRILRYIKQCTNLGLHIHRSDSTLV 1159

Query: 618  TAFADSDWGSDPDDRRSTSGMCIYIGGNLVSWAAKKQTVVARSSTEAEYRSLALVITEIM 677
            +AF+D+DW    DDR+ST G  +++G NLVSW+A+KQ  V+RSSTE+EY++LA    E++
Sbjct: 1160 SAFSDADWAGSVDDRKSTGGFAVFLGSNLVSWSARKQPTVSRSSTESEYKALANATAELI 1219

Query: 678  WLRSLLSELRVPTSPSSLIYCDNLSVVHLVANPILHTKTKHFALDLHFVRERVADKQVTV 737
            W++ LL+E+ + +  ++ ++CDNL   +L ANPI H +TKH  +D HFVR+RVA K + +
Sbjct: 1220 WVQILLTEISIKSPRAAKLWCDNLGAKYLSANPIFHARTKHIEVDYHFVRDRVAKKLLDI 1279

Query: 738  VNIPAPSQVS 747
              +P   QV+
Sbjct: 1280 EYVPTGDQVA 1289


>gb|AAK51235.1| polyprotein [Arabidopsis thaliana]
          Length = 1453

 Score =  625 bits (1612), Expect = e-177
 Identities = 345/776 (44%), Positives = 471/776 (60%), Gaps = 38/776 (4%)

Query: 3    LTQCGIQHRLTCPYVHQQNGRVEWKHRHITETGLALLAHAHMPLTFWGDAFETAAYLINQ 62
            LT CGIQHR++CPY  QQNG  E KHRH  E GL+++ H+H PL FW +AF TA++L N 
Sbjct: 598  LTDCGIQHRISCPYTPQQNGIAERKHRHFVELGLSMMFHSHTPLQFWVEAFFTASFLSNM 657

Query: 63   LPIPTLDLSTPLQKLYDKKPDYSFLKCFGCLCYPHTRPYNTHKLAFRSAPCTFLGYSPMH 122
            LP P+L   +PL+ L  +KP+Y+ L+ FG  CYP  RP   HK   RS  C FLGY+  +
Sbjct: 658  LPSPSLGNVSPLEALLKQKPNYAMLRVFGTACYPCLRPLGEHKFEPRSLQCVFLGYNSQY 717

Query: 123  KGYNCL-TTEGKVIITRNVLFDETVFPF----QFQTSKSSSDILL--DSSLPTT--SVIP 173
            KGY CL    G+V I+R+V+FDE  FPF    QF   +  S +L    SS+P    S+IP
Sbjct: 718  KGYRCLYPPTGRVYISRHVIFDEETFPFKQKYQFLVPQYESSLLSAWQSSIPQADQSLIP 777

Query: 174  LLPISNASPNVMPHA--------NSVMPTVATNG---------SAPVVQT-TVSDFSQPA 215
                        P +         +  P + T G         S    +T ++++ +   
Sbjct: 778  QAEEGKIESLAKPPSIQKNTIQDTTTQPAILTEGVLNEEEEEDSFEETETESLNEETHTQ 837

Query: 216  *DSGVVHASAQPTEPPVPPSSNVHSMTTRAKTGIHRPKA-YAASTALSSV--PTSAK*AL 272
             D   V    +  + P     N H MTTR+K GIH+    YA  T+  SV  P S   AL
Sbjct: 838  NDEAEVTVEEEVQQEP----ENTHPMTTRSKAGIHKSNTRYALLTSKFSVEEPKSIDEAL 893

Query: 273  SIPHWHQAMQEEYQALMNNNTWELVPLLHGRDAIGCKWVFRTKYNSDGSINKHKPRLVAK 332
            + P W+ A+ +E + +   +TW LV      + +GC+WVF+TK   DGS++K K RLVAK
Sbjct: 894  NHPGWNNAVNDEMRTIHMLHTWSLVQPTEDMNILGCRWVFKTKLKPDGSVDKLKARLVAK 953

Query: 333  GFHQTEGYDYNETFSPVVKPTTIRTVLSLALMHQWPIRQFDFNNAFLNGELVEEVFMQQP 392
            GFHQ EG DY ETFSPVV+  TIR VL +A    W I+Q D +NAFL+GEL E V+M QP
Sbjct: 954  GFHQEEGLDYLETFSPVVRTATIRLVLDVATAKGWNIKQLDVSNAFLHGELKEPVYMLQP 1013

Query: 393  PGF-SKGSSHLVCKLKKALYGLKQAPRAWFDKLSSTLARFGFQAAKCDTSLFCKFTKSET 451
            PGF  +     VC+L KALYGLKQAPRAWFD +S+ L  FGF  +K D SLF      +T
Sbjct: 1014 PGFVDQEKPSYVCRLTKALYGLKQAPRAWFDTISNYLLDFGFSCSKSDPSLFTYHKNGKT 1073

Query: 452  LYILVYVDDIIITGSSSTAIFSLIADLNAVFPLKDLGSLHYFLGIEATHTKDGGLILSQT 511
            L +L+YVDDI++TGS    +  L+  LN  F +KDLG+  YFLG+E   + +G L L QT
Sbjct: 1074 LVLLLYVDDILLTGSDHNLLQELLMSLNKRFSMKDLGAPSYFLGVEIESSPEG-LFLHQT 1132

Query: 512  KYINDLLIKAKMDSINPSPTPMTSGLKLSANSGALFPKPLLYRSVVGALHYVTITRPEIA 571
             Y  D+L +A M + N  PTP+   ++ + NS  LFP+P  +RS+ G L Y+TITRP+I 
Sbjct: 1133 AYAKDILHQAAMSNCNSMPTPLPQHIE-NLNSD-LFPEPTYFRSLAGKLQYLTITRPDIQ 1190

Query: 572  FPLNKVCQFMHSPLEPHWQAVKRILRYLHGTATHGLYLRRPSSMSITAFADSDWGSDPDD 631
            F +N +CQ MHSP    +  +KRILRY+ GT   GL++++  ++S+ A++DSDW    + 
Sbjct: 1191 FAVNFICQRMHSPTTADFGLLKRILRYVKGTIHLGLHIKKNQNLSLVAYSDSDWAGCKET 1250

Query: 632  RRSTSGMCIYIGGNLVSWAAKKQTVVARSSTEAEYRSLALVITEIMWLRSLLSELRVPTS 691
            RRST+G C  +G NL+SW+AK+Q  V++SSTEAEYR+L  V  E+ WL  LL ++ V  +
Sbjct: 1251 RRSTTGFCTLLGCNLISWSAKRQETVSKSSTEAEYRALTAVAQELTWLSFLLRDIGVTQT 1310

Query: 692  PSSLIYCDNLSVVHLVANPILHTKTKHFALDLHFVRERVADKQVTVVNIPAPSQVS 747
              +L+ CDNLS V+L ANP LH ++KHF  D H++RE+VA   V   +I A  Q++
Sbjct: 1311 HPTLVKCDNLSAVYLSANPALHNRSKHFDTDYHYIREQVALGLVETKHISATLQLA 1366


>ref|XP_507316.1| PREDICTED P0623F08.22 gene product [Oryza sativa (japonica
           cultivar-group)]
          Length = 821

 Score =  621 bits (1601), Expect = e-176
 Identities = 343/792 (43%), Positives = 462/792 (58%), Gaps = 89/792 (11%)

Query: 44  MPLTFWGDAFETAAYLINQLPIPTLDLSTPLQKLYDKKPDYSFLKCFGCLCYPHTRPYNT 103
           MP+ FW +A   AAYLIN+ P   ++ + PL++L+ +KP+Y+ L+ FGC  +P+ RPYN 
Sbjct: 1   MPIKFWDEAVLAAAYLINRTPSKVINFACPLEQLFKEKPNYTALRIFGCAVWPNLRPYNK 60

Query: 104 HKLAFRSAPCTFLGYSPMHKGYNCLT-TEGKVIITRNVLFDETVFPFQFQTSKSSSDILL 162
           HKLAFRS  C FLGYS +HKG+ CL    G+V ++R+V FDE++FPF    S + + +  
Sbjct: 61  HKLAFRSKRCVFLGYSNLHKGFKCLEIATGRVYVSRDVTFDESIFPFSELHSNAGACLRA 120

Query: 163 DSSLPTTSVIPLLP--------------------------------ISNASPNVMPHANS 190
           + SL   S++P L                                 ++N   N    A+ 
Sbjct: 121 EISLLPPSLVPHLSSLGGEQNNHVLNYPPNVTDQFGEENAEIGEEIVANGEENAAAAADE 180

Query: 191 VMPTVATNG----------------SAPVVQTTVSDFSQ----PA*DSGVVHASAQP--- 227
                A  G                S+PV     +  ++    P  +  +V AS Q    
Sbjct: 181 NAAAAANGGAQDDVHGVAYDASPEHSSPVTDDATASAAEQHGNPIQEEHLVQASPQTASS 240

Query: 228 TEPPVPPSSNVHSMTT-----------------------RAKTGIHRPKAYAASTAL--- 261
           T P V  S+ VH   T                       R ++GI + K Y   T     
Sbjct: 241 TSPSVASSAGVHDDVTTDQSDQTDQAMPEAAVAPIRPKTRLQSGIRKEKVYTDGTVKWLN 300

Query: 262 ---SSVPTSAK*ALSIPHWHQAMQEEYQALMNNNTWELVPLLHGRDAIGCKWVFRTKYNS 318
              S  P S + A++  HW +AM  EY AL+ N TW LVP   GR+ I CKWV++ K  +
Sbjct: 301 FTSSGEPQSLEEAVNNKHWKEAMDAEYMALIENKTWHLVPPQKGRNVIDCKWVYKVKRKA 360

Query: 319 DGSINKHKPRLVAKGFHQTEGYDYNETFSPVVKPTTIRTVLSLALMHQWPIRQFDFNNAF 378
           DGS++++K RLVAKGF Q  G DY +TFSPVVK  TIR VLSLA+   W +RQ D  NAF
Sbjct: 361 DGSLDRYKARLVAKGFKQRYGIDYEDTFSPVVKAATIRIVLSLAVSRGWSLRQLDVKNAF 420

Query: 379 LNGELVEEVFMQQPPGFSKGSS-HLVCKLKKALYGLKQAPRAWFDKLSSTLARFGFQAAK 437
           L+G L EEV+M+QPPG+ K S  + VCKL KALYGLKQAPRAW+ +LS+ L+  GF  +K
Sbjct: 421 LHGVLEEEVYMEQPPGYEKKSMPNYVCKLDKALYGLKQAPRAWYSRLSTKLSELGFVPSK 480

Query: 438 CDTSLFCKFTKSETLYILVYVDDIIITGSSSTAIFSLIADLNAVFPLKDLGSLHYFLGIE 497
            DTSLF       ++++L+YVDDII+  S   A  +L+ +L+  F LKDLG LHYFLGIE
Sbjct: 481 ADTSLFFYKKGQVSIFLLIYVDDIIMASSVPDATSTLLQELSKDFALKDLGDLHYFLGIE 540

Query: 498 ATHTKDGGLILSQTKYINDLLIKAKMDSINPSPTPMTSGLKLSANSGALF--PKPLLYRS 555
               KDG L+LSQ KY +DLL +  M    P  TP+++  KLS N G L        YRS
Sbjct: 541 VHKVKDG-LMLSQEKYASDLLRRVGMYECKPVSTPLSTSEKLSVNEGTLLGPQDSTQYRS 599

Query: 556 VVGALHYVTITRPEIAFPLNKVCQFMHSPLEPHWQAVKRILRYLHGTATHGLYLRRPSSM 615
           VVGAL Y+T+TRP+I+F +NKVCQF+H+P   HW AVKRILRY+  T   GL   R  S+
Sbjct: 600 VVGALQYLTLTRPDISFSINKVCQFLHAPTTTHWAAVKRILRYVKYTVDTGLKFCRNPSL 659

Query: 616 SITAFADSDWGSDPDDRRSTSGMCIYIGGNLVSWAAKKQTVVARSSTEAEYRSLALVITE 675
            ++ F+D+DW   PDDRRST G  +++G NLVSW+A+KQ  V+RSSTEAEY++LA    E
Sbjct: 660 LVSGFSDADWAGSPDDRRSTGGFAVFLGPNLVSWSARKQATVSRSSTEAEYKALANATAE 719

Query: 676 IMWLRSLLSELRVPTSPSSLIYCDNLSVVHLVANPILHTKTKHFALDLHFVRERVADKQV 735
           IMW+++LL EL V +  ++ ++CDNL   +L ANPI H +TKH  +D HFVRERVA K +
Sbjct: 720 IMWVQTLLQELGVESPRAAKLWCDNLGAKYLSANPIFHARTKHIEVDFHFVRERVARKLL 779

Query: 736 TVVNIPAPSQVS 747
            +  I    QV+
Sbjct: 780 EIAYISTKDQVA 791


>ref|NP_916434.1| putative gag/pol polyprotein [Oryza sativa (japonica cultivar-group)]
          Length = 1090

 Score =  615 bits (1587), Expect = e-174
 Identities = 341/792 (43%), Positives = 473/792 (59%), Gaps = 46/792 (5%)

Query: 2    FLTQCGIQHRLTCPYVHQQNGRVEWKHRHITETGLALLAHAHMPLTFWGDAFETAAYLIN 61
            FL   G Q RL+CPY   QNG+ E   R I  +   LL  A MP ++W +A  TA YL+N
Sbjct: 261  FLASRGTQLRLSCPYTSPQNGKAERMLRTINNSIRTLLIQASMPPSYWAEALATATYLLN 320

Query: 62   QLPIPTLDLSTPLQKLYDKKPDYSFLKCFGCLCYPHTRPYNTHKLAFRSAPCTFLGYSPM 121
            + P  ++  S P Q L+   PD+S L+ FGCLCYP+      HKL+ RS  C FLGY   
Sbjct: 321  RRPSSSIHQSLPFQLLHRTIPDFSHLRVFGCLCYPNLSATTPHKLSPRSTACVFLGYPTS 380

Query: 122  HKGYNCLT-TEGKVIITRNVLFDETVFPFQFQTSKSSS-DILLDSSLPTTSVIPLLPISN 179
            HKGY CL  +  ++II+R+V+FDE+ FPF      +SS D LL    P  +  P L +  
Sbjct: 381  HKGYRCLDLSTHRIIISRHVVFDESQFPFAATPPAASSFDFLLQGLSPADA--PSLEVEQ 438

Query: 180  ASP-NVMPHANSVMP--------------TVAT---NGSAPVVQTTVSDFSQPA*D---- 217
              P  V P      P              TVA+   +  AP+V T+ +D + P       
Sbjct: 439  PRPLTVAPSTEVEQPYLPLPSRRLSAGTVTVASEAPSAGAPLVGTSSADATPPGSATRAS 498

Query: 218  ---SGVVHASAQPTEPPVPPSSNV-----------HSMTTRAKTGIHRPK---AYAASTA 260
               S   H   +     VPPSS+            HSM TR+++G  RP     Y A+ A
Sbjct: 499  TIVSPFRHVYTRRPVTTVPPSSSTAVTNAVAAPQPHSMVTRSQSGSLRPVDRLTYTATQA 558

Query: 261  LSS-VPTSAK*ALSIPHWHQAMQEEYQALMNNNTWELVPLLHGRDAIGCKWVFRTKYNSD 319
             +S VP +   AL+ P+W  AM +EY+ L++N TW LV      +    KW+F+ K++SD
Sbjct: 559  AASPVPANYHSALADPNWRAAMADEYKELVDNGTWRLVSRPPRANIATGKWIFKHKFHSD 618

Query: 320  GSINKHKPRLVAKGFHQTEGYDYNETFSPVVKPTTIRTVLSLALMHQWPIRQFDFNNAFL 379
            GS+ ++K R V +G+ Q  G DY+ETFSPVVK  TIR VLS+A    WPI Q D  NAFL
Sbjct: 619  GSLARYKARWVVRGYSQQHGIDYDETFSPVVKLATIRVVLSIAASRAWPIHQLDVKNAFL 678

Query: 380  NGELVEEVFMQQPPGFSKGSS-HLVCKLKKALYGLKQAPRAWFDKLSSTLARFGFQAAKC 438
            +G L E V+ QQP GF   ++   VC L+K+LYGLKQAPRAW+ + ++ + + GF  +  
Sbjct: 679  HGHLKETVYCQQPSGFVDPTAPDAVCLLQKSLYGLKQAPRAWYQRFATYIRQMGFMPSAS 738

Query: 439  DTSLFCKFTKSETLYILVYVDDIIITGSSSTAIFSLIADLNAVFPLKDLGSLHYFLGIEA 498
            DTSLF         Y+L+YVDDII+T S++T +  L A L++ F + DLG LH+FLGI  
Sbjct: 739  DTSLFVYKDGDRIAYLLLYVDDIILTASTTTLLQQLTARLHSEFAMTDLGDLHFFLGISV 798

Query: 499  THTKDGGLILSQTKYINDLLIKAKMDSINPSPTPMTSGLKLSANSGALFPKPLLYRSVVG 558
              + DG L LSQ +Y  DLL +A M   + + TP+ +  KLSA  G     P  YRS+ G
Sbjct: 799  KRSPDG-LFLSQRQYAVDLLQRAGMAECHSTSTPVDTHAKLSATDGLPVADPSAYRSIAG 857

Query: 559  ALHYVTITRPEIAFPLNKVCQFMHSPLEPHWQAVKRILRYLHGTATHGLYLRRPSSMSIT 618
            AL Y+T+TRP++A+ + +VC FMH P EPH   VKRILRY+ G+ + GL++      S+T
Sbjct: 858  ALQYLTLTRPDLAYAVQQVCLFMHDPREPHLALVKRILRYVKGSLSIGLHIGSGPIQSLT 917

Query: 619  AFADSDWGSDPDDRRSTSGMCIYIGGNLVSWAAKKQTVVARSSTEAEYRSLALVITEIMW 678
            A++D+DW   P+ RRSTSG C+Y+G NLVSW++K+QT V+RSS EAEYR++A  + E  W
Sbjct: 918  AYSDADWAGCPNSRRSTSGYCVYLGDNLVSWSSKRQTTVSRSSAEAEYRAVAHAVAECCW 977

Query: 679  LRSLLSELRVPTSPSSLIYCDNLSVVHLVANPILHTKTKHFALDLHFVRERVADKQVTVV 738
            LR LL EL VP + ++++YCDN+S V++ ANP+ H +TKH  +D+HFVRE+VA  QV V+
Sbjct: 978  LRQLLQELHVPIASATIVYCDNVSAVYMTANPVHHRRTKHIEIDIHFVREKVALGQVRVL 1037

Query: 739  NIPAPSQVSSTL 750
             +P+  Q +  +
Sbjct: 1038 YVPSSHQFADIM 1049


>emb|CAB79576.1| putative protein [Arabidopsis thaliana] gi|3269282|emb|CAA19715.1|
            putative protein [Arabidopsis thaliana]
            gi|7444417|pir||T05745 hypothetical protein M4I22.20 -
            Arabidopsis thaliana
          Length = 1318

 Score =  612 bits (1578), Expect = e-173
 Identities = 347/789 (43%), Positives = 460/789 (57%), Gaps = 54/789 (6%)

Query: 7    GIQHRLTCPYVHQQNGRVEWKHRHITETGLALLAHAHMPLTFWGDAFETAAYLINQLPIP 66
            GIQ +L+CP+  QQNG  E KHRH+ E GL++L  +H+P  FW +AF TA +LIN LP  
Sbjct: 414  GIQQQLSCPHTPQQNGLAERKHRHLVELGLSMLFQSHVPHKFWVEAFFTANFLINLLPTS 473

Query: 67   TLDLS-TPLQKLYDKKPDYSFLKCFGCLCYPHTRPYNTHKLAFRSAPCTFLGYSPMHKGY 125
             L  S +P +KLYDKKPDY+ L+ FG  C+P  R Y  +K    S  C FLGY+  +KGY
Sbjct: 474  ALKESISPYEKLYDKKPDYTSLRSFGSACFPTLRDYAENKFNPCSLKCVFLGYNEKYKGY 533

Query: 126  NCL-TTEGKVIITRNVLFDETVFPFQF----------------------------QTSKS 156
             CL    G++ I+R+V+FDE+V+PF                               TS S
Sbjct: 534  RCLYPPTGRLYISRHVIFDESVYPFSHTYKHLHPQPRTPLLAAWLRSSDSPAPSTSTSPS 593

Query: 157  SSDILLDSS----LP--TTSVIP-LLPISNASPNVMPHANSV----MPTVATNGSAPVVQ 205
            S   L  S+    LP   T ++P L+PIS+ S     HA+++     P   +  +     
Sbjct: 594  SRSPLFTSADFPPLPQRKTPLLPTLVPISSVS-----HASNITTQQSPDFDSERTTDFDS 648

Query: 206  TTVSD---FSQPA*DSGVVHASAQPTEPPVPPSSNVHSMTTRAKTGIHRPK---AYAAST 259
             ++ D    SQ   DS      A         S+NVH M TRAK GI +P     + +  
Sbjct: 649  ASIGDSSHSSQAGSDSEETIQQASVNVHQTHASTNVHPMVTRAKVGISKPNPRYVFLSHK 708

Query: 260  ALSSVPTSAK*ALSIPHWHQAMQEEYQALMNNNTWELVPLLHGRDAIGCKWVFRTKYNSD 319
                 P +   AL  P W  AM EE        TW LVP       +G KWVFRTK ++D
Sbjct: 709  VSYPEPKTVTAALKHPGWTGAMTEEIGNCSETQTWSLVPYKSDMHVLGSKWVFRTKLHAD 768

Query: 320  GSINKHKPRLVAKGFHQTEGYDYNETFSPVVKPTTIRTVLSLALMHQWPIRQFDFNNAFL 379
            G++NK K R+VAKGF Q EG DY ET+SPVV+  T+R VL LA    W I+Q D  NAFL
Sbjct: 769  GTLNKLKARIVAKGFLQEEGIDYLETYSPVVRTPTVRLVLHLATALNWDIKQMDVKNAFL 828

Query: 380  NGELVEEVFMQQPPGFSKGSS-HLVCKLKKALYGLKQAPRAWFDKLSSTLARFGFQAAKC 438
            +G+L E V+M QP GF   S    VC L K++YGLKQ+PRAWFDK S+ L  FGF  +K 
Sbjct: 829  HGDLKETVYMTQPAGFVDPSKPDHVCLLHKSIYGLKQSPRAWFDKFSTFLLEFGFFCSKS 888

Query: 439  DTSLFCKFTKSETLYILVYVDDIIITGSSSTAIFSLIADLNAVFPLKDLGSLHYFLGIEA 498
            D SLF     +  + +L+YVDD++ITG+SS  + SL+A LN  F + D+G LHYFLGI+ 
Sbjct: 889  DPSLFIYAHNNNLILLLLYVDDMVITGNSSQTLTSLLAALNKEFRMTDMGQLHYFLGIQ- 947

Query: 499  THTKDGGLILSQTKYINDLLIKAKMDSINPSPTPMTSGLKLSANSGALFPKPLLYRSVVG 558
               +  GL +SQ KY  DLLI A M+   P PTP+   L    +   LF  P  +RS+ G
Sbjct: 948  VQRQQNGLFMSQQKYAEDLLIAASMEHCTPLPTPLPVQLDRVPHQEELFSDPTYFRSIAG 1007

Query: 559  ALHYVTITRPEIAFPLNKVCQFMHSPLEPHWQAVKRILRYLHGTATHGLYLRRPSSMSIT 618
             L Y+T+TRP+I F +N VCQ MH P    +  +KRILRY+ GT T G+   R S   + 
Sbjct: 1008 KLQYLTLTRPDIQFAVNFVCQKMHQPTISDFHLLKRILRYIKGTITMGISYSRDSPTLLQ 1067

Query: 619  AFADSDWGSDPDDRRSTSGMCIYIGGNLVSWAAKKQTVVARSSTEAEYRSLALVITEIMW 678
            A++DSDWG+    RRS  G+C ++G NLVSW++KK   V+RSSTEAEY+SL+   +EI+W
Sbjct: 1068 AYSDSDWGNCKQTRRSVGGLCTFMGTNLVSWSSKKHPTVSRSSTEAEYKSLSDAASEILW 1127

Query: 679  LRSLLSELRVPTSPSSLIYCDNLSVVHLVANPILHTKTKHFALDLHFVRERVADKQVTVV 738
            L +LL ELR+P   +  ++CDNLS V+L ANP  H +TKHF +D HFVRERVA K + V 
Sbjct: 1128 LSTLLRELRIPLPDTPELFCDNLSAVYLTANPAFHARTKHFDIDFHFVRERVALKALVVK 1187

Query: 739  NIPAPSQVS 747
            +IP   Q++
Sbjct: 1188 HIPGSEQIA 1196


>gb|AAP51971.1| putative copia-type polyprotein [Oryza sativa (japonica
            cultivar-group)] gi|37530764|ref|NP_919684.1| putative
            copia-type polyprotein [Oryza sativa (japonica
            cultivar-group)] gi|20042923|gb|AAM08751.1| Putative
            copia-type polyprotein [Oryza sativa (japonica
            cultivar-group)]
          Length = 1803

 Score =  608 bits (1569), Expect = e-172
 Identities = 335/774 (43%), Positives = 460/774 (59%), Gaps = 44/774 (5%)

Query: 11   RLTCPYVHQQNGRVEWKHRHITETGLALLAHAHMPLTFWGDAFETAAYLINQLPIPTLDL 70
            RL+CPY  QQNG+ E   R I +    +L H+  PL+FW +A +TA +LIN+ P      
Sbjct: 645  RLSCPYSSQQNGKAERILRTINDCVRTMLVHSAAPLSFWAEALQTAMHLINRRPCRATGS 704

Query: 71   STPLQKLYDKKPDYSFLKCFGCLCYPHTRPYNTHKLAFRSAPCTFLGYSPMHKGYNCLTT 130
              P Q L    P Y  L+ FGCLCYP+T     HKL+ RS  C F+GY   H+GY C   
Sbjct: 705  LKPYQLLLGAPPTYDHLRVFGCLCYPNTIATAPHKLSPRSLACVFIGYPADHRGYRCYDM 764

Query: 131  EGKVIIT-RNVLFDETVFPFQFQTSKSSSDILLDSSLPTTSVIPLLPISNASPNVMPHAN 189
              + + T R+V F E VFPF+            D+  P  S  P  P  +    ++    
Sbjct: 765  VSRRVFTSRHVTFVEDVFPFR------------DAPSPRPSAPP--PPDHGDDTIV---- 806

Query: 190  SVMPTVATNGSAPVVQTTVSDFSQPA*DSGVVHASAQPTE----PPVPPSSNV------- 238
             ++P  A +   PV      D + P   +    +SA P      PP P +S+        
Sbjct: 807  -LLPAPAQHVVTPVGTAPAHDAASPPSPASSTPSSAAPAHDVAPPPSPETSSPASASPPR 865

Query: 239  HSMTTRAKTGIHRPK---AYAASTALSSVPTSAK*ALSIPHWHQAMQEEYQALMNNNTWE 295
            H+MTTRA+ GI +P    A  A++ LS  P+S + AL  P+W  AMQ E+ AL+ N TW 
Sbjct: 866  HAMTTRARAGISKPNPRYAMTATSTLSPTPSSVRVALRDPNWRAAMQAEFDALLANRTWT 925

Query: 296  LVPLLHGRDAIGCKWVFRTKYNSDGSINKHKPRLVAKGFHQTEGYDYNETFSPVVKPTTI 355
            LVP   G   I  KWVF+TK ++DGS++K+K R V +GF+Q  G D+ ETFSPVVKP TI
Sbjct: 926  LVPRPPGARIITGKWVFKTKLHADGSLDKYKARWVVRGFNQRPGVDFGETFSPVVKPATI 985

Query: 356  RTVLSLALMHQWPIRQFDFNNAFLNGELVEEVFMQQPPGFSKGSSHL-VCKLKKALYGLK 414
            RTVL+L    QWP  Q D +NAFL+G L E V  QQP GF   +    VC L ++LYGL+
Sbjct: 986  RTVLTLISSKQWPAHQLDVSNAFLHGHLQERVLCQQPTGFEDAARPADVCLLSRSLYGLR 1045

Query: 415  QAPRAWFDKLSSTLARFGFQAAKCDTSLFCKFTKSETLYILVYVDDIIITGSSSTAIFSL 474
            QAPRAWF + +      GF  ++ D SLF     S+T Y+L+YVDD+I++ SSS+ +  +
Sbjct: 1046 QAPRAWFKRFADHATSLGFVQSRADPSLFVLRRGSDTAYLLLYVDDMILSASSSSLLQRI 1105

Query: 475  IADLNAVFPLKDLGSLHYFLGIEATHTKDGGLILSQTKYINDLLIKAKMDSINPSPTPMT 534
            I  L A F +KD+G L YFLGIE   T DG  +LSQ+KY  D+L +A M +     TP  
Sbjct: 1106 IDRLQAEFKVKDMGPLKYFLGIEVQRTADG-FVLSQSKYATDVLERAGMANCKAVATPAD 1164

Query: 535  SGLKLSANSGALFPKPLLYRSVVGALHYVTITRPEIAFPLNKVCQFMHSPLEPHWQAVKR 594
            +  KLS++ G LF     YRS+ GAL Y+T+TRP+IA+ + +VC  MH+P E H   +KR
Sbjct: 1165 AKPKLSSDEGPLFQDSSWYRSIAGALQYLTLTRPDIAYAVQQVCLHMHAPREAHVTLLKR 1224

Query: 595  ILRYLHGTATHGLYLRRPSSMSITAFADSDWGSDPDDRRSTSGMCIYIGGNLVSWAAKKQ 654
            ILRY+ GTA  GL+LR  +S ++TAF+D+DW   PD RRSTSG CI++G +L+SW++K+Q
Sbjct: 1225 ILRYIKGTAAFGLHLRASTSPTLTAFSDADWAGCPDTRRSTSGFCIFLGDSLISWSSKRQ 1284

Query: 655  TVVARSSTEAEYRSLALVITEIMWLRSLLSELRVPTSPSSLIYCDNLSVVHLVANPILHT 714
            T V+RSS EAEYR +A  + E  WLR LL EL      +++ YCDN+S V++  NP+ H 
Sbjct: 1285 TTVSRSSAEAEYRGVANAVAECTWLRQLLGELHCRVPQATIAYCDNISSVYMSKNPVHHK 1344

Query: 715  KTKHFALDLHFVRERVADKQVTVVNIPAPSQ--------VSSTLFHCFKTKLRV 760
            +TKH  LD+HFVRE+VA  ++ V+ IP+  Q        + S++F+ F+  L V
Sbjct: 1345 RTKHIELDIHFVREKVALGELRVLPIPSAHQFADVFTKGLPSSMFNEFRASLCV 1398


>emb|CAB81170.1| retrotransposon like protein [Arabidopsis thaliana]
            gi|4539447|emb|CAB40035.1| retrotransposon like protein
            [Arabidopsis thaliana] gi|7444419|pir||T04204
            hypothetical protein T4F9.150 - Arabidopsis thaliana
          Length = 1515

 Score =  592 bits (1527), Expect = e-167
 Identities = 337/808 (41%), Positives = 474/808 (57%), Gaps = 66/808 (8%)

Query: 3    LTQCGIQHRLTCPYVHQQNGRVEWKHRHITETGLALLAHAHMPLTFWGDAFETAAYLINQ 62
            L  CGI+  ++CP+  QQNG  E +HR++TE GL+L+ H+ +P   W +AF T+ +L N 
Sbjct: 596  LASCGIKQLISCPHTPQQNGIAERRHRYLTELGLSLMFHSKVPHKLWVEAFFTSNFLSNL 655

Query: 63   LPIPTL-DLSTPLQKLYDKKPDYSFLKCFGCLCYPHTRPYNTHKLAFRSAPCTFLGYSPM 121
            LP  TL D  +P + L+   P Y+ L+ FG  CYP+ RPY  +K   +S  C FLGY+  
Sbjct: 656  LPSSTLSDNKSPYEMLHGTPPVYTALRVFGSACYPYLRPYAKNKFDPKSLLCVFLGYNNK 715

Query: 122  HKGYNCL-TTEGKVIITRNVLFDETVFPF-----QFQT----------SKSSSDILLDSS 165
            +KGY CL    GKV I R+VLFDE  FP+     QFQT           K  S   L   
Sbjct: 716  YKGYRCLHPPTGKVYICRHVLFDERKFPYSDIYSQFQTISGSPLFTAWQKGFSSTALSRE 775

Query: 166  LPTTSV----IPLLPISNA-----SPNVMPHANS------------VMPTVATNGSAPVV 204
             P+T+V     P   +S++     +PN+   A +            V P+  T+ S P  
Sbjct: 776  TPSTNVEDIIFPSATVSSSVPTGCAPNIAETATAPDVDVAAAHDMVVPPSPITSTSLPTQ 835

Query: 205  -QTTVSDFSQPA*DSGVVHASAQPTEP---------PVPPSSNV-----------HSMTT 243
             + + SD +  + DS    +SA   +            PP  +V           H M T
Sbjct: 836  PEESTSDQNHYSTDSETAISSAMTPQSINVSLFEDSDFPPLQSVISSTTAAPETSHPMIT 895

Query: 244  RAKTGIHRPKAYAASTALSS---VPTSAK*ALSIPHWHQAMQEEYQALMNNNTWELVPLL 300
            RAK+GI +P    A  ++ S    P S K AL    W  AM EE   +   +TW+LVP  
Sbjct: 896  RAKSGITKPNPKYALFSVKSNYPEPKSVKEALKDEGWTNAMGEEMGTMHETDTWDLVPPE 955

Query: 301  HGRDAIGCKWVFRTKYNSDGSINKHKPRLVAKGFHQTEGYDYNETFSPVVKPTTIRTVLS 360
                 +GCKWVF+TK NSDGS+++ K RLVA+G+ Q EG DY ET+SPVV+  T+R++L 
Sbjct: 956  MVDRLLGCKWVFKTKLNSDGSLDRLKARLVARGYEQEEGVDYVETYSPVVRSATVRSILH 1015

Query: 361  LALMHQWPIRQFDFNNAFLNGELVEEVFMQQPPGFSKGSS-HLVCKLKKALYGLKQAPRA 419
            +A +++W ++Q D  NAFL+ EL E VFM QPPGF   S    VCKLKKA+Y LKQAPRA
Sbjct: 1016 VATINKWSLKQLDVKNAFLHDELKETVFMTQPPGFEDPSRPDYVCKLKKAIYDLKQAPRA 1075

Query: 420  WFDKLSSTLARFGFQAAKCDTSLFCKFTKSETLYILVYVDDIIITGSSSTAIFSLIADLN 479
            WFDK SS L ++GF  +  D SLF      + +++L+YVDD+I+TG++   +  L+  L+
Sbjct: 1076 WFDKFSSYLLKYGFICSFSDPSLFVYLKGRDVMFLLLYVDDMILTGNNDVLLQQLLNILS 1135

Query: 480  AVFPLKDLGSLHYFLGIEATHTKDGGLILSQTKYINDLLIKAKMDSINPSPTPMTSGLKL 539
              F +KD+G+LHYFLGI+A H  + GL LSQ KY +DLL+ A M   +  PTP+   L L
Sbjct: 1136 TEFRMKDMGALHYFLGIQA-HYHNDGLFLSQEKYTSDLLVNAGMSDCSSMPTPLQ--LDL 1192

Query: 540  SANSGALFPKPLLYRSVVGALHYVTITRPEIAFPLNKVCQFMHSPLEPHWQAVKRILRYL 599
               +   FP+P  +R + G L Y+T+TRP+I F +N VCQ MH+P    +  +KRIL YL
Sbjct: 1193 LQGNNKPFPEPTYFRRLAGKLQYLTLTRPDIQFAVNFVCQKMHAPTMSDFHLLKRILHYL 1252

Query: 600  HGTATHGLYLRRPSSMSITAFADSDWGSDPDDRRSTSGMCIYIGGNLVSWAAKKQTVVAR 659
             GT T G+ L   +   +  ++DSDW    D RRST G C ++G N++SW+AK+   V++
Sbjct: 1253 KGTMTMGINLSSNTDSVLRCYSDSDWAGCKDTRRSTGGFCTFLGYNIISWSAKRHPTVSK 1312

Query: 660  SSTEAEYRSLALVITEIMWLRSLLSELRVPTSPSSLIYCDNLSVVHLVANPILHTKTKHF 719
            SSTEAEYR+L+   +E+ W+  LL E+ +P      +YCDNLS V+L ANP LH+++KHF
Sbjct: 1313 SSTEAEYRTLSFAASEVSWIGFLLQEIGLPQQQIPEMYCDNLSAVYLSANPALHSRSKHF 1372

Query: 720  ALDLHFVRERVADKQVTVVNIPAPSQVS 747
             +D ++VRERVA   +TV +IPA  Q++
Sbjct: 1373 QVDYYYVRERVALGALTVKHIPASQQLA 1400


>gb|AAD21687.1| Strong similarity to gi|3600044 T12H20.12 protease homolog from
            Arabidopsis thaliana BAC gb|AF080119 and is a member of
            the reverse transcriptase family PF|00078
            gi|25301706|pir||C86438 hypothetical protein F28K20.17 -
            Arabidopsis thaliana
          Length = 1415

 Score =  588 bits (1517), Expect = e-166
 Identities = 327/754 (43%), Positives = 450/754 (59%), Gaps = 17/754 (2%)

Query: 3    LTQCGIQHRLTCPYVHQQNGRVEWKHRHITETGLALLAHAHMPLTFWGDAFETAAYLINQ 62
            L++ GI HR++CPY  QQNG  E KHRH+ E GL++L H+H P  FW ++F TA Y+IN+
Sbjct: 595  LSEHGIHHRISCPYTPQQNGLAERKHRHLVELGLSMLFHSHTPQKFWVESFFTANYIINR 654

Query: 63   LPIPTLDLSTPLQKLYDKKPDYSFLKCFGCLCYPHTRPYNTHKLAFRSAPCTFLGYSPMH 122
            LP   L   +P + L+ +KPDYS L+ FG  CYP  RP   +K   RS  C FLGY+  +
Sbjct: 655  LPSSVLKNLSPYEALFGEKPDYSSLRVFGSACYPCLRPLAQNKFDPRSLQCVFLGYNSQY 714

Query: 123  KGYNCL-TTEGKVIITRNVLFDETVFPFQFQTSKSSSDILLDSSLPTTSVIPLLPISNAS 181
            KGY C     GKV I+RNV+F+E+  PF+    +    ++   S P         IS  S
Sbjct: 715  KGYRCFYPPTGKVYISRNVIFNESELPFK----EKYQSLVPQYSTPLLQAWQHNKISEIS 770

Query: 182  PNVMPHANSVMPTVATNGSAPVVQTTVSDFSQPA*DSGV---VHASAQPTEPPVPPSSNV 238
                P      P      +   V   ++D    + + G    V+  A+          N 
Sbjct: 771  VPAAPVQLFSKPIDLNTYAGSQVTEQLTDPEPTSNNEGSDEEVNPVAEEIAANQEQVINS 830

Query: 239  HSMTTRAKTGIHRPKA-YAASTALSSV--PTSAK*ALSIPHWHQAMQEEYQALMNNNTWE 295
            H+MTTR+K GI +P   YA  T+  +   P +   A+  P W++A+ EE   +   +TW 
Sbjct: 831  HAMTTRSKAGIQKPNTRYALITSRMNTAEPKTLASAMKHPGWNEAVHEEINRVHMLHTWS 890

Query: 296  LVPLLHGRDAIGCKWVFRTKYNSDGSINKHKPRLVAKGFHQTEGYDYNETFSPVVKPTTI 355
            LVP     + +  KWVF+TK + DGSI+K K RLVAKGF Q EG DY ETFSPVV+  TI
Sbjct: 891  LVPPTDDMNILSSKWVFKTKLHPDGSIDKLKARLVAKGFDQEEGVDYLETFSPVVRTATI 950

Query: 356  RTVLSLALMHQWPIRQFDFNNAFLNGELVEEVFMQQPPGF--SKGSSHLVCKLKKALYGL 413
            R VL ++    WPI+Q D +NAFL+GEL E VFM QP GF   +  +H VC+L KA+YGL
Sbjct: 951  RLVLDVSTSKGWPIKQLDVSNAFLHGELQEPVFMYQPSGFIDPQKPTH-VCRLTKAIYGL 1009

Query: 414  KQAPRAWFDKLSSTLARFGFQAAKCDTSLFCKFTKSETLYILVYVDDIIITGSSSTAIFS 473
            KQAPRAWFD  S+ L  +GF  +K D SLF      + LY+L+YVDDI++TGS  + +  
Sbjct: 1010 KQAPRAWFDTFSNFLLDYGFVCSKSDPSLFVCHQDGKILYLLLYVDDILLTGSDQSLLED 1069

Query: 474  LIADLNAVFPLKDLGSLHYFLGIEATHTKDGGLILSQTKYINDLLIKAKMDSINPSPTPM 533
            L+  L   F +KDLG   YFLGI+     +G L L QT Y  D+L +A M   NP PTP+
Sbjct: 1070 LLQALKNRFSMKDLGPPRYFLGIQIEDYANG-LFLHQTAYATDILQQAGMSDCNPMPTPL 1128

Query: 534  TSGLKLSANSGALFPKPLLYRSVVGALHYVTITRPEIAFPLNKVCQFMHSPLEPHWQAVK 593
                +L   +  LF +P  +RS+ G L Y+TITRP+I F +N +CQ MHSP    +  +K
Sbjct: 1129 PQ--QLDNLNSELFAEPTYFRSLAGKLQYLTITRPDIQFAVNFICQRMHSPTTSDFGLLK 1186

Query: 594  RILRYLHGTATHGLYLRRPSSMSITAFADSDWGSDPDDRRSTSGMCIYIGGNLVSWAAKK 653
            RILRY+ GT   GL ++R S+++++A++DSD     + RRST+G CI +G NL+SW+AK+
Sbjct: 1187 RILRYIKGTIGMGLPIKRNSTLTLSAYSDSDHAGCKNTRRSTTGFCILLGSNLISWSAKR 1246

Query: 654  QTVVARSSTEAEYRSLALVITEIMWLRSLLSELRVPTSPSSLIYCDNLSVVHLVANPILH 713
            Q  V+ SSTEAEYR+L     EI W+  LL +L +P    + +YCDNLS V+L ANP LH
Sbjct: 1247 QPTVSNSSTEAEYRALTYAAREITWISFLLRDLGIPQYLPTQVYCDNLSAVYLSANPALH 1306

Query: 714  TKTKHFALDLHFVRERVADKQVTVVNIPAPSQVS 747
             ++KHF  D H++RE+VA   +   +I A  Q++
Sbjct: 1307 NRSKHFDTDYHYIREQVALGLIETQHISATFQLA 1340


>emb|CAC37623.1| copia-like polyprotein [Arabidopsis thaliana]
          Length = 1466

 Score =  585 bits (1508), Expect = e-165
 Identities = 328/761 (43%), Positives = 446/761 (58%), Gaps = 23/761 (3%)

Query: 7    GIQHRLTCPYVHQQNGRVEWKHRHITETGLALLAHAHMPLTFWGDAFETAAYLINQLPIP 66
            GI HR++CPY  QQNG  E KHRH+ E GL++L H+H PL FW +AF TA YL N LP  
Sbjct: 601  GIHHRISCPYTPQQNGVAERKHRHLVELGLSMLYHSHTPLKFWVEAFFTANYLSNLLPSS 660

Query: 67   TLDLSTPLQKLYDKKPDYSFLKCFGCLCYPHTRPYNTHKLAFRSAPCTFLGYSPMHKGYN 126
             L   +P + L+ +K DY+ L+ FG  CYP  RP   +K   RS  C FLGY   +KGY 
Sbjct: 661  VLKEISPYETLFQQKVDYTPLRVFGTACYPCLRPLAKNKFDPRSLQCVFLGYHNQYKGYR 720

Query: 127  CL-TTEGKVIITRNVLFDETVFPFQ---------FQTSKSSSDILLDSSLPTTSVIPLLP 176
            CL    GKV I+R+V+FDE  FPF+         +QT+   +    D + P+     L P
Sbjct: 721  CLYPPTGKVYISRHVIFDEAQFPFKEKYHSLVPKYQTTLLQAWQHTDLTPPSVPSSQLQP 780

Query: 177  ISNASPNVMPHANSVMPTVATNGSAPVVQTTVSDFSQPA*D------SGVVHASAQPTEP 230
            ++     +    N  M    T  +  V   T SD    + D      + V++   +    
Sbjct: 781  LARQMTPMATSENQPMMNYETEEAVNVNMETSSDEETESNDEFDHEVAPVLNDQNEDNAL 840

Query: 231  PVPPSSNVHSMTTRAKTGIHRPKA-YAASTALSSV--PTSAK*ALSIPHWHQAMQEEYQA 287
                  N+H M TR+K GI +P   YA   + SS   P +   A+  P W+ A+ +E   
Sbjct: 841  GQGSLENLHPMITRSKDGIQKPNPRYALIVSKSSFDEPKTITTAMKHPSWNAAVMDEIDR 900

Query: 288  LMNNNTWELVPLLHGRDAIGCKWVFRTKYNSDGSINKHKPRLVAKGFHQTEGYDYNETFS 347
            +   NTW LVP     + +  KWVF+TK   DG+I+K K RLVAKGF Q EG DY ETFS
Sbjct: 901  IHMLNTWSLVPATEDMNILTSKWVFKTKLKPDGTIDKLKARLVAKGFDQEEGVDYLETFS 960

Query: 348  PVVKPTTIRTVLSLALMHQWPIRQFDFNNAFLNGELVEEVFMQQPPGF-SKGSSHLVCKL 406
            PVV+  TIR VL  A  ++WP++Q D +NAFL+GEL E VFM QP GF      + VC+L
Sbjct: 961  PVVRTATIRLVLDTATANEWPLKQLDVSNAFLHGELQEPVFMFQPSGFVDPNKPNHVCRL 1020

Query: 407  KKALYGLKQAPRAWFDKLSSTLARFGFQAAKCDTSLFCKFTKSETLYILVYVDDIIITGS 466
             KALYGLKQAPRAWFD  S+ L  FGF+ +  D SLF      ++L +L+YVDDI++TGS
Sbjct: 1021 TKALYGLKQAPRAWFDTFSNFLLDFGFECSTSDPSLFVCHQNGQSLILLLYVDDILLTGS 1080

Query: 467  SSTAIFSLIADLNAVFPLKDLGSLHYFLGIEATHTKDGGLILSQTKYINDLLIKAKMDSI 526
                +  L+  LN  F +KDLG   YFLGIE   + + GL L Q  Y +D+L +A M   
Sbjct: 1081 DQLLMDKLLQALNNRFSMKDLGPPRYFLGIEI-ESYNNGLFLHQHAYASDILHQAGMTEC 1139

Query: 527  NPSPTPMTSGLKLSANSGALFPKPLLYRSVVGALHYVTITRPEIAFPLNKVCQFMHSPLE 586
            NP PTP+   L+   NS   F +P  +RS+ G L Y+TITRP+I + +N +CQ MH+P  
Sbjct: 1140 NPMPTPLPQHLE-DLNSEP-FEEPTYFRSLAGKLQYLTITRPDIQYAVNFICQRMHAPTN 1197

Query: 587  PHWQAVKRILRYLHGTATHGLYLRRPSSMSITAFADSDWGSDPDDRRSTSGMCIYIGGNL 646
              +  +KRILRY+ GT   GL +R+  +  ++ F DSD+    D RRST+G CI +G  L
Sbjct: 1198 SDFGLLKRILRYVKGTINMGLPIRKHHNPVLSGFCDSDYAGCKDTRRSTTGFCILLGSTL 1257

Query: 647  VSWAAKKQTVVARSSTEAEYRSLALVITEIMWLRSLLSELRVPTSPSSLIYCDNLSVVHL 706
            +SW+AK+Q  ++ SSTEAEYR+L+    EI W+ SLL +L +     + ++CDNLS V+L
Sbjct: 1258 ISWSAKRQPTISHSSTEAEYRALSDTAREITWISSLLRDLGISQHQPTRVFCDNLSAVYL 1317

Query: 707  VANPILHTKTKHFALDLHFVRERVADKQVTVVNIPAPSQVS 747
             ANP LH ++KHF  D H++RERVA   +   +IPA  Q++
Sbjct: 1318 SANPALHKRSKHFDKDFHYIRERVALGLIETQHIPATIQLA 1358


>gb|AAD43604.1| T3P18.3 [Arabidopsis thaliana] gi|25301688|pir||H96650 protein
            T3P18.3 [imported] - Arabidopsis thaliana
          Length = 1309

 Score =  584 bits (1506), Expect = e-165
 Identities = 328/761 (43%), Positives = 446/761 (58%), Gaps = 23/761 (3%)

Query: 7    GIQHRLTCPYVHQQNGRVEWKHRHITETGLALLAHAHMPLTFWGDAFETAAYLINQLPIP 66
            GI HR++CPY  QQNG  E KHRH+ E GL++L H+H PL FW +AF TA YL N LP  
Sbjct: 444  GIHHRISCPYTPQQNGVAERKHRHLVELGLSMLYHSHTPLKFWVEAFFTANYLSNLLPSS 503

Query: 67   TLDLSTPLQKLYDKKPDYSFLKCFGCLCYPHTRPYNTHKLAFRSAPCTFLGYSPMHKGYN 126
             L   +P + L+ +K DY+ L+ FG  CYP  RP   +K   RS  C FLGY   +KGY 
Sbjct: 504  VLKEISPYETLFQQKVDYTPLRVFGTACYPCLRPLAKNKFDPRSLQCVFLGYHNQYKGYR 563

Query: 127  CL-TTEGKVIITRNVLFDETVFPFQ---------FQTSKSSSDILLDSSLPTTSVIPLLP 176
            CL    GKV I+R+V+FDE  FPF+         +QT+   +    D + P+     L P
Sbjct: 564  CLYPPTGKVYISRHVIFDEAQFPFKEKYHSLVPKYQTTLLQAWQHTDLTPPSVPSSQLQP 623

Query: 177  ISNASPNVMPHANSVMPTVATNGSAPVVQTTVSDFSQPA*D------SGVVHASAQPTEP 230
            ++     +    N  M    T  +  V   T SD    + D      + V++   +    
Sbjct: 624  LARQVTPMATSENQPMMNYETEEAVNVNMETSSDEETESNDEFDHEVAPVLNDQNEDNAL 683

Query: 231  PVPPSSNVHSMTTRAKTGIHRPKA-YAASTALSSV--PTSAK*ALSIPHWHQAMQEEYQA 287
                  N+H M TR+K GI +P   YA   + SS   P +   A+  P W+ A+ +E   
Sbjct: 684  GQGSLENLHPMITRSKDGIQKPNPRYALIVSKSSFDEPKTITTAMKHPGWNAAVMDEIDR 743

Query: 288  LMNNNTWELVPLLHGRDAIGCKWVFRTKYNSDGSINKHKPRLVAKGFHQTEGYDYNETFS 347
            +   NTW LVP     + +  KWVF+TK   DG+I+K K RLVAKGF Q EG DY ETFS
Sbjct: 744  IHMLNTWSLVPATEDMNILTSKWVFKTKLKPDGTIDKLKARLVAKGFDQEEGVDYLETFS 803

Query: 348  PVVKPTTIRTVLSLALMHQWPIRQFDFNNAFLNGELVEEVFMQQPPGF-SKGSSHLVCKL 406
            PVV+  TIR VL  A  ++WP++Q D +NAFL+GEL E VFM QP GF      + VC+L
Sbjct: 804  PVVRTATIRLVLDTATANEWPLKQLDVSNAFLHGELQEPVFMFQPSGFVDPNKPNHVCRL 863

Query: 407  KKALYGLKQAPRAWFDKLSSTLARFGFQAAKCDTSLFCKFTKSETLYILVYVDDIIITGS 466
             KALYGLKQAPRAWFD  S+ L  FGF+ +  D SLF      ++L +L+YVDDI++TGS
Sbjct: 864  TKALYGLKQAPRAWFDTFSNFLLDFGFECSTSDPSLFVCHQNGQSLILLLYVDDILLTGS 923

Query: 467  SSTAIFSLIADLNAVFPLKDLGSLHYFLGIEATHTKDGGLILSQTKYINDLLIKAKMDSI 526
                +  L+  LN  F +KDLG   YFLGIE   + + GL L Q  Y +D+L +A M   
Sbjct: 924  DQLLMDKLLQALNNRFSMKDLGPPRYFLGIEI-ESYNNGLFLHQHAYASDILHQAGMTEC 982

Query: 527  NPSPTPMTSGLKLSANSGALFPKPLLYRSVVGALHYVTITRPEIAFPLNKVCQFMHSPLE 586
            NP PTP+   L+   NS   F +P  +RS+ G L Y+TITRP+I + +N +CQ MH+P  
Sbjct: 983  NPMPTPLPQHLE-DLNSEP-FEEPTYFRSLAGKLQYLTITRPDIQYAVNFICQRMHAPTN 1040

Query: 587  PHWQAVKRILRYLHGTATHGLYLRRPSSMSITAFADSDWGSDPDDRRSTSGMCIYIGGNL 646
              +  +KRILRY+ GT   GL +R+  +  ++ F DSD+    D RRST+G CI +G  L
Sbjct: 1041 SDFGLLKRILRYVKGTINMGLPIRKHHNPVLSGFCDSDYAGCKDTRRSTTGFCILLGSTL 1100

Query: 647  VSWAAKKQTVVARSSTEAEYRSLALVITEIMWLRSLLSELRVPTSPSSLIYCDNLSVVHL 706
            +SW+AK+Q  ++ SSTEAEYR+L+    EI W+ SLL +L +     + ++CDNLS V+L
Sbjct: 1101 ISWSAKRQPTISHSSTEAEYRALSDTAREITWISSLLRDLGISQHQPTRVFCDNLSAVYL 1160

Query: 707  VANPILHTKTKHFALDLHFVRERVADKQVTVVNIPAPSQVS 747
             ANP LH ++KHF  D H++RERVA   +   +IPA  Q++
Sbjct: 1161 SANPALHKRSKHFDKDFHYIRERVALGLIETQHIPATIQLA 1201


>emb|CAE05956.3| OSJNBb0088C09.15 [Oryza sativa (japonica cultivar-group)]
            gi|32487794|emb|CAE05417.1| OSJNBa0035I04.5 [Oryza sativa
            (japonica cultivar-group)]
          Length = 1246

 Score =  578 bits (1490), Expect = e-163
 Identities = 324/760 (42%), Positives = 445/760 (57%), Gaps = 21/760 (2%)

Query: 2    FLTQCGIQHRLTCPYVHQQNGRVEWKHRHITETGLALLAHAHMPLTFWGDAFETAAYLIN 61
            FL   G   RL+CPY   QNG+ E   R +  +   LL  A MP ++W +   TA YL+N
Sbjct: 458  FLAGRGSLLRLSCPYTSPQNGKAERMIRTLNNSIRTLLLQASMPPSYWAEGLATATYLLN 517

Query: 62   QLPIPTLDLSTPLQKLYDKKPDYSFLKCFGCLCYPHTRPYNTHKLAFRSAPCTFLGYSPM 121
            + P  +++ S P Q L+ K P+YS L+ FGCLCYP+      HKLA  SA C FLGY   
Sbjct: 518  RRPSSSVNNSIPFQLLHRKIPNYSMLRVFGCLCYPNLSATAAHKLAPYSAACVFLGYPSS 577

Query: 122  HKGYNCLT-TEGKVIITRNVLFDETVFPFQFQTSKSSSDILLDSSLPTTSVIPLLPISNA 180
            HKGY CL  +  ++II+ +V+FDET FPF      +SS   L    P  SVI        
Sbjct: 578  HKGYCCLNISTRRIIISCHVIFDETQFPFSGDPVDASSLDFLLQDAPAPSVIAPSLAGVE 637

Query: 181  SPNVMPHA------NSVMPTVATNGSAPVVQTTVSDFSQPA*DSGVVHASA--------Q 226
             P+ +PHA         +PT A +     +   V   +    D G  H +         Q
Sbjct: 638  QPH-LPHAPFPVNVEQRLPTGAPSTKDEHLPYYVQPAAHCGQDDGKFHTAGCHVSIFLKQ 696

Query: 227  PTEPPVPPSSNVHSMTTRAKTGIHRPKAYAASTALSSVPTSAK*ALSIPHWHQAMQEEYQ 286
             T     P+ +V   T+R    I  P         S+         + P    AM EE++
Sbjct: 697  GTIVTSYPAVHVFFTTSRHGGAIPVPLPTTTGADDSAPSHRCGCTHTSP---AAMAEEFK 753

Query: 287  ALMNNNTWELVPLLHGRDAIGCKWVFRTKYNSDGSINKHKPRLVAKGFHQTEGYDYNETF 346
            AL++N TW LVP   G + +  KW+F+ K++SDGS+ +HK R V  G+ Q  G DY+ETF
Sbjct: 754  ALIDNGTWRLVPRPPGANVVTGKWIFKHKFHSDGSLARHKARWVVHGYSQQHGIDYDETF 813

Query: 347  SPVVKPTTIRTVLSLALMHQWPIRQFDFNNAFLNGELVEEVFMQQPPGF-SKGSSHLVCK 405
            SPVVKP+TIR VLS+A    WPI Q D  NAFL+G L E V+ QQ  GF    +   VC 
Sbjct: 814  SPVVKPSTIRVVLSIATSRSWPIHQLDVKNAFLHGTLDETVYCQQSSGFIDPAAPDAVCL 873

Query: 406  LKKALYGLKQAPRAWFDKLSSTLARFGFQAAKCDTSLFCKFTKSETLYILVYVDDIIITG 465
            L+++LYGLKQAPRAW+ + ++ + + GF  +  DTSLF         Y+L+YVDDII+  
Sbjct: 874  LQRSLYGLKQAPRAWYQRFATYIRQLGFTPSASDTSLFILRDGDRLAYLLLYVDDIILMA 933

Query: 466  SSSTAIFSLIADLNAVFPLKDLGSLHYFLGIEATHTKDGGLILSQTKYINDLLIKAKMDS 525
            SS+  +  + A L+  F + DLG LH+FLGI    T D  L LSQ +Y  DLL +A M  
Sbjct: 934  SSAELLCHITARLHTEFAMTDLGDLHFFLGISVRRTPDD-LFLSQRQYAVDLLQRAGMSE 992

Query: 526  INPSPTPMTSGLKLSANSGALFPKPLLYRSVVGALHYVTITRPEIAFPLNKVCQFMHSPL 585
             +P+ TP+ +  KLSA+ GA    P  YRS+  AL Y+T+TRPEIA+ + +VC+FMH P 
Sbjct: 993  YHPTATPVDARAKLSASEGAPVADPTEYRSLADALQYLTLTRPEIAYAVQQVCRFMHDPH 1052

Query: 586  EPHWQAVKRILRYLHGTATHGLYLRRPSSMSITAFADSDWGSDPDDRRSTSGMCIYIGGN 645
            EPH   VKRI+RY+ G+ + GL++      S+TA++D+DW   PD RRSTSG CI++G N
Sbjct: 1053 EPHLALVKRIMRYIKGSLSAGLHIGTGPVGSLTAYSDADWAGCPDSRRSTSGYCIFLGDN 1112

Query: 646  LVSWAAKKQTVVARSSTEAEYRSLALVITEIMWLRSLLSELRVPTSPSSLIYCDNLSVVH 705
            LVSW++K+QT V+RSS EAEYR++A  I E   LR LL EL  P S +++++CDN+S  +
Sbjct: 1113 LVSWSSKRQTTVSRSSAEAEYRAVAHAIAECCCLRQLLQELHAPISTTTVVFCDNVSAAY 1172

Query: 706  LVANPILHTKTKHFALDLHFVRERVADKQVTVVNIPAPSQ 745
            + ANP+ H  TKH  +D+HFVRE+VA  QV V+++P+  Q
Sbjct: 1173 MTANPVHHRCTKHIEIDIHFVREKVALGQVRVLHVPSTHQ 1212


>gb|AAW56918.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
          Length = 1018

 Score =  570 bits (1468), Expect = e-161
 Identities = 317/705 (44%), Positives = 432/705 (60%), Gaps = 42/705 (5%)

Query: 40   AHAHMPLTFWGDAFETAAYLINQLPIPTLDLSTPLQKLYDKKPDYSFLKCFGCLCYPHTR 99
            AHA MPL FW +AF TA+YLIN+ P   ++  TPL++L+ + P+YSFL+ FGC C+P+ R
Sbjct: 338  AHASMPLKFWDEAFYTASYLINRTPSKVINFETPLERLFHQPPNYSFLRVFGCACWPNLR 397

Query: 100  PYNTHKLAFRSAPCTFLGYSPMHKGYNCLTTE-GKVIITRNVLFDETVFPFQFQTSKSSS 158
            PYN HKL FR           +HKG+ CL    G+V I+R+V FDE VFPF    S + +
Sbjct: 398  PYNKHKLQFRH----------LHKGFKCLDVPTGRVYISRDVTFDENVFPFAQLHSNAGA 447

Query: 159  DILLDSSLPTTSVIPLLPISNASPNVMPHANSVMPTVATNGSAPVVQTTVSDFSQPA*DS 218
             +  + SL  +S++P+   S    ++    NSV    ATN S         D      D 
Sbjct: 448  RLRNEISLLPSSLLPIT-YSGGEQSITSMFNSVPN--ATNLSDAGSAENGGDL-----DV 499

Query: 219  GVVHASAQPTEPPVPPSSNVHSMTTRAKTGIHRPKAYAASTALSSVPTSAK*ALS----- 273
             V H     T      S ++ SM   +   + +   +  S +  ++PT A  A S     
Sbjct: 500  SVTHDGVHTTHGDAL-SEDLGSMPQESLGSVSQAPLHHVSGSPPTMPTPAPSAPSPRQSP 558

Query: 274  --------IPHWHQAMQEEYQALMNNNTWELVPLLHGRDAIGCKWVFRTKYNSDGSINKH 325
                      HW +AM  EYQALM N TW LVP   GR+ I CKWV++ K  +DGS++++
Sbjct: 559  NQPDEAFATNHWREAMDAEYQALMKNQTWHLVPPQQGRNIIDCKWVYKVKRKADGSLDRY 618

Query: 326  KPRLVAKGFHQTEGYDYNETFSPVVKPTTIRTVLSLALMHQWPIRQFDFNNAFLNGELVE 385
            K RLVAKGF Q  G DY +TFSPVVK TTIRT+LS+ +   W +RQ D  NAFL+G L E
Sbjct: 619  KARLVAKGFKQRYGIDYEDTFSPVVKATTIRTILSITVSRGWSLRQLDVQNAFLHGILEE 678

Query: 386  EVFMQQPPGF--SKGSSHLVCKLKKALYGLKQAPRAWFDKLSSTLARFGFQAAKCDTSLF 443
            EV+M+Q PG+  SK  +H +CKL KA+YGLKQAPRAW+ +LS+ L   GF+ +K DTSLF
Sbjct: 679  EVYMKQTPGYENSKFPNH-ICKLDKAIYGLKQAPRAWYSRLSTKLQALGFKPSKVDTSLF 737

Query: 444  CKFTKSETLYILVYVDDIIITGSSSTAIFSLIADLNAVFPLKDLGSLHYFLGIEATHTKD 503
                   T+++L+YVDDII+  S+ +A  +L+ DL   F LKDL  LHYFLGIE T T+D
Sbjct: 738  FYSKGDVTVFVLIYVDDIIVASSTPSATSALLRDLTKEFALKDLRELHYFLGIEVTRTED 797

Query: 504  GGLILSQTKYINDLLIKAKMDSINPSPTPMTSGLKLSANSGALF-PKPLL-YRSVVGALH 561
             G++L+Q KY  D+L +  M    P  TP+ +  KLS N G L  PK    YRS+VGAL 
Sbjct: 798  -GILLTQAKYAYDVLRRVGMQDCKPVNTPLLTSEKLSVNEGDLLGPKDATEYRSIVGALQ 856

Query: 562  YVTITRPEIAFPLNKVCQFMHSPLEPHWQAVKRILRYLHGTATHGLYLRRPSSMSITAFA 621
            Y+T+TR +I+F +NKVCQ    P   HW AVKRILRYL  T + GL + +  ++ +++F+
Sbjct: 857  YLTLTRLDISFSVNKVCQ---CPTTVHWAAVKRILRYLKHTVSCGLKINKSPTLLVSSFS 913

Query: 622  DSDWGSDPDDRRSTSGMCIYIGGNLVSWAAKKQTVVARSSTEAEYRSLALVITEIMWLRS 681
            D+DW S   DRRST G  +++G NLVSW+A+KQ  V+RSSTEAE ++LA   TEIMWL++
Sbjct: 914  DADWASCLADRRSTGGFTVFLGSNLVSWSARKQATVSRSSTEAECKALADATTEIMWLQT 973

Query: 682  LLSELRVPTSPSSLIYCDNLSVVHLVANPILHTKTKHFALDLHFV 726
            LL E+ +       ++ DN+   +L ANP+ H +TKH  +D HFV
Sbjct: 974  LLCEIGINAPKVVKMWFDNIGAKYLSANPVFHARTKHIEVDYHFV 1018


>gb|AAT40550.1| putative receptor kinase [Solanum demissum]
          Length = 1358

 Score =  537 bits (1383), Expect = e-151
 Identities = 297/751 (39%), Positives = 426/751 (56%), Gaps = 47/751 (6%)

Query: 2    FLTQCGIQHRLTCPYVHQQNGRVEWKHRHITETGLALLAHAHMPLTFWGDAFETAAYLIN 61
            F+T  GI H+ TCPY  QQNG  E K+RH+ ET   LL  +++PL FWGDA  T+ YLIN
Sbjct: 622  FMTHQGIIHQTTCPYTPQQNGVAERKNRHLIETARTLLLESNVPLRFWGDAVLTSCYLIN 681

Query: 62   QLPIPTLDLSTPLQKLYDKKPDYSFL-KCFGCLCYPHTRPYNTHKLAFRSAPCTFLGYSP 120
            ++P  ++    P   L+ +   Y    + FG  C+ H       KLA R+  C FLGYS 
Sbjct: 682  RMPSSSIQNQVPHSILFPQSHLYPIPPRVFGSTCFVHNLAPGKDKLAPRALKCVFLGYSR 741

Query: 121  MHKGYNCLTTE-GKVIITRNVLFDETVFPFQFQTSKSSSDILLDSSLPTTSVIPLLPISN 179
            + KGY C + +  + +++ +V F E+     + TS +  D+ +           +LPI  
Sbjct: 742  VQKGYRCYSHDLHRYLMSADVTFFESQ---PYYTSSNHPDVSM-----------VLPI-- 785

Query: 180  ASPNVMPHANSVMPTVATNGSAPVVQTTVSDFSQPA*DSGVVHASAQPTEPPVPPSSNVH 239
              P V+P    V  TV +  ++PVV   +  +          H   +PT   + P  + H
Sbjct: 786  --PQVLPVPTFVESTVTS--TSPVVVPPLLTY----------HRRPRPT---LVPDDSCH 828

Query: 240  SMTTRAKTGIHRPKAYAASTALSSVPTSAK*ALSIPHWHQAMQEEYQALMNNNTWELVPL 299
            +        +  P    A             ALS   W QAM +E  AL  + TWELV L
Sbjct: 829  APDPAPTADLPPPSQPLALQKGE--------ALSHSGWRQAMVDEMSALHKSGTWELVSL 880

Query: 300  LHGRDAIGCKWVFRTKYNSDGSINKHKPRLVAKGFHQTEGYDYNETFSPVVKPTTIRTVL 359
              G+  +GC+WV+  K   DG +++ K RLVAKG+ Q  G DY++TF+PV K  ++R  L
Sbjct: 881  PAGKSTVGCRWVYAVKIGPDGQVDRLKARLVAKGYTQIFGLDYSDTFAPVAKIASVRLFL 940

Query: 360  SLALMHQWPIRQFDFNNAFLNGELVEEVFMQQPPGF-SKG-SSHLVCKLKKALYGLKQAP 417
            S+A +  WP+ Q D  NAFL+G+L EEV+M+QPPGF ++G SS LVC+L+++LYGLKQ+P
Sbjct: 941  SMAAVRHWPLHQLDIKNAFLHGDLEEEVYMEQPPGFVAQGESSSLVCRLRRSLYGLKQSP 1000

Query: 418  RAWFDKLSSTLARFGFQAAKCDTSLFCKFTK-SETLYILVYVDDIIITGSSSTAIFSLIA 476
            RAWF K S+ +  FG   +  D S+F + +  S  +Y++VYVDDI+ITG+    I  L  
Sbjct: 1001 RAWFGKFSTVIQEFGMTRSGADHSVFYRHSAPSRCIYLVVYVDDIVITGNDQDGITDLKQ 1060

Query: 477  DLNAVFPLKDLGSLHYFLGIEATHTKDGGLILSQTKYINDLLIKAKMDSINPSPTPMTSG 536
             L   F  KDLG L YFLGIE   ++ G +++SQ KY  D+L +  M    P  TPM   
Sbjct: 1061 HLFKHFQTKDLGRLKYFLGIEVAQSRSG-IVISQRKYALDILEETGMMGCRPVDTPMDPN 1119

Query: 537  LKLSANSGALFPKPLLYRSVVGALHYVTITRPEIAFPLNKVCQFMHSPLEPHWQAVKRIL 596
            +KL    G     P  YR +VG L+Y+T+TRP+I+FP++ V QFM SP + HW+AV RIL
Sbjct: 1120 VKLLPGQGEPLSNPERYRRLVGKLNYLTVTRPDISFPVSVVSQFMTSPCDSHWEAVVRIL 1179

Query: 597  RYLHGTATHGLYLRRPSSMSITAFADSDWGSDPDDRRSTSGMCIYIGGNLVSWAAKKQTV 656
            RY+      GL         I  + D+DW   P DRRSTSG C+ +GGNLVSW +KKQ V
Sbjct: 1180 RYIKSAPGKGLLFEDQGHEHIIGYTDADWAGSPSDRRSTSGYCVLVGGNLVSWKSKKQNV 1239

Query: 657  VARSSTEAEYRSLALVITEIMWLRSLLSELRVPTSPSSLIYCDNLSVVHLVANPILHTKT 716
            VARSS E+EYR++A    E++W++ LL EL+        + CDN + +H+ +NP+ H +T
Sbjct: 1240 VARSSAESEYRAMATATCELVWIKQLLGELKFGKVDKMELVCDNQAALHIASNPVFHERT 1299

Query: 717  KHFALDLHFVRERVADKQVTVVNIPAPSQVS 747
            KH  +D HFVRE++    +    + +  Q++
Sbjct: 1300 KHIEIDCHFVREKILSGDIVTKFVKSNDQLA 1330


>emb|CAC95126.1| gag-pol polyprotein [Populus deltoides]
          Length = 1382

 Score =  535 bits (1379), Expect = e-150
 Identities = 307/753 (40%), Positives = 442/753 (57%), Gaps = 33/753 (4%)

Query: 7    GIQHRLTCPYVHQQNGRVEWKHRHITETGLALLAHAHMPLTFWGDAFETAAYLINQLPIP 66
            G  H+ +C    +QNG  E KHRHI ET  +LL  A +   FWG+A  TA  LIN +P  
Sbjct: 624  GTIHQTSCTDTPEQNGVAERKHRHIVETARSLLLSAFVLSEFWGEAVLTAVSLINTIPSS 683

Query: 67   TLDLSTPLQKLYDKKPDYSFLKCFGC---LCYPHTRPYNTHKLAFRSAPCTFLGYSPMHK 123
                 +P +KLY   PDYS  + FGC   + +PH      +KL+ RSA C FLGY    K
Sbjct: 684  HSSGLSPFEKLYGHVPDYSSFRVFGCTYFVLHPHVE---RNKLSSRSAICVFLGYGEGKK 740

Query: 124  GYNCLTT-EGKVIITRNVLFDETVFPFQFQTSKSSSDILLDSSLPTTSVIPLLPISNASP 182
            GY C      K+ ++ +V+F E +  F   ++  S        L  + +I + P S  S 
Sbjct: 741  GYRCFDPITQKLYVSHHVVFLEHIPFFSIPSTTHS--------LTKSDLIHIDPFSEDSG 792

Query: 183  N-VMPHANSVMPTVATNGSAPVVQTTVSDFSQPA*DSGVVHASAQPTEPPVPPSSNVHSM 241
            N   P+  S+     T+ SA    T +S   + +  S    AS++  +PP   S  +   
Sbjct: 793  NDTSPYVRSI----CTHNSAGT-GTLLSGTPEASFSSTAPQASSEIVDPPPRQSIRIRKS 847

Query: 242  TTRAKTGIHRPKAYAAS--TALSSV-----PTSAK*ALSIPHWHQAMQEEYQALMNNNTW 294
            T   K        Y++S  + L+ +     P+S K A+  P   QAM EE  AL   +TW
Sbjct: 848  T---KLPDFAYSCYSSSFTSFLAYIHCLFEPSSYKEAILDPLGQQAMDEELSALHKTDTW 904

Query: 295  ELVPLLHGRDAIGCKWVFRTKYNSDGSINKHKPRLVAKGFHQTEGYDYNETFSPVVKPTT 354
            +LVPL  G+  +GC+WV++ K NSDGSI ++K RLVAKG+ Q  G DY ETF+P+ K TT
Sbjct: 905  DLVPLPPGKSVVGCRWVYKIKTNSDGSIERYKARLVAKGYSQQYGMDYEETFAPIAKMTT 964

Query: 355  IRTVLSLALMHQWPIRQFDFNNAFLNGELVEEVFMQQPPGFSKGSSHLVCKLKKALYGLK 414
            IRT++++A + QW I Q D  NAFLNG+L EEV+M  PPG S  S + VCKLKKALYGLK
Sbjct: 965  IRTLIAVASIRQWHISQLDVKNAFLNGDLQEEVYMAPPPGISHDSGY-VCKLKKALYGLK 1023

Query: 415  QAPRAWFDKLSSTLARFGFQAAKCDTSLFCKFTKSETLYILVYVDDIIITGSSSTAIFSL 474
            QAPRAWF+K S  ++  GF ++  D++LF K T +  + + +YVDD+IITG     I  L
Sbjct: 1024 QAPRAWFEKFSIVISSLGFVSSSHDSALFIKCTDAGRIILSLYVDDMIITGDDIDGISVL 1083

Query: 475  IADLNAVFPLKDLGSLHYFLGIEATHTKDGGLILSQTKYINDLLIKAKMDSINPSPTPMT 534
              +L   F +KDLG L YFLGIE  ++  G L LSQ+KY+ ++L +A++       TP+ 
Sbjct: 1084 KTELARRFEMKDLGYLRYFLGIEVAYSPRGYL-LSQSKYVANILERARLTDNKTVDTPIE 1142

Query: 535  SGLKLSANSGALFPKPLLYRSVVGALHYVTITRPEIAFPLNKVCQFMHSPLEPHWQAVKR 594
               + S++ G     P LYR++VG+L Y+TIT P+IA+ ++ V QF+ SP   HW AV R
Sbjct: 1143 VNARYSSSDGLPLIDPTLYRTIVGSLVYLTITHPDIAYAVHVVSQFVASPTTIHWAAVLR 1202

Query: 595  ILRYLHGTATHGLYLRRPSSMSITAFADSDWGSDPDDRRSTSGMCIYIGGNLVSWAAKKQ 654
            ILRYL GT    L L   SS+ + A++D+D GSDP DR+S +G CI++G +L+SW +KKQ
Sbjct: 1203 ILRYLRGTVFQSLLLSSTSSLELRAYSDADHGSDPTDRKSVTGFCIFLGDSLISWKSKKQ 1262

Query: 655  TVVARSSTEAEYRSLALVITEIMWLRSLLSELRVPTSPSSLIYCDNLSVVHLVANPILHT 714
            ++V++SSTEAEY ++A    EI+W R LL+++ +  S  + +YCDN S + +  N + H 
Sbjct: 1263 SIVSQSSTEAEYCAMASTTKEIVWSRWLLADMGISFSHLTPMYCDNQSSIQIAHNSVFHE 1322

Query: 715  KTKHFALDLHFVRERVADKQVTVVNIPAPSQVS 747
            +TKH  +D H  R  +    + +  +P+  Q++
Sbjct: 1323 RTKHIEIDCHLTRHHLKHGTIALPFVPSSLQIA 1355


>gb|AAD25646.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
            gi|25301701|pir||E84589 probable retroelement pol
            polyprotein [imported] - Arabidopsis thaliana
          Length = 1461

 Score =  529 bits (1363), Expect = e-148
 Identities = 295/752 (39%), Positives = 431/752 (57%), Gaps = 33/752 (4%)

Query: 7    GIQHRLTCPYVHQQNGRVEWKHRHITETGLALLAHAHMPLTFWGDAFETAAYLINQLPIP 66
            GI    +CP   +QN  VE KH+HI     AL+  ++M L +WGD   TA +LIN+ P  
Sbjct: 709  GIVSFHSCPETPEQNSVVERKHQHILNVARALMFQSNMSLPYWGDCVLTAVFLINRTPSA 768

Query: 67   TLDLSTPLQKLYDKKPDYSFLKCFGCLCYPHTRPYNTHKLAFRSAPCTFLGYSPMHKGYN 126
             L   TP + L  K PDYS LK FGCLCY  T     HK   RS  C FLGY    KGY 
Sbjct: 769  LLSNKTPFEVLTGKLPDYSQLKTFGCLCYSSTSSKQRHKFLPRSRACVFLGYPFGFKGYK 828

Query: 127  CLTTEGKVI-ITRNVLFDETVFPFQF--QTSKSSSDILLDSSLPTTSVIPLLPISNASPN 183
             L  E  V+ I+RNV F E +FP     Q++ ++SD+            P+ P+S+ +  
Sbjct: 829  LLDLESNVVHISRNVEFHEELFPLASSQQSATTASDVFT----------PMDPLSSGNS- 877

Query: 184  VMPHANSVMPTVATNGSAPVVQTTVSDFSQPA*DSGVVHASAQPTEPPVPPSSNVHSMTT 243
                  S +P+   + S  + +  ++ F     D      +   + P    SS  +S  +
Sbjct: 878  ----ITSHLPSPQISPSTQISKRRITKFPAHLQDYHCYFVNKDDSHPI--SSSLSYSQIS 931

Query: 244  RAKTGIHRPKAYAASTALSSVPTSAK*ALSIPHWHQAMQEEYQALMNNNTWELVPLLHGR 303
             +         Y  + +   +P S   A     W  A+ +E  A+   +TWE+  L  G+
Sbjct: 932  PSHM------LYINNISKIPIPQSYHEAKDSKEWCGAIDQEIGAMERTDTWEITSLPPGK 985

Query: 304  DAIGCKWVFRTKYNSDGSINKHKPRLVAKGFHQTEGYDYNETFSPVVKPTTIRTVLSLAL 363
             A+GCKWVF  K+++DGS+ + K R+VAKG+ Q EG DY ETFSPV K  T++ +L ++ 
Sbjct: 986  KAVGCKWVFTVKFHADGSLERFKARIVAKGYTQKEGLDYTETFSPVAKMATVKLLLKVSA 1045

Query: 364  MHQWPIRQFDFNNAFLNGELVEEVFMQQPPGFS--KGSS---HLVCKLKKALYGLKQAPR 418
              +W + Q D +NAFLNG+L E ++M+ P G++  KG+S   ++VC+LKK++YGLKQA R
Sbjct: 1046 SKKWYLNQLDISNAFLNGDLEETIYMKLPDGYADIKGTSLPPNVVCRLKKSIYGLKQASR 1105

Query: 419  AWFDKLSSTLARFGFQAAKCDTSLFCKFTKSETLYILVYVDDIIITGSSSTAIFSLIADL 478
             WF K S++L   GF+    D +LF +   SE + +LVYVDDI+I  ++  A  SL   L
Sbjct: 1106 QWFLKFSNSLLALGFEKQHGDHTLFVRCIGSEFIVLLVYVDDIVIASTTEQAAQSLTEAL 1165

Query: 479  NAVFPLKDLGSLHYFLGIEATHTKDGGLILSQTKYINDLLIKAKMDSINPSPTPMTSGLK 538
             A F L++LG L YFLG+E   T +G + LSQ KY  +LL  A M    PS  PMT  ++
Sbjct: 1166 KASFKLRELGPLKYFLGLEVARTSEG-ISLSQRKYALELLTSADMLDCKPSSIPMTPNIR 1224

Query: 539  LSANSGALFPKPLLYRSVVGALHYVTITRPEIAFPLNKVCQFMHSPLEPHWQAVKRILRY 598
            LS N G L     +YR +VG L Y+TITRP+I F +NK+CQF  +P   H  AV ++L+Y
Sbjct: 1225 LSKNDGLLLEDKEMYRRLVGKLMYLTITRPDITFAVNKLCQFSSAPRTAHLAAVYKVLQY 1284

Query: 599  LHGTATHGLYLRRPSSMSITAFADSDWGSDPDDRRSTSGMCIYIGGNLVSWAAKKQTVVA 658
            + GT   GL+      +++  + D+DWG+ PD RRST+G  +++G +L+SW +KKQ  V+
Sbjct: 1285 IKGTVGQGLFYSAEDDLTLKGYTDADWGTCPDSRRSTTGFTMFVGSSLISWRSKKQPTVS 1344

Query: 659  RSSTEAEYRSLALVITEIMWLRSLLSELRVPTSPSSLIYCDNLSVVHLVANPILHTKTKH 718
            RSS EAEYR+LAL   E+ WL +LL  LRV  S   ++Y D+ + V++  NP+ H +TKH
Sbjct: 1345 RSSAEAEYRALALASCEMAWLSTLLLALRV-HSGVPILYSDSTAAVYIATNPVFHERTKH 1403

Query: 719  FALDLHFVRERVADKQVTVVNIPAPSQVSSTL 750
              +D H VRE++ + Q+ ++++    QV+  L
Sbjct: 1404 IEIDCHTVREKLDNGQLKLLHVKTKDQVADIL 1435


>pir||E96608 probable retroelement polyprotein F25P12.89 [imported] - Arabidopsis
            thaliana gi|9954746|gb|AAG09097.1| Putative retroelement
            polyprotein [Arabidopsis thaliana]
          Length = 1486

 Score =  528 bits (1360), Expect = e-148
 Identities = 305/826 (36%), Positives = 440/826 (52%), Gaps = 82/826 (9%)

Query: 2    FLTQCGIQHRLTCPYVHQQNGRVEWKHRHITETGLALLAHAHMPLTFWGDAFETAAYLIN 61
            F  Q GI H  +C    QQNGRVE KHRHI     AL   + +P+ FW     TAAYLIN
Sbjct: 641  FFAQKGIIHETSCVGTPQQNGRVERKHRHILNVARALRFQSGLPIEFWSYCALTAAYLIN 700

Query: 62   QLPIPTLDLSTPLQKLYDKKPDYSFLKCFGCLCYPHTRPYNTHKLAFRSAPCTFLGYSPM 121
            + P P L   TP + +Y++ P    ++ FGC+CY H   +   K A RS    FLGY   
Sbjct: 701  RTPTPLLKGKTPFELIYNRPPPLQHIRIFGCICYVHNLKHGGDKFASRSNKSIFLGYPFA 760

Query: 122  HKGYNCLTTE-GKVIITRNVLFDETVFPFQFQTSKSSSD---ILLDSS------------ 165
             KG+     E G V ++R+V+F ET F F      SS     +L+DSS            
Sbjct: 761  KKGWRVYNIETGVVSVSRDVVFRETEFHFPISVMDSSPSLDPVLVDSSELEEISMTPPVT 820

Query: 166  -----------LPTTSVIPLLPISNASPNVMPHANSVMPTVATNGSAPVVQTTVSDFSQP 214
                        P++ V P  P+S +SP V P ++ V P  +T  SA +   T+ D +  
Sbjct: 821  PSSPATPSSPVTPSSPVTPSSPVSPSSP-VTP-SSPVTPVSSTTTSAAI--DTIEDITTD 876

Query: 215  A*DSGVV--------HASAQPTEPPVPPSSNVHSMTTRAK-------------------- 246
              DS  +          S   TE P   SS VH    + +                    
Sbjct: 877  LEDSTSMDFFPDDEDEFSPTATESPASSSSPVHPPAVQLELLGKGHRPKRPPVKLADYVT 936

Query: 247  TGIHRP----------------------KAYAASTALSSVPTSAK*ALSIPHWHQAMQEE 284
            T +H+P                      +AY  +    + P +   A+   HW  A+  E
Sbjct: 937  TLLHQPFPSATPYPLDNYISSSRFSDNYQAYILAITSGNEPRNYNEAMLDDHWKGAVSHE 996

Query: 285  YQALMNNNTWELVPLLHGRDAIGCKWVFRTKYNSDGSINKHKPRLVAKGFHQTEGYDYNE 344
              +L N  TW +  L  G+ A+GCKWVFR KY SDG++ +HK RLV  G +QTEG DY E
Sbjct: 997  IGSLENLGTWTVEDLPPGKKALGCKWVFRLKYKSDGTLERHKARLVVLGNNQTEGLDYTE 1056

Query: 345  TFSPVVKPTTIRTVLSLALMHQWPIRQFDFNNAFLNGELVEEVFMQQPPGFSKGSSHLVC 404
            TF+PV K  T+R  L   +   W + Q D +NAFL+G+L EEV+MQ PPGF  G    VC
Sbjct: 1057 TFAPVAKMVTVRAFLQQVVSLDWEVHQMDVHNAFLHGDLDEEVYMQFPPGFRTGDKTKVC 1116

Query: 405  KLKKALYGLKQAPRAWFDKLSSTLARFGFQAAKCDTSLFCKFTKSETLYILVYVDDIIIT 464
            +L+K+LYGLKQAPR WF KL+S L  +GF     D SLF        L++LVYVDD+IIT
Sbjct: 1117 RLRKSLYGLKQAPRCWFAKLTSALKNYGFIQDISDYSLFIFHKNGVRLHVLVYVDDLIIT 1176

Query: 465  GSSSTAIFSLIADLNAVFPLKDLGSLHYFLGIEATHTKDGGLILSQTKYINDLLIKAKMD 524
            G++   I      L++ F +KDLG L YFLGIE   + + G+ L Q KY  D++ +  + 
Sbjct: 1177 GTTIAVITEFKHYLSSCFYMKDLGILRYFLGIEVARSPE-GIYLCQRKYALDIITETGLL 1235

Query: 525  SINPSPTPMTSGLKLSANSGALFPKPLLYRSVVGALHYVTITRPEIAFPLNKVCQFMHSP 584
             + P+  P+    KL+  +G     PL YR +VG + Y+  TRPE+++ ++ + QFMH+P
Sbjct: 1236 GVKPASFPLDQNHKLAFATGETIDDPLRYRRLVGRIIYLATTRPELSYVIHILSQFMHNP 1295

Query: 585  LEPHWQAVKRILRYLHGTATHGLYLRRPSSMSITAFADSDWGSDPDDRRSTSGMCIYIGG 644
               HW+A  R++RYL  +   G+ LR  + + ++A+ DSD+G+ P   RS +G  I +GG
Sbjct: 1296 KPAHWEAALRVVRYLKSSPGQGILLRANTPLVLSAWCDSDFGACPHSDRSLTGWFIQLGG 1355

Query: 645  NLVSWAAKKQTVVARSSTEAEYRSLALVITEIMWLRSLLSELRVPTSPSSLIYCDNLSVV 704
            + +SW  +KQ VV+RSS EAEYR++A  ++EI+W+R LL  L +P +  + ++ D+LS +
Sbjct: 1356 SPLSWKTQKQNVVSRSSAEAEYRAMAETVSEIIWIRELLPALGIPCTAPTTLHSDSLSAI 1415

Query: 705  HLVANPILHTKTKHFALDLHFVRERVADKQVTVVNIPAPSQVSSTL 750
             L ANP+ H +TKH   D+HF+R+ + +  +   ++   SQ++  L
Sbjct: 1416 SLAANPVYHARTKHVRRDVHFIRDELVNGTIATKHVSTTSQLADIL 1461


>gb|AAP53905.1| putative pol polyprotein [Oryza sativa (japonica cultivar-group)]
            gi|37534632|ref|NP_921618.1| putative pol polyprotein
            [Oryza sativa (japonica cultivar-group)]
          Length = 1688

 Score =  526 bits (1355), Expect = e-147
 Identities = 303/779 (38%), Positives = 427/779 (53%), Gaps = 35/779 (4%)

Query: 2    FLTQCGIQHRLTCPYVHQQNGRVEWKHRHITETGLALLAHAHMPLTFWGDAFETAAYLIN 61
            FL   G   +L+CP  H QNG  E KHRHI ET   LL  + +P  FW +A  TA YLIN
Sbjct: 463  FLVSQGTLPQLSCPGAHAQNGVAERKHRHIIETARTLLIASFVPAHFWAEAISTAVYLIN 522

Query: 62   QLPIPTLDLSTPLQKLYDKKPDYSFLKCFGCLCYPHTRPYNTHKLAFRSAPCTFLGYSPM 121
              P  +L   +P + L+   P Y  L+ FGC CY    P    KL  +S  C FLGYS  
Sbjct: 523  MQPSSSLQGRSPGEVLFGSPPRYDHLRVFGCTCYVLLAPRERTKLTAQSVECVFLGYSLE 582

Query: 122  HKGYNCLTTEGKVI-ITRNVLFDETVFPFQFQTSKSSSD-----------ILLDSSLPTT 169
            HKGY C     + I I+R+V FDE    F   T++ SS            I    SLP++
Sbjct: 583  HKGYRCYDPSARRIRISRDVTFDENKPFFYSSTNQPSSPENSISFLYLPPIPSPESLPSS 642

Query: 170  SVIPL---LPISNASPNVMP------HANSVMPT---VATNGSAPVVQTTVSDFSQPA*D 217
             + P    +P S  SP  +P        + V P    +  + S P V +T++  + P   
Sbjct: 643  PITPSPSPIPPSVPSPTYVPPPPPSPSPSPVSPPPSHIPASSSPPHVPSTITLDTFPFHY 702

Query: 218  SG--VVHASAQPTEPP-------VPPSSNVHSMTTRAKTGIHRPKAYAASTALSSVPTSA 268
            S    +   +QP++P        V  SS       RA+  +  P        +   P++ 
Sbjct: 703  SRRPKIPNESQPSQPTLEDPTCSVDDSSPAPRYNLRARDALRAPNRDDFVVGVVFEPSTY 762

Query: 269  K*ALSIPHWHQAMQEEYQALMNNNTWELVPLLHGRDAIGCKWVFRTKYNSDGSINKHKPR 328
            + A+ +PHW  AM EE  AL   NTW++VPL      I CKWV++ K  SDG + ++K R
Sbjct: 763  QEAIVLPHWKLAMSEELAALERTNTWDVVPLPSHAVPITCKWVYKVKTKSDGQVERYKAR 822

Query: 329  LVAKGFHQTEGYDYNETFSPVVKPTTIRTVLSLALMHQWPIRQFDFNNAFLNGELVEEVF 388
            LVA+GF Q  G DY+ETF+PV   TT+RT++++A    W I Q D  NAFL+G+L EEV+
Sbjct: 823  LVARGFQQAHGRDYDETFAPVAHMTTVRTLIAVAATRSWTISQMDVKNAFLHGDLHEEVY 882

Query: 389  MQQPPGFSKGSSHLVCKLKKALYGLKQAPRAWFDKLSSTLARFGFQAAKCDTSLFCKFTK 448
            M  PPG      H V +L++ALYGLKQAPRAWF + SS +   GF  +  D +LF   + 
Sbjct: 883  MHPPPGVEAPPGH-VFRLRRALYGLKQAPRAWFARFSSVVLAAGFSPSDHDPALFIHTSS 941

Query: 449  SETLYILVYVDDIIITGSSSTAIFSLIADLNAVFPLKDLGSLHYFLGIEATHTKDGGLIL 508
                 +L+YVDD++ITG     I  +   L+  F + DLG L YFLGIE T T DG   L
Sbjct: 942  RGRTLLLLYVDDMLITGDDLEYIAFVKGKLSEQFMMSDLGPLSYFLGIEVTSTVDG-YYL 1000

Query: 509  SQTKYINDLLIKAKMDSINPSPTPMTSGLKLSANSGALFPKPLLYRSVVGALHYVTITRP 568
            SQ +YI DLL ++ +     + TPM   ++L +  G     P  YR +VG+L Y+T+TRP
Sbjct: 1001 SQHRYIEDLLAQSGLTDSRTTTTPMELHVRLRSTDGTPLDDPSRYRHLVGSLVYLTVTRP 1060

Query: 569  EIAFPLNKVCQFMHSPLEPHWQAVKRILRYLHGTATHGLYLRRPSSMSITAFADSDWGSD 628
            +IA+ ++ + QF+ +P+  H+  + R+LRYL GT T  L+    S + + AF+DS W SD
Sbjct: 1061 DIAYAVHILSQFVSAPISVHYGHLLRVLRYLRGTTTQCLFYAASSPLQLRAFSDSTWASD 1120

Query: 629  PDDRRSTSGMCIYIGGNLVSWAAKKQTVVARSSTEAEYRSLALVITEIMWLRSLLSELRV 688
            P DRRS +G CI++G +L++W +KKQT V+RSSTEAE R+LA   +EI+WLR LL++  V
Sbjct: 1121 PIDRRSVTGYCIFLGTSLLTWKSKKQTAVSRSSTEAELRALATTTSEIVWLRWLLADFGV 1180

Query: 689  PTSPSSLIYCDNLSVVHLVANPILHTKTKHFALDLHFVRERVADKQVTVVNIPAPSQVS 747
                 + + CDN   + +  +PI H  TKH  +D  F R       + +  +P+  QV+
Sbjct: 1181 SCDVPTPLLCDNTGAIQIANDPIKHELTKHIGVDASFTRSHCQQSTIALHYVPSELQVA 1239


>gb|AAD26943.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
            gi|25301694|pir||E84535 probable retroelement pol
            polyprotein [imported] - Arabidopsis thaliana
          Length = 1454

 Score =  525 bits (1351), Expect = e-147
 Identities = 294/774 (37%), Positives = 426/774 (54%), Gaps = 33/774 (4%)

Query: 2    FLTQCGIQHRLTCPYVHQQNGRVEWKHRHITETGLALLAHAHMPLTFWGDAFETAAYLIN 61
            F  + GI    +CP   +QN  VE KH+HI     AL+  + +PL+ WGD   TA +LIN
Sbjct: 693  FYAEKGIVSFHSCPETPEQNSVVERKHQHILNVARALMFQSQVPLSLWGDCVLTAVFLIN 752

Query: 62   QLPIPTLDLSTPLQKLYDKKPDYSFLKCFGCLCYPHTRPYNTHKLAFRSAPCTFLGYSPM 121
            + P   L   TP + L    P Y  L+ FGCLCY  T P   HK   RS  C FLGY   
Sbjct: 753  RTPSQLLMNKTPYEILTGTAPVYEQLRTFGCLCYSSTSPKQRHKFQPRSRACLFLGYPSG 812

Query: 122  HKGYNCLTTEGK-VIITRNVLFDETVFPFQFQTSKSSSDILLDSSLPTTSVIPLLPISNA 180
            +KGY  +  E   V I+RNV F E VFP        SS  L    +P +S I  +  +  
Sbjct: 813  YKGYKLMDLESNTVFISRNVQFHEEVFPLAKNPGSESSLKLFTPMVPVSSGI--ISDTTH 870

Query: 181  SPNVMPHANSVMPTVATNGSAPVVQTTVSDFSQPA*DSGVVHASAQPTEPPVPPSSNVHS 240
            SP+ +P   S +P   ++         ++D+          H +   ++   P SS +  
Sbjct: 871  SPSSLPSQISDLPPQISSQRVRKPPAHLNDY----------HCNTMQSDHKYPISSTISY 920

Query: 241  MTTRAKTGIHRPKAYAASTALSSVPTSAK*ALSIPHWHQAMQEEYQALMNNNTWELVPLL 300
                          Y  +     +PT+   A     W +A+  E  A+   NTWE+  L 
Sbjct: 921  SKISPSH-----MCYINNITKIPIPTNYAEAQDTKEWCEAVDAEIGAMEKTNTWEITTLP 975

Query: 301  HGRDAIGCKWVFRTKYNSDGSINKHKPRLVAKGFHQTEGYDYNETFSPVVKPTTIRTVLS 360
             G+ A+GCKWVF  K+ +DG++ ++K RLVAKG+ Q EG DY +TFSPV K TTI+ +L 
Sbjct: 976  KGKKAVGCKWVFTLKFLADGNLERYKARLVAKGYTQKEGLDYTDTFSPVAKMTTIKLLLK 1035

Query: 361  LALMHQWPIRQFDFNNAFLNGELVEEVFMQQPPGFSKGS-----SHLVCKLKKALYGLKQ 415
            ++   +W ++Q D +NAFLNGEL EE+FM+ P G+++       S++V +LK+++YGLKQ
Sbjct: 1036 VSASKKWFLKQLDVSNAFLNGELEEEIFMKIPEGYAERKGIVLPSNVVLRLKRSIYGLKQ 1095

Query: 416  APRAWFDKLSSTLARFGFQAAKCDTSLFCKFTKSETLYILVYVDDIIITGSSSTAIFSLI 475
            A R WF K SS+L   GF+    D +LF K    E + +LVYVDDI+I  +S  A   L 
Sbjct: 1096 ASRQWFKKFSSSLLSLGFKKTHGDHTLFLKMYDGEFVIVLVYVDDIVIASTSEAAAAQLT 1155

Query: 476  ADLNAVFPLKDLGSLHYFLGIEATHTKDGGLILSQTKYINDLLIKAKMDSINPSPTPMTS 535
             +L+  F L+DLG L YFLG+E   T   G+ + Q KY  +LL    M +  P   PM  
Sbjct: 1156 EELDQRFKLRDLGDLKYFLGLEVARTT-AGISICQRKYALELLQSTGMLACKPVSVPMIP 1214

Query: 536  GLKLSANSGALFPKPLLYRSVVGALHYVTITRPEIAFPLNKVCQFMHSPLEPHWQAVKRI 595
             LK+  + G L      YR +VG L Y+TITRP+I F +NK+CQF  +P   H  A  R+
Sbjct: 1215 NLKMRKDDGDLIEDIEQYRRIVGKLMYLTITRPDITFAVNKLCQFSSAPRTTHLTAAYRV 1274

Query: 596  LRYLHGTATHGLYLRRPSSMSITAFADSDWGSDPDDRRSTSGMCIYIGGNLVSWAAKKQT 655
            L+Y+ GT   GL+    S +++  FADSDW S  D RRST+   +++G +L+SW +KKQ 
Sbjct: 1275 LQYIKGTVGQGLFYSASSDLTLKGFADSDWASCQDSRRSTTSFTMFVGDSLISWRSKKQH 1334

Query: 656  VVARSSTEAEYRSLALVITEIMWLRSLLSELRVPTSPSSLIYCDNLSVVHLVANPILHTK 715
             V+RSS EAEYR+LAL   E++WL +LL  L+  + P  ++Y D+ + +++  NP+ H +
Sbjct: 1335 TVSRSSAEAEYRALALATCEMVWLFTLLVSLQA-SPPVPILYSDSTAAIYIATNPVFHER 1393

Query: 716  TKHFALDLHFVRERVADKQVTVVNIPAPSQVSSTL--------FHCFKTKLRVL 761
            TKH  LD H VRER+ + ++ ++++    QV+  L        F   K+K+ +L
Sbjct: 1394 TKHIKLDCHTVRERLDNGELKLLHVRTEDQVADILTKPLFPYQFEHLKSKMSIL 1447


>gb|AAC33963.1| contains similarity to reverse transcriptases (Pfam; rvt.hmm, score:
            11.19) [Arabidopsis thaliana] gi|7486705|pir||T01879
            hypothetical protein F8M12.17 - Arabidopsis thaliana
          Length = 1633

 Score =  515 bits (1327), Expect = e-144
 Identities = 298/797 (37%), Positives = 436/797 (54%), Gaps = 67/797 (8%)

Query: 2    FLTQCGIQHRLTCPYVHQQNGRVEWKHRHITETGLALLAHAHMPLTFWGDAFETAAYLIN 61
            F+ + G+ H+ +C Y  QQN  VE KH+H+     +LL  +++PL +W D   TAAYLIN
Sbjct: 635  FVKEQGMIHQFSCAYTPQQNSVVERKHQHLLNIARSLLFQSNVPLQYWSDCVLTAAYLIN 694

Query: 62   QLPIPTLDLSTPLQKLYDKKPDYSFLKCFGCLCYPHTRPYNTHKLAFRSAPCTFLGYSPM 121
            +LP P LD  TP + L  K PDY+ LK   CLCY  T  ++ +K + R+ PC FLGY   
Sbjct: 695  RLPSPLLDNKTPFELLLKKIPDYTLLK--SCLCYASTNVHDRNKFSPRARPCVFLGYPSG 752

Query: 122  HKGYNCLTTEGKVI-ITRNVLFDETVFPFQFQTS-KSSSDILLDSSLPTTSVIPLLPISN 179
            +KGY  L  E   I ITRNV+F ET FPF+     K S D+  +S LP  + +  +    
Sbjct: 753  YKGYKVLDLESHSISITRNVVFHETKFPFKTSKFLKESVDMFPNSILPLPAPLHFVESMP 812

Query: 180  ASPNVMPHAN--SVMPTVATNGSAPVVQTTVSDFSQPA*DSGVVHAS-AQPTEPPVPPS- 235
               ++    N  S   + ++  S P + +TV+  +  A D        A+P      P+ 
Sbjct: 813  LDDDLRADDNNASTSNSASSASSIPPLPSTVNTQNTDALDIDTNSVPIARPKRNAKAPAY 872

Query: 236  -SNVH--------SMTTRAKTGIHRPKA----------YAASTALS-------------- 262
             S  H        S++    T I  P +          Y  STA+S              
Sbjct: 873  LSEYHCNSVPFLSSLSPTTSTSIETPSSSIPPKKITTPYPMSTAISYDKLTPLFHSYICA 932

Query: 263  ----SVPTSAK*ALSIPHWHQAMQEEYQALMNNNTWELVPLLHGRDAIGCKWVFRTKYNS 318
                + P +   A+    W +A  EE  AL  N TW +  L  G++ +GCKWVF  KYN 
Sbjct: 933  YNVETEPKAFTQAMKSEKWTRAANEELHALEQNKTWIVESLTEGKNVVGCKWVFTIKYNP 992

Query: 319  DGSINKHKPRLVAKGFHQTEGYDYNETFSPVVKPTTIRTVLSLALMHQWPIRQFDFNNAF 378
            DGSI ++K RLVA+GF Q EG DY ETFSPV K  +++ +L LA    W + Q D +NAF
Sbjct: 993  DGSIERYKARLVAQGFTQQEGIDYMETFSPVAKFGSVKLLLGLAAATGWSLTQMDVSNAF 1052

Query: 379  LNGELVEEVFMQQPPGFSKGS-----SHLVCKLKKALYGLKQAPRAWFDKLSSTLARFGF 433
            L+GEL EE++M  P G++  +     S  VC+L K+LYGLKQA R W+ +LSS      F
Sbjct: 1053 LHGELDEEIYMSLPQGYTPPTGISLPSKPVCRLLKSLYGLKQASRQWYKRLSSVFLGANF 1112

Query: 434  QAAKCDTSLFCKFTKSETLYILVYVDDIIITGSSSTAIFSLIADLNAVFPLKDLGSLHYF 493
              +  D ++F K + +  + +LVYVDD++I  + S+A+ +L   L + F +KDLG   +F
Sbjct: 1113 IQSPADNTMFVKVSCTSIIVVLVYVDDLMIASNDSSAVENLKELLRSEFKIKDLGPARFF 1172

Query: 494  LGIEATHTKDGGLILSQTKYINDLLIKAKMDSINPSPTPMTSGLKLSANSGALFPKPLLY 553
            LG+E   + +G + + Q KY  +LL    +    PS  PM   L L+   G L P    Y
Sbjct: 1173 LGLEIARSSEG-ISVCQRKYAQNLLEDVGLSGCKPSSIPMDPNLHLTKEMGTLLPNATSY 1231

Query: 554  RSVVGALHYVTITRPEIAFPLNKVCQFMHSPLEPHWQAVKRILRYLHGTATHGLYLRRPS 613
            R +VG L Y+ ITRP+I F ++ + QF+ +P + H QA  ++LRYL G            
Sbjct: 1232 RELVGRLLYLCITRPDITFAVHTLSQFLSAPTDIHMQAAHKVLRYLKGNPGQ-------- 1283

Query: 614  SMSITAFADSDWGSDPDDRRSTSGMCIYIGGNLVSWAAKKQTVVARSSTEAEYRSLALVI 673
                    D+DWG+  D RRS +G CIY+G +L++W +KKQ+VV+RSSTE+EYRSLA   
Sbjct: 1284 --------DADWGTCKDSRRSVTGFCIYLGTSLITWKSKKQSVVSRSSTESEYRSLAQAT 1335

Query: 674  TEIMWLRSLLSELRVPTSPSSLIYCDNLSVVHLVANPILHTKTKHFALDLHFVRERVADK 733
             EI+WL+ LL +L V  +  + ++CDN S +HL  NP+ H +TKH  +D H VR+++   
Sbjct: 1336 CEIIWLQQLLKDLHVTMTCPAKLFCDNKSALHLATNPVFHERTKHIEIDCHTVRDQIKAG 1395

Query: 734  QVTVVNIPAPSQVSSTL 750
            ++  +++P  +Q++  L
Sbjct: 1396 KLKTLHVPTGNQLADIL 1412


>pir||G86301 probable retroelement polyprotein [imported] - Arabidopsis thaliana
            gi|9989054|gb|AAG10817.1| Putative retroelement
            polyprotein [Arabidopsis thaliana]
          Length = 1413

 Score =  513 bits (1322), Expect = e-144
 Identities = 284/755 (37%), Positives = 419/755 (54%), Gaps = 41/755 (5%)

Query: 7    GIQHRLTCPYVHQQNGRVEWKHRHITETGLALLAHAHMPLTFWGDAFETAAYLINQLPIP 66
            GI    +CP   +QN  VE KH+HI     ALL  + +PL++WGD   TA ++IN+ P P
Sbjct: 674  GIVAYHSCPETPEQNSVVERKHQHILNVARALLFQSQIPLSYWGDCILTAVFIINRTPSP 733

Query: 67   TLDLSTPLQKLYDKKPDYSFLKCFGCLCYPHTRPYNTHKLAFRSAPCTFLGYSPMHKGYN 126
             +   T  + L  K PDY+ LK FGCLCY  T P   HK   R+  C FLGY   +KGY 
Sbjct: 734  VISNKTLFEMLTKKVPDYTHLKSFGCLCYASTSPKQRHKFEDRARTCAFLGYPSGYKGYK 793

Query: 127  CLTTEGKVI-ITRNVLFDETVFPFQFQTSKSSSDILLDSSLPTTSVIPLLPISNASPNVM 185
             L  E   I I+RNV+F E +FPF+ + +++    +           P + +        
Sbjct: 794  LLDLESHTIFISRNVVFYEDLFPFKTKPAENEESSVF---------FPHIYVDRNDS--- 841

Query: 186  PHANSVMPTVATNGSAPVVQTTVSDFSQPA*DSGVVHASAQPTEPPVPPS--------SN 237
             H +  +P   T+ S    +   S  S+P       H ++  +    P S        S+
Sbjct: 842  -HPSQPLPVQETSASNVPAEKQNSRVSRPPAYLKDYHCNSVTSSTDHPISEVLSYSSLSD 900

Query: 238  VHSMTTRAKTGIHRPKAYAASTALSSVPTSAK*ALSIPHWHQAMQEEYQALMNNNTWELV 297
             + +   A   I  P  YA              A  I  W  AM  E  AL +N TW + 
Sbjct: 901  PYMIFINAVNKIPEPHTYAQ-------------ARQIKEWCDAMGMEITALEDNGTWVVC 947

Query: 298  PLLHGRDAIGCKWVFRTKYNSDGSINKHKPRLVAKGFHQTEGYDYNETFSPVVKPTTIRT 357
             L  G+ A+GCKWV++ K N+DGS+ ++K RLVAKG+ QTEG DY +TFSPV K TT++ 
Sbjct: 948  SLPVGKKAVGCKWVYKIKLNADGSLERYKARLVAKGYTQTEGLDYVDTFSPVAKLTTVKL 1007

Query: 358  VLSLALMHQWPIRQFDFNNAFLNGELVEEVFMQQPPGFS--KGSS---HLVCKLKKALYG 412
            ++++A    W + Q D +NAFLNG L EE++M  PPG+S  +G S   + VC+LKK+LYG
Sbjct: 1008 LIAVAAAKGWSLSQLDISNAFLNGSLDEEIYMTLPPGYSPRQGDSFPPNAVCRLKKSLYG 1067

Query: 413  LKQAPRAWFDKLSSTLARFGFQAAKCDTSLFCKFTKSETLYILVYVDDIIITGSSSTAIF 472
            LKQA R W+ K S +L   GF  +  D +LF + +K+  + +LVYVDDIII  S      
Sbjct: 1068 LKQASRQWYLKFSESLKALGFTQSSGDHTLFTRKSKNSYMAVLVYVDDIIIASSCDRETE 1127

Query: 473  SLIADLNAVFPLKDLGSLHYFLGIEATHTKDGGLILSQTKYINDLLIKAKMDSINPSPTP 532
             L   L     L+DLG+L YFLG+E     DG + + Q KY  +LL +  +     S  P
Sbjct: 1128 LLRDALQRSSKLRDLGTLRYFLGLEIARNTDG-ISICQRKYTLELLAETGLLGCKSSSVP 1186

Query: 533  MTSGLKLSANSGALFPKPLLYRSVVGALHYVTITRPEIAFPLNKVCQFMHSPLEPHWQAV 592
            M    KLS   G L      YR +VG L Y+T TRP+I + ++++CQF  +P  PH +AV
Sbjct: 1187 MEPNQKLSQEDGELIDDAEHYRKLVGKLMYLTFTRPDITYAVHRLCQFTSAPRVPHLKAV 1246

Query: 593  KRILRYLHGTATHGLYLRRPSSMSITAFADSDWGSDPDDRRSTSGMCIYIGGNLVSWAAK 652
             +I+ YL GT   GL+      + ++ FADSD+ S  D R+ T+G C+++G +LV+W +K
Sbjct: 1247 YKIIYYLKGTVGQGLFYSANVDLKLSGFADSDFSSCSDSRKLTTGYCMFLGTSLVAWKSK 1306

Query: 653  KQTVVARSSTEAEYRSLALVITEIMWLRSLLSELRVPTSPSSLIYCDNLSVVHLVANPIL 712
            KQ V++ SS EAEY+++++ + E+MWLR LL +L +  S +S++YCDN + +H+  NP+ 
Sbjct: 1307 KQEVISMSSAEAEYKAMSMAVREMMWLRFLLEDLWIDVSEASVLYCDNTAAIHIANNPVF 1366

Query: 713  HTKTKHFALDLHFVRERVADKQVTVVNIPAPSQVS 747
            H +TKH   D H +RE++    +  +++   +Q++
Sbjct: 1367 HERTKHIERDYHHIREKIILGLIRTLHVRTENQLA 1401


  Database: nr
    Posted date:  Jul 5, 2005 12:34 AM
  Number of letters in database: 863,360,394
  Number of sequences in database:  2,540,612
  
Lambda     K      H
   0.322    0.135    0.416 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,317,730,480
Number of Sequences: 2540612
Number of extensions: 55900775
Number of successful extensions: 138493
Number of sequences better than 10.0: 1769
Number of HSP's better than 10.0 without gapping: 1605
Number of HSP's successfully gapped in prelim test: 170
Number of HSP's that attempted gapping in prelim test: 131928
Number of HSP's gapped (non-prelim): 3051
length of query: 768
length of database: 863,360,394
effective HSP length: 136
effective length of query: 632
effective length of database: 517,837,162
effective search space: 327273086384
effective search space used: 327273086384
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 79 (35.0 bits)


Lotus: description of TM0378b.9