Lotus
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= TM0003.6
         (1307 letters)

Database: nr 
           2,540,612 sequences; 863,360,394 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAT85194.1| putative polyprotein [Oryza sativa (japonica cult...   970  0.0
ref|XP_469192.1| putative polyprotein [Oryza sativa (japonica cu...   960  0.0
gb|AAX92861.1| retrotransposon protein, putative, Ty1-copia sub-...   952  0.0
gb|AAX92941.1| retrotransposon protein, putative, Ty1-copia sub-...   920  0.0
dbj|BAB09923.1| copia-like retrotransposable element [Arabidopsi...   888  0.0
emb|CAB75481.1| copia-like polyprotein [Arabidopsis thaliana] gi...   881  0.0
gb|AAD32759.1| putative retroelement pol polyprotein [Arabidopsi...   875  0.0
gb|AAP54315.1| putative polyprotein [Oryza sativa (japonica cult...   869  0.0
dbj|BAD34493.1| Gag-Pol [Ipomoea batatas]                             865  0.0
ref|XP_475489.1| putative polyprotein [Oryza sativa (japonica cu...   863  0.0
gb|AAD23690.1| putative retroelement pol polyprotein [Arabidopsi...   857  0.0
gb|AAD19773.1| putative retroelement pol polyprotein [Arabidopsi...   855  0.0
emb|CAA32025.1| unnamed protein product [Nicotiana tabacum] gi|1...   853  0.0
gb|AAF19226.1| Highly similar to Ta1-3 polyprotein [Arabidopsis ...   845  0.0
gb|AAK29467.1| polyprotein-like [Lycopersicon chilense]               811  0.0
emb|CAB79135.1| putative transposable element [Arabidopsis thali...   795  0.0
pir||T02206 hypothetical protein - common tobacco retrotransposo...   782  0.0
gb|AAV88069.1| hypothetical retrotransposon [Ipomoea batatas]         762  0.0
emb|CAA31653.1| polyprotein [Arabidopsis thaliana] gi|99721|pir|...   746  0.0
emb|CAA37924.1| unnamed protein product [Arabidopsis thaliana] g...   690  0.0

>gb|AAT85194.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
          Length = 1241

 Score =  970 bits (2507), Expect = 0.0
 Identities = 554/1249 (44%), Positives = 770/1249 (61%), Gaps = 60/1249 (4%)

Query: 91   MSKTPMNKLFAKQRLYSLKMQEGGDLQAHVYAFNNILADMTRLGVTVDDEDKAIILLCSL 150
            M+K   +K+  KQ+L+  K+Q+ G +  H+ AF  I+AD+  + V  D+ED  +ILLCSL
Sbjct: 1    MTKDLTSKMHLKQKLFLHKLQDDGSVMDHLSAFKEIVADLESMEVKYDEEDLGLILLCSL 60

Query: 151  PGSYDHLVTTLTYGKDSITLDSISSTLLPHAQRRQSV-EEGGGSSGEGLFVKGGQDRGRG 209
            P SY +   T+ Y +D++TL  +   L    + ++ V  EG  S  EGL V+G Q     
Sbjct: 61   PSSYANFRDTILYSRDTLTLKEVYDALHAKEKMKKMVPSEGSNSQAEGLVVRGRQQEKNT 120

Query: 210  KGKAVDSGK---KKRSKSKDRKTTECYSCKQIGH-----WK-RDCPNRSGK-----SGNN 255
              K+ D      + RSKS+ R  + C  CK+ GH     WK +D   R+ K         
Sbjct: 121  NNKSRDKSSSIYRGRSKSRGRYKS-CKYCKRDGHDISECWKLQDKDKRTRKYIPKGKKEE 179

Query: 256  SSSANVVQSDGSCSEEDLLCVSSVKCTDAWVLDSGCSYHMTQHREWFNSFKSGDLGYVYL 315
               A VV  + S +E  +      + +D W+LD+ C+YHM  +R+WF ++++   G V +
Sbjct: 180  EGKAAVVTDEKSDAELLVAYAGCAQTSDQWILDTACTYHMCPNRDWFATYEAVQGGTVLM 239

Query: 316  GDDKPCIIKGMRQVKIALDDGGVRTLSQVRYVLEVMKNLISLGTLHENGYSFKSEENRDI 375
            GDD PC + G+  V+I + DG +RTL  VR++  + ++LISL TL   GY +   +   I
Sbjct: 240  GDDTPCEVAGIGTVQIKMFDGCIRTLLDVRHIPNLKRSLISLCTLDRKGYKYSGGDG--I 297

Query: 376  LRVSKGAMTVMRAKRTAGNIYKLLGGTIMGDVASVE---TDDDATKLWHMRLGHLSERGM 432
            L+V+KG++ VM+A     N+Y L G TI+G+VA+V    ++ DAT LWHMRLGH+SE G+
Sbjct: 298  LKVTKGSLVVMKADIKYANLYHLRGTTILGNVAAVSDSLSNSDATNLWHMRLGHMSEIGL 357

Query: 433  MELHKRNMLKGVRSCIIGLCKYCVLGKQCRVRFKTGHHKTKGILDYVHSDVRGPTKEPSV 492
             EL KR +L G     +  C++C+ GK  RV+F T  H T+GILDYVHSD+ GP ++ S 
Sbjct: 358  AELSKRGLLDGQSIGKLKFCEHCIFGKHKRVKFNTSTHTTEGILDYVHSDLWGPARKTSF 417

Query: 493  *GFRYFVTFTDDFSRKVWVYFMKYKSEVFAKFKLWKAEVENQTGRKIKYLRSDNGTEYTD 552
             G RY +T  DD+SRKVW YF+K+K + F  FK WK  VE QT RK+K LR+DNG E   
Sbjct: 418  GGARYMMTIVDDYSRKVWPYFLKHKYQAFDVFKEWKTMVERQTERKVKILRTDNGMELCS 477

Query: 553  KNFMHFCEENGIQRHFSVRKTPQQNGVAERMNRTLTEKARCLRLNACLSKCLWAATINMA 612
            K F  +C+  GI RH++V  TPQQNGVAERMNRT+  KARC+  NA L K  WA  ++ A
Sbjct: 478  KIFKSYCKSEGIVRHYTVPHTPQQNGVAERMNRTIISKARCMLSNASLPKQFWAEAVSTA 537

Query: 613  CYLVNRSPRASLDGKVAEEVWTGNPIDLSNLRIFWRPAYVHISSEDRSKLDPKSK*CIII 672
            CYL+NRSP  ++D K   EVW+G+P + S+LR+F   AY H+   D  KL+P+   CI +
Sbjct: 538  CYLINRSPSYAIDKKTPIEVWSGSPANYSDLRVFGCTAYAHV---DNGKLEPRVIKCIFL 594

Query: 673  GYNKGVKGYKLWDPVKKKVIVSRDVVFDE*SMLKQSDVTVVPDTEVENSSQDKIQVDIEE 732
            GY  GVKGYKLW P  KKV++SR+VVF E  ML        P T V   SQ+K+ V +E 
Sbjct: 595  GYLSGVKGYKLWCPETKKVVISRNVVFHESIMLHDK-----PSTNVPVESQEKVSVQVEH 649

Query: 733  TPVS----PRQIVAQQQSEP---GSDSGEVQD---YTLVRDREPSRITPPVRYGFE-DLA 781
               S     ++ VA  Q  P    SDS  VQ     ++ +DR      PP RY  E ++ 
Sbjct: 650  LISSGHAPEKEDVAINQDAPVIEDSDSSIVQQSPKRSIAKDRPKRNTKPPRRYIEEANIV 709

Query: 782  AYAL-----LTSSGDPSTYHEAMAS*EKDKWMSAMVEEMESLKKNET*NLVQLPHGKRVI 836
            AYAL     +  + +PSTY +A+ S + ++W++AM +EMESL+KN +  L +LP  K+ I
Sbjct: 710  AYALSVAEEIEGNAEPSTYSDAIVSDDCNRWITAMHDEMESLEKNHSWELEKLPKEKKPI 769

Query: 837  GCKWVYKKKLAVT*KEREKFKAHLVTKGYSQHKGIDYDEIFSPVVRHTSIRVVLALVASM 896
             CKW++K+K  ++  +  ++KA LV KGYSQ  GID++++FSPVV+H+SIR +L++VA  
Sbjct: 770  RCKWIFKRKEGMSPSDEARYKARLVAKGYSQIPGIDFNDVFSPVVKHSSIRTLLSIVAMH 829

Query: 897  DMHLEQMDVKTTFLHGNLEEQIYIEQPEGFSETGDGRLVCKLKRSLYGLKQSPRQWYKRF 956
            D  LEQMDVKT FLHG LEE IY+EQPEGF   G   LVC+LK+SLYGLKQSPRQWYKRF
Sbjct: 830  DYELEQMDVKTAFLHGELEEDIYMEQPEGFVVPGKENLVCRLKKSLYGLKQSPRQWYKRF 889

Query: 957  DSYMLRIGYRRCDYDCCVYVMSLDDGSFIFLLLYVDDMLIAANHLHDVNELKTKLGKEFD 1016
            DS+ML   +RR +YD CVY + + DGS I+LLLYVDDMLIAA    ++ +LK +L  EF+
Sbjct: 890  DSFMLSQKFRRSNYDSCVY-LKVVDGSAIYLLLYVDDMLIAAKDKSEIAKLKAQLSSEFE 948

Query: 1017 MKDLGAAKKILGMEIHKDRGAKKLWLSQKSYVEGVLSRFDMSKANHVSTPLTNHFKLSLE 1076
            MKDLGAAKKILGMEI ++R + KL+LSQK Y+E VL RF+M  A  VST L  HF+LS +
Sbjct: 949  MKDLGAAKKILGMEITRERHSGKLYLSQKCYIEKVLHRFNMHDAKLVSTLLAAHFRLSSD 1008

Query: 1077 QSPKIDSEIEGMSKIPY--AVQLVV*CMLWFALDQIWHKQLVKCQVHVQAREAALGSSQV 1134
              P+   +IE MS++PY  AV  ++  M+    D + H   V  +      +    + Q 
Sbjct: 1009 LCPQSAYDIEYMSRVPYSSAVSSLMYAMVCSRPD-LSHALSVVSRYMANPGKEHWKAVQW 1067

Query: 1135 DPKILEGYNGSRYHV*QGARCCSISCGICGLCR*SR**KVYNRICL----YSCGG-PICW 1189
              + L G + +     +       S G+ G         +  R  L    ++ GG  + W
Sbjct: 1068 IFRYLRGTSSACLQFGRS------SDGLVGYVDSDFAGDLDRRRSLTGYVFTVGGCAVSW 1121

Query: 1190 KSSVQSTVAMSTTEAEYMAVAEAAKEALWLTGLVKELGVEQGGVQLHCDSQSAIYLTNNQ 1249
            K+S+Q+TVA+STTEAEYMA++EA KE +WL GL  EL      + + CDSQSAI LT +Q
Sbjct: 1122 KASLQATVALSTTEAEYMAISEACKEVIWLRGLYTELCGVTSCINIFCDSQSAICLTKDQ 1181

Query: 1250 VYHARTKHIDVRFHKIRELLASRQILLQKIHTSENTTDKLTKPVTSDKF 1298
            ++H RTKHID+R+H IR ++A   + + KI T +N  D +TKPV + KF
Sbjct: 1182 MFHERTKHIDLRYHFIRGVIAEGDVKVCKISTHDNPVDMMTKPVPATKF 1230


>ref|XP_469192.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
            gi|53370655|gb|AAU89150.1| integrase core domain
            containing protein [Oryza sativa (japonica
            cultivar-group)] gi|40538906|gb|AAR87163.1| putative
            polyprotein [Oryza sativa (japonica cultivar-group)]
          Length = 1322

 Score =  960 bits (2482), Expect = 0.0
 Identities = 557/1334 (41%), Positives = 780/1334 (57%), Gaps = 67/1334 (5%)

Query: 5    KFEVTRFDGTGNFGLWQRMAKDLLAQKS-LQKALRD--EKPADIATVDWNEMKEKAAGLI 61
            K+++   D    F LWQ   + +LAQ S L +AL    +K     T +      KA  LI
Sbjct: 5    KYDLPLLDYKTRFSLWQVKMRAVLAQTSDLDEALESFGKKKTTEWTAEEKRKDRKALSLI 64

Query: 62   TLCVSDDVMNHILDLTTLKDVWDKLESLYMSKTPMNKLFAKQRLYSLKMQEGGDLQAHVY 121
             L +S+D++  +L   T  ++W KLES+ MSK   +K+  K +L+S K+ E G +  H+ 
Sbjct: 65   QLHLSNDILQEVLQKKTAAELWLKLESICMSKDLTSKMHIKMKLFSHKLHESGSVLNHIS 124

Query: 122  AFNNILADMTRLGVTVDDEDKAIILLCSLPGSYDHLVTTLTYGKDSITLDSISSTLLPHA 181
             F  I+AD+  + V  DDED  ++LLCSLP SY +   T+   +D +TL  +   L    
Sbjct: 125  VFKEIVADLVSMEVQFDDEDLGLLLLCSLPSSYANFRHTILLSRDELTLAEVYEALQNRE 184

Query: 182  QRRQSVEEGGGSS-GEGLFVKGGQDRGRGKGKAVDSGKKKRSKSKDRKTTECYSCKQIGH 240
            + +  V+    SS GE L V+G  ++            + R +SK R    C  CK+  H
Sbjct: 185  KMKGMVQSYASSSKGEALQVRGRSEQRTYNDSNDHDKSQSRGRSKSRGKKFCKYCKKKNH 244

Query: 241  WKRDCPNRSGKSGNNSSS-ANVVQSDGSCSEEDLLCVSS--VKCTDAWVLDSGCSYHMTQ 297
            +  +C     K    S   A+VV S  +    D L V +  V   D W+LD+ CS+H+  
Sbjct: 245  FIEECWKLQNKEKRKSDGKASVVTSAENSDSGDCLVVFAGYVASHDEWILDTACSFHICI 304

Query: 298  HREWFNSFKS-GDLGYVYLGDDKPCIIKGMRQVKIALDDGGVRTLSQVRYVLEVMKNLIS 356
            +R+WF+S+KS  +   V +GDD P  I G+  V+I   DG  RTL  VR++  + +NLIS
Sbjct: 305  NRDWFSSYKSVQNEDVVRMGDDNPREIVGIGSVQIKTHDGMTRTLKDVRHIPGMARNLIS 364

Query: 357  LGTLHENGYSFKSEENRDILRVSKGAMTVMRAKRTAGNIYKLLGGTIMGDV--ASVETDD 414
            L TL   GY +       +++VSKG++  M     + N+Y L G T+ G V  A+V  D+
Sbjct: 365  LSTLDAEGYKYSGSGG--VVKVSKGSLVYMIGDMNSANLYVLRGSTLHGSVTAAAVTKDE 422

Query: 415  DA-TKLWHMRLGHLSERGMMELHKRNMLKGVRSCIIGLCKYCVLGKQCRVRFKTGHHKTK 473
             + T LWHMRLGH+SE GM EL KRN+L G     +  C++CV GK  RV+F T  H+TK
Sbjct: 423  PSKTNLWHMRLGHMSELGMAELMKRNLLDGCTQGNMKFCEHCVFGKHKRVKFNTSVHRTK 482

Query: 474  GILDYVHSDVRGPTKEPSV*GFRYFVTFTDDFSRKVWVYFMKYKSEVFAKFKLWKAEVEN 533
            GILDYVH+D+ GP+++PS+ G RY +T  DD+SRK W YF+K+K + FA FK  K  +E 
Sbjct: 483  GILDYVHADLWGPSRKPSLGGARYMLTIIDDYSRKEWPYFLKHKDDTFAAFKERKVMIER 542

Query: 534  QTGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHFSVRKTPQQNGVAERMNRTLTEKARC 593
            QT +++K L +DNG E+    F  +C + GI RH ++  TPQQNGVAERMNRT+  KARC
Sbjct: 543  QTEKEVKVLCTDNGGEFCSDAFDDYCRKEGIVRHHTIPYTPQQNGVAERMNRTIISKARC 602

Query: 594  LRLNACLSKCLWAATINMACYLVNRSPRASLDGKVAEEVWTGNPIDLSNLRIFWRPAYVH 653
            +  NA ++K  WA   N ACYL+NRSP   L+ K   E+W+G P D S LR+F   AY H
Sbjct: 603  MLSNARMNKRFWAEAANTACYLINRSPSIPLNKKTPIEIWSGMPADYSQLRVFGCTAYAH 662

Query: 654  ISSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDPVKKKVIVSRDVVFDE*SMLKQSDVT-V 712
            +   D  KL+P++  C+ +GY  GVKGYKLW+P   K  +SR+V+F+E  M   S  T V
Sbjct: 663  V---DNGKLEPRAIKCLFLGYGSGVKGYKLWNPETNKTFMSRNVIFNEFVMFNDSLPTDV 719

Query: 713  VPDTEVENSSQDKIQVDIEETPVSPRQIVAQQQSEPGSDSGEV---QDYTLVRDREPSRI 769
            +P    E   Q  + V +E       +IV    ++    S  V   QD  +   R     
Sbjct: 720  IPGGSDEE--QQYVSVQVEHVDDQETEIVGNDVNDTVQHSPSVLQPQDEPIAHRRTKRSC 777

Query: 770  TPPVRYGFE-DLAAYAL-----LTSSGDPSTYHEAMAS*EKDKWMSAMVEEMESLKKNET 823
              PVR   E D+  YA      + ++ +P+TY EA+ S +++KW+SA+ EEM+SL+KN T
Sbjct: 778  GAPVRLIEECDMVYYAFSYAEQVENTLEPATYTEAVVSGDREKWISAIQEEMQSLEKNGT 837

Query: 824  *NLVQLPHGKRVIGCKWVYKKKLAVT*KEREKFKAHLVTKGYSQHKGIDYDEIFSPVVRH 883
              LV LP  K+ + CKW++K+K  ++  E  +FK  LV KG+SQ  G+DY+++FSPVV+H
Sbjct: 838  WELVHLPKQKKPVRCKWIFKRKEGLSPSEPPRFKVRLVAKGFSQIAGVDYNDVFSPVVKH 897

Query: 884  TSIRVVLALVASMDMHLEQMDVKTTFLHGNLEEQIYIEQPEGFSETGDGRLVCKLKRSLY 943
            +SIR   ++V   D+ LEQ+DVKTTFLHG LEE+IY++QPEGF   G    VCKLKRSLY
Sbjct: 898  SSIRTFFSIVTMHDLELEQLDVKTTFLHGELEEEIYMDQPEGFIVPGKEDYVCKLKRSLY 957

Query: 944  GLKQSPRQWYKRFDSYMLRIGYRRCDYDCCVYVMSLDDGSFIFLLLYVDDMLIAANHLHD 1003
            GLKQSPRQWYKRFDS+ML  G++R ++D CVY+  ++ GS I+LLLYVDDMLIAA     
Sbjct: 958  GLKQSPRQWYKRFDSFMLSHGFKRSEFDSCVYIKFVN-GSPIYLLLYVDDMLIAAKSKEQ 1016

Query: 1004 VNELKTKLGKEFDMKDLGAAKKILGMEIHKDRGAKKLWLSQKSYVEGVLSRFDMSKANHV 1063
            +  LK +L  EFDMKDLGAAKKILGMEI +DR +  L+LSQ+SY++ VL RF+M  A  V
Sbjct: 1017 ITTLKKQLSSEFDMKDLGAAKKILGMEITRDRNSGLLFLSQQSYIKKVLQRFNMHDAKPV 1076

Query: 1064 STPLTNHFKLSLEQSPKIDSEIEGMSKIPY--AVQLVV*CMLWFALDQIWHKQLVKCQVH 1121
            STP+  HFKLS  Q    D ++E MS++PY  AV  ++  M+    D      LV   + 
Sbjct: 1077 STPIAPHFKLSALQCASTDEDVEYMSRVPYSSAVGSLMYAMVCSWPDLSHAMSLVSRYMA 1136

Query: 1122 VQARE-----------------AALGSSQVDPKILEGYNGSRYHV*QGARCCSISCGICG 1164
               +E                 A L   ++D K L GY  S +      R       + G
Sbjct: 1137 NPGKEHWKAVQWIFRYLRGTADACLKFGRID-KGLVGYVDSDFAADLDKRR-----SLTG 1190

Query: 1165 LCR*SR**KVYNRICLYSCGGPICWKSSVQSTVAMSTTEAEYMAVAEAAKEALWLTGLVK 1224
                           + SC   + WK+++Q  VA STTEAEYMA+AEA KE++WL GL  
Sbjct: 1191 YV-----------FTIGSCA--VSWKATLQPVVAQSTTEAEYMAIAEACKESVWLKGLFA 1237

Query: 1225 ELGVEQGGVQLHCDSQSAIYLTNNQVYHARTKHIDVRFHKIRELLASRQILLQKIHTSEN 1284
            EL      + L CDSQSAI LT +Q++H RTKHID+++H +R+++A  ++ + KI   +N
Sbjct: 1238 ELCGVDSCINLFCDSQSAICLTKDQMFHERTKHIDIKYHYVRDIVAQGKLKVCKISIHDN 1297

Query: 1285 TTDKLTKPVTSDKF 1298
              D +TKP+   KF
Sbjct: 1298 PADMMTKPIPVAKF 1311


>gb|AAX92861.1| retrotransposon protein, putative, Ty1-copia sub-class [Oryza sativa
            (japonica cultivar-group)]
          Length = 1373

 Score =  952 bits (2462), Expect = 0.0
 Identities = 543/1309 (41%), Positives = 767/1309 (58%), Gaps = 61/1309 (4%)

Query: 5    KFEVTRFDGTGNFGLWQRMAKDLLAQKSLQKALRDEKPADIATVDWNEMK--EKAAGLIT 62
            KF++   +    F LWQ   + +LAQ        D      A     E++  +KA  LI 
Sbjct: 2    KFDLPLLNYDTRFSLWQVKMRGILAQTHDYDEALDNFGKRRAEWTAEEIRKDQKALALIQ 61

Query: 63   LCVSDDVMNHILDLTTLKDVWDKLESLYMSKTPMNKLFAKQRLYSLKMQEGGDLQAHVYA 122
            L + +D++   L   T  ++W KLES+ MSK   +K+  K +L++LKM+E   +  H+  
Sbjct: 62   LHLHNDILQECLTEKTSAELWLKLESICMSKDLTSKMQMKMKLFTLKMKEEDSVITHMAE 121

Query: 123  FNNILADMTRLGVTVDDEDKAIILLCSLPGSYDHLVTTLTYGKDSITLDSISSTLLPHAQ 182
            F  I+AD+  + V  DDED  ++LLCSLP SY +   T+   +D +TL  +   L    +
Sbjct: 122  FKKIVADLVSMEVKYDDEDLGLLLLCSLPNSYANFRDTILLSRDELTLKEVYDALQNKEK 181

Query: 183  RRQSVEEGGGSS--GEGLFVKGGQDRGRGKGKAVDSGKKKRSKSKDRKTTECYSCKQIGH 240
             +  V+  G SS  GE L V+G  +      K  D   + +SK    K   C  CK   H
Sbjct: 182  MKIMVQNDGSSSSKGEALHVRGRTENRTSNEKNYDRRGRSKSKPPGNKKF-CVYCKLKNH 240

Query: 241  WKRDCPNRSGKSGNNSSSANVVQSDGSCSEED-----LLCVSSVKCTDAWVLDSGCSYHM 295
               +C     K   N     V  +  + S++D     ++    V   D W+LDS CS+H+
Sbjct: 241  NIDECKKVQAKERKNKKDGKVSVASAAASDDDSGDCLVVFAGCVAGHDEWILDSACSFHI 300

Query: 296  TQHREWFNSFKSGDLG-YVYLGDDKPCIIKGMRQVKIALDDGGVRTLSQVRYVLEVMKNL 354
               R WF+S+K    G  V +GDD PC I G+  V+I  DDG  RTL  VRY+  + +NL
Sbjct: 301  CTKRNWFSSYKPVQKGDVVRMGDDNPCAIVGIGSVQIKTDDGMTRTLKNVRYIPGMSRNL 360

Query: 355  ISLGTLHENGYSFKSEENRDILRVSKGAMTVMRAKRTAGNIYKLLGGTIMGD--VASVET 412
            ISL TL   GY +   +   +L+VSKG++  ++    +  +Y L G T+ G    A+  T
Sbjct: 361  ISLSTLDAEGYKYSGSDG--VLKVSKGSLVCLKGDVNSAKLYVLRGCTLTGSDSAAAAIT 418

Query: 413  DDDATK--LWHMRLGHLSERGMMELHKRNMLKGVRSCIIGLCKYCVLGKQCRVRFKTGHH 470
            +D+ +K  LWHMRLGH+S  GM EL KRN+LKG  S  I  C++C+ GK  RV+F T  H
Sbjct: 419  NDEPSKTNLWHMRLGHMSHLGMTELMKRNLLKGCTSSKIKFCEHCIFGKHKRVQFNTSVH 478

Query: 471  KTKGILDYVHSDVRGPTKEPSV*GFRYFVTFTDDFSRKVWVYFMKYKSEVFAKFKLWKAE 530
             TKG LDYVH+D+ GP+K+PS+ G RY +T  DD+SRKVW YF+K+K + F  FK WK  
Sbjct: 479  TTKGTLDYVHADLWGPSKKPSLGGARYMLTIIDDYSRKVWPYFLKHKDDTFTAFKNWKVM 538

Query: 531  VENQTGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHFSVRKTPQQNGVAERMNRTLTEK 590
            +E QT RK+K LR+DNG E+    F  +C + GI RH ++  TPQQNGVAERMNRT+  +
Sbjct: 539  IERQTERKVKLLRTDNGGEFCSHAFNDYCRQEGIVRHHTIPHTPQQNGVAERMNRTIISR 598

Query: 591  ARCLRLNACLSKCLWAATINMACYLVNRSPRASLDGKVAEEVWTGNPIDLSNLRIFWRPA 650
            ARC+  +A ++K  WA   + ACYL+NRSP   L+ K   EVW+G P D S L++F   A
Sbjct: 599  ARCMLSHARMNKRFWAEAASTACYLINRSPSIPLNKKTPIEVWSGTPADYSQLKVFGCTA 658

Query: 651  YVHISSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDPVKKKVIVSRDVVFDE*SMLKQS-D 709
            Y H+   D  KL+P++  C+ +GY  GVKGYKLW+P   K  +SR VVF+E  M   S  
Sbjct: 659  YAHV---DNGKLEPRAVKCLFLGYGSGVKGYKLWNPETGKTFMSRSVVFNESVMFTNSLP 715

Query: 710  VTVVPDTEVENSSQDKIQVDIEETPVSPRQIVAQQQSEPGSDSG---------------E 754
               VP+ E++        VD ++T V    +   +Q +  +D                 +
Sbjct: 716  SEHVPEKELQRMHMQVEHVD-DDTGVQVEPV--HEQDDHNNDVADDDAHDDVQQTPPILQ 772

Query: 755  VQDYTLVRDREPSRITPPVRYGFED--LAAYAL-----LTSSGDPSTYHEAMAS*EKDKW 807
            +++   +  R+  R T P +   E+  L+ YAL     + +  +P+TY EA+   + + W
Sbjct: 773  LEEELPIAQRKSKRTTKPPKRLIEECNLSYYALSCAEQVENVHEPATYKEAVRCGDSENW 832

Query: 808  MSAMVEEMESLKKNET*NLVQLPHGKRVIGCKWVYKKKLAVT*KEREKFKAHLVTKGYSQ 867
            +SAM EEM+SL+KN T  +V LP  K+ I CKW++K+K  ++  E  K+KA LV +GYSQ
Sbjct: 833  ISAMHEEMQSLEKNSTWEVVPLPKKKKTISCKWIFKRKEGLSSSEPPKYKARLVARGYSQ 892

Query: 868  HKGIDYDEIFSPVVRHTSIRVVLALVASMDMHLEQMDVKTTFLHGNLEEQIYIEQPEGFS 927
              G+DY+++FSPVV+H+SIR  L++VAS D+ LEQ+DVKT FLHG LEE IY++QPEGF 
Sbjct: 893  IPGVDYNDVFSPVVKHSSIRTFLSIVASHDLELEQLDVKTAFLHGELEEDIYMDQPEGFI 952

Query: 928  ETGDGRLVCKLKRSLYGLKQSPRQWYKRFDSYMLRIGYRRCDYDCCVYVMSLDDGSFIFL 987
              G  + VCKLKRSLYGLKQSPRQW KRFDS+ML   ++R  YD CVY+  + +GS I+L
Sbjct: 953  VPGKEKYVCKLKRSLYGLKQSPRQWNKRFDSFMLSHSFKRSKYDSCVYIKHV-NGSPIYL 1011

Query: 988  LLYVDDMLIAANHLHDVNELKTKLGKEFDMKDLGAAKKILGMEIHKDRGAKKLWLSQKSY 1047
            LLYVDDMLIAA    ++ +LK  L  EFDMKDLG+AKKILGMEI +DR +  L+LSQ +Y
Sbjct: 1012 LLYVDDMLIAAKSKIEITKLKKLLSSEFDMKDLGSAKKILGMEISRDRKSGLLFLSQHNY 1071

Query: 1048 VEGVLSRFDMSKANHVSTPLTNHFKLSLEQSPKIDSEIEGMSKIPY--AVQLVV*CMLWF 1105
            ++ VL RF+M  A  VSTP+  HFKLS  Q P ID+EIE MS++PY  AV  ++  M+  
Sbjct: 1072 IKKVLQRFNMQNAKAVSTPIAPHFKLSAAQCPSIDAEIEYMSRVPYSSAVGSLMYAMVCS 1131

Query: 1106 ALDQIWHKQLVKCQVHVQAREAALGSSQVDPKILEGYNGSRYHV*QGARCCSISCGICGL 1165
              D  +   LV   +    +E      +    I     G+ Y   +  R      G+ G 
Sbjct: 1132 RPDLSYAMSLVSRYMSNPGKE----HWRAVQWIFRYLRGTTYSCLKFGR---TDKGLIGY 1184

Query: 1166 CR*SR**KVYNRICL----YSCGG-PICWKSSVQSTVAMSTTEAEYMAVAEAAKEALWLT 1220
                    +  R  L    ++ G   + W++++QS VA+STTEAEYMA+ EA KE +WL 
Sbjct: 1185 VDSDYAADLDRRRSLTGYVFTIGSCAVSWRATLQSVVALSTTEAEYMAICEACKELIWLK 1244

Query: 1221 GLVKELGVEQGGVQLHCDSQSAIYLTNNQVYHARTKHIDVRFHKIRELL 1269
            GL  EL   +  + LHCDS+SAIYLT +Q++H RTKHID+++H +R+++
Sbjct: 1245 GLYAELSGVESCISLHCDSESAIYLTKDQMFHERTKHIDIKYHFVRDVI 1293


>gb|AAX92941.1| retrotransposon protein, putative, Ty1-copia sub-class [Oryza sativa
            (japonica cultivar-group)]
          Length = 2340

 Score =  920 bits (2378), Expect = 0.0
 Identities = 511/1116 (45%), Positives = 705/1116 (62%), Gaps = 53/1116 (4%)

Query: 17   FGLWQRMAKDLLAQKSLQKALRDEKPADIATVDW-NEMKE---KAAGLITLCVSDDVMNH 72
            F LWQ   + +LAQ+ L  AL      D  T DW N+ K+   KA   I L +S++++  
Sbjct: 224  FSLWQVKMRAVLAQQDLDDALSG---FDKRTHDWSNDEKKRDRKAMSYIHLHLSNNILQE 280

Query: 73   ILDLTTLKDVWDKLESLYMSKTPMNKLFAKQRLYSLKMQEGGDLQAHVYAFNNILADMTR 132
            +L       +W KLE + M+K   +K+  KQ L+  K+Q+ G +  H+ AF  I+AD+  
Sbjct: 281  VLKEEIAAGLWLKLEQICMTKDLTSKMHLKQTLFLHKLQDDGSVMDHLSAFKEIIADLES 340

Query: 133  LGVTVDDEDKAIILLCSLPGSYDHLVTTLTYGKDSITLDSISSTLLPHAQRRQSV-EEGG 191
            + V  D+ED  +ILLCSLP SY +   T+ Y +D++TL  +   L    + ++ V  EG 
Sbjct: 341  MEVKYDEEDLGLILLCSLPSSYANFRDTILYSRDTLTLKEVYDALHVKEKMKKMVPSEGS 400

Query: 192  GSSGEGLFVKGGQDRGRGKGKAVD---SGKKKRSKSKDRKTTECYSCKQIGH-----WK- 242
             S  EGL V G Q     K ++ D   S  + RSKS+ R  + C  CK+ GH     WK 
Sbjct: 401  NSQAEGLIVWGRQQEKNTKNQSRDKSSSSYRGRSKSRGRYKS-CKYCKRDGHDIFECWKL 459

Query: 243  RDCPNRSGK-----SGNNSSSANVVQSDGSCSEEDLLCVSSVKCTDAWVLDSGCSYHMTQ 297
             D   R+GK            A VV  + S +E  +      + +D W+L++ C YHM  
Sbjct: 460  HDKDKRTGKYVPKGKKEEEGKAAVVTDEKSDAELLVAYAGCAQTSDQWILNTACIYHMCP 519

Query: 298  HREWFNSFKSGDLGYVYLGDDKPCIIKGMRQVKIALDDGGVRTLSQVRYVLEVMKNLISL 357
            +R+WF ++++  +G V +GDD PC + G+  V+I + DG +RTLS VR++  + ++LISL
Sbjct: 520  NRDWFATYEAVQVGTVLMGDDTPCEVAGIGTVQIKMFDGCIRTLSDVRHIPNLKRSLISL 579

Query: 358  GTLHENGYSFKSEENRDILRVSKGAMTVMRAKRTAGNIYKLLGGTIMGDVASVE---TDD 414
             TL   GY +   +   IL+V+KG++ VM+A   + N+Y L G TI+G+VA+V    ++ 
Sbjct: 580  CTLDRKGYKYSGGDG--ILKVTKGSLVVMKADIKSANLYHLRGTTILGNVAAVSDSLSNS 637

Query: 415  DATKLWHMRLGHLSERGMMELHKRNMLKGVRSCIIGLCKYCVLGKQCRVRFKTGHHKTKG 474
            DAT LWHMRLGH++E G+ EL KR +L G     +  C++C+ GK  RV+F T  H T+G
Sbjct: 638  DATNLWHMRLGHMTEIGLAELSKRGLLDGQSIGKLKFCEHCIFGKHKRVKFNTSTHTTEG 697

Query: 475  ILDYVHSDVRGPTKEPSV*GFRYFVTFTDDFSRKVWVYFMKYKSEVFAKFKLWKAEVENQ 534
            ILDYVHSD+ GP ++ S  G RY +T  DD+SRKVW YF+K+K + F  FK WK  VE Q
Sbjct: 698  ILDYVHSDLWGPARKTSFGGTRYMMTIVDDYSRKVWPYFLKHKYQAFDVFKEWKTMVERQ 757

Query: 535  TGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHFSVRKTPQQNGVAERMNRTLTEKARCL 594
            T RK+K LR+DNG E+  K F  +C+  GI RH++V  TPQQNGVAERMNRT+  KARCL
Sbjct: 758  TERKVKILRTDNGMEFCSKIFKSYCKSEGIVRHYTVPHTPQQNGVAERMNRTIISKARCL 817

Query: 595  RLNACLSKCLWAATINMACYLVNRSPRASLDGKVAEEVWTGNPIDLSNLRIFWRPAYVHI 654
              NA L K  WA  ++ ACYL+NRSP  ++D K   EVW+G+P + S+LR+F   AY H+
Sbjct: 818  LSNAGLPKQFWAEAVSTACYLINRSPSYAIDKKTPIEVWSGSPANYSDLRVFGCTAYAHV 877

Query: 655  SSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDPVKKKVIVSRDVVFDE*SMLKQSDVTVVP 714
               D  KL+P++  CI +GY  GVKGYKLW P  KKV++SR+VVF E  ML        P
Sbjct: 878  ---DNGKLEPRAIKCIFLGYPSGVKGYKLWCPETKKVVISRNVVFHESVMLHDK-----P 929

Query: 715  DTEVENSSQDKIQVDIEETPVS----PRQIVAQQQSEP---GSDSGEVQD---YTLVRDR 764
             T V   SQ+K  V +E    S     ++ VA  Q EP    S+S  VQ     ++ +DR
Sbjct: 930  STNVPVESQEKASVQVEHLISSGHAPEKEDVAINQDEPVIEDSNSSIVQQSPKRSIAKDR 989

Query: 765  EPSRITPPVRYGFE-DLAAYAL-----LTSSGDPSTYHEAMAS*EKDKWMSAMVEEMESL 818
                I PP RY  E ++ AYAL     +  + +PSTY EA+ S + ++W++AM +EMESL
Sbjct: 990  PKRNIKPPQRYIEEANIVAYALSVAEEIEGNAEPSTYSEAIVSDDCNRWITAMHDEMESL 1049

Query: 819  KKNET*NLVQLPHGKRVIGCKWVYKKKLAVT*KEREKFKAHLVTKGYSQHKGIDYDEIFS 878
            +KN T  LV+LP  K+ I CKW++K+K  ++  +  ++KA L+ KGYSQ  GID++++FS
Sbjct: 1050 EKNHTWELVKLPKEKKPIRCKWIFKRKEGISSSDEARYKARLIAKGYSQIPGIDFNDVFS 1109

Query: 879  PVVRHTSIRVVLALVASMDMHLEQMDVKTTFLHGNLEEQIYIEQPEGFSETGDGRLVCKL 938
            PVV+H+SIR +L++VA  D  LEQMDVKT FLHG LEE IY+EQP+GF   G   LVC+L
Sbjct: 1110 PVVKHSSIRTLLSIVAMHDYELEQMDVKTAFLHGELEEDIYMEQPKGFVVPGKENLVCRL 1169

Query: 939  KRSLYGLKQSPRQWYKRFDSYMLRIGYRRCDYDCCVYVMSLDDGSFIFLLLYVDDMLIAA 998
            K+SLYGLKQSPRQWYKRFDS+ML   +RR +YD CVY + + DGS I+LLLYVDDMLIAA
Sbjct: 1170 KKSLYGLKQSPRQWYKRFDSFMLSQKFRRSNYDSCVY-LKVVDGSAIYLLLYVDDMLIAA 1228

Query: 999  NHLHDVNELKTKLGKEFDMKDLGAAKKILGMEIHKDRGAKKLWLSQKSYVEGVLSRFDMS 1058
                ++ +LK +L  EF+MKDLGAAKKILGMEI + R + KL+LSQK Y+E VL RF+M 
Sbjct: 1229 KDKSEIAKLKAQLSSEFEMKDLGAAKKILGMEITRKRHSFKLYLSQKGYIEKVLRRFNMH 1288

Query: 1059 KANHVSTPLTNHFKLSLEQSPKIDSEIEGMSKIPYA 1094
             A  VSTPL  HF+LS +  P+ D +IE MS++PY+
Sbjct: 1289 DAKPVSTPLAAHFRLSSDLCPQSDYDIEYMSRVPYS 1324



 Score =  126 bits (316), Expect = 6e-27
 Identities = 62/120 (51%), Positives = 85/120 (70%), Gaps = 1/120 (0%)

Query: 1180 LYSCGG-PICWKSSVQSTVAMSTTEAEYMAVAEAAKEALWLTGLVKELGVEQGGVQLHCD 1238
            +++ GG  + WK+S+Q+TVA+STTEAEYMA++EA KEA+WL GL   L      + + CD
Sbjct: 1409 VFTIGGCAVSWKASLQATVALSTTEAEYMAISEACKEAIWLRGLYTVLCAVTSCINIFCD 1468

Query: 1239 SQSAIYLTNNQVYHARTKHIDVRFHKIRELLASRQILLQKIHTSENTTDKLTKPVTSDKF 1298
            SQSAI LT +Q++H RTKHIDVR+H IR L+A   + + KI   +N  D +TKPV + KF
Sbjct: 1469 SQSAICLTKDQMFHERTKHIDVRYHFIRGLIAEGDVKICKISIHDNPADMMTKPVPATKF 1528


>dbj|BAB09923.1| copia-like retrotransposable element [Arabidopsis thaliana]
          Length = 1342

 Score =  888 bits (2294), Expect = 0.0
 Identities = 537/1381 (38%), Positives = 768/1381 (54%), Gaps = 148/1381 (10%)

Query: 7    EVTRFDGTGNFGLWQRMAKDLLAQKSLQKALRDEKPADI--ATVDWN------------- 51
            EV +FDG G++ LW+      +    L + L +E+ A +  +T + +             
Sbjct: 7    EVEKFDGDGDYILWKEKLLAHMEMLGLLEGLGEEEEAVVEDSTTEISDGGNQDPETATSK 66

Query: 52   -------EMKEKAAGLITLCVSDDVMNHILDLTTLKDVWDKLESLYMSKTPMNKLFAKQR 104
                   E + KA   I L + ++V+  ++   T   +   L+ L+M+K+  N+++ KQR
Sbjct: 67   LEDKILKEKRGKARSTIILSLGNNVLRKVIKQKTAAGMIKVLDQLFMAKSLPNRIYLKQR 126

Query: 105  LYSLKMQEGGDLQAHVYAFNNILADMTRLGVTVDDEDKAIILLCSLPGSYDHLVTTLTYG 164
            LY  KM E   ++ +V  F  +++D+  + V V DED+AI+LL SLP  +D L  TL Y 
Sbjct: 127  LYGYKMSENMTMEENVNDFFKLISDLENVKVVVPDEDQAIVLLMSLPRQFDQLKETLKYC 186

Query: 165  KDSITLDSISSTLLPHAQRRQSVEEGGGS------SGEGLFVKGGQDRGRGKGKAVDSGK 218
            K ++ L+ I+S +      R  + E G S      + +GLFV   QDRGR + +     K
Sbjct: 187  KTTLHLEEITSAI------RSKILELGASGKLLKNNSDGLFV---QDRGRSETRGKGPNK 237

Query: 219  KK-RSKSKDRKTTECYSCKQIGHWKRDC-----PNRSGKSGNNSSSANVVQ--SDGSCSE 270
             K RSKSK    T C+ C + GH+K+ C      N+ G +     ++ V    +D +   
Sbjct: 238  NKSRSKSKGAGKT-CWICGKEGHFKKQCYVWKERNKQGSTSERGEASTVTARVTDAAALV 296

Query: 271  EDLLCVSSVKCT-DAWVLDSGCSYHMTQHREWFNSFKSGDLGYVYLGDDKPCIIKGMRQV 329
                 +   + T D W+LD+GCS+HMT  ++W   FK    G V +G+D    +KG+  V
Sbjct: 297  VSRALLGFAEVTPDTWILDTGCSFHMTCRKDWIIDFKETASGKVRMGNDTYSEVKGIGDV 356

Query: 330  KIALDDGGVRTLSQVRYVLEVMKNLISLGTLHENGYSFKSEENRDILRVSKGAMTVMRAK 389
            +I  +DG    L+ VRY+ E+ KNLISLGTL + G  F+S++   IL + K  +TV+  K
Sbjct: 357  RIKNEDGSTILLTDVRYIPEMSKNLISLGTLEDKGCWFESKKG--ILTIFKNDLTVLTGK 414

Query: 390  RTAGNIYKLLGGTIMGDVASVETDDDATKLWHMRLGHLSERGMMELHKRNMLKGVRSCII 449
            + +  +Y L G T+ G+   ++ + D T LWH RLGH+  +G+  L  +  L        
Sbjct: 415  KES-TLYFLQGTTLAGEANVIDKEKDETSLWHSRLGHIGAKGLQVLVSKGHLD------- 466

Query: 450  GLCKYCVLGKQCRVRFKTGHHKTKGILDYVHSDVRGPTKEP-SV*GFRYFVTFTDDFSRK 508
                     K   + F    H TK  LDYVHSD+ G T  P S+   +YF+TF DDF+R+
Sbjct: 467  ---------KNIMISFGAAKHVTKDKLDYVHSDLWGSTNVPFSIGKCQYFITFIDDFTRR 517

Query: 509  VWVYFMKYKSEVFAKFKLWKAEVENQTGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHF 568
             W+YF++ K E F+KF  WK ++ENQ  +K+K L +DNG E+ ++ F  FC + G+ RH 
Sbjct: 518  TWIYFIRTKDEAFSKFVEWKTQIENQQDKKLKILITDNGLEFCNQEFDSFCRKEGVIRHR 577

Query: 569  SVRKTPQQNGVAERMNRTLTEKARCLRLNACLSKCLWAATINMACYLVNRSPRASLDGKV 628
            +   TPQQNGVAERMNRT+  K RC+   + L K  WA   + A +L+N+SP +S++  +
Sbjct: 578  TCAYTPQQNGVAERMNRTIMNKVRCMLSESGLGKQFWAEAASTAVFLINKSPSSSIEFDI 637

Query: 629  AEEVWTGNPIDLSNLRIFWRPAYVHISSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDPVK 688
             EE WTG+P D   L+ F   AY+H    D+ KL+P++K  I +GY  GVK +K+W    
Sbjct: 638  PEEKWTGHPPDYKILKKFGSVAYIH---SDQGKLNPRAKKGIFLGYPDGVKRFKVWLLED 694

Query: 689  KKVIVSRDVVFDE*SMLKQSDVTVVPDTEVENSSQDKIQVDIEETPVSPRQIVA--QQQS 746
            +K +VSRD+VF E  M K+           + S +DK   ++E T +  + + A  + QS
Sbjct: 695  RKCVVSRDIVFQENQMYKELQKN-------DMSEEDKQLTEVERTLIELKNLSADDENQS 747

Query: 747  EPGSDSGEVQ------------------------DYTLVRDREPSRITPPVRYGFED--L 780
            E G +S + Q                        +Y L RDR   +I  P R+  ED  L
Sbjct: 748  EGGDNSNQEQASTTRSASKDKQVEETDSDDDCLENYLLARDRIRRQIRAPQRFVEEDDSL 807

Query: 781  AAYAL-LTSSGD---PSTYHEAMAS*EKDKWMSAMVEEMESLKKNET*NLVQLPHGKRVI 836
              +AL +T  G+   P TY EAM S E +KW  A +EEM+S+KKN+T +++  P GKRVI
Sbjct: 808  VGFALTMTEDGEVYEPETYEEAMRSPECEKWKQATIEEMDSMKKNDTWDVIDKPEGKRVI 867

Query: 837  GCKWVYKKKLAVT*KEREKFKAHLVTKGYSQHKGIDYDEIFSPVVRHTSIRVVLALVASM 896
            GCKW++K+K  +   E  ++KA LV KG+SQ +GIDY EIFSPVV+H SIR +L++V   
Sbjct: 868  GCKWIFKRKAGIPGVEPPRYKARLVAKGFSQREGIDYQEIFSPVVKHVSIRYLLSIVVQF 927

Query: 897  DMHLEQMDVKTTFLHGNLEEQIYIEQPEGFSETGDGRLVCKLKRSLYGLKQSPRQWYKRF 956
            DM LEQ+DVKT FLHGNL+E I + QPEG+ +      VC LK+SLYGLKQSPRQW +RF
Sbjct: 928  DMELEQLDVKTAFLHGNLDEYILMSQPEGYEDEDSTEKVCLLKKSLYGLKQSPRQWNQRF 987

Query: 957  DSYMLRIGYRRCDYDCCVYVMSLDDGSFIFLLLYVDDMLIAANHLHDVNELKTKLGKEFD 1016
            DS+M+  GY+R  Y+ CVY   L+DGS+I+LLLYVDDMLIA+ +   + +LK  L +EF+
Sbjct: 988  DSFMINSGYQRSKYNPCVYTQQLNDGSYIYLLLYVDDMLIASQNKDQIQKLKESLNREFE 1047

Query: 1017 MKDLGAAKKILGMEIHKDRGAKKLWLSQKSYVEGVLSRFDMSKANHVSTPLTNHFKLSLE 1076
            MKDLG A+KILGMEI ++R    L LSQ  YV GVL  F M ++    TPL  HFKL   
Sbjct: 1048 MKDLGPARKILGMEITRNREQGILDLSQSEYVAGVLRAFGMDQSKVSQTPLGAHFKLRAA 1107

Query: 1077 QSPKIDSEIEGMSKIPY--AVQLVV*CMLWFALDQIW---------------HKQLVKCQ 1119
                +  + E M  +PY  A+  ++  M+    D  +               H Q VK  
Sbjct: 1108 NEKTLARDAEYMKLVPYPNAIGSIMYSMIGSRPDLAYPVGVVSRFMSKPSKEHWQAVKWV 1167

Query: 1120 VHVQ--AREAALGSSQVDPKILEGYNGSRYHV*QGARCCSISCGICGLCR*SR**KVYNR 1177
            +      ++  L   + D   + GY  S Y     A        I G             
Sbjct: 1168 MRYMKGTQDTCLRFKKDDKFEIRGYCDSDY-----ATDLDRRRSITGF------------ 1210

Query: 1178 ICLYSCGG-PICWKSSVQSTVAMSTTEAEYMAVAEAAKEALWLTGLVKELGVEQGGVQLH 1236
              +++ GG  I WKS +Q  VA+STTEAEYMA+AEA KEA+WL GL  E+G EQ  V++ 
Sbjct: 1211 --VFTAGGNTISWKSGLQRVVALSTTEAEYMALAEAVKEAIWLRGLAAEMGFEQDAVEVM 1268

Query: 1237 CDSQSAIYLTNNQVYHARTKHIDVRFHKIRELLASRQILLQKIHTSENTTDKLTKPVTSD 1296
            CDSQSAI L+ N V+H RTKHIDVR+H IRE +A  +I + KI T+ N  D  TK V   
Sbjct: 1269 CDSQSAIALSKNSVHHERTKHIDVRYHFIREKIADGEIQVVKISTTWNPADIFTKTVPVS 1328

Query: 1297 K 1297
            K
Sbjct: 1329 K 1329


>emb|CAB75481.1| copia-like polyprotein [Arabidopsis thaliana] gi|11278366|pir||T47492
            copia-like polyprotein - Arabidopsis thaliana
          Length = 1363

 Score =  881 bits (2276), Expect = 0.0
 Identities = 526/1365 (38%), Positives = 770/1365 (55%), Gaps = 85/1365 (6%)

Query: 3    GAKFEVTRFDGTGNFGLWQRMAK---DLLA--------------QKSLQKALRDEKPADI 45
            GA+ EV +FDG G++ +W+       D+L               ++  +K+  DEK    
Sbjct: 3    GARIEVEKFDGRGDYTMWKEKLLAHIDMLGLSAVLRESETPMGKERDSEKSDEDEKEERE 62

Query: 46   ATVDWNEMKEKAAGLITLCVSDDVMNHILDLTTLKDVWDKLESLYMSKTPMNKLFAKQRL 105
                + E K KA   I L VSD V+  I   T+   + + L+ LYMSK   N+++ KQ+L
Sbjct: 63   KMEAFEEKKRKARSTIVLSVSDRVLRKIKKETSAAAMLEALDRLYMSKALPNRIYLKQKL 122

Query: 106  YSLKMQEGGDLQAHVYAFNNILADMTRLGVTVDDEDKAIILLCSLPGSYDHLVTTLTY-- 163
            YS KM E   ++ ++  F +I+AD+  L V V DED+AI+LL SLP  +D L  TL Y  
Sbjct: 123  YSFKMSENLSIEGNIDEFLHIVADLENLNVLVSDEDQAILLLMSLPKPFDQLKDTLKYSS 182

Query: 164  GKDSITLDSISSTLLPHAQRRQSVEEGGGSSGEGLFVKGGQDRGRGKGKAVDSGKKKRSK 223
            GK  ++LD +++ +        SV++      EGL+VK   +  RG+ +  D GK KRSK
Sbjct: 183  GKTVLSLDEVAAAIYSRELEFGSVKKSIKGQAEGLYVKDKAEN-RGRSEQKDKGKGKRSK 241

Query: 224  SKDRKTTECYSCKQIGHWKRDCPNRS----------------GKSGNNSSSANVVQSDGS 267
            SK ++   C+ C + GH K  CPN++                GK      S N V+S G 
Sbjct: 242  SKSKRG--CWICGEDGHLKSTCPNKNKPQFKNQGSNKGESSGGKGNLVEGSVNFVESAGM 299

Query: 268  CSEEDLLCVSSVKCTDAWVLDSGCSYHMTQHREWFNSFKSGDLGYVYLGDDKPCIIKGMR 327
               E  L  + +   D W++D+GC YHMT  REW   F     G V +G+     +KG+ 
Sbjct: 300  FVSE-ALSSTDIHLEDEWIMDTGCIYHMTHKREWLEDFDEEAGGSVRMGNKSISRVKGVG 358

Query: 328  QVKIALDDGGVRTLSQVRYVLEVMKNLISLGTLHENGYSFKSEENRDILRVSKGAMTVMR 387
             V+I  D+G   TL  VRY+ ++ +NL+SLGT  + G+ F+SE    +LR+  G   ++ 
Sbjct: 359  TVRIVNDNGLTVTLQNVRYIPDMDRNLLSLGTFEKAGHKFESENG--MLRIKSGNQVLLE 416

Query: 388  AKRTAGNIYKLLGGTIMGDVASVETDDDATKLWHMRLGHLSERGMMELHKRNMLKGVRSC 447
             +R    +Y L G     +  +V   +D T LWH RL H+S++ M  L K+  L   +  
Sbjct: 417  GRRY-DTLYILHGKPATDESLAVARANDDTVLWHRRLCHMSQKNMSLLIKKGFLDKKKVS 475

Query: 448  IIGLCKYCVLGKQCRVRFKTGHHKTKGILDYVHSDVRG-PTKEPSV*GFRYFVTFTDDFS 506
            ++  C+ C+ G+  ++ F    H TK  L+YVHSD+ G PT   S+   +YF++F DD++
Sbjct: 476  MLDTCEDCIYGRAKKIGFNLAQHDTKKKLEYVHSDLWGAPTVPMSLGNCQYFISFIDDYT 535

Query: 507  RKVWVYFMKYKSEVFAKFKLWKAEVENQTGRKIKYLRSDNGTEYTDKNFMHFCEENGIQR 566
            RKVWVYF+K K E F KF  W + VENQ+G ++K LR+DNG E+ ++ F  FCEE G QR
Sbjct: 536  RKVWVYFLKTKDEAFEKFVSWISLVENQSGERVKTLRTDNGLEFCNRMFDGFCEEKGFQR 595

Query: 567  HFSVRKTPQQNGVAERMNRTLTEKARCLRLNACLSKCLWAATINMACYLVNRSPRASLDG 626
            H +   TPQQNGV ERMNRT+ EK R +  ++ L K  WA   + A  L+N++P ++++ 
Sbjct: 596  HRTCAYTPQQNGVVERMNRTIMEKVRSMLCDSGLPKRFWAEATHTAVLLINKTPCSAINF 655

Query: 627  KVAEEVWTGNPIDLSNLRIFWRPAYVHISSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDP 686
            +  ++ W+G     S LR +    +VH    D  KL+ ++K  ++IGY  GVKGYK+W  
Sbjct: 656  EFPDKRWSGKAPIYSYLRRYGCVTFVHT---DGGKLNLRAKKGVLIGYPSGVKGYKVWLI 712

Query: 687  VKKKVIVSRDVVFDE*SMLKQ-SDVTVVPDTEVENSSQDKIQVDIE------------ET 733
             +KK +VSR+V F E ++ K           E ++ +   I +D+E            + 
Sbjct: 713  EEKKCVVSRNVSFQENAVYKDLMQRKEQVSCEEDDHAGSYIDLDLEADKDNSSGGEQSQA 772

Query: 734  PVSPRQIVAQQQSEPGSDSGEVQD---------YTLVRDREPSRITPPVRYGFEDLAAYA 784
             V+P    A   + P  ++ ++++         Y LVRDRE   I  P R+  ED  A A
Sbjct: 773  QVTPATRGAVTSTPPRYETDDIEETDVHQSPLSYHLVRDRERREIRAPRRFDDEDYYAEA 832

Query: 785  LLTSSG----DPSTYHEAMAS*EKDKWMSAMVEEMESLKKNET*NLVQLPHGKRVIGCKW 840
            L T+      +P+ Y EA+     DKW  AM EE+ES  KN+T   V  P  +R+IG +W
Sbjct: 833  LYTTEDGDAVEPADYKEAVRDENWDKWRLAMNEEIESQLKNDTWTTVTRPEKQRIIGSRW 892

Query: 841  VYKKKLAVT*KEREKFKAHLVTKGYSQHKGIDYDEIFSPVVRHTSIRVVLALVASMDMHL 900
            +YK K  +   E  +FKA LV KGY+Q +G+DY EIF+PVV+H SIR++L++VA  ++ L
Sbjct: 893  IYKYKQGIPGVEEPRFKARLVAKGYAQREGVDYHEIFAPVVKHVSIRILLSIVAQENLEL 952

Query: 901  EQMDVKTTFLHGNLEEQIYIEQPEGFSETGDGRLVCKLKRSLYGLKQSPRQWYKRFDSYM 960
            EQ+DVKT FLHG L+E+IY+  PEG         VC L +SLYGLKQ+PRQW ++F+ YM
Sbjct: 953  EQLDVKTAFLHGELKEKIYMMPPEGCESLFKENEVCLLNKSLYGLKQAPRQWNEKFNHYM 1012

Query: 961  LRIGYRRCDYDCCVYVMSLDDGSFIFLLLYVDDMLIAANHLHDVNELKTKLGKEFDMKDL 1020
              IG++R DYD C Y   L D S ++LL YVDDML+AAN++  ++ LK +L  +F+MKDL
Sbjct: 1013 TEIGFKRSDYDSCAYTKKLSDDSTMYLLFYVDDMLVAANNMQAIDALKKELSIKFEMKDL 1072

Query: 1021 GAAKKILGMEIHKDRGAKKLWLSQKSYVEGVLSRFDMSKANHVSTPLTNHFKLSLEQSPK 1080
            GAAKKILG+EI  DR A  LWLSQ+SY+  VL  F+M ++    TPL  H K+      K
Sbjct: 1073 GAAKKILGIEIIIDREAGVLWLSQESYLNKVLKTFNMLESKPALTPLGAHLKMKSATEEK 1132

Query: 1081 IDSEIEGMSKIPY--AVQLVV*CMLWFALDQIWHKQLVKCQVHVQAREAALGSSQVDPKI 1138
            + +E E M+ +PY  AV  ++  M+    D  +   +V   +   A+E  LG   V  + 
Sbjct: 1133 LSTEEEYMNSVPYSSAVGSIMYAMIGTRPDLAYPVGVVSRFMSQPAKEHWLGVKWV-LRY 1191

Query: 1139 LEGYNGSRYHV*QGARCCSISCGICGLCR*SR**KVYNRICL----YSCGG-PICWKSSV 1193
            ++G   +R    + +        ICG C       +  R  +    ++ GG  I WKS +
Sbjct: 1192 IKGTVDTRLCYKRNS-----DFSICGYCDADYAADLDKRRSITGLVFTLGGNTISWKSGL 1246

Query: 1194 QSTVAMSTTEAEYMAVAEAAKEALWLTGLVKELGVEQGGVQLHCDSQSAIYLTNNQVYHA 1253
            Q  VA S+TE EYM++ EA KEA+WL GL+K+ G EQ  V++ CDSQSAI L+ N V+H 
Sbjct: 1247 QRVVAQSSTECEYMSLTEAVKEAIWLKGLLKDFGYEQKNVEIFCDSQSAIALSKNNVHHE 1306

Query: 1254 RTKHIDVRFHKIRELLASRQILLQKIHTSENTTDKLTKPVTSDKF 1298
            RTKHIDV+FH IRE++A  ++ + KI T +N  D  TK +  +KF
Sbjct: 1307 RTKHIDVKFHFIREIIADGKVEVSKISTEKNPADIFTKVLPVNKF 1351


>gb|AAD32759.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
            gi|25301696|pir||F84486 probable retroelement pol
            polyprotein [imported] - Arabidopsis thaliana
          Length = 1356

 Score =  875 bits (2261), Expect = 0.0
 Identities = 516/1360 (37%), Positives = 758/1360 (54%), Gaps = 84/1360 (6%)

Query: 4    AKFEVTRFDGTGNFGLWQRMAKDLLAQKSLQKALRDEKPA-------DIATVDWNEM--- 53
            A+ EV +FDG G++ +W+      +    L  AL++ +         D +  D+ E    
Sbjct: 4    ARIEVEKFDGRGDYTMWKEKLLAHMDILGLNTALKESESTGEKKSVLDESDEDYEEKLEK 63

Query: 54   -------KEKAAGLITLCVSDDVMNHILDLTTLKDVWDKLESLYMSKTPMNKLFAKQRLY 106
                   K+KA   I L V+D V+  I   +T   +   L+ LYMSK   N+++ KQ+LY
Sbjct: 64   FEALEEKKKKARSAIVLSVTDRVLRKIKKESTAAAMLLALDKLYMSKALPNRIYPKQKLY 123

Query: 107  SLKMQEGGDLQAHVYAFNNILADMTRLGVTVDDEDKAIILLCSLPGSYDHLVTTLTY--G 164
            S KM E   ++ ++  F  I+ D+  + V + DED+AI+LL +LP ++D L  TL Y  G
Sbjct: 124  SFKMSENLSVEGNIDEFLQIITDLENMNVIISDEDQAILLLTALPKAFDQLKDTLKYSSG 183

Query: 165  KDSITLDSISSTLLPHAQRRQSVEEGGGSSGEGLFVKGGQDRGRGKGKAVDSGKKKRSKS 224
            K  +TLD +++ +        SV++      EGL+VK   D+   KGK    GK K  K 
Sbjct: 184  KSILTLDEVAAAIYSKELELGSVKKSIKVQAEGLYVK---DKNENKGKGEQKGKGKGKKG 240

Query: 225  KDRKTTECYSCKQIGHWKRDCPNRSG---------KSGNNSSSANVVQSDGSCSEEDLLC 275
            K +K   C++C + GH++  CPN++          K  ++    N+ ++ G    E  L 
Sbjct: 241  KSKKKPGCWTCGEEGHFRSSCPNQNKPQFKQSQVVKGESSGGKGNLAEAAGYYVSE-ALS 299

Query: 276  VSSVKCTDAWVLDSGCSYHMTQHREWFNSFKSGDLGYVYLGDDKPCIIKGMRQVKIALDD 335
             + V   D W+LD+GCSYHMT  REWF+ F     G V +G+     ++G+  +++   D
Sbjct: 300  STEVHLEDEWILDTGCSYHMTYKREWFHEFNEDAGGSVRMGNKTVSRVRGVGTIRVKNSD 359

Query: 336  GGVRTLSQVRYVLEVMKNLISLGTLHENGYSFKSEENRDILRVSKGAMTVMRAKRTAGNI 395
            G    L+ VRY+ ++ +NL+SLGT  + GY F+SE+   ILR+  G   ++  +R    +
Sbjct: 360  GLTIVLTNVRYIPDMDRNLLSLGTFEKAGYKFESEDG--ILRIKAGNQVLLTGRRY-DTL 416

Query: 396  YKLLGGTIMGDVASVETDDDATKLWHMRLGHLSERGMMELHKRNMLKGVRSCIIGLCKYC 455
            Y L    +  +  +V    D T LWH RL H+S++ M  L ++  L   +   + +C+ C
Sbjct: 417  YLLNWKPVASESLAVVKRADDTVLWHQRLCHMSQKNMEILVRKGFLDKKKVSSLDVCEDC 476

Query: 456  VLGKQCRVRFKTGHHKTKGILDYVHSDVRGPTKEP-SV*GFRYFVTFTDDFSRKVWVYFM 514
            + GK  R  F   HH TK  L+Y+HSD+ G    P S+   +YF++  DDF+RKVWVYFM
Sbjct: 477  IYGKAKRKSFSLAHHDTKEKLEYIHSDLWGAPFVPLSLGKCQYFMSIIDDFTRKVWVYFM 536

Query: 515  KYKSEVFAKFKLWKAEVENQTGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHFSVRKTP 574
            K K E F KF  W   VENQT R++K LR+DNG E+ +K F  FCE  GI RH +   TP
Sbjct: 537  KTKDEAFEKFVEWVNLVENQTDRRVKTLRTDNGLEFCNKLFDGFCESIGIHRHRTCAYTP 596

Query: 575  QQNGVAERMNRTLTEKARCLRLNACLSKCLWAATINMACYLVNRSPRASLDGKVAEEVWT 634
            QQNGVAERMNRT+ EK R +  ++ L K  WA   +    L+N++P ++L+ ++ ++ W+
Sbjct: 597  QQNGVAERMNRTIMEKVRSMLSDSGLPKRFWAEATHTTVLLINKTPSSALNFEIPDKKWS 656

Query: 635  GNPIDLSNLRIFWRPAYVHISSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDPVKKKVIVS 694
            GNP   S LR +   A+VH    D  KL+P++K  ++IGY  GVKGYK+W   ++K +VS
Sbjct: 657  GNPPVYSYLRRYGCVAFVHT---DDGKLEPRAKKGVLIGYPVGVKGYKVWILDERKCVVS 713

Query: 695  RDVVFDE*SMLK-----QSDVTVVPDTEVEN-------------SSQDKIQVDIEETPVS 736
            R+++F E ++ K     Q +V+   D +  +             S  D+  V+    P S
Sbjct: 714  RNIIFQENAVYKDLMQRQENVSTEEDDQTGSYLEFDLEAERDVISGGDQEMVNTIPAPES 773

Query: 737  PRQIVAQQQSEPGSDSGEVQD----YTLVRDREPSRITPPVRYGFEDLAAYALLTSSG-- 790
            P       Q     +  +V      Y LVRDR+   I  P R+  ED  A AL T+    
Sbjct: 774  PVVSTPTTQDTNDDEDSDVNQSPLSYHLVRDRDKREIRAPRRFDDEDYYAEALYTTEDGE 833

Query: 791  --DPSTYHEAMAS*EKDKWMSAMVEEMESLKKNET*NLVQLPHGKRVIGCKWVYKKKLAV 848
              +P  Y +A      DKW  AM EE++S +KN T  +V  P  +R+IGC+W++K KL +
Sbjct: 834  AVEPENYRKAKLDANFDKWKLAMDEEIDSQEKNNTWTIVTRPENQRIIGCRWIFKYKLGI 893

Query: 849  T*KEREKFKAHLVTKGYSQHKGIDYDEIFSPVVRHTSIRVVLALVASMDMHLEQMDVKTT 908
               E  +FKA LV KGY+Q +GIDY EIF+PVV+H SIRV+L++VA  D+ LEQ+DVKT 
Sbjct: 894  LGVEEPRFKARLVAKGYAQKEGIDYHEIFAPVVKHVSIRVLLSIVAQEDLELEQLDVKTA 953

Query: 909  FLHGNLEEQIYIEQPEGFSETGDGRLVCKLKRSLYGLKQSPRQWYKRFDSYMLRIGYRRC 968
            FLHG L+E+IY+  PEG+        VC L ++LYGLKQ+P+QW ++FD++M  I + + 
Sbjct: 954  FLHGELKEKIYMSPPEGYESMFKANEVCLLNKALYGLKQAPKQWNEKFDNFMKEICFVKS 1013

Query: 969  DYDCCVYVMSLDDGSFIFLLLYVDDMLIAANHLHDVNELKTKLGKEFDMKDLGAAKKILG 1028
             YD C Y   L DGS ++LL+YVDD+L+A+ +   +  LK  LG  F+MKDLGAAKKILG
Sbjct: 1014 AYDSCAYTKVLPDGSVMYLLIYVDDILVASKNKEAITALKANLGMRFEMKDLGAAKKILG 1073

Query: 1029 MEIHKDRGAKKLWLSQKSYVEGVLSRFDMSKANHVSTPLTNHFKLSLEQSPKIDSEIEGM 1088
            MEI +DR    LWLSQ+ Y+  +L  ++M++A    TPL  HFK       K+  + + M
Sbjct: 1074 MEIIRDRTLGVLWLSQEGYLNKILETYNMAEAKPAMTPLGAHFKFQAATEQKLIRDEDFM 1133

Query: 1089 SKIPY--AVQLVV*CMLWFALDQIWHKQLVKCQVHVQAREAALGSSQVDPKILEGYNGSR 1146
              +PY  AV  ++  ML    D  +   ++   +    +E  LG   V    L    G+ 
Sbjct: 1134 KSVPYSSAVGSIMYAMLGTRPDLAYPVGIISRFMSQPIKEHWLGVKWV----LRYIKGT- 1188

Query: 1147 YHV*QGARCC---SISCGICGLCR*SR**KVYNRICL----YSCGG-PICWKSSVQSTVA 1198
                   R C   S S  I G C       +  R  +    ++ GG  I WKS +Q  VA
Sbjct: 1189 ----LKTRLCYKKSSSFSIVGYCDADYAADLDKRRSITGLVFTLGGNTISWKSGLQRVVA 1244

Query: 1199 MSTTEAEYMAVAEAAKEALWLTGLVKELGVEQGGVQLHCDSQSAIYLTNNQVYHARTKHI 1258
             STTE+EYM++ EA KEA+WL GL+K+ G EQ  V++ CDSQSAI L+ N V+H RTKHI
Sbjct: 1245 QSTTESEYMSLTEAVKEAIWLKGLLKDFGYEQKSVEIFCDSQSAIALSKNNVHHERTKHI 1304

Query: 1259 DVRFHKIRELLASRQILLQKIHTSENTTDKLTKPVTSDKF 1298
            DV++H IRE+++   + + KI T +N  D  TK +   KF
Sbjct: 1305 DVKYHFIREIISDGTVEVLKISTEKNPADIFTKVLAVSKF 1344


>gb|AAP54315.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
            gi|37535452|ref|NP_922028.1| putative polyprotein [Oryza
            sativa (japonica cultivar-group)]
            gi|22094359|gb|AAM91886.1| putative polyprotein [Oryza
            sativa (japonica cultivar-group)]
          Length = 1280

 Score =  869 bits (2246), Expect = 0.0
 Identities = 486/1091 (44%), Positives = 676/1091 (61%), Gaps = 53/1091 (4%)

Query: 5    KFEVTRFDGTGNFGLWQRMAKDLLAQKSLQKALRDEKPADIATVDW-NEMKEK---AAGL 60
            K+++   D    F LWQ   + +LAQ+ L  AL      D  T DW N+ K+K   A   
Sbjct: 40   KYDLPLLDRDTRFSLWQVKMRAVLAQQDLDDALSG---FDKRTQDWSNDEKKKDRKAMSY 96

Query: 61   ITLCVSDDVMNHILDLTTLKDVWDKLESLYMSKTPMNKLFAKQRLYSLKMQEGGDLQAHV 120
            I L +S++++  +L   T   +W KLE + M+K   +K+  KQ+L+  K+Q+ G +  H+
Sbjct: 97   IHLHLSNNILQEVLKEETAAGLWLKLEQICMTKDLTSKMHLKQKLFLHKLQDDGSVMDHL 156

Query: 121  YAFNNILADMTRLGVTVDDEDKAIILLCSLPGSYDHLVTTLTYGKDSITLDSISSTLLPH 180
              F  I+AD+  + V  D+ED  +ILLCSLP SY +   T+ Y  D++ L  +   L   
Sbjct: 157  STFKEIVADLESIEVKYDEEDLGLILLCSLPSSYANFRDTILYSHDTLILKEVYDALHAK 216

Query: 181  AQRRQSV-EEGGGSSGEGLFVKGGQDRGRGKGKAVD---SGKKKRSKSKDRKTTECYSCK 236
             + ++ V  EG  S  EGL V+G Q     K ++ D   S  + RSKS+ R  + C  CK
Sbjct: 217  EKMKKMVPSEGSNSQAEGLVVRGRQQEKNTKNQSRDKSSSSYRGRSKSRGRYKS-CKYCK 275

Query: 237  QIGH-----WK-RDCPNRSGK-----SGNNSSSANVVQSDGSCSEEDLLCVSSVKCTDAW 285
            + GH     WK +D   R+GK            A VV  + S +E  +      + +D W
Sbjct: 276  RDGHDISECWKLQDKDKRTGKYIPKGKKEEEGKAAVVTDEKSDTELLVAYAGCAQTSDQW 335

Query: 286  VLDSGCSYHMTQHREWFNSFKSGDLGYVYLGDDKPCIIKGMRQVKIALDDGGVRTLSQVR 345
            +LD+  +YHM  +R+WF ++++   G V +GDD PC + G+  V+I + DG +RTLS VR
Sbjct: 336  ILDTAWTYHMCPNRDWFATYEALQGGTVLMGDDTPCEVAGIGTVQIKMFDGYIRTLSDVR 395

Query: 346  YVLEVMKNLISLGTLHENGYSFKSEENRDILRVSKGAMTVMRAKRTAGNIYKLLGGTIMG 405
            ++  + ++LISL TL   GY +   +   IL+V+KG++ VM+A   + N+Y L G TI+G
Sbjct: 396  HIPNLKRSLISLCTLDRKGYKYSGGDG--ILKVTKGSLVVMKADIKSANLYHLRGTTILG 453

Query: 406  DVASVE---TDDDATKLWHMRLGHLSERGMMELHKRNMLKGVRSCIIGLCKYCVLGKQCR 462
            +VA+V    ++ DAT LWHMRLGH+SE G+ EL KR +L G     +  C++C+ GK  R
Sbjct: 454  NVAAVSDSLSNSDATNLWHMRLGHMSEIGLAELSKRELLDGQSIGKLKFCEHCIFGKHKR 513

Query: 463  VRFKTGHHKTKGILDYVHSDVRGPTKEPSV*GFRYFVTFTDDFSRKVWVYFMKYKSEVFA 522
            V+F T  H T+GILDYVHSD+ GP  + S  G RY +T  DD+SRKVW YF+K+K + F 
Sbjct: 514  VKFNTSTHTTEGILDYVHSDLWGPACKTSFGGARYMMTIVDDYSRKVWPYFLKHKYQAFD 573

Query: 523  KFKLWKAEVENQTGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHFSVRKTPQQNGVAER 582
             FK WK  VE QT +K+K LR+DNG E+  K F  +C+  GI  H++V  TPQQNGVAER
Sbjct: 574  VFKEWKTMVERQTEKKVKILRTDNGMEFCSKIFKSYCKSEGIVHHYTVPHTPQQNGVAER 633

Query: 583  MNRTLTEKARCLRLNACLSKCLWAATINMACYLVNRSPRASLDGKVAEEVWTGNPIDLSN 642
            MN  +  KARC+  NA L K  WA  ++  CYL+NRSP  + D K   EVW+G+P + S+
Sbjct: 634  MNMAIISKARCMLSNADLPKQFWAEAVSTTCYLINRSPSYATDKKTPIEVWSGSPANYSD 693

Query: 643  LRIFWRPAYVHISSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDPVKKKVIVSRDVVFDE* 702
            LR+F   AY H+   D  KL+P++  CI +GY  GVKGYKLW P  KKV++SR+VVF E 
Sbjct: 694  LRVFGCTAYAHV---DNGKLEPRAIKCIFLGYPSGVKGYKLWCPETKKVVISRNVVFHES 750

Query: 703  SMLKQSDVTVVPDTEVENSSQDKIQVDIEETPVS----PRQIVAQQQSEP---GSDSGEV 755
             +L        P T V   SQ+K  V +E    S     ++ VA  Q  P    SDS  V
Sbjct: 751  VILHDK-----PSTNVPVESQEKASVQVEHLISSGHAPEKENVAINQDAPVIEDSDSSIV 805

Query: 756  QD---YTLVRDREPSRITPPVRYGFE-DLAAYAL-----LTSSGDPSTYHEAMAS*EKDK 806
            Q     ++ +D+    I PP RY  E ++ AYAL     +  + +PSTY EA+ S + ++
Sbjct: 806  QQSSKRSIAKDKPKRNIKPPRRYIEEANIVAYALSVAEEIEGNAEPSTYSEAIVSDDCNR 865

Query: 807  WMSAMVEEMESLKKNET*NLVQLPHGKRVIGCKWVYKKKLAVT*KEREKFKAHLVTKGYS 866
            W++AM +EMESLKKN T   V+LP  K+ I CKW++K+K  ++  +  ++KA LV KGYS
Sbjct: 866  WITAMHDEMESLKKNHTWEFVKLPKEKKPIRCKWIFKRKEGMSPSDEARYKARLVAKGYS 925

Query: 867  QHKGIDYDEIFSPVVRHTSIRVVLALVASMDMHLEQMDVKTTFLHGNLEEQIYIEQPEGF 926
            Q  GID++++FSP+++H+SIR +L +VA  D  LEQMDVKT FLHG LEE IY+EQPEGF
Sbjct: 926  QIPGIDFNDVFSPIMKHSSIRTLLGIVAMHDYELEQMDVKTAFLHGELEEDIYMEQPEGF 985

Query: 927  SETGDGRLVCKLKRSLYGLKQSPRQWYKRFDSYMLRIGYRRCDYDCCVYVMSLDDGSFIF 986
               G   LVC+LK+SLYGLKQSPRQWYKRFDS+ML   +RR +YD CVY + + DGS I+
Sbjct: 986  VVLGKENLVCRLKKSLYGLKQSPRQWYKRFDSFMLSQKFRRSNYDSCVY-LKVVDGSAIY 1044

Query: 987  LLLYVDDMLIAANHLHDVNELKTKLGKEFDMKDLGAAKKILGMEIHKDRGAKKLWLSQKS 1046
            LLLYVDDMLIAA    ++ +LK +L  EF MKDLGAAKKILGMEI ++R + KL+LSQK 
Sbjct: 1045 LLLYVDDMLIAAKDKSEIAKLKAQLSSEFGMKDLGAAKKILGMEITRERHSGKLYLSQKG 1104

Query: 1047 YVEGVLSRFDM 1057
            Y++ VL RF+M
Sbjct: 1105 YIKKVLRRFNM 1115



 Score =  127 bits (320), Expect = 2e-27
 Identities = 62/120 (51%), Positives = 87/120 (71%), Gaps = 1/120 (0%)

Query: 1180 LYSCGG-PICWKSSVQSTVAMSTTEAEYMAVAEAAKEALWLTGLVKELGVEQGGVQLHCD 1238
            +++ GG  + WK+S+Q+TVA+STTEAEYMA++EA KEA+WL GL  EL      + + CD
Sbjct: 1150 VFTIGGCDVSWKASLQATVALSTTEAEYMAISEACKEAIWLRGLYTELCGVTSCINIFCD 1209

Query: 1239 SQSAIYLTNNQVYHARTKHIDVRFHKIRELLASRQILLQKIHTSENTTDKLTKPVTSDKF 1298
            SQSAI LT +Q++H RTKHIDVR+H IR ++    + + KI T +N  D +TKPV++ KF
Sbjct: 1210 SQSAICLTKDQMFHERTKHIDVRYHIIRGVIVEGDVKVCKISTHDNPADMMTKPVSATKF 1269


>dbj|BAD34493.1| Gag-Pol [Ipomoea batatas]
          Length = 1298

 Score =  865 bits (2236), Expect = 0.0
 Identities = 469/1100 (42%), Positives = 686/1100 (61%), Gaps = 29/1100 (2%)

Query: 4    AKFEVTRFDGTGNFGLWQRMAKDLLAQKSLQKALRDEKPADIAT-VDWNEMKEKAAGLIT 62
            AKFE+ +F+G  NF LW+   K +L + +   A+  E+P D      W+EM E A   + 
Sbjct: 3    AKFEIEKFNGK-NFSLWKLKVKAILRKDNCLAAI-SERPVDFTDDKKWSEMNEDAMADLY 60

Query: 63   LCVSDDVMNHILDLTTLKDVWDKLESLYMSKTPMNKLFAKQRLYSLKMQEGGDLQAHVYA 122
            L ++D V++ I +  T  ++WD L  LY +K+  NK+F K++LY+L+M E   +  H+  
Sbjct: 61   LSIADGVLSSIEEKKTANEIWDHLNRLYEAKSLHNKIFLKRKLYTLRMSESTSVTEHLNT 120

Query: 123  FNNILADMTRLGVTVDDEDKAIILLCSLPGSYDHLVTTLTYG--KDSITLDSISSTLLPH 180
             N + + +T L   ++ +++A +LL SLP SYD L+  LT     D +  D +++ +L  
Sbjct: 121  LNTLFSQLTSLSCKIEPQERAELLLQSLPDSYDQLIINLTNNILTDYLVFDDVAAAVLEE 180

Query: 181  AQRRQSVEEGGGS--SGEGLFVKGGQDRGRGKGKAVDSGKKKRSKSKDRKTTECYSCKQI 238
              RR++ E+   +    E L V  G+   RG+     S  + RSKS  +  T CY+C + 
Sbjct: 181  ESRRKNKEDRQVNLQQAEALTVMRGRSTERGQ-----SSGRGRSKSSKKNLT-CYNCGKK 234

Query: 239  GHWKRDCPNRSGKSGNNSSSANVVQSDGS--CSEEDLLCVSSVKCTDAWVLDSGCSYHMT 296
            GH K+DC N +  S    + A+    DGS  C E  +      +  D W++DSG +YHMT
Sbjct: 235  GHLKKDCWNLAQNSNPQGNVAST-SDDGSALCCEASIAREGRKRFADIWLIDSGATYHMT 293

Query: 297  QHREWFNSFKSGDLGYVYLGDDKPCIIKGMRQVKIALDDGGVRTLSQVRYVLEVMKNLIS 356
              +EWF+ ++    G VY  DD    I G+  +K+ + DG V+T+  VR+V  + KNL+S
Sbjct: 294  SRKEWFHHYEPISGGSVYSCDDHALEIIGIGTIKLKMYDGTVQTVQDVRHVKGLKKNLLS 353

Query: 357  LGTLHENGYSFKSEENRDILRVSKGAMTVMRAKRTAGNIYKLLGGTIMGDVASVET-DDD 415
             G L  +    ++++   ++++ +GA+ VM+ ++ A N+Y L G T+    ASV     D
Sbjct: 354  YGILDNSATQIETQKG--VMKIFQGALVVMKGEKIAANLYMLKGETLQEAEASVAACSPD 411

Query: 416  ATKLWHMRLGHLSERGMMELHKRNMLKGVRSCIIGLCKYCVLGKQCRVRFKTGHHKTKGI 475
            +T LWH +LGH+S++GM  L ++ ++ G+    + LC++C+  KQ R++F T + + K +
Sbjct: 412  STLLWHQKLGHMSDQGMKILVEQKLIPGLTKVSLPLCEHCITSKQHRLKFSTSNSRGKVV 471

Query: 476  LDYVHSDV-RGPTKEPSV*GFRYFVTFTDDFSRKVWVYFMKYKSEVFAKFKLWKAEVENQ 534
            L+ VHSDV + P   PS+ G +YFV+F DD+SR+ WVY +K KS+VFA FK +KA VE  
Sbjct: 472  LELVHSDVWQAPV--PSLGGAKYFVSFIDDYSRRCWVYPIKKKSDVFATFKAFKARVELD 529

Query: 535  TGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHFSVRKTPQQNGVAERMNRTLTEKARCL 594
            +G+KIK  R+DNG EYT + F  FC++ GI+R F+V  TPQQNGVAERMNRTL E+ R +
Sbjct: 530  SGKKIKCFRTDNGGEYTSEEFDDFCKKEGIKRQFTVAYTPQQNGVAERMNRTLLERTRAM 589

Query: 595  RLNACLSKCLWAATINMACYLVNRSPRASLDGKVAEEVWTGNPIDLSNLRIFWRPAYVHI 654
               A L K  WA  +N ACYLVNR+P  +++ K   E+WTG P+D SNL IF    Y   
Sbjct: 590  LRAAGLEKSFWAEAVNTACYLVNRAPSTAIELKTPMEMWTGKPVDYSNLHIFGSIVYAMY 649

Query: 655  SSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDPVKKKVIVSRDVVFDE*SMLKQSDVTVVP 714
            ++++ +KLDPKS+ C  +GY  GVKGY+LWDP   KV++SRDV+F E   L++ +V    
Sbjct: 650  NAQEITKLDPKSRKCRFLGYADGVKGYRLWDPTAHKVVISRDVIFVE-DRLQRGEVD--D 706

Query: 715  DTEVENSSQDKIQVDIEETPVSPRQIVAQQQSEPGSDSGEVQDYTLVRDREPSRITPPVR 774
             TE E     +IQV+ E    S     A ++ EP S        T   DRE  R T    
Sbjct: 707  STEKEKPETTQIQVEEEFEQDSSEAEPAHEEPEPESSGAPT---TRQSDREKRRPTWHSD 763

Query: 775  YGFEDLAAYALLTSSGDPSTYHEAMAS*EKDKWMSAMVEEMESLKKNET*NLVQLPHGKR 834
            Y  E   AY LLT  G+PST+ EA+ S +  +W +AM EE+E+L KN T +LV LP G++
Sbjct: 764  YVMEGNVAYCLLTEDGEPSTFQEAINSSDVSQWTAAMQEEIEALHKNNTWDLVPLPQGRK 823

Query: 835  VIGCKWVYKKKLAVT*KEREKFKAHLVTKGYSQHKGIDYDEIFSPVVRHTSIRVVLALVA 894
             IG KWV+K K      + E+++A LV KGY+Q +GID++EIFSPVVR T++RVVLA+ A
Sbjct: 824  PIGNKWVFKIKRNGD-DQVERYRARLVVKGYAQKEGIDFNEIFSPVVRLTTVRVVLAMCA 882

Query: 895  SMDMHLEQMDVKTTFLHGNLEEQIYIEQPEGFSETGDGRLVCKLKRSLYGLKQSPRQWYK 954
            + ++HLEQ+DVKT FLHG+LEE+IY+ QPEGF +  +  LVC+L +SLYGLKQ+PR WYK
Sbjct: 883  TFNLHLEQLDVKTAFLHGDLEEEIYMLQPEGFEDKENQNLVCRLNKSLYGLKQAPRCWYK 942

Query: 955  RFDSYMLRIGYRRCDYDCCVYVMSLDDGSFIFLLLYVDDMLIAANHLHDVNELKTKLGKE 1014
            RFDS+++ +GY R + D C Y       +F+ LLLYVDDML+A  +   ++ELK +L +E
Sbjct: 943  RFDSFIMCLGYNRLNADPCAYFKRFGKDNFVILLLYVDDMLVAGPNKDHIDELKAQLARE 1002

Query: 1015 FDMKDLGAAKKILGMEIHKDRGAKKLWLSQKSYVEGVLSRFDMSKANHVSTPLTNHFKLS 1074
            F+MKDLG A KILGM+IH+DRG +K+WLSQK+Y++ +LSRF M     +STPL  + K+S
Sbjct: 1003 FEMKDLGPANKILGMQIHRDRGNRKIWLSQKNYLKKILSRFSMQDCKSISTPLPINLKVS 1062

Query: 1075 LEQSPKIDSEIEGMSKIPYA 1094
               SP  +     MS++PYA
Sbjct: 1063 SSMSPSNEEGRMEMSRVPYA 1082



 Score =  125 bits (315), Expect = 7e-27
 Identities = 61/115 (53%), Positives = 82/115 (71%)

Query: 1184 GGPICWKSSVQSTVAMSTTEAEYMAVAEAAKEALWLTGLVKELGVEQGGVQLHCDSQSAI 1243
            GG + W S +Q+ VA STTEAEY+A  +A+KEA+WL  L++ELG +Q  V L CDSQSA+
Sbjct: 1172 GGAVSWVSKLQAVVATSTTEAEYVAATQASKEAIWLKMLLEELGHKQEFVSLFCDSQSAL 1231

Query: 1244 YLTNNQVYHARTKHIDVRFHKIRELLASRQILLQKIHTSENTTDKLTKPVTSDKF 1298
            +L  N  +H+RTKHI V++H IRE +    + LQKIHT++N  D LTK +  DKF
Sbjct: 1232 HLARNPAFHSRTKHIRVQYHFIREKVKEGTVDLQKIHTADNVADFLTKIINVDKF 1286


>ref|XP_475489.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
            gi|48475213|gb|AAT44282.1| putative polyprotein [Oryza
            sativa (japonica cultivar-group)]
          Length = 1243

 Score =  863 bits (2230), Expect = 0.0
 Identities = 493/1123 (43%), Positives = 688/1123 (60%), Gaps = 71/1123 (6%)

Query: 5    KFEVTRFDGTGNFGLWQRMAKDLLAQKSLQKALRDEKPADIATVDW-NEMKE---KAAGL 60
            K+++   D    F LWQ   + +LAQ+ L  AL      D  T DW N+ K+   KA   
Sbjct: 5    KYDLPLLDRDTRFSLWQVKMRAVLAQQDLDDALSG---FDKRTQDWSNDEKKRDRKAISY 61

Query: 61   ITLCVSDDVMNHILDLTTLKDVWDKLESLYMSKTPMNKLFAKQRLYSLKMQEGGDLQAHV 120
            I L +S++++  +L   T   +W KLE + M+K   +K+  KQ+L+  K+Q+   +  H+
Sbjct: 62   IHLHLSNNILQEVLKEETAAGLWLKLEQICMTKDLTSKMHLKQKLFLHKLQDDESVMDHL 121

Query: 121  YAFNNILADMTRLGVTVDDEDKAIILLCSLPGSYDHLVTTLTYGKDSITLDSISSTLLPH 180
             AF  I+AD+  + V  D++D  +ILLCSLP SY +   T+ Y +D++TL  +       
Sbjct: 122  SAFKEIVADLESMEVKYDEDDLGLILLCSLPSSYANFRGTILYSRDTLTLKEVYDAFHAK 181

Query: 181  AQRRQSV-EEGGGSSGEGLFVKGGQDRGRGKGKAVD---SGKKKRSKSKDRKTTECYSCK 236
             + ++ V  EG  S  EGL V+G Q +   K ++ D   S  + R+KS+ R  + C  CK
Sbjct: 182  EKMKKMVTSEGSNSQAEGLVVRGRQQKKNTKNQSRDKSSSSYRGRTKSRGRYKS-CKYCK 240

Query: 237  QIGH-----WK-RDCPNRSGKSGNNSSSANVVQSDGSCSEEDLLCVSSVKCTDAWVLDSG 290
            + GH     WK +D   R+GK              G   EE    V + + +DA +L + 
Sbjct: 241  RDGHDISECWKLQDKDKRTGK----------YIPKGKKEEEGKAAVVTDEKSDAELLVAY 290

Query: 291  CSYHMTQHREWFNSFKSGDLGYVYLGDDKPCIIKGMRQVKIALDDGGVRTLSQVRYVLEV 350
                 T  ++WF ++++   G V +GDD PC + G+  V+I + DG +RTLS V+++  +
Sbjct: 291  AGCAQTSDQDWFATYEALQGGTVLMGDDTPCEVAGIGTVQIKMFDGCIRTLSDVQHIPNL 350

Query: 351  MKNLISLGTLHENGYSFKSEENRDILRVSKGAMTVMRAKRTAGNIYKLLGGTIMGDVASV 410
             ++LISL                 IL+V+KG++ VM+    + N+Y L G TI+G+VA+V
Sbjct: 351  KRSLISL---------------YGILKVTKGSLVVMKVDIKSANLYHLRGTTILGNVAAV 395

Query: 411  E---TDDDATKLWHMRLGHLSERGMMELHKRNMLKGVRSCIIGLCKYCVLGKQCRVRFKT 467
                ++ DAT LWHMRLGH+SE G+ EL KR +L G     +  C++C+ GK  RV+F T
Sbjct: 396  FDSLSNSDATNLWHMRLGHMSEIGLAELSKRGLLDGQSIRKLKFCEHCIFGKHKRVKFNT 455

Query: 468  GHHKTKGILDYVHSDVRGPTKEPSV*GFRYFVTFTDDFSRKVWVYFMKYKSEVFAKFKLW 527
              H T+GILDYVHSD+ GP  + S  G RY +T  DD+SRKVW YF+K+K + F  FK W
Sbjct: 456  STHTTEGILDYVHSDLWGPAHKTSFGGARYMMTIVDDYSRKVWPYFLKHKYQAFDGFKEW 515

Query: 528  KAEVENQTGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHFSVRKTPQQNGVAERMNRTL 587
            K  VE QT RK+K LR+DNG E+  K F  +C+  GI  H++   TPQQN VAERMNRT+
Sbjct: 516  KTMVERQTERKVKILRTDNGMEFCSKIFKSYCKSEGIVCHYTAPHTPQQNDVAERMNRTI 575

Query: 588  TEKARCLRLNACLSKCLWAATINMACYLVNRSPRASLDGKVAEEVWTGNPIDLSNLRIFW 647
              KARC+  NA L K  WA  ++ ACYL+NRSP  ++D K   EVW+G+P + S+LR+F 
Sbjct: 576  ISKARCMLSNAGLPKQFWAEAVSTACYLINRSPGYAIDKKTPIEVWSGSPTNYSDLRVFG 635

Query: 648  RPAYVHISSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDPVKKKVIVSRDVVFDE*SMLKQ 707
              AY H+   D  KL+P++  CI +GY  GVKGYKLW P  KKV++SR+VVF E  +L  
Sbjct: 636  CTAYAHV---DNGKLEPRAIKCIFLGYASGVKGYKLWCPETKKVVISRNVVFHESVILHD 692

Query: 708  SDVTVVPDTEVENSSQDKIQVDIEETPVS----PRQIVAQQQSEP---GSDSGEVQD--- 757
                  P T V   SQ+K  V +E    S     ++ VA  Q  P    SDS  V     
Sbjct: 693  K-----PSTNVPVESQEKASVQVEHLISSGHAPEKEDVAINQDAPVIEDSDSSIVHQSPK 747

Query: 758  YTLVRDREPSRITPPVRYGFE-DLAAYAL-----LTSSGDPSTYHEAMAS*EKDKWMSAM 811
             ++ +D+    I PP RY  E  + AYAL     +  + +PSTY EA+ S + ++W++AM
Sbjct: 748  RSIAKDKPKRNIKPPRRYIEEAKIVAYALSVAEKIEGNAEPSTYSEAIVSDDCNRWITAM 807

Query: 812  VEEMESLKKNET*NLVQLPHGKRVIGCKWVYKKKLAVT*KEREKFKAHLVTKGYSQHKGI 871
             +EMESL+KN T  LV+LP  K+ I CKW++K+K  ++  +  ++KA LV KGYSQ  GI
Sbjct: 808  HDEMESLEKNHTWELVKLPKEKKPIRCKWIFKRKEGMSPSDEARYKARLVAKGYSQIPGI 867

Query: 872  DYDEIFSPVVRHTSIRVVLALVASMDMHLEQMDVKTTFLHGNLEEQIYIEQPEGFSETGD 931
            D++++FSPVV+H+SIR +L +VA  D  LEQM+VKT FLHG LEE IY+EQPEGF   G 
Sbjct: 868  DFNDVFSPVVKHSSIRTLLGIVAMHDYELEQMNVKTAFLHGELEEDIYMEQPEGFVVPGK 927

Query: 932  GRLVCKLKRSLYGLKQSPRQWYKRFDSYMLRIGYRRCDYDCCVYVMSLDDGSFIFLLLYV 991
              LVC+LK+SLYGLKQSPRQWYKRFDS+ML   +R  +YD CVY + + DGS I+LLLYV
Sbjct: 928  ENLVCRLKKSLYGLKQSPRQWYKRFDSFMLSQKFRISNYDSCVY-LKVVDGSVIYLLLYV 986

Query: 992  DDMLIAANHLHDVNELKTKLGKEFDMKDLGAAKKILGMEIHKDRGAKKLWLSQKSYVEGV 1051
            DDMLIAA    ++ +LK +L  EF+MKDLGAAKKILGMEI ++R + KL+LSQK Y+E V
Sbjct: 987  DDMLIAAKDKSEIEKLKAQLSSEFEMKDLGAAKKILGMEITRERHSGKLYLSQKGYIEKV 1046

Query: 1052 LSRFDMSKANHVSTPLTNHFKLSLEQSPKIDSEIEGMSKIPYA 1094
            L RF+M  A  VSTPL  HF+LS +  P  D +IE MS++PY+
Sbjct: 1047 LRRFNMHDAKPVSTPLAAHFRLSSDLCPLSDYDIEYMSRVPYS 1089



 Score = 79.3 bits (194), Expect = 8e-13
 Identities = 39/70 (55%), Positives = 52/70 (73%), Gaps = 1/70 (1%)

Query: 1180 LYSCGG-PICWKSSVQSTVAMSTTEAEYMAVAEAAKEALWLTGLVKELGVEQGGVQLHCD 1238
            +++ GG  + WK+S+Q+TVA+STTEAEYMA+ EA KEA+WL GL  EL      + + CD
Sbjct: 1174 VFTIGGCAVSWKASLQATVALSTTEAEYMAIFEACKEAIWLRGLYTELCGVTSCINIFCD 1233

Query: 1239 SQSAIYLTNN 1248
            SQSAIYLT +
Sbjct: 1234 SQSAIYLTKD 1243


>gb|AAD23690.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
            gi|25301702|pir||E84601 probable retroelement pol
            polyprotein [imported] - Arabidopsis thaliana
          Length = 1333

 Score =  857 bits (2213), Expect = 0.0
 Identities = 526/1347 (39%), Positives = 760/1347 (56%), Gaps = 81/1347 (6%)

Query: 4    AKFEVTRFDGTGNFGLWQRMAKDLLAQKSLQKALRDE-----KPADIATVDWNEMKE--- 55
            A+ EV +FDG G++ +W+      L    L  AL++E     K A++   +  E +E   
Sbjct: 4    ARIEVEKFDGRGDYTMWKEKLMAHLDILGLSVALKEEDDLVEKVAEMQLTEEEEKEEVLR 63

Query: 56   ---------KAAGLITLCVSDDVMNHILDLTTLKDVWDKLESLYMSKTPMNKLFAKQRLY 106
                     KA   I L V+D V+  I    +   +   L+ LYMSK   N+++ KQ+LY
Sbjct: 64   RELLEEKRRKARSAIVLSVTDRVLRKIKKEQSAAAMLGVLDKLYMSKALPNRIYQKQKLY 123

Query: 107  SLKMQEGGDLQAHVYAFNNILADMTRLGVTVDDEDKAIILLCSLPGSYDHLVTTLTYGKD 166
            S KM E   ++ ++  F  I+AD+    V V DED+AI+LL SLP  +D L  TL YG  
Sbjct: 124  SFKMSENLSIEGNIDEFLRIIADLENTNVLVSDEDQAILLLMSLPKPFDQLRDTLKYGLG 183

Query: 167  SITL--DSISSTLLPHAQRRQSVEEGGGSSGEGLFVKGGQD-RGRGKGKAVDSGKKKRSK 223
             +TL  D + + +        S ++      EGLFVK   + RGR + +  ++  KK S+
Sbjct: 184  RVTLSLDEVVAAIYSKELELGSNKKSIKGQAEGLFVKEKTETRGRTEQRGNNNNNKK-SR 242

Query: 224  SKDRKTTECYSCKQIGHWKRDCPNRSGKSGNNSSSANVVQSDGSCSEEDLLCVSSVKCTD 283
            SK R    C+ C              G+S N SS  N  +++G    E  L  + +   D
Sbjct: 243  SKSRSKKGCWIC--------------GESSNGSS--NYSEANGLYVSE-ALSSTDIHLED 285

Query: 284  AWVLDSGCSYHMTQHREWFNSFKSGDLGYVYLGDDKPCIIKGMRQVKIALDDGGVRTLSQ 343
             WV+D+GCSYHMT  REWF        G V +G+     ++G+  +++  + G V  L+ 
Sbjct: 286  EWVMDTGCSYHMTYKREWFEDLNEDAGGSVRMGNKTVSKVRGIGTIRVKNEAGMVVRLTN 345

Query: 344  VRYVLEVMKNLISLGTLHENGYSFKSEENRDILRVSKGAMTVMRAKRTAGNIYKLLGGTI 403
            VRY+ E+ +NL+SLGT  ++GYSFK E     L +  G   ++  +R    +Y L    +
Sbjct: 346  VRYIPEMDRNLLSLGTFEKSGYSFKLENGT--LSIIAGDSVLLTVRRCY-TLYLLQWRPV 402

Query: 404  MGDVASVETDDDATKLWHMRLGHLSERGMMELHKRNMLKGVRSCIIGLCKYCVLGKQCRV 463
              +  SV    D T LWH RLGH+S++ M  L K+ +L   +   +  C+ C+ GK  R+
Sbjct: 403  TEESLSVVKRQDDTILWHRRLGHMSQKNMDLLLKKGLLDKKKVSKLETCEDCIYGKAKRI 462

Query: 464  RFKTGHHKTKGILDYVHSDVRGPTKEP-SV*GFRYFVTFTDDFSRKVWVYFMKYKSEVFA 522
             F    H T+  L+YVHSD+ G    P S+   +YF++F DD++RKV +YF+K K E F 
Sbjct: 463  GFNLAQHDTREKLEYVHSDLWGAPSVPFSLGKCQYFISFIDDYTRKVRIYFLKTKDEAFD 522

Query: 523  KFKLWKAEVENQTGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHFSVRKTPQQNGVAER 582
            KF  W   VENQT ++IK LR+DNG E+ +++F  FC + GI  H +   TPQQNGVAER
Sbjct: 523  KFVEWANLVENQTDKRIKTLRTDNGLEFCNRSFDEFCSQKGILWHRTCAYTPQQNGVAER 582

Query: 583  MNRTLTEKARCLRLNACLSKCLWAATINMACYLVNRSPRASLDGKVAEEVWTGNPIDLSN 642
            MNRTL EK R +  ++ L K  WA   +    L+N++P ++L+ +V ++ W+G     S 
Sbjct: 583  MNRTLMEKVRSMLSDSGLPKKFWAEATHTTAILINKTPSSALNYEVPDKRWSGKSPIYSY 642

Query: 643  LRIFWRPAYVHISSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDPVKKKVIVSRDVVFDE* 702
            LR F   A+VH    D  KL+P++K  I++GY  GVKGYK+W   +KK +VSR+V+F E 
Sbjct: 643  LRRFGCIAFVHT---DDGKLNPRAKKGILVGYPIGVKGYKIWLLEEKKCVVSRNVIFQEN 699

Query: 703  SMLK---QSDVTVVPDTEVENSSQDKIQVDIEE--------------TPVSPRQIVAQQQ 745
            +  K   QS      + E   SS   + +D EE              +P +P     Q  
Sbjct: 700  ASYKDMMQSKDAEKDENEAPPSSYLDLDLDHEEVITSGGDDPIVEAQSPFNPSPATTQTY 759

Query: 746  SEP-GSDSGEVQD---YTLVRDREPSRITPPVRYGFEDLAAYALLTSSG----DPSTYHE 797
            SE   S++  +Q    Y LVRDR+   I  PVR+  ED  A AL T+      +P+ Y E
Sbjct: 760  SEGVNSETDIIQSPLSYQLVRDRDRRTIRAPVRFDDEDYLAEALYTTEDSGEIEPADYSE 819

Query: 798  AMAS*EKDKWMSAMVEEMESLKKNET*NLVQLPHGKRVIGCKWVYKKKLAVT*KEREKFK 857
            A  S   +KW  AM EEMES  KN T  +V+ P  ++VIG +W+YK KL +   E  +FK
Sbjct: 820  AKRSMNWNKWKLAMNEEMESQIKNHTWTVVKRPQHQKVIGSRWIYKFKLGIPGVEEGRFK 879

Query: 858  AHLVTKGYSQHKGIDYDEIFSPVVRHTSIRVVLALVASMDMHLEQMDVKTTFLHGNLEEQ 917
            A LV KGY+Q KGIDY EIF+PVV+H SIR+++++VA  D+ LEQ+DVKT FLHG L+E+
Sbjct: 880  ARLVAKGYAQRKGIDYHEIFAPVVKHVSIRILMSIVAQEDLELEQLDVKTAFLHGELKEK 939

Query: 918  IYIEQPEGFSETGDGRLVCKLKRSLYGLKQSPRQWYKRFDSYMLRIGYRRCDYDCCVYVM 977
            IY+  PEG+ E      VC L +SLYGLKQ+P+QW ++F++YM  IG+ R  YD C Y+ 
Sbjct: 940  IYMVPPEGYEEMFKEDEVCLLNKSLYGLKQAPKQWNEKFNAYMSEIGFIRSLYDSCAYIK 999

Query: 978  SLDDGSFIFLLLYVDDMLIAANHLHDVNELKTKLGKEFDMKDLGAAKKILGMEIHKDRGA 1037
             L DGS ++LLLYVDDML+AA +  D+++LK +L + FDMKDLGAAK+ILGMEI ++R  
Sbjct: 1000 ELSDGSRVYLLLYVDDMLVAAKNKEDISQLKEELSQRFDMKDLGAAKRILGMEIIRNREE 1059

Query: 1038 KKLWLSQKSYVEGVLSRFDMSKANHVSTPLTNHFKLSLEQSPKIDSEIEGMSKIPY--AV 1095
              LWLSQ  Y+  +L  ++M+++ HV TPL  H K+      K + + + M  IPY  AV
Sbjct: 1060 NTLWLSQNGYLNKILETYNMAESKHVVTPLGAHLKMRAATVEKQEQDEDYMKSIPYSSAV 1119

Query: 1096 QLVV*CMLWFALDQIWHKQLVKCQVHVQAREAALGSSQVDPKILEGYNGSRYHV*QGARC 1155
              ++  M+    D  +   ++   +   ARE  LG   V  + ++G  G++    + +  
Sbjct: 1120 GSIMYAMIGTRPDLAYPVGIISRYMSQPAREHWLGVKWV-LRYIKGSLGTKLQYKRSSDF 1178

Query: 1156 CSISCGIC----GLCR*SR**KVYNRICLYSCGGPICWKSSVQSTVAMSTTEAEYMAVAE 1211
              +  G C      C+  R  +    +     G  I WKS  Q  VA+STTEAEYM++ E
Sbjct: 1179 KVV--GYCDADHAACKDRR--RSITGLVFTLGGSTISWKSGQQRVVALSTTEAEYMSLTE 1234

Query: 1212 AAKEALWLTGLVKELGVEQGGVQLHCDSQSAIYLTNNQVYHARTKHIDVRFHKIRELLAS 1271
            A KEA+W+ GL+KE G EQ  V++ CDSQSAI L+ N V+H RTKHIDVR+  IR+++A+
Sbjct: 1235 AVKEAVWMKGLLKEFGYEQKSVEIFCDSQSAIALSKNNVHHERTKHIDVRYQYIRDIIAN 1294

Query: 1272 RQILLQKIHTSENTTDKLTKPVTSDKF 1298
                + KI T +N  D  TK V  +KF
Sbjct: 1295 GDGDVVKIDTEKNPADIFTKIVPVNKF 1321


>gb|AAD19773.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
            gi|25301697|pir||B84512 probable retroelement pol
            polyprotein [imported] - Arabidopsis thaliana
          Length = 1335

 Score =  855 bits (2208), Expect = 0.0
 Identities = 522/1323 (39%), Positives = 736/1323 (55%), Gaps = 97/1323 (7%)

Query: 41   KPADIATVDWNEMKEKAAGLITLCVSDDVMNHILDLTTLKDVWDKLESLYMSKTPMNKLF 100
            K  D   V   E  +KA  +I L V+D V+  I    T  + W+ L+ L+M ++  ++++
Sbjct: 33   KKRDADEVARLERCDKAKNVIFLNVADKVLRKIELCKTAAEAWETLDRLFMIRSLPHRVY 92

Query: 101  AKQRLYSLKMQEGGDLQAHVYAFNNILADMTRLGVTVDDEDKAIILLCSLPGSYDHLVTT 160
             +   Y+ KMQE   +  ++  F  I+AD+  L + V DE +AI+LL SLP  YD LV T
Sbjct: 93   TQLSFYTFKMQENKKIDENIDDFLKIVADLNHLQIDVTDEVQAILLLSSLPARYDGLVET 152

Query: 161  LTYG--KDSITLDSISSTLLPHAQRRQSVEEGGGSSGEGLFVKGGQDRGRGKGKAVDSGK 218
            + Y   ++ + LD +   ++    + + + +      EG F +G  D G+   +      
Sbjct: 153  MKYSNSREKLRLDDV---MVAARDKERELSQNNRPVVEGHFARGRPD-GKNNNQGNKGKN 208

Query: 219  KKRSKSKDRKTTECYSCKQIGHWKRDC-----PNRSGKSGNNSSSANVVQSDGS------ 267
            + RSKS D K   C+ C + GH+K+ C      N+S + G+++  +++ +S  +      
Sbjct: 209  RSRSKSADGKRV-CWICGKEGHFKKQCYKWIERNKSKQQGSDNGESSLAKSTEAFNPAMV 267

Query: 268  --CSEEDLLCVSSVKCTDAWVLDSGCSYHMTQHREWFNSFKSGDLGYVYLGDDKPCIIKG 325
               ++E L+   S+   + WVLD+GCS+HMT  ++WF  FK    GYV +G+D    +KG
Sbjct: 268  LLATDETLVVTDSI--ANEWVLDTGCSFHMTPRKDWFKDFKELSSGYVKMGNDTYSPVKG 325

Query: 326  MRQVKIALDDGGVRTLSQVRYVLEVMKNLISLGTLHENGYSFKSEENRDILRVSKGAMTV 385
            +  +KI   DG    L+ VRY+  + +NLISLGTL + G  FKS++   IL++ KG  T+
Sbjct: 326  IGSIKIRNSDGSQVILTDVRYMPNMTRNLISLGTLEDRGCWFKSQDG--ILKIVKGCSTI 383

Query: 386  MRAKRTAGNIYKLLGGTIMGDVASVETDDDATKLWHMRLGHLSERGMMELHKRNMLKGVR 445
            ++ ++    +Y L G T  G+  S     D T LWH RLGH+S++GM  L K+  L+   
Sbjct: 384  LKGQKR-DTLYILDGVTEEGESHSSAEVKDETALWHSRLGHMSQKGMEILVKKGCLRREV 442

Query: 446  SCIIGLCKYCVLGKQCRVRFKTGHHKTKGILDYVHSDVRGPTKEPSV*G-FRYFVTFTDD 504
               +  C+ CV GKQ RV F    H TK  L YVHSD+ G    P+  G  +YF++F DD
Sbjct: 443  IKELEFCEDCVYGKQHRVSFAPAQHVTKEKLAYVHSDLWGSPHNPASLGNSQYFISFVDD 502

Query: 505  FSRKVWVYFMKYKSEVFAKFKLWKAEVENQTGRKIKYLRSDNGTEYTDKNFMHFCEENGI 564
            +SRKVW+YF++ K E F KF  WK  VENQ+ RK+K LR+DNG EY +  F  FC+E GI
Sbjct: 503  YSRKVWIYFLRKKDEAFEKFVEWKKMVENQSDRKVKKLRTDNGLEYCNHYFEKFCKEEGI 562

Query: 565  QRHFSVRKTPQQNGVAERMNRTLTEKARCLRLNACLSKCLWAATINMACYLVNRSPRASL 624
             RH +   TPQQNG+AER+NRT+ +K R +   + + K  WA   + A YL+NRSP  ++
Sbjct: 563  VRHKTCAYTPQQNGIAERLNRTIMDKVRSMLSRSGMEKKFWAEAASTAVYLINRSPSTAI 622

Query: 625  DGKVAEEVWTGNPIDLSNLRIFWRPAYVHISSEDRSKLDPKSK*CIIIGYNKGVKGYKLW 684
            +  + EE WTG   DLS+LR F   AY+H    D+ KL+P+SK  I   Y +GVKGYK+W
Sbjct: 623  NFDLPEEKWTGALPDLSSLRKFGCLAYIHA---DQGKLNPRSKKGIFTSYPEGVKGYKVW 679

Query: 685  DPVKKKVIVSRDVVFDE*SMLKQ--------------SDVTVVPD------TEVENSSQD 724
                KK ++SR+V+F E  M K                D+ V PD      T+   ++QD
Sbjct: 680  VLEDKKCVISRNVIFREQVMFKDLKGDSQNTISESDLEDLRVNPDMNDQEFTDQGGATQD 739

Query: 725  KI---QVDIEETPVSPRQIVAQQQSEPGSDSGEVQD---YTLVRDREPSRITPPVRYGFE 778
                 +      PV      +Q +     DS  V+D   Y LVRDR    I    +Y   
Sbjct: 740  NSNPSEATTSHNPVLNSPTHSQDEESEEEDSDAVEDLSTYQLVRDRVRRTIKANPKYNES 799

Query: 779  DLAAYALLTSSG---DPSTYHEAMAS*EKDKWMSAMVEEMESLKKNET*NLVQLPHGKRV 835
            ++  +A  +      +P +Y EA+   + +KW +AM EEM S+ KN T +LV  P   ++
Sbjct: 800  NMVGFAYYSEDDGKPEPKSYQEALLDPDWEKWNAAMKEEMVSMSKNHTWDLVTKPEKVKL 859

Query: 836  IGCKWVYKKKLAVT*KEREKFKAHLVTKGYSQHKGIDYDEIFSPVVRHTSIRVVLALVAS 895
            IGC+WV+ +K  +   E  +F A LV KG++Q +G+DY+EIFSPVV+H SIR +L++V  
Sbjct: 860  IGCRWVFTRKAGIPGVEAPRFIARLVAKGFTQKEGVDYNEIFSPVVKHVSIRYLLSMVVH 919

Query: 896  MDMHLEQMDVKTTFLHGNLEEQIYIEQPEGFSETGDGRLVCKLKRSLYGLKQSPRQWYKR 955
             +M L+QMDVKT FLHG LEE+IY+ QPEGF        VC LKRSLYGLKQSPRQW  R
Sbjct: 920  YNMELQQMDVKTAFLHGFLEEEIYMAQPEGFEIKRGSNKVCLLKRSLYGLKQSPRQWNLR 979

Query: 956  FDSYMLRIGYRRCDYDCCVYVMSLDDGSFIFLLLYVDDMLIAANHLHDVNELKTKLGKEF 1015
            FD +M  I Y R  YD CVY    +  ++I+LLLYVDDMLIA+ +  +VNELK  L +EF
Sbjct: 980  FDEFMRGIKYTRSAYDSCVYFKKCNGDTYIYLLLYVDDMLIASANKSEVNELKQLLSREF 1039

Query: 1016 DMKDLGAAKKILGMEIHKDRGAKKLWLSQKSYVEGVLSRFDMSKANHVSTPLTNHFKLSL 1075
            +MKDLG AKKILGMEI +DR A  L LSQ+ YV+ VL  F M  A  VSTPL  HFKL  
Sbjct: 1040 EMKDLGDAKKILGMEISRDRDAGLLTLSQEGYVKKVLRSFQMDNAKPVSTPLGIHFKLKA 1099

Query: 1076 EQSPKIDSEIEGMSKIPYA--VQLVV*CMLWFALDQIW---------------HKQLVKC 1118
                + + + E M  +PYA  +  ++  M+    D  +               H Q VK 
Sbjct: 1100 ATDKEYEEQFERMKIVPYANTIGSIMYSMIGTRPDLAYSLGVISRFMSKPLKDHWQAVKW 1159

Query: 1119 QVHVQ--AREAALGSSQVDPKILEGYNGSRYHV*QGARCCSISCGICGLCR*SR**KVYN 1176
             +       +  L   + +  +L GY  S Y      R       I G            
Sbjct: 1160 VLRYMRGTEKKKLCFRKQEDFLLRGYCDSDYGSNFDTR-----RSITGY----------- 1203

Query: 1177 RICLYSCGG-PICWKSSVQSTVAMSTTEAEYMAVAEAAKEALWLTGLVKELGVEQGGVQL 1235
               +++ GG  I WKS +Q  VA+S+TEAEYMA+ EA KEALWL G   ELG  Q  V++
Sbjct: 1204 ---VFTVGGNTISWKSKLQKVVAISSTEAEYMALTEAVKEALWLKGFAAELGHSQDYVEV 1260

Query: 1236 HCDSQSAIYLTNNQVYHARTKHIDVRFHKIRELLASRQILLQKIHTSENTTDKLTKPVTS 1295
            H DSQSAI L  N V+H RTKHID+R H IR+++ +  I + KI T  N  +  TK V  
Sbjct: 1261 HSDSQSAITLAKNSVHHERTKHIDIRLHFIRDIICAGLIKVVKIATECNPANIFTKTVPL 1320

Query: 1296 DKF 1298
             KF
Sbjct: 1321 AKF 1323


>emb|CAA32025.1| unnamed protein product [Nicotiana tabacum]
            gi|130582|sp|P10978|POLX_TOBAC Retrovirus-related Pol
            polyprotein from transposon TNT 1-94 [Contains: Protease
            ; Reverse transcriptase ; Endonuclease]
          Length = 1328

 Score =  853 bits (2204), Expect = 0.0
 Identities = 478/1129 (42%), Positives = 679/1129 (59%), Gaps = 56/1129 (4%)

Query: 3    GAKFEVTRFDGTGNFGLWQRMAKDLLAQKSLQKALR--DEKPADIATVDWNEMKEKAAGL 60
            G K+EV +F+G   F  WQR  +DLL Q+ L K L    +KP  +   DW ++ E+AA  
Sbjct: 3    GVKYEVAKFNGDNGFSTWQRRMRDLLIQQGLHKVLDVDSKKPDTMKAEDWADLDERAASA 62

Query: 61   ITLCVSDDVMNHILDLTTLKDVWDKLESLYMSKTPMNKLFAKQRLYSLKMQEGGDLQAHV 120
            I L +SDDV+N+I+D  T + +W +LESLYMSKT  NKL+ K++LY+L M EG +  +H+
Sbjct: 63   IRLHLSDDVVNNIIDEDTARGIWTRLESLYMSKTLTNKLYLKKQLYALHMSEGTNFLSHL 122

Query: 121  YAFNNILADMTRLGVTVDDEDKAIILLCSLPGSYDHLVTTLTYGKDSITLDSISSTLLPH 180
              FN ++  +  LGV +++EDKAI+LL SLP SYD+L TT+ +GK +I L  ++S LL +
Sbjct: 123  NVFNGLITQLANLGVKIEEEDKAILLLNSLPSSYDNLATTILHGKTTIELKDVTSALLLN 182

Query: 181  AQRRQSVEEGGGSSGEGLFVKG-GQDRGRGKGKAVDSGKKKRSKSKDR-KTTECYSCKQI 238
             + R+  E    + G+ L  +G G+   R       SG + +SK++ + +   CY+C Q 
Sbjct: 183  EKMRKKPE----NQGQALITEGRGRSYQRSSNNYGRSGARGKSKNRSKSRVRNCYNCNQP 238

Query: 239  GHWKRDCPN-RSGK---SG--NNSSSANVVQSDGSC-----SEEDLLCVSSVKCTDAWVL 287
            GH+KRDCPN R GK   SG  N+ ++A +VQ++ +       EE+ + +S  +    WV+
Sbjct: 239  GHFKRDCPNPRKGKGETSGQKNDDNTAAMVQNNDNVVLFINEEEECMHLSGPE--SEWVV 296

Query: 288  DSGCSYHMTQHREWFNSFKSGDLGYVYLGDDKPCIIKGMRQVKIALDDGGVRTLSQVRYV 347
            D+  S+H T  R+ F  + +GD G V +G+     I G+  + I  + G    L  VR+V
Sbjct: 297  DTAASHHATPVRDLFCRYVAGDFGTVKMGNTSYSKIAGIGDICIKTNVGCTLVLKDVRHV 356

Query: 348  LEVMKNLISLGTLHENGYSFKSEENRDILRVSKGAMTVMRAKRTAGNIYKLLGGTIMGDV 407
             ++  NLIS   L  +GY       +   R++KG++ + +     G +Y+       G++
Sbjct: 357  PDLRMNLISGIALDRDGYESYFANQK--WRLTKGSLVIAKGV-ARGTLYRTNAEICQGEL 413

Query: 408  ASVETDDDATKLWHMRLGHLSERGMMELHKRNMLKGVRSCIIGLCKYCVLGKQCRVRFKT 467
             + + D+ +  LWH R+GH+SE+G+  L K++++   +   +  C YC+ GKQ RV F+T
Sbjct: 414  NAAQ-DEISVDLWHKRMGHMSEKGLQILAKKSLISYAKGTTVKPCDYCLFGKQHRVSFQT 472

Query: 468  GHHKTKGILDYVHSDVRGPTKEPSV*GFRYFVTFTDDFSRKVWVYFMKYKSEVFAKFKLW 527
               +   ILD V+SDV GP +  S+ G +YFVTF DD SRK+WVY +K K +VF  F+ +
Sbjct: 473  SSERKLNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKTKDQVFQVFQKF 532

Query: 528  KAEVENQTGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHFSVRKTPQQNGVAERMNRTL 587
             A VE +TGRK+K LRSDNG EYT + F  +C  +GI+   +V  TPQ NGVAERMNRT+
Sbjct: 533  HALVERETGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTPQHNGVAERMNRTI 592

Query: 588  TEKARCLRLNACLSKCLWAATINMACYLVNRSPRASLDGKVAEEVWTGNPIDLSNLRIFW 647
             EK R +   A L K  W   +  ACYL+NRSP   L  ++ E VWT   +  S+L++F 
Sbjct: 593  VEKVRSMLRMAKLPKSFWGEAVQTACYLINRSPSVPLAFEIPERVWTNKEVSYSHLKVFG 652

Query: 648  RPAYVHISSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDPVKKKVIVSRDVVFDE*SMLKQ 707
              A+ H+  E R+KLD KS  CI IGY     GY+LWDPVKKKVI SRDVVF E  +   
Sbjct: 653  CRAFAHVPKEQRTKLDDKSIPCIFIGYGDEEFGYRLWDPVKKKVIRSRDVVFRESEVRTA 712

Query: 708  SDVT------------VVPDTEVENSSQDKIQVDIEETPVSPRQIVAQ-QQSEPGSD--- 751
            +D++             +P T    +S +    ++ E    P +++ Q +Q + G +   
Sbjct: 713  ADMSEKVKNGIIPNFVTIPSTSNNPTSAESTTDEVSEQGEQPGEVIEQGEQLDEGVEEVE 772

Query: 752  ---SGEVQDYTLVRDREP---SRITPPVRYGFEDLAAYALLTSSGDPSTYHEAMAS*EKD 805
                GE Q   L R   P   SR  P           Y L++   +P +  E ++  EK+
Sbjct: 773  HPTQGEEQHQPLRRSERPRVESRRYPSTE--------YVLISDDREPESLKEVLSHPEKN 824

Query: 806  KWMSAMVEEMESLKKNET*NLVQLPHGKRVIGCKWVYKKKLAVT*KEREKFKAHLVTKGY 865
            + M AM EEMESL+KN T  LV+LP GKR + CKWV+K K     K   ++KA LV KG+
Sbjct: 825  QLMKAMQEEMESLQKNGTYKLVELPKGKRPLKCKWVFKLKKDGDCK-LVRYKARLVVKGF 883

Query: 866  SQHKGIDYDEIFSPVVRHTSIRVVLALVASMDMHLEQMDVKTTFLHGNLEEQIYIEQPEG 925
             Q KGID+DEIFSPVV+ TSIR +L+L AS+D+ +EQ+DVKT FLHG+LEE+IY+EQPEG
Sbjct: 884  EQKKGIDFDEIFSPVVKMTSIRTILSLAASLDLEVEQLDVKTAFLHGDLEEEIYMEQPEG 943

Query: 926  FSETGDGRLVCKLKRSLYGLKQSPRQWYKRFDSYMLRIGYRRCDYDCCVYVMSLDDGSFI 985
            F   G   +VCKL +SLYGLKQ+PRQWY +FDS+M    Y +   D CVY     + +FI
Sbjct: 944  FEVAGKKHMVCKLNKSLYGLKQAPRQWYMKFDSFMKSQTYLKTYSDPCVYFKRFSENNFI 1003

Query: 986  FLLLYVDDMLIAANHLHDVNELKTKLGKEFDMKDLGAAKKILGMEIHKDRGAKKLWLSQK 1045
             LLLYVDDMLI       + +LK  L K FDMKDLG A++ILGM+I ++R ++KLWLSQ+
Sbjct: 1004 ILLLYVDDMLIVGKDKGLIAKLKGDLSKSFDMKDLGPAQQILGMKIVRERTSRKLWLSQE 1063

Query: 1046 SYVEGVLSRFDMSKANHVSTPLTNHFKLSLEQSPKIDSEIEGMSKIPYA 1094
             Y+E VL RF+M  A  VSTPL  H KLS +  P    E   M+K+PY+
Sbjct: 1064 KYIERVLERFNMKNAKPVSTPLAGHLKLSKKMCPTTVEEKGNMAKVPYS 1112



 Score =  125 bits (315), Expect = 7e-27
 Identities = 61/115 (53%), Positives = 80/115 (69%)

Query: 1184 GGPICWKSSVQSTVAMSTTEAEYMAVAEAAKEALWLTGLVKELGVEQGGVQLHCDSQSAI 1243
            GG I W+S +Q  VA+STTEAEY+A  E  KE +WL   ++ELG+ Q    ++CDSQSAI
Sbjct: 1202 GGAISWQSKLQKCVALSTTEAEYIAATETGKEMIWLKRFLQELGLHQKEYVVYCDSQSAI 1261

Query: 1244 YLTNNQVYHARTKHIDVRFHKIRELLASRQILLQKIHTSENTTDKLTKPVTSDKF 1298
             L+ N +YHARTKHIDVR+H IRE++    + + KI T+EN  D LTK V  +KF
Sbjct: 1262 DLSKNSMYHARTKHIDVRYHWIREMVDDESLKVLKISTNENPADMLTKVVPRNKF 1316


>gb|AAF19226.1| Highly similar to Ta1-3 polyprotein [Arabidopsis thaliana]
            gi|25301707|pir||E86490 hypothetical protein F28L22.3 -
            Arabidopsis thaliana
          Length = 1356

 Score =  845 bits (2182), Expect = 0.0
 Identities = 516/1382 (37%), Positives = 754/1382 (54%), Gaps = 133/1382 (9%)

Query: 5    KFEVTRFDGTGNFGLWQRMAKDLLAQKSLQKALRDEKPADIATVDWNEMK---------- 54
            + E+  F+G  +F LW+   +  L    L+  L D        +  +E K          
Sbjct: 7    RVEIKVFNGDRDFSLWKIRIQAQLGVLGLKDTLTDFSLTKTVPLTKSEAKQESGDGESSG 66

Query: 55   -------------EKAAGLITLCVSDDVMNHILDLTTLKDVWDKLESLYMSKTPMNKLFA 101
                         E+A  +I   +SD V+  +    T  D+W  L   YM  +  N+++ 
Sbjct: 67   TKEVPDPVKIEQSEQAKNIIINHISDVVLLKVNHYATTADLWATLNKKYMETSLPNRIYT 126

Query: 102  KQRLYSLKMQEGGDLQAHVYAFNNILADMTRLGVTVDDEDKAIILLCSLPGSYDHLVTTL 161
            + +LYS KM     +  +V  F  I+A++  L + VD+E +AI++L SLP S+  L  TL
Sbjct: 127  QLKLYSFKMVSTMTIDQNVDEFLRIVAELGSLEIQVDEEVQAILILNSLPASHIQLKHTL 186

Query: 162  TYGKDSITLDSISSTLLPHAQRRQSVEEGGGSSGEGL----------FVKGGQDRGRGKG 211
             YG  ++T+  ++S+    +  R+  E      G+             V+  Q  G+GKG
Sbjct: 187  KYGNKTLTVQDVTSSA--KSLERELAEAVDLDKGQAAVLYTTERGRPLVRNNQKGGQGKG 244

Query: 212  KAVDSGKKKRSKSKDRKTTECYSCKQIGHWKRDCPNRSGK-SGNNSSSANVVQSDGSCSE 270
                     RS+S  +    C+ CK+ GH K+DC +R  K        A V+      SE
Sbjct: 245  ---------RSRSNSKTKVPCWYCKKEGHVKKDCYSRKKKMESEGQGEAGVITEKLVFSE 295

Query: 271  EDLLCVSSVKCTDAWVLDSGCSYHMTQHREWFNSFKSGDLGYVYLGDDKPCIIKGMRQVK 330
               L V+     D W+LDSGC+ HMT  R+WF SF+      + LGDD     +G   ++
Sbjct: 296  A--LSVNEQMVKDLWILDSGCTSHMTSRRDWFISFQEKGNTTILLGDDHSVESQGQGTIR 353

Query: 331  IALDDGGVRTLSQVRYVLEVMKNLISLGTLHENGYSFKSEENRDILRVSKGAMTVMRAKR 390
            I    G ++ L  V+YV  + +NLIS GTL + GY  +  E +  +R  K   T +R   
Sbjct: 354  IDTHGGTIKILENVKYVPHLRRNLISTGTLDKLGYRHEGGEGK--VRYFKNNKTALRGSL 411

Query: 391  TAGNIYKLLGGTIMGDVASVETDDDATKLWHMRLGHLSERGMMELHKRNMLKGVRSCIIG 450
            + G +Y L G T+M ++ + ETD   T LWH RLGH+S   +  L  + ++       + 
Sbjct: 412  SNG-LYVLDGSTVMSELCNAETDKVKTALWHSRLGHMSMNNLKVLAGKGLIDRKEINELE 470

Query: 451  LCKYCVLGKQCRVRFKTGHHKTKGILDYVHSDVRG-PTKEPSV*GFRYFVTFTDDFSRKV 509
             C++CV+GK  +V F  G H ++  L YVH+D+ G P   PS+ G +YF++  DD +RKV
Sbjct: 471  FCEHCVMGKSKKVSFNVGKHTSEDALSYVHADLWGSPNVTPSISGKQYFLSIIDDKTRKV 530

Query: 510  WVYFMKYKSEVFAKFKLWKAEVENQTGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHFS 569
            W+YF+K K E F KF  WK+ VENQ  +K+K LR+DNG E+ +  F  +C+E+GI+RH +
Sbjct: 531  WLYFLKSKDETFDKFCEWKSLVENQVNKKVKCLRTDNGLEFCNSRFDSYCKEHGIERHRT 590

Query: 570  VRKTPQQNGVAERMNRTLTEKARCLRLNACLSKCLWAATINMACYLVNRSPRASLDGKVA 629
               TPQQNGVAERMNRT+ EK RCL   + + +  WA     A YL+NRSP ++++  V 
Sbjct: 591  CTYTPQQNGVAERMNRTIMEKVRCLLNKSGVEEVFWAEAAATAAYLINRSPASAINHNVP 650

Query: 630  EEVWTGNPIDLSNLRIFWRPAYVHISSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDPVKK 689
            EE+W        +LR F   AYVH   +D+ KL P++     +GY  G KGYK+W   ++
Sbjct: 651  EEMWLNRKPGYKHLRKFGSIAYVH---QDQGKLKPRALKGFFLGYPAGTKGYKVWLLEEE 707

Query: 690  KVIVSRDVVFDE*SMLKQSDVTVVPD-----TEVENSSQDKIQVDIEETPVSPRQIVAQQ 744
            K ++SR+VVF E  + +  D+ V  D      + E +S +  Q    E   S   I  Q 
Sbjct: 708  KCVISRNVVFQESVVYR--DLKVKEDDTDNLNQKETTSSEVEQNKFAEASGSGGVIQLQS 765

Query: 745  QSEP------GSDSGEVQDYT----------------LVRDREPSRITPPVRYGFEDLAA 782
             SEP       SDS E  +Y+                L RDR    I PP R+  E    
Sbjct: 766  DSEPITEGEQSSDSEEEVEYSEKTQETPKRTGLTTYKLARDRVRRNINPPTRFTEESSVT 825

Query: 783  YALLTSSG----DPSTYHEAMAS*EKDKWMSAMVEEMESLKKNET*NLVQLPHGKRVIGC 838
            +AL+        +P +Y EAM S + +KW  A  +EM+SL KN T +LV  P  +++IGC
Sbjct: 826  FALVVVENCIVQEPQSYQEAMESQDCEKWDMATHDEMDSLMKNGTWDLVDKPKDRKIIGC 885

Query: 839  KWVYKKKLAVT*KEREKFKAHLVTKGYSQHKGIDYDEIFSPVVRHTSIRVVLALVASMDM 898
            +W++K K  +   E  +FKA LV KGY+Q +G+DY EIF+PVV+H SIR++++LV   D+
Sbjct: 886  RWLFKLKSGIPGVEPTRFKARLVAKGYTQREGVDYQEIFAPVVKHVSIRILMSLVVDKDL 945

Query: 899  HLEQMDVKTTFLHGNLEEQIYIEQPEGFSETGDGRLVCKLKRSLYGLKQSPRQWYKRFDS 958
             LEQMDVKTTFLHG+LEE++Y+EQPEGF        VC+LK+SLYGLKQSPRQW KRFD 
Sbjct: 946  ELEQMDVKTTFLHGDLEEELYMEQPEGFVSDSSENKVCRLKKSLYGLKQSPRQWNKRFDR 1005

Query: 959  YMLRIGYRRCDYDCCVYVMSLDDGSFIFLLLYVDDMLIAANHLHDVNELKTKLGKEFDMK 1018
            +M    + R ++D CVYV  + +  FI+LLLYVDDMLIA     ++N +K +L  EF+MK
Sbjct: 1006 FMSSQQFIRSEHDACVYVKHVSEHDFIYLLLYVDDMLIAGASKAEINRVKEQLSTEFEMK 1065

Query: 1019 DLGAAKKILGMEIHKDRGAKKLWLSQKSYVEGVLSRFDMSKANHVSTPLTNHFKLSL--E 1076
            D+G A +ILG++I++DR    L LSQ+ Y+  VL RF+MS A   + P+  HFKL+   E
Sbjct: 1066 DMGGASRILGIDIYRDRKGGVLKLSQEIYIRKVLDRFNMSGAKMTNAPVGAHFKLAAVRE 1125

Query: 1077 QSPKIDSEIEGMSKIPY--AVQLVV*CMLWFALDQIW---------------HKQLVKCQ 1119
            +   +D+++     +PY  AV  ++  ML    D  +               H + VK  
Sbjct: 1126 EDECVDTDV-----VPYSSAVGSIMYAMLGTRPDLAYAICLISRYMSKPGSMHWEAVKWV 1180

Query: 1120 VHV--QAREAALGSSQVDPKILEGYNGSRYHV*QGARCCSISCGICGLCR*SR**KVYNR 1177
            +     A++  L  ++     + GY  S Y     A        I G             
Sbjct: 1181 MRYLKGAQDLNLVFTKEKDFTVTGYCDSNY-----AADLDRRRSISGY------------ 1223

Query: 1178 ICLYSCGG-PICWKSSVQSTVAMSTTEAEYMAVAEAAKEALWLTGLVKELGVEQGGVQLH 1236
              +++ GG  + WK+S+Q  VAMSTTEAEY+A+AEAAKEA+W+ GL++++G++Q  V++ 
Sbjct: 1224 --VFTIGGNTVSWKASLQPVVAMSTTEAEYIALAEAAKEAMWIKGLLQDMGMQQDKVKIW 1281

Query: 1237 CDSQSAIYLTNNQVYHARTKHIDVRFHKIRELLASRQILLQKIHTSENTTDKLTKPVTSD 1296
            CDSQSAI L+ N VYH RTKHIDVRF+ IR+++ S  + + KIHTS N  D LTK +  +
Sbjct: 1282 CDSQSAICLSKNSVYHERTKHIDVRFNYIRDVVESGDVDVLKIHTSRNPVDALTKCIPVN 1341

Query: 1297 KF 1298
            KF
Sbjct: 1342 KF 1343


>gb|AAK29467.1| polyprotein-like [Lycopersicon chilense]
          Length = 1328

 Score =  811 bits (2096), Expect = 0.0
 Identities = 468/1129 (41%), Positives = 655/1129 (57%), Gaps = 55/1129 (4%)

Query: 3    GAKFEVTRFDGTGN-FGLWQRMAKDLLAQKSLQKAL--RDEKPADIATVDWNEMKEKAAG 59
            G K+EV +F+G    F +WQR  KDLL Q+ L KAL  + +KP  +   DW E+ EKAA 
Sbjct: 3    GVKYEVAKFNGDKPVFSMWQRRMKDLLIQQGLHKALGGKSKKPESMKLEDWEELDEKAAS 62

Query: 60   LITLCVSDDVMNHILDLTTLKDVWDKLESLYMSKTPMNKLFAKQRLYSLKMQEGGDLQAH 119
             I L ++DDV+N+I+D  +   +W KLE+LYMSKT  NKL+ K++LY+L M EG +  +H
Sbjct: 63   AIRLHLTDDVVNNIVDEESACGIWTKLENLYMSKTLTNKLYLKKQLYTLHMDEGTNFLSH 122

Query: 120  VYAFNNILADMTRLGVTVDDEDKAIILLCSLPGSYDHLVTTLTYGKDSITLDSISSTLLP 179
            +   N ++  +  LGV +++EDK I+LL SLP SYD L TT+ +GKDSI L  ++S LL 
Sbjct: 123  LNVLNGLITQLANLGVKIEEEDKRIVLLNSLPSSYDTLSTTILHGKDSIQLKDVTSALLL 182

Query: 180  HAQRRQSVEEGG--------GSSGEGLFVKGGQDRGRGKGKAVDSGKKKRSKSKDRKTTE 231
            + + R+  E  G        G S +      G+   RGK K        RSKSK R    
Sbjct: 183  NEKMRKKPENHGQVFITESRGRSYQRSSSNYGRSGARGKSKV-------RSKSKARN--- 232

Query: 232  CYSCKQIGHWKRDCPN-------RSGKSGNNSSSANVVQSDGSCS--EEDLLCVSSVKCT 282
            CY+C Q GH+KRDCPN        SG+  +++++A V  +D       E+  C+      
Sbjct: 233  CYNCDQPGHFKRDCPNPKRGKGESSGQKNDDNTAAMVQNNDDVVLLINEEEECMHLAGTE 292

Query: 283  DAWVLDSGCSYHMTQHREWFNSFKSGDLGYVYLGDDKPCIIKGMRQVKIALDDGGVRTLS 342
              WV+D+  SYH T  R+ F  + +GD G V +G+     I G+  +    + G    L 
Sbjct: 293  SEWVVDTAASYHATPVRDLFCRYVAGDYGNVKMGNTSYSKIAGIGDICFKTNVGCTLVLK 352

Query: 343  QVRYVLEVMKNLISLGTLHENGYSFKSEENRDILRVSKGAMTVMRAKRTAGNIYKLLGGT 402
             VR+V ++  NLIS   L ++GY       +   R++KGA+ + +     G +Y+     
Sbjct: 353  DVRHVPDLRMNLISGIALDQDGYENYFANQK--WRLTKGALVIAKGV-ARGTLYRTNAEI 409

Query: 403  IMGDVASVETDDDATKLWHMRLGHLSERGMMELHKRNMLKGVRSCIIGLCKYCVLGKQCR 462
              G++ +   ++ A  LWH R+GH SE+G+  L K++++   +   I  C Y + GKQ R
Sbjct: 410  CQGELNAAHEENSAD-LWHKRMGHTSEKGLQILSKKSLISFTKGTTIKPCNYWLFGKQHR 468

Query: 463  VRFKTGHHKTKGILDYVHSDVRGPTKEPSV*GFRYFVTFTDDFSRKVWVYFMKYKSEVFA 522
            V F+T   +   ILD V+SDV GP +  S+ G +YFVTF DD SRK+WVY  + K +VF 
Sbjct: 469  VSFQTSSERKSNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYIFRAKDQVFQ 528

Query: 523  KFKLWKAEVENQTGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHFSVRKTPQQNGVAER 582
             F+ + A VE +TGRK K LR+DNG EYT + F  +C  +GI+   +V  TPQ NGVAER
Sbjct: 529  VFQKFHALVERETGRKRKRLRTDNGGEYTSREFEEYCSNHGIRHEKTVPGTPQHNGVAER 588

Query: 583  MNRTLTEKARCLRLNACLSKCLWAATINMACYLVNRSPRASLDGKVAEEVWTGNPIDLSN 642
            MNRT+ EK R +   A L K  W   +  ACYL+NRSP   L+  + E VWT   +  S+
Sbjct: 589  MNRTIVEKVRSMLRMAKLPKTFWGEAVRTACYLINRSPSVPLEFDIPERVWTNKEMSYSH 648

Query: 643  LRIFWRPAYVHISSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDPVKKKVIVSRDVVFDE* 702
            L++F   A+ H+  E R+KLD KS  CI IGY     GY+LWD VKKKVI SRDV+F E 
Sbjct: 649  LKVFGCKAFAHVPKEQRTKLDDKSVPCIFIGYGDEEFGYRLWDLVKKKVIRSRDVIFRES 708

Query: 703  SMLKQSDVT--------------VVPDTEVENSSQDKIQVDIEETPVSPRQIVAQQQSEP 748
             +   +D++               +P +    +S +    ++ E    P +IV  +Q E 
Sbjct: 709  EVGTAADLSEKAKKKNGIIPNLVTIPSSSNHPTSAESTIDEVVEQEEQPDEIV--EQGEQ 766

Query: 749  GSDSGEVQDYTLVRDREPSRITPPVRYGFEDL--AAYALLTSSGDPSTYHEAMAS*EKDK 806
              D+ E  +Y      +P R +   R        + Y L+   G+P    E ++  EK +
Sbjct: 767  LGDNTEQMEYPEEEQSQPLRRSERQRVESTKYPSSEYVLIKYEGEPENLKEVLSHPEKSQ 826

Query: 807  WMSAMVEEMESLKKNET*NLVQLPHGKRVIGCKWVYKKKLAVT*KEREKFKAHLVTKGYS 866
            WM AM EEM SL+KN T  LV+LP GKR + CKWV+K K     K   ++KA LV KG+ 
Sbjct: 827  WMKAMHEEMGSLQKNGTYQLVELPKGKRPLKCKWVFKLKKDGNGK-LVRYKARLVVKGFE 885

Query: 867  QHKGIDYDEIFSPVVRHTSIRVVLALVASMDMHLEQMDVKTTFLHGNLEEQIYIEQPEGF 926
            Q KGID+DEIFSPVV+ TSIR +L++ AS+D+ +EQ+DVKT FLHG+LEE+IY+EQ EGF
Sbjct: 886  QKKGIDFDEIFSPVVKMTSIRTILSIAASLDLEVEQLDVKTAFLHGDLEEEIYMEQGEGF 945

Query: 927  SETGDGRLVCKLKRSLYGLKQSPRQWYKRFDSYMLRIGYRRCDYDCCVYVMSLDDGSFIF 986
              +G   +VCKL +SLYGLKQ+PRQWYK+FDS+M    YR      CVY     D +FI 
Sbjct: 946  EVSGKKHMVCKLNKSLYGLKQAPRQWYKKFDSFMKSQTYRNTYSHPCVYFKRFSDKNFII 1005

Query: 987  LLLYVDDMLIAANHLHDVNELKTKLGKEFDMKDLGAAKKILGMEIHKDRGAKKLWLSQKS 1046
            LLLY D MLI       + +L+    K FDMKDLG AK+ILGM+I ++   KKL LS + 
Sbjct: 1006 LLLYTDYMLIVGKDKELIAKLRKDFSKSFDMKDLGPAKQILGMKIAREE-QKKLGLSHEK 1064

Query: 1047 YVEGVLSRFDMSKANHVSTPLTNHFKLSLEQSP-KIDSEIEGMSKIPYA 1094
            Y+E VL RF+M  A  +STPL ++ KL+ +  P K   E   M+K+PY+
Sbjct: 1065 YIERVLERFNMKSAKPISTPLVSYLKLTKQMFPTKKKGEKGDMAKVPYS 1113



 Score =  112 bits (279), Expect = 1e-22
 Identities = 55/115 (47%), Positives = 78/115 (67%)

Query: 1184 GGPICWKSSVQSTVAMSTTEAEYMAVAEAAKEALWLTGLVKELGVEQGGVQLHCDSQSAI 1243
            GG I W+S +Q  VA+STTEA+Y+A  E  KE LWL   ++E G+ Q    ++C+SQSA+
Sbjct: 1202 GGDISWQSKLQKYVALSTTEAKYIAGTEVCKEMLWLKRFLQEHGLHQKEYVVYCESQSAM 1261

Query: 1244 YLTNNQVYHARTKHIDVRFHKIRELLASRQILLQKIHTSENTTDKLTKPVTSDKF 1298
             L+   +YHA TKHID+R+H IRE++    + + KI TSEN  D +TK V ++KF
Sbjct: 1262 DLSKKAMYHATTKHIDMRYHWIREMVDDGSLQVVKIPTSENPADMVTKVVQNEKF 1316


>emb|CAB79135.1| putative transposable element [Arabidopsis thaliana]
            gi|3402755|emb|CAA20201.1| putative transposable element
            [Arabidopsis thaliana] gi|7444415|pir||T05178
            hypothetical protein T6K22.90 - Arabidopsis thaliana
          Length = 1308

 Score =  795 bits (2054), Expect = 0.0
 Identities = 488/1347 (36%), Positives = 741/1347 (54%), Gaps = 108/1347 (8%)

Query: 5    KFEVTRFDGTGNFGLWQRMAKDLLAQKSLQKALRD------------EKPADIATVDWNE 52
            K E+  F+G  +F LW+   +  L    L+ AL D            EK    +  D  +
Sbjct: 6    KVEIKTFNGDRDFSLWKIRIEAQLGVLGLKPALSDFTLTKTILVVKSEKKESESEDDETD 65

Query: 53   MK--EKAAGLITLCVSDD----VMNHILDLTTLK--------DVWDKLESLYMSKTPMNK 98
             K  E+    I    SD     ++NHI D   LK        ++W  L  L+M  +  N+
Sbjct: 66   SKKTEEVPDPIKFEQSDQAKNFIINHITDTVLLKVQHCVTAAELWATLNKLFMETSLPNR 125

Query: 99   LFAKQRLYSLKMQEGGDLQAHVYAFNNILADMTRLGVTVDDEDKAIILLCSLPGSYDHLV 158
            ++ + RLYS KM +   +  +   F  I+A++  L + V +E +AI++L SLP SY  L 
Sbjct: 126  IYTQLRLYSFKMVDNLSIDQNTDEFLRIVAELGSLQIQVGEEVQAILILNSLPPSYIQLK 185

Query: 159  TTLTYGKDSITL-DSISSTLLPHAQRRQSVEEGGGSSGEGLFVKGGQDRGRGKGKAVDSG 217
             TL YG  ++++ D +SS      +  +  E     +   L+     +RGR + K     
Sbjct: 186  HTLKYGNKTLSVQDVVSSAKSLERELSEQKETIRAPASTALYTA---ERGRPQTKNTQGQ 242

Query: 218  KKKRSKSKDRKTTECYSCKQIGHWKRDCPNRSGKSGNNSSS-ANVVQSDGSCSEEDLLCV 276
             K R +S  +    C+ CK+ GH K+DC     K  N     A V+      SE   L +
Sbjct: 243  GKGRGRSNSKSRLTCWFCKKEGHVKKDCYAGKRKLENEGQGKAGVITEKLVYSEA--LSM 300

Query: 277  SSVKCTDAWVLDSGCSYHMTQHREWFNSFKSGDLGYVYLGDDKPCIIKGMRQVKIALDDG 336
               +  D WV+DSGC+YHMT   +WF+ F   +   + LGDD     KG   VK+    G
Sbjct: 301  YDQEAKDKWVIDSGCTYHMTSRMDWFSEFNENETTMILLGDDHTVESKGSGTVKVNTHGG 360

Query: 337  GVRTLSQVRYVLEVMKNLISLGTLHENGYSFKSEENRDILRVSKGAMTVMRAKRTAGNIY 396
             +R L  VR+V  + +NLIS GTL + GY  +  + +  +R  K   T +      G +Y
Sbjct: 361  SIRVLKNVRFVPNLRRNLISTGTLDKLGYKHEGGDGK--VRFYKENKTALCGNLVNG-LY 417

Query: 397  KLLGGTIMGDVASVETDDDATKLWHMRLGHLSERGMMELHKRNMLKGVRSCIIGLCKYCV 456
             L G T++ +  +VE  ++ T+LWH RLGH+S   M  L ++ +L+      +  C+ CV
Sbjct: 418  VLDGHTVVNENCNVEGSNEKTELWHCRLGHMSLNNMKILAEKGLLEKKDIKELSFCENCV 477

Query: 457  LGKQCRVRFKTGHHKTKGILDYVHSDVRGPTKEPSV*GFRYFVTFTDDFSRKVWVYFMKY 516
            +GK  ++ F  G H T  +L Y+H+D+ G          +YF++  DD SRKVW+ F+K 
Sbjct: 478  MGKSKKLSFNVGKHITDEVLGYIHADLWGK---------QYFLSIIDDKSRKVWLMFLKT 528

Query: 517  KSEVFAKFKLWKAEVENQTGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHFSVRKTPQQ 576
            K E F +F  WK  VENQ  +K+K LR+DNG E+ +  F  FC++NGI+RH +   TPQQ
Sbjct: 529  KDETFERFCEWKELVENQVNKKVKILRTDNGLEFCNLKFDEFCKQNGIERHRTCTYTPQQ 588

Query: 577  NGVAERMNRTLTEKARCLRLNACLSKCLWAATINMACYLVNRSPRASLDGKVAEEVWTGN 636
            NGVA+RMNRTL EK RCL   + L +  WA     A YLVNRSP +++D  V EE+W   
Sbjct: 589  NGVAKRMNRTLMEKVRCLLNESGLEEVFWAEAAATAAYLVNRSPASAVDHNVPEELWLDK 648

Query: 637  PIDLSNLRIFWRPAYVHISSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDPVKKKVIVSRD 696
                 +LR F   AYVH+   D+ KL P++   + +GY +G KGYK+W   ++K ++SR+
Sbjct: 649  KPGYKHLRRFGCIAYVHL---DQGKLKPRALKGVFLGYPQGTKGYKVWLLDEEKCVISRN 705

Query: 697  VVFDE*SMLK------QSDVTVVPDTEVENSSQDKIQ----------VDIEETPVSPRQI 740
            +VF+E  + K      +  V  + D E  N  Q  ++          V IEE        
Sbjct: 706  IVFNENQVYKDIRESSEQSVKDISDLEGYNEFQVSVKEHGECSKTGGVTIEEIDQESDSE 765

Query: 741  VAQQQSEPGSDSGEVQDYTLVRDREPSRITPPVRYGFEDLAAYALLTS----SGDPSTYH 796
             +  Q EP   S ++ +Y   RDRE     PP +       A AL+ +    S +P  YH
Sbjct: 766  NSVTQ-EPLIASIDLSNYQSARDRERRAPNPPQKLADYTHFALALVMAEEIESEEPQCYH 824

Query: 797  EAMAS*EKDKWMSAMVEEMESLKKNET*NLVQLPHGKRVIGCKWVYKKKLAVT*KEREKF 856
            +A       KW   M EE++SL KN T ++V+ P  ++VI C+W++K K  +   E +++
Sbjct: 825  DAKKDKHWIKWNGGMKEEIDSLLKNGTWDIVEWPKEQKVISCRWLFKLKPGIPGVEAQRY 884

Query: 857  KAHLVTKGYSQHKGIDYDEIFSPVVRHTSIRVVLALVASMDMHLEQMDVKTTFLHGNLEE 916
            KA LV +G++Q KGIDY+E+F+PVV+H SIR++++ V   DM LEQMDVKTTFLHG L++
Sbjct: 885  KARLVARGFTQQKGIDYEEVFAPVVKHISIRILMSAVVKDDMELEQMDVKTTFLHGELDQ 944

Query: 917  QIYIEQPEGFSETGDGRLVCKLKRSLYGLKQSPRQWYKRFDSYMLRIGYRRCDYDCCVYV 976
             +Y+EQPEGF    +   VC LK+SLYGLKQ+PRQW K+F ++ML + + R ++D CVYV
Sbjct: 945  VLYMEQPEGFEVNPEKDQVCLLKKSLYGLKQAPRQWNKKFHAFMLSLQFARSEHDSCVYV 1004

Query: 977  MSLDDGSFIFLLLYVDDMLIAANHLHDVNELKTKLGKEFDMKDLGAAKKILGMEIHKDRG 1036
              ++ G F++LLLYVDDML+AA    ++++LK  L  +F+MKD+GAA +ILG++I ++R 
Sbjct: 1005 KEVNPGEFVYLLLYVDDMLLAAKSKSEISKLKEALSLKFEMKDMGAASRILGIDIIRNRK 1064

Query: 1037 AKKLWLSQKSYVEGVLSRFDMSKANHVSTPLTNHFKLS--LEQSPKIDSEIEGMSKIPY- 1093
               L LSQ  YV+ V+ RF M+ A  VSTP+  HFKL+  +++   +D E+     +PY 
Sbjct: 1065 EGTLRLSQTRYVDKVIQRFRMADAKVVSTPMGAHFKLTSLIDEIGSVDPEV-----VPYS 1119

Query: 1094 -AVQLVV*CMLWFALDQIWHKQLVKCQVHVQAREAALGSSQVDPKILEGYNGSRYHV*QG 1152
             AV  V+  M+    D  +   LV       +R  +   + ++   ++GY  S +     
Sbjct: 1120 SAVGSVMYAMIGTIPDVAYAMGLV-------SRFMSRPGANLE---VQGYCDSDH----- 1164

Query: 1153 ARCCSISCGICGLCR*SR**KVYNRICLYSCGG-PICWKSSVQSTVAMSTTEAEYMAVAE 1211
            A        I G               +++ GG  + WKSS+Q  VA+S+T+AE++A+ E
Sbjct: 1165 AADLDKRRSISGY--------------VFTVGGNTVSWKSSLQHVVALSSTQAEFIALTE 1210

Query: 1212 AAKEALWLTGLVKELGVEQGGVQLHCDSQSAIYLTNNQVYHARTKHIDVRFHKIRELLAS 1271
            A KEA+W+ GL++++G++     + CDSQSAI L+ N  +H RTKH++V+F+ IR+++ +
Sbjct: 1211 AVKEAIWIRGLLEDMGLQPKPATVWCDSQSAICLSKNNAFHDRTKHVEVKFYFIRDIIEA 1270

Query: 1272 RQILLQKIHTSENTTDKLTKPVTSDKF 1298
             ++ ++KIHTS N  D LTK +   KF
Sbjct: 1271 GEVKVRKIHTSVNPADMLTKCIPVKKF 1297


>pir||T02206 hypothetical protein - common tobacco retrotransposon Tto1
            gi|1167523|dbj|BAA11674.1| ORF(AA 1-1338) [Nicotiana
            tabacum]
          Length = 1338

 Score =  782 bits (2020), Expect = 0.0
 Identities = 463/1132 (40%), Positives = 652/1132 (56%), Gaps = 69/1132 (6%)

Query: 7    EVTRFDGTGNFGLWQRMAKDLL-AQKSLQKALRDEKPADIATVDWNEMKEKAAGLITLCV 65
            ++   +GT N+ LW+   KDLL   K        +KP D +  DW     +  G I   V
Sbjct: 7    KMVNLNGT-NYHLWRNKMKDLLFVTKMHLPVFSSQKPEDKSDEDWEFEHNQVCGYIRQFV 65

Query: 66   SDDVMNHILDLTTLKDVWDKLESLYMSKTPMNKLFAKQRLYSLKMQEGGDLQAHVYAFNN 125
             D+V NHI  +T  + +WDKLE LY SKT  NKLF   +L  +K  EG  +  H+     
Sbjct: 66   EDNVYNHISGVTHARSLWDKLEELYASKTGNNKLFYLTKLMQVKYVEGTTVADHLNEIQG 125

Query: 126  ILADMTRLGVTVDDEDKAIILLCSLPGSYDHLVTTLTYGKDS--ITLDSISSTLLPHAQR 183
            I+  ++ +G+  DDE  A+++L +LP S++ L  ++T    +  + ++++ S +L    R
Sbjct: 126  IVDQLSGMGIKFDDEVLALMVLATLPESWETLKVSITNSAPNGVVNMETVKSGILNEEMR 185

Query: 184  RQSVEEGGGSSGEGLFVKGGQDRGRGKGKAVDSGKKKRSKSKDRKTTECYSCKQIGHWKR 243
            R+S     G+S     V     RGR + K+  +  K R KS      EC+ CK+ GH KR
Sbjct: 186  RRSQ----GTSSSQSEVLAVTTRGRSQNKSQSNRDKSRGKSNKFANVECHYCKKKGHIKR 241

Query: 244  DC--------PNRSGKSGNNSSSANVVQSDGSCS---EEDLLCVSSVKCTDAWVLDSGCS 292
             C         N+  K     SS +   S G  +   ++D++ +++ + T  WV+DSG +
Sbjct: 242  FCRQFQNDQKKNKGKKVKPEESSDDETNSFGEFNVVYDDDIINLTTQEMT--WVIDSGAT 299

Query: 293  YHMTQHREWFNSFKSGDLGYVYLGDDKPCIIKGMRQVKIALDDGGVRTLSQVRYVLEVMK 352
             H T  RE F+S+  GD G V +G+     + G   V +   +G    L  VR+V ++  
Sbjct: 300  IHATPRRELFSSYTLGDFGRVKMGNANFSTVVGKGDVCLETMNGMKLLLRDVRHVPDMRL 359

Query: 353  NLISLGTLHENGYSFKSEENRDILRVSKGAMTVMRAKRTAGNIYKLLGGTIMGDVASVET 412
            NLIS+  L E GY       +   +++KG++ V R  + +  +Y +   +I   V +V  
Sbjct: 360  NLISVDKLDEEGYCNTFHNGQ--WKLTKGSLMVARGTKQS-KLY-VTQASISQQVINVAE 415

Query: 413  DDDATKLWHMRLGHLSERGMMELHKRNMLKGVRSCIIGLCKYCVLGKQCRVRFKT-GHHK 471
            +D   KLWH RLGH+SE+ M  L K+N L G+    +  C  C+ GKQ RV FK     +
Sbjct: 416  NDSNIKLWHRRLGHMSEKSMARLVKKNALPGLNQIQLKKCADCLAGKQNRVSFKRFPPSR 475

Query: 472  TKGILDYVHSDVRGPTKEPSV*GFRYFVTFTDDFSRKVWVYFMKYKSEVFAKFKLWKAEV 531
             + +LD VHSDV GP K+ S+ G RYFVTF DD SRK WVY +K K +VF  FK +   V
Sbjct: 476  RQNVLDLVHSDVCGPFKK-SLGGARYFVTFIDDHSRKTWVYTLKTKDQVFQVFKQFLTLV 534

Query: 532  ENQTGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHFSVRKTPQQNGVAERMNRTLTEKA 591
            E +TG+K+K +R+DNG EY  + F  +C+E+GI+  F+  KTPQ NG+AERMNRTL E+ 
Sbjct: 535  ERETGKKLKCIRTDNGGEYQGQ-FDAYCKEHGIRHQFTPPKTPQLNGLAERMNRTLIERT 593

Query: 592  RCLRLNACLSKCLWAATINMACYLVNRSPRASLDGKVAEEVWTGNPIDLSNLRIFWRPAY 651
            RCL  ++ L K  W   +  A Y++N SP   L  K  E++W G  I    LR+F   AY
Sbjct: 594  RCLLSHSKLPKAFWGEALVTAAYVLNHSPCVPLQYKAPEKIWLGRDISYDQLRVFGCKAY 653

Query: 652  VHISSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDPVKKKVIVSRDVVFDE*SMLKQSDVT 711
            VH+  ++RSKLD K++ C+ IGY + + GYK +DPV+KK++ SRDVVF E       D T
Sbjct: 654  VHVPKDERSKLDVKTRECVFIGYGQDMLGYKFYDPVEKKLVRSRDVVFVE-------DQT 706

Query: 712  VVPDTEVENSSQDKIQVDIEETPVSPRQI---VAQQQSE-PG-------------SDSGE 754
            +    +VE S+ D  + ++  T V PRQ+   V   Q E PG              D+G+
Sbjct: 707  IEDIDKVEKSTDDSAEFELPPTVV-PRQVGDDVQDNQPEAPGLPNEDELADTEGNEDNGD 765

Query: 755  VQDYTLVRDREPSRITPP------------VRYGFEDLAAYALLTSSGDPSTYHEAMAS* 802
                   + + P    PP             RY   +   Y LLT  G+P ++ EA+   
Sbjct: 766  DDADEEDQPQPPILNNPPYHTRSGRVVQQSTRYSPHE---YVLLTDGGEPDSFEEAIDDE 822

Query: 803  EKDKWMSAMVEEMESLKKNET*NLVQLPHGKRVIGCKWVYKKKLAVT*KEREKFKAHLVT 862
             K+KW+ AM +E++SL +N+T  LV+LP GKR +  KWV+K K         +FKA LV 
Sbjct: 823  HKEKWIEAMQDEIKSLHENKTFELVKLPKGKRALKNKWVFKMKHDEH-NSLPRFKARLVV 881

Query: 863  KGYSQHKGIDYDEIFSPVVRHTSIRVVLALVASMDMHLEQMDVKTTFLHGNLEEQIYIEQ 922
            KG++Q KGID+DEIFSPVV+ TSIR VL L AS+++ +EQMDVKT FLHG+LEE+IY+EQ
Sbjct: 882  KGFNQRKGIDFDEIFSPVVKMTSIRTVLGLAASLNLEVEQMDVKTAFLHGDLEEEIYMEQ 941

Query: 923  PEGFSETGDGRLVCKLKRSLYGLKQSPRQWYKRFDSYMLRIGYRRCDYDCCVYVMSLDDG 982
            P+GF + G    VC+L++SLYGLKQ+PRQWYK+F+S M + GY++   D CV+     D 
Sbjct: 942  PDGFQQKGKEDYVCRLRKSLYGLKQAPRQWYKKFESVMGQHGYKKTTSDHCVFAQKFSDD 1001

Query: 983  SFIFLLLYVDDMLIAANHLHDVNELKTKLGKEFDMKDLGAAKKILGMEIHKDRGAKKLWL 1042
             FI LLLYVDDMLI   ++  +N LK +L K F MKDLG AK+ILGM I +DR AKKLWL
Sbjct: 1002 DFIILLLYVDDMLIVGRNVSRINSLKEQLSKFFAMKDLGPAKQILGMRIMRDREAKKLWL 1061

Query: 1043 SQKSYVEGVLSRFDMSKANHVSTPLTNHFKLSLEQSPKIDSEIEGMSKIPYA 1094
            SQ+ Y+E VL RF+M K   VS PL NHF+LS +QSP  D E   M +IPYA
Sbjct: 1062 SQEKYIEKVLQRFNMEKTKAVSCPLANHFRLSTKQSPSTDDERRKMERIPYA 1113



 Score =  129 bits (324), Expect = 7e-28
 Identities = 60/117 (51%), Positives = 82/117 (69%)

Query: 1184 GGPICWKSSVQSTVAMSTTEAEYMAVAEAAKEALWLTGLVKELGVEQGGVQLHCDSQSAI 1243
            GG + W+S +Q  VA+STTEAE++A  EA KE +W+   + ELG  Q G QL CDSQSAI
Sbjct: 1203 GGAVSWQSKLQKCVALSTTEAEFIAATEACKELIWMKKFLTELGFSQDGYQLFCDSQSAI 1262

Query: 1244 YLTNNQVYHARTKHIDVRFHKIRELLASRQILLQKIHTSENTTDKLTKPVTSDKFMF 1300
            +L  N  +H+R+KHIDVR++ IR++L  + + L+KIHT EN +D LTK +   KF F
Sbjct: 1263 HLAKNASFHSRSKHIDVRYNWIRDVLEKKMLRLEKIHTDENGSDMLTKTLPKGKFEF 1319


>gb|AAV88069.1| hypothetical retrotransposon [Ipomoea batatas]
          Length = 1415

 Score =  762 bits (1968), Expect = 0.0
 Identities = 439/1113 (39%), Positives = 646/1113 (57%), Gaps = 51/1113 (4%)

Query: 16   NFGLWQRMAKDLLAQKSLQ-KALRDEKPADIATVDWNEMKEKAAGLITLCVSDDVMNHIL 74
            N+ +W+   KDLL  K L        KP +++  +W+   ++  G I   V D+V+NHI+
Sbjct: 15   NYHIWKAKMKDLLFVKKLHLPVFASAKPENMSDEEWDFEHQQVCGYIRQWVEDNVLNHII 74

Query: 75   DLTTLKDVWDKLESLYMSKTPMNKLFAKQRLYSLKMQEGGDLQAHVYAFNNILADMTRLG 134
            + T  + +W+KLE+LY SKT  NKLF  +++ +++ +EG  +  HV  F  +L  ++ +G
Sbjct: 75   NETHARSLWNKLETLYASKTGNNKLFLLKQMMNIRYREGTLINDHVNDFQGVLDQLSGMG 134

Query: 135  VTVDDEDKAIILLCSLPGSYDHLVTTLTYGKDS--ITLDSISSTLLPHAQRRQSVEEGGG 192
            +  +DE   + LL +LP S++    +LT    +  +T++ + S +L    RR+S +    
Sbjct: 135  IKFEDEVLGLWLLNTLPDSWETFRVSLTNSAPNGVVTMEYVKSGILNEEARRRSQDTSTS 194

Query: 193  SSGEGLFVKGGQDRGRGKGKAVDSGKKKRSKSKDR-KTTECYSCKQIGH-------WKRD 244
             S     +    DRGR K K      K RSKS+ R K  EC+ C +  H       WKR+
Sbjct: 195  QSD----ILVTDDRGRNKQKGQRGRDKSRSKSRSRYKDIECHYCGKKSHIKKYSFKWKRE 250

Query: 245  CPNRSGKSGNNSSSANVVQSDGSCSEEDLLCVSSVKCTDA-WVLDSGCSYHMTQHREWFN 303
               +  K G+  +    V++D   + +D   V +V C +  W++DSG +YH+T  +E+F 
Sbjct: 251  -KKQDNKDGDTGNQVATVRADLLVACDDN--VINVACHETTWIVDSGAAYHVTPRKEFFT 307

Query: 304  SFKSGDLGYVYLGDDKPCIIKGMRQVKIALDDGGVRTLSQVRYVLEVMKNLISLGTLHEN 363
            S+  GD G + +G+D    + G   V +   +G    L  V++  ++  NLIS G L ++
Sbjct: 308  SYTPGDFGELRMGNDGQVKVTGTGTVCLETSNGTKLVLKNVKHAPDIRLNLISTGKLDDD 367

Query: 364  GYSFKSEENRDILRVSKGAMTVMRAKRTAGNIYKLLGGTIMGDVASVETDDDATKLWHMR 423
            G+     +     +++KG++ V R  +++ N+Y L   ++  D  +V   + A++LWH R
Sbjct: 368  GFCCFFGDGH--WKITKGSLVVARGNKSS-NLYSLQS-SVSDDSVNVVEKECASELWHKR 423

Query: 424  LGHLSERGMMELHKRNMLKGVRSCIIGLCKYCVLGKQCRVRFKTGHHKTKGI--LDYVHS 481
            LGH+S +G+  L K++ L GV+   +  C +C+ GKQ RV F + H  T+    LD +HS
Sbjct: 424  LGHMSVKGIDYLAKKSKLSGVKEAKLDKCVHCLAGKQRRVSFMS-HPPTRKSEPLDLIHS 482

Query: 482  DVRGPTKEPSV*GFRYFVTFTDDFSRKVWVYFMKYKSEVFAKFKLWKAEVENQTGRKIKY 541
            DV GP K  S+ G  YFVTF DD+SRK+WVY +K+KS+V   FK + A VE QTG+K+K 
Sbjct: 483  DVCGPMKVRSLGGASYFVTFIDDYSRKLWVYTLKHKSDVLGVFKEFHALVERQTGKKLKC 542

Query: 542  LRSDNGTEYTDKNFMHFCEENGIQRHFSVRKTPQQNGVAERMNRTLTEKARCLRLNACLS 601
            +R+DNG EY    F  +C   GI+   +  K PQ NG+AERMNRT+ E+ RC+  +A L 
Sbjct: 543  IRTDNGGEYCGP-FDEYCRRYGIRHQKTPPKIPQLNGLAERMNRTIMERVRCMLDDAKLP 601

Query: 602  KCLWAATINMACYLVNRSPRASLDGKVAEEVWTGNPIDLSNLRIFWRPAYVHISSEDRSK 661
               WA  ++ A +++N SP  +L  +V ++VW G  +   +LR+F   A+VH+  ++RSK
Sbjct: 602  SSFWAEAVSTAVHVINLSPVIALKNEVPDKVWCGKDVSYDHLRVFGCKAFVHVPRDERSK 661

Query: 662  LDPKSK*CIIIGYNKGVKGYKLWDPVKKKVIVSRDVVFDE*SMLKQSDVTVVPDTEVENS 721
            LD K++ CI IGY     GY+L+DPV+KK++ SRDVVF E   ++  D    P++    S
Sbjct: 662  LDSKTRQCIFIGYGFDEFGYRLYDPVEKKLVRSRDVVFFENQTIEDIDKVKQPESRDSGS 721

Query: 722  SQDKIQV------DIEETP--VSPRQIVAQQQSEPGSDSGEVQDYTLVRDREPSRIT--- 770
              D   V      D++E    V     V   Q +     G   D        PS++    
Sbjct: 722  LVDIEPVSRRYTDDVDEVQENVQNGDPVPDYQGDTVDVDGHADDVVHQEQEVPSQVPVDL 781

Query: 771  ---------PPVRYGFEDLAAYALLTSSGDPSTYHEAMAS*EKDKWMSAMVEEMESLKKN 821
                     P  RY     + Y LLT  G+P +Y EAM S +K +W  AM EEM SL  N
Sbjct: 782  PRRSDRERRPSTRYS---PSQYVLLTDGGEPESYEEAMESDQKRQWFEAMQEEMNSLYVN 838

Query: 822  ET*NLVQLPHGKRVIGCKWVYKKKLAVT*KEREKFKAHLVTKGYSQHKGIDYDEIFSPVV 881
            +T  LV+ P  ++ +  +WVY+ K         +FKA LV KG+SQ KGID+DEIFSPVV
Sbjct: 839  DTFELVKAPKNRKALKNRWVYRVKHEEG-TSVPRFKARLVVKGFSQKKGIDFDEIFSPVV 897

Query: 882  RHTSIRVVLALVASMDMHLEQMDVKTTFLHGNLEEQIYIEQPEGFSETGDGRLVCKLKRS 941
            + +SIRVVL L A +D+ +EQMDVKT FLHG+L+E+IY+EQPEGF   G    VC+LK+S
Sbjct: 898  KFSSIRVVLGLAARLDIEIEQMDVKTAFLHGDLDEEIYMEQPEGFKVKGKEDYVCRLKKS 957

Query: 942  LYGLKQSPRQWYKRFDSYMLRIGYRRCDYDCCVYVMSLDDGSFIFLLLYVDDMLIAANHL 1001
            LYGLKQ+PRQWYK+F S M + GY++   D CV+V    D  F+ LLLYVDDMLI   + 
Sbjct: 958  LYGLKQAPRQWYKKFTSVMSKHGYKKTSSDHCVFVNRYSDDDFVILLLYVDDMLIVGRNA 1017

Query: 1002 HDVNELKTKLGKEFDMKDLGAAKKILGMEIHKDRGAKKLWLSQKSYVEGVLSRFDMSKAN 1061
              + ELK +L K F MKD+G AK+ILGM+I +DR  KKLWLSQ+ Y+E VL RF M++A 
Sbjct: 1018 SRIQELKQELSKSFSMKDMGPAKQILGMKIIRDRQNKKLWLSQEKYIEKVLERFHMNEAK 1077

Query: 1062 HVSTPLTNHFKLSLEQSPKIDSEIEGMSKIPYA 1094
             VSTPL  HFKL  +Q P  + E E M ++PY+
Sbjct: 1078 PVSTPLDMHFKLCKKQCPSSEKEKEEMQRVPYS 1110



 Score = 84.3 bits (207), Expect = 2e-14
 Identities = 44/115 (38%), Positives = 65/115 (56%), Gaps = 22/115 (19%)

Query: 1184 GGPICWKSSVQSTVAMSTTEAEYMAVAEAAKEALWLTGLVKELGVEQGGVQLHCDSQSAI 1243
            GG + W+S +Q  V +STTEAE++A  EA+KE LW+   ++ELG  Q             
Sbjct: 1200 GGAVSWQSRLQKCVDLSTTEAEFIASVEASKEMLWMKKFLQELGFVQD------------ 1247

Query: 1244 YLTNNQVYHARTKHIDVRFHKIRELLASRQILLQKIHTSENTTDKLTKPVTSDKF 1298
                      R+KHID R+H IR++L  + + L+KIHT +N +D +TK +   KF
Sbjct: 1248 ----------RSKHIDTRYHWIRDILECKMLELEKIHTDDNGSDMMTKALPRGKF 1292


>emb|CAA31653.1| polyprotein [Arabidopsis thaliana] gi|99721|pir||S05465
            retrovirus-related polyprotein - Arabidopsis thaliana
            retrotransposon Ta1-3
          Length = 1291

 Score =  746 bits (1925), Expect = 0.0
 Identities = 461/1243 (37%), Positives = 680/1243 (54%), Gaps = 91/1243 (7%)

Query: 52   EMKEKAAGLITLCVSDDVMNHILDLTTLKDVWDKLESLYMSKTPMNKLFAKQRLYSLKMQ 111
            E  E A  +I   V D V+  I    +  ++W+ L   YM  +  N+++ + + YS KM 
Sbjct: 85   EKSENAMNIIIAHVGDAVLRKIDHCKSAAEMWETLNKQYMETSLPNRIYVQLKFYSFKMN 144

Query: 112  EGGDLQAHVYAFNNILADMTRLGVTVDDEDKAIILLCSLPGSYDHLVTTLTYGKDSITLD 171
            +   +  +V  F  I+A+++ L + V +E +AI+ L  L   Y  L  TL YG  +++L 
Sbjct: 145  DTKSINENVNEFLKIVAELSSLEINVVEEVRAILFLNRLSSRYSQLKHTLKYGNKALSLK 204

Query: 172  SISSTLLPHAQRRQSVEEGGGSSGEGLFVKGGQDRGRGKGKAVDSGK----KKRSKSKDR 227
             + S      +     +E   ++   L+     +R R + +  +  K    + RSKS   
Sbjct: 205  DVISAARSLERELNEQKETDKNTSTVLYTN---ERSRPQTRNQNHNKGGQGRGRSKSNSN 261

Query: 228  KTTECYSCKQIGHWKRDCPNRSGK-SGNNSSSANVVQSDGSCSEEDLLCVSSVKCTDAWV 286
                C+ CK+ GH K+D   R  K    N   A V+      SE   L V+ +   D WV
Sbjct: 262  AKLTCWYCKKEGHVKKDYFARKRKLESENPGEAGVITEKLVFSEA--LSVNDLAVRDIWV 319

Query: 287  LDSGCSYHMTQHREWFNSFKSGDLGYVYLGDDKPCIIKGMRQVKIALDDGGVRTLSQVRY 346
            LDSGC+ HM+  R+WF SF+      + LGDD     +G   +KI    G +  L  V+Y
Sbjct: 320  LDSGCTSHMSARRDWFCSFREDGGPTILLGDDHSVKSQGQGSIKIETHGGTIIGLENVKY 379

Query: 347  VLEVMKNLISLGTLHENGYSFKSEENRDILRVSKGAMTVMRAKRTAGNIYKLLGGTIMGD 406
            V E+ +NLIS GTL + GY  +  + +  +R  K   T +R +   G +Y L G T++ +
Sbjct: 380  VPELRRNLISTGTLDKRGYKHEGGDGK--VRYFKNQKTALRGELVNG-LYILDGNTVLSE 436

Query: 407  VASVETDDDATKLWHMRLGHLSERGMMELHKRNMLKGVRSCIIGLCKYCVLGKQCRVRFK 466
                E     T+LWH RLGH+    M  L  + ++      ++  C+ CV+GK  +V F 
Sbjct: 437  TCVAEGSKGKTELWHSRLGHIGLNNMKVLAGKGLVSKEEIRVLDFCENCVMGKAKKVSFN 496

Query: 467  TGHHKTKGILDYVHSDVRGPTK-EPSV*GFRYFVTFTDDFSRKVWVYFMKYKSEVFAKFK 525
             G H ++ +L YVH+D+ G T   PS+ G +YF++  DD +RKVW+YF++ K E F +F 
Sbjct: 497  VGKHNSEDVLRYVHADLWGSTNVTPSLSGNKYFLSIIDDKTRKVWLYFLRSKDETFDRFC 556

Query: 526  LWKAEVENQTGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHFSVRKTPQQNGVAERMNR 585
             WK  VENQ  +K+K LR+DNG E+ +  F  +C+E+GI+RH +   TPQQNGVAERMNR
Sbjct: 557  EWKELVENQQNKKVKCLRTDNGLEFCNLKFDAYCKEHGIERHKTCTYTPQQNGVAERMNR 616

Query: 586  TLTEKARCLRLNACLSKCLWAATINMACYLVNRSPRASLDGKVAEEVWTGNPIDLSNLRI 645
            T+ EK RC+   + L +  WA     A YL+NRSP +++D  V EE+W        +LR 
Sbjct: 617  TIMEKVRCMLNESGLGEEFWAEAAATAAYLINRSPASAIDHNVPEELWLNKKPGYKHLRR 676

Query: 646  FWRPAYVHISSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDPVKKKVIVSRDVVFDE*SML 705
            F   AYVHI   D+ KL P++   I IGY  G KGYK+W   + K ++SR+V+F E S+ 
Sbjct: 677  FGSIAYVHI---DQGKLKPRALKGIFIGYPAGTKGYKIWLLEEHKCVISRNVLFHEESVY 733

Query: 706  KQS-DVTVVPDTEVENSSQDK-----------------IQVDIEETPVSPRQIVAQQQSE 747
            K +     V ++E E +S  K                 IQV  EE   S   +  +Q+ E
Sbjct: 734  KDTMKKERVVESEAEPASHSKSTLIKVKTPGNLNSGEVIQVSDEEE--SDESVEEEQEPE 791

Query: 748  PGSDSGEVQ------DYTLVRDREPSRITPPVRYGFEDLAAYALLT----SSGDPSTYHE 797
               +  E Q      +Y L RDRE  +I PP R+  E   A+AL+T    S  +P +Y E
Sbjct: 792  TQVELPETQTTSSLANYQLARDRERRQIHPPARFTEESGVAFALVTVETLSMEEPQSYQE 851

Query: 798  AMAS*EKDKWMSAMVEEMESLKKNET*NLVQLPHGKRVIGCKWVYKKKLAVT*KEREKFK 857
            A +  E  KW  A  EEM+SL KN T  LV  P  +++IGC+W++K K      E  ++K
Sbjct: 852  ATSDKEWKKWKLATHEEMDSLIKNGTWVLVDKPQNRKIIGCRWLFKLKSGSPGVEPVRYK 911

Query: 858  AHLVTKGYSQHKGIDYDEIFSPVVRHTSIRVVLALVASMDMHLEQMDVKTTFLHGNLEEQ 917
            A LV KGY+  +G+DY EIF+ VV+HTSIR+++++V   D+ LEQMDVKT FLHG LEE+
Sbjct: 912  AQLVAKGYTHREGVDYQEIFALVVKHTSIRILMSVVVDQDLELEQMDVKTAFLHGELEEE 971

Query: 918  IYIEQPEG-FSETGDGRLVCKLKRSLYGLKQSPRQWYKRFDSYMLRIGYRRCDYDCCVYV 976
            +Y+EQPEG  SE G+ + VC LK+SLYGLKQSPRQW KRF+ +M+   + R ++D CVYV
Sbjct: 972  LYMEQPEGCISEDGENK-VCLLKKSLYGLKQSPRQWNKRFNRFMIDQNFIRSEHDACVYV 1030

Query: 977  MSLDDGSFIFLLLYVDDMLIAANHLHDVNELKTKLGKEFDMKDLGAAKKILGMEIHKDRG 1036
              + +   ++LLLYVDDMLIA     ++N++K +L  EF+MKD+G A +ILG++I +D  
Sbjct: 1031 KQVSEQEHLYLLLYVDDMLIAGKSKSEINKVKEQLSMEFEMKDMGPASRILGIDIIRDMK 1090

Query: 1037 AKKLWLSQKSYVEGVLSRFDMSKANHVSTPLTNHFKLSLEQSPKIDSEIEGMSKIPY--A 1094
               L +SQ SY+  V+ RF+M++A    +P+  HFKL+  +    D E    + +PY  A
Sbjct: 1091 NGVLRMSQASYIHNVVQRFNMAEAKVTRSPIGAHFKLAAVRD---DDECIDNNAVPYASA 1147

Query: 1095 VQLVV*CMLWFALDQIWHKQLVKC------QVHVQAREAAL----GSSQVDPKILE---- 1140
            V  ++  M+    D  +   LV         +H +A +  L    GS  ++    +    
Sbjct: 1148 VGSIMYAMIGIRPDLAYVICLVSRYMARPGSIHWEAVKWILRYMRGSQDLNLVFTKEKEF 1207

Query: 1141 ---GYNGSRYHV*QGARCCSISCGICGLCR*SR**KVYNRICLYSCGG-PICWKSSVQST 1196
               GY  S Y      R       + G               +++ GG  + WK+++QS 
Sbjct: 1208 RVTGYCDSDYAADLDRRR-----SVSGY--------------VFTVGGNTVSWKANLQSV 1248

Query: 1197 VAMSTTEAEYMAVAEAAKEALWLTGLVKELGVEQGGVQLHCDS 1239
             A+STTEAE+MA+ EAAKEALW+ GL+K+LG+EQ  V L CDS
Sbjct: 1249 TALSTTEAEFMALTEAAKEALWIKGLMKDLGLEQDKVTLWCDS 1291


>emb|CAA37924.1| unnamed protein product [Arabidopsis thaliana] gi|99719|pir||S23319
            hypothetical protein 2 - Arabidopsis thaliana
            retrotransposon Ta1-2 (strain Landsberg) (fragment)
          Length = 1084

 Score =  690 bits (1780), Expect = 0.0
 Identities = 399/1049 (38%), Positives = 598/1049 (56%), Gaps = 53/1049 (5%)

Query: 81   DVWDKLESLYMSKTPMNKLFAKQRLYSLKMQEGGDLQAHVYAFNNILADMTRLGVTVDDE 140
            ++W+ L   YM  +  N+++ + + YS KM +   +  +V  F  I+A+++ L + V +E
Sbjct: 14   EMWETLNKQYMETSLPNRIYVQLKFYSFKMNDSKSINENVNEFLKIVAELSSLEINVVEE 73

Query: 141  DKAIILLCSLPGSYDHLVTTLTYGKDSITLDSISSTLLPHAQRRQSVEEGGGSSGEGLFV 200
             +AI+ L  L   Y  L  TL YG  +++L  + S+     +     +E   ++   L+ 
Sbjct: 74   VRAILFLNGLSSRYSQLKHTLKYGNKALSLQDVISSARSLERELDEQKETDKNTSTVLYT 133

Query: 201  KGGQDRGRGKGKAVDSGKKKR----SKSKDRKTTECYSCKQIGHWKRDCPNRSGK-SGNN 255
                +RGR + +  +  K+ +    SKS       C+ CK+ GH K+DC  R  K    N
Sbjct: 134  N---ERGRPQTRNQNQNKEGQGRGISKSNSNAKLTCWYCKKEGHVKKDCFARKRKLESEN 190

Query: 256  SSSANVVQSDGSCSEEDLLCVSSVKCTDAWVLDSGCSYHMTQHREWFNSFKSGDLGYVYL 315
               A V+      SE   L V+ +   D WVLDSGC+ HM+  ++WF +F+      + L
Sbjct: 191  PGEAGVITEKLVFSEA--LSVNDLAVRDIWVLDSGCTSHMSARKDWFCNFRKDGGTTILL 248

Query: 316  GDDKPCIIKGMRQVKIALDDGGVRTLSQVRYVLEVMKNLISLGTLHENGYSFKSEENRDI 375
            GDD     +G   +KI    G +  L  V+YV E+ +NLIS GTL + GY  +  + +  
Sbjct: 249  GDDHSVKSQGQGSIKIDTHGGTITVLENVKYVPELRRNLISTGTLDKRGYKHEGGDGK-- 306

Query: 376  LRVSKGAMTVMRAKRTAGNIYKLLGGTIMGDVASVETDDDATKLWHMRLGHLSERGMMEL 435
            +R  K   T +R +   G +Y L G TI+ +    E     T+LWH RLGH+    M  L
Sbjct: 307  VRYFKNQKTALRGEIVNG-LYILDGNTILSETCVAEGSKGKTELWHSRLGHMGLNNMKVL 365

Query: 436  HKRNMLKGVRSCIIGLCKYCVLGKQCRVRFKTGHHKTKGILDYVHSDVRGPTK-EPSV*G 494
              + ++       +  C+ CV+GK  +V F  G H ++ +L YVH+D+ G T   PS+ G
Sbjct: 366  AGKGLVSKEEIRELDFCENCVMGKAKKVSFNMGKHNSEYVLSYVHADLWGSTNVTPSLSG 425

Query: 495  FRYFVTFTDDFSRKVWVYFMKYKSEVFAKFKLWKAEVENQTGRKIKYLRSDNGTEYTDKN 554
             +YF++  DD +RKVW+YF++ K E F +F   K  VENQ  +K+K LR+DNG E+ +  
Sbjct: 426  NKYFLSIIDDKTRKVWLYFLRSKDETFDRFCERKELVENQQNKKVKCLRTDNGLEFCNLK 485

Query: 555  FMHFCEENGIQRHFSVRKTPQQNGVAERMNRTLTEKARCLRLNACLSKCLWAATINMACY 614
            F  +C+++GI+RH +   TPQQNGVA+RMNRT+ EK RC+   + L +  WA     A Y
Sbjct: 486  FDAYCKDHGIERHMTCTYTPQQNGVADRMNRTIMEKVRCMLNESGLGEEFWAEAAATAAY 545

Query: 615  LVNRSPRASLDGKVAEEVWTGNPIDLSNLRIFWRPAYVHISSEDRSKLDPKSK*CIIIGY 674
            L++RSP +++D  V EE+W        +LR F   AYVHI   D+ KL P++   I IGY
Sbjct: 546  LISRSPASAIDHNVPEELWLNKKPGYKHLRRFGSIAYVHI---DQGKLKPRALKGIFIGY 602

Query: 675  NKGVKGYKLWDPVKKKVIVSRDVVFDE*SMLKQS-DVTVVPDTEVENSSQDK-------- 725
              G KGYK+W   ++K ++SR+V+F E  + K + +   V ++E E +S  K        
Sbjct: 603  PSGTKGYKIWLLEEQKCVISRNVLFHEELVYKDTIEKERVVESEAEPASHSKSTLIKMKT 662

Query: 726  ---------IQVDIEETPVSPRQIVAQQQSEPGSDSGEVQ------DYTLVRDREPSRIT 770
                     IQV  EE   S   +  +Q+ EP  +  E Q      +Y LVRDRE  +I 
Sbjct: 663  PGNLNSGGVIQVSDEEE--SDESVDEEQEPEPQVELPETQTTSSLANYQLVRDRERRQIH 720

Query: 771  PPVRYGFEDLAAYALLT----SSGDPSTYHEAMAS*EKDKWMSAMVEEMESLKKNET*NL 826
            PP R+  E   A+AL+T    S  +P +Y EA +  E  KW  A  EE  SL KN T  L
Sbjct: 721  PPARFTEESGVAFALVTVETLSMEEPQSYQEATSDKEWKKWKLATHEEKNSLIKNGTWVL 780

Query: 827  VQLPHGKRVIGCKWVYKKKLAVT*KEREKFKAHLVTKGYSQHKGIDYDEIFSPVVRHTSI 886
            V  P  +++IGC+W++K K  +   E  ++KA LV KGY+Q +G+DY EIF+PV +HTSI
Sbjct: 781  VDKPKDRKIIGCRWLFKMKSGIPGVEPVRYKARLVAKGYTQREGVDYQEIFAPVAKHTSI 840

Query: 887  RVVLALVASMDMHLEQMDVKTTFLHGNLEEQIYIEQPEGF-SETGDGRLVCKLKRSLYGL 945
            R+++++V   D+ LEQMDVKTTFLHG LEE++Y+EQP+GF SE G+ + VC LK+SLYGL
Sbjct: 841  RILMSVVVDQDLELEQMDVKTTFLHGELEEELYMEQPDGFISEDGENK-VCLLKKSLYGL 899

Query: 946  KQSPRQWYKRFDSYMLRIGYRRCDYDCCVYVMSLDDGSFIFLLLYVDDMLIAANHLHDVN 1005
            KQSPRQW KRF  +M+   + R ++D CVYV    +   ++LLLYVDDMLIA     ++N
Sbjct: 900  KQSPRQWNKRFSRFMIDQNFIRSEHDACVYVKQAGEQDHLYLLLYVDDMLIAGKSKSEIN 959

Query: 1006 ELKTKLGKEFDMKDLGAAKKILGMEIHKDRGAKKLWLSQKSYVEGVLSRFDMSKANHVST 1065
            +++ +L  EF+MKD+G A +ILG++I +DR    L +SQ  Y+  V+ RF+M++A    +
Sbjct: 960  KVE-QLSMEFEMKDMGPASRILGIDIIRDRKNGVLRMSQARYIHNVVQRFNMAEAKVTRS 1018

Query: 1066 PLTNHFKLSLEQSPKIDSEIEGMSKIPYA 1094
            P+  HFKL+   + + D E    + +PYA
Sbjct: 1019 PIGAHFKLA---AVRDDDECIDNNDVPYA 1044


  Database: nr
    Posted date:  Jul 5, 2005 12:34 AM
  Number of letters in database: 863,360,394
  Number of sequences in database:  2,540,612
  
Lambda     K      H
   0.323    0.138    0.419 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,170,957,551
Number of Sequences: 2540612
Number of extensions: 90774210
Number of successful extensions: 252065
Number of sequences better than 10.0: 4486
Number of HSP's better than 10.0 without gapping: 3740
Number of HSP's successfully gapped in prelim test: 750
Number of HSP's that attempted gapping in prelim test: 238968
Number of HSP's gapped (non-prelim): 8642
length of query: 1307
length of database: 863,360,394
effective HSP length: 140
effective length of query: 1167
effective length of database: 507,674,714
effective search space: 592456391238
effective search space used: 592456391238
T: 11
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 81 (35.8 bits)


Lotus: description of TM0003.6