Medicago
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= AC146720.2 - phase: 0 
         (1333 letters)

Database: nr 
           2,540,612 sequences; 863,360,394 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAV24907.1| hypothetical protein [Oryza sativa (japonica cult...  1134  0.0
gb|AAP54850.1| putative gag-pol polyprotein [Oryza sativa (japon...  1080  0.0
gb|AAP51971.1| putative copia-type polyprotein [Oryza sativa (ja...  1075  0.0
ref|NP_916434.1| putative gag/pol polyprotein [Oryza sativa (jap...  1045  0.0
ref|XP_462785.1| putative gag/pol polyprotein [Oryza sativa (jap...   961  0.0
dbj|BAB10876.1| polyprotein [Arabidopsis thaliana]                    816  0.0
gb|AAA57005.1| copia-like retrotransposon Hopscotch polyprotein ...   813  0.0
gb|AAF99727.1| F17L21.7 [Arabidopsis thaliana]                        800  0.0
gb|AAG46116.1| putative copia-like retrotransposon polyprotein [...   798  0.0
gb|AAC02664.1| polyprotein [Arabidopsis thaliana]                     785  0.0
gb|AAC02666.1| polyprotein [Arabidopsis thaliana]                     779  0.0
emb|CAC37623.1| copia-like polyprotein [Arabidopsis thaliana]         779  0.0
gb|AAC02669.1| polyprotein [Arabidopsis thaliana]                     778  0.0
gb|AAK43485.1| polyprotein, putative [Arabidopsis thaliana]           771  0.0
gb|AAD21687.1| Strong similarity to gi|3600044 T12H20.12 proteas...   766  0.0
gb|AAK51235.1| polyprotein [Arabidopsis thaliana]                     760  0.0
gb|AAF02855.1| Similar to retrotransposon proteins [Arabidopsis ...   741  0.0
gb|AAC02672.1| polyprotein [Arabidopsis arenosa] gi|7522104|pir|...   736  0.0
gb|AAP53905.1| putative pol polyprotein [Oryza sativa (japonica ...   686  0.0
emb|CAC95126.1| gag-pol polyprotein [Populus deltoides]               624  e-177

>gb|AAV24907.1| hypothetical protein [Oryza sativa (japonica cultivar-group)]
            gi|51854440|gb|AAU10819.1| putative polyprotein [Oryza
            sativa (japonica cultivar-group)]
          Length = 1679

 Score = 1134 bits (2932), Expect = 0.0
 Identities = 645/1441 (44%), Positives = 832/1441 (56%), Gaps = 160/1441 (11%)

Query: 30   RGIAFLIFFDNKNSRALYLEQEFSKISMEQFSTASSYCQHIKSLSDQLANVGAPVSNERM 89
            RG+ +  F  N   RAL L  EF        S    YC+ +K+ +D L +VG PV +  +
Sbjct: 250  RGLEYQ-FLGNSEQRALNLTTEFHTFQQGDLSV-DEYCRKMKTFADSLGDVGEPVRDRTL 307

Query: 90   VLQLVSGLSEAYDTVGSQIRHSDTLPPFYKARSMVVLEETARAKRTAVTTANSTFLAAQE 149
            VL  ++GLSE ++ + S +      P F + RS++ LEE ++    A + +++ FLA   
Sbjct: 308  VLNTLNGLSEKFNNLRSLVPMQRPFPTFAELRSLLRLEELSKPNHAA-SASSAVFLATGS 366

Query: 150  DNNSGNSPSHRPNRNNNSNNGSRGGRGGSGNNH-GGRGRGRGGGRGGRSHNYQQGSWQQP 208
              N G   +        + N   GG+G S     GG G G GGG GG + N   GS    
Sbjct: 367  TTNGGKGANPAHGTGYGAGNQGGGGKGNSNRRRRGGNGGGGGGGGGGGNSNQGGGSTVPA 426

Query: 209  QAPQHQWAYPPYQYPSWNAPW----QPWATP-----------PCPY-----------PTT 242
            QA   QW     Q+PS   PW      W  P           P P+           P  
Sbjct: 427  QAQGAQWRAGS-QWPSPQNPWAGTIHMWPGPLGRGVLGPRPAPSPFAGAALAGPSGGPLP 485

Query: 243  GNWQRPVVPNRQAGILGAKPQQAHVAATPSSYAPTDIQ------------AAMHSLSLA- 289
            G      VP + A  LG + Q   V     S  PT +             +     SLA 
Sbjct: 486  GQVYPAPVPYQTAAGLGQQAQLGGVGQFGPSQPPTSVALQQPLGWFGGAPSQWDQASLAG 545

Query: 290  ---------PPDEQWYMDTGATSHMTANGGNLT----SYSNISNNITVGSGHNIPVIGCG 336
                     P    WYMDTGAT+HMT++ G L+       N  ++I VG+G  IPV   G
Sbjct: 546  SFNTTTLHQPATNDWYMDTGATAHMTSDTGILSLSHPPNPNSPSHIVVGNGSTIPVTSIG 605

Query: 337  NALVQNPQYPLTLNNVLHAPKLIKNLVSVRKFTIDNEVSVEFDPFGFSVKDFQTGMPLMR 396
            ++ + +P    TL ++L +P +IKNL+SVR+F IDN  SVEFDPFGFSVKD +T   + R
Sbjct: 606  HSKICHPNCSFTLRDILCSPAIIKNLISVRRFVIDNWCSVEFDPFGFSVKDLRTRTVIAR 665

Query: 397  CNSSGDLYPLA-TRPYYPSTTPSTFAALSNEIWHNRLGHPGVSILNSLHRNNSILCNKFR 455
             NSSG LY L    P  P+ T     + ++ +WH RLGH G   LN L    + +    R
Sbjct: 666  FNSSGPLYSLHHALPPPPAATALLANSSTDLLWHRRLGHLGHDALNRL----AAVVPMTR 721

Query: 456  NNF--FCQSCQLGKQIKLPFYESLSSTLLPFDIVHSDIWTSPILSSGGHHYYILFLDDFT 513
             +    C +CQLG+ ++LPF  S S     F++ H D+WTSP++S+ G  YY++ LDD +
Sbjct: 722  GDLTGVCHACQLGRHVRLPFASSTSRASTNFELFHCDLWTSPVVSASGFKYYLVILDDCS 781

Query: 514  DFLWTFPLTNKSQAHSIFLQFCNHIKTQFERDIKCFQCDNGKEYDNSHFHQFCKQNGMIF 573
             ++WTFPL  KS   +    F  ++KTQF  +I+  QCDNG+E+DNS    F   NG+  
Sbjct: 782  HYVWTFPLRFKSDTFTTLSHFFAYVKTQFATNIRSIQCDNGREFDNSAARTFFLTNGVHL 841

Query: 574  RFSCPHTSPQNGKVERKIRTINNIIRTLLAHASLPPSFWHHALQMATYLLNILPTKKLAL 633
            R SCPHTSPQNGK ER +R++NNI+R++L  A LP SFW  AL  AT+L+N  PTK L  
Sbjct: 842  RMSCPHTSPQNGKAERILRSLNNIVRSMLFQAKLPGSFWVEALHTATHLINRHPTKTLDR 901

Query: 634  QTPTTILYQKSPSYSHLKVFGCLCFPLIPSTTRNKLQARSTPCVFLGYPSNHRGYKCFEL 693
             TP   LY   PSYSHL+VFGC C+P + +TT +KL  RST CVFLGYP  H+GY+CF+ 
Sbjct: 902  HTPHFALYGTHPSYSHLRVFGCKCYPNLSATTPHKLAPRSTMCVFLGYPLYHKGYRCFDP 961

Query: 694  SSRKIIISRHVIFDENTFPFSN--SNIPESSCYNFLD----------------------- 728
             S ++IISRHV+FDE++FPF+   + +  ++  +FL+                       
Sbjct: 962  LSNRVIISRHVVFDEHSFPFTELTNGVSNATDLDFLEDFTAPAQAPIGATRRPAVAPTTQ 1021

Query: 729  TSDTPFPYHLFQHTSNLPT---NEPNSHDQPPTTTATPLTTPYSI---NTTTHQPTVAPS 782
            T+ +P  + L +     PT   + P     P +    P  TP  I   +T+   P+  P+
Sbjct: 1022 TASSPMVHGLERPPPCSPTRPVSTPGGPSSPDSRLGPPSPTPALIGPASTSPGPPSAGPA 1081

Query: 783  PQIQTSPQLTITPNTASPHQIPPPNPLLANPPLSPQ------------------------ 818
            P   T P  +     A P   PP  P L  P L P                         
Sbjct: 1082 PSASTCPASSTWETVARP-PTPPGLPRLDGPHLPPAPHVPRRLRSVRATGAPTPLSGLEI 1140

Query: 819  --------MTTRAQHGIFKPRQLLNLHTSSSNQISPLPTNPINALQDHNWKMAMKDEYDA 870
                    MTTRA+ G  KP   LNLH +    +S +P     AL D  W+ AM++EY+A
Sbjct: 1141 SPVVNDHVMTTRAKSGHHKPVHRLNLHAAP---LSLVPKTYRAALADPLWRAAMEEEYNA 1197

Query: 871  LIDNKTWDLVPRPSNANIIRSLWIFRHKKKADGSFERYKARLVGNGSNQQTGVDCGETFS 930
            L+ N+TWDLVPRP+  N++   WIF+HK  ADGS +RYKAR V  G  Q+ GVD  ETFS
Sbjct: 1198 LLANRTWDLVPRPAGVNVVTGKWIFKHKFHADGSLDRYKARWVLRGFTQRPGVDFDETFS 1257

Query: 931  PVVKPATIRTVLSIALSKSWCLHQLDVKNAFLHGNLNETVYMYQPPGFRDPQHPDYVCLL 990
            PVVKPAT+RTVLS+A+S+ W +HQLDVKNAFLHG L ETVY  QPPGF D   PD VC L
Sbjct: 1258 PVVKPATVRTVLSLAVSRDWPVHQLDVKNAFLHGTLQETVYCTQPPGFVDSAKPDMVCCL 1317

Query: 991  KKSLYGLKQAPRAWYQRFTDYVATLGFSHSVCDHSLFIYHSGDDTAYILLYVDDIILTAS 1050
             KSLYGLKQAPRAWY RFT ++ ++GF  +  D SLFI H G+DT Y+LLYVDDI+LTAS
Sbjct: 1318 NKSLYGLKQAPRAWYSRFTTFLQSIGFVEAKSDTSLFILHRGNDTVYLLLYVDDIVLTAS 1377

Query: 1051 SDTLRQSIMSKLNSEFAMKDLGPLSYFLGISVTRHSD----------------------- 1087
            S TL    +S L  EF+MKDLG L +FLG+SVTR+S                        
Sbjct: 1378 SRTLLHWTISALQGEFSMKDLGALHHFLGVSVTRNSAGLVLSQRQYCIDILERAGMADCK 1437

Query: 1088 ------DTKAKLSGTSGNPYHDPSEYRSLAGALQYLTFTRPDISYAVQQVCLFMHDPKTQ 1141
                  DT AKLS + G P  DP+++RSLAGALQYLTFTRPDISYAVQQVCL MHDP+  
Sbjct: 1438 PCNTPVDTTAKLSSSDGPPVADPTDFRSLAGALQYLTFTRPDISYAVQQVCLHMHDPREP 1497

Query: 1142 HMTALKRIIRYIKGTSTHGLHLYPSTVDKLTTYTDADWGGCPDTRKSTSGYCVYLGDNLV 1201
            H+ ALKRI+ YI+G+   GLH+  S+   L  Y+DADW GCPDTR+STSGY V+LGDNLV
Sbjct: 1498 HLAALKRILHYIRGSVDLGLHIQRSSACDLAVYSDADWAGCPDTRRSTSGYAVFLGDNLV 1557

Query: 1202 SWSAKRQPTLSRSSAEAEYRGVANVVSESCWLRNLLLELQCPVTKATLVYCDNVSAVYLS 1261
            SWS+KRQ T+SRSSAEAEYR VAN V+E  WLR LL EL  P ++ATLVYCDNVSAVYLS
Sbjct: 1558 SWSSKRQHTVSRSSAEAEYRAVANAVAEVTWLRQLLQELHSPPSRATLVYCDNVSAVYLS 1617

Query: 1262 GNPIQHQRTKHIEMDIHFVREKVARGQVRVMHVPSRYQIADIFTKGLPLQLFDDFRDSLH 1321
             NP+QHQRTKH+E+D+HFVRE+VA G VRV+HVP+  Q ADIFTKGLP  +F +FR SL+
Sbjct: 1618 SNPVQHQRTKHVEIDLHFVRERVAVGAVRVLHVPTTSQYADIFTKGLPTPVFTEFRSSLN 1677

Query: 1322 I 1322
            +
Sbjct: 1678 V 1678


>gb|AAP54850.1| putative gag-pol polyprotein [Oryza sativa (japonica cultivar-group)]
            gi|37536522|ref|NP_922563.1| putative gag-pol polyprotein
            [Oryza sativa (japonica cultivar-group)]
            gi|13310887|gb|AAG13591.2| putative gag/pol polyprotein
            [Oryza sativa]
          Length = 1417

 Score = 1080 bits (2792), Expect = 0.0
 Identities = 598/1308 (45%), Positives = 798/1308 (60%), Gaps = 64/1308 (4%)

Query: 36   IFFDNKNSRALYLEQEFSKISMEQFSTASSYCQHIKSLSDQLANVGAPVSNERMVLQLVS 95
            +F DN   R++YLE EF  I+     T + Y   +K L+D L ++  PVS    VL L+ 
Sbjct: 120  VFRDNAVQRSVYLETEFRSINQGDM-TITQYTAKLKQLADGLRDINMPVSEPSQVLNLLR 178

Query: 96   GLSEAYDTVGSQIRHSDTLPPFYKARSMVVLEETARAKRTAVTTANSTFLAAQEDNNSGN 155
            GL+  + ++ + I   +    F  ARS ++L E  + +  A   A     A    ++  +
Sbjct: 179  GLNTKFRSLRASIADRNPPHTFMTARSYLLLAEL-QMQHDAKAEAGEALYAGTGSSSGTS 237

Query: 156  SPSHRPNRNNNSNNGSRGGRGGSGNNHGGRGRGRGGGRGGRSHNYQQGSWQQPQAPQHQW 215
              + +P        G R GRGG G   GG     GGG G   H+ Q     +P AP   W
Sbjct: 238  DTTGQPRPKGR---GKRRGRGG-GAPPGGAPSTPGGGAGA-GHDGQP----RPPAP---W 285

Query: 216  AYPPYQ--YPSWNAPWQ-PWATPPCPYPTTGNWQRPVVPNRQAGILGAKPQQAHVAATPS 272
             Y P+     +W  P++ P A    P P     Q     +    +  A P      A  +
Sbjct: 286  GYNPWTGFVQAWPFPFRAPGAGVLGPRPPFQAQQAMTAQHLLPALPPASPGVHSTGAWDN 345

Query: 273  SYAPTDIQAAMHSLSLAPPDEQWYMDTGATSHMTANGGNLTSYSNI--SNNITVGSGHNI 330
            S   + +Q+A  + +  P    W++DTGA++HM++  G L     +  S+ ITVG+G  +
Sbjct: 346  SALYSALQSAGVATTTPPSAADWFLDTGASAHMSSTPGILAHPRPLPFSSCITVGNGAKL 405

Query: 331  PVIGCGNALVQNPQYPLTLNNVLHAPKLIKNLVSVRKFTIDNEVSVEFDPFGFSVKDFQT 390
            PV    +  +      L L+NVL +P LIKNL+SV+K T DN VS+EFDP GFS+KD QT
Sbjct: 406  PVTHTASTHIPTSSTDLHLHNVLVSPPLIKNLISVKKLTRDNNVSIEFDPTGFSIKDLQT 465

Query: 391  GMPLMRCNSSGDLYPLAT-RPYYPSTTPSTFAALSNEIWHNRLGHPGVSILNSLHRNNSI 449
             +  +RC+S GDLYPL    P+  S T S     S E WH RLGHPG + L+ +  +   
Sbjct: 466  QVVKLRCDSPGDLYPLRLPSPHALSATSSP----SVEHWHLRLGHPGSASLSKVLGSFDF 521

Query: 450  LCNKFRNNFFCQSCQLGKQIKLPFYESLSSTLLPFDIVHSDIWTSPILSSGGHHYYILFL 509
             CNK   +  C +C +G  ++LPF+ S S TL PF +VH+D+WTSPI S+ G+ YY++FL
Sbjct: 522  QCNKSAPHH-CSACHVGTNVRLPFHSSSSQTLFPFQLVHTDVWTSPIYSNSGYKYYVVFL 580

Query: 510  DDFTDFLWTFPLTNKSQAHSIFLQFCNHIKTQFERDIKCFQCDNGKEYDNSHFHQFCKQN 569
            DDFT ++WTFP+ NKS+       F  +  TQF   +   Q DNGKEYD+         +
Sbjct: 581  DDFTHYIWTFPVRNKSEVFHTVRSFFAYAHTQFGLPVLALQTDNGKEYDSYALRSLLSLH 640

Query: 570  GMIFRFSCPHTSPQNGKVERKIRTINNIIRTLLAHASLPPSFWHHALQMATYLLNILPTK 629
            G + R SCP++S QNGK ER +RTIN+ +RT+L H++ P SFW  ALQ AT+L+N  P +
Sbjct: 641  GAVLRLSCPYSSQQNGKAERILRTINDYVRTMLVHSAAPLSFWAEALQTATHLINRRPCR 700

Query: 630  KLALQTPTTILYQKSPSYSHLKVFGCLCFPLIPSTTRNKLQARSTPCVFLGYPSNHRGYK 689
                 TP  +L    P+Y HL+VFGCLC+P   +T  +KL  RS  CVF+GYP++HRGY+
Sbjct: 701  ATGSLTPYQLLLGAPPTYDHLRVFGCLCYPNTIATAPHKLSPRSLACVFIGYPADHRGYR 760

Query: 690  CFELSSRKIIISRHVIFDENTFPFSNSNIPESSCYNFLDTSDTPFPYHLFQHTSNLPTNE 749
            C+++ SR++  SRHV F E+ FPF ++  P  S          P P H       LP   
Sbjct: 761  CYDMVSRRVFTSRHVTFVEDVFPFRDAPSPRPSA--------PPPPDHGDDTIVLLPA-- 810

Query: 750  PNSHDQPPTTTATPLTTPYSINTTTHQPTVAPSPQIQTSPQLTITPNTASP-HQI-PPPN 807
            P  H   P  TA             H     PSP        + TP++A+P H + PPP+
Sbjct: 811  PAQHVVTPVGTAP-----------AHDAASPPSPA-------SSTPSSAAPAHDVAPPPS 852

Query: 808  PLLANP----PLSPQMTTRAQHGIFKPRQLLNLHTSSSNQISPLPTNPINALQDHNWKMA 863
            P  ++P    P    MTTRA+ GI KP     +  +S+  +SP P++   AL+D NW+ A
Sbjct: 853  PETSSPASASPPRHAMTTRARAGISKPNPRYAMTATST--LSPTPSSVRAALRDPNWRAA 910

Query: 864  MKDEYDALIDNKTWDLVPRPSNANIIRSLWIFRHKKKADGSFERYKARLVGNGSNQQTGV 923
            M+ E+DAL+ N+TW LVPRP  A II   W+F+ K  ADGS ++YKAR V  G NQ+ GV
Sbjct: 911  MQAEFDALLANRTWTLVPRPPGARIITGKWVFKTKLHADGSLDKYKARWVVRGFNQRPGV 970

Query: 924  DCGETFSPVVKPATIRTVLSIALSKSWCLHQLDVKNAFLHGNLNETVYMYQPPGFRDPQH 983
            D GETFSPVVKPATIRTVL++  SK W  HQLDV NAFLHG+L E V   QP GF D   
Sbjct: 971  DFGETFSPVVKPATIRTVLTLISSKQWPAHQLDVSNAFLHGHLQERVLCQQPTGFEDAAR 1030

Query: 984  PDYVCLLKKSLYGLKQAPRAWYQRFTDYVATLGFSHSVCDHSLFIYHSGDDTAYILLYVD 1043
            P  VCLL +SLYGL+QAPRAW++RF D+  +LGF  S  D SLF+   G DTAY+LLYVD
Sbjct: 1031 PADVCLLSRSLYGLRQAPRAWFKRFADHATSLGFVQSRADPSLFVLRRGSDTAYLLLYVD 1090

Query: 1044 DIILTASSDTLRQSIMSKLNSEFAMKDLGPLSYFLGISVTRHSDDTKAKLSGTSGNPYHD 1103
            D+IL+ASS +L Q I+ +L +EF +KD+GPL YFLGI V R +D      S  + +   D
Sbjct: 1091 DMILSASSSSLLQRIIDRLQAEFKVKDMGPLKYFLGIEVQRTADGFVLSQSKYATD---D 1147

Query: 1104 PSEYRSLAGALQYLTFTRPDISYAVQQVCLFMHDPKTQHMTALKRIIRYIKGTSTHGLHL 1163
             S YRS+AGALQYLT TRPDI+YAVQQVCL MH P+  H+T LKRI+RYIKGT+  GLHL
Sbjct: 1148 SSWYRSIAGALQYLTLTRPDIAYAVQQVCLHMHAPREAHVTLLKRILRYIKGTAAFGLHL 1207

Query: 1164 YPSTVDKLTTYTDADWGGCPDTRKSTSGYCVYLGDNLVSWSAKRQPTLSRSSAEAEYRGV 1223
              ST   LT ++DADW GCPDTR+STSG+C++LGD+L+SWS+KRQ T+SRSSAEAEYRGV
Sbjct: 1208 RASTSPTLTAFSDADWAGCPDTRRSTSGFCIFLGDSLISWSSKRQTTVSRSSAEAEYRGV 1267

Query: 1224 ANVVSESCWLRNLLLELQCPVTKATLVYCDNVSAVYLSGNPIQHQRTKHIEMDIHFVREK 1283
            AN V+E  WLR LL EL C V +AT+ YCDN+S+VY+S NP+ H+RTKHIE+DIHFVREK
Sbjct: 1268 ANAVAECTWLRQLLGELHCRVPQATIAYCDNISSVYMSKNPVHHKRTKHIELDIHFVREK 1327

Query: 1284 VARGQVRVMHVPSRYQIADIFTKGLPLQLFDDFRDSLHIRQPPVSTTG 1331
            VA G++RV+ +PS +Q AD+FTKGLP  +F++FR SL +     +T G
Sbjct: 1328 VALGELRVLPIPSAHQFADVFTKGLPSSMFNEFRASLCVGDTTAATAG 1375


>gb|AAP51971.1| putative copia-type polyprotein [Oryza sativa (japonica
            cultivar-group)] gi|37530764|ref|NP_919684.1| putative
            copia-type polyprotein [Oryza sativa (japonica
            cultivar-group)] gi|20042923|gb|AAM08751.1| Putative
            copia-type polyprotein [Oryza sativa (japonica
            cultivar-group)]
          Length = 1803

 Score = 1075 bits (2779), Expect = 0.0
 Identities = 599/1326 (45%), Positives = 796/1326 (59%), Gaps = 90/1326 (6%)

Query: 36   IFFDNKNSRALYLEQEFSKISMEQFSTASSYCQHIKSLSDQLANVGAPVSNERMVLQLVS 95
            +F DN   R++YLE EF  I+     T + Y   +K L+D L ++  PVS    VL L+ 
Sbjct: 120  VFRDNAVQRSVYLETEFRSINQGDM-TITQYTAKLKQLADGLRDINMPVSEPSQVLNLLR 178

Query: 96   GLSEAYDTVGSQIRHSDTLPPFYKARSMVVLEETARAKRTAVTTANSTFLAAQEDNNSGN 155
            GL+  + ++ + I   +    F  ARS ++L E  + +  A   A     A    ++  +
Sbjct: 179  GLNTKFRSLRASIADRNPPHTFMTARSYLLLAEL-QMQHDAKAEAGEALYAGTGSSSGTS 237

Query: 156  SPSHRPNRNNNSNNGSRGGRGGSGNNHGGRGRGRGGGRGGRSHNYQQGSWQQPQAPQHQW 215
              + +P        G R GRGG G   GG     GGG G   H+ Q     +P AP   W
Sbjct: 238  DTTGQPRPKGR---GKRRGRGG-GAPPGGAPSTPGGGAGA-GHDGQP----RPPAP---W 285

Query: 216  AYPPYQ--YPSWNAPWQ-PWATPPCPYPTTGNWQRPVVPNRQAGILGAKPQQAHVAATPS 272
             Y P+     +W  P++ P A    P P     Q     +    +  A P      A  +
Sbjct: 286  GYNPWTGFVQAWPFPFRAPGAGVLGPRPPFQAQQAMTAQHLLPALPPASPGVHSTGAWDN 345

Query: 273  SYAPTDIQAAMHSLSLAPPDEQWYMDTGATSHMTANGGNLTSYSNI--SNNITVGSGHNI 330
            S   + +Q+A  + +  P    W++DTGA++HM++  G L     +  S+ ITVG+G  +
Sbjct: 346  SALYSALQSAGVATTTPPSAADWFLDTGASAHMSSTPGILAHPRPLPFSSCITVGNGAKL 405

Query: 331  PVIGCGNALVQNPQYPLTLNNVLHAPKLIKNLVSVRKFTIDNEVSVEFDPFGFSVKDFQT 390
            PV    +  +      L L+NVL +P LIKNL+SV+K T DN VS+EFDP GFS+KD QT
Sbjct: 406  PVTHTASTHIPTSSTDLHLHNVLVSPPLIKNLISVKKLTRDNNVSIEFDPTGFSIKDLQT 465

Query: 391  GMPLMRCNSSGDLYPLAT-RPYYPSTTPSTFAALSNEIWHNRLGHPGVSILNSLHRNNSI 449
             +  +RC+S GDLYPL    P+  S T S     S E WH RLGHPG + L+ +  +   
Sbjct: 466  QVVKLRCDSPGDLYPLRLPSPHALSATSSP----SVEHWHLRLGHPGSASLSKVLGSFDF 521

Query: 450  LCNKFRNNFFCQSCQLGKQIKLPFYESLSSTLLPFDIVHSDIWTSPILSSGGHHYYILFL 509
             CNK   +  C +C +G  ++LPF+ S S TL PF +VH+D+WTSPI S+ G+ YY++FL
Sbjct: 522  QCNKSAPHH-CSACHVGTNVRLPFHSSSSQTLFPFQLVHTDVWTSPIYSNSGYKYYVVFL 580

Query: 510  DDFTDFLWTFPLTNKSQAHSIFLQFCNHIKTQFERDIKCFQCDNGKEYDNSHFHQFCKQN 569
            DDFT ++WTFP+ NKS+       F  +  TQF   +   Q DNGKEYD+         +
Sbjct: 581  DDFTHYIWTFPVRNKSEVFHTVRSFFAYAHTQFGLPVLALQTDNGKEYDSYALRSLLSLH 640

Query: 570  GMIFRFSCPHTSPQNGKVERKIRTINNIIRTLLAHASLPPSFWHHALQMATYLLNILPTK 629
            G + R SCP++S QNGK ER +RTIN+ +RT+L H++ P SFW  ALQ A +L+N  P +
Sbjct: 641  GAVLRLSCPYSSQQNGKAERILRTINDCVRTMLVHSAAPLSFWAEALQTAMHLINRRPCR 700

Query: 630  KLALQTPTTILYQKSPSYSHLKVFGCLCFPLIPSTTRNKLQARSTPCVFLGYPSNHRGYK 689
                  P  +L    P+Y HL+VFGCLC+P   +T  +KL  RS  CVF+GYP++HRGY+
Sbjct: 701  ATGSLKPYQLLLGAPPTYDHLRVFGCLCYPNTIATAPHKLSPRSLACVFIGYPADHRGYR 760

Query: 690  CFELSSRKIIISRHVIFDENTFPFSNSNIPESSCYNFLDTSDTPFPYHLFQHTSNLPTNE 749
            C+++ SR++  SRHV F E+ FPF ++  P  S          P P H       LP   
Sbjct: 761  CYDMVSRRVFTSRHVTFVEDVFPFRDAPSPRPSA--------PPPPDHGDDTIVLLPA-- 810

Query: 750  PNSHDQPPTTTATPLTTPYSINTTTHQPTVAPSPQIQTSPQLTITPNTASP-HQI-PPPN 807
            P  H   P  TA             H     PSP        + TP++A+P H + PPP+
Sbjct: 811  PAQHVVTPVGTAP-----------AHDAASPPSPA-------SSTPSSAAPAHDVAPPPS 852

Query: 808  PLLANP----PLSPQMTTRAQHGIFKPRQLLNLHTSSSNQISPLPTNPINALQDHNWKMA 863
            P  ++P    P    MTTRA+ GI KP     +  +S+  +SP P++   AL+D NW+ A
Sbjct: 853  PETSSPASASPPRHAMTTRARAGISKPNPRYAMTATST--LSPTPSSVRVALRDPNWRAA 910

Query: 864  MKDEYDALIDNKTWDLVPRPSNANIIRSLWIFRHKKKADGSFERYKARLVGNGSNQQTGV 923
            M+ E+DAL+ N+TW LVPRP  A II   W+F+ K  ADGS ++YKAR V  G NQ+ GV
Sbjct: 911  MQAEFDALLANRTWTLVPRPPGARIITGKWVFKTKLHADGSLDKYKARWVVRGFNQRPGV 970

Query: 924  DCGETFSPVVKPATIRTVLSIALSKSWCLHQLDVKNAFLHGNLNETVYMYQPPGFRDPQH 983
            D GETFSPVVKPATIRTVL++  SK W  HQLDV NAFLHG+L E V   QP GF D   
Sbjct: 971  DFGETFSPVVKPATIRTVLTLISSKQWPAHQLDVSNAFLHGHLQERVLCQQPTGFEDAAR 1030

Query: 984  PDYVCLLKKSLYGLKQAPRAWYQRFTDYVATLGFSHSVCDHSLFIYHSGDDTAYILLYVD 1043
            P  VCLL +SLYGL+QAPRAW++RF D+  +LGF  S  D SLF+   G DTAY+LLYVD
Sbjct: 1031 PADVCLLSRSLYGLRQAPRAWFKRFADHATSLGFVQSRADPSLFVLRRGSDTAYLLLYVD 1090

Query: 1044 DIILTASSDTLRQSIMSKLNSEFAMKDLGPLSYFLGISVTRHSD---------------- 1087
            D+IL+ASS +L Q I+ +L +EF +KD+GPL YFLGI V R +D                
Sbjct: 1091 DMILSASSSSLLQRIIDRLQAEFKVKDMGPLKYFLGIEVQRTADGFVLSQSKYATDVLER 1150

Query: 1088 -------------DTKAKLSGTSGNPYHDPSEYRSLAGALQYLTFTRPDISYAVQQVCLF 1134
                         D K KLS   G  + D S YRS+AGALQYLT TRPDI+YAVQQVCL 
Sbjct: 1151 AGMANCKAVATPADAKPKLSSDEGPLFQDSSWYRSIAGALQYLTLTRPDIAYAVQQVCLH 1210

Query: 1135 MHDPKTQHMTALKRIIRYIKGTSTHGLHLYPSTVDKLTTYTDADWGGCPDTRKSTSGYCV 1194
            MH P+  H+T LKRI+RYIKGT+  GLHL  ST   LT ++DADW GCPDTR+STSG+C+
Sbjct: 1211 MHAPREAHVTLLKRILRYIKGTAAFGLHLRASTSPTLTAFSDADWAGCPDTRRSTSGFCI 1270

Query: 1195 YLGDNLVSWSAKRQPTLSRSSAEAEYRGVANVVSESCWLRNLLLELQCPVTKATLVYCDN 1254
            +LGD+L+SWS+KRQ T+SRSSAEAEYRGVAN V+E  WLR LL EL C V +AT+ YCDN
Sbjct: 1271 FLGDSLISWSSKRQTTVSRSSAEAEYRGVANAVAECTWLRQLLGELHCRVPQATIAYCDN 1330

Query: 1255 VSAVYLSGNPIQHQRTKHIEMDIHFVREKVARGQVRVMHVPSRYQIADIFTKGLPLQLFD 1314
            +S+VY+S NP+ H+RTKHIE+DIHFVREKVA G++RV+ +PS +Q AD+FTKGLP  +F+
Sbjct: 1331 ISSVYMSKNPVHHKRTKHIELDIHFVREKVALGELRVLPIPSAHQFADVFTKGLPSSMFN 1390

Query: 1315 DFRDSL 1320
            +FR SL
Sbjct: 1391 EFRASL 1396


>ref|NP_916434.1| putative gag/pol polyprotein [Oryza sativa (japonica cultivar-group)]
          Length = 1090

 Score = 1045 bits (2703), Expect = 0.0
 Identities = 549/1066 (51%), Positives = 691/1066 (64%), Gaps = 79/1066 (7%)

Query: 323  TVGSGHNIPVIGCGNALVQNPQYPLTLNNVLHAPKLIKNLVSVRKFTIDNEVSVEFDPFG 382
            + G G  +P  G G   V NP               ++NL+SVR+FT DN+ S+EFD FG
Sbjct: 37   STGGGGALPGGGGGYQGVANP---------------VRNLLSVRQFTRDNKCSIEFDEFG 81

Query: 383  FSVKDFQTGMPLMRCNSSGDLYPLATRPYYPSTTPSTFA----ALSNEIWHNRLGHPGVS 438
            FSVKD QT   ++RCNS G+LY L      P+ TPS+ A    A S+ +WH RLGHPG +
Sbjct: 82   FSVKDLQTRRVILRCNSRGELYTL------PAATPSSAAHGLLATSSTLWHCRLGHPGPA 135

Query: 439  ILNSLHRNNSILCNKFRNNFFCQSCQLGKQIKLPFYESLSSTLLPFDIVHSDIWTSPILS 498
             ++ L    SI CNK   +  C +CQLGK  +LPF+ S S T +PF++VH D+WTSP++S
Sbjct: 136  AIHGLRNIASISCNKIDTSL-CHACQLGKHTRLPFHNSSSRTSVPFELVHCDVWTSPVMS 194

Query: 499  SGGHHYYILFLDDFTDFLWTFPLTNKSQAHSIFLQFCNHIKTQFERDIKCFQCDNGKEYD 558
            + G  YY++ LDDF+ F WTF L  KS  H   ++F  ++ TQF   +K FQ DNG+E+ 
Sbjct: 195  TSGFKYYLVVLDDFSHFCWTFLLRLKSDVHRHIVEFVEYVSTQFGLPLKSFQADNGREFV 254

Query: 559  NSHFHQFCKQNGMIFRFSCPHTSPQNGKVERKIRTINNIIRTLLAHASLPPSFWHHALQM 618
            N+    F    G   R SCP+TSPQNGK ER +RTINN IRTLL  AS+PPS+W  AL  
Sbjct: 255  NTAITTFLASRGTQLRLSCPYTSPQNGKAERMLRTINNSIRTLLIQASMPPSYWAEALAT 314

Query: 619  ATYLLNILPTKKLALQTPTTILYQKSPSYSHLKVFGCLCFPLIPSTTRNKLQARSTPCVF 678
            ATYLLN  P+  +    P  +L++  P +SHL+VFGCLC+P + +TT +KL  RST CVF
Sbjct: 315  ATYLLNRRPSSSIHQSLPFQLLHRTIPDFSHLRVFGCLCYPNLSATTPHKLSPRSTACVF 374

Query: 679  LGYPSNHRGYKCFELSSRKIIISRHVIFDENTFPFSNSNIPESSCYNFLDTSDTPFPYHL 738
            LGYP++H+GY+C +LS+ +IIISRHV+FDE+ FPF+ +  P +S ++FL    +P     
Sbjct: 375  LGYPTSHKGYRCLDLSTHRIIISRHVVFDESQFPFA-ATPPAASSFDFLLQGLSPADAPS 433

Query: 739  FQHTSNLP-TNEPNSHDQPPTTTATPLTTPYSINTTTHQPTVAPSPQIQTSPQLTITPNT 797
             +     P T  P++  + P              T   +   A +P + TS      P +
Sbjct: 434  LEVEQPRPLTVAPSTEVEQPYLPLPSRRLSAGTVTVASEAPSAGAPLVGTSSADATPPGS 493

Query: 798  AS----------------PHQIPPPNPLLA--NPPLSPQ---MTTRAQHGIFKPRQLLNL 836
            A+                P    PP+   A  N   +PQ   M TR+Q G  +P   L  
Sbjct: 494  ATRASTIVSPFRHVYTRRPVTTVPPSSSTAVTNAVAAPQPHSMVTRSQSGSLRPVDRLT- 552

Query: 837  HTSSSNQISPLPTNPINALQDHNWKMAMKDEYDALIDNKTWDLVPRPSNANIIRSLWIFR 896
            +T++    SP+P N  +AL D NW+ AM DEY  L+DN TW LV RP  ANI    WIF+
Sbjct: 553  YTATQAAASPVPANYHSALADPNWRAAMADEYKELVDNGTWRLVSRPPRANIATGKWIFK 612

Query: 897  HKKKADGSFERYKARLVGNGSNQQTGVDCGETFSPVVKPATIRTVLSIALSKSWCLHQLD 956
            HK  +DGS  RYKAR V  G +QQ G+D  ETFSPVVK ATIR VLSIA S++W +HQLD
Sbjct: 613  HKFHSDGSLARYKARWVVRGYSQQHGIDYDETFSPVVKLATIRVVLSIAASRAWPIHQLD 672

Query: 957  VKNAFLHGNLNETVYMYQPPGFRDPQHPDYVCLLKKSLYGLKQAPRAWYQRFTDYVATLG 1016
            VKNAFLHG+L ETVY  QP GF DP  PD VCLL+KSLYGLKQAPRAWYQRF  Y+  +G
Sbjct: 673  VKNAFLHGHLKETVYCQQPSGFVDPTAPDAVCLLQKSLYGLKQAPRAWYQRFATYIRQMG 732

Query: 1017 FSHSVCDHSLFIYHSGDDTAYILLYVDDIILTASSDTLRQSIMSKLNSEFAMKDLGPLSY 1076
            F  S  D SLF+Y  GD  AY+LLYVDDIILTAS+ TL Q + ++L+SEFAM DLG L +
Sbjct: 733  FMPSASDTSLFVYKDGDRIAYLLLYVDDIILTASTTTLLQQLTARLHSEFAMTDLGDLHF 792

Query: 1077 FLGISVTRHSD-----------------------------DTKAKLSGTSGNPYHDPSEY 1107
            FLGISV R  D                             DT AKLS T G P  DPS Y
Sbjct: 793  FLGISVKRSPDGLFLSQRQYAVDLLQRAGMAECHSTSTPVDTHAKLSATDGLPVADPSAY 852

Query: 1108 RSLAGALQYLTFTRPDISYAVQQVCLFMHDPKTQHMTALKRIIRYIKGTSTHGLHLYPST 1167
            RS+AGALQYLT TRPD++YAVQQVCLFMHDP+  H+  +KRI+RY+KG+ + GLH+    
Sbjct: 853  RSIAGALQYLTLTRPDLAYAVQQVCLFMHDPREPHLALVKRILRYVKGSLSIGLHIGSGP 912

Query: 1168 VDKLTTYTDADWGGCPDTRKSTSGYCVYLGDNLVSWSAKRQPTLSRSSAEAEYRGVANVV 1227
            +  LT Y+DADW GCP++R+STSGYCVYLGDNLVSWS+KRQ T+SRSSAEAEYR VA+ V
Sbjct: 913  IQSLTAYSDADWAGCPNSRRSTSGYCVYLGDNLVSWSSKRQTTVSRSSAEAEYRAVAHAV 972

Query: 1228 SESCWLRNLLLELQCPVTKATLVYCDNVSAVYLSGNPIQHQRTKHIEMDIHFVREKVARG 1287
            +E CWLR LL EL  P+  AT+VYCDNVSAVY++ NP+ H+RTKHIE+DIHFVREKVA G
Sbjct: 973  AECCWLRQLLQELHVPIASATIVYCDNVSAVYMTANPVHHRRTKHIEIDIHFVREKVALG 1032

Query: 1288 QVRVMHVPSRYQIADIFTKGLPLQLFDDFRDSLHIRQPPVSTTGVY 1333
            QVRV++VPS +Q ADI TKGLP+QLF DFR SL +R PP +T G Y
Sbjct: 1033 QVRVLYVPSSHQFADIMTKGLPVQLFTDFRSSLCVRAPPAATAGRY 1078


>ref|XP_462785.1| putative gag/pol polyprotein [Oryza sativa (japonica cultivar-group)]
          Length = 1373

 Score =  961 bits (2483), Expect = 0.0
 Identities = 515/1031 (49%), Positives = 642/1031 (61%), Gaps = 94/1031 (9%)

Query: 382  GFSVKDFQTGMPLMRCNSSGDLYPLATRPYYPSTTPSTFAALSNEIWHNRLGHPGVSILN 441
            G +  D +T   + RCNSSGDLYP     Y P+T+     A    +WH RLGH G   L+
Sbjct: 350  GGAALDLETRNVIARCNSSGDLYPF----YPPATSTHALLAAPTSLWHRRLGHLGREALS 405

Query: 442  SLHRNNSILCNKFRNNFFCQSCQLGKQIKLPFYESLSSTLLPFDIVHSDIWTSPILSSGG 501
             L R++ I C K      C +CQLG   +LPF  S S     FD++H D+WTSPI+S  G
Sbjct: 406  KLIRSSVISCTKDDLPHLCHACQLGHHTRLPFSSSSSRASNNFDLIHCDLWTSPIVSVSG 465

Query: 502  HHYYILFLDDFTDFLWTFPLTNKSQAHSIFLQFCNHIKTQFERDIKCFQCDNGKEYDNSH 561
            + YY++ LDD + ++WTFPL  KS   S    F  H+KTQF   IK  QCDNG+E+DNS 
Sbjct: 466  YKYYLVILDDCSHYIWTFPLRLKSDTFSTIANFFAHVKTQFGTTIKSVQCDNGREFDNSP 525

Query: 562  FHQFCKQNGMIFRFSCPHTSPQNGKVERKIRTINNIIRTLLAHASLPPSFWHHALQMATY 621
               F   +G+ FR SCP+TS QNG+ ER +RT+NNI+R+LL  A LPP +W  AL  AT 
Sbjct: 526  ARTFFLSHGVAFRMSCPYTSQQNGRAERSLRTLNNILRSLLFQACLPPVYWVEALHTATL 585

Query: 622  LLNILPTKKLALQTPTTILYQKSPSYSHLKVFGCLCFPLIPSTTRNKLQARSTPCVFLGY 681
            L+N +PTK L+  TP   LY   P+Y HL+VFGC C+P + ST  +KL  RS+ CVFLGY
Sbjct: 586  LVNRIPTKTLSSSTPYFHLYSTQPTYDHLRVFGCACYPNMSSTAPHKLAPRSSLCVFLGY 645

Query: 682  PSNHRGYKCFELSSRKIIISRHVIFDENTFPF---SNSNIPESSCYNFLDTSDTPF--PY 736
             S H+GY+C EL S +II SRHV+FDE+ FPF   S S +  S+   FLD ++     P 
Sbjct: 646  SSEHKGYRCLELGSNRIITSRHVVFDESFFPFADMSTSPMASSALDIFLDDNELTAQPPR 705

Query: 737  HLFQHTS-----------NLPTNEPNS-HDQPPTTTATPLTTPYS---INTTTHQP---- 777
              F H             + P   P+S   + P T A P   P+        T QP    
Sbjct: 706  AKFVHAGTSSAARGAVEPSTPPPAPSSIGPRSPATLAGPEAGPHGGSPAGAATSQPGAIS 765

Query: 778  ---TVAPSPQIQTSPQLTITPNTASPHQIPPPNPL-----------------------LA 811
               T APS    T+  +T  P  A+    P  +PL                       LA
Sbjct: 766  PARTAAPSAATSTTRAVTSAPRAATSGTTPSLSPLAGTAAPPPRAEVAASSTAATGRTLA 825

Query: 812  NPPLS-------PQMTTRAQHGIFKPRQLLNLHTSSSNQISPLPTNPINALQDHNWKMAM 864
              P+S         M TR + G+ +P   LNLH +    +SP+P +   AL D NW+ AM
Sbjct: 826  TRPVSIAPVDNAHSMRTRGKAGMAQPVDRLNLHAA---PLSPVPRSVREALSDPNWRAAM 882

Query: 865  KDEYDALIDNKTWDLVPRPSNANIIRSLWIFRHKKKADGSFERYKARLVGNGSNQQTGVD 924
            + E+DAL+ N TW LVPRP   N++   WIFRHK  +DGS +RYKAR V  G  Q+ GVD
Sbjct: 883  QAEFDALLANDTWSLVPRPRGVNLVTGKWIFRHKLHSDGSLDRYKARWVLRGFTQRPGVD 942

Query: 925  CGETFSPVVKPATIRTVLSIALSKSWCLHQLDVKNAFLHGNLNETVYMYQPPGFRDPQHP 984
              ETFSPVVKPAT+R VLS+ALS+ W +HQLDVKNAFLHG L+ETVY  QP GF DP H 
Sbjct: 943  YDETFSPVVKPATVRVVLSLALSQDWPIHQLDVKNAFLHGTLSETVYCIQPTGFADPSHA 1002

Query: 985  DYVCLLKKSLYGLKQAPRAWYQRFTDYVATLGFSHSVCDHSLFIYHSGDDTAYILLYVDD 1044
            D VC L KSLYGLKQAPRAW+ RF  ++ +LGF  +  D SLFI+  G+DT  +LLYVDD
Sbjct: 1003 DLVCRLNKSLYGLKQAPRAWHHRFASHLISLGFIEAQSDSSLFIHRRGNDTVLLLLYVDD 1062

Query: 1045 IILTASSDTLRQSIMSKLNSEFAMKDLGPLSYFLGISVTRHSD----------------- 1087
            I+LTASS +L Q +++ L  EFAM D+GPL +FLGI+VTR +                  
Sbjct: 1063 IVLTASSASLLQQVIAALQREFAMTDMGPLHHFLGITVTRFASGLFLSQRQYSQDILERA 1122

Query: 1088 ------------DTKAKLSGTSGNPYHDPSEYRSLAGALQYLTFTRPDISYAVQQVCLFM 1135
                        D  +KLS   G P  D ++YRSLAGALQYLTFTRPDI++AVQQVCL+M
Sbjct: 1123 GMGECKPCSTPVDVHSKLS-ADGPPVADSTQYRSLAGALQYLTFTRPDIAFAVQQVCLYM 1181

Query: 1136 HDPKTQHMTALKRIIRYIKGTSTHGLHLYPSTVDKLTTYTDADWGGCPDTRKSTSGYCVY 1195
            HDP+  H+ ALKRI+RYI+GT + GL +  S    L  YTDADW GCPDTR+STSGY V+
Sbjct: 1182 HDPREPHLAALKRILRYIQGTLSLGLTMRRSPPTDLVVYTDADWAGCPDTRRSTSGYAVF 1241

Query: 1196 LGDNLVSWSAKRQPTLSRSSAEAEYRGVANVVSESCWLRNLLLELQCPVTKATLVYCDNV 1255
            LGDNLVSWS+KRQ T+SRSSAEAEYR VAN V+E+ WLR LL+EL  P   AT+VYCDNV
Sbjct: 1242 LGDNLVSWSSKRQHTVSRSSAEAEYRAVANGVAEATWLRQLLMELHRPPRTATVVYCDNV 1301

Query: 1256 SAVYLSGNPIQHQRTKHIEMDIHFVREKVARGQVRVMHVPSRYQIADIFTKGLPLQLFDD 1315
            SA+YLS NP+QHQRTKH+E+D+HFVREKVA G VRV+HVP+  Q AD+FTKGLP  LF +
Sbjct: 1302 SAMYLSSNPVQHQRTKHVEIDLHFVREKVALGHVRVLHVPTTSQYADVFTKGLPTSLFQE 1361

Query: 1316 FRDSLHIRQPP 1326
            FR SL I   P
Sbjct: 1362 FRTSLTISDAP 1372


>dbj|BAB10876.1| polyprotein [Arabidopsis thaliana]
          Length = 1429

 Score =  816 bits (2108), Expect = 0.0
 Identities = 497/1348 (36%), Positives = 702/1348 (51%), Gaps = 147/1348 (10%)

Query: 75   DQLANVGAPVSNERMVLQLVSGLSEAYDTVGSQIRHSDTLPPFYKARSMVV-LEETARAK 133
            DQLA +   + +E  +  ++ GLS+ Y  V  QI   D  P   +    ++  E   +A 
Sbjct: 135  DQLALLEEAIPHEDQIAYILGGLSDDYRRVIDQIEGRDISPSITELHEKLINFELKLQAM 194

Query: 134  RTAVTTANSTFLAAQEDNNSG--NSPSHRPNRNNN-SNNGSRGGRGGSGNNHGGRGRGRG 190
                +T  +   A+  +NN+G  N  S R N+NN    N ++  R    NN G +G+G  
Sbjct: 195  VPDSSTPVTANAASYNNNNNGRNNRSSSRGNQNNQWQQNQTQQSRS---NNRGSQGKGYQ 251

Query: 191  GGR---GGRSHNYQQGSWQQPQAPQHQWAYPPYQYPSWNAPWQPWATPPCPYPTTGNWQR 247
            G     G   H+ ++ S  QP                         + P  YPT G    
Sbjct: 252  GRCQICGVHGHSARRCSQFQPYGGSGGSQ-----------------SVPSGYPTNGY--- 291

Query: 248  PVVPNRQAGILGAKPQQAHVAATPSSYAPTDIQAAMHSLSLAPPDEQWYMDTGATSHMTA 307
                                  +PS  AP   +A   +++ APP   W +D+GAT H+T+
Sbjct: 292  ----------------------SPSPMAPWQPRA---NIATAPPFNPWVLDSGATHHLTS 326

Query: 308  NGGNLTSYSNISNN--ITVGSGHNIPVIGCGNALVQNPQYPLTLNNVLHAPKLIKNLVSV 365
            +  NL+ +   +    +T+  G  +P+   G+AL+  P   L L ++L+ P + KNL+SV
Sbjct: 327  DLANLSMHQPYTGGEEVTIADGSGLPISHTGSALLPTPSRSLALKDILYVPNVSKNLISV 386

Query: 366  RKFTIDNEVSVEFDPFGFSVKDFQTGMPLMRCNSSGDLY--PLATRPYYPST-TPSTFAA 422
             +    N+VSVEF P  F VKD  TG  L++  +  +LY  P+  +     T +PS    
Sbjct: 387  YRLCNANQVSVEFFPAHFQVKDLNTGARLLQGRTRNELYEWPVNQKSITILTASPSPKTD 446

Query: 423  LSNEIWHNRLGHPGVSILNSLHRNNSI-LCNKFRNNFFCQSCQLGKQIKLPFYESLSSTL 481
            LS+  WH RLGHP + IL  +  +  + L N       C  C + K  KLPF+ +   + 
Sbjct: 447  LSS--WHQRLGHPALPILKDVVSHFHLPLSNTIPKQLPCSDCSINKSHKLPFFTNTIVSS 504

Query: 482  LPFDIVHSDIWTSPILSSGGHHYYILFLDDFTDFLWTFPLTNKSQAHSIFLQFCNHIKTQ 541
             P + +++D+WTSP +S   + YY++ +D FT + W +PL  KSQ   +F+ F   ++ +
Sbjct: 505  QPLEYLYTDVWTSPCISVDNYKYYLVIVDHFTRYTWMYPLKQKSQVKDVFVAFKALVENR 564

Query: 542  FERDIKCFQCDNGKEYDNSHFHQFCKQNGMIFRFSCPHTSPQNGKVERKIRTINNIIRTL 601
            F+  I+    DNG E+       F   +G+    S PHT   NG  ERK R I      L
Sbjct: 565  FQSRIRTLYSDNGGEFIG--LRPFLAAHGISHLTSPPHTPEHNGLAERKHRHIVETGLAL 622

Query: 602  LAHASLPPSFWHHALQMATYLLNILPTKKLALQTPTTILYQKSPSYSHLKVFGCLCFPLI 661
            L HASLP +FW +A   A YL+N +PT+ L   +P   L+Q SP+Y  L+VFGCLC+P +
Sbjct: 623  LTHASLPKTFWTYAFATAVYLINRMPTEVLQGTSPYVKLFQMSPNYLKLRVFGCLCYPWL 682

Query: 662  PSTTRNKLQARSTPCVFLGYPSNHRGYKCFELSSRKIIISRHVIFDENTFPFSNSNIPES 721
                 NKL+ARST CVFLGY      Y C ++++ +I  SRHV F E++FPF++    E+
Sbjct: 683  RPYNTNKLEARSTMCVFLGYSLTQSAYLCLDIATNRIYTSRHVQFVESSFPFASPRTSET 742

Query: 722  SCYNFLDTSDTPFPYHLFQHTSNL--------------PTNEPNSHDQPPT--------- 758
                 +    T     L Q   ++              P + P+S   PP+         
Sbjct: 743  DSTQTMSQPTTTNVIPLLQRPPHIAPPTALPLCPIFHSPPHSPSSPASPPSEHVPLTAAS 802

Query: 759  ----------------------TTATPLTTPYSINTTTHQPTVAPSPQIQTSPQLTITPN 796
                                  T+ +P TTP + NT+    +  P+   Q+      T  
Sbjct: 803  SSSNAINDDNISSTGQVSVSGPTSQSPHTTPTNQNTSPLSKSPNPTNTNQSQNSTPPTSP 862

Query: 797  TASPHQ-IPPPNPLLANPPLSP------QMTTRAQHGIFKPRQLLNLHTSSSNQISPLPT 849
            T S HQ  P P+PL  NPPL P       M TRA++ I KP+   NL TS ++    +PT
Sbjct: 863  TTSVHQHSPTPSPLPQNPPLPPPPQNDHPMRTRAKNQITKPKTKFNLTTSLTSSKPTIPT 922

Query: 850  NPINALQDHNWKMAMKDEYDALIDNKTWDLVPRPSNANIIRSLWIFRHKKKADGSFERYK 909
                AL+D NW+ AM +E +A + N TWDLV      ++I   WIF  K   DGS  RYK
Sbjct: 923  TVAQALKDPNWRNAMSEEINAQMKNHTWDLVSPEEAKHVISCKWIFTLKYNVDGSIARYK 982

Query: 910  ARLVGNGSNQQTGVDCGETFSPVVKPATIRTVLSIALSKSWCLHQLDVKNAFLHGNLNET 969
            ARLV  G NQQ G+D  ETFSPV+K  TIRTVL +A+ ++W +HQ+D+ NAFL G LNE 
Sbjct: 983  ARLVARGFNQQYGIDYSETFSPVIKSTTIRTVLEVAVKRNWSIHQVDINNAFLQGTLNEE 1042

Query: 970  VYMYQPPGFRDPQHPDYVCLLKKSLYGLKQAPRAWYQRFTDYVATLGFSHSVCDHSLFIY 1029
            VY+ QPPGF D   P +VC L K+LYGLKQAPRAWYQ    ++   GF +S+ D SLFIY
Sbjct: 1043 VYVSQPPGFIDRDRPSHVCRLNKALYGLKQAPRAWYQELRRFLLQAGFVNSLADASLFIY 1102

Query: 1030 HSGDDTAYILLYVDDIILTASSDTLRQSIMSKLNSEFAMKDLGPLSYFLGISVTRHSD-- 1087
            +  +   Y+L+YVDDII+ A  + L Q+  + L S F++KDLGPLSYFLGI  TR S   
Sbjct: 1103 NRHNTFMYVLVYVDDIII-AGENALVQAFNASLASRFSLKDLGPLSYFLGIEATRTSRGL 1161

Query: 1088 ------------------DTK---------AKLSGTSGNPYHDPSEYRSLAGALQYLTFT 1120
                              DTK          KLS  SG    D +EYR++ G+LQYL FT
Sbjct: 1162 HLMQRKYITDLLKKHNMLDTKPVSTPMSPTPKLSLLSGTALDDATEYRTVLGSLQYLAFT 1221

Query: 1121 RPDISYAVQQVCLFMHDPKTQHMTALKRIIRYIKGTSTHGLHLYPSTVDKLTTYTDADWG 1180
            RPDI++AV ++  FMH P  +H  A KRI+RY+ GT +HG+ L   T   +  ++DADWG
Sbjct: 1222 RPDIAFAVNRLSQFMHRPTNEHWQAAKRILRYLAGTKSHGIFLRSDTPLTIHAFSDADWG 1281

Query: 1181 GCPDTRKSTSGYCVYLGDNLVSWSAKRQPTLSRSSAEAEYRGVANVVSESCWLRNLLLEL 1240
               D   ST+ Y VY G + VSWS+K+Q +++RSS EAEYR VAN  SE  WL +LLLE+
Sbjct: 1282 CDLDAYLSTNAYIVYFGGSPVSWSSKKQRSVARSSTEAEYRAVANTASELRWLCSLLLEM 1341

Query: 1241 QCPVTKATLVYCDNVSAVYLSGNPIQHQRTKHIEMDIHFVREKVARGQVRVMHVPSRYQI 1300
                T   ++YCDN+ A YL  NP+ H R KH+ +D HFVR  +  G +RV HV ++ Q+
Sbjct: 1342 GISQTTVPVIYCDNIGATYLCANPVFHSRMKHVALDYHFVRGYIQSGALRVSHVSTKDQL 1401

Query: 1301 ADIFTKGLPLQLFDDFRDSLHIRQPPVS 1328
            AD  TK LP   F +    + +++ P S
Sbjct: 1402 ADALTKPLPRPRFTELNSKIGVQELPPS 1429


>gb|AAA57005.1| copia-like retrotransposon Hopscotch polyprotein [Zea mays]
            gi|7444442|pir||T02087 gag/pol polyprotein - maize
            retrotransposon Hopscotch
          Length = 1439

 Score =  813 bits (2099), Expect = 0.0
 Identities = 503/1352 (37%), Positives = 708/1352 (52%), Gaps = 162/1352 (11%)

Query: 61   STASSYCQHIKSLSDQLANVGAPVSNERMVLQLVSGLSEAYDT-VGSQIRHSDTLPPFYK 119
            S+ + Y   ++  +D+L   G P+ +E  V  L++GL E ++  V + +  SD + P   
Sbjct: 141  SSVAEYFAKMRGFADELGAAGKPLDDEEFVSFLLTGLDEDFNPLVTAVVARSDPITPGDL 200

Query: 120  ARSMVVLEETARAKRTAVTTANSTFLAAQEDNNSGNSPSHRPNRNNNSNNGSRGGRGGSG 179
               ++  E      R  + T +S+ +      +S N+ S  P R  +   G  GGRG S 
Sbjct: 201  YTQLLSYEN-----RMHLQTGSSSLM-----QSSANARS--PGRGMSW--GRSGGRGFSR 246

Query: 180  NNHGGRGRGRGGGRGG-----RSHNYQQGSWQQPQAPQHQWAYPPYQYPSWNAPWQPWAT 234
                GRGRGRG  RGG     R +NY   +     +           + + N        
Sbjct: 247  ----GRGRGRGPSRGGFQSFGRGNNYSGATDADTSSRPRCQVCSRVGHTALN-------- 294

Query: 235  PPCPYPTTGNWQRPVVPNRQAGILGAKPQQAHVAATPSSYAPTDIQAAMHSLSLAPPDEQ 294
              C Y    N+    VP++++    A    ++V                           
Sbjct: 295  --CWYRFDENY----VPDQRSANSAAHQNGSNVP-------------------------- 322

Query: 295  WYMDTGATSHMTANGGNLTSYSNIS--NNITVGSGHNIPVIGCGNALVQNPQYPLTLNNV 352
            WY DTGAT H+T +   LT +   +  + I   +G  + +   GNA+V      L L +V
Sbjct: 323  WYTDTGATDHITGDLDRLTMHDKYTGTDQIIAANGTGMTISNIGNAIVPTSSRSLHLRSV 382

Query: 353  LHAPKLIKNLVSVRKFTIDNEVSVEFDPFGFSVKDFQTGMPLMRCNSSGDLYPLATRPYY 412
            LH P   KNL+SV + T DN+V +EF    F +KD QT   L+       LYPL   P  
Sbjct: 383  LHVPSTHKNLISVHRLTNDNDVFIEFHSSHFLIKDRQTKAVLLHGKCRDGLYPLPPHPDL 442

Query: 413  PSTTPSTFAALSNEIWHNRLGHPGVSILNSLHRNNSILC-NKFRNNFFCQSCQLGKQIKL 471
                  +   +  E WH RLGHP   I++ +  NN++ C +       C +C   K  +L
Sbjct: 443  RLKHNFSSTRVPLEHWHKRLGHPSRDIVHRVISNNNLPCLSNNSTTSVCDACLQAKAHQL 502

Query: 472  PFYESLSSTLLPFDIVHSDIWTSPILSSGGHHYYILFLDDFTDFLWTFPLTNKSQAHSIF 531
            P+  S+S +  P  ++ SD++   I S G + YY+ F+DD++ F W + L +KS  +  F
Sbjct: 503  PYTISMSQSSAPLMLIFSDVFGPAIDSFGRYKYYVSFIDDYSKFTWIYLLRHKSDVYKSF 562

Query: 532  LQFCNHIKTQFERDIKCFQCDNGKEYD--NSHFHQFCKQNGMIFRFSCPHTSPQNGKVER 589
             +F + ++  F R I  FQ D G EY+  N+HF    K  G+  + SCPHT  QNG  ER
Sbjct: 563  CEFQHLVERMFGRKIIAFQSDWGGEYEKLNAHF----KTIGIHHQVSCPHTHQQNGAAER 618

Query: 590  KIRTINNIIRTLLAHASLPPSFWHHALQMATYLLNILPTKKLALQTPTTILYQKSPSYSH 649
            K R I  +   LLA +S+P  +W HA   A YL+N  P+K +A  TP   L   +P YS 
Sbjct: 619  KHRHIVEVGLALLAQSSMPLKYWDHAFLAAVYLINRTPSKTIAHDTPLHKLTGATPDYSS 678

Query: 650  LKVFGCLCFPLIPSTTRNKLQARSTPCVFLGYPSNHRGYKCFELSSRKIIISRHVIFDEN 709
            L++FGC C+P +    ++KLQ RST CVFLGY + H+G+KC ++S+ +I ISR V+FDE+
Sbjct: 679  LRIFGCACWPNLRPYNQHKLQFRSTRCVFLGYSNMHKGFKCLDISTGRIYISRDVVFDEH 738

Query: 710  TFPFSNSN-------------IPESSCYNFLDT--------SDTPFPY---HLFQHTSNL 745
             FPF++ N             +P  SC N + T        S +P P+   H  Q  S +
Sbjct: 739  VFPFASLNKNAGVKYTSEVLLLPHDSCGNNMLTDHANNLPGSSSPLPFLAQHFLQGNSEV 798

Query: 746  PTNE----------PNSHDQPPTTTATPLTTPYSINTTTHQPTVAPSPQ----------- 784
            PT+           PN    PP    + L    S   T       P+P+           
Sbjct: 799  PTSNNTAMALPASGPNEVSVPPALVPSSLVPAASPAPTGVSANAEPAPEADSLSSGPPVA 858

Query: 785  ------IQTSPQLTITPNTASPHQIPPPNPLLANPPLSPQMTTRAQHGIFKPRQLLNLHT 838
                  +  +  L   P ++  HQ P   PL A  P      TR QHGI KP+Q  +   
Sbjct: 859  TESVTGVPDADPLLQAPGSSVAHQTPDSAPLSAAAP-----RTRLQHGISKPKQFTDGTV 913

Query: 839  SSSNQISPL--PTNPINALQDHNWKMAMKDEYDALIDNKTWDLVPRPSNANIIRSLWIFR 896
               N  + +  P++   AL D  W+ AM+ E+ AL  N TW LVP     N+I   W+F+
Sbjct: 914  RYGNAAARITEPSSVSEALADPQWRAAMEAEFQALQKNNTWTLVPPDRTRNLIDCKWVFK 973

Query: 897  HKKKADGSFERYKARLVGNGSNQQTGVDCGETFSPVVKPATIRTVLSIALSKSWCLHQLD 956
             K  ADGS +R KARLV  G  QQ G+D  +TFSPVVK +TIR VLS+A+S+ W L QLD
Sbjct: 974  VKYNADGSIDRLKARLVAKGFKQQYGIDYDDTFSPVVKHSTIRLVLSLAVSQKWSLRQLD 1033

Query: 957  VKNAFLHGNLNETVYMYQPPGFRDPQHPDYVCLLKKSLYGLKQAPRAWYQRFTDYVATLG 1016
            V+NAFLHG L ETVYM QPPGF D  HP+Y C L+KSLYGLKQ PRAWY R ++ + +LG
Sbjct: 1034 VQNAFLHGILEETVYMKQPPGFADTTHPNYHCHLQKSLYGLKQRPRAWYSRLSEKLQSLG 1093

Query: 1017 FSHSVCDHSLFIYHSGDDTAYILLYVDDIILTASSDTLRQSIMSKLNSEFAMKDLGPLSY 1076
            F  S  D SLFIY++     YIL+YVDDII+T SS     ++++KL  +FA+KDLG L Y
Sbjct: 1094 FVPSKADVSLFIYNAHSTAIYILVYVDDIIITGSSPHAIDNVLAKLKDDFAIKDLGDLHY 1153

Query: 1077 FLGISVTRHSDD-----------------------------TKAKLSGTSGN--PYHDPS 1105
            FLGI V R  D                              T  KLS ++G      + +
Sbjct: 1154 FLGIEVHRKGDGLLLCQEKYARDLLKRVGMECCKPVHTPVATSEKLSASAGTLLSPEETT 1213

Query: 1106 EYRSLAGALQYLTFTRPDISYAVQQVCLFMHDPKTQHMTALKRIIRYIKGTSTHGLHLYP 1165
            +YRS+ GALQYLT TRPD+SYA+ +VC F+H P   H TA+KRI+R I+ T   GL + P
Sbjct: 1214 KYRSVVGALQYLTLTRPDLSYAINRVCQFLHAPTDLHWTAVKRILRNIQHTIGLGLTIRP 1273

Query: 1166 STVDKLTTYTDADWGGCPDTRKSTSGYCVYLGDNLVSWSAKRQPTLSRSSAEAEYRGVAN 1225
            S    L+ ++DADW GCPD RKST GY ++LG NL+SW++K+Q T+SRSS EAEY+ +AN
Sbjct: 1274 SLSLMLSAFSDADWAGCPDDRKSTGGYALFLGPNLISWNSKKQSTVSRSSTEAEYKAMAN 1333

Query: 1226 VVSESCWLRNLLLELQCPVTKATLVYCDNVSAVYLSGNPIQHQRTKHIEMDIHFVREKVA 1285
              +E  WL++LL EL   +T    ++CDN+ A YLS  PI + RTKHIE+D HFVR++V 
Sbjct: 1334 ATAEVIWLQSLLHELGIRLTGIPRLWCDNLGATYLSSKPIFNARTKHIEVDFHFVRDRVL 1393

Query: 1286 RGQVRVMHVPSRYQIADIFTKGLPLQLFDDFR 1317
              ++ +  + +  Q+AD FTK L +   ++FR
Sbjct: 1394 SKKLDIRLISTNDQVADGFTKALTIGRLNEFR 1425


>gb|AAF99727.1| F17L21.7 [Arabidopsis thaliana]
          Length = 1534

 Score =  800 bits (2067), Expect = 0.0
 Identities = 498/1338 (37%), Positives = 694/1338 (51%), Gaps = 142/1338 (10%)

Query: 75   DQLANVGAPVSNERMVLQLVSGLSEAYDTVGSQIRHSDTLPPFYKARSMVVLEETARAKR 134
            DQLA +G P+ +E  +  +V GLS+ Y  V  QI+  +  P      S+  + E      
Sbjct: 255  DQLALLGKPMESEEQMEVIVEGLSDDYKQVIDQIQGREVPP------SLTEIHEKLLNHE 308

Query: 135  TAVTTANSTFLAAQEDNNSGNSPSHRPNRNNNSNNGSRGGRGGSGNNHGGRGRGRGGGRG 194
              +  A S+         S N+ S+RP  NN  NN          NN+ G+ R     RG
Sbjct: 309  VKLQAAASSLPI------SANAASYRPPANNKHNNS---------NNYRGQNRNNNN-RG 352

Query: 195  GRSHNYQQGSWQQPQAPQHQWAYPPYQYPSWNAPWQPWATPPCPY-PTTGNWQRPVVPNR 253
              S  YQQ    QP +  +Q                      C      G+  R     +
Sbjct: 353  ANS--YQQPRNDQPSSRGYQGK--------------------CQICGVFGHSARRCSQLQ 390

Query: 254  QAGILGAKPQQAHVAATPSSYAPTDIQAAMHSLSLAPPDEQWYMDTGATSHMTANGGNLT 313
             +G         +    P++  P   +A M ++S  P    W +D+GAT H+T +  NL 
Sbjct: 391  MSGAYSTPSPSQY----PNATVPWQPRANMAAMSYNP----WLLDSGATHHLTTDLNNLA 442

Query: 314  SYS--NISNNITVGSGHNIPVIGCGNALVQNPQYPLTLNNVLHAPKLIKNLVSVRKFTID 371
             +   N    +T+  G  +P+   G++ +      L LNN+L+ P L KNL+SV K    
Sbjct: 443  LHQPYNGGEEVTIADGSTLPITHTGSSTLSTQSRSLALNNILYVPNLHKNLISVYKLCNA 502

Query: 372  NEVSVEFDPFGFSVKDFQTGMPLMRCNSSGDLYPLATRPYYPSTTP-STFAALSNEI--- 427
            N+VSVEF P  F VKD  TG  L++  +  +LY        PS TP S FA+ + +    
Sbjct: 503  NKVSVEFFPAHFQVKDLSTGARLLQGRTKDELYEWPV----PSNTPISLFASPTPKTTLP 558

Query: 428  -WHNRLGHPGVSILNSLHRNNSI-LCNKFRNNFFCQSCQLGKQIKLPFYESLSSTLLPFD 485
             WH+RLGHP   +L SL    S+ + N  + +F C  C + K  KLPFY +   +  P +
Sbjct: 559  SWHSRLGHPSPPVLKSLVSQFSLPVSNSSQKHFPCSHCLINKSHKLPFYSNTIISYTPLE 618

Query: 486  IVHSDIWTSPILSSGGHHYYILFLDDFTDFLWTFPLTNKSQAHSIFLQFCNHIKTQFERD 545
             V+SD+WTSP+ S     YY++ +D +T + W +PL  KSQ    F+ F   ++ +F+  
Sbjct: 619  YVYSDVWTSPVTSVDNFKYYLILVDHYTRYTWLYPLKQKSQVRETFVAFKALVENRFQTK 678

Query: 546  IKCFQCDNGKEYDNSHFHQFCKQNGMIFRFSCPHTSPQNGKVERKIRTINNIIRTLLAHA 605
            I+    DNG E+      QF   +G+    S PHT   NG  ERK R I     TLL  A
Sbjct: 679  IRTLYSDNGGEF--IALRQFLLTHGISHLTSLPHTPEHNGIAERKHRHILETGLTLLTQA 736

Query: 606  SLPPSFWHHALQMATYLLNILPTKKLALQTPTTILYQKSPSYSHLKVFGCLCFPLIPSTT 665
            S+P S+W +A   A YL+N LP+  L  ++P + L++ SP+Y  L+VFGC CFP +   T
Sbjct: 737  SIPTSYWTYAFGTAVYLINRLPSSVLNNESPYSKLFKTSPNYLKLRVFGCSCFPWLRPYT 796

Query: 666  RNKLQARSTPCVFLGYPSNHRGYKCFELSSRKIIISRHVIFDENTFPF--------SNSN 717
             +KL+ RS PCVFLGY      Y C + SS ++  SRHV F E+ FPF        SNS+
Sbjct: 797  NHKLERRSQPCVFLGYSLTQSAYLCLDRSSGRVYTSRHVQFVEDQFPFSISDTHSVSNSS 856

Query: 718  IPESSCYNFLDTSDTPFPYH---LFQHTSNLPTNEPNSHDQPPTTTATPLTT-------- 766
              E+S       S  P       L Q  S+LP    +SH +P   T++  ++        
Sbjct: 857  PEEASPSCHQPPSRIPIQSSSPPLVQAPSSLPPLSSDSHRRPNAETSSSSSSTNNDVVVS 916

Query: 767  --------------PYSINTTTHQPTVAPSPQIQT----SPQLTITPNTASPHQIPP--- 805
                          P S ++   Q    PS  IQT    +P  + TP  +SP   P    
Sbjct: 917  KDNTQVDNRNNFIGPTSSSSAQSQNNSNPSSSIQTQNEPNPSPSPTPQNSSPESSPSSST 976

Query: 806  ------PNPLLANPPLSPQMTTRAQHGIFKPRQLLNLHTSSSNQISPLPTNPINALQDHN 859
                  PNP    P  +  M TRA++ I KP+  L+L   +      +P     AL+D  
Sbjct: 977  SATSTVPNPPPPPPTNNHPMRTRAKNHITKPKTKLSLLAKTVQTRPQIPNTVNQALRDEK 1036

Query: 860  WKMAMKDEYDALIDNKTWDLVPRPSNANIIRSLWIFRHKKKADGSFERYKARLVGNGSNQ 919
            W+ AM +E +A I N T++LVP   N N+I + WIF  K   +G+ +RYKARLV  G  Q
Sbjct: 1037 WRNAMGEEINAQIRNNTFELVPPKPNQNVISTKWIFTLKYLPNGTLDRYKARLVARGFRQ 1096

Query: 920  QTGVDCGETFSPVVKPATIRTVLSIALSKSWCLHQLDVKNAFLHGNLNETVYMYQPPGFR 979
            Q G+   ETFSPVVK  TIR VL +A+S+SW + QLDV NAFL G L + VY+ QPPGF 
Sbjct: 1097 QYGLHYSETFSPVVKSLTIRLVLQLAVSRSWTIKQLDVNNAFLQGTLTDEVYVTQPPGFI 1156

Query: 980  DPQHPDYVCLLKKSLYGLKQAPRAWYQRFTDYVATLGFSHSVCDHSLFIYHSGDDTAYIL 1039
            DP  P +VC LKK+LYGLKQAPRAWYQ   ++V +LGF++S+ D S+F+Y +     Y L
Sbjct: 1157 DPDRPHHVCRLKKALYGLKQAPRAWYQELRNFVCSLGFTNSLADTSVFVYINDIQIVYCL 1216

Query: 1040 LYVDDIILTASSDTLRQSIMSKLNSEFAMKDLGPLSYFLGISVTRHSDD----------- 1088
            +YVDDII+T SSD L  + ++ L+  F++KD   L YFLGI  TR S             
Sbjct: 1217 VYVDDIIVTGSSDALVMAFITALSRRFSLKDPTDLVYFLGIEATRTSQGLHLMQHKYVYD 1276

Query: 1089 ------------------TKAKLSGTSGNPYHDPSEYRSLAGALQYLTFTRPDISYAVQQ 1130
                              T  KLS  SG    +P EYR++ G+LQYL FTRPDI+YAV +
Sbjct: 1277 LLSRMKMLDAKPVSTPMATHPKLSLYSGIALDEPGEYRTVIGSLQYLAFTRPDIAYAVNR 1336

Query: 1131 VCLFMHDPKTQHMTALKRIIRYIKGTSTHGLHLYPSTVDKLTTYTDADWGGCPDTRKSTS 1190
            +  FMH P   H  A KR++RY+ GT+THG+ L  ++   L  ++DADW G  D   ST+
Sbjct: 1337 LSQFMHRPTDIHWQAAKRVLRYLAGTATHGILLRSNSPLSLHAFSDADWAGDNDDFVSTN 1396

Query: 1191 GYCVYLGDNLVSWSAKRQPTLSRSSAEAEYRGVANVVSESCWLRNLLLELQCPVTKATLV 1250
             Y VYLG   ++WS+K+Q  ++RSS EAEYR VAN  SE  W+ +LL EL   + K  ++
Sbjct: 1397 AYIVYLGSTPIAWSSKKQKGVARSSTEAEYRAVANTTSEIRWVCSLLTELGITLPKMPVI 1456

Query: 1251 YCDNVSAVYLSGNPIQHQRTKHIEMDIHFVREKVARGQVRVMHVPSRYQIADIFTKGLPL 1310
            YCDNV A YLS NP+ H R KH+ +D HF+R+ V+ G +RV H+ +  Q+AD  TK LP 
Sbjct: 1457 YCDNVGATYLSANPVFHSRMKHLALDYHFIRDNVSAGALRVSHISTHDQLADALTKPLPR 1516

Query: 1311 QLFDDFRDSLHIRQPPVS 1328
            Q F  F   + + + P S
Sbjct: 1517 QHFLQFSSKIGVSKLPPS 1534


>gb|AAG46116.1| putative copia-like retrotransposon polyprotein [Oryza sativa]
          Length = 1302

 Score =  798 bits (2062), Expect = 0.0
 Identities = 412/787 (52%), Positives = 529/787 (66%), Gaps = 39/787 (4%)

Query: 540  TQFERDIKCFQCDNGKEYDNSHFHQFCKQNGMIFRFSCPHTSPQNGKVERKIRTINNIIR 599
            ++F   +   Q DNGKEYD+         +G + R SCP++S QNGK ER +RTIN+ +R
Sbjct: 513  SKFGLPVLALQTDNGKEYDSYALRSLLSLHGAVLRLSCPYSSQQNGKAERILRTINDYVR 572

Query: 600  TLLAHASLPPSFWHHALQMATYLLNILPTKKLALQTPTTILYQKSPSYSHLKVFGCLCFP 659
            T+L H++ P SFW  ALQ AT+L+N  P +     TP  +L    P+Y HL+VFGCLC+P
Sbjct: 573  TMLVHSAAPLSFWAEALQTATHLINRRPCRATGSLTPYQLLLGAPPTYDHLRVFGCLCYP 632

Query: 660  LIPSTTRNKLQARSTPCVFLGYPSNHRGYKCFELSSRKIIISRHVIFDENTFPFSNSNIP 719
               +T  +KL  RS  CVF+GYP++HRGY+C+++ SR++  SRHV F E+ FPF ++  P
Sbjct: 633  NTIATAPHKLSPRSLACVFIGYPADHRGYRCYDMVSRRVFTSRHVTFVEDVFPFRDAPSP 692

Query: 720  ESSCYNFLDTSDTPFPYHLFQHTSNLPTNEPNSHDQPPTTTATPLTTPYSINTTTHQPTV 779
              S          P P H       LP   P  H   P  TA             H    
Sbjct: 693  RPSA--------PPPPDHGDDTIVLLPA--PAQHVVTPVGTAP-----------AHDAAS 731

Query: 780  APSPQIQTSPQLTITPNTASP-HQI-PPPNPLLANP----PLSPQMTTRAQHGIFKPRQL 833
             PSP        + TP++A+P H + PPP+P  ++P    P    MTTRA+ GI KP   
Sbjct: 732  PPSPA-------SSTPSSAAPAHDVAPPPSPETSSPASASPPRHAMTTRARAGISKPNPR 784

Query: 834  LNLHTSSSNQISPLPTNPINALQDHNWKMAMKDEYDALIDNKTWDLVPRPSNANIIRSLW 893
              +  +S+  +SP P++   AL+D NW+ AM+ E+DAL+ N+TW LVPRP  A II   W
Sbjct: 785  YAMTATST--LSPTPSSVRAALRDPNWRAAMQAEFDALLANRTWTLVPRPPGARIITGKW 842

Query: 894  IFRHKKKADGSFERYKARLVGNGSNQQTGVDCGETFSPVVKPATIRTVLSIALSKSWCLH 953
            +F+ K  ADGS ++YKAR V  G NQ+ GVD GETFSPVVKPATIRTVL++  SK W  H
Sbjct: 843  VFKTKLHADGSLDKYKARWVVRGFNQRPGVDFGETFSPVVKPATIRTVLTLISSKQWPAH 902

Query: 954  QLDVKNAFLHGNLNETVYMYQPPGFRDPQHPDYVCLLKKSLYGLKQAPRAWYQRFTDYVA 1013
            QLDV NAFLHG+L E V   QP GF D   P  VCLL +SLYGL+QAPRAW++RF D+  
Sbjct: 903  QLDVSNAFLHGHLQERVLCQQPTGFEDAARPADVCLLSRSLYGLRQAPRAWFKRFADHAT 962

Query: 1014 TLGFSHSVCDHSLFIYHSGDDTAYILLYVDDIILTASSDTLRQSIMSKLNSEFAMKDLGP 1073
            +LGF  S  D SLF+   G DTAY+LLYVDD+IL+ASS +L Q I+ +L +EF +KD+GP
Sbjct: 963  SLGFVQSRADPSLFVLRRGSDTAYLLLYVDDMILSASSSSLLQRIIDRLQAEFKVKDMGP 1022

Query: 1074 LSYFLGISVTRHSDDTKAKLSGTSGNPYHDPSEYRSLAGALQYLTFTRPDISYAVQQVCL 1133
            L YFLGI V R +D      S  + +   D S YRS+AGALQYLT TRPDI+YAVQQVCL
Sbjct: 1023 LKYFLGIEVQRTADGFVLSQSKYATD---DSSWYRSIAGALQYLTLTRPDIAYAVQQVCL 1079

Query: 1134 FMHDPKTQHMTALKRIIRYIKGTSTHGLHLYPSTVDKLTTYTDADWGGCPDTRKSTSGYC 1193
             MH P+  H+T LKRI+RYIKGT+  GLHL  ST   LT ++DADW GCPDTR+STSG+C
Sbjct: 1080 HMHAPREAHVTLLKRILRYIKGTAAFGLHLRASTSPTLTAFSDADWAGCPDTRRSTSGFC 1139

Query: 1194 VYLGDNLVSWSAKRQPTLSRSSAEAEYRGVANVVSESCWLRNLLLELQCPVTKATLVYCD 1253
            ++LGD+L+SWS+KRQ T+SRSSAEAEYRGVAN V+E  WLR LL EL C V +AT+ YCD
Sbjct: 1140 IFLGDSLISWSSKRQTTVSRSSAEAEYRGVANAVAECTWLRQLLGELHCRVPQATIAYCD 1199

Query: 1254 NVSAVYLSGNPIQHQRTKHIEMDIHFVREKVARGQVRVMHVPSRYQIADIFTKGLPLQLF 1313
            N+S+VY+S NP+ H+RTKHIE+DIHFVREKVA G++RV+ +PS +Q AD+FTKGLP  +F
Sbjct: 1200 NISSVYMSKNPVHHKRTKHIELDIHFVREKVALGELRVLPIPSAHQFADVFTKGLPSSMF 1259

Query: 1314 DDFRDSL 1320
            ++FR SL
Sbjct: 1260 NEFRASL 1266



 Score =  189 bits (479), Expect = 7e-46
 Identities = 141/412 (34%), Positives = 204/412 (49%), Gaps = 24/412 (5%)

Query: 36  IFFDNKNSRALYLEQEFSKISMEQFSTASSYCQHIKSLSDQLANVGAPVSNERMVLQLVS 95
           +F DN   R++YLE EF  I+     T + Y   +K L+D L ++  PVS    VL L+ 
Sbjct: 120 VFRDNAVQRSVYLETEFRSINQGDM-TITQYTAKLKQLADGLRDINMPVSEPSQVLNLLR 178

Query: 96  GLSEAYDTVGSQIRHSDTLPPFYKARSMVVLEETARAKRTAVTTANSTFLAAQEDNNSGN 155
           GL+  + ++ + I   +    F  ARS ++L E  + +  A   A     A    ++  +
Sbjct: 179 GLNTKFRSLRASIADRNPPHTFMTARSYLLLAEL-QMQHDAKAEAGEALYAGTGSSSGTS 237

Query: 156 SPSHRPNRNNNSNNGSRGGRGGSGNNHGGRGRGRGGGRGGRSHNYQQGSWQQPQAPQHQW 215
             + +P        G R GRGG G   GG     GGG G   H+ Q     +P AP   W
Sbjct: 238 DTTGQPRPKGR---GKRRGRGG-GAPPGGAPSTPGGGAGA-GHDGQP----RPPAP---W 285

Query: 216 AYPPYQ--YPSWNAPWQ-PWATPPCPYPTTGNWQRPVVPNRQAGILGAKPQQAHVAATPS 272
            Y P+     +W  P++ P A    P P     Q     +    +  A P      A  +
Sbjct: 286 GYNPWTGFVQAWPFPFRAPGAGVLGPRPPFQAQQAMTAQHLLPALPPASPGVHSTGAWDN 345

Query: 273 SYAPTDIQAAMHSLSLAPPDEQWYMDTGATSHMTANGGNLTSYSNI--SNNITVGSGHNI 330
           S   + +Q+A  + +  P    W++DTGA++HM++  G L     +  S+ ITVG+G  +
Sbjct: 346 SALYSALQSAGVATTTPPSAADWFLDTGASAHMSSTPGILAHPRPLPFSSCITVGNGAKL 405

Query: 331 PVIGCGNALVQNPQYPLTLNNVLHAPKLIKNLVSVRKFTIDNEVSVEFDPFGFSVKDFQT 390
           PV    +  +      L L+NVL +P LIKNL+SV+K T DN VS+EFDP GFS+KD QT
Sbjct: 406 PVTHTASTHIPTSSTDLHLHNVLVSPPLIKNLISVKKLTRDNNVSIEFDPTGFSIKDLQT 465

Query: 391 GMPLMRCNSSGDLYPL-ATRPYYPSTTPSTFAALSNEIWHNRLGHPGVSILN 441
            +  +RC+S GDLYPL    P+  S T S     S E WH RLGHPG + L+
Sbjct: 466 QVVKLRCDSPGDLYPLRLPSPHALSATSSP----SVEHWHLRLGHPGSASLS 513


>gb|AAC02664.1| polyprotein [Arabidopsis thaliana]
          Length = 1451

 Score =  785 bits (2026), Expect = 0.0
 Identities = 487/1358 (35%), Positives = 683/1358 (49%), Gaps = 151/1358 (11%)

Query: 65   SYCQHIKSLSDQLANVGAPVSNERMVLQLVSGLSEAYDTVGSQIRHSDTLPPFYKARSMV 124
            +Y Q   +  D LA +G     E  +  ++ GL E Y TV  QI   +  P   +    +
Sbjct: 151  TYFQGFTTRFDHLALLGKAPEREEQIELILGGLPEDYKTVVDQIEGRENPPALTEVLEKL 210

Query: 125  VLEETARAKRTAVTTANSTFLAAQEDNNSGNSPSHRPNRNNNSNNGSRGGRGGSGNNHGG 184
            +  E   A +   T+   T           N+ ++R N NNN+N+ S G     GN    
Sbjct: 211  INHEVKLAAKAEATSVPVT----------ANAVNYRGNNNNNNNSRSNGRNNSRGNT--- 257

Query: 185  RGRGRGGGRGGRSHNYQQGSWQQPQAPQHQWAYPPYQYPSWNAPWQPWATPPCPYPTTGN 244
                               SWQ  Q+  ++  Y P  Y              C   +   
Sbjct: 258  -------------------SWQNSQSTSNRQQYTPRPYQG-----------KCQICSVHG 287

Query: 245  WQRPVVPNRQAGILGAKPQQAHVAATPSSYAPTDIQAAMHSLSLAPPDE-QWYMDTGATS 303
                  P  Q         Q+  A    SYAP   +A M  +S  P +   W +D+GAT 
Sbjct: 288  HSARRCPQLQQHAGSYASNQSSSA----SYAPWQPRANM--VSATPYNSGNWLLDSGATH 341

Query: 304  HMTANGGNLTSYS--NISNNITVGSGHNIPVIGCGNALVQNPQYPLTLNNVLHAPKLIKN 361
            H+T+N  NL  +   N    +T+  G  +P+   G+AL+  P   L L +VL+ P + KN
Sbjct: 342  HLTSNLNNLALHQPYNGDEEVTIADGSGLPISHSGSALLPTPTRSLALKDVLYVPDIQKN 401

Query: 362  LVSVRKFTIDNEVSVEFDPFGFSVKDFQTGMPLMRCNSSGDLYPLATRPYYPSTTPSTFA 421
            L+SV +    N VSVEF P  F VKD  TG  L++  +  +LY     P   S   S FA
Sbjct: 402  LISVYRMCNTNGVSVEFFPAHFQVKDLSTGARLLQGKTKNELYEW---PVNSSIATSMFA 458

Query: 422  ALSNEI----WHNRLGHPGVSILNSLHRNNSI-LCNKFRNNFFCQSCQLGKQIKLPFYES 476
            + + +     WH RLGHP + IL +L    S+ + +  +N   C  C + K  KLPFY +
Sbjct: 459  SPTPKTDLPSWHARLGHPSLPILKALISKFSLPISHSLQNQLLCSDCSINKSHKLPFYSN 518

Query: 477  LSSTLLPFDIVHSDIWTSPILSSGGHHYYILFLDDFTDFLWTFPLTNKSQAHSIFLQFCN 536
              ++  P + +++D+WTSPI S   + YY++ +D +T + W +PL  KSQ   +F+ F  
Sbjct: 519  TIASSHPLEYLYTDVWTSPITSIDNYKYYLVIVDHYTRYTWLYPLRKKSQVREMFITFTA 578

Query: 537  HIKTQFERDIKCFQCDNGKEYDNSHFHQFCKQNGMIFRFSCPHTSPQNGKVERKIRTINN 596
             ++ +F+  I     DNG E+       F   +G+    + PHT   NG  ERK R I  
Sbjct: 579  LVENKFKFKIGTLYSDNGGEF--IAMRSFLASHGISHMTTPPHTPELNGISERKHRHIVE 636

Query: 597  IIRTLLAHASLPPSFWHHALQMATYLLNILPTKKLALQTPTTILYQKSPSYSHLKVFGCL 656
               TLL+ AS+P  +W +A   A YL+N + T  L  ++P   L+ + P+Y  L++FGCL
Sbjct: 637  TGLTLLSTASMPKEYWSYAFATAVYLINRMLTPVLGNESPYVKLFGQPPNYLKLRIFGCL 696

Query: 657  CFPLIPSTTRNKLQARSTPCVFLGYPSNHRGYKCFELSSRKIIISRHVIFDENTFPFSNS 716
            CFP +   T +KL  RS PCV LGY  +   Y C + ++ ++  SRHV F E++FPFS +
Sbjct: 697  CFPWLRPYTAHKLDNRSVPCVLLGYSLSQSAYLCLDRATGRVYTSRHVQFAESSFPFSTT 756

Query: 717  N---IPESSCYNFLDTSDTPFPYHLFQHTSNLPTNEPN-----------SHDQPPT---- 758
            +    P S      DT     P  L +  +  P + P+            +  PP     
Sbjct: 757  SPSVTPPSDPPLSQDTRPVSVPL-LARPLTTAPPSSPSCSAPHRSPSQSENLSPPAPLQP 815

Query: 759  ----TTATPLTTPY-----------SINTTTHQPTVAPSPQ--IQTSPQLT--------- 792
                +  +P+T+P                T   P ++P PQ     SPQ T         
Sbjct: 816  SLSLSPTSPITSPSLSEESLVGHNSETGPTGSSPPLSPQPQRPQPQSPQSTSPHSSSPQP 875

Query: 793  -----------ITPN-TASPHQIPPPNPLLANPPLSPQMTTRAQHGIFKPR-QLLNLHTS 839
                       +TP  T+SP   PPPNP    PP+   M TR+++ I KP  +  NL T 
Sbjct: 876  NSPNPQHSPRSLTPTLTSSPSPSPPPNP--NPPPIQHTMRTRSKNNIVKPNPKFANLATK 933

Query: 840  SSNQISPLPTNPINALQDHNWKMAMKDEYDALIDNKTWDLVPRPSNANIIRSLWIFRHKK 899
             +     +P   + AL D NW+ AM DE +A   N T+DLVP   N N++   W+F  K 
Sbjct: 934  PTPLKPIIPKTVVEALLDPNWRQAMCDEINAQTRNGTFDLVPPAPNQNVVGCKWVFTLKY 993

Query: 900  KADGSFERYKARLVGNGSNQQTGVDCGETFSPVVKPATIRTVLSIALSKSWCLHQLDVKN 959
             ++G  +RYKARLV  G +QQ G D  ETFSPV+K  T+R+VL IA+SK W + Q+DV N
Sbjct: 994  LSNGVLDRYKARLVAKGFHQQYGHDFKETFSPVIKSTTVRSVLHIAVSKGWSIRQIDVNN 1053

Query: 960  AFLHGNLNETVYMYQPPGFRDPQHPDYVCLLKKSLYGLKQAPRAWYQRFTDYVATLGFSH 1019
            AFL G L++ VY+ QPPGF D  +  +VC L K+LYGLKQAPRAWYQ    Y+ T GF +
Sbjct: 1054 AFLQGTLSDEVYVTQPPGFVDKDNAHHVCRLYKALYGLKQAPRAWYQELRSYLLTQGFVN 1113

Query: 1020 SVCDHSLFIYHSGDDTAYILLYVDDIILTASSDTLRQSIMSKLNSEFAMKDLGPLSYFLG 1079
            SV D SLF         Y+L+YVDD+++T S   +    ++ L + F++KDLG +SYFLG
Sbjct: 1114 SVADTSLFTLRHERTILYVLVYVDDMLITGSDTNIITRFIANLAARFSLKDLGEMSYFLG 1173

Query: 1080 ISVTRHSD-----------------------------DTKAKLSGTSGNPYHDPSEYRSL 1110
            I  TR S                                  KLS TSG P   PSEYR++
Sbjct: 1174 IEATRTSKGLHLMQKRYVLDLLEKTNMLAAHPVLTPMSPTPKLSLTSGKPLDKPSEYRAV 1233

Query: 1111 AGALQYLTFTRPDISYAVQQVCLFMHDPKTQHMTALKRIIRYIKGTSTHGLHLYPSTVDK 1170
             G+LQYL FTRPDI+YAV ++  +MH P   H  A KRI+RY+ GT +HG+ +   T   
Sbjct: 1234 LGSLQYLLFTRPDIAYAVNRLSQYMHCPTDLHWQAAKRILRYLAGTPSHGIFIRADTPLT 1293

Query: 1171 LTTYTDADWGGCPDTRKSTSGYCVYLGDNLVSWSAKRQPTLSRSSAEAEYRGVANVVSES 1230
            L  Y+DADW G  D   ST+ Y +YLG N +SWS+K+Q  ++RSS EAEYR VAN  SE 
Sbjct: 1294 LHAYSDADWAGDIDNYNSTNAYILYLGSNPISWSSKKQKGVARSSTEAEYRAVANATSEI 1353

Query: 1231 CWLRNLLLELQCPVTKATLVYCDNVSAVYLSGNPIQHQRTKHIEMDIHFVREKVARGQVR 1290
             W+ +LL EL   ++   +VYCDNV A YLS NP+ H R KHI +D HFVRE V  G +R
Sbjct: 1354 RWVCSLLTELGITLSSPPVVYCDNVGATYLSANPVFHSRMKHIALDFHFVRESVQAGALR 1413

Query: 1291 VMHVPSRYQIADIFTKGLPLQLFDDFRDSLHIRQPPVS 1328
            V HV ++ Q+AD  TK LP Q F      + + + P S
Sbjct: 1414 VTHVSTKDQLADALTKPLPRQPFTTLISKIGVAKAPPS 1451


>gb|AAC02666.1| polyprotein [Arabidopsis thaliana]
          Length = 1451

 Score =  779 bits (2012), Expect = 0.0
 Identities = 485/1358 (35%), Positives = 682/1358 (49%), Gaps = 151/1358 (11%)

Query: 65   SYCQHIKSLSDQLANVGAPVSNERMVLQLVSGLSEAYDTVGSQIRHSDTLPPFYKARSMV 124
            +Y Q   +  D LA +G     E  +  ++ GL E Y TV  QI   +  P   +    +
Sbjct: 151  TYFQGFTTRFDHLALLGKAPEREEQIELILGGLPEDYKTVVDQIEGRENPPALTEVLEKL 210

Query: 125  VLEETARAKRTAVTTANSTFLAAQEDNNSGNSPSHRPNRNNNSNNGSRGGRGGSGNNHGG 184
            +  E   A +   T+   T           N+ ++R N NNN+N+ S G     GN    
Sbjct: 211  INHEVKLAAKAEATSVPVT----------ANAVNYRGNNNNNNNSRSNGRNNSRGNT--- 257

Query: 185  RGRGRGGGRGGRSHNYQQGSWQQPQAPQHQWAYPPYQYPSWNAPWQPWATPPCPYPTTGN 244
                               SWQ  Q+  ++  Y P  Y              C   +   
Sbjct: 258  -------------------SWQNSQSTSNRQQYTPRPYQG-----------KCQICSVHG 287

Query: 245  WQRPVVPNRQAGILGAKPQQAHVAATPSSYAPTDIQAAMHSLSLAPPDE-QWYMDTGATS 303
                  P  Q         Q+  A    SYAP   +A M  +S  P +   W +D+GAT 
Sbjct: 288  HSARRCPQLQQHAGSYASNQSSSA----SYAPWQPRANM--VSATPYNSGNWLLDSGATH 341

Query: 304  HMTANGGNLTSYS--NISNNITVGSGHNIPVIGCGNALVQNPQYPLTLNNVLHAPKLIKN 361
            H+T++  NL  +   N    +T+  G  +P+   G+AL+  P   L L +VL+ P + KN
Sbjct: 342  HLTSDLNNLALHQPYNGDEEVTIADGSGLPISHSGSALLPTPTRSLALKDVLYVPDIQKN 401

Query: 362  LVSVRKFTIDNEVSVEFDPFGFSVKDFQTGMPLMRCNSSGDLYPLATRPYYPSTTPSTFA 421
            L+SV +    N VSVEF P  F VKD  TG  L++  +  +LY     P   S   S FA
Sbjct: 402  LISVYRMCNTNGVSVEFFPAHFQVKDLSTGARLLQGKTKNELYEW---PVNSSIATSMFA 458

Query: 422  ALSNEI----WHNRLGHPGVSILNSLHRNNSI-LCNKFRNNFFCQSCQLGKQIKLPFYES 476
            + + +     WH RLGHP + IL +L    S+ + +  +N   C  C + K  KLPFY +
Sbjct: 459  SPTPKTDLPSWHARLGHPSLPILKALISKFSLPISHSLQNQLLCSDCSINKSHKLPFYSN 518

Query: 477  LSSTLLPFDIVHSDIWTSPILSSGGHHYYILFLDDFTDFLWTFPLTNKSQAHSIFLQFCN 536
              ++  P + +++D+WTSPI S   + YY++ +D +T + W +PL  KSQ   +F+ F  
Sbjct: 519  TIASSHPLEYLYTDVWTSPITSIDNYKYYLVIVDHYTRYTWLYPLRKKSQVREMFITFTA 578

Query: 537  HIKTQFERDIKCFQCDNGKEYDNSHFHQFCKQNGMIFRFSCPHTSPQNGKVERKIRTINN 596
             ++ +F+  I     DNG E+       F   +G+    + PHT   NG  ERK R I  
Sbjct: 579  LVENKFKFKIGTLYSDNGGEF--IAMRSFLASHGISHMTTPPHTPELNGISERKHRHIVE 636

Query: 597  IIRTLLAHASLPPSFWHHALQMATYLLNILPTKKLALQTPTTILYQKSPSYSHLKVFGCL 656
               TLL+ AS+P  +W +A   A YL+N + T  L  ++P   L+ + P+Y  L++FGCL
Sbjct: 637  TGLTLLSTASMPKEYWSYAFATAVYLINRMLTPVLGNESPYVKLFGQPPNYLKLRIFGCL 696

Query: 657  CFPLIPSTTRNKLQARSTPCVFLGYPSNHRGYKCFELSSRKIIISRHVIFDENTFPFSNS 716
            CFP +   T +KL  RS PCV LGY  +   Y C + ++ ++  SRHV F E++FPFS +
Sbjct: 697  CFPWLRPYTAHKLDNRSVPCVLLGYSLSQSAYLCLDRATGRVYTSRHVQFAESSFPFSTT 756

Query: 717  N---IPESSCYNFLDTSDTPFPYHLFQHTSNLPTNEPN-----------SHDQPPT---- 758
            +    P S      DT     P  L +  +  P + P+            +  PP     
Sbjct: 757  SPSVTPPSDPPLSQDTRPVSVPL-LARPLTTAPPSSPSCSAPHRSPSQSENLSPPAPLQP 815

Query: 759  ----TTATPLTTPY-----------SINTTTHQPTVAPSPQ--IQTSPQLT--------- 792
                +  +P+T+P                T   P ++P PQ     SPQ T         
Sbjct: 816  SLSLSPTSPITSPSLSEESLVGHNSETGPTGSSPPLSPQPQRPQPQSPQSTSPHSSSPQP 875

Query: 793  -----------ITPN-TASPHQIPPPNPLLANPPLSPQMTTRAQHGIFKPR-QLLNLHTS 839
                       +TP  T+SP   PPPNP    PP+   M TR+++ I KP  +  NL T 
Sbjct: 876  NSPNPQHSPRSLTPTLTSSPSPSPPPNP--NPPPIQHTMRTRSKNNIVKPNPKFANLATK 933

Query: 840  SSNQISPLPTNPINALQDHNWKMAMKDEYDALIDNKTWDLVPRPSNANIIRSLWIFRHKK 899
             +     +P   + AL D NW+ AM DE +A   N T+DLVP   N N++   W+F  K 
Sbjct: 934  PTPLKPIIPKTVVEALLDPNWRQAMCDEINAQTRNGTFDLVPPAPNQNVVGCKWVFTLKY 993

Query: 900  KADGSFERYKARLVGNGSNQQTGVDCGETFSPVVKPATIRTVLSIALSKSWCLHQLDVKN 959
             ++G  +RYKARLV  G +QQ G D  ETFSPV+K  T+R+VL IA+SK W + Q+DV N
Sbjct: 994  LSNGVLDRYKARLVAKGFHQQYGHDFKETFSPVIKSTTVRSVLHIAVSKGWSIRQIDVNN 1053

Query: 960  AFLHGNLNETVYMYQPPGFRDPQHPDYVCLLKKSLYGLKQAPRAWYQRFTDYVATLGFSH 1019
            AFL G L++ VY+ QPPGF D  +  +VC L K+LYGLKQAPRAWYQ    Y+ T GF +
Sbjct: 1054 AFLQGTLSDEVYVTQPPGFVDKDNAHHVCRLYKALYGLKQAPRAWYQELRSYLLTQGFVN 1113

Query: 1020 SVCDHSLFIYHSGDDTAYILLYVDDIILTASSDTLRQSIMSKLNSEFAMKDLGPLSYFLG 1079
            SV D SLF         Y+L+YVDD+++T S   +    ++ L + F++KDLG +SYFLG
Sbjct: 1114 SVADTSLFTLRHERTILYVLVYVDDMLITGSDTNIITRFIANLAARFSLKDLGEMSYFLG 1173

Query: 1080 ISVTRHSD-----------------------------DTKAKLSGTSGNPYHDPSEYRSL 1110
            I  TR S                                  KLS TSG P   PSEYR++
Sbjct: 1174 IEATRTSKGLHLMQKRYVLDLLEKTNMLAAHPVLTPMSPTPKLSLTSGKPLDKPSEYRAV 1233

Query: 1111 AGALQYLTFTRPDISYAVQQVCLFMHDPKTQHMTALKRIIRYIKGTSTHGLHLYPSTVDK 1170
             G+LQYL FTRPDI+YAV ++  +MH P   H  A KRI+RY+ GT +HG+ +   T   
Sbjct: 1234 LGSLQYLLFTRPDIAYAVNRLSQYMHCPTDLHWQAAKRILRYLAGTPSHGIFIRADTPLT 1293

Query: 1171 LTTYTDADWGGCPDTRKSTSGYCVYLGDNLVSWSAKRQPTLSRSSAEAEYRGVANVVSES 1230
            L  Y+DADW G  D   ST+ Y +YLG N +SWS+K+Q  ++RSS EAEYR VAN  SE 
Sbjct: 1294 LHAYSDADWAGDIDNYNSTNAYILYLGSNPISWSSKKQKGVARSSTEAEYRAVANATSEI 1353

Query: 1231 CWLRNLLLELQCPVTKATLVYCDNVSAVYLSGNPIQHQRTKHIEMDIHFVREKVARGQVR 1290
             W+ +LL EL   ++   +VYCDNV A YLS NP+   R KHI +D HFVRE V  G +R
Sbjct: 1354 RWVCSLLTELGITLSSPPVVYCDNVGATYLSANPVFDSRMKHIALDFHFVRESVQAGALR 1413

Query: 1291 VMHVPSRYQIADIFTKGLPLQLFDDFRDSLHIRQPPVS 1328
            V HV ++ Q+AD  TK LP Q F      + + + P S
Sbjct: 1414 VTHVSTKDQLADALTKPLPRQPFTTLISKIGVAKAPPS 1451


>emb|CAC37623.1| copia-like polyprotein [Arabidopsis thaliana]
          Length = 1466

 Score =  779 bits (2011), Expect = 0.0
 Identities = 483/1344 (35%), Positives = 683/1344 (49%), Gaps = 126/1344 (9%)

Query: 40   NKNSRALYLEQEFS-----KISMEQFSTASSYCQHIKSLSDQLANVGAPVSNERMVLQLV 94
            NK+S A    +EFS     ++  ++  + S YC+  K + D L+++G PV     +   +
Sbjct: 117  NKSSIA----REFSLRRNLQLLTKKDKSLSVYCRDFKIICDSLSSIGKPVEESMKIFGFL 172

Query: 95   SGLSEAYDTVGSQIRHSDT---LPPFYKARSMVVLEETARAKRTAVTTANSTFLAAQEDN 151
            +GL   YD + + I+ S +    P F    S V   ++         + N       E +
Sbjct: 173  NGLGREYDPITTVIQSSLSKLPAPTFNDVISEVQGFDSKLQSYDDTVSVNPHLAFNTERS 232

Query: 152  NSGNSPSHRPNRNNNSNNGSRGGRGGSGNNHGGRGRGRGG----GRGGRSHNYQQGSWQQ 207
            NSG      P  N+NS      GRG SG N     RGRGG    GRG             
Sbjct: 233  NSG-----APQYNSNSR-----GRGRSGQN-----RGRGGYSTRGRG------------- 264

Query: 208  PQAPQHQWAYPPYQYPSWNAPWQPWATPPCPYPTTGNWQRPVVPNRQAGILGAKPQQAHV 267
                QHQ A P                         + QRPV   +  G +G    + + 
Sbjct: 265  --FSQHQSASP------------------------SSGQRPVC--QICGRIGHTAIKCYN 296

Query: 268  AATPSSYAPTDIQAAMHSLSLAPPDEQWYMDTGATSHMTANGGNLTSYSNISNN--ITVG 325
                +  +    QA           ++WY D+ AT+H+TA+   L + +    N  + VG
Sbjct: 297  RFDNNYQSEVPTQAFSALRVSDETGKEWYPDSAATAHITASTSGLQNATTYEGNDAVLVG 356

Query: 326  SGHNIPVIGCGNALVQNPQYPLTLNNVLHAPKLIKNLVSVRKFTIDNEVSVEFDPFGFSV 385
             G  +P+   G+  + + +  + LN VL  P + K+L+SV K   D    V FD     +
Sbjct: 357  DGTYLPITHVGSTTISSSKGTIPLNEVLVCPAIQKSLLSVSKLCDDYPCGVYFDANKVCI 416

Query: 386  KDFQTGMPLMRCNSSGDLYPLATRPYYPSTTPSTFAALSNEIWHNRLGHPGVSILNSLHR 445
             D  T   + +   +  LY L    +    +    AA S E WH+RLGH    IL  L  
Sbjct: 417  IDLTTQKVVSKGPRNNGLYMLENSEFVALYSNRQCAA-SMETWHHRLGHSNSKILQQLLT 475

Query: 446  NNSILCNKFRNNFFCQSCQLGKQIKLPFYESLSSTLLPFDIVHSDIW-TSPILSSGGHHY 504
               I  NK R +  C+ CQ+GK  +L F+ S    L P D VH D+W  SP++S+ G  Y
Sbjct: 476  RKEIQVNKSRTSPVCEPCQMGKSTRLQFFSSDFRALKPLDRVHCDLWGPSPVVSNQGFKY 535

Query: 505  YILFLDDFTDFLWTFPLTNKSQAHSIFLQFCNHIKTQFERDIKCFQCDNGKEYDNSHFHQ 564
            Y +F+DDF+ F W FPL  KS+  S+F+ +   ++ Q    IK FQ D G E+ ++   +
Sbjct: 536  YAVFVDDFSRFSWFFPLRMKSKFISVFIAYQKLVENQLGTKIKEFQSDGGGEFTSNKLKE 595

Query: 565  FCKQNGMIFRFSCPHTSPQNGKVERKIRTINNIIRTLLAHASLPPSFWHHALQMATYLLN 624
              +++G+  R SCP+T  QNG  ERK R +  +  ++L H+  P  FW  A   A YL N
Sbjct: 596  HFREHGIHHRISCPYTPQQNGVAERKHRHLVELGLSMLYHSHTPLKFWVEAFFTANYLSN 655

Query: 625  ILPTKKLALQTPTTILYQKSPSYSHLKVFGCLCFPLIPSTTRNKLQARSTPCVFLGYPSN 684
            +LP+  L   +P   L+Q+   Y+ L+VFG  C+P +    +NK   RS  CVFLGY + 
Sbjct: 656  LLPSSVLKEISPYETLFQQKVDYTPLRVFGTACYPCLRPLAKNKFDPRSLQCVFLGYHNQ 715

Query: 685  HRGYKCFELSSRKIIISRHVIFDENTFPFSNSNIPESSCYNFLDTSDTPFPYHLFQHTSN 744
            ++GY+C    + K+ ISRHVIFDE  FPF          Y+ L           +QHT  
Sbjct: 716  YKGYRCLYPPTGKVYISRHVIFDEAQFPFKEK-------YHSLVPKYQTTLLQAWQHTDL 768

Query: 745  LPTNEPNSHDQPPTTTATPLTTPYSINTTTHQPTVAPSPQIQTSPQLTITPNTASPHQIP 804
             P + P+S  QP     TP+ T  +     ++   A +  ++TS       N    H++ 
Sbjct: 769  TPPSVPSSQLQPLARQMTPMATSENQPMMNYETEEAVNVNMETSSDEETESNDEFDHEVA 828

Query: 805  P-------PNPL----LANPPLSPQMTTRAQHGIFKPRQLLNLHTSSSNQISPLPTNPIN 853
            P        N L    L N  L P M TR++ GI KP     L  S S+   P       
Sbjct: 829  PVLNDQNEDNALGQGSLEN--LHP-MITRSKDGIQKPNPRYALIVSKSSFDEPKTIT--T 883

Query: 854  ALQDHNWKMAMKDEYDALIDNKTWDLVPRPSNANIIRSLWIFRHKKKADGSFERYKARLV 913
            A++  +W  A+ DE D +    TW LVP   + NI+ S W+F+ K K DG+ ++ KARLV
Sbjct: 884  AMKHPSWNAAVMDEIDRIHMLNTWSLVPATEDMNILTSKWVFKTKLKPDGTIDKLKARLV 943

Query: 914  GNGSNQQTGVDCGETFSPVVKPATIRTVLSIALSKSWCLHQLDVKNAFLHGNLNETVYMY 973
              G +Q+ GVD  ETFSPVV+ ATIR VL  A +  W L QLDV NAFLHG L E V+M+
Sbjct: 944  AKGFDQEEGVDYLETFSPVVRTATIRLVLDTATANEWPLKQLDVSNAFLHGELQEPVFMF 1003

Query: 974  QPPGFRDPQHPDYVCLLKKSLYGLKQAPRAWYQRFTDYVATLGFSHSVCDHSLFIYHSGD 1033
            QP GF DP  P++VC L K+LYGLKQAPRAW+  F++++   GF  S  D SLF+ H   
Sbjct: 1004 QPSGFVDPNKPNHVCRLTKALYGLKQAPRAWFDTFSNFLLDFGFECSTSDPSLFVCHQNG 1063

Query: 1034 DTAYILLYVDDIILTASSDTLRQSIMSKLNSEFAMKDLGPLSYFLGISVTRHSD------ 1087
             +  +LLYVDDI+LT S   L   ++  LN+ F+MKDLGP  YFLGI +  +++      
Sbjct: 1064 QSLILLLYVDDILLTGSDQLLMDKLLQALNNRFSMKDLGPPRYFLGIEIESYNNGLFLHQ 1123

Query: 1088 ---------------------DTKAKLSGTSGNPYHDPSEYRSLAGALQYLTFTRPDISY 1126
                                      L   +  P+ +P+ +RSLAG LQYLT TRPDI Y
Sbjct: 1124 HAYASDILHQAGMTECNPMPTPLPQHLEDLNSEPFEEPTYFRSLAGKLQYLTITRPDIQY 1183

Query: 1127 AVQQVCLFMHDPKTQHMTALKRIIRYIKGTSTHGLHLYPSTVDKLTTYTDADWGGCPDTR 1186
            AV  +C  MH P       LKRI+RY+KGT   GL +       L+ + D+D+ GC DTR
Sbjct: 1184 AVNFICQRMHAPTNSDFGLLKRILRYVKGTINMGLPIRKHHNPVLSGFCDSDYAGCKDTR 1243

Query: 1187 KSTSGYCVYLGDNLVSWSAKRQPTLSRSSAEAEYRGVANVVSESCWLRNLLLELQCPVTK 1246
            +ST+G+C+ LG  L+SWSAKRQPT+S SS EAEYR +++   E  W+ +LL +L     +
Sbjct: 1244 RSTTGFCILLGSTLISWSAKRQPTISHSSTEAEYRALSDTAREITWISSLLRDLGISQHQ 1303

Query: 1247 ATLVYCDNVSAVYLSGNPIQHQRTKHIEMDIHFVREKVARGQVRVMHVPSRYQIADIFTK 1306
             T V+CDN+SAVYLS NP  H+R+KH + D H++RE+VA G +   H+P+  Q+AD+FTK
Sbjct: 1304 PTRVFCDNLSAVYLSANPALHKRSKHFDKDFHYIRERVALGLIETQHIPATIQLADVFTK 1363

Query: 1307 GLPLQLFDDFRDSLHIRQPPVSTT 1330
             LP + F   R  L +   PVS T
Sbjct: 1364 SLPRRPFITLRAKLGVSASPVSPT 1387


>gb|AAC02669.1| polyprotein [Arabidopsis thaliana]
          Length = 1451

 Score =  778 bits (2010), Expect = 0.0
 Identities = 485/1358 (35%), Positives = 682/1358 (49%), Gaps = 151/1358 (11%)

Query: 65   SYCQHIKSLSDQLANVGAPVSNERMVLQLVSGLSEAYDTVGSQIRHSDTLPPFYKARSMV 124
            +Y Q   +  D LA +G     E  +  ++ GL E Y TV  QI   +  P   +    +
Sbjct: 151  TYFQGFTTRFDHLALLGKAPEREEQIELILGGLPEDYKTVVDQIEGRENPPALTEVLEKL 210

Query: 125  VLEETARAKRTAVTTANSTFLAAQEDNNSGNSPSHRPNRNNNSNNGSRGGRGGSGNNHGG 184
            +  E   A +   T+   T           N+ ++R N NNN+N+ S G     GN    
Sbjct: 211  INHEVKLAAKAEATSVPVT----------ANAVNYRGNNNNNNNSRSNGRNNSRGNT--- 257

Query: 185  RGRGRGGGRGGRSHNYQQGSWQQPQAPQHQWAYPPYQYPSWNAPWQPWATPPCPYPTTGN 244
                               SWQ  Q+  ++  Y P  Y              C   +   
Sbjct: 258  -------------------SWQNSQSTSNRQQYTPRPYQG-----------KCQICSVHG 287

Query: 245  WQRPVVPNRQAGILGAKPQQAHVAATPSSYAPTDIQAAMHSLSLAPPDE-QWYMDTGATS 303
                  P  Q         Q+  A    SYAP   +A M  +S  P +   W +D+GAT 
Sbjct: 288  HSARRCPQLQQHAGSYASNQSSSA----SYAPWQPRANM--VSATPYNSGNWLLDSGATH 341

Query: 304  HMTANGGNLTSYS--NISNNITVGSGHNIPVIGCGNALVQNPQYPLTLNNVLHAPKLIKN 361
            H+T++  NL  +   N    +T+  G  +P+   G+AL+  P   L L +VL+ P + KN
Sbjct: 342  HLTSDLNNLALHQPYNGDEEVTIADGSGLPISHSGSALLPTPTRSLALKDVLYVPDIQKN 401

Query: 362  LVSVRKFTIDNEVSVEFDPFGFSVKDFQTGMPLMRCNSSGDLYPLATRPYYPSTTPSTFA 421
            L+SV +    N VSVEF P  F VKD  TG  L++  +  +LY     P   S   S FA
Sbjct: 402  LISVYRMCNTNGVSVEFFPAHFQVKDLSTGARLLQGKTKNELYEW---PVNSSIATSMFA 458

Query: 422  ALSNEI----WHNRLGHPGVSILNSLHRNNSI-LCNKFRNNFFCQSCQLGKQIKLPFYES 476
            + + +     WH RLGHP + IL +L    S+ + +  +N   C  C + K  KLPFY +
Sbjct: 459  SPTPKTDLPSWHARLGHPSLPILKALISKFSLPISHSLQNQLLCSDCSINKSHKLPFYSN 518

Query: 477  LSSTLLPFDIVHSDIWTSPILSSGGHHYYILFLDDFTDFLWTFPLTNKSQAHSIFLQFCN 536
              ++  P + +++D+WTSPI S   + YY++ +D +T + W +PL  KSQ   +F+ F  
Sbjct: 519  TIASSHPLEYLYTDVWTSPITSIDNYKYYLVIVDHYTRYTWLYPLRKKSQVREMFITFTA 578

Query: 537  HIKTQFERDIKCFQCDNGKEYDNSHFHQFCKQNGMIFRFSCPHTSPQNGKVERKIRTINN 596
             ++ +F+  I     DNG E+       F   +G+    + PHT   NG  ERK R I  
Sbjct: 579  LVENKFKFKIGTLYSDNGGEF--IAMRSFLASHGISHMTTPPHTPELNGISERKHRHIVE 636

Query: 597  IIRTLLAHASLPPSFWHHALQMATYLLNILPTKKLALQTPTTILYQKSPSYSHLKVFGCL 656
               TLL+ AS+P  +W +A   A YL+N + T  L  ++P   L+ + P+Y  L++FGCL
Sbjct: 637  TGLTLLSTASMPKEYWSYAFATAVYLINRMLTPVLGNESPYVKLFGQPPNYLKLRIFGCL 696

Query: 657  CFPLIPSTTRNKLQARSTPCVFLGYPSNHRGYKCFELSSRKIIISRHVIFDENTFPFSNS 716
            CFP +   T +KL  RS PCV LGY  +   Y C + ++ ++  SRHV F E++FPFS +
Sbjct: 697  CFPWLRPYTAHKLDNRSVPCVLLGYSLSQSAYLCLDRATGRVYTSRHVQFAESSFPFSTT 756

Query: 717  N---IPESSCYNFLDTSDTPFPYHLFQHTSNLPTNEPN-----------SHDQPPT---- 758
            +    P S      DT     P  L +  +  P + P+            +  PP     
Sbjct: 757  SPSVTPPSDPPLSQDTRPVSVPL-LARPLTTAPPSSPSCSAPHRSPSQSENLSPPAPLQP 815

Query: 759  ----TTATPLTTPY-----------SINTTTHQPTVAPSPQ--IQTSPQLT--------- 792
                +  +P+T+P                T   P ++P PQ     SPQ T         
Sbjct: 816  SLSLSPTSPITSPSLSEESLVGHNSETGPTGSSPPLSPQPQRPQPQSPQSTSPHSSSPQP 875

Query: 793  -----------ITPN-TASPHQIPPPNPLLANPPLSPQMTTRAQHGIFKPR-QLLNLHTS 839
                       +TP  T+SP   PPPNP    PP+   M TR+++ I KP  +  NL T 
Sbjct: 876  NSPNPQHSPRSLTPTLTSSPSPSPPPNP--NPPPIQHTMRTRSKNNIVKPNPKFANLATK 933

Query: 840  SSNQISPLPTNPINALQDHNWKMAMKDEYDALIDNKTWDLVPRPSNANIIRSLWIFRHKK 899
             +     +P   + AL D NW+ AM DE +A   N T+DLVP   N N++   W+F  K 
Sbjct: 934  PTPLKPIIPKTVVEALLDPNWRQAMCDEINAQTRNGTFDLVPPAPNQNVVGCKWVFTLKY 993

Query: 900  KADGSFERYKARLVGNGSNQQTGVDCGETFSPVVKPATIRTVLSIALSKSWCLHQLDVKN 959
             ++G  +RYKARLV  G +QQ G D  ETFSPV+K  T+R+VL IA+SK W + Q+DV N
Sbjct: 994  LSNGVLDRYKARLVAKGFHQQYGHDFKETFSPVIKLTTVRSVLHIAVSKGWSIRQIDVNN 1053

Query: 960  AFLHGNLNETVYMYQPPGFRDPQHPDYVCLLKKSLYGLKQAPRAWYQRFTDYVATLGFSH 1019
            AFL G L++ VY+ QPPGF D  +  +VC L K+LYGLKQAPRAWYQ    Y+ T GF +
Sbjct: 1054 AFLQGTLSDEVYVTQPPGFVDKDNAHHVCRLYKALYGLKQAPRAWYQELRSYLLTQGFVN 1113

Query: 1020 SVCDHSLFIYHSGDDTAYILLYVDDIILTASSDTLRQSIMSKLNSEFAMKDLGPLSYFLG 1079
            SV D SLF         Y+L+YVDD+++T S   +    ++ L + F++KDLG +SYFLG
Sbjct: 1114 SVADTSLFTLRHERTILYVLVYVDDMLITGSDTNIITRFIANLAARFSLKDLGEMSYFLG 1173

Query: 1080 ISVTRHSD-----------------------------DTKAKLSGTSGNPYHDPSEYRSL 1110
            I  TR S                                  KLS TSG P   PSEYR++
Sbjct: 1174 IEATRTSKGLHLMQKRYVLDLLEKTNMLAAHPVLTPMSPTPKLSLTSGKPLDKPSEYRAV 1233

Query: 1111 AGALQYLTFTRPDISYAVQQVCLFMHDPKTQHMTALKRIIRYIKGTSTHGLHLYPSTVDK 1170
             G+LQYL FTRPDI+YAV ++  +MH P   H  A KRI+RY+ GT +HG+ +   T   
Sbjct: 1234 LGSLQYLLFTRPDIAYAVNRLSQYMHCPTDLHWQAAKRILRYLAGTPSHGIFIRADTPLT 1293

Query: 1171 LTTYTDADWGGCPDTRKSTSGYCVYLGDNLVSWSAKRQPTLSRSSAEAEYRGVANVVSES 1230
            L  Y+DADW G  D   ST+ Y +YLG N +SWS+K+Q  ++RSS EAEYR VAN  SE 
Sbjct: 1294 LHAYSDADWAGDIDNYNSTNAYILYLGSNPISWSSKKQKGVARSSTEAEYRAVANATSEI 1353

Query: 1231 CWLRNLLLELQCPVTKATLVYCDNVSAVYLSGNPIQHQRTKHIEMDIHFVREKVARGQVR 1290
             W+ +LL EL   ++   +VYCDNV A YLS NP+   R KHI +D HFVRE V  G +R
Sbjct: 1354 RWVCSLLTELGITLSSPPVVYCDNVGATYLSANPVFDSRMKHIALDFHFVRESVQAGALR 1413

Query: 1291 VMHVPSRYQIADIFTKGLPLQLFDDFRDSLHIRQPPVS 1328
            V HV ++ Q+AD  TK LP Q F      + + + P S
Sbjct: 1414 VTHVSTKDQLADALTKPLPRQPFTTLISKIGVAKAPPS 1451


>gb|AAK43485.1| polyprotein, putative [Arabidopsis thaliana]
          Length = 1459

 Score =  771 bits (1992), Expect = 0.0
 Identities = 484/1384 (34%), Positives = 687/1384 (48%), Gaps = 162/1384 (11%)

Query: 48   LEQEFSKISMEQFSTASSYCQHIKSLSDQLANVGAPVSNERMVLQLVSGLSEAYDTVGSQ 107
            L Q+  +++ +   T   Y Q   +  DQLA +G P+ +E  V  ++ GL E Y TV  Q
Sbjct: 135  LRQQIQRLT-KGTKTIDEYVQSHTTRLDQLAILGKPMEHEEQVEHILKGLPEEYKTVVDQ 193

Query: 108  IRHSDTLPPFYKARSMVVLEETARAKRTAVTTANSTFLAAQEDNNSGNSPSHRPNRNNNS 167
            I   D  P   +    ++  E+         +++           S N+   R N NNN 
Sbjct: 194  IEGKDNTPTITEIHERLINHESKLLSDEVPPSSSFPM--------SANAVQQR-NFNNNC 244

Query: 168  NNGSRGGRGGSGNNHGGRGRGRGGGRGGRSHNYQQGSWQQPQA--PQHQWAYPPYQYPSW 225
            N           N H  R +G        +HN    +  QP       Q  + PY     
Sbjct: 245  NQ----------NQHKNRYQGN-------THNNNTNTNSQPSTYNKSGQRTFKPYLGKCQ 287

Query: 226  NAPWQPWATPPCPYPTTGNWQRPVVPNRQAGILGAKPQQAHVAATPSSYAPTDIQAAMHS 285
                Q  +   CP                      + Q   + A+ S+++P        +
Sbjct: 288  ICSVQGHSARRCP----------------------QLQAMQLPASSSAHSPFTPWQPRAN 325

Query: 286  LSLAPP--DEQWYMDTGATSHMTANGGNLTSYS--NISNNITVGSGHNIPVIGCGNALVQ 341
            L++  P     W +D+GAT H+T++   L+ +   N    + +  G  + +   G+  + 
Sbjct: 326  LAIGSPYAANPWLLDSGATHHITSDLNALSLHQPYNGGEYVMIADGTGLTIKQTGSTFLP 385

Query: 342  NPQYPLTLNNVLHAPKLIKNLVSVRKFTIDNEVSVEFDPFGFSVKDFQTGMPLMRCNSSG 401
            +    L L+ VL+ P + KNL+SV +    N+VSVEF P  F VKD  TG  L++  +  
Sbjct: 386  SQNRDLALHKVLYVPDIRKNLISVYRLCNTNQVSVEFFPASFQVKDLNTGTLLLQGRTKD 445

Query: 402  DLY------PLATRPYYPSTTPSTFAALSNEIWHNRLGHPGVSILNSLHRNNSILCNKFR 455
            DLY      P AT  +   T+PS    LS+  WH+RLGHP  SILN+L    S+  +   
Sbjct: 446  DLYEWPVTNPPATALF---TSPSPKTTLSS--WHSRLGHPSASILNTLLSKFSLPVSVAS 500

Query: 456  NN-FFCQSCQLGKQIKLPFYESLSSTLLPFDIVHSDIWTSPILSSGGHHYYILFLDDFTD 514
            +N   C  C + K  KLPF  S   +  P + + +D+WTSPI+S   + YY++ +D +T 
Sbjct: 501  SNKTSCSDCLINKSHKLPFATSSIHSSSPLEYIFTDVWTSPIISHDNYKYYLVLVDHYTR 560

Query: 515  FLWTFPLTNKSQAHSIFLQFCNHIKTQFERDIKCFQCDNGKEYDNSHFHQFCKQNGMIFR 574
            + W +PL  KSQ  + F+ F   ++ +F+  I+    DNG E+       F   NG+   
Sbjct: 561  YTWLYPLQQKSQVKATFIAFKALVENRFQAKIRTLYSDNGGEF--IALRDFLVSNGISHL 618

Query: 575  FSCPHTSPQNGKVERKIRTINNIIRTLLAHASLPPSFWHHALQMATYLLNILPTKKLALQ 634
             S PHT   NG  ERK R I     TLL  AS+P  +W +A   A YL+N +PT  L LQ
Sbjct: 619  TSPPHTPEHNGLSERKHRHIVETGLTLLTQASVPREYWTYAFATAVYLINRMPTPVLCLQ 678

Query: 635  TPTTILYQKSPSYSHLKVFGCLCFPLIPSTTRNKLQARSTPCVFLGYPSNHRGYKCFELS 694
            +P   L+  SP+Y  L+VFGCLCFP +   TRNKL+ RS  CVFLGY      Y C ++ 
Sbjct: 679  SPFQKLFGSSPNYQRLRVFGCLCFPWLRPYTRNKLEERSKRCVFLGYSLTQTAYLCLDVD 738

Query: 695  SRKIIISRHVIFDENTFPFSNS----------NIPESSCYNFLDTSDTPFPYHLFQHTS- 743
            + ++  SRHV+FDE+T+PF+ S            PESS  +    +++ FP  + +  S 
Sbjct: 739  NNRLYTSRHVMFDESTYPFAASIREQSQSSLVTPPESSSSS--SPANSGFPCSVLRLQSP 796

Query: 744  -----NLPTNEPNSHDQPPTTTATPLTTPYSINTTTHQPTVAPSPQIQTS---------- 788
                   P+     +D P +   T   TP S ++     T++PSP +  S          
Sbjct: 797  PASSPETPSPPQQQNDSPVSPRQTGSPTP-SHHSQVRDSTLSPSPSVSNSEPTAPHENGP 855

Query: 789  -PQLTITPNTASPHQIPPPNP-----------------LLANPP---------------- 814
             P+    PN+     +P PNP                   A PP                
Sbjct: 856  EPEAQSNPNSPFIGPLPNPNPETNPSSSIEQRPVDKSTTTALPPNQTTIAATSNSRSQPP 915

Query: 815  -LSPQMTTRAQHGIFKPRQLLNLHTSSSNQISPLPTNPINALQDHNWKMAMKDEYDALID 873
              + QM TR+++ I KP+   +L  + +      P     AL+D  W+ AM DE+DA   
Sbjct: 916  KNNHQMKTRSKNNITKPKTKTSLTVALTQPHLSEPNTVTQALKDKKWRFAMSDEFDAQQR 975

Query: 874  NKTWDLVPRPSNANIIRSLWIFRHKKKADGSFERYKARLVGNGSNQQTGVDCGETFSPVV 933
            N TWDLVP     +++   W+F+ K   +G  ++YKARLV  G NQQ GVD  ETFSPV+
Sbjct: 976  NHTWDLVPPNPTQHLVGCRWVFKLKYLPNGLIDKYKARLVAKGFNQQYGVDYAETFSPVI 1035

Query: 934  KPATIRTVLSIALSKSWCLHQLDVKNAFLHGNLNETVYMYQPPGFRDPQHPDYVCLLKKS 993
            K  TIR VL +A+ K+W L QLDV NAFL G L E VYM QPPGF D   P +VC L+K+
Sbjct: 1036 KATTIRVVLDVAVKKNWPLKQLDVNNAFLQGTLTEEVYMAQPPGFVDKDRPSHVCRLRKA 1095

Query: 994  LYGLKQAPRAWYQRFTDYVATLGFSHSVCDHSLFIYHSGDDTAYILLYVDDIILTASSDT 1053
            +YGLKQAPRAWY     ++  +GF +S+ D SLFIY  G    Y+L+YVDDII+T S   
Sbjct: 1096 IYGLKQAPRAWYMELKQHLLNIGFVNSLADTSLFIYSHGTTLLYLLVYVDDIIVTGSDHK 1155

Query: 1054 LRQSIMSKLNSEFAMKDLGPLSYFLGISVTRHSD-------------------------- 1087
               +++S L   F++KD   L YFLGI  TR +                           
Sbjct: 1156 SVSAVLSSLAERFSIKDPTDLHYFLGIEATRTNTGLHLMQRKYMTDLLAKHNMLDAKPVA 1215

Query: 1088 ---DTKAKLSGTSGNPYHDPSEYRSLAGALQYLTFTRPDISYAVQQVCLFMHDPKTQHMT 1144
                T  KL+   G   +D SEYRS+ G+LQYL FTRPDI++AV ++  FMH P + H  
Sbjct: 1216 TPLPTSPKLTLHGGTKLNDASEYRSVVGSLQYLAFTRPDIAFAVNRLSQFMHQPTSDHWQ 1275

Query: 1145 ALKRIIRYIKGTSTHGLHLYPSTVDKLTTYTDADWGGCPDTRKSTSGYCVYLGDNLVSWS 1204
            A KR++RY+ GT+THG+ L  S+   L  ++DADW G      ST+ Y +YLG N +SWS
Sbjct: 1276 AAKRVLRYLAGTTTHGIFLNSSSPIHLHAFSDADWAGDSADYVSTNAYVIYLGRNPISWS 1335

Query: 1205 AKRQPTLSRSSAEAEYRGVANVVSESCWLRNLLLELQCPVTKATLVYCDNVSAVYLSGNP 1264
            +K+Q  +SRSS E+EYR VAN  SE  WL +LL EL   +     ++CDN+ A Y+  NP
Sbjct: 1336 SKKQRGVSRSSTESEYRAVANAASEIRWLCSLLTELHIRLPHGPTIFCDNIGATYICANP 1395

Query: 1265 IQHQRTKHIEMDIHFVREKVARGQVRVMHVPSRYQIADIFTKGLPLQLFDDFRDSLHIRQ 1324
            + H R KHI +D HFVR  +    +RV HV +  Q+AD  TK L    F   R  + +RQ
Sbjct: 1396 VFHSRMKHIALDYHFVRGMIQSRALRVSHVSTNDQLADALTKSLSRPHFLSARSKIGVRQ 1455

Query: 1325 PPVS 1328
             P S
Sbjct: 1456 LPPS 1459


>gb|AAD21687.1| Strong similarity to gi|3600044 T12H20.12 protease homolog from
            Arabidopsis thaliana BAC gb|AF080119 and is a member of
            the reverse transcriptase family PF|00078
            gi|25301706|pir||C86438 hypothetical protein F28K20.17 -
            Arabidopsis thaliana
          Length = 1415

 Score =  766 bits (1977), Expect = 0.0
 Identities = 473/1331 (35%), Positives = 679/1331 (50%), Gaps = 126/1331 (9%)

Query: 40   NKNS--RALYLEQEFSKISMEQFSTASSYCQHIKSLSDQLANVGAPVSNERMVLQLVSGL 97
            NK+S  R   L Q    +S ++    S YC+  K++ D L+++G PV     +   ++GL
Sbjct: 117  NKSSVAREFSLRQNLQLLSKKE-KPFSVYCREFKTICDALSSIGKPVDESMKIFGFLNGL 175

Query: 98   SEAYDTVGSQIRHSDTLPPFYKARSMVVLEETARAKRTAVTTANST--FLAAQEDNNSGN 155
               YD + + I+ S +  P      +V   +   +K  +   A S    LA   + +   
Sbjct: 176  GRDYDPITTVIQSSLSKLPTPTFNDVVSEVQGFDSKLQSYEEAASVTPHLAFNIERSESG 235

Query: 156  SPSHRPNRNNNSNNGSRGGRGGSGNNHGGRGRGRGGGRGGRSHNYQQGSWQQPQAPQHQW 215
            SP + PN+     +G   GRGG    +  RGRG                       QHQ 
Sbjct: 236  SPQYNPNQKGRGRSGQNKGRGG----YSTRGRGFS---------------------QHQS 270

Query: 216  AYPPYQYPSWNAPWQPWATPPCPYPTTGNWQRPVVPNRQAGILGAKPQQAHVAATPSSYA 275
            +      P  + P                  RPV       I G   +  H A    +  
Sbjct: 271  S------PQVSGP------------------RPVCQ-----ICG---RTGHTALKCYNRF 298

Query: 276  PTDIQAAMHSLS-LAPPDE---QWYMDTGATSHMTANGGNLTSYSNISNN--ITVGSGHN 329
              + QA + + S L   D+   +W+ D+ AT+H+T++   L S +    +  + VG G  
Sbjct: 299  DNNYQAEIQAFSTLRVSDDTGKEWHPDSAATAHVTSSTNGLQSATEYEGDDAVLVGDGTY 358

Query: 330  IPVIGCGNALVQNPQYPLTLNNVLHAPKLIKNLVSVRKFTIDNEVSVEFDPFGFSVKDFQ 389
            +P+   G+  +++    + LN VL  P + K+L+SV K   D    V FD     + D Q
Sbjct: 359  LPITHTGSTTIKSSNGKIPLNEVLVVPNIQKSLLSVSKLCDDYPCGVYFDANKVCIIDLQ 418

Query: 390  TGMPLMRCNSSGDLYPLATRPYYPSTTPSTFAALSNEIWHNRLGHPGVSILNSLHRNNSI 449
            T   +        LY L  + +    +    AA + E+WH+RLGH     L  L  + +I
Sbjct: 419  TQKVVTTGPRRNGLYVLENQEFVALYSNRQCAA-TEEVWHHRLGHANSKALQHLQNSKAI 477

Query: 450  LCNKFRNNFFCQSCQLGKQIKLPFYESLSSTLLPFDIVHSDIW-TSPILSSGGHHYYILF 508
              NK R +  C+ CQ+GK  +LPF  S S  L P D +H D+W  SP++S+ G  YY +F
Sbjct: 478  QINKSRTSPVCEPCQMGKSSRLPFLISDSRVLHPLDRIHCDLWGPSPVVSNQGLKYYAIF 537

Query: 509  LDDFTDFLWTFPLTNKSQAHSIFLQFCNHIKTQFERDIKCFQCDNGKEYDNSHFHQFCKQ 568
            +DD++ + W +PL NKS+  S+F+ F   ++ Q    IK FQ D G E+ ++       +
Sbjct: 538  VDDYSRYSWFYPLHNKSEFLSVFISFQKLVENQLNTKIKVFQSDGGGEFVSNKLKTHLSE 597

Query: 569  NGMIFRFSCPHTSPQNGKVERKIRTINNIIRTLLAHASLPPSFWHHALQMATYLLNILPT 628
            +G+  R SCP+T  QNG  ERK R +  +  ++L H+  P  FW  +   A Y++N LP+
Sbjct: 598  HGIHHRISCPYTPQQNGLAERKHRHLVELGLSMLFHSHTPQKFWVESFFTANYIINRLPS 657

Query: 629  KKLALQTPTTILYQKSPSYSHLKVFGCLCFPLIPSTTRNKLQARSTPCVFLGYPSNHRGY 688
              L   +P   L+ + P YS L+VFG  C+P +    +NK   RS  CVFLGY S ++GY
Sbjct: 658  SVLKNLSPYEALFGEKPDYSSLRVFGSACYPCLRPLAQNKFDPRSLQCVFLGYNSQYKGY 717

Query: 689  KCFELSSRKIIISRHVIFDENTFPFSN---SNIPESSCYNFLDTSDTPFPYHLFQHTSNL 745
            +CF   + K+ ISR+VIF+E+  PF     S +P+ S         TP     +QH    
Sbjct: 718  RCFYPPTGKVYISRNVIFNESELPFKEKYQSLVPQYS---------TPL-LQAWQHNKI- 766

Query: 746  PTNEPNSHDQPPTTTATPLTTPYSINTTTHQPTVAPSPQIQTSPQLTITPNTASPHQIPP 805
                  S    P       + P  +NT           +  T P+ T     +     P 
Sbjct: 767  ------SEISVPAAPVQLFSKPIDLNTYAGSQVT----EQLTDPEPTSNNEGSDEEVNPV 816

Query: 806  PNPLLANPPL---SPQMTTRAQHGIFKPRQLLNLHTSSSNQISPLPTNPINALQDHNWKM 862
               + AN      S  MTTR++ GI KP     L TS  N   P      +A++   W  
Sbjct: 817  AEEIAANQEQVINSHAMTTRSKAGIQKPNTRYALITSRMNTAEPKTL--ASAMKHPGWNE 874

Query: 863  AMKDEYDALIDNKTWDLVPRPSNANIIRSLWIFRHKKKADGSFERYKARLVGNGSNQQTG 922
            A+ +E + +    TW LVP   + NI+ S W+F+ K   DGS ++ KARLV  G +Q+ G
Sbjct: 875  AVHEEINRVHMLHTWSLVPPTDDMNILSSKWVFKTKLHPDGSIDKLKARLVAKGFDQEEG 934

Query: 923  VDCGETFSPVVKPATIRTVLSIALSKSWCLHQLDVKNAFLHGNLNETVYMYQPPGFRDPQ 982
            VD  ETFSPVV+ ATIR VL ++ SK W + QLDV NAFLHG L E V+MYQP GF DPQ
Sbjct: 935  VDYLETFSPVVRTATIRLVLDVSTSKGWPIKQLDVSNAFLHGELQEPVFMYQPSGFIDPQ 994

Query: 983  HPDYVCLLKKSLYGLKQAPRAWYQRFTDYVATLGFSHSVCDHSLFIYHSGDDTAYILLYV 1042
             P +VC L K++YGLKQAPRAW+  F++++   GF  S  D SLF+ H      Y+LLYV
Sbjct: 995  KPTHVCRLTKAIYGLKQAPRAWFDTFSNFLLDYGFVCSKSDPSLFVCHQDGKILYLLLYV 1054

Query: 1043 DDIILTASSDTLRQSIMSKLNSEFAMKDLGPLSYFLGISV-----------TRHSDDTKA 1091
            DDI+LT S  +L + ++  L + F+MKDLGP  YFLGI +           T ++ D   
Sbjct: 1055 DDILLTGSDQSLLEDLLQALKNRFSMKDLGPPRYFLGIQIEDYANGLFLHQTAYATDILQ 1114

Query: 1092 KLSGTSGNP----------------YHDPSEYRSLAGALQYLTFTRPDISYAVQQVCLFM 1135
            +   +  NP                + +P+ +RSLAG LQYLT TRPDI +AV  +C  M
Sbjct: 1115 QAGMSDCNPMPTPLPQQLDNLNSELFAEPTYFRSLAGKLQYLTITRPDIQFAVNFICQRM 1174

Query: 1136 HDPKTQHMTALKRIIRYIKGTSTHGLHLYPSTVDKLTTYTDADWGGCPDTRKSTSGYCVY 1195
            H P T     LKRI+RYIKGT   GL +  ++   L+ Y+D+D  GC +TR+ST+G+C+ 
Sbjct: 1175 HSPTTSDFGLLKRILRYIKGTIGMGLPIKRNSTLTLSAYSDSDHAGCKNTRRSTTGFCIL 1234

Query: 1196 LGDNLVSWSAKRQPTLSRSSAEAEYRGVANVVSESCWLRNLLLELQCPVTKATLVYCDNV 1255
            LG NL+SWSAKRQPT+S SS EAEYR +     E  W+  LL +L  P    T VYCDN+
Sbjct: 1235 LGSNLISWSAKRQPTVSNSSTEAEYRALTYAAREITWISFLLRDLGIPQYLPTQVYCDNL 1294

Query: 1256 SAVYLSGNPIQHQRTKHIEMDIHFVREKVARGQVRVMHVPSRYQIADIFTKGLPLQLFDD 1315
            SAVYLS NP  H R+KH + D H++RE+VA G +   H+ + +Q+AD+FTK LP + F D
Sbjct: 1295 SAVYLSANPALHNRSKHFDTDYHYIREQVALGLIETQHISATFQLADVFTKSLPRRAFVD 1354

Query: 1316 FRDSLHIRQPP 1326
             R  L +   P
Sbjct: 1355 LRSKLGVSGSP 1365


>gb|AAK51235.1| polyprotein [Arabidopsis thaliana]
          Length = 1453

 Score =  760 bits (1963), Expect = 0.0
 Identities = 483/1346 (35%), Positives = 679/1346 (49%), Gaps = 124/1346 (9%)

Query: 40   NKNS--RALYLEQEFSKISMEQFSTASSYCQHIKSLSDQLANVGAPVSNERMVLQLVSGL 97
            NK+S  R   L +    +S +   T S+YC+   ++ D L+++G PV     +   ++GL
Sbjct: 117  NKSSVAREFTLRRTLQLLSKKD-KTLSAYCREFIAVCDALSSIGKPVDESMKIFGFLNGL 175

Query: 98   SEAYDTVGSQIRHS--DTLPPFYKARSMVVLEETARAKRTAVTTANSTFLAAQEDNNSGN 155
               YD + + I+ S     PP ++     V     + +    +   +  +A         
Sbjct: 176  GREYDPITTVIQSSLSKISPPTFRDVISEVKGFDVKLQSYEESVTANPHMAFN------- 228

Query: 156  SPSHRPNRNNNSNNGSRG-GRGGSGNNHGGRGRGRGGGRGGRSHNYQQGSWQQPQAPQHQ 214
              + R    +N  +G+RG GRGG G N G  G    G RG   H     +          
Sbjct: 229  --TQRSEYTDNYTSGNRGKGRGGYGQNRGRSGYSTRG-RGFSQHQTNSNN---------- 275

Query: 215  WAYPPYQYPSWNAPWQPWATPPCPYPTTGNWQRPVVPNRQAGILGAKPQQAHVAATPSSY 274
                                       TG  +RPV   +  G  G    + +      +Y
Sbjct: 276  ---------------------------TG--ERPVC--QICGRTGHTALKCY-NRFDHNY 303

Query: 275  APTDIQAAMHSLSLAPPD-EQWYMDTGATSHMTANGGNLTSYS--NISNNITVGSGHNIP 331
               D   A  SL ++    ++W  D+ AT+H+T++  NL + S  N S+ + VG G  +P
Sbjct: 304  QSVDTAQAFSSLRVSDSSGKEWVPDSAATAHVTSSTNNLQAASPYNGSDTVLVGDGAYLP 363

Query: 332  VIGCGNALVQNPQYPLTLNNVLHAPKLIKNLVSVRKFTIDNEVSVEFDPFGFSVKDFQTG 391
            +   G+  + +    L LN VL  P + K+L+SV K   D    V FD     + D  T 
Sbjct: 364  ITHVGSTTISSDSGTLPLNEVLVCPDIQKSLLSVSKLCDDYPCGVYFDANKVCIIDINTQ 423

Query: 392  MPLMRCNSSGDLYPLATRPYYPSTTPSTFAALSNEIWHNRLGHPGVSILNSLHRNNSILC 451
              + +   S  LY L  + +    +    AA S EIWH+RLGH    IL  L  +  I  
Sbjct: 424  KVVSKGPRSNGLYVLENQEFVAFYSNRQCAA-SEEIWHHRLGHSNSRILQQLKSSKEISF 482

Query: 452  NKFRNNFFCQSCQLGKQIKLPFYESLSSTLLPFDIVHSDIW-TSPILSSGGHHYYILFLD 510
            NK R +  C+ CQ+GK  KL F+ S S  L     +H D+W  SP++S  G  YY++F+D
Sbjct: 483  NKSRMSPVCEPCQMGKSSKLQFFSSNSRELDLLGRIHCDLWGPSPVVSKQGFKYYVVFVD 542

Query: 511  DFTDFLWTFPLTNKSQAHSIFLQFCNHIKTQFERDIKCFQCDNGKEYDNSHFHQFCKQNG 570
            D++ + W +PL  KS   ++F+ F N ++ QF   IK FQ D G E+ ++   +     G
Sbjct: 543  DYSRYSWFYPLKAKSDFFAVFVAFQNLVENQFNTKIKVFQSDGGGEFTSNLMKKHLTDCG 602

Query: 571  MIFRFSCPHTSPQNGKVERKIRTINNIIRTLLAHASLPPSFWHHALQMATYLLNILPTKK 630
            +  R SCP+T  QNG  ERK R    +  +++ H+  P  FW  A   A++L N+LP+  
Sbjct: 603  IQHRISCPYTPQQNGIAERKHRHFVELGLSMMFHSHTPLQFWVEAFFTASFLSNMLPSPS 662

Query: 631  LALQTPTTILYQKSPSYSHLKVFGCLCFPLIPSTTRNKLQARSTPCVFLGYPSNHRGYKC 690
            L   +P   L ++ P+Y+ L+VFG  C+P +     +K + RS  CVFLGY S ++GY+C
Sbjct: 663  LGNVSPLEALLKQKPNYAMLRVFGTACYPCLRPLGEHKFEPRSLQCVFLGYNSQYKGYRC 722

Query: 691  FELSSRKIIISRHVIFDENTFPFSNSNIPESSCYNFLDTSDTPFPYHLFQHTSNLPTNEP 750
                + ++ ISRHVIFDE TFPF          Y FL           +Q  S++P  + 
Sbjct: 723  LYPPTGRVYISRHVIFDEETFPFKQK-------YQFLVPQYESSLLSAWQ--SSIPQADQ 773

Query: 751  NSHDQPPTTTATPLTTPYSINTTTHQPTVAPSPQIQTSPQL------------------- 791
            +   Q        L  P SI   T Q T    P I T   L                   
Sbjct: 774  SLIPQAEEGKIESLAKPPSIQKNTIQDTTT-QPAILTEGVLNEEEEEDSFEETETESLNE 832

Query: 792  -TITPNTASPHQIPPPNPLLANPPLSPQMTTRAQHGIFKPRQLLNLHTSSSNQISPLPTN 850
             T T N  +  ++     +   P  +  MTTR++ GI K      L TS  +   P   +
Sbjct: 833  ETHTQNDEA--EVTVEEEVQQEPENTHPMTTRSKAGIHKSNTRYALLTSKFSVEEPKSID 890

Query: 851  PINALQDHNWKMAMKDEYDALIDNKTWDLVPRPSNANIIRSLWIFRHKKKADGSFERYKA 910
               AL    W  A+ DE   +    TW LV    + NI+   W+F+ K K DGS ++ KA
Sbjct: 891  --EALNHPGWNNAVNDEMRTIHMLHTWSLVQPTEDMNILGCRWVFKTKLKPDGSVDKLKA 948

Query: 911  RLVGNGSNQQTGVDCGETFSPVVKPATIRTVLSIALSKSWCLHQLDVKNAFLHGNLNETV 970
            RLV  G +Q+ G+D  ETFSPVV+ ATIR VL +A +K W + QLDV NAFLHG L E V
Sbjct: 949  RLVAKGFHQEEGLDYLETFSPVVRTATIRLVLDVATAKGWNIKQLDVSNAFLHGELKEPV 1008

Query: 971  YMYQPPGFRDPQHPDYVCLLKKSLYGLKQAPRAWYQRFTDYVATLGFSHSVCDHSLFIYH 1030
            YM QPPGF D + P YVC L K+LYGLKQAPRAW+   ++Y+   GFS S  D SLF YH
Sbjct: 1009 YMLQPPGFVDQEKPSYVCRLTKALYGLKQAPRAWFDTISNYLLDFGFSCSKSDPSLFTYH 1068

Query: 1031 SGDDTAYILLYVDDIILTASSDTLRQSIMSKLNSEFAMKDLGPLSYFLGISV-------- 1082
                T  +LLYVDDI+LT S   L Q ++  LN  F+MKDLG  SYFLG+ +        
Sbjct: 1069 KNGKTLVLLLYVDDILLTGSDHNLLQELLMSLNKRFSMKDLGAPSYFLGVEIESSPEGLF 1128

Query: 1083 ---TRHSDDT--KAKLSGTSGNP--------------YHDPSEYRSLAGALQYLTFTRPD 1123
               T ++ D   +A +S  +  P              + +P+ +RSLAG LQYLT TRPD
Sbjct: 1129 LHQTAYAKDILHQAAMSNCNSMPTPLPQHIENLNSDLFPEPTYFRSLAGKLQYLTITRPD 1188

Query: 1124 ISYAVQQVCLFMHDPKTQHMTALKRIIRYIKGTSTHGLHLYPSTVDKLTTYTDADWGGCP 1183
            I +AV  +C  MH P T     LKRI+RY+KGT   GLH+  +    L  Y+D+DW GC 
Sbjct: 1189 IQFAVNFICQRMHSPTTADFGLLKRILRYVKGTIHLGLHIKKNQNLSLVAYSDSDWAGCK 1248

Query: 1184 DTRKSTSGYCVYLGDNLVSWSAKRQPTLSRSSAEAEYRGVANVVSESCWLRNLLLELQCP 1243
            +TR+ST+G+C  LG NL+SWSAKRQ T+S+SS EAEYR +  V  E  WL  LL ++   
Sbjct: 1249 ETRRSTTGFCTLLGCNLISWSAKRQETVSKSSTEAEYRALTAVAQELTWLSFLLRDIGVT 1308

Query: 1244 VTKATLVYCDNVSAVYLSGNPIQHQRTKHIEMDIHFVREKVARGQVRVMHVPSRYQIADI 1303
             T  TLV CDN+SAVYLS NP  H R+KH + D H++RE+VA G V   H+ +  Q+ADI
Sbjct: 1309 QTHPTLVKCDNLSAVYLSANPALHNRSKHFDTDYHYIREQVALGLVETKHISATLQLADI 1368

Query: 1304 FTKGLPLQLFDDFRDSLHIRQPPVST 1329
            FTK LP + F D R  L + +PP ++
Sbjct: 1369 FTKPLPRRAFIDLRIKLGVAEPPTTS 1394


>gb|AAF02855.1| Similar to retrotransposon proteins [Arabidopsis thaliana]
            gi|25301689|pir||C96578 hypothetical protein T18A20.5
            [imported] - Arabidopsis thaliana
          Length = 1522

 Score =  741 bits (1914), Expect = 0.0
 Identities = 467/1388 (33%), Positives = 693/1388 (49%), Gaps = 168/1388 (12%)

Query: 38   FDNKNSRALYLEQEFSKISMEQFSTASSYCQHIKSLSDQLANVGAPVSNERMVLQLVSGL 97
            F+  +S  L+  Q   +   ++ +T   + + +K + DQLA+VG+PV  +  +   ++GL
Sbjct: 114  FNRVSSSRLFELQRRLQTLEKKDNTMEVFLKDLKHICDQLASVGSPVPEKMKIFSALNGL 173

Query: 98   SEAYDTVGSQIRHSDTLPPFYKARSMVVLEETARAKRTAVTTANSTFLAAQEDNNSGNSP 157
               Y+ + + I +S    P       + L+E A   R       S         +   + 
Sbjct: 174  GREYEPIKTTIENSVDSNP------SLSLDEVASKLRGYDDRLQSYVTEPTISPHVAFNV 227

Query: 158  SHRPNRNNNSNNGSRGGRGGSGNNHGGRGRGRGGGRGGRSHNYQQGSWQQPQAPQHQWAY 217
            +H  +   ++NN       G G ++ G G+     RG   H  QQ S             
Sbjct: 228  THSDSGYYHNNNR------GKGRSNSGSGKSSFSTRGRGFH--QQIS------------- 266

Query: 218  PPYQYPSWNAPWQPWATPPCPYPTTGNWQRPVVPNRQAGILGAKPQQAHVAAT------- 270
                                  PT+G+         QAG  G   Q    A         
Sbjct: 267  ----------------------PTSGS---------QAGNSGLVCQICGKAGHHALKCWH 295

Query: 271  --PSSYAPTDIQAAMHSLSLAPPDE----QWYMDTGATSHMTANGGNLTSYSNI--SNNI 322
               +SY   D+  A+ ++ +    +    +W  D+ A++H+T N   L        S++I
Sbjct: 296  RFDNSYQHEDLPMALATMRITDVTDHHGHEWIPDSAASAHVTNNRHVLQQSQPYHGSDSI 355

Query: 323  TVGSGHNIPVIGCGNALVQNPQYPLTLNNVLHAPKLIKNLVSVRKFTIDNEVSVEFDPFG 382
             V  G+ +P+   G+  + +    + L  VL  P ++K+L+SV K T D   SVEFD   
Sbjct: 356  MVADGNFLPITHTGSGSIASSSGKIPLKEVLVCPDIVKSLLSVSKLTSDYPCSVEFDADS 415

Query: 383  FSVKDFQTGMPLMRCNSSGDLYPLATRPYYPSTTPSTFAALSNEIWHNRLGHPGVSILNS 442
              + D  T   L+   +   LY L   P       +   + S+E+WH RLGH    +L+ 
Sbjct: 416  VRINDKATKKLLVMGRNRDGLYSLE-EPKLQVLYSTRQNSASSEVWHRRLGHANAEVLHQ 474

Query: 443  LHRNNSILCNKFRNNFFCQSCQLGKQIKLPFYESLSSTLLPFDIVHSDIW-TSPILSSGG 501
            L  + SI+         C++C LGK  +LPF  S  +   P + +H D+W  SP  S  G
Sbjct: 475  LASSKSIIIINKVVKTVCEACHLGKSTRLPFMLSTFNASRPLERIHCDLWGPSPTSSVQG 534

Query: 502  HHYYILFLDDFTDFLWTFPLTNKSQAHSIFLQFCNHIKTQFERDIKCFQCDNGKEYDNSH 561
              YY++F+D ++ F W +PL  KS   S F+ F   ++ Q    IK FQCD G E+ +S 
Sbjct: 535  FRYYVVFIDHYSRFTWFYPLKLKSDFFSTFVMFQKLVENQLGHKIKIFQCDGGGEFISSQ 594

Query: 562  FHQFCKQNGMIFRFSCPHTSPQNGKVERKIRTINNIIRTLLAHASLPPSFWHHALQMATY 621
            F +  + +G+    SCP+T  QNG  ERK R I  +  +++  + LP  +W  +   A +
Sbjct: 595  FLKHLQDHGIQQNMSCPYTPQQNGMAERKHRHIVELGLSMIFQSKLPLKYWLESFFTANF 654

Query: 622  LLNILPTKKLAL-QTPTTILYQKSPSYSHLKVFGCLCFPLIPSTTRNKLQARSTPCVFLG 680
            ++N+LPT  L   ++P   LY K+P YS L+VFGC C+P +      K   RS  CVFLG
Sbjct: 655  VINLLPTSSLDNNESPYQKLYGKAPEYSALRVFGCACYPTLRDYASTKFDPRSLKCVFLG 714

Query: 681  YPSNHRGYKCFELSSRKIIISRHVIFDENTFPFSNSNIPESSCYNFLDTSD-TPFPYHLF 739
            Y   ++GY+C    + +I ISRHV+FDENT PF        S Y+ L   D TP     F
Sbjct: 715  YNEKYKGYRCLYPPTGRIYISRHVVFDENTHPFE-------SIYSHLHPQDKTPLLEAWF 767

Query: 740  QHTSNLPTNEPNSHDQPPTTTATPLTT-----PYSINTTTHQPTVAPSPQ--------IQ 786
            +   ++   +P+    P ++   P TT     P S+   T  P  +            + 
Sbjct: 768  KSFHHVTPTQPDQSRYPVSSIPQPETTDLSAAPASVAAETAGPNASDDTSQDNETISVVS 827

Query: 787  TSPQLTITPNTAS---PHQIPP-----PNPLLANPPLSPQ-------------------- 818
             SP+ T   ++AS    +  P      P+P  ++P  SPQ                    
Sbjct: 828  GSPERTTGLDSASIGDSYHSPTADSSHPSPARSSPASSPQGSPIQMAPAQQVQAPVTNEH 887

Query: 819  -MTTRAQHGIFKPRQLLNLHTSSSNQISPLPTNPINALQDHNWKMAMKDEYDALIDNKTW 877
             M TR + GI KP +   L T   +   P P     AL+   W  AM++E     + +TW
Sbjct: 888  AMVTRGKEGISKPNKRYVLLTHKVS--IPEPKTVTEALKHPGWNNAMQEEMGNCKETETW 945

Query: 878  DLVPRPSNANIIRSLWIFRHKKKADGSFERYKARLVGNGSNQQTGVDCGETFSPVVKPAT 937
             LVP   N N++ S+W+FR K  ADGS ++ KARLV  G  Q+ G+D  ET+SPVV+  T
Sbjct: 946  TLVPYSPNMNVLGSMWVFRTKLHADGSLDKLKARLVAKGFKQEEGIDYLETYSPVVRTPT 1005

Query: 938  IRTVLSIALSKSWCLHQLDVKNAFLHGNLNETVYMYQPPGFRDPQHPDYVCLLKKSLYGL 997
            +R +L +A    W L Q+DVKNAFLHG+L ETVYM QP GF D   PD+VCLL KSLYGL
Sbjct: 1006 VRLILHVATVLKWELKQMDVKNAFLHGDLTETVYMRQPAGFVDKSKPDHVCLLHKSLYGL 1065

Query: 998  KQAPRAWYQRFTDYVATLGFSHSVCDHSLFIYHSGDDTAYILLYVDDIILTASSDTLRQS 1057
            KQ+PRAW+ RF++++   GF  S+ D SLF+Y S +D   +LLYVDD+++T ++      
Sbjct: 1066 KQSPRAWFDRFSNFLLEFGFICSLFDPSLFVYSSNNDVILLLLYVDDMVITGNNSQSLTH 1125

Query: 1058 IMSKLNSEFAMKDLGPLSYFLGISV-----------TRHSDDTKAKLSGTSGNP------ 1100
            +++ LN EF MKD+G + YFLGI +            ++++D     S  + +P      
Sbjct: 1126 LLAALNKEFRMKDMGQVHYFLGIQIQTYDGGLFMSQQKYAEDLLITASMANCSPMPTPLP 1185

Query: 1101 ------------YHDPSEYRSLAGALQYLTFTRPDISYAVQQVCLFMHDPKTQHMTALKR 1148
                        + DP+ +RSLAG LQYLT TRPDI +AV  VC  MH P       LKR
Sbjct: 1186 LQLDRVSNQDEVFSDPTYFRSLAGKLQYLTLTRPDIQFAVNFVCQKMHQPSVSDFNLLKR 1245

Query: 1149 IIRYIKGTSTHGLH----------LYPSTVDKLTTYTDADWGGCPDTRKSTSGYCVYLGD 1198
            I+RYIKGT + G+            Y S  D L+ Y+D+D+  C +TR+S  GYC ++G 
Sbjct: 1246 ILRYIKGTVSMGIQYNSNSSSVVSAYESDYD-LSAYSDSDYANCKETRRSVGGYCTFMGQ 1304

Query: 1199 NLVSWSAKRQPTLSRSSAEAEYRGVANVVSESCWLRNLLLELQCPVTKATLVYCDNVSAV 1258
            N++SWS+K+QPT+SRSS EAEYR ++   SE  W+ ++L E+   +     ++CDN+SAV
Sbjct: 1305 NIISWSSKKQPTVSRSSTEAEYRSLSETASEIKWMSSILREIGVSLPDTPELFCDNLSAV 1364

Query: 1259 YLSGNPIQHQRTKHIEMDIHFVREKVARGQVRVMHVPSRYQIADIFTKGLPLQLFDDFRD 1318
            YL+ NP  H RTKH ++D H++RE+VA   + V H+P   Q+ADIFTK LP + F   R 
Sbjct: 1365 YLTANPAFHARTKHFDVDHHYIRERVALKTLVVKHIPGHLQLADIFTKSLPFEAFTRLRF 1424

Query: 1319 SLHIRQPP 1326
             L +  PP
Sbjct: 1425 KLGVDFPP 1432


>gb|AAC02672.1| polyprotein [Arabidopsis arenosa] gi|7522104|pir||T31353 polyprotein
            - Arabidopsis arenosa Evelknievel retrotransposon
            (fragment)
          Length = 1390

 Score =  736 bits (1901), Expect = 0.0
 Identities = 462/1297 (35%), Positives = 653/1297 (49%), Gaps = 147/1297 (11%)

Query: 65   SYCQHIKSLSDQLANVGAPVSNERMVLQLVSGLSEAYDTVGSQIRHSDTLPPFYKARSMV 124
            +Y Q   +  D LA +G     E  +  ++ GL E Y TV  QI   +  P   +    +
Sbjct: 151  TYFQGFTTRFDHLALLGKAPEREEQIELILGGLPEDYKTVVDQIESRENPPALTEVLEKL 210

Query: 125  VLEETARAKRTAVTTANSTFLAAQEDNNSGNSPSHRPNRNNNSNNGSRGGRGGSGNNHGG 184
            +  E   A +   T++            + N+ ++R N NNN++                
Sbjct: 211  INHEVKLAAKAEATSSVPI---------TANAVNYRGNNNNNNS---------------- 245

Query: 185  RGRGRGGGRGGRSHNYQQGSWQQPQAPQHQWAYPPYQYPSWNAPWQPWATPPCPYPTTGN 244
            R  GR   RG  S       WQ  Q+  ++  Y P  Y              C   +   
Sbjct: 246  RSNGRNNSRGNTS-------WQNNQSTTNRQQYTPRPYQG-----------KCQICSVHG 287

Query: 245  WQRPVVPNRQAGILGAKPQQAHVAATPSSYAPTDIQAAMHSLSLAPPDE-QWYMDTGATS 303
                  P  Q         Q+    + SSYAP   +A M  +S  P +   W +D+GAT 
Sbjct: 288  HSARRCPQLQQHAGSYASNQS----SSSSYAPWQPRANM--VSATPYNSGNWLLDSGATH 341

Query: 304  HMTANGGNLTSYS--NISNNITVGSGHNIPVIGCGNALVQNPQYPLTLNNVLHAPKLIKN 361
            H+T++  NL  +   N    +T+  G  +P+   G+AL+  P   L L +VL+ P + KN
Sbjct: 342  HLTSDLNNLALHQPYNGGEEVTIADGSGLPISHSGSALLPTPTRSLDLKDVLYVPDIQKN 401

Query: 362  LVSVRKFTIDNEVSVEFDPFGFSVKDFQTGMPLMRCNSSGDLYPLATRPYYPSTTPSTFA 421
            L+SV +    N VSVEF P  F VKD  TG  L++  +  +LY     P   S   S FA
Sbjct: 402  LISVYRMCNTNGVSVEFFPAHFQVKDLSTGARLLQGKTKNELYEW---PVNSSIATSMFA 458

Query: 422  ALSNEI----WHNRLGHPGVSILNSLHRNNSI-LCNKFRNNFFCQSCQLGKQIKLPFYES 476
            + + +     WH RLGHP + IL +L    S+ + +  +N   C  C + K  KLPFY +
Sbjct: 459  SPTPKTDLPSWHARLGHPSLPILKTLISKFSLPISHSLQNQLLCSDCSINKSHKLPFYSN 518

Query: 477  LSSTLLPFDIVHSDIWTSPILSSGGHHYYILFLDDFTDFLWTFPLTNKSQAHSIFLQFCN 536
              ++  P + +++D+WTSPI S   + YY++ +D +T + W +PL  KSQ    F+ F  
Sbjct: 519  TIASSHPLEYLYTDVWTSPITSIDNYKYYLVIVDHYTRYTWLYPLRQKSQVRETFITFTA 578

Query: 537  HIKTQFERDIKCFQCDNGKEYDNSHFHQFCKQNGMIFRFSCPHTSPQNGKVERKIRTINN 596
             ++ +F+  I     DNG E+       F   +G+    + PHT   NG  ERK R I  
Sbjct: 579  LVENKFKSKIGTLYSDNGGEF--IALRSFLASHGISHMTTPPHTPELNGISERKHRHIVE 636

Query: 597  IIRTLLAHASLPPSFWHHALQMATYLLNILPTKKLALQTPTTILYQKSPSYSHLKVFGCL 656
               TLL+ AS+   +W +A   A YL+N + T  L  ++P   L+ + P+Y  L+VFGCL
Sbjct: 637  TGLTLLSTASMSKEYWSYAFTTAVYLINRMLTPVLGNESPYMKLFGQPPNYLKLRVFGCL 696

Query: 657  CFPLIPSTTRNKLQARSTPCVFLGYPSNHRGYKCFELSSRKIIISRHVIFDENTFPFSNS 716
            CFP +   T +KL  RS PCV LGY  +   Y C + ++ ++  SRHV F E+ FPFS +
Sbjct: 697  CFPWLRPYTAHKLDNRSMPCVLLGYSLSQSAYLCLDRATGRVYTSRHVQFAESIFPFSTT 756

Query: 717  N---IPESSCYNFLDTSDTPFPYHLFQHTSNLPTNEPNS-------------------HD 754
            +    P S      DT     P  L +  +  P + P+                      
Sbjct: 757  SPSVTPPSDPPLSQDTRPISVPI-LARPLTTAPPSSPSCSAPHRSPSQPGILSPSAPFQP 815

Query: 755  QPPTTTATPLTTPYSINTTTH------------QPTVAPSPQIQ-------TSPQ----- 790
             PP++  +P+T+P S++  +H             P V+P PQ +       TSPQ     
Sbjct: 816  SPPSSPTSPITSP-SLSEESHVGHNQETGPTGSSPPVSPQPQSEQSTSPRSTSPQPNSPH 874

Query: 791  -----LTITPN-TASPHQIPPPNPLLANPPLSPQMTTRAQHGIFKPR-QLLNLHTSSSNQ 843
                  +ITP  T SP   PPPNP    PP+   M TR+++ I KP  +  NL T  +  
Sbjct: 875  TQHSPRSITPALTPSPSPSPPPNPN-PPPPIQHTMRTRSKNNIVKPNPKFANLATKPTPL 933

Query: 844  ISPLPTNPINALQDHNWKMAMKDEYDALIDNKTWDLVPRPSNANIIRSLWIFRHKKKADG 903
               +P     AL D NW+ AM DE +A   N T+DLVP   N N+I   W+F  K   +G
Sbjct: 934  KPIIPKTVAEALLDPNWRQAMCDEINAQTRNGTFDLVPPAPNQNVIGCKWVFTLKYLPNG 993

Query: 904  SFERYKARLVGNGSNQQTGVDCGETFSPVVKPATIRTVLSIALSKSWCLHQLDVKNAFLH 963
              +RYKARLV  G +QQ G D  ETFSPV+K  T+R+VL +A+SK W + Q+DV NAFL 
Sbjct: 994  VLDRYKARLVAKGFHQQYGHDFKETFSPVIKSTTVRSVLHVAVSKGWSIRQIDVNNAFLQ 1053

Query: 964  GNLNETVYMYQPPGFRDPQHPDYVCLLKKSLYGLKQAPRAWYQRFTDYVATLGFSHSVCD 1023
            G L++ VY+ QPPGF D  +P +VC L K+LYGLKQAPRAWYQ    Y+ T GF +S+ D
Sbjct: 1054 GTLSDEVYVMQPPGFVDKDNPHHVCRLHKALYGLKQAPRAWYQELRSYLLTQGFVNSIAD 1113

Query: 1024 HSLFIYHSGDDTAYILLYVDDIILTASSDTLRQSIMSKLNSEFAMKDLGPLSYFLGISVT 1083
             SLF         Y+L+YVDD+++T S   +    ++ L + F++KDLG +SYFLGI  T
Sbjct: 1114 TSLFTLRHKRTILYVLVYVDDMLITGSDTNIITRFIANLAARFSLKDLGEMSYFLGIEAT 1173

Query: 1084 RHSD-----------------------------DTKAKLSGTSGNPYHDPSEYRSLAGAL 1114
            R S                                  KLS TSG P   PSEYR++ G+L
Sbjct: 1174 RTSKGLHLMQKRYVLDLLEKTNMLAAHPVLTPMSPTPKLSLTSGTPLDKPSEYRAVLGSL 1233

Query: 1115 QYLTFTRPDISYAVQQVCLFMHDPKTQHMTALKRIIRYIKGTSTHGLHLYPSTVDKLTTY 1174
            QYL+FTRPDI+YAV ++  +MH P   H  A KRI+RY+ GT +HG+ +   T  KL  Y
Sbjct: 1234 QYLSFTRPDIAYAVNRLSQYMHCPTDLHWQAAKRILRYLAGTPSHGIFIRADTPLKLHAY 1293

Query: 1175 TDADWGGCPDTRKSTSGYCVYLGDNLVSWSAKRQPTLSRSSAEAEYRGVANVVSESCWLR 1234
            +DADW G  D   ST+ Y +YLG   +SWS+K+Q  ++RSS EAEYR VAN  SE  W+ 
Sbjct: 1294 SDADWAGDTDNYNSTNAYILYLGSTPISWSSKKQNGVARSSTEAEYRAVANATSEIRWVC 1353

Query: 1235 NLLLELQCPVTKATLVYCDNVSAVYLSGNPIQHQRTK 1271
            +LL EL   ++   +VYCDNV A YLS NP+   R K
Sbjct: 1354 SLLTELGITLSSPPVVYCDNVGATYLSANPVFDSRMK 1390


>gb|AAP53905.1| putative pol polyprotein [Oryza sativa (japonica cultivar-group)]
            gi|37534632|ref|NP_921618.1| putative pol polyprotein
            [Oryza sativa (japonica cultivar-group)]
          Length = 1688

 Score =  686 bits (1771), Expect = 0.0
 Identities = 408/1087 (37%), Positives = 591/1087 (53%), Gaps = 71/1087 (6%)

Query: 281  AAMHSLSLAPP--DEQWYMDTGATSHMTANGGNLTSYSNISNNITVGSGHNIPVIGCGNA 338
            A+  S+S   P   + W +D+GA+ HM+ +   LTS   + N  TV + +          
Sbjct: 168  ASCSSISTVTPIASQPWILDSGASFHMSFDDSWLTSCRLVKNGATVHTANGTLCKVTHQG 227

Query: 339  LVQNPQYPLTLNNVLHAPKLIKNLVSVRKFTIDNEVSVEFDPFGFSVKDFQTGMPL---M 395
             + +PQ+  T+ NV   PKL  NL+SV + T D    V FD     V+D  TG  +    
Sbjct: 228  SISSPQF--TVPNVSLVPKLSMNLISVGQLT-DTNCFVGFDDTSCFVQDRHTGAVIGTGH 284

Query: 396  RCNSSGDLYPL--ATRPYYPSTTPSTFAALSNEI------WHNRLGHPGVSILNSLHRNN 447
            R   S  LY L   + P   + TPS ++ + +        WH+RLGH   S L +L    
Sbjct: 285  RQKRSCGLYILDSLSLPSSSTNTPSVYSPMCSTACKSFPQWHHRLGHLCGSRLATLINQG 344

Query: 448  SILCNKFRNNFFCQSCQLGKQIKLPFYESLSSTLLPFDIVHSDIW-TSPILSSGGHHYYI 506
             +        F C+ C+LGKQ++LP+  S S +  PFD+VHSD+W  SP  S GGH+YY+
Sbjct: 345  VLGSVPVDTTFVCKGCKLGKQVQLPYPSSTSRSSRPFDLVHSDVWGKSPFPSKGGHNYYV 404

Query: 507  LFLDDFTDFLWTFPLTNKSQAHSIFLQFCNHIKTQFERDIKCFQCDNGKEYDNSHFHQFC 566
            +F+DD++ + W + + ++SQ  SI+  F   I TQF   I+ F+ D+G EY ++ F +F 
Sbjct: 405  IFVDDYSRYTWIYFMKHRSQLISIYQSFAQMIHTQFSSAIRIFRSDSGGEYMSNAFREFL 464

Query: 567  KQNGMIFRFSCPHTSPQNGKVERKIRTINNIIRTLLAHASLPPSFWHHALQMATYLLNIL 626
               G + + SCP    QNG  ERK R I    RTLL  + +P  FW  A+  A YL+N+ 
Sbjct: 465  VSQGTLPQLSCPGAHAQNGVAERKHRHIIETARTLLIASFVPAHFWAEAISTAVYLINMQ 524

Query: 627  PTKKLALQTPTTILYQKSPSYSHLKVFGCLCFPLIPSTTRNKLQARSTPCVFLGYPSNHR 686
            P+  L  ++P  +L+   P Y HL+VFGC C+ L+    R KL A+S  CVFLGY   H+
Sbjct: 525  PSSSLQGRSPGEVLFGSPPRYDHLRVFGCTCYVLLAPRERTKLTAQSVECVFLGYSLEHK 584

Query: 687  GYKCFELSSRKIIISRHVIFDEN-TFPFSNSNIPES--SCYNFLDTSDTPFPYHL----- 738
            GY+C++ S+R+I ISR V FDEN  F +S++N P S  +  +FL     P P  L     
Sbjct: 585  GYRCYDPSARRIRISRDVTFDENKPFFYSSTNQPSSPENSISFLYLPPIPSPESLPSSPI 644

Query: 739  FQHTSNLPTNEPNS---HDQPPTTTATPLTTPYSINTTTHQPTVAPSPQIQTSPQLTITP 795
                S +P + P+       PP+ + +P++ P S     H P  +  P + ++  L   P
Sbjct: 645  TPSPSPIPPSVPSPTYVPPPPPSPSPSPVSPPPS-----HIPASSSPPHVPSTITLDTFP 699

Query: 796  NTAS-----PHQIPPPNPLLANPPLSPQMTTRAQHGIFKPRQLLNLHTSSSNQISPL--P 848
               S     P++  P  P L +P  S   ++ A     + R  L         +  +  P
Sbjct: 700  FHYSRRPKIPNESQPSQPTLEDPTCSVDDSSPAPRYNLRARDALRAPNRDDFVVGVVFEP 759

Query: 849  TNPINALQDHNWKMAMKDEYDALIDNKTWDLVPRPSNANIIRSLWIFRHKKKADGSFERY 908
            +    A+   +WK+AM +E  AL    TWD+VP PS+A  I   W+++ K K+DG  ERY
Sbjct: 760  STYQEAIVLPHWKLAMSEELAALERTNTWDVVPLPSHAVPITCKWVYKVKTKSDGQVERY 819

Query: 909  KARLVGNGSNQQTGVDCGETFSPVVKPATIRTVLSIALSKSWCLHQLDVKNAFLHGNLNE 968
            KARLV  G  Q  G D  ETF+PV    T+RT++++A ++SW + Q+DVKNAFLHG+L+E
Sbjct: 820  KARLVARGFQQAHGRDYDETFAPVAHMTTVRTLIAVAATRSWTISQMDVKNAFLHGDLHE 879

Query: 969  TVYMYQPPGFRDPQHPDYVCLLKKSLYGLKQAPRAWYQRFTDYVATLGFSHSVCDHSLFI 1028
             VYM+ PPG   P  P +V  L+++LYGLKQAPRAW+ RF+  V   GFS S  D +LFI
Sbjct: 880  EVYMHPPPGVEAP--PGHVFRLRRALYGLKQAPRAWFARFSSVVLAAGFSPSDHDPALFI 937

Query: 1029 YHSGDDTAYILLYVDDIILTASSDTLRQSIMSKLNSEFAMKDLGPLSYFLGISVT----- 1083
            + S      +LLYVDD+++T         +  KL+ +F M DLGPLSYFLGI VT     
Sbjct: 938  HTSSRGRTLLLLYVDDMLITGDDLEYIAFVKGKLSEQFMMSDLGPLSYFLGIEVTSTVDG 997

Query: 1084 ------RHSDDTKA------------------KLSGTSGNPYHDPSEYRSLAGALQYLTF 1119
                  R+ +D  A                  +L  T G P  DPS YR L G+L YLT 
Sbjct: 998  YYLSQHRYIEDLLAQSGLTDSRTTTTPMELHVRLRSTDGTPLDDPSRYRHLVGSLVYLTV 1057

Query: 1120 TRPDISYAVQQVCLFMHDPKTQHMTALKRIIRYIKGTSTHGLHLYPSTVDKLTTYTDADW 1179
            TRPDI+YAV  +  F+  P + H   L R++RY++GT+T  L    S+  +L  ++D+ W
Sbjct: 1058 TRPDIAYAVHILSQFVSAPISVHYGHLLRVLRYLRGTTTQCLFYAASSPLQLRAFSDSTW 1117

Query: 1180 GGCPDTRKSTSGYCVYLGDNLVSWSAKRQPTLSRSSAEAEYRGVANVVSESCWLRNLLLE 1239
               P  R+S +GYC++LG +L++W +K+Q  +SRSS EAE R +A   SE  WLR LL +
Sbjct: 1118 ASDPIDRRSVTGYCIFLGTSLLTWKSKKQTAVSRSSTEAELRALATTTSEIVWLRWLLAD 1177

Query: 1240 LQCPVTKATLVYCDNVSAVYLSGNPIQHQRTKHIEMDIHFVREKVARGQVRVMHVPSRYQ 1299
                    T + CDN  A+ ++ +PI+H+ TKHI +D  F R    +  + + +VPS  Q
Sbjct: 1178 FGVSCDVPTPLLCDNTGAIQIANDPIKHELTKHIGVDASFTRSHCQQSTIALHYVPSELQ 1237

Query: 1300 IADIFTK 1306
            +AD FTK
Sbjct: 1238 VADFFTK 1244


>emb|CAC95126.1| gag-pol polyprotein [Populus deltoides]
          Length = 1382

 Score =  624 bits (1610), Expect = e-177
 Identities = 402/1164 (34%), Positives = 585/1164 (49%), Gaps = 97/1164 (8%)

Query: 201  QQGSW--QQPQAPQHQWAYPPYQYPSWNAPWQPWATPPCPYPTTGNWQRP---VVPNRQA 255
            Q+G W  Q P+  Q   A+        NA   P    P P+  T     P     PN  A
Sbjct: 250  QKGHWKAQCPKLRQQNQAWKSGSQSQSNAHRSPQGYKP-PHHNTAAVASPGSITDPNTLA 308

Query: 256  GILGAKPQQAHVAATPSSYAPTDIQAAMHSLSLAPPDEQWYMDTGATSHMTANGGNLTSY 315
                 +  Q  ++  P + + + I    HS S     E W +D+GA+ HM+ +  + TS 
Sbjct: 309  -----EQFQKFLSLQPQAMSASSIGQLPHSSSGISHSE-WVLDSGASHHMSPDSSSFTSV 362

Query: 316  SNISN-NITVGSGHNIPVIGCGNALVQNPQYPLTLNNVLHAPKLIKNLVSVRKFTIDNEV 374
            S +S+  +    G  +P+ G G+ +  +    L+L NV   PKL  NL S+ +     + 
Sbjct: 363  SPLSSIPVMTADGTPMPLAGVGSVVTLH----LSLPNVYLIPKLKLNLASIGQICDSGDY 418

Query: 375  SVEFDPFGFSVKDFQTGMPLMRCNSSGDLY---PLATRPYYPSTTPS------TFAALSN 425
             V F      V+D Q+   +        LY    L       +TT        + ++ S 
Sbjct: 419  LVMFSGSFCCVQDLQSQKLIGTGRRENGLYILDELKVPVVVAATTVDLSFFRLSLSSSSF 478

Query: 426  EIWHNRLGHPGVSILNSLHRNNSILCNKFRNNFFCQSCQLGKQIKLPFYESLSSTLLPFD 485
             +WH+RLGH   S L  L    ++   K  +   C  C+L K   LPF  S S +  PFD
Sbjct: 479  YLWHSRLGHVSSSRLRFLASTGALGNLKTCDISDCSGCKLAKFSALPFNRSTSVSSSPFD 538

Query: 486  IVHSDIW-TSPILSSGGHHYYILFLDDFTDFLWTFPLTNKSQAHSIFLQFCNHIKTQFER 544
            ++HSD+W  SP+ + GG  YY+ F+DD T + W + + ++S+   I+  F   IKTQ   
Sbjct: 539  LIHSDVWGPSPVSTKGGSRYYVSFIDDHTRYCWVYLMKHRSEFFEIYAAFRALIKTQHSA 598

Query: 545  DIKCFQCDNGKEYDNSHFHQFCKQNGMIFRFSCPHTSPQNGKVERKIRTINNIIRTLLAH 604
             IKCF+CD G EY ++ F Q    +G I + SC  T  QNG  ERK R I    R+LL  
Sbjct: 599  VIKCFRCDLGGEYTSNKFCQMLALDGTIHQTSCTDTPEQNGVAERKHRHIVETARSLLLS 658

Query: 605  ASLPPSFWHHALQMATYLLNILPTKKLALQTPTTILYQKSPSYSHLKVFGCLCFPLIPST 664
            A +   FW  A+  A  L+N +P+   +  +P   LY   P YS  +VFGC  F L P  
Sbjct: 659  AFVLSEFWGEAVLTAVSLINTIPSSHSSGLSPFEKLYGHVPDYSSFRVFGCTYFVLHPHV 718

Query: 665  TRNKLQARSTPCVFLGYPSNHRGYKCFELSSRKIIISRHVIFDENTFPFSNSNIPESSCY 724
             RNKL +RS  CVFLGY    +GY+CF+  ++K+ +S HV+F E+               
Sbjct: 719  ERNKLSSRSAICVFLGYGEGKKGYRCFDPITQKLYVSHHVVFLEH--------------- 763

Query: 725  NFLDTSDTPFPYHLFQHTSNLPTNEPNSHDQPPTTTATPLTTPYSINTTTHQPTVAPSPQ 784
                      P+     T++  T     H  P +  +   T+PY  +  TH         
Sbjct: 764  ---------IPFFSIPSTTHSLTKSDLIHIDPFSEDSGNDTSPYVRSICTHNSA------ 808

Query: 785  IQTSPQLTITPNTASPHQIPPPNPLLANPPLSPQMTTRAQHGIFKPRQLLNLHTSSSNQI 844
              T   L+ TP  +     P  +  + +PP  P+ + R +     P    + ++SS    
Sbjct: 809  -GTGTLLSGTPEASFSSTAPQASSEIVDPP--PRQSIRIRKSTKLPDFAYSCYSSSFTSF 865

Query: 845  SPL------PTNPINALQDHNWKMAMKDEYDALIDNKTWDLVPRPSNANIIRSLWIFRHK 898
                     P++   A+ D   + AM +E  AL    TWDLVP P   +++   W+++ K
Sbjct: 866  LAYIHCLFEPSSYKEAILDPLGQQAMDEELSALHKTDTWDLVPLPPGKSVVGCRWVYKIK 925

Query: 899  KKADGSFERYKARLVGNGSNQQTGVDCGETFSPVVKPATIRTVLSIALSKSWCLHQLDVK 958
              +DGS ERYKARLV  G +QQ G+D  ETF+P+ K  TIRT++++A  + W + QLDVK
Sbjct: 926  TNSDGSIERYKARLVAKGYSQQYGMDYEETFAPIAKMTTIRTLIAVASIRQWHISQLDVK 985

Query: 959  NAFLHGNLNETVYMYQPPGFRDPQHPDYVCLLKKSLYGLKQAPRAWYQRFTDYVATLGFS 1018
            NAFL+G+L E VYM  PPG        YVC LKK+LYGLKQAPRAW+++F+  +++LGF 
Sbjct: 986  NAFLNGDLQEEVYMAPPPGISHDS--GYVCKLKKALYGLKQAPRAWFEKFSIVISSLGFV 1043

Query: 1019 HSVCDHSLFIYHSGDDTAYILLYVDDIILTASSDTLRQSIMSKLNSEFAMKDLGPLSYFL 1078
             S  D +LFI  +      + LYVDD+I+T         + ++L   F MKDLG L YFL
Sbjct: 1044 SSSHDSALFIKCTDAGRIILSLYVDDMIITGDDIDGISVLKTELARRFEMKDLGYLRYFL 1103

Query: 1079 GISVT---------------------RHSD--------DTKAKLSGTSGNPYHDPSEYRS 1109
            GI V                      R +D        +  A+ S + G P  DP+ YR+
Sbjct: 1104 GIEVAYSPRGYLLSQSKYVANILERARLTDNKTVDTPIEVNARYSSSDGLPLIDPTLYRT 1163

Query: 1110 LAGALQYLTFTRPDISYAVQQVCLFMHDPKTQHMTALKRIIRYIKGTSTHGLHLYPSTVD 1169
            + G+L YLT T PDI+YAV  V  F+  P T H  A+ RI+RY++GT    L L  ++  
Sbjct: 1164 IVGSLVYLTITHPDIAYAVHVVSQFVASPTTIHWAAVLRILRYLRGTVFQSLLLSSTSSL 1223

Query: 1170 KLTTYTDADWGGCPDTRKSTSGYCVYLGDNLVSWSAKRQPTLSRSSAEAEYRGVANVVSE 1229
            +L  Y+DAD G  P  RKS +G+C++LGD+L+SW +K+Q  +S+SS EAEY  +A+   E
Sbjct: 1224 ELRAYSDADHGSDPTDRKSVTGFCIFLGDSLISWKSKKQSIVSQSSTEAEYCAMASTTKE 1283

Query: 1230 SCWLRNLLLELQCPVTKATLVYCDNVSAVYLSGNPIQHQRTKHIEMDIHFVREKVARGQV 1289
              W R LL ++    +  T +YCDN S++ ++ N + H+RTKHIE+D H  R  +  G +
Sbjct: 1284 IVWSRWLLADMGISFSHLTPMYCDNQSSIQIAHNSVFHERTKHIEIDCHLTRHHLKHGTI 1343

Query: 1290 RVMHVPSRYQIADIFTKGLPLQLF 1313
             +  VPS  QIAD FTK   +  F
Sbjct: 1344 ALPFVPSSLQIADFFTKAHSISRF 1367


  Database: nr
    Posted date:  Jul 5, 2005 12:34 AM
  Number of letters in database: 863,360,394
  Number of sequences in database:  2,540,612
  
Lambda     K      H
   0.319    0.134    0.418 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,516,043,403
Number of Sequences: 2540612
Number of extensions: 120596425
Number of successful extensions: 947488
Number of sequences better than 10.0: 6743
Number of HSP's better than 10.0 without gapping: 3644
Number of HSP's successfully gapped in prelim test: 3439
Number of HSP's that attempted gapping in prelim test: 739759
Number of HSP's gapped (non-prelim): 77188
length of query: 1333
length of database: 863,360,394
effective HSP length: 140
effective length of query: 1193
effective length of database: 507,674,714
effective search space: 605655933802
effective search space used: 605655933802
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 82 (36.2 bits)


Medicago: description of AC146720.2