Lotus
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= TM0019a.6
         (1380 letters)

Database: nr 
           2,540,612 sequences; 863,360,394 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

emb|CAC37623.1| copia-like polyprotein [Arabidopsis thaliana]         910  0.0
gb|AAK51235.1| polyprotein [Arabidopsis thaliana]                     875  0.0
gb|AAD21687.1| Strong similarity to gi|3600044 T12H20.12 proteas...   870  0.0
gb|AAA57005.1| copia-like retrotransposon Hopscotch polyprotein ...   845  0.0
emb|CAB81170.1| retrotransposon like protein [Arabidopsis thalia...   829  0.0
gb|AAT85031.1| putative polyprotein [Oryza sativa (japonica cult...   793  0.0
dbj|BAB10876.1| polyprotein [Arabidopsis thaliana]                    790  0.0
gb|AAC02664.1| polyprotein [Arabidopsis thaliana]                     787  0.0
gb|AAC02666.1| polyprotein [Arabidopsis thaliana]                     783  0.0
gb|AAC02669.1| polyprotein [Arabidopsis thaliana]                     783  0.0
gb|AAC67200.1| putative retroelement pol polyprotein [Arabidopsi...   775  0.0
emb|CAB81478.1| putative protein [Arabidopsis thaliana] gi|49720...   760  0.0
gb|AAK43485.1| polyprotein, putative [Arabidopsis thaliana]           758  0.0
gb|AAU43956.1| unknown protein [Oryza sativa (japonica cultivar-...   757  0.0
gb|AAC02672.1| polyprotein [Arabidopsis arenosa] gi|7522104|pir|...   734  0.0
gb|AAV24907.1| hypothetical protein [Oryza sativa (japonica cult...   730  0.0
ref|XP_462785.1| putative gag/pol polyprotein [Oryza sativa (jap...   667  0.0
gb|AAD14478.1| Strong similarity to gb|AF039376 Evelknievel retr...   542  e-152
ref|XP_507106.1| PREDICTED OJ1499_A07.20 gene product [Oryza sat...   506  e-141
gb|AAC35532.1| contains similarity to proteases [Arabidopsis tha...   447  e-124

>emb|CAC37623.1| copia-like polyprotein [Arabidopsis thaliana]
          Length = 1466

 Score =  910 bits (2351), Expect = 0.0
 Identities = 558/1416 (39%), Positives = 763/1416 (53%), Gaps = 108/1416 (7%)

Query: 17   ITVKLNSGNFLLWSKQVTAILQNQNLFDIVDPIVAPPSEFMLQPVSSALHHVVPAPNPLF 76
            +T+KLN  N+LLW  Q  ++L +Q L   V+ +V PP++  L         V   PNP +
Sbjct: 17   VTLKLNDSNYLLWKTQFESLLSSQKLIGFVNGVVTPPAQTRLVVNDDVTSEV---PNPQY 73

Query: 77   VQWQARDRNVNSLLLSSLTEEALSETLSATTARDVWTALHSAYASRNKAREMRLRDELQL 136
              W   D+ V S L  +L+EE L    + TT+R +W +L   +   + ARE  LR  LQL
Sbjct: 74   EDWFCTDQLVRSWLFGTLSEEVLGHVHNLTTSRQIWISLAENFNKSSIAREFSLRRNLQL 133

Query: 137  MRRGSLSVSEYGR----------KIR*SVANDDKVHWFLRGLGPSYANFST---GQLDQV 183
            + +   S+S Y R           I   V    K+  FL GLG  Y   +T     L ++
Sbjct: 134  LTKKDKSLSVYCRDFKIICDSLSSIGKPVEESMKIFGFLNGLGREYDPITTVIQSSLSKL 193

Query: 184  PLPLFTDILWKVESHAIFQASLEDYVTPPSAAFH-ARNPTRS-SGSQSSGGGHRGNSSSG 241
            P P F D++ +V+    F + L+ Y    S   H A N  RS SG+       RG   SG
Sbjct: 194  PAPTFNDVISEVQG---FDSKLQSYDDTVSVNPHLAFNTERSNSGAPQYNSNSRGRGRSG 250

Query: 242  SRPRRDNGGSHCRG-------------SYTPRCQLCRKQGHYAAKCPVRWDRPSESANLT 288
                R  GG   RG                P CQ+C + GH A KC  R+D   +S   T
Sbjct: 251  QN--RGRGGYSTRGRGFSQHQSASPSSGQRPVCQICGRIGHTAIKCYNRFDNNYQSEVPT 308

Query: 289  HSFAAGCSLNNSNRSDKYMDTGATSHMTHSLSQLTYSHAYSGNDRVLVGNGAGLHITHIG 348
             +F+A   +++    + Y D+ AT+H+T S S L  +  Y GND VLVG+G  L ITH+G
Sbjct: 309  QAFSA-LRVSDETGKEWYPDSAATAHITASTSGLQNATTYEGNDAVLVGDGTYLPITHVG 367

Query: 349  SRSASHS---VPLSNVLVVPRLTNNLVSVSKLTRDHNVRAIFEADSFVIQNRQTGAVLGK 405
            S + S S   +PL+ VLV P +  +L+SVSKL  D+     F+A+   I +  T  V+ K
Sbjct: 368  STTISSSKGTIPLNEVLVCPAIQKSLLSVSKLCDDYPCGVYFDANKVCIIDLTTQKVVSK 427

Query: 406  GPCDKGFYVLDQGSQALLATSSSLPRASFELWHSRLDHVNFDIIKKLHKLGCFNVSSILP 465
            GP + G Y+L+  S+ +   S+    AS E WH RL H N  I+++L       V+    
Sbjct: 428  GPRNNGLYMLEN-SEFVALYSNRQCAASMETWHHRLGHSNSKILQQLLTRKEIQVNKSRT 486

Query: 466  KPICCTSCQMAKSKRLVCYDNHKRASAVLDLVHCDLWGPSPVASVDGFSYFVIFVDDFSQ 525
             P+C   CQM KS RL  + +  RA   LD VHCDLWGPSPV S  GF Y+ +FVDDFS+
Sbjct: 487  SPVC-EPCQMGKSTRLQFFSSDFRALKPLDRVHCDLWGPSPVVSNQGFKYYAVFVDDFSR 545

Query: 526  FTWFYPLKRKSDFYDVLVRFKAFVENQFSRSLKVFQSDNGTEFTNNKVQALFSSSGVLHR 585
            F+WF+PL+ KS F  V + ++  VENQ    +K FQSD G EFT+NK++  F   G+ HR
Sbjct: 546  FSWFFPLRMKSKFISVFIAYQKLVENQLGTKIKEFQSDGGGEFTSNKLKEHFREHGIHHR 605

Query: 586  FLCPHTQAQNGRVERKHRHVLELGLAMLYHSHVSTSYWVHAFSTAVYIINRVPSKVLSDQ 645
              CP+T  QNG  ERKHRH++ELGL+MLYHSH    +WV AF TA Y+ N +PS VL + 
Sbjct: 606  ISCPYTPQQNGVAERKHRHLVELGLSMLYHSHTPLKFWVEAFFTANYLSNLLPSSVLKEI 665

Query: 646  IPFQLLFQVAPTYANFHPFGCRVFPCLRPYMKNKFSPRGTPCIFLGYNSHHKGFKCFDPT 705
             P++ LFQ    Y     FG   +PCLRP  KNKF PR   C+FLGY++ +KG++C  P 
Sbjct: 666  SPYETLFQQKVDYTPLRVFGTACYPCLRPLAKNKFDPRSLQCVFLGYHNQYKGYRCLYPP 725

Query: 706  TSRTYVSRHAQFDEFCFPLTGSKSSPNDLVVFTFYEPAAGSPPSLQPVILVVPESSPPTG 765
            T + Y+SRH  FDE  FP               F E      P  Q  +L   + +  T 
Sbjct: 726  TGKVYISRHVIFDEAQFP---------------FKEKYHSLVPKYQTTLLQAWQHTDLTP 770

Query: 766  PLPCPSCVDPDV-QPVPVDDAPPSPVPH---NDAPPLPAPTS---PTPPATPLVVTQAPP 818
            P    S + P   Q  P+  +   P+ +    +A  +   TS    T          AP 
Sbjct: 771  PSVPSSQLQPLARQMTPMATSENQPMMNYETEEAVNVNMETSSDEETESNDEFDHEVAPV 830

Query: 819  TSPPTTPPAVTQVPLAPVDARPRTRS*NGIFKPNPRYALVHAQPTGLLTALHTVT*PKGF 878
             +      A+ Q  L  +     TRS +GI KPNPRYAL+         +  +   PK  
Sbjct: 831  LNDQNEDNALGQGSLENLHPM-ITRSKDGIQKPNPRYALI--------VSKSSFDEPKTI 881

Query: 879  KSAMKHPHWLPAMEDELSALHKKFTWTLVPRPYTTNVVGSK*VFRTKFHSDGTVERLKTC 938
             +AMKHP W  A+ DE+  +H   TW+LVP     N++ SK VF+TK   DGT+++LK  
Sbjct: 882  TTAMKHPSWNAAVMDEIDRIHMLNTWSLVPATEDMNILTSKWVFKTKLKPDGTIDKLKAR 941

Query: 939  LVAQGLTQIPGFDYSLTFSPVVKATTVRLILSLVVLND*QLHQLDVKNAFLHGHLTETVY 998
            LVA+G  Q  G DY  TFSPVV+  T+RL+L     N+  L QLDV NAFLHG L E V+
Sbjct: 942  LVAKGFDQEEGVDYLETFSPVVRTATIRLVLDTATANEWPLKQLDVSNAFLHGELQEPVF 1001

Query: 999  MEQPHGFVD------------LVSRLMCVG*TRLSTASNRRLVLGF----SA*ALFSCVL 1042
            M QP GFVD             +  L         T SN  L  GF    S  +LF C  
Sbjct: 1002 MFQPSGFVDPNKPNHVCRLTKALYGLKQAPRAWFDTFSNFLLDFGFECSTSDPSLFVC-- 1059

Query: 1043 VFRVVGLIHPCSFFYKGHITLYLLVYVDDIILTGSDPSLLT*FIACLNDEFAIKYLGKLG 1102
                          ++   +L LL+YVDDI+LTGSD  L+   +  LN+ F++K LG   
Sbjct: 1060 --------------HQNGQSLILLLYVDDILLTGSDQLLMDKLLQALNNRFSMKDLGPPR 1105

Query: 1103 YFLGLEITYTADGLFLGHAKYAHDLLSHALMLEASHVLTPLAAGSHLVS-SGEGYSDPTH 1161
            YFLG+EI    +GLFL    YA D+L  A M E + + TPL    HL   + E + +PT+
Sbjct: 1106 YFLGIEIESYNNGLFLHQHAYASDILHQAGMTECNPMPTPLP--QHLEDLNSEPFEEPTY 1163

Query: 1162 YRSLVGALQYLTITRPDLSYAVNTVSQFLQTPTVDHFQAVKRIIRYVCAMQHFGLTFRRT 1221
            +RSL G LQYLTITRPD+ YAVN + Q +  PT   F  +KRI+RYV    + GL  R+ 
Sbjct: 1164 FRSLAGKLQYLTITRPDIQYAVNFICQRMHAPTNSDFGLLKRILRYVKGTINMGLPIRKH 1223

Query: 1222 SSPAVLGYSDADWARCTDHRRSTYGYAIFLGYNLLSWSAKKQPSVAHSSCESEYRAMANT 1281
             +P + G+ D+D+A C D RRST G+ I LG  L+SWSAK+QP+++HSS E+EYRA+++T
Sbjct: 1224 HNPVLSGFCDSDYAGCKDTRRSTTGFCILLGSTLISWSAKRQPTISHSSTEAEYRALSDT 1283

Query: 1282 ASELVWLLNLLHELRVRLSAPPLFLSDNQSALFMAQNPVAHKHAKHIDLDCHFVRELVSS 1341
            A E+ W+ +LL +L +    P     DN SA++++ NP  HK +KH D D H++RE V+ 
Sbjct: 1284 AREITWISSLLRDLGISQHQPTRVFCDNLSAVYLSANPALHKRSKHFDKDFHYIRERVAL 1343

Query: 1342 GRLAVRHVPTSLQLADIFTKVLPRPLFDLFRSKLRV 1377
            G +  +H+P ++QLAD+FTK LPR  F   R+KL V
Sbjct: 1344 GLIETQHIPATIQLADVFTKSLPRRPFITLRAKLGV 1379


>gb|AAK51235.1| polyprotein [Arabidopsis thaliana]
          Length = 1453

 Score =  875 bits (2260), Expect = 0.0
 Identities = 540/1428 (37%), Positives = 749/1428 (51%), Gaps = 124/1428 (8%)

Query: 17   ITVKLNSGNFLLWSKQVTAILQNQNLFDIVDPIVAPPSEFMLQPVSSALHHVVPAPNPLF 76
            +T+KLN  N+LLW  Q  ++L    L   V+  + PP   +           V   NP +
Sbjct: 17   VTLKLNDSNYLLWKTQFESLLSCHKLIGFVNGGITPPPRTLNVVTGDTS---VDVANPQY 73

Query: 77   VQWQARDRNVNSLLLSSLTEEALSETLSATTARDVWTALHSAYASRNKAREMRLRDELQL 136
              W   D+ + S L  +L+EE L    +  T+RD+W +L   +   + ARE  LR  LQL
Sbjct: 74   ESWFCTDQLIRSWLFGTLSEEVLGYVHNLQTSRDIWISLAENFNKSSVAREFTLRRTLQL 133

Query: 137  MRRGSLSVSEYGRK----------IR*SVANDDKVHWFLRGLGPSYANFST---GQLDQV 183
            + +   ++S Y R+          I   V    K+  FL GLG  Y   +T     L ++
Sbjct: 134  LSKKDKTLSAYCREFIAVCDALSSIGKPVDESMKIFGFLNGLGREYDPITTVIQSSLSKI 193

Query: 184  PLPLFTDILWKVESHAIFQASLEDYVTP-PSAAFHARNPTRSSGSQSSGGGHRGNSSSGS 242
              P F D++ +V+   +   S E+ VT  P  AF   N  RS  + +   G+RG    G 
Sbjct: 194  SPPTFRDVISEVKGFDVKLQSYEESVTANPHMAF---NTQRSEYTDNYTSGNRGKGRGGY 250

Query: 243  RPRRDNGGSHCRG-------------SYTPRCQLCRKQGHYAAKCPVRWDRPSESANLTH 289
               R   G   RG                P CQ+C + GH A KC  R+D   +S +   
Sbjct: 251  GQNRGRSGYSTRGRGFSQHQTNSNNTGERPVCQICGRTGHTALKCYNRFDHNYQSVDTAQ 310

Query: 290  SFAAGCSLNNSNRSDKYMDTGATSHMTHSLSQLTYSHAYSGNDRVLVGNGAGLHITHIGS 349
            +F++   +++S+  +   D+ AT+H+T S + L  +  Y+G+D VLVG+GA L ITH+GS
Sbjct: 311  AFSS-LRVSDSSGKEWVPDSAATAHVTSSTNNLQAASPYNGSDTVLVGDGAYLPITHVGS 369

Query: 350  R---SASHSVPLSNVLVVPRLTNNLVSVSKLTRDHNVRAIFEADSFVIQNRQTGAVLGKG 406
                S S ++PL+ VLV P +  +L+SVSKL  D+     F+A+   I +  T  V+ KG
Sbjct: 370  TTISSDSGTLPLNEVLVCPDIQKSLLSVSKLCDDYPCGVYFDANKVCIIDINTQKVVSKG 429

Query: 407  PCDKGFYVLDQGSQALLATSSSLPRASFELWHSRLDHVNFDIIKKLH--KLGCFNVSSIL 464
            P   G YVL+   + +   S+    AS E+WH RL H N  I+++L   K   FN S + 
Sbjct: 430  PRSNGLYVLEN-QEFVAFYSNRQCAASEEIWHHRLGHSNSRILQQLKSSKEISFNKSRMS 488

Query: 465  PKPICCTSCQMAKSKRLVCYDNHKRASAVLDLVHCDLWGPSPVASVDGFSYFVIFVDDFS 524
            P    C  CQM KS +L  + ++ R   +L  +HCDLWGPSPV S  GF Y+V+FVDD+S
Sbjct: 489  P---VCEPCQMGKSSKLQFFSSNSRELDLLGRIHCDLWGPSPVVSKQGFKYYVVFVDDYS 545

Query: 525  QFTWFYPLKRKSDFYDVLVRFKAFVENQFSRSLKVFQSDNGTEFTNNKVQALFSSSGVLH 584
            +++WFYPLK KSDF+ V V F+  VENQF+  +KVFQSD G EFT+N ++   +  G+ H
Sbjct: 546  RYSWFYPLKAKSDFFAVFVAFQNLVENQFNTKIKVFQSDGGGEFTSNLMKKHLTDCGIQH 605

Query: 585  RFLCPHTQAQNGRVERKHRHVLELGLAMLYHSHVSTSYWVHAFSTAVYIINRVPSKVLSD 644
            R  CP+T  QNG  ERKHRH +ELGL+M++HSH    +WV AF TA ++ N +PS  L +
Sbjct: 606  RISCPYTPQQNGIAERKHRHFVELGLSMMFHSHTPLQFWVEAFFTASFLSNMLPSPSLGN 665

Query: 645  QIPFQLLFQVAPTYANFHPFGCRVFPCLRPYMKNKFSPRGTPCIFLGYNSHHKGFKCFDP 704
              P + L +  P YA    FG   +PCLRP  ++KF PR   C+FLGYNS +KG++C  P
Sbjct: 666  VSPLEALLKQKPNYAMLRVFGTACYPCLRPLGEHKFEPRSLQCVFLGYNSQYKGYRCLYP 725

Query: 705  TTSRTYVSRHAQFDEFCFPLTGSKSSPNDLVVFTFYEPA------AGSPPSLQPVILVVP 758
             T R Y+SRH  FDE  FP    K     LV    YE +      +  P + Q +I    
Sbjct: 726  PTGRVYISRHVIFDEETFPF---KQKYQFLV--PQYESSLLSAWQSSIPQADQSLIPQAE 780

Query: 759  ESSPPTGPLPCPSCVDPDVQPVPVDDAPPSPV----------------PHNDAPPLPAPT 802
            E    +   P      P +Q   + D    P                    +   L   T
Sbjct: 781  EGKIESLAKP------PSIQKNTIQDTTTQPAILTEGVLNEEEEEDSFEETETESLNEET 834

Query: 803  SPTPPATPLVVTQAPPTSPPTTPPAVTQVPLAPVDARPRTRS*NGIFKPNPRYALVHAQP 862
                    + V +     P  T P               TRS  GI K N RYA      
Sbjct: 835  HTQNDEAEVTVEEEVQQEPENTHPMT-------------TRSKAGIHKSNTRYA------ 875

Query: 863  TGLLTALHTVT*PKGFKSAMKHPHWLPAMEDELSALHKKFTWTLVPRPYTTNVVGSK*VF 922
              LLT+  +V  PK    A+ HP W  A+ DE+  +H   TW+LV      N++G + VF
Sbjct: 876  --LLTSKFSVEEPKSIDEALNHPGWNNAVNDEMRTIHMLHTWSLVQPTEDMNILGCRWVF 933

Query: 923  RTKFHSDGTVERLKTCLVAQGLTQIPGFDYSLTFSPVVKATTVRLILSLVVLND*QLHQL 982
            +TK   DG+V++LK  LVA+G  Q  G DY  TFSPVV+  T+RL+L +       + QL
Sbjct: 934  KTKLKPDGSVDKLKARLVAKGFHQEEGLDYLETFSPVVRTATIRLVLDVATAKGWNIKQL 993

Query: 983  DVKNAFLHGHLTETVYMEQPHGFVDLVSRLMCVG*TR------------LSTASNRRLVL 1030
            DV NAFLHG L E VYM QP GFVD          T+              T SN  L  
Sbjct: 994  DVSNAFLHGELKEPVYMLQPPGFVDQEKPSYVCRLTKALYGLKQAPRAWFDTISNYLLDF 1053

Query: 1031 GFSA*ALFSCVLVFRVVGLIHPCSF-FYKGHITLYLLVYVDDIILTGSDPSLLT*FIACL 1089
            GFS                  P  F ++K   TL LL+YVDDI+LTGSD +LL   +  L
Sbjct: 1054 GFSC-------------SKSDPSLFTYHKNGKTLVLLLYVDDILLTGSDHNLLQELLMSL 1100

Query: 1090 NDEFAIKYLGKLGYFLGLEITYTADGLFLGHAKYAHDLLSHALMLEASHVLTPLAAGSHL 1149
            N  F++K LG   YFLG+EI  + +GLFL    YA D+L  A M   + + TPL      
Sbjct: 1101 NKRFSMKDLGAPSYFLGVEIESSPEGLFLHQTAYAKDILHQAAMSNCNSMPTPLPQHIEN 1160

Query: 1150 VSSGEGYSDPTHYRSLVGALQYLTITRPDLSYAVNTVSQFLQTPTVDHFQAVKRIIRYVC 1209
            ++S + + +PT++RSL G LQYLTITRPD+ +AVN + Q + +PT   F  +KRI+RYV 
Sbjct: 1161 LNS-DLFPEPTYFRSLAGKLQYLTITRPDIQFAVNFICQRMHSPTTADFGLLKRILRYVK 1219

Query: 1210 AMQHFGLTFRRTSSPAVLGYSDADWARCTDHRRSTYGYAIFLGYNLLSWSAKKQPSVAHS 1269
               H GL  ++  + +++ YSD+DWA C + RRST G+   LG NL+SWSAK+Q +V+ S
Sbjct: 1220 GTIHLGLHIKKNQNLSLVAYSDSDWAGCKETRRSTTGFCTLLGCNLISWSAKRQETVSKS 1279

Query: 1270 SCESEYRAMANTASELVWLLNLLHELRVRLSAPPLFLSDNQSALFMAQNPVAHKHAKHID 1329
            S E+EYRA+   A EL WL  LL ++ V  + P L   DN SA++++ NP  H  +KH D
Sbjct: 1280 STEAEYRALTAVAQELTWLSFLLRDIGVTQTHPTLVKCDNLSAVYLSANPALHNRSKHFD 1339

Query: 1330 LDCHFVRELVSSGRLAVRHVPTSLQLADIFTKVLPRPLFDLFRSKLRV 1377
             D H++RE V+ G +  +H+  +LQLADIFTK LPR  F   R KL V
Sbjct: 1340 TDYHYIREQVALGLVETKHISATLQLADIFTKPLPRRAFIDLRIKLGV 1387


>gb|AAD21687.1| Strong similarity to gi|3600044 T12H20.12 protease homolog from
            Arabidopsis thaliana BAC gb|AF080119 and is a member of
            the reverse transcriptase family PF|00078
            gi|25301706|pir||C86438 hypothetical protein F28K20.17 -
            Arabidopsis thaliana
          Length = 1415

 Score =  870 bits (2249), Expect = 0.0
 Identities = 540/1408 (38%), Positives = 746/1408 (52%), Gaps = 110/1408 (7%)

Query: 17   ITVKLNSGNFLLWSKQVTAILQNQNLFDIVDPIVAPPSEFMLQPVSSALHHVVPAPNPLF 76
            +T+KL   N+LLW  Q  ++L +Q L   V+  V  PS+  L             PNPL+
Sbjct: 17   VTLKLTDSNYLLWKTQFESLLSSQKLIGFVNGAVNAPSQSRLVVNGEVTSE---EPNPLY 73

Query: 77   VQWQARDRNVNSLLLSSLTEEALSETLSATTARDVWTALHSAYASRNKAREMRLRDELQL 136
              W   D+ V S L  +L+EE L    + +T+R +W +L   +   + ARE  LR  LQL
Sbjct: 74   ESWFCTDQLVRSWLFGTLSEEVLGHVHNLSTSRQIWVSLAENFNKSSVAREFSLRQNLQL 133

Query: 137  MRRGSLSVSEYGRKIR*----------SVANDDKVHWFLRGLGPSYANFST---GQLDQV 183
            + +     S Y R+ +            V    K+  FL GLG  Y   +T     L ++
Sbjct: 134  LSKKEKPFSVYCREFKTICDALSSIGKPVDESMKIFGFLNGLGRDYDPITTVIQSSLSKL 193

Query: 184  PLPLFTDILWKVESHAIFQASLEDYVTPPSAAFH-ARNPTRS-SGSQSSGGGHRGNSSSG 241
            P P F D++ +V+    F + L+ Y    S   H A N  RS SGS       +G   SG
Sbjct: 194  PTPTFNDVVSEVQG---FDSKLQSYEEAASVTPHLAFNIERSESGSPQYNPNQKGRGRSG 250

Query: 242  SRPRRDNGGSHCRG-----------SYTPRCQLCRKQGHYAAKCPVRWDRPSESANLTHS 290
                R    +  RG              P CQ+C + GH A KC  R+D      N    
Sbjct: 251  QNKGRGGYSTRGRGFSQHQSSPQVSGPRPVCQICGRTGHTALKCYNRFDN-----NYQAE 305

Query: 291  FAAGCSLNNSNRSDK--YMDTGATSHMTHSLSQLTYSHAYSGNDRVLVGNGAGLHITHIG 348
              A  +L  S+ + K  + D+ AT+H+T S + L  +  Y G+D VLVG+G  L ITH G
Sbjct: 306  IQAFSTLRVSDDTGKEWHPDSAATAHVTSSTNGLQSATEYEGDDAVLVGDGTYLPITHTG 365

Query: 349  S---RSASHSVPLSNVLVVPRLTNNLVSVSKLTRDHNVRAIFEADSFVIQNRQTGAVLGK 405
            S   +S++  +PL+ VLVVP +  +L+SVSKL  D+     F+A+   I + QT  V+  
Sbjct: 366  STTIKSSNGKIPLNEVLVVPNIQKSLLSVSKLCDDYPCGVYFDANKVCIIDLQTQKVVTT 425

Query: 406  GPCDKGFYVLDQGSQALLATSSSLPRASFELWHSRLDHVNFDIIKKLHKLGCFNVSSILP 465
            GP   G YVL+      L ++     A+ E+WH RL H N   ++ L       ++    
Sbjct: 426  GPRRNGLYVLENQEFVALYSNRQCA-ATEEVWHHRLGHANSKALQHLQNSKAIQINKSRT 484

Query: 466  KPICCTSCQMAKSKRLVCYDNHKRASAVLDLVHCDLWGPSPVASVDGFSYFVIFVDDFSQ 525
             P+C   CQM KS RL    +  R    LD +HCDLWGPSPV S  G  Y+ IFVDD+S+
Sbjct: 485  SPVC-EPCQMGKSSRLPFLISDSRVLHPLDRIHCDLWGPSPVVSNQGLKYYAIFVDDYSR 543

Query: 526  FTWFYPLKRKSDFYDVLVRFKAFVENQFSRSLKVFQSDNGTEFTNNKVQALFSSSGVLHR 585
            ++WFYPL  KS+F  V + F+  VENQ +  +KVFQSD G EF +NK++   S  G+ HR
Sbjct: 544  YSWFYPLHNKSEFLSVFISFQKLVENQLNTKIKVFQSDGGGEFVSNKLKTHLSEHGIHHR 603

Query: 586  FLCPHTQAQNGRVERKHRHVLELGLAMLYHSHVSTSYWVHAFSTAVYIINRVPSKVLSDQ 645
              CP+T  QNG  ERKHRH++ELGL+ML+HSH    +WV +F TA YIINR+PS VL + 
Sbjct: 604  ISCPYTPQQNGLAERKHRHLVELGLSMLFHSHTPQKFWVESFFTANYIINRLPSSVLKNL 663

Query: 646  IPFQLLFQVAPTYANFHPFGCRVFPCLRPYMKNKFSPRGTPCIFLGYNSHHKGFKCFDPT 705
             P++ LF   P Y++   FG   +PCLRP  +NKF PR   C+FLGYNS +KG++CF P 
Sbjct: 664  SPYEALFGEKPDYSSLRVFGSACYPCLRPLAQNKFDPRSLQCVFLGYNSQYKGYRCFYPP 723

Query: 706  TSRTYVSRHAQFDEFCFPLTGSKSSPNDLVVFTFYEPAAGSPPSLQPVILVVPESSPPTG 765
            T + Y+SR+  F+E   P    K     LV      P   +P         + E S P  
Sbjct: 724  TGKVYISRNVIFNESELPF---KEKYQSLV------PQYSTPLLQAWQHNKISEISVPAA 774

Query: 766  PLPCPSCVDPDVQPVPVDDAPPSPVPHNDAPPLPAPTSPTPPATPLVVTQAPPTSPPTTP 825
            P+   S      +P+ ++    S V        P PTS    +   V       +P    
Sbjct: 775  PVQLFS------KPIDLNTYAGSQVTEQLTD--PEPTSNNEGSDEEV-------NPVAEE 819

Query: 826  PAVTQVPLAPVDARPRTRS*NGIFKPNPRYALVHAQPTGLLTALHTVT*PKGFKSAMKHP 885
             A  Q  +    A   TRS  GI KPN RYAL+        T+      PK   SAMKHP
Sbjct: 820  IAANQEQVINSHAM-TTRSKAGIQKPNTRYALI--------TSRMNTAEPKTLASAMKHP 870

Query: 886  HWLPAMEDELSALHKKFTWTLVPRPYTTNVVGSK*VFRTKFHSDGTVERLKTCLVAQGLT 945
             W  A+ +E++ +H   TW+LVP     N++ SK VF+TK H DG++++LK  LVA+G  
Sbjct: 871  GWNEAVHEEINRVHMLHTWSLVPPTDDMNILSSKWVFKTKLHPDGSIDKLKARLVAKGFD 930

Query: 946  QIPGFDYSLTFSPVVKATTVRLILSLVVLND*QLHQLDVKNAFLHGHLTETVYMEQPHGF 1005
            Q  G DY  TFSPVV+  T+RL+L +       + QLDV NAFLHG L E V+M QP GF
Sbjct: 931  QEEGVDYLETFSPVVRTATIRLVLDVSTSKGWPIKQLDVSNAFLHGELQEPVFMYQPSGF 990

Query: 1006 VD------------LVSRLMCVG*TRLSTASNRRLVLGF----SA*ALFSCVLVFRVVGL 1049
            +D             +  L         T SN  L  GF    S  +LF C         
Sbjct: 991  IDPQKPTHVCRLTKAIYGLKQAPRAWFDTFSNFLLDYGFVCSKSDPSLFVC--------- 1041

Query: 1050 IHPCSFFYKGHITLYLLVYVDDIILTGSDPSLLT*FIACLNDEFAIKYLGKLGYFLGLEI 1109
                   ++    LYLL+YVDDI+LTGSD SLL   +  L + F++K LG   YFLG++I
Sbjct: 1042 -------HQDGKILYLLLYVDDILLTGSDQSLLEDLLQALKNRFSMKDLGPPRYFLGIQI 1094

Query: 1110 TYTADGLFLGHAKYAHDLLSHALMLEASHVLTPLAAGSHLVSSGEGYSDPTHYRSLVGAL 1169
               A+GLFL    YA D+L  A M + + + TPL      ++S E +++PT++RSL G L
Sbjct: 1095 EDYANGLFLHQTAYATDILQQAGMSDCNPMPTPLPQQLDNLNS-ELFAEPTYFRSLAGKL 1153

Query: 1170 QYLTITRPDLSYAVNTVSQFLQTPTVDHFQAVKRIIRYVCAMQHFGLTFRRTSSPAVLGY 1229
            QYLTITRPD+ +AVN + Q + +PT   F  +KRI+RY+      GL  +R S+  +  Y
Sbjct: 1154 QYLTITRPDIQFAVNFICQRMHSPTTSDFGLLKRILRYIKGTIGMGLPIKRNSTLTLSAY 1213

Query: 1230 SDADWARCTDHRRSTYGYAIFLGYNLLSWSAKKQPSVAHSSCESEYRAMANTASELVWLL 1289
            SD+D A C + RRST G+ I LG NL+SWSAK+QP+V++SS E+EYRA+   A E+ W+ 
Sbjct: 1214 SDSDHAGCKNTRRSTTGFCILLGSNLISWSAKRQPTVSNSSTEAEYRALTYAAREITWIS 1273

Query: 1290 NLLHELRVRLSAPPLFLSDNQSALFMAQNPVAHKHAKHIDLDCHFVRELVSSGRLAVRHV 1349
             LL +L +    P     DN SA++++ NP  H  +KH D D H++RE V+ G +  +H+
Sbjct: 1274 FLLRDLGIPQYLPTQVYCDNLSAVYLSANPALHNRSKHFDTDYHYIREQVALGLIETQHI 1333

Query: 1350 PTSLQLADIFTKVLPRPLFDLFRSKLRV 1377
              + QLAD+FTK LPR  F   RSKL V
Sbjct: 1334 SATFQLADVFTKSLPRRAFVDLRSKLGV 1361


>gb|AAA57005.1| copia-like retrotransposon Hopscotch polyprotein [Zea mays]
            gi|7444442|pir||T02087 gag/pol polyprotein - maize
            retrotransposon Hopscotch
          Length = 1439

 Score =  845 bits (2182), Expect = 0.0
 Identities = 548/1441 (38%), Positives = 756/1441 (52%), Gaps = 120/1441 (8%)

Query: 17   ITVKLNSGNFLLWSKQVTAILQNQNLFDIVDPI-VAPPSEFMLQPVSSALHHVVPAPNPL 75
            ++ KL  GN+LLW  QV   ++   L DI+  + + PP     + +S A    V   NP 
Sbjct: 20   VSEKLTKGNYLLWKAQVLPAIRAAQLDDILTGVEICPP-----KTISDASDRTVTVANPA 74

Query: 76   FVQWQARDRNVNSLLLSSLTEEALSETLSATTARDVWTALHSAYASRNKAREMRLRDELQ 135
            + +W ARD+ V   LLSSL+ E LS  ++ +T+  VWT L   Y+S ++AR++  R  L 
Sbjct: 75   YGRWIARDQAVLGYLLSSLSREVLSSVVNCSTSASVWTTLSEMYSSHSRARKVNTRIALA 134

Query: 136  LMRRGSLSVSEYGRKIR*----------SVANDDKVHWFLRGLGPSYANFSTGQLDQ--- 182
              ++G+ SV+EY  K+R            + +++ V + L GL   +    T  + +   
Sbjct: 135  TTKKGASSVAEYFAKMRGFADELGAAGKPLDDEEFVSFLLTGLDEDFNPLVTAVVARSDP 194

Query: 183  -VPLPLFTDILWKVESHAIFQASLEDYVTPPSAAFHARNPTRSSGSQSSGGG-------- 233
              P  L+T +L   E+    Q      +    ++ +AR+P R      SGG         
Sbjct: 195  ITPGDLYTQLL-SYENRMHLQTGSSSLM---QSSANARSPGRGMSWGRSGGRGFSRGRGR 250

Query: 234  ----HRGNSSSGSRPRRDNGGSHCRGSYTPRCQLCRKQGHYAAKCPVRWDRPSESANLTH 289
                 RG   S  R    +G +    S  PRCQ+C + GH A  C   W R  E+     
Sbjct: 251  GRGPSRGGFQSFGRGNNYSGATDADTSSRPRCQVCSRVGHTALNC---WYRFDENYVPDQ 307

Query: 290  SFAAGCSLNNSNRSDKYMDTGATSHMTHSLSQLTYSHAYSGNDRVLVGNGAGLHITHIGS 349
              A   +  N +    Y DTGAT H+T  L +LT    Y+G D+++  NG G+ I++IG+
Sbjct: 308  RSANSAAHQNGSNVPWYTDTGATDHITGDLDRLTMHDKYTGTDQIIAANGTGMTISNIGN 367

Query: 350  R---SASHSVPLSNVLVVPRLTNNLVSVSKLTRDHNVRAIFEADSFVIQNRQTGAVLGKG 406
                ++S S+ L +VL VP    NL+SV +LT D++V   F +  F+I++RQT AVL  G
Sbjct: 368  AIVPTSSRSLHLRSVLHVPSTHKNLISVHRLTNDNDVFIEFHSSHFLIKDRQTKAVLLHG 427

Query: 407  PCDKGFYVLDQGSQALLATSSSLPRASFELWHSRLDHVNFDIIKKL---HKLGCFNVSSI 463
             C  G Y L       L  + S  R   E WH RL H + DI+ ++   + L C + +S 
Sbjct: 428  KCRDGLYPLPPHPDLRLKHNFSSTRVPLEHWHKRLGHPSRDIVHRVISNNNLPCLSNNST 487

Query: 464  LPKPICCTSCQMAKSKRLVCYDNHKRASAVLDLVHCDLWGPSPVASVDGFSYFVIFVDDF 523
                  C +C  AK+ +L    +  ++SA L L+  D++GP+ + S   + Y+V F+DD+
Sbjct: 488  TS---VCDACLQAKAHQLPYTISMSQSSAPLMLIFSDVFGPA-IDSFGRYKYYVSFIDDY 543

Query: 524  SQFTWFYPLKRKSDFYDVLVRFKAFVENQFSRSLKVFQSDNGTEFTNNKVQALFSSSGVL 583
            S+FTW Y L+ KSD Y     F+  VE  F R +  FQSD G E+   K+ A F + G+ 
Sbjct: 544  SKFTWIYLLRHKSDVYKSFCEFQHLVERMFGRKIIAFQSDWGGEY--EKLNAHFKTIGIH 601

Query: 584  HRFLCPHTQAQNGRVERKHRHVLELGLAMLYHSHVSTSYWVHAFSTAVYIINRVPSKVLS 643
            H+  CPHT  QNG  ERKHRH++E+GLA+L  S +   YW HAF  AVY+INR PSK ++
Sbjct: 602  HQVSCPHTHQQNGAAERKHRHIVEVGLALLAQSSMPLKYWDHAFLAAVYLINRTPSKTIA 661

Query: 644  DQIPFQLLFQVAPTYANFHPFGCRVFPCLRPYMKNKFSPRGTPCIFLGYNSHHKGFKCFD 703
               P   L    P Y++   FGC  +P LRPY ++K   R T C+FLGY++ HKGFKC D
Sbjct: 662  HDTPLHKLTGATPDYSSLRIFGCACWPNLRPYNQHKLQFRSTRCVFLGYSNMHKGFKCLD 721

Query: 704  PTTSRTYVSRHAQFDEFCFPLTGSKSS------------PND------LVVFTFYEPAAG 745
             +T R Y+SR   FDE  FP      +            P+D      L       P + 
Sbjct: 722  ISTGRIYISRDVVFDEHVFPFASLNKNAGVKYTSEVLLLPHDSCGNNMLTDHANNLPGSS 781

Query: 746  SP---------------PSLQPVILVVPESSPPTGPLP---CPSCVDPDVQPVPVD-DAP 786
            SP               P+     + +P S P    +P    PS + P   P P    A 
Sbjct: 782  SPLPFLAQHFLQGNSEVPTSNNTAMALPASGPNEVSVPPALVPSSLVPAASPAPTGVSAN 841

Query: 787  PSPVPHNDA----PPLPAPT-SPTPPATPLVVTQAPPTSPPTTPPAVTQVPLAPVDARPR 841
              P P  D+    PP+   + +  P A PL+  QAP +S     P     PL+   A PR
Sbjct: 842  AEPAPEADSLSSGPPVATESVTGVPDADPLL--QAPGSSVAHQTP--DSAPLSA--AAPR 895

Query: 842  TRS*NGIFKPNP------RYALVHAQPTGLLTALHTVT*PKGFKSAMKHPHWLPAMEDEL 895
            TR  +GI KP        RY            A   +T P     A+  P W  AME E 
Sbjct: 896  TRLQHGISKPKQFTDGTVRYG----------NAAARITEPSSVSEALADPQWRAAMEAEF 945

Query: 896  SALHKKFTWTLVPRPYTTNVVGSK*VFRTKFHSDGTVERLKTCLVAQGLTQIPGFDYSLT 955
             AL K  TWTLVP   T N++  K VF+ K+++DG+++RLK  LVA+G  Q  G DY  T
Sbjct: 946  QALQKNNTWTLVPPDRTRNLIDCKWVFKVKYNADGSIDRLKARLVAKGFKQQYGIDYDDT 1005

Query: 956  FSPVVKATTVRLILSLVVLND*QLHQLDVKNAFLHGHLTETVYMEQPHGFVDLVSRLMCV 1015
            FSPVVK +T+RL+LSL V     L QLDV+NAFLHG L ETVYM+QP GF D        
Sbjct: 1006 FSPVVKHSTIRLVLSLAVSQKWSLRQLDVQNAFLHGILEETVYMKQPPGFADTTHPNYHC 1065

Query: 1016 G*TRLSTASNRRLVLGFSA*ALFSCVLVFRVVGLIHPCSFFYKGHIT-LYLLVYVDDIIL 1074
               +      +R    +S  +     L F V        F Y  H T +Y+LVYVDDII+
Sbjct: 1066 HLQKSLYGLKQRPRAWYSRLSEKLQSLGF-VPSKADVSLFIYNAHSTAIYILVYVDDIII 1124

Query: 1075 TGSDPSLLT*FIACLNDEFAIKYLGKLGYFLGLEITYTADGLFLGHAKYAHDLLSHALML 1134
            TGS P  +   +A L D+FAIK LG L YFLG+E+    DGL L   KYA DLL    M 
Sbjct: 1125 TGSSPHAIDNVLAKLKDDFAIKDLGDLHYFLGIEVHRKGDGLLLCQEKYARDLLKRVGME 1184

Query: 1135 EASHVLTPLAAGSHLVSSGEGYSDP---THYRSLVGALQYLTITRPDLSYAVNTVSQFLQ 1191
                V TP+A    L +S      P   T YRS+VGALQYLT+TRPDLSYA+N V QFL 
Sbjct: 1185 CCKPVHTPVATSEKLSASAGTLLSPEETTKYRSVVGALQYLTLTRPDLSYAINRVCQFLH 1244

Query: 1192 TPTVDHFQAVKRIIRYVCAMQHFGLTFRRTSSPAVLGYSDADWARCTDHRRSTYGYAIFL 1251
             PT  H+ AVKRI+R +      GLT R + S  +  +SDADWA C D R+ST GYA+FL
Sbjct: 1245 APTDLHWTAVKRILRNIQHTIGLGLTIRPSLSLMLSAFSDADWAGCPDDRKSTGGYALFL 1304

Query: 1252 GYNLLSWSAKKQPSVAHSSCESEYRAMANTASELVWLLNLLHELRVRLSAPPLFLSDNQS 1311
            G NL+SW++KKQ +V+ SS E+EY+AMAN  +E++WL +LLHEL +RL+  P    DN  
Sbjct: 1305 GPNLISWNSKKQSTVSRSSTEAEYKAMANATAEVIWLQSLLHELGIRLTGIPRLWCDNLG 1364

Query: 1312 ALFMAQNPVAHKHAKHIDLDCHFVRELVSSGRLAVRHVPTSLQLADIFTKVLPRPLFDLF 1371
            A +++  P+ +   KHI++D HFVR+ V S +L +R + T+ Q+AD FTK L     + F
Sbjct: 1365 ATYLSSKPIFNARTKHIEVDFHFVRDRVLSKKLDIRLISTNDQVADGFTKALTIGRLNEF 1424

Query: 1372 R 1372
            R
Sbjct: 1425 R 1425


>emb|CAB81170.1| retrotransposon like protein [Arabidopsis thaliana]
            gi|4539447|emb|CAB40035.1| retrotransposon like protein
            [Arabidopsis thaliana] gi|7444419|pir||T04204
            hypothetical protein T4F9.150 - Arabidopsis thaliana
          Length = 1515

 Score =  829 bits (2142), Expect = 0.0
 Identities = 516/1387 (37%), Positives = 735/1387 (52%), Gaps = 109/1387 (7%)

Query: 73   NPLFVQWQARDRNVNSLLLSSLTEEALSETLSATTARDVWTALHSAYASRNKAREMRLRD 132
            N  F++W   D+ V + +  SL+EEAL   +   +A++VW  L   +   +  R+  L+ 
Sbjct: 66   NQEFLKWTRIDQLVKAWIFGSLSEEALKVVIGLNSAQEVWLGLARRFNRFSTTRKYDLQK 125

Query: 133  ELQLMRRGSLSVSEYGRKIR*----------SVANDDKVHWFLRGLGPSYANFST---GQ 179
             L    +   ++  Y  +++            V   +K+   L GLG  Y + +T     
Sbjct: 126  RLGTCSKAGKTMDAYLSEVKNICDQLDSIGFPVTEQEKIFGVLNGLGKEYESIATVIEHS 185

Query: 180  LDQVPLPLFTDILWKVESHAIFQASLEDYVT----PPSAAFHARNPTRSSGSQSSGGGHR 235
            LD  P P F D+++K+ +   F   L  Y       P  AF+      S G+ +S GG  
Sbjct: 186  LDVYPGPCFDDVVYKLTT---FDDKLSTYTANSEVTPHLAFYTDKSYSSRGNNNSRGGRY 242

Query: 236  GN-------SSSGSRPRRDNGGSHCRGSYT---PRCQLCRKQGHYAAKCPVRWDRPSESA 285
            GN       SS G    +  G     GS     P CQ+CRK GH A KC  R++      
Sbjct: 243  GNFRGRGSYSSRGRGFHQQFGSGSNNGSGNGSKPTCQICRKYGHSAFKCYTRFEENYLPE 302

Query: 286  NLTHSFAA-GCSLNNSNRSDKYM-DTGATSHMTHSLSQLTYSHAYSGNDRVLVGNGAGLH 343
            +L ++FAA   S  N   S +++ D+ AT+H+T++   L  S  YSG+D V+VGNG  L 
Sbjct: 303  DLPNAFAAMRVSDQNQASSHEWLPDSAATAHITNTTDGLQNSQTYSGDDSVIVGNGDFLP 362

Query: 344  ITHIGS---RSASHSVPLSNVLVVPRLTNNLVSVSKLTRDHNVRAIFEADSFVIQNRQTG 400
            ITHIG+     +  ++PL +VLV P +T +L+SVSKLT D+     F++DS VI++++T 
Sbjct: 363  ITHIGTIPLNISQGTLPLEDVLVCPGITKSLLSVSKLTDDYPCSFTFDSDSVVIKDKRTQ 422

Query: 401  AVLGKGPCDKGFYVL-DQGSQALLATSSSLPRASFELWHSRLDHVNFDIIKKLHKLGCFN 459
             +L +G   KG YVL D   Q   +T      +  E+WH RL H N ++++ L K     
Sbjct: 423  QLLTQGNKHKGLYVLKDVPFQTYYSTRQQ--SSDDEVWHQRLGHPNKEVLQHLIKTKAIV 480

Query: 460  VSSILPKPICCTSCQMAKSKRLVCYDNHKRASAVLDLVHCDLWGPSPVASVDGFSYFVIF 519
            V+        C +CQM K  RL    +   +S  L+ +HCDLWGP+PV S  GF Y+VIF
Sbjct: 481  VNKTSSN--MCEACQMGKVCRLPFVASEFVSSRPLERIHCDLWGPAPVTSAQGFQYYVIF 538

Query: 520  VDDFSQFTWFYPLKRKSDFYDVLVRFKAFVENQFSRSLKVFQSDNGTEFTNNKVQALFSS 579
            +D++S+FTWFYPLK KSDF+ V V F+  VENQ+   + +FQ D G EF + K  A  +S
Sbjct: 539  IDNYSRFTWFYPLKLKSDFFSVFVLFQQLVENQYQHKIAMFQCDGGGEFVSYKFVAHLAS 598

Query: 580  SGVLHRFLCPHTQAQNGRVERKHRHVLELGLAMLYHSHVSTSYWVHAFSTAVYIINRVPS 639
             G+     CPHT  QNG  ER+HR++ ELGL++++HS V    WV AF T+ ++ N +PS
Sbjct: 599  CGIKQLISCPHTPQQNGIAERRHRYLTELGLSLMFHSKVPHKLWVEAFFTSNFLSNLLPS 658

Query: 640  KVLSD-QIPFQLLFQVAPTYANFHPFGCRVFPCLRPYMKNKFSPRGTPCIFLGYNSHHKG 698
              LSD + P+++L    P Y     FG   +P LRPY KNKF P+   C+FLGYN+ +KG
Sbjct: 659  STLSDNKSPYEMLHGTPPVYTALRVFGSACYPYLRPYAKNKFDPKSLLCVFLGYNNKYKG 718

Query: 699  FKCFDPTTSRTYVSRHAQFDEFCFPLTGSKSSPNDL---VVFTFYEPAAGS------PPS 749
            ++C  P T + Y+ RH  FDE  FP +   S    +    +FT ++    S       PS
Sbjct: 719  YRCLHPPTGKVYICRHVLFDERKFPYSDIYSQFQTISGSPLFTAWQKGFSSTALSRETPS 778

Query: 750  LQPVILVVP----ESSPPTGPLP--CPSCVDPDVQPVPVDD--APPSPVPHNDAPPLP-- 799
                 ++ P     SS PTG  P    +   PDV      D   PPSP+     P  P  
Sbjct: 779  TNVEDIIFPSATVSSSVPTGCAPNIAETATAPDVDVAAAHDMVVPPSPITSTSLPTQPEE 838

Query: 800  -----------APTSPTPPATP--LVVTQAPPTSPPTTPPAVTQVPLAPVDARPR-TRS* 845
                       + T+ +   TP  + V+    +  P     ++    AP  + P  TR+ 
Sbjct: 839  STSDQNHYSTDSETAISSAMTPQSINVSLFEDSDFPPLQSVISSTTAAPETSHPMITRAK 898

Query: 846  NGIFKPNPRYALVHAQPTGLLTALHTVT*PKGFKSAMKHPHWLPAMEDELSALHKKFTWT 905
            +GI KPNP+YA        L +       PK  K A+K   W  AM +E+  +H+  TW 
Sbjct: 899  SGITKPNPKYA--------LFSVKSNYPEPKSVKEALKDEGWTNAMGEEMGTMHETDTWD 950

Query: 906  LVPRPYTTNVVGSK*VFRTKFHSDGTVERLKTCLVAQGLTQIPGFDYSLTFSPVVKATTV 965
            LVP      ++G K VF+TK +SDG+++RLK  LVA+G  Q  G DY  T+SPVV++ TV
Sbjct: 951  LVPPEMVDRLLGCKWVFKTKLNSDGSLDRLKARLVARGYEQEEGVDYVETYSPVVRSATV 1010

Query: 966  RLILSLVVLND*QLHQLDVKNAFLHGHLTETVYMEQPHGFVD------------LVSRLM 1013
            R IL +  +N   L QLDVKNAFLH  L ETV+M QP GF D             +  L 
Sbjct: 1011 RSILHVATINKWSLKQLDVKNAFLHDELKETVFMTQPPGFEDPSRPDYVCKLKKAIYDLK 1070

Query: 1014 CVG*TRLSTASNRRLVLGFSA*ALFSCVLVFRVVGLIHPCSFFY-KGHITLYLLVYVDDI 1072
                      S+  L  GF             +     P  F Y KG   ++LL+YVDD+
Sbjct: 1071 QAPRAWFDKFSSYLLKYGF-------------ICSFSDPSLFVYLKGRDVMFLLLYVDDM 1117

Query: 1073 ILTGSDPSLLT*FIACLNDEFAIKYLGKLGYFLGLEITYTADGLFLGHAKYAHDLLSHAL 1132
            ILTG++  LL   +  L+ EF +K +G L YFLG++  Y  DGLFL   KY  DLL +A 
Sbjct: 1118 ILTGNNDVLLQQLLNILSTEFRMKDMGALHYFLGIQAHYHNDGLFLSQEKYTSDLLVNAG 1177

Query: 1133 MLEASHVLTPLAAGSHLVSSGEGYSDPTHYRSLVGALQYLTITRPDLSYAVNTVSQFLQT 1192
            M + S + TPL     L  + + + +PT++R L G LQYLT+TRPD+ +AVN V Q +  
Sbjct: 1178 MSDCSSMPTPLQL-DLLQGNNKPFPEPTYFRRLAGKLQYLTLTRPDIQFAVNFVCQKMHA 1236

Query: 1193 PTVDHFQAVKRIIRYVCAMQHFGLTFRRTSSPAVLGYSDADWARCTDHRRSTYGYAIFLG 1252
            PT+  F  +KRI+ Y+      G+     +   +  YSD+DWA C D RRST G+  FLG
Sbjct: 1237 PTMSDFHLLKRILHYLKGTMTMGINLSSNTDSVLRCYSDSDWAGCKDTRRSTGGFCTFLG 1296

Query: 1253 YNLLSWSAKKQPSVAHSSCESEYRAMANTASELVWLLNLLHELRVRLSAPPLFLSDNQSA 1312
            YN++SWSAK+ P+V+ SS E+EYR ++  ASE+ W+  LL E+ +     P    DN SA
Sbjct: 1297 YNIISWSAKRHPTVSKSSTEAEYRTLSFAASEVSWIGFLLQEIGLPQQQIPEMYCDNLSA 1356

Query: 1313 LFMAQNPVAHKHAKHIDLDCHFVRELVSSGRLAVRHVPTSLQLADIFTKVLPRPLFDLFR 1372
            ++++ NP  H  +KH  +D ++VRE V+ G L V+H+P S QLADIFTK LP+  F   R
Sbjct: 1357 VYLSANPALHSRSKHFQVDYYYVRERVALGALTVKHIPASQQLADIFTKSLPQAPFCDLR 1416

Query: 1373 SKLRVGL 1379
             KL V L
Sbjct: 1417 FKLGVVL 1423


>gb|AAT85031.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
          Length = 1437

 Score =  793 bits (2049), Expect = 0.0
 Identities = 516/1446 (35%), Positives = 746/1446 (50%), Gaps = 118/1446 (8%)

Query: 17   ITVKLNSGNFLLWSKQVTAILQNQNLFDIVDPIVAPPSEFMLQPVSSALHHVVPAPNPLF 76
            ++ KL   N  +W  Q+ A ++   L   +     PP+  + +         V   NP +
Sbjct: 18   VSEKLGKSNHAVWKAQILATIRGARLEGHLTGDDQPPAPILRRKEGEK---EVVVSNPEY 74

Query: 77   VQWQARDRNVNSLLLSSLTEEALSETLSATTARDVWTALHSAYASRNKAREMRLRDELQL 136
             +W A D+ V + LLSS+T++ L +  +  TA   W+ +   + S  +AR +  R  L  
Sbjct: 75   EEWVATDQQVLAYLLSSMTKDLLVQVATCRTAASAWSMIQGMFGSMTRARTINTRLSLST 134

Query: 137  MRRGSLSVSEYGRKIR*----------SVANDDKVHWFLRGLGPSYA---NFSTGQLDQV 183
            +++G ++++ Y  K+R            V +D+ + +   GL   +    +   G+ D V
Sbjct: 135  LQKGDMNITTYVGKMRALADDLMAVGKPVDDDELIGYIFAGLDDEFEPVISTIVGRPDPV 194

Query: 184  PLPLFTDILWKVESHAIFQASLEDYVTPPSAAFHARNPTRSSGSQSSGGGHRGNSSSGSR 243
             +      L   E     + S +   +  ++A  +R   +  GS+S G  +RG  +  + 
Sbjct: 195  TIGETYAQLISFEQRLAHRRSGDQ--SSVNSASRSRGQPQRGGSRSGGDSNRGRGAPSNG 252

Query: 244  PRRDNGGSHCRGSYT---------PRCQLCRKQGHYAAKCPVRWDRPSESANLTHSFAAG 294
              R  G  +  G            P+CQLC K+GH    C   W R  E+      FA G
Sbjct: 253  ANRGRGRGNPSGGRANVGGGTDNRPKCQLCYKRGHTVCDC---WYRYDENFVPDERFA-G 308

Query: 295  CSLNNSNRSDKYMDTGATSHMTHSLSQLTYSHAYSGNDRVLVGNGAGLHITHIGS---RS 351
             +++    ++ Y+DTGAT H+T  L +LT    Y GND+V   +GAG+ I+HIG+   ++
Sbjct: 309  TAVSYGVDTNWYLDTGATDHVTGELDKLTVRDKYHGNDQVHTASGAGMEISHIGNSVVKT 368

Query: 352  ASHSVPLSNVLVVPRLTNNLVSVSKLTRDHNVRAIFEADSFVIQNRQTGAVLGKGPCDKG 411
             S ++ L +VL VP+   NLVS  KLT D+          F I++      L +G C KG
Sbjct: 369  PSRNLHLKDVLYVPKANKNLVSAYKLTSDNLAFIELYRKFFFIKDLAMRRTLLRGRCHKG 428

Query: 412  FYVLDQGSQALLATSS--SLPRASFELWHSRLDHVNFDIIKKLHK---LGCFNVSSILPK 466
             Y L   S           + + SFE WHSRL H ++ +++K+ K   L C +VS  +  
Sbjct: 429  LYALPSPSSHHHQVKQVYGVTKPSFERWHSRLGHPSYTVVEKVIKSQNLPCLDVSEQVS- 487

Query: 467  PICCTSCQMAKSKRLVCYDNHKRASAVLDLVHCDLWGPSPVASVDGFSYFVIFVDDFSQF 526
               C +CQ AKS +L    +   +   L+LV  D+WGP+P  SV    Y+V F+DD+S+F
Sbjct: 488  --VCDACQKAKSHQLSFPKSTSESKYPLELVFSDVWGPAP-QSVGNNKYYVSFIDDYSKF 544

Query: 527  TWFYPLKRKSDFYDVLVRFKAFVENQFSRSLKVFQSDNGTEFTNNKVQALFSSSGVLHRF 586
            TW Y LK KS+ +D    F++ VE  F+R +   Q+D G E+   K+ + F+  G+ H  
Sbjct: 545  TWIYLLKYKSEVFDKFHEFQSLVERLFNRKIVAMQTDWGGEY--QKLHSFFNKVGITHHV 602

Query: 587  LCPHTQAQNGRVERKHRHVLELGLAMLYHSHVSTSYWVHAFSTAVYIINRVPSKVLSDQI 646
             CPHT  QNG  ERKHRH++E+GLA+L +S +   +W  AF +AVY+INR PS+VL D  
Sbjct: 603  SCPHTHQQNGSAERKHRHIVEVGLALLAYSSMPLKFWGEAFLSAVYLINRTPSRVLHDVS 662

Query: 647  PFQLLFQVAPTYANFHPFGCRVFPCLRPYMKNKFSPRGTPCIFLGYNSHHKGFKCFDPTT 706
            P + L    P Y     FGC  +P LRPY K+K   R T C FLGY++ HKGFKC DP+T
Sbjct: 663  PLERLLGHKPDYNALRVFGCACWPNLRPYNKHKLQFRSTTCTFLGYSTLHKGFKCLDPST 722

Query: 707  SRTYVSRHAQFDEFCFPLTGSKSSPN-----DLVVFTFYEPAAGSPPSLQPVILVV--PE 759
             R Y+SR   FDE  FP T  K  PN        +    E AA  P  LQ +  V+  PE
Sbjct: 723  GRVYISRDVVFDETQFPFT--KLHPNVGAKLRAEIALVPELAASLPRGLQQISSVINTPE 780

Query: 760  SSPPTGPLPCPSCVDPDVQPVPVDDAPP-----SPVPHNDAPPLPAPTSP---------T 805
            ++  +           +      D AP      +P   + +PP+  P SP         +
Sbjct: 781  NANVSNENMQQDSTYDNEPETETDGAPDTVSANAPAESSGSPPINEPASPFGESDSATAS 840

Query: 806  PPATPLVVTQAPP-----TSPPTTPPAVTQVPLAPVD-------------ARPRTRS*NG 847
            P + P+     P      +S P    +    P   +D              RPRTR  +G
Sbjct: 841  PASAPVNSAPHPDAAASGSSAPRGSTSQGGTPSVAIDDPHPATTVTGQEAQRPRTRLQSG 900

Query: 848  IFKPNPRYALVHAQPTGLLTALHTVT*PKGFKSAMKHPHWLPAMEDELSALHKKFTWTLV 907
            I K       V+   T     L +   P+  + A+++ +W  AM+ E  AL K  TW LV
Sbjct: 901  IRKEK-----VYTDGTVKWGMLTSTGEPENLQDALQNNNWKCAMDAEYMALIKNNTWHLV 955

Query: 908  PRPYTTNVVGSK*VFRTKFHSDGTVERLKTCLVAQGLTQIPGFDYSLTFSPVVKATTVRL 967
            P     NV+  K V++ K   DG+++R K  LVA+G  Q  G DY  TFSPVVKA T+R+
Sbjct: 956  PPQQGRNVIDCKWVYKIKRKQDGSLDRYKARLVAKGFKQRYGIDYEDTFSPVVKAATIRI 1015

Query: 968  ILSLVVLND*QLHQLDVKNAFLHGHLTETVYMEQPHGF-----VDLVSRL--MCVG*TRL 1020
            ILS+ V     L QLDV+NAFLHG L E VYM+QP G+      D V +L     G  + 
Sbjct: 1016 ILSIAVSRGWCLRQLDVQNAFLHGVLEEEVYMKQPPGYENPSTPDYVCKLDKALYGLKQA 1075

Query: 1021 STASNRRLV-----LGFSA*ALFSCVLVFRVVGLIHPCSFFYKGHITLYLLVYVDDIILT 1075
              A   RL      LGF      + +             F+ KG +T++LL+YVDDII+ 
Sbjct: 1076 PRAWYSRLSGKLHDLGFKGSKADTSLF------------FYNKGSLTIFLLIYVDDIIVV 1123

Query: 1076 GSDPSLLT*FIACLNDEFAIKYLGKLGYFLGLEITYTADGLFLGHAKYAHDLLSHALMLE 1135
             S    ++  +  L  EFA+K LG L YFLG+E+T    G+ +   KYA DLL    M +
Sbjct: 1124 SSRKEAVSALLQDLQKEFALKDLGDLHYFLGIEVTKIPGGILMSQEKYASDLLKRVNMSD 1183

Query: 1136 ASHVLTPLAAGSHLVSSGE---GYSDPTHYRSLVGALQYLTITRPDLSYAVNTVSQFLQT 1192
               V TPL+A   L++      G +D T YRS+VGALQYLT+TR D++++VN V QFL  
Sbjct: 1184 CKSVATPLSASEKLIAGKGTILGPNDATQYRSIVGALQYLTLTRLDIAFSVNKVCQFLHN 1243

Query: 1193 PTVDHFQAVKRIIRYVCAMQHFGLTFRRTSSPAVLGYSDADWARCTDHRRSTYGYAIFLG 1252
            PT +H+ AVKRI+RY+      GL   ++SS  V GYSDADWA C D RRST G+A++LG
Sbjct: 1244 PTTEHWAAVKRILRYIKQCTGLGLRICKSSSMIVSGYSDADWAGCLDDRRSTGGFAVYLG 1303

Query: 1253 YNLLSWSAKKQPSVAHSSCESEYRAMANTASELVWLLNLLHELRVRLSAPPLFLSDNQSA 1312
             NL+SW+AKKQ +V+ SS E+EY+A+AN  +E++W+  LL EL +   A      DN  A
Sbjct: 1304 DNLVSWNAKKQATVSRSSTEAEYKALANATAEIMWVQTLLQELNIVSPAMAQLWCDNMGA 1363

Query: 1313 LFMAQNPVAHKHAKHIDLDCHFVRELVSSGRLAVRHVPTSLQLADIFTKVLPRPLFDLFR 1372
             +++ NPV H   KHI++D HFVRE V+   L V +V T+ Q+AD FTK LP    + F+
Sbjct: 1364 KYLSFNPVFHARTKHIEVDYHFVRERVARKLLQVDYVSTNDQVADGFTKALPVKQLENFK 1423

Query: 1373 SKLRVG 1378
              L +G
Sbjct: 1424 YNLNLG 1429


>dbj|BAB10876.1| polyprotein [Arabidopsis thaliana]
          Length = 1429

 Score =  790 bits (2041), Expect = 0.0
 Identities = 511/1459 (35%), Positives = 752/1459 (51%), Gaps = 142/1459 (9%)

Query: 20   KLNSGNFLLWSKQVTAILQNQNLFDIVDPIVAPPSEFMLQPVSSALHHVVPAPNPLFVQW 79
            KL S NFL+W +QV A+L   +L   VD  +        +P ++   H V +PNP +  W
Sbjct: 6    KLTSTNFLMWRRQVHALLDGYDLAGYVDGSIE-------EPHTTVTVHGVTSPNPEYKLW 58

Query: 80   QARDRNVNSLLLSSLTEEALSETLSATTARDVWTALHSAYASRNKAREMRLRDELQLMRR 139
            + +D+ + S L+ +++         ATT+  +W  L   YA+ ++  + ++R++++  ++
Sbjct: 59   KRQDKLIYSGLIGAISVAVQPLLSQATTSAQIWRKLVDTYANPSRGHKQQIREQIKQWKK 118

Query: 140  GSLSVSEYG----------RKIR*SVANDDKVHWFLRGLGPSYANFSTGQLDQVP----L 185
            GS S+ +Y             +  ++ ++D++ + L GL   Y       +DQ+      
Sbjct: 119  GSRSIDDYVLGLTTRFDQLALLEEAIPHEDQIAYILGGLSDDYRRV----IDQIEGRDIS 174

Query: 186  PLFTDILWKVESHAI-FQASLEDYVTPPSA-AFHARNPTRSSGSQSSGGGHRGNS---SS 240
            P  T++  K+ +  +  QA + D  TP +A A    N      ++SS  G++ N    + 
Sbjct: 175  PSITELHEKLINFELKLQAMVPDSSTPVTANAASYNNNNNGRNNRSSSRGNQNNQWQQNQ 234

Query: 241  GSRPRRDNGGSHCRGSYTPRCQLCRKQGHYAAKCPVRWDRPSESANLTHSFAAGCSLNNS 300
              + R +N GS  +G Y  RCQ+C   GH A +C     +P   +  + S  +G   N  
Sbjct: 235  TQQSRSNNRGSQGKG-YQGRCQICGVHGHSARRCSQF--QPYGGSGGSQSVPSGYPTNGY 291

Query: 301  NRSDK------------------YMDTGATSHMTHSLSQLTYSHAYSGNDRVLVGNGAGL 342
            + S                     +D+GAT H+T  L+ L+    Y+G + V + +G+GL
Sbjct: 292  SPSPMAPWQPRANIATAPPFNPWVLDSGATHHLTSDLANLSMHQPYTGGEEVTIADGSGL 351

Query: 343  HITHIGSR---SASHSVPLSNVLVVPRLTNNLVSVSKLTRDHNVRAIFEADSFVIQNRQT 399
             I+H GS    + S S+ L ++L VP ++ NL+SV +L   + V   F    F +++  T
Sbjct: 352  PISHTGSALLPTPSRSLALKDILYVPNVSKNLISVYRLCNANQVSVEFFPAHFQVKDLNT 411

Query: 400  GAVLGKGPCDKGFYVLDQGSQAL-LATSSSLPRASFELWHSRLDHVNFDIIKKLHKLGCF 458
            GA L +G      Y      +++ + T+S  P+     WH RL H    I+K +      
Sbjct: 412  GARLLQGRTRNELYEWPVNQKSITILTASPSPKTDLSSWHQRLGHPALPILKDVVSHFHL 471

Query: 459  NVSSILPKPICCTSCQMAKSKRLVCYDNHKRASAVLDLVHCDLWGPSPVASVDGFSYFVI 518
             +S+ +PK + C+ C + KS +L  + N   +S  L+ ++ D+W  SP  SVD + Y+++
Sbjct: 472  PLSNTIPKQLPCSDCSINKSHKLPFFTNTIVSSQPLEYLYTDVW-TSPCISVDNYKYYLV 530

Query: 519  FVDDFSQFTWFYPLKRKSDFYDVLVRFKAFVENQFSRSLKVFQSDNGTEFTNNKVQALFS 578
             VD F+++TW YPLK+KS   DV V FKA VEN+F   ++   SDNG EF    ++   +
Sbjct: 531  IVDHFTRYTWMYPLKQKSQVKDVFVAFKALVENRFQSRIRTLYSDNGGEFIG--LRPFLA 588

Query: 579  SSGVLHRFLCPHTQAQNGRVERKHRHVLELGLAMLYHSHVSTSYWVHAFSTAVYIINRVP 638
            + G+ H    PHT   NG  ERKHRH++E GLA+L H+ +  ++W +AF+TAVY+INR+P
Sbjct: 589  AHGISHLTSPPHTPEHNGLAERKHRHIVETGLALLTHASLPKTFWTYAFATAVYLINRMP 648

Query: 639  SKVLSDQIPFQLLFQVAPTYANFHPFGCRVFPCLRPYMKNKFSPRGTPCIFLGYNSHHKG 698
            ++VL    P+  LFQ++P Y     FGC  +P LRPY  NK   R T C+FLGY+     
Sbjct: 649  TEVLQGTSPYVKLFQMSPNYLKLRVFGCLCYPWLRPYNTNKLEARSTMCVFLGYSLTQSA 708

Query: 699  FKCFDPTTSRTYVSRHAQFDEFCFPLTGSKSSPNDLVVFTFYEPAAGSPPSLQPVILVVP 758
            + C D  T+R Y SRH QF E  FP    ++S  D    T  +P   +  ++ P++   P
Sbjct: 709  YLCLDIATNRIYTSRHVQFVESSFPFASPRTSETDSTQ-TMSQP---TTTNVIPLLQRPP 764

Query: 759  ESSPPTGPLPCP----------SCVDPDVQPVPV-----------DD-----------AP 786
              +PPT    CP          S   P  + VP+           DD            P
Sbjct: 765  HIAPPTALPLCPIFHSPPHSPSSPASPPSEHVPLTAASSSSNAINDDNISSTGQVSVSGP 824

Query: 787  PSPVPH---NDAPPLPAPTSPTPPATPLVVTQAPPTSP--------PTTPPAVTQVPLAP 835
             S  PH    +    P   SP P  T       PPTSP        PT  P     PL P
Sbjct: 825  TSQSPHTTPTNQNTSPLSKSPNPTNTNQSQNSTPPTSPTTSVHQHSPTPSPLPQNPPLPP 884

Query: 836  V---DARPRTRS*NGIFKPNPRYALVHAQPTGLLTALHTVT*PKGFKSAMKHPHWLPAME 892
                D   RTR+ N I KP  ++ L     T L ++  T+  P     A+K P+W  AM 
Sbjct: 885  PPQNDHPMRTRAKNQITKPKTKFNLT----TSLTSSKPTI--PTTVAQALKDPNWRNAMS 938

Query: 893  DELSALHKKFTWTLVPRPYTTNVVGSK*VFRTKFHSDGTVERLKTCLVAQGLTQIPGFDY 952
            +E++A  K  TW LV      +V+  K +F  K++ DG++ R K  LVA+G  Q  G DY
Sbjct: 939  EEINAQMKNHTWDLVSPEEAKHVISCKWIFTLKYNVDGSIARYKARLVARGFNQQYGIDY 998

Query: 953  SLTFSPVVKATTVRLILSLVVLND*QLHQLDVKNAFLHGHLTETVYMEQPHGFVDL---- 1008
            S TFSPV+K+TT+R +L + V  +  +HQ+D+ NAFL G L E VY+ QP GF+D     
Sbjct: 999  SETFSPVIKSTTIRTVLEVAVKRNWSIHQVDINNAFLQGTLNEEVYVSQPPGFIDRDRPS 1058

Query: 1009 -VSRL--MCVG*TRLSTA---SNRRLVL--GFSA*ALFSCVLVFRVVGLIHPCSFFYKGH 1060
             V RL     G  +   A     RR +L  GF             V  L     F Y  H
Sbjct: 1059 HVCRLNKALYGLKQAPRAWYQELRRFLLQAGF-------------VNSLADASLFIYNRH 1105

Query: 1061 IT-LYLLVYVDDIILTGSDPSLLT*FIACLNDEFAIKYLGKLGYFLGLEITYTADGLFLG 1119
             T +Y+LVYVDDII+ G + +L+  F A L   F++K LG L YFLG+E T T+ GL L 
Sbjct: 1106 NTFMYVLVYVDDIIIAGEN-ALVQAFNASLASRFSLKDLGPLSYFLGIEATRTSRGLHLM 1164

Query: 1120 HAKYAHDLLSHALMLEASHVLTPLAAGSHL-VSSGEGYSDPTHYRSLVGALQYLTITRPD 1178
              KY  DLL    ML+   V TP++    L + SG    D T YR+++G+LQYL  TRPD
Sbjct: 1165 QRKYITDLLKKHNMLDTKPVSTPMSPTPKLSLLSGTALDDATEYRTVLGSLQYLAFTRPD 1224

Query: 1179 LSYAVNTVSQFLQTPTVDHFQAVKRIIRYVCAMQHFGLTFRRTSSPAVLGYSDADWARCT 1238
            +++AVN +SQF+  PT +H+QA KRI+RY+   +  G+  R  +   +  +SDADW    
Sbjct: 1225 IAFAVNRLSQFMHRPTNEHWQAAKRILRYLAGTKSHGIFLRSDTPLTIHAFSDADWGCDL 1284

Query: 1239 DHRRSTYGYAIFLGYNLLSWSAKKQPSVAHSSCESEYRAMANTASELVWLLNLLHELRVR 1298
            D   ST  Y ++ G + +SWS+KKQ SVA SS E+EYRA+ANTASEL WL +LL E+ + 
Sbjct: 1285 DAYLSTNAYIVYFGGSPVSWSSKKQRSVARSSTEAEYRAVANTASELRWLCSLLLEMGIS 1344

Query: 1299 LSAPPLFLSDNQSALFMAQNPVAHKHAKHIDLDCHFVRELVSSGRLAVRHVPTSLQLADI 1358
             +  P+   DN  A ++  NPV H   KH+ LD HFVR  + SG L V HV T  QLAD 
Sbjct: 1345 QTTVPVIYCDNIGATYLCANPVFHSRMKHVALDYHFVRGYIQSGALRVSHVSTKDQLADA 1404

Query: 1359 FTKVLPRPLFDLFRSKLRV 1377
             TK LPRP F    SK+ V
Sbjct: 1405 LTKPLPRPRFTELNSKIGV 1423


>gb|AAC02664.1| polyprotein [Arabidopsis thaliana]
          Length = 1451

 Score =  787 bits (2033), Expect = 0.0
 Identities = 506/1449 (34%), Positives = 745/1449 (50%), Gaps = 112/1449 (7%)

Query: 14   VHMITV-KLNSGNFLLWSKQVTAILQNQNLFDIVDPIVAPPSEFMLQPVSSALHHVVPAP 72
            V+M  V +L   NF++WS+QV A+L   +L   VD  +  P+     P  +    VV   
Sbjct: 24   VNMTNVTRLTDSNFVMWSRQVHALLDGYDLAGYVDGSIPIPT-----PTRTTADGVVTTN 78

Query: 73   NPLFVQWQARDRNVNSLLLSSLTEEALSETLSATTARDVWTALHSAYASRNKAREMRLRD 132
            N  +  W+ +D+ + S LL +++         A T+ ++W  L S +A+ + A   +LR 
Sbjct: 79   ND-YTLWKRQDKLIYSALLGAISLSVQPLLSKANTSAEIWETLSSTFANPSWAHVQQLRQ 137

Query: 133  ELQLMRRGSLSVSEYGRK----------IR*SVANDDKVHWFLRGLGPSYANFSTGQLDQ 182
            +L+   +G+ S+  Y +           +  +   ++++   L GL   Y         +
Sbjct: 138  QLKQWTKGTKSIVTYFQGFTTRFDHLALLGKAPEREEQIELILGGLPEDYKTVVDQIEGR 197

Query: 183  VPLPLFTDILWKVESHAIFQASLEDYVTPP---SAAFHARNPTRSSGSQSSGGGH-RGNS 238
               P  T++L K+ +H +  A+  +  + P   +A  +  N   ++ S+S+G  + RGN+
Sbjct: 198  ENPPALTEVLEKLINHEVKLAAKAEATSVPVTANAVNYRGNNNNNNNSRSNGRNNSRGNT 257

Query: 239  SSGSRPRRDNGGSHCRGSYTPRCQLCRKQGHYAAKCPVRWDRPSESANLTHSFAAGC--- 295
            S  +     N   +    Y  +CQ+C   GH A +CP         A+   S A+     
Sbjct: 258  SWQNSQSTSNRQQYTPRPYQGKCQICSVHGHSARRCPQLQQHAGSYASNQSSSASYAPWQ 317

Query: 296  ------SLNNSNRSDKYMDTGATSHMTHSLSQLTYSHAYSGNDRVLVGNGAGLHITHIGS 349
                  S    N  +  +D+GAT H+T +L+ L     Y+G++ V + +G+GL I+H GS
Sbjct: 318  PRANMVSATPYNSGNWLLDSGATHHLTSNLNNLALHQPYNGDEEVTIADGSGLPISHSGS 377

Query: 350  R---SASHSVPLSNVLVVPRLTNNLVSVSKLTRDHNVRAIFEADSFVIQNRQTGAVLGKG 406
                + + S+ L +VL VP +  NL+SV ++   + V   F    F +++  TGA L +G
Sbjct: 378  ALLPTPTRSLALKDVLYVPDIQKNLISVYRMCNTNGVSVEFFPAHFQVKDLSTGARLLQG 437

Query: 407  PCDKGFYVLDQGSQALLATS---SSLPRASFELWHSRLDHVNFDIIKKLHKLGCFNVSSI 463
                  Y     S   +ATS   S  P+     WH+RL H +  I+K L       +S  
Sbjct: 438  KTKNELYEWPVNSS--IATSMFASPTPKTDLPSWHARLGHPSLPILKALISKFSLPISHS 495

Query: 464  LPKPICCTSCQMAKSKRLVCYDNHKRASAVLDLVHCDLWGPSPVASVDGFSYFVIFVDDF 523
            L   + C+ C + KS +L  Y N   +S  L+ ++ D+W  SP+ S+D + Y+++ VD +
Sbjct: 496  LQNQLLCSDCSINKSHKLPFYSNTIASSHPLEYLYTDVW-TSPITSIDNYKYYLVIVDHY 554

Query: 524  SQFTWFYPLKRKSDFYDVLVRFKAFVENQFSRSLKVFQSDNGTEFTNNKVQALFSSSGVL 583
            +++TW YPL++KS   ++ + F A VEN+F   +    SDNG EF    +++  +S G+ 
Sbjct: 555  TRYTWLYPLRKKSQVREMFITFTALVENKFKFKIGTLYSDNGGEFI--AMRSFLASHGIS 612

Query: 584  HRFLCPHTQAQNGRVERKHRHVLELGLAMLYHSHVSTSYWVHAFSTAVYIINRVPSKVLS 643
            H    PHT   NG  ERKHRH++E GL +L  + +   YW +AF+TAVY+INR+ + VL 
Sbjct: 613  HMTTPPHTPELNGISERKHRHIVETGLTLLSTASMPKEYWSYAFATAVYLINRMLTPVLG 672

Query: 644  DQIPFQLLFQVAPTYANFHPFGCRVFPCLRPYMKNKFSPRGTPCIFLGYNSHHKGFKCFD 703
            ++ P+  LF   P Y     FGC  FP LRPY  +K   R  PC+ LGY+     + C D
Sbjct: 673  NESPYVKLFGQPPNYLKLRIFGCLCFPWLRPYTAHKLDNRSVPCVLLGYSLSQSAYLCLD 732

Query: 704  PTTSRTYVSRHAQFDEFCFPLTGSKSS---PND---------LVVFTFYEPAAGSPPS-- 749
              T R Y SRH QF E  FP + +  S   P+D         + V     P   +PPS  
Sbjct: 733  RATGRVYTSRHVQFAESSFPFSTTSPSVTPPSDPPLSQDTRPVSVPLLARPLTTAPPSSP 792

Query: 750  --------------LQPVILVVPE-SSPPTGPLPCPSCVD----------------PDVQ 778
                          L P   + P  S  PT P+  PS  +                P + 
Sbjct: 793  SCSAPHRSPSQSENLSPPAPLQPSLSLSPTSPITSPSLSEESLVGHNSETGPTGSSPPLS 852

Query: 779  PVPVDDAPPSPVPHNDAPPLPAPTSPTPPATPLVVTQAPPTSPPTTPPAVTQVPLAPVDA 838
            P P    P SP   +     P P SP P  +P  +T    +SP  +PP     P  P+  
Sbjct: 853  PQPQRPQPQSPQSTSPHSSSPQPNSPNPQHSPRSLTPTLTSSPSPSPPPNPNPP--PIQH 910

Query: 839  RPRTRS*NGIFKPNPRYALVHAQPTGLLTALHTVT*PKGFKSAMKHPHWLPAMEDELSAL 898
              RTRS N I KPNP++A +  +PT L   +     PK    A+  P+W  AM DE++A 
Sbjct: 911  TMRTRSKNNIVKPNPKFANLATKPTPLKPII-----PKTVVEALLDPNWRQAMCDEINAQ 965

Query: 899  HKKFTWTLVPRPYTTNVVGSK*VFRTKFHSDGTVERLKTCLVAQGLTQIPGFDYSLTFSP 958
             +  T+ LVP     NVVG K VF  K+ S+G ++R K  LVA+G  Q  G D+  TFSP
Sbjct: 966  TRNGTFDLVPPAPNQNVVGCKWVFTLKYLSNGVLDRYKARLVAKGFHQQYGHDFKETFSP 1025

Query: 959  VVKATTVRLILSLVVLND*QLHQLDVKNAFLHGHLTETVYMEQPHGFVDL-----VSRL- 1012
            V+K+TTVR +L + V     + Q+DV NAFL G L++ VY+ QP GFVD      V RL 
Sbjct: 1026 VIKSTTVRSVLHIAVSKGWSIRQIDVNNAFLQGTLSDEVYVTQPPGFVDKDNAHHVCRLY 1085

Query: 1013 -MCVG*TRLSTASNRRLVLGFSA*ALFSCVLVFRVVGLIHPCSFFYKGH--ITLYLLVYV 1069
                G  +   A  +          L S +L    V  +   S F   H    LY+LVYV
Sbjct: 1086 KALYGLKQAPRAWYQE---------LRSYLLTQGFVNSVADTSLFTLRHERTILYVLVYV 1136

Query: 1070 DDIILTGSDPSLLT*FIACLNDEFAIKYLGKLGYFLGLEITYTADGLFLGHAKYAHDLLS 1129
            DD+++TGSD +++T FIA L   F++K LG++ YFLG+E T T+ GL L   +Y  DLL 
Sbjct: 1137 DDMLITGSDTNIITRFIANLAARFSLKDLGEMSYFLGIEATRTSKGLHLMQKRYVLDLLE 1196

Query: 1130 HALMLEASHVLTPLAAGSHL-VSSGEGYSDPTHYRSLVGALQYLTITRPDLSYAVNTVSQ 1188
               ML A  VLTP++    L ++SG+    P+ YR+++G+LQYL  TRPD++YAVN +SQ
Sbjct: 1197 KTNMLAAHPVLTPMSPTPKLSLTSGKPLDKPSEYRAVLGSLQYLLFTRPDIAYAVNRLSQ 1256

Query: 1189 FLQTPTVDHFQAVKRIIRYVCAMQHFGLTFRRTSSPAVLGYSDADWARCTDHRRSTYGYA 1248
            ++  PT  H+QA KRI+RY+      G+  R  +   +  YSDADWA   D+  ST  Y 
Sbjct: 1257 YMHCPTDLHWQAAKRILRYLAGTPSHGIFIRADTPLTLHAYSDADWAGDIDNYNSTNAYI 1316

Query: 1249 IFLGYNLLSWSAKKQPSVAHSSCESEYRAMANTASELVWLLNLLHELRVRLSAPPLFLSD 1308
            ++LG N +SWS+KKQ  VA SS E+EYRA+AN  SE+ W+ +LL EL + LS+PP+   D
Sbjct: 1317 LYLGSNPISWSSKKQKGVARSSTEAEYRAVANATSEIRWVCSLLTELGITLSSPPVVYCD 1376

Query: 1309 NQSALFMAQNPVAHKHAKHIDLDCHFVRELVSSGRLAVRHVPTSLQLADIFTKVLPRPLF 1368
            N  A +++ NPV H   KHI LD HFVRE V +G L V HV T  QLAD  TK LPR  F
Sbjct: 1377 NVGATYLSANPVFHSRMKHIALDFHFVRESVQAGALRVTHVSTKDQLADALTKPLPRQPF 1436

Query: 1369 DLFRSKLRV 1377
                SK+ V
Sbjct: 1437 TTLISKIGV 1445


>gb|AAC02666.1| polyprotein [Arabidopsis thaliana]
          Length = 1451

 Score =  783 bits (2022), Expect = 0.0
 Identities = 504/1449 (34%), Positives = 743/1449 (50%), Gaps = 112/1449 (7%)

Query: 14   VHMITV-KLNSGNFLLWSKQVTAILQNQNLFDIVDPIVAPPSEFMLQPVSSALHHVVPAP 72
            V+M  V +L   NF++WS+QV A+L   +L   +D  +  P+     P  +    VV   
Sbjct: 24   VNMTNVTRLTDSNFVMWSRQVHALLDGYDLAGYIDGSIPIPT-----PTRTTADGVVTTN 78

Query: 73   NPLFVQWQARDRNVNSLLLSSLTEEALSETLSATTARDVWTALHSAYASRNKAREMRLRD 132
            N  +  W+ +D+ + S LL +++         A T+ ++W  L S +A+ + A   +LR 
Sbjct: 79   ND-YTLWKRQDKLIYSALLGAISLSVQPLLSKANTSAEIWETLSSTFANPSWAHVQQLRQ 137

Query: 133  ELQLMRRGSLSVSEYGRK----------IR*SVANDDKVHWFLRGLGPSYANFSTGQLDQ 182
            +L+   +G+ S+  Y +           +  +   ++++   L GL   Y         +
Sbjct: 138  QLKQWTKGTKSIVTYFQGFTTRFDHLALLGKAPEREEQIELILGGLPEDYKTVVDQIEGR 197

Query: 183  VPLPLFTDILWKVESHAIFQASLEDYVTPP---SAAFHARNPTRSSGSQSSGGGH-RGNS 238
               P  T++L K+ +H +  A+  +  + P   +A  +  N   ++ S+S+G  + RGN+
Sbjct: 198  ENPPALTEVLEKLINHEVKLAAKAEATSVPVTANAVNYRGNNNNNNNSRSNGRNNSRGNT 257

Query: 239  SSGSRPRRDNGGSHCRGSYTPRCQLCRKQGHYAAKCPVRWDRPSESANLTHSFAAGC--- 295
            S  +     N   +    Y  +CQ+C   GH A +CP         A+   S A+     
Sbjct: 258  SWQNSQSTSNRQQYTPRPYQGKCQICSVHGHSARRCPQLQQHAGSYASNQSSSASYAPWQ 317

Query: 296  ------SLNNSNRSDKYMDTGATSHMTHSLSQLTYSHAYSGNDRVLVGNGAGLHITHIGS 349
                  S    N  +  +D+GAT H+T  L+ L     Y+G++ V + +G+GL I+H GS
Sbjct: 318  PRANMVSATPYNSGNWLLDSGATHHLTSDLNNLALHQPYNGDEEVTIADGSGLPISHSGS 377

Query: 350  R---SASHSVPLSNVLVVPRLTNNLVSVSKLTRDHNVRAIFEADSFVIQNRQTGAVLGKG 406
                + + S+ L +VL VP +  NL+SV ++   + V   F    F +++  TGA L +G
Sbjct: 378  ALLPTPTRSLALKDVLYVPDIQKNLISVYRMCNTNGVSVEFFPAHFQVKDLSTGARLLQG 437

Query: 407  PCDKGFYVLDQGSQALLATS---SSLPRASFELWHSRLDHVNFDIIKKLHKLGCFNVSSI 463
                  Y     S   +ATS   S  P+     WH+RL H +  I+K L       +S  
Sbjct: 438  KTKNELYEWPVNSS--IATSMFASPTPKTDLPSWHARLGHPSLPILKALISKFSLPISHS 495

Query: 464  LPKPICCTSCQMAKSKRLVCYDNHKRASAVLDLVHCDLWGPSPVASVDGFSYFVIFVDDF 523
            L   + C+ C + KS +L  Y N   +S  L+ ++ D+W  SP+ S+D + Y+++ VD +
Sbjct: 496  LQNQLLCSDCSINKSHKLPFYSNTIASSHPLEYLYTDVW-TSPITSIDNYKYYLVIVDHY 554

Query: 524  SQFTWFYPLKRKSDFYDVLVRFKAFVENQFSRSLKVFQSDNGTEFTNNKVQALFSSSGVL 583
            +++TW YPL++KS   ++ + F A VEN+F   +    SDNG EF    +++  +S G+ 
Sbjct: 555  TRYTWLYPLRKKSQVREMFITFTALVENKFKFKIGTLYSDNGGEFI--AMRSFLASHGIS 612

Query: 584  HRFLCPHTQAQNGRVERKHRHVLELGLAMLYHSHVSTSYWVHAFSTAVYIINRVPSKVLS 643
            H    PHT   NG  ERKHRH++E GL +L  + +   YW +AF+TAVY+INR+ + VL 
Sbjct: 613  HMTTPPHTPELNGISERKHRHIVETGLTLLSTASMPKEYWSYAFATAVYLINRMLTPVLG 672

Query: 644  DQIPFQLLFQVAPTYANFHPFGCRVFPCLRPYMKNKFSPRGTPCIFLGYNSHHKGFKCFD 703
            ++ P+  LF   P Y     FGC  FP LRPY  +K   R  PC+ LGY+     + C D
Sbjct: 673  NESPYVKLFGQPPNYLKLRIFGCLCFPWLRPYTAHKLDNRSVPCVLLGYSLSQSAYLCLD 732

Query: 704  PTTSRTYVSRHAQFDEFCFPLTGSKSS---PND---------LVVFTFYEPAAGSPPS-- 749
              T R Y SRH QF E  FP + +  S   P+D         + V     P   +PPS  
Sbjct: 733  RATGRVYTSRHVQFAESSFPFSTTSPSVTPPSDPPLSQDTRPVSVPLLARPLTTAPPSSP 792

Query: 750  --------------LQPVILVVPE-SSPPTGPLPCPSCVD----------------PDVQ 778
                          L P   + P  S  PT P+  PS  +                P + 
Sbjct: 793  SCSAPHRSPSQSENLSPPAPLQPSLSLSPTSPITSPSLSEESLVGHNSETGPTGSSPPLS 852

Query: 779  PVPVDDAPPSPVPHNDAPPLPAPTSPTPPATPLVVTQAPPTSPPTTPPAVTQVPLAPVDA 838
            P P    P SP   +     P P SP P  +P  +T    +SP  +PP     P  P+  
Sbjct: 853  PQPQRPQPQSPQSTSPHSSSPQPNSPNPQHSPRSLTPTLTSSPSPSPPPNPNPP--PIQH 910

Query: 839  RPRTRS*NGIFKPNPRYALVHAQPTGLLTALHTVT*PKGFKSAMKHPHWLPAMEDELSAL 898
              RTRS N I KPNP++A +  +PT L   +     PK    A+  P+W  AM DE++A 
Sbjct: 911  TMRTRSKNNIVKPNPKFANLATKPTPLKPII-----PKTVVEALLDPNWRQAMCDEINAQ 965

Query: 899  HKKFTWTLVPRPYTTNVVGSK*VFRTKFHSDGTVERLKTCLVAQGLTQIPGFDYSLTFSP 958
             +  T+ LVP     NVVG K VF  K+ S+G ++R K  LVA+G  Q  G D+  TFSP
Sbjct: 966  TRNGTFDLVPPAPNQNVVGCKWVFTLKYLSNGVLDRYKARLVAKGFHQQYGHDFKETFSP 1025

Query: 959  VVKATTVRLILSLVVLND*QLHQLDVKNAFLHGHLTETVYMEQPHGFVDL-----VSRL- 1012
            V+K+TTVR +L + V     + Q+DV NAFL G L++ VY+ QP GFVD      V RL 
Sbjct: 1026 VIKSTTVRSVLHIAVSKGWSIRQIDVNNAFLQGTLSDEVYVTQPPGFVDKDNAHHVCRLY 1085

Query: 1013 -MCVG*TRLSTASNRRLVLGFSA*ALFSCVLVFRVVGLIHPCSFFYKGH--ITLYLLVYV 1069
                G  +   A  +          L S +L    V  +   S F   H    LY+LVYV
Sbjct: 1086 KALYGLKQAPRAWYQE---------LRSYLLTQGFVNSVADTSLFTLRHERTILYVLVYV 1136

Query: 1070 DDIILTGSDPSLLT*FIACLNDEFAIKYLGKLGYFLGLEITYTADGLFLGHAKYAHDLLS 1129
            DD+++TGSD +++T FIA L   F++K LG++ YFLG+E T T+ GL L   +Y  DLL 
Sbjct: 1137 DDMLITGSDTNIITRFIANLAARFSLKDLGEMSYFLGIEATRTSKGLHLMQKRYVLDLLE 1196

Query: 1130 HALMLEASHVLTPLAAGSHL-VSSGEGYSDPTHYRSLVGALQYLTITRPDLSYAVNTVSQ 1188
               ML A  VLTP++    L ++SG+    P+ YR+++G+LQYL  TRPD++YAVN +SQ
Sbjct: 1197 KTNMLAAHPVLTPMSPTPKLSLTSGKPLDKPSEYRAVLGSLQYLLFTRPDIAYAVNRLSQ 1256

Query: 1189 FLQTPTVDHFQAVKRIIRYVCAMQHFGLTFRRTSSPAVLGYSDADWARCTDHRRSTYGYA 1248
            ++  PT  H+QA KRI+RY+      G+  R  +   +  YSDADWA   D+  ST  Y 
Sbjct: 1257 YMHCPTDLHWQAAKRILRYLAGTPSHGIFIRADTPLTLHAYSDADWAGDIDNYNSTNAYI 1316

Query: 1249 IFLGYNLLSWSAKKQPSVAHSSCESEYRAMANTASELVWLLNLLHELRVRLSAPPLFLSD 1308
            ++LG N +SWS+KKQ  VA SS E+EYRA+AN  SE+ W+ +LL EL + LS+PP+   D
Sbjct: 1317 LYLGSNPISWSSKKQKGVARSSTEAEYRAVANATSEIRWVCSLLTELGITLSSPPVVYCD 1376

Query: 1309 NQSALFMAQNPVAHKHAKHIDLDCHFVRELVSSGRLAVRHVPTSLQLADIFTKVLPRPLF 1368
            N  A +++ NPV     KHI LD HFVRE V +G L V HV T  QLAD  TK LPR  F
Sbjct: 1377 NVGATYLSANPVFDSRMKHIALDFHFVRESVQAGALRVTHVSTKDQLADALTKPLPRQPF 1436

Query: 1369 DLFRSKLRV 1377
                SK+ V
Sbjct: 1437 TTLISKIGV 1445


>gb|AAC02669.1| polyprotein [Arabidopsis thaliana]
          Length = 1451

 Score =  783 bits (2021), Expect = 0.0
 Identities = 505/1449 (34%), Positives = 742/1449 (50%), Gaps = 112/1449 (7%)

Query: 14   VHMITV-KLNSGNFLLWSKQVTAILQNQNLFDIVDPIVAPPSEFMLQPVSSALHHVVPAP 72
            V+M  V +L   NF++WS+QV A+L   +L   VD  +  P+     P  +    VV   
Sbjct: 24   VNMTNVTRLTDSNFVMWSRQVHALLDGYDLAGYVDGSIPIPT-----PTRTTADGVVTTN 78

Query: 73   NPLFVQWQARDRNVNSLLLSSLTEEALSETLSATTARDVWTALHSAYASRNKAREMRLRD 132
            N  +  W+ +D+ + S LL +++         A T+ ++W  L S +A+ + A   +LR 
Sbjct: 79   ND-YTLWKRQDKLIYSALLGAISLSVQPLLSKANTSAEIWETLSSTFANPSWAHVQQLRQ 137

Query: 133  ELQLMRRGSLSVSEYGRK----------IR*SVANDDKVHWFLRGLGPSYANFSTGQLDQ 182
            +L+   +G+ S+  Y +           +  +   ++++   L GL   Y         +
Sbjct: 138  QLKQWTKGTKSIVTYFQGFTTRFDHLALLGKAPEREEQIELILGGLPEDYKTVVDQIEGR 197

Query: 183  VPLPLFTDILWKVESHAIFQASLEDYVTPP---SAAFHARNPTRSSGSQSSGGGH-RGNS 238
               P  T++L K+ +H +  A+  +  + P   +A  +  N   ++ S+S+G  + RGN+
Sbjct: 198  ENPPALTEVLEKLINHEVKLAAKAEATSVPVTANAVNYRGNNNNNNNSRSNGRNNSRGNT 257

Query: 239  SSGSRPRRDNGGSHCRGSYTPRCQLCRKQGHYAAKCPVRWDRPSESANLTHSFAAGC--- 295
            S  +     N   +    Y  +CQ+C   GH A +CP         A+   S A+     
Sbjct: 258  SWQNSQSTSNRQQYTPRPYQGKCQICSVHGHSARRCPQLQQHAGSYASNQSSSASYAPWQ 317

Query: 296  ------SLNNSNRSDKYMDTGATSHMTHSLSQLTYSHAYSGNDRVLVGNGAGLHITHIGS 349
                  S    N  +  +D+GAT H+T  L+ L     Y+G++ V + +G+GL I+H GS
Sbjct: 318  PRANMVSATPYNSGNWLLDSGATHHLTSDLNNLALHQPYNGDEEVTIADGSGLPISHSGS 377

Query: 350  R---SASHSVPLSNVLVVPRLTNNLVSVSKLTRDHNVRAIFEADSFVIQNRQTGAVLGKG 406
                + + S+ L +VL VP +  NL+SV ++   + V   F    F +++  TGA L +G
Sbjct: 378  ALLPTPTRSLALKDVLYVPDIQKNLISVYRMCNTNGVSVEFFPAHFQVKDLSTGARLLQG 437

Query: 407  PCDKGFYVLDQGSQALLATS---SSLPRASFELWHSRLDHVNFDIIKKLHKLGCFNVSSI 463
                  Y     S   +ATS   S  P+     WH+RL H +  I+K L       +S  
Sbjct: 438  KTKNELYEWPVNSS--IATSMFASPTPKTDLPSWHARLGHPSLPILKALISKFSLPISHS 495

Query: 464  LPKPICCTSCQMAKSKRLVCYDNHKRASAVLDLVHCDLWGPSPVASVDGFSYFVIFVDDF 523
            L   + C+ C + KS +L  Y N   +S  L+ ++ D+W  SP+ S+D + Y+++ VD +
Sbjct: 496  LQNQLLCSDCSINKSHKLPFYSNTIASSHPLEYLYTDVW-TSPITSIDNYKYYLVIVDHY 554

Query: 524  SQFTWFYPLKRKSDFYDVLVRFKAFVENQFSRSLKVFQSDNGTEFTNNKVQALFSSSGVL 583
            +++TW YPL++KS   ++ + F A VEN+F   +    SDNG EF    +++  +S G+ 
Sbjct: 555  TRYTWLYPLRKKSQVREMFITFTALVENKFKFKIGTLYSDNGGEFI--AMRSFLASHGIS 612

Query: 584  HRFLCPHTQAQNGRVERKHRHVLELGLAMLYHSHVSTSYWVHAFSTAVYIINRVPSKVLS 643
            H    PHT   NG  ERKHRH++E GL +L  + +   YW +AF+TAVY+INR+ + VL 
Sbjct: 613  HMTTPPHTPELNGISERKHRHIVETGLTLLSTASMPKEYWSYAFATAVYLINRMLTPVLG 672

Query: 644  DQIPFQLLFQVAPTYANFHPFGCRVFPCLRPYMKNKFSPRGTPCIFLGYNSHHKGFKCFD 703
            ++ P+  LF   P Y     FGC  FP LRPY  +K   R  PC+ LGY+     + C D
Sbjct: 673  NESPYVKLFGQPPNYLKLRIFGCLCFPWLRPYTAHKLDNRSVPCVLLGYSLSQSAYLCLD 732

Query: 704  PTTSRTYVSRHAQFDEFCFPLTGSKSS---PND---------LVVFTFYEPAAGSPPS-- 749
              T R Y SRH QF E  FP + +  S   P+D         + V     P   +PPS  
Sbjct: 733  RATGRVYTSRHVQFAESSFPFSTTSPSVTPPSDPPLSQDTRPVSVPLLARPLTTAPPSSP 792

Query: 750  --------------LQPVILVVPE-SSPPTGPLPCPSCVD----------------PDVQ 778
                          L P   + P  S  PT P+  PS  +                P + 
Sbjct: 793  SCSAPHRSPSQSENLSPPAPLQPSLSLSPTSPITSPSLSEESLVGHNSETGPTGSSPPLS 852

Query: 779  PVPVDDAPPSPVPHNDAPPLPAPTSPTPPATPLVVTQAPPTSPPTTPPAVTQVPLAPVDA 838
            P P    P SP   +     P P SP P  +P  +T    +SP  +PP     P  P+  
Sbjct: 853  PQPQRPQPQSPQSTSPHSSSPQPNSPNPQHSPRSLTPTLTSSPSPSPPPNPNPP--PIQH 910

Query: 839  RPRTRS*NGIFKPNPRYALVHAQPTGLLTALHTVT*PKGFKSAMKHPHWLPAMEDELSAL 898
              RTRS N I KPNP++A +  +PT L   +     PK    A+  P+W  AM DE++A 
Sbjct: 911  TMRTRSKNNIVKPNPKFANLATKPTPLKPII-----PKTVVEALLDPNWRQAMCDEINAQ 965

Query: 899  HKKFTWTLVPRPYTTNVVGSK*VFRTKFHSDGTVERLKTCLVAQGLTQIPGFDYSLTFSP 958
             +  T+ LVP     NVVG K VF  K+ S+G ++R K  LVA+G  Q  G D+  TFSP
Sbjct: 966  TRNGTFDLVPPAPNQNVVGCKWVFTLKYLSNGVLDRYKARLVAKGFHQQYGHDFKETFSP 1025

Query: 959  VVKATTVRLILSLVVLND*QLHQLDVKNAFLHGHLTETVYMEQPHGFVDL-----VSRL- 1012
            V+K TTVR +L + V     + Q+DV NAFL G L++ VY+ QP GFVD      V RL 
Sbjct: 1026 VIKLTTVRSVLHIAVSKGWSIRQIDVNNAFLQGTLSDEVYVTQPPGFVDKDNAHHVCRLY 1085

Query: 1013 -MCVG*TRLSTASNRRLVLGFSA*ALFSCVLVFRVVGLIHPCSFFYKGH--ITLYLLVYV 1069
                G  +   A  +          L S +L    V  +   S F   H    LY+LVYV
Sbjct: 1086 KALYGLKQAPRAWYQE---------LRSYLLTQGFVNSVADTSLFTLRHERTILYVLVYV 1136

Query: 1070 DDIILTGSDPSLLT*FIACLNDEFAIKYLGKLGYFLGLEITYTADGLFLGHAKYAHDLLS 1129
            DD+++TGSD +++T FIA L   F++K LG++ YFLG+E T T+ GL L   +Y  DLL 
Sbjct: 1137 DDMLITGSDTNIITRFIANLAARFSLKDLGEMSYFLGIEATRTSKGLHLMQKRYVLDLLE 1196

Query: 1130 HALMLEASHVLTPLAAGSHL-VSSGEGYSDPTHYRSLVGALQYLTITRPDLSYAVNTVSQ 1188
               ML A  VLTP++    L ++SG+    P+ YR+++G+LQYL  TRPD++YAVN +SQ
Sbjct: 1197 KTNMLAAHPVLTPMSPTPKLSLTSGKPLDKPSEYRAVLGSLQYLLFTRPDIAYAVNRLSQ 1256

Query: 1189 FLQTPTVDHFQAVKRIIRYVCAMQHFGLTFRRTSSPAVLGYSDADWARCTDHRRSTYGYA 1248
            ++  PT  H+QA KRI+RY+      G+  R  +   +  YSDADWA   D+  ST  Y 
Sbjct: 1257 YMHCPTDLHWQAAKRILRYLAGTPSHGIFIRADTPLTLHAYSDADWAGDIDNYNSTNAYI 1316

Query: 1249 IFLGYNLLSWSAKKQPSVAHSSCESEYRAMANTASELVWLLNLLHELRVRLSAPPLFLSD 1308
            ++LG N +SWS+KKQ  VA SS E+EYRA+AN  SE+ W+ +LL EL + LS+PP+   D
Sbjct: 1317 LYLGSNPISWSSKKQKGVARSSTEAEYRAVANATSEIRWVCSLLTELGITLSSPPVVYCD 1376

Query: 1309 NQSALFMAQNPVAHKHAKHIDLDCHFVRELVSSGRLAVRHVPTSLQLADIFTKVLPRPLF 1368
            N  A +++ NPV     KHI LD HFVRE V +G L V HV T  QLAD  TK LPR  F
Sbjct: 1377 NVGATYLSANPVFDSRMKHIALDFHFVRESVQAGALRVTHVSTKDQLADALTKPLPRQPF 1436

Query: 1369 DLFRSKLRV 1377
                SK+ V
Sbjct: 1437 TTLISKIGV 1445


>gb|AAC67200.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
            gi|25301693|pir||F84480 probable retroelement pol
            polyprotein [imported] - Arabidopsis thaliana
          Length = 1402

 Score =  775 bits (2000), Expect = 0.0
 Identities = 491/1401 (35%), Positives = 708/1401 (50%), Gaps = 115/1401 (8%)

Query: 6    YSFLAYTWVHMITVKLNSGNFLLWSKQVTAILQNQNLFDIVDPIVAPPSEFMLQPVSSAL 65
            YS  +    + +TV L + N++LW  Q  + L  Q L   V   +  PS+  +  VS   
Sbjct: 4    YSVPSLNISNCVTVTLTAKNYILWKSQFESFLDGQGLLGFVTGSIPAPSQTSV--VSDID 61

Query: 66   HHVVPAPNPLFVQWQARDRNVNSLLLSSLTEEALSETLSATTARDVWTALHSAYASRNKA 125
                 +PNP +  W   DR V S LL S  E+ LS  ++  T+ +VW ++ + +   + +
Sbjct: 62   GSTSASPNPEYYTWFKTDRVVKSWLLGSFLEDILSVVVNCNTSHEVWISVANHFNRVSSS 121

Query: 126  REMRLRDELQLMRRGSLSVSEYGRKIR*----------SVANDDKVHWFLRGLGPSYANF 175
            R   L+  LQ + +   S+ EY + ++            V    K+   L GLG  Y   
Sbjct: 122  RLFELQRRLQNVSKRDKSMDEYLKDLKTICDQLASVGSPVTEKMKIFAALNGLGREYEPI 181

Query: 176  ST---GQLDQVPLPLFTDILWKVESHAI-FQASLEDYVTPPSAAFHARNPTRSSGS---- 227
             T     +D +P P   D++ K+  +    Q  LE+    P  AF+      S+ S    
Sbjct: 182  KTTIENSMDALPGPSLEDVIPKLTGYDDRLQGYLEETAVSPHVAFNITTSDDSNASGYFN 241

Query: 228  ---QSSGGGHRGNSSSGSRPR---------RDNGGSHCRGSYTPRCQLCRKQGHYAAKCP 275
               +  G  +RG +S  +R R           + GS   G+    CQ+C K GH A KC 
Sbjct: 242  AYNRGKGKSNRGRNSFSTRGRGFHQQISSTNSSSGSQSGGTSVV-CQICGKMGHPALKCW 300

Query: 276  VRWDRPSESANLTHSFAAG--CSLNNSNRSDKYMDTGATSHMTHSLSQLTYSHAYSGNDR 333
             R++   +   L  + AA     + + + ++   D+ AT+H+T+S   L  S  Y G+D 
Sbjct: 301  HRFNNSYQYEELPRALAAMRITDITDQHGNEWLPDSAATAHVTNSPRSLQQSQPYHGSDA 360

Query: 334  VLVGNGAGLHITHIGSR---SASHSVPLSNVLVVPRLTNNLVSVSKLTRDHNVRAIFEAD 390
            V+V +G  L ITH GS    S+S +VPL++VLV P +T +L+SVSKLT+D+     F++D
Sbjct: 361  VMVADGNFLPITHTGSTNLASSSGNVPLTDVLVCPSITKSLLSVSKLTQDYPCTVEFDSD 420

Query: 391  SFVIQNRQTGAVLGKGPCDKGFYVLDQGSQALLATSSSLPRASFELWHSRLDHVNFDIIK 450
               I ++ T  +L  G    G Y L   SQ     S+    AS E+WH RL H +  +++
Sbjct: 421  GVRINDKATKKLLIMGSTCDGLYCLKDDSQFKAFFSTRQQSASDEVWHRRLGHPHPQVLQ 480

Query: 451  KLHKLGCFNVSSILPKPICCTSCQMAKSKRLVCYDNHKRASAVLDLVHCDLWGPSPVASV 510
            +L K    +++        C +CQ+ KS RL    +   ++  L+ VHCDLWGPSP+ SV
Sbjct: 481  QLVKTNSISINKTSKS--LCEACQLGKSTRLPFVSSSFTSNRPLERVHCDLWGPSPITSV 538

Query: 511  DGFSYFVIFVDDFSQFTWFYPLKRKSDFYDVLVRFKAFVENQFSRSLKVFQSDNGTEFTN 570
             GF Y+ +F+D +S+F+W YPLK KSDFY++ V F   VENQ +  + VFQ D G EF N
Sbjct: 539  QGFRYYAVFIDHYSRFSWIYPLKLKSDFYNIFVAFHKLVENQLNHKISVFQCDGGGEFVN 598

Query: 571  NKVQALFSSSGVLHRFLCPHTQAQNGRVERKHRHVLELGLAMLYHSHVSTSYWVHAFSTA 630
            +K      + G+      PHT  QNG  ERKHRH++ELGL+ML+ S V   +WV AF TA
Sbjct: 599  HKFLQHLQNHGIQQHISYPHTPQQNGLAERKHRHLVELGLSMLFQSKVPLKFWVEAFFTA 658

Query: 631  VYIINRVPSKVLSDQI-PFQLLFQVAPTYANFHPFGCRVFPCLRPYMKNKFSPRGTPCIF 689
             ++IN +P+  + D I P++ L Q  P Y     FGC  FP +R Y  NKF PR   C+F
Sbjct: 659  NFLINLLPTSAVEDAISPYEKLHQTTPDYTALRSFGCACFPTMRDYAMNKFDPRSLKCVF 718

Query: 690  LGYNSHHKGFKCFDPTTSRTYVSRHAQFDEFCFP-------LTGSKSSPNDLVVFTFYEP 742
            LGYN  +KG++C  P T R Y+SRH  FDE  +P       L    ++P     F  +E 
Sbjct: 719  LGYNDKYKGYRCLYPPTGRVYISRHVIFDETAYPFSHHYKHLHSQPTTPLLAAWFKGFES 778

Query: 743  AAG-SPPSLQPVILVVPESSPPTGPLPCPSCVDPDVQPVPVDDAPPSP-----VPHNDAP 796
            +   +PP + P      +++ PT PL                D PP P     +  N A 
Sbjct: 779  SVSQAPPKVSPAQPPQRKATLPTPPL------------FTAADFPPLPRRSPQLSQNSAA 826

Query: 797  PL---PAPTSPTPPATPLVVTQA------------------------PPTSPPTTPPAVT 829
             L   P+ T+      P VV ++                                 P  T
Sbjct: 827  ALVSQPSTTTINSTHPPAVVNESSERTINFDSASIGDSSHSSQLLVDDTVEDLMAAPVPT 886

Query: 830  QVPLAPVDARPR-TRS*NGIFKPNPRYALVHAQPTGLLTALHTVT*PKGFKSAMKHPHWL 888
            Q    P +  P  TR+  GI KPNPRY          L+   T   PK   +A+KHP W 
Sbjct: 887  QQAPPPTNTHPMITRAKVGITKPNPRYV--------FLSHKVTYPEPKTVTAALKHPGWT 938

Query: 889  PAMEDELSALHKKFTWTLVPRPYTTNVVGSK*VFRTKFHSDGTVERLKTCLVAQGLTQIP 948
             AM +E+    +  TW+LVP     +V+GSK VFRTK H+DGT+ +LK  +VA+   Q  
Sbjct: 939  GAMTEEMGNCSETNTWSLVPYTPNMHVLGSKWVFRTKLHADGTLNKLKARIVAKCFLQEE 998

Query: 949  GFDYSLTFSPVVKATTVRLILSLVVLND*QLHQLDVKNAFLHGHLTETVYMEQPHGFVDL 1008
            G  Y  T+SPVV+  TV+L+L L    + +L Q+DVKNAFLHG L ETVYM QP GFVD 
Sbjct: 999  GIGYLETYSPVVRTPTVQLVLHLATALNWELKQMDVKNAFLHGDLNETVYMTQPAGFVDK 1058

Query: 1009 VSRLMCVG*TRLSTASNRRLVLGFSA*ALFSCVLVFRV-----VGLIHPCSFFYKGHITL 1063
                     T +         L  S  A F     F +          P  F Y  +  L
Sbjct: 1059 SKP------THVCLLHKSIYGLKQSPRAWFDKFSTFLLEFGFFCSKSDPSLFIYAHNNNL 1112

Query: 1064 -YLLVYVDDIILTGSDPSLLT*FIACLNDEFAIKYLGKLGYFLGLEITYTADGLFLGHAK 1122
              LL+YVDD+++TG+    L+  +A LN EF +  +G+L YFLG+++     GLF+   K
Sbjct: 1113 ILLLLYVDDMVITGNSSQTLSSLLAALNKEFRMTDMGQLHYFLGIQVQRNQHGLFMSQQK 1172

Query: 1123 YAHDLLSHALMLEASHVLTPLAAG-SHLVSSGEGYSDPTHYRSLVGALQYLTITRPDLSY 1181
            YA DLL  + M   + + TPL      +    E ++DPT++RS+ G LQYLT+TRPD+ +
Sbjct: 1173 YAEDLLVASAMENCTPLPTPLPVQLDRVPHQEEPFTDPTYFRSIAGKLQYLTLTRPDIHF 1232

Query: 1182 AVNTVSQFLQTPTVDHFQAVKRIIRYVCAMQHFGLTFRRTSSPAVLGYSDADWARCTDHR 1241
            AVN V Q +  PT+  F  +KRI+RY+      G+++ + S   +  YSD+DW  C   R
Sbjct: 1233 AVNFVCQKMHQPTMSDFHLLKRILRYIKGTITMGISYNQNSPTLLQAYSDSDWGNCKLTR 1292

Query: 1242 RSTYGYAIFLGYNLLSWSAKKQPSVAHSSCESEYRAMANTASELVWLLNLLHELRVRLSA 1301
            RS  G   F+  NL+SWS+KK P+V+ SS E+EYR +++ ASE++WL  LL EL + L  
Sbjct: 1293 RSVGGLCTFMATNLVSWSSKKHPTVSRSSTEAEYRTLSDAASEILWLSTLLRELGIPLPD 1352

Query: 1302 PPLFLSDNQSALFMAQNPVAH 1322
             P    DN SA++   NP  H
Sbjct: 1353 TPELFCDNLSAVYHTANPAFH 1373


>emb|CAB81478.1| putative protein [Arabidopsis thaliana] gi|4972079|emb|CAB43904.1|
            putative protein [Arabidopsis thaliana]
            gi|7444467|pir||T08945 hypothetical protein F25O24.20 -
            Arabidopsis thaliana
          Length = 1415

 Score =  760 bits (1962), Expect = 0.0
 Identities = 500/1420 (35%), Positives = 716/1420 (50%), Gaps = 154/1420 (10%)

Query: 10   AYTWVHMITVKLNSGNFLLWSKQVTAILQNQNLFDIVDPIVAPPSEFMLQPVSSALHHVV 69
            A  + H +T+KL++ N+LLW  Q    L NQ L   V    A P     + + +    V 
Sbjct: 10   ALCFSHYVTLKLSTANYLLWKIQFETWLNNQRLLGFVTG--ANPCPNATRSIRNG-DQVT 66

Query: 70   PAPNPLFVQWQARDRNVNSLLLSSLTEEALSETLSATTARDVWTALHSAYASRNKAREMR 129
             A NP F+ W   D+ +   LL SL+E+AL       T+R+VW +L   Y   + +R+  
Sbjct: 67   EATNPDFLTWVQNDQKIMGWLLGSLSEDALRSVYGLHTSREVWFSLAKKYNRVSASRKSD 126

Query: 130  LRDELQLMRRGSLSVSEYGRKIR*----------SVANDDKVHWFLRGLGPSYANFST-- 177
            L+  L  + +   S+ EY   ++            V  ++K+   L GLG  Y   ST  
Sbjct: 127  LQRRLNPVSKNEKSMLEYLNCVKQICDQLDSIGCPVPENEKIFGVLNGLGQEYMLVSTMI 186

Query: 178  -GQLDQVPLPLFTDILWKVESHAIFQASLEDYVTPPSAAFHARNPTRSSGSQSSGGGHRG 236
             G +D  P+  F D+++K+ +   F   L++                    QS G   R 
Sbjct: 187  KGSMDTYPMS-FEDVVFKLIN---FDDKLQN-------------------GQSGGNRGRN 223

Query: 237  NSSSGSR--PRRDNGGSHCRGSYTPRCQLCRKQGHYAAKCPVRWDRPSESANLTHSFAAG 294
            N ++  R  P++ + GS       P CQ+C K GH A KC  R+D   +S + + +FAA 
Sbjct: 224  NYTTKGRGFPQQISSGSPSDSGTRPTCQICNKYGHSAYKCWKRFDHAFQSEDFSKAFAA- 282

Query: 295  CSLNNSNRSDKYMDTGATSHMTHSLSQLTYSHAYSGNDRVLVGNGAGLHITHIGSR---S 351
              +++   +    D+GATSH+T+S SQL  +  YSG D V+VGN   L ITHIGS    S
Sbjct: 283  MRVSDQKSNPWVTDSGATSHITNSTSQLQSAQPYSGEDSVIVGNSDFLPITHIGSAVLTS 342

Query: 352  ASHSVPLSNVLVVPRLTNNLVSVSKLTRDHNVRAIFEADSFVIQNRQTGAVLGKGPCDKG 411
               ++PL +VLV P +T +L+SVSKLT D+     F++D  +++++ T  +L KG     
Sbjct: 343  NQGNLPLRDVLVCPNITKSLLSVSKLTSDYPCVIEFDSDGVIVKDKLTKQLLTKGTRHND 402

Query: 412  FYVLDQGSQALLATSSSLPRASFELWHSRLDHVNFDIIKKLHKLGCFNVSSILPKPICCT 471
             Y+L+   + +   SS     S E+WH RL H N D++++L +     +S        C 
Sbjct: 403  LYLLEN-PKFMACYSSRQQATSDEVWHMRLGHPNQDVLQQLLRNKAIVISKTSHS--LCD 459

Query: 472  SCQMAKSKRLVCYDNHKRASAVLDLVHCDLWGPSPVASVDGFSYFVIFVDDFSQFTWFYP 531
            +CQM K  +L    +   +S +L+ VHCDLWGP+PV S  GF Y+VIF+D++S+FTWFYP
Sbjct: 460  ACQMGKICKLPFASSDFVSSRLLERVHCDLWGPAPVVSSQGFRYYVIFIDNYSRFTWFYP 519

Query: 532  LKRKSDFYDVLVRFKAFVENQFSRSLKVFQSDNGTEFTNNKVQALFSSSGVLHRFLCPHT 591
            L+ KSDF+ V + F+  VENQ  + +  FQ D G EF +N+  +  +  G+     CP+T
Sbjct: 520  LRLKSDFFSVFLTFQKMVENQCQQKIASFQCDGGGEFISNQFVSHLAECGIRQLISCPYT 579

Query: 592  QAQNGRVERKHRHVLELGLAMLYHSHVSTSYWVHAFSTAVYIINRVPSKVLSDQ-IPFQL 650
              QNG  ERKHRH+ ELG +M++   V    WV AF T+ ++ N +PS VL DQ  P+++
Sbjct: 580  PQQNGIAERKHRHITELGSSMMFQGKVPQFLWVEAFYTSNFLCNLLPSSVLKDQKSPYEV 639

Query: 651  LFQVAPTYANFHPFGCRVFPCLRPYMKNKFSPRGTPCIFLGYNSHHKGFKCFDPTTSRTY 710
            L   AP Y +   FGC  +P LRPY  NKF P+   C+F GYN  +KG+KCF P T + Y
Sbjct: 640  LMGKAPVYTSLRVFGCACYPNLRPYASNKFDPKSLLCVFTGYNEKYKGYKCFHPPTGKIY 699

Query: 711  VSRHAQFDE----FCFPLTGSKSSPNDLVVFTFYEPAAGSPPSLQPVILVVP-------- 758
            ++RH  FDE    F    +   S  N  +V  +            P +L +         
Sbjct: 700  INRHVLFDESKFLFSDIYSDKVSGTNSTLVSAWQSNFLPKSIPATPEVLDISNTAASFSD 759

Query: 759  ---ESSPPTGPLPCPSCVDPDVQPVPVDDAPPSPVPHNDAPPLPAPTSPTPPATPLVVTQ 815
               E S   G   C    D D  P+  +  P SPV   ++P    P S          ++
Sbjct: 760  EQGEFSGAVGGGGCGCTADLDSVPIG-NSLPSSPVTQQNSPQPETPISSAGSGNDAEDSE 818

Query: 816  APPTSPPTT----PPAVTQVPLAP---VDARPR-TRS*NGIFKPNPRYALVHAQPTGLLT 867
                S  +       A T+   A      + P  TRS +GIFKPNP+YA        + T
Sbjct: 819  LSENSENSESSVFSEATTETEAADNTNDQSHPMITRSKSGIFKPNPKYA--------MFT 870

Query: 868  ALHTVT*PKGFKSAMKHPHWLPAMEDELSALHKKFTWTLVPRPYTTNVVGSK*VFRTKFH 927
                   PK  K+A+K P W  AM +E  +  +  TW LVP       +G + VF+TK  
Sbjct: 871  VKSNYPVPKTVKTALKDPGWTDAMGEEYDSFEETHTWDLVPPDSFITPLGCRWVFKTKLK 930

Query: 928  SDGTVERLKTCLVAQGLTQIPGFDYSLTFSPVVKATTVRLILSLVVLND*QLHQLDVKNA 987
            +DGT++RLK  LVA+G  Q  G DY  T+SPVV+  TVR IL +  +N  ++ QLDVKNA
Sbjct: 931  ADGTLDRLKARLVAKGYEQEEGVDYMETYSPVVRTATVRTILHVATINKWEIKQLDVKNA 990

Query: 988  FLHGHLTETVYMEQPHGF-----VDLVSRLMCVG*TRLSTASNRRLVLGFSA*ALFSCVL 1042
            FLHG L ETVYM QP GF      D V +L      +      +     F   + F    
Sbjct: 991  FLHGDLKETVYMYQPPGFENQDRPDYVCKL-----NKAIYGLKQAPRAWFDKFSTFLLEF 1045

Query: 1043 VFRVVGLIHPCSF-FYKGHITLYLLVYVDDIILTGSDPSLLT*FIACLNDEFAIKYLGKL 1101
             F +     P  F F KG   ++LL+Y+DD++LTG+                        
Sbjct: 1046 GF-ICTYSDPSLFVFLKGRDLMFLLLYMDDMLLTGN------------------------ 1080

Query: 1102 GYFLGLEITYTADGLFLGHAKYAHDLLSHALMLEASHVLTPLAAGSHLV-SSGEGYSDPT 1160
                              + KYA DLL  A M + + + TPL      V    E ++DPT
Sbjct: 1081 ------------------NKKYAMDLLVAAGMADCAPMPTPLPLQLDKVPGQQESFADPT 1122

Query: 1161 HYRSLVGALQYLTITRPDLSYAVNTVSQFLQTPTVDHFQAVKRIIRYVCAMQHFGLTFRR 1220
            ++RSL                AVN V Q + +PTV  F  +KR++RY+      GL    
Sbjct: 1123 YFRSL----------------AVNLVCQKMHSPTVADFNLLKRVLRYLKGKVQMGLNLHN 1166

Query: 1221 TSSPAVLGYSDADWARCTDHRRSTYGYAIFLGYNLLSWSAKKQPSVAHSSCESEYRAMAN 1280
             +   +  YSD+DWA C + RRS  G+  FLG N++SWSAK+ P+V+ SS E+EYR ++ 
Sbjct: 1167 NTDITLRAYSDSDWANCKETRRSVGGFCTFLGTNIISWSAKRHPTVSRSSTEAEYRTLSI 1226

Query: 1281 TASELVWLLNLLHELRVRLSAPPLFLSDNQSALFMAQNPVAHKHAKHIDLDCHFVRELVS 1340
             A+E+ W+ +LL E+ +   APP    DN SA+++  NP  H  +K  D+D H+VRE V+
Sbjct: 1227 AATEVKWISSLLREIGIYQPAPPELYCDNLSAVYLTANPAMHNRSKAFDVDFHYVRERVA 1286

Query: 1341 SGRLAVRHVPTSLQLADIFTKVLP-RPLFDLFRSKLRVGL 1379
             G L V+HVP S QLADIFTK LP RP FDL R KL V L
Sbjct: 1287 LGALVVKHVPASHQLADIFTKSLPQRPFFDL-RYKLGVVL 1325


>gb|AAK43485.1| polyprotein, putative [Arabidopsis thaliana]
          Length = 1459

 Score =  758 bits (1956), Expect = 0.0
 Identities = 508/1459 (34%), Positives = 730/1459 (49%), Gaps = 138/1459 (9%)

Query: 20   KLNSGNFLLWSKQVTAILQNQNLFDIVDPIVAPPSEFMLQPVSSALHHVVPAPNPLFVQW 79
            KL S N+L+WS Q+ A+L   +L   +D  V  P      P ++ ++ VV A NP F  W
Sbjct: 32   KLTSTNYLMWSIQIHALLDGYDLAGYLDNSVVIP------PETTTINSVVSA-NPSFTLW 84

Query: 80   QARDRNVNSLLLSSLTEEALSETLSATTARDVWTALHSAYASRNKAREMRLRDELQLMRR 139
            + +D+ + S L+ +++    S    AT +  +W+ L++ YA  +     +LR ++Q + +
Sbjct: 85   KRQDKLIFSALIGAISPAVQSLVSRATNSSQIWSTLNNTYAKPSYGHIKQLRQQIQRLTK 144

Query: 140  GSLSVSEYGRK----------IR*SVANDDKVHWFLRGLGPSYANFSTGQLDQVPLPLFT 189
            G+ ++ EY +           +   + ++++V   L+GL   Y         +   P  T
Sbjct: 145  GTKTIDEYVQSHTTRLDQLAILGKPMEHEEQVEHILKGLPEEYKTVVDQIEGKDNTPTIT 204

Query: 190  DILWKVESHAIFQASLEDYVTPPSAAF-------HARNPTRSSGSQS-----SGGGHRGN 237
            +I  ++ +H   ++ L     PPS++F         RN   +           G  H  N
Sbjct: 205  EIHERLINH---ESKLLSDEVPPSSSFPMSANAVQQRNFNNNCNQNQHKNRYQGNTHNNN 261

Query: 238  SSSGSRPRRDN-GGSHCRGSYTPRCQLCRKQGHYAAKCPVRWDRPSESANLTHSF----- 291
            +++ S+P   N  G      Y  +CQ+C  QGH A +CP        +++  HS      
Sbjct: 262  TNTNSQPSTYNKSGQRTFKPYLGKCQICSVQGHSARRCPQLQAMQLPASSSAHSPFTPWQ 321

Query: 292  -AAGCSLNNSNRSDKYM-DTGATSHMTHSLSQLTYSHAYSGNDRVLVGNGAGLHITHIGS 349
              A  ++ +   ++ ++ D+GAT H+T  L+ L+    Y+G + V++ +G GL I   GS
Sbjct: 322  PRANLAIGSPYAANPWLLDSGATHHITSDLNALSLHQPYNGGEYVMIADGTGLTIKQTGS 381

Query: 350  R---SASHSVPLSNVLVVPRLTNNLVSVSKLTRDHNVRAIFEADSFVIQNRQTGAVLGKG 406
                S +  + L  VL VP +  NL+SV +L   + V   F   SF +++  TG +L +G
Sbjct: 382  TFLPSQNRDLALHKVLYVPDIRKNLISVYRLCNTNQVSVEFFPASFQVKDLNTGTLLLQG 441

Query: 407  PCDKGFY---VLDQGSQALLATSSSLPRASFELWHSRLDHVNFDIIKKLHKLGCFNVSSI 463
                  Y   V +  + AL  + S  P+ +   WHSRL H +  I+  L       VS  
Sbjct: 442  RTKDDLYEWPVTNPPATALFTSPS--PKTTLSSWHSRLGHPSASILNTLLSKFSLPVSVA 499

Query: 464  LPKPICCTSCQMAKSKRLVCYDNHKRASAVLDLVHCDLWGPSPVASVDGFSYFVIFVDDF 523
                  C+ C + KS +L    +   +S+ L+ +  D+W  SP+ S D + Y+++ VD +
Sbjct: 500  SSNKTSCSDCLINKSHKLPFATSSIHSSSPLEYIFTDVW-TSPIISHDNYKYYLVLVDHY 558

Query: 524  SQFTWFYPLKRKSDFYDVLVRFKAFVENQFSRSLKVFQSDNGTEFTNNKVQALFSSSGVL 583
            +++TW YPL++KS      + FKA VEN+F   ++   SDNG EF    ++    S+G+ 
Sbjct: 559  TRYTWLYPLQQKSQVKATFIAFKALVENRFQAKIRTLYSDNGGEFI--ALRDFLVSNGIS 616

Query: 584  HRFLCPHTQAQNGRVERKHRHVLELGLAMLYHSHVSTSYWVHAFSTAVYIINRVPSKVLS 643
            H    PHT   NG  ERKHRH++E GL +L  + V   YW +AF+TAVY+INR+P+ VL 
Sbjct: 617  HLTSPPHTPEHNGLSERKHRHIVETGLTLLTQASVPREYWTYAFATAVYLINRMPTPVLC 676

Query: 644  DQIPFQLLFQVAPTYANFHPFGCRVFPCLRPYMKNKFSPRGTPCIFLGYNSHHKGFKCFD 703
             Q PFQ LF  +P Y     FGC  FP LRPY +NK   R   C+FLGY+     + C D
Sbjct: 677  LQSPFQKLFGSSPNYQRLRVFGCLCFPWLRPYTRNKLEERSKRCVFLGYSLTQTAYLCLD 736

Query: 704  PTTSRTYVSRHAQFDEFCFPLTGSKSSPNDLVVFTFYEPAAGSPPS-----LQPVILVVP 758
               +R Y SRH  FDE  +P   S    +   + T  E ++ S P+        + L  P
Sbjct: 737  VDNNRLYTSRHVMFDESTYPFAASIREQSQSSLVTPPESSSSSSPANSGFPCSVLRLQSP 796

Query: 759  ESSPPTGPLPCPSCVDPDVQPVPVDDAPPS---------------------PVPHNDAP- 796
             +S P  P P     D  V P       PS                       PH + P 
Sbjct: 797  PASSPETPSPPQQQNDSPVSPRQTGSPTPSHHSQVRDSTLSPSPSVSNSEPTAPHENGPE 856

Query: 797  -------------PLPAPTSPTPPATPLVVTQAPPTSPPTTPPAVTQVPLAPV------- 836
                         PLP P   T P++   + Q P     TT     Q  +A         
Sbjct: 857  PEAQSNPNSPFIGPLPNPNPETNPSSS--IEQRPVDKSTTTALPPNQTTIAATSNSRSQP 914

Query: 837  ---DARPRTRS*NGIFKPNPRYALVHAQPTGLLTALHTVT*PKGFKSAMKHPHWLPAMED 893
               + + +TRS N I KP  + +L  A     L+  +TVT       A+K   W  AM D
Sbjct: 915  PKNNHQMKTRSKNNITKPKTKTSLTVALTQPHLSEPNTVT------QALKDKKWRFAMSD 968

Query: 894  ELSALHKKFTWTLVPRPYTTNVVGSK*VFRTKFHSDGTVERLKTCLVAQGLTQIPGFDYS 953
            E  A  +  TW LVP   T ++VG + VF+ K+  +G +++ K  LVA+G  Q  G DY+
Sbjct: 969  EFDAQQRNHTWDLVPPNPTQHLVGCRWVFKLKYLPNGLIDKYKARLVAKGFNQQYGVDYA 1028

Query: 954  LTFSPVVKATTVRLILSLVVLND*QLHQLDVKNAFLHGHLTETVYMEQPHGFVDL----- 1008
             TFSPV+KATT+R++L + V  +  L QLDV NAFL G LTE VYM QP GFVD      
Sbjct: 1029 ETFSPVIKATTIRVVLDVAVKKNWPLKQLDVNNAFLQGTLTEEVYMAQPPGFVDKDRPSH 1088

Query: 1009 VSRLMCVG*TRLSTASNRRLVLGFSA------*ALFSCVLVFRVVGLIHPCSFFYKGHIT 1062
            V RL             R+ + G           L   +L    V  +   S F   H T
Sbjct: 1089 VCRL-------------RKAIYGLKQAPRAWYMELKQHLLNIGFVNSLADTSLFIYSHGT 1135

Query: 1063 --LYLLVYVDDIILTGSDPSLLT*FIACLNDEFAIKYLGKLGYFLGLEITYTADGLFLGH 1120
              LYLLVYVDDII+TGSD   ++  ++ L + F+IK    L YFLG+E T T  GL L  
Sbjct: 1136 TLLYLLVYVDDIIVTGSDHKSVSAVLSSLAERFSIKDPTDLHYFLGIEATRTNTGLHLMQ 1195

Query: 1121 AKYAHDLLSHALMLEASHVLTPLAAGSHL-VSSGEGYSDPTHYRSLVGALQYLTITRPDL 1179
             KY  DLL+   ML+A  V TPL     L +  G   +D + YRS+VG+LQYL  TRPD+
Sbjct: 1196 RKYMTDLLAKHNMLDAKPVATPLPTSPKLTLHGGTKLNDASEYRSVVGSLQYLAFTRPDI 1255

Query: 1180 SYAVNTVSQFLQTPTVDHFQAVKRIIRYVCAMQHFGLTFRRTSSPAVL-GYSDADWARCT 1238
            ++AVN +SQF+  PT DH+QA KR++RY+      G+ F  +SSP  L  +SDADWA  +
Sbjct: 1256 AFAVNRLSQFMHQPTSDHWQAAKRVLRYLAGTTTHGI-FLNSSSPIHLHAFSDADWAGDS 1314

Query: 1239 DHRRSTYGYAIFLGYNLLSWSAKKQPSVAHSSCESEYRAMANTASELVWLLNLLHELRVR 1298
                ST  Y I+LG N +SWS+KKQ  V+ SS ESEYRA+AN ASE+ WL +LL EL +R
Sbjct: 1315 ADYVSTNAYVIYLGRNPISWSSKKQRGVSRSSTESEYRAVANAASEIRWLCSLLTELHIR 1374

Query: 1299 LSAPPLFLSDNQSALFMAQNPVAHKHAKHIDLDCHFVRELVSSGRLAVRHVPTSLQLADI 1358
            L   P    DN  A ++  NPV H   KHI LD HFVR ++ S  L V HV T+ QLAD 
Sbjct: 1375 LPHGPTIFCDNIGATYICANPVFHSRMKHIALDYHFVRGMIQSRALRVSHVSTNDQLADA 1434

Query: 1359 FTKVLPRPLFDLFRSKLRV 1377
             TK L RP F   RSK+ V
Sbjct: 1435 LTKSLSRPHFLSARSKIGV 1453


>gb|AAU43956.1| unknown protein [Oryza sativa (japonica cultivar-group)]
            gi|52353503|gb|AAU44069.1| putative polyprotein [Oryza
            sativa (japonica cultivar-group)]
          Length = 1447

 Score =  757 bits (1954), Expect = 0.0
 Identities = 509/1467 (34%), Positives = 721/1467 (48%), Gaps = 144/1467 (9%)

Query: 17   ITVKLNSGNFLLWSKQVTAILQNQNLFDIVDPIVAPPSEFMLQPVSSALHHVVPAPNPLF 76
            I+ KL+  N  LW  QV A ++   L   +      P+  +           V   NP F
Sbjct: 18   ISEKLSKSNHALWKAQVMAAVRGARLEGHLTGATKTPNALITTTAGDKGEKEVTVRNPEF 77

Query: 77   VQWQARDRNVNSLLLSSLTEEALSETLSATTARDVWTALHSAYASRNKAREMRLRDELQL 136
              W A D+ V   LLS+L  + L++  +  TA   W  L   Y+S  +AR +  R  L  
Sbjct: 78   DDWVATDQQVLGFLLSTLARDVLAQVATCGTAAAAWQMLEEMYSSVTRARFINTRIALSN 137

Query: 137  MRRGSLSVSEYGRKIR*S----------VANDDKVHWFLRGLGPSYANFSTGQL--DQVP 184
             ++G+LS++EY  K++            V +DD + + + GL  +Y    +  +  D + 
Sbjct: 138  TKKGTLSINEYVSKMKALADEMTAAGKIVDDDDLISYIIAGLDDTYEPVISTIVGKDTMT 197

Query: 185  LPLFTDILWKVESHAIFQASLEDYVTPPSAAFHARNPTRSSGSQSSGGGHRGNSSSGS-- 242
            L      L   E     +   +  V   +         +  G+  +GG  RG +++G+  
Sbjct: 198  LGEAYSQLLSFEQRLALRHGGDSSVNLANRGRGGGGGQQRGGNTGNGGRGRGGNNNGANR 257

Query: 243  -RPRRDNGGSHCRGSYT--PRCQLCRKQGHYAAKCPVRWDR---PSESANLTHSFAAGCS 296
             R R +NGG+   G     P+CQLC K+GH    C  R+D    P E         AG +
Sbjct: 258  GRGRGNNGGARPPGGVDNRPKCQLCYKRGHTVINCWYRYDEDFVPDEKY-------AGSA 310

Query: 297  LNNSNRSDKYMDTGATSHMTHSLSQLTYSHAYSGNDRVLVGNGAGLHITHIGS---RSAS 353
             +    ++ Y+DT AT H+T  L +LT    Y G D+V   +GAG+ I+HIG    R+ +
Sbjct: 311  TSYGIDTNWYVDTSATDHVTGELDKLTVRDRYKGQDQVHTASGAGMEISHIGHSTVRTPN 370

Query: 354  HSVPLSNVLVVPRLTNNLVSVSKLTRDHNVRAIFEADSFVIQNRQTGAVLGKGPCDKGFY 413
              + L N+L VP    NLVS ++L  D++      +  F +++  T  +L +GPC    Y
Sbjct: 371  RDIHLRNILYVPNANKNLVSANRLVSDNSAYMELYSKYFNLKDLATKKLLFRGPCRGRLY 430

Query: 414  VLDQGSQ----ALLATSSSLPRASFELWHSRLDHVNFDIIKKL---HKLGCFNVSSILPK 466
             L   S       L  +    + SFE WHSRL H    I++K+   + L C   S+   K
Sbjct: 431  ALPSSSPHERPRPLKEAFGAIKPSFERWHSRLGHPASPIVEKVISKNNLPCLAESN---K 487

Query: 467  PICCTSCQMAKSKRLVCYDNHKRASAVLDLVHCDLWGPSPVASVDGFSYFVIFVDDFSQF 526
               C +CQ  KS +L    +   +S  L+L++ D+WGP+ + SV G  Y+V F+ D+S+F
Sbjct: 488  QSVCDACQQGKSHQLPYSRSSSMSSHPLELIYSDVWGPA-LTSVGGKQYYVSFIGDYSKF 546

Query: 527  TWFYPLKRKSDFYDVLVRFKAFVENQFSRSLKVFQSDNGTEFTNNKVQALFSSSGVLHRF 586
            TW Y +K KS+       F+A VE  F+R +   QSD G E+   K+ + F+  G+ H  
Sbjct: 547  TWLYLIKHKSEVIQKFHEFQALVERLFNRKIIAMQSDWGGEY--EKLHSFFTKIGITHHV 604

Query: 587  LCPHTQAQNGRVERKHRHVLELGLAMLYHSHVSTSYWVHAFSTAVYIINRVPSKVLSDQI 646
             CPHT  QNG  ERKHRH++E+GL +L +S +   +W  AF  AVY+INR P+K+L    
Sbjct: 605  SCPHTHQQNGSAERKHRHIVEVGLTLLAYSSMPLKFWDEAFQAAVYLINRTPTKLLQFLT 664

Query: 647  PFQLLFQVAPTYANFHPFGCRVFPCLRPYMKNKFSPRGTPCIFLGYNSHHKGFKCFDPTT 706
            P + LF   P Y++   FGC  +P LRPY  +K   R   C FLGY++ HKGFKC DP+T
Sbjct: 665  PLEHLFNQTPDYSSLRVFGCACWPHLRPYNTHKLQFRSKQCTFLGYSTLHKGFKCLDPST 724

Query: 707  SRTYVSRHAQFDEFCFPLTGSKSSPNDLVVFTFYEPAAGSPPSLQPVILVVPESSPPTGP 766
             R Y+SR   FDE  FP            + T       S   L P  L+ P S+P    
Sbjct: 725  GRVYISRDVIFDETNFPFA---------KLHTNAGARLRSEILLLPSHLLNPTSNPGEQQ 775

Query: 767  L----------PCPSCVDPDVQPVPVD--------------------DAPPSPVPHNDAP 796
            L          P       +VQPV  +                    D   SP   + A 
Sbjct: 776  LDDNMANIPVNPANQIFGSNVQPVSAENDEADDSSGATENLAEQIHQDTAASPSASDTAA 835

Query: 797  PLPAPTSPTP----PATPLVVT-QAPPTSPPTTPPAVTQVPLAPVD-------------- 837
              P   +       PA+P V+T  +  +S P+ P   T  P +  D              
Sbjct: 836  SQPGAATSLDVVHFPASPDVMTHHSADSSSPSQPSHATATPASNDDVPSPSHISASEPFA 895

Query: 838  ---------ARPRTRS*NGIFKPNPRYALVHAQPTGLL--TALHTVT*PKGFKSAMKHPH 886
                     +RP TR   GI K       V+   T     T L     PK    A+++ +
Sbjct: 896  TGEEVTQAPSRPTTRLQRGIRKEK-----VYTDGTVKYKNTFLTVTGEPKNLTDALQNTN 950

Query: 887  WLPAMEDELSALHKKFTWTLVPRPYTTNVVGSK*VFRTKFHSDGTVERLKTCLVAQGLTQ 946
            W  AM+ E  AL    TW LVP     NV+  K V++ K   DG+++R K  LVA+G  Q
Sbjct: 951  WKKAMDIEYEALMNNKTWHLVPPKQGRNVIDCKWVYKIKRKQDGSLDRYKARLVAKGFKQ 1010

Query: 947  IPGFDYSLTFSPVVKATTVRLILSLVVLND*QLHQLDVKNAFLHGHLTETVYMEQPHGFV 1006
              G DY  TFSPVVKA T+R++LS+ V     + QLDV+NAFLHG L E VYM+QP G+ 
Sbjct: 1011 RYGIDYEDTFSPVVKAATIRIVLSIAVSRGWCMRQLDVQNAFLHGFLEEEVYMKQPPGYE 1070

Query: 1007 D------------LVSRLMCVG*TRLSTASNRRLVLGFSA*ALFSCVLVFRVVGLIHPCS 1054
            D             +  L        S  S +   LGF      + +             
Sbjct: 1071 DESFPGYVCKLDKALYGLKQAPRAWYSRLSKKLYDLGFQGSKGDTSLF------------ 1118

Query: 1055 FFYKGHITLYLLVYVDDIILTGSDPSLLT*FIACLNDEFAIKYLGKLGYFLGLEITYTAD 1114
            F+ KG + +++L+YVDDII+T S    ++  +  L  EFA+K LG L YFLG+E+    D
Sbjct: 1119 FYNKGGLIIFVLIYVDDIIVTSSRQEAVSALLQDLKKEFALKDLGDLHYFLGIEVNKVTD 1178

Query: 1115 GLFLGHAKYAHDLLSHALMLEASHVLTPLAAGSHL-VSSGE--GYSDPTHYRSLVGALQY 1171
             + L   KYA DLL    M +   V TPL+    L    G+  G  D T+YRS+VGALQY
Sbjct: 1179 EIILTQDKYACDLLRRVNMFDCKPVSTPLSTSEKLSAHEGDLLGPLDATNYRSVVGALQY 1238

Query: 1172 LTITRPDLSYAVNTVSQFLQTPTVDHFQAVKRIIRYVCAMQHFGLTFRRTSSPAVLGYSD 1231
            LT+TRPD+++ VN V QFL  PT  H+ AVKRI+RY+      GL   ++ S  V  YSD
Sbjct: 1239 LTLTRPDIAFPVNKVCQFLHAPTTVHWAAVKRILRYLKQCTKLGLKLCKSKSMLVSAYSD 1298

Query: 1232 ADWARCTDHRRSTYGYAIFLGYNLLSWSAKKQPSVAHSSCESEYRAMANTASELVWLLNL 1291
            ADWA   D RRST G+A+FLG NL+SW A+KQ +V+ SS ESEY+A+AN  +E++W+  L
Sbjct: 1299 ADWAGSLDDRRSTGGFAVFLGDNLVSWCARKQATVSRSSTESEYKALANATAEIMWVQTL 1358

Query: 1292 LHELRVRLSAPPLFLSDNQSALFMAQNPVAHKHAKHIDLDCHFVRELVSSGRLAVRHVPT 1351
            L EL+V+         DN  A +++ NPV H   KHI++D HFVRE VS   L +  VPT
Sbjct: 1359 LTELQVQSPPMAKLWCDNLGAKYLSSNPVFHARTKHIEVDYHFVRERVSQKLLEIDFVPT 1418

Query: 1352 SLQLADIFTKVLPRPLFDLFRSKLRVG 1378
              Q+AD FTK LP    + F+  L +G
Sbjct: 1419 GDQVADGFTKALPVRQLENFKHNLNLG 1445


>gb|AAC02672.1| polyprotein [Arabidopsis arenosa] gi|7522104|pir||T31353 polyprotein
            - Arabidopsis arenosa Evelknievel retrotransposon
            (fragment)
          Length = 1390

 Score =  734 bits (1894), Expect = 0.0
 Identities = 482/1388 (34%), Positives = 714/1388 (50%), Gaps = 109/1388 (7%)

Query: 20   KLNSGNFLLWSKQVTAILQNQNLFDIVDPIVAPPSEFMLQPVSSALHHVVPAPNPLFVQW 79
            +L   NF++WS+QV A+L   +L   VD  V  P      P +      V   N  +  W
Sbjct: 31   RLTDSNFVMWSRQVHALLDGYDLAGYVDGSVPIP------PPTRTTDDGVVTTNNDYTLW 84

Query: 80   QARDRNVNSLLLSSLTEEALSETLSATTARDVWTALHSAYASRNKAREMRLRDELQLMRR 139
            + +D+ V S LL +++         A T+ +VW  L S +A+ + A   +LR +L+   +
Sbjct: 85   KRQDKLVYSALLGAISLSVQPLLSKANTSAEVWETLSSTFANPSWAHVQQLRQQLKQWTK 144

Query: 140  GSLSVSEYGRK----------IR*SVANDDKVHWFLRGLGPSYANFSTGQLDQVPLPLFT 189
            G+ SV  Y +           +  +   ++++   L GL   Y         +   P  T
Sbjct: 145  GTKSVVTYFQGFTTRFDHLALLGKAPEREEQIELILGGLPEDYKTVVDQIESRENPPALT 204

Query: 190  DILWKVESHAIFQASLEDYVTPPSAAFHARNPTRSSGSQSSGGGHRGNSSSGSRPRRDNG 249
            ++L K+ +H +  A+  +  +      +A N   ++ + +S    R N+S G+   ++N 
Sbjct: 205  EVLEKLINHEVKLAAKAEATSSVPITANAVNYRGNNNNNNSRSNGR-NNSRGNTSWQNNQ 263

Query: 250  GSHCRGSYTPR-----CQLCRKQGHYAAKCP-VRWDRPSESANLTHSFAAG--------C 295
             +  R  YTPR     CQ+C   GH A +CP ++    S ++N + S +           
Sbjct: 264  STTNRQQYTPRPYQGKCQICSVHGHSARRCPQLQQHAGSYASNQSSSSSYAPWQPRANMV 323

Query: 296  SLNNSNRSDKYMDTGATSHMTHSLSQLTYSHAYSGNDRVLVGNGAGLHITHIGSR---SA 352
            S    N  +  +D+GAT H+T  L+ L     Y+G + V + +G+GL I+H GS    + 
Sbjct: 324  SATPYNSGNWLLDSGATHHLTSDLNNLALHQPYNGGEEVTIADGSGLPISHSGSALLPTP 383

Query: 353  SHSVPLSNVLVVPRLTNNLVSVSKLTRDHNVRAIFEADSFVIQNRQTGAVLGKGPCDKGF 412
            + S+ L +VL VP +  NL+SV ++   + V   F    F +++  TGA L +G      
Sbjct: 384  TRSLDLKDVLYVPDIQKNLISVYRMCNTNGVSVEFFPAHFQVKDLSTGARLLQGKTKNEL 443

Query: 413  YVLDQGSQALLATS---SSLPRASFELWHSRLDHVNFDIIKKLHKLGCFNVSSILPKPIC 469
            Y     S   +ATS   S  P+     WH+RL H +  I+K L       +S  L   + 
Sbjct: 444  YEWPVNSS--IATSMFASPTPKTDLPSWHARLGHPSLPILKTLISKFSLPISHSLQNQLL 501

Query: 470  CTSCQMAKSKRLVCYDNHKRASAVLDLVHCDLWGPSPVASVDGFSYFVIFVDDFSQFTWF 529
            C+ C + KS +L  Y N   +S  L+ ++ D+W  SP+ S+D + Y+++ VD ++++TW 
Sbjct: 502  CSDCSINKSHKLPFYSNTIASSHPLEYLYTDVW-TSPITSIDNYKYYLVIVDHYTRYTWL 560

Query: 530  YPLKRKSDFYDVLVRFKAFVENQFSRSLKVFQSDNGTEFTNNKVQALFSSSGVLHRFLCP 589
            YPL++KS   +  + F A VEN+F   +    SDNG EF    +++  +S G+ H    P
Sbjct: 561  YPLRQKSQVRETFITFTALVENKFKSKIGTLYSDNGGEFI--ALRSFLASHGISHMTTPP 618

Query: 590  HTQAQNGRVERKHRHVLELGLAMLYHSHVSTSYWVHAFSTAVYIINRVPSKVLSDQIPFQ 649
            HT   NG  ERKHRH++E GL +L  + +S  YW +AF+TAVY+INR+ + VL ++ P+ 
Sbjct: 619  HTPELNGISERKHRHIVETGLTLLSTASMSKEYWSYAFTTAVYLINRMLTPVLGNESPYM 678

Query: 650  LLFQVAPTYANFHPFGCRVFPCLRPYMKNKFSPRGTPCIFLGYNSHHKGFKCFDPTTSRT 709
             LF   P Y     FGC  FP LRPY  +K   R  PC+ LGY+     + C D  T R 
Sbjct: 679  KLFGQPPNYLKLRVFGCLCFPWLRPYTAHKLDNRSMPCVLLGYSLSQSAYLCLDRATGRV 738

Query: 710  YVSRHAQFDEFCFPLTGSKSS---PND---------LVVFTFYEPAAGSPPS-------- 749
            Y SRH QF E  FP + +  S   P+D         + V     P   +PPS        
Sbjct: 739  YTSRHVQFAESIFPFSTTSPSVTPPSDPPLSQDTRPISVPILARPLTTAPPSSPSCSAPH 798

Query: 750  ---LQPVIL-------VVPESSPPTGPLPCPSC-----VDPDVQPVPVDDAPP-SPVPHN 793
                QP IL         P SS PT P+  PS      V  + +  P   +PP SP P +
Sbjct: 799  RSPSQPGILSPSAPFQPSPPSS-PTSPITSPSLSEESHVGHNQETGPTGSSPPVSPQPQS 857

Query: 794  DAPPLPAPTSPTPPA-----TPLVVTQAPPTSPPTTPPAVTQVPLAPVDARPRTRS*NGI 848
            +    P  TSP P +     +P  +T A   SP  +PP     P  P+    RTRS N I
Sbjct: 858  EQSTSPRSTSPQPNSPHTQHSPRSITPALTPSPSPSPPPNPNPP-PPIQHTMRTRSKNNI 916

Query: 849  FKPNPRYALVHAQPTGLLTALHTVT*PKGFKSAMKHPHWLPAMEDELSALHKKFTWTLVP 908
             KPNP++A +  +PT L   +     PK    A+  P+W  AM DE++A  +  T+ LVP
Sbjct: 917  VKPNPKFANLATKPTPLKPII-----PKTVAEALLDPNWRQAMCDEINAQTRNGTFDLVP 971

Query: 909  RPYTTNVVGSK*VFRTKFHSDGTVERLKTCLVAQGLTQIPGFDYSLTFSPVVKATTVRLI 968
                 NV+G K VF  K+  +G ++R K  LVA+G  Q  G D+  TFSPV+K+TTVR +
Sbjct: 972  PAPNQNVIGCKWVFTLKYLPNGVLDRYKARLVAKGFHQQYGHDFKETFSPVIKSTTVRSV 1031

Query: 969  LSLVVLND*QLHQLDVKNAFLHGHLTETVYMEQPHGFVDL-----VSRL--MCVG*TRLS 1021
            L + V     + Q+DV NAFL G L++ VY+ QP GFVD      V RL     G  +  
Sbjct: 1032 LHVAVSKGWSIRQIDVNNAFLQGTLSDEVYVMQPPGFVDKDNPHHVCRLHKALYGLKQAP 1091

Query: 1022 TASNRRLVLGFSA*ALFSCVLVFRVVGLIHPCSFFYKGH--ITLYLLVYVDDIILTGSDP 1079
             A  +          L S +L    V  I   S F   H    LY+LVYVDD+++TGSD 
Sbjct: 1092 RAWYQE---------LRSYLLTQGFVNSIADTSLFTLRHKRTILYVLVYVDDMLITGSDT 1142

Query: 1080 SLLT*FIACLNDEFAIKYLGKLGYFLGLEITYTADGLFLGHAKYAHDLLSHALMLEASHV 1139
            +++T FIA L   F++K LG++ YFLG+E T T+ GL L   +Y  DLL    ML A  V
Sbjct: 1143 NIITRFIANLAARFSLKDLGEMSYFLGIEATRTSKGLHLMQKRYVLDLLEKTNMLAAHPV 1202

Query: 1140 LTPLAAGSHL-VSSGEGYSDPTHYRSLVGALQYLTITRPDLSYAVNTVSQFLQTPTVDHF 1198
            LTP++    L ++SG     P+ YR+++G+LQYL+ TRPD++YAVN +SQ++  PT  H+
Sbjct: 1203 LTPMSPTPKLSLTSGTPLDKPSEYRAVLGSLQYLSFTRPDIAYAVNRLSQYMHCPTDLHW 1262

Query: 1199 QAVKRIIRYVCAMQHFGLTFRRTSSPAVLGYSDADWARCTDHRRSTYGYAIFLGYNLLSW 1258
            QA KRI+RY+      G+  R  +   +  YSDADWA  TD+  ST  Y ++LG   +SW
Sbjct: 1263 QAAKRILRYLAGTPSHGIFIRADTPLKLHAYSDADWAGDTDNYNSTNAYILYLGSTPISW 1322

Query: 1259 SAKKQPSVAHSSCESEYRAMANTASELVWLLNLLHELRVRLSAPPLFLSDNQSALFMAQN 1318
            S+KKQ  VA SS E+EYRA+AN  SE+ W+ +LL EL + LS+PP+   DN  A +++ N
Sbjct: 1323 SSKKQNGVARSSTEAEYRAVANATSEIRWVCSLLTELGITLSSPPVVYCDNVGATYLSAN 1382

Query: 1319 PVAHKHAK 1326
            PV     K
Sbjct: 1383 PVFDSRMK 1390


>gb|AAV24907.1| hypothetical protein [Oryza sativa (japonica cultivar-group)]
            gi|51854440|gb|AAU10819.1| putative polyprotein [Oryza
            sativa (japonica cultivar-group)]
          Length = 1679

 Score =  730 bits (1884), Expect = 0.0
 Identities = 453/1169 (38%), Positives = 629/1169 (53%), Gaps = 91/1169 (7%)

Query: 275  PVRWDRPSESANLTHSFAAGCSLNNSNRSDKYMDTGATSHMTHSLSQLTYSHAYSGND-- 332
            P +WD+    A+L  SF    +L+    +D YMDTGAT+HMT     L+ SH  + N   
Sbjct: 535  PSQWDQ----ASLAGSFNT-TTLHQPATNDWYMDTGATAHMTSDTGILSLSHPPNPNSPS 589

Query: 333  RVLVGNGAGLHITHIGSRSASH---SVPLSNVLVVPRLTNNLVSVSKLTRDHNVRAIFEA 389
             ++VGNG+ + +T IG     H   S  L ++L  P +  NL+SV +   D+     F+ 
Sbjct: 590  HIVVGNGSTIPVTSIGHSKICHPNCSFTLRDILCSPAIIKNLISVRRFVIDNWCSVEFDP 649

Query: 390  DSFVIQNRQTGAVLGKGPCDKGFYVLDQGSQALLATSSSLPRASFEL-WHSRLDHVNFDI 448
              F +++ +T  V+ +       Y L        A ++ L  +S +L WH RL H+  D 
Sbjct: 650  FGFSVKDLRTRTVIARFNSSGPLYSLHHALPPPPAATALLANSSTDLLWHRRLGHLGHDA 709

Query: 449  IKKLHKLGCFNVSSILPKPICCTSCQMAKSKRLVCYDNHKRASAVLDLVHCDLWGPSPVA 508
            + +L  +       +      C +CQ+ +  RL    +  RAS   +L HCDLW  SPV 
Sbjct: 710  LNRLAAVVPMTRGDLTG---VCHACQLGRHVRLPFASSTSRASTNFELFHCDLW-TSPVV 765

Query: 509  SVDGFSYFVIFVDDFSQFTWFYPLKRKSDFYDVLVRFKAFVENQFSRSLKVFQSDNGTEF 568
            S  GF Y+++ +DD S + W +PL+ KSD +  L  F A+V+ QF+ +++  Q DNG EF
Sbjct: 766  SASGFKYYLVILDDCSHYVWTFPLRFKSDTFTTLSHFFAYVKTQFATNIRSIQCDNGREF 825

Query: 569  TNNKVQALFSSSGVLHRFLCPHTQAQNGRVERKHRHVLELGLAMLYHSHVSTSYWVHAFS 628
             N+  +  F ++GV  R  CPHT  QNG+ ER  R +  +  +ML+ + +  S+WV A  
Sbjct: 826  DNSAARTFFLTNGVHLRMSCPHTSPQNGKAERILRSLNNIVRSMLFQAKLPGSFWVEALH 885

Query: 629  TAVYIINRVPSKVLSDQIPFQLLFQVAPTYANFHPFGCRVFPCLRPYMKNKFSPRGTPCI 688
            TA ++INR P+K L    P   L+   P+Y++   FGC+ +P L     +K +PR T C+
Sbjct: 886  TATHLINRHPTKTLDRHTPHFALYGTHPSYSHLRVFGCKCYPNLSATTPHKLAPRSTMCV 945

Query: 689  FLGYNSHHKGFKCFDPTTSRTYVSRHAQFDEFCFP---LTGSKSSPNDLVVFTFYEPAAG 745
            FLGY  +HKG++CFDP ++R  +SRH  FDE  FP   LT   S+  DL     +   A 
Sbjct: 946  FLGYPLYHKGYRCFDPLSNRVIISRHVVFDEHSFPFTELTNGVSNATDLDFLEDFTAPAQ 1005

Query: 746  SPPSLQPVILVVPESSPPTGPL--------PC-----------PSCVDPDVQPVPVDDAP 786
            +P        V P +   + P+        PC           PS  D  + P     A 
Sbjct: 1006 APIGATRRPAVAPTTQTASSPMVHGLERPPPCSPTRPVSTPGGPSSPDSRLGPPSPTPAL 1065

Query: 787  PSPVPHNDAPPL--PAPTSPTPPATPLVVTQAPPTSPPTTP-------PAVTQVP----- 832
              P   +  PP   PAP++ T PA+    T A P +PP  P       P    VP     
Sbjct: 1066 IGPASTSPGPPSAGPAPSASTCPASSTWETVARPPTPPGLPRLDGPHLPPAPHVPRRLRS 1125

Query: 833  --------------LAPV--DARPRTRS*NGIFKPNPRYALVHAQPTGLLTALHTVT*PK 876
                          ++PV  D    TR+ +G  KP  R  L HA P  L+        PK
Sbjct: 1126 VRATGAPTPLSGLEISPVVNDHVMTTRAKSGHHKPVHRLNL-HAAPLSLV--------PK 1176

Query: 877  GFKSAMKHPHWLPAMEDELSALHKKFTWTLVPRPYTTNVVGSK*VFRTKFHSDGTVERLK 936
             +++A+  P W  AME+E +AL    TW LVPRP   NVV  K +F+ KFH+DG+++R K
Sbjct: 1177 TYRAALADPLWRAAMEEEYNALLANRTWDLVPRPAGVNVVTGKWIFKHKFHADGSLDRYK 1236

Query: 937  TCLVAQGLTQIPGFDYSLTFSPVVKATTVRLILSLVVLND*QLHQLDVKNAFLHGHLTET 996
               V +G TQ PG D+  TFSPVVK  TVR +LSL V  D  +HQLDVKNAFLHG L ET
Sbjct: 1237 ARWVLRGFTQRPGVDFDETFSPVVKPATVRTVLSLAVSRDWPVHQLDVKNAFLHGTLQET 1296

Query: 997  VYMEQPHGFVDLVSRLMCVG*TRLSTASNRRLV-LGFSA*ALFSCVLVF-RVVGLIHPCS 1054
            VY  QP GFVD     M           N+ L  L  +  A +S    F + +G +   S
Sbjct: 1297 VYCTQPPGFVDSAKPDMV-------CCLNKSLYGLKQAPRAWYSRFTTFLQSIGFVEAKS 1349

Query: 1055 -----FFYKGHITLYLLVYVDDIILTGSDPSLLT*FIACLNDEFAIKYLGKLGYFLGLEI 1109
                   ++G+ T+YLL+YVDDI+LT S  +LL   I+ L  EF++K LG L +FLG+ +
Sbjct: 1350 DTSLFILHRGNDTVYLLLYVDDIVLTASSRTLLHWTISALQGEFSMKDLGALHHFLGVSV 1409

Query: 1110 TYTADGLFLGHAKYAHDLLSHALMLEASHVLTPLAAGSHLVSS-GEGYSDPTHYRSLVGA 1168
            T  + GL L   +Y  D+L  A M +     TP+   + L SS G   +DPT +RSL GA
Sbjct: 1410 TRNSAGLVLSQRQYCIDILERAGMADCKPCNTPVDTTAKLSSSDGPPVADPTDFRSLAGA 1469

Query: 1169 LQYLTITRPDLSYAVNTVSQFLQTPTVDHFQAVKRIIRYVCAMQHFGLTFRRTSSPAVLG 1228
            LQYLT TRPD+SYAV  V   +  P   H  A+KRI+ Y+      GL  +R+S+  +  
Sbjct: 1470 LQYLTFTRPDISYAVQQVCLHMHDPREPHLAALKRILHYIRGSVDLGLHIQRSSACDLAV 1529

Query: 1229 YSDADWARCTDHRRSTYGYAIFLGYNLLSWSAKKQPSVAHSSCESEYRAMANTASELVWL 1288
            YSDADWA C D RRST GYA+FLG NL+SWS+K+Q +V+ SS E+EYRA+AN  +E+ WL
Sbjct: 1530 YSDADWAGCPDTRRSTSGYAVFLGDNLVSWSSKRQHTVSRSSAEAEYRAVANAVAEVTWL 1589

Query: 1289 LNLLHELRVRLSAPPLFLSDNQSALFMAQNPVAHKHAKHIDLDCHFVRELVSSGRLAVRH 1348
              LL EL    S   L   DN SA++++ NPV H+  KH+++D HFVRE V+ G + V H
Sbjct: 1590 RQLLQELHSPPSRATLVYCDNVSAVYLSSNPVQHQRTKHVEIDLHFVRERVAVGAVRVLH 1649

Query: 1349 VPTSLQLADIFTKVLPRPLFDLFRSKLRV 1377
            VPT+ Q ADIFTK LP P+F  FRS L V
Sbjct: 1650 VPTTSQYADIFTKGLPTPVFTEFRSSLNV 1678



 Score = 53.1 bits (126), Expect = 6e-05
 Identities = 48/191 (25%), Positives = 80/191 (41%), Gaps = 20/191 (10%)

Query: 79  WQARDRNVNSLLLSSLTEEALSETLSA-TTARDVWTALHSAYASRNKAREMRLRDELQLM 137
           W   D  V + L  +++ + L + L+  TTAR VW  L   +   ++ R + L  E    
Sbjct: 215 WVQMDCVVLAWLFGTISFDLLQDVLATDTTARLVWRGLEYQFLGNSEQRALNLTTEFHTF 274

Query: 138 RRGSLSVSEYGRKIR----------*SVANDDKVHWFLRGLGPSYANFSTGQLDQVPLPL 187
           ++G LSV EY RK++            V +   V   L GL   + N  +    Q P P 
Sbjct: 275 QQGDLSVDEYCRKMKTFADSLGDVGEPVRDRTLVLNTLNGLSEKFNNLRSLVPMQRPFPT 334

Query: 188 FTDILWKVESHAIFQASLEDYVTPPSAAFHARNPTRSSGSQSS-------GGGHRGNSSS 240
           F ++   +    + + +     +  SA F A   T + G  ++       G G++G    
Sbjct: 335 FAELRSLLRLEELSKPN--HAASASSAVFLATGSTTNGGKGANPAHGTGYGAGNQGGGGK 392

Query: 241 GSRPRRDNGGS 251
           G+  RR  GG+
Sbjct: 393 GNSNRRRRGGN 403


>ref|XP_462785.1| putative gag/pol polyprotein [Oryza sativa (japonica cultivar-group)]
          Length = 1373

 Score =  667 bits (1722), Expect = 0.0
 Identities = 402/1028 (39%), Positives = 546/1028 (53%), Gaps = 95/1028 (9%)

Query: 412  FYVLDQGSQALLATSSSLPRASFELWHSRLDHVNFDIIKKLHKLGCFNVSSILPKPICCT 471
            FY     + ALLA  +SL       WH RL H+  + + KL +    + +     P  C 
Sbjct: 374  FYPPATSTHALLAAPTSL-------WHRRLGHLGREALSKLIRSSVISCTKD-DLPHLCH 425

Query: 472  SCQMAKSKRLVCYDNHKRASAVLDLVHCDLWGPSPVASVDGFSYFVIFVDDFSQFTWFYP 531
            +CQ+    RL    +  RAS   DL+HCDLW  SP+ SV G+ Y+++ +DD S + W +P
Sbjct: 426  ACQLGHHTRLPFSSSSSRASNNFDLIHCDLW-TSPIVSVSGYKYYLVILDDCSHYIWTFP 484

Query: 532  LKRKSDFYDVLVRFKAFVENQFSRSLKVFQSDNGTEFTNNKVQALFSSSGVLHRFLCPHT 591
            L+ KSD +  +  F A V+ QF  ++K  Q DNG EF N+  +  F S GV  R  CP+T
Sbjct: 485  LRLKSDTFSTIANFFAHVKTQFGTTIKSVQCDNGREFDNSPARTFFLSHGVAFRMSCPYT 544

Query: 592  QAQNGRVERKHRHVLELGLAMLYHSHVSTSYWVHAFSTAVYIINRVPSKVLSDQIPFQLL 651
              QNGR ER  R +  +  ++L+ + +   YWV A  TA  ++NR+P+K LS   P+  L
Sbjct: 545  SQQNGRAERSLRTLNNILRSLLFQACLPPVYWVEALHTATLLVNRIPTKTLSSSTPYFHL 604

Query: 652  FQVAPTYANFHPFGCRVFPCLRPYMKNKFSPRGTPCIFLGYNSHHKGFKCFDPTTSRTYV 711
            +   PTY +   FGC  +P +     +K +PR + C+FLGY+S HKG++C +  ++R   
Sbjct: 605  YSTQPTYDHLRVFGCACYPNMSSTAPHKLAPRSSLCVFLGYSSEHKGYRCLELGSNRIIT 664

Query: 712  SRHAQFDEFCFPLTGSKSSP---NDLVVFTFYEPAAGSPPSLQPVILVVPESS----PPT 764
            SRH  FDE  FP     +SP   + L +F         PP  + V      ++     P+
Sbjct: 665  SRHVVFDESFFPFADMSTSPMASSALDIFLDDNELTAQPPRAKFVHAGTSSAARGAVEPS 724

Query: 765  GPLPCPSCVDPDVQPVPVDDAPPSPVPHNDAP---------------------------- 796
             P P PS + P     P   A P   PH  +P                            
Sbjct: 725  TPPPAPSSIGPRS---PATLAGPEAGPHGGSPAGAATSQPGAISPARTAAPSAATSTTRA 781

Query: 797  ----PLPAPTSPTPPATPLVVTQAPP---------TSPPTTPPAVTQVPLAPVDARP--R 841
                P  A +  TP  +PL  T APP         T+      A   V +APVD     R
Sbjct: 782  VTSAPRAATSGTTPSLSPLAGTAAPPPRAEVAASSTAATGRTLATRPVSIAPVDNAHSMR 841

Query: 842  TRS*NGIFKPNPRYALVHAQPTGLLTALHTVT*PKGFKSAMKHPHWLPAMEDELSALHKK 901
            TR   G+ +P  R  L HA P   +        P+  + A+  P+W  AM+ E  AL   
Sbjct: 842  TRGKAGMAQPVDRLNL-HAAPLSPV--------PRSVREALSDPNWRAAMQAEFDALLAN 892

Query: 902  FTWTLVPRPYTTNVVGSK*VFRTKFHSDGTVERLKTCLVAQGLTQIPGFDYSLTFSPVVK 961
             TW+LVPRP   N+V  K +FR K HSDG+++R K   V +G TQ PG DY  TFSPVVK
Sbjct: 893  DTWSLVPRPRGVNLVTGKWIFRHKLHSDGSLDRYKARWVLRGFTQRPGVDYDETFSPVVK 952

Query: 962  ATTVRLILSLVVLND*QLHQLDVKNAFLHGHLTETVYMEQPHGFVD-----LVSRL--MC 1014
              TVR++LSL +  D  +HQLDVKNAFLHG L+ETVY  QP GF D     LV RL    
Sbjct: 953  PATVRVVLSLALSQDWPIHQLDVKNAFLHGTLSETVYCIQPTGFADPSHADLVCRLNKSL 1012

Query: 1015 VG*TRLSTASNRRLV-----LGFSA*ALFSCVLVFRVVGLIHPCSFFYKGHITLYLLVYV 1069
             G  +   A + R       LGF      S + + R            +G+ T+ LL+YV
Sbjct: 1013 YGLKQAPRAWHHRFASHLISLGFIEAQSDSSLFIHR------------RGNDTVLLLLYV 1060

Query: 1070 DDIILTGSDPSLLT*FIACLNDEFAIKYLGKLGYFLGLEITYTADGLFLGHAKYAHDLLS 1129
            DDI+LT S  SLL   IA L  EFA+  +G L +FLG+ +T  A GLFL   +Y+ D+L 
Sbjct: 1061 DDIVLTASSASLLQQVIAALQREFAMTDMGPLHHFLGITVTRFASGLFLSQRQYSQDILE 1120

Query: 1130 HALMLEASHVLTPLAAGSHLVSSGEGYSDPTHYRSLVGALQYLTITRPDLSYAVNTVSQF 1189
             A M E     TP+   S L + G   +D T YRSL GALQYLT TRPD+++AV  V  +
Sbjct: 1121 RAGMGECKPCSTPVDVHSKLSADGPPVADSTQYRSLAGALQYLTFTRPDIAFAVQQVCLY 1180

Query: 1190 LQTPTVDHFQAVKRIIRYVCAMQHFGLTFRRTSSPAVLGYSDADWARCTDHRRSTYGYAI 1249
            +  P   H  A+KRI+RY+      GLT RR+    ++ Y+DADWA C D RRST GYA+
Sbjct: 1181 MHDPREPHLAALKRILRYIQGTLSLGLTMRRSPPTDLVVYTDADWAGCPDTRRSTSGYAV 1240

Query: 1250 FLGYNLLSWSAKKQPSVAHSSCESEYRAMANTASELVWLLNLLHELRVRLSAPPLFLSDN 1309
            FLG NL+SWS+K+Q +V+ SS E+EYRA+AN  +E  WL  LL EL        +   DN
Sbjct: 1241 FLGDNLVSWSSKRQHTVSRSSAEAEYRAVANGVAEATWLRQLLMELHRPPRTATVVYCDN 1300

Query: 1310 QSALFMAQNPVAHKHAKHIDLDCHFVRELVSSGRLAVRHVPTSLQLADIFTKVLPRPLFD 1369
             SA++++ NPV H+  KH+++D HFVRE V+ G + V HVPT+ Q AD+FTK LP  LF 
Sbjct: 1301 VSAMYLSSNPVQHQRTKHVEIDLHFVREKVALGHVRVLHVPTTSQYADVFTKGLPTSLFQ 1360

Query: 1370 LFRSKLRV 1377
             FR+ L +
Sbjct: 1361 EFRTSLTI 1368


>gb|AAD14478.1| Strong similarity to gb|AF039376 Evelknievel retrotransposon
            polyprotein from Arabidopsis arenosa. [Arabidopsis
            thaliana] gi|25301711|pir||E96624 hypothetical protein
            T2K10.7 [imported] - Arabidopsis thaliana
          Length = 1194

 Score =  542 bits (1396), Expect = e-152
 Identities = 378/1230 (30%), Positives = 592/1230 (47%), Gaps = 145/1230 (11%)

Query: 20   KLNSGNFLLWSKQVTAILQNQNLFDIVDPIVAPPSEFMLQPVSSALHHVVPAPNPLFVQW 79
            KL   N+++W++QV A+L   +L   +D  V  PSE +           V A NP +  W
Sbjct: 30   KLTPTNYIMWNRQVHALLDGYDLAGYIDGSVTAPSEMITTAG-------VSAANPAYKFW 82

Query: 80   QARDRNVNSLLLSSLTEEALSETLSATTARDVWTALHSAYASRNKAREMRLRDELQLMRR 139
            + +D+ + S +L ++T         + TA ++W  L S YA+ +     ++R  ++   +
Sbjct: 83   KRQDKLIYSAILGTITTTIQPLLSRSNTAAEIWEKLKSIYATPSWGHIQQMRQHIKQWSK 142

Query: 140  GSLSVSEY--GRKIR*S--------VANDDKVHWFLRGLGPSYANFSTGQLDQVPL---- 185
            G+ +++EY  G   R          + + +++ + L GL   Y +     +DQ  +    
Sbjct: 143  GTKTITEYFQGHTTRFDELALLGKPLEHAEQIEFLLGGLSEDYKSV----VDQTEIRDKP 198

Query: 186  PLFTDILWKV---ESHAIFQASLEDYVTPPS--AAFHARNPTRSSGSQSSGGGHRGNSSS 240
            P  T++L K+   E+  +  A+     T PS  A  HA N   +S +      +R N S 
Sbjct: 199  PTLTELLEKLLNREAKLMCAAA-----TTPSLPATAHAANYKGNSNNNQYNNNNRNNKSH 253

Query: 241  GSRPRRDNGGSHCRGSYTPRCQLCRKQGHYAAKCPVRWDRPSESANLTHSFAAGCSLNNS 300
            GS        S     +TP                  W +P  +A +   +         
Sbjct: 254  GSSYNSQQ--SIPASPFTP------------------W-QPRANAAIASPY--------- 283

Query: 301  NRSDKYMDTGATSHMTHSLSQLTYSHAYSGNDRVLVGNGAGLHITHIGSR---SASHSVP 357
            N ++  +D+GAT H+T  L+ L+    Y+G + V + +G+GL I+H GS    + S S+ 
Sbjct: 284  NANNWLLDSGATHHITSDLNNLSLHQPYTGGEDVTIADGSGLSISHTGSALISTPSRSLA 343

Query: 358  LSNVLVVPRLTNNLVSVSKLTRDHNVRAIFEADSFVIQNRQTGAVLGKGPCDKGFYVLDQ 417
            L++VL VP +  NL+SV ++   + V   F    F +++ +TG  L +G      Y    
Sbjct: 344  LTDVLYVPNIHKNLISVYRMCNANKVSVEFFPAHFQVKDLKTGVQLLQGRTKDELYEWPV 403

Query: 418  GSQALLAT-SSSLPRASFELWHSRLDHVNFDIIKKLHKLGCFNVSSILPKPICCTSCQMA 476
                  +  +++ P+     WHSRL H +   +K +       VS+ L K   C+ C + 
Sbjct: 404  NPPKPSSHFTTTTPKTDLTSWHSRLGHPSLSTLKVVVSQFSLPVSNSLQKQFNCSDCLLN 463

Query: 477  KSKRLVCYDNHKRASAVLDLVHCDLWGPSPVASVDGFSYFVIFVDDFSQFTWFYPLKRKS 536
            K+ +L  + N   ++  L+ ++ DLW  SP+ S+D F Y+++ VD +++++WFYP+K+KS
Sbjct: 464  KTHKLPFHTNTITSTQPLEYLYIDLW-TSPIVSIDNFKYYLVIVDHYTRYSWFYPIKQKS 522

Query: 537  DFYDVLVRFKAFVENQFSRSLKVFQSDNGTEFTNNKVQALFSSSGVLHRFLCPHTQAQNG 596
               DV + FKA V N+F R +    SDNG EF    +++  SS+G+ H    PHT   NG
Sbjct: 523  HVKDVFMTFKALVANKFQRKIIHLYSDNGGEFI--ALRSFLSSNGITHLTTPPHTPEHNG 580

Query: 597  RVERKHRHVLELGLAMLYHSHVSTSYWVHAFSTAVYIINRVPSKVLSDQIPFQLLFQVAP 656
              ERKHRH++E GL +L  + +  SYW +AF+ A+Y+INR+ S V+    P++ LF  AP
Sbjct: 581  ISERKHRHIVETGLTLLGQASMPKSYWSYAFTIAIYLINRMSSDVIGGISPYKRLFGQAP 640

Query: 657  TYANFHPFGCRVFPCLRPYMKNKFSPRGTPCIFLGYNSHHKGFKCFDPTTSRTYVSRHAQ 716
             Y     FGC  FP LRPY  +K   R  PC+FLGY+     + C + TT R Y SRH Q
Sbjct: 641  NYLKLRVFGCLCFPWLRPYTTHKLDDRPAPCVFLGYSQTQSAYLCLNRTTGRVYTSRHVQ 700

Query: 717  FDEFCF--------PLTGSKSSPNDLVVFTFYEPAAGSPPSLQPVILVVPESSPPTGPLP 768
            F E  +        P T  + S N  +  T   P     PS+ P     P   PP+ P P
Sbjct: 701  FVENTYPFTKPTLDPFTNLEESNNHSITTTVPSPPFVQLPSVPPPTR-DPHQPPPSQPAP 759

Query: 769  CPSCVDPDVQPVPVDDAPPS---------------------PVPHNDAPPLPAPT----- 802
             PS + P     PV  + P                        P N   P+ +PT     
Sbjct: 760  SPSPLSPPSMSSPVMTSSPQFSSNRDSTTLHGDYSHVDYGLSSPSNPPGPITSPTTSKSP 819

Query: 803  -------------SPTPPATPLVVTQAPPTSPPTTPPAVTQVPLAPVDARP-RTRS*NGI 848
                         + TPP +P   + +P   P  +P +    P  P +    RTR+ N I
Sbjct: 820  SEPTSSPSHSNQPNKTPPNSPSSSSSSPTPIPSPSPQSSNSPPPPPQNQHSMRTRAKNNI 879

Query: 849  FKPNPRYALVHAQPTGLLTALHTVT*PKGFKSAMKHPHWLPAMEDELSALHKKFTWTLVP 908
             KP  +  L  A P G      TV        A++ P+W  AM +E +A  +  T+ LVP
Sbjct: 880  TKPIKKLTLA-ATPKGKSKIPTTVA------EALRDPNWRNAMSEEFNAGLRNSTYDLVP 932

Query: 909  RPYTTNVVGSK*VFRTKFHSDGTVERLKTCLVAQGLTQIPGFDYSLTFSPVVKATTVRLI 968
                 N VG++ +F  K++ DG++ R K   +A+G  Q  G DYS TFSPV+K+TTV+ +
Sbjct: 933  PKPHQNFVGTRWIFTIKYNPDGSINRYKARFLAKGFHQQHGLDYSNTFSPVIKSTTVQTV 992

Query: 969  LSLVVLND*QLHQLDVKNAFLHGHLTETVYMEQPHGFVD------LVSRLMCVG*TRLST 1022
            L + V     + QLD+ NAFL G LTE VY+ QP GF++      +      +   + + 
Sbjct: 993  LDIAVSRSWDIRQLDINNAFLQGRLTEDVYVAQPPGFINPDRPNYVCHLKKALHGLKQAP 1052

Query: 1023 ASNRRLVLGFSA*ALFSCVLVFRVVGLIHPCSFFYKGHIT--LYLLVYVDDIILTGSDPS 1080
             +  + + GF    L +C     V       S F + H    +Y+LVYVDD ++TGS+ +
Sbjct: 1053 RAWYQELRGF----LLTCGFTNSVAN----TSLFIRQHNKDYIYILVYVDDFLITGSNSN 1104

Query: 1081 LLT*FIACLNDEFAIKYLGKLGYFLGLEITYTADGLFLGHAKYAHDLLSHALMLEASHVL 1140
            L+  FI CL + F++K LG+L YFL +E T T  GL L   +Y  DLL+   ML+A  V 
Sbjct: 1105 LIAQFITCLANRFSLKDLGQLSYFLEIEATRTKAGLHLMQRRYVLDLLTKTKMLDAKTVS 1164

Query: 1141 TPLAAGSHL-VSSGEGYSDPTHYRSLVGAL 1169
            TP++    L ++SG    +P  YR ++G+L
Sbjct: 1165 TPMSPTPKLTLTSGTPIDNPGGYRQILGSL 1194


>ref|XP_507106.1| PREDICTED OJ1499_A07.20 gene product [Oryza sativa (japonica
            cultivar-group)]
          Length = 1427

 Score =  506 bits (1303), Expect = e-141
 Identities = 420/1458 (28%), Positives = 638/1458 (42%), Gaps = 189/1458 (12%)

Query: 20   KLNSGNFLLWSKQVTAILQNQNLFDIVDPIVAPPSEFMLQPVSSALHHVVPAPNPLFVQW 79
            +L + N+  W  +V A++++Q ++D ++P                      A +P     
Sbjct: 25   QLTATNYTSWCIRVQAMMEDQGVWDAIEPAAGV------------------AVDP----- 61

Query: 80   QARDRNVNSLLLSSLTEEALSETLSATTARDVWTALHSAYASRNKAREMR---LRDELQL 136
              RD+   S LL SL E+ L +     +A++VW  L + +   ++ RE R   L+ E   
Sbjct: 62   -RRDKKSKSHLLQSLPEDLLMQVAKKRSAKEVWDCLKTRFVGADRVREARLQTLKGEFGA 120

Query: 137  M----------RRGSLSVSEYGRKIR*SVANDDKVHWFLRGLGPS-YANFSTG-----QL 180
            M            G ++       +  S  +D  +   L    P  + +   G     ++
Sbjct: 121  MVMEPGETLDQYAGRITAMSVRHSVLGSTLSDSAIVKKLFDTVPEKFISLVAGIEQFYEI 180

Query: 181  DQVP-------LPLFTD------------------ILWKVESHAIFQASLEDYVTPPSAA 215
            D +P       L  + +                  +L + E  A F+    +  +P    
Sbjct: 181  DNMPFEEAVGRLKAYEERVRKKKAAAGGVTADGQVLLTQAEWEARFRKDGSESSSPQKNK 240

Query: 216  FHARNPTRSSGSQSSGGGHRGNSSSGSRPRRDNGGSHCRGSYTP---RCQLCRKQGHYAA 272
              +    R+ G +  G GH G    GS PR    G    G       +C  C + GHY+ 
Sbjct: 241  PPSDRGNRAQGGRGRGRGHGGGGGRGSAPRNSGAGGSGGGGRDKSHIKCYNCEEFGHYST 300

Query: 273  KCPVRWDRPSESANLTHSFAAGCSLNNSNRSDK--------------------------- 305
            +CP    +  E A+L  +  A  +L  +   D+                           
Sbjct: 301  QCPHPKKKKVE-AHLAQTDDANPALLLAVTEDEPASGLVVHEERVWPQLLLADSGAATGD 359

Query: 306  --YMDTGATSHMTHSLSQ-------LTYSHAYSGNDRVLV-GNGAGLHITHIGSRSASHS 355
              ++D GA++HMT   ++       +T S  +     V + G G+ L     G +     
Sbjct: 360  IWFLDNGASNHMTGDRAKFRDLDVSITGSVKFGDASTVKIQGKGSILFSCKNGDQWL--- 416

Query: 356  VPLSNVLVVPRLTNNLVSVSKLTR-------DHNVRAIFEADSF--VIQNRQTGAVLGKG 406
              L +V  +P L  N+VS+ +LT        D +V  +F+      V++ R+T   L + 
Sbjct: 417  --LQDVFYIPSLCCNMVSLGQLTETGHRVVMDEDVLEVFDKSPLRLVMRVRRTPNRLYR- 473

Query: 407  PCDKGFYVLDQGSQALLATSSSLPRASFELWHSRLDHVNFDIIKKLHKLGCFN-VSSILP 465
                    L   +   L T    P     LWH+RL HVNF  +K L   G    + +I  
Sbjct: 474  ------IELKLATPVCLLTRMDEPAW---LWHARLGHVNFQAMKLLADKGMAGGIPAITH 524

Query: 466  KPICCTSCQMAKSKRLVCYDNHK-RASAVLDLVHCDLWGPSPVASVDGFSYFVIFVDDFS 524
                C +C +AK  R         RA   L+L+H DL GP    ++ G  YF++ VDDFS
Sbjct: 525  PNQLCQACLVAKQIRQPFPATANFRAEEPLELLHIDLCGPITPTTMAGNRYFMLIVDDFS 584

Query: 525  QFTWFYPLKRKSDFYDVLVRFKAFVENQFSRSLKVFQSDNGTEFTNNKVQALFSSSGVLH 584
            ++ W + +K K    +   +FK   EN   R +K  +SD G EF + +   L   +G+  
Sbjct: 585  RWMWMFVIKTKDQALEAFTKFKPLAENTAGRRIKTLRSDRGGEFLSGEFAQLCEQAGIQR 644

Query: 585  RFLCPHTQAQNGRVERKHRHVLELGLAMLYHSHVSTSYWVHAFSTAVYIINRVPSKVLSD 644
                P++  QNG VER++R V+ +  +++    V   +W  A   AVY++NR+P+K + D
Sbjct: 645  HLTAPYSPQQNGVVERRNRSVMAMARSLMKGMSVPGRFWGEAVRHAVYLLNRLPTKAMGD 704

Query: 645  QIPFQLLFQVAPTYANFHPFGCRVFPCLRPYMKNKFSPRGTPCIFLGYNSHHKGFKCFDP 704
            + PF+      P   +   FGC     +    + K   R  P ++LG     K  + FDP
Sbjct: 705  RTPFEAWTGRKPQLGHLRVFGCIAHAKITTPNQKKLDDRSAPYVYLGVEEGSKAHRLFDP 764

Query: 705  TTSRTYVSRHAQFDEFCFPLTGSKSSPNDLVVFTFYEPAAGSPPSLQPVILVVPESSPPT 764
               R +VSR   F+E         +   +   FT  E    +PP           +  P 
Sbjct: 765  RCGRIHVSRDVIFEENVPWQWSVVAGEQNSTEFTVEEDGVDAPP-----------AGAPA 813

Query: 765  GPLPCPSCVDPDVQPVPVDDAPPSPVPHNDAPPLPAPTSP--TPPATPLVVTQAPPTSPP 822
             P+P      P V   P    P SPV  + + P    +SP  TPP+TP   +  P  SP 
Sbjct: 814  YPVPRYRAPSPAVPQSP----PASPVGASSSLPTSPQSSPSSTPPSTPATGSAGPVASPG 869

Query: 823  TTPPAVTQVPLAPVDARPRTRS*NGIFKPNPRYALVHAQPTG--LLTALHTVT*PKGFKS 880
            +         L   +   R RS   I +  PR  LV  +  G  LL  +     P  ++ 
Sbjct: 870  SGG------DLRSDEGPVRFRSLEDIMREAPRVDLVEDEHDGDALLAEMEE---PSSYRE 920

Query: 881  AMKHPHWLPAMEDELSALHKKFTWTLVPRPYTTNVVGSK*VFRTKFHSDGTVERLKTCLV 940
            A   P W  AM  EL A+ K  TW L   P     +G K V++ K ++ G V + K  LV
Sbjct: 921  AAGQPAWENAMAQELQAIEKNSTWALTALPAGHKPIGLKWVYKLKKNTAGEVIKHKARLV 980

Query: 941  AQGLTQIPGFDYSLTFSPVVKATTVRLILSLVVLND*QLHQLDVKNAFLHGHLTETVYME 1000
            A+G  Q  G D+   F+PV +  TVR+IL++      ++H LDVK+AFL+G L E VY+ 
Sbjct: 981  AKGYVQRQGVDFEEVFAPVARLDTVRVILAIAADRRWEVHHLDVKSAFLNGDLEEEVYVA 1040

Query: 1001 QPHGFV-----DLVSRL--MCVG*TRLSTASNRRL-----VLGFSA*ALFSCVLVFRVVG 1048
            QP GFV      LV RL     G  +   A N RL      LGF+       V       
Sbjct: 1041 QPEGFVKRGEEHLVLRLSKALYGLRQAPRAWNTRLDKCLKELGFARCTQEQAVYTRG--- 1097

Query: 1049 LIHPCSFFYKGHITLYLLVYVDDIILTGSDPSLLT*FIACLNDEFAIKYLGKLGYFLGLE 1108
                     KG   + + VYVDD+I+TG +P  +  F   +  EF +  LG L Y+LG+E
Sbjct: 1098 ---------KGQAGVIVGVYVDDLIVTGENPHEIAMFKQQMMGEFEMSDLGLLSYYLGIE 1148

Query: 1109 ITYTADGLFLGHAKYAHDLLSHALMLEASHVLTPLAAGSHLVSSGEGYS-DPTHYRSLVG 1167
            +    +G+ +  A YA  +LS   M   +    P+   S L    +G   D T YR ++G
Sbjct: 1149 VIQGENGIAIKQAAYAKKILSQFGMQGCNPTSIPMEPRSLLHKDADGNPIDATEYRRVIG 1208

Query: 1168 ALQYLTITRPDLSYAVNTVSQFLQTPTVDHFQAVKRIIRYVCAMQHFGLTFRRTS-SPAV 1226
             L+YL  TRPDLSYAV   S+F++ PT  H +AVK I+RY+      GL F   S S  +
Sbjct: 1209 CLRYLLHTRPDLSYAVGVASRFMERPTTMHLKAVKMILRYLKGTLDSGLVFASGSGSLDI 1268

Query: 1227 LGYSDADWARCTDHRRSTYGYAIFLGYNLLSWSAKKQPSVAHSSCESEYRAMANTASELV 1286
             G++D+D A   D RRST G A ++  +L+SW ++KQ +VA SSCE+E+ A    A   +
Sbjct: 1269 TGFTDSDLAGDMDDRRSTGGMAFYVNSSLVSWCSQKQKTVALSSCEAEFMAATAAACHAL 1328

Query: 1287 WLLNLLHELRVRLSAPPLFLSDNQSALFMAQNPVAHKHAKHIDLDCHFVRELVSSGRLAV 1346
            WL  LL E+    +       DN+SA+ + +NPV H  +KHID   HF+RE V SG++ +
Sbjct: 1329 WLRALLSEMMGTEAKRVKLFVDNKSAIALMKNPVFHGRSKHIDTRYHFIRECVESGQILI 1388

Query: 1347 RHVPTSLQLADIFTKVLP 1364
              V +  Q AD  TK LP
Sbjct: 1389 EFVRSEEQRADAMTKGLP 1406


>gb|AAC35532.1| contains similarity to proteases [Arabidopsis thaliana]
           gi|7444456|pir||T01908 hypothetical protein T12H20.12 -
           Arabidopsis thaliana
          Length = 1392

 Score =  447 bits (1151), Expect = e-124
 Identities = 270/766 (35%), Positives = 403/766 (52%), Gaps = 53/766 (6%)

Query: 16  MITVKLNSGNFLLWSKQVTAILQNQNLFDIVDPIVAPPSEFMLQPVSSALHHVVPAPNPL 75
           ++T+KL   N+LLW  Q  + L +  L   V      P+  ++              N  
Sbjct: 15  VVTLKLTPTNYLLWKTQFESYLSSHLLLGFVTGATPRPASTIIVTKDDIQSEEA---NQE 71

Query: 76  FVQWQARDRNVNSLLLSSLTEEALSETLSATTARDVWTALHSAYASRNKAREMRLRDELQ 135
           F++W   D+ V + +  SL+EEAL   +   +A++VW  L   +   +  R+  L+  L 
Sbjct: 72  FLKWTRIDQLVKAWIFGSLSEEALKVVIGLNSAQEVWLGLARRFNRFSTTRKYDLQKRLG 131

Query: 136 LMRRGSLSVSEYGRKIR*----------SVANDDKVHWFLRGLGPSYANFST---GQLDQ 182
              +   ++  Y  +++            V   +K+   L GLG  Y + +T     LD 
Sbjct: 132 TCSKAGKTMDAYLSEVKNICDQLDSIGFPVTEQEKIFGVLNGLGKEYESIATVIEHSLDV 191

Query: 183 VPLPLFTDILWKVESHAIFQASLEDYVT----PPSAAFHARNPTRSSGSQSSGGGHRGN- 237
            P P F D+++K+ +   F   L  Y       P  AF+      S G+ +S GG  GN 
Sbjct: 192 YPGPCFDDVVYKLTT---FDDKLSTYTANSEVTPHLAFYTDKSYSSRGNNNSRGGRYGNF 248

Query: 238 ------SSSGSRPRRDNGGSHCRGSYT---PRCQLCRKQGHYAAKCPVRWDRPSESANLT 288
                 SS G    +  G     GS     P CQ+CRK GH A KC  R++      +L 
Sbjct: 249 RGRGSYSSRGRGFHQQFGSGSNNGSGNGSKPTCQICRKYGHSAFKCYTRFEENYLPEDLP 308

Query: 289 HSFAA-GCSLNNSNRSDKYM-DTGATSHMTHSLSQLTYSHAYSGNDRVLVGNGAGLHITH 346
           ++FAA   S  N   S +++ D+ AT+H+T++   L  S  YSG+D V+VGNG  L ITH
Sbjct: 309 NAFAAMRVSDQNQASSHEWLPDSAATAHITNTTDGLQNSQTYSGDDSVIVGNGDFLPITH 368

Query: 347 IGS---RSASHSVPLSNVLVVPRLTNNLVSVSKLTRDHNVRAIFEADSFVIQNRQTGAVL 403
           IG+     +  ++PL +VLV P +T +L+SVSKLT D+     F++DS VI++++T  +L
Sbjct: 369 IGTIPLNISQGTLPLEDVLVCPGITKSLLSVSKLTDDYPCSFTFDSDSVVIKDKRTQQLL 428

Query: 404 GKGPCDKGFYVL-DQGSQALLATSSSLPRASFELWHSRLDHVNFDIIKKLHKLGCFNVSS 462
            +G   KG YVL D   Q   +T      +  E+WH RL H N ++++ L K     V+ 
Sbjct: 429 TQGNKHKGLYVLKDVPFQTYYSTRQQ--SSDDEVWHQRLGHPNKEVLQHLIKTKAIVVNK 486

Query: 463 ILPKPICCTSCQMAKSKRLVCYDNHKRASAVLDLVHCDLWGPSPVASVDGFSYFVIFVDD 522
                  C +CQM K  RL    +   +S  L+ +HCDLWGP+PV S  GF Y+VIF+D+
Sbjct: 487 TSSN--MCEACQMGKVCRLPFVASEFVSSRPLERIHCDLWGPAPVTSAQGFQYYVIFIDN 544

Query: 523 FSQFTWFYPLKRKSDFYDVLVRFKAFVENQFSRSLKVFQSDNGTEFTNNKVQALFSSSGV 582
           +S+FTWFYPLK KSDF+ V V F+  VENQ+   + +FQ D G EF + K  A  +S G+
Sbjct: 545 YSRFTWFYPLKLKSDFFSVFVLFQQLVENQYQHKIAMFQCDGGGEFVSYKFVAHLASCGI 604

Query: 583 LHRFLCPHTQAQNGRVERKHRHVLELGLAMLYHSHVSTSYWVHAFSTAVYIINRVPSKVL 642
                CPHT  QNG  ER+HR++ ELGL++++HS V    WV AF T+ ++ N +PS  L
Sbjct: 605 KQLISCPHTPQQNGIAERRHRYLTELGLSLMFHSKVPHKLWVEAFFTSNFLSNLLPSSTL 664

Query: 643 SD-QIPFQLLFQVAPTYANFHPFGCRVFPCLRPYMKNKFSPRGTPCIFLGYNSHHKGFKC 701
           SD + P+++L    P Y     FG   +P LRPY KNKF P+   C+FLGYN+ +KG++C
Sbjct: 665 SDNKSPYEMLHGTPPVYTALRVFGSACYPYLRPYAKNKFDPKSLLCVFLGYNNKYKGYRC 724

Query: 702 FDPTTSRTYVSRHAQFDEFCFPLTGSKSSPNDLVVFTFYEPAAGSP 747
             P T + Y+ RH  FDE  FP +          +++ ++  +GSP
Sbjct: 725 LHPPTGKVYICRHVLFDERKFPYSD---------IYSQFQTISGSP 761



 Score =  391 bits (1004), Expect = e-106
 Identities = 228/535 (42%), Positives = 314/535 (58%), Gaps = 15/535 (2%)

Query: 848  IFKPNPRYALVHAQPTGLLTALHTVT*PKGFKSAMKHPHWLPAMEDELSALHKKFTWTLV 907
            I KPNP+YAL   +             PK  K A+K   W  AM +E+  +H+  TW LV
Sbjct: 778  ITKPNPKYALFSVKSN--------YPEPKSVKEALKDEGWTNAMGEEMGTMHETDTWDLV 829

Query: 908  PRPYTTNVVGSK*VFRTKFHSDGTVERLKTCLVAQGLTQIPGFDYSLTFSPVVKATTVRL 967
            P      ++G K VF+TK +SDG+++RLK  LVA+G  Q  G DY  T+SPVV++ TVR 
Sbjct: 830  PPEMVDRLLGCKWVFKTKLNSDGSLDRLKARLVARGYEQEEGVDYVETYSPVVRSATVRS 889

Query: 968  ILSLVVLND*QLHQLDVKNAFLHGHLTETVYMEQPHGFVDLVSRLMCVG*TRLSTASNRR 1027
            IL +  +N   L QLDVKNAFLH  L ETV+M QP GF D  SR   V   + +    ++
Sbjct: 890  ILHVATINKWSLKQLDVKNAFLHDELKETVFMTQPPGFED-PSRPDYVCKLKKAIYDLKQ 948

Query: 1028 LVLGFSA*ALFSCVLVFR--VVGLIHPCSFFY-KGHITLYLLVYVDDIILTGSDPSLLT* 1084
                +     FS  L+    +     P  F Y KG   ++LL+YVDD+ILTG++  LL  
Sbjct: 949  APRAWFD--KFSSYLLKYGFICSFSDPSLFVYLKGRDVMFLLLYVDDMILTGNNDVLLQQ 1006

Query: 1085 FIACLNDEFAIKYLGKLGYFLGLEITYTADGLFLGHAKYAHDLLSHALMLEASHVLTPLA 1144
             +  L+ EF +K +G L YFLG++  Y  DGLFL   KY  DLL +A M + S + TPL 
Sbjct: 1007 LLNILSTEFRMKDMGALHYFLGIQAHYHNDGLFLSQEKYTSDLLVNAGMSDCSSMPTPLQ 1066

Query: 1145 AGSHLVSSGEGYSDPTHYRSLVGALQYLTITRPDLSYAVNTVSQFLQTPTVDHFQAVKRI 1204
                L  + + + +PT++R L G LQYLT+TRPD+ +AVN V Q +  PT+  F  +KRI
Sbjct: 1067 LDL-LQGNNKPFPEPTYFRRLAGKLQYLTLTRPDIQFAVNFVCQKMHAPTMSDFHLLKRI 1125

Query: 1205 IRYVCAMQHFGLTFRRTSSPAVLGYSDADWARCTDHRRSTYGYAIFLGYNLLSWSAKKQP 1264
            + Y+      G+     +   +  YSD+DWA C D RRST G+  FLGYN++SWSAK+ P
Sbjct: 1126 LHYLKGTMTMGINLSSNTDSVLRCYSDSDWAGCKDTRRSTGGFCTFLGYNIISWSAKRHP 1185

Query: 1265 SVAHSSCESEYRAMANTASELVWLLNLLHELRVRLSAPPLFLSDNQSALFMAQNPVAHKH 1324
            +V+ SS E+EYR ++  ASE+ W+  LL E+ +     P    DN SA++++ NP  H  
Sbjct: 1186 TVSKSSTEAEYRTLSFAASEVSWIGFLLQEIGLPQQQIPEMYCDNLSAVYLSANPALHSR 1245

Query: 1325 AKHIDLDCHFVRELVSSGRLAVRHVPTSLQLADIFTKVLPRPLFDLFRSKLRVGL 1379
            +KH  +D ++VRE V+ G L V+H+P S QLADIFTK LP+  F   R KL V L
Sbjct: 1246 SKHFQVDYYYVRERVALGALTVKHIPASQQLADIFTKSLPQAPFCDLRFKLGVVL 1300


  Database: nr
    Posted date:  Jul 5, 2005 12:34 AM
  Number of letters in database: 863,360,394
  Number of sequences in database:  2,540,612
  
Lambda     K      H
   0.325    0.138    0.433 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,434,904,960
Number of Sequences: 2540612
Number of extensions: 112450920
Number of successful extensions: 870012
Number of sequences better than 10.0: 9431
Number of HSP's better than 10.0 without gapping: 3055
Number of HSP's successfully gapped in prelim test: 6999
Number of HSP's that attempted gapping in prelim test: 634233
Number of HSP's gapped (non-prelim): 73260
length of query: 1380
length of database: 863,360,394
effective HSP length: 141
effective length of query: 1239
effective length of database: 505,134,102
effective search space: 625861152378
effective search space used: 625861152378
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 82 (36.2 bits)


Lotus: description of TM0019a.6