Medicago
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= AC144645.3 + phase: 0 /pseudo
         (993 letters)

Database: nr 
           2,540,612 sequences; 863,360,394 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAT93988.1| putative polyprotein [Oryza sativa (japonica cult...   671  0.0
gb|AAP54850.1| putative gag-pol polyprotein [Oryza sativa (japon...   661  0.0
ref|NP_916434.1| putative gag/pol polyprotein [Oryza sativa (jap...   660  0.0
gb|AAP51971.1| putative copia-type polyprotein [Oryza sativa (ja...   655  0.0
ref|XP_462785.1| putative gag/pol polyprotein [Oryza sativa (jap...   624  e-177
emb|CAC37623.1| copia-like polyprotein [Arabidopsis thaliana]         503  e-141
gb|AAD21687.1| Strong similarity to gi|3600044 T12H20.12 proteas...   498  e-139
gb|AAP53032.1| putative copia-like retrotransposon polyprotein [...   495  e-138
emb|CAB81170.1| retrotransposon like protein [Arabidopsis thalia...   488  e-136
gb|AAK51235.1| polyprotein [Arabidopsis thaliana]                     486  e-135
gb|AAU89779.1| gag-pol polyprotein-like [Solanum tuberosum]           461  e-128
gb|AAG46116.1| putative copia-like retrotransposon polyprotein [...   458  e-127
gb|AAC61290.1| putative retroelement pol polyprotein [Arabidopsi...   450  e-125
emb|CAB79576.1| putative protein [Arabidopsis thaliana] gi|32692...   446  e-123
pir||F86470 probable retroelement polyprotein [imported] - Arabi...   437  e-120
gb|AAT39281.1| putative late blight resistance protein [Solanum ...   419  e-115
emb|CAC95126.1| gag-pol polyprotein [Populus deltoides]               414  e-114
gb|AAB61111.1| Strong similarity to Zea mays retrotransposon Hop...   398  e-109
gb|AAD25646.1| putative retroelement pol polyprotein [Arabidopsi...   397  e-108
gb|AAP46257.1| putative polyprotein [Oryza sativa (japonica cult...   389  e-106

>gb|AAT93988.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
          Length = 1480

 Score =  671 bits (1731), Expect = 0.0
 Identities = 371/820 (45%), Positives = 491/820 (59%), Gaps = 84/820 (10%)

Query: 20   IVVGSGQEIPIRGYGQTYISPPYPPLSLNNVLHAPKLIKNLIFVRRFTIDNNVTIEFDPF 79
            I VG+G  IP+   G +++       +L N+L AP L++NL+FVR+FT DN  + EFD F
Sbjct: 386  ITVGNGHTIPVICRGTSFLPIGTTRFALKNILVAPSLVRNLLFVRQFTRDNKCSFEFDEF 445

Query: 80   SFSVEDIQTGIPLMRCDSTGDLYPLTTTTSHQSTSPATFAALSPTLWHNRLGHPGANVLS 139
             FSV+D+ T   ++RC+S GDLY L TT    + +  +F A S TLWH+RLGHP    + 
Sbjct: 446  GFSVKDLPTRRVILRCNSRGDLYTLPTTVP--AITAHSFLAKSSTLWHHRLGHPSPAAVQ 503

Query: 140  FLNKNKFIECKQISSPSICQSCIYGKHVKLPFSISNSTTSKPFDIIHSDLWTSPILSSAG 199
             L+K   + C + S+  +C +C  GKH +L FS S+S+TS PF+++H D+WTSP+LS +G
Sbjct: 504  TLHKLAILSCTR-SNNKLCHACHLGKHTRLSFSKSSSSTSSPFELVHCDVWTSPVLSLSG 562

Query: 200  HKYYLFFLDDYTNFVWTFPIGRKSQVPSIFSSFHAFIKTQFGTTIKCFQCDNGTEFNNEY 259
             KYYL  LDD+T+F WTFP+  KS V      F A++KTQF   I+CFQ DNGT+F N  
Sbjct: 563  FKYYLVVLDDFTHFCWTFPLRHKSDVHQHLLEFVAYVKTQFSLPIRCFQADNGTKFVNHA 622

Query: 260  FTQFCHKNGMVFRFSCPHTSPQNGKAERKIRAINNFIRTSLAHSSMPPSFWHHALQITTY 319
             T F    G+V R SCP+TSPQNGKAER +R IN  IRT L  +SMPPS+W  AL   TY
Sbjct: 623  TTSFFASRGIVLRLSCPYTSPQNGKAERVLRTINKSIRTLLIQASMPPSYWAEALATATY 682

Query: 320  LQNILPSKILSHHSPTQYLYHRDPSYTHLRVFGCLCYPLFPSTAINKLQARSTPCAFLGY 379
            L N  PS  + +  P Q L+++ P Y++L+VFGCLCYP   +   +KL  RS P  FLGY
Sbjct: 683  LLNRRPSTSVRNSIPYQLLHNKLPDYSNLQVFGCLCYPNLSAMTSHKLSPRSAPYVFLGY 742

Query: 380  PQNHRGYKCYDLSSKKIIISRHVIFDETQFPFAKTHTSSTHTYEFLNDSLHPLLHYHLQN 439
              +H+G++C D+S++++ ISRHV+FDE  FPFA     ++ +++FL      L  + +  
Sbjct: 743  SASHKGFRCLDISTRRLYISRHVVFDEKTFPFAAIPQDAS-SFDFL------LQGFSIAV 795

Query: 440  DPKQDEPEPR--------KIESPQP--------------------ATTP--ASPI----- 464
             P  +   PR        ++E P P                    A  P   SP+     
Sbjct: 796  APSSEVERPRFSSMTPSPEVEQPIPDDDTSGTELFQLLPGLRSSAAGRPLAGSPVDARLP 855

Query: 465  --------NVTNQSIL-----PPSPMSINQLPHPLVSTELTSP-THT--------PQQIH 502
                    N ++ S L     PP+   +   P    +T L SP  HT        P  IH
Sbjct: 856  GGCANDAANGSSSSNLSPVMDPPAASVVRPAPSEGPTTSLISPYRHTYLRRSQPAPTAIH 915

Query: 503  ----------------QEPPRTIATHSMHGIHKPKIQFNLT-TSITSSPLPHNPKAALSD 545
                            Q+   T+ T S  G  +P  +F  T T    SP+P N ++AL+D
Sbjct: 916  RPIRASRAFHSATDQQQQTGHTMVTRSQTGHLRPIQRFTYTATHDVVSPVPSNYRSALAD 975

Query: 546  SNWKAAMLDEFDALIKNKTWELVPRPQNVNFIRSMWIFRHKKKSDGSFERYKARLVGDGR 605
             NW+AA  +E+ AL+ N TW LVPRP   N +   WIF+HK  SDG+  R+KAR V  G 
Sbjct: 976  PNWRAATANEYKALVDNNTWRLVPRPPGANVVTGKWIFKHKFHSDGTLARHKARWVVRGY 1035

Query: 606  SQ*VGVDCDETFSPVVKPTTIRIVLTIALSKSWPIHQLDVKNVFLHGELQETVYMHQPMG 665
            SQ  G+D DETFSPVVKP TI +VL+IA S+SWPIHQLDVKN FLHG L+ETVY  QP G
Sbjct: 1036 SQQHGIDYDETFSPVVKPATIHVVLSIAASRSWPIHQLDVKNAFLHGNLEETVYYQQPSG 1095

Query: 666  FRDPIHPDYVCLLKKSLYGLKQAPRAWYQRFADFAFTIGFSHSKSDHSLFIYRKGNDMTY 725
            F DP  P+ VCLL+KSLYGLKQAPRAWYQRFA +   +GF+ S S+ SLF+Y+ G+++ Y
Sbjct: 1096 FVDPSAPNAVCLLQKSLYGLKQAPRAWYQRFATYIRQLGFTSSASNTSLFVYKDGDNIAY 1155

Query: 726  ILLYVDDIILTASSDVLRRSIMSLLASEFAMKDLGTLSYF 765
            +LLYVDDIILTASS  L   I + L SEFAM DLG L +F
Sbjct: 1156 LLLYVDDIILTASSATLLHHITARLHSEFAMTDLGDLHFF 1195


>gb|AAP54850.1| putative gag-pol polyprotein [Oryza sativa (japonica cultivar-group)]
            gi|37536522|ref|NP_922563.1| putative gag-pol polyprotein
            [Oryza sativa (japonica cultivar-group)]
            gi|13310887|gb|AAG13591.2| putative gag/pol polyprotein
            [Oryza sativa]
          Length = 1417

 Score =  661 bits (1706), Expect = 0.0
 Identities = 352/773 (45%), Positives = 472/773 (60%), Gaps = 34/773 (4%)

Query: 1    MTASQGTLSTYSNLSINKNIVVGSGQEIPIRGYGQTYISPPYPPLSLNNVLHAPKLIKNL 60
            M+++ G L+    L  +  I VG+G ++P+     T+I      L L+NVL +P LIKNL
Sbjct: 378  MSSTPGILAHPRPLPFSSCITVGNGAKLPVTHTASTHIPTSSTDLHLHNVLVSPPLIKNL 437

Query: 61   IFVRRFTIDNNVTIEFDPFSFSVEDIQTGIPLMRCDSTGDLYPLTTTTSHQSTSPATFAA 120
            I V++ T DNNV+IEFDP  FS++D+QT +  +RCDS GDLYPL   + H     A  A 
Sbjct: 438  ISVKKLTRDNNVSIEFDPTGFSIKDLQTQVVKLRCDSPGDLYPLRLPSPH-----ALSAT 492

Query: 121  LSPTL--WHNRLGHPGANVLSFLNKNKFIECKQISSPSICQSCIYGKHVKLPFSISNSTT 178
             SP++  WH RLGHPG+  LS +  +   +C + S+P  C +C  G +V+LPF  S+S T
Sbjct: 493  SSPSVEHWHLRLGHPGSASLSKVLGSFDFQCNK-SAPHHCSACHVGTNVRLPFHSSSSQT 551

Query: 179  SKPFDIIHSDLWTSPILSSAGHKYYLFFLDDYTNFVWTFPIGRKSQVPSIFSSFHAFIKT 238
              PF ++H+D+WTSPI S++G+KYY+ FLDD+T+++WTFP+  KS+V     SF A+  T
Sbjct: 552  LFPFQLVHTDVWTSPIYSNSGYKYYVVFLDDFTHYIWTFPVRNKSEVFHTVRSFFAYAHT 611

Query: 239  QFGTTIKCFQCDNGTEFNNEYFTQFCHKNGMVFRFSCPHTSPQNGKAERKIRAINNFIRT 298
            QFG  +   Q DNG E+++         +G V R SCP++S QNGKAER +R IN+++RT
Sbjct: 612  QFGLPVLALQTDNGKEYDSYALRSLLSLHGAVLRLSCPYSSQQNGKAERILRTINDYVRT 671

Query: 299  SLAHSSMPPSFWHHALQITTYLQNILPSKILSHHSPTQYLYHRDPSYTHLRVFGCLCYPL 358
             L HS+ P SFW  ALQ  T+L N  P +     +P Q L    P+Y HLRVFGCLCYP 
Sbjct: 672  MLVHSAAPLSFWAEALQTATHLINRRPCRATGSLTPYQLLLGAPPTYDHLRVFGCLCYPN 731

Query: 359  FPSTAINKLQARSTPCAFLGYPQNHRGYKCYDLSSKKIIISRHVIFDETQFPFAKTHTSS 418
              +TA +KL  RS  C F+GYP +HRGY+CYD+ S+++  SRHV F E  FPF       
Sbjct: 732  TIATAPHKLSPRSLACVFIGYPADHRGYRCYDMVSRRVFTSRHVTFVEDVFPF------- 784

Query: 419  THTYEFLNDSLHPLLHYHLQNDPKQDEPEPRKIESPQPATTPASPINVT--NQSILPPSP 476
                    D+  P         P  D  +   +  P PA    +P+     + +  PPSP
Sbjct: 785  -------RDAPSP----RPSAPPPPDHGDDTIVLLPAPAQHVVTPVGTAPAHDAASPPSP 833

Query: 477  MSINQLPHPLVSTELTSPTHTPQQ---IHQEPPR-TIATHSMHGIHKPKIQFNLTTSITS 532
             S    P         +P  +P+        PPR  + T +  GI KP  ++ +T + T 
Sbjct: 834  AS--STPSSAAPAHDVAPPPSPETSSPASASPPRHAMTTRARAGISKPNPRYAMTATSTL 891

Query: 533  SPLPHNPKAALSDSNWKAAMLDEFDALIKNKTWELVPRPQNVNFIRSMWIFRHKKKSDGS 592
            SP P + +AAL D NW+AAM  EFDAL+ N+TW LVPRP     I   W+F+ K  +DGS
Sbjct: 892  SPTPSSVRAALRDPNWRAAMQAEFDALLANRTWTLVPRPPGARIITGKWVFKTKLHADGS 951

Query: 593  FERYKARLVGDGRSQ*VGVDCDETFSPVVKPTTIRIVLTIALSKSWPIHQLDVKNVFLHG 652
             ++YKAR V  G +Q  GVD  ETFSPVVKP TIR VLT+  SK WP HQLDV N FLHG
Sbjct: 952  LDKYKARWVVRGFNQRPGVDFGETFSPVVKPATIRTVLTLISSKQWPAHQLDVSNAFLHG 1011

Query: 653  ELQETVYMHQPMGFRDPIHPDYVCLLKKSLYGLKQAPRAWYQRFADFAFTIGFSHSKSDH 712
             LQE V   QP GF D   P  VCLL +SLYGL+QAPRAW++RFAD A ++GF  S++D 
Sbjct: 1012 HLQERVLCQQPTGFEDAARPADVCLLSRSLYGLRQAPRAWFKRFADHATSLGFVQSRADP 1071

Query: 713  SLFIYRKGNDMTYILLYVDDIILTASSDVLRRSIMSLLASEFAMKDLGTLSYF 765
            SLF+ R+G+D  Y+LLYVDD+IL+ASS  L + I+  L +EF +KD+G L YF
Sbjct: 1072 SLFVLRRGSDTAYLLLYVDDMILSASSSSLLQRIIDRLQAEFKVKDMGPLKYF 1124


>ref|NP_916434.1| putative gag/pol polyprotein [Oryza sativa (japonica
           cultivar-group)]
          Length = 1090

 Score =  660 bits (1704), Expect = 0.0
 Identities = 353/746 (47%), Positives = 456/746 (60%), Gaps = 47/746 (6%)

Query: 57  IKNLIFVRRFTIDNNVTIEFDPFSFSVEDIQTGIPLMRCDSTGDLYPLTTTTSHQSTSPA 116
           ++NL+ VR+FT DN  +IEFD F FSV+D+QT   ++RC+S G+LY L   T   S++  
Sbjct: 58  VRNLLSVRQFTRDNKCSIEFDEFGFSVKDLQTRRVILRCNSRGELYTLPAATP--SSAAH 115

Query: 117 TFAALSPTLWHNRLGHPGANVLSFLNKNKFIECKQISSPSICQSCIYGKHVKLPFSISNS 176
              A S TLWH RLGHPG   +  L     I C +I + S+C +C  GKH +LPF  S+S
Sbjct: 116 GLLATSSTLWHCRLGHPGPAAIHGLRNIASISCNKIDT-SLCHACQLGKHTRLPFHNSSS 174

Query: 177 TTSKPFDIIHSDLWTSPILSSAGHKYYLFFLDDYTNFVWTFPIGRKSQVPSIFSSFHAFI 236
            TS PF+++H D+WTSP++S++G KYYL  LDD+++F WTF +  KS V      F  ++
Sbjct: 175 RTSVPFELVHCDVWTSPVMSTSGFKYYLVVLDDFSHFCWTFLLRLKSDVHRHIVEFVEYV 234

Query: 237 KTQFGTTIKCFQCDNGTEFNNEYFTQFCHKNGMVFRFSCPHTSPQNGKAERKIRAINNFI 296
            TQFG  +K FQ DNG EF N   T F    G   R SCP+TSPQNGKAER +R INN I
Sbjct: 235 STQFGLPLKSFQADNGREFVNTAITTFLASRGTQLRLSCPYTSPQNGKAERMLRTINNSI 294

Query: 297 RTSLAHSSMPPSFWHHALQITTYLQNILPSKILSHHSPTQYLYHRDPSYTHLRVFGCLCY 356
           RT L  +SMPPS+W  AL   TYL N  PS  +    P Q L+   P ++HLRVFGCLCY
Sbjct: 295 RTLLIQASMPPSYWAEALATATYLLNRRPSSSIHQSLPFQLLHRTIPDFSHLRVFGCLCY 354

Query: 357 PLFPSTAINKLQARSTPCAFLGYPQNHRGYKCYDLSSKKIIISRHVIFDETQFPFAKTHT 416
           P   +T  +KL  RST C FLGYP +H+GY+C DLS+ +IIISRHV+FDE+QFPFA T  
Sbjct: 355 PNLSATTPHKLSPRSTACVFLGYPTSHKGYRCLDLSTHRIIISRHVVFDESQFPFAATPP 414

Query: 417 SSTHTYEFLNDSLHPLLHYHLQNDPKQDEPEPR--------KIESP-------------- 454
           +++ +++FL   L P       + P  +  +PR        ++E P              
Sbjct: 415 AAS-SFDFLLQGLSP------ADAPSLEVEQPRPLTVAPSTEVEQPYLPLPSRRLSAGTV 467

Query: 455 ---QPATTPASPINVTNQSILPP-----------SPMSINQLPHPLVSTELTSPTHTPQQ 500
                A +  +P+  T+ +   P           SP        P+ +   +S T     
Sbjct: 468 TVASEAPSAGAPLVGTSSADATPPGSATRASTIVSPFRHVYTRRPVTTVPPSSSTAVTNA 527

Query: 501 IHQEPPRTIATHSMHGIHKPKIQFNLT-TSITSSPLPHNPKAALSDSNWKAAMLDEFDAL 559
           +    P ++ T S  G  +P  +   T T   +SP+P N  +AL+D NW+AAM DE+  L
Sbjct: 528 VAAPQPHSMVTRSQSGSLRPVDRLTYTATQAAASPVPANYHSALADPNWRAAMADEYKEL 587

Query: 560 IKNKTWELVPRPQNVNFIRSMWIFRHKKKSDGSFERYKARLVGDGRSQ*VGVDCDETFSP 619
           + N TW LV RP   N     WIF+HK  SDGS  RYKAR V  G SQ  G+D DETFSP
Sbjct: 588 VDNGTWRLVSRPPRANIATGKWIFKHKFHSDGSLARYKARWVVRGYSQQHGIDYDETFSP 647

Query: 620 VVKPTTIRIVLTIALSKSWPIHQLDVKNVFLHGELQETVYMHQPMGFRDPIHPDYVCLLK 679
           VVK  TIR+VL+IA S++WPIHQLDVKN FLHG L+ETVY  QP GF DP  PD VCLL+
Sbjct: 648 VVKLATIRVVLSIAASRAWPIHQLDVKNAFLHGHLKETVYCQQPSGFVDPTAPDAVCLLQ 707

Query: 680 KSLYGLKQAPRAWYQRFADFAFTIGFSHSKSDHSLFIYRKGNDMTYILLYVDDIILTASS 739
           KSLYGLKQAPRAWYQRFA +   +GF  S SD SLF+Y+ G+ + Y+LLYVDDIILTAS+
Sbjct: 708 KSLYGLKQAPRAWYQRFATYIRQMGFMPSASDTSLFVYKDGDRIAYLLLYVDDIILTAST 767

Query: 740 DVLRRSIMSLLASEFAMKDLGTLSYF 765
             L + + + L SEFAM DLG L +F
Sbjct: 768 TTLLQQLTARLHSEFAMTDLGDLHFF 793


>gb|AAP51971.1| putative copia-type polyprotein [Oryza sativa (japonica
            cultivar-group)] gi|37530764|ref|NP_919684.1| putative
            copia-type polyprotein [Oryza sativa (japonica
            cultivar-group)] gi|20042923|gb|AAM08751.1| Putative
            copia-type polyprotein [Oryza sativa (japonica
            cultivar-group)]
          Length = 1803

 Score =  655 bits (1690), Expect = 0.0
 Identities = 350/773 (45%), Positives = 468/773 (60%), Gaps = 34/773 (4%)

Query: 1    MTASQGTLSTYSNLSINKNIVVGSGQEIPIRGYGQTYISPPYPPLSLNNVLHAPKLIKNL 60
            M+++ G L+    L  +  I VG+G ++P+     T+I      L L+NVL +P LIKNL
Sbjct: 378  MSSTPGILAHPRPLPFSSCITVGNGAKLPVTHTASTHIPTSSTDLHLHNVLVSPPLIKNL 437

Query: 61   IFVRRFTIDNNVTIEFDPFSFSVEDIQTGIPLMRCDSTGDLYPLTTTTSHQSTSPATFAA 120
            I V++ T DNNV+IEFDP  FS++D+QT +  +RCDS GDLYPL   + H     A  A 
Sbjct: 438  ISVKKLTRDNNVSIEFDPTGFSIKDLQTQVVKLRCDSPGDLYPLRLPSPH-----ALSAT 492

Query: 121  LSPTL--WHNRLGHPGANVLSFLNKNKFIECKQISSPSICQSCIYGKHVKLPFSISNSTT 178
             SP++  WH RLGHPG+  LS +  +   +C + S+P  C +C  G +V+LPF  S+S T
Sbjct: 493  SSPSVEHWHLRLGHPGSASLSKVLGSFDFQCNK-SAPHHCSACHVGTNVRLPFHSSSSQT 551

Query: 179  SKPFDIIHSDLWTSPILSSAGHKYYLFFLDDYTNFVWTFPIGRKSQVPSIFSSFHAFIKT 238
              PF ++H+D+WTSPI S++G+KYY+ FLDD+T+++WTFP+  KS+V     SF A+  T
Sbjct: 552  LFPFQLVHTDVWTSPIYSNSGYKYYVVFLDDFTHYIWTFPVRNKSEVFHTVRSFFAYAHT 611

Query: 239  QFGTTIKCFQCDNGTEFNNEYFTQFCHKNGMVFRFSCPHTSPQNGKAERKIRAINNFIRT 298
            QFG  +   Q DNG E+++         +G V R SCP++S QNGKAER +R IN+ +RT
Sbjct: 612  QFGLPVLALQTDNGKEYDSYALRSLLSLHGAVLRLSCPYSSQQNGKAERILRTINDCVRT 671

Query: 299  SLAHSSMPPSFWHHALQITTYLQNILPSKILSHHSPTQYLYHRDPSYTHLRVFGCLCYPL 358
             L HS+ P SFW  ALQ   +L N  P +      P Q L    P+Y HLRVFGCLCYP 
Sbjct: 672  MLVHSAAPLSFWAEALQTAMHLINRRPCRATGSLKPYQLLLGAPPTYDHLRVFGCLCYPN 731

Query: 359  FPSTAINKLQARSTPCAFLGYPQNHRGYKCYDLSSKKIIISRHVIFDETQFPFAKTHTSS 418
              +TA +KL  RS  C F+GYP +HRGY+CYD+ S+++  SRHV F E  FPF       
Sbjct: 732  TIATAPHKLSPRSLACVFIGYPADHRGYRCYDMVSRRVFTSRHVTFVEDVFPF------- 784

Query: 419  THTYEFLNDSLHPLLHYHLQNDPKQDEPEPRKIESPQPATTPASPINVT--NQSILPPSP 476
                    D+  P         P  D  +   +  P PA    +P+     + +  PPSP
Sbjct: 785  -------RDAPSP----RPSAPPPPDHGDDTIVLLPAPAQHVVTPVGTAPAHDAASPPSP 833

Query: 477  MSINQLPHPLVSTELTSPTHTPQQ---IHQEPPR-TIATHSMHGIHKPKIQFNLTTSITS 532
             S    P         +P  +P+        PPR  + T +  GI KP  ++ +T + T 
Sbjct: 834  AS--STPSSAAPAHDVAPPPSPETSSPASASPPRHAMTTRARAGISKPNPRYAMTATSTL 891

Query: 533  SPLPHNPKAALSDSNWKAAMLDEFDALIKNKTWELVPRPQNVNFIRSMWIFRHKKKSDGS 592
            SP P + + AL D NW+AAM  EFDAL+ N+TW LVPRP     I   W+F+ K  +DGS
Sbjct: 892  SPTPSSVRVALRDPNWRAAMQAEFDALLANRTWTLVPRPPGARIITGKWVFKTKLHADGS 951

Query: 593  FERYKARLVGDGRSQ*VGVDCDETFSPVVKPTTIRIVLTIALSKSWPIHQLDVKNVFLHG 652
             ++YKAR V  G +Q  GVD  ETFSPVVKP TIR VLT+  SK WP HQLDV N FLHG
Sbjct: 952  LDKYKARWVVRGFNQRPGVDFGETFSPVVKPATIRTVLTLISSKQWPAHQLDVSNAFLHG 1011

Query: 653  ELQETVYMHQPMGFRDPIHPDYVCLLKKSLYGLKQAPRAWYQRFADFAFTIGFSHSKSDH 712
             LQE V   QP GF D   P  VCLL +SLYGL+QAPRAW++RFAD A ++GF  S++D 
Sbjct: 1012 HLQERVLCQQPTGFEDAARPADVCLLSRSLYGLRQAPRAWFKRFADHATSLGFVQSRADP 1071

Query: 713  SLFIYRKGNDMTYILLYVDDIILTASSDVLRRSIMSLLASEFAMKDLGTLSYF 765
            SLF+ R+G+D  Y+LLYVDD+IL+ASS  L + I+  L +EF +KD+G L YF
Sbjct: 1072 SLFVLRRGSDTAYLLLYVDDMILSASSSSLLQRIIDRLQAEFKVKDMGPLKYF 1124


>ref|XP_462785.1| putative gag/pol polyprotein [Oryza sativa (japonica cultivar-group)]
          Length = 1373

 Score =  624 bits (1610), Expect = e-177
 Identities = 344/748 (45%), Positives = 443/748 (58%), Gaps = 74/748 (9%)

Query: 85   DIQTGIPLMRCDSTGDLYPLTTTTSHQSTSPATFAALSPTLWHNRLGHPGANVLSFLNKN 144
            D++T   + RC+S+GDLYP        +TS     A   +LWH RLGH G   LS L ++
Sbjct: 355  DLETRNVIARCNSSGDLYPFYPP----ATSTHALLAAPTSLWHRRLGHLGREALSKLIRS 410

Query: 145  KFIECKQISSPSICQSCIYGKHVKLPFSISNSTTSKPFDIIHSDLWTSPILSSAGHKYYL 204
              I C +   P +C +C  G H +LPFS S+S  S  FD+IH DLWTSPI+S +G+KYYL
Sbjct: 411  SVISCTKDDLPHLCHACQLGHHTRLPFSSSSSRASNNFDLIHCDLWTSPIVSVSGYKYYL 470

Query: 205  FFLDDYTNFVWTFPIGRKSQVPSIFSSFHAFIKTQFGTTIKCFQCDNGTEFNNEYFTQFC 264
              LDD ++++WTFP+  KS   S  ++F A +KTQFGTTIK  QCDNG EF+N     F 
Sbjct: 471  VILDDCSHYIWTFPLRLKSDTFSTIANFFAHVKTQFGTTIKSVQCDNGREFDNSPARTFF 530

Query: 265  HKNGMVFRFSCPHTSPQNGKAERKIRAINNFIRTSLAHSSMPPSFWHHALQITTYLQNIL 324
              +G+ FR SCP+TS QNG+AER +R +NN +R+ L  + +PP +W  AL   T L N +
Sbjct: 531  LSHGVAFRMSCPYTSQQNGRAERSLRTLNNILRSLLFQACLPPVYWVEALHTATLLVNRI 590

Query: 325  PSKILSHHSPTQYLYHRDPSYTHLRVFGCLCYPLFPSTAINKLQARSTPCAFLGYPQNHR 384
            P+K LS  +P  +LY   P+Y HLRVFGC CYP   STA +KL  RS+ C FLGY   H+
Sbjct: 591  PTKTLSSSTPYFHLYSTQPTYDHLRVFGCACYPNMSSTAPHKLAPRSSLCVFLGYSSEHK 650

Query: 385  GYKCYDLSSKKIIISRHVIFDETQFPFAKTHTS---STHTYEFLNDS-------LHPLLH 434
            GY+C +L S +II SRHV+FDE+ FPFA   TS   S+    FL+D+           +H
Sbjct: 651  GYRCLELGSNRIITSRHVVFDESFFPFADMSTSPMASSALDIFLDDNELTAQPPRAKFVH 710

Query: 435  YHLQN------DPKQDEPEPRKIESPQPAT-----------TPA----------SPINVT 467
                +      +P    P P  I    PAT           +PA          SP    
Sbjct: 711  AGTSSAARGAVEPSTPPPAPSSIGPRSPATLAGPEAGPHGGSPAGAATSQPGAISPARTA 770

Query: 468  NQSILPPSPMSINQLPHPLVSTELTSPTHTPQQIHQEPP--------------RTIAT-- 511
              S    +  ++   P    +T  T+P+ +P      PP              RT+AT  
Sbjct: 771  APSAATSTTRAVTSAPR--AATSGTTPSLSPLAGTAAPPPRAEVAASSTAATGRTLATRP 828

Query: 512  ---------HSMH-----GIHKPKIQFNLTTSITSSPLPHNPKAALSDSNWKAAMLDEFD 557
                     HSM      G+ +P  + NL  +   SP+P + + ALSD NW+AAM  EFD
Sbjct: 829  VSIAPVDNAHSMRTRGKAGMAQPVDRLNLHAA-PLSPVPRSVREALSDPNWRAAMQAEFD 887

Query: 558  ALIKNKTWELVPRPQNVNFIRSMWIFRHKKKSDGSFERYKARLVGDGRSQ*VGVDCDETF 617
            AL+ N TW LVPRP+ VN +   WIFRHK  SDGS +RYKAR V  G +Q  GVD DETF
Sbjct: 888  ALLANDTWSLVPRPRGVNLVTGKWIFRHKLHSDGSLDRYKARWVLRGFTQRPGVDYDETF 947

Query: 618  SPVVKPTTIRIVLTIALSKSWPIHQLDVKNVFLHGELQETVYMHQPMGFRDPIHPDYVCL 677
            SPVVKP T+R+VL++ALS+ WPIHQLDVKN FLHG L ETVY  QP GF DP H D VC 
Sbjct: 948  SPVVKPATVRVVLSLALSQDWPIHQLDVKNAFLHGTLSETVYCIQPTGFADPSHADLVCR 1007

Query: 678  LKKSLYGLKQAPRAWYQRFADFAFTIGFSHSKSDHSLFIYRKGNDMTYILLYVDDIILTA 737
            L KSLYGLKQAPRAW+ RFA    ++GF  ++SD SLFI+R+GND   +LLYVDDI+LTA
Sbjct: 1008 LNKSLYGLKQAPRAWHHRFASHLISLGFIEAQSDSSLFIHRRGNDTVLLLLYVDDIVLTA 1067

Query: 738  SSDVLRRSIMSLLASEFAMKDLGTLSYF 765
            SS  L + +++ L  EFAM D+G L +F
Sbjct: 1068 SSASLLQQVIAALQREFAMTDMGPLHHF 1095


>emb|CAC37623.1| copia-like polyprotein [Arabidopsis thaliana]
          Length = 1466

 Score =  503 bits (1296), Expect = e-141
 Identities = 293/777 (37%), Positives = 406/777 (51%), Gaps = 15/777 (1%)

Query: 1    MTASQGTLSTYSNLSINKNIVVGSGQEIPIRGYGQTYISPPYPPLSLNNVLHAPKLIKNL 60
            +TAS   L   +    N  ++VG G  +PI   G T IS     + LN VL  P + K+L
Sbjct: 334  ITASTSGLQNATTYEGNDAVLVGDGTYLPITHVGSTTISSSKGTIPLNEVLVCPAIQKSL 393

Query: 61   IFVRRFTIDNNVTIEFDPFSFSVEDIQTGIPLMRCDSTGDLYPLTTTTSHQSTSPATFAA 120
            + V +   D    + FD     + D+ T   + +      LY L  +      S    AA
Sbjct: 394  LSVSKLCDDYPCGVYFDANKVCIIDLTTQKVVSKGPRNNGLYMLENSEFVALYSNRQCAA 453

Query: 121  LSPTLWHNRLGHPGANVLSFLNKNKFIECKQISSPSICQSCIYGKHVKLPFSISNSTTSK 180
               T WH+RLGH  + +L  L   K I+  +  +  +C+ C  GK  +L F  S+    K
Sbjct: 454  SMET-WHHRLGHSNSKILQQLLTRKEIQVNKSRTSPVCEPCQMGKSTRLQFFSSDFRALK 512

Query: 181  PFDIIHSDLW-TSPILSSAGHKYYLFFLDDYTNFVWTFPIGRKSQVPSIFSSFHAFIKTQ 239
            P D +H DLW  SP++S+ G KYY  F+DD++ F W FP+  KS+  S+F ++   ++ Q
Sbjct: 513  PLDRVHCDLWGPSPVVSNQGFKYYAVFVDDFSRFSWFFPLRMKSKFISVFIAYQKLVENQ 572

Query: 240  FGTTIKCFQCDNGTEFNNEYFTQFCHKNGMVFRFSCPHTSPQNGKAERKIRAINNFIRTS 299
             GT IK FQ D G EF +    +   ++G+  R SCP+T  QNG AERK R +     + 
Sbjct: 573  LGTKIKEFQSDGGGEFTSNKLKEHFREHGIHHRISCPYTPQQNGVAERKHRHLVELGLSM 632

Query: 300  LAHSSMPPSFWHHALQITTYLQNILPSKILSHHSPTQYLYHRDPSYTHLRVFGCLCYPLF 359
            L HS  P  FW  A     YL N+LPS +L   SP + L+ +   YT LRVFG  CYP  
Sbjct: 633  LYHSHTPLKFWVEAFFTANYLSNLLPSSVLKEISPYETLFQQKVDYTPLRVFGTACYPCL 692

Query: 360  PSTAINKLQARSTPCAFLGYPQNHRGYKCYDLSSKKIIISRHVIFDETQFPFAKTHTSST 419
               A NK   RS  C FLGY   ++GY+C    + K+ ISRHVIFDE QFPF + + S  
Sbjct: 693  RPLAKNKFDPRSLQCVFLGYHNQYKGYRCLYPPTGKVYISRHVIFDEAQFPFKEKYHSLV 752

Query: 420  HTYE------FLNDSLHPLLHYHLQNDPKQDEPEPRKIESPQPATT--PASPINVTNQSI 471
              Y+      + +  L P      Q  P   +  P      QP         +NV  ++ 
Sbjct: 753  PKYQTTLLQAWQHTDLTPPSVPSSQLQPLARQMTPMATSENQPMMNYETEEAVNVNMETS 812

Query: 472  LPPSPMSINQLPH---PLVSTELTSPTHTPQQIHQEPPRTIATHSMHGIHKPKIQFNLTT 528
                  S ++  H   P+++ +  +  +   Q   E    + T S  GI KP  ++ L  
Sbjct: 813  SDEETESNDEFDHEVAPVLNDQ--NEDNALGQGSLENLHPMITRSKDGIQKPNPRYALIV 870

Query: 529  SITSSPLPHNPKAALSDSNWKAAMLDEFDALIKNKTWELVPRPQNVNFIRSMWIFRHKKK 588
            S +S   P     A+   +W AA++DE D +    TW LVP  +++N + S W+F+ K K
Sbjct: 871  SKSSFDEPKTITTAMKHPSWNAAVMDEIDRIHMLNTWSLVPATEDMNILTSKWVFKTKLK 930

Query: 589  SDGSFERYKARLVGDGRSQ*VGVDCDETFSPVVKPTTIRIVLTIALSKSWPIHQLDVKNV 648
             DG+ ++ KARLV  G  Q  GVD  ETFSPVV+  TIR+VL  A +  WP+ QLDV N 
Sbjct: 931  PDGTIDKLKARLVAKGFDQEEGVDYLETFSPVVRTATIRLVLDTATANEWPLKQLDVSNA 990

Query: 649  FLHGELQETVYMHQPMGFRDPIHPDYVCLLKKSLYGLKQAPRAWYQRFADFAFTIGFSHS 708
            FLHGELQE V+M QP GF DP  P++VC L K+LYGLKQAPRAW+  F++F    GF  S
Sbjct: 991  FLHGELQEPVFMFQPSGFVDPNKPNHVCRLTKALYGLKQAPRAWFDTFSNFLLDFGFECS 1050

Query: 709  KSDHSLFIYRKGNDMTYILLYVDDIILTASSDVLRRSIMSLLASEFAMKDLGTLSYF 765
             SD SLF+  +      +LLYVDDI+LT S  +L   ++  L + F+MKDLG   YF
Sbjct: 1051 TSDPSLFVCHQNGQSLILLLYVDDILLTGSDQLLMDKLLQALNNRFSMKDLGPPRYF 1107


>gb|AAD21687.1| Strong similarity to gi|3600044 T12H20.12 protease homolog from
            Arabidopsis thaliana BAC gb|AF080119 and is a member of
            the reverse transcriptase family PF|00078
            gi|25301706|pir||C86438 hypothetical protein F28K20.17 -
            Arabidopsis thaliana
          Length = 1415

 Score =  498 bits (1283), Expect = e-139
 Identities = 289/784 (36%), Positives = 410/784 (51%), Gaps = 45/784 (5%)

Query: 1    MTASQGTLSTYSNLSINKNIVVGSGQEIPIRGYGQTYISPPYPPLSLNNVLHAPKLIKNL 60
            +T+S   L + +    +  ++VG G  +PI   G T I      + LN VL  P + K+L
Sbjct: 332  VTSSTNGLQSATEYEGDDAVLVGDGTYLPITHTGSTTIKSSNGKIPLNEVLVVPNIQKSL 391

Query: 61   IFVRRFTIDNNVTIEFDPFSFSVEDIQTGIPLMRCDSTGDLYPLTTTTSHQSTSPATFAA 120
            + V +   D    + FD     + D+QT   +        LY L         S    AA
Sbjct: 392  LSVSKLCDDYPCGVYFDANKVCIIDLQTQKVVTTGPRRNGLYVLENQEFVALYSNRQCAA 451

Query: 121  LSPTLWHNRLGHPGANVLSFLNKNKFIECKQISSPSICQSCIYGKHVKLPFSISNSTTSK 180
             +  +WH+RLGH  +  L  L  +K I+  +  +  +C+ C  GK  +LPF IS+S    
Sbjct: 452  -TEEVWHHRLGHANSKALQHLQNSKAIQINKSRTSPVCEPCQMGKSSRLPFLISDSRVLH 510

Query: 181  PFDIIHSDLW-TSPILSSAGHKYYLFFLDDYTNFVWTFPIGRKSQVPSIFSSFHAFIKTQ 239
            P D IH DLW  SP++S+ G KYY  F+DDY+ + W +P+  KS+  S+F SF   ++ Q
Sbjct: 511  PLDRIHCDLWGPSPVVSNQGLKYYAIFVDDYSRYSWFYPLHNKSEFLSVFISFQKLVENQ 570

Query: 240  FGTTIKCFQCDNGTEFNNEYFTQFCHKNGMVFRFSCPHTSPQNGKAERKIRAINNFIRTS 299
              T IK FQ D G EF +        ++G+  R SCP+T  QNG AERK R +     + 
Sbjct: 571  LNTKIKVFQSDGGGEFVSNKLKTHLSEHGIHHRISCPYTPQQNGLAERKHRHLVELGLSM 630

Query: 300  LAHSSMPPSFWHHALQITTYLQNILPSKILSHHSPTQYLYHRDPSYTHLRVFGCLCYPLF 359
            L HS  P  FW  +     Y+ N LPS +L + SP + L+   P Y+ LRVFG  CYP  
Sbjct: 631  LFHSHTPQKFWVESFFTANYIINRLPSSVLKNLSPYEALFGEKPDYSSLRVFGSACYPCL 690

Query: 360  PSTAINKLQARSTPCAFLGYPQNHRGYKCYDLSSKKIIISRHVIFDETQFPFAKTHTSST 419
               A NK   RS  C FLGY   ++GY+C+   + K+ ISR+VIF+E++ PF + + S  
Sbjct: 691  RPLAQNKFDPRSLQCVFLGYNSQYKGYRCFYPPTGKVYISRNVIFNESELPFKEKYQSLV 750

Query: 420  HTYEFLNDSLHPLLHYHLQNDPKQDEPEPRKIESPQPATTPASPINVTNQSILPPSPMSI 479
              Y        PLL     N   +              + PA+P+ + ++      P+ +
Sbjct: 751  PQYST------PLLQAWQHNKISE-------------ISVPAAPVQLFSK------PIDL 785

Query: 480  NQLPHPLVSTELTSPTHTP-------------QQIHQEPPRTIATHSMH-----GIHKPK 521
            N      V+ +LT P  T              ++I     + I +H+M      GI KP 
Sbjct: 786  NTYAGSQVTEQLTDPEPTSNNEGSDEEVNPVAEEIAANQEQVINSHAMTTRSKAGIQKPN 845

Query: 522  IQFNLTTSITSSPLPHNPKAALSDSNWKAAMLDEFDALIKNKTWELVPRPQNVNFIRSMW 581
             ++ L TS  ++  P    +A+    W  A+ +E + +    TW LVP   ++N + S W
Sbjct: 846  TRYALITSRMNTAEPKTLASAMKHPGWNEAVHEEINRVHMLHTWSLVPPTDDMNILSSKW 905

Query: 582  IFRHKKKSDGSFERYKARLVGDGRSQ*VGVDCDETFSPVVKPTTIRIVLTIALSKSWPIH 641
            +F+ K   DGS ++ KARLV  G  Q  GVD  ETFSPVV+  TIR+VL ++ SK WPI 
Sbjct: 906  VFKTKLHPDGSIDKLKARLVAKGFDQEEGVDYLETFSPVVRTATIRLVLDVSTSKGWPIK 965

Query: 642  QLDVKNVFLHGELQETVYMHQPMGFRDPIHPDYVCLLKKSLYGLKQAPRAWYQRFADFAF 701
            QLDV N FLHGELQE V+M+QP GF DP  P +VC L K++YGLKQAPRAW+  F++F  
Sbjct: 966  QLDVSNAFLHGELQEPVFMYQPSGFIDPQKPTHVCRLTKAIYGLKQAPRAWFDTFSNFLL 1025

Query: 702  TIGFSHSKSDHSLFIYRKGNDMTYILLYVDDIILTASSDVLRRSIMSLLASEFAMKDLGT 761
              GF  SKSD SLF+  +   + Y+LLYVDDI+LT S   L   ++  L + F+MKDLG 
Sbjct: 1026 DYGFVCSKSDPSLFVCHQDGKILYLLLYVDDILLTGSDQSLLEDLLQALKNRFSMKDLGP 1085

Query: 762  LSYF 765
              YF
Sbjct: 1086 PRYF 1089


>gb|AAP53032.1| putative copia-like retrotransposon polyprotein [Oryza sativa
           (japonica cultivar-group)] gi|37532886|ref|NP_920745.1|
           putative copia-like retrotransposon polyprotein [Oryza
           sativa (japonica cultivar-group)]
           gi|22655750|gb|AAN04167.1| Putative copia-like
           retrotransposon polyprotein [Oryza sativa (japonica
           cultivar-group)]
          Length = 1042

 Score =  495 bits (1275), Expect = e-138
 Identities = 293/785 (37%), Positives = 416/785 (52%), Gaps = 36/785 (4%)

Query: 1   MTASQGTLSTYSNLSINKNIVVGSGQEIPIRGYGQTYISPPYPPLSLNNVLHAPKLIKNL 60
           +T+    L+T      +  I   SG  + I+  G   +  P  PL LNNVLH P+  KNL
Sbjct: 229 ITSQLEKLNTREVYKGHDQIHTASGAGMKIKHIGHAIVHTPTRPLHLNNVLHVPQAAKNL 288

Query: 61  IFVRRFTIDNNVTIEFDPFSFSVEDIQTGIPLMRCDSTGDLYPLTTTTSHQSTSPATFAA 120
           I   +   DN+V +E     F ++D  T   +++      LYPL +T+S   T  A   A
Sbjct: 289 ISATKLASDNSVFVEIHSKYFLIKDRTTRSTVLKGPRRHGLYPLPSTSS---TKQAFAVA 345

Query: 121 LSPTLWHNRLGHPGAN-VLSFLNKNKFIECKQISSPSICQSCIYGKHVKLPFSISNSTTS 179
            S   WH+RLGHP    V+  ++ NK    ++ +  S+C +C   K  +LP+S S S ++
Sbjct: 346 PSLERWHSRLGHPSIPIVMKVISSNKLPCLRESNKESVCDACQKAKSHQLPYSNSMSVSN 405

Query: 180 KPFDIIHSDLWTSPILSSAGHKYYLFFLDDYTNFVWTFPIGRKSQVPSIFSSFHAFIKTQ 239
           KP ++I+SD+W     S  G K+Y+ F+D Y  F W + +  KS V   F  F   ++  
Sbjct: 406 KPLELIYSDVWGPASTSFGGKKFYVSFIDSYRKFSWIYFLKHKSDVFEKFHDFQQLVERL 465

Query: 240 FGTTIKCFQCDNGTEFNNEYFTQFCHKNGMVFRFSCPHTSPQNGKAERKIRAINNFIRTS 299
           F   I   Q D G E+       F  K G+    SCPHT  QNG AERK R I       
Sbjct: 466 FDRKIIAMQTDWGGEYQK--LNSFFEKIGISHHVSCPHTHQQNGSAERKHRLIVEVGLAL 523

Query: 300 LAHSSMPPSFWHHALQITTYLQNILPSKILSHHSPTQYLYHRDPSYTHLRVFGCLCYPLF 359
           LA++SMP  +W  A    T++ N +PS+IL + +P + L++    Y+  R+FGC C+P  
Sbjct: 524 LAYASMPLKYWDEAFLAATHIINRIPSRILQYDTPLECLFNHKLDYSSFRIFGCACWPNL 583

Query: 360 PSTAINKLQARSTPCAFLGYPQNHRGYKCYDLSSKKIIISRHVIFDETQFPFAKTHTSS- 418
                +KLQ RS  C FLG    H GYKC D+++ +I I R V+FDE  FP +K H+++ 
Sbjct: 584 RPYNAHKLQFRSMQCVFLGPSHTHNGYKCLDIATGRIYICRDVVFDENVFPLSKFHSNAG 643

Query: 419 ---------------THTYEFLNDSLHPLLHYHLQNDPKQDEPEPRKIESPQPATTPASP 463
                          +HT     +  + +L ++  +  + DE      +     TT  + 
Sbjct: 644 SRLRSEIALLPSHLLSHTSHQGGEHNNHMLDFYNVSSDQTDE----NADIDGGNTTDTTN 699

Query: 464 INVTNQSILPPSPMSINQLPHPLVSTELTSPTHTPQQIHQEPPRTIATHSMHGIHKPKIQ 523
            ++ NQ  L     S+ Q  H     E  +     Q +    PRT       GI K K+ 
Sbjct: 700 DDLGNQ--LHELRSSVMQDMH--FGGEAATHATEDQSMVAAKPRT---RLQSGIRKEKVY 752

Query: 524 FNLTTS---ITSSPLPHNPKAALSDSNWKAAMLDEFDALIKNKTWELVPRPQNVNFIRSM 580
            + T      TSS  P N   AL+D NWK AM  E+ AL+KNKTW LVP   + N I   
Sbjct: 753 TDGTVKYSCFTSSGEPQNLHEALNDKNWKHAMDSEYTALMKNKTWHLVPAKSDRNVIDCK 812

Query: 581 WIFRHKKKSDGSFERYKARLVGDGRSQ*VGVDCDETFSPVVKPTTIRIVLTIALSKSWPI 640
           W+++ K+K+DGS +RYKARLV  G  Q  G+D ++TFSPVVK  TIR++L+IA+S+ W +
Sbjct: 813 WVYKIKRKADGSLDRYKARLVAKGFKQRYGIDYEDTFSPVVKAATIRVILSIAVSRGWSL 872

Query: 641 HQLDVKNVFLHGELQETVYMHQPMGFRDPIHPDYVCLLKKSLYGLKQAPRAWYQRFADFA 700
            QLDV N FLHG L+E VYM QP+G+     P++VC L K+LYGLKQAPR WY R +   
Sbjct: 873 RQLDVSNAFLHGILEEEVYMRQPLGYEVSSLPNHVCKLDKALYGLKQAPRVWYSRLSTKL 932

Query: 701 FTIGFSHSKSDHSLFIYRKGNDMTYILLYVDDIILTASSDVLRRSIMSLLASEFAMKDLG 760
             +GF  SK+D SLF Y KG    ++L+YVDDI + +S      +++  L  EFA+KDLG
Sbjct: 933 QELGFQASKADTSLFFYNKGVVSMFVLVYVDDIFVASSMQSATAALLQDLNKEFALKDLG 992

Query: 761 TLSYF 765
            L YF
Sbjct: 993 DLHYF 997


>emb|CAB81170.1| retrotransposon like protein [Arabidopsis thaliana]
            gi|4539447|emb|CAB40035.1| retrotransposon like protein
            [Arabidopsis thaliana] gi|7444419|pir||T04204
            hypothetical protein T4F9.150 - Arabidopsis thaliana
          Length = 1515

 Score =  488 bits (1256), Expect = e-136
 Identities = 293/815 (35%), Positives = 419/815 (50%), Gaps = 70/815 (8%)

Query: 10   TYSNLSINKNIVVGSGQEIPIRGYGQTYISPPYPPLSLNNVLHAPKLIKNLIFVRRFTID 69
            TYS    + +++VG+G  +PI   G   ++     L L +VL  P + K+L+ V + T D
Sbjct: 346  TYSG---DDSVIVGNGDFLPITHIGTIPLNISQGTLPLEDVLVCPGITKSLLSVSKLTDD 402

Query: 70   NNVTIEFDPFSFSVEDIQTGIPLMRCDSTGDLYPLTTTTSHQSTSPATFAALSPTLWHNR 129
               +  FD  S  ++D +T   L + +    LY L      Q+       +    +WH R
Sbjct: 403  YPCSFTFDSDSVVIKDKRTQQLLTQGNKHKGLYVLKDVP-FQTYYSTRQQSSDDEVWHQR 461

Query: 130  LGHPGANVLSFLNKNKFIECKQISSPSICQSCIYGKHVKLPFSISNSTTSKPFDIIHSDL 189
            LGHP   VL  L K K I   + SS ++C++C  GK  +LPF  S   +S+P + IH DL
Sbjct: 462  LGHPNKEVLQHLIKTKAIVVNKTSS-NMCEACQMGKVCRLPFVASEFVSSRPLERIHCDL 520

Query: 190  W-TSPILSSAGHKYYLFFLDDYTNFVWTFPIGRKSQVPSIFSSFHAFIKTQFGTTIKCFQ 248
            W  +P+ S+ G +YY+ F+D+Y+ F W +P+  KS   S+F  F   ++ Q+   I  FQ
Sbjct: 521  WGPAPVTSAQGFQYYVIFIDNYSRFTWFYPLKLKSDFFSVFVLFQQLVENQYQHKIAMFQ 580

Query: 249  CDNGTEFNNEYFTQFCHKNGMVFRFSCPHTSPQNGKAERKIRAINNFIRTSLAHSSMPPS 308
            CD G EF +  F       G+    SCPHT  QNG AER+ R +     + + HS +P  
Sbjct: 581  CDGGGEFVSYKFVAHLASCGIKQLISCPHTPQQNGIAERRHRYLTELGLSLMFHSKVPHK 640

Query: 309  FWHHALQITTYLQNILPSKILSHH-SPTQYLYHRDPSYTHLRVFGCLCYPLFPSTAINKL 367
             W  A   + +L N+LPS  LS + SP + L+   P YT LRVFG  CYP     A NK 
Sbjct: 641  LWVEAFFTSNFLSNLLPSSTLSDNKSPYEMLHGTPPVYTALRVFGSACYPYLRPYAKNKF 700

Query: 368  QARSTPCAFLGYPQNHRGYKCYDLSSKKIIISRHVIFDETQFPFAKTHTSSTHTYEFLND 427
              +S  C FLGY   ++GY+C    + K+ I RHV+FDE +FP++  ++      +F   
Sbjct: 701  DPKSLLCVFLGYNNKYKGYRCLHPPTGKVYICRHVLFDERKFPYSDIYS------QFQTI 754

Query: 428  SLHPLL---HYHLQNDPKQDEPEPRKIES--------------------PQPATTPASPI 464
            S  PL         +     E     +E                      + AT P   +
Sbjct: 755  SGSPLFTAWQKGFSSTALSRETPSTNVEDIIFPSATVSSSVPTGCAPNIAETATAPDVDV 814

Query: 465  NVTNQSILPPSPMSINQLP-HPLVSTE-----------LTSPTHTPQQIH---------- 502
               +  ++PPSP++   LP  P  ST              S   TPQ I+          
Sbjct: 815  AAAHDMVVPPSPITSTSLPTQPEESTSDQNHYSTDSETAISSAMTPQSINVSLFEDSDFP 874

Query: 503  ------------QEPPRTIATHSMHGIHKPKIQFNLTTSITSSPLPHNPKAALSDSNWKA 550
                         E    + T +  GI KP  ++ L +  ++ P P + K AL D  W  
Sbjct: 875  PLQSVISSTTAAPETSHPMITRAKSGITKPNPKYALFSVKSNYPEPKSVKEALKDEGWTN 934

Query: 551  AMLDEFDALIKNKTWELVPRPQNVNFIRSMWIFRHKKKSDGSFERYKARLVGDGRSQ*VG 610
            AM +E   + +  TW+LVP       +   W+F+ K  SDGS +R KARLV  G  Q  G
Sbjct: 935  AMGEEMGTMHETDTWDLVPPEMVDRLLGCKWVFKTKLNSDGSLDRLKARLVARGYEQEEG 994

Query: 611  VDCDETFSPVVKPTTIRIVLTIALSKSWPIHQLDVKNVFLHGELQETVYMHQPMGFRDPI 670
            VD  ET+SPVV+  T+R +L +A    W + QLDVKN FLH EL+ETV+M QP GF DP 
Sbjct: 995  VDYVETYSPVVRSATVRSILHVATINKWSLKQLDVKNAFLHDELKETVFMTQPPGFEDPS 1054

Query: 671  HPDYVCLLKKSLYGLKQAPRAWYQRFADFAFTIGFSHSKSDHSLFIYRKGNDMTYILLYV 730
             PDYVC LKK++Y LKQAPRAW+ +F+ +    GF  S SD SLF+Y KG D+ ++LLYV
Sbjct: 1055 RPDYVCKLKKAIYDLKQAPRAWFDKFSSYLLKYGFICSFSDPSLFVYLKGRDVMFLLLYV 1114

Query: 731  DDIILTASSDVLRRSIMSLLASEFAMKDLGTLSYF 765
            DD+ILT ++DVL + ++++L++EF MKD+G L YF
Sbjct: 1115 DDMILTGNNDVLLQQLLNILSTEFRMKDMGALHYF 1149


>gb|AAK51235.1| polyprotein [Arabidopsis thaliana]
          Length = 1453

 Score =  486 bits (1252), Expect = e-135
 Identities = 300/788 (38%), Positives = 405/788 (51%), Gaps = 30/788 (3%)

Query: 1    MTASQGTLSTYSNLSINKNIVVGSGQEIPIRGYGQTYISPPYPPLSLNNVLHAPKLIKNL 60
            +T+S   L   S  + +  ++VG G  +PI   G T IS     L LN VL  P + K+L
Sbjct: 335  VTSSTNNLQAASPYNGSDTVLVGDGAYLPITHVGSTTISSDSGTLPLNEVLVCPDIQKSL 394

Query: 61   IFVRRFTIDNNVTIEFDPFSFSVEDIQTGIPLMRCDSTGDLYPLTTTTSHQSTSPATFAA 120
            + V +   D    + FD     + DI T   + +   +  LY L         S    AA
Sbjct: 395  LSVSKLCDDYPCGVYFDANKVCIIDINTQKVVSKGPRSNGLYVLENQEFVAFYSNRQCAA 454

Query: 121  LSPTLWHNRLGHPGANVLSFLNKNKFIECKQISSPSICQSCIYGKHVKLPFSISNSTTSK 180
             S  +WH+RLGH  + +L  L  +K I   +     +C+ C  GK  KL F  SNS    
Sbjct: 455  -SEEIWHHRLGHSNSRILQQLKSSKEISFNKSRMSPVCEPCQMGKSSKLQFFSSNSRELD 513

Query: 181  PFDIIHSDLW-TSPILSSAGHKYYLFFLDDYTNFVWTFPIGRKSQVPSIFSSFHAFIKTQ 239
                IH DLW  SP++S  G KYY+ F+DDY+ + W +P+  KS   ++F +F   ++ Q
Sbjct: 514  LLGRIHCDLWGPSPVVSKQGFKYYVVFVDDYSRYSWFYPLKAKSDFFAVFVAFQNLVENQ 573

Query: 240  FGTTIKCFQCDNGTEFNNEYFTQFCHKNGMVFRFSCPHTSPQNGKAERKIRAINNFIRTS 299
            F T IK FQ D G EF +    +     G+  R SCP+T  QNG AERK R       + 
Sbjct: 574  FNTKIKVFQSDGGGEFTSNLMKKHLTDCGIQHRISCPYTPQQNGIAERKHRHFVELGLSM 633

Query: 300  LAHSSMPPSFWHHALQITTYLQNILPSKILSHHSPTQYLYHRDPSYTHLRVFGCLCYPLF 359
            + HS  P  FW  A    ++L N+LPS  L + SP + L  + P+Y  LRVFG  CYP  
Sbjct: 634  MFHSHTPLQFWVEAFFTASFLSNMLPSPSLGNVSPLEALLKQKPNYAMLRVFGTACYPCL 693

Query: 360  PSTAINKLQARSTPCAFLGYPQNHRGYKCYDLSSKKIIISRHVIFDETQFPFAKTHTSST 419
                 +K + RS  C FLGY   ++GY+C    + ++ ISRHVIFDE  FPF + +    
Sbjct: 694  RPLGEHKFEPRSLQCVFLGYNSQYKGYRCLYPPTGRVYISRHVIFDEETFPFKQKYQFLV 753

Query: 420  HTYEFLNDSLHPLLHYHLQNDPKQDEP-----EPRKIES-PQPATTPASPIN--VTNQSI 471
              YE        LL     + P+ D+      E  KIES  +P +   + I    T  +I
Sbjct: 754  PQYE------SSLLSAWQSSIPQADQSLIPQAEEGKIESLAKPPSIQKNTIQDTTTQPAI 807

Query: 472  LPPSPMSINQLPHPLVSTE---LTSPTHTP---------QQIHQEPPRT--IATHSMHGI 517
            L    ++  +       TE   L   THT          +++ QEP  T  + T S  GI
Sbjct: 808  LTEGVLNEEEEEDSFEETETESLNEETHTQNDEAEVTVEEEVQQEPENTHPMTTRSKAGI 867

Query: 518  HKPKIQFNLTTSITSSPLPHNPKAALSDSNWKAAMLDEFDALIKNKTWELVPRPQNVNFI 577
            HK   ++ L TS  S   P +   AL+   W  A+ DE   +    TW LV   +++N +
Sbjct: 868  HKSNTRYALLTSKFSVEEPKSIDEALNHPGWNNAVNDEMRTIHMLHTWSLVQPTEDMNIL 927

Query: 578  RSMWIFRHKKKSDGSFERYKARLVGDGRSQ*VGVDCDETFSPVVKPTTIRIVLTIALSKS 637
               W+F+ K K DGS ++ KARLV  G  Q  G+D  ETFSPVV+  TIR+VL +A +K 
Sbjct: 928  GCRWVFKTKLKPDGSVDKLKARLVAKGFHQEEGLDYLETFSPVVRTATIRLVLDVATAKG 987

Query: 638  WPIHQLDVKNVFLHGELQETVYMHQPMGFRDPIHPDYVCLLKKSLYGLKQAPRAWYQRFA 697
            W I QLDV N FLHGEL+E VYM QP GF D   P YVC L K+LYGLKQAPRAW+   +
Sbjct: 988  WNIKQLDVSNAFLHGELKEPVYMLQPPGFVDQEKPSYVCRLTKALYGLKQAPRAWFDTIS 1047

Query: 698  DFAFTIGFSHSKSDHSLFIYRKGNDMTYILLYVDDIILTASSDVLRRSIMSLLASEFAMK 757
            ++    GFS SKSD SLF Y K      +LLYVDDI+LT S   L + ++  L   F+MK
Sbjct: 1048 NYLLDFGFSCSKSDPSLFTYHKNGKTLVLLLYVDDILLTGSDHNLLQELLMSLNKRFSMK 1107

Query: 758  DLGTLSYF 765
            DLG  SYF
Sbjct: 1108 DLGAPSYF 1115


>gb|AAU89779.1| gag-pol polyprotein-like [Solanum tuberosum]
          Length = 1212

 Score =  461 bits (1186), Expect = e-128
 Identities = 293/781 (37%), Positives = 409/781 (51%), Gaps = 70/781 (8%)

Query: 1    MTASQGTLSTYSNLSINKNIVVGSGQEIPIRGYGQTYISPPYPPLSLNNVLHAPKLIKNL 60
            MT S   L           I + +G  +PI   G   I+P +      NV  +PKL  +L
Sbjct: 330  MTNSTSILKNVRKYQGPSQIQIANGSNLPITKVGD--ITPTF-----KNVFVSPKLSTSL 382

Query: 61   IFVRRFTIDNNVTIEFDPFSFSVEDIQTGIPLMRCDSTGDLYPLTTTTSHQSTSPATFAA 120
            I V +  +DNN  + F      V+D  +G  + +    G L+P+  +     +   T  A
Sbjct: 383  ISVGQL-VDNNCDVNFSRNGCLVQDQVSGTIIAKGPKVGRLFPIHFSIPPVLSFACTSTA 441

Query: 121  LSPTLWHNRLGHPGANVLSFL-------NKNKFIECKQISSPSI-CQSCIYGKHVKLPFS 172
                +WH RLGHP + VLS +       NKNKF      S  SI C +C  GK   LPF 
Sbjct: 442  SKTEVWHKRLGHPNSVVLSHISNSGLLGNKNKF------SVASIDCSTCKLGKSKTLPFP 495

Query: 173  ISNSTTSKPFDIIHSDLW-TSPILSSAGHKYYLFFLDDYTNFVWTFPIGRKSQVPSIFSS 231
               S  +K FD+IHSD+W  SPI+S A  KY++ F+DDY+ F W + +  KS+V S+F +
Sbjct: 496  NFGSRATKCFDVIHSDVWGISPIISHAHFKYFMTFIDDYSRFTWVYFLRSKSEVFSMFKT 555

Query: 232  FHAFIKTQFGTTIKCFQCDNGTEFNNEYFTQFCHKNGMVFRFSCPHTSPQNGKAERKIRA 291
            F A+I+TQF T IK  + D+G E+ +  F +F    G+V + SCP+T  QNG AERK R 
Sbjct: 556  FLAYIETQFSTCIKLLRSDSGGEYMSYEFKKFLLDKGIVSQHSCPYTPQQNGVAERKNRH 615

Query: 292  INNFIRTSLAHSSMPPSFWHHALQITTYLQNILPSKILSHHSPTQYLYHRDPSYTHLRVF 351
            + +  RT L  SS+P  +W  AL    YL N LPSK+L+  SP   LYH++P+Y+    F
Sbjct: 616  LLDVTRTLLIESSVPSKYWVEALSTAVYLINRLPSKVLNLESPYFRLYHQNPNYSDFHTF 675

Query: 352  GCLCYPLFPSTAINKLQARSTPCAFLGYPQNHRGYKCYDLSSKKIIISRHVIFDETQFPF 411
            GC+C+   P +  NKL  +ST CAF+GY  + +G+ CYD  S K  ISR+V+F E Q+ F
Sbjct: 676  GCVCFVHLPPSQCNKLSVQSTKCAFMGYSTSQKGFICYDPCSHKFRISRNVVFFENQYFF 735

Query: 412  AKTHTSSTHTYEFLNDSLHPLL--HYHLQNDPKQDEP----EPRKIESPQPATTPASPIN 465
                  S         S+ PLL     L +  K+ +P    E R+   P P T P     
Sbjct: 736  PTIVDLS---------SVSPLLPTFEDLSSSFKRFKPGFVYERRRPTLPYPNTDP----- 781

Query: 466  VTNQSILPPSPMSINQLPHPLVSTELTSPTHTPQQIHQEPPRTIATHSMHGIHKPKIQFN 525
                   PP          P + +E  S    P +  +   R   T + +G         
Sbjct: 782  -------PPETA-------PQLESE-NSSRSGPLEPTRRSTRVSRTPNWYG--------- 817

Query: 526  LTTSITSSPLPHNPKAALSDSNWKAAMLDEFDALIKNKTWELVPRPQNVNFIRSMWIFRH 585
             ++++++  +P     A     W+ AM +E  AL +N TW++V  P NV  I   W++  
Sbjct: 818  FSSTLSNISVPSCYSQASKHECWQKAMEEELLALKENDTWDIVSCPSNVRPIGCKWVYSI 877

Query: 586  KKKSDGSFERYKARLVGDGRSQ*VGVDCDETFSPVVKPTTIRIVLTIALSKSWPIHQLDV 645
            K  SDG+ +RYKARLV  G  Q  GVD +ETF+PV K TT+R ++ IA S++W ++Q DV
Sbjct: 878  KLHSDGTLDRYKARLVVLGNRQEYGVDYEETFAPVAKMTTVRTIIAIAASQNWSLYQKDV 937

Query: 646  KNVFLHGELQETVYMHQPMG-FRDPIHPDYVCLLKKSLYGLKQAPRAWYQRFADFAFTIG 704
            KN FLHG+L+E +YM  P   F  P     VC LK+SLYGLKQAPRAW+ +F        
Sbjct: 938  KNAFLHGDLKEDIYMKPPPDLFSSPTSD--VCKLKRSLYGLKQAPRAWFDKFRSTLLQFS 995

Query: 705  FSHSKSDHSLFIYRKGNDMTYILLYVDDIILTASSDVLRRSIMSLLASEFAMKDLGTLSY 764
            F  SK D SLF+ +       +L+YVDDII+T +   L   +   L   F MKDLGTL+Y
Sbjct: 996  FELSKYDSSLFLRKTSTSCVLLLVYVDDIIITGTDSSLITCLQQQLKDSFHMKDLGTLTY 1055

Query: 765  F 765
            F
Sbjct: 1056 F 1056


>gb|AAG46116.1| putative copia-like retrotransposon polyprotein [Oryza sativa]
          Length = 1302

 Score =  458 bits (1179), Expect = e-127
 Identities = 244/534 (45%), Positives = 318/534 (58%), Gaps = 26/534 (4%)

Query: 238  TQFGTTIKCFQCDNGTEFNNEYFTQFCHKNGMVFRFSCPHTSPQNGKAERKIRAINNFIR 297
            ++FG  +   Q DNG E+++         +G V R SCP++S QNGKAER +R IN+++R
Sbjct: 513  SKFGLPVLALQTDNGKEYDSYALRSLLSLHGAVLRLSCPYSSQQNGKAERILRTINDYVR 572

Query: 298  TSLAHSSMPPSFWHHALQITTYLQNILPSKILSHHSPTQYLYHRDPSYTHLRVFGCLCYP 357
            T L HS+ P SFW  ALQ  T+L N  P +     +P Q L    P+Y HLRVFGCLCYP
Sbjct: 573  TMLVHSAAPLSFWAEALQTATHLINRRPCRATGSLTPYQLLLGAPPTYDHLRVFGCLCYP 632

Query: 358  LFPSTAINKLQARSTPCAFLGYPQNHRGYKCYDLSSKKIIISRHVIFDETQFPFAKTHTS 417
               +TA +KL  RS  C F+GYP +HRGY+CYD+ S+++  SRHV F E  FPF      
Sbjct: 633  NTIATAPHKLSPRSLACVFIGYPADHRGYRCYDMVSRRVFTSRHVTFVEDVFPF------ 686

Query: 418  STHTYEFLNDSLHPLLHYHLQNDPKQDEPEPRKIESPQPATTPASPINVT--NQSILPPS 475
                     D+  P         P  D  +   +  P PA    +P+     + +  PPS
Sbjct: 687  --------RDAPSP----RPSAPPPPDHGDDTIVLLPAPAQHVVTPVGTAPAHDAASPPS 734

Query: 476  PMSINQLPHPLVSTELTSPTHTPQQ---IHQEPPR-TIATHSMHGIHKPKIQFNLTTSIT 531
            P S    P         +P  +P+        PPR  + T +  GI KP  ++ +T + T
Sbjct: 735  PAS--STPSSAAPAHDVAPPPSPETSSPASASPPRHAMTTRARAGISKPNPRYAMTATST 792

Query: 532  SSPLPHNPKAALSDSNWKAAMLDEFDALIKNKTWELVPRPQNVNFIRSMWIFRHKKKSDG 591
             SP P + +AAL D NW+AAM  EFDAL+ N+TW LVPRP     I   W+F+ K  +DG
Sbjct: 793  LSPTPSSVRAALRDPNWRAAMQAEFDALLANRTWTLVPRPPGARIITGKWVFKTKLHADG 852

Query: 592  SFERYKARLVGDGRSQ*VGVDCDETFSPVVKPTTIRIVLTIALSKSWPIHQLDVKNVFLH 651
            S ++YKAR V  G +Q  GVD  ETFSPVVKP TIR VLT+  SK WP HQLDV N FLH
Sbjct: 853  SLDKYKARWVVRGFNQRPGVDFGETFSPVVKPATIRTVLTLISSKQWPAHQLDVSNAFLH 912

Query: 652  GELQETVYMHQPMGFRDPIHPDYVCLLKKSLYGLKQAPRAWYQRFADFAFTIGFSHSKSD 711
            G LQE V   QP GF D   P  VCLL +SLYGL+QAPRAW++RFAD A ++GF  S++D
Sbjct: 913  GHLQERVLCQQPTGFEDAARPADVCLLSRSLYGLRQAPRAWFKRFADHATSLGFVQSRAD 972

Query: 712  HSLFIYRKGNDMTYILLYVDDIILTASSDVLRRSIMSLLASEFAMKDLGTLSYF 765
             SLF+ R+G+D  Y+LLYVDD+IL+ASS  L + I+  L +EF +KD+G L YF
Sbjct: 973  PSLFVLRRGSDTAYLLLYVDDMILSASSSSLLQRIIDRLQAEFKVKDMGPLKYF 1026



 Score =  115 bits (289), Expect = 6e-24
 Identities = 64/141 (45%), Positives = 88/141 (62%), Gaps = 7/141 (4%)

Query: 1   MTASQGTLSTYSNLSINKNIVVGSGQEIPIRGYGQTYISPPYPPLSLNNVLHAPKLIKNL 60
           M+++ G L+    L  +  I VG+G ++P+     T+I      L L+NVL +P LIKNL
Sbjct: 378 MSSTPGILAHPRPLPFSSCITVGNGAKLPVTHTASTHIPTSSTDLHLHNVLVSPPLIKNL 437

Query: 61  IFVRRFTIDNNVTIEFDPFSFSVEDIQTGIPLMRCDSTGDLYPLTTTTSHQSTSPATFAA 120
           I V++ T DNNV+IEFDP  FS++D+QT +  +RCDS GDLYPL   + H     A  A 
Sbjct: 438 ISVKKLTRDNNVSIEFDPTGFSIKDLQTQVVKLRCDSPGDLYPLRLPSPH-----ALSAT 492

Query: 121 LSPTL--WHNRLGHPGANVLS 139
            SP++  WH RLGHPG+  LS
Sbjct: 493 SSPSVEHWHLRLGHPGSASLS 513


>gb|AAC61290.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
            gi|25301708|pir||B84523 probable retroelement pol
            polyprotein [imported] - Arabidopsis thaliana
          Length = 1149

 Score =  450 bits (1158), Expect = e-125
 Identities = 278/764 (36%), Positives = 392/764 (50%), Gaps = 35/764 (4%)

Query: 17   NKNIVVGSGQEIPIRGYGQTYISPPYPPLSLNNVLHAPKLIKNLIFVRRFTIDNNVTIEF 76
            N  ++   G  +PI   G   +      L L +VL  P + K+L+ V + T D   +  F
Sbjct: 340  NDTVMASDGNFLPITHIGSANLPSTSGNLPLKDVLVCPNIAKSLLSVSKLTKDYPCSFTF 399

Query: 77   DPFSFSVEDIQTGIPLMRCDSTGD-LYPLTTTTSHQSTSPATFAALSPTLWHNRLGHPGA 135
            D     V+D  T   L +  ST + LY L         S     A +  +WH RLGHP  
Sbjct: 400  DADGVLVKDKATCKVLTKGSSTSEGLYKLENPKFQMFYSTRQVKA-TDEVWHMRLGHPNP 458

Query: 136  NVLSFLNKNKFIECKQISSPSICQSCIYGKHVKLPFSISNSTTSKPFDIIHSDLW-TSPI 194
             VL  L   K I+  + S+  +C+SC  GK  +LPF  S+   S+P + +H DLW  +P+
Sbjct: 459  QVLQLLANKKAIQINK-STSKMCESCRLGKSSRLPFIASDFIASRPLERVHCDLWGPAPV 517

Query: 195  LSSAGHKYYLFFLDDYTNFVWTFPIGRKSQVPSIFSSFHAFIKTQFGTTIKCFQCDNGTE 254
             S  G +YY+ F+D+ + F W +P+  KS   S+F  F +F++    T I  FQ D G E
Sbjct: 518  SSIQGFQYYVIFIDNRSRFCWFYPLKHKSDFCSLFMKFQSFVENLLQTKIGTFQSDGGGE 577

Query: 255  FNNEYFTQFCHKNGMVFRFSCPHTSPQNGKAERKIRAINNFIRTSLAHSSMPPSFWHHAL 314
            F +  F Q   ++G+    SCPHT  QNG AERK R +     T +  S  P  FW  A 
Sbjct: 578  FTSNRFLQHLQESGIQHYISCPHTPQQNGLAERKHRQLTERGLTLMFQSKAPQRFWVEAF 637

Query: 315  QITTYLQNILPSKIL-SHHSPTQYLYHRDPSYTHLRVFGCLCYPLFPSTAINKLQARSTP 373
                +L N+LP+  L S  +P Q L+ + P Y+ LR FGC C+P   + A NK   RS  
Sbjct: 638  FTANFLSNLLPTSALDSSTTPYQVLFGKAPDYSALRTFGCACFPTLRAYARNKFDPRSLK 697

Query: 374  CAFLGYPQNHRGYKCYDLSSKKIIISRHVIFDETQFPFAKTHTSSTHTYEFLNDSLHPLL 433
            C FLGY + ++GY+C+   + ++ +SRHV+FDE+ FPF  T+TS  H       S  P+ 
Sbjct: 698  CIFLGYTEKYKGYRCFFPPTNRVYLSRHVLFDESSFPFIDTYTSLQHP------SPTPMF 751

Query: 434  HYHLQNDPKQDEPEPRKIESPQPA--TTPASPINVTNQSILPPSPMSINQLPHPLVSTEL 491
               L++ P    P    +E+ Q A   + AS   +T Q   P   +S+   P+ L+    
Sbjct: 752  DAWLKSFPSSSSP----LENDQTAGFNSGASVPVITAQQTQPI--LSLKDGPNILLPEGE 805

Query: 492  TSPTHTPQQIHQEPPRTIATHSMHGIHKPKIQFNL----------TTSITSSPLPHNPKA 541
             + +   Q I  EP       ++      K    L          T S    P+ +N   
Sbjct: 806  ITVSSNNQDIEDEPICVTPLQTLSSEDNAKSSETLSMGSEECSECTASFDLDPIGNN--- 862

Query: 542  ALSDSNWKAAMLDEFDALIKNKTWELVPRPQNVNFIRSMWIFRHKKKSDGSFERYKARLV 601
            ALS S     +           T  +  R +      +    R K   DGS ++YKARLV
Sbjct: 863  ALSSSPRHDQLTSSIPRAATESTHPMTTRLKKGIIKLNQ---RVKLNVDGSLDKYKARLV 919

Query: 602  GDGRSQ*VGVDCDETFSPVVKPTTIRIVLTIALSKSWPIHQLDVKNVFLHGELQETVYMH 661
              G  Q  G+D  ET+SPVV+  T+R VL ++   +W + Q+DVKN FLHG+L ETVYM 
Sbjct: 920  AQGFKQEEGIDYLETYSPVVRSATVRAVLHLSTIMNWELKQMDVKNGFLHGDLTETVYMK 979

Query: 662  QPMGFRDPIHPDYVCLLKKSLYGLKQAPRAWYQRFADFAFTIGFSHSKSDHSLFIYRKGN 721
            QP GF D  HPD+VCLL K+LYGLKQAPRAW+ +F+ F  + GF  S SD SLF+  K  
Sbjct: 980  QPAGFIDKAHPDHVCLLHKALYGLKQAPRAWFDKFSKFLLSFGFVCSMSDPSLFVCVKNK 1039

Query: 722  DMTYILLYVDDIILTASSDVLRRSIMSLLASEFAMKDLGTLSYF 765
            D+  +LLYVDD+++T +S  L  S++S L  +F MKDLG LSYF
Sbjct: 1040 DVIMLLLYVDDMVITGNSSKLLSSLLSELNKQFKMKDLGRLSYF 1083


>emb|CAB79576.1| putative protein [Arabidopsis thaliana] gi|3269282|emb|CAA19715.1|
           putative protein [Arabidopsis thaliana]
           gi|7444417|pir||T05745 hypothetical protein M4I22.20 -
           Arabidopsis thaliana
          Length = 1318

 Score =  446 bits (1146), Expect = e-123
 Identities = 274/789 (34%), Positives = 386/789 (48%), Gaps = 94/789 (11%)

Query: 20  IVVGSGQEIPIRGYGQTYISPPYPPLSLNNVLHAPKLIKNLIFVRRFTIDNNVTIEFDPF 79
           I+V  G  +PI   G T ++     + L +VL  P + K+L+ + + T D   T+EF+  
Sbjct: 206 IMVDDGNYLPITHTGSTNLASSSGTVPLTDVLVCPSITKSLLSMSKLTQDFPCTVEFEYD 265

Query: 80  SFSVEDIQTGIPLMRCDSTGDLYPLTTTTSHQSTSPATFAALSPTLWHNRLGHPGANVLS 139
              V D  T   L+   +   LY L      Q+       + S  +WH RLGHP   +L 
Sbjct: 266 GVRVNDKATKKLLLMGSNRDGLYCLKDDKQFQAFFSTRQRSASDEVWHRRLGHPHPQIL- 324

Query: 140 FLNKNKFIECKQISSPSICQSCIYGKHVKLPFSISNSTTSKPFDIIHSDLW-TSPILSSA 198
                                                   +P + +H DLW  + I S  
Sbjct: 325 ----------------------------------------QPLERVHCDLWGPTTITSVQ 344

Query: 199 GHKYYLFFLDDYTNFVWTFPIGRKSQVPSIFSSFHAFIKTQFGTTIKCFQCDNGTEFNNE 258
           G +YY  F+D Y+ F W +P+  KS   +IF +FH  ++ Q    I  FQCD G EF + 
Sbjct: 345 GFRYYAVFIDHYSRFSWIYPLKLKSDFYNIFLAFHKLVENQLSQKISVFQCDGGGEFVSH 404

Query: 259 YFTQFCHKNGMVFRFSCPHTSPQNGKAERKIRAINNFIRTSLAHSSMPPSFWHHALQITT 318
            F Q    +G+  + SCPHT  QNG AERK R +     + L  S +P  FW  A     
Sbjct: 405 KFLQHLQSHGIQQQLSCPHTPQQNGLAERKHRHLVELGLSMLFQSHVPHKFWVEAFFTAN 464

Query: 319 YLQNILPSKILSHH-SPTQYLYHRDPSYTHLRVFGCLCYPLFPSTAINKLQARSTPCAFL 377
           +L N+LP+  L    SP + LY + P YT LR FG  C+P     A NK    S  C FL
Sbjct: 465 FLINLLPTSALKESISPYEKLYDKKPDYTSLRSFGSACFPTLRDYAENKFNPCSLKCVFL 524

Query: 378 GYPQNHRGYKCYDLSSKKIIISRHVIFDETQFPFAKTHTSSTHTYEFLNDSLH-PLLHYH 436
           GY + ++GY+C    + ++ ISRHVIFDE+ +PF+       HTY+ L+     PLL   
Sbjct: 525 GYNEKYKGYRCLYPPTGRLYISRHVIFDESVYPFS-------HTYKHLHPQPRTPLLAAW 577

Query: 437 LQNDPKQDEPEPRKIESPQPAT---------------TPASPINVTNQSILPPSPMSINQ 481
           L++    D P P    SP   +               TP  P  V   S+   S ++  Q
Sbjct: 578 LRSS---DSPAPSTSTSPSSRSPLFTSADFPPLPQRKTPLLPTLVPISSVSHASNITTQQ 634

Query: 482 LPH-------PLVSTELTSPTHTPQ--------------QIHQEPPRT----IATHSMHG 516
            P           S  +   +H+ Q               +HQ    T    + T +  G
Sbjct: 635 SPDFDSERTTDFDSASIGDSSHSSQAGSDSEETIQQASVNVHQTHASTNVHPMVTRAKVG 694

Query: 517 IHKPKIQFNLTTSITSSPLPHNPKAALSDSNWKAAMLDEFDALIKNKTWELVPRPQNVNF 576
           I KP  ++   +   S P P    AAL    W  AM +E     + +TW LVP   +++ 
Sbjct: 695 ISKPNPRYVFLSHKVSYPEPKTVTAALKHPGWTGAMTEEIGNCSETQTWSLVPYKSDMHV 754

Query: 577 IRSMWIFRHKKKSDGSFERYKARLVGDGRSQ*VGVDCDETFSPVVKPTTIRIVLTIALSK 636
           + S W+FR K  +DG+  + KAR+V  G  Q  G+D  ET+SPVV+  T+R+VL +A + 
Sbjct: 755 LGSKWVFRTKLHADGTLNKLKARIVAKGFLQEEGIDYLETYSPVVRTPTVRLVLHLATAL 814

Query: 637 SWPIHQLDVKNVFLHGELQETVYMHQPMGFRDPIHPDYVCLLKKSLYGLKQAPRAWYQRF 696
           +W I Q+DVKN FLHG+L+ETVYM QP GF DP  PD+VCLL KS+YGLKQ+PRAW+ +F
Sbjct: 815 NWDIKQMDVKNAFLHGDLKETVYMTQPAGFVDPSKPDHVCLLHKSIYGLKQSPRAWFDKF 874

Query: 697 ADFAFTIGFSHSKSDHSLFIYRKGNDMTYILLYVDDIILTASSDVLRRSIMSLLASEFAM 756
           + F    GF  SKSD SLFIY   N++  +LLYVDD+++T +S     S+++ L  EF M
Sbjct: 875 STFLLEFGFFCSKSDPSLFIYAHNNNLILLLLYVDDMVITGNSSQTLTSLLAALNKEFRM 934

Query: 757 KDLGTLSYF 765
            D+G L YF
Sbjct: 935 TDMGQLHYF 943


>pir||F86470 probable retroelement polyprotein [imported] - Arabidopsis thaliana
            gi|9989049|gb|AAG10812.1| Putative retroelement
            polyprotein [Arabidopsis thaliana]
          Length = 1404

 Score =  437 bits (1123), Expect = e-120
 Identities = 268/788 (34%), Positives = 397/788 (50%), Gaps = 62/788 (7%)

Query: 19   NIVVGSGQEIPIRGYGQTYISPPYPPLSLNNVLHAPKLIKNLIFVRRFTIDNNVTIEFDP 78
            ++++ +G ++PI G G   +         +     PK   NL+ V+R T D N    F P
Sbjct: 355  HVIIANGDKVPIEGIGNLKLFNKD-----SKAFFMPKFTSNLLSVKRTTRDLNCYAIFGP 409

Query: 79   FSFSVEDIQTGIPLMRCDSTGDLYPLTTTTSHQSTSPATFAALS---PTLWHNRLGHPGA 135
                 +DI+TG  +    S G+LY L   + + S+  ++ + L     TLWH RLGHP  
Sbjct: 410  NDVYFQDIETGKVIGEGGSKGELYVLEDLSPNSSSCFSSKSHLGISFNTLWHARLGHPHT 469

Query: 136  NVLSFLNKNKFIECKQISSPSICQSCIYGKHVKLPFSISNSTTSKPFDIIHSDLWTSPIL 195
              L  +  N   +       + C++CI GKH K  F  S +   K FD++HSD+WTSP +
Sbjct: 470  RALKLMLPNISFD------HTSCEACILGKHCKSVFPKSLTIYEKCFDLVHSDVWTSPCV 523

Query: 196  SSAGHKYYLFFLDDYTNFVWTFPIGRKSQVPSIFSSFHAFIKTQFGTTIKCFQCDNGTEF 255
            S   +KY++ F+++ + + W   +  K +V   F++F  ++  QF   IK F+ DNG E+
Sbjct: 524  SRDNNKYFVTFINEKSKYTWITLLPSKDRVFEAFTNFETYVTNQFNAKIKVFRTDNGGEY 583

Query: 256  NNEYFTQFCHKNGMVFRFSCPHTSPQNGKAERKIRAINNFIRTSLAHSSMPPSFWHHALQ 315
             ++ F     K G++ + SCP+T  QNG AERK R +    R+ + H+S+P  FW  A+ 
Sbjct: 584  TSQKFRDHLAKRGIIHQTSCPYTPQQNGVAERKNRHLMEVARSMMFHTSVPKRFWGDAVL 643

Query: 316  ITTYLQNILPSKILSHHSPTQYLYHRDPSYTHLRVFGCLCYPLFPSTAINKLQARSTPCA 375
               YL N  P+K+LS  SP + L +  P   HLRVFGC+C+ L P    +KL A+ST C 
Sbjct: 644  TACYLINRTPTKVLSDLSPFEVLNNTKPFIDHLRVFGCVCFVLIPGEQRSKLDAKSTKCM 703

Query: 376  FLGYPQNHRGYKCYDLSSKKIIISRHVIFDETQFPFAK---------THTSS--THTYEF 424
            FLGY    +GYKC+D +  +  ISR V F E Q    K         TH++S    T +F
Sbjct: 704  FLGYSTTQKGYKCFDPTKNRTFISRDVKFLENQDYNNKKDWENLKDLTHSTSDRVETLKF 763

Query: 425  LNDSLHPLLHYHLQND-PKQDEPEPRKIESPQPATTPASPINVTNQSIL------PPSPM 477
            L D        HL ND     + +P   +  +        +++ +Q  L      PP+  
Sbjct: 764  LLD--------HLGNDSTSTTQHQPEMTQDQEDLNQENEEVSLQHQENLTHVQEDPPNTQ 815

Query: 478  SINQLPHPLVSTELTSPTHTPQQIHQEPP------RTIATHSMHGIHKPKIQFNLTTSIT 531
              ++  H     + +S    P Q+   PP      R          +     F  T S+ 
Sbjct: 816  EHSE--HVQEIQDDSSEDEEPTQVLPPPPPLRRSTRIRRKKEFFNSNAVAHPFQATCSLA 873

Query: 532  SSPLPHNP--------------KAALSDSNWKAAMLDEFDALIKNKTWELVPRPQNVNFI 577
              PL H                + A+    W+ A+ DE +A+ +N TW+    P+    +
Sbjct: 874  LVPLDHQAFLSKISEHWIPQTYEEAMEVKEWRDAIADEINAMKRNHTWDEDDLPKGKKTV 933

Query: 578  RSMWIFRHKKKSDGSFERYKARLVGDGRSQ*VGVDCDETFSPVVKPTTIRIVLTIALSKS 637
             S W+F  K KS+G  ERYK RLV  G +Q  G D  ETF+PV K  T+R+VL +A + S
Sbjct: 934  SSRWVFTIKYKSNGDIERYKTRLVARGFTQTYGSDYMETFAPVAKLHTVRVVLALATNLS 993

Query: 638  WPIHQLDVKNVFLHGELQETVYMHQPMGFRDPIHPDYVCLLKKSLYGLKQAPRAWYQRFA 697
            W + Q+DVKN FL GEL++ VYM  P G  D I  D V  L+K++YGLKQ+PRAWY + +
Sbjct: 994  WGLWQMDVKNAFLQGELEDDVYMTPPPGLEDTIPCDKVLRLRKAIYGLKQSPRAWYHKLS 1053

Query: 698  DFAFTIGFSHSKSDHSLFIYRKGNDMTYILLYVDDIILTASSDVLRRSIMSLLASEFAMK 757
                  GF  S+SDH+LF  +    +  +L+YVDD+I+T  +     S  + L S F +K
Sbjct: 1054 RTLKDHGFKKSESDHTLFTLQSPQGIVVVLIYVDDLIITGDNKDGIDSTKTFLKSCFDIK 1113

Query: 758  DLGTLSYF 765
            DLG L YF
Sbjct: 1114 DLGELKYF 1121


>gb|AAT39281.1| putative late blight resistance protein [Solanum demissum]
          Length = 1630

 Score =  419 bits (1078), Expect = e-115
 Identities = 268/800 (33%), Positives = 382/800 (47%), Gaps = 97/800 (12%)

Query: 18   KNIVVGSGQEIPIRGYGQTYISPPYPPLSLNNVLHAPKLIKNLIFVRRFTIDNNVTIEFD 77
            + I +G G  IPI   G T +S       L N L +  +  NL+ V +F  DN+ +IEF 
Sbjct: 313  EEIAMGDGNTIPISHTGNTNLSASNQQFKLLNTLCSHSIKNNLLSVSKFCRDNHTSIEFF 372

Query: 78   PFSFSVEDIQTGIPLMRCDSTGDLYPLTTTTSHQSTSPATFAALSPTLWHNRLGHPGANV 137
            PFS+ V+D+ TG PL R  +   LY     ++H +  P     +   LWH RLGHP    
Sbjct: 373  PFSYCVKDLSTGAPLFRGQNRDGLYEWPLGSAHHT--PQCNVVVPLHLWHRRLGHPNHRT 430

Query: 138  LSFLNKNKFIECKQISSPSICQSCIYGKHVKLPFSISNSTTSKPFDIIHSDLW-TSPILS 196
            L+ +     +      + SIC SC   K  +LPFS ++  + +P  II++DLW  SP+LS
Sbjct: 431  LNMIFHQFSLPVSHSRTASICNSCYSNKMHRLPFSENSLQSQRPLQIIYTDLWGPSPVLS 490

Query: 197  SAGHKYYLFFLDDYTNFVWTFPIGRKSQVPSIFSSFHAFIKTQFGTTIKCFQCDNGTEFN 256
                +YY  F+D Y+ ++  F I  K +V  +F + H  ++ +F T I     D G EF 
Sbjct: 491  IDNKRYYALFVDQYSKYMCLFTIKSKKEVLDVFQALHPLLERRFQTKIMSLYTDGGGEFQ 550

Query: 257  NEYFTQFCHKNGMVFRFSCPHTSPQNGKAERKIRAINNFIRTSLAHSSMPPSFWHHALQI 316
                + +    G+    + P+T  +    ER+ + +    +T L  +S+P SFW  A   
Sbjct: 551  G--LSSYLKIQGIEHLVTPPYTPQRVASVERRHKHVVETAKTLLHQASLPSSFWSFACHQ 608

Query: 317  TTYLQNILPSKILSHHSPTQYLYHRDPSYTHLRVFGCLCYPLFPSTAINKLQARSTPCAF 376
              YL N L +  L +  P + L+H  P Y  LRVFGCLCYP     A NKL+ +STPC +
Sbjct: 609  AVYLINRLTTPNLQNKCPYEILFHEAPKYESLRVFGCLCYPWLKPYAKNKLEPKSTPCVY 668

Query: 377  LGYPQNHRGYKCYDLSSKKIIISRHVIFDETQFPFAKT-----HTSSTHTYEFLND---- 427
            LG+   H  ++C+D    K+ +SR V F E  +PF        +  ST ++E   D    
Sbjct: 669  LGFSTKHYCHQCFDPVKNKLYLSRDVQFLEDTYPFHNIFLNLKNQQSTDSWEICYDVLPV 728

Query: 428  --------SLHPL-----LHYHLQND-PKQDEPEPRKIESPQPATTPASPINVT------ 467
                    S H L     ++  L N  P + E       + Q  ++P+ P  +T      
Sbjct: 729  TNKPSSFDSCHTLPDALPVYSLLPNSMPARSEGVSIASGNSQTLSSPSLPHTITPPPDYT 788

Query: 468  ----------------NQSILP-PSPMSINQLPHPLVSTELTSPTHTPQQIHQEPPRTIA 510
                            + S+LP PSP+    LP    +   + PT         P  T +
Sbjct: 789  QPQPLITYQRKNHQQPSTSVLPLPSPIPPTNLPSQSSANNSSQPTLALAPSDPSPVVTTS 848

Query: 511  THSMHGIHK-----PKIQFNLTTSITSSPLPHNPKAALSDSNWKAAMLDEFDALIKNKTW 565
            +H M    K     PK QF++   ++SS +PH  K A    +W+ AM  EFDAL++N TW
Sbjct: 849  SHPMVTRSKTNSLQPK-QFSVNVQLSSSFVPHTYKQACPHPHWREAMHAEFDALVRNWTW 907

Query: 566  ELVPRPQNVNFIRSMWIFRHKKKSDGSFERYKARLVGDGRSQ*VGVDCDETFSPVVKPTT 625
            +LVP   ++N +                                         PVVKP T
Sbjct: 908  DLVPVTHSMNVV----------------------------------------DPVVKPIT 927

Query: 626  IRIVLTIALSKSWPIHQLDVKNVFLHGELQETVYMHQPMGFRDPIHPDYVCLLKKSLYGL 685
            IR+VLTI    +WPIHQ+DV N FL G L+E VYM QP GF D     +VC L K +YGL
Sbjct: 928  IRLVLTIVTQYNWPIHQIDVNNAFLQGSLEEEVYMRQPPGFEDQSLSTHVCKLNKVIYGL 987

Query: 686  KQAPRAWYQRFADFAFTIGFSHSKSDHSLFIYRKGNDMTYILLYVDDIILTASSDVLRRS 745
            KQAPRAWY     +  T+GF  S+SD SLFI        Y+L+YVD II+T +     R 
Sbjct: 988  KQAPRAWYNELKSYLLTVGFVKSQSDSSLFILHNFGFTVYVLIYVDAIIITGNQIHGVRH 1047

Query: 746  IMSLLASEFAMKDLGTLSYF 765
            I+  L + F++KDLG L YF
Sbjct: 1048 IIDGLFTRFSLKDLGQLHYF 1067


>emb|CAC95126.1| gag-pol polyprotein [Populus deltoides]
          Length = 1382

 Score =  414 bits (1065), Expect = e-114
 Identities = 280/804 (34%), Positives = 405/804 (49%), Gaps = 53/804 (6%)

Query: 1    MTASQGTLSTYSNLSINKNIVVGSGQEIPIRGYGQTYISPPYPPLSLNNVLHAPKLIKNL 60
            M+    + ++ S LS +  ++   G  +P+ G G          LSL NV   PKL  NL
Sbjct: 352  MSPDSSSFTSVSPLS-SIPVMTADGTPMPLAGVGSVVTLH----LSLPNVYLIPKLKLNL 406

Query: 61   IFVRRFTIDNNVTIEFDPFSFSVEDIQTGIPLMRCDSTGDLYPL---------TTTTSHQ 111
              + +     +  + F      V+D+Q+   +        LY L           TT   
Sbjct: 407  ASIGQICDSGDYLVMFSGSFCCVQDLQSQKLIGTGRRENGLYILDELKVPVVVAATTVDL 466

Query: 112  STSPATFAALSPTLWHNRLGHPGANVLSFLNKNKFIECKQISSPSICQSCIYGKHVKLPF 171
            S    + ++ S  LWH+RLGH  ++ L FL     +   +    S C  C   K   LPF
Sbjct: 467  SFFRLSLSSSSFYLWHSRLGHVSSSRLRFLASTGALGNLKTCDISDCSGCKLAKFSALPF 526

Query: 172  SISNSTTSKPFDIIHSDLW-TSPILSSAGHKYYLFFLDDYTNFVWTFPIGRKSQVPSIFS 230
            + S S +S PFD+IHSD+W  SP+ +  G +YY+ F+DD+T + W + +  +S+   I++
Sbjct: 527  NRSTSVSSSPFDLIHSDVWGPSPVSTKGGSRYYVSFIDDHTRYCWVYLMKHRSEFFEIYA 586

Query: 231  SFHAFIKTQFGTTIKCFQCDNGTEFNNEYFTQFCHKNGMVFRFSCPHTSPQNGKAERKIR 290
            +F A IKTQ    IKCF+CD G E+ +  F Q    +G + + SC  T  QNG AERK R
Sbjct: 587  AFRALIKTQHSAVIKCFRCDLGGEYTSNKFCQMLALDGTIHQTSCTDTPEQNGVAERKHR 646

Query: 291  AINNFIRTSLAHSSMPPSFWHHALQITTYLQNILPSKILSHHSPTQYLYHRDPSYTHLRV 350
             I    R+ L  + +   FW  A+     L N +PS   S  SP + LY   P Y+  RV
Sbjct: 647  HIVETARSLLLSAFVLSEFWGEAVLTAVSLINTIPSSHSSGLSPFEKLYGHVPDYSSFRV 706

Query: 351  FGCLCYPLFPSTAINKLQARSTPCAFLGYPQNHRGYKCYDLSSKKIIISRHVIFDETQFP 410
            FGC  + L P    NKL +RS  C FLGY +  +GY+C+D  ++K+ +S HV+F E   P
Sbjct: 707  FGCTYFVLHPHVERNKLSSRSAICVFLGYGEGKKGYRCFDPITQKLYVSHHVVFLE-HIP 765

Query: 411  FAKTHTSSTHTYEFLNDSLHPLLHYHLQNDPKQDEPEPRKIESPQPATTPASPINVTNQS 470
            F     S+TH+    +D +H  +    ++      P  R I +   A T          +
Sbjct: 766  FFSI-PSTTHSLT-KSDLIH--IDPFSEDSGNDTSPYVRSICTHNSAGT---------GT 812

Query: 471  ILPPSPMSINQLPHPLVSTELTSPTHTPQQIHQEPPRTIATHSMHGIHKPKIQFNLTTSI 530
            +L  +P +      P  S+E+  P          PPR  +         P   ++  +S 
Sbjct: 813  LLSGTPEASFSSTAPQASSEIVDP----------PPRQ-SIRIRKSTKLPDFAYSCYSSS 861

Query: 531  TSSPL--------PHNPKAALSDSNWKAAMLDEFDALIKNKTWELVPRPQNVNFIRSMWI 582
             +S L        P + K A+ D   + AM +E  AL K  TW+LVP P   + +   W+
Sbjct: 862  FTSFLAYIHCLFEPSSYKEAILDPLGQQAMDEELSALHKTDTWDLVPLPPGKSVVGCRWV 921

Query: 583  FRHKKKSDGSFERYKARLVGDGRSQ*VGVDCDETFSPVVKPTTIRIVLTIALSKSWPIHQ 642
            ++ K  SDGS ERYKARLV  G SQ  G+D +ETF+P+ K TTIR ++ +A  + W I Q
Sbjct: 922  YKIKTNSDGSIERYKARLVAKGYSQQYGMDYEETFAPIAKMTTIRTLIAVASIRQWHISQ 981

Query: 643  LDVKNVFLHGELQETVYMHQPMGFRDPIHPDYVCLLKKSLYGLKQAPRAWYQRFADFAFT 702
            LDVKN FL+G+LQE VYM  P G        YVC LKK+LYGLKQAPRAW+++F+    +
Sbjct: 982  LDVKNAFLNGDLQEEVYMAPPPGISH--DSGYVCKLKKALYGLKQAPRAWFEKFSIVISS 1039

Query: 703  IGFSHSKSDHSLFIYRKGNDMTYILLYVDDIILTASSDVLRRSIMSL-LASEFAMKDLGT 761
            +GF  S  D +LFI         + LYVDD+I+T   D+   S++   LA  F MKDLG 
Sbjct: 1040 LGFVSSSHDSALFIKCTDAGRIILSLYVDDMIIT-GDDIDGISVLKTELARRFEMKDLGY 1098

Query: 762  LSYFFR-PCSYTSCRWLVS*SKKI 784
            L YF     +Y+   +L+S SK +
Sbjct: 1099 LRYFLGIEVAYSPRGYLLSQSKYV 1122


>gb|AAB61111.1| Strong similarity to Zea mays retrotransposon Hopscotch polyprotein
            (gb|U12626). [Arabidopsis thaliana]
            gi|25301690|pir||G96722 hypothetical protein F20P5.25
            [imported] - Arabidopsis thaliana
          Length = 1315

 Score =  398 bits (1022), Expect = e-109
 Identities = 252/738 (34%), Positives = 363/738 (49%), Gaps = 43/738 (5%)

Query: 45   LSLNNVLHAPKLIKNLIFVRRFTIDNNVTIEFDPFSFSVEDIQTGIPLMRCDSTGDLYPL 104
            L LN+VL  P+   NL+ V   T      I FD  S  ++D    + +       +LY +
Sbjct: 323  LILNDVLFIPQFKFNLLSVSSLTKSMGCRIWFDETSCVLQDATRELMVGMGKQVANLYIV 382

Query: 105  TTTT-SHQST-SPATFAAL-SPTLWHNRLGHPGANVLSFLNKNKFIECKQISSPSICQSC 161
               + SH  T S  T A++ S  LWH RLGHP    L  ++       ++ ++   C+ C
Sbjct: 383  DLDSLSHPGTDSSITVASVTSHDLWHKRLGHPSVQKLQPMSSLLSFPKQKNNTDFHCRVC 442

Query: 162  IYGKHVKLPFSISNSTTSKPFDIIHSDLWTS-PILSSAGHKYYLFFLDDYTNFVWTFPIG 220
               K   LPF   N+ +S+PFD+IH D W    + +  G++Y+L  +DDY+   W + + 
Sbjct: 443  HISKQKHLPFVSHNNKSSRPFDLIHIDTWGPFSVQTHDGYRYFLTIVDDYSRATWVYLLR 502

Query: 221  RKSQVPSIFSSFHAFIKTQFGTTIKCFQCDNGTEFNNEYFTQFCHKNGMVFRFSCPHTSP 280
             KS V ++  +F   ++ QF TTIK  + DN  E N   FTQF H  G+V   SCP T  
Sbjct: 503  NKSDVLTVIPTFVTMVENQFETTIKGVRSDNAPELN---FTQFYHSKGIVPYHSCPETPQ 559

Query: 281  QNGKAERKIRAINNFIRTSLAHSSMPPSFWHHALQITTYLQNILPSKILSHHSPTQYLYH 340
            QN   ERK + I N  R+    S +P S+W   +    YL N LP+ IL    P + L  
Sbjct: 560  QNSVVERKHQHILNVARSLFFQSHIPISYWGDCILTAVYLINRLPAPILEDKCPFEVLTK 619

Query: 341  RDPSYTHLRVFGCLCYPLFPSTAINKLQARSTPCAFLGYPQNHRGYKCYDLSSKKIIISR 400
              P+Y H++VFGCLCY        +K   R+  CAF+GYP   +GYK  DL +  II+SR
Sbjct: 620  TVPTYDHIKVFGCLCYASTSPKDRHKFSPRAKACAFIGYPSGFKGYKLLDLETHSIIVSR 679

Query: 401  HVIFDETQFPFAKTHTSSTHTYEFLNDSLHPLLHYHLQNDPKQDEPEPRKIESPQPATTP 460
            HV+F E  FPF  +  S      F                P  +   P + +S    +  
Sbjct: 680  HVVFHEELFPFLGSDLSQEEQNFF----------------PDLNPTPPMQRQS----SDH 719

Query: 461  ASPINVTNQSILPPSPMSINQLPHPLVSTELTSPTHTPQQIHQEPPRTIATHSMHGIHK- 519
             +P + ++   + PS    N +P P V T        P  +      ++ + + H I K 
Sbjct: 720  VNPSDSSSSVEILPSANPTNNVPEPSVQTS-HRKAKKPAYLQDYYCHSVVSSTPHEIRKF 778

Query: 520  --------PKIQFNLTTSITSSPLPHNPKAALSDSNWKAAMLDEFDALIKNKTWELVPRP 571
                    P + F      T  P  +     L    W+ AM  EFD L    TWE+   P
Sbjct: 779  LSYDRINDPYLTFLACLDKTKEPSNYTEAEKL--QVWRDAMGAEFDFLEGTHTWEVCSLP 836

Query: 572  QNVNFIRSMWIFRHKKKSDGSFERYKARLVGDGRSQ*VGVDCDETFSPVVKPTTIRIVLT 631
             +   I   WIF+ K  SDGS ERYKARLV  G +Q  G+D +ETFSPV K  +++++L 
Sbjct: 837  ADKRCIGCRWIFKIKYNSDGSVERYKARLVAQGYTQKEGIDYNETFSPVAKLNSVKLLLG 896

Query: 632  IALSKSWPIHQLDVKNVFLHGELQETVYMHQPMGFR----DPIHPDYVCLLKKSLYGLKQ 687
            +A      + QLD+ N FL+G+L E +YM  P G+     D + P+ VC LKKSLYGLKQ
Sbjct: 897  VAARFKLSLTQLDISNAFLNGDLDEEIYMRLPQGYASRQGDSLPPNAVCRLKKSLYGLKQ 956

Query: 688  APRAWYQRFADFAFTIGFSHSKSDHSLFIYRKGNDMTYILLYVDDIILTASSDVLRRSIM 747
            A R WY +F+     +GF  S  DH+ F+         +L+Y+DDII+ +++D     + 
Sbjct: 957  ASRQWYLKFSSTLLGLGFIQSYCDHTCFLKISDGIFLCVLVYIDDIIIASNNDAAVDILK 1016

Query: 748  SLLASEFAMKDLGTLSYF 765
            S + S F ++DLG L YF
Sbjct: 1017 SQMKSFFKLRDLGELKYF 1034


>gb|AAD25646.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
            gi|25301701|pir||E84589 probable retroelement pol
            polyprotein [imported] - Arabidopsis thaliana
          Length = 1461

 Score =  397 bits (1019), Expect = e-108
 Identities = 238/756 (31%), Positives = 376/756 (49%), Gaps = 45/756 (5%)

Query: 15   SINKNIVVGSGQEIPIRGYGQTYISPPYPPLSLNNVLHAPKLIKNLIFVRRFTIDNNVTI 74
            SI   + + +G  + I G G   I+     + L NVL  P+   NLI +   T D    +
Sbjct: 465  SIVSFVNLPTGPNVRISGVGTVLINKD---IILQNVLFIPEFRLNLISISSLTTDLGTRV 521

Query: 75   EFDPFSFSVEDIQTGIPLMRCDSTGDLYPLTTTTSHQSTSPATFAALSPTLWHNRLGHPG 134
             FDP    ++D+  G+ L      G+LY L T    QS + +  A +  ++WH RLGHP 
Sbjct: 522  IFDPSCCQIQDLTKGLTLGEGKRIGNLYVLDT----QSPAISVNAVVDVSVWHKRLGHPS 577

Query: 135  ANVLSFLNKNKFIECKQISSPSICQSCIYGKHVKLPFSISNSTTSKPFDIIHSDLWTS-P 193
             + L  L++       +    + C  C   K  KL F  +N+  +  F+++H D+W    
Sbjct: 578  FSRLDSLSEVLGTTRHKNKKSAYCHVCHLAKQKKLSFPSANNICNSTFELLHIDVWGPFS 637

Query: 194  ILSSAGHKYYLFFLDDYTNFVWTFPIGRKSQVPSIFSSFHAFIKTQFGTTIKCFQCDNGT 253
            + +  G+KY+L  +DD++   W + +  KS V ++F +F   ++ Q+ T +K  + DN  
Sbjct: 638  VETVEGYKYFLTIVDDHSRATWIYLLKSKSDVLTVFPAFIDLVENQYDTRVKSVRSDNAK 697

Query: 254  EFNNEYFTQFCHKNGMVFRFSCPHTSPQNGKAERKIRAINNFIRTSLAHSSMPPSFWHHA 313
            E     FT+F    G+V   SCP T  QN   ERK + I N  R  +  S+M   +W   
Sbjct: 698  ELA---FTEFYKAKGIVSFHSCPETPEQNSVVERKHQHILNVARALMFQSNMSLPYWGDC 754

Query: 314  LQITTYLQNILPSKILSHHSPTQYLYHRDPSYTHLRVFGCLCYPLFPSTAINKLQARSTP 373
            +    +L N  PS +LS+ +P + L  + P Y+ L+ FGCLCY    S   +K   RS  
Sbjct: 755  VLTAVFLINRTPSALLSNKTPFEVLTGKLPDYSQLKTFGCLCYSSTSSKQRHKFLPRSRA 814

Query: 374  CAFLGYPQNHRGYKCYDLSSKKIIISRHVIFDETQFPFAKTHTSSTHTYEFLNDSLHPLL 433
            C FLGYP   +GYK  DL S  + ISR+V F E  FP A +  S+T      +D   P+ 
Sbjct: 815  CVFLGYPFGFKGYKLLDLESNVVHISRNVEFHEELFPLASSQQSATTA----SDVFTPMD 870

Query: 434  HYHLQNDPKQDEPEPRKIESPQPATTPASPINVTNQSILPPSPMSINQLPHPLVSTELTS 493
                 N      P P+            SP    ++  +   P  +       V+ + + 
Sbjct: 871  PLSSGNSITSHLPSPQ-----------ISPSTQISKRRITKFPAHLQDYHCYFVNKDDSH 919

Query: 494  PTHTPQQIHQEPPRTIATHSMHGIHKPKIQFNLTTSITSSPLPHNPKAALSDSNWKAAML 553
            P  +     Q  P    +H ++             +I+  P+P +   A     W  A+ 
Sbjct: 920  PISSSLSYSQISP----SHMLY-----------INNISKIPIPQSYHEAKDSKEWCGAID 964

Query: 554  DEFDALIKNKTWELVPRPQNVNFIRSMWIFRHKKKSDGSFERYKARLVGDGRSQ*VGVDC 613
             E  A+ +  TWE+   P     +   W+F  K  +DGS ER+KAR+V  G +Q  G+D 
Sbjct: 965  QEIGAMERTDTWEITSLPPGKKAVGCKWVFTVKFHADGSLERFKARIVAKGYTQKEGLDY 1024

Query: 614  DETFSPVVKPTTIRIVLTIALSKSWPIHQLDVKNVFLHGELQETVYMHQPMGFRD----P 669
             ETFSPV K  T++++L ++ SK W ++QLD+ N FL+G+L+ET+YM  P G+ D     
Sbjct: 1025 TETFSPVAKMATVKLLLKVSASKKWYLNQLDISNAFLNGDLEETIYMKLPDGYADIKGTS 1084

Query: 670  IHPDYVCLLKKSLYGLKQAPRAWYQRFADFAFTIGFSHSKSDHSLFIYRKGNDMTYILLY 729
            + P+ VC LKKS+YGLKQA R W+ +F++    +GF     DH+LF+   G++   +L+Y
Sbjct: 1085 LPPNVVCRLKKSIYGLKQASRQWFLKFSNSLLALGFEKQHGDHTLFVRCIGSEFIVLLVY 1144

Query: 730  VDDIILTASSDVLRRSIMSLLASEFAMKDLGTLSYF 765
            VDDI++ ++++   +S+   L + F +++LG L YF
Sbjct: 1145 VDDIVIASTTEQAAQSLTEALKASFKLRELGPLKYF 1180


>gb|AAP46257.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
            gi|50919599|ref|XP_470160.1| putative polyprotein [Oryza
            sativa (japonica cultivar-group)]
          Length = 1335

 Score =  389 bits (999), Expect = e-106
 Identities = 238/760 (31%), Positives = 379/760 (49%), Gaps = 42/760 (5%)

Query: 15   SINKNIVVGSGQEIPIRGYGQTYISPPYPPLSLNNVLHAPKLIKNLIFVRRFTIDNNVTI 74
            S +  I +G+G      G G   +     P  + +VL  P L +NL+ + +  +++   +
Sbjct: 349  SYHAKIHMGNGSIAQSEGKGTVAVQTADGPKFIKDVLLVPDLKQNLLSIGQL-LEHGYAV 407

Query: 75   EFDPFSFSVEDIQTGIPLMRCDSTGD---LYPLTTTTSHQSTSPATFAALSPTLWHNRLG 131
             F+ FS  + D +    + + +   +   L  +  TT     S    +     LWH R+G
Sbjct: 408  YFEDFSCKILDRKNNRLVAKINMEKNRNFLLRMNHTTQMALRSEVDIS----DLWHKRMG 463

Query: 132  HPGANVLSFLNKNKFIECKQI----SSPSICQSCIYGKHVKLPFSISNS-TTSKPFDIIH 186
            H     L  L     ++        S P  C+ C++GK ++  F  S +   S P +++H
Sbjct: 464  HLNYRALKLLRTKGMVQGLPFITLKSDP--CEGCVFGKQIRASFPHSGAWRASAPLELVH 521

Query: 187  SDLWTS-PILSSAGHKYYLFFLDDYTNFVWTFPIGRKSQVPSIFSSFHAFIKTQFGTTIK 245
            +D+    P +S  G+ Y++ F+DDYT  +W + +  KS    IF  F A ++ Q    IK
Sbjct: 522  ADIVGKVPTISEGGNWYFITFIDDYTRMIWVYFLKEKSAALEIFKKFKAMVENQSNRKIK 581

Query: 246  CFQCDNGTEFNNEYFTQFCHKNGMVFRFSCPHTSPQNGKAERKIRAINNFIRTSLAHSSM 305
              + D G E+ ++ F ++C   G+  + +  +++ QNG AERK R IN+   + L    M
Sbjct: 582  VLRSDQGREYISKEFEKYCENAGIRRQLTAGYSAQQNGVAERKNRTINDMANSMLQDKGM 641

Query: 306  PPSFWHHALQITTYLQNILPSKILSHHSPTQYLYHRDPSYTHLRVFGCLCYPLFPSTAIN 365
            P SFW  A+    Y+ N  P+K +++ +P +  Y + P   H+RVFGC+CY   P+    
Sbjct: 642  PKSFWAEAVNTAVYILNRSPTKAVTNRTPFEAWYGKKPVIGHMRVFGCICYAQVPAQKRV 701

Query: 366  KLQARSTPCAFLGYPQNHRGYKCYDLSSKKIIISRHVIFDETQFPFAKTHTSSTHTYEFL 425
            K   +S  C F+GY    +GY+ Y+L  KKIIISR  IFDE          S+T  ++  
Sbjct: 702  KFDNKSDRCIFVGYADGIKGYRLYNLEKKKIIISRDAIFDE----------SATWNWKSP 751

Query: 426  NDSLHPLLHYHLQNDPKQDEPEPRKIESPQPATTPASPINVTNQSILPPSPMSINQLPHP 485
              S  PLL        +       ++E   P+  P+SP++ ++ S    SP S  Q+   
Sbjct: 752  EASSTPLLPTTTITLGQPHMHGTHEVEDHTPSPQPSSPMSSSSASS-DSSPSSEEQI--- 807

Query: 486  LVSTELTSPTHTPQQIHQEPPRTIATHSMHGIHKPKIQFNLTTSITSSPLPHNPKAALSD 545
                  ++P   P+++        +T    G  + +       S+     P + + A   
Sbjct: 808  ------STPESAPRRVRSMVELLESTSQQRGSEQHEF---CNYSVVE---PQSFQEAEKH 855

Query: 546  SNWKAAMLDEFDALIKNKTWELVPRPQNVNFIRSMWIFRHKKKSDGSFERYKARLVGDGR 605
             NW  AM DE   + KN TWELV RP++   I   W+++ K   DGS ++YKARLV  G 
Sbjct: 856  DNWIKAMEDEIHMIEKNNTWELVDRPRDREVIGVKWVYKTKLNPDGSVQKYKARLVAKGF 915

Query: 606  SQ*VGVDCDETFSPVVKPTTIRIVLTIALSKSWPIHQLDVKNVFLHGELQETVYMHQPMG 665
             Q  G+D  ET++PV +  TIR ++ +A  K W I+QLDVK+ FL+G L E +Y+ QP G
Sbjct: 916  KQKPGIDYYETYAPVARLETIRTIIALAAQKRWKIYQLDVKSAFLNGYLDEEIYVEQPEG 975

Query: 666  FRDPIHPDYVCLLKKSLYGLKQAPRAWYQRFADFAFTIGFSHSKSDHSLFIYRKGNDMTY 725
            F      + V  LKK+LYGLKQAPRAWY +   +    GF+ S S+ +L++ + G D+  
Sbjct: 976  FSVQGGENKVFRLKKALYGLKQAPRAWYSQIDKYFIQKGFAKSISEPTLYVNKTGTDILI 1035

Query: 726  ILLYVDDIILTASSDVLRRSIMSLLASEFAMKDLGTLSYF 765
            + LYVDD+I T +S+ + +     +   + M DLG L YF
Sbjct: 1036 VSLYVDDLIYTGNSEKMMQDFKKDMMHTYEMSDLGLLHYF 1075


  Database: nr
    Posted date:  Jul 5, 2005 12:34 AM
  Number of letters in database: 863,360,394
  Number of sequences in database:  2,540,612
  
Lambda     K      H
   0.332    0.142    0.470 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,748,971,649
Number of Sequences: 2540612
Number of extensions: 76521276
Number of successful extensions: 251689
Number of sequences better than 10.0: 2329
Number of HSP's better than 10.0 without gapping: 1828
Number of HSP's successfully gapped in prelim test: 527
Number of HSP's that attempted gapping in prelim test: 244530
Number of HSP's gapped (non-prelim): 4472
length of query: 993
length of database: 863,360,394
effective HSP length: 138
effective length of query: 855
effective length of database: 512,755,938
effective search space: 438406326990
effective search space used: 438406326990
T: 11
A: 40
X1: 15 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (22.0 bits)
S2: 80 (35.4 bits)


Medicago: description of AC144645.3