Medicago
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= AC124217.5 + phase: 0 /pseudo
         (1307 letters)

Database: nr 
           2,540,612 sequences; 863,360,394 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|XP_474090.1| OSJNBa0033G05.13 [Oryza sativa (japonica cultiv...   404  e-111
gb|AAP54315.1| putative polyprotein [Oryza sativa (japonica cult...   390  e-106
gb|AAD32759.1| putative retroelement pol polyprotein [Arabidopsi...   388  e-106
gb|AAX92941.1| retrotransposon protein, putative, Ty1-copia sub-...   387  e-105
ref|XP_470868.1| Putative retroelement pol polyprotein [Oryza sa...   385  e-105
gb|AAP53029.1| putative retrotransposon-related protein [Oryza s...   378  e-103
emb|CAB75481.1| copia-like polyprotein [Arabidopsis thaliana] gi...   377  e-102
gb|AAX92861.1| retrotransposon protein, putative, Ty1-copia sub-...   376  e-102
ref|XP_469192.1| putative polyprotein [Oryza sativa (japonica cu...   374  e-101
gb|AAD23690.1| putative retroelement pol polyprotein [Arabidopsi...   372  e-101
emb|CAB78169.1| putative retrotransposon [Arabidopsis thaliana] ...   369  e-100
ref|XP_475489.1| putative polyprotein [Oryza sativa (japonica cu...   358  5e-97
dbj|BAB09923.1| copia-like retrotransposable element [Arabidopsi...   353  2e-95
gb|AAF19226.1| Highly similar to Ta1-3 polyprotein [Arabidopsis ...   352  5e-95
gb|AAT85194.1| putative polyprotein [Oryza sativa (japonica cult...   351  8e-95
gb|AAD19773.1| putative retroelement pol polyprotein [Arabidopsi...   350  2e-94
emb|CAA32025.1| unnamed protein product [Nicotiana tabacum] gi|1...   349  3e-94
gb|AAD23679.1| putative retroelement pol polyprotein [Arabidopsi...   345  4e-93
dbj|BAD34493.1| Gag-Pol [Ipomoea batatas]                             340  1e-91
gb|AAK29467.1| polyprotein-like [Lycopersicon chilense]               338  7e-91

>ref|XP_474090.1| OSJNBa0033G05.13 [Oryza sativa (japonica cultivar-group)]
           gi|38344889|emb|CAD41912.2| OSJNBa0033G05.13 [Oryza
           sativa (japonica cultivar-group)]
          Length = 1181

 Score =  404 bits (1039), Expect = e-111
 Identities = 238/583 (40%), Positives = 341/583 (57%), Gaps = 36/583 (6%)

Query: 5   KWDIEKFTGSNDFGLWKVKMQAVLTQQKCVEALKGEAAMPATLTQEEKREMIDKAKSAIV 64
           K+D+        F LW+VKM+AVL QQ+  +AL G        + +EK+    KA S I 
Sbjct: 5   KYDLPLLDRDTRFSLWQVKMRAVLAQQELDDALSGFDKRTQDWSNDEKKRD-RKAMSYIH 63

Query: 65  LCLGDKVLRDVAREATAASM*AKLESLYMTKSLAHRQLLKQQLYSFKMVESISISEQLTE 124
           L L + +L++V +E TAA +  KLE + MTK L  +  LKQ+L+  K+ +  S+ + L+ 
Sbjct: 64  LHLSNNILQEVLKEETAAGLWLKLEQICMTKDLTSKMHLKQKLFLHKLQDDGSVMDHLSA 123

Query: 125 FNKILVDLANIEVNTEDEDKALLLLCSLPKSFEHFKDTILYGKEGTTTLEEVQAALRTKE 184
           F +I+ DL ++EV  +++D AL+LLCSLP S+ +F+DTILY ++ T TL+EV  AL  KE
Sbjct: 124 FKEIVADLESMEVKYDEKDLALILLCSLPSSYANFRDTILYSRD-TLTLKEVYDALHAKE 182

Query: 185 LTKFKDLKVDEGS----EGLNVARGRNEHRGKGKGKSRSKSRSKGFDKSK--YK-CFLCH 237
             K K +   EGS    EGL V   + E     K + +S S  +G  KS+  YK C  C 
Sbjct: 183 --KMKKMVPSEGSNSQAEGLVVRGSQQEKNTNNKSRDKSSSSYRGRSKSRGRYKSCKYCK 240

Query: 238 KQGHFKKDC---PDKGGDGSPSVQVAEASNEEGYESTGALVVTSWKSEKS---------- 284
           + GH    C    DK  D      + +   EE  +   A VVT  KS+            
Sbjct: 241 RDGHDISKCWKLQDK--DKRTGKYIPKGKKEEEGK---AAVVTDEKSDAELLVAYAGCAQ 295

Query: 285 ----WVLDSGCSYHMCPRKEYFETLTLKEGGVVRLGNNKACKVQGMGNVRLKMFDGREFL 340
               W+LD+ C+YHMCP +++F T  + +GG V +G++  C+V G+G V++KMFDG    
Sbjct: 296 TSDQWILDTACTYHMCPNRDWFATYEVVQGGTVLMGDDTPCEVAGIGTVQIKMFDGCIRT 355

Query: 341 LRDVRFVPELKRNLISLSMFDGLGYCTRIEHGVCKISHGALITVKGS-KMNGLYILDGSI 399
           L DVR +P LKR+LISL   D  GY      G+ K++ G+L+ +K S K   LY L G+ 
Sbjct: 356 LSDVRHIPNLKRSLISLCTLDRKGYKYSGGDGILKVTKGSLVVMKASIKSANLYHLQGTT 415

Query: 400 VIGNASVASVVPHNN--SELWHLRLGHVSERGLVELAKQGLLGKDKLDKLEFCEHCILGK 457
           ++GN +  S    N+  + LWH+RLGH+SE GL EL+K+GLL    + KL+FCEHCI GK
Sbjct: 416 ILGNVATVSDSLSNSDATNLWHMRLGHMSEIGLAELSKRGLLDGQSISKLKFCEHCIFGK 475

Query: 458 QHRVKFGSGMHHSSRLFEYVHSDLLGPSKTPTHGGGSYFLSIIDDYSRRVWVFVLKKKSD 517
             RVKF +  H +  + +YVHSDL GP++  + GG  Y ++I+DDYSR+VW + LK K  
Sbjct: 476 HKRVKFNTSTHTTEGILDYVHSDLWGPARKTSFGGARYMMTIVDDYSRKVWPYFLKHKYQ 535

Query: 518 TF*KFKE*HTLIENQMGTKLKGLRTDNGLEFVSEQFNDFLQVE 560
            F  FKE  T++E Q   K+K LRTDNG+EF S+ F  + + E
Sbjct: 536 AFNVFKEWKTMVERQTERKVKILRTDNGMEFCSKIFKSYCKSE 578


>gb|AAP54315.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
           gi|37535452|ref|NP_922028.1| putative polyprotein [Oryza
           sativa (japonica cultivar-group)]
           gi|22094359|gb|AAM91886.1| putative polyprotein [Oryza
           sativa (japonica cultivar-group)]
          Length = 1280

 Score =  390 bits (1003), Expect = e-106
 Identities = 235/584 (40%), Positives = 336/584 (57%), Gaps = 38/584 (6%)

Query: 5   KWDIEKFTGSNDFGLWKVKMQAVLTQQKCVEALKGEAAMPATLTQEEKREMIDKAKSAIV 64
           K+D+        F LW+VKM+AVL QQ   +AL G        + +EK++   KA S I 
Sbjct: 40  KYDLPLLDRDTRFSLWQVKMRAVLAQQDLDDALSGFDKRTQDWSNDEKKKD-RKAMSYIH 98

Query: 65  LCLGDKVLRDVAREATAASM*AKLESLYMTKSLAHRQLLKQQLYSFKMVESISISEQLTE 124
           L L + +L++V +E TAA +  KLE + MTK L  +  LKQ+L+  K+ +  S+ + L+ 
Sbjct: 99  LHLSNNILQEVLKEETAAGLWLKLEQICMTKDLTSKMHLKQKLFLHKLQDDGSVMDHLST 158

Query: 125 FNKILVDLANIEVNTEDEDKALLLLCSLPKSFEHFKDTILYGKEGTTTLEEVQAALRTKE 184
           F +I+ DL +IEV  ++ED  L+LLCSLP S+ +F+DTILY  + T  L+EV  AL  KE
Sbjct: 159 FKEIVADLESIEVKYDEEDLGLILLCSLPSSYANFRDTILYSHD-TLILKEVYDALHAKE 217

Query: 185 LTKFKDLKVDEGS----EGLNVARGRNEHRG---KGKGKSRSKSRSKGFDKSKYK-CFLC 236
             K K +   EGS    EGL V RGR + +    + + KS S  R +   + +YK C  C
Sbjct: 218 --KMKKMVPSEGSNSQAEGL-VVRGRQQEKNTKNQSRDKSSSSYRGRSKSRGRYKSCKYC 274

Query: 237 HKQGHFKKDC---PDKGGDGSPSVQVAEASNEEGYESTGALVVTSWKSEKS--------- 284
            + GH   +C    DK  D      + +   EE  +   A VVT  KS+           
Sbjct: 275 KRDGHDISECWKLQDK--DKRTGKYIPKGKKEEEGK---AAVVTDEKSDTELLVAYAGCA 329

Query: 285 -----WVLDSGCSYHMCPRKEYFETLTLKEGGVVRLGNNKACKVQGMGNVRLKMFDGREF 339
                W+LD+  +YHMCP +++F T    +GG V +G++  C+V G+G V++KMFDG   
Sbjct: 330 QTSDQWILDTAWTYHMCPNRDWFATYEALQGGTVLMGDDTPCEVAGIGTVQIKMFDGYIR 389

Query: 340 LLRDVRFVPELKRNLISLSMFDGLGYCTRIEHGVCKISHGALITVKGS-KMNGLYILDGS 398
            L DVR +P LKR+LISL   D  GY      G+ K++ G+L+ +K   K   LY L G+
Sbjct: 390 TLSDVRHIPNLKRSLISLCTLDRKGYKYSGGDGILKVTKGSLVVMKADIKSANLYHLRGT 449

Query: 399 IVIGNASVASVVPHNN--SELWHLRLGHVSERGLVELAKQGLLGKDKLDKLEFCEHCILG 456
            ++GN +  S    N+  + LWH+RLGH+SE GL EL+K+ LL    + KL+FCEHCI G
Sbjct: 450 TILGNVAAVSDSLSNSDATNLWHMRLGHMSEIGLAELSKRELLDGQSIGKLKFCEHCIFG 509

Query: 457 KQHRVKFGSGMHHSSRLFEYVHSDLLGPSKTPTHGGGSYFLSIIDDYSRRVWVFVLKKKS 516
           K  RVKF +  H +  + +YVHSDL GP+   + GG  Y ++I+DDYSR+VW + LK K 
Sbjct: 510 KHKRVKFNTSTHTTEGILDYVHSDLWGPACKTSFGGARYMMTIVDDYSRKVWPYFLKHKY 569

Query: 517 DTF*KFKE*HTLIENQMGTKLKGLRTDNGLEFVSEQFNDFLQVE 560
             F  FKE  T++E Q   K+K LRTDNG+EF S+ F  + + E
Sbjct: 570 QAFDVFKEWKTMVERQTEKKVKILRTDNGMEFCSKIFKSYCKSE 613


>gb|AAD32759.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
           gi|25301696|pir||F84486 probable retroelement pol
           polyprotein [imported] - Arabidopsis thaliana
          Length = 1356

 Score =  388 bits (996), Expect = e-106
 Identities = 227/584 (38%), Positives = 342/584 (57%), Gaps = 40/584 (6%)

Query: 7   DIEKFTGSNDFGLWKVKMQAVLTQQKCVEALK-----GEAAMPATLTQEEKREMIDK--- 58
           ++EKF G  D+ +WK K+ A +       ALK     GE       + E+  E ++K   
Sbjct: 7   EVEKFDGRGDYTMWKEKLLAHMDILGLNTALKESESTGEKKSVLDESDEDYEEKLEKFEA 66

Query: 59  -------AKSAIVLCLGDKVLRDVAREATAASM*AKLESLYMTKSLAHRQLLKQQLYSFK 111
                  A+SAIVL + D+VLR + +E+TAA+M   L+ LYM+K+L +R   KQ+LYSFK
Sbjct: 67  LEEKKKKARSAIVLSVTDRVLRKIKKESTAAAMLLALDKLYMSKALPNRIYPKQKLYSFK 126

Query: 112 MVESISISEQLTEFNKILVDLANIEVNTEDEDKALLLLCSLPKSFEHFKDTILYGK-EGT 170
           M E++S+   + EF +I+ DL N+ V   DED+A+LLL +LPK+F+  KDT+ Y   +  
Sbjct: 127 MSENLSVEGNIDEFLQIITDLENMNVIISDEDQAILLLTALPKAFDQLKDTLKYSSGKSI 186

Query: 171 TTLEEVQAALRTKEL---TKFKDLKVDEGSEGLNVARGRNEHRGKGKGKSRSKSRSKGFD 227
            TL+EV AA+ +KEL   +  K +KV   +EGL V + +NE++GKG+ K + K + KG  
Sbjct: 187 LTLDEVAAAIYSKELELGSVKKSIKVQ--AEGLYV-KDKNENKGKGEQKGKGKGK-KGKS 242

Query: 228 KSKYKCFLCHKQGHFKKDCPD------------KGGDGSPSVQVAEASNEEGYESTGALV 275
           K K  C+ C ++GHF+  CP+            KG        +AEA+   GY  + AL 
Sbjct: 243 KKKPGCWTCGEEGHFRSSCPNQNKPQFKQSQVVKGESSGGKGNLAEAA---GYYVSEALS 299

Query: 276 VTSWKSEKSWVLDSGCSYHMCPRKEYFETLTLKEGGVVRLGNNKACKVQGMGNVRLKMFD 335
            T    E  W+LD+GCSYHM  ++E+F       GG VR+GN    +V+G+G +R+K  D
Sbjct: 300 STEVHLEDEWILDTGCSYHMTYKREWFHEFNEDAGGSVRMGNKTVSRVRGVGTIRVKNSD 359

Query: 336 GREFLLRDVRFVPELKRNLISLSMFDGLGYCTRIEHGVCKISHGALITVKGSKMNGLYIL 395
           G   +L +VR++P++ RNL+SL  F+  GY    E G+ +I  G  + + G + + LY+L
Sbjct: 360 GLTIVLTNVRYIPDMDRNLLSLGTFEKAGYKFESEDGILRIKAGNQVLLTGRRYDTLYLL 419

Query: 396 DGSIVIGNASVASVVPHNNSELWHLRLGHVSERGLVELAKQGLLGKDKLDKLEFCEHCIL 455
           +    + + S+A V   +++ LWH RL H+S++ +  L ++G L K K+  L+ CE CI 
Sbjct: 420 NWK-PVASESLAVVKRADDTVLWHQRLCHMSQKNMEILVRKGFLDKKKVSSLDVCEDCIY 478

Query: 456 GKQHRVKFGSGMHHSSRLFEYVHSDLLGPSKTP-THGGGSYFLSIIDDYSRRVWVFVLKK 514
           GK  R  F    H +    EY+HSDL G    P + G   YF+SIIDD++R+VWV+ +K 
Sbjct: 479 GKAKRKSFSLAHHDTKEKLEYIHSDLWGAPFVPLSLGKCQYFMSIIDDFTRKVWVYFMKT 538

Query: 515 KSDTF*KFKE*HTLIENQMGTKLKGLRTDNGLEFVSEQFNDFLQ 558
           K + F KF E   L+ENQ   ++K LRTDNGLEF ++ F+ F +
Sbjct: 539 KDEAFEKFVEWVNLVENQTDRRVKTLRTDNGLEFCNKLFDGFCE 582


>gb|AAX92941.1| retrotransposon protein, putative, Ty1-copia sub-class [Oryza
           sativa (japonica cultivar-group)]
          Length = 2340

 Score =  387 bits (994), Expect = e-105
 Identities = 234/583 (40%), Positives = 335/583 (57%), Gaps = 36/583 (6%)

Query: 5   KWDIEKFTGSNDFGLWKVKMQAVLTQQKCVEALKGEAAMPATLTQEEKREMIDKAKSAIV 64
           K+D+        F LW+VKM+AVL QQ   +AL G        + +EK+    KA S I 
Sbjct: 212 KYDLPLLYRDTRFSLWQVKMRAVLAQQDLDDALSGFDKRTHDWSNDEKKRD-RKAMSYIH 270

Query: 65  LCLGDKVLRDVAREATAASM*AKLESLYMTKSLAHRQLLKQQLYSFKMVESISISEQLTE 124
           L L + +L++V +E  AA +  KLE + MTK L  +  LKQ L+  K+ +  S+ + L+ 
Sbjct: 271 LHLSNNILQEVLKEEIAAGLWLKLEQICMTKDLTSKMHLKQTLFLHKLQDDGSVMDHLSA 330

Query: 125 FNKILVDLANIEVNTEDEDKALLLLCSLPKSFEHFKDTILYGKEGTTTLEEVQAALRTKE 184
           F +I+ DL ++EV  ++ED  L+LLCSLP S+ +F+DTILY ++ T TL+EV  AL  KE
Sbjct: 331 FKEIIADLESMEVKYDEEDLGLILLCSLPSSYANFRDTILYSRD-TLTLKEVYDALHVKE 389

Query: 185 LTKFKDLKVDEGS----EGLNVARGRNEHRGKGKGKSRSKSRSKGFDKSK--YK-CFLCH 237
             K K +   EGS    EGL V   + E   K + + +S S  +G  KS+  YK C  C 
Sbjct: 390 --KMKKMVPSEGSNSQAEGLIVWGRQQEKNTKNQSRDKSSSSYRGRSKSRGRYKSCKYCK 447

Query: 238 KQGHFKKDC---PDKGGDGSPSVQVAEASNEEGYESTGALVVTSWKSEKS---------- 284
           + GH   +C    DK  D      V +   EE  +   A VVT  KS+            
Sbjct: 448 RDGHDIFECWKLHDK--DKRTGKYVPKGKKEEEGK---AAVVTDEKSDAELLVAYAGCAQ 502

Query: 285 ----WVLDSGCSYHMCPRKEYFETLTLKEGGVVRLGNNKACKVQGMGNVRLKMFDGREFL 340
               W+L++ C YHMCP +++F T    + G V +G++  C+V G+G V++KMFDG    
Sbjct: 503 TSDQWILNTACIYHMCPNRDWFATYEAVQVGTVLMGDDTPCEVAGIGTVQIKMFDGCIRT 562

Query: 341 LRDVRFVPELKRNLISLSMFDGLGYCTRIEHGVCKISHGALITVKGS-KMNGLYILDGSI 399
           L DVR +P LKR+LISL   D  GY      G+ K++ G+L+ +K   K   LY L G+ 
Sbjct: 563 LSDVRHIPNLKRSLISLCTLDRKGYKYSGGDGILKVTKGSLVVMKADIKSANLYHLRGTT 622

Query: 400 VIGNASVASVVPHNN--SELWHLRLGHVSERGLVELAKQGLLGKDKLDKLEFCEHCILGK 457
           ++GN +  S    N+  + LWH+RLGH++E GL EL+K+GLL    + KL+FCEHCI GK
Sbjct: 623 ILGNVAAVSDSLSNSDATNLWHMRLGHMTEIGLAELSKRGLLDGQSIGKLKFCEHCIFGK 682

Query: 458 QHRVKFGSGMHHSSRLFEYVHSDLLGPSKTPTHGGGSYFLSIIDDYSRRVWVFVLKKKSD 517
             RVKF +  H +  + +YVHSDL GP++  + GG  Y ++I+DDYSR+VW + LK K  
Sbjct: 683 HKRVKFNTSTHTTEGILDYVHSDLWGPARKTSFGGTRYMMTIVDDYSRKVWPYFLKHKYQ 742

Query: 518 TF*KFKE*HTLIENQMGTKLKGLRTDNGLEFVSEQFNDFLQVE 560
            F  FKE  T++E Q   K+K LRTDNG+EF S+ F  + + E
Sbjct: 743 AFDVFKEWKTMVERQTERKVKILRTDNGMEFCSKIFKSYCKSE 785


>ref|XP_470868.1| Putative retroelement pol polyprotein [Oryza sativa]
           gi|14029020|gb|AAK52561.1| Putative retroelement pol
           polyprotein [Oryza sativa]
          Length = 1326

 Score =  385 bits (989), Expect = e-105
 Identities = 229/574 (39%), Positives = 337/574 (57%), Gaps = 25/574 (4%)

Query: 5   KWDIEKFTGSNDFGLWKVKMQAVLTQQKCV-EALKGEAAMPATLTQEEKREMIDKAKSAI 63
           K+D+        F LW+VKM+A+L Q   + EAL+      +T    E++    KA   I
Sbjct: 5   KYDLPLLDYKTRFSLWQVKMRAILAQTSDLDEALESFGKKKSTEWTAEEKRKDRKALLLI 64

Query: 64  VLCLGDKVLRDVAREATAASM*AKLESLYMTKSLAHRQLLKQQLYSFKMVESISISEQLT 123
            L L + +L++V +E TAA +  KLES+ M+K L  +  +K +L+S K+ ES S+   ++
Sbjct: 65  QLHLSNDILQEVLQEKTAAELWLKLESICMSKDLTSKMHIKMKLFSHKLQESGSVLNHIS 124

Query: 124 EFNKILVDLANIEVNTEDEDKALLLLCSLPKSFEHFKDTILYGKEGTTTLEEVQAALRTK 183
            F +I+VDL +IEV  +DED  LLLLCSLP S+ +F+DTIL  ++   TL EV  AL+ +
Sbjct: 125 VFKEIVVDLVSIEVQFDDEDLGLLLLCSLPSSYANFRDTILLSRD-ELTLAEVYEALQNR 183

Query: 184 ELTKFKDLKVDEGSEGLNVA---RGRNEHRGKGKGKSRSKSRSKGFDKSKYK--CFLCHK 238
           E  K K +   + S     A   RGR+E R       R KS+S+G  KS+ K  C  C K
Sbjct: 184 E--KMKGMVQSDASSSKGEALQVRGRSEQRTYNDSSDRDKSQSRGRSKSRGKKFCKYCKK 241

Query: 239 QGHFKKDC------PDKGGDGSPSVQVAEASNEEGYESTGALVVTSW--KSEKSWVLDSG 290
           + HF ++C        +  DG  SV     ++ E  +S   LVV +    S   W+LD+ 
Sbjct: 242 KNHFIEECWKLQNKEKRKSDGKASV----VTSAENSDSGDCLVVFAGCVASHDEWILDTA 297

Query: 291 CSYHMCPRKEYFETL-TLKEGGVVRLGNNKACKVQGMGNVRLKMFDGREFLLRDVRFVPE 349
           CS+H+C  +++F +  +++ G VVR+G++   ++ G+G+V++K  DG    L+DVR +P 
Sbjct: 298 CSFHICINRDWFSSYKSVQNGDVVRMGDDNPREIVGIGSVQIKTHDGMTRTLKDVRHIPG 357

Query: 350 LKRNLISLSMFDGLGYCTRIEHGVCKISHGALITVKGSKMNG-LYILDGSIVIGNASVAS 408
           + RNLISLS  D  GY      GV K+S G+L+ + G   +  LY+L GS + G+ + A+
Sbjct: 358 MARNLISLSTLDAEGYKYSSSGGVVKVSKGSLVYMIGDMNSANLYVLRGSTLHGSVTAAA 417

Query: 409 VVPHN--NSELWHLRLGHVSERGLVELAKQGLLGKDKLDKLEFCEHCILGKQHRVKFGSG 466
           V       + LWH+RLGH+SE G+ EL K+ LL      K++FCEHC+ GK  RVKF + 
Sbjct: 418 VSKDEPIKTNLWHMRLGHMSELGMAELMKRNLLDGCTQGKMKFCEHCVFGKHKRVKFNTS 477

Query: 467 MHHSSRLFEYVHSDLLGPSKTPTHGGGSYFLSIIDDYSRRVWVFVLKKKSDTF*KFKE*H 526
           +H +  + +YVH+DL GPS+    GG  Y L+IIDDYSR+VW + LK K DTF  FKE  
Sbjct: 478 VHRTKGILDYVHTDLWGPSRKAYLGGARYMLTIIDDYSRKVWPYFLKHKDDTFAAFKEWK 537

Query: 527 TLIENQMGTKLKGLRTDNGLEFVSEQFNDFLQVE 560
             IE Q   ++K LRTDNG EF S+ F+D+ + E
Sbjct: 538 VRIERQTEKEVKVLRTDNGGEFCSDAFDDYCRKE 571


>gb|AAP53029.1| putative retrotransposon-related protein [Oryza sativa (japonica
           cultivar-group)] gi|37532880|ref|NP_920742.1| putative
           retrotransposon-related protein [Oryza sativa (japonica
           cultivar-group)] gi|22655747|gb|AAN04164.1| Putative
           retrotransposon protein [Oryza sativa (japonica
           cultivar-group)] gi|16905223|gb|AAL31093.1| putative
           retrotransposon-related protein [Oryza sativa]
          Length = 1229

 Score =  378 bits (971), Expect = e-103
 Identities = 227/574 (39%), Positives = 334/574 (57%), Gaps = 25/574 (4%)

Query: 5   KWDIEKFTGSNDFGLWKVKMQAVLTQQKCV-EALKGEAAMPATLTQEEKREMIDKAKSAI 63
           K+D+        F LW+VKM+AVL Q   + EAL+       T    E++    KA S I
Sbjct: 2   KYDLPLQDYKTRFSLWQVKMRAVLAQTSDLDEALESFGKKKTTEWTAEEKRKDRKALSLI 61

Query: 64  VLCLGDKVLRDVAREATAASM*AKLESLYMTKSLAHRQLLKQQLYSFKMVESISISEQLT 123
            L L + +L+ V +E TAA +  KLES+ M+K L  +  +K +L+S K+ ES S+   ++
Sbjct: 62  QLHLSNDILQKVLQEKTAAELWFKLESICMSKDLTSKMHIKMKLFSHKLQESGSVLNHIS 121

Query: 124 EFNKILVDLANIEVNTEDEDKALLLLCSLPKSFEHFKDTILYGKEGTTTLEEVQAALRTK 183
            F +I+ DL ++EV  +DED  LLLLCSLP  + +F+DTIL  ++   TL EV  AL+ +
Sbjct: 122 VFKEIIADLVSMEVQFDDEDLGLLLLCSLPSLYANFRDTILLSRD-ELTLAEVYEALQNR 180

Query: 184 ELTKFKDLKVDEGSEGLNVA---RGRNEHRGKGKGKSRSKSRSKGFDKSKYK--CFLCHK 238
           E  K K +   + S     A   RGR+E R       R KS+S+G  KS+ K  C  C K
Sbjct: 181 E--KMKGMVQSDASSSKGKALQVRGRSEQRTYNDSNDRDKSQSRGRSKSRGKKFCKYCKK 238

Query: 239 QGHFKKDC------PDKGGDGSPSVQVAEASNEEGYESTGALVVTSW--KSEKSWVLDSG 290
           + HF ++C        +  DG  SV     ++ E  +S   LV  +    S   W+LD+ 
Sbjct: 239 KNHFIEECWKLQNKEKRKSDGKASV----VTSAENSDSADCLVFFAGCVASHDEWILDTA 294

Query: 291 CSYHMCPRKEYFET-LTLKEGGVVRLGNNKACKVQGMGNVRLKMFDGREFLLRDVRFVPE 349
           C + +C  +++F +  +++ G VVR+G+N   ++ G+G+V++K  DG    L+DVR +P 
Sbjct: 295 CLFLICINRDWFSSHKSVQNGDVVRMGDNNPREIMGIGSVQIKTHDGMTRTLKDVRHIPG 354

Query: 350 LKRNLISLSMFDGLGYCTRIEHGVCKISHGALITVKGSKMNG-LYILDGSIVIGNASVAS 408
           + RNLISLS  D  GY      GV K+S G+L+ + G   +  LY+L GS + G+ + A+
Sbjct: 355 MARNLISLSTLDAEGYKYSGSGGVVKVSKGSLVYMIGDMNSANLYVLRGSTLHGSLTAAA 414

Query: 409 VVPHNNSE--LWHLRLGHVSERGLVELAKQGLLGKDKLDKLEFCEHCILGKQHRVKFGSG 466
           V     S+  LWH+RLGH+SE G+ EL K+ LL       ++FCEHC+ GK  RVKF + 
Sbjct: 415 VSKDEPSKTNLWHMRLGHMSELGMAELMKRNLLDGCTQGNMKFCEHCVFGKHKRVKFNTS 474

Query: 467 MHHSSRLFEYVHSDLLGPSKTPTHGGGSYFLSIIDDYSRRVWVFVLKKKSDTF*KFKE*H 526
           +H +  + +YVH+DL GPS+ P+ GG  Y L+IIDDYSR+VW + LK K DTF  FKE  
Sbjct: 475 VHRTKGILDYVHADLWGPSRKPSLGGACYMLTIIDDYSRKVWPYFLKHKDDTFAAFKEWK 534

Query: 527 TLIENQMGTKLKGLRTDNGLEFVSEQFNDFLQVE 560
            +IE Q   ++K LRTDNG EF S+ F+D+ + E
Sbjct: 535 VMIERQAEKEVKVLRTDNGGEFCSDAFDDYCRKE 568


>emb|CAB75481.1| copia-like polyprotein [Arabidopsis thaliana]
           gi|11278366|pir||T47492 copia-like polyprotein -
           Arabidopsis thaliana
          Length = 1363

 Score =  377 bits (968), Expect = e-102
 Identities = 227/597 (38%), Positives = 342/597 (57%), Gaps = 51/597 (8%)

Query: 3   GSKWDIEKFTGSNDFGLWKVKMQAVLTQQKCVEALKGEAAMP---------ATLTQEEKR 53
           G++ ++EKF G  D+ +WK K+ A +        L+ E+  P         +   ++E+R
Sbjct: 3   GARIEVEKFDGRGDYTMWKEKLLAHIDMLGLSAVLR-ESETPMGKERDSEKSDEDEKEER 61

Query: 54  EMID-------KAKSAIVLCLGDKVLRDVAREATAASM*AKLESLYMTKSLAHRQLLKQQ 106
           E ++       KA+S IVL + D+VLR + +E +AA+M   L+ LYM+K+L +R  LKQ+
Sbjct: 62  EKMEAFEEKKRKARSTIVLSVSDRVLRKIKKETSAAAMLEALDRLYMSKALPNRIYLKQK 121

Query: 107 LYSFKMVESISISEQLTEFNKILVDLANIEVNTEDEDKALLLLCSLPKSFEHFKDTILYG 166
           LYSFKM E++SI   + EF  I+ DL N+ V   DED+A+LLL SLPK F+  KDT+ Y 
Sbjct: 122 LYSFKMSENLSIEGNIDEFLHIVADLENLNVLVSDEDQAILLLMSLPKPFDQLKDTLKYS 181

Query: 167 KEGTT-TLEEVQAALRTKELTKFKDLK--VDEGSEGLNVA-----RGRNEHRGKGKGKSR 218
              T  +L+EV AA+ ++EL +F  +K  +   +EGL V      RGR+E + KGKGK R
Sbjct: 182 SGKTVLSLDEVAAAIYSREL-EFGSVKKSIKGQAEGLYVKDKAENRGRSEQKDKGKGK-R 239

Query: 219 SKSRSKGFDKSKYKCFLCHKQGHFKKDCPDK----------------GGDGSPSVQVAEA 262
           SKS      KSK  C++C + GH K  CP+K                GG G+        
Sbjct: 240 SKS------KSKRGCWICGEDGHLKSTCPNKNKPQFKNQGSNKGESSGGKGNLVEGSVNF 293

Query: 263 SNEEGYESTGALVVTSWKSEKSWVLDSGCSYHMCPRKEYFETLTLKEGGVVRLGNNKACK 322
               G   + AL  T    E  W++D+GC YHM  ++E+ E    + GG VR+GN    +
Sbjct: 294 VESAGMFVSEALSSTDIHLEDEWIMDTGCIYHMTHKREWLEDFDEEAGGSVRMGNKSISR 353

Query: 323 VQGMGNVRLKMFDGREFLLRDVRFVPELKRNLISLSMFDGLGYCTRIEHGVCKISHGALI 382
           V+G+G VR+   +G    L++VR++P++ RNL+SL  F+  G+    E+G+ +I  G  +
Sbjct: 354 VKGVGTVRIVNDNGLTVTLQNVRYIPDMDRNLLSLGTFEKAGHKFESENGMLRIKSGNQV 413

Query: 383 TVKGSKMNGLYILDGSIVIGNASVASVVPHNNSELWHLRLGHVSERGLVELAKQGLLGKD 442
            ++G + + LYIL G     + S+A    ++++ LWH RL H+S++ +  L K+G L K 
Sbjct: 414 LLEGRRYDTLYILHGKPAT-DESLAVARANDDTVLWHRRLCHMSQKNMSLLIKKGFLDKK 472

Query: 443 KLDKLEFCEHCILGKQHRVKFGSGMHHSSRLFEYVHSDLLGPSKTP-THGGGSYFLSIID 501
           K+  L+ CE CI G+  ++ F    H + +  EYVHSDL G    P + G   YF+S ID
Sbjct: 473 KVSMLDTCEDCIYGRAKKIGFNLAQHDTKKKLEYVHSDLWGAPTVPMSLGNCQYFISFID 532

Query: 502 DYSRRVWVFVLKKKSDTF*KFKE*HTLIENQMGTKLKGLRTDNGLEFVSEQFNDFLQ 558
           DY+R+VWV+ LK K + F KF    +L+ENQ G ++K LRTDNGLEF +  F+ F +
Sbjct: 533 DYTRKVWVYFLKTKDEAFEKFVSWISLVENQSGERVKTLRTDNGLEFCNRMFDGFCE 589


>gb|AAX92861.1| retrotransposon protein, putative, Ty1-copia sub-class [Oryza
           sativa (japonica cultivar-group)]
          Length = 1373

 Score =  376 bits (966), Expect = e-102
 Identities = 232/580 (40%), Positives = 336/580 (57%), Gaps = 35/580 (6%)

Query: 5   KWDIEKFTGSNDFGLWKVKMQAVLTQQKCV-EALKGEAAMPATLTQEEKREMIDKAKSAI 63
           K+D+        F LW+VKM+ +L Q     EAL       A  T EE R+   KA + I
Sbjct: 2   KFDLPLLNYDTRFSLWQVKMRGILAQTHDYDEALDNFGKRRAEWTAEEIRKD-QKALALI 60

Query: 64  VLCLGDKVLRDVAREATAASM*AKLESLYMTKSLAHRQLLKQQLYSFKMVESISISEQLT 123
            L L + +L++   E T+A +  KLES+ M+K L  +  +K +L++ KM E  S+   + 
Sbjct: 61  QLHLHNDILQECLTEKTSAELWLKLESICMSKDLTSKMQMKMKLFTLKMKEEDSVITHMA 120

Query: 124 EFNKILVDLANIEVNTEDEDKALLLLCSLPKSFEHFKDTILYGKEGTTTLEEVQAALRTK 183
           EF KI+ DL ++EV  +DED  LLLLCSLP S+ +F+DTIL  ++   TL+EV  AL+ K
Sbjct: 121 EFKKIVADLVSMEVKYDDEDLGLLLLCSLPNSYANFRDTILLSRD-ELTLKEVYDALQNK 179

Query: 184 ELTKFKDLKVDEGS-----EGLNVARGRNEHRGKGKG----KSRSKSRSKGFDKSKYKCF 234
           E  K K +  ++GS     E L+V RGR E+R   +     + RSKS+  G   +K  C 
Sbjct: 180 E--KMKIMVQNDGSSSSKGEALHV-RGRTENRTSNEKNYDRRGRSKSKPPG---NKKFCV 233

Query: 235 LCHKQGHFKKDCPDKGG-------DGSPSVQVAEASNEEGYESTGALVVTSW--KSEKSW 285
            C  + H   +C            DG  SV  A AS+++   S   LVV +        W
Sbjct: 234 YCKLKNHNIDECKKVQAKERKNKKDGKVSVASAAASDDD---SGDCLVVFAGCVAGHDEW 290

Query: 286 VLDSGCSYHMCPRKEYFETLT-LKEGGVVRLGNNKACKVQGMGNVRLKMFDGREFLLRDV 344
           +LDS CS+H+C ++ +F +   +++G VVR+G++  C + G+G+V++K  DG    L++V
Sbjct: 291 ILDSACSFHICTKRNWFSSYKPVQKGDVVRMGDDNPCAIVGIGSVQIKTDDGMTRTLKNV 350

Query: 345 RFVPELKRNLISLSMFDGLGYCTRIEHGVCKISHGALITVKGSKMNG-LYILDGSIVIGN 403
           R++P + RNLISLS  D  GY      GV K+S G+L+ +KG   +  LY+L G  + G+
Sbjct: 351 RYIPGMSRNLISLSTLDAEGYKYSGSDGVLKVSKGSLVCLKGDVNSAKLYVLRGCTLTGS 410

Query: 404 ASVASVVPHNN---SELWHLRLGHVSERGLVELAKQGLLGKDKLDKLEFCEHCILGKQHR 460
            S A+ + ++    + LWH+RLGH+S  G+ EL K+ LL      K++FCEHCI GK  R
Sbjct: 411 DSAAAAITNDEPSKTNLWHMRLGHMSHLGMTELMKRNLLKGCTSSKIKFCEHCIFGKHKR 470

Query: 461 VKFGSGMHHSSRLFEYVHSDLLGPSKTPTHGGGSYFLSIIDDYSRRVWVFVLKKKSDTF* 520
           V+F + +H +    +YVH+DL GPSK P+ GG  Y L+IIDDYSR+VW + LK K DTF 
Sbjct: 471 VQFNTSVHTTKGTLDYVHADLWGPSKKPSLGGARYMLTIIDDYSRKVWPYFLKHKDDTFT 530

Query: 521 KFKE*HTLIENQMGTKLKGLRTDNGLEFVSEQFNDFLQVE 560
            FK    +IE Q   K+K LRTDNG EF S  FND+ + E
Sbjct: 531 AFKNWKVMIERQTERKVKLLRTDNGGEFCSHAFNDYCRQE 570


>ref|XP_469192.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
           gi|53370655|gb|AAU89150.1| integrase core domain
           containing protein [Oryza sativa (japonica
           cultivar-group)] gi|40538906|gb|AAR87163.1| putative
           polyprotein [Oryza sativa (japonica cultivar-group)]
          Length = 1322

 Score =  374 bits (960), Expect = e-101
 Identities = 223/572 (38%), Positives = 333/572 (57%), Gaps = 21/572 (3%)

Query: 5   KWDIEKFTGSNDFGLWKVKMQAVLTQQKCV-EALKGEAAMPATLTQEEKREMIDKAKSAI 63
           K+D+        F LW+VKM+AVL Q   + EAL+       T    E++    KA S I
Sbjct: 5   KYDLPLLDYKTRFSLWQVKMRAVLAQTSDLDEALESFGKKKTTEWTAEEKRKDRKALSLI 64

Query: 64  VLCLGDKVLRDVAREATAASM*AKLESLYMTKSLAHRQLLKQQLYSFKMVESISISEQLT 123
            L L + +L++V ++ TAA +  KLES+ M+K L  +  +K +L+S K+ ES S+   ++
Sbjct: 65  QLHLSNDILQEVLQKKTAAELWLKLESICMSKDLTSKMHIKMKLFSHKLHESGSVLNHIS 124

Query: 124 EFNKILVDLANIEVNTEDEDKALLLLCSLPKSFEHFKDTILYGKEGTTTLEEVQAALRTK 183
            F +I+ DL ++EV  +DED  LLLLCSLP S+ +F+ TIL  ++   TL EV  AL+ +
Sbjct: 125 VFKEIVADLVSMEVQFDDEDLGLLLLCSLPSSYANFRHTILLSRD-ELTLAEVYEALQNR 183

Query: 184 ELTKFKDLKVDEGSEGLNV-ARGRNEHRGKGKGKSRSKSRSKGFDKSKYK--CFLCHKQG 240
           E  K         S+G  +  RGR+E R         KS+S+G  KS+ K  C  C K+ 
Sbjct: 184 EKMKGMVQSYASSSKGEALQVRGRSEQRTYNDSNDHDKSQSRGRSKSRGKKFCKYCKKKN 243

Query: 241 HFKKDC------PDKGGDGSPSVQVAEASNEEGYESTGALVVTSW--KSEKSWVLDSGCS 292
           HF ++C        +  DG  SV     ++ E  +S   LVV +    S   W+LD+ CS
Sbjct: 244 HFIEECWKLQNKEKRKSDGKASV----VTSAENSDSGDCLVVFAGYVASHDEWILDTACS 299

Query: 293 YHMCPRKEYFETL-TLKEGGVVRLGNNKACKVQGMGNVRLKMFDGREFLLRDVRFVPELK 351
           +H+C  +++F +  +++   VVR+G++   ++ G+G+V++K  DG    L+DVR +P + 
Sbjct: 300 FHICINRDWFSSYKSVQNEDVVRMGDDNPREIVGIGSVQIKTHDGMTRTLKDVRHIPGMA 359

Query: 352 RNLISLSMFDGLGYCTRIEHGVCKISHGALITVKGSKMNG-LYILDGSIVIGNASVASVV 410
           RNLISLS  D  GY      GV K+S G+L+ + G   +  LY+L GS + G+ + A+V 
Sbjct: 360 RNLISLSTLDAEGYKYSGSGGVVKVSKGSLVYMIGDMNSANLYVLRGSTLHGSVTAAAVT 419

Query: 411 PHNNSE--LWHLRLGHVSERGLVELAKQGLLGKDKLDKLEFCEHCILGKQHRVKFGSGMH 468
               S+  LWH+RLGH+SE G+ EL K+ LL       ++FCEHC+ GK  RVKF + +H
Sbjct: 420 KDEPSKTNLWHMRLGHMSELGMAELMKRNLLDGCTQGNMKFCEHCVFGKHKRVKFNTSVH 479

Query: 469 HSSRLFEYVHSDLLGPSKTPTHGGGSYFLSIIDDYSRRVWVFVLKKKSDTF*KFKE*HTL 528
            +  + +YVH+DL GPS+ P+ GG  Y L+IIDDYSR+ W + LK K DTF  FKE   +
Sbjct: 480 RTKGILDYVHADLWGPSRKPSLGGARYMLTIIDDYSRKEWPYFLKHKDDTFAAFKERKVM 539

Query: 529 IENQMGTKLKGLRTDNGLEFVSEQFNDFLQVE 560
           IE Q   ++K L TDNG EF S+ F+D+ + E
Sbjct: 540 IERQTEKEVKVLCTDNGGEFCSDAFDDYCRKE 571


>gb|AAD23690.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
           gi|25301702|pir||E84601 probable retroelement pol
           polyprotein [imported] - Arabidopsis thaliana
          Length = 1333

 Score =  372 bits (954), Expect = e-101
 Identities = 229/573 (39%), Positives = 325/573 (55%), Gaps = 44/573 (7%)

Query: 7   DIEKFTGSNDFGLWKVKMQAVLTQQKCVEALKGEAAM-----PATLTQEEKREMI----- 56
           ++EKF G  D+ +WK K+ A L       ALK E  +        LT+EE++E +     
Sbjct: 7   EVEKFDGRGDYTMWKEKLMAHLDILGLSVALKEEDDLVEKVAEMQLTEEEEKEEVLRREL 66

Query: 57  -----DKAKSAIVLCLGDKVLRDVAREATAASM*AKLESLYMTKSLAHRQLLKQQLYSFK 111
                 KA+SAIVL + D+VLR + +E +AA+M   L+ LYM+K+L +R   KQ+LYSFK
Sbjct: 67  LEEKRRKARSAIVLSVTDRVLRKIKKEQSAAAMLGVLDKLYMSKALPNRIYQKQKLYSFK 126

Query: 112 MVESISISEQLTEFNKILVDLANIEVNTEDEDKALLLLCSLPKSFEHFKDTILYG-KEGT 170
           M E++SI   + EF +I+ DL N  V   DED+A+LLL SLPK F+  +DT+ YG    T
Sbjct: 127 MSENLSIEGNIDEFLRIIADLENTNVLVSDEDQAILLLMSLPKPFDQLRDTLKYGLGRVT 186

Query: 171 TTLEEVQAALRTKELTKFKDLKVDEG-SEGLNV-----ARGRNEHRGKGKGKSRSKSRSK 224
            +L+EV AA+ +KEL    + K  +G +EGL V      RGR E RG      +S+S+S 
Sbjct: 187 LSLDEVVAAIYSKELELGSNKKSIKGQAEGLFVKEKTETRGRTEQRGNNNNNKKSRSKS- 245

Query: 225 GFDKSKYKCFLCHKQGHFKKDCPDKGGDGSPSVQVAEASNEEGYESTGALVVTSWKSEKS 284
              +SK  C++C                G  S   +  S   G   + AL  T    E  
Sbjct: 246 ---RSKKGCWIC----------------GESSNGSSNYSEANGLYVSEALSSTDIHLEDE 286

Query: 285 WVLDSGCSYHMCPRKEYFETLTLKEGGVVRLGNNKACKVQGMGNVRLKMFDGREFLLRDV 344
           WV+D+GCSYHM  ++E+FE L    GG VR+GN    KV+G+G +R+K   G    L +V
Sbjct: 287 WVMDTGCSYHMTYKREWFEDLNEDAGGSVRMGNKTVSKVRGIGTIRVKNEAGMVVRLTNV 346

Query: 345 RFVPELKRNLISLSMFDGLGYCTRIEHGVCKISHGALITVKGSKMNGLYILDGSIVIGNA 404
           R++PE+ RNL+SL  F+  GY  ++E+G   I  G  + +   +   LY+L    V    
Sbjct: 347 RYIPEMDRNLLSLGTFEKSGYSFKLENGTLSIIAGDSVLLTVRRCYTLYLLQWRPVT-EE 405

Query: 405 SVASVVPHNNSELWHLRLGHVSERGLVELAKQGLLGKDKLDKLEFCEHCILGKQHRVKFG 464
           S++ V   +++ LWH RLGH+S++ +  L K+GLL K K+ KLE CE CI GK  R+ F 
Sbjct: 406 SLSVVKRQDDTILWHRRLGHMSQKNMDLLLKKGLLDKKKVSKLETCEDCIYGKAKRIGFN 465

Query: 465 SGMHHSSRLFEYVHSDLLGPSKTP-THGGGSYFLSIIDDYSRRVWVFVLKKKSDTF*KFK 523
              H +    EYVHSDL G    P + G   YF+S IDDY+R+V ++ LK K + F KF 
Sbjct: 466 LAQHDTREKLEYVHSDLWGAPSVPFSLGKCQYFISFIDDYTRKVRIYFLKTKDEAFDKFV 525

Query: 524 E*HTLIENQMGTKLKGLRTDNGLEFVSEQFNDF 556
           E   L+ENQ   ++K LRTDNGLEF +  F++F
Sbjct: 526 EWANLVENQTDKRIKTLRTDNGLEFCNRSFDEF 558


>emb|CAB78169.1| putative retrotransposon [Arabidopsis thaliana]
           gi|4539406|emb|CAB40039.1| putative retrotransposon
           [Arabidopsis thaliana] gi|7444416|pir||T04181
           hypothetical protein F7L13.40 - Arabidopsis thaliana
          Length = 1230

 Score =  369 bits (947), Expect = e-100
 Identities = 225/583 (38%), Positives = 321/583 (54%), Gaps = 58/583 (9%)

Query: 7   DIEKFTGSNDFGLWKVKMQAVLTQQKCVEALKGEAAMPATLTQEEK-------------R 53
           ++EKF G  D+ LWK K+ A +       AL+   ++   L  EE+              
Sbjct: 7   EMEKFDGHGDYTLWKEKLMAHMDLLGLTVALRETQSVSDPLESEEEGKESEKGDKEALME 66

Query: 54  EMIDKAKSAIVLCLGDKVLRDVAREATAASM*AKLESLYMTKSLAHRQLLKQQLYSFKMV 113
           E   KA+S IVL + D+VLR   +E TA SM   L+ LYM+K+L +R  LKQ+LYS+KM 
Sbjct: 67  EKRQKARSTIVLSVSDQVLRKSKKEKTAPSMLEALDKLYMSKALPNRIYLKQKLYSYKMQ 126

Query: 114 ESISISEQLTEFNKILVDLANIEVNTEDEDKALLLLCSLPKSFEHFKDTILYGKEGTT-T 172
           E++S+   + EF +++ DL N  V   DED+A+LLL SLPK F+  KDT+ YG   TT +
Sbjct: 127 ENLSVEGNIDEFLRLIADLENTNVLVSDEDQAILLLMSLPKQFDQLKDTLKYGSGRTTLS 186

Query: 173 LEEVQAALRTKELTKFKDLKVDEG-SEGLNV-----ARGRNEHRGKGKGKSRSKSRSKGF 226
           ++EV AA+ +KEL    + K   G +EGL V      RG +E + KG  K RS+SRSKG+
Sbjct: 187 VDEVVAAIYSKELELGSNKKSIRGQAEGLYVKDKPETRGMSEQKEKG-NKGRSRSRSKGW 245

Query: 227 DKSKYKCFLCHKQGHFKKDCPDKGGDGSPSVQVAEASNEE------------GYESTGAL 274
                 C++C ++GHFK  CP+KG   +     A  S  E            GY  + AL
Sbjct: 246 K----GCWICGEEGHFKTSCPNKGKQQNKGKDQASGSKGEAATIKGNTSEGSGYYVSEAL 301

Query: 275 VVTSWKSEKSWVLDSGCSYHMCPRKEYFETLTLKEGGVVRLGNNKACKVQGMGNVRLKMF 334
             T       WV+D+GC+YHM  +KE+FE L+   GG VR+GN    K +          
Sbjct: 302 HSTDVNLGNEWVMDTGCNYHMTHKKEWFEELSEDAGGTVRMGNKSTSKFR---------- 351

Query: 335 DGREFLLRDVRFVPELKRNLISLSMFDGLGYCTRIEHGVCKISHGALITVKGSKMNGLYI 394
                    V+++P++ RNL+S+   +  GY    ++GV  +  G    + GS+   LY+
Sbjct: 352 ---------VKYIPDMDRNLLSMGTLEEHGYSFESKNGVLVVKEGTRTLLIGSRHEKLYL 402

Query: 395 LDGSIVIGNASVASVVPHNNSELWHLRLGHVSERGLVELAKQGLLGKDKLDKLEFCEHCI 454
           L G   + + S+     ++++ LWH RLGH+S++ +  L K+G L   K+ KLE CE CI
Sbjct: 403 LQGKPEVSH-SMTVERRNDDTVLWHRRLGHISQKNMDILVKKGYLDGKKVSKLELCEDCI 461

Query: 455 LGKQHRVKFGSGMHHSSRLFEYVHSDLLG-PSKTPTHGGGSYFLSIIDDYSRRVWVFVLK 513
            GK  R+ F    H++     YVHSDL G PS   + G   YF+S ID YSR+ WV+ LK
Sbjct: 462 YGKARRLSFVVATHNTEDKLNYVHSDLWGAPSVPLSLGKCQYFISFIDVYSRKTWVYFLK 521

Query: 514 KKSDTF*KFKE*HTLIENQMGTKLKGLRTDNGLEFVSEQFNDF 556
            K + F  F E   ++ENQ G K+K LR DNGLEF ++QFNDF
Sbjct: 522 HKDEAFGTFAEWSVMVENQTGRKIKILRIDNGLEFCNQQFNDF 564


>ref|XP_475489.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
           gi|48475213|gb|AAT44282.1| putative polyprotein [Oryza
           sativa (japonica cultivar-group)]
          Length = 1243

 Score =  358 bits (920), Expect = 5e-97
 Identities = 222/570 (38%), Positives = 327/570 (56%), Gaps = 38/570 (6%)

Query: 5   KWDIEKFTGSNDFGLWKVKMQAVLTQQKCVEALKGEAAMPATLTQEEKREMIDKAKSAIV 64
           K+D+        F LW+VKM+AVL QQ   +AL G        + +EK+    KA S I 
Sbjct: 5   KYDLPLLDRDTRFSLWQVKMRAVLAQQDLDDALSGFDKRTQDWSNDEKKRD-RKAISYIH 63

Query: 65  LCLGDKVLRDVAREATAASM*AKLESLYMTKSLAHRQLLKQQLYSFKMVESISISEQLTE 124
           L L + +L++V +E TAA +  KLE + MTK L  +  LKQ+L+  K+ +  S+ + L+ 
Sbjct: 64  LHLSNNILQEVLKEETAAGLWLKLEQICMTKDLTSKMHLKQKLFLHKLQDDESVMDHLSA 123

Query: 125 FNKILVDLANIEVNTEDEDKALLLLCSLPKSFEHFKDTILYGKEGTTTLEEVQAALRTKE 184
           F +I+ DL ++EV  +++D  L+LLCSLP S+ +F+ TILY ++ T TL+EV  A   KE
Sbjct: 124 FKEIVADLESMEVKYDEDDLGLILLCSLPSSYANFRGTILYSRD-TLTLKEVYDAFHAKE 182

Query: 185 LTKFKDLKVDEGS----EGLNVARGRNEHRG-KGKGKSRSKSRSKGFDKSK--YK-CFLC 236
             K K +   EGS    EGL V RGR + +  K + + +S S  +G  KS+  YK C  C
Sbjct: 183 --KMKKMVTSEGSNSQAEGL-VVRGRQQKKNTKNQSRDKSSSSYRGRTKSRGRYKSCKYC 239

Query: 237 HKQGHFKKDC---PDKGGDGSPSVQVAEASNEEGYESTGALVVTSWKSEKSWVLDSGCSY 293
            + GH   +C    DK  D      + +   EE  E   A+V       +  V  +GC+ 
Sbjct: 240 KRDGHDISECWKLQDK--DKRTGKYIPKGKKEE--EGKAAVVTDEKSDAELLVAYAGCA- 294

Query: 294 HMCPRKEYFETLTLKEGGVVRLGNNKACKVQGMGNVRLKMFDGREFLLRDVRFVPELKRN 353
                +++F T    +GG V +G++  C+V G+G V++KMFDG    L DV+ +P LKR+
Sbjct: 295 -QTSDQDWFATYEALQGGTVLMGDDTPCEVAGIGTVQIKMFDGCIRTLSDVQHIPNLKRS 353

Query: 354 LISLSMFDGLGYCTRIEHGVCKISHGALITVKGS-KMNGLYILDGSIVIGNASVA--SVV 410
           LISL             +G+ K++ G+L+ +K   K   LY L G+ ++GN +    S+ 
Sbjct: 354 LISL-------------YGILKVTKGSLVVMKVDIKSANLYHLRGTTILGNVAAVFDSLS 400

Query: 411 PHNNSELWHLRLGHVSERGLVELAKQGLLGKDKLDKLEFCEHCILGKQHRVKFGSGMHHS 470
             + + LWH+RLGH+SE GL EL+K+GLL    + KL+FCEHCI GK  RVKF +  H +
Sbjct: 401 NSDATNLWHMRLGHMSEIGLAELSKRGLLDGQSIRKLKFCEHCIFGKHKRVKFNTSTHTT 460

Query: 471 SRLFEYVHSDLLGPSKTPTHGGGSYFLSIIDDYSRRVWVFVLKKKSDTF*KFKE*HTLIE 530
             + +YVHSDL GP+   + GG  Y ++I+DDYSR+VW + LK K   F  FKE  T++E
Sbjct: 461 EGILDYVHSDLWGPAHKTSFGGARYMMTIVDDYSRKVWPYFLKHKYQAFDGFKEWKTMVE 520

Query: 531 NQMGTKLKGLRTDNGLEFVSEQFNDFLQVE 560
            Q   K+K LRTDNG+EF S+ F  + + E
Sbjct: 521 RQTERKVKILRTDNGMEFCSKIFKSYCKSE 550


>dbj|BAB09923.1| copia-like retrotransposable element [Arabidopsis thaliana]
          Length = 1342

 Score =  353 bits (907), Expect = 2e-95
 Identities = 219/586 (37%), Positives = 329/586 (55%), Gaps = 53/586 (9%)

Query: 7   DIEKFTGSNDFGLWKVKMQAVLTQQKCVEALKGEAAM----------------PATLTQ- 49
           ++EKF G  D+ LWK K+ A +     +E L  E                   P T T  
Sbjct: 7   EVEKFDGDGDYILWKEKLLAHMEMLGLLEGLGEEEEAVVEDSTTEISDGGNQDPETATSK 66

Query: 50  -EEK--REMIDKAKSAIVLCLGDKVLRDVAREATAASM*AKLESLYMTKSLAHRQLLKQQ 106
            E+K  +E   KA+S I+L LG+ VLR V ++ TAA M   L+ L+M KSL +R  LKQ+
Sbjct: 67  LEDKILKEKRGKARSTIILSLGNNVLRKVIKQKTAAGMIKVLDQLFMAKSLPNRIYLKQR 126

Query: 107 LYSFKMVESISISEQLTEFNKILVDLANIEVNTEDEDKALLLLCSLPKSFEHFKDTILYG 166
           LY +KM E++++ E + +F K++ DL N++V   DED+A++LL SLP+ F+  K+T+ Y 
Sbjct: 127 LYGYKMSENMTMEENVNDFFKLISDLENVKVVVPDEDQAIVLLMSLPRQFDQLKETLKYC 186

Query: 167 KEGTTTLEEVQAALRTKELTKFKDLK-VDEGSEGLNVA-RGRNEHRGKGKGKSRSKSRSK 224
           K  T  LEE+ +A+R+K L      K +   S+GL V  RGR+E RGKG  K++S+S+SK
Sbjct: 187 KT-TLHLEEITSAIRSKILELGASGKLLKNNSDGLFVQDRGRSETRGKGPNKNKSRSKSK 245

Query: 225 GFDKSKYKCFLCHKQGHFKKDC---PDKGGDGSPSVQVAEASNEEGYESTGALVVT---- 277
           G  K+   C++C K+GHFKK C    ++   GS S +   ++         ALVV+    
Sbjct: 246 GAGKT---CWICGKEGHFKKQCYVWKERNKQGSTSERGEASTVTARVTDAAALVVSRALL 302

Query: 278 --SWKSEKSWVLDSGCSYHMCPRKEYFETLTLKEGGVVRLGNNKACKVQGMGNVRLKMFD 335
             +  +  +W+LD+GCS+HM  RK++         G VR+GN+   +V+G+G+VR+K  D
Sbjct: 303 GFAEVTPDTWILDTGCSFHMTCRKDWIIDFKETASGKVRMGNDTYSEVKGIGDVRIKNED 362

Query: 336 GREFLLRDVRFVPELKRNLISLSMFDGLGYCTRIEHGVCKISHGALITVKGSKMNGLYIL 395
           G   LL DVR++PE+ +NLISL   +  G     + G+  I    L  + G K + LY L
Sbjct: 363 GSTILLTDVRYIPEMSKNLISLGTLEDKGCWFESKKGILTIFKNDLTVLTGKKESTLYFL 422

Query: 396 DGSIVIGNASVASVVPHNNSELWHLRLGHVSERGLVELAKQGLLGKDKLDKLEFCEHCIL 455
            G+ + G A+V      + + LWH RLGH+  +GL  L  +G L K+ +           
Sbjct: 423 QGTTLAGEANVID-KEKDETSLWHSRLGHIGAKGLQVLVSKGHLDKNIM----------- 470

Query: 456 GKQHRVKFGSGMHHSSRLFEYVHSDLLGPSKTP-THGGGSYFLSIIDDYSRRVWVFVLKK 514
                + FG+  H +    +YVHSDL G +  P + G   YF++ IDD++RR W++ ++ 
Sbjct: 471 -----ISFGAAKHVTKDKLDYVHSDLWGSTNVPFSIGKCQYFITFIDDFTRRTWIYFIRT 525

Query: 515 KSDTF*KFKE*HTLIENQMGTKLKGLRTDNGLEFVSEQFNDFLQVE 560
           K + F KF E  T IENQ   KLK L TDNGLEF +++F+ F + E
Sbjct: 526 KDEAFSKFVEWKTQIENQQDKKLKILITDNGLEFCNQEFDSFCRKE 571


>gb|AAF19226.1| Highly similar to Ta1-3 polyprotein [Arabidopsis thaliana]
           gi|25301707|pir||E86490 hypothetical protein F28L22.3 -
           Arabidopsis thaliana
          Length = 1356

 Score =  352 bits (903), Expect = 5e-95
 Identities = 216/592 (36%), Positives = 326/592 (54%), Gaps = 46/592 (7%)

Query: 7   DIEKFTGSNDFGLWKVKMQAVL------------TQQKCVEALKGEAAMPA--------- 45
           +I+ F G  DF LWK+++QA L            +  K V   K EA   +         
Sbjct: 9   EIKVFNGDRDFSLWKIRIQAQLGVLGLKDTLTDFSLTKTVPLTKSEAKQESGDGESSGTK 68

Query: 46  TLTQEEKREMIDKAKSAIVLCLGDKVLRDVAREATAASM*AKLESLYMTKSLAHRQLLKQ 105
            +    K E  ++AK+ I+  + D VL  V   AT A + A L   YM  SL +R   + 
Sbjct: 69  EVPDPVKIEQSEQAKNIIINHISDVVLLKVNHYATTADLWATLNKKYMETSLPNRIYTQL 128

Query: 106 QLYSFKMVESISISEQLTEFNKILVDLANIEVNTEDEDKALLLLCSLPKSFEHFKDTILY 165
           +LYSFKMV +++I + + EF +I+ +L ++E+  ++E +A+L+L SLP S    K T+ Y
Sbjct: 129 KLYSFKMVSTMTIDQNVDEFLRIVAELGSLEIQVDEEVQAILILNSLPASHIQLKHTLKY 188

Query: 166 GKEGTTTLEEVQAALRT--KELTKFKDLKVDEGSEGLNVARGR-----NEHRGKGKGKSR 218
           G + T T+++V ++ ++  +EL +  DL   + +      RGR     N+  G+GKG+SR
Sbjct: 189 GNK-TLTVQDVTSSAKSLERELAEAVDLDKGQAAVLYTTERGRPLVRNNQKGGQGKGRSR 247

Query: 219 SKSRSKGFDKSKYKCFLCHKQGHFKKDCPDKGGDGSPSVQVAEASNEEGYESTGALVVTS 278
           S S      K+K  C+ C K+GH KKDC  +        Q       E    + AL V  
Sbjct: 248 SNS------KTKVPCWYCKKEGHVKKDCYSRKKKMESEGQGEAGVITEKLVFSEALSVNE 301

Query: 279 WKSEKSWVLDSGCSYHMCPRKEYFETLTLKEGGVVRLGNNKACKVQGMGNVRLKMFDGRE 338
              +  W+LDSGC+ HM  R+++F +   K    + LG++ + + QG G +R+    G  
Sbjct: 302 QMVKDLWILDSGCTSHMTSRRDWFISFQEKGNTTILLGDDHSVESQGQGTIRIDTHGGTI 361

Query: 339 FLLRDVRFVPELKRNLISLSMFDGLGYCTRIEHGVCKISHGALITVKGSKMNGLYILDGS 398
            +L +V++VP L+RNLIS    D LGY      G  +        ++GS  NGLY+LDGS
Sbjct: 362 KILENVKYVPHLRRNLISTGTLDKLGYRHEGGEGKVRYFKNNKTALRGSLSNGLYVLDGS 421

Query: 399 IVIG---NASVASVVPHNNSELWHLRLGHVSERGLVELAKQGLLGKDKLDKLEFCEHCIL 455
            V+    NA    V     + LWH RLGH+S   L  LA +GL+ + ++++LEFCEHC++
Sbjct: 422 TVMSELCNAETDKV----KTALWHSRLGHMSMNNLKVLAGKGLIDRKEINELEFCEHCVM 477

Query: 456 GKQHRVKFGSGMHHSSRLFEYVHSDLLG-PSKTPTHGGGSYFLSIIDDYSRRVWVFVLKK 514
           GK  +V F  G H S     YVH+DL G P+ TP+  G  YFLSIIDD +R+VW++ LK 
Sbjct: 478 GKSKKVSFNVGKHTSEDALSYVHADLWGSPNVTPSISGKQYFLSIIDDKTRKVWLYFLKS 537

Query: 515 KSDTF*KFKE*HTLIENQMGTKLKGLRTDNGLEFVSEQFNDFLQ---VERNQ 563
           K +TF KF E  +L+ENQ+  K+K LRTDNGLEF + +F+ + +   +ER++
Sbjct: 538 KDETFDKFCEWKSLVENQVNKKVKCLRTDNGLEFCNSRFDSYCKEHGIERHR 589


>gb|AAT85194.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
          Length = 1241

 Score =  351 bits (901), Expect = 8e-95
 Identities = 201/495 (40%), Positives = 288/495 (57%), Gaps = 35/495 (7%)

Query: 93  MTKSLAHRQLLKQQLYSFKMVESISISEQLTEFNKILVDLANIEVNTEDEDKALLLLCSL 152
           MTK L  +  LKQ+L+  K+ +  S+ + L+ F +I+ DL ++EV  ++ED  L+LLCSL
Sbjct: 1   MTKDLTSKMHLKQKLFLHKLQDDGSVMDHLSAFKEIVADLESMEVKYDEEDLGLILLCSL 60

Query: 153 PKSFEHFKDTILYGKEGTTTLEEVQAALRTKELTKFKDLKVDEGS----EGLNVARGRNE 208
           P S+ +F+DTILY ++ T TL+EV  AL  KE  K K +   EGS    EGL V RGR +
Sbjct: 61  PSSYANFRDTILYSRD-TLTLKEVYDALHAKE--KMKKMVPSEGSNSQAEGL-VVRGRQQ 116

Query: 209 HRG---KGKGKSRSKSRSKGFDKSKYK-CFLCHKQGHFKKDC----------------PD 248
            +    K + KS S  R +   + +YK C  C + GH   +C                  
Sbjct: 117 EKNTNNKSRDKSSSIYRGRSKSRGRYKSCKYCKRDGHDISECWKLQDKDKRTRKYIPKGK 176

Query: 249 KGGDGSPSVQVAEASNEEGYESTGALVVTSWKSEKSWVLDSGCSYHMCPRKEYFETLTLK 308
           K  +G  +V   E S+ E   +      TS      W+LD+ C+YHMCP +++F T    
Sbjct: 177 KEEEGKAAVVTDEKSDAELLVAYAGCAQTS----DQWILDTACTYHMCPNRDWFATYEAV 232

Query: 309 EGGVVRLGNNKACKVQGMGNVRLKMFDGREFLLRDVRFVPELKRNLISLSMFDGLGYCTR 368
           +GG V +G++  C+V G+G V++KMFDG    L DVR +P LKR+LISL   D  GY   
Sbjct: 233 QGGTVLMGDDTPCEVAGIGTVQIKMFDGCIRTLLDVRHIPNLKRSLISLCTLDRKGYKYS 292

Query: 369 IEHGVCKISHGALITVKGS-KMNGLYILDGSIVIGNASVASVVPHNN--SELWHLRLGHV 425
              G+ K++ G+L+ +K   K   LY L G+ ++GN +  S    N+  + LWH+RLGH+
Sbjct: 293 GGDGILKVTKGSLVVMKADIKYANLYHLRGTTILGNVAAVSDSLSNSDATNLWHMRLGHM 352

Query: 426 SERGLVELAKQGLLGKDKLDKLEFCEHCILGKQHRVKFGSGMHHSSRLFEYVHSDLLGPS 485
           SE GL EL+K+GLL    + KL+FCEHCI GK  RVKF +  H +  + +YVHSDL GP+
Sbjct: 353 SEIGLAELSKRGLLDGQSIGKLKFCEHCIFGKHKRVKFNTSTHTTEGILDYVHSDLWGPA 412

Query: 486 KTPTHGGGSYFLSIIDDYSRRVWVFVLKKKSDTF*KFKE*HTLIENQMGTKLKGLRTDNG 545
           +  + GG  Y ++I+DDYSR+VW + LK K   F  FKE  T++E Q   K+K LRTDNG
Sbjct: 413 RKTSFGGARYMMTIVDDYSRKVWPYFLKHKYQAFDVFKEWKTMVERQTERKVKILRTDNG 472

Query: 546 LEFVSEQFNDFLQVE 560
           +E  S+ F  + + E
Sbjct: 473 MELCSKIFKSYCKSE 487


>gb|AAD19773.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
           gi|25301697|pir||B84512 probable retroelement pol
           polyprotein [imported] - Arabidopsis thaliana
          Length = 1335

 Score =  350 bits (897), Expect = 2e-94
 Identities = 200/523 (38%), Positives = 296/523 (56%), Gaps = 22/523 (4%)

Query: 54  EMIDKAKSAIVLCLGDKVLRDVAREATAASM*AKLESLYMTKSLAHRQLLKQQLYSFKMV 113
           E  DKAK+ I L + DKVLR +    TAA     L+ L+M +SL HR   +   Y+FKM 
Sbjct: 44  ERCDKAKNVIFLNVADKVLRKIELCKTAAEAWETLDRLFMIRSLPHRVYTQLSFYTFKMQ 103

Query: 114 ESISISEQLTEFNKILVDLANIEVNTEDEDKALLLLCSLPKSFEHFKDTILYGKEGTTT- 172
           E+  I E + +F KI+ DL +++++  DE +A+LLL SLP  ++   +T+ Y        
Sbjct: 104 ENKKIDENIDDFLKIVADLNHLQIDVTDEVQAILLLSSLPARYDGLVETMKYSNSREKLR 163

Query: 173 LEEVQAALRTKELTKFKDLK-VDEGSEGLNVARGRNEHRG-KGKGKSRSKSRSKGFDKSK 230
           L++V  A R KE    ++ + V EG        G+N ++G KGK +SRSKS        K
Sbjct: 164 LDDVMVAARDKERELSQNNRPVVEGHFARGRPDGKNNNQGNKGKNRSRSKSAD-----GK 218

Query: 231 YKCFLCHKQGHFKKDC------PDKGGDGSPSVQVAEASNEEGYE------STGALVVTS 278
             C++C K+GHFKK C            GS + + + A + E +       +T   +V +
Sbjct: 219 RVCWICGKEGHFKKQCYKWIERNKSKQQGSDNGESSLAKSTEAFNPAMVLLATDETLVVT 278

Query: 279 WKSEKSWVLDSGCSYHMCPRKEYFETLTLKEGGVVRLGNNKACKVQGMGNVRLKMFDGRE 338
                 WVLD+GCS+HM PRK++F+       G V++GN+    V+G+G+++++  DG +
Sbjct: 279 DSIANEWVLDTGCSFHMTPRKDWFKDFKELSSGYVKMGNDTYSPVKGIGSIKIRNSDGSQ 338

Query: 339 FLLRDVRFVPELKRNLISLSMFDGLGYCTRIEHGVCKISHGALITVKGSKMNGLYILDGS 398
            +L DVR++P + RNLISL   +  G   + + G+ KI  G    +KG K + LYILDG 
Sbjct: 339 VILTDVRYMPNMTRNLISLGTLEDRGCWFKSQDGILKIVKGCSTILKGQKRDTLYILDGV 398

Query: 399 IVIGNASVASVVPHNNSELWHLRLGHVSERGLVELAKQGLLGKDKLDKLEFCEHCILGKQ 458
              G  S +S    + + LWH RLGH+S++G+  L K+G L ++ + +LEFCE C+ GKQ
Sbjct: 399 TEEGE-SHSSAEVKDETALWHSRLGHMSQKGMEILVKKGCLRREVIKELEFCEDCVYGKQ 457

Query: 459 HRVKFGSGMHHSSRLFEYVHSDLLGPSKTPTH-GGGSYFLSIIDDYSRRVWVFVLKKKSD 517
           HRV F    H +     YVHSDL G    P   G   YF+S +DDYSR+VW++ L+KK +
Sbjct: 458 HRVSFAPAQHVTKEKLAYVHSDLWGSPHNPASLGNSQYFISFVDDYSRKVWIYFLRKKDE 517

Query: 518 TF*KFKE*HTLIENQMGTKLKGLRTDNGLEFVSEQFNDFLQVE 560
            F KF E   ++ENQ   K+K LRTDNGLE+ +  F  F + E
Sbjct: 518 AFEKFVEWKKMVENQSDRKVKKLRTDNGLEYCNHYFEKFCKEE 560


>emb|CAA32025.1| unnamed protein product [Nicotiana tabacum]
           gi|130582|sp|P10978|POLX_TOBAC Retrovirus-related Pol
           polyprotein from transposon TNT 1-94 [Contains: Protease
           ; Reverse transcriptase ; Endonuclease]
          Length = 1328

 Score =  349 bits (896), Expect = 3e-94
 Identities = 199/572 (34%), Positives = 321/572 (55%), Gaps = 29/572 (5%)

Query: 3   GSKWDIEKFTGSNDFGLWKVKMQAVLTQQKCVEALKGEAAMPATLTQEEKREMIDKAKSA 62
           G K+++ KF G N F  W+ +M+ +L QQ   + L  ++  P T+  E+  ++ ++A SA
Sbjct: 3   GVKYEVAKFNGDNGFSTWQRRMRDLLIQQGLHKVLDVDSKKPDTMKAEDWADLDERAASA 62

Query: 63  IVLCLGDKVLRDVAREATAASM*AKLESLYMTKSLAHRQLLKQQLYSFKMVESISISEQL 122
           I L L D V+ ++  E TA  +  +LESLYM+K+L ++  LK+QLY+  M E  +    L
Sbjct: 63  IRLHLSDDVVNNIIDEDTARGIWTRLESLYMSKTLTNKLYLKKQLYALHMSEGTNFLSHL 122

Query: 123 TEFNKILVDLANIEVNTEDEDKALLLLCSLPKSFEHFKDTILYGKEGTTTLEEVQAALRT 182
             FN ++  LAN+ V  E+EDKA+LLL SLP S+++   TIL+GK  T  L++V +AL  
Sbjct: 123 NVFNGLITQLANLGVKIEEEDKAILLLNSLPSSYDNLATTILHGKT-TIELKDVTSALLL 181

Query: 183 KELTKFKDLKVDEGSEGLNVARGRNEHRGKGK-GKSRSKSRSKGFDKSKYK-CFLCHKQG 240
            E  K +    ++G   +   RGR+  R     G+S ++ +SK   KS+ + C+ C++ G
Sbjct: 182 NE--KMRKKPENQGQALITEGRGRSYQRSSNNYGRSGARGKSKNRSKSRVRNCYNCNQPG 239

Query: 241 HFKKDCPDKGGDGSPSVQVAEASNEEGYESTGALVVTSWK----------------SEKS 284
           HFK+DCP+      P     E S ++  ++T A+V  +                   E  
Sbjct: 240 HFKRDCPN------PRKGKGETSGQKNDDNTAAMVQNNDNVVLFINEEEECMHLSGPESE 293

Query: 285 WVLDSGCSYHMCPRKEYFETLTLKEGGVVRLGNNKACKVQGMGNVRLKMFDGREFLLRDV 344
           WV+D+  S+H  P ++ F      + G V++GN    K+ G+G++ +K   G   +L+DV
Sbjct: 294 WVVDTAASHHATPVRDLFCRYVAGDFGTVKMGNTSYSKIAGIGDICIKTNVGCTLVLKDV 353

Query: 345 RFVPELKRNLISLSMFDGLGYCTRIEHGVCKISHGALITVKGSKMNGLYILDGSIVIGNA 404
           R VP+L+ NLIS    D  GY +   +   +++ G+L+  KG     LY  +  I  G  
Sbjct: 354 RHVPDLRMNLISGIALDRDGYESYFANQKWRLTKGSLVIAKGVARGTLYRTNAEICQGEL 413

Query: 405 SVASVVPHNNSELWHLRLGHVSERGLVELAKQGLLGKDKLDKLEFCEHCILGKQHRVKFG 464
           + A      + +LWH R+GH+SE+GL  LAK+ L+   K   ++ C++C+ GKQHRV F 
Sbjct: 414 NAAQ--DEISVDLWHKRMGHMSEKGLQILAKKSLISYAKGTTVKPCDYCLFGKQHRVSFQ 471

Query: 465 SGMHHSSRLFEYVHSDLLGPSKTPTHGGGSYFLSIIDDYSRRVWVFVLKKKSDTF*KFKE 524
           +       + + V+SD+ GP +  + GG  YF++ IDD SR++WV++LK K   F  F++
Sbjct: 472 TSSERKLNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKTKDQVFQVFQK 531

Query: 525 *HTLIENQMGTKLKGLRTDNGLEFVSEQFNDF 556
            H L+E + G KLK LR+DNG E+ S +F ++
Sbjct: 532 FHALVERETGRKLKRLRSDNGGEYTSREFEEY 563


>gb|AAD23679.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
           gi|25412027|pir||G84599 probable retroelement pol
           polyprotein [imported] - Arabidopsis thaliana
          Length = 838

 Score =  345 bits (886), Expect = 4e-93
 Identities = 199/525 (37%), Positives = 310/525 (58%), Gaps = 30/525 (5%)

Query: 7   DIEKFTGSNDFGLWKVKMQAV---------LTQQKCVEALKGEAAMPATLTQEEKR---E 54
           ++EK  G  D+ LWK K+ A          L + + +E  +  A   + LT+ E +   E
Sbjct: 7   EVEKLDGEGDYVLWKEKLLAHIELLGLLEGLEEDEAIEEEESTAETDSLLTKTEDKVLKE 66

Query: 55  MIDKAKSAIVLCLGDKVLRDVAREATAASM*AKLESLYMTKSLAHRQLLKQQLYSFKMVE 114
              KA+S ++L LG+ VLR V +E TAA M   L+ L+M KSL +R  LKQ+LY +KM +
Sbjct: 67  KRGKARSTVILSLGNHVLRKVIKEKTAAGMIRVLDKLFMAKSLPNRIYLKQRLYGYKMSD 126

Query: 115 SISISEQLTEFNKILVDLANIEVNTEDEDKALLLLCSLPKSFEHFKDTILYGKEGTTTLE 174
           S++I E + +F K++ DL N++V+  DED+A++LL SLPK F+  KDT+ YGK  T  L+
Sbjct: 127 SMTIEENVNDFFKLISDLENVKVSVPDEDQAIVLLMSLPKQFDQLKDTLKYGKT-TLALD 185

Query: 175 EVQAALRTKELTKFKDLK-VDEGSEGLNVA-RGRNEHRGKGKGKSRSKSRSKGFDKSKYK 232
           E+  A+R+K L      K +   S+ L V  RGR+E R K   +++S+SRSK   + K  
Sbjct: 186 EITGAIRSKVLELGASGKMLKNSSDALFVQDRGRSEKRDKSSERNKSQSRSK--SREKKV 243

Query: 233 CFLCHKQGHFKKDC---PDKGGDGSPSVQVAEASNEEGYESTGALVVTSWKS-------E 282
           C++C K+GHFKK C    +K   G+ S +  E+SN  G  +  A +    +S       +
Sbjct: 244 CWVCGKEGHFKKQCYVWKEKNKKGNNS-EKGESSNVIGQAADAAALAVREESNADNQEVD 302

Query: 283 KSWVLDSGCSYHMCPRKEYFETLTLKEGGVVRLGNNKACKVQGMGNVRLKMFDGREFLLR 342
             W++D+GCS+HM PR+++F      + G V++ N    +++G+G++R++  D    LL+
Sbjct: 303 NEWIMDTGCSFHMTPRRDWFVEFDESQTGRVKMANQTYSEIKGIGSIRIQNDDNTTVLLK 362

Query: 343 DVRFVPELKRNLISLSMFDGLGYCTRIEHGVCKISHGALITVKGSKMNGLYILDGSIVIG 402
           +VR+VP + +NLIS+   +  G   + + G  K+  G +  +KG K+  LY+L G +V G
Sbjct: 363 NVRYVPSMSKNLISMGTLEDQGCWFQSKAGTLKVVKGCMTLLKGKKVGTLYLLQGVVVTG 422

Query: 403 NASVASVVPHNNSELWHLRLGHVSERGLVELAKQGLLGKDKLDKLEFCEHCILGKQHRVK 462
           NA+ A     + S++WH RL H+S+R +  L K+G L  +K++ LEFCE C+ GK HRV 
Sbjct: 423 NAN-AVTSSKDESKIWHSRLCHMSQRNIDVLIKKGCLQAEKINGLEFCEDCVYGKTHRVG 481

Query: 463 FGSGMHHSSRLFEYVHSDLLG-PSKTPTHGGGSYFLSIIDDYSRR 506
           FGS  H +    EY+HSDL G PS   + G   YF++ IDD +R+
Sbjct: 482 FGSAKHVTREKLEYIHSDLWGAPSVPNSLGNCQYFITFIDDLTRK 526


>dbj|BAD34493.1| Gag-Pol [Ipomoea batatas]
          Length = 1298

 Score =  340 bits (873), Expect = 1e-91
 Identities = 210/574 (36%), Positives = 319/574 (54%), Gaps = 32/574 (5%)

Query: 2   MGSKWDIEKFTGSNDFGLWKVKMQAVLTQQKCVEALKGEAAMPATLTQEEK-REMIDKAK 60
           M +K++IEKF G N F LWK+K++A+L +  C+ A+   +  P   T ++K  EM + A 
Sbjct: 1   MAAKFEIEKFNGKN-FSLWKLKVKAILRKDNCLAAI---SERPVDFTDDKKWSEMNEDAM 56

Query: 61  SAIVLCLGDKVLRDVAREATAASM*AKLESLYMTKSLAHRQLLKQQLYSFKMVESISISE 120
           + + L + D VL  +  + TA  +   L  LY  KSL ++  LK++LY+ +M ES S++E
Sbjct: 57  ADLYLSIADGVLSSIEEKKTANEIWDHLNRLYEAKSLHNKIFLKRKLYTLRMSESTSVTE 116

Query: 121 QLTEFNKILVDLANIEVNTEDEDKALLLLCSLPKSFEHF---------KDTILYGKEGTT 171
            L   N +   L ++    E +++A LLL SLP S++            D +++      
Sbjct: 117 HLNTLNTLFSQLTSLSCKIEPQERAELLLQSLPDSYDQLIINLTNNILTDYLVFDDVAAA 176

Query: 172 TLEEVQAALRTKELTKFKDLKVD-EGSEGLNVARGRNEHRGKGKGKSRSKSRSKGFDKSK 230
            LEE ++  + KE     D +V+ + +E L V RGR+  RG+  G+ RSKS  K      
Sbjct: 177 VLEE-ESRRKNKE-----DRQVNLQQAEALTVMRGRSTERGQSSGRGRSKSSKKNLT--- 227

Query: 231 YKCFLCHKQGHFKKDCPDKGGDGSPSVQVAEASNEEGYESTGALVVTSWKSEKS--WVLD 288
             C+ C K+GH KKDC +   + +P   VA  S++       A +    +   +  W++D
Sbjct: 228 --CYNCGKKGHLKKDCWNLAQNSNPQGNVASTSDDGSALCCEASIAREGRKRFADIWLID 285

Query: 289 SGCSYHMCPRKEYFETLTLKEGGVVRLGNNKACKVQGMGNVRLKMFDGREFLLRDVRFVP 348
           SG +YHM  RKE+F       GG V   ++ A ++ G+G ++LKM+DG    ++DVR V 
Sbjct: 286 SGATYHMTSRKEWFHHYEPISGGSVYSCDDHALEIIGIGTIKLKMYDGTVQTVQDVRHVK 345

Query: 349 ELKRNLISLSMFDGLGYCTRIEHGVCKISHGALITVKGSKM-NGLYILDG-SIVIGNASV 406
            LK+NL+S  + D        + GV KI  GAL+ +KG K+   LY+L G ++    ASV
Sbjct: 346 GLKKNLLSYGILDNSATQIETQKGVMKIFQGALVVMKGEKIAANLYMLKGETLQEAEASV 405

Query: 407 ASVVPHNNSELWHLRLGHVSERGLVELAKQGLLGKDKLDKLEFCEHCILGKQHRVKFGSG 466
           A+  P +++ LWH +LGH+S++G+  L +Q L+       L  CEHCI  KQHR+KF + 
Sbjct: 406 AACSP-DSTLLWHQKLGHMSDQGMKILVEQKLIPGLTKVSLPLCEHCITSKQHRLKFSTS 464

Query: 467 MHHSSRLFEYVHSDLLGPSKTPTHGGGSYFLSIIDDYSRRVWVFVLKKKSDTF*KFKE*H 526
                 + E VHSD +  +  P+ GG  YF+S IDDYSRR WV+ +KKKSD F  FK   
Sbjct: 465 NSRGKVVLELVHSD-VWQAPVPSLGGAKYFVSFIDDYSRRCWVYPIKKKSDVFATFKAFK 523

Query: 527 TLIENQMGTKLKGLRTDNGLEFVSEQFNDFLQVE 560
             +E   G K+K  RTDNG E+ SE+F+DF + E
Sbjct: 524 ARVELDSGKKIKCFRTDNGGEYTSEEFDDFCKKE 557


>gb|AAK29467.1| polyprotein-like [Lycopersicon chilense]
          Length = 1328

 Score =  338 bits (867), Expect = 7e-91
 Identities = 199/574 (34%), Positives = 320/574 (55%), Gaps = 32/574 (5%)

Query: 3   GSKWDIEKFTGSND-FGLWKVKMQAVLTQQKCVEALKGEAAMPATLTQEEKREMIDKAKS 61
           G K+++ KF G    F +W+ +M+ +L QQ   +AL G++  P ++  E+  E+ +KA S
Sbjct: 3   GVKYEVAKFNGDKPVFSMWQRRMKDLLIQQGLHKALGGKSKKPESMKLEDWEELDEKAAS 62

Query: 62  AIVLCLGDKVLRDVAREATAASM*AKLESLYMTKSLAHRQLLKQQLYSFKMVESISISEQ 121
           AI L L D V+ ++  E +A  +  KLE+LYM+K+L ++  LK+QLY+  M E  +    
Sbjct: 63  AIRLHLTDDVVNNIVDEESACGIWTKLENLYMSKTLTNKLYLKKQLYTLHMDEGTNFLSH 122

Query: 122 LTEFNKILVDLANIEVNTEDEDKALLLLCSLPKSFEHFKDTILYGKEGTTTLEEVQAALR 181
           L   N ++  LAN+ V  E+EDK ++LL SLP S++    TIL+GK+ +  L++V +AL 
Sbjct: 123 LNVLNGLITQLANLGVKIEEEDKRIVLLNSLPSSYDTLSTTILHGKD-SIQLKDVTSALL 181

Query: 182 TKELTKFKDLKVDEGSEGLNVARGRNEHRGKGK-GKSRSKSRSKGFDKSKYK-CFLCHKQ 239
             E  K +    + G   +  +RGR+  R     G+S ++ +SK   KSK + C+ C + 
Sbjct: 182 LNE--KMRKKPENHGQVFITESRGRSYQRSSSNYGRSGARGKSKVRSKSKARNCYNCDQP 239

Query: 240 GHFKKDCPD-KGGDGSPSVQVAEASNEEGYESTGALVVTSWK----------------SE 282
           GHFK+DCP+ K G G       E+S ++  ++T A+V  +                  +E
Sbjct: 240 GHFKRDCPNPKRGKG-------ESSGQKNDDNTAAMVQNNDDVVLLINEEEECMHLAGTE 292

Query: 283 KSWVLDSGCSYHMCPRKEYFETLTLKEGGVVRLGNNKACKVQGMGNVRLKMFDGREFLLR 342
             WV+D+  SYH  P ++ F      + G V++GN    K+ G+G++  K   G   +L+
Sbjct: 293 SEWVVDTAASYHATPVRDLFCRYVAGDYGNVKMGNTSYSKIAGIGDICFKTNVGCTLVLK 352

Query: 343 DVRFVPELKRNLISLSMFDGLGYCTRIEHGVCKISHGALITVKGSKMNGLYILDGSIVIG 402
           DVR VP+L+ NLIS    D  GY     +   +++ GAL+  KG     LY  +  I  G
Sbjct: 353 DVRHVPDLRMNLISGIALDQDGYENYFANQKWRLTKGALVIAKGVARGTLYRTNAEICQG 412

Query: 403 NASVASVVPHNNSELWHLRLGHVSERGLVELAKQGLLGKDKLDKLEFCEHCILGKQHRVK 462
             + A     N+++LWH R+GH SE+GL  L+K+ L+   K   ++ C + + GKQHRV 
Sbjct: 413 ELNAAH--EENSADLWHKRMGHTSEKGLQILSKKSLISFTKGTTIKPCNYWLFGKQHRVS 470

Query: 463 FGSGMHHSSRLFEYVHSDLLGPSKTPTHGGGSYFLSIIDDYSRRVWVFVLKKKSDTF*KF 522
           F +     S + + V+SD+ GP +  + GG  YF++ IDD SR++WV++ + K   F  F
Sbjct: 471 FQTSSERKSNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYIFRAKDQVFQVF 530

Query: 523 KE*HTLIENQMGTKLKGLRTDNGLEFVSEQFNDF 556
           ++ H L+E + G K K LRTDNG E+ S +F ++
Sbjct: 531 QKFHALVERETGRKRKRLRTDNGGEYTSREFEEY 564


  Database: nr
    Posted date:  Jul 5, 2005 12:34 AM
  Number of letters in database: 863,360,394
  Number of sequences in database:  2,540,612
  
Lambda     K      H
   0.346    0.154    0.537 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,043,683,341
Number of Sequences: 2540612
Number of extensions: 81477081
Number of successful extensions: 293933
Number of sequences better than 10.0: 1076
Number of HSP's better than 10.0 without gapping: 736
Number of HSP's successfully gapped in prelim test: 340
Number of HSP's that attempted gapping in prelim test: 290172
Number of HSP's gapped (non-prelim): 2003
length of query: 1307
length of database: 863,360,394
effective HSP length: 140
effective length of query: 1167
effective length of database: 507,674,714
effective search space: 592456391238
effective search space used: 592456391238
T: 11
A: 40
X1: 15 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 38 (21.7 bits)
S2: 81 (35.8 bits)


Medicago: description of AC124217.5