Medicago
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= AC135504.5 - phase: 0 /pseudo
         (1283 letters)

Database: nr 
           2,540,612 sequences; 863,360,394 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|XP_474090.1| OSJNBa0033G05.13 [Oryza sativa (japonica cultiv...   400  e-109
emb|CAB75481.1| copia-like polyprotein [Arabidopsis thaliana] gi...   390  e-106
gb|AAP54315.1| putative polyprotein [Oryza sativa (japonica cult...   387  e-105
gb|AAX92941.1| retrotransposon protein, putative, Ty1-copia sub-...   382  e-104
gb|AAD32759.1| putative retroelement pol polyprotein [Arabidopsi...   380  e-103
gb|AAX92861.1| retrotransposon protein, putative, Ty1-copia sub-...   378  e-103
ref|XP_470868.1| Putative retroelement pol polyprotein [Oryza sa...   377  e-102
emb|CAB78169.1| putative retrotransposon [Arabidopsis thaliana] ...   370  e-100
gb|AAP53029.1| putative retrotransposon-related protein [Oryza s...   368  e-100
gb|AAD23690.1| putative retroelement pol polyprotein [Arabidopsi...   367  1e-99
ref|XP_475489.1| putative polyprotein [Oryza sativa (japonica cu...   360  2e-97
gb|AAD19773.1| putative retroelement pol polyprotein [Arabidopsi...   359  3e-97
ref|XP_469192.1| putative polyprotein [Oryza sativa (japonica cu...   359  3e-97
dbj|BAB09923.1| copia-like retrotransposable element [Arabidopsi...   353  2e-95
emb|CAA32025.1| unnamed protein product [Nicotiana tabacum] gi|1...   350  2e-94
gb|AAT85194.1| putative polyprotein [Oryza sativa (japonica cult...   350  2e-94
gb|AAF19226.1| Highly similar to Ta1-3 polyprotein [Arabidopsis ...   347  1e-93
dbj|BAD34493.1| Gag-Pol [Ipomoea batatas]                             345  4e-93
gb|AAK29467.1| polyprotein-like [Lycopersicon chilense]               328  6e-88
emb|CAA31653.1| polyprotein [Arabidopsis thaliana] gi|99721|pir|...   328  9e-88

>ref|XP_474090.1| OSJNBa0033G05.13 [Oryza sativa (japonica cultivar-group)]
           gi|38344889|emb|CAD41912.2| OSJNBa0033G05.13 [Oryza
           sativa (japonica cultivar-group)]
          Length = 1181

 Score =  400 bits (1027), Expect = e-109
 Identities = 228/589 (38%), Positives = 349/589 (58%), Gaps = 24/589 (4%)

Query: 5   KWDIEKFTGSNLFGLWKVKMRAILIQEKCVEALKREAQMSAHLTPAEKTEMNDKAVSAII 64
           K+D+        F LW+VKMRA+L Q++  +AL    + +   +  EK + + KA+S I 
Sbjct: 5   KYDLPLLDRDTRFSLWQVKMRAVLAQQELDDALSGFDKRTQDWSNDEK-KRDRKAMSYIH 63

Query: 65  LCLGDKMLREVSRETTAVSMWNKLDLLYMTKSLAHRQCLKQQLYFYRMMESKPIMEQLTE 124
           L L + +L+EV +E TA  +W KL+ + MTK L  +  LKQ+L+ +++ +   +M+ L+ 
Sbjct: 64  LHLSNNILQEVLKEETAAGLWLKLEQICMTKDLTSKMHLKQKLFLHKLQDDGSVMDHLSA 123

Query: 125 FNKIIDDLANIDVNLEDEDKALHLLCALPRSFENFKDTMLYGKEGTITLEEVQATLITKE 184
           F +I+ DL +++V  +++D AL LLC+LP S+ NF+DT+LY ++ T+TL+EV   L  KE
Sbjct: 124 FKEIVADLESMEVKYDEKDLALILLCSLPSSYANFRDTILYSRD-TLTLKEVYDALHAKE 182

Query: 185 LTKFKDLKVDDSG---EGLNVSRGRNQNRGKGKGKNSKSKSRSKGDG-NKTKYK-CFICH 239
             K K +  + S    EGL V RG  Q +        KS S  +G   ++ +YK C  C 
Sbjct: 183 KMK-KMVPSEGSNSQAEGL-VVRGSQQEKNTNNKSRDKSSSSYRGRSKSRGRYKSCKYCK 240

Query: 240 NPGHFKKDC--PERKDNGGGNPSVQLASKDEGC------ESAGALTVTSW----EPEKGW 287
             GH    C   + KD   G    +   ++EG       E + A  + ++    +    W
Sbjct: 241 RDGHDISKCWKLQDKDKRTGKYIPKGKKEEEGKAAVVTDEKSDAELLVAYAGCAQTSDQW 300

Query: 288 VLDSGCSYHISPRKGYFETLELEEGGVVRLGNNKACKIQGIGTIRLKMFDDRDFLLKDVR 347
           +LD+ C+YH+ P + +F T E+ +GG V +G++  C++ GIGT+++KMFD     L DVR
Sbjct: 301 ILDTACTYHMCPNRDWFATYEVVQGGTVLMGDDTPCEVAGIGTVQIKMFDGCIRTLSDVR 360

Query: 348 YIPELRRNLISISMFDGLGYCTRIERGVMRISHGALVIAKGS-KIHGLYILEGSTIIAHA 406
           +IP L+R+LIS+   D  GY      G+++++ G+LV+ K S K   LY L+G+TI+ + 
Sbjct: 361 HIPNLKRSLISLCTLDRKGYKYSGGDGILKVTKGSLVVMKASIKSANLYHLQGTTILGNV 420

Query: 407 SV--PSVDTLDITKLWHLRLGHVSERGLVELAKQGLLGNEKLNKLDFCDNCTLGKQHKVK 464
           +    S+   D T LWH+RLGH+SE GL EL+K+GLL  + ++KL FC++C  GK  +VK
Sbjct: 421 ATVSDSLSNSDATNLWHMRLGHMSEIGLAELSKRGLLDGQSISKLKFCEHCIFGKHKRVK 480

Query: 465 FGVGVHKSSRPFE*VHSDLLGPA*VKTYGGGSYFTSIIDDYSRRVWVYILKNKSDAFEKF 524
           F    H +    + VHSDL GPA   ++GG  Y  +I+DDYSR+VW Y LK+K  AF  F
Sbjct: 481 FNTSTHTTEGILDYVHSDLWGPARKTSFGGARYMMTIVDDYSRKVWPYFLKHKYQAFNVF 540

Query: 525 KEWDILVENQIGTKLKVLRTDNGQEFVLEQFNEFCRKKGIKRHRIVAYT 573
           KEW  +VE Q   K+K+LRTDNG EF  + F  +C+ +GI RH  V +T
Sbjct: 541 KEWKTMVERQTERKVKILRTDNGMEFCSKIFKSYCKSEGIVRHYTVPHT 589



 Score = 99.0 bits (245), Expect = 9e-19
 Identities = 43/91 (47%), Positives = 65/91 (71%)

Query: 1190 EAIWLKGMIGEMGISQGCVKIHCDSQSAIHLANHQIYHERTKHINIRLHFVRDMIETKEI 1249
            EAIWL+G+  E+     C+ I CDSQSAI L   Q++HERTKHI++R HF+R +I   ++
Sbjct: 1087 EAIWLRGLYTELCGVTSCINIFCDSQSAICLTKDQMFHERTKHIDVRYHFIRGVIAEGDV 1146

Query: 1250 KVEKVASEENPADIFTKSLPRSRFKHCLDLI 1280
            KV K+++ +NPAD+ TK +P ++F+ C  L+
Sbjct: 1147 KVCKISTHDNPADMMTKPVPATKFELCSSLV 1177


>emb|CAB75481.1| copia-like polyprotein [Arabidopsis thaliana]
           gi|11278366|pir||T47492 copia-like polyprotein -
           Arabidopsis thaliana
          Length = 1363

 Score =  390 bits (1003), Expect = e-106
 Identities = 233/611 (38%), Positives = 346/611 (56%), Gaps = 51/611 (8%)

Query: 3   GSKWDIEKFTGSNLFGLWKVKM---------RAILIQEKCVEALKREAQMSAHLTPAEKT 53
           G++ ++EKF G   + +WK K+          A+L + +     +R+++ S      E+ 
Sbjct: 3   GARIEVEKFDGRGDYTMWKEKLLAHIDMLGLSAVLRESETPMGKERDSEKSDEDEKEERE 62

Query: 54  EMND------KAVSAIILCLGDKMLREVSRETTAVSMWNKLDLLYMTKSLAHRQCLKQQL 107
           +M        KA S I+L + D++LR++ +ET+A +M   LD LYM+K+L +R  LKQ+L
Sbjct: 63  KMEAFEEKKRKARSTIVLSVSDRVLRKIKKETSAAAMLEALDRLYMSKALPNRIYLKQKL 122

Query: 108 YFYRMMESKPIMEQLTEFNKIIDDLANIDVNLEDEDKALHLLCALPRSFENFKDTMLYGK 167
           Y ++M E+  I   + EF  I+ DL N++V + DED+A+ LL +LP+ F+  KDT+ Y  
Sbjct: 123 YSFKMSENLSIEGNIDEFLHIVADLENLNVLVSDEDQAILLLMSLPKPFDQLKDTLKYSS 182

Query: 168 EGTI-TLEEVQATLITKELTKFKDLKVDDSG--EGLNV-----SRGRNQNRGKGKGKNSK 219
             T+ +L+EV A + ++EL +F  +K    G  EGL V     +RGR++ + KGKGK SK
Sbjct: 183 GKTVLSLDEVAAAIYSREL-EFGSVKKSIKGQAEGLYVKDKAENRGRSEQKDKGKGKRSK 241

Query: 220 SKSRSKGDGNKTKYKCFICHNPGHFKKDCPERKD----NGGGNPSVQLASKD---EGC-- 270
           SKS         K  C+IC   GH K  CP +      N G N       K    EG   
Sbjct: 242 SKS---------KRGCWICGEDGHLKSTCPNKNKPQFKNQGSNKGESSGGKGNLVEGSVN 292

Query: 271 --ESAG-----ALTVTSWEPEKGWVLDSGCSYHISPRKGYFETLELEEGGVVRLGNNKAC 323
             ESAG     AL+ T    E  W++D+GC YH++ ++ + E  + E GG VR+GN    
Sbjct: 293 FVESAGMFVSEALSSTDIHLEDEWIMDTGCIYHMTHKREWLEDFDEEAGGSVRMGNKSIS 352

Query: 324 KIQGIGTIRLKMFDDRDFLLKDVRYIPELRRNLISISMFDGLGYCTRIERGVMRISHGAL 383
           +++G+GT+R+   +     L++VRYIP++ RNL+S+  F+  G+    E G++RI  G  
Sbjct: 353 RVKGVGTVRIVNDNGLTVTLQNVRYIPDMDRNLLSLGTFEKAGHKFESENGMLRIKSGNQ 412

Query: 384 VIAKGSKIHGLYILEGSTIIAHASVPSVDTLDITKLWHLRLGHVSERGLVELAKQGLLGN 443
           V+ +G +   LYIL G       S+      D T LWH RL H+S++ +  L K+G L  
Sbjct: 413 VLLEGRRYDTLYILHGKPA-TDESLAVARANDDTVLWHRRLCHMSQKNMSLLIKKGFLDK 471

Query: 444 EKLNKLDFCDNCTLGKQHKVKFGVGVHKSSRPFE*VHSDLLG-PA*VKTYGGGSYFTSII 502
           +K++ LD C++C  G+  K+ F +  H + +  E VHSDL G P    + G   YF S I
Sbjct: 472 KKVSMLDTCEDCIYGRAKKIGFNLAQHDTKKKLEYVHSDLWGAPTVPMSLGNCQYFISFI 531

Query: 503 DDYSRRVWVYILKNKSDAFEKFKEWDILVENQIGTKLKVLRTDNGQEFVLEQFNEFCRKK 562
           DDY+R+VWVY LK K +AFEKF  W  LVENQ G ++K LRTDNG EF    F+ FC +K
Sbjct: 532 DDYTRKVWVYFLKTKDEAFEKFVSWISLVENQSGERVKTLRTDNGLEFCNRMFDGFCEEK 591

Query: 563 GIKRHRIVAYT 573
           G +RHR  AYT
Sbjct: 592 GFQRHRTCAYT 602



 Score =  108 bits (271), Expect = 9e-22
 Identities = 48/102 (47%), Positives = 75/102 (73%)

Query: 1179 CVYIVWYNSNLEAIWLKGMIGEMGISQGCVKIHCDSQSAIHLANHQIYHERTKHINIRLH 1238
            C Y+    +  EAIWLKG++ + G  Q  V+I CDSQSAI L+ + ++HERTKHI+++ H
Sbjct: 1257 CEYMSLTEAVKEAIWLKGLLKDFGYEQKNVEIFCDSQSAIALSKNNVHHERTKHIDVKFH 1316

Query: 1239 FVRDMIETKEIKVEKVASEENPADIFTKSLPRSRFKHCLDLI 1280
            F+R++I   +++V K+++E+NPADIFTK LP ++F+  LD +
Sbjct: 1317 FIREIIADGKVEVSKISTEKNPADIFTKVLPVNKFQTALDFL 1358


>gb|AAP54315.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
           gi|37535452|ref|NP_922028.1| putative polyprotein [Oryza
           sativa (japonica cultivar-group)]
           gi|22094359|gb|AAM91886.1| putative polyprotein [Oryza
           sativa (japonica cultivar-group)]
          Length = 1280

 Score =  387 bits (994), Expect = e-105
 Identities = 224/589 (38%), Positives = 335/589 (56%), Gaps = 24/589 (4%)

Query: 5   KWDIEKFTGSNLFGLWKVKMRAILIQEKCVEALKREAQMSAHLTPAEKTEMNDKAVSAII 64
           K+D+        F LW+VKMRA+L Q+   +AL    + +   +  EK + + KA+S I 
Sbjct: 40  KYDLPLLDRDTRFSLWQVKMRAVLAQQDLDDALSGFDKRTQDWSNDEKKK-DRKAMSYIH 98

Query: 65  LCLGDKMLREVSRETTAVSMWNKLDLLYMTKSLAHRQCLKQQLYFYRMMESKPIMEQLTE 124
           L L + +L+EV +E TA  +W KL+ + MTK L  +  LKQ+L+ +++ +   +M+ L+ 
Sbjct: 99  LHLSNNILQEVLKEETAAGLWLKLEQICMTKDLTSKMHLKQKLFLHKLQDDGSVMDHLST 158

Query: 125 FNKIIDDLANIDVNLEDEDKALHLLCALPRSFENFKDTMLYGKEGTITLEEVQATLITKE 184
           F +I+ DL +I+V  ++ED  L LLC+LP S+ NF+DT+LY  + T+ L+EV   L  KE
Sbjct: 159 FKEIVADLESIEVKYDEEDLGLILLCSLPSSYANFRDTILYSHD-TLILKEVYDALHAKE 217

Query: 185 LTKFKDLKVDDSG---EGLNVSRGRNQNRGKGKGKNSKSKSRSKGDG-NKTKYK-CFICH 239
             K K +  + S    EGL V RGR Q +        KS S  +G   ++ +YK C  C 
Sbjct: 218 KMK-KMVPSEGSNSQAEGL-VVRGRQQEKNTKNQSRDKSSSSYRGRSKSRGRYKSCKYCK 275

Query: 240 NPGHFKKDCPERKDNGGGNPSVQLASKDEGCESAGALTVTSWEPE------------KGW 287
             GH   +C + +D            K E    A  +T    + E              W
Sbjct: 276 RDGHDISECWKLQDKDKRTGKYIPKGKKEEEGKAAVVTDEKSDTELLVAYAGCAQTSDQW 335

Query: 288 VLDSGCSYHISPRKGYFETLELEEGGVVRLGNNKACKIQGIGTIRLKMFDDRDFLLKDVR 347
           +LD+  +YH+ P + +F T E  +GG V +G++  C++ GIGT+++KMFD     L DVR
Sbjct: 336 ILDTAWTYHMCPNRDWFATYEALQGGTVLMGDDTPCEVAGIGTVQIKMFDGYIRTLSDVR 395

Query: 348 YIPELRRNLISISMFDGLGYCTRIERGVMRISHGALVIAKGS-KIHGLYILEGSTIIAHA 406
           +IP L+R+LIS+   D  GY      G+++++ G+LV+ K   K   LY L G+TI+ + 
Sbjct: 396 HIPNLKRSLISLCTLDRKGYKYSGGDGILKVTKGSLVVMKADIKSANLYHLRGTTILGNV 455

Query: 407 SV--PSVDTLDITKLWHLRLGHVSERGLVELAKQGLLGNEKLNKLDFCDNCTLGKQHKVK 464
           +    S+   D T LWH+RLGH+SE GL EL+K+ LL  + + KL FC++C  GK  +VK
Sbjct: 456 AAVSDSLSNSDATNLWHMRLGHMSEIGLAELSKRELLDGQSIGKLKFCEHCIFGKHKRVK 515

Query: 465 FGVGVHKSSRPFE*VHSDLLGPA*VKTYGGGSYFTSIIDDYSRRVWVYILKNKSDAFEKF 524
           F    H +    + VHSDL GPA   ++GG  Y  +I+DDYSR+VW Y LK+K  AF+ F
Sbjct: 516 FNTSTHTTEGILDYVHSDLWGPACKTSFGGARYMMTIVDDYSRKVWPYFLKHKYQAFDVF 575

Query: 525 KEWDILVENQIGTKLKVLRTDNGQEFVLEQFNEFCRKKGIKRHRIVAYT 573
           KEW  +VE Q   K+K+LRTDNG EF  + F  +C+ +GI  H  V +T
Sbjct: 576 KEWKTMVERQTEKKVKILRTDNGMEFCSKIFKSYCKSEGIVHHYTVPHT 624



 Score = 93.2 bits (230), Expect = 5e-17
 Identities = 41/91 (45%), Positives = 63/91 (69%)

Query: 1190 EAIWLKGMIGEMGISQGCVKIHCDSQSAIHLANHQIYHERTKHINIRLHFVRDMIETKEI 1249
            EAIWL+G+  E+     C+ I CDSQSAI L   Q++HERTKHI++R H +R +I   ++
Sbjct: 1186 EAIWLRGLYTELCGVTSCINIFCDSQSAICLTKDQMFHERTKHIDVRYHIIRGVIVEGDV 1245

Query: 1250 KVEKVASEENPADIFTKSLPRSRFKHCLDLI 1280
            KV K+++ +NPAD+ TK +  ++F+ C  L+
Sbjct: 1246 KVCKISTHDNPADMMTKPVSATKFELCSSLV 1276


>gb|AAX92941.1| retrotransposon protein, putative, Ty1-copia sub-class [Oryza
           sativa (japonica cultivar-group)]
          Length = 2340

 Score =  382 bits (980), Expect = e-104
 Identities = 219/588 (37%), Positives = 333/588 (56%), Gaps = 22/588 (3%)

Query: 5   KWDIEKFTGSNLFGLWKVKMRAILIQEKCVEALKREAQMSAHLTPAEKTEMNDKAVSAII 64
           K+D+        F LW+VKMRA+L Q+   +AL    + +   +  EK + + KA+S I 
Sbjct: 212 KYDLPLLYRDTRFSLWQVKMRAVLAQQDLDDALSGFDKRTHDWSNDEK-KRDRKAMSYIH 270

Query: 65  LCLGDKMLREVSRETTAVSMWNKLDLLYMTKSLAHRQCLKQQLYFYRMMESKPIMEQLTE 124
           L L + +L+EV +E  A  +W KL+ + MTK L  +  LKQ L+ +++ +   +M+ L+ 
Sbjct: 271 LHLSNNILQEVLKEEIAAGLWLKLEQICMTKDLTSKMHLKQTLFLHKLQDDGSVMDHLSA 330

Query: 125 FNKIIDDLANIDVNLEDEDKALHLLCALPRSFENFKDTMLYGKEGTITLEEVQATLITKE 184
           F +II DL +++V  ++ED  L LLC+LP S+ NF+DT+LY ++ T+TL+EV   L  KE
Sbjct: 331 FKEIIADLESMEVKYDEEDLGLILLCSLPSSYANFRDTILYSRD-TLTLKEVYDALHVKE 389

Query: 185 LTKFKDLKVDDSG---EGLNVSRGRNQNRGKGKGKNSKSKSRSKGDGNKTKYK-CFICHN 240
             K K +  + S    EGL V   + +   K + ++  S S      ++ +YK C  C  
Sbjct: 390 KMK-KMVPSEGSNSQAEGLIVWGRQQEKNTKNQSRDKSSSSYRGRSKSRGRYKSCKYCKR 448

Query: 241 PGHFKKDCPERKDNGGGNPSVQLASKDEGCESAGALTVTSWEPE------------KGWV 288
            GH   +C +  D            K E    A  +T    + E              W+
Sbjct: 449 DGHDIFECWKLHDKDKRTGKYVPKGKKEEEGKAAVVTDEKSDAELLVAYAGCAQTSDQWI 508

Query: 289 LDSGCSYHISPRKGYFETLELEEGGVVRLGNNKACKIQGIGTIRLKMFDDRDFLLKDVRY 348
           L++ C YH+ P + +F T E  + G V +G++  C++ GIGT+++KMFD     L DVR+
Sbjct: 509 LNTACIYHMCPNRDWFATYEAVQVGTVLMGDDTPCEVAGIGTVQIKMFDGCIRTLSDVRH 568

Query: 349 IPELRRNLISISMFDGLGYCTRIERGVMRISHGALVIAKGS-KIHGLYILEGSTIIAHAS 407
           IP L+R+LIS+   D  GY      G+++++ G+LV+ K   K   LY L G+TI+ + +
Sbjct: 569 IPNLKRSLISLCTLDRKGYKYSGGDGILKVTKGSLVVMKADIKSANLYHLRGTTILGNVA 628

Query: 408 V--PSVDTLDITKLWHLRLGHVSERGLVELAKQGLLGNEKLNKLDFCDNCTLGKQHKVKF 465
               S+   D T LWH+RLGH++E GL EL+K+GLL  + + KL FC++C  GK  +VKF
Sbjct: 629 AVSDSLSNSDATNLWHMRLGHMTEIGLAELSKRGLLDGQSIGKLKFCEHCIFGKHKRVKF 688

Query: 466 GVGVHKSSRPFE*VHSDLLGPA*VKTYGGGSYFTSIIDDYSRRVWVYILKNKSDAFEKFK 525
               H +    + VHSDL GPA   ++GG  Y  +I+DDYSR+VW Y LK+K  AF+ FK
Sbjct: 689 NTSTHTTEGILDYVHSDLWGPARKTSFGGTRYMMTIVDDYSRKVWPYFLKHKYQAFDVFK 748

Query: 526 EWDILVENQIGTKLKVLRTDNGQEFVLEQFNEFCRKKGIKRHRIVAYT 573
           EW  +VE Q   K+K+LRTDNG EF  + F  +C+ +GI RH  V +T
Sbjct: 749 EWKTMVERQTERKVKILRTDNGMEFCSKIFKSYCKSEGIVRHYTVPHT 796



 Score = 96.3 bits (238), Expect = 6e-18
 Identities = 41/91 (45%), Positives = 63/91 (69%)

Query: 1190 EAIWLKGMIGEMGISQGCVKIHCDSQSAIHLANHQIYHERTKHINIRLHFVRDMIETKEI 1249
            EAIWL+G+   +     C+ I CDSQSAI L   Q++HERTKHI++R HF+R +I   ++
Sbjct: 1445 EAIWLRGLYTVLCAVTSCINIFCDSQSAICLTKDQMFHERTKHIDVRYHFIRGLIAEGDV 1504

Query: 1250 KVEKVASEENPADIFTKSLPRSRFKHCLDLI 1280
            K+ K++  +NPAD+ TK +P ++F+ C  L+
Sbjct: 1505 KICKISIHDNPADMMTKPVPATKFELCSSLV 1535


>gb|AAD32759.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
           gi|25301696|pir||F84486 probable retroelement pol
           polyprotein [imported] - Arabidopsis thaliana
          Length = 1356

 Score =  380 bits (977), Expect = e-103
 Identities = 224/599 (37%), Positives = 333/599 (55%), Gaps = 42/599 (7%)

Query: 7   DIEKFTGSNLFGLWKVKMRAILIQEKCVEALKREAQMSAHLTPAEKT------------- 53
           ++EKF G   + +WK K+ A +       ALK         +  +++             
Sbjct: 7   EVEKFDGRGDYTMWKEKLLAHMDILGLNTALKESESTGEKKSVLDESDEDYEEKLEKFEA 66

Query: 54  --EMNDKAVSAIILCLGDKMLREVSRETTAVSMWNKLDLLYMTKSLAHRQCLKQQLYFYR 111
             E   KA SAI+L + D++LR++ +E+TA +M   LD LYM+K+L +R   KQ+LY ++
Sbjct: 67  LEEKKKKARSAIVLSVTDRVLRKIKKESTAAAMLLALDKLYMSKALPNRIYPKQKLYSFK 126

Query: 112 MMESKPIMEQLTEFNKIIDDLANIDVNLEDEDKALHLLCALPRSFENFKDTMLYGK-EGT 170
           M E+  +   + EF +II DL N++V + DED+A+ LL ALP++F+  KDT+ Y   +  
Sbjct: 127 MSENLSVEGNIDEFLQIITDLENMNVIISDEDQAILLLTALPKAFDQLKDTLKYSSGKSI 186

Query: 171 ITLEEVQATLITKEL---TKFKDLKVDDSG---EGLNVSRGRNQNRGKGKGKNSKSKSRS 224
           +TL+EV A + +KEL   +  K +KV   G   +  N ++G+ + +GKGKGK  KSK   
Sbjct: 187 LTLDEVAAAIYSKELELGSVKKSIKVQAEGLYVKDKNENKGKGEQKGKGKGKKGKSKK-- 244

Query: 225 KGDGNKTKYKCFICHNPGHFKKDCPERKDNGGGNPSVQLASKDEG----CESAG-----A 275
                  K  C+ C   GHF+  CP +         V       G     E+AG     A
Sbjct: 245 -------KPGCWTCGEEGHFRSSCPNQNKPQFKQSQVVKGESSGGKGNLAEAAGYYVSEA 297

Query: 276 LTVTSWEPEKGWVLDSGCSYHISPRKGYFETLELEEGGVVRLGNNKACKIQGIGTIRLKM 335
           L+ T    E  W+LD+GCSYH++ ++ +F     + GG VR+GN    +++G+GTIR+K 
Sbjct: 298 LSSTEVHLEDEWILDTGCSYHMTYKREWFHEFNEDAGGSVRMGNKTVSRVRGVGTIRVKN 357

Query: 336 FDDRDFLLKDVRYIPELRRNLISISMFDGLGYCTRIERGVMRISHGALVIAKGSKIHGLY 395
            D    +L +VRYIP++ RNL+S+  F+  GY    E G++RI  G  V+  G +   LY
Sbjct: 358 SDGLTIVLTNVRYIPDMDRNLLSLGTFEKAGYKFESEDGILRIKAGNQVLLTGRRYDTLY 417

Query: 396 ILEGSTIIAHASVPSVDTLDITKLWHLRLGHVSERGLVELAKQGLLGNEKLNKLDFCDNC 455
           +L     +A  S+  V   D T LWH RL H+S++ +  L ++G L  +K++ LD C++C
Sbjct: 418 LLNWKP-VASESLAVVKRADDTVLWHQRLCHMSQKNMEILVRKGFLDKKKVSSLDVCEDC 476

Query: 456 TLGKQHKVKFGVGVHKSSRPFE*VHSDLLG-PA*VKTYGGGSYFTSIIDDYSRRVWVYIL 514
             GK  +  F +  H +    E +HSDL G P    + G   YF SIIDD++R+VWVY +
Sbjct: 477 IYGKAKRKSFSLAHHDTKEKLEYIHSDLWGAPFVPLSLGKCQYFMSIIDDFTRKVWVYFM 536

Query: 515 KNKSDAFEKFKEWDILVENQIGTKLKVLRTDNGQEFVLEQFNEFCRKKGIKRHRIVAYT 573
           K K +AFEKF EW  LVENQ   ++K LRTDNG EF  + F+ FC   GI RHR  AYT
Sbjct: 537 KTKDEAFEKFVEWVNLVENQTDRRVKTLRTDNGLEFCNKLFDGFCESIGIHRHRTCAYT 595



 Score =  103 bits (257), Expect = 4e-20
 Identities = 46/91 (50%), Positives = 70/91 (76%)

Query: 1190 EAIWLKGMIGEMGISQGCVKIHCDSQSAIHLANHQIYHERTKHINIRLHFVRDMIETKEI 1249
            EAIWLKG++ + G  Q  V+I CDSQSAI L+ + ++HERTKHI+++ HF+R++I    +
Sbjct: 1261 EAIWLKGLLKDFGYEQKSVEIFCDSQSAIALSKNNVHHERTKHIDVKYHFIREIISDGTV 1320

Query: 1250 KVEKVASEENPADIFTKSLPRSRFKHCLDLI 1280
            +V K+++E+NPADIFTK L  S+F+  L+L+
Sbjct: 1321 EVLKISTEKNPADIFTKVLAVSKFQAALNLL 1351


>gb|AAX92861.1| retrotransposon protein, putative, Ty1-copia sub-class [Oryza
           sativa (japonica cultivar-group)]
          Length = 1373

 Score =  378 bits (971), Expect = e-103
 Identities = 226/586 (38%), Positives = 332/586 (56%), Gaps = 23/586 (3%)

Query: 5   KWDIEKFTGSNLFGLWKVKMRAILIQEKCV-EALKREAQMSAHLTPAEKTEMNDKAVSAI 63
           K+D+        F LW+VKMR IL Q     EAL    +  A  T AE+   + KA++ I
Sbjct: 2   KFDLPLLNYDTRFSLWQVKMRGILAQTHDYDEALDNFGKRRAEWT-AEEIRKDQKALALI 60

Query: 64  ILCLGDKMLREVSRETTAVSMWNKLDLLYMTKSLAHRQCLKQQLYFYRMMESKPIMEQLT 123
            L L + +L+E   E T+  +W KL+ + M+K L  +  +K +L+  +M E   ++  + 
Sbjct: 61  QLHLHNDILQECLTEKTSAELWLKLESICMSKDLTSKMQMKMKLFTLKMKEEDSVITHMA 120

Query: 124 EFNKIIDDLANIDVNLEDEDKALHLLCALPRSFENFKDTMLYGKEGTITLEEVQATLITK 183
           EF KI+ DL +++V  +DED  L LLC+LP S+ NF+DT+L  ++  +TL+EV   L  K
Sbjct: 121 EFKKIVADLVSMEVKYDDEDLGLLLLCSLPNSYANFRDTILLSRD-ELTLKEVYDALQNK 179

Query: 184 ELTKF---KDLKVDDSGEGLNVSRGRNQNRGKG-KGKNSKSKSRSKGDGNKTKYKCFICH 239
           E  K     D      GE L+V RGR +NR    K  + + +S+SK  GNK K+ C  C 
Sbjct: 180 EKMKIMVQNDGSSSSKGEALHV-RGRTENRTSNEKNYDRRGRSKSKPPGNK-KF-CVYCK 236

Query: 240 NPGHFKKDCP-----ERKDNGGGNPSVQLASKDEGCESAGALTVTSW--EPEKGWVLDSG 292
              H   +C      ERK+   G  SV  A+  +  +S   L V +        W+LDS 
Sbjct: 237 LKNHNIDECKKVQAKERKNKKDGKVSVASAAASDD-DSGDCLVVFAGCVAGHDEWILDSA 295

Query: 293 CSYHISPRKGYFETLE-LEEGGVVRLGNNKACKIQGIGTIRLKMFDDRDFLLKDVRYIPE 351
           CS+HI  ++ +F + + +++G VVR+G++  C I GIG++++K  D     LK+VRYIP 
Sbjct: 296 CSFHICTKRNWFSSYKPVQKGDVVRMGDDNPCAIVGIGSVQIKTDDGMTRTLKNVRYIPG 355

Query: 352 LRRNLISISMFDGLGYCTRIERGVMRISHGALVIAKGSKIHG-LYILEGSTIIAHASVPS 410
           + RNLIS+S  D  GY      GV+++S G+LV  KG      LY+L G T+    S  +
Sbjct: 356 MSRNLISLSTLDAEGYKYSGSDGVLKVSKGSLVCLKGDVNSAKLYVLRGCTLTGSDSAAA 415

Query: 411 VDTLDI---TKLWHLRLGHVSERGLVELAKQGLLGNEKLNKLDFCDNCTLGKQHKVKFGV 467
             T D    T LWH+RLGH+S  G+ EL K+ LL     +K+ FC++C  GK  +V+F  
Sbjct: 416 AITNDEPSKTNLWHMRLGHMSHLGMTELMKRNLLKGCTSSKIKFCEHCIFGKHKRVQFNT 475

Query: 468 GVHKSSRPFE*VHSDLLGPA*VKTYGGGSYFTSIIDDYSRRVWVYILKNKSDAFEKFKEW 527
            VH +    + VH+DL GP+   + GG  Y  +IIDDYSR+VW Y LK+K D F  FK W
Sbjct: 476 SVHTTKGTLDYVHADLWGPSKKPSLGGARYMLTIIDDYSRKVWPYFLKHKDDTFTAFKNW 535

Query: 528 DILVENQIGTKLKVLRTDNGQEFVLEQFNEFCRKKGIKRHRIVAYT 573
            +++E Q   K+K+LRTDNG EF    FN++CR++GI RH  + +T
Sbjct: 536 KVMIERQTERKVKLLRTDNGGEFCSHAFNDYCRQEGIVRHHTIPHT 581



 Score = 78.6 bits (192), Expect = 1e-12
 Identities = 32/56 (57%), Positives = 44/56 (78%)

Query: 1190 EAIWLKGMIGEMGISQGCVKIHCDSQSAIHLANHQIYHERTKHINIRLHFVRDMIE 1245
            E IWLKG+  E+   + C+ +HCDS+SAI+L   Q++HERTKHI+I+ HFVRD+IE
Sbjct: 1239 ELIWLKGLYAELSGVESCISLHCDSESAIYLTKDQMFHERTKHIDIKYHFVRDVIE 1294


>ref|XP_470868.1| Putative retroelement pol polyprotein [Oryza sativa]
           gi|14029020|gb|AAK52561.1| Putative retroelement pol
           polyprotein [Oryza sativa]
          Length = 1326

 Score =  377 bits (969), Expect = e-102
 Identities = 228/590 (38%), Positives = 336/590 (56%), Gaps = 33/590 (5%)

Query: 5   KWDIEKFTGSNLFGLWKVKMRAILIQEKCV-EALKREAQMSAHLTPAEKTEMNDKAVSAI 63
           K+D+        F LW+VKMRAIL Q   + EAL+   +  +    AE+   + KA+  I
Sbjct: 5   KYDLPLLDYKTRFSLWQVKMRAILAQTSDLDEALESFGKKKSTEWTAEEKRKDRKALLLI 64

Query: 64  ILCLGDKMLREVSRETTAVSMWNKLDLLYMTKSLAHRQCLKQQLYFYRMMESKPIMEQLT 123
            L L + +L+EV +E TA  +W KL+ + M+K L  +  +K +L+ +++ ES  ++  ++
Sbjct: 65  QLHLSNDILQEVLQEKTAAELWLKLESICMSKDLTSKMHIKMKLFSHKLQESGSVLNHIS 124

Query: 124 EFNKIIDDLANIDVNLEDEDKALHLLCALPRSFENFKDTMLYGKEGTITLEEVQATLITK 183
            F +I+ DL +I+V  +DED  L LLC+LP S+ NF+DT+L  ++  +TL EV   L  +
Sbjct: 125 VFKEIVVDLVSIEVQFDDEDLGLLLLCSLPSSYANFRDTILLSRD-ELTLAEVYEALQNR 183

Query: 184 ELTKFKDLKVDDS----GEGLNVSRGRNQNRGKGKGKN---SKSKSRSKGDGNKTKYKCF 236
           E  K K +   D+    GE L V RGR++ R      +   S+S+ RSK  G K    C 
Sbjct: 184 E--KMKGMVQSDASSSKGEALQV-RGRSEQRTYNDSSDRDKSQSRGRSKSRGKKF---CK 237

Query: 237 ICHNPGHFKKDC------PERKDNGGGNPSVQLASKDEG-CESAGALTVTSWEPEKGWVL 289
            C    HF ++C       +RK +G  +      + D G C    A  V S +    W+L
Sbjct: 238 YCKKKNHFIEECWKLQNKEKRKSDGKASVVTSAENSDSGDCLVVFAGCVASHDE---WIL 294

Query: 290 DSGCSYHISPRKGYFETLE-LEEGGVVRLGNNKACKIQGIGTIRLKMFDDRDFLLKDVRY 348
           D+ CS+HI   + +F + + ++ G VVR+G++   +I GIG++++K  D     LKDVR+
Sbjct: 295 DTACSFHICINRDWFSSYKSVQNGDVVRMGDDNPREIVGIGSVQIKTHDGMTRTLKDVRH 354

Query: 349 IPELRRNLISISMFDGLGYCTRIERGVMRISHGALVIAKGS-KIHGLYILEGSTIIAHAS 407
           IP + RNLIS+S  D  GY      GV+++S G+LV   G      LY+L GST+  H S
Sbjct: 355 IPGMARNLISLSTLDAEGYKYSSSGGVVKVSKGSLVYMIGDMNSANLYVLRGSTL--HGS 412

Query: 408 VP----SVDTLDITKLWHLRLGHVSERGLVELAKQGLLGNEKLNKLDFCDNCTLGKQHKV 463
           V     S D    T LWH+RLGH+SE G+ EL K+ LL      K+ FC++C  GK  +V
Sbjct: 413 VTAAAVSKDEPIKTNLWHMRLGHMSELGMAELMKRNLLDGCTQGKMKFCEHCVFGKHKRV 472

Query: 464 KFGVGVHKSSRPFE*VHSDLLGPA*VKTYGGGSYFTSIIDDYSRRVWVYILKNKSDAFEK 523
           KF   VH++    + VH+DL GP+     GG  Y  +IIDDYSR+VW Y LK+K D F  
Sbjct: 473 KFNTSVHRTKGILDYVHTDLWGPSRKAYLGGARYMLTIIDDYSRKVWPYFLKHKDDTFAA 532

Query: 524 FKEWDILVENQIGTKLKVLRTDNGQEFVLEQFNEFCRKKGIKRHRIVAYT 573
           FKEW + +E Q   ++KVLRTDNG EF  + F+++CRK+GI RH  + YT
Sbjct: 533 FKEWKVRIERQTEKEVKVLRTDNGGEFCSDAFDDYCRKEGIVRHHTIPYT 582



 Score = 64.3 bits (155), Expect = 3e-08
 Identities = 25/55 (45%), Positives = 39/55 (70%)

Query: 1190 EAIWLKGMIGEMGISQGCVKIHCDSQSAIHLANHQIYHERTKHINIRLHFVRDMI 1244
            E++WLKG+  E+     C+ + CDSQSAI L    ++HER+KHI+I+ H+V D++
Sbjct: 1097 ESVWLKGLFAELCGVDSCINLFCDSQSAICLTKDHMFHERSKHIDIKYHYVHDVV 1151


>emb|CAB78169.1| putative retrotransposon [Arabidopsis thaliana]
           gi|4539406|emb|CAB40039.1| putative retrotransposon
           [Arabidopsis thaliana] gi|7444416|pir||T04181
           hypothetical protein F7L13.40 - Arabidopsis thaliana
          Length = 1230

 Score =  370 bits (951), Expect = e-100
 Identities = 222/598 (37%), Positives = 323/598 (53%), Gaps = 56/598 (9%)

Query: 7   DIEKFTGSNLFGLWKVKMRAILIQEKCVEALKREAQMSAHLTPAEK-------------T 53
           ++EKF G   + LWK K+ A +       AL+    +S  L   E+              
Sbjct: 7   EMEKFDGHGDYTLWKEKLMAHMDLLGLTVALRETQSVSDPLESEEEGKESEKGDKEALME 66

Query: 54  EMNDKAVSAIILCLGDKMLREVSRETTAVSMWNKLDLLYMTKSLAHRQCLKQQLYFYRMM 113
           E   KA S I+L + D++LR+  +E TA SM   LD LYM+K+L +R  LKQ+LY Y+M 
Sbjct: 67  EKRQKARSTIVLSVSDQVLRKSKKEKTAPSMLEALDKLYMSKALPNRIYLKQKLYSYKMQ 126

Query: 114 ESKPIMEQLTEFNKIIDDLANIDVNLEDEDKALHLLCALPRSFENFKDTMLYGK-EGTIT 172
           E+  +   + EF ++I DL N +V + DED+A+ LL +LP+ F+  KDT+ YG    T++
Sbjct: 127 ENLSVEGNIDEFLRLIADLENTNVLVSDEDQAILLLMSLPKQFDQLKDTLKYGSGRTTLS 186

Query: 173 LEEVQATLITKELTKFKDLK-VDDSGEGLNVS---RGRNQNRGKGKGKNSKSKSRSKGDG 228
           ++EV A + +KEL    + K +    EGL V      R  +  K KG   +S+SRSKG  
Sbjct: 187 VDEVVAAIYSKELELGSNKKSIRGQAEGLYVKDKPETRGMSEQKEKGNKGRSRSRSKGWK 246

Query: 229 NKTKYKCFICHNPGHFKKDCPER-------KDNGGGNPSVQLASKDEGCESAG-----AL 276
                 C+IC   GHFK  CP +       KD   G+       K    E +G     AL
Sbjct: 247 G-----CWICGEEGHFKTSCPNKGKQQNKGKDQASGSKGEAATIKGNTSEGSGYYVSEAL 301

Query: 277 TVTSWEPEKGWVLDSGCSYHISPRKGYFETLELEEGGVVRLGNNKACKIQGIGTIRLKMF 336
             T       WV+D+GC+YH++ +K +FE L  + GG VR+GN    K +          
Sbjct: 302 HSTDVNLGNEWVMDTGCNYHMTHKKEWFEELSEDAGGTVRMGNKSTSKFR---------- 351

Query: 337 DDRDFLLKDVRYIPELRRNLISISMFDGLGYCTRIERGVMRISHGALVIAKGSKIHGLYI 396
                    V+YIP++ RNL+S+   +  GY    + GV+ +  G   +  GS+   LY+
Sbjct: 352 ---------VKYIPDMDRNLLSMGTLEEHGYSFESKNGVLVVKEGTRTLLIGSRHEKLYL 402

Query: 397 LEGSTIIAHASVPSVDTLDITKLWHLRLGHVSERGLVELAKQGLLGNEKLNKLDFCDNCT 456
           L+G   ++H S+      D T LWH RLGH+S++ +  L K+G L  +K++KL+ C++C 
Sbjct: 403 LQGKPEVSH-SMTVERRNDDTVLWHRRLGHISQKNMDILVKKGYLDGKKVSKLELCEDCI 461

Query: 457 LGKQHKVKFGVGVHKSSRPFE*VHSDLLG-PA*VKTYGGGSYFTSIIDDYSRRVWVYILK 515
            GK  ++ F V  H +      VHSDL G P+   + G   YF S ID YSR+ WVY LK
Sbjct: 462 YGKARRLSFVVATHNTEDKLNYVHSDLWGAPSVPLSLGKCQYFISFIDVYSRKTWVYFLK 521

Query: 516 NKSDAFEKFKEWDILVENQIGTKLKVLRTDNGQEFVLEQFNEFCRKKGIKRHRIVAYT 573
           +K +AF  F EW ++VENQ G K+K+LR DNG EF  +QFN+FC++KGI RH+  AYT
Sbjct: 522 HKDEAFGTFAEWSVMVENQTGRKIKILRIDNGLEFCNQQFNDFCKEKGIVRHQTCAYT 579



 Score = 93.2 bits (230), Expect = 5e-17
 Identities = 43/88 (48%), Positives = 64/88 (71%)

Query: 1190 EAIWLKGMIGEMGISQGCVKIHCDSQSAIHLANHQIYHERTKHINIRLHFVRDMIETKEI 1249
            EAIWLKG++ + G  Q  V+I CDSQSAI L+ + ++H+RTKHI+I+ H +R++I    +
Sbjct: 1135 EAIWLKGLLQDFGYEQKTVEIFCDSQSAIALSKNNVHHDRTKHIDIKYHKIREVIADGVV 1194

Query: 1250 KVEKVASEENPADIFTKSLPRSRFKHCL 1277
            +V+K+ +  N ADIFTK +P S+FK  L
Sbjct: 1195 EVKKICTLVNSADIFTKVVPVSKFKTAL 1222


>gb|AAP53029.1| putative retrotransposon-related protein [Oryza sativa (japonica
           cultivar-group)] gi|37532880|ref|NP_920742.1| putative
           retrotransposon-related protein [Oryza sativa (japonica
           cultivar-group)] gi|22655747|gb|AAN04164.1| Putative
           retrotransposon protein [Oryza sativa (japonica
           cultivar-group)] gi|16905223|gb|AAL31093.1| putative
           retrotransposon-related protein [Oryza sativa]
          Length = 1229

 Score =  368 bits (945), Expect = e-100
 Identities = 218/590 (36%), Positives = 333/590 (55%), Gaps = 23/590 (3%)

Query: 5   KWDIEKFTGSNLFGLWKVKMRAILIQEKCV-EALKREAQMSAHLTPAEKTEMNDKAVSAI 63
           K+D+        F LW+VKMRA+L Q   + EAL+   +       AE+   + KA+S I
Sbjct: 2   KYDLPLQDYKTRFSLWQVKMRAVLAQTSDLDEALESFGKKKTTEWTAEEKRKDRKALSLI 61

Query: 64  ILCLGDKMLREVSRETTAVSMWNKLDLLYMTKSLAHRQCLKQQLYFYRMMESKPIMEQLT 123
            L L + +L++V +E TA  +W KL+ + M+K L  +  +K +L+ +++ ES  ++  ++
Sbjct: 62  QLHLSNDILQKVLQEKTAAELWFKLESICMSKDLTSKMHIKMKLFSHKLQESGSVLNHIS 121

Query: 124 EFNKIIDDLANIDVNLEDEDKALHLLCALPRSFENFKDTMLYGKEGTITLEEVQATLITK 183
            F +II DL +++V  +DED  L LLC+LP  + NF+DT+L  ++  +TL EV   L  +
Sbjct: 122 VFKEIIADLVSMEVQFDDEDLGLLLLCSLPSLYANFRDTILLSRD-ELTLAEVYEALQNR 180

Query: 184 ELTKFKDLKVDDS----GEGLNVSRGRNQNRGKGKGKN---SKSKSRSKGDGNKTKYKCF 236
           E  K K +   D+    G+ L V RGR++ R      +   S+S+ RSK  G K    C 
Sbjct: 181 E--KMKGMVQSDASSSKGKALQV-RGRSEQRTYNDSNDRDKSQSRGRSKSRGKKF---CK 234

Query: 237 ICHNPGHFKKDC--PERKDNGGGNPSVQLASKDEGCESAGALTVTSW--EPEKGWVLDSG 292
            C    HF ++C   + K+    +    + +  E  +SA  L   +        W+LD+ 
Sbjct: 235 YCKKKNHFIEECWKLQNKEKRKSDGKASVVTSAENSDSADCLVFFAGCVASHDEWILDTA 294

Query: 293 CSYHISPRKGYFETLE-LEEGGVVRLGNNKACKIQGIGTIRLKMFDDRDFLLKDVRYIPE 351
           C + I   + +F + + ++ G VVR+G+N   +I GIG++++K  D     LKDVR+IP 
Sbjct: 295 CLFLICINRDWFSSHKSVQNGDVVRMGDNNPREIMGIGSVQIKTHDGMTRTLKDVRHIPG 354

Query: 352 LRRNLISISMFDGLGYCTRIERGVMRISHGALVIAKGS-KIHGLYILEGSTIIAH--ASV 408
           + RNLIS+S  D  GY      GV+++S G+LV   G      LY+L GST+     A+ 
Sbjct: 355 MARNLISLSTLDAEGYKYSGSGGVVKVSKGSLVYMIGDMNSANLYVLRGSTLHGSLTAAA 414

Query: 409 PSVDTLDITKLWHLRLGHVSERGLVELAKQGLLGNEKLNKLDFCDNCTLGKQHKVKFGVG 468
            S D    T LWH+RLGH+SE G+ EL K+ LL       + FC++C  GK  +VKF   
Sbjct: 415 VSKDEPSKTNLWHMRLGHMSELGMAELMKRNLLDGCTQGNMKFCEHCVFGKHKRVKFNTS 474

Query: 469 VHKSSRPFE*VHSDLLGPA*VKTYGGGSYFTSIIDDYSRRVWVYILKNKSDAFEKFKEWD 528
           VH++    + VH+DL GP+   + GG  Y  +IIDDYSR+VW Y LK+K D F  FKEW 
Sbjct: 475 VHRTKGILDYVHADLWGPSRKPSLGGACYMLTIIDDYSRKVWPYFLKHKDDTFAAFKEWK 534

Query: 529 ILVENQIGTKLKVLRTDNGQEFVLEQFNEFCRKKGIKRHRIVAYTSTERS 578
           +++E Q   ++KVLRTDNG EF  + F+++CRK+GI RH  + YT  + S
Sbjct: 535 VMIERQAEKEVKVLRTDNGGEFCSDAFDDYCRKEGIGRHHTIPYTPQQNS 584



 Score = 60.1 bits (144), Expect = 5e-07
 Identities = 25/55 (45%), Positives = 38/55 (68%)

Query: 1190 EAIWLKGMIGEMGISQGCVKIHCDSQSAIHLANHQIYHERTKHINIRLHFVRDMI 1244
            E++WLKG+  E+      + + CDSQS I L   QI+HERTK+I+I+ H+V D++
Sbjct: 1174 ESVWLKGLFAELCRVDSYINLFCDSQSVICLTKDQIFHERTKYIDIKYHYVCDVV 1228


>gb|AAD23690.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
           gi|25301702|pir||E84601 probable retroelement pol
           polyprotein [imported] - Arabidopsis thaliana
          Length = 1333

 Score =  367 bits (943), Expect = 1e-99
 Identities = 227/590 (38%), Positives = 333/590 (55%), Gaps = 46/590 (7%)

Query: 7   DIEKFTGSNLFGLWKVKMRAILIQEKCVEALKREAQMSAHLTPAEKTEMNDK-------- 58
           ++EKF G   + +WK K+ A L       ALK E  +   +   + TE  +K        
Sbjct: 7   EVEKFDGRGDYTMWKEKLMAHLDILGLSVALKEEDDLVEKVAEMQLTEEEEKEEVLRREL 66

Query: 59  -------AVSAIILCLGDKMLREVSRETTAVSMWNKLDLLYMTKSLAHRQCLKQQLYFYR 111
                  A SAI+L + D++LR++ +E +A +M   LD LYM+K+L +R   KQ+LY ++
Sbjct: 67  LEEKRRKARSAIVLSVTDRVLRKIKKEQSAAAMLGVLDKLYMSKALPNRIYQKQKLYSFK 126

Query: 112 MMESKPIMEQLTEFNKIIDDLANIDVNLEDEDKALHLLCALPRSFENFKDTMLYGK-EGT 170
           M E+  I   + EF +II DL N +V + DED+A+ LL +LP+ F+  +DT+ YG    T
Sbjct: 127 MSENLSIEGNIDEFLRIIADLENTNVLVSDEDQAILLLMSLPKPFDQLRDTLKYGLGRVT 186

Query: 171 ITLEEVQATLITKELTKFKDLK-VDDSGEGLNV-----SRGRNQNRGKGKGKNSKSKSRS 224
           ++L+EV A + +KEL    + K +    EGL V     +RGR + RG     N+  KSRS
Sbjct: 187 LSLDEVVAAIYSKELELGSNKKSIKGQAEGLFVKEKTETRGRTEQRGNN---NNNKKSRS 243

Query: 225 KGDGNKTKYKCFICHNPGHFKKDCPERKDNGGGNPSVQLASKDEGCESAGALTVTSWEPE 284
           K   +++K  C+IC               NG  N      S+  G   + AL+ T    E
Sbjct: 244 K---SRSKKGCWICGE-----------SSNGSSN-----YSEANGLYVSEALSSTDIHLE 284

Query: 285 KGWVLDSGCSYHISPRKGYFETLELEEGGVVRLGNNKACKIQGIGTIRLKMFDDRDFLLK 344
             WV+D+GCSYH++ ++ +FE L  + GG VR+GN    K++GIGTIR+K        L 
Sbjct: 285 DEWVMDTGCSYHMTYKREWFEDLNEDAGGSVRMGNKTVSKVRGIGTIRVKNEAGMVVRLT 344

Query: 345 DVRYIPELRRNLISISMFDGLGYCTRIERGVMRISHGALVIAKGSKIHGLYILEGSTIIA 404
           +VRYIPE+ RNL+S+  F+  GY  ++E G + I  G  V+    + + LY+L+   +  
Sbjct: 345 NVRYIPEMDRNLLSLGTFEKSGYSFKLENGTLSIIAGDSVLLTVRRCYTLYLLQWRPV-T 403

Query: 405 HASVPSVDTLDITKLWHLRLGHVSERGLVELAKQGLLGNEKLNKLDFCDNCTLGKQHKVK 464
             S+  V   D T LWH RLGH+S++ +  L K+GLL  +K++KL+ C++C  GK  ++ 
Sbjct: 404 EESLSVVKRQDDTILWHRRLGHMSQKNMDLLLKKGLLDKKKVSKLETCEDCIYGKAKRIG 463

Query: 465 FGVGVHKSSRPFE*VHSDLLG-PA*VKTYGGGSYFTSIIDDYSRRVWVYILKNKSDAFEK 523
           F +  H +    E VHSDL G P+   + G   YF S IDDY+R+V +Y LK K +AF+K
Sbjct: 464 FNLAQHDTREKLEYVHSDLWGAPSVPFSLGKCQYFISFIDDYTRKVRIYFLKTKDEAFDK 523

Query: 524 FKEWDILVENQIGTKLKVLRTDNGQEFVLEQFNEFCRKKGIKRHRIVAYT 573
           F EW  LVENQ   ++K LRTDNG EF    F+EFC +KGI  HR  AYT
Sbjct: 524 FVEWANLVENQTDKRIKTLRTDNGLEFCNRSFDEFCSQKGILWHRTCAYT 573



 Score = 99.0 bits (245), Expect = 9e-19
 Identities = 44/91 (48%), Positives = 67/91 (73%)

Query: 1190 EAIWLKGMIGEMGISQGCVKIHCDSQSAIHLANHQIYHERTKHINIRLHFVRDMIETKEI 1249
            EA+W+KG++ E G  Q  V+I CDSQSAI L+ + ++HERTKHI++R  ++RD+I   + 
Sbjct: 1238 EAVWMKGLLKEFGYEQKSVEIFCDSQSAIALSKNNVHHERTKHIDVRYQYIRDIIANGDG 1297

Query: 1250 KVEKVASEENPADIFTKSLPRSRFKHCLDLI 1280
             V K+ +E+NPADIFTK +P ++F+  L L+
Sbjct: 1298 DVVKIDTEKNPADIFTKIVPVNKFQAALTLL 1328


>ref|XP_475489.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
           gi|48475213|gb|AAT44282.1| putative polyprotein [Oryza
           sativa (japonica cultivar-group)]
          Length = 1243

 Score =  360 bits (924), Expect = 2e-97
 Identities = 213/578 (36%), Positives = 328/578 (55%), Gaps = 30/578 (5%)

Query: 5   KWDIEKFTGSNLFGLWKVKMRAILIQEKCVEALKREAQMSAHLTPAEKTEMNDKAVSAII 64
           K+D+        F LW+VKMRA+L Q+   +AL    + +   +  EK + + KA+S I 
Sbjct: 5   KYDLPLLDRDTRFSLWQVKMRAVLAQQDLDDALSGFDKRTQDWSNDEK-KRDRKAISYIH 63

Query: 65  LCLGDKMLREVSRETTAVSMWNKLDLLYMTKSLAHRQCLKQQLYFYRMMESKPIMEQLTE 124
           L L + +L+EV +E TA  +W KL+ + MTK L  +  LKQ+L+ +++ + + +M+ L+ 
Sbjct: 64  LHLSNNILQEVLKEETAAGLWLKLEQICMTKDLTSKMHLKQKLFLHKLQDDESVMDHLSA 123

Query: 125 FNKIIDDLANIDVNLEDEDKALHLLCALPRSFENFKDTMLYGKEGTITLEEVQATLITKE 184
           F +I+ DL +++V  +++D  L LLC+LP S+ NF+ T+LY ++ T+TL+EV      KE
Sbjct: 124 FKEIVADLESMEVKYDEDDLGLILLCSLPSSYANFRGTILYSRD-TLTLKEVYDAFHAKE 182

Query: 185 LTKFKDLKVDDSG----EGLNVSRGRNQNRGKGKGKNSKSKSRSKG-DGNKTKYK-CFIC 238
             K K +   +      EGL V RGR Q +        KS S  +G   ++ +YK C  C
Sbjct: 183 --KMKKMVTSEGSNSQAEGL-VVRGRQQKKNTKNQSRDKSSSSYRGRTKSRGRYKSCKYC 239

Query: 239 HNPGHFKKDCPERKDNGGGNPSVQLASKDEGCESAGALTVTSWEPEKGWVLDSGCSYHIS 298
              GH   +C + +D            K E    A  +T    + E   V  +GC+   +
Sbjct: 240 KRDGHDISECWKLQDKDKRTGKYIPKGKKEEEGKAAVVTDEKSDAEL-LVAYAGCAQ--T 296

Query: 299 PRKGYFETLELEEGGVVRLGNNKACKIQGIGTIRLKMFDDRDFLLKDVRYIPELRRNLIS 358
             + +F T E  +GG V +G++  C++ GIGT+++KMFD     L DV++IP L+R+LIS
Sbjct: 297 SDQDWFATYEALQGGTVLMGDDTPCEVAGIGTVQIKMFDGCIRTLSDVQHIPNLKRSLIS 356

Query: 359 ISMFDGLGYCTRIERGVMRISHGALVIAKGS-KIHGLYILEGSTIIAHASV--PSVDTLD 415
           +              G+++++ G+LV+ K   K   LY L G+TI+ + +    S+   D
Sbjct: 357 LY-------------GILKVTKGSLVVMKVDIKSANLYHLRGTTILGNVAAVFDSLSNSD 403

Query: 416 ITKLWHLRLGHVSERGLVELAKQGLLGNEKLNKLDFCDNCTLGKQHKVKFGVGVHKSSRP 475
            T LWH+RLGH+SE GL EL+K+GLL  + + KL FC++C  GK  +VKF    H +   
Sbjct: 404 ATNLWHMRLGHMSEIGLAELSKRGLLDGQSIRKLKFCEHCIFGKHKRVKFNTSTHTTEGI 463

Query: 476 FE*VHSDLLGPA*VKTYGGGSYFTSIIDDYSRRVWVYILKNKSDAFEKFKEWDILVENQI 535
            + VHSDL GPA   ++GG  Y  +I+DDYSR+VW Y LK+K  AF+ FKEW  +VE Q 
Sbjct: 464 LDYVHSDLWGPAHKTSFGGARYMMTIVDDYSRKVWPYFLKHKYQAFDGFKEWKTMVERQT 523

Query: 536 GTKLKVLRTDNGQEFVLEQFNEFCRKKGIKRHRIVAYT 573
             K+K+LRTDNG EF  + F  +C+ +GI  H    +T
Sbjct: 524 ERKVKILRTDNGMEFCSKIFKSYCKSEGIVCHYTAPHT 561



 Score = 40.4 bits (93), Expect = 0.39
 Identities = 18/40 (45%), Positives = 26/40 (65%)

Query: 1181 YIVWYNSNLEAIWLKGMIGEMGISQGCVKIHCDSQSAIHL 1220
            Y+  + +  EAIWL+G+  E+     C+ I CDSQSAI+L
Sbjct: 1201 YMAIFEACKEAIWLRGLYTELCGVTSCINIFCDSQSAIYL 1240


>gb|AAD19773.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
           gi|25301697|pir||B84512 probable retroelement pol
           polyprotein [imported] - Arabidopsis thaliana
          Length = 1335

 Score =  359 bits (922), Expect = 3e-97
 Identities = 207/551 (37%), Positives = 302/551 (54%), Gaps = 28/551 (5%)

Query: 38  KREAQMSAHLTPAEKTEMNDKAVSAIILCLGDKMLREVSRETTAVSMWNKLDLLYMTKSL 97
           KR+A   A L      E  DKA + I L + DK+LR++    TA   W  LD L+M +SL
Sbjct: 34  KRDADEVARL------ERCDKAKNVIFLNVADKVLRKIELCKTAAEAWETLDRLFMIRSL 87

Query: 98  AHRQCLKQQLYFYRMMESKPIMEQLTEFNKIIDDLANIDVNLEDEDKALHLLCALPRSFE 157
            HR   +   Y ++M E+K I E + +F KI+ DL ++ +++ DE +A+ LL +LP  ++
Sbjct: 88  PHRVYTQLSFYTFKMQENKKIDENIDDFLKIVADLNHLQIDVTDEVQAILLLSSLPARYD 147

Query: 158 NFKDTMLYGKEGT-ITLEEVQATLITKELTKFKDLKVDDSGEGLNVSRGRNQNRGKGKGK 216
              +TM Y      + L++V      KE    ++ +    G   + +RGR   +   +G 
Sbjct: 148 GLVETMKYSNSREKLRLDDVMVAARDKERELSQNNRPVVEG---HFARGRPDGKNNNQGN 204

Query: 217 NSKSKSRSKG-DGNKTKYKCFICHNPGHFKKDC------PERKDNGGGNPSVQLASKDEG 269
             K++SRSK  DG +    C+IC   GHFKK C       + K  G  N    LA   E 
Sbjct: 205 KGKNRSRSKSADGKRV---CWICGKEGHFKKQCYKWIERNKSKQQGSDNGESSLAKSTEA 261

Query: 270 CESAGALTVTSW------EPEKGWVLDSGCSYHISPRKGYFETLELEEGGVVRLGNNKAC 323
              A  L  T             WVLD+GCS+H++PRK +F+  +    G V++GN+   
Sbjct: 262 FNPAMVLLATDETLVVTDSIANEWVLDTGCSFHMTPRKDWFKDFKELSSGYVKMGNDTYS 321

Query: 324 KIQGIGTIRLKMFDDRDFLLKDVRYIPELRRNLISISMFDGLGYCTRIERGVMRISHGAL 383
            ++GIG+I+++  D    +L DVRY+P + RNLIS+   +  G   + + G+++I  G  
Sbjct: 322 PVKGIGSIKIRNSDGSQVILTDVRYMPNMTRNLISLGTLEDRGCWFKSQDGILKIVKGCS 381

Query: 384 VIAKGSKIHGLYILEGSTIIAHASVPSVDTLDITKLWHLRLGHVSERGLVELAKQGLLGN 443
            I KG K   LYIL+G T     S  S +  D T LWH RLGH+S++G+  L K+G L  
Sbjct: 382 TILKGQKRDTLYILDGVTEEGE-SHSSAEVKDETALWHSRLGHMSQKGMEILVKKGCLRR 440

Query: 444 EKLNKLDFCDNCTLGKQHKVKFGVGVHKSSRPFE*VHSDLLG-PA*VKTYGGGSYFTSII 502
           E + +L+FC++C  GKQH+V F    H +      VHSDL G P    + G   YF S +
Sbjct: 441 EVIKELEFCEDCVYGKQHRVSFAPAQHVTKEKLAYVHSDLWGSPHNPASLGNSQYFISFV 500

Query: 503 DDYSRRVWVYILKNKSDAFEKFKEWDILVENQIGTKLKVLRTDNGQEFVLEQFNEFCRKK 562
           DDYSR+VW+Y L+ K +AFEKF EW  +VENQ   K+K LRTDNG E+    F +FC+++
Sbjct: 501 DDYSRKVWIYFLRKKDEAFEKFVEWKKMVENQSDRKVKKLRTDNGLEYCNHYFEKFCKEE 560

Query: 563 GIKRHRIVAYT 573
           GI RH+  AYT
Sbjct: 561 GIVRHKTCAYT 571



 Score =  108 bits (271), Expect = 9e-22
 Identities = 51/91 (56%), Positives = 72/91 (79%)

Query: 1190 EAIWLKGMIGEMGISQGCVKIHCDSQSAIHLANHQIYHERTKHINIRLHFVRDMIETKEI 1249
            EA+WLKG   E+G SQ  V++H DSQSAI LA + ++HERTKHI+IRLHF+RD+I    I
Sbjct: 1240 EALWLKGFAAELGHSQDYVEVHSDSQSAITLAKNSVHHERTKHIDIRLHFIRDIICAGLI 1299

Query: 1250 KVEKVASEENPADIFTKSLPRSRFKHCLDLI 1280
            KV K+A+E NPA+IFTK++P ++F+  L+++
Sbjct: 1300 KVVKIATECNPANIFTKTVPLAKFEGALNML 1330


>ref|XP_469192.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
           gi|53370655|gb|AAU89150.1| integrase core domain
           containing protein [Oryza sativa (japonica
           cultivar-group)] gi|40538906|gb|AAR87163.1| putative
           polyprotein [Oryza sativa (japonica cultivar-group)]
          Length = 1322

 Score =  359 bits (922), Expect = 3e-97
 Identities = 215/586 (36%), Positives = 328/586 (55%), Gaps = 25/586 (4%)

Query: 5   KWDIEKFTGSNLFGLWKVKMRAILIQEKCV-EALKREAQMSAHLTPAEKTEMNDKAVSAI 63
           K+D+        F LW+VKMRA+L Q   + EAL+   +       AE+   + KA+S I
Sbjct: 5   KYDLPLLDYKTRFSLWQVKMRAVLAQTSDLDEALESFGKKKTTEWTAEEKRKDRKALSLI 64

Query: 64  ILCLGDKMLREVSRETTAVSMWNKLDLLYMTKSLAHRQCLKQQLYFYRMMESKPIMEQLT 123
            L L + +L+EV ++ TA  +W KL+ + M+K L  +  +K +L+ +++ ES  ++  ++
Sbjct: 65  QLHLSNDILQEVLQKKTAAELWLKLESICMSKDLTSKMHIKMKLFSHKLHESGSVLNHIS 124

Query: 124 EFNKIIDDLANIDVNLEDEDKALHLLCALPRSFENFKDTMLYGKEGTITLEEVQATLITK 183
            F +I+ DL +++V  +DED  L LLC+LP S+ NF+ T+L  ++  +TL EV   L  +
Sbjct: 125 VFKEIVADLVSMEVQFDDEDLGLLLLCSLPSSYANFRHTILLSRD-ELTLAEVYEALQNR 183

Query: 184 ELTK--FKDLKVDDSGEGLNVSRGRNQNRGKGKGKN---SKSKSRSKGDGNKTKYKCFIC 238
           E  K   +       GE L V RGR++ R      +   S+S+ RSK  G K    C  C
Sbjct: 184 EKMKGMVQSYASSSKGEALQV-RGRSEQRTYNDSNDHDKSQSRGRSKSRGKKF---CKYC 239

Query: 239 HNPGHFKKDC------PERKDNGGGNPSVQLASKDEG-CESAGALTVTSWEPEKGWVLDS 291
               HF ++C       +RK +G  +      + D G C    A  V S +    W+LD+
Sbjct: 240 KKKNHFIEECWKLQNKEKRKSDGKASVVTSAENSDSGDCLVVFAGYVASHDE---WILDT 296

Query: 292 GCSYHISPRKGYFETLE-LEEGGVVRLGNNKACKIQGIGTIRLKMFDDRDFLLKDVRYIP 350
            CS+HI   + +F + + ++   VVR+G++   +I GIG++++K  D     LKDVR+IP
Sbjct: 297 ACSFHICINRDWFSSYKSVQNEDVVRMGDDNPREIVGIGSVQIKTHDGMTRTLKDVRHIP 356

Query: 351 ELRRNLISISMFDGLGYCTRIERGVMRISHGALVIAKGS-KIHGLYILEGSTIIAHASVP 409
            + RNLIS+S  D  GY      GV+++S G+LV   G      LY+L GST+    +  
Sbjct: 357 GMARNLISLSTLDAEGYKYSGSGGVVKVSKGSLVYMIGDMNSANLYVLRGSTLHGSVTAA 416

Query: 410 SV--DTLDITKLWHLRLGHVSERGLVELAKQGLLGNEKLNKLDFCDNCTLGKQHKVKFGV 467
           +V  D    T LWH+RLGH+SE G+ EL K+ LL       + FC++C  GK  +VKF  
Sbjct: 417 AVTKDEPSKTNLWHMRLGHMSELGMAELMKRNLLDGCTQGNMKFCEHCVFGKHKRVKFNT 476

Query: 468 GVHKSSRPFE*VHSDLLGPA*VKTYGGGSYFTSIIDDYSRRVWVYILKNKSDAFEKFKEW 527
            VH++    + VH+DL GP+   + GG  Y  +IIDDYSR+ W Y LK+K D F  FKE 
Sbjct: 477 SVHRTKGILDYVHADLWGPSRKPSLGGARYMLTIIDDYSRKEWPYFLKHKDDTFAAFKER 536

Query: 528 DILVENQIGTKLKVLRTDNGQEFVLEQFNEFCRKKGIKRHRIVAYT 573
            +++E Q   ++KVL TDNG EF  + F+++CRK+GI RH  + YT
Sbjct: 537 KVMIERQTEKEVKVLCTDNGGEFCSDAFDDYCRKEGIVRHHTIPYT 582



 Score = 99.0 bits (245), Expect = 9e-19
 Identities = 41/94 (43%), Positives = 66/94 (69%)

Query: 1190 EAIWLKGMIGEMGISQGCVKIHCDSQSAIHLANHQIYHERTKHINIRLHFVRDMIETKEI 1249
            E++WLKG+  E+     C+ + CDSQSAI L   Q++HERTKHI+I+ H+VRD++   ++
Sbjct: 1228 ESVWLKGLFAELCGVDSCINLFCDSQSAICLTKDQMFHERTKHIDIKYHYVRDIVAQGKL 1287

Query: 1250 KVEKVASEENPADIFTKSLPRSRFKHCLDLINFI 1283
            KV K++  +NPAD+ TK +P ++F+ C  L+  +
Sbjct: 1288 KVCKISIHDNPADMMTKPIPVAKFELCSSLVGIV 1321


>dbj|BAB09923.1| copia-like retrotransposable element [Arabidopsis thaliana]
          Length = 1342

 Score =  353 bits (906), Expect = 2e-95
 Identities = 223/599 (37%), Positives = 321/599 (53%), Gaps = 55/599 (9%)

Query: 7   DIEKFTGSNLFGLWKVKMRAILI-----------QEKCVEALKREAQMSAHLTPAEKT-- 53
           ++EKF G   + LWK K+ A +            +E  VE    E     +  P   T  
Sbjct: 7   EVEKFDGDGDYILWKEKLLAHMEMLGLLEGLGEEEEAVVEDSTTEISDGGNQDPETATSK 66

Query: 54  -------EMNDKAVSAIILCLGDKMLREVSRETTAVSMWNKLDLLYMTKSLAHRQCLKQQ 106
                  E   KA S IIL LG+ +LR+V ++ TA  M   LD L+M KSL +R  LKQ+
Sbjct: 67  LEDKILKEKRGKARSTIILSLGNNVLRKVIKQKTAAGMIKVLDQLFMAKSLPNRIYLKQR 126

Query: 107 LYFYRMMESKPIMEQLTEFNKIIDDLANIDVNLEDEDKALHLLCALPRSFENFKDTMLYG 166
           LY Y+M E+  + E + +F K+I DL N+ V + DED+A+ LL +LPR F+  K+T+ Y 
Sbjct: 127 LYGYKMSENMTMEENVNDFFKLISDLENVKVVVPDEDQAIVLLMSLPRQFDQLKETLKYC 186

Query: 167 KEGTITLEEVQATLITKELTKFKDLK-VDDSGEGLNV-SRGRNQNRGKGKGKNSKSKSRS 224
           K  T+ LEE+ + + +K L      K + ++ +GL V  RGR++ RGKG  KN KS+S+S
Sbjct: 187 KT-TLHLEEITSAIRSKILELGASGKLLKNNSDGLFVQDRGRSETRGKGPNKN-KSRSKS 244

Query: 225 KGDGNKTKYKCFICHNPGHFKKDC---PERKDNGGGNPSVQLASKDEGCESAGALTVT-- 279
           KG G      C+IC   GHFKK C    ER   G  +   + ++       A AL V+  
Sbjct: 245 KGAGK----TCWICGKEGHFKKQCYVWKERNKQGSTSERGEASTVTARVTDAAALVVSRA 300

Query: 280 ----SWEPEKGWVLDSGCSYHISPRKGYFETLELEEGGVVRLGNNKACKIQGIGTIRLKM 335
               +      W+LD+GCS+H++ RK +    +    G VR+GN+   +++GIG +R+K 
Sbjct: 301 LLGFAEVTPDTWILDTGCSFHMTCRKDWIIDFKETASGKVRMGNDTYSEVKGIGDVRIKN 360

Query: 336 FDDRDFLLKDVRYIPELRRNLISISMFDGLGYCTRIERGVMRISHGALVIAKGSKIHGLY 395
            D    LL DVRYIPE+ +NLIS+   +  G     ++G++ I    L +  G K   LY
Sbjct: 361 EDGSTILLTDVRYIPEMSKNLISLGTLEDKGCWFESKKGILTIFKNDLTVLTGKKESTLY 420

Query: 396 ILEGSTIIAHASVPSVDTLDITKLWHLRLGHVSERGLVELAKQGLLGNEKLNKLDFCDNC 455
            L+G+T+   A+V   +  D T LWH RLGH+  +GL  L  +G                
Sbjct: 421 FLQGTTLAGEANVIDKEK-DETSLWHSRLGHIGAKGLQVLVSKG---------------- 463

Query: 456 TLGKQHKVKFGVGVHKSSRPFE*VHSDLLGPA*VK-TYGGGSYFTSIIDDYSRRVWVYIL 514
            L K   + FG   H +    + VHSDL G   V  + G   YF + IDD++RR W+Y +
Sbjct: 464 HLDKNIMISFGAAKHVTKDKLDYVHSDLWGSTNVPFSIGKCQYFITFIDDFTRRTWIYFI 523

Query: 515 KNKSDAFEKFKEWDILVENQIGTKLKVLRTDNGQEFVLEQFNEFCRKKGIKRHRIVAYT 573
           + K +AF KF EW   +ENQ   KLK+L TDNG EF  ++F+ FCRK+G+ RHR  AYT
Sbjct: 524 RTKDEAFSKFVEWKTQIENQQDKKLKILITDNGLEFCNQEFDSFCRKEGVIRHRTCAYT 582



 Score =  105 bits (262), Expect = 1e-20
 Identities = 47/91 (51%), Positives = 68/91 (74%)

Query: 1190 EAIWLKGMIGEMGISQGCVKIHCDSQSAIHLANHQIYHERTKHINIRLHFVRDMIETKEI 1249
            EAIWL+G+  EMG  Q  V++ CDSQSAI L+ + ++HERTKHI++R HF+R+ I   EI
Sbjct: 1247 EAIWLRGLAAEMGFEQDAVEVMCDSQSAIALSKNSVHHERTKHIDVRYHFIREKIADGEI 1306

Query: 1250 KVEKVASEENPADIFTKSLPRSRFKHCLDLI 1280
            +V K+++  NPADIFTK++P S+ +  L L+
Sbjct: 1307 QVVKISTTWNPADIFTKTVPVSKLQEALKLL 1337


>emb|CAA32025.1| unnamed protein product [Nicotiana tabacum]
           gi|130582|sp|P10978|POLX_TOBAC Retrovirus-related Pol
           polyprotein from transposon TNT 1-94 [Contains: Protease
           ; Reverse transcriptase ; Endonuclease]
          Length = 1328

 Score =  350 bits (897), Expect = 2e-94
 Identities = 211/594 (35%), Positives = 318/594 (53%), Gaps = 41/594 (6%)

Query: 3   GSKWDIEKFTGSNLFGLWKVKMRAILIQEKCVEALKREAQMSAHLTPAEKTEMNDKAVSA 62
           G K+++ KF G N F  W+ +MR +LIQ+   + L  +++    +   +  +++++A SA
Sbjct: 3   GVKYEVAKFNGDNGFSTWQRRMRDLLIQQGLHKVLDVDSKKPDTMKAEDWADLDERAASA 62

Query: 63  IILCLGDKMLREVSRETTAVSMWNKLDLLYMTKSLAHRQCLKQQLYFYRMMESKPIMEQL 122
           I L L D ++  +  E TA  +W +L+ LYM+K+L ++  LK+QLY   M E    +  L
Sbjct: 63  IRLHLSDDVVNNIIDEDTARGIWTRLESLYMSKTLTNKLYLKKQLYALHMSEGTNFLSHL 122

Query: 123 TEFNKIIDDLANIDVNLEDEDKALHLLCALPRSFENFKDTMLYGKEGTITLEEVQATLIT 182
             FN +I  LAN+ V +E+EDKA+ LL +LP S++N   T+L+GK  TI L++V + L+ 
Sbjct: 123 NVFNGLITQLANLGVKIEEEDKAILLLNSLPSSYDNLATTILHGKT-TIELKDVTSALLL 181

Query: 183 KELTKFKDLKVDDSGEGL-NVSRGRNQNRGKGKGKNSKSKSRSKGDGNKTKYKCFICHNP 241
            E  + K    ++ G+ L    RGR+  R       S ++ +SK         C+ C+ P
Sbjct: 182 NEKMRKKP---ENQGQALITEGRGRSYQRSSNNYGRSGARGKSKNRSKSRVRNCYNCNQP 238

Query: 242 GHFKKDCPERKDNGG----------------GNPSVQL-ASKDEGCESAGALTVTSWEPE 284
           GHFK+DCP  +   G                 N +V L  +++E C            PE
Sbjct: 239 GHFKRDCPNPRKGKGETSGQKNDDNTAAMVQNNDNVVLFINEEEECMHLSG-------PE 291

Query: 285 KGWVLDSGCSYHISPRKGYFETLELEEGGVVRLGNNKACKIQGIGTIRLKMFDDRDFLLK 344
             WV+D+  S+H +P +  F      + G V++GN    KI GIG I +K       +LK
Sbjct: 292 SEWVVDTAASHHATPVRDLFCRYVAGDFGTVKMGNTSYSKIAGIGDICIKTNVGCTLVLK 351

Query: 345 DVRYIPELRRNLISISMFDGLGYCTRIERGVMRISHGALVIAKGSKIHGLY-----ILEG 399
           DVR++P+LR NLIS    D  GY +       R++ G+LVIAKG     LY     I +G
Sbjct: 352 DVRHVPDLRMNLISGIALDRDGYESYFANQKWRLTKGSLVIAKGVARGTLYRTNAEICQG 411

Query: 400 STIIAHASVPSVDTLDITKLWHLRLGHVSERGLVELAKQGLLGNEKLNKLDFCDNCTLGK 459
               A   + SVD      LWH R+GH+SE+GL  LAK+ L+   K   +  CD C  GK
Sbjct: 412 ELNAAQDEI-SVD------LWHKRMGHMSEKGLQILAKKSLISYAKGTTVKPCDYCLFGK 464

Query: 460 QHKVKFGVGVHKSSRPFE*VHSDLLGPA*VKTYGGGSYFTSIIDDYSRRVWVYILKNKSD 519
           QH+V F     +     + V+SD+ GP  +++ GG  YF + IDD SR++WVYILK K  
Sbjct: 465 QHRVSFQTSSERKLNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKTKDQ 524

Query: 520 AFEKFKEWDILVENQIGTKLKVLRTDNGQEFVLEQFNEFCRKKGIKRHRIVAYT 573
            F+ F+++  LVE + G KLK LR+DNG E+   +F E+C   GI+  + V  T
Sbjct: 525 VFQVFQKFHALVERETGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVPGT 578



 Score =  103 bits (257), Expect = 4e-20
 Identities = 44/100 (44%), Positives = 72/100 (72%)

Query: 1181 YIVWYNSNLEAIWLKGMIGEMGISQGCVKIHCDSQSAIHLANHQIYHERTKHINIRLHFV 1240
            YI    +  E IWLK  + E+G+ Q    ++CDSQSAI L+ + +YH RTKHI++R H++
Sbjct: 1224 YIAATETGKEMIWLKRFLQELGLHQKEYVVYCDSQSAIDLSKNSMYHARTKHIDVRYHWI 1283

Query: 1241 RDMIETKEIKVEKVASEENPADIFTKSLPRSRFKHCLDLI 1280
            R+M++ + +KV K+++ ENPAD+ TK +PR++F+ C +L+
Sbjct: 1284 REMVDDESLKVLKISTNENPADMLTKVVPRNKFELCKELV 1323


>gb|AAT85194.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
          Length = 1241

 Score =  350 bits (897), Expect = 2e-94
 Identities = 195/501 (38%), Positives = 289/501 (56%), Gaps = 23/501 (4%)

Query: 93  MTKSLAHRQCLKQQLYFYRMMESKPIMEQLTEFNKIIDDLANIDVNLEDEDKALHLLCAL 152
           MTK L  +  LKQ+L+ +++ +   +M+ L+ F +I+ DL +++V  ++ED  L LLC+L
Sbjct: 1   MTKDLTSKMHLKQKLFLHKLQDDGSVMDHLSAFKEIVADLESMEVKYDEEDLGLILLCSL 60

Query: 153 PRSFENFKDTMLYGKEGTITLEEVQATLITKELTKFKDLKVDDSG---EGLNVSRGRNQN 209
           P S+ NF+DT+LY ++ T+TL+EV   L  KE  K K +  + S    EGL V RGR Q 
Sbjct: 61  PSSYANFRDTILYSRD-TLTLKEVYDALHAKEKMK-KMVPSEGSNSQAEGL-VVRGRQQE 117

Query: 210 RGKGKGKNSKSKSRSKGDG-NKTKYK-CFICHNPGHFKKDCPERKDNGGGNPSVQLASKD 267
           +        KS S  +G   ++ +YK C  C   GH   +C + +D            K 
Sbjct: 118 KNTNNKSRDKSSSIYRGRSKSRGRYKSCKYCKRDGHDISECWKLQDKDKRTRKYIPKGKK 177

Query: 268 EGCESAGALTVTSWEPE------------KGWVLDSGCSYHISPRKGYFETLELEEGGVV 315
           E    A  +T    + E              W+LD+ C+YH+ P + +F T E  +GG V
Sbjct: 178 EEEGKAAVVTDEKSDAELLVAYAGCAQTSDQWILDTACTYHMCPNRDWFATYEAVQGGTV 237

Query: 316 RLGNNKACKIQGIGTIRLKMFDDRDFLLKDVRYIPELRRNLISISMFDGLGYCTRIERGV 375
            +G++  C++ GIGT+++KMFD     L DVR+IP L+R+LIS+   D  GY      G+
Sbjct: 238 LMGDDTPCEVAGIGTVQIKMFDGCIRTLLDVRHIPNLKRSLISLCTLDRKGYKYSGGDGI 297

Query: 376 MRISHGALVIAKGS-KIHGLYILEGSTIIAHASV--PSVDTLDITKLWHLRLGHVSERGL 432
           ++++ G+LV+ K   K   LY L G+TI+ + +    S+   D T LWH+RLGH+SE GL
Sbjct: 298 LKVTKGSLVVMKADIKYANLYHLRGTTILGNVAAVSDSLSNSDATNLWHMRLGHMSEIGL 357

Query: 433 VELAKQGLLGNEKLNKLDFCDNCTLGKQHKVKFGVGVHKSSRPFE*VHSDLLGPA*VKTY 492
            EL+K+GLL  + + KL FC++C  GK  +VKF    H +    + VHSDL GPA   ++
Sbjct: 358 AELSKRGLLDGQSIGKLKFCEHCIFGKHKRVKFNTSTHTTEGILDYVHSDLWGPARKTSF 417

Query: 493 GGGSYFTSIIDDYSRRVWVYILKNKSDAFEKFKEWDILVENQIGTKLKVLRTDNGQEFVL 552
           GG  Y  +I+DDYSR+VW Y LK+K  AF+ FKEW  +VE Q   K+K+LRTDNG E   
Sbjct: 418 GGARYMMTIVDDYSRKVWPYFLKHKYQAFDVFKEWKTMVERQTERKVKILRTDNGMELCS 477

Query: 553 EQFNEFCRKKGIKRHRIVAYT 573
           + F  +C+ +GI RH  V +T
Sbjct: 478 KIFKSYCKSEGIVRHYTVPHT 498



 Score = 95.5 bits (236), Expect = 1e-17
 Identities = 41/91 (45%), Positives = 63/91 (69%)

Query: 1190 EAIWLKGMIGEMGISQGCVKIHCDSQSAIHLANHQIYHERTKHINIRLHFVRDMIETKEI 1249
            E IWL+G+  E+     C+ I CDSQSAI L   Q++HERTKHI++R HF+R +I   ++
Sbjct: 1147 EVIWLRGLYTELCGVTSCINIFCDSQSAICLTKDQMFHERTKHIDLRYHFIRGVIAEGDV 1206

Query: 1250 KVEKVASEENPADIFTKSLPRSRFKHCLDLI 1280
            KV K+++ +NP D+ TK +P ++F+ C  L+
Sbjct: 1207 KVCKISTHDNPVDMMTKPVPATKFELCSSLV 1237


>gb|AAF19226.1| Highly similar to Ta1-3 polyprotein [Arabidopsis thaliana]
           gi|25301707|pir||E86490 hypothetical protein F28L22.3 -
           Arabidopsis thaliana
          Length = 1356

 Score =  347 bits (890), Expect = 1e-93
 Identities = 205/591 (34%), Positives = 320/591 (53%), Gaps = 29/591 (4%)

Query: 7   DIEKFTGSNLFGLWKVKMRAIL------------IQEKCVEALKREAQMSA--------- 45
           +I+ F G   F LWK++++A L               K V   K EA+  +         
Sbjct: 9   EIKVFNGDRDFSLWKIRIQAQLGVLGLKDTLTDFSLTKTVPLTKSEAKQESGDGESSGTK 68

Query: 46  HLTPAEKTEMNDKAVSAIILCLGDKMLREVSRETTAVSMWNKLDLLYMTKSLAHRQCLKQ 105
            +    K E +++A + II  + D +L +V+   T   +W  L+  YM  SL +R   + 
Sbjct: 69  EVPDPVKIEQSEQAKNIIINHISDVVLLKVNHYATTADLWATLNKKYMETSLPNRIYTQL 128

Query: 106 QLYFYRMMESKPIMEQLTEFNKIIDDLANIDVNLEDEDKALHLLCALPRSFENFKDTMLY 165
           +LY ++M+ +  I + + EF +I+ +L ++++ +++E +A+ +L +LP S    K T+ Y
Sbjct: 129 KLYSFKMVSTMTIDQNVDEFLRIVAELGSLEIQVDEEVQAILILNSLPASHIQLKHTLKY 188

Query: 166 GKEGTITLEEV--QATLITKELTKFKDLKVDDSGEGLNVSRGRNQNRGKGKGKNSKSKSR 223
           G + T+T+++V   A  + +EL +  DL    +       RGR   R   KG   K +SR
Sbjct: 189 GNK-TLTVQDVTSSAKSLERELAEAVDLDKGQAAVLYTTERGRPLVRNNQKGGQGKGRSR 247

Query: 224 SKGDGNKTKYKCFICHNPGHFKKDCPERKDNGGGNPSVQLASKDEGCESAGALTVTSWEP 283
           S    +KTK  C+ C   GH KKDC  RK         +     E    + AL+V     
Sbjct: 248 SN---SKTKVPCWYCKKEGHVKKDCYSRKKKMESEGQGEAGVITEKLVFSEALSVNEQMV 304

Query: 284 EKGWVLDSGCSYHISPRKGYFETLELEEGGVVRLGNNKACKIQGIGTIRLKMFDDRDFLL 343
           +  W+LDSGC+ H++ R+ +F + + +    + LG++ + + QG GTIR+        +L
Sbjct: 305 KDLWILDSGCTSHMTSRRDWFISFQEKGNTTILLGDDHSVESQGQGTIRIDTHGGTIKIL 364

Query: 344 KDVRYIPELRRNLISISMFDGLGYCTRIERGVMRISHGALVIAKGSKIHGLYILEGSTII 403
           ++V+Y+P LRRNLIS    D LGY      G +R         +GS  +GLY+L+GST++
Sbjct: 365 ENVKYVPHLRRNLISTGTLDKLGYRHEGGEGKVRYFKNNKTALRGSLSNGLYVLDGSTVM 424

Query: 404 AHASVPSVDTLDITKLWHLRLGHVSERGLVELAKQGLLGNEKLNKLDFCDNCTLGKQHKV 463
           +       D +  T LWH RLGH+S   L  LA +GL+  +++N+L+FC++C +GK  KV
Sbjct: 425 SELCNAETDKVK-TALWHSRLGHMSMNNLKVLAGKGLIDRKEINELEFCEHCVMGKSKKV 483

Query: 464 KFGVGVHKSSRPFE*VHSDLLG-PA*VKTYGGGSYFTSIIDDYSRRVWVYILKNKSDAFE 522
            F VG H S      VH+DL G P    +  G  YF SIIDD +R+VW+Y LK+K + F+
Sbjct: 484 SFNVGKHTSEDALSYVHADLWGSPNVTPSISGKQYFLSIIDDKTRKVWLYFLKSKDETFD 543

Query: 523 KFKEWDILVENQIGTKLKVLRTDNGQEFVLEQFNEFCRKKGIKRHRIVAYT 573
           KF EW  LVENQ+  K+K LRTDNG EF   +F+ +C++ GI+RHR   YT
Sbjct: 544 KFCEWKSLVENQVNKKVKCLRTDNGLEFCNSRFDSYCKEHGIERHRTCTYT 594



 Score =  102 bits (255), Expect = 6e-20
 Identities = 44/103 (42%), Positives = 72/103 (69%)

Query: 1181 YIVWYNSNLEAIWLKGMIGEMGISQGCVKIHCDSQSAIHLANHQIYHERTKHINIRLHFV 1240
            YI    +  EA+W+KG++ +MG+ Q  VKI CDSQSAI L+ + +YHERTKHI++R +++
Sbjct: 1251 YIALAEAAKEAMWIKGLLQDMGMQQDKVKIWCDSQSAICLSKNSVYHERTKHIDVRFNYI 1310

Query: 1241 RDMIETKEIKVEKVASEENPADIFTKSLPRSRFKHCLDLINFI 1283
            RD++E+ ++ V K+ +  NP D  TK +P ++FK  L ++  +
Sbjct: 1311 RDVVESGDVDVLKIHTSRNPVDALTKCIPVNKFKSALGVLKLM 1353


>dbj|BAD34493.1| Gag-Pol [Ipomoea batatas]
          Length = 1298

 Score =  345 bits (886), Expect = 4e-93
 Identities = 217/586 (37%), Positives = 329/586 (56%), Gaps = 32/586 (5%)

Query: 2   MGSKWDIEKFTGSNLFGLWKVKMRAILIQEKCVEALKREAQMSAHLTPAEK-TEMNDKAV 60
           M +K++IEKF G N F LWK+K++AIL ++ C+ A+   ++     T  +K +EMN+ A+
Sbjct: 1   MAAKFEIEKFNGKN-FSLWKLKVKAILRKDNCLAAI---SERPVDFTDDKKWSEMNEDAM 56

Query: 61  SAIILCLGDKMLREVSRETTAVSMWNKLDLLYMTKSLAHRQCLKQQLYFYRMMESKPIME 120
           + + L + D +L  +  + TA  +W+ L+ LY  KSL ++  LK++LY  RM ES  + E
Sbjct: 57  ADLYLSIADGVLSSIEEKKTANEIWDHLNRLYEAKSLHNKIFLKRKLYTLRMSESTSVTE 116

Query: 121 QLTEFNKIIDDLANIDVNLEDEDKALHLLCALPRSFE----NFKDTMLYGKEGTITLEEV 176
            L   N +   L ++   +E +++A  LL +LP S++    N  + +L      +  ++V
Sbjct: 117 HLNTLNTLFSQLTSLSCKIEPQERAELLLQSLPDSYDQLIINLTNNIL---TDYLVFDDV 173

Query: 177 QATLITKELTKF--KDLKVD-DSGEGLNVSRGRNQNRGKGKGKNSKSKSRSKGDGNKTKY 233
            A ++ +E  +   +D +V+    E L V RGR+  RG+  G+  +SKS      +K   
Sbjct: 174 AAAVLEEESRRKNKEDRQVNLQQAEALTVMRGRSTERGQSSGRG-RSKS------SKKNL 226

Query: 234 KCFICHNPGHFKKDCPERKDNGGGNPSVQLASKDEGCESAGALTVTSWEPEKG----WVL 289
            C+ C   GH KKDC     N   NP   +AS  +   +       + E  K     W++
Sbjct: 227 TCYNCGKKGHLKKDCWNLAQNS--NPQGNVASTSDDGSALCCEASIAREGRKRFADIWLI 284

Query: 290 DSGCSYHISPRKGYFETLELEEGGVVRLGNNKACKIQGIGTIRLKMFDDRDFLLKDVRYI 349
           DSG +YH++ RK +F   E   GG V   ++ A +I GIGTI+LKM+D     ++DVR++
Sbjct: 285 DSGATYHMTSRKEWFHHYEPISGGSVYSCDDHALEIIGIGTIKLKMYDGTVQTVQDVRHV 344

Query: 350 PELRRNLISISMFDGLGYCTRIERGVMRISHGALVIAKGSKIHG-LYILEGSTII-AHAS 407
             L++NL+S  + D        ++GVM+I  GALV+ KG KI   LY+L+G T+  A AS
Sbjct: 345 KGLKKNLLSYGILDNSATQIETQKGVMKIFQGALVVMKGEKIAANLYMLKGETLQEAEAS 404

Query: 408 VPSVDTLDITKLWHLRLGHVSERGLVELAKQGLLGNEKLNKLDFCDNCTLGKQHKVKFGV 467
           V +    D T LWH +LGH+S++G+  L +Q L+       L  C++C   KQH++KF  
Sbjct: 405 VAACSP-DSTLLWHQKLGHMSDQGMKILVEQKLIPGLTKVSLPLCEHCITSKQHRLKFST 463

Query: 468 GVHKSSRPFE*VHSDLLGPA*VKTYGGGSYFTSIIDDYSRRVWVYILKNKSDAFEKFKEW 527
              +     E VHSD+   A V + GG  YF S IDDYSRR WVY +K KSD F  FK +
Sbjct: 464 SNSRGKVVLELVHSDVW-QAPVPSLGGAKYFVSFIDDYSRRCWVYPIKKKSDVFATFKAF 522

Query: 528 DILVENQIGTKLKVLRTDNGQEFVLEQFNEFCRKKGIKRHRIVAYT 573
              VE   G K+K  RTDNG E+  E+F++FC+K+GIKR   VAYT
Sbjct: 523 KARVELDSGKKIKCFRTDNGGEYTSEEFDDFCKKEGIKRQFTVAYT 568



 Score = 86.3 bits (212), Expect = 6e-15
 Identities = 37/96 (38%), Positives = 61/96 (63%)

Query: 1181 YIVWYNSNLEAIWLKGMIGEMGISQGCVKIHCDSQSAIHLANHQIYHERTKHINIRLHFV 1240
            Y+    ++ EAIWLK ++ E+G  Q  V + CDSQSA+HLA +  +H RTKHI ++ HF+
Sbjct: 1194 YVAATQASKEAIWLKMLLEELGHKQEFVSLFCDSQSALHLARNPAFHSRTKHIRVQYHFI 1253

Query: 1241 RDMIETKEIKVEKVASEENPADIFTKSLPRSRFKHC 1276
            R+ ++   + ++K+ + +N AD  TK +   +F  C
Sbjct: 1254 REKVKEGTVDLQKIHTADNVADFLTKIINVDKFTWC 1289


>gb|AAK29467.1| polyprotein-like [Lycopersicon chilense]
          Length = 1328

 Score =  328 bits (842), Expect = 6e-88
 Identities = 203/590 (34%), Positives = 311/590 (52%), Gaps = 32/590 (5%)

Query: 3   GSKWDIEKFTGSN-LFGLWKVKMRAILIQEKCVEALKREAQMSAHLTPAEKTEMNDKAVS 61
           G K+++ KF G   +F +W+ +M+ +LIQ+   +AL  +++    +   +  E+++KA S
Sbjct: 3   GVKYEVAKFNGDKPVFSMWQRRMKDLLIQQGLHKALGGKSKKPESMKLEDWEELDEKAAS 62

Query: 62  AIILCLGDKMLREVSRETTAVSMWNKLDLLYMTKSLAHRQCLKQQLYFYRMMESKPIMEQ 121
           AI L L D ++  +  E +A  +W KL+ LYM+K+L ++  LK+QLY   M E    +  
Sbjct: 63  AIRLHLTDDVVNNIVDEESACGIWTKLENLYMSKTLTNKLYLKKQLYTLHMDEGTNFLSH 122

Query: 122 LTEFNKIIDDLANIDVNLEDEDKALHLLCALPRSFENFKDTMLYGKEGTITLEEVQATLI 181
           L   N +I  LAN+ V +E+EDK + LL +LP S++    T+L+GK+ +I L++V + L+
Sbjct: 123 LNVLNGLITQLANLGVKIEEEDKRIVLLNSLPSSYDTLSTTILHGKD-SIQLKDVTSALL 181

Query: 182 TKELTKFKDLKVDDSGEG-LNVSRGRNQNRGKGKGKNSKSKSRSKGDGNKTKYKCFICHN 240
             E  + K    ++ G+  +  SRGR+  R       S ++ +SK         C+ C  
Sbjct: 182 LNEKMRKKP---ENHGQVFITESRGRSYQRSSSNYGRSGARGKSKVRSKSKARNCYNCDQ 238

Query: 241 PGHFKKDCPERKDNGG--------GNPSVQLASKDEGC----ESAGALTVTSWEPEKGWV 288
           PGHFK+DCP  K   G         N +  + + D+      E    + +   E E  WV
Sbjct: 239 PGHFKRDCPNPKRGKGESSGQKNDDNTAAMVQNNDDVVLLINEEEECMHLAGTESE--WV 296

Query: 289 LDSGCSYHISPRKGYFETLELEEGGVVRLGNNKACKIQGIGTIRLKMFDDRDFLLKDVRY 348
           +D+  SYH +P +  F      + G V++GN    KI GIG I  K       +LKDVR+
Sbjct: 297 VDTAASYHATPVRDLFCRYVAGDYGNVKMGNTSYSKIAGIGDICFKTNVGCTLVLKDVRH 356

Query: 349 IPELRRNLISISMFDGLGYCTRIERGVMRISHGALVIAKGSKIHGLY-----ILEGSTII 403
           +P+LR NLIS    D  GY         R++ GALVIAKG     LY     I +G    
Sbjct: 357 VPDLRMNLISGIALDQDGYENYFANQKWRLTKGALVIAKGVARGTLYRTNAEICQGELNA 416

Query: 404 AHASVPSVDTLDITKLWHLRLGHVSERGLVELAKQGLLGNEKLNKLDFCDNCTLGKQHKV 463
           AH         +   LWH R+GH SE+GL  L+K+ L+   K   +  C+    GKQH+V
Sbjct: 417 AHEE-------NSADLWHKRMGHTSEKGLQILSKKSLISFTKGTTIKPCNYWLFGKQHRV 469

Query: 464 KFGVGVHKSSRPFE*VHSDLLGPA*VKTYGGGSYFTSIIDDYSRRVWVYILKNKSDAFEK 523
            F     + S   + V+SD+ GP  +++ GG  YF + IDD SR++WVYI + K   F+ 
Sbjct: 470 SFQTSSERKSNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYIFRAKDQVFQV 529

Query: 524 FKEWDILVENQIGTKLKVLRTDNGQEFVLEQFNEFCRKKGIKRHRIVAYT 573
           F+++  LVE + G K K LRTDNG E+   +F E+C   GI+  + V  T
Sbjct: 530 FQKFHALVERETGRKRKRLRTDNGGEYTSREFEEYCSNHGIRHEKTVPGT 579



 Score = 79.3 bits (194), Expect = 8e-13
 Identities = 34/91 (37%), Positives = 60/91 (65%)

Query: 1190 EAIWLKGMIGEMGISQGCVKIHCDSQSAIHLANHQIYHERTKHINIRLHFVRDMIETKEI 1249
            E +WLK  + E G+ Q    ++C+SQSA+ L+   +YH  TKHI++R H++R+M++   +
Sbjct: 1233 EMLWLKRFLQEHGLHQKEYVVYCESQSAMDLSKKAMYHATTKHIDMRYHWIREMVDDGSL 1292

Query: 1250 KVEKVASEENPADIFTKSLPRSRFKHCLDLI 1280
            +V K+ + ENPAD+ TK +   +F+   +L+
Sbjct: 1293 QVVKIPTSENPADMVTKVVQNEKFELWKELV 1323


>emb|CAA31653.1| polyprotein [Arabidopsis thaliana] gi|99721|pir||S05465
           retrovirus-related polyprotein - Arabidopsis thaliana
           retrotransposon Ta1-3
          Length = 1291

 Score =  328 bits (840), Expect = 9e-88
 Identities = 187/525 (35%), Positives = 292/525 (55%), Gaps = 6/525 (1%)

Query: 52  KTEMNDKAVSAIILCLGDKMLREVSRETTAVSMWNKLDLLYMTKSLAHRQCLKQQLYFYR 111
           K E ++ A++ II  +GD +LR++    +A  MW  L+  YM  SL +R  ++ + Y ++
Sbjct: 83  KIEKSENAMNIIIAHVGDAVLRKIDHCKSAAEMWETLNKQYMETSLPNRIYVQLKFYSFK 142

Query: 112 MMESKPIMEQLTEFNKIIDDLANIDVNLEDEDKALHLLCALPRSFENFKDTMLYGKEGTI 171
           M ++K I E + EF KI+ +L+++++N+ +E +A+  L  L   +   K T+ YG +  +
Sbjct: 143 MNDTKSINENVNEFLKIVAELSSLEINVVEEVRAILFLNRLSSRYSQLKHTLKYGNKA-L 201

Query: 172 TLEEV--QATLITKELTKFKDLKVDDSGEGLNVSRGRNQNRGKGKGKNSKSKSRSKGDGN 229
           +L++V   A  + +EL + K+   + S       R R Q R +   K  + + RSK + N
Sbjct: 202 SLKDVISAARSLERELNEQKETDKNTSTVLYTNERSRPQTRNQNHNKGGQGRGRSKSNSN 261

Query: 230 KTKYKCFICHNPGHFKKDCPERKDNGGGNPSVQLASKDEGCESAGALTVTSWEPEKGWVL 289
             K  C+ C   GH KKD   RK         +     E    + AL+V        WVL
Sbjct: 262 -AKLTCWYCKKEGHVKKDYFARKRKLESENPGEAGVITEKLVFSEALSVNDLAVRDIWVL 320

Query: 290 DSGCSYHISPRKGYFETLELEEGGVVRLGNNKACKIQGIGTIRLKMFDDRDFLLKDVRYI 349
           DSGC+ H+S R+ +F +   + G  + LG++ + K QG G+I+++        L++V+Y+
Sbjct: 321 DSGCTSHMSARRDWFCSFREDGGPTILLGDDHSVKSQGQGSIKIETHGGTIIGLENVKYV 380

Query: 350 PELRRNLISISMFDGLGYCTRIERGVMRISHGALVIAKGSKIHGLYILEGSTIIAHASVP 409
           PELRRNLIS    D  GY      G +R         +G  ++GLYIL+G+T+++   V 
Sbjct: 381 PELRRNLISTGTLDKRGYKHEGGDGKVRYFKNQKTALRGELVNGLYILDGNTVLSETCVA 440

Query: 410 SVDTLDITKLWHLRLGHVSERGLVELAKQGLLGNEKLNKLDFCDNCTLGKQHKVKFGVGV 469
              +   T+LWH RLGH+    +  LA +GL+  E++  LDFC+NC +GK  KV F VG 
Sbjct: 441 E-GSKGKTELWHSRLGHIGLNNMKVLAGKGLVSKEEIRVLDFCENCVMGKAKKVSFNVGK 499

Query: 470 HKSSRPFE*VHSDLLGPA*VK-TYGGGSYFTSIIDDYSRRVWVYILKNKSDAFEKFKEWD 528
           H S      VH+DL G   V  +  G  YF SIIDD +R+VW+Y L++K + F++F EW 
Sbjct: 500 HNSEDVLRYVHADLWGSTNVTPSLSGNKYFLSIIDDKTRKVWLYFLRSKDETFDRFCEWK 559

Query: 529 ILVENQIGTKLKVLRTDNGQEFVLEQFNEFCRKKGIKRHRIVAYT 573
            LVENQ   K+K LRTDNG EF   +F+ +C++ GI+RH+   YT
Sbjct: 560 ELVENQQNKKVKCLRTDNGLEFCNLKFDAYCKEHGIERHKTCTYT 604


  Database: nr
    Posted date:  Jul 5, 2005 12:34 AM
  Number of letters in database: 863,360,394
  Number of sequences in database:  2,540,612
  
Lambda     K      H
   0.343    0.153    0.518 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,022,752,737
Number of Sequences: 2540612
Number of extensions: 82392245
Number of successful extensions: 360308
Number of sequences better than 10.0: 1326
Number of HSP's better than 10.0 without gapping: 934
Number of HSP's successfully gapped in prelim test: 394
Number of HSP's that attempted gapping in prelim test: 355441
Number of HSP's gapped (non-prelim): 3048
length of query: 1283
length of database: 863,360,394
effective HSP length: 140
effective length of query: 1143
effective length of database: 507,674,714
effective search space: 580272198102
effective search space used: 580272198102
T: 11
A: 40
X1: 15 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 38 (21.5 bits)
S2: 81 (35.8 bits)


Medicago: description of AC135504.5