Medicago
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= AC144730.2 - phase: 0 /pseudo
         (866 letters)

Database: nr 
           2,540,612 sequences; 863,360,394 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAD23679.1| putative retroelement pol polyprotein [Arabidopsi...   263  2e-68
ref|XP_474090.1| OSJNBa0033G05.13 [Oryza sativa (japonica cultiv...   256  2e-66
gb|AAD32759.1| putative retroelement pol polyprotein [Arabidopsi...   253  2e-65
dbj|BAB09923.1| copia-like retrotransposable element [Arabidopsi...   251  6e-65
emb|CAB75481.1| copia-like polyprotein [Arabidopsis thaliana] gi...   248  8e-64
gb|AAP54315.1| putative polyprotein [Oryza sativa (japonica cult...   246  2e-63
gb|AAX92941.1| retrotransposon protein, putative, Ty1-copia sub-...   238  8e-61
ref|XP_475663.1| putative polyprotein [Oryza sativa (japonica cu...   237  1e-60
gb|AAD23690.1| putative retroelement pol polyprotein [Arabidopsi...   235  4e-60
gb|AAX92861.1| retrotransposon protein, putative, Ty1-copia sub-...   232  3e-59
ref|XP_470868.1| Putative retroelement pol polyprotein [Oryza sa...   231  6e-59
emb|CAA32025.1| unnamed protein product [Nicotiana tabacum] gi|1...   230  2e-58
emb|CAB78169.1| putative retrotransposon [Arabidopsis thaliana] ...   229  2e-58
ref|XP_469192.1| putative polyprotein [Oryza sativa (japonica cu...   225  6e-57
gb|AAP53029.1| putative retrotransposon-related protein [Oryza s...   222  4e-56
dbj|BAD34493.1| Gag-Pol [Ipomoea batatas]                             219  2e-55
ref|XP_475489.1| putative polyprotein [Oryza sativa (japonica cu...   214  8e-54
gb|AAD19773.1| putative retroelement pol polyprotein [Arabidopsi...   214  1e-53
gb|AAK29467.1| polyprotein-like [Lycopersicon chilense]               214  1e-53
ref|NP_916849.1| retrovirus-related pol polyprotein from transpo...   211  6e-53

>gb|AAD23679.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
           gi|25412027|pir||G84599 probable retroelement pol
           polyprotein [imported] - Arabidopsis thaliana
          Length = 838

 Score =  263 bits (671), Expect = 2e-68
 Identities = 168/458 (36%), Positives = 260/458 (56%), Gaps = 30/458 (6%)

Query: 3   DIEKFTGSNDFGLWKVKMRAI---------LIQQKCVEALKGEAQMDVHLTPAEKT---E 50
           ++EK  G  D+ LWK K+ A          L + + +E  +  A+ D  LT  E     E
Sbjct: 7   EVEKLDGEGDYVLWKEKLLAHIELLGLLEGLEEDEAIEEEESTAETDSLLTKTEDKVLKE 66

Query: 51  MNDKAVSAIILCLGDKVLREVSRESTAVSMRNKLDSLYMTKSLAHRQCLKQQLYFYRMVE 110
              KA S +IL LG+ VLR+V +E TA  M   LD L+M KSL +R  LKQ+LY Y+M +
Sbjct: 67  KRGKARSTVILSLGNHVLRKVIKEKTAAGMIRVLDKLFMAKSLPNRIYLKQRLYGYKMSD 126

Query: 111 SKPIMEQLTEFNKIIDDLANIDVNLEDEDKALHLPCALPRSFENFKDTMLYGK*GTITLE 170
           S  I E + +F K+I DL N+ V++ DED+A+ L  +LP+ F+  KDT+ YGK  T+ L+
Sbjct: 127 SMTIEENVNDFFKLISDLENVKVSVPDEDQAIVLLMSLPKQFDQLKDTLKYGK-TTLALD 185

Query: 171 EVQAALRTKELTKFKELK-VEDSGEGLNV-SRERSQNRGKGKGKNSRSKSRSKGDGNKTQ 228
           E+  A+R+K L      K +++S + L V  R RS+ R K   +N +S+SRSK    K  
Sbjct: 186 EITGAIRSKVLELGASGKMLKNSSDALFVQDRGRSEKRDKSSERN-KSQSRSKSREKKV- 243

Query: 229 YKCFICHNPGHFKKDCP--ERKGNGGGNPSVQIASNEEGYES-AGALTV------TSWEP 279
             C++C   GHFKK C   + K   G N     +SN  G  + A AL V       + E 
Sbjct: 244 --CWVCGKEGHFKKQCYVWKEKNKKGNNSEKGESSNVIGQAADAAALAVREESNADNQEV 301

Query: 280 EKGWVLDSGCSYHICPRKEYFEMLELEEGGVVCLGNNKACKIQVIGTIRLKMFDDRDFLL 339
           +  W++D+GCS+H+ PR+++F   +  + G V + N    +I+ IG+IR++  D+   LL
Sbjct: 302 DNEWIMDTGCSFHMTPRRDWFVEFDESQTGRVKMANQTYSEIKGIGSIRIQNDDNTTVLL 361

Query: 340 KDVRYIPKLRRNLISISMFDGLGYCTRIERGVMRISHGALIIAKGSKIHGLYILEGSTVI 399
           K+VRY+P + +NLIS+   +  G   + + G +++  G + + KG K+  LY+L+G  V 
Sbjct: 362 KNVRYVPSMSKNLISMGTLEDQGCWFQSKAGTLKVVKGCMTLLKGKKVGTLYLLQGVVVT 421

Query: 400 ADASVASVDTLDVTKLWHLRLGHVSERGIWLN*LNKGC 437
            +A+ A   + D +K+WH RL H+S+R I +  + KGC
Sbjct: 422 GNAN-AVTSSKDESKIWHSRLCHMSQRNIDVL-IKKGC 457


>ref|XP_474090.1| OSJNBa0033G05.13 [Oryza sativa (japonica cultivar-group)]
           gi|38344889|emb|CAD41912.2| OSJNBa0033G05.13 [Oryza
           sativa (japonica cultivar-group)]
          Length = 1181

 Score =  256 bits (655), Expect = 2e-66
 Identities = 157/451 (34%), Positives = 254/451 (55%), Gaps = 30/451 (6%)

Query: 1   KWDIEKFTGSNDFGLWKVKMRAILIQQKCVEALKGEAQMDVHLTPAEKTEMNDKAVSAII 60
           K+D+        F LW+VKMRA+L QQ+  +AL G  +     +  EK + + KA+S I 
Sbjct: 5   KYDLPLLDRDTRFSLWQVKMRAVLAQQELDDALSGFDKRTQDWSNDEK-KRDRKAMSYIH 63

Query: 61  LCLGDKVLREVSRESTAVSMRNKLDSLYMTKSLAHRQCLKQQLYFYRMVESKPIMEQLTE 120
           L L + +L+EV +E TA  +  KL+ + MTK L  +  LKQ+L+ +++ +   +M+ L+ 
Sbjct: 64  LHLSNNILQEVLKEETAAGLWLKLEQICMTKDLTSKMHLKQKLFLHKLQDDGSVMDHLSA 123

Query: 121 FNKIIDDLANIDVNLEDEDKALHLPCALPRSFENFKDTMLYGK*GTITLEEVQAALRTKE 180
           F +I+ DL +++V  +++D AL L C+LP S+ NF+DT+LY +  T+TL+EV  AL  KE
Sbjct: 124 FKEIVADLESMEVKYDEKDLALILLCSLPSSYANFRDTILYSR-DTLTLKEVYDALHAKE 182

Query: 181 LTKFKELKVEDSGEGLNVSRERSQNRGKGKGKNSRSKSRSKGDGN-------KTQYK-CF 232
                ++K     EG N   E    RG  + KN+ +KSR K   +       + +YK C 
Sbjct: 183 -----KMKKMVPSEGSNSQAEGLVVRGSQQEKNTNNKSRDKSSSSYRGRSKSRGRYKSCK 237

Query: 233 ICHNPGHFKKDC--PERKGNGGGNPSVQIASNEEGY------ESAGALTVTSW----EPE 280
            C   GH    C   + K    G    +    EEG       E + A  + ++    +  
Sbjct: 238 YCKRDGHDISKCWKLQDKDKRTGKYIPKGKKEEEGKAAVVTDEKSDAELLVAYAGCAQTS 297

Query: 281 KGWVLDSGCSYHICPRKEYFEMLELEEGGVVCLGNNKACKIQVIGTIRLKMFDDRDFLLK 340
             W+LD+ C+YH+CP +++F   E+ +GG V +G++  C++  IGT+++KMFD     L 
Sbjct: 298 DQWILDTACTYHMCPNRDWFATYEVVQGGTVLMGDDTPCEVAGIGTVQIKMFDGCIRTLS 357

Query: 341 DVRYIPKLRRNLISISMFDGLGYCTRIERGVMRISHGALIIAKGS-KIHGLYILEGSTVI 399
           DVR+IP L+R+LIS+   D  GY      G+++++ G+L++ K S K   LY L+G+T++
Sbjct: 358 DVRHIPNLKRSLISLCTLDRKGYKYSGGDGILKVTKGSLVVMKASIKSANLYHLQGTTIL 417

Query: 400 ADASVA--SVDTLDVTKLWHLRLGHVSERGI 428
            + +    S+   D T LWH+RLGH+SE G+
Sbjct: 418 GNVATVSDSLSNSDATNLWHMRLGHMSEIGL 448


>gb|AAD32759.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
           gi|25301696|pir||F84486 probable retroelement pol
           polyprotein [imported] - Arabidopsis thaliana
          Length = 1356

 Score =  253 bits (645), Expect = 2e-65
 Identities = 163/457 (35%), Positives = 255/457 (55%), Gaps = 41/457 (8%)

Query: 3   DIEKFTGSNDFGLWKVKMRAILIQQKCVEALKGEAQMDVHLTPAEKT------------- 49
           ++EKF G  D+ +WK K+ A +       ALK         +  +++             
Sbjct: 7   EVEKFDGRGDYTMWKEKLLAHMDILGLNTALKESESTGEKKSVLDESDEDYEEKLEKFEA 66

Query: 50  --EMNDKAVSAIILCLGDKVLREVSRESTAVSMRNKLDSLYMTKSLAHRQCLKQQLYFYR 107
             E   KA SAI+L + D+VLR++ +ESTA +M   LD LYM+K+L +R   KQ+LY ++
Sbjct: 67  LEEKKKKARSAIVLSVTDRVLRKIKKESTAAAMLLALDKLYMSKALPNRIYPKQKLYSFK 126

Query: 108 MVESKPIMEQLTEFNKIIDDLANIDVNLEDEDKALHLPCALPRSFENFKDTMLYGK*GTI 167
           M E+  +   + EF +II DL N++V + DED+A+ L  ALP++F+  KDT+ Y    +I
Sbjct: 127 MSENLSVEGNIDEFLQIITDLENMNVIISDEDQAILLLTALPKAFDQLKDTLKYSSGKSI 186

Query: 168 -TLEEVQAALRTKEL---TKFKELKVEDSGEGLNVSRERSQNRGKGKGKNSRSKSRSKGD 223
            TL+EV AA+ +KEL   +  K +KV+   EGL V +++++N+GKG+ K    K + K  
Sbjct: 187 LTLDEVAAAIYSKELELGSVKKSIKVQ--AEGLYV-KDKNENKGKGEQKG---KGKGKKG 240

Query: 224 GNKTQYKCFICHNPGHFKKDCPER-----------KG-NGGGNPSVQIASNEEGYESAGA 271
            +K +  C+ C   GHF+  CP +           KG + GG  ++  A+   GY  + A
Sbjct: 241 KSKKKPGCWTCGEEGHFRSSCPNQNKPQFKQSQVVKGESSGGKGNLAEAA---GYYVSEA 297

Query: 272 LTVTSWEPEKGWVLDSGCSYHICPRKEYFEMLELEEGGVVCLGNNKACKIQVIGTIRLKM 331
           L+ T    E  W+LD+GCSYH+  ++E+F     + GG V +GN    +++ +GTIR+K 
Sbjct: 298 LSSTEVHLEDEWILDTGCSYHMTYKREWFHEFNEDAGGSVRMGNKTVSRVRGVGTIRVKN 357

Query: 332 FDDRDFLLKDVRYIPKLRRNLISISMFDGLGYCTRIERGVMRISHGALIIAKGSKIHGLY 391
            D    +L +VRYIP + RNL+S+  F+  GY    E G++RI  G  ++  G +   LY
Sbjct: 358 SDGLTIVLTNVRYIPDMDRNLLSLGTFEKAGYKFESEDGILRIKAGNQVLLTGRRYDTLY 417

Query: 392 ILEGSTVIADASVASVDTLDVTKLWHLRLGHVSERGI 428
           +L    V A  S+A V   D T LWH RL H+S++ +
Sbjct: 418 LLNWKPV-ASESLAVVKRADDTVLWHQRLCHMSQKNM 453


>dbj|BAB09923.1| copia-like retrotransposable element [Arabidopsis thaliana]
          Length = 1342

 Score =  251 bits (642), Expect = 6e-65
 Identities = 164/457 (35%), Positives = 248/457 (53%), Gaps = 38/457 (8%)

Query: 3   DIEKFTGSNDFGLWKVKMRAILIQQKCVEALKGEAQMDVHLTPAEKT------------- 49
           ++EKF G  D+ LWK K+ A +     +E L  E +  V  +  E +             
Sbjct: 7   EVEKFDGDGDYILWKEKLLAHMEMLGLLEGLGEEEEAVVEDSTTEISDGGNQDPETATSK 66

Query: 50  -------EMNDKAVSAIILCLGDKVLREVSRESTAVSMRNKLDSLYMTKSLAHRQCLKQQ 102
                  E   KA S IIL LG+ VLR+V ++ TA  M   LD L+M KSL +R  LKQ+
Sbjct: 67  LEDKILKEKRGKARSTIILSLGNNVLRKVIKQKTAAGMIKVLDQLFMAKSLPNRIYLKQR 126

Query: 103 LYFYRMVESKPIMEQLTEFNKIIDDLANIDVNLEDEDKALHLPCALPRSFENFKDTMLYG 162
           LY Y+M E+  + E + +F K+I DL N+ V + DED+A+ L  +LPR F+  K+T+ Y 
Sbjct: 127 LYGYKMSENMTMEENVNDFFKLISDLENVKVVVPDEDQAIVLLMSLPRQFDQLKETLKYC 186

Query: 163 K*GTITLEEVQAALRTKELTKFKELK-VEDSGEGLNV-SRERSQNRGKGKGKNSRSKSRS 220
           K  T+ LEE+ +A+R+K L      K ++++ +GL V  R RS+ RGKG  KN +S+S+S
Sbjct: 187 K-TTLHLEEITSAIRSKILELGASGKLLKNNSDGLFVQDRGRSETRGKGPNKN-KSRSKS 244

Query: 221 KGDGNKTQYKCFICHNPGHFKKDC---PERKGNGGGNPSVQIASNEEGYESAGALTVT-- 275
           KG G      C+IC   GHFKK C    ER   G  +   + ++       A AL V+  
Sbjct: 245 KGAGK----TCWICGKEGHFKKQCYVWKERNKQGSTSERGEASTVTARVTDAAALVVSRA 300

Query: 276 ----SWEPEKGWVLDSGCSYHICPRKEYFEMLELEEGGVVCLGNNKACKIQVIGTIRLKM 331
               +      W+LD+GCS+H+  RK++    +    G V +GN+   +++ IG +R+K 
Sbjct: 301 LLGFAEVTPDTWILDTGCSFHMTCRKDWIIDFKETASGKVRMGNDTYSEVKGIGDVRIKN 360

Query: 332 FDDRDFLLKDVRYIPKLRRNLISISMFDGLGYCTRIERGVMRISHGALIIAKGSKIHGLY 391
            D    LL DVRYIP++ +NLIS+   +  G     ++G++ I    L +  G K   LY
Sbjct: 361 EDGSTILLTDVRYIPEMSKNLISLGTLEDKGCWFESKKGILTIFKNDLTVLTGKKESTLY 420

Query: 392 ILEGSTVIADASVASVDTLDVTKLWHLRLGHVSERGI 428
            L+G+T+  +A+V   +  D T LWH RLGH+  +G+
Sbjct: 421 FLQGTTLAGEANVIDKEK-DETSLWHSRLGHIGAKGL 456


>emb|CAB75481.1| copia-like polyprotein [Arabidopsis thaliana]
           gi|11278366|pir||T47492 copia-like polyprotein -
           Arabidopsis thaliana
          Length = 1363

 Score =  248 bits (632), Expect = 8e-64
 Identities = 158/466 (33%), Positives = 254/466 (53%), Gaps = 48/466 (10%)

Query: 3   DIEKFTGSNDFGLWKVKM---------RAILIQQKCVEALKGEAQMDVHLTPAEKTEMND 53
           ++EKF G  D+ +WK K+          A+L + +     + +++        E+ +M  
Sbjct: 7   EVEKFDGRGDYTMWKEKLLAHIDMLGLSAVLRESETPMGKERDSEKSDEDEKEEREKMEA 66

Query: 54  ------KAVSAIILCLGDKVLREVSRESTAVSMRNKLDSLYMTKSLAHRQCLKQQLYFYR 107
                 KA S I+L + D+VLR++ +E++A +M   LD LYM+K+L +R  LKQ+LY ++
Sbjct: 67  FEEKKRKARSTIVLSVSDRVLRKIKKETSAAAMLEALDRLYMSKALPNRIYLKQKLYSFK 126

Query: 108 MVESKPIMEQLTEFNKIIDDLANIDVNLEDEDKALHLPCALPRSFENFKDTMLYGK*GTI 167
           M E+  I   + EF  I+ DL N++V + DED+A+ L  +LP+ F+  KDT+ Y    T+
Sbjct: 127 MSENLSIEGNIDEFLHIVADLENLNVLVSDEDQAILLLMSLPKPFDQLKDTLKYSSGKTV 186

Query: 168 -TLEEVQAALRTKELTKFKELK--VEDSGEGLNVSRERSQNRG----KGKGKNSRSKSRS 220
            +L+EV AA+ ++EL +F  +K  ++   EGL V +++++NRG    K KGK  RSKS+S
Sbjct: 187 LSLDEVAAAIYSREL-EFGSVKKSIKGQAEGLYV-KDKAENRGRSEQKDKGKGKRSKSKS 244

Query: 221 KGDGNKTQYKCFICHNPGHFKKDCPER-----------KGNGGGNPSVQIASNEEGYESA 269
           K         C+IC   GH K  CP +           KG   G     +  +    ESA
Sbjct: 245 KRG-------CWICGEDGHLKSTCPNKNKPQFKNQGSNKGESSGGKGNLVEGSVNFVESA 297

Query: 270 G-----ALTVTSWEPEKGWVLDSGCSYHICPRKEYFEMLELEEGGVVCLGNNKACKIQVI 324
           G     AL+ T    E  W++D+GC YH+  ++E+ E  + E GG V +GN    +++ +
Sbjct: 298 GMFVSEALSSTDIHLEDEWIMDTGCIYHMTHKREWLEDFDEEAGGSVRMGNKSISRVKGV 357

Query: 325 GTIRLKMFDDRDFLLKDVRYIPKLRRNLISISMFDGLGYCTRIERGVMRISHGALIIAKG 384
           GT+R+   +     L++VRYIP + RNL+S+  F+  G+    E G++RI  G  ++ +G
Sbjct: 358 GTVRIVNDNGLTVTLQNVRYIPDMDRNLLSLGTFEKAGHKFESENGMLRIKSGNQVLLEG 417

Query: 385 SKIHGLYILEGSTVIADASVASVDTLDVTKLWHLRLGHVSERGIWL 430
            +   LYIL G     D S+A     D T LWH RL H+S++ + L
Sbjct: 418 RRYDTLYILHGKPA-TDESLAVARANDDTVLWHRRLCHMSQKNMSL 462


>gb|AAP54315.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
           gi|37535452|ref|NP_922028.1| putative polyprotein [Oryza
           sativa (japonica cultivar-group)]
           gi|22094359|gb|AAM91886.1| putative polyprotein [Oryza
           sativa (japonica cultivar-group)]
          Length = 1280

 Score =  246 bits (628), Expect = 2e-63
 Identities = 150/451 (33%), Positives = 244/451 (53%), Gaps = 30/451 (6%)

Query: 1   KWDIEKFTGSNDFGLWKVKMRAILIQQKCVEALKGEAQMDVHLTPAEKTEMNDKAVSAII 60
           K+D+        F LW+VKMRA+L QQ   +AL G  +     +  EK + + KA+S I 
Sbjct: 40  KYDLPLLDRDTRFSLWQVKMRAVLAQQDLDDALSGFDKRTQDWSNDEKKK-DRKAMSYIH 98

Query: 61  LCLGDKVLREVSRESTAVSMRNKLDSLYMTKSLAHRQCLKQQLYFYRMVESKPIMEQLTE 120
           L L + +L+EV +E TA  +  KL+ + MTK L  +  LKQ+L+ +++ +   +M+ L+ 
Sbjct: 99  LHLSNNILQEVLKEETAAGLWLKLEQICMTKDLTSKMHLKQKLFLHKLQDDGSVMDHLST 158

Query: 121 FNKIIDDLANIDVNLEDEDKALHLPCALPRSFENFKDTMLYGK*GTITLEEVQAALRTKE 180
           F +I+ DL +I+V  ++ED  L L C+LP S+ NF+DT+LY    T+ L+EV  AL  KE
Sbjct: 159 FKEIVADLESIEVKYDEEDLGLILLCSLPSSYANFRDTILYSH-DTLILKEVYDALHAKE 217

Query: 181 LTKFKELKVEDSGEGLNVSRERSQNRGKGKGKNSRSKSRSKGDG-------NKTQYK-CF 232
                ++K     EG N   E    RG+ + KN++++SR K          ++ +YK C 
Sbjct: 218 -----KMKKMVPSEGSNSQAEGLVVRGRQQEKNTKNQSRDKSSSSYRGRSKSRGRYKSCK 272

Query: 233 ICHNPGHFKKDCPE------------RKGNGGGNPSVQIASNEEGYESAGALTVTSWEPE 280
            C   GH   +C +             KG         + ++E+             +  
Sbjct: 273 YCKRDGHDISECWKLQDKDKRTGKYIPKGKKEEEGKAAVVTDEKSDTELLVAYAGCAQTS 332

Query: 281 KGWVLDSGCSYHICPRKEYFEMLELEEGGVVCLGNNKACKIQVIGTIRLKMFDDRDFLLK 340
             W+LD+  +YH+CP +++F   E  +GG V +G++  C++  IGT+++KMFD     L 
Sbjct: 333 DQWILDTAWTYHMCPNRDWFATYEALQGGTVLMGDDTPCEVAGIGTVQIKMFDGYIRTLS 392

Query: 341 DVRYIPKLRRNLISISMFDGLGYCTRIERGVMRISHGALIIAKGS-KIHGLYILEGSTVI 399
           DVR+IP L+R+LIS+   D  GY      G+++++ G+L++ K   K   LY L G+T++
Sbjct: 393 DVRHIPNLKRSLISLCTLDRKGYKYSGGDGILKVTKGSLVVMKADIKSANLYHLRGTTIL 452

Query: 400 ADASVA--SVDTLDVTKLWHLRLGHVSERGI 428
            + +    S+   D T LWH+RLGH+SE G+
Sbjct: 453 GNVAAVSDSLSNSDATNLWHMRLGHMSEIGL 483


>gb|AAX92941.1| retrotransposon protein, putative, Ty1-copia sub-class [Oryza
           sativa (japonica cultivar-group)]
          Length = 2340

 Score =  238 bits (606), Expect = 8e-61
 Identities = 150/451 (33%), Positives = 247/451 (54%), Gaps = 30/451 (6%)

Query: 1   KWDIEKFTGSNDFGLWKVKMRAILIQQKCVEALKGEAQMDVHLTPAEKTEMNDKAVSAII 60
           K+D+        F LW+VKMRA+L QQ   +AL G  +   H    ++ + + KA+S I 
Sbjct: 212 KYDLPLLYRDTRFSLWQVKMRAVLAQQDLDDALSGFDKR-THDWSNDEKKRDRKAMSYIH 270

Query: 61  LCLGDKVLREVSRESTAVSMRNKLDSLYMTKSLAHRQCLKQQLYFYRMVESKPIMEQLTE 120
           L L + +L+EV +E  A  +  KL+ + MTK L  +  LKQ L+ +++ +   +M+ L+ 
Sbjct: 271 LHLSNNILQEVLKEEIAAGLWLKLEQICMTKDLTSKMHLKQTLFLHKLQDDGSVMDHLSA 330

Query: 121 FNKIIDDLANIDVNLEDEDKALHLPCALPRSFENFKDTMLYGK*GTITLEEVQAALRTKE 180
           F +II DL +++V  ++ED  L L C+LP S+ NF+DT+LY +  T+TL+EV  AL  KE
Sbjct: 331 FKEIIADLESMEVKYDEEDLGLILLCSLPSSYANFRDTILYSR-DTLTLKEVYDALHVKE 389

Query: 181 LTKFKELKVEDSGEGLNVSRERSQNRGKGKGKNSRSKSRSKGDGN-------KTQYK-CF 232
                ++K     EG N   E     G+ + KN++++SR K   +       + +YK C 
Sbjct: 390 -----KMKKMVPSEGSNSQAEGLIVWGRQQEKNTKNQSRDKSSSSYRGRSKSRGRYKSCK 444

Query: 233 ICHNPGHFKKDCPER--KGNGGGNPSVQIASNEEGY------ESAGALTVTSW----EPE 280
            C   GH   +C +   K    G    +    EEG       E + A  + ++    +  
Sbjct: 445 YCKRDGHDIFECWKLHDKDKRTGKYVPKGKKEEEGKAAVVTDEKSDAELLVAYAGCAQTS 504

Query: 281 KGWVLDSGCSYHICPRKEYFEMLELEEGGVVCLGNNKACKIQVIGTIRLKMFDDRDFLLK 340
             W+L++ C YH+CP +++F   E  + G V +G++  C++  IGT+++KMFD     L 
Sbjct: 505 DQWILNTACIYHMCPNRDWFATYEAVQVGTVLMGDDTPCEVAGIGTVQIKMFDGCIRTLS 564

Query: 341 DVRYIPKLRRNLISISMFDGLGYCTRIERGVMRISHGALIIAKGS-KIHGLYILEGSTVI 399
           DVR+IP L+R+LIS+   D  GY      G+++++ G+L++ K   K   LY L G+T++
Sbjct: 565 DVRHIPNLKRSLISLCTLDRKGYKYSGGDGILKVTKGSLVVMKADIKSANLYHLRGTTIL 624

Query: 400 ADASVA--SVDTLDVTKLWHLRLGHVSERGI 428
            + +    S+   D T LWH+RLGH++E G+
Sbjct: 625 GNVAAVSDSLSNSDATNLWHMRLGHMTEIGL 655


>ref|XP_475663.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
           gi|48475188|gb|AAT44257.1| putative polyprotein [Oryza
           sativa (japonica cultivar-group)]
          Length = 1211

 Score =  237 bits (604), Expect = 1e-60
 Identities = 153/452 (33%), Positives = 247/452 (53%), Gaps = 32/452 (7%)

Query: 1   KWDIEKFTGSNDFGLWKVKMRAILIQQKCVEALKGEAQMDVHLTPAEKTEMNDKAVSAII 60
           K+D+        F LW+VKMRA+L QQ   +AL G  +     +  EK + + K +S I 
Sbjct: 5   KYDLPLLDRDTRFSLWQVKMRAVLAQQDLDDALSGFDKRTQDWSNDEK-KRDRKTMSYIH 63

Query: 61  LCLGDKVLREVSRESTAVSMRNKLDSLYMTKSLAHRQCLKQQLYFYRMVESKPIMEQLTE 120
           L L + +L+EV +E TA  +  KL+ + MTK L  +  LKQ+L+ +++ +   +M+ L+ 
Sbjct: 64  LHLSNNILQEVLKEETAAGLWLKLEQICMTKDLTSKMHLKQKLFLHKLQDDGSVMDHLSA 123

Query: 121 FNKIIDDLANIDVNLEDEDKALHLPCALPRSFENFKDTMLYGK*GTITLEEVQAALRTKE 180
           F KI+ DL +++V  ++ED  L L C+LP S+ NF+DT+LY    T+TL+EV  AL  KE
Sbjct: 124 FKKIVADLESMEVKYDEEDLCLILLCSLPSSYANFRDTILYSC-DTLTLKEVYDALHAKE 182

Query: 181 LTKFKELKVEDSGEGLNVSRERSQNRGKGKGKNSRSKSRSKGDG-------NKTQYK-CF 232
                ++K     EG N   E    RG+ + KN+ SKSR K          ++ +YK C 
Sbjct: 183 -----KIKKMVPSEGSNSQAEGLVVRGRQQEKNTNSKSRDKSSSSYRGRSKSRGRYKSCK 237

Query: 233 ICHNPGHFKKDC--PERKGNGGGNPSVQIASNEEGY------ESAGALTVTSW----EPE 280
                GH   +C   + K    G    +    EEG       E + A  + ++    +  
Sbjct: 238 YYKRDGHDISECWKLQDKDKRTGKYVPKGKKEEEGKAAVVTDEKSDAELLVAYAGCAQTS 297

Query: 281 KGWVLDSGCSYHICPRKEYFEMLELEEGGVVCLGNNKACKIQVIGTIRLKMFDDRDFLLK 340
             W+LD+ C+YH+C  +++F   E  +GG V +G++  C++  + T+++KMFD     L 
Sbjct: 298 DQWILDTACTYHMCLNRDWFATYEAVQGGTVLMGDDTPCEVAGVETVQIKMFDGCIRTLS 357

Query: 341 DVRYIPKLRRNLISISMFDGLGYCTRIERGVMRISHGALIIAKGS-KIHGLYILEGSTVI 399
           DVR+IP L+R+LIS+   D   Y      G+++++ G+L++ K   K   LY + G+T++
Sbjct: 358 DVRHIPNLKRSLISLCTLDRKVYKYSGGDGILKVTKGSLVVMKADIKSANLYHVRGTTIL 417

Query: 400 ADASVASVDTL---DVTKLWHLRLGHVSERGI 428
            + +  S D+L   D T LWH+RLGH+SE G+
Sbjct: 418 GNIAAVS-DSLYNSDATNLWHMRLGHMSEIGL 448


>gb|AAD23690.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
           gi|25301702|pir||E84601 probable retroelement pol
           polyprotein [imported] - Arabidopsis thaliana
          Length = 1333

 Score =  235 bits (600), Expect = 4e-60
 Identities = 158/446 (35%), Positives = 247/446 (54%), Gaps = 41/446 (9%)

Query: 3   DIEKFTGSNDFGLWKVKMRAILIQQKCVEALKGEAQM-----DVHLTPAEKTE------- 50
           ++EKF G  D+ +WK K+ A L       ALK E  +     ++ LT  E+ E       
Sbjct: 7   EVEKFDGRGDYTMWKEKLMAHLDILGLSVALKEEDDLVEKVAEMQLTEEEEKEEVLRREL 66

Query: 51  ---MNDKAVSAIILCLGDKVLREVSRESTAVSMRNKLDSLYMTKSLAHRQCLKQQLYFYR 107
                 KA SAI+L + D+VLR++ +E +A +M   LD LYM+K+L +R   KQ+LY ++
Sbjct: 67  LEEKRRKARSAIVLSVTDRVLRKIKKEQSAAAMLGVLDKLYMSKALPNRIYQKQKLYSFK 126

Query: 108 MVESKPIMEQLTEFNKIIDDLANIDVNLEDEDKALHLPCALPRSFENFKDTMLYGK*G-T 166
           M E+  I   + EF +II DL N +V + DED+A+ L  +LP+ F+  +DT+ YG    T
Sbjct: 127 MSENLSIEGNIDEFLRIIADLENTNVLVSDEDQAILLLMSLPKPFDQLRDTLKYGLGRVT 186

Query: 167 ITLEEVQAALRTKELTKFKELK-VEDSGEGLNVSRERSQNRGKGK---GKNSRSKSRSKG 222
           ++L+EV AA+ +KEL      K ++   EGL V +E+++ RG+ +     N+  KSRSK 
Sbjct: 187 LSLDEVVAAIYSKELELGSNKKSIKGQAEGLFV-KEKTETRGRTEQRGNNNNNKKSRSK- 244

Query: 223 DGNKTQYKCFICHNPGHFKKDCPERKGNGGGNPSVQIASNEEGYESAGALTVTSWEPEKG 282
             ++++  C+IC               NG  N      S   G   + AL+ T    E  
Sbjct: 245 --SRSKKGCWIC-----------GESSNGSSN-----YSEANGLYVSEALSSTDIHLEDE 286

Query: 283 WVLDSGCSYHICPRKEYFEMLELEEGGVVCLGNNKACKIQVIGTIRLKMFDDRDFLLKDV 342
           WV+D+GCSYH+  ++E+FE L  + GG V +GN    K++ IGTIR+K        L +V
Sbjct: 287 WVMDTGCSYHMTYKREWFEDLNEDAGGSVRMGNKTVSKVRGIGTIRVKNEAGMVVRLTNV 346

Query: 343 RYIPKLRRNLISISMFDGLGYCTRIERGVMRISHGALIIAKGSKIHGLYILEGSTVIADA 402
           RYIP++ RNL+S+  F+  GY  ++E G + I  G  ++    + + LY+L+   V  + 
Sbjct: 347 RYIPEMDRNLLSLGTFEKSGYSFKLENGTLSIIAGDSVLLTVRRCYTLYLLQWRPV-TEE 405

Query: 403 SVASVDTLDVTKLWHLRLGHVSERGI 428
           S++ V   D T LWH RLGH+S++ +
Sbjct: 406 SLSVVKRQDDTILWHRRLGHMSQKNM 431


>gb|AAX92861.1| retrotransposon protein, putative, Ty1-copia sub-class [Oryza
           sativa (japonica cultivar-group)]
          Length = 1373

 Score =  232 bits (592), Expect = 3e-59
 Identities = 165/457 (36%), Positives = 243/457 (53%), Gaps = 26/457 (5%)

Query: 1   KWDIEKFTGSNDFGLWKVKMRAILIQQKCV-EALKGEAQMDVHLTPAEKTEMNDKAVSAI 59
           K+D+        F LW+VKMR IL Q     EAL    +     T AE+   + KA++ I
Sbjct: 2   KFDLPLLNYDTRFSLWQVKMRGILAQTHDYDEALDNFGKRRAEWT-AEEIRKDQKALALI 60

Query: 60  ILCLGDKVLREVSRESTAVSMRNKLDSLYMTKSLAHRQCLKQQLYFYRMVESKPIMEQLT 119
            L L + +L+E   E T+  +  KL+S+ M+K L  +  +K +L+  +M E   ++  + 
Sbjct: 61  QLHLHNDILQECLTEKTSAELWLKLESICMSKDLTSKMQMKMKLFTLKMKEEDSVITHMA 120

Query: 120 EFNKIIDDLANIDVNLEDEDKALHLPCALPRSFENFKDTMLYGK*GTITLEEVQAALRTK 179
           EF KI+ DL +++V  +DED  L L C+LP S+ NF+DT+L  +   +TL+EV  AL+ K
Sbjct: 121 EFKKIVADLVSMEVKYDDEDLGLLLLCSLPNSYANFRDTILLSR-DELTLKEVYDALQNK 179

Query: 180 ELTKF---KELKVEDSGEGLNVSRERSQNR-GKGKGKNSRSKSRSKGDGNKTQYKCFIC- 234
           E  K     +      GE L+V R R++NR    K  + R +S+SK  GNK    C  C 
Sbjct: 180 EKMKIMVQNDGSSSSKGEALHV-RGRTENRTSNEKNYDRRGRSKSKPPGNKK--FCVYCK 236

Query: 235 ---HNPGHFKK-DCPERKGNGGGNPSVQIASNEEGYESAGALTVTSW--EPEKGWVLDSG 288
              HN    KK    ERK    G  SV  A+  +  +S   L V +        W+LDS 
Sbjct: 237 LKNHNIDECKKVQAKERKNKKDGKVSVASAAASDD-DSGDCLVVFAGCVAGHDEWILDSA 295

Query: 289 CSYHICPRKEYFEMLE-LEEGGVVCLGNNKACKIQVIGTIRLKMFDDRDFLLKDVRYIPK 347
           CS+HIC ++ +F   + +++G VV +G++  C I  IG++++K  D     LK+VRYIP 
Sbjct: 296 CSFHICTKRNWFSSYKPVQKGDVVRMGDDNPCAIVGIGSVQIKTDDGMTRTLKNVRYIPG 355

Query: 348 LRRNLISISMFDGLGYCTRIERGVMRISHGALIIAKGS-KIHGLYILEGSTVIADASVAS 406
           + RNLIS+S  D  GY      GV+++S G+L+  KG      LY+L G T+    S A+
Sbjct: 356 MSRNLISLSTLDAEGYKYSGSDGVLKVSKGSLVCLKGDVNSAKLYVLRGCTLTGSDSAAA 415

Query: 407 VDTLD---VTKLWHLRLGHVSERG---IWLN*LNKGC 437
             T D    T LWH+RLGH+S  G   +    L KGC
Sbjct: 416 AITNDEPSKTNLWHMRLGHMSHLGMTELMKRNLLKGC 452


>ref|XP_470868.1| Putative retroelement pol polyprotein [Oryza sativa]
           gi|14029020|gb|AAK52561.1| Putative retroelement pol
           polyprotein [Oryza sativa]
          Length = 1326

 Score =  231 bits (590), Expect = 6e-59
 Identities = 156/448 (34%), Positives = 245/448 (53%), Gaps = 31/448 (6%)

Query: 1   KWDIEKFTGSNDFGLWKVKMRAILIQQKCV-EALKGEAQMDVHLTPAEKTEMNDKAVSAI 59
           K+D+        F LW+VKMRAIL Q   + EAL+   +       AE+   + KA+  I
Sbjct: 5   KYDLPLLDYKTRFSLWQVKMRAILAQTSDLDEALESFGKKKSTEWTAEEKRKDRKALLLI 64

Query: 60  ILCLGDKVLREVSRESTAVSMRNKLDSLYMTKSLAHRQCLKQQLYFYRMVESKPIMEQLT 119
            L L + +L+EV +E TA  +  KL+S+ M+K L  +  +K +L+ +++ ES  ++  ++
Sbjct: 65  QLHLSNDILQEVLQEKTAAELWLKLESICMSKDLTSKMHIKMKLFSHKLQESGSVLNHIS 124

Query: 120 EFNKIIDDLANIDVNLEDEDKALHLPCALPRSFENFKDTMLYGK*GTITLEEVQAALRTK 179
            F +I+ DL +I+V  +DED  L L C+LP S+ NF+DT+L  +   +TL EV  AL+ +
Sbjct: 125 VFKEIVVDLVSIEVQFDDEDLGLLLLCSLPSSYANFRDTILLSR-DELTLAEVYEALQNR 183

Query: 180 ELTKFKELKVEDS----GEGLNV---SRERSQNRGKGKGKNSRSKSRSKGDGNKTQYKCF 232
           E  K K +   D+    GE L V   S +R+ N    + K S+S+ RSK  G K    C 
Sbjct: 184 E--KMKGMVQSDASSSKGEALQVRGRSEQRTYNDSSDRDK-SQSRGRSKSRGKKF---CK 237

Query: 233 ICHNPGHFKKDC------PERKGNGGGNPSVQIASNEEGYESAGALTVTSW--EPEKGWV 284
            C    HF ++C       +RK +G       + ++ E  +S   L V +        W+
Sbjct: 238 YCKKKNHFIEECWKLQNKEKRKSDG----KASVVTSAENSDSGDCLVVFAGCVASHDEWI 293

Query: 285 LDSGCSYHICPRKEYFEMLE-LEEGGVVCLGNNKACKIQVIGTIRLKMFDDRDFLLKDVR 343
           LD+ CS+HIC  +++F   + ++ G VV +G++   +I  IG++++K  D     LKDVR
Sbjct: 294 LDTACSFHICINRDWFSSYKSVQNGDVVRMGDDNPREIVGIGSVQIKTHDGMTRTLKDVR 353

Query: 344 YIPKLRRNLISISMFDGLGYCTRIERGVMRISHGALIIAKGS-KIHGLYILEGSTVIADA 402
           +IP + RNLIS+S  D  GY      GV+++S G+L+   G      LY+L GST+    
Sbjct: 354 HIPGMARNLISLSTLDAEGYKYSSSGGVVKVSKGSLVYMIGDMNSANLYVLRGSTLHGSV 413

Query: 403 SVASV--DTLDVTKLWHLRLGHVSERGI 428
           + A+V  D    T LWH+RLGH+SE G+
Sbjct: 414 TAAAVSKDEPIKTNLWHMRLGHMSELGM 441


>emb|CAA32025.1| unnamed protein product [Nicotiana tabacum]
           gi|130582|sp|P10978|POLX_TOBAC Retrovirus-related Pol
           polyprotein from transposon TNT 1-94 [Contains: Protease
           ; Reverse transcriptase ; Endonuclease]
          Length = 1328

 Score =  230 bits (586), Expect = 2e-58
 Identities = 159/449 (35%), Positives = 234/449 (51%), Gaps = 37/449 (8%)

Query: 1   KWDIEKFTGSNDFGLWKVKMRAILIQQKCVEALKGEAQMDVHLTPAEKTEMNDKAVSAII 60
           K+++ KF G N F  W+ +MR +LIQQ   + L  +++    +   +  +++++A SAI 
Sbjct: 5   KYEVAKFNGDNGFSTWQRRMRDLLIQQGLHKVLDVDSKKPDTMKAEDWADLDERAASAIR 64

Query: 61  LCLGDKVLREVSRESTAVSMRNKLDSLYMTKSLAHRQCLKQQLYFYRMVESKPIMEQLTE 120
           L L D V+  +  E TA  +  +L+SLYM+K+L ++  LK+QLY   M E    +  L  
Sbjct: 65  LHLSDDVVNNIIDEDTARGIWTRLESLYMSKTLTNKLYLKKQLYALHMSEGTNFLSHLNV 124

Query: 121 FNKIIDDLANIDVNLEDEDKALHLPCALPRSFENFKDTMLYGK*GTITLEEVQAALRTKE 180
           FN +I  LAN+ V +E+EDKA+ L  +LP S++N   T+L+GK  TI L++V +AL   E
Sbjct: 125 FNGLITQLANLGVKIEEEDKAILLLNSLPSSYDNLATTILHGK-TTIELKDVTSALLLNE 183

Query: 181 LTKFKELKVEDSG-----EGLNVSRERSQNRGKGKGKNSRSKSRSKGDGNKTQYKCFICH 235
             +    K E+ G     EG   S +RS N     G   +SK+RSK         C+ C+
Sbjct: 184 KMR---KKPENQGQALITEGRGRSYQRSSNNYGRSGARGKSKNRSK----SRVRNCYNCN 236

Query: 236 NPGHFKKDCPE-RKGNG---------------GGNPSVQIASNEEGYESAGALTVTSWEP 279
            PGHFK+DCP  RKG G                 N +V +  NEE  E    L+     P
Sbjct: 237 QPGHFKRDCPNPRKGKGETSGQKNDDNTAAMVQNNDNVVLFINEE--EECMHLS----GP 290

Query: 280 EKGWVLDSGCSYHICPRKEYFEMLELEEGGVVCLGNNKACKIQVIGTIRLKMFDDRDFLL 339
           E  WV+D+  S+H  P ++ F      + G V +GN    KI  IG I +K       +L
Sbjct: 291 ESEWVVDTAASHHATPVRDLFCRYVAGDFGTVKMGNTSYSKIAGIGDICIKTNVGCTLVL 350

Query: 340 KDVRYIPKLRRNLISISMFDGLGYCTRIERGVMRISHGALIIAKGSKIHGLYILEGSTVI 399
           KDVR++P LR NLIS    D  GY +       R++ G+L+IAKG     LY        
Sbjct: 351 KDVRHVPDLRMNLISGIALDRDGYESYFANQKWRLTKGSLVIAKGVARGTLYRTNAEICQ 410

Query: 400 ADASVASVDTLDVTKLWHLRLGHVSERGI 428
            + + A  D + V  LWH R+GH+SE+G+
Sbjct: 411 GELNAAQ-DEISV-DLWHKRMGHMSEKGL 437


>emb|CAB78169.1| putative retrotransposon [Arabidopsis thaliana]
           gi|4539406|emb|CAB40039.1| putative retrotransposon
           [Arabidopsis thaliana] gi|7444416|pir||T04181
           hypothetical protein F7L13.40 - Arabidopsis thaliana
          Length = 1230

 Score =  229 bits (585), Expect = 2e-58
 Identities = 155/457 (33%), Positives = 232/457 (49%), Gaps = 57/457 (12%)

Query: 3   DIEKFTGSNDFGLWKVKMRAILIQQKCVEALKGEAQMDVHLTPAEK-------------T 49
           ++EKF G  D+ LWK K+ A +       AL+    +   L   E+              
Sbjct: 7   EMEKFDGHGDYTLWKEKLMAHMDLLGLTVALRETQSVSDPLESEEEGKESEKGDKEALME 66

Query: 50  EMNDKAVSAIILCLGDKVLREVSRESTAVSMRNKLDSLYMTKSLAHRQCLKQQLYFYRMV 109
           E   KA S I+L + D+VLR+  +E TA SM   LD LYM+K+L +R  LKQ+LY Y+M 
Sbjct: 67  EKRQKARSTIVLSVSDQVLRKSKKEKTAPSMLEALDKLYMSKALPNRIYLKQKLYSYKMQ 126

Query: 110 ESKPIMEQLTEFNKIIDDLANIDVNLEDEDKALHLPCALPRSFENFKDTMLYGK-*GTIT 168
           E+  +   + EF ++I DL N +V + DED+A+ L  +LP+ F+  KDT+ YG    T++
Sbjct: 127 ENLSVEGNIDEFLRLIADLENTNVLVSDEDQAILLLMSLPKQFDQLKDTLKYGSGRTTLS 186

Query: 169 LEEVQAALRTKELTKFKELK-VEDSGEGLNVSRERSQNRG----KGKGKNSRSKSRSKGD 223
           ++EV AA+ +KEL      K +    EGL V +++ + RG    K KG   RS+SRSKG 
Sbjct: 187 VDEVVAAIYSKELELGSNKKSIRGQAEGLYV-KDKPETRGMSEQKEKGNKGRSRSRSKG- 244

Query: 224 GNKTQYKCFICHNPGHFKKDCPER---------KGNGGGNPSVQIASNE---EGYESAGA 271
                  C+IC   GHFK  CP +         + +G    +  I  N     GY  + A
Sbjct: 245 ----WKGCWICGEEGHFKTSCPNKGKQQNKGKDQASGSKGEAATIKGNTSEGSGYYVSEA 300

Query: 272 LTVTSWEPEKGWVLDSGCSYHICPRKEYFEMLELEEGGVVCLGNNKACKIQVIGTIRLKM 331
           L  T       WV+D+GC+YH+  +KE+FE L  + GG V +GN    K +         
Sbjct: 301 LHSTDVNLGNEWVMDTGCNYHMTHKKEWFEELSEDAGGTVRMGNKSTSKFR--------- 351

Query: 332 FDDRDFLLKDVRYIPKLRRNLISISMFDGLGYCTRIERGVMRISHGALIIAKGSKIHGLY 391
                     V+YIP + RNL+S+   +  GY    + GV+ +  G   +  GS+   LY
Sbjct: 352 ----------VKYIPDMDRNLLSMGTLEEHGYSFESKNGVLVVKEGTRTLLIGSRHEKLY 401

Query: 392 ILEGSTVIADASVASVDTLDVTKLWHLRLGHVSERGI 428
           +L+G   ++  S+      D T LWH RLGH+S++ +
Sbjct: 402 LLQGKPEVSH-SMTVERRNDDTVLWHRRLGHISQKNM 437


>ref|XP_469192.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
           gi|53370655|gb|AAU89150.1| integrase core domain
           containing protein [Oryza sativa (japonica
           cultivar-group)] gi|40538906|gb|AAR87163.1| putative
           polyprotein [Oryza sativa (japonica cultivar-group)]
          Length = 1322

 Score =  225 bits (573), Expect = 6e-57
 Identities = 150/446 (33%), Positives = 240/446 (53%), Gaps = 27/446 (6%)

Query: 1   KWDIEKFTGSNDFGLWKVKMRAILIQQKCV-EALKGEAQMDVHLTPAEKTEMNDKAVSAI 59
           K+D+        F LW+VKMRA+L Q   + EAL+   +       AE+   + KA+S I
Sbjct: 5   KYDLPLLDYKTRFSLWQVKMRAVLAQTSDLDEALESFGKKKTTEWTAEEKRKDRKALSLI 64

Query: 60  ILCLGDKVLREVSRESTAVSMRNKLDSLYMTKSLAHRQCLKQQLYFYRMVESKPIMEQLT 119
            L L + +L+EV ++ TA  +  KL+S+ M+K L  +  +K +L+ +++ ES  ++  ++
Sbjct: 65  QLHLSNDILQEVLQKKTAAELWLKLESICMSKDLTSKMHIKMKLFSHKLHESGSVLNHIS 124

Query: 120 EFNKIIDDLANIDVNLEDEDKALHLPCALPRSFENFKDTMLYGK*GTITLEEVQAALRTK 179
            F +I+ DL +++V  +DED  L L C+LP S+ NF+ T+L  +   +TL EV  AL+ +
Sbjct: 125 VFKEIVADLVSMEVQFDDEDLGLLLLCSLPSSYANFRHTILLSR-DELTLAEVYEALQNR 183

Query: 180 ELTK--FKELKVEDSGEGLNV---SRERSQNRGKGKGKNSRSKSRSKGDGNKTQYKCFIC 234
           E  K   +       GE L V   S +R+ N      K S+S+ RSK  G K    C  C
Sbjct: 184 EKMKGMVQSYASSSKGEALQVRGRSEQRTYNDSNDHDK-SQSRGRSKSRGKKF---CKYC 239

Query: 235 HNPGHFKKDC------PERKGNGGGNPSVQIASNEEGYESAGALTVTSW--EPEKGWVLD 286
               HF ++C       +RK +G       + ++ E  +S   L V +        W+LD
Sbjct: 240 KKKNHFIEECWKLQNKEKRKSDG----KASVVTSAENSDSGDCLVVFAGYVASHDEWILD 295

Query: 287 SGCSYHICPRKEYFEMLE-LEEGGVVCLGNNKACKIQVIGTIRLKMFDDRDFLLKDVRYI 345
           + CS+HIC  +++F   + ++   VV +G++   +I  IG++++K  D     LKDVR+I
Sbjct: 296 TACSFHICINRDWFSSYKSVQNEDVVRMGDDNPREIVGIGSVQIKTHDGMTRTLKDVRHI 355

Query: 346 PKLRRNLISISMFDGLGYCTRIERGVMRISHGALIIAKGS-KIHGLYILEGSTVIADASV 404
           P + RNLIS+S  D  GY      GV+++S G+L+   G      LY+L GST+    + 
Sbjct: 356 PGMARNLISLSTLDAEGYKYSGSGGVVKVSKGSLVYMIGDMNSANLYVLRGSTLHGSVTA 415

Query: 405 ASV--DTLDVTKLWHLRLGHVSERGI 428
           A+V  D    T LWH+RLGH+SE G+
Sbjct: 416 AAVTKDEPSKTNLWHMRLGHMSELGM 441


>gb|AAP53029.1| putative retrotransposon-related protein [Oryza sativa (japonica
           cultivar-group)] gi|37532880|ref|NP_920742.1| putative
           retrotransposon-related protein [Oryza sativa (japonica
           cultivar-group)] gi|22655747|gb|AAN04164.1| Putative
           retrotransposon protein [Oryza sativa (japonica
           cultivar-group)] gi|16905223|gb|AAL31093.1| putative
           retrotransposon-related protein [Oryza sativa]
          Length = 1229

 Score =  222 bits (566), Expect = 4e-56
 Identities = 152/448 (33%), Positives = 242/448 (53%), Gaps = 31/448 (6%)

Query: 1   KWDIEKFTGSNDFGLWKVKMRAILIQQKCV-EALKGEAQMDVHLTPAEKTEMNDKAVSAI 59
           K+D+        F LW+VKMRA+L Q   + EAL+   +       AE+   + KA+S I
Sbjct: 2   KYDLPLQDYKTRFSLWQVKMRAVLAQTSDLDEALESFGKKKTTEWTAEEKRKDRKALSLI 61

Query: 60  ILCLGDKVLREVSRESTAVSMRNKLDSLYMTKSLAHRQCLKQQLYFYRMVESKPIMEQLT 119
            L L + +L++V +E TA  +  KL+S+ M+K L  +  +K +L+ +++ ES  ++  ++
Sbjct: 62  QLHLSNDILQKVLQEKTAAELWFKLESICMSKDLTSKMHIKMKLFSHKLQESGSVLNHIS 121

Query: 120 EFNKIIDDLANIDVNLEDEDKALHLPCALPRSFENFKDTMLYGK*GTITLEEVQAALRTK 179
            F +II DL +++V  +DED  L L C+LP  + NF+DT+L  +   +TL EV  AL+ +
Sbjct: 122 VFKEIIADLVSMEVQFDDEDLGLLLLCSLPSLYANFRDTILLSR-DELTLAEVYEALQNR 180

Query: 180 ELTKFKELKVEDS----GEGLNV---SRERSQNRGKGKGKNSRSKSRSKGDGNKTQYKCF 232
           E  K K +   D+    G+ L V   S +R+ N    + K S+S+ RSK  G K    C 
Sbjct: 181 E--KMKGMVQSDASSSKGKALQVRGRSEQRTYNDSNDRDK-SQSRGRSKSRGKKF---CK 234

Query: 233 ICHNPGHFKKDC------PERKGNGGGNPSVQIASNEEGYESAGALTVTSW--EPEKGWV 284
            C    HF ++C       +RK +G       + ++ E  +SA  L   +        W+
Sbjct: 235 YCKKKNHFIEECWKLQNKEKRKSDG----KASVVTSAENSDSADCLVFFAGCVASHDEWI 290

Query: 285 LDSGCSYHICPRKEYFEM-LELEEGGVVCLGNNKACKIQVIGTIRLKMFDDRDFLLKDVR 343
           LD+ C + IC  +++F     ++ G VV +G+N   +I  IG++++K  D     LKDVR
Sbjct: 291 LDTACLFLICINRDWFSSHKSVQNGDVVRMGDNNPREIMGIGSVQIKTHDGMTRTLKDVR 350

Query: 344 YIPKLRRNLISISMFDGLGYCTRIERGVMRISHGALIIAKGS-KIHGLYILEGSTVIADA 402
           +IP + RNLIS+S  D  GY      GV+++S G+L+   G      LY+L GST+    
Sbjct: 351 HIPGMARNLISLSTLDAEGYKYSGSGGVVKVSKGSLVYMIGDMNSANLYVLRGSTLHGSL 410

Query: 403 SVASV--DTLDVTKLWHLRLGHVSERGI 428
           + A+V  D    T LWH+RLGH+SE G+
Sbjct: 411 TAAAVSKDEPSKTNLWHMRLGHMSELGM 438


>dbj|BAD34493.1| Gag-Pol [Ipomoea batatas]
          Length = 1298

 Score =  219 bits (559), Expect = 2e-55
 Identities = 155/442 (35%), Positives = 242/442 (54%), Gaps = 31/442 (7%)

Query: 1   KWDIEKFTGSNDFGLWKVKMRAILIQQKCVEALKGEAQMDVHLTPAEK-TEMNDKAVSAI 59
           K++IEKF G N F LWK+K++AIL +  C+ A+   ++  V  T  +K +EMN+ A++ +
Sbjct: 4   KFEIEKFNGKN-FSLWKLKVKAILRKDNCLAAI---SERPVDFTDDKKWSEMNEDAMADL 59

Query: 60  ILCLGDKVLREVSRESTAVSMRNKLDSLYMTKSLAHRQCLKQQLYFYRMVESKPIMEQLT 119
            L + D VL  +  + TA  + + L+ LY  KSL ++  LK++LY  RM ES  + E L 
Sbjct: 60  YLSIADGVLSSIEEKKTANEIWDHLNRLYEAKSLHNKIFLKRKLYTLRMSESTSVTEHLN 119

Query: 120 EFNKIIDDLANIDVNLEDEDKALHLPCALPRSFE----NFKDTMLYGK*GTITLEEVQAA 175
             N +   L ++   +E +++A  L  +LP S++    N  + +L      +  ++V AA
Sbjct: 120 TLNTLFSQLTSLSCKIEPQERAELLLQSLPDSYDQLIINLTNNILTDY---LVFDDVAAA 176

Query: 176 LRTKELT-KFKELKVED--SGEGLNVSRERSQNRGKGKGKNSRSKSRSKGDGNKTQYKCF 232
           +  +E   K KE +  +    E L V R RS  RG+  G+  RSKS      +K    C+
Sbjct: 177 VLEEESRRKNKEDRQVNLQQAEALTVMRGRSTERGQSSGRG-RSKS------SKKNLTCY 229

Query: 233 ICHNPGHFKKDCPERKGNGGGNPSVQIASNEEGYESAGALTVTSWEPEKG----WVLDSG 288
            C   GH KKDC     N   NP   +AS  +   +       + E  K     W++DSG
Sbjct: 230 NCGKKGHLKKDCWNLAQNS--NPQGNVASTSDDGSALCCEASIAREGRKRFADIWLIDSG 287

Query: 289 CSYHICPRKEYFEMLELEEGGVVCLGNNKACKIQVIGTIRLKMFDDRDFLLKDVRYIPKL 348
            +YH+  RKE+F   E   GG V   ++ A +I  IGTI+LKM+D     ++DVR++  L
Sbjct: 288 ATYHMTSRKEWFHHYEPISGGSVYSCDDHALEIIGIGTIKLKMYDGTVQTVQDVRHVKGL 347

Query: 349 RRNLISISMFDGLGYCTRIERGVMRISHGALIIAKGSKI-HGLYILEGSTV-IADASVAS 406
           ++NL+S  + D        ++GVM+I  GAL++ KG KI   LY+L+G T+  A+ASVA+
Sbjct: 348 KKNLLSYGILDNSATQIETQKGVMKIFQGALVVMKGEKIAANLYMLKGETLQEAEASVAA 407

Query: 407 VDTLDVTKLWHLRLGHVSERGI 428
               D T LWH +LGH+S++G+
Sbjct: 408 CSP-DSTLLWHQKLGHMSDQGM 428


>ref|XP_475489.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
           gi|48475213|gb|AAT44282.1| putative polyprotein [Oryza
           sativa (japonica cultivar-group)]
          Length = 1243

 Score =  214 bits (546), Expect = 8e-54
 Identities = 139/439 (31%), Positives = 238/439 (53%), Gaps = 34/439 (7%)

Query: 1   KWDIEKFTGSNDFGLWKVKMRAILIQQKCVEALKGEAQMDVHLTPAEKTEMNDKAVSAII 60
           K+D+        F LW+VKMRA+L QQ   +AL G  +     +  EK + + KA+S I 
Sbjct: 5   KYDLPLLDRDTRFSLWQVKMRAVLAQQDLDDALSGFDKRTQDWSNDEK-KRDRKAISYIH 63

Query: 61  LCLGDKVLREVSRESTAVSMRNKLDSLYMTKSLAHRQCLKQQLYFYRMVESKPIMEQLTE 120
           L L + +L+EV +E TA  +  KL+ + MTK L  +  LKQ+L+ +++ + + +M+ L+ 
Sbjct: 64  LHLSNNILQEVLKEETAAGLWLKLEQICMTKDLTSKMHLKQKLFLHKLQDDESVMDHLSA 123

Query: 121 FNKIIDDLANIDVNLEDEDKALHLPCALPRSFENFKDTMLYGK*GTITLEEVQAALRTKE 180
           F +I+ DL +++V  +++D  L L C+LP S+ NF+ T+LY +  T+TL+EV  A   KE
Sbjct: 124 FKEIVADLESMEVKYDEDDLGLILLCSLPSSYANFRGTILYSR-DTLTLKEVYDAFHAKE 182

Query: 181 LTKFKELKVEDSGEGLNVSRERSQNRGKGKGKNSRSKSRSKGDG-------NKTQYK-CF 232
                ++K   + EG N   E    RG+ + KN++++SR K          ++ +YK C 
Sbjct: 183 -----KMKKMVTSEGSNSQAEGLVVRGRQQKKNTKNQSRDKSSSSYRGRTKSRGRYKSCK 237

Query: 233 ICHNPGHFKKDCPERKGNGGGNPSVQIASNEEGYESAGALTVTSWEPEKGWVLDSGCSYH 292
            C   GH   +C + + +        I   ++  E   A+        +  V  +GC+  
Sbjct: 238 YCKRDGHDISECWKLQ-DKDKRTGKYIPKGKKEEEGKAAVVTDEKSDAELLVAYAGCAQ- 295

Query: 293 ICPRKEYFEMLELEEGGVVCLGNNKACKIQVIGTIRLKMFDDRDFLLKDVRYIPKLRRNL 352
               +++F   E  +GG V +G++  C++  IGT+++KMFD     L DV++IP L+R+L
Sbjct: 296 -TSDQDWFATYEALQGGTVLMGDDTPCEVAGIGTVQIKMFDGCIRTLSDVQHIPNLKRSL 354

Query: 353 ISISMFDGLGYCTRIERGVMRISHGALIIAK-GSKIHGLYILEGSTVIADASVA--SVDT 409
           IS+              G+++++ G+L++ K   K   LY L G+T++ + +    S+  
Sbjct: 355 ISL-------------YGILKVTKGSLVVMKVDIKSANLYHLRGTTILGNVAAVFDSLSN 401

Query: 410 LDVTKLWHLRLGHVSERGI 428
            D T LWH+RLGH+SE G+
Sbjct: 402 SDATNLWHMRLGHMSEIGL 420


>gb|AAD19773.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
           gi|25301697|pir||B84512 probable retroelement pol
           polyprotein [imported] - Arabidopsis thaliana
          Length = 1335

 Score =  214 bits (545), Expect = 1e-53
 Identities = 135/404 (33%), Positives = 213/404 (52%), Gaps = 22/404 (5%)

Query: 48  KTEMNDKAVSAIILCLGDKVLREVSRESTAVSMRNKLDSLYMTKSLAHRQCLKQQLYFYR 107
           + E  DKA + I L + DKVLR++    TA      LD L+M +SL HR   +   Y ++
Sbjct: 42  RLERCDKAKNVIFLNVADKVLRKIELCKTAAEAWETLDRLFMIRSLPHRVYTQLSFYTFK 101

Query: 108 MVESKPIMEQLTEFNKIIDDLANIDVNLEDEDKALHLPCALPRSFENFKDTMLYGK*GT- 166
           M E+K I E + +F KI+ DL ++ +++ DE +A+ L  +LP  ++   +TM Y      
Sbjct: 102 MQENKKIDENIDDFLKIVADLNHLQIDVTDEVQAILLLSSLPARYDGLVETMKYSNSREK 161

Query: 167 ITLEEVQAALRTKELTKFKELKVEDSGEGLNVSRERSQNRGKGKGKNSRSKSRSKG-DGN 225
           + L++V  A R KE    +  +    G   + +R R   +   +G   +++SRSK  DG 
Sbjct: 162 LRLDDVMVAARDKERELSQNNRPVVEG---HFARGRPDGKNNNQGNKGKNRSRSKSADGK 218

Query: 226 KTQYKCFICHNPGHFKKDC------PERKGNGGGNPSVQIASNEEGYESAGALTVTSW-- 277
           +    C+IC   GHFKK C       + K  G  N    +A + E +  A  L  T    
Sbjct: 219 RV---CWICGKEGHFKKQCYKWIERNKSKQQGSDNGESSLAKSTEAFNPAMVLLATDETL 275

Query: 278 ----EPEKGWVLDSGCSYHICPRKEYFEMLELEEGGVVCLGNNKACKIQVIGTIRLKMFD 333
                    WVLD+GCS+H+ PRK++F+  +    G V +GN+    ++ IG+I+++  D
Sbjct: 276 VVTDSIANEWVLDTGCSFHMTPRKDWFKDFKELSSGYVKMGNDTYSPVKGIGSIKIRNSD 335

Query: 334 DRDFLLKDVRYIPKLRRNLISISMFDGLGYCTRIERGVMRISHGALIIAKGSKIHGLYIL 393
               +L DVRY+P + RNLIS+   +  G   + + G+++I  G   I KG K   LYIL
Sbjct: 336 GSQVILTDVRYMPNMTRNLISLGTLEDRGCWFKSQDGILKIVKGCSTILKGQKRDTLYIL 395

Query: 394 EGSTVIADASVASVDTLDVTKLWHLRLGHVSERGIWLN*LNKGC 437
           +G T   + S +S +  D T LWH RLGH+S++G+ +  + KGC
Sbjct: 396 DGVTEEGE-SHSSAEVKDETALWHSRLGHMSQKGMEIL-VKKGC 437


>gb|AAK29467.1| polyprotein-like [Lycopersicon chilense]
          Length = 1328

 Score =  214 bits (545), Expect = 1e-53
 Identities = 146/442 (33%), Positives = 224/442 (50%), Gaps = 22/442 (4%)

Query: 1   KWDIEKFTGSND-FGLWKVKMRAILIQQKCVEALKGEAQMDVHLTPAEKTEMNDKAVSAI 59
           K+++ KF G    F +W+ +M+ +LIQQ   +AL G+++    +   +  E+++KA SAI
Sbjct: 5   KYEVAKFNGDKPVFSMWQRRMKDLLIQQGLHKALGGKSKKPESMKLEDWEELDEKAASAI 64

Query: 60  ILCLGDKVLREVSRESTAVSMRNKLDSLYMTKSLAHRQCLKQQLYFYRMVESKPIMEQLT 119
            L L D V+  +  E +A  +  KL++LYM+K+L ++  LK+QLY   M E    +  L 
Sbjct: 65  RLHLTDDVVNNIVDEESACGIWTKLENLYMSKTLTNKLYLKKQLYTLHMDEGTNFLSHLN 124

Query: 120 EFNKIIDDLANIDVNLEDEDKALHLPCALPRSFENFKDTMLYGK*GTITLEEVQAALRTK 179
             N +I  LAN+ V +E+EDK + L  +LP S++    T+L+GK  +I L++V +AL   
Sbjct: 125 VLNGLITQLANLGVKIEEEDKRIVLLNSLPSSYDTLSTTILHGK-DSIQLKDVTSALLLN 183

Query: 180 ELTKFKELKVEDSGE-GLNVSRERSQNRGKGKGKNSRSKSRSKGDGNKTQYKCFICHNPG 238
           E  +    K E+ G+  +  SR RS  R       S ++ +SK         C+ C  PG
Sbjct: 184 EKMR---KKPENHGQVFITESRGRSYQRSSSNYGRSGARGKSKVRSKSKARNCYNCDQPG 240

Query: 239 HFKKDCPERKGNGG--------GNPSVQIASNEEGY----ESAGALTVTSWEPEKGWVLD 286
           HFK+DCP  K   G         N +  + +N++      E    + +   E E  WV+D
Sbjct: 241 HFKRDCPNPKRGKGESSGQKNDDNTAAMVQNNDDVVLLINEEEECMHLAGTESE--WVVD 298

Query: 287 SGCSYHICPRKEYFEMLELEEGGVVCLGNNKACKIQVIGTIRLKMFDDRDFLLKDVRYIP 346
           +  SYH  P ++ F      + G V +GN    KI  IG I  K       +LKDVR++P
Sbjct: 299 TAASYHATPVRDLFCRYVAGDYGNVKMGNTSYSKIAGIGDICFKTNVGCTLVLKDVRHVP 358

Query: 347 KLRRNLISISMFDGLGYCTRIERGVMRISHGALIIAKGSKIHGLYILEGSTVIADASVAS 406
            LR NLIS    D  GY         R++ GAL+IAKG     LY    +  I    + +
Sbjct: 359 DLRMNLISGIALDQDGYENYFANQKWRLTKGALVIAKGVARGTLY--RTNAEICQGELNA 416

Query: 407 VDTLDVTKLWHLRLGHVSERGI 428
               +   LWH R+GH SE+G+
Sbjct: 417 AHEENSADLWHKRMGHTSEKGL 438


>ref|NP_916849.1| retrovirus-related pol polyprotein from transposon TNT 1-94-like
           [Oryza sativa (japonica cultivar-group)]
          Length = 425

 Score =  211 bits (538), Expect = 6e-53
 Identities = 133/403 (33%), Positives = 218/403 (54%), Gaps = 21/403 (5%)

Query: 1   KWDIEKFTGSNDFGLWKVKMRAILIQQKCVEALKGEAQMDVHLTPAEKTEMNDKAVSAII 60
           K+++ KF G+ +F LW+++++ +L QQ   +AL  E  M   +   +  EM  +A + I 
Sbjct: 8   KFEMVKFDGTGNFVLWQMRLKDLLAQQGISKAL--EETMPEKMDAGKWEEMKAQAAATIR 65

Query: 61  LCLGDKVLREVSRESTAVSMRNKLDSLYMTKSLAHRQCLKQQLYFYRMVESKPIMEQLTE 120
           L L D V+ +V  E T   + +KL SLYM+KSL  +  LKQQLY  +M E   + + +  
Sbjct: 66  LSLSDSVMYQVMDEKTPKEIWDKLASLYMSKSLTSKLYLKQQLYGLQMQEESDLRKHVDV 125

Query: 121 FNKIIDDLANIDVNLEDEDKALHLPCALPRSFENFKDTMLYGK*GTITLEEVQAALRTKE 180
           FN+++ DL+ +DVNL DEDKA+ L C+LP S+E+   T+ +GK  TI  EE+ ++L  ++
Sbjct: 126 FNQLVVDLSKLDVNLYDEDKAIILLCSLPPSYEHVVTTLTHGK-DTIKTEEIISSLLARD 184

Query: 181 LTKFK--ELKVEDSGEGLNVSRERSQNRGKGKGKNSRSKSRSKGDGNKTQYKCFICHNPG 238
           L + K  E       E L V  +     G        SKS  KG       +C+ CH  G
Sbjct: 185 LRRSKKNEAMEASQAESLLVKAKHDHEAGV-------SKSNEKG------ARCYKCHEFG 231

Query: 239 HFKKDCPERKGNGGGNPSVQIASNEEGYESAGALTVTSWEPEKGWVLDSGCSYHICPRKE 298
           H +++CP  K    G  S+    ++    S   LTV++ +  + W+LDS  SYH+  ++E
Sbjct: 232 HIRRNCPLLKKRKDGIASLAARGDDSDSSSHEILTVSNEKSGEAWMLDSASSYHVTSKRE 291

Query: 299 YFEMLELEEGGVVCLGNNKACKIQVIGTIRLKMFDDRDFLLKDVRYIPKLRRNLISISMF 358
           +F   +  + GVV LG++ +  +  +  ++ KM+D  + LL DVR++P LR++LIS+   
Sbjct: 292 WFFSYKSGDFGVVYLGDDTSYHVVGVDDVKFKMYDGNEVLLSDVRHVPGLRKSLISLGSL 351

Query: 359 DGLGYCTRI--ERGVMRISHGALIIAKGSKIHG-LYILEGSTV 398
              G+  ++  +R  M I      +  G +    LY L+G+ V
Sbjct: 352 HETGWLYQVDFDRKTMNIMKDGKTVMTGERTSSCLYKLQGNAV 394


  Database: nr
    Posted date:  Jul 5, 2005 12:34 AM
  Number of letters in database: 863,360,394
  Number of sequences in database:  2,540,612
  
Lambda     K      H
   0.344    0.151    0.505 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,280,109,185
Number of Sequences: 2540612
Number of extensions: 49254429
Number of successful extensions: 230256
Number of sequences better than 10.0: 873
Number of HSP's better than 10.0 without gapping: 451
Number of HSP's successfully gapped in prelim test: 425
Number of HSP's that attempted gapping in prelim test: 227754
Number of HSP's gapped (non-prelim): 1563
length of query: 866
length of database: 863,360,394
effective HSP length: 137
effective length of query: 729
effective length of database: 515,296,550
effective search space: 375651184950
effective search space used: 375651184950
T: 11
A: 40
X1: 15 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 38 (21.6 bits)
S2: 80 (35.4 bits)


Medicago: description of AC144730.2