Lotus
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= TM0220.7
         (720 letters)

Database: nr 
           2,540,612 sequences; 863,360,394 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

emb|CAA32025.1| unnamed protein product [Nicotiana tabacum] gi|1...   579  e-163
gb|AAV88069.1| hypothetical retrotransposon [Ipomoea batatas]         573  e-162
gb|AAK29467.1| polyprotein-like [Lycopersicon chilense]               559  e-157
pir||T02206 hypothetical protein - common tobacco retrotransposo...   550  e-155
dbj|BAD34493.1| Gag-Pol [Ipomoea batatas]                             509  e-143
ref|XP_470868.1| Putative retroelement pol polyprotein [Oryza sa...   471  e-131
ref|XP_474090.1| OSJNBa0033G05.13 [Oryza sativa (japonica cultiv...   464  e-129
gb|AAX92941.1| retrotransposon protein, putative, Ty1-copia sub-...   464  e-129
gb|AAP54315.1| putative polyprotein [Oryza sativa (japonica cult...   462  e-128
ref|XP_469192.1| putative polyprotein [Oryza sativa (japonica cu...   461  e-128
gb|AAX92861.1| retrotransposon protein, putative, Ty1-copia sub-...   451  e-125
gb|AAT85194.1| putative polyprotein [Oryza sativa (japonica cult...   439  e-121
ref|XP_476137.1| putative polyprotein [Oryza sativa (japonica cu...   436  e-120
gb|AAD19773.1| putative retroelement pol polyprotein [Arabidopsi...   430  e-119
ref|XP_475489.1| putative polyprotein [Oryza sativa (japonica cu...   422  e-116
gb|AAF19226.1| Highly similar to Ta1-3 polyprotein [Arabidopsis ...   413  e-114
gb|AAD32759.1| putative retroelement pol polyprotein [Arabidopsi...   407  e-112
dbj|BAB09923.1| copia-like retrotransposable element [Arabidopsi...   404  e-111
emb|CAB79135.1| putative transposable element [Arabidopsis thali...   401  e-110
emb|CAA31653.1| polyprotein [Arabidopsis thaliana] gi|99721|pir|...   398  e-109

>emb|CAA32025.1| unnamed protein product [Nicotiana tabacum]
           gi|130582|sp|P10978|POLX_TOBAC Retrovirus-related Pol
           polyprotein from transposon TNT 1-94 [Contains: Protease
           ; Reverse transcriptase ; Endonuclease]
          Length = 1328

 Score =  579 bits (1492), Expect = e-163
 Identities = 305/714 (42%), Positives = 446/714 (61%), Gaps = 27/714 (3%)

Query: 5   KVKIERFDGRD-FGFWKMLMEDYLYQKMLYQPLT--GKKPNDMKQEDWDLLDRQALGVIR 61
           K ++ +F+G + F  W+  M D L Q+ L++ L    KKP+ MK EDW  LD +A   IR
Sbjct: 5   KYEVAKFNGDNGFSTWQRRMRDLLIQQGLHKVLDVDSKKPDTMKAEDWADLDERAASAIR 64

Query: 62  LTLSKNVAFNIVNEKTTADLMKALSNMYEKPFAANKVHLIRRLFNLRMGEGNSVTEHINS 121
           L LS +V  NI++E T   +   L ++Y      NK++L ++L+ L M EG +   H+N 
Sbjct: 65  LHLSDDVVNNIIDEDTARGIWTRLESLYMSKTLTNKLYLKKQLYALHMSEGTNFLSHLNV 124

Query: 122 FNTIISQLSSVKITFDNELMVLSLLQSLPDSWAATVTAVSNSARDNKLKFDDIRDLILSE 181
           FN +I+QL+++ +  + E   + LL SLP S+    T + +     +LK D    L+L+E
Sbjct: 125 FNGLITQLANLGVKIEEEDKAILLLNSLPSSYDNLATTILHGKTTIELK-DVTSALLLNE 183

Query: 182 DIRRKDSGESSNTFGSALNTESRGRGSQKS-HNQSQGRGRSKSRGRSQTRVRNDITCWNC 240
            +R+K   +     G AL TE RGR  Q+S +N  +   R KS+ RS++RVRN   C+NC
Sbjct: 184 KMRKKPENQ-----GQALITEGRGRSYQRSSNNYGRSGARGKSKNRSKSRVRN---CYNC 235

Query: 241 DRKGHFTNQCKAPRKKKNYQKR*DDDESANAATEEVADTLI--------CSLDSPVDSWV 292
           ++ GHF   C  PRK K       +D++  A  +   + ++          L  P   WV
Sbjct: 236 NQPGHFKRDCPNPRKGKGETSGQKNDDNTAAMVQNNDNVVLFINEEEECMHLSGPESEWV 295

Query: 293 IDSGASFHTIPSKELLSNYICGKFGKVYLADGKPLDIVGIGDIDIRSSNGTLWTLHNVRH 352
           +D+ AS H  P ++L   Y+ G FG V + +     I GIGDI I+++ G    L +VRH
Sbjct: 296 VDTAASHHATPVRDLFCRYVAGDFGTVKMGNTSYSKIAGIGDICIKTNVGCTLVLKDVRH 355

Query: 353 VPGIKRNLISIGQLDDEGYHTTFGGGAWKVTKGNLVVARGKKRGSLYM----VAEEDMIA 408
           VP ++ NLIS   LD +GY + F    W++TKG+LV+A+G  RG+LY     + + ++ A
Sbjct: 356 VPDLRMNLISGIALDRDGYESYFANQKWRLTKGSLVIAKGVARGTLYRTNAEICQGELNA 415

Query: 409 VTEAINSSSIWHQRLGHMSEKGMKIMASKGKMSNLKHVDLGVCEHCILGKQRKVSFSKAG 468
             + I S  +WH+R+GHMSEKG++I+A K  +S  K   +  C++C+ GKQ +VSF  + 
Sbjct: 416 AQDEI-SVDLWHKRMGHMSEKGLQILAKKSLISYAKGTTVKPCDYCLFGKQHRVSFQTSS 474

Query: 469 RKSKSEKLELVHTDVWGPAPVKSLGGSRYYVTFIDDSTRKVWVYFLKSKSDVFSVFKKWK 528
            + K   L+LV++DV GP  ++S+GG++Y+VTFIDD++RK+WVY LK+K  VF VF+K+ 
Sbjct: 475 ER-KLNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKTKDQVFQVFQKFH 533

Query: 529 TEVENQTGLKIKSLKSDNGGEYDSQEFKKFCSENGIRMIKTIPGTPEQNGVAERMNRTLN 588
             VE +TG K+K L+SDNGGEY S+EF+++CS +GIR  KT+PGTP+ NGVAERMNRT+ 
Sbjct: 534 ALVERETGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTPQHNGVAERMNRTIV 593

Query: 589 ERARCMRIQSGLPKMFWPDAINTAAYLINRGPSVPLDYQLPEEVWYGKEVSLSHLKVFGC 648
           E+ R M   + LPK FW +A+ TA YLINR PSVPL +++PE VW  KEVS SHLKVFGC
Sbjct: 594 EKVRSMLRMAKLPKSFWGEAVQTACYLINRSPSVPLAFEIPERVWTNKEVSYSHLKVFGC 653

Query: 649 VSYVLIDSDKRDKLDPKAIKCFFIGYGSDMYGYRFWDEQNRKIIRSINVTFNES 702
            ++  +  ++R KLD K+I C FIGYG + +GYR WD   +K+IRS +V F ES
Sbjct: 654 RAFAHVPKEQRTKLDDKSIPCIFIGYGDEEFGYRLWDPVKKKVIRSRDVVFRES 707


>gb|AAV88069.1| hypothetical retrotransposon [Ipomoea batatas]
          Length = 1415

 Score =  573 bits (1477), Expect = e-162
 Identities = 298/718 (41%), Positives = 439/718 (60%), Gaps = 22/718 (3%)

Query: 1   LEEGKVKIERFDGRDFGFWKMLMEDYLYQKMLYQPL-TGKKPNDMKQEDWDLLDRQALGV 59
           +E     + R +GR++  WK  M+D L+ K L+ P+    KP +M  E+WD   +Q  G 
Sbjct: 1   METNTSNMVRLNGRNYHIWKAKMKDLLFVKKLHLPVFASAKPENMSDEEWDFEHQQVCGY 60

Query: 60  IRLTLSKNVAFNIVNEKTTADLMKALSNMYEKPFAANKVHLIRRLFNLRMGEGNSVTEHI 119
           IR  +  NV  +I+NE     L   L  +Y      NK+ L++++ N+R  EG  + +H+
Sbjct: 61  IRQWVEDNVLNHIINETHARSLWNKLETLYASKTGNNKLFLLKQMMNIRYREGTLINDHV 120

Query: 120 NSFNTIISQLSSVKITFDNELMVLSLLQSLPDSWAATVTAVSNSARDNKLKFDDIRDLIL 179
           N F  ++ QLS + I F++E++ L LL +LPDSW     +++NSA +  +  + ++  IL
Sbjct: 121 NDFQGVLDQLSGMGIKFEDEVLGLWLLNTLPDSWETFRVSLTNSAPNGVVTMEYVKSGIL 180

Query: 180 SEDIRRKDSGESSNTFGSALNTESRGRGSQKSHNQSQGRGRSKSRGRSQTRVRNDITCWN 239
           +E+ RR+ S ++S +    L T+ RGR  QK       RGR KSR +S++R + DI C  
Sbjct: 181 NEEARRR-SQDTSTSQSDILVTDDRGRNKQKGQ-----RGRDKSRSKSRSRYK-DIECHY 233

Query: 240 CDRKGHFTNQC-KAPRKKKNYQKR*DDDESANAATEEVADTLICSLDSPVD------SWV 292
           C +K H      K  R+KK   K   D ++ N      AD L+   D+ ++      +W+
Sbjct: 234 CGKKSHIKKYSFKWKREKKQDNK---DGDTGNQVATVRADLLVACDDNVINVACHETTWI 290

Query: 293 IDSGASFHTIPSKELLSNYICGKFGKVYLADGKPLDIVGIGDIDIRSSNGTLWTLHNVRH 352
           +DSGA++H  P KE  ++Y  G FG++ + +   + + G G + + +SNGT   L NV+H
Sbjct: 291 VDSGAAYHVTPRKEFFTSYTPGDFGELRMGNDGQVKVTGTGTVCLETSNGTKLVLKNVKH 350

Query: 353 VPGIKRNLISIGQLDDEGYHTTFGGGAWKVTKGNLVVARGKKRGSLYMV---AEEDMIAV 409
            P I+ NLIS G+LDD+G+   FG G WK+TKG+LVVARG K  +LY +     +D + V
Sbjct: 351 APDIRLNLISTGKLDDDGFCCFFGDGHWKITKGSLVVARGNKSSNLYSLQSSVSDDSVNV 410

Query: 410 TEAINSSSIWHQRLGHMSEKGMKIMASKGKMSNLKHVDLGVCEHCILGKQRKVSFSKAGR 469
            E   +S +WH+RLGHMS KG+  +A K K+S +K   L  C HC+ GKQR+VSF     
Sbjct: 411 VEKECASELWHKRLGHMSVKGIDYLAKKSKLSGVKEAKLDKCVHCLAGKQRRVSFMSHPP 470

Query: 470 KSKSEKLELVHTDVWGPAPVKSLGGSRYYVTFIDDSTRKVWVYFLKSKSDVFSVFKKWKT 529
             KSE L+L+H+DV GP  V+SLGG+ Y+VTFIDD +RK+WVY LK KSDV  VFK++  
Sbjct: 471 TRKSEPLDLIHSDVCGPMKVRSLGGASYFVTFIDDYSRKLWVYTLKHKSDVLGVFKEFHA 530

Query: 530 EVENQTGLKIKSLKSDNGGEYDSQEFKKFCSENGIRMIKTIPGTPEQNGVAERMNRTLNE 589
            VE QTG K+K +++DNGGEY    F ++C   GIR  KT P  P+ NG+AERMNRT+ E
Sbjct: 531 LVERQTGKKLKCIRTDNGGEY-CGPFDEYCRRYGIRHQKTPPKIPQLNGLAERMNRTIME 589

Query: 590 RARCMRIQSGLPKMFWPDAINTAAYLINRGPSVPLDYQLPEEVWYGKEVSLSHLKVFGCV 649
           R RCM   + LP  FW +A++TA ++IN  P + L  ++P++VW GK+VS  HL+VFGC 
Sbjct: 590 RVRCMLDDAKLPSSFWAEAVSTAVHVINLSPVIALKNEVPDKVWCGKDVSYDHLRVFGCK 649

Query: 650 SYVLIDSDKRDKLDPKAIKCFFIGYGSDMYGYRFWDEQNRKIIRSINVTFNESVLYKD 707
           ++V +  D+R KLD K  +C FIGYG D +GYR +D   +K++RS +V F E+   +D
Sbjct: 650 AFVHVPRDERSKLDSKTRQCIFIGYGFDEFGYRLYDPVEKKLVRSRDVVFFENQTIED 707


>gb|AAK29467.1| polyprotein-like [Lycopersicon chilense]
          Length = 1328

 Score =  559 bits (1441), Expect = e-157
 Identities = 293/715 (40%), Positives = 437/715 (60%), Gaps = 28/715 (3%)

Query: 5   KVKIERFDGRD--FGFWKMLMEDYLYQKMLYQPLTGK--KPNDMKQEDWDLLDRQALGVI 60
           K ++ +F+G    F  W+  M+D L Q+ L++ L GK  KP  MK EDW+ LD +A   I
Sbjct: 5   KYEVAKFNGDKPVFSMWQRRMKDLLIQQGLHKALGGKSKKPESMKLEDWEELDEKAASAI 64

Query: 61  RLTLSKNVAFNIVNEKTTADLMKALSNMYEKPFAANKVHLIRRLFNLRMGEGNSVTEHIN 120
           RL L+ +V  NIV+E++   +   L N+Y      NK++L ++L+ L M EG +   H+N
Sbjct: 65  RLHLTDDVVNNIVDEESACGIWTKLENLYMSKTLTNKLYLKKQLYTLHMDEGTNFLSHLN 124

Query: 121 SFNTIISQLSSVKITFDNELMVLSLLQSLPDSWAATVTAVSNSARDNKLKFDDIRDLILS 180
             N +I+QL+++ +  + E   + LL SLP S+    T + +     +LK D    L+L+
Sbjct: 125 VLNGLITQLANLGVKIEEEDKRIVLLNSLPSSYDTLSTTILHGKDSIQLK-DVTSALLLN 183

Query: 181 EDIRRKDSGESSNTFGSALNTESRGRGSQKSH-NQSQGRGRSKSRGRSQTRVRNDITCWN 239
           E +R+K         G    TESRGR  Q+S  N  +   R KS+ RS+++ RN   C+N
Sbjct: 184 EKMRKKPENH-----GQVFITESRGRSYQRSSSNYGRSGARGKSKVRSKSKARN---CYN 235

Query: 240 CDRKGHFTNQCKAPRKKKNYQKR*DDDESANAATEEVADTLIC--------SLDSPVDSW 291
           CD+ GHF   C  P++ K       +D++  A  +   D ++          L      W
Sbjct: 236 CDQPGHFKRDCPNPKRGKGESSGQKNDDNTAAMVQNNDDVVLLINEEEECMHLAGTESEW 295

Query: 292 VIDSGASFHTIPSKELLSNYICGKFGKVYLADGKPLDIVGIGDIDIRSSNGTLWTLHNVR 351
           V+D+ AS+H  P ++L   Y+ G +G V + +     I GIGDI  +++ G    L +VR
Sbjct: 296 VVDTAASYHATPVRDLFCRYVAGDYGNVKMGNTSYSKIAGIGDICFKTNVGCTLVLKDVR 355

Query: 352 HVPGIKRNLISIGQLDDEGYHTTFGGGAWKVTKGNLVVARGKKRGSLYM----VAEEDMI 407
           HVP ++ NLIS   LD +GY   F    W++TKG LV+A+G  RG+LY     + + ++ 
Sbjct: 356 HVPDLRMNLISGIALDQDGYENYFANQKWRLTKGALVIAKGVARGTLYRTNAEICQGELN 415

Query: 408 AVTEAINSSSIWHQRLGHMSEKGMKIMASKGKMSNLKHVDLGVCEHCILGKQRKVSFSKA 467
           A  E  NS+ +WH+R+GH SEKG++I++ K  +S  K   +  C + + GKQ +VSF  +
Sbjct: 416 AAHEE-NSADLWHKRMGHTSEKGLQILSKKSLISFTKGTTIKPCNYWLFGKQHRVSFQTS 474

Query: 468 GRKSKSEKLELVHTDVWGPAPVKSLGGSRYYVTFIDDSTRKVWVYFLKSKSDVFSVFKKW 527
             + KS  L+LV++DV GP  ++S+GG++Y+VTFIDD++RK+WVY  ++K  VF VF+K+
Sbjct: 475 SER-KSNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYIFRAKDQVFQVFQKF 533

Query: 528 KTEVENQTGLKIKSLKSDNGGEYDSQEFKKFCSENGIRMIKTIPGTPEQNGVAERMNRTL 587
              VE +TG K K L++DNGGEY S+EF+++CS +GIR  KT+PGTP+ NGVAERMNRT+
Sbjct: 534 HALVERETGRKRKRLRTDNGGEYTSREFEEYCSNHGIRHEKTVPGTPQHNGVAERMNRTI 593

Query: 588 NERARCMRIQSGLPKMFWPDAINTAAYLINRGPSVPLDYQLPEEVWYGKEVSLSHLKVFG 647
            E+ R M   + LPK FW +A+ TA YLINR PSVPL++ +PE VW  KE+S SHLKVFG
Sbjct: 594 VEKVRSMLRMAKLPKTFWGEAVRTACYLINRSPSVPLEFDIPERVWTNKEMSYSHLKVFG 653

Query: 648 CVSYVLIDSDKRDKLDPKAIKCFFIGYGSDMYGYRFWDEQNRKIIRSINVTFNES 702
           C ++  +  ++R KLD K++ C FIGYG + +GYR WD   +K+IRS +V F ES
Sbjct: 654 CKAFAHVPKEQRTKLDDKSVPCIFIGYGDEEFGYRLWDLVKKKVIRSRDVIFRES 708


>pir||T02206 hypothetical protein - common tobacco retrotransposon Tto1
           gi|1167523|dbj|BAA11674.1| ORF(AA 1-1338) [Nicotiana
           tabacum]
          Length = 1338

 Score =  550 bits (1418), Expect = e-155
 Identities = 292/724 (40%), Positives = 428/724 (58%), Gaps = 22/724 (3%)

Query: 1   LEEGKVKIERFDGRDFGFWKMLMEDYLYQKMLYQPL-TGKKPNDMKQEDWDLLDRQALGV 59
           +E    K+   +G ++  W+  M+D L+   ++ P+ + +KP D   EDW+    Q  G 
Sbjct: 1   MEARTSKMVNLNGTNYHLWRNKMKDLLFVTKMHLPVFSSQKPEDKSDEDWEFEHNQVCGY 60

Query: 60  IRLTLSKNVAFNIVNEKTTADLMKALSNMYEKPFAANKVHLIRRLFNLRMGEGNSVTEHI 119
           IR  +  NV  +I        L   L  +Y      NK+  + +L  ++  EG +V +H+
Sbjct: 61  IRQFVEDNVYNHISGVTHARSLWDKLEELYASKTGNNKLFYLTKLMQVKYVEGTTVADHL 120

Query: 120 NSFNTIISQLSSVKITFDNELMVLSLLQSLPDSWAATVTAVSNSARDNKLKFDDIRDLIL 179
           N    I+ QLS + I FD+E++ L +L +LP+SW     +++NSA +  +  + ++  IL
Sbjct: 121 NEIQGIVDQLSGMGIKFDDEVLALMVLATLPESWETLKVSITNSAPNGVVNMETVKSGIL 180

Query: 180 SEDIRRKDSGESSNTFGSALNTESRGRGSQKSHNQSQGRGRSKSRGRSQTRVRNDITCWN 239
           +E++RR+  G SS+     L   +RGR   KS +      R KSRG+S      ++ C  
Sbjct: 181 NEEMRRRSQGTSSSQ-SEVLAVTTRGRSQNKSQSN-----RDKSRGKSNKFA--NVECHY 232

Query: 240 CDRKGHFTNQCKAPR--KKKNYQKR*DDDESANAATEE------VADTLICSLDSPVDSW 291
           C +KGH    C+  +  +KKN  K+   +ES++  T        V D  I +L +   +W
Sbjct: 233 CKKKGHIKRFCRQFQNDQKKNKGKKVKPEESSDDETNSFGEFNVVYDDDIINLTTQEMTW 292

Query: 292 VIDSGASFHTIPSKELLSNYICGKFGKVYLADGKPLDIVGIGDIDIRSSNGTLWTLHNVR 351
           VIDSGA+ H  P +EL S+Y  G FG+V + +     +VG GD+ + + NG    L +VR
Sbjct: 293 VIDSGATIHATPRRELFSSYTLGDFGRVKMGNANFSTVVGKGDVCLETMNGMKLLLRDVR 352

Query: 352 HVPGIKRNLISIGQLDDEGYHTTFGGGAWKVTKGNLVVARGKKRGSLYMVA---EEDMIA 408
           HVP ++ NLIS+ +LD+EGY  TF  G WK+TKG+L+VARG K+  LY+      + +I 
Sbjct: 353 HVPDMRLNLISVDKLDEEGYCNTFHNGQWKLTKGSLMVARGTKQSKLYVTQASISQQVIN 412

Query: 409 VTEAINSSSIWHQRLGHMSEKGMKIMASKGKMSNLKHVDLGVCEHCILGKQRKVSFSKAG 468
           V E  ++  +WH+RLGHMSEK M  +  K  +  L  + L  C  C+ GKQ +VSF +  
Sbjct: 413 VAENDSNIKLWHRRLGHMSEKSMARLVKKNALPGLNQIQLKKCADCLAGKQNRVSFKRFP 472

Query: 469 RKSKSEKLELVHTDVWGPAPVKSLGGSRYYVTFIDDSTRKVWVYFLKSKSDVFSVFKKWK 528
              +   L+LVH+DV GP   KSLGG+RY+VTFIDD +RK WVY LK+K  VF VFK++ 
Sbjct: 473 PSRRQNVLDLVHSDVCGPFK-KSLGGARYFVTFIDDHSRKTWVYTLKTKDQVFQVFKQFL 531

Query: 529 TEVENQTGLKIKSLKSDNGGEYDSQEFKKFCSENGIRMIKTIPGTPEQNGVAERMNRTLN 588
           T VE +TG K+K +++DNGGEY  Q F  +C E+GIR   T P TP+ NG+AERMNRTL 
Sbjct: 532 TLVERETGKKLKCIRTDNGGEYQGQ-FDAYCKEHGIRHQFTPPKTPQLNGLAERMNRTLI 590

Query: 589 ERARCMRIQSGLPKMFWPDAINTAAYLINRGPSVPLDYQLPEEVWYGKEVSLSHLKVFGC 648
           ER RC+   S LPK FW +A+ TAAY++N  P VPL Y+ PE++W G+++S   L+VFGC
Sbjct: 591 ERTRCLLSHSKLPKAFWGEALVTAAYVLNHSPCVPLQYKAPEKIWLGRDISYDQLRVFGC 650

Query: 649 VSYVLIDSDKRDKLDPKAIKCFFIGYGSDMYGYRFWDEQNRKIIRSINVTFNESVLYKDR 708
            +YV +  D+R KLD K  +C FIGYG DM GY+F+D   +K++RS +V F E    +D 
Sbjct: 651 KAYVHVPKDERSKLDVKTRECVFIGYGQDMLGYKFYDPVEKKLVRSRDVVFVEDQTIEDI 710

Query: 709 SSAE 712
              E
Sbjct: 711 DKVE 714


>dbj|BAD34493.1| Gag-Pol [Ipomoea batatas]
          Length = 1298

 Score =  509 bits (1312), Expect = e-143
 Identities = 279/716 (38%), Positives = 418/716 (57%), Gaps = 32/716 (4%)

Query: 5   KVKIERFDGRDFGFWKMLMEDYLYQKMLYQPLTGKKPNDMKQEDWDLLDRQALGVIRLTL 64
           K +IE+F+G++F  WK+ ++  L +      ++ +  +    + W  ++  A+  + L++
Sbjct: 4   KFEIEKFNGKNFSLWKLKVKAILRKDNCLAAISERPVDFTDDKKWSEMNEDAMADLYLSI 63

Query: 65  SKNVAFNIVNEKTTADLMKALSNMYEKPFAANKVHLIRRLFNLRMGEGNSVTEHINSFNT 124
           +  V  +I  +KT  ++   L+ +YE     NK+ L R+L+ LRM E  SVTEH+N+ NT
Sbjct: 64  ADGVLSSIEEKKTANEIWDHLNRLYEAKSLHNKIFLKRKLYTLRMSESTSVTEHLNTLNT 123

Query: 125 IISQLSSVKITFDNELMVLSLLQSLPDSWAATVTAVSNSARDNKLKFDDIRDLILSEDIR 184
           + SQL+S+    + +     LLQSLPDS+   +  ++N+   + L FDD+   +L E+ R
Sbjct: 124 LFSQLTSLSCKIEPQERAELLLQSLPDSYDQLIINLTNNILTDYLVFDDVAAAVLEEESR 183

Query: 185 RKDSGESS-NTFGSALNTESRGRGSQKSHNQSQGRGRSKSRGRSQTRVRNDITCWNCDRK 243
           RK+  +   N   +   T  RGR +++   QS GRGRSKS        + ++TC+NC +K
Sbjct: 184 RKNKEDRQVNLQQAEALTVMRGRSTERG--QSSGRGRSKSS-------KKNLTCYNCGKK 234

Query: 244 GHFTNQCKAPRKKKNYQKR*DDDESANAATEEVADTLICSLDSP-------VDSWVIDSG 296
           GH    C    +  N Q          A+T +    L C             D W+IDSG
Sbjct: 235 GHLKKDCWNLAQNSNPQGN-------VASTSDDGSALCCEASIAREGRKRFADIWLIDSG 287

Query: 297 ASFHTIPSKELLSNYICGKFGKVYLADGKPLDIVGIGDIDIRSSNGTLWTLHNVRHVPGI 356
           A++H    KE   +Y     G VY  D   L+I+GIG I ++  +GT+ T+ +VRHV G+
Sbjct: 288 ATYHMTSRKEWFHHYEPISGGSVYSCDDHALEIIGIGTIKLKMYDGTVQTVQDVRHVKGL 347

Query: 357 KRNLISIGQLDDEGYHTTFGGGAWKVTKGNLVVARGKK-RGSLYMVAEEDMIAVTEAI-- 413
           K+NL+S G LD+         G  K+ +G LVV +G+K   +LYM+  E +     ++  
Sbjct: 348 KKNLLSYGILDNSATQIETQKGVMKIFQGALVVMKGEKIAANLYMLKGETLQEAEASVAA 407

Query: 414 ---NSSSIWHQRLGHMSEKGMKIMASKGKMSNLKHVDLGVCEHCILGKQRKVSFSKAGRK 470
              +S+ +WHQ+LGHMS++GMKI+  +  +  L  V L +CEHCI  KQ ++ FS +  +
Sbjct: 408 CSPDSTLLWHQKLGHMSDQGMKILVEQKLIPGLTKVSLPLCEHCITSKQHRLKFSTSNSR 467

Query: 471 SKSEKLELVHTDVWGPAPVKSLGGSRYYVTFIDDSTRKVWVYFLKSKSDVFSVFKKWKTE 530
            K   LELVH+DVW  APV SLGG++Y+V+FIDD +R+ WVY +K KSDVF+ FK +K  
Sbjct: 468 GKVV-LELVHSDVW-QAPVPSLGGAKYFVSFIDDYSRRCWVYPIKKKSDVFATFKAFKAR 525

Query: 531 VENQTGLKIKSLKSDNGGEYDSQEFKKFCSENGIRMIKTIPGTPEQNGVAERMNRTLNER 590
           VE  +G KIK  ++DNGGEY S+EF  FC + GI+   T+  TP+QNGVAERMNRTL ER
Sbjct: 526 VELDSGKKIKCFRTDNGGEYTSEEFDDFCKKEGIKRQFTVAYTPQQNGVAERMNRTLLER 585

Query: 591 ARCMRIQSGLPKMFWPDAINTAAYLINRGPSVPLDYQLPEEVWYGKEVSLSHLKVFGCVS 650
            R M   +GL K FW +A+NTA YL+NR PS  ++ + P E+W GK V  S+L +FG + 
Sbjct: 586 TRAMLRAAGLEKSFWAEAVNTACYLVNRAPSTAIELKTPMEMWTGKPVDYSNLHIFGSIV 645

Query: 651 YVLIDSDKRDKLDPKAIKCFFIGYGSDMYGYRFWDEQNRKIIRSINVTFNESVLYK 706
           Y + ++ +  KLDPK+ KC F+GY   + GYR WD    K++ S +V F E  L +
Sbjct: 646 YAMYNAQEITKLDPKSRKCRFLGYADGVKGYRLWDPTAHKVVISRDVIFVEDRLQR 701


>ref|XP_470868.1| Putative retroelement pol polyprotein [Oryza sativa]
           gi|14029020|gb|AAK52561.1| Putative retroelement pol
           polyprotein [Oryza sativa]
          Length = 1326

 Score =  471 bits (1213), Expect = e-131
 Identities = 272/709 (38%), Positives = 408/709 (57%), Gaps = 29/709 (4%)

Query: 16  FGFWKMLMEDYLYQKM-LYQPLT--GKKPNDMKQEDWDLLDRQALGVIRLTLSKNVAFNI 72
           F  W++ M   L Q   L + L   GKK +     +    DR+AL +I+L LS ++   +
Sbjct: 17  FSLWQVKMRAILAQTSDLDEALESFGKKKSTEWTAEEKRKDRKALLLIQLHLSNDILQEV 76

Query: 73  VNEKTTADLMKALSNMYEKPFAANKVHLIRRLFNLRMGEGNSVTEHINSFNTIISQLSSV 132
           + EKT A+L   L ++       +K+H+  +LF+ ++ E  SV  HI+ F  I+  L S+
Sbjct: 77  LQEKTAAELWLKLESICMSKDLTSKMHIKMKLFSHKLQESGSVLNHISVFKEIVVDLVSI 136

Query: 133 KITFDNELMVLSLLQSLPDSWAATVTAVSNSARDNKLKFDDIRDLILSEDIRRKDSGESS 192
           ++ FD+E + L LL SLP S+A     +  S RD     +    L   E ++     ++S
Sbjct: 137 EVQFDDEDLGLLLLCSLPSSYANFRDTILLS-RDELTLAEVYEALQNREKMKGMVQSDAS 195

Query: 193 NTFGSALNTESRGRGSQKSHNQSQGRGRSKSRGRSQTRVRNDITCWNCDRKGHFTNQCKA 252
           ++ G AL    RGR  Q+++N S  R +S+SRGRS++R +    C  C +K HF  +C  
Sbjct: 196 SSKGEALQV--RGRSEQRTYNDSSDRDKSQSRGRSKSRGKK--FCKYCKKKNHFIEECW- 250

Query: 253 PRKKKNYQKR*DDDESANAATEEVADTLICSLD-----SPVDSWVIDSGASFHTIPSKEL 307
             K +N +KR  D +++   + E +D+  C +      +  D W++D+  SFH   +++ 
Sbjct: 251 --KLQNKEKRKSDGKASVVTSAENSDSGDCLVVFAGCVASHDEWILDTACSFHICINRDW 308

Query: 308 LSNYICGKFGKVY-LADGKPLDIVGIGDIDIRSSNGTLWTLHNVRHVPGIKRNLISIGQL 366
            S+Y   + G V  + D  P +IVGIG + I++ +G   TL +VRH+PG+ RNLIS+  L
Sbjct: 309 FSSYKSVQNGDVVRMGDDNPREIVGIGSVQIKTHDGMTRTLKDVRHIPGMARNLISLSTL 368

Query: 367 DDEGYHTTFGGGAWKVTKGNLVVARGKKRGSLYMVAEEDMI--AVTEAINS------SSI 418
           D EGY  +  GG  KV+KG+LV   G    +   V     +  +VT A  S      +++
Sbjct: 369 DAEGYKYSSSGGVVKVSKGSLVYMIGDMNSANLYVLRGSTLHGSVTAAAVSKDEPIKTNL 428

Query: 419 WHQRLGHMSEKGMKIMASKGKMSNLKHVDLGVCEHCILGKQRKVSFSKAGRKSKSEKLEL 478
           WH RLGHMSE GM  +  +  +       +  CEHC+ GK ++V F+ +  ++K   L+ 
Sbjct: 429 WHMRLGHMSELGMAELMKRNLLDGCTQGKMKFCEHCVFGKHKRVKFNTSVHRTKGI-LDY 487

Query: 479 VHTDVWGPAPVKSLGGSRYYVTFIDDSTRKVWVYFLKSKSDVFSVFKKWKTEVENQTGLK 538
           VHTD+WGP+    LGG+RY +T IDD +RKVW YFLK K D F+ FK+WK  +E QT  +
Sbjct: 488 VHTDLWGPSRKAYLGGARYMLTIIDDYSRKVWPYFLKHKDDTFAAFKEWKVRIERQTEKE 547

Query: 539 IKSLKSDNGGEYDSQEFKKFCSENGIRMIKTIPGTPEQNGVAERMNRTLNERARCMRIQS 598
           +K L++DNGGE+ S  F  +C + GI    TIP TP+QNGVAERMNRT+  +ARCM   +
Sbjct: 548 VKVLRTDNGGEFCSDAFDDYCRKEGIVRHHTIPYTPQQNGVAERMNRTIISKARCMLSNA 607

Query: 599 GLPKMFWPDAINTAAYLINRGPSVPLDYQLPEEVWYGKEVSLSHLKVFGCVSYVLIDSDK 658
            + K FW +A NTA YLINR PS+PL+ + P EVW G     S L+VFGC +Y  +D+  
Sbjct: 608 RMNKRFWAEAANTACYLINRSPSIPLNKKTPIEVWSGMPADYSQLRVFGCTAYAHVDN-- 665

Query: 659 RDKLDPKAIKCFFIGYGSDMYGYRFWDEQNRKIIRSINVTFNESVLYKD 707
             KL+P+AIKC F+GYGS +  Y+ W+ +  K     +V FN+SV++ D
Sbjct: 666 -GKLEPRAIKCLFLGYGSGVKRYKLWNPETNKTFMRRSVVFNKSVMFND 713


>ref|XP_474090.1| OSJNBa0033G05.13 [Oryza sativa (japonica cultivar-group)]
           gi|38344889|emb|CAD41912.2| OSJNBa0033G05.13 [Oryza
           sativa (japonica cultivar-group)]
          Length = 1181

 Score =  464 bits (1193), Expect = e-129
 Identities = 271/732 (37%), Positives = 424/732 (57%), Gaps = 45/732 (6%)

Query: 16  FGFWKMLMEDYLYQKMLYQPLTGKKPNDMKQEDWD----LLDRQALGVIRLTLSKNVAFN 71
           F  W++ M   L Q+ L   L+G    D + +DW       DR+A+  I L LS N+   
Sbjct: 17  FSLWQVKMRAVLAQQELDDALSGF---DKRTQDWSNDEKKRDRKAMSYIHLHLSNNILQE 73

Query: 72  IVNEKTTADLMKALSNMYEKPFAANKVHLIRRLFNLRMGEGNSVTEHINSFNTIISQLSS 131
           ++ E+T A L   L  +       +K+HL ++LF  ++ +  SV +H+++F  I++ L S
Sbjct: 74  VLKEETAAGLWLKLEQICMTKDLTSKMHLKQKLFLHKLQDDGSVMDHLSAFKEIVADLES 133

Query: 132 VKITFDNELMVLSLLQSLPDSWAATVTAVSNSARDNKLKFDDIRDLI-LSEDIRRKDSGE 190
           +++ +D + + L LL SLP S+A     +  S RD  L   ++ D +   E +++    E
Sbjct: 134 MEVKYDEKDLALILLCSLPSSYANFRDTILYS-RDT-LTLKEVYDALHAKEKMKKMVPSE 191

Query: 191 SSNTFGSALNTESRGRGSQK---SHNQSQGRGRSKSRGRSQTRVRNDITCWNCDRKGHFT 247
            SN+    L      RGSQ+   ++N+S+ +  S  RGRS++R R   +C  C R GH  
Sbjct: 192 GSNSQAEGLVV----RGSQQEKNTNNKSRDKSSSSYRGRSKSRGRYK-SCKYCKRDGHDI 246

Query: 248 NQC-KAPRKKKNYQK-----R*DDDESANAATEEVADTLI------CSLDSPVDSWVIDS 295
           ++C K   K K   K     + +++  A   T+E +D  +      C+  S  D W++D+
Sbjct: 247 SKCWKLQDKDKRTGKYIPKGKKEEEGKAAVVTDEKSDAELLVAYAGCAQTS--DQWILDT 304

Query: 296 GASFHTIPSKELLSNYICGKFGKVYLADGKPLDIVGIGDIDIRSSNGTLWTLHNVRHVPG 355
             ++H  P+++  + Y   + G V + D  P ++ GIG + I+  +G + TL +VRH+P 
Sbjct: 305 ACTYHMCPNRDWFATYEVVQGGTVLMGDDTPCEVAGIGTVQIKMFDGCIRTLSDVRHIPN 364

Query: 356 IKRNLISIGQLDDEGYHTTFGGGAWKVTKGNLVVARGK-KRGSLYMVAEEDMIA----VT 410
           +KR+LIS+  LD +GY  + G G  KVTKG+LVV +   K  +LY +    ++     V+
Sbjct: 365 LKRSLISLCTLDRKGYKYSGGDGILKVTKGSLVVMKASIKSANLYHLQGTTILGNVATVS 424

Query: 411 EAINSS---SIWHQRLGHMSEKGMKIMASKGKMSNLKHVDLGVCEHCILGKQRKVSFSKA 467
           +++++S   ++WH RLGHMSE G+  ++ +G +       L  CEHCI GK ++V F+ +
Sbjct: 425 DSLSNSDATNLWHMRLGHMSEIGLAELSKRGLLDGQSISKLKFCEHCIFGKHKRVKFNTS 484

Query: 468 GRKSKSEKLELVHTDVWGPAPVKSLGGSRYYVTFIDDSTRKVWVYFLKSKSDVFSVFKKW 527
              ++   L+ VH+D+WGPA   S GG+RY +T +DD +RKVW YFLK K   F+VFK+W
Sbjct: 485 THTTEGI-LDYVHSDLWGPARKTSFGGARYMMTIVDDYSRKVWPYFLKHKYQAFNVFKEW 543

Query: 528 KTEVENQTGLKIKSLKSDNGGEYDSQEFKKFCSENGIRMIKTIPGTPEQNGVAERMNRTL 587
           KT VE QT  K+K L++DNG E+ S+ FK +C   GI    T+P TP+QNGVAERMNR +
Sbjct: 544 KTMVERQTERKVKILRTDNGMEFCSKIFKSYCKSEGIVRHYTVPHTPQQNGVAERMNRII 603

Query: 588 NERARCMRIQSGLPKMFWPDAINTAAYLINRGPSVPLDYQLPEEVWYGKEVSLSHLKVFG 647
             +ARCM   +GLPK FW +A++TA YLINR PS   + + P EVW G   + S LKVFG
Sbjct: 604 ISKARCMLSNAGLPKQFWAEAVSTACYLINRSPSY-ANKKTPIEVWSGSPANYSDLKVFG 662

Query: 648 CVSYVLIDSDKRDKLDPKAIKCFFIGYGSDMYGYRFWDEQNRKIIRSINVTFNESVLYKD 707
           C +Y  +D+    KL+P+AIKC F+GY S + GY+ W  + +K++ S NV F+ES++  D
Sbjct: 663 CTAYAHVDN---GKLEPRAIKCIFLGYPSSVKGYKLWCPETKKVVISRNVVFHESIMLHD 719

Query: 708 RSSAESMSSSKQ 719
           + S      S++
Sbjct: 720 KPSTNVPVESQE 731


>gb|AAX92941.1| retrotransposon protein, putative, Ty1-copia sub-class [Oryza
           sativa (japonica cultivar-group)]
          Length = 2340

 Score =  464 bits (1193), Expect = e-129
 Identities = 271/729 (37%), Positives = 418/729 (57%), Gaps = 38/729 (5%)

Query: 16  FGFWKMLMEDYLYQKMLYQPLTGKKPNDMKQEDWD----LLDRQALGVIRLTLSKNVAFN 71
           F  W++ M   L Q+ L   L+G    D +  DW       DR+A+  I L LS N+   
Sbjct: 224 FSLWQVKMRAVLAQQDLDDALSGF---DKRTHDWSNDEKKRDRKAMSYIHLHLSNNILQE 280

Query: 72  IVNEKTTADLMKALSNMYEKPFAANKVHLIRRLFNLRMGEGNSVTEHINSFNTIISQLSS 131
           ++ E+  A L   L  +       +K+HL + LF  ++ +  SV +H+++F  II+ L S
Sbjct: 281 VLKEEIAAGLWLKLEQICMTKDLTSKMHLKQTLFLHKLQDDGSVMDHLSAFKEIIADLES 340

Query: 132 VKITFDNELMVLSLLQSLPDSWAATVTAVSNSARDNKLKFDDIRDLI-LSEDIRRKDSGE 190
           +++ +D E + L LL SLP S+A     +  S RD  L   ++ D + + E +++    E
Sbjct: 341 MEVKYDEEDLGLILLCSLPSSYANFRDTILYS-RDT-LTLKEVYDALHVKEKMKKMVPSE 398

Query: 191 SSNTFGSALNTESRGRGSQKSHNQSQGRGRSKSRGRSQTRVRNDITCWNCDRKGHFTNQC 250
            SN+    L    R +  + + NQS+ +  S  RGRS++R R   +C  C R GH   +C
Sbjct: 399 GSNSQAEGLIVWGRQQ-EKNTKNQSRDKSSSSYRGRSKSRGRYK-SCKYCKRDGHDIFEC 456

Query: 251 -----KAPRKKKNYQKR*DDDES-ANAATEEVADTLI------CSLDSPVDSWVIDSGAS 298
                K  R  K   K   ++E  A   T+E +D  +      C+  S  D W++++   
Sbjct: 457 WKLHDKDKRTGKYVPKGKKEEEGKAAVVTDEKSDAELLVAYAGCAQTS--DQWILNTACI 514

Query: 299 FHTIPSKELLSNYICGKFGKVYLADGKPLDIVGIGDIDIRSSNGTLWTLHNVRHVPGIKR 358
           +H  P+++  + Y   + G V + D  P ++ GIG + I+  +G + TL +VRH+P +KR
Sbjct: 515 YHMCPNRDWFATYEAVQVGTVLMGDDTPCEVAGIGTVQIKMFDGCIRTLSDVRHIPNLKR 574

Query: 359 NLISIGQLDDEGYHTTFGGGAWKVTKGNLVVARGK-KRGSLYMVAEEDMI----AVTEAI 413
           +LIS+  LD +GY  + G G  KVTKG+LVV +   K  +LY +    ++    AV++++
Sbjct: 575 SLISLCTLDRKGYKYSGGDGILKVTKGSLVVMKADIKSANLYHLRGTTILGNVAAVSDSL 634

Query: 414 NSS---SIWHQRLGHMSEKGMKIMASKGKMSNLKHVDLGVCEHCILGKQRKVSFSKAGRK 470
           ++S   ++WH RLGHM+E G+  ++ +G +       L  CEHCI GK ++V F+ +   
Sbjct: 635 SNSDATNLWHMRLGHMTEIGLAELSKRGLLDGQSIGKLKFCEHCIFGKHKRVKFNTSTHT 694

Query: 471 SKSEKLELVHTDVWGPAPVKSLGGSRYYVTFIDDSTRKVWVYFLKSKSDVFSVFKKWKTE 530
           ++   L+ VH+D+WGPA   S GG+RY +T +DD +RKVW YFLK K   F VFK+WKT 
Sbjct: 695 TEGI-LDYVHSDLWGPARKTSFGGTRYMMTIVDDYSRKVWPYFLKHKYQAFDVFKEWKTM 753

Query: 531 VENQTGLKIKSLKSDNGGEYDSQEFKKFCSENGIRMIKTIPGTPEQNGVAERMNRTLNER 590
           VE QT  K+K L++DNG E+ S+ FK +C   GI    T+P TP+QNGVAERMNRT+  +
Sbjct: 754 VERQTERKVKILRTDNGMEFCSKIFKSYCKSEGIVRHYTVPHTPQQNGVAERMNRTIISK 813

Query: 591 ARCMRIQSGLPKMFWPDAINTAAYLINRGPSVPLDYQLPEEVWYGKEVSLSHLKVFGCVS 650
           ARC+   +GLPK FW +A++TA YLINR PS  +D + P EVW G   + S L+VFGC +
Sbjct: 814 ARCLLSNAGLPKQFWAEAVSTACYLINRSPSYAIDKKTPIEVWSGSPANYSDLRVFGCTA 873

Query: 651 YVLIDSDKRDKLDPKAIKCFFIGYGSDMYGYRFWDEQNRKIIRSINVTFNESVLYKDRSS 710
           Y  +D+    KL+P+AIKC F+GY S + GY+ W  + +K++ S NV F+ESV+  D+ S
Sbjct: 874 YAHVDN---GKLEPRAIKCIFLGYPSGVKGYKLWCPETKKVVISRNVVFHESVMLHDKPS 930

Query: 711 AESMSSSKQ 719
                 S++
Sbjct: 931 TNVPVESQE 939


>gb|AAP54315.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
           gi|37535452|ref|NP_922028.1| putative polyprotein [Oryza
           sativa (japonica cultivar-group)]
           gi|22094359|gb|AAM91886.1| putative polyprotein [Oryza
           sativa (japonica cultivar-group)]
          Length = 1280

 Score =  462 bits (1188), Expect = e-128
 Identities = 270/730 (36%), Positives = 419/730 (56%), Gaps = 40/730 (5%)

Query: 16  FGFWKMLMEDYLYQKMLYQPLTGKKPNDMKQEDWD----LLDRQALGVIRLTLSKNVAFN 71
           F  W++ M   L Q+ L   L+G    D + +DW       DR+A+  I L LS N+   
Sbjct: 52  FSLWQVKMRAVLAQQDLDDALSGF---DKRTQDWSNDEKKKDRKAMSYIHLHLSNNILQE 108

Query: 72  IVNEKTTADLMKALSNMYEKPFAANKVHLIRRLFNLRMGEGNSVTEHINSFNTIISQLSS 131
           ++ E+T A L   L  +       +K+HL ++LF  ++ +  SV +H+++F  I++ L S
Sbjct: 109 VLKEETAAGLWLKLEQICMTKDLTSKMHLKQKLFLHKLQDDGSVMDHLSTFKEIVADLES 168

Query: 132 VKITFDNELMVLSLLQSLPDSWAATVTAVSNSARDNKLKFDDIRDLI-LSEDIRRKDSGE 190
           +++ +D E + L LL SLP S+A     +  S   + L   ++ D +   E +++    E
Sbjct: 169 IEVKYDEEDLGLILLCSLPSSYANFRDTILYS--HDTLILKEVYDALHAKEKMKKMVPSE 226

Query: 191 SSNTFGSALNTESRGRGSQKS-HNQSQGRGRSKSRGRSQTRVRNDITCWNCDRKGHFTNQ 249
            SN+    L    RGR  +K+  NQS+ +  S  RGRS++R R   +C  C R GH  ++
Sbjct: 227 GSNSQAEGLVV--RGRQQEKNTKNQSRDKSSSSYRGRSKSRGRYK-SCKYCKRDGHDISE 283

Query: 250 C-KAPRKKKNYQK-----R*DDDESANAATEEVADTLI------CSLDSPVDSWVIDSGA 297
           C K   K K   K     + +++  A   T+E +DT +      C+  S  D W++D+  
Sbjct: 284 CWKLQDKDKRTGKYIPKGKKEEEGKAAVVTDEKSDTELLVAYAGCAQTS--DQWILDTAW 341

Query: 298 SFHTIPSKELLSNYICGKFGKVYLADGKPLDIVGIGDIDIRSSNGTLWTLHNVRHVPGIK 357
           ++H  P+++  + Y   + G V + D  P ++ GIG + I+  +G + TL +VRH+P +K
Sbjct: 342 TYHMCPNRDWFATYEALQGGTVLMGDDTPCEVAGIGTVQIKMFDGYIRTLSDVRHIPNLK 401

Query: 358 RNLISIGQLDDEGYHTTFGGGAWKVTKGNLVVARGK-KRGSLYMVAEEDMI----AVTEA 412
           R+LIS+  LD +GY  + G G  KVTKG+LVV +   K  +LY +    ++    AV+++
Sbjct: 402 RSLISLCTLDRKGYKYSGGDGILKVTKGSLVVMKADIKSANLYHLRGTTILGNVAAVSDS 461

Query: 413 INSS---SIWHQRLGHMSEKGMKIMASKGKMSNLKHVDLGVCEHCILGKQRKVSFSKAGR 469
           +++S   ++WH RLGHMSE G+  ++ +  +       L  CEHCI GK ++V F+ +  
Sbjct: 462 LSNSDATNLWHMRLGHMSEIGLAELSKRELLDGQSIGKLKFCEHCIFGKHKRVKFNTSTH 521

Query: 470 KSKSEKLELVHTDVWGPAPVKSLGGSRYYVTFIDDSTRKVWVYFLKSKSDVFSVFKKWKT 529
            ++   L+ VH+D+WGPA   S GG+RY +T +DD +RKVW YFLK K   F VFK+WKT
Sbjct: 522 TTEGI-LDYVHSDLWGPACKTSFGGARYMMTIVDDYSRKVWPYFLKHKYQAFDVFKEWKT 580

Query: 530 EVENQTGLKIKSLKSDNGGEYDSQEFKKFCSENGIRMIKTIPGTPEQNGVAERMNRTLNE 589
            VE QT  K+K L++DNG E+ S+ FK +C   GI    T+P TP+QNGVAERMN  +  
Sbjct: 581 MVERQTEKKVKILRTDNGMEFCSKIFKSYCKSEGIVHHYTVPHTPQQNGVAERMNMAIIS 640

Query: 590 RARCMRIQSGLPKMFWPDAINTAAYLINRGPSVPLDYQLPEEVWYGKEVSLSHLKVFGCV 649
           +ARCM   + LPK FW +A++T  YLINR PS   D + P EVW G   + S L+VFGC 
Sbjct: 641 KARCMLSNADLPKQFWAEAVSTTCYLINRSPSYATDKKTPIEVWSGSPANYSDLRVFGCT 700

Query: 650 SYVLIDSDKRDKLDPKAIKCFFIGYGSDMYGYRFWDEQNRKIIRSINVTFNESVLYKDRS 709
           +Y  +D+    KL+P+AIKC F+GY S + GY+ W  + +K++ S NV F+ESV+  D+ 
Sbjct: 701 AYAHVDN---GKLEPRAIKCIFLGYPSGVKGYKLWCPETKKVVISRNVVFHESVILHDKP 757

Query: 710 SAESMSSSKQ 719
           S      S++
Sbjct: 758 STNVPVESQE 767


>ref|XP_469192.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
           gi|53370655|gb|AAU89150.1| integrase core domain
           containing protein [Oryza sativa (japonica
           cultivar-group)] gi|40538906|gb|AAR87163.1| putative
           polyprotein [Oryza sativa (japonica cultivar-group)]
          Length = 1322

 Score =  461 bits (1185), Expect = e-128
 Identities = 269/709 (37%), Positives = 407/709 (56%), Gaps = 29/709 (4%)

Query: 16  FGFWKMLMEDYLYQKM-LYQPLT--GKKPNDMKQEDWDLLDRQALGVIRLTLSKNVAFNI 72
           F  W++ M   L Q   L + L   GKK       +    DR+AL +I+L LS ++   +
Sbjct: 17  FSLWQVKMRAVLAQTSDLDEALESFGKKKTTEWTAEEKRKDRKALSLIQLHLSNDILQEV 76

Query: 73  VNEKTTADLMKALSNMYEKPFAANKVHLIRRLFNLRMGEGNSVTEHINSFNTIISQLSSV 132
           + +KT A+L   L ++       +K+H+  +LF+ ++ E  SV  HI+ F  I++ L S+
Sbjct: 77  LQKKTAAELWLKLESICMSKDLTSKMHIKMKLFSHKLHESGSVLNHISVFKEIVADLVSM 136

Query: 133 KITFDNELMVLSLLQSLPDSWAATVTAVSNSARDNKLKFDDIRDLILSEDIRRKDSGESS 192
           ++ FD+E + L LL SLP S+A     +  S RD     +    L   E ++      +S
Sbjct: 137 EVQFDDEDLGLLLLCSLPSSYANFRHTILLS-RDELTLAEVYEALQNREKMKGMVQSYAS 195

Query: 193 NTFGSALNTESRGRGSQKSHNQSQGRGRSKSRGRSQTRVRNDITCWNCDRKGHFTNQCKA 252
           ++ G AL    RGR  Q+++N S    +S+SRGRS++R +    C  C +K HF  +C  
Sbjct: 196 SSKGEALQV--RGRSEQRTYNDSNDHDKSQSRGRSKSRGKK--FCKYCKKKNHFIEECW- 250

Query: 253 PRKKKNYQKR*DDDESANAATEEVADTLICSLD-----SPVDSWVIDSGASFHTIPSKEL 307
             K +N +KR  D +++   + E +D+  C +      +  D W++D+  SFH   +++ 
Sbjct: 251 --KLQNKEKRKSDGKASVVTSAENSDSGDCLVVFAGYVASHDEWILDTACSFHICINRDW 308

Query: 308 LSNYICGKFGKVY-LADGKPLDIVGIGDIDIRSSNGTLWTLHNVRHVPGIKRNLISIGQL 366
            S+Y   +   V  + D  P +IVGIG + I++ +G   TL +VRH+PG+ RNLIS+  L
Sbjct: 309 FSSYKSVQNEDVVRMGDDNPREIVGIGSVQIKTHDGMTRTLKDVRHIPGMARNLISLSTL 368

Query: 367 DDEGYHTTFGGGAWKVTKGNLVVARGKKRGS-LYMVAEEDM------IAVT-EAINSSSI 418
           D EGY  +  GG  KV+KG+LV   G    + LY++    +       AVT +  + +++
Sbjct: 369 DAEGYKYSGSGGVVKVSKGSLVYMIGDMNSANLYVLRGSTLHGSVTAAAVTKDEPSKTNL 428

Query: 419 WHQRLGHMSEKGMKIMASKGKMSNLKHVDLGVCEHCILGKQRKVSFSKAGRKSKSEKLEL 478
           WH RLGHMSE GM  +  +  +      ++  CEHC+ GK ++V F+ +  ++K   L+ 
Sbjct: 429 WHMRLGHMSELGMAELMKRNLLDGCTQGNMKFCEHCVFGKHKRVKFNTSVHRTKGI-LDY 487

Query: 479 VHTDVWGPAPVKSLGGSRYYVTFIDDSTRKVWVYFLKSKSDVFSVFKKWKTEVENQTGLK 538
           VH D+WGP+   SLGG+RY +T IDD +RK W YFLK K D F+ FK+ K  +E QT  +
Sbjct: 488 VHADLWGPSRKPSLGGARYMLTIIDDYSRKEWPYFLKHKDDTFAAFKERKVMIERQTEKE 547

Query: 539 IKSLKSDNGGEYDSQEFKKFCSENGIRMIKTIPGTPEQNGVAERMNRTLNERARCMRIQS 598
           +K L +DNGGE+ S  F  +C + GI    TIP TP+QNGVAERMNRT+  +ARCM   +
Sbjct: 548 VKVLCTDNGGEFCSDAFDDYCRKEGIVRHHTIPYTPQQNGVAERMNRTIISKARCMLSNA 607

Query: 599 GLPKMFWPDAINTAAYLINRGPSVPLDYQLPEEVWYGKEVSLSHLKVFGCVSYVLIDSDK 658
            + K FW +A NTA YLINR PS+PL+ + P E+W G     S L+VFGC +Y  +D+  
Sbjct: 608 RMNKRFWAEAANTACYLINRSPSIPLNKKTPIEIWSGMPADYSQLRVFGCTAYAHVDN-- 665

Query: 659 RDKLDPKAIKCFFIGYGSDMYGYRFWDEQNRKIIRSINVTFNESVLYKD 707
             KL+P+AIKC F+GYGS + GY+ W+ +  K   S NV FNE V++ D
Sbjct: 666 -GKLEPRAIKCLFLGYGSGVKGYKLWNPETNKTFMSRNVIFNEFVMFND 713


>gb|AAX92861.1| retrotransposon protein, putative, Ty1-copia sub-class [Oryza
           sativa (japonica cultivar-group)]
          Length = 1373

 Score =  451 bits (1161), Expect = e-125
 Identities = 265/717 (36%), Positives = 396/717 (54%), Gaps = 33/717 (4%)

Query: 16  FGFWKMLMEDYLYQKMLYQPLT---GKKPNDMKQEDWDLLDRQALGVIRLTLSKNVAFNI 72
           F  W++ M   L Q   Y       GK+  +   E+    D++AL +I+L L  ++    
Sbjct: 14  FSLWQVKMRGILAQTHDYDEALDNFGKRRAEWTAEEIRK-DQKALALIQLHLHNDILQEC 72

Query: 73  VNEKTTADLMKALSNMYEKPFAANKVHLIRRLFNLRMGEGNSVTEHINSFNTIISQLSSV 132
           + EKT+A+L   L ++       +K+ +  +LF L+M E +SV  H+  F  I++ L S+
Sbjct: 73  LTEKTSAELWLKLESICMSKDLTSKMQMKMKLFTLKMKEEDSVITHMAEFKKIVADLVSM 132

Query: 133 KITFDNELMVLSLLQSLPDSWAATVTAVSNSARDNKLKFDDIRDLILSED---IRRKDSG 189
           ++ +D+E + L LL SLP+S+A     +  S  +  LK  ++ D + +++   I  ++ G
Sbjct: 133 EVKYDDEDLGLLLLCSLPNSYANFRDTILLSRDELTLK--EVYDALQNKEKMKIMVQNDG 190

Query: 190 ESSNTFGSALNTESRGRGSQKSHNQSQGRGRSKSRGRSQTRVRNDITCWNCDRKGHFTNQ 249
            SS+  G AL+   R      +      RGRSKS+     +      C  C  K H  ++
Sbjct: 191 SSSSK-GEALHVRGRTENRTSNEKNYDRRGRSKSKPPGNKKF-----CVYCKLKNHNIDE 244

Query: 250 CKAPRKKKNYQKR*DDDESANAAT--EEVADTLICSLDSPV--DSWVIDSGASFHTIPSK 305
           CK  + K+   K+      A+AA   ++  D L+         D W++DS  SFH    +
Sbjct: 245 CKKVQAKERKNKKDGKVSVASAAASDDDSGDCLVVFAGCVAGHDEWILDSACSFHICTKR 304

Query: 306 ELLSNYICGKFGKVY-LADGKPLDIVGIGDIDIRSSNGTLWTLHNVRHVPGIKRNLISIG 364
              S+Y   + G V  + D  P  IVGIG + I++ +G   TL NVR++PG+ RNLIS+ 
Sbjct: 305 NWFSSYKPVQKGDVVRMGDDNPCAIVGIGSVQIKTDDGMTRTLKNVRYIPGMSRNLISLS 364

Query: 365 QLDDEGYHTTFGGGAWKVTKGNLVVARGKK--------RGSLYMVAEEDMIAVT-EAINS 415
            LD EGY  +   G  KV+KG+LV  +G          RG     ++    A+T +  + 
Sbjct: 365 TLDAEGYKYSGSDGVLKVSKGSLVCLKGDVNSAKLYVLRGCTLTGSDSAAAAITNDEPSK 424

Query: 416 SSIWHQRLGHMSEKGMKIMASKGKMSNLKHVDLGVCEHCILGKQRKVSFSKAGRKSKSEK 475
           +++WH RLGHMS  GM  +  +  +       +  CEHCI GK ++V F+ +   +K   
Sbjct: 425 TNLWHMRLGHMSHLGMTELMKRNLLKGCTSSKIKFCEHCIFGKHKRVQFNTSVHTTKGT- 483

Query: 476 LELVHTDVWGPAPVKSLGGSRYYVTFIDDSTRKVWVYFLKSKSDVFSVFKKWKTEVENQT 535
           L+ VH D+WGP+   SLGG+RY +T IDD +RKVW YFLK K D F+ FK WK  +E QT
Sbjct: 484 LDYVHADLWGPSKKPSLGGARYMLTIIDDYSRKVWPYFLKHKDDTFTAFKNWKVMIERQT 543

Query: 536 GLKIKSLKSDNGGEYDSQEFKKFCSENGIRMIKTIPGTPEQNGVAERMNRTLNERARCMR 595
             K+K L++DNGGE+ S  F  +C + GI    TIP TP+QNGVAERMNRT+  RARCM 
Sbjct: 544 ERKVKLLRTDNGGEFCSHAFNDYCRQEGIVRHHTIPHTPQQNGVAERMNRTIISRARCML 603

Query: 596 IQSGLPKMFWPDAINTAAYLINRGPSVPLDYQLPEEVWYGKEVSLSHLKVFGCVSYVLID 655
             + + K FW +A +TA YLINR PS+PL+ + P EVW G     S LKVFGC +Y  +D
Sbjct: 604 SHARMNKRFWAEAASTACYLINRSPSIPLNKKTPIEVWSGTPADYSQLKVFGCTAYAHVD 663

Query: 656 SDKRDKLDPKAIKCFFIGYGSDMYGYRFWDEQNRKIIRSINVTFNESVLYKDRSSAE 712
           +    KL+P+A+KC F+GYGS + GY+ W+ +  K   S +V FNESV++ +   +E
Sbjct: 664 N---GKLEPRAVKCLFLGYGSGVKGYKLWNPETGKTFMSRSVVFNESVMFTNSLPSE 717


>gb|AAT85194.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
          Length = 1241

 Score =  439 bits (1129), Expect = e-121
 Identities = 249/646 (38%), Positives = 386/646 (59%), Gaps = 33/646 (5%)

Query: 96  NKVHLIRRLFNLRMGEGNSVTEHINSFNTIISQLSSVKITFDNELMVLSLLQSLPDSWAA 155
           +K+HL ++LF  ++ +  SV +H+++F  I++ L S+++ +D E + L LL SLP S+A 
Sbjct: 7   SKMHLKQKLFLHKLQDDGSVMDHLSAFKEIVADLESMEVKYDEEDLGLILLCSLPSSYAN 66

Query: 156 TVTAVSNSARDNKLKFDDIRDLI-LSEDIRRKDSGESSNTFGSALNTESRGRGSQKS-HN 213
               +  S RD  L   ++ D +   E +++    E SN+    L    RGR  +K+ +N
Sbjct: 67  FRDTILYS-RDT-LTLKEVYDALHAKEKMKKMVPSEGSNSQAEGLVV--RGRQQEKNTNN 122

Query: 214 QSQGRGRSKSRGRSQTRVRNDITCWNCDRKGHFTNQC-----KAPRKKKNYQKR*DDDES 268
           +S+ +  S  RGRS++R R   +C  C R GH  ++C     K  R +K   K   ++E 
Sbjct: 123 KSRDKSSSIYRGRSKSRGRYK-SCKYCKRDGHDISECWKLQDKDKRTRKYIPKGKKEEEG 181

Query: 269 -ANAATEEVADTLI------CSLDSPVDSWVIDSGASFHTIPSKELLSNYICGKFGKVYL 321
            A   T+E +D  +      C+  S  D W++D+  ++H  P+++  + Y   + G V +
Sbjct: 182 KAAVVTDEKSDAELLVAYAGCAQTS--DQWILDTACTYHMCPNRDWFATYEAVQGGTVLM 239

Query: 322 ADGKPLDIVGIGDIDIRSSNGTLWTLHNVRHVPGIKRNLISIGQLDDEGYHTTFGGGAWK 381
            D  P ++ GIG + I+  +G + TL +VRH+P +KR+LIS+  LD +GY  + G G  K
Sbjct: 240 GDDTPCEVAGIGTVQIKMFDGCIRTLLDVRHIPNLKRSLISLCTLDRKGYKYSGGDGILK 299

Query: 382 VTKGNLVVARGK-KRGSLYMVAEEDMI----AVTEAINSS---SIWHQRLGHMSEKGMKI 433
           VTKG+LVV +   K  +LY +    ++    AV++++++S   ++WH RLGHMSE G+  
Sbjct: 300 VTKGSLVVMKADIKYANLYHLRGTTILGNVAAVSDSLSNSDATNLWHMRLGHMSEIGLAE 359

Query: 434 MASKGKMSNLKHVDLGVCEHCILGKQRKVSFSKAGRKSKSEKLELVHTDVWGPAPVKSLG 493
           ++ +G +       L  CEHCI GK ++V F+ +   ++   L+ VH+D+WGPA   S G
Sbjct: 360 LSKRGLLDGQSIGKLKFCEHCIFGKHKRVKFNTSTHTTEGI-LDYVHSDLWGPARKTSFG 418

Query: 494 GSRYYVTFIDDSTRKVWVYFLKSKSDVFSVFKKWKTEVENQTGLKIKSLKSDNGGEYDSQ 553
           G+RY +T +DD +RKVW YFLK K   F VFK+WKT VE QT  K+K L++DNG E  S+
Sbjct: 419 GARYMMTIVDDYSRKVWPYFLKHKYQAFDVFKEWKTMVERQTERKVKILRTDNGMELCSK 478

Query: 554 EFKKFCSENGIRMIKTIPGTPEQNGVAERMNRTLNERARCMRIQSGLPKMFWPDAINTAA 613
            FK +C   GI    T+P TP+QNGVAERMNRT+  +ARCM   + LPK FW +A++TA 
Sbjct: 479 IFKSYCKSEGIVRHYTVPHTPQQNGVAERMNRTIISKARCMLSNASLPKQFWAEAVSTAC 538

Query: 614 YLINRGPSVPLDYQLPEEVWYGKEVSLSHLKVFGCVSYVLIDSDKRDKLDPKAIKCFFIG 673
           YLINR PS  +D + P EVW G   + S L+VFGC +Y  +D+    KL+P+ IKC F+G
Sbjct: 539 YLINRSPSYAIDKKTPIEVWSGSPANYSDLRVFGCTAYAHVDN---GKLEPRVIKCIFLG 595

Query: 674 YGSDMYGYRFWDEQNRKIIRSINVTFNESVLYKDRSSAESMSSSKQ 719
           Y S + GY+ W  + +K++ S NV F+ES++  D+ S      S++
Sbjct: 596 YLSGVKGYKLWCPETKKVVISRNVVFHESIMLHDKPSTNVPVESQE 641


>ref|XP_476137.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
           gi|48475101|gb|AAT44170.1| putative polyprotein [Oryza
           sativa (japonica cultivar-group)]
           gi|46576026|gb|AAT01387.1| putative polyprotein [Oryza
           sativa (japonica cultivar-group)]
          Length = 1175

 Score =  436 bits (1120), Expect = e-120
 Identities = 241/620 (38%), Positives = 373/620 (59%), Gaps = 28/620 (4%)

Query: 103 RLFNLRMGEGNSVTEHINSFNTIISQLSSVKITFDNELMVLSLLQSLPDSWAATVTAVSN 162
           +LF+ ++ E  S+  HI+ F  I++ L S+++ FD+E + L LL SLP S+A     +  
Sbjct: 2   KLFSHKLQESGSILNHISVFKEIVADLVSMEVQFDDEDLGLLLLCSLPSSYANFRDTILL 61

Query: 163 SARDNKLKFDDIRDLILS-EDIRRKDSGESSNTFGSALNTESRGRGSQKSHNQSQGRGRS 221
           S   ++L   ++ + + + E ++     ++S++ G AL    RGR  Q+++N S  R ++
Sbjct: 62  SR--SELTLAEVYEALQNREKMKGMVQSDASSSKGEALQV--RGRSEQRTYNDSNDRDKN 117

Query: 222 KSRGRSQTRVRNDITCWNCDRKGHFTNQCKAPRKKKNYQKR*DDDESANAATEEVADTLI 281
           +SRGRS++R +    C  C +K HF  +C    K +N +KR  D +++   + + +D+  
Sbjct: 118 QSRGRSKSRGKK--FCKYCKKKNHFIEECW---KLQNKEKRKSDGKASVVTSADNSDSGD 172

Query: 282 CSLDSPV-----DSWVIDSGASFHTIPSKELLSNYICGKFGKVY-LADGKPLDIVGIGDI 335
           C +   V     D W++D+  SFH   +++  S+Y   + G V  + D  P +IVGIG +
Sbjct: 173 CLVVFVVCVSSHDEWILDTTCSFHICINRDWFSSYKSVQNGDVVRMGDDNPREIVGIGSV 232

Query: 336 DIRSSNGTLWTLHNVRHVPGIKRNLISIGQLDDEGYHTTFGGGAWKVTKGNLVVARGKKR 395
            I++ +G   TL +VRH+P + RNLIS+  LD EGY  +  GG  KV+KG+LV   G   
Sbjct: 233 QIKTHDGMTRTLKDVRHIPRMARNLISLSTLDAEGYKYSGSGGVVKVSKGSLVYMIGDMN 292

Query: 396 GS-LYMVAEEDMIA-VTEAINS------SSIWHQRLGHMSEKGMKIMASKGKMSNLKHVD 447
            + LY++    +   VT A+ S      +++WH RLGHMSE GM  +  +  +      +
Sbjct: 293 SANLYVLRGSTLHGYVTAAVVSKDEPSKTNMWHMRLGHMSELGMAELMKRNLLDGCTQGN 352

Query: 448 LGVCEHCILGKQRKVSFSKAGRKSKSEKLELVHTDVWGPAPVKSLGGSRYYVTFIDDSTR 507
           +  CEHC+ GK ++V F+ +  ++K   L+ VH D+WGP+   SLGG+RY +T IDD +R
Sbjct: 353 MKFCEHCVFGKHKRVKFNTSVHRTKGI-LDYVHADLWGPSRKPSLGGARYMLTIIDDYSR 411

Query: 508 KVWVYFLKSKSDVFSVFKKWKTEVENQTGLKIKSLKSDNGGEYDSQEFKKFCSENGIRMI 567
           KVW YFLK K D F+ FK+WK  ++ QT  ++K L++DNGG + S  F  +C + GI M 
Sbjct: 412 KVWPYFLKHKDDTFAAFKEWKVMIKRQTEKEVKVLRTDNGGGFCSDAFDDYCRKEGIVMH 471

Query: 568 KTIPGTPEQNGVAERMNRTLNERARCMRIQSGLPKMFWPDAINTAAYLINRGPSVPLDYQ 627
            TIP TP+QNGVAERMNRT+  +ARCM   + + K FW +A  TA YLINR PS+ L+ +
Sbjct: 472 HTIPYTPQQNGVAERMNRTIISKARCMLSNARMNKRFWAEAAKTACYLINRSPSISLNKK 531

Query: 628 LPEEVWYGKEVSLSHLKVFGCVSYVLIDSDKRDKLDPKAIKCFFIGYGSDMYGYRFWDEQ 687
            P EVW G   + S L+VFGC +Y  +++    KL+P+AIKC F+GYGS + GY+ W+ +
Sbjct: 532 TPIEVWSGMPANYSQLRVFGCTAYAHVNN---GKLEPRAIKCLFLGYGSGVKGYKLWNPE 588

Query: 688 NRKIIRSINVTFNESVLYKD 707
             K   S +V FNESV++ D
Sbjct: 589 TNKTFMSRSVVFNESVMFND 608


>gb|AAD19773.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
           gi|25301697|pir||B84512 probable retroelement pol
           polyprotein [imported] - Arabidopsis thaliana
          Length = 1335

 Score =  430 bits (1106), Expect = e-119
 Identities = 253/698 (36%), Positives = 393/698 (56%), Gaps = 38/698 (5%)

Query: 34  QPLTGKK---PNDMKQEDWDLLDR-----QALGVIRLTLSKNVAFNIVNEKTTADLMKAL 85
           +PLT ++   P   K+ D D + R     +A  VI L ++  V   I   KT A+  + L
Sbjct: 19  KPLTEEEEEDPEKRKKRDADEVARLERCDKAKNVIFLNVADKVLRKIELCKTAAEAWETL 78

Query: 86  SNMYEKPFAANKVHLIRRLFNLRMGEGNSVTEHINSFNTIISQLSSVKITFDNELMVLSL 145
             ++      ++V+     +  +M E   + E+I+ F  I++ L+ ++I   +E+  + L
Sbjct: 79  DRLFMIRSLPHRVYTQLSFYTFKMQENKKIDENIDDFLKIVADLNHLQIDVTDEVQAILL 138

Query: 146 LQSLPDSWAATVTAVSNSARDNKLKFDDIRDLILSEDIRRKDSGESSNTFGSALNTESRG 205
           L SLP  +   V  +  S    KL+ DD+  ++ + D   K+   S N         +RG
Sbjct: 139 LSSLPARYDGLVETMKYSNSREKLRLDDV--MVAARD---KERELSQNNRPVVEGHFARG 193

Query: 206 RGSQKSHNQ-SQGRGRSKSRGRSQTRVRNDITCWNCDRKGHFTNQC-KAPRKKKNYQKR* 263
           R   K++NQ ++G+ RS+S+     RV     CW C ++GHF  QC K   + K+ Q+  
Sbjct: 194 RPDGKNNNQGNKGKNRSRSKSADGKRV-----CWICGKEGHFKKQCYKWIERNKSKQQGS 248

Query: 264 DDDESANAATEEV---------ADTLICSLDSPVDSWVIDSGASFHTIPSKELLSNYICG 314
           D+ ES+ A + E           D  +   DS  + WV+D+G SFH  P K+   ++   
Sbjct: 249 DNGESSLAKSTEAFNPAMVLLATDETLVVTDSIANEWVLDTGCSFHMTPRKDWFKDFKEL 308

Query: 315 KFGKVYLADGKPLDIVGIGDIDIRSSNGTLWTLHNVRHVPGIKRNLISIGQLDDEGYHTT 374
             G V + +     + GIG I IR+S+G+   L +VR++P + RNLIS+G L+D G    
Sbjct: 309 SSGYVKMGNDTYSPVKGIGSIKIRNSDGSQVILTDVRYMPNMTRNLISLGTLEDRGCWFK 368

Query: 375 FGGGAWKVTKGNLVVARGKKRGSLYMV----AEEDMIAVTEAINSSSIWHQRLGHMSEKG 430
              G  K+ KG   + +G+KR +LY++     E +  +  E  + +++WH RLGHMS+KG
Sbjct: 369 SQDGILKIVKGCSTILKGQKRDTLYILDGVTEEGESHSSAEVKDETALWHSRLGHMSQKG 428

Query: 431 MKIMASKGKMSNLKHVDLGVCEHCILGKQRKVSFSKAGRKSKSEKLELVHTDVWG-PAPV 489
           M+I+  KG +      +L  CE C+ GKQ +VSF+ A   +K EKL  VH+D+WG P   
Sbjct: 429 MEILVKKGCLRREVIKELEFCEDCVYGKQHRVSFAPAQHVTK-EKLAYVHSDLWGSPHNP 487

Query: 490 KSLGGSRYYVTFIDDSTRKVWVYFLKSKSDVFSVFKKWKTEVENQTGLKIKSLKSDNGGE 549
            SLG S+Y+++F+DD +RKVW+YFL+ K + F  F +WK  VENQ+  K+K L++DNG E
Sbjct: 488 ASLGNSQYFISFVDDYSRKVWIYFLRKKDEAFEKFVEWKKMVENQSDRKVKKLRTDNGLE 547

Query: 550 YDSQEFKKFCSENGIRMIKTIPGTPEQNGVAERMNRTLNERARCMRIQSGLPKMFWPDAI 609
           Y +  F+KFC E GI   KT   TP+QNG+AER+NRT+ ++ R M  +SG+ K FW +A 
Sbjct: 548 YCNHYFEKFCKEEGIVRHKTCAYTPQQNGIAERLNRTIMDKVRSMLSRSGMEKKFWAEAA 607

Query: 610 NTAAYLINRGPSVPLDYQLPEEVWYGKEVSLSHLKVFGCVSYVLIDSDKRDKLDPKAIKC 669
           +TA YLINR PS  +++ LPEE W G    LS L+ FGC++Y+  D   + KL+P++ K 
Sbjct: 608 STAVYLINRSPSTAINFDLPEEKWTGALPDLSSLRKFGCLAYIHAD---QGKLNPRSKKG 664

Query: 670 FFIGYGSDMYGYRFWDEQNRKIIRSINVTFNESVLYKD 707
            F  Y   + GY+ W  +++K + S NV F E V++KD
Sbjct: 665 IFTSYPEGVKGYKVWVLEDKKCVISRNVIFREQVMFKD 702


>ref|XP_475489.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
           gi|48475213|gb|AAT44282.1| putative polyprotein [Oryza
           sativa (japonica cultivar-group)]
          Length = 1243

 Score =  422 bits (1085), Expect = e-116
 Identities = 257/723 (35%), Positives = 396/723 (54%), Gaps = 54/723 (7%)

Query: 16  FGFWKMLMEDYLYQKMLYQPLTGKKPNDMKQEDWD----LLDRQALGVIRLTLSKNVAFN 71
           F  W++ M   L Q+ L   L+G    D + +DW       DR+A+  I L LS N+   
Sbjct: 17  FSLWQVKMRAVLAQQDLDDALSGF---DKRTQDWSNDEKKRDRKAISYIHLHLSNNILQE 73

Query: 72  IVNEKTTADLMKALSNMYEKPFAANKVHLIRRLFNLRMGEGNSVTEHINSFNTIISQLSS 131
           ++ E+T A L   L  +       +K+HL ++LF  ++ +  SV +H+++F  I++ L S
Sbjct: 74  VLKEETAAGLWLKLEQICMTKDLTSKMHLKQKLFLHKLQDDESVMDHLSAFKEIVADLES 133

Query: 132 VKITFDNELMVLSLLQSLPDSWAATVTAVSNSARDNKLKFDDIRDLI-LSEDIRRKDSGE 190
           +++ +D + + L LL SLP S+A     +  S RD  L   ++ D     E +++  + E
Sbjct: 134 MEVKYDEDDLGLILLCSLPSSYANFRGTILYS-RDT-LTLKEVYDAFHAKEKMKKMVTSE 191

Query: 191 SSNTFGSALNTESRGRGSQKS-HNQSQGRGRSKSRGRSQTRVRNDITCWNCDRKGHFTNQ 249
            SN+    L    RGR  +K+  NQS+ +  S  RGR+++R R   +C  C R GH  ++
Sbjct: 192 GSNSQAEGLVV--RGRQQKKNTKNQSRDKSSSSYRGRTKSRGRYK-SCKYCKRDGHDISE 248

Query: 250 C-----KAPRKKKNYQKR*DDDESANAATEEVADTLICSLDSPVDSWVIDSGASFHTIPS 304
           C     K  R  K   K   ++E   A             D   D+ ++ + A       
Sbjct: 249 CWKLQDKDKRTGKYIPKGKKEEEGKAAVVT----------DEKSDAELLVAYAGCAQTSD 298

Query: 305 KELLSNYICGKFGKVYLADGKPLDIVGIGDIDIRSSNGTLWTLHNVRHVPGIKRNLISIG 364
           ++  + Y   + G V + D  P ++ GIG + I+  +G + TL +V+H+P +KR+LIS+ 
Sbjct: 299 QDWFATYEALQGGTVLMGDDTPCEVAGIGTVQIKMFDGCIRTLSDVQHIPNLKRSLISLY 358

Query: 365 QLDDEGYHTTFGGGAWKVTKGNLVVARGK-KRGSLYMVAEEDMIAVTEAI-------NSS 416
                        G  KVTKG+LVV +   K  +LY +    ++    A+       +++
Sbjct: 359 -------------GILKVTKGSLVVMKVDIKSANLYHLRGTTILGNVAAVFDSLSNSDAT 405

Query: 417 SIWHQRLGHMSEKGMKIMASKGKMSNLKHVDLGVCEHCILGKQRKVSFSKAGRKSKSEKL 476
           ++WH RLGHMSE G+  ++ +G +       L  CEHCI GK ++V F+ +   ++   L
Sbjct: 406 NLWHMRLGHMSEIGLAELSKRGLLDGQSIRKLKFCEHCIFGKHKRVKFNTSTHTTEGI-L 464

Query: 477 ELVHTDVWGPAPVKSLGGSRYYVTFIDDSTRKVWVYFLKSKSDVFSVFKKWKTEVENQTG 536
           + VH+D+WGPA   S GG+RY +T +DD +RKVW YFLK K   F  FK+WKT VE QT 
Sbjct: 465 DYVHSDLWGPAHKTSFGGARYMMTIVDDYSRKVWPYFLKHKYQAFDGFKEWKTMVERQTE 524

Query: 537 LKIKSLKSDNGGEYDSQEFKKFCSENGIRMIKTIPGTPEQNGVAERMNRTLNERARCMRI 596
            K+K L++DNG E+ S+ FK +C   GI    T P TP+QN VAERMNRT+  +ARCM  
Sbjct: 525 RKVKILRTDNGMEFCSKIFKSYCKSEGIVCHYTAPHTPQQNDVAERMNRTIISKARCMLS 584

Query: 597 QSGLPKMFWPDAINTAAYLINRGPSVPLDYQLPEEVWYGKEVSLSHLKVFGCVSYVLIDS 656
            +GLPK FW +A++TA YLINR P   +D + P EVW G   + S L+VFGC +Y  +D+
Sbjct: 585 NAGLPKQFWAEAVSTACYLINRSPGYAIDKKTPIEVWSGSPTNYSDLRVFGCTAYAHVDN 644

Query: 657 DKRDKLDPKAIKCFFIGYGSDMYGYRFWDEQNRKIIRSINVTFNESVLYKDRSSAESMSS 716
               KL+P+AIKC F+GY S + GY+ W  + +K++ S NV F+ESV+  D+ S      
Sbjct: 645 ---GKLEPRAIKCIFLGYASGVKGYKLWCPETKKVVISRNVVFHESVILHDKPSTNVPVE 701

Query: 717 SKQ 719
           S++
Sbjct: 702 SQE 704


>gb|AAF19226.1| Highly similar to Ta1-3 polyprotein [Arabidopsis thaliana]
           gi|25301707|pir||E86490 hypothetical protein F28L22.3 -
           Arabidopsis thaliana
          Length = 1356

 Score =  413 bits (1062), Expect = e-114
 Identities = 252/747 (33%), Positives = 401/747 (52%), Gaps = 72/747 (9%)

Query: 5   KVKIERFDG-RDFGFWKMLMEDYLYQKMLYQPLTGKK--------PNDMKQEDWD----- 50
           +V+I+ F+G RDF  WK+ ++  L    L   LT            ++ KQE  D     
Sbjct: 7   RVEIKVFNGDRDFSLWKIRIQAQLGVLGLKDTLTDFSLTKTVPLTKSEAKQESGDGESSG 66

Query: 51  ----------LLDRQALGVIRLTLSKNVAFNIVNEKTTADLMKALSNMYEKPFAANKVHL 100
                         QA  +I   +S  V   + +  TTADL   L+  Y +    N+++ 
Sbjct: 67  TKEVPDPVKIEQSEQAKNIIINHISDVVLLKVNHYATTADLWATLNKKYMETSLPNRIYT 126

Query: 101 IRRLFNLRMGEGNSVTEHINSFNTIISQLSSVKITFDNELMVLSLLQSLPDSW------- 153
             +L++ +M    ++ ++++ F  I+++L S++I  D E+  + +L SLP S        
Sbjct: 127 QLKLYSFKMVSTMTIDQNVDEFLRIVAELGSLEIQVDEEVQAILILNSLPASHIQLKHTL 186

Query: 154 -----AATVTAVSNSARDNKLKFDDIRDLILSEDIRRKDSGESSNTFGSALNTESRGRGS 208
                  TV  V++SA+  + +  +  DL         D G+++      L T  RGR  
Sbjct: 187 KYGNKTLTVQDVTSSAKSLERELAEAVDL---------DKGQAA-----VLYTTERGRPL 232

Query: 209 QKSHNQSQGRGRSKSRGRSQTRVRNDITCWNCDRKGHFTNQCKAPRKKKNYQKR*DDDES 268
            ++ NQ  G+G+ +SR  S+T+V     CW C ++GH    C + +KK   + + +    
Sbjct: 233 VRN-NQKGGQGKGRSRSNSKTKV----PCWYCKKEGHVKKDCYSRKKKMESEGQGE---- 283

Query: 269 ANAATEEVADTLICSLDSPV--DSWVIDSGASFHTIPSKELLSNYICGKFGKVYLADGKP 326
           A   TE++  +   S++  +  D W++DSG + H    ++   ++       + L D   
Sbjct: 284 AGVITEKLVFSEALSVNEQMVKDLWILDSGCTSHMTSRRDWFISFQEKGNTTILLGDDHS 343

Query: 327 LDIVGIGDIDIRSSNGTLWTLHNVRHVPGIKRNLISIGQLDDEGYHTTFGGGAWKVTKGN 386
           ++  G G I I +  GT+  L NV++VP ++RNLIS G LD  GY    G G  +  K N
Sbjct: 344 VESQGQGTIRIDTHGGTIKILENVKYVPHLRRNLISTGTLDKLGYRHEGGEGKVRYFKNN 403

Query: 387 LVVARGKKRGSLYM-----VAEEDMIAVTEAINSSSIWHQRLGHMSEKGMKIMASKGKMS 441
               RG     LY+     V  E   A T+ + ++ +WH RLGHMS   +K++A KG + 
Sbjct: 404 KTALRGSLSNGLYVLDGSTVMSELCNAETDKVKTA-LWHSRLGHMSMNNLKVLAGKGLID 462

Query: 442 NLKHVDLGVCEHCILGKQRKVSFSKAGRKSKSEKLELVHTDVWG-PAPVKSLGGSRYYVT 500
             +  +L  CEHC++GK +KVSF+  G+ +  + L  VH D+WG P    S+ G +Y+++
Sbjct: 463 RKEINELEFCEHCVMGKSKKVSFN-VGKHTSEDALSYVHADLWGSPNVTPSISGKQYFLS 521

Query: 501 FIDDSTRKVWVYFLKSKSDVFSVFKKWKTEVENQTGLKIKSLKSDNGGEYDSQEFKKFCS 560
            IDD TRKVW+YFLKSK + F  F +WK+ VENQ   K+K L++DNG E+ +  F  +C 
Sbjct: 522 IIDDKTRKVWLYFLKSKDETFDKFCEWKSLVENQVNKKVKCLRTDNGLEFCNSRFDSYCK 581

Query: 561 ENGIRMIKTIPGTPEQNGVAERMNRTLNERARCMRIQSGLPKMFWPDAINTAAYLINRGP 620
           E+GI   +T   TP+QNGVAERMNRT+ E+ RC+  +SG+ ++FW +A  TAAYLINR P
Sbjct: 582 EHGIERHRTCTYTPQQNGVAERMNRTIMEKVRCLLNKSGVEEVFWAEAAATAAYLINRSP 641

Query: 621 SVPLDYQLPEEVWYGKEVSLSHLKVFGCVSYVLIDSDKRDKLDPKAIKCFFIGYGSDMYG 680
           +  +++ +PEE+W  ++    HL+ FG ++YV  D   + KL P+A+K FF+GY +   G
Sbjct: 642 ASAINHNVPEEMWLNRKPGYKHLRKFGSIAYVHQD---QGKLKPRALKGFFLGYPAGTKG 698

Query: 681 YRFWDEQNRKIIRSINVTFNESVLYKD 707
           Y+ W  +  K + S NV F ESV+Y+D
Sbjct: 699 YKVWLLEEEKCVISRNVVFQESVVYRD 725


>gb|AAD32759.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
           gi|25301696|pir||F84486 probable retroelement pol
           polyprotein [imported] - Arabidopsis thaliana
          Length = 1356

 Score =  407 bits (1045), Expect = e-112
 Identities = 241/750 (32%), Positives = 398/750 (52%), Gaps = 43/750 (5%)

Query: 1   LEEGKVKIERFDGR-DFGFWKMLMEDYLYQKMLYQPL-----TGKKPNDMKQEDWDLLDR 54
           +   ++++E+FDGR D+  WK  +  ++    L   L     TG+K + + + D D  ++
Sbjct: 1   MSTARIEVEKFDGRGDYTMWKEKLLAHMDILGLNTALKESESTGEKKSVLDESDEDYEEK 60

Query: 55  ------------QALGVIRLTLSKNVAFNIVNEKTTADLMKALSNMYEKPFAANKVHLIR 102
                       +A   I L+++  V   I  E T A ++ AL  +Y      N+++  +
Sbjct: 61  LEKFEALEEKKKKARSAIVLSVTDRVLRKIKKESTAAAMLLALDKLYMSKALPNRIYPKQ 120

Query: 103 RLFNLRMGEGNSVTEHINSFNTIISQLSSVKITFDNELMVLSLLQSLPDSWAATVTAVSN 162
           +L++ +M E  SV  +I+ F  II+ L ++ +   +E   + LL +LP ++      +  
Sbjct: 121 KLYSFKMSENLSVEGNIDEFLQIITDLENMNVIISDEDQAILLLTALPKAFDQLKDTLKY 180

Query: 163 SARDNKLKFDDIRDLILSEDIRRKDSGESSNTFGSALNTESRGRGSQKSHNQSQGRGRSK 222
           S+  + L  D++   I S+++      +S       L          K  N+++G+G  K
Sbjct: 181 SSGKSILTLDEVAAAIYSKELELGSVKKSIKVQAEGLYV--------KDKNENKGKGEQK 232

Query: 223 SRGRSQT-RVRNDITCWNCDRKGHFTNQCKAPRKKKNYQKR*DDDES-------ANAATE 274
            +G+ +  + +    CW C  +GHF + C    K +  Q +    ES       A AA  
Sbjct: 233 GKGKGKKGKSKKKPGCWTCGEEGHFRSSCPNQNKPQFKQSQVVKGESSGGKGNLAEAAGY 292

Query: 275 EVADTLICSLDSPVDSWVIDSGASFHTIPSKELLSNYICGKFGKVYLADGKPLDIVGIGD 334
            V++ L  +     D W++D+G S+H    +E    +     G V + +     + G+G 
Sbjct: 293 YVSEALSSTEVHLEDEWILDTGCSYHMTYKREWFHEFNEDAGGSVRMGNKTVSRVRGVGT 352

Query: 335 IDIRSSNGTLWTLHNVRHVPGIKRNLISIGQLDDEGYHTTFGGGAWKVTKGNLVVARGKK 394
           I +++S+G    L NVR++P + RNL+S+G  +  GY      G  ++  GN V+  G++
Sbjct: 353 IRVKNSDGLTIVLTNVRYIPDMDRNLLSLGTFEKAGYKFESEDGILRIKAGNQVLLTGRR 412

Query: 395 RGSLYMV----AEEDMIAVTEAINSSSIWHQRLGHMSEKGMKIMASKGKMSNLKHVDLGV 450
             +LY++       + +AV +  + + +WHQRL HMS+K M+I+  KG +   K   L V
Sbjct: 413 YDTLYLLNWKPVASESLAVVKRADDTVLWHQRLCHMSQKNMEILVRKGFLDKKKVSSLDV 472

Query: 451 CEHCILGKQRKVSFSKAGRKSKSEKLELVHTDVWG-PAPVKSLGGSRYYVTFIDDSTRKV 509
           CE CI GK ++ SFS A   +K EKLE +H+D+WG P    SLG  +Y+++ IDD TRKV
Sbjct: 473 CEDCIYGKAKRKSFSLAHHDTK-EKLEYIHSDLWGAPFVPLSLGKCQYFMSIIDDFTRKV 531

Query: 510 WVYFLKSKSDVFSVFKKWKTEVENQTGLKIKSLKSDNGGEYDSQEFKKFCSENGIRMIKT 569
           WVYF+K+K + F  F +W   VENQT  ++K+L++DNG E+ ++ F  FC   GI   +T
Sbjct: 532 WVYFMKTKDEAFEKFVEWVNLVENQTDRRVKTLRTDNGLEFCNKLFDGFCESIGIHRHRT 591

Query: 570 IPGTPEQNGVAERMNRTLNERARCMRIQSGLPKMFWPDAINTAAYLINRGPSVPLDYQLP 629
              TP+QNGVAERMNRT+ E+ R M   SGLPK FW +A +T   LIN+ PS  L++++P
Sbjct: 592 CAYTPQQNGVAERMNRTIMEKVRSMLSDSGLPKRFWAEATHTTVLLINKTPSSALNFEIP 651

Query: 630 EEVWYGKEVSLSHLKVFGCVSYVLIDSDKRDKLDPKAIKCFFIGYGSDMYGYRFWDEQNR 689
           ++ W G     S+L+ +GCV++V  D     KL+P+A K   IGY   + GY+ W    R
Sbjct: 652 DKKWSGNPPVYSYLRRYGCVAFVHTDD---GKLEPRAKKGVLIGYPVGVKGYKVWILDER 708

Query: 690 KIIRSINVTFNESVLYKDRSSAESMSSSKQ 719
           K + S N+ F E+ +YKD    +   S+++
Sbjct: 709 KCVVSRNIIFQENAVYKDLMQRQENVSTEE 738


>dbj|BAB09923.1| copia-like retrotransposable element [Arabidopsis thaliana]
          Length = 1342

 Score =  404 bits (1039), Expect = e-111
 Identities = 238/756 (31%), Positives = 395/756 (51%), Gaps = 66/756 (8%)

Query: 1   LEEGKVKIERFDGR-DFGFWKMLMEDYLYQKMLYQPLTGKKPNDMKQEDWDLLDR----- 54
           +  G+ ++E+FDG  D+  WK  +  ++    L + L  ++   ++    ++ D      
Sbjct: 1   MSSGRAEVEKFDGDGDYILWKEKLLAHMEMLGLLEGLGEEEEAVVEDSTTEISDGGNQDP 60

Query: 55  -----------------QALGVIRLTLSKNVAFNIVNEKTTADLMKALSNMYEKPFAANK 97
                            +A   I L+L  NV   ++ +KT A ++K L  ++      N+
Sbjct: 61  ETATSKLEDKILKEKRGKARSTIILSLGNNVLRKVIKQKTAAGMIKVLDQLFMAKSLPNR 120

Query: 98  VHLIRRLFNLRMGEGNSVTEHINSFNTIISQLSSVKITFDNELMVLSLLQSLPDSWAATV 157
           ++L +RL+  +M E  ++ E++N F  +IS L +VK+   +E   + LL SLP  +    
Sbjct: 121 IYLKQRLYGYKMSENMTMEENVNDFFKLISDLENVKVVVPDEDQAIVLLMSLPRQFDQLK 180

Query: 158 TAVSNSARDNKLKFDDIRDLILSEDIRRKDSGESSNTFGSALNTESRGRGSQKSHNQSQG 217
             +        L  ++I   I S+ +    SG+        L  + RGR   +    ++ 
Sbjct: 181 ETLKYCK--TTLHLEEITSAIRSKILELGASGKLLKNNSDGLFVQDRGRSETRGKGPNKN 238

Query: 218 RGRSKSRGRSQTRVRNDITCWNCDRKGHFTNQCKAPRKKKNYQKR*DDDESANAATEEVA 277
           + RSKS+G  +T       CW C ++GHF  QC    K++N Q    +   A+  T  V 
Sbjct: 239 KSRSKSKGAGKT-------CWICGKEGHFKKQCYV-WKERNKQGSTSERGEASTVTARVT 290

Query: 278 DT--------LICSLDSPVDSWVIDSGASFHTIPSKELLSNYICGKFGKVYLADGKPLDI 329
           D         L+   +   D+W++D+G SFH    K+ + ++     GKV + +    ++
Sbjct: 291 DAAALVVSRALLGFAEVTPDTWILDTGCSFHMTCRKDWIIDFKETASGKVRMGNDTYSEV 350

Query: 330 VGIGDIDIRSSNGTLWTLHNVRHVPGIKRNLISIGQLDDEGYHTTFGGGAWKVTKGNLVV 389
            GIGD+ I++ +G+   L +VR++P + +NLIS+G L+D+G       G   + K +L V
Sbjct: 351 KGIGDVRIKNEDGSTILLTDVRYIPEMSKNLISLGTLEDKGCWFESKKGILTIFKNDLTV 410

Query: 390 ARGKKRGSLYMVAEEDMIAVTEAINS----SSIWHQRLGHMSEKGMKIMASKGKMSNLKH 445
             GKK  +LY +    +      I+     +S+WH RLGH+  KG++++ SKG      H
Sbjct: 411 LTGKKESTLYFLQGTTLAGEANVIDKEKDETSLWHSRLGHIGAKGLQVLVSKG------H 464

Query: 446 VDLGVCEHCILGKQRKVSFSKAGRKSKSEKLELVHTDVWGPAPVK-SLGGSRYYVTFIDD 504
           +D          K   +SF  A   +K +KL+ VH+D+WG   V  S+G  +Y++TFIDD
Sbjct: 465 LD----------KNIMISFGAAKHVTK-DKLDYVHSDLWGSTNVPFSIGKCQYFITFIDD 513

Query: 505 STRKVWVYFLKSKSDVFSVFKKWKTEVENQTGLKIKSLKSDNGGEYDSQEFKKFCSENGI 564
            TR+ W+YF+++K + FS F +WKT++ENQ   K+K L +DNG E+ +QEF  FC + G+
Sbjct: 514 FTRRTWIYFIRTKDEAFSKFVEWKTQIENQQDKKLKILITDNGLEFCNQEFDSFCRKEGV 573

Query: 565 RMIKTIPGTPEQNGVAERMNRTLNERARCMRIQSGLPKMFWPDAINTAAYLINRGPSVPL 624
              +T   TP+QNGVAERMNRT+  + RCM  +SGL K FW +A +TA +LIN+ PS  +
Sbjct: 574 IRHRTCAYTPQQNGVAERMNRTIMNKVRCMLSESGLGKQFWAEAASTAVFLINKSPSSSI 633

Query: 625 DYQLPEEVWYGKEVSLSHLKVFGCVSYVLIDSDKRDKLDPKAIKCFFIGYGSDMYGYRFW 684
           ++ +PEE W G       LK FG V+Y+  D   + KL+P+A K  F+GY   +  ++ W
Sbjct: 634 EFDIPEEKWTGHPPDYKILKKFGSVAYIHSD---QGKLNPRAKKGIFLGYPDGVKRFKVW 690

Query: 685 DEQNRKIIRSINVTFNESVLYKDRSSAESMSSSKQL 720
             ++RK + S ++ F E+ +YK+    +     KQL
Sbjct: 691 LLEDRKCVVSRDIVFQENQMYKELQKNDMSEEDKQL 726


>emb|CAB79135.1| putative transposable element [Arabidopsis thaliana]
           gi|3402755|emb|CAA20201.1| putative transposable element
           [Arabidopsis thaliana] gi|7444415|pir||T05178
           hypothetical protein T6K22.90 - Arabidopsis thaliana
          Length = 1308

 Score =  401 bits (1031), Expect = e-110
 Identities = 241/746 (32%), Positives = 389/746 (51%), Gaps = 67/746 (8%)

Query: 5   KVKIERFDG-RDFGFWKMLME-------------DYLYQKMLYQPLTGKKPNDMKQEDWD 50
           KV+I+ F+G RDF  WK+ +E             D+   K +    + KK ++ + ++ D
Sbjct: 6   KVEIKTFNGDRDFSLWKIRIEAQLGVLGLKPALSDFTLTKTILVVKSEKKESESEDDETD 65

Query: 51  LLDR-------------QALGVIRLTLSKNVAFNIVNEKTTADLMKALSNMYEKPFAANK 97
                            QA   I   ++  V   + +  T A+L   L+ ++ +    N+
Sbjct: 66  SKKTEEVPDPIKFEQSDQAKNFIINHITDTVLLKVQHCVTAAELWATLNKLFMETSLPNR 125

Query: 98  VHLIRRLFNLRMGEGNSVTEHINSFNTIISQLSSVKITFDNELMVLSLLQSLPDSWAATV 157
           ++   RL++ +M +  S+ ++ + F  I+++L S++I    E+  + +L SLP S+    
Sbjct: 126 IYTQLRLYSFKMVDNLSIDQNTDEFLRIVAELGSLQIQVGEEVQAILILNSLPPSYIQLK 185

Query: 158 TAVSNSARDNKLKFDDIRDLILSEDIRRKDSGESSNTF----GSALNTESRGRGSQKSHN 213
             +         K   ++D++ S     ++  E   T      +AL T  RGR  Q  + 
Sbjct: 186 HTLKYGN-----KTLSVQDVVSSAKSLERELSEQKETIRAPASTALYTAERGR-PQTKNT 239

Query: 214 QSQGRGRSKSRGRSQTRVRNDITCWNCDRKGHFTNQCKAPRKKKNYQKR*DDDESANAAT 273
           Q QG+GR +S  +S+      +TCW C ++GH    C A ++K   + +      A   T
Sbjct: 240 QGQGKGRGRSNSKSR------LTCWFCKKEGHVKKDCYAGKRKLENEGQ----GKAGVIT 289

Query: 274 EEVADTLICSL--DSPVDSWVIDSGASFHTIPSKELLSNYICGKFGKVYLADGKPLDIVG 331
           E++  +   S+      D WVIDSG ++H     +  S +   +   + L D   ++  G
Sbjct: 290 EKLVYSEALSMYDQEAKDKWVIDSGCTYHMTSRMDWFSEFNENETTMILLGDDHTVESKG 349

Query: 332 IGDIDIRSSNGTLWTLHNVRHVPGIKRNLISIGQLDDEGYHTTFGGGAWKVTKGNLVVAR 391
            G + + +  G++  L NVR VP ++RNLIS G LD  GY    G G  +  K N     
Sbjct: 350 SGTVKVNTHGGSIRVLKNVRFVPNLRRNLISTGTLDKLGYKHEGGDGKVRFYKENKTALC 409

Query: 392 GKKRGSLYMVAEEDMIAVTEAINSSS----IWHQRLGHMSEKGMKIMASKGKMSNLKHVD 447
           G     LY++    ++     +  S+    +WH RLGHMS   MKI+A KG +      +
Sbjct: 410 GNLVNGLYVLDGHTVVNENCNVEGSNEKTELWHCRLGHMSLNNMKILAEKGLLEKKDIKE 469

Query: 448 LGVCEHCILGKQRKVSFSKAGRKSKSEKLELVHTDVWGPAPVKSLGGSRYYVTFIDDSTR 507
           L  CE+C++GK +K+SF+  G+    E L  +H D+WG          +Y+++ IDD +R
Sbjct: 470 LSFCENCVMGKSKKLSFN-VGKHITDEVLGYIHADLWG---------KQYFLSIIDDKSR 519

Query: 508 KVWVYFLKSKSDVFSVFKKWKTEVENQTGLKIKSLKSDNGGEYDSQEFKKFCSENGIRMI 567
           KVW+ FLK+K + F  F +WK  VENQ   K+K L++DNG E+ + +F +FC +NGI   
Sbjct: 520 KVWLMFLKTKDETFERFCEWKELVENQVNKKVKILRTDNGLEFCNLKFDEFCKQNGIERH 579

Query: 568 KTIPGTPEQNGVAERMNRTLNERARCMRIQSGLPKMFWPDAINTAAYLINRGPSVPLDYQ 627
           +T   TP+QNGVA+RMNRTL E+ RC+  +SGL ++FW +A  TAAYL+NR P+  +D+ 
Sbjct: 580 RTCTYTPQQNGVAKRMNRTLMEKVRCLLNESGLEEVFWAEAAATAAYLVNRSPASAVDHN 639

Query: 628 LPEEVWYGKEVSLSHLKVFGCVSYVLIDSDKRDKLDPKAIKCFFIGYGSDMYGYRFWDEQ 687
           +PEE+W  K+    HL+ FGC++YV +D   + KL P+A+K  F+GY     GY+ W   
Sbjct: 640 VPEELWLDKKPGYKHLRRFGCIAYVHLD---QGKLKPRALKGVFLGYPQGTKGYKVWLLD 696

Query: 688 NRKIIRSINVTFNESVLYKD-RSSAE 712
             K + S N+ FNE+ +YKD R S+E
Sbjct: 697 EEKCVISRNIVFNENQVYKDIRESSE 722


>emb|CAA31653.1| polyprotein [Arabidopsis thaliana] gi|99721|pir||S05465
           retrovirus-related polyprotein - Arabidopsis thaliana
           retrotransposon Ta1-3
          Length = 1291

 Score =  398 bits (1023), Expect = e-109
 Identities = 238/741 (32%), Positives = 386/741 (51%), Gaps = 48/741 (6%)

Query: 12  DGRDFGFWKMLMEDYL-----------YQKMLYQPLT---GKKPNDMKQEDWDLLDRQAL 57
           +  DF  WK  M+ +L           +   +  P+    GKK  D  ++      +   
Sbjct: 20  ENSDFSLWKTCMKAHLGLAGLKGIIDDFDLTMTVPIPKSEGKKIEDGDEQGDSSQTKIVP 79

Query: 58  GVIRLTLSKNVAFNIV-------------NEKTTADLMKALSNMYEKPFAANKVHLIRRL 104
            ++++  S+N A NI+             + K+ A++ + L+  Y +    N++++  + 
Sbjct: 80  DLVKIEKSEN-AMNIIIAHVGDAVLRKIDHCKSAAEMWETLNKQYMETSLPNRIYVQLKF 138

Query: 105 FNLRMGEGNSVTEHINSFNTIISQLSSVKITFDNELMVLSLLQSLPDSWAATVTAVSNSA 164
           ++ +M +  S+ E++N F  I+++LSS++I    E+  +  L  L   ++     +    
Sbjct: 139 YSFKMNDTKSINENVNEFLKIVAELSSLEINVVEEVRAILFLNRLSSRYSQLKHTLKYGN 198

Query: 165 RDNKLKFDDIRDLILSEDIRRKDSGESSNTFGSALNTESRGRGSQKSHNQSQGRGRSKSR 224
           +   LK  D+     S +    +  E+     + L T  R R   ++ N ++G    + R
Sbjct: 199 KALSLK--DVISAARSLERELNEQKETDKNTSTVLYTNERSRPQTRNQNHNKG---GQGR 253

Query: 225 GRSQTRVRNDITCWNCDRKGHFTNQCKAPRKKKNYQKR*DDDESANAATEEVADTLICSL 284
           GRS++     +TCW C ++GH      A ++K   +    +   A   TE++  +   S+
Sbjct: 254 GRSKSNSNAKLTCWYCKKEGHVKKDYFARKRKLESE----NPGEAGVITEKLVFSEALSV 309

Query: 285 DSPV--DSWVIDSGASFHTIPSKELLSNYICGKFGKVYLADGKPLDIVGIGDIDIRSSNG 342
           +     D WV+DSG + H    ++   ++       + L D   +   G G I I +  G
Sbjct: 310 NDLAVRDIWVLDSGCTSHMSARRDWFCSFREDGGPTILLGDDHSVKSQGQGSIKIETHGG 369

Query: 343 TLWTLHNVRHVPGIKRNLISIGQLDDEGYHTTFGGGAWKVTKGNLVVARGKKRGSLYMVA 402
           T+  L NV++VP ++RNLIS G LD  GY    G G  +  K      RG+    LY++ 
Sbjct: 370 TIIGLENVKYVPELRRNLISTGTLDKRGYKHEGGDGKVRYFKNQKTALRGELVNGLYILD 429

Query: 403 EEDMIAVTEAINSSS----IWHQRLGHMSEKGMKIMASKGKMSNLKHVDLGVCEHCILGK 458
              +++ T     S     +WH RLGH+    MK++A KG +S  +   L  CE+C++GK
Sbjct: 430 GNTVLSETCVAEGSKGKTELWHSRLGHIGLNNMKVLAGKGLVSKEEIRVLDFCENCVMGK 489

Query: 459 QRKVSFSKAGRKSKSEKLELVHTDVWGPAPVK-SLGGSRYYVTFIDDSTRKVWVYFLKSK 517
            +KVSF+  G+ +  + L  VH D+WG   V  SL G++Y+++ IDD TRKVW+YFL+SK
Sbjct: 490 AKKVSFN-VGKHNSEDVLRYVHADLWGSTNVTPSLSGNKYFLSIIDDKTRKVWLYFLRSK 548

Query: 518 SDVFSVFKKWKTEVENQTGLKIKSLKSDNGGEYDSQEFKKFCSENGIRMIKTIPGTPEQN 577
            + F  F +WK  VENQ   K+K L++DNG E+ + +F  +C E+GI   KT   TP+QN
Sbjct: 549 DETFDRFCEWKELVENQQNKKVKCLRTDNGLEFCNLKFDAYCKEHGIERHKTCTYTPQQN 608

Query: 578 GVAERMNRTLNERARCMRIQSGLPKMFWPDAINTAAYLINRGPSVPLDYQLPEEVWYGKE 637
           GVAERMNRT+ E+ RCM  +SGL + FW +A  TAAYLINR P+  +D+ +PEE+W  K+
Sbjct: 609 GVAERMNRTIMEKVRCMLNESGLGEEFWAEAAATAAYLINRSPASAIDHNVPEELWLNKK 668

Query: 638 VSLSHLKVFGCVSYVLIDSDKRDKLDPKAIKCFFIGYGSDMYGYRFWDEQNRKIIRSINV 697
               HL+ FG ++YV ID   + KL P+A+K  FIGY +   GY+ W  +  K + S NV
Sbjct: 669 PGYKHLRRFGSIAYVHID---QGKLKPRALKGIFIGYPAGTKGYKIWLLEEHKCVISRNV 725

Query: 698 TFNESVLYKDRSSAESMSSSK 718
            F+E  +YKD    E +  S+
Sbjct: 726 LFHEESVYKDTMKKERVVESE 746


  Database: nr
    Posted date:  Jul 5, 2005 12:34 AM
  Number of letters in database: 863,360,394
  Number of sequences in database:  2,540,612
  
Lambda     K      H
   0.317    0.134    0.396 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,206,169,069
Number of Sequences: 2540612
Number of extensions: 51343581
Number of successful extensions: 158306
Number of sequences better than 10.0: 6385
Number of HSP's better than 10.0 without gapping: 5447
Number of HSP's successfully gapped in prelim test: 946
Number of HSP's that attempted gapping in prelim test: 145834
Number of HSP's gapped (non-prelim): 10608
length of query: 720
length of database: 863,360,394
effective HSP length: 135
effective length of query: 585
effective length of database: 520,377,774
effective search space: 304420997790
effective search space used: 304420997790
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 79 (35.0 bits)


Lotus: description of TM0220.7