Lotus
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= TM0171.6
         (799 letters)

Database: uniref100 
           2,790,947 sequences; 848,049,833 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

UniRef100_Q9FJV3 Retroelement pol polyprotein-like [Arabidopsis ...   424  e-117
UniRef100_O65468 Hypothetical protein F21P8.50 [Arabidopsis thal...   421  e-116
UniRef100_Q9FX79 Putative retroelement polyprotein [Arabidopsis ...   408  e-112
UniRef100_Q9XII7 Putative retroelement pol polyprotein [Arabidop...   396  e-108
UniRef100_Q8L700 Hypothetical protein [Arabidopsis thaliana]          392  e-107
UniRef100_O22175 Putative retroelement pol polyprotein [Arabidop...   392  e-107
UniRef100_Q9SIM3 Putative retroelement pol polyprotein [Arabidop...   390  e-107
UniRef100_O23588 Retrotransposon like protein [Arabidopsis thali...   387  e-106
UniRef100_O04543 F20P5.25 protein [Arabidopsis thaliana]              385  e-105
UniRef100_Q5XWR5 Putative retroelement pol polyprotein-like [Sol...   379  e-103
UniRef100_Q9LVQ2 Retroelement pol polyprotein-like [Arabidopsis ...   377  e-103
UniRef100_Q9ZPU4 Putative retroelement pol polyprotein [Arabidop...   375  e-102
UniRef100_Q8W153 Polyprotein [Oryza sativa]                           370  e-100
UniRef100_Q7X6S0 OSJNBb0011N17.2 protein [Oryza sativa]               369  e-100
UniRef100_Q9C692 Polyprotein, putative [Arabidopsis thaliana]         335  3e-90
UniRef100_Q9MAJ8 F27F5.19 [Arabidopsis thaliana]                      305  3e-81
UniRef100_Q9ZPG3 F5K24.2 protein [Arabidopsis thaliana]               304  6e-81
UniRef100_Q9SJ99 Putative retroelement pol polyprotein [Arabidop...   303  2e-80
UniRef100_O65452 LTR retrotransposon like protein [Arabidopsis t...   301  5e-80
UniRef100_Q9FL75 Retroelement pol polyprotein-like [Arabidopsis ...   301  5e-80

>UniRef100_Q9FJV3 Retroelement pol polyprotein-like [Arabidopsis thaliana]
          Length = 1475

 Score =  424 bits (1089), Expect = e-117
 Identities = 227/555 (40%), Positives = 340/555 (60%), Gaps = 38/555 (6%)

Query: 262  LPSPESPSTTSVSDHTRRPTRPRHQPSHLRNYVLHTVSSSCKASQTSSGIKYPISNYMSY 321
            + +P S S ++    ++R +RP   P +L++Y  + V    K       ++YP++ Y++Y
Sbjct: 890  IENPPSTSESAPKVSSKRESRP---PGYLQDYFCNAVPDVTK------DVRYPLNAYINY 940

Query: 322  SNLSIPHHAYAMSLSLDSEPHTYAEASKHKCWVDAMNLEISALEANGTWSLVPLPPNVVP 381
            + LS    AY  +++   EP TYA+A K K W+DAM +EI ALE+  TWS+  LP    P
Sbjct: 941  TQLSEEFTAYICAVNKYPEPCTYAQAKKIKEWLDAMEIEIDALESTNTWSVCSLPQGKKP 1000

Query: 382  IDNKWVYKIKRRANGTVERYKARLVA*GYNQIEGIYYFDTFSPTAKLTIVRMVLALASVN 441
            I  KWV+K+K  A+G++ER+KARLVA GY Q EG+ Y+DTFSP AK+T V+ +L++A++ 
Sbjct: 1001 IGCKWVFKVKLNADGSLERFKARLVAKGYTQREGLDYYDTFSPVAKMTTVKTLLSVAAIK 1060

Query: 442  NWHLHQLDVNNAFLLGDLYEDVYMKVPEGVSCVDSG-----KVCKLHKSSYGLKQASRQW 496
             W LHQLD++NAFL GDL E++YM +P G S    G      V KL KS YGLKQASRQW
Sbjct: 1061 EWSLHQLDISNAFLNGDLKEEIYMTLPPGYSMKQGGVLPQNPVLKLQKSLYGLKQASRQW 1120

Query: 497  YAKFTSLLVTCGYKQAHSDHSLFSKTQGQSFTILLIYVDDIILAGNFLDEFTRIKAALDN 556
            Y KF+S L   G+K++H+DH+LF++  G+++  LL+YVDDI++AGN  +    +K  L  
Sbjct: 1121 YLKFSSTLKKLGFKKSHADHTLFTRISGKAYIALLVYVDDIVIAGNNDENIEELKKDLAK 1180

Query: 557  AFKIKDLGVLKYFLGLEVSHSAKGISLCQRKYCLDLVHDSGVLGSKPVSTPLDPSSRLSQ 616
            AFK++DLG +KYFLGLE++ + +GIS+CQRKY ++L+ D+G+LG +P + P++PS +LSQ
Sbjct: 1181 AFKLRDLGPMKYFLGLEIARTKEGISVCQRKYTMELLEDTGLLGCRPSTIPMEPSLKLSQ 1240

Query: 617  DGGGATL*GCFFIQKTHRKTVLSYYNQV*YNLCSSAAKSVPLQSYCDSLCCSS*NSKVSE 676
                            H       Y ++   L         +    + LC  S + K S 
Sbjct: 1241 H------------NDEHVIDNPEVYRRLVGKLMYLTITRPDITYAINRLCQFSSSPKNSH 1288

Query: 677  RESKKRVI------------FPKEFCSTIVGV**CRLGGCVDTRRSVTSYCFFIGNSLIC 724
             ++ ++V+            +  +    +        G CVD+RRS +  C F+G+SLI 
Sbjct: 1289 LKAAQKVVHYLKGTIGLGLFYSSKSDLCLKAYTDADWGSCVDSRRSTSGICMFLGDSLIS 1348

Query: 725  WRSKKQQTISKSSSEAVYRALASATCEL*RLTYLLKDLQIEPVKPSVIYCDNQSALHIAA 784
            W+SKKQ   S SS+E+ YRA+A  + E+  L  LL + Q++  KP  ++CD+ +A+HIA 
Sbjct: 1349 WKSKKQNMASSSSAESEYRAMAMGSREIAWLVKLLAEFQVKQTKPVPLFCDSTAAIHIAN 1408

Query: 785  NPVFHERTKHLEIEC 799
            N VFHERTKH+E +C
Sbjct: 1409 NAVFHERTKHIENDC 1423


>UniRef100_O65468 Hypothetical protein F21P8.50 [Arabidopsis thaliana]
          Length = 1240

 Score =  421 bits (1083), Expect = e-116
 Identities = 246/584 (42%), Positives = 343/584 (58%), Gaps = 63/584 (10%)

Query: 237 ASSSNSLDDLSVSIPSADATSVQVPLPSPESPSTTSVSDHTRRPTRPRHQPSHLRNYVLH 296
           AS+S+S  D+   +PSA+   +Q  +P P        S HT    R   +P++L++Y  H
Sbjct: 7   ASTSSSSIDI---MPSAN---IQNDVPEP--------SVHTSH--RRTRKPAYLQDYYCH 50

Query: 297 TVSSSCKASQTSSGIKYPISNYMSYSNLSIPHHAYAMSLSLDSEPHTYAEASKHKCWVDA 356
           +V+S            + IS ++SY  +S  +H++ + ++   EP TY EA +   W  A
Sbjct: 51  SVASLTI---------HDISQFLSYEKVSPLYHSFLVCIAKAKEPSTYNEAKEFLVWCGA 101

Query: 357 MNLEISALEANGTWSLVPLPPNVVPIDNKWVYKIKRRANGTVERYKARLVA*GYNQIEGI 416
           M+ EI A+E   TW +  LPPN  PI  KWVYKIK  ++GT+ERYKARLVA GY Q EGI
Sbjct: 102 MDDEIGAMETTHTWEICTLPPNKKPIGCKWVYKIKYNSDGTIERYKARLVAKGYTQQEGI 161

Query: 417 YYFDTFSPTAKLTIVRMVLALASVNNWHLHQLDVNNAFLLGDLYEDVYMKVPEGVSC--- 473
            + +TFSP  KLT V+++LA++++ N+ LHQLD++NAFL GDL E++YMK+P G +    
Sbjct: 162 DFIETFSPVCKLTSVKLILAISAIYNFTLHQLDISNAFLNGDLDEEIYMKLPPGYAARQG 221

Query: 474 --VDSGKVCKLHKSSYGLKQASRQWYAKFTSLLVTCGYKQAHSDHSLFSKTQGQSFTILL 531
             +    VC L KS YGLKQASRQW+ KF+  L+  G+ Q+HSDH+ F K     F  +L
Sbjct: 222 DSLPPNAVCYLKKSIYGLKQASRQWFLKFSVTLIGFGFVQSHSDHTYFLKITATLFLCVL 281

Query: 532 IYVDDIILAGNFLDEFTRIKAALDNAFKIKDLGVLKYFLGLEVSHSAKGISLCQRKYCLD 591
           +YVDDII+  N       +K+ L + FK++DLG LKYFLGLE++ SA GI++CQRKY LD
Sbjct: 282 VYVDDIIICSNNDAAVDELKSQLKSCFKLRDLGPLKYFLGLEIARSAAGINICQRKYALD 341

Query: 592 LVHDSGVLGSKPVSTPLDPSSRLSQDGGGATL*GCFFIQKTHRKTV-LSYYNQV*YNLCS 650
           L+ ++G+LG KP S P+DPS   S   GG      F   K +R+ +    Y Q+     S
Sbjct: 342 LLDETGLLGCKPSSVPMDPSVTFSAHSGGD-----FVDAKAYRRLIGRLMYLQITRLDIS 396

Query: 651 SAAKSVPLQSYCDSLCCSS*NSKV---------------SERESKKRVIFPKEFCSTIVG 695
            A   +   S    L       K+               S+ E + +V     F S    
Sbjct: 397 FAVNKLSQFSEAPRLAHQQAVMKILHYIKGTVGQGLFYSSQAEMQLQVFSDASFQS---- 452

Query: 696 V**CRLGGCVDTRRSVTSYCFFIGNSLICWRSKKQQTISKSSSEAVYRALASATCEL*RL 755
                   C DTRRS   YC F+G SLI W+SKKQQ +SKSS+EA YRAL+ AT E+  L
Sbjct: 453 --------CKDTRRSTNGYCMFLGTSLISWKSKKQQVVSKSSAEAEYRALSFATDEMMWL 504

Query: 756 TYLLKDLQIEPVKPSVIYCDNQSALHIAANPVFHERTKHLEIEC 799
               ++LQ+   KP++++CDN +A+HIA N VFHERTKH+E +C
Sbjct: 505 AQFFRELQLPLSKPTLLFCDNTAAIHIATNAVFHERTKHIESDC 548


>UniRef100_Q9FX79 Putative retroelement polyprotein [Arabidopsis thaliana]
          Length = 1413

 Score =  408 bits (1048), Expect = e-112
 Identities = 232/557 (41%), Positives = 334/557 (59%), Gaps = 49/557 (8%)

Query: 261  PLPSPE-SPSTTSVSDHTRRPTRPRHQPSHLRNYVLHTVSSSCKASQTSSGIKYPISNYM 319
            PLP  E S S         R +RP   P++L++Y  ++V+SS           +PIS  +
Sbjct: 846  PLPVQETSASNVPAEKQNSRVSRP---PAYLKDYHCNSVTSSTD---------HPISEVL 893

Query: 320  SYSNLSIPHHAYAMSLSLDSEPHTYAEASKHKCWVDAMNLEISALEANGTWSLVPLPPNV 379
            SYS+LS P+  +  +++   EPHTYA+A + K W DAM +EI+ALE NGTW +  LP   
Sbjct: 894  SYSSLSDPYMIFINAVNKIPEPHTYAQARQIKEWCDAMGMEITALEDNGTWVVCSLPVGK 953

Query: 380  VPIDNKWVYKIKRRANGTVERYKARLVA*GYNQIEGIYYFDTFSPTAKLTIVRMVLALAS 439
              +  KWVYKIK  A+G++ERYKARLVA GY Q EG+ Y DTFSP AKLT V++++A+A+
Sbjct: 954  KAVGCKWVYKIKLNADGSLERYKARLVAKGYTQTEGLDYVDTFSPVAKLTTVKLLIAVAA 1013

Query: 440  VNNWHLHQLDVNNAFLLGDLYEDVYMKVPEGVS-----CVDSGKVCKLHKSSYGLKQASR 494
               W L QLD++NAFL G L E++YM +P G S           VC+L KS YGLKQASR
Sbjct: 1014 AKGWSLSQLDISNAFLNGSLDEEIYMTLPPGYSPRQGDSFPPNAVCRLKKSLYGLKQASR 1073

Query: 495  QWYAKFTSLLVTCGYKQAHSDHSLFSKTQGQSFTILLIYVDDIILAGNFLDEFTRIKAAL 554
            QWY KF+  L   G+ Q+  DH+LF++    S+  +L+YVDDII+A +   E   ++ AL
Sbjct: 1074 QWYLKFSESLKALGFTQSSGDHTLFTRKSKNSYMAVLVYVDDIIIASSCDRETELLRDAL 1133

Query: 555  DNAFKIKDLGVLKYFLGLEVSHSAKGISLCQRKYCLDLVHDSGVLGSKPVSTPLDPSSRL 614
              + K++DLG L+YFLGLE++ +  GIS+CQRKY L+L+ ++G+LG K  S P++P+ +L
Sbjct: 1134 QRSSKLRDLGTLRYFLGLEIARNTDGISICQRKYTLELLAETGLLGCKSSSVPMEPNQKL 1193

Query: 615  SQDGGGATL*GCFFIQKTHRKTVLSYYNQV*YNLCSSAAKSVPLQSYCDSLCCSS*NSKV 674
            SQ+ G                    +Y ++   L         +      LC  +   +V
Sbjct: 1194 SQEDGELI-------------DDAEHYRKLVGKLMYLTFTRPDITYAVHRLCQFTSAPRV 1240

Query: 675  SERESKKRVIFPKE-------FCSTIVGV**CRLGG--------CVDTRRSVTSYCFFIG 719
               ++  ++I+  +       F S  V +   +L G        C D+R+  T YC F+G
Sbjct: 1241 PHLKAVYKIIYYLKGTVGQGLFYSANVDL---KLSGFADSDFSSCSDSRKLTTGYCMFLG 1297

Query: 720  NSLICWRSKKQQTISKSSSEAVYRALASATCEL*RLTYLLKDLQIEPVKPSVIYCDNQSA 779
             SL+ W+SKKQ+ IS SS+EA Y+A++ A  E+  L +LL+DL I+  + SV+YCDN +A
Sbjct: 1298 TSLVAWKSKKQEVISMSSAEAEYKAMSMAVREMMWLRFLLEDLWIDVSEASVLYCDNTAA 1357

Query: 780  LHIAANPVFHERTKHLE 796
            +HIA NPVFHERTKH+E
Sbjct: 1358 IHIANNPVFHERTKHIE 1374


>UniRef100_Q9XII7 Putative retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 1454

 Score =  396 bits (1018), Expect = e-108
 Identities = 226/564 (40%), Positives = 323/564 (57%), Gaps = 45/564 (7%)

Query: 253  ADATSVQVPLPSPESPSTTSVSDHTRRPTRPRHQPSHLRNYVLHTVSSSCKASQTSSGIK 312
            +D T     LPS  S     +S       R R  P+HL +Y  +T+ S  K         
Sbjct: 866  SDTTHSPSSLPSQISDLPPQISSQ-----RVRKPPAHLNDYHCNTMQSDHK--------- 911

Query: 313  YPISNYMSYSNLSIPHHAYAMSLSLDSEPHTYAEASKHKCWVDAMNLEISALEANGTWSL 372
            YPIS+ +SYS +S  H  Y  +++    P  YAEA   K W +A++ EI A+E   TW +
Sbjct: 912  YPISSTISYSKISPSHMCYINNITKIPIPTNYAEAQDTKEWCEAVDAEIGAMEKTNTWEI 971

Query: 373  VPLPPNVVPIDNKWVYKIKRRANGTVERYKARLVA*GYNQIEGIYYFDTFSPTAKLTIVR 432
              LP     +  KWV+ +K  A+G +ERYKARLVA GY Q EG+ Y DTFSP AK+T ++
Sbjct: 972  TTLPKGKKAVGCKWVFTLKFLADGNLERYKARLVAKGYTQKEGLDYTDTFSPVAKMTTIK 1031

Query: 433  MVLALASVNNWHLHQLDVNNAFLLGDLYEDVYMKVPEGVS-----CVDSGKVCKLHKSSY 487
            ++L +++   W L QLDV+NAFL G+L E+++MK+PEG +      + S  V +L +S Y
Sbjct: 1032 LLLKVSASKKWFLKQLDVSNAFLNGELEEEIFMKIPEGYAERKGIVLPSNVVLRLKRSIY 1091

Query: 488  GLKQASRQWYAKFTSLLVTCGYKQAHSDHSLFSKTQGQSFTILLIYVDDIILAGNFLDEF 547
            GLKQASRQW+ KF+S L++ G+K+ H DH+LF K     F I+L+YVDDI++A       
Sbjct: 1092 GLKQASRQWFKKFSSSLLSLGFKKTHGDHTLFLKMYDGEFVIVLVYVDDIVIASTSEAAA 1151

Query: 548  TRIKAALDNAFKIKDLGVLKYFLGLEVSHSAKGISLCQRKYCLDLVHDSGVLGSKPVSTP 607
             ++   LD  FK++DLG LKYFLGLEV+ +  GIS+CQRKY L+L+  +G+L  KPVS P
Sbjct: 1152 AQLTEELDQRFKLRDLGDLKYFLGLEVARTTAGISICQRKYALELLQSTGMLACKPVSVP 1211

Query: 608  LDPSSRLSQDGGGATL*GCFFIQKTHRKTVLSYYNQV*YNLCSSAAKSVPLQSYCDSLCC 667
            + P+ ++ +D G                  +  Y ++   L         +    + LC 
Sbjct: 1212 MIPNLKMRKDDGDLI-------------EDIEQYRRIVGKLMYLTITRPDITFAVNKLCQ 1258

Query: 668  SS*NSKVSERESKKRVI------------FPKEFCSTIVGV**CRLGGCVDTRRSVTSYC 715
             S   + +   +  RV+            +      T+ G        C D+RRS TS+ 
Sbjct: 1259 FSSAPRTTHLTAAYRVLQYIKGTVGQGLFYSASSDLTLKGFADSDWASCQDSRRSTTSFT 1318

Query: 716  FFIGNSLICWRSKKQQTISKSSSEAVYRALASATCEL*RLTYLLKDLQIEPVKPSVIYCD 775
             F+G+SLI WRSKKQ T+S+SS+EA YRALA ATCE+  L  LL  LQ  P  P ++Y D
Sbjct: 1319 MFVGDSLISWRSKKQHTVSRSSAEAEYRALALATCEMVWLFTLLVSLQASPPVP-ILYSD 1377

Query: 776  NQSALHIAANPVFHERTKHLEIEC 799
            + +A++IA NPVFHERTKH++++C
Sbjct: 1378 STAAIYIATNPVFHERTKHIKLDC 1401


>UniRef100_Q8L700 Hypothetical protein [Arabidopsis thaliana]
          Length = 776

 Score =  392 bits (1008), Expect = e-107
 Identities = 236/596 (39%), Positives = 341/596 (56%), Gaps = 42/596 (7%)

Query: 235 VPASSSNSLDDLSVSIPSADATSVQVPLPSPESPSTTSVSDHTRRPTRPRHQPSHLRNYV 294
           VP+SS +   D S S  SA  T+  +      +PS+  + +   +  R + +   L+++V
Sbjct: 140 VPSSSPSRSIDRSTSDLSASDTTELLSTGESSTPSSPGLPELLGKGCREKKKSVLLKDFV 199

Query: 295 LHTVS---------------------SSCKASQTSSGIKYPISNYMSYSNLSIPHHAYAM 333
            +T S                     +S  A   S    YP+S++++ S  S  H A+  
Sbjct: 200 TNTTSKKKTASHNIHSPSQVLPSGLPTSLSADSVSGKTLYPLSDFLTNSGYSANHIAFMA 259

Query: 334 SLSLDSEPHTYAEASKHKCWVDAMNLEISALEANGTWSLVPLPPNVVPIDNKWVYKIKRR 393
           ++   +EP  + +A   K W +AM+ EI ALEAN TW +  LP     I +KWVYK+K  
Sbjct: 260 AILDSNEPKHFKDAILIKEWCEAMSKEIDALEANHTWDITDLPHGKKAISSKWVYKLKYN 319

Query: 394 ANGTVERYKARLVA*GYNQIEGIYYFDTFSPTAKLTIVRMVLALASVNNWHLHQLDVNNA 453
           ++GT+ER+KARLV  G +Q EG+ + +TF+P AKLT VR +LA+A+  +W +HQ+DV+NA
Sbjct: 320 SDGTLERHKARLVVMGNHQKEGVDFKETFAPVAKLTTVRTILAVAAAKDWEVHQMDVHNA 379

Query: 454 FLLGDLYEDVYMKVPEGVSCVDSGKVCKLHKSSYGLKQASRQWYAKFTSLLVTCGYKQAH 513
           FL GDL E+VYM++P G  C D  KVC+L KS YGLKQA R W++K ++ L   G+ Q++
Sbjct: 380 FLHGDLEEEVYMRLPPGFKCSDPSKVCRLRKSLYGLKQAPRCWFSKLSTALRNIGFTQSY 439

Query: 514 SDHSLFSKTQGQSFTILLIYVDDIILAGNFLDEFTRIKAALDNAFKIKDLGVLKYFLGLE 573
            D+SLFS   G +   +L+YVDD+I+AGN LD   R K+ L   F +KDLG LKYFLGLE
Sbjct: 440 EDYSLFSLKNGDTIIHVLVYVDDLIVAGNNLDAIDRFKSQLHKCFHMKDLGKLKYFLGLE 499

Query: 574 VSHSAKGISLCQRKYCLDLVHDSGVLGSKPVSTPLDPSSRLSQDGGGA--------TL*G 625
           VS    G  L QRKY LD+V ++G+LG KP + P+  + +L+   G           L G
Sbjct: 500 VSRGPDGFCLSQRKYALDIVKETGLLGCKPSAVPIALNHKLASITGPVFTNPEQYRRLVG 559

Query: 626 CFFIQKTHRKTVLSYYNQV*YNLCSSAAKSVPLQSYCDSLCCSS*NSKVSERESKKRVIF 685
             FI  T  +  LSY   +      S     PL ++ ++        K S  +     IF
Sbjct: 560 -RFIYLTITRPDLSYAVHI-----LSQFMQAPLVAHWEAALRLVRYLKGSPAQG----IF 609

Query: 686 PKEFCSTIVGV**C--RLGGCVDTRRSVTSYCFFIGNSLICWRSKKQQTISKSSSEAVYR 743
            +   S I+    C      C  TRRS+++Y  ++G+S I W++KKQ T+S SS+EA YR
Sbjct: 610 LRSDSSLIINA-YCDSDYNACPLTRRSLSAYVVYLGDSPISWKTKKQDTVSYSSAEAEYR 668

Query: 744 ALASATCEL*RLTYLLKDLQIEPVKPSVIYCDNQSALHIAANPVFHERTKHLEIEC 799
           A+A    EL  L  LLKDL +    P  ++CD+++A+HIAANPVFHERTKH+E +C
Sbjct: 669 AMAYTLKELKWLKALLKDLGVHHSSPMKLHCDSEAAIHIAANPVFHERTKHIESDC 724


>UniRef100_O22175 Putative retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 1496

 Score =  392 bits (1008), Expect = e-107
 Identities = 236/596 (39%), Positives = 341/596 (56%), Gaps = 42/596 (7%)

Query: 235  VPASSSNSLDDLSVSIPSADATSVQVPLPSPESPSTTSVSDHTRRPTRPRHQPSHLRNYV 294
            VP+SS +   D S S  SA  T+  +      +PS+  + +   +  R + +   L+++V
Sbjct: 860  VPSSSPSRSIDRSTSDLSASDTTELLSTGESSTPSSPGLPELLGKGCREKKKSVLLKDFV 919

Query: 295  LHTVS---------------------SSCKASQTSSGIKYPISNYMSYSNLSIPHHAYAM 333
             +T S                     +S  A   S    YP+S++++ S  S  H A+  
Sbjct: 920  TNTTSKKKTASHNIHSPSQVLPSGLPTSLSADSVSGKTLYPLSDFLTNSGYSANHIAFMA 979

Query: 334  SLSLDSEPHTYAEASKHKCWVDAMNLEISALEANGTWSLVPLPPNVVPIDNKWVYKIKRR 393
            ++   +EP  + +A   K W +AM+ EI ALEAN TW +  LP     I +KWVYK+K  
Sbjct: 980  AILDSNEPKHFKDAILIKEWCEAMSKEIDALEANHTWDITDLPHGKKAISSKWVYKLKYN 1039

Query: 394  ANGTVERYKARLVA*GYNQIEGIYYFDTFSPTAKLTIVRMVLALASVNNWHLHQLDVNNA 453
            ++GT+ER+KARLV  G +Q EG+ + +TF+P AKLT VR +LA+A+  +W +HQ+DV+NA
Sbjct: 1040 SDGTLERHKARLVVMGNHQKEGVDFKETFAPVAKLTTVRTILAVAAAKDWEVHQMDVHNA 1099

Query: 454  FLLGDLYEDVYMKVPEGVSCVDSGKVCKLHKSSYGLKQASRQWYAKFTSLLVTCGYKQAH 513
            FL GDL E+VYM++P G  C D  KVC+L KS YGLKQA R W++K ++ L   G+ Q++
Sbjct: 1100 FLHGDLEEEVYMRLPPGFKCSDPSKVCRLRKSLYGLKQAPRCWFSKLSTALRNIGFTQSY 1159

Query: 514  SDHSLFSKTQGQSFTILLIYVDDIILAGNFLDEFTRIKAALDNAFKIKDLGVLKYFLGLE 573
             D+SLFS   G +   +L+YVDD+I+AGN LD   R K+ L   F +KDLG LKYFLGLE
Sbjct: 1160 EDYSLFSLKNGDTIIHVLVYVDDLIVAGNNLDAIDRFKSQLHKCFHMKDLGKLKYFLGLE 1219

Query: 574  VSHSAKGISLCQRKYCLDLVHDSGVLGSKPVSTPLDPSSRLSQDGGGA--------TL*G 625
            VS    G  L QRKY LD+V ++G+LG KP + P+  + +L+   G           L G
Sbjct: 1220 VSRGPDGFCLSQRKYALDIVKETGLLGCKPSAVPIALNHKLASITGPVFTNPEQYRRLVG 1279

Query: 626  CFFIQKTHRKTVLSYYNQV*YNLCSSAAKSVPLQSYCDSLCCSS*NSKVSERESKKRVIF 685
              FI  T  +  LSY   +      S     PL ++ ++        K S  +     IF
Sbjct: 1280 -RFIYLTITRPDLSYAVHI-----LSQFMQAPLVAHWEAALRLVRYLKGSPAQG----IF 1329

Query: 686  PKEFCSTIVGV**C--RLGGCVDTRRSVTSYCFFIGNSLICWRSKKQQTISKSSSEAVYR 743
             +   S I+    C      C  TRRS+++Y  ++G+S I W++KKQ T+S SS+EA YR
Sbjct: 1330 LRSDSSLIINA-YCDSDYNACPLTRRSLSAYVVYLGDSPISWKTKKQDTVSYSSAEAEYR 1388

Query: 744  ALASATCEL*RLTYLLKDLQIEPVKPSVIYCDNQSALHIAANPVFHERTKHLEIEC 799
            A+A    EL  L  LLKDL +    P  ++CD+++A+HIAANPVFHERTKH+E +C
Sbjct: 1389 AMAYTLKELKWLKALLKDLGVHHSSPMKLHCDSEAAIHIAANPVFHERTKHIESDC 1444


>UniRef100_Q9SIM3 Putative retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 1461

 Score =  390 bits (1002), Expect = e-107
 Identities = 230/586 (39%), Positives = 344/586 (58%), Gaps = 41/586 (6%)

Query: 229  LLPETSVPASSSNSLDDLSVSIPSADATSVQVPLPSPESPSTTSVSDHTRRPTRPRHQPS 288
            L P  S   S++ + D  +   P +   S+   LPSP+   +T +S   RR T+    P+
Sbjct: 849  LFPLASSQQSATTASDVFTPMDPLSSGNSITSHLPSPQISPSTQISK--RRITK---FPA 903

Query: 289  HLRNYVLHTVSSSCKASQTSSGIKYPISNYMSYSNLSIPHHAYAMSLSLDSEPHTYAEAS 348
            HL++Y  + V+             +PIS+ +SYS +S  H  Y  ++S    P +Y EA 
Sbjct: 904  HLQDYHCYFVNKDDS---------HPISSSLSYSQISPSHMLYINNISKIPIPQSYHEAK 954

Query: 349  KHKCWVDAMNLEISALEANGTWSLVPLPPNVVPIDNKWVYKIKRRANGTVERYKARLVA* 408
              K W  A++ EI A+E   TW +  LPP    +  KWV+ +K  A+G++ER+KAR+VA 
Sbjct: 955  DSKEWCGAIDQEIGAMERTDTWEITSLPPGKKAVGCKWVFTVKFHADGSLERFKARIVAK 1014

Query: 409  GYNQIEGIYYFDTFSPTAKLTIVRMVLALASVNNWHLHQLDVNNAFLLGDLYEDVYMKVP 468
            GY Q EG+ Y +TFSP AK+  V+++L +++   W+L+QLD++NAFL GDL E +YMK+P
Sbjct: 1015 GYTQKEGLDYTETFSPVAKMATVKLLLKVSASKKWYLNQLDISNAFLNGDLEETIYMKLP 1074

Query: 469  EGVSCVDS-----GKVCKLHKSSYGLKQASRQWYAKFTSLLVTCGYKQAHSDHSLFSKTQ 523
            +G + +         VC+L KS YGLKQASRQW+ KF++ L+  G+++ H DH+LF +  
Sbjct: 1075 DGYADIKGTSLPPNVVCRLKKSIYGLKQASRQWFLKFSNSLLALGFEKQHGDHTLFVRCI 1134

Query: 524  GQSFTILLIYVDDIILAGNFLDEFTRIKAALDNAFKIKDLGVLKYFLGLEVSHSAKGISL 583
            G  F +LL+YVDDI++A         +  AL  +FK+++LG LKYFLGLEV+ +++GISL
Sbjct: 1135 GSEFIVLLVYVDDIVIASTTEQAAQSLTEALKASFKLRELGPLKYFLGLEVARTSEGISL 1194

Query: 584  CQRKYCLDLVHDSGVLGSKPVSTPLDPSSRLSQDGG--------GATL*GCFFIQKTHRK 635
             QRKY L+L+  + +L  KP S P+ P+ RLS++ G           L G        R 
Sbjct: 1195 SQRKYALELLTSADMLDCKPSSIPMTPNIRLSKNDGLLLEDKEMYRRLVGKLMYLTITRP 1254

Query: 636  TVLSYYNQV*YNLC--SSAAKSVPLQSYCDSLCCSS*NSKVSERESKKRVIFPKEFCSTI 693
             +    N+    LC  SSA ++  L +    L       +  +    + + +  E   T+
Sbjct: 1255 DITFAVNK----LCQFSSAPRTAHLAAVYKVL-------QYIKGTVGQGLFYSAEDDLTL 1303

Query: 694  VGV**CRLGGCVDTRRSVTSYCFFIGNSLICWRSKKQQTISKSSSEAVYRALASATCEL* 753
             G      G C D+RRS T +  F+G+SLI WRSKKQ T+S+SS+EA YRALA A+CE+ 
Sbjct: 1304 KGYTDADWGTCPDSRRSTTGFTMFVGSSLISWRSKKQPTVSRSSAEAEYRALALASCEMA 1363

Query: 754  RLTYLLKDLQIEPVKPSVIYCDNQSALHIAANPVFHERTKHLEIEC 799
             L+ LL  L++    P ++Y D+ +A++IA NPVFHERTKH+EI+C
Sbjct: 1364 WLSTLLLALRVHSGVP-ILYSDSTAAVYIATNPVFHERTKHIEIDC 1408


>UniRef100_O23588 Retrotransposon like protein [Arabidopsis thaliana]
          Length = 1433

 Score =  387 bits (994), Expect = e-106
 Identities = 212/500 (42%), Positives = 302/500 (60%), Gaps = 26/500 (5%)

Query: 315  ISNYMSYSNLSIPHHAYAMSLSLDSEPHTYAEASKHKCWVDAMNLEISALEANGTWSLVP 374
            + ++  Y+N + P HA+  +++    P  Y+EA   K W DAM  EI A+    TWS+V 
Sbjct: 891  LQDFHCYNNTTEPFHAFINNITNAVIPQRYSEAKDFKAWCDAMKEEIGAMVRTNTWSVVS 950

Query: 375  LPPNVVPIDNKWVYKIKRRANGTVERYKARLVA*GYNQIEGIYYFDTFSPTAKLTIVRMV 434
            LPPN   I  KWV+ IK  A+G++ERYKARLVA GY Q EG+ Y +TFSP AKLT VRM+
Sbjct: 951  LPPNKKAIGCKWVFTIKHNADGSIERYKARLVAKGYTQEEGLDYEETFSPVAKLTSVRMM 1010

Query: 435  LALASVNNWHLHQLDVNNAFLLGDLYEDVYMKVPEGVS-----CVDSGKVCKLHKSSYGL 489
            L LA+   W +HQLD++NAFL GDL E++YMK+P G +      +    +C+LHKS YGL
Sbjct: 1011 LLLAAKMKWSVHQLDISNAFLNGDLDEEIYMKIPPGYADLVGEALPPHAICRLHKSIYGL 1070

Query: 490  KQASRQWYAKFTSLLVTCGYKQAHSDHSLFSKTQGQSFTILLIYVDDIILAGNFLDEFTR 549
            KQASRQWY K ++ L   G++++++DH+LF K        +L+YVDDI++  N  D   +
Sbjct: 1071 KQASRQWYLKLSNTLKGMGFQKSNADHTLFIKYANGVLMGVLVYVDDIMIVSNSDDAVAQ 1130

Query: 550  IKAALDNAFKIKDLGVLKYFLGLEVSHSAKGISLCQRKYCLDLVHDSGVLGSKPVSTPLD 609
              A L + FK++DLG  KYFLG+E++ S KGIS+CQRKY L+L+  +G LGSKP S PLD
Sbjct: 1131 FTAELKSYFKLRDLGAAKYFLGIEIARSEKGISICQRKYILELLSTTGFLGSKPSSIPLD 1190

Query: 610  PSSRLSQDGG--------GATL*GCFFIQKTHRKTVLSYYNQV*YNLC--SSAAKSVPLQ 659
            PS +L+++ G           L G     +  R  +    N     LC  S A  SV L 
Sbjct: 1191 PSVKLNKEDGVPLTDSTSYRKLVGKLMYLQITRPDIAYAVN----TLCQFSHAPTSVHLS 1246

Query: 660  SYCDSLCCSS*NSKVSERESKKRVIFPKEFCSTIVGV**CRLGGCVDTRRSVTSYCFFIG 719
            +    L       +  +    + + +  +    + G      G C D+RR V +YC FIG
Sbjct: 1247 AVHKVL-------RYLKGTVGQGLFYSADDKFDLRGYTDSDFGSCTDSRRCVAAYCMFIG 1299

Query: 720  NSLICWRSKKQQTISKSSSEAVYRALASATCEL*RLTYLLKDLQIEPVKPSVIYCDNQSA 779
            + L+ W+SKKQ T+S S++EA +RA++  T E+  L+ L  D ++  + P+ +YCDN +A
Sbjct: 1300 DYLVSWKSKKQDTVSMSTAEAEFRAMSQGTKEMIWLSRLFDDFKVPFIPPAYLYCDNTAA 1359

Query: 780  LHIAANPVFHERTKHLEIEC 799
            LHI  N VFHERTK +E++C
Sbjct: 1360 LHIVNNSVFHERTKFVELDC 1379


>UniRef100_O04543 F20P5.25 protein [Arabidopsis thaliana]
          Length = 1315

 Score =  385 bits (990), Expect = e-105
 Identities = 232/569 (40%), Positives = 329/569 (57%), Gaps = 47/569 (8%)

Query: 251  PSADATSVQVPLPSPESPSTTSVSDHTRRPTRPRHQPSHLRNYVLHTVSSSCKASQTSSG 310
            PS  ++SV++ LPS  +P+        +   R   +P++L++Y  H+V SS         
Sbjct: 722  PSDSSSSVEI-LPSA-NPTNNVPEPSVQTSHRKAKKPAYLQDYYCHSVVSSTP------- 772

Query: 311  IKYPISNYMSYSNLSIPHHAYAMSLSLDSEPHTYAEASKHKCWVDAMNLEISALEANGTW 370
              + I  ++SY  ++ P+  +   L    EP  Y EA K + W DAM  E   LE   TW
Sbjct: 773  --HEIRKFLSYDRINDPYLTFLACLDKTKEPSNYTEAEKLQVWRDAMGAEFDFLEGTHTW 830

Query: 371  SLVPLPPNVVPIDNKWVYKIKRRANGTVERYKARLVA*GYNQIEGIYYFDTFSPTAKLTI 430
             +  LP +   I  +W++KIK  ++G+VERYKARLVA GY Q EGI Y +TFSP AKL  
Sbjct: 831  EVCSLPADKRCIGCRWIFKIKYNSDGSVERYKARLVAQGYTQKEGIDYNETFSPVAKLNS 890

Query: 431  VRMVLALASVNNWHLHQLDVNNAFLLGDLYEDVYMKVPEGVS-----CVDSGKVCKLHKS 485
            V+++L +A+     L QLD++NAFL GDL E++YM++P+G +      +    VC+L KS
Sbjct: 891  VKLLLGVAARFKLSLTQLDISNAFLNGDLDEEIYMRLPQGYASRQGDSLPPNAVCRLKKS 950

Query: 486  SYGLKQASRQWYAKFTSLLVTCGYKQAHSDHSLFSKTQGQSFTILLIYVDDIILAGNFLD 545
             YGLKQASRQWY KF+S L+  G+ Q++ DH+ F K     F  +L+Y+DDII+A N   
Sbjct: 951  LYGLKQASRQWYLKFSSTLLGLGFIQSYCDHTCFLKISDGIFLCVLVYIDDIIIASNNDA 1010

Query: 546  EFTRIKAALDNAFKIKDLGVLKYFLGLEVSHSAKGISLCQRKYCLDLVHDSGVLGSKPVS 605
                +K+ + + FK++DLG LKYFLGLE+  S KGI + QRKY LDL+ ++G LG KP S
Sbjct: 1011 AVDILKSQMKSFFKLRDLGELKYFLGLEIVRSDKGIHISQRKYALDLLDETGQLGCKPSS 1070

Query: 606  TPLDPSSRLSQDGGG--------ATL*GCFFIQKTHRKTVLSYYNQV*YNLCSSAAKSVP 657
             P+DPS   + D GG          L G        R  +    N++     S A +   
Sbjct: 1071 IPMDPSMVFAHDSGGDFVEVGPYRRLIGRLMYLNITRPDITFAVNKL--AQFSMAPRKAH 1128

Query: 658  LQSYCDSLCCSS*N-------SKVSERESKKRVIFPKEFCSTIVGV**CRLGGCVDTRRS 710
            LQ+    L             S  SE + K  V    ++ S       CR     D+RRS
Sbjct: 1129 LQAVYKILQYIKGTIGQGLFYSATSELQLK--VYANADYNS-------CR-----DSRRS 1174

Query: 711  VTSYCFFIGNSLICWRSKKQQTISKSSSEAVYRALASATCEL*RLTYLLKDLQIEPVKPS 770
             + YC F+G+SLICW+S+KQ  +SKSS+EA YR+L+ AT EL  LT  LK+LQ+   KP+
Sbjct: 1175 TSGYCMFLGDSLICWKSRKQDVVSKSSAEAEYRSLSVATDELVWLTNFLKELQVPLSKPT 1234

Query: 771  VIYCDNQSALHIAANPVFHERTKHLEIEC 799
            +++CDN++A+HIA N VFHERTKH+E +C
Sbjct: 1235 LLFCDNEAAIHIANNHVFHERTKHIESDC 1263


>UniRef100_Q5XWR5 Putative retroelement pol polyprotein-like [Solanum tuberosum]
          Length = 1476

 Score =  379 bits (972), Expect = e-103
 Identities = 228/577 (39%), Positives = 322/577 (55%), Gaps = 27/577 (4%)

Query: 240  SNSLDDLSVSIPSADATSVQVPLPSPESPSTTSVSDHTRRPTRPRHQPSHLRNYVLHTVS 299
            S+  +D     P+   +   +P+ SP S    +VSD    P   R +        +    
Sbjct: 840  SSHTEDADAVQPAIITSEEIIPVASPPS----AVSDDHLHPPPERRRSYRTGKPPIWQKD 895

Query: 300  SSCKASQTSSGIKYPISNYMSYSNLSIPHHAYAMSLSLDSEPHTYAEASKHKCWVDAMNL 359
                ++  S+   YPIS+ + YS LS  +  Y  S S+++EP  Y +A+    WV AM  
Sbjct: 896  FITTSTSRSNHCLYPISDNIDYSCLSSTYQCYIASSSVETEPQFYYQAANDCRWVHAMKE 955

Query: 360  EISALEANGTWSLVPLPPNVVPIDNKWVYKIKRRANGTVERYKARLVA*GYNQIEGIYYF 419
            EI ALE N TW +V LP     I  KWVYKIK +A+G +ER+KARLVA GYNQ EG+ Y 
Sbjct: 956  EIQALEDNKTWEVVSLPKGKKAIGCKWVYKIKYKASGEIERFKARLVAKGYNQKEGLDYQ 1015

Query: 420  DTFSPTAKLTIVRMVLALASVNNWHLHQLDVNNAFLLGDLYEDVYMKVPEGVSCVDSG-- 477
            +TFSP  K+  +R VL LA    W + Q+DV NAFL GDL E+VYM++P+G     +G  
Sbjct: 1016 ETFSPVVKMVTLRTVLTLAVSKGWDIQQMDVYNAFLQGDLIEEVYMQLPQGFQYDKTGDP 1075

Query: 478  KVCKLHKSSYGLKQASRQWYAKFTSLLVTCGYKQAHSDHSLFSKTQGQSFTILLIYVDDI 537
            KVC+L KS YGLKQASRQW  K T+ L+  G++Q+H D+SL  K       I+LIYVDD+
Sbjct: 1076 KVCRLLKSLYGLKQASRQWNVKLTTALLAAGFQQSHLDYSLMLKRTADGIVIVLIYVDDL 1135

Query: 538  ILAGNFLDEFTRIKAALDNAFKIKDLGVLKYFLGLEVSHSAKGISLCQRKYCLDLVHDSG 597
            ++ G+ L      K  L   FKIKDLG L+YFLG+E + +A G+ + QRKY L+L+ D G
Sbjct: 1136 LITGSSLQLIDDAKQVLKANFKIKDLGTLRYFLGMEFARNASGMLMHQRKYALELISDLG 1195

Query: 598  VLGSKPVSTPLDPSSRLSQDGGGATL*GCFFIQKTHRKTVL---SYYNQV*YNLCSSAAK 654
            + GSKP  TP++   +L+      T      +  +   ++L   + Y ++   L      
Sbjct: 1196 LGGSKPSVTPVELHLKLT------TREFDLHVGSSGADSLLADPTEYQRLVGRLLYLTIT 1249

Query: 655  SVPLQSYCDSLCCSS*NSKVSERESKKRVI------------FPKEFCSTIVGV**CRLG 702
               +      L       KVS  E+  RV+               +   T+        G
Sbjct: 1250 RPDISFAVQHLSQFMHAPKVSHMEAAIRVVKYVKQAPGLGLYMAVQTADTLQAYCDADWG 1309

Query: 703  GCVDTRRSVTSYCFFIGNSLICWRSKKQQTISKSSSEAVYRALASATCEL*RLTYLLKDL 762
             C++TR+S+T Y    G++L+ W+SKKQ TIS+SS+EA YR+LAS   EL  LT L K+L
Sbjct: 1310 SCINTRKSITGYMIQFGSALLSWKSKKQPTISRSSAEAEYRSLASTVAELVWLTGLFKEL 1369

Query: 763  QIEPVKPSVIYCDNQSALHIAANPVFHERTKHLEIEC 799
             +    P  +YCD+++A+ IAANPVFHERTKH++I+C
Sbjct: 1370 DMPLSLPVSLYCDSKAAIQIAANPVFHERTKHIDIDC 1406


>UniRef100_Q9LVQ2 Retroelement pol polyprotein-like [Arabidopsis thaliana]
          Length = 1491

 Score =  377 bits (969), Expect = e-103
 Identities = 232/589 (39%), Positives = 328/589 (55%), Gaps = 38/589 (6%)

Query: 236  PASSSNSLDDLSVSIPSADATSVQVPLPSPESPSTTSVSDHT--RRPTRPRHQPSHLRNY 293
            P S S S+        S+ +TS    + SP    TT + ++T  R+  R   Q + L++Y
Sbjct: 864  PLSPSTSVTPTQTPTNSSSSTSPSTNV-SPPQQDTTPIIENTPPRQGKRQVQQLARLKDY 922

Query: 294  VLHTVSS--------SCKASQTSSGIK----YPISNYMSYSNLSIPHHAYAMSLSLDSEP 341
            +L+  S         S   SQ+SS I+    YP+++Y+     S  H  +  +++ + EP
Sbjct: 923  ILYNASCTPNTPHVLSPSTSQSSSSIQGNSQYPLTDYIFDECFSAGHKVFLAAITANDEP 982

Query: 342  HTYAEASKHKCWVDAMNLEISALEANGTWSLVPLPPNVVPIDNKWVYKIKRRANGTVERY 401
              + EA K K W DAM  E+ ALE N TW +V LP   V I ++WVYK K  A+GTVERY
Sbjct: 983  KHFKEAVKVKVWNDAMYKEVDALEVNKTWDIVDLPTGKVAIGSQWVYKTKFNADGTVERY 1042

Query: 402  KARLVA*GYNQIEGIYYFDTFSPTAKLTIVRMVLALASVNNWHLHQLDVNNAFLLGDLYE 461
            KARLV  G NQIEG  Y +TF+P  K+T VR +L L + N W ++Q+DV+NAFL GDL E
Sbjct: 1043 KARLVVQGNNQIEGEDYTETFAPVVKMTTVRTLLRLVAANQWEVYQMDVHNAFLHGDLEE 1102

Query: 462  DVYMKVPEGVSCVDSGKVCKLHKSSYGLKQASRQWYAKFTSLLVTCGYKQAHSDHSLFSK 521
            +VYMK+P G       KVC+L KS YGLKQA R W+ K +  L   G+ Q + D+S FS 
Sbjct: 1103 EVYMKLPPGFRHSHPDKVCRLRKSLYGLKQAPRCWFKKLSDALKRFGFIQGYEDYSFFSY 1162

Query: 522  TQGQSFTILLIYVDDIILAGNFLDEFTRIKAALDNAFKIKDLGVLKYFLGLEVSHSAKGI 581
            +       +L+YVDD+I+ GN      + K  L   F +KDLG LKYFLG+EVS    GI
Sbjct: 1163 SCKGIELRVLVYVDDLIICGNDEYMVQKFKEYLGRCFSMKDLGKLKYFLGIEVSRGPDGI 1222

Query: 582  SLCQRKYCLDLVHDSGVLGSKPVSTPLDPSSRLSQDGGGATL*GCFF-------IQKTHR 634
             L QRKY LD++ DSG LG++P  TPL+ +  L+ D G        F       +   H 
Sbjct: 1223 FLSQRKYALDIISDSGTLGARPAYTPLEQNHHLASDDGPLLQDPKPFRRLVGRLLYLLHT 1282

Query: 635  KTVLSYYNQV*YNLCSSAAKSVPLQSYCDSLCCSS*NSKVSERE----SKKRVIFPKEFC 690
            +  LSY   V      S     P +++ ++        K S  +    S  + +  + +C
Sbjct: 1283 RPELSYSVHV-----LSQFMQAPREAHLEAAMRIVRYLKGSPGQGILLSSNKDLTLEVYC 1337

Query: 691  STIVGV**CRLGGCVDTRRSVTSYCFFIGNSLICWRSKKQQTISKSSSEAVYRALASATC 750
             +           C  TRRS+++Y   +G S I W++KKQ T+S SS+EA YRA++ A  
Sbjct: 1338 DS-------DFQSCPLTRRSLSAYVVLLGGSPISWKTKKQDTVSHSSAEAEYRAMSVALK 1390

Query: 751  EL*RLTYLLKDLQIEPVKPSVIYCDNQSALHIAANPVFHERTKHLEIEC 799
            E+  L  LLK+L I    P+ ++CD+++A+ IAANPVFHERTKH+E +C
Sbjct: 1391 EIKWLNKLLKELGITLAAPTRLFCDSKAAISIAANPVFHERTKHIERDC 1439


>UniRef100_Q9ZPU4 Putative retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 1501

 Score =  375 bits (963), Expect = e-102
 Identities = 235/592 (39%), Positives = 330/592 (55%), Gaps = 41/592 (6%)

Query: 230  LPETSVPASSSNSLDDLSVS---IPSADATSVQVPLPSPESPSTTSVSDHTRRPTRPRHQ 286
            +P+ + P+S       LSVS    P+   T + VP+ SP   S        R+  R  H 
Sbjct: 877  VPDDTPPSSP------LSVSPSGSPNTPTTPIVVPVASPIPVSPPK----QRKSKRATHP 926

Query: 287  PSHLRNYVL----------HTVSSSCKASQTSSGIK-YPISNYMSYSNLSIPHHAYAMSL 335
            P  L +YVL          H + +    S T  G   +P+++Y+S +  S  H AY  ++
Sbjct: 927  PPKLNDYVLYNAMYTPSSIHALPADPSQSSTVPGKSLFPLTDYVSDAAFSSSHRAYLAAI 986

Query: 336  SLDSEPHTYAEASKHKCWVDAMNLEISALEANGTWSLVPLPPNVVPIDNKWVYKIKRRAN 395
            + + EP  + EA + K W DAM  E+ ALE N TW +V LPP  V I ++WV+K K  ++
Sbjct: 987  TDNVEPKHFKEAVQIKVWNDAMFTEVDALEINKTWDIVDLPPGKVAIGSQWVFKTKYNSD 1046

Query: 396  GTVERYKARLVA*GYNQIEGIYYFDTFSPTAKLTIVRMVLALASVNNWHLHQLDVNNAFL 455
            GTVERYKARLV  G  Q+EG  Y +TF+P  ++T VR +L   + N W ++Q+DV+NAFL
Sbjct: 1047 GTVERYKARLVVQGNKQVEGEDYKETFAPVVRMTTVRTLLRNVAANQWEVYQMDVHNAFL 1106

Query: 456  LGDLYEDVYMKVPEGVSCVDSGKVCKLHKSSYGLKQASRQWYAKFTSLLVTCGYKQAHSD 515
             GDL E+VYMK+P G       KVC+L KS YGLKQA R W+ K +  L+  G+ Q++ D
Sbjct: 1107 HGDLEEEVYMKLPPGFRHSHPDKVCRLRKSLYGLKQAPRCWFKKLSDSLLRFGFVQSYED 1166

Query: 516  HSLFSKTQGQSFTILLIYVDDIILAGNFLDEFTRIKAALDNAFKIKDLGVLKYFLGLEVS 575
            +SLFS T+      +LIYVDD+++ GN      + K  L   F +KDLG LKYFLG+EVS
Sbjct: 1167 YSLFSYTRNNIELRVLIYVDDLLICGNDGYMLQKFKDYLSRCFSMKDLGKLKYFLGIEVS 1226

Query: 576  HSAKGISLCQRKYCLDLVHDSGVLGSKPVSTPL--------DPSSRLSQDGGGATL*GCF 627
               +GI L QRKY LD++ DSG LGS+P  TPL        D    LS       L G  
Sbjct: 1227 RGPEGIFLSQRKYALDVIADSGNLGSRPAHTPLEQNHHLASDDGPLLSDPKPYRRLVGRL 1286

Query: 628  FIQKTHRKTVLSYYNQV*YNLCSSAAKSVPLQSYCDSLCCSS*NSKVSERESKKRVIFPK 687
             +   H +  LSY   V      +     P +++ D+        K S  +    ++   
Sbjct: 1287 -LYLLHTRPELSYSVHVLAQFMQN-----PREAHFDAALRVVRYLKGSPGQG---ILLNA 1337

Query: 688  EFCSTIVGV**CRLGGCVDTRRSVTSYCFFIGNSLICWRSKKQQTISKSSSEAVYRALAS 747
            +   T+          C  TRRS+++Y   +G S I W++KKQ T+S SS+EA YRA++ 
Sbjct: 1338 DPDLTLEVYCDSDWQSCPLTRRSISAYVVLLGGSPISWKTKKQDTVSHSSAEAEYRAMSY 1397

Query: 748  ATCEL*RLTYLLKDLQIEPVKPSVIYCDNQSALHIAANPVFHERTKHLEIEC 799
            A  E+  L  LLK+L IE   P+ +YCD+++A+HIAANPVFHERTKH+E +C
Sbjct: 1398 ALKEIKWLRKLLKELGIEQSTPARLYCDSKAAIHIAANPVFHERTKHIESDC 1449


>UniRef100_Q8W153 Polyprotein [Oryza sativa]
          Length = 1472

 Score =  370 bits (949), Expect = e-100
 Identities = 205/497 (41%), Positives = 295/497 (59%), Gaps = 15/497 (3%)

Query: 309  SGIKYPISNYMSYSNLSIPHHAYAMSLSLDSEPHTYAEASKHKCWVDAMNLEISALEANG 368
            SG +  I+NY+SY++LS  + A+  SL+    P  + EA +   W  AM  E+ ALE N 
Sbjct: 931  SGDENDIANYVSYTSLSSTYRAFVASLNSAIIPKDWKEAKQDPRWHQAMLDELEALEKNK 990

Query: 369  TWSLVPLPPNVVPIDNKWVYKIKRRANGTVERYKARLVA*GYNQIEGIYYFDTFSPTAKL 428
            TW LV  P     ++ KWVY +K+  +G VERYKARLVA GY+Q  GI Y +TF+P AK+
Sbjct: 991  TWDLVSYPNGKKVVNCKWVYAVKQNPDGKVERYKARLVAKGYSQTYGIDYDETFAPVAKM 1050

Query: 429  TIVRMVLALASVNNWHLHQLDVNNAFLLGDLYEDVYMKVPEGVSCVDS-GKVCKLHKSSY 487
            + VR +++ A   +W LHQLDV NAFL GDL E+VYM++P G + + + GKV +L KS Y
Sbjct: 1051 STVRTIISCAVNFDWPLHQLDVKNAFLHGDLQEEVYMEIPPGFATLQTKGKVLRLKKSLY 1110

Query: 488  GLKQASRQWYAKFTSLLVTCGYKQAHSDHSLFSKTQGQSFTILLIYVDDIILAGNFLDEF 547
            GLKQ+ R W+ +F   +   GYKQ + DH++F    G   TIL +YVDD+I+ GN   E 
Sbjct: 1111 GLKQSPRAWFDRFRRAMCAMGYKQCNGDHTVFYHHSGDHITILAVYVDDMIITGNDCSEI 1170

Query: 548  TRIKAALDNAFKIKDLGVLKYFLGLEVSHSAKGISLCQRKYCLDLVHDSGVLGSKPVSTP 607
            TR+K  L   F++KDLG LKYFLG+E++ S +GI L QRKY LDL+ D+G+LG +P STP
Sbjct: 1171 TRLKQNLSKEFEVKDLGQLKYFLGIEIARSPRGIVLSQRKYALDLLSDTGMLGCRPASTP 1230

Query: 608  LDPSSRLSQDGGGATL*GCF------FIQKTHRKTVLSYYNQV*YNLCSSAAKSVPLQSY 661
            +D + +L  + G       +       I   H +  ++Y   +      S     P   +
Sbjct: 1231 VDQNHKLCAESGNPVNKERYQRLVGRLIYLCHTRPDITYAVSM-----VSRYMHDPRSGH 1285

Query: 662  CDSLCCSS*NSKVSERESKKRVIFPKEFCSTIVGV**CRLGGCVDTRRSVTSYCFFIGNS 721
             D++       +  +    K + F K     + G        C D RRS + YC F+G +
Sbjct: 1286 MDAVYRI---LRYLKGSPGKGLWFKKNGHLEVEGYCDAHWASCPDDRRSTSGYCVFVGGN 1342

Query: 722  LICWRSKKQQTISKSSSEAVYRALASATCEL*RLTYLLKDLQIEPVKPSVIYCDNQSALH 781
            L+ WRSKKQ  +S+S++EA YRA++ +  EL  L  LL +L +    P  ++CDN+SA+ 
Sbjct: 1343 LVSWRSKKQPVVSRSTAEAEYRAMSVSLSELLWLRNLLSELMLPVDTPMKLWCDNKSAIS 1402

Query: 782  IAANPVFHERTKHLEIE 798
            IA NPV H+RTKH+E++
Sbjct: 1403 IANNPVQHDRTKHVELD 1419


>UniRef100_Q7X6S0 OSJNBb0011N17.2 protein [Oryza sativa]
          Length = 1262

 Score =  369 bits (946), Expect = e-100
 Identities = 205/497 (41%), Positives = 295/497 (59%), Gaps = 15/497 (3%)

Query: 309  SGIKYPISNYMSYSNLSIPHHAYAMSLSLDSEPHTYAEASKHKCWVDAMNLEISALEANG 368
            SG +  I+NY+SY++LS  + A+  SL+    P  + EA +   W  AM  E+ ALE N 
Sbjct: 721  SGDENDIANYVSYTSLSSTYKAFVASLNSAIIPKDWKEAKQDPRWHQAMLDELEALEKNK 780

Query: 369  TWSLVPLPPNVVPIDNKWVYKIKRRANGTVERYKARLVA*GYNQIEGIYYFDTFSPTAKL 428
            TW LV  P     ++ KWVY +K+  +G VERYKARLVA GY+Q  GI Y +TF+P AK+
Sbjct: 781  TWDLVSYPNGKKVVNCKWVYAVKQNPDGKVERYKARLVAKGYSQTYGIDYDETFAPVAKM 840

Query: 429  TIVRMVLALASVNNWHLHQLDVNNAFLLGDLYEDVYMKVPEGVSCVDS-GKVCKLHKSSY 487
            + VR +++ A   +W LHQLDV NAFL GDL E+VYM++P G + + + GKV +L KS Y
Sbjct: 841  STVRTIISCAVNFDWPLHQLDVKNAFLHGDLQEEVYMEIPPGFATLQTKGKVLRLKKSLY 900

Query: 488  GLKQASRQWYAKFTSLLVTCGYKQAHSDHSLFSKTQGQSFTILLIYVDDIILAGNFLDEF 547
            GLKQ+ R W+ +F   +   GYKQ + DH++F    G   TIL +YVDD+I+ GN   E 
Sbjct: 901  GLKQSPRAWFDRFRRAMCAMGYKQCNGDHTVFYHHSGDHITILAVYVDDMIITGNDCSEI 960

Query: 548  TRIKAALDNAFKIKDLGVLKYFLGLEVSHSAKGISLCQRKYCLDLVHDSGVLGSKPVSTP 607
            TR+K  L   F++KDLG LKYFLG+E++ S +GI L QRKY LDL+ D+G+LG +P STP
Sbjct: 961  TRLKQNLSKEFEVKDLGQLKYFLGIEIARSPRGIVLSQRKYALDLLSDTGMLGCRPASTP 1020

Query: 608  LDPSSRLSQDGGGATL*GCF------FIQKTHRKTVLSYYNQV*YNLCSSAAKSVPLQSY 661
            +D + +L  + G       +       I   H +  ++Y   +      S     P   +
Sbjct: 1021 VDQNHKLCAESGNPVNKERYQRLVGRLIYLCHTRPDITYAVSM-----VSRYMHDPRSGH 1075

Query: 662  CDSLCCSS*NSKVSERESKKRVIFPKEFCSTIVGV**CRLGGCVDTRRSVTSYCFFIGNS 721
             D++       +  +    K + F K     + G        C D RRS + YC F+G +
Sbjct: 1076 MDAVYRI---LRYLKGSPGKGLWFKKNGHLEVEGYCDADWASCPDDRRSTSGYCVFVGGN 1132

Query: 722  LICWRSKKQQTISKSSSEAVYRALASATCEL*RLTYLLKDLQIEPVKPSVIYCDNQSALH 781
            L+ WRSKKQ  +S+S++EA YRA++ +  EL  L  LL +L +    P  ++CDN+SA+ 
Sbjct: 1133 LVSWRSKKQPVVSRSTAEAEYRAMSVSLSELLWLRNLLSELMLPVDTPMKLWCDNKSAIS 1192

Query: 782  IAANPVFHERTKHLEIE 798
            IA NPV H+RTKH+E++
Sbjct: 1193 IANNPVQHDRTKHVELD 1209


>UniRef100_Q9C692 Polyprotein, putative [Arabidopsis thaliana]
          Length = 1468

 Score =  335 bits (859), Expect = 3e-90
 Identities = 173/390 (44%), Positives = 251/390 (64%), Gaps = 8/390 (2%)

Query: 229  LLPETSVPASSSN---SLDDLSVSIPSADATSVQVPLPSPESPSTTSVSDHTRRPTRPRH 285
            ++PE +  +SS +   SL  L   + S+   +  +PL S     TT      RR +R   
Sbjct: 849  IIPEINQESSSPSEFVSLSSLDPFLASSTVQTADLPLSS-----TTPAPIQLRRSSRQTQ 903

Query: 286  QPSHLRNYVLHTVSSSCKASQTSSGIKYPISNYMSYSNLSIPHHAYAMSLSLDSEPHTYA 345
            +P  L+N+V +TVS    + + SS   YPI  Y+     +  H A+  +++   EP TY 
Sbjct: 904  KPMKLKNFVTNTVSVESISPEASSSSLYPIEKYVDCHRFTSSHKAFLAAVTAGMEPTTYN 963

Query: 346  EASKHKCWVDAMNLEISALEANGTWSLVPLPPNVVPIDNKWVYKIKRRANGTVERYKARL 405
            EA   K W +AM+ EI +L  N T+S+V LPP    + NKWVYKIK R++G +ERYKARL
Sbjct: 964  EAMVDKAWREAMSAEIESLRVNQTFSIVNLPPGKRALGNKWVYKIKYRSDGAIERYKARL 1023

Query: 406  VA*GYNQIEGIYYFDTFSPTAKLTIVRMVLALASVNNWHLHQLDVNNAFLLGDLYEDVYM 465
            V  G  Q EG+ Y +TF+P AK++ VR+ L +A+  +WH+HQ+DV+NAFL GDL E+VYM
Sbjct: 1024 VVLGNCQKEGVDYDETFAPVAKMSTVRLFLGVAAARDWHVHQMDVHNAFLHGDLKEEVYM 1083

Query: 466  KVPEGVSCVDSGKVCKLHKSSYGLKQASRQWYAKFTSLLVTCGYKQAHSDHSLFSKTQGQ 525
            K+P+G  C D  KVC+LHKS YGLKQA R W++K +S L   G+ Q+ SD+SLFS     
Sbjct: 1084 KLPQGFQCDDPSKVCRLHKSLYGLKQAPRCWFSKLSSALKQYGFTQSLSDYSLFSYNNDG 1143

Query: 526  SFTILLIYVDDIILAGNFLDEFTRIKAALDNAFKIKDLGVLKYFLGLEVSHSAKGISLCQ 585
             F  +L+YVDD+I++G+  D   + K+ L++ F +KDLG+LKYFLG+EVS +A+G  L Q
Sbjct: 1144 IFVHVLVYVDDLIISGSCPDAVAQFKSYLESCFHMKDLGLLKYFLGIEVSRNAQGFYLSQ 1203

Query: 586  RKYCLDLVHDSGVLGSKPVSTPLDPSSRLS 615
            RKY LD++ + G+LG++P + PL+ + +LS
Sbjct: 1204 RKYVLDIISEMGLLGARPSAFPLEQNHKLS 1233



 Score = 92.4 bits (228), Expect = 5e-17
 Identities = 46/96 (47%), Positives = 68/96 (69%)

Query: 704  CVDTRRSVTSYCFFIGNSLICWRSKKQQTISKSSSEAVYRALASATCEL*RLTYLLKDLQ 763
            C  TRRS+T Y   +G++ I W++KKQ T+S+SS+EA YRA+A  T EL  L  +L DL 
Sbjct: 1321 CPLTRRSLTGYFVQLGDTPISWKTKKQPTVSRSSAEAEYRAMAFLTQELMWLKRVLYDLG 1380

Query: 764  IEPVKPSVIYCDNQSALHIAANPVFHERTKHLEIEC 799
            +  V+   I+ D++SA+ ++ NPV HERTKH+E++C
Sbjct: 1381 VSHVQAMRIFSDSKSAIALSVNPVQHERTKHVEVDC 1416


>UniRef100_Q9MAJ8 F27F5.19 [Arabidopsis thaliana]
          Length = 1309

 Score =  305 bits (782), Expect = 3e-81
 Identities = 168/396 (42%), Positives = 239/396 (59%), Gaps = 23/396 (5%)

Query: 229  LLPETSVPASSSNSLDDLSVSIPSADATSVQVPLPSPESPSTTSVSDHTRRPTRPRHQPS 288
            L P    PA      DDL +     + TS+    P  +  S+ ++     +  R +  P 
Sbjct: 776  LFPLLQFPAKP----DDLPL-----EQTSLSDAHPHQDVSSSKALVPFDPQSKRQKKPPK 826

Query: 289  HLRNYVLHTVSSSCKASQTSSGIKYPISNYMSYSNLSIPHHAYAMSLSLDSEPHTYAEAS 348
            H +++  +  +S+         I YPI +Y+SYS +  P HA+  +++    P  Y+EA 
Sbjct: 827  HFQDFHCYNNTST---------ILYPIKDYISYSYIVEPFHAFINNITNAVVPQRYSEAK 877

Query: 349  KHKCWVDAMNLEISALEANGTWSLVPLPPNVVPIDNKWVYKIKRRANGTVERYKARLVA* 408
              K W DAM  EI A+    TWS+V LPPN   I  KWV+ IK  A+G++ERYKARLVA 
Sbjct: 878  DFKAWCDAMKEEIGAMIQTNTWSVVSLPPNKKAIGCKWVFTIKHNADGSIERYKARLVAK 937

Query: 409  GYNQIEGIYYFDTFSPTAKLTIVRMVLALASVNNWHLHQLDVNNAFLLGDLYEDVYMKVP 468
            GY Q E + Y +TFSP AKLT VRM+L LA+   W + QLD++NAFL GDL E++YMK+P
Sbjct: 938  GYTQEESLDYEETFSPVAKLTSVRMMLLLAAKMKWSVLQLDISNAFLNGDLDEEIYMKIP 997

Query: 469  EGVS-----CVDSGKVCKLHKSSYGLKQASRQWYAKFTSLLVTCGYKQAHSDHSLFSKTQ 523
             G +      +    VC+LHKS YGLKQASRQWY K ++ L   G++++++DH+LF K  
Sbjct: 998  PGYADLIGESLPPHAVCRLHKSIYGLKQASRQWYLKLSNTLKGMGFQKSNADHTLFIKFA 1057

Query: 524  GQSFTILLIYVDDIILAGNFLDEFTRIKAALDNAFKIKDLGVLKYFLGLEVSHSAKGISL 583
                  +L+YVDDI++  N  +  T+    L + FK++DL   KYF G+E++ SAKGIS+
Sbjct: 1058 SGVLMGVLVYVDDIMIVSNSDNAVTQFTTELKSYFKLRDLSAAKYFFGIEIARSAKGISI 1117

Query: 584  CQRKYCLDLVHDSGVLGSKPVSTPLDPSSRLSQDGG 619
            CQRKY L+L+  +G LGSKP S PLD S +L+++ G
Sbjct: 1118 CQRKYILELLSTTGFLGSKPSSIPLDTSVKLNKEDG 1153


>UniRef100_Q9ZPG3 F5K24.2 protein [Arabidopsis thaliana]
          Length = 1366

 Score =  304 bits (779), Expect = 6e-81
 Identities = 208/606 (34%), Positives = 317/606 (51%), Gaps = 76/606 (12%)

Query: 203  LIISQKQCHGLFS-HLRMRILRALVLILLPETSVPASS-SNSLDDLSVSIPSADATSVQV 260
            L IS+KQ    F  +    +L   V  ++  T+VPA + + SL  LS ++ +    +   
Sbjct: 766  LPISEKQKENRFQIYDYFNVLNLEVCPVIEPTTVPAHTHTRSLAPLSTTVTNDQFGNDM- 824

Query: 261  PLPSPESPSTTSVSDHTRRPTRPRHQPSHLRNYVLHTVSSSCKASQTSSGIKYPISNYMS 320
                          D+T  P +    PS+L  Y  H  +   + S +  G  + +S+++S
Sbjct: 825  --------------DNTLMPRKETRAPSYLSQY--HCSNVLKEPSSSLHGTAHSLSSHLS 868

Query: 321  YSNLSIPHHAYAMSLSLDSEPHTYAEASKHKCWVDAMNLEISALEANGTWSLVPLPPNVV 380
            Y  LS  +  +  ++  + EP T+ EA+  + W+DAMN+E+ AL +  T  +  L     
Sbjct: 869  YDKLSNEYRLFCFAIIAEKEPTTFKEAALLQKWLDAMNVELDALVSTSTREICSLHDGKR 928

Query: 381  PIDNKWVYKIKRRANGTVERYKARLVA*GYNQIEGIYYFDTFSPTAKLTIVRMVLALASV 440
             I  KWV+KIK +++GT+ERYKARLVA GY Q EG+ Y DTFSP AKLT VR++LALA++
Sbjct: 929  AIGCKWVFKIKYKSDGTIERYKARLVANGYTQQEGVDYIDTFSPIAKLTSVRLILALAAI 988

Query: 441  NNWHLHQLDVNNAFLLGDLYEDVYMKVPEGVS-----CVDSGKVCKLHKSSYGLKQASRQ 495
            +NW + Q+DV NAFL GD  E++YM++P+G +      +    VC+L KS YGLKQASRQ
Sbjct: 989  HNWSISQMDVTNAFLHGDFEEEIYMQLPQGYTPRKGELLPKRPVCRLVKSLYGLKQASRQ 1048

Query: 496  WYAKFTSLLVTCGYKQAHSDHSLFSKTQGQSFTILLIYVDDIILAGNFLDEFTRIKAALD 555
            W+ KF+ +L+  G+ Q+  D +LF + +  +F  LL+YVDDI+L  N       +K  L 
Sbjct: 1049 WFHKFSGVLIQNGFMQSLFDPTLFVRVREDTFLALLVYVDDIMLVSNKDSAVIEVKQILA 1108

Query: 556  NAFKIKDLGVLKYFLGLEVSHSAKGISLCQRKYCLDLVHDSGVLGSKPVSTPLDPSSRLS 615
              FK+KDLG  +YFLGLE++ S +GIS+ QRKY L+L+ + G LG KPV TP++ + +LS
Sbjct: 1109 KEFKLKDLGQKRYFLGLEIARSKEGISISQRKYALELLEEFGFLGCKPVPTPMELNLKLS 1168

Query: 616  QDGGGATL*GCFFIQKTHRKTVLSYYNQV*YNLCSSAAK-----SVPLQSYCDSLCCSS* 670
            Q+ G   L    + +   R   L Y      ++C +  K     S P + +   L  +  
Sbjct: 1169 QEDGALLLDASHYRKLIGR---LVYLTVTRPDICFAVNKLNQYMSAPREPH---LMAARR 1222

Query: 671  NSKVSERESKKRVIFPKEFCSTIVGV**CRLGGCVDTRRSVTSYCFFIGNSLICWRSKKQ 730
              +  + +  + V +P     T           C ++  S+         S++ W     
Sbjct: 1223 ILRYLKNDPGQGVFYPASSTLTFRAFADADWSNCPESSISI---------SIVFW----- 1268

Query: 731  QTISKSSSEAVYRALASATCEL*RLTYLLKDLQIEPVKPSVIYC--DNQSALHIAANPVF 788
                K S+EA                +L+  L      P  I+   D++SALHIA N VF
Sbjct: 1269 ---LKLSTEA----------------WLVLSL------PDTIFVYYDDESALHIAKNSVF 1303

Query: 789  HERTKH 794
            HE TK+
Sbjct: 1304 HESTKN 1309


>UniRef100_Q9SJ99 Putative retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 1156

 Score =  303 bits (775), Expect = 2e-80
 Identities = 172/405 (42%), Positives = 238/405 (58%), Gaps = 19/405 (4%)

Query: 232 ETSVPASSSNSLDDLSVSIPSADATSV----QVPLPSPESPSTTSVSDHTRRPTRPRHQP 287
           ++S P  +  S D L+    S D  S       PL   +SP   + S   R+  R   Q 
Sbjct: 515 DSSTPDKNLASGDTLAQIDDSPDIVSTPNRNNQPLFVVDSPFVEATSPRQRK--RQIRQS 572

Query: 288 SHLRNYVLHTVSSSC--------KASQTSSGIK-----YPISNYMSYSNLSIPHHAYAMS 334
             L++YVL+  + S          +SQ+SS ++     YP+S+Y+S    S  H A+  +
Sbjct: 573 VRLQDYVLYNATVSPINPHALPDSSSQSSSMVQGTSSLYPLSDYVSDDCFSAGHKAFLAA 632

Query: 335 LSLDSEPHTYAEASKHKCWVDAMNLEISALEANGTWSLVPLPPNVVPIDNKWVYKIKRRA 394
           ++ + EP  + EA + K W DAM  E+ ALE N TW +V LPP  V I ++WVYK K  A
Sbjct: 633 ITANDEPKHFKEAVRIKVWNDAMFKEVDALEINKTWDIVDLPPGKVAIGSQWVYKTKYNA 692

Query: 395 NGTVERYKARLVA*GYNQIEGIYYFDTFSPTAKLTIVRMVLALASVNNWHLHQLDVNNAF 454
           +G++ERYKARLV  G  Q+EG  Y +TF+P  K+T VR +L L + N W ++Q+DVNNAF
Sbjct: 693 DGSIERYKARLVVQGNKQVEGEDYNETFAPVVKMTTVRTLLRLVAANQWEVYQMDVNNAF 752

Query: 455 LLGDLYEDVYMKVPEGVSCVDSGKVCKLHKSSYGLKQASRQWYAKFTSLLVTCGYKQAHS 514
           L GDL E+VYMK+P G       KVC+L KS YGLKQA R W+ K +  L+  G+ Q H 
Sbjct: 753 LHGDLDEEVYMKLPPGFRHSHPDKVCRLRKSLYGLKQAPRCWFKKLSDALLRFGFVQGHE 812

Query: 515 DHSLFSKTQGQSFTILLIYVDDIILAGNFLDEFTRIKAALDNAFKIKDLGVLKYFLGLEV 574
           D+S FS T+      +L+YVDD+++ GN      + K  L   F +KDLG LKYFLG+EV
Sbjct: 813 DYSFFSYTRNGIELRVLVYVDDLLICGNDGYMLQKFKEYLGRCFSMKDLGKLKYFLGIEV 872

Query: 575 SHSAKGISLCQRKYCLDLVHDSGVLGSKPVSTPLDPSSRLSQDGG 619
           S  ++GI L QRKY LD++ DSG LG +P  TPL+ +  L+ D G
Sbjct: 873 SRGSEGIFLSQRKYALDIITDSGNLGCRPALTPLEQNHHLATDDG 917



 Score = 75.9 bits (185), Expect = 4e-12
 Identities = 42/96 (43%), Positives = 56/96 (57%), Gaps = 21/96 (21%)

Query: 704  CVDTRRSVTSYCFFIGNSLICWRSKKQQTISKSSSEAVYRALASATCEL*RLTYLLKDLQ 763
            C  TRRS+++Y   +G S I W++KKQ T+S SS+EA YRA++ A  E+  L  LLK+L 
Sbjct: 1001 CPKTRRSLSAYVVLLGGSPISWKTKKQDTVSHSSAEAEYRAMSVALREIKWLRKLLKEL- 1059

Query: 764  IEPVKPSVIYCDNQSALHIAANPVFHERTKHLEIEC 799
                                ANPVFHERTKH+E +C
Sbjct: 1060 --------------------ANPVFHERTKHIESDC 1075


>UniRef100_O65452 LTR retrotransposon like protein [Arabidopsis thaliana]
          Length = 1109

 Score =  301 bits (771), Expect = 5e-80
 Identities = 148/304 (48%), Positives = 212/304 (69%), Gaps = 1/304 (0%)

Query: 312 KYPISNYMSYSNLSIPHHAYAMSLSLDSEPHTYAEASKHKCWVDAMNLEISALEANGTWS 371
           ++P S  MS +  +  H A+  +++   EP TY EA   K W +AM+ EI +L  N T+S
Sbjct: 572 EFPYSK-MSCNRFTSSHKAFLAAVTAGMEPTTYNEAMVDKAWREAMSAEIESLRVNQTFS 630

Query: 372 LVPLPPNVVPIDNKWVYKIKRRANGTVERYKARLVA*GYNQIEGIYYFDTFSPTAKLTIV 431
           +V LPP    + NKWVYKIK R++G +ERYKARLV  G  Q EG+ Y +TF+P AK++ V
Sbjct: 631 IVNLPPGKRALGNKWVYKIKYRSDGAIERYKARLVVLGNCQKEGVDYDETFAPVAKMSTV 690

Query: 432 RMVLALASVNNWHLHQLDVNNAFLLGDLYEDVYMKVPEGVSCVDSGKVCKLHKSSYGLKQ 491
           R+ L +A+  +WH+HQ+DV+NAFL GDL E+VYMK+P+G  C D  KVC+LHKS YGLKQ
Sbjct: 691 RLFLGVAAARDWHVHQMDVHNAFLHGDLKEEVYMKLPQGFQCDDPSKVCRLHKSLYGLKQ 750

Query: 492 ASRQWYAKFTSLLVTCGYKQAHSDHSLFSKTQGQSFTILLIYVDDIILAGNFLDEFTRIK 551
           A R W++K +S L   G+ Q+ SD+SLFS      F  +L+YVDD+I++G+  D   + K
Sbjct: 751 APRCWFSKLSSALKQYGFTQSLSDYSLFSYNNDGVFVHVLVYVDDLIISGSCPDAVAQFK 810

Query: 552 AALDNAFKIKDLGVLKYFLGLEVSHSAKGISLCQRKYCLDLVHDSGVLGSKPVSTPLDPS 611
           + L++ F +KDLG+LKYFLG+EVS +A+G  L QRKY LD++ + G+LG++P + PL+ +
Sbjct: 811 SYLESCFHMKDLGLLKYFLGIEVSRNAQGFYLSQRKYVLDIISEMGLLGARPSAFPLEQN 870

Query: 612 SRLS 615
            +LS
Sbjct: 871 HKLS 874



 Score = 92.8 bits (229), Expect = 4e-17
 Identities = 47/96 (48%), Positives = 68/96 (69%)

Query: 704  CVDTRRSVTSYCFFIGNSLICWRSKKQQTISKSSSEAVYRALASATCEL*RLTYLLKDLQ 763
            C  TRRS+T Y   +G++ I W++KKQ TIS+SS+EA YRA+A  T EL  L  +L DL 
Sbjct: 962  CPLTRRSLTGYFVQLGDTPISWKTKKQPTISRSSAEAEYRAMAFLTQELMWLKRVLYDLG 1021

Query: 764  IEPVKPSVIYCDNQSALHIAANPVFHERTKHLEIEC 799
            +  V+   I+ D++SA+ ++ NPV HERTKH+E++C
Sbjct: 1022 VSHVQAMRIFSDSKSAIALSVNPVQHERTKHVEVDC 1057


>UniRef100_Q9FL75 Retroelement pol polyprotein-like [Arabidopsis thaliana]
          Length = 1109

 Score =  301 bits (771), Expect = 5e-80
 Identities = 148/304 (48%), Positives = 212/304 (69%), Gaps = 1/304 (0%)

Query: 312 KYPISNYMSYSNLSIPHHAYAMSLSLDSEPHTYAEASKHKCWVDAMNLEISALEANGTWS 371
           ++P S  MS +  +  H A+  +++   EP TY EA   K W +AM+ EI +L  N T+S
Sbjct: 572 EFPYSK-MSCNRFTSSHKAFLAAVTAGMEPTTYNEAMVDKAWREAMSAEIESLRVNQTFS 630

Query: 372 LVPLPPNVVPIDNKWVYKIKRRANGTVERYKARLVA*GYNQIEGIYYFDTFSPTAKLTIV 431
           +V LPP    + NKWVYKIK R++G +ERYKARLV  G  Q EG+ Y +TF+P AK++ V
Sbjct: 631 IVNLPPGKRALGNKWVYKIKYRSDGAIERYKARLVVLGNCQKEGVDYDETFAPVAKMSTV 690

Query: 432 RMVLALASVNNWHLHQLDVNNAFLLGDLYEDVYMKVPEGVSCVDSGKVCKLHKSSYGLKQ 491
           R+ L +A+  +WH+HQ+DV+NAFL GDL E+VYMK+P+G  C D  KVC+LHKS YGLKQ
Sbjct: 691 RLFLGVAAARDWHVHQMDVHNAFLHGDLKEEVYMKLPQGFQCDDPSKVCRLHKSLYGLKQ 750

Query: 492 ASRQWYAKFTSLLVTCGYKQAHSDHSLFSKTQGQSFTILLIYVDDIILAGNFLDEFTRIK 551
           A R W++K +S L   G+ Q+ SD+SLFS      F  +L+YVDD+I++G+  D   + K
Sbjct: 751 APRCWFSKLSSALKQYGFTQSLSDYSLFSYNNDGVFVHVLVYVDDLIISGSCPDAVAQFK 810

Query: 552 AALDNAFKIKDLGVLKYFLGLEVSHSAKGISLCQRKYCLDLVHDSGVLGSKPVSTPLDPS 611
           + L++ F +KDLG+LKYFLG+EVS +A+G  L QRKY LD++ + G+LG++P + PL+ +
Sbjct: 811 SYLESCFHMKDLGLLKYFLGIEVSRNAQGFYLSQRKYVLDIISEMGLLGARPSAFPLEQN 870

Query: 612 SRLS 615
            +LS
Sbjct: 871 HKLS 874



 Score = 92.4 bits (228), Expect = 5e-17
 Identities = 46/96 (47%), Positives = 68/96 (69%)

Query: 704  CVDTRRSVTSYCFFIGNSLICWRSKKQQTISKSSSEAVYRALASATCEL*RLTYLLKDLQ 763
            C  TRRS+T Y   +G++ I W++KKQ T+S+SS+EA YRA+A  T EL  L  +L DL 
Sbjct: 962  CPLTRRSLTGYFVQLGDTPISWKTKKQPTVSRSSAEAEYRAMAFLTQELMWLKRVLYDLG 1021

Query: 764  IEPVKPSVIYCDNQSALHIAANPVFHERTKHLEIEC 799
            +  V+   I+ D++SA+ ++ NPV HERTKH+E++C
Sbjct: 1022 VSHVQAMRIFSDSKSAIALSVNPVQHERTKHVEVDC 1057


  Database: uniref100
    Posted date:  Jan 5, 2005  1:24 AM
  Number of letters in database: 848,049,833
  Number of sequences in database:  2,790,947
  
Lambda     K      H
   0.338    0.145    0.471 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,189,591,307
Number of Sequences: 2790947
Number of extensions: 46368661
Number of successful extensions: 208913
Number of sequences better than 10.0: 1462
Number of HSP's better than 10.0 without gapping: 1317
Number of HSP's successfully gapped in prelim test: 147
Number of HSP's that attempted gapping in prelim test: 204397
Number of HSP's gapped (non-prelim): 2601
length of query: 799
length of database: 848,049,833
effective HSP length: 136
effective length of query: 663
effective length of database: 468,481,041
effective search space: 310602930183
effective search space used: 310602930183
T: 11
A: 40
X1: 15 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 39 (21.8 bits)
S2: 79 (35.0 bits)


Lotus: description of TM0171.6