Medicago
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= AC149491.1 + phase: 0 /pseudo
         (1441 letters)

Database: uniref100 
           2,790,947 sequences; 848,049,833 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

UniRef100_Q9SIM3 Putative retroelement pol polyprotein [Arabidop...   259  4e-67
UniRef100_O23741 SLG-Sc and SLA-Sc genes and Melmoth retrotransp...   255  6e-66
UniRef100_Q9XII7 Putative retroelement pol polyprotein [Arabidop...   248  7e-64
UniRef100_Q9FJV3 Retroelement pol polyprotein-like [Arabidopsis ...   234  1e-59
UniRef100_O82607 T2L5.9 protein [Arabidopsis thaliana]                231  1e-58
UniRef100_Q5XWR5 Putative retroelement pol polyprotein-like [Sol...   231  2e-58
UniRef100_O23588 Retrotransposon like protein [Arabidopsis thali...   219  5e-55
UniRef100_Q9C8F4 Ty1/copia-element polyprotein [Arabidopsis thal...   214  2e-53
UniRef100_O04543 F20P5.25 protein [Arabidopsis thaliana]              213  3e-53
UniRef100_Q9FX79 Putative retroelement polyprotein [Arabidopsis ...   212  6e-53
UniRef100_O81617 F8M12.17 protein [Arabidopsis thaliana]              212  6e-53
UniRef100_Q9ZPU4 Putative retroelement pol polyprotein [Arabidop...   208  9e-52
UniRef100_Q9LVQ2 Retroelement pol polyprotein-like [Arabidopsis ...   206  4e-51
UniRef100_Q9ZVW0 Putative retroelement pol polyprotein [Arabidop...   205  9e-51
UniRef100_Q9MAJ8 F27F5.19 [Arabidopsis thaliana]                      199  7e-49
UniRef100_Q9LGZ8 Retroelement pol polyprotein-like [Arabidopsis ...   198  1e-48
UniRef100_O22175 Putative retroelement pol polyprotein [Arabidop...   193  4e-47
UniRef100_Q9FXB7 Putative retroelement polyprotein [Arabidopsis ...   182  9e-44
UniRef100_Q9XIM3 Putative retroelement pol polyprotein [Arabidop...   174  1e-41
UniRef100_Q9ZQN0 Putative retroelement pol polyprotein [Arabidop...   171  2e-40

>UniRef100_Q9SIM3 Putative retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 1461

 Score =  259 bits (662), Expect = 4e-67
 Identities = 183/610 (30%), Positives = 294/610 (48%), Gaps = 72/610 (11%)

Query: 1   SIYYVHPSEGPNSVTVTPLLTGPNYLAWNRSMKRALGTKNKFVFIDGSVPIPPMDDLNRT 60
           S +++H ++ P    ++  L    Y  W+ +M+ +L  KNK  F+DGS+P P   D N  
Sbjct: 64  SPFFLHSADHPGLSIISHRLDETTYGDWSVAMRISLDAKNKLGFVDGSLPRPLESDPNFR 123

Query: 61  AWERCNNLILSWIINSVSPQIAQTIVFHEYAIDVWIELQERFSKVDRIRVASLRSSINNL 120
            W RCN+++ SW++NSVSPQI ++I+    A D+W +L +RF+  +  R  +L   I +L
Sbjct: 124 LWSRCNSMVKSWLLNSVSPQIYRSILRLNDATDIWRDLFDRFNLTNLPRTYNLTQEIQDL 183

Query: 121 KQGDKSVLDYFTSIKSLWEELNSHRPMPMCTCPYPCRCESMRAARDFRMEDQVIQFLTGL 180
           +QG  S+ +Y+T +K+LW++L+S   +       PC C           + ++++FL GL
Sbjct: 184 RQGTMSLSEYYTLLKTLWDQLDSTEAL-----DDPCTCGKAVRLYQKAEKAKIMKFLAGL 238

Query: 181 NDSFSVVKTQVLLIDPLPSINKVYSMVIQEESN------IIPPTSLASNEDSSILVNASD 234
           N+S+++V+ Q++    LPS+ +VY ++ Q+ S       + PP +   +E     V+ S 
Sbjct: 239 NESYAIVRRQIIAKKALPSLAEVYHILDQDNSQKGFFNVVAPPAAFQVSE-----VSHSP 293

Query: 235 ARKPFLRGKSSGTSQSKNNSRYCTFCRRNNHTVEYCYLKHDFPNANKPTA-SSNAVTSEH 293
              P +    SG ++ +     C+FC R  H  E CY KH FP    P   SS+      
Sbjct: 294 ITSPEIMYVQSGPNKGRPT---CSFCNRVGHIAERCYKKHGFPPGFTPKGKSSDKPPKPQ 350

Query: 294 AVDSHTSSEGTSSSSQT-----GLTQEQYVHLVSLLQQSSLVPSATPPNPASTNHVATS- 347
           AV +  +      + Q        + +Q  +L++L   S L P    P  AS+ H A+S 
Sbjct: 351 AVAAQVTLSPDKMTGQLETLAGNFSPDQIQNLIALF-SSQLQPQIVSPQTASSQHEASSS 409

Query: 348 ---FPSSIDFTS------GINTIFSCSLHVPSDHWLIDSGANEHICSSLHLFHSYYRIKP 398
               PS I F+       GI  +   SL   SD W+IDSGA  H+     LF +      
Sbjct: 410 QSVAPSGILFSPSTYCFIGILAVSHNSL--SSDTWVIDSGATHHVSHDRKLFQTLDTSIV 467

Query: 399 ICVNLPNGSSVIVQYAGTVVFSPHFHITHVLYSPSFKVYLISVSKICQSLPYHVHFLLNT 458
             VNLP G +V +   GTV+ +    + +VL+ P F++ LIS+S +   L   V F  + 
Sbjct: 468 SFVNLPTGPNVRISGVGTVLINKDIILQNVLFIPEFRLNLISISSLTTDLGTRVIFDPSC 527

Query: 459 YVIQDVKTQKMIGLGNLCDGLYRLHPFAPASPQAHFISSAVSPTNKMSCNSVSSNNNVSS 518
             IQD+     +G G     LY L   +PA                          +V++
Sbjct: 528 CQIQDLTKGLTLGEGKRIGNLYVLDTQSPAI-------------------------SVNA 562

Query: 519 IPSNAIWHFRLGHLSNQRLSMMHSLYSSITIDNK--AVCDICHFAKQRKLPY-------N 569
           +   ++WH RLGH S  RL  +  +  +    NK  A C +CH AKQ+KL +       N
Sbjct: 563 VVDVSVWHKRLGHPSFSRLDSLSEVLGTTRHKNKKSAYCHVCHLAKQKKLSFPSANNICN 622

Query: 570 LSTLLLHLNL 579
            +  LLH+++
Sbjct: 623 STFELLHIDV 632


>UniRef100_O23741 SLG-Sc and SLA-Sc genes and Melmoth retrotransposon sequence
           [Brassica oleracea]
          Length = 1131

 Score =  255 bits (652), Expect = 6e-66
 Identities = 180/588 (30%), Positives = 285/588 (47%), Gaps = 61/588 (10%)

Query: 4   YVHPSEGPNSVTVTPLLTGPNYLAWNRSMKRALGTKNKFVFIDGSVPIPPMDDLNRTAWE 63
           ++H ++ P    ++  L G NY  WN +MK AL  KNK  F+DG++  P   D     W 
Sbjct: 16  FMHNADHPGLQLISLKLDGSNYDDWNAAMKIALDAKNKIGFVDGTLTRPDTSDPTFRLWS 75

Query: 64  RCNNLILSWIINSVSPQIAQTIVFHEYAIDVWIELQERFSKVDRIRVASLRSSINNLKQG 123
           RCN+++ SW++NSVSPQI ++I+    A D+W +L  RF   +  R  +L   I +LKQG
Sbjct: 76  RCNSMVKSWLLNSVSPQIYRSILRLNDAADIWRDLHGRFHMTNLPRTFNLTQEIQDLKQG 135

Query: 124 DKSVLDYFTSIKSLWEELNS-HRPMPMCTCPYPCRCESMRAARDFRMEDQVIQFLTGLND 182
             S+ DY+T++K+LW+ L S   P   C C      E ++   D     ++++FL GLND
Sbjct: 136 SMSLSDYYTTLKTLWDNLESVDEPDTPCVCG---NAEKLQKKVD---RAKIVKFLAGLND 189

Query: 183 SFSVVKTQVLLIDPLPSINKVYSMVIQEESN---IIPPTSLASNEDSSILVNASDARKPF 239
           S+++++ Q+++   LPS+ +VY+++ Q++S        T  A N   ++    ++A   +
Sbjct: 190 SYAIIRRQIIMKKVLPSLVEVYNILDQDDSQKGFSTAITPAAFNVSENVPPPMAEAGICY 249

Query: 240 LRGKSSGTSQSKNNSRYCTFCRRNNHTVEYCYLKHDFP----NANKPTASSNAVTSEHAV 295
           ++     T  +K     C+FC R  H  E CY KH FP    +  K  +S + +     V
Sbjct: 250 VQ-----TGPNKGRP-ICSFCNRVGHIAERCYKKHGFPPGFVSKYKSQSSGDRLQKPKQV 303

Query: 296 DSHTSSEGTSSSSQTGLTQEQYV--HLVSLLQQSSLVPSATPPNPASTNHVATSFPSSID 353
            +  S     +S Q+ +T +  V  H    LQQ   + S+  PN    ++ A+S    +D
Sbjct: 304 AAQVSF-SPPNSGQSPMTMDHLVGNHSKEQLQQFIALFSSQLPNVTMGSNEASSSKQPMD 362

Query: 354 FTSGIN----TIFSCSLHVPSDH------WLIDSGANEHICSSLHLFHSYYRIKPICVNL 403
             SGI+    T+    L   S H      W+IDSGA  H+C    ++ S        VNL
Sbjct: 363 -NSGISFNPTTLVFIGLLTVSRHTLANETWIIDSGATHHVCHDRSMYTSIDITTTSNVNL 421

Query: 404 PNGSSVIVQYAGTVVFSPHFHITHVLYSPSFKVYLISVSKICQSLPYHVHFLLNTYVIQD 463
           PNG  V +   G V  + H  + +VLY P F++ L+S+S +   +   V F +++  IQD
Sbjct: 422 PNGMIVKISGVGIVQLNEHITLHNVLYIPEFRLNLLSISSLTSDIGSQVIFDVSSCAIQD 481

Query: 464 VKTQKMIGLGNLCDGLYRLHPFAPASPQAHFISSAVSPTNKMSCNSVSSNNNVSSIPSNA 523
                 IG G     LY L                         +  SS   ++++   +
Sbjct: 482 PTKGWTIGQGRRVANLYVL-------------------------DVKSSPMKINAVVDIS 516

Query: 524 IWHFRLGHLSNQRLSMMHSLYSSITIDNK--AVCDICHFAKQRKLPYN 569
           +WH RLGH S  RL  +     +    NK  A C +CH AKQ+KL Y+
Sbjct: 517 LWHKRLGHPSYTRLDKISEALGTTKHKNKGDAHCHVCHLAKQKKLSYS 564


>UniRef100_Q9XII7 Putative retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 1454

 Score =  248 bits (634), Expect = 7e-64
 Identities = 175/588 (29%), Positives = 282/588 (47%), Gaps = 58/588 (9%)

Query: 1   SIYYVHPSEGPNSVTVTPLLTGPNYLAWNRSMKRALGTKNKFVFIDGSVPIPPMDDLNRT 60
           S +++H ++ P    ++  L   NY  W+ +M  +L  KNK  FIDG++  P   DLN  
Sbjct: 60  SPFFLHSADHPGLNIISHRLDETNYGDWSVAMLISLDAKNKTGFIDGTLSRPLESDLNFR 119

Query: 61  AWERCNNLILSWIINSVSPQIAQTIVFHEYAIDVWIELQERFSKVDRIRVASLRSSINNL 120
            W RCN+++ SW++NSVSPQI ++I+    A D+W +L  RF+  +  R  +L   I + 
Sbjct: 120 LWSRCNSMVKSWLLNSVSPQIYRSILRMNDASDIWRDLNSRFNVTNLPRTYNLTQEIQDF 179

Query: 121 KQGDKSVLDYFTSIKSLWEELNSHRPMPMCTCPYPCRCESMRAARDFRMEDQVIQFLTGL 180
           +QG  S+ +Y+T +K+LW++L+S   +       PC C      +    + ++++FL GL
Sbjct: 180 RQGTLSLSEYYTRLKTLWDQLDSTEAL-----DEPCTCGKAMRLQQKAEQAKIVKFLAGL 234

Query: 181 NDSFSVVKTQVLLIDPLPSINKVYSMVIQEESN------IIPPTSLASNEDSSILVNASD 234
           N+S+++V+ Q++    LPS+ +VY ++ Q+ S       + PP +   +E     +  S 
Sbjct: 235 NESYAIVRRQIIAKKALPSLGEVYHILDQDNSQQSFSNVVAPPAAFQVSE-----ITQSP 289

Query: 235 ARKPFLRGKSSGTSQSKNNSRYCTFCRRNNHTVEYCYLKHDFPNANKPTASSNAVTSE-- 292
           +  P +    +G ++ +     C+F  R  H  E CY KH FP    P   +     +  
Sbjct: 290 SMDPTVCYVQNGPNKGR---PICSFYNRVGHIAERCYKKHGFPPGFTPKGKAGEKLQKPK 346

Query: 293 --HAVDSHTSSEGTSSSSQTG-LTQEQYVHLVSL----LQQSSLVPSATPPNPASTNHVA 345
              A  + +S   TS  S  G L++EQ    +++    LQ +     AT     S N   
Sbjct: 347 PLAANVAESSEVNTSLESMVGNLSKEQLQQFIAMFSSQLQNTPPSTYATASTSQSDNLGI 406

Query: 346 TSFPSSIDFTSGINTIFSCSLHVPSDHWLIDSGANEHICSSLHLFHSYYRIKPICVNLPN 405
              PS+  F  GI T+   +L   S  W+IDSGA  H+     LF S        VNLP 
Sbjct: 407 CFSPSTYSFI-GILTVARHTL--SSATWVIDSGATHHVSHDRSLFSSLDTSVLSAVNLPT 463

Query: 406 GSSVIVQYAGTVVFSPHFHITHVLYSPSFKVYLISVSKICQSLPYHVHFLLNTYVIQDVK 465
           G +V +   GT+  +    + +VL+ P F++ LIS+S +   +   V F  N+  IQD+ 
Sbjct: 464 GPTVKISGVGTLKLNDDILLKNVLFIPEFRLNLISISSLTDDIGSRVIFDKNSCEIQDLI 523

Query: 466 TQKMIGLGNLCDGLYRLHPFAPASPQAHFISSAVSPTNKMSCNSVSSNNNVSSIPSNAIW 525
             +M+G G     LY L                      +   S+S    V+++   ++W
Sbjct: 524 KGRMLGQGRRVANLYLL---------------------DVGDQSIS----VNAVVDISMW 558

Query: 526 HFRLGHLSNQRLSMMHSLYSSITIDNKA--VCDICHFAKQRKLPYNLS 571
           H RLGH S QRL  +     +    NK    C +CH AKQRKL +  S
Sbjct: 559 HRRLGHASLQRLDAISDSLGTTRHKNKGSDFCHVCHLAKQRKLSFPTS 606


>UniRef100_Q9FJV3 Retroelement pol polyprotein-like [Arabidopsis thaliana]
          Length = 1475

 Score =  234 bits (598), Expect = 1e-59
 Identities = 170/593 (28%), Positives = 288/593 (47%), Gaps = 64/593 (10%)

Query: 9   EGPNSVTVTPLLTGPNYLAWNRSMKRALGTKNKFVFIDGSVPIPPMDDLNRTAWERCNNL 68
           + P +  V+ +L G N+ +W  +M  +L  KNK  F+DG++P PP  D +   W RCN++
Sbjct: 70  DSPGNTLVSEVLDGTNFSSWKIAMFVSLYAKNKIAFVDGTLPRPPESDPSFRVWSRCNSM 129

Query: 69  ILSWIINSVSPQIAQTIVFHEYAIDVWIELQERFSKVDRIRVASLRSSINNLKQGDKSVL 128
           + SWI+NSV+ QI ++I+    A ++W +L  RF   +  R   L   I +L+QG  S+ 
Sbjct: 130 VKSWILNSVTKQIYKSILRFNDAAEIWKDLDTRFHITNLPRSYQLTQQIWSLQQGTMSLS 189

Query: 129 DYFTSIKSLWEELNSHRPMPMCTCPYPCR-CESMRAARDFRMEDQVIQFLTGLNDSFSVV 187
           DY+T++K+LW++L+        +C   C+ C    A        ++++FL+GLN+S+S +
Sbjct: 190 DYYTALKTLWDDLDG------ASCVSTCKNCTCCIATASMIEHSKIVKFLSGLNESYSTI 243

Query: 188 KTQVLLIDPLPSINKVYSMVIQEES--NIIPPTSLASNEDSSILVNASDARKPFLRGKSS 245
           ++Q+++   +P + ++Y+++ Q+ S  NI+   + AS    ++    SD     L  KS 
Sbjct: 244 RSQIIMKKTIPDLAEIYNLLDQDHSQRNIVTMPTNAST--FNVSAPQSDQFAVNL-AKSF 300

Query: 246 GTSQSKNNSRYCTFCRRNNHTVEYCYLKHDFPNANKPTASSNAVTSE---HAVDSHTSSE 302
           GT         C+ C    H  + CY  H +P   K         SE     V +   ++
Sbjct: 301 GTQPKPKVQ--CSHCGYTGHNADTCYKIHGYPVGFKHKDKKTVTPSEKPKSVVANLALTD 358

Query: 303 GTSSSSQTGLTQEQYVHLVSLLQQSSLVP-----SATPPNP---------ASTNHVATSF 348
           G  S +Q G+  +  V LV  + +S +       S    NP         ASTN+   S 
Sbjct: 359 GKVSVTQ-GIGPDGIVELVGSMSKSQIQDVIAYFSTQLHNPAKPITVASFASTNNDNGST 417

Query: 349 PSSIDFTSGINTIFSCSLH-----VPSDHWLIDSGANEHICSSLHLFHSYYRIKPICVNL 403
            + I F+     +  CSL      +  + W+IDSGA  H+    +LF S        V L
Sbjct: 418 FTGISFSPSTLRLL-CSLTSSKKVLSLNTWIIDSGATHHVSYDRNLFESLSDGLSNEVTL 476

Query: 404 PNGSSVIVQYAGTVVFSPHFHITHVLYSPSFKVYLISVSKICQSLPYHVHFLLNTYVIQD 463
           P GS+V +   G +  + +  + +VLY P F++ L+SVS+  + +   ++F  +  VIQD
Sbjct: 477 PTGSNVKIAGIGVIKLNSNLTLKNVLYIPEFRLNLLSVSQQTKDMKCKIYFDEDCCVIQD 536

Query: 464 VKTQKMIGLGNLCDGLYRLHPFAPASPQAHFISSAVSPTNKMSCNSVSSNNNV------S 517
              ++ IG GN   GLY                  V  T+ + C SV  N++V      +
Sbjct: 537 PIKEQKIGRGNQIGGLY------------------VLDTSSVECTSVDINSSVTEKQYCN 578

Query: 518 SIPSNAIWHFRLGHLSNQRLSMMHSLYS--SITIDNKAVCDICHFAKQRKLPY 568
           ++  +A+WH RLGH S ++  ++H +        ++   C IC  AKQ+ L +
Sbjct: 579 AVVDSALWHSRLGHPSYEKNDVLHDVLGLPKRNKEDLVHCSICQKAKQKHLSF 631


>UniRef100_O82607 T2L5.9 protein [Arabidopsis thaliana]
          Length = 1244

 Score =  231 bits (589), Expect = 1e-58
 Identities = 164/565 (29%), Positives = 271/565 (47%), Gaps = 60/565 (10%)

Query: 31  SMKRALGTKNKFVFIDGSVPIPPMDDLNRTAWERCNNLILSWIINSVSPQIAQTIVFHEY 90
           +++ +L  KNK  F+DGS+P P   D N   W RCN+++ SW++NSVSPQI ++I+    
Sbjct: 75  ALRISLDAKNKIGFVDGSLPRPLESDGNFRLWSRCNSMVKSWLLNSVSPQIYRSILRMND 134

Query: 91  AIDVWIELQERFSKVDRIRVASLRSSINNLKQGDKSVLDYFTSIKSLWEELNS-HRPMPM 149
           A D+W ++  RF   +  R  +L   I + +QG  S+ +Y+T ++ LW+ L+S   P   
Sbjct: 135 ATDIWRDIYGRFHMTNLPRTYNLTQEIQDFRQGSLSLSEYYTQLRILWDLLDSTEEPDDP 194

Query: 150 CTCPYPCRCESMRAARDFRMEDQVIQFLTGLNDSFSVVKTQVLLIDPLPSINKVYSMVIQ 209
           CTC    R +  +A R      + ++FL GLN+S+S+V+ Q++    LPS+ +VY+++ Q
Sbjct: 195 CTCGKVLRLQ-QKAER-----AKTVKFLAGLNESYSIVRRQIIAKKALPSLVEVYNILDQ 248

Query: 210 EES------NIIPPTSLASNEDSSILVNASDARKPFLRGKSSGTSQSKNNSRYCTFCRRN 263
           + S      N+ PP +   ++ S      S A  P +    +G ++ +     C+FC + 
Sbjct: 249 DYSQKGFSTNVSPPAAFQVSKIS------STALTPKICYVQNGPNKGR---PICSFCNKV 299

Query: 264 NHTVEYCYLKHDFPNA------NKPTASSNAVTSEHAVDSHTSSEGTSSSSQTGLTQEQY 317
            H  E CY KH +P         K T           +     ++ T       L+ +Q 
Sbjct: 300 GHIAEKCYKKHGYPPGFKGKLPEKGTKPQPVAAQVSLLPPMVPTQATLDGLLGNLSNDQL 359

Query: 318 VHLVSLL-QQSSLVPSATPPNP----ASTNHVATSFPSSIDFTSGINTIFSCSLHVPSDH 372
            + ++L   Q    P+A+  +     +  ++   SF +S  +  GI  +   +L   ++ 
Sbjct: 360 QNFIALFSSQLKSQPTASSSDAGISRSPIDYTGISFSNSTYYFVGILNVSQHTL--STET 417

Query: 373 WLIDSGANEHICSSLHLFHSYYRIKPICVNLPNGSSVIVQYAGTVVFSPHFHITHVLYSP 432
           W+IDSGA  H+C    LF S        VNLP GS V +   G+V  + +  + +VL+ P
Sbjct: 418 WVIDSGATHHVCHDKSLFVSLDHSVVSYVNLPTGSRVKISGVGSVQINENILLRNVLFLP 477

Query: 433 SFKVYLISVSKICQSLPYHVHFLLNTYVIQDVKTQKMIGLGNLCDGLYRLHPFAPASPQA 492
            F++ LIS+S +   +   V F  +   IQD+     IG G     LY L    P     
Sbjct: 478 EFRLNLISISSLTSDIGSRVIFDPSCCEIQDLTKDLRIGRGRRIGNLYVLDTTPP----- 532

Query: 493 HFISSAVSPTNKMSCNSVSSNNNVSSIPSNAIWHFRLGHLSNQRLSMMHSLYSSITIDNK 552
                          +SVS    V+++   ++WH R+GH +  RL  +  +  +    NK
Sbjct: 533 --------------LDSVS----VNAVVDVSLWHMRMGHPAYSRLDAISDILRTTKHKNK 574

Query: 553 --AVCDICHFAKQRKLPYNLSTLLL 575
             A C ICH AKQ+KL +  S  +L
Sbjct: 575 GSAYCHICHLAKQKKLSFQSSNNIL 599


>UniRef100_Q5XWR5 Putative retroelement pol polyprotein-like [Solanum tuberosum]
          Length = 1476

 Score =  231 bits (588), Expect = 2e-58
 Identities = 171/604 (28%), Positives = 282/604 (46%), Gaps = 78/604 (12%)

Query: 20  LTG-PNYLAWNRSMKRALGTKNKFVFIDGSVPIPPM-DDLNRTAWERCNNLILSWIINSV 77
           LTG  NY  W+R+M+  L TKNK  FIDGS+      ++L +  W+RCN ++LSW++N+V
Sbjct: 21  LTGMENYSLWSRAMQLTLLTKNKMGFIDGSLRRDDFKEELEKKQWDRCNAMVLSWLMNNV 80

Query: 78  SPQIAQTIVFHEYAIDVWIELQERFSKVDRIRVASLRSSINNLKQGDKSVLDYFTSIKSL 137
           S  +   I+F   A  VW +L+ERF KV+  R+  L  +I    QG   V  Y++ +K L
Sbjct: 81  STDLVSGILFRSNATLVWNDLKERFDKVNMSRIFHLHKAIVTHVQGVSPVSVYYSKLKDL 140

Query: 138 WEELNSHRPMPMCTCPYPCRCESMRAARDFRMEDQVIQFLTGLNDSFSVVKTQVLLIDPL 197
           W+E +S  P P C       CE      D  +  +++QFL GLND++   ++Q+L+++P 
Sbjct: 141 WDEYDSILPPPSCD------CEKSVDYTDSMLRQKLLQFLMGLNDNYGQARSQILMMNPS 194

Query: 198 PSINKVYSMVIQEES--------NIIPPTSLASNEDSSILVNASDARKPFLRGKSSGTSQ 249
           PS+N+ Y+M++Q+ES          I PT+L ++         S   +    G S+G S 
Sbjct: 195 PSVNQCYAMIVQDESQRSLSGSGQTIDPTALFTHRPGGSGF-GSQGSQGSGNGSSNGNSH 253

Query: 250 ---------------------SKNNSRYCTFCRRNNHTVEYCYLKHDFPNANKPTASSNA 288
                                + N  ++CT C    HT + CY    +P   K    +N 
Sbjct: 254 RFHKGGNIYCDFCNMKGHIRANCNKLKHCTHCNMQGHTKDTCYQLIGYPADYKGKKKANI 313

Query: 289 VTSEHAVDSHTSSEGTSSSSQTGLTQEQYVHLVSLLQ-------------QSSLVPSATP 335
           VT+        ++   + +     T +   H VS +Q               +  P + P
Sbjct: 314 VTAPSLPQMQHNNFNNNLNYPMQYTGDGIGHFVSPMQFTGNTNGHSSGSIAGNFGPGSVP 373

Query: 336 P-NPASTNHVATSF--PSSIDFTSGINTIFS----CSLHVPSDHWLIDSGANEHICSSLH 388
              P+  N++      P   + ++ +  IF+    C+ +  S  W++DSGA +H+ S+  
Sbjct: 374 QFTPSQYNNILQMLNKPMLSESSANVAGIFAGSSHCNSNTHSSAWIVDSGATDHMVSNTT 433

Query: 389 LF-HSYYRIKPICVNLPNGSSVIVQYAGTVVFSPHFHITHVLYSPSFKVYLISVSKICQS 447
           L  H      P  V LP G S +V ++G+   +    + +VL  P+F+  L+SVSK+ + 
Sbjct: 434 LLNHGLSVSHPGKVQLPTGDSAVVTHSGSSQLTGGDVVKNVLCVPTFQFNLLSVSKLTKE 493

Query: 448 LPYHVHFLLNTYVIQDVKTQKMIGLGNLCDGLYRLHPFAPASPQAHFISSAVSPTNKMSC 507
           L   V F  + ++IQD+ T K+  +G   +GLY   P      Q H        T+K + 
Sbjct: 494 LNCCVIFFPDFFIIQDLFTGKVKEIGEEINGLYITRPH-----QHH-------DTSKKTL 541

Query: 508 NSVSSNNNVSSIPSNAIWHFRLGHLSNQRLSMMHSLYSSITIDNKAVCDICHFAKQRKLP 567
            ++             +WH RLGH+    L  +  ++ S        CD+C  A+Q +LP
Sbjct: 542 AAIKGCEEAE------MWHKRLGHIPMSVLRKI-KMFDSPQKLVLPSCDVCPLARQVRLP 594

Query: 568 YNLS 571
           + +S
Sbjct: 595 FPIS 598


>UniRef100_O23588 Retrotransposon like protein [Arabidopsis thaliana]
          Length = 1433

 Score =  219 bits (558), Expect = 5e-55
 Identities = 160/594 (26%), Positives = 279/594 (46%), Gaps = 82/594 (13%)

Query: 1   SIYYVHPSEGPNSVTVTPLLTGPNYLAWNRSMKRALGTKNKFVFIDGSVPIPPMDDLNRT 60
           S Y++H S+ P    V+ +L G NY  W+ +M+ +L  KNK  F+DGS+P P + D    
Sbjct: 66  SPYFLHSSDHPGLNIVSHILDGTNYNNWSIAMRMSLDAKNKLSFVDGSLPRPDVSDRMFK 125

Query: 61  AWERCNNLILSWIINSVSPQIAQTIVFHEYAIDVWIELQERFSKVDRIRVASLRSSINNL 120
            W RCN+++ +W++N V+              ++W +L  RF   +  R   L  SI+ L
Sbjct: 126 IWSRCNSMVKTWLLNVVT--------------EMWNDLFSRFRVSNLPRKYQLEQSIHTL 171

Query: 121 KQGDKSVLDYFTSIKSLWEELNSHRPMPMCTCPYPCRCESMRAARDFRMEDQVIQFLTGL 180
           KQG+  +  Y+T  K+LWE+L + R + +      C CE ++   +     ++IQFL GL
Sbjct: 172 KQGNLDLSTYYTKKKTLWEQLANTRVLTV----RKCNCEHVKELLEEAETSRIIQFLMGL 227

Query: 181 NDSFSVVKTQVLLIDPLPSINKVYSMVIQEESNIIPPTSLASNEDSSILVNASDARKPFL 240
           ND+F+ ++ Q+L + P P + ++Y+M+ Q+ES  +      SN  ++  V AS    P +
Sbjct: 228 NDNFAHIRGQILNMKPRPGLTEIYNMLDQDESQRLVGNPTLSNPTAAFQVQAS----PII 283

Query: 241 RGKSSGTSQSKNNSRYCTFCRRNNHTVEYCYLKHDFPNANKPT-----ASSNAVTSEHAV 295
             + +  +Q       C++C +  H V+ CY KH +P  +K T      S+N  +++   
Sbjct: 284 DSQVN-MAQGSYKKPKCSYCNKLGHLVDKCYKKHGYPPGSKWTKGQTIGSTNLASTQLQP 342

Query: 296 DSHTSSEGTSSSSQTGLTQEQYVHLVSLLQQSSLVPSATPPNPASTNHVAT--SFPSSID 353
            + T +E T S  +   + +Q   ++S L     + SA+P    S+  ++   S P    
Sbjct: 343 VNETPNEKTDSYEE--FSTDQIQTMISYLSTKLHIASASPMPTTSSASISASPSVPMISQ 400

Query: 354 FTSGINTIFSCSLH------------VPSDHWLIDSGANEHICSSLHLFHSYYRIKPICV 401
            +    ++FS + +            V    W+IDSGA  H+  +  L+ ++  ++   V
Sbjct: 401 ISGTFLSLFSNAYYDMLISSVSQEPAVSPRGWVIDSGATHHVTHNRDLYLNFRSLENTFV 460

Query: 402 NLPNGSSVIVQYAGTVVFSPHFHITHVLYSPSFKVYLISVSKICQSLPYHVHFLLNTYVI 461
            LPN  +V +   G +  S    + +VLY P FK  LIS                     
Sbjct: 461 RLPNDCTVKIAGIGFIQLSDAISLHNVLYIPEFKFNLIS--------------------- 499

Query: 462 QDVKTQKMIGLGNLCDGLYRLHPFAPASPQAHFISSAVSPTNKMSCNSVSSNNNVSSIPS 521
            ++  + MIG G+    LY L      +   H +S  +  T  M C   S  ++V  +  
Sbjct: 500 -ELTKELMIGRGSQVGNLYVL----DFNENNHTVS--LKGTTSM-CPEFSVCSSV--VVD 549

Query: 522 NAIWHFRLGHLSNQRLSMMHSLYS-SITIDNKA------VCDICHFAKQRKLPY 568
           +  WH RLGH +  ++ ++  + +  +   NK       VC +CH +KQ+ L +
Sbjct: 550 SVTWHKRLGHPAYSKIDLLSDVLNLKVKKINKEHSPVCHVCHVCHLSKQKHLSF 603


>UniRef100_Q9C8F4 Ty1/copia-element polyprotein [Arabidopsis thaliana]
          Length = 1152

 Score =  214 bits (544), Expect = 2e-53
 Identities = 164/600 (27%), Positives = 277/600 (45%), Gaps = 88/600 (14%)

Query: 1   SIYYVHPSEGPNSVTVTPLLTGPNYLAWNRSMKRALGTKNKFVFIDGSVPIPPMDDLNRT 60
           S YY+HPS+ P+ V    LL G NY  W +  +  L  K K  FIDG++  P  D  +  
Sbjct: 23  SPYYLHPSDHPHHVLTPMLLNGENYERWAKLTRNNLQAKQKLGFIDGTLTKPSSDSPDYP 82

Query: 61  AWERCNNLILSWIINSVSPQIAQTIVFHEYAIDVWIELQERFSKVDRIRVASLRSSINNL 120
            W + N++++ W+  S+ PQ+ ++I   + A  +W  L+ R+S  +  RV  L+  I   
Sbjct: 83  RWLQTNSMLVGWLYASLDPQVQKSISVVDNARVMWESLRTRYSVGNASRVHQLKYDIVAC 142

Query: 121 KQGDKSVLDYFTSIKSLWEELNSHRPMPMCTCPYPCRCESMRAARDFRMEDQVIQFLTGL 180
           +Q  ++  +YF  +K +W++L+ + P+  C C  P     +R ++  R  +++ QFL GL
Sbjct: 143 RQDGQTAANYFGKLKVMWDDLDDYEPLLTCCCNRPSCTHRVRQSQR-RDHERIHQFLMGL 201

Query: 181 NDS-FSVVKTQV---LLIDPLPSINKVYSMVIQEESNIIPPTSLASNEDSSILVNASDAR 236
           + + F   +T +   L  D   S++ +YS +I EE ++   T   S E+    V  +   
Sbjct: 202 DAAKFGTSRTNILGRLSRDDNISLDSIYSEIIAEERHL---TITRSKEERVDAVGFA--- 255

Query: 237 KPFLRGKSSGTSQSK-NNSRYCTFCRRNNHTVEYCYLKHDFPN----------------- 278
                G ++  S ++ NN   CT C R+NH+ + C+  H  P                  
Sbjct: 256 --VQTGVNAIASVTRVNNMGPCTHCGRSNHSADTCFKLHGVPEWYTEKYGDTSSGRGRGR 313

Query: 279 ----ANKPTASSNAVTSEHAVDSHTSSEGTSSSSQTGLTQEQYVHLVSLLQQSSLVPSAT 334
                 +     N+  + +A  SH SS  +  S   G+++E +  + +LL+Q +      
Sbjct: 314 SSTPRGRGRGHGNSYKANNAQTSHPSSSASEFSDIPGVSKEAWSAIRNLLKQDT------ 367

Query: 335 PPNPASTNHVATSFPSSIDFTSGINTIFSCSLHVPSDHWLIDSGANEHICSSLHLFHSYY 394
               A+++   +   + +DF                   LIDSGA+ H+   L L    Y
Sbjct: 368 ----ATSSEKLSGKTNCVDF-------------------LIDSGASHHMTGFLDLLTEIY 404

Query: 395 RIKPICVNLPNGSSVIVQYAGTVVFSPHFHITHVLYSPSFKVYLISVSKICQSLPYHVHF 454
            I    V LPN    I    GT++   +  +THVL+ P     LISV+++ + L     F
Sbjct: 405 EIPHSVVVLPNAKHTIATKKGTLILGANMKLTHVLFVPDLSCTLISVARLLRELHCFAIF 464

Query: 455 LLNTYVIQDVKTQKMIGLGNLCDGLYRLHPFAPASPQAHFISSAVSPTNKMSCNSVSSNN 514
                VIQD  ++ +IG+G   +G+Y L        +A  +++        S N V    
Sbjct: 465 TDKVCVIQDRTSKMLIGVGTESNGVYHLQ-------RAEVVAT--------SANVVKWKT 509

Query: 515 NVSSIPSNAIWHFRLGHLSNQRL-SMMHSL--YSSITIDNKAVCDICHFAKQRKLPYNLS 571
           N       A+WH RLGH S++ L S++ SL  + S + D K +CD+C  AKQ +  ++ S
Sbjct: 510 N------KALWHMRLGHPSSKVLSSVLPSLEDFDSCSSDLKTICDVCVRAKQTRASFSES 563


>UniRef100_O04543 F20P5.25 protein [Arabidopsis thaliana]
          Length = 1315

 Score =  213 bits (543), Expect = 3e-53
 Identities = 159/549 (28%), Positives = 245/549 (43%), Gaps = 109/549 (19%)

Query: 32  MKRALGTKNKFVFIDGSVPIPPMDDLNRTAWERCNNLILSWIINSVSPQIAQTIVFHEYA 91
           M  ++  KNK  F+DGS+P P  DD     W RCN+++ SW++NSVS +I  +I++   A
Sbjct: 1   MTTSIEAKNKLGFVDGSIPKPDDDDPYCKIWRRCNSMVKSWLLNSVSKEIYTSILYFPTA 60

Query: 92  IDVWIELQERFSKVDRIRVASLRSSINNLKQGDKSVLDYFTSIKSLWEELNSHRPMPMCT 151
             +W +L  RF K    R+  LR  I++L+QG+  +  Y T  ++LWEEL S + +P   
Sbjct: 61  AAIWKDLYTRFHKSSLPRLYKLRQQIHSLRQGNLDLSSYHTRTQTLWEELTSLQAVP--- 117

Query: 152 CPYPCRCESMRAARDFRME---DQVIQFLTGLNDSFSVVKTQVLLIDPLPSINKVYSMVI 208
                     R   D  +E   ++VI FL GLND +  V++Q+L+   LPS+++V++M+ 
Sbjct: 118 ----------RTVEDLLIERETNRVIDFLMGLNDCYDTVRSQILMKKTLPSLSEVFNMID 167

Query: 209 QEESNIIPPTSLASNEDSSILVNASDARKPFLRGKSSGTSQSKNNSRYCTFCRRNNHTVE 268
           Q+E+      S      SS+   ++ + +  L    +G +  K     C++C R  H  +
Sbjct: 168 QDETQRSARISTTPGMTSSVFPVSNQSSQSAL----NGDTYQKKERPVCSYCSRPGHVED 223

Query: 269 YCYLKHDFPNA-------NKPTASSNAVTSEHAVDSHTSSEGTSSSSQTGLTQEQYVHLV 321
            CY KH +P +        KP+ S+NA      V ++T      S S   LT  Q   LV
Sbjct: 224 TCYKKHGYPTSFKSKQKFVKPSISANAAIGSEEVVNNT------SVSTGDLTTSQIQQLV 277

Query: 322 SLLQQSSLVPSATPPNPASTNHVATSFPSSIDFTSGINTIFSCSLHVPSDHWLIDSGANE 381
           S L  S L P +TP  P   +   +S PSS                         S    
Sbjct: 278 SFL-SSKLQPPSTPVQPEVHSISVSSDPSS-------------------------SSTVC 311

Query: 382 HICSSLHLFHSYYRIKPICVNLPNGSSVIVQYAGTVVFSPHFHITHVLYSPSFKVYLISV 441
            I  S+HL                                H  +  VL+ P FK  L+SV
Sbjct: 312 PISGSVHL------------------------------GRHLILNDVLFIPQFKFNLLSV 341

Query: 442 SKICQSLPYHVHFLLNTYVIQDVKTQKMIGLGNLCDGLYRLHPFAPASPQAHFISSAVSP 501
           S + +S+   + F   + V+QD   + M+G+G     LY                  +  
Sbjct: 342 SSLTKSMGCRIWFDETSCVLQDATRELMVGMGKQVANLY------------------IVD 383

Query: 502 TNKMSCNSVSSNNNVSSIPSNAIWHFRLGHLSNQRLSMMHSLYSSITIDNKA--VCDICH 559
            + +S     S+  V+S+ S+ +WH RLGH S Q+L  M SL S     N     C +CH
Sbjct: 384 LDSLSHPGTDSSITVASVTSHDLWHKRLGHPSVQKLQPMSSLLSFPKQKNNTDFHCRVCH 443

Query: 560 FAKQRKLPY 568
            +KQ+ LP+
Sbjct: 444 ISKQKHLPF 452


>UniRef100_Q9FX79 Putative retroelement polyprotein [Arabidopsis thaliana]
          Length = 1413

 Score =  212 bits (540), Expect = 6e-53
 Identities = 147/507 (28%), Positives = 243/507 (46%), Gaps = 39/507 (7%)

Query: 4   YVHPSEGPNSVTVTPLLTGPNYLAWNRSMKRALGTKNKFVFIDGSVPIPPMDDLNRTAWE 63
           ++H ++ P    V+  L G NY  W+ +MK AL  KNK  FIDGS P P   +     W 
Sbjct: 55  FLHNADHPGISIVSVQLDGANYNQWSSAMKIALDAKNKIAFIDGSCPRPEEGNHLLRIWS 114

Query: 64  RCNNLILSWIINSVSPQIAQTIVFHEYAIDVWIELQERFSKVDRIRVASLRSSINNLKQG 123
           RCN+++ SWI+NSV+ +I  +I+  + A  +W +L  RF   +  R   L   I +L+QG
Sbjct: 115 RCNSMVKSWILNSVNREIYGSILSFDDAAQIWNDLHNRFHMTNLPRTFQLVQQIQDLRQG 174

Query: 124 DKSVLDYFTSIKSLWEELNSHRPMPMCTCPYPCRCESMRAARDFRMEDQVIQFLTGLNDS 183
             ++  Y+T++K+L + L+       C C     CES   A+      ++I+FL GLN+ 
Sbjct: 175 SMNLSTYYTTLKTLRDNLDGAEASVPCHCCKKSTCESQIFAKSNVNRGRIIKFLAGLNEK 234

Query: 184 FSVVKTQVLLIDPLPSINKVYSMVIQEESNIIPPTSLASNEDSSILVNASDARKPFLRGK 243
           +S+++ Q+++  PLP + +VY+++ Q++S      ++AS   ++  V   D +   L   
Sbjct: 235 YSIIRGQIIMKKPLPDLAEVYNILDQDDSQRQFSNNVAS---AAFQVTKDDVQPGALASS 291

Query: 244 SS-------GTSQSKNNSRYCTFCRRNNHTVEYCYLKHDFP-----------NANKPTAS 285
           S+       G  Q K+    C+      HT E CY  H +P              + + S
Sbjct: 292 SNMPQPGMLGAVQKKDKKSICSHYGYTGHTSERCYKLHGYPVGWKKGKSFYEKIAQASQS 351

Query: 286 SNAVTSEHAVDSHTSSEGTSSSSQTGL-------TQEQYVHLVSLLQQSSLVPSATPPN- 337
           S A     AV +  +  G S ++  GL       +++Q  +L++L   S L P++   N 
Sbjct: 352 SQAPKPNSAVTAQVT--GNSQNTPAGLESLIGNMSKDQIQNLIALF-SSQLQPASPVLNT 408

Query: 338 -PASTNHVATSFPSSIDFTSG----INTIFSCSLHVPSDHWLIDSGANEHICSSLHLFHS 392
            P ST+H   + PS I F+S     I  +      +    W++DSGA  H+C    +F +
Sbjct: 409 APMSTSH--NNDPSGITFSSSTFSFIGILTVSETEMTHGTWIVDSGATHHVCHVKDMFLN 466

Query: 393 YYRIKPICVNLPNGSSVIVQYAGTVVFSPHFHITHVLYSPSFKVYLISVSKICQSLPYHV 452
                   VNLP G+++ V   G +  +    + +VLY P F++ L+SVS +   +   V
Sbjct: 467 LDTSVQHHVNLPTGTTIRVGGVGNIAVNADLILKNVLYIPEFRLNLLSVSALTTDIGARV 526

Query: 453 HFLLNTYVIQDVKTQKMIGLGNLCDGL 479
            F     V+ D+     IG   L D L
Sbjct: 527 VFDPTCCVVHDLTKGSTIGSDLLTDVL 553


>UniRef100_O81617 F8M12.17 protein [Arabidopsis thaliana]
          Length = 1633

 Score =  212 bits (540), Expect = 6e-53
 Identities = 156/559 (27%), Positives = 252/559 (44%), Gaps = 71/559 (12%)

Query: 3   YYVHPSEGPNSVTVTPLL-TGPNYLAWNRSMKRALGTKNKFVFIDGSVPIPPMDDLNRTA 61
           +++H S+    V V+  L T  ++ +W RS+  AL  +NK  FIDG++  PP+D  +  A
Sbjct: 35  HHLHTSDHAGLVLVSERLNTASDFHSWRRSIWMALNVRNKLGFIDGTIVKPPLDHRDYGA 94

Query: 62  WERCNNLILSWIINSVSPQIAQTIVFHEYAIDVWIELQERFSKVDRIRVASLRSSINNLK 121
           W RCN+ + +W++NSVS +I Q+++F   A  +W  +  RF + D  RV  +   ++ ++
Sbjct: 95  WSRCNDTVSTWLMNSVSKKIGQSLLFIPTAEGIWKNMLSRFKQDDAPRVYDIEQRLSKIE 154

Query: 122 QGDKSVLDYFTSIKSLWEELNSHRPMPMCTCPYPCRCESMRAARDFRMEDQVIQFLTGLN 181
           QG   +  Y+T +++LWEE  ++  +P+CTC   C C++       +    V +FL GLN
Sbjct: 155 QGSMDISAYYTELQTLWEEHKNYVDLPVCTCGR-CECDAAVKWERLQQRSHVTKFLMGLN 213

Query: 182 DSFSVVKTQVLLIDPLPSINKVYSMVIQEE-SNIIPPTSLASNEDSSILVNASDARKPFL 240
           +S+   +  +L++ P+ +I + +++V Q+E    I PT    N+D   L           
Sbjct: 214 ESYEQTRRHILMLKPIRTIEEAFNIVTQDERQKAIRPTPKVDNQDQLKLP---------- 263

Query: 241 RGKSSGTSQSKNNSRYCTFCRRNNHTVEYCYLKHDFPNANKPTASSNAVTSEHAVDSHTS 300
                           CT C +  HTV+ CY    +P   K      A TS       T 
Sbjct: 264 ---------------LCTNCGKVGHTVQKCYKIIGYPPGYK------AATSYRQPQIQTQ 302

Query: 301 SEGTSSSSQTGLTQEQYVHLVSLLQQSSLV--PSATP-----PNPASTNH-----VATS- 347
                        Q+   HL+S       V  P+AT      P    T H      +TS 
Sbjct: 303 PRMQMPQQSQPRMQQPIQHLISQFNAQVRVQEPAATSIYTSSPTATITEHGLMAQTSTSG 362

Query: 348 ---FPSSI------DFTSGINTIFSCSLHVPSDHWLIDSGANEHICSSLHLFHSYYRIKP 398
              FPS+       + T   +T+ S    + SD W+IDSGA+ H+CS L +F     +  
Sbjct: 363 TIPFPSTSLKYENNNLTFQNHTLSSLQNVLSSDAWIIDSGASSHVCSDLTMFRELIHVSG 422

Query: 399 ICVNLPNGSSVIVQYAGTVVFSPHFHITHVLYSPSFKVYLISVSKICQSLPYHVHF---- 454
           + V LPNG+ V + + GT+  +    + +VL  P FK  LISV   C  L   +      
Sbjct: 423 VTVTLPNGTRVAITHTGTICITSTLILHNVLLVPDFKFNLISVC--CLELTRGLMIGRGK 480

Query: 455 -LLNTYVIQDVKTQKMIGLGNLCD---GLYRLHPFAPASPQAHFISSA-----VSPTNKM 505
              N Y+++  +T     L         L  L     + P    +SS      +SP  K 
Sbjct: 481 TYNNLYILETQRTSFSPSLPAATSRHPSLPALQKLVSSIPSLKSVSSTASHCRISPLAKQ 540

Query: 506 SCNSVSSNNNVSSIPSNAI 524
              +  S+NN++S P + I
Sbjct: 541 KRLAYVSHNNLASSPFDLI 559


>UniRef100_Q9ZPU4 Putative retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 1501

 Score =  208 bits (530), Expect = 9e-52
 Identities = 154/576 (26%), Positives = 258/576 (44%), Gaps = 46/576 (7%)

Query: 1   SIYYVHPSEGPNSVTVTPLLTGPNYLAWNRSMKRALGTKNKFVFIDGSVPIPPMDDLNRT 60
           S Y +  S+ P +V  +  L G NY  W   M  AL  K K  FI+G++P PP +D N  
Sbjct: 30  SPYTLASSDNPGAVISSVELNGDNYNQWATEMLNALQAKRKTGFINGTIPRPPPNDPNYE 89

Query: 61  AWERCNNLILSWIINSVSPQIAQTIVFHEYAIDVWIELQERFSKVDRIRVASLRSSINNL 120
            W   N++I+ WI  S+ P++  T+ F   A  +W +L++RFS  +++R+  +R+ +++ 
Sbjct: 90  NWTAVNSMIVGWIRTSIEPKVKATVTFISDAHLLWKDLKQRFSVGNKVRIHQIRAQLSSC 149

Query: 121 KQGDKSVLDYFTSIKSLWEELNSHRPMPMCTCPYPCRCESMRAARDFRMEDQVIQFLTGL 180
           +Q  ++V++Y+  + +LWEE N ++P+ +CTC   CRC +       R E+++ QF+ GL
Sbjct: 150 RQDGQAVIEYYGRLSNLWEEYNIYKPVTVCTCGL-CRCGATSEPTKEREEEKIHQFVLGL 208

Query: 181 NDS-FSVVKTQVLLIDPLPSINKVYSMVIQEESNIIPPTSLASNEDSSILVNASD----- 234
           ++S F  +   ++ +DPLPS+ ++YS VI+EE  +         E++   +   +     
Sbjct: 209 DESRFGGLCATLINMDPLPSLGEIYSRVIREEQRLASVHVREQKEEAVGFLARREQLDHH 268

Query: 235 ARKPFLRGKSSGTSQSKNNSRY-----CTFCRRNNHTVEYCYLKHDFPNANKPTASSNAV 289
           +R      +S  T  S++NS       C+ C R  H  + C+    FP+           
Sbjct: 269 SRVDASSSRSEHTGGSRSNSIIKGRVTCSNCGRTGHEKKECWQIVGFPDWWSERNGGRG- 327

Query: 290 TSEHAVDSHTSSEGTSSSSQTGLTQEQYVHLVSLLQQSSLVPSATPPNPASTNHVATSFP 349
                  +     G  S+   G  Q    H  S    SS+ P  T  +    + +     
Sbjct: 328 ------SNGRGRGGRGSNGGRGQGQVMAAHATS--SNSSVFPEFTEEHMRVLSQLVKE-- 377

Query: 350 SSIDFTSGINTIFSCSLHVPSDHWLIDSGANEHICSSLHLFHSYYRIKPICVNLPNGSSV 409
            S   ++  N     S        ++DSGA+ H+  +L    +   + P  V   +GS  
Sbjct: 378 KSNSGSTSNNNSDRLSGKTKLGDIILDSGASHHMTGTLSSLTNVVPVPPCPVGFADGSKA 437

Query: 410 IVQYAGTVVFSPHFHITHVLYSPSFKVYLISVSKICQSLPYHVHFLLNTYVIQDVKTQKM 469
                G +  S    +T+VL+ PS    LISVSK+ +       F      +QD  ++ +
Sbjct: 438 FALSVGVLTLSNTVSLTNVLFVPSLNCTLISVSKLLKQTQCLATFTDTLCFLQDRSSKTL 497

Query: 470 IGLGNLCDGLYRLHPFAPASPQAHFISSAVSPTNKMSCNSVSSNNNVSSIPSNAIWHFRL 529
           IG G    G+Y L    PA               K+   +V S+         A+WH RL
Sbjct: 498 IGSGEERGGVYYLTDVTPA---------------KIHTANVDSD--------QALWHQRL 534

Query: 530 GHLSNQRLSMMHSLYSSITIDNKAVCDICHFAKQRK 565
           GH S   LS +     + +      CD+C  AKQ +
Sbjct: 535 GHPSFSVLSSLPLFSKTSSTVTSHSCDVCFRAKQTR 570


>UniRef100_Q9LVQ2 Retroelement pol polyprotein-like [Arabidopsis thaliana]
          Length = 1491

 Score =  206 bits (524), Expect = 4e-51
 Identities = 148/567 (26%), Positives = 254/567 (44%), Gaps = 40/567 (7%)

Query: 1   SIYYVHPSEGPNSVTVTPLLTGPNYLAWNRSMKRALGTKNKFVFIDGSVPIPPMDDLNRT 60
           S Y +  S+ P ++  + +LTG NY  W+  M  AL  K K  FI+GS+  PP+D+ +  
Sbjct: 25  SPYTLASSDNPGAMISSVMLTGDNYNEWSTEMLNALQAKRKTGFINGSISKPPLDNPDYE 84

Query: 61  AWERCNNLILSWIINSVSPQIAQTIVFHEYAIDVWIELQERFSKVDRIRVASLRSSINNL 120
            W+  N++I+ WI  S+ P++  T+ F   A  +W EL++RFS  +++RV  +++ +   
Sbjct: 85  NWQAVNSMIVGWIRASIEPKVKSTVTFISDAHQLWSELKQRFSVGNKVRVHQIKAQLAAC 144

Query: 121 KQGDKSVLDYFTSIKSLWEELNSHRPMPMCTCPYPCRCESMRAARDFRMEDQVIQFLTGL 180
           +Q  + V+DY+  +  LWEE   ++P+ +C C   C C +       R E+++ QF+ GL
Sbjct: 145 RQDGQPVIDYYGRLCKLWEEFQIYKPITVCKCGL-CTCGATLEPSKEREEEKIHQFVLGL 203

Query: 181 NDS-FSVVKTQVLLIDPLPSINKVYSMVIQEESNIIPPTSLASNEDSSILVNASDARKPF 239
           +DS F  +   ++ +DP PS+ ++YS V++EE   +    +   + S+I      +    
Sbjct: 204 DDSRFGGLSATLIAMDPFPSLGEIYSRVVREEQR-LASVQIREQQQSAIGFLTRQSEVTA 262

Query: 240 LRGKSSGTSQSKNNSRYCTFCRRNNHTVEYCYLKHDFPNANKPTASSNAVTSEHAVDSHT 299
                S   +S++ S  C+ C R+ H  + C+    FP+      +     S     S  
Sbjct: 263 DGRTDSSIIKSRDRSVLCSHCGRSGHEKKDCWQIVGFPDWWTERTNGGGRGS----SSRG 318

Query: 300 SSEGTSSSSQTGLTQEQYVHLVSLLQQSSLVPSATPPN-PASTNHVATSFPSSIDFTSGI 358
               +S S+ +G  + Q     +     S  P  TP      T  +      + D  SG 
Sbjct: 319 RGGRSSGSNNSGRGRGQVTAAHATTSNLSSFPEFTPDQLRVITQMIQNKNNGTSDKLSG- 377

Query: 359 NTIFSCSLHVPSDHWLIDSGANEHICSSLHLFHSYYRIKPICVNLPNGSSVIVQYAGTVV 418
                    +     ++D+GA+ H+   L L  +   I    V   +         GT  
Sbjct: 378 --------KMKLGDVILDTGASHHMTGQLSLLTNIVTIPSCSVGFADDRKTFAISMGTFK 429

Query: 419 FSPHFHITHVLYSPSFKVYLISVSKICQSLPYHVHFLLNTYVIQDVKTQKMIGLGNLCDG 478
            S    +++VLY P+    LISVSK+ + +     F     V+QD  ++ +IG G   DG
Sbjct: 430 LSETVSLSNVLYVPALNCSLISVSKLVKQIKCLALFTDTICVLQDRFSRTLIGTGEERDG 489

Query: 479 LYRLHPFAPASPQAHFISSAVSPTNKMSCNSVSSNNNVSSIPSNAIWHFRLGHLSNQRLS 538
           +Y             +++ A + T           + V     +A+WH RLGH S   LS
Sbjct: 490 VY-------------YLTDAATTT----------VHKVDVTTDHALWHQRLGHPSFSVLS 526

Query: 539 MMHSLYSSITIDNKAVCDICHFAKQRK 565
            +     S    +   CD+C  AKQ +
Sbjct: 527 SLPLFSGSSCSVSSRSCDVCFRAKQTR 553


>UniRef100_Q9ZVW0 Putative retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 1413

 Score =  205 bits (521), Expect = 9e-51
 Identities = 148/567 (26%), Positives = 254/567 (44%), Gaps = 40/567 (7%)

Query: 1   SIYYVHPSEGPNSVTVTPLLTGPNYLAWNRSMKRALGTKNKFVFIDGSVPIPPMDDLNRT 60
           S Y +  S+ P ++  + +LTG NY  W+  M  AL  K K  FI+GS+  PP+D+ +  
Sbjct: 25  SPYTLASSDNPGAMISSVMLTGDNYNEWSTKMLNALQAKRKTGFINGSISKPPLDNPDYE 84

Query: 61  AWERCNNLILSWIINSVSPQIAQTIVFHEYAIDVWIELQERFSKVDRIRVASLRSSINNL 120
            W+  N++I+ WI  S+ P++  T+ F   A  +W EL++RFS  +++ V  +++ +   
Sbjct: 85  NWQAVNSMIVGWIRASIEPKVKSTVTFICDAHQLWSELKQRFSVGNKVHVHQIKTQLAAC 144

Query: 121 KQGDKSVLDYFTSIKSLWEELNSHRPMPMCTCPYPCRCESMRAARDFRMEDQVIQFLTGL 180
           +Q  + V+DY+  +  LWEE   ++P+ +C C   C C +       R E+++ QF+ GL
Sbjct: 145 RQDGQPVIDYYGRLCKLWEEFQIYKPITVCKCGL-CTCGATLEPSKEREEEKIHQFVLGL 203

Query: 181 NDS-FSVVKTQVLLIDPLPSINKVYSMVIQEESNIIPPTSLASNEDSSILVNASDARKPF 239
           +DS F  +   ++ +DP PS+ ++YS V++EE   +    +   + S+I      +    
Sbjct: 204 DDSRFGGLSATLIAMDPFPSLGEIYSRVVREEQR-LASVQIREQQQSAIGFLTRQSEVTA 262

Query: 240 LRGKSSGTSQSKNNSRYCTFCRRNNHTVEYCYLKHDFPNANKPTASSNAVTSEHAVDSHT 299
                S   +S++ S  C+ C R+ H  + C+    FP+      +     S     S  
Sbjct: 263 DGRTDSSIIKSRDRSVLCSHCGRSGHEKKDCWQIVGFPDWWTERTNGGGRGS----SSRG 318

Query: 300 SSEGTSSSSQTGLTQEQYVHLVSLLQQSSLVPSATPPN-PASTNHVATSFPSSIDFTSGI 358
               +S S+ +G  + Q     +     S  P  TP      T  +      + D  SG 
Sbjct: 319 RGGRSSGSNNSGRGRGQVTAAHATTSNLSPFPEFTPDQLRVITQMIQNKNNGTSDKLSG- 377

Query: 359 NTIFSCSLHVPSDHWLIDSGANEHICSSLHLFHSYYRIKPICVNLPNGSSVIVQYAGTVV 418
                    +     ++D+GA+ H+   L L  +   I    V   +G        GT  
Sbjct: 378 --------KMKLGDVILDTGASHHMTGQLSLLTNIVTIPSCSVGFADGRKTFAISMGTFK 429

Query: 419 FSPHFHITHVLYSPSFKVYLISVSKICQSLPYHVHFLLNTYVIQDVKTQKMIGLGNLCDG 478
            S    +++VLY P+    LISVSK+ + +     F     V+QD  ++ +IG G   DG
Sbjct: 430 LSETVSLSNVLYVPALNCSLISVSKLVKQIKCLALFTDTICVLQDRFSRTLIGTGEERDG 489

Query: 479 LYRLHPFAPASPQAHFISSAVSPTNKMSCNSVSSNNNVSSIPSNAIWHFRLGHLSNQRLS 538
           +Y             +++ A + T           + V     +A+WH RLGH S   LS
Sbjct: 490 VY-------------YLTDAATTT----------VHKVDITTDHALWHQRLGHPSFSVLS 526

Query: 539 MMHSLYSSITIDNKAVCDICHFAKQRK 565
            +     S    +   CD+C  AKQ +
Sbjct: 527 SLPLFSGSSCSVSSRSCDVCFRAKQTR 553


>UniRef100_Q9MAJ8 F27F5.19 [Arabidopsis thaliana]
          Length = 1309

 Score =  199 bits (505), Expect = 7e-49
 Identities = 147/591 (24%), Positives = 275/591 (45%), Gaps = 100/591 (16%)

Query: 1   SIYYVHPSEGPNSVTVTPLLTGPNYLAWNRSMKRALGTKNKFVFIDGSVPIPPMDDLNRT 60
           S +++H S+      V+ +L G +Y  W+ +M+ +L  KNK  F+DGS+P P + D    
Sbjct: 66  SPFFLHSSDHLGLNIVSHVLDGTSYNNWSIAMRMSLDAKNKLSFVDGSLPRPDVSDRMFR 125

Query: 61  AWERCNNLILSWIINSVSPQIAQTIVFHEYAIDVWIELQERFSKVDRIRVASLRSSINNL 120
            W RCN+++ +W++N VS +I  +I+++E A+++W +L  RF   +  R   L  SI+ L
Sbjct: 126 IWSRCNSMVKTWLLNVVSKEIYDSILYYEDAVEMWNDLFSRFRVSNLPRKYQLEQSIHTL 185

Query: 121 KQGDKSVLDYFTSIKSLWEELNSHRPMPMCTCPYPCRCESMRAARDFRMEDQVIQFLTGL 180
           KQ +  +  Y+T  K+LW +L + R + +      C C+ ++   +     ++IQFL GL
Sbjct: 186 KQRNLDLSTYYTKKKTLWVQLANTRVLTV----RKCNCDHVKELLEEAETSRIIQFLMGL 241

Query: 181 NDSFSVVKTQVLLIDPLPSINKVYSMVIQEESNIIPPTSLASNEDSSILVNASDARKPFL 240
           ND+F+ ++ Q+L + P P + ++Y+M+ Q+ES  +  ++  SN  ++  V AS    P +
Sbjct: 242 NDNFAHIRGQILNMKPRPGLTEIYNMLDQDESQRLVGSTPLSNLTAAFQVQAS----PVI 297

Query: 241 RGKSSGTSQSKNNSRYCTFCRRNNHTVEYCYLKHDFPNANKPT-----ASSNAVTSEHAV 295
             + +  +Q       C+FC +  H V+ CY KH +P  +K T      S+N  +++   
Sbjct: 298 DSQVN-MAQGSYKKPKCSFCNKLGHLVDKCYKKHGYPPGSKWTKAQTIGSTNLASTQLQP 356

Query: 296 DSHTSSEGTSSSSQ--TGLTQEQYVHLVSLLQQSSLVPSATPPNPASTNHVATSFPSSID 353
            + T SE T S  +  T   Q    +L + L  +S+ P   P   +++   + S P    
Sbjct: 357 VNETPSEKTDSCEEFSTDQIQTMISYLSTKLHTASISP--MPITSSASTSASPSVPMISQ 414

Query: 354 FTSGINTIFSCSLH------------VPSDHWLIDSGANEHICSSLHLFHSYYRIKPICV 401
            +S   ++FS + +            V    W+IDSGA  H+  +  L+  +  ++   V
Sbjct: 415 ISSTFLSLFSNAYYDMLISSISQEPAVSPRAWVIDSGAIHHVTHNRDLYLEFRILENTFV 474

Query: 402 NLPNGSSVIVQYAGTVVFSPHFHITHVLYSPSFKVYLISVSKICQSLPYHVHFLLNTYVI 461
            LPN  +V                         K+  I   ++  ++  H          
Sbjct: 475 RLPNDCTV-------------------------KIAGIGFIQLSDAISLH---------- 499

Query: 462 QDVKTQKMIGLGNLCDGLYRLHPFAPASPQAHFISSAVSPTNKMSCNSVSSNNNVSSIPS 521
            ++  + MIG G                      ++++SP   + C+SV        +  
Sbjct: 500 NELTKELMIGRG----------------------TNSMSPEFSI-CSSV--------VVD 528

Query: 522 NAIWHFRLGHLSNQRLSMMHSLYS----SITIDNKAVCDICHFAKQRKLPY 568
           +  WH RLG+ +  ++ ++  + +     I  ++  VC +CH +KQ+ L +
Sbjct: 529 SITWHKRLGYPAYSKIDLLSDVLNLKDKKINKEHSPVCRVCHLSKQKHLSF 579


>UniRef100_Q9LGZ8 Retroelement pol polyprotein-like [Arabidopsis thaliana]
          Length = 1098

 Score =  198 bits (503), Expect = 1e-48
 Identities = 155/572 (27%), Positives = 251/572 (43%), Gaps = 44/572 (7%)

Query: 1   SIYYVHPSEGPNSVTVTPLLTGPNYLAWNRSMKRALGTKNKFVFIDGSVPIPPMDDLNRT 60
           S Y +  S+ P ++  + +L   NY  W   +  +L  K K  F+DG++P P  +    +
Sbjct: 12  SPYGITASDNPGALISSVILKEDNYSEWAEELMNSLQAKQKLGFLDGTIPKPTTEPA-LS 70

Query: 61  AWERCNNLILSWIINSVSPQIAQTIVFHEYAIDVWIELQERFSKVDRIRVASLRSSINNL 120
           +W+  N++I+ WI  S+ P I  T+ F   A D+W  L++RFS  + +R   L+  I   
Sbjct: 71  SWKAANSMIIGWIRTSIDPTIRSTVAFVSDAKDLWDSLKQRFSNGNGVRKQLLKDEILAC 130

Query: 121 KQGDKSVLDYFTSIKSLWEELNSHRPMPMCTCPYPCRCESMRAARDFRMEDQVIQFLTGL 180
           KQ  +SVL Y+  +  LWEEL +++    CT      CE+       R +D+V QFL  L
Sbjct: 131 KQDGQSVLVYYGRLTKLWEELQNYKTSRTCT------CEAAPDIAKEREDDKVHQFLLNL 184

Query: 181 NDSFSVVKTQVLLIDPLPSINKVYSMVIQEESNIIPP--TSLASNEDSSILVNASDARKP 238
           ++ F  +++ + + DPLP++N+VYS VI EE N+           E     V A+     
Sbjct: 185 DERFRPIRSTITVQDPLPALNQVYSRVIHEEQNLNASRIKDDIKTEAVGFTVQATPLPPT 244

Query: 239 FLRGKSSGTSQSKNNSRYCTFCRRNNHTVEYCYLKHDFPN----ANKPTASSNAVTSEHA 294
                 S       +S  CT   R  H +  C+L H +P+     N    S+   TS   
Sbjct: 245 PQVAAVSAPRFRDRSSLTCTHYHRQGHDITECFLVHGYPDWWLEQNGSNGSAGRGTSGRG 304

Query: 295 VDSHTSSEGTSSSSQTGLTQEQYVHLVSLLQQSSLVPSATPPNPASTNHVATSFPSSIDF 354
            +   ++     SS +G   +   +  S    +   P++TP N    N + +   +    
Sbjct: 305 NNGRGNNNRGGRSSSSGSRGKGRANAAS----THPPPTSTPSNADQINQLISLLQAQNPA 360

Query: 355 TSGINTIFSCSLHVPSDHWLIDSGANEHICSSLHLFHSYYRIKPICVNLPNGSSVIVQYA 414
           TS        S    + + +ID+GA+ H+   + L  +   I P  V  P+G++      
Sbjct: 361 TSS----QKLSGKTFTTYVIIDTGASHHMTGDITLLTNVEDIIPSPVTKPDGTASRATKR 416

Query: 415 GTVVFSPHFHITHVLYSPSFKVYLISVSKICQSLPYHVHFLLNTYVIQDVKTQKMIGLGN 474
           GT+     + +  VL+ P F   LISV+K+ +       F      +QD  T+ +IG G 
Sbjct: 417 GTLALHNAYVLPDVLFVPDFNCTLISVAKLLKHTGCVAIFTDTLCFLQDRFTRTLIGAGE 476

Query: 475 LCDGLYRLHPFAPASPQAHFISSAVSPTNKMSCNSVSSNNNVSSIPSNAIWHFRLGHLS- 533
             +G+Y            +F     +  NK    S S+           +WH RLGH S 
Sbjct: 477 EREGVY------------YFTGVLAARVNKGFKESSSA----------TLWHHRLGHPST 514

Query: 534 NQRLSMMHSLYSSITIDNKAVCDICHFAKQRK 565
              LS      SS  ++    CDIC+ AKQ +
Sbjct: 515 GVLLSFPEFASSSSDLEIIKSCDICYRAKQAR 546


>UniRef100_O22175 Putative retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 1496

 Score =  193 bits (490), Expect = 4e-47
 Identities = 155/593 (26%), Positives = 254/593 (42%), Gaps = 89/593 (15%)

Query: 3   YYVHPSEGPNSVTVTPLLTGPNYLAWNRSMKRALGTKNKFVFIDGSVPIPPMDDLNRTAW 62
           Y ++ S+ P ++  + +L   NY  W+  ++  L  K K  FIDGS+P P  D    + W
Sbjct: 22  YLINASDNPGALISSVVLKENNYAEWSEELQNFLRAKQKLGFIDGSIPKPAADP-ELSLW 80

Query: 63  ERCNNLILSWIINSVSPQIAQTIVFHEYAIDVWIELQERFSKVDRIRVASLRSSINNLKQ 122
              N++I+ WI  S+ P I  T+ F   A  +W  L+ RFS  + +R   L+  I    Q
Sbjct: 81  IAINSMIVGWIRTSIDPTIRSTVGFVSEASQLWENLRRRFSVGNGVRKTLLKDEIAACTQ 140

Query: 123 GDKSVLDYFTSIKSLWEELNSHRPMPMCTCPYPCRCESMRAARDFRMEDQVIQFLTGLND 182
             + VL Y+  +  LWEEL +++          C+CE+       R +D+V +FL GL+ 
Sbjct: 141 DGQPVLAYYGRLIKLWEELQNYK------SGRECKCEAASDIEKEREDDRVHKFLLGLDS 194

Query: 183 SFSVVKTQVLLIDPLPSINKVYSMVIQEESNIIPPTSLASNEDSSILVNASDARKPFLRG 242
            FS +++ +  I+PLP + +VYS V++EE N+    +    +  +I  +   +  P  R 
Sbjct: 195 RFSSIRSSITDIEPLPDLYQVYSRVVREEQNLNASRTKDVVKTEAIGFSVQSSTTPRFRD 254

Query: 243 KSSGTSQSKNNSRYCTFCRRNNHTVEYCYLKHDFPN--------ANKPT---------AS 285
           KS         + +CT C R  H V  C+L H +P+         N+P+          S
Sbjct: 255 KS---------TLFCTHCNRKGHEVTQCFLVHGYPDWWLEQNPQENQPSTRGRGSNGRGS 305

Query: 286 SNAVTSEHAVDSHTSSEGTSSSSQ------TGLTQEQYVHLVSLLQQSSLVPSATPPNPA 339
           S+      +    T   G ++++Q      +G   +Q   L+SLLQ            P+
Sbjct: 306 SSGRGGNRSSAPTTRGRGRANNAQAAAPTVSGDGNDQIAQLISLLQAQ---------RPS 356

Query: 340 STNHVATSFPSSIDFTSGINTIFSCSLHVPSDHWLIDSGANEHICSSLHLFHSYYRIKPI 399
           S++        +   T G+                ID+GA+ H+     +    + I P 
Sbjct: 357 SSSE---RLSGNTCLTDGV----------------IDTGASHHMTGDCSILVDVFDITPS 397

Query: 400 CVNLPNGSSVIVQYAGTVVFSPHFHITHVLYSPSFKVYLISVSKICQSLPYHVHFLLNTY 459
            V  P+G +      GT++    + +  VL+ P F   LISVSK+ +       F     
Sbjct: 398 PVTKPDGKASQATKCGTLLLHDSYKLHDVLFVPDFDCTLISVSKLLKQTSSIAIFTDTFC 457

Query: 460 VIQDVKTQKMIGLGNLCDGLYRLHPFAPASPQAHFISSAVSPTNKMSCNSVSSNNNVSSI 519
            +QD   + +IG G   +G+Y  +     +P+ H  SS  +                   
Sbjct: 458 FLQDRFLRTLIGAGEEREGVY--YFTGVLAPRVHKASSDFA------------------- 496

Query: 520 PSNAIWHFRLGHLSNQ-RLSMMHSLYSSITIDNKAVCDICHFAKQRKLPYNLS 571
            S  +WH RLGH S    LS+     SS   D    CD C  +KQ +  + +S
Sbjct: 497 ISGDLWHRRLGHPSTSVLLSLPECNRSSQGFDKIDSCDTCFRSKQTREVFPIS 549


>UniRef100_Q9FXB7 Putative retroelement polyprotein [Arabidopsis thaliana]
          Length = 1486

 Score =  182 bits (461), Expect = 9e-44
 Identities = 148/593 (24%), Positives = 263/593 (43%), Gaps = 82/593 (13%)

Query: 1   SIYYVHPSEGPNSVTVTPLLTGPNYLAWNRSMKRALGTKNKFVFIDGSVPIPPMDDLNRT 60
           S Y +   + P ++   PLL GPNY  W  +++ AL  + KF F DGS+P P   D +  
Sbjct: 23  SPYDLTSGDNPGTLISKPLLRGPNYDEWATNLRLALKARKKFGFADGSIPQPVETDPDFE 82

Query: 61  AWERCNNLILSWIINSVSPQIAQTIVFHEYAIDVWIELQERFSKVDRIRVASLRSSINNL 120
            W   N L++SW+  ++   ++ ++   + + ++W  +Q+RF   +  RV  L++ +   
Sbjct: 83  DWTANNALVVSWMKLTIDETVSTSMSHLDDSHELWTHIQKRFGVKNGQRVQRLKTELATC 142

Query: 121 KQGDKSVLDYFTSIKSLWEELNSHRPMPMCTCPYPCRCESMRAARDFRMEDQVIQFLTGL 180
           +Q   ++  Y+  +  LW  L  ++           + ++M   R  R ED++ QFL GL
Sbjct: 143 RQKGVAIETYYGRLSQLWRSLADYQ-----------QAKTMDDVRKEREEDKLHQFLMGL 191

Query: 181 NDS-FSVVKTQVLLIDPLPSINKVYSMVIQEESNIIPPTSLASNEDSSILVNASDARKPF 239
           ++S +  VK+ +L   PLPS+ + Y+             +L  +E+S  L    + R   
Sbjct: 192 DESVYGAVKSALLSRVPLPSLEEAYN-------------ALTQDEESKSLSRLHNERVDG 238

Query: 240 LRGKSSGTSQSKNNS--RYCTFCRRNNHTVEYCYLKHDFPN--------ANKPTASSNAV 289
           +      TS+ +++S  R C+ C R  H  E C+    +P          N  ++S   +
Sbjct: 239 VSFAVQTTSRPRDSSENRVCSNCGRVGHLAEQCFKLIGYPPWLEEKLRLKNTASSSRGGL 298

Query: 290 TSEHAVDSHTSSEGTSSSSQTGLTQEQYVHLVSLLQQSSLVPSATPPNPASTNHVATSFP 349
           +S     SH      +  + +G+         +++  SSL    T     S + +  S  
Sbjct: 299 SSFKGKQSHGRGSSINHVASSGMA-------ANVVTNSSLTSPLT-----SDDRIGLSGL 346

Query: 350 SSIDFTSGINTIFSCSLHVPSDH---------WLIDSGANEHICSSLHLFHSYYRIKPIC 400
           +   +   + TI        +DH         W+IDSGA  H+  SL    +   + P+ 
Sbjct: 347 NDSQWKI-LQTILEERKSTSNDHQSGKYFLESWIIDSGATNHMTGSLAFLRNVCDMPPVL 405

Query: 401 VNLPNGSSVIVQYAGTVVFSPHFHITHVLYSPSFKVYLISVSKICQSLPYHVHFLLNTYV 460
           + LP+G        G+V       +  VL+      +LISVS++ ++            +
Sbjct: 406 IKLPDGRFTTATKQGSVQLGSSLDLQDVLFVDGLHCHLISVSQLTRTRRCIFQITDKVCI 465

Query: 461 IQDVKTQKMIGLGNLCDGLYRLHPFAPASPQAHFISSAVSPTNKMSCNSVSSNNNVSSIP 520
           +QD  T  +IG G   +GLY              + +A + T+K             ++P
Sbjct: 466 VQDRTTLMLIGAGRELNGLYFFRG----------VETAAAVTSK-------------ALP 502

Query: 521 SNAIWHFRLGHLSNQRLSMM-HSLYSSITIDNKAVCDICHFAKQRKLPYNLST 572
           S+ +WH RLGH S++ L ++  S  +S T D+K  C+IC  AKQ + P+ LS+
Sbjct: 503 SSQLWHQRLGHPSSKALHLLPFSDVTSSTFDSK-TCEICIQAKQTRDPFPLSS 554


>UniRef100_Q9XIM3 Putative retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 1100

 Score =  174 bits (442), Expect = 1e-41
 Identities = 136/495 (27%), Positives = 232/495 (46%), Gaps = 66/495 (13%)

Query: 8   SEGPNSVTVTPLLTGPNYLAWNRSMKRALGTKNKFVFIDGSVPIPPMDDLNRTAWERCNN 67
           ++ P    ++  L   NY  WN +M  +L  KNK  FIDG++P P   D N   W RCN+
Sbjct: 23  ADHPGLNIISHRLDETNYGDWNVAMLISLDVKNKSGFIDGTLPRPLETDKNFCLWSRCNS 82

Query: 68  LILSWIINSVSPQIAQTIVFHEYAIDVWIELQERFSKVDRIRVASLRSSINNLKQGDKSV 127
           +I             ++I+    A D+W +L  RF+  +  R  +L   I +L+QG  S+
Sbjct: 83  MI------------CRSILRMNDASDIWRDLNSRFNMTNLPRTYNLTQEIQDLRQGTLSL 130

Query: 128 LDYFTSIKSLWEELNSHRPMPMCTCPYPCRCESMRAARDFRMEDQVIQFLTGLNDSFSVV 187
            +Y+T +K+LW++L+S   +       PC  +            ++++FL GLN+S++++
Sbjct: 131 SEYYTRLKTLWDQLDSTEEL-----DDPCTRQK---------RAKIVKFLAGLNESYAII 176

Query: 188 KTQVLLIDPLPSINKVYSMVIQEESN------IIPPTSLASNEDSSILVNASDARKPFLR 241
           + Q++    LPS+ +VY +V Q+ S       +  P +   +E     V  +++  P + 
Sbjct: 177 RRQIIAKKILPSLAEVYHIVDQDNSQQGFSNVVARPAAFQVSE-----VTITNSIDPTIY 231

Query: 242 GKSSGTSQSKNNSRYCTFCRRNNHTVEYCYLKHDFP-------------NANKPTASSNA 288
              +G ++ ++    C+FC R  H  E CY KH FP                KP AS+ A
Sbjct: 232 YVQNGPNKGRS---MCSFCNRVGHIAERCYKKHGFPPRFTPKGKVGDKTQKPKPVASNVA 288

Query: 289 VTSEHAVDSHT---SSEGTSSSSQTGLTQEQYVHLVSLLQQSSLVPSATPPNPASTNHVA 345
           + +  + D+H+   S  G  S  Q     +Q++ + S   Q     ++   + +  +++ 
Sbjct: 289 LATTESNDTHSGLKSLVGNLSKEQL----QQFIAMFSSQLQPQPHSNSAVASSSQADNIG 344

Query: 346 TSFPSSIDFTSGINTIFSCSLHVPSDHWLIDSGANEHICSSLHLFHSYYRIKPICVNLPN 405
            SF  S     GI T+   +L   S  W+IDSGA  H+   L L  S        VNLP 
Sbjct: 345 ISFSPSTYSFIGILTVAQHTL--SSKTWVIDSGAIHHVNLLLTLNTSVLS----SVNLPA 398

Query: 406 GSSVIVQYAGTVVFSPHFHITHVLYSPSFKVYLISVSKICQSLPYHVHFLLNTYVIQDVK 465
           G +V +   GT+  +    + +VL+ P F + L+S+S +   +   V F  +   IQD+ 
Sbjct: 399 GPTVKISGVGTLRLNDDNLLKNVLFIPEFCLNLMSISSLTDDIGSRVIFNQHACEIQDLI 458

Query: 466 TQKMIGLGNLCDGLY 480
             +M+G G     LY
Sbjct: 459 KGRMLGHGRRVANLY 473


>UniRef100_Q9ZQN0 Putative retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 411

 Score =  171 bits (432), Expect = 2e-40
 Identities = 100/336 (29%), Positives = 172/336 (50%), Gaps = 25/336 (7%)

Query: 5   VHPSEGPNSVTVTPLLTGPNYLAWNRSMKRALGTKNKFVFIDGSVPIPPMDDLNRTAWER 64
           +H S+ P    V  +L G NY +W+ +M+ +L  KNK  F+DGS+  P +DD     W R
Sbjct: 69  LHSSDHPGLSIVAHVLDGSNYNSWSIAMRISLDAKNKLGFVDGSLLRPSVDDSTFRIWSR 128

Query: 65  CNNLILSWIINSVSPQIAQTIVFHEYAIDVWIELQERFSKVDRIRVASLRSSINNLKQGD 124
           CN+++ SWI+N V+ +I  +I+++E A+++W +L  RF   +  R   L  ++  LKQG 
Sbjct: 129 CNSMVKSWILNVVNKEIYDSILYYEDAVEMWTDLFTRFRVNNLPRKYQLEQAVMTLKQGS 188

Query: 125 KSVLDYFTSIKSLWEELNSHRPMPMCTCPYPCRCESMRAARDFRMEDQVIQFLTGLNDSF 184
            ++  YFT  K+LWE+L + +   +      C C+ ++   +     +VIQFL GLND F
Sbjct: 189 LNLSTYFTKKKTLWEQLLNTKTRSV----KKCDCDQVKELLEDAETSRVIQFLMGLNDDF 244

Query: 185 SVVKTQVLLIDPLPSINKVYSMVIQEESNII------PPTSLASNEDSSILVNASDARKP 238
           + + +Q+L + P P +N++Y+M+ Q+ES  +      P  S A+ +   +L      + P
Sbjct: 245 NTIMSQILNMKPRPGLNEIYNMLDQDESQRLVGHASKPTPSPAAFQTQGLLTE----QNP 300

Query: 239 FLRGKSSGTSQSKNNSRYCTFCRRNNHTVEYCYLKHDFPNANKPTASSNAVTSEHAVDSH 298
            L       +Q       CT C R  HTV+ CY  H +P  +      ++      + S 
Sbjct: 301 IL------MAQGNFKKPKCTHCNRIGHTVDKCYKVHGYPPGHPRANQQSSCVGSTNLTSI 354

Query: 299 TSSEGTSSSSQTGLTQEQYV-----HLVSLLQQSSL 329
             SE  +   Q  +  + ++     +L + LQ +S+
Sbjct: 355 DQSENQAPVMQDEIMSKDHIQQLISYLSTKLQSASI 390


  Database: uniref100
    Posted date:  Jan 5, 2005  1:24 AM
  Number of letters in database: 848,049,833
  Number of sequences in database:  2,790,947
  
Lambda     K      H
   0.356    0.156    0.555 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,025,339,216
Number of Sequences: 2790947
Number of extensions: 76561164
Number of successful extensions: 474619
Number of sequences better than 10.0: 433
Number of HSP's better than 10.0 without gapping: 177
Number of HSP's successfully gapped in prelim test: 260
Number of HSP's that attempted gapping in prelim test: 472004
Number of HSP's gapped (non-prelim): 1552
length of query: 1441
length of database: 848,049,833
effective HSP length: 140
effective length of query: 1301
effective length of database: 457,317,253
effective search space: 594969746153
effective search space used: 594969746153
T: 11
A: 40
X1: 14 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 37 (21.7 bits)
S2: 81 (35.8 bits)


Medicago: description of AC149491.1