Medicago
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= AC145449.7 + phase: 0 /pseudo
         (965 letters)

Database: uniref100 
           2,790,947 sequences; 848,049,833 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

UniRef100_Q6L3H0 Putative receptor kinase [Solanum demissum]          604  e-171
UniRef100_Q6L3Q0 Putative polyprotein [Solanum demissum]              522  e-146
UniRef100_Q7XE85 Putative pol polyprotein [Oryza sativa]              399  e-109
UniRef100_Q5XWK9 Gag-pol polyprotein-like [Solanum tuberosum]         399  e-109
UniRef100_Q9FWZ5 Putative retroelement polyprotein [Arabidopsis ...   389  e-106
UniRef100_Q710T7 Gag-pol polyprotein [Populus deltoides]              385  e-105
UniRef100_Q9ZPU4 Putative retroelement pol polyprotein [Arabidop...   367  e-100
UniRef100_Q9ZQK0 Putative retroelement pol polyprotein [Arabidop...   365  3e-99
UniRef100_O04543 F20P5.25 protein [Arabidopsis thaliana]              364  8e-99
UniRef100_Q9XII7 Putative retroelement pol polyprotein [Arabidop...   362  2e-98
UniRef100_Q9FIC5 Retroelement pol polyprotein-like [Arabidopsis ...   358  4e-97
UniRef100_Q94KV0 Polyprotein [Arabidopsis thaliana]                   355  4e-96
UniRef100_Q9C692 Polyprotein, putative [Arabidopsis thaliana]         352  2e-95
UniRef100_O81617 F8M12.17 protein [Arabidopsis thaliana]              349  3e-94
UniRef100_Q9FXB7 Putative retroelement polyprotein [Arabidopsis ...   344  6e-93
UniRef100_Q9SA17 F28K20.17 protein [Arabidopsis thaliana]             342  3e-92
UniRef100_Q9FLA4 Polyprotein [Arabidopsis thaliana]                   342  3e-92
UniRef100_Q9SSB1 T18A20.5 protein [Arabidopsis thaliana]              338  6e-91
UniRef100_O23302 Retrovirus-related like polyprotein [Arabidopsi...   332  3e-89
UniRef100_Q94IU9 Copia-like polyprotein [Arabidopsis thaliana]        331  7e-89

>UniRef100_Q6L3H0 Putative receptor kinase [Solanum demissum]
          Length = 1358

 Score =  604 bits (1558), Expect = e-171
 Identities = 319/599 (53%), Positives = 396/599 (65%), Gaps = 70/599 (11%)

Query: 1   DFNTGKTIGT*SISQGLYYLHSQSS-NICGVSASPDMIHRRLGHPSFDKLKVLVPQLSHL 59
           D +TG+ IGT   SQGLYYL S +S   C ++ SPD+IH+RLGH S  KL+ +VP LS L
Sbjct: 451 DRSTGQMIGTGHESQGLYYLTSSNSLAACSITDSPDLIHKRLGHSSLSKLQKMVPSLSSL 510

Query: 60  KSLDCESCQLGKHVRASFPSSPNKRSKSPFDIVHSDVWGPSRVMSTLG*RYYVTFIDGFS 119
            +LDCESCQLGKH RA+F  S   RS+S F +VHSD+WGPSRV STLG RY+V+FID +S
Sbjct: 511 STLDCESCQLGKHTRATFSRSTEGRSESIFSLVHSDIWGPSRVSSTLGFRYFVSFIDDYS 570

Query: 120 RCTWIILLKDRSQLFGAFLTFCSEIKNQFGKGIRILRSDNAKEYFFAPFNSFMASLGIIH 179
           +CTW+ L+KDRS+LF  F +F +EI+NQFG  IR  RSDNA EY  + F  FM   GIIH
Sbjct: 571 KCTWVFLMKDRSELFSIFKSFFAEIQNQFGVSIRTFRSDNALEYLSSQFREFMTHQGIIH 630

Query: 180 QSSCPHTPQQNGVAERKHCHLVDTTRTLLINAHAPFKFWGDAILTACYLINRMPSSVLDN 239
           Q++CP+TPQQNGVAERK+ HL++T RTLL+ ++ P +FWGDA+LT+CYLINRMPSS + N
Sbjct: 631 QTTCPYTPQQNGVAERKNRHLIETARTLLLESNVPLRFWGDAVLTSCYLINRMPSSSIQN 690

Query: 240 EIPQSLLFPKDPLYRVQLRVYGSTCFVHDLTPGRDKLSARAVKCVFLGYSRTQKGYRCYS 299
           ++P S+LFP+  LY +  RV+GSTCFVH+L PG+DKL+ RA+KCVFLGYSR QKGYRCYS
Sbjct: 691 QVPHSILFPQSHLYPIPPRVFGSTCFVHNLAPGKDKLAPRALKCVFLGYSRVQKGYRCYS 750

Query: 300 PSTRRFYISADVTFFEDTPFFASPTTTSSTTDVTDSQVIPTPLFHPIFEPPVSTQSSPQL 359
               R+ +SADVTFFE  P++    T+S+  DV+    IP  L  P F     T +SP +
Sbjct: 751 HDLHRYLMSADVTFFESQPYY----TSSNHPDVSMVLPIPQVLPVPTFVESTVTSTSPVV 806

Query: 360 QSNPEFRRYGNIYERRHVEAPETSPIDSSDSAPKTVTTDSSDSATAPISSPVVVPPEPSN 419
                                   P+ +    P+  T    DS  AP  +P    P PS 
Sbjct: 807 ----------------------VPPLLTYHRRPRP-TLVPDDSCHAPDPAPTADLPPPSQ 843

Query: 420 DLPIALHKGKRSTANPHPVYNFLSYHRLSPSYFAFVSALSSVSIPKTVHEALSHQEWKQA 479
             P+AL KG                                        EALSH  W+QA
Sbjct: 844 --PLALQKG----------------------------------------EALSHSGWRQA 861

Query: 480 MIDEMVALESNHTWELVSPSPGKSIVGCR*VFNVKVGLDGQVDRLKARLVAIGYTQVYGQ 539
           M+DEM AL  + TWELVS   GKS VGCR V+ VK+G DGQVDRLKARLVA GYTQ++G 
Sbjct: 862 MVDEMSALHKSGTWELVSLPAGKSTVGCRWVYAVKIGPDGQVDRLKARLVAKGYTQIFGL 921

Query: 540 DYNDTFSPVAKMTSVRLFIAMTAMKR*PLFQLDIKNAFLHGDLEEEIYMEQPSGFVAWG 598
           DY+DTF+PVAK+ SVRLF++M A++  PL QLDIKNAFLHGDLEEE+YMEQP GFVA G
Sbjct: 922 DYSDTFAPVAKIASVRLFLSMAAVRHWPLHQLDIKNAFLHGDLEEEVYMEQPPGFVAQG 980



 Score = 36.2 bits (82), Expect = 4.9
 Identities = 14/22 (63%), Positives = 17/22 (76%)

Query: 944  SLRSPRITYICDKMDAYDMYAP 965
            SL  PRI YIC+K+  YD+YAP
Sbjct: 1336 SLTCPRINYICNKLGTYDLYAP 1357


>UniRef100_Q6L3Q0 Putative polyprotein [Solanum demissum]
          Length = 1336

 Score =  522 bits (1344), Expect = e-146
 Identities = 284/605 (46%), Positives = 365/605 (59%), Gaps = 50/605 (8%)

Query: 1    DFNTGKTIGT*SISQGLYYLHSQS--SNICGVSASPDMIHRRLGHPSFDKLKVLVPQLSH 58
            D  T + IG   +S GLY L   +  S  C    SP   H RLGHPS   LK L PQ  +
Sbjct: 456  DLMTKQIIGKRHVSDGLYILDEWTPPSVACSSIVSPFEAHCRLGHPSLPVLKKLCPQFHN 515

Query: 59   LKSLDCESCQLGKHVRASFPSSPNKRSKSPFDIVHSDVWGPSRVMSTLG*RYYVTFIDGF 118
            + S+DCESC   KH R S     NKR+   F++VHSDVWGP  V+S +G RY+VTF+D F
Sbjct: 516  VPSIDCESCHFAKHHRISLSPRNNKRANFAFELVHSDVWGPCPVVSKVGFRYFVTFMDDF 575

Query: 119  SRCTWIILLKDRSQLFGAFLTFCSEIKNQFGKGIRILRSDNAKEYFFAPFNSFMASLGII 178
            SR TWI  +K+RS++F  F  FC+EIK QF   + ILRSDNA+E+  A F ++M   GI+
Sbjct: 576  SRMTWIYFMKNRSEVFSHFSNFCAEIKTQFNASVHILRSDNAREFMSASFQNYMNQYGIL 635

Query: 179  HQSSCPHTPQQNGVAERKHCHLVDTTRTLLINAHAPFKFWGDAILTACYLINRMPSSVLD 238
            HQSSC  TP QNGVAERK+ HL++T R LL     P +FW D + TA +LINRMPS+VL+
Sbjct: 636  HQSSCVDTPSQNGVAERKNRHLLETARVLLFQMKVPKQFWADTVSTASFLINRMPSTVLN 695

Query: 239  NEIPQSLLFPKDPLYRVQLRVYGSTCFVHDLTPGRDKLSARAVKCVFLGYSRTQKGYRCY 298
             +IP  +LFP  PL+ ++ +V+GSTC+V D+ P   KL  +A+KCVFLGYSR QKGYRCY
Sbjct: 696  GDIPYGVLFPNKPLFPLEPKVFGSTCYVRDVRPHITKLDPKALKCVFLGYSRLQKGYRCY 755

Query: 299  SPSTRRFYISADVTFFEDTPFFASPTTTSSTTDVTDSQVIPTPLFHPIFE-PPVSTQSSP 357
            SP+  R+ +S DV F E   FF+SP T  +     D + +       I+   P  ++   
Sbjct: 756  SPTLNRYMVSIDVVFSESISFFSSPDTFPTQGQQEDEEWL-------IYRTTPSRSEQHK 808

Query: 358  QLQSNPEFRRYGNIYERRHVEAP--ETSPIDSSDSAPKTVTTDSSDSATAPISSPVVVPP 415
            ++  + E        E    +AP  +T P      + + VT D+  + T   S P+ V P
Sbjct: 809  EVPGSVE-----QSMENVSSDAPLAQTKPPIVQVYSRRQVTNDTCPAPTLSSSDPLPVNP 863

Query: 416  EPSN--DLPIALHKGKRSTANPHPVYNFLSYHRLSPSYFAFVSALSSVSIPKTVHEALSH 473
             P+   D+PIAL K                                S+ +PKTV EAL+H
Sbjct: 864  SPTENLDIPIALRK-------------------------------DSIFVPKTVREALNH 892

Query: 474  QEWKQAMIDEMVALESNHTWELVSPSPGKSIVGCR*VFNVKVGLDGQVDRLKARLVAIGY 533
              W  AM+DE+ AL+ NHTW LV    GK  VGC+ VF +KV  DG + RLKARLVA GY
Sbjct: 893  PGWYDAMLDEIHALDDNHTWNLVDLPKGKKAVGCKWVFTIKVNPDGSMARLKARLVAKGY 952

Query: 534  TQVYGQDYNDTFSPVAKMTSVRLFIAMTAMKR*PLFQLDIKNAFLHGDLEEEIYMEQPSG 593
             Q YG DY+DTFSPVAK+TSVRLFI++ A +  PL QL IKNAFLHGDL+EE+YMEQP G
Sbjct: 953  AQTYGVDYSDTFSPVAKLTSVRLFISLAASQNWPLHQLAIKNAFLHGDLQEEVYMEQPPG 1012

Query: 594  FVAWG 598
            FVA G
Sbjct: 1013 FVAQG 1017


>UniRef100_Q7XE85 Putative pol polyprotein [Oryza sativa]
          Length = 1688

 Score =  399 bits (1025), Expect = e-109
 Identities = 266/637 (41%), Positives = 344/637 (53%), Gaps = 47/637 (7%)

Query: 1   DFNTGKTIGT*---SISQGLYYLHSQS------------SNICGVSA-SPDMIHRRLGHP 44
           D +TG  IGT      S GLY L S S            S +C  +  S    H RLGH 
Sbjct: 273 DRHTGAVIGTGHRQKRSCGLYILDSLSLPSSSTNTPSVYSPMCSTACKSFPQWHHRLGHL 332

Query: 45  SFDKLKVLVPQLSHLKSLD------CESCQLGKHVRASFPSSPNKRSKSPFDIVHSDVWG 98
              +L  L+ Q   L S+       C+ C+LGK V+  +PSS + RS  PFD+VHSDVWG
Sbjct: 333 CGSRLATLINQ-GVLGSVPVDTTFVCKGCKLGKQVQLPYPSSTS-RSSRPFDLVHSDVWG 390

Query: 99  PSRVMSTLG*RYYVTFIDGFSRCTWIILLKDRSQLFGAFLTFCSEIKNQFGKGIRILRSD 158
            S   S  G  YYV F+D +SR TWI  +K RSQL   + +F   I  QF   IRI RSD
Sbjct: 391 KSPFPSKGGHNYYVIFVDDYSRYTWIYFMKHRSQLISIYQSFAQMIHTQFSSAIRIFRSD 450

Query: 159 NAKEYFFAPFNSFMASLGIIHQSSCPHTPQQNGVAERKHCHLVDTTRTLLINAHAPFKFW 218
           +  EY    F  F+ S G + Q SCP    QNGVAERKH H+++T RTLLI +  P  FW
Sbjct: 451 SGGEYMSNAFREFLVSQGTLPQLSCPGAHAQNGVAERKHRHIIETARTLLIASFVPAHFW 510

Query: 219 GDAILTACYLINRMPSSVLDNEIPQSLLFPKDPLYRVQLRVYGSTCFVHDLTPGRDKLSA 278
            +AI TA YLIN  PSS L    P  +LF   P Y   LRV+G TC+V      R KL+A
Sbjct: 511 AEAISTAVYLINMQPSSSLQGRSPGEVLFGSPPRYD-HLRVFGCTCYVLLAPRERTKLTA 569

Query: 279 RAVKCVFLGYSRTQKGYRCYSPSTRRFYISADVTFFEDTPFFASPTTTSSTTDVTDSQVI 338
           ++V+CVFLGYS   KGYRCY PS RR  IS DVTF E+ PFF S T   S+ + + S + 
Sbjct: 570 QSVECVFLGYSLEHKGYRCYDPSARRIRISRDVTFDENKPFFYSSTNQPSSPENSISFLY 629

Query: 339 PTPLFHPIFEP--PVSTQSSPQLQSNPEFRRYGNIYERRHVEAPETSP-------IDSSD 389
             P+  P   P  P++   SP   S P        Y      +P  SP       I +S 
Sbjct: 630 LPPIPSPESLPSSPITPSPSPIPPSVP-----SPTYVPPPPPSPSPSPVSPPPSHIPASS 684

Query: 390 SAPKTVTTDSSDSATAPISSPVVVPPEPSNDLPIALHKGKRST--ANPHPVYNFLSYHRL 447
           S P   +T + D+     S    +P E     P  L     S   ++P P YN  +   L
Sbjct: 685 SPPHVPSTITLDTFPFHYSRRPKIPNESQPSQP-TLEDPTCSVDDSSPAPRYNLRARDAL 743

Query: 448 -SPSYFAFVSALSSVSIPKTVHEALSHQEWKQAMIDEMVALESNHTWELVSPSPGKSI-V 505
            +P+   FV  +  V  P T  EA+    WK AM +E+ ALE  +TW++V P P  ++ +
Sbjct: 744 RAPNRDDFV--VGVVFEPSTYQEAIVLPHWKLAMSEELAALERTNTWDVV-PLPSHAVPI 800

Query: 506 GCR*VFNVKVGLDGQVDRLKARLVAIGYTQVYGQDYNDTFSPVAKMTSVRLFIAMTAMKR 565
            C+ V+ VK   DGQV+R KARLVA G+ Q +G+DY++TF+PVA MT+VR  IA+ A + 
Sbjct: 801 TCKWVYKVKTKSDGQVERYKARLVARGFQQAHGRDYDETFAPVAHMTTVRTLIAVAATRS 860

Query: 566 *PLFQLDIKNAFLHGDLEEEIYMEQPSGFVAWGGVVW 602
             + Q+D+KNAFLHGDL EE+YM  P G  A  G V+
Sbjct: 861 WTISQMDVKNAFLHGDLHEEVYMHPPPGVEAPPGHVF 897


>UniRef100_Q5XWK9 Gag-pol polyprotein-like [Solanum tuberosum]
          Length = 1212

 Score =  399 bits (1025), Expect = e-109
 Identities = 228/575 (39%), Positives = 339/575 (58%), Gaps = 74/575 (12%)

Query: 31  SASPDMIHRRLGHPSFDKLKVLVPQLSH-----------LKSLDCESCQLGKHVRASFPS 79
           ++  ++ H+RLGHP+     V++  +S+           + S+DC +C+LGK     FP+
Sbjct: 441 ASKTEVWHKRLGHPN----SVVLSHISNSGLLGNKNKFSVASIDCSTCKLGKSKTLPFPN 496

Query: 80  SPNKRSKSPFDIVHSDVWGPSRVMSTLG*RYYVTFIDGFSRCTWIILLKDRSQLFGAFLT 139
             ++ +K  FD++HSDVWG S ++S    +Y++TFID +SR TW+  L+ +S++F  F T
Sbjct: 497 FGSRATKC-FDVIHSDVWGISPIISHAHFKYFMTFIDDYSRFTWVYFLRSKSEVFSMFKT 555

Query: 140 FCSEIKNQFGKGIRILRSDNAKEYFFAPFNSFMASLGIIHQSSCPHTPQQNGVAERKHCH 199
           F + I+ QF   I++LRSD+  EY    F  F+   GI+ Q SCP+TPQQNGVAERK+ H
Sbjct: 556 FLAYIETQFSTCIKLLRSDSGGEYMSYEFKKFLLDKGIVSQHSCPYTPQQNGVAERKNRH 615

Query: 200 LVDTTRTLLINAHAPFKFWGDAILTACYLINRMPSSVLDNEIPQSLLFPKDPLYRVQLRV 259
           L+D TRTLLI +  P K+W +A+ TA YLINR+PS VL+ E P   L+ ++P Y      
Sbjct: 616 LLDVTRTLLIESSVPSKYWVEALSTAVYLINRLPSKVLNLESPYFRLYHQNPNYS-DFHT 674

Query: 260 YGSTCFVHDLTPGR-DKLSARAVKCVFLGYSRTQKGYRCYSPSTRRFYISADVTFFEDTP 318
           +G  CFVH L P + +KLS ++ KC F+GYS +QKG+ CY P + +F IS +V FFE+  
Sbjct: 675 FGCVCFVH-LPPSQCNKLSVQSTKCAFMGYSTSQKGFICYDPCSHKFRISRNVVFFENQY 733

Query: 319 FFASPTTTSSTTDVTDSQVIPTPLFHPIFEPPVSTQSSPQLQSNPEFRRY--GNIYERRH 376
           FF +    SS           +PL  P FE   S+           F+R+  G +YERR 
Sbjct: 734 FFPTIVDLSSV----------SPLL-PTFEDLSSS-----------FKRFKPGFVYERRR 771

Query: 377 VEAPETSPIDSSDSAPKTVTTDSSDSATAPISSPVVVPPEPSNDLPIALHKGKRSTANPH 436
              P  +     ++AP+  + +SS S           P EP+          +RST    
Sbjct: 772 PTLPYPNTDPPPETAPQLESENSSRSG----------PLEPT----------RRSTRVSR 811

Query: 437 PVYNFLSYHRLSPSYFAFVSALSSVSIPKTVHEALSHQEWKQAMIDEMVALESNHTWELV 496
                      +P+++ F S LS++S+P    +A  H+ W++AM +E++AL+ N TW++V
Sbjct: 812 -----------TPNWYGFSSTLSNISVPSCYSQASKHECWQKAMEEELLALKENDTWDIV 860

Query: 497 SPSPGKSIVGCR*VFNVKVGLDGQVDRLKARLVAIGYTQVYGQDYNDTFSPVAKMTSVRL 556
           S       +GC+ V+++K+  DG +DR KARLV +G  Q YG DY +TF+PVAKMT+VR 
Sbjct: 861 SCPSNVRPIGCKWVYSIKLHSDGTLDRYKARLVVLGNRQEYGVDYEETFAPVAKMTTVRT 920

Query: 557 FIAMTAMKR*PLFQLDIKNAFLHGDLEEEIYMEQP 591
            IA+ A +   L+Q D+KNAFLHGDL+E+IYM+ P
Sbjct: 921 IIAIAASQNWSLYQKDVKNAFLHGDLKEDIYMKPP 955


>UniRef100_Q9FWZ5 Putative retroelement polyprotein [Arabidopsis thaliana]
          Length = 1404

 Score =  389 bits (1000), Expect = e-106
 Identities = 237/616 (38%), Positives = 332/616 (53%), Gaps = 33/616 (5%)

Query: 1    DFNTGKTIGT*SISQGLYYLHSQSSNICGVSASPDMI--------HRRLGHPSFDKLKVL 52
            D  TGK IG       LY L   S N     +S   +        H RLGHP    LK++
Sbjct: 416  DIETGKVIGEGGSKGELYVLEDLSPNSSSCFSSKSHLGISFNTLWHARLGHPHTRALKLM 475

Query: 53   VPQLSHLKSLDCESCQLGKHVRASFPSSPNKRSKSPFDIVHSDVWGPSRVMSTLG*RYYV 112
            +P +S      CE+C LGKH ++ FP S     K  FD+VHSDVW  S  +S    +Y+V
Sbjct: 476  LPNIS-FDHTSCEACILGKHCKSVFPKSLTIYEKC-FDLVHSDVW-TSPCVSRDNNKYFV 532

Query: 113  TFIDGFSRCTWIILLKDRSQLFGAFLTFCSEIKNQFGKGIRILRSDNAKEYFFAPFNSFM 172
            TFI+  S+ TWI LL  + ++F AF  F + + NQF   I++ R+DN  EY    F   +
Sbjct: 533  TFINEKSKYTWITLLPSKDRVFEAFTNFETYVTNQFNAKIKVFRTDNGGEYTSQKFRDHL 592

Query: 173  ASLGIIHQSSCPHTPQQNGVAERKHCHLVDTTRTLLINAHAPFKFWGDAILTACYLINRM 232
            A  GIIHQ+SCP+TPQQNGVAERK+ HL++  R+++ +   P +FWGDA+LTACYLINR 
Sbjct: 593  AKRGIIHQTSCPYTPQQNGVAERKNRHLMEVARSMMFHTSVPKRFWGDAVLTACYLINRT 652

Query: 233  PSSVLDNEIPQSLLFPKDPLYRVQLRVYGSTCFVHDLTPG--RDKLSARAVKCVFLGYSR 290
            P+ VL +  P  +L    P +   LRV+G  CFV  L PG  R KL A++ KC+FLGYS 
Sbjct: 653  PTKVLSDLSPFEVLNNTKP-FIDHLRVFGCVCFV--LIPGEQRSKLDAKSTKCMFLGYST 709

Query: 291  TQKGYRCYSPSTRRFYISADVTFFEDTPFFASPTTTSSTTDVTDS-----QVIPTPLFHP 345
            TQKGY+C+ P+  R +IS DV F E+   + +     +  D+T S     + +   L H 
Sbjct: 710  TQKGYKCFDPTKNRTFISRDVKFLENQD-YNNKKDWENLKDLTHSTSDRVETLKFLLDHL 768

Query: 346  IFEPPVSTQSSPQLQS-----NPEFRRYGNIYERRHVEAPETSPIDSSDSAPKTVTTDSS 400
              +   +TQ  P++       N E       ++       E  P     S       D S
Sbjct: 769  GNDSTSTTQHQPEMTQDQEDLNQENEEVSLQHQENLTHVQEDPPNTQEHSEHVQEIQDDS 828

Query: 401  DSATAPISSPVVVPPEPSNDLPIALHKGK---RSTANPHPVYNFLSYHRLSPSYFAFVSA 457
                 P     V+PP P       + + K    S A  HP     S   +   + AF+S 
Sbjct: 829  SEDEEPTQ---VLPPPPPLRRSTRIRRKKEFFNSNAVAHPFQATCSLALVPLDHQAFLSK 885

Query: 458  LSSVSIPKTVHEALSHQEWKQAMIDEMVALESNHTWELVSPSPGKSIVGCR*VFNVKVGL 517
            +S   IP+T  EA+  +EW+ A+ DE+ A++ NHTW+      GK  V  R VF +K   
Sbjct: 886  ISEHWIPQTYEEAMEVKEWRDAIADEINAMKRNHTWDEDDLPKGKKTVSSRWVFTIKYKS 945

Query: 518  DGQVDRLKARLVAIGYTQVYGQDYNDTFSPVAKMTSVRLFIAMTAMKR*PLFQLDIKNAF 577
            +G ++R K RLVA G+TQ YG DY +TF+PVAK+ +VR+ +A+       L+Q+D+KNAF
Sbjct: 946  NGDIERYKTRLVARGFTQTYGSDYMETFAPVAKLHTVRVVLALATNLSWGLWQMDVKNAF 1005

Query: 578  LHGDLEEEIYMEQPSG 593
            L G+LE+++YM  P G
Sbjct: 1006 LQGELEDDVYMTPPPG 1021


>UniRef100_Q710T7 Gag-pol polyprotein [Populus deltoides]
          Length = 1382

 Score =  385 bits (988), Expect = e-105
 Identities = 240/620 (38%), Positives = 333/620 (53%), Gaps = 57/620 (9%)

Query: 1    DFNTGKTIGT*SISQGLYYLHSQSSNICGVSASPDMI--------------HRRLGHPSF 46
            D  + K IGT     GLY L      +   + + D+               H RLGH S 
Sbjct: 431  DLQSQKLIGTGRRENGLYILDELKVPVVVAATTVDLSFFRLSLSSSSFYLWHSRLGHVSS 490

Query: 47   DKLKVLVPQ--LSHLKSLD---CESCQLGKHVRASFPSSPNKRSKSPFDIVHSDVWGPSR 101
             +L+ L     L +LK+ D   C  C+L K     F  S +  S SPFD++HSDVWGPS 
Sbjct: 491  SRLRFLASTGALGNLKTCDISDCSGCKLAKFSALPFNRSTSV-SSSPFDLIHSDVWGPSP 549

Query: 102  VMSTLG*RYYVTFIDGFSRCTWIILLKDRSQLFGAFLTFCSEIKNQFGKGIRILRSDNAK 161
            V +  G RYYV+FID  +R  W+ L+K RS+ F  +  F + IK Q    I+  R D   
Sbjct: 550  VSTKGGSRYYVSFIDDHTRYCWVYLMKHRSEFFEIYAAFRALIKTQHSAVIKCFRCDLGG 609

Query: 162  EYFFAPFNSFMASLGIIHQSSCPHTPQQNGVAERKHCHLVDTTRTLLINAHAPFKFWGDA 221
            EY    F   +A  G IHQ+SC  TP+QNGVAERKH H+V+T R+LL++A    +FWG+A
Sbjct: 610  EYTSNKFCQMLALDGTIHQTSCTDTPEQNGVAERKHRHIVETARSLLLSAFVLSEFWGEA 669

Query: 222  ILTACYLINRMPSSVLDNEIPQSLLFPKDPLYRVQLRVYGSTCFVHDLTPGRDKLSARAV 281
            +LTA  LIN +PSS      P   L+   P Y    RV+G T FV      R+KLS+R+ 
Sbjct: 670  VLTAVSLINTIPSSHSSGLSPFEKLYGHVPDYS-SFRVFGCTYFVLHPHVERNKLSSRSA 728

Query: 282  KCVFLGYSRTQKGYRCYSPSTRRFYISADVTFFEDTPFFASPTTTSSTTDVTDSQVIPTP 341
             CVFLGY   +KGYRC+ P T++ Y+S  V F E  PFF+ P+TT S T      + P  
Sbjct: 729  ICVFLGYGEGKKGYRCFDPITQKLYVSHHVVFLEHIPFFSIPSTTHSLTKSDLIHIDP-- 786

Query: 342  LFHPIFEPPVSTQSSPQLQSNPEFRRYGNIYERRHVEAPETSPIDSSDSAPKTVTTDSSD 401
                 F       +SP ++S       G            T  + S        T ++S 
Sbjct: 787  -----FSEDSGNDTSPYVRSICTHNSAG------------TGTLLSG-------TPEASF 822

Query: 402  SATAPISSPVVVPPEPSNDLPIALHKGKRSTANPHPVYNFLSYHRLSPSYFAFVSALSSV 461
            S+TAP +S  +V P P   + I     ++ST  P       +Y   S S+ +F++ +  +
Sbjct: 823  SSTAPQASSEIVDPPPRQSIRI-----RKSTKLPD-----FAYSCYSSSFTSFLAYIHCL 872

Query: 462  SIPKTVHEALSHQEWKQAMIDEMVALESNHTWELVSPSPGKSIVGCR*VFNVKVGLDGQV 521
              P +  EA+     +QAM +E+ AL    TW+LV   PGKS+VGCR V+ +K   DG +
Sbjct: 873  FEPSSYKEAILDPLGQQAMDEELSALHKTDTWDLVPLPPGKSVVGCRWVYKIKTNSDGSI 932

Query: 522  DRLKARLVAIGYTQVYGQDYNDTFSPVAKMTSVRLFIAMTAMKR*PLFQLDIKNAFLHGD 581
            +R KARLVA GY+Q YG DY +TF+P+AKMT++R  IA+ ++++  + QLD+KNAFL+GD
Sbjct: 933  ERYKARLVAKGYSQQYGMDYEETFAPIAKMTTIRTLIAVASIRQWHISQLDVKNAFLNGD 992

Query: 582  LEEEIYMEQPSGFVAWGGVV 601
            L+EE+YM  P G     G V
Sbjct: 993  LQEEVYMAPPPGISHDSGYV 1012


>UniRef100_Q9ZPU4 Putative retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 1501

 Score =  367 bits (942), Expect = e-100
 Identities = 229/653 (35%), Positives = 336/653 (51%), Gaps = 80/653 (12%)

Query: 1    DFNTGKTIGT*SISQGLYYLHSQSS---NICGVSASPDMIHRRLGHPSFDKLKVLV---P 54
            D ++   IG+     G+YYL   +    +   V +   + H+RLGHPSF  L  L     
Sbjct: 491  DRSSKTLIGSGEERGGVYYLTDVTPAKIHTANVDSDQALWHQRLGHPSFSVLSSLPLFSK 550

Query: 55   QLSHLKSLDCESCQLGKHVRASFPSSPNKRSKSPFDIVHSDVWGPSRVMSTLG*RYYVTF 114
              S + S  C+ C   K  R  FP S NK  +  F ++H DVWGP RV ++ G  Y++T 
Sbjct: 551  TSSTVTSHSCDVCFRAKQTREVFPESINKTEEC-FSLIHCDVWGPYRVPASCGAVYFLTI 609

Query: 115  IDGFSRCTWIILLKDRSQLFGAFLTFCSEIKNQFGKGIRILRSDNAKEYFFAPFNSFMAS 174
            +D +SR  W  LL ++S++      F    + QFGK ++++RSDN  E  F   +S+   
Sbjct: 610  VDDYSRAVWTYLLLEKSEVRQVLTNFLKYAEKQFGKTVKMVRSDNGTE--FMCLSSYFRE 667

Query: 175  LGIIHQSSCPHTPQQNGVAERKHCHLVDTTRTLLINAHAPFKFWGDAILTACYLINRMPS 234
             GIIHQ+SC  TPQQNG  ERKH H+++  R LL  A  P KFWG++ILTA YLINR PS
Sbjct: 668  NGIIHQTSCVGTPQQNGRVERKHRHILNVARALLFQASLPIKFWGESILTAAYLINRTPS 727

Query: 235  SVLDNEIPQSLLFPKDPLYRVQLRVYGSTCFVHDLTPGRDKLSARAVKCVFLGYSRTQKG 294
            S+L    P  +L    P+Y  QLRV+GS C+VH +T  +DK   R+  C+F+GY   +KG
Sbjct: 728  SILSGRTPYEVLHGSKPVYS-QLRVFGSACYVHRVTRDKDKFGQRSRSCIFVGYPFGKKG 786

Query: 295  YRCYSPSTRRFYISADVTFFEDT-PFFASPTTTSSTT----------------------D 331
            ++ Y      F +S DV F E+  P+    ++T ++T                      D
Sbjct: 787  WKVYDIERNEFLVSRDVIFREEVFPYAGVNSSTLASTSLPTVSEDDDWAIPPLEVRGSID 846

Query: 332  VTDSQVIPTPLFHPIFEPPVSTQSSPQLQSNPEFRRYGNIYERRHVEAPETSPIDSSDS- 390
              +++ +       + +  VS    P  +  P+             + P +SP+  S S 
Sbjct: 847  SVETERVVCTTDEVVLDTSVSDSEIPNQEFVPD-------------DTPPSSPLSVSPSG 893

Query: 391  APKTVTTDSSDSATAPISSPVVV-------------PPEPSNDL--------PIALHK-- 427
            +P T TT        P++SP+ V             PP   ND         P ++H   
Sbjct: 894  SPNTPTTP----IVVPVASPIPVSPPKQRKSKRATHPPPKLNDYVLYNAMYTPSSIHALP 949

Query: 428  --GKRSTANP----HPVYNFLSYHRLSPSYFAFVSALSSVSIPKTVHEALSHQEWKQAMI 481
                +S+  P     P+ +++S    S S+ A+++A++    PK   EA+  + W  AM 
Sbjct: 950  ADPSQSSTVPGKSLFPLTDYVSDAAFSSSHRAYLAAITDNVEPKHFKEAVQIKVWNDAMF 1009

Query: 482  DEMVALESNHTWELVSPSPGKSIVGCR*VFNVKVGLDGQVDRLKARLVAIGYTQVYGQDY 541
             E+ ALE N TW++V   PGK  +G + VF  K   DG V+R KARLV  G  QV G+DY
Sbjct: 1010 TEVDALEINKTWDIVDLPPGKVAIGSQWVFKTKYNSDGTVERYKARLVVQGNKQVEGEDY 1069

Query: 542  NDTFSPVAKMTSVRLFIAMTAMKR*PLFQLDIKNAFLHGDLEEEIYMEQPSGF 594
             +TF+PV +MT+VR  +   A  +  ++Q+D+ NAFLHGDLEEE+YM+ P GF
Sbjct: 1070 KETFAPVVRMTTVRTLLRNVAANQWEVYQMDVHNAFLHGDLEEEVYMKLPPGF 1122


>UniRef100_Q9ZQK0 Putative retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 1664

 Score =  365 bits (938), Expect = 3e-99
 Identities = 231/636 (36%), Positives = 335/636 (52%), Gaps = 57/636 (8%)

Query: 1    DFNTGKTIGT*SISQGLYYLH---------SQSSNICGVSASPDMIHRRLGHPSFDKLKV 51
            D  T + +G      GLY L          S  S+I G +A+ +  H RLGHP    LK+
Sbjct: 400  DIETSRVLGQGVTKDGLYVLEDTKPSVPLSSHFSSILG-NANSESWHARLGHPHSRALKL 458

Query: 52   LVPQLSHLKSLDCESCQLGKHVRASFPSSPNKRSKSPFDIVHSDVWGPSRVMSTLG*RYY 111
            L+P  S  K+ +CE+C LGKH ++ FP S     K  FD++HSDVW  S  +S    +Y+
Sbjct: 459  LLPSTS-FKNDECEACILGKHCKSVFPKSSTIYEKC-FDLIHSDVW-TSPCLSRENHKYF 515

Query: 112  VTFIDGFSRCTWIILLKDRSQLFGAFLTFCSEIKNQFGKGIRILRSDNAKEYFFAPFNSF 171
            VTFID  S+ TW  LL  + ++  AF  F + + N +   I+ILRSDN  EY    F   
Sbjct: 516  VTFIDEKSKFTWFTLLPSKDRVLEAFTNFQTYVTNHYDAKIKILRSDNRGEYTSHAFKQH 575

Query: 172  MASLGIIHQSSCPHTPQQNGVAERKHCHLVDTTRTLLINAHAPFKFWGDAILTACYLINR 231
            +   GIIHQ+SCP+TPQQNGVAERK+ HL++  R ++ + + P  FW D +++ACYLIN+
Sbjct: 576  LNKHGIIHQTSCPYTPQQNGVAERKNRHLMEVRRVMMFHTNVPKHFWIDGVVSACYLINQ 635

Query: 232  MPSSVLDNEIPQSLLFPKDPLYRVQLRVYGSTCFVHDLTPGRDKLSARAVKCVFLGYSRT 291
             P+ +L +  P  +L    P     LRV+G  CFV      R+KL  ++ K +F+GYS  
Sbjct: 636  TPTKILLDSSPFEVLNKVKPFIN-HLRVFGCVCFVLISGEQRNKLQPKSTKGMFIGYSIN 694

Query: 292  QKGYRCYSPSTRRFYISADVTFFEDTPFFASPTTTSSTTDVTDSQVIPTPLFHPIFE--- 348
            QKGY+CY   TR+  IS DV F E   ++          D+TDS          I E   
Sbjct: 695  QKGYKCYVLETRKVLISRDVKFLESKSYY-DKKNWEDIQDLTDSPSDRATNLRIILERLG 753

Query: 349  -PPVSTQSSPQLQSNPEF------------------RRYGNIYERRHVEAPETSPIDSSD 389
               + TQ++P+  SNPE                    + G   E   +E  E+S +   D
Sbjct: 754  VSNIQTQTTPRT-SNPETITQPENMEEEEEEEEEEEEKQGKEQELITLEETESSKVQEKD 812

Query: 390  SA---PKTVTTDSSDSATAPISSPVVVPPEPSNDLPIALHKGKRSTAN---------PHP 437
            ++        T++ +  +     P +  P  S  L     K KR   N          HP
Sbjct: 813  TSLLNDDNGHTNNQEEDSNSREEPRI--PRRSEHL-----KDKRVYYNNQVYFDNVVEHP 865

Query: 438  VYNFLSYHRLSPSYFAFVSALSSVSIPKTVHEALSHQEWKQAMIDEMVALESNHTWELVS 497
            +    +   L   +  F   +    IP+T  EA++HQ W+ A+  E  A+E+NHTW+   
Sbjct: 866  IQVVCTLAHLPEEHQVFFGKVDQHWIPQTYEEAITHQVWRDAIAAEKQAMENNHTWDEDE 925

Query: 498  PSPGKSIVGCR*VFNVKVGLDGQVDRLKARLVAIGYTQVYGQDYNDTFSPVAKMTSVRLF 557
               GK +V  + VF +K   DG+++R KARLVA G+TQ YG+DY DTF+PVAK+ +VR+ 
Sbjct: 926  LPRGKKVVTSKWVFAIKYKSDGEIERYKARLVARGFTQTYGEDYLDTFAPVAKLHTVRVV 985

Query: 558  IAMTAMKR*PLFQLDIKNAFLHGDLEEEIYMEQPSG 593
            +++T      L+Q+D+KNAFL G+LEE++YM+ P G
Sbjct: 986  LSLTTNLEWDLWQMDVKNAFLQGELEEKVYMKPPPG 1021


>UniRef100_O04543 F20P5.25 protein [Arabidopsis thaliana]
          Length = 1315

 Score =  364 bits (934), Expect = 8e-99
 Identities = 220/585 (37%), Positives = 301/585 (50%), Gaps = 69/585 (11%)

Query: 33  SPDMIHRRLGHPSFDKLKVLVPQLSHLKSLD-----CESCQLGKHVRASFPSSPNKRSKS 87
           S D+ H+RLGHPS  KL+ +   LS  K  +     C  C + K     F S  NK S+ 
Sbjct: 403 SHDLWHKRLGHPSVQKLQPMSSLLSFPKQKNNTDFHCRVCHISKQKHLPFVSHNNKSSR- 461

Query: 88  PFDIVHSDVWGPSRVMSTLG*RYYVTFIDGFSRCTWIILLKDRSQLFGAFLTFCSEIKNQ 147
           PFD++H D WGP  V +  G RY++T +D +SR TW+ LL+++S +     TF + ++NQ
Sbjct: 462 PFDLIHIDTWGPFSVQTHDGYRYFLTIVDDYSRATWVYLLRNKSDVLTVIPTFVTMVENQ 521

Query: 148 FGKGIRILRSDNAKEYFFAPFNSFMASLGIIHQSSCPHTPQQNGVAERKHCHLVDTTRTL 207
           F   I+ +RSDNA E     F  F  S GI+   SCP TPQQN V ERKH H+++  R+L
Sbjct: 522 FETTIKGVRSDNAPEL---NFTQFYHSKGIVPYHSCPETPQQNSVVERKHQHILNVARSL 578

Query: 208 LINAHAPFKFWGDAILTACYLINRMPSSVLDNEIPQSLLFPKDPLYRVQLRVYGSTCFVH 267
              +H P  +WGD ILTA YLINR+P+ +L+++ P  +L    P Y   ++V+G  C+  
Sbjct: 579 FFQSHIPISYWGDCILTAVYLINRLPAPILEDKCPFEVLTKTVPTYD-HIKVFGCLCYAS 637

Query: 268 DLTPGRDKLSARAVKCVFLGYSRTQKGYRCYSPSTRRFYISADVTFFEDT-PFFASPTTT 326
                R K S RA  C F+GY    KGY+     T    +S  V F E+  PF  S    
Sbjct: 638 TSPKDRHKFSPRAKACAFIGYPSGFKGYKLLDLETHSIIVSRHVVFHEELFPFLGS---- 693

Query: 327 SSTTDVTDSQVIPTPLFHPIFEPPVSTQSSPQLQSNPEFRRYGNIYERRHVEAPETSPID 386
               D++  +    P  +P   PP+  QSS  +                       +P D
Sbjct: 694 ----DLSQEEQNFFPDLNP--TPPMQRQSSDHV-----------------------NPSD 724

Query: 387 SSDSAPKTVTTDSSDSATAPISSPVVVPPEPSNDLPIALHKGKRS------------TAN 434
           SS S               P ++P    PEPS  +  +  K K+             ++ 
Sbjct: 725 SSSSV-----------EILPSANPTNNVPEPS--VQTSHRKAKKPAYLQDYYCHSVVSST 771

Query: 435 PHPVYNFLSYHRLSPSYFAFVSALSSVSIPKTVHEALSHQEWKQAMIDEMVALESNHTWE 494
           PH +  FLSY R++  Y  F++ L     P    EA   Q W+ AM  E   LE  HTWE
Sbjct: 772 PHEIRKFLSYDRINDPYLTFLACLDKTKEPSNYTEAEKLQVWRDAMGAEFDFLEGTHTWE 831

Query: 495 LVSPSPGKSIVGCR*VFNVKVGLDGQVDRLKARLVAIGYTQVYGQDYNDTFSPVAKMTSV 554
           + S    K  +GCR +F +K   DG V+R KARLVA GYTQ  G DYN+TFSPVAK+ SV
Sbjct: 832 VCSLPADKRCIGCRWIFKIKYNSDGSVERYKARLVAQGYTQKEGIDYNETFSPVAKLNSV 891

Query: 555 RLFIAMTAMKR*PLFQLDIKNAFLHGDLEEEIYMEQPSGFVAWGG 599
           +L + + A  +  L QLDI NAFL+GDL+EEIYM  P G+ +  G
Sbjct: 892 KLLLGVAARFKLSLTQLDISNAFLNGDLDEEIYMRLPQGYASRQG 936


>UniRef100_Q9XII7 Putative retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 1454

 Score =  362 bits (930), Expect = 2e-98
 Identities = 224/613 (36%), Positives = 324/613 (52%), Gaps = 60/613 (9%)

Query: 1    DFNTGKTIGT*SISQGLYYLHSQSSNICGVSASPD--MIHRRLGHPSFDKLKVLVPQLSH 58
            D   G+ +G       LY L     +I  V+A  D  M HRRLGH S  +L  +   L  
Sbjct: 521  DLIKGRMLGQGRRVANLYLLDVGDQSI-SVNAVVDISMWHRRLGHASLQRLDAISDSLGT 579

Query: 59   LKSLD-----CESCQLGKHVRASFPSSPNKRSKSPFDIVHSDVWGPSRVMSTLG*RYYVT 113
             +  +     C  C L K  + SFP+S NK  K  FD++H DVWGP  V +  G +Y++T
Sbjct: 580  TRHKNKGSDFCHVCHLAKQRKLSFPTS-NKVCKEIFDLLHIDVWGPFSVETVEGYKYFLT 638

Query: 114  FIDGFSRCTWIILLKDRSQLFGAFLTFCSEIKNQFGKGIRILRSDNAKEYFFAPFNSFMA 173
             +D  SR TW+ LLK +S++   F  F  +++NQ+   ++ +RSDNA E     F SF A
Sbjct: 639  IVDDHSRATWMYLLKTKSEVLTVFPAFIQQVENQYKVKVKAVRSDNAPEL---KFTSFYA 695

Query: 174  SLGIIHQSSCPHTPQQNGVAERKHCHLVDTTRTLLINAHAPFKFWGDAILTACYLINRMP 233
              GI+   SCP TP+QN V ERKH H+++  R L+  +  P   WGD +LTA +LINR P
Sbjct: 696  EKGIVSFHSCPETPEQNSVVERKHQHILNVARALMFQSQVPLSLWGDCVLTAVFLINRTP 755

Query: 234  SSVLDNEIPQSLLFPKDPLYRVQLRVYGSTCFVHDLTPGRDKLSARAVKCVFLGYSRTQK 293
            S +L N+ P  +L    P+Y  QLR +G  C+       R K   R+  C+FLGY    K
Sbjct: 756  SQLLMNKTPYEILTGTAPVYE-QLRTFGCLCYSSTSPKQRHKFQPRSRACLFLGYPSGYK 814

Query: 294  GYRCYSPSTRRFYISADVTFFEDT-PFFASPTTTSSTTDVTDSQVIPTPLFHPIFEPPVS 352
            GY+     +   +IS +V F E+  P   +P + SS             LF P+   PVS
Sbjct: 815  GYKLMDLESNTVFISRNVQFHEEVFPLAKNPGSESS-----------LKLFTPMV--PVS 861

Query: 353  TQSSPQLQSNPEFRRYGNIYERRHVEAPETSPIDSSDSAPKTVTTDSSDSATAPISSPVV 412
            +               G I +  H  +P + P   SD  P+              S  V 
Sbjct: 862  S---------------GIISDTTH--SPSSLPSQISDLPPQI------------SSQRVR 892

Query: 413  VPPEPSNDLPIALHKGKRSTANPHPVYNFLSYHRLSPSYFAFVSALSSVSIPKTVHEALS 472
             PP   ND     H     + + +P+ + +SY ++SPS+  +++ ++ + IP    EA  
Sbjct: 893  KPPAHLND----YHCNTMQSDHKYPISSTISYSKISPSHMCYINNITKIPIPTNYAEAQD 948

Query: 473  HQEWKQAMIDEMVALESNHTWELVSPSPGKSIVGCR*VFNVKVGLDGQVDRLKARLVAIG 532
             +EW +A+  E+ A+E  +TWE+ +   GK  VGC+ VF +K   DG ++R KARLVA G
Sbjct: 949  TKEWCEAVDAEIGAMEKTNTWEITTLPKGKKAVGCKWVFTLKFLADGNLERYKARLVAKG 1008

Query: 533  YTQVYGQDYNDTFSPVAKMTSVRLFIAMTAMKR*PLFQLDIKNAFLHGDLEEEIYMEQPS 592
            YTQ  G DY DTFSPVAKMT+++L + ++A K+  L QLD+ NAFL+G+LEEEI+M+ P 
Sbjct: 1009 YTQKEGLDYTDTFSPVAKMTTIKLLLKVSASKKWFLKQLDVSNAFLNGELEEEIFMKIPE 1068

Query: 593  GFVAWGGVVWYAN 605
            G+    G+V  +N
Sbjct: 1069 GYAERKGIVLPSN 1081


>UniRef100_Q9FIC5 Retroelement pol polyprotein-like [Arabidopsis thaliana]
          Length = 1462

 Score =  358 bits (919), Expect = 4e-97
 Identities = 234/641 (36%), Positives = 336/641 (51%), Gaps = 65/641 (10%)

Query: 8    IGT*SISQGLYYLHSQSS--NICGVSASPDMIHRRLGHPSFDKLKVLVPQLSHLKSLD-- 63
            IG      GLY+     +  ++  + +S  + H RLGHPS   LK+L    S   + D  
Sbjct: 440  IGAGKQQNGLYFFRGTETVASMTRMDSSSQLWHCRLGHPSSKVLKLLSFSDSTGHAFDSK 499

Query: 64   -CESCQLGKHVRASFPSSPNKRSKSPFDIVHSDVWGPSRVMSTLG*RYYVTFIDGFSRCT 122
             CE C   K  R  FP S NK S SPF++VH D+WGP R  S  G  Y++T +D ++R  
Sbjct: 500  TCEICIKAKQTRDPFPLSNNKTS-SPFEMVHCDLWGPYRTTSICGSNYFLTLVDNYTRAV 558

Query: 123  WIILLKDRSQLFGAFLTFCSEIKNQFGKGIRILRSDNAKEYFFAPFNSFMASLGIIHQSS 182
            W+ LL  +         F S ++ QF   I+ +RSDN  E  F   +SF    GIIH++S
Sbjct: 559  WLYLLPSKQTAPMHLKNFISLVERQFSTKIKTIRSDNGTE--FVCLSSFFVDHGIIHETS 616

Query: 183  CPHTPQQNGVAERKHCHLVDTTRTLLINAHAPFKFWGDAILTACYLINRMPSSVLDNEIP 242
            C  TPQQNG  ERKH H+++  R L   A  P +FW    LTA YLINR P+ +L  + P
Sbjct: 617  CVGTPQQNGRVERKHRHILNVARALRFQARLPIEFWSYCALTAAYLINRTPTPLLQGKTP 676

Query: 243  QSLLFPKDPLYRVQLRVYGSTCFVHDLTPGRDKLSARAVKCVFLGYSRTQKGYRCYSPST 302
              LL+ + P     +RV+G  C+VH+   G DK  +R+ K +FLGY   +KG+R Y+  T
Sbjct: 677  FELLYNRPPPVN-HIRVFGCICYVHNQKHGGDKFESRSNKSIFLGYPFAKKGWRVYNFET 735

Query: 303  RRFYISADVTFFE-DTPFFAS-----PTTTSSTTDVTDSQVIPTPLFHPIFEPPVS---- 352
                +S DV F E + PF AS     P +  S ++   S  +P+ L  P    PVS    
Sbjct: 736  GVISVSRDVVFRETEFPFPASVFDSTPDSQLSPSNADQSFFLPSELQAPT---PVSITTT 792

Query: 353  ---TQSSPQLQSNPEFRRY--------GNIYERRHVEAP---ETSPIDSSDS-----APK 393
               TQSS     N +  R           + +   + +P   E+SP  S  S     +P 
Sbjct: 793  LELTQSSSSTNLNDDNFRIPSDESSSVNEMSDNEDLNSPTTNESSPFLSPASPSLPLSPA 852

Query: 394  TVTTDSSDSATAPISSPVVVPPEPSNDLPIALHKGKRS--------------------TA 433
            +++   S +A +P S P +  PEP  +L   L KGKR                     + 
Sbjct: 853  SLSLPLSPAAPSP-SLPKIAEPEPEPEL---LGKGKRKKTQPVRLADYATTLLHQPHPSV 908

Query: 434  NPHPVYNFLSYHRLSPSYFAFVSALSSVSIPKTVHEALSHQEWKQAMIDEMVALESNHTW 493
             P+P+ N++S  + S +Y A+V A+S    PK+  EA+  + W+ A+ DE+V+LE+  TW
Sbjct: 909  TPYPLDNYVSSSQFSAAYQAYVFAISLGIEPKSYKEAILDENWRCAVSDEIVSLENLGTW 968

Query: 494  ELVSPSPGKSIVGCR*VFNVKVGLDGQVDRLKARLVAIGYTQVYGQDYNDTFSPVAKMTS 553
             +    PGK  +GC+ VF +K   DG ++R KARLV +G  Q  G DY++TF+PVAKM +
Sbjct: 969  TVEDLPPGKKALGCKWVFRLKYKSDGTLERHKARLVVLGNKQTEGIDYSETFAPVAKMVT 1028

Query: 554  VRLFIAMTAMKR*PLFQLDIKNAFLHGDLEEEIYMEQPSGF 594
            VR F+   A     + Q+D+ NAFLHGDL+EE+Y++ P GF
Sbjct: 1029 VRAFLQQVASLDWEVHQMDVHNAFLHGDLDEEVYIKFPPGF 1069


>UniRef100_Q94KV0 Polyprotein [Arabidopsis thaliana]
          Length = 1453

 Score =  355 bits (911), Expect = 4e-96
 Identities = 228/623 (36%), Positives = 321/623 (50%), Gaps = 52/623 (8%)

Query: 1    DFNTGKTIGT*SISQGLYYLHSQ------SSNICGVSASPDMIHRRLGHPSFDKLKVLVP 54
            D NT K +     S GLY L +Q      S+  C  +AS ++ H RLGH +   L+ L  
Sbjct: 419  DINTQKVVSKGPRSNGLYVLENQEFVAFYSNRQC--AASEEIWHHRLGHSNSRILQQLKS 476

Query: 55   --QLSHLKSLD---CESCQLGKHVRASFPSSPNKRSKSPFDIVHSDVWGPSRVMSTLG*R 109
              ++S  KS     CE CQ+GK  +  F SS N R       +H D+WGPS V+S  G +
Sbjct: 477  SKEISFNKSRMSPVCEPCQMGKSSKLQFFSS-NSRELDLLGRIHCDLWGPSPVVSKQGFK 535

Query: 110  YYVTFIDGFSRCTWIILLKDRSQLFGAFLTFCSEIKNQFGKGIRILRSDNAKEYFFAPFN 169
            YYV F+D +SR +W   LK +S  F  F+ F + ++NQF   I++ +SD   E+      
Sbjct: 536  YYVVFVDDYSRYSWFYPLKAKSDFFAVFVAFQNLVENQFNTKIKVFQSDGGGEFTSNLMK 595

Query: 170  SFMASLGIIHQSSCPHTPQQNGVAERKHCHLVDTTRTLLINAHAPFKFWGDAILTACYLI 229
              +   GI H+ SCP+TPQQNG+AERKH H V+   +++ ++H P +FW +A  TA +L 
Sbjct: 596  KHLTDCGIQHRISCPYTPQQNGIAERKHRHFVELGLSMMFHSHTPLQFWVEAFFTASFLS 655

Query: 230  NRMPSSVLDNEIPQSLLFPKDPLYRVQLRVYGSTCFVHDLTPGRDKLSARAVKCVFLGYS 289
            N +PS  L N  P   L  + P Y   LRV+G+ C+      G  K   R+++CVFLGY+
Sbjct: 656  NMLPSPSLGNVSPLEALLKQKPNY-AMLRVFGTACYPCLRPLGEHKFEPRSLQCVFLGYN 714

Query: 290  RTQKGYRCYSPSTRRFYISADVTFFEDT-PF-----FASPTTTSSTTDV-------TDSQ 336
               KGYRC  P T R YIS  V F E+T PF     F  P   SS            D  
Sbjct: 715  SQYKGYRCLYPPTGRVYISRHVIFDEETFPFKQKYQFLVPQYESSLLSAWQSSIPQADQS 774

Query: 337  VIPTP---LFHPIFEPP-VSTQSSPQLQSNPEFRRYGNIYERRHVEAPETSPIDSSDSAP 392
            +IP         + +PP +   +     + P     G + E    ++ E +  +S +   
Sbjct: 775  LIPQAEEGKIESLAKPPSIQKNTIQDTTTQPAILTEGVLNEEEEEDSFEETETESLNEET 834

Query: 393  KTVTTDSSDSATAPISSPVVVPPEPSNDLPIALHKGKRSTANPHPVYNFLSYHRLSPSYF 452
             T     +D A   +     V  EP N  P+      RS A  H           S + +
Sbjct: 835  HT----QNDEAEVTVEEE--VQQEPENTHPMT----TRSKAGIHK----------SNTRY 874

Query: 453  AFVSALSSVSIPKTVHEALSHQEWKQAMIDEMVALESNHTWELVSPSPGKSIVGCR*VFN 512
            A +++  SV  PK++ EAL+H  W  A+ DEM  +   HTW LV P+   +I+GCR VF 
Sbjct: 875  ALLTSKFSVEEPKSIDEALNHPGWNNAVNDEMRTIHMLHTWSLVQPTEDMNILGCRWVFK 934

Query: 513  VKVGLDGQVDRLKARLVAIGYTQVYGQDYNDTFSPVAKMTSVRLFIAMTAMKR*PLFQLD 572
             K+  DG VD+LKARLVA G+ Q  G DY +TFSPV +  ++RL + +   K   + QLD
Sbjct: 935  TKLKPDGSVDKLKARLVAKGFHQEEGLDYLETFSPVVRTATIRLVLDVATAKGWNIKQLD 994

Query: 573  IKNAFLHGDLEEEIYMEQPSGFV 595
            + NAFLHG+L+E +YM QP GFV
Sbjct: 995  VSNAFLHGELKEPVYMLQPPGFV 1017


>UniRef100_Q9C692 Polyprotein, putative [Arabidopsis thaliana]
          Length = 1468

 Score =  352 bits (904), Expect = 2e-95
 Identities = 203/590 (34%), Positives = 322/590 (54%), Gaps = 34/590 (5%)

Query: 30   VSASPDMIHRRLGHPSFDKLKVLVPQ--LSHLKSL---DCESCQLGKHVRASFPSSPNKR 84
            V A  D+ HRRLGH S DK+  L+P+  LS  K +    C++C   K  R +FP S N R
Sbjct: 509  VKAPFDLWHRRLGHAS-DKIVNLLPRELLSSGKEILENVCDTCMRAKQTRDTFPLSDN-R 566

Query: 85   SKSPFDIVHSDVWGPSRVMSTLG*RYYVTFIDGFSRCTWIILLKDRSQLFGAFLTFCSEI 144
            S   F ++H DVWGP R  S  G RY++T +D +SR  W+ L+ D+S+       F + +
Sbjct: 567  SMDSFQLIHCDVWGPYRAPSYSGARYFLTIVDDYSRGVWVYLMTDKSETQKHLKDFIALV 626

Query: 145  KNQFGKGIRILRSDNAKEYFFAPFNSFMASLGIIHQSSCPHTPQQNGVAERKHCHLVDTT 204
            + QF   I+I+RSDN  E  F     +    GI H++SC  TP QNG  ERKH H+++  
Sbjct: 627  ERQFDTEIKIVRSDNGTE--FLCMREYFLHKGIAHETSCVGTPHQNGRVERKHRHILNIA 684

Query: 205  RTLLINAHAPFKFWGDAILTACYLINRMPSSVLDNEIPQSLLFPKDPLYRVQLRVYGSTC 264
            R L   ++ P +FWG+ IL+A YLINR PS +L  + P  +L+   P Y   LRV+GS C
Sbjct: 685  RALRFQSYLPIQFWGECILSAAYLINRTPSMLLQGKSPYEMLYKTAPKYS-HLRVFGSLC 743

Query: 265  FVHDLTPGRDKLSARAVKCVFLGYSRTQKGYRCYSPSTRRFYISADVTFFEDTPFFASPT 324
            + H+     DK +AR+ +CVF+GY   QKG+R +    ++F++S DV  F++T F  S  
Sbjct: 744  YAHNQNHKGDKFAARSRRCVFVGYPHGQKGWRLFDLEEQKFFVSRDV-IFQETEFPYSKM 802

Query: 325  TTSSTTDVTDSQVIPTPLFHPIFEP--------------------PVSTQSSPQLQSNPE 364
            + +   +      +  P       P                    P+  + + +  S  E
Sbjct: 803  SCNEEDERVLVDCVGPPFIEEAIGPRTIIGRNIGEATVGPNVATGPIIPEINQESSSPSE 862

Query: 365  FRRYGNIYERRHVEAPETSPIDSSDSAPKTVTTDSSDSATAPISSPVVVPPEPSNDLPIA 424
            F    ++         +T+ +  S + P  +    S   T     P+ +    +N + + 
Sbjct: 863  FVSLSSLDPFLASSTVQTADLPLSSTTPAPIQLRRSSRQT---QKPMKLKNFVTNTVSVE 919

Query: 425  LHKGKRSTANPHPVYNFLSYHRLSPSYFAFVSALSSVSIPKTVHEALSHQEWKQAMIDEM 484
                + S+++ +P+  ++  HR + S+ AF++A+++   P T +EA+  + W++AM  E+
Sbjct: 920  SISPEASSSSLYPIEKYVDCHRFTSSHKAFLAAVTAGMEPTTYNEAMVDKAWREAMSAEI 979

Query: 485  VALESNHTWELVSPSPGKSIVGCR*VFNVKVGLDGQVDRLKARLVAIGYTQVYGQDYNDT 544
             +L  N T+ +V+  PGK  +G + V+ +K   DG ++R KARLV +G  Q  G DY++T
Sbjct: 980  ESLRVNQTFSIVNLPPGKRALGNKWVYKIKYRSDGAIERYKARLVVLGNCQKEGVDYDET 1039

Query: 545  FSPVAKMTSVRLFIAMTAMKR*PLFQLDIKNAFLHGDLEEEIYMEQPSGF 594
            F+PVAKM++VRLF+ + A +   + Q+D+ NAFLHGDL+EE+YM+ P GF
Sbjct: 1040 FAPVAKMSTVRLFLGVAAARDWHVHQMDVHNAFLHGDLKEEVYMKLPQGF 1089


>UniRef100_O81617 F8M12.17 protein [Arabidopsis thaliana]
          Length = 1633

 Score =  349 bits (895), Expect = 3e-94
 Identities = 212/623 (34%), Positives = 320/623 (51%), Gaps = 39/623 (6%)

Query: 1    DFNTGKTIGT*SISQGLYYLHSQSSNICGVSASPDMIHRRLGHPSFDKLKVLVPQLSHLK 60
            +   G  IG       LY L +Q +     S SP +      HPS   L+ LV  +  LK
Sbjct: 469  ELTRGLMIGRGKTYNNLYILETQRT-----SFSPSLPAATSRHPSLPALQKLVSSIPSLK 523

Query: 61   SLD-----CESCQLGKHVRASFPSSPNKRSKSPFDIVHSDVWGPSRVMSTLG*RYYVTFI 115
            S+      C    L K  R ++ S  N  S SPFD++H D+WGP  + S  G RY++T +
Sbjct: 524  SVSSTASHCRISPLAKQKRLAYVSHNNLAS-SPFDLIHLDIWGPFSIESVDGFRYFLTLV 582

Query: 116  DGFSRCTWIILLKDRSQLFGAFLTFCSEIKNQFGKGIRILRSDNAKEYFFAPFNSFMASL 175
            D  +R TW+ ++K++S++   F  F   I  Q+   I+ +RSDN KE     F  F+   
Sbjct: 583  DDCTRTTWVYMMKNKSEVSNIFPVFVKLIFTQYNAKIKAIRSDNVKEL---AFTKFVKEQ 639

Query: 176  GIIHQSSCPHTPQQNGVAERKHCHLVDTTRTLLINAHAPFKFWGDAILTACYLINRMPSS 235
            G+IHQ SC +TPQQN V ERKH HL++  R+LL  ++ P ++W D +LTA YLINR+PS 
Sbjct: 640  GMIHQFSCAYTPQQNSVVERKHQHLLNIARSLLFQSNVPLQYWSDCVLTAAYLINRLPSP 699

Query: 236  VLDNEIPQSLLFPKDPLYRVQLRVYGSTCFVHDLTPGRDKLSARAVKCVFLGYSRTQKGY 295
            +LDN+ P  LL  K P Y +   +    C+       R+K S RA  CVFLGY    KGY
Sbjct: 700  LLDNKTPFELLLKKIPDYTL---LKSCLCYASTNVHDRNKFSPRARPCVFLGYPSGYKGY 756

Query: 296  RCYSPSTRRFYISADVTFFEDTPFFASPTTTSSTTDVTDSQVIPTPL-FHPIFEPPVSTQ 354
            +     +    I+ +V F E    F +      + D+  + ++P P   H +   P+   
Sbjct: 757  KVLDLESHSISITRNVVFHETKFPFKTSKFLKESVDMFPNSILPLPAPLHFVESMPLDDD 816

Query: 355  SSPQLQSNPEFRRYGNIYERRHVEAPETSPIDSSDSAPKTVTTDS------SDSATAPI- 407
                L+++       N         P  S +++ ++    + T+S        +A AP  
Sbjct: 817  ----LRADDNNASTSNSASSASSIPPLPSTVNTQNTDALDIDTNSVPIARPKRNAKAPAY 872

Query: 408  -------SSPVVVPPEPSNDLPIALHKGK---RSTANPHPVYNFLSYHRLSPSYFAFVSA 457
                   S P +    P+    I         +    P+P+   +SY +L+P + +++ A
Sbjct: 873  LSEYHCNSVPFLSSLSPTTSTSIETPSSSIPPKKITTPYPMSTAISYDKLTPLFHSYICA 932

Query: 458  LSSVSIPKTVHEALSHQEWKQAMIDEMVALESNHTWELVSPSPGKSIVGCR*VFNVKVGL 517
             +  + PK   +A+  ++W +A  +E+ ALE N TW + S + GK++VGC+ VF +K   
Sbjct: 933  YNVETEPKAFTQAMKSEKWTRAANEELHALEQNKTWIVESLTEGKNVVGCKWVFTIKYNP 992

Query: 518  DGQVDRLKARLVAIGYTQVYGQDYNDTFSPVAKMTSVRLFIAMTAMKR*PLFQLDIKNAF 577
            DG ++R KARLVA G+TQ  G DY +TFSPVAK  SV+L + + A     L Q+D+ NAF
Sbjct: 993  DGSIERYKARLVAQGFTQQEGIDYMETFSPVAKFGSVKLLLGLAAATGWSLTQMDVSNAF 1052

Query: 578  LHGDLEEEIYMEQPSGFVAWGGV 600
            LHG+L+EEIYM  P G+    G+
Sbjct: 1053 LHGELDEEIYMSLPQGYTPPTGI 1075


>UniRef100_Q9FXB7 Putative retroelement polyprotein [Arabidopsis thaliana]
          Length = 1486

 Score =  344 bits (883), Expect = 6e-93
 Identities = 220/644 (34%), Positives = 326/644 (50%), Gaps = 54/644 (8%)

Query: 1    DFNTGKTIGT*SISQGLYYLHSQSSNICGVSA---SPDMIHRRLGHPSFDKLKVLV---P 54
            D  T   IG      GLY+     +     S    S  + H+RLGHPS   L +L     
Sbjct: 468  DRTTLMLIGAGRELNGLYFFRGVETAAAVTSKALPSSQLWHQRLGHPSSKALHLLPFSDV 527

Query: 55   QLSHLKSLDCESCQLGKHVRASFPSSPNKRSKSPFDIVHSDVWGPSRVMSTLG*RYYVTF 114
              S   S  CE C   K  R  FP S NK S + F++VH D+WGP R  S  G RY++T 
Sbjct: 528  TSSTFDSKTCEICIQAKQTRDPFPLSSNKTSFA-FELVHCDLWGPYRTTSICGSRYFLTL 586

Query: 115  IDGFSRCTWIILLKDRSQLFGAFLTFCSEIKNQFGKGIRILRSDNAKEYFFAPFNSFMAS 174
            +D +SR  W+ LL  + +       F + ++ Q+   I+++RSDN  E  F   + F A 
Sbjct: 587  VDDYSRAVWLYLLPSKQEAPKHLKNFIALVERQYTTNIKMIRSDNGSE--FICLSDFFAQ 644

Query: 175  LGIIHQSSCPHTPQQNGVAERKHCHLVDTTRTLLINAHAPFKFWGDAILTACYLINRMPS 234
             GIIH++SC  TPQQNG  ERKH H+++  R L   +  P +FW    LTA YLINR P+
Sbjct: 645  KGIIHETSCVGTPQQNGRVERKHRHILNVARALRFQSGLPIEFWSYCALTAAYLINRTPT 704

Query: 235  SVLDNEIPQSLLFPKDPLYRVQLRVYGSTCFVHDLTPGRDKLSARAVKCVFLGYSRTQKG 294
             +L  + P  L++ + P  +  +R++G  C+VH+L  G DK ++R+ K +FLGY   +KG
Sbjct: 705  PLLKGKTPFELIYNRPPPLQ-HIRIFGCICYVHNLKHGGDKFASRSNKSIFLGYPFAKKG 763

Query: 295  YRCYSPSTRRFYISADVTFFE-DTPFFASPTTTSSTTD---VTDSQVIPTPLFHPIFEPP 350
            +R Y+  T    +S DV F E +  F  S   +S + D   V  S++    +  P+    
Sbjct: 764  WRVYNIETGVVSVSRDVVFRETEFHFPISVMDSSPSLDPVLVDSSELEEISMTPPVTPSS 823

Query: 351  VSTQSSPQLQSNPEFRRYGNIYERRHVEAPETSPIDSSDSAP-----KTVTTDSSDSATA 405
             +T SSP   S+P               +   +P+ S+ ++      + +TTD  DS + 
Sbjct: 824  PATPSSPVTPSSPVTPSSPVSPSSPVTPSSPVTPVSSTTTSAAIDTIEDITTDLEDSTSM 883

Query: 406  PI------------------SSPVVVPPEPSNDLPIALHKGKRS---------------- 431
                                SS  V PP    +L    H+ KR                 
Sbjct: 884  DFFPDDEDEFSPTATESPASSSSPVHPPAVQLELLGKGHRPKRPPVKLADYVTTLLHQPF 943

Query: 432  -TANPHPVYNFLSYHRLSPSYFAFVSALSSVSIPKTVHEALSHQEWKQAMIDEMVALESN 490
             +A P+P+ N++S  R S +Y A++ A++S + P+  +EA+    WK A+  E+ +LE+ 
Sbjct: 944  PSATPYPLDNYISSSRFSDNYQAYILAITSGNEPRNYNEAMLDDHWKGAVSHEIGSLENL 1003

Query: 491  HTWELVSPSPGKSIVGCR*VFNVKVGLDGQVDRLKARLVAIGYTQVYGQDYNDTFSPVAK 550
             TW +    PGK  +GC+ VF +K   DG ++R KARLV +G  Q  G DY +TF+PVAK
Sbjct: 1004 GTWTVEDLPPGKKALGCKWVFRLKYKSDGTLERHKARLVVLGNNQTEGLDYTETFAPVAK 1063

Query: 551  MTSVRLFIAMTAMKR*PLFQLDIKNAFLHGDLEEEIYMEQPSGF 594
            M +VR F+         + Q+D+ NAFLHGDL+EE+YM+ P GF
Sbjct: 1064 MVTVRAFLQQVVSLDWEVHQMDVHNAFLHGDLDEEVYMQFPPGF 1107


>UniRef100_Q9SA17 F28K20.17 protein [Arabidopsis thaliana]
          Length = 1415

 Score =  342 bits (877), Expect = 3e-92
 Identities = 214/614 (34%), Positives = 322/614 (51%), Gaps = 57/614 (9%)

Query: 1   DFNTGKTIGT*SISQGLYYLHSQ------SSNICGVSASPDMIHRRLGHPSFDKLKVL-- 52
           D  T K + T     GLY L +Q      S+  C  +A+ ++ H RLGH +   L+ L  
Sbjct: 416 DLQTQKVVTTGPRRNGLYVLENQEFVALYSNRQC--AATEEVWHHRLGHANSKALQHLQN 473

Query: 53  --VPQLSHLKSLD-CESCQLGKHVRASFPSSPNKRSKSPFDIVHSDVWGPSRVMSTLG*R 109
               Q++  ++   CE CQ+GK  R  F  S + R   P D +H D+WGPS V+S  G +
Sbjct: 474 SKAIQINKSRTSPVCEPCQMGKSSRLPFLIS-DSRVLHPLDRIHCDLWGPSPVVSNQGLK 532

Query: 110 YYVTFIDGFSRCTWIILLKDRSQLFGAFLTFCSEIKNQFGKGIRILRSDNAKEYFFAPFN 169
           YY  F+D +SR +W   L ++S+    F++F   ++NQ    I++ +SD   E+      
Sbjct: 533 YYAIFVDDYSRYSWFYPLHNKSEFLSVFISFQKLVENQLNTKIKVFQSDGGGEFVSNKLK 592

Query: 170 SFMASLGIIHQSSCPHTPQQNGVAERKHCHLVDTTRTLLINAHAPFKFWGDAILTACYLI 229
           + ++  GI H+ SCP+TPQQNG+AERKH HLV+   ++L ++H P KFW ++  TA Y+I
Sbjct: 593 THLSEHGIHHRISCPYTPQQNGLAERKHRHLVELGLSMLFHSHTPQKFWVESFFTANYII 652

Query: 230 NRMPSSVLDNEIPQSLLFPKDPLYRVQLRVYGSTCFVHDLTPGRDKLSARAVKCVFLGYS 289
           NR+PSSVL N  P   LF + P Y   LRV+GS C+       ++K   R+++CVFLGY+
Sbjct: 653 NRLPSSVLKNLSPYEALFGEKPDYS-SLRVFGSACYPCLRPLAQNKFDPRSLQCVFLGYN 711

Query: 290 RTQKGYRCYSPSTRRFYISADVTFFE-DTPFFASPTTTSSTTDVTDSQVIP---TPLFHP 345
              KGYRC+ P T + YIS +V F E + PF                 ++P   TPL   
Sbjct: 712 SQYKGYRCFYPPTGKVYISRNVIFNESELPF-----------KEKYQSLVPQYSTPLLQA 760

Query: 346 IFEPPVSTQSSP----QLQSNPEFRRYGNIYERRHVEAPETSPIDSSDSAPKTVTTDSSD 401
                +S  S P    QL S P      N Y    V    T P  +S++       + SD
Sbjct: 761 WQHNKISEISVPAAPVQLFSKPIDL---NTYAGSQVTEQLTDPEPTSNN-------EGSD 810

Query: 402 SATAPISSPVVVPPEPSNDLPIALHKGKRSTANPHPVYNFLSYHRLSPSYFAFVSALSSV 461
               P++  +    E   +      + K     P+  Y             A +++  + 
Sbjct: 811 EEVNPVAEEIAANQEQVINSHAMTTRSKAGIQKPNTRY-------------ALITSRMNT 857

Query: 462 SIPKTVHEALSHQEWKQAMIDEMVALESNHTWELVSPSPGKSIVGCR*VFNVKVGLDGQV 521
           + PKT+  A+ H  W +A+ +E+  +   HTW LV P+   +I+  + VF  K+  DG +
Sbjct: 858 AEPKTLASAMKHPGWNEAVHEEINRVHMLHTWSLVPPTDDMNILSSKWVFKTKLHPDGSI 917

Query: 522 DRLKARLVAIGYTQVYGQDYNDTFSPVAKMTSVRLFIAMTAMKR*PLFQLDIKNAFLHGD 581
           D+LKARLVA G+ Q  G DY +TFSPV +  ++RL + ++  K  P+ QLD+ NAFLHG+
Sbjct: 918 DKLKARLVAKGFDQEEGVDYLETFSPVVRTATIRLVLDVSTSKGWPIKQLDVSNAFLHGE 977

Query: 582 LEEEIYMEQPSGFV 595
           L+E ++M QPSGF+
Sbjct: 978 LQEPVFMYQPSGFI 991


>UniRef100_Q9FLA4 Polyprotein [Arabidopsis thaliana]
          Length = 1429

 Score =  342 bits (877), Expect = 3e-92
 Identities = 234/663 (35%), Positives = 325/663 (48%), Gaps = 86/663 (12%)

Query: 1    DFNTGKTIGT*SISQGLYYLHSQSSNICGVSASPD------MIHRRLGHPSFDKLKVLVP 54
            D NTG  +        LY       +I  ++ASP         H+RLGHP+   LK +V 
Sbjct: 408  DLNTGARLLQGRTRNELYEWPVNQKSITILTASPSPKTDLSSWHQRLGHPALPILKDVVS 467

Query: 55   QLSHL-------KSLDCESCQLGKHVRASFPSSPNKRSKSPFDIVHSDVWGPSRVMSTLG 107
               HL       K L C  C + K  +  F ++    S+ P + +++DVW  S  +S   
Sbjct: 468  HF-HLPLSNTIPKQLPCSDCSINKSHKLPFFTNTIVSSQ-PLEYLYTDVW-TSPCISVDN 524

Query: 108  *RYYVTFIDGFSRCTWIILLKDRSQLFGAFLTFCSEIKNQFGKGIRILRSDNAKEYFFAP 167
             +YY+  +D F+R TW+  LK +SQ+   F+ F + ++N+F   IR L SDN  E  F  
Sbjct: 525  YKYYLVIVDHFTRYTWMYPLKQKSQVKDVFVAFKALVENRFQSRIRTLYSDNGGE--FIG 582

Query: 168  FNSFMASLGIIHQSSCPHTPQQNGVAERKHCHLVDTTRTLLINAHAPFKFWGDAILTACY 227
               F+A+ GI H +S PHTP+ NG+AERKH H+V+T   LL +A  P  FW  A  TA Y
Sbjct: 583  LRPFLAAHGISHLTSPPHTPEHNGLAERKHRHIVETGLALLTHASLPKTFWTYAFATAVY 642

Query: 228  LINRMPSSVLDNEIPQSLLFPKDPLYRVQLRVYGSTCFVHDLTPGRDKLSARAVKCVFLG 287
            LINRMP+ VL    P   LF   P Y ++LRV+G  C+        +KL AR+  CVFLG
Sbjct: 643  LINRMPTEVLQGTSPYVKLFQMSPNY-LKLRVFGCLCYPWLRPYNTNKLEARSTMCVFLG 701

Query: 288  YSRTQKGYRCYSPSTRRFYISADVTFFEDTPFFASPTTTSSTTDVTDSQVIPT---PLFH 344
            YS TQ  Y C   +T R Y S  V F E +  FASP T+ + +  T SQ   T   PL  
Sbjct: 702  YSLTQSAYLCLDIATNRIYTSRHVQFVESSFPFASPRTSETDSTQTMSQPTTTNVIPLLQ 761

Query: 345  --------------PIFEPPVSTQSSPQ-----------LQSNPEFRRYGNIYERRHV-- 377
                          PIF  P  + SSP              S+       NI     V  
Sbjct: 762  RPPHIAPPTALPLCPIFHSPPHSPSSPASPPSEHVPLTAASSSSNAINDDNISSTGQVSV 821

Query: 378  -----EAPETSPIDSSDS----APKTVTTDSSDSATAPISSPVVV--------------- 413
                 ++P T+P + + S    +P    T+ S ++T P S    V               
Sbjct: 822  SGPTSQSPHTTPTNQNTSPLSKSPNPTNTNQSQNSTPPTSPTTSVHQHSPTPSPLPQNPP 881

Query: 414  -PPEPSNDLPIALHKGKRSTANPHPVYNFLSYHRLSPSYFAFVSALSSVSIPKTVHEALS 472
             PP P ND P+   + K     P   +N      L+ S  +     S  +IP TV +AL 
Sbjct: 882  LPPPPQNDHPMRT-RAKNQITKPKTKFN------LTTSLTS-----SKPTIPTTVAQALK 929

Query: 473  HQEWKQAMIDEMVALESNHTWELVSPSPGKSIVGCR*VFNVKVGLDGQVDRLKARLVAIG 532
               W+ AM +E+ A   NHTW+LVSP   K ++ C+ +F +K  +DG + R KARLVA G
Sbjct: 930  DPNWRNAMSEEINAQMKNHTWDLVSPEEAKHVISCKWIFTLKYNVDGSIARYKARLVARG 989

Query: 533  YTQVYGQDYNDTFSPVAKMTSVRLFIAMTAMKR*PLFQLDIKNAFLHGDLEEEIYMEQPS 592
            + Q YG DY++TFSPV K T++R  + +   +   + Q+DI NAFL G L EE+Y+ QP 
Sbjct: 990  FNQQYGIDYSETFSPVIKSTTIRTVLEVAVKRNWSIHQVDINNAFLQGTLNEEVYVSQPP 1049

Query: 593  GFV 595
            GF+
Sbjct: 1050 GFI 1052


>UniRef100_Q9SSB1 T18A20.5 protein [Arabidopsis thaliana]
          Length = 1522

 Score =  338 bits (866), Expect = 6e-91
 Identities = 223/624 (35%), Positives = 307/624 (48%), Gaps = 65/624 (10%)

Query: 17   LYYLHSQSSNICGVSASPDMIHRRLGHPSFDKLKVLVPQ-----LSHLKSLDCESCQLGK 71
            L  L+S   N    SAS ++ HRRLGH + + L  L        ++ +    CE+C LGK
Sbjct: 444  LQVLYSTRQN----SASSEVWHRRLGHANAEVLHQLASSKSIIIINKVVKTVCEACHLGK 499

Query: 72   HVRASFPSSPNKRSKSPFDIVHSDVWGPSRVMSTLG*RYYVTFIDGFSRCTWIILLKDRS 131
              R  F  S    S+ P + +H D+WGPS   S  G RYYV FID +SR TW   LK +S
Sbjct: 500  STRLPFMLSTFNASR-PLERIHCDLWGPSPTSSVQGFRYYVVFIDHYSRFTWFYPLKLKS 558

Query: 132  QLFGAFLTFCSEIKNQFGKGIRILRSDNAKEYFFAPFNSFMASLGIIHQSSCPHTPQQNG 191
              F  F+ F   ++NQ G  I+I + D   E+  + F   +   GI    SCP+TPQQNG
Sbjct: 559  DFFSTFVMFQKLVENQLGHKIKIFQCDGGGEFISSQFLKHLQDHGIQQNMSCPYTPQQNG 618

Query: 192  VAERKHCHLVDTTRTLLINAHAPFKFWGDAILTACYLINRMPSSVLD-NEIPQSLLFPKD 250
            +AERKH H+V+   +++  +  P K+W ++  TA ++IN +P+S LD NE P   L+ K 
Sbjct: 619  MAERKHRHIVELGLSMIFQSKLPLKYWLESFFTANFVINLLPTSSLDNNESPYQKLYGKA 678

Query: 251  PLYRVQLRVYGSTCFVHDLTPGRDKLSARAVKCVFLGYSRTQKGYRCYSPSTRRFYISAD 310
            P Y   LRV+G  C+         K   R++KCVFLGY+   KGYRC  P T R YIS  
Sbjct: 679  PEYSA-LRVFGCACYPTLRDYASTKFDPRSLKCVFLGYNEKYKGYRCLYPPTGRIYISRH 737

Query: 311  VTFFEDTPFFASPTTTSSTTDVTDSQVIPTPLFHPIFEPPVSTQSSPQLQSNPEFRRYGN 370
            V F E+T  F S  +     D T         FH +  P    QS   + S P+      
Sbjct: 738  VVFDENTHPFESIYSHLHPQDKTPLLEAWFKSFHHV-TPTQPDQSRYPVSSIPQPETTDL 796

Query: 371  IYERRHVEAPETSPIDSSD-------------SAPKTVTTDS------------------ 399
                  V A    P  S D             S  +T   DS                  
Sbjct: 797  SAAPASVAAETAGPNASDDTSQDNETISVVSGSPERTTGLDSASIGDSYHSPTADSSHPS 856

Query: 400  ---SDSATAPISSPVVVPPEPSNDLPIA-----LHKGKRSTANPHPVYNFLSYHRLSPSY 451
               S  A++P  SP+ + P      P+      + +GK   + P+  Y  L+ H++    
Sbjct: 857  PARSSPASSPQGSPIQMAPAQQVQAPVTNEHAMVTRGKEGISKPNKRYVLLT-HKV---- 911

Query: 452  FAFVSALSSVSIPKTVHEALSHQEWKQAMIDEMVALESNHTWELVSPSPGKSIVGCR*VF 511
                    S+  PKTV EAL H  W  AM +EM   +   TW LV  SP  +++G   VF
Sbjct: 912  --------SIPEPKTVTEALKHPGWNNAMQEEMGNCKETETWTLVPYSPNMNVLGSMWVF 963

Query: 512  NVKVGLDGQVDRLKARLVAIGYTQVYGQDYNDTFSPVAKMTSVRLFIAMTAMKR*PLFQL 571
              K+  DG +D+LKARLVA G+ Q  G DY +T+SPV +  +VRL + +  + +  L Q+
Sbjct: 964  RTKLHADGSLDKLKARLVAKGFKQEEGIDYLETYSPVVRTPTVRLILHVATVLKWELKQM 1023

Query: 572  DIKNAFLHGDLEEEIYMEQPSGFV 595
            D+KNAFLHGDL E +YM QP+GFV
Sbjct: 1024 DVKNAFLHGDLTETVYMRQPAGFV 1047


>UniRef100_O23302 Retrovirus-related like polyprotein [Arabidopsis thaliana]
          Length = 1489

 Score =  332 bits (851), Expect = 3e-89
 Identities = 197/592 (33%), Positives = 307/592 (51%), Gaps = 48/592 (8%)

Query: 38   HRRLGHPSFDKLKVLVPQLSHLKSLDCESCQLGKHVRASFPSSPNKRSKSPFDIVHSDVW 97
            H+RLGHPS     V++ +L  L  +                 S N  + +PFD+VH D+W
Sbjct: 603  HQRLGHPS----SVVLQKLKRLAYI-----------------SHNNLASNPFDLVHLDIW 641

Query: 98   GPSRVMSTLG*RYYVTFIDGFSRCTWIILLKDRSQLFGAFLTFCSEIKNQFGKGIRILRS 157
            GP  + S  G RY++T +D  +R TW+ +L+++  +   F  F   +  QF   I+ +RS
Sbjct: 642  GPFSIESIEGFRYFLTVVDDCTRTTWVYMLRNKKDVSSVFPEFIKLVSTQFNAKIKAIRS 701

Query: 158  DNAKEYFFAPFNSFMASLGIIHQSSCPHTPQQNGVAERKHCHLVDTTRTLLINAHAPFKF 217
            DNA E     F   +   G++H  SC +TPQQN V ERKH H+++  R LL  ++ P ++
Sbjct: 702  DNAPEL---GFTEIVKEHGMLHHFSCAYTPQQNSVVERKHQHILNVARALLFQSNIPMQY 758

Query: 218  WGDAILTACYLINRMPSSVLDNEIPQSLLFPKDPLYRVQLRVYGSTCFVHDLTPGRDKLS 277
            W D + TA +LINR+PS +L+N+ P  L+  K P Y + L+ +G  CFV      R K +
Sbjct: 759  WSDCVTTAVFLINRLPSPLLNNKSPYELILNKQPDYSL-LKNFGCLCFVSTNAHERTKFT 817

Query: 278  ARAVKCVFLGYSRTQKGYRCYSPSTRRFYISADVTFFEDTPFFASPTTTSSTTDVTDSQV 337
             RA  CVFLGY    KGY+     +    +S +V F E    F +    +   D+  + +
Sbjct: 818  PRARACVFLGYPSGYKGYKVLDLESHSVTVSRNVVFKEHVFPFKTSELLNKAVDMFPNSI 877

Query: 338  IPTPL-FHPIFEPPVSTQSSPQLQSNPEFRRYGNIYERRHVEAP---------ETSPIDS 387
            +P P   H +   P+  + S  + +  + R   N         P         ET  IDS
Sbjct: 878  LPLPAPLHFVETMPLIDEDS-LIPTTTDSRTADNHASSSSSALPSIIPPSSNTETQDIDS 936

Query: 388  S----DSAPKTVTTDS--SDSATAPISSPVVVPPE----PSNDLPIALHKGKRSTANPHP 437
            +      + +T    S  S+   + + S   +PP     P + LP            P+P
Sbjct: 937  NAVPITRSKRTTRAPSYLSEYHCSLVPSISTLPPTDSSIPIHPLPEIFTASSPKKTTPYP 996

Query: 438  VYNFLSYHRLSPSYFAFVSALSSVSIPKTVHEALSHQEWKQAMIDEMVALESNHTWELVS 497
            +   +SY + +P   +++ A ++ + PKT  +A+  ++W +  ++E+ A+E N TW + S
Sbjct: 997  ISTVVSYDKYTPLCQSYIFAYNTETEPKTFSQAMKSEKWIRVAVEELQAMELNKTWSVES 1056

Query: 498  PSPGKSIVGCR*VFNVKVGLDGQVDRLKARLVAIGYTQVYGQDYNDTFSPVAKMTSVRLF 557
              P K++VGC+ VF +K   DG V+R KARLVA G+TQ  G D+ DTFSPVAK+TS ++ 
Sbjct: 1057 LPPDKNVVGCKWVFTIKYNPDGTVERYKARLVAQGFTQQEGIDFLDTFSPVAKLTSAKMM 1116

Query: 558  IAMTAMKR*PLFQLDIKNAFLHGDLEEEIYMEQPSGFVAWGGVVWYAN--CR 607
            + + A+    L Q+D+ +AFLHGDL+EEI+M  P G+    G +   N  CR
Sbjct: 1117 LGLAAITGWTLTQMDVSDAFLHGDLDEEIFMSLPQGYTPPAGTILPPNPVCR 1168


>UniRef100_Q94IU9 Copia-like polyprotein [Arabidopsis thaliana]
          Length = 1466

 Score =  331 bits (848), Expect = 7e-89
 Identities = 221/622 (35%), Positives = 312/622 (49%), Gaps = 57/622 (9%)

Query: 1    DFNTGKTIGT*SISQGLYYLHSQ------SSNICGVSASPDMIHRRLGHPSFDKLKVLVP 54
            D  T K +     + GLY L +       S+  C  +AS +  H RLGH +   L+ L+ 
Sbjct: 418  DLTTQKVVSKGPRNNGLYMLENSEFVALYSNRQC--AASMETWHHRLGHSNSKILQQLLT 475

Query: 55   QLS-----HLKSLDCESCQLGKHVRASFPSSPNKRSKSPFDIVHSDVWGPSRVMSTLG*R 109
            +          S  CE CQ+GK  R  F SS + R+  P D VH D+WGPS V+S  G +
Sbjct: 476  RKEIQVNKSRTSPVCEPCQMGKSTRLQFFSS-DFRALKPLDRVHCDLWGPSPVVSNQGFK 534

Query: 110  YYVTFIDGFSRCTWIILLKDRSQLFGAFLTFCSEIKNQFGKGIRILRSDNAKEYFFAPFN 169
            YY  F+D FSR +W   L+ +S+    F+ +   ++NQ G  I+  +SD   E+      
Sbjct: 535  YYAVFVDDFSRFSWFFPLRMKSKFISVFIAYQKLVENQLGTKIKEFQSDGGGEFTSNKLK 594

Query: 170  SFMASLGIIHQSSCPHTPQQNGVAERKHCHLVDTTRTLLINAHAPFKFWGDAILTACYLI 229
                  GI H+ SCP+TPQQNGVAERKH HLV+   ++L ++H P KFW +A  TA YL 
Sbjct: 595  EHFREHGIHHRISCPYTPQQNGVAERKHRHLVELGLSMLYHSHTPLKFWVEAFFTANYLS 654

Query: 230  NRMPSSVLDNEIPQSLLFPKDPLYRVQLRVYGSTCFVHDLTPGRDKLSARAVKCVFLGYS 289
            N +PSSVL    P   LF +   Y   LRV+G+ C+       ++K   R+++CVFLGY 
Sbjct: 655  NLLPSSVLKEISPYETLFQQKVDY-TPLRVFGTACYPCLRPLAKNKFDPRSLQCVFLGYH 713

Query: 290  RTQKGYRCYSPSTRRFYISADVTFFE-DTPF---FASPTTTSST--------TDVTDSQV 337
               KGYRC  P T + YIS  V F E   PF   + S      T        TD+T   V
Sbjct: 714  NQYKGYRCLYPPTGKVYISRHVIFDEAQFPFKEKYHSLVPKYQTTLLQAWQHTDLTPPSV 773

Query: 338  IPTPLFHPI---FEPPVSTQSSPQLQSNPEFRRYGNIYERRHVEAPETSPIDSSDSAPKT 394
             P+    P+     P  ++++ P +    E             EA   +   SSD   +T
Sbjct: 774  -PSSQLQPLARQMTPMATSENQPMMNYETE-------------EAVNVNMETSSDE--ET 817

Query: 395  VTTDSSDSATAPISSPVVVPPEPSNDLPIALHKGKRSTANPHPVYNFLSYHRLSPS-YFA 453
             + D  D   AP+           ND       G+ S  N HP+          P+  +A
Sbjct: 818  ESNDEFDHEVAPV----------LNDQNEDNALGQGSLENLHPMITRSKDGIQKPNPRYA 867

Query: 454  FVSALSSVSIPKTVHEALSHQEWKQAMIDEMVALESNHTWELVSPSPGKSIVGCR*VFNV 513
             + + SS   PKT+  A+ H  W  A++DE+  +   +TW LV  +   +I+  + VF  
Sbjct: 868  LIVSKSSFDEPKTITTAMKHPSWNAAVMDEIDRIHMLNTWSLVPATEDMNILTSKWVFKT 927

Query: 514  KVGLDGQVDRLKARLVAIGYTQVYGQDYNDTFSPVAKMTSVRLFIAMTAMKR*PLFQLDI 573
            K+  DG +D+LKARLVA G+ Q  G DY +TFSPV +  ++RL +        PL QLD+
Sbjct: 928  KLKPDGTIDKLKARLVAKGFDQEEGVDYLETFSPVVRTATIRLVLDTATANEWPLKQLDV 987

Query: 574  KNAFLHGDLEEEIYMEQPSGFV 595
             NAFLHG+L+E ++M QPSGFV
Sbjct: 988  SNAFLHGELQEPVFMFQPSGFV 1009


  Database: uniref100
    Posted date:  Jan 5, 2005  1:24 AM
  Number of letters in database: 848,049,833
  Number of sequences in database:  2,790,947
  
Lambda     K      H
   0.339    0.146    0.489 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,512,022,912
Number of Sequences: 2790947
Number of extensions: 61071305
Number of successful extensions: 232333
Number of sequences better than 10.0: 1696
Number of HSP's better than 10.0 without gapping: 1230
Number of HSP's successfully gapped in prelim test: 477
Number of HSP's that attempted gapping in prelim test: 227166
Number of HSP's gapped (non-prelim): 3277
length of query: 965
length of database: 848,049,833
effective HSP length: 137
effective length of query: 828
effective length of database: 465,690,094
effective search space: 385591397832
effective search space used: 385591397832
T: 11
A: 40
X1: 15 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 39 (21.8 bits)
S2: 80 (35.4 bits)


Medicago: description of AC145449.7