Medicago
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= AC146527.6 - phase: 0 /pseudo
         (1019 letters)

Database: nr 
           2,540,612 sequences; 863,360,394 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAU89779.1| gag-pol polyprotein-like [Solanum tuberosum]           792  0.0
gb|AAP53905.1| putative pol polyprotein [Oryza sativa (japonica ...   699  0.0
emb|CAC95126.1| gag-pol polyprotein [Populus deltoides]               670  0.0
gb|AAT40550.1| putative receptor kinase [Solanum demissum]            620  e-176
gb|AAD25646.1| putative retroelement pol polyprotein [Arabidopsi...   597  e-169
gb|AAD26943.1| putative retroelement pol polyprotein [Arabidopsi...   589  e-166
gb|AAD24600.1| putative retroelement pol polyprotein [Arabidopsi...   586  e-165
pir||G86301 probable retroelement polyprotein [imported] - Arabi...   584  e-165
gb|AAB61111.1| Strong similarity to Zea mays retrotransposon Hop...   580  e-163
gb|AAP51971.1| putative copia-type polyprotein [Oryza sativa (ja...   578  e-163
gb|AAC35532.1| contains similarity to proteases [Arabidopsis tha...   570  e-160
gb|AAD21687.1| Strong similarity to gi|3600044 T12H20.12 proteas...   567  e-160
emb|CAB10526.1| retrotransposon like protein [Arabidopsis thalia...   566  e-160
gb|AAT38758.1| putative gag-pol polyprotein [Solanum demissum]        558  e-157
gb|AAP46257.1| putative polyprotein [Oryza sativa (japonica cult...   547  e-154
gb|AAD50001.1| Hypothetical protein [Arabidopsis thaliana] gi|25...   531  e-149
emb|CAB71063.1| copia-type polyprotein [Arabidopsis thaliana] gi...   531  e-149
emb|CAB75932.1| putative protein [Arabidopsis thaliana] gi|11278...   530  e-148
gb|AAG60117.1| copia-type polyprotein, putative [Arabidopsis tha...   530  e-148
gb|AAF16534.1| T26F17.17 [Arabidopsis thaliana]                       524  e-147

>gb|AAU89779.1| gag-pol polyprotein-like [Solanum tuberosum]
          Length = 1212

 Score =  792 bits (2045), Expect = 0.0
 Identities = 398/635 (62%), Positives = 485/635 (75%), Gaps = 18/635 (2%)

Query: 278  GEYMSHSFQEFLQSNGIISQRSCPSTP*QNGVAERKNRHLLDVVRTLLLESHVPSRFWCE 337
            GEYMS+ F++FL   GI+SQ SCP TP QNGVAERKNRHLLDV RTLL+ES VPS++W E
Sbjct: 577  GEYMSYEFKKFLLDKGIVSQHSCPYTPQQNGVAERKNRHLLDVTRTLLIESSVPSKYWVE 636

Query: 338  ALSTAVHLINRMPSPSIGNESPFTRLYGHPPNYSTLRVFGCVCYVHLPPQERTKFTAQSV 397
            ALSTAV+LINR+PS  +  ESP+ RLY   PNYS    FGCVC+VHLPP +  K + QS 
Sbjct: 637  ALSTAVYLINRLPSKVLNLESPYFRLYHQNPNYSDFHTFGCVCFVHLPPSQCNKLSVQST 696

Query: 398  ECAFLGYSPHQKGFLCYDPNLRRIRVSRNVIFQENKYFFASHHDLVSSPISILPLFYDSH 457
            +CAF+GYS  QKGF+CYDP   + R+SRNV+F EN+YFF +  DL SS   +LP F D  
Sbjct: 697  KCAFMGYSTSQKGFICYDPCSHKFRISRNVVFFENQYFFPTIVDL-SSVSPLLPTFEDLS 755

Query: 458  SRQQPSKPLLTYKRRSTA-----THGPPQ-------DNSLVAGPVEEPAPLRRSSRESKP 505
            S  +  KP   Y+RR        T  PP+       +NS  +GP+E   P RRS+R S+ 
Sbjct: 756  SSFKRFKPGFVYERRRPTLPYPNTDPPPETAPQLESENSSRSGPLE---PTRRSTRVSRT 812

Query: 506  PERYINCMTATLSSIPIPSSYKQAMENDCWQKAIESELLALEENQTWDIVPCPSSVKPLG 565
            P  Y    ++TLS+I +PS Y QA +++CWQKA+E ELLAL+EN TWDIV CPS+V+P+G
Sbjct: 813  PNWY--GFSSTLSNISVPSCYSQASKHECWQKAMEEELLALKENDTWDIVSCPSNVRPIG 870

Query: 566  SKFVFSIKLRSDGSIDRYKAHLVVLGNKQEYGLDYDETFAPVAKMTTVRTILAIAASQAW 625
             K+V+SIKL SDG++DRYKA LVVLGN+QEYG+DY+ETFAPVAKMTTVRTI+AIAASQ W
Sbjct: 871  CKWVYSIKLHSDGTLDRYKARLVVLGNRQEYGVDYEETFAPVAKMTTVRTIIAIAASQNW 930

Query: 626  PLHQMDVKNAFLHGDLQEEVYIKLPNGMPTPSPNTVCKLKRSLYGLKQAPRVWFEKFRST 685
             L+Q DVKNAFLHGDL+E++Y+K P  + +   + VCKLKRSLYGLKQAPR WF+KFRST
Sbjct: 931  SLYQKDVKNAFLHGDLKEDIYMKPPPDLFSSPTSDVCKLKRSLYGLKQAPRAWFDKFRST 990

Query: 686  LLGFEFSQSRYDPSLFLQRTPKGMVVLLVYVDDIVVTGSDQDAISRIKNLLHSTFHMKEL 745
            LL F F  S+YD SLFL++T    V+LLVYVDDI++TG+D   I+ ++  L  +FHMK+L
Sbjct: 991  LLQFSFELSKYDSSLFLRKTSTSCVLLLVYVDDIIITGTDSSLITCLQQQLKDSFHMKDL 1050

Query: 746  GRLTYFLGLEVHYHHEGVFLNQQKYIQDLVQLAGLTNATLVDTPMEVNVKYRRDEGDHLD 805
            G LTYFLGLEVH    GVFLNQ KY QDL+ LAGL  ++ VDTP+E+NVKYRR+EGD L 
Sbjct: 1051 GTLTYFLGLEVHNVASGVFLNQHKYTQDLISLAGLQVSSSVDTPLEMNVKYRREEGDLLP 1110

Query: 806  DPTQYRKLVGSLIYVTITRPDISFAVHTVSKFMQAPRHFHLSAVQQIIRYLLGTLKRGLF 865
            DPT +R+LVGSL Y+TITRPDISFAV  VS+FMQAPRH HL AV  IIRYLLGT  RGLF
Sbjct: 1111 DPTIFRQLVGSLNYLTITRPDISFAVQQVSQFMQAPRHLHLVAVCHIIRYLLGTSTRGLF 1170

Query: 866  FPVGSSIKLQAYSDADWAGCPDTRKSTTGWCMFLG 900
            FP GS I+L A+SD+DWAGCPDTR+S +GWCMFLG
Sbjct: 1171 FPSGSPIRLNAFSDSDWAGCPDTRRSVSGWCMFLG 1205


>gb|AAP53905.1| putative pol polyprotein [Oryza sativa (japonica cultivar-group)]
            gi|37534632|ref|NP_921618.1| putative pol polyprotein
            [Oryza sativa (japonica cultivar-group)]
          Length = 1688

 Score =  699 bits (1804), Expect = 0.0
 Identities = 373/800 (46%), Positives = 503/800 (62%), Gaps = 62/800 (7%)

Query: 278  GEYMSHSFQEFLQSNGIISQRSCPSTP*QNGVAERKNRHLLDVVRTLLLESHVPSRFWCE 337
            GEYMS++F+EFL S G + Q SCP    QNGVAERK+RH+++  RTLL+ S VP+ FW E
Sbjct: 453  GEYMSNAFREFLVSQGTLPQLSCPGAHAQNGVAERKHRHIIETARTLLIASFVPAHFWAE 512

Query: 338  ALSTAVHLINRMPSPSIGNESPFTRLYGHPPNYSTLRVFGCVCYVHLPPQERTKFTAQSV 397
            A+STAV+LIN  PS S+   SP   L+G PP Y  LRVFGC CYV L P+ERTK TAQSV
Sbjct: 513  AISTAVYLINMQPSSSLQGRSPGEVLFGSPPRYDHLRVFGCTCYVLLAPRERTKLTAQSV 572

Query: 398  ECAFLGYSPHQKGFLCYDPNLRRIRVSRNVIFQENKYFFASHHDLVSSPISILPLFY--- 454
            EC FLGYS   KG+ CYDP+ RRIR+SR+V F ENK FF S  +  SSP + +   Y   
Sbjct: 573  ECVFLGYSLEHKGYRCYDPSARRIRISRDVTFDENKPFFYSSTNQPSSPENSISFLYLPP 632

Query: 455  -------------DSHSRQQPSKPLLTY------KRRSTATHGPPQDNSLVAGPVEEPA- 494
                          S S   PS P  TY          +    PP      + P   P+ 
Sbjct: 633  IPSPESLPSSPITPSPSPIPPSVPSPTYVPPPPPSPSPSPVSPPPSHIPASSSPPHVPST 692

Query: 495  ------PLRRSSR-----ESKPPERYINCMTATLS-SIPIP------------------- 523
                  P   S R     ES+P +  +   T ++  S P P                   
Sbjct: 693  ITLDTFPFHYSRRPKIPNESQPSQPTLEDPTCSVDDSSPAPRYNLRARDALRAPNRDDFV 752

Query: 524  -------SSYKQAMENDCWQKAIESELLALEENQTWDIVPCPSSVKPLGSKFVFSIKLRS 576
                   S+Y++A+    W+ A+  EL ALE   TWD+VP PS   P+  K+V+ +K +S
Sbjct: 753  VGVVFEPSTYQEAIVLPHWKLAMSEELAALERTNTWDVVPLPSHAVPITCKWVYKVKTKS 812

Query: 577  DGSIDRYKAHLVVLGNKQEYGLDYDETFAPVAKMTTVRTILAIAASQAWPLHQMDVKNAF 636
            DG ++RYKA LV  G +Q +G DYDETFAPVA MTTVRT++A+AA+++W + QMDVKNAF
Sbjct: 813  DGQVERYKARLVARGFQQAHGRDYDETFAPVAHMTTVRTLIAVAATRSWTISQMDVKNAF 872

Query: 637  LHGDLQEEVYIKLPNGMPTPSPNTVCKLKRSLYGLKQAPRVWFEKFRSTLLGFEFSQSRY 696
            LHGDL EEVY+  P G+  P P  V +L+R+LYGLKQAPR WF +F S +L   FS S +
Sbjct: 873  LHGDLHEEVYMHPPPGVEAP-PGHVFRLRRALYGLKQAPRAWFARFSSVVLAAGFSPSDH 931

Query: 697  DPSLFLQRTPKGMVVLLVYVDDIVVTGSDQDAISRIKNLLHSTFHMKELGRLTYFLGLEV 756
            DP+LF+  + +G  +LL+YVDD+++TG D + I+ +K  L   F M +LG L+YFLG+EV
Sbjct: 932  DPALFIHTSSRGRTLLLLYVDDMLITGDDLEYIAFVKGKLSEQFMMSDLGPLSYFLGIEV 991

Query: 757  HYHHEGVFLNQQKYIQDLVQLAGLTNATLVDTPMEVNVKYRRDEGDHLDDPTQYRKLVGS 816
                +G +L+Q +YI+DL+  +GLT++    TPME++V+ R  +G  LDDP++YR LVGS
Sbjct: 992  TSTVDGYYLSQHRYIEDLLAQSGLTDSRTTTTPMELHVRLRSTDGTPLDDPSRYRHLVGS 1051

Query: 817  LIYVTITRPDISFAVHTVSKFMQAPRHFHLSAVQQIIRYLLGTLKRGLFFPVGSSIKLQA 876
            L+Y+T+TRPDI++AVH +S+F+ AP   H   + +++RYL GT  + LF+   S ++L+A
Sbjct: 1052 LVYLTVTRPDIAYAVHILSQFVSAPISVHYGHLLRVLRYLRGTTTQCLFYAASSPLQLRA 1111

Query: 877  YSDADWAGCPDTRKSTTGWCMFLGNAPISWKCKKQDSVSKSSTEAEYRAMSAACSEIIWL 936
            +SD+ WA  P  R+S TG+C+FLG + ++WK KKQ +VS+SSTEAE RA++   SEI+WL
Sbjct: 1112 FSDSTWASDPIDRRSVTGYCIFLGTSLLTWKSKKQTAVSRSSTEAELRALATTTSEIVWL 1171

Query: 937  RGLLTELGFSQDQPTPLHADNTSAIQIAANPVYHERTKHIEVDCHSIREAYDRRIINLPH 996
            R LL + G S D PTPL  DNT AIQIA +P+ HE TKHI VD    R    +  I L +
Sbjct: 1172 RWLLADFGVSCDVPTPLLCDNTGAIQIANDPIKHELTKHIGVDASFTRSHCQQSTIALHY 1231

Query: 997  VSTSVQTADIFTKSLTRQRH 1016
            V + +Q AD FTK+ TR+ H
Sbjct: 1232 VPSELQVADFFTKAQTREHH 1251


>emb|CAC95126.1| gag-pol polyprotein [Populus deltoides]
          Length = 1382

 Score =  670 bits (1728), Expect = 0.0
 Identities = 359/763 (47%), Positives = 494/763 (64%), Gaps = 22/763 (2%)

Query: 278  GEYMSHSFQEFLQSNGIISQRSCPSTP*QNGVAERKNRHLLDVVRTLLLESHVPSRFWCE 337
            GEY S+ F + L  +G I Q SC  TP QNGVAERK+RH+++  R+LLL + V S FW E
Sbjct: 609  GEYTSNKFCQMLALDGTIHQTSCTDTPEQNGVAERKHRHIVETARSLLLSAFVLSEFWGE 668

Query: 338  ALSTAVHLINRMPSPSIGNESPFTRLYGHPPNYSTLRVFGCVCYVHLPPQERTKFTAQSV 397
            A+ TAV LIN +PS      SPF +LYGH P+YS+ RVFGC  +V  P  ER K +++S 
Sbjct: 669  AVLTAVSLINTIPSSHSSGLSPFEKLYGHVPDYSSFRVFGCTYFVLHPHVERNKLSSRSA 728

Query: 398  ECAFLGYSPHQKGFLCYDPNLRRIRVSRNVIFQENKYFFA---SHHDLVSSP-ISILPLF 453
             C FLGY   +KG+ C+DP  +++ VS +V+F E+  FF+   + H L  S  I I P  
Sbjct: 729  ICVFLGYGEGKKGYRCFDPITQKLYVSHHVVFLEHIPFFSIPSTTHSLTKSDLIHIDPFS 788

Query: 454  YDSHSRQQPS-KPLLTYKRRSTAT--HGPPQDNSLVAGP-----VEEPAPLR--RSSRES 503
             DS +   P  + + T+    T T   G P+ +     P     + +P P +  R  + +
Sbjct: 789  EDSGNDTSPYVRSICTHNSAGTGTLLSGTPEASFSSTAPQASSEIVDPPPRQSIRIRKST 848

Query: 504  KPPERYINCMTATLSSIPI-------PSSYKQAMENDCWQKAIESELLALEENQTWDIVP 556
            K P+   +C +++ +S          PSSYK+A+ +   Q+A++ EL AL +  TWD+VP
Sbjct: 849  KLPDFAYSCYSSSFTSFLAYIHCLFEPSSYKEAILDPLGQQAMDEELSALHKTDTWDLVP 908

Query: 557  CPSSVKPLGSKFVFSIKLRSDGSIDRYKAHLVVLGNKQEYGLDYDETFAPVAKMTTVRTI 616
             P     +G ++V+ IK  SDGSI+RYKA LV  G  Q+YG+DY+ETFAP+AKMTT+RT+
Sbjct: 909  LPPGKSVVGCRWVYKIKTNSDGSIERYKARLVAKGYSQQYGMDYEETFAPIAKMTTIRTL 968

Query: 617  LAIAASQAWPLHQMDVKNAFLHGDLQEEVYIKLPNGMPTPSPNTVCKLKRSLYGLKQAPR 676
            +A+A+ + W + Q+DVKNAFL+GDLQEEVY+  P G+   S   VCKLK++LYGLKQAPR
Sbjct: 969  IAVASIRQWHISQLDVKNAFLNGDLQEEVYMAPPPGISHDS-GYVCKLKKALYGLKQAPR 1027

Query: 677  VWFEKFRSTLLGFEFSQSRYDPSLFLQRTPKGMVVLLVYVDDIVVTGSDQDAISRIKNLL 736
             WFEKF   +    F  S +D +LF++ T  G ++L +YVDD+++TG D D IS +K  L
Sbjct: 1028 AWFEKFSIVISSLGFVSSSHDSALFIKCTDAGRIILSLYVDDMIITGDDIDGISVLKTEL 1087

Query: 737  HSTFHMKELGRLTYFLGLEVHYHHEGVFLNQQKYIQDLVQLAGLTNATLVDTPMEVNVKY 796
               F MK+LG L YFLG+EV Y   G  L+Q KY+ ++++ A LT+   VDTP+EVN +Y
Sbjct: 1088 ARRFEMKDLGYLRYFLGIEVAYSPRGYLLSQSKYVANILERARLTDNKTVDTPIEVNARY 1147

Query: 797  RRDEGDHLDDPTQYRKLVGSLIYVTITRPDISFAVHTVSKFMQAPRHFHLSAVQQIIRYL 856
               +G  L DPT YR +VGSL+Y+TIT PDI++AVH VS+F+ +P   H +AV +I+RYL
Sbjct: 1148 SSSDGLPLIDPTLYRTIVGSLVYLTITHPDIAYAVHVVSQFVASPTTIHWAAVLRILRYL 1207

Query: 857  LGTLKRGLFFPVGSSIKLQAYSDADWAGCPDTRKSTTGWCMFLGNAPISWKCKKQDSVSK 916
             GT+ + L     SS++L+AYSDAD    P  RKS TG+C+FLG++ ISWK KKQ  VS+
Sbjct: 1208 RGTVFQSLLLSSTSSLELRAYSDADHGSDPTDRKSVTGFCIFLGDSLISWKSKKQSIVSQ 1267

Query: 917  SSTEAEYRAMSAACSEIIWLRGLLTELGFSQDQPTPLHADNTSAIQIAANPVYHERTKHI 976
            SSTEAEY AM++   EI+W R LL ++G S    TP++ DN S+IQIA N V+HERTKHI
Sbjct: 1268 SSTEAEYCAMASTTKEIVWSRWLLADMGISFSHLTPMYCDNQSSIQIAHNSVFHERTKHI 1327

Query: 977  EVDCHSIREAYDRRIINLPHVSTSVQTADIFTKSLTRQRHNFL 1019
            E+DCH  R       I LP V +S+Q AD FTK+ +  R  FL
Sbjct: 1328 EIDCHLTRHHLKHGTIALPFVPSSLQIADFFTKAHSISRFCFL 1370


>gb|AAT40550.1| putative receptor kinase [Solanum demissum]
          Length = 1358

 Score =  620 bits (1598), Expect = e-176
 Identities = 339/759 (44%), Positives = 477/759 (62%), Gaps = 44/759 (5%)

Query: 279  EYMSHSFQEFLQSNGIISQRSCPSTP*QNGVAERKNRHLLDVVRTLLLESHVPSRFWCEA 338
            EY+S  F+EF+   GII Q +CP TP QNGVAERKNRHL++  RTLLLES+VP RFW +A
Sbjct: 613  EYLSSQFREFMTHQGIIHQTTCPYTPQQNGVAERKNRHLIETARTLLLESNVPLRFWGDA 672

Query: 339  LSTAVHLINRMPSPSIGNESPFT------RLYGHPPNYSTLRVFGCVCYVHLPPQERTKF 392
            + T+ +LINRMPS SI N+ P +       LY  PP     RVFG  C+VH     + K 
Sbjct: 673  VLTSCYLINRMPSSSIQNQVPHSILFPQSHLYPIPP-----RVFGSTCFVHNLAPGKDKL 727

Query: 393  TAQSVECAFLGYSPHQKGFLCYDPNLRRIRVSRNVIFQENK-YFFASHHDLVSSPISI-- 449
              ++++C FLGYS  QKG+ CY  +L R  +S +V F E++ Y+ +S+H  VS  + I  
Sbjct: 728  APRALKCVFLGYSRVQKGYRCYSHDLHRYLMSADVTFFESQPYYTSSNHPDVSMVLPIPQ 787

Query: 450  ---LPLFYDSHSRQQPS---KPLLTYKRRSTATHGPPQDNSLVAGPVEEPAPLRRSSRES 503
               +P F +S           PLLTY RR   T  P  D+S  A    +PAP    + + 
Sbjct: 788  VLPVPTFVESTVTSTSPVVVPPLLTYHRRPRPTLVP--DDSCHA---PDPAP----TADL 838

Query: 504  KPPERYINCMTATLSSIPIPSSYKQAMENDCWQKAIESELLALEENQTWDIVPCPSSVKP 563
             PP +            P+     +A+ +  W++A+  E+ AL ++ TW++V  P+    
Sbjct: 839  PPPSQ------------PLALQKGEALSHSGWRQAMVDEMSALHKSGTWELVSLPAGKST 886

Query: 564  LGSKFVFSIKLRSDGSIDRYKAHLVVLGNKQEYGLDYDETFAPVAKMTTVRTILAIAASQ 623
            +G ++V+++K+  DG +DR KA LV  G  Q +GLDY +TFAPVAK+ +VR  L++AA +
Sbjct: 887  VGCRWVYAVKIGPDGQVDRLKARLVAKGYTQIFGLDYSDTFAPVAKIASVRLFLSMAAVR 946

Query: 624  AWPLHQMDVKNAFLHGDLQEEVYIKLPNGMPTP--SPNTVCKLKRSLYGLKQAPRVWFEK 681
             WPLHQ+D+KNAFLHGDL+EEVY++ P G      S + VC+L+RSLYGLKQ+PR WF K
Sbjct: 947  HWPLHQLDIKNAFLHGDLEEEVYMEQPPGFVAQGESSSLVCRLRRSLYGLKQSPRAWFGK 1006

Query: 682  FRSTLLGFEFSQSRYDPSLFLQRT-PKGMVVLLVYVDDIVVTGSDQDAISRIKNLLHSTF 740
            F + +  F  ++S  D S+F + + P   + L+VYVDDIV+TG+DQD I+ +K  L   F
Sbjct: 1007 FSTVIQEFGMTRSGADHSVFYRHSAPSRCIYLVVYVDDIVITGNDQDGITDLKQHLFKHF 1066

Query: 741  HMKELGRLTYFLGLEVHYHHEGVFLNQQKYIQDLVQLAGLTNATLVDTPMEVNVKYRRDE 800
              K+LGRL YFLG+EV     G+ ++Q+KY  D+++  G+     VDTPM+ NVK    +
Sbjct: 1067 QTKDLGRLKYFLGIEVAQSRSGIVISQRKYALDILEETGMMGCRPVDTPMDPNVKLLPGQ 1126

Query: 801  GDHLDDPTQYRKLVGSLIYVTITRPDISFAVHTVSKFMQAPRHFHLSAVQQIIRYLLGTL 860
            G+ L +P +YR+LVG L Y+T+TRPDISF V  VS+FM +P   H  AV +I+RY+    
Sbjct: 1127 GEPLSNPERYRRLVGKLNYLTVTRPDISFPVSVVSQFMTSPCDSHWEAVVRILRYIKSAP 1186

Query: 861  KRGLFFPVGSSIKLQAYSDADWAGCPDTRKSTTGWCMFLGNAPISWKCKKQDSVSKSSTE 920
             +GL F       +  Y+DADWAG P  R+ST+G+C+ +G   +SWK KKQ+ V++SS E
Sbjct: 1187 GKGLLFEDQGHEHIIGYTDADWAGSPSDRRSTSGYCVLVGGNLVSWKSKKQNVVARSSAE 1246

Query: 921  AEYRAMSAACSEIIWLRGLLTELGFSQDQPTPLHADNTSAIQIAANPVYHERTKHIEVDC 980
            +EYRAM+ A  E++W++ LL EL F +     L  DN +A+ IA+NPV+HERTKHIE+DC
Sbjct: 1247 SEYRAMATATCELVWIKQLLGELKFGKVDKMELVCDNQAALHIASNPVFHERTKHIEIDC 1306

Query: 981  HSIREAYDRRIINLPHVSTSVQTADIFTKSLTRQRHNFL 1019
            H +RE      I    V ++ Q ADIFTKSLT  R N++
Sbjct: 1307 HFVREKILSGDIVTKFVKSNDQLADIFTKSLTCPRINYI 1345


>gb|AAD25646.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
            gi|25301701|pir||E84589 probable retroelement pol
            polyprotein [imported] - Arabidopsis thaliana
          Length = 1461

 Score =  597 bits (1539), Expect = e-169
 Identities = 318/745 (42%), Positives = 461/745 (61%), Gaps = 22/745 (2%)

Query: 284  SFQEFLQSNGIISQRSCPSTP*QNGVAERKNRHLLDVVRTLLLESHVPSRFWCEALSTAV 343
            +F EF ++ GI+S  SCP TP QN V ERK++H+L+V R L+ +S++   +W + + TAV
Sbjct: 700  AFTEFYKAKGIVSFHSCPETPEQNSVVERKHQHILNVARALMFQSNMSLPYWGDCVLTAV 759

Query: 344  HLINRMPSPSIGNESPFTRLYGHPPNYSTLRVFGCVCYVHLPPQERTKFTAQSVECAFLG 403
             LINR PS  + N++PF  L G  P+YS L+ FGC+CY     ++R KF  +S  C FLG
Sbjct: 760  FLINRTPSALLSNKTPFEVLTGKLPDYSQLKTFGCLCYSSTSSKQRHKFLPRSRACVFLG 819

Query: 404  YSPHQKGFLCYDPNLRRIRVSRNVIFQENKYFFASHHDLVSSPISIL----PLFYDSHSR 459
            Y    KG+   D     + +SRNV F E  +  AS     ++   +     PL   +   
Sbjct: 820  YPFGFKGYKLLDLESNVVHISRNVEFHEELFPLASSQQSATTASDVFTPMDPLSSGNSIT 879

Query: 460  QQPSKPLLT-----YKRRSTATHGPPQDNSLVAGPVEEPAPLRRS---SRESKPPERYIN 511
                 P ++      KRR T      QD        ++  P+  S   S+ S     YIN
Sbjct: 880  SHLPSPQISPSTQISKRRITKFPAHLQDYHCYFVNKDDSHPISSSLSYSQISPSHMLYIN 939

Query: 512  CMTATLSSIPIPSSYKQAMENDCWQKAIESELLALEENQTWDIVPCPSSVKPLGSKFVFS 571
                 +S IPIP SY +A ++  W  AI+ E+ A+E   TW+I   P   K +G K+VF+
Sbjct: 940  ----NISKIPIPQSYHEAKDSKEWCGAIDQEIGAMERTDTWEITSLPPGKKAVGCKWVFT 995

Query: 572  IKLRSDGSIDRYKAHLVVLGNKQEYGLDYDETFAPVAKMTTVRTILAIAASQAWPLHQMD 631
            +K  +DGS++R+KA +V  G  Q+ GLDY ETF+PVAKM TV+ +L ++AS+ W L+Q+D
Sbjct: 996  VKFHADGSLERFKARIVAKGYTQKEGLDYTETFSPVAKMATVKLLLKVSASKKWYLNQLD 1055

Query: 632  VKNAFLHGDLQEEVYIKLPNGMP-----TPSPNTVCKLKRSLYGLKQAPRVWFEKFRSTL 686
            + NAFL+GDL+E +Y+KLP+G       +  PN VC+LK+S+YGLKQA R WF KF ++L
Sbjct: 1056 ISNAFLNGDLEETIYMKLPDGYADIKGTSLPPNVVCRLKKSIYGLKQASRQWFLKFSNSL 1115

Query: 687  LGFEFSQSRYDPSLFLQRTPKGMVVLLVYVDDIVVTGSDQDAISRIKNLLHSTFHMKELG 746
            L   F +   D +LF++      +VLLVYVDDIV+  + + A   +   L ++F ++ELG
Sbjct: 1116 LALGFEKQHGDHTLFVRCIGSEFIVLLVYVDDIVIASTTEQAAQSLTEALKASFKLRELG 1175

Query: 747  RLTYFLGLEVHYHHEGVFLNQQKYIQDLVQLAGLTNATLVDTPMEVNVKYRRDEGDHLDD 806
             L YFLGLEV    EG+ L+Q+KY  +L+  A + +      PM  N++  +++G  L+D
Sbjct: 1176 PLKYFLGLEVARTSEGISLSQRKYALELLTSADMLDCKPSSIPMTPNIRLSKNDGLLLED 1235

Query: 807  PTQYRKLVGSLIYVTITRPDISFAVHTVSKFMQAPRHFHLSAVQQIIRYLLGTLKRGLFF 866
               YR+LVG L+Y+TITRPDI+FAV+ + +F  APR  HL+AV ++++Y+ GT+ +GLF+
Sbjct: 1236 KEMYRRLVGKLMYLTITRPDITFAVNKLCQFSSAPRTAHLAAVYKVLQYIKGTVGQGLFY 1295

Query: 867  PVGSSIKLQAYSDADWAGCPDTRKSTTGWCMFLGNAPISWKCKKQDSVSKSSTEAEYRAM 926
                 + L+ Y+DADW  CPD+R+STTG+ MF+G++ ISW+ KKQ +VS+SS EAEYRA+
Sbjct: 1296 SAEDDLTLKGYTDADWGTCPDSRRSTTGFTMFVGSSLISWRSKKQPTVSRSSAEAEYRAL 1355

Query: 927  SAACSEIIWLRGLLTELGFSQDQPTPLHADNTSAIQIAANPVYHERTKHIEVDCHSIREA 986
            + A  E+ WL  LL  L      P  L++D+T+A+ IA NPV+HERTKHIE+DCH++RE 
Sbjct: 1356 ALASCEMAWLSTLLLALRVHSGVPI-LYSDSTAAVYIATNPVFHERTKHIEIDCHTVREK 1414

Query: 987  YDRRIINLPHVSTSVQTADIFTKSL 1011
             D   + L HV T  Q ADI TK L
Sbjct: 1415 LDNGQLKLLHVKTKDQVADILTKPL 1439


>gb|AAD26943.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
            gi|25301694|pir||E84535 probable retroelement pol
            polyprotein [imported] - Arabidopsis thaliana
          Length = 1454

 Score =  589 bits (1519), Expect = e-166
 Identities = 319/761 (41%), Positives = 462/761 (59%), Gaps = 52/761 (6%)

Query: 285  FQEFLQSNGIISQRSCPSTP*QNGVAERKNRHLLDVVRTLLLESHVPSRFWCEALSTAVH 344
            F  F    GI+S  SCP TP QN V ERK++H+L+V R L+ +S VP   W + + TAV 
Sbjct: 690  FTSFYAEKGIVSFHSCPETPEQNSVVERKHQHILNVARALMFQSQVPLSLWGDCVLTAVF 749

Query: 345  LINRMPSPSIGNESPFTRLYGHPPNYSTLRVFGCVCYVHLPPQERTKFTAQSVECAFLGY 404
            LINR PS  + N++P+  L G  P Y  LR FGC+CY    P++R KF  +S  C FLGY
Sbjct: 750  LINRTPSQLLMNKTPYEILTGTAPVYEQLRTFGCLCYSSTSPKQRHKFQPRSRACLFLGY 809

Query: 405  SPHQKGFLCYDPNLRRIRVSRNVIFQENKYFFASHHDLVSSPISILPLFYDSHSRQQPSK 464
                KG+   D     + +SRNV F E  +  A +    SS    L LF        P  
Sbjct: 810  PSGYKGYKLMDLESNTVFISRNVQFHEEVFPLAKNPGSESS----LKLF-------TPMV 858

Query: 465  PLLTYKRRSTATHGPPQDNSLVAGPVEEPAPLRRSSRESKPPERYIN------------- 511
            P+ +     T TH P    S +   + +  P   S R  KPP    +             
Sbjct: 859  PVSSGIISDT-THSP----SSLPSQISDLPPQISSQRVRKPPAHLNDYHCNTMQSDHKYP 913

Query: 512  ---------------CMTATLSSIPIPSSYKQAMENDCWQKAIESELLALEENQTWDIVP 556
                           C    ++ IPIP++Y +A +   W +A+++E+ A+E+  TW+I  
Sbjct: 914  ISSTISYSKISPSHMCYINNITKIPIPTNYAEAQDTKEWCEAVDAEIGAMEKTNTWEITT 973

Query: 557  CPSSVKPLGSKFVFSIKLRSDGSIDRYKAHLVVLGNKQEYGLDYDETFAPVAKMTTVRTI 616
             P   K +G K+VF++K  +DG+++RYKA LV  G  Q+ GLDY +TF+PVAKMTT++ +
Sbjct: 974  LPKGKKAVGCKWVFTLKFLADGNLERYKARLVAKGYTQKEGLDYTDTFSPVAKMTTIKLL 1033

Query: 617  LAIAASQAWPLHQMDVKNAFLHGDLQEEVYIKLPNG------MPTPSPNTVCKLKRSLYG 670
            L ++AS+ W L Q+DV NAFL+G+L+EE+++K+P G      +  PS N V +LKRS+YG
Sbjct: 1034 LKVSASKKWFLKQLDVSNAFLNGELEEEIFMKIPEGYAERKGIVLPS-NVVLRLKRSIYG 1092

Query: 671  LKQAPRVWFEKFRSTLLGFEFSQSRYDPSLFLQRTPKGMVVLLVYVDDIVVTGSDQDAIS 730
            LKQA R WF+KF S+LL   F ++  D +LFL+      V++LVYVDDIV+  + + A +
Sbjct: 1093 LKQASRQWFKKFSSSLLSLGFKKTHGDHTLFLKMYDGEFVIVLVYVDDIVIASTSEAAAA 1152

Query: 731  RIKNLLHSTFHMKELGRLTYFLGLEVHYHHEGVFLNQQKYIQDLVQLAGLTNATLVDTPM 790
            ++   L   F +++LG L YFLGLEV     G+ + Q+KY  +L+Q  G+     V  PM
Sbjct: 1153 QLTEELDQRFKLRDLGDLKYFLGLEVARTTAGISICQRKYALELLQSTGMLACKPVSVPM 1212

Query: 791  EVNVKYRRDEGDHLDDPTQYRKLVGSLIYVTITRPDISFAVHTVSKFMQAPRHFHLSAVQ 850
              N+K R+D+GD ++D  QYR++VG L+Y+TITRPDI+FAV+ + +F  APR  HL+A  
Sbjct: 1213 IPNLKMRKDDGDLIEDIEQYRRIVGKLMYLTITRPDITFAVNKLCQFSSAPRTTHLTAAY 1272

Query: 851  QIIRYLLGTLKRGLFFPVGSSIKLQAYSDADWAGCPDTRKSTTGWCMFLGNAPISWKCKK 910
            ++++Y+ GT+ +GLF+   S + L+ ++D+DWA C D+R+STT + MF+G++ ISW+ KK
Sbjct: 1273 RVLQYIKGTVGQGLFYSASSDLTLKGFADSDWASCQDSRRSTTSFTMFVGDSLISWRSKK 1332

Query: 911  QDSVSKSSTEAEYRAMSAACSEIIWLRGLLTELGFSQDQPTPLHADNTSAIQIAANPVYH 970
            Q +VS+SS EAEYRA++ A  E++WL  LL  L  S   P  L++D+T+AI IA NPV+H
Sbjct: 1333 QHTVSRSSAEAEYRALALATCEMVWLFTLLVSLQASPPVPI-LYSDSTAAIYIATNPVFH 1391

Query: 971  ERTKHIEVDCHSIREAYDRRIINLPHVSTSVQTADIFTKSL 1011
            ERTKHI++DCH++RE  D   + L HV T  Q ADI TK L
Sbjct: 1392 ERTKHIKLDCHTVRERLDNGELKLLHVRTEDQVADILTKPL 1432


>gb|AAD24600.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
            gi|25301700|pir||G84542 probable retroelement pol
            polyprotein [imported] - Arabidopsis thaliana
          Length = 1333

 Score =  586 bits (1510), Expect = e-165
 Identities = 327/791 (41%), Positives = 447/791 (56%), Gaps = 66/791 (8%)

Query: 287  EFLQSNGIISQRSCPSTP*QNGVAERKNRHLLDVVRTLLLESHVPSRFWCEALSTAVHLI 346
            +F Q  G+I +RSC +TP +N   ERK+RHLL+V R L  ++++P +FW E + TA +LI
Sbjct: 526  KFFQEQGVIHERSCVATPERNDRVERKHRHLLNVARALRFQANLPIQFWGECVLTAAYLI 585

Query: 347  NRMPSPSIGNESPFTRLYGHPPNYSTLRVFGCVCYVHLPPQERTKFTAQSVECAFLGYSP 406
            NR PS  + + +P+ RL+   P +  LRVFG +CY H   +   KF  +S  C F+GY  
Sbjct: 586  NRTPSSVLNDSTPYERLHKKQPRFDHLRVFGSLCYAHNRNRGGDKFAERSRRCVFVGYPH 645

Query: 407  HQKGFLCYDPNLRRIRVSRNVIFQENKYFFASHHD------------------------- 441
             QKG+  +D       VSR+V+F E ++ F   H+                         
Sbjct: 646  GQKGWRLFDLEQNEFFVSRDVVFSELEFPFRISHEQNVIEEEEEALWAPIVDGLIEEEVH 705

Query: 442  -----------LVSSPISILPLFYDSHSRQQPSKPLLTY-----KRRSTATHGPPQDNSL 485
                        VSSPIS  P    S S    S PL T         +T+   P    +L
Sbjct: 706  LGQNAGPTPPICVSSPIS--PSATSSRSEHSTSSPLDTEVVPTPATSTTSASSPSSPTNL 763

Query: 486  --------------VAGPVEEPAPLRRSSRESKPP----ERYINCMTATLSSIPIPSSYK 527
                             P   P P R+S+R   PP    +  +N      S   + S   
Sbjct: 764  QFLPLSRAKPTTAQAVAPPAVPPPRRQSTRNKAPPVTLKDFVVNTTVCQESPSKLNSILY 823

Query: 528  QAMENDCWQKAIESELL-----ALEENQTWDIVPCPSSVKPLGSKFVFSIKLRSDGSIDR 582
            Q  + D  ++   S        A EEN TW I   P   + +GS++V+ +K  SDGS++R
Sbjct: 824  QLQKRDDTRRFSASHTTYVAIDAQEENHTWTIEDLPPGKRAIGSQWVYKVKHNSDGSVER 883

Query: 583  YKAHLVVLGNKQEYGLDYDETFAPVAKMTTVRTILAIAASQAWPLHQMDVKNAFLHGDLQ 642
            YKA LV LGNKQ+ G DY ETFAPVAKM TVR  L +A  + W +HQMDV NAFLHGDL+
Sbjct: 884  YKARLVALGNKQKEGEDYGETFAPVAKMATVRLFLDVAVKRNWEIHQMDVHNAFLHGDLR 943

Query: 643  EEVYIKLPNGMPTPSPNTVCKLKRSLYGLKQAPRVWFEKFRSTLLGFEFSQSRYDPSLFL 702
            EEVY+KLP G     PN VC+L+++LYGLKQAPR WFEK  + L  + F QS  D SLF 
Sbjct: 944  EEVYMKLPPGFEASHPNKVCRLRKALYGLKQAPRCWFEKLTTALKRYGFQQSLADYSLFT 1003

Query: 703  QRTPKGMVVLLVYVDDIVVTGSDQDAISRIKNLLHSTFHMKELGRLTYFLGLEVHYHHEG 762
                   + +L+YVDD+++TG+ Q A  + K  L S FHMK+LG L YFLG+EV     G
Sbjct: 1004 LVKGSVRIKILIYVDDLIITGNSQRATQQFKEYLASCFHMKDLGPLKYFLGIEVARSTTG 1063

Query: 763  VFLNQQKYIQDLVQLAGLTNATLVDTPMEVNVKYRRDEGDHLDDPTQYRKLVGSLIYVTI 822
            +++ Q+KY  D++   GL      + P+E N K        L DP +YR+LVG LIY+ +
Sbjct: 1064 IYICQRKYALDIISETGLLGVKPANFPLEQNHKLGLSTSPLLTDPQRYRRLVGRLIYLAV 1123

Query: 823  TRPDISFAVHTVSKFMQAPRHFHLSAVQQIIRYLLGTLKRGLFFPVGSSIKLQAYSDADW 882
            TR D++F+VH +++FMQ PR  H +A  +++RYL     +G+F       ++  + D+DW
Sbjct: 1124 TRLDLAFSVHILARFMQEPREDHWAAALRVVRYLKADPGQGVFLRRSGDFQITGWCDSDW 1183

Query: 883  AGCPDTRKSTTGWCMFLGNAPISWKCKKQDSVSKSSTEAEYRAMSAACSEIIWLRGLLTE 942
            AG P +R+S TG+ +  G++PISWK KKQD+VSKSS EAEYRAMS   SE++WL+ LL  
Sbjct: 1184 AGDPMSRRSVTGYFVQFGDSPISWKTKKQDTVSKSSAEAEYRAMSFLASELLWLKQLLFS 1243

Query: 943  LGFSQDQPTPLHADNTSAIQIAANPVYHERTKHIEVDCHSIREAYDRRIINLPHVSTSVQ 1002
            LG S  QP  +  D+ SAI IA NPV+HERTKHIE+D H +R+ + + +I   HV T+ Q
Sbjct: 1244 LGVSHVQPMIMCCDSKSAIYIATNPVFHERTKHIEIDYHFVRDEFVKGVITPRHVGTTSQ 1303

Query: 1003 TADIFTKSLTR 1013
             ADIFTK L R
Sbjct: 1304 LADIFTKPLGR 1314


>pir||G86301 probable retroelement polyprotein [imported] - Arabidopsis thaliana
            gi|9989054|gb|AAG10817.1| Putative retroelement
            polyprotein [Arabidopsis thaliana]
          Length = 1413

 Score =  584 bits (1506), Expect = e-165
 Identities = 315/759 (41%), Positives = 451/759 (58%), Gaps = 50/759 (6%)

Query: 285  FQEFLQSNGIISQRSCPSTP*QNGVAERKNRHLLDVVRTLLLESHVPSRFWCEALSTAVH 344
            F+E  +  GI++  SCP TP QN V ERK++H+L+V R LL +S +P  +W + + TAV 
Sbjct: 666  FEELYRRKGIVAYHSCPETPEQNSVVERKHQHILNVARALLFQSQIPLSYWGDCILTAVF 725

Query: 345  LINRMPSPSIGNESPFTRLYGHPPNYSTLRVFGCVCYVHLPPQERTKFTAQSVECAFLGY 404
            +INR PSP I N++ F  L    P+Y+ L+ FGC+CY    P++R KF  ++  CAFLGY
Sbjct: 726  IINRTPSPVISNKTLFEMLTKKVPDYTHLKSFGCLCYASTSPKQRHKFEDRARTCAFLGY 785

Query: 405  SPHQKGFLCYDPNLRRIRVSRNVIFQENKYFFASHHDLVSSPISILPLFYDSHSRQQPSK 464
                KG+   D     I +SRNV+F E+ + F +            P  Y   +   PS+
Sbjct: 786  PSGYKGYKLLDLESHTIFISRNVVFYEDLFPFKTKPAENEESSVFFPHIYVDRNDSHPSQ 845

Query: 465  PLLTYKRRSTATHGPPQDNSLVAGPVEEPAPLRRSSRESKPP----ERYINCMTAT---- 516
            PL            P Q+ S    P E     +++SR S+PP    + + N +T++    
Sbjct: 846  PL------------PVQETSASNVPAE-----KQNSRVSRPPAYLKDYHCNSVTSSTDHP 888

Query: 517  --------------------LSSIPIPSSYKQAMENDCWQKAIESELLALEENQTWDIVP 556
                                ++ IP P +Y QA +   W  A+  E+ ALE+N TW +  
Sbjct: 889  ISEVLSYSSLSDPYMIFINAVNKIPEPHTYAQARQIKEWCDAMGMEITALEDNGTWVVCS 948

Query: 557  CPSSVKPLGSKFVFSIKLRSDGSIDRYKAHLVVLGNKQEYGLDYDETFAPVAKMTTVRTI 616
             P   K +G K+V+ IKL +DGS++RYKA LV  G  Q  GLDY +TF+PVAK+TTV+ +
Sbjct: 949  LPVGKKAVGCKWVYKIKLNADGSLERYKARLVAKGYTQTEGLDYVDTFSPVAKLTTVKLL 1008

Query: 617  LAIAASQAWPLHQMDVKNAFLHGDLQEEVYIKLPNGMPTPS-----PNTVCKLKRSLYGL 671
            +A+AA++ W L Q+D+ NAFL+G L EE+Y+ LP G          PN VC+LK+SLYGL
Sbjct: 1009 IAVAAAKGWSLSQLDISNAFLNGSLDEEIYMTLPPGYSPRQGDSFPPNAVCRLKKSLYGL 1068

Query: 672  KQAPRVWFEKFRSTLLGFEFSQSRYDPSLFLQRTPKGMVVLLVYVDDIVVTGSDQDAISR 731
            KQA R W+ KF  +L    F+QS  D +LF +++    + +LVYVDDI++  S       
Sbjct: 1069 KQASRQWYLKFSESLKALGFTQSSGDHTLFTRKSKNSYMAVLVYVDDIIIASSCDRETEL 1128

Query: 732  IKNLLHSTFHMKELGRLTYFLGLEVHYHHEGVFLNQQKYIQDLVQLAGLTNATLVDTPME 791
            +++ L  +  +++LG L YFLGLE+  + +G+ + Q+KY  +L+   GL        PME
Sbjct: 1129 LRDALQRSSKLRDLGTLRYFLGLEIARNTDGISICQRKYTLELLAETGLLGCKSSSVPME 1188

Query: 792  VNVKYRRDEGDHLDDPTQYRKLVGSLIYVTITRPDISFAVHTVSKFMQAPRHFHLSAVQQ 851
             N K  +++G+ +DD   YRKLVG L+Y+T TRPDI++AVH + +F  APR  HL AV +
Sbjct: 1189 PNQKLSQEDGELIDDAEHYRKLVGKLMYLTFTRPDITYAVHRLCQFTSAPRVPHLKAVYK 1248

Query: 852  IIRYLLGTLKRGLFFPVGSSIKLQAYSDADWAGCPDTRKSTTGWCMFLGNAPISWKCKKQ 911
            II YL GT+ +GLF+     +KL  ++D+D++ C D+RK TTG+CMFLG + ++WK KKQ
Sbjct: 1249 IIYYLKGTVGQGLFYSANVDLKLSGFADSDFSSCSDSRKLTTGYCMFLGTSLVAWKSKKQ 1308

Query: 912  DSVSKSSTEAEYRAMSAACSEIIWLRGLLTELGFSQDQPTPLHADNTSAIQIAANPVYHE 971
            + +S SS EAEY+AMS A  E++WLR LL +L     + + L+ DNT+AI IA NPV+HE
Sbjct: 1309 EVISMSSAEAEYKAMSMAVREMMWLRFLLEDLWIDVSEASVLYCDNTAAIHIANNPVFHE 1368

Query: 972  RTKHIEVDCHSIREAYDRRIINLPHVSTSVQTADIFTKS 1010
            RTKHIE D H IRE     +I   HV T  Q ADI  KS
Sbjct: 1369 RTKHIERDYHHIREKIILGLIRTLHVRTENQLADIPYKS 1407


>gb|AAB61111.1| Strong similarity to Zea mays retrotransposon Hopscotch polyprotein
            (gb|U12626). [Arabidopsis thaliana]
            gi|25301690|pir||G96722 hypothetical protein F20P5.25
            [imported] - Arabidopsis thaliana
          Length = 1315

 Score =  580 bits (1494), Expect = e-163
 Identities = 307/761 (40%), Positives = 459/761 (59%), Gaps = 37/761 (4%)

Query: 284  SFQEFLQSNGIISQRSCPSTP*QNGVAERKNRHLLDVVRTLLLESHVPSRFWCEALSTAV 343
            +F +F  S GI+   SCP TP QN V ERK++H+L+V R+L  +SH+P  +W + + TAV
Sbjct: 538  NFTQFYHSKGIVPYHSCPETPQQNSVVERKHQHILNVARSLFFQSHIPISYWGDCILTAV 597

Query: 344  HLINRMPSPSIGNESPFTRLYGHPPNYSTLRVFGCVCYVHLPPQERTKFTAQSVECAFLG 403
            +LINR+P+P + ++ PF  L    P Y  ++VFGC+CY    P++R KF+ ++  CAF+G
Sbjct: 598  YLINRLPAPILEDKCPFEVLTKTVPTYDHIKVFGCLCYASTSPKDRHKFSPRAKACAFIG 657

Query: 404  YSPHQKGFLCYDPNLRRIRVSRNVIF-------------QENKYFF------------AS 438
            Y    KG+   D     I VSR+V+F             QE + FF            +S
Sbjct: 658  YPSGFKGYKLLDLETHSIIVSRHVVFHEELFPFLGSDLSQEEQNFFPDLNPTPPMQRQSS 717

Query: 439  HH---DLVSSPISILPLFYDSHSRQQPSKPLLTYKRRSTATHGPPQDNSLVAGPVEEPAP 495
             H      SS + ILP    +++  +PS      K +  A       +S+V+    E   
Sbjct: 718  DHVNPSDSSSSVEILPSANPTNNVPEPSVQTSHRKAKKPAYLQDYYCHSVVSSTPHEIRK 777

Query: 496  LRRSSRESKPPERYINCMTATLSSIPIPSSYKQAMENDCWQKAIESELLALEENQTWDIV 555
                 R + P   ++ C+  T      PS+Y +A +   W+ A+ +E   LE   TW++ 
Sbjct: 778  FLSYDRINDPYLTFLACLDKTKE----PSNYTEAEKLQVWRDAMGAEFDFLEGTHTWEVC 833

Query: 556  PCPSSVKPLGSKFVFSIKLRSDGSIDRYKAHLVVLGNKQEYGLDYDETFAPVAKMTTVRT 615
              P+  + +G +++F IK  SDGS++RYKA LV  G  Q+ G+DY+ETF+PVAK+ +V+ 
Sbjct: 834  SLPADKRCIGCRWIFKIKYNSDGSVERYKARLVAQGYTQKEGIDYNETFSPVAKLNSVKL 893

Query: 616  ILAIAASQAWPLHQMDVKNAFLHGDLQEEVYIKLPNGMPTPS-----PNTVCKLKRSLYG 670
            +L +AA     L Q+D+ NAFL+GDL EE+Y++LP G  +       PN VC+LK+SLYG
Sbjct: 894  LLGVAARFKLSLTQLDISNAFLNGDLDEEIYMRLPQGYASRQGDSLPPNAVCRLKKSLYG 953

Query: 671  LKQAPRVWFEKFRSTLLGFEFSQSRYDPSLFLQRTPKGMVVLLVYVDDIVVTGSDQDAIS 730
            LKQA R W+ KF STLLG  F QS  D + FL+ +    + +LVY+DDI++  ++  A+ 
Sbjct: 954  LKQASRQWYLKFSSTLLGLGFIQSYCDHTCFLKISDGIFLCVLVYIDDIIIASNNDAAVD 1013

Query: 731  RIKNLLHSTFHMKELGRLTYFLGLEVHYHHEGVFLNQQKYIQDLVQLAGLTNATLVDTPM 790
             +K+ + S F +++LG L YFLGLE+    +G+ ++Q+KY  DL+   G         PM
Sbjct: 1014 ILKSQMKSFFKLRDLGELKYFLGLEIVRSDKGIHISQRKYALDLLDETGQLGCKPSSIPM 1073

Query: 791  EVNVKYRRDEGDHLDDPTQYRKLVGSLIYVTITRPDISFAVHTVSKFMQAPRHFHLSAVQ 850
            + ++ +  D G    +   YR+L+G L+Y+ ITRPDI+FAV+ +++F  APR  HL AV 
Sbjct: 1074 DPSMVFAHDSGGDFVEVGPYRRLIGRLMYLNITRPDITFAVNKLAQFSMAPRKAHLQAVY 1133

Query: 851  QIIRYLLGTLKRGLFFPVGSSIKLQAYSDADWAGCPDTRKSTTGWCMFLGNAPISWKCKK 910
            +I++Y+ GT+ +GLF+   S ++L+ Y++AD+  C D+R+ST+G+CMFLG++ I WK +K
Sbjct: 1134 KILQYIKGTIGQGLFYSATSELQLKVYANADYNSCRDSRRSTSGYCMFLGDSLICWKSRK 1193

Query: 911  QDSVSKSSTEAEYRAMSAACSEIIWLRGLLTELGFSQDQPTPLHADNTSAIQIAANPVYH 970
            QD VSKSS EAEYR++S A  E++WL   L EL     +PT L  DN +AI IA N V+H
Sbjct: 1194 QDVVSKSSAEAEYRSLSVATDELVWLTNFLKELQVPLSKPTLLFCDNEAAIHIANNHVFH 1253

Query: 971  ERTKHIEVDCHSIREAYDRRIINLPHVSTSVQTADIFTKSL 1011
            ERTKHIE DCHS+RE   + +  L H++T +Q AD FTK L
Sbjct: 1254 ERTKHIESDCHSVRERLLKGLFELYHINTELQIADPFTKPL 1294


>gb|AAP51971.1| putative copia-type polyprotein [Oryza sativa (japonica
            cultivar-group)] gi|37530764|ref|NP_919684.1| putative
            copia-type polyprotein [Oryza sativa (japonica
            cultivar-group)] gi|20042923|gb|AAM08751.1| Putative
            copia-type polyprotein [Oryza sativa (japonica
            cultivar-group)]
          Length = 1803

 Score =  578 bits (1490), Expect = e-163
 Identities = 315/770 (40%), Positives = 441/770 (56%), Gaps = 37/770 (4%)

Query: 279  EYMSHSFQEFLQSNGIISQRSCPSTP*QNGVAERKNRHLLDVVRTLLLESHVPSRFWCEA 338
            EY S++ +  L  +G + + SCP +  QNG AER  R + D VRT+L+ S  P  FW EA
Sbjct: 627  EYDSYALRSLLSLHGAVLRLSCPYSSQQNGKAERILRTINDCVRTMLVHSAAPLSFWAEA 686

Query: 339  LSTAVHLINRMPSPSIGNESPFTRLYGHPPNYSTLRVFGCVCYVHLPPQERTKFTAQSVE 398
            L TA+HLINR P  + G+  P+  L G PP Y  LRVFGC+CY +       K + +S+ 
Sbjct: 687  LQTAMHLINRRPCRATGSLKPYQLLLGAPPTYDHLRVFGCLCYPNTIATAPHKLSPRSLA 746

Query: 399  CAFLGYSPHQKGFLCYDPNLRRIRVSRNVIFQENKYFFAS-------------HHD---- 441
            C F+GY    +G+ CYD   RR+  SR+V F E+ + F               H D    
Sbjct: 747  CVFIGYPADHRGYRCYDMVSRRVFTSRHVTFVEDVFPFRDAPSPRPSAPPPPDHGDDTIV 806

Query: 442  -------LVSSPISILPLFYDSHSRQQPSKPLLTYKRRSTATH------GPPQDNSLVAG 488
                    V +P+   P    +H    P  P  +    +   H       P   +   A 
Sbjct: 807  LLPAPAQHVVTPVGTAP----AHDAASPPSPASSTPSSAAPAHDVAPPPSPETSSPASAS 862

Query: 489  PVEEPAPLRRSSRESKPPERYINCMTATLSSIPIPSSYKQAMENDCWQKAIESELLALEE 548
            P       R  +  SKP  RY    T+TLS  P PSS + A+ +  W+ A+++E  AL  
Sbjct: 863  PPRHAMTTRARAGISKPNPRYAMTATSTLS--PTPSSVRVALRDPNWRAAMQAEFDALLA 920

Query: 549  NQTWDIVPCPSSVKPLGSKFVFSIKLRSDGSIDRYKAHLVVLGNKQEYGLDYDETFAPVA 608
            N+TW +VP P   + +  K+VF  KL +DGS+D+YKA  VV G  Q  G+D+ ETF+PV 
Sbjct: 921  NRTWTLVPRPPGARIITGKWVFKTKLHADGSLDKYKARWVVRGFNQRPGVDFGETFSPVV 980

Query: 609  KMTTVRTILAIAASQAWPLHQMDVKNAFLHGDLQEEVYIKLPNGMPTPS-PNTVCKLKRS 667
            K  T+RT+L + +S+ WP HQ+DV NAFLHG LQE V  + P G    + P  VC L RS
Sbjct: 981  KPATIRTVLTLISSKQWPAHQLDVSNAFLHGHLQERVLCQQPTGFEDAARPADVCLLSRS 1040

Query: 668  LYGLKQAPRVWFEKFRSTLLGFEFSQSRYDPSLFLQRTPKGMVVLLVYVDDIVVTGSDQD 727
            LYGL+QAPR WF++F        F QSR DPSLF+ R       LL+YVDD++++ S   
Sbjct: 1041 LYGLRQAPRAWFKRFADHATSLGFVQSRADPSLFVLRRGSDTAYLLLYVDDMILSASSSS 1100

Query: 728  AISRIKNLLHSTFHMKELGRLTYFLGLEVHYHHEGVFLNQQKYIQDLVQLAGLTNATLVD 787
             + RI + L + F +K++G L YFLG+EV    +G  L+Q KY  D+++ AG+ N   V 
Sbjct: 1101 LLQRIIDRLQAEFKVKDMGPLKYFLGIEVQRTADGFVLSQSKYATDVLERAGMANCKAVA 1160

Query: 788  TPMEVNVKYRRDEGDHLDDPTQYRKLVGSLIYVTITRPDISFAVHTVSKFMQAPRHFHLS 847
            TP +   K   DEG    D + YR + G+L Y+T+TRPDI++AV  V   M APR  H++
Sbjct: 1161 TPADAKPKLSSDEGPLFQDSSWYRSIAGALQYLTLTRPDIAYAVQQVCLHMHAPREAHVT 1220

Query: 848  AVQQIIRYLLGTLKRGLFFPVGSSIKLQAYSDADWAGCPDTRKSTTGWCMFLGNAPISWK 907
             +++I+RY+ GT   GL     +S  L A+SDADWAGCPDTR+ST+G+C+FLG++ ISW 
Sbjct: 1221 LLKRILRYIKGTAAFGLHLRASTSPTLTAFSDADWAGCPDTRRSTSGFCIFLGDSLISWS 1280

Query: 908  CKKQDSVSKSSTEAEYRAMSAACSEIIWLRGLLTELGFSQDQPTPLHADNTSAIQIAANP 967
             K+Q +VS+SS EAEYR ++ A +E  WLR LL EL     Q T  + DN S++ ++ NP
Sbjct: 1281 SKRQTTVSRSSAEAEYRGVANAVAECTWLRQLLGELHCRVPQATIAYCDNISSVYMSKNP 1340

Query: 968  VYHERTKHIEVDCHSIREAYDRRIINLPHVSTSVQTADIFTKSLTRQRHN 1017
            V+H+RTKHIE+D H +RE      + +  + ++ Q AD+FTK L     N
Sbjct: 1341 VHHKRTKHIELDIHFVREKVALGELRVLPIPSAHQFADVFTKGLPSSMFN 1390


>gb|AAC35532.1| contains similarity to proteases [Arabidopsis thaliana]
            gi|7444456|pir||T01908 hypothetical protein T12H20.12 -
            Arabidopsis thaliana
          Length = 1392

 Score =  570 bits (1468), Expect = e-160
 Identities = 302/736 (41%), Positives = 450/736 (61%), Gaps = 41/736 (5%)

Query: 278  GEYMSHSFQEFLQSNGIISQRSCPSTP*QNGVAERKNRHLLDVVRTLLLESHVPSRFWCE 337
            GE++S+ F   L S GI    SCP TP QNG+AER++R+L ++  +L+  S VP + W E
Sbjct: 588  GEFVSYKFVAHLASCGIKQLISCPHTPQQNGIAERRHRYLTELGLSLMFHSKVPHKLWVE 647

Query: 338  ALSTAVHLINRMPSPSIG-NESPFTRLYGHPPNYSTLRVFGCVCYVHLPPQERTKFTAQS 396
            A  T+  L N +PS ++  N+SP+  L+G PP Y+ LRVFG  CY +L P  + KF  +S
Sbjct: 648  AFFTSNFLSNLLPSSTLSDNKSPYEMLHGTPPVYTALRVFGSACYPYLRPYAKNKFDPKS 707

Query: 397  VECAFLGYSPHQKGFLCYDPNLRRIRVSRNVIFQENKYFFASHHDLVSSPISILPLFYDS 456
            + C FLGY+   KG+ C  P   ++ + R+V+F E K+ ++  +    + IS  PLF   
Sbjct: 708  LLCVFLGYNNKYKGYRCLHPPTGKVYICRHVLFDERKFPYSDIYSQFQT-ISGSPLF--- 763

Query: 457  HSRQQPSKPLLTYKRRSTATHGPPQDNSLVAGPVEEPAPLRRSSRESKPPERYINCMTAT 516
                        +++  ++T                       SR +KP  +Y   + + 
Sbjct: 764  ----------TAWQKGFSST---------------------ALSRITKPNPKY--ALFSV 790

Query: 517  LSSIPIPSSYKQAMENDCWQKAIESELLALEENQTWDIVPCPSSVKPLGSKFVFSIKLRS 576
             S+ P P S K+A++++ W  A+  E+  + E  TWD+VP     + LG K+VF  KL S
Sbjct: 791  KSNYPEPKSVKEALKDEGWTNAMGEEMGTMHETDTWDLVPPEMVDRLLGCKWVFKTKLNS 850

Query: 577  DGSIDRYKAHLVVLGNKQEYGLDYDETFAPVAKMTTVRTILAIAASQAWPLHQMDVKNAF 636
            DGS+DR KA LV  G +QE G+DY ET++PV +  TVR+IL +A    W L Q+DVKNAF
Sbjct: 851  DGSLDRLKARLVARGYEQEEGVDYVETYSPVVRSATVRSILHVATINKWSLKQLDVKNAF 910

Query: 637  LHGDLQEEVYIKLPNGMPTPS-PNTVCKLKRSLYGLKQAPRVWFEKFRSTLLGFEFSQSR 695
            LH +L+E V++  P G   PS P+ VCKLK+++Y LKQAPR WF+KF S LL + F  S 
Sbjct: 911  LHDELKETVFMTQPPGFEDPSRPDYVCKLKKAIYDLKQAPRAWFDKFSSYLLKYGFICSF 970

Query: 696  YDPSLFLQRTPKGMVVLLVYVDDIVVTGSDQDAISRIKNLLHSTFHMKELGRLTYFLGLE 755
             DPSLF+    + ++ LL+YVDD+++TG++   + ++ N+L + F MK++G L YFLG++
Sbjct: 971  SDPSLFVYLKGRDVMFLLLYVDDMILTGNNDVLLQQLLNILSTEFRMKDMGALHYFLGIQ 1030

Query: 756  VHYHHEGVFLNQQKYIQDLVQLAGLTNATLVDTPMEVNVKYRRDEGDHLDDPTQYRKLVG 815
             HYH++G+FL+Q+KY  DL+  AG+++ + + TP+++++   +       +PT +R+L G
Sbjct: 1031 AHYHNDGLFLSQEKYTSDLLVNAGMSDCSSMPTPLQLDL--LQGNNKPFPEPTYFRRLAG 1088

Query: 816  SLIYVTITRPDISFAVHTVSKFMQAPRHFHLSAVQQIIRYLLGTLKRGLFFPVGSSIKLQ 875
             L Y+T+TRPDI FAV+ V + M AP       +++I+ YL GT+  G+     +   L+
Sbjct: 1089 KLQYLTLTRPDIQFAVNFVCQKMHAPTMSDFHLLKRILHYLKGTMTMGINLSSNTDSVLR 1148

Query: 876  AYSDADWAGCPDTRKSTTGWCMFLGNAPISWKCKKQDSVSKSSTEAEYRAMSAACSEIIW 935
             YSD+DWAGC DTR+ST G+C FLG   ISW  K+  +VSKSSTEAEYR +S A SE+ W
Sbjct: 1149 CYSDSDWAGCKDTRRSTGGFCTFLGYNIISWSAKRHPTVSKSSTEAEYRTLSFAASEVSW 1208

Query: 936  LRGLLTELGFSQDQPTPLHADNTSAIQIAANPVYHERTKHIEVDCHSIREAYDRRIINLP 995
            +  LL E+G  Q Q   ++ DN SA+ ++ANP  H R+KH +VD + +RE      + + 
Sbjct: 1209 IGFLLQEIGLPQQQIPEMYCDNLSAVYLSANPALHSRSKHFQVDYYYVRERVALGALTVK 1268

Query: 996  HVSTSVQTADIFTKSL 1011
            H+  S Q ADIFTKSL
Sbjct: 1269 HIPASQQLADIFTKSL 1284


>gb|AAD21687.1| Strong similarity to gi|3600044 T12H20.12 protease homolog from
            Arabidopsis thaliana BAC gb|AF080119 and is a member of
            the reverse transcriptase family PF|00078
            gi|25301706|pir||C86438 hypothetical protein F28K20.17 -
            Arabidopsis thaliana
          Length = 1415

 Score =  567 bits (1461), Expect = e-160
 Identities = 312/771 (40%), Positives = 456/771 (58%), Gaps = 38/771 (4%)

Query: 278  GEYMSHSFQEFLQSNGIISQRSCPSTP*QNGVAERKNRHLLDVVRTLLLESHVPSRFWCE 337
            GE++S+  +  L  +GI  + SCP TP QNG+AERK+RHL+++  ++L  SH P +FW E
Sbjct: 584  GEFVSNKLKTHLSEHGIHHRISCPYTPQQNGLAERKHRHLVELGLSMLFHSHTPQKFWVE 643

Query: 338  ALSTAVHLINRMPSPSIGNESPFTRLYGHPPNYSTLRVFGCVCYVHLPPQERTKFTAQSV 397
            +  TA ++INR+PS  + N SP+  L+G  P+YS+LRVFG  CY  L P  + KF  +S+
Sbjct: 644  SFFTANYIINRLPSSVLKNLSPYEALFGEKPDYSSLRVFGSACYPCLRPLAQNKFDPRSL 703

Query: 398  ECAFLGYSPHQKGFLCYDPNLRRIRVSRNVIFQENKYFFASHHDLVSSPISILPLFYDSH 457
            +C FLGY+   KG+ C+ P   ++ +SRNVIF E++  F   +  +    S   L    H
Sbjct: 704  QCVFLGYNSQYKGYRCFYPPTGKVYISRNVIFNESELPFKEKYQSLVPQYSTPLLQAWQH 763

Query: 458  SR-----------QQPSKP--LLTYK-RRSTATHGPPQDNSLVAGPVEEPAPL------- 496
            ++           Q  SKP  L TY   + T     P+  S   G  EE  P+       
Sbjct: 764  NKISEISVPAAPVQLFSKPIDLNTYAGSQVTEQLTDPEPTSNNEGSDEEVNPVAEEIAAN 823

Query: 497  ------------RRSSRESKPPERYINCMTATLSSIPIPSSYKQAMENDCWQKAIESELL 544
                        R  +   KP  RY   +T+ +++   P +   AM++  W +A+  E+ 
Sbjct: 824  QEQVINSHAMTTRSKAGIQKPNTRYA-LITSRMNTAE-PKTLASAMKHPGWNEAVHEEIN 881

Query: 545  ALEENQTWDIVPCPSSVKPLGSKFVFSIKLRSDGSIDRYKAHLVVLGNKQEYGLDYDETF 604
             +    TW +VP    +  L SK+VF  KL  DGSID+ KA LV  G  QE G+DY ETF
Sbjct: 882  RVHMLHTWSLVPPTDDMNILSSKWVFKTKLHPDGSIDKLKARLVAKGFDQEEGVDYLETF 941

Query: 605  APVAKMTTVRTILAIAASQAWPLHQMDVKNAFLHGDLQEEVYIKLPNGMPTPS-PNTVCK 663
            +PV +  T+R +L ++ S+ WP+ Q+DV NAFLHG+LQE V++  P+G   P  P  VC+
Sbjct: 942  SPVVRTATIRLVLDVSTSKGWPIKQLDVSNAFLHGELQEPVFMYQPSGFIDPQKPTHVCR 1001

Query: 664  LKRSLYGLKQAPRVWFEKFRSTLLGFEFSQSRYDPSLFLQRTPKGMVVLLVYVDDIVVTG 723
            L +++YGLKQAPR WF+ F + LL + F  S+ DPSLF+      ++ LL+YVDDI++TG
Sbjct: 1002 LTKAIYGLKQAPRAWFDTFSNFLLDYGFVCSKSDPSLFVCHQDGKILYLLLYVDDILLTG 1061

Query: 724  SDQDAISRIKNLLHSTFHMKELGRLTYFLGLEVHYHHEGVFLNQQKYIQDLVQLAGLTNA 783
            SDQ  +  +   L + F MK+LG   YFLG+++  +  G+FL+Q  Y  D++Q AG+++ 
Sbjct: 1062 SDQSLLEDLLQALKNRFSMKDLGPPRYFLGIQIEDYANGLFLHQTAYATDILQQAGMSDC 1121

Query: 784  TLVDTPMEVNVKYRRDEGDHLDDPTQYRKLVGSLIYVTITRPDISFAVHTVSKFMQAPRH 843
              + TP+   +     E     +PT +R L G L Y+TITRPDI FAV+ + + M +P  
Sbjct: 1122 NPMPTPLPQQLDNLNSE--LFAEPTYFRSLAGKLQYLTITRPDIQFAVNFICQRMHSPTT 1179

Query: 844  FHLSAVQQIIRYLLGTLKRGLFFPVGSSIKLQAYSDADWAGCPDTRKSTTGWCMFLGNAP 903
                 +++I+RY+ GT+  GL     S++ L AYSD+D AGC +TR+STTG+C+ LG+  
Sbjct: 1180 SDFGLLKRILRYIKGTIGMGLPIKRNSTLTLSAYSDSDHAGCKNTRRSTTGFCILLGSNL 1239

Query: 904  ISWKCKKQDSVSKSSTEAEYRAMSAACSEIIWLRGLLTELGFSQDQPTPLHADNTSAIQI 963
            ISW  K+Q +VS SSTEAEYRA++ A  EI W+  LL +LG  Q  PT ++ DN SA+ +
Sbjct: 1240 ISWSAKRQPTVSNSSTEAEYRALTYAAREITWISFLLRDLGIPQYLPTQVYCDNLSAVYL 1299

Query: 964  AANPVYHERTKHIEVDCHSIREAYDRRIINLPHVSTSVQTADIFTKSLTRQ 1014
            +ANP  H R+KH + D H IRE     +I   H+S + Q AD+FTKSL R+
Sbjct: 1300 SANPALHNRSKHFDTDYHYIREQVALGLIETQHISATFQLADVFTKSLPRR 1350


>emb|CAB10526.1| retrotransposon like protein [Arabidopsis thaliana]
            gi|7268497|emb|CAB78748.1| retrotransposon like protein
            [Arabidopsis thaliana] gi|7444421|pir||A71444 probable
            LTR retrotransposon - Arabidopsis thaliana
          Length = 1433

 Score =  567 bits (1460), Expect = e-160
 Identities = 299/745 (40%), Positives = 446/745 (59%), Gaps = 28/745 (3%)

Query: 285  FQEFLQSNGIISQRSCPSTP*QNGVAERKNRHLLDVVRTLLLESHVPSRFWCEALSTAVH 344
            F +   ++GI++  SCP TP QN V ERK++H+L+V R LL +S++P  FW + + TAV 
Sbjct: 676  FTDLFAAHGIVAYHSCPETPEQNSVVERKHQHILNVARALLFQSNIPLEFWGDCVLTAVF 735

Query: 345  LINRMPSPSIGNESPFTRLYGHPPNYSTLRVFGCVCYVHLPPQERTKFTAQSVECAFLGY 404
            LINR+P+P + N+SP+ +L   PP Y +L+ FGC+CY    P++R KF  ++  C FLGY
Sbjct: 736  LINRLPTPVLNNKSPYEKLKNIPPAYESLKTFGCLCYSSTSPKQRHKFEPRARACVFLGY 795

Query: 405  SPHQKGFLCYDPNLRRIRVSRNVIFQENKYFFASHHDLVSSPISILPLFYDSHSRQQPSK 464
                KG+   D     + +SR+VIF E+ + F S   +        PL       Q P++
Sbjct: 796  PLGYKGYKLLDIETHAVSISRHVIFHEDIFPFISS-TIKDDIKDFFPLL------QFPAR 848

Query: 465  PL-LTYKRRSTATHGPPQDNSLVAGPVEEPAPLRRSSRESKPPERY--INCMTAT----- 516
               L  ++ S     P QD S     V    PL  S R+ KPP+     +C   T     
Sbjct: 849  TDDLPLEQTSIIDTHPHQDVSSSKALVPFD-PL--SKRQKKPPKHLQDFHCYNNTTEPFH 905

Query: 517  -----LSSIPIPSSYKQAMENDCWQKAIESELLALEENQTWDIVPCPSSVKPLGSKFVFS 571
                 +++  IP  Y +A +   W  A++ E+ A+    TW +V  P + K +G K+VF+
Sbjct: 906  AFINNITNAVIPQRYSEAKDFKAWCDAMKEEIGAMVRTNTWSVVSLPPNKKAIGCKWVFT 965

Query: 572  IKLRSDGSIDRYKAHLVVLGNKQEYGLDYDETFAPVAKMTTVRTILAIAASQAWPLHQMD 631
            IK  +DGSI+RYKA LV  G  QE GLDY+ETF+PVAK+T+VR +L +AA   W +HQ+D
Sbjct: 966  IKHNADGSIERYKARLVAKGYTQEEGLDYEETFSPVAKLTSVRMMLLLAAKMKWSVHQLD 1025

Query: 632  VKNAFLHGDLQEEVYIKLPNGMP-----TPSPNTVCKLKRSLYGLKQAPRVWFEKFRSTL 686
            + NAFL+GDL EE+Y+K+P G          P+ +C+L +S+YGLKQA R W+ K  +TL
Sbjct: 1026 ISNAFLNGDLDEEIYMKIPPGYADLVGEALPPHAICRLHKSIYGLKQASRQWYLKLSNTL 1085

Query: 687  LGFEFSQSRYDPSLFLQRTPKGMVVLLVYVDDIVVTGSDQDAISRIKNLLHSTFHMKELG 746
             G  F +S  D +LF++     ++ +LVYVDDI++  +  DA+++    L S F +++LG
Sbjct: 1086 KGMGFQKSNADHTLFIKYANGVLMGVLVYVDDIMIVSNSDDAVAQFTAELKSYFKLRDLG 1145

Query: 747  RLTYFLGLEVHYHHEGVFLNQQKYIQDLVQLAGLTNATLVDTPMEVNVKYRRDEGDHLDD 806
               YFLG+E+    +G+ + Q+KYI +L+   G   +     P++ +VK  +++G  L D
Sbjct: 1146 AAKYFLGIEIARSEKGISICQRKYILELLSTTGFLGSKPSSIPLDPSVKLNKEDGVPLTD 1205

Query: 807  PTQYRKLVGSLIYVTITRPDISFAVHTVSKFMQAPRHFHLSAVQQIIRYLLGTLKRGLFF 866
             T YRKLVG L+Y+ ITRPDI++AV+T+ +F  AP   HLSAV +++RYL GT+ +GLF+
Sbjct: 1206 STSYRKLVGKLMYLQITRPDIAYAVNTLCQFSHAPTSVHLSAVHKVLRYLKGTVGQGLFY 1265

Query: 867  PVGSSIKLQAYSDADWAGCPDTRKSTTGWCMFLGNAPISWKCKKQDSVSKSSTEAEYRAM 926
                   L+ Y+D+D+  C D+R+    +CMF+G+  +SWK KKQD+VS S+ EAE+RAM
Sbjct: 1266 SADDKFDLRGYTDSDFGSCTDSRRCVAAYCMFIGDYLVSWKSKKQDTVSMSTAEAEFRAM 1325

Query: 927  SAACSEIIWLRGLLTELGFSQDQPTPLHADNTSAIQIAANPVYHERTKHIEVDCHSIREA 986
            S    E+IWL  L  +       P  L+ DNT+A+ I  N V+HERTK +E+DC+  REA
Sbjct: 1326 SQGTKEMIWLSRLFDDFKVPFIPPAYLYCDNTAALHIVNNSVFHERTKFVELDCYKTREA 1385

Query: 987  YDRRIINLPHVSTSVQTADIFTKSL 1011
             +   +    V T  Q AD  TK++
Sbjct: 1386 VESGFLKTMFVETGEQVADPLTKAI 1410


>gb|AAT38758.1| putative gag-pol polyprotein [Solanum demissum]
          Length = 1333

 Score =  558 bits (1438), Expect = e-157
 Identities = 303/754 (40%), Positives = 459/754 (60%), Gaps = 25/754 (3%)

Query: 278  GEYMSHSFQEFLQSNGIISQRSCPSTP*QNGVAERKNRHLLDVVRTLLLESHVPSRFWCE 337
            GE++S+ F  F + NGI  + + P TP QNGVAERKNR ++++ R+ L    +P  FW E
Sbjct: 575  GEFLSNDFNLFCEENGIRRELTAPYTPEQNGVAERKNRTVVEMARSSLKAKGLPDYFWGE 634

Query: 338  ALSTAVHLINRMPSPSIGNESPFTRLYGHPPNYSTLRVFGCVCYVHLPPQERTKFTAQSV 397
            A++T V+ +N  P+  + N +P     G  P  S LR+FGC+ Y  +     +K   +S 
Sbjct: 635  AVATVVYFLNISPTKDVWNTTPLEAWNGKKPRVSHLRIFGCIAYALV--NFHSKLDEKST 692

Query: 398  ECAFLGYSPHQKGFLCYDPNLRRIRVSRNVIFQENKYFFASHHDLVSSPISILPLFYDS- 456
            +C F+GYS   K +  Y+P   ++ +SRNV+F E+  +  +  +++S+ I +LP   +S 
Sbjct: 693  KCIFVGYSLQSKAYRLYNPISGKVIISRNVVFNEDVSWNFNSGNMMSN-IQLLPTDEESA 751

Query: 457  --HSRQQPSKPLLTYKRRSTATHGPPQDNSLVAGPVE---EPAPLRRSSRESKPPERYIN 511
                    S P+      S++   P   ++ VA P E   EP PLRRS+RE KP  +Y N
Sbjct: 752  VDFGNSPNSSPV------SSSVSSPIAPSTTVA-PDESSVEPIPLRRSTREKKPNPKYSN 804

Query: 512  -----CMTATLSSIPIPSSYKQAMENDCWQKAIESELLALEENQTWDIVPCPSSVKPLGS 566
                 C  A L S PI   Y++A+E   W+ A+  E+ A+E N TW++V  P     +G 
Sbjct: 805  TVNTSCQFALLVSDPI--CYEEAVEQSEWKNAMIEEIQAIERNSTWELVDAPEGKNVIGL 862

Query: 567  KFVFSIKLRSDGSIDRYKAHLVVLGNKQEYGLDYDETFAPVAKMTTVRTILAIAASQAWP 626
            K+VF  K  +DGSI ++KA LV  G  Q+ G+D+DETF+PVA+  TVR +LA+AA    P
Sbjct: 863  KWVFRTKYNADGSIQKHKARLVAKGYSQQQGVDFDETFSPVARFETVRVVLALAAQLHLP 922

Query: 627  LHQMDVKNAFLHGDLQEEVYIKLPNG-MPTPSPNTVCKLKRSLYGLKQAPRVWFEKFRST 685
            ++Q DVK+AFL+GDL+EEVY+  P G M T + N V KL+++LYGLKQAPR W+ K  S 
Sbjct: 923  VYQFDVKSAFLNGDLEEEVYVSQPQGFMITGNENKVYKLRKALYGLKQAPRAWYSKIDSF 982

Query: 686  LLGFEFSQSRYDPSLFLQRTPKGMVVLL-VYVDDIVVTGSDQDAISRIKNLLHSTFHMKE 744
              G  F +S  +P+L+L++      +L+ +YVDD++  GS +  ++  K+ +   F M +
Sbjct: 983  FQGSGFRRSDNEPTLYLKKQGTDEFLLVCLYVDDMIYIGSSKSLVNDFKSNMMRNFEMSD 1042

Query: 745  LGRLTYFLGLEVHYHHEGVFLNQQKYIQDLVQLAGLTNATLVDTPMEVNVKYRRDEGDHL 804
            LG L YFLGLEV    +G+F++Q+KY +DL++   + N  +  TPM +N K +R +G   
Sbjct: 1043 LGLLKYFLGLEVIQDKDGIFISQKKYAEDLLKKFQMMNCEVATTPMNINEKLQRADGTEK 1102

Query: 805  DDPTQYRKLVGSLIYVTITRPDISFAVHTVSKFMQAPRHFHLSAVQQIIRYLLGTLKRGL 864
             +P  +R LVG L Y+T TRPDI+F+V  VS+F+Q+P   H  A ++++RY+ GT   G+
Sbjct: 1103 ANPKLFRSLVGGLNYLTHTRPDIAFSVSVVSRFLQSPTKQHFGAAKRVLRYVAGTTDFGI 1162

Query: 865  FFPVGSSIKLQAYSDADWAGCPDTRKSTTGWCMFLGNAPISWKCKKQDSVSKSSTEAEYR 924
            ++    + +L  ++D+D+AGC D RKST+G C   G+  ++W  KKQ++V+ S++EAEY 
Sbjct: 1163 WYSKAPNFRLVGFTDSDYAGCLDDRKSTSGSCFSFGSGVVTWSSKKQETVALSTSEAEYT 1222

Query: 925  AMSAACSEIIWLRGLLTELGFSQDQPTPLHADNTSAIQIAANPVYHERTKHIEVDCHSIR 984
            A S A  + +WLR LL +  + Q + T + +D+ SAI +A NP +H RTKHI+V  H IR
Sbjct: 1223 AASLAARQALWLRKLLEDFSYEQKESTEIFSDSKSAIAMAKNPSFHGRTKHIDVQYHFIR 1282

Query: 985  EAYDRRIINLPHVSTSVQTADIFTKSLTRQRHNF 1018
                   I L   ST+ Q ADIFTKSL + +H +
Sbjct: 1283 TLVADGRIVLKFCSTNEQAADIFTKSLPQAKHEY 1316


>gb|AAP46257.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
            gi|50919599|ref|XP_470160.1| putative polyprotein [Oryza
            sativa (japonica cultivar-group)]
          Length = 1335

 Score =  547 bits (1410), Expect = e-154
 Identities = 285/760 (37%), Positives = 457/760 (59%), Gaps = 44/760 (5%)

Query: 276  QWGEYMSHSFQEFLQSNGIISQRSCPSTP*QNGVAERKNRHLLDVVRTLLLESHVPSRFW 335
            Q  EY+S  F+++ ++ GI  Q +   +  QNGVAERKNR + D+  ++L +  +P  FW
Sbjct: 587  QGREYISKEFEKYCENAGIRRQLTAGYSAQQNGVAERKNRTINDMANSMLQDKGMPKSFW 646

Query: 336  CEALSTAVHLINRMPSPSIGNESPFTRLYGHPPNYSTLRVFGCVCYVHLPPQERTKFTAQ 395
             EA++TAV+++NR P+ ++ N +PF   YG  P    +RVFGC+CY  +P Q+R KF  +
Sbjct: 647  AEAVNTAVYILNRSPTKAVTNRTPFEAWYGKKPVIGHMRVFGCICYAQVPAQKRVKFDNK 706

Query: 396  SVECAFLGYSPHQKGFLCYDPNLRRIRVSRNVIFQENKYFFASHHDLVSSPISIL----- 450
            S  C F+GY+   KG+  Y+   ++I +SR+ IF E+  +     +  S+P+        
Sbjct: 707  SDRCIFVGYADGIKGYRLYNLEKKKIIISRDAIFDESATWNWKSPEASSTPLLPTTTITL 766

Query: 451  --PLFYDSHSRQ------QPSKPLLTYKRRSTATHGPPQDNSLVAGPVEEPAPLR----- 497
              P  + +H  +      QPS P+ +    S ++   P     ++ P   P  +R     
Sbjct: 767  GQPHMHGTHEVEDHTPSPQPSSPMSS---SSASSDSSPSSEEQISTPESAPRRVRSMVEL 823

Query: 498  -RSSRESKPPERYINCMTATLSSIPIPSSYKQAMENDCWQKAIESELLALEENQTWDIVP 556
              S+ + +  E++  C  + +     P S+++A ++D W KA+E E+  +E+N TW++V 
Sbjct: 824  LESTSQQRGSEQHEFCNYSVVE----PQSFQEAEKHDNWIKAMEDEIHMIEKNNTWELVD 879

Query: 557  CPSSVKPLGSKFVFSIKLRSDGSIDRYKAHLVVLGNKQEYGLDYDETFAPVAKMTTVRTI 616
             P   + +G K+V+  KL  DGS+ +YKA LV  G KQ+ G+DY ET+APVA++ T+RTI
Sbjct: 880  RPRDREVIGVKWVYKTKLNPDGSVQKYKARLVAKGFKQKPGIDYYETYAPVARLETIRTI 939

Query: 617  LAIAASQAWPLHQMDVKNAFLHGDLQEEVYIKLPNGMPTPS-PNTVCKLKRSLYGLKQAP 675
            +A+AA + W ++Q+DVK+AFL+G L EE+Y++ P G       N V +LK++LYGLKQAP
Sbjct: 940  IALAAQKRWKIYQLDVKSAFLNGYLDEEIYVEQPEGFSVQGGENKVFRLKKALYGLKQAP 999

Query: 676  RVWFEKFRSTLLGFEFSQSRYDPSLFLQRTPKGMVVLLVYVDDIVVTGSDQDAISRIKNL 735
            R W+ +     +   F++S  +P+L++ +T   ++++ +YVDD++ TG+ +  +   K  
Sbjct: 1000 RAWYSQIDKYFIQKGFAKSISEPTLYVNKTGTDILIVSLYVDDLIYTGNSEKMMQDFKKD 1059

Query: 736  LHSTFHMKELGRLTYFLGLEVHYHHEGVFLNQQKYIQDLVQLAGLTNATLVDTPMEVNVK 795
            +  T+ M +LG L YFLG+EVH   EG+F++Q+KY +++++   + N   V TP+  N K
Sbjct: 1060 MMHTYEMSDLGLLHYFLGMEVHQSDEGIFISQRKYAENILKKFKMDNCKSVTTPLLPNEK 1119

Query: 796  YRRDEGDHLDDPTQYRKLVGSLIYVTITRPDISFAVHTVSKFMQAPRHFHLSAVQQIIRY 855
             +  +G    DPT YR LVGSL+Y+T TRPDI FA   +S++M +P   + +A ++++RY
Sbjct: 1120 QKARDGADKADPTIYRSLVGSLLYLTATRPDIMFAASLLSRYMSSPSQLNFTAAKRVLRY 1179

Query: 856  LLGTLKRGLFFPVGSSIKLQAYSDADWAGCPDTRKSTTGWCMFLGNAPISWKCKKQDSVS 915
            + GT   G+++      KL  Y+D+DWAGC D  KST+G+   LG+A             
Sbjct: 1180 IKGTADYGIWYKPVKESKLIGYTDSDWAGCLDDMKSTSGYAFSLGSA------------- 1226

Query: 916  KSSTEAEYRAMSAACSEIIWLRGLLTELGFSQDQPTPLHADNTSAIQIAANPVYHERTKH 975
                EAEY A S A S+++WLR ++ +LG  Q QPT ++ D+ SAI I+ NPV H+RTKH
Sbjct: 1227 ----EAEYVAASKAVSQVVWLRRIMEDLGEKQYQPTTIYCDSKSAIAISENPVSHDRTKH 1282

Query: 976  IEVDCHSIREAYDRRIINLPHVSTSVQTADIFTKSLTRQR 1015
            I +  H IREA DR+ + L    T  Q ADIFTK+L++++
Sbjct: 1283 IAIKYHYIREAVDRQEVKLEFCRTDEQLADIFTKALSKEK 1322


>gb|AAD50001.1| Hypothetical protein [Arabidopsis thaliana] gi|25301681|pir||F86246
            hypothetical protein [imported] - Arabidopsis thaliana
          Length = 1352

 Score =  531 bits (1368), Expect = e-149
 Identities = 275/745 (36%), Positives = 437/745 (57%), Gaps = 13/745 (1%)

Query: 278  GEYMSHSFQEFLQSNGIISQRSCPSTP*QNGVAERKNRHLLDVVRTLLLESHVPSRFWCE 337
            GE+ S  F ++ + NGI  Q + P +P QNGV ERKNR +L++ R++L    +P   W E
Sbjct: 599  GEFTSKEFLKYCEDNGIRRQLTVPRSPQQNGVVERKNRTILEMARSMLKSKRLPKELWAE 658

Query: 338  ALSTAVHLINRMPSPSIGNESPFTRLYGHPPNYSTLRVFGCVCYVHLPPQERTKFTAQSV 397
            A++ AV+L+NR P+ S+  ++P     G  P  S LRVFG + + H+P ++R+K   +S 
Sbjct: 659  AVACAVYLLNRSPTKSVSGKTPQEAWSGRKPGVSHLRVFGSIAHAHVPDEKRSKLDDKSE 718

Query: 398  ECAFLGYSPHQKGFLCYDPNLRRIRVSRNVIF-QENKYFFASHHDLVSSPISILPLFYDS 456
            +  F+GY  + KG+  Y+P+ ++  +SRN++F +E ++ + S+ +      +  P F + 
Sbjct: 719  KYIFIGYDNNSKGYKLYNPDTKKTIISRNIVFDEEGEWDWNSNEE----DYNFFPHFEED 774

Query: 457  HSRQQPSKPLLTYKRRSTATHGPPQDNSLVAGPVEEPAPLRRSSRES-KPPERYINCMTA 515
                   +P          T      +S +     E  P  RS +E  +  E   N    
Sbjct: 775  EPEPTREEP----PSEEPTTPPTSPTSSQIEESSSERTPRFRSIQELYEVTENQENLTLF 830

Query: 516  TLSSIPIPSSYKQAMENDCWQKAIESELLALEENQTWDIVPCPSSVKPLGSKFVFSIKLR 575
             L +   P  +++A+E   W+ A++ E+ ++++N TW++   P+  K +G K+V+  K  
Sbjct: 831  CLFAECEPMDFQKAIEKKTWRNAMDEEIKSIQKNDTWELTSLPNGHKAIGVKWVYKAKKN 890

Query: 576  SDGSIDRYKAHLVVLGNKQEYGLDYDETFAPVAKMTTVRTILAIAASQAWPLHQMDVKNA 635
            S G ++RYKA LV  G  Q  G+DYDE FAPVA++ TVR I+++AA   W +HQMDVK+A
Sbjct: 891  SKGEVERYKARLVAKGYSQRVGIDYDEVFAPVARLETVRLIISLAAQNKWKIHQMDVKSA 950

Query: 636  FLHGDLQEEVYIKLPNG-MPTPSPNTVCKLKRSLYGLKQAPRVWFEKFRSTLLGFEFSQS 694
            FL+GDL+EEVYI+ P G +     + V +LK+ LYGLKQAPR W  +        +F + 
Sbjct: 951  FLNGDLEEEVYIEQPQGYIVKGEEDKVLRLKKVLYGLKQAPRAWNTRIDKYFKEKDFIKC 1010

Query: 695  RYDPSLFLQRTPKGMVVLLVYVDDIVVTGSDQDAISRIKNLLHSTFHMKELGRLTYFLGL 754
             Y+ +L+++   + +++  +YVDD++ TG++       K  +   F M ++G ++Y+LG+
Sbjct: 1011 PYEHALYIKIQKEDILIACLYVDDLIFTGNNPSIFEEFKKEMTKEFEMTDIGLMSYYLGI 1070

Query: 755  EVHYHHEGVFLNQQKYIQDLVQLAGLTNATLVDTPMEVNVKYRRDEGDHLDDPTQYRKLV 814
            EV     G+F+ Q+ Y +++++   + ++  V TPME  +K  + E     DPT ++ LV
Sbjct: 1071 EVKQEDNGIFITQEGYAKEVLKKFKMDDSNPVCTPMECGIKLSKKEEGEGVDPTTFKSLV 1130

Query: 815  GSLIYVTITRPDISFAVHTVSKFMQAPRHFHLSAVQQIIRYLLGTLKRGLFFPVGSSIKL 874
            GSL Y+T TRPDI +AV  VS++M+ P   H  A ++I+RY+ GT+  GL +   S  KL
Sbjct: 1131 GSLRYLTCTRPDILYAVGVVSRYMEHPTTTHFKAAKRILRYIKGTVNFGLHYSTTSDYKL 1190

Query: 875  QAYSDADWAGCPDTRKSTTGWCMFLGNAPISWKCKKQDSVSKSSTEAEYRAMSAACSEII 934
              YSD+DW G  D RKST+G+  ++G+   +W  KKQ  V+ S+ EAEY A ++     I
Sbjct: 1191 VGYSDSDWGGDVDDRKSTSGFVFYIGDTAFTWMSKKQPIVTLSTCEAEYVAATSCVCHAI 1250

Query: 935  WLRGLLTELGFSQDQPTPLHADNTSAIQIAANPVYHERTKHIEVDCHSIREAYDRRIINL 994
            WLR LL EL   Q++PT +  DN SAI +A NPV+H+R+KHI+   H IRE   ++ + L
Sbjct: 1251 WLRNLLKELSLPQEEPTKIFVDNKSAIALAKNPVFHDRSKHIDTRYHYIRECVSKKDVQL 1310

Query: 995  PHVSTSVQTADIFTKSLTRQRHNFL 1019
             +V T  Q AD FTK L  +R NF+
Sbjct: 1311 EYVKTHDQVADFFTKPL--KRENFI 1333


>emb|CAB71063.1| copia-type polyprotein [Arabidopsis thaliana] gi|11278364|pir||T47925
            copia-type polyprotein - Arabidopsis thaliana
          Length = 1352

 Score =  531 bits (1368), Expect = e-149
 Identities = 275/745 (36%), Positives = 437/745 (57%), Gaps = 13/745 (1%)

Query: 278  GEYMSHSFQEFLQSNGIISQRSCPSTP*QNGVAERKNRHLLDVVRTLLLESHVPSRFWCE 337
            GE+ S  F ++ + NGI  Q + P +P QNGV ERKNR +L++ R++L    +P   W E
Sbjct: 599  GEFTSKEFLKYCEDNGIRRQLTVPRSPQQNGVVERKNRTILEMARSMLKSKRLPKELWAE 658

Query: 338  ALSTAVHLINRMPSPSIGNESPFTRLYGHPPNYSTLRVFGCVCYVHLPPQERTKFTAQSV 397
            A++ AV+L+NR P+ S+  ++P     G  P  S LRVFG + + H+P ++R+K   +S 
Sbjct: 659  AVACAVYLLNRSPTKSVSGKTPQEAWSGRKPGVSHLRVFGSIAHAHVPDEKRSKLDDKSE 718

Query: 398  ECAFLGYSPHQKGFLCYDPNLRRIRVSRNVIF-QENKYFFASHHDLVSSPISILPLFYDS 456
            +  F+GY  + KG+  Y+P+ ++  +SRN++F +E ++ + S+ +      +  P F + 
Sbjct: 719  KYIFIGYDNNSKGYKLYNPDTKKTIISRNIVFDEEGEWDWNSNEE----DYNFFPHFEED 774

Query: 457  HSRQQPSKPLLTYKRRSTATHGPPQDNSLVAGPVEEPAPLRRSSRES-KPPERYINCMTA 515
                   +P          T      +S +     E  P  RS +E  +  E   N    
Sbjct: 775  EPEPTREEP----PSEEPTTPPTSPTSSQIEESSSERTPRFRSIQELYEVTENQENLTLF 830

Query: 516  TLSSIPIPSSYKQAMENDCWQKAIESELLALEENQTWDIVPCPSSVKPLGSKFVFSIKLR 575
             L +   P  +++A+E   W+ A++ E+ ++++N TW++   P+  K +G K+V+  K  
Sbjct: 831  CLFAECEPMDFQKAIEKKTWRNAMDEEIKSIQKNDTWELTSLPNGHKAIGVKWVYKAKKN 890

Query: 576  SDGSIDRYKAHLVVLGNKQEYGLDYDETFAPVAKMTTVRTILAIAASQAWPLHQMDVKNA 635
            S G ++RYKA LV  G  Q  G+DYDE FAPVA++ TVR I+++AA   W +HQMDVK+A
Sbjct: 891  SKGEVERYKARLVAKGYSQRVGIDYDEVFAPVARLETVRLIISLAAQNKWKIHQMDVKSA 950

Query: 636  FLHGDLQEEVYIKLPNG-MPTPSPNTVCKLKRSLYGLKQAPRVWFEKFRSTLLGFEFSQS 694
            FL+GDL+EEVYI+ P G +     + V +LK+ LYGLKQAPR W  +        +F + 
Sbjct: 951  FLNGDLEEEVYIEQPQGYIVKGEEDKVLRLKKVLYGLKQAPRAWNTRIDKYFKEKDFIKC 1010

Query: 695  RYDPSLFLQRTPKGMVVLLVYVDDIVVTGSDQDAISRIKNLLHSTFHMKELGRLTYFLGL 754
             Y+ +L+++   + +++  +YVDD++ TG++       K  +   F M ++G ++Y+LG+
Sbjct: 1011 PYEHALYIKIQKEDILIACLYVDDLIFTGNNPSIFEEFKKEMTKEFEMTDIGLMSYYLGI 1070

Query: 755  EVHYHHEGVFLNQQKYIQDLVQLAGLTNATLVDTPMEVNVKYRRDEGDHLDDPTQYRKLV 814
            EV     G+F+ Q+ Y +++++   + ++  V TPME  +K  + E     DPT ++ LV
Sbjct: 1071 EVKQEDNGIFITQEGYAKEVLKKFKIDDSNPVCTPMECGIKLSKKEEGEGVDPTTFKSLV 1130

Query: 815  GSLIYVTITRPDISFAVHTVSKFMQAPRHFHLSAVQQIIRYLLGTLKRGLFFPVGSSIKL 874
            GSL Y+T TRPDI +AV  VS++M+ P   H  A ++I+RY+ GT+  GL +   S  KL
Sbjct: 1131 GSLRYLTCTRPDILYAVGVVSRYMEHPTTTHFKAAKRILRYIKGTVNFGLHYSTTSDYKL 1190

Query: 875  QAYSDADWAGCPDTRKSTTGWCMFLGNAPISWKCKKQDSVSKSSTEAEYRAMSAACSEII 934
              YSD+DW G  D RKST+G+  ++G+   +W  KKQ  V+ S+ EAEY A ++     I
Sbjct: 1191 VGYSDSDWGGDVDDRKSTSGFVFYIGDTAFTWMSKKQPIVTLSTCEAEYVAATSCVCHAI 1250

Query: 935  WLRGLLTELGFSQDQPTPLHADNTSAIQIAANPVYHERTKHIEVDCHSIREAYDRRIINL 994
            WLR LL EL   Q++PT +  DN SAI +A NPV+H+R+KHI+   H IRE   ++ + L
Sbjct: 1251 WLRNLLKELSLPQEEPTKIFVDNKSAIALAKNPVFHDRSKHIDTRYHYIRECVSKKDVQL 1310

Query: 995  PHVSTSVQTADIFTKSLTRQRHNFL 1019
             +V T  Q AD FTK L  +R NF+
Sbjct: 1311 EYVKTHDQVADFFTKPL--KRENFI 1333


>emb|CAB75932.1| putative protein [Arabidopsis thaliana] gi|11278365|pir||T47841
            hypothetical protein T2O9.150 - Arabidopsis thaliana
          Length = 1339

 Score =  530 bits (1365), Expect = e-148
 Identities = 273/752 (36%), Positives = 443/752 (58%), Gaps = 18/752 (2%)

Query: 278  GEYMSHSFQEFLQSNGIISQRSCPSTP*QNGVAERKNRHLLDVVRTLLLESHVPSRFWCE 337
            GE+ S+ F EF +S+GI  Q +   TP QNGVAERKNR +++ VR++L E  VP  FW E
Sbjct: 567  GEFTSNEFGEFCRSHGISRQLTAAFTPQQNGVAERKNRTIMNAVRSMLSERQVPKMFWSE 626

Query: 338  ALSTAVHLINRMPSPSIGNESPFTRLYGHPPNYSTLRVFGCVCYVHLPPQERTKFTAQSV 397
            A   +VH+ NR P+ ++   +P     G  P     RVFGC+ YVH+P Q+R+K   +S 
Sbjct: 627  ATKWSVHIQNRSPTAAVEGMTPEEAWSGRKPVVEYFRVFGCIGYVHIPDQKRSKLDDKSK 686

Query: 398  ECAFLGYSPHQKGFLCYDPNLRRIRVSRNVIFQENKYFFASHHDLVSSPISILPLFYDSH 457
            +C FLG S   K +  YDP +++I +S++V+F E+K +     D+ +  +++     D  
Sbjct: 687  KCVFLGVSEESKAWRLYDPVMKKIVISKDVVFDEDKSWDWDQADVEAKEVTLECGDEDDE 746

Query: 458  SRQQPSKPLLTYKRRSTATHGPPQDNSLVAGPVEEPAPLR-RSSRESKPP---------- 506
               +  +P+         +      + ++A     P+P+  + +RE +PP          
Sbjct: 747  KNSEVVEPIAVASPNHVGSDNNVSSSPILAPSSPAPSPVAAKVTRERRPPGWMADYETGE 806

Query: 507  ----ERYINCMTATLSSIPIPSSYKQAMENDCWQKAIESELLALEENQTWDIVPCPSSVK 562
                E  ++ M   + +   P  +  A+++  W++A+E E+ ++ +N TW++   P    
Sbjct: 807  GEEIEENLSVMLLMMMTEADPIQFDDAVKDKIWREAMEHEIESIVKNNTWELTTLPKGFT 866

Query: 563  PLGSKFVFSIKLRSDGSIDRYKAHLVVLGNKQEYGLDYDETFAPVAKMTTVRTILAIAAS 622
            P+G K+V+  KL  DG +D+YKA LV  G  Q YG+DY E FAPVA++ TVRTILAI++ 
Sbjct: 867  PIGVKWVYKTKLNEDGEVDKYKARLVAKGYAQCYGIDYTEVFAPVARLDTVRTILAISSQ 926

Query: 623  QAWPLHQMDVKNAFLHGDLQEEVYIKLPNG-MPTPSPNTVCKLKRSLYGLKQAPRVWFEK 681
              W + Q+DVK+AFLHG+L+EEVY++ P G +       V KL+++LYGLKQAPR W+ +
Sbjct: 927  FNWEIFQLDVKSAFLHGELKEEVYVRQPEGFIREGEEEKVYKLRKALYGLKQAPRAWYSR 986

Query: 682  FRSTLLGFEFSQSRYDPSLFLQRTPKGMVVLLVYVDDIVVTGSDQDAISRIKNLLHSTFH 741
              +  L  EF +   + +LF +     ++++ +YVDD++ TGSD+      K  +   F 
Sbjct: 987  IEAYFLKEEFERCPSEHTLFTKTRVGNILIVSLYVDDLIFTGSDKAMCDEFKKSMMLEFE 1046

Query: 742  MKELGRLTYFLGLEVHYHHEGVFLNQQKYIQDLVQLAGLTNATLVDTPMEVNVKYRRDEG 801
            M +LG++ +FLG+EV     G+F+ Q++Y ++++   G+  +  V  P+    K  +DE 
Sbjct: 1047 MSDLGKMKHFLGIEVKQSDGGIFICQRRYAREVLARFGMDESNAVKNPIVPGTKLTKDEN 1106

Query: 802  DHLDDPTQYRKLVGSLIYVTITRPDISFAVHTVSKFMQAPRHFHLSAVQQIIRYLLGTLK 861
                D T +++LVGSL+Y+T+TRPD+ + V  +S+FM  PR  H  A ++I+RYL GT++
Sbjct: 1107 GEKVDETMFKQLVGSLMYLTVTRPDLMYGVCLISRFMSNPRMSHWLAAKRILRYLKGTVE 1166

Query: 862  RGLFF--PVGSSIKLQAYSDADWAGCPDTRKSTTGWCMFLGNAPISWKCKKQDSVSKSST 919
             G+F+      S+KL A++D+D+AG  + R+ST+G+   + +  I W  KKQ  V+ S+T
Sbjct: 1167 LGIFYRRRKNRSLKLMAFTDSDYAGDLNDRRSTSGFVFLMASGAICWASKKQPVVALSTT 1226

Query: 920  EAEYRAMSAACSEIIWLRGLLTELGFSQDQPTPLHADNTSAIQIAANPVYHERTKHIEVD 979
            EAEY A +    + +WLR +L +LG  +   T ++ DN+S IQ++ +PV H ++KHIEV 
Sbjct: 1227 EAEYIAAAFCACQCVWLRKVLEKLGAEEKSATVINCDNSSTIQLSKHPVLHGKSKHIEVR 1286

Query: 980  CHSIREAYDRRIINLPHVSTSVQTADIFTKSL 1011
             H +R+  +  ++ L +  T  Q ADIFTK L
Sbjct: 1287 FHYLRDLVNGDVVKLEYCPTEDQVADIFTKPL 1318


>gb|AAG60117.1| copia-type polyprotein, putative [Arabidopsis thaliana]
          Length = 1352

 Score =  530 bits (1364), Expect = e-148
 Identities = 274/740 (37%), Positives = 435/740 (58%), Gaps = 11/740 (1%)

Query: 278  GEYMSHSFQEFLQSNGIISQRSCPSTP*QNGVAERKNRHLLDVVRTLLLESHVPSRFWCE 337
            GE+ S  F ++ + NGI  Q + P +P QNGVAERKNR +L++ R++L    +P   W E
Sbjct: 599  GEFTSKEFLKYCEDNGIRRQLTVPRSPQQNGVAERKNRTILEMARSMLKSKRLPKELWAE 658

Query: 338  ALSTAVHLINRMPSPSIGNESPFTRLYGHPPNYSTLRVFGCVCYVHLPPQERTKFTAQSV 397
            A++ AV+L+NR P+ S+  ++P     G     S LRVFG + + H+P ++R+K   +S 
Sbjct: 659  AVACAVYLLNRSPTKSVSGKTPQEAWSGRKSGVSHLRVFGSIAHAHVPDEKRSKLDDKSE 718

Query: 398  ECAFLGYSPHQKGFLCYDPNLRRIRVSRNVIF-QENKYFFASHHDLVSSPISILPLFYDS 456
            +  F+GY  + KG+  Y+P+ ++  +SRN++F +E ++ + S+ +      +  P F + 
Sbjct: 719  KYIFIGYDNNSKGYKLYNPDTKKTIISRNIVFDEEGEWDWNSNEE----DYNFFPHFEED 774

Query: 457  HSRQQPSKPLLTYKRRSTATHGPPQDNSLVAGPVEEPAPLRRSSRES-KPPERYINCMTA 515
                   +P          T      +S +     E  P  RS +E  +  E   N    
Sbjct: 775  EPEPTREEP----PSEEPTTPPTSPTSSQIEESSSERTPRFRSIQELYEVTENQENLTLF 830

Query: 516  TLSSIPIPSSYKQAMENDCWQKAIESELLALEENQTWDIVPCPSSVKPLGSKFVFSIKLR 575
             L +   P  +++A+E   W+ A++ E+ ++++N TW++   P+  K +G K+V+  K  
Sbjct: 831  CLFAECEPMDFQEAIEKKTWRNAMDEEIKSIQKNDTWELTSLPNGHKTIGVKWVYKAKKN 890

Query: 576  SDGSIDRYKAHLVVLGNKQEYGLDYDETFAPVAKMTTVRTILAIAASQAWPLHQMDVKNA 635
            S G ++RYKA LV  G  Q  G+DYDE FAPVA++ TVR I+++AA   W +HQMDVK+A
Sbjct: 891  SKGEVERYKARLVAKGYIQRAGIDYDEVFAPVARLETVRLIISLAAQNKWKIHQMDVKSA 950

Query: 636  FLHGDLQEEVYIKLPNG-MPTPSPNTVCKLKRSLYGLKQAPRVWFEKFRSTLLGFEFSQS 694
            FL+GDL+EEVYI+ P G +     + V +LK++LYGLKQAPR W  +        +F + 
Sbjct: 951  FLNGDLEEEVYIEQPQGYIVKGEEDKVLRLKKALYGLKQAPRAWNTRIDKYFKEKDFIKC 1010

Query: 695  RYDPSLFLQRTPKGMVVLLVYVDDIVVTGSDQDAISRIKNLLHSTFHMKELGRLTYFLGL 754
             Y+ +L+++   + +++  +YVDD++ TG++       K  +   F M ++G ++Y+LG+
Sbjct: 1011 PYEHALYIKIQKEDILIACLYVDDLIFTGNNPSMFEEFKKEMTKEFEMTDIGLMSYYLGI 1070

Query: 755  EVHYHHEGVFLNQQKYIQDLVQLAGLTNATLVDTPMEVNVKYRRDEGDHLDDPTQYRKLV 814
            EV     G+F+ Q+ Y +++++   + ++  V TPME  +K  + E     DPT ++ LV
Sbjct: 1071 EVKQEDNGIFITQEGYAKEVLKKFKMDDSNPVCTPMECGIKLSKKEEGEGVDPTTFKSLV 1130

Query: 815  GSLIYVTITRPDISFAVHTVSKFMQAPRHFHLSAVQQIIRYLLGTLKRGLFFPVGSSIKL 874
            GSL Y+T TRPDI +AV  VS++M+ P   H  A ++I+RY+ GT+  GL +   S  KL
Sbjct: 1131 GSLRYLTCTRPDILYAVGVVSRYMEHPTTTHFKAAKRILRYIKGTVNFGLHYSTTSDYKL 1190

Query: 875  QAYSDADWAGCPDTRKSTTGWCMFLGNAPISWKCKKQDSVSKSSTEAEYRAMSAACSEII 934
              YSD+DW G  D RKST+G+  ++G+   +W  KKQ  V  S+ EAEY A ++     I
Sbjct: 1191 VGYSDSDWGGDVDDRKSTSGFVFYIGDTAFTWMSKKQPIVVLSTCEAEYVAATSCVCHAI 1250

Query: 935  WLRGLLTELGFSQDQPTPLHADNTSAIQIAANPVYHERTKHIEVDCHSIREAYDRRIINL 994
            WLR LL EL   Q++PT +  DN SAI +A NPV+H+R+KHI+   H IRE   ++ + L
Sbjct: 1251 WLRNLLKELSLPQEEPTKIFVDNKSAIALAKNPVFHDRSKHIDTRYHYIRECVSKKDVQL 1310

Query: 995  PHVSTSVQTADIFTKSLTRQ 1014
             +V T  Q ADIFTK L R+
Sbjct: 1311 EYVKTHDQVADIFTKPLKRE 1330


>gb|AAF16534.1| T26F17.17 [Arabidopsis thaliana]
          Length = 1291

 Score =  524 bits (1350), Expect = e-147
 Identities = 271/740 (36%), Positives = 432/740 (57%), Gaps = 11/740 (1%)

Query: 278  GEYMSHSFQEFLQSNGIISQRSCPSTP*QNGVAERKNRHLLDVVRTLLLESHVPSRFWCE 337
            GE+ S  F ++ + NGI  Q + P +P QNGVAERKNR +L++ R++L    +P   W E
Sbjct: 538  GEFTSKEFLKYCEDNGIRRQLTVPRSPQQNGVAERKNRTILEMARSMLKSKRLPKELWAE 597

Query: 338  ALSTAVHLINRMPSPSIGNESPFTRLYGHPPNYSTLRVFGCVCYVHLPPQERTKFTAQSV 397
            A++ AV+L+NR P+ S+  ++P     G  P  S LRVFG + + H+P ++R+K   +S 
Sbjct: 598  AVACAVYLLNRSPTKSVSGKTPQEAWSGRKPGVSHLRVFGSIAHAHVPDEKRSKLDDKSE 657

Query: 398  ECAFLGYSPHQKGFLCYDPNLRRIRVSRNVIF-QENKYFFASHHDLVSSPISILPLFYDS 456
            +  F+GY  + KG+  Y+P+ ++  +SRN++F +E ++ + S+ +      +  P F + 
Sbjct: 658  KYIFIGYDNNSKGYKLYNPDTKKTIISRNIVFDEEGEWDWNSNEE----DYNFFPHFEED 713

Query: 457  HSRQQPSKPLLTYKRRSTATHGPPQDNSLVAGPVEEPAPLRRSSRES-KPPERYINCMTA 515
                   +P          T      +S +     E  P  RS +E  +  E   N    
Sbjct: 714  EPEPTREEP----PSEEPTTRPTSLTSSQIEESSSERTPRFRSIQELYEVTENQENLTLF 769

Query: 516  TLSSIPIPSSYKQAMENDCWQKAIESELLALEENQTWDIVPCPSSVKPLGSKFVFSIKLR 575
             L +   P  +++A+E   W+ A++ E+ ++++N TW++   P+  K +G K+V+  K  
Sbjct: 770  CLFAECEPMDFQEAIEKKTWRNAMDEEIKSIQKNDTWELTSLPNGHKAIGVKWVYKAKKN 829

Query: 576  SDGSIDRYKAHLVVLGNKQEYGLDYDETFAPVAKMTTVRTILAIAASQAWPLHQMDVKNA 635
            S G ++RYKA LV  G  Q  G+DYDE FAPVA++ TVR I+++AA   W +HQMD K A
Sbjct: 830  SKGEVERYKARLVAKGYSQRAGIDYDEVFAPVARLETVRLIISLAAQNKWKIHQMDFKLA 889

Query: 636  FLHGDLQEEVYIKLPNG-MPTPSPNTVCKLKRSLYGLKQAPRVWFEKFRSTLLGFEFSQS 694
            FL+GD +EEVYI+ P G +     + V +LK++LYGLKQAPR W  +        +F + 
Sbjct: 890  FLNGDFEEEVYIEQPQGYIVKGEEDKVLRLKKALYGLKQAPRAWNTRIDKYFKEKDFIKC 949

Query: 695  RYDPSLFLQRTPKGMVVLLVYVDDIVVTGSDQDAISRIKNLLHSTFHMKELGRLTYFLGL 754
             Y+ +L+++   + +++  +YVDD++ TG++       K  +   F M ++G ++Y+LG+
Sbjct: 950  PYEHALYIKIQKEDILIACLYVDDLIFTGNNPSMFEEFKKEMTKEFEMTDIGLMSYYLGI 1009

Query: 755  EVHYHHEGVFLNQQKYIQDLVQLAGLTNATLVDTPMEVNVKYRRDEGDHLDDPTQYRKLV 814
            EV      +F+ Q+ Y +++++   + ++  V TPME  +K  + E     DPT ++ LV
Sbjct: 1010 EVKQEDNRIFITQEGYAKEVLKKFKMDDSNPVCTPMECGIKLSKKEEGEGVDPTTFKSLV 1069

Query: 815  GSLIYVTITRPDISFAVHTVSKFMQAPRHFHLSAVQQIIRYLLGTLKRGLFFPVGSSIKL 874
            GSL Y+T TRPDI +AV  VS++M+ P   H  A ++I+RY+ GT+  GL +   S  KL
Sbjct: 1070 GSLRYLTCTRPDILYAVGVVSRYMEHPTTTHFKAAKRILRYIKGTVNFGLHYSTTSDYKL 1129

Query: 875  QAYSDADWAGCPDTRKSTTGWCMFLGNAPISWKCKKQDSVSKSSTEAEYRAMSAACSEII 934
              YSD+DW    D RKST+G+  ++G+   +W  KKQ  V+ S+ EAEY A ++     I
Sbjct: 1130 VGYSDSDWGRDVDDRKSTSGFVFYIGDTAFTWMSKKQPIVTLSTCEAEYVAATSCVCHAI 1189

Query: 935  WLRGLLTELGFSQDQPTPLHADNTSAIQIAANPVYHERTKHIEVDCHSIREAYDRRIINL 994
            WLR LL EL   Q++PT +  DN SAI +A NPV+H+R+KHI+   H IRE   ++ + L
Sbjct: 1190 WLRNLLKELSLPQEEPTKIFVDNKSAIALAKNPVFHDRSKHIDTRYHYIRECVSKKDVQL 1249

Query: 995  PHVSTSVQTADIFTKSLTRQ 1014
             +V T  Q ADIFTK L R+
Sbjct: 1250 EYVKTHDQVADIFTKPLKRE 1269


  Database: nr
    Posted date:  Jul 5, 2005 12:34 AM
  Number of letters in database: 863,360,394
  Number of sequences in database:  2,540,612
  
Lambda     K      H
   0.339    0.147    0.496 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,663,389,574
Number of Sequences: 2540612
Number of extensions: 68974435
Number of successful extensions: 219574
Number of sequences better than 10.0: 1737
Number of HSP's better than 10.0 without gapping: 1679
Number of HSP's successfully gapped in prelim test: 58
Number of HSP's that attempted gapping in prelim test: 213369
Number of HSP's gapped (non-prelim): 2926
length of query: 1019
length of database: 863,360,394
effective HSP length: 138
effective length of query: 881
effective length of database: 512,755,938
effective search space: 451737981378
effective search space used: 451737981378
T: 11
A: 40
X1: 15 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 39 (21.8 bits)
S2: 80 (35.4 bits)


Medicago: description of AC146527.6