Medicago
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= AC145330.14 - phase: 0 
         (1199 letters)

Database: nr 
           2,540,612 sequences; 863,360,394 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

emb|CAD41085.2| OSJNBb0011N17.2 [Oryza sativa (japonica cultivar...   774  0.0
gb|AAP54332.1| putative copia-like polyprotein [Oryza sativa (ja...   665  0.0
ref|XP_470422.1| putative polyprotein [Oryza sativa (japonica cu...   662  0.0
gb|AAL68641.1| polyprotein [Oryza sativa (japonica cultivar-group)]   658  0.0
ref|XP_470025.1| putative polyprotein [Oryza sativa (japonica cu...   639  0.0
ref|NP_918613.1| polyprotein [Oryza sativa (japonica cultivar-gr...   637  0.0
gb|AAT40550.1| putative receptor kinase [Solanum demissum]            593  e-168
emb|CAE05707.2| OSJNBb0065J09.3 [Oryza sativa (japonica cultivar...   555  e-156
emb|CAA36616.1| unnamed protein product [Solanum tuberosum] gi|4...   529  e-148
emb|CAB81200.1| putative retrotransposon polyprotein [Arabidopsi...   517  e-145
pir||F86470 probable retroelement polyprotein [imported] - Arabi...   513  e-143
gb|AAB61111.1| Strong similarity to Zea mays retrotransposon Hop...   510  e-143
gb|AAU89728.1| putative retroelement pol polyprotein-like [Solan...   509  e-142
pir||E96608 probable retroelement polyprotein F25P12.89 [importe...   509  e-142
gb|AAD26943.1| putative retroelement pol polyprotein [Arabidopsi...   506  e-141
dbj|BAA97287.1| retroelement pol polyprotein-like [Arabidopsis t...   506  e-141
gb|AAD19784.1| putative retroelement pol polyprotein [Arabidopsi...   504  e-141
gb|AAC33963.1| contains similarity to reverse transcriptases (Pf...   503  e-140
gb|AAB87099.1| putative retroelement pol polyprotein [Arabidopsi...   500  e-139
gb|AAR24647.1| At2g23330 [Arabidopsis thaliana] gi|22655202|gb|A...   500  e-139

>emb|CAD41085.2| OSJNBb0011N17.2 [Oryza sativa (japonica cultivar-group)]
            gi|50925209|ref|XP_472906.1| OSJNBb0011N17.2 [Oryza
            sativa (japonica cultivar-group)]
          Length = 1262

 Score =  774 bits (1999), Expect = 0.0
 Identities = 483/1257 (38%), Positives = 676/1257 (53%), Gaps = 181/1257 (14%)

Query: 92   NILKFEVQ----RLDGKNYMEWSQTVRLILDGKGKLGFITGEIQMP-STTDPNYRFWKSE 146
            +ILK E+     +L  KNY+ WS+   LIL  KG  G++TGE++ P +T+   ++ W + 
Sbjct: 38   SILKIELMQNEIKLGVKNYLSWSRRALLILKTKGLEGYVTGEVKEPENTSSVEWKTWSTT 97

Query: 147  NSTIIAWLLSTMESTIKKPYMFLPTAKNVWDAVKATYSDIQNSSRIFDLKSRLWQAKQSD 206
            NS ++AWLL+++   I      + +A  +W  +   YS   N   + + + ++   +Q +
Sbjct: 98   NSLVVAWLLTSLIPAIATTVETISSASEMWKTLTKLYSGEGNVMLMVEAQEKISALRQGE 157

Query: 207  RDVTTYYNELMALWQELDLCYDDNWRCTEDSVLFLKRQ-ENDRVFVLLAGLNNCLDEVRG 265
            R V  Y  EL +LW +LD  YD       D +  +K+  E  RV   L GLN   +  R 
Sbjct: 158  RSVAEYVAELKSLWSDLDH-YDPLGLEHSDCIAKMKKWVERRRVIEFLKGLNPEFEGRRD 216

Query: 266  RILGRIPLPTLQETFSEVRREEARQSVMMGKSASITESSALVTKGNEEGKRDGKKPFCDH 325
             +  +  LPTL E  + + +EE ++ V+   +      +  + +G E  +       C +
Sbjct: 217  AMFHQTTLPTLDEAIAAMAQEELKKKVLPSAAPCSPSPTYAIVQGKETRE-------CFN 269

Query: 326  CNRQWHTRDTCWK----LHGKPPNWKKKGGKEGRALQATTSDQEHQSSSSSFPFTKEQLD 381
            C    H    C       +G+     + G + GR     ++          +      L+
Sbjct: 270  CGEMGHLMRDCHAPRKPTYGRGRGVDRGGTRGGRGYAGRSNRGRGYGYRGDYKANAVTLE 329

Query: 382  QLYKMFGSQTPSCSIAQI-----GNFPNTALVSVKPS-PTWIIDSGATDHMTGESSLFAS 435
            +      S T   ++A       G+F N A +S+  S  +WI+DSGA+ H+TG S  F S
Sbjct: 330  E----GSSGTTPDNVANFAHSTSGSF-NQAFMSMNTSHSSWILDSGASRHVTGMSGEFTS 384

Query: 436  YSPCAGNHK--IKIADGSLSAIAGKDCQANFFHSHCIFKDLNTGKMIGSAKKSGGLYYLD 493
            Y P +  HK  I+ ADG+        CQ           +  TGK +G      GL+YLD
Sbjct: 385  YKPYSFAHKETIQTADGT-------SCQ-----------ERRTGKKLGIGIMRDGLWYLD 426

Query: 494  ---------------------NGPDFKDQPQQIMLSQNSETNIGAPKENAQESIMELNPL 532
                                 NG   +     + ++++    +  PK    E++M    L
Sbjct: 427  RRGTNEDVCALMASTSKEVTENGVAERKNRHLLEIARSLMYTMNVPKFLWSEAVMTAAYL 486

Query: 533  VNEVPTNS-----------------------GATFSNQNHNEQILDIDSNEEEPEMPQQN 569
            +N  P+                         G T   ++H   I  +D    +      +
Sbjct: 487  INRTPSRILGMKTPYEMIFGKNEFVVPPRVFGCTCFVRDHRPSIGKLDPRAVKCIFIGYS 546

Query: 570  DSNKETK-------NRFNSSDPIWKGNV------------------YERRDHKRGDEGPI 604
             S K  K         F S D  ++ +V                    R DH +  EG I
Sbjct: 547  SSQKGYKCWSPSERRTFVSMDVTFRESVPFYGEKTDISSLFVDLDDLTRGDHDQQKEGEI 606

Query: 605  L----------------------QPCQESE---PRNDPNHHPNPGKSSIPKCRGKSSSIT 639
            L                       P QE E   P  + N      +  +P  +       
Sbjct: 607  LGLKENEQSKGKIVVGEIPCAIGDPVQEQEWRKPHEEENLQVYTRRMRLPTTQQVEVDDQ 666

Query: 640  TSDD-------------------PDLHIPIAIRKPVRSCTKHP----------------- 663
             SDD                    + ++PIAIRK +RS    P                 
Sbjct: 667  VSDDLTHVQVSSESGGEQIEIREEESNLPIAIRKGMRSNAGKPPQRYGFEIGDESGDEND 726

Query: 664  MAKFVSYSNLSSSFAAFTSQLSTVEIPKNVQEALKIPKWKEAVLEEMRALEKNKTWKIMT 723
            +A +VSY++LSS++ AF + L++  IPK+ +EA + P+W +A+L+E+ ALEKNKTW +++
Sbjct: 727  IANYVSYTSLSSTYKAFVASLNSAIIPKDWKEAKQDPRWHQAMLDELEALEKNKTWDLVS 786

Query: 724  LPAGKNTVGCKWVFTVKYNSDNTVERYKARLVAKGFTQAYGIDYSETFAPVAKLNTVRVL 783
             P GK  V CKWV+ VK N D  VERYKARLVAKG++Q YGIDY ETFAPVAK++TVR +
Sbjct: 787  YPNGKKVVNCKWVYAVKQNPDGKVERYKARLVAKGYSQTYGIDYDETFAPVAKMSTVRTI 846

Query: 784  LSLAVNLDWPLNQLDVKNAFLNGDLQEEVYMDSPPGFED-KFGLNVCKLQKSLYGLKQSP 842
            +S AVN DWPL+QLDVKNAFL+GDLQEEVYM+ PPGF   +    V +L+KSLYGLKQSP
Sbjct: 847  ISCAVNFDWPLHQLDVKNAFLHGDLQEEVYMEIPPGFATLQTKGKVLRLKKSLYGLKQSP 906

Query: 843  RAWFEKFTWSVKKQGYMQAQSDHTLFMRFSNDGKIAILIVYVDDIILTGDDIVEMDRLKK 902
            RAWF++F  ++   GY Q   DHT+F   S D  I IL VYVDD+I+TG+D  E+ RLK+
Sbjct: 907  RAWFDRFRRAMCAMGYKQCNGDHTVFYHHSGD-HITILAVYVDDMIITGNDCSEITRLKQ 965

Query: 903  NLAKEFEIKDLGALKYFLGMEVARSRKGIVVSQRKYILDLLEETGMSGCRPADTPMELNA 962
            NL+KEFE+KDLG LKYFLG+E+ARS +GIV+SQRKY LDLL +TGM GCRPA TP++ N 
Sbjct: 966  NLSKEFEVKDLGQLKYFLGIEIARSPRGIVLSQRKYALDLLSDTGMLGCRPASTPVDQNH 1025

Query: 963  KLWEKGNVPVDIGRYQRLVGKLIYLAHTRPDIAFSVSVVSQFMHSPYEEHLEAVYRILRY 1022
            KL  +   PV+  RYQRLVG+LIYL HTRPDI ++VS+VS++MH P   H++AVYRILRY
Sbjct: 1026 KLCAESGNPVNKERYQRLVGRLIYLCHTRPDITYAVSMVSRYMHDPRSGHMDAVYRILRY 1085

Query: 1023 LKSNPGKGLYFKKTNDRDVSIFTDADWAGSVIDRKSTTGYCAYVWGNLVTWRSKKQGVVA 1082
            LK +PGKGL+FKK    +V  + DADWA    DR+ST+GYC +V GNLV+WRSKKQ VV+
Sbjct: 1086 LKGSPGKGLWFKKNGHLEVEGYCDADWASCPDDRRSTSGYCVFVGGNLVSWRSKKQPVVS 1145

Query: 1083 RSSAEAEFRAMAQGICELLWIQKLLEELKLKIDLPLKLFCDSKAAISIAHNPVQHDRTKH 1142
            RS+AEAE+RAM+  + ELLW++ LL EL L +D P+KL+CD+K+AISIA+NPVQHDRTKH
Sbjct: 1146 RSTAEAEYRAMSVSLSELLWLRNLLSELMLPVDTPMKLWCDNKSAISIANNPVQHDRTKH 1205

Query: 1143 IEIDRHFIKEKIVSGTICLPYVTSNEQTADILTKSLARPNFERLIVKLGMTNIYAPT 1199
            +E+DR FIKEK+  G + L +V S  Q AD  TK L          K+GM +IY P+
Sbjct: 1206 VELDRFFIKEKLDEGVLELEFVMSGGQVADCFTKGLGVKECNSSCDKMGMIDIYHPS 1262


>gb|AAP54332.1| putative copia-like polyprotein [Oryza sativa (japonica
            cultivar-group)] gi|37535486|ref|NP_922045.1| putative
            copia-like polyprotein [Oryza sativa (japonica
            cultivar-group)] gi|22094347|gb|AAM91874.1| putative
            copia-like polyprotein [Oryza sativa (japonica
            cultivar-group)]
          Length = 894

 Score =  665 bits (1716), Expect = 0.0
 Identities = 329/573 (57%), Positives = 427/573 (74%), Gaps = 19/573 (3%)

Query: 645  DLHIPIAIRKPVRSCTKHP-----------------MAKFVSYSNLSSSFAAFTSQLSTV 687
            + ++PIAIRK VRS    P                 +A +VSY++L S++ AF + L++V
Sbjct: 323  ETNLPIAIRKGVRSNAGKPPQRYGFEAQGVNDDENNIANYVSYASLLSTYKAFVTSLNSV 382

Query: 688  EIPKNVQEALKIPKWKEAVLEEMRALEKNKTWKIMTLPAGKNTVGCKWVFTVKYNSDNTV 747
            EIP + +EA + P+W +A+LEE+ ALEKNKTW ++  P GK  V CKWV+TVK N D  V
Sbjct: 383  EIPNDWREAKQDPRWHQAMLEELEALEKNKTWDLVPFPKGKKVVNCKWVYTVKQNPDENV 442

Query: 748  ERYKARLVAKGFTQAYGIDYSETFAPVAKLNTVRVLLSLAVNLDWPLNQLDVKNAFLNGD 807
            ERYKARLVAKG++Q YGIDY ETFAPVAK++TVR L+S A N DWPL+QLDVKNAFL+GD
Sbjct: 443  ERYKARLVAKGYSQTYGIDYDETFAPVAKMSTVRTLISCAANFDWPLHQLDVKNAFLHGD 502

Query: 808  LQEEVYMDSPPGFE-DKFGLNVCKLQKSLYGLKQSPRAWFEKFTWSVKKQGYMQAQSDHT 866
            LQEEVYM+ PPGF   +    V +L+KSLYGLKQSPRAWF++F  ++   GY Q   DHT
Sbjct: 503  LQEEVYMEIPPGFATSQTEGKVLRLKKSLYGLKQSPRAWFDRFRRAMCGMGYKQCNGDHT 562

Query: 867  LFMRFSNDGKIAILIVYVDDIILTGDDIVEMDRLKKNLAKEFEIKDLGALKYFLGMEVAR 926
            +F R  N G   IL+VYVDD+I+TGDD +E+ RLK+NL+KEFE+KDLG LKYFLG+E+AR
Sbjct: 563  VFYRH-NRGLKTILVVYVDDMIITGDDCLEISRLKQNLSKEFEVKDLGQLKYFLGIEIAR 621

Query: 927  SRKGIVVSQRKYILDLLEETGMSGCRPADTPMELNAKLWEKGNVPVDIGRYQRLVGKLIY 986
            S +GIV+SQRKY+LDLL +TGM GCRPA T +E N KL  +   PV+  RYQRLVG+LIY
Sbjct: 622  SPRGIVLSQRKYVLDLLSDTGMLGCRPASTLIEQNHKLCAESGDPVNKERYQRLVGRLIY 681

Query: 987  LAHTRPDIAFSVSVVSQFMHSPYEEHLEAVYRILRYLKSNPGKGLYFKKTNDRDVSIFTD 1046
            L HTRPDI ++VSVVS++MH P   H++ VYRILRYLK++PGKG++FKK    D+  + D
Sbjct: 682  LCHTRPDITYAVSVVSRYMHDPRSGHMDVVYRILRYLKASPGKGIWFKKNGHLDMEGYCD 741

Query: 1047 ADWAGSVIDRKSTTGYCAYVWGNLVTWRSKKQGVVARSSAEAEFRAMAQGICELLWIQKL 1106
            ADW   + DR+ST+GYC ++ GNLV+WRSKK+ VV+RS+AEAE+R+M+  + ELLW++ L
Sbjct: 742  ADWGSCLDDRRSTSGYCVFIGGNLVSWRSKKESVVSRSTAEAEYRSMSMSLSELLWLKNL 801

Query: 1107 LEELKLKIDLPLKLFCDSKAAISIAHNPVQHDRTKHIEIDRHFIKEKIVSGTICLPYVTS 1166
            L ELKL     +KL+CD+K+AI+IA+NPVQHDRTKH+EIDR FIKE++  GT+ L +V S
Sbjct: 802  LAELKLSTSTSMKLWCDNKSAINIANNPVQHDRTKHVEIDRFFIKERMDEGTLNLGFVNS 861

Query: 1167 NEQTADILTKSLARPNFERLIVKLGMTNIYAPT 1199
             EQ  D LTK+L          K+GM +IY P+
Sbjct: 862  GEQVVDSLTKALGARECTSSCSKMGMIDIYRPS 894


>ref|XP_470422.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
            gi|27573360|gb|AAO20078.1| putative polyprotein [Oryza
            sativa (japonica cultivar-group)]
          Length = 1299

 Score =  662 bits (1708), Expect = 0.0
 Identities = 332/570 (58%), Positives = 421/570 (73%), Gaps = 19/570 (3%)

Query: 648  IPIAIRKPVRSCTKHP-----------------MAKFVSYSNLSSSFAAFTSQLSTVEIP 690
            +PIAIRK VRS    P                 ++ +VSY +LSS++ AF + L +V+IP
Sbjct: 731  LPIAIRKSVRSNAGKPPLRYGFEAQDEGDDENNISNYVSYDSLSSTYKAFIASLDSVQIP 790

Query: 691  KNVQEALKIPKWKEAVLEEMRALEKNKTWKIMTLPAGKNTVGCKWVFTVKYNSDNTVERY 750
            K+ +EA + P+W +A+L+E+ ALEKNKTW ++  P GK  V CKWV+TVK N D  VERY
Sbjct: 791  KDWREAKQDPRWHQAMLDELEALEKNKTWDLVPFPKGKKIVNCKWVYTVKQNPDGKVERY 850

Query: 751  KARLVAKGFTQAYGIDYSETFAPVAKLNTVRVLLSLAVNLDWPLNQLDVKNAFLNGDLQE 810
            KARLVAKG++Q YGIDY ETFAPVAK++TVR L+S A N DWPL+QLDVKNAFL+ DLQE
Sbjct: 851  KARLVAKGYSQTYGIDYDETFAPVAKMSTVRTLISCAANFDWPLHQLDVKNAFLHRDLQE 910

Query: 811  EVYMDSPPGFE-DKFGLNVCKLQKSLYGLKQSPRAWFEKFTWSVKKQGYMQAQSDHTLFM 869
            EVYMD PPGF   +    V +L+KSLYGLKQSPRAWF++F  ++    Y Q   DHT+F 
Sbjct: 911  EVYMDVPPGFATSQTKGKVLRLKKSLYGLKQSPRAWFDRFRRAMCAMDYKQCNGDHTVFY 970

Query: 870  RFSNDGKIAILIVYVDDIILTGDDIVEMDRLKKNLAKEFEIKDLGALKYFLGMEVARSRK 929
              S D  I IL VYVDD+I+TG+D +E+ RLK+NL+KEFE+KDLG L+YFLG+E+ARS +
Sbjct: 971  HHSGD-HITILAVYVDDMIITGNDCLEITRLKRNLSKEFEVKDLGQLRYFLGIEIARSPR 1029

Query: 930  GIVVSQRKYILDLLEETGMSGCRPADTPMELNAKLWEKGNVPVDIGRYQRLVGKLIYLAH 989
            GIV+SQRKY+LDLL ETGM GC P  TP++ N KL  +   PV+  RYQRLVG+LIYL H
Sbjct: 1030 GIVISQRKYVLDLLSETGMLGCCPVSTPIDQNHKLCAESGDPVNRERYQRLVGRLIYLCH 1089

Query: 990  TRPDIAFSVSVVSQFMHSPYEEHLEAVYRILRYLKSNPGKGLYFKKTNDRDVSIFTDADW 1049
            TRPDI ++VS+VS++MH P   H+EAVYRILRYLK +PGKGL+FKK     +  + DADW
Sbjct: 1090 TRPDITYAVSMVSRYMHDPRSSHMEAVYRILRYLKGSPGKGLWFKKNGHLKIEGYCDADW 1149

Query: 1050 AGSVIDRKSTTGYCAYVWGNLVTWRSKKQGVVARSSAEAEFRAMAQGICELLWIQKLLEE 1109
            A  + DR+ST+GYC YV GNLV+WRSKKQ VV+RS+AEAE+RAMA  + ELLW++ LL E
Sbjct: 1150 ASCLDDRRSTSGYCVYVGGNLVSWRSKKQSVVSRSTAEAEYRAMAASLSELLWLRNLLVE 1209

Query: 1110 LKLKIDLPLKLFCDSKAAISIAHNPVQHDRTKHIEIDRHFIKEKIVSGTICLPYVTSNEQ 1169
            LK+  + P+KL CD+K+AI+IA+NPVQHDRTKH+EIDR FIKEK+  G + L +VTS  Q
Sbjct: 1210 LKILGNTPMKLLCDNKSAINIANNPVQHDRTKHVEIDRFFIKEKLDEGVLELGFVTSGGQ 1269

Query: 1170 TADILTKSLARPNFERLIVKLGMTNIYAPT 1199
             AD LTK L          K+GM +IY P+
Sbjct: 1270 VADCLTKGLGVKECNCSCDKMGMIDIYHPS 1299



 Score =  123 bits (309), Expect = 3e-26
 Identities = 104/425 (24%), Positives = 189/425 (44%), Gaps = 42/425 (9%)

Query: 100 RLDG-KNYMEWSQTVRLILDGKGKLGFITGEIQMPST-TDPNYRFWKSENSTIIAWLLST 157
           +L+G KNY+ WS+   LIL  KG  G++TGEI+ P   +   ++ W + NS ++AWLL++
Sbjct: 50  KLEGVKNYLSWSRRALLILKTKGLEGYVTGEIKEPENISSVEWKTWSTTNSLVVAWLLTS 109

Query: 158 MESTIKKPYMFLPTAKNVWDAVKATYSDIQNSSRIFDLKSRLWQAKQSDRDVTTYYNELM 217
           +   I      + +A  +W  +   YS   N   + + + ++   +Q +R V  Y  EL 
Sbjct: 110 LIPAIATTVETISSASEMWKTLTNLYSGEGNVMLMVEAQEKISVLRQGERSVAEYVAELK 169

Query: 218 ALWQELDLCYDDNWRCTEDSVLFLKRQ-ENDRVFVLLAGLNNCLDEVRGRILGRIPLPTL 276
            LW +LD  YD       D +  +++  E  RV   L GLN+  +  R  +  +  LP+L
Sbjct: 170 HLWSDLD-HYDPLGLEHPDCIAKMRKWIERRRVIEFLKGLNSEFEGRRDAMFHQTTLPSL 228

Query: 277 QETFSEVRREEARQSVMMGKSASITESSALVTKGNEEGKRDGKKPFCDHCNRQWH-TRDT 335
            E  + + +EE ++ V+   + S    + +V +  E  +       C +C    H  RD 
Sbjct: 229 DEAIAAMAQEELKKKVLPSATPSSPSPTYVVAQSKETRE-------CFNCGEMGHLIRD- 280

Query: 336 CWKLHGKPPNWKKKGGKEGRALQATTSDQEHQSSSSSFPFTKEQLDQLYKMFGSQTPSCS 395
            ++   KP   + + G  G A +         +    + +  +    +  +  S + S +
Sbjct: 281 -YRAPRKPSYGRGRFGDRGGA-RGGRGYAGRGNRGRGYEYRSDHRANVVTLEESCSGSTN 338

Query: 396 I-------AQIGNFPNTALVSVKPS-PTWIIDSGATDHMTGESSLFASYSPCAGNHKIKI 447
           +       +  GN  N A +S+  S   WI+DSGA+ H+T                 + +
Sbjct: 339 VDVANLVHSSSGN-SNQAFMSINSSHSNWILDSGASRHVT-----------------VNL 380

Query: 448 ADGSLSAIAGKDCQANFFHSHCIFKDLNTGKMIGSAKKSGGLYYLDNGPDFKDQPQQIML 507
              S S +   +C+ +    +C+ ++  TGK +G   +  GL+YLD     +D    +M 
Sbjct: 381 VSIS-SLVDHMNCRVSLDRENCLIQERETGKKLGIGVRRDGLWYLDRKETSEDVCLALMA 439

Query: 508 SQNSE 512
             + E
Sbjct: 440 PTSEE 444


>gb|AAL68641.1| polyprotein [Oryza sativa (japonica cultivar-group)]
          Length = 1472

 Score =  658 bits (1698), Expect = 0.0
 Identities = 329/575 (57%), Positives = 425/575 (73%), Gaps = 19/575 (3%)

Query: 643  DPDLHIPIAIRKPVRSCTKHP-----------------MAKFVSYSNLSSSFAAFTSQLS 685
            + + ++PIAIRK +RS    P                 +A +VSY++LSS++ AF + L+
Sbjct: 899  EEESNLPIAIRKGMRSNAGKPPQRYGFEIGDESGDENDIANYVSYTSLSSTYRAFVASLN 958

Query: 686  TVEIPKNVQEALKIPKWKEAVLEEMRALEKNKTWKIMTLPAGKNTVGCKWVFTVKYNSDN 745
            +  IPK+ +EA + P+W +A+L+E+ ALEKNKTW +++ P GK  V CKWV+ VK N D 
Sbjct: 959  SAIIPKDWKEAKQDPRWHQAMLDELEALEKNKTWDLVSYPNGKKVVNCKWVYAVKQNPDG 1018

Query: 746  TVERYKARLVAKGFTQAYGIDYSETFAPVAKLNTVRVLLSLAVNLDWPLNQLDVKNAFLN 805
             VERYKARLVAKG++Q YGIDY ETFAPVAK++TVR ++S AVN DWPL+QLDVKNAFL+
Sbjct: 1019 KVERYKARLVAKGYSQTYGIDYDETFAPVAKMSTVRTIISCAVNFDWPLHQLDVKNAFLH 1078

Query: 806  GDLQEEVYMDSPPGFED-KFGLNVCKLQKSLYGLKQSPRAWFEKFTWSVKKQGYMQAQSD 864
            GDLQEEVYM+ PPGF   +    V +L+KSLYGLKQSPRAWF++F  ++   GY Q   D
Sbjct: 1079 GDLQEEVYMEIPPGFATLQTKGKVLRLKKSLYGLKQSPRAWFDRFRRAMCAMGYKQCNGD 1138

Query: 865  HTLFMRFSNDGKIAILIVYVDDIILTGDDIVEMDRLKKNLAKEFEIKDLGALKYFLGMEV 924
            HT+F   S D  I IL VYVDD+I+TG+D  E+ RLK+NL+KEFE+KDLG LKYFLG+E+
Sbjct: 1139 HTVFYHHSGD-HITILAVYVDDMIITGNDCSEITRLKQNLSKEFEVKDLGQLKYFLGIEI 1197

Query: 925  ARSRKGIVVSQRKYILDLLEETGMSGCRPADTPMELNAKLWEKGNVPVDIGRYQRLVGKL 984
            ARS +GIV+SQRKY LDLL +TGM GCRPA TP++ N KL  +   PV+  RYQRLVG+L
Sbjct: 1198 ARSPRGIVLSQRKYALDLLSDTGMLGCRPASTPVDQNHKLCAESGNPVNKERYQRLVGRL 1257

Query: 985  IYLAHTRPDIAFSVSVVSQFMHSPYEEHLEAVYRILRYLKSNPGKGLYFKKTNDRDVSIF 1044
            IYL HTRPDI ++VS+VS++MH P   H++AVYRILRYLK +PGKGL+FKK    +V  +
Sbjct: 1258 IYLCHTRPDITYAVSMVSRYMHDPRSGHMDAVYRILRYLKGSPGKGLWFKKNGHLEVEGY 1317

Query: 1045 TDADWAGSVIDRKSTTGYCAYVWGNLVTWRSKKQGVVARSSAEAEFRAMAQGICELLWIQ 1104
             DA WA    DR+ST+GYC +V GNLV+WRSKKQ VV+RS+AEAE+RAM+  + ELLW++
Sbjct: 1318 CDAHWASCPDDRRSTSGYCVFVGGNLVSWRSKKQPVVSRSTAEAEYRAMSVSLSELLWLR 1377

Query: 1105 KLLEELKLKIDLPLKLFCDSKAAISIAHNPVQHDRTKHIEIDRHFIKEKIVSGTICLPYV 1164
             LL EL L +D P+KL+CD+K+AISIA+NPVQHDRTKH+E+DR FIKEK+  G + L +V
Sbjct: 1378 NLLSELMLPVDTPMKLWCDNKSAISIANNPVQHDRTKHVELDRFFIKEKLDEGVLELEFV 1437

Query: 1165 TSNEQTADILTKSLARPNFERLIVKLGMTNIYAPT 1199
             S  Q AD  TK L          K+GM +IY P+
Sbjct: 1438 MSGGQVADCFTKGLGVKECNSSCDKMGMIDIYHPS 1472



 Score =  147 bits (372), Expect = 2e-33
 Identities = 118/455 (25%), Positives = 201/455 (43%), Gaps = 66/455 (14%)

Query: 92  NILKFEVQ----RLDG-KNYMEWSQTVRLILDGKGKLGFITGEIQMP-STTDPNYRFWKS 145
           +ILK E+     +L+G KNY+ WS+   LIL  KG  G++TGE++ P +T+   ++ W +
Sbjct: 38  SILKIELMQNEIKLEGVKNYLSWSRRALLILKTKGLEGYVTGEVKEPENTSSVEWKTWST 97

Query: 146 ENSTIIAWLLSTMESTIKKPYMFLPTAKNVWDAVKATYSDIQNSSRIFDLKSRLWQAKQS 205
            NS ++AWLL+++   I      + +A  +W  +   YS   N   + + + ++   +Q 
Sbjct: 98  TNSLVVAWLLTSLIPAIATTVETISSASEMWKTLTKLYSGEGNVMLMVEAQEKISALRQG 157

Query: 206 DRDVTTYYNELMALWQELDLCYDDNWRCTEDSVLFLKR-QENDRVFVLLAGLNNCLDEVR 264
           +R V  Y  EL +LW +LD  YD       D +  +K+  E  RV   L GLN   +  R
Sbjct: 158 ERSVAEYVAELKSLWSDLD-HYDPLGLEHSDCIAKMKKWVERRRVIEFLKGLNPEFEGRR 216

Query: 265 GRILGRIPLPTLQETFSEVRREEARQSVMMGKSASITESSALVTKGNEEGKRDGKKPFCD 324
             +  +  LPTL E  + + +EE ++ V+   +      +  + +G E  +       C 
Sbjct: 217 DAMFHQTTLPTLDEAIAAMAQEELKKKVLPSAAPCSPSPTYAIVQGKETRE-------CF 269

Query: 325 HCNRQWHTRDTCW----KLHGKPPNWKKKGGKEGRALQATTSDQEHQSSSSSFPFTKEQL 380
           +C    H    C       +G+     + G + GR     ++          +      L
Sbjct: 270 NCGEMGHLMRDCHAPRKPTYGRGRGVDRGGTRGGRGYAGRSNRGRGYGYRGDYKANAVTL 329

Query: 381 DQLYKMFGSQTPSCSIAQI-----GNFPNTALVSVKPS-PTWIIDSGATDHMTGESSLFA 434
           ++      S T   ++A       G+F N A +S+  S  +WI+DSGA+ H+TG S  F 
Sbjct: 330 EE----GSSGTTPDNVANFAHSTSGSF-NQAFMSMNTSHSSWILDSGASRHVTGMSGEFT 384

Query: 435 SYSPCAGNHK--IKIADGSLSAIAGK---------------------------------- 458
           SY P +  HK  I+ ADG+   + G+                                  
Sbjct: 385 SYKPYSFAHKETIQTADGTSCQVKGEGIVQCTPSITLSSVLYVHSFPVNLISISSLVDNM 444

Query: 459 DCQANFFHSHCIFKDLNTGKMIGSAKKSGGLYYLD 493
           DC+ +    +C+ ++  TGK +G   +  GL+YLD
Sbjct: 445 DCRVSLDRENCLIQERRTGKKLGIGIRRDGLWYLD 479


>ref|XP_470025.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
            gi|30103001|gb|AAP21414.1| putative polyprotein [Oryza
            sativa (japonica cultivar-group)]
          Length = 1393

 Score =  639 bits (1648), Expect = 0.0
 Identities = 341/675 (50%), Positives = 449/675 (66%), Gaps = 37/675 (5%)

Query: 557  DSNEEEPEM-PQQNDSNK------------ETKNRFNSSDPIWKGNVYERRDH-KRGDEG 602
            D+ +E+ EM P + D  +            E   R    D +    VY+RR    +G++ 
Sbjct: 724  DTQDEDREMVPHEEDGEEGEVVVGTIPCPMEGAERVKQKDVL----VYQRRRFDSQGEKR 779

Query: 603  PILQPCQESEPRNDPNHHPNPGKSSIPKCRGKSSSITTSDDPDLH---IPIAIRKPVRSC 659
              L   Q  E  +     P   +S  P     S     +  P L    +P+  R+  RS 
Sbjct: 780  KGLVQSQIEELPHPKCPVPESSQSLSPPASLASLETIGNTSPTLEHVELPLVQRRETRSN 839

Query: 660  T--------------KHPMAKFVSYSNLSSSFAAFTSQLSTVEIPKNVQEALKIPKWKEA 705
                            H +A +++YS++S ++  F + L T+ IPK+ + A + PKWK+A
Sbjct: 840  AGRPPIRLGFEHLSFMHDIANYITYSHVSPAYKTFIASLQTMPIPKDWKCAKQDPKWKDA 899

Query: 706  VLEEMRALEKNKTWKIMTLPAGKNTVGCKWVFTVKYNSDNTVERYKARLVAKGFTQAYGI 765
            + EE+ AL KNKTW+++ LP  K  VGCKWVFTVK   +  V+RYKARLVAKG++Q YGI
Sbjct: 900  MKEELNALVKNKTWELVKLPPEKRAVGCKWVFTVKQTPEGKVDRYKARLVAKGYSQTYGI 959

Query: 766  DYSETFAPVAKLNTVRVLLSLAVNLDWPLNQLDVKNAFLNGDLQEEVYMDSPPGFEDKFG 825
            DY ETFAPVAK+ TVR L+S AVN  WPL+QLDVKNAFL+GDL EEVYM+ PPGF +   
Sbjct: 960  DYDETFAPVAKMGTVRALVSCAVNFGWPLHQLDVKNAFLHGDLHEEVYMEIPPGFGNSQT 1019

Query: 826  LN-VCKLQKSLYGLKQSPRAWFEKFTWSVKKQGYMQAQSDHTLFMRFSNDGKIAILIVYV 884
            +  VCKL+KSLYGLKQSPRAWF++F  +V   GY Q   DHT+F +      I IL VYV
Sbjct: 1020 VGKVCKLKKSLYGLKQSPRAWFDRFRHAVCDMGYSQCNGDHTVFYKHRGT-HITILAVYV 1078

Query: 885  DDIILTGDDIVEMDRLKKNLAKEFEIKDLGALKYFLGMEVARSRKGIVVSQRKYILDLLE 944
            DDI++TGDD+ E+  LK+ L K FE+KDLG L+YFLG+E+ARS KGIV+SQRKY+LDLL 
Sbjct: 1079 DDIVITGDDVEEIRCLKERLGKAFEVKDLGPLRYFLGIEIARSSKGIVLSQRKYVLDLLT 1138

Query: 945  ETGMSGCRPADTPMELNAKLWEKGNVPVDIGRYQRLVGKLIYLAHTRPDIAFSVSVVSQF 1004
            +TGM GCR + TP++ N +L  +   PVD   YQRLVG+LIYL HTRPDI+++VSVVS++
Sbjct: 1139 DTGMLGCRASTTPIDRNHQLCAQSGDPVDKEAYQRLVGRLIYLCHTRPDISYAVSVVSRY 1198

Query: 1005 MHSPYEEHLEAVYRILRYLKSNPGKGLYFKKTNDRDVSIFTDADWAGSVIDRKSTTGYCA 1064
            MH P   HL+ V++ILRYLK  PGKGL+F+K    +V  + DADWA S+ DR+ST+GYC 
Sbjct: 1199 MHDPRTGHLDVVHKILRYLKGTPGKGLWFRKNGHLNVEGYCDADWASSMDDRRSTSGYCV 1258

Query: 1065 YVWGNLVTWRSKKQGVVARSSAEAEFRAMAQGICELLWIQKLLEELKLKIDLPLKLFCDS 1124
            +V GNLV+WRSKKQ VVARS+AEAE+RAMA  + E+LW++ LL EL++     + L CD+
Sbjct: 1259 FVGGNLVSWRSKKQAVVARSTAEAEYRAMALSLSEMLWMRSLLTELRVLRSDTVMLHCDN 1318

Query: 1125 KAAISIAHNPVQHDRTKHIEIDRHFIKEKIVSGTICLPYVTSNEQTADILTKSLARPNFE 1184
            K+AISIA+NPVQHDRTKH+EIDR FIKEKI SG + L Y+ S EQ AD LTK L     +
Sbjct: 1319 KSAISIANNPVQHDRTKHVEIDRFFIKEKIDSGVLRLEYIKSCEQLADCLTKGLGPSEIQ 1378

Query: 1185 RLIVKLGMTNIYAPT 1199
             +  K+GM +I+ P+
Sbjct: 1379 SICNKMGMIDIFCPS 1393



 Score =  138 bits (347), Expect = 1e-30
 Identities = 119/451 (26%), Positives = 195/451 (42%), Gaps = 68/451 (15%)

Query: 100 RLDG-KNYMEWSQTVRLILDGKGKLGFITGEIQMPSTTDPN-YRFWKSENSTIIAWLLST 157
           RL+G KNY+ W +  +L+L  KG   F+    + PS  +   +R W + NST+++WL+++
Sbjct: 55  RLEGSKNYLSWCRRAQLMLRAKGVDHFLLESCEEPSDKESQAWRTWNTTNSTVVSWLMTS 114

Query: 158 MESTIKKPYMFLPTAKNVWDAVKATYSDIQNSSRIFDLKSRLWQAKQSDRDVTTYYNELM 217
           +  +I +    +  A  VW  +   YS   N   + + ++++   KQ  R V  Y +EL 
Sbjct: 115 VAPSIGRMIEAIQNAAVVWKTLSNMYSGEGNVMMMVEAQNKVENLKQEGRTVQEYASELQ 174

Query: 218 ALWQELDLCYDDNWRCTEDSVLFLKRQENDRVFVLLAGLNNCLDEVRGRILGRIPLPTLQ 277
            LW +LD       +  +D V+  K  +  RV   L GLN   ++ R  +  +  LPT++
Sbjct: 175 QLWADLDHYDPLQLKHEDDIVIGNKWLQRRRVIHFLKGLNKEFEDRRAAMFHQATLPTME 234

Query: 278 ETFSEVRREEARQSVMMGKSASITESSALVTKGNEEGKRDGKKPFCDHCNRQWHTRDTCW 337
           E  S + +EE R  +M G +      SA +   N E         C +C +  H    C 
Sbjct: 235 EAISAMVQEEMRLRLMRGTNPI---RSAYIAADNRE---------CYNCGQVGHVSYNCP 282

Query: 338 KL------------HG----------------KPPNWKKKGGKEGRALQATTSDQEH--Q 367
                         HG                +      +GG+ G   +     Q +  +
Sbjct: 283 TSRNIGGRGSIRGGHGGTRGGFRGDRGVFGGNRGGRGGDRGGRVGGRGRGRGVPQANAVK 342

Query: 368 SSSSSFPFTKEQLDQLYKMFGSQT--PSCSIAQIGNFPN----------TALVSVKPSP- 414
               +     EQ+ Q  +   ++T   S +    GNF N           AL S    P 
Sbjct: 343 EDGKAVTLIGEQVTQWEEWQKNKTNESSNTTTHFGNFANYAQVGEGTQAQALASTYRHPI 402

Query: 415 TWIIDSGATDHMTGESSLFASYSPCAGNHKIKIADGS-----------LSAIAGKDCQAN 463
            WIIDSGA+ H+TG  + F SY+P   +  I+IADG+            SAI    C   
Sbjct: 403 DWIIDSGASKHVTGLHNTFTSYTPYIHSETIQIADGTSKPIHVNLLSISSAIDQLKCIVV 462

Query: 464 FFHSHCIFKDLNTGKMIGSAKKSGGLYYLDN 494
           F  + C+F++  TG+ IG+  +  GL+Y+++
Sbjct: 463 FDENSCLFQEKGTGRRIGTGVRRDGLWYINH 493


>ref|NP_918613.1| polyprotein [Oryza sativa (japonica cultivar-group)]
          Length = 1554

 Score =  637 bits (1642), Expect = 0.0
 Identities = 341/672 (50%), Positives = 447/672 (65%), Gaps = 37/672 (5%)

Query: 557  DSNEEEPEM-PQQNDSNK------------ETKNRFNSSDPIWKGNVYERRDH-KRGDEG 602
            D+ +E+ EM P + D  +            E   R    D +    VY+RR    +G++ 
Sbjct: 885  DTQDEDREMVPHEEDGEEGEVVVGTIPCPMEGAERVKQKDVL----VYQRRRFDSQGEKR 940

Query: 603  PILQPCQESEPRNDPNHHPNPGKSSIPKCRGKSSSITTSDDPDLH---IPIAIRKPVRS- 658
              L   Q  E  +     P   +S  P     S     +  P L    +P+A R+  RS 
Sbjct: 941  KGLVQSQIEELPHQKCPVPESSQSLSPPASLASLETIGNTSPTLEHVELPLAQRRETRSN 1000

Query: 659  -------------CTKHPMAKFVSYSNLSSSFAAFTSQLSTVEIPKNVQEALKIPKWKEA 705
                          + H +A +++YS++S ++  F + L TV IPK+ + A + PKWK+A
Sbjct: 1001 AGRPPIRLGFEHLSSMHDIANYITYSHVSPAYKTFIASLQTVPIPKDWKCAKQDPKWKDA 1060

Query: 706  VLEEMRALEKNKTWKIMTLPAGKNTVGCKWVFTVKYNSDNTVERYKARLVAKGFTQAYGI 765
            + EE+ AL KNKTW+++ LP  K  VGCKWVFTVK   +  V+ YKARLVAKG++Q YGI
Sbjct: 1061 MKEELNALVKNKTWELVKLPPEKRAVGCKWVFTVKQTPEGKVDMYKARLVAKGYSQTYGI 1120

Query: 766  DYSETFAPVAKLNTVRVLLSLAVNLDWPLNQLDVKNAFLNGDLQEEVYMDSPPGFEDKFG 825
            DY ETFAPVAK+ TVR L+S AVN  WPL+QLDVKNAFL+GDL EEVYM+ PPGF +   
Sbjct: 1121 DYDETFAPVAKMGTVRALVSCAVNFGWPLHQLDVKNAFLHGDLHEEVYMEIPPGFGNSQT 1180

Query: 826  LN-VCKLQKSLYGLKQSPRAWFEKFTWSVKKQGYMQAQSDHTLFMRFSNDGKIAILIVYV 884
            +  VCKL+KSLYGLKQSPRAWF++F  +V   GY Q   DHT+F +      I IL VYV
Sbjct: 1181 VGKVCKLKKSLYGLKQSPRAWFDRFRHAVCDMGYSQCNGDHTVFYKHRGT-HITILAVYV 1239

Query: 885  DDIILTGDDIVEMDRLKKNLAKEFEIKDLGALKYFLGMEVARSRKGIVVSQRKYILDLLE 944
            DDI++TGDD+ E+  LK+ L K FE+KDLG L+YFLG+E+ARS KGIV+SQRKY+LDLL 
Sbjct: 1240 DDIVITGDDVEEIRCLKERLGKAFEVKDLGPLRYFLGIEIARSSKGIVLSQRKYVLDLLT 1299

Query: 945  ETGMSGCRPADTPMELNAKLWEKGNVPVDIGRYQRLVGKLIYLAHTRPDIAFSVSVVSQF 1004
            +TGM GCR + TP++ N +L  +   PVD   YQRLVG+LIYL HTRPDI+++VSVVS++
Sbjct: 1300 DTGMLGCRASTTPIDRNHQLCAQSGDPVDKEAYQRLVGRLIYLCHTRPDISYAVSVVSRY 1359

Query: 1005 MHSPYEEHLEAVYRILRYLKSNPGKGLYFKKTNDRDVSIFTDADWAGSVIDRKSTTGYCA 1064
            MH P   HL+ V++ILRYLK  PGKGL+F+K    +V  + DADWA S+ DR+ST+GYC 
Sbjct: 1360 MHDPRTGHLDVVHKILRYLKGTPGKGLWFRKNGHLNVEGYCDADWASSMDDRRSTSGYCV 1419

Query: 1065 YVWGNLVTWRSKKQGVVARSSAEAEFRAMAQGICELLWIQKLLEELKLKIDLPLKLFCDS 1124
            +V GNLV+WRSKKQ VVARS+AEAE+RAMA    E+LW++ LL EL++     + L CD+
Sbjct: 1420 FVGGNLVSWRSKKQAVVARSTAEAEYRAMALSFSEMLWMRSLLTELRVLRSDTVMLHCDN 1479

Query: 1125 KAAISIAHNPVQHDRTKHIEIDRHFIKEKIVSGTICLPYVTSNEQTADILTKSLARPNFE 1184
            K+AISIA+NPVQHDRTKH+EIDR FIKEKI SG + L Y+ S EQ AD LTK L     +
Sbjct: 1480 KSAISIANNPVQHDRTKHVEIDRFFIKEKIDSGVLRLEYIKSCEQLADCLTKGLGPSEIQ 1539

Query: 1185 RLIVKLGMTNIY 1196
             +  K+GM +I+
Sbjct: 1540 SVCNKMGMIDIF 1551



 Score =  126 bits (317), Expect = 4e-27
 Identities = 120/474 (25%), Positives = 195/474 (40%), Gaps = 91/474 (19%)

Query: 100 RLDG-KNYMEWSQTVRLILDGKGKLGFITGEIQMPSTTDPN-YRFWKSENSTIIAWLLST 157
           RL+G KNY+ W +  +L+L  KG   F+    + PS  +   +R W + NST+++WL++ 
Sbjct: 55  RLEGSKNYLSWCRRAQLMLRAKGVDHFLLESCEEPSDKESQAWRTWNTTNSTVVSWLMTL 114

Query: 158 MESTIKKPYMFLPTAKNVWDAVKATYSDIQNSSRIFDLKSRLWQAKQSDRDVTTYYNELM 217
           +  +I +    +  A  VW  +   YS   N   + + ++++   KQ  R V  Y +EL 
Sbjct: 115 VAPSIGRMIEAIQNAAVVWKTLSNMYSGEGNVMMMVEAQNKVENLKQEGRTVQEYASELQ 174

Query: 218 ALWQELDLCYDDNWRCTEDSVLFLKRQENDRVFVLLAGLNNCLDEVRGRILGRIPLPTLQ 277
            LW +LD       +  +D V+  K  +  RV   L GLN   ++ R  +  +  LPT++
Sbjct: 175 QLWADLDHYDPLLLKHEDDIVIGNKWLQRRRVIHFLKGLNKEFEDRRAAMFHQATLPTME 234

Query: 278 ETFSEVRREEARQSVMMGKSASITESSALVTKGNEEGKRDGKKPFCDHCNRQWHTRDTCW 337
           E  S + +EE R  +M G +      SA +   N E         C +C +  H    C 
Sbjct: 235 EAISAMVQEEMRLRLMRGTNPI---RSAYIAADNRE---------CYNCGQVGHVSYNCP 282

Query: 338 KL------------HG----------------KPPNWKKKGGKEGRALQATTSDQEHQSS 369
                         HG                +      +GG+ G   +     Q + ++
Sbjct: 283 TSRNIGGRGSIRGGHGGTRGGFGGDRGGFGGNRGGRGGDRGGRVGGRGRGRGVPQANAAT 342

Query: 370 SSSFPFT--KEQLDQLYKMFGSQT--PSCSIAQIGNFPN----------TALVSVKPSP- 414
                 T   EQ+ Q  +   ++T   S +    GNF N           AL S    P 
Sbjct: 343 EDGKAVTLIGEQVTQWEEWQKNKTNESSNTTTHFGNFANYAQVGEGTQAQALASTYRHPI 402

Query: 415 TWIIDSGATDHMTGESSLFASYSPCAGNHKIKIADGS----------------------- 451
            WIIDSGA+ H+TG  + F SY+P   +  I+IADG+                       
Sbjct: 403 DWIIDSGASKHVTGLHNTFTSYTPYIHSETIQIADGTSKPIHGIGSVECTSSMNLSSVLH 462

Query: 452 -----------LSAIAGKDCQANFFHSHCIFKDLNTGKMIGSAKKSGGLYYLDN 494
                       SAI    C   F  + C+F++  TG+ IG+  +  GL+Y+++
Sbjct: 463 VPSFPVNLLSVSSAIDQLKCIVVFDENSCLFQEKWTGRRIGTGVRRDGLWYINH 516


>gb|AAT40550.1| putative receptor kinase [Solanum demissum]
          Length = 1358

 Score =  593 bits (1530), Expect = e-168
 Identities = 315/600 (52%), Positives = 421/600 (69%), Gaps = 19/600 (3%)

Query: 610  ESEPRNDPNHHPN-----PGKSSIPKCRGKSSSITTSDDPDLHIPIAIRKPVRSCTKHPM 664
            ES+P    ++HP+     P    +P      S++T++       P+ +  P+ +  + P 
Sbjct: 766  ESQPYYTSSNHPDVSMVLPIPQVLPVPTFVESTVTSTS------PVVV-PPLLTYHRRPR 818

Query: 665  AKFVSYSNLSSSFAAFTSQLSTVEIPKNVQ--EALKIPKWKEAVLEEMRALEKNKTWKIM 722
               V   +  +   A T+ L     P  +Q  EAL    W++A+++EM AL K+ TW+++
Sbjct: 819  PTLVPDDSCHAPDPAPTADLPPPSQPLALQKGEALSHSGWRQAMVDEMSALHKSGTWELV 878

Query: 723  TLPAGKNTVGCKWVFTVKYNSDNTVERYKARLVAKGFTQAYGIDYSETFAPVAKLNTVRV 782
            +LPAGK+TVGC+WV+ VK   D  V+R KARLVAKG+TQ +G+DYS+TFAPVAK+ +VR+
Sbjct: 879  SLPAGKSTVGCRWVYAVKIGPDGQVDRLKARLVAKGYTQIFGLDYSDTFAPVAKIASVRL 938

Query: 783  LLSLAVNLDWPLNQLDVKNAFLNGDLQEEVYMDSPPGF--EDKFGLNVCKLQKSLYGLKQ 840
             LS+A    WPL+QLD+KNAFL+GDL+EEVYM+ PPGF  + +    VC+L++SLYGLKQ
Sbjct: 939  FLSMAAVRHWPLHQLDIKNAFLHGDLEEEVYMEQPPGFVAQGESSSLVCRLRRSLYGLKQ 998

Query: 841  SPRAWFEKFTWSVKKQGYMQAQSDHTLFMRFSNDGKIAILIVYVDDIILTGDDIVEMDRL 900
            SPRAWF KF+  +++ G  ++ +DH++F R S   +   L+VYVDDI++TG+D   +  L
Sbjct: 999  SPRAWFGKFSTVIQEFGMTRSGADHSVFYRHSAPSRCIYLVVYVDDIVITGNDQDGITDL 1058

Query: 901  KKNLAKEFEIKDLGALKYFLGMEVARSRKGIVVSQRKYILDLLEETGMSGCRPADTPMEL 960
            K++L K F+ KDLG LKYFLG+EVA+SR GIV+SQRKY LD+LEETGM GCRP DTPM+ 
Sbjct: 1059 KQHLFKHFQTKDLGRLKYFLGIEVAQSRSGIVISQRKYALDILEETGMMGCRPVDTPMDP 1118

Query: 961  NAKLWEKGNVPV-DIGRYQRLVGKLIYLAHTRPDIAFSVSVVSQFMHSPYEEHLEAVYRI 1019
            N KL      P+ +  RY+RLVGKL YL  TRPDI+F VSVVSQFM SP + H EAV RI
Sbjct: 1119 NVKLLPGQGEPLSNPERYRRLVGKLNYLTVTRPDISFPVSVVSQFMTSPCDSHWEAVVRI 1178

Query: 1020 LRYLKSNPGKGLYFKKTNDRDVSIFTDADWAGSVIDRKSTTGYCAYVWGNLVTWRSKKQG 1079
            LRY+KS PGKGL F+      +  +TDADWAGS  DR+ST+GYC  V GNLV+W+SKKQ 
Sbjct: 1179 LRYIKSAPGKGLLFEDQGHEHIIGYTDADWAGSPSDRRSTSGYCVLVGGNLVSWKSKKQN 1238

Query: 1080 VVARSSAEAEFRAMAQGICELLWIQKLLEELKL-KIDLPLKLFCDSKAAISIAHNPVQHD 1138
            VVARSSAE+E+RAMA   CEL+WI++LL ELK  K+D  ++L CD++AA+ IA NPV H+
Sbjct: 1239 VVARSSAESEYRAMATATCELVWIKQLLGELKFGKVD-KMELVCDNQAALHIASNPVFHE 1297

Query: 1139 RTKHIEIDRHFIKEKIVSGTICLPYVTSNEQTADILTKSLARPNFERLIVKLGMTNIYAP 1198
            RTKHIEID HF++EKI+SG I   +V SN+Q ADI TKSL  P    +  KLG  ++YAP
Sbjct: 1298 RTKHIEIDCHFVREKILSGDIVTKFVKSNDQLADIFTKSLTCPRINYICNKLGTYDLYAP 1357



 Score =  125 bits (315), Expect = 7e-27
 Identities = 131/469 (27%), Positives = 206/469 (42%), Gaps = 91/469 (19%)

Query: 101 LDGKNYMEWSQTVRLILDGKGKLGFITG---EIQMPSTTDPNYRFWKSENSTIIAWLLST 157
           L   NY+ W+ +V L   G+G    +T    E+ + + T       K++   + A L S 
Sbjct: 16  LGSSNYLSWASSVELWCKGQGVQDHLTNKAYEVDVKAKTSEEDAKAKAQWEKVDAQLCSL 75

Query: 158 MESTI--KKPYMFLP--TAKNVWDAVKATYSDIQNSSRIFDLKSRLWQAKQSDRDVTTYY 213
           +  +I  K   +F P  T   VW+  +A Y++  + SR +D+ SRL   K+ + D++TY 
Sbjct: 76  LWRSIDFKLMPLFRPFQTCYTVWEKARALYTN--DISRFYDVISRLTNLKKQESDMSTYL 133

Query: 214 NELMALWQELDLCYDDNWRCTEDSVLFLKRQENDRVFVL---LAGLNNCLDEVRGRILGR 270
            ++ A+ +E D                 K+QE+ +   L   LAGL    D VR +IL  
Sbjct: 134 GQVQAVMEEFDTLMPVTTNVE-------KQQEHRQTLFLVLTLAGLPPDHDSVRDQILAS 186

Query: 271 IPLPTLQETFSEVRREEARQSVMMGKSASITESSALVTKGNEE-----------GKRDGK 319
             +PT+ E FS + R  A  S  +  S ++ +SS L ++  E+           G R GK
Sbjct: 187 PTVPTIDELFSRLLRLAAPPSHKVVSSPTV-DSSILASQTFEKRTYQSMENRRGGGRFGK 245

Query: 320 -KPFCDHCNRQWHTRDTCWKLHGKPPNWKKKGGKE------GRALQATT-----SDQEHQ 367
            +  C HC++  HTRD C+ LHG PP++     KE       RA + T+       Q +Q
Sbjct: 246 PRSKCSHCHKPGHTRDICYILHGPPPSYDPIVLKEYNEFLRNRASKQTSPPVAYGAQPNQ 305

Query: 368 SSSSSFPFTKEQLDQLYKMFGSQT---------PSCSIAQIGNFPNTALVSVKPS-PTWI 417
            S+++     E  + L      QT         P  S+A  GN  + A VS   +  TW+
Sbjct: 306 PSNNAHIAQTEYDEFLQYRANKQTSPQVVSVAQPDVSVA--GN--SFACVSQSSTLGTWV 361

Query: 418 IDSGATDHMTGESSLFASYSPCAGNHKIKIADG------------SLSAIA--------- 456
           +DSGA+DH++G  SL +          I +A+G             LS++          
Sbjct: 362 MDSGASDHISGNKSLLSDIVYSQSLPAITLANGIQTKPKGVGKAKPLSSVTLDSVLYVPG 421

Query: 457 -------------GKDCQANFFHSHCIFKDLNTGKMIGSAKKSGGLYYL 492
                           C   FF    + +D +TG+MIG+  +S GLYYL
Sbjct: 422 SPFNLASVSRLTKALHCSITFFDDFFLMQDRSTGQMIGTGHESQGLYYL 470


>emb|CAE05707.2| OSJNBb0065J09.3 [Oryza sativa (japonica cultivar-group)]
          Length = 1015

 Score =  555 bits (1429), Expect = e-156
 Identities = 286/503 (56%), Positives = 362/503 (71%), Gaps = 21/503 (4%)

Query: 615  NDPNHHPNPGKSSIPKCRGKSSSITTSDDPDLHIPIAIRKPVRSCTKHP----------- 663
            ND + + +       +  G+ S I+  +    ++PIA RK +RS    P           
Sbjct: 400  NDQSSYQSDPIQENTETGGEESEISGEES---NLPIANRKGIRSTAGKPPIRYGFEEVEE 456

Query: 664  -----MAKFVSYSNLSSSFAAFTSQLSTVEIPKNVQEALKIPKWKEAVLEEMRALEKNKT 718
                 +A +VSYS+LS ++ AF +   ++ IPK+ +EA   PKW EA++EEM ALEKNKT
Sbjct: 457  ENGNDIANYVSYSSLSPAYRAFIASFQSIVIPKDWREAKNDPKWHEAMMEEMSALEKNKT 516

Query: 719  WKIMTLPAGKNTVGCKWVFTVKYNSDNTVERYKARLVAKGFTQAYGIDYSETFAPVAKLN 778
            W+++  P GK  V CKWV  VK +    VERYKARLVAKG++Q YGIDY ETFAPVAK++
Sbjct: 517  WELVPFPTGKKVVSCKWVNAVKQDPFGKVERYKARLVAKGYSQTYGIDYDETFAPVAKMS 576

Query: 779  TVRVLLSLAVNLDWPLNQLDVKNAFLNGDLQEEVYMDSPPGFE-DKFGLNVCKLQKSLYG 837
            TVR L+S A N DWPL QLDVKNAFL+GDLQEEVYM+ PPGF   +    V +L+KSLYG
Sbjct: 577  TVRTLISCAANFDWPLYQLDVKNAFLHGDLQEEVYMEIPPGFSTSQTKGKVLRLKKSLYG 636

Query: 838  LKQSPRAWFEKFTWSVKKQGYMQAQSDHTLFMRFSNDGKIAILIVYVDDIILTGDDIVEM 897
            LKQSPRAWF++F  ++   GY Q   DHTLF R     KIAIL VYVDDII+TGDD  E+
Sbjct: 637  LKQSPRAWFDRFRRAMCGMGYKQCNGDHTLFYRHRGK-KIAILAVYVDDIIITGDDTQEI 695

Query: 898  DRLKKNLAKEFEIKDLGALKYFLGMEVARSRKGIVVSQRKYILDLLEETGMSGCRPADTP 957
             +LK+N++KEFE+KDLG LKYFLG+E+ARS +GIV+SQRKY+LDLL +TGM GCRPA TP
Sbjct: 696  AQLKENISKEFEVKDLGQLKYFLGIEIARSPRGIVLSQRKYVLDLLCDTGMLGCRPASTP 755

Query: 958  MELNAKLWEKGNVPVDIGRYQRLVGKLIYLAHTRPDIAFSVSVVSQFMHSPYEEHLEAVY 1017
            +E N KL  +   PV+  RYQRLVG+LIYL HTRPDI ++VSVVS++MH P   H++AVY
Sbjct: 756  IEQNHKLCAELGDPVNKERYQRLVGRLIYLCHTRPDITYAVSVVSRYMHDPRSGHMDAVY 815

Query: 1018 RILRYLKSNPGKGLYFKKTNDRDVSIFTDADWAGSVIDRKSTTGYCAYVWGNLVTWRSKK 1077
            RILRYLK +PGKGL+FKK     V  + DADWA S+ DR+ST+GYC +V GNLV+WRSKK
Sbjct: 816  RILRYLKGSPGKGLWFKKNGHLGVEGYCDADWASSLDDRRSTSGYCVFVGGNLVSWRSKK 875

Query: 1078 QGVVARSSAEAEFRAMAQGICEL 1100
            Q VV+RS+AEAE+RAM+  IC +
Sbjct: 876  QPVVSRSTAEAEYRAMSGCICRI 898



 Score = 42.4 bits (98), Expect = 0.095
 Identities = 27/97 (27%), Positives = 43/97 (43%), Gaps = 20/97 (20%)

Query: 399 IGNFPNTA--LVSVKPSPTWIIDSGATDHMTGESSLFASYSPCAGNHKIKIADGSLSAIA 456
           +GN  N A  +    P  TWI+DSGA+ H+T                 + +   S S + 
Sbjct: 22  LGNLVNLAHPMQMQVPHSTWILDSGASRHVT-----------------VNLVSIS-SLVD 63

Query: 457 GKDCQANFFHSHCIFKDLNTGKMIGSAKKSGGLYYLD 493
             DC       +C+ ++  TG+ +G   +  GL+YLD
Sbjct: 64  HMDCWVTLDRENCLIEERRTGRKLGIGIRQNGLWYLD 100


>emb|CAA36616.1| unnamed protein product [Solanum tuberosum] gi|421955|pir||S25787
            hypothetical protein 4 - potato transposon Tst1
          Length = 390

 Score =  529 bits (1363), Expect = e-148
 Identities = 248/356 (69%), Positives = 304/356 (84%), Gaps = 1/356 (0%)

Query: 799  VKNAFLNGDLQEEVYMDSPPGFEDKFGLNVCKLQKSLYGLKQSPRAWFEKFTWSVKKQGY 858
            +KN FLNG L+EEVYMD PPGFE K+   +C+L++SLYGLKQSPRAWFE+FT  VK+QGY
Sbjct: 1    MKNVFLNGHLEEEVYMDPPPGFEGKYKSKICRLRRSLYGLKQSPRAWFERFTQFVKRQGY 60

Query: 859  MQAQSDHTLFMRFSNDGKIAILIVYVDDIILTGDDIVEMDRLKKNLAKEFEIKDLGALKY 918
            +Q Q+DHT+F R S +GK  +LIVYVDDIILTGDD+VE+  LK+ LA EFEIKDLG LKY
Sbjct: 61   VQGQADHTMFTRHSLEGKTTVLIVYVDDIILTGDDVVEIKNLKERLASEFEIKDLGPLKY 120

Query: 919  FLGMEVARSRKGIVVSQRKYILDLLEETGMSGCRPADTPMELNAKLWEKGNVPVDIGRYQ 978
            FLGMEVARS+KGI+VSQRKY+LDLL+ETGMSGCRP +TP++ N K  ++G + +D G+YQ
Sbjct: 121  FLGMEVARSKKGIIVSQRKYVLDLLKETGMSGCRPTETPIDPNLKFVKEGKL-IDKGQYQ 179

Query: 979  RLVGKLIYLAHTRPDIAFSVSVVSQFMHSPYEEHLEAVYRILRYLKSNPGKGLYFKKTND 1038
            RLVGKLIYL+HTRPDI+F+VS+V QFMH P EEH EAVYRILRYLKS+PGKGL+FKK   
Sbjct: 180  RLVGKLIYLSHTRPDISFAVSLVIQFMHYPREEHQEAVYRILRYLKSSPGKGLFFKKNEQ 239

Query: 1039 RDVSIFTDADWAGSVIDRKSTTGYCAYVWGNLVTWRSKKQGVVARSSAEAEFRAMAQGIC 1098
            R +  +TDADWAGS IDR+ST+GYC +VWGNLVTWRSKKQ VVARSSAEAE+R+MA GIC
Sbjct: 240  RSLEAYTDADWAGSSIDRRSTSGYCTFVWGNLVTWRSKKQNVVARSSAEAEYRSMALGIC 299

Query: 1099 ELLWIQKLLEELKLKIDLPLKLFCDSKAAISIAHNPVQHDRTKHIEIDRHFIKEKI 1154
            E+LW+++ LEEL+  +  P+KL+CD+KAAISIAHNPVQHDRTKH+E+    +K ++
Sbjct: 300  EILWLKRFLEELRRPVSFPMKLYCDNKAAISIAHNPVQHDRTKHVEVTDTSLKRRL 355


>emb|CAB81200.1| putative retrotransposon polyprotein [Arabidopsis thaliana]
            gi|4539373|emb|CAB40067.1| putative retrotransposon
            polyprotein [Arabidopsis thaliana] gi|7486142|pir||T04294
            hypothetical protein F25I24.200 - Arabidopsis thaliana
          Length = 1203

 Score =  517 bits (1332), Expect = e-145
 Identities = 262/584 (44%), Positives = 391/584 (66%), Gaps = 18/584 (3%)

Query: 603  PILQPCQESE-PRNDPNHHPNPGKSSIPKCRGKSSSITTS-DDPDLHIPIAIRKPVRSCT 660
            PI +P + ++ P     +H N    S+P     S + +TS + P   IP     P +  T
Sbjct: 445  PIARPKRNAKAPAYLSEYHCN----SVPFLSSLSPTTSTSIETPSSSIP-----PKKITT 495

Query: 661  KHPMAKFVSYSNLSSSFAAFTSQLSTVEIPKNVQEALKIPKWKEAVLEEMRALEKNKTWK 720
             +PM+  +SY  L+  F ++    +    PK   +A+K  KW  A  EE+ ALE+NKTW 
Sbjct: 496  PYPMSTAISYDKLTPLFHSYICAYNVETEPKAFTQAMKSEKWTRAANEELHALEQNKTWI 555

Query: 721  IMTLPAGKNTVGCKWVFTVKYNSDNTVERYKARLVAKGFTQAYGIDYSETFAPVAKLNTV 780
            + +L  GKN VGCKWVFT+KYN D ++ERYKARLVA+GFTQ  GIDY ETF+PVAK  +V
Sbjct: 556  VESLTEGKNVVGCKWVFTIKYNPDGSIERYKARLVAQGFTQQEGIDYMETFSPVAKFGSV 615

Query: 781  RVLLSLAVNLDWPLNQLDVKNAFLNGDLQEEVYMDSPPGFEDKFGLN-----VCKLQKSL 835
            ++LL LA    W L Q+DV NAFL+G+L EE+YM  P G+    G++     VC+L KSL
Sbjct: 616  KLLLGLAAATGWSLTQMDVSNAFLHGELDEEIYMSLPQGYTPPTGISLPSKPVCRLLKSL 675

Query: 836  YGLKQSPRAWFEKFTWSVKKQGYMQAQSDHTLFMRFSNDGKIAILIVYVDDIILTGDDIV 895
            YGLKQ+ R W+++ +       ++Q+ +D+T+F++ S    I +++VYVDD+++  +D  
Sbjct: 676  YGLKQASRQWYKRLSSVFLGANFIQSPADNTMFVKVSCTS-IIVVLVYVDDLMIASNDSS 734

Query: 896  EMDRLKKNLAKEFEIKDLGALKYFLGMEVARSRKGIVVSQRKYILDLLEETGMSGCRPAD 955
             ++ LK+ L  EF+IKDLG  ++FLG+E+ARS +GI V QRKY  +LLE+ G+SGC+P+ 
Sbjct: 735  AVENLKELLRSEFKIKDLGPARFFLGLEIARSSEGISVCQRKYAQNLLEDVGLSGCKPSS 794

Query: 956  TPMELNAKLW-EKGNVPVDIGRYQRLVGKLIYLAHTRPDIAFSVSVVSQFMHSPYEEHLE 1014
             PM+ N  L  E G +  +   Y+ LVG+L+YL  TRPDI F+V  +SQF+ +P + H++
Sbjct: 795  IPMDPNLHLTKEMGTLLPNATSYRELVGRLLYLCITRPDITFAVHTLSQFLSAPTDIHMQ 854

Query: 1015 AVYRILRYLKSNPGKGLYFKKTNDRDVSIFTDADWAGSVIDRKSTTGYCAYVWGNLVTWR 1074
            A +++LRYLK NPG+GL +  +++  ++ F+DADW      R+S TG+C Y+  +L+TW+
Sbjct: 855  AAHKVLRYLKGNPGQGLMYSASSELCLNGFSDADWGTCKDSRRSVTGFCIYLGTSLITWK 914

Query: 1075 SKKQGVVARSSAEAEFRAMAQGICELLWIQKLLEELKLKIDLPLKLFCDSKAAISIAHNP 1134
            SKKQ VV+RSS E+E+R++AQ  CE++W+Q+LL++L + +  P KLFCD+K+A+ +A NP
Sbjct: 915  SKKQSVVSRSSTESEYRSLAQATCEIIWLQQLLKDLHVTMTCPAKLFCDNKSALHLATNP 974

Query: 1135 VQHDRTKHIEIDRHFIKEKIVSGTICLPYVTSNEQTADILTKSL 1178
            V H+RTKHIEID H ++++I +G +   +V +  Q ADILTK L
Sbjct: 975  VFHERTKHIEIDCHTVRDQIKAGKLKTLHVPTGNQLADILTKPL 1018


>pir||F86470 probable retroelement polyprotein [imported] - Arabidopsis thaliana
            gi|9989049|gb|AAG10812.1| Putative retroelement
            polyprotein [Arabidopsis thaliana]
          Length = 1404

 Score =  513 bits (1321), Expect = e-143
 Identities = 263/539 (48%), Positives = 367/539 (67%), Gaps = 6/539 (1%)

Query: 662  HPMAKFVSYSNLSSSFAAFTSQLSTVEIPKNVQEALKIPKWKEAVLEEMRALEKNKTWKI 721
            HP     S + +     AF S++S   IP+  +EA+++ +W++A+ +E+ A+++N TW  
Sbjct: 864  HPFQATCSLALVPLDHQAFLSKISEHWIPQTYEEAMEVKEWRDAIADEINAMKRNHTWDE 923

Query: 722  MTLPAGKNTVGCKWVFTVKYNSDNTVERYKARLVAKGFTQAYGIDYSETFAPVAKLNTVR 781
              LP GK TV  +WVFT+KY S+  +ERYK RLVA+GFTQ YG DY ETFAPVAKL+TVR
Sbjct: 924  DDLPKGKKTVSSRWVFTIKYKSNGDIERYKTRLVARGFTQTYGSDYMETFAPVAKLHTVR 983

Query: 782  VLLSLAVNLDWPLNQLDVKNAFLNGDLQEEVYMDSPPGFEDKFGLN-VCKLQKSLYGLKQ 840
            V+L+LA NL W L Q+DVKNAFL G+L+++VYM  PPG ED    + V +L+K++YGLKQ
Sbjct: 984  VVLALATNLSWGLWQMDVKNAFLQGELEDDVYMTPPPGLEDTIPCDKVLRLRKAIYGLKQ 1043

Query: 841  SPRAWFEKFTWSVKKQGYMQAQSDHTLFMRFSNDGKIAILIVYVDDIILTGDDIVEMDRL 900
            SPRAW+ K + ++K  G+ +++SDHTLF   S  G I ++++YVDD+I+TGD+   +D  
Sbjct: 1044 SPRAWYHKLSRTLKDHGFKKSESDHTLFTLQSPQG-IVVVLIYVDDLIITGDNKDGIDST 1102

Query: 901  KKNLAKEFEIKDLGALKYFLGMEVARSRKGIVVSQRKYILDLLEETGMSGCRPADTPMEL 960
            K  L   F+IKDLG LKYFLG+EV RS  G+ +SQRKY LDLL ETG    +PA TP+E 
Sbjct: 1103 KTFLKSCFDIKDLGELKYFLGIEVCRSNAGLFLSQRKYTLDLLNETGFMDAKPARTPLED 1162

Query: 961  NAKLWEKGNVPV----DIGRYQRLVGKLIYLAHTRPDIAFSVSVVSQFMHSPYEEHLEAV 1016
              K+  KG        D   Y++LVGKLIYL +TRPDI F+V+ VSQ M  P   H   V
Sbjct: 1163 GYKVNRKGEKEDEKFGDAPLYRKLVGKLIYLTNTRPDICFAVNQVSQHMKVPMVYHWNMV 1222

Query: 1017 YRILRYLKSNPGKGLYFKKTNDRDVSIFTDADWAGSVIDRKSTTGYCAYVWGNLVTWRSK 1076
             RILRYLK + G+G++  K +  ++  + DAD+AG   DR+S TGYC ++ GNL TW++K
Sbjct: 1223 ERILRYLKGSSGQGIWMGKNSSTEIVGYCDADYAGDRGDRRSKTGYCTFIGGNLATWKTK 1282

Query: 1077 KQGVVARSSAEAEFRAMAQGICELLWIQKLLEELKLKIDLPLKLFCDSKAAISIAHNPVQ 1136
            KQ VV+ SSAE+E+RAM +   EL W++ LL++L ++  +P+ + CD+KAAI IA N V 
Sbjct: 1283 KQKVVSCSSAESEYRAMRKLTNELTWLKALLKDLGIEQHMPITMHCDNKAAIYIASNSVF 1342

Query: 1137 HDRTKHIEIDRHFIKEKIVSGTICLPYVTSNEQTADILTKSLARPNFERLIVKLGMTNI 1195
            H+RTKHIE+D H ++EKI+ G     Y  S +Q ADI TK+ +      +  KLG+ ++
Sbjct: 1343 HERTKHIEVDCHKVREKIIEGVTLPCYTRSEDQLADIFTKAASLKVCNFIHGKLGLVDL 1401



 Score =  167 bits (423), Expect = 2e-39
 Identities = 120/435 (27%), Positives = 195/435 (44%), Gaps = 51/435 (11%)

Query: 101 LDGKNYMEWSQTVRLILDGKGKLGFITG--------EIQMPSTTDPNYRFWKSENSTIIA 152
           L G NY+ WS+T + +L G+G    +          E +   T  P    W  E+  ++A
Sbjct: 13  LQGGNYLTWSRTTKTVLCGRGLWSHVISSQAPKEDKEEEETETISPEEEKWFQEDQAVLA 72

Query: 153 WLLSTMESTIKKPYMFLPTAKNVWDAVKATYSDIQNSSRIFDLKSRLWQAKQSDRDVTTY 212
            L +++E++I + Y +  TAK +WD +K  Y +  N +R+F++K  + +  Q D + T +
Sbjct: 73  LLQNSLETSILEGYSYCETAKELWDTLKNVYGNESNLTRVFEVKKAINELSQEDLEFTKH 132

Query: 213 YNELMALWQELDLCYDDNWRCTEDSVLFLKRQENDRVFVLLAGLNNCLDEVRGRILGRIP 272
           + +  +LW EL          T D  +  +R+E D+VF LL  LN   +++   +L    
Sbjct: 133 FGKFRSLWSELKSLRPG----TLDPKILHERREQDKVFGLLLTLNPGYNDLIKHLLRSEK 188

Query: 273 LPTLQETFSEVRREEARQSVMMGKSASITESSALVTKGNEEGKRDGKKPF-CDHCNRQWH 331
           LP+L E  S++++E+    +  GKS  IT +   V       K + +K   CDHC ++ H
Sbjct: 189 LPSLDEVCSKIQKEQGSTGLFGGKSELITANKGEVVANKGVYKNEDRKLLTCDHCKKKGH 248

Query: 332 TRDTCWKLHGKPPNWKKKGGKEGRALQATTSDQEHQSSSSSFPFTKEQLDQLYKMFGSQT 391
           T+D CW LH   P+ K    K+ RA  +  + +E   + SS   T        +    + 
Sbjct: 249 TKDKCWLLH---PHLKPAKFKDSRAHFSQETHEEQSQAGSSKGETSTSFGDYVRKSDLEA 305

Query: 392 PSCSIAQIGNFPNTALVSVKPSPTWIIDSGATDHMTGESSLFASYSPCAGNHKIKIADGS 451
              SI  +         S   S + +IDSGA+ HM   S+L  +  P  G+  + IA+G 
Sbjct: 306 LIKSIVSLKE-SGITFSSQTSSGSIVIDSGASHHMISNSNLLDNIEPALGH--VIIANGD 362

Query: 452 LSAIAG--------KD------------------------CQANFFHSHCIFKDLNTGKM 479
              I G        KD                        C A F  +   F+D+ TGK+
Sbjct: 363 KVPIEGIGNLKLFNKDSKAFFMPKFTSNLLSVKRTTRDLNCYAIFGPNDVYFQDIETGKV 422

Query: 480 IGSAKKSGGLYYLDN 494
           IG     G LY L++
Sbjct: 423 IGEGGSKGELYVLED 437


>gb|AAB61111.1| Strong similarity to Zea mays retrotransposon Hopscotch polyprotein
            (gb|U12626). [Arabidopsis thaliana]
            gi|25301690|pir||G96722 hypothetical protein F20P5.25
            [imported] - Arabidopsis thaliana
          Length = 1315

 Score =  510 bits (1314), Expect = e-143
 Identities = 260/613 (42%), Positives = 399/613 (64%), Gaps = 23/613 (3%)

Query: 603  PILQPCQESEPRNDPNHHPNPGKSSIPKCRGKSSSITTSDDPDLHIPIAIRKP------- 655
            P L P    + ++  + +P+   SS+       S+  T++ P+  +  + RK        
Sbjct: 704  PDLNPTPPMQRQSSDHVNPSDSSSSVEIL---PSANPTNNVPEPSVQTSHRKAKKPAYLQ 760

Query: 656  ------VRSCTKHPMAKFVSYSNLSSSFAAFTSQLSTVEIPKNVQEALKIPKWKEAVLEE 709
                  V S T H + KF+SY  ++  +  F + L   + P N  EA K+  W++A+  E
Sbjct: 761  DYYCHSVVSSTPHEIRKFLSYDRINDPYLTFLACLDKTKEPSNYTEAEKLQVWRDAMGAE 820

Query: 710  MRALEKNKTWKIMTLPAGKNTVGCKWVFTVKYNSDNTVERYKARLVAKGFTQAYGIDYSE 769
               LE   TW++ +LPA K  +GC+W+F +KYNSD +VERYKARLVA+G+TQ  GIDY+E
Sbjct: 821  FDFLEGTHTWEVCSLPADKRCIGCRWIFKIKYNSDGSVERYKARLVAQGYTQKEGIDYNE 880

Query: 770  TFAPVAKLNTVRVLLSLAVNLDWPLNQLDVKNAFLNGDLQEEVYMDSPPGFEDKFGLN-- 827
            TF+PVAKLN+V++LL +A      L QLD+ NAFLNGDL EE+YM  P G+  + G +  
Sbjct: 881  TFSPVAKLNSVKLLLGVAARFKLSLTQLDISNAFLNGDLDEEIYMRLPQGYASRQGDSLP 940

Query: 828  ---VCKLQKSLYGLKQSPRAWFEKFTWSVKKQGYMQAQSDHTLFMRFSNDGKIAILIVYV 884
               VC+L+KSLYGLKQ+ R W+ KF+ ++   G++Q+  DHT F++ S DG    ++VY+
Sbjct: 941  PNAVCRLKKSLYGLKQASRQWYLKFSSTLLGLGFIQSYCDHTCFLKIS-DGIFLCVLVYI 999

Query: 885  DDIILTGDDIVEMDRLKKNLAKEFEIKDLGALKYFLGMEVARSRKGIVVSQRKYILDLLE 944
            DDII+  ++   +D LK  +   F+++DLG LKYFLG+E+ RS KGI +SQRKY LDLL+
Sbjct: 1000 DDIIIASNNDAAVDILKSQMKSFFKLRDLGELKYFLGLEIVRSDKGIHISQRKYALDLLD 1059

Query: 945  ETGMSGCRPADTPMELNAKL-WEKGNVPVDIGRYQRLVGKLIYLAHTRPDIAFSVSVVSQ 1003
            ETG  GC+P+  PM+ +     + G   V++G Y+RL+G+L+YL  TRPDI F+V+ ++Q
Sbjct: 1060 ETGQLGCKPSSIPMDPSMVFAHDSGGDFVEVGPYRRLIGRLMYLNITRPDITFAVNKLAQ 1119

Query: 1004 FMHSPYEEHLEAVYRILRYLKSNPGKGLYFKKTNDRDVSIFTDADWAGSVIDRKSTTGYC 1063
            F  +P + HL+AVY+IL+Y+K   G+GL++  T++  + ++ +AD+      R+ST+GYC
Sbjct: 1120 FSMAPRKAHLQAVYKILQYIKGTIGQGLFYSATSELQLKVYANADYNSCRDSRRSTSGYC 1179

Query: 1064 AYVWGNLVTWRSKKQGVVARSSAEAEFRAMAQGICELLWIQKLLEELKLKIDLPLKLFCD 1123
             ++  +L+ W+S+KQ VV++SSAEAE+R+++    EL+W+   L+EL++ +  P  LFCD
Sbjct: 1180 MFLGDSLICWKSRKQDVVSKSSAEAEYRSLSVATDELVWLTNFLKELQVPLSKPTLLFCD 1239

Query: 1124 SKAAISIAHNPVQHDRTKHIEIDRHFIKEKIVSGTICLPYVTSNEQTADILTKSLARPNF 1183
            ++AAI IA+N V H+RTKHIE D H ++E+++ G   L ++ +  Q AD  TK L   +F
Sbjct: 1240 NEAAIHIANNHVFHERTKHIESDCHSVRERLLKGLFELYHINTELQIADPFTKPLYPSHF 1299

Query: 1184 ERLIVKLGMTNIY 1196
             RLI K+G+ NI+
Sbjct: 1300 HRLISKMGLLNIF 1312



 Score =  151 bits (381), Expect = 1e-34
 Identities = 112/402 (27%), Positives = 185/402 (45%), Gaps = 48/402 (11%)

Query: 117 LDGKGKLGFITGEIQMPSTTDPNYRFWKSENSTIIAWLLSTMESTIKKPYMFLPTAKNVW 176
           ++ K KLGF+ G I  P   DP  + W+  NS + +WLL+++   I    ++ PTA  +W
Sbjct: 5   IEAKNKLGFVDGSIPKPDDDDPYCKIWRRCNSMVKSWLLNSVSKEIYTSILYFPTAAAIW 64

Query: 177 DAVKATYSDIQNSS--RIFDLKSRLWQAKQSDRDVTTYYNELMALWQELDLCYDDNWRCT 234
              K  Y+    SS  R++ L+ ++   +Q + D+++Y+     LW+EL        R  
Sbjct: 65  ---KDLYTRFHKSSLPRLYKLRQQIHSLRQGNLDLSSYHTRTQTLWEEL-TSLQAVPRTV 120

Query: 235 EDSVLFLKRQENDRVFVLLAGLNNCLDEVRGRILGRIPLPTLQETFSEVRREEARQSVMM 294
           ED    L  +E +RV   L GLN+C D VR +IL +  LP+L E F+ + ++E ++S  +
Sbjct: 121 ED---LLIERETNRVIDFLMGLNDCYDTVRSQILMKKTLPSLSEVFNMIDQDETQRSARI 177

Query: 295 GKSASITESSALVTKGNEEGKRDG------KKPFCDHCNRQWHTRDTCWKLHGKPPNWKK 348
             +  +T S   V+  + +   +G      ++P C +C+R  H  DTC+K HG P ++K 
Sbjct: 178 STTPGMTSSVFPVSNQSSQSALNGDTYQKKERPVCSYCSRPGHVEDTCYKKHGYPTSFKS 237

Query: 349 KGG--KEGRALQATTSDQE--HQSSSSSFPFTKEQLDQLYKMFGS--QTPSCSIAQIGNF 402
           K    K   +  A    +E  + +S S+   T  Q+ QL     S  Q PS  +      
Sbjct: 238 KQKFVKPSISANAAIGSEEVVNNTSVSTGDLTTSQIQQLVSFLSSKLQPPSTPVQ----- 292

Query: 403 PNTALVSVKPSPTWIIDSGATDHMTGESSLFASYSPCAGNHKI----------KIADGSL 452
           P    +SV   P+    S     ++G   L        G H I          K    S+
Sbjct: 293 PEVHSISVSSDPS---SSSTVCPISGSVHL--------GRHLILNDVLFIPQFKFNLLSV 341

Query: 453 SAIA-GKDCQANFFHSHCIFKDLNTGKMIGSAKKSGGLYYLD 493
           S++     C+  F  + C+ +D     M+G  K+   LY +D
Sbjct: 342 SSLTKSMGCRIWFDETSCVLQDATRELMVGMGKQVANLYIVD 383


>gb|AAU89728.1| putative retroelement pol polyprotein-like [Solanum tuberosum]
          Length = 1476

 Score =  509 bits (1311), Expect = e-142
 Identities = 253/550 (46%), Positives = 368/550 (66%), Gaps = 14/550 (2%)

Query: 662  HPMAKFVSYSNLSSSFAAFTSQLSTVEIPKNVQEALKIPKWKEAVLEEMRALEKNKTWKI 721
            +P++  + YS LSS++  + +  S    P+   +A    +W  A+ EE++ALE NKTW++
Sbjct: 909  YPISDNIDYSCLSSTYQCYIASSSVETEPQFYYQAANDCRWVHAMKEEIQALEDNKTWEV 968

Query: 722  MTLPAGKNTVGCKWVFTVKYNSDNTVERYKARLVAKGFTQAYGIDYSETFAPVAKLNTVR 781
            ++LP GK  +GCKWV+ +KY +   +ER+KARLVAKG+ Q  G+DY ETF+PV K+ T+R
Sbjct: 969  VSLPKGKKAIGCKWVYKIKYKASGEIERFKARLVAKGYNQKEGLDYQETFSPVVKMVTLR 1028

Query: 782  VLLSLAVNLDWPLNQLDVKNAFLNGDLQEEVYMDSPPGFE-DKFG-LNVCKLQKSLYGLK 839
             +L+LAV+  W + Q+DV NAFL GDL EEVYM  P GF+ DK G   VC+L KSLYGLK
Sbjct: 1029 TVLTLAVSKGWDIQQMDVYNAFLQGDLIEEVYMQLPQGFQYDKTGDPKVCRLLKSLYGLK 1088

Query: 840  QSPRAWFEKFTWSVKKQGYMQAQSDHTLFMRFSNDGKIAILIVYVDDIILTGDDIVEMDR 899
            Q+ R W  K T ++   G+ Q+  D++L ++ + DG I I+++YVDD+++TG  +  +D 
Sbjct: 1089 QASRQWNVKLTTALLAAGFQQSHLDYSLMLKRTADG-IVIVLIYVDDLLITGSSLQLIDD 1147

Query: 900  LKKNLAKEFEIKDLGALKYFLGMEVARSRKGIVVSQRKYILDLLEETGMSGCRPADTPME 959
             K+ L   F+IKDLG L+YFLGME AR+  G+++ QRKY L+L+ + G+ G +P+ TP+E
Sbjct: 1148 AKQVLKANFKIKDLGTLRYFLGMEFARNASGMLMHQRKYALELISDLGLGGSKPSVTPVE 1207

Query: 960  LNAKLWEK-----------GNVPVDIGRYQRLVGKLIYLAHTRPDIAFSVSVVSQFMHSP 1008
            L+ KL  +            ++  D   YQRLVG+L+YL  TRPDI+F+V  +SQFMH+P
Sbjct: 1208 LHLKLTTREFDLHVGSSGADSLLADPTEYQRLVGRLLYLTITRPDISFAVQHLSQFMHAP 1267

Query: 1009 YEEHLEAVYRILRYLKSNPGKGLYFKKTNDRDVSIFTDADWAGSVIDRKSTTGYCAYVWG 1068
               H+EA  R+++Y+K  PG GLY        +  + DADW   +  RKS TGY      
Sbjct: 1268 KVSHMEAAIRVVKYVKQAPGLGLYMAVQTADTLQAYCDADWGSCINTRKSITGYMIQFGS 1327

Query: 1069 NLVTWRSKKQGVVARSSAEAEFRAMAQGICELLWIQKLLEELKLKIDLPLKLFCDSKAAI 1128
             L++W+SKKQ  ++RSSAEAE+R++A  + EL+W+  L +EL + + LP+ L+CDSKAAI
Sbjct: 1328 ALLSWKSKKQPTISRSSAEAEYRSLASTVAELVWLTGLFKELDMPLSLPVSLYCDSKAAI 1387

Query: 1129 SIAHNPVQHDRTKHIEIDRHFIKEKIVSGTICLPYVTSNEQTADILTKSLARPNFERLIV 1188
             IA NPV H+RTKHI+ID HFI+EK+ +G + + Y+ + EQ ADILTK L+      L+ 
Sbjct: 1388 QIAANPVFHERTKHIDIDCHFIREKVQAGLVMIHYLPTQEQPADILTKGLSSAQHSYLVS 1447

Query: 1189 KLGMTNIYAP 1198
            KLG+ NI+ P
Sbjct: 1448 KLGLKNIFIP 1457



 Score =  118 bits (295), Expect = 1e-24
 Identities = 121/511 (23%), Positives = 197/511 (37%), Gaps = 121/511 (23%)

Query: 98  VQRLDGKNYMEWSQTVRLILDGKGKLGFITGEIQMPSTTDP-NYRFWKSENSTIIAWLLS 156
           +Q    +NY  WS+ ++L L  K K+GFI G ++     +    + W   N+ +++WL++
Sbjct: 19  IQLTGMENYSLWSRAMQLTLLTKNKMGFIDGSLRRDDFKEELEKKQWDRCNAMVLSWLMN 78

Query: 157 TMESTIKKPYMFLPTAKNVWDAVKATYSDIQNSSRIFDLKSRLWQAKQSDRDVTTYYNEL 216
            + + +    +F   A  VW+ +K  +  + N SRIF L   +    Q    V+ YY++L
Sbjct: 79  NVSTDLVSGILFRSNATLVWNDLKERFDKV-NMSRIFHLHKAIVTHVQGVSPVSVYYSKL 137

Query: 217 MALWQELDLCYDDNWRCTEDSVLFLKRQENDRVFVLLAGLNNCLDEVRGRILGRIPLPTL 276
             LW E D          E SV +       ++   L GLN+   + R +IL   P P++
Sbjct: 138 KDLWDEYDSILPPPSCDCEKSVDYTDSMLRQKLLQFLMGLNDNYGQARSQILMMNPSPSV 197

Query: 277 QETFSEVRREEARQSVMMGKSASITESSALVT-------------KGNEEGKRDG----- 318
            + ++ + ++E+++S  +  S    + +AL T             +G+  G  +G     
Sbjct: 198 NQCYAMIVQDESQRS--LSGSGQTIDPTALFTHRPGGSGFGSQGSQGSGNGSSNGNSHRF 255

Query: 319 ----------------------KKPFCDHCNRQWHTRDTCWKLHGKPPNWKKKG------ 350
                                 K   C HCN Q HT+DTC++L G P ++K K       
Sbjct: 256 HKGGNIYCDFCNMKGHIRANCNKLKHCTHCNMQGHTKDTCYQLIGYPADYKGKKKANIVT 315

Query: 351 ---------------------------GKEGRALQATTSDQEHQSSS-------SSFP-F 375
                                      G     +Q T +   H S S        S P F
Sbjct: 316 APSLPQMQHNNFNNNLNYPMQYTGDGIGHFVSPMQFTGNTNGHSSGSIAGNFGPGSVPQF 375

Query: 376 TKEQLDQLYKMFGSQTPSCSIAQI-GNFPNTA-LVSVKPSPTWIIDSGATDHMTGESSLF 433
           T  Q + + +M      S S A + G F  ++   S   S  WI+DSGATDHM   ++L 
Sbjct: 376 TPSQYNNILQMLNKPMLSESSANVAGIFAGSSHCNSNTHSSAWIVDSGATDHMVSNTTLL 435

Query: 434 ASYSPCAGNHKIKIADG--------SLSAIAGKD-------------------------- 459
                 +   K+++  G          S + G D                          
Sbjct: 436 NHGLSVSHPGKVQLPTGDSAVVTHSGSSQLTGGDVVKNVLCVPTFQFNLLSVSKLTKELN 495

Query: 460 CQANFFHSHCIFKDLNTGKMIGSAKKSGGLY 490
           C   FF    I +DL TGK+    ++  GLY
Sbjct: 496 CCVIFFPDFFIIQDLFTGKVKEIGEEINGLY 526


>pir||E96608 probable retroelement polyprotein F25P12.89 [imported] - Arabidopsis
            thaliana gi|9954746|gb|AAG09097.1| Putative retroelement
            polyprotein [Arabidopsis thaliana]
          Length = 1486

 Score =  509 bits (1310), Expect = e-142
 Identities = 255/553 (46%), Positives = 370/553 (66%), Gaps = 2/553 (0%)

Query: 647  HIPIAIRKPVRSCTKHPMAKFVSYSNLSSSFAAFTSQLSTVEIPKNVQEALKIPKWKEAV 706
            ++   + +P  S T +P+  ++S S  S ++ A+   +++   P+N  EA+    WK AV
Sbjct: 934  YVTTLLHQPFPSATPYPLDNYISSSRFSDNYQAYILAITSGNEPRNYNEAMLDDHWKGAV 993

Query: 707  LEEMRALEKNKTWKIMTLPAGKNTVGCKWVFTVKYNSDNTVERYKARLVAKGFTQAYGID 766
              E+ +LE   TW +  LP GK  +GCKWVF +KY SD T+ER+KARLV  G  Q  G+D
Sbjct: 994  SHEIGSLENLGTWTVEDLPPGKKALGCKWVFRLKYKSDGTLERHKARLVVLGNNQTEGLD 1053

Query: 767  YSETFAPVAKLNTVRVLLSLAVNLDWPLNQLDVKNAFLNGDLQEEVYMDSPPGFEDKFGL 826
            Y+ETFAPVAK+ TVR  L   V+LDW ++Q+DV NAFL+GDL EEVYM  PPGF      
Sbjct: 1054 YTETFAPVAKMVTVRAFLQQVVSLDWEVHQMDVHNAFLHGDLDEEVYMQFPPGFRTGDKT 1113

Query: 827  NVCKLQKSLYGLKQSPRAWFEKFTWSVKKQGYMQAQSDHTLFMRFSNDGKIAILIVYVDD 886
             VC+L+KSLYGLKQ+PR WF K T ++K  G++Q  SD++LF+   N  ++ +L VYVDD
Sbjct: 1114 KVCRLRKSLYGLKQAPRCWFAKLTSALKNYGFIQDISDYSLFIFHKNGVRLHVL-VYVDD 1172

Query: 887  IILTGDDIVEMDRLKKNLAKEFEIKDLGALKYFLGMEVARSRKGIVVSQRKYILDLLEET 946
            +I+TG  I  +   K  L+  F +KDLG L+YFLG+EVARS +GI + QRKY LD++ ET
Sbjct: 1173 LIITGTTIAVITEFKHYLSSCFYMKDLGILRYFLGIEVARSPEGIYLCQRKYALDIITET 1232

Query: 947  GMSGCRPADTPMELNAKL-WEKGNVPVDIGRYQRLVGKLIYLAHTRPDIAFSVSVVSQFM 1005
            G+ G +PA  P++ N KL +  G    D  RY+RLVG++IYLA TRP++++ + ++SQFM
Sbjct: 1233 GLLGVKPASFPLDQNHKLAFATGETIDDPLRYRRLVGRIIYLATTRPELSYVIHILSQFM 1292

Query: 1006 HSPYEEHLEAVYRILRYLKSNPGKGLYFKKTNDRDVSIFTDADWAGSVIDRKSTTGYCAY 1065
            H+P   H EA  R++RYLKS+PG+G+  +      +S + D+D+       +S TG+   
Sbjct: 1293 HNPKPAHWEAALRVVRYLKSSPGQGILLRANTPLVLSAWCDSDFGACPHSDRSLTGWFIQ 1352

Query: 1066 VWGNLVTWRSKKQGVVARSSAEAEFRAMAQGICELLWIQKLLEELKLKIDLPLKLFCDSK 1125
            + G+ ++W+++KQ VV+RSSAEAE+RAMA+ + E++WI++LL  L +    P  L  DS 
Sbjct: 1353 LGGSPLSWKTQKQNVVSRSSAEAEYRAMAETVSEIIWIRELLPALGIPCTAPTTLHSDSL 1412

Query: 1126 AAISIAHNPVQHDRTKHIEIDRHFIKEKIVSGTICLPYVTSNEQTADILTKSLARPNFER 1185
            +AIS+A NPV H RTKH+  D HFI++++V+GTI   +V++  Q ADILTK+L R  F  
Sbjct: 1413 SAISLAANPVYHARTKHVRRDVHFIRDELVNGTIATKHVSTTSQLADILTKALGRKEFAD 1472

Query: 1186 LIVKLGMTNIYAP 1198
             + KLG+ N++ P
Sbjct: 1473 FLAKLGICNLHIP 1485



 Score =  134 bits (338), Expect = 1e-29
 Identities = 110/452 (24%), Positives = 183/452 (40%), Gaps = 68/452 (15%)

Query: 101 LDGKNYMEWSQTVRLILDGKGKLGFITGEIQMPSTTDPNYRFWKSENSTIIAWLLSTMES 160
           L G NY EW+  +RL L  + K GF  G I  P  TDP++  W + N+ +++W+  T++ 
Sbjct: 42  LRGPNYDEWATNLRLALKARKKFGFADGSIPQPVETDPDFEDWTANNALVVSWMKLTIDE 101

Query: 161 TIKKPYMFLPTAKNVWDAVKATYSDIQNSSRIFDLKSRLWQAKQSDRDVTTYYNELMALW 220
           T+      L  +  +W  ++  +  ++N  R+  LK+ L   +Q    + TYY  L  LW
Sbjct: 102 TVSTSMSHLDDSHELWTHIQKRFG-VKNGQRVQRLKTELATCRQKGVAIETYYGRLSQLW 160

Query: 221 QELDLCYDDNWRCTEDSVLFLKRQENDRVFVLLAGLNNCL-DEVRGRILGRIPLPTLQET 279
           + L    D     T D V   K +E D++   L GL+  +   V+  +L R+PLP+L+E 
Sbjct: 161 RSL---ADYQQAKTMDDV--RKEREEDKLHQFLMGLDESVYGAVKSALLSRVPLPSLEEA 215

Query: 280 FSEVRREEARQSVMMGKSASITESSALVTKGNEEGKRDGKKPFCDHCNRQWHTRDTCWKL 339
           ++ + ++E  +S+    +  + +  +   +     +   +   C +C R  H  + C+KL
Sbjct: 216 YNALTQDEESKSLSRLHNERV-DGVSFAVQTTSRPRDSSENRVCSNCGRVGHLAEQCFKL 274

Query: 340 HGKPP------NWKKKGGKEGRALQATTSDQEH-QSSSSSFPFTKEQLDQLYKMFGSQTP 392
            G PP        K         L +    Q H + SS +   +      +       +P
Sbjct: 275 IGYPPWLEEKLRLKNTASSSRGGLSSFKGKQSHGRGSSINHVASSGMAANVVTNSSLTSP 334

Query: 393 SCSIAQIG---------NFPNTALVSVKPS-----------PTWIIDSGATDHMTGESSL 432
             S  +IG             T L   K +            +WIIDSGAT+HMTG  + 
Sbjct: 335 LTSDDRIGLSGLNDSQWKILQTILEERKSTSNDHQSGKYFLESWIIDSGATNHMTGSLAF 394

Query: 433 FASYSPCA--------GNHKIKIADGSLSAIAGKDCQANFF----HSH------------ 468
             +             G        GS+   +  D Q   F    H H            
Sbjct: 395 LRNVCDMPPVLIKLPDGRFTTATKQGSVQLGSSLDLQDVLFVDGLHCHLISVSQLTRTRR 454

Query: 469 ---------CIFKDLNTGKMIGSAKKSGGLYY 491
                    CI +D  T  +IG+ ++  GLY+
Sbjct: 455 CIFQITDKVCIVQDRTTLMLIGAGRELNGLYF 486


>gb|AAD26943.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
            gi|25301694|pir||E84535 probable retroelement pol
            polyprotein [imported] - Arabidopsis thaliana
          Length = 1454

 Score =  506 bits (1303), Expect = e-141
 Identities = 260/589 (44%), Positives = 387/589 (65%), Gaps = 9/589 (1%)

Query: 615  NDPNHHPNPGKSSIPKCRGKSSSITTSDDPDLHIPIAIRKPVRSCTKHPMAKFVSYSNLS 674
            +D  H P+   S I     + SS      P  H+       ++S  K+P++  +SYS +S
Sbjct: 866  SDTTHSPSSLPSQISDLPPQISSQRVRKPP-AHLNDYHCNTMQSDHKYPISSTISYSKIS 924

Query: 675  SSFAAFTSQLSTVEIPKNVQEALKIPKWKEAVLEEMRALEKNKTWKIMTLPAGKNTVGCK 734
             S   + + ++ + IP N  EA    +W EAV  E+ A+EK  TW+I TLP GK  VGCK
Sbjct: 925  PSHMCYINNITKIPIPTNYAEAQDTKEWCEAVDAEIGAMEKTNTWEITTLPKGKKAVGCK 984

Query: 735  WVFTVKYNSDNTVERYKARLVAKGFTQAYGIDYSETFAPVAKLNTVRVLLSLAVNLDWPL 794
            WVFT+K+ +D  +ERYKARLVAKG+TQ  G+DY++TF+PVAK+ T+++LL ++ +  W L
Sbjct: 985  WVFTLKFLADGNLERYKARLVAKGYTQKEGLDYTDTFSPVAKMTTIKLLLKVSASKKWFL 1044

Query: 795  NQLDVKNAFLNGDLQEEVYMDSPPGFEDKFGL-----NVCKLQKSLYGLKQSPRAWFEKF 849
             QLDV NAFLNG+L+EE++M  P G+ ++ G+      V +L++S+YGLKQ+ R WF+KF
Sbjct: 1045 KQLDVSNAFLNGELEEEIFMKIPEGYAERKGIVLPSNVVLRLKRSIYGLKQASRQWFKKF 1104

Query: 850  TWSVKKQGYMQAQSDHTLFMRFSNDGKIAILIVYVDDIILTGDDIVEMDRLKKNLAKEFE 909
            + S+   G+ +   DHTLF++   DG+  I++VYVDDI++         +L + L + F+
Sbjct: 1105 SSSLLSLGFKKTHGDHTLFLKM-YDGEFVIVLVYVDDIVIASTSEAAAAQLTEELDQRFK 1163

Query: 910  IKDLGALKYFLGMEVARSRKGIVVSQRKYILDLLEETGMSGCRPADTPMELNAKL-WEKG 968
            ++DLG LKYFLG+EVAR+  GI + QRKY L+LL+ TGM  C+P   PM  N K+  + G
Sbjct: 1164 LRDLGDLKYFLGLEVARTTAGISICQRKYALELLQSTGMLACKPVSVPMIPNLKMRKDDG 1223

Query: 969  NVPVDIGRYQRLVGKLIYLAHTRPDIAFSVSVVSQFMHSPYEEHLEAVYRILRYLKSNPG 1028
            ++  DI +Y+R+VGKL+YL  TRPDI F+V+ + QF  +P   HL A YR+L+Y+K   G
Sbjct: 1224 DLIEDIEQYRRIVGKLMYLTITRPDITFAVNKLCQFSSAPRTTHLTAAYRVLQYIKGTVG 1283

Query: 1029 KGLYFKKTNDRDVSIFTDADWAGSVIDRKSTTGYCAYVWGNLVTWRSKKQGVVARSSAEA 1088
            +GL++  ++D  +  F D+DWA     R+STT +  +V  +L++WRSKKQ  V+RSSAEA
Sbjct: 1284 QGLFYSASSDLTLKGFADSDWASCQDSRRSTTSFTMFVGDSLISWRSKKQHTVSRSSAEA 1343

Query: 1089 EFRAMAQGICELLWIQKLLEELKLKIDLPLKLFCDSKAAISIAHNPVQHDRTKHIEIDRH 1148
            E+RA+A   CE++W+  LL  L+    +P+ L+ DS AAI IA NPV H+RTKHI++D H
Sbjct: 1344 EYRALALATCEMVWLFTLLVSLQASPPVPI-LYSDSTAAIYIATNPVFHERTKHIKLDCH 1402

Query: 1149 FIKEKIVSGTICLPYVTSNEQTADILTKSLARPNFERLIVKLGMTNIYA 1197
             ++E++ +G + L +V + +Q ADILTK L    FE L  K+ + NI++
Sbjct: 1403 TVRERLDNGELKLLHVRTEDQVADILTKPLFPYQFEHLKSKMSILNIFS 1451



 Score =  158 bits (399), Expect = 1e-36
 Identities = 130/473 (27%), Positives = 211/473 (44%), Gaps = 84/473 (17%)

Query: 100 RLDGKNYMEWSQTVRLILDGKGKLGFITGEIQMPSTTDPNYRFWKSENSTIIAWLLSTME 159
           RLD  NY +WS  + + LD K K GFI G +  P  +D N+R W   NS + +WLL+++ 
Sbjct: 78  RLDETNYGDWSVAMLISLDAKNKTGFIDGTLSRPLESDLNFRLWSRCNSMVKSWLLNSVS 137

Query: 160 STIKKPYMFLPTAKNVWDAVKATYSDIQNSSRIFDLKSRLWQAKQSDRDVTTYYNELMAL 219
             I +  + +  A ++W  + + + ++ N  R ++L   +   +Q    ++ YY  L  L
Sbjct: 138 PQIYRSILRMNDASDIWRDLNSRF-NVTNLPRTYNLTQEIQDFRQGTLSLSEYYTRLKTL 196

Query: 220 WQELDLCYDDNWRCTEDSVLFLKRQ-ENDRVFVLLAGLNNCLDEVRGRILGRIPLPTLQE 278
           W +LD     +  CT    + L+++ E  ++   LAGLN     VR +I+ +  LP+L E
Sbjct: 197 WDQLDSTEALDEPCTCGKAMRLQQKAEQAKIVKFLAGLNESYAIVRRQIIAKKALPSLGE 256

Query: 279 TFSEVRREEARQSVM-------------MGKSASITESSALVTKGNEEGKRDGKKPFCDH 325
            +  + ++ ++QS               + +S S+  +   V  G  +G     +P C  
Sbjct: 257 VYHILDQDNSQQSFSNVVAPPAAFQVSEITQSPSMDPTVCYVQNGPNKG-----RPICSF 311

Query: 326 CNRQWHTRDTCWKLHGKPPNWKKKGGKEGRALQ---------ATTSDQEHQSSSSSFPFT 376
            NR  H  + C+K HG PP +  K GK G  LQ         A +S+      S     +
Sbjct: 312 YNRVGHIAERCYKKHGFPPGFTPK-GKAGEKLQKPKPLAANVAESSEVNTSLESMVGNLS 370

Query: 377 KEQLDQLYKMFGSQ---TP-----SCSIAQIGNF-----PNT----ALVSVK----PSPT 415
           KEQL Q   MF SQ   TP     + S +Q  N      P+T     +++V      S T
Sbjct: 371 KEQLQQFIAMFSSQLQNTPPSTYATASTSQSDNLGICFSPSTYSFIGILTVARHTLSSAT 430

Query: 416 WIIDSGATDHMTGESSLFASYS---------------PCAGNHKIKIADG---------- 450
           W+IDSGAT H++ + SLF+S                   +G   +K+ D           
Sbjct: 431 WVIDSGATHHVSHDRSLFSSLDTSVLSAVNLPTGPTVKISGVGTLKLNDDILLKNVLFIP 490

Query: 451 -------SLSAIAGK-DCQANFFHSHCIFKDLNTGKMIGSAKKSGGLYYLDNG 495
                  S+S++      +  F  + C  +DL  G+M+G  ++   LY LD G
Sbjct: 491 EFRLNLISISSLTDDIGSRVIFDKNSCEIQDLIKGRMLGQGRRVANLYLLDVG 543


>dbj|BAA97287.1| retroelement pol polyprotein-like [Arabidopsis thaliana]
          Length = 1491

 Score =  506 bits (1302), Expect = e-141
 Identities = 251/545 (46%), Positives = 369/545 (67%), Gaps = 2/545 (0%)

Query: 656  VRSCTKHPMAKFVSYSNLSSSFAAFTSQLSTVEIPKNVQEALKIPKWKEAVLEEMRALEK 715
            ++  +++P+  ++     S+    F + ++  + PK+ +EA+K+  W +A+ +E+ ALE 
Sbjct: 948  IQGNSQYPLTDYIFDECFSAGHKVFLAAITANDEPKHFKEAVKVKVWNDAMYKEVDALEV 1007

Query: 716  NKTWKIMTLPAGKNTVGCKWVFTVKYNSDNTVERYKARLVAKGFTQAYGIDYSETFAPVA 775
            NKTW I+ LP GK  +G +WV+  K+N+D TVERYKARLV +G  Q  G DY+ETFAPV 
Sbjct: 1008 NKTWDIVDLPTGKVAIGSQWVYKTKFNADGTVERYKARLVVQGNNQIEGEDYTETFAPVV 1067

Query: 776  KLNTVRVLLSLAVNLDWPLNQLDVKNAFLNGDLQEEVYMDSPPGFEDKFGLNVCKLQKSL 835
            K+ TVR LL L     W + Q+DV NAFL+GDL+EEVYM  PPGF       VC+L+KSL
Sbjct: 1068 KMTTVRTLLRLVAANQWEVYQMDVHNAFLHGDLEEEVYMKLPPGFRHSHPDKVCRLRKSL 1127

Query: 836  YGLKQSPRAWFEKFTWSVKKQGYMQAQSDHTLFMRFSNDGKIAILIVYVDDIILTGDDIV 895
            YGLKQ+PR WF+K + ++K+ G++Q   D++ F  +S  G    ++VYVDD+I+ G+D  
Sbjct: 1128 YGLKQAPRCWFKKLSDALKRFGFIQGYEDYSFFS-YSCKGIELRVLVYVDDLIICGNDEY 1186

Query: 896  EMDRLKKNLAKEFEIKDLGALKYFLGMEVARSRKGIVVSQRKYILDLLEETGMSGCRPAD 955
             + + K+ L + F +KDLG LKYFLG+EV+R   GI +SQRKY LD++ ++G  G RPA 
Sbjct: 1187 MVQKFKEYLGRCFSMKDLGKLKYFLGIEVSRGPDGIFLSQRKYALDIISDSGTLGARPAY 1246

Query: 956  TPMELNAKLW-EKGNVPVDIGRYQRLVGKLIYLAHTRPDIAFSVSVVSQFMHSPYEEHLE 1014
            TP+E N  L  + G +  D   ++RLVG+L+YL HTRP++++SV V+SQFM +P E HLE
Sbjct: 1247 TPLEQNHHLASDDGPLLQDPKPFRRLVGRLLYLLHTRPELSYSVHVLSQFMQAPREAHLE 1306

Query: 1015 AVYRILRYLKSNPGKGLYFKKTNDRDVSIFTDADWAGSVIDRKSTTGYCAYVWGNLVTWR 1074
            A  RI+RYLK +PG+G+      D  + ++ D+D+    + R+S + Y   + G+ ++W+
Sbjct: 1307 AAMRIVRYLKGSPGQGILLSSNKDLTLEVYCDSDFQSCPLTRRSLSAYVVLLGGSPISWK 1366

Query: 1075 SKKQGVVARSSAEAEFRAMAQGICELLWIQKLLEELKLKIDLPLKLFCDSKAAISIAHNP 1134
            +KKQ  V+ SSAEAE+RAM+  + E+ W+ KLL+EL + +  P +LFCDSKAAISIA NP
Sbjct: 1367 TKKQDTVSHSSAEAEYRAMSVALKEIKWLNKLLKELGITLAAPTRLFCDSKAAISIAANP 1426

Query: 1135 VQHDRTKHIEIDRHFIKEKIVSGTICLPYVTSNEQTADILTKSLARPNFERLIVKLGMTN 1194
            V H+RTKHIE D H +++ +  G I   +V ++EQ ADI TK+L R  F  L+ KLG+ N
Sbjct: 1427 VFHERTKHIERDCHSVRDAVRDGIITTHHVRTSEQLADIFTKALGRNQFIYLMSKLGIQN 1486

Query: 1195 IYAPT 1199
            ++ PT
Sbjct: 1487 LHTPT 1491



 Score =  140 bits (354), Expect = 2e-31
 Identities = 115/460 (25%), Positives = 194/460 (42%), Gaps = 78/460 (16%)

Query: 101 LDGKNYMEWSQTVRLILDGKGKLGFITGEIQMPSTTDPNYRFWKSENSTIIAWLLSTMES 160
           L G NY EWS  +   L  K K GFI G I  P   +P+Y  W++ NS I+ W+ +++E 
Sbjct: 44  LTGDNYNEWSTEMLNALQAKRKTGFINGSISKPPLDNPDYENWQAVNSMIVGWIRASIEP 103

Query: 161 TIKKPYMFLPTAKNVWDAVKATYSDIQNSSRIFDLKSRLWQAKQSDRDVTTYYNELMALW 220
            +K    F+  A  +W  +K  +S + N  R+  +K++L   +Q  + V  YY  L  LW
Sbjct: 104 KVKSTVTFISDAHQLWSELKQRFS-VGNKVRVHQIKAQLAACRQDGQPVIDYYGRLCKLW 162

Query: 221 QELDLCYDDN----WRCTEDSVLF-LKRQENDRVFVLLAGLNNC-LDEVRGRILGRIPLP 274
           +E  +           CT  + L   K +E +++   + GL++     +   ++   P P
Sbjct: 163 EEFQIYKPITVCKCGLCTCGATLEPSKEREEEKIHQFVLGLDDSRFGGLSATLIAMDPFP 222

Query: 275 TLQETFSEVRREEARQSVMMGKSAS------ITESSALVTKGNEEG---KRDGKKPFCDH 325
           +L E +S V REE R + +  +         +T  S +   G  +    K   +   C H
Sbjct: 223 SLGEIYSRVVREEQRLASVQIREQQQSAIGFLTRQSEVTADGRTDSSIIKSRDRSVLCSH 282

Query: 326 CNRQWHTRDTCWKLHGKPPNWKKK---GGK----------------EGRALQATTSDQEH 366
           C R  H +  CW++ G P  W ++   GG+                 GR     T+    
Sbjct: 283 CGRSGHEKKDCWQIVGFPDWWTERTNGGGRGSSSRGRGGRSSGSNNSGRGRGQVTAAHAT 342

Query: 367 QSSSSSFP-FTKEQLDQLYKMFGSQTPSCSIAQIGNFPNTALVSVKPSPTWIIDSGATDH 425
            S+ SSFP FT +QL  + +M  ++    S    G      +         I+D+GA+ H
Sbjct: 343 TSNLSSFPEFTPDQLRVITQMIQNKNNGTSDKLSGKMKLGDV---------ILDTGASHH 393

Query: 426 MTGESSLFA-----------------SYSPCAGNHK------------IKIADGSLSAIA 456
           MTG+ SL                   +++   G  K            +   + SL +++
Sbjct: 394 MTGQLSLLTNIVTIPSCSVGFADDRKTFAISMGTFKLSETVSLSNVLYVPALNCSLISVS 453

Query: 457 GK----DCQANFFHSHCIFKDLNTGKMIGSAKKSGGLYYL 492
                  C A F  + C+ +D  +  +IG+ ++  G+YYL
Sbjct: 454 KLVKQIKCLALFTDTICVLQDRFSRTLIGTGEERDGVYYL 493


>gb|AAD19784.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
            gi|25301698|pir||C84512 probable retroelement pol
            polyprotein [imported] - Arabidopsis thaliana
          Length = 1501

 Score =  504 bits (1297), Expect = e-141
 Identities = 251/538 (46%), Positives = 365/538 (67%), Gaps = 2/538 (0%)

Query: 663  PMAKFVSYSNLSSSFAAFTSQLSTVEIPKNVQEALKIPKWKEAVLEEMRALEKNKTWKIM 722
            P+  +VS +  SSS  A+ + ++    PK+ +EA++I  W +A+  E+ ALE NKTW I+
Sbjct: 965  PLTDYVSDAAFSSSHRAYLAAITDNVEPKHFKEAVQIKVWNDAMFTEVDALEINKTWDIV 1024

Query: 723  TLPAGKNTVGCKWVFTVKYNSDNTVERYKARLVAKGFTQAYGIDYSETFAPVAKLNTVRV 782
             LP GK  +G +WVF  KYNSD TVERYKARLV +G  Q  G DY ETFAPV ++ TVR 
Sbjct: 1025 DLPPGKVAIGSQWVFKTKYNSDGTVERYKARLVVQGNKQVEGEDYKETFAPVVRMTTVRT 1084

Query: 783  LLSLAVNLDWPLNQLDVKNAFLNGDLQEEVYMDSPPGFEDKFGLNVCKLQKSLYGLKQSP 842
            LL       W + Q+DV NAFL+GDL+EEVYM  PPGF       VC+L+KSLYGLKQ+P
Sbjct: 1085 LLRNVAANQWEVYQMDVHNAFLHGDLEEEVYMKLPPGFRHSHPDKVCRLRKSLYGLKQAP 1144

Query: 843  RAWFEKFTWSVKKQGYMQAQSDHTLFMRFSNDGKIAILIVYVDDIILTGDDIVEMDRLKK 902
            R WF+K + S+ + G++Q+  D++LF    N+ ++ +LI YVDD+++ G+D   + + K 
Sbjct: 1145 RCWFKKLSDSLLRFGFVQSYEDYSLFSYTRNNIELRVLI-YVDDLLICGNDGYMLQKFKD 1203

Query: 903  NLAKEFEIKDLGALKYFLGMEVARSRKGIVVSQRKYILDLLEETGMSGCRPADTPMELNA 962
             L++ F +KDLG LKYFLG+EV+R  +GI +SQRKY LD++ ++G  G RPA TP+E N 
Sbjct: 1204 YLSRCFSMKDLGKLKYFLGIEVSRGPEGIFLSQRKYALDVIADSGNLGSRPAHTPLEQNH 1263

Query: 963  KLW-EKGNVPVDIGRYQRLVGKLIYLAHTRPDIAFSVSVVSQFMHSPYEEHLEAVYRILR 1021
             L  + G +  D   Y+RLVG+L+YL HTRP++++SV V++QFM +P E H +A  R++R
Sbjct: 1264 HLASDDGPLLSDPKPYRRLVGRLLYLLHTRPELSYSVHVLAQFMQNPREAHFDAALRVVR 1323

Query: 1022 YLKSNPGKGLYFKKTNDRDVSIFTDADWAGSVIDRKSTTGYCAYVWGNLVTWRSKKQGVV 1081
            YLK +PG+G+      D  + ++ D+DW    + R+S + Y   + G+ ++W++KKQ  V
Sbjct: 1324 YLKGSPGQGILLNADPDLTLEVYCDSDWQSCPLTRRSISAYVVLLGGSPISWKTKKQDTV 1383

Query: 1082 ARSSAEAEFRAMAQGICELLWIQKLLEELKLKIDLPLKLFCDSKAAISIAHNPVQHDRTK 1141
            + SSAEAE+RAM+  + E+ W++KLL+EL ++   P +L+CDSKAAI IA NPV H+RTK
Sbjct: 1384 SHSSAEAEYRAMSYALKEIKWLRKLLKELGIEQSTPARLYCDSKAAIHIAANPVFHERTK 1443

Query: 1142 HIEIDRHFIKEKIVSGTICLPYVTSNEQTADILTKSLARPNFERLIVKLGMTNIYAPT 1199
            HIE D H +++ +  G I   +V + EQ AD+ TK+L R  F  L+ KLG+ N++ PT
Sbjct: 1444 HIESDCHSVRDAVRDGIITTQHVRTTEQLADVFTKALGRNQFLYLMSKLGVQNLHTPT 1501



 Score =  139 bits (350), Expect = 6e-31
 Identities = 118/467 (25%), Positives = 193/467 (41%), Gaps = 80/467 (17%)

Query: 101 LDGKNYMEWSQTVRLILDGKGKLGFITGEIQMPSTTDPNYRFWKSENSTIIAWLLSTMES 160
           L+G NY +W+  +   L  K K GFI G I  P   DPNY  W + NS I+ W+ +++E 
Sbjct: 49  LNGDNYNQWATEMLNALQAKRKTGFINGTIPRPPPNDPNYENWTAVNSMIVGWIRTSIEP 108

Query: 161 TIKKPYMFLPTAKNVWDAVKATYSDIQNSSRIFDLKSRLWQAKQSDRDVTTYYNELMALW 220
            +K    F+  A  +W  +K  +S + N  RI  ++++L   +Q  + V  YY  L  LW
Sbjct: 109 KVKATVTFISDAHLLWKDLKQRFS-VGNKVRIHQIRAQLSSCRQDGQAVIEYYGRLSNLW 167

Query: 221 QELDL------CYDDNWRCTEDSVLFLKRQENDRVFVLLAGLNNC-LDEVRGRILGRIPL 273
           +E ++      C     RC   S    K +E +++   + GL+      +   ++   PL
Sbjct: 168 EEYNIYKPVTVCTCGLCRCGATSEP-TKEREEEKIHQFVLGLDESRFGGLCATLINMDPL 226

Query: 274 PTLQETFSEVRREEARQS---VMMGKSASI-----------------TESSALVTKGNEE 313
           P+L E +S V REE R +   V   K  ++                 + S +  T G+  
Sbjct: 227 PSLGEIYSRVIREEQRLASVHVREQKEEAVGFLARREQLDHHSRVDASSSRSEHTGGSRS 286

Query: 314 GKRDGKKPFCDHCNRQWHTRDTCWKLHGKPPNWKK--------------KGGKEGRALQA 359
                 +  C +C R  H +  CW++ G P  W +              +G   GR    
Sbjct: 287 NSIIKGRVTCSNCGRTGHEKKECWQIVGFPDWWSERNGGRGSNGRGRGGRGSNGGRGQGQ 346

Query: 360 TTSDQEHQSSSSSFP-FTKEQLDQLYKMFGSQTPSCSIAQIGNFPNTALVSVKPSPTWII 418
             +     S+SS FP FT+E +  L ++   ++ S S +   N  +  L         I+
Sbjct: 347 VMAAHATSSNSSVFPEFTEEHMRVLSQLVKEKSNSGSTS---NNNSDRLSGKTKLGDIIL 403

Query: 419 DSGATDHMTGESSLFASYSP--------CAGNHKIKIADGSLS----------------- 453
           DSGA+ HMTG  S   +  P          G+    ++ G L+                 
Sbjct: 404 DSGASHHMTGTLSSLTNVVPVPPCPVGFADGSKAFALSVGVLTLSNTVSLTNVLFVPSLN 463

Query: 454 --------AIAGKDCQANFFHSHCIFKDLNTGKMIGSAKKSGGLYYL 492
                    +    C A F  + C  +D ++  +IGS ++ GG+YYL
Sbjct: 464 CTLISVSKLLKQTQCLATFTDTLCFLQDRSSKTLIGSGEERGGVYYL 510


>gb|AAC33963.1| contains similarity to reverse transcriptases (Pfam; rvt.hmm, score:
            11.19) [Arabidopsis thaliana] gi|7486705|pir||T01879
            hypothetical protein F8M12.17 - Arabidopsis thaliana
          Length = 1633

 Score =  503 bits (1295), Expect = e-140
 Identities = 261/596 (43%), Positives = 386/596 (63%), Gaps = 34/596 (5%)

Query: 603  PILQPCQESE-PRNDPNHHPNPGKSSIPKCRGKSSSITTS-DDPDLHIPIAIRKPVRSCT 660
            PI +P + ++ P     +H N    S+P     S + +TS + P   IP     P +  T
Sbjct: 859  PIARPKRNAKAPAYLSEYHCN----SVPFLSSLSPTTSTSIETPSSSIP-----PKKITT 909

Query: 661  KHPMAKFVSYSNLSSSFAAFTSQLSTVEIPKNVQEALKIPKWKEAVLEEMRALEKNKTWK 720
             +PM+  +SY  L+  F ++    +    PK   +A+K  KW  A  EE+ ALE+NKTW 
Sbjct: 910  PYPMSTAISYDKLTPLFHSYICAYNVETEPKAFTQAMKSEKWTRAANEELHALEQNKTWI 969

Query: 721  IMTLPAGKNTVGCKWVFTVKYNSDNTVERYKARLVAKGFTQAYGIDYSETFAPVAKLNTV 780
            + +L  GKN VGCKWVFT+KYN D ++ERYKARLVA+GFTQ  GIDY ETF+PVAK  +V
Sbjct: 970  VESLTEGKNVVGCKWVFTIKYNPDGSIERYKARLVAQGFTQQEGIDYMETFSPVAKFGSV 1029

Query: 781  RVLLSLAVNLDWPLNQLDVKNAFLNGDLQEEVYMDSPPGFEDKFGLN-----VCKLQKSL 835
            ++LL LA    W L Q+DV NAFL+G+L EE+YM  P G+    G++     VC+L KSL
Sbjct: 1030 KLLLGLAAATGWSLTQMDVSNAFLHGELDEEIYMSLPQGYTPPTGISLPSKPVCRLLKSL 1089

Query: 836  YGLKQSPRAWFEKFTWSVKKQGYMQAQSDHTLFMRFSNDGKIAILIVYVDDIILTGDDIV 895
            YGLKQ+ R W+++ +       ++Q+ +D+T+F++ S    I +++VYVDD+++  +D  
Sbjct: 1090 YGLKQASRQWYKRLSSVFLGANFIQSPADNTMFVKVSCT-SIIVVLVYVDDLMIASNDSS 1148

Query: 896  EMDRLKKNLAKEFEIKDLGALKYFLGMEVARSRKGIVVSQRKYILDLLEETGMSGCRPAD 955
             ++ LK+ L  EF+IKDLG  ++FLG+E+ARS +GI V QRKY  +LLE+ G+SGC+P+ 
Sbjct: 1149 AVENLKELLRSEFKIKDLGPARFFLGLEIARSSEGISVCQRKYAQNLLEDVGLSGCKPSS 1208

Query: 956  TPMELNAKLW-EKGNVPVDIGRYQRLVGKLIYLAHTRPDIAFSVSVVSQFMHSPYEEHLE 1014
             PM+ N  L  E G +  +   Y+ LVG+L+YL  TRPDI F+V  +SQF+ +P + H++
Sbjct: 1209 IPMDPNLHLTKEMGTLLPNATSYRELVGRLLYLCITRPDITFAVHTLSQFLSAPTDIHMQ 1268

Query: 1015 AVYRILRYLKSNPGKGLYFKKTNDRDVSIFTDADWAGSVIDRKSTTGYCAYVWGNLVTWR 1074
            A +++LRYLK NPG+                DADW      R+S TG+C Y+  +L+TW+
Sbjct: 1269 AAHKVLRYLKGNPGQ----------------DADWGTCKDSRRSVTGFCIYLGTSLITWK 1312

Query: 1075 SKKQGVVARSSAEAEFRAMAQGICELLWIQKLLEELKLKIDLPLKLFCDSKAAISIAHNP 1134
            SKKQ VV+RSS E+E+R++AQ  CE++W+Q+LL++L + +  P KLFCD+K+A+ +A NP
Sbjct: 1313 SKKQSVVSRSSTESEYRSLAQATCEIIWLQQLLKDLHVTMTCPAKLFCDNKSALHLATNP 1372

Query: 1135 VQHDRTKHIEIDRHFIKEKIVSGTICLPYVTSNEQTADILTKSLARPNFERLIVKL 1190
            V H+RTKHIEID H ++++I +G +   +V +  Q ADILTK L    F  L+ ++
Sbjct: 1373 VFHERTKHIEIDCHTVRDQIKAGKLKTLHVPTGNQLADILTKPLHPGPFHSLLKRI 1428



 Score =  141 bits (356), Expect = 1e-31
 Identities = 113/452 (25%), Positives = 196/452 (43%), Gaps = 82/452 (18%)

Query: 105 NYMEWSQTVRLILDGKGKLGFITGEIQMPSTTDPNYRFWKSENSTIIAWLLSTMESTIKK 164
           ++  W +++ + L+ + KLGFI G I  P     +Y  W   N T+  WL++++   I +
Sbjct: 57  DFHSWRRSIWMALNVRNKLGFIDGTIVKPPLDHRDYGAWSRCNDTVSTWLMNSVSKKIGQ 116

Query: 165 PYMFLPTAKNVWDAVKATYSDIQNSSRIFDLKSRLWQAKQSDRDVTTYYNELMALWQE-- 222
             +F+PTA+ +W  + + +    ++ R++D++ RL + +Q   D++ YY EL  LW+E  
Sbjct: 117 SLLFIPTAEGIWKNMLSRFKQ-DDAPRVYDIEQRLSKIEQGSMDISAYYTELQTLWEEHK 175

Query: 223 --LDLCYDDNWRCTEDSVLFLKR-QENDRVFVLLAGLNNCLDEVRGRILGRIPLPTLQET 279
             +DL      RC  D+ +  +R Q+   V   L GLN   ++ R  IL   P+ T++E 
Sbjct: 176 NYVDLPVCTCGRCECDAAVKWERLQQRSHVTKFLMGLNESYEQTRRHILMLKPIRTIEEA 235

Query: 280 FSEVRREEARQSVMMGKSASITESSALVTKGNEEGKRDGKKPFCDHCNRQWHTRDTCWKL 339
           F+ V ++E ++++    +  +     L            K P C +C +  HT   C+K+
Sbjct: 236 FNIVTQDERQKAIR--PTPKVDNQDQL------------KLPLCTNCGKVGHTVQKCYKI 281

Query: 340 HGKPPNWKKKGGKEGRALQATTSDQEHQSSSSSFPFTKEQLDQLYKMFGSQ--------- 390
            G PP +K         +Q     Q  Q S    P  ++ +  L   F +Q         
Sbjct: 282 IGYPPGYKAATSYRQPQIQTQPRMQMPQQSQ---PRMQQPIQHLISQFNAQVRVQEPAAT 338

Query: 391 -----TPSCSIAQIG-----------NFPNT-----------------ALVSVKPSPTWI 417
                +P+ +I + G            FP+T                 +L +V  S  WI
Sbjct: 339 SIYTSSPTATITEHGLMAQTSTSGTIPFPSTSLKYENNNLTFQNHTLSSLQNVLSSDAWI 398

Query: 418 IDSGATDHMTGESSLFASYSPCAGNHKIKIADGSLSAI--AGKDCQANFFHSHCI----- 470
           IDSGA+ H+  + ++F      +G   + + +G+  AI   G  C  +    H +     
Sbjct: 399 IDSGASSHVCSDLTMFRELIHVSG-VTVTLPNGTRVAITHTGTICITSTLILHNVLLVPD 457

Query: 471 FK---------DLNTGKMIGSAKKSGGLYYLD 493
           FK         +L  G MIG  K    LY L+
Sbjct: 458 FKFNLISVCCLELTRGLMIGRGKTYNNLYILE 489


>gb|AAB87099.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
            gi|7444418|pir||T00499 probable retroelement pol
            polyprotein [imported] - Arabidopsis thaliana
          Length = 1496

 Score =  500 bits (1288), Expect = e-139
 Identities = 249/541 (46%), Positives = 370/541 (68%), Gaps = 3/541 (0%)

Query: 660  TKHPMAKFVSYSNLSSSFAAFTSQLSTVEIPKNVQEALKIPKWKEAVLEEMRALEKNKTW 719
            T +P++ F++ S  S++  AF + +     PK+ ++A+ I +W EA+ +E+ ALE N TW
Sbjct: 957  TLYPLSDFLTNSGYSANHIAFMAAILDSNEPKHFKDAILIKEWCEAMSKEIDALEANHTW 1016

Query: 720  KIMTLPAGKNTVGCKWVFTVKYNSDNTVERYKARLVAKGFTQAYGIDYSETFAPVAKLNT 779
             I  LP GK  +  KWV+ +KYNSD T+ER+KARLV  G  Q  G+D+ ETFAPVAKL T
Sbjct: 1017 DITDLPHGKKAISSKWVYKLKYNSDGTLERHKARLVVMGNHQKEGVDFKETFAPVAKLTT 1076

Query: 780  VRVLLSLAVNLDWPLNQLDVKNAFLNGDLQEEVYMDSPPGFEDKFGLNVCKLQKSLYGLK 839
            VR +L++A   DW ++Q+DV NAFL+GDL+EEVYM  PPGF+      VC+L+KSLYGLK
Sbjct: 1077 VRTILAVAAAKDWEVHQMDVHNAFLHGDLEEEVYMRLPPGFKCSDPSKVCRLRKSLYGLK 1136

Query: 840  QSPRAWFEKFTWSVKKQGYMQAQSDHTLFMRFSNDGKIAILIVYVDDIILTGDDIVEMDR 899
            Q+PR WF K + +++  G+ Q+  D++LF    N   I  ++VYVDD+I+ G+++  +DR
Sbjct: 1137 QAPRCWFSKLSTALRNIGFTQSYEDYSLF-SLKNGDTIIHVLVYVDDLIVAGNNLDAIDR 1195

Query: 900  LKKNLAKEFEIKDLGALKYFLGMEVARSRKGIVVSQRKYILDLLEETGMSGCRPADTPME 959
             K  L K F +KDLG LKYFLG+EV+R   G  +SQRKY LD+++ETG+ GC+P+  P+ 
Sbjct: 1196 FKSQLHKCFHMKDLGKLKYFLGLEVSRGPDGFCLSQRKYALDIVKETGLLGCKPSAVPIA 1255

Query: 960  LNAKLWE-KGNVPVDIGRYQRLVGKLIYLAHTRPDIAFSVSVVSQFMHSPYEEHLEAVYR 1018
            LN KL    G V  +  +Y+RLVG+ IYL  TRPD++++V ++SQFM +P   H EA  R
Sbjct: 1256 LNHKLASITGPVFTNPEQYRRLVGRFIYLTITRPDLSYAVHILSQFMQAPLVAHWEAALR 1315

Query: 1019 ILRYLKSNPGKGLYFKKTNDRDVSIFTDADWAGSVIDRKSTTGYCAYVWGNLVTWRSKKQ 1078
            ++RYLK +P +G++ +  +   ++ + D+D+    + R+S + Y  Y+  + ++W++KKQ
Sbjct: 1316 LVRYLKGSPAQGIFLRSDSSLIINAYCDSDYNACPLTRRSLSAYVVYLGDSPISWKTKKQ 1375

Query: 1079 GVVARSSAEAEFRAMAQGICELLWIQKLLEELKLKIDLPLKLFCDSKAAISIAHNPVQHD 1138
              V+ SSAEAE+RAMA  + EL W++ LL++L +    P+KL CDS+AAI IA NPV H+
Sbjct: 1376 DTVSYSSAEAEYRAMAYTLKELKWLKALLKDLGVHHSSPMKLHCDSEAAIHIAANPVFHE 1435

Query: 1139 RTKHIEIDRHFIKEKIVSGTICLPYVTSNEQTADILTKSLARPNFERLIVKLGMTNIYAP 1198
            RTKHIE D H +++ ++   I   ++ + +Q AD+LTKSL RP FERL+  LG+T+ Y P
Sbjct: 1436 RTKHIESDCHKVRDAVLDKLITTEHIYTEDQVADLLTKSLPRPTFERLLSTLGVTD-YVP 1494

Query: 1199 T 1199
            +
Sbjct: 1495 S 1495



 Score =  151 bits (382), Expect = 1e-34
 Identities = 107/360 (29%), Positives = 164/360 (44%), Gaps = 39/360 (10%)

Query: 101 LDGKNYMEWSQTVRLILDGKGKLGFITGEIQMPSTTDPNYRFWKSENSTIIAWLLSTMES 160
           L   NY EWS+ ++  L  K KLGFI G I  P+  DP    W + NS I+ W+ ++++ 
Sbjct: 39  LKENNYAEWSEELQNFLRAKQKLGFIDGSIPKPAA-DPELSLWIAINSMIVGWIRTSIDP 97

Query: 161 TIKKPYMFLPTAKNVWDAVKATYSDIQNSSRIFDLKSRLWQAKQSDRDVTTYYNELMALW 220
           TI+    F+  A  +W+ ++  +S + N  R   LK  +    Q  + V  YY  L+ LW
Sbjct: 98  TIRSTVGFVSEASQLWENLRRRFS-VGNGVRKTLLKDEIAACTQDGQPVLAYYGRLIKLW 156

Query: 221 QELDLCYDDNWRCTEDSVLFL-KRQENDRVFVLLAGLNNCLDEVRGRILGRIPLPTLQET 279
           +EL   Y     C  ++   + K +E+DRV   L GL++    +R  I    PLP L + 
Sbjct: 157 EELQN-YKSGRECKCEAASDIEKEREDDRVHKFLLGLDSRFSSIRSSITDIEPLPDLYQV 215

Query: 280 FSEVRREEARQSVMMGKSASITESSALVTKGNEEGK-RDGKKPFCDHCNRQWHTRDTCWK 338
           +S V REE   +    K    TE+     + +   + RD    FC HCNR+ H    C+ 
Sbjct: 216 YSRVVREEQNLNASRTKDVVKTEAIGFSVQSSTTPRFRDKSTLFCTHCNRKGHEVTQCFL 275

Query: 339 LHGKPPNWKKKGGKE------GRALQATTSDQEHQSSSSSFPFTK--------------- 377
           +HG P  W ++  +E      GR      S      + SS P T+               
Sbjct: 276 VHGYPDWWLEQNPQENQPSTRGRGSNGRGSSSGRGGNRSSAPTTRGRGRANNAQAAAPTV 335

Query: 378 -----EQLDQLYKMFGSQTPSCSIAQIGNFPNTALVSVKPSPTWIIDSGATDHMTGESSL 432
                +Q+ QL  +  +Q PS S  ++    NT L         +ID+GA+ HMTG+ S+
Sbjct: 336 SGDGNDQIAQLISLLQAQRPSSSSERLSG--NTCLTD------GVIDTGASHHMTGDCSI 387


>gb|AAR24647.1| At2g23330 [Arabidopsis thaliana] gi|22655202|gb|AAM98191.1| unknown
            protein [Arabidopsis thaliana]
          Length = 776

 Score =  500 bits (1288), Expect = e-139
 Identities = 249/541 (46%), Positives = 370/541 (68%), Gaps = 3/541 (0%)

Query: 660  TKHPMAKFVSYSNLSSSFAAFTSQLSTVEIPKNVQEALKIPKWKEAVLEEMRALEKNKTW 719
            T +P++ F++ S  S++  AF + +     PK+ ++A+ I +W EA+ +E+ ALE N TW
Sbjct: 237  TLYPLSDFLTNSGYSANHIAFMAAILDSNEPKHFKDAILIKEWCEAMSKEIDALEANHTW 296

Query: 720  KIMTLPAGKNTVGCKWVFTVKYNSDNTVERYKARLVAKGFTQAYGIDYSETFAPVAKLNT 779
             I  LP GK  +  KWV+ +KYNSD T+ER+KARLV  G  Q  G+D+ ETFAPVAKL T
Sbjct: 297  DITDLPHGKKAISSKWVYKLKYNSDGTLERHKARLVVMGNHQKEGVDFKETFAPVAKLTT 356

Query: 780  VRVLLSLAVNLDWPLNQLDVKNAFLNGDLQEEVYMDSPPGFEDKFGLNVCKLQKSLYGLK 839
            VR +L++A   DW ++Q+DV NAFL+GDL+EEVYM  PPGF+      VC+L+KSLYGLK
Sbjct: 357  VRTILAVAAAKDWEVHQMDVHNAFLHGDLEEEVYMRLPPGFKCSDPSKVCRLRKSLYGLK 416

Query: 840  QSPRAWFEKFTWSVKKQGYMQAQSDHTLFMRFSNDGKIAILIVYVDDIILTGDDIVEMDR 899
            Q+PR WF K + +++  G+ Q+  D++LF    N   I  ++VYVDD+I+ G+++  +DR
Sbjct: 417  QAPRCWFSKLSTALRNIGFTQSYEDYSLF-SLKNGDTIIHVLVYVDDLIVAGNNLDAIDR 475

Query: 900  LKKNLAKEFEIKDLGALKYFLGMEVARSRKGIVVSQRKYILDLLEETGMSGCRPADTPME 959
             K  L K F +KDLG LKYFLG+EV+R   G  +SQRKY LD+++ETG+ GC+P+  P+ 
Sbjct: 476  FKSQLHKCFHMKDLGKLKYFLGLEVSRGPDGFCLSQRKYALDIVKETGLLGCKPSAVPIA 535

Query: 960  LNAKLWE-KGNVPVDIGRYQRLVGKLIYLAHTRPDIAFSVSVVSQFMHSPYEEHLEAVYR 1018
            LN KL    G V  +  +Y+RLVG+ IYL  TRPD++++V ++SQFM +P   H EA  R
Sbjct: 536  LNHKLASITGPVFTNPEQYRRLVGRFIYLTITRPDLSYAVHILSQFMQAPLVAHWEAALR 595

Query: 1019 ILRYLKSNPGKGLYFKKTNDRDVSIFTDADWAGSVIDRKSTTGYCAYVWGNLVTWRSKKQ 1078
            ++RYLK +P +G++ +  +   ++ + D+D+    + R+S + Y  Y+  + ++W++KKQ
Sbjct: 596  LVRYLKGSPAQGIFLRSDSSLIINAYCDSDYNACPLTRRSLSAYVVYLGDSPISWKTKKQ 655

Query: 1079 GVVARSSAEAEFRAMAQGICELLWIQKLLEELKLKIDLPLKLFCDSKAAISIAHNPVQHD 1138
              V+ SSAEAE+RAMA  + EL W++ LL++L +    P+KL CDS+AAI IA NPV H+
Sbjct: 656  DTVSYSSAEAEYRAMAYTLKELKWLKALLKDLGVHHSSPMKLHCDSEAAIHIAANPVFHE 715

Query: 1139 RTKHIEIDRHFIKEKIVSGTICLPYVTSNEQTADILTKSLARPNFERLIVKLGMTNIYAP 1198
            RTKHIE D H +++ ++   I   ++ + +Q AD+LTKSL RP FERL+  LG+T+ Y P
Sbjct: 716  RTKHIESDCHKVRDAVLDKLITTEHIYTEDQVADLLTKSLPRPTFERLLSTLGVTD-YVP 774

Query: 1199 T 1199
            +
Sbjct: 775  S 775


  Database: nr
    Posted date:  Jul 5, 2005 12:34 AM
  Number of letters in database: 863,360,394
  Number of sequences in database:  2,540,612
  
Lambda     K      H
   0.315    0.133    0.399 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,171,928,404
Number of Sequences: 2540612
Number of extensions: 102696103
Number of successful extensions: 993617
Number of sequences better than 10.0: 7121
Number of HSP's better than 10.0 without gapping: 5365
Number of HSP's successfully gapped in prelim test: 1998
Number of HSP's that attempted gapping in prelim test: 651344
Number of HSP's gapped (non-prelim): 97214
length of query: 1199
length of database: 863,360,394
effective HSP length: 140
effective length of query: 1059
effective length of database: 507,674,714
effective search space: 537627522126
effective search space used: 537627522126
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 81 (35.8 bits)


Medicago: description of AC145330.14