
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC146527.6 - phase: 0 /pseudo
(1019 letters)
Database: nr
2,540,612 sequences; 863,360,394 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gb|AAU89779.1| gag-pol polyprotein-like [Solanum tuberosum] 792 0.0
gb|AAP53905.1| putative pol polyprotein [Oryza sativa (japonica ... 699 0.0
emb|CAC95126.1| gag-pol polyprotein [Populus deltoides] 670 0.0
gb|AAT40550.1| putative receptor kinase [Solanum demissum] 620 e-176
gb|AAD25646.1| putative retroelement pol polyprotein [Arabidopsi... 597 e-169
gb|AAD26943.1| putative retroelement pol polyprotein [Arabidopsi... 589 e-166
gb|AAD24600.1| putative retroelement pol polyprotein [Arabidopsi... 586 e-165
pir||G86301 probable retroelement polyprotein [imported] - Arabi... 584 e-165
gb|AAB61111.1| Strong similarity to Zea mays retrotransposon Hop... 580 e-163
gb|AAP51971.1| putative copia-type polyprotein [Oryza sativa (ja... 578 e-163
gb|AAC35532.1| contains similarity to proteases [Arabidopsis tha... 570 e-160
gb|AAD21687.1| Strong similarity to gi|3600044 T12H20.12 proteas... 567 e-160
emb|CAB10526.1| retrotransposon like protein [Arabidopsis thalia... 566 e-160
gb|AAT38758.1| putative gag-pol polyprotein [Solanum demissum] 558 e-157
gb|AAP46257.1| putative polyprotein [Oryza sativa (japonica cult... 547 e-154
gb|AAD50001.1| Hypothetical protein [Arabidopsis thaliana] gi|25... 531 e-149
emb|CAB71063.1| copia-type polyprotein [Arabidopsis thaliana] gi... 531 e-149
emb|CAB75932.1| putative protein [Arabidopsis thaliana] gi|11278... 530 e-148
gb|AAG60117.1| copia-type polyprotein, putative [Arabidopsis tha... 530 e-148
gb|AAF16534.1| T26F17.17 [Arabidopsis thaliana] 524 e-147
>gb|AAU89779.1| gag-pol polyprotein-like [Solanum tuberosum]
Length = 1212
Score = 792 bits (2045), Expect = 0.0
Identities = 398/635 (62%), Positives = 485/635 (75%), Gaps = 18/635 (2%)
Query: 278 GEYMSHSFQEFLQSNGIISQRSCPSTP*QNGVAERKNRHLLDVVRTLLLESHVPSRFWCE 337
GEYMS+ F++FL GI+SQ SCP TP QNGVAERKNRHLLDV RTLL+ES VPS++W E
Sbjct: 577 GEYMSYEFKKFLLDKGIVSQHSCPYTPQQNGVAERKNRHLLDVTRTLLIESSVPSKYWVE 636
Query: 338 ALSTAVHLINRMPSPSIGNESPFTRLYGHPPNYSTLRVFGCVCYVHLPPQERTKFTAQSV 397
ALSTAV+LINR+PS + ESP+ RLY PNYS FGCVC+VHLPP + K + QS
Sbjct: 637 ALSTAVYLINRLPSKVLNLESPYFRLYHQNPNYSDFHTFGCVCFVHLPPSQCNKLSVQST 696
Query: 398 ECAFLGYSPHQKGFLCYDPNLRRIRVSRNVIFQENKYFFASHHDLVSSPISILPLFYDSH 457
+CAF+GYS QKGF+CYDP + R+SRNV+F EN+YFF + DL SS +LP F D
Sbjct: 697 KCAFMGYSTSQKGFICYDPCSHKFRISRNVVFFENQYFFPTIVDL-SSVSPLLPTFEDLS 755
Query: 458 SRQQPSKPLLTYKRRSTA-----THGPPQ-------DNSLVAGPVEEPAPLRRSSRESKP 505
S + KP Y+RR T PP+ +NS +GP+E P RRS+R S+
Sbjct: 756 SSFKRFKPGFVYERRRPTLPYPNTDPPPETAPQLESENSSRSGPLE---PTRRSTRVSRT 812
Query: 506 PERYINCMTATLSSIPIPSSYKQAMENDCWQKAIESELLALEENQTWDIVPCPSSVKPLG 565
P Y ++TLS+I +PS Y QA +++CWQKA+E ELLAL+EN TWDIV CPS+V+P+G
Sbjct: 813 PNWY--GFSSTLSNISVPSCYSQASKHECWQKAMEEELLALKENDTWDIVSCPSNVRPIG 870
Query: 566 SKFVFSIKLRSDGSIDRYKAHLVVLGNKQEYGLDYDETFAPVAKMTTVRTILAIAASQAW 625
K+V+SIKL SDG++DRYKA LVVLGN+QEYG+DY+ETFAPVAKMTTVRTI+AIAASQ W
Sbjct: 871 CKWVYSIKLHSDGTLDRYKARLVVLGNRQEYGVDYEETFAPVAKMTTVRTIIAIAASQNW 930
Query: 626 PLHQMDVKNAFLHGDLQEEVYIKLPNGMPTPSPNTVCKLKRSLYGLKQAPRVWFEKFRST 685
L+Q DVKNAFLHGDL+E++Y+K P + + + VCKLKRSLYGLKQAPR WF+KFRST
Sbjct: 931 SLYQKDVKNAFLHGDLKEDIYMKPPPDLFSSPTSDVCKLKRSLYGLKQAPRAWFDKFRST 990
Query: 686 LLGFEFSQSRYDPSLFLQRTPKGMVVLLVYVDDIVVTGSDQDAISRIKNLLHSTFHMKEL 745
LL F F S+YD SLFL++T V+LLVYVDDI++TG+D I+ ++ L +FHMK+L
Sbjct: 991 LLQFSFELSKYDSSLFLRKTSTSCVLLLVYVDDIIITGTDSSLITCLQQQLKDSFHMKDL 1050
Query: 746 GRLTYFLGLEVHYHHEGVFLNQQKYIQDLVQLAGLTNATLVDTPMEVNVKYRRDEGDHLD 805
G LTYFLGLEVH GVFLNQ KY QDL+ LAGL ++ VDTP+E+NVKYRR+EGD L
Sbjct: 1051 GTLTYFLGLEVHNVASGVFLNQHKYTQDLISLAGLQVSSSVDTPLEMNVKYRREEGDLLP 1110
Query: 806 DPTQYRKLVGSLIYVTITRPDISFAVHTVSKFMQAPRHFHLSAVQQIIRYLLGTLKRGLF 865
DPT +R+LVGSL Y+TITRPDISFAV VS+FMQAPRH HL AV IIRYLLGT RGLF
Sbjct: 1111 DPTIFRQLVGSLNYLTITRPDISFAVQQVSQFMQAPRHLHLVAVCHIIRYLLGTSTRGLF 1170
Query: 866 FPVGSSIKLQAYSDADWAGCPDTRKSTTGWCMFLG 900
FP GS I+L A+SD+DWAGCPDTR+S +GWCMFLG
Sbjct: 1171 FPSGSPIRLNAFSDSDWAGCPDTRRSVSGWCMFLG 1205
>gb|AAP53905.1| putative pol polyprotein [Oryza sativa (japonica cultivar-group)]
gi|37534632|ref|NP_921618.1| putative pol polyprotein
[Oryza sativa (japonica cultivar-group)]
Length = 1688
Score = 699 bits (1804), Expect = 0.0
Identities = 373/800 (46%), Positives = 503/800 (62%), Gaps = 62/800 (7%)
Query: 278 GEYMSHSFQEFLQSNGIISQRSCPSTP*QNGVAERKNRHLLDVVRTLLLESHVPSRFWCE 337
GEYMS++F+EFL S G + Q SCP QNGVAERK+RH+++ RTLL+ S VP+ FW E
Sbjct: 453 GEYMSNAFREFLVSQGTLPQLSCPGAHAQNGVAERKHRHIIETARTLLIASFVPAHFWAE 512
Query: 338 ALSTAVHLINRMPSPSIGNESPFTRLYGHPPNYSTLRVFGCVCYVHLPPQERTKFTAQSV 397
A+STAV+LIN PS S+ SP L+G PP Y LRVFGC CYV L P+ERTK TAQSV
Sbjct: 513 AISTAVYLINMQPSSSLQGRSPGEVLFGSPPRYDHLRVFGCTCYVLLAPRERTKLTAQSV 572
Query: 398 ECAFLGYSPHQKGFLCYDPNLRRIRVSRNVIFQENKYFFASHHDLVSSPISILPLFY--- 454
EC FLGYS KG+ CYDP+ RRIR+SR+V F ENK FF S + SSP + + Y
Sbjct: 573 ECVFLGYSLEHKGYRCYDPSARRIRISRDVTFDENKPFFYSSTNQPSSPENSISFLYLPP 632
Query: 455 -------------DSHSRQQPSKPLLTY------KRRSTATHGPPQDNSLVAGPVEEPA- 494
S S PS P TY + PP + P P+
Sbjct: 633 IPSPESLPSSPITPSPSPIPPSVPSPTYVPPPPPSPSPSPVSPPPSHIPASSSPPHVPST 692
Query: 495 ------PLRRSSR-----ESKPPERYINCMTATLS-SIPIP------------------- 523
P S R ES+P + + T ++ S P P
Sbjct: 693 ITLDTFPFHYSRRPKIPNESQPSQPTLEDPTCSVDDSSPAPRYNLRARDALRAPNRDDFV 752
Query: 524 -------SSYKQAMENDCWQKAIESELLALEENQTWDIVPCPSSVKPLGSKFVFSIKLRS 576
S+Y++A+ W+ A+ EL ALE TWD+VP PS P+ K+V+ +K +S
Sbjct: 753 VGVVFEPSTYQEAIVLPHWKLAMSEELAALERTNTWDVVPLPSHAVPITCKWVYKVKTKS 812
Query: 577 DGSIDRYKAHLVVLGNKQEYGLDYDETFAPVAKMTTVRTILAIAASQAWPLHQMDVKNAF 636
DG ++RYKA LV G +Q +G DYDETFAPVA MTTVRT++A+AA+++W + QMDVKNAF
Sbjct: 813 DGQVERYKARLVARGFQQAHGRDYDETFAPVAHMTTVRTLIAVAATRSWTISQMDVKNAF 872
Query: 637 LHGDLQEEVYIKLPNGMPTPSPNTVCKLKRSLYGLKQAPRVWFEKFRSTLLGFEFSQSRY 696
LHGDL EEVY+ P G+ P P V +L+R+LYGLKQAPR WF +F S +L FS S +
Sbjct: 873 LHGDLHEEVYMHPPPGVEAP-PGHVFRLRRALYGLKQAPRAWFARFSSVVLAAGFSPSDH 931
Query: 697 DPSLFLQRTPKGMVVLLVYVDDIVVTGSDQDAISRIKNLLHSTFHMKELGRLTYFLGLEV 756
DP+LF+ + +G +LL+YVDD+++TG D + I+ +K L F M +LG L+YFLG+EV
Sbjct: 932 DPALFIHTSSRGRTLLLLYVDDMLITGDDLEYIAFVKGKLSEQFMMSDLGPLSYFLGIEV 991
Query: 757 HYHHEGVFLNQQKYIQDLVQLAGLTNATLVDTPMEVNVKYRRDEGDHLDDPTQYRKLVGS 816
+G +L+Q +YI+DL+ +GLT++ TPME++V+ R +G LDDP++YR LVGS
Sbjct: 992 TSTVDGYYLSQHRYIEDLLAQSGLTDSRTTTTPMELHVRLRSTDGTPLDDPSRYRHLVGS 1051
Query: 817 LIYVTITRPDISFAVHTVSKFMQAPRHFHLSAVQQIIRYLLGTLKRGLFFPVGSSIKLQA 876
L+Y+T+TRPDI++AVH +S+F+ AP H + +++RYL GT + LF+ S ++L+A
Sbjct: 1052 LVYLTVTRPDIAYAVHILSQFVSAPISVHYGHLLRVLRYLRGTTTQCLFYAASSPLQLRA 1111
Query: 877 YSDADWAGCPDTRKSTTGWCMFLGNAPISWKCKKQDSVSKSSTEAEYRAMSAACSEIIWL 936
+SD+ WA P R+S TG+C+FLG + ++WK KKQ +VS+SSTEAE RA++ SEI+WL
Sbjct: 1112 FSDSTWASDPIDRRSVTGYCIFLGTSLLTWKSKKQTAVSRSSTEAELRALATTTSEIVWL 1171
Query: 937 RGLLTELGFSQDQPTPLHADNTSAIQIAANPVYHERTKHIEVDCHSIREAYDRRIINLPH 996
R LL + G S D PTPL DNT AIQIA +P+ HE TKHI VD R + I L +
Sbjct: 1172 RWLLADFGVSCDVPTPLLCDNTGAIQIANDPIKHELTKHIGVDASFTRSHCQQSTIALHY 1231
Query: 997 VSTSVQTADIFTKSLTRQRH 1016
V + +Q AD FTK+ TR+ H
Sbjct: 1232 VPSELQVADFFTKAQTREHH 1251
>emb|CAC95126.1| gag-pol polyprotein [Populus deltoides]
Length = 1382
Score = 670 bits (1728), Expect = 0.0
Identities = 359/763 (47%), Positives = 494/763 (64%), Gaps = 22/763 (2%)
Query: 278 GEYMSHSFQEFLQSNGIISQRSCPSTP*QNGVAERKNRHLLDVVRTLLLESHVPSRFWCE 337
GEY S+ F + L +G I Q SC TP QNGVAERK+RH+++ R+LLL + V S FW E
Sbjct: 609 GEYTSNKFCQMLALDGTIHQTSCTDTPEQNGVAERKHRHIVETARSLLLSAFVLSEFWGE 668
Query: 338 ALSTAVHLINRMPSPSIGNESPFTRLYGHPPNYSTLRVFGCVCYVHLPPQERTKFTAQSV 397
A+ TAV LIN +PS SPF +LYGH P+YS+ RVFGC +V P ER K +++S
Sbjct: 669 AVLTAVSLINTIPSSHSSGLSPFEKLYGHVPDYSSFRVFGCTYFVLHPHVERNKLSSRSA 728
Query: 398 ECAFLGYSPHQKGFLCYDPNLRRIRVSRNVIFQENKYFFA---SHHDLVSSP-ISILPLF 453
C FLGY +KG+ C+DP +++ VS +V+F E+ FF+ + H L S I I P
Sbjct: 729 ICVFLGYGEGKKGYRCFDPITQKLYVSHHVVFLEHIPFFSIPSTTHSLTKSDLIHIDPFS 788
Query: 454 YDSHSRQQPS-KPLLTYKRRSTAT--HGPPQDNSLVAGP-----VEEPAPLR--RSSRES 503
DS + P + + T+ T T G P+ + P + +P P + R + +
Sbjct: 789 EDSGNDTSPYVRSICTHNSAGTGTLLSGTPEASFSSTAPQASSEIVDPPPRQSIRIRKST 848
Query: 504 KPPERYINCMTATLSSIPI-------PSSYKQAMENDCWQKAIESELLALEENQTWDIVP 556
K P+ +C +++ +S PSSYK+A+ + Q+A++ EL AL + TWD+VP
Sbjct: 849 KLPDFAYSCYSSSFTSFLAYIHCLFEPSSYKEAILDPLGQQAMDEELSALHKTDTWDLVP 908
Query: 557 CPSSVKPLGSKFVFSIKLRSDGSIDRYKAHLVVLGNKQEYGLDYDETFAPVAKMTTVRTI 616
P +G ++V+ IK SDGSI+RYKA LV G Q+YG+DY+ETFAP+AKMTT+RT+
Sbjct: 909 LPPGKSVVGCRWVYKIKTNSDGSIERYKARLVAKGYSQQYGMDYEETFAPIAKMTTIRTL 968
Query: 617 LAIAASQAWPLHQMDVKNAFLHGDLQEEVYIKLPNGMPTPSPNTVCKLKRSLYGLKQAPR 676
+A+A+ + W + Q+DVKNAFL+GDLQEEVY+ P G+ S VCKLK++LYGLKQAPR
Sbjct: 969 IAVASIRQWHISQLDVKNAFLNGDLQEEVYMAPPPGISHDS-GYVCKLKKALYGLKQAPR 1027
Query: 677 VWFEKFRSTLLGFEFSQSRYDPSLFLQRTPKGMVVLLVYVDDIVVTGSDQDAISRIKNLL 736
WFEKF + F S +D +LF++ T G ++L +YVDD+++TG D D IS +K L
Sbjct: 1028 AWFEKFSIVISSLGFVSSSHDSALFIKCTDAGRIILSLYVDDMIITGDDIDGISVLKTEL 1087
Query: 737 HSTFHMKELGRLTYFLGLEVHYHHEGVFLNQQKYIQDLVQLAGLTNATLVDTPMEVNVKY 796
F MK+LG L YFLG+EV Y G L+Q KY+ ++++ A LT+ VDTP+EVN +Y
Sbjct: 1088 ARRFEMKDLGYLRYFLGIEVAYSPRGYLLSQSKYVANILERARLTDNKTVDTPIEVNARY 1147
Query: 797 RRDEGDHLDDPTQYRKLVGSLIYVTITRPDISFAVHTVSKFMQAPRHFHLSAVQQIIRYL 856
+G L DPT YR +VGSL+Y+TIT PDI++AVH VS+F+ +P H +AV +I+RYL
Sbjct: 1148 SSSDGLPLIDPTLYRTIVGSLVYLTITHPDIAYAVHVVSQFVASPTTIHWAAVLRILRYL 1207
Query: 857 LGTLKRGLFFPVGSSIKLQAYSDADWAGCPDTRKSTTGWCMFLGNAPISWKCKKQDSVSK 916
GT+ + L SS++L+AYSDAD P RKS TG+C+FLG++ ISWK KKQ VS+
Sbjct: 1208 RGTVFQSLLLSSTSSLELRAYSDADHGSDPTDRKSVTGFCIFLGDSLISWKSKKQSIVSQ 1267
Query: 917 SSTEAEYRAMSAACSEIIWLRGLLTELGFSQDQPTPLHADNTSAIQIAANPVYHERTKHI 976
SSTEAEY AM++ EI+W R LL ++G S TP++ DN S+IQIA N V+HERTKHI
Sbjct: 1268 SSTEAEYCAMASTTKEIVWSRWLLADMGISFSHLTPMYCDNQSSIQIAHNSVFHERTKHI 1327
Query: 977 EVDCHSIREAYDRRIINLPHVSTSVQTADIFTKSLTRQRHNFL 1019
E+DCH R I LP V +S+Q AD FTK+ + R FL
Sbjct: 1328 EIDCHLTRHHLKHGTIALPFVPSSLQIADFFTKAHSISRFCFL 1370
>gb|AAT40550.1| putative receptor kinase [Solanum demissum]
Length = 1358
Score = 620 bits (1598), Expect = e-176
Identities = 339/759 (44%), Positives = 477/759 (62%), Gaps = 44/759 (5%)
Query: 279 EYMSHSFQEFLQSNGIISQRSCPSTP*QNGVAERKNRHLLDVVRTLLLESHVPSRFWCEA 338
EY+S F+EF+ GII Q +CP TP QNGVAERKNRHL++ RTLLLES+VP RFW +A
Sbjct: 613 EYLSSQFREFMTHQGIIHQTTCPYTPQQNGVAERKNRHLIETARTLLLESNVPLRFWGDA 672
Query: 339 LSTAVHLINRMPSPSIGNESPFT------RLYGHPPNYSTLRVFGCVCYVHLPPQERTKF 392
+ T+ +LINRMPS SI N+ P + LY PP RVFG C+VH + K
Sbjct: 673 VLTSCYLINRMPSSSIQNQVPHSILFPQSHLYPIPP-----RVFGSTCFVHNLAPGKDKL 727
Query: 393 TAQSVECAFLGYSPHQKGFLCYDPNLRRIRVSRNVIFQENK-YFFASHHDLVSSPISI-- 449
++++C FLGYS QKG+ CY +L R +S +V F E++ Y+ +S+H VS + I
Sbjct: 728 APRALKCVFLGYSRVQKGYRCYSHDLHRYLMSADVTFFESQPYYTSSNHPDVSMVLPIPQ 787
Query: 450 ---LPLFYDSHSRQQPS---KPLLTYKRRSTATHGPPQDNSLVAGPVEEPAPLRRSSRES 503
+P F +S PLLTY RR T P D+S A +PAP + +
Sbjct: 788 VLPVPTFVESTVTSTSPVVVPPLLTYHRRPRPTLVP--DDSCHA---PDPAP----TADL 838
Query: 504 KPPERYINCMTATLSSIPIPSSYKQAMENDCWQKAIESELLALEENQTWDIVPCPSSVKP 563
PP + P+ +A+ + W++A+ E+ AL ++ TW++V P+
Sbjct: 839 PPPSQ------------PLALQKGEALSHSGWRQAMVDEMSALHKSGTWELVSLPAGKST 886
Query: 564 LGSKFVFSIKLRSDGSIDRYKAHLVVLGNKQEYGLDYDETFAPVAKMTTVRTILAIAASQ 623
+G ++V+++K+ DG +DR KA LV G Q +GLDY +TFAPVAK+ +VR L++AA +
Sbjct: 887 VGCRWVYAVKIGPDGQVDRLKARLVAKGYTQIFGLDYSDTFAPVAKIASVRLFLSMAAVR 946
Query: 624 AWPLHQMDVKNAFLHGDLQEEVYIKLPNGMPTP--SPNTVCKLKRSLYGLKQAPRVWFEK 681
WPLHQ+D+KNAFLHGDL+EEVY++ P G S + VC+L+RSLYGLKQ+PR WF K
Sbjct: 947 HWPLHQLDIKNAFLHGDLEEEVYMEQPPGFVAQGESSSLVCRLRRSLYGLKQSPRAWFGK 1006
Query: 682 FRSTLLGFEFSQSRYDPSLFLQRT-PKGMVVLLVYVDDIVVTGSDQDAISRIKNLLHSTF 740
F + + F ++S D S+F + + P + L+VYVDDIV+TG+DQD I+ +K L F
Sbjct: 1007 FSTVIQEFGMTRSGADHSVFYRHSAPSRCIYLVVYVDDIVITGNDQDGITDLKQHLFKHF 1066
Query: 741 HMKELGRLTYFLGLEVHYHHEGVFLNQQKYIQDLVQLAGLTNATLVDTPMEVNVKYRRDE 800
K+LGRL YFLG+EV G+ ++Q+KY D+++ G+ VDTPM+ NVK +
Sbjct: 1067 QTKDLGRLKYFLGIEVAQSRSGIVISQRKYALDILEETGMMGCRPVDTPMDPNVKLLPGQ 1126
Query: 801 GDHLDDPTQYRKLVGSLIYVTITRPDISFAVHTVSKFMQAPRHFHLSAVQQIIRYLLGTL 860
G+ L +P +YR+LVG L Y+T+TRPDISF V VS+FM +P H AV +I+RY+
Sbjct: 1127 GEPLSNPERYRRLVGKLNYLTVTRPDISFPVSVVSQFMTSPCDSHWEAVVRILRYIKSAP 1186
Query: 861 KRGLFFPVGSSIKLQAYSDADWAGCPDTRKSTTGWCMFLGNAPISWKCKKQDSVSKSSTE 920
+GL F + Y+DADWAG P R+ST+G+C+ +G +SWK KKQ+ V++SS E
Sbjct: 1187 GKGLLFEDQGHEHIIGYTDADWAGSPSDRRSTSGYCVLVGGNLVSWKSKKQNVVARSSAE 1246
Query: 921 AEYRAMSAACSEIIWLRGLLTELGFSQDQPTPLHADNTSAIQIAANPVYHERTKHIEVDC 980
+EYRAM+ A E++W++ LL EL F + L DN +A+ IA+NPV+HERTKHIE+DC
Sbjct: 1247 SEYRAMATATCELVWIKQLLGELKFGKVDKMELVCDNQAALHIASNPVFHERTKHIEIDC 1306
Query: 981 HSIREAYDRRIINLPHVSTSVQTADIFTKSLTRQRHNFL 1019
H +RE I V ++ Q ADIFTKSLT R N++
Sbjct: 1307 HFVREKILSGDIVTKFVKSNDQLADIFTKSLTCPRINYI 1345
>gb|AAD25646.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301701|pir||E84589 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1461
Score = 597 bits (1539), Expect = e-169
Identities = 318/745 (42%), Positives = 461/745 (61%), Gaps = 22/745 (2%)
Query: 284 SFQEFLQSNGIISQRSCPSTP*QNGVAERKNRHLLDVVRTLLLESHVPSRFWCEALSTAV 343
+F EF ++ GI+S SCP TP QN V ERK++H+L+V R L+ +S++ +W + + TAV
Sbjct: 700 AFTEFYKAKGIVSFHSCPETPEQNSVVERKHQHILNVARALMFQSNMSLPYWGDCVLTAV 759
Query: 344 HLINRMPSPSIGNESPFTRLYGHPPNYSTLRVFGCVCYVHLPPQERTKFTAQSVECAFLG 403
LINR PS + N++PF L G P+YS L+ FGC+CY ++R KF +S C FLG
Sbjct: 760 FLINRTPSALLSNKTPFEVLTGKLPDYSQLKTFGCLCYSSTSSKQRHKFLPRSRACVFLG 819
Query: 404 YSPHQKGFLCYDPNLRRIRVSRNVIFQENKYFFASHHDLVSSPISIL----PLFYDSHSR 459
Y KG+ D + +SRNV F E + AS ++ + PL +
Sbjct: 820 YPFGFKGYKLLDLESNVVHISRNVEFHEELFPLASSQQSATTASDVFTPMDPLSSGNSIT 879
Query: 460 QQPSKPLLT-----YKRRSTATHGPPQDNSLVAGPVEEPAPLRRS---SRESKPPERYIN 511
P ++ KRR T QD ++ P+ S S+ S YIN
Sbjct: 880 SHLPSPQISPSTQISKRRITKFPAHLQDYHCYFVNKDDSHPISSSLSYSQISPSHMLYIN 939
Query: 512 CMTATLSSIPIPSSYKQAMENDCWQKAIESELLALEENQTWDIVPCPSSVKPLGSKFVFS 571
+S IPIP SY +A ++ W AI+ E+ A+E TW+I P K +G K+VF+
Sbjct: 940 ----NISKIPIPQSYHEAKDSKEWCGAIDQEIGAMERTDTWEITSLPPGKKAVGCKWVFT 995
Query: 572 IKLRSDGSIDRYKAHLVVLGNKQEYGLDYDETFAPVAKMTTVRTILAIAASQAWPLHQMD 631
+K +DGS++R+KA +V G Q+ GLDY ETF+PVAKM TV+ +L ++AS+ W L+Q+D
Sbjct: 996 VKFHADGSLERFKARIVAKGYTQKEGLDYTETFSPVAKMATVKLLLKVSASKKWYLNQLD 1055
Query: 632 VKNAFLHGDLQEEVYIKLPNGMP-----TPSPNTVCKLKRSLYGLKQAPRVWFEKFRSTL 686
+ NAFL+GDL+E +Y+KLP+G + PN VC+LK+S+YGLKQA R WF KF ++L
Sbjct: 1056 ISNAFLNGDLEETIYMKLPDGYADIKGTSLPPNVVCRLKKSIYGLKQASRQWFLKFSNSL 1115
Query: 687 LGFEFSQSRYDPSLFLQRTPKGMVVLLVYVDDIVVTGSDQDAISRIKNLLHSTFHMKELG 746
L F + D +LF++ +VLLVYVDDIV+ + + A + L ++F ++ELG
Sbjct: 1116 LALGFEKQHGDHTLFVRCIGSEFIVLLVYVDDIVIASTTEQAAQSLTEALKASFKLRELG 1175
Query: 747 RLTYFLGLEVHYHHEGVFLNQQKYIQDLVQLAGLTNATLVDTPMEVNVKYRRDEGDHLDD 806
L YFLGLEV EG+ L+Q+KY +L+ A + + PM N++ +++G L+D
Sbjct: 1176 PLKYFLGLEVARTSEGISLSQRKYALELLTSADMLDCKPSSIPMTPNIRLSKNDGLLLED 1235
Query: 807 PTQYRKLVGSLIYVTITRPDISFAVHTVSKFMQAPRHFHLSAVQQIIRYLLGTLKRGLFF 866
YR+LVG L+Y+TITRPDI+FAV+ + +F APR HL+AV ++++Y+ GT+ +GLF+
Sbjct: 1236 KEMYRRLVGKLMYLTITRPDITFAVNKLCQFSSAPRTAHLAAVYKVLQYIKGTVGQGLFY 1295
Query: 867 PVGSSIKLQAYSDADWAGCPDTRKSTTGWCMFLGNAPISWKCKKQDSVSKSSTEAEYRAM 926
+ L+ Y+DADW CPD+R+STTG+ MF+G++ ISW+ KKQ +VS+SS EAEYRA+
Sbjct: 1296 SAEDDLTLKGYTDADWGTCPDSRRSTTGFTMFVGSSLISWRSKKQPTVSRSSAEAEYRAL 1355
Query: 927 SAACSEIIWLRGLLTELGFSQDQPTPLHADNTSAIQIAANPVYHERTKHIEVDCHSIREA 986
+ A E+ WL LL L P L++D+T+A+ IA NPV+HERTKHIE+DCH++RE
Sbjct: 1356 ALASCEMAWLSTLLLALRVHSGVPI-LYSDSTAAVYIATNPVFHERTKHIEIDCHTVREK 1414
Query: 987 YDRRIINLPHVSTSVQTADIFTKSL 1011
D + L HV T Q ADI TK L
Sbjct: 1415 LDNGQLKLLHVKTKDQVADILTKPL 1439
>gb|AAD26943.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301694|pir||E84535 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1454
Score = 589 bits (1519), Expect = e-166
Identities = 319/761 (41%), Positives = 462/761 (59%), Gaps = 52/761 (6%)
Query: 285 FQEFLQSNGIISQRSCPSTP*QNGVAERKNRHLLDVVRTLLLESHVPSRFWCEALSTAVH 344
F F GI+S SCP TP QN V ERK++H+L+V R L+ +S VP W + + TAV
Sbjct: 690 FTSFYAEKGIVSFHSCPETPEQNSVVERKHQHILNVARALMFQSQVPLSLWGDCVLTAVF 749
Query: 345 LINRMPSPSIGNESPFTRLYGHPPNYSTLRVFGCVCYVHLPPQERTKFTAQSVECAFLGY 404
LINR PS + N++P+ L G P Y LR FGC+CY P++R KF +S C FLGY
Sbjct: 750 LINRTPSQLLMNKTPYEILTGTAPVYEQLRTFGCLCYSSTSPKQRHKFQPRSRACLFLGY 809
Query: 405 SPHQKGFLCYDPNLRRIRVSRNVIFQENKYFFASHHDLVSSPISILPLFYDSHSRQQPSK 464
KG+ D + +SRNV F E + A + SS L LF P
Sbjct: 810 PSGYKGYKLMDLESNTVFISRNVQFHEEVFPLAKNPGSESS----LKLF-------TPMV 858
Query: 465 PLLTYKRRSTATHGPPQDNSLVAGPVEEPAPLRRSSRESKPPERYIN------------- 511
P+ + T TH P S + + + P S R KPP +
Sbjct: 859 PVSSGIISDT-THSP----SSLPSQISDLPPQISSQRVRKPPAHLNDYHCNTMQSDHKYP 913
Query: 512 ---------------CMTATLSSIPIPSSYKQAMENDCWQKAIESELLALEENQTWDIVP 556
C ++ IPIP++Y +A + W +A+++E+ A+E+ TW+I
Sbjct: 914 ISSTISYSKISPSHMCYINNITKIPIPTNYAEAQDTKEWCEAVDAEIGAMEKTNTWEITT 973
Query: 557 CPSSVKPLGSKFVFSIKLRSDGSIDRYKAHLVVLGNKQEYGLDYDETFAPVAKMTTVRTI 616
P K +G K+VF++K +DG+++RYKA LV G Q+ GLDY +TF+PVAKMTT++ +
Sbjct: 974 LPKGKKAVGCKWVFTLKFLADGNLERYKARLVAKGYTQKEGLDYTDTFSPVAKMTTIKLL 1033
Query: 617 LAIAASQAWPLHQMDVKNAFLHGDLQEEVYIKLPNG------MPTPSPNTVCKLKRSLYG 670
L ++AS+ W L Q+DV NAFL+G+L+EE+++K+P G + PS N V +LKRS+YG
Sbjct: 1034 LKVSASKKWFLKQLDVSNAFLNGELEEEIFMKIPEGYAERKGIVLPS-NVVLRLKRSIYG 1092
Query: 671 LKQAPRVWFEKFRSTLLGFEFSQSRYDPSLFLQRTPKGMVVLLVYVDDIVVTGSDQDAIS 730
LKQA R WF+KF S+LL F ++ D +LFL+ V++LVYVDDIV+ + + A +
Sbjct: 1093 LKQASRQWFKKFSSSLLSLGFKKTHGDHTLFLKMYDGEFVIVLVYVDDIVIASTSEAAAA 1152
Query: 731 RIKNLLHSTFHMKELGRLTYFLGLEVHYHHEGVFLNQQKYIQDLVQLAGLTNATLVDTPM 790
++ L F +++LG L YFLGLEV G+ + Q+KY +L+Q G+ V PM
Sbjct: 1153 QLTEELDQRFKLRDLGDLKYFLGLEVARTTAGISICQRKYALELLQSTGMLACKPVSVPM 1212
Query: 791 EVNVKYRRDEGDHLDDPTQYRKLVGSLIYVTITRPDISFAVHTVSKFMQAPRHFHLSAVQ 850
N+K R+D+GD ++D QYR++VG L+Y+TITRPDI+FAV+ + +F APR HL+A
Sbjct: 1213 IPNLKMRKDDGDLIEDIEQYRRIVGKLMYLTITRPDITFAVNKLCQFSSAPRTTHLTAAY 1272
Query: 851 QIIRYLLGTLKRGLFFPVGSSIKLQAYSDADWAGCPDTRKSTTGWCMFLGNAPISWKCKK 910
++++Y+ GT+ +GLF+ S + L+ ++D+DWA C D+R+STT + MF+G++ ISW+ KK
Sbjct: 1273 RVLQYIKGTVGQGLFYSASSDLTLKGFADSDWASCQDSRRSTTSFTMFVGDSLISWRSKK 1332
Query: 911 QDSVSKSSTEAEYRAMSAACSEIIWLRGLLTELGFSQDQPTPLHADNTSAIQIAANPVYH 970
Q +VS+SS EAEYRA++ A E++WL LL L S P L++D+T+AI IA NPV+H
Sbjct: 1333 QHTVSRSSAEAEYRALALATCEMVWLFTLLVSLQASPPVPI-LYSDSTAAIYIATNPVFH 1391
Query: 971 ERTKHIEVDCHSIREAYDRRIINLPHVSTSVQTADIFTKSL 1011
ERTKHI++DCH++RE D + L HV T Q ADI TK L
Sbjct: 1392 ERTKHIKLDCHTVRERLDNGELKLLHVRTEDQVADILTKPL 1432
>gb|AAD24600.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301700|pir||G84542 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1333
Score = 586 bits (1510), Expect = e-165
Identities = 327/791 (41%), Positives = 447/791 (56%), Gaps = 66/791 (8%)
Query: 287 EFLQSNGIISQRSCPSTP*QNGVAERKNRHLLDVVRTLLLESHVPSRFWCEALSTAVHLI 346
+F Q G+I +RSC +TP +N ERK+RHLL+V R L ++++P +FW E + TA +LI
Sbjct: 526 KFFQEQGVIHERSCVATPERNDRVERKHRHLLNVARALRFQANLPIQFWGECVLTAAYLI 585
Query: 347 NRMPSPSIGNESPFTRLYGHPPNYSTLRVFGCVCYVHLPPQERTKFTAQSVECAFLGYSP 406
NR PS + + +P+ RL+ P + LRVFG +CY H + KF +S C F+GY
Sbjct: 586 NRTPSSVLNDSTPYERLHKKQPRFDHLRVFGSLCYAHNRNRGGDKFAERSRRCVFVGYPH 645
Query: 407 HQKGFLCYDPNLRRIRVSRNVIFQENKYFFASHHD------------------------- 441
QKG+ +D VSR+V+F E ++ F H+
Sbjct: 646 GQKGWRLFDLEQNEFFVSRDVVFSELEFPFRISHEQNVIEEEEEALWAPIVDGLIEEEVH 705
Query: 442 -----------LVSSPISILPLFYDSHSRQQPSKPLLTY-----KRRSTATHGPPQDNSL 485
VSSPIS P S S S PL T +T+ P +L
Sbjct: 706 LGQNAGPTPPICVSSPIS--PSATSSRSEHSTSSPLDTEVVPTPATSTTSASSPSSPTNL 763
Query: 486 --------------VAGPVEEPAPLRRSSRESKPP----ERYINCMTATLSSIPIPSSYK 527
P P P R+S+R PP + +N S + S
Sbjct: 764 QFLPLSRAKPTTAQAVAPPAVPPPRRQSTRNKAPPVTLKDFVVNTTVCQESPSKLNSILY 823
Query: 528 QAMENDCWQKAIESELL-----ALEENQTWDIVPCPSSVKPLGSKFVFSIKLRSDGSIDR 582
Q + D ++ S A EEN TW I P + +GS++V+ +K SDGS++R
Sbjct: 824 QLQKRDDTRRFSASHTTYVAIDAQEENHTWTIEDLPPGKRAIGSQWVYKVKHNSDGSVER 883
Query: 583 YKAHLVVLGNKQEYGLDYDETFAPVAKMTTVRTILAIAASQAWPLHQMDVKNAFLHGDLQ 642
YKA LV LGNKQ+ G DY ETFAPVAKM TVR L +A + W +HQMDV NAFLHGDL+
Sbjct: 884 YKARLVALGNKQKEGEDYGETFAPVAKMATVRLFLDVAVKRNWEIHQMDVHNAFLHGDLR 943
Query: 643 EEVYIKLPNGMPTPSPNTVCKLKRSLYGLKQAPRVWFEKFRSTLLGFEFSQSRYDPSLFL 702
EEVY+KLP G PN VC+L+++LYGLKQAPR WFEK + L + F QS D SLF
Sbjct: 944 EEVYMKLPPGFEASHPNKVCRLRKALYGLKQAPRCWFEKLTTALKRYGFQQSLADYSLFT 1003
Query: 703 QRTPKGMVVLLVYVDDIVVTGSDQDAISRIKNLLHSTFHMKELGRLTYFLGLEVHYHHEG 762
+ +L+YVDD+++TG+ Q A + K L S FHMK+LG L YFLG+EV G
Sbjct: 1004 LVKGSVRIKILIYVDDLIITGNSQRATQQFKEYLASCFHMKDLGPLKYFLGIEVARSTTG 1063
Query: 763 VFLNQQKYIQDLVQLAGLTNATLVDTPMEVNVKYRRDEGDHLDDPTQYRKLVGSLIYVTI 822
+++ Q+KY D++ GL + P+E N K L DP +YR+LVG LIY+ +
Sbjct: 1064 IYICQRKYALDIISETGLLGVKPANFPLEQNHKLGLSTSPLLTDPQRYRRLVGRLIYLAV 1123
Query: 823 TRPDISFAVHTVSKFMQAPRHFHLSAVQQIIRYLLGTLKRGLFFPVGSSIKLQAYSDADW 882
TR D++F+VH +++FMQ PR H +A +++RYL +G+F ++ + D+DW
Sbjct: 1124 TRLDLAFSVHILARFMQEPREDHWAAALRVVRYLKADPGQGVFLRRSGDFQITGWCDSDW 1183
Query: 883 AGCPDTRKSTTGWCMFLGNAPISWKCKKQDSVSKSSTEAEYRAMSAACSEIIWLRGLLTE 942
AG P +R+S TG+ + G++PISWK KKQD+VSKSS EAEYRAMS SE++WL+ LL
Sbjct: 1184 AGDPMSRRSVTGYFVQFGDSPISWKTKKQDTVSKSSAEAEYRAMSFLASELLWLKQLLFS 1243
Query: 943 LGFSQDQPTPLHADNTSAIQIAANPVYHERTKHIEVDCHSIREAYDRRIINLPHVSTSVQ 1002
LG S QP + D+ SAI IA NPV+HERTKHIE+D H +R+ + + +I HV T+ Q
Sbjct: 1244 LGVSHVQPMIMCCDSKSAIYIATNPVFHERTKHIEIDYHFVRDEFVKGVITPRHVGTTSQ 1303
Query: 1003 TADIFTKSLTR 1013
ADIFTK L R
Sbjct: 1304 LADIFTKPLGR 1314
>pir||G86301 probable retroelement polyprotein [imported] - Arabidopsis thaliana
gi|9989054|gb|AAG10817.1| Putative retroelement
polyprotein [Arabidopsis thaliana]
Length = 1413
Score = 584 bits (1506), Expect = e-165
Identities = 315/759 (41%), Positives = 451/759 (58%), Gaps = 50/759 (6%)
Query: 285 FQEFLQSNGIISQRSCPSTP*QNGVAERKNRHLLDVVRTLLLESHVPSRFWCEALSTAVH 344
F+E + GI++ SCP TP QN V ERK++H+L+V R LL +S +P +W + + TAV
Sbjct: 666 FEELYRRKGIVAYHSCPETPEQNSVVERKHQHILNVARALLFQSQIPLSYWGDCILTAVF 725
Query: 345 LINRMPSPSIGNESPFTRLYGHPPNYSTLRVFGCVCYVHLPPQERTKFTAQSVECAFLGY 404
+INR PSP I N++ F L P+Y+ L+ FGC+CY P++R KF ++ CAFLGY
Sbjct: 726 IINRTPSPVISNKTLFEMLTKKVPDYTHLKSFGCLCYASTSPKQRHKFEDRARTCAFLGY 785
Query: 405 SPHQKGFLCYDPNLRRIRVSRNVIFQENKYFFASHHDLVSSPISILPLFYDSHSRQQPSK 464
KG+ D I +SRNV+F E+ + F + P Y + PS+
Sbjct: 786 PSGYKGYKLLDLESHTIFISRNVVFYEDLFPFKTKPAENEESSVFFPHIYVDRNDSHPSQ 845
Query: 465 PLLTYKRRSTATHGPPQDNSLVAGPVEEPAPLRRSSRESKPP----ERYINCMTAT---- 516
PL P Q+ S P E +++SR S+PP + + N +T++
Sbjct: 846 PL------------PVQETSASNVPAE-----KQNSRVSRPPAYLKDYHCNSVTSSTDHP 888
Query: 517 --------------------LSSIPIPSSYKQAMENDCWQKAIESELLALEENQTWDIVP 556
++ IP P +Y QA + W A+ E+ ALE+N TW +
Sbjct: 889 ISEVLSYSSLSDPYMIFINAVNKIPEPHTYAQARQIKEWCDAMGMEITALEDNGTWVVCS 948
Query: 557 CPSSVKPLGSKFVFSIKLRSDGSIDRYKAHLVVLGNKQEYGLDYDETFAPVAKMTTVRTI 616
P K +G K+V+ IKL +DGS++RYKA LV G Q GLDY +TF+PVAK+TTV+ +
Sbjct: 949 LPVGKKAVGCKWVYKIKLNADGSLERYKARLVAKGYTQTEGLDYVDTFSPVAKLTTVKLL 1008
Query: 617 LAIAASQAWPLHQMDVKNAFLHGDLQEEVYIKLPNGMPTPS-----PNTVCKLKRSLYGL 671
+A+AA++ W L Q+D+ NAFL+G L EE+Y+ LP G PN VC+LK+SLYGL
Sbjct: 1009 IAVAAAKGWSLSQLDISNAFLNGSLDEEIYMTLPPGYSPRQGDSFPPNAVCRLKKSLYGL 1068
Query: 672 KQAPRVWFEKFRSTLLGFEFSQSRYDPSLFLQRTPKGMVVLLVYVDDIVVTGSDQDAISR 731
KQA R W+ KF +L F+QS D +LF +++ + +LVYVDDI++ S
Sbjct: 1069 KQASRQWYLKFSESLKALGFTQSSGDHTLFTRKSKNSYMAVLVYVDDIIIASSCDRETEL 1128
Query: 732 IKNLLHSTFHMKELGRLTYFLGLEVHYHHEGVFLNQQKYIQDLVQLAGLTNATLVDTPME 791
+++ L + +++LG L YFLGLE+ + +G+ + Q+KY +L+ GL PME
Sbjct: 1129 LRDALQRSSKLRDLGTLRYFLGLEIARNTDGISICQRKYTLELLAETGLLGCKSSSVPME 1188
Query: 792 VNVKYRRDEGDHLDDPTQYRKLVGSLIYVTITRPDISFAVHTVSKFMQAPRHFHLSAVQQ 851
N K +++G+ +DD YRKLVG L+Y+T TRPDI++AVH + +F APR HL AV +
Sbjct: 1189 PNQKLSQEDGELIDDAEHYRKLVGKLMYLTFTRPDITYAVHRLCQFTSAPRVPHLKAVYK 1248
Query: 852 IIRYLLGTLKRGLFFPVGSSIKLQAYSDADWAGCPDTRKSTTGWCMFLGNAPISWKCKKQ 911
II YL GT+ +GLF+ +KL ++D+D++ C D+RK TTG+CMFLG + ++WK KKQ
Sbjct: 1249 IIYYLKGTVGQGLFYSANVDLKLSGFADSDFSSCSDSRKLTTGYCMFLGTSLVAWKSKKQ 1308
Query: 912 DSVSKSSTEAEYRAMSAACSEIIWLRGLLTELGFSQDQPTPLHADNTSAIQIAANPVYHE 971
+ +S SS EAEY+AMS A E++WLR LL +L + + L+ DNT+AI IA NPV+HE
Sbjct: 1309 EVISMSSAEAEYKAMSMAVREMMWLRFLLEDLWIDVSEASVLYCDNTAAIHIANNPVFHE 1368
Query: 972 RTKHIEVDCHSIREAYDRRIINLPHVSTSVQTADIFTKS 1010
RTKHIE D H IRE +I HV T Q ADI KS
Sbjct: 1369 RTKHIERDYHHIREKIILGLIRTLHVRTENQLADIPYKS 1407
>gb|AAB61111.1| Strong similarity to Zea mays retrotransposon Hopscotch polyprotein
(gb|U12626). [Arabidopsis thaliana]
gi|25301690|pir||G96722 hypothetical protein F20P5.25
[imported] - Arabidopsis thaliana
Length = 1315
Score = 580 bits (1494), Expect = e-163
Identities = 307/761 (40%), Positives = 459/761 (59%), Gaps = 37/761 (4%)
Query: 284 SFQEFLQSNGIISQRSCPSTP*QNGVAERKNRHLLDVVRTLLLESHVPSRFWCEALSTAV 343
+F +F S GI+ SCP TP QN V ERK++H+L+V R+L +SH+P +W + + TAV
Sbjct: 538 NFTQFYHSKGIVPYHSCPETPQQNSVVERKHQHILNVARSLFFQSHIPISYWGDCILTAV 597
Query: 344 HLINRMPSPSIGNESPFTRLYGHPPNYSTLRVFGCVCYVHLPPQERTKFTAQSVECAFLG 403
+LINR+P+P + ++ PF L P Y ++VFGC+CY P++R KF+ ++ CAF+G
Sbjct: 598 YLINRLPAPILEDKCPFEVLTKTVPTYDHIKVFGCLCYASTSPKDRHKFSPRAKACAFIG 657
Query: 404 YSPHQKGFLCYDPNLRRIRVSRNVIF-------------QENKYFF------------AS 438
Y KG+ D I VSR+V+F QE + FF +S
Sbjct: 658 YPSGFKGYKLLDLETHSIIVSRHVVFHEELFPFLGSDLSQEEQNFFPDLNPTPPMQRQSS 717
Query: 439 HH---DLVSSPISILPLFYDSHSRQQPSKPLLTYKRRSTATHGPPQDNSLVAGPVEEPAP 495
H SS + ILP +++ +PS K + A +S+V+ E
Sbjct: 718 DHVNPSDSSSSVEILPSANPTNNVPEPSVQTSHRKAKKPAYLQDYYCHSVVSSTPHEIRK 777
Query: 496 LRRSSRESKPPERYINCMTATLSSIPIPSSYKQAMENDCWQKAIESELLALEENQTWDIV 555
R + P ++ C+ T PS+Y +A + W+ A+ +E LE TW++
Sbjct: 778 FLSYDRINDPYLTFLACLDKTKE----PSNYTEAEKLQVWRDAMGAEFDFLEGTHTWEVC 833
Query: 556 PCPSSVKPLGSKFVFSIKLRSDGSIDRYKAHLVVLGNKQEYGLDYDETFAPVAKMTTVRT 615
P+ + +G +++F IK SDGS++RYKA LV G Q+ G+DY+ETF+PVAK+ +V+
Sbjct: 834 SLPADKRCIGCRWIFKIKYNSDGSVERYKARLVAQGYTQKEGIDYNETFSPVAKLNSVKL 893
Query: 616 ILAIAASQAWPLHQMDVKNAFLHGDLQEEVYIKLPNGMPTPS-----PNTVCKLKRSLYG 670
+L +AA L Q+D+ NAFL+GDL EE+Y++LP G + PN VC+LK+SLYG
Sbjct: 894 LLGVAARFKLSLTQLDISNAFLNGDLDEEIYMRLPQGYASRQGDSLPPNAVCRLKKSLYG 953
Query: 671 LKQAPRVWFEKFRSTLLGFEFSQSRYDPSLFLQRTPKGMVVLLVYVDDIVVTGSDQDAIS 730
LKQA R W+ KF STLLG F QS D + FL+ + + +LVY+DDI++ ++ A+
Sbjct: 954 LKQASRQWYLKFSSTLLGLGFIQSYCDHTCFLKISDGIFLCVLVYIDDIIIASNNDAAVD 1013
Query: 731 RIKNLLHSTFHMKELGRLTYFLGLEVHYHHEGVFLNQQKYIQDLVQLAGLTNATLVDTPM 790
+K+ + S F +++LG L YFLGLE+ +G+ ++Q+KY DL+ G PM
Sbjct: 1014 ILKSQMKSFFKLRDLGELKYFLGLEIVRSDKGIHISQRKYALDLLDETGQLGCKPSSIPM 1073
Query: 791 EVNVKYRRDEGDHLDDPTQYRKLVGSLIYVTITRPDISFAVHTVSKFMQAPRHFHLSAVQ 850
+ ++ + D G + YR+L+G L+Y+ ITRPDI+FAV+ +++F APR HL AV
Sbjct: 1074 DPSMVFAHDSGGDFVEVGPYRRLIGRLMYLNITRPDITFAVNKLAQFSMAPRKAHLQAVY 1133
Query: 851 QIIRYLLGTLKRGLFFPVGSSIKLQAYSDADWAGCPDTRKSTTGWCMFLGNAPISWKCKK 910
+I++Y+ GT+ +GLF+ S ++L+ Y++AD+ C D+R+ST+G+CMFLG++ I WK +K
Sbjct: 1134 KILQYIKGTIGQGLFYSATSELQLKVYANADYNSCRDSRRSTSGYCMFLGDSLICWKSRK 1193
Query: 911 QDSVSKSSTEAEYRAMSAACSEIIWLRGLLTELGFSQDQPTPLHADNTSAIQIAANPVYH 970
QD VSKSS EAEYR++S A E++WL L EL +PT L DN +AI IA N V+H
Sbjct: 1194 QDVVSKSSAEAEYRSLSVATDELVWLTNFLKELQVPLSKPTLLFCDNEAAIHIANNHVFH 1253
Query: 971 ERTKHIEVDCHSIREAYDRRIINLPHVSTSVQTADIFTKSL 1011
ERTKHIE DCHS+RE + + L H++T +Q AD FTK L
Sbjct: 1254 ERTKHIESDCHSVRERLLKGLFELYHINTELQIADPFTKPL 1294
>gb|AAP51971.1| putative copia-type polyprotein [Oryza sativa (japonica
cultivar-group)] gi|37530764|ref|NP_919684.1| putative
copia-type polyprotein [Oryza sativa (japonica
cultivar-group)] gi|20042923|gb|AAM08751.1| Putative
copia-type polyprotein [Oryza sativa (japonica
cultivar-group)]
Length = 1803
Score = 578 bits (1490), Expect = e-163
Identities = 315/770 (40%), Positives = 441/770 (56%), Gaps = 37/770 (4%)
Query: 279 EYMSHSFQEFLQSNGIISQRSCPSTP*QNGVAERKNRHLLDVVRTLLLESHVPSRFWCEA 338
EY S++ + L +G + + SCP + QNG AER R + D VRT+L+ S P FW EA
Sbjct: 627 EYDSYALRSLLSLHGAVLRLSCPYSSQQNGKAERILRTINDCVRTMLVHSAAPLSFWAEA 686
Query: 339 LSTAVHLINRMPSPSIGNESPFTRLYGHPPNYSTLRVFGCVCYVHLPPQERTKFTAQSVE 398
L TA+HLINR P + G+ P+ L G PP Y LRVFGC+CY + K + +S+
Sbjct: 687 LQTAMHLINRRPCRATGSLKPYQLLLGAPPTYDHLRVFGCLCYPNTIATAPHKLSPRSLA 746
Query: 399 CAFLGYSPHQKGFLCYDPNLRRIRVSRNVIFQENKYFFAS-------------HHD---- 441
C F+GY +G+ CYD RR+ SR+V F E+ + F H D
Sbjct: 747 CVFIGYPADHRGYRCYDMVSRRVFTSRHVTFVEDVFPFRDAPSPRPSAPPPPDHGDDTIV 806
Query: 442 -------LVSSPISILPLFYDSHSRQQPSKPLLTYKRRSTATH------GPPQDNSLVAG 488
V +P+ P +H P P + + H P + A
Sbjct: 807 LLPAPAQHVVTPVGTAP----AHDAASPPSPASSTPSSAAPAHDVAPPPSPETSSPASAS 862
Query: 489 PVEEPAPLRRSSRESKPPERYINCMTATLSSIPIPSSYKQAMENDCWQKAIESELLALEE 548
P R + SKP RY T+TLS P PSS + A+ + W+ A+++E AL
Sbjct: 863 PPRHAMTTRARAGISKPNPRYAMTATSTLS--PTPSSVRVALRDPNWRAAMQAEFDALLA 920
Query: 549 NQTWDIVPCPSSVKPLGSKFVFSIKLRSDGSIDRYKAHLVVLGNKQEYGLDYDETFAPVA 608
N+TW +VP P + + K+VF KL +DGS+D+YKA VV G Q G+D+ ETF+PV
Sbjct: 921 NRTWTLVPRPPGARIITGKWVFKTKLHADGSLDKYKARWVVRGFNQRPGVDFGETFSPVV 980
Query: 609 KMTTVRTILAIAASQAWPLHQMDVKNAFLHGDLQEEVYIKLPNGMPTPS-PNTVCKLKRS 667
K T+RT+L + +S+ WP HQ+DV NAFLHG LQE V + P G + P VC L RS
Sbjct: 981 KPATIRTVLTLISSKQWPAHQLDVSNAFLHGHLQERVLCQQPTGFEDAARPADVCLLSRS 1040
Query: 668 LYGLKQAPRVWFEKFRSTLLGFEFSQSRYDPSLFLQRTPKGMVVLLVYVDDIVVTGSDQD 727
LYGL+QAPR WF++F F QSR DPSLF+ R LL+YVDD++++ S
Sbjct: 1041 LYGLRQAPRAWFKRFADHATSLGFVQSRADPSLFVLRRGSDTAYLLLYVDDMILSASSSS 1100
Query: 728 AISRIKNLLHSTFHMKELGRLTYFLGLEVHYHHEGVFLNQQKYIQDLVQLAGLTNATLVD 787
+ RI + L + F +K++G L YFLG+EV +G L+Q KY D+++ AG+ N V
Sbjct: 1101 LLQRIIDRLQAEFKVKDMGPLKYFLGIEVQRTADGFVLSQSKYATDVLERAGMANCKAVA 1160
Query: 788 TPMEVNVKYRRDEGDHLDDPTQYRKLVGSLIYVTITRPDISFAVHTVSKFMQAPRHFHLS 847
TP + K DEG D + YR + G+L Y+T+TRPDI++AV V M APR H++
Sbjct: 1161 TPADAKPKLSSDEGPLFQDSSWYRSIAGALQYLTLTRPDIAYAVQQVCLHMHAPREAHVT 1220
Query: 848 AVQQIIRYLLGTLKRGLFFPVGSSIKLQAYSDADWAGCPDTRKSTTGWCMFLGNAPISWK 907
+++I+RY+ GT GL +S L A+SDADWAGCPDTR+ST+G+C+FLG++ ISW
Sbjct: 1221 LLKRILRYIKGTAAFGLHLRASTSPTLTAFSDADWAGCPDTRRSTSGFCIFLGDSLISWS 1280
Query: 908 CKKQDSVSKSSTEAEYRAMSAACSEIIWLRGLLTELGFSQDQPTPLHADNTSAIQIAANP 967
K+Q +VS+SS EAEYR ++ A +E WLR LL EL Q T + DN S++ ++ NP
Sbjct: 1281 SKRQTTVSRSSAEAEYRGVANAVAECTWLRQLLGELHCRVPQATIAYCDNISSVYMSKNP 1340
Query: 968 VYHERTKHIEVDCHSIREAYDRRIINLPHVSTSVQTADIFTKSLTRQRHN 1017
V+H+RTKHIE+D H +RE + + + ++ Q AD+FTK L N
Sbjct: 1341 VHHKRTKHIELDIHFVREKVALGELRVLPIPSAHQFADVFTKGLPSSMFN 1390
>gb|AAC35532.1| contains similarity to proteases [Arabidopsis thaliana]
gi|7444456|pir||T01908 hypothetical protein T12H20.12 -
Arabidopsis thaliana
Length = 1392
Score = 570 bits (1468), Expect = e-160
Identities = 302/736 (41%), Positives = 450/736 (61%), Gaps = 41/736 (5%)
Query: 278 GEYMSHSFQEFLQSNGIISQRSCPSTP*QNGVAERKNRHLLDVVRTLLLESHVPSRFWCE 337
GE++S+ F L S GI SCP TP QNG+AER++R+L ++ +L+ S VP + W E
Sbjct: 588 GEFVSYKFVAHLASCGIKQLISCPHTPQQNGIAERRHRYLTELGLSLMFHSKVPHKLWVE 647
Query: 338 ALSTAVHLINRMPSPSIG-NESPFTRLYGHPPNYSTLRVFGCVCYVHLPPQERTKFTAQS 396
A T+ L N +PS ++ N+SP+ L+G PP Y+ LRVFG CY +L P + KF +S
Sbjct: 648 AFFTSNFLSNLLPSSTLSDNKSPYEMLHGTPPVYTALRVFGSACYPYLRPYAKNKFDPKS 707
Query: 397 VECAFLGYSPHQKGFLCYDPNLRRIRVSRNVIFQENKYFFASHHDLVSSPISILPLFYDS 456
+ C FLGY+ KG+ C P ++ + R+V+F E K+ ++ + + IS PLF
Sbjct: 708 LLCVFLGYNNKYKGYRCLHPPTGKVYICRHVLFDERKFPYSDIYSQFQT-ISGSPLF--- 763
Query: 457 HSRQQPSKPLLTYKRRSTATHGPPQDNSLVAGPVEEPAPLRRSSRESKPPERYINCMTAT 516
+++ ++T SR +KP +Y + +
Sbjct: 764 ----------TAWQKGFSST---------------------ALSRITKPNPKY--ALFSV 790
Query: 517 LSSIPIPSSYKQAMENDCWQKAIESELLALEENQTWDIVPCPSSVKPLGSKFVFSIKLRS 576
S+ P P S K+A++++ W A+ E+ + E TWD+VP + LG K+VF KL S
Sbjct: 791 KSNYPEPKSVKEALKDEGWTNAMGEEMGTMHETDTWDLVPPEMVDRLLGCKWVFKTKLNS 850
Query: 577 DGSIDRYKAHLVVLGNKQEYGLDYDETFAPVAKMTTVRTILAIAASQAWPLHQMDVKNAF 636
DGS+DR KA LV G +QE G+DY ET++PV + TVR+IL +A W L Q+DVKNAF
Sbjct: 851 DGSLDRLKARLVARGYEQEEGVDYVETYSPVVRSATVRSILHVATINKWSLKQLDVKNAF 910
Query: 637 LHGDLQEEVYIKLPNGMPTPS-PNTVCKLKRSLYGLKQAPRVWFEKFRSTLLGFEFSQSR 695
LH +L+E V++ P G PS P+ VCKLK+++Y LKQAPR WF+KF S LL + F S
Sbjct: 911 LHDELKETVFMTQPPGFEDPSRPDYVCKLKKAIYDLKQAPRAWFDKFSSYLLKYGFICSF 970
Query: 696 YDPSLFLQRTPKGMVVLLVYVDDIVVTGSDQDAISRIKNLLHSTFHMKELGRLTYFLGLE 755
DPSLF+ + ++ LL+YVDD+++TG++ + ++ N+L + F MK++G L YFLG++
Sbjct: 971 SDPSLFVYLKGRDVMFLLLYVDDMILTGNNDVLLQQLLNILSTEFRMKDMGALHYFLGIQ 1030
Query: 756 VHYHHEGVFLNQQKYIQDLVQLAGLTNATLVDTPMEVNVKYRRDEGDHLDDPTQYRKLVG 815
HYH++G+FL+Q+KY DL+ AG+++ + + TP+++++ + +PT +R+L G
Sbjct: 1031 AHYHNDGLFLSQEKYTSDLLVNAGMSDCSSMPTPLQLDL--LQGNNKPFPEPTYFRRLAG 1088
Query: 816 SLIYVTITRPDISFAVHTVSKFMQAPRHFHLSAVQQIIRYLLGTLKRGLFFPVGSSIKLQ 875
L Y+T+TRPDI FAV+ V + M AP +++I+ YL GT+ G+ + L+
Sbjct: 1089 KLQYLTLTRPDIQFAVNFVCQKMHAPTMSDFHLLKRILHYLKGTMTMGINLSSNTDSVLR 1148
Query: 876 AYSDADWAGCPDTRKSTTGWCMFLGNAPISWKCKKQDSVSKSSTEAEYRAMSAACSEIIW 935
YSD+DWAGC DTR+ST G+C FLG ISW K+ +VSKSSTEAEYR +S A SE+ W
Sbjct: 1149 CYSDSDWAGCKDTRRSTGGFCTFLGYNIISWSAKRHPTVSKSSTEAEYRTLSFAASEVSW 1208
Query: 936 LRGLLTELGFSQDQPTPLHADNTSAIQIAANPVYHERTKHIEVDCHSIREAYDRRIINLP 995
+ LL E+G Q Q ++ DN SA+ ++ANP H R+KH +VD + +RE + +
Sbjct: 1209 IGFLLQEIGLPQQQIPEMYCDNLSAVYLSANPALHSRSKHFQVDYYYVRERVALGALTVK 1268
Query: 996 HVSTSVQTADIFTKSL 1011
H+ S Q ADIFTKSL
Sbjct: 1269 HIPASQQLADIFTKSL 1284
>gb|AAD21687.1| Strong similarity to gi|3600044 T12H20.12 protease homolog from
Arabidopsis thaliana BAC gb|AF080119 and is a member of
the reverse transcriptase family PF|00078
gi|25301706|pir||C86438 hypothetical protein F28K20.17 -
Arabidopsis thaliana
Length = 1415
Score = 567 bits (1461), Expect = e-160
Identities = 312/771 (40%), Positives = 456/771 (58%), Gaps = 38/771 (4%)
Query: 278 GEYMSHSFQEFLQSNGIISQRSCPSTP*QNGVAERKNRHLLDVVRTLLLESHVPSRFWCE 337
GE++S+ + L +GI + SCP TP QNG+AERK+RHL+++ ++L SH P +FW E
Sbjct: 584 GEFVSNKLKTHLSEHGIHHRISCPYTPQQNGLAERKHRHLVELGLSMLFHSHTPQKFWVE 643
Query: 338 ALSTAVHLINRMPSPSIGNESPFTRLYGHPPNYSTLRVFGCVCYVHLPPQERTKFTAQSV 397
+ TA ++INR+PS + N SP+ L+G P+YS+LRVFG CY L P + KF +S+
Sbjct: 644 SFFTANYIINRLPSSVLKNLSPYEALFGEKPDYSSLRVFGSACYPCLRPLAQNKFDPRSL 703
Query: 398 ECAFLGYSPHQKGFLCYDPNLRRIRVSRNVIFQENKYFFASHHDLVSSPISILPLFYDSH 457
+C FLGY+ KG+ C+ P ++ +SRNVIF E++ F + + S L H
Sbjct: 704 QCVFLGYNSQYKGYRCFYPPTGKVYISRNVIFNESELPFKEKYQSLVPQYSTPLLQAWQH 763
Query: 458 SR-----------QQPSKP--LLTYK-RRSTATHGPPQDNSLVAGPVEEPAPL------- 496
++ Q SKP L TY + T P+ S G EE P+
Sbjct: 764 NKISEISVPAAPVQLFSKPIDLNTYAGSQVTEQLTDPEPTSNNEGSDEEVNPVAEEIAAN 823
Query: 497 ------------RRSSRESKPPERYINCMTATLSSIPIPSSYKQAMENDCWQKAIESELL 544
R + KP RY +T+ +++ P + AM++ W +A+ E+
Sbjct: 824 QEQVINSHAMTTRSKAGIQKPNTRYA-LITSRMNTAE-PKTLASAMKHPGWNEAVHEEIN 881
Query: 545 ALEENQTWDIVPCPSSVKPLGSKFVFSIKLRSDGSIDRYKAHLVVLGNKQEYGLDYDETF 604
+ TW +VP + L SK+VF KL DGSID+ KA LV G QE G+DY ETF
Sbjct: 882 RVHMLHTWSLVPPTDDMNILSSKWVFKTKLHPDGSIDKLKARLVAKGFDQEEGVDYLETF 941
Query: 605 APVAKMTTVRTILAIAASQAWPLHQMDVKNAFLHGDLQEEVYIKLPNGMPTPS-PNTVCK 663
+PV + T+R +L ++ S+ WP+ Q+DV NAFLHG+LQE V++ P+G P P VC+
Sbjct: 942 SPVVRTATIRLVLDVSTSKGWPIKQLDVSNAFLHGELQEPVFMYQPSGFIDPQKPTHVCR 1001
Query: 664 LKRSLYGLKQAPRVWFEKFRSTLLGFEFSQSRYDPSLFLQRTPKGMVVLLVYVDDIVVTG 723
L +++YGLKQAPR WF+ F + LL + F S+ DPSLF+ ++ LL+YVDDI++TG
Sbjct: 1002 LTKAIYGLKQAPRAWFDTFSNFLLDYGFVCSKSDPSLFVCHQDGKILYLLLYVDDILLTG 1061
Query: 724 SDQDAISRIKNLLHSTFHMKELGRLTYFLGLEVHYHHEGVFLNQQKYIQDLVQLAGLTNA 783
SDQ + + L + F MK+LG YFLG+++ + G+FL+Q Y D++Q AG+++
Sbjct: 1062 SDQSLLEDLLQALKNRFSMKDLGPPRYFLGIQIEDYANGLFLHQTAYATDILQQAGMSDC 1121
Query: 784 TLVDTPMEVNVKYRRDEGDHLDDPTQYRKLVGSLIYVTITRPDISFAVHTVSKFMQAPRH 843
+ TP+ + E +PT +R L G L Y+TITRPDI FAV+ + + M +P
Sbjct: 1122 NPMPTPLPQQLDNLNSE--LFAEPTYFRSLAGKLQYLTITRPDIQFAVNFICQRMHSPTT 1179
Query: 844 FHLSAVQQIIRYLLGTLKRGLFFPVGSSIKLQAYSDADWAGCPDTRKSTTGWCMFLGNAP 903
+++I+RY+ GT+ GL S++ L AYSD+D AGC +TR+STTG+C+ LG+
Sbjct: 1180 SDFGLLKRILRYIKGTIGMGLPIKRNSTLTLSAYSDSDHAGCKNTRRSTTGFCILLGSNL 1239
Query: 904 ISWKCKKQDSVSKSSTEAEYRAMSAACSEIIWLRGLLTELGFSQDQPTPLHADNTSAIQI 963
ISW K+Q +VS SSTEAEYRA++ A EI W+ LL +LG Q PT ++ DN SA+ +
Sbjct: 1240 ISWSAKRQPTVSNSSTEAEYRALTYAAREITWISFLLRDLGIPQYLPTQVYCDNLSAVYL 1299
Query: 964 AANPVYHERTKHIEVDCHSIREAYDRRIINLPHVSTSVQTADIFTKSLTRQ 1014
+ANP H R+KH + D H IRE +I H+S + Q AD+FTKSL R+
Sbjct: 1300 SANPALHNRSKHFDTDYHYIREQVALGLIETQHISATFQLADVFTKSLPRR 1350
>emb|CAB10526.1| retrotransposon like protein [Arabidopsis thaliana]
gi|7268497|emb|CAB78748.1| retrotransposon like protein
[Arabidopsis thaliana] gi|7444421|pir||A71444 probable
LTR retrotransposon - Arabidopsis thaliana
Length = 1433
Score = 567 bits (1460), Expect = e-160
Identities = 299/745 (40%), Positives = 446/745 (59%), Gaps = 28/745 (3%)
Query: 285 FQEFLQSNGIISQRSCPSTP*QNGVAERKNRHLLDVVRTLLLESHVPSRFWCEALSTAVH 344
F + ++GI++ SCP TP QN V ERK++H+L+V R LL +S++P FW + + TAV
Sbjct: 676 FTDLFAAHGIVAYHSCPETPEQNSVVERKHQHILNVARALLFQSNIPLEFWGDCVLTAVF 735
Query: 345 LINRMPSPSIGNESPFTRLYGHPPNYSTLRVFGCVCYVHLPPQERTKFTAQSVECAFLGY 404
LINR+P+P + N+SP+ +L PP Y +L+ FGC+CY P++R KF ++ C FLGY
Sbjct: 736 LINRLPTPVLNNKSPYEKLKNIPPAYESLKTFGCLCYSSTSPKQRHKFEPRARACVFLGY 795
Query: 405 SPHQKGFLCYDPNLRRIRVSRNVIFQENKYFFASHHDLVSSPISILPLFYDSHSRQQPSK 464
KG+ D + +SR+VIF E+ + F S + PL Q P++
Sbjct: 796 PLGYKGYKLLDIETHAVSISRHVIFHEDIFPFISS-TIKDDIKDFFPLL------QFPAR 848
Query: 465 PL-LTYKRRSTATHGPPQDNSLVAGPVEEPAPLRRSSRESKPPERY--INCMTAT----- 516
L ++ S P QD S V PL S R+ KPP+ +C T
Sbjct: 849 TDDLPLEQTSIIDTHPHQDVSSSKALVPFD-PL--SKRQKKPPKHLQDFHCYNNTTEPFH 905
Query: 517 -----LSSIPIPSSYKQAMENDCWQKAIESELLALEENQTWDIVPCPSSVKPLGSKFVFS 571
+++ IP Y +A + W A++ E+ A+ TW +V P + K +G K+VF+
Sbjct: 906 AFINNITNAVIPQRYSEAKDFKAWCDAMKEEIGAMVRTNTWSVVSLPPNKKAIGCKWVFT 965
Query: 572 IKLRSDGSIDRYKAHLVVLGNKQEYGLDYDETFAPVAKMTTVRTILAIAASQAWPLHQMD 631
IK +DGSI+RYKA LV G QE GLDY+ETF+PVAK+T+VR +L +AA W +HQ+D
Sbjct: 966 IKHNADGSIERYKARLVAKGYTQEEGLDYEETFSPVAKLTSVRMMLLLAAKMKWSVHQLD 1025
Query: 632 VKNAFLHGDLQEEVYIKLPNGMP-----TPSPNTVCKLKRSLYGLKQAPRVWFEKFRSTL 686
+ NAFL+GDL EE+Y+K+P G P+ +C+L +S+YGLKQA R W+ K +TL
Sbjct: 1026 ISNAFLNGDLDEEIYMKIPPGYADLVGEALPPHAICRLHKSIYGLKQASRQWYLKLSNTL 1085
Query: 687 LGFEFSQSRYDPSLFLQRTPKGMVVLLVYVDDIVVTGSDQDAISRIKNLLHSTFHMKELG 746
G F +S D +LF++ ++ +LVYVDDI++ + DA+++ L S F +++LG
Sbjct: 1086 KGMGFQKSNADHTLFIKYANGVLMGVLVYVDDIMIVSNSDDAVAQFTAELKSYFKLRDLG 1145
Query: 747 RLTYFLGLEVHYHHEGVFLNQQKYIQDLVQLAGLTNATLVDTPMEVNVKYRRDEGDHLDD 806
YFLG+E+ +G+ + Q+KYI +L+ G + P++ +VK +++G L D
Sbjct: 1146 AAKYFLGIEIARSEKGISICQRKYILELLSTTGFLGSKPSSIPLDPSVKLNKEDGVPLTD 1205
Query: 807 PTQYRKLVGSLIYVTITRPDISFAVHTVSKFMQAPRHFHLSAVQQIIRYLLGTLKRGLFF 866
T YRKLVG L+Y+ ITRPDI++AV+T+ +F AP HLSAV +++RYL GT+ +GLF+
Sbjct: 1206 STSYRKLVGKLMYLQITRPDIAYAVNTLCQFSHAPTSVHLSAVHKVLRYLKGTVGQGLFY 1265
Query: 867 PVGSSIKLQAYSDADWAGCPDTRKSTTGWCMFLGNAPISWKCKKQDSVSKSSTEAEYRAM 926
L+ Y+D+D+ C D+R+ +CMF+G+ +SWK KKQD+VS S+ EAE+RAM
Sbjct: 1266 SADDKFDLRGYTDSDFGSCTDSRRCVAAYCMFIGDYLVSWKSKKQDTVSMSTAEAEFRAM 1325
Query: 927 SAACSEIIWLRGLLTELGFSQDQPTPLHADNTSAIQIAANPVYHERTKHIEVDCHSIREA 986
S E+IWL L + P L+ DNT+A+ I N V+HERTK +E+DC+ REA
Sbjct: 1326 SQGTKEMIWLSRLFDDFKVPFIPPAYLYCDNTAALHIVNNSVFHERTKFVELDCYKTREA 1385
Query: 987 YDRRIINLPHVSTSVQTADIFTKSL 1011
+ + V T Q AD TK++
Sbjct: 1386 VESGFLKTMFVETGEQVADPLTKAI 1410
>gb|AAT38758.1| putative gag-pol polyprotein [Solanum demissum]
Length = 1333
Score = 558 bits (1438), Expect = e-157
Identities = 303/754 (40%), Positives = 459/754 (60%), Gaps = 25/754 (3%)
Query: 278 GEYMSHSFQEFLQSNGIISQRSCPSTP*QNGVAERKNRHLLDVVRTLLLESHVPSRFWCE 337
GE++S+ F F + NGI + + P TP QNGVAERKNR ++++ R+ L +P FW E
Sbjct: 575 GEFLSNDFNLFCEENGIRRELTAPYTPEQNGVAERKNRTVVEMARSSLKAKGLPDYFWGE 634
Query: 338 ALSTAVHLINRMPSPSIGNESPFTRLYGHPPNYSTLRVFGCVCYVHLPPQERTKFTAQSV 397
A++T V+ +N P+ + N +P G P S LR+FGC+ Y + +K +S
Sbjct: 635 AVATVVYFLNISPTKDVWNTTPLEAWNGKKPRVSHLRIFGCIAYALV--NFHSKLDEKST 692
Query: 398 ECAFLGYSPHQKGFLCYDPNLRRIRVSRNVIFQENKYFFASHHDLVSSPISILPLFYDS- 456
+C F+GYS K + Y+P ++ +SRNV+F E+ + + +++S+ I +LP +S
Sbjct: 693 KCIFVGYSLQSKAYRLYNPISGKVIISRNVVFNEDVSWNFNSGNMMSN-IQLLPTDEESA 751
Query: 457 --HSRQQPSKPLLTYKRRSTATHGPPQDNSLVAGPVE---EPAPLRRSSRESKPPERYIN 511
S P+ S++ P ++ VA P E EP PLRRS+RE KP +Y N
Sbjct: 752 VDFGNSPNSSPV------SSSVSSPIAPSTTVA-PDESSVEPIPLRRSTREKKPNPKYSN 804
Query: 512 -----CMTATLSSIPIPSSYKQAMENDCWQKAIESELLALEENQTWDIVPCPSSVKPLGS 566
C A L S PI Y++A+E W+ A+ E+ A+E N TW++V P +G
Sbjct: 805 TVNTSCQFALLVSDPI--CYEEAVEQSEWKNAMIEEIQAIERNSTWELVDAPEGKNVIGL 862
Query: 567 KFVFSIKLRSDGSIDRYKAHLVVLGNKQEYGLDYDETFAPVAKMTTVRTILAIAASQAWP 626
K+VF K +DGSI ++KA LV G Q+ G+D+DETF+PVA+ TVR +LA+AA P
Sbjct: 863 KWVFRTKYNADGSIQKHKARLVAKGYSQQQGVDFDETFSPVARFETVRVVLALAAQLHLP 922
Query: 627 LHQMDVKNAFLHGDLQEEVYIKLPNG-MPTPSPNTVCKLKRSLYGLKQAPRVWFEKFRST 685
++Q DVK+AFL+GDL+EEVY+ P G M T + N V KL+++LYGLKQAPR W+ K S
Sbjct: 923 VYQFDVKSAFLNGDLEEEVYVSQPQGFMITGNENKVYKLRKALYGLKQAPRAWYSKIDSF 982
Query: 686 LLGFEFSQSRYDPSLFLQRTPKGMVVLL-VYVDDIVVTGSDQDAISRIKNLLHSTFHMKE 744
G F +S +P+L+L++ +L+ +YVDD++ GS + ++ K+ + F M +
Sbjct: 983 FQGSGFRRSDNEPTLYLKKQGTDEFLLVCLYVDDMIYIGSSKSLVNDFKSNMMRNFEMSD 1042
Query: 745 LGRLTYFLGLEVHYHHEGVFLNQQKYIQDLVQLAGLTNATLVDTPMEVNVKYRRDEGDHL 804
LG L YFLGLEV +G+F++Q+KY +DL++ + N + TPM +N K +R +G
Sbjct: 1043 LGLLKYFLGLEVIQDKDGIFISQKKYAEDLLKKFQMMNCEVATTPMNINEKLQRADGTEK 1102
Query: 805 DDPTQYRKLVGSLIYVTITRPDISFAVHTVSKFMQAPRHFHLSAVQQIIRYLLGTLKRGL 864
+P +R LVG L Y+T TRPDI+F+V VS+F+Q+P H A ++++RY+ GT G+
Sbjct: 1103 ANPKLFRSLVGGLNYLTHTRPDIAFSVSVVSRFLQSPTKQHFGAAKRVLRYVAGTTDFGI 1162
Query: 865 FFPVGSSIKLQAYSDADWAGCPDTRKSTTGWCMFLGNAPISWKCKKQDSVSKSSTEAEYR 924
++ + +L ++D+D+AGC D RKST+G C G+ ++W KKQ++V+ S++EAEY
Sbjct: 1163 WYSKAPNFRLVGFTDSDYAGCLDDRKSTSGSCFSFGSGVVTWSSKKQETVALSTSEAEYT 1222
Query: 925 AMSAACSEIIWLRGLLTELGFSQDQPTPLHADNTSAIQIAANPVYHERTKHIEVDCHSIR 984
A S A + +WLR LL + + Q + T + +D+ SAI +A NP +H RTKHI+V H IR
Sbjct: 1223 AASLAARQALWLRKLLEDFSYEQKESTEIFSDSKSAIAMAKNPSFHGRTKHIDVQYHFIR 1282
Query: 985 EAYDRRIINLPHVSTSVQTADIFTKSLTRQRHNF 1018
I L ST+ Q ADIFTKSL + +H +
Sbjct: 1283 TLVADGRIVLKFCSTNEQAADIFTKSLPQAKHEY 1316
>gb|AAP46257.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
gi|50919599|ref|XP_470160.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
Length = 1335
Score = 547 bits (1410), Expect = e-154
Identities = 285/760 (37%), Positives = 457/760 (59%), Gaps = 44/760 (5%)
Query: 276 QWGEYMSHSFQEFLQSNGIISQRSCPSTP*QNGVAERKNRHLLDVVRTLLLESHVPSRFW 335
Q EY+S F+++ ++ GI Q + + QNGVAERKNR + D+ ++L + +P FW
Sbjct: 587 QGREYISKEFEKYCENAGIRRQLTAGYSAQQNGVAERKNRTINDMANSMLQDKGMPKSFW 646
Query: 336 CEALSTAVHLINRMPSPSIGNESPFTRLYGHPPNYSTLRVFGCVCYVHLPPQERTKFTAQ 395
EA++TAV+++NR P+ ++ N +PF YG P +RVFGC+CY +P Q+R KF +
Sbjct: 647 AEAVNTAVYILNRSPTKAVTNRTPFEAWYGKKPVIGHMRVFGCICYAQVPAQKRVKFDNK 706
Query: 396 SVECAFLGYSPHQKGFLCYDPNLRRIRVSRNVIFQENKYFFASHHDLVSSPISIL----- 450
S C F+GY+ KG+ Y+ ++I +SR+ IF E+ + + S+P+
Sbjct: 707 SDRCIFVGYADGIKGYRLYNLEKKKIIISRDAIFDESATWNWKSPEASSTPLLPTTTITL 766
Query: 451 --PLFYDSHSRQ------QPSKPLLTYKRRSTATHGPPQDNSLVAGPVEEPAPLR----- 497
P + +H + QPS P+ + S ++ P ++ P P +R
Sbjct: 767 GQPHMHGTHEVEDHTPSPQPSSPMSS---SSASSDSSPSSEEQISTPESAPRRVRSMVEL 823
Query: 498 -RSSRESKPPERYINCMTATLSSIPIPSSYKQAMENDCWQKAIESELLALEENQTWDIVP 556
S+ + + E++ C + + P S+++A ++D W KA+E E+ +E+N TW++V
Sbjct: 824 LESTSQQRGSEQHEFCNYSVVE----PQSFQEAEKHDNWIKAMEDEIHMIEKNNTWELVD 879
Query: 557 CPSSVKPLGSKFVFSIKLRSDGSIDRYKAHLVVLGNKQEYGLDYDETFAPVAKMTTVRTI 616
P + +G K+V+ KL DGS+ +YKA LV G KQ+ G+DY ET+APVA++ T+RTI
Sbjct: 880 RPRDREVIGVKWVYKTKLNPDGSVQKYKARLVAKGFKQKPGIDYYETYAPVARLETIRTI 939
Query: 617 LAIAASQAWPLHQMDVKNAFLHGDLQEEVYIKLPNGMPTPS-PNTVCKLKRSLYGLKQAP 675
+A+AA + W ++Q+DVK+AFL+G L EE+Y++ P G N V +LK++LYGLKQAP
Sbjct: 940 IALAAQKRWKIYQLDVKSAFLNGYLDEEIYVEQPEGFSVQGGENKVFRLKKALYGLKQAP 999
Query: 676 RVWFEKFRSTLLGFEFSQSRYDPSLFLQRTPKGMVVLLVYVDDIVVTGSDQDAISRIKNL 735
R W+ + + F++S +P+L++ +T ++++ +YVDD++ TG+ + + K
Sbjct: 1000 RAWYSQIDKYFIQKGFAKSISEPTLYVNKTGTDILIVSLYVDDLIYTGNSEKMMQDFKKD 1059
Query: 736 LHSTFHMKELGRLTYFLGLEVHYHHEGVFLNQQKYIQDLVQLAGLTNATLVDTPMEVNVK 795
+ T+ M +LG L YFLG+EVH EG+F++Q+KY +++++ + N V TP+ N K
Sbjct: 1060 MMHTYEMSDLGLLHYFLGMEVHQSDEGIFISQRKYAENILKKFKMDNCKSVTTPLLPNEK 1119
Query: 796 YRRDEGDHLDDPTQYRKLVGSLIYVTITRPDISFAVHTVSKFMQAPRHFHLSAVQQIIRY 855
+ +G DPT YR LVGSL+Y+T TRPDI FA +S++M +P + +A ++++RY
Sbjct: 1120 QKARDGADKADPTIYRSLVGSLLYLTATRPDIMFAASLLSRYMSSPSQLNFTAAKRVLRY 1179
Query: 856 LLGTLKRGLFFPVGSSIKLQAYSDADWAGCPDTRKSTTGWCMFLGNAPISWKCKKQDSVS 915
+ GT G+++ KL Y+D+DWAGC D KST+G+ LG+A
Sbjct: 1180 IKGTADYGIWYKPVKESKLIGYTDSDWAGCLDDMKSTSGYAFSLGSA------------- 1226
Query: 916 KSSTEAEYRAMSAACSEIIWLRGLLTELGFSQDQPTPLHADNTSAIQIAANPVYHERTKH 975
EAEY A S A S+++WLR ++ +LG Q QPT ++ D+ SAI I+ NPV H+RTKH
Sbjct: 1227 ----EAEYVAASKAVSQVVWLRRIMEDLGEKQYQPTTIYCDSKSAIAISENPVSHDRTKH 1282
Query: 976 IEVDCHSIREAYDRRIINLPHVSTSVQTADIFTKSLTRQR 1015
I + H IREA DR+ + L T Q ADIFTK+L++++
Sbjct: 1283 IAIKYHYIREAVDRQEVKLEFCRTDEQLADIFTKALSKEK 1322
>gb|AAD50001.1| Hypothetical protein [Arabidopsis thaliana] gi|25301681|pir||F86246
hypothetical protein [imported] - Arabidopsis thaliana
Length = 1352
Score = 531 bits (1368), Expect = e-149
Identities = 275/745 (36%), Positives = 437/745 (57%), Gaps = 13/745 (1%)
Query: 278 GEYMSHSFQEFLQSNGIISQRSCPSTP*QNGVAERKNRHLLDVVRTLLLESHVPSRFWCE 337
GE+ S F ++ + NGI Q + P +P QNGV ERKNR +L++ R++L +P W E
Sbjct: 599 GEFTSKEFLKYCEDNGIRRQLTVPRSPQQNGVVERKNRTILEMARSMLKSKRLPKELWAE 658
Query: 338 ALSTAVHLINRMPSPSIGNESPFTRLYGHPPNYSTLRVFGCVCYVHLPPQERTKFTAQSV 397
A++ AV+L+NR P+ S+ ++P G P S LRVFG + + H+P ++R+K +S
Sbjct: 659 AVACAVYLLNRSPTKSVSGKTPQEAWSGRKPGVSHLRVFGSIAHAHVPDEKRSKLDDKSE 718
Query: 398 ECAFLGYSPHQKGFLCYDPNLRRIRVSRNVIF-QENKYFFASHHDLVSSPISILPLFYDS 456
+ F+GY + KG+ Y+P+ ++ +SRN++F +E ++ + S+ + + P F +
Sbjct: 719 KYIFIGYDNNSKGYKLYNPDTKKTIISRNIVFDEEGEWDWNSNEE----DYNFFPHFEED 774
Query: 457 HSRQQPSKPLLTYKRRSTATHGPPQDNSLVAGPVEEPAPLRRSSRES-KPPERYINCMTA 515
+P T +S + E P RS +E + E N
Sbjct: 775 EPEPTREEP----PSEEPTTPPTSPTSSQIEESSSERTPRFRSIQELYEVTENQENLTLF 830
Query: 516 TLSSIPIPSSYKQAMENDCWQKAIESELLALEENQTWDIVPCPSSVKPLGSKFVFSIKLR 575
L + P +++A+E W+ A++ E+ ++++N TW++ P+ K +G K+V+ K
Sbjct: 831 CLFAECEPMDFQKAIEKKTWRNAMDEEIKSIQKNDTWELTSLPNGHKAIGVKWVYKAKKN 890
Query: 576 SDGSIDRYKAHLVVLGNKQEYGLDYDETFAPVAKMTTVRTILAIAASQAWPLHQMDVKNA 635
S G ++RYKA LV G Q G+DYDE FAPVA++ TVR I+++AA W +HQMDVK+A
Sbjct: 891 SKGEVERYKARLVAKGYSQRVGIDYDEVFAPVARLETVRLIISLAAQNKWKIHQMDVKSA 950
Query: 636 FLHGDLQEEVYIKLPNG-MPTPSPNTVCKLKRSLYGLKQAPRVWFEKFRSTLLGFEFSQS 694
FL+GDL+EEVYI+ P G + + V +LK+ LYGLKQAPR W + +F +
Sbjct: 951 FLNGDLEEEVYIEQPQGYIVKGEEDKVLRLKKVLYGLKQAPRAWNTRIDKYFKEKDFIKC 1010
Query: 695 RYDPSLFLQRTPKGMVVLLVYVDDIVVTGSDQDAISRIKNLLHSTFHMKELGRLTYFLGL 754
Y+ +L+++ + +++ +YVDD++ TG++ K + F M ++G ++Y+LG+
Sbjct: 1011 PYEHALYIKIQKEDILIACLYVDDLIFTGNNPSIFEEFKKEMTKEFEMTDIGLMSYYLGI 1070
Query: 755 EVHYHHEGVFLNQQKYIQDLVQLAGLTNATLVDTPMEVNVKYRRDEGDHLDDPTQYRKLV 814
EV G+F+ Q+ Y +++++ + ++ V TPME +K + E DPT ++ LV
Sbjct: 1071 EVKQEDNGIFITQEGYAKEVLKKFKMDDSNPVCTPMECGIKLSKKEEGEGVDPTTFKSLV 1130
Query: 815 GSLIYVTITRPDISFAVHTVSKFMQAPRHFHLSAVQQIIRYLLGTLKRGLFFPVGSSIKL 874
GSL Y+T TRPDI +AV VS++M+ P H A ++I+RY+ GT+ GL + S KL
Sbjct: 1131 GSLRYLTCTRPDILYAVGVVSRYMEHPTTTHFKAAKRILRYIKGTVNFGLHYSTTSDYKL 1190
Query: 875 QAYSDADWAGCPDTRKSTTGWCMFLGNAPISWKCKKQDSVSKSSTEAEYRAMSAACSEII 934
YSD+DW G D RKST+G+ ++G+ +W KKQ V+ S+ EAEY A ++ I
Sbjct: 1191 VGYSDSDWGGDVDDRKSTSGFVFYIGDTAFTWMSKKQPIVTLSTCEAEYVAATSCVCHAI 1250
Query: 935 WLRGLLTELGFSQDQPTPLHADNTSAIQIAANPVYHERTKHIEVDCHSIREAYDRRIINL 994
WLR LL EL Q++PT + DN SAI +A NPV+H+R+KHI+ H IRE ++ + L
Sbjct: 1251 WLRNLLKELSLPQEEPTKIFVDNKSAIALAKNPVFHDRSKHIDTRYHYIRECVSKKDVQL 1310
Query: 995 PHVSTSVQTADIFTKSLTRQRHNFL 1019
+V T Q AD FTK L +R NF+
Sbjct: 1311 EYVKTHDQVADFFTKPL--KRENFI 1333
>emb|CAB71063.1| copia-type polyprotein [Arabidopsis thaliana] gi|11278364|pir||T47925
copia-type polyprotein - Arabidopsis thaliana
Length = 1352
Score = 531 bits (1368), Expect = e-149
Identities = 275/745 (36%), Positives = 437/745 (57%), Gaps = 13/745 (1%)
Query: 278 GEYMSHSFQEFLQSNGIISQRSCPSTP*QNGVAERKNRHLLDVVRTLLLESHVPSRFWCE 337
GE+ S F ++ + NGI Q + P +P QNGV ERKNR +L++ R++L +P W E
Sbjct: 599 GEFTSKEFLKYCEDNGIRRQLTVPRSPQQNGVVERKNRTILEMARSMLKSKRLPKELWAE 658
Query: 338 ALSTAVHLINRMPSPSIGNESPFTRLYGHPPNYSTLRVFGCVCYVHLPPQERTKFTAQSV 397
A++ AV+L+NR P+ S+ ++P G P S LRVFG + + H+P ++R+K +S
Sbjct: 659 AVACAVYLLNRSPTKSVSGKTPQEAWSGRKPGVSHLRVFGSIAHAHVPDEKRSKLDDKSE 718
Query: 398 ECAFLGYSPHQKGFLCYDPNLRRIRVSRNVIF-QENKYFFASHHDLVSSPISILPLFYDS 456
+ F+GY + KG+ Y+P+ ++ +SRN++F +E ++ + S+ + + P F +
Sbjct: 719 KYIFIGYDNNSKGYKLYNPDTKKTIISRNIVFDEEGEWDWNSNEE----DYNFFPHFEED 774
Query: 457 HSRQQPSKPLLTYKRRSTATHGPPQDNSLVAGPVEEPAPLRRSSRES-KPPERYINCMTA 515
+P T +S + E P RS +E + E N
Sbjct: 775 EPEPTREEP----PSEEPTTPPTSPTSSQIEESSSERTPRFRSIQELYEVTENQENLTLF 830
Query: 516 TLSSIPIPSSYKQAMENDCWQKAIESELLALEENQTWDIVPCPSSVKPLGSKFVFSIKLR 575
L + P +++A+E W+ A++ E+ ++++N TW++ P+ K +G K+V+ K
Sbjct: 831 CLFAECEPMDFQKAIEKKTWRNAMDEEIKSIQKNDTWELTSLPNGHKAIGVKWVYKAKKN 890
Query: 576 SDGSIDRYKAHLVVLGNKQEYGLDYDETFAPVAKMTTVRTILAIAASQAWPLHQMDVKNA 635
S G ++RYKA LV G Q G+DYDE FAPVA++ TVR I+++AA W +HQMDVK+A
Sbjct: 891 SKGEVERYKARLVAKGYSQRVGIDYDEVFAPVARLETVRLIISLAAQNKWKIHQMDVKSA 950
Query: 636 FLHGDLQEEVYIKLPNG-MPTPSPNTVCKLKRSLYGLKQAPRVWFEKFRSTLLGFEFSQS 694
FL+GDL+EEVYI+ P G + + V +LK+ LYGLKQAPR W + +F +
Sbjct: 951 FLNGDLEEEVYIEQPQGYIVKGEEDKVLRLKKVLYGLKQAPRAWNTRIDKYFKEKDFIKC 1010
Query: 695 RYDPSLFLQRTPKGMVVLLVYVDDIVVTGSDQDAISRIKNLLHSTFHMKELGRLTYFLGL 754
Y+ +L+++ + +++ +YVDD++ TG++ K + F M ++G ++Y+LG+
Sbjct: 1011 PYEHALYIKIQKEDILIACLYVDDLIFTGNNPSIFEEFKKEMTKEFEMTDIGLMSYYLGI 1070
Query: 755 EVHYHHEGVFLNQQKYIQDLVQLAGLTNATLVDTPMEVNVKYRRDEGDHLDDPTQYRKLV 814
EV G+F+ Q+ Y +++++ + ++ V TPME +K + E DPT ++ LV
Sbjct: 1071 EVKQEDNGIFITQEGYAKEVLKKFKIDDSNPVCTPMECGIKLSKKEEGEGVDPTTFKSLV 1130
Query: 815 GSLIYVTITRPDISFAVHTVSKFMQAPRHFHLSAVQQIIRYLLGTLKRGLFFPVGSSIKL 874
GSL Y+T TRPDI +AV VS++M+ P H A ++I+RY+ GT+ GL + S KL
Sbjct: 1131 GSLRYLTCTRPDILYAVGVVSRYMEHPTTTHFKAAKRILRYIKGTVNFGLHYSTTSDYKL 1190
Query: 875 QAYSDADWAGCPDTRKSTTGWCMFLGNAPISWKCKKQDSVSKSSTEAEYRAMSAACSEII 934
YSD+DW G D RKST+G+ ++G+ +W KKQ V+ S+ EAEY A ++ I
Sbjct: 1191 VGYSDSDWGGDVDDRKSTSGFVFYIGDTAFTWMSKKQPIVTLSTCEAEYVAATSCVCHAI 1250
Query: 935 WLRGLLTELGFSQDQPTPLHADNTSAIQIAANPVYHERTKHIEVDCHSIREAYDRRIINL 994
WLR LL EL Q++PT + DN SAI +A NPV+H+R+KHI+ H IRE ++ + L
Sbjct: 1251 WLRNLLKELSLPQEEPTKIFVDNKSAIALAKNPVFHDRSKHIDTRYHYIRECVSKKDVQL 1310
Query: 995 PHVSTSVQTADIFTKSLTRQRHNFL 1019
+V T Q AD FTK L +R NF+
Sbjct: 1311 EYVKTHDQVADFFTKPL--KRENFI 1333
>emb|CAB75932.1| putative protein [Arabidopsis thaliana] gi|11278365|pir||T47841
hypothetical protein T2O9.150 - Arabidopsis thaliana
Length = 1339
Score = 530 bits (1365), Expect = e-148
Identities = 273/752 (36%), Positives = 443/752 (58%), Gaps = 18/752 (2%)
Query: 278 GEYMSHSFQEFLQSNGIISQRSCPSTP*QNGVAERKNRHLLDVVRTLLLESHVPSRFWCE 337
GE+ S+ F EF +S+GI Q + TP QNGVAERKNR +++ VR++L E VP FW E
Sbjct: 567 GEFTSNEFGEFCRSHGISRQLTAAFTPQQNGVAERKNRTIMNAVRSMLSERQVPKMFWSE 626
Query: 338 ALSTAVHLINRMPSPSIGNESPFTRLYGHPPNYSTLRVFGCVCYVHLPPQERTKFTAQSV 397
A +VH+ NR P+ ++ +P G P RVFGC+ YVH+P Q+R+K +S
Sbjct: 627 ATKWSVHIQNRSPTAAVEGMTPEEAWSGRKPVVEYFRVFGCIGYVHIPDQKRSKLDDKSK 686
Query: 398 ECAFLGYSPHQKGFLCYDPNLRRIRVSRNVIFQENKYFFASHHDLVSSPISILPLFYDSH 457
+C FLG S K + YDP +++I +S++V+F E+K + D+ + +++ D
Sbjct: 687 KCVFLGVSEESKAWRLYDPVMKKIVISKDVVFDEDKSWDWDQADVEAKEVTLECGDEDDE 746
Query: 458 SRQQPSKPLLTYKRRSTATHGPPQDNSLVAGPVEEPAPLR-RSSRESKPP---------- 506
+ +P+ + + ++A P+P+ + +RE +PP
Sbjct: 747 KNSEVVEPIAVASPNHVGSDNNVSSSPILAPSSPAPSPVAAKVTRERRPPGWMADYETGE 806
Query: 507 ----ERYINCMTATLSSIPIPSSYKQAMENDCWQKAIESELLALEENQTWDIVPCPSSVK 562
E ++ M + + P + A+++ W++A+E E+ ++ +N TW++ P
Sbjct: 807 GEEIEENLSVMLLMMMTEADPIQFDDAVKDKIWREAMEHEIESIVKNNTWELTTLPKGFT 866
Query: 563 PLGSKFVFSIKLRSDGSIDRYKAHLVVLGNKQEYGLDYDETFAPVAKMTTVRTILAIAAS 622
P+G K+V+ KL DG +D+YKA LV G Q YG+DY E FAPVA++ TVRTILAI++
Sbjct: 867 PIGVKWVYKTKLNEDGEVDKYKARLVAKGYAQCYGIDYTEVFAPVARLDTVRTILAISSQ 926
Query: 623 QAWPLHQMDVKNAFLHGDLQEEVYIKLPNG-MPTPSPNTVCKLKRSLYGLKQAPRVWFEK 681
W + Q+DVK+AFLHG+L+EEVY++ P G + V KL+++LYGLKQAPR W+ +
Sbjct: 927 FNWEIFQLDVKSAFLHGELKEEVYVRQPEGFIREGEEEKVYKLRKALYGLKQAPRAWYSR 986
Query: 682 FRSTLLGFEFSQSRYDPSLFLQRTPKGMVVLLVYVDDIVVTGSDQDAISRIKNLLHSTFH 741
+ L EF + + +LF + ++++ +YVDD++ TGSD+ K + F
Sbjct: 987 IEAYFLKEEFERCPSEHTLFTKTRVGNILIVSLYVDDLIFTGSDKAMCDEFKKSMMLEFE 1046
Query: 742 MKELGRLTYFLGLEVHYHHEGVFLNQQKYIQDLVQLAGLTNATLVDTPMEVNVKYRRDEG 801
M +LG++ +FLG+EV G+F+ Q++Y ++++ G+ + V P+ K +DE
Sbjct: 1047 MSDLGKMKHFLGIEVKQSDGGIFICQRRYAREVLARFGMDESNAVKNPIVPGTKLTKDEN 1106
Query: 802 DHLDDPTQYRKLVGSLIYVTITRPDISFAVHTVSKFMQAPRHFHLSAVQQIIRYLLGTLK 861
D T +++LVGSL+Y+T+TRPD+ + V +S+FM PR H A ++I+RYL GT++
Sbjct: 1107 GEKVDETMFKQLVGSLMYLTVTRPDLMYGVCLISRFMSNPRMSHWLAAKRILRYLKGTVE 1166
Query: 862 RGLFF--PVGSSIKLQAYSDADWAGCPDTRKSTTGWCMFLGNAPISWKCKKQDSVSKSST 919
G+F+ S+KL A++D+D+AG + R+ST+G+ + + I W KKQ V+ S+T
Sbjct: 1167 LGIFYRRRKNRSLKLMAFTDSDYAGDLNDRRSTSGFVFLMASGAICWASKKQPVVALSTT 1226
Query: 920 EAEYRAMSAACSEIIWLRGLLTELGFSQDQPTPLHADNTSAIQIAANPVYHERTKHIEVD 979
EAEY A + + +WLR +L +LG + T ++ DN+S IQ++ +PV H ++KHIEV
Sbjct: 1227 EAEYIAAAFCACQCVWLRKVLEKLGAEEKSATVINCDNSSTIQLSKHPVLHGKSKHIEVR 1286
Query: 980 CHSIREAYDRRIINLPHVSTSVQTADIFTKSL 1011
H +R+ + ++ L + T Q ADIFTK L
Sbjct: 1287 FHYLRDLVNGDVVKLEYCPTEDQVADIFTKPL 1318
>gb|AAG60117.1| copia-type polyprotein, putative [Arabidopsis thaliana]
Length = 1352
Score = 530 bits (1364), Expect = e-148
Identities = 274/740 (37%), Positives = 435/740 (58%), Gaps = 11/740 (1%)
Query: 278 GEYMSHSFQEFLQSNGIISQRSCPSTP*QNGVAERKNRHLLDVVRTLLLESHVPSRFWCE 337
GE+ S F ++ + NGI Q + P +P QNGVAERKNR +L++ R++L +P W E
Sbjct: 599 GEFTSKEFLKYCEDNGIRRQLTVPRSPQQNGVAERKNRTILEMARSMLKSKRLPKELWAE 658
Query: 338 ALSTAVHLINRMPSPSIGNESPFTRLYGHPPNYSTLRVFGCVCYVHLPPQERTKFTAQSV 397
A++ AV+L+NR P+ S+ ++P G S LRVFG + + H+P ++R+K +S
Sbjct: 659 AVACAVYLLNRSPTKSVSGKTPQEAWSGRKSGVSHLRVFGSIAHAHVPDEKRSKLDDKSE 718
Query: 398 ECAFLGYSPHQKGFLCYDPNLRRIRVSRNVIF-QENKYFFASHHDLVSSPISILPLFYDS 456
+ F+GY + KG+ Y+P+ ++ +SRN++F +E ++ + S+ + + P F +
Sbjct: 719 KYIFIGYDNNSKGYKLYNPDTKKTIISRNIVFDEEGEWDWNSNEE----DYNFFPHFEED 774
Query: 457 HSRQQPSKPLLTYKRRSTATHGPPQDNSLVAGPVEEPAPLRRSSRES-KPPERYINCMTA 515
+P T +S + E P RS +E + E N
Sbjct: 775 EPEPTREEP----PSEEPTTPPTSPTSSQIEESSSERTPRFRSIQELYEVTENQENLTLF 830
Query: 516 TLSSIPIPSSYKQAMENDCWQKAIESELLALEENQTWDIVPCPSSVKPLGSKFVFSIKLR 575
L + P +++A+E W+ A++ E+ ++++N TW++ P+ K +G K+V+ K
Sbjct: 831 CLFAECEPMDFQEAIEKKTWRNAMDEEIKSIQKNDTWELTSLPNGHKTIGVKWVYKAKKN 890
Query: 576 SDGSIDRYKAHLVVLGNKQEYGLDYDETFAPVAKMTTVRTILAIAASQAWPLHQMDVKNA 635
S G ++RYKA LV G Q G+DYDE FAPVA++ TVR I+++AA W +HQMDVK+A
Sbjct: 891 SKGEVERYKARLVAKGYIQRAGIDYDEVFAPVARLETVRLIISLAAQNKWKIHQMDVKSA 950
Query: 636 FLHGDLQEEVYIKLPNG-MPTPSPNTVCKLKRSLYGLKQAPRVWFEKFRSTLLGFEFSQS 694
FL+GDL+EEVYI+ P G + + V +LK++LYGLKQAPR W + +F +
Sbjct: 951 FLNGDLEEEVYIEQPQGYIVKGEEDKVLRLKKALYGLKQAPRAWNTRIDKYFKEKDFIKC 1010
Query: 695 RYDPSLFLQRTPKGMVVLLVYVDDIVVTGSDQDAISRIKNLLHSTFHMKELGRLTYFLGL 754
Y+ +L+++ + +++ +YVDD++ TG++ K + F M ++G ++Y+LG+
Sbjct: 1011 PYEHALYIKIQKEDILIACLYVDDLIFTGNNPSMFEEFKKEMTKEFEMTDIGLMSYYLGI 1070
Query: 755 EVHYHHEGVFLNQQKYIQDLVQLAGLTNATLVDTPMEVNVKYRRDEGDHLDDPTQYRKLV 814
EV G+F+ Q+ Y +++++ + ++ V TPME +K + E DPT ++ LV
Sbjct: 1071 EVKQEDNGIFITQEGYAKEVLKKFKMDDSNPVCTPMECGIKLSKKEEGEGVDPTTFKSLV 1130
Query: 815 GSLIYVTITRPDISFAVHTVSKFMQAPRHFHLSAVQQIIRYLLGTLKRGLFFPVGSSIKL 874
GSL Y+T TRPDI +AV VS++M+ P H A ++I+RY+ GT+ GL + S KL
Sbjct: 1131 GSLRYLTCTRPDILYAVGVVSRYMEHPTTTHFKAAKRILRYIKGTVNFGLHYSTTSDYKL 1190
Query: 875 QAYSDADWAGCPDTRKSTTGWCMFLGNAPISWKCKKQDSVSKSSTEAEYRAMSAACSEII 934
YSD+DW G D RKST+G+ ++G+ +W KKQ V S+ EAEY A ++ I
Sbjct: 1191 VGYSDSDWGGDVDDRKSTSGFVFYIGDTAFTWMSKKQPIVVLSTCEAEYVAATSCVCHAI 1250
Query: 935 WLRGLLTELGFSQDQPTPLHADNTSAIQIAANPVYHERTKHIEVDCHSIREAYDRRIINL 994
WLR LL EL Q++PT + DN SAI +A NPV+H+R+KHI+ H IRE ++ + L
Sbjct: 1251 WLRNLLKELSLPQEEPTKIFVDNKSAIALAKNPVFHDRSKHIDTRYHYIRECVSKKDVQL 1310
Query: 995 PHVSTSVQTADIFTKSLTRQ 1014
+V T Q ADIFTK L R+
Sbjct: 1311 EYVKTHDQVADIFTKPLKRE 1330
>gb|AAF16534.1| T26F17.17 [Arabidopsis thaliana]
Length = 1291
Score = 524 bits (1350), Expect = e-147
Identities = 271/740 (36%), Positives = 432/740 (57%), Gaps = 11/740 (1%)
Query: 278 GEYMSHSFQEFLQSNGIISQRSCPSTP*QNGVAERKNRHLLDVVRTLLLESHVPSRFWCE 337
GE+ S F ++ + NGI Q + P +P QNGVAERKNR +L++ R++L +P W E
Sbjct: 538 GEFTSKEFLKYCEDNGIRRQLTVPRSPQQNGVAERKNRTILEMARSMLKSKRLPKELWAE 597
Query: 338 ALSTAVHLINRMPSPSIGNESPFTRLYGHPPNYSTLRVFGCVCYVHLPPQERTKFTAQSV 397
A++ AV+L+NR P+ S+ ++P G P S LRVFG + + H+P ++R+K +S
Sbjct: 598 AVACAVYLLNRSPTKSVSGKTPQEAWSGRKPGVSHLRVFGSIAHAHVPDEKRSKLDDKSE 657
Query: 398 ECAFLGYSPHQKGFLCYDPNLRRIRVSRNVIF-QENKYFFASHHDLVSSPISILPLFYDS 456
+ F+GY + KG+ Y+P+ ++ +SRN++F +E ++ + S+ + + P F +
Sbjct: 658 KYIFIGYDNNSKGYKLYNPDTKKTIISRNIVFDEEGEWDWNSNEE----DYNFFPHFEED 713
Query: 457 HSRQQPSKPLLTYKRRSTATHGPPQDNSLVAGPVEEPAPLRRSSRES-KPPERYINCMTA 515
+P T +S + E P RS +E + E N
Sbjct: 714 EPEPTREEP----PSEEPTTRPTSLTSSQIEESSSERTPRFRSIQELYEVTENQENLTLF 769
Query: 516 TLSSIPIPSSYKQAMENDCWQKAIESELLALEENQTWDIVPCPSSVKPLGSKFVFSIKLR 575
L + P +++A+E W+ A++ E+ ++++N TW++ P+ K +G K+V+ K
Sbjct: 770 CLFAECEPMDFQEAIEKKTWRNAMDEEIKSIQKNDTWELTSLPNGHKAIGVKWVYKAKKN 829
Query: 576 SDGSIDRYKAHLVVLGNKQEYGLDYDETFAPVAKMTTVRTILAIAASQAWPLHQMDVKNA 635
S G ++RYKA LV G Q G+DYDE FAPVA++ TVR I+++AA W +HQMD K A
Sbjct: 830 SKGEVERYKARLVAKGYSQRAGIDYDEVFAPVARLETVRLIISLAAQNKWKIHQMDFKLA 889
Query: 636 FLHGDLQEEVYIKLPNG-MPTPSPNTVCKLKRSLYGLKQAPRVWFEKFRSTLLGFEFSQS 694
FL+GD +EEVYI+ P G + + V +LK++LYGLKQAPR W + +F +
Sbjct: 890 FLNGDFEEEVYIEQPQGYIVKGEEDKVLRLKKALYGLKQAPRAWNTRIDKYFKEKDFIKC 949
Query: 695 RYDPSLFLQRTPKGMVVLLVYVDDIVVTGSDQDAISRIKNLLHSTFHMKELGRLTYFLGL 754
Y+ +L+++ + +++ +YVDD++ TG++ K + F M ++G ++Y+LG+
Sbjct: 950 PYEHALYIKIQKEDILIACLYVDDLIFTGNNPSMFEEFKKEMTKEFEMTDIGLMSYYLGI 1009
Query: 755 EVHYHHEGVFLNQQKYIQDLVQLAGLTNATLVDTPMEVNVKYRRDEGDHLDDPTQYRKLV 814
EV +F+ Q+ Y +++++ + ++ V TPME +K + E DPT ++ LV
Sbjct: 1010 EVKQEDNRIFITQEGYAKEVLKKFKMDDSNPVCTPMECGIKLSKKEEGEGVDPTTFKSLV 1069
Query: 815 GSLIYVTITRPDISFAVHTVSKFMQAPRHFHLSAVQQIIRYLLGTLKRGLFFPVGSSIKL 874
GSL Y+T TRPDI +AV VS++M+ P H A ++I+RY+ GT+ GL + S KL
Sbjct: 1070 GSLRYLTCTRPDILYAVGVVSRYMEHPTTTHFKAAKRILRYIKGTVNFGLHYSTTSDYKL 1129
Query: 875 QAYSDADWAGCPDTRKSTTGWCMFLGNAPISWKCKKQDSVSKSSTEAEYRAMSAACSEII 934
YSD+DW D RKST+G+ ++G+ +W KKQ V+ S+ EAEY A ++ I
Sbjct: 1130 VGYSDSDWGRDVDDRKSTSGFVFYIGDTAFTWMSKKQPIVTLSTCEAEYVAATSCVCHAI 1189
Query: 935 WLRGLLTELGFSQDQPTPLHADNTSAIQIAANPVYHERTKHIEVDCHSIREAYDRRIINL 994
WLR LL EL Q++PT + DN SAI +A NPV+H+R+KHI+ H IRE ++ + L
Sbjct: 1190 WLRNLLKELSLPQEEPTKIFVDNKSAIALAKNPVFHDRSKHIDTRYHYIRECVSKKDVQL 1249
Query: 995 PHVSTSVQTADIFTKSLTRQ 1014
+V T Q ADIFTK L R+
Sbjct: 1250 EYVKTHDQVADIFTKPLKRE 1269
Database: nr
Posted date: Jul 5, 2005 12:34 AM
Number of letters in database: 863,360,394
Number of sequences in database: 2,540,612
Lambda K H
0.339 0.147 0.496
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,663,389,574
Number of Sequences: 2540612
Number of extensions: 68974435
Number of successful extensions: 219574
Number of sequences better than 10.0: 1737
Number of HSP's better than 10.0 without gapping: 1679
Number of HSP's successfully gapped in prelim test: 58
Number of HSP's that attempted gapping in prelim test: 213369
Number of HSP's gapped (non-prelim): 2926
length of query: 1019
length of database: 863,360,394
effective HSP length: 138
effective length of query: 881
effective length of database: 512,755,938
effective search space: 451737981378
effective search space used: 451737981378
T: 11
A: 40
X1: 15 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 39 (21.8 bits)
S2: 80 (35.4 bits)
Medicago: description of AC146527.6