
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0348.7
(778 letters)
Database: nr
2,540,612 sequences; 863,360,394 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gb|AAB82754.1| retrofit [Oryza longistaminata] gi|7444451|pir||T... 690 0.0
gb|AAA57005.1| copia-like retrotransposon Hopscotch polyprotein ... 689 0.0
gb|AAK43485.1| polyprotein, putative [Arabidopsis thaliana] 678 0.0
gb|AAT85031.1| putative polyprotein [Oryza sativa (japonica cult... 676 0.0
gb|AAF99727.1| F17L21.7 [Arabidopsis thaliana] 663 0.0
gb|AAC02664.1| polyprotein [Arabidopsis thaliana] 662 0.0
ref|XP_475911.1| putative polyprotein [Oryza sativa (japonica cu... 659 0.0
gb|AAC02666.1| polyprotein [Arabidopsis thaliana] 659 0.0
gb|AAC02669.1| polyprotein [Arabidopsis thaliana] 658 0.0
emb|CAB79576.1| putative protein [Arabidopsis thaliana] gi|32692... 634 e-180
gb|AAC02672.1| polyprotein [Arabidopsis arenosa] gi|7522104|pir|... 624 e-177
gb|AAF02855.1| Similar to retrotransposon proteins [Arabidopsis ... 617 e-175
gb|AAD21687.1| Strong similarity to gi|3600044 T12H20.12 proteas... 608 e-172
emb|CAB81170.1| retrotransposon like protein [Arabidopsis thalia... 607 e-172
ref|XP_462785.1| putative gag/pol polyprotein [Oryza sativa (jap... 604 e-171
ref|NP_916434.1| putative gag/pol polyprotein [Oryza sativa (jap... 598 e-169
gb|AAK51235.1| polyprotein [Arabidopsis thaliana] 591 e-167
emb|CAC37623.1| copia-like polyprotein [Arabidopsis thaliana] 585 e-165
gb|AAP53905.1| putative pol polyprotein [Oryza sativa (japonica ... 585 e-165
gb|AAD43604.1| T3P18.3 [Arabidopsis thaliana] gi|25301688|pir||H... 584 e-165
>gb|AAB82754.1| retrofit [Oryza longistaminata] gi|7444451|pir||T10728 probable
gag/pol polyprotein - long-staminate rice retrotransposon
retrofit
Length = 1445
Score = 690 bits (1781), Expect = 0.0
Identities = 381/832 (45%), Positives = 519/832 (61%), Gaps = 66/832 (7%)
Query: 1 SVERKHRHIVETGLALLSHASMPLHFWDHAFLTATYLINRMSTTTLQGASPYFKLYGNHP 60
S ERKHRHIVE GL+LLS+ASMPL FWD AF+ ATYLINR+ + T+Q ++P KL+ P
Sbjct: 615 SAERKHRHIVEVGLSLLSYASMPLKFWDEAFVAATYLINRIPSKTIQNSTPLEKLFNQKP 674
Query: 61 DFKSLKIFGSACFPFLRPYNSNKLSLHSKECVFLGYSSSHKGYKCLD-QSGRIYVSKDVL 119
D+ SL++FG AC+P LRPYN++KL SK+CVFLG+S+ HKG+KCLD SGR+Y+S+DV+
Sbjct: 675 DYSSLRVFGCACWPHLRPYNTHKLQFRSKQCVFLGFSTHHKGFKCLDVSSGRVYISRDVV 734
Query: 120 FHEHRFPYTT--------------LFPSEPFSPPTSSA----EYFPLSTVPIISRSM--- 158
F E+ FP++T L PS + T+SA P++ P+ S ++
Sbjct: 735 FDENVFPFSTLHSNAGARLRSEILLLPSPLTNYNTASAGGTHVVAPVANTPLPSDNLISN 794
Query: 159 -----PQPSPAPISTELANPGPLSPQSEASDLQSQPS--PIPTGSGLASTSQPAEHASSE 211
+ A E+ N + +D+ + P+ S++ P + A +
Sbjct: 795 AADVTSGENSAAHEQEMENEQEIENVMHGNDVHGDAASGPVLDQPTADSSTAPDQGADTS 854
Query: 212 SAHQEMATSSG-----VHAASSASTVAVPVNAHPMQ------------------TRSKSG 248
A A+ +G + A ++ S A + P+Q TR +SG
Sbjct: 855 DAVSGAASDAGGDTATLGAGAANSAAAGGEESQPVQPDVTGTVLATVAPASRPHTRLRSG 914
Query: 249 IIKPRLNPTLLLTH------MEPTTVKQAMRDDKWLQAMKEEYNALMSNGTWSLVPLPSN 302
I K ++ + + EP K+A+ D W AM+ EYNAL+ N TW LVP
Sbjct: 915 IRKEKVYTDGTVKYGCFSSTGEPQNDKEALGDKNWRDAMETEYNALIKNDTWHLVPYEKG 974
Query: 303 RKAVGCKWIYRVKENPDGSINKYKARLVAKGYSQVQGFDYSETFSPVVKPITVRLILSLA 362
+ +GCKW+Y++K DG++++YKARLVAKG+ Q G DY +TFSPVVK T+R+ILS+A
Sbjct: 975 QNIIGCKWVYKIKRKADGTLDRYKARLVAKGFKQRYGIDYEDTFSPVVKAATIRIILSIA 1034
Query: 363 ISRGWPLQQIDVNNAFLNGVLEEEVYMTQPPGFEHKDK-TLVCKLHKALYGLKQAPRAWF 421
+SRGW L+Q+DV NAFL+G LEEEVYM QPPGFE K VCKL KALYGLKQAPRAW+
Sbjct: 1035 VSRGWSLRQLDVQNAFLHGFLEEEVYMQQPPGFESSSKPDYVCKLDKALYGLKQAPRAWY 1094
Query: 422 HRLKEVLLQFGFKASKCDPSLFTYNSPLGCIYMLIYVDDIILTGDSMVLIQQLTAKLHSI 481
RL + L++ GF+ASK D SLF N +++L+YVDDII+ + L L+
Sbjct: 1095 SRLSKKLVELGFEASKADTSLFFLNKGGILMFVLVYVDDIIVASSTEKATTALLKDLNKE 1154
Query: 482 FALKQLGQLDYFLGVQVTHLANGSLLLNQTKYINDLLTKVNMADAAPISTPMQFGAKLTK 541
FALK LG L YFLG++VT ++NG ++L Q KY NDLL +VNM++ P+STP+ KLT
Sbjct: 1155 FALKDLGDLHYFLGIEVTKVSNG-VILTQEKYANDLLKRVNMSNCKPVSTPLSVSEKLTL 1213
Query: 542 HGGTSL--HDPTEYRSVVGALQYATITRPEISYAVNKVCQFLSDPHEEHWKAVKRILRYL 599
+ G+ L +D +YRS+VGALQY T+TRP+I+Y+VNKVCQFL P HW AVKRILRYL
Sbjct: 1214 YEGSPLGPNDAIQYRSIVGALQYLTLTRPDIAYSVNKVCQFLHAPTTSHWIAVKRILRYL 1273
Query: 600 KGTITHGVLLQPCSMTQPLPLLAFCDADWGSDPDDRRSTSGSCVFLGPNLISWTAKKQTL 659
+ G+ + + T + + DADW DDR+ST G VFLG NL+SW+A+KQ
Sbjct: 1274 NQCTSLGLHIHKSASTL---VHGYSDADWAGSIDDRKSTGGFAVFLGSNLVSWSARKQPT 1330
Query: 660 VARSSTEAEYRSLANTTAELLWVESLLTELKI-AFTVPTVLCDNMSTVLLTHNPILHTRT 718
V+RSSTEAEY+++ANTTAEL+WV++LL EL I + + CDN+ L+ NP+ H RT
Sbjct: 1331 VSRSSTEAEYKAVANTTAELIWVQTLLKELGIESPKAAKIWCDNLGAKYLSANPVFHART 1390
Query: 719 KHMEMDLFFVREKVQAKSLVVQHVPSEHQRADIFTKALSPTRFLLMRDKLNV 770
KH+E+D FVRE+V K L + VPS Q AD FTKALS + LN+
Sbjct: 1391 KHIEVDYHFVRERVSQKLLEIDFVPSGDQVADGFTKALSACLLENFKHNLNL 1442
>gb|AAA57005.1| copia-like retrotransposon Hopscotch polyprotein [Zea mays]
gi|7444442|pir||T02087 gag/pol polyprotein - maize
retrotransposon Hopscotch
Length = 1439
Score = 689 bits (1778), Expect = 0.0
Identities = 391/811 (48%), Positives = 509/811 (62%), Gaps = 62/811 (7%)
Query: 1 SVERKHRHIVETGLALLSHASMPLHFWDHAFLTATYLINRMSTTTLQGASPYFKLYGNHP 60
+ ERKHRHIVE GLALL+ +SMPL +WDHAFL A YLINR + T+ +P KL G P
Sbjct: 615 AAERKHRHIVEVGLALLAQSSMPLKYWDHAFLAAVYLINRTPSKTIAHDTPLHKLTGATP 674
Query: 61 DFKSLKIFGSACFPFLRPYNSNKLSLHSKECVFLGYSSSHKGYKCLDQS-GRIYVSKDVL 119
D+ SL+IFG AC+P LRPYN +KL S CVFLGYS+ HKG+KCLD S GRIY+S+DV+
Sbjct: 675 DYSSLRIFGCACWPNLRPYNQHKLQFRSTRCVFLGYSNMHKGFKCLDISTGRIYISRDVV 734
Query: 120 FHEHRFPYTTL-------FPSEPFSPP---------TSSAEYFPLSTVPI---ISRSMPQ 160
F EH FP+ +L + SE P T A P S+ P+ +
Sbjct: 735 FDEHVFPFASLNKNAGVKYTSEVLLLPHDSCGNNMLTDHANNLPGSSSPLPFLAQHFLQG 794
Query: 161 PSPAPISTELANPGPLSPQSEAS--------DLQSQPSPIPTGSGLASTSQPAEHASSES 212
S P S A P S +E S L SP PTG +++ ++PA A S S
Sbjct: 795 NSEVPTSNNTAMALPASGPNEVSVPPALVPSSLVPAASPAPTG--VSANAEPAPEADSLS 852
Query: 213 AHQEMATSS--GVHAA-----SSASTVA------VPVNAHPMQTRSKSGIIKPRL----- 254
+ +AT S GV A + S+VA P++A +TR + GI KP+
Sbjct: 853 SGPPVATESVTGVPDADPLLQAPGSSVAHQTPDSAPLSAAAPRTRLQHGISKPKQFTDGT 912
Query: 255 ----NPTLLLTHMEPTTVKQAMRDDKWLQAMKEEYNALMSNGTWSLVPLPSNRKAVGCKW 310
N +T EP++V +A+ D +W AM+ E+ AL N TW+LVP R + CKW
Sbjct: 913 VRYGNAAARIT--EPSSVSEALADPQWRAAMEAEFQALQKNNTWTLVPPDRTRNLIDCKW 970
Query: 311 IYRVKENPDGSINKYKARLVAKGYSQVQGFDYSETFSPVVKPITVRLILSLAISRGWPLQ 370
+++VK N DGSI++ KARLVAKG+ Q G DY +TFSPVVK T+RL+LSLA+S+ W L+
Sbjct: 971 VFKVKYNADGSIDRLKARLVAKGFKQQYGIDYDDTFSPVVKHSTIRLVLSLAVSQKWSLR 1030
Query: 371 QIDVNNAFLNGVLEEEVYMTQPPGF-EHKDKTLVCKLHKALYGLKQAPRAWFHRLKEVLL 429
Q+DV NAFL+G+LEE VYM QPPGF + C L K+LYGLKQ PRAW+ RL E L
Sbjct: 1031 QLDVQNAFLHGILEETVYMKQPPGFADTTHPNYHCHLQKSLYGLKQRPRAWYSRLSEKLQ 1090
Query: 430 QFGFKASKCDPSLFTYNSPLGCIYMLIYVDDIILTGDSMVLIQQLTAKLHSIFALKQLGQ 489
GF SK D SLF YN+ IY+L+YVDDII+TG S I + AKL FA+K LG
Sbjct: 1091 SLGFVPSKADVSLFIYNAHSTAIYILVYVDDIIITGSSPHAIDNVLAKLKDDFAIKDLGD 1150
Query: 490 LDYFLGVQVTHLANGSLLLNQTKYINDLLTKVNMADAAPISTPMQFGAKLTKHGGTSL-- 547
L YFLG++V H LLL Q KY DLL +V M P+ TP+ KL+ GT L
Sbjct: 1151 LHYFLGIEV-HRKGDGLLLCQEKYARDLLKRVGMECCKPVHTPVATSEKLSASAGTLLSP 1209
Query: 548 HDPTEYRSVVGALQYATITRPEISYAVNKVCQFLSDPHEEHWKAVKRILRYLKGTITHGV 607
+ T+YRSVVGALQY T+TRP++SYA+N+VCQFL P + HW AVKRILR ++ TI G+
Sbjct: 1210 EETTKYRSVVGALQYLTLTRPDLSYAINRVCQFLHAPTDLHWTAVKRILRNIQHTIGLGL 1269
Query: 608 LLQPCSMTQPLPLLAFCDADWGSDPDDRRSTSGSCVFLGPNLISWTAKKQTLVARSSTEA 667
++P + L L AF DADW PDDR+ST G +FLGPNLISW +KKQ+ V+RSSTEA
Sbjct: 1270 TIRP---SLSLMLSAFSDADWAGCPDDRKSTGGYALFLGPNLISWNSKKQSTVSRSSTEA 1326
Query: 668 EYRSLANTTAELLWVESLLTELKIAFT-VPTVLCDNMSTVLLTHNPILHTRTKHMEMDLF 726
EY+++AN TAE++W++SLL EL I T +P + CDN+ L+ PI + RTKH+E+D
Sbjct: 1327 EYKAMANATAEVIWLQSLLHELGIRLTGIPRLWCDNLGATYLSSKPIFNARTKHIEVDFH 1386
Query: 727 FVREKVQAKSLVVQHVPSEHQRADIFTKALS 757
FVR++V +K L ++ + + Q AD FTKAL+
Sbjct: 1387 FVRDRVLSKKLDIRLISTNDQVADGFTKALT 1417
>gb|AAK43485.1| polyprotein, putative [Arabidopsis thaliana]
Length = 1459
Score = 678 bits (1749), Expect = 0.0
Identities = 369/826 (44%), Positives = 497/826 (59%), Gaps = 62/826 (7%)
Query: 3 ERKHRHIVETGLALLSHASMPLHFWDHAFLTATYLINRMSTTTLQGASPYFKLYGNHPDF 62
ERKHRHIVETGL LL+ AS+P +W +AF TA YLINRM T L SP+ KL+G+ P++
Sbjct: 632 ERKHRHIVETGLTLLTQASVPREYWTYAFATAVYLINRMPTPVLCLQSPFQKLFGSSPNY 691
Query: 63 KSLKIFGSACFPFLRPYNSNKLSLHSKECVFLGYSSSHKGYKCLD-QSGRIYVSKDVLFH 121
+ L++FG CFP+LRPY NKL SK CVFLGYS + Y CLD + R+Y S+ V+F
Sbjct: 692 QRLRVFGCLCFPWLRPYTRNKLEERSKRCVFLGYSLTQTAYLCLDVDNNRLYTSRHVMFD 751
Query: 122 EHRFPYTTLF----PSEPFSPPTSSAEYFP-----------LSTVPIISRSMPQP----- 161
E +P+ S +PP SS+ P L + P S P P
Sbjct: 752 ESTYPFAASIREQSQSSLVTPPESSSSSSPANSGFPCSVLRLQSPPASSPETPSPPQQQN 811
Query: 162 ----------SPAPI------------STELANPGPLSPQSEASDLQSQPSPIPTGSGLA 199
SP P S ++N P +P + ++Q +P G
Sbjct: 812 DSPVSPRQTGSPTPSHHSQVRDSTLSPSPSVSNSEPTAPHENGPEPEAQSNPNSPFIGPL 871
Query: 200 STSQPAEHASS--------ESAHQEMATSSGVHAASSASTVAVPVNAHPMQTRSKSGIIK 251
P + SS +S + + AA+S S P N H M+TRSK+ I K
Sbjct: 872 PNPNPETNPSSSIEQRPVDKSTTTALPPNQTTIAATSNSRSQPPKNNHQMKTRSKNNITK 931
Query: 252 PRLNPTLLLT----HM-EPTTVKQAMRDDKWLQAMKEEYNALMSNGTWSLVPLPSNRKAV 306
P+ +L + H+ EP TV QA++D KW AM +E++A N TW LVP + V
Sbjct: 932 PKTKTSLTVALTQPHLSEPNTVTQALKDKKWRFAMSDEFDAQQRNHTWDLVPPNPTQHLV 991
Query: 307 GCKWIYRVKENPDGSINKYKARLVAKGYSQVQGFDYSETFSPVVKPITVRLILSLAISRG 366
GC+W++++K P+G I+KYKARLVAKG++Q G DY+ETFSPV+K T+R++L +A+ +
Sbjct: 992 GCRWVFKLKYLPNGLIDKYKARLVAKGFNQQYGVDYAETFSPVIKATTIRVVLDVAVKKN 1051
Query: 367 WPLQQIDVNNAFLNGVLEEEVYMTQPPGFEHKDK-TLVCKLHKALYGLKQAPRAWFHRLK 425
WPL+Q+DVNNAFL G L EEVYM QPPGF KD+ + VC+L KA+YGLKQAPRAW+ LK
Sbjct: 1052 WPLKQLDVNNAFLQGTLTEEVYMAQPPGFVDKDRPSHVCRLRKAIYGLKQAPRAWYMELK 1111
Query: 426 EVLLQFGFKASKCDPSLFTYNSPLGCIYMLIYVDDIILTGDSMVLIQQLTAKLHSIFALK 485
+ LL GF S D SLF Y+ +Y+L+YVDDII+TG + + + L F++K
Sbjct: 1112 QHLLNIGFVNSLADTSLFIYSHGTTLLYLLVYVDDIIVTGSDHKSVSAVLSSLAERFSIK 1171
Query: 486 QLGQLDYFLGVQVTHLANGSLLLNQTKYINDLLTKVNMADAAPISTPMQFGAKLTKHGGT 545
L YFLG++ T N L L Q KY+ DLL K NM DA P++TP+ KLT HGGT
Sbjct: 1172 DPTDLHYFLGIEATR-TNTGLHLMQRKYMTDLLAKHNMLDAKPVATPLPTSPKLTLHGGT 1230
Query: 546 SLHDPTEYRSVVGALQYATITRPEISYAVNKVCQFLSDPHEEHWKAVKRILRYLKGTITH 605
L+D +EYRSVVG+LQY TRP+I++AVN++ QF+ P +HW+A KR+LRYL GT TH
Sbjct: 1231 KLNDASEYRSVVGSLQYLAFTRPDIAFAVNRLSQFMHQPTSDHWQAAKRVLRYLAGTTTH 1290
Query: 606 GVLLQPCSMTQPLPLLAFCDADWGSDPDDRRSTSGSCVFLGPNLISWTAKKQTLVARSST 665
G+ L S P+ L AF DADW D D ST+ ++LG N ISW++KKQ V+RSST
Sbjct: 1291 GIFLNSSS---PIHLHAFSDADWAGDSADYVSTNAYVIYLGRNPISWSSKKQRGVSRSST 1347
Query: 666 EAEYRSLANTTAELLWVESLLTELKIAFT-VPTVLCDNMSTVLLTHNPILHTRTKHMEMD 724
E+EYR++AN +E+ W+ SLLTEL I PT+ CDN+ + NP+ H+R KH+ +D
Sbjct: 1348 ESEYRAVANAASEIRWLCSLLTELHIRLPHGPTIFCDNIGATYICANPVFHSRMKHIALD 1407
Query: 725 LFFVREKVQAKSLVVQHVPSEHQRADIFTKALSPTRFLLMRDKLNV 770
FVR +Q+++L V HV + Q AD TK+LS FL R K+ V
Sbjct: 1408 YHFVRGMIQSRALRVSHVSTNDQLADALTKSLSRPHFLSARSKIGV 1453
>gb|AAT85031.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
Length = 1437
Score = 676 bits (1745), Expect = 0.0
Identities = 380/826 (46%), Positives = 501/826 (60%), Gaps = 66/826 (7%)
Query: 1 SVERKHRHIVETGLALLSHASMPLHFWDHAFLTATYLINRMSTTTLQGASPYFKLYGNHP 60
S ERKHRHIVE GLALL+++SMPL FW AFL+A YLINR + L SP +L G+ P
Sbjct: 613 SAERKHRHIVEVGLALLAYSSMPLKFWGEAFLSAVYLINRTPSRVLHDVSPLERLLGHKP 672
Query: 61 DFKSLKIFGSACFPFLRPYNSNKLSLHSKECVFLGYSSSHKGYKCLDQS-GRIYVSKDVL 119
D+ +L++FG AC+P LRPYN +KL S C FLGYS+ HKG+KCLD S GR+Y+S+DV+
Sbjct: 673 DYNALRVFGCACWPNLRPYNKHKLQFRSTTCTFLGYSTLHKGFKCLDPSTGRVYISRDVV 732
Query: 120 FHEHRFPYTTLFPSEPFSPPTSSAEYFPLSTVPIISRSMPQ-----------PSPAPIST 168
F E +FP+T L P+ + ++ VP ++ S+P+ P A +S
Sbjct: 733 FDETQFPFTKLHPN------VGAKLRAEIALVPELAASLPRGLQQISSVINTPENANVSN 786
Query: 169 ELAN----------------PGPLSPQSEASDLQSQP---SPIPTGSGLASTSQPAEHAS 209
E P +S + A S P P G ++T+ PA
Sbjct: 787 ENMQQDSTYDNEPETETDGAPDTVSANAPAESSGSPPINEPASPFGESDSATASPASAPV 846
Query: 210 SESAHQEMATSSGVHAASSASTVAVPVNA----HPM-----------QTRSKSGIIKPRL 254
+ + H + A S S S P A HP +TR +SGI K ++
Sbjct: 847 NSAPHPDAAASGSSAPRGSTSQGGTPSVAIDDPHPATTVTGQEAQRPRTRLQSGIRKEKV 906
Query: 255 NPT------LLLTHMEPTTVKQAMRDDKWLQAMKEEYNALMSNGTWSLVPLPSNRKAVGC 308
+L + EP ++ A++++ W AM EY AL+ N TW LVP R + C
Sbjct: 907 YTDGTVKWGMLTSTGEPENLQDALQNNNWKCAMDAEYMALIKNNTWHLVPPQQGRNVIDC 966
Query: 309 KWIYRVKENPDGSINKYKARLVAKGYSQVQGFDYSETFSPVVKPITVRLILSLAISRGWP 368
KW+Y++K DGS+++YKARLVAKG+ Q G DY +TFSPVVK T+R+ILS+A+SRGW
Sbjct: 967 KWVYKIKRKQDGSLDRYKARLVAKGFKQRYGIDYEDTFSPVVKAATIRIILSIAVSRGWC 1026
Query: 369 LQQIDVNNAFLNGVLEEEVYMTQPPGFEH-KDKTLVCKLHKALYGLKQAPRAWFHRLKEV 427
L+Q+DV NAFL+GVLEEEVYM QPPG+E+ VCKL KALYGLKQAPRAW+ RL
Sbjct: 1027 LRQLDVQNAFLHGVLEEEVYMKQPPGYENPSTPDYVCKLDKALYGLKQAPRAWYSRLSGK 1086
Query: 428 LLQFGFKASKCDPSLFTYNSPLGCIYMLIYVDDIILTGDSMVLIQQLTAKLHSIFALKQL 487
L GFK SK D SLF YN I++LIYVDDII+ + L L FALK L
Sbjct: 1087 LHDLGFKGSKADTSLFFYNKGSLTIFLLIYVDDIIVVSSRKEAVSALLQDLQKEFALKDL 1146
Query: 488 GQLDYFLGVQVTHLANGSLLLNQTKYINDLLTKVNMADAAPISTPMQFGAKLTKHGGTSL 547
G L YFLG++VT + G +L++Q KY +DLL +VNM+D ++TP+ KL GT L
Sbjct: 1147 GDLHYFLGIEVTKIP-GGILMSQEKYASDLLKRVNMSDCKSVATPLSASEKLIAGKGTIL 1205
Query: 548 --HDPTEYRSVVGALQYATITRPEISYAVNKVCQFLSDPHEEHWKAVKRILRYLKGTITH 605
+D T+YRS+VGALQY T+TR +I+++VNKVCQFL +P EHW AVKRILRY+K
Sbjct: 1206 GPNDATQYRSIVGALQYLTLTRLDIAFSVNKVCQFLHNPTTEHWAAVKRILRYIKQCT-- 1263
Query: 606 GVLLQPCSMTQPLPLLAFCDADWGSDPDDRRSTSGSCVFLGPNLISWTAKKQTLVARSST 665
G+ L+ C + + + + DADW DDRRST G V+LG NL+SW AKKQ V+RSST
Sbjct: 1264 GLGLRICKSSSMI-VSGYSDADWAGCLDDRRSTGGFAVYLGDNLVSWNAKKQATVSRSST 1322
Query: 666 EAEYRSLANTTAELLWVESLLTELKIAFTVPTVL-CDNMSTVLLTHNPILHTRTKHMEMD 724
EAEY++LAN TAE++WV++LL EL I L CDNM L+ NP+ H RTKH+E+D
Sbjct: 1323 EAEYKALANATAEIMWVQTLLQELNIVSPAMAQLWCDNMGAKYLSFNPVFHARTKHIEVD 1382
Query: 725 LFFVREKVQAKSLVVQHVPSEHQRADIFTKALSPTRFLLMRDKLNV 770
FVRE+V K L V +V + Q AD FTKAL + + LN+
Sbjct: 1383 YHFVRERVARKLLQVDYVSTNDQVADGFTKALPVKQLENFKYNLNL 1428
>gb|AAF99727.1| F17L21.7 [Arabidopsis thaliana]
Length = 1534
Score = 663 bits (1711), Expect = 0.0
Identities = 367/817 (44%), Positives = 500/817 (60%), Gaps = 56/817 (6%)
Query: 3 ERKHRHIVETGLALLSHASMPLHFWDHAFLTATYLINRMSTTTLQGASPYFKLYGNHPDF 62
ERKHRHI+ETGL LL+ AS+P +W +AF TA YLINR+ ++ L SPY KL+ P++
Sbjct: 719 ERKHRHILETGLTLLTQASIPTSYWTYAFGTAVYLINRLPSSVLNNESPYSKLFKTSPNY 778
Query: 63 KSLKIFGSACFPFLRPYNSNKLSLHSKECVFLGYSSSHKGYKCLDQS-GRIYVSKDVLFH 121
L++FG +CFP+LRPY ++KL S+ CVFLGYS + Y CLD+S GR+Y S+ V F
Sbjct: 779 LKLRVFGCSCFPWLRPYTNHKLERRSQPCVFLGYSLTQSAYLCLDRSSGRVYTSRHVQFV 838
Query: 122 EHRFPYTTLFPSEPFSPPTSSAE------YFPLSTVPIISRSMP---QPSPAPISTELAN 172
E +FP++ S+ S SS E + P S +PI S S P PS P + ++
Sbjct: 839 EDQFPFSI---SDTHSVSNSSPEEASPSCHQPPSRIPIQSSSPPLVQAPSSLPPLSSDSH 895
Query: 173 PGPLSPQSEASDLQSQPSPI---------------PTGSGLASTSQPAEHASSESAHQEM 217
P + S +S + + PT S A + + +SS E
Sbjct: 896 RRPNAETSSSSSSTNNDVVVSKDNTQVDNRNNFIGPTSSSSAQSQNNSNPSSSIQTQNEP 955
Query: 218 ATS-------SGVHAASSASTVAV----------PVNAHPMQTRSKSGIIKPRLNPTLLL 260
S S ++ S+ST A P N HPM+TR+K+ I KP+ +LL
Sbjct: 956 NPSPSPTPQNSSPESSPSSSTSATSTVPNPPPPPPTNNHPMRTRAKNHITKPKTKLSLLA 1015
Query: 261 THME-----PTTVKQAMRDDKWLQAMKEEYNALMSNGTWSLVPLPSNRKAVGCKWIYRVK 315
++ P TV QA+RD+KW AM EE NA + N T+ LVP N+ + KWI+ +K
Sbjct: 1016 KTVQTRPQIPNTVNQALRDEKWRNAMGEEINAQIRNNTFELVPPKPNQNVISTKWIFTLK 1075
Query: 316 ENPDGSINKYKARLVAKGYSQVQGFDYSETFSPVVKPITVRLILSLAISRGWPLQQIDVN 375
P+G++++YKARLVA+G+ Q G YSETFSPVVK +T+RL+L LA+SR W ++Q+DVN
Sbjct: 1076 YLPNGTLDRYKARLVARGFRQQYGLHYSETFSPVVKSLTIRLVLQLAVSRSWTIKQLDVN 1135
Query: 376 NAFLNGVLEEEVYMTQPPGFEHKDKT-LVCKLHKALYGLKQAPRAWFHRLKEVLLQFGFK 434
NAFL G L +EVY+TQPPGF D+ VC+L KALYGLKQAPRAW+ L+ + GF
Sbjct: 1136 NAFLQGTLTDEVYVTQPPGFIDPDRPHHVCRLKKALYGLKQAPRAWYQELRNFVCSLGFT 1195
Query: 435 ASKCDPSLFTYNSPLGCIYMLIYVDDIILTGDSMVLIQQLTAKLHSIFALKQLGQLDYFL 494
S D S+F Y + + +Y L+YVDDII+TG S L+ L F+LK L YFL
Sbjct: 1196 NSLADTSVFVYINDIQIVYCLVYVDDIIVTGSSDALVMAFITALSRRFSLKDPTDLVYFL 1255
Query: 495 GVQVTHLANGSLLLNQTKYINDLLTKVNMADAAPISTPMQFGAKLTKHGGTSLHDPTEYR 554
G++ T + G L L Q KY+ DLL+++ M DA P+STPM KL+ + G +L +P EYR
Sbjct: 1256 GIEATRTSQG-LHLMQHKYVYDLLSRMKMLDAKPVSTPMATHPKLSLYSGIALDEPGEYR 1314
Query: 555 SVVGALQYATITRPEISYAVNKVCQFLSDPHEEHWKAVKRILRYLKGTITHGVLLQPCSM 614
+V+G+LQY TRP+I+YAVN++ QF+ P + HW+A KR+LRYL GT THG+LL+ S
Sbjct: 1315 TVIGSLQYLAFTRPDIAYAVNRLSQFMHRPTDIHWQAAKRVLRYLAGTATHGILLRSNS- 1373
Query: 615 TQPLPLLAFCDADWGSDPDDRRSTSGSCVFLGPNLISWTAKKQTLVARSSTEAEYRSLAN 674
PL L AF DADW D DD ST+ V+LG I+W++KKQ VARSSTEAEYR++AN
Sbjct: 1374 --PLSLHAFSDADWAGDNDDFVSTNAYIVYLGSTPIAWSSKKQKGVARSSTEAEYRAVAN 1431
Query: 675 TTAELLWVESLLTELKIAF-TVPTVLCDNMSTVLLTHNPILHTRTKHMEMDLFFVREKVQ 733
TT+E+ WV SLLTEL I +P + CDN+ L+ NP+ H+R KH+ +D F+R+ V
Sbjct: 1432 TTSEIRWVCSLLTELGITLPKMPVIYCDNVGATYLSANPVFHSRMKHLALDYHFIRDNVS 1491
Query: 734 AKSLVVQHVPSEHQRADIFTKALSPTRFLLMRDKLNV 770
A +L V H+ + Q AD TK L FL K+ V
Sbjct: 1492 AGALRVSHISTHDQLADALTKPLPRQHFLQFSSKIGV 1528
>gb|AAC02664.1| polyprotein [Arabidopsis thaliana]
Length = 1451
Score = 662 bits (1708), Expect = 0.0
Identities = 377/824 (45%), Positives = 497/824 (59%), Gaps = 62/824 (7%)
Query: 3 ERKHRHIVETGLALLSHASMPLHFWDHAFLTATYLINRMSTTTLQGASPYFKLYGNHPDF 62
ERKHRHIVETGL LLS ASMP +W +AF TA YLINRM T L SPY KL+G P++
Sbjct: 628 ERKHRHIVETGLTLLSTASMPKEYWSYAFATAVYLINRMLTPVLGNESPYVKLFGQPPNY 687
Query: 63 KSLKIFGSACFPFLRPYNSNKLSLHSKECVFLGYSSSHKGYKCLDQS-GRIYVSKDVLFH 121
L+IFG CFP+LRPY ++KL S CV LGYS S Y CLD++ GR+Y S+ V F
Sbjct: 688 LKLRIFGCLCFPWLRPYTAHKLDNRSVPCVLLGYSLSQSAYLCLDRATGRVYTSRHVQFA 747
Query: 122 EHRFPYTTLFPS--EPFSPPTSSAEY--------FPLSTVPIISRSMPQPSPAPISTE-L 170
E FP++T PS P PP S PL+T P S S P +P +E L
Sbjct: 748 ESSFPFSTTSPSVTPPSDPPLSQDTRPVSVPLLARPLTTAPPSSPSCSAPHRSPSQSENL 807
Query: 171 ANPGPLSPQSEASDLQSQPSPI--------------PTGSGLASTSQP-----------A 205
+ P PL P S SP PTGS + QP +
Sbjct: 808 SPPAPLQPSLSLSPTSPITSPSLSEESLVGHNSETGPTGSSPPLSPQPQRPQPQSPQSTS 867
Query: 206 EHASS---ESAHQEMATSSGVHAASSASTVAVPVNAHP------MQTRSKSGIIKPRLNP 256
H+SS S + + + S +S+ + + P N +P M+TRSK+ I+KP NP
Sbjct: 868 PHSSSPQPNSPNPQHSPRSLTPTLTSSPSPSPPPNPNPPPIQHTMRTRSKNNIVKP--NP 925
Query: 257 TLLLTHMEPT--------TVKQAMRDDKWLQAMKEEYNALMSNGTWSLVPLPSNRKAVGC 308
+PT TV +A+ D W QAM +E NA NGT+ LVP N+ VGC
Sbjct: 926 KFANLATKPTPLKPIIPKTVVEALLDPNWRQAMCDEINAQTRNGTFDLVPPAPNQNVVGC 985
Query: 309 KWIYRVKENPDGSINKYKARLVAKGYSQVQGFDYSETFSPVVKPITVRLILSLAISRGWP 368
KW++ +K +G +++YKARLVAKG+ Q G D+ ETFSPV+K TVR +L +A+S+GW
Sbjct: 986 KWVFTLKYLSNGVLDRYKARLVAKGFHQQYGHDFKETFSPVIKSTTVRSVLHIAVSKGWS 1045
Query: 369 LQQIDVNNAFLNGVLEEEVYMTQPPGFEHKDKT-LVCKLHKALYGLKQAPRAWFHRLKEV 427
++QIDVNNAFL G L +EVY+TQPPGF KD VC+L+KALYGLKQAPRAW+ L+
Sbjct: 1046 IRQIDVNNAFLQGTLSDEVYVTQPPGFVDKDNAHHVCRLYKALYGLKQAPRAWYQELRSY 1105
Query: 428 LLQFGFKASKCDPSLFTYNSPLGCIYMLIYVDDIILTGDSMVLIQQLTAKLHSIFALKQL 487
LL GF S D SLFT +Y+L+YVDD+++TG +I + A L + F+LK L
Sbjct: 1106 LLTQGFVNSVADTSLFTLRHERTILYVLVYVDDMLITGSDTNIITRFIANLAARFSLKDL 1165
Query: 488 GQLDYFLGVQVTHLANGSLLLNQTKYINDLLTKVNMADAAPISTPMQFGAKLTKHGGTSL 547
G++ YFLG++ T + G L L Q +Y+ DLL K NM A P+ TPM KL+ G L
Sbjct: 1166 GEMSYFLGIEATRTSKG-LHLMQKRYVLDLLEKTNMLAAHPVLTPMSPTPKLSLTSGKPL 1224
Query: 548 HDPTEYRSVVGALQYATITRPEISYAVNKVCQFLSDPHEEHWKAVKRILRYLKGTITHGV 607
P+EYR+V+G+LQY TRP+I+YAVN++ Q++ P + HW+A KRILRYL GT +HG+
Sbjct: 1225 DKPSEYRAVLGSLQYLLFTRPDIAYAVNRLSQYMHCPTDLHWQAAKRILRYLAGTPSHGI 1284
Query: 608 LLQPCSMTQPLPLLAFCDADWGSDPDDRRSTSGSCVFLGPNLISWTAKKQTLVARSSTEA 667
++ PL L A+ DADW D D+ ST+ ++LG N ISW++KKQ VARSSTEA
Sbjct: 1285 FIR---ADTPLTLHAYSDADWAGDIDNYNSTNAYILYLGSNPISWSSKKQKGVARSSTEA 1341
Query: 668 EYRSLANTTAELLWVESLLTELKIAFTVPTVL-CDNMSTVLLTHNPILHTRTKHMEMDLF 726
EYR++AN T+E+ WV SLLTEL I + P V+ CDN+ L+ NP+ H+R KH+ +D
Sbjct: 1342 EYRAVANATSEIRWVCSLLTELGITLSSPPVVYCDNVGATYLSANPVFHSRMKHIALDFH 1401
Query: 727 FVREKVQAKSLVVQHVPSEHQRADIFTKALSPTRFLLMRDKLNV 770
FVRE VQA +L V HV ++ Q AD TK L F + K+ V
Sbjct: 1402 FVRESVQAGALRVTHVSTKDQLADALTKPLPRQPFTTLISKIGV 1445
>ref|XP_475911.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
gi|52353546|gb|AAU44112.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
gi|50080247|gb|AAT69582.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
Length = 1256
Score = 659 bits (1701), Expect = 0.0
Identities = 363/815 (44%), Positives = 498/815 (60%), Gaps = 50/815 (6%)
Query: 1 SVERKHRHIVETGLALLSHASMPLHFWDHAFLTATYLINRMSTTTLQGASPYFKLYGNHP 60
S ERKHRHIVE GL+LL+HAS+PL FWD A+ + YLINRM T L +SP L+ P
Sbjct: 444 SAERKHRHIVEVGLSLLAHASLPLKFWDEAYQSGVYLINRMPTKVLGYSSPLECLFKQTP 503
Query: 61 DFKSLKIFGSACFPFLRPYNSNKLSLHSKECVFLGYSSSHKGYKCLDQS-GRIYVSKDVL 119
D+++L+ FG AC+P LRPYNS+K+ SK C FLGYS HKG++CLD S GRIY+S+DV+
Sbjct: 504 DYQALRTFGCACWPDLRPYNSHKMHFRSKRCTFLGYSPLHKGFRCLDSSTGRIYISRDVV 563
Query: 120 FHEHRFPYT--------------TLFPSEPFSPP----TSSAEYFPLSTVPIISRSMP-- 159
F E FP+ L P++ +P Y+P + +++ P
Sbjct: 564 FDESVFPFAELNPNAGTNLRKEVNLLPADMLNPGDVQLNDHVNYYPTAPANMVAAENPVE 623
Query: 160 --QPSPAPISTELANPGP--------LSPQSEAS-DLQSQPSPIPTG---SGLASTSQPA 205
+ + A + A+ G P +A+ D+ + P+ + S +++++ PA
Sbjct: 624 NTEENLASTRDDAADSGGSDTGTISNADPADDAAGDMTANPNLNDSSTHESSISASASPA 683
Query: 206 EHASSESAHQEMATSSGVHAASSASTVAVPVNAHPMQTRSKSGIIKPRLNPT------LL 259
+S +A + + H + S+ + P+ T + GI K ++ L
Sbjct: 684 SQSSVATAPEATLPNPQQHQQALRSSTPEGEASRPV-THLQKGIRKEKIYTDRTVKYGCL 742
Query: 260 LTHMEPTTVKQAMRDDKWLQAMKEEYNALMSNGTWSLVPLPSNRKAVGCKWIYRVKENPD 319
T EP + A+ D W AM ++ AL+ N TW LVP R + KW+Y++K D
Sbjct: 743 TTTGEPRDLHDALHDTNWKHAMDAKFTALLHNKTWHLVPPQKGRNIIDYKWVYKIKRKQD 802
Query: 320 GSINKYKARLVAKGYSQVQGFDYSETFSPVVKPITVRLILSLAISRGWPLQQIDVNNAFL 379
GS+++YKARLVAKG+ Q G DY +TFSPVVK T+R+ILS+A+SRGW L+Q+DV NAFL
Sbjct: 803 GSLDRYKARLVAKGFKQRYGIDYEDTFSPVVKAATIRIILSIAVSRGWTLRQLDVQNAFL 862
Query: 380 NGVLEEEVYMTQPPGFEHK-DKTLVCKLHKALYGLKQAPRAWFHRLKEVLLQFGFKASKC 438
+G+LEEEVYM QPPG+E K VCKL KALYGLKQAPRAW+ +L + L GF+ SK
Sbjct: 863 HGILEEEVYMKQPPGYEDKVHPDYVCKLDKALYGLKQAPRAWYAKLSQKLQHLGFQGSKA 922
Query: 439 DPSLFTYNSPLGCIYMLIYVDDIILTGDSMVLIQQLTAKLHSIFALKQLGQLDYFLGVQV 498
D SLF YN I++L+YVDDII+ + L L FALK LG L YFLG++V
Sbjct: 923 DTSLFFYNKGGLIIFVLVYVDDIIVASSRQDAVPALLKDLQKDFALKDLGDLHYFLGIEV 982
Query: 499 THLANGSLLLNQTKYINDLLTKVNMADAAPISTPMQFGAKLTKHGGTSL--HDPTEYRSV 556
++G ++L Q KY+ DLL +V M D P+STP+ KLT H G L +D + YRSV
Sbjct: 983 NKASSG-IVLTQEKYVTDLLRRVGMTDCKPVSTPLSTSEKLTLHEGDLLGPNDASNYRSV 1041
Query: 557 VGALQYATITRPEISYAVNKVCQFLSDPHEEHWKAVKRILRYLKGTITHGVLLQPCSMTQ 616
VGALQY T+TRP+I + VNKVCQFL P HW A+KRILRYLK G+ + S ++
Sbjct: 1042 VGALQYLTLTRPDIYFPVNKVCQFLHAPTIVHWAAMKRILRYLKQCTKLGLKI---SKSK 1098
Query: 617 PLPLLAFCDADWGSDPDDRRSTSGSCVFLGPNLISWTAKKQTLVARSSTEAEYRSLANTT 676
+ + + DADW + DDRRST G VFLG NL+SW AKKQ V RSSTE+EY++LAN T
Sbjct: 1099 SMLVSGYSDADWAGNIDDRRSTGGFAVFLGDNLVSWNAKKQATVPRSSTESEYKALANAT 1158
Query: 677 AELLWVESLLTELKI-AFTVPTVLCDNMSTVLLTHNPILHTRTKHMEMDLFFVREKVQAK 735
AE++W+++LL EL + + + + CDN+ L+ NP+ H RTKH+E+D FVRE++Q K
Sbjct: 1159 AEIMWIQTLLEELSVPSPPMARLWCDNLGAKYLSSNPVFHARTKHIEVDYHFVRERMQRK 1218
Query: 736 SLVVQHVPSEHQRADIFTKALSPTRFLLMRDKLNV 770
L V+ +P+ Q AD FTKALS + + LNV
Sbjct: 1219 LLEVEFIPTGDQVADGFTKALSARQLENFKYNLNV 1253
>gb|AAC02666.1| polyprotein [Arabidopsis thaliana]
Length = 1451
Score = 659 bits (1699), Expect = 0.0
Identities = 376/824 (45%), Positives = 496/824 (59%), Gaps = 62/824 (7%)
Query: 3 ERKHRHIVETGLALLSHASMPLHFWDHAFLTATYLINRMSTTTLQGASPYFKLYGNHPDF 62
ERKHRHIVETGL LLS ASMP +W +AF TA YLINRM T L SPY KL+G P++
Sbjct: 628 ERKHRHIVETGLTLLSTASMPKEYWSYAFATAVYLINRMLTPVLGNESPYVKLFGQPPNY 687
Query: 63 KSLKIFGSACFPFLRPYNSNKLSLHSKECVFLGYSSSHKGYKCLDQS-GRIYVSKDVLFH 121
L+IFG CFP+LRPY ++KL S CV LGYS S Y CLD++ GR+Y S+ V F
Sbjct: 688 LKLRIFGCLCFPWLRPYTAHKLDNRSVPCVLLGYSLSQSAYLCLDRATGRVYTSRHVQFA 747
Query: 122 EHRFPYTTLFPS--EPFSPPTSSAEY--------FPLSTVPIISRSMPQPSPAPISTE-L 170
E FP++T PS P PP S PL+T P S S P +P +E L
Sbjct: 748 ESSFPFSTTSPSVTPPSDPPLSQDTRPVSVPLLARPLTTAPPSSPSCSAPHRSPSQSENL 807
Query: 171 ANPGPLSPQSEASDLQSQPSPI--------------PTGSGLASTSQP-----------A 205
+ P PL P S SP PTGS + QP +
Sbjct: 808 SPPAPLQPSLSLSPTSPITSPSLSEESLVGHNSETGPTGSSPPLSPQPQRPQPQSPQSTS 867
Query: 206 EHASS---ESAHQEMATSSGVHAASSASTVAVPVNAHP------MQTRSKSGIIKPRLNP 256
H+SS S + + + S +S+ + + P N +P M+TRSK+ I+KP NP
Sbjct: 868 PHSSSPQPNSPNPQHSPRSLTPTLTSSPSPSPPPNPNPPPIQHTMRTRSKNNIVKP--NP 925
Query: 257 TLLLTHMEPT--------TVKQAMRDDKWLQAMKEEYNALMSNGTWSLVPLPSNRKAVGC 308
+PT TV +A+ D W QAM +E NA NGT+ LVP N+ VGC
Sbjct: 926 KFANLATKPTPLKPIIPKTVVEALLDPNWRQAMCDEINAQTRNGTFDLVPPAPNQNVVGC 985
Query: 309 KWIYRVKENPDGSINKYKARLVAKGYSQVQGFDYSETFSPVVKPITVRLILSLAISRGWP 368
KW++ +K +G +++YKARLVAKG+ Q G D+ ETFSPV+K TVR +L +A+S+GW
Sbjct: 986 KWVFTLKYLSNGVLDRYKARLVAKGFHQQYGHDFKETFSPVIKSTTVRSVLHIAVSKGWS 1045
Query: 369 LQQIDVNNAFLNGVLEEEVYMTQPPGFEHKDKT-LVCKLHKALYGLKQAPRAWFHRLKEV 427
++QIDVNNAFL G L +EVY+TQPPGF KD VC+L+KALYGLKQAPRAW+ L+
Sbjct: 1046 IRQIDVNNAFLQGTLSDEVYVTQPPGFVDKDNAHHVCRLYKALYGLKQAPRAWYQELRSY 1105
Query: 428 LLQFGFKASKCDPSLFTYNSPLGCIYMLIYVDDIILTGDSMVLIQQLTAKLHSIFALKQL 487
LL GF S D SLFT +Y+L+YVDD+++TG +I + A L + F+LK L
Sbjct: 1106 LLTQGFVNSVADTSLFTLRHERTILYVLVYVDDMLITGSDTNIITRFIANLAARFSLKDL 1165
Query: 488 GQLDYFLGVQVTHLANGSLLLNQTKYINDLLTKVNMADAAPISTPMQFGAKLTKHGGTSL 547
G++ YFLG++ T + G L L Q +Y+ DLL K NM A P+ TPM KL+ G L
Sbjct: 1166 GEMSYFLGIEATRTSKG-LHLMQKRYVLDLLEKTNMLAAHPVLTPMSPTPKLSLTSGKPL 1224
Query: 548 HDPTEYRSVVGALQYATITRPEISYAVNKVCQFLSDPHEEHWKAVKRILRYLKGTITHGV 607
P+EYR+V+G+LQY TRP+I+YAVN++ Q++ P + HW+A KRILRYL GT +HG+
Sbjct: 1225 DKPSEYRAVLGSLQYLLFTRPDIAYAVNRLSQYMHCPTDLHWQAAKRILRYLAGTPSHGI 1284
Query: 608 LLQPCSMTQPLPLLAFCDADWGSDPDDRRSTSGSCVFLGPNLISWTAKKQTLVARSSTEA 667
++ PL L A+ DADW D D+ ST+ ++LG N ISW++KKQ VARSSTEA
Sbjct: 1285 FIR---ADTPLTLHAYSDADWAGDIDNYNSTNAYILYLGSNPISWSSKKQKGVARSSTEA 1341
Query: 668 EYRSLANTTAELLWVESLLTELKIAFTVPTVL-CDNMSTVLLTHNPILHTRTKHMEMDLF 726
EYR++AN T+E+ WV SLLTEL I + P V+ CDN+ L+ NP+ +R KH+ +D
Sbjct: 1342 EYRAVANATSEIRWVCSLLTELGITLSSPPVVYCDNVGATYLSANPVFDSRMKHIALDFH 1401
Query: 727 FVREKVQAKSLVVQHVPSEHQRADIFTKALSPTRFLLMRDKLNV 770
FVRE VQA +L V HV ++ Q AD TK L F + K+ V
Sbjct: 1402 FVRESVQAGALRVTHVSTKDQLADALTKPLPRQPFTTLISKIGV 1445
>gb|AAC02669.1| polyprotein [Arabidopsis thaliana]
Length = 1451
Score = 658 bits (1697), Expect = 0.0
Identities = 376/824 (45%), Positives = 496/824 (59%), Gaps = 62/824 (7%)
Query: 3 ERKHRHIVETGLALLSHASMPLHFWDHAFLTATYLINRMSTTTLQGASPYFKLYGNHPDF 62
ERKHRHIVETGL LLS ASMP +W +AF TA YLINRM T L SPY KL+G P++
Sbjct: 628 ERKHRHIVETGLTLLSTASMPKEYWSYAFATAVYLINRMLTPVLGNESPYVKLFGQPPNY 687
Query: 63 KSLKIFGSACFPFLRPYNSNKLSLHSKECVFLGYSSSHKGYKCLDQS-GRIYVSKDVLFH 121
L+IFG CFP+LRPY ++KL S CV LGYS S Y CLD++ GR+Y S+ V F
Sbjct: 688 LKLRIFGCLCFPWLRPYTAHKLDNRSVPCVLLGYSLSQSAYLCLDRATGRVYTSRHVQFA 747
Query: 122 EHRFPYTTLFPS--EPFSPPTSSAEY--------FPLSTVPIISRSMPQPSPAPISTE-L 170
E FP++T PS P PP S PL+T P S S P +P +E L
Sbjct: 748 ESSFPFSTTSPSVTPPSDPPLSQDTRPVSVPLLARPLTTAPPSSPSCSAPHRSPSQSENL 807
Query: 171 ANPGPLSPQSEASDLQSQPSPI--------------PTGSGLASTSQP-----------A 205
+ P PL P S SP PTGS + QP +
Sbjct: 808 SPPAPLQPSLSLSPTSPITSPSLSEESLVGHNSETGPTGSSPPLSPQPQRPQPQSPQSTS 867
Query: 206 EHASS---ESAHQEMATSSGVHAASSASTVAVPVNAHP------MQTRSKSGIIKPRLNP 256
H+SS S + + + S +S+ + + P N +P M+TRSK+ I+KP NP
Sbjct: 868 PHSSSPQPNSPNPQHSPRSLTPTLTSSPSPSPPPNPNPPPIQHTMRTRSKNNIVKP--NP 925
Query: 257 TLLLTHMEPT--------TVKQAMRDDKWLQAMKEEYNALMSNGTWSLVPLPSNRKAVGC 308
+PT TV +A+ D W QAM +E NA NGT+ LVP N+ VGC
Sbjct: 926 KFANLATKPTPLKPIIPKTVVEALLDPNWRQAMCDEINAQTRNGTFDLVPPAPNQNVVGC 985
Query: 309 KWIYRVKENPDGSINKYKARLVAKGYSQVQGFDYSETFSPVVKPITVRLILSLAISRGWP 368
KW++ +K +G +++YKARLVAKG+ Q G D+ ETFSPV+K TVR +L +A+S+GW
Sbjct: 986 KWVFTLKYLSNGVLDRYKARLVAKGFHQQYGHDFKETFSPVIKLTTVRSVLHIAVSKGWS 1045
Query: 369 LQQIDVNNAFLNGVLEEEVYMTQPPGFEHKDKT-LVCKLHKALYGLKQAPRAWFHRLKEV 427
++QIDVNNAFL G L +EVY+TQPPGF KD VC+L+KALYGLKQAPRAW+ L+
Sbjct: 1046 IRQIDVNNAFLQGTLSDEVYVTQPPGFVDKDNAHHVCRLYKALYGLKQAPRAWYQELRSY 1105
Query: 428 LLQFGFKASKCDPSLFTYNSPLGCIYMLIYVDDIILTGDSMVLIQQLTAKLHSIFALKQL 487
LL GF S D SLFT +Y+L+YVDD+++TG +I + A L + F+LK L
Sbjct: 1106 LLTQGFVNSVADTSLFTLRHERTILYVLVYVDDMLITGSDTNIITRFIANLAARFSLKDL 1165
Query: 488 GQLDYFLGVQVTHLANGSLLLNQTKYINDLLTKVNMADAAPISTPMQFGAKLTKHGGTSL 547
G++ YFLG++ T + G L L Q +Y+ DLL K NM A P+ TPM KL+ G L
Sbjct: 1166 GEMSYFLGIEATRTSKG-LHLMQKRYVLDLLEKTNMLAAHPVLTPMSPTPKLSLTSGKPL 1224
Query: 548 HDPTEYRSVVGALQYATITRPEISYAVNKVCQFLSDPHEEHWKAVKRILRYLKGTITHGV 607
P+EYR+V+G+LQY TRP+I+YAVN++ Q++ P + HW+A KRILRYL GT +HG+
Sbjct: 1225 DKPSEYRAVLGSLQYLLFTRPDIAYAVNRLSQYMHCPTDLHWQAAKRILRYLAGTPSHGI 1284
Query: 608 LLQPCSMTQPLPLLAFCDADWGSDPDDRRSTSGSCVFLGPNLISWTAKKQTLVARSSTEA 667
++ PL L A+ DADW D D+ ST+ ++LG N ISW++KKQ VARSSTEA
Sbjct: 1285 FIR---ADTPLTLHAYSDADWAGDIDNYNSTNAYILYLGSNPISWSSKKQKGVARSSTEA 1341
Query: 668 EYRSLANTTAELLWVESLLTELKIAFTVPTVL-CDNMSTVLLTHNPILHTRTKHMEMDLF 726
EYR++AN T+E+ WV SLLTEL I + P V+ CDN+ L+ NP+ +R KH+ +D
Sbjct: 1342 EYRAVANATSEIRWVCSLLTELGITLSSPPVVYCDNVGATYLSANPVFDSRMKHIALDFH 1401
Query: 727 FVREKVQAKSLVVQHVPSEHQRADIFTKALSPTRFLLMRDKLNV 770
FVRE VQA +L V HV ++ Q AD TK L F + K+ V
Sbjct: 1402 FVRESVQAGALRVTHVSTKDQLADALTKPLPRQPFTTLISKIGV 1445
>emb|CAB79576.1| putative protein [Arabidopsis thaliana] gi|3269282|emb|CAA19715.1|
putative protein [Arabidopsis thaliana]
gi|7444417|pir||T05745 hypothetical protein M4I22.20 -
Arabidopsis thaliana
Length = 1318
Score = 634 bits (1635), Expect = e-180
Identities = 350/796 (43%), Positives = 473/796 (58%), Gaps = 38/796 (4%)
Query: 3 ERKHRHIVETGLALLSHASMPLHFWDHAFLTATYLINRMSTTTL-QGASPYFKLYGNHPD 61
ERKHRH+VE GL++L + +P FW AF TA +LIN + T+ L + SPY KLY PD
Sbjct: 432 ERKHRHLVELGLSMLFQSHVPHKFWVEAFFTANFLINLLPTSALKESISPYEKLYDKKPD 491
Query: 62 FKSLKIFGSACFPFLRPYNSNKLSLHSKECVFLGYSSSHKGYKCL-DQSGRIYVSKDVLF 120
+ SL+ FGSACFP LR Y NK + S +CVFLGY+ +KGY+CL +GR+Y+S+ V+F
Sbjct: 492 YTSLRSFGSACFPTLRDYAENKFNPCSLKCVFLGYNEKYKGYRCLYPPTGRLYISRHVIF 551
Query: 121 HEHRFPYTTLFPS---EPFSP-----------PTSSAEYFPLSTVPIISRSMPQPSPAPI 166
E +P++ + +P +P P S P S P+ + + P P
Sbjct: 552 DESVYPFSHTYKHLHPQPRTPLLAAWLRSSDSPAPSTSTSPSSRSPLFTSADFPPLPQRK 611
Query: 167 STELANPGPLSPQSEASDLQSQPSP-------IPTGSGLASTSQPAEHASSESAHQEMAT 219
+ L P+S S AS++ +Q SP S S + A S+S
Sbjct: 612 TPLLPTLVPISSVSHASNITTQQSPDFDSERTTDFDSASIGDSSHSSQAGSDSEETIQQA 671
Query: 220 SSGVHAASSASTVAVPVNAHPMQTRSKSGIIKPRLNPTLL---LTHMEPTTVKQAMRDDK 276
S VH +++ N HPM TR+K GI KP L +++ EP TV A++
Sbjct: 672 SVNVHQTHAST------NVHPMVTRAKVGISKPNPRYVFLSHKVSYPEPKTVTAALKHPG 725
Query: 277 WLQAMKEEYNALMSNGTWSLVPLPSNRKAVGCKWIYRVKENPDGSINKYKARLVAKGYSQ 336
W AM EE TWSLVP S+ +G KW++R K + DG++NK KAR+VAKG+ Q
Sbjct: 726 WTGAMTEEIGNCSETQTWSLVPYKSDMHVLGSKWVFRTKLHADGTLNKLKARIVAKGFLQ 785
Query: 337 VQGFDYSETFSPVVKPITVRLILSLAISRGWPLQQIDVNNAFLNGVLEEEVYMTQPPGFE 396
+G DY ET+SPVV+ TVRL+L LA + W ++Q+DV NAFL+G L+E VYMTQP GF
Sbjct: 786 EEGIDYLETYSPVVRTPTVRLVLHLATALNWDIKQMDVKNAFLHGDLKETVYMTQPAGFV 845
Query: 397 HKDK-TLVCKLHKALYGLKQAPRAWFHRLKEVLLQFGFKASKCDPSLFTYNSPLGCIYML 455
K VC LHK++YGLKQ+PRAWF + LL+FGF SK DPSLF Y I +L
Sbjct: 846 DPSKPDHVCLLHKSIYGLKQSPRAWFDKFSTFLLEFGFFCSKSDPSLFIYAHNNNLILLL 905
Query: 456 IYVDDIILTGDSMVLIQQLTAKLHSIFALKQLGQLDYFLGVQVTHLANGSLLLNQTKYIN 515
+YVDD+++TG+S + L A L+ F + +GQL YFLG+QV NG L ++Q KY
Sbjct: 906 LYVDDMVITGNSSQTLTSLLAALNKEFRMTDMGQLHYFLGIQVQRQQNG-LFMSQQKYAE 964
Query: 516 DLLTKVNMADAAPISTPMQFGAKLTKHGGTSLHDPTEYRSVVGALQYATITRPEISYAVN 575
DLL +M P+ TP+ H DPT +RS+ G LQY T+TRP+I +AVN
Sbjct: 965 DLLIAASMEHCTPLPTPLPVQLDRVPHQEELFSDPTYFRSIAGKLQYLTLTRPDIQFAVN 1024
Query: 576 KVCQFLSDPHEEHWKAVKRILRYLKGTITHGVLLQPCSMTQPLPLLAFCDADWGSDPDDR 635
VCQ + P + +KRILRY+KGTIT G+ S P L A+ D+DWG+ R
Sbjct: 1025 FVCQKMHQPTISDFHLLKRILRYIKGTITMGI---SYSRDSPTLLQAYSDSDWGNCKQTR 1081
Query: 636 RSTSGSCVFLGPNLISWTAKKQTLVARSSTEAEYRSLANTTAELLWVESLLTELKIAF-T 694
RS G C F+G NL+SW++KK V+RSSTEAEY+SL++ +E+LW+ +LL EL+I
Sbjct: 1082 RSVGGLCTFMGTNLVSWSSKKHPTVSRSSTEAEYKSLSDAASEILWLSTLLRELRIPLPD 1141
Query: 695 VPTVLCDNMSTVLLTHNPILHTRTKHMEMDLFFVREKVQAKSLVVQHVPSEHQRADIFTK 754
P + CDN+S V LT NP H RTKH ++D FVRE+V K+LVV+H+P Q ADIFTK
Sbjct: 1142 TPELFCDNLSAVYLTANPAFHARTKHFDIDFHFVRERVALKALVVKHIPGSEQIADIFTK 1201
Query: 755 ALSPTRFLLMRDKLNV 770
+L F+ +R KL V
Sbjct: 1202 SLPYEAFIHLRGKLGV 1217
>gb|AAC02672.1| polyprotein [Arabidopsis arenosa] gi|7522104|pir||T31353 polyprotein
- Arabidopsis arenosa Evelknievel retrotransposon
(fragment)
Length = 1390
Score = 624 bits (1609), Expect = e-177
Identities = 352/770 (45%), Positives = 470/770 (60%), Gaps = 60/770 (7%)
Query: 3 ERKHRHIVETGLALLSHASMPLHFWDHAFLTATYLINRMSTTTLQGASPYFKLYGNHPDF 62
ERKHRHIVETGL LLS ASM +W +AF TA YLINRM T L SPY KL+G P++
Sbjct: 628 ERKHRHIVETGLTLLSTASMSKEYWSYAFTTAVYLINRMLTPVLGNESPYMKLFGQPPNY 687
Query: 63 KSLKIFGSACFPFLRPYNSNKLSLHSKECVFLGYSSSHKGYKCLDQS-GRIYVSKDVLFH 121
L++FG CFP+LRPY ++KL S CV LGYS S Y CLD++ GR+Y S+ V F
Sbjct: 688 LKLRVFGCLCFPWLRPYTAHKLDNRSMPCVLLGYSLSQSAYLCLDRATGRVYTSRHVQFA 747
Query: 122 EHRFPYTTLFPS-EPFSPPTSSAEYFPLSTVPIISRSMPQPSPAPISTEL-----ANPGP 175
E FP++T PS P S P S + P+S VPI++R + P+ S + PG
Sbjct: 748 ESIFPFSTTSPSVTPPSDPPLSQDTRPIS-VPILARPLTTAPPSSPSCSAPHRSPSQPGI 806
Query: 176 LSPQS--EASDLQSQPSPI------------------PTGSGLASTSQPAEHASSE---- 211
LSP + + S S SPI PTGS + QP S+
Sbjct: 807 LSPSAPFQPSPPSSPTSPITSPSLSEESHVGHNQETGPTGSSPPVSPQPQSEQSTSPRST 866
Query: 212 -----SAHQEMATSSGVHAASSASTVAVPVNAHP-------MQTRSKSGIIKPRLNPTLL 259
S H + + S A + + + + P N +P M+TRSK+ I+KP NP
Sbjct: 867 SPQPNSPHTQHSPRSITPALTPSPSPSPPPNPNPPPPIQHTMRTRSKNNIVKP--NPKFA 924
Query: 260 LTHMEPT--------TVKQAMRDDKWLQAMKEEYNALMSNGTWSLVPLPSNRKAVGCKWI 311
+PT TV +A+ D W QAM +E NA NGT+ LVP N+ +GCKW+
Sbjct: 925 NLATKPTPLKPIIPKTVAEALLDPNWRQAMCDEINAQTRNGTFDLVPPAPNQNVIGCKWV 984
Query: 312 YRVKENPDGSINKYKARLVAKGYSQVQGFDYSETFSPVVKPITVRLILSLAISRGWPLQQ 371
+ +K P+G +++YKARLVAKG+ Q G D+ ETFSPV+K TVR +L +A+S+GW ++Q
Sbjct: 985 FTLKYLPNGVLDRYKARLVAKGFHQQYGHDFKETFSPVIKSTTVRSVLHVAVSKGWSIRQ 1044
Query: 372 IDVNNAFLNGVLEEEVYMTQPPGFEHKDKT-LVCKLHKALYGLKQAPRAWFHRLKEVLLQ 430
IDVNNAFL G L +EVY+ QPPGF KD VC+LHKALYGLKQAPRAW+ L+ LL
Sbjct: 1045 IDVNNAFLQGTLSDEVYVMQPPGFVDKDNPHHVCRLHKALYGLKQAPRAWYQELRSYLLT 1104
Query: 431 FGFKASKCDPSLFTYNSPLGCIYMLIYVDDIILTGDSMVLIQQLTAKLHSIFALKQLGQL 490
GF S D SLFT +Y+L+YVDD+++TG +I + A L + F+LK LG++
Sbjct: 1105 QGFVNSIADTSLFTLRHKRTILYVLVYVDDMLITGSDTNIITRFIANLAARFSLKDLGEM 1164
Query: 491 DYFLGVQVTHLANGSLLLNQTKYINDLLTKVNMADAAPISTPMQFGAKLTKHGGTSLHDP 550
YFLG++ T + G L L Q +Y+ DLL K NM A P+ TPM KL+ GT L P
Sbjct: 1165 SYFLGIEATRTSKG-LHLMQKRYVLDLLEKTNMLAAHPVLTPMSPTPKLSLTSGTPLDKP 1223
Query: 551 TEYRSVVGALQYATITRPEISYAVNKVCQFLSDPHEEHWKAVKRILRYLKGTITHGVLLQ 610
+EYR+V+G+LQY + TRP+I+YAVN++ Q++ P + HW+A KRILRYL GT +HG+ ++
Sbjct: 1224 SEYRAVLGSLQYLSFTRPDIAYAVNRLSQYMHCPTDLHWQAAKRILRYLAGTPSHGIFIR 1283
Query: 611 PCSMTQPLPLLAFCDADWGSDPDDRRSTSGSCVFLGPNLISWTAKKQTLVARSSTEAEYR 670
PL L A+ DADW D D+ ST+ ++LG ISW++KKQ VARSSTEAEYR
Sbjct: 1284 ---ADTPLKLHAYSDADWAGDTDNYNSTNAYILYLGSTPISWSSKKQNGVARSSTEAEYR 1340
Query: 671 SLANTTAELLWVESLLTELKIAFTVPTVL-CDNMSTVLLTHNPILHTRTK 719
++AN T+E+ WV SLLTEL I + P V+ CDN+ L+ NP+ +R K
Sbjct: 1341 AVANATSEIRWVCSLLTELGITLSSPPVVYCDNVGATYLSANPVFDSRMK 1390
>gb|AAF02855.1| Similar to retrotransposon proteins [Arabidopsis thaliana]
gi|25301689|pir||C96578 hypothetical protein T18A20.5
[imported] - Arabidopsis thaliana
Length = 1522
Score = 617 bits (1591), Expect = e-175
Identities = 343/810 (42%), Positives = 479/810 (58%), Gaps = 44/810 (5%)
Query: 3 ERKHRHIVETGLALLSHASMPLHFWDHAFLTATYLINRMSTTTLQG-ASPYFKLYGNHPD 61
ERKHRHIVE GL+++ + +PL +W +F TA ++IN + T++L SPY KLYG P+
Sbjct: 621 ERKHRHIVELGLSMIFQSKLPLKYWLESFFTANFVINLLPTSSLDNNESPYQKLYGKAPE 680
Query: 62 FKSLKIFGSACFPFLRPYNSNKLSLHSKECVFLGYSSSHKGYKCL-DQSGRIYVSKDVLF 120
+ +L++FG AC+P LR Y S K S +CVFLGY+ +KGY+CL +GRIY+S+ V+F
Sbjct: 681 YSALRVFGCACYPTLRDYASTKFDPRSLKCVFLGYNEKYKGYRCLYPPTGRIYISRHVVF 740
Query: 121 HEHRFP----YTTLFPSEP-------------FSPPTSSAEYFPLSTVPIISRSMPQPSP 163
E+ P Y+ L P + +P +P+S++P + +P
Sbjct: 741 DENTHPFESIYSHLHPQDKTPLLEAWFKSFHHVTPTQPDQSRYPVSSIPQPETTDLSAAP 800
Query: 164 APISTELANPGPLSPQSEASDLQSQPSPIP---TGSGLASTSQPAEHASSESAHQEMATS 220
A ++ E A P S+ ++ S S P TG AS +++S+H A S
Sbjct: 801 ASVAAETAGPNASDDTSQDNETISVVSGSPERTTGLDSASIGDSYHSPTADSSHPSPARS 860
Query: 221 SGVHAASS-------ASTVAVPV-NAHPMQTRSKSGIIKPRLNPTLLLTHM----EPTTV 268
S + A V PV N H M TR K GI KP +LLTH EP TV
Sbjct: 861 SPASSPQGSPIQMAPAQQVQAPVTNEHAMVTRGKEGISKPNKR-YVLLTHKVSIPEPKTV 919
Query: 269 KQAMRDDKWLQAMKEEYNALMSNGTWSLVPLPSNRKAVGCKWIYRVKENPDGSINKYKAR 328
+A++ W AM+EE TW+LVP N +G W++R K + DGS++K KAR
Sbjct: 920 TEALKHPGWNNAMQEEMGNCKETETWTLVPYSPNMNVLGSMWVFRTKLHADGSLDKLKAR 979
Query: 329 LVAKGYSQVQGFDYSETFSPVVKPITVRLILSLAISRGWPLQQIDVNNAFLNGVLEEEVY 388
LVAKG+ Q +G DY ET+SPVV+ TVRLIL +A W L+Q+DV NAFL+G L E VY
Sbjct: 980 LVAKGFKQEEGIDYLETYSPVVRTPTVRLILHVATVLKWELKQMDVKNAFLHGDLTETVY 1039
Query: 389 MTQPPGFEHKDK-TLVCKLHKALYGLKQAPRAWFHRLKEVLLQFGFKASKCDPSLFTYNS 447
M QP GF K K VC LHK+LYGLKQ+PRAWF R LL+FGF S DPSLF Y+S
Sbjct: 1040 MRQPAGFVDKSKPDHVCLLHKSLYGLKQSPRAWFDRFSNFLLEFGFICSLFDPSLFVYSS 1099
Query: 448 PLGCIYMLIYVDDIILTGDSMVLIQQLTAKLHSIFALKQLGQLDYFLGVQVTHLANGSLL 507
I +L+YVDD+++TG++ + L A L+ F +K +GQ+ YFLG+Q+ +G L
Sbjct: 1100 NNDVILLLLYVDDMVITGNNSQSLTHLLAALNKEFRMKDMGQVHYFLGIQI-QTYDGGLF 1158
Query: 508 LNQTKYINDLLTKVNMADAAPISTPMQFGAKLTKHGGTSLHDPTEYRSVVGALQYATITR 567
++Q KY DLL +MA+ +P+ TP+ + DPT +RS+ G LQY T+TR
Sbjct: 1159 MSQQKYAEDLLITASMANCSPMPTPLPLQLDRVSNQDEVFSDPTYFRSLAGKLQYLTLTR 1218
Query: 568 PEISYAVNKVCQFLSDPHEEHWKAVKRILRYLKGTITHGVLLQPCSMT------QPLPLL 621
P+I +AVN VCQ + P + +KRILRY+KGT++ G+ S + L
Sbjct: 1219 PDIQFAVNFVCQKMHQPSVSDFNLLKRILRYIKGTVSMGIQYNSNSSSVVSAYESDYDLS 1278
Query: 622 AFCDADWGSDPDDRRSTSGSCVFLGPNLISWTAKKQTLVARSSTEAEYRSLANTTAELLW 681
A+ D+D+ + + RRS G C F+G N+ISW++KKQ V+RSSTEAEYRSL+ T +E+ W
Sbjct: 1279 AYSDSDYANCKETRRSVGGYCTFMGQNIISWSSKKQPTVSRSSTEAEYRSLSETASEIKW 1338
Query: 682 VESLLTELKIAF-TVPTVLCDNMSTVLLTHNPILHTRTKHMEMDLFFVREKVQAKSLVVQ 740
+ S+L E+ ++ P + CDN+S V LT NP H RTKH ++D ++RE+V K+LVV+
Sbjct: 1339 MSSILREIGVSLPDTPELFCDNLSAVYLTANPAFHARTKHFDVDHHYIRERVALKTLVVK 1398
Query: 741 HVPSEHQRADIFTKALSPTRFLLMRDKLNV 770
H+P Q ADIFTK+L F +R KL V
Sbjct: 1399 HIPGHLQLADIFTKSLPFEAFTRLRFKLGV 1428
>gb|AAD21687.1| Strong similarity to gi|3600044 T12H20.12 protease homolog from
Arabidopsis thaliana BAC gb|AF080119 and is a member of
the reverse transcriptase family PF|00078
gi|25301706|pir||C86438 hypothetical protein F28K20.17 -
Arabidopsis thaliana
Length = 1415
Score = 608 bits (1569), Expect = e-172
Identities = 330/777 (42%), Positives = 466/777 (59%), Gaps = 41/777 (5%)
Query: 3 ERKHRHIVETGLALLSHASMPLHFWDHAFLTATYLINRMSTTTLQGASPYFKLYGNHPDF 62
ERKHRH+VE GL++L H+ P FW +F TA Y+INR+ ++ L+ SPY L+G PD+
Sbjct: 617 ERKHRHLVELGLSMLFHSHTPQKFWVESFFTANYIINRLPSSVLKNLSPYEALFGEKPDY 676
Query: 63 KSLKIFGSACFPFLRPYNSNKLSLHSKECVFLGYSSSHKGYKCL-DQSGRIYVSKDVLFH 121
SL++FGSAC+P LRP NK S +CVFLGY+S +KGY+C +G++Y+S++V+F+
Sbjct: 677 SSLRVFGSACYPCLRPLAQNKFDPRSLQCVFLGYNSQYKGYRCFYPPTGKVYISRNVIFN 736
Query: 122 EHRFPYTTLFPS--EPFSPPTSSA-EYFPLSTVPIISRSMPQPSPAPISTELANPGPLSP 178
E P+ + S +S P A ++ +S + + PA + P L+
Sbjct: 737 ESELPFKEKYQSLVPQYSTPLLQAWQHNKISEISV---------PAAPVQLFSKPIDLNT 787
Query: 179 QSEASDLQSQPSPIPTGSGLASTSQPAEHASSESAHQEMATSSGVHAASSASTVAVPVNA 238
+ + + P PT + S + A +A+QE +N+
Sbjct: 788 YAGSQVTEQLTDPEPTSNNEGSDEEVNPVAEEIAANQEQV-----------------INS 830
Query: 239 HPMQTRSKSGIIKPRLNPTLLLTHM---EPTTVKQAMRDDKWLQAMKEEYNALMSNGTWS 295
H M TRSK+GI KP L+ + M EP T+ AM+ W +A+ EE N + TWS
Sbjct: 831 HAMTTRSKAGIQKPNTRYALITSRMNTAEPKTLASAMKHPGWNEAVHEEINRVHMLHTWS 890
Query: 296 LVPLPSNRKAVGCKWIYRVKENPDGSINKYKARLVAKGYSQVQGFDYSETFSPVVKPITV 355
LVP + + KW+++ K +PDGSI+K KARLVAKG+ Q +G DY ETFSPVV+ T+
Sbjct: 891 LVPPTDDMNILSSKWVFKTKLHPDGSIDKLKARLVAKGFDQEEGVDYLETFSPVVRTATI 950
Query: 356 RLILSLAISRGWPLQQIDVNNAFLNGVLEEEVYMTQPPGFEHKDK-TLVCKLHKALYGLK 414
RL+L ++ S+GWP++Q+DV+NAFL+G L+E V+M QP GF K T VC+L KA+YGLK
Sbjct: 951 RLVLDVSTSKGWPIKQLDVSNAFLHGELQEPVFMYQPSGFIDPQKPTHVCRLTKAIYGLK 1010
Query: 415 QAPRAWFHRLKEVLLQFGFKASKCDPSLFTYNSPLGCIYMLIYVDDIILTGDSMVLIQQL 474
QAPRAWF LL +GF SK DPSLF + +Y+L+YVDDI+LTG L++ L
Sbjct: 1011 QAPRAWFDTFSNFLLDYGFVCSKSDPSLFVCHQDGKILYLLLYVDDILLTGSDQSLLEDL 1070
Query: 475 TAKLHSIFALKQLGQLDYFLGVQVTHLANGSLLLNQTKYINDLLTKVNMADAAPISTPMQ 534
L + F++K LG YFLG+Q+ ANG L L+QT Y D+L + M+D P+ TP+
Sbjct: 1071 LQALKNRFSMKDLGPPRYFLGIQIEDYANG-LFLHQTAYATDILQQAGMSDCNPMPTPLP 1129
Query: 535 FGAKLTKHGGTSLHDPTEYRSVVGALQYATITRPEISYAVNKVCQFLSDPHEEHWKAVKR 594
+L +PT +RS+ G LQY TITRP+I +AVN +CQ + P + +KR
Sbjct: 1130 --QQLDNLNSELFAEPTYFRSLAGKLQYLTITRPDIQFAVNFICQRMHSPTTSDFGLLKR 1187
Query: 595 ILRYLKGTITHGVLLQPCSMTQPLPLLAFCDADWGSDPDDRRSTSGSCVFLGPNLISWTA 654
ILRY+KGTI G+ P L L A+ D+D + RRST+G C+ LG NLISW+A
Sbjct: 1188 ILRYIKGTIGMGL---PIKRNSTLTLSAYSDSDHAGCKNTRRSTTGFCILLGSNLISWSA 1244
Query: 655 KKQTLVARSSTEAEYRSLANTTAELLWVESLLTELKIAFTVPT-VLCDNMSTVLLTHNPI 713
K+Q V+ SSTEAEYR+L E+ W+ LL +L I +PT V CDN+S V L+ NP
Sbjct: 1245 KRQPTVSNSSTEAEYRALTYAAREITWISFLLRDLGIPQYLPTQVYCDNLSAVYLSANPA 1304
Query: 714 LHTRTKHMEMDLFFVREKVQAKSLVVQHVPSEHQRADIFTKALSPTRFLLMRDKLNV 770
LH R+KH + D ++RE+V + QH+ + Q AD+FTK+L F+ +R KL V
Sbjct: 1305 LHNRSKHFDTDYHYIREQVALGLIETQHISATFQLADVFTKSLPRRAFVDLRSKLGV 1361
>emb|CAB81170.1| retrotransposon like protein [Arabidopsis thaliana]
gi|4539447|emb|CAB40035.1| retrotransposon like protein
[Arabidopsis thaliana] gi|7444419|pir||T04204
hypothetical protein T4F9.150 - Arabidopsis thaliana
Length = 1515
Score = 607 bits (1564), Expect = e-172
Identities = 350/821 (42%), Positives = 483/821 (58%), Gaps = 68/821 (8%)
Query: 3 ERKHRHIVETGLALLSHASMPLHFWDHAFLTATYLINRMSTTTLQ-GASPYFKLYGNHPD 61
ER+HR++ E GL+L+ H+ +P W AF T+ +L N + ++TL SPY L+G P
Sbjct: 618 ERRHRYLTELGLSLMFHSKVPHKLWVEAFFTSNFLSNLLPSSTLSDNKSPYEMLHGTPPV 677
Query: 62 FKSLKIFGSACFPFLRPYNSNKLSLHSKECVFLGYSSSHKGYKCLDQ-SGRIYVSKDVLF 120
+ +L++FGSAC+P+LRPY NK S CVFLGY++ +KGY+CL +G++Y+ + VLF
Sbjct: 678 YTALRVFGSACYPYLRPYAKNKFDPKSLLCVFLGYNNKYKGYRCLHPPTGKVYICRHVLF 737
Query: 121 HEHRFPYTTL-------------------FPSEPFSPPTSSAEY----FPLSTVPIISRS 157
E +FPY+ + F S S T S FP +TV S S
Sbjct: 738 DERKFPYSDIYSQFQTISGSPLFTAWQKGFSSTALSRETPSTNVEDIIFPSATV---SSS 794
Query: 158 MPQPSPAPISTELANPGPLSPQSEASDLQSQPSPIPTGSGLASTSQPAEHASSE---SAH 214
+P AP E A P + A D+ PSPI + S +QP E S + S
Sbjct: 795 VPTGC-APNIAETAT-APDVDVAAAHDMVVPPSPITSTS---LPTQPEESTSDQNHYSTD 849
Query: 215 QEMATSSGVHAASS-----------------ASTVAVPVNAHPMQTRSKSGIIKPRLNPT 257
E A SS + S +ST A P +HPM TR+KSGI KP NP
Sbjct: 850 SETAISSAMTPQSINVSLFEDSDFPPLQSVISSTTAAPETSHPMITRAKSGITKP--NPK 907
Query: 258 LLL-----THMEPTTVKQAMRDDKWLQAMKEEYNALMSNGTWSLVPLPSNRKAVGCKWIY 312
L + EP +VK+A++D+ W AM EE + TW LVP + +GCKW++
Sbjct: 908 YALFSVKSNYPEPKSVKEALKDEGWTNAMGEEMGTMHETDTWDLVPPEMVDRLLGCKWVF 967
Query: 313 RVKENPDGSINKYKARLVAKGYSQVQGFDYSETFSPVVKPITVRLILSLAISRGWPLQQI 372
+ K N DGS+++ KARLVA+GY Q +G DY ET+SPVV+ TVR IL +A W L+Q+
Sbjct: 968 KTKLNSDGSLDRLKARLVARGYEQEEGVDYVETYSPVVRSATVRSILHVATINKWSLKQL 1027
Query: 373 DVNNAFLNGVLEEEVYMTQPPGFEHKDK-TLVCKLHKALYGLKQAPRAWFHRLKEVLLQF 431
DV NAFL+ L+E V+MTQPPGFE + VCKL KA+Y LKQAPRAWF + LL++
Sbjct: 1028 DVKNAFLHDELKETVFMTQPPGFEDPSRPDYVCKLKKAIYDLKQAPRAWFDKFSSYLLKY 1087
Query: 432 GFKASKCDPSLFTYNSPLGCIYMLIYVDDIILTGDSMVLIQQLTAKLHSIFALKQLGQLD 491
GF S DPSLF Y +++L+YVDD+ILTG++ VL+QQL L + F +K +G L
Sbjct: 1088 GFICSFSDPSLFVYLKGRDVMFLLLYVDDMILTGNNDVLLQQLLNILSTEFRMKDMGALH 1147
Query: 492 YFLGVQVTHLANGSLLLNQTKYINDLLTKVNMADAAPISTPMQFGAKLTKHGGTSLHDPT 551
YFLG+Q H N L L+Q KY +DLL M+D + + TP+Q L + +PT
Sbjct: 1148 YFLGIQ-AHYHNDGLFLSQEKYTSDLLVNAGMSDCSSMPTPLQL--DLLQGNNKPFPEPT 1204
Query: 552 EYRSVVGALQYATITRPEISYAVNKVCQFLSDPHEEHWKAVKRILRYLKGTITHGVLLQP 611
+R + G LQY T+TRP+I +AVN VCQ + P + +KRIL YLKGT+T G+ L
Sbjct: 1205 YFRRLAGKLQYLTLTRPDIQFAVNFVCQKMHAPTMSDFHLLKRILHYLKGTMTMGINL-- 1262
Query: 612 CSMTQPLPLLAFCDADWGSDPDDRRSTSGSCVFLGPNLISWTAKKQTLVARSSTEAEYRS 671
S L + D+DW D RRST G C FLG N+ISW+AK+ V++SSTEAEYR+
Sbjct: 1263 -SSNTDSVLRCYSDSDWAGCKDTRRSTGGFCTFLGYNIISWSAKRHPTVSKSSTEAEYRT 1321
Query: 672 LANTTAELLWVESLLTELKI-AFTVPTVLCDNMSTVLLTHNPILHTRTKHMEMDLFFVRE 730
L+ +E+ W+ LL E+ + +P + CDN+S V L+ NP LH+R+KH ++D ++VRE
Sbjct: 1322 LSFAASEVSWIGFLLQEIGLPQQQIPEMYCDNLSAVYLSANPALHSRSKHFQVDYYYVRE 1381
Query: 731 KVQAKSLVVQHVPSEHQRADIFTKALSPTRFLLMRDKLNVV 771
+V +L V+H+P+ Q ADIFTK+L F +R KL VV
Sbjct: 1382 RVALGALTVKHIPASQQLADIFTKSLPQAPFCDLRFKLGVV 1422
>ref|XP_462785.1| putative gag/pol polyprotein [Oryza sativa (japonica cultivar-group)]
Length = 1373
Score = 604 bits (1557), Expect = e-171
Identities = 342/826 (41%), Positives = 474/826 (56%), Gaps = 63/826 (7%)
Query: 3 ERKHRHIVETGLALLSHASMPLHFWDHAFLTATYLINRMSTTTLQGASPYFKLYGNHPDF 62
ER R + +LL A +P +W A TAT L+NR+ T TL ++PYF LY P +
Sbjct: 552 ERSLRTLNNILRSLLFQACLPPVYWVEALHTATLLVNRIPTKTLSSSTPYFHLYSTQPTY 611
Query: 63 KSLKIFGSACFPFLRPYNSNKLSLHSKECVFLGYSSSHKGYKCLD-QSGRIYVSKDVLFH 121
L++FG AC+P + +KL+ S CVFLGYSS HKGY+CL+ S RI S+ V+F
Sbjct: 612 DHLRVFGCACYPNMSSTAPHKLAPRSSLCVFLGYSSEHKGYRCLELGSNRIITSRHVVFD 671
Query: 122 EHRFPYTTLFPS--------------------------------------EPFSPPTSSA 143
E FP+ + S EP +PP + +
Sbjct: 672 ESFFPFADMSTSPMASSALDIFLDDNELTAQPPRAKFVHAGTSSAARGAVEPSTPPPAPS 731
Query: 144 EYFPLSTVPIISRSMPQPSPAPISTELANPGPLSPQSEASDLQS-------QPSPIPTGS 196
P S + +P + PG +SP A+ + +P S
Sbjct: 732 SIGPRSPATLAGPEAGPHGGSPAGAATSQPGAISPARTAAPSAATSTTRAVTSAPRAATS 791
Query: 197 GLASTSQPAEHASSESAHQEMATSSGVHAASSASTVAVPV----NAHPMQTRSKSGIIKP 252
G + P ++ E+A SS + +T V + NAH M+TR K+G+ +P
Sbjct: 792 GTTPSLSPLAGTAAPPPRAEVAASSTAATGRTLATRPVSIAPVDNAHSMRTRGKAGMAQP 851
Query: 253 --RLNPTLLLTHMEPTTVKQAMRDDKWLQAMKEEYNALMSNGTWSLVPLPSNRKAVGCKW 310
RLN P +V++A+ D W AM+ E++AL++N TWSLVP P V KW
Sbjct: 852 VDRLNLHAAPLSPVPRSVREALSDPNWRAAMQAEFDALLANDTWSLVPRPRGVNLVTGKW 911
Query: 311 IYRVKENPDGSINKYKARLVAKGYSQVQGFDYSETFSPVVKPITVRLILSLAISRGWPLQ 370
I+R K + DGS+++YKAR V +G++Q G DY ETFSPVVKP TVR++LSLA+S+ WP+
Sbjct: 912 IFRHKLHSDGSLDRYKARWVLRGFTQRPGVDYDETFSPVVKPATVRVVLSLALSQDWPIH 971
Query: 371 QIDVNNAFLNGVLEEEVYMTQPPGF---EHKDKTLVCKLHKALYGLKQAPRAWFHRLKEV 427
Q+DV NAFL+G L E VY QP GF H D LVC+L+K+LYGLKQAPRAW HR
Sbjct: 972 QLDVKNAFLHGTLSETVYCIQPTGFADPSHAD--LVCRLNKSLYGLKQAPRAWHHRFASH 1029
Query: 428 LLQFGFKASKCDPSLFTYNSPLGCIYMLIYVDDIILTGDSMVLIQQLTAKLHSIFALKQL 487
L+ GF ++ D SLF + + +L+YVDDI+LT S L+QQ+ A L FA+ +
Sbjct: 1030 LISLGFIEAQSDSSLFIHRRGNDTVLLLLYVDDIVLTASSASLLQQVIAALQREFAMTDM 1089
Query: 488 GQLDYFLGVQVTHLANGSLLLNQTKYINDLLTKVNMADAAPISTPMQFGAKLTKHGGTSL 547
G L +FLG+ VT A+G L L+Q +Y D+L + M + P STP+ +KL+ G +
Sbjct: 1090 GPLHHFLGITVTRFASG-LFLSQRQYSQDILERAGMGECKPCSTPVDVHSKLSA-DGPPV 1147
Query: 548 HDPTEYRSVVGALQYATITRPEISYAVNKVCQFLSDPHEEHWKAVKRILRYLKGTITHGV 607
D T+YRS+ GALQY T TRP+I++AV +VC ++ DP E H A+KRILRY++GT++ G+
Sbjct: 1148 ADSTQYRSLAGALQYLTFTRPDIAFAVQQVCLYMHDPREPHLAALKRILRYIQGTLSLGL 1207
Query: 608 LLQPCSMTQPLPLLAFCDADWGSDPDDRRSTSGSCVFLGPNLISWTAKKQTLVARSSTEA 667
++ + P L+ + DADW PD RRSTSG VFLG NL+SW++K+Q V+RSS EA
Sbjct: 1208 TMR---RSPPTDLVVYTDADWAGCPDTRRSTSGYAVFLGDNLVSWSSKRQHTVSRSSAEA 1264
Query: 668 EYRSLANTTAELLWVESLLTEL-KIAFTVPTVLCDNMSTVLLTHNPILHTRTKHMEMDLF 726
EYR++AN AE W+ LL EL + T V CDN+S + L+ NP+ H RTKH+E+DL
Sbjct: 1265 EYRAVANGVAEATWLRQLLMELHRPPRTATVVYCDNVSAMYLSSNPVQHQRTKHVEIDLH 1324
Query: 727 FVREKVQAKSLVVQHVPSEHQRADIFTKALSPTRFLLMRDKLNVVD 772
FVREKV + V HVP+ Q AD+FTK L + F R L + D
Sbjct: 1325 FVREKVALGHVRVLHVPTTSQYADVFTKGLPTSLFQEFRTSLTISD 1370
>ref|NP_916434.1| putative gag/pol polyprotein [Oryza sativa (japonica cultivar-group)]
Length = 1090
Score = 598 bits (1541), Expect = e-169
Identities = 349/796 (43%), Positives = 464/796 (57%), Gaps = 40/796 (5%)
Query: 3 ERKHRHIVETGLALLSHASMPLHFWDHAFLTATYLINRMSTTTLQGASPYFKLYGNHPDF 62
ER R I + LL ASMP +W A TATYL+NR ++++ + P+ L+ PDF
Sbjct: 284 ERMLRTINNSIRTLLIQASMPPSYWAEALATATYLLNRRPSSSIHQSLPFQLLHRTIPDF 343
Query: 63 KSLKIFGSACFPFLRPYNSNKLSLHSKECVFLGYSSSHKGYKCLDQSG-RIYVSKDVLFH 121
L++FG C+P L +KLS S CVFLGY +SHKGY+CLD S RI +S+ V+F
Sbjct: 344 SHLRVFGCLCYPNLSATTPHKLSPRSTACVFLGYPTSHKGYRCLDLSTHRIIISRHVVFD 403
Query: 122 EHRFPYTTLFPSEPFSPPTSSAEYFPL-----STVPIISRSMPQPSPAPISTELANPGPL 176
E +FP+ +PP +S+ F L + P + P+P STE+ P
Sbjct: 404 ESQFPFAA-------TPPAASSFDFLLQGLSPADAPSLEVEQPRPLTVAPSTEVEQPYLP 456
Query: 177 SPQSE--------ASDLQSQPSP-IPTGSGLASTSQPAEHASS-ESAHQEMATSSGVHAA 226
P AS+ S +P + T S A+ A AS+ S + + T V
Sbjct: 457 LPSRRLSAGTVTVASEAPSAGAPLVGTSSADATPPGSATRASTIVSPFRHVYTRRPVTTV 516
Query: 227 SSASTVAVPVNA------HPMQTRSKSGIIKPRLNPTLLLTHME----PTTVKQAMRDDK 276
+S+ AV NA H M TRS+SG ++P T T P A+ D
Sbjct: 517 PPSSSTAV-TNAVAAPQPHSMVTRSQSGSLRPVDRLTYTATQAAASPVPANYHSALADPN 575
Query: 277 WLQAMKEEYNALMSNGTWSLVPLPSNRKAVGCKWIYRVKENPDGSINKYKARLVAKGYSQ 336
W AM +EY L+ NGTW LV P KWI++ K + DGS+ +YKAR V +GYSQ
Sbjct: 576 WRAAMADEYKELVDNGTWRLVSRPPRANIATGKWIFKHKFHSDGSLARYKARWVVRGYSQ 635
Query: 337 VQGFDYSETFSPVVKPITVRLILSLAISRGWPLQQIDVNNAFLNGVLEEEVYMTQPPGF- 395
G DY ETFSPVVK T+R++LS+A SR WP+ Q+DV NAFL+G L+E VY QP GF
Sbjct: 636 QHGIDYDETFSPVVKLATIRVVLSIAASRAWPIHQLDVKNAFLHGHLKETVYCQQPSGFV 695
Query: 396 EHKDKTLVCKLHKALYGLKQAPRAWFHRLKEVLLQFGFKASKCDPSLFTYNSPLGCIYML 455
+ VC L K+LYGLKQAPRAW+ R + Q GF S D SLF Y Y+L
Sbjct: 696 DPTAPDAVCLLQKSLYGLKQAPRAWYQRFATYIRQMGFMPSASDTSLFVYKDGDRIAYLL 755
Query: 456 IYVDDIILTGDSMVLIQQLTAKLHSIFALKQLGQLDYFLGVQVTHLANGSLLLNQTKYIN 515
+YVDDIILT + L+QQLTA+LHS FA+ LG L +FLG+ V +G L L+Q +Y
Sbjct: 756 LYVDDIILTASTTTLLQQLTARLHSEFAMTDLGDLHFFLGISVKRSPDG-LFLSQRQYAV 814
Query: 516 DLLTKVNMADAAPISTPMQFGAKLTKHGGTSLHDPTEYRSVVGALQYATITRPEISYAVN 575
DLL + MA+ STP+ AKL+ G + DP+ YRS+ GALQY T+TRP+++YAV
Sbjct: 815 DLLQRAGMAECHSTSTPVDTHAKLSATDGLPVADPSAYRSIAGALQYLTLTRPDLAYAVQ 874
Query: 576 KVCQFLSDPHEEHWKAVKRILRYLKGTITHGVLLQPCSMTQPLPLLAFCDADWGSDPDDR 635
+VC F+ DP E H VKRILRY+KG+++ G+ + + L A+ DADW P+ R
Sbjct: 875 QVCLFMHDPREPHLALVKRILRYVKGSLSIGLHIGSGPIQS---LTAYSDADWAGCPNSR 931
Query: 636 RSTSGSCVFLGPNLISWTAKKQTLVARSSTEAEYRSLANTTAELLWVESLLTELKIAFTV 695
RSTSG CV+LG NL+SW++K+QT V+RSS EAEYR++A+ AE W+ LL EL +
Sbjct: 932 RSTSGYCVYLGDNLVSWSSKRQTTVSRSSAEAEYRAVAHAVAECCWLRQLLQELHVPIAS 991
Query: 696 PTVL-CDNMSTVLLTHNPILHTRTKHMEMDLFFVREKVQAKSLVVQHVPSEHQRADIFTK 754
T++ CDN+S V +T NP+ H RTKH+E+D+ FVREKV + V +VPS HQ ADI TK
Sbjct: 992 ATIVYCDNVSAVYMTANPVHHRRTKHIEIDIHFVREKVALGQVRVLYVPSSHQFADIMTK 1051
Query: 755 ALSPTRFLLMRDKLNV 770
L F R L V
Sbjct: 1052 GLPVQLFTDFRSSLCV 1067
>gb|AAK51235.1| polyprotein [Arabidopsis thaliana]
Length = 1453
Score = 591 bits (1523), Expect = e-167
Identities = 327/781 (41%), Positives = 469/781 (59%), Gaps = 22/781 (2%)
Query: 3 ERKHRHIVETGLALLSHASMPLHFWDHAFLTATYLINRMSTTTLQGASPYFKLYGNHPDF 62
ERKHRH VE GL+++ H+ PL FW AF TA++L N + + +L SP L P++
Sbjct: 620 ERKHRHFVELGLSMMFHSHTPLQFWVEAFFTASFLSNMLPSPSLGNVSPLEALLKQKPNY 679
Query: 63 KSLKIFGSACFPFLRPYNSNKLSLHSKECVFLGYSSSHKGYKCL-DQSGRIYVSKDVLFH 121
L++FG+AC+P LRP +K S +CVFLGY+S +KGY+CL +GR+Y+S+ V+F
Sbjct: 680 AMLRVFGTACYPCLRPLGEHKFEPRSLQCVFLGYNSQYKGYRCLYPPTGRVYISRHVIFD 739
Query: 122 EHRFPYTTLFPSEPFSPPTSSAEYFPL--STVPIISRSM-PQPSPAPISTELANPGPLSP 178
E FP+ + F P + S++P +S+ PQ I + LA P P
Sbjct: 740 EETFPFKQKYQ---FLVPQYESSLLSAWQSSIPQADQSLIPQAEEGKIES-LAKP-PSIQ 794
Query: 179 QSEASDLQSQPSPIPTGSGLASTSQPA-EHASSESAHQEMATSSGVHAASSASTVAV-PV 236
++ D +QP+ + G + + E +ES ++E T + + V P
Sbjct: 795 KNTIQDTTTQPAILTEGVLNEEEEEDSFEETETESLNEETHTQNDEAEVTVEEEVQQEPE 854
Query: 237 NAHPMQTRSKSGIIKPRLNPTLLLTHM---EPTTVKQAMRDDKWLQAMKEEYNALMSNGT 293
N HPM TRSK+GI K LL + EP ++ +A+ W A+ +E + T
Sbjct: 855 NTHPMTTRSKAGIHKSNTRYALLTSKFSVEEPKSIDEALNHPGWNNAVNDEMRTIHMLHT 914
Query: 294 WSLVPLPSNRKAVGCKWIYRVKENPDGSINKYKARLVAKGYSQVQGFDYSETFSPVVKPI 353
WSLV + +GC+W+++ K PDGS++K KARLVAKG+ Q +G DY ETFSPVV+
Sbjct: 915 WSLVQPTEDMNILGCRWVFKTKLKPDGSVDKLKARLVAKGFHQEEGLDYLETFSPVVRTA 974
Query: 354 TVRLILSLAISRGWPLQQIDVNNAFLNGVLEEEVYMTQPPGFEHKDK-TLVCKLHKALYG 412
T+RL+L +A ++GW ++Q+DV+NAFL+G L+E VYM QPPGF ++K + VC+L KALYG
Sbjct: 975 TIRLVLDVATAKGWNIKQLDVSNAFLHGELKEPVYMLQPPGFVDQEKPSYVCRLTKALYG 1034
Query: 413 LKQAPRAWFHRLKEVLLQFGFKASKCDPSLFTYNSPLGCIYMLIYVDDIILTGDSMVLIQ 472
LKQAPRAWF + LL FGF SK DPSLFTY+ + +L+YVDDI+LTG L+Q
Sbjct: 1035 LKQAPRAWFDTISNYLLDFGFSCSKSDPSLFTYHKNGKTLVLLLYVDDILLTGSDHNLLQ 1094
Query: 473 QLTAKLHSIFALKQLGQLDYFLGVQVTHLANGSLLLNQTKYINDLLTKVNMADAAPISTP 532
+L L+ F++K LG YFLGV++ G L L+QT Y D+L + M++ + TP
Sbjct: 1095 ELLMSLNKRFSMKDLGAPSYFLGVEIESSPEG-LFLHQTAYAKDILHQAAMSNCNSMPTP 1153
Query: 533 MQFGAKLTKHGGTSLHDPTEYRSVVGALQYATITRPEISYAVNKVCQFLSDPHEEHWKAV 592
+ + +PT +RS+ G LQY TITRP+I +AVN +CQ + P + +
Sbjct: 1154 LP--QHIENLNSDLFPEPTYFRSLAGKLQYLTITRPDIQFAVNFICQRMHSPTTADFGLL 1211
Query: 593 KRILRYLKGTITHGVLLQPCSMTQPLPLLAFCDADWGSDPDDRRSTSGSCVFLGPNLISW 652
KRILRY+KGTI G+ ++ Q L L+A+ D+DW + RRST+G C LG NLISW
Sbjct: 1212 KRILRYVKGTIHLGLHIK---KNQNLSLVAYSDSDWAGCKETRRSTTGFCTLLGCNLISW 1268
Query: 653 TAKKQTLVARSSTEAEYRSLANTTAELLWVESLLTELKIAFTVPT-VLCDNMSTVLLTHN 711
+AK+Q V++SSTEAEYR+L EL W+ LL ++ + T PT V CDN+S V L+ N
Sbjct: 1269 SAKRQETVSKSSTEAEYRALTAVAQELTWLSFLLRDIGVTQTHPTLVKCDNLSAVYLSAN 1328
Query: 712 PILHTRTKHMEMDLFFVREKVQAKSLVVQHVPSEHQRADIFTKALSPTRFLLMRDKLNVV 771
P LH R+KH + D ++RE+V + +H+ + Q ADIFTK L F+ +R KL V
Sbjct: 1329 PALHNRSKHFDTDYHYIREQVALGLVETKHISATLQLADIFTKPLPRRAFIDLRIKLGVA 1388
Query: 772 D 772
+
Sbjct: 1389 E 1389
>emb|CAC37623.1| copia-like polyprotein [Arabidopsis thaliana]
Length = 1466
Score = 585 bits (1508), Expect = e-165
Identities = 325/779 (41%), Positives = 458/779 (58%), Gaps = 29/779 (3%)
Query: 3 ERKHRHIVETGLALLSHASMPLHFWDHAFLTATYLINRMSTTTLQGASPYFKLYGNHPDF 62
ERKHRH+VE GL++L H+ PL FW AF TA YL N + ++ L+ SPY L+ D+
Sbjct: 619 ERKHRHLVELGLSMLYHSHTPLKFWVEAFFTANYLSNLLPSSVLKEISPYETLFQQKVDY 678
Query: 63 KSLKIFGSACFPFLRPYNSNKLSLHSKECVFLGYSSSHKGYKCL-DQSGRIYVSKDVLFH 121
L++FG+AC+P LRP NK S +CVFLGY + +KGY+CL +G++Y+S+ V+F
Sbjct: 679 TPLRVFGTACYPCLRPLAKNKFDPRSLQCVFLGYHNQYKGYRCLYPPTGKVYISRHVIFD 738
Query: 122 EHRFPYTTLFPSEPFSPPTSSAEYFPLS--TVPIISRSMPQP---SPAPISTELANPGPL 176
E +FP+ + S T+ + + + T P + S QP P++T P
Sbjct: 739 EAQFPFKEKYHSLVPKYQTTLLQAWQHTDLTPPSVPSSQLQPLARQMTPMATSENQPMMN 798
Query: 177 SPQSEASDLQSQPSPIPTGSGLASTSQPAEHASSESAHQEMATSSGVHAASSASTVAVPV 236
EA ++ + TS E S++ E+A +A
Sbjct: 799 YETEEAVNVNME------------TSSDEETESNDEFDHEVAPVLNDQNEDNALGQGSLE 846
Query: 237 NAHPMQTRSKSGIIKPRLNPTLLLTHM---EPTTVKQAMRDDKWLQAMKEEYNALMSNGT 293
N HPM TRSK GI KP L+++ EP T+ AM+ W A+ +E + + T
Sbjct: 847 NLHPMITRSKDGIQKPNPRYALIVSKSSFDEPKTITTAMKHPSWNAAVMDEIDRIHMLNT 906
Query: 294 WSLVPLPSNRKAVGCKWIYRVKENPDGSINKYKARLVAKGYSQVQGFDYSETFSPVVKPI 353
WSLVP + + KW+++ K PDG+I+K KARLVAKG+ Q +G DY ETFSPVV+
Sbjct: 907 WSLVPATEDMNILTSKWVFKTKLKPDGTIDKLKARLVAKGFDQEEGVDYLETFSPVVRTA 966
Query: 354 TVRLILSLAISRGWPLQQIDVNNAFLNGVLEEEVYMTQPPGFEHKDK-TLVCKLHKALYG 412
T+RL+L A + WPL+Q+DV+NAFL+G L+E V+M QP GF +K VC+L KALYG
Sbjct: 967 TIRLVLDTATANEWPLKQLDVSNAFLHGELQEPVFMFQPSGFVDPNKPNHVCRLTKALYG 1026
Query: 413 LKQAPRAWFHRLKEVLLQFGFKASKCDPSLFTYNSPLGCIYMLIYVDDIILTGDSMVLIQ 472
LKQAPRAWF LL FGF+ S DPSLF + + +L+YVDDI+LTG +L+
Sbjct: 1027 LKQAPRAWFDTFSNFLLDFGFECSTSDPSLFVCHQNGQSLILLLYVDDILLTGSDQLLMD 1086
Query: 473 QLTAKLHSIFALKQLGQLDYFLGVQVTHLANGSLLLNQTKYINDLLTKVNMADAAPISTP 532
+L L++ F++K LG YFLG+++ NG L L+Q Y +D+L + M + P+ TP
Sbjct: 1087 KLLQALNNRFSMKDLGPPRYFLGIEIESYNNG-LFLHQHAYASDILHQAGMTECNPMPTP 1145
Query: 533 MQFGAKLTKHGGTSLHDPTEYRSVVGALQYATITRPEISYAVNKVCQFLSDPHEEHWKAV 592
+ L +PT +RS+ G LQY TITRP+I YAVN +CQ + P + +
Sbjct: 1146 LP--QHLEDLNSEPFEEPTYFRSLAGKLQYLTITRPDIQYAVNFICQRMHAPTNSDFGLL 1203
Query: 593 KRILRYLKGTITHGVLLQPCSMTQPLPLLAFCDADWGSDPDDRRSTSGSCVFLGPNLISW 652
KRILRY+KGTI G+ P L FCD+D+ D RRST+G C+ LG LISW
Sbjct: 1204 KRILRYVKGTINMGL---PIRKHHNPVLSGFCDSDYAGCKDTRRSTTGFCILLGSTLISW 1260
Query: 653 TAKKQTLVARSSTEAEYRSLANTTAELLWVESLLTELKIAFTVPT-VLCDNMSTVLLTHN 711
+AK+Q ++ SSTEAEYR+L++T E+ W+ SLL +L I+ PT V CDN+S V L+ N
Sbjct: 1261 SAKRQPTISHSSTEAEYRALSDTAREITWISSLLRDLGISQHQPTRVFCDNLSAVYLSAN 1320
Query: 712 PILHTRTKHMEMDLFFVREKVQAKSLVVQHVPSEHQRADIFTKALSPTRFLLMRDKLNV 770
P LH R+KH + D ++RE+V + QH+P+ Q AD+FTK+L F+ +R KL V
Sbjct: 1321 PALHKRSKHFDKDFHYIRERVALGLIETQHIPATIQLADVFTKSLPRRPFITLRAKLGV 1379
>gb|AAP53905.1| putative pol polyprotein [Oryza sativa (japonica cultivar-group)]
gi|37534632|ref|NP_921618.1| putative pol polyprotein
[Oryza sativa (japonica cultivar-group)]
Length = 1688
Score = 585 bits (1507), Expect = e-165
Identities = 328/783 (41%), Positives = 455/783 (57%), Gaps = 23/783 (2%)
Query: 3 ERKHRHIVETGLALLSHASMPLHFWDHAFLTATYLINRMSTTTLQGASPYFKLYGNHPDF 62
ERKHRHI+ET LL + +P HFW A TA YLIN +++LQG SP L+G+ P +
Sbjct: 486 ERKHRHIIETARTLLIASFVPAHFWAEAISTAVYLINMQPSSSLQGRSPGEVLFGSPPRY 545
Query: 63 KSLKIFGSACFPFLRPYNSNKLSLHSKECVFLGYSSSHKGYKCLDQSGR-IYVSKDVLFH 121
L++FG C+ L P KL+ S ECVFLGYS HKGY+C D S R I +S+DV F
Sbjct: 546 DHLRVFGCTCYVLLAPRERTKLTAQSVECVFLGYSLEHKGYRCYDPSARRIRISRDVTFD 605
Query: 122 EHR--FPYTTLFPSEP-------FSPPTSSAEYFPLSTVPI----ISRSMPQPSPAPIST 168
E++ F +T PS P + PP S E P S + I S+P P+ P
Sbjct: 606 ENKPFFYSSTNQPSSPENSISFLYLPPIPSPESLPSSPITPSPSPIPPSVPSPTYVPPPP 665
Query: 169 ELANPGPLSPQSEASDLQSQPSPIPTGSGLASTSQPAEHASSESAHQEMATSSGVHAASS 228
+P P+SP S P +P S + + P ++ E S +
Sbjct: 666 PSPSPSPVSPPPSHIPASSSPPHVP--STITLDTFPFHYSRRPKIPNESQPSQPTLEDPT 723
Query: 229 ASTVAVPVNAHPMQTRSKSGIIKPRLNPTLLLTHMEPTTVKQAMRDDKWLQAMKEEYNAL 288
S V A R++ + P + ++ EP+T ++A+ W AM EE AL
Sbjct: 724 CS-VDDSSPAPRYNLRARDALRAPNRDDFVVGVVFEPSTYQEAIVLPHWKLAMSEELAAL 782
Query: 289 MSNGTWSLVPLPSNRKAVGCKWIYRVKENPDGSINKYKARLVAKGYSQVQGFDYSETFSP 348
TW +VPLPS+ + CKW+Y+VK DG + +YKARLVA+G+ Q G DY ETF+P
Sbjct: 783 ERTNTWDVVPLPSHAVPITCKWVYKVKTKSDGQVERYKARLVARGFQQAHGRDYDETFAP 842
Query: 349 VVKPITVRLILSLAISRGWPLQQIDVNNAFLNGVLEEEVYMTQPPGFEHKDKTLVCKLHK 408
V TVR ++++A +R W + Q+DV NAFL+G L EEVYM PPG E V +L +
Sbjct: 843 VAHMTTVRTLIAVAATRSWTISQMDVKNAFLHGDLHEEVYMHPPPGVE-APPGHVFRLRR 901
Query: 409 ALYGLKQAPRAWFHRLKEVLLQFGFKASKCDPSLFTYNSPLGCIYMLIYVDDIILTGDSM 468
ALYGLKQAPRAWF R V+L GF S DP+LF + S G +L+YVDD+++TGD +
Sbjct: 902 ALYGLKQAPRAWFARFSSVVLAAGFSPSDHDPALFIHTSSRGRTLLLLYVDDMLITGDDL 961
Query: 469 VLIQQLTAKLHSIFALKQLGQLDYFLGVQVTHLANGSLLLNQTKYINDLLTKVNMADAAP 528
I + KL F + LG L YFLG++VT +G L+Q +YI DLL + + D+
Sbjct: 962 EYIAFVKGKLSEQFMMSDLGPLSYFLGIEVTSTVDG-YYLSQHRYIEDLLAQSGLTDSRT 1020
Query: 529 ISTPMQFGAKLTKHGGTSLHDPTEYRSVVGALQYATITRPEISYAVNKVCQFLSDPHEEH 588
+TPM+ +L GT L DP+ YR +VG+L Y T+TRP+I+YAV+ + QF+S P H
Sbjct: 1021 TTTPMELHVRLRSTDGTPLDDPSRYRHLVGSLVYLTVTRPDIAYAVHILSQFVSAPISVH 1080
Query: 589 WKAVKRILRYLKGTITHGVLLQPCSMTQPLPLLAFCDADWGSDPDDRRSTSGSCVFLGPN 648
+ + R+LRYL+GT T + + + PL L AF D+ W SDP DRRS +G C+FLG +
Sbjct: 1081 YGHLLRVLRYLRGTTTQCLFY---AASSPLQLRAFSDSTWASDPIDRRSVTGYCIFLGTS 1137
Query: 649 LISWTAKKQTLVARSSTEAEYRSLANTTAELLWVESLLTELKIAFTVPT-VLCDNMSTVL 707
L++W +KKQT V+RSSTEAE R+LA TT+E++W+ LL + ++ VPT +LCDN +
Sbjct: 1138 LLTWKSKKQTAVSRSSTEAELRALATTTSEIVWLRWLLADFGVSCDVPTPLLCDNTGAIQ 1197
Query: 708 LTHNPILHTRTKHMEMDLFFVREKVQAKSLVVQHVPSEHQRADIFTKALSPTRFLLMRDK 767
+ ++PI H TKH+ +D F R Q ++ + +VPSE Q AD FTKA + L K
Sbjct: 1198 IANDPIKHELTKHIGVDASFTRSHCQQSTIALHYVPSELQVADFFTKAQTREHHRLHLLK 1257
Query: 768 LNV 770
LNV
Sbjct: 1258 LNV 1260
>gb|AAD43604.1| T3P18.3 [Arabidopsis thaliana] gi|25301688|pir||H96650 protein
T3P18.3 [imported] - Arabidopsis thaliana
Length = 1309
Score = 584 bits (1506), Expect = e-165
Identities = 325/779 (41%), Positives = 458/779 (58%), Gaps = 29/779 (3%)
Query: 3 ERKHRHIVETGLALLSHASMPLHFWDHAFLTATYLINRMSTTTLQGASPYFKLYGNHPDF 62
ERKHRH+VE GL++L H+ PL FW AF TA YL N + ++ L+ SPY L+ D+
Sbjct: 462 ERKHRHLVELGLSMLYHSHTPLKFWVEAFFTANYLSNLLPSSVLKEISPYETLFQQKVDY 521
Query: 63 KSLKIFGSACFPFLRPYNSNKLSLHSKECVFLGYSSSHKGYKCL-DQSGRIYVSKDVLFH 121
L++FG+AC+P LRP NK S +CVFLGY + +KGY+CL +G++Y+S+ V+F
Sbjct: 522 TPLRVFGTACYPCLRPLAKNKFDPRSLQCVFLGYHNQYKGYRCLYPPTGKVYISRHVIFD 581
Query: 122 EHRFPYTTLFPSEPFSPPTSSAEYFPLS--TVPIISRSMPQP---SPAPISTELANPGPL 176
E +FP+ + S T+ + + + T P + S QP P++T P
Sbjct: 582 EAQFPFKEKYHSLVPKYQTTLLQAWQHTDLTPPSVPSSQLQPLARQVTPMATSENQPMMN 641
Query: 177 SPQSEASDLQSQPSPIPTGSGLASTSQPAEHASSESAHQEMATSSGVHAASSASTVAVPV 236
EA ++ + TS E S++ E+A +A
Sbjct: 642 YETEEAVNVNME------------TSSDEETESNDEFDHEVAPVLNDQNEDNALGQGSLE 689
Query: 237 NAHPMQTRSKSGIIKPRLNPTLLLTHM---EPTTVKQAMRDDKWLQAMKEEYNALMSNGT 293
N HPM TRSK GI KP L+++ EP T+ AM+ W A+ +E + + T
Sbjct: 690 NLHPMITRSKDGIQKPNPRYALIVSKSSFDEPKTITTAMKHPGWNAAVMDEIDRIHMLNT 749
Query: 294 WSLVPLPSNRKAVGCKWIYRVKENPDGSINKYKARLVAKGYSQVQGFDYSETFSPVVKPI 353
WSLVP + + KW+++ K PDG+I+K KARLVAKG+ Q +G DY ETFSPVV+
Sbjct: 750 WSLVPATEDMNILTSKWVFKTKLKPDGTIDKLKARLVAKGFDQEEGVDYLETFSPVVRTA 809
Query: 354 TVRLILSLAISRGWPLQQIDVNNAFLNGVLEEEVYMTQPPGFEHKDK-TLVCKLHKALYG 412
T+RL+L A + WPL+Q+DV+NAFL+G L+E V+M QP GF +K VC+L KALYG
Sbjct: 810 TIRLVLDTATANEWPLKQLDVSNAFLHGELQEPVFMFQPSGFVDPNKPNHVCRLTKALYG 869
Query: 413 LKQAPRAWFHRLKEVLLQFGFKASKCDPSLFTYNSPLGCIYMLIYVDDIILTGDSMVLIQ 472
LKQAPRAWF LL FGF+ S DPSLF + + +L+YVDDI+LTG +L+
Sbjct: 870 LKQAPRAWFDTFSNFLLDFGFECSTSDPSLFVCHQNGQSLILLLYVDDILLTGSDQLLMD 929
Query: 473 QLTAKLHSIFALKQLGQLDYFLGVQVTHLANGSLLLNQTKYINDLLTKVNMADAAPISTP 532
+L L++ F++K LG YFLG+++ NG L L+Q Y +D+L + M + P+ TP
Sbjct: 930 KLLQALNNRFSMKDLGPPRYFLGIEIESYNNG-LFLHQHAYASDILHQAGMTECNPMPTP 988
Query: 533 MQFGAKLTKHGGTSLHDPTEYRSVVGALQYATITRPEISYAVNKVCQFLSDPHEEHWKAV 592
+ L +PT +RS+ G LQY TITRP+I YAVN +CQ + P + +
Sbjct: 989 LP--QHLEDLNSEPFEEPTYFRSLAGKLQYLTITRPDIQYAVNFICQRMHAPTNSDFGLL 1046
Query: 593 KRILRYLKGTITHGVLLQPCSMTQPLPLLAFCDADWGSDPDDRRSTSGSCVFLGPNLISW 652
KRILRY+KGTI G+ P L FCD+D+ D RRST+G C+ LG LISW
Sbjct: 1047 KRILRYVKGTINMGL---PIRKHHNPVLSGFCDSDYAGCKDTRRSTTGFCILLGSTLISW 1103
Query: 653 TAKKQTLVARSSTEAEYRSLANTTAELLWVESLLTELKIAFTVPT-VLCDNMSTVLLTHN 711
+AK+Q ++ SSTEAEYR+L++T E+ W+ SLL +L I+ PT V CDN+S V L+ N
Sbjct: 1104 SAKRQPTISHSSTEAEYRALSDTAREITWISSLLRDLGISQHQPTRVFCDNLSAVYLSAN 1163
Query: 712 PILHTRTKHMEMDLFFVREKVQAKSLVVQHVPSEHQRADIFTKALSPTRFLLMRDKLNV 770
P LH R+KH + D ++RE+V + QH+P+ Q AD+FTK+L F+ +R KL V
Sbjct: 1164 PALHKRSKHFDKDFHYIRERVALGLIETQHIPATIQLADVFTKSLPRRPFITLRAKLGV 1222
Database: nr
Posted date: Jul 5, 2005 12:34 AM
Number of letters in database: 863,360,394
Number of sequences in database: 2,540,612
Lambda K H
0.319 0.133 0.398
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,340,535,924
Number of Sequences: 2540612
Number of extensions: 58028346
Number of successful extensions: 240509
Number of sequences better than 10.0: 3059
Number of HSP's better than 10.0 without gapping: 1735
Number of HSP's successfully gapped in prelim test: 1417
Number of HSP's that attempted gapping in prelim test: 224010
Number of HSP's gapped (non-prelim): 8775
length of query: 778
length of database: 863,360,394
effective HSP length: 136
effective length of query: 642
effective length of database: 517,837,162
effective search space: 332451458004
effective search space used: 332451458004
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 79 (35.0 bits)
Lotus: description of TM0348.7