
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0019a.6
(1380 letters)
Database: nr
2,540,612 sequences; 863,360,394 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
emb|CAC37623.1| copia-like polyprotein [Arabidopsis thaliana] 910 0.0
gb|AAK51235.1| polyprotein [Arabidopsis thaliana] 875 0.0
gb|AAD21687.1| Strong similarity to gi|3600044 T12H20.12 proteas... 870 0.0
gb|AAA57005.1| copia-like retrotransposon Hopscotch polyprotein ... 845 0.0
emb|CAB81170.1| retrotransposon like protein [Arabidopsis thalia... 829 0.0
gb|AAT85031.1| putative polyprotein [Oryza sativa (japonica cult... 793 0.0
dbj|BAB10876.1| polyprotein [Arabidopsis thaliana] 790 0.0
gb|AAC02664.1| polyprotein [Arabidopsis thaliana] 787 0.0
gb|AAC02666.1| polyprotein [Arabidopsis thaliana] 783 0.0
gb|AAC02669.1| polyprotein [Arabidopsis thaliana] 783 0.0
gb|AAC67200.1| putative retroelement pol polyprotein [Arabidopsi... 775 0.0
emb|CAB81478.1| putative protein [Arabidopsis thaliana] gi|49720... 760 0.0
gb|AAK43485.1| polyprotein, putative [Arabidopsis thaliana] 758 0.0
gb|AAU43956.1| unknown protein [Oryza sativa (japonica cultivar-... 757 0.0
gb|AAC02672.1| polyprotein [Arabidopsis arenosa] gi|7522104|pir|... 734 0.0
gb|AAV24907.1| hypothetical protein [Oryza sativa (japonica cult... 730 0.0
ref|XP_462785.1| putative gag/pol polyprotein [Oryza sativa (jap... 667 0.0
gb|AAD14478.1| Strong similarity to gb|AF039376 Evelknievel retr... 542 e-152
ref|XP_507106.1| PREDICTED OJ1499_A07.20 gene product [Oryza sat... 506 e-141
gb|AAC35532.1| contains similarity to proteases [Arabidopsis tha... 447 e-124
>emb|CAC37623.1| copia-like polyprotein [Arabidopsis thaliana]
Length = 1466
Score = 910 bits (2351), Expect = 0.0
Identities = 558/1416 (39%), Positives = 763/1416 (53%), Gaps = 108/1416 (7%)
Query: 17 ITVKLNSGNFLLWSKQVTAILQNQNLFDIVDPIVAPPSEFMLQPVSSALHHVVPAPNPLF 76
+T+KLN N+LLW Q ++L +Q L V+ +V PP++ L V PNP +
Sbjct: 17 VTLKLNDSNYLLWKTQFESLLSSQKLIGFVNGVVTPPAQTRLVVNDDVTSEV---PNPQY 73
Query: 77 VQWQARDRNVNSLLLSSLTEEALSETLSATTARDVWTALHSAYASRNKAREMRLRDELQL 136
W D+ V S L +L+EE L + TT+R +W +L + + ARE LR LQL
Sbjct: 74 EDWFCTDQLVRSWLFGTLSEEVLGHVHNLTTSRQIWISLAENFNKSSIAREFSLRRNLQL 133
Query: 137 MRRGSLSVSEYGR----------KIR*SVANDDKVHWFLRGLGPSYANFST---GQLDQV 183
+ + S+S Y R I V K+ FL GLG Y +T L ++
Sbjct: 134 LTKKDKSLSVYCRDFKIICDSLSSIGKPVEESMKIFGFLNGLGREYDPITTVIQSSLSKL 193
Query: 184 PLPLFTDILWKVESHAIFQASLEDYVTPPSAAFH-ARNPTRS-SGSQSSGGGHRGNSSSG 241
P P F D++ +V+ F + L+ Y S H A N RS SG+ RG SG
Sbjct: 194 PAPTFNDVISEVQG---FDSKLQSYDDTVSVNPHLAFNTERSNSGAPQYNSNSRGRGRSG 250
Query: 242 SRPRRDNGGSHCRG-------------SYTPRCQLCRKQGHYAAKCPVRWDRPSESANLT 288
R GG RG P CQ+C + GH A KC R+D +S T
Sbjct: 251 QN--RGRGGYSTRGRGFSQHQSASPSSGQRPVCQICGRIGHTAIKCYNRFDNNYQSEVPT 308
Query: 289 HSFAAGCSLNNSNRSDKYMDTGATSHMTHSLSQLTYSHAYSGNDRVLVGNGAGLHITHIG 348
+F+A +++ + Y D+ AT+H+T S S L + Y GND VLVG+G L ITH+G
Sbjct: 309 QAFSA-LRVSDETGKEWYPDSAATAHITASTSGLQNATTYEGNDAVLVGDGTYLPITHVG 367
Query: 349 SRSASHS---VPLSNVLVVPRLTNNLVSVSKLTRDHNVRAIFEADSFVIQNRQTGAVLGK 405
S + S S +PL+ VLV P + +L+SVSKL D+ F+A+ I + T V+ K
Sbjct: 368 STTISSSKGTIPLNEVLVCPAIQKSLLSVSKLCDDYPCGVYFDANKVCIIDLTTQKVVSK 427
Query: 406 GPCDKGFYVLDQGSQALLATSSSLPRASFELWHSRLDHVNFDIIKKLHKLGCFNVSSILP 465
GP + G Y+L+ S+ + S+ AS E WH RL H N I+++L V+
Sbjct: 428 GPRNNGLYMLEN-SEFVALYSNRQCAASMETWHHRLGHSNSKILQQLLTRKEIQVNKSRT 486
Query: 466 KPICCTSCQMAKSKRLVCYDNHKRASAVLDLVHCDLWGPSPVASVDGFSYFVIFVDDFSQ 525
P+C CQM KS RL + + RA LD VHCDLWGPSPV S GF Y+ +FVDDFS+
Sbjct: 487 SPVC-EPCQMGKSTRLQFFSSDFRALKPLDRVHCDLWGPSPVVSNQGFKYYAVFVDDFSR 545
Query: 526 FTWFYPLKRKSDFYDVLVRFKAFVENQFSRSLKVFQSDNGTEFTNNKVQALFSSSGVLHR 585
F+WF+PL+ KS F V + ++ VENQ +K FQSD G EFT+NK++ F G+ HR
Sbjct: 546 FSWFFPLRMKSKFISVFIAYQKLVENQLGTKIKEFQSDGGGEFTSNKLKEHFREHGIHHR 605
Query: 586 FLCPHTQAQNGRVERKHRHVLELGLAMLYHSHVSTSYWVHAFSTAVYIINRVPSKVLSDQ 645
CP+T QNG ERKHRH++ELGL+MLYHSH +WV AF TA Y+ N +PS VL +
Sbjct: 606 ISCPYTPQQNGVAERKHRHLVELGLSMLYHSHTPLKFWVEAFFTANYLSNLLPSSVLKEI 665
Query: 646 IPFQLLFQVAPTYANFHPFGCRVFPCLRPYMKNKFSPRGTPCIFLGYNSHHKGFKCFDPT 705
P++ LFQ Y FG +PCLRP KNKF PR C+FLGY++ +KG++C P
Sbjct: 666 SPYETLFQQKVDYTPLRVFGTACYPCLRPLAKNKFDPRSLQCVFLGYHNQYKGYRCLYPP 725
Query: 706 TSRTYVSRHAQFDEFCFPLTGSKSSPNDLVVFTFYEPAAGSPPSLQPVILVVPESSPPTG 765
T + Y+SRH FDE FP F E P Q +L + + T
Sbjct: 726 TGKVYISRHVIFDEAQFP---------------FKEKYHSLVPKYQTTLLQAWQHTDLTP 770
Query: 766 PLPCPSCVDPDV-QPVPVDDAPPSPVPH---NDAPPLPAPTS---PTPPATPLVVTQAPP 818
P S + P Q P+ + P+ + +A + TS T AP
Sbjct: 771 PSVPSSQLQPLARQMTPMATSENQPMMNYETEEAVNVNMETSSDEETESNDEFDHEVAPV 830
Query: 819 TSPPTTPPAVTQVPLAPVDARPRTRS*NGIFKPNPRYALVHAQPTGLLTALHTVT*PKGF 878
+ A+ Q L + TRS +GI KPNPRYAL+ + + PK
Sbjct: 831 LNDQNEDNALGQGSLENLHPM-ITRSKDGIQKPNPRYALI--------VSKSSFDEPKTI 881
Query: 879 KSAMKHPHWLPAMEDELSALHKKFTWTLVPRPYTTNVVGSK*VFRTKFHSDGTVERLKTC 938
+AMKHP W A+ DE+ +H TW+LVP N++ SK VF+TK DGT+++LK
Sbjct: 882 TTAMKHPSWNAAVMDEIDRIHMLNTWSLVPATEDMNILTSKWVFKTKLKPDGTIDKLKAR 941
Query: 939 LVAQGLTQIPGFDYSLTFSPVVKATTVRLILSLVVLND*QLHQLDVKNAFLHGHLTETVY 998
LVA+G Q G DY TFSPVV+ T+RL+L N+ L QLDV NAFLHG L E V+
Sbjct: 942 LVAKGFDQEEGVDYLETFSPVVRTATIRLVLDTATANEWPLKQLDVSNAFLHGELQEPVF 1001
Query: 999 MEQPHGFVD------------LVSRLMCVG*TRLSTASNRRLVLGF----SA*ALFSCVL 1042
M QP GFVD + L T SN L GF S +LF C
Sbjct: 1002 MFQPSGFVDPNKPNHVCRLTKALYGLKQAPRAWFDTFSNFLLDFGFECSTSDPSLFVC-- 1059
Query: 1043 VFRVVGLIHPCSFFYKGHITLYLLVYVDDIILTGSDPSLLT*FIACLNDEFAIKYLGKLG 1102
++ +L LL+YVDDI+LTGSD L+ + LN+ F++K LG
Sbjct: 1060 --------------HQNGQSLILLLYVDDILLTGSDQLLMDKLLQALNNRFSMKDLGPPR 1105
Query: 1103 YFLGLEITYTADGLFLGHAKYAHDLLSHALMLEASHVLTPLAAGSHLVS-SGEGYSDPTH 1161
YFLG+EI +GLFL YA D+L A M E + + TPL HL + E + +PT+
Sbjct: 1106 YFLGIEIESYNNGLFLHQHAYASDILHQAGMTECNPMPTPLP--QHLEDLNSEPFEEPTY 1163
Query: 1162 YRSLVGALQYLTITRPDLSYAVNTVSQFLQTPTVDHFQAVKRIIRYVCAMQHFGLTFRRT 1221
+RSL G LQYLTITRPD+ YAVN + Q + PT F +KRI+RYV + GL R+
Sbjct: 1164 FRSLAGKLQYLTITRPDIQYAVNFICQRMHAPTNSDFGLLKRILRYVKGTINMGLPIRKH 1223
Query: 1222 SSPAVLGYSDADWARCTDHRRSTYGYAIFLGYNLLSWSAKKQPSVAHSSCESEYRAMANT 1281
+P + G+ D+D+A C D RRST G+ I LG L+SWSAK+QP+++HSS E+EYRA+++T
Sbjct: 1224 HNPVLSGFCDSDYAGCKDTRRSTTGFCILLGSTLISWSAKRQPTISHSSTEAEYRALSDT 1283
Query: 1282 ASELVWLLNLLHELRVRLSAPPLFLSDNQSALFMAQNPVAHKHAKHIDLDCHFVRELVSS 1341
A E+ W+ +LL +L + P DN SA++++ NP HK +KH D D H++RE V+
Sbjct: 1284 AREITWISSLLRDLGISQHQPTRVFCDNLSAVYLSANPALHKRSKHFDKDFHYIRERVAL 1343
Query: 1342 GRLAVRHVPTSLQLADIFTKVLPRPLFDLFRSKLRV 1377
G + +H+P ++QLAD+FTK LPR F R+KL V
Sbjct: 1344 GLIETQHIPATIQLADVFTKSLPRRPFITLRAKLGV 1379
>gb|AAK51235.1| polyprotein [Arabidopsis thaliana]
Length = 1453
Score = 875 bits (2260), Expect = 0.0
Identities = 540/1428 (37%), Positives = 749/1428 (51%), Gaps = 124/1428 (8%)
Query: 17 ITVKLNSGNFLLWSKQVTAILQNQNLFDIVDPIVAPPSEFMLQPVSSALHHVVPAPNPLF 76
+T+KLN N+LLW Q ++L L V+ + PP + V NP +
Sbjct: 17 VTLKLNDSNYLLWKTQFESLLSCHKLIGFVNGGITPPPRTLNVVTGDTS---VDVANPQY 73
Query: 77 VQWQARDRNVNSLLLSSLTEEALSETLSATTARDVWTALHSAYASRNKAREMRLRDELQL 136
W D+ + S L +L+EE L + T+RD+W +L + + ARE LR LQL
Sbjct: 74 ESWFCTDQLIRSWLFGTLSEEVLGYVHNLQTSRDIWISLAENFNKSSVAREFTLRRTLQL 133
Query: 137 MRRGSLSVSEYGRK----------IR*SVANDDKVHWFLRGLGPSYANFST---GQLDQV 183
+ + ++S Y R+ I V K+ FL GLG Y +T L ++
Sbjct: 134 LSKKDKTLSAYCREFIAVCDALSSIGKPVDESMKIFGFLNGLGREYDPITTVIQSSLSKI 193
Query: 184 PLPLFTDILWKVESHAIFQASLEDYVTP-PSAAFHARNPTRSSGSQSSGGGHRGNSSSGS 242
P F D++ +V+ + S E+ VT P AF N RS + + G+RG G
Sbjct: 194 SPPTFRDVISEVKGFDVKLQSYEESVTANPHMAF---NTQRSEYTDNYTSGNRGKGRGGY 250
Query: 243 RPRRDNGGSHCRG-------------SYTPRCQLCRKQGHYAAKCPVRWDRPSESANLTH 289
R G RG P CQ+C + GH A KC R+D +S +
Sbjct: 251 GQNRGRSGYSTRGRGFSQHQTNSNNTGERPVCQICGRTGHTALKCYNRFDHNYQSVDTAQ 310
Query: 290 SFAAGCSLNNSNRSDKYMDTGATSHMTHSLSQLTYSHAYSGNDRVLVGNGAGLHITHIGS 349
+F++ +++S+ + D+ AT+H+T S + L + Y+G+D VLVG+GA L ITH+GS
Sbjct: 311 AFSS-LRVSDSSGKEWVPDSAATAHVTSSTNNLQAASPYNGSDTVLVGDGAYLPITHVGS 369
Query: 350 R---SASHSVPLSNVLVVPRLTNNLVSVSKLTRDHNVRAIFEADSFVIQNRQTGAVLGKG 406
S S ++PL+ VLV P + +L+SVSKL D+ F+A+ I + T V+ KG
Sbjct: 370 TTISSDSGTLPLNEVLVCPDIQKSLLSVSKLCDDYPCGVYFDANKVCIIDINTQKVVSKG 429
Query: 407 PCDKGFYVLDQGSQALLATSSSLPRASFELWHSRLDHVNFDIIKKLH--KLGCFNVSSIL 464
P G YVL+ + + S+ AS E+WH RL H N I+++L K FN S +
Sbjct: 430 PRSNGLYVLEN-QEFVAFYSNRQCAASEEIWHHRLGHSNSRILQQLKSSKEISFNKSRMS 488
Query: 465 PKPICCTSCQMAKSKRLVCYDNHKRASAVLDLVHCDLWGPSPVASVDGFSYFVIFVDDFS 524
P C CQM KS +L + ++ R +L +HCDLWGPSPV S GF Y+V+FVDD+S
Sbjct: 489 P---VCEPCQMGKSSKLQFFSSNSRELDLLGRIHCDLWGPSPVVSKQGFKYYVVFVDDYS 545
Query: 525 QFTWFYPLKRKSDFYDVLVRFKAFVENQFSRSLKVFQSDNGTEFTNNKVQALFSSSGVLH 584
+++WFYPLK KSDF+ V V F+ VENQF+ +KVFQSD G EFT+N ++ + G+ H
Sbjct: 546 RYSWFYPLKAKSDFFAVFVAFQNLVENQFNTKIKVFQSDGGGEFTSNLMKKHLTDCGIQH 605
Query: 585 RFLCPHTQAQNGRVERKHRHVLELGLAMLYHSHVSTSYWVHAFSTAVYIINRVPSKVLSD 644
R CP+T QNG ERKHRH +ELGL+M++HSH +WV AF TA ++ N +PS L +
Sbjct: 606 RISCPYTPQQNGIAERKHRHFVELGLSMMFHSHTPLQFWVEAFFTASFLSNMLPSPSLGN 665
Query: 645 QIPFQLLFQVAPTYANFHPFGCRVFPCLRPYMKNKFSPRGTPCIFLGYNSHHKGFKCFDP 704
P + L + P YA FG +PCLRP ++KF PR C+FLGYNS +KG++C P
Sbjct: 666 VSPLEALLKQKPNYAMLRVFGTACYPCLRPLGEHKFEPRSLQCVFLGYNSQYKGYRCLYP 725
Query: 705 TTSRTYVSRHAQFDEFCFPLTGSKSSPNDLVVFTFYEPA------AGSPPSLQPVILVVP 758
T R Y+SRH FDE FP K LV YE + + P + Q +I
Sbjct: 726 PTGRVYISRHVIFDEETFPF---KQKYQFLV--PQYESSLLSAWQSSIPQADQSLIPQAE 780
Query: 759 ESSPPTGPLPCPSCVDPDVQPVPVDDAPPSPV----------------PHNDAPPLPAPT 802
E + P P +Q + D P + L T
Sbjct: 781 EGKIESLAKP------PSIQKNTIQDTTTQPAILTEGVLNEEEEEDSFEETETESLNEET 834
Query: 803 SPTPPATPLVVTQAPPTSPPTTPPAVTQVPLAPVDARPRTRS*NGIFKPNPRYALVHAQP 862
+ V + P T P TRS GI K N RYA
Sbjct: 835 HTQNDEAEVTVEEEVQQEPENTHPMT-------------TRSKAGIHKSNTRYA------ 875
Query: 863 TGLLTALHTVT*PKGFKSAMKHPHWLPAMEDELSALHKKFTWTLVPRPYTTNVVGSK*VF 922
LLT+ +V PK A+ HP W A+ DE+ +H TW+LV N++G + VF
Sbjct: 876 --LLTSKFSVEEPKSIDEALNHPGWNNAVNDEMRTIHMLHTWSLVQPTEDMNILGCRWVF 933
Query: 923 RTKFHSDGTVERLKTCLVAQGLTQIPGFDYSLTFSPVVKATTVRLILSLVVLND*QLHQL 982
+TK DG+V++LK LVA+G Q G DY TFSPVV+ T+RL+L + + QL
Sbjct: 934 KTKLKPDGSVDKLKARLVAKGFHQEEGLDYLETFSPVVRTATIRLVLDVATAKGWNIKQL 993
Query: 983 DVKNAFLHGHLTETVYMEQPHGFVDLVSRLMCVG*TR------------LSTASNRRLVL 1030
DV NAFLHG L E VYM QP GFVD T+ T SN L
Sbjct: 994 DVSNAFLHGELKEPVYMLQPPGFVDQEKPSYVCRLTKALYGLKQAPRAWFDTISNYLLDF 1053
Query: 1031 GFSA*ALFSCVLVFRVVGLIHPCSF-FYKGHITLYLLVYVDDIILTGSDPSLLT*FIACL 1089
GFS P F ++K TL LL+YVDDI+LTGSD +LL + L
Sbjct: 1054 GFSC-------------SKSDPSLFTYHKNGKTLVLLLYVDDILLTGSDHNLLQELLMSL 1100
Query: 1090 NDEFAIKYLGKLGYFLGLEITYTADGLFLGHAKYAHDLLSHALMLEASHVLTPLAAGSHL 1149
N F++K LG YFLG+EI + +GLFL YA D+L A M + + TPL
Sbjct: 1101 NKRFSMKDLGAPSYFLGVEIESSPEGLFLHQTAYAKDILHQAAMSNCNSMPTPLPQHIEN 1160
Query: 1150 VSSGEGYSDPTHYRSLVGALQYLTITRPDLSYAVNTVSQFLQTPTVDHFQAVKRIIRYVC 1209
++S + + +PT++RSL G LQYLTITRPD+ +AVN + Q + +PT F +KRI+RYV
Sbjct: 1161 LNS-DLFPEPTYFRSLAGKLQYLTITRPDIQFAVNFICQRMHSPTTADFGLLKRILRYVK 1219
Query: 1210 AMQHFGLTFRRTSSPAVLGYSDADWARCTDHRRSTYGYAIFLGYNLLSWSAKKQPSVAHS 1269
H GL ++ + +++ YSD+DWA C + RRST G+ LG NL+SWSAK+Q +V+ S
Sbjct: 1220 GTIHLGLHIKKNQNLSLVAYSDSDWAGCKETRRSTTGFCTLLGCNLISWSAKRQETVSKS 1279
Query: 1270 SCESEYRAMANTASELVWLLNLLHELRVRLSAPPLFLSDNQSALFMAQNPVAHKHAKHID 1329
S E+EYRA+ A EL WL LL ++ V + P L DN SA++++ NP H +KH D
Sbjct: 1280 STEAEYRALTAVAQELTWLSFLLRDIGVTQTHPTLVKCDNLSAVYLSANPALHNRSKHFD 1339
Query: 1330 LDCHFVRELVSSGRLAVRHVPTSLQLADIFTKVLPRPLFDLFRSKLRV 1377
D H++RE V+ G + +H+ +LQLADIFTK LPR F R KL V
Sbjct: 1340 TDYHYIREQVALGLVETKHISATLQLADIFTKPLPRRAFIDLRIKLGV 1387
>gb|AAD21687.1| Strong similarity to gi|3600044 T12H20.12 protease homolog from
Arabidopsis thaliana BAC gb|AF080119 and is a member of
the reverse transcriptase family PF|00078
gi|25301706|pir||C86438 hypothetical protein F28K20.17 -
Arabidopsis thaliana
Length = 1415
Score = 870 bits (2249), Expect = 0.0
Identities = 540/1408 (38%), Positives = 746/1408 (52%), Gaps = 110/1408 (7%)
Query: 17 ITVKLNSGNFLLWSKQVTAILQNQNLFDIVDPIVAPPSEFMLQPVSSALHHVVPAPNPLF 76
+T+KL N+LLW Q ++L +Q L V+ V PS+ L PNPL+
Sbjct: 17 VTLKLTDSNYLLWKTQFESLLSSQKLIGFVNGAVNAPSQSRLVVNGEVTSE---EPNPLY 73
Query: 77 VQWQARDRNVNSLLLSSLTEEALSETLSATTARDVWTALHSAYASRNKAREMRLRDELQL 136
W D+ V S L +L+EE L + +T+R +W +L + + ARE LR LQL
Sbjct: 74 ESWFCTDQLVRSWLFGTLSEEVLGHVHNLSTSRQIWVSLAENFNKSSVAREFSLRQNLQL 133
Query: 137 MRRGSLSVSEYGRKIR*----------SVANDDKVHWFLRGLGPSYANFST---GQLDQV 183
+ + S Y R+ + V K+ FL GLG Y +T L ++
Sbjct: 134 LSKKEKPFSVYCREFKTICDALSSIGKPVDESMKIFGFLNGLGRDYDPITTVIQSSLSKL 193
Query: 184 PLPLFTDILWKVESHAIFQASLEDYVTPPSAAFH-ARNPTRS-SGSQSSGGGHRGNSSSG 241
P P F D++ +V+ F + L+ Y S H A N RS SGS +G SG
Sbjct: 194 PTPTFNDVVSEVQG---FDSKLQSYEEAASVTPHLAFNIERSESGSPQYNPNQKGRGRSG 250
Query: 242 SRPRRDNGGSHCRG-----------SYTPRCQLCRKQGHYAAKCPVRWDRPSESANLTHS 290
R + RG P CQ+C + GH A KC R+D N
Sbjct: 251 QNKGRGGYSTRGRGFSQHQSSPQVSGPRPVCQICGRTGHTALKCYNRFDN-----NYQAE 305
Query: 291 FAAGCSLNNSNRSDK--YMDTGATSHMTHSLSQLTYSHAYSGNDRVLVGNGAGLHITHIG 348
A +L S+ + K + D+ AT+H+T S + L + Y G+D VLVG+G L ITH G
Sbjct: 306 IQAFSTLRVSDDTGKEWHPDSAATAHVTSSTNGLQSATEYEGDDAVLVGDGTYLPITHTG 365
Query: 349 S---RSASHSVPLSNVLVVPRLTNNLVSVSKLTRDHNVRAIFEADSFVIQNRQTGAVLGK 405
S +S++ +PL+ VLVVP + +L+SVSKL D+ F+A+ I + QT V+
Sbjct: 366 STTIKSSNGKIPLNEVLVVPNIQKSLLSVSKLCDDYPCGVYFDANKVCIIDLQTQKVVTT 425
Query: 406 GPCDKGFYVLDQGSQALLATSSSLPRASFELWHSRLDHVNFDIIKKLHKLGCFNVSSILP 465
GP G YVL+ L ++ A+ E+WH RL H N ++ L ++
Sbjct: 426 GPRRNGLYVLENQEFVALYSNRQCA-ATEEVWHHRLGHANSKALQHLQNSKAIQINKSRT 484
Query: 466 KPICCTSCQMAKSKRLVCYDNHKRASAVLDLVHCDLWGPSPVASVDGFSYFVIFVDDFSQ 525
P+C CQM KS RL + R LD +HCDLWGPSPV S G Y+ IFVDD+S+
Sbjct: 485 SPVC-EPCQMGKSSRLPFLISDSRVLHPLDRIHCDLWGPSPVVSNQGLKYYAIFVDDYSR 543
Query: 526 FTWFYPLKRKSDFYDVLVRFKAFVENQFSRSLKVFQSDNGTEFTNNKVQALFSSSGVLHR 585
++WFYPL KS+F V + F+ VENQ + +KVFQSD G EF +NK++ S G+ HR
Sbjct: 544 YSWFYPLHNKSEFLSVFISFQKLVENQLNTKIKVFQSDGGGEFVSNKLKTHLSEHGIHHR 603
Query: 586 FLCPHTQAQNGRVERKHRHVLELGLAMLYHSHVSTSYWVHAFSTAVYIINRVPSKVLSDQ 645
CP+T QNG ERKHRH++ELGL+ML+HSH +WV +F TA YIINR+PS VL +
Sbjct: 604 ISCPYTPQQNGLAERKHRHLVELGLSMLFHSHTPQKFWVESFFTANYIINRLPSSVLKNL 663
Query: 646 IPFQLLFQVAPTYANFHPFGCRVFPCLRPYMKNKFSPRGTPCIFLGYNSHHKGFKCFDPT 705
P++ LF P Y++ FG +PCLRP +NKF PR C+FLGYNS +KG++CF P
Sbjct: 664 SPYEALFGEKPDYSSLRVFGSACYPCLRPLAQNKFDPRSLQCVFLGYNSQYKGYRCFYPP 723
Query: 706 TSRTYVSRHAQFDEFCFPLTGSKSSPNDLVVFTFYEPAAGSPPSLQPVILVVPESSPPTG 765
T + Y+SR+ F+E P K LV P +P + E S P
Sbjct: 724 TGKVYISRNVIFNESELPF---KEKYQSLV------PQYSTPLLQAWQHNKISEISVPAA 774
Query: 766 PLPCPSCVDPDVQPVPVDDAPPSPVPHNDAPPLPAPTSPTPPATPLVVTQAPPTSPPTTP 825
P+ S +P+ ++ S V P PTS + V +P
Sbjct: 775 PVQLFS------KPIDLNTYAGSQVTEQLTD--PEPTSNNEGSDEEV-------NPVAEE 819
Query: 826 PAVTQVPLAPVDARPRTRS*NGIFKPNPRYALVHAQPTGLLTALHTVT*PKGFKSAMKHP 885
A Q + A TRS GI KPN RYAL+ T+ PK SAMKHP
Sbjct: 820 IAANQEQVINSHAM-TTRSKAGIQKPNTRYALI--------TSRMNTAEPKTLASAMKHP 870
Query: 886 HWLPAMEDELSALHKKFTWTLVPRPYTTNVVGSK*VFRTKFHSDGTVERLKTCLVAQGLT 945
W A+ +E++ +H TW+LVP N++ SK VF+TK H DG++++LK LVA+G
Sbjct: 871 GWNEAVHEEINRVHMLHTWSLVPPTDDMNILSSKWVFKTKLHPDGSIDKLKARLVAKGFD 930
Query: 946 QIPGFDYSLTFSPVVKATTVRLILSLVVLND*QLHQLDVKNAFLHGHLTETVYMEQPHGF 1005
Q G DY TFSPVV+ T+RL+L + + QLDV NAFLHG L E V+M QP GF
Sbjct: 931 QEEGVDYLETFSPVVRTATIRLVLDVSTSKGWPIKQLDVSNAFLHGELQEPVFMYQPSGF 990
Query: 1006 VD------------LVSRLMCVG*TRLSTASNRRLVLGF----SA*ALFSCVLVFRVVGL 1049
+D + L T SN L GF S +LF C
Sbjct: 991 IDPQKPTHVCRLTKAIYGLKQAPRAWFDTFSNFLLDYGFVCSKSDPSLFVC--------- 1041
Query: 1050 IHPCSFFYKGHITLYLLVYVDDIILTGSDPSLLT*FIACLNDEFAIKYLGKLGYFLGLEI 1109
++ LYLL+YVDDI+LTGSD SLL + L + F++K LG YFLG++I
Sbjct: 1042 -------HQDGKILYLLLYVDDILLTGSDQSLLEDLLQALKNRFSMKDLGPPRYFLGIQI 1094
Query: 1110 TYTADGLFLGHAKYAHDLLSHALMLEASHVLTPLAAGSHLVSSGEGYSDPTHYRSLVGAL 1169
A+GLFL YA D+L A M + + + TPL ++S E +++PT++RSL G L
Sbjct: 1095 EDYANGLFLHQTAYATDILQQAGMSDCNPMPTPLPQQLDNLNS-ELFAEPTYFRSLAGKL 1153
Query: 1170 QYLTITRPDLSYAVNTVSQFLQTPTVDHFQAVKRIIRYVCAMQHFGLTFRRTSSPAVLGY 1229
QYLTITRPD+ +AVN + Q + +PT F +KRI+RY+ GL +R S+ + Y
Sbjct: 1154 QYLTITRPDIQFAVNFICQRMHSPTTSDFGLLKRILRYIKGTIGMGLPIKRNSTLTLSAY 1213
Query: 1230 SDADWARCTDHRRSTYGYAIFLGYNLLSWSAKKQPSVAHSSCESEYRAMANTASELVWLL 1289
SD+D A C + RRST G+ I LG NL+SWSAK+QP+V++SS E+EYRA+ A E+ W+
Sbjct: 1214 SDSDHAGCKNTRRSTTGFCILLGSNLISWSAKRQPTVSNSSTEAEYRALTYAAREITWIS 1273
Query: 1290 NLLHELRVRLSAPPLFLSDNQSALFMAQNPVAHKHAKHIDLDCHFVRELVSSGRLAVRHV 1349
LL +L + P DN SA++++ NP H +KH D D H++RE V+ G + +H+
Sbjct: 1274 FLLRDLGIPQYLPTQVYCDNLSAVYLSANPALHNRSKHFDTDYHYIREQVALGLIETQHI 1333
Query: 1350 PTSLQLADIFTKVLPRPLFDLFRSKLRV 1377
+ QLAD+FTK LPR F RSKL V
Sbjct: 1334 SATFQLADVFTKSLPRRAFVDLRSKLGV 1361
>gb|AAA57005.1| copia-like retrotransposon Hopscotch polyprotein [Zea mays]
gi|7444442|pir||T02087 gag/pol polyprotein - maize
retrotransposon Hopscotch
Length = 1439
Score = 845 bits (2182), Expect = 0.0
Identities = 548/1441 (38%), Positives = 756/1441 (52%), Gaps = 120/1441 (8%)
Query: 17 ITVKLNSGNFLLWSKQVTAILQNQNLFDIVDPI-VAPPSEFMLQPVSSALHHVVPAPNPL 75
++ KL GN+LLW QV ++ L DI+ + + PP + +S A V NP
Sbjct: 20 VSEKLTKGNYLLWKAQVLPAIRAAQLDDILTGVEICPP-----KTISDASDRTVTVANPA 74
Query: 76 FVQWQARDRNVNSLLLSSLTEEALSETLSATTARDVWTALHSAYASRNKAREMRLRDELQ 135
+ +W ARD+ V LLSSL+ E LS ++ +T+ VWT L Y+S ++AR++ R L
Sbjct: 75 YGRWIARDQAVLGYLLSSLSREVLSSVVNCSTSASVWTTLSEMYSSHSRARKVNTRIALA 134
Query: 136 LMRRGSLSVSEYGRKIR*----------SVANDDKVHWFLRGLGPSYANFSTGQLDQ--- 182
++G+ SV+EY K+R + +++ V + L GL + T + +
Sbjct: 135 TTKKGASSVAEYFAKMRGFADELGAAGKPLDDEEFVSFLLTGLDEDFNPLVTAVVARSDP 194
Query: 183 -VPLPLFTDILWKVESHAIFQASLEDYVTPPSAAFHARNPTRSSGSQSSGGG-------- 233
P L+T +L E+ Q + ++ +AR+P R SGG
Sbjct: 195 ITPGDLYTQLL-SYENRMHLQTGSSSLM---QSSANARSPGRGMSWGRSGGRGFSRGRGR 250
Query: 234 ----HRGNSSSGSRPRRDNGGSHCRGSYTPRCQLCRKQGHYAAKCPVRWDRPSESANLTH 289
RG S R +G + S PRCQ+C + GH A C W R E+
Sbjct: 251 GRGPSRGGFQSFGRGNNYSGATDADTSSRPRCQVCSRVGHTALNC---WYRFDENYVPDQ 307
Query: 290 SFAAGCSLNNSNRSDKYMDTGATSHMTHSLSQLTYSHAYSGNDRVLVGNGAGLHITHIGS 349
A + N + Y DTGAT H+T L +LT Y+G D+++ NG G+ I++IG+
Sbjct: 308 RSANSAAHQNGSNVPWYTDTGATDHITGDLDRLTMHDKYTGTDQIIAANGTGMTISNIGN 367
Query: 350 R---SASHSVPLSNVLVVPRLTNNLVSVSKLTRDHNVRAIFEADSFVIQNRQTGAVLGKG 406
++S S+ L +VL VP NL+SV +LT D++V F + F+I++RQT AVL G
Sbjct: 368 AIVPTSSRSLHLRSVLHVPSTHKNLISVHRLTNDNDVFIEFHSSHFLIKDRQTKAVLLHG 427
Query: 407 PCDKGFYVLDQGSQALLATSSSLPRASFELWHSRLDHVNFDIIKKL---HKLGCFNVSSI 463
C G Y L L + S R E WH RL H + DI+ ++ + L C + +S
Sbjct: 428 KCRDGLYPLPPHPDLRLKHNFSSTRVPLEHWHKRLGHPSRDIVHRVISNNNLPCLSNNST 487
Query: 464 LPKPICCTSCQMAKSKRLVCYDNHKRASAVLDLVHCDLWGPSPVASVDGFSYFVIFVDDF 523
C +C AK+ +L + ++SA L L+ D++GP+ + S + Y+V F+DD+
Sbjct: 488 TS---VCDACLQAKAHQLPYTISMSQSSAPLMLIFSDVFGPA-IDSFGRYKYYVSFIDDY 543
Query: 524 SQFTWFYPLKRKSDFYDVLVRFKAFVENQFSRSLKVFQSDNGTEFTNNKVQALFSSSGVL 583
S+FTW Y L+ KSD Y F+ VE F R + FQSD G E+ K+ A F + G+
Sbjct: 544 SKFTWIYLLRHKSDVYKSFCEFQHLVERMFGRKIIAFQSDWGGEY--EKLNAHFKTIGIH 601
Query: 584 HRFLCPHTQAQNGRVERKHRHVLELGLAMLYHSHVSTSYWVHAFSTAVYIINRVPSKVLS 643
H+ CPHT QNG ERKHRH++E+GLA+L S + YW HAF AVY+INR PSK ++
Sbjct: 602 HQVSCPHTHQQNGAAERKHRHIVEVGLALLAQSSMPLKYWDHAFLAAVYLINRTPSKTIA 661
Query: 644 DQIPFQLLFQVAPTYANFHPFGCRVFPCLRPYMKNKFSPRGTPCIFLGYNSHHKGFKCFD 703
P L P Y++ FGC +P LRPY ++K R T C+FLGY++ HKGFKC D
Sbjct: 662 HDTPLHKLTGATPDYSSLRIFGCACWPNLRPYNQHKLQFRSTRCVFLGYSNMHKGFKCLD 721
Query: 704 PTTSRTYVSRHAQFDEFCFPLTGSKSS------------PND------LVVFTFYEPAAG 745
+T R Y+SR FDE FP + P+D L P +
Sbjct: 722 ISTGRIYISRDVVFDEHVFPFASLNKNAGVKYTSEVLLLPHDSCGNNMLTDHANNLPGSS 781
Query: 746 SP---------------PSLQPVILVVPESSPPTGPLP---CPSCVDPDVQPVPVD-DAP 786
SP P+ + +P S P +P PS + P P P A
Sbjct: 782 SPLPFLAQHFLQGNSEVPTSNNTAMALPASGPNEVSVPPALVPSSLVPAASPAPTGVSAN 841
Query: 787 PSPVPHNDA----PPLPAPT-SPTPPATPLVVTQAPPTSPPTTPPAVTQVPLAPVDARPR 841
P P D+ PP+ + + P A PL+ QAP +S P PL+ A PR
Sbjct: 842 AEPAPEADSLSSGPPVATESVTGVPDADPLL--QAPGSSVAHQTP--DSAPLSA--AAPR 895
Query: 842 TRS*NGIFKPNP------RYALVHAQPTGLLTALHTVT*PKGFKSAMKHPHWLPAMEDEL 895
TR +GI KP RY A +T P A+ P W AME E
Sbjct: 896 TRLQHGISKPKQFTDGTVRYG----------NAAARITEPSSVSEALADPQWRAAMEAEF 945
Query: 896 SALHKKFTWTLVPRPYTTNVVGSK*VFRTKFHSDGTVERLKTCLVAQGLTQIPGFDYSLT 955
AL K TWTLVP T N++ K VF+ K+++DG+++RLK LVA+G Q G DY T
Sbjct: 946 QALQKNNTWTLVPPDRTRNLIDCKWVFKVKYNADGSIDRLKARLVAKGFKQQYGIDYDDT 1005
Query: 956 FSPVVKATTVRLILSLVVLND*QLHQLDVKNAFLHGHLTETVYMEQPHGFVDLVSRLMCV 1015
FSPVVK +T+RL+LSL V L QLDV+NAFLHG L ETVYM+QP GF D
Sbjct: 1006 FSPVVKHSTIRLVLSLAVSQKWSLRQLDVQNAFLHGILEETVYMKQPPGFADTTHPNYHC 1065
Query: 1016 G*TRLSTASNRRLVLGFSA*ALFSCVLVFRVVGLIHPCSFFYKGHIT-LYLLVYVDDIIL 1074
+ +R +S + L F V F Y H T +Y+LVYVDDII+
Sbjct: 1066 HLQKSLYGLKQRPRAWYSRLSEKLQSLGF-VPSKADVSLFIYNAHSTAIYILVYVDDIII 1124
Query: 1075 TGSDPSLLT*FIACLNDEFAIKYLGKLGYFLGLEITYTADGLFLGHAKYAHDLLSHALML 1134
TGS P + +A L D+FAIK LG L YFLG+E+ DGL L KYA DLL M
Sbjct: 1125 TGSSPHAIDNVLAKLKDDFAIKDLGDLHYFLGIEVHRKGDGLLLCQEKYARDLLKRVGME 1184
Query: 1135 EASHVLTPLAAGSHLVSSGEGYSDP---THYRSLVGALQYLTITRPDLSYAVNTVSQFLQ 1191
V TP+A L +S P T YRS+VGALQYLT+TRPDLSYA+N V QFL
Sbjct: 1185 CCKPVHTPVATSEKLSASAGTLLSPEETTKYRSVVGALQYLTLTRPDLSYAINRVCQFLH 1244
Query: 1192 TPTVDHFQAVKRIIRYVCAMQHFGLTFRRTSSPAVLGYSDADWARCTDHRRSTYGYAIFL 1251
PT H+ AVKRI+R + GLT R + S + +SDADWA C D R+ST GYA+FL
Sbjct: 1245 APTDLHWTAVKRILRNIQHTIGLGLTIRPSLSLMLSAFSDADWAGCPDDRKSTGGYALFL 1304
Query: 1252 GYNLLSWSAKKQPSVAHSSCESEYRAMANTASELVWLLNLLHELRVRLSAPPLFLSDNQS 1311
G NL+SW++KKQ +V+ SS E+EY+AMAN +E++WL +LLHEL +RL+ P DN
Sbjct: 1305 GPNLISWNSKKQSTVSRSSTEAEYKAMANATAEVIWLQSLLHELGIRLTGIPRLWCDNLG 1364
Query: 1312 ALFMAQNPVAHKHAKHIDLDCHFVRELVSSGRLAVRHVPTSLQLADIFTKVLPRPLFDLF 1371
A +++ P+ + KHI++D HFVR+ V S +L +R + T+ Q+AD FTK L + F
Sbjct: 1365 ATYLSSKPIFNARTKHIEVDFHFVRDRVLSKKLDIRLISTNDQVADGFTKALTIGRLNEF 1424
Query: 1372 R 1372
R
Sbjct: 1425 R 1425
>emb|CAB81170.1| retrotransposon like protein [Arabidopsis thaliana]
gi|4539447|emb|CAB40035.1| retrotransposon like protein
[Arabidopsis thaliana] gi|7444419|pir||T04204
hypothetical protein T4F9.150 - Arabidopsis thaliana
Length = 1515
Score = 829 bits (2142), Expect = 0.0
Identities = 516/1387 (37%), Positives = 735/1387 (52%), Gaps = 109/1387 (7%)
Query: 73 NPLFVQWQARDRNVNSLLLSSLTEEALSETLSATTARDVWTALHSAYASRNKAREMRLRD 132
N F++W D+ V + + SL+EEAL + +A++VW L + + R+ L+
Sbjct: 66 NQEFLKWTRIDQLVKAWIFGSLSEEALKVVIGLNSAQEVWLGLARRFNRFSTTRKYDLQK 125
Query: 133 ELQLMRRGSLSVSEYGRKIR*----------SVANDDKVHWFLRGLGPSYANFST---GQ 179
L + ++ Y +++ V +K+ L GLG Y + +T
Sbjct: 126 RLGTCSKAGKTMDAYLSEVKNICDQLDSIGFPVTEQEKIFGVLNGLGKEYESIATVIEHS 185
Query: 180 LDQVPLPLFTDILWKVESHAIFQASLEDYVT----PPSAAFHARNPTRSSGSQSSGGGHR 235
LD P P F D+++K+ + F L Y P AF+ S G+ +S GG
Sbjct: 186 LDVYPGPCFDDVVYKLTT---FDDKLSTYTANSEVTPHLAFYTDKSYSSRGNNNSRGGRY 242
Query: 236 GN-------SSSGSRPRRDNGGSHCRGSYT---PRCQLCRKQGHYAAKCPVRWDRPSESA 285
GN SS G + G GS P CQ+CRK GH A KC R++
Sbjct: 243 GNFRGRGSYSSRGRGFHQQFGSGSNNGSGNGSKPTCQICRKYGHSAFKCYTRFEENYLPE 302
Query: 286 NLTHSFAA-GCSLNNSNRSDKYM-DTGATSHMTHSLSQLTYSHAYSGNDRVLVGNGAGLH 343
+L ++FAA S N S +++ D+ AT+H+T++ L S YSG+D V+VGNG L
Sbjct: 303 DLPNAFAAMRVSDQNQASSHEWLPDSAATAHITNTTDGLQNSQTYSGDDSVIVGNGDFLP 362
Query: 344 ITHIGS---RSASHSVPLSNVLVVPRLTNNLVSVSKLTRDHNVRAIFEADSFVIQNRQTG 400
ITHIG+ + ++PL +VLV P +T +L+SVSKLT D+ F++DS VI++++T
Sbjct: 363 ITHIGTIPLNISQGTLPLEDVLVCPGITKSLLSVSKLTDDYPCSFTFDSDSVVIKDKRTQ 422
Query: 401 AVLGKGPCDKGFYVL-DQGSQALLATSSSLPRASFELWHSRLDHVNFDIIKKLHKLGCFN 459
+L +G KG YVL D Q +T + E+WH RL H N ++++ L K
Sbjct: 423 QLLTQGNKHKGLYVLKDVPFQTYYSTRQQ--SSDDEVWHQRLGHPNKEVLQHLIKTKAIV 480
Query: 460 VSSILPKPICCTSCQMAKSKRLVCYDNHKRASAVLDLVHCDLWGPSPVASVDGFSYFVIF 519
V+ C +CQM K RL + +S L+ +HCDLWGP+PV S GF Y+VIF
Sbjct: 481 VNKTSSN--MCEACQMGKVCRLPFVASEFVSSRPLERIHCDLWGPAPVTSAQGFQYYVIF 538
Query: 520 VDDFSQFTWFYPLKRKSDFYDVLVRFKAFVENQFSRSLKVFQSDNGTEFTNNKVQALFSS 579
+D++S+FTWFYPLK KSDF+ V V F+ VENQ+ + +FQ D G EF + K A +S
Sbjct: 539 IDNYSRFTWFYPLKLKSDFFSVFVLFQQLVENQYQHKIAMFQCDGGGEFVSYKFVAHLAS 598
Query: 580 SGVLHRFLCPHTQAQNGRVERKHRHVLELGLAMLYHSHVSTSYWVHAFSTAVYIINRVPS 639
G+ CPHT QNG ER+HR++ ELGL++++HS V WV AF T+ ++ N +PS
Sbjct: 599 CGIKQLISCPHTPQQNGIAERRHRYLTELGLSLMFHSKVPHKLWVEAFFTSNFLSNLLPS 658
Query: 640 KVLSD-QIPFQLLFQVAPTYANFHPFGCRVFPCLRPYMKNKFSPRGTPCIFLGYNSHHKG 698
LSD + P+++L P Y FG +P LRPY KNKF P+ C+FLGYN+ +KG
Sbjct: 659 STLSDNKSPYEMLHGTPPVYTALRVFGSACYPYLRPYAKNKFDPKSLLCVFLGYNNKYKG 718
Query: 699 FKCFDPTTSRTYVSRHAQFDEFCFPLTGSKSSPNDL---VVFTFYEPAAGS------PPS 749
++C P T + Y+ RH FDE FP + S + +FT ++ S PS
Sbjct: 719 YRCLHPPTGKVYICRHVLFDERKFPYSDIYSQFQTISGSPLFTAWQKGFSSTALSRETPS 778
Query: 750 LQPVILVVP----ESSPPTGPLP--CPSCVDPDVQPVPVDD--APPSPVPHNDAPPLP-- 799
++ P SS PTG P + PDV D PPSP+ P P
Sbjct: 779 TNVEDIIFPSATVSSSVPTGCAPNIAETATAPDVDVAAAHDMVVPPSPITSTSLPTQPEE 838
Query: 800 -----------APTSPTPPATP--LVVTQAPPTSPPTTPPAVTQVPLAPVDARPR-TRS* 845
+ T+ + TP + V+ + P ++ AP + P TR+
Sbjct: 839 STSDQNHYSTDSETAISSAMTPQSINVSLFEDSDFPPLQSVISSTTAAPETSHPMITRAK 898
Query: 846 NGIFKPNPRYALVHAQPTGLLTALHTVT*PKGFKSAMKHPHWLPAMEDELSALHKKFTWT 905
+GI KPNP+YA L + PK K A+K W AM +E+ +H+ TW
Sbjct: 899 SGITKPNPKYA--------LFSVKSNYPEPKSVKEALKDEGWTNAMGEEMGTMHETDTWD 950
Query: 906 LVPRPYTTNVVGSK*VFRTKFHSDGTVERLKTCLVAQGLTQIPGFDYSLTFSPVVKATTV 965
LVP ++G K VF+TK +SDG+++RLK LVA+G Q G DY T+SPVV++ TV
Sbjct: 951 LVPPEMVDRLLGCKWVFKTKLNSDGSLDRLKARLVARGYEQEEGVDYVETYSPVVRSATV 1010
Query: 966 RLILSLVVLND*QLHQLDVKNAFLHGHLTETVYMEQPHGFVD------------LVSRLM 1013
R IL + +N L QLDVKNAFLH L ETV+M QP GF D + L
Sbjct: 1011 RSILHVATINKWSLKQLDVKNAFLHDELKETVFMTQPPGFEDPSRPDYVCKLKKAIYDLK 1070
Query: 1014 CVG*TRLSTASNRRLVLGFSA*ALFSCVLVFRVVGLIHPCSFFY-KGHITLYLLVYVDDI 1072
S+ L GF + P F Y KG ++LL+YVDD+
Sbjct: 1071 QAPRAWFDKFSSYLLKYGF-------------ICSFSDPSLFVYLKGRDVMFLLLYVDDM 1117
Query: 1073 ILTGSDPSLLT*FIACLNDEFAIKYLGKLGYFLGLEITYTADGLFLGHAKYAHDLLSHAL 1132
ILTG++ LL + L+ EF +K +G L YFLG++ Y DGLFL KY DLL +A
Sbjct: 1118 ILTGNNDVLLQQLLNILSTEFRMKDMGALHYFLGIQAHYHNDGLFLSQEKYTSDLLVNAG 1177
Query: 1133 MLEASHVLTPLAAGSHLVSSGEGYSDPTHYRSLVGALQYLTITRPDLSYAVNTVSQFLQT 1192
M + S + TPL L + + + +PT++R L G LQYLT+TRPD+ +AVN V Q +
Sbjct: 1178 MSDCSSMPTPLQL-DLLQGNNKPFPEPTYFRRLAGKLQYLTLTRPDIQFAVNFVCQKMHA 1236
Query: 1193 PTVDHFQAVKRIIRYVCAMQHFGLTFRRTSSPAVLGYSDADWARCTDHRRSTYGYAIFLG 1252
PT+ F +KRI+ Y+ G+ + + YSD+DWA C D RRST G+ FLG
Sbjct: 1237 PTMSDFHLLKRILHYLKGTMTMGINLSSNTDSVLRCYSDSDWAGCKDTRRSTGGFCTFLG 1296
Query: 1253 YNLLSWSAKKQPSVAHSSCESEYRAMANTASELVWLLNLLHELRVRLSAPPLFLSDNQSA 1312
YN++SWSAK+ P+V+ SS E+EYR ++ ASE+ W+ LL E+ + P DN SA
Sbjct: 1297 YNIISWSAKRHPTVSKSSTEAEYRTLSFAASEVSWIGFLLQEIGLPQQQIPEMYCDNLSA 1356
Query: 1313 LFMAQNPVAHKHAKHIDLDCHFVRELVSSGRLAVRHVPTSLQLADIFTKVLPRPLFDLFR 1372
++++ NP H +KH +D ++VRE V+ G L V+H+P S QLADIFTK LP+ F R
Sbjct: 1357 VYLSANPALHSRSKHFQVDYYYVRERVALGALTVKHIPASQQLADIFTKSLPQAPFCDLR 1416
Query: 1373 SKLRVGL 1379
KL V L
Sbjct: 1417 FKLGVVL 1423
>gb|AAT85031.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
Length = 1437
Score = 793 bits (2049), Expect = 0.0
Identities = 516/1446 (35%), Positives = 746/1446 (50%), Gaps = 118/1446 (8%)
Query: 17 ITVKLNSGNFLLWSKQVTAILQNQNLFDIVDPIVAPPSEFMLQPVSSALHHVVPAPNPLF 76
++ KL N +W Q+ A ++ L + PP+ + + V NP +
Sbjct: 18 VSEKLGKSNHAVWKAQILATIRGARLEGHLTGDDQPPAPILRRKEGEK---EVVVSNPEY 74
Query: 77 VQWQARDRNVNSLLLSSLTEEALSETLSATTARDVWTALHSAYASRNKAREMRLRDELQL 136
+W A D+ V + LLSS+T++ L + + TA W+ + + S +AR + R L
Sbjct: 75 EEWVATDQQVLAYLLSSMTKDLLVQVATCRTAASAWSMIQGMFGSMTRARTINTRLSLST 134
Query: 137 MRRGSLSVSEYGRKIR*----------SVANDDKVHWFLRGLGPSYA---NFSTGQLDQV 183
+++G ++++ Y K+R V +D+ + + GL + + G+ D V
Sbjct: 135 LQKGDMNITTYVGKMRALADDLMAVGKPVDDDELIGYIFAGLDDEFEPVISTIVGRPDPV 194
Query: 184 PLPLFTDILWKVESHAIFQASLEDYVTPPSAAFHARNPTRSSGSQSSGGGHRGNSSSGSR 243
+ L E + S + + ++A +R + GS+S G +RG + +
Sbjct: 195 TIGETYAQLISFEQRLAHRRSGDQ--SSVNSASRSRGQPQRGGSRSGGDSNRGRGAPSNG 252
Query: 244 PRRDNGGSHCRGSYT---------PRCQLCRKQGHYAAKCPVRWDRPSESANLTHSFAAG 294
R G + G P+CQLC K+GH C W R E+ FA G
Sbjct: 253 ANRGRGRGNPSGGRANVGGGTDNRPKCQLCYKRGHTVCDC---WYRYDENFVPDERFA-G 308
Query: 295 CSLNNSNRSDKYMDTGATSHMTHSLSQLTYSHAYSGNDRVLVGNGAGLHITHIGS---RS 351
+++ ++ Y+DTGAT H+T L +LT Y GND+V +GAG+ I+HIG+ ++
Sbjct: 309 TAVSYGVDTNWYLDTGATDHVTGELDKLTVRDKYHGNDQVHTASGAGMEISHIGNSVVKT 368
Query: 352 ASHSVPLSNVLVVPRLTNNLVSVSKLTRDHNVRAIFEADSFVIQNRQTGAVLGKGPCDKG 411
S ++ L +VL VP+ NLVS KLT D+ F I++ L +G C KG
Sbjct: 369 PSRNLHLKDVLYVPKANKNLVSAYKLTSDNLAFIELYRKFFFIKDLAMRRTLLRGRCHKG 428
Query: 412 FYVLDQGSQALLATSS--SLPRASFELWHSRLDHVNFDIIKKLHK---LGCFNVSSILPK 466
Y L S + + SFE WHSRL H ++ +++K+ K L C +VS +
Sbjct: 429 LYALPSPSSHHHQVKQVYGVTKPSFERWHSRLGHPSYTVVEKVIKSQNLPCLDVSEQVS- 487
Query: 467 PICCTSCQMAKSKRLVCYDNHKRASAVLDLVHCDLWGPSPVASVDGFSYFVIFVDDFSQF 526
C +CQ AKS +L + + L+LV D+WGP+P SV Y+V F+DD+S+F
Sbjct: 488 --VCDACQKAKSHQLSFPKSTSESKYPLELVFSDVWGPAP-QSVGNNKYYVSFIDDYSKF 544
Query: 527 TWFYPLKRKSDFYDVLVRFKAFVENQFSRSLKVFQSDNGTEFTNNKVQALFSSSGVLHRF 586
TW Y LK KS+ +D F++ VE F+R + Q+D G E+ K+ + F+ G+ H
Sbjct: 545 TWIYLLKYKSEVFDKFHEFQSLVERLFNRKIVAMQTDWGGEY--QKLHSFFNKVGITHHV 602
Query: 587 LCPHTQAQNGRVERKHRHVLELGLAMLYHSHVSTSYWVHAFSTAVYIINRVPSKVLSDQI 646
CPHT QNG ERKHRH++E+GLA+L +S + +W AF +AVY+INR PS+VL D
Sbjct: 603 SCPHTHQQNGSAERKHRHIVEVGLALLAYSSMPLKFWGEAFLSAVYLINRTPSRVLHDVS 662
Query: 647 PFQLLFQVAPTYANFHPFGCRVFPCLRPYMKNKFSPRGTPCIFLGYNSHHKGFKCFDPTT 706
P + L P Y FGC +P LRPY K+K R T C FLGY++ HKGFKC DP+T
Sbjct: 663 PLERLLGHKPDYNALRVFGCACWPNLRPYNKHKLQFRSTTCTFLGYSTLHKGFKCLDPST 722
Query: 707 SRTYVSRHAQFDEFCFPLTGSKSSPN-----DLVVFTFYEPAAGSPPSLQPVILVV--PE 759
R Y+SR FDE FP T K PN + E AA P LQ + V+ PE
Sbjct: 723 GRVYISRDVVFDETQFPFT--KLHPNVGAKLRAEIALVPELAASLPRGLQQISSVINTPE 780
Query: 760 SSPPTGPLPCPSCVDPDVQPVPVDDAPP-----SPVPHNDAPPLPAPTSP---------T 805
++ + + D AP +P + +PP+ P SP +
Sbjct: 781 NANVSNENMQQDSTYDNEPETETDGAPDTVSANAPAESSGSPPINEPASPFGESDSATAS 840
Query: 806 PPATPLVVTQAPP-----TSPPTTPPAVTQVPLAPVD-------------ARPRTRS*NG 847
P + P+ P +S P + P +D RPRTR +G
Sbjct: 841 PASAPVNSAPHPDAAASGSSAPRGSTSQGGTPSVAIDDPHPATTVTGQEAQRPRTRLQSG 900
Query: 848 IFKPNPRYALVHAQPTGLLTALHTVT*PKGFKSAMKHPHWLPAMEDELSALHKKFTWTLV 907
I K V+ T L + P+ + A+++ +W AM+ E AL K TW LV
Sbjct: 901 IRKEK-----VYTDGTVKWGMLTSTGEPENLQDALQNNNWKCAMDAEYMALIKNNTWHLV 955
Query: 908 PRPYTTNVVGSK*VFRTKFHSDGTVERLKTCLVAQGLTQIPGFDYSLTFSPVVKATTVRL 967
P NV+ K V++ K DG+++R K LVA+G Q G DY TFSPVVKA T+R+
Sbjct: 956 PPQQGRNVIDCKWVYKIKRKQDGSLDRYKARLVAKGFKQRYGIDYEDTFSPVVKAATIRI 1015
Query: 968 ILSLVVLND*QLHQLDVKNAFLHGHLTETVYMEQPHGF-----VDLVSRL--MCVG*TRL 1020
ILS+ V L QLDV+NAFLHG L E VYM+QP G+ D V +L G +
Sbjct: 1016 ILSIAVSRGWCLRQLDVQNAFLHGVLEEEVYMKQPPGYENPSTPDYVCKLDKALYGLKQA 1075
Query: 1021 STASNRRLV-----LGFSA*ALFSCVLVFRVVGLIHPCSFFYKGHITLYLLVYVDDIILT 1075
A RL LGF + + F+ KG +T++LL+YVDDII+
Sbjct: 1076 PRAWYSRLSGKLHDLGFKGSKADTSLF------------FYNKGSLTIFLLIYVDDIIVV 1123
Query: 1076 GSDPSLLT*FIACLNDEFAIKYLGKLGYFLGLEITYTADGLFLGHAKYAHDLLSHALMLE 1135
S ++ + L EFA+K LG L YFLG+E+T G+ + KYA DLL M +
Sbjct: 1124 SSRKEAVSALLQDLQKEFALKDLGDLHYFLGIEVTKIPGGILMSQEKYASDLLKRVNMSD 1183
Query: 1136 ASHVLTPLAAGSHLVSSGE---GYSDPTHYRSLVGALQYLTITRPDLSYAVNTVSQFLQT 1192
V TPL+A L++ G +D T YRS+VGALQYLT+TR D++++VN V QFL
Sbjct: 1184 CKSVATPLSASEKLIAGKGTILGPNDATQYRSIVGALQYLTLTRLDIAFSVNKVCQFLHN 1243
Query: 1193 PTVDHFQAVKRIIRYVCAMQHFGLTFRRTSSPAVLGYSDADWARCTDHRRSTYGYAIFLG 1252
PT +H+ AVKRI+RY+ GL ++SS V GYSDADWA C D RRST G+A++LG
Sbjct: 1244 PTTEHWAAVKRILRYIKQCTGLGLRICKSSSMIVSGYSDADWAGCLDDRRSTGGFAVYLG 1303
Query: 1253 YNLLSWSAKKQPSVAHSSCESEYRAMANTASELVWLLNLLHELRVRLSAPPLFLSDNQSA 1312
NL+SW+AKKQ +V+ SS E+EY+A+AN +E++W+ LL EL + A DN A
Sbjct: 1304 DNLVSWNAKKQATVSRSSTEAEYKALANATAEIMWVQTLLQELNIVSPAMAQLWCDNMGA 1363
Query: 1313 LFMAQNPVAHKHAKHIDLDCHFVRELVSSGRLAVRHVPTSLQLADIFTKVLPRPLFDLFR 1372
+++ NPV H KHI++D HFVRE V+ L V +V T+ Q+AD FTK LP + F+
Sbjct: 1364 KYLSFNPVFHARTKHIEVDYHFVRERVARKLLQVDYVSTNDQVADGFTKALPVKQLENFK 1423
Query: 1373 SKLRVG 1378
L +G
Sbjct: 1424 YNLNLG 1429
>dbj|BAB10876.1| polyprotein [Arabidopsis thaliana]
Length = 1429
Score = 790 bits (2041), Expect = 0.0
Identities = 511/1459 (35%), Positives = 752/1459 (51%), Gaps = 142/1459 (9%)
Query: 20 KLNSGNFLLWSKQVTAILQNQNLFDIVDPIVAPPSEFMLQPVSSALHHVVPAPNPLFVQW 79
KL S NFL+W +QV A+L +L VD + +P ++ H V +PNP + W
Sbjct: 6 KLTSTNFLMWRRQVHALLDGYDLAGYVDGSIE-------EPHTTVTVHGVTSPNPEYKLW 58
Query: 80 QARDRNVNSLLLSSLTEEALSETLSATTARDVWTALHSAYASRNKAREMRLRDELQLMRR 139
+ +D+ + S L+ +++ ATT+ +W L YA+ ++ + ++R++++ ++
Sbjct: 59 KRQDKLIYSGLIGAISVAVQPLLSQATTSAQIWRKLVDTYANPSRGHKQQIREQIKQWKK 118
Query: 140 GSLSVSEYG----------RKIR*SVANDDKVHWFLRGLGPSYANFSTGQLDQVP----L 185
GS S+ +Y + ++ ++D++ + L GL Y +DQ+
Sbjct: 119 GSRSIDDYVLGLTTRFDQLALLEEAIPHEDQIAYILGGLSDDYRRV----IDQIEGRDIS 174
Query: 186 PLFTDILWKVESHAI-FQASLEDYVTPPSA-AFHARNPTRSSGSQSSGGGHRGNS---SS 240
P T++ K+ + + QA + D TP +A A N ++SS G++ N +
Sbjct: 175 PSITELHEKLINFELKLQAMVPDSSTPVTANAASYNNNNNGRNNRSSSRGNQNNQWQQNQ 234
Query: 241 GSRPRRDNGGSHCRGSYTPRCQLCRKQGHYAAKCPVRWDRPSESANLTHSFAAGCSLNNS 300
+ R +N GS +G Y RCQ+C GH A +C +P + + S +G N
Sbjct: 235 TQQSRSNNRGSQGKG-YQGRCQICGVHGHSARRCSQF--QPYGGSGGSQSVPSGYPTNGY 291
Query: 301 NRSDK------------------YMDTGATSHMTHSLSQLTYSHAYSGNDRVLVGNGAGL 342
+ S +D+GAT H+T L+ L+ Y+G + V + +G+GL
Sbjct: 292 SPSPMAPWQPRANIATAPPFNPWVLDSGATHHLTSDLANLSMHQPYTGGEEVTIADGSGL 351
Query: 343 HITHIGSR---SASHSVPLSNVLVVPRLTNNLVSVSKLTRDHNVRAIFEADSFVIQNRQT 399
I+H GS + S S+ L ++L VP ++ NL+SV +L + V F F +++ T
Sbjct: 352 PISHTGSALLPTPSRSLALKDILYVPNVSKNLISVYRLCNANQVSVEFFPAHFQVKDLNT 411
Query: 400 GAVLGKGPCDKGFYVLDQGSQAL-LATSSSLPRASFELWHSRLDHVNFDIIKKLHKLGCF 458
GA L +G Y +++ + T+S P+ WH RL H I+K +
Sbjct: 412 GARLLQGRTRNELYEWPVNQKSITILTASPSPKTDLSSWHQRLGHPALPILKDVVSHFHL 471
Query: 459 NVSSILPKPICCTSCQMAKSKRLVCYDNHKRASAVLDLVHCDLWGPSPVASVDGFSYFVI 518
+S+ +PK + C+ C + KS +L + N +S L+ ++ D+W SP SVD + Y+++
Sbjct: 472 PLSNTIPKQLPCSDCSINKSHKLPFFTNTIVSSQPLEYLYTDVW-TSPCISVDNYKYYLV 530
Query: 519 FVDDFSQFTWFYPLKRKSDFYDVLVRFKAFVENQFSRSLKVFQSDNGTEFTNNKVQALFS 578
VD F+++TW YPLK+KS DV V FKA VEN+F ++ SDNG EF ++ +
Sbjct: 531 IVDHFTRYTWMYPLKQKSQVKDVFVAFKALVENRFQSRIRTLYSDNGGEFIG--LRPFLA 588
Query: 579 SSGVLHRFLCPHTQAQNGRVERKHRHVLELGLAMLYHSHVSTSYWVHAFSTAVYIINRVP 638
+ G+ H PHT NG ERKHRH++E GLA+L H+ + ++W +AF+TAVY+INR+P
Sbjct: 589 AHGISHLTSPPHTPEHNGLAERKHRHIVETGLALLTHASLPKTFWTYAFATAVYLINRMP 648
Query: 639 SKVLSDQIPFQLLFQVAPTYANFHPFGCRVFPCLRPYMKNKFSPRGTPCIFLGYNSHHKG 698
++VL P+ LFQ++P Y FGC +P LRPY NK R T C+FLGY+
Sbjct: 649 TEVLQGTSPYVKLFQMSPNYLKLRVFGCLCYPWLRPYNTNKLEARSTMCVFLGYSLTQSA 708
Query: 699 FKCFDPTTSRTYVSRHAQFDEFCFPLTGSKSSPNDLVVFTFYEPAAGSPPSLQPVILVVP 758
+ C D T+R Y SRH QF E FP ++S D T +P + ++ P++ P
Sbjct: 709 YLCLDIATNRIYTSRHVQFVESSFPFASPRTSETDSTQ-TMSQP---TTTNVIPLLQRPP 764
Query: 759 ESSPPTGPLPCP----------SCVDPDVQPVPV-----------DD-----------AP 786
+PPT CP S P + VP+ DD P
Sbjct: 765 HIAPPTALPLCPIFHSPPHSPSSPASPPSEHVPLTAASSSSNAINDDNISSTGQVSVSGP 824
Query: 787 PSPVPH---NDAPPLPAPTSPTPPATPLVVTQAPPTSP--------PTTPPAVTQVPLAP 835
S PH + P SP P T PPTSP PT P PL P
Sbjct: 825 TSQSPHTTPTNQNTSPLSKSPNPTNTNQSQNSTPPTSPTTSVHQHSPTPSPLPQNPPLPP 884
Query: 836 V---DARPRTRS*NGIFKPNPRYALVHAQPTGLLTALHTVT*PKGFKSAMKHPHWLPAME 892
D RTR+ N I KP ++ L T L ++ T+ P A+K P+W AM
Sbjct: 885 PPQNDHPMRTRAKNQITKPKTKFNLT----TSLTSSKPTI--PTTVAQALKDPNWRNAMS 938
Query: 893 DELSALHKKFTWTLVPRPYTTNVVGSK*VFRTKFHSDGTVERLKTCLVAQGLTQIPGFDY 952
+E++A K TW LV +V+ K +F K++ DG++ R K LVA+G Q G DY
Sbjct: 939 EEINAQMKNHTWDLVSPEEAKHVISCKWIFTLKYNVDGSIARYKARLVARGFNQQYGIDY 998
Query: 953 SLTFSPVVKATTVRLILSLVVLND*QLHQLDVKNAFLHGHLTETVYMEQPHGFVDL---- 1008
S TFSPV+K+TT+R +L + V + +HQ+D+ NAFL G L E VY+ QP GF+D
Sbjct: 999 SETFSPVIKSTTIRTVLEVAVKRNWSIHQVDINNAFLQGTLNEEVYVSQPPGFIDRDRPS 1058
Query: 1009 -VSRL--MCVG*TRLSTA---SNRRLVL--GFSA*ALFSCVLVFRVVGLIHPCSFFYKGH 1060
V RL G + A RR +L GF V L F Y H
Sbjct: 1059 HVCRLNKALYGLKQAPRAWYQELRRFLLQAGF-------------VNSLADASLFIYNRH 1105
Query: 1061 IT-LYLLVYVDDIILTGSDPSLLT*FIACLNDEFAIKYLGKLGYFLGLEITYTADGLFLG 1119
T +Y+LVYVDDII+ G + +L+ F A L F++K LG L YFLG+E T T+ GL L
Sbjct: 1106 NTFMYVLVYVDDIIIAGEN-ALVQAFNASLASRFSLKDLGPLSYFLGIEATRTSRGLHLM 1164
Query: 1120 HAKYAHDLLSHALMLEASHVLTPLAAGSHL-VSSGEGYSDPTHYRSLVGALQYLTITRPD 1178
KY DLL ML+ V TP++ L + SG D T YR+++G+LQYL TRPD
Sbjct: 1165 QRKYITDLLKKHNMLDTKPVSTPMSPTPKLSLLSGTALDDATEYRTVLGSLQYLAFTRPD 1224
Query: 1179 LSYAVNTVSQFLQTPTVDHFQAVKRIIRYVCAMQHFGLTFRRTSSPAVLGYSDADWARCT 1238
+++AVN +SQF+ PT +H+QA KRI+RY+ + G+ R + + +SDADW
Sbjct: 1225 IAFAVNRLSQFMHRPTNEHWQAAKRILRYLAGTKSHGIFLRSDTPLTIHAFSDADWGCDL 1284
Query: 1239 DHRRSTYGYAIFLGYNLLSWSAKKQPSVAHSSCESEYRAMANTASELVWLLNLLHELRVR 1298
D ST Y ++ G + +SWS+KKQ SVA SS E+EYRA+ANTASEL WL +LL E+ +
Sbjct: 1285 DAYLSTNAYIVYFGGSPVSWSSKKQRSVARSSTEAEYRAVANTASELRWLCSLLLEMGIS 1344
Query: 1299 LSAPPLFLSDNQSALFMAQNPVAHKHAKHIDLDCHFVRELVSSGRLAVRHVPTSLQLADI 1358
+ P+ DN A ++ NPV H KH+ LD HFVR + SG L V HV T QLAD
Sbjct: 1345 QTTVPVIYCDNIGATYLCANPVFHSRMKHVALDYHFVRGYIQSGALRVSHVSTKDQLADA 1404
Query: 1359 FTKVLPRPLFDLFRSKLRV 1377
TK LPRP F SK+ V
Sbjct: 1405 LTKPLPRPRFTELNSKIGV 1423
>gb|AAC02664.1| polyprotein [Arabidopsis thaliana]
Length = 1451
Score = 787 bits (2033), Expect = 0.0
Identities = 506/1449 (34%), Positives = 745/1449 (50%), Gaps = 112/1449 (7%)
Query: 14 VHMITV-KLNSGNFLLWSKQVTAILQNQNLFDIVDPIVAPPSEFMLQPVSSALHHVVPAP 72
V+M V +L NF++WS+QV A+L +L VD + P+ P + VV
Sbjct: 24 VNMTNVTRLTDSNFVMWSRQVHALLDGYDLAGYVDGSIPIPT-----PTRTTADGVVTTN 78
Query: 73 NPLFVQWQARDRNVNSLLLSSLTEEALSETLSATTARDVWTALHSAYASRNKAREMRLRD 132
N + W+ +D+ + S LL +++ A T+ ++W L S +A+ + A +LR
Sbjct: 79 ND-YTLWKRQDKLIYSALLGAISLSVQPLLSKANTSAEIWETLSSTFANPSWAHVQQLRQ 137
Query: 133 ELQLMRRGSLSVSEYGRK----------IR*SVANDDKVHWFLRGLGPSYANFSTGQLDQ 182
+L+ +G+ S+ Y + + + ++++ L GL Y +
Sbjct: 138 QLKQWTKGTKSIVTYFQGFTTRFDHLALLGKAPEREEQIELILGGLPEDYKTVVDQIEGR 197
Query: 183 VPLPLFTDILWKVESHAIFQASLEDYVTPP---SAAFHARNPTRSSGSQSSGGGH-RGNS 238
P T++L K+ +H + A+ + + P +A + N ++ S+S+G + RGN+
Sbjct: 198 ENPPALTEVLEKLINHEVKLAAKAEATSVPVTANAVNYRGNNNNNNNSRSNGRNNSRGNT 257
Query: 239 SSGSRPRRDNGGSHCRGSYTPRCQLCRKQGHYAAKCPVRWDRPSESANLTHSFAAGC--- 295
S + N + Y +CQ+C GH A +CP A+ S A+
Sbjct: 258 SWQNSQSTSNRQQYTPRPYQGKCQICSVHGHSARRCPQLQQHAGSYASNQSSSASYAPWQ 317
Query: 296 ------SLNNSNRSDKYMDTGATSHMTHSLSQLTYSHAYSGNDRVLVGNGAGLHITHIGS 349
S N + +D+GAT H+T +L+ L Y+G++ V + +G+GL I+H GS
Sbjct: 318 PRANMVSATPYNSGNWLLDSGATHHLTSNLNNLALHQPYNGDEEVTIADGSGLPISHSGS 377
Query: 350 R---SASHSVPLSNVLVVPRLTNNLVSVSKLTRDHNVRAIFEADSFVIQNRQTGAVLGKG 406
+ + S+ L +VL VP + NL+SV ++ + V F F +++ TGA L +G
Sbjct: 378 ALLPTPTRSLALKDVLYVPDIQKNLISVYRMCNTNGVSVEFFPAHFQVKDLSTGARLLQG 437
Query: 407 PCDKGFYVLDQGSQALLATS---SSLPRASFELWHSRLDHVNFDIIKKLHKLGCFNVSSI 463
Y S +ATS S P+ WH+RL H + I+K L +S
Sbjct: 438 KTKNELYEWPVNSS--IATSMFASPTPKTDLPSWHARLGHPSLPILKALISKFSLPISHS 495
Query: 464 LPKPICCTSCQMAKSKRLVCYDNHKRASAVLDLVHCDLWGPSPVASVDGFSYFVIFVDDF 523
L + C+ C + KS +L Y N +S L+ ++ D+W SP+ S+D + Y+++ VD +
Sbjct: 496 LQNQLLCSDCSINKSHKLPFYSNTIASSHPLEYLYTDVW-TSPITSIDNYKYYLVIVDHY 554
Query: 524 SQFTWFYPLKRKSDFYDVLVRFKAFVENQFSRSLKVFQSDNGTEFTNNKVQALFSSSGVL 583
+++TW YPL++KS ++ + F A VEN+F + SDNG EF +++ +S G+
Sbjct: 555 TRYTWLYPLRKKSQVREMFITFTALVENKFKFKIGTLYSDNGGEFI--AMRSFLASHGIS 612
Query: 584 HRFLCPHTQAQNGRVERKHRHVLELGLAMLYHSHVSTSYWVHAFSTAVYIINRVPSKVLS 643
H PHT NG ERKHRH++E GL +L + + YW +AF+TAVY+INR+ + VL
Sbjct: 613 HMTTPPHTPELNGISERKHRHIVETGLTLLSTASMPKEYWSYAFATAVYLINRMLTPVLG 672
Query: 644 DQIPFQLLFQVAPTYANFHPFGCRVFPCLRPYMKNKFSPRGTPCIFLGYNSHHKGFKCFD 703
++ P+ LF P Y FGC FP LRPY +K R PC+ LGY+ + C D
Sbjct: 673 NESPYVKLFGQPPNYLKLRIFGCLCFPWLRPYTAHKLDNRSVPCVLLGYSLSQSAYLCLD 732
Query: 704 PTTSRTYVSRHAQFDEFCFPLTGSKSS---PND---------LVVFTFYEPAAGSPPS-- 749
T R Y SRH QF E FP + + S P+D + V P +PPS
Sbjct: 733 RATGRVYTSRHVQFAESSFPFSTTSPSVTPPSDPPLSQDTRPVSVPLLARPLTTAPPSSP 792
Query: 750 --------------LQPVILVVPE-SSPPTGPLPCPSCVD----------------PDVQ 778
L P + P S PT P+ PS + P +
Sbjct: 793 SCSAPHRSPSQSENLSPPAPLQPSLSLSPTSPITSPSLSEESLVGHNSETGPTGSSPPLS 852
Query: 779 PVPVDDAPPSPVPHNDAPPLPAPTSPTPPATPLVVTQAPPTSPPTTPPAVTQVPLAPVDA 838
P P P SP + P P SP P +P +T +SP +PP P P+
Sbjct: 853 PQPQRPQPQSPQSTSPHSSSPQPNSPNPQHSPRSLTPTLTSSPSPSPPPNPNPP--PIQH 910
Query: 839 RPRTRS*NGIFKPNPRYALVHAQPTGLLTALHTVT*PKGFKSAMKHPHWLPAMEDELSAL 898
RTRS N I KPNP++A + +PT L + PK A+ P+W AM DE++A
Sbjct: 911 TMRTRSKNNIVKPNPKFANLATKPTPLKPII-----PKTVVEALLDPNWRQAMCDEINAQ 965
Query: 899 HKKFTWTLVPRPYTTNVVGSK*VFRTKFHSDGTVERLKTCLVAQGLTQIPGFDYSLTFSP 958
+ T+ LVP NVVG K VF K+ S+G ++R K LVA+G Q G D+ TFSP
Sbjct: 966 TRNGTFDLVPPAPNQNVVGCKWVFTLKYLSNGVLDRYKARLVAKGFHQQYGHDFKETFSP 1025
Query: 959 VVKATTVRLILSLVVLND*QLHQLDVKNAFLHGHLTETVYMEQPHGFVDL-----VSRL- 1012
V+K+TTVR +L + V + Q+DV NAFL G L++ VY+ QP GFVD V RL
Sbjct: 1026 VIKSTTVRSVLHIAVSKGWSIRQIDVNNAFLQGTLSDEVYVTQPPGFVDKDNAHHVCRLY 1085
Query: 1013 -MCVG*TRLSTASNRRLVLGFSA*ALFSCVLVFRVVGLIHPCSFFYKGH--ITLYLLVYV 1069
G + A + L S +L V + S F H LY+LVYV
Sbjct: 1086 KALYGLKQAPRAWYQE---------LRSYLLTQGFVNSVADTSLFTLRHERTILYVLVYV 1136
Query: 1070 DDIILTGSDPSLLT*FIACLNDEFAIKYLGKLGYFLGLEITYTADGLFLGHAKYAHDLLS 1129
DD+++TGSD +++T FIA L F++K LG++ YFLG+E T T+ GL L +Y DLL
Sbjct: 1137 DDMLITGSDTNIITRFIANLAARFSLKDLGEMSYFLGIEATRTSKGLHLMQKRYVLDLLE 1196
Query: 1130 HALMLEASHVLTPLAAGSHL-VSSGEGYSDPTHYRSLVGALQYLTITRPDLSYAVNTVSQ 1188
ML A VLTP++ L ++SG+ P+ YR+++G+LQYL TRPD++YAVN +SQ
Sbjct: 1197 KTNMLAAHPVLTPMSPTPKLSLTSGKPLDKPSEYRAVLGSLQYLLFTRPDIAYAVNRLSQ 1256
Query: 1189 FLQTPTVDHFQAVKRIIRYVCAMQHFGLTFRRTSSPAVLGYSDADWARCTDHRRSTYGYA 1248
++ PT H+QA KRI+RY+ G+ R + + YSDADWA D+ ST Y
Sbjct: 1257 YMHCPTDLHWQAAKRILRYLAGTPSHGIFIRADTPLTLHAYSDADWAGDIDNYNSTNAYI 1316
Query: 1249 IFLGYNLLSWSAKKQPSVAHSSCESEYRAMANTASELVWLLNLLHELRVRLSAPPLFLSD 1308
++LG N +SWS+KKQ VA SS E+EYRA+AN SE+ W+ +LL EL + LS+PP+ D
Sbjct: 1317 LYLGSNPISWSSKKQKGVARSSTEAEYRAVANATSEIRWVCSLLTELGITLSSPPVVYCD 1376
Query: 1309 NQSALFMAQNPVAHKHAKHIDLDCHFVRELVSSGRLAVRHVPTSLQLADIFTKVLPRPLF 1368
N A +++ NPV H KHI LD HFVRE V +G L V HV T QLAD TK LPR F
Sbjct: 1377 NVGATYLSANPVFHSRMKHIALDFHFVRESVQAGALRVTHVSTKDQLADALTKPLPRQPF 1436
Query: 1369 DLFRSKLRV 1377
SK+ V
Sbjct: 1437 TTLISKIGV 1445
>gb|AAC02666.1| polyprotein [Arabidopsis thaliana]
Length = 1451
Score = 783 bits (2022), Expect = 0.0
Identities = 504/1449 (34%), Positives = 743/1449 (50%), Gaps = 112/1449 (7%)
Query: 14 VHMITV-KLNSGNFLLWSKQVTAILQNQNLFDIVDPIVAPPSEFMLQPVSSALHHVVPAP 72
V+M V +L NF++WS+QV A+L +L +D + P+ P + VV
Sbjct: 24 VNMTNVTRLTDSNFVMWSRQVHALLDGYDLAGYIDGSIPIPT-----PTRTTADGVVTTN 78
Query: 73 NPLFVQWQARDRNVNSLLLSSLTEEALSETLSATTARDVWTALHSAYASRNKAREMRLRD 132
N + W+ +D+ + S LL +++ A T+ ++W L S +A+ + A +LR
Sbjct: 79 ND-YTLWKRQDKLIYSALLGAISLSVQPLLSKANTSAEIWETLSSTFANPSWAHVQQLRQ 137
Query: 133 ELQLMRRGSLSVSEYGRK----------IR*SVANDDKVHWFLRGLGPSYANFSTGQLDQ 182
+L+ +G+ S+ Y + + + ++++ L GL Y +
Sbjct: 138 QLKQWTKGTKSIVTYFQGFTTRFDHLALLGKAPEREEQIELILGGLPEDYKTVVDQIEGR 197
Query: 183 VPLPLFTDILWKVESHAIFQASLEDYVTPP---SAAFHARNPTRSSGSQSSGGGH-RGNS 238
P T++L K+ +H + A+ + + P +A + N ++ S+S+G + RGN+
Sbjct: 198 ENPPALTEVLEKLINHEVKLAAKAEATSVPVTANAVNYRGNNNNNNNSRSNGRNNSRGNT 257
Query: 239 SSGSRPRRDNGGSHCRGSYTPRCQLCRKQGHYAAKCPVRWDRPSESANLTHSFAAGC--- 295
S + N + Y +CQ+C GH A +CP A+ S A+
Sbjct: 258 SWQNSQSTSNRQQYTPRPYQGKCQICSVHGHSARRCPQLQQHAGSYASNQSSSASYAPWQ 317
Query: 296 ------SLNNSNRSDKYMDTGATSHMTHSLSQLTYSHAYSGNDRVLVGNGAGLHITHIGS 349
S N + +D+GAT H+T L+ L Y+G++ V + +G+GL I+H GS
Sbjct: 318 PRANMVSATPYNSGNWLLDSGATHHLTSDLNNLALHQPYNGDEEVTIADGSGLPISHSGS 377
Query: 350 R---SASHSVPLSNVLVVPRLTNNLVSVSKLTRDHNVRAIFEADSFVIQNRQTGAVLGKG 406
+ + S+ L +VL VP + NL+SV ++ + V F F +++ TGA L +G
Sbjct: 378 ALLPTPTRSLALKDVLYVPDIQKNLISVYRMCNTNGVSVEFFPAHFQVKDLSTGARLLQG 437
Query: 407 PCDKGFYVLDQGSQALLATS---SSLPRASFELWHSRLDHVNFDIIKKLHKLGCFNVSSI 463
Y S +ATS S P+ WH+RL H + I+K L +S
Sbjct: 438 KTKNELYEWPVNSS--IATSMFASPTPKTDLPSWHARLGHPSLPILKALISKFSLPISHS 495
Query: 464 LPKPICCTSCQMAKSKRLVCYDNHKRASAVLDLVHCDLWGPSPVASVDGFSYFVIFVDDF 523
L + C+ C + KS +L Y N +S L+ ++ D+W SP+ S+D + Y+++ VD +
Sbjct: 496 LQNQLLCSDCSINKSHKLPFYSNTIASSHPLEYLYTDVW-TSPITSIDNYKYYLVIVDHY 554
Query: 524 SQFTWFYPLKRKSDFYDVLVRFKAFVENQFSRSLKVFQSDNGTEFTNNKVQALFSSSGVL 583
+++TW YPL++KS ++ + F A VEN+F + SDNG EF +++ +S G+
Sbjct: 555 TRYTWLYPLRKKSQVREMFITFTALVENKFKFKIGTLYSDNGGEFI--AMRSFLASHGIS 612
Query: 584 HRFLCPHTQAQNGRVERKHRHVLELGLAMLYHSHVSTSYWVHAFSTAVYIINRVPSKVLS 643
H PHT NG ERKHRH++E GL +L + + YW +AF+TAVY+INR+ + VL
Sbjct: 613 HMTTPPHTPELNGISERKHRHIVETGLTLLSTASMPKEYWSYAFATAVYLINRMLTPVLG 672
Query: 644 DQIPFQLLFQVAPTYANFHPFGCRVFPCLRPYMKNKFSPRGTPCIFLGYNSHHKGFKCFD 703
++ P+ LF P Y FGC FP LRPY +K R PC+ LGY+ + C D
Sbjct: 673 NESPYVKLFGQPPNYLKLRIFGCLCFPWLRPYTAHKLDNRSVPCVLLGYSLSQSAYLCLD 732
Query: 704 PTTSRTYVSRHAQFDEFCFPLTGSKSS---PND---------LVVFTFYEPAAGSPPS-- 749
T R Y SRH QF E FP + + S P+D + V P +PPS
Sbjct: 733 RATGRVYTSRHVQFAESSFPFSTTSPSVTPPSDPPLSQDTRPVSVPLLARPLTTAPPSSP 792
Query: 750 --------------LQPVILVVPE-SSPPTGPLPCPSCVD----------------PDVQ 778
L P + P S PT P+ PS + P +
Sbjct: 793 SCSAPHRSPSQSENLSPPAPLQPSLSLSPTSPITSPSLSEESLVGHNSETGPTGSSPPLS 852
Query: 779 PVPVDDAPPSPVPHNDAPPLPAPTSPTPPATPLVVTQAPPTSPPTTPPAVTQVPLAPVDA 838
P P P SP + P P SP P +P +T +SP +PP P P+
Sbjct: 853 PQPQRPQPQSPQSTSPHSSSPQPNSPNPQHSPRSLTPTLTSSPSPSPPPNPNPP--PIQH 910
Query: 839 RPRTRS*NGIFKPNPRYALVHAQPTGLLTALHTVT*PKGFKSAMKHPHWLPAMEDELSAL 898
RTRS N I KPNP++A + +PT L + PK A+ P+W AM DE++A
Sbjct: 911 TMRTRSKNNIVKPNPKFANLATKPTPLKPII-----PKTVVEALLDPNWRQAMCDEINAQ 965
Query: 899 HKKFTWTLVPRPYTTNVVGSK*VFRTKFHSDGTVERLKTCLVAQGLTQIPGFDYSLTFSP 958
+ T+ LVP NVVG K VF K+ S+G ++R K LVA+G Q G D+ TFSP
Sbjct: 966 TRNGTFDLVPPAPNQNVVGCKWVFTLKYLSNGVLDRYKARLVAKGFHQQYGHDFKETFSP 1025
Query: 959 VVKATTVRLILSLVVLND*QLHQLDVKNAFLHGHLTETVYMEQPHGFVDL-----VSRL- 1012
V+K+TTVR +L + V + Q+DV NAFL G L++ VY+ QP GFVD V RL
Sbjct: 1026 VIKSTTVRSVLHIAVSKGWSIRQIDVNNAFLQGTLSDEVYVTQPPGFVDKDNAHHVCRLY 1085
Query: 1013 -MCVG*TRLSTASNRRLVLGFSA*ALFSCVLVFRVVGLIHPCSFFYKGH--ITLYLLVYV 1069
G + A + L S +L V + S F H LY+LVYV
Sbjct: 1086 KALYGLKQAPRAWYQE---------LRSYLLTQGFVNSVADTSLFTLRHERTILYVLVYV 1136
Query: 1070 DDIILTGSDPSLLT*FIACLNDEFAIKYLGKLGYFLGLEITYTADGLFLGHAKYAHDLLS 1129
DD+++TGSD +++T FIA L F++K LG++ YFLG+E T T+ GL L +Y DLL
Sbjct: 1137 DDMLITGSDTNIITRFIANLAARFSLKDLGEMSYFLGIEATRTSKGLHLMQKRYVLDLLE 1196
Query: 1130 HALMLEASHVLTPLAAGSHL-VSSGEGYSDPTHYRSLVGALQYLTITRPDLSYAVNTVSQ 1188
ML A VLTP++ L ++SG+ P+ YR+++G+LQYL TRPD++YAVN +SQ
Sbjct: 1197 KTNMLAAHPVLTPMSPTPKLSLTSGKPLDKPSEYRAVLGSLQYLLFTRPDIAYAVNRLSQ 1256
Query: 1189 FLQTPTVDHFQAVKRIIRYVCAMQHFGLTFRRTSSPAVLGYSDADWARCTDHRRSTYGYA 1248
++ PT H+QA KRI+RY+ G+ R + + YSDADWA D+ ST Y
Sbjct: 1257 YMHCPTDLHWQAAKRILRYLAGTPSHGIFIRADTPLTLHAYSDADWAGDIDNYNSTNAYI 1316
Query: 1249 IFLGYNLLSWSAKKQPSVAHSSCESEYRAMANTASELVWLLNLLHELRVRLSAPPLFLSD 1308
++LG N +SWS+KKQ VA SS E+EYRA+AN SE+ W+ +LL EL + LS+PP+ D
Sbjct: 1317 LYLGSNPISWSSKKQKGVARSSTEAEYRAVANATSEIRWVCSLLTELGITLSSPPVVYCD 1376
Query: 1309 NQSALFMAQNPVAHKHAKHIDLDCHFVRELVSSGRLAVRHVPTSLQLADIFTKVLPRPLF 1368
N A +++ NPV KHI LD HFVRE V +G L V HV T QLAD TK LPR F
Sbjct: 1377 NVGATYLSANPVFDSRMKHIALDFHFVRESVQAGALRVTHVSTKDQLADALTKPLPRQPF 1436
Query: 1369 DLFRSKLRV 1377
SK+ V
Sbjct: 1437 TTLISKIGV 1445
>gb|AAC02669.1| polyprotein [Arabidopsis thaliana]
Length = 1451
Score = 783 bits (2021), Expect = 0.0
Identities = 505/1449 (34%), Positives = 742/1449 (50%), Gaps = 112/1449 (7%)
Query: 14 VHMITV-KLNSGNFLLWSKQVTAILQNQNLFDIVDPIVAPPSEFMLQPVSSALHHVVPAP 72
V+M V +L NF++WS+QV A+L +L VD + P+ P + VV
Sbjct: 24 VNMTNVTRLTDSNFVMWSRQVHALLDGYDLAGYVDGSIPIPT-----PTRTTADGVVTTN 78
Query: 73 NPLFVQWQARDRNVNSLLLSSLTEEALSETLSATTARDVWTALHSAYASRNKAREMRLRD 132
N + W+ +D+ + S LL +++ A T+ ++W L S +A+ + A +LR
Sbjct: 79 ND-YTLWKRQDKLIYSALLGAISLSVQPLLSKANTSAEIWETLSSTFANPSWAHVQQLRQ 137
Query: 133 ELQLMRRGSLSVSEYGRK----------IR*SVANDDKVHWFLRGLGPSYANFSTGQLDQ 182
+L+ +G+ S+ Y + + + ++++ L GL Y +
Sbjct: 138 QLKQWTKGTKSIVTYFQGFTTRFDHLALLGKAPEREEQIELILGGLPEDYKTVVDQIEGR 197
Query: 183 VPLPLFTDILWKVESHAIFQASLEDYVTPP---SAAFHARNPTRSSGSQSSGGGH-RGNS 238
P T++L K+ +H + A+ + + P +A + N ++ S+S+G + RGN+
Sbjct: 198 ENPPALTEVLEKLINHEVKLAAKAEATSVPVTANAVNYRGNNNNNNNSRSNGRNNSRGNT 257
Query: 239 SSGSRPRRDNGGSHCRGSYTPRCQLCRKQGHYAAKCPVRWDRPSESANLTHSFAAGC--- 295
S + N + Y +CQ+C GH A +CP A+ S A+
Sbjct: 258 SWQNSQSTSNRQQYTPRPYQGKCQICSVHGHSARRCPQLQQHAGSYASNQSSSASYAPWQ 317
Query: 296 ------SLNNSNRSDKYMDTGATSHMTHSLSQLTYSHAYSGNDRVLVGNGAGLHITHIGS 349
S N + +D+GAT H+T L+ L Y+G++ V + +G+GL I+H GS
Sbjct: 318 PRANMVSATPYNSGNWLLDSGATHHLTSDLNNLALHQPYNGDEEVTIADGSGLPISHSGS 377
Query: 350 R---SASHSVPLSNVLVVPRLTNNLVSVSKLTRDHNVRAIFEADSFVIQNRQTGAVLGKG 406
+ + S+ L +VL VP + NL+SV ++ + V F F +++ TGA L +G
Sbjct: 378 ALLPTPTRSLALKDVLYVPDIQKNLISVYRMCNTNGVSVEFFPAHFQVKDLSTGARLLQG 437
Query: 407 PCDKGFYVLDQGSQALLATS---SSLPRASFELWHSRLDHVNFDIIKKLHKLGCFNVSSI 463
Y S +ATS S P+ WH+RL H + I+K L +S
Sbjct: 438 KTKNELYEWPVNSS--IATSMFASPTPKTDLPSWHARLGHPSLPILKALISKFSLPISHS 495
Query: 464 LPKPICCTSCQMAKSKRLVCYDNHKRASAVLDLVHCDLWGPSPVASVDGFSYFVIFVDDF 523
L + C+ C + KS +L Y N +S L+ ++ D+W SP+ S+D + Y+++ VD +
Sbjct: 496 LQNQLLCSDCSINKSHKLPFYSNTIASSHPLEYLYTDVW-TSPITSIDNYKYYLVIVDHY 554
Query: 524 SQFTWFYPLKRKSDFYDVLVRFKAFVENQFSRSLKVFQSDNGTEFTNNKVQALFSSSGVL 583
+++TW YPL++KS ++ + F A VEN+F + SDNG EF +++ +S G+
Sbjct: 555 TRYTWLYPLRKKSQVREMFITFTALVENKFKFKIGTLYSDNGGEFI--AMRSFLASHGIS 612
Query: 584 HRFLCPHTQAQNGRVERKHRHVLELGLAMLYHSHVSTSYWVHAFSTAVYIINRVPSKVLS 643
H PHT NG ERKHRH++E GL +L + + YW +AF+TAVY+INR+ + VL
Sbjct: 613 HMTTPPHTPELNGISERKHRHIVETGLTLLSTASMPKEYWSYAFATAVYLINRMLTPVLG 672
Query: 644 DQIPFQLLFQVAPTYANFHPFGCRVFPCLRPYMKNKFSPRGTPCIFLGYNSHHKGFKCFD 703
++ P+ LF P Y FGC FP LRPY +K R PC+ LGY+ + C D
Sbjct: 673 NESPYVKLFGQPPNYLKLRIFGCLCFPWLRPYTAHKLDNRSVPCVLLGYSLSQSAYLCLD 732
Query: 704 PTTSRTYVSRHAQFDEFCFPLTGSKSS---PND---------LVVFTFYEPAAGSPPS-- 749
T R Y SRH QF E FP + + S P+D + V P +PPS
Sbjct: 733 RATGRVYTSRHVQFAESSFPFSTTSPSVTPPSDPPLSQDTRPVSVPLLARPLTTAPPSSP 792
Query: 750 --------------LQPVILVVPE-SSPPTGPLPCPSCVD----------------PDVQ 778
L P + P S PT P+ PS + P +
Sbjct: 793 SCSAPHRSPSQSENLSPPAPLQPSLSLSPTSPITSPSLSEESLVGHNSETGPTGSSPPLS 852
Query: 779 PVPVDDAPPSPVPHNDAPPLPAPTSPTPPATPLVVTQAPPTSPPTTPPAVTQVPLAPVDA 838
P P P SP + P P SP P +P +T +SP +PP P P+
Sbjct: 853 PQPQRPQPQSPQSTSPHSSSPQPNSPNPQHSPRSLTPTLTSSPSPSPPPNPNPP--PIQH 910
Query: 839 RPRTRS*NGIFKPNPRYALVHAQPTGLLTALHTVT*PKGFKSAMKHPHWLPAMEDELSAL 898
RTRS N I KPNP++A + +PT L + PK A+ P+W AM DE++A
Sbjct: 911 TMRTRSKNNIVKPNPKFANLATKPTPLKPII-----PKTVVEALLDPNWRQAMCDEINAQ 965
Query: 899 HKKFTWTLVPRPYTTNVVGSK*VFRTKFHSDGTVERLKTCLVAQGLTQIPGFDYSLTFSP 958
+ T+ LVP NVVG K VF K+ S+G ++R K LVA+G Q G D+ TFSP
Sbjct: 966 TRNGTFDLVPPAPNQNVVGCKWVFTLKYLSNGVLDRYKARLVAKGFHQQYGHDFKETFSP 1025
Query: 959 VVKATTVRLILSLVVLND*QLHQLDVKNAFLHGHLTETVYMEQPHGFVDL-----VSRL- 1012
V+K TTVR +L + V + Q+DV NAFL G L++ VY+ QP GFVD V RL
Sbjct: 1026 VIKLTTVRSVLHIAVSKGWSIRQIDVNNAFLQGTLSDEVYVTQPPGFVDKDNAHHVCRLY 1085
Query: 1013 -MCVG*TRLSTASNRRLVLGFSA*ALFSCVLVFRVVGLIHPCSFFYKGH--ITLYLLVYV 1069
G + A + L S +L V + S F H LY+LVYV
Sbjct: 1086 KALYGLKQAPRAWYQE---------LRSYLLTQGFVNSVADTSLFTLRHERTILYVLVYV 1136
Query: 1070 DDIILTGSDPSLLT*FIACLNDEFAIKYLGKLGYFLGLEITYTADGLFLGHAKYAHDLLS 1129
DD+++TGSD +++T FIA L F++K LG++ YFLG+E T T+ GL L +Y DLL
Sbjct: 1137 DDMLITGSDTNIITRFIANLAARFSLKDLGEMSYFLGIEATRTSKGLHLMQKRYVLDLLE 1196
Query: 1130 HALMLEASHVLTPLAAGSHL-VSSGEGYSDPTHYRSLVGALQYLTITRPDLSYAVNTVSQ 1188
ML A VLTP++ L ++SG+ P+ YR+++G+LQYL TRPD++YAVN +SQ
Sbjct: 1197 KTNMLAAHPVLTPMSPTPKLSLTSGKPLDKPSEYRAVLGSLQYLLFTRPDIAYAVNRLSQ 1256
Query: 1189 FLQTPTVDHFQAVKRIIRYVCAMQHFGLTFRRTSSPAVLGYSDADWARCTDHRRSTYGYA 1248
++ PT H+QA KRI+RY+ G+ R + + YSDADWA D+ ST Y
Sbjct: 1257 YMHCPTDLHWQAAKRILRYLAGTPSHGIFIRADTPLTLHAYSDADWAGDIDNYNSTNAYI 1316
Query: 1249 IFLGYNLLSWSAKKQPSVAHSSCESEYRAMANTASELVWLLNLLHELRVRLSAPPLFLSD 1308
++LG N +SWS+KKQ VA SS E+EYRA+AN SE+ W+ +LL EL + LS+PP+ D
Sbjct: 1317 LYLGSNPISWSSKKQKGVARSSTEAEYRAVANATSEIRWVCSLLTELGITLSSPPVVYCD 1376
Query: 1309 NQSALFMAQNPVAHKHAKHIDLDCHFVRELVSSGRLAVRHVPTSLQLADIFTKVLPRPLF 1368
N A +++ NPV KHI LD HFVRE V +G L V HV T QLAD TK LPR F
Sbjct: 1377 NVGATYLSANPVFDSRMKHIALDFHFVRESVQAGALRVTHVSTKDQLADALTKPLPRQPF 1436
Query: 1369 DLFRSKLRV 1377
SK+ V
Sbjct: 1437 TTLISKIGV 1445
>gb|AAC67200.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301693|pir||F84480 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1402
Score = 775 bits (2000), Expect = 0.0
Identities = 491/1401 (35%), Positives = 708/1401 (50%), Gaps = 115/1401 (8%)
Query: 6 YSFLAYTWVHMITVKLNSGNFLLWSKQVTAILQNQNLFDIVDPIVAPPSEFMLQPVSSAL 65
YS + + +TV L + N++LW Q + L Q L V + PS+ + VS
Sbjct: 4 YSVPSLNISNCVTVTLTAKNYILWKSQFESFLDGQGLLGFVTGSIPAPSQTSV--VSDID 61
Query: 66 HHVVPAPNPLFVQWQARDRNVNSLLLSSLTEEALSETLSATTARDVWTALHSAYASRNKA 125
+PNP + W DR V S LL S E+ LS ++ T+ +VW ++ + + + +
Sbjct: 62 GSTSASPNPEYYTWFKTDRVVKSWLLGSFLEDILSVVVNCNTSHEVWISVANHFNRVSSS 121
Query: 126 REMRLRDELQLMRRGSLSVSEYGRKIR*----------SVANDDKVHWFLRGLGPSYANF 175
R L+ LQ + + S+ EY + ++ V K+ L GLG Y
Sbjct: 122 RLFELQRRLQNVSKRDKSMDEYLKDLKTICDQLASVGSPVTEKMKIFAALNGLGREYEPI 181
Query: 176 ST---GQLDQVPLPLFTDILWKVESHAI-FQASLEDYVTPPSAAFHARNPTRSSGS---- 227
T +D +P P D++ K+ + Q LE+ P AF+ S+ S
Sbjct: 182 KTTIENSMDALPGPSLEDVIPKLTGYDDRLQGYLEETAVSPHVAFNITTSDDSNASGYFN 241
Query: 228 ---QSSGGGHRGNSSSGSRPR---------RDNGGSHCRGSYTPRCQLCRKQGHYAAKCP 275
+ G +RG +S +R R + GS G+ CQ+C K GH A KC
Sbjct: 242 AYNRGKGKSNRGRNSFSTRGRGFHQQISSTNSSSGSQSGGTSVV-CQICGKMGHPALKCW 300
Query: 276 VRWDRPSESANLTHSFAAG--CSLNNSNRSDKYMDTGATSHMTHSLSQLTYSHAYSGNDR 333
R++ + L + AA + + + ++ D+ AT+H+T+S L S Y G+D
Sbjct: 301 HRFNNSYQYEELPRALAAMRITDITDQHGNEWLPDSAATAHVTNSPRSLQQSQPYHGSDA 360
Query: 334 VLVGNGAGLHITHIGSR---SASHSVPLSNVLVVPRLTNNLVSVSKLTRDHNVRAIFEAD 390
V+V +G L ITH GS S+S +VPL++VLV P +T +L+SVSKLT+D+ F++D
Sbjct: 361 VMVADGNFLPITHTGSTNLASSSGNVPLTDVLVCPSITKSLLSVSKLTQDYPCTVEFDSD 420
Query: 391 SFVIQNRQTGAVLGKGPCDKGFYVLDQGSQALLATSSSLPRASFELWHSRLDHVNFDIIK 450
I ++ T +L G G Y L SQ S+ AS E+WH RL H + +++
Sbjct: 421 GVRINDKATKKLLIMGSTCDGLYCLKDDSQFKAFFSTRQQSASDEVWHRRLGHPHPQVLQ 480
Query: 451 KLHKLGCFNVSSILPKPICCTSCQMAKSKRLVCYDNHKRASAVLDLVHCDLWGPSPVASV 510
+L K +++ C +CQ+ KS RL + ++ L+ VHCDLWGPSP+ SV
Sbjct: 481 QLVKTNSISINKTSKS--LCEACQLGKSTRLPFVSSSFTSNRPLERVHCDLWGPSPITSV 538
Query: 511 DGFSYFVIFVDDFSQFTWFYPLKRKSDFYDVLVRFKAFVENQFSRSLKVFQSDNGTEFTN 570
GF Y+ +F+D +S+F+W YPLK KSDFY++ V F VENQ + + VFQ D G EF N
Sbjct: 539 QGFRYYAVFIDHYSRFSWIYPLKLKSDFYNIFVAFHKLVENQLNHKISVFQCDGGGEFVN 598
Query: 571 NKVQALFSSSGVLHRFLCPHTQAQNGRVERKHRHVLELGLAMLYHSHVSTSYWVHAFSTA 630
+K + G+ PHT QNG ERKHRH++ELGL+ML+ S V +WV AF TA
Sbjct: 599 HKFLQHLQNHGIQQHISYPHTPQQNGLAERKHRHLVELGLSMLFQSKVPLKFWVEAFFTA 658
Query: 631 VYIINRVPSKVLSDQI-PFQLLFQVAPTYANFHPFGCRVFPCLRPYMKNKFSPRGTPCIF 689
++IN +P+ + D I P++ L Q P Y FGC FP +R Y NKF PR C+F
Sbjct: 659 NFLINLLPTSAVEDAISPYEKLHQTTPDYTALRSFGCACFPTMRDYAMNKFDPRSLKCVF 718
Query: 690 LGYNSHHKGFKCFDPTTSRTYVSRHAQFDEFCFP-------LTGSKSSPNDLVVFTFYEP 742
LGYN +KG++C P T R Y+SRH FDE +P L ++P F +E
Sbjct: 719 LGYNDKYKGYRCLYPPTGRVYISRHVIFDETAYPFSHHYKHLHSQPTTPLLAAWFKGFES 778
Query: 743 AAG-SPPSLQPVILVVPESSPPTGPLPCPSCVDPDVQPVPVDDAPPSP-----VPHNDAP 796
+ +PP + P +++ PT PL D PP P + N A
Sbjct: 779 SVSQAPPKVSPAQPPQRKATLPTPPL------------FTAADFPPLPRRSPQLSQNSAA 826
Query: 797 PL---PAPTSPTPPATPLVVTQA------------------------PPTSPPTTPPAVT 829
L P+ T+ P VV ++ P T
Sbjct: 827 ALVSQPSTTTINSTHPPAVVNESSERTINFDSASIGDSSHSSQLLVDDTVEDLMAAPVPT 886
Query: 830 QVPLAPVDARPR-TRS*NGIFKPNPRYALVHAQPTGLLTALHTVT*PKGFKSAMKHPHWL 888
Q P + P TR+ GI KPNPRY L+ T PK +A+KHP W
Sbjct: 887 QQAPPPTNTHPMITRAKVGITKPNPRYV--------FLSHKVTYPEPKTVTAALKHPGWT 938
Query: 889 PAMEDELSALHKKFTWTLVPRPYTTNVVGSK*VFRTKFHSDGTVERLKTCLVAQGLTQIP 948
AM +E+ + TW+LVP +V+GSK VFRTK H+DGT+ +LK +VA+ Q
Sbjct: 939 GAMTEEMGNCSETNTWSLVPYTPNMHVLGSKWVFRTKLHADGTLNKLKARIVAKCFLQEE 998
Query: 949 GFDYSLTFSPVVKATTVRLILSLVVLND*QLHQLDVKNAFLHGHLTETVYMEQPHGFVDL 1008
G Y T+SPVV+ TV+L+L L + +L Q+DVKNAFLHG L ETVYM QP GFVD
Sbjct: 999 GIGYLETYSPVVRTPTVQLVLHLATALNWELKQMDVKNAFLHGDLNETVYMTQPAGFVDK 1058
Query: 1009 VSRLMCVG*TRLSTASNRRLVLGFSA*ALFSCVLVFRV-----VGLIHPCSFFYKGHITL 1063
T + L S A F F + P F Y + L
Sbjct: 1059 SKP------THVCLLHKSIYGLKQSPRAWFDKFSTFLLEFGFFCSKSDPSLFIYAHNNNL 1112
Query: 1064 -YLLVYVDDIILTGSDPSLLT*FIACLNDEFAIKYLGKLGYFLGLEITYTADGLFLGHAK 1122
LL+YVDD+++TG+ L+ +A LN EF + +G+L YFLG+++ GLF+ K
Sbjct: 1113 ILLLLYVDDMVITGNSSQTLSSLLAALNKEFRMTDMGQLHYFLGIQVQRNQHGLFMSQQK 1172
Query: 1123 YAHDLLSHALMLEASHVLTPLAAG-SHLVSSGEGYSDPTHYRSLVGALQYLTITRPDLSY 1181
YA DLL + M + + TPL + E ++DPT++RS+ G LQYLT+TRPD+ +
Sbjct: 1173 YAEDLLVASAMENCTPLPTPLPVQLDRVPHQEEPFTDPTYFRSIAGKLQYLTLTRPDIHF 1232
Query: 1182 AVNTVSQFLQTPTVDHFQAVKRIIRYVCAMQHFGLTFRRTSSPAVLGYSDADWARCTDHR 1241
AVN V Q + PT+ F +KRI+RY+ G+++ + S + YSD+DW C R
Sbjct: 1233 AVNFVCQKMHQPTMSDFHLLKRILRYIKGTITMGISYNQNSPTLLQAYSDSDWGNCKLTR 1292
Query: 1242 RSTYGYAIFLGYNLLSWSAKKQPSVAHSSCESEYRAMANTASELVWLLNLLHELRVRLSA 1301
RS G F+ NL+SWS+KK P+V+ SS E+EYR +++ ASE++WL LL EL + L
Sbjct: 1293 RSVGGLCTFMATNLVSWSSKKHPTVSRSSTEAEYRTLSDAASEILWLSTLLRELGIPLPD 1352
Query: 1302 PPLFLSDNQSALFMAQNPVAH 1322
P DN SA++ NP H
Sbjct: 1353 TPELFCDNLSAVYHTANPAFH 1373
>emb|CAB81478.1| putative protein [Arabidopsis thaliana] gi|4972079|emb|CAB43904.1|
putative protein [Arabidopsis thaliana]
gi|7444467|pir||T08945 hypothetical protein F25O24.20 -
Arabidopsis thaliana
Length = 1415
Score = 760 bits (1962), Expect = 0.0
Identities = 500/1420 (35%), Positives = 716/1420 (50%), Gaps = 154/1420 (10%)
Query: 10 AYTWVHMITVKLNSGNFLLWSKQVTAILQNQNLFDIVDPIVAPPSEFMLQPVSSALHHVV 69
A + H +T+KL++ N+LLW Q L NQ L V A P + + + V
Sbjct: 10 ALCFSHYVTLKLSTANYLLWKIQFETWLNNQRLLGFVTG--ANPCPNATRSIRNG-DQVT 66
Query: 70 PAPNPLFVQWQARDRNVNSLLLSSLTEEALSETLSATTARDVWTALHSAYASRNKAREMR 129
A NP F+ W D+ + LL SL+E+AL T+R+VW +L Y + +R+
Sbjct: 67 EATNPDFLTWVQNDQKIMGWLLGSLSEDALRSVYGLHTSREVWFSLAKKYNRVSASRKSD 126
Query: 130 LRDELQLMRRGSLSVSEYGRKIR*----------SVANDDKVHWFLRGLGPSYANFST-- 177
L+ L + + S+ EY ++ V ++K+ L GLG Y ST
Sbjct: 127 LQRRLNPVSKNEKSMLEYLNCVKQICDQLDSIGCPVPENEKIFGVLNGLGQEYMLVSTMI 186
Query: 178 -GQLDQVPLPLFTDILWKVESHAIFQASLEDYVTPPSAAFHARNPTRSSGSQSSGGGHRG 236
G +D P+ F D+++K+ + F L++ QS G R
Sbjct: 187 KGSMDTYPMS-FEDVVFKLIN---FDDKLQN-------------------GQSGGNRGRN 223
Query: 237 NSSSGSR--PRRDNGGSHCRGSYTPRCQLCRKQGHYAAKCPVRWDRPSESANLTHSFAAG 294
N ++ R P++ + GS P CQ+C K GH A KC R+D +S + + +FAA
Sbjct: 224 NYTTKGRGFPQQISSGSPSDSGTRPTCQICNKYGHSAYKCWKRFDHAFQSEDFSKAFAA- 282
Query: 295 CSLNNSNRSDKYMDTGATSHMTHSLSQLTYSHAYSGNDRVLVGNGAGLHITHIGSR---S 351
+++ + D+GATSH+T+S SQL + YSG D V+VGN L ITHIGS S
Sbjct: 283 MRVSDQKSNPWVTDSGATSHITNSTSQLQSAQPYSGEDSVIVGNSDFLPITHIGSAVLTS 342
Query: 352 ASHSVPLSNVLVVPRLTNNLVSVSKLTRDHNVRAIFEADSFVIQNRQTGAVLGKGPCDKG 411
++PL +VLV P +T +L+SVSKLT D+ F++D +++++ T +L KG
Sbjct: 343 NQGNLPLRDVLVCPNITKSLLSVSKLTSDYPCVIEFDSDGVIVKDKLTKQLLTKGTRHND 402
Query: 412 FYVLDQGSQALLATSSSLPRASFELWHSRLDHVNFDIIKKLHKLGCFNVSSILPKPICCT 471
Y+L+ + + SS S E+WH RL H N D++++L + +S C
Sbjct: 403 LYLLEN-PKFMACYSSRQQATSDEVWHMRLGHPNQDVLQQLLRNKAIVISKTSHS--LCD 459
Query: 472 SCQMAKSKRLVCYDNHKRASAVLDLVHCDLWGPSPVASVDGFSYFVIFVDDFSQFTWFYP 531
+CQM K +L + +S +L+ VHCDLWGP+PV S GF Y+VIF+D++S+FTWFYP
Sbjct: 460 ACQMGKICKLPFASSDFVSSRLLERVHCDLWGPAPVVSSQGFRYYVIFIDNYSRFTWFYP 519
Query: 532 LKRKSDFYDVLVRFKAFVENQFSRSLKVFQSDNGTEFTNNKVQALFSSSGVLHRFLCPHT 591
L+ KSDF+ V + F+ VENQ + + FQ D G EF +N+ + + G+ CP+T
Sbjct: 520 LRLKSDFFSVFLTFQKMVENQCQQKIASFQCDGGGEFISNQFVSHLAECGIRQLISCPYT 579
Query: 592 QAQNGRVERKHRHVLELGLAMLYHSHVSTSYWVHAFSTAVYIINRVPSKVLSDQ-IPFQL 650
QNG ERKHRH+ ELG +M++ V WV AF T+ ++ N +PS VL DQ P+++
Sbjct: 580 PQQNGIAERKHRHITELGSSMMFQGKVPQFLWVEAFYTSNFLCNLLPSSVLKDQKSPYEV 639
Query: 651 LFQVAPTYANFHPFGCRVFPCLRPYMKNKFSPRGTPCIFLGYNSHHKGFKCFDPTTSRTY 710
L AP Y + FGC +P LRPY NKF P+ C+F GYN +KG+KCF P T + Y
Sbjct: 640 LMGKAPVYTSLRVFGCACYPNLRPYASNKFDPKSLLCVFTGYNEKYKGYKCFHPPTGKIY 699
Query: 711 VSRHAQFDE----FCFPLTGSKSSPNDLVVFTFYEPAAGSPPSLQPVILVVP-------- 758
++RH FDE F + S N +V + P +L +
Sbjct: 700 INRHVLFDESKFLFSDIYSDKVSGTNSTLVSAWQSNFLPKSIPATPEVLDISNTAASFSD 759
Query: 759 ---ESSPPTGPLPCPSCVDPDVQPVPVDDAPPSPVPHNDAPPLPAPTSPTPPATPLVVTQ 815
E S G C D D P+ + P SPV ++P P S ++
Sbjct: 760 EQGEFSGAVGGGGCGCTADLDSVPIG-NSLPSSPVTQQNSPQPETPISSAGSGNDAEDSE 818
Query: 816 APPTSPPTT----PPAVTQVPLAP---VDARPR-TRS*NGIFKPNPRYALVHAQPTGLLT 867
S + A T+ A + P TRS +GIFKPNP+YA + T
Sbjct: 819 LSENSENSESSVFSEATTETEAADNTNDQSHPMITRSKSGIFKPNPKYA--------MFT 870
Query: 868 ALHTVT*PKGFKSAMKHPHWLPAMEDELSALHKKFTWTLVPRPYTTNVVGSK*VFRTKFH 927
PK K+A+K P W AM +E + + TW LVP +G + VF+TK
Sbjct: 871 VKSNYPVPKTVKTALKDPGWTDAMGEEYDSFEETHTWDLVPPDSFITPLGCRWVFKTKLK 930
Query: 928 SDGTVERLKTCLVAQGLTQIPGFDYSLTFSPVVKATTVRLILSLVVLND*QLHQLDVKNA 987
+DGT++RLK LVA+G Q G DY T+SPVV+ TVR IL + +N ++ QLDVKNA
Sbjct: 931 ADGTLDRLKARLVAKGYEQEEGVDYMETYSPVVRTATVRTILHVATINKWEIKQLDVKNA 990
Query: 988 FLHGHLTETVYMEQPHGF-----VDLVSRLMCVG*TRLSTASNRRLVLGFSA*ALFSCVL 1042
FLHG L ETVYM QP GF D V +L + + F + F
Sbjct: 991 FLHGDLKETVYMYQPPGFENQDRPDYVCKL-----NKAIYGLKQAPRAWFDKFSTFLLEF 1045
Query: 1043 VFRVVGLIHPCSF-FYKGHITLYLLVYVDDIILTGSDPSLLT*FIACLNDEFAIKYLGKL 1101
F + P F F KG ++LL+Y+DD++LTG+
Sbjct: 1046 GF-ICTYSDPSLFVFLKGRDLMFLLLYMDDMLLTGN------------------------ 1080
Query: 1102 GYFLGLEITYTADGLFLGHAKYAHDLLSHALMLEASHVLTPLAAGSHLV-SSGEGYSDPT 1160
+ KYA DLL A M + + + TPL V E ++DPT
Sbjct: 1081 ------------------NKKYAMDLLVAAGMADCAPMPTPLPLQLDKVPGQQESFADPT 1122
Query: 1161 HYRSLVGALQYLTITRPDLSYAVNTVSQFLQTPTVDHFQAVKRIIRYVCAMQHFGLTFRR 1220
++RSL AVN V Q + +PTV F +KR++RY+ GL
Sbjct: 1123 YFRSL----------------AVNLVCQKMHSPTVADFNLLKRVLRYLKGKVQMGLNLHN 1166
Query: 1221 TSSPAVLGYSDADWARCTDHRRSTYGYAIFLGYNLLSWSAKKQPSVAHSSCESEYRAMAN 1280
+ + YSD+DWA C + RRS G+ FLG N++SWSAK+ P+V+ SS E+EYR ++
Sbjct: 1167 NTDITLRAYSDSDWANCKETRRSVGGFCTFLGTNIISWSAKRHPTVSRSSTEAEYRTLSI 1226
Query: 1281 TASELVWLLNLLHELRVRLSAPPLFLSDNQSALFMAQNPVAHKHAKHIDLDCHFVRELVS 1340
A+E+ W+ +LL E+ + APP DN SA+++ NP H +K D+D H+VRE V+
Sbjct: 1227 AATEVKWISSLLREIGIYQPAPPELYCDNLSAVYLTANPAMHNRSKAFDVDFHYVRERVA 1286
Query: 1341 SGRLAVRHVPTSLQLADIFTKVLP-RPLFDLFRSKLRVGL 1379
G L V+HVP S QLADIFTK LP RP FDL R KL V L
Sbjct: 1287 LGALVVKHVPASHQLADIFTKSLPQRPFFDL-RYKLGVVL 1325
>gb|AAK43485.1| polyprotein, putative [Arabidopsis thaliana]
Length = 1459
Score = 758 bits (1956), Expect = 0.0
Identities = 508/1459 (34%), Positives = 730/1459 (49%), Gaps = 138/1459 (9%)
Query: 20 KLNSGNFLLWSKQVTAILQNQNLFDIVDPIVAPPSEFMLQPVSSALHHVVPAPNPLFVQW 79
KL S N+L+WS Q+ A+L +L +D V P P ++ ++ VV A NP F W
Sbjct: 32 KLTSTNYLMWSIQIHALLDGYDLAGYLDNSVVIP------PETTTINSVVSA-NPSFTLW 84
Query: 80 QARDRNVNSLLLSSLTEEALSETLSATTARDVWTALHSAYASRNKAREMRLRDELQLMRR 139
+ +D+ + S L+ +++ S AT + +W+ L++ YA + +LR ++Q + +
Sbjct: 85 KRQDKLIFSALIGAISPAVQSLVSRATNSSQIWSTLNNTYAKPSYGHIKQLRQQIQRLTK 144
Query: 140 GSLSVSEYGRK----------IR*SVANDDKVHWFLRGLGPSYANFSTGQLDQVPLPLFT 189
G+ ++ EY + + + ++++V L+GL Y + P T
Sbjct: 145 GTKTIDEYVQSHTTRLDQLAILGKPMEHEEQVEHILKGLPEEYKTVVDQIEGKDNTPTIT 204
Query: 190 DILWKVESHAIFQASLEDYVTPPSAAF-------HARNPTRSSGSQS-----SGGGHRGN 237
+I ++ +H ++ L PPS++F RN + G H N
Sbjct: 205 EIHERLINH---ESKLLSDEVPPSSSFPMSANAVQQRNFNNNCNQNQHKNRYQGNTHNNN 261
Query: 238 SSSGSRPRRDN-GGSHCRGSYTPRCQLCRKQGHYAAKCPVRWDRPSESANLTHSF----- 291
+++ S+P N G Y +CQ+C QGH A +CP +++ HS
Sbjct: 262 TNTNSQPSTYNKSGQRTFKPYLGKCQICSVQGHSARRCPQLQAMQLPASSSAHSPFTPWQ 321
Query: 292 -AAGCSLNNSNRSDKYM-DTGATSHMTHSLSQLTYSHAYSGNDRVLVGNGAGLHITHIGS 349
A ++ + ++ ++ D+GAT H+T L+ L+ Y+G + V++ +G GL I GS
Sbjct: 322 PRANLAIGSPYAANPWLLDSGATHHITSDLNALSLHQPYNGGEYVMIADGTGLTIKQTGS 381
Query: 350 R---SASHSVPLSNVLVVPRLTNNLVSVSKLTRDHNVRAIFEADSFVIQNRQTGAVLGKG 406
S + + L VL VP + NL+SV +L + V F SF +++ TG +L +G
Sbjct: 382 TFLPSQNRDLALHKVLYVPDIRKNLISVYRLCNTNQVSVEFFPASFQVKDLNTGTLLLQG 441
Query: 407 PCDKGFY---VLDQGSQALLATSSSLPRASFELWHSRLDHVNFDIIKKLHKLGCFNVSSI 463
Y V + + AL + S P+ + WHSRL H + I+ L VS
Sbjct: 442 RTKDDLYEWPVTNPPATALFTSPS--PKTTLSSWHSRLGHPSASILNTLLSKFSLPVSVA 499
Query: 464 LPKPICCTSCQMAKSKRLVCYDNHKRASAVLDLVHCDLWGPSPVASVDGFSYFVIFVDDF 523
C+ C + KS +L + +S+ L+ + D+W SP+ S D + Y+++ VD +
Sbjct: 500 SSNKTSCSDCLINKSHKLPFATSSIHSSSPLEYIFTDVW-TSPIISHDNYKYYLVLVDHY 558
Query: 524 SQFTWFYPLKRKSDFYDVLVRFKAFVENQFSRSLKVFQSDNGTEFTNNKVQALFSSSGVL 583
+++TW YPL++KS + FKA VEN+F ++ SDNG EF ++ S+G+
Sbjct: 559 TRYTWLYPLQQKSQVKATFIAFKALVENRFQAKIRTLYSDNGGEFI--ALRDFLVSNGIS 616
Query: 584 HRFLCPHTQAQNGRVERKHRHVLELGLAMLYHSHVSTSYWVHAFSTAVYIINRVPSKVLS 643
H PHT NG ERKHRH++E GL +L + V YW +AF+TAVY+INR+P+ VL
Sbjct: 617 HLTSPPHTPEHNGLSERKHRHIVETGLTLLTQASVPREYWTYAFATAVYLINRMPTPVLC 676
Query: 644 DQIPFQLLFQVAPTYANFHPFGCRVFPCLRPYMKNKFSPRGTPCIFLGYNSHHKGFKCFD 703
Q PFQ LF +P Y FGC FP LRPY +NK R C+FLGY+ + C D
Sbjct: 677 LQSPFQKLFGSSPNYQRLRVFGCLCFPWLRPYTRNKLEERSKRCVFLGYSLTQTAYLCLD 736
Query: 704 PTTSRTYVSRHAQFDEFCFPLTGSKSSPNDLVVFTFYEPAAGSPPS-----LQPVILVVP 758
+R Y SRH FDE +P S + + T E ++ S P+ + L P
Sbjct: 737 VDNNRLYTSRHVMFDESTYPFAASIREQSQSSLVTPPESSSSSSPANSGFPCSVLRLQSP 796
Query: 759 ESSPPTGPLPCPSCVDPDVQPVPVDDAPPS---------------------PVPHNDAP- 796
+S P P P D V P PS PH + P
Sbjct: 797 PASSPETPSPPQQQNDSPVSPRQTGSPTPSHHSQVRDSTLSPSPSVSNSEPTAPHENGPE 856
Query: 797 -------------PLPAPTSPTPPATPLVVTQAPPTSPPTTPPAVTQVPLAPV------- 836
PLP P T P++ + Q P TT Q +A
Sbjct: 857 PEAQSNPNSPFIGPLPNPNPETNPSSS--IEQRPVDKSTTTALPPNQTTIAATSNSRSQP 914
Query: 837 ---DARPRTRS*NGIFKPNPRYALVHAQPTGLLTALHTVT*PKGFKSAMKHPHWLPAMED 893
+ + +TRS N I KP + +L A L+ +TVT A+K W AM D
Sbjct: 915 PKNNHQMKTRSKNNITKPKTKTSLTVALTQPHLSEPNTVT------QALKDKKWRFAMSD 968
Query: 894 ELSALHKKFTWTLVPRPYTTNVVGSK*VFRTKFHSDGTVERLKTCLVAQGLTQIPGFDYS 953
E A + TW LVP T ++VG + VF+ K+ +G +++ K LVA+G Q G DY+
Sbjct: 969 EFDAQQRNHTWDLVPPNPTQHLVGCRWVFKLKYLPNGLIDKYKARLVAKGFNQQYGVDYA 1028
Query: 954 LTFSPVVKATTVRLILSLVVLND*QLHQLDVKNAFLHGHLTETVYMEQPHGFVDL----- 1008
TFSPV+KATT+R++L + V + L QLDV NAFL G LTE VYM QP GFVD
Sbjct: 1029 ETFSPVIKATTIRVVLDVAVKKNWPLKQLDVNNAFLQGTLTEEVYMAQPPGFVDKDRPSH 1088
Query: 1009 VSRLMCVG*TRLSTASNRRLVLGFSA------*ALFSCVLVFRVVGLIHPCSFFYKGHIT 1062
V RL R+ + G L +L V + S F H T
Sbjct: 1089 VCRL-------------RKAIYGLKQAPRAWYMELKQHLLNIGFVNSLADTSLFIYSHGT 1135
Query: 1063 --LYLLVYVDDIILTGSDPSLLT*FIACLNDEFAIKYLGKLGYFLGLEITYTADGLFLGH 1120
LYLLVYVDDII+TGSD ++ ++ L + F+IK L YFLG+E T T GL L
Sbjct: 1136 TLLYLLVYVDDIIVTGSDHKSVSAVLSSLAERFSIKDPTDLHYFLGIEATRTNTGLHLMQ 1195
Query: 1121 AKYAHDLLSHALMLEASHVLTPLAAGSHL-VSSGEGYSDPTHYRSLVGALQYLTITRPDL 1179
KY DLL+ ML+A V TPL L + G +D + YRS+VG+LQYL TRPD+
Sbjct: 1196 RKYMTDLLAKHNMLDAKPVATPLPTSPKLTLHGGTKLNDASEYRSVVGSLQYLAFTRPDI 1255
Query: 1180 SYAVNTVSQFLQTPTVDHFQAVKRIIRYVCAMQHFGLTFRRTSSPAVL-GYSDADWARCT 1238
++AVN +SQF+ PT DH+QA KR++RY+ G+ F +SSP L +SDADWA +
Sbjct: 1256 AFAVNRLSQFMHQPTSDHWQAAKRVLRYLAGTTTHGI-FLNSSSPIHLHAFSDADWAGDS 1314
Query: 1239 DHRRSTYGYAIFLGYNLLSWSAKKQPSVAHSSCESEYRAMANTASELVWLLNLLHELRVR 1298
ST Y I+LG N +SWS+KKQ V+ SS ESEYRA+AN ASE+ WL +LL EL +R
Sbjct: 1315 ADYVSTNAYVIYLGRNPISWSSKKQRGVSRSSTESEYRAVANAASEIRWLCSLLTELHIR 1374
Query: 1299 LSAPPLFLSDNQSALFMAQNPVAHKHAKHIDLDCHFVRELVSSGRLAVRHVPTSLQLADI 1358
L P DN A ++ NPV H KHI LD HFVR ++ S L V HV T+ QLAD
Sbjct: 1375 LPHGPTIFCDNIGATYICANPVFHSRMKHIALDYHFVRGMIQSRALRVSHVSTNDQLADA 1434
Query: 1359 FTKVLPRPLFDLFRSKLRV 1377
TK L RP F RSK+ V
Sbjct: 1435 LTKSLSRPHFLSARSKIGV 1453
>gb|AAU43956.1| unknown protein [Oryza sativa (japonica cultivar-group)]
gi|52353503|gb|AAU44069.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
Length = 1447
Score = 757 bits (1954), Expect = 0.0
Identities = 509/1467 (34%), Positives = 721/1467 (48%), Gaps = 144/1467 (9%)
Query: 17 ITVKLNSGNFLLWSKQVTAILQNQNLFDIVDPIVAPPSEFMLQPVSSALHHVVPAPNPLF 76
I+ KL+ N LW QV A ++ L + P+ + V NP F
Sbjct: 18 ISEKLSKSNHALWKAQVMAAVRGARLEGHLTGATKTPNALITTTAGDKGEKEVTVRNPEF 77
Query: 77 VQWQARDRNVNSLLLSSLTEEALSETLSATTARDVWTALHSAYASRNKAREMRLRDELQL 136
W A D+ V LLS+L + L++ + TA W L Y+S +AR + R L
Sbjct: 78 DDWVATDQQVLGFLLSTLARDVLAQVATCGTAAAAWQMLEEMYSSVTRARFINTRIALSN 137
Query: 137 MRRGSLSVSEYGRKIR*S----------VANDDKVHWFLRGLGPSYANFSTGQL--DQVP 184
++G+LS++EY K++ V +DD + + + GL +Y + + D +
Sbjct: 138 TKKGTLSINEYVSKMKALADEMTAAGKIVDDDDLISYIIAGLDDTYEPVISTIVGKDTMT 197
Query: 185 LPLFTDILWKVESHAIFQASLEDYVTPPSAAFHARNPTRSSGSQSSGGGHRGNSSSGS-- 242
L L E + + V + + G+ +GG RG +++G+
Sbjct: 198 LGEAYSQLLSFEQRLALRHGGDSSVNLANRGRGGGGGQQRGGNTGNGGRGRGGNNNGANR 257
Query: 243 -RPRRDNGGSHCRGSYT--PRCQLCRKQGHYAAKCPVRWDR---PSESANLTHSFAAGCS 296
R R +NGG+ G P+CQLC K+GH C R+D P E AG +
Sbjct: 258 GRGRGNNGGARPPGGVDNRPKCQLCYKRGHTVINCWYRYDEDFVPDEKY-------AGSA 310
Query: 297 LNNSNRSDKYMDTGATSHMTHSLSQLTYSHAYSGNDRVLVGNGAGLHITHIGS---RSAS 353
+ ++ Y+DT AT H+T L +LT Y G D+V +GAG+ I+HIG R+ +
Sbjct: 311 TSYGIDTNWYVDTSATDHVTGELDKLTVRDRYKGQDQVHTASGAGMEISHIGHSTVRTPN 370
Query: 354 HSVPLSNVLVVPRLTNNLVSVSKLTRDHNVRAIFEADSFVIQNRQTGAVLGKGPCDKGFY 413
+ L N+L VP NLVS ++L D++ + F +++ T +L +GPC Y
Sbjct: 371 RDIHLRNILYVPNANKNLVSANRLVSDNSAYMELYSKYFNLKDLATKKLLFRGPCRGRLY 430
Query: 414 VLDQGSQ----ALLATSSSLPRASFELWHSRLDHVNFDIIKKL---HKLGCFNVSSILPK 466
L S L + + SFE WHSRL H I++K+ + L C S+ K
Sbjct: 431 ALPSSSPHERPRPLKEAFGAIKPSFERWHSRLGHPASPIVEKVISKNNLPCLAESN---K 487
Query: 467 PICCTSCQMAKSKRLVCYDNHKRASAVLDLVHCDLWGPSPVASVDGFSYFVIFVDDFSQF 526
C +CQ KS +L + +S L+L++ D+WGP+ + SV G Y+V F+ D+S+F
Sbjct: 488 QSVCDACQQGKSHQLPYSRSSSMSSHPLELIYSDVWGPA-LTSVGGKQYYVSFIGDYSKF 546
Query: 527 TWFYPLKRKSDFYDVLVRFKAFVENQFSRSLKVFQSDNGTEFTNNKVQALFSSSGVLHRF 586
TW Y +K KS+ F+A VE F+R + QSD G E+ K+ + F+ G+ H
Sbjct: 547 TWLYLIKHKSEVIQKFHEFQALVERLFNRKIIAMQSDWGGEY--EKLHSFFTKIGITHHV 604
Query: 587 LCPHTQAQNGRVERKHRHVLELGLAMLYHSHVSTSYWVHAFSTAVYIINRVPSKVLSDQI 646
CPHT QNG ERKHRH++E+GL +L +S + +W AF AVY+INR P+K+L
Sbjct: 605 SCPHTHQQNGSAERKHRHIVEVGLTLLAYSSMPLKFWDEAFQAAVYLINRTPTKLLQFLT 664
Query: 647 PFQLLFQVAPTYANFHPFGCRVFPCLRPYMKNKFSPRGTPCIFLGYNSHHKGFKCFDPTT 706
P + LF P Y++ FGC +P LRPY +K R C FLGY++ HKGFKC DP+T
Sbjct: 665 PLEHLFNQTPDYSSLRVFGCACWPHLRPYNTHKLQFRSKQCTFLGYSTLHKGFKCLDPST 724
Query: 707 SRTYVSRHAQFDEFCFPLTGSKSSPNDLVVFTFYEPAAGSPPSLQPVILVVPESSPPTGP 766
R Y+SR FDE FP + T S L P L+ P S+P
Sbjct: 725 GRVYISRDVIFDETNFPFA---------KLHTNAGARLRSEILLLPSHLLNPTSNPGEQQ 775
Query: 767 L----------PCPSCVDPDVQPVPVD--------------------DAPPSPVPHNDAP 796
L P +VQPV + D SP + A
Sbjct: 776 LDDNMANIPVNPANQIFGSNVQPVSAENDEADDSSGATENLAEQIHQDTAASPSASDTAA 835
Query: 797 PLPAPTSPTP----PATPLVVT-QAPPTSPPTTPPAVTQVPLAPVD-------------- 837
P + PA+P V+T + +S P+ P T P + D
Sbjct: 836 SQPGAATSLDVVHFPASPDVMTHHSADSSSPSQPSHATATPASNDDVPSPSHISASEPFA 895
Query: 838 ---------ARPRTRS*NGIFKPNPRYALVHAQPTGLL--TALHTVT*PKGFKSAMKHPH 886
+RP TR GI K V+ T T L PK A+++ +
Sbjct: 896 TGEEVTQAPSRPTTRLQRGIRKEK-----VYTDGTVKYKNTFLTVTGEPKNLTDALQNTN 950
Query: 887 WLPAMEDELSALHKKFTWTLVPRPYTTNVVGSK*VFRTKFHSDGTVERLKTCLVAQGLTQ 946
W AM+ E AL TW LVP NV+ K V++ K DG+++R K LVA+G Q
Sbjct: 951 WKKAMDIEYEALMNNKTWHLVPPKQGRNVIDCKWVYKIKRKQDGSLDRYKARLVAKGFKQ 1010
Query: 947 IPGFDYSLTFSPVVKATTVRLILSLVVLND*QLHQLDVKNAFLHGHLTETVYMEQPHGFV 1006
G DY TFSPVVKA T+R++LS+ V + QLDV+NAFLHG L E VYM+QP G+
Sbjct: 1011 RYGIDYEDTFSPVVKAATIRIVLSIAVSRGWCMRQLDVQNAFLHGFLEEEVYMKQPPGYE 1070
Query: 1007 D------------LVSRLMCVG*TRLSTASNRRLVLGFSA*ALFSCVLVFRVVGLIHPCS 1054
D + L S S + LGF + +
Sbjct: 1071 DESFPGYVCKLDKALYGLKQAPRAWYSRLSKKLYDLGFQGSKGDTSLF------------ 1118
Query: 1055 FFYKGHITLYLLVYVDDIILTGSDPSLLT*FIACLNDEFAIKYLGKLGYFLGLEITYTAD 1114
F+ KG + +++L+YVDDII+T S ++ + L EFA+K LG L YFLG+E+ D
Sbjct: 1119 FYNKGGLIIFVLIYVDDIIVTSSRQEAVSALLQDLKKEFALKDLGDLHYFLGIEVNKVTD 1178
Query: 1115 GLFLGHAKYAHDLLSHALMLEASHVLTPLAAGSHL-VSSGE--GYSDPTHYRSLVGALQY 1171
+ L KYA DLL M + V TPL+ L G+ G D T+YRS+VGALQY
Sbjct: 1179 EIILTQDKYACDLLRRVNMFDCKPVSTPLSTSEKLSAHEGDLLGPLDATNYRSVVGALQY 1238
Query: 1172 LTITRPDLSYAVNTVSQFLQTPTVDHFQAVKRIIRYVCAMQHFGLTFRRTSSPAVLGYSD 1231
LT+TRPD+++ VN V QFL PT H+ AVKRI+RY+ GL ++ S V YSD
Sbjct: 1239 LTLTRPDIAFPVNKVCQFLHAPTTVHWAAVKRILRYLKQCTKLGLKLCKSKSMLVSAYSD 1298
Query: 1232 ADWARCTDHRRSTYGYAIFLGYNLLSWSAKKQPSVAHSSCESEYRAMANTASELVWLLNL 1291
ADWA D RRST G+A+FLG NL+SW A+KQ +V+ SS ESEY+A+AN +E++W+ L
Sbjct: 1299 ADWAGSLDDRRSTGGFAVFLGDNLVSWCARKQATVSRSSTESEYKALANATAEIMWVQTL 1358
Query: 1292 LHELRVRLSAPPLFLSDNQSALFMAQNPVAHKHAKHIDLDCHFVRELVSSGRLAVRHVPT 1351
L EL+V+ DN A +++ NPV H KHI++D HFVRE VS L + VPT
Sbjct: 1359 LTELQVQSPPMAKLWCDNLGAKYLSSNPVFHARTKHIEVDYHFVRERVSQKLLEIDFVPT 1418
Query: 1352 SLQLADIFTKVLPRPLFDLFRSKLRVG 1378
Q+AD FTK LP + F+ L +G
Sbjct: 1419 GDQVADGFTKALPVRQLENFKHNLNLG 1445
>gb|AAC02672.1| polyprotein [Arabidopsis arenosa] gi|7522104|pir||T31353 polyprotein
- Arabidopsis arenosa Evelknievel retrotransposon
(fragment)
Length = 1390
Score = 734 bits (1894), Expect = 0.0
Identities = 482/1388 (34%), Positives = 714/1388 (50%), Gaps = 109/1388 (7%)
Query: 20 KLNSGNFLLWSKQVTAILQNQNLFDIVDPIVAPPSEFMLQPVSSALHHVVPAPNPLFVQW 79
+L NF++WS+QV A+L +L VD V P P + V N + W
Sbjct: 31 RLTDSNFVMWSRQVHALLDGYDLAGYVDGSVPIP------PPTRTTDDGVVTTNNDYTLW 84
Query: 80 QARDRNVNSLLLSSLTEEALSETLSATTARDVWTALHSAYASRNKAREMRLRDELQLMRR 139
+ +D+ V S LL +++ A T+ +VW L S +A+ + A +LR +L+ +
Sbjct: 85 KRQDKLVYSALLGAISLSVQPLLSKANTSAEVWETLSSTFANPSWAHVQQLRQQLKQWTK 144
Query: 140 GSLSVSEYGRK----------IR*SVANDDKVHWFLRGLGPSYANFSTGQLDQVPLPLFT 189
G+ SV Y + + + ++++ L GL Y + P T
Sbjct: 145 GTKSVVTYFQGFTTRFDHLALLGKAPEREEQIELILGGLPEDYKTVVDQIESRENPPALT 204
Query: 190 DILWKVESHAIFQASLEDYVTPPSAAFHARNPTRSSGSQSSGGGHRGNSSSGSRPRRDNG 249
++L K+ +H + A+ + + +A N ++ + +S R N+S G+ ++N
Sbjct: 205 EVLEKLINHEVKLAAKAEATSSVPITANAVNYRGNNNNNNSRSNGR-NNSRGNTSWQNNQ 263
Query: 250 GSHCRGSYTPR-----CQLCRKQGHYAAKCP-VRWDRPSESANLTHSFAAG--------C 295
+ R YTPR CQ+C GH A +CP ++ S ++N + S +
Sbjct: 264 STTNRQQYTPRPYQGKCQICSVHGHSARRCPQLQQHAGSYASNQSSSSSYAPWQPRANMV 323
Query: 296 SLNNSNRSDKYMDTGATSHMTHSLSQLTYSHAYSGNDRVLVGNGAGLHITHIGSR---SA 352
S N + +D+GAT H+T L+ L Y+G + V + +G+GL I+H GS +
Sbjct: 324 SATPYNSGNWLLDSGATHHLTSDLNNLALHQPYNGGEEVTIADGSGLPISHSGSALLPTP 383
Query: 353 SHSVPLSNVLVVPRLTNNLVSVSKLTRDHNVRAIFEADSFVIQNRQTGAVLGKGPCDKGF 412
+ S+ L +VL VP + NL+SV ++ + V F F +++ TGA L +G
Sbjct: 384 TRSLDLKDVLYVPDIQKNLISVYRMCNTNGVSVEFFPAHFQVKDLSTGARLLQGKTKNEL 443
Query: 413 YVLDQGSQALLATS---SSLPRASFELWHSRLDHVNFDIIKKLHKLGCFNVSSILPKPIC 469
Y S +ATS S P+ WH+RL H + I+K L +S L +
Sbjct: 444 YEWPVNSS--IATSMFASPTPKTDLPSWHARLGHPSLPILKTLISKFSLPISHSLQNQLL 501
Query: 470 CTSCQMAKSKRLVCYDNHKRASAVLDLVHCDLWGPSPVASVDGFSYFVIFVDDFSQFTWF 529
C+ C + KS +L Y N +S L+ ++ D+W SP+ S+D + Y+++ VD ++++TW
Sbjct: 502 CSDCSINKSHKLPFYSNTIASSHPLEYLYTDVW-TSPITSIDNYKYYLVIVDHYTRYTWL 560
Query: 530 YPLKRKSDFYDVLVRFKAFVENQFSRSLKVFQSDNGTEFTNNKVQALFSSSGVLHRFLCP 589
YPL++KS + + F A VEN+F + SDNG EF +++ +S G+ H P
Sbjct: 561 YPLRQKSQVRETFITFTALVENKFKSKIGTLYSDNGGEFI--ALRSFLASHGISHMTTPP 618
Query: 590 HTQAQNGRVERKHRHVLELGLAMLYHSHVSTSYWVHAFSTAVYIINRVPSKVLSDQIPFQ 649
HT NG ERKHRH++E GL +L + +S YW +AF+TAVY+INR+ + VL ++ P+
Sbjct: 619 HTPELNGISERKHRHIVETGLTLLSTASMSKEYWSYAFTTAVYLINRMLTPVLGNESPYM 678
Query: 650 LLFQVAPTYANFHPFGCRVFPCLRPYMKNKFSPRGTPCIFLGYNSHHKGFKCFDPTTSRT 709
LF P Y FGC FP LRPY +K R PC+ LGY+ + C D T R
Sbjct: 679 KLFGQPPNYLKLRVFGCLCFPWLRPYTAHKLDNRSMPCVLLGYSLSQSAYLCLDRATGRV 738
Query: 710 YVSRHAQFDEFCFPLTGSKSS---PND---------LVVFTFYEPAAGSPPS-------- 749
Y SRH QF E FP + + S P+D + V P +PPS
Sbjct: 739 YTSRHVQFAESIFPFSTTSPSVTPPSDPPLSQDTRPISVPILARPLTTAPPSSPSCSAPH 798
Query: 750 ---LQPVIL-------VVPESSPPTGPLPCPSC-----VDPDVQPVPVDDAPP-SPVPHN 793
QP IL P SS PT P+ PS V + + P +PP SP P +
Sbjct: 799 RSPSQPGILSPSAPFQPSPPSS-PTSPITSPSLSEESHVGHNQETGPTGSSPPVSPQPQS 857
Query: 794 DAPPLPAPTSPTPPA-----TPLVVTQAPPTSPPTTPPAVTQVPLAPVDARPRTRS*NGI 848
+ P TSP P + +P +T A SP +PP P P+ RTRS N I
Sbjct: 858 EQSTSPRSTSPQPNSPHTQHSPRSITPALTPSPSPSPPPNPNPP-PPIQHTMRTRSKNNI 916
Query: 849 FKPNPRYALVHAQPTGLLTALHTVT*PKGFKSAMKHPHWLPAMEDELSALHKKFTWTLVP 908
KPNP++A + +PT L + PK A+ P+W AM DE++A + T+ LVP
Sbjct: 917 VKPNPKFANLATKPTPLKPII-----PKTVAEALLDPNWRQAMCDEINAQTRNGTFDLVP 971
Query: 909 RPYTTNVVGSK*VFRTKFHSDGTVERLKTCLVAQGLTQIPGFDYSLTFSPVVKATTVRLI 968
NV+G K VF K+ +G ++R K LVA+G Q G D+ TFSPV+K+TTVR +
Sbjct: 972 PAPNQNVIGCKWVFTLKYLPNGVLDRYKARLVAKGFHQQYGHDFKETFSPVIKSTTVRSV 1031
Query: 969 LSLVVLND*QLHQLDVKNAFLHGHLTETVYMEQPHGFVDL-----VSRL--MCVG*TRLS 1021
L + V + Q+DV NAFL G L++ VY+ QP GFVD V RL G +
Sbjct: 1032 LHVAVSKGWSIRQIDVNNAFLQGTLSDEVYVMQPPGFVDKDNPHHVCRLHKALYGLKQAP 1091
Query: 1022 TASNRRLVLGFSA*ALFSCVLVFRVVGLIHPCSFFYKGH--ITLYLLVYVDDIILTGSDP 1079
A + L S +L V I S F H LY+LVYVDD+++TGSD
Sbjct: 1092 RAWYQE---------LRSYLLTQGFVNSIADTSLFTLRHKRTILYVLVYVDDMLITGSDT 1142
Query: 1080 SLLT*FIACLNDEFAIKYLGKLGYFLGLEITYTADGLFLGHAKYAHDLLSHALMLEASHV 1139
+++T FIA L F++K LG++ YFLG+E T T+ GL L +Y DLL ML A V
Sbjct: 1143 NIITRFIANLAARFSLKDLGEMSYFLGIEATRTSKGLHLMQKRYVLDLLEKTNMLAAHPV 1202
Query: 1140 LTPLAAGSHL-VSSGEGYSDPTHYRSLVGALQYLTITRPDLSYAVNTVSQFLQTPTVDHF 1198
LTP++ L ++SG P+ YR+++G+LQYL+ TRPD++YAVN +SQ++ PT H+
Sbjct: 1203 LTPMSPTPKLSLTSGTPLDKPSEYRAVLGSLQYLSFTRPDIAYAVNRLSQYMHCPTDLHW 1262
Query: 1199 QAVKRIIRYVCAMQHFGLTFRRTSSPAVLGYSDADWARCTDHRRSTYGYAIFLGYNLLSW 1258
QA KRI+RY+ G+ R + + YSDADWA TD+ ST Y ++LG +SW
Sbjct: 1263 QAAKRILRYLAGTPSHGIFIRADTPLKLHAYSDADWAGDTDNYNSTNAYILYLGSTPISW 1322
Query: 1259 SAKKQPSVAHSSCESEYRAMANTASELVWLLNLLHELRVRLSAPPLFLSDNQSALFMAQN 1318
S+KKQ VA SS E+EYRA+AN SE+ W+ +LL EL + LS+PP+ DN A +++ N
Sbjct: 1323 SSKKQNGVARSSTEAEYRAVANATSEIRWVCSLLTELGITLSSPPVVYCDNVGATYLSAN 1382
Query: 1319 PVAHKHAK 1326
PV K
Sbjct: 1383 PVFDSRMK 1390
>gb|AAV24907.1| hypothetical protein [Oryza sativa (japonica cultivar-group)]
gi|51854440|gb|AAU10819.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
Length = 1679
Score = 730 bits (1884), Expect = 0.0
Identities = 453/1169 (38%), Positives = 629/1169 (53%), Gaps = 91/1169 (7%)
Query: 275 PVRWDRPSESANLTHSFAAGCSLNNSNRSDKYMDTGATSHMTHSLSQLTYSHAYSGND-- 332
P +WD+ A+L SF +L+ +D YMDTGAT+HMT L+ SH + N
Sbjct: 535 PSQWDQ----ASLAGSFNT-TTLHQPATNDWYMDTGATAHMTSDTGILSLSHPPNPNSPS 589
Query: 333 RVLVGNGAGLHITHIGSRSASH---SVPLSNVLVVPRLTNNLVSVSKLTRDHNVRAIFEA 389
++VGNG+ + +T IG H S L ++L P + NL+SV + D+ F+
Sbjct: 590 HIVVGNGSTIPVTSIGHSKICHPNCSFTLRDILCSPAIIKNLISVRRFVIDNWCSVEFDP 649
Query: 390 DSFVIQNRQTGAVLGKGPCDKGFYVLDQGSQALLATSSSLPRASFEL-WHSRLDHVNFDI 448
F +++ +T V+ + Y L A ++ L +S +L WH RL H+ D
Sbjct: 650 FGFSVKDLRTRTVIARFNSSGPLYSLHHALPPPPAATALLANSSTDLLWHRRLGHLGHDA 709
Query: 449 IKKLHKLGCFNVSSILPKPICCTSCQMAKSKRLVCYDNHKRASAVLDLVHCDLWGPSPVA 508
+ +L + + C +CQ+ + RL + RAS +L HCDLW SPV
Sbjct: 710 LNRLAAVVPMTRGDLTG---VCHACQLGRHVRLPFASSTSRASTNFELFHCDLW-TSPVV 765
Query: 509 SVDGFSYFVIFVDDFSQFTWFYPLKRKSDFYDVLVRFKAFVENQFSRSLKVFQSDNGTEF 568
S GF Y+++ +DD S + W +PL+ KSD + L F A+V+ QF+ +++ Q DNG EF
Sbjct: 766 SASGFKYYLVILDDCSHYVWTFPLRFKSDTFTTLSHFFAYVKTQFATNIRSIQCDNGREF 825
Query: 569 TNNKVQALFSSSGVLHRFLCPHTQAQNGRVERKHRHVLELGLAMLYHSHVSTSYWVHAFS 628
N+ + F ++GV R CPHT QNG+ ER R + + +ML+ + + S+WV A
Sbjct: 826 DNSAARTFFLTNGVHLRMSCPHTSPQNGKAERILRSLNNIVRSMLFQAKLPGSFWVEALH 885
Query: 629 TAVYIINRVPSKVLSDQIPFQLLFQVAPTYANFHPFGCRVFPCLRPYMKNKFSPRGTPCI 688
TA ++INR P+K L P L+ P+Y++ FGC+ +P L +K +PR T C+
Sbjct: 886 TATHLINRHPTKTLDRHTPHFALYGTHPSYSHLRVFGCKCYPNLSATTPHKLAPRSTMCV 945
Query: 689 FLGYNSHHKGFKCFDPTTSRTYVSRHAQFDEFCFP---LTGSKSSPNDLVVFTFYEPAAG 745
FLGY +HKG++CFDP ++R +SRH FDE FP LT S+ DL + A
Sbjct: 946 FLGYPLYHKGYRCFDPLSNRVIISRHVVFDEHSFPFTELTNGVSNATDLDFLEDFTAPAQ 1005
Query: 746 SPPSLQPVILVVPESSPPTGPL--------PC-----------PSCVDPDVQPVPVDDAP 786
+P V P + + P+ PC PS D + P A
Sbjct: 1006 APIGATRRPAVAPTTQTASSPMVHGLERPPPCSPTRPVSTPGGPSSPDSRLGPPSPTPAL 1065
Query: 787 PSPVPHNDAPPL--PAPTSPTPPATPLVVTQAPPTSPPTTP-------PAVTQVP----- 832
P + PP PAP++ T PA+ T A P +PP P P VP
Sbjct: 1066 IGPASTSPGPPSAGPAPSASTCPASSTWETVARPPTPPGLPRLDGPHLPPAPHVPRRLRS 1125
Query: 833 --------------LAPV--DARPRTRS*NGIFKPNPRYALVHAQPTGLLTALHTVT*PK 876
++PV D TR+ +G KP R L HA P L+ PK
Sbjct: 1126 VRATGAPTPLSGLEISPVVNDHVMTTRAKSGHHKPVHRLNL-HAAPLSLV--------PK 1176
Query: 877 GFKSAMKHPHWLPAMEDELSALHKKFTWTLVPRPYTTNVVGSK*VFRTKFHSDGTVERLK 936
+++A+ P W AME+E +AL TW LVPRP NVV K +F+ KFH+DG+++R K
Sbjct: 1177 TYRAALADPLWRAAMEEEYNALLANRTWDLVPRPAGVNVVTGKWIFKHKFHADGSLDRYK 1236
Query: 937 TCLVAQGLTQIPGFDYSLTFSPVVKATTVRLILSLVVLND*QLHQLDVKNAFLHGHLTET 996
V +G TQ PG D+ TFSPVVK TVR +LSL V D +HQLDVKNAFLHG L ET
Sbjct: 1237 ARWVLRGFTQRPGVDFDETFSPVVKPATVRTVLSLAVSRDWPVHQLDVKNAFLHGTLQET 1296
Query: 997 VYMEQPHGFVDLVSRLMCVG*TRLSTASNRRLV-LGFSA*ALFSCVLVF-RVVGLIHPCS 1054
VY QP GFVD M N+ L L + A +S F + +G + S
Sbjct: 1297 VYCTQPPGFVDSAKPDMV-------CCLNKSLYGLKQAPRAWYSRFTTFLQSIGFVEAKS 1349
Query: 1055 -----FFYKGHITLYLLVYVDDIILTGSDPSLLT*FIACLNDEFAIKYLGKLGYFLGLEI 1109
++G+ T+YLL+YVDDI+LT S +LL I+ L EF++K LG L +FLG+ +
Sbjct: 1350 DTSLFILHRGNDTVYLLLYVDDIVLTASSRTLLHWTISALQGEFSMKDLGALHHFLGVSV 1409
Query: 1110 TYTADGLFLGHAKYAHDLLSHALMLEASHVLTPLAAGSHLVSS-GEGYSDPTHYRSLVGA 1168
T + GL L +Y D+L A M + TP+ + L SS G +DPT +RSL GA
Sbjct: 1410 TRNSAGLVLSQRQYCIDILERAGMADCKPCNTPVDTTAKLSSSDGPPVADPTDFRSLAGA 1469
Query: 1169 LQYLTITRPDLSYAVNTVSQFLQTPTVDHFQAVKRIIRYVCAMQHFGLTFRRTSSPAVLG 1228
LQYLT TRPD+SYAV V + P H A+KRI+ Y+ GL +R+S+ +
Sbjct: 1470 LQYLTFTRPDISYAVQQVCLHMHDPREPHLAALKRILHYIRGSVDLGLHIQRSSACDLAV 1529
Query: 1229 YSDADWARCTDHRRSTYGYAIFLGYNLLSWSAKKQPSVAHSSCESEYRAMANTASELVWL 1288
YSDADWA C D RRST GYA+FLG NL+SWS+K+Q +V+ SS E+EYRA+AN +E+ WL
Sbjct: 1530 YSDADWAGCPDTRRSTSGYAVFLGDNLVSWSSKRQHTVSRSSAEAEYRAVANAVAEVTWL 1589
Query: 1289 LNLLHELRVRLSAPPLFLSDNQSALFMAQNPVAHKHAKHIDLDCHFVRELVSSGRLAVRH 1348
LL EL S L DN SA++++ NPV H+ KH+++D HFVRE V+ G + V H
Sbjct: 1590 RQLLQELHSPPSRATLVYCDNVSAVYLSSNPVQHQRTKHVEIDLHFVRERVAVGAVRVLH 1649
Query: 1349 VPTSLQLADIFTKVLPRPLFDLFRSKLRV 1377
VPT+ Q ADIFTK LP P+F FRS L V
Sbjct: 1650 VPTTSQYADIFTKGLPTPVFTEFRSSLNV 1678
Score = 53.1 bits (126), Expect = 6e-05
Identities = 48/191 (25%), Positives = 80/191 (41%), Gaps = 20/191 (10%)
Query: 79 WQARDRNVNSLLLSSLTEEALSETLSA-TTARDVWTALHSAYASRNKAREMRLRDELQLM 137
W D V + L +++ + L + L+ TTAR VW L + ++ R + L E
Sbjct: 215 WVQMDCVVLAWLFGTISFDLLQDVLATDTTARLVWRGLEYQFLGNSEQRALNLTTEFHTF 274
Query: 138 RRGSLSVSEYGRKIR----------*SVANDDKVHWFLRGLGPSYANFSTGQLDQVPLPL 187
++G LSV EY RK++ V + V L GL + N + Q P P
Sbjct: 275 QQGDLSVDEYCRKMKTFADSLGDVGEPVRDRTLVLNTLNGLSEKFNNLRSLVPMQRPFPT 334
Query: 188 FTDILWKVESHAIFQASLEDYVTPPSAAFHARNPTRSSGSQSS-------GGGHRGNSSS 240
F ++ + + + + + SA F A T + G ++ G G++G
Sbjct: 335 FAELRSLLRLEELSKPN--HAASASSAVFLATGSTTNGGKGANPAHGTGYGAGNQGGGGK 392
Query: 241 GSRPRRDNGGS 251
G+ RR GG+
Sbjct: 393 GNSNRRRRGGN 403
>ref|XP_462785.1| putative gag/pol polyprotein [Oryza sativa (japonica cultivar-group)]
Length = 1373
Score = 667 bits (1722), Expect = 0.0
Identities = 402/1028 (39%), Positives = 546/1028 (53%), Gaps = 95/1028 (9%)
Query: 412 FYVLDQGSQALLATSSSLPRASFELWHSRLDHVNFDIIKKLHKLGCFNVSSILPKPICCT 471
FY + ALLA +SL WH RL H+ + + KL + + + P C
Sbjct: 374 FYPPATSTHALLAAPTSL-------WHRRLGHLGREALSKLIRSSVISCTKD-DLPHLCH 425
Query: 472 SCQMAKSKRLVCYDNHKRASAVLDLVHCDLWGPSPVASVDGFSYFVIFVDDFSQFTWFYP 531
+CQ+ RL + RAS DL+HCDLW SP+ SV G+ Y+++ +DD S + W +P
Sbjct: 426 ACQLGHHTRLPFSSSSSRASNNFDLIHCDLW-TSPIVSVSGYKYYLVILDDCSHYIWTFP 484
Query: 532 LKRKSDFYDVLVRFKAFVENQFSRSLKVFQSDNGTEFTNNKVQALFSSSGVLHRFLCPHT 591
L+ KSD + + F A V+ QF ++K Q DNG EF N+ + F S GV R CP+T
Sbjct: 485 LRLKSDTFSTIANFFAHVKTQFGTTIKSVQCDNGREFDNSPARTFFLSHGVAFRMSCPYT 544
Query: 592 QAQNGRVERKHRHVLELGLAMLYHSHVSTSYWVHAFSTAVYIINRVPSKVLSDQIPFQLL 651
QNGR ER R + + ++L+ + + YWV A TA ++NR+P+K LS P+ L
Sbjct: 545 SQQNGRAERSLRTLNNILRSLLFQACLPPVYWVEALHTATLLVNRIPTKTLSSSTPYFHL 604
Query: 652 FQVAPTYANFHPFGCRVFPCLRPYMKNKFSPRGTPCIFLGYNSHHKGFKCFDPTTSRTYV 711
+ PTY + FGC +P + +K +PR + C+FLGY+S HKG++C + ++R
Sbjct: 605 YSTQPTYDHLRVFGCACYPNMSSTAPHKLAPRSSLCVFLGYSSEHKGYRCLELGSNRIIT 664
Query: 712 SRHAQFDEFCFPLTGSKSSP---NDLVVFTFYEPAAGSPPSLQPVILVVPESS----PPT 764
SRH FDE FP +SP + L +F PP + V ++ P+
Sbjct: 665 SRHVVFDESFFPFADMSTSPMASSALDIFLDDNELTAQPPRAKFVHAGTSSAARGAVEPS 724
Query: 765 GPLPCPSCVDPDVQPVPVDDAPPSPVPHNDAP---------------------------- 796
P P PS + P P A P PH +P
Sbjct: 725 TPPPAPSSIGPRS---PATLAGPEAGPHGGSPAGAATSQPGAISPARTAAPSAATSTTRA 781
Query: 797 ----PLPAPTSPTPPATPLVVTQAPP---------TSPPTTPPAVTQVPLAPVDARP--R 841
P A + TP +PL T APP T+ A V +APVD R
Sbjct: 782 VTSAPRAATSGTTPSLSPLAGTAAPPPRAEVAASSTAATGRTLATRPVSIAPVDNAHSMR 841
Query: 842 TRS*NGIFKPNPRYALVHAQPTGLLTALHTVT*PKGFKSAMKHPHWLPAMEDELSALHKK 901
TR G+ +P R L HA P + P+ + A+ P+W AM+ E AL
Sbjct: 842 TRGKAGMAQPVDRLNL-HAAPLSPV--------PRSVREALSDPNWRAAMQAEFDALLAN 892
Query: 902 FTWTLVPRPYTTNVVGSK*VFRTKFHSDGTVERLKTCLVAQGLTQIPGFDYSLTFSPVVK 961
TW+LVPRP N+V K +FR K HSDG+++R K V +G TQ PG DY TFSPVVK
Sbjct: 893 DTWSLVPRPRGVNLVTGKWIFRHKLHSDGSLDRYKARWVLRGFTQRPGVDYDETFSPVVK 952
Query: 962 ATTVRLILSLVVLND*QLHQLDVKNAFLHGHLTETVYMEQPHGFVD-----LVSRL--MC 1014
TVR++LSL + D +HQLDVKNAFLHG L+ETVY QP GF D LV RL
Sbjct: 953 PATVRVVLSLALSQDWPIHQLDVKNAFLHGTLSETVYCIQPTGFADPSHADLVCRLNKSL 1012
Query: 1015 VG*TRLSTASNRRLV-----LGFSA*ALFSCVLVFRVVGLIHPCSFFYKGHITLYLLVYV 1069
G + A + R LGF S + + R +G+ T+ LL+YV
Sbjct: 1013 YGLKQAPRAWHHRFASHLISLGFIEAQSDSSLFIHR------------RGNDTVLLLLYV 1060
Query: 1070 DDIILTGSDPSLLT*FIACLNDEFAIKYLGKLGYFLGLEITYTADGLFLGHAKYAHDLLS 1129
DDI+LT S SLL IA L EFA+ +G L +FLG+ +T A GLFL +Y+ D+L
Sbjct: 1061 DDIVLTASSASLLQQVIAALQREFAMTDMGPLHHFLGITVTRFASGLFLSQRQYSQDILE 1120
Query: 1130 HALMLEASHVLTPLAAGSHLVSSGEGYSDPTHYRSLVGALQYLTITRPDLSYAVNTVSQF 1189
A M E TP+ S L + G +D T YRSL GALQYLT TRPD+++AV V +
Sbjct: 1121 RAGMGECKPCSTPVDVHSKLSADGPPVADSTQYRSLAGALQYLTFTRPDIAFAVQQVCLY 1180
Query: 1190 LQTPTVDHFQAVKRIIRYVCAMQHFGLTFRRTSSPAVLGYSDADWARCTDHRRSTYGYAI 1249
+ P H A+KRI+RY+ GLT RR+ ++ Y+DADWA C D RRST GYA+
Sbjct: 1181 MHDPREPHLAALKRILRYIQGTLSLGLTMRRSPPTDLVVYTDADWAGCPDTRRSTSGYAV 1240
Query: 1250 FLGYNLLSWSAKKQPSVAHSSCESEYRAMANTASELVWLLNLLHELRVRLSAPPLFLSDN 1309
FLG NL+SWS+K+Q +V+ SS E+EYRA+AN +E WL LL EL + DN
Sbjct: 1241 FLGDNLVSWSSKRQHTVSRSSAEAEYRAVANGVAEATWLRQLLMELHRPPRTATVVYCDN 1300
Query: 1310 QSALFMAQNPVAHKHAKHIDLDCHFVRELVSSGRLAVRHVPTSLQLADIFTKVLPRPLFD 1369
SA++++ NPV H+ KH+++D HFVRE V+ G + V HVPT+ Q AD+FTK LP LF
Sbjct: 1301 VSAMYLSSNPVQHQRTKHVEIDLHFVREKVALGHVRVLHVPTTSQYADVFTKGLPTSLFQ 1360
Query: 1370 LFRSKLRV 1377
FR+ L +
Sbjct: 1361 EFRTSLTI 1368
>gb|AAD14478.1| Strong similarity to gb|AF039376 Evelknievel retrotransposon
polyprotein from Arabidopsis arenosa. [Arabidopsis
thaliana] gi|25301711|pir||E96624 hypothetical protein
T2K10.7 [imported] - Arabidopsis thaliana
Length = 1194
Score = 542 bits (1396), Expect = e-152
Identities = 378/1230 (30%), Positives = 592/1230 (47%), Gaps = 145/1230 (11%)
Query: 20 KLNSGNFLLWSKQVTAILQNQNLFDIVDPIVAPPSEFMLQPVSSALHHVVPAPNPLFVQW 79
KL N+++W++QV A+L +L +D V PSE + V A NP + W
Sbjct: 30 KLTPTNYIMWNRQVHALLDGYDLAGYIDGSVTAPSEMITTAG-------VSAANPAYKFW 82
Query: 80 QARDRNVNSLLLSSLTEEALSETLSATTARDVWTALHSAYASRNKAREMRLRDELQLMRR 139
+ +D+ + S +L ++T + TA ++W L S YA+ + ++R ++ +
Sbjct: 83 KRQDKLIYSAILGTITTTIQPLLSRSNTAAEIWEKLKSIYATPSWGHIQQMRQHIKQWSK 142
Query: 140 GSLSVSEY--GRKIR*S--------VANDDKVHWFLRGLGPSYANFSTGQLDQVPL---- 185
G+ +++EY G R + + +++ + L GL Y + +DQ +
Sbjct: 143 GTKTITEYFQGHTTRFDELALLGKPLEHAEQIEFLLGGLSEDYKSV----VDQTEIRDKP 198
Query: 186 PLFTDILWKV---ESHAIFQASLEDYVTPPS--AAFHARNPTRSSGSQSSGGGHRGNSSS 240
P T++L K+ E+ + A+ T PS A HA N +S + +R N S
Sbjct: 199 PTLTELLEKLLNREAKLMCAAA-----TTPSLPATAHAANYKGNSNNNQYNNNNRNNKSH 253
Query: 241 GSRPRRDNGGSHCRGSYTPRCQLCRKQGHYAAKCPVRWDRPSESANLTHSFAAGCSLNNS 300
GS S +TP W +P +A + +
Sbjct: 254 GSSYNSQQ--SIPASPFTP------------------W-QPRANAAIASPY--------- 283
Query: 301 NRSDKYMDTGATSHMTHSLSQLTYSHAYSGNDRVLVGNGAGLHITHIGSR---SASHSVP 357
N ++ +D+GAT H+T L+ L+ Y+G + V + +G+GL I+H GS + S S+
Sbjct: 284 NANNWLLDSGATHHITSDLNNLSLHQPYTGGEDVTIADGSGLSISHTGSALISTPSRSLA 343
Query: 358 LSNVLVVPRLTNNLVSVSKLTRDHNVRAIFEADSFVIQNRQTGAVLGKGPCDKGFYVLDQ 417
L++VL VP + NL+SV ++ + V F F +++ +TG L +G Y
Sbjct: 344 LTDVLYVPNIHKNLISVYRMCNANKVSVEFFPAHFQVKDLKTGVQLLQGRTKDELYEWPV 403
Query: 418 GSQALLAT-SSSLPRASFELWHSRLDHVNFDIIKKLHKLGCFNVSSILPKPICCTSCQMA 476
+ +++ P+ WHSRL H + +K + VS+ L K C+ C +
Sbjct: 404 NPPKPSSHFTTTTPKTDLTSWHSRLGHPSLSTLKVVVSQFSLPVSNSLQKQFNCSDCLLN 463
Query: 477 KSKRLVCYDNHKRASAVLDLVHCDLWGPSPVASVDGFSYFVIFVDDFSQFTWFYPLKRKS 536
K+ +L + N ++ L+ ++ DLW SP+ S+D F Y+++ VD +++++WFYP+K+KS
Sbjct: 464 KTHKLPFHTNTITSTQPLEYLYIDLW-TSPIVSIDNFKYYLVIVDHYTRYSWFYPIKQKS 522
Query: 537 DFYDVLVRFKAFVENQFSRSLKVFQSDNGTEFTNNKVQALFSSSGVLHRFLCPHTQAQNG 596
DV + FKA V N+F R + SDNG EF +++ SS+G+ H PHT NG
Sbjct: 523 HVKDVFMTFKALVANKFQRKIIHLYSDNGGEFI--ALRSFLSSNGITHLTTPPHTPEHNG 580
Query: 597 RVERKHRHVLELGLAMLYHSHVSTSYWVHAFSTAVYIINRVPSKVLSDQIPFQLLFQVAP 656
ERKHRH++E GL +L + + SYW +AF+ A+Y+INR+ S V+ P++ LF AP
Sbjct: 581 ISERKHRHIVETGLTLLGQASMPKSYWSYAFTIAIYLINRMSSDVIGGISPYKRLFGQAP 640
Query: 657 TYANFHPFGCRVFPCLRPYMKNKFSPRGTPCIFLGYNSHHKGFKCFDPTTSRTYVSRHAQ 716
Y FGC FP LRPY +K R PC+FLGY+ + C + TT R Y SRH Q
Sbjct: 641 NYLKLRVFGCLCFPWLRPYTTHKLDDRPAPCVFLGYSQTQSAYLCLNRTTGRVYTSRHVQ 700
Query: 717 FDEFCF--------PLTGSKSSPNDLVVFTFYEPAAGSPPSLQPVILVVPESSPPTGPLP 768
F E + P T + S N + T P PS+ P P PP+ P P
Sbjct: 701 FVENTYPFTKPTLDPFTNLEESNNHSITTTVPSPPFVQLPSVPPPTR-DPHQPPPSQPAP 759
Query: 769 CPSCVDPDVQPVPVDDAPPS---------------------PVPHNDAPPLPAPT----- 802
PS + P PV + P P N P+ +PT
Sbjct: 760 SPSPLSPPSMSSPVMTSSPQFSSNRDSTTLHGDYSHVDYGLSSPSNPPGPITSPTTSKSP 819
Query: 803 -------------SPTPPATPLVVTQAPPTSPPTTPPAVTQVPLAPVDARP-RTRS*NGI 848
+ TPP +P + +P P +P + P P + RTR+ N I
Sbjct: 820 SEPTSSPSHSNQPNKTPPNSPSSSSSSPTPIPSPSPQSSNSPPPPPQNQHSMRTRAKNNI 879
Query: 849 FKPNPRYALVHAQPTGLLTALHTVT*PKGFKSAMKHPHWLPAMEDELSALHKKFTWTLVP 908
KP + L A P G TV A++ P+W AM +E +A + T+ LVP
Sbjct: 880 TKPIKKLTLA-ATPKGKSKIPTTVA------EALRDPNWRNAMSEEFNAGLRNSTYDLVP 932
Query: 909 RPYTTNVVGSK*VFRTKFHSDGTVERLKTCLVAQGLTQIPGFDYSLTFSPVVKATTVRLI 968
N VG++ +F K++ DG++ R K +A+G Q G DYS TFSPV+K+TTV+ +
Sbjct: 933 PKPHQNFVGTRWIFTIKYNPDGSINRYKARFLAKGFHQQHGLDYSNTFSPVIKSTTVQTV 992
Query: 969 LSLVVLND*QLHQLDVKNAFLHGHLTETVYMEQPHGFVD------LVSRLMCVG*TRLST 1022
L + V + QLD+ NAFL G LTE VY+ QP GF++ + + + +
Sbjct: 993 LDIAVSRSWDIRQLDINNAFLQGRLTEDVYVAQPPGFINPDRPNYVCHLKKALHGLKQAP 1052
Query: 1023 ASNRRLVLGFSA*ALFSCVLVFRVVGLIHPCSFFYKGHIT--LYLLVYVDDIILTGSDPS 1080
+ + + GF L +C V S F + H +Y+LVYVDD ++TGS+ +
Sbjct: 1053 RAWYQELRGF----LLTCGFTNSVAN----TSLFIRQHNKDYIYILVYVDDFLITGSNSN 1104
Query: 1081 LLT*FIACLNDEFAIKYLGKLGYFLGLEITYTADGLFLGHAKYAHDLLSHALMLEASHVL 1140
L+ FI CL + F++K LG+L YFL +E T T GL L +Y DLL+ ML+A V
Sbjct: 1105 LIAQFITCLANRFSLKDLGQLSYFLEIEATRTKAGLHLMQRRYVLDLLTKTKMLDAKTVS 1164
Query: 1141 TPLAAGSHL-VSSGEGYSDPTHYRSLVGAL 1169
TP++ L ++SG +P YR ++G+L
Sbjct: 1165 TPMSPTPKLTLTSGTPIDNPGGYRQILGSL 1194
>ref|XP_507106.1| PREDICTED OJ1499_A07.20 gene product [Oryza sativa (japonica
cultivar-group)]
Length = 1427
Score = 506 bits (1303), Expect = e-141
Identities = 420/1458 (28%), Positives = 638/1458 (42%), Gaps = 189/1458 (12%)
Query: 20 KLNSGNFLLWSKQVTAILQNQNLFDIVDPIVAPPSEFMLQPVSSALHHVVPAPNPLFVQW 79
+L + N+ W +V A++++Q ++D ++P A +P
Sbjct: 25 QLTATNYTSWCIRVQAMMEDQGVWDAIEPAAGV------------------AVDP----- 61
Query: 80 QARDRNVNSLLLSSLTEEALSETLSATTARDVWTALHSAYASRNKAREMR---LRDELQL 136
RD+ S LL SL E+ L + +A++VW L + + ++ RE R L+ E
Sbjct: 62 -RRDKKSKSHLLQSLPEDLLMQVAKKRSAKEVWDCLKTRFVGADRVREARLQTLKGEFGA 120
Query: 137 M----------RRGSLSVSEYGRKIR*SVANDDKVHWFLRGLGPS-YANFSTG-----QL 180
M G ++ + S +D + L P + + G ++
Sbjct: 121 MVMEPGETLDQYAGRITAMSVRHSVLGSTLSDSAIVKKLFDTVPEKFISLVAGIEQFYEI 180
Query: 181 DQVP-------LPLFTD------------------ILWKVESHAIFQASLEDYVTPPSAA 215
D +P L + + +L + E A F+ + +P
Sbjct: 181 DNMPFEEAVGRLKAYEERVRKKKAAAGGVTADGQVLLTQAEWEARFRKDGSESSSPQKNK 240
Query: 216 FHARNPTRSSGSQSSGGGHRGNSSSGSRPRRDNGGSHCRGSYTP---RCQLCRKQGHYAA 272
+ R+ G + G GH G GS PR G G +C C + GHY+
Sbjct: 241 PPSDRGNRAQGGRGRGRGHGGGGGRGSAPRNSGAGGSGGGGRDKSHIKCYNCEEFGHYST 300
Query: 273 KCPVRWDRPSESANLTHSFAAGCSLNNSNRSDK--------------------------- 305
+CP + E A+L + A +L + D+
Sbjct: 301 QCPHPKKKKVE-AHLAQTDDANPALLLAVTEDEPASGLVVHEERVWPQLLLADSGAATGD 359
Query: 306 --YMDTGATSHMTHSLSQ-------LTYSHAYSGNDRVLV-GNGAGLHITHIGSRSASHS 355
++D GA++HMT ++ +T S + V + G G+ L G +
Sbjct: 360 IWFLDNGASNHMTGDRAKFRDLDVSITGSVKFGDASTVKIQGKGSILFSCKNGDQWL--- 416
Query: 356 VPLSNVLVVPRLTNNLVSVSKLTR-------DHNVRAIFEADSF--VIQNRQTGAVLGKG 406
L +V +P L N+VS+ +LT D +V +F+ V++ R+T L +
Sbjct: 417 --LQDVFYIPSLCCNMVSLGQLTETGHRVVMDEDVLEVFDKSPLRLVMRVRRTPNRLYR- 473
Query: 407 PCDKGFYVLDQGSQALLATSSSLPRASFELWHSRLDHVNFDIIKKLHKLGCFN-VSSILP 465
L + L T P LWH+RL HVNF +K L G + +I
Sbjct: 474 ------IELKLATPVCLLTRMDEPAW---LWHARLGHVNFQAMKLLADKGMAGGIPAITH 524
Query: 466 KPICCTSCQMAKSKRLVCYDNHK-RASAVLDLVHCDLWGPSPVASVDGFSYFVIFVDDFS 524
C +C +AK R RA L+L+H DL GP ++ G YF++ VDDFS
Sbjct: 525 PNQLCQACLVAKQIRQPFPATANFRAEEPLELLHIDLCGPITPTTMAGNRYFMLIVDDFS 584
Query: 525 QFTWFYPLKRKSDFYDVLVRFKAFVENQFSRSLKVFQSDNGTEFTNNKVQALFSSSGVLH 584
++ W + +K K + +FK EN R +K +SD G EF + + L +G+
Sbjct: 585 RWMWMFVIKTKDQALEAFTKFKPLAENTAGRRIKTLRSDRGGEFLSGEFAQLCEQAGIQR 644
Query: 585 RFLCPHTQAQNGRVERKHRHVLELGLAMLYHSHVSTSYWVHAFSTAVYIINRVPSKVLSD 644
P++ QNG VER++R V+ + +++ V +W A AVY++NR+P+K + D
Sbjct: 645 HLTAPYSPQQNGVVERRNRSVMAMARSLMKGMSVPGRFWGEAVRHAVYLLNRLPTKAMGD 704
Query: 645 QIPFQLLFQVAPTYANFHPFGCRVFPCLRPYMKNKFSPRGTPCIFLGYNSHHKGFKCFDP 704
+ PF+ P + FGC + + K R P ++LG K + FDP
Sbjct: 705 RTPFEAWTGRKPQLGHLRVFGCIAHAKITTPNQKKLDDRSAPYVYLGVEEGSKAHRLFDP 764
Query: 705 TTSRTYVSRHAQFDEFCFPLTGSKSSPNDLVVFTFYEPAAGSPPSLQPVILVVPESSPPT 764
R +VSR F+E + + FT E +PP + P
Sbjct: 765 RCGRIHVSRDVIFEENVPWQWSVVAGEQNSTEFTVEEDGVDAPP-----------AGAPA 813
Query: 765 GPLPCPSCVDPDVQPVPVDDAPPSPVPHNDAPPLPAPTSP--TPPATPLVVTQAPPTSPP 822
P+P P V P P SPV + + P +SP TPP+TP + P SP
Sbjct: 814 YPVPRYRAPSPAVPQSP----PASPVGASSSLPTSPQSSPSSTPPSTPATGSAGPVASPG 869
Query: 823 TTPPAVTQVPLAPVDARPRTRS*NGIFKPNPRYALVHAQPTG--LLTALHTVT*PKGFKS 880
+ L + R RS I + PR LV + G LL + P ++
Sbjct: 870 SGG------DLRSDEGPVRFRSLEDIMREAPRVDLVEDEHDGDALLAEMEE---PSSYRE 920
Query: 881 AMKHPHWLPAMEDELSALHKKFTWTLVPRPYTTNVVGSK*VFRTKFHSDGTVERLKTCLV 940
A P W AM EL A+ K TW L P +G K V++ K ++ G V + K LV
Sbjct: 921 AAGQPAWENAMAQELQAIEKNSTWALTALPAGHKPIGLKWVYKLKKNTAGEVIKHKARLV 980
Query: 941 AQGLTQIPGFDYSLTFSPVVKATTVRLILSLVVLND*QLHQLDVKNAFLHGHLTETVYME 1000
A+G Q G D+ F+PV + TVR+IL++ ++H LDVK+AFL+G L E VY+
Sbjct: 981 AKGYVQRQGVDFEEVFAPVARLDTVRVILAIAADRRWEVHHLDVKSAFLNGDLEEEVYVA 1040
Query: 1001 QPHGFV-----DLVSRL--MCVG*TRLSTASNRRL-----VLGFSA*ALFSCVLVFRVVG 1048
QP GFV LV RL G + A N RL LGF+ V
Sbjct: 1041 QPEGFVKRGEEHLVLRLSKALYGLRQAPRAWNTRLDKCLKELGFARCTQEQAVYTRG--- 1097
Query: 1049 LIHPCSFFYKGHITLYLLVYVDDIILTGSDPSLLT*FIACLNDEFAIKYLGKLGYFLGLE 1108
KG + + VYVDD+I+TG +P + F + EF + LG L Y+LG+E
Sbjct: 1098 ---------KGQAGVIVGVYVDDLIVTGENPHEIAMFKQQMMGEFEMSDLGLLSYYLGIE 1148
Query: 1109 ITYTADGLFLGHAKYAHDLLSHALMLEASHVLTPLAAGSHLVSSGEGYS-DPTHYRSLVG 1167
+ +G+ + A YA +LS M + P+ S L +G D T YR ++G
Sbjct: 1149 VIQGENGIAIKQAAYAKKILSQFGMQGCNPTSIPMEPRSLLHKDADGNPIDATEYRRVIG 1208
Query: 1168 ALQYLTITRPDLSYAVNTVSQFLQTPTVDHFQAVKRIIRYVCAMQHFGLTFRRTS-SPAV 1226
L+YL TRPDLSYAV S+F++ PT H +AVK I+RY+ GL F S S +
Sbjct: 1209 CLRYLLHTRPDLSYAVGVASRFMERPTTMHLKAVKMILRYLKGTLDSGLVFASGSGSLDI 1268
Query: 1227 LGYSDADWARCTDHRRSTYGYAIFLGYNLLSWSAKKQPSVAHSSCESEYRAMANTASELV 1286
G++D+D A D RRST G A ++ +L+SW ++KQ +VA SSCE+E+ A A +
Sbjct: 1269 TGFTDSDLAGDMDDRRSTGGMAFYVNSSLVSWCSQKQKTVALSSCEAEFMAATAAACHAL 1328
Query: 1287 WLLNLLHELRVRLSAPPLFLSDNQSALFMAQNPVAHKHAKHIDLDCHFVRELVSSGRLAV 1346
WL LL E+ + DN+SA+ + +NPV H +KHID HF+RE V SG++ +
Sbjct: 1329 WLRALLSEMMGTEAKRVKLFVDNKSAIALMKNPVFHGRSKHIDTRYHFIRECVESGQILI 1388
Query: 1347 RHVPTSLQLADIFTKVLP 1364
V + Q AD TK LP
Sbjct: 1389 EFVRSEEQRADAMTKGLP 1406
>gb|AAC35532.1| contains similarity to proteases [Arabidopsis thaliana]
gi|7444456|pir||T01908 hypothetical protein T12H20.12 -
Arabidopsis thaliana
Length = 1392
Score = 447 bits (1151), Expect = e-124
Identities = 270/766 (35%), Positives = 403/766 (52%), Gaps = 53/766 (6%)
Query: 16 MITVKLNSGNFLLWSKQVTAILQNQNLFDIVDPIVAPPSEFMLQPVSSALHHVVPAPNPL 75
++T+KL N+LLW Q + L + L V P+ ++ N
Sbjct: 15 VVTLKLTPTNYLLWKTQFESYLSSHLLLGFVTGATPRPASTIIVTKDDIQSEEA---NQE 71
Query: 76 FVQWQARDRNVNSLLLSSLTEEALSETLSATTARDVWTALHSAYASRNKAREMRLRDELQ 135
F++W D+ V + + SL+EEAL + +A++VW L + + R+ L+ L
Sbjct: 72 FLKWTRIDQLVKAWIFGSLSEEALKVVIGLNSAQEVWLGLARRFNRFSTTRKYDLQKRLG 131
Query: 136 LMRRGSLSVSEYGRKIR*----------SVANDDKVHWFLRGLGPSYANFST---GQLDQ 182
+ ++ Y +++ V +K+ L GLG Y + +T LD
Sbjct: 132 TCSKAGKTMDAYLSEVKNICDQLDSIGFPVTEQEKIFGVLNGLGKEYESIATVIEHSLDV 191
Query: 183 VPLPLFTDILWKVESHAIFQASLEDYVT----PPSAAFHARNPTRSSGSQSSGGGHRGN- 237
P P F D+++K+ + F L Y P AF+ S G+ +S GG GN
Sbjct: 192 YPGPCFDDVVYKLTT---FDDKLSTYTANSEVTPHLAFYTDKSYSSRGNNNSRGGRYGNF 248
Query: 238 ------SSSGSRPRRDNGGSHCRGSYT---PRCQLCRKQGHYAAKCPVRWDRPSESANLT 288
SS G + G GS P CQ+CRK GH A KC R++ +L
Sbjct: 249 RGRGSYSSRGRGFHQQFGSGSNNGSGNGSKPTCQICRKYGHSAFKCYTRFEENYLPEDLP 308
Query: 289 HSFAA-GCSLNNSNRSDKYM-DTGATSHMTHSLSQLTYSHAYSGNDRVLVGNGAGLHITH 346
++FAA S N S +++ D+ AT+H+T++ L S YSG+D V+VGNG L ITH
Sbjct: 309 NAFAAMRVSDQNQASSHEWLPDSAATAHITNTTDGLQNSQTYSGDDSVIVGNGDFLPITH 368
Query: 347 IGS---RSASHSVPLSNVLVVPRLTNNLVSVSKLTRDHNVRAIFEADSFVIQNRQTGAVL 403
IG+ + ++PL +VLV P +T +L+SVSKLT D+ F++DS VI++++T +L
Sbjct: 369 IGTIPLNISQGTLPLEDVLVCPGITKSLLSVSKLTDDYPCSFTFDSDSVVIKDKRTQQLL 428
Query: 404 GKGPCDKGFYVL-DQGSQALLATSSSLPRASFELWHSRLDHVNFDIIKKLHKLGCFNVSS 462
+G KG YVL D Q +T + E+WH RL H N ++++ L K V+
Sbjct: 429 TQGNKHKGLYVLKDVPFQTYYSTRQQ--SSDDEVWHQRLGHPNKEVLQHLIKTKAIVVNK 486
Query: 463 ILPKPICCTSCQMAKSKRLVCYDNHKRASAVLDLVHCDLWGPSPVASVDGFSYFVIFVDD 522
C +CQM K RL + +S L+ +HCDLWGP+PV S GF Y+VIF+D+
Sbjct: 487 TSSN--MCEACQMGKVCRLPFVASEFVSSRPLERIHCDLWGPAPVTSAQGFQYYVIFIDN 544
Query: 523 FSQFTWFYPLKRKSDFYDVLVRFKAFVENQFSRSLKVFQSDNGTEFTNNKVQALFSSSGV 582
+S+FTWFYPLK KSDF+ V V F+ VENQ+ + +FQ D G EF + K A +S G+
Sbjct: 545 YSRFTWFYPLKLKSDFFSVFVLFQQLVENQYQHKIAMFQCDGGGEFVSYKFVAHLASCGI 604
Query: 583 LHRFLCPHTQAQNGRVERKHRHVLELGLAMLYHSHVSTSYWVHAFSTAVYIINRVPSKVL 642
CPHT QNG ER+HR++ ELGL++++HS V WV AF T+ ++ N +PS L
Sbjct: 605 KQLISCPHTPQQNGIAERRHRYLTELGLSLMFHSKVPHKLWVEAFFTSNFLSNLLPSSTL 664
Query: 643 SD-QIPFQLLFQVAPTYANFHPFGCRVFPCLRPYMKNKFSPRGTPCIFLGYNSHHKGFKC 701
SD + P+++L P Y FG +P LRPY KNKF P+ C+FLGYN+ +KG++C
Sbjct: 665 SDNKSPYEMLHGTPPVYTALRVFGSACYPYLRPYAKNKFDPKSLLCVFLGYNNKYKGYRC 724
Query: 702 FDPTTSRTYVSRHAQFDEFCFPLTGSKSSPNDLVVFTFYEPAAGSP 747
P T + Y+ RH FDE FP + +++ ++ +GSP
Sbjct: 725 LHPPTGKVYICRHVLFDERKFPYSD---------IYSQFQTISGSP 761
Score = 391 bits (1004), Expect = e-106
Identities = 228/535 (42%), Positives = 314/535 (58%), Gaps = 15/535 (2%)
Query: 848 IFKPNPRYALVHAQPTGLLTALHTVT*PKGFKSAMKHPHWLPAMEDELSALHKKFTWTLV 907
I KPNP+YAL + PK K A+K W AM +E+ +H+ TW LV
Sbjct: 778 ITKPNPKYALFSVKSN--------YPEPKSVKEALKDEGWTNAMGEEMGTMHETDTWDLV 829
Query: 908 PRPYTTNVVGSK*VFRTKFHSDGTVERLKTCLVAQGLTQIPGFDYSLTFSPVVKATTVRL 967
P ++G K VF+TK +SDG+++RLK LVA+G Q G DY T+SPVV++ TVR
Sbjct: 830 PPEMVDRLLGCKWVFKTKLNSDGSLDRLKARLVARGYEQEEGVDYVETYSPVVRSATVRS 889
Query: 968 ILSLVVLND*QLHQLDVKNAFLHGHLTETVYMEQPHGFVDLVSRLMCVG*TRLSTASNRR 1027
IL + +N L QLDVKNAFLH L ETV+M QP GF D SR V + + ++
Sbjct: 890 ILHVATINKWSLKQLDVKNAFLHDELKETVFMTQPPGFED-PSRPDYVCKLKKAIYDLKQ 948
Query: 1028 LVLGFSA*ALFSCVLVFR--VVGLIHPCSFFY-KGHITLYLLVYVDDIILTGSDPSLLT* 1084
+ FS L+ + P F Y KG ++LL+YVDD+ILTG++ LL
Sbjct: 949 APRAWFD--KFSSYLLKYGFICSFSDPSLFVYLKGRDVMFLLLYVDDMILTGNNDVLLQQ 1006
Query: 1085 FIACLNDEFAIKYLGKLGYFLGLEITYTADGLFLGHAKYAHDLLSHALMLEASHVLTPLA 1144
+ L+ EF +K +G L YFLG++ Y DGLFL KY DLL +A M + S + TPL
Sbjct: 1007 LLNILSTEFRMKDMGALHYFLGIQAHYHNDGLFLSQEKYTSDLLVNAGMSDCSSMPTPLQ 1066
Query: 1145 AGSHLVSSGEGYSDPTHYRSLVGALQYLTITRPDLSYAVNTVSQFLQTPTVDHFQAVKRI 1204
L + + + +PT++R L G LQYLT+TRPD+ +AVN V Q + PT+ F +KRI
Sbjct: 1067 LDL-LQGNNKPFPEPTYFRRLAGKLQYLTLTRPDIQFAVNFVCQKMHAPTMSDFHLLKRI 1125
Query: 1205 IRYVCAMQHFGLTFRRTSSPAVLGYSDADWARCTDHRRSTYGYAIFLGYNLLSWSAKKQP 1264
+ Y+ G+ + + YSD+DWA C D RRST G+ FLGYN++SWSAK+ P
Sbjct: 1126 LHYLKGTMTMGINLSSNTDSVLRCYSDSDWAGCKDTRRSTGGFCTFLGYNIISWSAKRHP 1185
Query: 1265 SVAHSSCESEYRAMANTASELVWLLNLLHELRVRLSAPPLFLSDNQSALFMAQNPVAHKH 1324
+V+ SS E+EYR ++ ASE+ W+ LL E+ + P DN SA++++ NP H
Sbjct: 1186 TVSKSSTEAEYRTLSFAASEVSWIGFLLQEIGLPQQQIPEMYCDNLSAVYLSANPALHSR 1245
Query: 1325 AKHIDLDCHFVRELVSSGRLAVRHVPTSLQLADIFTKVLPRPLFDLFRSKLRVGL 1379
+KH +D ++VRE V+ G L V+H+P S QLADIFTK LP+ F R KL V L
Sbjct: 1246 SKHFQVDYYYVRERVALGALTVKHIPASQQLADIFTKSLPQAPFCDLRFKLGVVL 1300
Database: nr
Posted date: Jul 5, 2005 12:34 AM
Number of letters in database: 863,360,394
Number of sequences in database: 2,540,612
Lambda K H
0.325 0.138 0.433
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,434,904,960
Number of Sequences: 2540612
Number of extensions: 112450920
Number of successful extensions: 870012
Number of sequences better than 10.0: 9431
Number of HSP's better than 10.0 without gapping: 3055
Number of HSP's successfully gapped in prelim test: 6999
Number of HSP's that attempted gapping in prelim test: 634233
Number of HSP's gapped (non-prelim): 73260
length of query: 1380
length of database: 863,360,394
effective HSP length: 141
effective length of query: 1239
effective length of database: 505,134,102
effective search space: 625861152378
effective search space used: 625861152378
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 82 (36.2 bits)
Lotus: description of TM0019a.6