
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC144760.12 + phase: 0 /pseudo
(1302 letters)
Database: nr
2,540,612 sequences; 863,360,394 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
emb|CAA32025.1| unnamed protein product [Nicotiana tabacum] gi|1... 1226 0.0
pir||T02206 hypothetical protein - common tobacco retrotransposo... 1219 0.0
gb|AAV88069.1| hypothetical retrotransposon [Ipomoea batatas] 1152 0.0
dbj|BAD34493.1| Gag-Pol [Ipomoea batatas] 1125 0.0
gb|AAK29467.1| polyprotein-like [Lycopersicon chilense] 1121 0.0
gb|AAX92941.1| retrotransposon protein, putative, Ty1-copia sub-... 1030 0.0
ref|XP_469192.1| putative polyprotein [Oryza sativa (japonica cu... 1008 0.0
gb|AAX92861.1| retrotransposon protein, putative, Ty1-copia sub-... 1007 0.0
gb|AAT85194.1| putative polyprotein [Oryza sativa (japonica cult... 1004 0.0
gb|AAD19773.1| putative retroelement pol polyprotein [Arabidopsi... 984 0.0
emb|CAB75481.1| copia-like polyprotein [Arabidopsis thaliana] gi... 954 0.0
gb|AAD32759.1| putative retroelement pol polyprotein [Arabidopsi... 953 0.0
ref|XP_475489.1| putative polyprotein [Oryza sativa (japonica cu... 953 0.0
gb|AAD23690.1| putative retroelement pol polyprotein [Arabidopsi... 949 0.0
dbj|BAB09923.1| copia-like retrotransposable element [Arabidopsi... 948 0.0
gb|AAF19226.1| Highly similar to Ta1-3 polyprotein [Arabidopsis ... 940 0.0
ref|XP_476137.1| putative polyprotein [Oryza sativa (japonica cu... 913 0.0
emb|CAB79135.1| putative transposable element [Arabidopsis thali... 868 0.0
emb|CAA31653.1| polyprotein [Arabidopsis thaliana] gi|99721|pir|... 867 0.0
gb|AAG50765.1| copia-type polyprotein, putative [Arabidopsis tha... 795 0.0
>emb|CAA32025.1| unnamed protein product [Nicotiana tabacum]
gi|130582|sp|P10978|POLX_TOBAC Retrovirus-related Pol
polyprotein from transposon TNT 1-94 [Contains: Protease
; Reverse transcriptase ; Endonuclease]
Length = 1328
Score = 1226 bits (3171), Expect = 0.0
Identities = 646/1329 (48%), Positives = 884/1329 (65%), Gaps = 51/1329 (3%)
Query: 7 KIEKFDGAD-FGFWKMQIEDYLYQKKLHQPLT--EKKPDSMKDDEWSLLDRQALGVVRLS 63
++ KF+G + F W+ ++ D L Q+ LH+ L KKPD+MK ++W+ LD +A +RL
Sbjct: 7 EVAKFNGDNGFSTWQRRMRDLLIQQGLHKVLDVDSKKPDTMKAEDWADLDERAASAIRLH 66
Query: 64 LSRNVAFNIAKEKTTAGLMKALSSMYEKPPSSNKVHLMRRLFTLRMAEGMSVAQHINELN 123
LS +V NI E T G+ L S+Y +NK++L ++L+ L M+EG + H+N N
Sbjct: 67 LSDDVVNNIIDEDTARGIWTRLESLYMSKTLTNKLYLKKQLYALHMSEGTNFLSHLNVFN 126
Query: 124 IVTTQLSSVGIEFDDEVRALILLSSLPDSWSAIVTAVSSSSGSKKMKFDDVRDLVLSEEI 183
+ TQL+++G++ ++E +A++LL+SLP S+ + T + + ++K D L+L+E++
Sbjct: 127 GLITQLANLGVKIEEEDKAILLLNSLPSSYDNLATTILHGKTTIELK-DVTSALLLNEKM 185
Query: 184 RRRELGESSSSSVLHTESRGRNSTRGNGRGKSKARRSKSKNHRSSHNSKSIECWNCGKTG 243
R++ + L TE RGR+ R + R KSKN S + C+NC + G
Sbjct: 186 RKKP---ENQGQALITEGRGRSYQRSSNNYGRSGARGKSKNRSKS---RVRNCYNCNQPG 239
Query: 244 HFKNQCRLPTKNQEE----KDEANVASTSGGGDALICSLESKEE---------SWVLDSG 290
HFK C P K + E K++ N A+ D ++ + +EE WV+D+
Sbjct: 240 HFKRDCPNPRKGKGETSGQKNDDNTAAMVQNNDNVVLFINEEEECMHLSGPESEWVVDTA 299
Query: 291 ASFHASSQKEFFKNYVPGNLGKVYLGNEQSCKVVGKGEVKIKLN-GSVWELKNVRHIPNL 349
AS HA+ ++ F YV G+ G V +GN K+ G G++ IK N G LK+VRH+P+L
Sbjct: 300 ASHHATPVRDLFCRYVAGDFGTVKMGNTSYSKIAGIGDICIKTNVGCTLVLKDVRHVPDL 359
Query: 350 TKNLISVGQLADEGYTTVFHGDDWKISKGAMTIARGRKSGTLYKT-AGACH--LIAVATN 406
NLIS L +GY + F W+++KG++ IA+G GTLY+T A C L A
Sbjct: 360 RMNLISGIALDRDGYESYFANQKWRLTKGSLVIAKGVARGTLYRTNAEICQGELNAAQDE 419
Query: 407 ENPNLWHKRLGHMSEKGMKVMHSKGKLPSLRSIEIDICEDCILGKQKRVSFQTSGRTPKK 466
+ +LWHKR+GHMSEKG++++ K + + + C+ C+ GKQ RVSFQTS K
Sbjct: 420 ISVDLWHKRMGHMSEKGLQILAKKSLISYAKGTTVKPCDYCLFGKQHRVSFQTSSER-KL 478
Query: 467 EKLELVHSDVWGPTTVPSIGGKHYFVTFIDDHSRKVWVYFLKHKSEVFEAFKRWKAMVEN 526
L+LV+SDV GP + S+GG YFVTFIDD SRK+WVY LK K +VF+ F+++ A+VE
Sbjct: 479 NILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKTKDQVFQVFQKFHALVER 538
Query: 527 ETDLKIKKLRTDNGGEYEDTKFKKFCYEHGIRMERTVPGTPQHNGVAERMNRTLTERARS 586
ET K+K+LR+DNGGEY +F+++C HGIR E+TVPGTPQHNGVAERMNRT+ E+ RS
Sbjct: 539 ETGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTPQHNGVAERMNRTIVEKVRS 598
Query: 587 LRVQSGLPKKFWAEAVNTSAYLINRGPSVPLEHKIPEEVWSGKEVKLSHLRVFGCVAYVH 646
+ + LPK FW EAV T+ YLINR PSVPL +IPE VW+ KEV SHL+VFGC A+ H
Sbjct: 599 MLRMAKLPKSFWGEAVQTACYLINRSPSVPLAFEIPERVWTNKEVSYSHLKVFGCRAFAH 658
Query: 647 ISDQGRNKLDPKSKKCIFIGYGEDEFGYRLWDDENKKMVRSKDVIFNERVMYKDKHNTTT 706
+ + R KLD KS CIFIGYG++EFGYRLWD KK++RS+DV+F E + +
Sbjct: 659 VPKEQRTKLDDKSIPCIFIGYGDEEFGYRLWDPVKKKVIRSRDVVFRESEVRTAADMSEK 718
Query: 707 NDSGLSEPVYVEMDDVPGSPTDKSPQSGELAESSIRQPSDTLV-------------HPTP 753
+G+ P +V + +PT + E++E QP + + HPT
Sbjct: 719 VKNGII-PNFVTIPSTSNNPTSAESTTDEVSEQG-EQPGEVIEQGEQLDEGVEEVEHPTQ 776
Query: 754 VPV----LRRSSRPHAPNRRY--IDYMLLTDGGEPEDYDEACQTTDASKWELAMKEEMKS 807
LRRS RP +RRY +Y+L++D EPE E + ++ AM+EEM+S
Sbjct: 777 GEEQHQPLRRSERPRVESRRYPSTEYVLISDDREPESLKEVLSHPEKNQLMKAMQEEMES 836
Query: 808 LISNQTWELAKLPIGKKALHNKWVYRVKEDHDGSK-RYKARLVVKGFRQKEGIDYTEIFA 866
L N T++L +LP GK+ L KWV+++K+D D RYKARLVVKGF QK+GID+ EIF+
Sbjct: 837 LQKNGTYKLVELPKGKRPLKCKWVFKLKKDGDCKLVRYKARLVVKGFEQKKGIDFDEIFS 896
Query: 867 PVVKLNTIRSVLSIVASENLYLEQLDVKTAFLHGDLVEEIYMHQPEGFLEEGKENMVCML 926
PVVK+ +IR++LS+ AS +L +EQLDVKTAFLHGDL EEIYM QPEGF GK++MVC L
Sbjct: 897 PVVKMTSIRTILSLAASLDLEVEQLDVKTAFLHGDLEEEIYMEQPEGFEVAGKKHMVCKL 956
Query: 927 KKSLYGLKQAPRQWYMKFESFMHKEGFQKCNADHCCFFKRY-KSSYIILLLYVDDMLVAG 985
KSLYGLKQAPRQWYMKF+SFM + + K +D C +FKR+ ++++IILLLYVDDML+ G
Sbjct: 957 NKSLYGLKQAPRQWYMKFDSFMKSQTYLKTYSDPCVYFKRFSENNFIILLLYVDDMLIVG 1016
Query: 986 SNIDEIKNLKIQLSKEFDMKDLGPAKKILGMQITRDKQKGVLQLS*AEYINRVLQRFNMG 1045
+ I LK LSK FDMKDLGPA++ILGM+I R++ L LS +YI RVL+RFNM
Sbjct: 1017 KDKGLIAKLKGDLSKSFDMKDLGPAQQILGMKIVRERTSRKLWLSQEKYIERVLERFNMK 1076
Query: 1046 DAKLVSTPLASHFRLSQEQSPQTEEEKELMAKIPYASAIGSLMYAMVCTRPDIGHAVGVV 1105
+AK VSTPLA H +LS++ P T EEK MAK+PY+SA+GSLMYAMVCTRPDI HAVGVV
Sbjct: 1077 NAKPVSTPLAGHLKLSKKMCPTTVEEKGNMAKVPYSSAVGSLMYAMVCTRPDIAHAVGVV 1136
Query: 1106 SRFMSNPGKAHWEAVKWILRYLRGTTEKCLYFGKGEIKVEGYVDADFAGEVDHRRSTTGY 1165
SRF+ NPGK HWEAVKWILRYLRGTT CL FG + ++GY DAD AG++D+R+S+TGY
Sbjct: 1137 SRFLENPGKEHWEAVKWILRYLRGTTGDCLCFGGSDPILKGYTDADMAGDIDNRKSSTGY 1196
Query: 1166 IFTVGTRSVSWMSRIQKIVALSTTEVEYVAVTEASKELIWLQGLLTELGFMQEKSALYSD 1225
+FT ++SW S++QK VALSTTE EY+A TE KE+IWL+ L ELG Q++ +Y D
Sbjct: 1197 LFTFSGGAISWQSKLQKCVALSTTEAEYIAATETGKEMIWLKRFLQELGLHQKEYVVYCD 1256
Query: 1226 SQSAIHLAKNSAFHSRTKHIGLRYHFIRSLLEDEVLTLIKIQGSKNPADMLTKVVTIDKL 1285
SQSAI L+KNS +H+RTKHI +RYH+IR +++DE L ++KI ++NPADMLTKVV +K
Sbjct: 1257 SQSAIDLSKNSMYHARTKHIDVRYHWIREMVDDESLKVLKISTNENPADMLTKVVPRNKF 1316
Query: 1286 KLCSTLVGL 1294
+LC LVG+
Sbjct: 1317 ELCKELVGM 1325
>pir||T02206 hypothetical protein - common tobacco retrotransposon Tto1
gi|1167523|dbj|BAA11674.1| ORF(AA 1-1338) [Nicotiana
tabacum]
Length = 1338
Score = 1219 bits (3154), Expect = 0.0
Identities = 638/1332 (47%), Positives = 884/1332 (65%), Gaps = 52/1332 (3%)
Query: 7 KIEKFDGADFGFWKMQIEDYLYQKKLHQPL-TEKKPDSMKDDEWSLLDRQALGVVRLSLS 65
K+ +G ++ W+ +++D L+ K+H P+ + +KP+ D++W Q G +R +
Sbjct: 7 KMVNLNGTNYHLWRNKMKDLLFVTKMHLPVFSSQKPEDKSDEDWEFEHNQVCGYIRQFVE 66
Query: 66 RNVAFNIAKEKTTAGLMKALSSMYEKPPSSNKVHLMRRLFTLRMAEGMSVAQHINELNIV 125
NV +I+ L L +Y +NK+ + +L ++ EG +VA H+NE+ +
Sbjct: 67 DNVYNHISGVTHARSLWDKLEELYASKTGNNKLFYLTKLMQVKYVEGTTVADHLNEIQGI 126
Query: 126 TTQLSSVGIEFDDEVRALILLSSLPDSWSAIVTAVSSSSGSKKMKFDDVRDLVLSEEIRR 185
QLS +GI+FDDEV AL++L++LP+SW + ++++S+ + + + V+ +L+EE+RR
Sbjct: 127 VDQLSGMGIKFDDEVLALMVLATLPESWETLKVSITNSAPNGVVNMETVKSGILNEEMRR 186
Query: 186 RELGESSSSSVLHTESRGRNSTRGNGRGKSKARRSKSKNHRSSHNSKSIECWNCGKTGHF 245
R G SSS S + + TRG + KS++ R KS+ S+ ++EC C K GH
Sbjct: 187 RSQGTSSSQSEVLAVT-----TRGRSQNKSQSNRDKSRG--KSNKFANVECHYCKKKGHI 239
Query: 246 KNQCRLPTKNQEEK-------DEANVASTSGGG------DALICSLESKEESWVLDSGAS 292
K CR +Q++ +E++ T+ G D I +L ++E +WV+DSGA+
Sbjct: 240 KRFCRQFQNDQKKNKGKKVKPEESSDDETNSFGEFNVVYDDDIINLTTQEMTWVIDSGAT 299
Query: 293 FHASSQKEFFKNYVPGNLGKVYLGNEQSCKVVGKGEVKIK-LNGSVWELKNVRHIPNLTK 351
HA+ ++E F +Y G+ G+V +GN VVGKG+V ++ +NG L++VRH+P++
Sbjct: 300 IHATPRRELFSSYTLGDFGRVKMGNANFSTVVGKGDVCLETMNGMKLLLRDVRHVPDMRL 359
Query: 352 NLISVGQLADEGYTTVFHGDDWKISKGAMTIARGRKSGTLYKTAGAC--HLIAVATNE-N 408
NLISV +L +EGY FH WK++KG++ +ARG K LY T + +I VA N+ N
Sbjct: 360 NLISVDKLDEEGYCNTFHNGQWKLTKGSLMVARGTKQSKLYVTQASISQQVINVAENDSN 419
Query: 409 PNLWHKRLGHMSEKGMKVMHSKGKLPSLRSIEIDICEDCILGKQKRVSFQTSGRTPKKEK 468
LWH+RLGHMSEK M + K LP L I++ C DC+ GKQ RVSF+ + ++
Sbjct: 420 IKLWHRRLGHMSEKSMARLVKKNALPGLNQIQLKKCADCLAGKQNRVSFKRFPPSRRQNV 479
Query: 469 LELVHSDVWGPTTVPSIGGKHYFVTFIDDHSRKVWVYFLKHKSEVFEAFKRWKAMVENET 528
L+LVHSDV GP S+GG YFVTFIDDHSRK WVY LK K +VF+ FK++ +VE ET
Sbjct: 480 LDLVHSDVCGPFK-KSLGGARYFVTFIDDHSRKTWVYTLKTKDQVFQVFKQFLTLVERET 538
Query: 529 DLKIKKLRTDNGGEYEDTKFKKFCYEHGIRMERTVPGTPQHNGVAERMNRTLTERARSLR 588
K+K +RTDNGGEY+ +F +C EHGIR + T P TPQ NG+AERMNRTL ER R L
Sbjct: 539 GKKLKCIRTDNGGEYQG-QFDAYCKEHGIRHQFTPPKTPQLNGLAERMNRTLIERTRCLL 597
Query: 589 VQSGLPKKFWAEAVNTSAYLINRGPSVPLEHKIPEEVWSGKEVKLSHLRVFGCVAYVHIS 648
S LPK FW EA+ T+AY++N P VPL++K PE++W G+++ LRVFGC AYVH+
Sbjct: 598 SHSKLPKAFWGEALVTAAYVLNHSPCVPLQYKAPEKIWLGRDISYDQLRVFGCKAYVHVP 657
Query: 649 DQGRNKLDPKSKKCIFIGYGEDEFGYRLWDDENKKMVRSKDVIFNERVMYKD--KHNTTT 706
R+KLD K+++C+FIGYG+D GY+ +D KK+VRS+DV+F E +D K +T
Sbjct: 658 KDERSKLDVKTRECVFIGYGQDMLGYKFYDPVEKKLVRSRDVVFVEDQTIEDIDKVEKST 717
Query: 707 NDSG------------LSEPVYVEMDDVPGSPTDKSPQSGELAESSIRQPSDTLVHPTPV 754
+DS + + V + PG P + E E + +D P P
Sbjct: 718 DDSAEFELPPTVVPRQVGDDVQDNQPEAPGLPNEDELADTEGNEDNGDDDADEEDQPQP- 776
Query: 755 PVLR------RSSRPHAPNRRYI--DYMLLTDGGEPEDYDEACQTTDASKWELAMKEEMK 806
P+L RS R + RY +Y+LLTDGGEP+ ++EA KW AM++E+K
Sbjct: 777 PILNNPPYHTRSGRVVQQSTRYSPHEYVLLTDGGEPDSFEEAIDDEHKEKWIEAMQDEIK 836
Query: 807 SLISNQTWELAKLPIGKKALHNKWVYRVKED-HDGSKRYKARLVVKGFRQKEGIDYTEIF 865
SL N+T+EL KLP GK+AL NKWV+++K D H+ R+KARLVVKGF Q++GID+ EIF
Sbjct: 837 SLHENKTFELVKLPKGKRALKNKWVFKMKHDEHNSLPRFKARLVVKGFNQRKGIDFDEIF 896
Query: 866 APVVKLNTIRSVLSIVASENLYLEQLDVKTAFLHGDLVEEIYMHQPEGFLEEGKENMVCM 925
+PVVK+ +IR+VL + AS NL +EQ+DVKTAFLHGDL EEIYM QP+GF ++GKE+ VC
Sbjct: 897 SPVVKMTSIRTVLGLAASLNLEVEQMDVKTAFLHGDLEEEIYMEQPDGFQQKGKEDYVCR 956
Query: 926 LKKSLYGLKQAPRQWYMKFESFMHKEGFQKCNADHCCFFKRYKSS-YIILLLYVDDMLVA 984
L+KSLYGLKQAPRQWY KFES M + G++K +DHC F +++ +IILLLYVDDML+
Sbjct: 957 LRKSLYGLKQAPRQWYKKFESVMGQHGYKKTTSDHCVFAQKFSDDDFIILLLYVDDMLIV 1016
Query: 985 GSNIDEIKNLKIQLSKEFDMKDLGPAKKILGMQITRDKQKGVLQLS*AEYINRVLQRFNM 1044
G N+ I +LK QLSK F MKDLGPAK+ILGM+I RD++ L LS +YI +VLQRFNM
Sbjct: 1017 GRNVSRINSLKEQLSKFFAMKDLGPAKQILGMRIMRDREAKKLWLSQEKYIEKVLQRFNM 1076
Query: 1045 GDAKLVSTPLASHFRLSQEQSPQTEEEKELMAKIPYASAIGSLMYAMVCTRPDIGHAVGV 1104
K VS PLA+HFRLS +QSP T++E+ M +IPYASA+GSLMYAMVCTRPDI HAVGV
Sbjct: 1077 EKTKAVSCPLANHFRLSTKQSPSTDDERRKMERIPYASAVGSLMYAMVCTRPDIAHAVGV 1136
Query: 1105 VSRFMSNPGKAHWEAVKWILRYLRGTTEKCLYFGKGEIKVEGYVDADFAGEVDHRRSTTG 1164
VSRF+SNPGK HW+AVKWILRYLRGT++ CL FG+ + GY DAD AG+VD R+ST+G
Sbjct: 1137 VSRFLSNPGKEHWDAVKWILRYLRGTSKLCLCFGEDNPVLVGYTDADMAGDVDSRKSTSG 1196
Query: 1165 YIFTVGTRSVSWMSRIQKIVALSTTEVEYVAVTEASKELIWLQGLLTELGFMQEKSALYS 1224
Y+ +VSW S++QK VALSTTE E++A TEA KELIW++ LTELGF Q+ L+
Sbjct: 1197 YLINFSGGAVSWQSKLQKCVALSTTEAEFIAATEACKELIWMKKFLTELGFSQDGYQLFC 1256
Query: 1225 DSQSAIHLAKNSAFHSRTKHIGLRYHFIRSLLEDEVLTLIKIQGSKNPADMLTKVVTIDK 1284
DSQSAIHLAKN++FHSR+KHI +RY++IR +LE ++L L KI +N +DMLTK + K
Sbjct: 1257 DSQSAIHLAKNASFHSRSKHIDVRYNWIRDVLEKKMLRLEKIHTDENGSDMLTKTLPKGK 1316
Query: 1285 LKLCSTLVGLLE 1296
+ C G+++
Sbjct: 1317 FEFCREAAGIVD 1328
>gb|AAV88069.1| hypothetical retrotransposon [Ipomoea batatas]
Length = 1415
Score = 1152 bits (2980), Expect = 0.0
Identities = 615/1315 (46%), Positives = 848/1315 (63%), Gaps = 69/1315 (5%)
Query: 10 KFDGADFGFWKMQIEDYLYQKKLHQPL-TEKKPDSMKDDEWSLLDRQALGVVRLSLSRNV 68
+ +G ++ WK +++D L+ KKLH P+ KP++M D+EW +Q G +R + NV
Sbjct: 10 RLNGRNYHIWKAKMKDLLFVKKLHLPVFASAKPENMSDEEWDFEHQQVCGYIRQWVEDNV 69
Query: 69 AFNIAKEKTTAGLMKALSSMYEKPPSSNKVHLMRRLFTLRMAEGMSVAQHINELNIVTTQ 128
+I E L L ++Y +NK+ L++++ +R EG + H+N+ V Q
Sbjct: 70 LNHIINETHARSLWNKLETLYASKTGNNKLFLLKQMMNIRYREGTLINDHVNDFQGVLDQ 129
Query: 129 LSSVGIEFDDEVRALILLSSLPDSWSAIVTAVSSSSGSKKMKFDDVRDLVLSEEIRRREL 188
LS +GI+F+DEV L LL++LPDSW ++++S+ + + + V+ +L+EE RRR
Sbjct: 130 LSGMGIKFEDEVLGLWLLNTLPDSWETFRVSLTNSAPNGVVTMEYVKSGILNEEARRRSQ 189
Query: 189 GESSSSS-VLHTESRGRNSTRGNGRGKSKARRSKSKNHRSSHNSKSIECWNCGKTGHFKN 247
S+S S +L T+ RGRN +G RG+ K+R SKS++ K IEC CGK H K
Sbjct: 190 DTSTSQSDILVTDDRGRNKQKGQ-RGRDKSR-SKSRSRY-----KDIECHYCGKKSHIKK 242
Query: 248 QC----RLPTKNQEEKDEANVASTSGGGDALIC-----SLESKEESWVLDSGASFHASSQ 298
R ++ ++ D N +T + C ++ E +W++DSGA++H + +
Sbjct: 243 YSFKWKREKKQDNKDGDTGNQVATVRADLLVACDDNVINVACHETTWIVDSGAAYHVTPR 302
Query: 299 KEFFKNYVPGNLGKVYLGNEQSCKVVGKGEVKIKL-NGSVWELKNVRHIPNLTKNLISVG 357
KEFF +Y PG+ G++ +GN+ KV G G V ++ NG+ LKNV+H P++ NLIS G
Sbjct: 303 KEFFTSYTPGDFGELRMGNDGQVKVTGTGTVCLETSNGTKLVLKNVKHAPDIRLNLISTG 362
Query: 358 QLADEGYTTVFHGDDWKISKGAMTIARGRKSGTLYKTAGACHLIAVATNENP---NLWHK 414
+L D+G+ F WKI+KG++ +ARG KS LY + +V E LWHK
Sbjct: 363 KLDDDGFCCFFGDGHWKITKGSLVVARGNKSSNLYSLQSSVSDDSVNVVEKECASELWHK 422
Query: 415 RLGHMSEKGMKVMHSKGKLPSLRSIEIDICEDCILGKQKRVSFQTSGRTPKKEKLELVHS 474
RLGHMS KG+ + K KL ++ ++D C C+ GKQ+RVSF + T K E L+L+HS
Sbjct: 423 RLGHMSVKGIDYLAKKSKLSGVKEAKLDKCVHCLAGKQRRVSFMSHPPTRKSEPLDLIHS 482
Query: 475 DVWGPTTVPSIGGKHYFVTFIDDHSRKVWVYFLKHKSEVFEAFKRWKAMVENETDLKIKK 534
DV GP V S+GG YFVTFIDD+SRK+WVY LKHKS+V FK + A+VE +T K+K
Sbjct: 483 DVCGPMKVRSLGGASYFVTFIDDYSRKLWVYTLKHKSDVLGVFKEFHALVERQTGKKLKC 542
Query: 535 LRTDNGGEYEDTKFKKFCYEHGIRMERTVPGTPQHNGVAERMNRTLTERARSLRVQSGLP 594
+RTDNGGEY F ++C +GIR ++T P PQ NG+AERMNRT+ ER R + + LP
Sbjct: 543 IRTDNGGEYCGP-FDEYCRRYGIRHQKTPPKIPQLNGLAERMNRTIMERVRCMLDDAKLP 601
Query: 595 KKFWAEAVNTSAYLINRGPSVPLEHKIPEEVWSGKEVKLSHLRVFGCVAYVHISDQGRNK 654
FWAEAV+T+ ++IN P + L++++P++VW GK+V HLRVFGC A+VH+ R+K
Sbjct: 602 SSFWAEAVSTAVHVINLSPVIALKNEVPDKVWCGKDVSYDHLRVFGCKAFVHVPRDERSK 661
Query: 655 LDPKSKKCIFIGYGEDEFGYRLWDDENKKMVRSKDVIFNERVMYKDKHNTTTNDSGLSEP 714
LD K+++CIFIGYG DEFGYRL+D KK+VRS+DV+F E +D +S S
Sbjct: 662 LDSKTRQCIFIGYGFDEFGYRLYDPVEKKLVRSRDVVFFENQTIEDIDKVKQPESRDSGS 721
Query: 715 VYVEMDDVPGSPTDKSPQSGELAESSIRQP------------SDTLVH-----PTPVPV- 756
+ V+++ V TD + E ++ P +D +VH P+ VPV
Sbjct: 722 L-VDIEPVSRRYTDDVDEVQENVQNGDPVPDYQGDTVDVDGHADDVVHQEQEVPSQVPVD 780
Query: 757 -LRRSSRPHAPNRRYI--DYMLLTDGGEPEDYDEACQTTDASKWELAMKEEMKSLISNQT 813
RRS R P+ RY Y+LLTDGGEPE Y+EA ++ +W AM+EEM SL N T
Sbjct: 781 LPRRSDRERRPSTRYSPSQYVLLTDGGEPESYEEAMESDQKRQWFEAMQEEMNSLYVNDT 840
Query: 814 WELAKLPIGKKALHNKWVYRVKEDHDGS-KRYKARLVVKGFRQKEGIDYTEIFAPVVKLN 872
+EL K P +KAL N+WVYRVK + S R+KARLVVKGF QK+GID+ EIF+PVVK +
Sbjct: 841 FELVKAPKNRKALKNRWVYRVKHEEGTSVPRFKARLVVKGFSQKKGIDFDEIFSPVVKFS 900
Query: 873 TIRSVLSIVASENLYLEQLDVKTAFLHGDLVEEIYMHQPEGFLEEGKENMVCMLKKSLYG 932
+IR VL + A ++ +EQ+DVKTAFLHGDL EEIYM QPEGF +GKE+ VC LKKSLYG
Sbjct: 901 SIRVVLGLAARLDIEIEQMDVKTAFLHGDLDEEIYMEQPEGFKVKGKEDYVCRLKKSLYG 960
Query: 933 LKQAPRQWYMKFESFMHKEGFQKCNADHCCFFKRYKSS-YIILLLYVDDMLVAGSNIDEI 991
LKQAPRQWY KF S M K G++K ++DHC F RY ++ILLLYVDDML+ G N I
Sbjct: 961 LKQAPRQWYKKFTSVMSKHGYKKTSSDHCVFVNRYSDDDFVILLLYVDDMLIVGRNASRI 1020
Query: 992 KNLKIQLSKEFDMKDLGPAKKILGMQITRDKQKGVLQLS*AEYINRVLQRFNMGDAKLVS 1051
+ LK +LSK F MKD+GPAK+ILGM+I RD+Q L LS +YI +VL+RF+M +AK VS
Sbjct: 1021 QELKQELSKSFSMKDMGPAKQILGMKIIRDRQNKKLWLSQEKYIEKVLERFHMNEAKPVS 1080
Query: 1052 TPLASHFRLSQEQSPQTEEEKELMAKIPYASAIGSLMYAMVCTRPDIGHAVGVVSRFMSN 1111
TPL HF+L ++Q P +E+EKE M ++PY+SA+GSLMYAMVCTRPDI HAVGVVSRF+SN
Sbjct: 1081 TPLDMHFKLCKKQCPSSEKEKEEMQRVPYSSAVGSLMYAMVCTRPDIAHAVGVVSRFLSN 1140
Query: 1112 PGKAHWEAVKWILRYLRGTTEKCLYFGKGEIKVEGYVDADFAGEVDHRRSTTGYIFTVGT 1171
PG+ HW+AVKWILRYLRGT+ L FG G+ + GY D+D AG++D R+ST+GY+ T
Sbjct: 1141 PGREHWDAVKWILRYLRGTSSLSLCFGTGKPILTGYTDSDMAGDIDTRKSTSGYLITYAG 1200
Query: 1172 RSVSWMSRIQKIVALSTTEVEYVAVTEASKELIWLQGLLTELGFMQEKSALYSDSQSAIH 1231
+VSW SR+QK V LSTTE E++A EASKE++W++ L ELGF+Q+
Sbjct: 1201 GAVSWQSRLQKCVDLSTTEAEFIASVEASKEMLWMKKFLQELGFVQD------------- 1247
Query: 1232 LAKNSAFHSRTKHIGLRYHFIRSLLEDEVLTLIKIQGSKNPADMLTKVVTIDKLK 1286
R+KHI RYH+IR +LE ++L L KI N +DM+TK + K +
Sbjct: 1248 ---------RSKHIDTRYHWIRDILECKMLELEKIHTDDNGSDMMTKALPRGKFE 1293
>dbj|BAD34493.1| Gag-Pol [Ipomoea batatas]
Length = 1298
Score = 1125 bits (2910), Expect = 0.0
Identities = 595/1318 (45%), Positives = 849/1318 (64%), Gaps = 54/1318 (4%)
Query: 7 KIEKFDGADFGFWKMQIEDYLYQKKLHQPLTEKKPDSMKDDEWSLLDRQALGVVRLSLSR 66
+IEKF+G +F WK++++ L + ++E+ D D +WS ++ A+ + LS++
Sbjct: 6 EIEKFNGKNFSLWKLKVKAILRKDNCLAAISERPVDFTDDKKWSEMNEDAMADLYLSIAD 65
Query: 67 NVAFNIAKEKTTAGLMKALSSMYEKPPSSNKVHLMRRLFTLRMAEGMSVAQHINELNIVT 126
V +I ++KT + L+ +YE NK+ L R+L+TLRM+E SV +H+N LN +
Sbjct: 66 GVLSSIEEKKTANEIWDHLNRLYEAKSLHNKIFLKRKLYTLRMSESTSVTEHLNTLNTLF 125
Query: 127 TQLSSVGIEFDDEVRALILLSSLPDSWSAIVTAVSSSSGSKKMKFDDVRDLVLSEEIRRR 186
+QL+S+ + + + RA +LL SLPDS+ ++ ++++ + + FDDV VL EE RR+
Sbjct: 126 SQLTSLSCKIEPQERAELLLQSLPDSYDQLIINLTNNILTDYLVFDDVAAAVLEEESRRK 185
Query: 187 ELGESS---SSSVLHTESRGRNSTRGNGRGKSKARRSKSKNHRSSHNSKSIECWNCGKTG 243
+ + T RGR++ RG G+ +++ SK K++ C+NCGK G
Sbjct: 186 NKEDRQVNLQQAEALTVMRGRSTERGQSSGRGRSKSSK----------KNLTCYNCGKKG 235
Query: 244 HFKNQCRLPTKNQEEKDEANVASTSGGGDALICSLESKEES-------WVLDSGASFHAS 296
H K C +N + NVASTS G AL C E W++DSGA++H +
Sbjct: 236 HLKKDCWNLAQNSNP--QGNVASTSDDGSALCCEASIAREGRKRFADIWLIDSGATYHMT 293
Query: 297 SQKEFFKNYVPGNLGKVYLGNEQSCKVVGKGEVKIKL-NGSVWELKNVRHIPNLTKNLIS 355
S+KE+F +Y P + G VY ++ + +++G G +K+K+ +G+V +++VRH+ L KNL+S
Sbjct: 294 SRKEWFHHYEPISGGSVYSCDDHALEIIGIGTIKLKMYDGTVQTVQDVRHVKGLKKNLLS 353
Query: 356 VGQLADEGYTTVFHGDDWKISKGAMTIARGRK-SGTLYKTAG-----ACHLIAVATNENP 409
G L + KI +GA+ + +G K + LY G A +A + ++
Sbjct: 354 YGILDNSATQIETQKGVMKIFQGALVVMKGEKIAANLYMLKGETLQEAEASVAACSPDST 413
Query: 410 NLWHKRLGHMSEKGMKVMHSKGKLPSLRSIEIDICEDCILGKQKRVSFQTSGRTPKKEKL 469
LWH++LGHMS++GMK++ + +P L + + +CE CI KQ R+ F TS + K L
Sbjct: 414 LLWHQKLGHMSDQGMKILVEQKLIPGLTKVSLPLCEHCITSKQHRLKFSTSN-SRGKVVL 472
Query: 470 ELVHSDVWGPTTVPSIGGKHYFVTFIDDHSRKVWVYFLKHKSEVFEAFKRWKAMVENETD 529
ELVHSDVW VPS+GG YFV+FIDD+SR+ WVY +K KS+VF FK +KA VE ++
Sbjct: 473 ELVHSDVW-QAPVPSLGGAKYFVSFIDDYSRRCWVYPIKKKSDVFATFKAFKARVELDSG 531
Query: 530 LKIKKLRTDNGGEYEDTKFKKFCYEHGIRMERTVPGTPQHNGVAERMNRTLTERARSLRV 589
KIK RTDNGGEY +F FC + GI+ + TV TPQ NGVAERMNRTL ER R++
Sbjct: 532 KKIKCFRTDNGGEYTSEEFDDFCKKEGIKRQFTVAYTPQQNGVAERMNRTLLERTRAMLR 591
Query: 590 QSGLPKKFWAEAVNTSAYLINRGPSVPLEHKIPEEVWSGKEVKLSHLRVFGCVAYVHISD 649
+GL K FWAEAVNT+ YL+NR PS +E K P E+W+GK V S+L +FG + Y +
Sbjct: 592 AAGLEKSFWAEAVNTACYLVNRAPSTAIELKTPMEMWTGKPVDYSNLHIFGSIVYAMYNA 651
Query: 650 QGRNKLDPKSKKCIFIGYGEDEFGYRLWDDENKKMVRSKDVIFNE-RVMYKDKHNTTTND 708
Q KLDPKS+KC F+GY + GYRLWD K+V S+DVIF E R+ + ++T +
Sbjct: 652 QEITKLDPKSRKCRFLGYADGVKGYRLWDPTAHKVVISRDVIFVEDRLQRGEVDDSTEKE 711
Query: 709 SGLSEPVYVEMDDVPGSPTDKSPQSGELAESSIRQPSDTLVHPTPVPVLRRSSRPHAPNR 768
+ + VE ++ Q AE + +P + P R+S R
Sbjct: 712 KPETTQIQVE---------EEFEQDSSEAEPAHEEPEP---ESSGAPTTRQSDREKRRPT 759
Query: 769 RYIDYM--------LLTDGGEPEDYDEACQTTDASKWELAMKEEMKSLISNQTWELAKLP 820
+ DY+ LLT+ GEP + EA ++D S+W AM+EE+++L N TW+L LP
Sbjct: 760 WHSDYVMEGNVAYCLLTEDGEPSTFQEAINSSDVSQWTAAMQEEIEALHKNNTWDLVPLP 819
Query: 821 IGKKALHNKWVYRVKEDHDGS-KRYKARLVVKGFRQKEGIDYTEIFAPVVKLNTIRSVLS 879
G+K + NKWV+++K + D +RY+ARLVVKG+ QKEGID+ EIF+PVV+L T+R VL+
Sbjct: 820 QGRKPIGNKWVFKIKRNGDDQVERYRARLVVKGYAQKEGIDFNEIFSPVVRLTTVRVVLA 879
Query: 880 IVASENLYLEQLDVKTAFLHGDLVEEIYMHQPEGFLEEGKENMVCMLKKSLYGLKQAPRQ 939
+ A+ NL+LEQLDVKTAFLHGDL EEIYM QPEGF ++ +N+VC L KSLYGLKQAPR
Sbjct: 880 MCATFNLHLEQLDVKTAFLHGDLEEEIYMLQPEGFEDKENQNLVCRLNKSLYGLKQAPRC 939
Query: 940 WYMKFESFMHKEGFQKCNADHCCFFKRY-KSSYIILLLYVDDMLVAGSNIDEIKNLKIQL 998
WY +F+SF+ G+ + NAD C +FKR+ K +++ILLLYVDDMLVAG N D I LK QL
Sbjct: 940 WYKRFDSFIMCLGYNRLNADPCAYFKRFGKDNFVILLLYVDDMLVAGPNKDHIDELKAQL 999
Query: 999 SKEFDMKDLGPAKKILGMQITRDKQKGVLQLS*AEYINRVLQRFNMGDAKLVSTPLASHF 1058
++EF+MKDLGPA KILGMQI RD+ + LS Y+ ++L RF+M D K +STPL +
Sbjct: 1000 AREFEMKDLGPANKILGMQIHRDRGNRKIWLSQKNYLKKILSRFSMQDCKSISTPLPINL 1059
Query: 1059 RLSQEQSPQTEEEKELMAKIPYASAIGSLMYAMVCTRPDIGHAVGVVSRFMSNPGKAHWE 1118
++S SP EE + M+++PYASA+GSLM+AM+CTRPDI AVGVVSR+M+NPG+ HW
Sbjct: 1060 KVSSSMSPSNEEGRMEMSRVPYASAVGSLMFAMICTRPDIAQAVGVVSRYMANPGREHWN 1119
Query: 1119 AVKWILRYLRGTTEKCLYFGKGEIKVEGYVDADFAGEVDHRRSTTGYIFTVGTRSVSWMS 1178
VK ILRY++GT++ L +G + + GYVD+D+AG++D +STTGY+F V +VSW+S
Sbjct: 1120 CVKRILRYIKGTSDVALCYGGSDFIINGYVDSDYAGDLDKSKSTTGYVFKVAGGAVSWVS 1179
Query: 1179 RIQKIVALSTTEVEYVAVTEASKELIWLQGLLTELGFMQEKSALYSDSQSAIHLAKNSAF 1238
++Q +VA STTE EYVA T+ASKE IWL+ LL ELG QE +L+ DSQSA+HLA+N AF
Sbjct: 1180 KLQAVVATSTTEAEYVAATQASKEAIWLKMLLEELGHKQEFVSLFCDSQSALHLARNPAF 1239
Query: 1239 HSRTKHIGLRYHFIRSLLEDEVLTLIKIQGSKNPADMLTKVVTIDKLKLCSTLVGLLE 1296
HSRTKHI ++YHFIR +++ + L KI + N AD LTK++ +DK C + GL+E
Sbjct: 1240 HSRTKHIRVQYHFIREKVKEGTVDLQKIHTADNVADFLTKIINVDKFTWCRSSCGLIE 1297
>gb|AAK29467.1| polyprotein-like [Lycopersicon chilense]
Length = 1328
Score = 1121 bits (2899), Expect = 0.0
Identities = 613/1335 (45%), Positives = 852/1335 (62%), Gaps = 63/1335 (4%)
Query: 7 KIEKFDGAD--FGFWKMQIEDYLYQKKLHQPL--TEKKPDSMKDDEWSLLDRQALGVVRL 62
++ KF+G F W+ +++D L Q+ LH+ L KKP+SMK ++W LD +A +RL
Sbjct: 7 EVAKFNGDKPVFSMWQRRMKDLLIQQGLHKALGGKSKKPESMKLEDWEELDEKAASAIRL 66
Query: 63 SLSRNVAFNIAKEKTTAGLMKALSSMYEKPPSSNKVHLMRRLFTLRMAEGMSVAQHINEL 122
L+ +V NI E++ G+ L ++Y +NK++L ++L+TL M EG + H+N L
Sbjct: 67 HLTDDVVNNIVDEESACGIWTKLENLYMSKTLTNKLYLKKQLYTLHMDEGTNFLSHLNVL 126
Query: 123 NIVTTQLSSVGIEFDDEVRALILLSSLPDSWSAIVTAVSSSSGSKKMKFDDVRDLVLSEE 182
N + TQL+++G++ ++E + ++LL+SLP S+ + T + S ++K D L+L+E+
Sbjct: 127 NGLITQLANLGVKIEEEDKRIVLLNSLPSSYDTLSTTILHGKDSIQLK-DVTSALLLNEK 185
Query: 183 IRRRELGESSSSSVLHTESRGRNSTRGNGRGKSKARRSKSKNHRSSHNSKSIECWNCGKT 242
+R++ + V TESRGR+ R + R KSK S K+ C+NC +
Sbjct: 186 MRKKP---ENHGQVFITESRGRSYQRSSSNYGRSGARGKSKVRSKS---KARNCYNCDQP 239
Query: 243 GHFKNQCRLPTKNQEE----KDEANVASTSGGGDALICSLESKEE---------SWVLDS 289
GHFK C P + + E K++ N A+ D ++ + +EE WV+D+
Sbjct: 240 GHFKRDCPNPKRGKGESSGQKNDDNTAAMVQNNDDVVLLINEEEECMHLAGTESEWVVDT 299
Query: 290 GASFHASSQKEFFKNYVPGNLGKVYLGNEQSCKVVGKGEVKIKLN-GSVWELKNVRHIPN 348
AS+HA+ ++ F YV G+ G V +GN K+ G G++ K N G LK+VRH+P+
Sbjct: 300 AASYHATPVRDLFCRYVAGDYGNVKMGNTSYSKIAGIGDICFKTNVGCTLVLKDVRHVPD 359
Query: 349 LTKNLISVGQLADEGYTTVFHGDDWKISKGAMTIARGRKSGTLYKT-AGACH--LIAVAT 405
L NLIS L +GY F W+++KGA+ IA+G GTLY+T A C L A
Sbjct: 360 LRMNLISGIALDQDGYENYFANQKWRLTKGALVIAKGVARGTLYRTNAEICQGELNAAHE 419
Query: 406 NENPNLWHKRLGHMSEKGMKVMHSKGKLPSLRSIEIDICEDCILGKQKRVSFQTSGRTPK 465
+ +LWHKR+GH SEKG++++ K + + I C + GKQ RVSFQTS K
Sbjct: 420 ENSADLWHKRMGHTSEKGLQILSKKSLISFTKGTTIKPCNYWLFGKQHRVSFQTSSER-K 478
Query: 466 KEKLELVHSDVWGPTTVPSIGGKHYFVTFIDDHSRKVWVYFLKHKSEVFEAFKRWKAMVE 525
L+LV+SDV GP + S+GG YFVTFIDD SRK+WVY + K +VF+ F+++ A+VE
Sbjct: 479 SNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYIFRAKDQVFQVFQKFHALVE 538
Query: 526 NETDLKIKKLRTDNGGEYEDTKFKKFCYEHGIRMERTVPGTPQHNGVAERMNRTLTERAR 585
ET K K+LRTDNGGEY +F+++C HGIR E+TVPGTPQHNGVAERMNRT+ E+ R
Sbjct: 539 RETGRKRKRLRTDNGGEYTSREFEEYCSNHGIRHEKTVPGTPQHNGVAERMNRTIVEKVR 598
Query: 586 SLRVQSGLPKKFWAEAVNTSAYLINRGPSVPLEHKIPEEVWSGKEVKLSHLRVFGCVAYV 645
S+ + LPK FW EAV T+ YLINR PSVPLE IPE VW+ KE+ SHL+VFGC A+
Sbjct: 599 SMLRMAKLPKTFWGEAVRTACYLINRSPSVPLEFDIPERVWTNKEMSYSHLKVFGCKAFA 658
Query: 646 HISDQGRNKLDPKSKKCIFIGYGEDEFGYRLWDDENKKMVRSKDVIFNERVM-------- 697
H+ + R KLD KS CIFIGYG++EFGYRLWD KK++RS+DVIF E +
Sbjct: 659 HVPKEQRTKLDDKSVPCIFIGYGDEEFGYRLWDLVKKKVIRSRDVIFRESEVGTAADLSE 718
Query: 698 -YKDKH----NTTTNDSGLSEPV--------YVEMDDVPGSPTDKSPQSGELAESSIRQP 744
K K+ N T S + P VE ++ P ++ Q G+ E + P
Sbjct: 719 KAKKKNGIIPNLVTIPSSSNHPTSAESTIDEVVEQEEQPDEIVEQGEQLGDNTE-QMEYP 777
Query: 745 SDTLVHPTPVPVLRRSSRPHAPNRRY--IDYMLLTDGGEPEDYDEACQTTDASKWELAMK 802
+ P LRRS R + +Y +Y+L+ GEPE+ E + S+W AM
Sbjct: 778 EEEQSQP-----LRRSERQRVESTKYPSSEYVLIKYEGEPENLKEVLSHPEKSQWMKAMH 832
Query: 803 EEMKSLISNQTWELAKLPIGKKALHNKWVYRVKEDHDGS-KRYKARLVVKGFRQKEGIDY 861
EEM SL N T++L +LP GK+ L KWV+++K+D +G RYKARLVVKGF QK+GID+
Sbjct: 833 EEMGSLQKNGTYQLVELPKGKRPLKCKWVFKLKKDGNGKLVRYKARLVVKGFEQKKGIDF 892
Query: 862 TEIFAPVVKLNTIRSVLSIVASENLYLEQLDVKTAFLHGDLVEEIYMHQPEGFLEEGKEN 921
EIF+PVVK+ +IR++LSI AS +L +EQLDVKTAFLHGDL EEIYM Q EGF GK++
Sbjct: 893 DEIFSPVVKMTSIRTILSIAASLDLEVEQLDVKTAFLHGDLEEEIYMEQGEGFEVSGKKH 952
Query: 922 MVCMLKKSLYGLKQAPRQWYMKFESFMHKEGFQKCNADHCCFFKRYK-SSYIILLLYVDD 980
MVC L KSLYGLKQAPRQWY KF+SFM + ++ + C +FKR+ ++IILLLY D
Sbjct: 953 MVCKLNKSLYGLKQAPRQWYKKFDSFMKSQTYRNTYSHPCVYFKRFSDKNFIILLLYTDY 1012
Query: 981 MLVAGSNIDEIKNLKIQLSKEFDMKDLGPAKKILGMQITRDKQKGVLQLS*AEYINRVLQ 1040
ML+ G + + I L+ SK FDMKDLGPAK+ILGM+I R++QK L LS +YI RVL+
Sbjct: 1013 MLIVGKDKELIAKLRKDFSKSFDMKDLGPAKQILGMKIAREEQK-KLGLSHEKYIERVLE 1071
Query: 1041 RFNMGDAKLVSTPLASHFRLSQEQSPQTEE-EKELMAKIPYASAIGSLMYAMVCTRPDIG 1099
RFNM AK +STPL S+ +L+++ P ++ EK MAK+PY+SA+GS MYAMVCTRP+I
Sbjct: 1072 RFNMKSAKPISTPLVSYLKLTKQMFPTKKKGEKGDMAKVPYSSAVGSFMYAMVCTRPNI- 1130
Query: 1100 HAVGVVSRFMSNPGKAHWEAVKWILRYLRGTTEKCLYFGKGEIKVEGYVDADFAGEVDHR 1159
AV VVSRF+ PGK H EAVKWILRYLR TT F + +GY + D G++D+R
Sbjct: 1131 VAVCVVSRFLEIPGKEHLEAVKWILRYLRRTTRDYFCFEGSDPISKGYTNVDMEGDLDNR 1190
Query: 1160 RSTTGYIFTVGTRSVSWMSRIQKIVALSTTEVEYVAVTEASKELIWLQGLLTELGFMQEK 1219
+STT Y+FT +SW S++QK VALSTTE +Y+A TE KE++WL+ L E G Q++
Sbjct: 1191 KSTTCYLFTFSGGDISWQSKLQKYVALSTTEAKYIAGTEVCKEMLWLKRFLQEHGLHQKE 1250
Query: 1220 SALYSDSQSAIHLAKNSAFHSRTKHIGLRYHFIRSLLEDEVLTLIKIQGSKNPADMLTKV 1279
+Y +SQSA+ L+K + +H+ TKHI +RYH+IR +++D L ++KI S+NPADM+TKV
Sbjct: 1251 YVVYCESQSAMDLSKKAMYHATTKHIDMRYHWIREMVDDGSLQVVKIPTSENPADMVTKV 1310
Query: 1280 VTIDKLKLCSTLVGL 1294
V +K +L LVG+
Sbjct: 1311 VQNEKFELWKELVGM 1325
>gb|AAX92941.1| retrotransposon protein, putative, Ty1-copia sub-class [Oryza sativa
(japonica cultivar-group)]
Length = 2340
Score = 1030 bits (2664), Expect = 0.0
Identities = 580/1332 (43%), Positives = 804/1332 (59%), Gaps = 54/1332 (4%)
Query: 16 FGFWKMQIEDYLYQKKLHQPLT--EKKPDSMKDDEWSLLDRQALGVVRLSLSRNVAFNIA 73
F W++++ L Q+ L L+ +K+ +DE DR+A+ + L LS N+ +
Sbjct: 224 FSLWQVKMRAVLAQQDLDDALSGFDKRTHDWSNDE-KKRDRKAMSYIHLHLSNNILQEVL 282
Query: 74 KEKTTAGLMKALSSMYEKPPSSNKVHLMRRLFTLRMAEGMSVAQHINELNIVTTQLSSVG 133
KE+ AGL L + ++K+HL + LF ++ + SV H++ + L S+
Sbjct: 283 KEEIAAGLWLKLEQICMTKDLTSKMHLKQTLFLHKLQDDGSVMDHLSAFKEIIADLESME 342
Query: 134 IEFDDEVRALILLSSLPDSWSAIVTAVSSSSGSKKMKFDDVRD-LVLSEEIRRRELGESS 192
+++D+E LILL SLP S++ + S + +K +V D L + E++++ E S
Sbjct: 343 VKYDEEDLGLILLCSLPSSYANFRDTILYSRDTLTLK--EVYDALHVKEKMKKMVPSEGS 400
Query: 193 SSSV-----------LHTESRGRNSTRGNGRGKSKAR-RSKSKNHRSSHNSKSIECWNCG 240
+S +T+++ R+ + + RG+SK+R R KS + ECW
Sbjct: 401 NSQAEGLIVWGRQQEKNTKNQSRDKSSSSYRGRSKSRGRYKSCKYCKRDGHDIFECWKLH 460
Query: 241 KTGHFKNQCRLPTKNQEEKDEANVASTSGGGDALIC--SLESKEESWVLDSGASFHASSQ 298
+ K +EE A V + L+ + W+L++ +H
Sbjct: 461 DKDKRTGKYVPKGKKEEEGKAAVVTDEKSDAELLVAYAGCAQTSDQWILNTACIYHMCPN 520
Query: 299 KEFFKNYVPGNLGKVYLGNEQSCKVVGKGEVKIKL-NGSVWELKNVRHIPNLTKNLISVG 357
+++F Y +G V +G++ C+V G G V+IK+ +G + L +VRHIPNL ++LIS+
Sbjct: 521 RDWFATYEAVQVGTVLMGDDTPCEVAGIGTVQIKMFDGCIRTLSDVRHIPNLKRSLISLC 580
Query: 358 QLADEGYTTVFHGDDW--KISKGAMTIARGR-KSGTLYKTAGACHLIAVAT-------NE 407
L +GY + G D K++KG++ + + KS LY G L VA ++
Sbjct: 581 TLDRKGYK--YSGGDGILKVTKGSLVVMKADIKSANLYHLRGTTILGNVAAVSDSLSNSD 638
Query: 408 NPNLWHKRLGHMSEKGMKVMHSKGKLPSLRSIEIDICEDCILGKQKRVSFQTSGRTPKKE 467
NLWH RLGHM+E G+ + +G L ++ CE CI GK KRV F TS T +
Sbjct: 639 ATNLWHMRLGHMTEIGLAELSKRGLLDGQSIGKLKFCEHCIFGKHKRVKFNTSTHTTEGI 698
Query: 468 KLELVHSDVWGPTTVPSIGGKHYFVTFIDDHSRKVWVYFLKHKSEVFEAFKRWKAMVENE 527
L+ VHSD+WGP S GG Y +T +DD+SRKVW YFLKHK + F+ FK WK MVE +
Sbjct: 699 -LDYVHSDLWGPARKTSFGGTRYMMTIVDDYSRKVWPYFLKHKYQAFDVFKEWKTMVERQ 757
Query: 528 TDLKIKKLRTDNGGEYEDTKFKKFCYEHGIRMERTVPGTPQHNGVAERMNRTLTERARSL 587
T+ K+K LRTDNG E+ FK +C GI TVP TPQ NGVAERMNRT+ +AR L
Sbjct: 758 TERKVKILRTDNGMEFCSKIFKSYCKSEGIVRHYTVPHTPQQNGVAERMNRTIISKARCL 817
Query: 588 RVQSGLPKKFWAEAVNTSAYLINRGPSVPLEHKIPEEVWSGKEVKLSHLRVFGCVAYVHI 647
+GLPK+FWAEAV+T+ YLINR PS ++ K P EVWSG S LRVFGC AY H+
Sbjct: 818 LSNAGLPKQFWAEAVSTACYLINRSPSYAIDKKTPIEVWSGSPANYSDLRVFGCTAYAHV 877
Query: 648 SDQGRNKLDPKSKKCIFIGYGEDEFGYRLWDDENKKMVRSKDVIFNERVMYKDKHNTTTN 707
+ KL+P++ KCIF+GY GY+LW E KK+V S++V+F+E VM DK +T
Sbjct: 878 DN---GKLEPRAIKCIFLGYPSGVKGYKLWCPETKKVVISRNVVFHESVMLHDKPSTNVP 934
Query: 708 -DSGLSEPVYVEMDDVPGSPTDKSPQSGELAESSIRQPSDTLVHPTPVPVLR--RSSRPH 764
+S V VE G +K + E I + ++V +P + R R
Sbjct: 935 VESQEKASVQVEHLISSGHAPEKEDVAINQDEPVIEDSNSSIVQQSPKRSIAKDRPKRNI 994
Query: 765 APNRRYID------YML-----LTDGGEPEDYDEACQTTDASKWELAMKEEMKSLISNQT 813
P +RYI+ Y L + EP Y EA + D ++W AM +EM+SL N T
Sbjct: 995 KPPQRYIEEANIVAYALSVAEEIEGNAEPSTYSEAIVSDDCNRWITAMHDEMESLEKNHT 1054
Query: 814 WELAKLPIGKKALHNKWVYRVKEDHDGSK--RYKARLVVKGFRQKEGIDYTEIFAPVVKL 871
WEL KLP KK + KW+++ KE S RYKARL+ KG+ Q GID+ ++F+PVVK
Sbjct: 1055 WELVKLPKEKKPIRCKWIFKRKEGISSSDEARYKARLIAKGYSQIPGIDFNDVFSPVVKH 1114
Query: 872 NTIRSVLSIVASENLYLEQLDVKTAFLHGDLVEEIYMHQPEGFLEEGKENMVCMLKKSLY 931
++IR++LSIVA + LEQ+DVKTAFLHG+L E+IYM QP+GF+ GKEN+VC LKKSLY
Sbjct: 1115 SSIRTLLSIVAMHDYELEQMDVKTAFLHGELEEDIYMEQPKGFVVPGKENLVCRLKKSLY 1174
Query: 932 GLKQAPRQWYMKFESFMHKEGFQKCNADHCCFFKRYKSSYIILLLYVDDMLVAGSNIDEI 991
GLKQ+PRQWY +F+SFM + F++ N D C + K S I LLLYVDDML+A + EI
Sbjct: 1175 GLKQSPRQWYKRFDSFMLSQKFRRSNYDSCVYLKVVDGSAIYLLLYVDDMLIAAKDKSEI 1234
Query: 992 KNLKIQLSKEFDMKDLGPAKKILGMQITRDKQKGVLQLS*AEYINRVLQRFNMGDAKLVS 1051
LK QLS EF+MKDLG AKKILGM+ITR + L LS YI +VL+RFNM DAK VS
Sbjct: 1235 AKLKAQLSSEFEMKDLGAAKKILGMEITRKRHSFKLYLSQKGYIEKVLRRFNMHDAKPVS 1294
Query: 1052 TPLASHFRLSQEQSPQTEEEKELMAKIPYASAIGSLMYAMVCTRPDIGHAVGVVSRFMSN 1111
TPLA+HFRLS + PQ++ + E M+++PY+SA+GSLMYAMVC+RPD+ HA+ VVSR+M+N
Sbjct: 1295 TPLAAHFRLSSDLCPQSDYDIEYMSRVPYSSAVGSLMYAMVCSRPDLSHALSVVSRYMAN 1354
Query: 1112 PGKAHWEAVKWILRYLRGTTEKCLYFGKGEIKVEGYVDADFAGEVDHRRSTTGYIFTVGT 1171
PGK HW+AV+WI RYLRGT+ CL FG+ + GYVD+DFAG++D RS GY+FT+G
Sbjct: 1355 PGKEHWKAVQWIFRYLRGTSSACLQFGRSRDGLVGYVDSDFAGDLDRGRSLAGYVFTIGG 1414
Query: 1172 RSVSWMSRIQKIVALSTTEVEYVAVTEASKELIWLQGLLTELGFMQEKSALYSDSQSAIH 1231
+VSW + +Q VALSTTE EY+A++EA KE IWL+GL T L + ++ DSQSAI
Sbjct: 1415 CAVSWKASLQATVALSTTEAEYMAISEACKEAIWLRGLYTVLCAVTSCINIFCDSQSAIC 1474
Query: 1232 LAKNSAFHSRTKHIGLRYHFIRSLLEDEVLTLIKIQGSKNPADMLTKVVTIDKLKLCSTL 1291
L K+ FH RTKHI +RYHFIR L+ + + + KI NPADM+TK V K +LCS+L
Sbjct: 1475 LTKDQMFHERTKHIDVRYHFIRGLIAEGDVKICKISIHDNPADMMTKPVPATKFELCSSL 1534
Query: 1292 VGLLE*EA-GEG 1302
V L + GEG
Sbjct: 1535 VILSDANVPGEG 1546
>ref|XP_469192.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
gi|53370655|gb|AAU89150.1| integrase core domain
containing protein [Oryza sativa (japonica
cultivar-group)] gi|40538906|gb|AAR87163.1| putative
polyprotein [Oryza sativa (japonica cultivar-group)]
Length = 1322
Score = 1008 bits (2605), Expect = 0.0
Identities = 573/1322 (43%), Positives = 793/1322 (59%), Gaps = 59/1322 (4%)
Query: 16 FGFWKMQIEDYLYQKK-LHQPLTE--KKPDSMKDDEWSLLDRQALGVVRLSLSRNVAFNI 72
F W++++ L Q L + L KK + E DR+AL +++L LS ++ +
Sbjct: 17 FSLWQVKMRAVLAQTSDLDEALESFGKKKTTEWTAEEKRKDRKALSLIQLHLSNDILQEV 76
Query: 73 AKEKTTAGLMKALSSMYEKPPSSNKVHLMRRLFTLRMAEGMSVAQHINELNIVTTQLSSV 132
++KT A L L S+ ++K+H+ +LF+ ++ E SV HI+ + L S+
Sbjct: 77 LQKKTAAELWLKLESICMSKDLTSKMHIKMKLFSHKLHESGSVLNHISVFKEIVADLVSM 136
Query: 133 GIEFDDEVRALILLSSLPDSWSAIVTAVSSSSGSKKMKFDDVRDLVLSEEIRRRELGESS 192
++FDDE L+LL SLP S++ + S + + L E+++ +S
Sbjct: 137 EVQFDDEDLGLLLLCSLPSSYANFRHTILLSRDELTLA-EVYEALQNREKMKGMVQSYAS 195
Query: 193 SSSVLHTESRGRNSTR----GNGRGKSKAR-RSKSKNHRSSHNSKSIECWNCGKTGHFKN 247
SS + RGR+ R N KS++R RSKS+ + C C K HF
Sbjct: 196 SSKGEALQVRGRSEQRTYNDSNDHDKSQSRGRSKSRGKKF--------CKYCKKKNHFIE 247
Query: 248 QC-RLPTKNQEEKD-EANV---ASTSGGGDALI--CSLESKEESWVLDSGASFHASSQKE 300
+C +L K + + D +A+V A S GD L+ + + W+LD+ SFH ++
Sbjct: 248 ECWKLQNKEKRKSDGKASVVTSAENSDSGDCLVVFAGYVASHDEWILDTACSFHICINRD 307
Query: 301 FFKNYVP-GNLGKVYLGNEQSCKVVGKGEVKIKLN-GSVWELKNVRHIPNLTKNLISVGQ 358
+F +Y N V +G++ ++VG G V+IK + G LK+VRHIP + +NLIS+
Sbjct: 308 WFSSYKSVQNEDVVRMGDDNPREIVGIGSVQIKTHDGMTRTLKDVRHIPGMARNLISLST 367
Query: 359 LADEGYTTVFHGDDWKISKGAMTIARG-RKSGTLYKTAGACH----LIAVATNENP---N 410
L EGY G K+SKG++ G S LY G+ A T + P N
Sbjct: 368 LDAEGYKYSGSGGVVKVSKGSLVYMIGDMNSANLYVLRGSTLHGSVTAAAVTKDEPSKTN 427
Query: 411 LWHKRLGHMSEKGMKVMHSKGKLPSLRSIEIDICEDCILGKQKRVSFQTSGRTPKKEKLE 470
LWH RLGHMSE GM + + L + CE C+ GK KRV F TS K L+
Sbjct: 428 LWHMRLGHMSELGMAELMKRNLLDGCTQGNMKFCEHCVFGKHKRVKFNTSVHRTKGI-LD 486
Query: 471 LVHSDVWGPTTVPSIGGKHYFVTFIDDHSRKVWVYFLKHKSEVFEAFKRWKAMVENETDL 530
VH+D+WGP+ PS+GG Y +T IDD+SRK W YFLKHK + F AFK K M+E +T+
Sbjct: 487 YVHADLWGPSRKPSLGGARYMLTIIDDYSRKEWPYFLKHKDDTFAAFKERKVMIERQTEK 546
Query: 531 KIKKLRTDNGGEYEDTKFKKFCYEHGIRMERTVPGTPQHNGVAERMNRTLTERARSLRVQ 590
++K L TDNGGE+ F +C + GI T+P TPQ NGVAERMNRT+ +AR +
Sbjct: 547 EVKVLCTDNGGEFCSDAFDDYCRKEGIVRHHTIPYTPQQNGVAERMNRTIISKARCMLSN 606
Query: 591 SGLPKKFWAEAVNTSAYLINRGPSVPLEHKIPEEVWSGKEVKLSHLRVFGCVAYVHISDQ 650
+ + K+FWAEA NT+ YLINR PS+PL K P E+WSG S LRVFGC AY H+ +
Sbjct: 607 ARMNKRFWAEAANTACYLINRSPSIPLNKKTPIEIWSGMPADYSQLRVFGCTAYAHVDN- 665
Query: 651 GRNKLDPKSKKCIFIGYGEDEFGYRLWDDENKKMVRSKDVIFNERVMYKDKHNTTTNDSG 710
KL+P++ KC+F+GYG GY+LW+ E K S++VIFNE VM+ D T G
Sbjct: 666 --GKLEPRAIKCLFLGYGSGVKGYKLWNPETNKTFMSRNVIFNEFVMFNDSLPTDVIPGG 723
Query: 711 LSEP---VYVEMDDVPGSPTDKSPQSGELAESSIRQPSDTLVHPTPVPVL-RRSSRPHAP 766
E V V+++ V T+ G ++ Q S +++ P P+ RR+ R
Sbjct: 724 SDEEQQYVSVQVEHVDDQETEIV---GNDVNDTV-QHSPSVLQPQDEPIAHRRTKRSCGA 779
Query: 767 NRRYIDYMLLT-----------DGGEPEDYDEACQTTDASKWELAMKEEMKSLISNQTWE 815
R I+ + + EP Y EA + D KW A++EEM+SL N TWE
Sbjct: 780 PVRLIEECDMVYYAFSYAEQVENTLEPATYTEAVVSGDREKWISAIQEEMQSLEKNGTWE 839
Query: 816 LAKLPIGKKALHNKWVYRVKEDHDGSK--RYKARLVVKGFRQKEGIDYTEIFAPVVKLNT 873
L LP KK + KW+++ KE S+ R+K RLV KGF Q G+DY ++F+PVVK ++
Sbjct: 840 LVHLPKQKKPVRCKWIFKRKEGLSPSEPPRFKVRLVAKGFSQIAGVDYNDVFSPVVKHSS 899
Query: 874 IRSVLSIVASENLYLEQLDVKTAFLHGDLVEEIYMHQPEGFLEEGKENMVCMLKKSLYGL 933
IR+ SIV +L LEQLDVKT FLHG+L EEIYM QPEGF+ GKE+ VC LK+SLYGL
Sbjct: 900 IRTFFSIVTMHDLELEQLDVKTTFLHGELEEEIYMDQPEGFIVPGKEDYVCKLKRSLYGL 959
Query: 934 KQAPRQWYMKFESFMHKEGFQKCNADHCCFFKRYKSSYIILLLYVDDMLVAGSNIDEIKN 993
KQ+PRQWY +F+SFM GF++ D C + K S I LLLYVDDML+A + ++I
Sbjct: 960 KQSPRQWYKRFDSFMLSHGFKRSEFDSCVYIKFVNGSPIYLLLYVDDMLIAAKSKEQITT 1019
Query: 994 LKIQLSKEFDMKDLGPAKKILGMQITRDKQKGVLQLS*AEYINRVLQRFNMGDAKLVSTP 1053
LK QLS EFDMKDLG AKKILGM+ITRD+ G+L LS YI +VLQRFNM DAK VSTP
Sbjct: 1020 LKKQLSSEFDMKDLGAAKKILGMEITRDRNSGLLFLSQQSYIKKVLQRFNMHDAKPVSTP 1079
Query: 1054 LASHFRLSQEQSPQTEEEKELMAKIPYASAIGSLMYAMVCTRPDIGHAVGVVSRFMSNPG 1113
+A HF+LS Q T+E+ E M+++PY+SA+GSLMYAMVC+ PD+ HA+ +VSR+M+NPG
Sbjct: 1080 IAPHFKLSALQCASTDEDVEYMSRVPYSSAVGSLMYAMVCSWPDLSHAMSLVSRYMANPG 1139
Query: 1114 KAHWEAVKWILRYLRGTTEKCLYFGKGEIKVEGYVDADFAGEVDHRRSTTGYIFTVGTRS 1173
K HW+AV+WI RYLRGT + CL FG+ + + GYVD+DFA ++D RRS TGY+FT+G+ +
Sbjct: 1140 KEHWKAVQWIFRYLRGTADACLKFGRIDKGLVGYVDSDFAADLDKRRSLTGYVFTIGSCA 1199
Query: 1174 VSWMSRIQKIVALSTTEVEYVAVTEASKELIWLQGLLTELGFMQEKSALYSDSQSAIHLA 1233
VSW + +Q +VA STTE EY+A+ EA KE +WL+GL EL + L+ DSQSAI L
Sbjct: 1200 VSWKATLQPVVAQSTTEAEYMAIAEACKESVWLKGLFAELCGVDSCINLFCDSQSAICLT 1259
Query: 1234 KNSAFHSRTKHIGLRYHFIRSLLEDEVLTLIKIQGSKNPADMLTKVVTIDKLKLCSTLVG 1293
K+ FH RTKHI ++YH++R ++ L + KI NPADM+TK + + K +LCS+LVG
Sbjct: 1260 KDQMFHERTKHIDIKYHYVRDIVAQGKLKVCKISIHDNPADMMTKPIPVAKFELCSSLVG 1319
Query: 1294 LL 1295
++
Sbjct: 1320 IV 1321
>gb|AAX92861.1| retrotransposon protein, putative, Ty1-copia sub-class [Oryza sativa
(japonica cultivar-group)]
Length = 1373
Score = 1007 bits (2604), Expect = 0.0
Identities = 566/1305 (43%), Positives = 784/1305 (59%), Gaps = 83/1305 (6%)
Query: 16 FGFWKMQIEDYLYQKKLHQPLTEKKPDSMKDDEWSL----LDRQALGVVRLSLSRNVAFN 71
F W++++ L Q + + + EW+ D++AL +++L L ++
Sbjct: 14 FSLWQVKMRGILAQTHDYDEALDNF--GKRRAEWTAEEIRKDQKALALIQLHLHNDILQE 71
Query: 72 IAKEKTTAGLMKALSSMYEKPPSSNKVHLMRRLFTLRMAEGMSVAQHINELNIVTTQLSS 131
EKT+A L L S+ ++K+ + +LFTL+M E SV H+ E + L S
Sbjct: 72 CLTEKTSAELWLKLESICMSKDLTSKMQMKMKLFTLKMKEEDSVITHMAEFKKIVADLVS 131
Query: 132 VGIEFDDEVRALILLSSLPDSWSAIVTAVSSSSGSKKMK--FDDVRDLVLSEEIRRRELG 189
+ +++DDE L+LL SLP+S++ + S +K +D +++ + + + +
Sbjct: 132 MEVKYDDEDLGLLLLCSLPNSYANFRDTILLSRDELTLKEVYDALQNKEKMKIMVQNDGS 191
Query: 190 ESSSSSVLHTESRGRNSTRGNGRGKSKARRSKSKNHRSSHNSKSIECWNCGKTGHFKNQC 249
SS LH R N T N + + RSKSK N K C C H ++C
Sbjct: 192 SSSKGEALHVRGRTENRT-SNEKNYDRRGRSKSK---PPGNKKF--CVYCKLKNHNIDEC 245
Query: 250 R-LPTKNQEEKDEANV------ASTSGGGDALI--CSLESKEESWVLDSGASFHASSQKE 300
+ + K ++ K + V AS GD L+ + + W+LDS SFH +++
Sbjct: 246 KKVQAKERKNKKDGKVSVASAAASDDDSGDCLVVFAGCVAGHDEWILDSACSFHICTKRN 305
Query: 301 FFKNYVPGNLGKVY-LGNEQSCKVVGKGEVKIKLN-GSVWELKNVRHIPNLTKNLISVGQ 358
+F +Y P G V +G++ C +VG G V+IK + G LKNVR+IP +++NLIS+
Sbjct: 306 WFSSYKPVQKGDVVRMGDDNPCAIVGIGSVQIKTDDGMTRTLKNVRYIPGMSRNLISLST 365
Query: 359 LADEGYTTVFHGDDW--KISKGAMTIARGR-KSGTLYKTAGACHLI------AVATNENP 409
L EGY + G D K+SKG++ +G S LY G C L A TN+ P
Sbjct: 366 LDAEGYK--YSGSDGVLKVSKGSLVCLKGDVNSAKLYVLRG-CTLTGSDSAAAAITNDEP 422
Query: 410 ---NLWHKRLGHMSEKGMKVMHSKGKLPSLRSIEIDICEDCILGKQKRVSFQTSGRTPKK 466
NLWH RLGHMS GM + + L S +I CE CI GK KRV F TS T K
Sbjct: 423 SKTNLWHMRLGHMSHLGMTELMKRNLLKGCTSSKIKFCEHCIFGKHKRVQFNTSVHTTKG 482
Query: 467 EKLELVHSDVWGPTTVPSIGGKHYFVTFIDDHSRKVWVYFLKHKSEVFEAFKRWKAMVEN 526
L+ VH+D+WGP+ PS+GG Y +T IDD+SRKVW YFLKHK + F AFK WK M+E
Sbjct: 483 T-LDYVHADLWGPSKKPSLGGARYMLTIIDDYSRKVWPYFLKHKDDTFTAFKNWKVMIER 541
Query: 527 ETDLKIKKLRTDNGGEYEDTKFKKFCYEHGIRMERTVPGTPQHNGVAERMNRTLTERARS 586
+T+ K+K LRTDNGGE+ F +C + GI T+P TPQ NGVAERMNRT+ RAR
Sbjct: 542 QTERKVKLLRTDNGGEFCSHAFNDYCRQEGIVRHHTIPHTPQQNGVAERMNRTIISRARC 601
Query: 587 LRVQSGLPKKFWAEAVNTSAYLINRGPSVPLEHKIPEEVWSGKEVKLSHLRVFGCVAYVH 646
+ + + K+FWAEA +T+ YLINR PS+PL K P EVWSG S L+VFGC AY H
Sbjct: 602 MLSHARMNKRFWAEAASTACYLINRSPSIPLNKKTPIEVWSGTPADYSQLKVFGCTAYAH 661
Query: 647 ISDQGRNKLDPKSKKCIFIGYGEDEFGYRLWDDENKKMVRSKDVIFNERVMYKDKHNT-- 704
+ + KL+P++ KC+F+GYG GY+LW+ E K S+ V+FNE VM+ + +
Sbjct: 662 VDN---GKLEPRAVKCLFLGYGSGVKGYKLWNPETGKTFMSRSVVFNESVMFTNSLPSEH 718
Query: 705 ---------------TTNDSGLS-EPVYVEMDDVPGSPTDKSPQSGELAESSIRQPSDTL 748
+D+G+ EPV+ E DD D A ++Q L
Sbjct: 719 VPEKELQRMHMQVEHVDDDTGVQVEPVH-EQDDHNNDVADDD------AHDDVQQTPPIL 771
Query: 749 VHPTPVPVL-RRSSRPHAPNRRYIDYMLLT-----------DGGEPEDYDEACQTTDASK 796
+P+ R+S R P +R I+ L+ + EP Y EA + D+
Sbjct: 772 QLEEELPIAQRKSKRTTKPPKRLIEECNLSYYALSCAEQVENVHEPATYKEAVRCGDSEN 831
Query: 797 WELAMKEEMKSLISNQTWELAKLPIGKKALHNKWVYRVKEDHDGSK--RYKARLVVKGFR 854
W AM EEM+SL N TWE+ LP KK + KW+++ KE S+ +YKARLV +G+
Sbjct: 832 WISAMHEEMQSLEKNSTWEVVPLPKKKKTISCKWIFKRKEGLSSSEPPKYKARLVARGYS 891
Query: 855 QKEGIDYTEIFAPVVKLNTIRSVLSIVASENLYLEQLDVKTAFLHGDLVEEIYMHQPEGF 914
Q G+DY ++F+PVVK ++IR+ LSIVAS +L LEQLDVKTAFLHG+L E+IYM QPEGF
Sbjct: 892 QIPGVDYNDVFSPVVKHSSIRTFLSIVASHDLELEQLDVKTAFLHGELEEDIYMDQPEGF 951
Query: 915 LEEGKENMVCMLKKSLYGLKQAPRQWYMKFESFMHKEGFQKCNADHCCFFKRYKSSYIIL 974
+ GKE VC LK+SLYGLKQ+PRQW +F+SFM F++ D C + K S I L
Sbjct: 952 IVPGKEKYVCKLKRSLYGLKQSPRQWNKRFDSFMLSHSFKRSKYDSCVYIKHVNGSPIYL 1011
Query: 975 LLYVDDMLVAGSNIDEIKNLKIQLSKEFDMKDLGPAKKILGMQITRDKQKGVLQLS*AEY 1034
LLYVDDML+A + EI LK LS EFDMKDLG AKKILGM+I+RD++ G+L LS Y
Sbjct: 1012 LLYVDDMLIAAKSKIEITKLKKLLSSEFDMKDLGSAKKILGMEISRDRKSGLLFLSQHNY 1071
Query: 1035 INRVLQRFNMGDAKLVSTPLASHFRLSQEQSPQTEEEKELMAKIPYASAIGSLMYAMVCT 1094
I +VLQRFNM +AK VSTP+A HF+LS Q P + E E M+++PY+SA+GSLMYAMVC+
Sbjct: 1072 IKKVLQRFNMQNAKAVSTPIAPHFKLSAAQCPSIDAEIEYMSRVPYSSAVGSLMYAMVCS 1131
Query: 1095 RPDIGHAVGVVSRFMSNPGKAHWEAVKWILRYLRGTTEKCLYFGKGEIKVEGYVDADFAG 1154
RPD+ +A+ +VSR+MSNPGK HW AV+WI RYLRGTT CL FG+ + + GYVD+D+A
Sbjct: 1132 RPDLSYAMSLVSRYMSNPGKEHWRAVQWIFRYLRGTTYSCLKFGRTDKGLIGYVDSDYAA 1191
Query: 1155 EVDHRRSTTGYIFTVGTRSVSWMSRIQKIVALSTTEVEYVAVTEASKELIWLQGLLTELG 1214
++D RRS TGY+FT+G+ +VSW + +Q +VALSTTE EY+A+ EA KELIWL+GL EL
Sbjct: 1192 DLDRRRSLTGYVFTIGSCAVSWRATLQSVVALSTTEAEYMAICEACKELIWLKGLYAELS 1251
Query: 1215 FMQEKSALYSDSQSAIHLAKNSAFHSRTKHIGLRYHFIRSLLEDE 1259
++ +L+ DS+SAI+L K+ FH RTKHI ++YHF+R ++E++
Sbjct: 1252 GVESCISLHCDSESAIYLTKDQMFHERTKHIDIKYHFVRDVIEED 1296
>gb|AAT85194.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
Length = 1241
Score = 1004 bits (2595), Expect = 0.0
Identities = 558/1246 (44%), Positives = 770/1246 (61%), Gaps = 58/1246 (4%)
Query: 95 SNKVHLMRRLFTLRMAEGMSVAQHINELNIVTTQLSSVGIEFDDEVRALILLSSLPDSWS 154
++K+HL ++LF ++ + SV H++ + L S+ +++D+E LILL SLP S++
Sbjct: 6 TSKMHLKQKLFLHKLQDDGSVMDHLSAFKEIVADLESMEVKYDEEDLGLILLCSLPSSYA 65
Query: 155 AIVTAVSSSSGSKKMKFDDVRD-LVLSEEIRRRELGESSSSSVLHTESRGRNSTRGNGRG 213
+ S + +K +V D L E++++ E S+S RGR + N
Sbjct: 66 NFRDTILYSRDTLTLK--EVYDALHAKEKMKKMVPSEGSNSQAEGLVVRGRQQEK-NTNN 122
Query: 214 KSKARRSKSKNHRSSHNSKSIECWNCGKTGHFKNQC------------RLPTKNQEEKDE 261
KS+ + S RS + C C + GH ++C +P +EE+ +
Sbjct: 123 KSRDKSSSIYRGRSKSRGRYKSCKYCKRDGHDISECWKLQDKDKRTRKYIPKGKKEEEGK 182
Query: 262 ANVASTSGGGDALICSLES---KEESWVLDSGASFHASSQKEFFKNYVPGNLGKVYLGNE 318
A V + L+ + + W+LD+ ++H +++F Y G V +G++
Sbjct: 183 AAVVTDEKSDAELLVAYAGCAQTSDQWILDTACTYHMCPNRDWFATYEAVQGGTVLMGDD 242
Query: 319 QSCKVVGKGEVKIKL-NGSVWELKNVRHIPNLTKNLISVGQLADEGYTTVFHGDDW--KI 375
C+V G G V+IK+ +G + L +VRHIPNL ++LIS+ L +GY + G D K+
Sbjct: 243 TPCEVAGIGTVQIKMFDGCIRTLLDVRHIPNLKRSLISLCTLDRKGYK--YSGGDGILKV 300
Query: 376 SKGAMTIARGR-KSGTLYKTAGACHLIAVAT-------NENPNLWHKRLGHMSEKGMKVM 427
+KG++ + + K LY G L VA ++ NLWH RLGHMSE G+ +
Sbjct: 301 TKGSLVVMKADIKYANLYHLRGTTILGNVAAVSDSLSNSDATNLWHMRLGHMSEIGLAEL 360
Query: 428 HSKGKLPSLRSIEIDICEDCILGKQKRVSFQTSGRTPKKEKLELVHSDVWGPTTVPSIGG 487
+G L ++ CE CI GK KRV F TS T + L+ VHSD+WGP S GG
Sbjct: 361 SKRGLLDGQSIGKLKFCEHCIFGKHKRVKFNTSTHTTEGI-LDYVHSDLWGPARKTSFGG 419
Query: 488 KHYFVTFIDDHSRKVWVYFLKHKSEVFEAFKRWKAMVENETDLKIKKLRTDNGGEYEDTK 547
Y +T +DD+SRKVW YFLKHK + F+ FK WK MVE +T+ K+K LRTDNG E
Sbjct: 420 ARYMMTIVDDYSRKVWPYFLKHKYQAFDVFKEWKTMVERQTERKVKILRTDNGMELCSKI 479
Query: 548 FKKFCYEHGIRMERTVPGTPQHNGVAERMNRTLTERARSLRVQSGLPKKFWAEAVNTSAY 607
FK +C GI TVP TPQ NGVAERMNRT+ +AR + + LPK+FWAEAV+T+ Y
Sbjct: 480 FKSYCKSEGIVRHYTVPHTPQQNGVAERMNRTIISKARCMLSNASLPKQFWAEAVSTACY 539
Query: 608 LINRGPSVPLEHKIPEEVWSGKEVKLSHLRVFGCVAYVHISDQGRNKLDPKSKKCIFIGY 667
LINR PS ++ K P EVWSG S LRVFGC AY H+ + KL+P+ KCIF+GY
Sbjct: 540 LINRSPSYAIDKKTPIEVWSGSPANYSDLRVFGCTAYAHVDN---GKLEPRVIKCIFLGY 596
Query: 668 GEDEFGYRLWDDENKKMVRSKDVIFNERVMYKDKHNTTTNDSGLSEPVYVEMDDVPGSPT 727
GY+LW E KK+V S++V+F+E +M DK +T E V V+++ + S
Sbjct: 597 LSGVKGYKLWCPETKKVVISRNVVFHESIMLHDKPSTNVPVES-QEKVSVQVEHLISS-- 653
Query: 728 DKSPQSGELA---ESSIRQPSDT-LVHPTPVPVLR--RSSRPHAPNRRYID------YML 775
+P+ ++A ++ + + SD+ +V +P + R R P RRYI+ Y L
Sbjct: 654 GHAPEKEDVAINQDAPVIEDSDSSIVQQSPKRSIAKDRPKRNTKPPRRYIEEANIVAYAL 713
Query: 776 -----LTDGGEPEDYDEACQTTDASKWELAMKEEMKSLISNQTWELAKLPIGKKALHNKW 830
+ EP Y +A + D ++W AM +EM+SL N +WEL KLP KK + KW
Sbjct: 714 SVAEEIEGNAEPSTYSDAIVSDDCNRWITAMHDEMESLEKNHSWELEKLPKEKKPIRCKW 773
Query: 831 VYRVKEDHDGSK--RYKARLVVKGFRQKEGIDYTEIFAPVVKLNTIRSVLSIVASENLYL 888
+++ KE S RYKARLV KG+ Q GID+ ++F+PVVK ++IR++LSIVA + L
Sbjct: 774 IFKRKEGMSPSDEARYKARLVAKGYSQIPGIDFNDVFSPVVKHSSIRTLLSIVAMHDYEL 833
Query: 889 EQLDVKTAFLHGDLVEEIYMHQPEGFLEEGKENMVCMLKKSLYGLKQAPRQWYMKFESFM 948
EQ+DVKTAFLHG+L E+IYM QPEGF+ GKEN+VC LKKSLYGLKQ+PRQWY +F+SFM
Sbjct: 834 EQMDVKTAFLHGELEEDIYMEQPEGFVVPGKENLVCRLKKSLYGLKQSPRQWYKRFDSFM 893
Query: 949 HKEGFQKCNADHCCFFKRYKSSYIILLLYVDDMLVAGSNIDEIKNLKIQLSKEFDMKDLG 1008
+ F++ N D C + K S I LLLYVDDML+A + EI LK QLS EF+MKDLG
Sbjct: 894 LSQKFRRSNYDSCVYLKVVDGSAIYLLLYVDDMLIAAKDKSEIAKLKAQLSSEFEMKDLG 953
Query: 1009 PAKKILGMQITRDKQKGVLQLS*AEYINRVLQRFNMGDAKLVSTPLASHFRLSQEQSPQT 1068
AKKILGM+ITR++ G L LS YI +VL RFNM DAKLVST LA+HFRLS + PQ+
Sbjct: 954 AAKKILGMEITRERHSGKLYLSQKCYIEKVLHRFNMHDAKLVSTLLAAHFRLSSDLCPQS 1013
Query: 1069 EEEKELMAKIPYASAIGSLMYAMVCTRPDIGHAVGVVSRFMSNPGKAHWEAVKWILRYLR 1128
+ E M+++PY+SA+ SLMYAMVC+RPD+ HA+ VVSR+M+NPGK HW+AV+WI RYLR
Sbjct: 1014 AYDIEYMSRVPYSSAVSSLMYAMVCSRPDLSHALSVVSRYMANPGKEHWKAVQWIFRYLR 1073
Query: 1129 GTTEKCLYFGKGEIKVEGYVDADFAGEVDHRRSTTGYIFTVGTRSVSWMSRIQKIVALST 1188
GT+ CL FG+ + GYVD+DFAG++D RRS TGY+FTVG +VSW + +Q VALST
Sbjct: 1074 GTSSACLQFGRSSDGLVGYVDSDFAGDLDRRRSLTGYVFTVGGCAVSWKASLQATVALST 1133
Query: 1189 TEVEYVAVTEASKELIWLQGLLTELGFMQEKSALYSDSQSAIHLAKNSAFHSRTKHIGLR 1248
TE EY+A++EA KE+IWL+GL TEL + ++ DSQSAI L K+ FH RTKHI LR
Sbjct: 1134 TEAEYMAISEACKEVIWLRGLYTELCGVTSCINIFCDSQSAICLTKDQMFHERTKHIDLR 1193
Query: 1249 YHFIRSLLEDEVLTLIKIQGSKNPADMLTKVVTIDKLKLCSTLVGL 1294
YHFIR ++ + + + KI NP DM+TK V K +LCS+LVG+
Sbjct: 1194 YHFIRGVIAEGDVKVCKISTHDNPVDMMTKPVPATKFELCSSLVGV 1239
>gb|AAD19773.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301697|pir||B84512 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1335
Score = 984 bits (2545), Expect = 0.0
Identities = 569/1319 (43%), Positives = 799/1319 (60%), Gaps = 79/1319 (5%)
Query: 34 QPLTEKK---PDSMKD---DEWSLLDR--QALGVVRLSLSRNVAFNIAKEKTTAGLMKAL 85
+PLTE++ P+ K DE + L+R +A V+ L+++ V I KT A + L
Sbjct: 19 KPLTEEEEEDPEKRKKRDADEVARLERCDKAKNVIFLNVADKVLRKIELCKTAAEAWETL 78
Query: 86 SSMYEKPPSSNKVHLMRRLFTLRMAEGMSVAQHINELNIVTTQLSSVGIEFDDEVRALIL 145
++ ++V+ +T +M E + ++I++ + L+ + I+ DEV+A++L
Sbjct: 79 DRLFMIRSLPHRVYTQLSFYTFKMQENKKIDENIDDFLKIVADLNHLQIDVTDEVQAILL 138
Query: 146 LSSLPDSWSAIVTAVSSSSGSKKMKFDDVRDLVLSEEIRRRELGESSSSSVLHTESRGRN 205
LSSLP + +V + S+ +K++ DDV +++ + REL +++ V +RGR
Sbjct: 139 LSSLPARYDGLVETMKYSNSREKLRLDDV---MVAARDKERELSQNNRPVVEGHFARGRP 195
Query: 206 STRGNGRGKSKARRSKSKNHRSSHNSKSIECWNCGKTGHFKNQC-----RLPTKNQ-EEK 259
+ N +G RS+SK S + K + CW CGK GHFK QC R +K Q +
Sbjct: 196 DGKNNNQGNKGKNRSRSK----SADGKRV-CWICGKEGHFKKQCYKWIERNKSKQQGSDN 250
Query: 260 DEANVASTSGGGDALICSLESKE---------ESWVLDSGASFHASSQKEFFKNYVPGNL 310
E+++A ++ + + L + E WVLD+G SFH + +K++FK++ +
Sbjct: 251 GESSLAKSTEAFNPAMVLLATDETLVVTDSIANEWVLDTGCSFHMTPRKDWFKDFKELSS 310
Query: 311 GKVYLGNEQSCKVVGKGEVKIK-LNGSVWELKNVRHIPNLTKNLISVGQLADEGYTTVFH 369
G V +GN+ V G G +KI+ +GS L +VR++PN+T+NLIS+G L D G
Sbjct: 311 GYVKMGNDTYSPVKGIGSIKIRNSDGSQVILTDVRYMPNMTRNLISLGTLEDRGCWFKSQ 370
Query: 370 GDDWKISKGAMTIARGRKSGTLY-----KTAGACHLIAVATNENPNLWHKRLGHMSEKGM 424
KI KG TI +G+K TLY G H A +E LWH RLGHMS+KGM
Sbjct: 371 DGILKIVKGCSTILKGQKRDTLYILDGVTEEGESHSSAEVKDETA-LWHSRLGHMSQKGM 429
Query: 425 KVMHSKGKLPSLRSIEIDICEDCILGKQKRVSFQTSGRTPKKEKLELVHSDVWG-PTTVP 483
+++ KG L E++ CEDC+ GKQ RVSF + K EKL VHSD+WG P
Sbjct: 430 EILVKKGCLRREVIKELEFCEDCVYGKQHRVSFAPAQHVTK-EKLAYVHSDLWGSPHNPA 488
Query: 484 SIGGKHYFVTFIDDHSRKVWVYFLKHKSEVFEAFKRWKAMVENETDLKIKKLRTDNGGEY 543
S+G YF++F+DD+SRKVW+YFL+ K E FE F WK MVEN++D K+KKLRTDNG EY
Sbjct: 489 SLGNSQYFISFVDDYSRKVWIYFLRKKDEAFEKFVEWKKMVENQSDRKVKKLRTDNGLEY 548
Query: 544 EDTKFKKFCYEHGIRMERTVPGTPQHNGVAERMNRTLTERARSLRVQSGLPKKFWAEAVN 603
+ F+KFC E GI +T TPQ NG+AER+NRT+ ++ RS+ +SG+ KKFWAEA +
Sbjct: 549 CNHYFEKFCKEEGIVRHKTCAYTPQQNGIAERLNRTIMDKVRSMLSRSGMEKKFWAEAAS 608
Query: 604 TSAYLINRGPSVPLEHKIPEEVWSGKEVKLSHLRVFGCVAYVHISDQGRNKLDPKSKKCI 663
T+ YLINR PS + +PEE W+G LS LR FGC+AY+H +DQG KL+P+SKK I
Sbjct: 609 TAVYLINRSPSTAINFDLPEEKWTGALPDLSSLRKFGCLAYIH-ADQG--KLNPRSKKGI 665
Query: 664 FIGYGEDEFGYRLWDDENKKMVRSKDVIFNERVMYKD---KHNTTTNDSGLSE-PVYVEM 719
F Y E GY++W E+KK V S++VIF E+VM+KD T ++S L + V +M
Sbjct: 666 FTSYPEGVKGYKVWVLEDKKCVISRNVIFREQVMFKDLKGDSQNTISESDLEDLRVNPDM 725
Query: 720 DD-------------------------VPGSPTDKSPQSGELAESSIRQPSDT--LVHPT 752
+D V SPT + E +S + T LV
Sbjct: 726 NDQEFTDQGGATQDNSNPSEATTSHNPVLNSPTHSQDEESEEEDSDAVEDLSTYQLVRDR 785
Query: 753 PVPVLRRSSRPHAPNRRYIDYMLLTDG-GEPEDYDEACQTTDASKWELAMKEEMKSLISN 811
++ + + + N Y DG EP+ Y EA D KW AMKEEM S+ N
Sbjct: 786 VRRTIKANPKYNESNMVGFAYYSEDDGKPEPKSYQEALLDPDWEKWNAAMKEEMVSMSKN 845
Query: 812 QTWELAKLPIGKKALHNKWVYRVKEDHDG--SKRYKARLVVKGFRQKEGIDYTEIFAPVV 869
TW+L P K + +WV+ K G + R+ ARLV KGF QKEG+DY EIF+PVV
Sbjct: 846 HTWDLVTKPEKVKLIGCRWVFTRKAGIPGVEAPRFIARLVAKGFTQKEGVDYNEIFSPVV 905
Query: 870 KLNTIRSVLSIVASENLYLEQLDVKTAFLHGDLVEEIYMHQPEGFLEEGKENMVCMLKKS 929
K +IR +LS+V N+ L+Q+DVKTAFLHG L EEIYM QPEGF + N VC+LK+S
Sbjct: 906 KHVSIRYLLSMVVHYNMELQQMDVKTAFLHGFLEEEIYMAQPEGFEIKRGSNKVCLLKRS 965
Query: 930 LYGLKQAPRQWYMKFESFMHKEGFQKCNADHCCFFKRYK-SSYIILLLYVDDMLVAGSNI 988
LYGLKQ+PRQW ++F+ FM + + D C +FK+ +YI LLLYVDDML+A +N
Sbjct: 966 LYGLKQSPRQWNLRFDEFMRGIKYTRSAYDSCVYFKKCNGDTYIYLLLYVDDMLIASANK 1025
Query: 989 DEIKNLKIQLSKEFDMKDLGPAKKILGMQITRDKQKGVLQLS*AEYINRVLQRFNMGDAK 1048
E+ LK LS+EF+MKDLG AKKILGM+I+RD+ G+L LS Y+ +VL+ F M +AK
Sbjct: 1026 SEVNELKQLLSREFEMKDLGDAKKILGMEISRDRDAGLLTLSQEGYVKKVLRSFQMDNAK 1085
Query: 1049 LVSTPLASHFRLSQEQSPQTEEEKELMAKIPYASAIGSLMYAMVCTRPDIGHAVGVVSRF 1108
VSTPL HF+L + EE+ E M +PYA+ IGS+MY+M+ TRPD+ +++GV+SRF
Sbjct: 1086 PVSTPLGIHFKLKAATDKEYEEQFERMKIVPYANTIGSIMYSMIGTRPDLAYSLGVISRF 1145
Query: 1109 MSNPGKAHWEAVKWILRYLRGTTEKCLYFGKGE-IKVEGYVDADFAGEVDHRRSTTGYIF 1167
MS P K HW+AVKW+LRY+RGT +K L F K E + GY D+D+ D RRS TGY+F
Sbjct: 1146 MSKPLKDHWQAVKWVLRYMRGTEKKKLCFRKQEDFLLRGYCDSDYGSNFDTRRSITGYVF 1205
Query: 1168 TVGTRSVSWMSRIQKIVALSTTEVEYVAVTEASKELIWLQGLLTELGFMQEKSALYSDSQ 1227
TVG ++SW S++QK+VA+S+TE EY+A+TEA KE +WL+G ELG Q+ ++SDSQ
Sbjct: 1206 TVGGNTISWKSKLQKVVAISSTEAEYMALTEAVKEALWLKGFAAELGHSQDYVEVHSDSQ 1265
Query: 1228 SAIHLAKNSAFHSRTKHIGLRYHFIRSLLEDEVLTLIKIQGSKNPADMLTKVVTIDKLK 1286
SAI LAKNS H RTKHI +R HFIR ++ ++ ++KI NPA++ TK V + K +
Sbjct: 1266 SAITLAKNSVHHERTKHIDIRLHFIRDIICAGLIKVVKIATECNPANIFTKTVPLAKFE 1324
>emb|CAB75481.1| copia-like polyprotein [Arabidopsis thaliana] gi|11278366|pir||T47492
copia-like polyprotein - Arabidopsis thaliana
Length = 1363
Score = 954 bits (2465), Expect = 0.0
Identities = 545/1366 (39%), Positives = 792/1366 (57%), Gaps = 104/1366 (7%)
Query: 6 VKIEKFDG-ADFGFWKMQIEDYLYQKKLHQPLTE------KKPDSMKDDE---------- 48
+++EKFDG D+ WK ++ ++ L L E K+ DS K DE
Sbjct: 6 IEVEKFDGRGDYTMWKEKLLAHIDMLGLSAVLRESETPMGKERDSEKSDEDEKEEREKME 65
Query: 49 -WSLLDRQALGVVRLSLSRNVAFNIAKEKTTAGLMKALSSMYEKPPSSNKVHLMRRLFTL 107
+ R+A + LS+S V I KE + A +++AL +Y N+++L ++L++
Sbjct: 66 AFEEKKRKARSTIVLSVSDRVLRKIKKETSAAAMLEALDRLYMSKALPNRIYLKQKLYSF 125
Query: 108 RMAEGMSVAQHINELNIVTTQLSSVGIEFDDEVRALILLSSLPDSWSAIVTAVSSSSGSK 167
+M+E +S+ +I+E + L ++ + DE +A++LL SLP + + + SSG
Sbjct: 126 KMSENLSIEGNIDEFLHIVADLENLNVLVSDEDQAILLLMSLPKPFDQLKDTLKYSSGKT 185
Query: 168 KMKFDDVRDLVLSEEIRRRELGESSSSSV--LHTESRGRNSTRGNGRGKSKARRSKSKNH 225
+ D+V + S E+ + +S L+ + + N R + K K +RSKSK+
Sbjct: 186 VLSLDEVAAAIYSRELEFGSVKKSIKGQAEGLYVKDKAENRGRSEQKDKGKGKRSKSKSK 245
Query: 226 RSSHNSKSIECWNCGKTGHFKNQCRLPTKNQEEKDEANVASTSGG--------------- 270
R CW CG+ GH K+ C K Q + +N +SGG
Sbjct: 246 RG--------CWICGEDGHLKSTCPNKNKPQFKNQGSNKGESSGGKGNLVEGSVNFVESA 297
Query: 271 ----GDALICSLESKEESWVLDSGASFHASSQKEFFKNYVPGNLGKVYLGNEQSCKVVGK 326
+AL + E+ W++D+G +H + ++E+ +++ G V +GN+ +V G
Sbjct: 298 GMFVSEALSSTDIHLEDEWIMDTGCIYHMTHKREWLEDFDEEAGGSVRMGNKSISRVKGV 357
Query: 327 GEVKI-KLNGSVWELKNVRHIPNLTKNLISVGQLADEGYTTVFHGDDWKISKGAMTIARG 385
G V+I NG L+NVR+IP++ +NL+S+G G+ +I G + G
Sbjct: 358 GTVRIVNDNGLTVTLQNVRYIPDMDRNLLSLGTFEKAGHKFESENGMLRIKSGNQVLLEG 417
Query: 386 RKSGTLY----KTAGACHLIAVATNENPNLWHKRLGHMSEKGMKVMHSKGKLPSLRSIEI 441
R+ TLY K A L N++ LWH+RL HMS+K M ++ KG L + +
Sbjct: 418 RRYDTLYILHGKPATDESLAVARANDDTVLWHRRLCHMSQKNMSLLIKKGFLDKKKVSML 477
Query: 442 DICEDCILGKQKRVSFQTSGRTPKKEKLELVHSDVWGPTTVP-SIGGKHYFVTFIDDHSR 500
D CEDCI G+ K++ F + KK KLE VHSD+WG TVP S+G YF++FIDD++R
Sbjct: 478 DTCEDCIYGRAKKIGFNLAQHDTKK-KLEYVHSDLWGAPTVPMSLGNCQYFISFIDDYTR 536
Query: 501 KVWVYFLKHKSEVFEAFKRWKAMVENETDLKIKKLRTDNGGEYEDTKFKKFCYEHGIRME 560
KVWVYFLK K E FE F W ++VEN++ ++K LRTDNG E+ + F FC E G +
Sbjct: 537 KVWVYFLKTKDEAFEKFVSWISLVENQSGERVKTLRTDNGLEFCNRMFDGFCEEKGFQRH 596
Query: 561 RTVPGTPQHNGVAERMNRTLTERARSLRVQSGLPKKFWAEAVNTSAYLINRGPSVPLEHK 620
RT TPQ NGV ERMNRT+ E+ RS+ SGLPK+FWAEA +T+ LIN+ P + +
Sbjct: 597 RTCAYTPQQNGVVERMNRTIMEKVRSMLCDSGLPKRFWAEATHTAVLLINKTPCSAINFE 656
Query: 621 IPEEVWSGKEVKLSHLRVFGCVAYVHISDQGRNKLDPKSKKCIFIGYGEDEFGYRLWDDE 680
P++ WSGK S+LR +GCV +VH +D G KL+ ++KK + IGY GY++W E
Sbjct: 657 FPDKRWSGKAPIYSYLRRYGCVTFVH-TDGG--KLNLRAKKGVLIGYPSGVKGYKVWLIE 713
Query: 681 NKKMVRSKDVIFNERVMYKDKHNTTTNDSGLSEPVYVEMDDVPGSPTD-------KSPQS 733
KK V S++V F E +YKD E V E DD GS D +
Sbjct: 714 EKKCVVSRNVSFQENAVYKDLMQR-------KEQVSCEEDDHAGSYIDLDLEADKDNSSG 766
Query: 734 GELAESSIRQPS-----------------DTLVHPTPVP---VLRRSSRP-HAPNR---- 768
GE +++ + + +T VH +P+ V R R AP R
Sbjct: 767 GEQSQAQVTPATRGAVTSTPPRYETDDIEETDVHQSPLSYHLVRDRERREIRAPRRFDDE 826
Query: 769 -RYIDYMLLTDGG---EPEDYDEACQTTDASKWELAMKEEMKSLISNQTWELAKLPIGKK 824
Y + + T+ G EP DY EA + + KW LAM EE++S + N TW P ++
Sbjct: 827 DYYAEALYTTEDGDAVEPADYKEAVRDENWDKWRLAMNEEIESQLKNDTWTTVTRPEKQR 886
Query: 825 ALHNKWVYRVKEDHDG--SKRYKARLVVKGFRQKEGIDYTEIFAPVVKLNTIRSVLSIVA 882
+ ++W+Y+ K+ G R+KARLV KG+ Q+EG+DY EIFAPVVK +IR +LSIVA
Sbjct: 887 IIGSRWIYKYKQGIPGVEEPRFKARLVAKGYAQREGVDYHEIFAPVVKHVSIRILLSIVA 946
Query: 883 SENLYLEQLDVKTAFLHGDLVEEIYMHQPEGFLEEGKENMVCMLKKSLYGLKQAPRQWYM 942
ENL LEQLDVKTAFLHG+L E+IYM PEG KEN VC+L KSLYGLKQAPRQW
Sbjct: 947 QENLELEQLDVKTAFLHGELKEKIYMMPPEGCESLFKENEVCLLNKSLYGLKQAPRQWNE 1006
Query: 943 KFESFMHKEGFQKCNADHCCFFKRYK-SSYIILLLYVDDMLVAGSNIDEIKNLKIQLSKE 1001
KF +M + GF++ + D C + K+ S + LL YVDDMLVA +N+ I LK +LS +
Sbjct: 1007 KFNHYMTEIGFKRSDYDSCAYTKKLSDDSTMYLLFYVDDMLVAANNMQAIDALKKELSIK 1066
Query: 1002 FDMKDLGPAKKILGMQITRDKQKGVLQLS*AEYINRVLQRFNMGDAKLVSTPLASHFRLS 1061
F+MKDLG AKKILG++I D++ GVL LS Y+N+VL+ FNM ++K TPL +H ++
Sbjct: 1067 FEMKDLGAAKKILGIEIIIDREAGVLWLSQESYLNKVLKTFNMLESKPALTPLGAHLKMK 1126
Query: 1062 QEQSPQTEEEKELMAKIPYASAIGSLMYAMVCTRPDIGHAVGVVSRFMSNPGKAHWEAVK 1121
+ E+E M +PY+SA+GS+MYAM+ TRPD+ + VGVVSRFMS P K HW VK
Sbjct: 1127 SATEEKLSTEEEYMNSVPYSSAVGSIMYAMIGTRPDLAYPVGVVSRFMSQPAKEHWLGVK 1186
Query: 1122 WILRYLRGTTE-KCLYFGKGEIKVEGYVDADFAGEVDHRRSTTGYIFTVGTRSVSWMSRI 1180
W+LRY++GT + + Y + + GY DAD+A ++D RRS TG +FT+G ++SW S +
Sbjct: 1187 WVLRYIKGTVDTRLCYKRNSDFSICGYCDADYAADLDKRRSITGLVFTLGGNTISWKSGL 1246
Query: 1181 QKIVALSTTEVEYVAVTEASKELIWLQGLLTELGFMQEKSALYSDSQSAIHLAKNSAFHS 1240
Q++VA S+TE EY+++TEA KE IWL+GLL + G+ Q+ ++ DSQSAI L+KN+ H
Sbjct: 1247 QRVVAQSSTECEYMSLTEAVKEAIWLKGLLKDFGYEQKNVEIFCDSQSAIALSKNNVHHE 1306
Query: 1241 RTKHIGLRYHFIRSLLEDEVLTLIKIQGSKNPADMLTKVVTIDKLK 1286
RTKHI +++HFIR ++ D + + KI KNPAD+ TKV+ ++K +
Sbjct: 1307 RTKHIDVKFHFIREIIADGKVEVSKISTEKNPADIFTKVLPVNKFQ 1352
>gb|AAD32759.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301696|pir||F84486 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1356
Score = 953 bits (2464), Expect = 0.0
Identities = 544/1361 (39%), Positives = 792/1361 (57%), Gaps = 89/1361 (6%)
Query: 6 VKIEKFDG-ADFGFWKMQIEDYLYQKKLHQPLTEKKP---------DSMKDDEWSLLDRQ 55
+++EKFDG D+ WK ++ ++ L+ L E + +S +D E L +
Sbjct: 6 IEVEKFDGRGDYTMWKEKLLAHMDILGLNTALKESESTGEKKSVLDESDEDYEEKLEKFE 65
Query: 56 AL--------GVVRLSLSRNVAFNIAKEKTTAGLMKALSSMYEKPPSSNKVHLMRRLFTL 107
AL + LS++ V I KE T A ++ AL +Y N+++ ++L++
Sbjct: 66 ALEEKKKKARSAIVLSVTDRVLRKIKKESTAAAMLLALDKLYMSKALPNRIYPKQKLYSF 125
Query: 108 RMAEGMSVAQHINELNIVTTQLSSVGIEFDDEVRALILLSSLPDSWSAIVTAVSSSSGSK 167
+M+E +SV +I+E + T L ++ + DE +A++LL++LP ++ + + SSG
Sbjct: 126 KMSENLSVEGNIDEFLQIITDLENMNVIISDEDQAILLLTALPKAFDQLKDTLKYSSGKS 185
Query: 168 KMKFDDVRDLVLSEEIRRRELGESSSS-----SVLHTESRGRNSTRGNGRGKSKARRSKS 222
+ D+V + S+E+ ELG S L+ + + N +G +GK K ++ KS
Sbjct: 186 ILTLDEVAAAIYSKEL---ELGSVKKSIKVQAEGLYVKDKNENKGKGEQKGKGKGKKGKS 242
Query: 223 KNHRSSHNSKSIECWNCGKTGHFKNQCRLPTKNQEEKDEANVASTSGG------------ 270
K K CW CG+ GHF++ C K Q ++ + +SGG
Sbjct: 243 K--------KKPGCWTCGEEGHFRSSCPNQNKPQFKQSQVVKGESSGGKGNLAEAAGYYV 294
Query: 271 GDALICSLESKEESWVLDSGASFHASSQKEFFKNYVPGNLGKVYLGNEQSCKVVGKGEVK 330
+AL + E+ W+LD+G S+H + ++E+F + G V +GN+ +V G G ++
Sbjct: 295 SEALSSTEVHLEDEWILDTGCSYHMTYKREWFHEFNEDAGGSVRMGNKTVSRVRGVGTIR 354
Query: 331 IK-LNGSVWELKNVRHIPNLTKNLISVGQLADEGYTTVFHGDDWKISKGAMTIARGRKSG 389
+K +G L NVR+IP++ +NL+S+G GY +I G + GR+
Sbjct: 355 VKNSDGLTIVLTNVRYIPDMDRNLLSLGTFEKAGYKFESEDGILRIKAGNQVLLTGRRYD 414
Query: 390 TLY----KTAGACHLIAVATNENPNLWHKRLGHMSEKGMKVMHSKGKLPSLRSIEIDICE 445
TLY K + L V ++ LWH+RL HMS+K M+++ KG L + +D+CE
Sbjct: 415 TLYLLNWKPVASESLAVVKRADDTVLWHQRLCHMSQKNMEILVRKGFLDKKKVSSLDVCE 474
Query: 446 DCILGKQKRVSFQTSGRTPKKEKLELVHSDVWGPTTVP-SIGGKHYFVTFIDDHSRKVWV 504
DCI GK KR SF + K EKLE +HSD+WG VP S+G YF++ IDD +RKVWV
Sbjct: 475 DCIYGKAKRKSFSLAHHDTK-EKLEYIHSDLWGAPFVPLSLGKCQYFMSIIDDFTRKVWV 533
Query: 505 YFLKHKSEVFEAFKRWKAMVENETDLKIKKLRTDNGGEYEDTKFKKFCYEHGIRMERTVP 564
YF+K K E FE F W +VEN+TD ++K LRTDNG E+ + F FC GI RT
Sbjct: 534 YFMKTKDEAFEKFVEWVNLVENQTDRRVKTLRTDNGLEFCNKLFDGFCESIGIHRHRTCA 593
Query: 565 GTPQHNGVAERMNRTLTERARSLRVQSGLPKKFWAEAVNTSAYLINRGPSVPLEHKIPEE 624
TPQ NGVAERMNRT+ E+ RS+ SGLPK+FWAEA +T+ LIN+ PS L +IP++
Sbjct: 594 YTPQQNGVAERMNRTIMEKVRSMLSDSGLPKRFWAEATHTTVLLINKTPSSALNFEIPDK 653
Query: 625 VWSGKEVKLSHLRVFGCVAYVHISDQGRNKLDPKSKKCIFIGYGEDEFGYRLWDDENKKM 684
WSG S+LR +GCVA+VH D KL+P++KK + IGY GY++W + +K
Sbjct: 654 KWSGNPPVYSYLRRYGCVAFVHTDD---GKLEPRAKKGVLIGYPVGVKGYKVWILDERKC 710
Query: 685 VRSKDVIFNERVMYKD----KHNTTTND-----SGLSEPVYVEMDDVPGS--------PT 727
V S+++IF E +YKD + N +T + S L + E D + G P
Sbjct: 711 VVSRNIIFQENAVYKDLMQRQENVSTEEDDQTGSYLEFDLEAERDVISGGDQEMVNTIPA 770
Query: 728 DKSPQSGELAESSIRQPSDTLVHPTPVP---VLRRSSRP-HAPNR-----RYIDYMLLTD 778
+SP D+ V+ +P+ V R R AP R Y + + T+
Sbjct: 771 PESPVVSTPTTQDTNDDEDSDVNQSPLSYHLVRDRDKREIRAPRRFDDEDYYAEALYTTE 830
Query: 779 GG---EPEDYDEACQTTDASKWELAMKEEMKSLISNQTWELAKLPIGKKALHNKWVYRVK 835
G EPE+Y +A + KW+LAM EE+ S N TW + P ++ + +W+++ K
Sbjct: 831 DGEAVEPENYRKAKLDANFDKWKLAMDEEIDSQEKNNTWTIVTRPENQRIIGCRWIFKYK 890
Query: 836 EDHDG--SKRYKARLVVKGFRQKEGIDYTEIFAPVVKLNTIRSVLSIVASENLYLEQLDV 893
G R+KARLV KG+ QKEGIDY EIFAPVVK +IR +LSIVA E+L LEQLDV
Sbjct: 891 LGILGVEEPRFKARLVAKGYAQKEGIDYHEIFAPVVKHVSIRVLLSIVAQEDLELEQLDV 950
Query: 894 KTAFLHGDLVEEIYMHQPEGFLEEGKENMVCMLKKSLYGLKQAPRQWYMKFESFMHKEGF 953
KTAFLHG+L E+IYM PEG+ K N VC+L K+LYGLKQAP+QW KF++FM + F
Sbjct: 951 KTAFLHGELKEKIYMSPPEGYESMFKANEVCLLNKALYGLKQAPKQWNEKFDNFMKEICF 1010
Query: 954 QKCNADHCCFFKRY-KSSYIILLLYVDDMLVAGSNIDEIKNLKIQLSKEFDMKDLGPAKK 1012
K D C + K S + LL+YVDD+LVA N + I LK L F+MKDLG AKK
Sbjct: 1011 VKSAYDSCAYTKVLPDGSVMYLLIYVDDILVASKNKEAITALKANLGMRFEMKDLGAAKK 1070
Query: 1013 ILGMQITRDKQKGVLQLS*AEYINRVLQRFNMGDAKLVSTPLASHFRLSQEQSPQTEEEK 1072
ILGM+I RD+ GVL LS Y+N++L+ +NM +AK TPL +HF+ + ++
Sbjct: 1071 ILGMEIIRDRTLGVLWLSQEGYLNKILETYNMAEAKPAMTPLGAHFKFQAATEQKLIRDE 1130
Query: 1073 ELMAKIPYASAIGSLMYAMVCTRPDIGHAVGVVSRFMSNPGKAHWEAVKWILRYLRGTTE 1132
+ M +PY+SA+GS+MYAM+ TRPD+ + VG++SRFMS P K HW VKW+LRY++GT +
Sbjct: 1131 DFMKSVPYSSAVGSIMYAMLGTRPDLAYPVGIISRFMSQPIKEHWLGVKWVLRYIKGTLK 1190
Query: 1133 KCLYFGK-GEIKVEGYVDADFAGEVDHRRSTTGYIFTVGTRSVSWMSRIQKIVALSTTEV 1191
L + K + GY DAD+A ++D RRS TG +FT+G ++SW S +Q++VA STTE
Sbjct: 1191 TRLCYKKSSSFSIVGYCDADYAADLDKRRSITGLVFTLGGNTISWKSGLQRVVAQSTTES 1250
Query: 1192 EYVAVTEASKELIWLQGLLTELGFMQEKSALYSDSQSAIHLAKNSAFHSRTKHIGLRYHF 1251
EY+++TEA KE IWL+GLL + G+ Q+ ++ DSQSAI L+KN+ H RTKHI ++YHF
Sbjct: 1251 EYMSLTEAVKEAIWLKGLLKDFGYEQKSVEIFCDSQSAIALSKNNVHHERTKHIDVKYHF 1310
Query: 1252 IRSLLEDEVLTLIKIQGSKNPADMLTKVVTIDKLKLCSTLV 1292
IR ++ D + ++KI KNPAD+ TKV+ + K + L+
Sbjct: 1311 IREIISDGTVEVLKISTEKNPADIFTKVLAVSKFQAALNLL 1351
>ref|XP_475489.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
gi|48475213|gb|AAT44282.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
Length = 1243
Score = 953 bits (2464), Expect = 0.0
Identities = 535/1249 (42%), Positives = 752/1249 (59%), Gaps = 51/1249 (4%)
Query: 16 FGFWKMQIEDYLYQKKLHQPLT--EKKPDSMKDDEWSLLDRQALGVVRLSLSRNVAFNIA 73
F W++++ L Q+ L L+ +K+ +DE DR+A+ + L LS N+ +
Sbjct: 17 FSLWQVKMRAVLAQQDLDDALSGFDKRTQDWSNDE-KKRDRKAISYIHLHLSNNILQEVL 75
Query: 74 KEKTTAGLMKALSSMYEKPPSSNKVHLMRRLFTLRMAEGMSVAQHINELNIVTTQLSSVG 133
KE+T AGL L + ++K+HL ++LF ++ + SV H++ + L S+
Sbjct: 76 KEETAAGLWLKLEQICMTKDLTSKMHLKQKLFLHKLQDDESVMDHLSAFKEIVADLESME 135
Query: 134 IEFDDEVRALILLSSLPDSWSAIVTAVSSSSGSKKMKFDDVRDLV-LSEEIRRRELGESS 192
+++D++ LILL SLP S++ + S + +K +V D E++++ E S
Sbjct: 136 VKYDEDDLGLILLCSLPSSYANFRGTILYSRDTLTLK--EVYDAFHAKEKMKKMVTSEGS 193
Query: 193 SSSVLHTESRGRNSTRGNGRGKSKARRSKSKNHRSSHNSKSIECWNCGKTGHFKNQC-RL 251
+S RGR + N + +S+ + S S R+ + C C + GH ++C +L
Sbjct: 194 NSQAEGLVVRGRQQKK-NTKNQSRDKSSSSYRGRTKSRGRYKSCKYCKRDGHDISECWKL 252
Query: 252 PTKNQEEKDEANVASTSGGGDALICSLESKEESWVLDSGASFHASSQKEFFKNYVPGNLG 311
K++ G A + + + K ++ +L + A +S +++F Y G
Sbjct: 253 QDKDKRTGKYIPKGKKEEEGKAAVVT-DEKSDAELLVAYAGCAQTSDQDWFATYEALQGG 311
Query: 312 KVYLGNEQSCKVVGKGEVKIKL-NGSVWELKNVRHIPNLTKNLISVGQLADEGYTTVFHG 370
V +G++ C+V G G V+IK+ +G + L +V+HIPNL ++LIS+ +
Sbjct: 312 TVLMGDDTPCEVAGIGTVQIKMFDGCIRTLSDVQHIPNLKRSLISLYGIL---------- 361
Query: 371 DDWKISKGAMTIAR-GRKSGTLYKTAGACHLIAVAT-------NENPNLWHKRLGHMSEK 422
K++KG++ + + KS LY G L VA ++ NLWH RLGHMSE
Sbjct: 362 ---KVTKGSLVVMKVDIKSANLYHLRGTTILGNVAAVFDSLSNSDATNLWHMRLGHMSEI 418
Query: 423 GMKVMHSKGKLPSLRSIEIDICEDCILGKQKRVSFQTSGRTPKKEKLELVHSDVWGPTTV 482
G+ + +G L ++ CE CI GK KRV F TS T + L+ VHSD+WGP
Sbjct: 419 GLAELSKRGLLDGQSIRKLKFCEHCIFGKHKRVKFNTSTHTTEGI-LDYVHSDLWGPAHK 477
Query: 483 PSIGGKHYFVTFIDDHSRKVWVYFLKHKSEVFEAFKRWKAMVENETDLKIKKLRTDNGGE 542
S GG Y +T +DD+SRKVW YFLKHK + F+ FK WK MVE +T+ K+K LRTDNG E
Sbjct: 478 TSFGGARYMMTIVDDYSRKVWPYFLKHKYQAFDGFKEWKTMVERQTERKVKILRTDNGME 537
Query: 543 YEDTKFKKFCYEHGIRMERTVPGTPQHNGVAERMNRTLTERARSLRVQSGLPKKFWAEAV 602
+ FK +C GI T P TPQ N VAERMNRT+ +AR + +GLPK+FWAEAV
Sbjct: 538 FCSKIFKSYCKSEGIVCHYTAPHTPQQNDVAERMNRTIISKARCMLSNAGLPKQFWAEAV 597
Query: 603 NTSAYLINRGPSVPLEHKIPEEVWSGKEVKLSHLRVFGCVAYVHISDQGRNKLDPKSKKC 662
+T+ YLINR P ++ K P EVWSG S LRVFGC AY H+ + KL+P++ KC
Sbjct: 598 STACYLINRSPGYAIDKKTPIEVWSGSPTNYSDLRVFGCTAYAHVDN---GKLEPRAIKC 654
Query: 663 IFIGYGEDEFGYRLWDDENKKMVRSKDVIFNERVMYKDKHNTTTN-DSGLSEPVYVEMDD 721
IF+GY GY+LW E KK+V S++V+F+E V+ DK +T +S V VE
Sbjct: 655 IFLGYASGVKGYKLWCPETKKVVISRNVVFHESVILHDKPSTNVPVESQEKASVQVEHLI 714
Query: 722 VPGSPTDKSPQSGELAESSIRQPSDTLVHPTPVPVLRRSS--RPHAPNRRYID------Y 773
G +K + I ++VH +P + + R P RRYI+ Y
Sbjct: 715 SSGHAPEKEDVAINQDAPVIEDSDSSIVHQSPKRSIAKDKPKRNIKPPRRYIEEAKIVAY 774
Query: 774 ML-----LTDGGEPEDYDEACQTTDASKWELAMKEEMKSLISNQTWELAKLPIGKKALHN 828
L + EP Y EA + D ++W AM +EM+SL N TWEL KLP KK +
Sbjct: 775 ALSVAEKIEGNAEPSTYSEAIVSDDCNRWITAMHDEMESLEKNHTWELVKLPKEKKPIRC 834
Query: 829 KWVYRVKEDHDGSK--RYKARLVVKGFRQKEGIDYTEIFAPVVKLNTIRSVLSIVASENL 886
KW+++ KE S RYKARLV KG+ Q GID+ ++F+PVVK ++IR++L IVA +
Sbjct: 835 KWIFKRKEGMSPSDEARYKARLVAKGYSQIPGIDFNDVFSPVVKHSSIRTLLGIVAMHDY 894
Query: 887 YLEQLDVKTAFLHGDLVEEIYMHQPEGFLEEGKENMVCMLKKSLYGLKQAPRQWYMKFES 946
LEQ++VKTAFLHG+L E+IYM QPEGF+ GKEN+VC LKKSLYGLKQ+PRQWY +F+S
Sbjct: 895 ELEQMNVKTAFLHGELEEDIYMEQPEGFVVPGKENLVCRLKKSLYGLKQSPRQWYKRFDS 954
Query: 947 FMHKEGFQKCNADHCCFFKRYKSSYIILLLYVDDMLVAGSNIDEIKNLKIQLSKEFDMKD 1006
FM + F+ N D C + K S I LLLYVDDML+A + EI+ LK QLS EF+MKD
Sbjct: 955 FMLSQKFRISNYDSCVYLKVVDGSVIYLLLYVDDMLIAAKDKSEIEKLKAQLSSEFEMKD 1014
Query: 1007 LGPAKKILGMQITRDKQKGVLQLS*AEYINRVLQRFNMGDAKLVSTPLASHFRLSQEQSP 1066
LG AKKILGM+ITR++ G L LS YI +VL+RFNM DAK VSTPLA+HFRLS + P
Sbjct: 1015 LGAAKKILGMEITRERHSGKLYLSQKGYIEKVLRRFNMHDAKPVSTPLAAHFRLSSDLCP 1074
Query: 1067 QTEEEKELMAKIPYASAIGSLMYAMVCTRPDIGHAVGVVSRFMSNPGKAHWEAVKWILRY 1126
++ + E M+++PY+SA+GSLMYAMVC RPD+ HA+ VV+R+M+NPGK HW+AV+WI RY
Sbjct: 1075 LSDYDIEYMSRVPYSSAVGSLMYAMVCCRPDLSHALSVVNRYMANPGKEHWKAVQWIFRY 1134
Query: 1127 LRGTTEKCLYFGKGEIKVEGYVDADFAGEVDHRRSTTGYIFTVGTRSVSWMSRIQKIVAL 1186
LRGT+ CL F + + GYVD+DFAG++D RRS TGY+FT+G +VSW + +Q VAL
Sbjct: 1135 LRGTSSACLQFERSRDGLVGYVDSDFAGDLDRRRSITGYVFTIGGCAVSWKASLQATVAL 1194
Query: 1187 STTEVEYVAVTEASKELIWLQGLLTELGFMQEKSALYSDSQSAIHLAKN 1235
STTE EY+A+ EA KE IWL+GL TEL + ++ DSQSAI+L K+
Sbjct: 1195 STTEAEYMAIFEACKEAIWLRGLYTELCGVTSCINIFCDSQSAIYLTKD 1243
>gb|AAD23690.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301702|pir||E84601 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1333
Score = 949 bits (2453), Expect = 0.0
Identities = 545/1352 (40%), Positives = 799/1352 (58%), Gaps = 94/1352 (6%)
Query: 6 VKIEKFDG-ADFGFWKMQI---------------EDYLYQKKLHQPLTEK--KPDSMKDD 47
+++EKFDG D+ WK ++ ED L +K LTE+ K + ++ +
Sbjct: 6 IEVEKFDGRGDYTMWKEKLMAHLDILGLSVALKEEDDLVEKVAEMQLTEEEEKEEVLRRE 65
Query: 48 EWSLLDRQALGVVRLSLSRNVAFNIAKEKTTAGLMKALSSMYEKPPSSNKVHLMRRLFTL 107
R+A + LS++ V I KE++ A ++ L +Y N+++ ++L++
Sbjct: 66 LLEEKRRKARSAIVLSVTDRVLRKIKKEQSAAAMLGVLDKLYMSKALPNRIYQKQKLYSF 125
Query: 108 RMAEGMSVAQHINELNIVTTQLSSVGIEFDDEVRALILLSSLPDSWSAIVTAVSSSSGSK 167
+M+E +S+ +I+E + L + + DE +A++LL SLP + + + G
Sbjct: 126 KMSENLSIEGNIDEFLRIIADLENTNVLVSDEDQAILLLMSLPKPFDQLRDTLKYGLGRV 185
Query: 168 KMKFDDVRDLVLSEEIRRRELGESSSSSVLH---------TESRGRNSTRGNGRGKSKAR 218
+ D+V + S+E+ ELG + S TE+RGR RGN K+R
Sbjct: 186 TLSLDEVVAAIYSKEL---ELGSNKKSIKGQAEGLFVKEKTETRGRTEQRGNNNNNKKSR 242
Query: 219 RSKSKNHRSSHNSKSIECWNCGKTGHFKNQCRLPTKNQEEKDEANVASTSGGGDALICSL 278
SKS++ + CW CG++ + EAN S +AL +
Sbjct: 243 -SKSRSKKG--------CWICGESSN----------GSSNYSEANGLYVS---EALSSTD 280
Query: 279 ESKEESWVLDSGASFHASSQKEFFKNYVPGNLGKVYLGNEQSCKVVGKGEVKIKLN-GSV 337
E+ WV+D+G S+H + ++E+F++ G V +GN+ KV G G +++K G V
Sbjct: 281 IHLEDEWVMDTGCSYHMTYKREWFEDLNEDAGGSVRMGNKTVSKVRGIGTIRVKNEAGMV 340
Query: 338 WELKNVRHIPNLTKNLISVGQLADEGYTTVFHGDDWKISKGAMTIARGRKSGTLY----K 393
L NVR+IP + +NL+S+G GY+ I G + R+ TLY +
Sbjct: 341 VRLTNVRYIPEMDRNLLSLGTFEKSGYSFKLENGTLSIIAGDSVLLTVRRCYTLYLLQWR 400
Query: 394 TAGACHLIAVATNENPNLWHKRLGHMSEKGMKVMHSKGKLPSLRSIEIDICEDCILGKQK 453
L V ++ LWH+RLGHMS+K M ++ KG L + +++ CEDCI GK K
Sbjct: 401 PVTEESLSVVKRQDDTILWHRRLGHMSQKNMDLLLKKGLLDKKKVSKLETCEDCIYGKAK 460
Query: 454 RVSFQTSGRTPKKEKLELVHSDVWGPTTVP-SIGGKHYFVTFIDDHSRKVWVYFLKHKSE 512
R+ F + + +EKLE VHSD+WG +VP S+G YF++FIDD++RKV +YFLK K E
Sbjct: 461 RIGFNLA-QHDTREKLEYVHSDLWGAPSVPFSLGKCQYFISFIDDYTRKVRIYFLKTKDE 519
Query: 513 VFEAFKRWKAMVENETDLKIKKLRTDNGGEYEDTKFKKFCYEHGIRMERTVPGTPQHNGV 572
F+ F W +VEN+TD +IK LRTDNG E+ + F +FC + GI RT TPQ NGV
Sbjct: 520 AFDKFVEWANLVENQTDKRIKTLRTDNGLEFCNRSFDEFCSQKGILWHRTCAYTPQQNGV 579
Query: 573 AERMNRTLTERARSLRVQSGLPKKFWAEAVNTSAYLINRGPSVPLEHKIPEEVWSGKEVK 632
AERMNRTL E+ RS+ SGLPKKFWAEA +T+A LIN+ PS L +++P++ WSGK
Sbjct: 580 AERMNRTLMEKVRSMLSDSGLPKKFWAEATHTTAILINKTPSSALNYEVPDKRWSGKSPI 639
Query: 633 LSHLRVFGCVAYVHISDQGRNKLDPKSKKCIFIGYGEDEFGYRLWDDENKKMVRSKDVIF 692
S+LR FGC+A+VH D KL+P++KK I +GY GY++W E KK V S++VIF
Sbjct: 640 YSYLRRFGCIAFVHTDD---GKLNPRAKKGILVGYPIGVKGYKIWLLEEKKCVVSRNVIF 696
Query: 693 NERVMYKDKHNTTTNDSGLSE---PVYVEMD-----------DVP----GSPTDKSPQSG 734
E YKD + + +E Y+++D D P SP + SP +
Sbjct: 697 QENASYKDMMQSKDAEKDENEAPPSSYLDLDLDHEEVITSGGDDPIVEAQSPFNPSPATT 756
Query: 735 ELAESSIRQPSDTLVHPTPVPVLRRSSRP--HAPNR----RYIDYMLLT--DGG--EPED 784
+ + +D + P ++R R AP R Y+ L T D G EP D
Sbjct: 757 QTYSEGVNSETDIIQSPLSYQLVRDRDRRTIRAPVRFDDEDYLAEALYTTEDSGEIEPAD 816
Query: 785 YDEACQTTDASKWELAMKEEMKSLISNQTWELAKLPIGKKALHNKWVYRVKEDHDGSK-- 842
Y EA ++ + +KW+LAM EEM+S I N TW + K P +K + ++W+Y+ K G +
Sbjct: 817 YSEAKRSMNWNKWKLAMNEEMESQIKNHTWTVVKRPQHQKVIGSRWIYKFKLGIPGVEEG 876
Query: 843 RYKARLVVKGFRQKEGIDYTEIFAPVVKLNTIRSVLSIVASENLYLEQLDVKTAFLHGDL 902
R+KARLV KG+ Q++GIDY EIFAPVVK +IR ++SIVA E+L LEQLDVKTAFLHG+L
Sbjct: 877 RFKARLVAKGYAQRKGIDYHEIFAPVVKHVSIRILMSIVAQEDLELEQLDVKTAFLHGEL 936
Query: 903 VEEIYMHQPEGFLEEGKENMVCMLKKSLYGLKQAPRQWYMKFESFMHKEGFQKCNADHCC 962
E+IYM PEG+ E KE+ VC+L KSLYGLKQAP+QW KF ++M + GF + D C
Sbjct: 937 KEKIYMVPPEGYEEMFKEDEVCLLNKSLYGLKQAPKQWNEKFNAYMSEIGFIRSLYDSCA 996
Query: 963 FFKRYK-SSYIILLLYVDDMLVAGSNIDEIKNLKIQLSKEFDMKDLGPAKKILGMQITRD 1021
+ K S + LLLYVDDMLVA N ++I LK +LS+ FDMKDLG AK+ILGM+I R+
Sbjct: 997 YIKELSDGSRVYLLLYVDDMLVAAKNKEDISQLKEELSQRFDMKDLGAAKRILGMEIIRN 1056
Query: 1022 KQKGVLQLS*AEYINRVLQRFNMGDAKLVSTPLASHFRLSQEQSPQTEEEKELMAKIPYA 1081
+++ L LS Y+N++L+ +NM ++K V TPL +H ++ + E++++ M IPY+
Sbjct: 1057 REENTLWLSQNGYLNKILETYNMAESKHVVTPLGAHLKMRAATVEKQEQDEDYMKSIPYS 1116
Query: 1082 SAIGSLMYAMVCTRPDIGHAVGVVSRFMSNPGKAHWEAVKWILRYLRGTT-EKCLYFGKG 1140
SA+GS+MYAM+ TRPD+ + VG++SR+MS P + HW VKW+LRY++G+ K Y
Sbjct: 1117 SAVGSIMYAMIGTRPDLAYPVGIISRYMSQPAREHWLGVKWVLRYIKGSLGTKLQYKRSS 1176
Query: 1141 EIKVEGYVDADFAGEVDHRRSTTGYIFTVGTRSVSWMSRIQKIVALSTTEVEYVAVTEAS 1200
+ KV GY DAD A D RRS TG +FT+G ++SW S Q++VALSTTE EY+++TEA
Sbjct: 1177 DFKVVGYCDADHAACKDRRRSITGLVFTLGGSTISWKSGQQRVVALSTTEAEYMSLTEAV 1236
Query: 1201 KELIWLQGLLTELGFMQEKSALYSDSQSAIHLAKNSAFHSRTKHIGLRYHFIRSLLEDEV 1260
KE +W++GLL E G+ Q+ ++ DSQSAI L+KN+ H RTKHI +RY +IR ++ +
Sbjct: 1237 KEAVWMKGLLKEFGYEQKSVEIFCDSQSAIALSKNNVHHERTKHIDVRYQYIRDIIANGD 1296
Query: 1261 LTLIKIQGSKNPADMLTKVVTIDKLKLCSTLV 1292
++KI KNPAD+ TK+V ++K + TL+
Sbjct: 1297 GDVVKIDTEKNPADIFTKIVPVNKFQAALTLL 1328
>dbj|BAB09923.1| copia-like retrotransposable element [Arabidopsis thaliana]
Length = 1342
Score = 948 bits (2451), Expect = 0.0
Identities = 547/1362 (40%), Positives = 810/1362 (59%), Gaps = 113/1362 (8%)
Query: 4 GEVKIEKFDG-ADFGFWKMQIEDYLYQKKLHQPLTEKKPDSMKDDEWSLLDR-------- 54
G ++EKFDG D+ WK ++ ++ L + L E++ ++D + D
Sbjct: 4 GRAEVEKFDGDGDYILWKEKLLAHMEMLGLLEGLGEEEEAVVEDSTTEISDGGNQDPETA 63
Query: 55 --------------QALGVVRLSLSRNVAFNIAKEKTTAGLMKALSSMYEKPPSSNKVHL 100
+A + LSL NV + K+KT AG++K L ++ N+++L
Sbjct: 64 TSKLEDKILKEKRGKARSTIILSLGNNVLRKVIKQKTAAGMIKVLDQLFMAKSLPNRIYL 123
Query: 101 MRRLFTLRMAEGMSVAQHINELNIVTTQLSSVGIEFDDEVRALILLSSLPDSWSAIVTAV 160
+RL+ +M+E M++ +++N+ + + L +V + DE +A++LL SLP + + +
Sbjct: 124 KQRLYGYKMSENMTMEENVNDFFKLISDLENVKVVVPDEDQAIVLLMSLPRQFDQLKETL 183
Query: 161 SSSSGSKKMKFDDVRDLVLSEEIRRRELGES-----SSSSVLHTESRGRNSTRGNGRGKS 215
+ + +++ + S+ + ELG S ++S L + RGR+ TRG G K+
Sbjct: 184 KYCKTT--LHLEEITSAIRSKIL---ELGASGKLLKNNSDGLFVQDRGRSETRGKGPNKN 238
Query: 216 KARRSKSKNHRSSHNSKSIECWNCGKTGHFKNQC-----RLPTKNQEEKDEANV------ 264
K+R SKSK + CW CGK GHFK QC R + E+ EA+
Sbjct: 239 KSR-SKSKGAGKT-------CWICGKEGHFKKQCYVWKERNKQGSTSERGEASTVTARVT 290
Query: 265 -ASTSGGGDALICSLESKEESWVLDSGASFHASSQKEFFKNYVPGNLGKVYLGNEQSCKV 323
A+ AL+ E ++W+LD+G SFH + +K++ ++ GKV +GN+ +V
Sbjct: 291 DAAALVVSRALLGFAEVTPDTWILDTGCSFHMTCRKDWIIDFKETASGKVRMGNDTYSEV 350
Query: 324 VGKGEVKIKL-NGSVWELKNVRHIPNLTKNLISVGQLADEGYTTVFHGDDWKISKGAMTI 382
G G+V+IK +GS L +VR+IP ++KNLIS+G L D+G I K +T+
Sbjct: 351 KGIGDVRIKNEDGSTILLTDVRYIPEMSKNLISLGTLEDKGCWFESKKGILTIFKNDLTV 410
Query: 383 ARGRKSGTLY-----KTAGACHLIAVATNENPNLWHKRLGHMSEKGMKVMHSKGKLPSLR 437
G+K TLY AG ++I +E +LWH RLGH+ KG++V+ SKG L +
Sbjct: 411 LTGKKESTLYFLQGTTLAGEANVIDKEKDET-SLWHSRLGHIGAKGLQVLVSKGHLD--K 467
Query: 438 SIEIDICEDCILGKQKRVSFQTSGRTPKKEKLELVHSDVWGPTTVP-SIGGKHYFVTFID 496
+I I G K V+ K+KL+ VHSD+WG T VP SIG YF+TFID
Sbjct: 468 NIMISF------GAAKHVT---------KDKLDYVHSDLWGSTNVPFSIGKCQYFITFID 512
Query: 497 DHSRKVWVYFLKHKSEVFEAFKRWKAMVENETDLKIKKLRTDNGGEYEDTKFKKFCYEHG 556
D +R+ W+YF++ K E F F WK +EN+ D K+K L TDNG E+ + +F FC + G
Sbjct: 513 DFTRRTWIYFIRTKDEAFSKFVEWKTQIENQQDKKLKILITDNGLEFCNQEFDSFCRKEG 572
Query: 557 IRMERTVPGTPQHNGVAERMNRTLTERARSLRVQSGLPKKFWAEAVNTSAYLINRGPSVP 616
+ RT TPQ NGVAERMNRT+ + R + +SGL K+FWAEA +T+ +LIN+ PS
Sbjct: 573 VIRHRTCAYTPQQNGVAERMNRTIMNKVRCMLSESGLGKQFWAEAASTAVFLINKSPSSS 632
Query: 617 LEHKIPEEVWSGKEVKLSHLRVFGCVAYVHISDQGRNKLDPKSKKCIFIGYGEDEFGYRL 676
+E IPEE W+G L+ FG VAY+H SDQG KL+P++KK IF+GY + +++
Sbjct: 633 IEFDIPEEKWTGHPPDYKILKKFGSVAYIH-SDQG--KLNPRAKKGIFLGYPDGVKRFKV 689
Query: 677 WDDENKKMVRSKDVIFNERVMYKD--KHNTTTNDSGLSEP--VYVEMDDVPGSPTDKSP- 731
W E++K V S+D++F E MYK+ K++ + D L+E +E+ ++ ++S
Sbjct: 690 WLLEDRKCVVSRDIVFQENQMYKELQKNDMSEEDKQLTEVERTLIELKNLSADDENQSEG 749
Query: 732 ---QSGELAESSIRQPSDTLVHPTP---------VPVLRRSSRPHAPNRRYIDY------ 773
+ E A ++ D V T + R R +R+++
Sbjct: 750 GDNSNQEQASTTRSASKDKQVEETDSDDDCLENYLLARDRIRRQIRAPQRFVEEDDSLVG 809
Query: 774 --MLLTDGGE---PEDYDEACQTTDASKWELAMKEEMKSLISNQTWELAKLPIGKKALHN 828
+ +T+ GE PE Y+EA ++ + KW+ A EEM S+ N TW++ P GK+ +
Sbjct: 810 FALTMTEDGEVYEPETYEEAMRSPECEKWKQATIEEMDSMKKNDTWDVIDKPEGKRVIGC 869
Query: 829 KWVYRVKEDHDGSK--RYKARLVVKGFRQKEGIDYTEIFAPVVKLNTIRSVLSIVASENL 886
KW+++ K G + RYKARLV KGF Q+EGIDY EIF+PVVK +IR +LSIV ++
Sbjct: 870 KWIFKRKAGIPGVEPPRYKARLVAKGFSQREGIDYQEIFSPVVKHVSIRYLLSIVVQFDM 929
Query: 887 YLEQLDVKTAFLHGDLVEEIYMHQPEGFLEEGKENMVCMLKKSLYGLKQAPRQWYMKFES 946
LEQLDVKTAFLHG+L E I M QPEG+ +E VC+LKKSLYGLKQ+PRQW +F+S
Sbjct: 930 ELEQLDVKTAFLHGNLDEYILMSQPEGYEDEDSTEKVCLLKKSLYGLKQSPRQWNQRFDS 989
Query: 947 FMHKEGFQKCNADHCCFFKRYKS-SYIILLLYVDDMLVAGSNIDEIKNLKIQLSKEFDMK 1005
FM G+Q+ + C + ++ SYI LLLYVDDML+A N D+I+ LK L++EF+MK
Sbjct: 990 FMINSGYQRSKYNPCVYTQQLNDGSYIYLLLYVDDMLIASQNKDQIQKLKESLNREFEMK 1049
Query: 1006 DLGPAKKILGMQITRDKQKGVLQLS*AEYINRVLQRFNMGDAKLVSTPLASHFRLSQEQS 1065
DLGPA+KILGM+ITR++++G+L LS +EY+ VL+ F M +K+ TPL +HF+L
Sbjct: 1050 DLGPARKILGMEITRNREQGILDLSQSEYVAGVLRAFGMDQSKVSQTPLGAHFKLRAANE 1109
Query: 1066 PQTEEEKELMAKIPYASAIGSLMYAMVCTRPDIGHAVGVVSRFMSNPGKAHWEAVKWILR 1125
+ E M +PY +AIGS+MY+M+ +RPD+ + VGVVSRFMS P K HW+AVKW++R
Sbjct: 1110 KTLARDAEYMKLVPYPNAIGSIMYSMIGSRPDLAYPVGVVSRFMSKPSKEHWQAVKWVMR 1169
Query: 1126 YLRGTTEKCLYFGKGE-IKVEGYVDADFAGEVDHRRSTTGYIFTVGTRSVSWMSRIQKIV 1184
Y++GT + CL F K + ++ GY D+D+A ++D RRS TG++FT G ++SW S +Q++V
Sbjct: 1170 YMKGTQDTCLRFKKDDKFEIRGYCDSDYATDLDRRRSITGFVFTAGGNTISWKSGLQRVV 1229
Query: 1185 ALSTTEVEYVAVTEASKELIWLQGLLTELGFMQEKSALYSDSQSAIHLAKNSAFHSRTKH 1244
ALSTTE EY+A+ EA KE IWL+GL E+GF Q+ + DSQSAI L+KNS H RTKH
Sbjct: 1230 ALSTTEAEYMALAEAVKEAIWLRGLAAEMGFEQDAVEVMCDSQSAIALSKNSVHHERTKH 1289
Query: 1245 IGLRYHFIRSLLEDEVLTLIKIQGSKNPADMLTKVVTIDKLK 1286
I +RYHFIR + D + ++KI + NPAD+ TK V + KL+
Sbjct: 1290 IDVRYHFIREKIADGEIQVVKISTTWNPADIFTKTVPVSKLQ 1331
>gb|AAF19226.1| Highly similar to Ta1-3 polyprotein [Arabidopsis thaliana]
gi|25301707|pir||E86490 hypothetical protein F28L22.3 -
Arabidopsis thaliana
Length = 1356
Score = 940 bits (2429), Expect = 0.0
Identities = 534/1372 (38%), Positives = 808/1372 (57%), Gaps = 106/1372 (7%)
Query: 6 VKIEKFDG-ADFGFWKMQIEDYLYQKKLHQPLTE-------------------------- 38
V+I+ F+G DF WK++I+ L L LT+
Sbjct: 8 VEIKVFNGDRDFSLWKIRIQAQLGVLGLKDTLTDFSLTKTVPLTKSEAKQESGDGESSGT 67
Query: 39 -KKPDSMKDDEWSLLDRQALGVVRLSLSRNVAFNIAKEKTTAGLMKALSSMYEKPPSSNK 97
+ PD +K ++ QA ++ +S V + TTA L L+ Y + N+
Sbjct: 68 KEVPDPVKIEQ----SEQAKNIIINHISDVVLLKVNHYATTADLWATLNKKYMETSLPNR 123
Query: 98 VHLMRRLFTLRMAEGMSVAQHINELNIVTTQLSSVGIEFDDEVRALILLSSLPDSWSAIV 157
++ +L++ +M M++ Q+++E + +L S+ I+ D+EV+A+++L+SLP S +
Sbjct: 124 IYTQLKLYSFKMVSTMTIDQNVDEFLRIVAELGSLEIQVDEEVQAILILNSLPASHIQLK 183
Query: 158 TAVSSSSGSKKMKFDDVRDLVLSEEIRRRELGES-----SSSSVLHTESRGRNSTRGNGR 212
+ G+K + DV S E REL E+ ++VL+T RGR R N +
Sbjct: 184 HTLKY--GNKTLTVQDVTSSAKSLE---RELAEAVDLDKGQAAVLYTTERGRPLVRNNQK 238
Query: 213 GKSKARRSKSKNHRSSHNSKS-IECWNCGKTGHFKNQCRLPTKNQEEKDEANVASTSGG- 270
G RS+S NSK+ + CW C K GH K C K E + + +
Sbjct: 239 GGQGKGRSRS-------NSKTKVPCWYCKKEGHVKKDCYSRKKKMESEGQGEAGVITEKL 291
Query: 271 --GDALICSLESKEESWVLDSGASFHASSQKEFFKNYVPGNLGKVYLGNEQSCKVVGKGE 328
+AL + + ++ W+LDSG + H +S++++F ++ + LG++ S + G+G
Sbjct: 292 VFSEALSVNEQMVKDLWILDSGCTSHMTSRRDWFISFQEKGNTTILLGDDHSVESQGQGT 351
Query: 329 VKIKLNGSVWE-LKNVRHIPNLTKNLISVGQLADEGYTTVFHGDDWKIS--KGAMTIARG 385
++I +G + L+NV+++P+L +NLIS G L GY G + K+ K T RG
Sbjct: 352 IRIDTHGGTIKILENVKYVPHLRRNLISTGTLDKLGYR--HEGGEGKVRYFKNNKTALRG 409
Query: 386 RKSGTLYKTAGACHLIAVATNENPN----LWHKRLGHMSEKGMKVMHSKGKLPSLRSIEI 441
S LY G+ + + E LWH RLGHMS +KV+ KG + E+
Sbjct: 410 SLSNGLYVLDGSTVMSELCNAETDKVKTALWHSRLGHMSMNNLKVLAGKGLIDRKEINEL 469
Query: 442 DICEDCILGKQKRVSFQTSGRTPKKEKLELVHSDVWG-PTTVPSIGGKHYFVTFIDDHSR 500
+ CE C++GK K+VSF G+ ++ L VH+D+WG P PSI GK YF++ IDD +R
Sbjct: 470 EFCEHCVMGKSKKVSFNV-GKHTSEDALSYVHADLWGSPNVTPSISGKQYFLSIIDDKTR 528
Query: 501 KVWVYFLKHKSEVFEAFKRWKAMVENETDLKIKKLRTDNGGEYEDTKFKKFCYEHGIRME 560
KVW+YFLK K E F+ F WK++VEN+ + K+K LRTDNG E+ +++F +C EHGI
Sbjct: 529 KVWLYFLKSKDETFDKFCEWKSLVENQVNKKVKCLRTDNGLEFCNSRFDSYCKEHGIERH 588
Query: 561 RTVPGTPQHNGVAERMNRTLTERARSLRVQSGLPKKFWAEAVNTSAYLINRGPSVPLEHK 620
RT TPQ NGVAERMNRT+ E+ R L +SG+ + FWAEA T+AYLINR P+ + H
Sbjct: 589 RTCTYTPQQNGVAERMNRTIMEKVRCLLNKSGVEEVFWAEAAATAAYLINRSPASAINHN 648
Query: 621 IPEEVWSGKEVKLSHLRVFGCVAYVHISDQGRNKLDPKSKKCIFIGYGEDEFGYRLWDDE 680
+PEE+W ++ HLR FG +AYVH DQG KL P++ K F+GY GY++W E
Sbjct: 649 VPEEMWLNRKPGYKHLRKFGSIAYVH-QDQG--KLKPRALKGFFLGYPAGTKGYKVWLLE 705
Query: 681 NKKMVRSKDVIFNERVMYKD------------KHNTTTND---------SGLSEPVYVEM 719
+K V S++V+F E V+Y+D + TT+++ SG + ++
Sbjct: 706 EEKCVISRNVVFQESVVYRDLKVKEDDTDNLNQKETTSSEVEQNKFAEASGSGGVIQLQS 765
Query: 720 DDVPGSPTDKSPQSGELAESSIR-QPSDTLVHPTPVPVLR-RSSRPHAPNRRYIDYMLLT 777
D P + ++S S E E S + Q + T + R R R P R+ + +T
Sbjct: 766 DSEPITEGEQSSDSEEEVEYSEKTQETPKRTGLTTYKLARDRVRRNINPPTRFTEESSVT 825
Query: 778 DG---------GEPEDYDEACQTTDASKWELAMKEEMKSLISNQTWELAKLPIGKKALHN 828
EP+ Y EA ++ D KW++A +EM SL+ N TW+L P +K +
Sbjct: 826 FALVVVENCIVQEPQSYQEAMESQDCEKWDMATHDEMDSLMKNGTWDLVDKPKDRKIIGC 885
Query: 829 KWVYRVKEDHDGSK--RYKARLVVKGFRQKEGIDYTEIFAPVVKLNTIRSVLSIVASENL 886
+W++++K G + R+KARLV KG+ Q+EG+DY EIFAPVVK +IR ++S+V ++L
Sbjct: 886 RWLFKLKSGIPGVEPTRFKARLVAKGYTQREGVDYQEIFAPVVKHVSIRILMSLVVDKDL 945
Query: 887 YLEQLDVKTAFLHGDLVEEIYMHQPEGFLEEGKENMVCMLKKSLYGLKQAPRQWYMKFES 946
LEQ+DVKT FLHGDL EE+YM QPEGF+ + EN VC LKKSLYGLKQ+PRQW +F+
Sbjct: 946 ELEQMDVKTTFLHGDLEEELYMEQPEGFVSDSSENKVCRLKKSLYGLKQSPRQWNKRFDR 1005
Query: 947 FMHKEGFQKCNADHCCFFKRY-KSSYIILLLYVDDMLVAGSNIDEIKNLKIQLSKEFDMK 1005
FM + F + D C + K + +I LLLYVDDML+AG++ EI +K QLS EF+MK
Sbjct: 1006 FMSSQQFIRSEHDACVYVKHVSEHDFIYLLLYVDDMLIAGASKAEINRVKEQLSTEFEMK 1065
Query: 1006 DLGPAKKILGMQITRDKQKGVLQLS*AEYINRVLQRFNMGDAKLVSTPLASHFRLSQEQS 1065
D+G A +ILG+ I RD++ GVL+LS YI +VL RFNM AK+ + P+ +HF+L+ +
Sbjct: 1066 DMGGASRILGIDIYRDRKGGVLKLSQEIYIRKVLDRFNMSGAKMTNAPVGAHFKLA---A 1122
Query: 1066 PQTEEEKELMAKIPYASAIGSLMYAMVCTRPDIGHAVGVVSRFMSNPGKAHWEAVKWILR 1125
+ E+E +PY+SA+GS+MYAM+ TRPD+ +A+ ++SR+MS PG HWEAVKW++R
Sbjct: 1123 VREEDECVDTDVVPYSSAVGSIMYAMLGTRPDLAYAICLISRYMSKPGSMHWEAVKWVMR 1182
Query: 1126 YLRGTTEKCLYFGK-GEIKVEGYVDADFAGEVDHRRSTTGYIFTVGTRSVSWMSRIQKIV 1184
YL+G + L F K + V GY D+++A ++D RRS +GY+FT+G +VSW + +Q +V
Sbjct: 1183 YLKGAQDLNLVFTKEKDFTVTGYCDSNYAADLDRRRSISGYVFTIGGNTVSWKASLQPVV 1242
Query: 1185 ALSTTEVEYVAVTEASKELIWLQGLLTELGFMQEKSALYSDSQSAIHLAKNSAFHSRTKH 1244
A+STTE EY+A+ EA+KE +W++GLL ++G Q+K ++ DSQSAI L+KNS +H RTKH
Sbjct: 1243 AMSTTEAEYIALAEAAKEAMWIKGLLQDMGMQQDKVKIWCDSQSAICLSKNSVYHERTKH 1302
Query: 1245 IGLRYHFIRSLLEDEVLTLIKIQGSKNPADMLTKVVTIDKLKLCSTLVGLLE 1296
I +R+++IR ++E + ++KI S+NP D LTK + ++K K ++ L++
Sbjct: 1303 IDVRFNYIRDVVESGDVDVLKIHTSRNPVDALTKCIPVNKFKSALGVLKLMK 1354
>ref|XP_476137.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
gi|48475101|gb|AAT44170.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
gi|46576026|gb|AAT01387.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
Length = 1175
Score = 913 bits (2360), Expect = 0.0
Identities = 522/1227 (42%), Positives = 712/1227 (57%), Gaps = 88/1227 (7%)
Query: 103 RLFTLRMAEGMSVAQHINELNIVTTQLSSVGIEFDDEVRALILLSSLPDSWSAIVTAVSS 162
+LF+ ++ E S+ HI+ + L S+ ++FDDE L+LL SLP S++ +
Sbjct: 2 KLFSHKLQESGSILNHISVFKEIVADLVSMEVQFDDEDLGLLLLCSLPSSYANFRDTILL 61
Query: 163 SSGSKKMKFDDVRDLVLSEEIRRRELGESSSSSVLHTESRGRNSTR----GNGRGKSKAR 218
S + + L E+++ ++SSS + RGR+ R N R K+++R
Sbjct: 62 SRSELTLA-EVYEALQNREKMKGMVQSDASSSKGEALQVRGRSEQRTYNDSNDRDKNQSR 120
Query: 219 -RSKSKNHRSSHNSKSIECWNCGKTGHFKNQC-RLPTKNQEEKD-EANV---ASTSGGGD 272
RSKS+ + C C K HF +C +L K + + D +A+V A S GD
Sbjct: 121 GRSKSRGKKF--------CKYCKKKNHFIEECWKLQNKEKRKSDGKASVVTSADNSDSGD 172
Query: 273 ALICSLE--SKEESWVLDSGASFHASSQKEFFKNYVPGNLGKVY-LGNEQSCKVVGKGEV 329
L+ + S + W+LD+ SFH +++F +Y G V +G++ ++VG G V
Sbjct: 173 CLVVFVVCVSSHDEWILDTTCSFHICINRDWFSSYKSVQNGDVVRMGDDNPREIVGIGSV 232
Query: 330 KIKLN-GSVWELKNVRHIPNLTKNLISVGQLADEGYTTVFHGDDWKISKGAMTIARG-RK 387
+IK + G LK+VRHIP + +NLIS+ L EGY G K+SKG++ G
Sbjct: 233 QIKTHDGMTRTLKDVRHIPRMARNLISLSTLDAEGYKYSGSGGVVKVSKGSLVYMIGDMN 292
Query: 388 SGTLYKTAGACH----LIAVATNENP---NLWHKRLGHMSEKGMKVMHSKGKLPSLRSIE 440
S LY G+ AV + + P N+WH RLGHMSE GM + + L
Sbjct: 293 SANLYVLRGSTLHGYVTAAVVSKDEPSKTNMWHMRLGHMSELGMAELMKRNLLDGCTQGN 352
Query: 441 IDICEDCILGKQKRVSFQTSGRTPKKEKLELVHSDVWGPTTVPSIGGKHYFVTFIDDHSR 500
+ CE C+ GK KRV F TS K L+ VH+D+WGP+ PS+GG Y +T IDD+SR
Sbjct: 353 MKFCEHCVFGKHKRVKFNTSVHRTKGI-LDYVHADLWGPSRKPSLGGARYMLTIIDDYSR 411
Query: 501 KVWVYFLKHKSEVFEAFKRWKAMVENETDLKIKKLRTDNGGEYEDTKFKKFCYEHGIRME 560
KVW YFLKHK + F AFK WK M++ +T+ ++K LRTDNGG + F +C + GI M
Sbjct: 412 KVWPYFLKHKDDTFAAFKEWKVMIKRQTEKEVKVLRTDNGGGFCSDAFDDYCRKEGIVMH 471
Query: 561 RTVPGTPQHNGVAERMNRTLTERARSLRVQSGLPKKFWAEAVNTSAYLINRGPSVPLEHK 620
T+P TPQ NGVAERMNRT+ +AR + + + K+FWAEA T+ YLINR PS+ L K
Sbjct: 472 HTIPYTPQQNGVAERMNRTIISKARCMLSNARMNKRFWAEAAKTACYLINRSPSISLNKK 531
Query: 621 IPEEVWSGKEVKLSHLRVFGCVAYVHISDQGRNKLDPKSKKCIFIGYGEDEFGYRLWDDE 680
P EVWSG S LRVFGC AY H+++ KL+P++ KC+F+GYG GY+LW+ E
Sbjct: 532 TPIEVWSGMPANYSQLRVFGCTAYAHVNN---GKLEPRAIKCLFLGYGSGVKGYKLWNPE 588
Query: 681 NKKMVRSKDVIFNERVMYKDKHNTTTNDSGLSEPVYVEMDDVPGSPTDKSPQSGELAESS 740
K S+ V+FNE VM+ D T G E V K+ G +
Sbjct: 589 TNKTFMSRSVVFNESVMFNDSLPTDVIPGGFDEEQQYVSVQVEHVDDQKTEIVGNDVNDT 648
Query: 741 IRQPSDTLVHPTPVPVL-RRSSRPHAPNRRYID-----YMLLTDGGEPED------YDEA 788
+ Q S ++ P P+ RR+ R R I+ Y + E+ Y E
Sbjct: 649 V-QHSPPVLQPQDEPIAHRRTKRSCGAPVRLIEECDMVYYAFICAEQVENTLELATYTEV 707
Query: 789 CQTTDASKWELAMKEEMKSLISNQTWELAKLPIGKKALHNKWVYRVKEDHDGSKRYKARL 848
+ D KW AM+EEM+SL N TWEL LP KK +H
Sbjct: 708 VVSGDREKWISAMQEEMQSLEKNDTWELVHLPKQKKPVH--------------------- 746
Query: 849 VVKGFRQKEGIDYTEIFAPVVKLNTIRSVLSIVASENLYLEQLDVKTAFLHGDLVEEIYM 908
VK ++IR+ SIVA +L LEQLDVKTAFLHG+L EEIYM
Sbjct: 747 -------------------FVKHSSIRTFFSIVAMHDLELEQLDVKTAFLHGELEEEIYM 787
Query: 909 HQPEGFLEEGKENMVCMLKKSLYGLKQAPRQWYMKFESFMHKEGFQKCNADHCCFFKRYK 968
QPEGF+ GKE+ VC LK+SLYGLKQ+PRQWY +F+SFM GF++ D C + K
Sbjct: 788 DQPEGFIVPGKEDYVCKLKRSLYGLKQSPRQWYKRFDSFMLSHGFKRSEFDSCVYIKFVN 847
Query: 969 SSYIILLLYVDDMLVAGSNIDEIKNLKIQLSKEFDMKDLGPAKKILGMQITRDKQKGVLQ 1028
S I LLLYVDDML+A + ++I LK QLS EFDMKDL AKKILGM+ITRD+ G L
Sbjct: 848 VSPIYLLLYVDDMLIAAKSKEQITTLKKQLSSEFDMKDLDAAKKILGMEITRDRNSGWLF 907
Query: 1029 LS*AEYINRVLQRFNMGDAKLVSTPLASHFRLSQEQSPQTEEEKELMAKIPYASAIGSLM 1088
LS YI +VLQRFNM D K VST +A HF+LS Q T+E+ E M+++PY+S +GSLM
Sbjct: 908 LSQQSYIKKVLQRFNMHDTKPVSTHIAPHFKLSALQCASTDEDVEYMSRVPYSSVVGSLM 967
Query: 1089 YAMVCTRPDIGHAVGVVSRFMSNPGKAHWEAVKWILRYLRGTTEKCLYFGKGEIKVEGYV 1148
YAMVC+R D+ HA+ +VSR+M+NPGK HW+A++WI RYLR T CL FG+ + GYV
Sbjct: 968 YAMVCSRLDLSHAMSLVSRYMANPGKEHWKAIQWIFRYLRDTANACLKFGRTNKGLIGYV 1027
Query: 1149 DADFAGEVDHRRSTTGYIFTVGTRSVSWMSRIQKIVALSTTEVEYVAVTEASKELIWLQG 1208
D+DFA ++D RRS TGY+FT+G+ +VSW + ++ +VA STTE EY+A+ EA KE +WL+G
Sbjct: 1028 DSDFAADLDKRRSLTGYVFTIGSCAVSWKATLRHVVAQSTTEAEYMAIAEACKESVWLKG 1087
Query: 1209 LLTELGFMQEKSALYSDSQSAIHLAKNSAFHSRTKHIGLRYHFIRSLLEDEVLTLIKIQG 1268
L EL + L+ DSQSAI L K+ FH RTKHI ++YH++R ++ L + KI
Sbjct: 1088 LFAELCGVYSCINLFCDSQSAICLTKDQMFHERTKHIDIKYHYVRDVVAQGKLKVCKIST 1147
Query: 1269 SKNPADMLTKVVTIDKLKLCSTLVGLL 1295
NP DM+TK V + K +LCS+LVG++
Sbjct: 1148 HDNPVDMMTKHVPVAKFELCSSLVGIV 1174
>emb|CAB79135.1| putative transposable element [Arabidopsis thaliana]
gi|3402755|emb|CAA20201.1| putative transposable element
[Arabidopsis thaliana] gi|7444415|pir||T05178
hypothetical protein T6K22.90 - Arabidopsis thaliana
Length = 1308
Score = 868 bits (2242), Expect = 0.0
Identities = 508/1354 (37%), Positives = 769/1354 (56%), Gaps = 133/1354 (9%)
Query: 5 EVKIEKFDG-ADFGFWKMQIE-------------DYLYQKKLHQPLTEKKPDSMKDDEWS 50
+V+I+ F+G DF WK++IE D+ K + +EKK +DDE
Sbjct: 6 KVEIKTFNGDRDFSLWKIRIEAQLGVLGLKPALSDFTLTKTILVVKSEKKESESEDDETD 65
Query: 51 LLDR-------------QALGVVRLSLSRNVAFNIAKEKTTAGLMKALSSMYEKPPSSNK 97
QA + ++ V + T A L L+ ++ + N+
Sbjct: 66 SKKTEEVPDPIKFEQSDQAKNFIINHITDTVLLKVQHCVTAAELWATLNKLFMETSLPNR 125
Query: 98 VHLMRRLFTLRMAEGMSVAQHINELNIVTTQLSSVGIEFDDEVRALILLSSLPDSWSAIV 157
++ RL++ +M + +S+ Q+ +E + +L S+ I+ +EV+A+++L+SLP S+ +
Sbjct: 126 IYTQLRLYSFKMVDNLSIDQNTDEFLRIVAELGSLQIQVGEEVQAILILNSLPPSYIQLK 185
Query: 158 TAVSSSSGSKKMKFDDVRDLVLSEEIRRRELGESS------SSSVLHTESRGRNSTRGN- 210
+ G+K + V+D+V S + REL E +S+ L+T RGR T+
Sbjct: 186 HTLKY--GNKTLS---VQDVVSSAKSLERELSEQKETIRAPASTALYTAERGRPQTKNTQ 240
Query: 211 GRGKSKARRSKSKNHRSSHNSKS-IECWNCGKTGHFKNQCRLPTKNQEEKDEANVASTSG 269
G+GK + R NSKS + CW C K GH K C + E + + +
Sbjct: 241 GQGKGRGRS----------NSKSRLTCWFCKKEGHVKKDCYAGKRKLENEGQGKAGVITE 290
Query: 270 G---GDALICSLESKEESWVLDSGASFHASSQKEFFKNYVPGNLGKVYLGNEQSCKVVGK 326
+AL + ++ WV+DSG ++H +S+ ++F + + LG++ + + G
Sbjct: 291 KLVYSEALSMYDQEAKDKWVIDSGCTYHMTSRMDWFSEFNENETTMILLGDDHTVESKGS 350
Query: 327 GEVKIKLNG-SVWELKNVRHIPNLTKNLISVGQLADEGYTTVFHGDDWKIS--KGAMTIA 383
G VK+ +G S+ LKNVR +PNL +NLIS G L GY G D K+ K T
Sbjct: 351 GTVKVNTHGGSIRVLKNVRFVPNLRRNLISTGTLDKLGYK--HEGGDGKVRFYKENKTAL 408
Query: 384 RGRKSGTLYKTAGACHLIA------VATNENPNLWHKRLGHMSEKGMKVMHSKGKLPSLR 437
G LY G H + +NE LWH RLGHMS MK++ KG L
Sbjct: 409 CGNLVNGLYVLDG--HTVVNENCNVEGSNEKTELWHCRLGHMSLNNMKILAEKGLLEKKD 466
Query: 438 SIEIDICEDCILGKQKRVSFQTSGRTPKKEKLELVHSDVWGPTTVPSIGGKHYFVTFIDD 497
E+ CE+C++GK K++SF G+ E L +H+D+WG K YF++ IDD
Sbjct: 467 IKELSFCENCVMGKSKKLSFNV-GKHITDEVLGYIHADLWG---------KQYFLSIIDD 516
Query: 498 HSRKVWVYFLKHKSEVFEAFKRWKAMVENETDLKIKKLRTDNGGEYEDTKFKKFCYEHGI 557
SRKVW+ FLK K E FE F WK +VEN+ + K+K LRTDNG E+ + KF +FC ++GI
Sbjct: 517 KSRKVWLMFLKTKDETFERFCEWKELVENQVNKKVKILRTDNGLEFCNLKFDEFCKQNGI 576
Query: 558 RMERTVPGTPQHNGVAERMNRTLTERARSLRVQSGLPKKFWAEAVNTSAYLINRGPSVPL 617
RT TPQ NGVA+RMNRTL E+ R L +SGL + FWAEA T+AYL+NR P+ +
Sbjct: 577 ERHRTCTYTPQQNGVAKRMNRTLMEKVRCLLNESGLEEVFWAEAAATAAYLVNRSPASAV 636
Query: 618 EHKIPEEVWSGKEVKLSHLRVFGCVAYVHISDQGRNKLDPKSKKCIFIGYGEDEFGYRLW 677
+H +PEE+W K+ HLR FGC+AYVH+ DQG KL P++ K +F+GY + GY++W
Sbjct: 637 DHNVPEELWLDKKPGYKHLRRFGCIAYVHL-DQG--KLKPRALKGVFLGYPQGTKGYKVW 693
Query: 678 DDENKKMVRSKDVIFNERVMYKD-KHNTTTNDSGLSEPVYVEMDDVPGSPTDKSPQSGEL 736
+ +K V S++++FNE +YKD + ++ + +S+ V + ++G +
Sbjct: 694 LLDEEKCVISRNIVFNENQVYKDIRESSEQSVKDISDLEGYNEFQVSVKEHGECSKTGGV 753
Query: 737 AESSIRQPSDTLVHPTPVPVLR------------RSSRPHAPNRRYIDYMLLT------- 777
I Q SD+ T P++ R R P ++ DY
Sbjct: 754 TIEEIDQESDSENSVTQEPLIASIDLSNYQSARDRERRAPNPPQKLADYTHFALALVMAE 813
Query: 778 --DGGEPEDYDEACQTTDASKWELAMKEEMKSLISNQTWELAKLPIGKKALHNKWVYRVK 835
+ EP+ Y +A + KW MKEE+ SL+ N TW++ + P +K + +W++++K
Sbjct: 814 EIESEEPQCYHDAKKDKHWIKWNGGMKEEIDSLLKNGTWDIVEWPKEQKVISCRWLFKLK 873
Query: 836 EDHDG--SKRYKARLVVKGFRQKEGIDYTEIFAPVVKLNTIRSVLSIVASENLYLEQLDV 893
G ++RYKARLV +GF Q++GIDY E+FAPVVK +IR ++S V +++ LEQ+DV
Sbjct: 874 PGIPGVEAQRYKARLVARGFTQQKGIDYEEVFAPVVKHISIRILMSAVVKDDMELEQMDV 933
Query: 894 KTAFLHGDLVEEIYMHQPEGFLEEGKENMVCMLKKSLYGLKQAPRQWYMKFESFMHKEGF 953
KT FLHG+L + +YM QPEGF +++ VC+LKKSLYGLKQAPRQW KF +FM F
Sbjct: 934 KTTFLHGELDQVLYMEQPEGFEVNPEKDQVCLLKKSLYGLKQAPRQWNKKFHAFMLSLQF 993
Query: 954 QKCNADHCCFFKRYK-SSYIILLLYVDDMLVAGSNIDEIKNLKIQLSKEFDMKDLGPAKK 1012
+ D C + K ++ LLLYVDDML+A + EI LK LS +F+MKD+G A +
Sbjct: 994 ARSEHDSCVYVKEVNPGEFVYLLLYVDDMLLAAKSKSEISKLKEALSLKFEMKDMGAASR 1053
Query: 1013 ILGMQITRDKQKGVLQLS*AEYINRVLQRFNMGDAKLVSTPLASHFRLSQEQSPQTEEEK 1072
ILG+ I R++++G L+LS Y+++V+QRF M DAK+VSTP+ +HF+L+ +
Sbjct: 1054 ILGIDIIRNRKEGTLRLSQTRYVDKVIQRFRMADAKVVSTPMGAHFKLTSLIDEIGSVDP 1113
Query: 1073 ELMAKIPYASAIGSLMYAMVCTRPDIGHAVGVVSRFMSNPGKAHWEAVKWILRYLRGTTE 1132
E+ +PY+SA+GS+MYAM+ T PD+ +A+G+VSRFMS PG
Sbjct: 1114 EV---VPYSSAVGSVMYAMIGTIPDVAYAMGLVSRFMSRPG------------------- 1151
Query: 1133 KCLYFGKGEIKVEGYVDADFAGEVDHRRSTTGYIFTVGTRSVSWMSRIQKIVALSTTEVE 1192
++V+GY D+D A ++D RRS +GY+FTVG +VSW S +Q +VALS+T+ E
Sbjct: 1152 -------ANLEVQGYCDSDHAADLDKRRSISGYVFTVGGNTVSWKSSLQHVVALSSTQAE 1204
Query: 1193 YVAVTEASKELIWLQGLLTELGFMQEKSALYSDSQSAIHLAKNSAFHSRTKHIGLRYHFI 1252
++A+TEA KE IW++GLL ++G + + ++ DSQSAI L+KN+AFH RTKH+ ++++FI
Sbjct: 1205 FIALTEAVKEAIWIRGLLEDMGLQPKPATVWCDSQSAICLSKNNAFHDRTKHVEVKFYFI 1264
Query: 1253 RSLLEDEVLTLIKIQGSKNPADMLTKVVTIDKLK 1286
R ++E + + KI S NPADMLTK + + K +
Sbjct: 1265 RDIIEAGEVKVRKIHTSVNPADMLTKCIPVKKFE 1298
>emb|CAA31653.1| polyprotein [Arabidopsis thaliana] gi|99721|pir||S05465
retrovirus-related polyprotein - Arabidopsis thaliana
retrotransposon Ta1-3
Length = 1291
Score = 867 bits (2239), Expect = 0.0
Identities = 490/1236 (39%), Positives = 726/1236 (58%), Gaps = 65/1236 (5%)
Query: 37 TEKKPDSMKDDEWSLLDRQALGVVRLSLSRNVAFNIAKEKTTAGLMKALSSMYEKPPSSN 96
T+ PD +K ++ A+ ++ + V I K+ A + + L+ Y + N
Sbjct: 75 TKIVPDLVKIEK----SENAMNIIIAHVGDAVLRKIDHCKSAAEMWETLNKQYMETSLPN 130
Query: 97 KVHLMRRLFTLRMAEGMSVAQHINELNIVTTQLSSVGIEFDDEVRALILLSSLPDSWSAI 156
++++ + ++ +M + S+ +++NE + +LSS+ I +EVRA++ L+ L +S +
Sbjct: 131 RIYVQLKFYSFKMNDTKSINENVNEFLKIVAELSSLEINVVEEVRAILFLNRLSSRYSQL 190
Query: 157 VTAVSSSSGSKKMKFDDVRDLV--LSEEIRRRELGESSSSSVLHTESRGRNSTRGNGRGK 214
+ G+K + DV L E+ ++ + ++S+VL+T R R TR K
Sbjct: 191 KHTLKY--GNKALSLKDVISAARSLERELNEQKETDKNTSTVLYTNERSRPQTRNQNHNK 248
Query: 215 SKARRSKSKNHRSSHNSKSIECWNCGKTGHFKNQ--CRLPTKNQEEKDEANVASTS-GGG 271
R +SK S+++ + CW C K GH K R E EA V +
Sbjct: 249 GGQGRGRSK----SNSNAKLTCWYCKKEGHVKKDYFARKRKLESENPGEAGVITEKLVFS 304
Query: 272 DALICSLESKEESWVLDSGASFHASSQKEFFKNYVPGNLGKVYLGNEQSCKVVGKGEVKI 331
+AL + + + WVLDSG + H S+++++F ++ + LG++ S K G+G +KI
Sbjct: 305 EALSVNDLAVRDIWVLDSGCTSHMSARRDWFCSFREDGGPTILLGDDHSVKSQGQGSIKI 364
Query: 332 KLNG-SVWELKNVRHIPNLTKNLISVGQLADEGYTTVFHGDDWKIS--KGAMTIARGRKS 388
+ +G ++ L+NV+++P L +NLIS G L GY G D K+ K T RG
Sbjct: 365 ETHGGTIIGLENVKYVPELRRNLISTGTLDKRGYK--HEGGDGKVRYFKNQKTALRGELV 422
Query: 389 GTLYKTAGACHL----IAVATNENPNLWHKRLGHMSEKGMKVMHSKGKLPSLRSIEIDIC 444
LY G L +A + LWH RLGH+ MKV+ KG + +D C
Sbjct: 423 NGLYILDGNTVLSETCVAEGSKGKTELWHSRLGHIGLNNMKVLAGKGLVSKEEIRVLDFC 482
Query: 445 EDCILGKQKRVSFQTSGRTPKKEKLELVHSDVWGPTTV-PSIGGKHYFVTFIDDHSRKVW 503
E+C++GK K+VSF G+ ++ L VH+D+WG T V PS+ G YF++ IDD +RKVW
Sbjct: 483 ENCVMGKAKKVSFNV-GKHNSEDVLRYVHADLWGSTNVTPSLSGNKYFLSIIDDKTRKVW 541
Query: 504 VYFLKHKSEVFEAFKRWKAMVENETDLKIKKLRTDNGGEYEDTKFKKFCYEHGIRMERTV 563
+YFL+ K E F+ F WK +VEN+ + K+K LRTDNG E+ + KF +C EHGI +T
Sbjct: 542 LYFLRSKDETFDRFCEWKELVENQQNKKVKCLRTDNGLEFCNLKFDAYCKEHGIERHKTC 601
Query: 564 PGTPQHNGVAERMNRTLTERARSLRVQSGLPKKFWAEAVNTSAYLINRGPSVPLEHKIPE 623
TPQ NGVAERMNRT+ E+ R + +SGL ++FWAEA T+AYLINR P+ ++H +PE
Sbjct: 602 TYTPQQNGVAERMNRTIMEKVRCMLNESGLGEEFWAEAAATAAYLINRSPASAIDHNVPE 661
Query: 624 EVWSGKEVKLSHLRVFGCVAYVHISDQGRNKLDPKSKKCIFIGYGEDEFGYRLWDDENKK 683
E+W K+ HLR FG +AYVHI DQG KL P++ K IFIGY GY++W E K
Sbjct: 662 ELWLNKKPGYKHLRRFGSIAYVHI-DQG--KLKPRALKGIFIGYPAGTKGYKIWLLEEHK 718
Query: 684 MVRSKDVIFNERVMYKDKHNTTTNDSGLSEPVYVEMDDVPGSPTDKSPQSGELAESSIRQ 743
V S++V+F+E +YKD +EP + T + SGE+ + S +
Sbjct: 719 CVISRNVLFHEESVYKDTMKKERVVESEAEPASHSKSTLIKVKTPGNLNSGEVIQVSDEE 778
Query: 744 PSDTLVHPTPVPVLR--------------------RSSRPHAPNRRY-----IDYMLLT- 777
SD V P + R R P R+ + + L+T
Sbjct: 779 ESDESVEEEQEPETQVELPETQTTSSLANYQLARDRERRQIHPPARFTEESGVAFALVTV 838
Query: 778 ---DGGEPEDYDEACQTTDASKWELAMKEEMKSLISNQTWELAKLPIGKKALHNKWVYRV 834
EP+ Y EA + KW+LA EEM SLI N TW L P +K + +W++++
Sbjct: 839 ETLSMEEPQSYQEATSDKEWKKWKLATHEEMDSLIKNGTWVLVDKPQNRKIIGCRWLFKL 898
Query: 835 KEDHDGSK--RYKARLVVKGFRQKEGIDYTEIFAPVVKLNTIRSVLSIVASENLYLEQLD 892
K G + RYKA+LV KG+ +EG+DY EIFA VVK +IR ++S+V ++L LEQ+D
Sbjct: 899 KSGSPGVEPVRYKAQLVAKGYTHREGVDYQEIFALVVKHTSIRILMSVVVDQDLELEQMD 958
Query: 893 VKTAFLHGDLVEEIYMHQPEGFLEEGKENMVCMLKKSLYGLKQAPRQWYMKFESFMHKEG 952
VKTAFLHG+L EE+YM QPEG + E EN VC+LKKSLYGLKQ+PRQW +F FM +
Sbjct: 959 VKTAFLHGELEEELYMEQPEGCISEDGENKVCLLKKSLYGLKQSPRQWNKRFNRFMIDQN 1018
Query: 953 FQKCNADHCCFFKRY-KSSYIILLLYVDDMLVAGSNIDEIKNLKIQLSKEFDMKDLGPAK 1011
F + D C + K+ + ++ LLLYVDDML+AG + EI +K QLS EF+MKD+GPA
Sbjct: 1019 FIRSEHDACVYVKQVSEQEHLYLLLYVDDMLIAGKSKSEINKVKEQLSMEFEMKDMGPAS 1078
Query: 1012 KILGMQITRDKQKGVLQLS*AEYINRVLQRFNMGDAKLVSTPLASHFRLSQEQSPQTEEE 1071
+ILG+ I RD + GVL++S A YI+ V+QRFNM +AK+ +P+ +HF+L+ + + ++E
Sbjct: 1079 RILGIDIIRDMKNGVLRMSQASYIHNVVQRFNMAEAKVTRSPIGAHFKLA---AVRDDDE 1135
Query: 1072 KELMAKIPYASAIGSLMYAMVCTRPDIGHAVGVVSRFMSNPGKAHWEAVKWILRYLRGTT 1131
+PYASA+GS+MYAM+ RPD+ + + +VSR+M+ PG HWEAVKWILRY+RG+
Sbjct: 1136 CIDNNAVPYASAVGSIMYAMIGIRPDLAYVICLVSRYMARPGSIHWEAVKWILRYMRGSQ 1195
Query: 1132 EKCLYFGK-GEIKVEGYVDADFAGEVDHRRSTTGYIFTVGTRSVSWMSRIQKIVALSTTE 1190
+ L F K E +V GY D+D+A ++D RRS +GY+FTVG +VSW + +Q + ALSTTE
Sbjct: 1196 DLNLVFTKEKEFRVTGYCDSDYAADLDRRRSVSGYVFTVGGNTVSWKANLQSVTALSTTE 1255
Query: 1191 VEYVAVTEASKELIWLQGLLTELGFMQEKSALYSDS 1226
E++A+TEA+KE +W++GL+ +LG Q+K L+ DS
Sbjct: 1256 AEFMALTEAAKEALWIKGLMKDLGLEQDKVTLWCDS 1291
>gb|AAG50765.1| copia-type polyprotein, putative [Arabidopsis thaliana]
gi|12321254|gb|AAG50698.1| copia-type polyprotein,
putative [Arabidopsis thaliana] gi|25301687|pir||F96614
probable copia-type polyprotein T18I24.5 [imported] -
Arabidopsis thaliana
Length = 1320
Score = 795 bits (2054), Expect = 0.0
Identities = 479/1325 (36%), Positives = 732/1325 (55%), Gaps = 132/1325 (9%)
Query: 36 LTEKKPDSMKDDEWSLLDRQALGVVRLSLSRNVAFNIAKEKTTAGLMKALSSMYEKPPSS 95
L++ + D ++D D++AL ++ L + + + + + L + Y+
Sbjct: 51 LSQTQKDGLRDSRKR--DKKALCLIYQGLDEDTFEKVVEATSAKEAWEKLRTSYKGADQV 108
Query: 96 NKVHL--MRRLF-TLRMAEGMSVAQHINELNIVTTQLSSVGIEFDDEVRALILLSSLPDS 152
KV L +R F L+M EG V+ + + + VT L G + DD +L SL
Sbjct: 109 KKVRLQTLRGEFEALQMKEGELVSDYFSRVLTVTNNLKRNGEKLDDVRIMEKVLRSLDLK 168
Query: 153 WSAIVTAVSSSSG-----------------SKKMKFDDVRDLVLSEEIRRRELGES---- 191
+ IVT + + KK K +D+ + VL+ +I + E G+S
Sbjct: 169 FEHIVTVIEETKDLEAMTIEQLLGSLQAYEEKKKKKEDIVEQVLNMQITKEENGQSYQRR 228
Query: 192 SSSSVL------------------HTESRGRNSTRGNGRGKSKARRSKSKNHRSSHNSKS 233
V +T RG NS+RG G+G K+R KS S
Sbjct: 229 GGGQVRGRGRGGYGNGRGWRPHEDNTNQRGENSSRGRGKGHPKSRYDKS----------S 278
Query: 234 IECWNCGKTGHFKNQCRLPTKNQEEKDEANVASTSGGGDALICSLESKEE-----SWVLD 288
++C+NCGK GH+ ++C+ P+ + E+ V D L+ + K+E W LD
Sbjct: 279 VKCYNCGKFGHYASECKAPSNKKFEEKANYVEEKIQEEDMLLMASYKKDEQEENHKWYLD 338
Query: 289 SGASFHASSQKEFFKNYVPGNLGKVYLGNEQSCKVVGKGEVKIKL-NGSVWELKNVRHIP 347
SGAS H +K F G V LG+E +V GKG + I+L NG + NV +IP
Sbjct: 339 SGASNHMCGRKSMFAELDESVRGNVALGDESKMEVKGKGNILIRLKNGDHQFISNVYYIP 398
Query: 348 NLTKNLISVGQLADEGYTTVFHGDDWKISKGAMTIARGRKSGTLYKTA------------ 395
++ N++S+GQL ++GY D ++ ++I R ++S + K
Sbjct: 399 SMKTNILSLGQLLEKGY-------DIRLKDNNLSI-RDQESNLITKVPMSKNRMFVLNIR 450
Query: 396 -GACHLIAVATNENPNLWHKRLGHMSEKGMKVMHSKGKLPSLRSIE--IDICEDCILGKQ 452
+ + E LWH R GH++ G++++ K + L I +CE C+LGKQ
Sbjct: 451 NDIAQCLKMCYKEESWLWHLRFGHLNFGGLELLSRKEMVRGLPCINHPNQVCEGCLLGKQ 510
Query: 453 KRVSFQTSGRTPKKEKLELVHSDVWGPTTVPSIGGKHYFVTFIDDHSRKVWVYFLKHKSE 512
++SF + ++ LEL+H+DV GP S+G +YF+ FIDD SRK WVYFLK KSE
Sbjct: 511 FKMSFPKESSSRAQKPLELIHTDVCGPIKPKSLGKSNYFLLFIDDFSRKTWVYFLKEKSE 570
Query: 513 VFEAFKRWKAMVENETDLKIKKLRTDNGGEYEDTKFKKFCYEHGIRMERTVPGTPQHNGV 572
VFE FK++KA VE E+ L IK +R+D GGE+ +F K+C ++GIR + TVP +PQ NGV
Sbjct: 571 VFEIFKKFKAHVEKESGLVIKTMRSDRGGEFTSKEFLKYCEDNGIRRQLTVPRSPQQNGV 630
Query: 573 AERMNRTLTERARSLRVQSGLPKKFWAEAVNTSAYLINRGPSVPLEHKIPEEVWSGKEVK 632
AER NRT+ E ARS+ LPK+ WAEAV + YL+NR P+ + K P+E WSG++
Sbjct: 631 AERKNRTILEMARSMLKSKRLPKELWAEAVACAVYLLNRSPTKSVSGKTPQEAWSGRKPG 690
Query: 633 LSHLRVFGCVAYVHISDQGRNKLDPKSKKCIFIGYGEDEFGYRLWDDENKKMVRSKDVIF 692
+SHLRVFG +A+ H+ D+ R+KLD KS+K IFIGY + GY+L++ + KK + S++++F
Sbjct: 691 VSHLRVFGSIAHAHVPDEKRSKLDDKSEKYIFIGYDNNSKGYKLYNPDTKKTIISRNIVF 750
Query: 693 NERVMYKDKHNTTTNDSGLSEPVYVEMDDVPGSPTDKSPQSGELAESSIRQPSDTLVHPT 752
+E + + + +N+ + + E +D P PT + P S E PT
Sbjct: 751 DE----EGEWDWNSNEEDYNFFPHFE-EDKP-EPTREEPPSEE---------------PT 789
Query: 753 PVPVLRRSSRPHAPNRRYIDYMLLTDGGEPEDYDEACQTTDASKWELAMKEEMKSLISNQ 812
P SS+ + + EP D+ EA + W AM EE+KS+ N
Sbjct: 790 TPPTSPTSSQ-------------IEEKCEPMDFQEA---IEKKTWRNAMDEEIKSIQKND 833
Query: 813 TWELAKLPIGKKALHNKWVYRVKEDHDGS-KRYKARLVVKGFRQKEGIDYTEIFAPVVKL 871
TWEL LP G KA+ KWVY+ K++ G +RYKARLV KG+ Q+ GIDY E+FAPV +L
Sbjct: 834 TWELTSLPNGHKAIGVKWVYKAKKNSKGEVERYKARLVAKGYSQRAGIDYDEVFAPVARL 893
Query: 872 NTIRSVLSIVASENLYLEQLDVKTAFLHGDLVEEIYMHQPEGFLEEGKENMVCMLKKSLY 931
T+R ++S+ A + Q+DVK+AFL+GDL EE+Y+ QP+G++ +G+E+ V LKK+LY
Sbjct: 894 ETVRLIISLAAQNKWKIHQMDVKSAFLNGDLEEEVYIEQPQGYIVKGEEDKVLRLKKALY 953
Query: 932 GLKQAPRQWYMKFESFMHKEGFQKCNADHCCFFKRYKSSYIILLLYVDDMLVAGSNIDEI 991
GLKQAPR W + + + ++ F KC +H + K K +I LYVDD++ G+N
Sbjct: 954 GLKQAPRAWNTRIDKYFKEKDFIKCPYEHALYIKIQKEDILIACLYVDDLIFTGNNPSMF 1013
Query: 992 KNLKIQLSKEFDMKDLGPAKKILGMQITRDKQKGVLQLS*AEYINRVLQRFNMGDAKLVS 1051
+ K +++KEF+M D+G LG+++ + + G+ ++ Y VL++F M D+ V
Sbjct: 1014 EEFKKEMTKEFEMTDIGLMSYYLGIEV-KQEDNGIF-ITQEGYAKEVLKKFKMDDSNPVC 1071
Query: 1052 TPLASHFRLSQEQSPQTEEEKELMAKIPYASAIGSLMYAMVCTRPDIGHAVGVVSRFMSN 1111
TP+ +LS+ +EE E + + S +GSL Y + CTRPDI +AVGVVSR+M +
Sbjct: 1072 TPMECGIKLSK------KEEGEGVDPTTFKSLVGSLRY-LTCTRPDILYAVGVVSRYMEH 1124
Query: 1112 PGKAHWEAVKWILRYLRGTTEKCLYFG-KGEIKVEGYVDADFAGEVDHRRSTTGYIFTVG 1170
P H++A K ILRY++GT L++ + K+ GY D+D+ G+VD R+ST+G++F +G
Sbjct: 1125 PTTTHFKAAKRILRYIKGTVNFGLHYSTTSDYKLVGYSDSDWGGDVDDRKSTSGFVFYIG 1184
Query: 1171 TRSVSWMSRIQKIVALSTTEVEYVAVTEASKELIWLQGLLTELGFMQEK-SALYSDSQSA 1229
+ +WMS+ Q IV LST E EYVA T IWL+ LL EL QE+ + ++ D++SA
Sbjct: 1185 DTAFTWMSKKQPIVTLSTCEAEYVAATSCVCHAIWLRNLLKELSLPQEEPTKIFVDNKSA 1244
Query: 1230 IHLAKNSAFHSRTKHIGLRYHFIRSLLEDEVLTLIKIQGSKNPADMLTKVVTIDKLKLCS 1289
I LAKN FH R+KHI RYH+IR + + + L ++ AD+ TK + +
Sbjct: 1245 IALAKNPVFHDRSKHIDTRYHYIRECVSKKDVQLEYVKTHDQVADIFTKPLKREDFIKMR 1304
Query: 1290 TLVGL 1294
+L+G+
Sbjct: 1305 SLLGV 1309
Database: nr
Posted date: Jul 5, 2005 12:34 AM
Number of letters in database: 863,360,394
Number of sequences in database: 2,540,612
Lambda K H
0.317 0.133 0.392
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,234,033,819
Number of Sequences: 2540612
Number of extensions: 97215271
Number of successful extensions: 279630
Number of sequences better than 10.0: 6033
Number of HSP's better than 10.0 without gapping: 4829
Number of HSP's successfully gapped in prelim test: 1208
Number of HSP's that attempted gapping in prelim test: 262606
Number of HSP's gapped (non-prelim): 11485
length of query: 1302
length of database: 863,360,394
effective HSP length: 140
effective length of query: 1162
effective length of database: 507,674,714
effective search space: 589918017668
effective search space used: 589918017668
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 81 (35.8 bits)
Medicago: description of AC144760.12