
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0065.18
(799 letters)
Database: nr
2,540,612 sequences; 863,360,394 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
emb|CAA32025.1| unnamed protein product [Nicotiana tabacum] gi|1... 835 0.0
pir||T02206 hypothetical protein - common tobacco retrotransposo... 828 0.0
dbj|BAD34493.1| Gag-Pol [Ipomoea batatas] 775 0.0
gb|AAV88069.1| hypothetical retrotransposon [Ipomoea batatas] 762 0.0
gb|AAX92941.1| retrotransposon protein, putative, Ty1-copia sub-... 754 0.0
gb|AAK29467.1| polyprotein-like [Lycopersicon chilense] 744 0.0
gb|AAT85194.1| putative polyprotein [Oryza sativa (japonica cult... 742 0.0
ref|XP_469192.1| putative polyprotein [Oryza sativa (japonica cu... 717 0.0
gb|AAX92861.1| retrotransposon protein, putative, Ty1-copia sub-... 717 0.0
ref|XP_475489.1| putative polyprotein [Oryza sativa (japonica cu... 687 0.0
gb|AAF19226.1| Highly similar to Ta1-3 polyprotein [Arabidopsis ... 685 0.0
gb|AAD19773.1| putative retroelement pol polyprotein [Arabidopsi... 684 0.0
gb|AAD23690.1| putative retroelement pol polyprotein [Arabidopsi... 679 0.0
dbj|BAB09923.1| copia-like retrotransposable element [Arabidopsi... 679 0.0
emb|CAB75481.1| copia-like polyprotein [Arabidopsis thaliana] gi... 676 0.0
gb|AAD32759.1| putative retroelement pol polyprotein [Arabidopsi... 666 0.0
gb|AAW22873.1| putative polyprotein [Lycopersicon esculentum] 641 0.0
emb|CAB79135.1| putative transposable element [Arabidopsis thali... 632 e-179
emb|CAA31653.1| polyprotein [Arabidopsis thaliana] gi|99721|pir|... 628 e-178
ref|XP_475663.1| putative polyprotein [Oryza sativa (japonica cu... 622 e-176
>emb|CAA32025.1| unnamed protein product [Nicotiana tabacum]
gi|130582|sp|P10978|POLX_TOBAC Retrovirus-related Pol
polyprotein from transposon TNT 1-94 [Contains: Protease
; Reverse transcriptase ; Endonuclease]
Length = 1328
Score = 835 bits (2157), Expect = 0.0
Identities = 430/793 (54%), Positives = 559/793 (70%), Gaps = 16/793 (2%)
Query: 6 VENQTCLKIKSLKSDNGGEYDSQEFKKFCSENGIRMIKTILGTPEQNGVAERMNRTLNER 65
VE +T K+K L+SDNGGEY S+EF+++CS +GIR KT+ GTP+ NGVAERMNRT+ E+
Sbjct: 536 VERETGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTPQHNGVAERMNRTIVEK 595
Query: 66 ARCMRIQSGLPKMFWVDAINTAAYLINRGPSIPLDYQLPEEVWSGKEVSLSHLKVFGCVS 125
R M + LPK FW +A+ TA YLINR PS+PL +++PE VW+ KEVS SHLKVFGC +
Sbjct: 596 VRSMLRMAKLPKSFWGEAVQTACYLINRSPSVPLAFEIPERVWTNKEVSYSHLKVFGCRA 655
Query: 126 YVLIDSDRRDKLDPKAIKCFFIGYGFDMYGYRFWDEQNKKIIRSRNVTFNESVLYKDRSS 185
+ + ++R KLD K+I C FIGYG + +GYR WD KK+IRSR+V F ES +
Sbjct: 656 FAHVPKEQRTKLDDKSIPCIFIGYGDEEFGYRLWDPVKKKVIRSRDVVFRESEVRTAADM 715
Query: 186 AESMS-----------SSKQLKLSERVALEEISESDVVKRNQINPENEDDEVEVELEQEP 234
+E + S+ S +E+SE I + DE E+E
Sbjct: 716 SEKVKNGIIPNFVTIPSTSNNPTSAESTTDEVSEQGEQPGEVIEQGEQLDEGVEEVEHP- 774
Query: 235 TTVIETPITQVRRSTRTPKAPQRYSPSLNYLLLTDAGEPEYFGEAMQGNDSIKWELAMKD 294
T E +RRS R P+ R PS Y+L++D EPE E + + + AM++
Sbjct: 775 -TQGEEQHQPLRRSER-PRVESRRYPSTEYVLISDDREPESLKEVLSHPEKNQLMKAMQE 832
Query: 295 EMTSLQKNGTWSLTKLPEGKKALQNRWVYRLKEESDGSR-RYKARLVVKGFQQKQGIDFT 353
EM SLQKNGT+ L +LP+GK+ L+ +WV++LK++ D RYKARLVVKGF+QK+GIDF
Sbjct: 833 EMESLQKNGTYKLVELPKGKRPLKCKWVFKLKKDGDCKLVRYKARLVVKGFEQKKGIDFD 892
Query: 354 EIFSPVVKMTTIRVILSIVAAENLHLEQLDVKIAFLHGDLEEEIYMTQPEGFEVLGTKNL 413
EIFSPVVKMT+IR ILS+ A+ +L +EQLDVK AFLHGDLEEEIYM QPEGFEV G K++
Sbjct: 893 EIFSPVVKMTSIRTILSLAASLDLEVEQLDVKTAFLHGDLEEEIYMEQPEGFEVAGKKHM 952
Query: 414 VCKLHKSLYGLKQAPRQWYKKFNEFMSNSGFNRCDMDHCCFVKKFADS-YIILALYVDDM 472
VCKL+KSLYGLKQAPRQWY KF+ FM + + + D C + K+F+++ +IIL LYVDDM
Sbjct: 953 VCKLNKSLYGLKQAPRQWYMKFDSFMKSQTYLKTYSDPCVYFKRFSENNFIILLLYVDDM 1012
Query: 473 LIAGSNMTEINRLKQQMSENFEMKDLGPAKQILGMRISRNRSEGVLKLSQEKYVEKLLDR 532
LI G + I +LK +S++F+MKDLGPA+QILGM+I R R+ L LSQEKY+E++L+R
Sbjct: 1013 LIVGKDKGLIAKLKGDLSKSFDMKDLGPAQQILGMKIVRERTSRKLWLSQEKYIERVLER 1072
Query: 533 FNVGDANTRSTPLGNHLKFSKKQSPQTDEEESYMSTVPYASAVGSLMYAMVCTRPDIAHA 592
FN+ +A STPL HLK SKK P T EE+ M+ VPY+SAVGSLMYAMVCTRPDIAHA
Sbjct: 1073 FNMKNAKPVSTPLAGHLKLSKKMCPTTVEEKGNMAKVPYSSAVGSLMYAMVCTRPDIAHA 1132
Query: 593 VGVVSRFMSNPGKEHWECVKWILRYLKGSSRMCLCFRRNNLTLQEFSDADLGGDSDGGKS 652
VGVVSRF+ NPGKEHWE VKWILRYL+G++ CLCF ++ L+ ++DAD+ GD D KS
Sbjct: 1133 VGVVSRFLENPGKEHWEAVKWILRYLRGTTGDCLCFGGSDPILKGYTDADMAGDIDNRKS 1192
Query: 653 TTGYIFTLGGTAVSWKSKLQNRVALSTTESEYVAISEAAKEMIWLKSFLKELGKEQDVPP 712
+TGY+FT G A+SW+SKLQ VALSTTE+EY+A +E KEMIWLK FL+ELG Q
Sbjct: 1193 STGYLFTFSGGAISWQSKLQKCVALSTTEAEYIAATETGKEMIWLKRFLQELGLHQKEYV 1252
Query: 713 LFSDSQSVIFLAKNPVFHSRCKHIQMKYHFIRELISDEELSLLKILGSENPTDMLTKTVT 772
++ DSQS I L+KN ++H+R KHI ++YH+IRE++ DE L +LKI +ENP DMLTK V
Sbjct: 1253 VYCDSQSAIDLSKNSMYHARTKHIDVRYHWIREMVDDESLKVLKISTNENPADMLTKVVP 1312
Query: 773 ADNLRLCIASAGL 785
+ LC G+
Sbjct: 1313 RNKFELCKELVGM 1325
>pir||T02206 hypothetical protein - common tobacco retrotransposon Tto1
gi|1167523|dbj|BAA11674.1| ORF(AA 1-1338) [Nicotiana
tabacum]
Length = 1338
Score = 828 bits (2139), Expect = 0.0
Identities = 425/799 (53%), Positives = 547/799 (68%), Gaps = 21/799 (2%)
Query: 4 TEVENQTCLKIKSLKSDNGGEYDSQEFKKFCSENGIRMIKTILGTPEQNGVAERMNRTLN 63
T VE +T K+K +++DNGGEY Q F +C E+GIR T TP+ NG+AERMNRTL
Sbjct: 532 TLVERETGKKLKCIRTDNGGEYQGQ-FDAYCKEHGIRHQFTPPKTPQLNGLAERMNRTLI 590
Query: 64 ERARCMRIQSGLPKMFWVDAINTAAYLINRGPSIPLDYQLPEEVWSGKEVSLSHLKVFGC 123
ER RC+ S LPK FW +A+ TAAY++N P +PL Y+ PE++W G+++S L+VFGC
Sbjct: 591 ERTRCLLSHSKLPKAFWGEALVTAAYVLNHSPCVPLQYKAPEKIWLGRDISYDQLRVFGC 650
Query: 124 VSYVLIDSDRRDKLDPKAIKCFFIGYGFDMYGYRFWDEQNKKIIRSRNVTFNESVLYKDR 183
+YV + D R KLD K +C FIGYG DM GY+F+D KK++RSR+V F E +D
Sbjct: 651 KAYVHVPKDERSKLDVKTRECVFIGYGQDMLGYKFYDPVEKKLVRSRDVVFVEDQTIEDI 710
Query: 184 SSAE-SMSSSKQLKLSERVALEEISES--------------DVVKRNQINPENEDDEVEV 228
E S S + +L V ++ + D + + N +N DD+ +
Sbjct: 711 DKVEKSTDDSAEFELPPTVVPRQVGDDVQDNQPEAPGLPNEDELADTEGNEDNGDDDADE 770
Query: 229 ELEQEPTTVIETPITQVRRSTRTPKAPQRYSPSLNYLLLTDAGEPEYFGEAMQGNDSIKW 288
E + +P + P RS R + RYSP Y+LLTD GEP+ F EA+ KW
Sbjct: 771 EDQPQPPILNNPPYHT--RSGRVVQQSTRYSPH-EYVLLTDGGEPDSFEEAIDDEHKEKW 827
Query: 289 ELAMKDEMTSLQKNGTWSLTKLPEGKKALQNRWVYRLKEESDGSR-RYKARLVVKGFQQK 347
AM+DE+ SL +N T+ L KLP+GK+AL+N+WV+++K + S R+KARLVVKGF Q+
Sbjct: 828 IEAMQDEIKSLHENKTFELVKLPKGKRALKNKWVFKMKHDEHNSLPRFKARLVVKGFNQR 887
Query: 348 QGIDFTEIFSPVVKMTTIRVILSIVAAENLHLEQLDVKIAFLHGDLEEEIYMTQPEGFEV 407
+GIDF EIFSPVVKMT+IR +L + A+ NL +EQ+DVK AFLHGDLEEEIYM QP+GF+
Sbjct: 888 KGIDFDEIFSPVVKMTSIRTVLGLAASLNLEVEQMDVKTAFLHGDLEEEIYMEQPDGFQQ 947
Query: 408 LGTKNLVCKLHKSLYGLKQAPRQWYKKFNEFMSNSGFNRCDMDHCCFVKKFADS-YIILA 466
G ++ VC+L KSLYGLKQAPRQWYKKF M G+ + DHC F +KF+D +IIL
Sbjct: 948 KGKEDYVCRLRKSLYGLKQAPRQWYKKFESVMGQHGYKKTTSDHCVFAQKFSDDDFIILL 1007
Query: 467 LYVDDMLIAGSNMTEINRLKQQMSENFEMKDLGPAKQILGMRISRNRSEGVLKLSQEKYV 526
LYVDDMLI G N++ IN LK+Q+S+ F MKDLGPAKQILGMRI R+R L LSQEKY+
Sbjct: 1008 LYVDDMLIVGRNVSRINSLKEQLSKFFAMKDLGPAKQILGMRIMRDREAKKLWLSQEKYI 1067
Query: 527 EKLLDRFNVGDANTRSTPLGNHLKFSKKQSPQTDEEESYMSTVPYASAVGSLMYAMVCTR 586
EK+L RFN+ S PL NH + S KQSP TD+E M +PYASAVGSLMYAMVCTR
Sbjct: 1068 EKVLQRFNMEKTKAVSCPLANHFRLSTKQSPSTDDERRKMERIPYASAVGSLMYAMVCTR 1127
Query: 587 PDIAHAVGVVSRFMSNPGKEHWECVKWILRYLKGSSRMCLCFRRNNLTLQEFSDADLGGD 646
PDIAHAVGVVSRF+SNPGKEHW+ VKWILRYL+G+S++CLCF +N L ++DAD+ GD
Sbjct: 1128 PDIAHAVGVVSRFLSNPGKEHWDAVKWILRYLRGTSKLCLCFGEDNPVLVGYTDADMAGD 1187
Query: 647 SDGGKSTTGYIFTLGGTAVSWKSKLQNRVALSTTESEYVAISEAAKEMIWLKSFLKELGK 706
D KST+GY+ G AVSW+SKLQ VALSTTE+E++A +EA KE+IW+K FL ELG
Sbjct: 1188 VDSRKSTSGYLINFSGGAVSWQSKLQKCVALSTTEAEFIAATEACKELIWMKKFLTELGF 1247
Query: 707 EQDVPPLFSDSQSVIFLAKNPVFHSRCKHIQMKYHFIRELISDEELSLLKILGSENPTDM 766
QD LF DSQS I LAKN FHSR KHI ++Y++IR+++ + L L KI EN +DM
Sbjct: 1248 SQDGYQLFCDSQSAIHLAKNASFHSRSKHIDVRYNWIRDVLEKKMLRLEKIHTDENGSDM 1307
Query: 767 LTKTVTADNLRLCIASAGL 785
LTKT+ C +AG+
Sbjct: 1308 LTKTLPKGKFEFCREAAGI 1326
>dbj|BAD34493.1| Gag-Pol [Ipomoea batatas]
Length = 1298
Score = 775 bits (2001), Expect = 0.0
Identities = 399/791 (50%), Positives = 533/791 (66%), Gaps = 24/791 (3%)
Query: 2 WKTEVENQTCLKIKSLKSDNGGEYDSQEFKKFCSENGIRMIKTILGTPEQNGVAERMNRT 61
+K VE + KIK ++DNGGEY S+EF FC + GI+ T+ TP+QNGVAERMNRT
Sbjct: 522 FKARVELDSGKKIKCFRTDNGGEYTSEEFDDFCKKEGIKRQFTVAYTPQQNGVAERMNRT 581
Query: 62 LNERARCMRIQSGLPKMFWVDAINTAAYLINRGPSIPLDYQLPEEVWSGKEVSLSHLKVF 121
L ER R M +GL K FW +A+NTA YL+NR PS ++ + P E+W+GK V S+L +F
Sbjct: 582 LLERTRAMLRAAGLEKSFWAEAVNTACYLVNRAPSTAIELKTPMEMWTGKPVDYSNLHIF 641
Query: 122 GCVSYVLIDSDRRDKLDPKAIKCFFIGYGFDMYGYRFWDEQNKKIIRSRNVTFNESVLYK 181
G + Y + ++ KLDPK+ KC F+GY + GYR WD K++ SR+V F E L +
Sbjct: 642 GSIVYAMYNAQEITKLDPKSRKCRFLGYADGVKGYRLWDPTAHKVVISRDVIFVEDRLQR 701
Query: 182 DRSSAESMSSSKQLKLSERVALEEISESDVVKRNQINPENEDDEVEVELEQEPTTVIETP 241
S+ K+ + ++ +EE E D ++ P +E E E E PTT
Sbjct: 702 GEVDD---STEKEKPETTQIQVEEEFEQD---SSEAEPAHE--EPEPESSGAPTT----- 748
Query: 242 ITQVRRSTRTPKAPQRYSP-----SLNYLLLTDAGEPEYFGEAMQGNDSIKWELAMKDEM 296
R+S R + P +S ++ Y LLT+ GEP F EA+ +D +W AM++E+
Sbjct: 749 ----RQSDREKRRPTWHSDYVMEGNVAYCLLTEDGEPSTFQEAINSSDVSQWTAAMQEEI 804
Query: 297 TSLQKNGTWSLTKLPEGKKALQNRWVYRLKEESDGS-RRYKARLVVKGFQQKQGIDFTEI 355
+L KN TW L LP+G+K + N+WV+++K D RY+ARLVVKG+ QK+GIDF EI
Sbjct: 805 EALHKNNTWDLVPLPQGRKPIGNKWVFKIKRNGDDQVERYRARLVVKGYAQKEGIDFNEI 864
Query: 356 FSPVVKMTTIRVILSIVAAENLHLEQLDVKIAFLHGDLEEEIYMTQPEGFEVLGTKNLVC 415
FSPVV++TT+RV+L++ A NLHLEQLDVK AFLHGDLEEEIYM QPEGFE +NLVC
Sbjct: 865 FSPVVRLTTVRVVLAMCATFNLHLEQLDVKTAFLHGDLEEEIYMLQPEGFEDKENQNLVC 924
Query: 416 KLHKSLYGLKQAPRQWYKKFNEFMSNSGFNRCDMDHCCFVKKFA-DSYIILALYVDDMLI 474
+L+KSLYGLKQAPR WYK+F+ F+ G+NR + D C + K+F D+++IL LYVDDML+
Sbjct: 925 RLNKSLYGLKQAPRCWYKRFDSFIMCLGYNRLNADPCAYFKRFGKDNFVILLLYVDDMLV 984
Query: 475 AGSNMTEINRLKQQMSENFEMKDLGPAKQILGMRISRNRSEGVLKLSQEKYVEKLLDRFN 534
AG N I+ LK Q++ FEMKDLGPA +ILGM+I R+R + LSQ+ Y++K+L RF+
Sbjct: 985 AGPNKDHIDELKAQLAREFEMKDLGPANKILGMQIHRDRGNRKIWLSQKNYLKKILSRFS 1044
Query: 535 VGDANTRSTPLGNHLKFSKKQSPQTDEEESYMSTVPYASAVGSLMYAMVCTRPDIAHAVG 594
+ D + STPL +LK S SP +E MS VPYASAVGSLM+AM+CTRPDIA AVG
Sbjct: 1045 MQDCKSISTPLPINLKVSSSMSPSNEEGRMEMSRVPYASAVGSLMFAMICTRPDIAQAVG 1104
Query: 595 VVSRFMSNPGKEHWECVKWILRYLKGSSRMCLCFRRNNLTLQEFSDADLGGDSDGGKSTT 654
VVSR+M+NPG+EHW CVK ILRY+KG+S + LC+ ++ + + D+D GD D KSTT
Sbjct: 1105 VVSRYMANPGREHWNCVKRILRYIKGTSDVALCYGGSDFIINGYVDSDYAGDLDKSKSTT 1164
Query: 655 GYIFTLGGTAVSWKSKLQNRVALSTTESEYVAISEAAKEMIWLKSFLKELGKEQDVPPLF 714
GY+F + G AVSW SKLQ VA STTE+EYVA ++A+KE IWLK L+ELG +Q+ LF
Sbjct: 1165 GYVFKVAGGAVSWVSKLQAVVATSTTEAEYVAATQASKEAIWLKMLLEELGHKQEFVSLF 1224
Query: 715 SDSQSVIFLAKNPVFHSRCKHIQMKYHFIRELISDEELSLLKILGSENPTDMLTKTVTAD 774
DSQS + LA+NP FHSR KHI+++YHFIRE + + + L KI ++N D LTK + D
Sbjct: 1225 CDSQSALHLARNPAFHSRTKHIRVQYHFIREKVKEGTVDLQKIHTADNVADFLTKIINVD 1284
Query: 775 NLRLCIASAGL 785
C +S GL
Sbjct: 1285 KFTWCRSSCGL 1295
>gb|AAV88069.1| hypothetical retrotransposon [Ipomoea batatas]
Length = 1415
Score = 762 bits (1967), Expect = 0.0
Identities = 397/780 (50%), Positives = 525/780 (66%), Gaps = 38/780 (4%)
Query: 6 VENQTCLKIKSLKSDNGGEYDSQEFKKFCSENGIRMIKTILGTPEQNGVAERMNRTLNER 65
VE QT K+K +++DNGGEY F ++C GIR KT P+ NG+AERMNRT+ ER
Sbjct: 532 VERQTGKKLKCIRTDNGGEYCGP-FDEYCRRYGIRHQKTPPKIPQLNGLAERMNRTIMER 590
Query: 66 ARCMRIQSGLPKMFWVDAINTAAYLINRGPSIPLDYQLPEEVWSGKEVSLSHLKVFGCVS 125
RCM + LP FW +A++TA ++IN P I L ++P++VW GK+VS HL+VFGC +
Sbjct: 591 VRCMLDDAKLPSSFWAEAVSTAVHVINLSPVIALKNEVPDKVWCGKDVSYDHLRVFGCKA 650
Query: 126 YVLIDSDRRDKLDPKAIKCFFIGYGFDMYGYRFWDEQNKKIIRSRNVTFNESVLYKDRSS 185
+V + D R KLD K +C FIGYGFD +GYR +D KK++RSR+V F E+ +D
Sbjct: 651 FVHVPRDERSKLDSKTRQCIFIGYGFDEFGYRLYDPVEKKLVRSRDVVFFENQTIEDIDK 710
Query: 186 AESMSS--SKQLKLSERVALEEISESDVVKRNQIN----PENEDDEVEVE------LEQE 233
+ S S L E V+ + D V+ N N P+ + D V+V+ + QE
Sbjct: 711 VKQPESRDSGSLVDIEPVSRRYTDDVDEVQENVQNGDPVPDYQGDTVDVDGHADDVVHQE 770
Query: 234 PTTVIETPITQVRRSTRTPKAPQRYSPSLNYLLLTDAGEPEYFGEAMQGNDSIKWELAMK 293
+ P+ RRS R + RYSPS Y+LLTD GEPE + EAM+ + +W AM+
Sbjct: 771 QEVPSQVPVDLPRRSDRERRPSTRYSPS-QYVLLTDGGEPESYEEAMESDQKRQWFEAMQ 829
Query: 294 DEMTSLQKNGTWSLTKLPEGKKALQNRWVYRLK-EESDGSRRYKARLVVKGFQQKQGIDF 352
+EM SL N T+ L K P+ +KAL+NRWVYR+K EE R+KARLVVKGF QK+GIDF
Sbjct: 830 EEMNSLYVNDTFELVKAPKNRKALKNRWVYRVKHEEGTSVPRFKARLVVKGFSQKKGIDF 889
Query: 353 TEIFSPVVKMTTIRVILSIVAAENLHLEQLDVKIAFLHGDLEEEIYMTQPEGFEVLGTKN 412
EIFSPVVK ++IRV+L + A ++ +EQ+DVK AFLHGDL+EEIYM QPEGF+V G ++
Sbjct: 890 DEIFSPVVKFSSIRVVLGLAARLDIEIEQMDVKTAFLHGDLDEEIYMEQPEGFKVKGKED 949
Query: 413 LVCKLHKSLYGLKQAPRQWYKKFNEFMSNSGFNRCDMDHCCFVKKFADS-YIILALYVDD 471
VC+L KSLYGLKQAPRQWYKKF MS G+ + DHC FV +++D ++IL LYVDD
Sbjct: 950 YVCRLKKSLYGLKQAPRQWYKKFTSVMSKHGYKKTSSDHCVFVNRYSDDDFVILLLYVDD 1009
Query: 472 MLIAGSNMTEINRLKQQMSENFEMKDLGPAKQILGMRISRNRSEGVLKLSQEKYVEKLLD 531
MLI G N + I LKQ++S++F MKD+GPAKQILGM+I R+R L LSQEKY+EK+L+
Sbjct: 1010 MLIVGRNASRIQELKQELSKSFSMKDMGPAKQILGMKIIRDRQNKKLWLSQEKYIEKVLE 1069
Query: 532 RFNVGDANTRSTPLGNHLKFSKKQSPQTDEEESYMSTVPYASAVGSLMYAMVCTRPDIAH 591
RF++ +A STPL H K KKQ P +++E+ M VPY+SAVGSLMYAMVCTRPDIAH
Sbjct: 1070 RFHMNEAKPVSTPLDMHFKLCKKQCPSSEKEKEEMQRVPYSSAVGSLMYAMVCTRPDIAH 1129
Query: 592 AVGVVSRFMSNPGKEHWECVKWILRYLKGSSRMCLCFRRNNLTLQEFSDADLGGDSDGGK 651
AVGVVSRF+SNPG+EHW+ VKWILRYL+G+S + LCF L ++D+D+ GD D K
Sbjct: 1130 AVGVVSRFLSNPGREHWDAVKWILRYLRGTSSLSLCFGTGKPILTGYTDSDMAGDIDTRK 1189
Query: 652 STTGYIFTLGGTAVSWKSKLQNRVALSTTESEYVAISEAAKEMIWLKSFLKELGKEQDVP 711
ST+GY+ T G AVSW+S+LQ V LSTTE+E++A EA+KEM+W+K FL+ELG QD
Sbjct: 1190 STSGYLITYAGGAVSWQSRLQKCVDLSTTEAEFIASVEASKEMLWMKKFLQELGFVQD-- 1247
Query: 712 PLFSDSQSVIFLAKNPVFHSRCKHIQMKYHFIRELISDEELSLLKILGSENPTDMLTKTV 771
R KHI +YH+IR+++ + L L KI +N +DM+TK +
Sbjct: 1248 --------------------RSKHIDTRYHWIRDILECKMLELEKIHTDDNGSDMMTKAL 1287
>gb|AAX92941.1| retrotransposon protein, putative, Ty1-copia sub-class [Oryza sativa
(japonica cultivar-group)]
Length = 2340
Score = 754 bits (1946), Expect = 0.0
Identities = 392/791 (49%), Positives = 517/791 (64%), Gaps = 22/791 (2%)
Query: 2 WKTEVENQTCLKIKSLKSDNGGEYDSQEFKKFCSENGIRMIKTILGTPEQNGVAERMNRT 61
WKT VE QT K+K L++DNG E+ S+ FK +C GI T+ TP+QNGVAERMNRT
Sbjct: 750 WKTMVERQTERKVKILRTDNGMEFCSKIFKSYCKSEGIVRHYTVPHTPQQNGVAERMNRT 809
Query: 62 LNERARCMRIQSGLPKMFWVDAINTAAYLINRGPSIPLDYQLPEEVWSGKEVSLSHLKVF 121
+ +ARC+ +GLPK FW +A++TA YLINR PS +D + P EVWSG + S L+VF
Sbjct: 810 IISKARCLLSNAGLPKQFWAEAVSTACYLINRSPSYAIDKKTPIEVWSGSPANYSDLRVF 869
Query: 122 GCVSYVLIDSDRRDKLDPKAIKCFFIGYGFDMYGYRFWDEQNKKIIRSRNVTFNESVLYK 181
GC +Y +D+ KL+P+AIKC F+GY + GY+ W + KK++ SRNV F+ESV+
Sbjct: 870 GCTAYAHVDNG---KLEPRAIKCIFLGYPSGVKGYKLWCPETKKVVISRNVVFHESVMLH 926
Query: 182 DRSSAESMSSSKQ---LKLSERVALEEISESDVVKRNQINPENEDDEVEVELEQEPTTVI 238
D+ S S++ +++ ++ E + V NQ P ED + ++Q P I
Sbjct: 927 DKPSTNVPVESQEKASVQVEHLISSGHAPEKEDVAINQDEPVIEDSNSSI-VQQSPKRSI 985
Query: 239 ETPITQVRRSTRTPKAPQRYSPSLNYLL--------LTDAGEPEYFGEAMQGNDSIKWEL 290
R R K PQRY N + + EP + EA+ +D +W
Sbjct: 986 AKD-----RPKRNIKPPQRYIEEANIVAYALSVAEEIEGNAEPSTYSEAIVSDDCNRWIT 1040
Query: 291 AMKDEMTSLQKNGTWSLTKLPEGKKALQNRWVYRLKE--ESDGSRRYKARLVVKGFQQKQ 348
AM DEM SL+KN TW L KLP+ KK ++ +W+++ KE S RYKARL+ KG+ Q
Sbjct: 1041 AMHDEMESLEKNHTWELVKLPKEKKPIRCKWIFKRKEGISSSDEARYKARLIAKGYSQIP 1100
Query: 349 GIDFTEIFSPVVKMTTIRVILSIVAAENLHLEQLDVKIAFLHGDLEEEIYMTQPEGFEVL 408
GIDF ++FSPVVK ++IR +LSIVA + LEQ+DVK AFLHG+LEE+IYM QP+GF V
Sbjct: 1101 GIDFNDVFSPVVKHSSIRTLLSIVAMHDYELEQMDVKTAFLHGELEEDIYMEQPKGFVVP 1160
Query: 409 GTKNLVCKLHKSLYGLKQAPRQWYKKFNEFMSNSGFNRCDMDHCCFVKKFADSYIILALY 468
G +NLVC+L KSLYGLKQ+PRQWYK+F+ FM + F R + D C ++K S I L LY
Sbjct: 1161 GKENLVCRLKKSLYGLKQSPRQWYKRFDSFMLSQKFRRSNYDSCVYLKVVDGSAIYLLLY 1220
Query: 469 VDDMLIAGSNMTEINRLKQQMSENFEMKDLGPAKQILGMRISRNRSEGVLKLSQEKYVEK 528
VDDMLIA + +EI +LK Q+S FEMKDLG AK+ILGM I+R R L LSQ+ Y+EK
Sbjct: 1221 VDDMLIAAKDKSEIAKLKAQLSSEFEMKDLGAAKKILGMEITRKRHSFKLYLSQKGYIEK 1280
Query: 529 LLDRFNVGDANTRSTPLGNHLKFSKKQSPQTDEEESYMSTVPYASAVGSLMYAMVCTRPD 588
+L RFN+ DA STPL H + S PQ+D + YMS VPY+SAVGSLMYAMVC+RPD
Sbjct: 1281 VLRRFNMHDAKPVSTPLAAHFRLSSDLCPQSDYDIEYMSRVPYSSAVGSLMYAMVCSRPD 1340
Query: 589 IAHAVGVVSRFMSNPGKEHWECVKWILRYLKGSSRMCLCFRRNNLTLQEFSDADLGGDSD 648
++HA+ VVSR+M+NPGKEHW+ V+WI RYL+G+S CL F R+ L + D+D GD D
Sbjct: 1341 LSHALSVVSRYMANPGKEHWKAVQWIFRYLRGTSSACLQFGRSRDGLVGYVDSDFAGDLD 1400
Query: 649 GGKSTTGYIFTLGGTAVSWKSKLQNRVALSTTESEYVAISEAAKEMIWLKSFLKELGKEQ 708
G+S GY+FT+GG AVSWK+ LQ VALSTTE+EY+AISEA KE IWL+ L
Sbjct: 1401 RGRSLAGYVFTIGGCAVSWKASLQATVALSTTEAEYMAISEACKEAIWLRGLYTVLCAVT 1460
Query: 709 DVPPLFSDSQSVIFLAKNPVFHSRCKHIQMKYHFIRELISDEELSLLKILGSENPTDMLT 768
+F DSQS I L K+ +FH R KHI ++YHFIR LI++ ++ + KI +NP DM+T
Sbjct: 1461 SCINIFCDSQSAICLTKDQMFHERTKHIDVRYHFIRGLIAEGDVKICKISIHDNPADMMT 1520
Query: 769 KTVTADNLRLC 779
K V A LC
Sbjct: 1521 KPVPATKFELC 1531
>gb|AAK29467.1| polyprotein-like [Lycopersicon chilense]
Length = 1328
Score = 744 bits (1921), Expect = 0.0
Identities = 397/795 (49%), Positives = 536/795 (66%), Gaps = 35/795 (4%)
Query: 6 VENQTCLKIKSLKSDNGGEYDSQEFKKFCSENGIRMIKTILGTPEQNGVAERMNRTLNER 65
VE +T K K L++DNGGEY S+EF+++CS +GIR KT+ GTP+ NGVAERMNRT+ E+
Sbjct: 537 VERETGRKRKRLRTDNGGEYTSREFEEYCSNHGIRHEKTVPGTPQHNGVAERMNRTIVEK 596
Query: 66 ARCMRIQSGLPKMFWVDAINTAAYLINRGPSIPLDYQLPEEVWSGKEVSLSHLKVFGCVS 125
R M + LPK FW +A+ TA YLINR PS+PL++ +PE VW+ KE+S SHLKVFGC +
Sbjct: 597 VRSMLRMAKLPKTFWGEAVRTACYLINRSPSVPLEFDIPERVWTNKEMSYSHLKVFGCKA 656
Query: 126 YVLIDSDRRDKLDPKAIKCFFIGYGFDMYGYRFWDEQNKKIIRSRNVTFNESVL--YKDR 183
+ + ++R KLD K++ C FIGYG + +GYR WD KK+IRSR+V F ES + D
Sbjct: 657 FAHVPKEQRTKLDDKSVPCIFIGYGDEEFGYRLWDLVKKKVIRSRDVIFRESEVGTAADL 716
Query: 184 S-----------------SAESMSSSKQLKLSERVALEEISESDVVKRNQINPENEDDEV 226
S S+ + +S + + E V EE + V + Q+ E E
Sbjct: 717 SEKAKKKNGIIPNLVTIPSSSNHPTSAESTIDEVVEQEEQPDEIVEQGEQLGDNTEQMEY 776
Query: 227 EVELEQEPTTVIETPITQVRRSTRTPKAPQRYSPSLNYLLLTDAGEPEYFGEAMQGNDSI 286
E + +P +RRS R +Y PS Y+L+ GEPE E + +
Sbjct: 777 PEEEQSQP----------LRRSERQRVESTKY-PSSEYVLIKYEGEPENLKEVLSHPEKS 825
Query: 287 KWELAMKDEMTSLQKNGTWSLTKLPEGKKALQNRWVYRLKEESDGSR-RYKARLVVKGFQ 345
+W AM +EM SLQKNGT+ L +LP+GK+ L+ +WV++LK++ +G RYKARLVVKGF+
Sbjct: 826 QWMKAMHEEMGSLQKNGTYQLVELPKGKRPLKCKWVFKLKKDGNGKLVRYKARLVVKGFE 885
Query: 346 QKQGIDFTEIFSPVVKMTTIRVILSIVAAENLHLEQLDVKIAFLHGDLEEEIYMTQPEGF 405
QK+GIDF EIFSPVVKMT+IR ILSI A+ +L +EQLDVK AFLHGDLEEEIYM Q EGF
Sbjct: 886 QKKGIDFDEIFSPVVKMTSIRTILSIAASLDLEVEQLDVKTAFLHGDLEEEIYMEQGEGF 945
Query: 406 EVLGTKNLVCKLHKSLYGLKQAPRQWYKKFNEFMSNSGFNRCDMDHCCFVKKFAD-SYII 464
EV G K++VCKL+KSLYGLKQAPRQWYKKF+ FM + + C + K+F+D ++II
Sbjct: 946 EVSGKKHMVCKLNKSLYGLKQAPRQWYKKFDSFMKSQTYRNTYSHPCVYFKRFSDKNFII 1005
Query: 465 LALYVDDMLIAGSNMTEINRLKQQMSENFEMKDLGPAKQILGMRISRNRSEGVLKLSQEK 524
L LY D MLI G + I +L++ S++F+MKDLGPAKQILGM+I+R + L LS EK
Sbjct: 1006 LLLYTDYMLIVGKDKELIAKLRKDFSKSFDMKDLGPAKQILGMKIAREEQK-KLGLSHEK 1064
Query: 525 YVEKLLDRFNVGDANTRSTPLGNHLKFSKKQSPQTDE-EESYMSTVPYASAVGSLMYAMV 583
Y+E++L+RFN+ A STPL ++LK +K+ P + E+ M+ VPY+SAVGS MYAMV
Sbjct: 1065 YIERVLERFNMKSAKPISTPLVSYLKLTKQMFPTKKKGEKGDMAKVPYSSAVGSFMYAMV 1124
Query: 584 CTRPDIAHAVGVVSRFMSNPGKEHWECVKWILRYLKGSSRMCLCFRRNNLTLQEFSDADL 643
CTRP+I AV VVSRF+ PGKEH E VKWILRYL+ ++R CF ++ + +++ D+
Sbjct: 1125 CTRPNIV-AVCVVSRFLEIPGKEHLEAVKWILRYLRRTTRDYFCFEGSDPISKGYTNVDM 1183
Query: 644 GGDSDGGKSTTGYIFTLGGTAVSWKSKLQNRVALSTTESEYVAISEAAKEMIWLKSFLKE 703
GD D KSTT Y+FT G +SW+SKLQ VALSTTE++Y+A +E KEM+WLK FL+E
Sbjct: 1184 EGDLDNRKSTTCYLFTFSGGDISWQSKLQKYVALSTTEAKYIAGTEVCKEMLWLKRFLQE 1243
Query: 704 LGKEQDVPPLFSDSQSVIFLAKNPVFHSRCKHIQMKYHFIRELISDEELSLLKILGSENP 763
G Q ++ +SQS + L+K ++H+ KHI M+YH+IRE++ D L ++KI SENP
Sbjct: 1244 HGLHQKEYVVYCESQSAMDLSKKAMYHATTKHIDMRYHWIREMVDDGSLQVVKIPTSENP 1303
Query: 764 TDMLTKTVTADNLRL 778
DM+TK V + L
Sbjct: 1304 ADMVTKVVQNEKFEL 1318
>gb|AAT85194.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
Length = 1241
Score = 742 bits (1916), Expect = 0.0
Identities = 388/797 (48%), Positives = 519/797 (64%), Gaps = 22/797 (2%)
Query: 2 WKTEVENQTCLKIKSLKSDNGGEYDSQEFKKFCSENGIRMIKTILGTPEQNGVAERMNRT 61
WKT VE QT K+K L++DNG E S+ FK +C GI T+ TP+QNGVAERMNRT
Sbjct: 452 WKTMVERQTERKVKILRTDNGMELCSKIFKSYCKSEGIVRHYTVPHTPQQNGVAERMNRT 511
Query: 62 LNERARCMRIQSGLPKMFWVDAINTAAYLINRGPSIPLDYQLPEEVWSGKEVSLSHLKVF 121
+ +ARCM + LPK FW +A++TA YLINR PS +D + P EVWSG + S L+VF
Sbjct: 512 IISKARCMLSNASLPKQFWAEAVSTACYLINRSPSYAIDKKTPIEVWSGSPANYSDLRVF 571
Query: 122 GCVSYVLIDSDRRDKLDPKAIKCFFIGYGFDMYGYRFWDEQNKKIIRSRNVTFNESVLYK 181
GC +Y +D+ KL+P+ IKC F+GY + GY+ W + KK++ SRNV F+ES++
Sbjct: 572 GCTAYAHVDNG---KLEPRVIKCIFLGYLSGVKGYKLWCPETKKVVISRNVVFHESIMLH 628
Query: 182 DRSSAESMSSSKQ---LKLSERVALEEISESDVVKRNQINPENEDDEVEVELEQEPTTVI 238
D+ S S++ +++ ++ E + V NQ P ED + + ++Q P I
Sbjct: 629 DKPSTNVPVESQEKVSVQVEHLISSGHAPEKEDVAINQDAPVIEDSDSSI-VQQSPKRSI 687
Query: 239 ETPITQVRRSTRTPKAPQRYSPSLNYLL--------LTDAGEPEYFGEAMQGNDSIKWEL 290
R R K P+RY N + + EP + +A+ +D +W
Sbjct: 688 AKD-----RPKRNTKPPRRYIEEANIVAYALSVAEEIEGNAEPSTYSDAIVSDDCNRWIT 742
Query: 291 AMKDEMTSLQKNGTWSLTKLPEGKKALQNRWVYRLKEESDGS--RRYKARLVVKGFQQKQ 348
AM DEM SL+KN +W L KLP+ KK ++ +W+++ KE S RYKARLV KG+ Q
Sbjct: 743 AMHDEMESLEKNHSWELEKLPKEKKPIRCKWIFKRKEGMSPSDEARYKARLVAKGYSQIP 802
Query: 349 GIDFTEIFSPVVKMTTIRVILSIVAAENLHLEQLDVKIAFLHGDLEEEIYMTQPEGFEVL 408
GIDF ++FSPVVK ++IR +LSIVA + LEQ+DVK AFLHG+LEE+IYM QPEGF V
Sbjct: 803 GIDFNDVFSPVVKHSSIRTLLSIVAMHDYELEQMDVKTAFLHGELEEDIYMEQPEGFVVP 862
Query: 409 GTKNLVCKLHKSLYGLKQAPRQWYKKFNEFMSNSGFNRCDMDHCCFVKKFADSYIILALY 468
G +NLVC+L KSLYGLKQ+PRQWYK+F+ FM + F R + D C ++K S I L LY
Sbjct: 863 GKENLVCRLKKSLYGLKQSPRQWYKRFDSFMLSQKFRRSNYDSCVYLKVVDGSAIYLLLY 922
Query: 469 VDDMLIAGSNMTEINRLKQQMSENFEMKDLGPAKQILGMRISRNRSEGVLKLSQEKYVEK 528
VDDMLIA + +EI +LK Q+S FEMKDLG AK+ILGM I+R R G L LSQ+ Y+EK
Sbjct: 923 VDDMLIAAKDKSEIAKLKAQLSSEFEMKDLGAAKKILGMEITRERHSGKLYLSQKCYIEK 982
Query: 529 LLDRFNVGDANTRSTPLGNHLKFSKKQSPQTDEEESYMSTVPYASAVGSLMYAMVCTRPD 588
+L RFN+ DA ST L H + S PQ+ + YMS VPY+SAV SLMYAMVC+RPD
Sbjct: 983 VLHRFNMHDAKLVSTLLAAHFRLSSDLCPQSAYDIEYMSRVPYSSAVSSLMYAMVCSRPD 1042
Query: 589 IAHAVGVVSRFMSNPGKEHWECVKWILRYLKGSSRMCLCFRRNNLTLQEFSDADLGGDSD 648
++HA+ VVSR+M+NPGKEHW+ V+WI RYL+G+S CL F R++ L + D+D GD D
Sbjct: 1043 LSHALSVVSRYMANPGKEHWKAVQWIFRYLRGTSSACLQFGRSSDGLVGYVDSDFAGDLD 1102
Query: 649 GGKSTTGYIFTLGGTAVSWKSKLQNRVALSTTESEYVAISEAAKEMIWLKSFLKELGKEQ 708
+S TGY+FT+GG AVSWK+ LQ VALSTTE+EY+AISEA KE+IWL+ EL
Sbjct: 1103 RRRSLTGYVFTVGGCAVSWKASLQATVALSTTEAEYMAISEACKEVIWLRGLYTELCGVT 1162
Query: 709 DVPPLFSDSQSVIFLAKNPVFHSRCKHIQMKYHFIRELISDEELSLLKILGSENPTDMLT 768
+F DSQS I L K+ +FH R KHI ++YHFIR +I++ ++ + KI +NP DM+T
Sbjct: 1163 SCINIFCDSQSAICLTKDQMFHERTKHIDLRYHFIRGVIAEGDVKVCKISTHDNPVDMMT 1222
Query: 769 KTVTADNLRLCIASAGL 785
K V A LC + G+
Sbjct: 1223 KPVPATKFELCSSLVGV 1239
>ref|XP_469192.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
gi|53370655|gb|AAU89150.1| integrase core domain
containing protein [Oryza sativa (japonica
cultivar-group)] gi|40538906|gb|AAR87163.1| putative
polyprotein [Oryza sativa (japonica cultivar-group)]
Length = 1322
Score = 717 bits (1852), Expect = 0.0
Identities = 373/796 (46%), Positives = 511/796 (63%), Gaps = 25/796 (3%)
Query: 3 KTEVENQTCLKIKSLKSDNGGEYDSQEFKKFCSENGIRMIKTILGTPEQNGVAERMNRTL 62
K +E QT ++K L +DNGGE+ S F +C + GI TI TP+QNGVAERMNRT+
Sbjct: 537 KVMIERQTEKEVKVLCTDNGGEFCSDAFDDYCRKEGIVRHHTIPYTPQQNGVAERMNRTI 596
Query: 63 NERARCMRIQSGLPKMFWVDAINTAAYLINRGPSIPLDYQLPEEVWSGKEVSLSHLKVFG 122
+ARCM + + K FW +A NTA YLINR PSIPL+ + P E+WSG S L+VFG
Sbjct: 597 ISKARCMLSNARMNKRFWAEAANTACYLINRSPSIPLNKKTPIEIWSGMPADYSQLRVFG 656
Query: 123 CVSYVLIDSDRRDKLDPKAIKCFFIGYGFDMYGYRFWDEQNKKIIRSRNVTFNESVLYKD 182
C +Y +D+ KL+P+AIKC F+GYG + GY+ W+ + K SRNV FNE V++ D
Sbjct: 657 CTAYAHVDNG---KLEPRAIKCLFLGYGSGVKGYKLWNPETNKTFMSRNVIFNEFVMFND 713
Query: 183 RSSAESM---SSSKQLKLSERVALEEISESDVVKRNQINPENEDDEVEVELEQEPTTVIE 239
+ + S +Q +S +V + E+++V N +N + ++ + EP
Sbjct: 714 SLPTDVIPGGSDEEQQYVSVQVEHVDDQETEIVG-NDVNDTVQHSPSVLQPQDEPIAH-- 770
Query: 240 TPITQVRRSTRTPKAPQRYSPSLNYLL--------LTDAGEPEYFGEAMQGNDSIKWELA 291
RR+ R+ AP R + + + + EP + EA+ D KW A
Sbjct: 771 ------RRTKRSCGAPVRLIEECDMVYYAFSYAEQVENTLEPATYTEAVVSGDREKWISA 824
Query: 292 MKDEMTSLQKNGTWSLTKLPEGKKALQNRWVYRLKEESDGSR--RYKARLVVKGFQQKQG 349
+++EM SL+KNGTW L LP+ KK ++ +W+++ KE S R+K RLV KGF Q G
Sbjct: 825 IQEEMQSLEKNGTWELVHLPKQKKPVRCKWIFKRKEGLSPSEPPRFKVRLVAKGFSQIAG 884
Query: 350 IDFTEIFSPVVKMTTIRVILSIVAAENLHLEQLDVKIAFLHGDLEEEIYMTQPEGFEVLG 409
+D+ ++FSPVVK ++IR SIV +L LEQLDVK FLHG+LEEEIYM QPEGF V G
Sbjct: 885 VDYNDVFSPVVKHSSIRTFFSIVTMHDLELEQLDVKTTFLHGELEEEIYMDQPEGFIVPG 944
Query: 410 TKNLVCKLHKSLYGLKQAPRQWYKKFNEFMSNSGFNRCDMDHCCFVKKFADSYIILALYV 469
++ VCKL +SLYGLKQ+PRQWYK+F+ FM + GF R + D C ++K S I L LYV
Sbjct: 945 KEDYVCKLKRSLYGLKQSPRQWYKRFDSFMLSHGFKRSEFDSCVYIKFVNGSPIYLLLYV 1004
Query: 470 DDMLIAGSNMTEINRLKQQMSENFEMKDLGPAKQILGMRISRNRSEGVLKLSQEKYVEKL 529
DDMLIA + +I LK+Q+S F+MKDLG AK+ILGM I+R+R+ G+L LSQ+ Y++K+
Sbjct: 1005 DDMLIAAKSKEQITTLKKQLSSEFDMKDLGAAKKILGMEITRDRNSGLLFLSQQSYIKKV 1064
Query: 530 LDRFNVGDANTRSTPLGNHLKFSKKQSPQTDEEESYMSTVPYASAVGSLMYAMVCTRPDI 589
L RFN+ DA STP+ H K S Q TDE+ YMS VPY+SAVGSLMYAMVC+ PD+
Sbjct: 1065 LQRFNMHDAKPVSTPIAPHFKLSALQCASTDEDVEYMSRVPYSSAVGSLMYAMVCSWPDL 1124
Query: 590 AHAVGVVSRFMSNPGKEHWECVKWILRYLKGSSRMCLCFRRNNLTLQEFSDADLGGDSDG 649
+HA+ +VSR+M+NPGKEHW+ V+WI RYL+G++ CL F R + L + D+D D D
Sbjct: 1125 SHAMSLVSRYMANPGKEHWKAVQWIFRYLRGTADACLKFGRIDKGLVGYVDSDFAADLDK 1184
Query: 650 GKSTTGYIFTLGGTAVSWKSKLQNRVALSTTESEYVAISEAAKEMIWLKSFLKELGKEQD 709
+S TGY+FT+G AVSWK+ LQ VA STTE+EY+AI+EA KE +WLK EL
Sbjct: 1185 RRSLTGYVFTIGSCAVSWKATLQPVVAQSTTEAEYMAIAEACKESVWLKGLFAELCGVDS 1244
Query: 710 VPPLFSDSQSVIFLAKNPVFHSRCKHIQMKYHFIRELISDEELSLLKILGSENPTDMLTK 769
LF DSQS I L K+ +FH R KHI +KYH++R++++ +L + KI +NP DM+TK
Sbjct: 1245 CINLFCDSQSAICLTKDQMFHERTKHIDIKYHYVRDIVAQGKLKVCKISIHDNPADMMTK 1304
Query: 770 TVTADNLRLCIASAGL 785
+ LC + G+
Sbjct: 1305 PIPVAKFELCSSLVGI 1320
>gb|AAX92861.1| retrotransposon protein, putative, Ty1-copia sub-class [Oryza sativa
(japonica cultivar-group)]
Length = 1373
Score = 717 bits (1850), Expect = 0.0
Identities = 370/769 (48%), Positives = 507/769 (65%), Gaps = 25/769 (3%)
Query: 2 WKTEVENQTCLKIKSLKSDNGGEYDSQEFKKFCSENGIRMIKTILGTPEQNGVAERMNRT 61
WK +E QT K+K L++DNGGE+ S F +C + GI TI TP+QNGVAERMNRT
Sbjct: 535 WKVMIERQTERKVKLLRTDNGGEFCSHAFNDYCRQEGIVRHHTIPHTPQQNGVAERMNRT 594
Query: 62 LNERARCMRIQSGLPKMFWVDAINTAAYLINRGPSIPLDYQLPEEVWSGKEVSLSHLKVF 121
+ RARCM + + K FW +A +TA YLINR PSIPL+ + P EVWSG S LKVF
Sbjct: 595 IISRARCMLSHARMNKRFWAEAASTACYLINRSPSIPLNKKTPIEVWSGTPADYSQLKVF 654
Query: 122 GCVSYVLIDSDRRDKLDPKAIKCFFIGYGFDMYGYRFWDEQNKKIIRSRNVTFNESVLYK 181
GC +Y +D+ KL+P+A+KC F+GYG + GY+ W+ + K SR+V FNESV++
Sbjct: 655 GCTAYAHVDNG---KLEPRAVKCLFLGYGSGVKGYKLWNPETGKTFMSRSVVFNESVMFT 711
Query: 182 DRSSAESMSSSKQLKLSERVALEEISESDVVKRNQINPENE------DDEVEVELEQEPT 235
+ +E + + ++ +V E + + V+ ++ +++ DD+ +++Q P
Sbjct: 712 NSLPSEHVPEKELQRMHMQV--EHVDDDTGVQVEPVHEQDDHNNDVADDDAHDDVQQTPP 769
Query: 236 TVI---ETPITQVRRSTRTPKAPQRYSP--SLNYLLLTDAG------EPEYFGEAMQGND 284
+ E PI Q R+S RT K P+R +L+Y L+ A EP + EA++ D
Sbjct: 770 ILQLEEELPIAQ-RKSKRTTKPPKRLIEECNLSYYALSCAEQVENVHEPATYKEAVRCGD 828
Query: 285 SIKWELAMKDEMTSLQKNGTWSLTKLPEGKKALQNRWVYRLKEESDGSR--RYKARLVVK 342
S W AM +EM SL+KN TW + LP+ KK + +W+++ KE S +YKARLV +
Sbjct: 829 SENWISAMHEEMQSLEKNSTWEVVPLPKKKKTISCKWIFKRKEGLSSSEPPKYKARLVAR 888
Query: 343 GFQQKQGIDFTEIFSPVVKMTTIRVILSIVAAENLHLEQLDVKIAFLHGDLEEEIYMTQP 402
G+ Q G+D+ ++FSPVVK ++IR LSIVA+ +L LEQLDVK AFLHG+LEE+IYM QP
Sbjct: 889 GYSQIPGVDYNDVFSPVVKHSSIRTFLSIVASHDLELEQLDVKTAFLHGELEEDIYMDQP 948
Query: 403 EGFEVLGTKNLVCKLHKSLYGLKQAPRQWYKKFNEFMSNSGFNRCDMDHCCFVKKFADSY 462
EGF V G + VCKL +SLYGLKQ+PRQW K+F+ FM + F R D C ++K S
Sbjct: 949 EGFIVPGKEKYVCKLKRSLYGLKQSPRQWNKRFDSFMLSHSFKRSKYDSCVYIKHVNGSP 1008
Query: 463 IILALYVDDMLIAGSNMTEINRLKQQMSENFEMKDLGPAKQILGMRISRNRSEGVLKLSQ 522
I L LYVDDMLIA + EI +LK+ +S F+MKDLG AK+ILGM ISR+R G+L LSQ
Sbjct: 1009 IYLLLYVDDMLIAAKSKIEITKLKKLLSSEFDMKDLGSAKKILGMEISRDRKSGLLFLSQ 1068
Query: 523 EKYVEKLLDRFNVGDANTRSTPLGNHLKFSKKQSPQTDEEESYMSTVPYASAVGSLMYAM 582
Y++K+L RFN+ +A STP+ H K S Q P D E YMS VPY+SAVGSLMYAM
Sbjct: 1069 HNYIKKVLQRFNMQNAKAVSTPIAPHFKLSAAQCPSIDAEIEYMSRVPYSSAVGSLMYAM 1128
Query: 583 VCTRPDIAHAVGVVSRFMSNPGKEHWECVKWILRYLKGSSRMCLCFRRNNLTLQEFSDAD 642
VC+RPD+++A+ +VSR+MSNPGKEHW V+WI RYL+G++ CL F R + L + D+D
Sbjct: 1129 VCSRPDLSYAMSLVSRYMSNPGKEHWRAVQWIFRYLRGTTYSCLKFGRTDKGLIGYVDSD 1188
Query: 643 LGGDSDGGKSTTGYIFTLGGTAVSWKSKLQNRVALSTTESEYVAISEAAKEMIWLKSFLK 702
D D +S TGY+FT+G AVSW++ LQ+ VALSTTE+EY+AI EA KE+IWLK
Sbjct: 1189 YAADLDRRRSLTGYVFTIGSCAVSWRATLQSVVALSTTEAEYMAICEACKELIWLKGLYA 1248
Query: 703 ELGKEQDVPPLFSDSQSVIFLAKNPVFHSRCKHIQMKYHFIRELISDEE 751
EL + L DS+S I+L K+ +FH R KHI +KYHF+R++I +++
Sbjct: 1249 ELSGVESCISLHCDSESAIYLTKDQMFHERTKHIDIKYHFVRDVIEEDD 1297
>ref|XP_475489.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
gi|48475213|gb|AAT44282.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
Length = 1243
Score = 687 bits (1773), Expect = 0.0
Identities = 363/738 (49%), Positives = 479/738 (64%), Gaps = 22/738 (2%)
Query: 2 WKTEVENQTCLKIKSLKSDNGGEYDSQEFKKFCSENGIRMIKTILGTPEQNGVAERMNRT 61
WKT VE QT K+K L++DNG E+ S+ FK +C GI T TP+QN VAERMNRT
Sbjct: 515 WKTMVERQTERKVKILRTDNGMEFCSKIFKSYCKSEGIVCHYTAPHTPQQNDVAERMNRT 574
Query: 62 LNERARCMRIQSGLPKMFWVDAINTAAYLINRGPSIPLDYQLPEEVWSGKEVSLSHLKVF 121
+ +ARCM +GLPK FW +A++TA YLINR P +D + P EVWSG + S L+VF
Sbjct: 575 IISKARCMLSNAGLPKQFWAEAVSTACYLINRSPGYAIDKKTPIEVWSGSPTNYSDLRVF 634
Query: 122 GCVSYVLIDSDRRDKLDPKAIKCFFIGYGFDMYGYRFWDEQNKKIIRSRNVTFNESVLYK 181
GC +Y +D+ KL+P+AIKC F+GY + GY+ W + KK++ SRNV F+ESV+
Sbjct: 635 GCTAYAHVDNG---KLEPRAIKCIFLGYASGVKGYKLWCPETKKVVISRNVVFHESVILH 691
Query: 182 DRSSAESMSSSKQ---LKLSERVALEEISESDVVKRNQINPENEDDEVEVELEQEPTTVI 238
D+ S S++ +++ ++ E + V NQ P ED + + + Q P I
Sbjct: 692 DKPSTNVPVESQEKASVQVEHLISSGHAPEKEDVAINQDAPVIEDSDSSI-VHQSPKRSI 750
Query: 239 ETPITQVRRSTRTPKAPQRY---SPSLNYLL-----LTDAGEPEYFGEAMQGNDSIKWEL 290
+ R K P+RY + + Y L + EP + EA+ +D +W
Sbjct: 751 AKD-----KPKRNIKPPRRYIEEAKIVAYALSVAEKIEGNAEPSTYSEAIVSDDCNRWIT 805
Query: 291 AMKDEMTSLQKNGTWSLTKLPEGKKALQNRWVYRLKEESDGS--RRYKARLVVKGFQQKQ 348
AM DEM SL+KN TW L KLP+ KK ++ +W+++ KE S RYKARLV KG+ Q
Sbjct: 806 AMHDEMESLEKNHTWELVKLPKEKKPIRCKWIFKRKEGMSPSDEARYKARLVAKGYSQIP 865
Query: 349 GIDFTEIFSPVVKMTTIRVILSIVAAENLHLEQLDVKIAFLHGDLEEEIYMTQPEGFEVL 408
GIDF ++FSPVVK ++IR +L IVA + LEQ++VK AFLHG+LEE+IYM QPEGF V
Sbjct: 866 GIDFNDVFSPVVKHSSIRTLLGIVAMHDYELEQMNVKTAFLHGELEEDIYMEQPEGFVVP 925
Query: 409 GTKNLVCKLHKSLYGLKQAPRQWYKKFNEFMSNSGFNRCDMDHCCFVKKFADSYIILALY 468
G +NLVC+L KSLYGLKQ+PRQWYK+F+ FM + F + D C ++K S I L LY
Sbjct: 926 GKENLVCRLKKSLYGLKQSPRQWYKRFDSFMLSQKFRISNYDSCVYLKVVDGSVIYLLLY 985
Query: 469 VDDMLIAGSNMTEINRLKQQMSENFEMKDLGPAKQILGMRISRNRSEGVLKLSQEKYVEK 528
VDDMLIA + +EI +LK Q+S FEMKDLG AK+ILGM I+R R G L LSQ+ Y+EK
Sbjct: 986 VDDMLIAAKDKSEIEKLKAQLSSEFEMKDLGAAKKILGMEITRERHSGKLYLSQKGYIEK 1045
Query: 529 LLDRFNVGDANTRSTPLGNHLKFSKKQSPQTDEEESYMSTVPYASAVGSLMYAMVCTRPD 588
+L RFN+ DA STPL H + S P +D + YMS VPY+SAVGSLMYAMVC RPD
Sbjct: 1046 VLRRFNMHDAKPVSTPLAAHFRLSSDLCPLSDYDIEYMSRVPYSSAVGSLMYAMVCCRPD 1105
Query: 589 IAHAVGVVSRFMSNPGKEHWECVKWILRYLKGSSRMCLCFRRNNLTLQEFSDADLGGDSD 648
++HA+ VV+R+M+NPGKEHW+ V+WI RYL+G+S CL F R+ L + D+D GD D
Sbjct: 1106 LSHALSVVNRYMANPGKEHWKAVQWIFRYLRGTSSACLQFERSRDGLVGYVDSDFAGDLD 1165
Query: 649 GGKSTTGYIFTLGGTAVSWKSKLQNRVALSTTESEYVAISEAAKEMIWLKSFLKELGKEQ 708
+S TGY+FT+GG AVSWK+ LQ VALSTTE+EY+AI EA KE IWL+ EL
Sbjct: 1166 RRRSITGYVFTIGGCAVSWKASLQATVALSTTEAEYMAIFEACKEAIWLRGLYTELCGVT 1225
Query: 709 DVPPLFSDSQSVIFLAKN 726
+F DSQS I+L K+
Sbjct: 1226 SCINIFCDSQSAIYLTKD 1243
>gb|AAF19226.1| Highly similar to Ta1-3 polyprotein [Arabidopsis thaliana]
gi|25301707|pir||E86490 hypothetical protein F28L22.3 -
Arabidopsis thaliana
Length = 1356
Score = 685 bits (1767), Expect = 0.0
Identities = 361/805 (44%), Positives = 525/805 (64%), Gaps = 37/805 (4%)
Query: 2 WKTEVENQTCLKIKSLKSDNGGEYDSQEFKKFCSENGIRMIKTILGTPEQNGVAERMNRT 61
WK+ VENQ K+K L++DNG E+ + F +C E+GI +T TP+QNGVAERMNRT
Sbjct: 548 WKSLVENQVNKKVKCLRTDNGLEFCNSRFDSYCKEHGIERHRTCTYTPQQNGVAERMNRT 607
Query: 62 LNERARCMRIQSGLPKMFWVDAINTAAYLINRGPSIPLDYQLPEEVWSGKEVSLSHLKVF 121
+ E+ RC+ +SG+ ++FW +A TAAYLINR P+ +++ +PEE+W ++ HL+ F
Sbjct: 608 IMEKVRCLLNKSGVEEVFWAEAAATAAYLINRSPASAINHNVPEEMWLNRKPGYKHLRKF 667
Query: 122 GCVSYVLIDSDRRDKLDPKAIKCFFIGYGFDMYGYRFWDEQNKKIIRSRNVTFNESVLYK 181
G ++YV D + KL P+A+K FF+GY GY+ W + +K + SRNV F ESV+Y+
Sbjct: 668 GSIAYVHQD---QGKLKPRALKGFFLGYPAGTKGYKVWLLEEEKCVISRNVVFQESVVYR 724
Query: 182 DRSSAESMSSSKQLKLSERVALE-----EISESDVVKRNQINPE-------NEDDEVEVE 229
D E + + K + +E E S S V + Q + E + D E EVE
Sbjct: 725 DLKVKEDDTDNLNQKETTSSEVEQNKFAEASGSGGVIQLQSDSEPITEGEQSSDSEEEVE 784
Query: 230 L-EQEPTTVIETPITQVR----RSTRTPKAPQRYSP--SLNYLLLTDAG----EPEYFGE 278
E+ T T +T + R R P R++ S+ + L+ EP+ + E
Sbjct: 785 YSEKTQETPKRTGLTTYKLARDRVRRNINPPTRFTEESSVTFALVVVENCIVQEPQSYQE 844
Query: 279 AMQGNDSIKWELAMKDEMTSLQKNGTWSLTKLPEGKKALQNRWVYRLKEESDGSR--RYK 336
AM+ D KW++A DEM SL KNGTW L P+ +K + RW+++LK G R+K
Sbjct: 845 AMESQDCEKWDMATHDEMDSLMKNGTWDLVDKPKDRKIIGCRWLFKLKSGIPGVEPTRFK 904
Query: 337 ARLVVKGFQQKQGIDFTEIFSPVVKMTTIRVILSIVAAENLHLEQLDVKIAFLHGDLEEE 396
ARLV KG+ Q++G+D+ EIF+PVVK +IR+++S+V ++L LEQ+DVK FLHGDLEEE
Sbjct: 905 ARLVAKGYTQREGVDYQEIFAPVVKHVSIRILMSLVVDKDLELEQMDVKTTFLHGDLEEE 964
Query: 397 IYMTQPEGFEVLGTKNLVCKLHKSLYGLKQAPRQWYKKFNEFMSNSGFNRCDMDHCCFVK 456
+YM QPEGF ++N VC+L KSLYGLKQ+PRQW K+F+ FMS+ F R + D C +VK
Sbjct: 965 LYMEQPEGFVSDSSENKVCRLKKSLYGLKQSPRQWNKRFDRFMSSQQFIRSEHDACVYVK 1024
Query: 457 KFAD-SYIILALYVDDMLIAGSNMTEINRLKQQMSENFEMKDLGPAKQILGMRISRNRSE 515
++ +I L LYVDDMLIAG++ EINR+K+Q+S FEMKD+G A +ILG+ I R+R
Sbjct: 1025 HVSEHDFIYLLLYVDDMLIAGASKAEINRVKEQLSTEFEMKDMGGASRILGIDIYRDRKG 1084
Query: 516 GVLKLSQEKYVEKLLDRFNVGDANTRSTPLGNHLKFSKKQSPQTDEEESYMST--VPYAS 573
GVLKLSQE Y+ K+LDRFN+ A + P+G H K + + EE+ + T VPY+S
Sbjct: 1085 GVLKLSQEIYIRKVLDRFNMSGAKMTNAPVGAHFKLAAVR-----EEDECVDTDVVPYSS 1139
Query: 574 AVGSLMYAMVCTRPDIAHAVGVVSRFMSNPGKEHWECVKWILRYLKGSSRMCLCF-RRNN 632
AVGS+MYAM+ TRPD+A+A+ ++SR+MS PG HWE VKW++RYLKG+ + L F + +
Sbjct: 1140 AVGSIMYAMLGTRPDLAYAICLISRYMSKPGSMHWEAVKWVMRYLKGAQDLNLVFTKEKD 1199
Query: 633 LTLQEFSDADLGGDSDGGKSTTGYIFTLGGTAVSWKSKLQNRVALSTTESEYVAISEAAK 692
T+ + D++ D D +S +GY+FT+GG VSWK+ LQ VA+STTE+EY+A++EAAK
Sbjct: 1200 FTVTGYCDSNYAADLDRRRSISGYVFTIGGNTVSWKASLQPVVAMSTTEAEYIALAEAAK 1259
Query: 693 EMIWLKSFLKELGKEQDVPPLFSDSQSVIFLAKNPVFHSRCKHIQMKYHFIRELISDEEL 752
E +W+K L+++G +QD ++ DSQS I L+KN V+H R KHI +++++IR+++ ++
Sbjct: 1260 EAMWIKGLLQDMGMQQDKVKIWCDSQSAICLSKNSVYHERTKHIDVRFNYIRDVVESGDV 1319
Query: 753 SLLKILGSENPTDMLTKTVTADNLR 777
+LKI S NP D LTK + + +
Sbjct: 1320 DVLKIHTSRNPVDALTKCIPVNKFK 1344
>gb|AAD19773.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301697|pir||B84512 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1335
Score = 684 bits (1765), Expect = 0.0
Identities = 370/797 (46%), Positives = 507/797 (63%), Gaps = 30/797 (3%)
Query: 2 WKTEVENQTCLKIKSLKSDNGGEYDSQEFKKFCSENGIRMIKTILGTPEQNGVAERMNRT 61
WK VENQ+ K+K L++DNG EY + F+KFC E GI KT TP+QNG+AER+NRT
Sbjct: 525 WKKMVENQSDRKVKKLRTDNGLEYCNHYFEKFCKEEGIVRHKTCAYTPQQNGIAERLNRT 584
Query: 62 LNERARCMRIQSGLPKMFWVDAINTAAYLINRGPSIPLDYQLPEEVWSGKEVSLSHLKVF 121
+ ++ R M +SG+ K FW +A +TA YLINR PS +++ LPEE W+G LS L+ F
Sbjct: 585 IMDKVRSMLSRSGMEKKFWAEAASTAVYLINRSPSTAINFDLPEEKWTGALPDLSSLRKF 644
Query: 122 GCVSYVLIDSDRRDKLDPKAIKCFFIGYGFDMYGYRFWDEQNKKIIRSRNVTFNESVLYK 181
GC++Y+ D + KL+P++ K F Y + GY+ W ++KK + SRNV F E V++K
Sbjct: 645 GCLAYIHAD---QGKLNPRSKKGIFTSYPEGVKGYKVWVLEDKKCVISRNVIFREQVMFK 701
Query: 182 D----RSSAESMSSSKQLKLSERVALEEISESDVVKRNQINPE-----------NEDDEV 226
D + S S + L+++ + +E ++ ++ NP +
Sbjct: 702 DLKGDSQNTISESDLEDLRVNPDMNDQEFTDQGGATQDNSNPSEATTSHNPVLNSPTHSQ 761
Query: 227 EVELEQEPTTVIETPITQ--VR-RSTRTPKAPQRYSPS----LNYLLLTDAG-EPEYFGE 278
+ E E+E + +E T VR R RT KA +Y+ S Y D EP+ + E
Sbjct: 762 DEESEEEDSDAVEDLSTYQLVRDRVRRTIKANPKYNESNMVGFAYYSEDDGKPEPKSYQE 821
Query: 279 AMQGNDSIKWELAMKDEMTSLQKNGTWSLTKLPEGKKALQNRWVYRLKEESDG--SRRYK 336
A+ D KW AMK+EM S+ KN TW L PE K + RWV+ K G + R+
Sbjct: 822 ALLDPDWEKWNAAMKEEMVSMSKNHTWDLVTKPEKVKLIGCRWVFTRKAGIPGVEAPRFI 881
Query: 337 ARLVVKGFQQKQGIDFTEIFSPVVKMTTIRVILSIVAAENLHLEQLDVKIAFLHGDLEEE 396
ARLV KGF QK+G+D+ EIFSPVVK +IR +LS+V N+ L+Q+DVK AFLHG LEEE
Sbjct: 882 ARLVAKGFTQKEGVDYNEIFSPVVKHVSIRYLLSMVVHYNMELQQMDVKTAFLHGFLEEE 941
Query: 397 IYMTQPEGFEVLGTKNLVCKLHKSLYGLKQAPRQWYKKFNEFMSNSGFNRCDMDHCCFVK 456
IYM QPEGFE+ N VC L +SLYGLKQ+PRQW +F+EFM + R D C + K
Sbjct: 942 IYMAQPEGFEIKRGSNKVCLLKRSLYGLKQSPRQWNLRFDEFMRGIKYTRSAYDSCVYFK 1001
Query: 457 KF-ADSYIILALYVDDMLIAGSNMTEINRLKQQMSENFEMKDLGPAKQILGMRISRNRSE 515
K D+YI L LYVDDMLIA +N +E+N LKQ +S FEMKDLG AK+ILGM ISR+R
Sbjct: 1002 KCNGDTYIYLLLYVDDMLIASANKSEVNELKQLLSREFEMKDLGDAKKILGMEISRDRDA 1061
Query: 516 GVLKLSQEKYVEKLLDRFNVGDANTRSTPLGNHLKFSKKQSPQTDEEESYMSTVPYASAV 575
G+L LSQE YV+K+L F + +A STPLG H K + +E+ M VPYA+ +
Sbjct: 1062 GLLTLSQEGYVKKVLRSFQMDNAKPVSTPLGIHFKLKAATDKEYEEQFERMKIVPYANTI 1121
Query: 576 GSLMYAMVCTRPDIAHAVGVVSRFMSNPGKEHWECVKWILRYLKGSSRMCLCFRR-NNLT 634
GS+MY+M+ TRPD+A+++GV+SRFMS P K+HW+ VKW+LRY++G+ + LCFR+ +
Sbjct: 1122 GSIMYSMIGTRPDLAYSLGVISRFMSKPLKDHWQAVKWVLRYMRGTEKKKLCFRKQEDFL 1181
Query: 635 LQEFSDADLGGDSDGGKSTTGYIFTLGGTAVSWKSKLQNRVALSTTESEYVAISEAAKEM 694
L+ + D+D G + D +S TGY+FT+GG +SWKSKLQ VA+S+TE+EY+A++EA KE
Sbjct: 1182 LRGYCDSDYGSNFDTRRSITGYVFTVGGNTISWKSKLQKVVAISSTEAEYMALTEAVKEA 1241
Query: 695 IWLKSFLKELGKEQDVPPLFSDSQSVIFLAKNPVFHSRCKHIQMKYHFIRELISDEELSL 754
+WLK F ELG QD + SDSQS I LAKN V H R KHI ++ HFIR++I + +
Sbjct: 1242 LWLKGFAAELGHSQDYVEVHSDSQSAITLAKNSVHHERTKHIDIRLHFIRDIICAGLIKV 1301
Query: 755 LKILGSENPTDMLTKTV 771
+KI NP ++ TKTV
Sbjct: 1302 VKIATECNPANIFTKTV 1318
>gb|AAD23690.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301702|pir||E84601 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1333
Score = 679 bits (1752), Expect = 0.0
Identities = 361/803 (44%), Positives = 520/803 (63%), Gaps = 28/803 (3%)
Query: 2 WKTEVENQTCLKIKSLKSDNGGEYDSQEFKKFCSENGIRMIKTILGTPEQNGVAERMNRT 61
W VENQT +IK+L++DNG E+ ++ F +FCS+ GI +T TP+QNGVAERMNRT
Sbjct: 527 WANLVENQTDKRIKTLRTDNGLEFCNRSFDEFCSQKGILWHRTCAYTPQQNGVAERMNRT 586
Query: 62 LNERARCMRIQSGLPKMFWVDAINTAAYLINRGPSIPLDYQLPEEVWSGKEVSLSHLKVF 121
L E+ R M SGLPK FW +A +T A LIN+ PS L+Y++P++ WSGK S+L+ F
Sbjct: 587 LMEKVRSMLSDSGLPKKFWAEATHTTAILINKTPSSALNYEVPDKRWSGKSPIYSYLRRF 646
Query: 122 GCVSYVLIDSDRRDKLDPKAIKCFFIGYGFDMYGYRFWDEQNKKIIRSRNVTFNESVLYK 181
GC+++V D KL+P+A K +GY + GY+ W + KK + SRNV F E+ YK
Sbjct: 647 GCIAFVHTDDG---KLNPRAKKGILVGYPIGVKGYKIWLLEEKKCVVSRNVIFQENASYK 703
Query: 182 DRSSAESM---------SSSKQLKLSERVALEEISESDVVK-RNQINPENEDDEVEVELE 231
D ++ SS L L + + +V+ ++ NP + E
Sbjct: 704 DMMQSKDAEKDENEAPPSSYLDLDLDHEEVITSGGDDPIVEAQSPFNPSPATTQTYSEGV 763
Query: 232 QEPTTVIETPITQ--VR-RSTRTPKAPQRYSPSLNYLL-----LTDAGE--PEYFGEAMQ 281
T +I++P++ VR R RT +AP R+ +YL D+GE P + EA +
Sbjct: 764 NSETDIIQSPLSYQLVRDRDRRTIRAPVRFDDE-DYLAEALYTTEDSGEIEPADYSEAKR 822
Query: 282 GNDSIKWELAMKDEMTSLQKNGTWSLTKLPEGKKALQNRWVYRLKEESDGSR--RYKARL 339
+ KW+LAM +EM S KN TW++ K P+ +K + +RW+Y+ K G R+KARL
Sbjct: 823 SMNWNKWKLAMNEEMESQIKNHTWTVVKRPQHQKVIGSRWIYKFKLGIPGVEEGRFKARL 882
Query: 340 VVKGFQQKQGIDFTEIFSPVVKMTTIRVILSIVAAENLHLEQLDVKIAFLHGDLEEEIYM 399
V KG+ Q++GID+ EIF+PVVK +IR+++SIVA E+L LEQLDVK AFLHG+L+E+IYM
Sbjct: 883 VAKGYAQRKGIDYHEIFAPVVKHVSIRILMSIVAQEDLELEQLDVKTAFLHGELKEKIYM 942
Query: 400 TQPEGFEVLGTKNLVCKLHKSLYGLKQAPRQWYKKFNEFMSNSGFNRCDMDHCCFVKKFA 459
PEG+E + ++ VC L+KSLYGLKQAP+QW +KFN +MS GF R D C ++K+ +
Sbjct: 943 VPPEGYEEMFKEDEVCLLNKSLYGLKQAPKQWNEKFNAYMSEIGFIRSLYDSCAYIKELS 1002
Query: 460 D-SYIILALYVDDMLIAGSNMTEINRLKQQMSENFEMKDLGPAKQILGMRISRNRSEGVL 518
D S + L LYVDDML+A N +I++LK+++S+ F+MKDLG AK+ILGM I RNR E L
Sbjct: 1003 DGSRVYLLLYVDDMLVAAKNKEDISQLKEELSQRFDMKDLGAAKRILGMEIIRNREENTL 1062
Query: 519 KLSQEKYVEKLLDRFNVGDANTRSTPLGNHLKFSKKQSPQTDEEESYMSTVPYASAVGSL 578
LSQ Y+ K+L+ +N+ ++ TPLG HLK + +++E YM ++PY+SAVGS+
Sbjct: 1063 WLSQNGYLNKILETYNMAESKHVVTPLGAHLKMRAATVEKQEQDEDYMKSIPYSSAVGSI 1122
Query: 579 MYAMVCTRPDIAHAVGVVSRFMSNPGKEHWECVKWILRYLKGSSRMCLCFRR-NNLTLQE 637
MYAM+ TRPD+A+ VG++SR+MS P +EHW VKW+LRY+KGS L ++R ++ +
Sbjct: 1123 MYAMIGTRPDLAYPVGIISRYMSQPAREHWLGVKWVLRYIKGSLGTKLQYKRSSDFKVVG 1182
Query: 638 FSDADLGGDSDGGKSTTGYIFTLGGTAVSWKSKLQNRVALSTTESEYVAISEAAKEMIWL 697
+ DAD D +S TG +FTLGG+ +SWKS Q VALSTTE+EY++++EA KE +W+
Sbjct: 1183 YCDADHAACKDRRRSITGLVFTLGGSTISWKSGQQRVVALSTTEAEYMSLTEAVKEAVWM 1242
Query: 698 KSFLKELGKEQDVPPLFSDSQSVIFLAKNPVFHSRCKHIQMKYHFIRELISDEELSLLKI 757
K LKE G EQ +F DSQS I L+KN V H R KHI ++Y +IR++I++ + ++KI
Sbjct: 1243 KGLLKEFGYEQKSVEIFCDSQSAIALSKNNVHHERTKHIDVRYQYIRDIIANGDGDVVKI 1302
Query: 758 LGSENPTDMLTKTVTADNLRLCI 780
+NP D+ TK V + + +
Sbjct: 1303 DTEKNPADIFTKIVPVNKFQAAL 1325
>dbj|BAB09923.1| copia-like retrotransposable element [Arabidopsis thaliana]
Length = 1342
Score = 679 bits (1751), Expect = 0.0
Identities = 355/799 (44%), Positives = 505/799 (62%), Gaps = 26/799 (3%)
Query: 2 WKTEVENQTCLKIKSLKSDNGGEYDSQEFKKFCSENGIRMIKTILGTPEQNGVAERMNRT 61
WKT++ENQ K+K L +DNG E+ +QEF FC + G+ +T TP+QNGVAERMNRT
Sbjct: 536 WKTQIENQQDKKLKILITDNGLEFCNQEFDSFCRKEGVIRHRTCAYTPQQNGVAERMNRT 595
Query: 62 LNERARCMRIQSGLPKMFWVDAINTAAYLINRGPSIPLDYQLPEEVWSGKEVSLSHLKVF 121
+ + RCM +SGL K FW +A +TA +LIN+ PS +++ +PEE W+G LK F
Sbjct: 596 IMNKVRCMLSESGLGKQFWAEAASTAVFLINKSPSSSIEFDIPEEKWTGHPPDYKILKKF 655
Query: 122 GCVSYVLIDSDRRDKLDPKAIKCFFIGYGFDMYGYRFWDEQNKKIIRSRNVTFNESVLYK 181
G V+Y+ D + KL+P+A K F+GY + ++ W +++K + SR++ F E+ +YK
Sbjct: 656 GSVAYIHSD---QGKLNPRAKKGIFLGYPDGVKRFKVWLLEDRKCVVSRDIVFQENQMYK 712
Query: 182 DRSSAESMSSSKQLKLSER--VALEEISESDVVKRNQINPENEDD---------EVEVEL 230
+ + KQL ER + L+ +S D + + N++ + +VE
Sbjct: 713 ELQKNDMSEEDKQLTEVERTLIELKNLSADDENQSEGGDNSNQEQASTTRSASKDKQVEE 772
Query: 231 EQEPTTVIETPITQVRRSTRTPKAPQRYSPSLNYLL-----LTDAGE---PEYFGEAMQG 282
+E + R R +APQR+ + L+ +T+ GE PE + EAM+
Sbjct: 773 TDSDDDCLENYLLARDRIRRQIRAPQRFVEEDDSLVGFALTMTEDGEVYEPETYEEAMRS 832
Query: 283 NDSIKWELAMKDEMTSLQKNGTWSLTKLPEGKKALQNRWVYRLKEESDGSR--RYKARLV 340
+ KW+ A +EM S++KN TW + PEGK+ + +W+++ K G RYKARLV
Sbjct: 833 PECEKWKQATIEEMDSMKKNDTWDVIDKPEGKRVIGCKWIFKRKAGIPGVEPPRYKARLV 892
Query: 341 VKGFQQKQGIDFTEIFSPVVKMTTIRVILSIVAAENLHLEQLDVKIAFLHGDLEEEIYMT 400
KGF Q++GID+ EIFSPVVK +IR +LSIV ++ LEQLDVK AFLHG+L+E I M+
Sbjct: 893 AKGFSQREGIDYQEIFSPVVKHVSIRYLLSIVVQFDMELEQLDVKTAFLHGNLDEYILMS 952
Query: 401 QPEGFEVLGTKNLVCKLHKSLYGLKQAPRQWYKKFNEFMSNSGFNRCDMDHCCFVKKFAD 460
QPEG+E + VC L KSLYGLKQ+PRQW ++F+ FM NSG+ R + C + ++ D
Sbjct: 953 QPEGYEDEDSTEKVCLLKKSLYGLKQSPRQWNQRFDSFMINSGYQRSKYNPCVYTQQLND 1012
Query: 461 -SYIILALYVDDMLIAGSNMTEINRLKQQMSENFEMKDLGPAKQILGMRISRNRSEGVLK 519
SYI L LYVDDMLIA N +I +LK+ ++ FEMKDLGPA++ILGM I+RNR +G+L
Sbjct: 1013 GSYIYLLLYVDDMLIASQNKDQIQKLKESLNREFEMKDLGPARKILGMEITRNREQGILD 1072
Query: 520 LSQEKYVEKLLDRFNVGDANTRSTPLGNHLKFSKKQSPQTDEEESYMSTVPYASAVGSLM 579
LSQ +YV +L F + + TPLG H K + YM VPY +A+GS+M
Sbjct: 1073 LSQSEYVAGVLRAFGMDQSKVSQTPLGAHFKLRAANEKTLARDAEYMKLVPYPNAIGSIM 1132
Query: 580 YAMVCTRPDIAHAVGVVSRFMSNPGKEHWECVKWILRYLKGSSRMCLCFRRNN-LTLQEF 638
Y+M+ +RPD+A+ VGVVSRFMS P KEHW+ VKW++RY+KG+ CL F++++ ++ +
Sbjct: 1133 YSMIGSRPDLAYPVGVVSRFMSKPSKEHWQAVKWVMRYMKGTQDTCLRFKKDDKFEIRGY 1192
Query: 639 SDADLGGDSDGGKSTTGYIFTLGGTAVSWKSKLQNRVALSTTESEYVAISEAAKEMIWLK 698
D+D D D +S TG++FT GG +SWKS LQ VALSTTE+EY+A++EA KE IWL+
Sbjct: 1193 CDSDYATDLDRRRSITGFVFTAGGNTISWKSGLQRVVALSTTEAEYMALAEAVKEAIWLR 1252
Query: 699 SFLKELGKEQDVPPLFSDSQSVIFLAKNPVFHSRCKHIQMKYHFIRELISDEELSLLKIL 758
E+G EQD + DSQS I L+KN V H R KHI ++YHFIRE I+D E+ ++KI
Sbjct: 1253 GLAAEMGFEQDAVEVMCDSQSAIALSKNSVHHERTKHIDVRYHFIREKIADGEIQVVKIS 1312
Query: 759 GSENPTDMLTKTVTADNLR 777
+ NP D+ TKTV L+
Sbjct: 1313 TTWNPADIFTKTVPVSKLQ 1331
>emb|CAB75481.1| copia-like polyprotein [Arabidopsis thaliana] gi|11278366|pir||T47492
copia-like polyprotein - Arabidopsis thaliana
Length = 1363
Score = 676 bits (1744), Expect = 0.0
Identities = 367/803 (45%), Positives = 508/803 (62%), Gaps = 27/803 (3%)
Query: 2 WKTEVENQTCLKIKSLKSDNGGEYDSQEFKKFCSENGIRMIKTILGTPEQNGVAERMNRT 61
W + VENQ+ ++K+L++DNG E+ ++ F FC E G + +T TP+QNGV ERMNRT
Sbjct: 556 WISLVENQSGERVKTLRTDNGLEFCNRMFDGFCEEKGFQRHRTCAYTPQQNGVVERMNRT 615
Query: 62 LNERARCMRIQSGLPKMFWVDAINTAAYLINRGPSIPLDYQLPEEVWSGKEVSLSHLKVF 121
+ E+ R M SGLPK FW +A +TA LIN+ P ++++ P++ WSGK S+L+ +
Sbjct: 616 IMEKVRSMLCDSGLPKRFWAEATHTAVLLINKTPCSAINFEFPDKRWSGKAPIYSYLRRY 675
Query: 122 GCVSYVLIDSDRRDKLDPKAIKCFFIGYGFDMYGYRFWDEQNKKIIRSRNVTFNESVLYK 181
GCV++V D KL+ +A K IGY + GY+ W + KK + SRNV+F E+ +YK
Sbjct: 676 GCVTFVHTDGG---KLNLRAKKGVLIGYPSGVKGYKVWLIEEKKCVVSRNVSFQENAVYK 732
Query: 182 D-RSSAESMSSSKQLKLSERVALEEISESD-----VVKRNQINPENEDDEVEVELEQEPT 235
D E +S + + L+ ++ D + Q+ P E
Sbjct: 733 DLMQRKEQVSCEEDDHAGSYIDLDLEADKDNSSGGEQSQAQVTPATRGAVTSTPPRYETD 792
Query: 236 TVIETPITQ-------VR-RSTRTPKAPQRYSPSLNYL--LLT----DAGEPEYFGEAMQ 281
+ ET + Q VR R R +AP+R+ Y L T DA EP + EA++
Sbjct: 793 DIEETDVHQSPLSYHLVRDRERREIRAPRRFDDEDYYAEALYTTEDGDAVEPADYKEAVR 852
Query: 282 GNDSIKWELAMKDEMTSLQKNGTWSLTKLPEGKKALQNRWVYRLKEESDGSR--RYKARL 339
+ KW LAM +E+ S KN TW+ PE ++ + +RW+Y+ K+ G R+KARL
Sbjct: 853 DENWDKWRLAMNEEIESQLKNDTWTTVTRPEKQRIIGSRWIYKYKQGIPGVEEPRFKARL 912
Query: 340 VVKGFQQKQGIDFTEIFSPVVKMTTIRVILSIVAAENLHLEQLDVKIAFLHGDLEEEIYM 399
V KG+ Q++G+D+ EIF+PVVK +IR++LSIVA ENL LEQLDVK AFLHG+L+E+IYM
Sbjct: 913 VAKGYAQREGVDYHEIFAPVVKHVSIRILLSIVAQENLELEQLDVKTAFLHGELKEKIYM 972
Query: 400 TQPEGFEVLGTKNLVCKLHKSLYGLKQAPRQWYKKFNEFMSNSGFNRCDMDHCCFVKKFA 459
PEG E L +N VC L+KSLYGLKQAPRQW +KFN +M+ GF R D D C + KK +
Sbjct: 973 MPPEGCESLFKENEVCLLNKSLYGLKQAPRQWNEKFNHYMTEIGFKRSDYDSCAYTKKLS 1032
Query: 460 D-SYIILALYVDDMLIAGSNMTEINRLKQQMSENFEMKDLGPAKQILGMRISRNRSEGVL 518
D S + L YVDDML+A +NM I+ LK+++S FEMKDLG AK+ILG+ I +R GVL
Sbjct: 1033 DDSTMYLLFYVDDMLVAANNMQAIDALKKELSIKFEMKDLGAAKKILGIEIIIDREAGVL 1092
Query: 519 KLSQEKYVEKLLDRFNVGDANTRSTPLGNHLKFSKKQSPQTDEEESYMSTVPYASAVGSL 578
LSQE Y+ K+L FN+ ++ TPLG HLK + EE YM++VPY+SAVGS+
Sbjct: 1093 WLSQESYLNKVLKTFNMLESKPALTPLGAHLKMKSATEEKLSTEEEYMNSVPYSSAVGSI 1152
Query: 579 MYAMVCTRPDIAHAVGVVSRFMSNPGKEHWECVKWILRYLKGSSRMCLCFRRN-NLTLQE 637
MYAM+ TRPD+A+ VGVVSRFMS P KEHW VKW+LRY+KG+ LC++RN + ++
Sbjct: 1153 MYAMIGTRPDLAYPVGVVSRFMSQPAKEHWLGVKWVLRYIKGTVDTRLCYKRNSDFSICG 1212
Query: 638 FSDADLGGDSDGGKSTTGYIFTLGGTAVSWKSKLQNRVALSTTESEYVAISEAAKEMIWL 697
+ DAD D D +S TG +FTLGG +SWKS LQ VA S+TE EY++++EA KE IWL
Sbjct: 1213 YCDADYAADLDKRRSITGLVFTLGGNTISWKSGLQRVVAQSSTECEYMSLTEAVKEAIWL 1272
Query: 698 KSFLKELGKEQDVPPLFSDSQSVIFLAKNPVFHSRCKHIQMKYHFIRELISDEELSLLKI 757
K LK+ G EQ +F DSQS I L+KN V H R KHI +K+HFIRE+I+D ++ + KI
Sbjct: 1273 KGLLKDFGYEQKNVEIFCDSQSAIALSKNNVHHERTKHIDVKFHFIREIIADGKVEVSKI 1332
Query: 758 LGSENPTDMLTKTVTADNLRLCI 780
+NP D+ TK + + + +
Sbjct: 1333 STEKNPADIFTKVLPVNKFQTAL 1355
>gb|AAD32759.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301696|pir||F84486 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1356
Score = 666 bits (1719), Expect = 0.0
Identities = 361/803 (44%), Positives = 506/803 (62%), Gaps = 27/803 (3%)
Query: 2 WKTEVENQTCLKIKSLKSDNGGEYDSQEFKKFCSENGIRMIKTILGTPEQNGVAERMNRT 61
W VENQT ++K+L++DNG E+ ++ F FC GI +T TP+QNGVAERMNRT
Sbjct: 549 WVNLVENQTDRRVKTLRTDNGLEFCNKLFDGFCESIGIHRHRTCAYTPQQNGVAERMNRT 608
Query: 62 LNERARCMRIQSGLPKMFWVDAINTAAYLINRGPSIPLDYQLPEEVWSGKEVSLSHLKVF 121
+ E+ R M SGLPK FW +A +T LIN+ PS L++++P++ WSG S+L+ +
Sbjct: 609 IMEKVRSMLSDSGLPKRFWAEATHTTVLLINKTPSSALNFEIPDKKWSGNPPVYSYLRRY 668
Query: 122 GCVSYVLIDSDRRDKLDPKAIKCFFIGYGFDMYGYRFWDEQNKKIIRSRNVTFNESVLYK 181
GCV++V D KL+P+A K IGY + GY+ W +K + SRN+ F E+ +YK
Sbjct: 669 GCVAFVHTDDG---KLEPRAKKGVLIGYPVGVKGYKVWILDERKCVVSRNIIFQENAVYK 725
Query: 182 D-RSSAESMSSSKQLKLSERVALEEISESDVVKRNQINPENEDDEVEVELEQEPTT---- 236
D E++S+ + + + + +E DV+ N E + PTT
Sbjct: 726 DLMQRQENVSTEEDDQTGSYLEFDLEAERDVISGGDQEMVNTIPAPESPVVSTPTTQDTN 785
Query: 237 ------VIETPITQ--VR-RSTRTPKAPQRYSPSLNY---LLLTDAGE---PEYFGEAMQ 281
V ++P++ VR R R +AP+R+ Y L T+ GE PE + +A
Sbjct: 786 DDEDSDVNQSPLSYHLVRDRDKREIRAPRRFDDEDYYAEALYTTEDGEAVEPENYRKAKL 845
Query: 282 GNDSIKWELAMKDEMTSLQKNGTWSLTKLPEGKKALQNRWVYRLKEESDGSR--RYKARL 339
+ KW+LAM +E+ S +KN TW++ PE ++ + RW+++ K G R+KARL
Sbjct: 846 DANFDKWKLAMDEEIDSQEKNNTWTIVTRPENQRIIGCRWIFKYKLGILGVEEPRFKARL 905
Query: 340 VVKGFQQKQGIDFTEIFSPVVKMTTIRVILSIVAAENLHLEQLDVKIAFLHGDLEEEIYM 399
V KG+ QK+GID+ EIF+PVVK +IRV+LSIVA E+L LEQLDVK AFLHG+L+E+IYM
Sbjct: 906 VAKGYAQKEGIDYHEIFAPVVKHVSIRVLLSIVAQEDLELEQLDVKTAFLHGELKEKIYM 965
Query: 400 TQPEGFEVLGTKNLVCKLHKSLYGLKQAPRQWYKKFNEFMSNSGFNRCDMDHCCFVKKFA 459
+ PEG+E + N VC L+K+LYGLKQAP+QW +KF+ FM F + D C + K
Sbjct: 966 SPPEGYESMFKANEVCLLNKALYGLKQAPKQWNEKFDNFMKEICFVKSAYDSCAYTKVLP 1025
Query: 460 D-SYIILALYVDDMLIAGSNMTEINRLKQQMSENFEMKDLGPAKQILGMRISRNRSEGVL 518
D S + L +YVDD+L+A N I LK + FEMKDLG AK+ILGM I R+R+ GVL
Sbjct: 1026 DGSVMYLLIYVDDILVASKNKEAITALKANLGMRFEMKDLGAAKKILGMEIIRDRTLGVL 1085
Query: 519 KLSQEKYVEKLLDRFNVGDANTRSTPLGNHLKFSKKQSPQTDEEESYMSTVPYASAVGSL 578
LSQE Y+ K+L+ +N+ +A TPLG H KF + +E +M +VPY+SAVGS+
Sbjct: 1086 WLSQEGYLNKILETYNMAEAKPAMTPLGAHFKFQAATEQKLIRDEDFMKSVPYSSAVGSI 1145
Query: 579 MYAMVCTRPDIAHAVGVVSRFMSNPGKEHWECVKWILRYLKGSSRMCLCFRR-NNLTLQE 637
MYAM+ TRPD+A+ VG++SRFMS P KEHW VKW+LRY+KG+ + LC+++ ++ ++
Sbjct: 1146 MYAMLGTRPDLAYPVGIISRFMSQPIKEHWLGVKWVLRYIKGTLKTRLCYKKSSSFSIVG 1205
Query: 638 FSDADLGGDSDGGKSTTGYIFTLGGTAVSWKSKLQNRVALSTTESEYVAISEAAKEMIWL 697
+ DAD D D +S TG +FTLGG +SWKS LQ VA STTESEY++++EA KE IWL
Sbjct: 1206 YCDADYAADLDKRRSITGLVFTLGGNTISWKSGLQRVVAQSTTESEYMSLTEAVKEAIWL 1265
Query: 698 KSFLKELGKEQDVPPLFSDSQSVIFLAKNPVFHSRCKHIQMKYHFIRELISDEELSLLKI 757
K LK+ G EQ +F DSQS I L+KN V H R KHI +KYHFIRE+ISD + +LKI
Sbjct: 1266 KGLLKDFGYEQKSVEIFCDSQSAIALSKNNVHHERTKHIDVKYHFIREIISDGTVEVLKI 1325
Query: 758 LGSENPTDMLTKTVTADNLRLCI 780
+NP D+ TK + + +
Sbjct: 1326 STEKNPADIFTKVLAVSKFQAAL 1348
>gb|AAW22873.1| putative polyprotein [Lycopersicon esculentum]
Length = 687
Score = 641 bits (1654), Expect = 0.0
Identities = 339/686 (49%), Positives = 461/686 (66%), Gaps = 32/686 (4%)
Query: 129 IDSDRRDKLDPKAIKCFFIGYGFDMYGYRFWDEQNKKIIRSRNVTFNE-----SVLYKDR 183
+ + R KLD K C F+GYG + +GYR +D +K++RSR+V F E +L D+
Sbjct: 3 VPKEHRLKLDFKTSPCVFVGYGDEEFGYRLYDPAKQKVVRSRDVVFYEHEMSFHLLGADK 62
Query: 184 SSAESMS-------------SSKQLKLSERVALEEISES----DVVKRNQINPENEDDEV 226
+ + S S QL EI+ + V+ + + P+ +D+ V
Sbjct: 63 TYYSNFSHDVIDMPMPHVSASDDQLTGDAPEDGHEIAHEHDHIEEVQPDVVVPQPDDEAV 122
Query: 227 EVE------LEQEPTTVIETPITQVRRSTRTPKAPQRYSPSLNYLLLTDAGEPEYFGEAM 280
+V+ ++ + +E P +R+STR + P R PS Y+L+TD GEPE E +
Sbjct: 123 DVQHGESSNQGEKSSPHVEEP--TLRKSTRV-RQPSRLYPSSEYILITDEGEPESLQEVL 179
Query: 281 QGNDSIKWELAMKDEMTSLQKNGTWSLTKLPEGKKALQNRWVYRLKEESDGSRRYKARLV 340
+D W AM+++M SL+KN T+ L K P+GKK L+NRW+++ K++ + + KARLV
Sbjct: 180 SHSDKDHWLKAMQEDMDSLKKNETYDLVKPPKGKKVLKNRWLFKNKKDGNKLVKRKARLV 239
Query: 341 VKGFQQKQGIDFTEIFSPVVKMTTIRVILSIVAAENLHLEQLDVKIAFLHGDLEEEIYMT 400
VKG QK+GIDF EIF+PVVKMT+IR+IL + NL LEQLDVK AFLHGDL EEIYM
Sbjct: 240 VKGCHQKKGIDFDEIFAPVVKMTSIRMILGLATCLNLELEQLDVKTAFLHGDLHEEIYME 299
Query: 401 QPEGFEVLGTKNLVCKLHKSLYGLKQAPRQWYKKFNEFMSNSGFNRCDMDHCCFVKKFAD 460
QPEGFEV G +N VCKL KSLYGLKQAPRQWY KF+ FMSN+ + R D C + +KF++
Sbjct: 300 QPEGFEVKGKENFVCKLKKSLYGLKQAPRQWYHKFDSFMSNNEYKRTTADPCVYFRKFSE 359
Query: 461 -SYIILALYVDDMLIAGSNMTEINRLKQQMSENFEMKDLGPAKQILGMRISRNRSEGVLK 519
++IIL LYVDDMLI G ++ I RLK+ +S++F+MKDLGPAKQILGM I+R+R G L
Sbjct: 360 GNFIILCLYVDDMLIVGQDVEMICRLKEDLSKSFDMKDLGPAKQILGMEIARDRKAGKLW 419
Query: 520 LSQEKYVEKLLDRFNVGDANTRSTPLGNHLKFSKKQSPQTDEEESYMSTVPYASAVGSLM 579
LSQE Y+E++L+RFN+ +A +TPL H K SK+ P T++E+ MS +PY+S VGSLM
Sbjct: 420 LSQENYIERVLERFNMKNAKPVNTPLAAHFKLSKRCCPTTEKEKESMSHIPYSSVVGSLM 479
Query: 580 YAMVCTRPDIAHAVGVVSRFMSNPGKEHWECVKWILRYLKGSSRMCLCFRRNNLTLQEFS 639
YAMVCTRPDIAHAVG+VSR+++NP K HWE VKWILRYL+G+S + LCF L+ F+
Sbjct: 480 YAMVCTRPDIAHAVGLVSRYLANPSKVHWEAVKWILRYLRGTSNLSLCFGGGEPILEGFT 539
Query: 640 DADLGGDSDGGKSTTGYIFTLGGTAVSWKSKLQNRVALSTTESEYVAISEAAKEMIWLKS 699
DAD+ GD D KST+GY+F A+SW+SKLQ VALSTTE EY+A EA+KEM+WLK
Sbjct: 540 DADMAGDLDNRKSTSGYLFKFARGAISWQSKLQKCVALSTTEVEYIAAVEASKEMLWLKR 599
Query: 700 FLKELGKEQDVPPLFSDSQSVIFLAKNPVFHSRCKHIQMKYHFIRELISDEELSLLKILG 759
FL+ELG +Q +F DS+S + L+KN ++H+R KHI ++YH++R +I + + L KI
Sbjct: 600 FLQELGLKQSEYVVFCDSKSAMDLSKNIMYHARTKHIDVRYHWLRVVIKERLMKLKKIHT 659
Query: 760 SENPTDMLTKTVTADNLRLCIASAGL 785
++N DMLTK V L C AG+
Sbjct: 660 NKNGADMLTKVVPGSKLEFCSKLAGM 685
>emb|CAB79135.1| putative transposable element [Arabidopsis thaliana]
gi|3402755|emb|CAA20201.1| putative transposable element
[Arabidopsis thaliana] gi|7444415|pir||T05178
hypothetical protein T6K22.90 - Arabidopsis thaliana
Length = 1308
Score = 632 bits (1631), Expect = e-179
Identities = 337/788 (42%), Positives = 493/788 (61%), Gaps = 52/788 (6%)
Query: 2 WKTEVENQTCLKIKSLKSDNGGEYDSQEFKKFCSENGIRMIKTILGTPEQNGVAERMNRT 61
WK VENQ K+K L++DNG E+ + +F +FC +NGI +T TP+QNGVA+RMNRT
Sbjct: 539 WKELVENQVNKKVKILRTDNGLEFCNLKFDEFCKQNGIERHRTCTYTPQQNGVAKRMNRT 598
Query: 62 LNERARCMRIQSGLPKMFWVDAINTAAYLINRGPSIPLDYQLPEEVWSGKEVSLSHLKVF 121
L E+ RC+ +SGL ++FW +A TAAYL+NR P+ +D+ +PEE+W K+ HL+ F
Sbjct: 599 LMEKVRCLLNESGLEEVFWAEAAATAAYLVNRSPASAVDHNVPEELWLDKKPGYKHLRRF 658
Query: 122 GCVSYVLIDSDRRDKLDPKAIKCFFIGYGFDMYGYRFWDEQNKKIIRSRNVTFNESVLYK 181
GC++YV +D + KL P+A+K F+GY GY+ W +K + SRN+ FNE+ +YK
Sbjct: 659 GCIAYVHLD---QGKLKPRALKGVFLGYPQGTKGYKVWLLDEEKCVISRNIVFNENQVYK 715
Query: 182 D--RSSAESMSSSKQLKLSERVALEEISESDVVKRNQINPENEDDEVEVE--LEQEPTTV 237
D SS +S+ L+ + + K + E D E + E + QEP
Sbjct: 716 DIRESSEQSVKDISDLEGYNEFQVSVKEHGECSKTGGVTIEEIDQESDSENSVTQEPLIA 775
Query: 238 ---IETPITQVRRSTRTPKAPQRYSPSLNYLLLT------DAGEPEYFGEAMQGNDSIKW 288
+ + R R P PQ+ + ++ L ++ EP+ + +A + IKW
Sbjct: 776 SIDLSNYQSARDRERRAPNPPQKLADYTHFALALVMAEEIESEEPQCYHDAKKDKHWIKW 835
Query: 289 ELAMKDEMTSLQKNGTWSLTKLPEGKKALQNRWVYRLKEESDG--SRRYKARLVVKGFQQ 346
MK+E+ SL KNGTW + + P+ +K + RW+++LK G ++RYKARLV +GF Q
Sbjct: 836 NGGMKEEIDSLLKNGTWDIVEWPKEQKVISCRWLFKLKPGIPGVEAQRYKARLVARGFTQ 895
Query: 347 KQGIDFTEIFSPVVKMTTIRVILSIVAAENLHLEQLDVKIAFLHGDLEEEIYMTQPEGFE 406
++GID+ E+F+PVVK +IR+++S V +++ LEQ+DVK FLHG+L++ +YM QPEGFE
Sbjct: 896 QKGIDYEEVFAPVVKHISIRILMSAVVKDDMELEQMDVKTTFLHGELDQVLYMEQPEGFE 955
Query: 407 VLGTKNLVCKLHKSLYGLKQAPRQWYKKFNEFMSNSGFNRCDMDHCCFVKKF-ADSYIIL 465
V K+ VC L KSLYGLKQAPRQW KKF+ FM + F R + D C +VK+ ++ L
Sbjct: 956 VNPEKDQVCLLKKSLYGLKQAPRQWNKKFHAFMLSLQFARSEHDSCVYVKEVNPGEFVYL 1015
Query: 466 ALYVDDMLIAGSNMTEINRLKQQMSENFEMKDLGPAKQILGMRISRNRSEGVLKLSQEKY 525
LYVDDML+A + +EI++LK+ +S FEMKD+G A +ILG+ I RNR EG L+LSQ +Y
Sbjct: 1016 LLYVDDMLLAAKSKSEISKLKEALSLKFEMKDMGAASRILGIDIIRNRKEGTLRLSQTRY 1075
Query: 526 VEKLLDRFNVGDANTRSTPLGNHLKFSK--KQSPQTDEEESYMSTVPYASAVGSLMYAMV 583
V+K++ RF + DA STP+G H K + + D E VPY+SAVGS+MYAM+
Sbjct: 1076 VDKVIQRFRMADAKVVSTPMGAHFKLTSLIDEIGSVDPE-----VVPYSSAVGSVMYAMI 1130
Query: 584 CTRPDIAHAVGVVSRFMSNPGKEHWECVKWILRYLKGSSRMCLCFRRNNLTLQEFSDADL 643
T PD+A+A+G+VSRFMS PG NL +Q + D+D
Sbjct: 1131 GTIPDVAYAMGLVSRFMSRPGA--------------------------NLEVQGYCDSDH 1164
Query: 644 GGDSDGGKSTTGYIFTLGGTAVSWKSKLQNRVALSTTESEYVAISEAAKEMIWLKSFLKE 703
D D +S +GY+FT+GG VSWKS LQ+ VALS+T++E++A++EA KE IW++ L++
Sbjct: 1165 AADLDKRRSISGYVFTVGGNTVSWKSSLQHVVALSSTQAEFIALTEAVKEAIWIRGLLED 1224
Query: 704 LGKEQDVPPLFSDSQSVIFLAKNPVFHSRCKHIQMKYHFIRELISDEELSLLKILGSENP 763
+G + ++ DSQS I L+KN FH R KH+++K++FIR++I E+ + KI S NP
Sbjct: 1225 MGLQPKPATVWCDSQSAICLSKNNAFHDRTKHVEVKFYFIRDIIEAGEVKVRKIHTSVNP 1284
Query: 764 TDMLTKTV 771
DMLTK +
Sbjct: 1285 ADMLTKCI 1292
>emb|CAA31653.1| polyprotein [Arabidopsis thaliana] gi|99721|pir||S05465
retrovirus-related polyprotein - Arabidopsis thaliana
retrotransposon Ta1-3
Length = 1291
Score = 628 bits (1619), Expect = e-178
Identities = 337/745 (45%), Positives = 476/745 (63%), Gaps = 40/745 (5%)
Query: 2 WKTEVENQTCLKIKSLKSDNGGEYDSQEFKKFCSENGIRMIKTILGTPEQNGVAERMNRT 61
WK VENQ K+K L++DNG E+ + +F +C E+GI KT TP+QNGVAERMNRT
Sbjct: 558 WKELVENQQNKKVKCLRTDNGLEFCNLKFDAYCKEHGIERHKTCTYTPQQNGVAERMNRT 617
Query: 62 LNERARCMRIQSGLPKMFWVDAINTAAYLINRGPSIPLDYQLPEEVWSGKEVSLSHLKVF 121
+ E+ RCM +SGL + FW +A TAAYLINR P+ +D+ +PEE+W K+ HL+ F
Sbjct: 618 IMEKVRCMLNESGLGEEFWAEAAATAAYLINRSPASAIDHNVPEELWLNKKPGYKHLRRF 677
Query: 122 GCVSYVLIDSDRRDKLDPKAIKCFFIGYGFDMYGYRFWDEQNKKIIRSRNVTFNESVLYK 181
G ++YV ID + KL P+A+K FIGY GY+ W + K + SRNV F+E +YK
Sbjct: 678 GSIAYVHID---QGKLKPRALKGIFIGYPAGTKGYKIWLLEEHKCVISRNVLFHEESVYK 734
Query: 182 D--------RSSAESMSSSKQLKLSERVALEEISESDVVKRNQINPENEDDEVEVELEQE 233
D S AE S SK + + ++ +V+ Q++ E E DE VE EQE
Sbjct: 735 DTMKKERVVESEAEPASHSKSTLIKVKTP-GNLNSGEVI---QVSDEEESDE-SVEEEQE 789
Query: 234 PTTVIETPITQVR-----------RSTRTPKAPQRYSPS--LNYLLLT----DAGEPEYF 276
P T +E P TQ R R P R++ + + L+T EP+ +
Sbjct: 790 PETQVELPETQTTSSLANYQLARDRERRQIHPPARFTEESGVAFALVTVETLSMEEPQSY 849
Query: 277 GEAMQGNDSIKWELAMKDEMTSLQKNGTWSLTKLPEGKKALQNRWVYRLKEESDGSR--R 334
EA + KW+LA +EM SL KNGTW L P+ +K + RW+++LK S G R
Sbjct: 850 QEATSDKEWKKWKLATHEEMDSLIKNGTWVLVDKPQNRKIIGCRWLFKLKSGSPGVEPVR 909
Query: 335 YKARLVVKGFQQKQGIDFTEIFSPVVKMTTIRVILSIVAAENLHLEQLDVKIAFLHGDLE 394
YKA+LV KG+ ++G+D+ EIF+ VVK T+IR+++S+V ++L LEQ+DVK AFLHG+LE
Sbjct: 910 YKAQLVAKGYTHREGVDYQEIFALVVKHTSIRILMSVVVDQDLELEQMDVKTAFLHGELE 969
Query: 395 EEIYMTQPEGFEVLGTKNLVCKLHKSLYGLKQAPRQWYKKFNEFMSNSGFNRCDMDHCCF 454
EE+YM QPEG +N VC L KSLYGLKQ+PRQW K+FN FM + F R + D C +
Sbjct: 970 EELYMEQPEGCISEDGENKVCLLKKSLYGLKQSPRQWNKRFNRFMIDQNFIRSEHDACVY 1029
Query: 455 VKKFAD-SYIILALYVDDMLIAGSNMTEINRLKQQMSENFEMKDLGPAKQILGMRISRNR 513
VK+ ++ ++ L LYVDDMLIAG + +EIN++K+Q+S FEMKD+GPA +ILG+ I R+
Sbjct: 1030 VKQVSEQEHLYLLLYVDDMLIAGKSKSEINKVKEQLSMEFEMKDMGPASRILGIDIIRDM 1089
Query: 514 SEGVLKLSQEKYVEKLLDRFNVGDANTRSTPLGNHLKFSKKQSPQTDEEESYMSTVPYAS 573
GVL++SQ Y+ ++ RFN+ +A +P+G H K + + + D+E + VPYAS
Sbjct: 1090 KNGVLRMSQASYIHNVVQRFNMAEAKVTRSPIGAHFKLA---AVRDDDECIDNNAVPYAS 1146
Query: 574 AVGSLMYAMVCTRPDIAHAVGVVSRFMSNPGKEHWECVKWILRYLKGSSRMCLCF-RRNN 632
AVGS+MYAM+ RPD+A+ + +VSR+M+ PG HWE VKWILRY++GS + L F +
Sbjct: 1147 AVGSIMYAMIGIRPDLAYVICLVSRYMARPGSIHWEAVKWILRYMRGSQDLNLVFTKEKE 1206
Query: 633 LTLQEFSDADLGGDSDGGKSTTGYIFTLGGTAVSWKSKLQNRVALSTTESEYVAISEAAK 692
+ + D+D D D +S +GY+FT+GG VSWK+ LQ+ ALSTTE+E++A++EAAK
Sbjct: 1207 FRVTGYCDSDYAADLDRRRSVSGYVFTVGGNTVSWKANLQSVTALSTTEAEFMALTEAAK 1266
Query: 693 EMIWLKSFLKELGKEQDVPPLFSDS 717
E +W+K +K+LG EQD L+ DS
Sbjct: 1267 EALWIKGLMKDLGLEQDKVTLWCDS 1291
>ref|XP_475663.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
gi|48475188|gb|AAT44257.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
Length = 1211
Score = 622 bits (1604), Expect = e-176
Identities = 328/707 (46%), Positives = 448/707 (62%), Gaps = 36/707 (5%)
Query: 99 LDYQLPEEVWSGKEVSLSHLKVFGCVSYVLIDSDRRDKLDPKAIKCFFIGYGFDMYGYRF 158
+D + P EVWSG + S L+VFGC +Y +D+ KL+P+AIKC F+GY + Y+
Sbjct: 519 IDKKTPIEVWSGSPANYSDLRVFGCTAYAHVDNG---KLEPRAIKCIFLGYPSGVKDYKL 575
Query: 159 WDEQNKKIIRSRNVTFNESVLYKDRSSAESMSSSKQLKLSERVALEEISESDVVKRNQIN 218
W + KK++ SRNV F+ESV+ D+ S V +E ++ V + I+
Sbjct: 576 WCPKTKKVVISRNVVFHESVMLHDKPSTN-------------VPVESQEKASVQVEHLIS 622
Query: 219 PENEDDEVEVELEQEPTTVIETPITQVRRST----------RTPKAPQRYSPSLNYLL-- 266
+ ++ +V + Q+ + + ++ + V++S R K P+RY N +
Sbjct: 623 SGHAPEKEDVAINQDASVIEDSDSSAVQQSPKRSIAKDKPKRNIKPPRRYIEEANIVAYA 682
Query: 267 ------LTDAGEPEYFGEAMQGNDSIKWELAMKDEMTSLQKNGTWSLTKLPEGKKALQNR 320
+ EP + EA+ + +W AM DEM SL+KN TW L KLP+ KK ++ +
Sbjct: 683 LSVAEEIEGNAEPSTYSEAIVSDGCNRWITAMHDEMESLEKNHTWELVKLPKEKKPIRCK 742
Query: 321 WVYRLKEESDGS--RRYKARLVVKGFQQKQGIDFTEIFSPVVKMTTIRVILSIVAAENLH 378
W+++ K+ S RYKARLV KG+ Q GIDF ++FSPV+K ++IR +LSIVA +
Sbjct: 743 WIFKRKKGMSPSDEARYKARLVAKGYSQIPGIDFNDVFSPVLKHSSIRTLLSIVAVHDYE 802
Query: 379 LEQLDVKIAFLHGDLEEEIYMTQPEGFEVLGTKNLVCKLHKSLYGLKQAPRQWYKKFNEF 438
LEQ+DVK AFLHG+LEE+IYM QPEGF V G +NLV +L KSLYGLKQ+PRQWYK+F+ F
Sbjct: 803 LEQMDVKTAFLHGELEEDIYMEQPEGFVVPGKENLVYRLKKSLYGLKQSPRQWYKRFDSF 862
Query: 439 MSNSGFNRCDMDHCCFVKKFADSYIILALYVDDMLIAGSNMTEINRLKQQMSENFEMKDL 498
M + F R + D C ++K S I L LYVDDMLIA + +EI +LK Q+S FEMKDL
Sbjct: 863 MFSQKFRRSNYDSCVYLKVVDGSSIYLLLYVDDMLIAAKDKSEIEKLKAQLSSEFEMKDL 922
Query: 499 GPAKQILGMRISRNRSEGVLKLSQEKYVEKLLDRFNVGDANTRSTPLGNHLKFSKKQSPQ 558
G AK+ILGM I+R R G L LSQ+ Y+EK+L RFN+ DA STPL H + S PQ
Sbjct: 923 GAAKKILGMEITRERHSGKLYLSQKGYIEKVLRRFNMHDAKPVSTPLAAHFRLSSDLCPQ 982
Query: 559 TDEEESYMSTVPYASAVGSLMYAMVCTRPDIAHAVGVVSRFMSNPGKEHWECVKWILRYL 618
+D + YMS VPY S VGSLMYAMVC+R D++HA+ VVSR+M+NPGKEHW+ V+WI +YL
Sbjct: 983 SDYDIEYMSRVPYLSVVGSLMYAMVCSRLDLSHALSVVSRYMANPGKEHWKVVQWIFKYL 1042
Query: 619 KGSSRMCLCFRRNNLTLQEFSDADLGGDSDGGKSTTGYIFTLGGTAVSWKSKLQNRVALS 678
+G+S CL F R+ L + D+D GD D +S TGY+FT+GG AVSWK+ LQ VALS
Sbjct: 1043 RGTSSACLQFGRSRDGLVGYVDSDFAGDLDRRRSLTGYVFTIGGCAVSWKASLQATVALS 1102
Query: 679 TTESEYVAISEAAKEMIWLKSFLKELGKEQDVPPLFSDSQSVIFLAKNPVFHSRCKHIQM 738
TTE EY+AISEA KE IWL+ EL +F DSQS I K+ +FH R KHI +
Sbjct: 1103 TTEVEYMAISEACKEAIWLRGLYTELCGVTSCINIFCDSQSAICFTKDQMFHERTKHIDV 1162
Query: 739 KYHFIRELISDEELSLLKILGSENPTDMLTKTVTADNLRLCIASAGL 785
+YHFIR +I++ ++ + KI +NP DM+TK V LC + G+
Sbjct: 1163 RYHFIRGVIAEGDVKVCKISTHDNPADMMTKPVPTTKFELCSSLVGV 1209
Database: nr
Posted date: Jul 5, 2005 12:34 AM
Number of letters in database: 863,360,394
Number of sequences in database: 2,540,612
Lambda K H
0.318 0.134 0.392
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,329,802,313
Number of Sequences: 2540612
Number of extensions: 56631214
Number of successful extensions: 168389
Number of sequences better than 10.0: 2189
Number of HSP's better than 10.0 without gapping: 2034
Number of HSP's successfully gapped in prelim test: 155
Number of HSP's that attempted gapping in prelim test: 161260
Number of HSP's gapped (non-prelim): 3308
length of query: 799
length of database: 863,360,394
effective HSP length: 136
effective length of query: 663
effective length of database: 517,837,162
effective search space: 343326038406
effective search space used: 343326038406
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 79 (35.0 bits)
Lotus: description of TM0065.18