
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0003.6
(1307 letters)
Database: nr
2,540,612 sequences; 863,360,394 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gb|AAT85194.1| putative polyprotein [Oryza sativa (japonica cult... 970 0.0
ref|XP_469192.1| putative polyprotein [Oryza sativa (japonica cu... 960 0.0
gb|AAX92861.1| retrotransposon protein, putative, Ty1-copia sub-... 952 0.0
gb|AAX92941.1| retrotransposon protein, putative, Ty1-copia sub-... 920 0.0
dbj|BAB09923.1| copia-like retrotransposable element [Arabidopsi... 888 0.0
emb|CAB75481.1| copia-like polyprotein [Arabidopsis thaliana] gi... 881 0.0
gb|AAD32759.1| putative retroelement pol polyprotein [Arabidopsi... 875 0.0
gb|AAP54315.1| putative polyprotein [Oryza sativa (japonica cult... 869 0.0
dbj|BAD34493.1| Gag-Pol [Ipomoea batatas] 865 0.0
ref|XP_475489.1| putative polyprotein [Oryza sativa (japonica cu... 863 0.0
gb|AAD23690.1| putative retroelement pol polyprotein [Arabidopsi... 857 0.0
gb|AAD19773.1| putative retroelement pol polyprotein [Arabidopsi... 855 0.0
emb|CAA32025.1| unnamed protein product [Nicotiana tabacum] gi|1... 853 0.0
gb|AAF19226.1| Highly similar to Ta1-3 polyprotein [Arabidopsis ... 845 0.0
gb|AAK29467.1| polyprotein-like [Lycopersicon chilense] 811 0.0
emb|CAB79135.1| putative transposable element [Arabidopsis thali... 795 0.0
pir||T02206 hypothetical protein - common tobacco retrotransposo... 782 0.0
gb|AAV88069.1| hypothetical retrotransposon [Ipomoea batatas] 762 0.0
emb|CAA31653.1| polyprotein [Arabidopsis thaliana] gi|99721|pir|... 746 0.0
emb|CAA37924.1| unnamed protein product [Arabidopsis thaliana] g... 690 0.0
>gb|AAT85194.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
Length = 1241
Score = 970 bits (2507), Expect = 0.0
Identities = 554/1249 (44%), Positives = 770/1249 (61%), Gaps = 60/1249 (4%)
Query: 91 MSKTPMNKLFAKQRLYSLKMQEGGDLQAHVYAFNNILADMTRLGVTVDDEDKAIILLCSL 150
M+K +K+ KQ+L+ K+Q+ G + H+ AF I+AD+ + V D+ED +ILLCSL
Sbjct: 1 MTKDLTSKMHLKQKLFLHKLQDDGSVMDHLSAFKEIVADLESMEVKYDEEDLGLILLCSL 60
Query: 151 PGSYDHLVTTLTYGKDSITLDSISSTLLPHAQRRQSV-EEGGGSSGEGLFVKGGQDRGRG 209
P SY + T+ Y +D++TL + L + ++ V EG S EGL V+G Q
Sbjct: 61 PSSYANFRDTILYSRDTLTLKEVYDALHAKEKMKKMVPSEGSNSQAEGLVVRGRQQEKNT 120
Query: 210 KGKAVDSGK---KKRSKSKDRKTTECYSCKQIGH-----WK-RDCPNRSGK-----SGNN 255
K+ D + RSKS+ R + C CK+ GH WK +D R+ K
Sbjct: 121 NNKSRDKSSSIYRGRSKSRGRYKS-CKYCKRDGHDISECWKLQDKDKRTRKYIPKGKKEE 179
Query: 256 SSSANVVQSDGSCSEEDLLCVSSVKCTDAWVLDSGCSYHMTQHREWFNSFKSGDLGYVYL 315
A VV + S +E + + +D W+LD+ C+YHM +R+WF ++++ G V +
Sbjct: 180 EGKAAVVTDEKSDAELLVAYAGCAQTSDQWILDTACTYHMCPNRDWFATYEAVQGGTVLM 239
Query: 316 GDDKPCIIKGMRQVKIALDDGGVRTLSQVRYVLEVMKNLISLGTLHENGYSFKSEENRDI 375
GDD PC + G+ V+I + DG +RTL VR++ + ++LISL TL GY + + I
Sbjct: 240 GDDTPCEVAGIGTVQIKMFDGCIRTLLDVRHIPNLKRSLISLCTLDRKGYKYSGGDG--I 297
Query: 376 LRVSKGAMTVMRAKRTAGNIYKLLGGTIMGDVASVE---TDDDATKLWHMRLGHLSERGM 432
L+V+KG++ VM+A N+Y L G TI+G+VA+V ++ DAT LWHMRLGH+SE G+
Sbjct: 298 LKVTKGSLVVMKADIKYANLYHLRGTTILGNVAAVSDSLSNSDATNLWHMRLGHMSEIGL 357
Query: 433 MELHKRNMLKGVRSCIIGLCKYCVLGKQCRVRFKTGHHKTKGILDYVHSDVRGPTKEPSV 492
EL KR +L G + C++C+ GK RV+F T H T+GILDYVHSD+ GP ++ S
Sbjct: 358 AELSKRGLLDGQSIGKLKFCEHCIFGKHKRVKFNTSTHTTEGILDYVHSDLWGPARKTSF 417
Query: 493 *GFRYFVTFTDDFSRKVWVYFMKYKSEVFAKFKLWKAEVENQTGRKIKYLRSDNGTEYTD 552
G RY +T DD+SRKVW YF+K+K + F FK WK VE QT RK+K LR+DNG E
Sbjct: 418 GGARYMMTIVDDYSRKVWPYFLKHKYQAFDVFKEWKTMVERQTERKVKILRTDNGMELCS 477
Query: 553 KNFMHFCEENGIQRHFSVRKTPQQNGVAERMNRTLTEKARCLRLNACLSKCLWAATINMA 612
K F +C+ GI RH++V TPQQNGVAERMNRT+ KARC+ NA L K WA ++ A
Sbjct: 478 KIFKSYCKSEGIVRHYTVPHTPQQNGVAERMNRTIISKARCMLSNASLPKQFWAEAVSTA 537
Query: 613 CYLVNRSPRASLDGKVAEEVWTGNPIDLSNLRIFWRPAYVHISSEDRSKLDPKSK*CIII 672
CYL+NRSP ++D K EVW+G+P + S+LR+F AY H+ D KL+P+ CI +
Sbjct: 538 CYLINRSPSYAIDKKTPIEVWSGSPANYSDLRVFGCTAYAHV---DNGKLEPRVIKCIFL 594
Query: 673 GYNKGVKGYKLWDPVKKKVIVSRDVVFDE*SMLKQSDVTVVPDTEVENSSQDKIQVDIEE 732
GY GVKGYKLW P KKV++SR+VVF E ML P T V SQ+K+ V +E
Sbjct: 595 GYLSGVKGYKLWCPETKKVVISRNVVFHESIMLHDK-----PSTNVPVESQEKVSVQVEH 649
Query: 733 TPVS----PRQIVAQQQSEP---GSDSGEVQD---YTLVRDREPSRITPPVRYGFE-DLA 781
S ++ VA Q P SDS VQ ++ +DR PP RY E ++
Sbjct: 650 LISSGHAPEKEDVAINQDAPVIEDSDSSIVQQSPKRSIAKDRPKRNTKPPRRYIEEANIV 709
Query: 782 AYAL-----LTSSGDPSTYHEAMAS*EKDKWMSAMVEEMESLKKNET*NLVQLPHGKRVI 836
AYAL + + +PSTY +A+ S + ++W++AM +EMESL+KN + L +LP K+ I
Sbjct: 710 AYALSVAEEIEGNAEPSTYSDAIVSDDCNRWITAMHDEMESLEKNHSWELEKLPKEKKPI 769
Query: 837 GCKWVYKKKLAVT*KEREKFKAHLVTKGYSQHKGIDYDEIFSPVVRHTSIRVVLALVASM 896
CKW++K+K ++ + ++KA LV KGYSQ GID++++FSPVV+H+SIR +L++VA
Sbjct: 770 RCKWIFKRKEGMSPSDEARYKARLVAKGYSQIPGIDFNDVFSPVVKHSSIRTLLSIVAMH 829
Query: 897 DMHLEQMDVKTTFLHGNLEEQIYIEQPEGFSETGDGRLVCKLKRSLYGLKQSPRQWYKRF 956
D LEQMDVKT FLHG LEE IY+EQPEGF G LVC+LK+SLYGLKQSPRQWYKRF
Sbjct: 830 DYELEQMDVKTAFLHGELEEDIYMEQPEGFVVPGKENLVCRLKKSLYGLKQSPRQWYKRF 889
Query: 957 DSYMLRIGYRRCDYDCCVYVMSLDDGSFIFLLLYVDDMLIAANHLHDVNELKTKLGKEFD 1016
DS+ML +RR +YD CVY + + DGS I+LLLYVDDMLIAA ++ +LK +L EF+
Sbjct: 890 DSFMLSQKFRRSNYDSCVY-LKVVDGSAIYLLLYVDDMLIAAKDKSEIAKLKAQLSSEFE 948
Query: 1017 MKDLGAAKKILGMEIHKDRGAKKLWLSQKSYVEGVLSRFDMSKANHVSTPLTNHFKLSLE 1076
MKDLGAAKKILGMEI ++R + KL+LSQK Y+E VL RF+M A VST L HF+LS +
Sbjct: 949 MKDLGAAKKILGMEITRERHSGKLYLSQKCYIEKVLHRFNMHDAKLVSTLLAAHFRLSSD 1008
Query: 1077 QSPKIDSEIEGMSKIPY--AVQLVV*CMLWFALDQIWHKQLVKCQVHVQAREAALGSSQV 1134
P+ +IE MS++PY AV ++ M+ D + H V + + + Q
Sbjct: 1009 LCPQSAYDIEYMSRVPYSSAVSSLMYAMVCSRPD-LSHALSVVSRYMANPGKEHWKAVQW 1067
Query: 1135 DPKILEGYNGSRYHV*QGARCCSISCGICGLCR*SR**KVYNRICL----YSCGG-PICW 1189
+ L G + + + S G+ G + R L ++ GG + W
Sbjct: 1068 IFRYLRGTSSACLQFGRS------SDGLVGYVDSDFAGDLDRRRSLTGYVFTVGGCAVSW 1121
Query: 1190 KSSVQSTVAMSTTEAEYMAVAEAAKEALWLTGLVKELGVEQGGVQLHCDSQSAIYLTNNQ 1249
K+S+Q+TVA+STTEAEYMA++EA KE +WL GL EL + + CDSQSAI LT +Q
Sbjct: 1122 KASLQATVALSTTEAEYMAISEACKEVIWLRGLYTELCGVTSCINIFCDSQSAICLTKDQ 1181
Query: 1250 VYHARTKHIDVRFHKIRELLASRQILLQKIHTSENTTDKLTKPVTSDKF 1298
++H RTKHID+R+H IR ++A + + KI T +N D +TKPV + KF
Sbjct: 1182 MFHERTKHIDLRYHFIRGVIAEGDVKVCKISTHDNPVDMMTKPVPATKF 1230
>ref|XP_469192.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
gi|53370655|gb|AAU89150.1| integrase core domain
containing protein [Oryza sativa (japonica
cultivar-group)] gi|40538906|gb|AAR87163.1| putative
polyprotein [Oryza sativa (japonica cultivar-group)]
Length = 1322
Score = 960 bits (2482), Expect = 0.0
Identities = 557/1334 (41%), Positives = 780/1334 (57%), Gaps = 67/1334 (5%)
Query: 5 KFEVTRFDGTGNFGLWQRMAKDLLAQKS-LQKALRD--EKPADIATVDWNEMKEKAAGLI 61
K+++ D F LWQ + +LAQ S L +AL +K T + KA LI
Sbjct: 5 KYDLPLLDYKTRFSLWQVKMRAVLAQTSDLDEALESFGKKKTTEWTAEEKRKDRKALSLI 64
Query: 62 TLCVSDDVMNHILDLTTLKDVWDKLESLYMSKTPMNKLFAKQRLYSLKMQEGGDLQAHVY 121
L +S+D++ +L T ++W KLES+ MSK +K+ K +L+S K+ E G + H+
Sbjct: 65 QLHLSNDILQEVLQKKTAAELWLKLESICMSKDLTSKMHIKMKLFSHKLHESGSVLNHIS 124
Query: 122 AFNNILADMTRLGVTVDDEDKAIILLCSLPGSYDHLVTTLTYGKDSITLDSISSTLLPHA 181
F I+AD+ + V DDED ++LLCSLP SY + T+ +D +TL + L
Sbjct: 125 VFKEIVADLVSMEVQFDDEDLGLLLLCSLPSSYANFRHTILLSRDELTLAEVYEALQNRE 184
Query: 182 QRRQSVEEGGGSS-GEGLFVKGGQDRGRGKGKAVDSGKKKRSKSKDRKTTECYSCKQIGH 240
+ + V+ SS GE L V+G ++ + R +SK R C CK+ H
Sbjct: 185 KMKGMVQSYASSSKGEALQVRGRSEQRTYNDSNDHDKSQSRGRSKSRGKKFCKYCKKKNH 244
Query: 241 WKRDCPNRSGKSGNNSSS-ANVVQSDGSCSEEDLLCVSS--VKCTDAWVLDSGCSYHMTQ 297
+ +C K S A+VV S + D L V + V D W+LD+ CS+H+
Sbjct: 245 FIEECWKLQNKEKRKSDGKASVVTSAENSDSGDCLVVFAGYVASHDEWILDTACSFHICI 304
Query: 298 HREWFNSFKS-GDLGYVYLGDDKPCIIKGMRQVKIALDDGGVRTLSQVRYVLEVMKNLIS 356
+R+WF+S+KS + V +GDD P I G+ V+I DG RTL VR++ + +NLIS
Sbjct: 305 NRDWFSSYKSVQNEDVVRMGDDNPREIVGIGSVQIKTHDGMTRTLKDVRHIPGMARNLIS 364
Query: 357 LGTLHENGYSFKSEENRDILRVSKGAMTVMRAKRTAGNIYKLLGGTIMGDV--ASVETDD 414
L TL GY + +++VSKG++ M + N+Y L G T+ G V A+V D+
Sbjct: 365 LSTLDAEGYKYSGSGG--VVKVSKGSLVYMIGDMNSANLYVLRGSTLHGSVTAAAVTKDE 422
Query: 415 DA-TKLWHMRLGHLSERGMMELHKRNMLKGVRSCIIGLCKYCVLGKQCRVRFKTGHHKTK 473
+ T LWHMRLGH+SE GM EL KRN+L G + C++CV GK RV+F T H+TK
Sbjct: 423 PSKTNLWHMRLGHMSELGMAELMKRNLLDGCTQGNMKFCEHCVFGKHKRVKFNTSVHRTK 482
Query: 474 GILDYVHSDVRGPTKEPSV*GFRYFVTFTDDFSRKVWVYFMKYKSEVFAKFKLWKAEVEN 533
GILDYVH+D+ GP+++PS+ G RY +T DD+SRK W YF+K+K + FA FK K +E
Sbjct: 483 GILDYVHADLWGPSRKPSLGGARYMLTIIDDYSRKEWPYFLKHKDDTFAAFKERKVMIER 542
Query: 534 QTGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHFSVRKTPQQNGVAERMNRTLTEKARC 593
QT +++K L +DNG E+ F +C + GI RH ++ TPQQNGVAERMNRT+ KARC
Sbjct: 543 QTEKEVKVLCTDNGGEFCSDAFDDYCRKEGIVRHHTIPYTPQQNGVAERMNRTIISKARC 602
Query: 594 LRLNACLSKCLWAATINMACYLVNRSPRASLDGKVAEEVWTGNPIDLSNLRIFWRPAYVH 653
+ NA ++K WA N ACYL+NRSP L+ K E+W+G P D S LR+F AY H
Sbjct: 603 MLSNARMNKRFWAEAANTACYLINRSPSIPLNKKTPIEIWSGMPADYSQLRVFGCTAYAH 662
Query: 654 ISSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDPVKKKVIVSRDVVFDE*SMLKQSDVT-V 712
+ D KL+P++ C+ +GY GVKGYKLW+P K +SR+V+F+E M S T V
Sbjct: 663 V---DNGKLEPRAIKCLFLGYGSGVKGYKLWNPETNKTFMSRNVIFNEFVMFNDSLPTDV 719
Query: 713 VPDTEVENSSQDKIQVDIEETPVSPRQIVAQQQSEPGSDSGEV---QDYTLVRDREPSRI 769
+P E Q + V +E +IV ++ S V QD + R
Sbjct: 720 IPGGSDEE--QQYVSVQVEHVDDQETEIVGNDVNDTVQHSPSVLQPQDEPIAHRRTKRSC 777
Query: 770 TPPVRYGFE-DLAAYAL-----LTSSGDPSTYHEAMAS*EKDKWMSAMVEEMESLKKNET 823
PVR E D+ YA + ++ +P+TY EA+ S +++KW+SA+ EEM+SL+KN T
Sbjct: 778 GAPVRLIEECDMVYYAFSYAEQVENTLEPATYTEAVVSGDREKWISAIQEEMQSLEKNGT 837
Query: 824 *NLVQLPHGKRVIGCKWVYKKKLAVT*KEREKFKAHLVTKGYSQHKGIDYDEIFSPVVRH 883
LV LP K+ + CKW++K+K ++ E +FK LV KG+SQ G+DY+++FSPVV+H
Sbjct: 838 WELVHLPKQKKPVRCKWIFKRKEGLSPSEPPRFKVRLVAKGFSQIAGVDYNDVFSPVVKH 897
Query: 884 TSIRVVLALVASMDMHLEQMDVKTTFLHGNLEEQIYIEQPEGFSETGDGRLVCKLKRSLY 943
+SIR ++V D+ LEQ+DVKTTFLHG LEE+IY++QPEGF G VCKLKRSLY
Sbjct: 898 SSIRTFFSIVTMHDLELEQLDVKTTFLHGELEEEIYMDQPEGFIVPGKEDYVCKLKRSLY 957
Query: 944 GLKQSPRQWYKRFDSYMLRIGYRRCDYDCCVYVMSLDDGSFIFLLLYVDDMLIAANHLHD 1003
GLKQSPRQWYKRFDS+ML G++R ++D CVY+ ++ GS I+LLLYVDDMLIAA
Sbjct: 958 GLKQSPRQWYKRFDSFMLSHGFKRSEFDSCVYIKFVN-GSPIYLLLYVDDMLIAAKSKEQ 1016
Query: 1004 VNELKTKLGKEFDMKDLGAAKKILGMEIHKDRGAKKLWLSQKSYVEGVLSRFDMSKANHV 1063
+ LK +L EFDMKDLGAAKKILGMEI +DR + L+LSQ+SY++ VL RF+M A V
Sbjct: 1017 ITTLKKQLSSEFDMKDLGAAKKILGMEITRDRNSGLLFLSQQSYIKKVLQRFNMHDAKPV 1076
Query: 1064 STPLTNHFKLSLEQSPKIDSEIEGMSKIPY--AVQLVV*CMLWFALDQIWHKQLVKCQVH 1121
STP+ HFKLS Q D ++E MS++PY AV ++ M+ D LV +
Sbjct: 1077 STPIAPHFKLSALQCASTDEDVEYMSRVPYSSAVGSLMYAMVCSWPDLSHAMSLVSRYMA 1136
Query: 1122 VQARE-----------------AALGSSQVDPKILEGYNGSRYHV*QGARCCSISCGICG 1164
+E A L ++D K L GY S + R + G
Sbjct: 1137 NPGKEHWKAVQWIFRYLRGTADACLKFGRID-KGLVGYVDSDFAADLDKRR-----SLTG 1190
Query: 1165 LCR*SR**KVYNRICLYSCGGPICWKSSVQSTVAMSTTEAEYMAVAEAAKEALWLTGLVK 1224
+ SC + WK+++Q VA STTEAEYMA+AEA KE++WL GL
Sbjct: 1191 YV-----------FTIGSCA--VSWKATLQPVVAQSTTEAEYMAIAEACKESVWLKGLFA 1237
Query: 1225 ELGVEQGGVQLHCDSQSAIYLTNNQVYHARTKHIDVRFHKIRELLASRQILLQKIHTSEN 1284
EL + L CDSQSAI LT +Q++H RTKHID+++H +R+++A ++ + KI +N
Sbjct: 1238 ELCGVDSCINLFCDSQSAICLTKDQMFHERTKHIDIKYHYVRDIVAQGKLKVCKISIHDN 1297
Query: 1285 TTDKLTKPVTSDKF 1298
D +TKP+ KF
Sbjct: 1298 PADMMTKPIPVAKF 1311
>gb|AAX92861.1| retrotransposon protein, putative, Ty1-copia sub-class [Oryza sativa
(japonica cultivar-group)]
Length = 1373
Score = 952 bits (2462), Expect = 0.0
Identities = 543/1309 (41%), Positives = 767/1309 (58%), Gaps = 61/1309 (4%)
Query: 5 KFEVTRFDGTGNFGLWQRMAKDLLAQKSLQKALRDEKPADIATVDWNEMK--EKAAGLIT 62
KF++ + F LWQ + +LAQ D A E++ +KA LI
Sbjct: 2 KFDLPLLNYDTRFSLWQVKMRGILAQTHDYDEALDNFGKRRAEWTAEEIRKDQKALALIQ 61
Query: 63 LCVSDDVMNHILDLTTLKDVWDKLESLYMSKTPMNKLFAKQRLYSLKMQEGGDLQAHVYA 122
L + +D++ L T ++W KLES+ MSK +K+ K +L++LKM+E + H+
Sbjct: 62 LHLHNDILQECLTEKTSAELWLKLESICMSKDLTSKMQMKMKLFTLKMKEEDSVITHMAE 121
Query: 123 FNNILADMTRLGVTVDDEDKAIILLCSLPGSYDHLVTTLTYGKDSITLDSISSTLLPHAQ 182
F I+AD+ + V DDED ++LLCSLP SY + T+ +D +TL + L +
Sbjct: 122 FKKIVADLVSMEVKYDDEDLGLLLLCSLPNSYANFRDTILLSRDELTLKEVYDALQNKEK 181
Query: 183 RRQSVEEGGGSS--GEGLFVKGGQDRGRGKGKAVDSGKKKRSKSKDRKTTECYSCKQIGH 240
+ V+ G SS GE L V+G + K D + +SK K C CK H
Sbjct: 182 MKIMVQNDGSSSSKGEALHVRGRTENRTSNEKNYDRRGRSKSKPPGNKKF-CVYCKLKNH 240
Query: 241 WKRDCPNRSGKSGNNSSSANVVQSDGSCSEED-----LLCVSSVKCTDAWVLDSGCSYHM 295
+C K N V + + S++D ++ V D W+LDS CS+H+
Sbjct: 241 NIDECKKVQAKERKNKKDGKVSVASAAASDDDSGDCLVVFAGCVAGHDEWILDSACSFHI 300
Query: 296 TQHREWFNSFKSGDLG-YVYLGDDKPCIIKGMRQVKIALDDGGVRTLSQVRYVLEVMKNL 354
R WF+S+K G V +GDD PC I G+ V+I DDG RTL VRY+ + +NL
Sbjct: 301 CTKRNWFSSYKPVQKGDVVRMGDDNPCAIVGIGSVQIKTDDGMTRTLKNVRYIPGMSRNL 360
Query: 355 ISLGTLHENGYSFKSEENRDILRVSKGAMTVMRAKRTAGNIYKLLGGTIMGD--VASVET 412
ISL TL GY + + +L+VSKG++ ++ + +Y L G T+ G A+ T
Sbjct: 361 ISLSTLDAEGYKYSGSDG--VLKVSKGSLVCLKGDVNSAKLYVLRGCTLTGSDSAAAAIT 418
Query: 413 DDDATK--LWHMRLGHLSERGMMELHKRNMLKGVRSCIIGLCKYCVLGKQCRVRFKTGHH 470
+D+ +K LWHMRLGH+S GM EL KRN+LKG S I C++C+ GK RV+F T H
Sbjct: 419 NDEPSKTNLWHMRLGHMSHLGMTELMKRNLLKGCTSSKIKFCEHCIFGKHKRVQFNTSVH 478
Query: 471 KTKGILDYVHSDVRGPTKEPSV*GFRYFVTFTDDFSRKVWVYFMKYKSEVFAKFKLWKAE 530
TKG LDYVH+D+ GP+K+PS+ G RY +T DD+SRKVW YF+K+K + F FK WK
Sbjct: 479 TTKGTLDYVHADLWGPSKKPSLGGARYMLTIIDDYSRKVWPYFLKHKDDTFTAFKNWKVM 538
Query: 531 VENQTGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHFSVRKTPQQNGVAERMNRTLTEK 590
+E QT RK+K LR+DNG E+ F +C + GI RH ++ TPQQNGVAERMNRT+ +
Sbjct: 539 IERQTERKVKLLRTDNGGEFCSHAFNDYCRQEGIVRHHTIPHTPQQNGVAERMNRTIISR 598
Query: 591 ARCLRLNACLSKCLWAATINMACYLVNRSPRASLDGKVAEEVWTGNPIDLSNLRIFWRPA 650
ARC+ +A ++K WA + ACYL+NRSP L+ K EVW+G P D S L++F A
Sbjct: 599 ARCMLSHARMNKRFWAEAASTACYLINRSPSIPLNKKTPIEVWSGTPADYSQLKVFGCTA 658
Query: 651 YVHISSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDPVKKKVIVSRDVVFDE*SMLKQS-D 709
Y H+ D KL+P++ C+ +GY GVKGYKLW+P K +SR VVF+E M S
Sbjct: 659 YAHV---DNGKLEPRAVKCLFLGYGSGVKGYKLWNPETGKTFMSRSVVFNESVMFTNSLP 715
Query: 710 VTVVPDTEVENSSQDKIQVDIEETPVSPRQIVAQQQSEPGSDSG---------------E 754
VP+ E++ VD ++T V + +Q + +D +
Sbjct: 716 SEHVPEKELQRMHMQVEHVD-DDTGVQVEPV--HEQDDHNNDVADDDAHDDVQQTPPILQ 772
Query: 755 VQDYTLVRDREPSRITPPVRYGFED--LAAYAL-----LTSSGDPSTYHEAMAS*EKDKW 807
+++ + R+ R T P + E+ L+ YAL + + +P+TY EA+ + + W
Sbjct: 773 LEEELPIAQRKSKRTTKPPKRLIEECNLSYYALSCAEQVENVHEPATYKEAVRCGDSENW 832
Query: 808 MSAMVEEMESLKKNET*NLVQLPHGKRVIGCKWVYKKKLAVT*KEREKFKAHLVTKGYSQ 867
+SAM EEM+SL+KN T +V LP K+ I CKW++K+K ++ E K+KA LV +GYSQ
Sbjct: 833 ISAMHEEMQSLEKNSTWEVVPLPKKKKTISCKWIFKRKEGLSSSEPPKYKARLVARGYSQ 892
Query: 868 HKGIDYDEIFSPVVRHTSIRVVLALVASMDMHLEQMDVKTTFLHGNLEEQIYIEQPEGFS 927
G+DY+++FSPVV+H+SIR L++VAS D+ LEQ+DVKT FLHG LEE IY++QPEGF
Sbjct: 893 IPGVDYNDVFSPVVKHSSIRTFLSIVASHDLELEQLDVKTAFLHGELEEDIYMDQPEGFI 952
Query: 928 ETGDGRLVCKLKRSLYGLKQSPRQWYKRFDSYMLRIGYRRCDYDCCVYVMSLDDGSFIFL 987
G + VCKLKRSLYGLKQSPRQW KRFDS+ML ++R YD CVY+ + +GS I+L
Sbjct: 953 VPGKEKYVCKLKRSLYGLKQSPRQWNKRFDSFMLSHSFKRSKYDSCVYIKHV-NGSPIYL 1011
Query: 988 LLYVDDMLIAANHLHDVNELKTKLGKEFDMKDLGAAKKILGMEIHKDRGAKKLWLSQKSY 1047
LLYVDDMLIAA ++ +LK L EFDMKDLG+AKKILGMEI +DR + L+LSQ +Y
Sbjct: 1012 LLYVDDMLIAAKSKIEITKLKKLLSSEFDMKDLGSAKKILGMEISRDRKSGLLFLSQHNY 1071
Query: 1048 VEGVLSRFDMSKANHVSTPLTNHFKLSLEQSPKIDSEIEGMSKIPY--AVQLVV*CMLWF 1105
++ VL RF+M A VSTP+ HFKLS Q P ID+EIE MS++PY AV ++ M+
Sbjct: 1072 IKKVLQRFNMQNAKAVSTPIAPHFKLSAAQCPSIDAEIEYMSRVPYSSAVGSLMYAMVCS 1131
Query: 1106 ALDQIWHKQLVKCQVHVQAREAALGSSQVDPKILEGYNGSRYHV*QGARCCSISCGICGL 1165
D + LV + +E + I G+ Y + R G+ G
Sbjct: 1132 RPDLSYAMSLVSRYMSNPGKE----HWRAVQWIFRYLRGTTYSCLKFGR---TDKGLIGY 1184
Query: 1166 CR*SR**KVYNRICL----YSCGG-PICWKSSVQSTVAMSTTEAEYMAVAEAAKEALWLT 1220
+ R L ++ G + W++++QS VA+STTEAEYMA+ EA KE +WL
Sbjct: 1185 VDSDYAADLDRRRSLTGYVFTIGSCAVSWRATLQSVVALSTTEAEYMAICEACKELIWLK 1244
Query: 1221 GLVKELGVEQGGVQLHCDSQSAIYLTNNQVYHARTKHIDVRFHKIRELL 1269
GL EL + + LHCDS+SAIYLT +Q++H RTKHID+++H +R+++
Sbjct: 1245 GLYAELSGVESCISLHCDSESAIYLTKDQMFHERTKHIDIKYHFVRDVI 1293
>gb|AAX92941.1| retrotransposon protein, putative, Ty1-copia sub-class [Oryza sativa
(japonica cultivar-group)]
Length = 2340
Score = 920 bits (2378), Expect = 0.0
Identities = 511/1116 (45%), Positives = 705/1116 (62%), Gaps = 53/1116 (4%)
Query: 17 FGLWQRMAKDLLAQKSLQKALRDEKPADIATVDW-NEMKE---KAAGLITLCVSDDVMNH 72
F LWQ + +LAQ+ L AL D T DW N+ K+ KA I L +S++++
Sbjct: 224 FSLWQVKMRAVLAQQDLDDALSG---FDKRTHDWSNDEKKRDRKAMSYIHLHLSNNILQE 280
Query: 73 ILDLTTLKDVWDKLESLYMSKTPMNKLFAKQRLYSLKMQEGGDLQAHVYAFNNILADMTR 132
+L +W KLE + M+K +K+ KQ L+ K+Q+ G + H+ AF I+AD+
Sbjct: 281 VLKEEIAAGLWLKLEQICMTKDLTSKMHLKQTLFLHKLQDDGSVMDHLSAFKEIIADLES 340
Query: 133 LGVTVDDEDKAIILLCSLPGSYDHLVTTLTYGKDSITLDSISSTLLPHAQRRQSV-EEGG 191
+ V D+ED +ILLCSLP SY + T+ Y +D++TL + L + ++ V EG
Sbjct: 341 MEVKYDEEDLGLILLCSLPSSYANFRDTILYSRDTLTLKEVYDALHVKEKMKKMVPSEGS 400
Query: 192 GSSGEGLFVKGGQDRGRGKGKAVD---SGKKKRSKSKDRKTTECYSCKQIGH-----WK- 242
S EGL V G Q K ++ D S + RSKS+ R + C CK+ GH WK
Sbjct: 401 NSQAEGLIVWGRQQEKNTKNQSRDKSSSSYRGRSKSRGRYKS-CKYCKRDGHDIFECWKL 459
Query: 243 RDCPNRSGK-----SGNNSSSANVVQSDGSCSEEDLLCVSSVKCTDAWVLDSGCSYHMTQ 297
D R+GK A VV + S +E + + +D W+L++ C YHM
Sbjct: 460 HDKDKRTGKYVPKGKKEEEGKAAVVTDEKSDAELLVAYAGCAQTSDQWILNTACIYHMCP 519
Query: 298 HREWFNSFKSGDLGYVYLGDDKPCIIKGMRQVKIALDDGGVRTLSQVRYVLEVMKNLISL 357
+R+WF ++++ +G V +GDD PC + G+ V+I + DG +RTLS VR++ + ++LISL
Sbjct: 520 NRDWFATYEAVQVGTVLMGDDTPCEVAGIGTVQIKMFDGCIRTLSDVRHIPNLKRSLISL 579
Query: 358 GTLHENGYSFKSEENRDILRVSKGAMTVMRAKRTAGNIYKLLGGTIMGDVASVE---TDD 414
TL GY + + IL+V+KG++ VM+A + N+Y L G TI+G+VA+V ++
Sbjct: 580 CTLDRKGYKYSGGDG--ILKVTKGSLVVMKADIKSANLYHLRGTTILGNVAAVSDSLSNS 637
Query: 415 DATKLWHMRLGHLSERGMMELHKRNMLKGVRSCIIGLCKYCVLGKQCRVRFKTGHHKTKG 474
DAT LWHMRLGH++E G+ EL KR +L G + C++C+ GK RV+F T H T+G
Sbjct: 638 DATNLWHMRLGHMTEIGLAELSKRGLLDGQSIGKLKFCEHCIFGKHKRVKFNTSTHTTEG 697
Query: 475 ILDYVHSDVRGPTKEPSV*GFRYFVTFTDDFSRKVWVYFMKYKSEVFAKFKLWKAEVENQ 534
ILDYVHSD+ GP ++ S G RY +T DD+SRKVW YF+K+K + F FK WK VE Q
Sbjct: 698 ILDYVHSDLWGPARKTSFGGTRYMMTIVDDYSRKVWPYFLKHKYQAFDVFKEWKTMVERQ 757
Query: 535 TGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHFSVRKTPQQNGVAERMNRTLTEKARCL 594
T RK+K LR+DNG E+ K F +C+ GI RH++V TPQQNGVAERMNRT+ KARCL
Sbjct: 758 TERKVKILRTDNGMEFCSKIFKSYCKSEGIVRHYTVPHTPQQNGVAERMNRTIISKARCL 817
Query: 595 RLNACLSKCLWAATINMACYLVNRSPRASLDGKVAEEVWTGNPIDLSNLRIFWRPAYVHI 654
NA L K WA ++ ACYL+NRSP ++D K EVW+G+P + S+LR+F AY H+
Sbjct: 818 LSNAGLPKQFWAEAVSTACYLINRSPSYAIDKKTPIEVWSGSPANYSDLRVFGCTAYAHV 877
Query: 655 SSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDPVKKKVIVSRDVVFDE*SMLKQSDVTVVP 714
D KL+P++ CI +GY GVKGYKLW P KKV++SR+VVF E ML P
Sbjct: 878 ---DNGKLEPRAIKCIFLGYPSGVKGYKLWCPETKKVVISRNVVFHESVMLHDK-----P 929
Query: 715 DTEVENSSQDKIQVDIEETPVS----PRQIVAQQQSEP---GSDSGEVQD---YTLVRDR 764
T V SQ+K V +E S ++ VA Q EP S+S VQ ++ +DR
Sbjct: 930 STNVPVESQEKASVQVEHLISSGHAPEKEDVAINQDEPVIEDSNSSIVQQSPKRSIAKDR 989
Query: 765 EPSRITPPVRYGFE-DLAAYAL-----LTSSGDPSTYHEAMAS*EKDKWMSAMVEEMESL 818
I PP RY E ++ AYAL + + +PSTY EA+ S + ++W++AM +EMESL
Sbjct: 990 PKRNIKPPQRYIEEANIVAYALSVAEEIEGNAEPSTYSEAIVSDDCNRWITAMHDEMESL 1049
Query: 819 KKNET*NLVQLPHGKRVIGCKWVYKKKLAVT*KEREKFKAHLVTKGYSQHKGIDYDEIFS 878
+KN T LV+LP K+ I CKW++K+K ++ + ++KA L+ KGYSQ GID++++FS
Sbjct: 1050 EKNHTWELVKLPKEKKPIRCKWIFKRKEGISSSDEARYKARLIAKGYSQIPGIDFNDVFS 1109
Query: 879 PVVRHTSIRVVLALVASMDMHLEQMDVKTTFLHGNLEEQIYIEQPEGFSETGDGRLVCKL 938
PVV+H+SIR +L++VA D LEQMDVKT FLHG LEE IY+EQP+GF G LVC+L
Sbjct: 1110 PVVKHSSIRTLLSIVAMHDYELEQMDVKTAFLHGELEEDIYMEQPKGFVVPGKENLVCRL 1169
Query: 939 KRSLYGLKQSPRQWYKRFDSYMLRIGYRRCDYDCCVYVMSLDDGSFIFLLLYVDDMLIAA 998
K+SLYGLKQSPRQWYKRFDS+ML +RR +YD CVY + + DGS I+LLLYVDDMLIAA
Sbjct: 1170 KKSLYGLKQSPRQWYKRFDSFMLSQKFRRSNYDSCVY-LKVVDGSAIYLLLYVDDMLIAA 1228
Query: 999 NHLHDVNELKTKLGKEFDMKDLGAAKKILGMEIHKDRGAKKLWLSQKSYVEGVLSRFDMS 1058
++ +LK +L EF+MKDLGAAKKILGMEI + R + KL+LSQK Y+E VL RF+M
Sbjct: 1229 KDKSEIAKLKAQLSSEFEMKDLGAAKKILGMEITRKRHSFKLYLSQKGYIEKVLRRFNMH 1288
Query: 1059 KANHVSTPLTNHFKLSLEQSPKIDSEIEGMSKIPYA 1094
A VSTPL HF+LS + P+ D +IE MS++PY+
Sbjct: 1289 DAKPVSTPLAAHFRLSSDLCPQSDYDIEYMSRVPYS 1324
Score = 126 bits (316), Expect = 6e-27
Identities = 62/120 (51%), Positives = 85/120 (70%), Gaps = 1/120 (0%)
Query: 1180 LYSCGG-PICWKSSVQSTVAMSTTEAEYMAVAEAAKEALWLTGLVKELGVEQGGVQLHCD 1238
+++ GG + WK+S+Q+TVA+STTEAEYMA++EA KEA+WL GL L + + CD
Sbjct: 1409 VFTIGGCAVSWKASLQATVALSTTEAEYMAISEACKEAIWLRGLYTVLCAVTSCINIFCD 1468
Query: 1239 SQSAIYLTNNQVYHARTKHIDVRFHKIRELLASRQILLQKIHTSENTTDKLTKPVTSDKF 1298
SQSAI LT +Q++H RTKHIDVR+H IR L+A + + KI +N D +TKPV + KF
Sbjct: 1469 SQSAICLTKDQMFHERTKHIDVRYHFIRGLIAEGDVKICKISIHDNPADMMTKPVPATKF 1528
>dbj|BAB09923.1| copia-like retrotransposable element [Arabidopsis thaliana]
Length = 1342
Score = 888 bits (2294), Expect = 0.0
Identities = 537/1381 (38%), Positives = 768/1381 (54%), Gaps = 148/1381 (10%)
Query: 7 EVTRFDGTGNFGLWQRMAKDLLAQKSLQKALRDEKPADI--ATVDWN------------- 51
EV +FDG G++ LW+ + L + L +E+ A + +T + +
Sbjct: 7 EVEKFDGDGDYILWKEKLLAHMEMLGLLEGLGEEEEAVVEDSTTEISDGGNQDPETATSK 66
Query: 52 -------EMKEKAAGLITLCVSDDVMNHILDLTTLKDVWDKLESLYMSKTPMNKLFAKQR 104
E + KA I L + ++V+ ++ T + L+ L+M+K+ N+++ KQR
Sbjct: 67 LEDKILKEKRGKARSTIILSLGNNVLRKVIKQKTAAGMIKVLDQLFMAKSLPNRIYLKQR 126
Query: 105 LYSLKMQEGGDLQAHVYAFNNILADMTRLGVTVDDEDKAIILLCSLPGSYDHLVTTLTYG 164
LY KM E ++ +V F +++D+ + V V DED+AI+LL SLP +D L TL Y
Sbjct: 127 LYGYKMSENMTMEENVNDFFKLISDLENVKVVVPDEDQAIVLLMSLPRQFDQLKETLKYC 186
Query: 165 KDSITLDSISSTLLPHAQRRQSVEEGGGS------SGEGLFVKGGQDRGRGKGKAVDSGK 218
K ++ L+ I+S + R + E G S + +GLFV QDRGR + + K
Sbjct: 187 KTTLHLEEITSAI------RSKILELGASGKLLKNNSDGLFV---QDRGRSETRGKGPNK 237
Query: 219 KK-RSKSKDRKTTECYSCKQIGHWKRDC-----PNRSGKSGNNSSSANVVQ--SDGSCSE 270
K RSKSK T C+ C + GH+K+ C N+ G + ++ V +D +
Sbjct: 238 NKSRSKSKGAGKT-CWICGKEGHFKKQCYVWKERNKQGSTSERGEASTVTARVTDAAALV 296
Query: 271 EDLLCVSSVKCT-DAWVLDSGCSYHMTQHREWFNSFKSGDLGYVYLGDDKPCIIKGMRQV 329
+ + T D W+LD+GCS+HMT ++W FK G V +G+D +KG+ V
Sbjct: 297 VSRALLGFAEVTPDTWILDTGCSFHMTCRKDWIIDFKETASGKVRMGNDTYSEVKGIGDV 356
Query: 330 KIALDDGGVRTLSQVRYVLEVMKNLISLGTLHENGYSFKSEENRDILRVSKGAMTVMRAK 389
+I +DG L+ VRY+ E+ KNLISLGTL + G F+S++ IL + K +TV+ K
Sbjct: 357 RIKNEDGSTILLTDVRYIPEMSKNLISLGTLEDKGCWFESKKG--ILTIFKNDLTVLTGK 414
Query: 390 RTAGNIYKLLGGTIMGDVASVETDDDATKLWHMRLGHLSERGMMELHKRNMLKGVRSCII 449
+ + +Y L G T+ G+ ++ + D T LWH RLGH+ +G+ L + L
Sbjct: 415 KES-TLYFLQGTTLAGEANVIDKEKDETSLWHSRLGHIGAKGLQVLVSKGHLD------- 466
Query: 450 GLCKYCVLGKQCRVRFKTGHHKTKGILDYVHSDVRGPTKEP-SV*GFRYFVTFTDDFSRK 508
K + F H TK LDYVHSD+ G T P S+ +YF+TF DDF+R+
Sbjct: 467 ---------KNIMISFGAAKHVTKDKLDYVHSDLWGSTNVPFSIGKCQYFITFIDDFTRR 517
Query: 509 VWVYFMKYKSEVFAKFKLWKAEVENQTGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHF 568
W+YF++ K E F+KF WK ++ENQ +K+K L +DNG E+ ++ F FC + G+ RH
Sbjct: 518 TWIYFIRTKDEAFSKFVEWKTQIENQQDKKLKILITDNGLEFCNQEFDSFCRKEGVIRHR 577
Query: 569 SVRKTPQQNGVAERMNRTLTEKARCLRLNACLSKCLWAATINMACYLVNRSPRASLDGKV 628
+ TPQQNGVAERMNRT+ K RC+ + L K WA + A +L+N+SP +S++ +
Sbjct: 578 TCAYTPQQNGVAERMNRTIMNKVRCMLSESGLGKQFWAEAASTAVFLINKSPSSSIEFDI 637
Query: 629 AEEVWTGNPIDLSNLRIFWRPAYVHISSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDPVK 688
EE WTG+P D L+ F AY+H D+ KL+P++K I +GY GVK +K+W
Sbjct: 638 PEEKWTGHPPDYKILKKFGSVAYIH---SDQGKLNPRAKKGIFLGYPDGVKRFKVWLLED 694
Query: 689 KKVIVSRDVVFDE*SMLKQSDVTVVPDTEVENSSQDKIQVDIEETPVSPRQIVA--QQQS 746
+K +VSRD+VF E M K+ + S +DK ++E T + + + A + QS
Sbjct: 695 RKCVVSRDIVFQENQMYKELQKN-------DMSEEDKQLTEVERTLIELKNLSADDENQS 747
Query: 747 EPGSDSGEVQ------------------------DYTLVRDREPSRITPPVRYGFED--L 780
E G +S + Q +Y L RDR +I P R+ ED L
Sbjct: 748 EGGDNSNQEQASTTRSASKDKQVEETDSDDDCLENYLLARDRIRRQIRAPQRFVEEDDSL 807
Query: 781 AAYAL-LTSSGD---PSTYHEAMAS*EKDKWMSAMVEEMESLKKNET*NLVQLPHGKRVI 836
+AL +T G+ P TY EAM S E +KW A +EEM+S+KKN+T +++ P GKRVI
Sbjct: 808 VGFALTMTEDGEVYEPETYEEAMRSPECEKWKQATIEEMDSMKKNDTWDVIDKPEGKRVI 867
Query: 837 GCKWVYKKKLAVT*KEREKFKAHLVTKGYSQHKGIDYDEIFSPVVRHTSIRVVLALVASM 896
GCKW++K+K + E ++KA LV KG+SQ +GIDY EIFSPVV+H SIR +L++V
Sbjct: 868 GCKWIFKRKAGIPGVEPPRYKARLVAKGFSQREGIDYQEIFSPVVKHVSIRYLLSIVVQF 927
Query: 897 DMHLEQMDVKTTFLHGNLEEQIYIEQPEGFSETGDGRLVCKLKRSLYGLKQSPRQWYKRF 956
DM LEQ+DVKT FLHGNL+E I + QPEG+ + VC LK+SLYGLKQSPRQW +RF
Sbjct: 928 DMELEQLDVKTAFLHGNLDEYILMSQPEGYEDEDSTEKVCLLKKSLYGLKQSPRQWNQRF 987
Query: 957 DSYMLRIGYRRCDYDCCVYVMSLDDGSFIFLLLYVDDMLIAANHLHDVNELKTKLGKEFD 1016
DS+M+ GY+R Y+ CVY L+DGS+I+LLLYVDDMLIA+ + + +LK L +EF+
Sbjct: 988 DSFMINSGYQRSKYNPCVYTQQLNDGSYIYLLLYVDDMLIASQNKDQIQKLKESLNREFE 1047
Query: 1017 MKDLGAAKKILGMEIHKDRGAKKLWLSQKSYVEGVLSRFDMSKANHVSTPLTNHFKLSLE 1076
MKDLG A+KILGMEI ++R L LSQ YV GVL F M ++ TPL HFKL
Sbjct: 1048 MKDLGPARKILGMEITRNREQGILDLSQSEYVAGVLRAFGMDQSKVSQTPLGAHFKLRAA 1107
Query: 1077 QSPKIDSEIEGMSKIPY--AVQLVV*CMLWFALDQIW---------------HKQLVKCQ 1119
+ + E M +PY A+ ++ M+ D + H Q VK
Sbjct: 1108 NEKTLARDAEYMKLVPYPNAIGSIMYSMIGSRPDLAYPVGVVSRFMSKPSKEHWQAVKWV 1167
Query: 1120 VHVQ--AREAALGSSQVDPKILEGYNGSRYHV*QGARCCSISCGICGLCR*SR**KVYNR 1177
+ ++ L + D + GY S Y A I G
Sbjct: 1168 MRYMKGTQDTCLRFKKDDKFEIRGYCDSDY-----ATDLDRRRSITGF------------ 1210
Query: 1178 ICLYSCGG-PICWKSSVQSTVAMSTTEAEYMAVAEAAKEALWLTGLVKELGVEQGGVQLH 1236
+++ GG I WKS +Q VA+STTEAEYMA+AEA KEA+WL GL E+G EQ V++
Sbjct: 1211 --VFTAGGNTISWKSGLQRVVALSTTEAEYMALAEAVKEAIWLRGLAAEMGFEQDAVEVM 1268
Query: 1237 CDSQSAIYLTNNQVYHARTKHIDVRFHKIRELLASRQILLQKIHTSENTTDKLTKPVTSD 1296
CDSQSAI L+ N V+H RTKHIDVR+H IRE +A +I + KI T+ N D TK V
Sbjct: 1269 CDSQSAIALSKNSVHHERTKHIDVRYHFIREKIADGEIQVVKISTTWNPADIFTKTVPVS 1328
Query: 1297 K 1297
K
Sbjct: 1329 K 1329
>emb|CAB75481.1| copia-like polyprotein [Arabidopsis thaliana] gi|11278366|pir||T47492
copia-like polyprotein - Arabidopsis thaliana
Length = 1363
Score = 881 bits (2276), Expect = 0.0
Identities = 526/1365 (38%), Positives = 770/1365 (55%), Gaps = 85/1365 (6%)
Query: 3 GAKFEVTRFDGTGNFGLWQRMAK---DLLA--------------QKSLQKALRDEKPADI 45
GA+ EV +FDG G++ +W+ D+L ++ +K+ DEK
Sbjct: 3 GARIEVEKFDGRGDYTMWKEKLLAHIDMLGLSAVLRESETPMGKERDSEKSDEDEKEERE 62
Query: 46 ATVDWNEMKEKAAGLITLCVSDDVMNHILDLTTLKDVWDKLESLYMSKTPMNKLFAKQRL 105
+ E K KA I L VSD V+ I T+ + + L+ LYMSK N+++ KQ+L
Sbjct: 63 KMEAFEEKKRKARSTIVLSVSDRVLRKIKKETSAAAMLEALDRLYMSKALPNRIYLKQKL 122
Query: 106 YSLKMQEGGDLQAHVYAFNNILADMTRLGVTVDDEDKAIILLCSLPGSYDHLVTTLTY-- 163
YS KM E ++ ++ F +I+AD+ L V V DED+AI+LL SLP +D L TL Y
Sbjct: 123 YSFKMSENLSIEGNIDEFLHIVADLENLNVLVSDEDQAILLLMSLPKPFDQLKDTLKYSS 182
Query: 164 GKDSITLDSISSTLLPHAQRRQSVEEGGGSSGEGLFVKGGQDRGRGKGKAVDSGKKKRSK 223
GK ++LD +++ + SV++ EGL+VK + RG+ + D GK KRSK
Sbjct: 183 GKTVLSLDEVAAAIYSRELEFGSVKKSIKGQAEGLYVKDKAEN-RGRSEQKDKGKGKRSK 241
Query: 224 SKDRKTTECYSCKQIGHWKRDCPNRS----------------GKSGNNSSSANVVQSDGS 267
SK ++ C+ C + GH K CPN++ GK S N V+S G
Sbjct: 242 SKSKRG--CWICGEDGHLKSTCPNKNKPQFKNQGSNKGESSGGKGNLVEGSVNFVESAGM 299
Query: 268 CSEEDLLCVSSVKCTDAWVLDSGCSYHMTQHREWFNSFKSGDLGYVYLGDDKPCIIKGMR 327
E L + + D W++D+GC YHMT REW F G V +G+ +KG+
Sbjct: 300 FVSE-ALSSTDIHLEDEWIMDTGCIYHMTHKREWLEDFDEEAGGSVRMGNKSISRVKGVG 358
Query: 328 QVKIALDDGGVRTLSQVRYVLEVMKNLISLGTLHENGYSFKSEENRDILRVSKGAMTVMR 387
V+I D+G TL VRY+ ++ +NL+SLGT + G+ F+SE +LR+ G ++
Sbjct: 359 TVRIVNDNGLTVTLQNVRYIPDMDRNLLSLGTFEKAGHKFESENG--MLRIKSGNQVLLE 416
Query: 388 AKRTAGNIYKLLGGTIMGDVASVETDDDATKLWHMRLGHLSERGMMELHKRNMLKGVRSC 447
+R +Y L G + +V +D T LWH RL H+S++ M L K+ L +
Sbjct: 417 GRRY-DTLYILHGKPATDESLAVARANDDTVLWHRRLCHMSQKNMSLLIKKGFLDKKKVS 475
Query: 448 IIGLCKYCVLGKQCRVRFKTGHHKTKGILDYVHSDVRG-PTKEPSV*GFRYFVTFTDDFS 506
++ C+ C+ G+ ++ F H TK L+YVHSD+ G PT S+ +YF++F DD++
Sbjct: 476 MLDTCEDCIYGRAKKIGFNLAQHDTKKKLEYVHSDLWGAPTVPMSLGNCQYFISFIDDYT 535
Query: 507 RKVWVYFMKYKSEVFAKFKLWKAEVENQTGRKIKYLRSDNGTEYTDKNFMHFCEENGIQR 566
RKVWVYF+K K E F KF W + VENQ+G ++K LR+DNG E+ ++ F FCEE G QR
Sbjct: 536 RKVWVYFLKTKDEAFEKFVSWISLVENQSGERVKTLRTDNGLEFCNRMFDGFCEEKGFQR 595
Query: 567 HFSVRKTPQQNGVAERMNRTLTEKARCLRLNACLSKCLWAATINMACYLVNRSPRASLDG 626
H + TPQQNGV ERMNRT+ EK R + ++ L K WA + A L+N++P ++++
Sbjct: 596 HRTCAYTPQQNGVVERMNRTIMEKVRSMLCDSGLPKRFWAEATHTAVLLINKTPCSAINF 655
Query: 627 KVAEEVWTGNPIDLSNLRIFWRPAYVHISSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDP 686
+ ++ W+G S LR + +VH D KL+ ++K ++IGY GVKGYK+W
Sbjct: 656 EFPDKRWSGKAPIYSYLRRYGCVTFVHT---DGGKLNLRAKKGVLIGYPSGVKGYKVWLI 712
Query: 687 VKKKVIVSRDVVFDE*SMLKQ-SDVTVVPDTEVENSSQDKIQVDIE------------ET 733
+KK +VSR+V F E ++ K E ++ + I +D+E +
Sbjct: 713 EEKKCVVSRNVSFQENAVYKDLMQRKEQVSCEEDDHAGSYIDLDLEADKDNSSGGEQSQA 772
Query: 734 PVSPRQIVAQQQSEPGSDSGEVQD---------YTLVRDREPSRITPPVRYGFEDLAAYA 784
V+P A + P ++ ++++ Y LVRDRE I P R+ ED A A
Sbjct: 773 QVTPATRGAVTSTPPRYETDDIEETDVHQSPLSYHLVRDRERREIRAPRRFDDEDYYAEA 832
Query: 785 LLTSSG----DPSTYHEAMAS*EKDKWMSAMVEEMESLKKNET*NLVQLPHGKRVIGCKW 840
L T+ +P+ Y EA+ DKW AM EE+ES KN+T V P +R+IG +W
Sbjct: 833 LYTTEDGDAVEPADYKEAVRDENWDKWRLAMNEEIESQLKNDTWTTVTRPEKQRIIGSRW 892
Query: 841 VYKKKLAVT*KEREKFKAHLVTKGYSQHKGIDYDEIFSPVVRHTSIRVVLALVASMDMHL 900
+YK K + E +FKA LV KGY+Q +G+DY EIF+PVV+H SIR++L++VA ++ L
Sbjct: 893 IYKYKQGIPGVEEPRFKARLVAKGYAQREGVDYHEIFAPVVKHVSIRILLSIVAQENLEL 952
Query: 901 EQMDVKTTFLHGNLEEQIYIEQPEGFSETGDGRLVCKLKRSLYGLKQSPRQWYKRFDSYM 960
EQ+DVKT FLHG L+E+IY+ PEG VC L +SLYGLKQ+PRQW ++F+ YM
Sbjct: 953 EQLDVKTAFLHGELKEKIYMMPPEGCESLFKENEVCLLNKSLYGLKQAPRQWNEKFNHYM 1012
Query: 961 LRIGYRRCDYDCCVYVMSLDDGSFIFLLLYVDDMLIAANHLHDVNELKTKLGKEFDMKDL 1020
IG++R DYD C Y L D S ++LL YVDDML+AAN++ ++ LK +L +F+MKDL
Sbjct: 1013 TEIGFKRSDYDSCAYTKKLSDDSTMYLLFYVDDMLVAANNMQAIDALKKELSIKFEMKDL 1072
Query: 1021 GAAKKILGMEIHKDRGAKKLWLSQKSYVEGVLSRFDMSKANHVSTPLTNHFKLSLEQSPK 1080
GAAKKILG+EI DR A LWLSQ+SY+ VL F+M ++ TPL H K+ K
Sbjct: 1073 GAAKKILGIEIIIDREAGVLWLSQESYLNKVLKTFNMLESKPALTPLGAHLKMKSATEEK 1132
Query: 1081 IDSEIEGMSKIPY--AVQLVV*CMLWFALDQIWHKQLVKCQVHVQAREAALGSSQVDPKI 1138
+ +E E M+ +PY AV ++ M+ D + +V + A+E LG V +
Sbjct: 1133 LSTEEEYMNSVPYSSAVGSIMYAMIGTRPDLAYPVGVVSRFMSQPAKEHWLGVKWV-LRY 1191
Query: 1139 LEGYNGSRYHV*QGARCCSISCGICGLCR*SR**KVYNRICL----YSCGG-PICWKSSV 1193
++G +R + + ICG C + R + ++ GG I WKS +
Sbjct: 1192 IKGTVDTRLCYKRNS-----DFSICGYCDADYAADLDKRRSITGLVFTLGGNTISWKSGL 1246
Query: 1194 QSTVAMSTTEAEYMAVAEAAKEALWLTGLVKELGVEQGGVQLHCDSQSAIYLTNNQVYHA 1253
Q VA S+TE EYM++ EA KEA+WL GL+K+ G EQ V++ CDSQSAI L+ N V+H
Sbjct: 1247 QRVVAQSSTECEYMSLTEAVKEAIWLKGLLKDFGYEQKNVEIFCDSQSAIALSKNNVHHE 1306
Query: 1254 RTKHIDVRFHKIRELLASRQILLQKIHTSENTTDKLTKPVTSDKF 1298
RTKHIDV+FH IRE++A ++ + KI T +N D TK + +KF
Sbjct: 1307 RTKHIDVKFHFIREIIADGKVEVSKISTEKNPADIFTKVLPVNKF 1351
>gb|AAD32759.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301696|pir||F84486 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1356
Score = 875 bits (2261), Expect = 0.0
Identities = 516/1360 (37%), Positives = 758/1360 (54%), Gaps = 84/1360 (6%)
Query: 4 AKFEVTRFDGTGNFGLWQRMAKDLLAQKSLQKALRDEKPA-------DIATVDWNEM--- 53
A+ EV +FDG G++ +W+ + L AL++ + D + D+ E
Sbjct: 4 ARIEVEKFDGRGDYTMWKEKLLAHMDILGLNTALKESESTGEKKSVLDESDEDYEEKLEK 63
Query: 54 -------KEKAAGLITLCVSDDVMNHILDLTTLKDVWDKLESLYMSKTPMNKLFAKQRLY 106
K+KA I L V+D V+ I +T + L+ LYMSK N+++ KQ+LY
Sbjct: 64 FEALEEKKKKARSAIVLSVTDRVLRKIKKESTAAAMLLALDKLYMSKALPNRIYPKQKLY 123
Query: 107 SLKMQEGGDLQAHVYAFNNILADMTRLGVTVDDEDKAIILLCSLPGSYDHLVTTLTY--G 164
S KM E ++ ++ F I+ D+ + V + DED+AI+LL +LP ++D L TL Y G
Sbjct: 124 SFKMSENLSVEGNIDEFLQIITDLENMNVIISDEDQAILLLTALPKAFDQLKDTLKYSSG 183
Query: 165 KDSITLDSISSTLLPHAQRRQSVEEGGGSSGEGLFVKGGQDRGRGKGKAVDSGKKKRSKS 224
K +TLD +++ + SV++ EGL+VK D+ KGK GK K K
Sbjct: 184 KSILTLDEVAAAIYSKELELGSVKKSIKVQAEGLYVK---DKNENKGKGEQKGKGKGKKG 240
Query: 225 KDRKTTECYSCKQIGHWKRDCPNRSG---------KSGNNSSSANVVQSDGSCSEEDLLC 275
K +K C++C + GH++ CPN++ K ++ N+ ++ G E L
Sbjct: 241 KSKKKPGCWTCGEEGHFRSSCPNQNKPQFKQSQVVKGESSGGKGNLAEAAGYYVSE-ALS 299
Query: 276 VSSVKCTDAWVLDSGCSYHMTQHREWFNSFKSGDLGYVYLGDDKPCIIKGMRQVKIALDD 335
+ V D W+LD+GCSYHMT REWF+ F G V +G+ ++G+ +++ D
Sbjct: 300 STEVHLEDEWILDTGCSYHMTYKREWFHEFNEDAGGSVRMGNKTVSRVRGVGTIRVKNSD 359
Query: 336 GGVRTLSQVRYVLEVMKNLISLGTLHENGYSFKSEENRDILRVSKGAMTVMRAKRTAGNI 395
G L+ VRY+ ++ +NL+SLGT + GY F+SE+ ILR+ G ++ +R +
Sbjct: 360 GLTIVLTNVRYIPDMDRNLLSLGTFEKAGYKFESEDG--ILRIKAGNQVLLTGRRY-DTL 416
Query: 396 YKLLGGTIMGDVASVETDDDATKLWHMRLGHLSERGMMELHKRNMLKGVRSCIIGLCKYC 455
Y L + + +V D T LWH RL H+S++ M L ++ L + + +C+ C
Sbjct: 417 YLLNWKPVASESLAVVKRADDTVLWHQRLCHMSQKNMEILVRKGFLDKKKVSSLDVCEDC 476
Query: 456 VLGKQCRVRFKTGHHKTKGILDYVHSDVRGPTKEP-SV*GFRYFVTFTDDFSRKVWVYFM 514
+ GK R F HH TK L+Y+HSD+ G P S+ +YF++ DDF+RKVWVYFM
Sbjct: 477 IYGKAKRKSFSLAHHDTKEKLEYIHSDLWGAPFVPLSLGKCQYFMSIIDDFTRKVWVYFM 536
Query: 515 KYKSEVFAKFKLWKAEVENQTGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHFSVRKTP 574
K K E F KF W VENQT R++K LR+DNG E+ +K F FCE GI RH + TP
Sbjct: 537 KTKDEAFEKFVEWVNLVENQTDRRVKTLRTDNGLEFCNKLFDGFCESIGIHRHRTCAYTP 596
Query: 575 QQNGVAERMNRTLTEKARCLRLNACLSKCLWAATINMACYLVNRSPRASLDGKVAEEVWT 634
QQNGVAERMNRT+ EK R + ++ L K WA + L+N++P ++L+ ++ ++ W+
Sbjct: 597 QQNGVAERMNRTIMEKVRSMLSDSGLPKRFWAEATHTTVLLINKTPSSALNFEIPDKKWS 656
Query: 635 GNPIDLSNLRIFWRPAYVHISSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDPVKKKVIVS 694
GNP S LR + A+VH D KL+P++K ++IGY GVKGYK+W ++K +VS
Sbjct: 657 GNPPVYSYLRRYGCVAFVHT---DDGKLEPRAKKGVLIGYPVGVKGYKVWILDERKCVVS 713
Query: 695 RDVVFDE*SMLK-----QSDVTVVPDTEVEN-------------SSQDKIQVDIEETPVS 736
R+++F E ++ K Q +V+ D + + S D+ V+ P S
Sbjct: 714 RNIIFQENAVYKDLMQRQENVSTEEDDQTGSYLEFDLEAERDVISGGDQEMVNTIPAPES 773
Query: 737 PRQIVAQQQSEPGSDSGEVQD----YTLVRDREPSRITPPVRYGFEDLAAYALLTSSG-- 790
P Q + +V Y LVRDR+ I P R+ ED A AL T+
Sbjct: 774 PVVSTPTTQDTNDDEDSDVNQSPLSYHLVRDRDKREIRAPRRFDDEDYYAEALYTTEDGE 833
Query: 791 --DPSTYHEAMAS*EKDKWMSAMVEEMESLKKNET*NLVQLPHGKRVIGCKWVYKKKLAV 848
+P Y +A DKW AM EE++S +KN T +V P +R+IGC+W++K KL +
Sbjct: 834 AVEPENYRKAKLDANFDKWKLAMDEEIDSQEKNNTWTIVTRPENQRIIGCRWIFKYKLGI 893
Query: 849 T*KEREKFKAHLVTKGYSQHKGIDYDEIFSPVVRHTSIRVVLALVASMDMHLEQMDVKTT 908
E +FKA LV KGY+Q +GIDY EIF+PVV+H SIRV+L++VA D+ LEQ+DVKT
Sbjct: 894 LGVEEPRFKARLVAKGYAQKEGIDYHEIFAPVVKHVSIRVLLSIVAQEDLELEQLDVKTA 953
Query: 909 FLHGNLEEQIYIEQPEGFSETGDGRLVCKLKRSLYGLKQSPRQWYKRFDSYMLRIGYRRC 968
FLHG L+E+IY+ PEG+ VC L ++LYGLKQ+P+QW ++FD++M I + +
Sbjct: 954 FLHGELKEKIYMSPPEGYESMFKANEVCLLNKALYGLKQAPKQWNEKFDNFMKEICFVKS 1013
Query: 969 DYDCCVYVMSLDDGSFIFLLLYVDDMLIAANHLHDVNELKTKLGKEFDMKDLGAAKKILG 1028
YD C Y L DGS ++LL+YVDD+L+A+ + + LK LG F+MKDLGAAKKILG
Sbjct: 1014 AYDSCAYTKVLPDGSVMYLLIYVDDILVASKNKEAITALKANLGMRFEMKDLGAAKKILG 1073
Query: 1029 MEIHKDRGAKKLWLSQKSYVEGVLSRFDMSKANHVSTPLTNHFKLSLEQSPKIDSEIEGM 1088
MEI +DR LWLSQ+ Y+ +L ++M++A TPL HFK K+ + + M
Sbjct: 1074 MEIIRDRTLGVLWLSQEGYLNKILETYNMAEAKPAMTPLGAHFKFQAATEQKLIRDEDFM 1133
Query: 1089 SKIPY--AVQLVV*CMLWFALDQIWHKQLVKCQVHVQAREAALGSSQVDPKILEGYNGSR 1146
+PY AV ++ ML D + ++ + +E LG V L G+
Sbjct: 1134 KSVPYSSAVGSIMYAMLGTRPDLAYPVGIISRFMSQPIKEHWLGVKWV----LRYIKGT- 1188
Query: 1147 YHV*QGARCC---SISCGICGLCR*SR**KVYNRICL----YSCGG-PICWKSSVQSTVA 1198
R C S S I G C + R + ++ GG I WKS +Q VA
Sbjct: 1189 ----LKTRLCYKKSSSFSIVGYCDADYAADLDKRRSITGLVFTLGGNTISWKSGLQRVVA 1244
Query: 1199 MSTTEAEYMAVAEAAKEALWLTGLVKELGVEQGGVQLHCDSQSAIYLTNNQVYHARTKHI 1258
STTE+EYM++ EA KEA+WL GL+K+ G EQ V++ CDSQSAI L+ N V+H RTKHI
Sbjct: 1245 QSTTESEYMSLTEAVKEAIWLKGLLKDFGYEQKSVEIFCDSQSAIALSKNNVHHERTKHI 1304
Query: 1259 DVRFHKIRELLASRQILLQKIHTSENTTDKLTKPVTSDKF 1298
DV++H IRE+++ + + KI T +N D TK + KF
Sbjct: 1305 DVKYHFIREIISDGTVEVLKISTEKNPADIFTKVLAVSKF 1344
>gb|AAP54315.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
gi|37535452|ref|NP_922028.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
gi|22094359|gb|AAM91886.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
Length = 1280
Score = 869 bits (2246), Expect = 0.0
Identities = 486/1091 (44%), Positives = 676/1091 (61%), Gaps = 53/1091 (4%)
Query: 5 KFEVTRFDGTGNFGLWQRMAKDLLAQKSLQKALRDEKPADIATVDW-NEMKEK---AAGL 60
K+++ D F LWQ + +LAQ+ L AL D T DW N+ K+K A
Sbjct: 40 KYDLPLLDRDTRFSLWQVKMRAVLAQQDLDDALSG---FDKRTQDWSNDEKKKDRKAMSY 96
Query: 61 ITLCVSDDVMNHILDLTTLKDVWDKLESLYMSKTPMNKLFAKQRLYSLKMQEGGDLQAHV 120
I L +S++++ +L T +W KLE + M+K +K+ KQ+L+ K+Q+ G + H+
Sbjct: 97 IHLHLSNNILQEVLKEETAAGLWLKLEQICMTKDLTSKMHLKQKLFLHKLQDDGSVMDHL 156
Query: 121 YAFNNILADMTRLGVTVDDEDKAIILLCSLPGSYDHLVTTLTYGKDSITLDSISSTLLPH 180
F I+AD+ + V D+ED +ILLCSLP SY + T+ Y D++ L + L
Sbjct: 157 STFKEIVADLESIEVKYDEEDLGLILLCSLPSSYANFRDTILYSHDTLILKEVYDALHAK 216
Query: 181 AQRRQSV-EEGGGSSGEGLFVKGGQDRGRGKGKAVD---SGKKKRSKSKDRKTTECYSCK 236
+ ++ V EG S EGL V+G Q K ++ D S + RSKS+ R + C CK
Sbjct: 217 EKMKKMVPSEGSNSQAEGLVVRGRQQEKNTKNQSRDKSSSSYRGRSKSRGRYKS-CKYCK 275
Query: 237 QIGH-----WK-RDCPNRSGK-----SGNNSSSANVVQSDGSCSEEDLLCVSSVKCTDAW 285
+ GH WK +D R+GK A VV + S +E + + +D W
Sbjct: 276 RDGHDISECWKLQDKDKRTGKYIPKGKKEEEGKAAVVTDEKSDTELLVAYAGCAQTSDQW 335
Query: 286 VLDSGCSYHMTQHREWFNSFKSGDLGYVYLGDDKPCIIKGMRQVKIALDDGGVRTLSQVR 345
+LD+ +YHM +R+WF ++++ G V +GDD PC + G+ V+I + DG +RTLS VR
Sbjct: 336 ILDTAWTYHMCPNRDWFATYEALQGGTVLMGDDTPCEVAGIGTVQIKMFDGYIRTLSDVR 395
Query: 346 YVLEVMKNLISLGTLHENGYSFKSEENRDILRVSKGAMTVMRAKRTAGNIYKLLGGTIMG 405
++ + ++LISL TL GY + + IL+V+KG++ VM+A + N+Y L G TI+G
Sbjct: 396 HIPNLKRSLISLCTLDRKGYKYSGGDG--ILKVTKGSLVVMKADIKSANLYHLRGTTILG 453
Query: 406 DVASVE---TDDDATKLWHMRLGHLSERGMMELHKRNMLKGVRSCIIGLCKYCVLGKQCR 462
+VA+V ++ DAT LWHMRLGH+SE G+ EL KR +L G + C++C+ GK R
Sbjct: 454 NVAAVSDSLSNSDATNLWHMRLGHMSEIGLAELSKRELLDGQSIGKLKFCEHCIFGKHKR 513
Query: 463 VRFKTGHHKTKGILDYVHSDVRGPTKEPSV*GFRYFVTFTDDFSRKVWVYFMKYKSEVFA 522
V+F T H T+GILDYVHSD+ GP + S G RY +T DD+SRKVW YF+K+K + F
Sbjct: 514 VKFNTSTHTTEGILDYVHSDLWGPACKTSFGGARYMMTIVDDYSRKVWPYFLKHKYQAFD 573
Query: 523 KFKLWKAEVENQTGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHFSVRKTPQQNGVAER 582
FK WK VE QT +K+K LR+DNG E+ K F +C+ GI H++V TPQQNGVAER
Sbjct: 574 VFKEWKTMVERQTEKKVKILRTDNGMEFCSKIFKSYCKSEGIVHHYTVPHTPQQNGVAER 633
Query: 583 MNRTLTEKARCLRLNACLSKCLWAATINMACYLVNRSPRASLDGKVAEEVWTGNPIDLSN 642
MN + KARC+ NA L K WA ++ CYL+NRSP + D K EVW+G+P + S+
Sbjct: 634 MNMAIISKARCMLSNADLPKQFWAEAVSTTCYLINRSPSYATDKKTPIEVWSGSPANYSD 693
Query: 643 LRIFWRPAYVHISSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDPVKKKVIVSRDVVFDE* 702
LR+F AY H+ D KL+P++ CI +GY GVKGYKLW P KKV++SR+VVF E
Sbjct: 694 LRVFGCTAYAHV---DNGKLEPRAIKCIFLGYPSGVKGYKLWCPETKKVVISRNVVFHES 750
Query: 703 SMLKQSDVTVVPDTEVENSSQDKIQVDIEETPVS----PRQIVAQQQSEP---GSDSGEV 755
+L P T V SQ+K V +E S ++ VA Q P SDS V
Sbjct: 751 VILHDK-----PSTNVPVESQEKASVQVEHLISSGHAPEKENVAINQDAPVIEDSDSSIV 805
Query: 756 QD---YTLVRDREPSRITPPVRYGFE-DLAAYAL-----LTSSGDPSTYHEAMAS*EKDK 806
Q ++ +D+ I PP RY E ++ AYAL + + +PSTY EA+ S + ++
Sbjct: 806 QQSSKRSIAKDKPKRNIKPPRRYIEEANIVAYALSVAEEIEGNAEPSTYSEAIVSDDCNR 865
Query: 807 WMSAMVEEMESLKKNET*NLVQLPHGKRVIGCKWVYKKKLAVT*KEREKFKAHLVTKGYS 866
W++AM +EMESLKKN T V+LP K+ I CKW++K+K ++ + ++KA LV KGYS
Sbjct: 866 WITAMHDEMESLKKNHTWEFVKLPKEKKPIRCKWIFKRKEGMSPSDEARYKARLVAKGYS 925
Query: 867 QHKGIDYDEIFSPVVRHTSIRVVLALVASMDMHLEQMDVKTTFLHGNLEEQIYIEQPEGF 926
Q GID++++FSP+++H+SIR +L +VA D LEQMDVKT FLHG LEE IY+EQPEGF
Sbjct: 926 QIPGIDFNDVFSPIMKHSSIRTLLGIVAMHDYELEQMDVKTAFLHGELEEDIYMEQPEGF 985
Query: 927 SETGDGRLVCKLKRSLYGLKQSPRQWYKRFDSYMLRIGYRRCDYDCCVYVMSLDDGSFIF 986
G LVC+LK+SLYGLKQSPRQWYKRFDS+ML +RR +YD CVY + + DGS I+
Sbjct: 986 VVLGKENLVCRLKKSLYGLKQSPRQWYKRFDSFMLSQKFRRSNYDSCVY-LKVVDGSAIY 1044
Query: 987 LLLYVDDMLIAANHLHDVNELKTKLGKEFDMKDLGAAKKILGMEIHKDRGAKKLWLSQKS 1046
LLLYVDDMLIAA ++ +LK +L EF MKDLGAAKKILGMEI ++R + KL+LSQK
Sbjct: 1045 LLLYVDDMLIAAKDKSEIAKLKAQLSSEFGMKDLGAAKKILGMEITRERHSGKLYLSQKG 1104
Query: 1047 YVEGVLSRFDM 1057
Y++ VL RF+M
Sbjct: 1105 YIKKVLRRFNM 1115
Score = 127 bits (320), Expect = 2e-27
Identities = 62/120 (51%), Positives = 87/120 (71%), Gaps = 1/120 (0%)
Query: 1180 LYSCGG-PICWKSSVQSTVAMSTTEAEYMAVAEAAKEALWLTGLVKELGVEQGGVQLHCD 1238
+++ GG + WK+S+Q+TVA+STTEAEYMA++EA KEA+WL GL EL + + CD
Sbjct: 1150 VFTIGGCDVSWKASLQATVALSTTEAEYMAISEACKEAIWLRGLYTELCGVTSCINIFCD 1209
Query: 1239 SQSAIYLTNNQVYHARTKHIDVRFHKIRELLASRQILLQKIHTSENTTDKLTKPVTSDKF 1298
SQSAI LT +Q++H RTKHIDVR+H IR ++ + + KI T +N D +TKPV++ KF
Sbjct: 1210 SQSAICLTKDQMFHERTKHIDVRYHIIRGVIVEGDVKVCKISTHDNPADMMTKPVSATKF 1269
>dbj|BAD34493.1| Gag-Pol [Ipomoea batatas]
Length = 1298
Score = 865 bits (2236), Expect = 0.0
Identities = 469/1100 (42%), Positives = 686/1100 (61%), Gaps = 29/1100 (2%)
Query: 4 AKFEVTRFDGTGNFGLWQRMAKDLLAQKSLQKALRDEKPADIAT-VDWNEMKEKAAGLIT 62
AKFE+ +F+G NF LW+ K +L + + A+ E+P D W+EM E A +
Sbjct: 3 AKFEIEKFNGK-NFSLWKLKVKAILRKDNCLAAI-SERPVDFTDDKKWSEMNEDAMADLY 60
Query: 63 LCVSDDVMNHILDLTTLKDVWDKLESLYMSKTPMNKLFAKQRLYSLKMQEGGDLQAHVYA 122
L ++D V++ I + T ++WD L LY +K+ NK+F K++LY+L+M E + H+
Sbjct: 61 LSIADGVLSSIEEKKTANEIWDHLNRLYEAKSLHNKIFLKRKLYTLRMSESTSVTEHLNT 120
Query: 123 FNNILADMTRLGVTVDDEDKAIILLCSLPGSYDHLVTTLTYG--KDSITLDSISSTLLPH 180
N + + +T L ++ +++A +LL SLP SYD L+ LT D + D +++ +L
Sbjct: 121 LNTLFSQLTSLSCKIEPQERAELLLQSLPDSYDQLIINLTNNILTDYLVFDDVAAAVLEE 180
Query: 181 AQRRQSVEEGGGS--SGEGLFVKGGQDRGRGKGKAVDSGKKKRSKSKDRKTTECYSCKQI 238
RR++ E+ + E L V G+ RG+ S + RSKS + T CY+C +
Sbjct: 181 ESRRKNKEDRQVNLQQAEALTVMRGRSTERGQ-----SSGRGRSKSSKKNLT-CYNCGKK 234
Query: 239 GHWKRDCPNRSGKSGNNSSSANVVQSDGS--CSEEDLLCVSSVKCTDAWVLDSGCSYHMT 296
GH K+DC N + S + A+ DGS C E + + D W++DSG +YHMT
Sbjct: 235 GHLKKDCWNLAQNSNPQGNVAST-SDDGSALCCEASIAREGRKRFADIWLIDSGATYHMT 293
Query: 297 QHREWFNSFKSGDLGYVYLGDDKPCIIKGMRQVKIALDDGGVRTLSQVRYVLEVMKNLIS 356
+EWF+ ++ G VY DD I G+ +K+ + DG V+T+ VR+V + KNL+S
Sbjct: 294 SRKEWFHHYEPISGGSVYSCDDHALEIIGIGTIKLKMYDGTVQTVQDVRHVKGLKKNLLS 353
Query: 357 LGTLHENGYSFKSEENRDILRVSKGAMTVMRAKRTAGNIYKLLGGTIMGDVASVET-DDD 415
G L + ++++ ++++ +GA+ VM+ ++ A N+Y L G T+ ASV D
Sbjct: 354 YGILDNSATQIETQKG--VMKIFQGALVVMKGEKIAANLYMLKGETLQEAEASVAACSPD 411
Query: 416 ATKLWHMRLGHLSERGMMELHKRNMLKGVRSCIIGLCKYCVLGKQCRVRFKTGHHKTKGI 475
+T LWH +LGH+S++GM L ++ ++ G+ + LC++C+ KQ R++F T + + K +
Sbjct: 412 STLLWHQKLGHMSDQGMKILVEQKLIPGLTKVSLPLCEHCITSKQHRLKFSTSNSRGKVV 471
Query: 476 LDYVHSDV-RGPTKEPSV*GFRYFVTFTDDFSRKVWVYFMKYKSEVFAKFKLWKAEVENQ 534
L+ VHSDV + P PS+ G +YFV+F DD+SR+ WVY +K KS+VFA FK +KA VE
Sbjct: 472 LELVHSDVWQAPV--PSLGGAKYFVSFIDDYSRRCWVYPIKKKSDVFATFKAFKARVELD 529
Query: 535 TGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHFSVRKTPQQNGVAERMNRTLTEKARCL 594
+G+KIK R+DNG EYT + F FC++ GI+R F+V TPQQNGVAERMNRTL E+ R +
Sbjct: 530 SGKKIKCFRTDNGGEYTSEEFDDFCKKEGIKRQFTVAYTPQQNGVAERMNRTLLERTRAM 589
Query: 595 RLNACLSKCLWAATINMACYLVNRSPRASLDGKVAEEVWTGNPIDLSNLRIFWRPAYVHI 654
A L K WA +N ACYLVNR+P +++ K E+WTG P+D SNL IF Y
Sbjct: 590 LRAAGLEKSFWAEAVNTACYLVNRAPSTAIELKTPMEMWTGKPVDYSNLHIFGSIVYAMY 649
Query: 655 SSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDPVKKKVIVSRDVVFDE*SMLKQSDVTVVP 714
++++ +KLDPKS+ C +GY GVKGY+LWDP KV++SRDV+F E L++ +V
Sbjct: 650 NAQEITKLDPKSRKCRFLGYADGVKGYRLWDPTAHKVVISRDVIFVE-DRLQRGEVD--D 706
Query: 715 DTEVENSSQDKIQVDIEETPVSPRQIVAQQQSEPGSDSGEVQDYTLVRDREPSRITPPVR 774
TE E +IQV+ E S A ++ EP S T DRE R T
Sbjct: 707 STEKEKPETTQIQVEEEFEQDSSEAEPAHEEPEPESSGAPT---TRQSDREKRRPTWHSD 763
Query: 775 YGFEDLAAYALLTSSGDPSTYHEAMAS*EKDKWMSAMVEEMESLKKNET*NLVQLPHGKR 834
Y E AY LLT G+PST+ EA+ S + +W +AM EE+E+L KN T +LV LP G++
Sbjct: 764 YVMEGNVAYCLLTEDGEPSTFQEAINSSDVSQWTAAMQEEIEALHKNNTWDLVPLPQGRK 823
Query: 835 VIGCKWVYKKKLAVT*KEREKFKAHLVTKGYSQHKGIDYDEIFSPVVRHTSIRVVLALVA 894
IG KWV+K K + E+++A LV KGY+Q +GID++EIFSPVVR T++RVVLA+ A
Sbjct: 824 PIGNKWVFKIKRNGD-DQVERYRARLVVKGYAQKEGIDFNEIFSPVVRLTTVRVVLAMCA 882
Query: 895 SMDMHLEQMDVKTTFLHGNLEEQIYIEQPEGFSETGDGRLVCKLKRSLYGLKQSPRQWYK 954
+ ++HLEQ+DVKT FLHG+LEE+IY+ QPEGF + + LVC+L +SLYGLKQ+PR WYK
Sbjct: 883 TFNLHLEQLDVKTAFLHGDLEEEIYMLQPEGFEDKENQNLVCRLNKSLYGLKQAPRCWYK 942
Query: 955 RFDSYMLRIGYRRCDYDCCVYVMSLDDGSFIFLLLYVDDMLIAANHLHDVNELKTKLGKE 1014
RFDS+++ +GY R + D C Y +F+ LLLYVDDML+A + ++ELK +L +E
Sbjct: 943 RFDSFIMCLGYNRLNADPCAYFKRFGKDNFVILLLYVDDMLVAGPNKDHIDELKAQLARE 1002
Query: 1015 FDMKDLGAAKKILGMEIHKDRGAKKLWLSQKSYVEGVLSRFDMSKANHVSTPLTNHFKLS 1074
F+MKDLG A KILGM+IH+DRG +K+WLSQK+Y++ +LSRF M +STPL + K+S
Sbjct: 1003 FEMKDLGPANKILGMQIHRDRGNRKIWLSQKNYLKKILSRFSMQDCKSISTPLPINLKVS 1062
Query: 1075 LEQSPKIDSEIEGMSKIPYA 1094
SP + MS++PYA
Sbjct: 1063 SSMSPSNEEGRMEMSRVPYA 1082
Score = 125 bits (315), Expect = 7e-27
Identities = 61/115 (53%), Positives = 82/115 (71%)
Query: 1184 GGPICWKSSVQSTVAMSTTEAEYMAVAEAAKEALWLTGLVKELGVEQGGVQLHCDSQSAI 1243
GG + W S +Q+ VA STTEAEY+A +A+KEA+WL L++ELG +Q V L CDSQSA+
Sbjct: 1172 GGAVSWVSKLQAVVATSTTEAEYVAATQASKEAIWLKMLLEELGHKQEFVSLFCDSQSAL 1231
Query: 1244 YLTNNQVYHARTKHIDVRFHKIRELLASRQILLQKIHTSENTTDKLTKPVTSDKF 1298
+L N +H+RTKHI V++H IRE + + LQKIHT++N D LTK + DKF
Sbjct: 1232 HLARNPAFHSRTKHIRVQYHFIREKVKEGTVDLQKIHTADNVADFLTKIINVDKF 1286
>ref|XP_475489.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
gi|48475213|gb|AAT44282.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
Length = 1243
Score = 863 bits (2230), Expect = 0.0
Identities = 493/1123 (43%), Positives = 688/1123 (60%), Gaps = 71/1123 (6%)
Query: 5 KFEVTRFDGTGNFGLWQRMAKDLLAQKSLQKALRDEKPADIATVDW-NEMKE---KAAGL 60
K+++ D F LWQ + +LAQ+ L AL D T DW N+ K+ KA
Sbjct: 5 KYDLPLLDRDTRFSLWQVKMRAVLAQQDLDDALSG---FDKRTQDWSNDEKKRDRKAISY 61
Query: 61 ITLCVSDDVMNHILDLTTLKDVWDKLESLYMSKTPMNKLFAKQRLYSLKMQEGGDLQAHV 120
I L +S++++ +L T +W KLE + M+K +K+ KQ+L+ K+Q+ + H+
Sbjct: 62 IHLHLSNNILQEVLKEETAAGLWLKLEQICMTKDLTSKMHLKQKLFLHKLQDDESVMDHL 121
Query: 121 YAFNNILADMTRLGVTVDDEDKAIILLCSLPGSYDHLVTTLTYGKDSITLDSISSTLLPH 180
AF I+AD+ + V D++D +ILLCSLP SY + T+ Y +D++TL +
Sbjct: 122 SAFKEIVADLESMEVKYDEDDLGLILLCSLPSSYANFRGTILYSRDTLTLKEVYDAFHAK 181
Query: 181 AQRRQSV-EEGGGSSGEGLFVKGGQDRGRGKGKAVD---SGKKKRSKSKDRKTTECYSCK 236
+ ++ V EG S EGL V+G Q + K ++ D S + R+KS+ R + C CK
Sbjct: 182 EKMKKMVTSEGSNSQAEGLVVRGRQQKKNTKNQSRDKSSSSYRGRTKSRGRYKS-CKYCK 240
Query: 237 QIGH-----WK-RDCPNRSGKSGNNSSSANVVQSDGSCSEEDLLCVSSVKCTDAWVLDSG 290
+ GH WK +D R+GK G EE V + + +DA +L +
Sbjct: 241 RDGHDISECWKLQDKDKRTGK----------YIPKGKKEEEGKAAVVTDEKSDAELLVAY 290
Query: 291 CSYHMTQHREWFNSFKSGDLGYVYLGDDKPCIIKGMRQVKIALDDGGVRTLSQVRYVLEV 350
T ++WF ++++ G V +GDD PC + G+ V+I + DG +RTLS V+++ +
Sbjct: 291 AGCAQTSDQDWFATYEALQGGTVLMGDDTPCEVAGIGTVQIKMFDGCIRTLSDVQHIPNL 350
Query: 351 MKNLISLGTLHENGYSFKSEENRDILRVSKGAMTVMRAKRTAGNIYKLLGGTIMGDVASV 410
++LISL IL+V+KG++ VM+ + N+Y L G TI+G+VA+V
Sbjct: 351 KRSLISL---------------YGILKVTKGSLVVMKVDIKSANLYHLRGTTILGNVAAV 395
Query: 411 E---TDDDATKLWHMRLGHLSERGMMELHKRNMLKGVRSCIIGLCKYCVLGKQCRVRFKT 467
++ DAT LWHMRLGH+SE G+ EL KR +L G + C++C+ GK RV+F T
Sbjct: 396 FDSLSNSDATNLWHMRLGHMSEIGLAELSKRGLLDGQSIRKLKFCEHCIFGKHKRVKFNT 455
Query: 468 GHHKTKGILDYVHSDVRGPTKEPSV*GFRYFVTFTDDFSRKVWVYFMKYKSEVFAKFKLW 527
H T+GILDYVHSD+ GP + S G RY +T DD+SRKVW YF+K+K + F FK W
Sbjct: 456 STHTTEGILDYVHSDLWGPAHKTSFGGARYMMTIVDDYSRKVWPYFLKHKYQAFDGFKEW 515
Query: 528 KAEVENQTGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHFSVRKTPQQNGVAERMNRTL 587
K VE QT RK+K LR+DNG E+ K F +C+ GI H++ TPQQN VAERMNRT+
Sbjct: 516 KTMVERQTERKVKILRTDNGMEFCSKIFKSYCKSEGIVCHYTAPHTPQQNDVAERMNRTI 575
Query: 588 TEKARCLRLNACLSKCLWAATINMACYLVNRSPRASLDGKVAEEVWTGNPIDLSNLRIFW 647
KARC+ NA L K WA ++ ACYL+NRSP ++D K EVW+G+P + S+LR+F
Sbjct: 576 ISKARCMLSNAGLPKQFWAEAVSTACYLINRSPGYAIDKKTPIEVWSGSPTNYSDLRVFG 635
Query: 648 RPAYVHISSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDPVKKKVIVSRDVVFDE*SMLKQ 707
AY H+ D KL+P++ CI +GY GVKGYKLW P KKV++SR+VVF E +L
Sbjct: 636 CTAYAHV---DNGKLEPRAIKCIFLGYASGVKGYKLWCPETKKVVISRNVVFHESVILHD 692
Query: 708 SDVTVVPDTEVENSSQDKIQVDIEETPVS----PRQIVAQQQSEP---GSDSGEVQD--- 757
P T V SQ+K V +E S ++ VA Q P SDS V
Sbjct: 693 K-----PSTNVPVESQEKASVQVEHLISSGHAPEKEDVAINQDAPVIEDSDSSIVHQSPK 747
Query: 758 YTLVRDREPSRITPPVRYGFE-DLAAYAL-----LTSSGDPSTYHEAMAS*EKDKWMSAM 811
++ +D+ I PP RY E + AYAL + + +PSTY EA+ S + ++W++AM
Sbjct: 748 RSIAKDKPKRNIKPPRRYIEEAKIVAYALSVAEKIEGNAEPSTYSEAIVSDDCNRWITAM 807
Query: 812 VEEMESLKKNET*NLVQLPHGKRVIGCKWVYKKKLAVT*KEREKFKAHLVTKGYSQHKGI 871
+EMESL+KN T LV+LP K+ I CKW++K+K ++ + ++KA LV KGYSQ GI
Sbjct: 808 HDEMESLEKNHTWELVKLPKEKKPIRCKWIFKRKEGMSPSDEARYKARLVAKGYSQIPGI 867
Query: 872 DYDEIFSPVVRHTSIRVVLALVASMDMHLEQMDVKTTFLHGNLEEQIYIEQPEGFSETGD 931
D++++FSPVV+H+SIR +L +VA D LEQM+VKT FLHG LEE IY+EQPEGF G
Sbjct: 868 DFNDVFSPVVKHSSIRTLLGIVAMHDYELEQMNVKTAFLHGELEEDIYMEQPEGFVVPGK 927
Query: 932 GRLVCKLKRSLYGLKQSPRQWYKRFDSYMLRIGYRRCDYDCCVYVMSLDDGSFIFLLLYV 991
LVC+LK+SLYGLKQSPRQWYKRFDS+ML +R +YD CVY + + DGS I+LLLYV
Sbjct: 928 ENLVCRLKKSLYGLKQSPRQWYKRFDSFMLSQKFRISNYDSCVY-LKVVDGSVIYLLLYV 986
Query: 992 DDMLIAANHLHDVNELKTKLGKEFDMKDLGAAKKILGMEIHKDRGAKKLWLSQKSYVEGV 1051
DDMLIAA ++ +LK +L EF+MKDLGAAKKILGMEI ++R + KL+LSQK Y+E V
Sbjct: 987 DDMLIAAKDKSEIEKLKAQLSSEFEMKDLGAAKKILGMEITRERHSGKLYLSQKGYIEKV 1046
Query: 1052 LSRFDMSKANHVSTPLTNHFKLSLEQSPKIDSEIEGMSKIPYA 1094
L RF+M A VSTPL HF+LS + P D +IE MS++PY+
Sbjct: 1047 LRRFNMHDAKPVSTPLAAHFRLSSDLCPLSDYDIEYMSRVPYS 1089
Score = 79.3 bits (194), Expect = 8e-13
Identities = 39/70 (55%), Positives = 52/70 (73%), Gaps = 1/70 (1%)
Query: 1180 LYSCGG-PICWKSSVQSTVAMSTTEAEYMAVAEAAKEALWLTGLVKELGVEQGGVQLHCD 1238
+++ GG + WK+S+Q+TVA+STTEAEYMA+ EA KEA+WL GL EL + + CD
Sbjct: 1174 VFTIGGCAVSWKASLQATVALSTTEAEYMAIFEACKEAIWLRGLYTELCGVTSCINIFCD 1233
Query: 1239 SQSAIYLTNN 1248
SQSAIYLT +
Sbjct: 1234 SQSAIYLTKD 1243
>gb|AAD23690.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301702|pir||E84601 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1333
Score = 857 bits (2213), Expect = 0.0
Identities = 526/1347 (39%), Positives = 760/1347 (56%), Gaps = 81/1347 (6%)
Query: 4 AKFEVTRFDGTGNFGLWQRMAKDLLAQKSLQKALRDE-----KPADIATVDWNEMKE--- 55
A+ EV +FDG G++ +W+ L L AL++E K A++ + E +E
Sbjct: 4 ARIEVEKFDGRGDYTMWKEKLMAHLDILGLSVALKEEDDLVEKVAEMQLTEEEEKEEVLR 63
Query: 56 ---------KAAGLITLCVSDDVMNHILDLTTLKDVWDKLESLYMSKTPMNKLFAKQRLY 106
KA I L V+D V+ I + + L+ LYMSK N+++ KQ+LY
Sbjct: 64 RELLEEKRRKARSAIVLSVTDRVLRKIKKEQSAAAMLGVLDKLYMSKALPNRIYQKQKLY 123
Query: 107 SLKMQEGGDLQAHVYAFNNILADMTRLGVTVDDEDKAIILLCSLPGSYDHLVTTLTYGKD 166
S KM E ++ ++ F I+AD+ V V DED+AI+LL SLP +D L TL YG
Sbjct: 124 SFKMSENLSIEGNIDEFLRIIADLENTNVLVSDEDQAILLLMSLPKPFDQLRDTLKYGLG 183
Query: 167 SITL--DSISSTLLPHAQRRQSVEEGGGSSGEGLFVKGGQD-RGRGKGKAVDSGKKKRSK 223
+TL D + + + S ++ EGLFVK + RGR + + ++ KK S+
Sbjct: 184 RVTLSLDEVVAAIYSKELELGSNKKSIKGQAEGLFVKEKTETRGRTEQRGNNNNNKK-SR 242
Query: 224 SKDRKTTECYSCKQIGHWKRDCPNRSGKSGNNSSSANVVQSDGSCSEEDLLCVSSVKCTD 283
SK R C+ C G+S N SS N +++G E L + + D
Sbjct: 243 SKSRSKKGCWIC--------------GESSNGSS--NYSEANGLYVSE-ALSSTDIHLED 285
Query: 284 AWVLDSGCSYHMTQHREWFNSFKSGDLGYVYLGDDKPCIIKGMRQVKIALDDGGVRTLSQ 343
WV+D+GCSYHMT REWF G V +G+ ++G+ +++ + G V L+
Sbjct: 286 EWVMDTGCSYHMTYKREWFEDLNEDAGGSVRMGNKTVSKVRGIGTIRVKNEAGMVVRLTN 345
Query: 344 VRYVLEVMKNLISLGTLHENGYSFKSEENRDILRVSKGAMTVMRAKRTAGNIYKLLGGTI 403
VRY+ E+ +NL+SLGT ++GYSFK E L + G ++ +R +Y L +
Sbjct: 346 VRYIPEMDRNLLSLGTFEKSGYSFKLENGT--LSIIAGDSVLLTVRRCY-TLYLLQWRPV 402
Query: 404 MGDVASVETDDDATKLWHMRLGHLSERGMMELHKRNMLKGVRSCIIGLCKYCVLGKQCRV 463
+ SV D T LWH RLGH+S++ M L K+ +L + + C+ C+ GK R+
Sbjct: 403 TEESLSVVKRQDDTILWHRRLGHMSQKNMDLLLKKGLLDKKKVSKLETCEDCIYGKAKRI 462
Query: 464 RFKTGHHKTKGILDYVHSDVRGPTKEP-SV*GFRYFVTFTDDFSRKVWVYFMKYKSEVFA 522
F H T+ L+YVHSD+ G P S+ +YF++F DD++RKV +YF+K K E F
Sbjct: 463 GFNLAQHDTREKLEYVHSDLWGAPSVPFSLGKCQYFISFIDDYTRKVRIYFLKTKDEAFD 522
Query: 523 KFKLWKAEVENQTGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHFSVRKTPQQNGVAER 582
KF W VENQT ++IK LR+DNG E+ +++F FC + GI H + TPQQNGVAER
Sbjct: 523 KFVEWANLVENQTDKRIKTLRTDNGLEFCNRSFDEFCSQKGILWHRTCAYTPQQNGVAER 582
Query: 583 MNRTLTEKARCLRLNACLSKCLWAATINMACYLVNRSPRASLDGKVAEEVWTGNPIDLSN 642
MNRTL EK R + ++ L K WA + L+N++P ++L+ +V ++ W+G S
Sbjct: 583 MNRTLMEKVRSMLSDSGLPKKFWAEATHTTAILINKTPSSALNYEVPDKRWSGKSPIYSY 642
Query: 643 LRIFWRPAYVHISSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDPVKKKVIVSRDVVFDE* 702
LR F A+VH D KL+P++K I++GY GVKGYK+W +KK +VSR+V+F E
Sbjct: 643 LRRFGCIAFVHT---DDGKLNPRAKKGILVGYPIGVKGYKIWLLEEKKCVVSRNVIFQEN 699
Query: 703 SMLK---QSDVTVVPDTEVENSSQDKIQVDIEE--------------TPVSPRQIVAQQQ 745
+ K QS + E SS + +D EE +P +P Q
Sbjct: 700 ASYKDMMQSKDAEKDENEAPPSSYLDLDLDHEEVITSGGDDPIVEAQSPFNPSPATTQTY 759
Query: 746 SEP-GSDSGEVQD---YTLVRDREPSRITPPVRYGFEDLAAYALLTSSG----DPSTYHE 797
SE S++ +Q Y LVRDR+ I PVR+ ED A AL T+ +P+ Y E
Sbjct: 760 SEGVNSETDIIQSPLSYQLVRDRDRRTIRAPVRFDDEDYLAEALYTTEDSGEIEPADYSE 819
Query: 798 AMAS*EKDKWMSAMVEEMESLKKNET*NLVQLPHGKRVIGCKWVYKKKLAVT*KEREKFK 857
A S +KW AM EEMES KN T +V+ P ++VIG +W+YK KL + E +FK
Sbjct: 820 AKRSMNWNKWKLAMNEEMESQIKNHTWTVVKRPQHQKVIGSRWIYKFKLGIPGVEEGRFK 879
Query: 858 AHLVTKGYSQHKGIDYDEIFSPVVRHTSIRVVLALVASMDMHLEQMDVKTTFLHGNLEEQ 917
A LV KGY+Q KGIDY EIF+PVV+H SIR+++++VA D+ LEQ+DVKT FLHG L+E+
Sbjct: 880 ARLVAKGYAQRKGIDYHEIFAPVVKHVSIRILMSIVAQEDLELEQLDVKTAFLHGELKEK 939
Query: 918 IYIEQPEGFSETGDGRLVCKLKRSLYGLKQSPRQWYKRFDSYMLRIGYRRCDYDCCVYVM 977
IY+ PEG+ E VC L +SLYGLKQ+P+QW ++F++YM IG+ R YD C Y+
Sbjct: 940 IYMVPPEGYEEMFKEDEVCLLNKSLYGLKQAPKQWNEKFNAYMSEIGFIRSLYDSCAYIK 999
Query: 978 SLDDGSFIFLLLYVDDMLIAANHLHDVNELKTKLGKEFDMKDLGAAKKILGMEIHKDRGA 1037
L DGS ++LLLYVDDML+AA + D+++LK +L + FDMKDLGAAK+ILGMEI ++R
Sbjct: 1000 ELSDGSRVYLLLYVDDMLVAAKNKEDISQLKEELSQRFDMKDLGAAKRILGMEIIRNREE 1059
Query: 1038 KKLWLSQKSYVEGVLSRFDMSKANHVSTPLTNHFKLSLEQSPKIDSEIEGMSKIPY--AV 1095
LWLSQ Y+ +L ++M+++ HV TPL H K+ K + + + M IPY AV
Sbjct: 1060 NTLWLSQNGYLNKILETYNMAESKHVVTPLGAHLKMRAATVEKQEQDEDYMKSIPYSSAV 1119
Query: 1096 QLVV*CMLWFALDQIWHKQLVKCQVHVQAREAALGSSQVDPKILEGYNGSRYHV*QGARC 1155
++ M+ D + ++ + ARE LG V + ++G G++ + +
Sbjct: 1120 GSIMYAMIGTRPDLAYPVGIISRYMSQPAREHWLGVKWV-LRYIKGSLGTKLQYKRSSDF 1178
Query: 1156 CSISCGIC----GLCR*SR**KVYNRICLYSCGGPICWKSSVQSTVAMSTTEAEYMAVAE 1211
+ G C C+ R + + G I WKS Q VA+STTEAEYM++ E
Sbjct: 1179 KVV--GYCDADHAACKDRR--RSITGLVFTLGGSTISWKSGQQRVVALSTTEAEYMSLTE 1234
Query: 1212 AAKEALWLTGLVKELGVEQGGVQLHCDSQSAIYLTNNQVYHARTKHIDVRFHKIRELLAS 1271
A KEA+W+ GL+KE G EQ V++ CDSQSAI L+ N V+H RTKHIDVR+ IR+++A+
Sbjct: 1235 AVKEAVWMKGLLKEFGYEQKSVEIFCDSQSAIALSKNNVHHERTKHIDVRYQYIRDIIAN 1294
Query: 1272 RQILLQKIHTSENTTDKLTKPVTSDKF 1298
+ KI T +N D TK V +KF
Sbjct: 1295 GDGDVVKIDTEKNPADIFTKIVPVNKF 1321
>gb|AAD19773.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301697|pir||B84512 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1335
Score = 855 bits (2208), Expect = 0.0
Identities = 522/1323 (39%), Positives = 736/1323 (55%), Gaps = 97/1323 (7%)
Query: 41 KPADIATVDWNEMKEKAAGLITLCVSDDVMNHILDLTTLKDVWDKLESLYMSKTPMNKLF 100
K D V E +KA +I L V+D V+ I T + W+ L+ L+M ++ ++++
Sbjct: 33 KKRDADEVARLERCDKAKNVIFLNVADKVLRKIELCKTAAEAWETLDRLFMIRSLPHRVY 92
Query: 101 AKQRLYSLKMQEGGDLQAHVYAFNNILADMTRLGVTVDDEDKAIILLCSLPGSYDHLVTT 160
+ Y+ KMQE + ++ F I+AD+ L + V DE +AI+LL SLP YD LV T
Sbjct: 93 TQLSFYTFKMQENKKIDENIDDFLKIVADLNHLQIDVTDEVQAILLLSSLPARYDGLVET 152
Query: 161 LTYG--KDSITLDSISSTLLPHAQRRQSVEEGGGSSGEGLFVKGGQDRGRGKGKAVDSGK 218
+ Y ++ + LD + ++ + + + + EG F +G D G+ +
Sbjct: 153 MKYSNSREKLRLDDV---MVAARDKERELSQNNRPVVEGHFARGRPD-GKNNNQGNKGKN 208
Query: 219 KKRSKSKDRKTTECYSCKQIGHWKRDC-----PNRSGKSGNNSSSANVVQSDGS------ 267
+ RSKS D K C+ C + GH+K+ C N+S + G+++ +++ +S +
Sbjct: 209 RSRSKSADGKRV-CWICGKEGHFKKQCYKWIERNKSKQQGSDNGESSLAKSTEAFNPAMV 267
Query: 268 --CSEEDLLCVSSVKCTDAWVLDSGCSYHMTQHREWFNSFKSGDLGYVYLGDDKPCIIKG 325
++E L+ S+ + WVLD+GCS+HMT ++WF FK GYV +G+D +KG
Sbjct: 268 LLATDETLVVTDSI--ANEWVLDTGCSFHMTPRKDWFKDFKELSSGYVKMGNDTYSPVKG 325
Query: 326 MRQVKIALDDGGVRTLSQVRYVLEVMKNLISLGTLHENGYSFKSEENRDILRVSKGAMTV 385
+ +KI DG L+ VRY+ + +NLISLGTL + G FKS++ IL++ KG T+
Sbjct: 326 IGSIKIRNSDGSQVILTDVRYMPNMTRNLISLGTLEDRGCWFKSQDG--ILKIVKGCSTI 383
Query: 386 MRAKRTAGNIYKLLGGTIMGDVASVETDDDATKLWHMRLGHLSERGMMELHKRNMLKGVR 445
++ ++ +Y L G T G+ S D T LWH RLGH+S++GM L K+ L+
Sbjct: 384 LKGQKR-DTLYILDGVTEEGESHSSAEVKDETALWHSRLGHMSQKGMEILVKKGCLRREV 442
Query: 446 SCIIGLCKYCVLGKQCRVRFKTGHHKTKGILDYVHSDVRGPTKEPSV*G-FRYFVTFTDD 504
+ C+ CV GKQ RV F H TK L YVHSD+ G P+ G +YF++F DD
Sbjct: 443 IKELEFCEDCVYGKQHRVSFAPAQHVTKEKLAYVHSDLWGSPHNPASLGNSQYFISFVDD 502
Query: 505 FSRKVWVYFMKYKSEVFAKFKLWKAEVENQTGRKIKYLRSDNGTEYTDKNFMHFCEENGI 564
+SRKVW+YF++ K E F KF WK VENQ+ RK+K LR+DNG EY + F FC+E GI
Sbjct: 503 YSRKVWIYFLRKKDEAFEKFVEWKKMVENQSDRKVKKLRTDNGLEYCNHYFEKFCKEEGI 562
Query: 565 QRHFSVRKTPQQNGVAERMNRTLTEKARCLRLNACLSKCLWAATINMACYLVNRSPRASL 624
RH + TPQQNG+AER+NRT+ +K R + + + K WA + A YL+NRSP ++
Sbjct: 563 VRHKTCAYTPQQNGIAERLNRTIMDKVRSMLSRSGMEKKFWAEAASTAVYLINRSPSTAI 622
Query: 625 DGKVAEEVWTGNPIDLSNLRIFWRPAYVHISSEDRSKLDPKSK*CIIIGYNKGVKGYKLW 684
+ + EE WTG DLS+LR F AY+H D+ KL+P+SK I Y +GVKGYK+W
Sbjct: 623 NFDLPEEKWTGALPDLSSLRKFGCLAYIHA---DQGKLNPRSKKGIFTSYPEGVKGYKVW 679
Query: 685 DPVKKKVIVSRDVVFDE*SMLKQ--------------SDVTVVPD------TEVENSSQD 724
KK ++SR+V+F E M K D+ V PD T+ ++QD
Sbjct: 680 VLEDKKCVISRNVIFREQVMFKDLKGDSQNTISESDLEDLRVNPDMNDQEFTDQGGATQD 739
Query: 725 KI---QVDIEETPVSPRQIVAQQQSEPGSDSGEVQD---YTLVRDREPSRITPPVRYGFE 778
+ PV +Q + DS V+D Y LVRDR I +Y
Sbjct: 740 NSNPSEATTSHNPVLNSPTHSQDEESEEEDSDAVEDLSTYQLVRDRVRRTIKANPKYNES 799
Query: 779 DLAAYALLTSSG---DPSTYHEAMAS*EKDKWMSAMVEEMESLKKNET*NLVQLPHGKRV 835
++ +A + +P +Y EA+ + +KW +AM EEM S+ KN T +LV P ++
Sbjct: 800 NMVGFAYYSEDDGKPEPKSYQEALLDPDWEKWNAAMKEEMVSMSKNHTWDLVTKPEKVKL 859
Query: 836 IGCKWVYKKKLAVT*KEREKFKAHLVTKGYSQHKGIDYDEIFSPVVRHTSIRVVLALVAS 895
IGC+WV+ +K + E +F A LV KG++Q +G+DY+EIFSPVV+H SIR +L++V
Sbjct: 860 IGCRWVFTRKAGIPGVEAPRFIARLVAKGFTQKEGVDYNEIFSPVVKHVSIRYLLSMVVH 919
Query: 896 MDMHLEQMDVKTTFLHGNLEEQIYIEQPEGFSETGDGRLVCKLKRSLYGLKQSPRQWYKR 955
+M L+QMDVKT FLHG LEE+IY+ QPEGF VC LKRSLYGLKQSPRQW R
Sbjct: 920 YNMELQQMDVKTAFLHGFLEEEIYMAQPEGFEIKRGSNKVCLLKRSLYGLKQSPRQWNLR 979
Query: 956 FDSYMLRIGYRRCDYDCCVYVMSLDDGSFIFLLLYVDDMLIAANHLHDVNELKTKLGKEF 1015
FD +M I Y R YD CVY + ++I+LLLYVDDMLIA+ + +VNELK L +EF
Sbjct: 980 FDEFMRGIKYTRSAYDSCVYFKKCNGDTYIYLLLYVDDMLIASANKSEVNELKQLLSREF 1039
Query: 1016 DMKDLGAAKKILGMEIHKDRGAKKLWLSQKSYVEGVLSRFDMSKANHVSTPLTNHFKLSL 1075
+MKDLG AKKILGMEI +DR A L LSQ+ YV+ VL F M A VSTPL HFKL
Sbjct: 1040 EMKDLGDAKKILGMEISRDRDAGLLTLSQEGYVKKVLRSFQMDNAKPVSTPLGIHFKLKA 1099
Query: 1076 EQSPKIDSEIEGMSKIPYA--VQLVV*CMLWFALDQIW---------------HKQLVKC 1118
+ + + E M +PYA + ++ M+ D + H Q VK
Sbjct: 1100 ATDKEYEEQFERMKIVPYANTIGSIMYSMIGTRPDLAYSLGVISRFMSKPLKDHWQAVKW 1159
Query: 1119 QVHVQ--AREAALGSSQVDPKILEGYNGSRYHV*QGARCCSISCGICGLCR*SR**KVYN 1176
+ + L + + +L GY S Y R I G
Sbjct: 1160 VLRYMRGTEKKKLCFRKQEDFLLRGYCDSDYGSNFDTR-----RSITGY----------- 1203
Query: 1177 RICLYSCGG-PICWKSSVQSTVAMSTTEAEYMAVAEAAKEALWLTGLVKELGVEQGGVQL 1235
+++ GG I WKS +Q VA+S+TEAEYMA+ EA KEALWL G ELG Q V++
Sbjct: 1204 ---VFTVGGNTISWKSKLQKVVAISSTEAEYMALTEAVKEALWLKGFAAELGHSQDYVEV 1260
Query: 1236 HCDSQSAIYLTNNQVYHARTKHIDVRFHKIRELLASRQILLQKIHTSENTTDKLTKPVTS 1295
H DSQSAI L N V+H RTKHID+R H IR+++ + I + KI T N + TK V
Sbjct: 1261 HSDSQSAITLAKNSVHHERTKHIDIRLHFIRDIICAGLIKVVKIATECNPANIFTKTVPL 1320
Query: 1296 DKF 1298
KF
Sbjct: 1321 AKF 1323
>emb|CAA32025.1| unnamed protein product [Nicotiana tabacum]
gi|130582|sp|P10978|POLX_TOBAC Retrovirus-related Pol
polyprotein from transposon TNT 1-94 [Contains: Protease
; Reverse transcriptase ; Endonuclease]
Length = 1328
Score = 853 bits (2204), Expect = 0.0
Identities = 478/1129 (42%), Positives = 679/1129 (59%), Gaps = 56/1129 (4%)
Query: 3 GAKFEVTRFDGTGNFGLWQRMAKDLLAQKSLQKALR--DEKPADIATVDWNEMKEKAAGL 60
G K+EV +F+G F WQR +DLL Q+ L K L +KP + DW ++ E+AA
Sbjct: 3 GVKYEVAKFNGDNGFSTWQRRMRDLLIQQGLHKVLDVDSKKPDTMKAEDWADLDERAASA 62
Query: 61 ITLCVSDDVMNHILDLTTLKDVWDKLESLYMSKTPMNKLFAKQRLYSLKMQEGGDLQAHV 120
I L +SDDV+N+I+D T + +W +LESLYMSKT NKL+ K++LY+L M EG + +H+
Sbjct: 63 IRLHLSDDVVNNIIDEDTARGIWTRLESLYMSKTLTNKLYLKKQLYALHMSEGTNFLSHL 122
Query: 121 YAFNNILADMTRLGVTVDDEDKAIILLCSLPGSYDHLVTTLTYGKDSITLDSISSTLLPH 180
FN ++ + LGV +++EDKAI+LL SLP SYD+L TT+ +GK +I L ++S LL +
Sbjct: 123 NVFNGLITQLANLGVKIEEEDKAILLLNSLPSSYDNLATTILHGKTTIELKDVTSALLLN 182
Query: 181 AQRRQSVEEGGGSSGEGLFVKG-GQDRGRGKGKAVDSGKKKRSKSKDR-KTTECYSCKQI 238
+ R+ E + G+ L +G G+ R SG + +SK++ + + CY+C Q
Sbjct: 183 EKMRKKPE----NQGQALITEGRGRSYQRSSNNYGRSGARGKSKNRSKSRVRNCYNCNQP 238
Query: 239 GHWKRDCPN-RSGK---SG--NNSSSANVVQSDGSC-----SEEDLLCVSSVKCTDAWVL 287
GH+KRDCPN R GK SG N+ ++A +VQ++ + EE+ + +S + WV+
Sbjct: 239 GHFKRDCPNPRKGKGETSGQKNDDNTAAMVQNNDNVVLFINEEEECMHLSGPE--SEWVV 296
Query: 288 DSGCSYHMTQHREWFNSFKSGDLGYVYLGDDKPCIIKGMRQVKIALDDGGVRTLSQVRYV 347
D+ S+H T R+ F + +GD G V +G+ I G+ + I + G L VR+V
Sbjct: 297 DTAASHHATPVRDLFCRYVAGDFGTVKMGNTSYSKIAGIGDICIKTNVGCTLVLKDVRHV 356
Query: 348 LEVMKNLISLGTLHENGYSFKSEENRDILRVSKGAMTVMRAKRTAGNIYKLLGGTIMGDV 407
++ NLIS L +GY + R++KG++ + + G +Y+ G++
Sbjct: 357 PDLRMNLISGIALDRDGYESYFANQK--WRLTKGSLVIAKGV-ARGTLYRTNAEICQGEL 413
Query: 408 ASVETDDDATKLWHMRLGHLSERGMMELHKRNMLKGVRSCIIGLCKYCVLGKQCRVRFKT 467
+ + D+ + LWH R+GH+SE+G+ L K++++ + + C YC+ GKQ RV F+T
Sbjct: 414 NAAQ-DEISVDLWHKRMGHMSEKGLQILAKKSLISYAKGTTVKPCDYCLFGKQHRVSFQT 472
Query: 468 GHHKTKGILDYVHSDVRGPTKEPSV*GFRYFVTFTDDFSRKVWVYFMKYKSEVFAKFKLW 527
+ ILD V+SDV GP + S+ G +YFVTF DD SRK+WVY +K K +VF F+ +
Sbjct: 473 SSERKLNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKTKDQVFQVFQKF 532
Query: 528 KAEVENQTGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHFSVRKTPQQNGVAERMNRTL 587
A VE +TGRK+K LRSDNG EYT + F +C +GI+ +V TPQ NGVAERMNRT+
Sbjct: 533 HALVERETGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTPQHNGVAERMNRTI 592
Query: 588 TEKARCLRLNACLSKCLWAATINMACYLVNRSPRASLDGKVAEEVWTGNPIDLSNLRIFW 647
EK R + A L K W + ACYL+NRSP L ++ E VWT + S+L++F
Sbjct: 593 VEKVRSMLRMAKLPKSFWGEAVQTACYLINRSPSVPLAFEIPERVWTNKEVSYSHLKVFG 652
Query: 648 RPAYVHISSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDPVKKKVIVSRDVVFDE*SMLKQ 707
A+ H+ E R+KLD KS CI IGY GY+LWDPVKKKVI SRDVVF E +
Sbjct: 653 CRAFAHVPKEQRTKLDDKSIPCIFIGYGDEEFGYRLWDPVKKKVIRSRDVVFRESEVRTA 712
Query: 708 SDVT------------VVPDTEVENSSQDKIQVDIEETPVSPRQIVAQ-QQSEPGSD--- 751
+D++ +P T +S + ++ E P +++ Q +Q + G +
Sbjct: 713 ADMSEKVKNGIIPNFVTIPSTSNNPTSAESTTDEVSEQGEQPGEVIEQGEQLDEGVEEVE 772
Query: 752 ---SGEVQDYTLVRDREP---SRITPPVRYGFEDLAAYALLTSSGDPSTYHEAMAS*EKD 805
GE Q L R P SR P Y L++ +P + E ++ EK+
Sbjct: 773 HPTQGEEQHQPLRRSERPRVESRRYPSTE--------YVLISDDREPESLKEVLSHPEKN 824
Query: 806 KWMSAMVEEMESLKKNET*NLVQLPHGKRVIGCKWVYKKKLAVT*KEREKFKAHLVTKGY 865
+ M AM EEMESL+KN T LV+LP GKR + CKWV+K K K ++KA LV KG+
Sbjct: 825 QLMKAMQEEMESLQKNGTYKLVELPKGKRPLKCKWVFKLKKDGDCK-LVRYKARLVVKGF 883
Query: 866 SQHKGIDYDEIFSPVVRHTSIRVVLALVASMDMHLEQMDVKTTFLHGNLEEQIYIEQPEG 925
Q KGID+DEIFSPVV+ TSIR +L+L AS+D+ +EQ+DVKT FLHG+LEE+IY+EQPEG
Sbjct: 884 EQKKGIDFDEIFSPVVKMTSIRTILSLAASLDLEVEQLDVKTAFLHGDLEEEIYMEQPEG 943
Query: 926 FSETGDGRLVCKLKRSLYGLKQSPRQWYKRFDSYMLRIGYRRCDYDCCVYVMSLDDGSFI 985
F G +VCKL +SLYGLKQ+PRQWY +FDS+M Y + D CVY + +FI
Sbjct: 944 FEVAGKKHMVCKLNKSLYGLKQAPRQWYMKFDSFMKSQTYLKTYSDPCVYFKRFSENNFI 1003
Query: 986 FLLLYVDDMLIAANHLHDVNELKTKLGKEFDMKDLGAAKKILGMEIHKDRGAKKLWLSQK 1045
LLLYVDDMLI + +LK L K FDMKDLG A++ILGM+I ++R ++KLWLSQ+
Sbjct: 1004 ILLLYVDDMLIVGKDKGLIAKLKGDLSKSFDMKDLGPAQQILGMKIVRERTSRKLWLSQE 1063
Query: 1046 SYVEGVLSRFDMSKANHVSTPLTNHFKLSLEQSPKIDSEIEGMSKIPYA 1094
Y+E VL RF+M A VSTPL H KLS + P E M+K+PY+
Sbjct: 1064 KYIERVLERFNMKNAKPVSTPLAGHLKLSKKMCPTTVEEKGNMAKVPYS 1112
Score = 125 bits (315), Expect = 7e-27
Identities = 61/115 (53%), Positives = 80/115 (69%)
Query: 1184 GGPICWKSSVQSTVAMSTTEAEYMAVAEAAKEALWLTGLVKELGVEQGGVQLHCDSQSAI 1243
GG I W+S +Q VA+STTEAEY+A E KE +WL ++ELG+ Q ++CDSQSAI
Sbjct: 1202 GGAISWQSKLQKCVALSTTEAEYIAATETGKEMIWLKRFLQELGLHQKEYVVYCDSQSAI 1261
Query: 1244 YLTNNQVYHARTKHIDVRFHKIRELLASRQILLQKIHTSENTTDKLTKPVTSDKF 1298
L+ N +YHARTKHIDVR+H IRE++ + + KI T+EN D LTK V +KF
Sbjct: 1262 DLSKNSMYHARTKHIDVRYHWIREMVDDESLKVLKISTNENPADMLTKVVPRNKF 1316
>gb|AAF19226.1| Highly similar to Ta1-3 polyprotein [Arabidopsis thaliana]
gi|25301707|pir||E86490 hypothetical protein F28L22.3 -
Arabidopsis thaliana
Length = 1356
Score = 845 bits (2182), Expect = 0.0
Identities = 516/1382 (37%), Positives = 754/1382 (54%), Gaps = 133/1382 (9%)
Query: 5 KFEVTRFDGTGNFGLWQRMAKDLLAQKSLQKALRDEKPADIATVDWNEMK---------- 54
+ E+ F+G +F LW+ + L L+ L D + +E K
Sbjct: 7 RVEIKVFNGDRDFSLWKIRIQAQLGVLGLKDTLTDFSLTKTVPLTKSEAKQESGDGESSG 66
Query: 55 -------------EKAAGLITLCVSDDVMNHILDLTTLKDVWDKLESLYMSKTPMNKLFA 101
E+A +I +SD V+ + T D+W L YM + N+++
Sbjct: 67 TKEVPDPVKIEQSEQAKNIIINHISDVVLLKVNHYATTADLWATLNKKYMETSLPNRIYT 126
Query: 102 KQRLYSLKMQEGGDLQAHVYAFNNILADMTRLGVTVDDEDKAIILLCSLPGSYDHLVTTL 161
+ +LYS KM + +V F I+A++ L + VD+E +AI++L SLP S+ L TL
Sbjct: 127 QLKLYSFKMVSTMTIDQNVDEFLRIVAELGSLEIQVDEEVQAILILNSLPASHIQLKHTL 186
Query: 162 TYGKDSITLDSISSTLLPHAQRRQSVEEGGGSSGEGL----------FVKGGQDRGRGKG 211
YG ++T+ ++S+ + R+ E G+ V+ Q G+GKG
Sbjct: 187 KYGNKTLTVQDVTSSA--KSLERELAEAVDLDKGQAAVLYTTERGRPLVRNNQKGGQGKG 244
Query: 212 KAVDSGKKKRSKSKDRKTTECYSCKQIGHWKRDCPNRSGK-SGNNSSSANVVQSDGSCSE 270
RS+S + C+ CK+ GH K+DC +R K A V+ SE
Sbjct: 245 ---------RSRSNSKTKVPCWYCKKEGHVKKDCYSRKKKMESEGQGEAGVITEKLVFSE 295
Query: 271 EDLLCVSSVKCTDAWVLDSGCSYHMTQHREWFNSFKSGDLGYVYLGDDKPCIIKGMRQVK 330
L V+ D W+LDSGC+ HMT R+WF SF+ + LGDD +G ++
Sbjct: 296 A--LSVNEQMVKDLWILDSGCTSHMTSRRDWFISFQEKGNTTILLGDDHSVESQGQGTIR 353
Query: 331 IALDDGGVRTLSQVRYVLEVMKNLISLGTLHENGYSFKSEENRDILRVSKGAMTVMRAKR 390
I G ++ L V+YV + +NLIS GTL + GY + E + +R K T +R
Sbjct: 354 IDTHGGTIKILENVKYVPHLRRNLISTGTLDKLGYRHEGGEGK--VRYFKNNKTALRGSL 411
Query: 391 TAGNIYKLLGGTIMGDVASVETDDDATKLWHMRLGHLSERGMMELHKRNMLKGVRSCIIG 450
+ G +Y L G T+M ++ + ETD T LWH RLGH+S + L + ++ +
Sbjct: 412 SNG-LYVLDGSTVMSELCNAETDKVKTALWHSRLGHMSMNNLKVLAGKGLIDRKEINELE 470
Query: 451 LCKYCVLGKQCRVRFKTGHHKTKGILDYVHSDVRG-PTKEPSV*GFRYFVTFTDDFSRKV 509
C++CV+GK +V F G H ++ L YVH+D+ G P PS+ G +YF++ DD +RKV
Sbjct: 471 FCEHCVMGKSKKVSFNVGKHTSEDALSYVHADLWGSPNVTPSISGKQYFLSIIDDKTRKV 530
Query: 510 WVYFMKYKSEVFAKFKLWKAEVENQTGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHFS 569
W+YF+K K E F KF WK+ VENQ +K+K LR+DNG E+ + F +C+E+GI+RH +
Sbjct: 531 WLYFLKSKDETFDKFCEWKSLVENQVNKKVKCLRTDNGLEFCNSRFDSYCKEHGIERHRT 590
Query: 570 VRKTPQQNGVAERMNRTLTEKARCLRLNACLSKCLWAATINMACYLVNRSPRASLDGKVA 629
TPQQNGVAERMNRT+ EK RCL + + + WA A YL+NRSP ++++ V
Sbjct: 591 CTYTPQQNGVAERMNRTIMEKVRCLLNKSGVEEVFWAEAAATAAYLINRSPASAINHNVP 650
Query: 630 EEVWTGNPIDLSNLRIFWRPAYVHISSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDPVKK 689
EE+W +LR F AYVH +D+ KL P++ +GY G KGYK+W ++
Sbjct: 651 EEMWLNRKPGYKHLRKFGSIAYVH---QDQGKLKPRALKGFFLGYPAGTKGYKVWLLEEE 707
Query: 690 KVIVSRDVVFDE*SMLKQSDVTVVPD-----TEVENSSQDKIQVDIEETPVSPRQIVAQQ 744
K ++SR+VVF E + + D+ V D + E +S + Q E S I Q
Sbjct: 708 KCVISRNVVFQESVVYR--DLKVKEDDTDNLNQKETTSSEVEQNKFAEASGSGGVIQLQS 765
Query: 745 QSEP------GSDSGEVQDYT----------------LVRDREPSRITPPVRYGFEDLAA 782
SEP SDS E +Y+ L RDR I PP R+ E
Sbjct: 766 DSEPITEGEQSSDSEEEVEYSEKTQETPKRTGLTTYKLARDRVRRNINPPTRFTEESSVT 825
Query: 783 YALLTSSG----DPSTYHEAMAS*EKDKWMSAMVEEMESLKKNET*NLVQLPHGKRVIGC 838
+AL+ +P +Y EAM S + +KW A +EM+SL KN T +LV P +++IGC
Sbjct: 826 FALVVVENCIVQEPQSYQEAMESQDCEKWDMATHDEMDSLMKNGTWDLVDKPKDRKIIGC 885
Query: 839 KWVYKKKLAVT*KEREKFKAHLVTKGYSQHKGIDYDEIFSPVVRHTSIRVVLALVASMDM 898
+W++K K + E +FKA LV KGY+Q +G+DY EIF+PVV+H SIR++++LV D+
Sbjct: 886 RWLFKLKSGIPGVEPTRFKARLVAKGYTQREGVDYQEIFAPVVKHVSIRILMSLVVDKDL 945
Query: 899 HLEQMDVKTTFLHGNLEEQIYIEQPEGFSETGDGRLVCKLKRSLYGLKQSPRQWYKRFDS 958
LEQMDVKTTFLHG+LEE++Y+EQPEGF VC+LK+SLYGLKQSPRQW KRFD
Sbjct: 946 ELEQMDVKTTFLHGDLEEELYMEQPEGFVSDSSENKVCRLKKSLYGLKQSPRQWNKRFDR 1005
Query: 959 YMLRIGYRRCDYDCCVYVMSLDDGSFIFLLLYVDDMLIAANHLHDVNELKTKLGKEFDMK 1018
+M + R ++D CVYV + + FI+LLLYVDDMLIA ++N +K +L EF+MK
Sbjct: 1006 FMSSQQFIRSEHDACVYVKHVSEHDFIYLLLYVDDMLIAGASKAEINRVKEQLSTEFEMK 1065
Query: 1019 DLGAAKKILGMEIHKDRGAKKLWLSQKSYVEGVLSRFDMSKANHVSTPLTNHFKLSL--E 1076
D+G A +ILG++I++DR L LSQ+ Y+ VL RF+MS A + P+ HFKL+ E
Sbjct: 1066 DMGGASRILGIDIYRDRKGGVLKLSQEIYIRKVLDRFNMSGAKMTNAPVGAHFKLAAVRE 1125
Query: 1077 QSPKIDSEIEGMSKIPY--AVQLVV*CMLWFALDQIW---------------HKQLVKCQ 1119
+ +D+++ +PY AV ++ ML D + H + VK
Sbjct: 1126 EDECVDTDV-----VPYSSAVGSIMYAMLGTRPDLAYAICLISRYMSKPGSMHWEAVKWV 1180
Query: 1120 VHV--QAREAALGSSQVDPKILEGYNGSRYHV*QGARCCSISCGICGLCR*SR**KVYNR 1177
+ A++ L ++ + GY S Y A I G
Sbjct: 1181 MRYLKGAQDLNLVFTKEKDFTVTGYCDSNY-----AADLDRRRSISGY------------ 1223
Query: 1178 ICLYSCGG-PICWKSSVQSTVAMSTTEAEYMAVAEAAKEALWLTGLVKELGVEQGGVQLH 1236
+++ GG + WK+S+Q VAMSTTEAEY+A+AEAAKEA+W+ GL++++G++Q V++
Sbjct: 1224 --VFTIGGNTVSWKASLQPVVAMSTTEAEYIALAEAAKEAMWIKGLLQDMGMQQDKVKIW 1281
Query: 1237 CDSQSAIYLTNNQVYHARTKHIDVRFHKIRELLASRQILLQKIHTSENTTDKLTKPVTSD 1296
CDSQSAI L+ N VYH RTKHIDVRF+ IR+++ S + + KIHTS N D LTK + +
Sbjct: 1282 CDSQSAICLSKNSVYHERTKHIDVRFNYIRDVVESGDVDVLKIHTSRNPVDALTKCIPVN 1341
Query: 1297 KF 1298
KF
Sbjct: 1342 KF 1343
>gb|AAK29467.1| polyprotein-like [Lycopersicon chilense]
Length = 1328
Score = 811 bits (2096), Expect = 0.0
Identities = 468/1129 (41%), Positives = 655/1129 (57%), Gaps = 55/1129 (4%)
Query: 3 GAKFEVTRFDGTGN-FGLWQRMAKDLLAQKSLQKAL--RDEKPADIATVDWNEMKEKAAG 59
G K+EV +F+G F +WQR KDLL Q+ L KAL + +KP + DW E+ EKAA
Sbjct: 3 GVKYEVAKFNGDKPVFSMWQRRMKDLLIQQGLHKALGGKSKKPESMKLEDWEELDEKAAS 62
Query: 60 LITLCVSDDVMNHILDLTTLKDVWDKLESLYMSKTPMNKLFAKQRLYSLKMQEGGDLQAH 119
I L ++DDV+N+I+D + +W KLE+LYMSKT NKL+ K++LY+L M EG + +H
Sbjct: 63 AIRLHLTDDVVNNIVDEESACGIWTKLENLYMSKTLTNKLYLKKQLYTLHMDEGTNFLSH 122
Query: 120 VYAFNNILADMTRLGVTVDDEDKAIILLCSLPGSYDHLVTTLTYGKDSITLDSISSTLLP 179
+ N ++ + LGV +++EDK I+LL SLP SYD L TT+ +GKDSI L ++S LL
Sbjct: 123 LNVLNGLITQLANLGVKIEEEDKRIVLLNSLPSSYDTLSTTILHGKDSIQLKDVTSALLL 182
Query: 180 HAQRRQSVEEGG--------GSSGEGLFVKGGQDRGRGKGKAVDSGKKKRSKSKDRKTTE 231
+ + R+ E G G S + G+ RGK K RSKSK R
Sbjct: 183 NEKMRKKPENHGQVFITESRGRSYQRSSSNYGRSGARGKSKV-------RSKSKARN--- 232
Query: 232 CYSCKQIGHWKRDCPN-------RSGKSGNNSSSANVVQSDGSCS--EEDLLCVSSVKCT 282
CY+C Q GH+KRDCPN SG+ +++++A V +D E+ C+
Sbjct: 233 CYNCDQPGHFKRDCPNPKRGKGESSGQKNDDNTAAMVQNNDDVVLLINEEEECMHLAGTE 292
Query: 283 DAWVLDSGCSYHMTQHREWFNSFKSGDLGYVYLGDDKPCIIKGMRQVKIALDDGGVRTLS 342
WV+D+ SYH T R+ F + +GD G V +G+ I G+ + + G L
Sbjct: 293 SEWVVDTAASYHATPVRDLFCRYVAGDYGNVKMGNTSYSKIAGIGDICFKTNVGCTLVLK 352
Query: 343 QVRYVLEVMKNLISLGTLHENGYSFKSEENRDILRVSKGAMTVMRAKRTAGNIYKLLGGT 402
VR+V ++ NLIS L ++GY + R++KGA+ + + G +Y+
Sbjct: 353 DVRHVPDLRMNLISGIALDQDGYENYFANQK--WRLTKGALVIAKGV-ARGTLYRTNAEI 409
Query: 403 IMGDVASVETDDDATKLWHMRLGHLSERGMMELHKRNMLKGVRSCIIGLCKYCVLGKQCR 462
G++ + ++ A LWH R+GH SE+G+ L K++++ + I C Y + GKQ R
Sbjct: 410 CQGELNAAHEENSAD-LWHKRMGHTSEKGLQILSKKSLISFTKGTTIKPCNYWLFGKQHR 468
Query: 463 VRFKTGHHKTKGILDYVHSDVRGPTKEPSV*GFRYFVTFTDDFSRKVWVYFMKYKSEVFA 522
V F+T + ILD V+SDV GP + S+ G +YFVTF DD SRK+WVY + K +VF
Sbjct: 469 VSFQTSSERKSNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYIFRAKDQVFQ 528
Query: 523 KFKLWKAEVENQTGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHFSVRKTPQQNGVAER 582
F+ + A VE +TGRK K LR+DNG EYT + F +C +GI+ +V TPQ NGVAER
Sbjct: 529 VFQKFHALVERETGRKRKRLRTDNGGEYTSREFEEYCSNHGIRHEKTVPGTPQHNGVAER 588
Query: 583 MNRTLTEKARCLRLNACLSKCLWAATINMACYLVNRSPRASLDGKVAEEVWTGNPIDLSN 642
MNRT+ EK R + A L K W + ACYL+NRSP L+ + E VWT + S+
Sbjct: 589 MNRTIVEKVRSMLRMAKLPKTFWGEAVRTACYLINRSPSVPLEFDIPERVWTNKEMSYSH 648
Query: 643 LRIFWRPAYVHISSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDPVKKKVIVSRDVVFDE* 702
L++F A+ H+ E R+KLD KS CI IGY GY+LWD VKKKVI SRDV+F E
Sbjct: 649 LKVFGCKAFAHVPKEQRTKLDDKSVPCIFIGYGDEEFGYRLWDLVKKKVIRSRDVIFRES 708
Query: 703 SMLKQSDVT--------------VVPDTEVENSSQDKIQVDIEETPVSPRQIVAQQQSEP 748
+ +D++ +P + +S + ++ E P +IV +Q E
Sbjct: 709 EVGTAADLSEKAKKKNGIIPNLVTIPSSSNHPTSAESTIDEVVEQEEQPDEIV--EQGEQ 766
Query: 749 GSDSGEVQDYTLVRDREPSRITPPVRYGFEDL--AAYALLTSSGDPSTYHEAMAS*EKDK 806
D+ E +Y +P R + R + Y L+ G+P E ++ EK +
Sbjct: 767 LGDNTEQMEYPEEEQSQPLRRSERQRVESTKYPSSEYVLIKYEGEPENLKEVLSHPEKSQ 826
Query: 807 WMSAMVEEMESLKKNET*NLVQLPHGKRVIGCKWVYKKKLAVT*KEREKFKAHLVTKGYS 866
WM AM EEM SL+KN T LV+LP GKR + CKWV+K K K ++KA LV KG+
Sbjct: 827 WMKAMHEEMGSLQKNGTYQLVELPKGKRPLKCKWVFKLKKDGNGK-LVRYKARLVVKGFE 885
Query: 867 QHKGIDYDEIFSPVVRHTSIRVVLALVASMDMHLEQMDVKTTFLHGNLEEQIYIEQPEGF 926
Q KGID+DEIFSPVV+ TSIR +L++ AS+D+ +EQ+DVKT FLHG+LEE+IY+EQ EGF
Sbjct: 886 QKKGIDFDEIFSPVVKMTSIRTILSIAASLDLEVEQLDVKTAFLHGDLEEEIYMEQGEGF 945
Query: 927 SETGDGRLVCKLKRSLYGLKQSPRQWYKRFDSYMLRIGYRRCDYDCCVYVMSLDDGSFIF 986
+G +VCKL +SLYGLKQ+PRQWYK+FDS+M YR CVY D +FI
Sbjct: 946 EVSGKKHMVCKLNKSLYGLKQAPRQWYKKFDSFMKSQTYRNTYSHPCVYFKRFSDKNFII 1005
Query: 987 LLLYVDDMLIAANHLHDVNELKTKLGKEFDMKDLGAAKKILGMEIHKDRGAKKLWLSQKS 1046
LLLY D MLI + +L+ K FDMKDLG AK+ILGM+I ++ KKL LS +
Sbjct: 1006 LLLYTDYMLIVGKDKELIAKLRKDFSKSFDMKDLGPAKQILGMKIAREE-QKKLGLSHEK 1064
Query: 1047 YVEGVLSRFDMSKANHVSTPLTNHFKLSLEQSP-KIDSEIEGMSKIPYA 1094
Y+E VL RF+M A +STPL ++ KL+ + P K E M+K+PY+
Sbjct: 1065 YIERVLERFNMKSAKPISTPLVSYLKLTKQMFPTKKKGEKGDMAKVPYS 1113
Score = 112 bits (279), Expect = 1e-22
Identities = 55/115 (47%), Positives = 78/115 (67%)
Query: 1184 GGPICWKSSVQSTVAMSTTEAEYMAVAEAAKEALWLTGLVKELGVEQGGVQLHCDSQSAI 1243
GG I W+S +Q VA+STTEA+Y+A E KE LWL ++E G+ Q ++C+SQSA+
Sbjct: 1202 GGDISWQSKLQKYVALSTTEAKYIAGTEVCKEMLWLKRFLQEHGLHQKEYVVYCESQSAM 1261
Query: 1244 YLTNNQVYHARTKHIDVRFHKIRELLASRQILLQKIHTSENTTDKLTKPVTSDKF 1298
L+ +YHA TKHID+R+H IRE++ + + KI TSEN D +TK V ++KF
Sbjct: 1262 DLSKKAMYHATTKHIDMRYHWIREMVDDGSLQVVKIPTSENPADMVTKVVQNEKF 1316
>emb|CAB79135.1| putative transposable element [Arabidopsis thaliana]
gi|3402755|emb|CAA20201.1| putative transposable element
[Arabidopsis thaliana] gi|7444415|pir||T05178
hypothetical protein T6K22.90 - Arabidopsis thaliana
Length = 1308
Score = 795 bits (2054), Expect = 0.0
Identities = 488/1347 (36%), Positives = 741/1347 (54%), Gaps = 108/1347 (8%)
Query: 5 KFEVTRFDGTGNFGLWQRMAKDLLAQKSLQKALRD------------EKPADIATVDWNE 52
K E+ F+G +F LW+ + L L+ AL D EK + D +
Sbjct: 6 KVEIKTFNGDRDFSLWKIRIEAQLGVLGLKPALSDFTLTKTILVVKSEKKESESEDDETD 65
Query: 53 MK--EKAAGLITLCVSDD----VMNHILDLTTLK--------DVWDKLESLYMSKTPMNK 98
K E+ I SD ++NHI D LK ++W L L+M + N+
Sbjct: 66 SKKTEEVPDPIKFEQSDQAKNFIINHITDTVLLKVQHCVTAAELWATLNKLFMETSLPNR 125
Query: 99 LFAKQRLYSLKMQEGGDLQAHVYAFNNILADMTRLGVTVDDEDKAIILLCSLPGSYDHLV 158
++ + RLYS KM + + + F I+A++ L + V +E +AI++L SLP SY L
Sbjct: 126 IYTQLRLYSFKMVDNLSIDQNTDEFLRIVAELGSLQIQVGEEVQAILILNSLPPSYIQLK 185
Query: 159 TTLTYGKDSITL-DSISSTLLPHAQRRQSVEEGGGSSGEGLFVKGGQDRGRGKGKAVDSG 217
TL YG ++++ D +SS + + E + L+ +RGR + K
Sbjct: 186 HTLKYGNKTLSVQDVVSSAKSLERELSEQKETIRAPASTALYTA---ERGRPQTKNTQGQ 242
Query: 218 KKKRSKSKDRKTTECYSCKQIGHWKRDCPNRSGKSGNNSSS-ANVVQSDGSCSEEDLLCV 276
K R +S + C+ CK+ GH K+DC K N A V+ SE L +
Sbjct: 243 GKGRGRSNSKSRLTCWFCKKEGHVKKDCYAGKRKLENEGQGKAGVITEKLVYSEA--LSM 300
Query: 277 SSVKCTDAWVLDSGCSYHMTQHREWFNSFKSGDLGYVYLGDDKPCIIKGMRQVKIALDDG 336
+ D WV+DSGC+YHMT +WF+ F + + LGDD KG VK+ G
Sbjct: 301 YDQEAKDKWVIDSGCTYHMTSRMDWFSEFNENETTMILLGDDHTVESKGSGTVKVNTHGG 360
Query: 337 GVRTLSQVRYVLEVMKNLISLGTLHENGYSFKSEENRDILRVSKGAMTVMRAKRTAGNIY 396
+R L VR+V + +NLIS GTL + GY + + + +R K T + G +Y
Sbjct: 361 SIRVLKNVRFVPNLRRNLISTGTLDKLGYKHEGGDGK--VRFYKENKTALCGNLVNG-LY 417
Query: 397 KLLGGTIMGDVASVETDDDATKLWHMRLGHLSERGMMELHKRNMLKGVRSCIIGLCKYCV 456
L G T++ + +VE ++ T+LWH RLGH+S M L ++ +L+ + C+ CV
Sbjct: 418 VLDGHTVVNENCNVEGSNEKTELWHCRLGHMSLNNMKILAEKGLLEKKDIKELSFCENCV 477
Query: 457 LGKQCRVRFKTGHHKTKGILDYVHSDVRGPTKEPSV*GFRYFVTFTDDFSRKVWVYFMKY 516
+GK ++ F G H T +L Y+H+D+ G +YF++ DD SRKVW+ F+K
Sbjct: 478 MGKSKKLSFNVGKHITDEVLGYIHADLWGK---------QYFLSIIDDKSRKVWLMFLKT 528
Query: 517 KSEVFAKFKLWKAEVENQTGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHFSVRKTPQQ 576
K E F +F WK VENQ +K+K LR+DNG E+ + F FC++NGI+RH + TPQQ
Sbjct: 529 KDETFERFCEWKELVENQVNKKVKILRTDNGLEFCNLKFDEFCKQNGIERHRTCTYTPQQ 588
Query: 577 NGVAERMNRTLTEKARCLRLNACLSKCLWAATINMACYLVNRSPRASLDGKVAEEVWTGN 636
NGVA+RMNRTL EK RCL + L + WA A YLVNRSP +++D V EE+W
Sbjct: 589 NGVAKRMNRTLMEKVRCLLNESGLEEVFWAEAAATAAYLVNRSPASAVDHNVPEELWLDK 648
Query: 637 PIDLSNLRIFWRPAYVHISSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDPVKKKVIVSRD 696
+LR F AYVH+ D+ KL P++ + +GY +G KGYK+W ++K ++SR+
Sbjct: 649 KPGYKHLRRFGCIAYVHL---DQGKLKPRALKGVFLGYPQGTKGYKVWLLDEEKCVISRN 705
Query: 697 VVFDE*SMLK------QSDVTVVPDTEVENSSQDKIQ----------VDIEETPVSPRQI 740
+VF+E + K + V + D E N Q ++ V IEE
Sbjct: 706 IVFNENQVYKDIRESSEQSVKDISDLEGYNEFQVSVKEHGECSKTGGVTIEEIDQESDSE 765
Query: 741 VAQQQSEPGSDSGEVQDYTLVRDREPSRITPPVRYGFEDLAAYALLTS----SGDPSTYH 796
+ Q EP S ++ +Y RDRE PP + A AL+ + S +P YH
Sbjct: 766 NSVTQ-EPLIASIDLSNYQSARDRERRAPNPPQKLADYTHFALALVMAEEIESEEPQCYH 824
Query: 797 EAMAS*EKDKWMSAMVEEMESLKKNET*NLVQLPHGKRVIGCKWVYKKKLAVT*KEREKF 856
+A KW M EE++SL KN T ++V+ P ++VI C+W++K K + E +++
Sbjct: 825 DAKKDKHWIKWNGGMKEEIDSLLKNGTWDIVEWPKEQKVISCRWLFKLKPGIPGVEAQRY 884
Query: 857 KAHLVTKGYSQHKGIDYDEIFSPVVRHTSIRVVLALVASMDMHLEQMDVKTTFLHGNLEE 916
KA LV +G++Q KGIDY+E+F+PVV+H SIR++++ V DM LEQMDVKTTFLHG L++
Sbjct: 885 KARLVARGFTQQKGIDYEEVFAPVVKHISIRILMSAVVKDDMELEQMDVKTTFLHGELDQ 944
Query: 917 QIYIEQPEGFSETGDGRLVCKLKRSLYGLKQSPRQWYKRFDSYMLRIGYRRCDYDCCVYV 976
+Y+EQPEGF + VC LK+SLYGLKQ+PRQW K+F ++ML + + R ++D CVYV
Sbjct: 945 VLYMEQPEGFEVNPEKDQVCLLKKSLYGLKQAPRQWNKKFHAFMLSLQFARSEHDSCVYV 1004
Query: 977 MSLDDGSFIFLLLYVDDMLIAANHLHDVNELKTKLGKEFDMKDLGAAKKILGMEIHKDRG 1036
++ G F++LLLYVDDML+AA ++++LK L +F+MKD+GAA +ILG++I ++R
Sbjct: 1005 KEVNPGEFVYLLLYVDDMLLAAKSKSEISKLKEALSLKFEMKDMGAASRILGIDIIRNRK 1064
Query: 1037 AKKLWLSQKSYVEGVLSRFDMSKANHVSTPLTNHFKLS--LEQSPKIDSEIEGMSKIPY- 1093
L LSQ YV+ V+ RF M+ A VSTP+ HFKL+ +++ +D E+ +PY
Sbjct: 1065 EGTLRLSQTRYVDKVIQRFRMADAKVVSTPMGAHFKLTSLIDEIGSVDPEV-----VPYS 1119
Query: 1094 -AVQLVV*CMLWFALDQIWHKQLVKCQVHVQAREAALGSSQVDPKILEGYNGSRYHV*QG 1152
AV V+ M+ D + LV +R + + ++ ++GY S +
Sbjct: 1120 SAVGSVMYAMIGTIPDVAYAMGLV-------SRFMSRPGANLE---VQGYCDSDH----- 1164
Query: 1153 ARCCSISCGICGLCR*SR**KVYNRICLYSCGG-PICWKSSVQSTVAMSTTEAEYMAVAE 1211
A I G +++ GG + WKSS+Q VA+S+T+AE++A+ E
Sbjct: 1165 AADLDKRRSISGY--------------VFTVGGNTVSWKSSLQHVVALSSTQAEFIALTE 1210
Query: 1212 AAKEALWLTGLVKELGVEQGGVQLHCDSQSAIYLTNNQVYHARTKHIDVRFHKIRELLAS 1271
A KEA+W+ GL++++G++ + CDSQSAI L+ N +H RTKH++V+F+ IR+++ +
Sbjct: 1211 AVKEAIWIRGLLEDMGLQPKPATVWCDSQSAICLSKNNAFHDRTKHVEVKFYFIRDIIEA 1270
Query: 1272 RQILLQKIHTSENTTDKLTKPVTSDKF 1298
++ ++KIHTS N D LTK + KF
Sbjct: 1271 GEVKVRKIHTSVNPADMLTKCIPVKKF 1297
>pir||T02206 hypothetical protein - common tobacco retrotransposon Tto1
gi|1167523|dbj|BAA11674.1| ORF(AA 1-1338) [Nicotiana
tabacum]
Length = 1338
Score = 782 bits (2020), Expect = 0.0
Identities = 463/1132 (40%), Positives = 652/1132 (56%), Gaps = 69/1132 (6%)
Query: 7 EVTRFDGTGNFGLWQRMAKDLL-AQKSLQKALRDEKPADIATVDWNEMKEKAAGLITLCV 65
++ +GT N+ LW+ KDLL K +KP D + DW + G I V
Sbjct: 7 KMVNLNGT-NYHLWRNKMKDLLFVTKMHLPVFSSQKPEDKSDEDWEFEHNQVCGYIRQFV 65
Query: 66 SDDVMNHILDLTTLKDVWDKLESLYMSKTPMNKLFAKQRLYSLKMQEGGDLQAHVYAFNN 125
D+V NHI +T + +WDKLE LY SKT NKLF +L +K EG + H+
Sbjct: 66 EDNVYNHISGVTHARSLWDKLEELYASKTGNNKLFYLTKLMQVKYVEGTTVADHLNEIQG 125
Query: 126 ILADMTRLGVTVDDEDKAIILLCSLPGSYDHLVTTLTYGKDS--ITLDSISSTLLPHAQR 183
I+ ++ +G+ DDE A+++L +LP S++ L ++T + + ++++ S +L R
Sbjct: 126 IVDQLSGMGIKFDDEVLALMVLATLPESWETLKVSITNSAPNGVVNMETVKSGILNEEMR 185
Query: 184 RQSVEEGGGSSGEGLFVKGGQDRGRGKGKAVDSGKKKRSKSKDRKTTECYSCKQIGHWKR 243
R+S G+S V RGR + K+ + K R KS EC+ CK+ GH KR
Sbjct: 186 RRSQ----GTSSSQSEVLAVTTRGRSQNKSQSNRDKSRGKSNKFANVECHYCKKKGHIKR 241
Query: 244 DC--------PNRSGKSGNNSSSANVVQSDGSCS---EEDLLCVSSVKCTDAWVLDSGCS 292
C N+ K SS + S G + ++D++ +++ + T WV+DSG +
Sbjct: 242 FCRQFQNDQKKNKGKKVKPEESSDDETNSFGEFNVVYDDDIINLTTQEMT--WVIDSGAT 299
Query: 293 YHMTQHREWFNSFKSGDLGYVYLGDDKPCIIKGMRQVKIALDDGGVRTLSQVRYVLEVMK 352
H T RE F+S+ GD G V +G+ + G V + +G L VR+V ++
Sbjct: 300 IHATPRRELFSSYTLGDFGRVKMGNANFSTVVGKGDVCLETMNGMKLLLRDVRHVPDMRL 359
Query: 353 NLISLGTLHENGYSFKSEENRDILRVSKGAMTVMRAKRTAGNIYKLLGGTIMGDVASVET 412
NLIS+ L E GY + +++KG++ V R + + +Y + +I V +V
Sbjct: 360 NLISVDKLDEEGYCNTFHNGQ--WKLTKGSLMVARGTKQS-KLY-VTQASISQQVINVAE 415
Query: 413 DDDATKLWHMRLGHLSERGMMELHKRNMLKGVRSCIIGLCKYCVLGKQCRVRFKT-GHHK 471
+D KLWH RLGH+SE+ M L K+N L G+ + C C+ GKQ RV FK +
Sbjct: 416 NDSNIKLWHRRLGHMSEKSMARLVKKNALPGLNQIQLKKCADCLAGKQNRVSFKRFPPSR 475
Query: 472 TKGILDYVHSDVRGPTKEPSV*GFRYFVTFTDDFSRKVWVYFMKYKSEVFAKFKLWKAEV 531
+ +LD VHSDV GP K+ S+ G RYFVTF DD SRK WVY +K K +VF FK + V
Sbjct: 476 RQNVLDLVHSDVCGPFKK-SLGGARYFVTFIDDHSRKTWVYTLKTKDQVFQVFKQFLTLV 534
Query: 532 ENQTGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHFSVRKTPQQNGVAERMNRTLTEKA 591
E +TG+K+K +R+DNG EY + F +C+E+GI+ F+ KTPQ NG+AERMNRTL E+
Sbjct: 535 ERETGKKLKCIRTDNGGEYQGQ-FDAYCKEHGIRHQFTPPKTPQLNGLAERMNRTLIERT 593
Query: 592 RCLRLNACLSKCLWAATINMACYLVNRSPRASLDGKVAEEVWTGNPIDLSNLRIFWRPAY 651
RCL ++ L K W + A Y++N SP L K E++W G I LR+F AY
Sbjct: 594 RCLLSHSKLPKAFWGEALVTAAYVLNHSPCVPLQYKAPEKIWLGRDISYDQLRVFGCKAY 653
Query: 652 VHISSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDPVKKKVIVSRDVVFDE*SMLKQSDVT 711
VH+ ++RSKLD K++ C+ IGY + + GYK +DPV+KK++ SRDVVF E D T
Sbjct: 654 VHVPKDERSKLDVKTRECVFIGYGQDMLGYKFYDPVEKKLVRSRDVVFVE-------DQT 706
Query: 712 VVPDTEVENSSQDKIQVDIEETPVSPRQI---VAQQQSE-PG-------------SDSGE 754
+ +VE S+ D + ++ T V PRQ+ V Q E PG D+G+
Sbjct: 707 IEDIDKVEKSTDDSAEFELPPTVV-PRQVGDDVQDNQPEAPGLPNEDELADTEGNEDNGD 765
Query: 755 VQDYTLVRDREPSRITPP------------VRYGFEDLAAYALLTSSGDPSTYHEAMAS* 802
+ + P PP RY + Y LLT G+P ++ EA+
Sbjct: 766 DDADEEDQPQPPILNNPPYHTRSGRVVQQSTRYSPHE---YVLLTDGGEPDSFEEAIDDE 822
Query: 803 EKDKWMSAMVEEMESLKKNET*NLVQLPHGKRVIGCKWVYKKKLAVT*KEREKFKAHLVT 862
K+KW+ AM +E++SL +N+T LV+LP GKR + KWV+K K +FKA LV
Sbjct: 823 HKEKWIEAMQDEIKSLHENKTFELVKLPKGKRALKNKWVFKMKHDEH-NSLPRFKARLVV 881
Query: 863 KGYSQHKGIDYDEIFSPVVRHTSIRVVLALVASMDMHLEQMDVKTTFLHGNLEEQIYIEQ 922
KG++Q KGID+DEIFSPVV+ TSIR VL L AS+++ +EQMDVKT FLHG+LEE+IY+EQ
Sbjct: 882 KGFNQRKGIDFDEIFSPVVKMTSIRTVLGLAASLNLEVEQMDVKTAFLHGDLEEEIYMEQ 941
Query: 923 PEGFSETGDGRLVCKLKRSLYGLKQSPRQWYKRFDSYMLRIGYRRCDYDCCVYVMSLDDG 982
P+GF + G VC+L++SLYGLKQ+PRQWYK+F+S M + GY++ D CV+ D
Sbjct: 942 PDGFQQKGKEDYVCRLRKSLYGLKQAPRQWYKKFESVMGQHGYKKTTSDHCVFAQKFSDD 1001
Query: 983 SFIFLLLYVDDMLIAANHLHDVNELKTKLGKEFDMKDLGAAKKILGMEIHKDRGAKKLWL 1042
FI LLLYVDDMLI ++ +N LK +L K F MKDLG AK+ILGM I +DR AKKLWL
Sbjct: 1002 DFIILLLYVDDMLIVGRNVSRINSLKEQLSKFFAMKDLGPAKQILGMRIMRDREAKKLWL 1061
Query: 1043 SQKSYVEGVLSRFDMSKANHVSTPLTNHFKLSLEQSPKIDSEIEGMSKIPYA 1094
SQ+ Y+E VL RF+M K VS PL NHF+LS +QSP D E M +IPYA
Sbjct: 1062 SQEKYIEKVLQRFNMEKTKAVSCPLANHFRLSTKQSPSTDDERRKMERIPYA 1113
Score = 129 bits (324), Expect = 7e-28
Identities = 60/117 (51%), Positives = 82/117 (69%)
Query: 1184 GGPICWKSSVQSTVAMSTTEAEYMAVAEAAKEALWLTGLVKELGVEQGGVQLHCDSQSAI 1243
GG + W+S +Q VA+STTEAE++A EA KE +W+ + ELG Q G QL CDSQSAI
Sbjct: 1203 GGAVSWQSKLQKCVALSTTEAEFIAATEACKELIWMKKFLTELGFSQDGYQLFCDSQSAI 1262
Query: 1244 YLTNNQVYHARTKHIDVRFHKIRELLASRQILLQKIHTSENTTDKLTKPVTSDKFMF 1300
+L N +H+R+KHIDVR++ IR++L + + L+KIHT EN +D LTK + KF F
Sbjct: 1263 HLAKNASFHSRSKHIDVRYNWIRDVLEKKMLRLEKIHTDENGSDMLTKTLPKGKFEF 1319
>gb|AAV88069.1| hypothetical retrotransposon [Ipomoea batatas]
Length = 1415
Score = 762 bits (1968), Expect = 0.0
Identities = 439/1113 (39%), Positives = 646/1113 (57%), Gaps = 51/1113 (4%)
Query: 16 NFGLWQRMAKDLLAQKSLQ-KALRDEKPADIATVDWNEMKEKAAGLITLCVSDDVMNHIL 74
N+ +W+ KDLL K L KP +++ +W+ ++ G I V D+V+NHI+
Sbjct: 15 NYHIWKAKMKDLLFVKKLHLPVFASAKPENMSDEEWDFEHQQVCGYIRQWVEDNVLNHII 74
Query: 75 DLTTLKDVWDKLESLYMSKTPMNKLFAKQRLYSLKMQEGGDLQAHVYAFNNILADMTRLG 134
+ T + +W+KLE+LY SKT NKLF +++ +++ +EG + HV F +L ++ +G
Sbjct: 75 NETHARSLWNKLETLYASKTGNNKLFLLKQMMNIRYREGTLINDHVNDFQGVLDQLSGMG 134
Query: 135 VTVDDEDKAIILLCSLPGSYDHLVTTLTYGKDS--ITLDSISSTLLPHAQRRQSVEEGGG 192
+ +DE + LL +LP S++ +LT + +T++ + S +L RR+S +
Sbjct: 135 IKFEDEVLGLWLLNTLPDSWETFRVSLTNSAPNGVVTMEYVKSGILNEEARRRSQDTSTS 194
Query: 193 SSGEGLFVKGGQDRGRGKGKAVDSGKKKRSKSKDR-KTTECYSCKQIGH-------WKRD 244
S + DRGR K K K RSKS+ R K EC+ C + H WKR+
Sbjct: 195 QSD----ILVTDDRGRNKQKGQRGRDKSRSKSRSRYKDIECHYCGKKSHIKKYSFKWKRE 250
Query: 245 CPNRSGKSGNNSSSANVVQSDGSCSEEDLLCVSSVKCTDA-WVLDSGCSYHMTQHREWFN 303
+ K G+ + V++D + +D V +V C + W++DSG +YH+T +E+F
Sbjct: 251 -KKQDNKDGDTGNQVATVRADLLVACDDN--VINVACHETTWIVDSGAAYHVTPRKEFFT 307
Query: 304 SFKSGDLGYVYLGDDKPCIIKGMRQVKIALDDGGVRTLSQVRYVLEVMKNLISLGTLHEN 363
S+ GD G + +G+D + G V + +G L V++ ++ NLIS G L ++
Sbjct: 308 SYTPGDFGELRMGNDGQVKVTGTGTVCLETSNGTKLVLKNVKHAPDIRLNLISTGKLDDD 367
Query: 364 GYSFKSEENRDILRVSKGAMTVMRAKRTAGNIYKLLGGTIMGDVASVETDDDATKLWHMR 423
G+ + +++KG++ V R +++ N+Y L ++ D +V + A++LWH R
Sbjct: 368 GFCCFFGDGH--WKITKGSLVVARGNKSS-NLYSLQS-SVSDDSVNVVEKECASELWHKR 423
Query: 424 LGHLSERGMMELHKRNMLKGVRSCIIGLCKYCVLGKQCRVRFKTGHHKTKGI--LDYVHS 481
LGH+S +G+ L K++ L GV+ + C +C+ GKQ RV F + H T+ LD +HS
Sbjct: 424 LGHMSVKGIDYLAKKSKLSGVKEAKLDKCVHCLAGKQRRVSFMS-HPPTRKSEPLDLIHS 482
Query: 482 DVRGPTKEPSV*GFRYFVTFTDDFSRKVWVYFMKYKSEVFAKFKLWKAEVENQTGRKIKY 541
DV GP K S+ G YFVTF DD+SRK+WVY +K+KS+V FK + A VE QTG+K+K
Sbjct: 483 DVCGPMKVRSLGGASYFVTFIDDYSRKLWVYTLKHKSDVLGVFKEFHALVERQTGKKLKC 542
Query: 542 LRSDNGTEYTDKNFMHFCEENGIQRHFSVRKTPQQNGVAERMNRTLTEKARCLRLNACLS 601
+R+DNG EY F +C GI+ + K PQ NG+AERMNRT+ E+ RC+ +A L
Sbjct: 543 IRTDNGGEYCGP-FDEYCRRYGIRHQKTPPKIPQLNGLAERMNRTIMERVRCMLDDAKLP 601
Query: 602 KCLWAATINMACYLVNRSPRASLDGKVAEEVWTGNPIDLSNLRIFWRPAYVHISSEDRSK 661
WA ++ A +++N SP +L +V ++VW G + +LR+F A+VH+ ++RSK
Sbjct: 602 SSFWAEAVSTAVHVINLSPVIALKNEVPDKVWCGKDVSYDHLRVFGCKAFVHVPRDERSK 661
Query: 662 LDPKSK*CIIIGYNKGVKGYKLWDPVKKKVIVSRDVVFDE*SMLKQSDVTVVPDTEVENS 721
LD K++ CI IGY GY+L+DPV+KK++ SRDVVF E ++ D P++ S
Sbjct: 662 LDSKTRQCIFIGYGFDEFGYRLYDPVEKKLVRSRDVVFFENQTIEDIDKVKQPESRDSGS 721
Query: 722 SQDKIQV------DIEETP--VSPRQIVAQQQSEPGSDSGEVQDYTLVRDREPSRIT--- 770
D V D++E V V Q + G D PS++
Sbjct: 722 LVDIEPVSRRYTDDVDEVQENVQNGDPVPDYQGDTVDVDGHADDVVHQEQEVPSQVPVDL 781
Query: 771 ---------PPVRYGFEDLAAYALLTSSGDPSTYHEAMAS*EKDKWMSAMVEEMESLKKN 821
P RY + Y LLT G+P +Y EAM S +K +W AM EEM SL N
Sbjct: 782 PRRSDRERRPSTRYS---PSQYVLLTDGGEPESYEEAMESDQKRQWFEAMQEEMNSLYVN 838
Query: 822 ET*NLVQLPHGKRVIGCKWVYKKKLAVT*KEREKFKAHLVTKGYSQHKGIDYDEIFSPVV 881
+T LV+ P ++ + +WVY+ K +FKA LV KG+SQ KGID+DEIFSPVV
Sbjct: 839 DTFELVKAPKNRKALKNRWVYRVKHEEG-TSVPRFKARLVVKGFSQKKGIDFDEIFSPVV 897
Query: 882 RHTSIRVVLALVASMDMHLEQMDVKTTFLHGNLEEQIYIEQPEGFSETGDGRLVCKLKRS 941
+ +SIRVVL L A +D+ +EQMDVKT FLHG+L+E+IY+EQPEGF G VC+LK+S
Sbjct: 898 KFSSIRVVLGLAARLDIEIEQMDVKTAFLHGDLDEEIYMEQPEGFKVKGKEDYVCRLKKS 957
Query: 942 LYGLKQSPRQWYKRFDSYMLRIGYRRCDYDCCVYVMSLDDGSFIFLLLYVDDMLIAANHL 1001
LYGLKQ+PRQWYK+F S M + GY++ D CV+V D F+ LLLYVDDMLI +
Sbjct: 958 LYGLKQAPRQWYKKFTSVMSKHGYKKTSSDHCVFVNRYSDDDFVILLLYVDDMLIVGRNA 1017
Query: 1002 HDVNELKTKLGKEFDMKDLGAAKKILGMEIHKDRGAKKLWLSQKSYVEGVLSRFDMSKAN 1061
+ ELK +L K F MKD+G AK+ILGM+I +DR KKLWLSQ+ Y+E VL RF M++A
Sbjct: 1018 SRIQELKQELSKSFSMKDMGPAKQILGMKIIRDRQNKKLWLSQEKYIEKVLERFHMNEAK 1077
Query: 1062 HVSTPLTNHFKLSLEQSPKIDSEIEGMSKIPYA 1094
VSTPL HFKL +Q P + E E M ++PY+
Sbjct: 1078 PVSTPLDMHFKLCKKQCPSSEKEKEEMQRVPYS 1110
Score = 84.3 bits (207), Expect = 2e-14
Identities = 44/115 (38%), Positives = 65/115 (56%), Gaps = 22/115 (19%)
Query: 1184 GGPICWKSSVQSTVAMSTTEAEYMAVAEAAKEALWLTGLVKELGVEQGGVQLHCDSQSAI 1243
GG + W+S +Q V +STTEAE++A EA+KE LW+ ++ELG Q
Sbjct: 1200 GGAVSWQSRLQKCVDLSTTEAEFIASVEASKEMLWMKKFLQELGFVQD------------ 1247
Query: 1244 YLTNNQVYHARTKHIDVRFHKIRELLASRQILLQKIHTSENTTDKLTKPVTSDKF 1298
R+KHID R+H IR++L + + L+KIHT +N +D +TK + KF
Sbjct: 1248 ----------RSKHIDTRYHWIRDILECKMLELEKIHTDDNGSDMMTKALPRGKF 1292
>emb|CAA31653.1| polyprotein [Arabidopsis thaliana] gi|99721|pir||S05465
retrovirus-related polyprotein - Arabidopsis thaliana
retrotransposon Ta1-3
Length = 1291
Score = 746 bits (1925), Expect = 0.0
Identities = 461/1243 (37%), Positives = 680/1243 (54%), Gaps = 91/1243 (7%)
Query: 52 EMKEKAAGLITLCVSDDVMNHILDLTTLKDVWDKLESLYMSKTPMNKLFAKQRLYSLKMQ 111
E E A +I V D V+ I + ++W+ L YM + N+++ + + YS KM
Sbjct: 85 EKSENAMNIIIAHVGDAVLRKIDHCKSAAEMWETLNKQYMETSLPNRIYVQLKFYSFKMN 144
Query: 112 EGGDLQAHVYAFNNILADMTRLGVTVDDEDKAIILLCSLPGSYDHLVTTLTYGKDSITLD 171
+ + +V F I+A+++ L + V +E +AI+ L L Y L TL YG +++L
Sbjct: 145 DTKSINENVNEFLKIVAELSSLEINVVEEVRAILFLNRLSSRYSQLKHTLKYGNKALSLK 204
Query: 172 SISSTLLPHAQRRQSVEEGGGSSGEGLFVKGGQDRGRGKGKAVDSGK----KKRSKSKDR 227
+ S + +E ++ L+ +R R + + + K + RSKS
Sbjct: 205 DVISAARSLERELNEQKETDKNTSTVLYTN---ERSRPQTRNQNHNKGGQGRGRSKSNSN 261
Query: 228 KTTECYSCKQIGHWKRDCPNRSGK-SGNNSSSANVVQSDGSCSEEDLLCVSSVKCTDAWV 286
C+ CK+ GH K+D R K N A V+ SE L V+ + D WV
Sbjct: 262 AKLTCWYCKKEGHVKKDYFARKRKLESENPGEAGVITEKLVFSEA--LSVNDLAVRDIWV 319
Query: 287 LDSGCSYHMTQHREWFNSFKSGDLGYVYLGDDKPCIIKGMRQVKIALDDGGVRTLSQVRY 346
LDSGC+ HM+ R+WF SF+ + LGDD +G +KI G + L V+Y
Sbjct: 320 LDSGCTSHMSARRDWFCSFREDGGPTILLGDDHSVKSQGQGSIKIETHGGTIIGLENVKY 379
Query: 347 VLEVMKNLISLGTLHENGYSFKSEENRDILRVSKGAMTVMRAKRTAGNIYKLLGGTIMGD 406
V E+ +NLIS GTL + GY + + + +R K T +R + G +Y L G T++ +
Sbjct: 380 VPELRRNLISTGTLDKRGYKHEGGDGK--VRYFKNQKTALRGELVNG-LYILDGNTVLSE 436
Query: 407 VASVETDDDATKLWHMRLGHLSERGMMELHKRNMLKGVRSCIIGLCKYCVLGKQCRVRFK 466
E T+LWH RLGH+ M L + ++ ++ C+ CV+GK +V F
Sbjct: 437 TCVAEGSKGKTELWHSRLGHIGLNNMKVLAGKGLVSKEEIRVLDFCENCVMGKAKKVSFN 496
Query: 467 TGHHKTKGILDYVHSDVRGPTK-EPSV*GFRYFVTFTDDFSRKVWVYFMKYKSEVFAKFK 525
G H ++ +L YVH+D+ G T PS+ G +YF++ DD +RKVW+YF++ K E F +F
Sbjct: 497 VGKHNSEDVLRYVHADLWGSTNVTPSLSGNKYFLSIIDDKTRKVWLYFLRSKDETFDRFC 556
Query: 526 LWKAEVENQTGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHFSVRKTPQQNGVAERMNR 585
WK VENQ +K+K LR+DNG E+ + F +C+E+GI+RH + TPQQNGVAERMNR
Sbjct: 557 EWKELVENQQNKKVKCLRTDNGLEFCNLKFDAYCKEHGIERHKTCTYTPQQNGVAERMNR 616
Query: 586 TLTEKARCLRLNACLSKCLWAATINMACYLVNRSPRASLDGKVAEEVWTGNPIDLSNLRI 645
T+ EK RC+ + L + WA A YL+NRSP +++D V EE+W +LR
Sbjct: 617 TIMEKVRCMLNESGLGEEFWAEAAATAAYLINRSPASAIDHNVPEELWLNKKPGYKHLRR 676
Query: 646 FWRPAYVHISSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDPVKKKVIVSRDVVFDE*SML 705
F AYVHI D+ KL P++ I IGY G KGYK+W + K ++SR+V+F E S+
Sbjct: 677 FGSIAYVHI---DQGKLKPRALKGIFIGYPAGTKGYKIWLLEEHKCVISRNVLFHEESVY 733
Query: 706 KQS-DVTVVPDTEVENSSQDK-----------------IQVDIEETPVSPRQIVAQQQSE 747
K + V ++E E +S K IQV EE S + +Q+ E
Sbjct: 734 KDTMKKERVVESEAEPASHSKSTLIKVKTPGNLNSGEVIQVSDEEE--SDESVEEEQEPE 791
Query: 748 PGSDSGEVQ------DYTLVRDREPSRITPPVRYGFEDLAAYALLT----SSGDPSTYHE 797
+ E Q +Y L RDRE +I PP R+ E A+AL+T S +P +Y E
Sbjct: 792 TQVELPETQTTSSLANYQLARDRERRQIHPPARFTEESGVAFALVTVETLSMEEPQSYQE 851
Query: 798 AMAS*EKDKWMSAMVEEMESLKKNET*NLVQLPHGKRVIGCKWVYKKKLAVT*KEREKFK 857
A + E KW A EEM+SL KN T LV P +++IGC+W++K K E ++K
Sbjct: 852 ATSDKEWKKWKLATHEEMDSLIKNGTWVLVDKPQNRKIIGCRWLFKLKSGSPGVEPVRYK 911
Query: 858 AHLVTKGYSQHKGIDYDEIFSPVVRHTSIRVVLALVASMDMHLEQMDVKTTFLHGNLEEQ 917
A LV KGY+ +G+DY EIF+ VV+HTSIR+++++V D+ LEQMDVKT FLHG LEE+
Sbjct: 912 AQLVAKGYTHREGVDYQEIFALVVKHTSIRILMSVVVDQDLELEQMDVKTAFLHGELEEE 971
Query: 918 IYIEQPEG-FSETGDGRLVCKLKRSLYGLKQSPRQWYKRFDSYMLRIGYRRCDYDCCVYV 976
+Y+EQPEG SE G+ + VC LK+SLYGLKQSPRQW KRF+ +M+ + R ++D CVYV
Sbjct: 972 LYMEQPEGCISEDGENK-VCLLKKSLYGLKQSPRQWNKRFNRFMIDQNFIRSEHDACVYV 1030
Query: 977 MSLDDGSFIFLLLYVDDMLIAANHLHDVNELKTKLGKEFDMKDLGAAKKILGMEIHKDRG 1036
+ + ++LLLYVDDMLIA ++N++K +L EF+MKD+G A +ILG++I +D
Sbjct: 1031 KQVSEQEHLYLLLYVDDMLIAGKSKSEINKVKEQLSMEFEMKDMGPASRILGIDIIRDMK 1090
Query: 1037 AKKLWLSQKSYVEGVLSRFDMSKANHVSTPLTNHFKLSLEQSPKIDSEIEGMSKIPY--A 1094
L +SQ SY+ V+ RF+M++A +P+ HFKL+ + D E + +PY A
Sbjct: 1091 NGVLRMSQASYIHNVVQRFNMAEAKVTRSPIGAHFKLAAVRD---DDECIDNNAVPYASA 1147
Query: 1095 VQLVV*CMLWFALDQIWHKQLVKC------QVHVQAREAAL----GSSQVDPKILE---- 1140
V ++ M+ D + LV +H +A + L GS ++ +
Sbjct: 1148 VGSIMYAMIGIRPDLAYVICLVSRYMARPGSIHWEAVKWILRYMRGSQDLNLVFTKEKEF 1207
Query: 1141 ---GYNGSRYHV*QGARCCSISCGICGLCR*SR**KVYNRICLYSCGG-PICWKSSVQST 1196
GY S Y R + G +++ GG + WK+++QS
Sbjct: 1208 RVTGYCDSDYAADLDRRR-----SVSGY--------------VFTVGGNTVSWKANLQSV 1248
Query: 1197 VAMSTTEAEYMAVAEAAKEALWLTGLVKELGVEQGGVQLHCDS 1239
A+STTEAE+MA+ EAAKEALW+ GL+K+LG+EQ V L CDS
Sbjct: 1249 TALSTTEAEFMALTEAAKEALWIKGLMKDLGLEQDKVTLWCDS 1291
>emb|CAA37924.1| unnamed protein product [Arabidopsis thaliana] gi|99719|pir||S23319
hypothetical protein 2 - Arabidopsis thaliana
retrotransposon Ta1-2 (strain Landsberg) (fragment)
Length = 1084
Score = 690 bits (1780), Expect = 0.0
Identities = 399/1049 (38%), Positives = 598/1049 (56%), Gaps = 53/1049 (5%)
Query: 81 DVWDKLESLYMSKTPMNKLFAKQRLYSLKMQEGGDLQAHVYAFNNILADMTRLGVTVDDE 140
++W+ L YM + N+++ + + YS KM + + +V F I+A+++ L + V +E
Sbjct: 14 EMWETLNKQYMETSLPNRIYVQLKFYSFKMNDSKSINENVNEFLKIVAELSSLEINVVEE 73
Query: 141 DKAIILLCSLPGSYDHLVTTLTYGKDSITLDSISSTLLPHAQRRQSVEEGGGSSGEGLFV 200
+AI+ L L Y L TL YG +++L + S+ + +E ++ L+
Sbjct: 74 VRAILFLNGLSSRYSQLKHTLKYGNKALSLQDVISSARSLERELDEQKETDKNTSTVLYT 133
Query: 201 KGGQDRGRGKGKAVDSGKKKR----SKSKDRKTTECYSCKQIGHWKRDCPNRSGK-SGNN 255
+RGR + + + K+ + SKS C+ CK+ GH K+DC R K N
Sbjct: 134 N---ERGRPQTRNQNQNKEGQGRGISKSNSNAKLTCWYCKKEGHVKKDCFARKRKLESEN 190
Query: 256 SSSANVVQSDGSCSEEDLLCVSSVKCTDAWVLDSGCSYHMTQHREWFNSFKSGDLGYVYL 315
A V+ SE L V+ + D WVLDSGC+ HM+ ++WF +F+ + L
Sbjct: 191 PGEAGVITEKLVFSEA--LSVNDLAVRDIWVLDSGCTSHMSARKDWFCNFRKDGGTTILL 248
Query: 316 GDDKPCIIKGMRQVKIALDDGGVRTLSQVRYVLEVMKNLISLGTLHENGYSFKSEENRDI 375
GDD +G +KI G + L V+YV E+ +NLIS GTL + GY + + +
Sbjct: 249 GDDHSVKSQGQGSIKIDTHGGTITVLENVKYVPELRRNLISTGTLDKRGYKHEGGDGK-- 306
Query: 376 LRVSKGAMTVMRAKRTAGNIYKLLGGTIMGDVASVETDDDATKLWHMRLGHLSERGMMEL 435
+R K T +R + G +Y L G TI+ + E T+LWH RLGH+ M L
Sbjct: 307 VRYFKNQKTALRGEIVNG-LYILDGNTILSETCVAEGSKGKTELWHSRLGHMGLNNMKVL 365
Query: 436 HKRNMLKGVRSCIIGLCKYCVLGKQCRVRFKTGHHKTKGILDYVHSDVRGPTK-EPSV*G 494
+ ++ + C+ CV+GK +V F G H ++ +L YVH+D+ G T PS+ G
Sbjct: 366 AGKGLVSKEEIRELDFCENCVMGKAKKVSFNMGKHNSEYVLSYVHADLWGSTNVTPSLSG 425
Query: 495 FRYFVTFTDDFSRKVWVYFMKYKSEVFAKFKLWKAEVENQTGRKIKYLRSDNGTEYTDKN 554
+YF++ DD +RKVW+YF++ K E F +F K VENQ +K+K LR+DNG E+ +
Sbjct: 426 NKYFLSIIDDKTRKVWLYFLRSKDETFDRFCERKELVENQQNKKVKCLRTDNGLEFCNLK 485
Query: 555 FMHFCEENGIQRHFSVRKTPQQNGVAERMNRTLTEKARCLRLNACLSKCLWAATINMACY 614
F +C+++GI+RH + TPQQNGVA+RMNRT+ EK RC+ + L + WA A Y
Sbjct: 486 FDAYCKDHGIERHMTCTYTPQQNGVADRMNRTIMEKVRCMLNESGLGEEFWAEAAATAAY 545
Query: 615 LVNRSPRASLDGKVAEEVWTGNPIDLSNLRIFWRPAYVHISSEDRSKLDPKSK*CIIIGY 674
L++RSP +++D V EE+W +LR F AYVHI D+ KL P++ I IGY
Sbjct: 546 LISRSPASAIDHNVPEELWLNKKPGYKHLRRFGSIAYVHI---DQGKLKPRALKGIFIGY 602
Query: 675 NKGVKGYKLWDPVKKKVIVSRDVVFDE*SMLKQS-DVTVVPDTEVENSSQDK-------- 725
G KGYK+W ++K ++SR+V+F E + K + + V ++E E +S K
Sbjct: 603 PSGTKGYKIWLLEEQKCVISRNVLFHEELVYKDTIEKERVVESEAEPASHSKSTLIKMKT 662
Query: 726 ---------IQVDIEETPVSPRQIVAQQQSEPGSDSGEVQ------DYTLVRDREPSRIT 770
IQV EE S + +Q+ EP + E Q +Y LVRDRE +I
Sbjct: 663 PGNLNSGGVIQVSDEEE--SDESVDEEQEPEPQVELPETQTTSSLANYQLVRDRERRQIH 720
Query: 771 PPVRYGFEDLAAYALLT----SSGDPSTYHEAMAS*EKDKWMSAMVEEMESLKKNET*NL 826
PP R+ E A+AL+T S +P +Y EA + E KW A EE SL KN T L
Sbjct: 721 PPARFTEESGVAFALVTVETLSMEEPQSYQEATSDKEWKKWKLATHEEKNSLIKNGTWVL 780
Query: 827 VQLPHGKRVIGCKWVYKKKLAVT*KEREKFKAHLVTKGYSQHKGIDYDEIFSPVVRHTSI 886
V P +++IGC+W++K K + E ++KA LV KGY+Q +G+DY EIF+PV +HTSI
Sbjct: 781 VDKPKDRKIIGCRWLFKMKSGIPGVEPVRYKARLVAKGYTQREGVDYQEIFAPVAKHTSI 840
Query: 887 RVVLALVASMDMHLEQMDVKTTFLHGNLEEQIYIEQPEGF-SETGDGRLVCKLKRSLYGL 945
R+++++V D+ LEQMDVKTTFLHG LEE++Y+EQP+GF SE G+ + VC LK+SLYGL
Sbjct: 841 RILMSVVVDQDLELEQMDVKTTFLHGELEEELYMEQPDGFISEDGENK-VCLLKKSLYGL 899
Query: 946 KQSPRQWYKRFDSYMLRIGYRRCDYDCCVYVMSLDDGSFIFLLLYVDDMLIAANHLHDVN 1005
KQSPRQW KRF +M+ + R ++D CVYV + ++LLLYVDDMLIA ++N
Sbjct: 900 KQSPRQWNKRFSRFMIDQNFIRSEHDACVYVKQAGEQDHLYLLLYVDDMLIAGKSKSEIN 959
Query: 1006 ELKTKLGKEFDMKDLGAAKKILGMEIHKDRGAKKLWLSQKSYVEGVLSRFDMSKANHVST 1065
+++ +L EF+MKD+G A +ILG++I +DR L +SQ Y+ V+ RF+M++A +
Sbjct: 960 KVE-QLSMEFEMKDMGPASRILGIDIIRDRKNGVLRMSQARYIHNVVQRFNMAEAKVTRS 1018
Query: 1066 PLTNHFKLSLEQSPKIDSEIEGMSKIPYA 1094
P+ HFKL+ + + D E + +PYA
Sbjct: 1019 PIGAHFKLA---AVRDDDECIDNNDVPYA 1044
Database: nr
Posted date: Jul 5, 2005 12:34 AM
Number of letters in database: 863,360,394
Number of sequences in database: 2,540,612
Lambda K H
0.323 0.138 0.419
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,170,957,551
Number of Sequences: 2540612
Number of extensions: 90774210
Number of successful extensions: 252065
Number of sequences better than 10.0: 4486
Number of HSP's better than 10.0 without gapping: 3740
Number of HSP's successfully gapped in prelim test: 750
Number of HSP's that attempted gapping in prelim test: 238968
Number of HSP's gapped (non-prelim): 8642
length of query: 1307
length of database: 863,360,394
effective HSP length: 140
effective length of query: 1167
effective length of database: 507,674,714
effective search space: 592456391238
effective search space used: 592456391238
T: 11
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 81 (35.8 bits)
Lotus: description of TM0003.6