
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0003.6
(1307 letters)
Database: uniref100
2,790,947 sequences; 848,049,833 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
UniRef100_Q6AUC7 Putative polyprotein [Oryza sativa] 970 0.0
UniRef100_Q75HA9 Putative polyprotein [Oryza sativa] 960 0.0
UniRef100_Q9FFM0 Copia-like retrotransposable element [Arabidops... 888 0.0
UniRef100_Q9M1F5 Copia-like polyprotein [Arabidopsis thaliana] 881 0.0
UniRef100_Q9SH77 Putative retroelement pol polyprotein [Arabidop... 875 0.0
UniRef100_Q8LNW7 Putative polyprotein [Oryza sativa] 869 0.0
UniRef100_Q6BCY1 Gag-Pol [Ipomoea batatas] 865 0.0
UniRef100_Q6L4V3 Putative polyprotein [Oryza sativa] 863 0.0
UniRef100_Q9SJT2 Putative retroelement pol polyprotein [Arabidop... 857 0.0
UniRef100_Q9ZPU5 Putative retroelement pol polyprotein [Arabidop... 855 0.0
UniRef100_P10978 Retrovirus-related Pol polyprotein from transpo... 853 0.0
UniRef100_Q9SHR5 F28L22.3 protein [Arabidopsis thaliana] 845 0.0
UniRef100_Q9AU17 Polyprotein-like [Lycopersicon chilense] 811 0.0
UniRef100_O81903 Putative transposable element [Arabidopsis thal... 795 0.0
UniRef100_Q9ZRJ0 ORF [Nicotiana tabacum] 782 0.0
UniRef100_Q7XFG2 Putative retroelement [Oryza sativa] 664 0.0
UniRef100_Q6L3N8 Putative gag-pol polyprotein [Solanum demissum] 655 0.0
UniRef100_Q9C739 Copia-type polyprotein, putative [Arabidopsis t... 645 0.0
UniRef100_Q9SXB2 T28P6.8 protein [Arabidopsis thaliana] 642 0.0
UniRef100_Q9AUZ1 Polyprotein, putative [Arabidopsis thaliana] 604 e-171
>UniRef100_Q6AUC7 Putative polyprotein [Oryza sativa]
Length = 1241
Score = 970 bits (2507), Expect = 0.0
Identities = 554/1249 (44%), Positives = 770/1249 (61%), Gaps = 60/1249 (4%)
Query: 91 MSKTPMNKLFAKQRLYSLKMQEGGDLQAHVYAFNNILADMTRLGVTVDDEDKAIILLCSL 150
M+K +K+ KQ+L+ K+Q+ G + H+ AF I+AD+ + V D+ED +ILLCSL
Sbjct: 1 MTKDLTSKMHLKQKLFLHKLQDDGSVMDHLSAFKEIVADLESMEVKYDEEDLGLILLCSL 60
Query: 151 PGSYDHLVTTLTYGKDSITLDSISSTLLPHAQRRQSV-EEGGGSSGEGLFVKGGQDRGRG 209
P SY + T+ Y +D++TL + L + ++ V EG S EGL V+G Q
Sbjct: 61 PSSYANFRDTILYSRDTLTLKEVYDALHAKEKMKKMVPSEGSNSQAEGLVVRGRQQEKNT 120
Query: 210 KGKAVDSGK---KKRSKSKDRKTTECYSCKQIGH-----WK-RDCPNRSGK-----SGNN 255
K+ D + RSKS+ R + C CK+ GH WK +D R+ K
Sbjct: 121 NNKSRDKSSSIYRGRSKSRGRYKS-CKYCKRDGHDISECWKLQDKDKRTRKYIPKGKKEE 179
Query: 256 SSSANVVQSDGSCSEEDLLCVSSVKCTDAWVLDSGCSYHMTQHREWFNSFKSGDLGYVYL 315
A VV + S +E + + +D W+LD+ C+YHM +R+WF ++++ G V +
Sbjct: 180 EGKAAVVTDEKSDAELLVAYAGCAQTSDQWILDTACTYHMCPNRDWFATYEAVQGGTVLM 239
Query: 316 GDDKPCIIKGMRQVKIALDDGGVRTLSQVRYVLEVMKNLISLGTLHENGYSFKSEENRDI 375
GDD PC + G+ V+I + DG +RTL VR++ + ++LISL TL GY + + I
Sbjct: 240 GDDTPCEVAGIGTVQIKMFDGCIRTLLDVRHIPNLKRSLISLCTLDRKGYKYSGGDG--I 297
Query: 376 LRVSKGAMTVMRAKRTAGNIYKLLGGTIMGDVASVE---TDDDATKLWHMRLGHLSERGM 432
L+V+KG++ VM+A N+Y L G TI+G+VA+V ++ DAT LWHMRLGH+SE G+
Sbjct: 298 LKVTKGSLVVMKADIKYANLYHLRGTTILGNVAAVSDSLSNSDATNLWHMRLGHMSEIGL 357
Query: 433 MELHKRNMLKGVRSCIIGLCKYCVLGKQCRVRFKTGHHKTKGILDYVHSDVRGPTKEPSV 492
EL KR +L G + C++C+ GK RV+F T H T+GILDYVHSD+ GP ++ S
Sbjct: 358 AELSKRGLLDGQSIGKLKFCEHCIFGKHKRVKFNTSTHTTEGILDYVHSDLWGPARKTSF 417
Query: 493 *GFRYFVTFTDDFSRKVWVYFMKYKSEVFAKFKLWKAEVENQTGRKIKYLRSDNGTEYTD 552
G RY +T DD+SRKVW YF+K+K + F FK WK VE QT RK+K LR+DNG E
Sbjct: 418 GGARYMMTIVDDYSRKVWPYFLKHKYQAFDVFKEWKTMVERQTERKVKILRTDNGMELCS 477
Query: 553 KNFMHFCEENGIQRHFSVRKTPQQNGVAERMNRTLTEKARCLRLNACLSKCLWAATINMA 612
K F +C+ GI RH++V TPQQNGVAERMNRT+ KARC+ NA L K WA ++ A
Sbjct: 478 KIFKSYCKSEGIVRHYTVPHTPQQNGVAERMNRTIISKARCMLSNASLPKQFWAEAVSTA 537
Query: 613 CYLVNRSPRASLDGKVAEEVWTGNPIDLSNLRIFWRPAYVHISSEDRSKLDPKSK*CIII 672
CYL+NRSP ++D K EVW+G+P + S+LR+F AY H+ D KL+P+ CI +
Sbjct: 538 CYLINRSPSYAIDKKTPIEVWSGSPANYSDLRVFGCTAYAHV---DNGKLEPRVIKCIFL 594
Query: 673 GYNKGVKGYKLWDPVKKKVIVSRDVVFDE*SMLKQSDVTVVPDTEVENSSQDKIQVDIEE 732
GY GVKGYKLW P KKV++SR+VVF E ML P T V SQ+K+ V +E
Sbjct: 595 GYLSGVKGYKLWCPETKKVVISRNVVFHESIMLHDK-----PSTNVPVESQEKVSVQVEH 649
Query: 733 TPVS----PRQIVAQQQSEP---GSDSGEVQD---YTLVRDREPSRITPPVRYGFE-DLA 781
S ++ VA Q P SDS VQ ++ +DR PP RY E ++
Sbjct: 650 LISSGHAPEKEDVAINQDAPVIEDSDSSIVQQSPKRSIAKDRPKRNTKPPRRYIEEANIV 709
Query: 782 AYAL-----LTSSGDPSTYHEAMAS*EKDKWMSAMVEEMESLKKNET*NLVQLPHGKRVI 836
AYAL + + +PSTY +A+ S + ++W++AM +EMESL+KN + L +LP K+ I
Sbjct: 710 AYALSVAEEIEGNAEPSTYSDAIVSDDCNRWITAMHDEMESLEKNHSWELEKLPKEKKPI 769
Query: 837 GCKWVYKKKLAVT*KEREKFKAHLVTKGYSQHKGIDYDEIFSPVVRHTSIRVVLALVASM 896
CKW++K+K ++ + ++KA LV KGYSQ GID++++FSPVV+H+SIR +L++VA
Sbjct: 770 RCKWIFKRKEGMSPSDEARYKARLVAKGYSQIPGIDFNDVFSPVVKHSSIRTLLSIVAMH 829
Query: 897 DMHLEQMDVKTTFLHGNLEEQIYIEQPEGFSETGDGRLVCKLKRSLYGLKQSPRQWYKRF 956
D LEQMDVKT FLHG LEE IY+EQPEGF G LVC+LK+SLYGLKQSPRQWYKRF
Sbjct: 830 DYELEQMDVKTAFLHGELEEDIYMEQPEGFVVPGKENLVCRLKKSLYGLKQSPRQWYKRF 889
Query: 957 DSYMLRIGYRRCDYDCCVYVMSLDDGSFIFLLLYVDDMLIAANHLHDVNELKTKLGKEFD 1016
DS+ML +RR +YD CVY + + DGS I+LLLYVDDMLIAA ++ +LK +L EF+
Sbjct: 890 DSFMLSQKFRRSNYDSCVY-LKVVDGSAIYLLLYVDDMLIAAKDKSEIAKLKAQLSSEFE 948
Query: 1017 MKDLGAAKKILGMEIHKDRGAKKLWLSQKSYVEGVLSRFDMSKANHVSTPLTNHFKLSLE 1076
MKDLGAAKKILGMEI ++R + KL+LSQK Y+E VL RF+M A VST L HF+LS +
Sbjct: 949 MKDLGAAKKILGMEITRERHSGKLYLSQKCYIEKVLHRFNMHDAKLVSTLLAAHFRLSSD 1008
Query: 1077 QSPKIDSEIEGMSKIPY--AVQLVV*CMLWFALDQIWHKQLVKCQVHVQAREAALGSSQV 1134
P+ +IE MS++PY AV ++ M+ D + H V + + + Q
Sbjct: 1009 LCPQSAYDIEYMSRVPYSSAVSSLMYAMVCSRPD-LSHALSVVSRYMANPGKEHWKAVQW 1067
Query: 1135 DPKILEGYNGSRYHV*QGARCCSISCGICGLCR*SR**KVYNRICL----YSCGG-PICW 1189
+ L G + + + S G+ G + R L ++ GG + W
Sbjct: 1068 IFRYLRGTSSACLQFGRS------SDGLVGYVDSDFAGDLDRRRSLTGYVFTVGGCAVSW 1121
Query: 1190 KSSVQSTVAMSTTEAEYMAVAEAAKEALWLTGLVKELGVEQGGVQLHCDSQSAIYLTNNQ 1249
K+S+Q+TVA+STTEAEYMA++EA KE +WL GL EL + + CDSQSAI LT +Q
Sbjct: 1122 KASLQATVALSTTEAEYMAISEACKEVIWLRGLYTELCGVTSCINIFCDSQSAICLTKDQ 1181
Query: 1250 VYHARTKHIDVRFHKIRELLASRQILLQKIHTSENTTDKLTKPVTSDKF 1298
++H RTKHID+R+H IR ++A + + KI T +N D +TKPV + KF
Sbjct: 1182 MFHERTKHIDLRYHFIRGVIAEGDVKVCKISTHDNPVDMMTKPVPATKF 1230
>UniRef100_Q75HA9 Putative polyprotein [Oryza sativa]
Length = 1322
Score = 960 bits (2482), Expect = 0.0
Identities = 557/1334 (41%), Positives = 780/1334 (57%), Gaps = 67/1334 (5%)
Query: 5 KFEVTRFDGTGNFGLWQRMAKDLLAQKS-LQKALRD--EKPADIATVDWNEMKEKAAGLI 61
K+++ D F LWQ + +LAQ S L +AL +K T + KA LI
Sbjct: 5 KYDLPLLDYKTRFSLWQVKMRAVLAQTSDLDEALESFGKKKTTEWTAEEKRKDRKALSLI 64
Query: 62 TLCVSDDVMNHILDLTTLKDVWDKLESLYMSKTPMNKLFAKQRLYSLKMQEGGDLQAHVY 121
L +S+D++ +L T ++W KLES+ MSK +K+ K +L+S K+ E G + H+
Sbjct: 65 QLHLSNDILQEVLQKKTAAELWLKLESICMSKDLTSKMHIKMKLFSHKLHESGSVLNHIS 124
Query: 122 AFNNILADMTRLGVTVDDEDKAIILLCSLPGSYDHLVTTLTYGKDSITLDSISSTLLPHA 181
F I+AD+ + V DDED ++LLCSLP SY + T+ +D +TL + L
Sbjct: 125 VFKEIVADLVSMEVQFDDEDLGLLLLCSLPSSYANFRHTILLSRDELTLAEVYEALQNRE 184
Query: 182 QRRQSVEEGGGSS-GEGLFVKGGQDRGRGKGKAVDSGKKKRSKSKDRKTTECYSCKQIGH 240
+ + V+ SS GE L V+G ++ + R +SK R C CK+ H
Sbjct: 185 KMKGMVQSYASSSKGEALQVRGRSEQRTYNDSNDHDKSQSRGRSKSRGKKFCKYCKKKNH 244
Query: 241 WKRDCPNRSGKSGNNSSS-ANVVQSDGSCSEEDLLCVSS--VKCTDAWVLDSGCSYHMTQ 297
+ +C K S A+VV S + D L V + V D W+LD+ CS+H+
Sbjct: 245 FIEECWKLQNKEKRKSDGKASVVTSAENSDSGDCLVVFAGYVASHDEWILDTACSFHICI 304
Query: 298 HREWFNSFKS-GDLGYVYLGDDKPCIIKGMRQVKIALDDGGVRTLSQVRYVLEVMKNLIS 356
+R+WF+S+KS + V +GDD P I G+ V+I DG RTL VR++ + +NLIS
Sbjct: 305 NRDWFSSYKSVQNEDVVRMGDDNPREIVGIGSVQIKTHDGMTRTLKDVRHIPGMARNLIS 364
Query: 357 LGTLHENGYSFKSEENRDILRVSKGAMTVMRAKRTAGNIYKLLGGTIMGDV--ASVETDD 414
L TL GY + +++VSKG++ M + N+Y L G T+ G V A+V D+
Sbjct: 365 LSTLDAEGYKYSGSGG--VVKVSKGSLVYMIGDMNSANLYVLRGSTLHGSVTAAAVTKDE 422
Query: 415 DA-TKLWHMRLGHLSERGMMELHKRNMLKGVRSCIIGLCKYCVLGKQCRVRFKTGHHKTK 473
+ T LWHMRLGH+SE GM EL KRN+L G + C++CV GK RV+F T H+TK
Sbjct: 423 PSKTNLWHMRLGHMSELGMAELMKRNLLDGCTQGNMKFCEHCVFGKHKRVKFNTSVHRTK 482
Query: 474 GILDYVHSDVRGPTKEPSV*GFRYFVTFTDDFSRKVWVYFMKYKSEVFAKFKLWKAEVEN 533
GILDYVH+D+ GP+++PS+ G RY +T DD+SRK W YF+K+K + FA FK K +E
Sbjct: 483 GILDYVHADLWGPSRKPSLGGARYMLTIIDDYSRKEWPYFLKHKDDTFAAFKERKVMIER 542
Query: 534 QTGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHFSVRKTPQQNGVAERMNRTLTEKARC 593
QT +++K L +DNG E+ F +C + GI RH ++ TPQQNGVAERMNRT+ KARC
Sbjct: 543 QTEKEVKVLCTDNGGEFCSDAFDDYCRKEGIVRHHTIPYTPQQNGVAERMNRTIISKARC 602
Query: 594 LRLNACLSKCLWAATINMACYLVNRSPRASLDGKVAEEVWTGNPIDLSNLRIFWRPAYVH 653
+ NA ++K WA N ACYL+NRSP L+ K E+W+G P D S LR+F AY H
Sbjct: 603 MLSNARMNKRFWAEAANTACYLINRSPSIPLNKKTPIEIWSGMPADYSQLRVFGCTAYAH 662
Query: 654 ISSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDPVKKKVIVSRDVVFDE*SMLKQSDVT-V 712
+ D KL+P++ C+ +GY GVKGYKLW+P K +SR+V+F+E M S T V
Sbjct: 663 V---DNGKLEPRAIKCLFLGYGSGVKGYKLWNPETNKTFMSRNVIFNEFVMFNDSLPTDV 719
Query: 713 VPDTEVENSSQDKIQVDIEETPVSPRQIVAQQQSEPGSDSGEV---QDYTLVRDREPSRI 769
+P E Q + V +E +IV ++ S V QD + R
Sbjct: 720 IPGGSDEE--QQYVSVQVEHVDDQETEIVGNDVNDTVQHSPSVLQPQDEPIAHRRTKRSC 777
Query: 770 TPPVRYGFE-DLAAYAL-----LTSSGDPSTYHEAMAS*EKDKWMSAMVEEMESLKKNET 823
PVR E D+ YA + ++ +P+TY EA+ S +++KW+SA+ EEM+SL+KN T
Sbjct: 778 GAPVRLIEECDMVYYAFSYAEQVENTLEPATYTEAVVSGDREKWISAIQEEMQSLEKNGT 837
Query: 824 *NLVQLPHGKRVIGCKWVYKKKLAVT*KEREKFKAHLVTKGYSQHKGIDYDEIFSPVVRH 883
LV LP K+ + CKW++K+K ++ E +FK LV KG+SQ G+DY+++FSPVV+H
Sbjct: 838 WELVHLPKQKKPVRCKWIFKRKEGLSPSEPPRFKVRLVAKGFSQIAGVDYNDVFSPVVKH 897
Query: 884 TSIRVVLALVASMDMHLEQMDVKTTFLHGNLEEQIYIEQPEGFSETGDGRLVCKLKRSLY 943
+SIR ++V D+ LEQ+DVKTTFLHG LEE+IY++QPEGF G VCKLKRSLY
Sbjct: 898 SSIRTFFSIVTMHDLELEQLDVKTTFLHGELEEEIYMDQPEGFIVPGKEDYVCKLKRSLY 957
Query: 944 GLKQSPRQWYKRFDSYMLRIGYRRCDYDCCVYVMSLDDGSFIFLLLYVDDMLIAANHLHD 1003
GLKQSPRQWYKRFDS+ML G++R ++D CVY+ ++ GS I+LLLYVDDMLIAA
Sbjct: 958 GLKQSPRQWYKRFDSFMLSHGFKRSEFDSCVYIKFVN-GSPIYLLLYVDDMLIAAKSKEQ 1016
Query: 1004 VNELKTKLGKEFDMKDLGAAKKILGMEIHKDRGAKKLWLSQKSYVEGVLSRFDMSKANHV 1063
+ LK +L EFDMKDLGAAKKILGMEI +DR + L+LSQ+SY++ VL RF+M A V
Sbjct: 1017 ITTLKKQLSSEFDMKDLGAAKKILGMEITRDRNSGLLFLSQQSYIKKVLQRFNMHDAKPV 1076
Query: 1064 STPLTNHFKLSLEQSPKIDSEIEGMSKIPY--AVQLVV*CMLWFALDQIWHKQLVKCQVH 1121
STP+ HFKLS Q D ++E MS++PY AV ++ M+ D LV +
Sbjct: 1077 STPIAPHFKLSALQCASTDEDVEYMSRVPYSSAVGSLMYAMVCSWPDLSHAMSLVSRYMA 1136
Query: 1122 VQARE-----------------AALGSSQVDPKILEGYNGSRYHV*QGARCCSISCGICG 1164
+E A L ++D K L GY S + R + G
Sbjct: 1137 NPGKEHWKAVQWIFRYLRGTADACLKFGRID-KGLVGYVDSDFAADLDKRR-----SLTG 1190
Query: 1165 LCR*SR**KVYNRICLYSCGGPICWKSSVQSTVAMSTTEAEYMAVAEAAKEALWLTGLVK 1224
+ SC + WK+++Q VA STTEAEYMA+AEA KE++WL GL
Sbjct: 1191 YV-----------FTIGSCA--VSWKATLQPVVAQSTTEAEYMAIAEACKESVWLKGLFA 1237
Query: 1225 ELGVEQGGVQLHCDSQSAIYLTNNQVYHARTKHIDVRFHKIRELLASRQILLQKIHTSEN 1284
EL + L CDSQSAI LT +Q++H RTKHID+++H +R+++A ++ + KI +N
Sbjct: 1238 ELCGVDSCINLFCDSQSAICLTKDQMFHERTKHIDIKYHYVRDIVAQGKLKVCKISIHDN 1297
Query: 1285 TTDKLTKPVTSDKF 1298
D +TKP+ KF
Sbjct: 1298 PADMMTKPIPVAKF 1311
>UniRef100_Q9FFM0 Copia-like retrotransposable element [Arabidopsis thaliana]
Length = 1342
Score = 888 bits (2294), Expect = 0.0
Identities = 537/1381 (38%), Positives = 768/1381 (54%), Gaps = 148/1381 (10%)
Query: 7 EVTRFDGTGNFGLWQRMAKDLLAQKSLQKALRDEKPADI--ATVDWN------------- 51
EV +FDG G++ LW+ + L + L +E+ A + +T + +
Sbjct: 7 EVEKFDGDGDYILWKEKLLAHMEMLGLLEGLGEEEEAVVEDSTTEISDGGNQDPETATSK 66
Query: 52 -------EMKEKAAGLITLCVSDDVMNHILDLTTLKDVWDKLESLYMSKTPMNKLFAKQR 104
E + KA I L + ++V+ ++ T + L+ L+M+K+ N+++ KQR
Sbjct: 67 LEDKILKEKRGKARSTIILSLGNNVLRKVIKQKTAAGMIKVLDQLFMAKSLPNRIYLKQR 126
Query: 105 LYSLKMQEGGDLQAHVYAFNNILADMTRLGVTVDDEDKAIILLCSLPGSYDHLVTTLTYG 164
LY KM E ++ +V F +++D+ + V V DED+AI+LL SLP +D L TL Y
Sbjct: 127 LYGYKMSENMTMEENVNDFFKLISDLENVKVVVPDEDQAIVLLMSLPRQFDQLKETLKYC 186
Query: 165 KDSITLDSISSTLLPHAQRRQSVEEGGGS------SGEGLFVKGGQDRGRGKGKAVDSGK 218
K ++ L+ I+S + R + E G S + +GLFV QDRGR + + K
Sbjct: 187 KTTLHLEEITSAI------RSKILELGASGKLLKNNSDGLFV---QDRGRSETRGKGPNK 237
Query: 219 KK-RSKSKDRKTTECYSCKQIGHWKRDC-----PNRSGKSGNNSSSANVVQ--SDGSCSE 270
K RSKSK T C+ C + GH+K+ C N+ G + ++ V +D +
Sbjct: 238 NKSRSKSKGAGKT-CWICGKEGHFKKQCYVWKERNKQGSTSERGEASTVTARVTDAAALV 296
Query: 271 EDLLCVSSVKCT-DAWVLDSGCSYHMTQHREWFNSFKSGDLGYVYLGDDKPCIIKGMRQV 329
+ + T D W+LD+GCS+HMT ++W FK G V +G+D +KG+ V
Sbjct: 297 VSRALLGFAEVTPDTWILDTGCSFHMTCRKDWIIDFKETASGKVRMGNDTYSEVKGIGDV 356
Query: 330 KIALDDGGVRTLSQVRYVLEVMKNLISLGTLHENGYSFKSEENRDILRVSKGAMTVMRAK 389
+I +DG L+ VRY+ E+ KNLISLGTL + G F+S++ IL + K +TV+ K
Sbjct: 357 RIKNEDGSTILLTDVRYIPEMSKNLISLGTLEDKGCWFESKKG--ILTIFKNDLTVLTGK 414
Query: 390 RTAGNIYKLLGGTIMGDVASVETDDDATKLWHMRLGHLSERGMMELHKRNMLKGVRSCII 449
+ + +Y L G T+ G+ ++ + D T LWH RLGH+ +G+ L + L
Sbjct: 415 KES-TLYFLQGTTLAGEANVIDKEKDETSLWHSRLGHIGAKGLQVLVSKGHLD------- 466
Query: 450 GLCKYCVLGKQCRVRFKTGHHKTKGILDYVHSDVRGPTKEP-SV*GFRYFVTFTDDFSRK 508
K + F H TK LDYVHSD+ G T P S+ +YF+TF DDF+R+
Sbjct: 467 ---------KNIMISFGAAKHVTKDKLDYVHSDLWGSTNVPFSIGKCQYFITFIDDFTRR 517
Query: 509 VWVYFMKYKSEVFAKFKLWKAEVENQTGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHF 568
W+YF++ K E F+KF WK ++ENQ +K+K L +DNG E+ ++ F FC + G+ RH
Sbjct: 518 TWIYFIRTKDEAFSKFVEWKTQIENQQDKKLKILITDNGLEFCNQEFDSFCRKEGVIRHR 577
Query: 569 SVRKTPQQNGVAERMNRTLTEKARCLRLNACLSKCLWAATINMACYLVNRSPRASLDGKV 628
+ TPQQNGVAERMNRT+ K RC+ + L K WA + A +L+N+SP +S++ +
Sbjct: 578 TCAYTPQQNGVAERMNRTIMNKVRCMLSESGLGKQFWAEAASTAVFLINKSPSSSIEFDI 637
Query: 629 AEEVWTGNPIDLSNLRIFWRPAYVHISSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDPVK 688
EE WTG+P D L+ F AY+H D+ KL+P++K I +GY GVK +K+W
Sbjct: 638 PEEKWTGHPPDYKILKKFGSVAYIH---SDQGKLNPRAKKGIFLGYPDGVKRFKVWLLED 694
Query: 689 KKVIVSRDVVFDE*SMLKQSDVTVVPDTEVENSSQDKIQVDIEETPVSPRQIVA--QQQS 746
+K +VSRD+VF E M K+ + S +DK ++E T + + + A + QS
Sbjct: 695 RKCVVSRDIVFQENQMYKELQKN-------DMSEEDKQLTEVERTLIELKNLSADDENQS 747
Query: 747 EPGSDSGEVQ------------------------DYTLVRDREPSRITPPVRYGFED--L 780
E G +S + Q +Y L RDR +I P R+ ED L
Sbjct: 748 EGGDNSNQEQASTTRSASKDKQVEETDSDDDCLENYLLARDRIRRQIRAPQRFVEEDDSL 807
Query: 781 AAYAL-LTSSGD---PSTYHEAMAS*EKDKWMSAMVEEMESLKKNET*NLVQLPHGKRVI 836
+AL +T G+ P TY EAM S E +KW A +EEM+S+KKN+T +++ P GKRVI
Sbjct: 808 VGFALTMTEDGEVYEPETYEEAMRSPECEKWKQATIEEMDSMKKNDTWDVIDKPEGKRVI 867
Query: 837 GCKWVYKKKLAVT*KEREKFKAHLVTKGYSQHKGIDYDEIFSPVVRHTSIRVVLALVASM 896
GCKW++K+K + E ++KA LV KG+SQ +GIDY EIFSPVV+H SIR +L++V
Sbjct: 868 GCKWIFKRKAGIPGVEPPRYKARLVAKGFSQREGIDYQEIFSPVVKHVSIRYLLSIVVQF 927
Query: 897 DMHLEQMDVKTTFLHGNLEEQIYIEQPEGFSETGDGRLVCKLKRSLYGLKQSPRQWYKRF 956
DM LEQ+DVKT FLHGNL+E I + QPEG+ + VC LK+SLYGLKQSPRQW +RF
Sbjct: 928 DMELEQLDVKTAFLHGNLDEYILMSQPEGYEDEDSTEKVCLLKKSLYGLKQSPRQWNQRF 987
Query: 957 DSYMLRIGYRRCDYDCCVYVMSLDDGSFIFLLLYVDDMLIAANHLHDVNELKTKLGKEFD 1016
DS+M+ GY+R Y+ CVY L+DGS+I+LLLYVDDMLIA+ + + +LK L +EF+
Sbjct: 988 DSFMINSGYQRSKYNPCVYTQQLNDGSYIYLLLYVDDMLIASQNKDQIQKLKESLNREFE 1047
Query: 1017 MKDLGAAKKILGMEIHKDRGAKKLWLSQKSYVEGVLSRFDMSKANHVSTPLTNHFKLSLE 1076
MKDLG A+KILGMEI ++R L LSQ YV GVL F M ++ TPL HFKL
Sbjct: 1048 MKDLGPARKILGMEITRNREQGILDLSQSEYVAGVLRAFGMDQSKVSQTPLGAHFKLRAA 1107
Query: 1077 QSPKIDSEIEGMSKIPY--AVQLVV*CMLWFALDQIW---------------HKQLVKCQ 1119
+ + E M +PY A+ ++ M+ D + H Q VK
Sbjct: 1108 NEKTLARDAEYMKLVPYPNAIGSIMYSMIGSRPDLAYPVGVVSRFMSKPSKEHWQAVKWV 1167
Query: 1120 VHVQ--AREAALGSSQVDPKILEGYNGSRYHV*QGARCCSISCGICGLCR*SR**KVYNR 1177
+ ++ L + D + GY S Y A I G
Sbjct: 1168 MRYMKGTQDTCLRFKKDDKFEIRGYCDSDY-----ATDLDRRRSITGF------------ 1210
Query: 1178 ICLYSCGG-PICWKSSVQSTVAMSTTEAEYMAVAEAAKEALWLTGLVKELGVEQGGVQLH 1236
+++ GG I WKS +Q VA+STTEAEYMA+AEA KEA+WL GL E+G EQ V++
Sbjct: 1211 --VFTAGGNTISWKSGLQRVVALSTTEAEYMALAEAVKEAIWLRGLAAEMGFEQDAVEVM 1268
Query: 1237 CDSQSAIYLTNNQVYHARTKHIDVRFHKIRELLASRQILLQKIHTSENTTDKLTKPVTSD 1296
CDSQSAI L+ N V+H RTKHIDVR+H IRE +A +I + KI T+ N D TK V
Sbjct: 1269 CDSQSAIALSKNSVHHERTKHIDVRYHFIREKIADGEIQVVKISTTWNPADIFTKTVPVS 1328
Query: 1297 K 1297
K
Sbjct: 1329 K 1329
>UniRef100_Q9M1F5 Copia-like polyprotein [Arabidopsis thaliana]
Length = 1363
Score = 881 bits (2276), Expect = 0.0
Identities = 526/1365 (38%), Positives = 770/1365 (55%), Gaps = 85/1365 (6%)
Query: 3 GAKFEVTRFDGTGNFGLWQRMAK---DLLA--------------QKSLQKALRDEKPADI 45
GA+ EV +FDG G++ +W+ D+L ++ +K+ DEK
Sbjct: 3 GARIEVEKFDGRGDYTMWKEKLLAHIDMLGLSAVLRESETPMGKERDSEKSDEDEKEERE 62
Query: 46 ATVDWNEMKEKAAGLITLCVSDDVMNHILDLTTLKDVWDKLESLYMSKTPMNKLFAKQRL 105
+ E K KA I L VSD V+ I T+ + + L+ LYMSK N+++ KQ+L
Sbjct: 63 KMEAFEEKKRKARSTIVLSVSDRVLRKIKKETSAAAMLEALDRLYMSKALPNRIYLKQKL 122
Query: 106 YSLKMQEGGDLQAHVYAFNNILADMTRLGVTVDDEDKAIILLCSLPGSYDHLVTTLTY-- 163
YS KM E ++ ++ F +I+AD+ L V V DED+AI+LL SLP +D L TL Y
Sbjct: 123 YSFKMSENLSIEGNIDEFLHIVADLENLNVLVSDEDQAILLLMSLPKPFDQLKDTLKYSS 182
Query: 164 GKDSITLDSISSTLLPHAQRRQSVEEGGGSSGEGLFVKGGQDRGRGKGKAVDSGKKKRSK 223
GK ++LD +++ + SV++ EGL+VK + RG+ + D GK KRSK
Sbjct: 183 GKTVLSLDEVAAAIYSRELEFGSVKKSIKGQAEGLYVKDKAEN-RGRSEQKDKGKGKRSK 241
Query: 224 SKDRKTTECYSCKQIGHWKRDCPNRS----------------GKSGNNSSSANVVQSDGS 267
SK ++ C+ C + GH K CPN++ GK S N V+S G
Sbjct: 242 SKSKRG--CWICGEDGHLKSTCPNKNKPQFKNQGSNKGESSGGKGNLVEGSVNFVESAGM 299
Query: 268 CSEEDLLCVSSVKCTDAWVLDSGCSYHMTQHREWFNSFKSGDLGYVYLGDDKPCIIKGMR 327
E L + + D W++D+GC YHMT REW F G V +G+ +KG+
Sbjct: 300 FVSE-ALSSTDIHLEDEWIMDTGCIYHMTHKREWLEDFDEEAGGSVRMGNKSISRVKGVG 358
Query: 328 QVKIALDDGGVRTLSQVRYVLEVMKNLISLGTLHENGYSFKSEENRDILRVSKGAMTVMR 387
V+I D+G TL VRY+ ++ +NL+SLGT + G+ F+SE +LR+ G ++
Sbjct: 359 TVRIVNDNGLTVTLQNVRYIPDMDRNLLSLGTFEKAGHKFESENG--MLRIKSGNQVLLE 416
Query: 388 AKRTAGNIYKLLGGTIMGDVASVETDDDATKLWHMRLGHLSERGMMELHKRNMLKGVRSC 447
+R +Y L G + +V +D T LWH RL H+S++ M L K+ L +
Sbjct: 417 GRRY-DTLYILHGKPATDESLAVARANDDTVLWHRRLCHMSQKNMSLLIKKGFLDKKKVS 475
Query: 448 IIGLCKYCVLGKQCRVRFKTGHHKTKGILDYVHSDVRG-PTKEPSV*GFRYFVTFTDDFS 506
++ C+ C+ G+ ++ F H TK L+YVHSD+ G PT S+ +YF++F DD++
Sbjct: 476 MLDTCEDCIYGRAKKIGFNLAQHDTKKKLEYVHSDLWGAPTVPMSLGNCQYFISFIDDYT 535
Query: 507 RKVWVYFMKYKSEVFAKFKLWKAEVENQTGRKIKYLRSDNGTEYTDKNFMHFCEENGIQR 566
RKVWVYF+K K E F KF W + VENQ+G ++K LR+DNG E+ ++ F FCEE G QR
Sbjct: 536 RKVWVYFLKTKDEAFEKFVSWISLVENQSGERVKTLRTDNGLEFCNRMFDGFCEEKGFQR 595
Query: 567 HFSVRKTPQQNGVAERMNRTLTEKARCLRLNACLSKCLWAATINMACYLVNRSPRASLDG 626
H + TPQQNGV ERMNRT+ EK R + ++ L K WA + A L+N++P ++++
Sbjct: 596 HRTCAYTPQQNGVVERMNRTIMEKVRSMLCDSGLPKRFWAEATHTAVLLINKTPCSAINF 655
Query: 627 KVAEEVWTGNPIDLSNLRIFWRPAYVHISSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDP 686
+ ++ W+G S LR + +VH D KL+ ++K ++IGY GVKGYK+W
Sbjct: 656 EFPDKRWSGKAPIYSYLRRYGCVTFVHT---DGGKLNLRAKKGVLIGYPSGVKGYKVWLI 712
Query: 687 VKKKVIVSRDVVFDE*SMLKQ-SDVTVVPDTEVENSSQDKIQVDIE------------ET 733
+KK +VSR+V F E ++ K E ++ + I +D+E +
Sbjct: 713 EEKKCVVSRNVSFQENAVYKDLMQRKEQVSCEEDDHAGSYIDLDLEADKDNSSGGEQSQA 772
Query: 734 PVSPRQIVAQQQSEPGSDSGEVQD---------YTLVRDREPSRITPPVRYGFEDLAAYA 784
V+P A + P ++ ++++ Y LVRDRE I P R+ ED A A
Sbjct: 773 QVTPATRGAVTSTPPRYETDDIEETDVHQSPLSYHLVRDRERREIRAPRRFDDEDYYAEA 832
Query: 785 LLTSSG----DPSTYHEAMAS*EKDKWMSAMVEEMESLKKNET*NLVQLPHGKRVIGCKW 840
L T+ +P+ Y EA+ DKW AM EE+ES KN+T V P +R+IG +W
Sbjct: 833 LYTTEDGDAVEPADYKEAVRDENWDKWRLAMNEEIESQLKNDTWTTVTRPEKQRIIGSRW 892
Query: 841 VYKKKLAVT*KEREKFKAHLVTKGYSQHKGIDYDEIFSPVVRHTSIRVVLALVASMDMHL 900
+YK K + E +FKA LV KGY+Q +G+DY EIF+PVV+H SIR++L++VA ++ L
Sbjct: 893 IYKYKQGIPGVEEPRFKARLVAKGYAQREGVDYHEIFAPVVKHVSIRILLSIVAQENLEL 952
Query: 901 EQMDVKTTFLHGNLEEQIYIEQPEGFSETGDGRLVCKLKRSLYGLKQSPRQWYKRFDSYM 960
EQ+DVKT FLHG L+E+IY+ PEG VC L +SLYGLKQ+PRQW ++F+ YM
Sbjct: 953 EQLDVKTAFLHGELKEKIYMMPPEGCESLFKENEVCLLNKSLYGLKQAPRQWNEKFNHYM 1012
Query: 961 LRIGYRRCDYDCCVYVMSLDDGSFIFLLLYVDDMLIAANHLHDVNELKTKLGKEFDMKDL 1020
IG++R DYD C Y L D S ++LL YVDDML+AAN++ ++ LK +L +F+MKDL
Sbjct: 1013 TEIGFKRSDYDSCAYTKKLSDDSTMYLLFYVDDMLVAANNMQAIDALKKELSIKFEMKDL 1072
Query: 1021 GAAKKILGMEIHKDRGAKKLWLSQKSYVEGVLSRFDMSKANHVSTPLTNHFKLSLEQSPK 1080
GAAKKILG+EI DR A LWLSQ+SY+ VL F+M ++ TPL H K+ K
Sbjct: 1073 GAAKKILGIEIIIDREAGVLWLSQESYLNKVLKTFNMLESKPALTPLGAHLKMKSATEEK 1132
Query: 1081 IDSEIEGMSKIPY--AVQLVV*CMLWFALDQIWHKQLVKCQVHVQAREAALGSSQVDPKI 1138
+ +E E M+ +PY AV ++ M+ D + +V + A+E LG V +
Sbjct: 1133 LSTEEEYMNSVPYSSAVGSIMYAMIGTRPDLAYPVGVVSRFMSQPAKEHWLGVKWV-LRY 1191
Query: 1139 LEGYNGSRYHV*QGARCCSISCGICGLCR*SR**KVYNRICL----YSCGG-PICWKSSV 1193
++G +R + + ICG C + R + ++ GG I WKS +
Sbjct: 1192 IKGTVDTRLCYKRNS-----DFSICGYCDADYAADLDKRRSITGLVFTLGGNTISWKSGL 1246
Query: 1194 QSTVAMSTTEAEYMAVAEAAKEALWLTGLVKELGVEQGGVQLHCDSQSAIYLTNNQVYHA 1253
Q VA S+TE EYM++ EA KEA+WL GL+K+ G EQ V++ CDSQSAI L+ N V+H
Sbjct: 1247 QRVVAQSSTECEYMSLTEAVKEAIWLKGLLKDFGYEQKNVEIFCDSQSAIALSKNNVHHE 1306
Query: 1254 RTKHIDVRFHKIRELLASRQILLQKIHTSENTTDKLTKPVTSDKF 1298
RTKHIDV+FH IRE++A ++ + KI T +N D TK + +KF
Sbjct: 1307 RTKHIDVKFHFIREIIADGKVEVSKISTEKNPADIFTKVLPVNKF 1351
>UniRef100_Q9SH77 Putative retroelement pol polyprotein [Arabidopsis thaliana]
Length = 1356
Score = 875 bits (2261), Expect = 0.0
Identities = 516/1360 (37%), Positives = 758/1360 (54%), Gaps = 84/1360 (6%)
Query: 4 AKFEVTRFDGTGNFGLWQRMAKDLLAQKSLQKALRDEKPA-------DIATVDWNEM--- 53
A+ EV +FDG G++ +W+ + L AL++ + D + D+ E
Sbjct: 4 ARIEVEKFDGRGDYTMWKEKLLAHMDILGLNTALKESESTGEKKSVLDESDEDYEEKLEK 63
Query: 54 -------KEKAAGLITLCVSDDVMNHILDLTTLKDVWDKLESLYMSKTPMNKLFAKQRLY 106
K+KA I L V+D V+ I +T + L+ LYMSK N+++ KQ+LY
Sbjct: 64 FEALEEKKKKARSAIVLSVTDRVLRKIKKESTAAAMLLALDKLYMSKALPNRIYPKQKLY 123
Query: 107 SLKMQEGGDLQAHVYAFNNILADMTRLGVTVDDEDKAIILLCSLPGSYDHLVTTLTY--G 164
S KM E ++ ++ F I+ D+ + V + DED+AI+LL +LP ++D L TL Y G
Sbjct: 124 SFKMSENLSVEGNIDEFLQIITDLENMNVIISDEDQAILLLTALPKAFDQLKDTLKYSSG 183
Query: 165 KDSITLDSISSTLLPHAQRRQSVEEGGGSSGEGLFVKGGQDRGRGKGKAVDSGKKKRSKS 224
K +TLD +++ + SV++ EGL+VK D+ KGK GK K K
Sbjct: 184 KSILTLDEVAAAIYSKELELGSVKKSIKVQAEGLYVK---DKNENKGKGEQKGKGKGKKG 240
Query: 225 KDRKTTECYSCKQIGHWKRDCPNRSG---------KSGNNSSSANVVQSDGSCSEEDLLC 275
K +K C++C + GH++ CPN++ K ++ N+ ++ G E L
Sbjct: 241 KSKKKPGCWTCGEEGHFRSSCPNQNKPQFKQSQVVKGESSGGKGNLAEAAGYYVSE-ALS 299
Query: 276 VSSVKCTDAWVLDSGCSYHMTQHREWFNSFKSGDLGYVYLGDDKPCIIKGMRQVKIALDD 335
+ V D W+LD+GCSYHMT REWF+ F G V +G+ ++G+ +++ D
Sbjct: 300 STEVHLEDEWILDTGCSYHMTYKREWFHEFNEDAGGSVRMGNKTVSRVRGVGTIRVKNSD 359
Query: 336 GGVRTLSQVRYVLEVMKNLISLGTLHENGYSFKSEENRDILRVSKGAMTVMRAKRTAGNI 395
G L+ VRY+ ++ +NL+SLGT + GY F+SE+ ILR+ G ++ +R +
Sbjct: 360 GLTIVLTNVRYIPDMDRNLLSLGTFEKAGYKFESEDG--ILRIKAGNQVLLTGRRY-DTL 416
Query: 396 YKLLGGTIMGDVASVETDDDATKLWHMRLGHLSERGMMELHKRNMLKGVRSCIIGLCKYC 455
Y L + + +V D T LWH RL H+S++ M L ++ L + + +C+ C
Sbjct: 417 YLLNWKPVASESLAVVKRADDTVLWHQRLCHMSQKNMEILVRKGFLDKKKVSSLDVCEDC 476
Query: 456 VLGKQCRVRFKTGHHKTKGILDYVHSDVRGPTKEP-SV*GFRYFVTFTDDFSRKVWVYFM 514
+ GK R F HH TK L+Y+HSD+ G P S+ +YF++ DDF+RKVWVYFM
Sbjct: 477 IYGKAKRKSFSLAHHDTKEKLEYIHSDLWGAPFVPLSLGKCQYFMSIIDDFTRKVWVYFM 536
Query: 515 KYKSEVFAKFKLWKAEVENQTGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHFSVRKTP 574
K K E F KF W VENQT R++K LR+DNG E+ +K F FCE GI RH + TP
Sbjct: 537 KTKDEAFEKFVEWVNLVENQTDRRVKTLRTDNGLEFCNKLFDGFCESIGIHRHRTCAYTP 596
Query: 575 QQNGVAERMNRTLTEKARCLRLNACLSKCLWAATINMACYLVNRSPRASLDGKVAEEVWT 634
QQNGVAERMNRT+ EK R + ++ L K WA + L+N++P ++L+ ++ ++ W+
Sbjct: 597 QQNGVAERMNRTIMEKVRSMLSDSGLPKRFWAEATHTTVLLINKTPSSALNFEIPDKKWS 656
Query: 635 GNPIDLSNLRIFWRPAYVHISSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDPVKKKVIVS 694
GNP S LR + A+VH D KL+P++K ++IGY GVKGYK+W ++K +VS
Sbjct: 657 GNPPVYSYLRRYGCVAFVHT---DDGKLEPRAKKGVLIGYPVGVKGYKVWILDERKCVVS 713
Query: 695 RDVVFDE*SMLK-----QSDVTVVPDTEVEN-------------SSQDKIQVDIEETPVS 736
R+++F E ++ K Q +V+ D + + S D+ V+ P S
Sbjct: 714 RNIIFQENAVYKDLMQRQENVSTEEDDQTGSYLEFDLEAERDVISGGDQEMVNTIPAPES 773
Query: 737 PRQIVAQQQSEPGSDSGEVQD----YTLVRDREPSRITPPVRYGFEDLAAYALLTSSG-- 790
P Q + +V Y LVRDR+ I P R+ ED A AL T+
Sbjct: 774 PVVSTPTTQDTNDDEDSDVNQSPLSYHLVRDRDKREIRAPRRFDDEDYYAEALYTTEDGE 833
Query: 791 --DPSTYHEAMAS*EKDKWMSAMVEEMESLKKNET*NLVQLPHGKRVIGCKWVYKKKLAV 848
+P Y +A DKW AM EE++S +KN T +V P +R+IGC+W++K KL +
Sbjct: 834 AVEPENYRKAKLDANFDKWKLAMDEEIDSQEKNNTWTIVTRPENQRIIGCRWIFKYKLGI 893
Query: 849 T*KEREKFKAHLVTKGYSQHKGIDYDEIFSPVVRHTSIRVVLALVASMDMHLEQMDVKTT 908
E +FKA LV KGY+Q +GIDY EIF+PVV+H SIRV+L++VA D+ LEQ+DVKT
Sbjct: 894 LGVEEPRFKARLVAKGYAQKEGIDYHEIFAPVVKHVSIRVLLSIVAQEDLELEQLDVKTA 953
Query: 909 FLHGNLEEQIYIEQPEGFSETGDGRLVCKLKRSLYGLKQSPRQWYKRFDSYMLRIGYRRC 968
FLHG L+E+IY+ PEG+ VC L ++LYGLKQ+P+QW ++FD++M I + +
Sbjct: 954 FLHGELKEKIYMSPPEGYESMFKANEVCLLNKALYGLKQAPKQWNEKFDNFMKEICFVKS 1013
Query: 969 DYDCCVYVMSLDDGSFIFLLLYVDDMLIAANHLHDVNELKTKLGKEFDMKDLGAAKKILG 1028
YD C Y L DGS ++LL+YVDD+L+A+ + + LK LG F+MKDLGAAKKILG
Sbjct: 1014 AYDSCAYTKVLPDGSVMYLLIYVDDILVASKNKEAITALKANLGMRFEMKDLGAAKKILG 1073
Query: 1029 MEIHKDRGAKKLWLSQKSYVEGVLSRFDMSKANHVSTPLTNHFKLSLEQSPKIDSEIEGM 1088
MEI +DR LWLSQ+ Y+ +L ++M++A TPL HFK K+ + + M
Sbjct: 1074 MEIIRDRTLGVLWLSQEGYLNKILETYNMAEAKPAMTPLGAHFKFQAATEQKLIRDEDFM 1133
Query: 1089 SKIPY--AVQLVV*CMLWFALDQIWHKQLVKCQVHVQAREAALGSSQVDPKILEGYNGSR 1146
+PY AV ++ ML D + ++ + +E LG V L G+
Sbjct: 1134 KSVPYSSAVGSIMYAMLGTRPDLAYPVGIISRFMSQPIKEHWLGVKWV----LRYIKGT- 1188
Query: 1147 YHV*QGARCC---SISCGICGLCR*SR**KVYNRICL----YSCGG-PICWKSSVQSTVA 1198
R C S S I G C + R + ++ GG I WKS +Q VA
Sbjct: 1189 ----LKTRLCYKKSSSFSIVGYCDADYAADLDKRRSITGLVFTLGGNTISWKSGLQRVVA 1244
Query: 1199 MSTTEAEYMAVAEAAKEALWLTGLVKELGVEQGGVQLHCDSQSAIYLTNNQVYHARTKHI 1258
STTE+EYM++ EA KEA+WL GL+K+ G EQ V++ CDSQSAI L+ N V+H RTKHI
Sbjct: 1245 QSTTESEYMSLTEAVKEAIWLKGLLKDFGYEQKSVEIFCDSQSAIALSKNNVHHERTKHI 1304
Query: 1259 DVRFHKIRELLASRQILLQKIHTSENTTDKLTKPVTSDKF 1298
DV++H IRE+++ + + KI T +N D TK + KF
Sbjct: 1305 DVKYHFIREIISDGTVEVLKISTEKNPADIFTKVLAVSKF 1344
>UniRef100_Q8LNW7 Putative polyprotein [Oryza sativa]
Length = 1280
Score = 869 bits (2246), Expect = 0.0
Identities = 486/1091 (44%), Positives = 676/1091 (61%), Gaps = 53/1091 (4%)
Query: 5 KFEVTRFDGTGNFGLWQRMAKDLLAQKSLQKALRDEKPADIATVDW-NEMKEK---AAGL 60
K+++ D F LWQ + +LAQ+ L AL D T DW N+ K+K A
Sbjct: 40 KYDLPLLDRDTRFSLWQVKMRAVLAQQDLDDALSG---FDKRTQDWSNDEKKKDRKAMSY 96
Query: 61 ITLCVSDDVMNHILDLTTLKDVWDKLESLYMSKTPMNKLFAKQRLYSLKMQEGGDLQAHV 120
I L +S++++ +L T +W KLE + M+K +K+ KQ+L+ K+Q+ G + H+
Sbjct: 97 IHLHLSNNILQEVLKEETAAGLWLKLEQICMTKDLTSKMHLKQKLFLHKLQDDGSVMDHL 156
Query: 121 YAFNNILADMTRLGVTVDDEDKAIILLCSLPGSYDHLVTTLTYGKDSITLDSISSTLLPH 180
F I+AD+ + V D+ED +ILLCSLP SY + T+ Y D++ L + L
Sbjct: 157 STFKEIVADLESIEVKYDEEDLGLILLCSLPSSYANFRDTILYSHDTLILKEVYDALHAK 216
Query: 181 AQRRQSV-EEGGGSSGEGLFVKGGQDRGRGKGKAVD---SGKKKRSKSKDRKTTECYSCK 236
+ ++ V EG S EGL V+G Q K ++ D S + RSKS+ R + C CK
Sbjct: 217 EKMKKMVPSEGSNSQAEGLVVRGRQQEKNTKNQSRDKSSSSYRGRSKSRGRYKS-CKYCK 275
Query: 237 QIGH-----WK-RDCPNRSGK-----SGNNSSSANVVQSDGSCSEEDLLCVSSVKCTDAW 285
+ GH WK +D R+GK A VV + S +E + + +D W
Sbjct: 276 RDGHDISECWKLQDKDKRTGKYIPKGKKEEEGKAAVVTDEKSDTELLVAYAGCAQTSDQW 335
Query: 286 VLDSGCSYHMTQHREWFNSFKSGDLGYVYLGDDKPCIIKGMRQVKIALDDGGVRTLSQVR 345
+LD+ +YHM +R+WF ++++ G V +GDD PC + G+ V+I + DG +RTLS VR
Sbjct: 336 ILDTAWTYHMCPNRDWFATYEALQGGTVLMGDDTPCEVAGIGTVQIKMFDGYIRTLSDVR 395
Query: 346 YVLEVMKNLISLGTLHENGYSFKSEENRDILRVSKGAMTVMRAKRTAGNIYKLLGGTIMG 405
++ + ++LISL TL GY + + IL+V+KG++ VM+A + N+Y L G TI+G
Sbjct: 396 HIPNLKRSLISLCTLDRKGYKYSGGDG--ILKVTKGSLVVMKADIKSANLYHLRGTTILG 453
Query: 406 DVASVE---TDDDATKLWHMRLGHLSERGMMELHKRNMLKGVRSCIIGLCKYCVLGKQCR 462
+VA+V ++ DAT LWHMRLGH+SE G+ EL KR +L G + C++C+ GK R
Sbjct: 454 NVAAVSDSLSNSDATNLWHMRLGHMSEIGLAELSKRELLDGQSIGKLKFCEHCIFGKHKR 513
Query: 463 VRFKTGHHKTKGILDYVHSDVRGPTKEPSV*GFRYFVTFTDDFSRKVWVYFMKYKSEVFA 522
V+F T H T+GILDYVHSD+ GP + S G RY +T DD+SRKVW YF+K+K + F
Sbjct: 514 VKFNTSTHTTEGILDYVHSDLWGPACKTSFGGARYMMTIVDDYSRKVWPYFLKHKYQAFD 573
Query: 523 KFKLWKAEVENQTGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHFSVRKTPQQNGVAER 582
FK WK VE QT +K+K LR+DNG E+ K F +C+ GI H++V TPQQNGVAER
Sbjct: 574 VFKEWKTMVERQTEKKVKILRTDNGMEFCSKIFKSYCKSEGIVHHYTVPHTPQQNGVAER 633
Query: 583 MNRTLTEKARCLRLNACLSKCLWAATINMACYLVNRSPRASLDGKVAEEVWTGNPIDLSN 642
MN + KARC+ NA L K WA ++ CYL+NRSP + D K EVW+G+P + S+
Sbjct: 634 MNMAIISKARCMLSNADLPKQFWAEAVSTTCYLINRSPSYATDKKTPIEVWSGSPANYSD 693
Query: 643 LRIFWRPAYVHISSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDPVKKKVIVSRDVVFDE* 702
LR+F AY H+ D KL+P++ CI +GY GVKGYKLW P KKV++SR+VVF E
Sbjct: 694 LRVFGCTAYAHV---DNGKLEPRAIKCIFLGYPSGVKGYKLWCPETKKVVISRNVVFHES 750
Query: 703 SMLKQSDVTVVPDTEVENSSQDKIQVDIEETPVS----PRQIVAQQQSEP---GSDSGEV 755
+L P T V SQ+K V +E S ++ VA Q P SDS V
Sbjct: 751 VILHDK-----PSTNVPVESQEKASVQVEHLISSGHAPEKENVAINQDAPVIEDSDSSIV 805
Query: 756 QD---YTLVRDREPSRITPPVRYGFE-DLAAYAL-----LTSSGDPSTYHEAMAS*EKDK 806
Q ++ +D+ I PP RY E ++ AYAL + + +PSTY EA+ S + ++
Sbjct: 806 QQSSKRSIAKDKPKRNIKPPRRYIEEANIVAYALSVAEEIEGNAEPSTYSEAIVSDDCNR 865
Query: 807 WMSAMVEEMESLKKNET*NLVQLPHGKRVIGCKWVYKKKLAVT*KEREKFKAHLVTKGYS 866
W++AM +EMESLKKN T V+LP K+ I CKW++K+K ++ + ++KA LV KGYS
Sbjct: 866 WITAMHDEMESLKKNHTWEFVKLPKEKKPIRCKWIFKRKEGMSPSDEARYKARLVAKGYS 925
Query: 867 QHKGIDYDEIFSPVVRHTSIRVVLALVASMDMHLEQMDVKTTFLHGNLEEQIYIEQPEGF 926
Q GID++++FSP+++H+SIR +L +VA D LEQMDVKT FLHG LEE IY+EQPEGF
Sbjct: 926 QIPGIDFNDVFSPIMKHSSIRTLLGIVAMHDYELEQMDVKTAFLHGELEEDIYMEQPEGF 985
Query: 927 SETGDGRLVCKLKRSLYGLKQSPRQWYKRFDSYMLRIGYRRCDYDCCVYVMSLDDGSFIF 986
G LVC+LK+SLYGLKQSPRQWYKRFDS+ML +RR +YD CVY + + DGS I+
Sbjct: 986 VVLGKENLVCRLKKSLYGLKQSPRQWYKRFDSFMLSQKFRRSNYDSCVY-LKVVDGSAIY 1044
Query: 987 LLLYVDDMLIAANHLHDVNELKTKLGKEFDMKDLGAAKKILGMEIHKDRGAKKLWLSQKS 1046
LLLYVDDMLIAA ++ +LK +L EF MKDLGAAKKILGMEI ++R + KL+LSQK
Sbjct: 1045 LLLYVDDMLIAAKDKSEIAKLKAQLSSEFGMKDLGAAKKILGMEITRERHSGKLYLSQKG 1104
Query: 1047 YVEGVLSRFDM 1057
Y++ VL RF+M
Sbjct: 1105 YIKKVLRRFNM 1115
Score = 127 bits (320), Expect = 2e-27
Identities = 62/120 (51%), Positives = 87/120 (71%), Gaps = 1/120 (0%)
Query: 1180 LYSCGG-PICWKSSVQSTVAMSTTEAEYMAVAEAAKEALWLTGLVKELGVEQGGVQLHCD 1238
+++ GG + WK+S+Q+TVA+STTEAEYMA++EA KEA+WL GL EL + + CD
Sbjct: 1150 VFTIGGCDVSWKASLQATVALSTTEAEYMAISEACKEAIWLRGLYTELCGVTSCINIFCD 1209
Query: 1239 SQSAIYLTNNQVYHARTKHIDVRFHKIRELLASRQILLQKIHTSENTTDKLTKPVTSDKF 1298
SQSAI LT +Q++H RTKHIDVR+H IR ++ + + KI T +N D +TKPV++ KF
Sbjct: 1210 SQSAICLTKDQMFHERTKHIDVRYHIIRGVIVEGDVKVCKISTHDNPADMMTKPVSATKF 1269
>UniRef100_Q6BCY1 Gag-Pol [Ipomoea batatas]
Length = 1298
Score = 865 bits (2236), Expect = 0.0
Identities = 469/1100 (42%), Positives = 686/1100 (61%), Gaps = 29/1100 (2%)
Query: 4 AKFEVTRFDGTGNFGLWQRMAKDLLAQKSLQKALRDEKPADIAT-VDWNEMKEKAAGLIT 62
AKFE+ +F+G NF LW+ K +L + + A+ E+P D W+EM E A +
Sbjct: 3 AKFEIEKFNGK-NFSLWKLKVKAILRKDNCLAAI-SERPVDFTDDKKWSEMNEDAMADLY 60
Query: 63 LCVSDDVMNHILDLTTLKDVWDKLESLYMSKTPMNKLFAKQRLYSLKMQEGGDLQAHVYA 122
L ++D V++ I + T ++WD L LY +K+ NK+F K++LY+L+M E + H+
Sbjct: 61 LSIADGVLSSIEEKKTANEIWDHLNRLYEAKSLHNKIFLKRKLYTLRMSESTSVTEHLNT 120
Query: 123 FNNILADMTRLGVTVDDEDKAIILLCSLPGSYDHLVTTLTYG--KDSITLDSISSTLLPH 180
N + + +T L ++ +++A +LL SLP SYD L+ LT D + D +++ +L
Sbjct: 121 LNTLFSQLTSLSCKIEPQERAELLLQSLPDSYDQLIINLTNNILTDYLVFDDVAAAVLEE 180
Query: 181 AQRRQSVEEGGGS--SGEGLFVKGGQDRGRGKGKAVDSGKKKRSKSKDRKTTECYSCKQI 238
RR++ E+ + E L V G+ RG+ S + RSKS + T CY+C +
Sbjct: 181 ESRRKNKEDRQVNLQQAEALTVMRGRSTERGQ-----SSGRGRSKSSKKNLT-CYNCGKK 234
Query: 239 GHWKRDCPNRSGKSGNNSSSANVVQSDGS--CSEEDLLCVSSVKCTDAWVLDSGCSYHMT 296
GH K+DC N + S + A+ DGS C E + + D W++DSG +YHMT
Sbjct: 235 GHLKKDCWNLAQNSNPQGNVAST-SDDGSALCCEASIAREGRKRFADIWLIDSGATYHMT 293
Query: 297 QHREWFNSFKSGDLGYVYLGDDKPCIIKGMRQVKIALDDGGVRTLSQVRYVLEVMKNLIS 356
+EWF+ ++ G VY DD I G+ +K+ + DG V+T+ VR+V + KNL+S
Sbjct: 294 SRKEWFHHYEPISGGSVYSCDDHALEIIGIGTIKLKMYDGTVQTVQDVRHVKGLKKNLLS 353
Query: 357 LGTLHENGYSFKSEENRDILRVSKGAMTVMRAKRTAGNIYKLLGGTIMGDVASVET-DDD 415
G L + ++++ ++++ +GA+ VM+ ++ A N+Y L G T+ ASV D
Sbjct: 354 YGILDNSATQIETQKG--VMKIFQGALVVMKGEKIAANLYMLKGETLQEAEASVAACSPD 411
Query: 416 ATKLWHMRLGHLSERGMMELHKRNMLKGVRSCIIGLCKYCVLGKQCRVRFKTGHHKTKGI 475
+T LWH +LGH+S++GM L ++ ++ G+ + LC++C+ KQ R++F T + + K +
Sbjct: 412 STLLWHQKLGHMSDQGMKILVEQKLIPGLTKVSLPLCEHCITSKQHRLKFSTSNSRGKVV 471
Query: 476 LDYVHSDV-RGPTKEPSV*GFRYFVTFTDDFSRKVWVYFMKYKSEVFAKFKLWKAEVENQ 534
L+ VHSDV + P PS+ G +YFV+F DD+SR+ WVY +K KS+VFA FK +KA VE
Sbjct: 472 LELVHSDVWQAPV--PSLGGAKYFVSFIDDYSRRCWVYPIKKKSDVFATFKAFKARVELD 529
Query: 535 TGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHFSVRKTPQQNGVAERMNRTLTEKARCL 594
+G+KIK R+DNG EYT + F FC++ GI+R F+V TPQQNGVAERMNRTL E+ R +
Sbjct: 530 SGKKIKCFRTDNGGEYTSEEFDDFCKKEGIKRQFTVAYTPQQNGVAERMNRTLLERTRAM 589
Query: 595 RLNACLSKCLWAATINMACYLVNRSPRASLDGKVAEEVWTGNPIDLSNLRIFWRPAYVHI 654
A L K WA +N ACYLVNR+P +++ K E+WTG P+D SNL IF Y
Sbjct: 590 LRAAGLEKSFWAEAVNTACYLVNRAPSTAIELKTPMEMWTGKPVDYSNLHIFGSIVYAMY 649
Query: 655 SSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDPVKKKVIVSRDVVFDE*SMLKQSDVTVVP 714
++++ +KLDPKS+ C +GY GVKGY+LWDP KV++SRDV+F E L++ +V
Sbjct: 650 NAQEITKLDPKSRKCRFLGYADGVKGYRLWDPTAHKVVISRDVIFVE-DRLQRGEVD--D 706
Query: 715 DTEVENSSQDKIQVDIEETPVSPRQIVAQQQSEPGSDSGEVQDYTLVRDREPSRITPPVR 774
TE E +IQV+ E S A ++ EP S T DRE R T
Sbjct: 707 STEKEKPETTQIQVEEEFEQDSSEAEPAHEEPEPESSGAPT---TRQSDREKRRPTWHSD 763
Query: 775 YGFEDLAAYALLTSSGDPSTYHEAMAS*EKDKWMSAMVEEMESLKKNET*NLVQLPHGKR 834
Y E AY LLT G+PST+ EA+ S + +W +AM EE+E+L KN T +LV LP G++
Sbjct: 764 YVMEGNVAYCLLTEDGEPSTFQEAINSSDVSQWTAAMQEEIEALHKNNTWDLVPLPQGRK 823
Query: 835 VIGCKWVYKKKLAVT*KEREKFKAHLVTKGYSQHKGIDYDEIFSPVVRHTSIRVVLALVA 894
IG KWV+K K + E+++A LV KGY+Q +GID++EIFSPVVR T++RVVLA+ A
Sbjct: 824 PIGNKWVFKIKRNGD-DQVERYRARLVVKGYAQKEGIDFNEIFSPVVRLTTVRVVLAMCA 882
Query: 895 SMDMHLEQMDVKTTFLHGNLEEQIYIEQPEGFSETGDGRLVCKLKRSLYGLKQSPRQWYK 954
+ ++HLEQ+DVKT FLHG+LEE+IY+ QPEGF + + LVC+L +SLYGLKQ+PR WYK
Sbjct: 883 TFNLHLEQLDVKTAFLHGDLEEEIYMLQPEGFEDKENQNLVCRLNKSLYGLKQAPRCWYK 942
Query: 955 RFDSYMLRIGYRRCDYDCCVYVMSLDDGSFIFLLLYVDDMLIAANHLHDVNELKTKLGKE 1014
RFDS+++ +GY R + D C Y +F+ LLLYVDDML+A + ++ELK +L +E
Sbjct: 943 RFDSFIMCLGYNRLNADPCAYFKRFGKDNFVILLLYVDDMLVAGPNKDHIDELKAQLARE 1002
Query: 1015 FDMKDLGAAKKILGMEIHKDRGAKKLWLSQKSYVEGVLSRFDMSKANHVSTPLTNHFKLS 1074
F+MKDLG A KILGM+IH+DRG +K+WLSQK+Y++ +LSRF M +STPL + K+S
Sbjct: 1003 FEMKDLGPANKILGMQIHRDRGNRKIWLSQKNYLKKILSRFSMQDCKSISTPLPINLKVS 1062
Query: 1075 LEQSPKIDSEIEGMSKIPYA 1094
SP + MS++PYA
Sbjct: 1063 SSMSPSNEEGRMEMSRVPYA 1082
Score = 125 bits (315), Expect = 7e-27
Identities = 61/115 (53%), Positives = 82/115 (71%)
Query: 1184 GGPICWKSSVQSTVAMSTTEAEYMAVAEAAKEALWLTGLVKELGVEQGGVQLHCDSQSAI 1243
GG + W S +Q+ VA STTEAEY+A +A+KEA+WL L++ELG +Q V L CDSQSA+
Sbjct: 1172 GGAVSWVSKLQAVVATSTTEAEYVAATQASKEAIWLKMLLEELGHKQEFVSLFCDSQSAL 1231
Query: 1244 YLTNNQVYHARTKHIDVRFHKIRELLASRQILLQKIHTSENTTDKLTKPVTSDKF 1298
+L N +H+RTKHI V++H IRE + + LQKIHT++N D LTK + DKF
Sbjct: 1232 HLARNPAFHSRTKHIRVQYHFIREKVKEGTVDLQKIHTADNVADFLTKIINVDKF 1286
>UniRef100_Q6L4V3 Putative polyprotein [Oryza sativa]
Length = 1243
Score = 863 bits (2230), Expect = 0.0
Identities = 493/1123 (43%), Positives = 688/1123 (60%), Gaps = 71/1123 (6%)
Query: 5 KFEVTRFDGTGNFGLWQRMAKDLLAQKSLQKALRDEKPADIATVDW-NEMKE---KAAGL 60
K+++ D F LWQ + +LAQ+ L AL D T DW N+ K+ KA
Sbjct: 5 KYDLPLLDRDTRFSLWQVKMRAVLAQQDLDDALSG---FDKRTQDWSNDEKKRDRKAISY 61
Query: 61 ITLCVSDDVMNHILDLTTLKDVWDKLESLYMSKTPMNKLFAKQRLYSLKMQEGGDLQAHV 120
I L +S++++ +L T +W KLE + M+K +K+ KQ+L+ K+Q+ + H+
Sbjct: 62 IHLHLSNNILQEVLKEETAAGLWLKLEQICMTKDLTSKMHLKQKLFLHKLQDDESVMDHL 121
Query: 121 YAFNNILADMTRLGVTVDDEDKAIILLCSLPGSYDHLVTTLTYGKDSITLDSISSTLLPH 180
AF I+AD+ + V D++D +ILLCSLP SY + T+ Y +D++TL +
Sbjct: 122 SAFKEIVADLESMEVKYDEDDLGLILLCSLPSSYANFRGTILYSRDTLTLKEVYDAFHAK 181
Query: 181 AQRRQSV-EEGGGSSGEGLFVKGGQDRGRGKGKAVD---SGKKKRSKSKDRKTTECYSCK 236
+ ++ V EG S EGL V+G Q + K ++ D S + R+KS+ R + C CK
Sbjct: 182 EKMKKMVTSEGSNSQAEGLVVRGRQQKKNTKNQSRDKSSSSYRGRTKSRGRYKS-CKYCK 240
Query: 237 QIGH-----WK-RDCPNRSGKSGNNSSSANVVQSDGSCSEEDLLCVSSVKCTDAWVLDSG 290
+ GH WK +D R+GK G EE V + + +DA +L +
Sbjct: 241 RDGHDISECWKLQDKDKRTGK----------YIPKGKKEEEGKAAVVTDEKSDAELLVAY 290
Query: 291 CSYHMTQHREWFNSFKSGDLGYVYLGDDKPCIIKGMRQVKIALDDGGVRTLSQVRYVLEV 350
T ++WF ++++ G V +GDD PC + G+ V+I + DG +RTLS V+++ +
Sbjct: 291 AGCAQTSDQDWFATYEALQGGTVLMGDDTPCEVAGIGTVQIKMFDGCIRTLSDVQHIPNL 350
Query: 351 MKNLISLGTLHENGYSFKSEENRDILRVSKGAMTVMRAKRTAGNIYKLLGGTIMGDVASV 410
++LISL IL+V+KG++ VM+ + N+Y L G TI+G+VA+V
Sbjct: 351 KRSLISL---------------YGILKVTKGSLVVMKVDIKSANLYHLRGTTILGNVAAV 395
Query: 411 E---TDDDATKLWHMRLGHLSERGMMELHKRNMLKGVRSCIIGLCKYCVLGKQCRVRFKT 467
++ DAT LWHMRLGH+SE G+ EL KR +L G + C++C+ GK RV+F T
Sbjct: 396 FDSLSNSDATNLWHMRLGHMSEIGLAELSKRGLLDGQSIRKLKFCEHCIFGKHKRVKFNT 455
Query: 468 GHHKTKGILDYVHSDVRGPTKEPSV*GFRYFVTFTDDFSRKVWVYFMKYKSEVFAKFKLW 527
H T+GILDYVHSD+ GP + S G RY +T DD+SRKVW YF+K+K + F FK W
Sbjct: 456 STHTTEGILDYVHSDLWGPAHKTSFGGARYMMTIVDDYSRKVWPYFLKHKYQAFDGFKEW 515
Query: 528 KAEVENQTGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHFSVRKTPQQNGVAERMNRTL 587
K VE QT RK+K LR+DNG E+ K F +C+ GI H++ TPQQN VAERMNRT+
Sbjct: 516 KTMVERQTERKVKILRTDNGMEFCSKIFKSYCKSEGIVCHYTAPHTPQQNDVAERMNRTI 575
Query: 588 TEKARCLRLNACLSKCLWAATINMACYLVNRSPRASLDGKVAEEVWTGNPIDLSNLRIFW 647
KARC+ NA L K WA ++ ACYL+NRSP ++D K EVW+G+P + S+LR+F
Sbjct: 576 ISKARCMLSNAGLPKQFWAEAVSTACYLINRSPGYAIDKKTPIEVWSGSPTNYSDLRVFG 635
Query: 648 RPAYVHISSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDPVKKKVIVSRDVVFDE*SMLKQ 707
AY H+ D KL+P++ CI +GY GVKGYKLW P KKV++SR+VVF E +L
Sbjct: 636 CTAYAHV---DNGKLEPRAIKCIFLGYASGVKGYKLWCPETKKVVISRNVVFHESVILHD 692
Query: 708 SDVTVVPDTEVENSSQDKIQVDIEETPVS----PRQIVAQQQSEP---GSDSGEVQD--- 757
P T V SQ+K V +E S ++ VA Q P SDS V
Sbjct: 693 K-----PSTNVPVESQEKASVQVEHLISSGHAPEKEDVAINQDAPVIEDSDSSIVHQSPK 747
Query: 758 YTLVRDREPSRITPPVRYGFE-DLAAYAL-----LTSSGDPSTYHEAMAS*EKDKWMSAM 811
++ +D+ I PP RY E + AYAL + + +PSTY EA+ S + ++W++AM
Sbjct: 748 RSIAKDKPKRNIKPPRRYIEEAKIVAYALSVAEKIEGNAEPSTYSEAIVSDDCNRWITAM 807
Query: 812 VEEMESLKKNET*NLVQLPHGKRVIGCKWVYKKKLAVT*KEREKFKAHLVTKGYSQHKGI 871
+EMESL+KN T LV+LP K+ I CKW++K+K ++ + ++KA LV KGYSQ GI
Sbjct: 808 HDEMESLEKNHTWELVKLPKEKKPIRCKWIFKRKEGMSPSDEARYKARLVAKGYSQIPGI 867
Query: 872 DYDEIFSPVVRHTSIRVVLALVASMDMHLEQMDVKTTFLHGNLEEQIYIEQPEGFSETGD 931
D++++FSPVV+H+SIR +L +VA D LEQM+VKT FLHG LEE IY+EQPEGF G
Sbjct: 868 DFNDVFSPVVKHSSIRTLLGIVAMHDYELEQMNVKTAFLHGELEEDIYMEQPEGFVVPGK 927
Query: 932 GRLVCKLKRSLYGLKQSPRQWYKRFDSYMLRIGYRRCDYDCCVYVMSLDDGSFIFLLLYV 991
LVC+LK+SLYGLKQSPRQWYKRFDS+ML +R +YD CVY + + DGS I+LLLYV
Sbjct: 928 ENLVCRLKKSLYGLKQSPRQWYKRFDSFMLSQKFRISNYDSCVY-LKVVDGSVIYLLLYV 986
Query: 992 DDMLIAANHLHDVNELKTKLGKEFDMKDLGAAKKILGMEIHKDRGAKKLWLSQKSYVEGV 1051
DDMLIAA ++ +LK +L EF+MKDLGAAKKILGMEI ++R + KL+LSQK Y+E V
Sbjct: 987 DDMLIAAKDKSEIEKLKAQLSSEFEMKDLGAAKKILGMEITRERHSGKLYLSQKGYIEKV 1046
Query: 1052 LSRFDMSKANHVSTPLTNHFKLSLEQSPKIDSEIEGMSKIPYA 1094
L RF+M A VSTPL HF+LS + P D +IE MS++PY+
Sbjct: 1047 LRRFNMHDAKPVSTPLAAHFRLSSDLCPLSDYDIEYMSRVPYS 1089
Score = 79.3 bits (194), Expect = 7e-13
Identities = 39/70 (55%), Positives = 52/70 (73%), Gaps = 1/70 (1%)
Query: 1180 LYSCGG-PICWKSSVQSTVAMSTTEAEYMAVAEAAKEALWLTGLVKELGVEQGGVQLHCD 1238
+++ GG + WK+S+Q+TVA+STTEAEYMA+ EA KEA+WL GL EL + + CD
Sbjct: 1174 VFTIGGCAVSWKASLQATVALSTTEAEYMAIFEACKEAIWLRGLYTELCGVTSCINIFCD 1233
Query: 1239 SQSAIYLTNN 1248
SQSAIYLT +
Sbjct: 1234 SQSAIYLTKD 1243
>UniRef100_Q9SJT2 Putative retroelement pol polyprotein [Arabidopsis thaliana]
Length = 1333
Score = 857 bits (2213), Expect = 0.0
Identities = 526/1347 (39%), Positives = 760/1347 (56%), Gaps = 81/1347 (6%)
Query: 4 AKFEVTRFDGTGNFGLWQRMAKDLLAQKSLQKALRDE-----KPADIATVDWNEMKE--- 55
A+ EV +FDG G++ +W+ L L AL++E K A++ + E +E
Sbjct: 4 ARIEVEKFDGRGDYTMWKEKLMAHLDILGLSVALKEEDDLVEKVAEMQLTEEEEKEEVLR 63
Query: 56 ---------KAAGLITLCVSDDVMNHILDLTTLKDVWDKLESLYMSKTPMNKLFAKQRLY 106
KA I L V+D V+ I + + L+ LYMSK N+++ KQ+LY
Sbjct: 64 RELLEEKRRKARSAIVLSVTDRVLRKIKKEQSAAAMLGVLDKLYMSKALPNRIYQKQKLY 123
Query: 107 SLKMQEGGDLQAHVYAFNNILADMTRLGVTVDDEDKAIILLCSLPGSYDHLVTTLTYGKD 166
S KM E ++ ++ F I+AD+ V V DED+AI+LL SLP +D L TL YG
Sbjct: 124 SFKMSENLSIEGNIDEFLRIIADLENTNVLVSDEDQAILLLMSLPKPFDQLRDTLKYGLG 183
Query: 167 SITL--DSISSTLLPHAQRRQSVEEGGGSSGEGLFVKGGQD-RGRGKGKAVDSGKKKRSK 223
+TL D + + + S ++ EGLFVK + RGR + + ++ KK S+
Sbjct: 184 RVTLSLDEVVAAIYSKELELGSNKKSIKGQAEGLFVKEKTETRGRTEQRGNNNNNKK-SR 242
Query: 224 SKDRKTTECYSCKQIGHWKRDCPNRSGKSGNNSSSANVVQSDGSCSEEDLLCVSSVKCTD 283
SK R C+ C G+S N SS N +++G E L + + D
Sbjct: 243 SKSRSKKGCWIC--------------GESSNGSS--NYSEANGLYVSE-ALSSTDIHLED 285
Query: 284 AWVLDSGCSYHMTQHREWFNSFKSGDLGYVYLGDDKPCIIKGMRQVKIALDDGGVRTLSQ 343
WV+D+GCSYHMT REWF G V +G+ ++G+ +++ + G V L+
Sbjct: 286 EWVMDTGCSYHMTYKREWFEDLNEDAGGSVRMGNKTVSKVRGIGTIRVKNEAGMVVRLTN 345
Query: 344 VRYVLEVMKNLISLGTLHENGYSFKSEENRDILRVSKGAMTVMRAKRTAGNIYKLLGGTI 403
VRY+ E+ +NL+SLGT ++GYSFK E L + G ++ +R +Y L +
Sbjct: 346 VRYIPEMDRNLLSLGTFEKSGYSFKLENGT--LSIIAGDSVLLTVRRCY-TLYLLQWRPV 402
Query: 404 MGDVASVETDDDATKLWHMRLGHLSERGMMELHKRNMLKGVRSCIIGLCKYCVLGKQCRV 463
+ SV D T LWH RLGH+S++ M L K+ +L + + C+ C+ GK R+
Sbjct: 403 TEESLSVVKRQDDTILWHRRLGHMSQKNMDLLLKKGLLDKKKVSKLETCEDCIYGKAKRI 462
Query: 464 RFKTGHHKTKGILDYVHSDVRGPTKEP-SV*GFRYFVTFTDDFSRKVWVYFMKYKSEVFA 522
F H T+ L+YVHSD+ G P S+ +YF++F DD++RKV +YF+K K E F
Sbjct: 463 GFNLAQHDTREKLEYVHSDLWGAPSVPFSLGKCQYFISFIDDYTRKVRIYFLKTKDEAFD 522
Query: 523 KFKLWKAEVENQTGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHFSVRKTPQQNGVAER 582
KF W VENQT ++IK LR+DNG E+ +++F FC + GI H + TPQQNGVAER
Sbjct: 523 KFVEWANLVENQTDKRIKTLRTDNGLEFCNRSFDEFCSQKGILWHRTCAYTPQQNGVAER 582
Query: 583 MNRTLTEKARCLRLNACLSKCLWAATINMACYLVNRSPRASLDGKVAEEVWTGNPIDLSN 642
MNRTL EK R + ++ L K WA + L+N++P ++L+ +V ++ W+G S
Sbjct: 583 MNRTLMEKVRSMLSDSGLPKKFWAEATHTTAILINKTPSSALNYEVPDKRWSGKSPIYSY 642
Query: 643 LRIFWRPAYVHISSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDPVKKKVIVSRDVVFDE* 702
LR F A+VH D KL+P++K I++GY GVKGYK+W +KK +VSR+V+F E
Sbjct: 643 LRRFGCIAFVHT---DDGKLNPRAKKGILVGYPIGVKGYKIWLLEEKKCVVSRNVIFQEN 699
Query: 703 SMLK---QSDVTVVPDTEVENSSQDKIQVDIEE--------------TPVSPRQIVAQQQ 745
+ K QS + E SS + +D EE +P +P Q
Sbjct: 700 ASYKDMMQSKDAEKDENEAPPSSYLDLDLDHEEVITSGGDDPIVEAQSPFNPSPATTQTY 759
Query: 746 SEP-GSDSGEVQD---YTLVRDREPSRITPPVRYGFEDLAAYALLTSSG----DPSTYHE 797
SE S++ +Q Y LVRDR+ I PVR+ ED A AL T+ +P+ Y E
Sbjct: 760 SEGVNSETDIIQSPLSYQLVRDRDRRTIRAPVRFDDEDYLAEALYTTEDSGEIEPADYSE 819
Query: 798 AMAS*EKDKWMSAMVEEMESLKKNET*NLVQLPHGKRVIGCKWVYKKKLAVT*KEREKFK 857
A S +KW AM EEMES KN T +V+ P ++VIG +W+YK KL + E +FK
Sbjct: 820 AKRSMNWNKWKLAMNEEMESQIKNHTWTVVKRPQHQKVIGSRWIYKFKLGIPGVEEGRFK 879
Query: 858 AHLVTKGYSQHKGIDYDEIFSPVVRHTSIRVVLALVASMDMHLEQMDVKTTFLHGNLEEQ 917
A LV KGY+Q KGIDY EIF+PVV+H SIR+++++VA D+ LEQ+DVKT FLHG L+E+
Sbjct: 880 ARLVAKGYAQRKGIDYHEIFAPVVKHVSIRILMSIVAQEDLELEQLDVKTAFLHGELKEK 939
Query: 918 IYIEQPEGFSETGDGRLVCKLKRSLYGLKQSPRQWYKRFDSYMLRIGYRRCDYDCCVYVM 977
IY+ PEG+ E VC L +SLYGLKQ+P+QW ++F++YM IG+ R YD C Y+
Sbjct: 940 IYMVPPEGYEEMFKEDEVCLLNKSLYGLKQAPKQWNEKFNAYMSEIGFIRSLYDSCAYIK 999
Query: 978 SLDDGSFIFLLLYVDDMLIAANHLHDVNELKTKLGKEFDMKDLGAAKKILGMEIHKDRGA 1037
L DGS ++LLLYVDDML+AA + D+++LK +L + FDMKDLGAAK+ILGMEI ++R
Sbjct: 1000 ELSDGSRVYLLLYVDDMLVAAKNKEDISQLKEELSQRFDMKDLGAAKRILGMEIIRNREE 1059
Query: 1038 KKLWLSQKSYVEGVLSRFDMSKANHVSTPLTNHFKLSLEQSPKIDSEIEGMSKIPY--AV 1095
LWLSQ Y+ +L ++M+++ HV TPL H K+ K + + + M IPY AV
Sbjct: 1060 NTLWLSQNGYLNKILETYNMAESKHVVTPLGAHLKMRAATVEKQEQDEDYMKSIPYSSAV 1119
Query: 1096 QLVV*CMLWFALDQIWHKQLVKCQVHVQAREAALGSSQVDPKILEGYNGSRYHV*QGARC 1155
++ M+ D + ++ + ARE LG V + ++G G++ + +
Sbjct: 1120 GSIMYAMIGTRPDLAYPVGIISRYMSQPAREHWLGVKWV-LRYIKGSLGTKLQYKRSSDF 1178
Query: 1156 CSISCGIC----GLCR*SR**KVYNRICLYSCGGPICWKSSVQSTVAMSTTEAEYMAVAE 1211
+ G C C+ R + + G I WKS Q VA+STTEAEYM++ E
Sbjct: 1179 KVV--GYCDADHAACKDRR--RSITGLVFTLGGSTISWKSGQQRVVALSTTEAEYMSLTE 1234
Query: 1212 AAKEALWLTGLVKELGVEQGGVQLHCDSQSAIYLTNNQVYHARTKHIDVRFHKIRELLAS 1271
A KEA+W+ GL+KE G EQ V++ CDSQSAI L+ N V+H RTKHIDVR+ IR+++A+
Sbjct: 1235 AVKEAVWMKGLLKEFGYEQKSVEIFCDSQSAIALSKNNVHHERTKHIDVRYQYIRDIIAN 1294
Query: 1272 RQILLQKIHTSENTTDKLTKPVTSDKF 1298
+ KI T +N D TK V +KF
Sbjct: 1295 GDGDVVKIDTEKNPADIFTKIVPVNKF 1321
>UniRef100_Q9ZPU5 Putative retroelement pol polyprotein [Arabidopsis thaliana]
Length = 1335
Score = 855 bits (2208), Expect = 0.0
Identities = 522/1323 (39%), Positives = 736/1323 (55%), Gaps = 97/1323 (7%)
Query: 41 KPADIATVDWNEMKEKAAGLITLCVSDDVMNHILDLTTLKDVWDKLESLYMSKTPMNKLF 100
K D V E +KA +I L V+D V+ I T + W+ L+ L+M ++ ++++
Sbjct: 33 KKRDADEVARLERCDKAKNVIFLNVADKVLRKIELCKTAAEAWETLDRLFMIRSLPHRVY 92
Query: 101 AKQRLYSLKMQEGGDLQAHVYAFNNILADMTRLGVTVDDEDKAIILLCSLPGSYDHLVTT 160
+ Y+ KMQE + ++ F I+AD+ L + V DE +AI+LL SLP YD LV T
Sbjct: 93 TQLSFYTFKMQENKKIDENIDDFLKIVADLNHLQIDVTDEVQAILLLSSLPARYDGLVET 152
Query: 161 LTYG--KDSITLDSISSTLLPHAQRRQSVEEGGGSSGEGLFVKGGQDRGRGKGKAVDSGK 218
+ Y ++ + LD + ++ + + + + EG F +G D G+ +
Sbjct: 153 MKYSNSREKLRLDDV---MVAARDKERELSQNNRPVVEGHFARGRPD-GKNNNQGNKGKN 208
Query: 219 KKRSKSKDRKTTECYSCKQIGHWKRDC-----PNRSGKSGNNSSSANVVQSDGS------ 267
+ RSKS D K C+ C + GH+K+ C N+S + G+++ +++ +S +
Sbjct: 209 RSRSKSADGKRV-CWICGKEGHFKKQCYKWIERNKSKQQGSDNGESSLAKSTEAFNPAMV 267
Query: 268 --CSEEDLLCVSSVKCTDAWVLDSGCSYHMTQHREWFNSFKSGDLGYVYLGDDKPCIIKG 325
++E L+ S+ + WVLD+GCS+HMT ++WF FK GYV +G+D +KG
Sbjct: 268 LLATDETLVVTDSI--ANEWVLDTGCSFHMTPRKDWFKDFKELSSGYVKMGNDTYSPVKG 325
Query: 326 MRQVKIALDDGGVRTLSQVRYVLEVMKNLISLGTLHENGYSFKSEENRDILRVSKGAMTV 385
+ +KI DG L+ VRY+ + +NLISLGTL + G FKS++ IL++ KG T+
Sbjct: 326 IGSIKIRNSDGSQVILTDVRYMPNMTRNLISLGTLEDRGCWFKSQDG--ILKIVKGCSTI 383
Query: 386 MRAKRTAGNIYKLLGGTIMGDVASVETDDDATKLWHMRLGHLSERGMMELHKRNMLKGVR 445
++ ++ +Y L G T G+ S D T LWH RLGH+S++GM L K+ L+
Sbjct: 384 LKGQKR-DTLYILDGVTEEGESHSSAEVKDETALWHSRLGHMSQKGMEILVKKGCLRREV 442
Query: 446 SCIIGLCKYCVLGKQCRVRFKTGHHKTKGILDYVHSDVRGPTKEPSV*G-FRYFVTFTDD 504
+ C+ CV GKQ RV F H TK L YVHSD+ G P+ G +YF++F DD
Sbjct: 443 IKELEFCEDCVYGKQHRVSFAPAQHVTKEKLAYVHSDLWGSPHNPASLGNSQYFISFVDD 502
Query: 505 FSRKVWVYFMKYKSEVFAKFKLWKAEVENQTGRKIKYLRSDNGTEYTDKNFMHFCEENGI 564
+SRKVW+YF++ K E F KF WK VENQ+ RK+K LR+DNG EY + F FC+E GI
Sbjct: 503 YSRKVWIYFLRKKDEAFEKFVEWKKMVENQSDRKVKKLRTDNGLEYCNHYFEKFCKEEGI 562
Query: 565 QRHFSVRKTPQQNGVAERMNRTLTEKARCLRLNACLSKCLWAATINMACYLVNRSPRASL 624
RH + TPQQNG+AER+NRT+ +K R + + + K WA + A YL+NRSP ++
Sbjct: 563 VRHKTCAYTPQQNGIAERLNRTIMDKVRSMLSRSGMEKKFWAEAASTAVYLINRSPSTAI 622
Query: 625 DGKVAEEVWTGNPIDLSNLRIFWRPAYVHISSEDRSKLDPKSK*CIIIGYNKGVKGYKLW 684
+ + EE WTG DLS+LR F AY+H D+ KL+P+SK I Y +GVKGYK+W
Sbjct: 623 NFDLPEEKWTGALPDLSSLRKFGCLAYIHA---DQGKLNPRSKKGIFTSYPEGVKGYKVW 679
Query: 685 DPVKKKVIVSRDVVFDE*SMLKQ--------------SDVTVVPD------TEVENSSQD 724
KK ++SR+V+F E M K D+ V PD T+ ++QD
Sbjct: 680 VLEDKKCVISRNVIFREQVMFKDLKGDSQNTISESDLEDLRVNPDMNDQEFTDQGGATQD 739
Query: 725 KI---QVDIEETPVSPRQIVAQQQSEPGSDSGEVQD---YTLVRDREPSRITPPVRYGFE 778
+ PV +Q + DS V+D Y LVRDR I +Y
Sbjct: 740 NSNPSEATTSHNPVLNSPTHSQDEESEEEDSDAVEDLSTYQLVRDRVRRTIKANPKYNES 799
Query: 779 DLAAYALLTSSG---DPSTYHEAMAS*EKDKWMSAMVEEMESLKKNET*NLVQLPHGKRV 835
++ +A + +P +Y EA+ + +KW +AM EEM S+ KN T +LV P ++
Sbjct: 800 NMVGFAYYSEDDGKPEPKSYQEALLDPDWEKWNAAMKEEMVSMSKNHTWDLVTKPEKVKL 859
Query: 836 IGCKWVYKKKLAVT*KEREKFKAHLVTKGYSQHKGIDYDEIFSPVVRHTSIRVVLALVAS 895
IGC+WV+ +K + E +F A LV KG++Q +G+DY+EIFSPVV+H SIR +L++V
Sbjct: 860 IGCRWVFTRKAGIPGVEAPRFIARLVAKGFTQKEGVDYNEIFSPVVKHVSIRYLLSMVVH 919
Query: 896 MDMHLEQMDVKTTFLHGNLEEQIYIEQPEGFSETGDGRLVCKLKRSLYGLKQSPRQWYKR 955
+M L+QMDVKT FLHG LEE+IY+ QPEGF VC LKRSLYGLKQSPRQW R
Sbjct: 920 YNMELQQMDVKTAFLHGFLEEEIYMAQPEGFEIKRGSNKVCLLKRSLYGLKQSPRQWNLR 979
Query: 956 FDSYMLRIGYRRCDYDCCVYVMSLDDGSFIFLLLYVDDMLIAANHLHDVNELKTKLGKEF 1015
FD +M I Y R YD CVY + ++I+LLLYVDDMLIA+ + +VNELK L +EF
Sbjct: 980 FDEFMRGIKYTRSAYDSCVYFKKCNGDTYIYLLLYVDDMLIASANKSEVNELKQLLSREF 1039
Query: 1016 DMKDLGAAKKILGMEIHKDRGAKKLWLSQKSYVEGVLSRFDMSKANHVSTPLTNHFKLSL 1075
+MKDLG AKKILGMEI +DR A L LSQ+ YV+ VL F M A VSTPL HFKL
Sbjct: 1040 EMKDLGDAKKILGMEISRDRDAGLLTLSQEGYVKKVLRSFQMDNAKPVSTPLGIHFKLKA 1099
Query: 1076 EQSPKIDSEIEGMSKIPYA--VQLVV*CMLWFALDQIW---------------HKQLVKC 1118
+ + + E M +PYA + ++ M+ D + H Q VK
Sbjct: 1100 ATDKEYEEQFERMKIVPYANTIGSIMYSMIGTRPDLAYSLGVISRFMSKPLKDHWQAVKW 1159
Query: 1119 QVHVQ--AREAALGSSQVDPKILEGYNGSRYHV*QGARCCSISCGICGLCR*SR**KVYN 1176
+ + L + + +L GY S Y R I G
Sbjct: 1160 VLRYMRGTEKKKLCFRKQEDFLLRGYCDSDYGSNFDTR-----RSITGY----------- 1203
Query: 1177 RICLYSCGG-PICWKSSVQSTVAMSTTEAEYMAVAEAAKEALWLTGLVKELGVEQGGVQL 1235
+++ GG I WKS +Q VA+S+TEAEYMA+ EA KEALWL G ELG Q V++
Sbjct: 1204 ---VFTVGGNTISWKSKLQKVVAISSTEAEYMALTEAVKEALWLKGFAAELGHSQDYVEV 1260
Query: 1236 HCDSQSAIYLTNNQVYHARTKHIDVRFHKIRELLASRQILLQKIHTSENTTDKLTKPVTS 1295
H DSQSAI L N V+H RTKHID+R H IR+++ + I + KI T N + TK V
Sbjct: 1261 HSDSQSAITLAKNSVHHERTKHIDIRLHFIRDIICAGLIKVVKIATECNPANIFTKTVPL 1320
Query: 1296 DKF 1298
KF
Sbjct: 1321 AKF 1323
>UniRef100_P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-94
[Contains: Protease (EC 3.4.23.-); Reverse transcriptase
(EC 2.7.7.49); Endonuclease] [Nicotiana tabacum]
Length = 1328
Score = 853 bits (2204), Expect = 0.0
Identities = 478/1129 (42%), Positives = 679/1129 (59%), Gaps = 56/1129 (4%)
Query: 3 GAKFEVTRFDGTGNFGLWQRMAKDLLAQKSLQKALR--DEKPADIATVDWNEMKEKAAGL 60
G K+EV +F+G F WQR +DLL Q+ L K L +KP + DW ++ E+AA
Sbjct: 3 GVKYEVAKFNGDNGFSTWQRRMRDLLIQQGLHKVLDVDSKKPDTMKAEDWADLDERAASA 62
Query: 61 ITLCVSDDVMNHILDLTTLKDVWDKLESLYMSKTPMNKLFAKQRLYSLKMQEGGDLQAHV 120
I L +SDDV+N+I+D T + +W +LESLYMSKT NKL+ K++LY+L M EG + +H+
Sbjct: 63 IRLHLSDDVVNNIIDEDTARGIWTRLESLYMSKTLTNKLYLKKQLYALHMSEGTNFLSHL 122
Query: 121 YAFNNILADMTRLGVTVDDEDKAIILLCSLPGSYDHLVTTLTYGKDSITLDSISSTLLPH 180
FN ++ + LGV +++EDKAI+LL SLP SYD+L TT+ +GK +I L ++S LL +
Sbjct: 123 NVFNGLITQLANLGVKIEEEDKAILLLNSLPSSYDNLATTILHGKTTIELKDVTSALLLN 182
Query: 181 AQRRQSVEEGGGSSGEGLFVKG-GQDRGRGKGKAVDSGKKKRSKSKDR-KTTECYSCKQI 238
+ R+ E + G+ L +G G+ R SG + +SK++ + + CY+C Q
Sbjct: 183 EKMRKKPE----NQGQALITEGRGRSYQRSSNNYGRSGARGKSKNRSKSRVRNCYNCNQP 238
Query: 239 GHWKRDCPN-RSGK---SG--NNSSSANVVQSDGSC-----SEEDLLCVSSVKCTDAWVL 287
GH+KRDCPN R GK SG N+ ++A +VQ++ + EE+ + +S + WV+
Sbjct: 239 GHFKRDCPNPRKGKGETSGQKNDDNTAAMVQNNDNVVLFINEEEECMHLSGPE--SEWVV 296
Query: 288 DSGCSYHMTQHREWFNSFKSGDLGYVYLGDDKPCIIKGMRQVKIALDDGGVRTLSQVRYV 347
D+ S+H T R+ F + +GD G V +G+ I G+ + I + G L VR+V
Sbjct: 297 DTAASHHATPVRDLFCRYVAGDFGTVKMGNTSYSKIAGIGDICIKTNVGCTLVLKDVRHV 356
Query: 348 LEVMKNLISLGTLHENGYSFKSEENRDILRVSKGAMTVMRAKRTAGNIYKLLGGTIMGDV 407
++ NLIS L +GY + R++KG++ + + G +Y+ G++
Sbjct: 357 PDLRMNLISGIALDRDGYESYFANQK--WRLTKGSLVIAKGV-ARGTLYRTNAEICQGEL 413
Query: 408 ASVETDDDATKLWHMRLGHLSERGMMELHKRNMLKGVRSCIIGLCKYCVLGKQCRVRFKT 467
+ + D+ + LWH R+GH+SE+G+ L K++++ + + C YC+ GKQ RV F+T
Sbjct: 414 NAAQ-DEISVDLWHKRMGHMSEKGLQILAKKSLISYAKGTTVKPCDYCLFGKQHRVSFQT 472
Query: 468 GHHKTKGILDYVHSDVRGPTKEPSV*GFRYFVTFTDDFSRKVWVYFMKYKSEVFAKFKLW 527
+ ILD V+SDV GP + S+ G +YFVTF DD SRK+WVY +K K +VF F+ +
Sbjct: 473 SSERKLNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKTKDQVFQVFQKF 532
Query: 528 KAEVENQTGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHFSVRKTPQQNGVAERMNRTL 587
A VE +TGRK+K LRSDNG EYT + F +C +GI+ +V TPQ NGVAERMNRT+
Sbjct: 533 HALVERETGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTPQHNGVAERMNRTI 592
Query: 588 TEKARCLRLNACLSKCLWAATINMACYLVNRSPRASLDGKVAEEVWTGNPIDLSNLRIFW 647
EK R + A L K W + ACYL+NRSP L ++ E VWT + S+L++F
Sbjct: 593 VEKVRSMLRMAKLPKSFWGEAVQTACYLINRSPSVPLAFEIPERVWTNKEVSYSHLKVFG 652
Query: 648 RPAYVHISSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDPVKKKVIVSRDVVFDE*SMLKQ 707
A+ H+ E R+KLD KS CI IGY GY+LWDPVKKKVI SRDVVF E +
Sbjct: 653 CRAFAHVPKEQRTKLDDKSIPCIFIGYGDEEFGYRLWDPVKKKVIRSRDVVFRESEVRTA 712
Query: 708 SDVT------------VVPDTEVENSSQDKIQVDIEETPVSPRQIVAQ-QQSEPGSD--- 751
+D++ +P T +S + ++ E P +++ Q +Q + G +
Sbjct: 713 ADMSEKVKNGIIPNFVTIPSTSNNPTSAESTTDEVSEQGEQPGEVIEQGEQLDEGVEEVE 772
Query: 752 ---SGEVQDYTLVRDREP---SRITPPVRYGFEDLAAYALLTSSGDPSTYHEAMAS*EKD 805
GE Q L R P SR P Y L++ +P + E ++ EK+
Sbjct: 773 HPTQGEEQHQPLRRSERPRVESRRYPSTE--------YVLISDDREPESLKEVLSHPEKN 824
Query: 806 KWMSAMVEEMESLKKNET*NLVQLPHGKRVIGCKWVYKKKLAVT*KEREKFKAHLVTKGY 865
+ M AM EEMESL+KN T LV+LP GKR + CKWV+K K K ++KA LV KG+
Sbjct: 825 QLMKAMQEEMESLQKNGTYKLVELPKGKRPLKCKWVFKLKKDGDCK-LVRYKARLVVKGF 883
Query: 866 SQHKGIDYDEIFSPVVRHTSIRVVLALVASMDMHLEQMDVKTTFLHGNLEEQIYIEQPEG 925
Q KGID+DEIFSPVV+ TSIR +L+L AS+D+ +EQ+DVKT FLHG+LEE+IY+EQPEG
Sbjct: 884 EQKKGIDFDEIFSPVVKMTSIRTILSLAASLDLEVEQLDVKTAFLHGDLEEEIYMEQPEG 943
Query: 926 FSETGDGRLVCKLKRSLYGLKQSPRQWYKRFDSYMLRIGYRRCDYDCCVYVMSLDDGSFI 985
F G +VCKL +SLYGLKQ+PRQWY +FDS+M Y + D CVY + +FI
Sbjct: 944 FEVAGKKHMVCKLNKSLYGLKQAPRQWYMKFDSFMKSQTYLKTYSDPCVYFKRFSENNFI 1003
Query: 986 FLLLYVDDMLIAANHLHDVNELKTKLGKEFDMKDLGAAKKILGMEIHKDRGAKKLWLSQK 1045
LLLYVDDMLI + +LK L K FDMKDLG A++ILGM+I ++R ++KLWLSQ+
Sbjct: 1004 ILLLYVDDMLIVGKDKGLIAKLKGDLSKSFDMKDLGPAQQILGMKIVRERTSRKLWLSQE 1063
Query: 1046 SYVEGVLSRFDMSKANHVSTPLTNHFKLSLEQSPKIDSEIEGMSKIPYA 1094
Y+E VL RF+M A VSTPL H KLS + P E M+K+PY+
Sbjct: 1064 KYIERVLERFNMKNAKPVSTPLAGHLKLSKKMCPTTVEEKGNMAKVPYS 1112
Score = 125 bits (315), Expect = 7e-27
Identities = 61/115 (53%), Positives = 80/115 (69%)
Query: 1184 GGPICWKSSVQSTVAMSTTEAEYMAVAEAAKEALWLTGLVKELGVEQGGVQLHCDSQSAI 1243
GG I W+S +Q VA+STTEAEY+A E KE +WL ++ELG+ Q ++CDSQSAI
Sbjct: 1202 GGAISWQSKLQKCVALSTTEAEYIAATETGKEMIWLKRFLQELGLHQKEYVVYCDSQSAI 1261
Query: 1244 YLTNNQVYHARTKHIDVRFHKIRELLASRQILLQKIHTSENTTDKLTKPVTSDKF 1298
L+ N +YHARTKHIDVR+H IRE++ + + KI T+EN D LTK V +KF
Sbjct: 1262 DLSKNSMYHARTKHIDVRYHWIREMVDDESLKVLKISTNENPADMLTKVVPRNKF 1316
>UniRef100_Q9SHR5 F28L22.3 protein [Arabidopsis thaliana]
Length = 1356
Score = 845 bits (2182), Expect = 0.0
Identities = 516/1382 (37%), Positives = 754/1382 (54%), Gaps = 133/1382 (9%)
Query: 5 KFEVTRFDGTGNFGLWQRMAKDLLAQKSLQKALRDEKPADIATVDWNEMK---------- 54
+ E+ F+G +F LW+ + L L+ L D + +E K
Sbjct: 7 RVEIKVFNGDRDFSLWKIRIQAQLGVLGLKDTLTDFSLTKTVPLTKSEAKQESGDGESSG 66
Query: 55 -------------EKAAGLITLCVSDDVMNHILDLTTLKDVWDKLESLYMSKTPMNKLFA 101
E+A +I +SD V+ + T D+W L YM + N+++
Sbjct: 67 TKEVPDPVKIEQSEQAKNIIINHISDVVLLKVNHYATTADLWATLNKKYMETSLPNRIYT 126
Query: 102 KQRLYSLKMQEGGDLQAHVYAFNNILADMTRLGVTVDDEDKAIILLCSLPGSYDHLVTTL 161
+ +LYS KM + +V F I+A++ L + VD+E +AI++L SLP S+ L TL
Sbjct: 127 QLKLYSFKMVSTMTIDQNVDEFLRIVAELGSLEIQVDEEVQAILILNSLPASHIQLKHTL 186
Query: 162 TYGKDSITLDSISSTLLPHAQRRQSVEEGGGSSGEGL----------FVKGGQDRGRGKG 211
YG ++T+ ++S+ + R+ E G+ V+ Q G+GKG
Sbjct: 187 KYGNKTLTVQDVTSSA--KSLERELAEAVDLDKGQAAVLYTTERGRPLVRNNQKGGQGKG 244
Query: 212 KAVDSGKKKRSKSKDRKTTECYSCKQIGHWKRDCPNRSGK-SGNNSSSANVVQSDGSCSE 270
RS+S + C+ CK+ GH K+DC +R K A V+ SE
Sbjct: 245 ---------RSRSNSKTKVPCWYCKKEGHVKKDCYSRKKKMESEGQGEAGVITEKLVFSE 295
Query: 271 EDLLCVSSVKCTDAWVLDSGCSYHMTQHREWFNSFKSGDLGYVYLGDDKPCIIKGMRQVK 330
L V+ D W+LDSGC+ HMT R+WF SF+ + LGDD +G ++
Sbjct: 296 A--LSVNEQMVKDLWILDSGCTSHMTSRRDWFISFQEKGNTTILLGDDHSVESQGQGTIR 353
Query: 331 IALDDGGVRTLSQVRYVLEVMKNLISLGTLHENGYSFKSEENRDILRVSKGAMTVMRAKR 390
I G ++ L V+YV + +NLIS GTL + GY + E + +R K T +R
Sbjct: 354 IDTHGGTIKILENVKYVPHLRRNLISTGTLDKLGYRHEGGEGK--VRYFKNNKTALRGSL 411
Query: 391 TAGNIYKLLGGTIMGDVASVETDDDATKLWHMRLGHLSERGMMELHKRNMLKGVRSCIIG 450
+ G +Y L G T+M ++ + ETD T LWH RLGH+S + L + ++ +
Sbjct: 412 SNG-LYVLDGSTVMSELCNAETDKVKTALWHSRLGHMSMNNLKVLAGKGLIDRKEINELE 470
Query: 451 LCKYCVLGKQCRVRFKTGHHKTKGILDYVHSDVRG-PTKEPSV*GFRYFVTFTDDFSRKV 509
C++CV+GK +V F G H ++ L YVH+D+ G P PS+ G +YF++ DD +RKV
Sbjct: 471 FCEHCVMGKSKKVSFNVGKHTSEDALSYVHADLWGSPNVTPSISGKQYFLSIIDDKTRKV 530
Query: 510 WVYFMKYKSEVFAKFKLWKAEVENQTGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHFS 569
W+YF+K K E F KF WK+ VENQ +K+K LR+DNG E+ + F +C+E+GI+RH +
Sbjct: 531 WLYFLKSKDETFDKFCEWKSLVENQVNKKVKCLRTDNGLEFCNSRFDSYCKEHGIERHRT 590
Query: 570 VRKTPQQNGVAERMNRTLTEKARCLRLNACLSKCLWAATINMACYLVNRSPRASLDGKVA 629
TPQQNGVAERMNRT+ EK RCL + + + WA A YL+NRSP ++++ V
Sbjct: 591 CTYTPQQNGVAERMNRTIMEKVRCLLNKSGVEEVFWAEAAATAAYLINRSPASAINHNVP 650
Query: 630 EEVWTGNPIDLSNLRIFWRPAYVHISSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDPVKK 689
EE+W +LR F AYVH +D+ KL P++ +GY G KGYK+W ++
Sbjct: 651 EEMWLNRKPGYKHLRKFGSIAYVH---QDQGKLKPRALKGFFLGYPAGTKGYKVWLLEEE 707
Query: 690 KVIVSRDVVFDE*SMLKQSDVTVVPD-----TEVENSSQDKIQVDIEETPVSPRQIVAQQ 744
K ++SR+VVF E + + D+ V D + E +S + Q E S I Q
Sbjct: 708 KCVISRNVVFQESVVYR--DLKVKEDDTDNLNQKETTSSEVEQNKFAEASGSGGVIQLQS 765
Query: 745 QSEP------GSDSGEVQDYT----------------LVRDREPSRITPPVRYGFEDLAA 782
SEP SDS E +Y+ L RDR I PP R+ E
Sbjct: 766 DSEPITEGEQSSDSEEEVEYSEKTQETPKRTGLTTYKLARDRVRRNINPPTRFTEESSVT 825
Query: 783 YALLTSSG----DPSTYHEAMAS*EKDKWMSAMVEEMESLKKNET*NLVQLPHGKRVIGC 838
+AL+ +P +Y EAM S + +KW A +EM+SL KN T +LV P +++IGC
Sbjct: 826 FALVVVENCIVQEPQSYQEAMESQDCEKWDMATHDEMDSLMKNGTWDLVDKPKDRKIIGC 885
Query: 839 KWVYKKKLAVT*KEREKFKAHLVTKGYSQHKGIDYDEIFSPVVRHTSIRVVLALVASMDM 898
+W++K K + E +FKA LV KGY+Q +G+DY EIF+PVV+H SIR++++LV D+
Sbjct: 886 RWLFKLKSGIPGVEPTRFKARLVAKGYTQREGVDYQEIFAPVVKHVSIRILMSLVVDKDL 945
Query: 899 HLEQMDVKTTFLHGNLEEQIYIEQPEGFSETGDGRLVCKLKRSLYGLKQSPRQWYKRFDS 958
LEQMDVKTTFLHG+LEE++Y+EQPEGF VC+LK+SLYGLKQSPRQW KRFD
Sbjct: 946 ELEQMDVKTTFLHGDLEEELYMEQPEGFVSDSSENKVCRLKKSLYGLKQSPRQWNKRFDR 1005
Query: 959 YMLRIGYRRCDYDCCVYVMSLDDGSFIFLLLYVDDMLIAANHLHDVNELKTKLGKEFDMK 1018
+M + R ++D CVYV + + FI+LLLYVDDMLIA ++N +K +L EF+MK
Sbjct: 1006 FMSSQQFIRSEHDACVYVKHVSEHDFIYLLLYVDDMLIAGASKAEINRVKEQLSTEFEMK 1065
Query: 1019 DLGAAKKILGMEIHKDRGAKKLWLSQKSYVEGVLSRFDMSKANHVSTPLTNHFKLSL--E 1076
D+G A +ILG++I++DR L LSQ+ Y+ VL RF+MS A + P+ HFKL+ E
Sbjct: 1066 DMGGASRILGIDIYRDRKGGVLKLSQEIYIRKVLDRFNMSGAKMTNAPVGAHFKLAAVRE 1125
Query: 1077 QSPKIDSEIEGMSKIPY--AVQLVV*CMLWFALDQIW---------------HKQLVKCQ 1119
+ +D+++ +PY AV ++ ML D + H + VK
Sbjct: 1126 EDECVDTDV-----VPYSSAVGSIMYAMLGTRPDLAYAICLISRYMSKPGSMHWEAVKWV 1180
Query: 1120 VHV--QAREAALGSSQVDPKILEGYNGSRYHV*QGARCCSISCGICGLCR*SR**KVYNR 1177
+ A++ L ++ + GY S Y A I G
Sbjct: 1181 MRYLKGAQDLNLVFTKEKDFTVTGYCDSNY-----AADLDRRRSISGY------------ 1223
Query: 1178 ICLYSCGG-PICWKSSVQSTVAMSTTEAEYMAVAEAAKEALWLTGLVKELGVEQGGVQLH 1236
+++ GG + WK+S+Q VAMSTTEAEY+A+AEAAKEA+W+ GL++++G++Q V++
Sbjct: 1224 --VFTIGGNTVSWKASLQPVVAMSTTEAEYIALAEAAKEAMWIKGLLQDMGMQQDKVKIW 1281
Query: 1237 CDSQSAIYLTNNQVYHARTKHIDVRFHKIRELLASRQILLQKIHTSENTTDKLTKPVTSD 1296
CDSQSAI L+ N VYH RTKHIDVRF+ IR+++ S + + KIHTS N D LTK + +
Sbjct: 1282 CDSQSAICLSKNSVYHERTKHIDVRFNYIRDVVESGDVDVLKIHTSRNPVDALTKCIPVN 1341
Query: 1297 KF 1298
KF
Sbjct: 1342 KF 1343
>UniRef100_Q9AU17 Polyprotein-like [Lycopersicon chilense]
Length = 1328
Score = 811 bits (2096), Expect = 0.0
Identities = 468/1129 (41%), Positives = 655/1129 (57%), Gaps = 55/1129 (4%)
Query: 3 GAKFEVTRFDGTGN-FGLWQRMAKDLLAQKSLQKAL--RDEKPADIATVDWNEMKEKAAG 59
G K+EV +F+G F +WQR KDLL Q+ L KAL + +KP + DW E+ EKAA
Sbjct: 3 GVKYEVAKFNGDKPVFSMWQRRMKDLLIQQGLHKALGGKSKKPESMKLEDWEELDEKAAS 62
Query: 60 LITLCVSDDVMNHILDLTTLKDVWDKLESLYMSKTPMNKLFAKQRLYSLKMQEGGDLQAH 119
I L ++DDV+N+I+D + +W KLE+LYMSKT NKL+ K++LY+L M EG + +H
Sbjct: 63 AIRLHLTDDVVNNIVDEESACGIWTKLENLYMSKTLTNKLYLKKQLYTLHMDEGTNFLSH 122
Query: 120 VYAFNNILADMTRLGVTVDDEDKAIILLCSLPGSYDHLVTTLTYGKDSITLDSISSTLLP 179
+ N ++ + LGV +++EDK I+LL SLP SYD L TT+ +GKDSI L ++S LL
Sbjct: 123 LNVLNGLITQLANLGVKIEEEDKRIVLLNSLPSSYDTLSTTILHGKDSIQLKDVTSALLL 182
Query: 180 HAQRRQSVEEGG--------GSSGEGLFVKGGQDRGRGKGKAVDSGKKKRSKSKDRKTTE 231
+ + R+ E G G S + G+ RGK K RSKSK R
Sbjct: 183 NEKMRKKPENHGQVFITESRGRSYQRSSSNYGRSGARGKSKV-------RSKSKARN--- 232
Query: 232 CYSCKQIGHWKRDCPN-------RSGKSGNNSSSANVVQSDGSCS--EEDLLCVSSVKCT 282
CY+C Q GH+KRDCPN SG+ +++++A V +D E+ C+
Sbjct: 233 CYNCDQPGHFKRDCPNPKRGKGESSGQKNDDNTAAMVQNNDDVVLLINEEEECMHLAGTE 292
Query: 283 DAWVLDSGCSYHMTQHREWFNSFKSGDLGYVYLGDDKPCIIKGMRQVKIALDDGGVRTLS 342
WV+D+ SYH T R+ F + +GD G V +G+ I G+ + + G L
Sbjct: 293 SEWVVDTAASYHATPVRDLFCRYVAGDYGNVKMGNTSYSKIAGIGDICFKTNVGCTLVLK 352
Query: 343 QVRYVLEVMKNLISLGTLHENGYSFKSEENRDILRVSKGAMTVMRAKRTAGNIYKLLGGT 402
VR+V ++ NLIS L ++GY + R++KGA+ + + G +Y+
Sbjct: 353 DVRHVPDLRMNLISGIALDQDGYENYFANQK--WRLTKGALVIAKGV-ARGTLYRTNAEI 409
Query: 403 IMGDVASVETDDDATKLWHMRLGHLSERGMMELHKRNMLKGVRSCIIGLCKYCVLGKQCR 462
G++ + ++ A LWH R+GH SE+G+ L K++++ + I C Y + GKQ R
Sbjct: 410 CQGELNAAHEENSAD-LWHKRMGHTSEKGLQILSKKSLISFTKGTTIKPCNYWLFGKQHR 468
Query: 463 VRFKTGHHKTKGILDYVHSDVRGPTKEPSV*GFRYFVTFTDDFSRKVWVYFMKYKSEVFA 522
V F+T + ILD V+SDV GP + S+ G +YFVTF DD SRK+WVY + K +VF
Sbjct: 469 VSFQTSSERKSNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYIFRAKDQVFQ 528
Query: 523 KFKLWKAEVENQTGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHFSVRKTPQQNGVAER 582
F+ + A VE +TGRK K LR+DNG EYT + F +C +GI+ +V TPQ NGVAER
Sbjct: 529 VFQKFHALVERETGRKRKRLRTDNGGEYTSREFEEYCSNHGIRHEKTVPGTPQHNGVAER 588
Query: 583 MNRTLTEKARCLRLNACLSKCLWAATINMACYLVNRSPRASLDGKVAEEVWTGNPIDLSN 642
MNRT+ EK R + A L K W + ACYL+NRSP L+ + E VWT + S+
Sbjct: 589 MNRTIVEKVRSMLRMAKLPKTFWGEAVRTACYLINRSPSVPLEFDIPERVWTNKEMSYSH 648
Query: 643 LRIFWRPAYVHISSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDPVKKKVIVSRDVVFDE* 702
L++F A+ H+ E R+KLD KS CI IGY GY+LWD VKKKVI SRDV+F E
Sbjct: 649 LKVFGCKAFAHVPKEQRTKLDDKSVPCIFIGYGDEEFGYRLWDLVKKKVIRSRDVIFRES 708
Query: 703 SMLKQSDVT--------------VVPDTEVENSSQDKIQVDIEETPVSPRQIVAQQQSEP 748
+ +D++ +P + +S + ++ E P +IV +Q E
Sbjct: 709 EVGTAADLSEKAKKKNGIIPNLVTIPSSSNHPTSAESTIDEVVEQEEQPDEIV--EQGEQ 766
Query: 749 GSDSGEVQDYTLVRDREPSRITPPVRYGFEDL--AAYALLTSSGDPSTYHEAMAS*EKDK 806
D+ E +Y +P R + R + Y L+ G+P E ++ EK +
Sbjct: 767 LGDNTEQMEYPEEEQSQPLRRSERQRVESTKYPSSEYVLIKYEGEPENLKEVLSHPEKSQ 826
Query: 807 WMSAMVEEMESLKKNET*NLVQLPHGKRVIGCKWVYKKKLAVT*KEREKFKAHLVTKGYS 866
WM AM EEM SL+KN T LV+LP GKR + CKWV+K K K ++KA LV KG+
Sbjct: 827 WMKAMHEEMGSLQKNGTYQLVELPKGKRPLKCKWVFKLKKDGNGK-LVRYKARLVVKGFE 885
Query: 867 QHKGIDYDEIFSPVVRHTSIRVVLALVASMDMHLEQMDVKTTFLHGNLEEQIYIEQPEGF 926
Q KGID+DEIFSPVV+ TSIR +L++ AS+D+ +EQ+DVKT FLHG+LEE+IY+EQ EGF
Sbjct: 886 QKKGIDFDEIFSPVVKMTSIRTILSIAASLDLEVEQLDVKTAFLHGDLEEEIYMEQGEGF 945
Query: 927 SETGDGRLVCKLKRSLYGLKQSPRQWYKRFDSYMLRIGYRRCDYDCCVYVMSLDDGSFIF 986
+G +VCKL +SLYGLKQ+PRQWYK+FDS+M YR CVY D +FI
Sbjct: 946 EVSGKKHMVCKLNKSLYGLKQAPRQWYKKFDSFMKSQTYRNTYSHPCVYFKRFSDKNFII 1005
Query: 987 LLLYVDDMLIAANHLHDVNELKTKLGKEFDMKDLGAAKKILGMEIHKDRGAKKLWLSQKS 1046
LLLY D MLI + +L+ K FDMKDLG AK+ILGM+I ++ KKL LS +
Sbjct: 1006 LLLYTDYMLIVGKDKELIAKLRKDFSKSFDMKDLGPAKQILGMKIAREE-QKKLGLSHEK 1064
Query: 1047 YVEGVLSRFDMSKANHVSTPLTNHFKLSLEQSP-KIDSEIEGMSKIPYA 1094
Y+E VL RF+M A +STPL ++ KL+ + P K E M+K+PY+
Sbjct: 1065 YIERVLERFNMKSAKPISTPLVSYLKLTKQMFPTKKKGEKGDMAKVPYS 1113
Score = 112 bits (279), Expect = 1e-22
Identities = 55/115 (47%), Positives = 78/115 (67%)
Query: 1184 GGPICWKSSVQSTVAMSTTEAEYMAVAEAAKEALWLTGLVKELGVEQGGVQLHCDSQSAI 1243
GG I W+S +Q VA+STTEA+Y+A E KE LWL ++E G+ Q ++C+SQSA+
Sbjct: 1202 GGDISWQSKLQKYVALSTTEAKYIAGTEVCKEMLWLKRFLQEHGLHQKEYVVYCESQSAM 1261
Query: 1244 YLTNNQVYHARTKHIDVRFHKIRELLASRQILLQKIHTSENTTDKLTKPVTSDKF 1298
L+ +YHA TKHID+R+H IRE++ + + KI TSEN D +TK V ++KF
Sbjct: 1262 DLSKKAMYHATTKHIDMRYHWIREMVDDGSLQVVKIPTSENPADMVTKVVQNEKF 1316
>UniRef100_O81903 Putative transposable element [Arabidopsis thaliana]
Length = 1308
Score = 795 bits (2054), Expect = 0.0
Identities = 488/1347 (36%), Positives = 741/1347 (54%), Gaps = 108/1347 (8%)
Query: 5 KFEVTRFDGTGNFGLWQRMAKDLLAQKSLQKALRD------------EKPADIATVDWNE 52
K E+ F+G +F LW+ + L L+ AL D EK + D +
Sbjct: 6 KVEIKTFNGDRDFSLWKIRIEAQLGVLGLKPALSDFTLTKTILVVKSEKKESESEDDETD 65
Query: 53 MK--EKAAGLITLCVSDD----VMNHILDLTTLK--------DVWDKLESLYMSKTPMNK 98
K E+ I SD ++NHI D LK ++W L L+M + N+
Sbjct: 66 SKKTEEVPDPIKFEQSDQAKNFIINHITDTVLLKVQHCVTAAELWATLNKLFMETSLPNR 125
Query: 99 LFAKQRLYSLKMQEGGDLQAHVYAFNNILADMTRLGVTVDDEDKAIILLCSLPGSYDHLV 158
++ + RLYS KM + + + F I+A++ L + V +E +AI++L SLP SY L
Sbjct: 126 IYTQLRLYSFKMVDNLSIDQNTDEFLRIVAELGSLQIQVGEEVQAILILNSLPPSYIQLK 185
Query: 159 TTLTYGKDSITL-DSISSTLLPHAQRRQSVEEGGGSSGEGLFVKGGQDRGRGKGKAVDSG 217
TL YG ++++ D +SS + + E + L+ +RGR + K
Sbjct: 186 HTLKYGNKTLSVQDVVSSAKSLERELSEQKETIRAPASTALYTA---ERGRPQTKNTQGQ 242
Query: 218 KKKRSKSKDRKTTECYSCKQIGHWKRDCPNRSGKSGNNSSS-ANVVQSDGSCSEEDLLCV 276
K R +S + C+ CK+ GH K+DC K N A V+ SE L +
Sbjct: 243 GKGRGRSNSKSRLTCWFCKKEGHVKKDCYAGKRKLENEGQGKAGVITEKLVYSEA--LSM 300
Query: 277 SSVKCTDAWVLDSGCSYHMTQHREWFNSFKSGDLGYVYLGDDKPCIIKGMRQVKIALDDG 336
+ D WV+DSGC+YHMT +WF+ F + + LGDD KG VK+ G
Sbjct: 301 YDQEAKDKWVIDSGCTYHMTSRMDWFSEFNENETTMILLGDDHTVESKGSGTVKVNTHGG 360
Query: 337 GVRTLSQVRYVLEVMKNLISLGTLHENGYSFKSEENRDILRVSKGAMTVMRAKRTAGNIY 396
+R L VR+V + +NLIS GTL + GY + + + +R K T + G +Y
Sbjct: 361 SIRVLKNVRFVPNLRRNLISTGTLDKLGYKHEGGDGK--VRFYKENKTALCGNLVNG-LY 417
Query: 397 KLLGGTIMGDVASVETDDDATKLWHMRLGHLSERGMMELHKRNMLKGVRSCIIGLCKYCV 456
L G T++ + +VE ++ T+LWH RLGH+S M L ++ +L+ + C+ CV
Sbjct: 418 VLDGHTVVNENCNVEGSNEKTELWHCRLGHMSLNNMKILAEKGLLEKKDIKELSFCENCV 477
Query: 457 LGKQCRVRFKTGHHKTKGILDYVHSDVRGPTKEPSV*GFRYFVTFTDDFSRKVWVYFMKY 516
+GK ++ F G H T +L Y+H+D+ G +YF++ DD SRKVW+ F+K
Sbjct: 478 MGKSKKLSFNVGKHITDEVLGYIHADLWGK---------QYFLSIIDDKSRKVWLMFLKT 528
Query: 517 KSEVFAKFKLWKAEVENQTGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHFSVRKTPQQ 576
K E F +F WK VENQ +K+K LR+DNG E+ + F FC++NGI+RH + TPQQ
Sbjct: 529 KDETFERFCEWKELVENQVNKKVKILRTDNGLEFCNLKFDEFCKQNGIERHRTCTYTPQQ 588
Query: 577 NGVAERMNRTLTEKARCLRLNACLSKCLWAATINMACYLVNRSPRASLDGKVAEEVWTGN 636
NGVA+RMNRTL EK RCL + L + WA A YLVNRSP +++D V EE+W
Sbjct: 589 NGVAKRMNRTLMEKVRCLLNESGLEEVFWAEAAATAAYLVNRSPASAVDHNVPEELWLDK 648
Query: 637 PIDLSNLRIFWRPAYVHISSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDPVKKKVIVSRD 696
+LR F AYVH+ D+ KL P++ + +GY +G KGYK+W ++K ++SR+
Sbjct: 649 KPGYKHLRRFGCIAYVHL---DQGKLKPRALKGVFLGYPQGTKGYKVWLLDEEKCVISRN 705
Query: 697 VVFDE*SMLK------QSDVTVVPDTEVENSSQDKIQ----------VDIEETPVSPRQI 740
+VF+E + K + V + D E N Q ++ V IEE
Sbjct: 706 IVFNENQVYKDIRESSEQSVKDISDLEGYNEFQVSVKEHGECSKTGGVTIEEIDQESDSE 765
Query: 741 VAQQQSEPGSDSGEVQDYTLVRDREPSRITPPVRYGFEDLAAYALLTS----SGDPSTYH 796
+ Q EP S ++ +Y RDRE PP + A AL+ + S +P YH
Sbjct: 766 NSVTQ-EPLIASIDLSNYQSARDRERRAPNPPQKLADYTHFALALVMAEEIESEEPQCYH 824
Query: 797 EAMAS*EKDKWMSAMVEEMESLKKNET*NLVQLPHGKRVIGCKWVYKKKLAVT*KEREKF 856
+A KW M EE++SL KN T ++V+ P ++VI C+W++K K + E +++
Sbjct: 825 DAKKDKHWIKWNGGMKEEIDSLLKNGTWDIVEWPKEQKVISCRWLFKLKPGIPGVEAQRY 884
Query: 857 KAHLVTKGYSQHKGIDYDEIFSPVVRHTSIRVVLALVASMDMHLEQMDVKTTFLHGNLEE 916
KA LV +G++Q KGIDY+E+F+PVV+H SIR++++ V DM LEQMDVKTTFLHG L++
Sbjct: 885 KARLVARGFTQQKGIDYEEVFAPVVKHISIRILMSAVVKDDMELEQMDVKTTFLHGELDQ 944
Query: 917 QIYIEQPEGFSETGDGRLVCKLKRSLYGLKQSPRQWYKRFDSYMLRIGYRRCDYDCCVYV 976
+Y+EQPEGF + VC LK+SLYGLKQ+PRQW K+F ++ML + + R ++D CVYV
Sbjct: 945 VLYMEQPEGFEVNPEKDQVCLLKKSLYGLKQAPRQWNKKFHAFMLSLQFARSEHDSCVYV 1004
Query: 977 MSLDDGSFIFLLLYVDDMLIAANHLHDVNELKTKLGKEFDMKDLGAAKKILGMEIHKDRG 1036
++ G F++LLLYVDDML+AA ++++LK L +F+MKD+GAA +ILG++I ++R
Sbjct: 1005 KEVNPGEFVYLLLYVDDMLLAAKSKSEISKLKEALSLKFEMKDMGAASRILGIDIIRNRK 1064
Query: 1037 AKKLWLSQKSYVEGVLSRFDMSKANHVSTPLTNHFKLS--LEQSPKIDSEIEGMSKIPY- 1093
L LSQ YV+ V+ RF M+ A VSTP+ HFKL+ +++ +D E+ +PY
Sbjct: 1065 EGTLRLSQTRYVDKVIQRFRMADAKVVSTPMGAHFKLTSLIDEIGSVDPEV-----VPYS 1119
Query: 1094 -AVQLVV*CMLWFALDQIWHKQLVKCQVHVQAREAALGSSQVDPKILEGYNGSRYHV*QG 1152
AV V+ M+ D + LV +R + + ++ ++GY S +
Sbjct: 1120 SAVGSVMYAMIGTIPDVAYAMGLV-------SRFMSRPGANLE---VQGYCDSDH----- 1164
Query: 1153 ARCCSISCGICGLCR*SR**KVYNRICLYSCGG-PICWKSSVQSTVAMSTTEAEYMAVAE 1211
A I G +++ GG + WKSS+Q VA+S+T+AE++A+ E
Sbjct: 1165 AADLDKRRSISGY--------------VFTVGGNTVSWKSSLQHVVALSSTQAEFIALTE 1210
Query: 1212 AAKEALWLTGLVKELGVEQGGVQLHCDSQSAIYLTNNQVYHARTKHIDVRFHKIRELLAS 1271
A KEA+W+ GL++++G++ + CDSQSAI L+ N +H RTKH++V+F+ IR+++ +
Sbjct: 1211 AVKEAIWIRGLLEDMGLQPKPATVWCDSQSAICLSKNNAFHDRTKHVEVKFYFIRDIIEA 1270
Query: 1272 RQILLQKIHTSENTTDKLTKPVTSDKF 1298
++ ++KIHTS N D LTK + KF
Sbjct: 1271 GEVKVRKIHTSVNPADMLTKCIPVKKF 1297
>UniRef100_Q9ZRJ0 ORF [Nicotiana tabacum]
Length = 1338
Score = 782 bits (2020), Expect = 0.0
Identities = 463/1132 (40%), Positives = 652/1132 (56%), Gaps = 69/1132 (6%)
Query: 7 EVTRFDGTGNFGLWQRMAKDLL-AQKSLQKALRDEKPADIATVDWNEMKEKAAGLITLCV 65
++ +GT N+ LW+ KDLL K +KP D + DW + G I V
Sbjct: 7 KMVNLNGT-NYHLWRNKMKDLLFVTKMHLPVFSSQKPEDKSDEDWEFEHNQVCGYIRQFV 65
Query: 66 SDDVMNHILDLTTLKDVWDKLESLYMSKTPMNKLFAKQRLYSLKMQEGGDLQAHVYAFNN 125
D+V NHI +T + +WDKLE LY SKT NKLF +L +K EG + H+
Sbjct: 66 EDNVYNHISGVTHARSLWDKLEELYASKTGNNKLFYLTKLMQVKYVEGTTVADHLNEIQG 125
Query: 126 ILADMTRLGVTVDDEDKAIILLCSLPGSYDHLVTTLTYGKDS--ITLDSISSTLLPHAQR 183
I+ ++ +G+ DDE A+++L +LP S++ L ++T + + ++++ S +L R
Sbjct: 126 IVDQLSGMGIKFDDEVLALMVLATLPESWETLKVSITNSAPNGVVNMETVKSGILNEEMR 185
Query: 184 RQSVEEGGGSSGEGLFVKGGQDRGRGKGKAVDSGKKKRSKSKDRKTTECYSCKQIGHWKR 243
R+S G+S V RGR + K+ + K R KS EC+ CK+ GH KR
Sbjct: 186 RRSQ----GTSSSQSEVLAVTTRGRSQNKSQSNRDKSRGKSNKFANVECHYCKKKGHIKR 241
Query: 244 DC--------PNRSGKSGNNSSSANVVQSDGSCS---EEDLLCVSSVKCTDAWVLDSGCS 292
C N+ K SS + S G + ++D++ +++ + T WV+DSG +
Sbjct: 242 FCRQFQNDQKKNKGKKVKPEESSDDETNSFGEFNVVYDDDIINLTTQEMT--WVIDSGAT 299
Query: 293 YHMTQHREWFNSFKSGDLGYVYLGDDKPCIIKGMRQVKIALDDGGVRTLSQVRYVLEVMK 352
H T RE F+S+ GD G V +G+ + G V + +G L VR+V ++
Sbjct: 300 IHATPRRELFSSYTLGDFGRVKMGNANFSTVVGKGDVCLETMNGMKLLLRDVRHVPDMRL 359
Query: 353 NLISLGTLHENGYSFKSEENRDILRVSKGAMTVMRAKRTAGNIYKLLGGTIMGDVASVET 412
NLIS+ L E GY + +++KG++ V R + + +Y + +I V +V
Sbjct: 360 NLISVDKLDEEGYCNTFHNGQ--WKLTKGSLMVARGTKQS-KLY-VTQASISQQVINVAE 415
Query: 413 DDDATKLWHMRLGHLSERGMMELHKRNMLKGVRSCIIGLCKYCVLGKQCRVRFKT-GHHK 471
+D KLWH RLGH+SE+ M L K+N L G+ + C C+ GKQ RV FK +
Sbjct: 416 NDSNIKLWHRRLGHMSEKSMARLVKKNALPGLNQIQLKKCADCLAGKQNRVSFKRFPPSR 475
Query: 472 TKGILDYVHSDVRGPTKEPSV*GFRYFVTFTDDFSRKVWVYFMKYKSEVFAKFKLWKAEV 531
+ +LD VHSDV GP K+ S+ G RYFVTF DD SRK WVY +K K +VF FK + V
Sbjct: 476 RQNVLDLVHSDVCGPFKK-SLGGARYFVTFIDDHSRKTWVYTLKTKDQVFQVFKQFLTLV 534
Query: 532 ENQTGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHFSVRKTPQQNGVAERMNRTLTEKA 591
E +TG+K+K +R+DNG EY + F +C+E+GI+ F+ KTPQ NG+AERMNRTL E+
Sbjct: 535 ERETGKKLKCIRTDNGGEYQGQ-FDAYCKEHGIRHQFTPPKTPQLNGLAERMNRTLIERT 593
Query: 592 RCLRLNACLSKCLWAATINMACYLVNRSPRASLDGKVAEEVWTGNPIDLSNLRIFWRPAY 651
RCL ++ L K W + A Y++N SP L K E++W G I LR+F AY
Sbjct: 594 RCLLSHSKLPKAFWGEALVTAAYVLNHSPCVPLQYKAPEKIWLGRDISYDQLRVFGCKAY 653
Query: 652 VHISSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDPVKKKVIVSRDVVFDE*SMLKQSDVT 711
VH+ ++RSKLD K++ C+ IGY + + GYK +DPV+KK++ SRDVVF E D T
Sbjct: 654 VHVPKDERSKLDVKTRECVFIGYGQDMLGYKFYDPVEKKLVRSRDVVFVE-------DQT 706
Query: 712 VVPDTEVENSSQDKIQVDIEETPVSPRQI---VAQQQSE-PG-------------SDSGE 754
+ +VE S+ D + ++ T V PRQ+ V Q E PG D+G+
Sbjct: 707 IEDIDKVEKSTDDSAEFELPPTVV-PRQVGDDVQDNQPEAPGLPNEDELADTEGNEDNGD 765
Query: 755 VQDYTLVRDREPSRITPP------------VRYGFEDLAAYALLTSSGDPSTYHEAMAS* 802
+ + P PP RY + Y LLT G+P ++ EA+
Sbjct: 766 DDADEEDQPQPPILNNPPYHTRSGRVVQQSTRYSPHE---YVLLTDGGEPDSFEEAIDDE 822
Query: 803 EKDKWMSAMVEEMESLKKNET*NLVQLPHGKRVIGCKWVYKKKLAVT*KEREKFKAHLVT 862
K+KW+ AM +E++SL +N+T LV+LP GKR + KWV+K K +FKA LV
Sbjct: 823 HKEKWIEAMQDEIKSLHENKTFELVKLPKGKRALKNKWVFKMKHDEH-NSLPRFKARLVV 881
Query: 863 KGYSQHKGIDYDEIFSPVVRHTSIRVVLALVASMDMHLEQMDVKTTFLHGNLEEQIYIEQ 922
KG++Q KGID+DEIFSPVV+ TSIR VL L AS+++ +EQMDVKT FLHG+LEE+IY+EQ
Sbjct: 882 KGFNQRKGIDFDEIFSPVVKMTSIRTVLGLAASLNLEVEQMDVKTAFLHGDLEEEIYMEQ 941
Query: 923 PEGFSETGDGRLVCKLKRSLYGLKQSPRQWYKRFDSYMLRIGYRRCDYDCCVYVMSLDDG 982
P+GF + G VC+L++SLYGLKQ+PRQWYK+F+S M + GY++ D CV+ D
Sbjct: 942 PDGFQQKGKEDYVCRLRKSLYGLKQAPRQWYKKFESVMGQHGYKKTTSDHCVFAQKFSDD 1001
Query: 983 SFIFLLLYVDDMLIAANHLHDVNELKTKLGKEFDMKDLGAAKKILGMEIHKDRGAKKLWL 1042
FI LLLYVDDMLI ++ +N LK +L K F MKDLG AK+ILGM I +DR AKKLWL
Sbjct: 1002 DFIILLLYVDDMLIVGRNVSRINSLKEQLSKFFAMKDLGPAKQILGMRIMRDREAKKLWL 1061
Query: 1043 SQKSYVEGVLSRFDMSKANHVSTPLTNHFKLSLEQSPKIDSEIEGMSKIPYA 1094
SQ+ Y+E VL RF+M K VS PL NHF+LS +QSP D E M +IPYA
Sbjct: 1062 SQEKYIEKVLQRFNMEKTKAVSCPLANHFRLSTKQSPSTDDERRKMERIPYA 1113
Score = 129 bits (324), Expect = 6e-28
Identities = 60/117 (51%), Positives = 82/117 (69%)
Query: 1184 GGPICWKSSVQSTVAMSTTEAEYMAVAEAAKEALWLTGLVKELGVEQGGVQLHCDSQSAI 1243
GG + W+S +Q VA+STTEAE++A EA KE +W+ + ELG Q G QL CDSQSAI
Sbjct: 1203 GGAVSWQSKLQKCVALSTTEAEFIAATEACKELIWMKKFLTELGFSQDGYQLFCDSQSAI 1262
Query: 1244 YLTNNQVYHARTKHIDVRFHKIRELLASRQILLQKIHTSENTTDKLTKPVTSDKFMF 1300
+L N +H+R+KHIDVR++ IR++L + + L+KIHT EN +D LTK + KF F
Sbjct: 1263 HLAKNASFHSRSKHIDVRYNWIRDVLEKKMLRLEKIHTDENGSDMLTKTLPKGKFEF 1319
>UniRef100_Q7XFG2 Putative retroelement [Oryza sativa]
Length = 1225
Score = 664 bits (1713), Expect = 0.0
Identities = 388/928 (41%), Positives = 537/928 (57%), Gaps = 142/928 (15%)
Query: 204 QDRGRGKGKAVDSGKKKRS------KSKDR-KTTECYSCKQIGH-----WK-RDCPNRSG 250
+DR R + K +S + +S +SK R + C CK+ G+ WK +D R+G
Sbjct: 201 RDRNRQQEKNTNSKSRDKSSSSYHGRSKSRGRYKSCKYCKRDGYDISKCWKLQDKDKRTG 260
Query: 251 K-----SGNNSSSANVVQSDGSCSEEDLLCVSSVKCTDAWVLDSGCSYHMTQHREWFNSF 305
K A VV + S ++ + + +D W+LD+ C+YHM +R+WF ++
Sbjct: 261 KYVPKGKKEEEGKAAVVTDEKSDAKLLVAYAGCAQTSDQWILDTACTYHMCSNRDWFATY 320
Query: 306 KSGDLGYVYLGDDKPCIIKGMRQVKIALDDGGVRTLSQVRYVLEVMKNLISLGTLHENGY 365
++ G V +GDD PC + G
Sbjct: 321 EAVQGGTVLMGDDTPCEVAG---------------------------------------- 340
Query: 366 SFKSEENRDILRVSKGAMTVMRAKRTAGNIYKLLGGTIMGDVASVE---TDDDATKLWHM 422
+K IL+V+KG++ VM+A + N+Y L G TI+G+VA+V ++ DAT LWHM
Sbjct: 341 -YKYSGGDGILKVTKGSLVVMKADIKSANLYHLRGTTILGNVAAVSDSLSNSDATNLWHM 399
Query: 423 RLGHLSERGMMELHKRNMLKGVRSCIIGLCKYCVLGKQCRVRFKTGHHKTKGILDYVHSD 482
RLGH+SE G+ EL KR +L G + C++C+ GK RV+F T H T+GILDYVHSD
Sbjct: 400 RLGHMSEIGLAELSKRGLLDGQSIKKLKFCEHCIFGKHKRVKFNTSTHTTEGILDYVHSD 459
Query: 483 VRGPTKEPSV*GFRYFVTFTDDFSRKVWVYFMKYKSEVFAKFKLWKAEVENQTGRKIKYL 542
+ GP ++ S G RY +T DD+SRKVW YF+K+K + F FK WK VE QT RK+K L
Sbjct: 460 LWGPARKTSFGGARYMMTIVDDYSRKVWPYFLKHKYQAFDVFKEWKTMVERQTERKVKIL 519
Query: 543 RSDNGTEYTDKNFMHFCEENGIQRHFSVRKTPQQNGVAERMNRTLTEKARCLRLNACLSK 602
R+DNG ++ K F +C+ GI RH++V TPQQNGVAERMNRT+ KARC+ NA L K
Sbjct: 520 RTDNGMDFCSKIFKSYCKSEGIVRHYTVPHTPQQNGVAERMNRTIISKARCMLSNAGLPK 579
Query: 603 CLWAATINMACYLVNRSPRASLDGKVAEEVWTGNPIDLSNLRIFWRPAYVHISSEDRSKL 662
WA ++ ACYL+NRSP ++D K +VW+G+P + S+LR+F AY H+ D SKL
Sbjct: 580 QFWAEAVSTACYLINRSPSYAIDKKTPIKVWSGSPANYSDLRVFGCIAYAHV---DNSKL 636
Query: 663 DPKSK*CIIIGYNKGVKGYKLWDPVKKKVIVSRDVVFDE*SMLKQSDVTVVPDTEVENSS 722
+P++ CI +GY GVKGYKLW P KKV++SR+VVF E ML P T V S
Sbjct: 637 EPRAIKCIFLGYPSGVKGYKLWCPETKKVVISRNVVFHESVMLHDK-----PSTNVPVES 691
Query: 723 QDKIQVDIEETPVS----PRQIVAQQQSEP---GSDSGEVQD---YTLVRDREPSRITPP 772
Q+K V +E S ++ VA P SDS VQ +++ +D+ I PP
Sbjct: 692 QEKASVQVEHLISSGHAPEKENVAINLDAPVIEDSDSSIVQQSPKHSIAKDKPKRNIKPP 751
Query: 773 VRYGFE-DLAAYAL-----LTSSGDPSTYHEAMAS*EKDKWMSAMVEEMESLKKNET*NL 826
RY E ++ AYAL + + +PSTY +A+ S + ++W++AM +EMESL+KN T L
Sbjct: 752 RRYIEEANIVAYALSVAEEIEGNVEPSTYSDAIVSDDCNRWITAMHDEMESLEKNHTWEL 811
Query: 827 VQLPHGKRVIGCKWVYKKKLAVT*KEREKFKAHLVTKGYSQHKGIDYDEIFSPVVRHTSI 886
V+LP K+ I CKW++K+K ++ + ++KA +SI
Sbjct: 812 VKLPKEKKPIRCKWIFKRKEGMSPSDEARYKA-------------------------SSI 846
Query: 887 RVVLALVASMDMHLEQMDVKTTFLHGNLEEQIYIEQPEGFSETGDGRLVCKLKRSLYGLK 946
R +L++VA D LEQMDVKT FLHG LEE IY+EQPEGF G LVC+LK+SLYGLK
Sbjct: 847 RTLLSIVAMYDYELEQMDVKTAFLHGELEEDIYMEQPEGFVVPGKENLVCRLKKSLYGLK 906
Query: 947 QSPRQWYKRFDSYMLRIGYRRCDYDCCVYVMSLDDGSFIFLLLYVDDMLIAANHLHDVNE 1006
QSPRQWYKRFDS+ML +RR +YD CVY + + DGS I+LLLYV+DMLIAA ++ +
Sbjct: 907 QSPRQWYKRFDSFMLSQKFRRSNYDSCVY-LKVVDGSAIYLLLYVNDMLIAAKDKLEIAK 965
Query: 1007 LKTKLGKEFDMKDLGAAKKILGMEIHKDRGAKKLWLSQKSYVEGVLSRFDMSKANHVSTP 1066
LK +L EF+MKDLGAAKKILGMEI ++R + KL+LSQK
Sbjct: 966 LKAQLSSEFEMKDLGAAKKILGMEITRERRSGKLYLSQK--------------------- 1004
Query: 1067 LTNHFKLSLEQSPKIDSEIEGMSKIPYA 1094
+ P+ D +IE MS++PY+
Sbjct: 1005 ---------DLCPQSDYDIEYMSRVPYS 1023
Score = 104 bits (259), Expect = 2e-20
Identities = 50/96 (52%), Positives = 72/96 (74%), Gaps = 1/96 (1%)
Query: 1180 LYSCGG-PICWKSSVQSTVAMSTTEAEYMAVAEAAKEALWLTGLVKELGVEQGGVQLHCD 1238
+++ GG + WK+S+Q+TVA+STT+AEYMA++EA KEA+WL GL EL + + CD
Sbjct: 1063 VFTIGGCAVSWKASLQATVALSTTKAEYMAISEACKEAIWLRGLYTELCGVTSCINIFCD 1122
Query: 1239 SQSAIYLTNNQVYHARTKHIDVRFHKIRELLASRQI 1274
SQSAI LT +Q++H RTK+IDVR+H IR ++A +
Sbjct: 1123 SQSAICLTKDQMFHERTKYIDVRYHFIRGVIAEGDV 1158
>UniRef100_Q6L3N8 Putative gag-pol polyprotein [Solanum demissum]
Length = 1333
Score = 655 bits (1689), Expect = 0.0
Identities = 448/1337 (33%), Positives = 671/1337 (49%), Gaps = 99/1337 (7%)
Query: 16 NFGLWQRMAKDLLAQKSLQKALRDEKPADIATVDWNEMKEKAAGLITL--CVSDDVMNHI 73
N+ W K L + L + P A K + L T+ + D++ I
Sbjct: 21 NYQFWSLKMKTLFKSQELWDIVETGIPEGNANQMREHRKRDSKALFTIQQALDDEIFPRI 80
Query: 74 LDLTTLKDVWDKLESLYMS--KTPMNKLFAKQRLY-SLKMQEGGDLQAHVYAFNNILADM 130
+ T K W+ L+ Y K KL +R + +L M E +Q ++ + I+ M
Sbjct: 81 SAVETSKQAWEILKQEYFGDDKVITVKLQTLRRDFETLFMNENESVQGYLSRTSAIVNRM 140
Query: 131 TRLGVTVDDEDKAIILLCSLPGSYDHLVTTLTYGKDSIT--LDSISSTLLPHAQR----R 184
G +D++ +L SL ++H+VT + KD T D + S+LL H R R
Sbjct: 141 RSYGEKIDNQIVVSKVLRSLTTKFEHVVTAIEESKDLSTYSFDELMSSLLAHEDRLNRSR 200
Query: 185 QSVEE------------GGGSSGEGLFVKGGQDRGRGKGKAVDSGKKKRSKSKDRKTTEC 232
+ V+E G + G G RGRG+G + + + + +C
Sbjct: 201 EKVQEKAFQVKGEFSYKGKAENSAGRGHGRGNFRGRGRGGSGRGRNQVGEFRQYKSNIQC 260
Query: 233 YSCKQIGHWKRDCPNRSGKSGNNSSSANVVQSDGSCSEEDLLCVSSVKCTDA----WVLD 288
CK+ GH + DC K + AN Q+ EE L ++S + T++ W +D
Sbjct: 261 RYCKKFGHKEVDCWT---KQKDEQKDANFTQN---VEEESKLFMASSQITESANAVWFID 314
Query: 289 SGCSYHMTQHREWFNSFKSGDLGYVYLGDDKPCIIKGMRQVKIALDDGGVRTLSQVRYVL 348
SGCS HM+ + F V LGDDK I+G V+I G V+ L V+YV
Sbjct: 315 SGCSNHMSSSKSLFRDLDESQKSEVRLGDDKQVHIEGKGTVEIKTVQGNVKFLYDVQYVP 374
Query: 349 EVMKNLISLGTLHENGYSFKSEENRDILRVSKGAMTVMRAKRTAGNIYKLLGGTIMGDVA 408
+ NL+S+G L +GYS +N ++ + T+ R T ++ L + G+ A
Sbjct: 375 TLAHNLLSVGQLMTSGYSVVFYDNACDIKDKESGRTIARVPMTQNKMFPLDISNV-GNSA 433
Query: 409 SVETDDDATKLWHMRLGHLSERGMMELHKRNMLKGVRSCI-IGLCKYCVLGKQCRVRFKT 467
V + + T LWH+R GHL+ + L +++M+ G+ + + LC+ C+ GKQ R F
Sbjct: 434 LVVKEKNETNLWHLRYGHLNVNWLKLLVQKDMVIGLPNIKELDLCEGCIYGKQTRKSFPV 493
Query: 468 GHH-KTKGILDYVHSDVRGPTKEPSV*GFRYFVTFTDDFSRKVWVYFMKYKSEVFAKFKL 526
G + L+ VH+D+ GP K S+ G RYF+ FTDD+SR WVYF+K+KSE F FK
Sbjct: 494 GKSWRATTCLELVHADLCGPMKMESLGGSRYFLMFTDDYSRFSWVYFLKFKSETFETFKK 553
Query: 527 WKAEVENQTGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHFSVRKTPQQNGVAERMNRT 586
+KA VENQ+G KIK LR+D G E+ +F FCEENGI+R + TP+QNGVAER NRT
Sbjct: 554 FKAFVENQSGNKIKSLRTDRGGEFLSNDFNLFCEENGIRRELTAPYTPEQNGVAERKNRT 613
Query: 587 LTEKARCLRLNACLSKCLWAATINMACYLVNRSPRASLDGKVAEEVWTGNPIDLSNLRIF 646
+ E AR L W + Y +N SP + E W G +S+LRIF
Sbjct: 614 VVEMARSSLKAKGLPDYFWGEAVATVVYFLNISPTKDVWNTTPLEAWNGKKPRVSHLRIF 673
Query: 647 WRPAYVHISSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDPVKKKVIVSRDVVFDE*---- 702
AY ++ SKLD KS CI +GY+ K Y+L++P+ KVI+SR+VVF+E
Sbjct: 674 GCIAYALVNFH--SKLDEKSTKCIFVGYSLQSKAYRLYNPISGKVIISRNVVFNEDVSWN 731
Query: 703 --SMLKQSDVTVVPDTEVE-----NSSQDKIQVDIEETPVSPRQIVAQQQSEPGSDSGEV 755
S S++ ++P E NS +P++P VA +S V
Sbjct: 732 FNSGNMMSNIQLLPTDEESAVDFGNSPNSSPVSSSVSSPIAPSTTVAPDESS-------V 784
Query: 756 QDYTLVRDREPSRITPPVRYGFEDLAAYALLTSSGDPSTYHEAMAS*EKDKWMSAMVEEM 815
+ L R + P +ALL S DP Y EA+ E+ +W +AM+EE+
Sbjct: 785 EPIPLRRSTREKKPNPKYSNTVNTSCQFALLVS--DPICYEEAV---EQSEWKNAMIEEI 839
Query: 816 ESLKKNET*NLVQLPHGKRVIGCKWVYKKKLAVT*KEREKFKAHLVTKGYSQHKGIDYDE 875
+++++N T LV P GK VIG KWV++ K +K KA LV KGYSQ +G+D+DE
Sbjct: 840 QAIERNSTWELVDAPEGKNVIGLKWVFRTKYNAD-GSIQKHKARLVAKGYSQQQGVDFDE 898
Query: 876 IFSPVVRHTSIRVVLALVASMDMHLEQMDVKTTFLHGNLEEQIYIEQPEGFSETGDGRLV 935
FSPV R ++RVVLAL A + + + Q DVK+ FL+G+LEE++Y+ QP+GF TG+ V
Sbjct: 899 TFSPVARFETVRVVLALAAQLHLPVYQFDVKSAFLNGDLEEEVYVSQPQGFMITGNENKV 958
Query: 936 CKLKRSLYGLKQSPRQWYKRFDSYMLRIGYRRCDYDCCVYVMSLDDGSFIFLLLYVDDML 995
KL+++LYGLKQ+PR WY + DS+ G+RR D + +Y+ F+ + LYVDDM+
Sbjct: 959 YKLRKALYGLKQAPRAWYSKIDSFFQGSGFRRSDNEPTLYLKKQGTDEFLLVCLYVDDMI 1018
Query: 996 IAANHLHDVNELKTKLGKEFDMKDLGAAKKILGMEIHKDRGAKKLWLSQKSYVEGVLSRF 1055
+ VN+ K+ + + F+M DLG K LG+E+ +D+ +++SQK Y E +L +F
Sbjct: 1019 YIGSSKSLVNDFKSNMMRNFEMSDLGLLKYFLGLEVIQDKDG--IFISQKKYAEDLLKKF 1076
Query: 1056 DMSKANHVSTPLTNHFKL-----SLEQSPKIDSEIEG--------MSKIPYAVQLVV*CM 1102
M +TP+ + KL + + +PK+ + G I ++V VV
Sbjct: 1077 QMMNCEVATTPMNINEKLQRADGTEKANPKLFRSLVGGLNYLTHTRPDIAFSVS-VVSRF 1135
Query: 1103 LWFALDQIWHKQLVKCQVHVQAREAALGSSQVDPKILEGYNGSRYHV*QGARCCSISCGI 1162
L Q + + + + S+ L G+ S Y A C
Sbjct: 1136 LQSPTKQHFGAAKRVLRYVAGTTDFGIWYSKAPNFRLVGFTDSDY-----AGCLDDRKST 1190
Query: 1163 CGLCR*SR**KVYNRICLYSCG-GPICWKSSVQSTVAMSTTEAEYMAVAEAAKEALWLTG 1221
G C +S G G + W S Q TVA+ST+EAEY A + AA++ALWL
Sbjct: 1191 SGSC--------------FSFGSGVVTWSSKKQETVALSTSEAEYTAASLAARQALWLRK 1236
Query: 1222 LVKELGVEQ-GGVQLHCDSQSAIYLTNNQVYHARTKHIDVRFHKIRELLASRQILLQKIH 1280
L+++ EQ ++ DS+SAI + N +H RTKHIDV++H IR L+A +I+L+
Sbjct: 1237 LLEDFSYEQKESTEIFSDSKSAIAMAKNPSFHGRTKHIDVQYHFIRTLVADGRIVLKFCS 1296
Query: 1281 TSENTTDKLTKPVTSDK 1297
T+E D TK + K
Sbjct: 1297 TNEQAADIFTKSLPQAK 1313
>UniRef100_Q9C739 Copia-type polyprotein, putative [Arabidopsis thaliana]
Length = 1352
Score = 645 bits (1663), Expect = 0.0
Identities = 425/1315 (32%), Positives = 670/1315 (50%), Gaps = 106/1315 (8%)
Query: 49 DWNEMKEKAAGLITLCVSDDVMNHILDLTTLKDVWDKLESLYMSKTPMNKL---FAKQRL 105
D + +KA LI + +D +++ T+ K+ W+KL + Y + K+ +
Sbjct: 61 DSRKRDKKALCLIYQGLDEDTFEKVVEATSAKEAWEKLRTSYKGADQVKKVRLQTLRGEF 120
Query: 106 YSLKMQEGGDLQAHVYAFNNILADMTRLGVTVDDEDKAIILLCSLPGSYDHLVTTLTYGK 165
+L+M+EG + + + ++ R G +DD +L SL ++H+VT + K
Sbjct: 121 EALQMKEGELVSDYFSRVLTVTNNLKRNGEKLDDVRIMEKVLRSLDLKFEHIVTVIEETK 180
Query: 166 D--SITLDSISSTLLPHAQRRQSVEE-----------------------GGGSSGEGLFV 200
D ++T++ + +L + ++++ E+ GG G G
Sbjct: 181 DLEAMTIEQLLGSLQAYEEKKKKKEDIIEQVLNMQITKEENGQSYQRRGGGQVRGRG--- 237
Query: 201 KGGQDRGRG----------KGKAVDSGKKK-RSKSK-DRKTTECYSCKQIGHWKRDCPNR 248
+GG GRG +G+ G+ K KS+ D+ + +CY+C + GH+ +C
Sbjct: 238 RGGYGNGRGWRPHEDNTNQRGENSSRGRGKGHPKSRYDKSSVKCYNCGKFGHYASECKAP 297
Query: 249 SGKSGNNSSSANVVQSDGSCSEEDLLCVSSVKCTDA-----WVLDSGCSYHMTQHREWFN 303
S K AN V+ EED+L ++S K + W LDSG S HM + F
Sbjct: 298 SNKKFEEK--ANYVEE--KIQEEDMLLMASYKKDEQEENHKWYLDSGASNHMCGRKSMFA 353
Query: 304 SFKSGDLGYVYLGDDKPCIIKGMRQVKIALDDGGVRTLSQVRYVLEVMKNLISLGTLHEN 363
G V LGD+ +KG + I L +G + +S V Y+ + N++SLG L E
Sbjct: 354 ELDESVRGNVALGDESKMEVKGKGNILIRLKNGDHQFISNVYYIPSMKTNILSLGQLLEK 413
Query: 364 GYSFKSEENRDILRVSKGAMTVMRAKRTAGNIYKLLGGTIMGDVASV--ETDDDATKLWH 421
GY + ++N +R + + + + + ++ L I D+A + + LWH
Sbjct: 414 GYDIRLKDNNLSIRDQESNL-ITKVPMSKNRMFVL---NIRNDIAQCLKMCYKEESWLWH 469
Query: 422 MRLGHLSERGMMELHKRNMLKGVRSCIIG---LCKYCVLGKQCRVRF-KTGHHKTKGILD 477
+R GHL+ G+ L ++ M++G+ CI +C+ C+LGKQ ++ F K + + L+
Sbjct: 470 LRFGHLNFGGLELLSRKEMVRGL-PCINHPNQVCEGCLLGKQFKMSFPKESSSRAQKSLE 528
Query: 478 YVHSDVRGPTKEPSV*GFRYFVTFTDDFSRKVWVYFMKYKSEVFAKFKLWKAEVENQTGR 537
+H+DV GP K S+ YF+ F DDFSRK WVYF+K KSEVF FK +KA VE ++G
Sbjct: 529 LIHTDVCGPIKPKSLGKSNYFLLFIDDFSRKTWVYFLKEKSEVFEIFKKFKAHVEKESGL 588
Query: 538 KIKYLRSDNGTEYTDKNFMHFCEENGIQRHFSVRKTPQQNGVAERMNRTLTEKARCLRLN 597
IK +RSD G E+T K F+ +CE+NGI+R +V ++PQQNGVAER NRT+ E AR + +
Sbjct: 589 VIKTMRSDRGGEFTSKEFLKYCEDNGIRRQLTVPRSPQQNGVAERKNRTILEMARSMLKS 648
Query: 598 ACLSKCLWAATINMACYLVNRSPRASLDGKVAEEVWTGNPIDLSNLRIFWRPAYVHISSE 657
L K LWA + A YL+NRSP S+ GK +E W+G +S+LR+F A+ H+ E
Sbjct: 649 KRLPKELWAEAVACAVYLLNRSPTKSVSGKTPQEAWSGRKSGVSHLRVFGSIAHAHVPDE 708
Query: 658 DRSKLDPKSK*CIIIGYNKGVKGYKLWDPVKKKVIVSRDVVFDE*SML----KQSDVTVV 713
RSKLD KS+ I IGY+ KGYKL++P KK I+SR++VFDE + D
Sbjct: 709 KRSKLDDKSEKYIFIGYDNNSKGYKLYNPDTKKTIISRNIVFDEEGEWDWNSNEEDYNFF 768
Query: 714 PDTEVENSSQDKIQVDIEETPVSPRQIVAQQQSEPGSDSG----EVQDYTLVRDREPSRI 769
P E + + + EE P + Q E S+ +Q+ V + +
Sbjct: 769 PHFEEDEPEPTREEPPSEEPTTPPTSPTSSQIEESSSERTPRFRSIQELYEVTENQ---- 824
Query: 770 TPPVRYGFEDLAAYALLTSSGDPSTYHEAMAS*EKDKWMSAMVEEMESLKKNET*NLVQL 829
E+L + L +P + EA+ EK W +AM EE++S++KN+T L L
Sbjct: 825 --------ENLTLFCLFAEC-EPMDFQEAI---EKKTWRNAMDEEIKSIQKNDTWELTSL 872
Query: 830 PHGKRVIGCKWVYKKKLAVT*KEREKFKAHLVTKGYSQHKGIDYDEIFSPVVRHTSIRVV 889
P+G + IG KWVYK K E E++KA LV KGY Q GIDYDE+F+PV R ++R++
Sbjct: 873 PNGHKTIGVKWVYKAKKNSK-GEVERYKARLVAKGYIQRAGIDYDEVFAPVARLETVRLI 931
Query: 890 LALVASMDMHLEQMDVKTTFLHGNLEEQIYIEQPEGFSETGDGRLVCKLKRSLYGLKQSP 949
++L A + QMDVK+ FL+G+LEE++YIEQP+G+ G+ V +LK++LYGLKQ+P
Sbjct: 932 ISLAAQNKWKIHQMDVKSAFLNGDLEEEVYIEQPQGYIVKGEEDKVLRLKKALYGLKQAP 991
Query: 950 RQWYKRFDSYMLRIGYRRCDYDCCVYVMSLDDGSFIFLLLYVDDMLIAANHLHDVNELKT 1009
R W R D Y + +C Y+ +Y+ + + LYVDD++ N+ E K
Sbjct: 992 RAWNTRIDKYFKEKDFIKCPYEHALYI-KIQKEDILIACLYVDDLIFTGNNPSMFEEFKK 1050
Query: 1010 KLGKEFDMKDLGAAKKILGMEIHKDRGAKKLWLSQKSYVEGVLSRFDMSKANHVSTPLTN 1069
++ KEF+M D+G LG+E+ ++ ++++Q+ Y + VL +F M +N V TP+
Sbjct: 1051 EMTKEFEMTDIGLMSYYLGIEVKQEDNG--IFITQEGYAKEVLKKFKMDDSNPVCTPM-- 1106
Query: 1070 HFKLSLEQSPKIDSEIEGMSKIPYAVQLVV*CMLWFAL---DQIWHKQLVKCQVHVQARE 1126
E K+ + EG P + +V + + D ++ +V +
Sbjct: 1107 ------ECGIKLSKKEEGEGVDPTTFKSLVGSLRYLTCTRPDILYAVGVVSRYMEHPTTT 1160
Query: 1127 AALGSSQVDPKILEGYN-GSRYHV*QGARCCSISCGICGLCR*SR**KVYNRICLYSCGG 1185
+ ++ I N G Y + S G R K + Y
Sbjct: 1161 HFKAAKRILRYIKGTVNFGLHYSTTSDYKLVGYSDSDWGGDVDDR--KSTSGFVFYIGDT 1218
Query: 1186 PICWKSSVQSTVAMSTTEAEYMAVAEAAKEALWLTGLVKELGVEQGG-VQLHCDSQSAIY 1244
W S Q V +ST EAEY+A A+WL L+KEL + Q ++ D++SAI
Sbjct: 1219 AFTWMSKKQPIVVLSTCEAEYVAATSCVCHAIWLRNLLKELSLPQEEPTKIFVDNKSAIA 1278
Query: 1245 LTNNQVYHARTKHIDVRFHKIRELLASRQILLQKIHTSENTTDKLTKPVTSDKFM 1299
L N V+H R+KHID R+H IRE ++ + + L+ + T + D TKP+ + F+
Sbjct: 1279 LAKNPVFHDRSKHIDTRYHYIRECVSKKDVQLEYVKTHDQVADIFTKPLKREDFI 1333
>UniRef100_Q9SXB2 T28P6.8 protein [Arabidopsis thaliana]
Length = 1352
Score = 642 bits (1656), Expect = 0.0
Identities = 424/1315 (32%), Positives = 669/1315 (50%), Gaps = 106/1315 (8%)
Query: 49 DWNEMKEKAAGLITLCVSDDVMNHILDLTTLKDVWDKLESLYMSKTPMNKL---FAKQRL 105
D + +KA LI + +D +++ T+ K+ W+KL + Y + K+ +
Sbjct: 61 DSRKRDKKALCLIYQGLDEDTFEKVVEATSAKEAWEKLRTSYKGADQVKKVRLQTLRGEF 120
Query: 106 YSLKMQEGGDLQAHVYAFNNILADMTRLGVTVDDEDKAIILLCSLPGSYDHLVTTLTYGK 165
+L+M+EG + + + ++ R G +DD +L SL ++H+VT + K
Sbjct: 121 EALQMKEGELVSDYFSRVLTVTNNLKRNGEKLDDVRIMEKVLRSLDLKFEHIVTVIEETK 180
Query: 166 D--SITLDSISSTLLPHAQRRQSVEE-----------------------GGGSSGEGLFV 200
D ++T++ + +L + ++++ E+ GG G G
Sbjct: 181 DLEAMTIEQLLGSLQAYEEKKKKKEDIVEQVLNMQITKEENGQSYQRRGGGQVRGRG--- 237
Query: 201 KGGQDRGRG----------KGKAVDSGKKK-RSKSK-DRKTTECYSCKQIGHWKRDCPNR 248
+GG GRG +G+ G+ K KS+ D+ + +CY+C + GH+ +C
Sbjct: 238 RGGYGNGRGWRPHEDNTNQRGENSSRGRGKGHPKSRYDKSSVKCYNCGKFGHYASECKAP 297
Query: 249 SGKSGNNSSSANVVQSDGSCSEEDLLCVSSVKCTDA-----WVLDSGCSYHMTQHREWFN 303
S K AN V+ EED+L ++S K + W LDSG S HM + F
Sbjct: 298 SNKKFEEK--ANYVEE--KIQEEDMLLMASYKKDEQKENHKWYLDSGASNHMCGRKSMFA 353
Query: 304 SFKSGDLGYVYLGDDKPCIIKGMRQVKIALDDGGVRTLSQVRYVLEVMKNLISLGTLHEN 363
G V LGD+ +KG + I L +G + +S V Y+ + N++SLG L E
Sbjct: 354 ELDESVRGNVALGDESKMEVKGKGNILIRLKNGDHQFISNVYYIPSMKTNILSLGQLLEK 413
Query: 364 GYSFKSEENRDILRVSKGAMTVMRAKRTAGNIYKLLGGTIMGDVASV--ETDDDATKLWH 421
GY + ++N +R + + + + + ++ L I D+A + + LWH
Sbjct: 414 GYDIRLKDNNLSIRDQESNL-ITKVPMSKNRMFVL---NIRNDIAQCLKMCYKEESWLWH 469
Query: 422 MRLGHLSERGMMELHKRNMLKGVRSCIIG---LCKYCVLGKQCRVRF-KTGHHKTKGILD 477
+R GHL+ G+ L ++ M++G+ CI +C+ C+LGKQ ++ F K + + L+
Sbjct: 470 LRFGHLNFGGLELLSRKEMVRGL-PCINHPNQVCEGCLLGKQFKMSFPKESSSRAQKPLE 528
Query: 478 YVHSDVRGPTKEPSV*GFRYFVTFTDDFSRKVWVYFMKYKSEVFAKFKLWKAEVENQTGR 537
+H+DV GP K S+ YF+ F DDFSRK WVYF+K KSEVF FK +KA VE ++G
Sbjct: 529 LIHTDVCGPIKPKSLGKSNYFLLFIDDFSRKTWVYFLKEKSEVFEIFKKFKAHVEKESGL 588
Query: 538 KIKYLRSDNGTEYTDKNFMHFCEENGIQRHFSVRKTPQQNGVAERMNRTLTEKARCLRLN 597
IK +RSD G E+T K F+ +CE+NGI+R +V ++PQQNGV ER NRT+ E AR + +
Sbjct: 589 VIKTMRSDRGGEFTSKEFLKYCEDNGIRRQLTVPRSPQQNGVVERKNRTILEMARSMLKS 648
Query: 598 ACLSKCLWAATINMACYLVNRSPRASLDGKVAEEVWTGNPIDLSNLRIFWRPAYVHISSE 657
L K LWA + A YL+NRSP S+ GK +E W+G +S+LR+F A+ H+ E
Sbjct: 649 KRLPKELWAEAVACAVYLLNRSPTKSVSGKTPQEAWSGRKPGVSHLRVFGSIAHAHVPDE 708
Query: 658 DRSKLDPKSK*CIIIGYNKGVKGYKLWDPVKKKVIVSRDVVFDE*SML----KQSDVTVV 713
RSKLD KS+ I IGY+ KGYKL++P KK I+SR++VFDE + D
Sbjct: 709 KRSKLDDKSEKYIFIGYDNNSKGYKLYNPDTKKTIISRNIVFDEEGEWDWNSNEEDYNFF 768
Query: 714 PDTEVENSSQDKIQVDIEETPVSPRQIVAQQQSEPGSDSG----EVQDYTLVRDREPSRI 769
P E + + + EE P + Q E S+ +Q+ V + +
Sbjct: 769 PHFEEDEPEPTREEPPSEEPTTPPTSPTSSQIEESSSERTPRFRSIQELYEVTENQ---- 824
Query: 770 TPPVRYGFEDLAAYALLTSSGDPSTYHEAMAS*EKDKWMSAMVEEMESLKKNET*NLVQL 829
E+L + L +P + +A+ EK W +AM EE++S++KN+T L L
Sbjct: 825 --------ENLTLFCLFAEC-EPMDFQKAI---EKKTWRNAMDEEIKSIQKNDTWELTSL 872
Query: 830 PHGKRVIGCKWVYKKKLAVT*KEREKFKAHLVTKGYSQHKGIDYDEIFSPVVRHTSIRVV 889
P+G + IG KWVYK K E E++KA LV KGYSQ GIDYDE+F+PV R ++R++
Sbjct: 873 PNGHKAIGVKWVYKAKKNSK-GEVERYKARLVAKGYSQRVGIDYDEVFAPVARLETVRLI 931
Query: 890 LALVASMDMHLEQMDVKTTFLHGNLEEQIYIEQPEGFSETGDGRLVCKLKRSLYGLKQSP 949
++L A + QMDVK+ FL+G+LEE++YIEQP+G+ G+ V +LK+ LYGLKQ+P
Sbjct: 932 ISLAAQNKWKIHQMDVKSAFLNGDLEEEVYIEQPQGYIVKGEEDKVLRLKKVLYGLKQAP 991
Query: 950 RQWYKRFDSYMLRIGYRRCDYDCCVYVMSLDDGSFIFLLLYVDDMLIAANHLHDVNELKT 1009
R W R D Y + +C Y+ +Y+ + + LYVDD++ N+ E K
Sbjct: 992 RAWNTRIDKYFKEKDFIKCPYEHALYI-KIQKEDILIACLYVDDLIFTGNNPSIFEEFKK 1050
Query: 1010 KLGKEFDMKDLGAAKKILGMEIHKDRGAKKLWLSQKSYVEGVLSRFDMSKANHVSTPLTN 1069
++ KEF+M D+G LG+E+ ++ ++++Q+ Y + VL +F M +N V TP+
Sbjct: 1051 EMTKEFEMTDIGLMSYYLGIEVKQEDNG--IFITQEGYAKEVLKKFKMDDSNPVCTPM-- 1106
Query: 1070 HFKLSLEQSPKIDSEIEGMSKIPYAVQLVV*CMLWFAL---DQIWHKQLVKCQVHVQARE 1126
E K+ + EG P + +V + + D ++ +V +
Sbjct: 1107 ------ECGIKLSKKEEGEGVDPTTFKSLVGSLRYLTCTRPDILYAVGVVSRYMEHPTTT 1160
Query: 1127 AALGSSQVDPKILEGYN-GSRYHV*QGARCCSISCGICGLCR*SR**KVYNRICLYSCGG 1185
+ ++ I N G Y + S G R K + Y
Sbjct: 1161 HFKAAKRILRYIKGTVNFGLHYSTTSDYKLVGYSDSDWGGDVDDR--KSTSGFVFYIGDT 1218
Query: 1186 PICWKSSVQSTVAMSTTEAEYMAVAEAAKEALWLTGLVKELGVEQGG-VQLHCDSQSAIY 1244
W S Q V +ST EAEY+A A+WL L+KEL + Q ++ D++SAI
Sbjct: 1219 AFTWMSKKQPIVTLSTCEAEYVAATSCVCHAIWLRNLLKELSLPQEEPTKIFVDNKSAIA 1278
Query: 1245 LTNNQVYHARTKHIDVRFHKIRELLASRQILLQKIHTSENTTDKLTKPVTSDKFM 1299
L N V+H R+KHID R+H IRE ++ + + L+ + T + D TKP+ + F+
Sbjct: 1279 LAKNPVFHDRSKHIDTRYHYIRECVSKKDVQLEYVKTHDQVADFFTKPLKRENFI 1333
>UniRef100_Q9AUZ1 Polyprotein, putative [Arabidopsis thaliana]
Length = 855
Score = 604 bits (1557), Expect = e-171
Identities = 335/825 (40%), Positives = 484/825 (58%), Gaps = 35/825 (4%)
Query: 295 MTQHREWFNSFKSGDLGYVYLGDDKPCIIKGMRQVKIALDDGGVRTLSQVRYVLEVMKNL 354
MT R+WF SF+ D + LGDD +G +++ G + L V+YV + +NL
Sbjct: 1 MTSRRDWFCSFQEKDTTKILLGDDHSVESQGQGSIRLDTHGGTITILENVKYVPNLRRNL 60
Query: 355 ISLGTLHENGYSFKSEENRDILRVSKGAMTVMRAKRTAGNIYKLLGGTIMGDVASVETDD 414
IS GTL GY + + + +R K T +R +G +Y L G T++ + E
Sbjct: 61 ISTGTLDRLGYKHEGGDGQ--VRYYKNNKTALRGS-LSGGLYVLDGNTVIAESCIAERSK 117
Query: 415 DATKLWHMRLGHLSERGMMELHKRNMLKGVRSCIIGLCKYCVLGKQCRVRFKTGHHKTKG 474
+ T LWH RLGH+ M L + ++K + + ++CV+GK +V F G H ++
Sbjct: 118 ELTTLWHSRLGHMGGNNMKILAGKGLIKPSEATSLEFYEHCVMGKAKKVSFNIGKHNSEE 177
Query: 475 ILDYVHSDVRGPTK-EPSV*GFRYFVTFTDDFSRKVWVYFMKYKSEVFAKFKLWKAEVEN 533
IL YVH+D+ G PS+ G +YF++ DD SRKVW+YF++ K E F KF WK VEN
Sbjct: 178 ILSYVHADLWGSQNVTPSMSGNKYFLSIIDDKSRKVWLYFLRSKDETFDKFCEWKELVEN 237
Query: 534 QTGRKIKYLRSDNGTEYTDKNFMHFCEENGIQRHFSVRKTPQQNGVAERMNRTLTEKARC 593
QT +++K LR+DNG E+ + F +C++ GI+RH + TPQQNGVAERMNRT+ EK RC
Sbjct: 238 QTDKRVKCLRTDNGLEFCNIKFDSYCKKYGIERHKTCTYTPQQNGVAERMNRTVMEKVRC 297
Query: 594 LRLNACLSKCLWAATINMACYLVNRSPRASLDGKVAEEVWTGNPIDLSNLRIFWRPAYVH 653
L + L + WA A Y++NRSP A++D V EE+W +LR AYVH
Sbjct: 298 LLNESGLEEEFWAEVATTAVYIINRSPSAAIDHNVPEELWLNRKPGYKHLRRLRAVAYVH 357
Query: 654 ISSEDRSKLDPKSK*CIIIGYNKGVKGYKLWDPVKKKVIVSRDVVF----------DE*S 703
+ D+ KL P++ I IGY G KGYK+W ++K ++SR+V+F D+ +
Sbjct: 358 V---DQGKLKPRAIKGIFIGYPSGTKGYKVWLLEEQKCVISRNVIFQEEVVYKDLNDKET 414
Query: 704 MLKQSDVTVVPDTEVENSSQDKIQ-----VDIEETPVSPRQIVAQQQSEPGSD-----SG 753
++K+ D+ D + S ++ IEE S + ++ +D G
Sbjct: 415 VVKKEDIRTQTDNHLVISKTKEVSDQGGVTHIEECEESDENDEQEPETVNETDPTVESEG 474
Query: 754 EVQDYTLVRDREPSRITPPVRYGFEDLAAYALLT----SSGDPSTYHEAMAS*EKDKWMS 809
+ +Y L +DR +I PP R+ E A+AL+ S +P +Y EA E KW +
Sbjct: 475 SLANYQLAKDRVRRQINPPARFTEESGVAFALVVVESLSLEEPESYQEATQDKEWLKWKN 534
Query: 810 AMVEEMESLKKNET*NLVQLPHGKRVIGCKWVYKKKLAVT*KEREKFKAHLVTKGYSQHK 869
A EEM+SL KN T +LV P +++IGC+W++K K + E +FKA LV KGY+Q +
Sbjct: 535 ATHEEMDSLIKNGTWDLVDKPTNRKIIGCRWLFKLKSGIPGVEPVRFKARLVAKGYTQRE 594
Query: 870 GIDYDEIFSPVVRHTSIRVVLALVASMDMHLEQMDVKTTFLHGNLEEQIYIEQPEGFSET 929
G+DY EIF+PVV+HTSIRV++++V DM LEQMDVKT FLHG+LEE++Y+EQPEGF
Sbjct: 595 GVDYQEIFAPVVKHTSIRVLMSVVVDQDMELEQMDVKTAFLHGDLEEELYMEQPEGFITD 654
Query: 930 GDGRLVCKLKRSLYGLKQSPRQWYKRFDSYMLRIGYRRCDYDCCVYVMSLDDGSFIFLLL 989
VC LK+SLYGLKQSPRQW KRF +M+ + R +D CVYV ++ G F++LLL
Sbjct: 655 KTKDKVCLLKKSLYGLKQSPRQWNKRFGRFMMEQKFIRSAHDACVYVKQVEQG-FVYLLL 713
Query: 990 YVDDMLIAANHLHDVNELKTKLGKEFDMKDLGAAKKILGMEIHKDRGAKKLWLSQKSYVE 1049
YVDDMLIA +VN +K +L EF+MKD+G A +ILG +I +DR L LSQ Y+
Sbjct: 714 YVDDMLIAGKSKAEVNMIKEQLSVEFEMKDMGPASRILGKDITRDRKKGILRLSQAPYIR 773
Query: 1050 GVLSRFDMSKANHVSTPLTNHFKLSLEQSPKIDSEIEGMSKIPYA 1094
++ RF++ +A V TP+ HFKL + + D E ++PY+
Sbjct: 774 KIVQRFNLDEARVVRTPIGAHFKL---PAVREDDECIDTEQVPYS 815
Database: uniref100
Posted date: Jan 5, 2005 1:24 AM
Number of letters in database: 848,049,833
Number of sequences in database: 2,790,947
Lambda K H
0.323 0.138 0.419
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,139,062,076
Number of Sequences: 2790947
Number of extensions: 89466004
Number of successful extensions: 244373
Number of sequences better than 10.0: 3994
Number of HSP's better than 10.0 without gapping: 3236
Number of HSP's successfully gapped in prelim test: 759
Number of HSP's that attempted gapping in prelim test: 232900
Number of HSP's gapped (non-prelim): 7606
length of query: 1307
length of database: 848,049,833
effective HSP length: 139
effective length of query: 1168
effective length of database: 460,108,200
effective search space: 537406377600
effective search space used: 537406377600
T: 11
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 81 (35.8 bits)
Lotus: description of TM0003.6