
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC144645.3 + phase: 0 /pseudo
(993 letters)
Database: uniref100
2,790,947 sequences; 848,049,833 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
UniRef100_Q6ATH4 Putative polyprotein [Oryza sativa] 671 0.0
UniRef100_Q7XCM1 Putative gag-pol polyprotein [Oryza sativa] 661 0.0
UniRef100_Q8S1E5 Putative gag/pol polyprotein [Oryza sativa] 660 0.0
UniRef100_Q8S805 Putative copia-type polyprotein [Oryza sativa] 655 0.0
UniRef100_Q94IU9 Copia-like polyprotein [Arabidopsis thaliana] 503 e-141
UniRef100_Q9SA17 F28K20.17 protein [Arabidopsis thaliana] 498 e-139
UniRef100_Q8LM18 Putative copia-like retrotransposon polyprotein... 495 e-138
UniRef100_Q9T0C5 Retrotransposon like protein [Arabidopsis thali... 488 e-136
UniRef100_Q94KV0 Polyprotein [Arabidopsis thaliana] 486 e-135
UniRef100_Q5XWK9 Gag-pol polyprotein-like [Solanum tuberosum] 461 e-128
UniRef100_Q9FRJ2 Putative copia-like retrotransposon polyprotein... 458 e-127
UniRef100_O82331 Putative retroelement pol polyprotein [Arabidop... 450 e-125
UniRef100_O81824 Hypothetical protein AT4g27210 [Arabidopsis tha... 446 e-123
UniRef100_Q9FWZ5 Putative retroelement polyprotein [Arabidopsis ... 437 e-120
UniRef100_Q6L3M9 Putative late blight resistance protein [Solanu... 419 e-115
UniRef100_Q710T7 Gag-pol polyprotein [Populus deltoides] 414 e-114
UniRef100_O23741 SLG-Sc and SLA-Sc genes and Melmoth retrotransp... 407 e-111
UniRef100_O04543 F20P5.25 protein [Arabidopsis thaliana] 398 e-109
UniRef100_Q9SIM3 Putative retroelement pol polyprotein [Arabidop... 397 e-108
UniRef100_Q7Y141 Putative polyprotein [Oryza sativa] 389 e-106
>UniRef100_Q6ATH4 Putative polyprotein [Oryza sativa]
Length = 1480
Score = 671 bits (1731), Expect = 0.0
Identities = 371/820 (45%), Positives = 491/820 (59%), Gaps = 84/820 (10%)
Query: 20 IVVGSGQEIPIRGYGQTYISPPYPPLSLNNVLHAPKLIKNLIFVRRFTIDNNVTIEFDPF 79
I VG+G IP+ G +++ +L N+L AP L++NL+FVR+FT DN + EFD F
Sbjct: 386 ITVGNGHTIPVICRGTSFLPIGTTRFALKNILVAPSLVRNLLFVRQFTRDNKCSFEFDEF 445
Query: 80 SFSVEDIQTGIPLMRCDSTGDLYPLTTTTSHQSTSPATFAALSPTLWHNRLGHPGANVLS 139
FSV+D+ T ++RC+S GDLY L TT + + +F A S TLWH+RLGHP +
Sbjct: 446 GFSVKDLPTRRVILRCNSRGDLYTLPTTVP--AITAHSFLAKSSTLWHHRLGHPSPAAVQ 503
Query: 140 FLNKNKFIECKQISSPSICQSCIYGKHVKLPFSISNSTTSKPFDIIHSDLWTSPILSSAG 199
L+K + C + S+ +C +C GKH +L FS S+S+TS PF+++H D+WTSP+LS +G
Sbjct: 504 TLHKLAILSCTR-SNNKLCHACHLGKHTRLSFSKSSSSTSSPFELVHCDVWTSPVLSLSG 562
Query: 200 HKYYLFFLDDYTNFVWTFPIGRKSQVPSIFSSFHAFIKTQFGTTIKCFQCDNGTEFNNEY 259
KYYL LDD+T+F WTFP+ KS V F A++KTQF I+CFQ DNGT+F N
Sbjct: 563 FKYYLVVLDDFTHFCWTFPLRHKSDVHQHLLEFVAYVKTQFSLPIRCFQADNGTKFVNHA 622
Query: 260 FTQFCHKNGMVFRFSCPHTSPQNGKAERKIRAINNFIRTSLAHSSMPPSFWHHALQITTY 319
T F G+V R SCP+TSPQNGKAER +R IN IRT L +SMPPS+W AL TY
Sbjct: 623 TTSFFASRGIVLRLSCPYTSPQNGKAERVLRTINKSIRTLLIQASMPPSYWAEALATATY 682
Query: 320 LQNILPSKILSHHSPTQYLYHRDPSYTHLRVFGCLCYPLFPSTAINKLQARSTPCAFLGY 379
L N PS + + P Q L+++ P Y++L+VFGCLCYP + +KL RS P FLGY
Sbjct: 683 LLNRRPSTSVRNSIPYQLLHNKLPDYSNLQVFGCLCYPNLSAMTSHKLSPRSAPYVFLGY 742
Query: 380 PQNHRGYKCYDLSSKKIIISRHVIFDETQFPFAKTHTSSTHTYEFLNDSLHPLLHYHLQN 439
+H+G++C D+S++++ ISRHV+FDE FPFA ++ +++FL L + +
Sbjct: 743 SASHKGFRCLDISTRRLYISRHVVFDEKTFPFAAIPQDAS-SFDFL------LQGFSIAV 795
Query: 440 DPKQDEPEPR--------KIESPQP--------------------ATTP--ASPI----- 464
P + PR ++E P P A P SP+
Sbjct: 796 APSSEVERPRFSSMTPSPEVEQPIPDDDTSGTELFQLLPGLRSSAAGRPLAGSPVDARLP 855
Query: 465 --------NVTNQSIL-----PPSPMSINQLPHPLVSTELTSP-THT--------PQQIH 502
N ++ S L PP+ + P +T L SP HT P IH
Sbjct: 856 GGCANDAANGSSSSNLSPVMDPPAASVVRPAPSEGPTTSLISPYRHTYLRRSQPAPTAIH 915
Query: 503 ----------------QEPPRTIATHSMHGIHKPKIQFNLT-TSITSSPLPHNPKAALSD 545
Q+ T+ T S G +P +F T T SP+P N ++AL+D
Sbjct: 916 RPIRASRAFHSATDQQQQTGHTMVTRSQTGHLRPIQRFTYTATHDVVSPVPSNYRSALAD 975
Query: 546 SNWKAAMLDEFDALIKNKTWELVPRPQNVNFIRSMWIFRHKKKSDGSFERYKARLVGDGR 605
NW+AA +E+ AL+ N TW LVPRP N + WIF+HK SDG+ R+KAR V G
Sbjct: 976 PNWRAATANEYKALVDNNTWRLVPRPPGANVVTGKWIFKHKFHSDGTLARHKARWVVRGY 1035
Query: 606 SQ*VGVDCDETFSPVVKPTTIRIVLTIALSKSWPIHQLDVKNVFLHGELQETVYMHQPMG 665
SQ G+D DETFSPVVKP TI +VL+IA S+SWPIHQLDVKN FLHG L+ETVY QP G
Sbjct: 1036 SQQHGIDYDETFSPVVKPATIHVVLSIAASRSWPIHQLDVKNAFLHGNLEETVYYQQPSG 1095
Query: 666 FRDPIHPDYVCLLKKSLYGLKQAPRAWYQRFADFAFTIGFSHSKSDHSLFIYRKGNDMTY 725
F DP P+ VCLL+KSLYGLKQAPRAWYQRFA + +GF+ S S+ SLF+Y+ G+++ Y
Sbjct: 1096 FVDPSAPNAVCLLQKSLYGLKQAPRAWYQRFATYIRQLGFTSSASNTSLFVYKDGDNIAY 1155
Query: 726 ILLYVDDIILTASSDVLRRSIMSLLASEFAMKDLGTLSYF 765
+LLYVDDIILTASS L I + L SEFAM DLG L +F
Sbjct: 1156 LLLYVDDIILTASSATLLHHITARLHSEFAMTDLGDLHFF 1195
>UniRef100_Q7XCM1 Putative gag-pol polyprotein [Oryza sativa]
Length = 1417
Score = 661 bits (1706), Expect = 0.0
Identities = 352/773 (45%), Positives = 472/773 (60%), Gaps = 34/773 (4%)
Query: 1 MTASQGTLSTYSNLSINKNIVVGSGQEIPIRGYGQTYISPPYPPLSLNNVLHAPKLIKNL 60
M+++ G L+ L + I VG+G ++P+ T+I L L+NVL +P LIKNL
Sbjct: 378 MSSTPGILAHPRPLPFSSCITVGNGAKLPVTHTASTHIPTSSTDLHLHNVLVSPPLIKNL 437
Query: 61 IFVRRFTIDNNVTIEFDPFSFSVEDIQTGIPLMRCDSTGDLYPLTTTTSHQSTSPATFAA 120
I V++ T DNNV+IEFDP FS++D+QT + +RCDS GDLYPL + H A A
Sbjct: 438 ISVKKLTRDNNVSIEFDPTGFSIKDLQTQVVKLRCDSPGDLYPLRLPSPH-----ALSAT 492
Query: 121 LSPTL--WHNRLGHPGANVLSFLNKNKFIECKQISSPSICQSCIYGKHVKLPFSISNSTT 178
SP++ WH RLGHPG+ LS + + +C + S+P C +C G +V+LPF S+S T
Sbjct: 493 SSPSVEHWHLRLGHPGSASLSKVLGSFDFQCNK-SAPHHCSACHVGTNVRLPFHSSSSQT 551
Query: 179 SKPFDIIHSDLWTSPILSSAGHKYYLFFLDDYTNFVWTFPIGRKSQVPSIFSSFHAFIKT 238
PF ++H+D+WTSPI S++G+KYY+ FLDD+T+++WTFP+ KS+V SF A+ T
Sbjct: 552 LFPFQLVHTDVWTSPIYSNSGYKYYVVFLDDFTHYIWTFPVRNKSEVFHTVRSFFAYAHT 611
Query: 239 QFGTTIKCFQCDNGTEFNNEYFTQFCHKNGMVFRFSCPHTSPQNGKAERKIRAINNFIRT 298
QFG + Q DNG E+++ +G V R SCP++S QNGKAER +R IN+++RT
Sbjct: 612 QFGLPVLALQTDNGKEYDSYALRSLLSLHGAVLRLSCPYSSQQNGKAERILRTINDYVRT 671
Query: 299 SLAHSSMPPSFWHHALQITTYLQNILPSKILSHHSPTQYLYHRDPSYTHLRVFGCLCYPL 358
L HS+ P SFW ALQ T+L N P + +P Q L P+Y HLRVFGCLCYP
Sbjct: 672 MLVHSAAPLSFWAEALQTATHLINRRPCRATGSLTPYQLLLGAPPTYDHLRVFGCLCYPN 731
Query: 359 FPSTAINKLQARSTPCAFLGYPQNHRGYKCYDLSSKKIIISRHVIFDETQFPFAKTHTSS 418
+TA +KL RS C F+GYP +HRGY+CYD+ S+++ SRHV F E FPF
Sbjct: 732 TIATAPHKLSPRSLACVFIGYPADHRGYRCYDMVSRRVFTSRHVTFVEDVFPF------- 784
Query: 419 THTYEFLNDSLHPLLHYHLQNDPKQDEPEPRKIESPQPATTPASPINVT--NQSILPPSP 476
D+ P P D + + P PA +P+ + + PPSP
Sbjct: 785 -------RDAPSP----RPSAPPPPDHGDDTIVLLPAPAQHVVTPVGTAPAHDAASPPSP 833
Query: 477 MSINQLPHPLVSTELTSPTHTPQQ---IHQEPPR-TIATHSMHGIHKPKIQFNLTTSITS 532
S P +P +P+ PPR + T + GI KP ++ +T + T
Sbjct: 834 AS--STPSSAAPAHDVAPPPSPETSSPASASPPRHAMTTRARAGISKPNPRYAMTATSTL 891
Query: 533 SPLPHNPKAALSDSNWKAAMLDEFDALIKNKTWELVPRPQNVNFIRSMWIFRHKKKSDGS 592
SP P + +AAL D NW+AAM EFDAL+ N+TW LVPRP I W+F+ K +DGS
Sbjct: 892 SPTPSSVRAALRDPNWRAAMQAEFDALLANRTWTLVPRPPGARIITGKWVFKTKLHADGS 951
Query: 593 FERYKARLVGDGRSQ*VGVDCDETFSPVVKPTTIRIVLTIALSKSWPIHQLDVKNVFLHG 652
++YKAR V G +Q GVD ETFSPVVKP TIR VLT+ SK WP HQLDV N FLHG
Sbjct: 952 LDKYKARWVVRGFNQRPGVDFGETFSPVVKPATIRTVLTLISSKQWPAHQLDVSNAFLHG 1011
Query: 653 ELQETVYMHQPMGFRDPIHPDYVCLLKKSLYGLKQAPRAWYQRFADFAFTIGFSHSKSDH 712
LQE V QP GF D P VCLL +SLYGL+QAPRAW++RFAD A ++GF S++D
Sbjct: 1012 HLQERVLCQQPTGFEDAARPADVCLLSRSLYGLRQAPRAWFKRFADHATSLGFVQSRADP 1071
Query: 713 SLFIYRKGNDMTYILLYVDDIILTASSDVLRRSIMSLLASEFAMKDLGTLSYF 765
SLF+ R+G+D Y+LLYVDD+IL+ASS L + I+ L +EF +KD+G L YF
Sbjct: 1072 SLFVLRRGSDTAYLLLYVDDMILSASSSSLLQRIIDRLQAEFKVKDMGPLKYF 1124
>UniRef100_Q8S1E5 Putative gag/pol polyprotein [Oryza sativa]
Length = 1090
Score = 660 bits (1704), Expect = 0.0
Identities = 353/746 (47%), Positives = 456/746 (60%), Gaps = 47/746 (6%)
Query: 57 IKNLIFVRRFTIDNNVTIEFDPFSFSVEDIQTGIPLMRCDSTGDLYPLTTTTSHQSTSPA 116
++NL+ VR+FT DN +IEFD F FSV+D+QT ++RC+S G+LY L T S++
Sbjct: 58 VRNLLSVRQFTRDNKCSIEFDEFGFSVKDLQTRRVILRCNSRGELYTLPAATP--SSAAH 115
Query: 117 TFAALSPTLWHNRLGHPGANVLSFLNKNKFIECKQISSPSICQSCIYGKHVKLPFSISNS 176
A S TLWH RLGHPG + L I C +I + S+C +C GKH +LPF S+S
Sbjct: 116 GLLATSSTLWHCRLGHPGPAAIHGLRNIASISCNKIDT-SLCHACQLGKHTRLPFHNSSS 174
Query: 177 TTSKPFDIIHSDLWTSPILSSAGHKYYLFFLDDYTNFVWTFPIGRKSQVPSIFSSFHAFI 236
TS PF+++H D+WTSP++S++G KYYL LDD+++F WTF + KS V F ++
Sbjct: 175 RTSVPFELVHCDVWTSPVMSTSGFKYYLVVLDDFSHFCWTFLLRLKSDVHRHIVEFVEYV 234
Query: 237 KTQFGTTIKCFQCDNGTEFNNEYFTQFCHKNGMVFRFSCPHTSPQNGKAERKIRAINNFI 296
TQFG +K FQ DNG EF N T F G R SCP+TSPQNGKAER +R INN I
Sbjct: 235 STQFGLPLKSFQADNGREFVNTAITTFLASRGTQLRLSCPYTSPQNGKAERMLRTINNSI 294
Query: 297 RTSLAHSSMPPSFWHHALQITTYLQNILPSKILSHHSPTQYLYHRDPSYTHLRVFGCLCY 356
RT L +SMPPS+W AL TYL N PS + P Q L+ P ++HLRVFGCLCY
Sbjct: 295 RTLLIQASMPPSYWAEALATATYLLNRRPSSSIHQSLPFQLLHRTIPDFSHLRVFGCLCY 354
Query: 357 PLFPSTAINKLQARSTPCAFLGYPQNHRGYKCYDLSSKKIIISRHVIFDETQFPFAKTHT 416
P +T +KL RST C FLGYP +H+GY+C DLS+ +IIISRHV+FDE+QFPFA T
Sbjct: 355 PNLSATTPHKLSPRSTACVFLGYPTSHKGYRCLDLSTHRIIISRHVVFDESQFPFAATPP 414
Query: 417 SSTHTYEFLNDSLHPLLHYHLQNDPKQDEPEPR--------KIESP-------------- 454
+++ +++FL L P + P + +PR ++E P
Sbjct: 415 AAS-SFDFLLQGLSP------ADAPSLEVEQPRPLTVAPSTEVEQPYLPLPSRRLSAGTV 467
Query: 455 ---QPATTPASPINVTNQSILPP-----------SPMSINQLPHPLVSTELTSPTHTPQQ 500
A + +P+ T+ + P SP P+ + +S T
Sbjct: 468 TVASEAPSAGAPLVGTSSADATPPGSATRASTIVSPFRHVYTRRPVTTVPPSSSTAVTNA 527
Query: 501 IHQEPPRTIATHSMHGIHKPKIQFNLT-TSITSSPLPHNPKAALSDSNWKAAMLDEFDAL 559
+ P ++ T S G +P + T T +SP+P N +AL+D NW+AAM DE+ L
Sbjct: 528 VAAPQPHSMVTRSQSGSLRPVDRLTYTATQAAASPVPANYHSALADPNWRAAMADEYKEL 587
Query: 560 IKNKTWELVPRPQNVNFIRSMWIFRHKKKSDGSFERYKARLVGDGRSQ*VGVDCDETFSP 619
+ N TW LV RP N WIF+HK SDGS RYKAR V G SQ G+D DETFSP
Sbjct: 588 VDNGTWRLVSRPPRANIATGKWIFKHKFHSDGSLARYKARWVVRGYSQQHGIDYDETFSP 647
Query: 620 VVKPTTIRIVLTIALSKSWPIHQLDVKNVFLHGELQETVYMHQPMGFRDPIHPDYVCLLK 679
VVK TIR+VL+IA S++WPIHQLDVKN FLHG L+ETVY QP GF DP PD VCLL+
Sbjct: 648 VVKLATIRVVLSIAASRAWPIHQLDVKNAFLHGHLKETVYCQQPSGFVDPTAPDAVCLLQ 707
Query: 680 KSLYGLKQAPRAWYQRFADFAFTIGFSHSKSDHSLFIYRKGNDMTYILLYVDDIILTASS 739
KSLYGLKQAPRAWYQRFA + +GF S SD SLF+Y+ G+ + Y+LLYVDDIILTAS+
Sbjct: 708 KSLYGLKQAPRAWYQRFATYIRQMGFMPSASDTSLFVYKDGDRIAYLLLYVDDIILTAST 767
Query: 740 DVLRRSIMSLLASEFAMKDLGTLSYF 765
L + + + L SEFAM DLG L +F
Sbjct: 768 TTLLQQLTARLHSEFAMTDLGDLHFF 793
>UniRef100_Q8S805 Putative copia-type polyprotein [Oryza sativa]
Length = 1803
Score = 655 bits (1690), Expect = 0.0
Identities = 350/773 (45%), Positives = 468/773 (60%), Gaps = 34/773 (4%)
Query: 1 MTASQGTLSTYSNLSINKNIVVGSGQEIPIRGYGQTYISPPYPPLSLNNVLHAPKLIKNL 60
M+++ G L+ L + I VG+G ++P+ T+I L L+NVL +P LIKNL
Sbjct: 378 MSSTPGILAHPRPLPFSSCITVGNGAKLPVTHTASTHIPTSSTDLHLHNVLVSPPLIKNL 437
Query: 61 IFVRRFTIDNNVTIEFDPFSFSVEDIQTGIPLMRCDSTGDLYPLTTTTSHQSTSPATFAA 120
I V++ T DNNV+IEFDP FS++D+QT + +RCDS GDLYPL + H A A
Sbjct: 438 ISVKKLTRDNNVSIEFDPTGFSIKDLQTQVVKLRCDSPGDLYPLRLPSPH-----ALSAT 492
Query: 121 LSPTL--WHNRLGHPGANVLSFLNKNKFIECKQISSPSICQSCIYGKHVKLPFSISNSTT 178
SP++ WH RLGHPG+ LS + + +C + S+P C +C G +V+LPF S+S T
Sbjct: 493 SSPSVEHWHLRLGHPGSASLSKVLGSFDFQCNK-SAPHHCSACHVGTNVRLPFHSSSSQT 551
Query: 179 SKPFDIIHSDLWTSPILSSAGHKYYLFFLDDYTNFVWTFPIGRKSQVPSIFSSFHAFIKT 238
PF ++H+D+WTSPI S++G+KYY+ FLDD+T+++WTFP+ KS+V SF A+ T
Sbjct: 552 LFPFQLVHTDVWTSPIYSNSGYKYYVVFLDDFTHYIWTFPVRNKSEVFHTVRSFFAYAHT 611
Query: 239 QFGTTIKCFQCDNGTEFNNEYFTQFCHKNGMVFRFSCPHTSPQNGKAERKIRAINNFIRT 298
QFG + Q DNG E+++ +G V R SCP++S QNGKAER +R IN+ +RT
Sbjct: 612 QFGLPVLALQTDNGKEYDSYALRSLLSLHGAVLRLSCPYSSQQNGKAERILRTINDCVRT 671
Query: 299 SLAHSSMPPSFWHHALQITTYLQNILPSKILSHHSPTQYLYHRDPSYTHLRVFGCLCYPL 358
L HS+ P SFW ALQ +L N P + P Q L P+Y HLRVFGCLCYP
Sbjct: 672 MLVHSAAPLSFWAEALQTAMHLINRRPCRATGSLKPYQLLLGAPPTYDHLRVFGCLCYPN 731
Query: 359 FPSTAINKLQARSTPCAFLGYPQNHRGYKCYDLSSKKIIISRHVIFDETQFPFAKTHTSS 418
+TA +KL RS C F+GYP +HRGY+CYD+ S+++ SRHV F E FPF
Sbjct: 732 TIATAPHKLSPRSLACVFIGYPADHRGYRCYDMVSRRVFTSRHVTFVEDVFPF------- 784
Query: 419 THTYEFLNDSLHPLLHYHLQNDPKQDEPEPRKIESPQPATTPASPINVT--NQSILPPSP 476
D+ P P D + + P PA +P+ + + PPSP
Sbjct: 785 -------RDAPSP----RPSAPPPPDHGDDTIVLLPAPAQHVVTPVGTAPAHDAASPPSP 833
Query: 477 MSINQLPHPLVSTELTSPTHTPQQ---IHQEPPR-TIATHSMHGIHKPKIQFNLTTSITS 532
S P +P +P+ PPR + T + GI KP ++ +T + T
Sbjct: 834 AS--STPSSAAPAHDVAPPPSPETSSPASASPPRHAMTTRARAGISKPNPRYAMTATSTL 891
Query: 533 SPLPHNPKAALSDSNWKAAMLDEFDALIKNKTWELVPRPQNVNFIRSMWIFRHKKKSDGS 592
SP P + + AL D NW+AAM EFDAL+ N+TW LVPRP I W+F+ K +DGS
Sbjct: 892 SPTPSSVRVALRDPNWRAAMQAEFDALLANRTWTLVPRPPGARIITGKWVFKTKLHADGS 951
Query: 593 FERYKARLVGDGRSQ*VGVDCDETFSPVVKPTTIRIVLTIALSKSWPIHQLDVKNVFLHG 652
++YKAR V G +Q GVD ETFSPVVKP TIR VLT+ SK WP HQLDV N FLHG
Sbjct: 952 LDKYKARWVVRGFNQRPGVDFGETFSPVVKPATIRTVLTLISSKQWPAHQLDVSNAFLHG 1011
Query: 653 ELQETVYMHQPMGFRDPIHPDYVCLLKKSLYGLKQAPRAWYQRFADFAFTIGFSHSKSDH 712
LQE V QP GF D P VCLL +SLYGL+QAPRAW++RFAD A ++GF S++D
Sbjct: 1012 HLQERVLCQQPTGFEDAARPADVCLLSRSLYGLRQAPRAWFKRFADHATSLGFVQSRADP 1071
Query: 713 SLFIYRKGNDMTYILLYVDDIILTASSDVLRRSIMSLLASEFAMKDLGTLSYF 765
SLF+ R+G+D Y+LLYVDD+IL+ASS L + I+ L +EF +KD+G L YF
Sbjct: 1072 SLFVLRRGSDTAYLLLYVDDMILSASSSSLLQRIIDRLQAEFKVKDMGPLKYF 1124
>UniRef100_Q94IU9 Copia-like polyprotein [Arabidopsis thaliana]
Length = 1466
Score = 503 bits (1296), Expect = e-141
Identities = 293/777 (37%), Positives = 406/777 (51%), Gaps = 15/777 (1%)
Query: 1 MTASQGTLSTYSNLSINKNIVVGSGQEIPIRGYGQTYISPPYPPLSLNNVLHAPKLIKNL 60
+TAS L + N ++VG G +PI G T IS + LN VL P + K+L
Sbjct: 334 ITASTSGLQNATTYEGNDAVLVGDGTYLPITHVGSTTISSSKGTIPLNEVLVCPAIQKSL 393
Query: 61 IFVRRFTIDNNVTIEFDPFSFSVEDIQTGIPLMRCDSTGDLYPLTTTTSHQSTSPATFAA 120
+ V + D + FD + D+ T + + LY L + S AA
Sbjct: 394 LSVSKLCDDYPCGVYFDANKVCIIDLTTQKVVSKGPRNNGLYMLENSEFVALYSNRQCAA 453
Query: 121 LSPTLWHNRLGHPGANVLSFLNKNKFIECKQISSPSICQSCIYGKHVKLPFSISNSTTSK 180
T WH+RLGH + +L L K I+ + + +C+ C GK +L F S+ K
Sbjct: 454 SMET-WHHRLGHSNSKILQQLLTRKEIQVNKSRTSPVCEPCQMGKSTRLQFFSSDFRALK 512
Query: 181 PFDIIHSDLW-TSPILSSAGHKYYLFFLDDYTNFVWTFPIGRKSQVPSIFSSFHAFIKTQ 239
P D +H DLW SP++S+ G KYY F+DD++ F W FP+ KS+ S+F ++ ++ Q
Sbjct: 513 PLDRVHCDLWGPSPVVSNQGFKYYAVFVDDFSRFSWFFPLRMKSKFISVFIAYQKLVENQ 572
Query: 240 FGTTIKCFQCDNGTEFNNEYFTQFCHKNGMVFRFSCPHTSPQNGKAERKIRAINNFIRTS 299
GT IK FQ D G EF + + ++G+ R SCP+T QNG AERK R + +
Sbjct: 573 LGTKIKEFQSDGGGEFTSNKLKEHFREHGIHHRISCPYTPQQNGVAERKHRHLVELGLSM 632
Query: 300 LAHSSMPPSFWHHALQITTYLQNILPSKILSHHSPTQYLYHRDPSYTHLRVFGCLCYPLF 359
L HS P FW A YL N+LPS +L SP + L+ + YT LRVFG CYP
Sbjct: 633 LYHSHTPLKFWVEAFFTANYLSNLLPSSVLKEISPYETLFQQKVDYTPLRVFGTACYPCL 692
Query: 360 PSTAINKLQARSTPCAFLGYPQNHRGYKCYDLSSKKIIISRHVIFDETQFPFAKTHTSST 419
A NK RS C FLGY ++GY+C + K+ ISRHVIFDE QFPF + + S
Sbjct: 693 RPLAKNKFDPRSLQCVFLGYHNQYKGYRCLYPPTGKVYISRHVIFDEAQFPFKEKYHSLV 752
Query: 420 HTYE------FLNDSLHPLLHYHLQNDPKQDEPEPRKIESPQPATT--PASPINVTNQSI 471
Y+ + + L P Q P + P QP +NV ++
Sbjct: 753 PKYQTTLLQAWQHTDLTPPSVPSSQLQPLARQMTPMATSENQPMMNYETEEAVNVNMETS 812
Query: 472 LPPSPMSINQLPH---PLVSTELTSPTHTPQQIHQEPPRTIATHSMHGIHKPKIQFNLTT 528
S ++ H P+++ + + + Q E + T S GI KP ++ L
Sbjct: 813 SDEETESNDEFDHEVAPVLNDQ--NEDNALGQGSLENLHPMITRSKDGIQKPNPRYALIV 870
Query: 529 SITSSPLPHNPKAALSDSNWKAAMLDEFDALIKNKTWELVPRPQNVNFIRSMWIFRHKKK 588
S +S P A+ +W AA++DE D + TW LVP +++N + S W+F+ K K
Sbjct: 871 SKSSFDEPKTITTAMKHPSWNAAVMDEIDRIHMLNTWSLVPATEDMNILTSKWVFKTKLK 930
Query: 589 SDGSFERYKARLVGDGRSQ*VGVDCDETFSPVVKPTTIRIVLTIALSKSWPIHQLDVKNV 648
DG+ ++ KARLV G Q GVD ETFSPVV+ TIR+VL A + WP+ QLDV N
Sbjct: 931 PDGTIDKLKARLVAKGFDQEEGVDYLETFSPVVRTATIRLVLDTATANEWPLKQLDVSNA 990
Query: 649 FLHGELQETVYMHQPMGFRDPIHPDYVCLLKKSLYGLKQAPRAWYQRFADFAFTIGFSHS 708
FLHGELQE V+M QP GF DP P++VC L K+LYGLKQAPRAW+ F++F GF S
Sbjct: 991 FLHGELQEPVFMFQPSGFVDPNKPNHVCRLTKALYGLKQAPRAWFDTFSNFLLDFGFECS 1050
Query: 709 KSDHSLFIYRKGNDMTYILLYVDDIILTASSDVLRRSIMSLLASEFAMKDLGTLSYF 765
SD SLF+ + +LLYVDDI+LT S +L ++ L + F+MKDLG YF
Sbjct: 1051 TSDPSLFVCHQNGQSLILLLYVDDILLTGSDQLLMDKLLQALNNRFSMKDLGPPRYF 1107
>UniRef100_Q9SA17 F28K20.17 protein [Arabidopsis thaliana]
Length = 1415
Score = 498 bits (1283), Expect = e-139
Identities = 289/784 (36%), Positives = 410/784 (51%), Gaps = 45/784 (5%)
Query: 1 MTASQGTLSTYSNLSINKNIVVGSGQEIPIRGYGQTYISPPYPPLSLNNVLHAPKLIKNL 60
+T+S L + + + ++VG G +PI G T I + LN VL P + K+L
Sbjct: 332 VTSSTNGLQSATEYEGDDAVLVGDGTYLPITHTGSTTIKSSNGKIPLNEVLVVPNIQKSL 391
Query: 61 IFVRRFTIDNNVTIEFDPFSFSVEDIQTGIPLMRCDSTGDLYPLTTTTSHQSTSPATFAA 120
+ V + D + FD + D+QT + LY L S AA
Sbjct: 392 LSVSKLCDDYPCGVYFDANKVCIIDLQTQKVVTTGPRRNGLYVLENQEFVALYSNRQCAA 451
Query: 121 LSPTLWHNRLGHPGANVLSFLNKNKFIECKQISSPSICQSCIYGKHVKLPFSISNSTTSK 180
+ +WH+RLGH + L L +K I+ + + +C+ C GK +LPF IS+S
Sbjct: 452 -TEEVWHHRLGHANSKALQHLQNSKAIQINKSRTSPVCEPCQMGKSSRLPFLISDSRVLH 510
Query: 181 PFDIIHSDLW-TSPILSSAGHKYYLFFLDDYTNFVWTFPIGRKSQVPSIFSSFHAFIKTQ 239
P D IH DLW SP++S+ G KYY F+DDY+ + W +P+ KS+ S+F SF ++ Q
Sbjct: 511 PLDRIHCDLWGPSPVVSNQGLKYYAIFVDDYSRYSWFYPLHNKSEFLSVFISFQKLVENQ 570
Query: 240 FGTTIKCFQCDNGTEFNNEYFTQFCHKNGMVFRFSCPHTSPQNGKAERKIRAINNFIRTS 299
T IK FQ D G EF + ++G+ R SCP+T QNG AERK R + +
Sbjct: 571 LNTKIKVFQSDGGGEFVSNKLKTHLSEHGIHHRISCPYTPQQNGLAERKHRHLVELGLSM 630
Query: 300 LAHSSMPPSFWHHALQITTYLQNILPSKILSHHSPTQYLYHRDPSYTHLRVFGCLCYPLF 359
L HS P FW + Y+ N LPS +L + SP + L+ P Y+ LRVFG CYP
Sbjct: 631 LFHSHTPQKFWVESFFTANYIINRLPSSVLKNLSPYEALFGEKPDYSSLRVFGSACYPCL 690
Query: 360 PSTAINKLQARSTPCAFLGYPQNHRGYKCYDLSSKKIIISRHVIFDETQFPFAKTHTSST 419
A NK RS C FLGY ++GY+C+ + K+ ISR+VIF+E++ PF + + S
Sbjct: 691 RPLAQNKFDPRSLQCVFLGYNSQYKGYRCFYPPTGKVYISRNVIFNESELPFKEKYQSLV 750
Query: 420 HTYEFLNDSLHPLLHYHLQNDPKQDEPEPRKIESPQPATTPASPINVTNQSILPPSPMSI 479
Y PLL N + + PA+P+ + ++ P+ +
Sbjct: 751 PQYST------PLLQAWQHNKISE-------------ISVPAAPVQLFSK------PIDL 785
Query: 480 NQLPHPLVSTELTSPTHTP-------------QQIHQEPPRTIATHSMH-----GIHKPK 521
N V+ +LT P T ++I + I +H+M GI KP
Sbjct: 786 NTYAGSQVTEQLTDPEPTSNNEGSDEEVNPVAEEIAANQEQVINSHAMTTRSKAGIQKPN 845
Query: 522 IQFNLTTSITSSPLPHNPKAALSDSNWKAAMLDEFDALIKNKTWELVPRPQNVNFIRSMW 581
++ L TS ++ P +A+ W A+ +E + + TW LVP ++N + S W
Sbjct: 846 TRYALITSRMNTAEPKTLASAMKHPGWNEAVHEEINRVHMLHTWSLVPPTDDMNILSSKW 905
Query: 582 IFRHKKKSDGSFERYKARLVGDGRSQ*VGVDCDETFSPVVKPTTIRIVLTIALSKSWPIH 641
+F+ K DGS ++ KARLV G Q GVD ETFSPVV+ TIR+VL ++ SK WPI
Sbjct: 906 VFKTKLHPDGSIDKLKARLVAKGFDQEEGVDYLETFSPVVRTATIRLVLDVSTSKGWPIK 965
Query: 642 QLDVKNVFLHGELQETVYMHQPMGFRDPIHPDYVCLLKKSLYGLKQAPRAWYQRFADFAF 701
QLDV N FLHGELQE V+M+QP GF DP P +VC L K++YGLKQAPRAW+ F++F
Sbjct: 966 QLDVSNAFLHGELQEPVFMYQPSGFIDPQKPTHVCRLTKAIYGLKQAPRAWFDTFSNFLL 1025
Query: 702 TIGFSHSKSDHSLFIYRKGNDMTYILLYVDDIILTASSDVLRRSIMSLLASEFAMKDLGT 761
GF SKSD SLF+ + + Y+LLYVDDI+LT S L ++ L + F+MKDLG
Sbjct: 1026 DYGFVCSKSDPSLFVCHQDGKILYLLLYVDDILLTGSDQSLLEDLLQALKNRFSMKDLGP 1085
Query: 762 LSYF 765
YF
Sbjct: 1086 PRYF 1089
>UniRef100_Q8LM18 Putative copia-like retrotransposon polyprotein [Oryza sativa]
Length = 1042
Score = 495 bits (1275), Expect = e-138
Identities = 293/785 (37%), Positives = 416/785 (52%), Gaps = 36/785 (4%)
Query: 1 MTASQGTLSTYSNLSINKNIVVGSGQEIPIRGYGQTYISPPYPPLSLNNVLHAPKLIKNL 60
+T+ L+T + I SG + I+ G + P PL LNNVLH P+ KNL
Sbjct: 229 ITSQLEKLNTREVYKGHDQIHTASGAGMKIKHIGHAIVHTPTRPLHLNNVLHVPQAAKNL 288
Query: 61 IFVRRFTIDNNVTIEFDPFSFSVEDIQTGIPLMRCDSTGDLYPLTTTTSHQSTSPATFAA 120
I + DN+V +E F ++D T +++ LYPL +T+S T A A
Sbjct: 289 ISATKLASDNSVFVEIHSKYFLIKDRTTRSTVLKGPRRHGLYPLPSTSS---TKQAFAVA 345
Query: 121 LSPTLWHNRLGHPGAN-VLSFLNKNKFIECKQISSPSICQSCIYGKHVKLPFSISNSTTS 179
S WH+RLGHP V+ ++ NK ++ + S+C +C K +LP+S S S ++
Sbjct: 346 PSLERWHSRLGHPSIPIVMKVISSNKLPCLRESNKESVCDACQKAKSHQLPYSNSMSVSN 405
Query: 180 KPFDIIHSDLWTSPILSSAGHKYYLFFLDDYTNFVWTFPIGRKSQVPSIFSSFHAFIKTQ 239
KP ++I+SD+W S G K+Y+ F+D Y F W + + KS V F F ++
Sbjct: 406 KPLELIYSDVWGPASTSFGGKKFYVSFIDSYRKFSWIYFLKHKSDVFEKFHDFQQLVERL 465
Query: 240 FGTTIKCFQCDNGTEFNNEYFTQFCHKNGMVFRFSCPHTSPQNGKAERKIRAINNFIRTS 299
F I Q D G E+ F K G+ SCPHT QNG AERK R I
Sbjct: 466 FDRKIIAMQTDWGGEYQK--LNSFFEKIGISHHVSCPHTHQQNGSAERKHRLIVEVGLAL 523
Query: 300 LAHSSMPPSFWHHALQITTYLQNILPSKILSHHSPTQYLYHRDPSYTHLRVFGCLCYPLF 359
LA++SMP +W A T++ N +PS+IL + +P + L++ Y+ R+FGC C+P
Sbjct: 524 LAYASMPLKYWDEAFLAATHIINRIPSRILQYDTPLECLFNHKLDYSSFRIFGCACWPNL 583
Query: 360 PSTAINKLQARSTPCAFLGYPQNHRGYKCYDLSSKKIIISRHVIFDETQFPFAKTHTSS- 418
+KLQ RS C FLG H GYKC D+++ +I I R V+FDE FP +K H+++
Sbjct: 584 RPYNAHKLQFRSMQCVFLGPSHTHNGYKCLDIATGRIYICRDVVFDENVFPLSKFHSNAG 643
Query: 419 ---------------THTYEFLNDSLHPLLHYHLQNDPKQDEPEPRKIESPQPATTPASP 463
+HT + + +L ++ + + DE + TT +
Sbjct: 644 SRLRSEIALLPSHLLSHTSHQGGEHNNHMLDFYNVSSDQTDE----NADIDGGNTTDTTN 699
Query: 464 INVTNQSILPPSPMSINQLPHPLVSTELTSPTHTPQQIHQEPPRTIATHSMHGIHKPKIQ 523
++ NQ L S+ Q H E + Q + PRT GI K K+
Sbjct: 700 DDLGNQ--LHELRSSVMQDMH--FGGEAATHATEDQSMVAAKPRT---RLQSGIRKEKVY 752
Query: 524 FNLTTS---ITSSPLPHNPKAALSDSNWKAAMLDEFDALIKNKTWELVPRPQNVNFIRSM 580
+ T TSS P N AL+D NWK AM E+ AL+KNKTW LVP + N I
Sbjct: 753 TDGTVKYSCFTSSGEPQNLHEALNDKNWKHAMDSEYTALMKNKTWHLVPAKSDRNVIDCK 812
Query: 581 WIFRHKKKSDGSFERYKARLVGDGRSQ*VGVDCDETFSPVVKPTTIRIVLTIALSKSWPI 640
W+++ K+K+DGS +RYKARLV G Q G+D ++TFSPVVK TIR++L+IA+S+ W +
Sbjct: 813 WVYKIKRKADGSLDRYKARLVAKGFKQRYGIDYEDTFSPVVKAATIRVILSIAVSRGWSL 872
Query: 641 HQLDVKNVFLHGELQETVYMHQPMGFRDPIHPDYVCLLKKSLYGLKQAPRAWYQRFADFA 700
QLDV N FLHG L+E VYM QP+G+ P++VC L K+LYGLKQAPR WY R +
Sbjct: 873 RQLDVSNAFLHGILEEEVYMRQPLGYEVSSLPNHVCKLDKALYGLKQAPRVWYSRLSTKL 932
Query: 701 FTIGFSHSKSDHSLFIYRKGNDMTYILLYVDDIILTASSDVLRRSIMSLLASEFAMKDLG 760
+GF SK+D SLF Y KG ++L+YVDDI + +S +++ L EFA+KDLG
Sbjct: 933 QELGFQASKADTSLFFYNKGVVSMFVLVYVDDIFVASSMQSATAALLQDLNKEFALKDLG 992
Query: 761 TLSYF 765
L YF
Sbjct: 993 DLHYF 997
>UniRef100_Q9T0C5 Retrotransposon like protein [Arabidopsis thaliana]
Length = 1515
Score = 488 bits (1256), Expect = e-136
Identities = 293/815 (35%), Positives = 419/815 (50%), Gaps = 70/815 (8%)
Query: 10 TYSNLSINKNIVVGSGQEIPIRGYGQTYISPPYPPLSLNNVLHAPKLIKNLIFVRRFTID 69
TYS + +++VG+G +PI G ++ L L +VL P + K+L+ V + T D
Sbjct: 346 TYSG---DDSVIVGNGDFLPITHIGTIPLNISQGTLPLEDVLVCPGITKSLLSVSKLTDD 402
Query: 70 NNVTIEFDPFSFSVEDIQTGIPLMRCDSTGDLYPLTTTTSHQSTSPATFAALSPTLWHNR 129
+ FD S ++D +T L + + LY L Q+ + +WH R
Sbjct: 403 YPCSFTFDSDSVVIKDKRTQQLLTQGNKHKGLYVLKDVP-FQTYYSTRQQSSDDEVWHQR 461
Query: 130 LGHPGANVLSFLNKNKFIECKQISSPSICQSCIYGKHVKLPFSISNSTTSKPFDIIHSDL 189
LGHP VL L K K I + SS ++C++C GK +LPF S +S+P + IH DL
Sbjct: 462 LGHPNKEVLQHLIKTKAIVVNKTSS-NMCEACQMGKVCRLPFVASEFVSSRPLERIHCDL 520
Query: 190 W-TSPILSSAGHKYYLFFLDDYTNFVWTFPIGRKSQVPSIFSSFHAFIKTQFGTTIKCFQ 248
W +P+ S+ G +YY+ F+D+Y+ F W +P+ KS S+F F ++ Q+ I FQ
Sbjct: 521 WGPAPVTSAQGFQYYVIFIDNYSRFTWFYPLKLKSDFFSVFVLFQQLVENQYQHKIAMFQ 580
Query: 249 CDNGTEFNNEYFTQFCHKNGMVFRFSCPHTSPQNGKAERKIRAINNFIRTSLAHSSMPPS 308
CD G EF + F G+ SCPHT QNG AER+ R + + + HS +P
Sbjct: 581 CDGGGEFVSYKFVAHLASCGIKQLISCPHTPQQNGIAERRHRYLTELGLSLMFHSKVPHK 640
Query: 309 FWHHALQITTYLQNILPSKILSHH-SPTQYLYHRDPSYTHLRVFGCLCYPLFPSTAINKL 367
W A + +L N+LPS LS + SP + L+ P YT LRVFG CYP A NK
Sbjct: 641 LWVEAFFTSNFLSNLLPSSTLSDNKSPYEMLHGTPPVYTALRVFGSACYPYLRPYAKNKF 700
Query: 368 QARSTPCAFLGYPQNHRGYKCYDLSSKKIIISRHVIFDETQFPFAKTHTSSTHTYEFLND 427
+S C FLGY ++GY+C + K+ I RHV+FDE +FP++ ++ +F
Sbjct: 701 DPKSLLCVFLGYNNKYKGYRCLHPPTGKVYICRHVLFDERKFPYSDIYS------QFQTI 754
Query: 428 SLHPLL---HYHLQNDPKQDEPEPRKIES--------------------PQPATTPASPI 464
S PL + E +E + AT P +
Sbjct: 755 SGSPLFTAWQKGFSSTALSRETPSTNVEDIIFPSATVSSSVPTGCAPNIAETATAPDVDV 814
Query: 465 NVTNQSILPPSPMSINQLP-HPLVSTE-----------LTSPTHTPQQIH---------- 502
+ ++PPSP++ LP P ST S TPQ I+
Sbjct: 815 AAAHDMVVPPSPITSTSLPTQPEESTSDQNHYSTDSETAISSAMTPQSINVSLFEDSDFP 874
Query: 503 ------------QEPPRTIATHSMHGIHKPKIQFNLTTSITSSPLPHNPKAALSDSNWKA 550
E + T + GI KP ++ L + ++ P P + K AL D W
Sbjct: 875 PLQSVISSTTAAPETSHPMITRAKSGITKPNPKYALFSVKSNYPEPKSVKEALKDEGWTN 934
Query: 551 AMLDEFDALIKNKTWELVPRPQNVNFIRSMWIFRHKKKSDGSFERYKARLVGDGRSQ*VG 610
AM +E + + TW+LVP + W+F+ K SDGS +R KARLV G Q G
Sbjct: 935 AMGEEMGTMHETDTWDLVPPEMVDRLLGCKWVFKTKLNSDGSLDRLKARLVARGYEQEEG 994
Query: 611 VDCDETFSPVVKPTTIRIVLTIALSKSWPIHQLDVKNVFLHGELQETVYMHQPMGFRDPI 670
VD ET+SPVV+ T+R +L +A W + QLDVKN FLH EL+ETV+M QP GF DP
Sbjct: 995 VDYVETYSPVVRSATVRSILHVATINKWSLKQLDVKNAFLHDELKETVFMTQPPGFEDPS 1054
Query: 671 HPDYVCLLKKSLYGLKQAPRAWYQRFADFAFTIGFSHSKSDHSLFIYRKGNDMTYILLYV 730
PDYVC LKK++Y LKQAPRAW+ +F+ + GF S SD SLF+Y KG D+ ++LLYV
Sbjct: 1055 RPDYVCKLKKAIYDLKQAPRAWFDKFSSYLLKYGFICSFSDPSLFVYLKGRDVMFLLLYV 1114
Query: 731 DDIILTASSDVLRRSIMSLLASEFAMKDLGTLSYF 765
DD+ILT ++DVL + ++++L++EF MKD+G L YF
Sbjct: 1115 DDMILTGNNDVLLQQLLNILSTEFRMKDMGALHYF 1149
>UniRef100_Q94KV0 Polyprotein [Arabidopsis thaliana]
Length = 1453
Score = 486 bits (1252), Expect = e-135
Identities = 300/788 (38%), Positives = 405/788 (51%), Gaps = 30/788 (3%)
Query: 1 MTASQGTLSTYSNLSINKNIVVGSGQEIPIRGYGQTYISPPYPPLSLNNVLHAPKLIKNL 60
+T+S L S + + ++VG G +PI G T IS L LN VL P + K+L
Sbjct: 335 VTSSTNNLQAASPYNGSDTVLVGDGAYLPITHVGSTTISSDSGTLPLNEVLVCPDIQKSL 394
Query: 61 IFVRRFTIDNNVTIEFDPFSFSVEDIQTGIPLMRCDSTGDLYPLTTTTSHQSTSPATFAA 120
+ V + D + FD + DI T + + + LY L S AA
Sbjct: 395 LSVSKLCDDYPCGVYFDANKVCIIDINTQKVVSKGPRSNGLYVLENQEFVAFYSNRQCAA 454
Query: 121 LSPTLWHNRLGHPGANVLSFLNKNKFIECKQISSPSICQSCIYGKHVKLPFSISNSTTSK 180
S +WH+RLGH + +L L +K I + +C+ C GK KL F SNS
Sbjct: 455 -SEEIWHHRLGHSNSRILQQLKSSKEISFNKSRMSPVCEPCQMGKSSKLQFFSSNSRELD 513
Query: 181 PFDIIHSDLW-TSPILSSAGHKYYLFFLDDYTNFVWTFPIGRKSQVPSIFSSFHAFIKTQ 239
IH DLW SP++S G KYY+ F+DDY+ + W +P+ KS ++F +F ++ Q
Sbjct: 514 LLGRIHCDLWGPSPVVSKQGFKYYVVFVDDYSRYSWFYPLKAKSDFFAVFVAFQNLVENQ 573
Query: 240 FGTTIKCFQCDNGTEFNNEYFTQFCHKNGMVFRFSCPHTSPQNGKAERKIRAINNFIRTS 299
F T IK FQ D G EF + + G+ R SCP+T QNG AERK R +
Sbjct: 574 FNTKIKVFQSDGGGEFTSNLMKKHLTDCGIQHRISCPYTPQQNGIAERKHRHFVELGLSM 633
Query: 300 LAHSSMPPSFWHHALQITTYLQNILPSKILSHHSPTQYLYHRDPSYTHLRVFGCLCYPLF 359
+ HS P FW A ++L N+LPS L + SP + L + P+Y LRVFG CYP
Sbjct: 634 MFHSHTPLQFWVEAFFTASFLSNMLPSPSLGNVSPLEALLKQKPNYAMLRVFGTACYPCL 693
Query: 360 PSTAINKLQARSTPCAFLGYPQNHRGYKCYDLSSKKIIISRHVIFDETQFPFAKTHTSST 419
+K + RS C FLGY ++GY+C + ++ ISRHVIFDE FPF + +
Sbjct: 694 RPLGEHKFEPRSLQCVFLGYNSQYKGYRCLYPPTGRVYISRHVIFDEETFPFKQKYQFLV 753
Query: 420 HTYEFLNDSLHPLLHYHLQNDPKQDEP-----EPRKIES-PQPATTPASPIN--VTNQSI 471
YE LL + P+ D+ E KIES +P + + I T +I
Sbjct: 754 PQYE------SSLLSAWQSSIPQADQSLIPQAEEGKIESLAKPPSIQKNTIQDTTTQPAI 807
Query: 472 LPPSPMSINQLPHPLVSTE---LTSPTHTP---------QQIHQEPPRT--IATHSMHGI 517
L ++ + TE L THT +++ QEP T + T S GI
Sbjct: 808 LTEGVLNEEEEEDSFEETETESLNEETHTQNDEAEVTVEEEVQQEPENTHPMTTRSKAGI 867
Query: 518 HKPKIQFNLTTSITSSPLPHNPKAALSDSNWKAAMLDEFDALIKNKTWELVPRPQNVNFI 577
HK ++ L TS S P + AL+ W A+ DE + TW LV +++N +
Sbjct: 868 HKSNTRYALLTSKFSVEEPKSIDEALNHPGWNNAVNDEMRTIHMLHTWSLVQPTEDMNIL 927
Query: 578 RSMWIFRHKKKSDGSFERYKARLVGDGRSQ*VGVDCDETFSPVVKPTTIRIVLTIALSKS 637
W+F+ K K DGS ++ KARLV G Q G+D ETFSPVV+ TIR+VL +A +K
Sbjct: 928 GCRWVFKTKLKPDGSVDKLKARLVAKGFHQEEGLDYLETFSPVVRTATIRLVLDVATAKG 987
Query: 638 WPIHQLDVKNVFLHGELQETVYMHQPMGFRDPIHPDYVCLLKKSLYGLKQAPRAWYQRFA 697
W I QLDV N FLHGEL+E VYM QP GF D P YVC L K+LYGLKQAPRAW+ +
Sbjct: 988 WNIKQLDVSNAFLHGELKEPVYMLQPPGFVDQEKPSYVCRLTKALYGLKQAPRAWFDTIS 1047
Query: 698 DFAFTIGFSHSKSDHSLFIYRKGNDMTYILLYVDDIILTASSDVLRRSIMSLLASEFAMK 757
++ GFS SKSD SLF Y K +LLYVDDI+LT S L + ++ L F+MK
Sbjct: 1048 NYLLDFGFSCSKSDPSLFTYHKNGKTLVLLLYVDDILLTGSDHNLLQELLMSLNKRFSMK 1107
Query: 758 DLGTLSYF 765
DLG SYF
Sbjct: 1108 DLGAPSYF 1115
>UniRef100_Q5XWK9 Gag-pol polyprotein-like [Solanum tuberosum]
Length = 1212
Score = 461 bits (1186), Expect = e-128
Identities = 293/781 (37%), Positives = 409/781 (51%), Gaps = 70/781 (8%)
Query: 1 MTASQGTLSTYSNLSINKNIVVGSGQEIPIRGYGQTYISPPYPPLSLNNVLHAPKLIKNL 60
MT S L I + +G +PI G I+P + NV +PKL +L
Sbjct: 330 MTNSTSILKNVRKYQGPSQIQIANGSNLPITKVGD--ITPTF-----KNVFVSPKLSTSL 382
Query: 61 IFVRRFTIDNNVTIEFDPFSFSVEDIQTGIPLMRCDSTGDLYPLTTTTSHQSTSPATFAA 120
I V + +DNN + F V+D +G + + G L+P+ + + T A
Sbjct: 383 ISVGQL-VDNNCDVNFSRNGCLVQDQVSGTIIAKGPKVGRLFPIHFSIPPVLSFACTSTA 441
Query: 121 LSPTLWHNRLGHPGANVLSFL-------NKNKFIECKQISSPSI-CQSCIYGKHVKLPFS 172
+WH RLGHP + VLS + NKNKF S SI C +C GK LPF
Sbjct: 442 SKTEVWHKRLGHPNSVVLSHISNSGLLGNKNKF------SVASIDCSTCKLGKSKTLPFP 495
Query: 173 ISNSTTSKPFDIIHSDLW-TSPILSSAGHKYYLFFLDDYTNFVWTFPIGRKSQVPSIFSS 231
S +K FD+IHSD+W SPI+S A KY++ F+DDY+ F W + + KS+V S+F +
Sbjct: 496 NFGSRATKCFDVIHSDVWGISPIISHAHFKYFMTFIDDYSRFTWVYFLRSKSEVFSMFKT 555
Query: 232 FHAFIKTQFGTTIKCFQCDNGTEFNNEYFTQFCHKNGMVFRFSCPHTSPQNGKAERKIRA 291
F A+I+TQF T IK + D+G E+ + F +F G+V + SCP+T QNG AERK R
Sbjct: 556 FLAYIETQFSTCIKLLRSDSGGEYMSYEFKKFLLDKGIVSQHSCPYTPQQNGVAERKNRH 615
Query: 292 INNFIRTSLAHSSMPPSFWHHALQITTYLQNILPSKILSHHSPTQYLYHRDPSYTHLRVF 351
+ + RT L SS+P +W AL YL N LPSK+L+ SP LYH++P+Y+ F
Sbjct: 616 LLDVTRTLLIESSVPSKYWVEALSTAVYLINRLPSKVLNLESPYFRLYHQNPNYSDFHTF 675
Query: 352 GCLCYPLFPSTAINKLQARSTPCAFLGYPQNHRGYKCYDLSSKKIIISRHVIFDETQFPF 411
GC+C+ P + NKL +ST CAF+GY + +G+ CYD S K ISR+V+F E Q+ F
Sbjct: 676 GCVCFVHLPPSQCNKLSVQSTKCAFMGYSTSQKGFICYDPCSHKFRISRNVVFFENQYFF 735
Query: 412 AKTHTSSTHTYEFLNDSLHPLL--HYHLQNDPKQDEP----EPRKIESPQPATTPASPIN 465
S S+ PLL L + K+ +P E R+ P P T P
Sbjct: 736 PTIVDLS---------SVSPLLPTFEDLSSSFKRFKPGFVYERRRPTLPYPNTDP----- 781
Query: 466 VTNQSILPPSPMSINQLPHPLVSTELTSPTHTPQQIHQEPPRTIATHSMHGIHKPKIQFN 525
PP P + +E S P + + R T + +G
Sbjct: 782 -------PPETA-------PQLESE-NSSRSGPLEPTRRSTRVSRTPNWYG--------- 817
Query: 526 LTTSITSSPLPHNPKAALSDSNWKAAMLDEFDALIKNKTWELVPRPQNVNFIRSMWIFRH 585
++++++ +P A W+ AM +E AL +N TW++V P NV I W++
Sbjct: 818 FSSTLSNISVPSCYSQASKHECWQKAMEEELLALKENDTWDIVSCPSNVRPIGCKWVYSI 877
Query: 586 KKKSDGSFERYKARLVGDGRSQ*VGVDCDETFSPVVKPTTIRIVLTIALSKSWPIHQLDV 645
K SDG+ +RYKARLV G Q GVD +ETF+PV K TT+R ++ IA S++W ++Q DV
Sbjct: 878 KLHSDGTLDRYKARLVVLGNRQEYGVDYEETFAPVAKMTTVRTIIAIAASQNWSLYQKDV 937
Query: 646 KNVFLHGELQETVYMHQPMG-FRDPIHPDYVCLLKKSLYGLKQAPRAWYQRFADFAFTIG 704
KN FLHG+L+E +YM P F P VC LK+SLYGLKQAPRAW+ +F
Sbjct: 938 KNAFLHGDLKEDIYMKPPPDLFSSPTSD--VCKLKRSLYGLKQAPRAWFDKFRSTLLQFS 995
Query: 705 FSHSKSDHSLFIYRKGNDMTYILLYVDDIILTASSDVLRRSIMSLLASEFAMKDLGTLSY 764
F SK D SLF+ + +L+YVDDII+T + L + L F MKDLGTL+Y
Sbjct: 996 FELSKYDSSLFLRKTSTSCVLLLVYVDDIIITGTDSSLITCLQQQLKDSFHMKDLGTLTY 1055
Query: 765 F 765
F
Sbjct: 1056 F 1056
>UniRef100_Q9FRJ2 Putative copia-like retrotransposon polyprotein [Oryza sativa]
Length = 1302
Score = 458 bits (1179), Expect = e-127
Identities = 244/534 (45%), Positives = 318/534 (58%), Gaps = 26/534 (4%)
Query: 238 TQFGTTIKCFQCDNGTEFNNEYFTQFCHKNGMVFRFSCPHTSPQNGKAERKIRAINNFIR 297
++FG + Q DNG E+++ +G V R SCP++S QNGKAER +R IN+++R
Sbjct: 513 SKFGLPVLALQTDNGKEYDSYALRSLLSLHGAVLRLSCPYSSQQNGKAERILRTINDYVR 572
Query: 298 TSLAHSSMPPSFWHHALQITTYLQNILPSKILSHHSPTQYLYHRDPSYTHLRVFGCLCYP 357
T L HS+ P SFW ALQ T+L N P + +P Q L P+Y HLRVFGCLCYP
Sbjct: 573 TMLVHSAAPLSFWAEALQTATHLINRRPCRATGSLTPYQLLLGAPPTYDHLRVFGCLCYP 632
Query: 358 LFPSTAINKLQARSTPCAFLGYPQNHRGYKCYDLSSKKIIISRHVIFDETQFPFAKTHTS 417
+TA +KL RS C F+GYP +HRGY+CYD+ S+++ SRHV F E FPF
Sbjct: 633 NTIATAPHKLSPRSLACVFIGYPADHRGYRCYDMVSRRVFTSRHVTFVEDVFPF------ 686
Query: 418 STHTYEFLNDSLHPLLHYHLQNDPKQDEPEPRKIESPQPATTPASPINVT--NQSILPPS 475
D+ P P D + + P PA +P+ + + PPS
Sbjct: 687 --------RDAPSP----RPSAPPPPDHGDDTIVLLPAPAQHVVTPVGTAPAHDAASPPS 734
Query: 476 PMSINQLPHPLVSTELTSPTHTPQQ---IHQEPPR-TIATHSMHGIHKPKIQFNLTTSIT 531
P S P +P +P+ PPR + T + GI KP ++ +T + T
Sbjct: 735 PAS--STPSSAAPAHDVAPPPSPETSSPASASPPRHAMTTRARAGISKPNPRYAMTATST 792
Query: 532 SSPLPHNPKAALSDSNWKAAMLDEFDALIKNKTWELVPRPQNVNFIRSMWIFRHKKKSDG 591
SP P + +AAL D NW+AAM EFDAL+ N+TW LVPRP I W+F+ K +DG
Sbjct: 793 LSPTPSSVRAALRDPNWRAAMQAEFDALLANRTWTLVPRPPGARIITGKWVFKTKLHADG 852
Query: 592 SFERYKARLVGDGRSQ*VGVDCDETFSPVVKPTTIRIVLTIALSKSWPIHQLDVKNVFLH 651
S ++YKAR V G +Q GVD ETFSPVVKP TIR VLT+ SK WP HQLDV N FLH
Sbjct: 853 SLDKYKARWVVRGFNQRPGVDFGETFSPVVKPATIRTVLTLISSKQWPAHQLDVSNAFLH 912
Query: 652 GELQETVYMHQPMGFRDPIHPDYVCLLKKSLYGLKQAPRAWYQRFADFAFTIGFSHSKSD 711
G LQE V QP GF D P VCLL +SLYGL+QAPRAW++RFAD A ++GF S++D
Sbjct: 913 GHLQERVLCQQPTGFEDAARPADVCLLSRSLYGLRQAPRAWFKRFADHATSLGFVQSRAD 972
Query: 712 HSLFIYRKGNDMTYILLYVDDIILTASSDVLRRSIMSLLASEFAMKDLGTLSYF 765
SLF+ R+G+D Y+LLYVDD+IL+ASS L + I+ L +EF +KD+G L YF
Sbjct: 973 PSLFVLRRGSDTAYLLLYVDDMILSASSSSLLQRIIDRLQAEFKVKDMGPLKYF 1026
Score = 115 bits (289), Expect = 5e-24
Identities = 64/141 (45%), Positives = 88/141 (62%), Gaps = 7/141 (4%)
Query: 1 MTASQGTLSTYSNLSINKNIVVGSGQEIPIRGYGQTYISPPYPPLSLNNVLHAPKLIKNL 60
M+++ G L+ L + I VG+G ++P+ T+I L L+NVL +P LIKNL
Sbjct: 378 MSSTPGILAHPRPLPFSSCITVGNGAKLPVTHTASTHIPTSSTDLHLHNVLVSPPLIKNL 437
Query: 61 IFVRRFTIDNNVTIEFDPFSFSVEDIQTGIPLMRCDSTGDLYPLTTTTSHQSTSPATFAA 120
I V++ T DNNV+IEFDP FS++D+QT + +RCDS GDLYPL + H A A
Sbjct: 438 ISVKKLTRDNNVSIEFDPTGFSIKDLQTQVVKLRCDSPGDLYPLRLPSPH-----ALSAT 492
Query: 121 LSPTL--WHNRLGHPGANVLS 139
SP++ WH RLGHPG+ LS
Sbjct: 493 SSPSVEHWHLRLGHPGSASLS 513
>UniRef100_O82331 Putative retroelement pol polyprotein [Arabidopsis thaliana]
Length = 1149
Score = 450 bits (1158), Expect = e-125
Identities = 278/764 (36%), Positives = 392/764 (50%), Gaps = 35/764 (4%)
Query: 17 NKNIVVGSGQEIPIRGYGQTYISPPYPPLSLNNVLHAPKLIKNLIFVRRFTIDNNVTIEF 76
N ++ G +PI G + L L +VL P + K+L+ V + T D + F
Sbjct: 340 NDTVMASDGNFLPITHIGSANLPSTSGNLPLKDVLVCPNIAKSLLSVSKLTKDYPCSFTF 399
Query: 77 DPFSFSVEDIQTGIPLMRCDSTGD-LYPLTTTTSHQSTSPATFAALSPTLWHNRLGHPGA 135
D V+D T L + ST + LY L S A + +WH RLGHP
Sbjct: 400 DADGVLVKDKATCKVLTKGSSTSEGLYKLENPKFQMFYSTRQVKA-TDEVWHMRLGHPNP 458
Query: 136 NVLSFLNKNKFIECKQISSPSICQSCIYGKHVKLPFSISNSTTSKPFDIIHSDLW-TSPI 194
VL L K I+ + S+ +C+SC GK +LPF S+ S+P + +H DLW +P+
Sbjct: 459 QVLQLLANKKAIQINK-STSKMCESCRLGKSSRLPFIASDFIASRPLERVHCDLWGPAPV 517
Query: 195 LSSAGHKYYLFFLDDYTNFVWTFPIGRKSQVPSIFSSFHAFIKTQFGTTIKCFQCDNGTE 254
S G +YY+ F+D+ + F W +P+ KS S+F F +F++ T I FQ D G E
Sbjct: 518 SSIQGFQYYVIFIDNRSRFCWFYPLKHKSDFCSLFMKFQSFVENLLQTKIGTFQSDGGGE 577
Query: 255 FNNEYFTQFCHKNGMVFRFSCPHTSPQNGKAERKIRAINNFIRTSLAHSSMPPSFWHHAL 314
F + F Q ++G+ SCPHT QNG AERK R + T + S P FW A
Sbjct: 578 FTSNRFLQHLQESGIQHYISCPHTPQQNGLAERKHRQLTERGLTLMFQSKAPQRFWVEAF 637
Query: 315 QITTYLQNILPSKIL-SHHSPTQYLYHRDPSYTHLRVFGCLCYPLFPSTAINKLQARSTP 373
+L N+LP+ L S +P Q L+ + P Y+ LR FGC C+P + A NK RS
Sbjct: 638 FTANFLSNLLPTSALDSSTTPYQVLFGKAPDYSALRTFGCACFPTLRAYARNKFDPRSLK 697
Query: 374 CAFLGYPQNHRGYKCYDLSSKKIIISRHVIFDETQFPFAKTHTSSTHTYEFLNDSLHPLL 433
C FLGY + ++GY+C+ + ++ +SRHV+FDE+ FPF T+TS H S P+
Sbjct: 698 CIFLGYTEKYKGYRCFFPPTNRVYLSRHVLFDESSFPFIDTYTSLQHP------SPTPMF 751
Query: 434 HYHLQNDPKQDEPEPRKIESPQPA--TTPASPINVTNQSILPPSPMSINQLPHPLVSTEL 491
L++ P P +E+ Q A + AS +T Q P +S+ P+ L+
Sbjct: 752 DAWLKSFPSSSSP----LENDQTAGFNSGASVPVITAQQTQPI--LSLKDGPNILLPEGE 805
Query: 492 TSPTHTPQQIHQEPPRTIATHSMHGIHKPKIQFNL----------TTSITSSPLPHNPKA 541
+ + Q I EP ++ K L T S P+ +N
Sbjct: 806 ITVSSNNQDIEDEPICVTPLQTLSSEDNAKSSETLSMGSEECSECTASFDLDPIGNN--- 862
Query: 542 ALSDSNWKAAMLDEFDALIKNKTWELVPRPQNVNFIRSMWIFRHKKKSDGSFERYKARLV 601
ALS S + T + R + + R K DGS ++YKARLV
Sbjct: 863 ALSSSPRHDQLTSSIPRAATESTHPMTTRLKKGIIKLNQ---RVKLNVDGSLDKYKARLV 919
Query: 602 GDGRSQ*VGVDCDETFSPVVKPTTIRIVLTIALSKSWPIHQLDVKNVFLHGELQETVYMH 661
G Q G+D ET+SPVV+ T+R VL ++ +W + Q+DVKN FLHG+L ETVYM
Sbjct: 920 AQGFKQEEGIDYLETYSPVVRSATVRAVLHLSTIMNWELKQMDVKNGFLHGDLTETVYMK 979
Query: 662 QPMGFRDPIHPDYVCLLKKSLYGLKQAPRAWYQRFADFAFTIGFSHSKSDHSLFIYRKGN 721
QP GF D HPD+VCLL K+LYGLKQAPRAW+ +F+ F + GF S SD SLF+ K
Sbjct: 980 QPAGFIDKAHPDHVCLLHKALYGLKQAPRAWFDKFSKFLLSFGFVCSMSDPSLFVCVKNK 1039
Query: 722 DMTYILLYVDDIILTASSDVLRRSIMSLLASEFAMKDLGTLSYF 765
D+ +LLYVDD+++T +S L S++S L +F MKDLG LSYF
Sbjct: 1040 DVIMLLLYVDDMVITGNSSKLLSSLLSELNKQFKMKDLGRLSYF 1083
>UniRef100_O81824 Hypothetical protein AT4g27210 [Arabidopsis thaliana]
Length = 1318
Score = 446 bits (1146), Expect = e-123
Identities = 274/789 (34%), Positives = 386/789 (48%), Gaps = 94/789 (11%)
Query: 20 IVVGSGQEIPIRGYGQTYISPPYPPLSLNNVLHAPKLIKNLIFVRRFTIDNNVTIEFDPF 79
I+V G +PI G T ++ + L +VL P + K+L+ + + T D T+EF+
Sbjct: 206 IMVDDGNYLPITHTGSTNLASSSGTVPLTDVLVCPSITKSLLSMSKLTQDFPCTVEFEYD 265
Query: 80 SFSVEDIQTGIPLMRCDSTGDLYPLTTTTSHQSTSPATFAALSPTLWHNRLGHPGANVLS 139
V D T L+ + LY L Q+ + S +WH RLGHP +L
Sbjct: 266 GVRVNDKATKKLLLMGSNRDGLYCLKDDKQFQAFFSTRQRSASDEVWHRRLGHPHPQIL- 324
Query: 140 FLNKNKFIECKQISSPSICQSCIYGKHVKLPFSISNSTTSKPFDIIHSDLW-TSPILSSA 198
+P + +H DLW + I S
Sbjct: 325 ----------------------------------------QPLERVHCDLWGPTTITSVQ 344
Query: 199 GHKYYLFFLDDYTNFVWTFPIGRKSQVPSIFSSFHAFIKTQFGTTIKCFQCDNGTEFNNE 258
G +YY F+D Y+ F W +P+ KS +IF +FH ++ Q I FQCD G EF +
Sbjct: 345 GFRYYAVFIDHYSRFSWIYPLKLKSDFYNIFLAFHKLVENQLSQKISVFQCDGGGEFVSH 404
Query: 259 YFTQFCHKNGMVFRFSCPHTSPQNGKAERKIRAINNFIRTSLAHSSMPPSFWHHALQITT 318
F Q +G+ + SCPHT QNG AERK R + + L S +P FW A
Sbjct: 405 KFLQHLQSHGIQQQLSCPHTPQQNGLAERKHRHLVELGLSMLFQSHVPHKFWVEAFFTAN 464
Query: 319 YLQNILPSKILSHH-SPTQYLYHRDPSYTHLRVFGCLCYPLFPSTAINKLQARSTPCAFL 377
+L N+LP+ L SP + LY + P YT LR FG C+P A NK S C FL
Sbjct: 465 FLINLLPTSALKESISPYEKLYDKKPDYTSLRSFGSACFPTLRDYAENKFNPCSLKCVFL 524
Query: 378 GYPQNHRGYKCYDLSSKKIIISRHVIFDETQFPFAKTHTSSTHTYEFLNDSLH-PLLHYH 436
GY + ++GY+C + ++ ISRHVIFDE+ +PF+ HTY+ L+ PLL
Sbjct: 525 GYNEKYKGYRCLYPPTGRLYISRHVIFDESVYPFS-------HTYKHLHPQPRTPLLAAW 577
Query: 437 LQNDPKQDEPEPRKIESPQPAT---------------TPASPINVTNQSILPPSPMSINQ 481
L++ D P P SP + TP P V S+ S ++ Q
Sbjct: 578 LRSS---DSPAPSTSTSPSSRSPLFTSADFPPLPQRKTPLLPTLVPISSVSHASNITTQQ 634
Query: 482 LPH-------PLVSTELTSPTHTPQ--------------QIHQEPPRT----IATHSMHG 516
P S + +H+ Q +HQ T + T + G
Sbjct: 635 SPDFDSERTTDFDSASIGDSSHSSQAGSDSEETIQQASVNVHQTHASTNVHPMVTRAKVG 694
Query: 517 IHKPKIQFNLTTSITSSPLPHNPKAALSDSNWKAAMLDEFDALIKNKTWELVPRPQNVNF 576
I KP ++ + S P P AAL W AM +E + +TW LVP +++
Sbjct: 695 ISKPNPRYVFLSHKVSYPEPKTVTAALKHPGWTGAMTEEIGNCSETQTWSLVPYKSDMHV 754
Query: 577 IRSMWIFRHKKKSDGSFERYKARLVGDGRSQ*VGVDCDETFSPVVKPTTIRIVLTIALSK 636
+ S W+FR K +DG+ + KAR+V G Q G+D ET+SPVV+ T+R+VL +A +
Sbjct: 755 LGSKWVFRTKLHADGTLNKLKARIVAKGFLQEEGIDYLETYSPVVRTPTVRLVLHLATAL 814
Query: 637 SWPIHQLDVKNVFLHGELQETVYMHQPMGFRDPIHPDYVCLLKKSLYGLKQAPRAWYQRF 696
+W I Q+DVKN FLHG+L+ETVYM QP GF DP PD+VCLL KS+YGLKQ+PRAW+ +F
Sbjct: 815 NWDIKQMDVKNAFLHGDLKETVYMTQPAGFVDPSKPDHVCLLHKSIYGLKQSPRAWFDKF 874
Query: 697 ADFAFTIGFSHSKSDHSLFIYRKGNDMTYILLYVDDIILTASSDVLRRSIMSLLASEFAM 756
+ F GF SKSD SLFIY N++ +LLYVDD+++T +S S+++ L EF M
Sbjct: 875 STFLLEFGFFCSKSDPSLFIYAHNNNLILLLLYVDDMVITGNSSQTLTSLLAALNKEFRM 934
Query: 757 KDLGTLSYF 765
D+G L YF
Sbjct: 935 TDMGQLHYF 943
>UniRef100_Q9FWZ5 Putative retroelement polyprotein [Arabidopsis thaliana]
Length = 1404
Score = 437 bits (1123), Expect = e-120
Identities = 268/788 (34%), Positives = 397/788 (50%), Gaps = 62/788 (7%)
Query: 19 NIVVGSGQEIPIRGYGQTYISPPYPPLSLNNVLHAPKLIKNLIFVRRFTIDNNVTIEFDP 78
++++ +G ++PI G G + + PK NL+ V+R T D N F P
Sbjct: 355 HVIIANGDKVPIEGIGNLKLFNKD-----SKAFFMPKFTSNLLSVKRTTRDLNCYAIFGP 409
Query: 79 FSFSVEDIQTGIPLMRCDSTGDLYPLTTTTSHQSTSPATFAALS---PTLWHNRLGHPGA 135
+DI+TG + S G+LY L + + S+ ++ + L TLWH RLGHP
Sbjct: 410 NDVYFQDIETGKVIGEGGSKGELYVLEDLSPNSSSCFSSKSHLGISFNTLWHARLGHPHT 469
Query: 136 NVLSFLNKNKFIECKQISSPSICQSCIYGKHVKLPFSISNSTTSKPFDIIHSDLWTSPIL 195
L + N + + C++CI GKH K F S + K FD++HSD+WTSP +
Sbjct: 470 RALKLMLPNISFD------HTSCEACILGKHCKSVFPKSLTIYEKCFDLVHSDVWTSPCV 523
Query: 196 SSAGHKYYLFFLDDYTNFVWTFPIGRKSQVPSIFSSFHAFIKTQFGTTIKCFQCDNGTEF 255
S +KY++ F+++ + + W + K +V F++F ++ QF IK F+ DNG E+
Sbjct: 524 SRDNNKYFVTFINEKSKYTWITLLPSKDRVFEAFTNFETYVTNQFNAKIKVFRTDNGGEY 583
Query: 256 NNEYFTQFCHKNGMVFRFSCPHTSPQNGKAERKIRAINNFIRTSLAHSSMPPSFWHHALQ 315
++ F K G++ + SCP+T QNG AERK R + R+ + H+S+P FW A+
Sbjct: 584 TSQKFRDHLAKRGIIHQTSCPYTPQQNGVAERKNRHLMEVARSMMFHTSVPKRFWGDAVL 643
Query: 316 ITTYLQNILPSKILSHHSPTQYLYHRDPSYTHLRVFGCLCYPLFPSTAINKLQARSTPCA 375
YL N P+K+LS SP + L + P HLRVFGC+C+ L P +KL A+ST C
Sbjct: 644 TACYLINRTPTKVLSDLSPFEVLNNTKPFIDHLRVFGCVCFVLIPGEQRSKLDAKSTKCM 703
Query: 376 FLGYPQNHRGYKCYDLSSKKIIISRHVIFDETQFPFAK---------THTSS--THTYEF 424
FLGY +GYKC+D + + ISR V F E Q K TH++S T +F
Sbjct: 704 FLGYSTTQKGYKCFDPTKNRTFISRDVKFLENQDYNNKKDWENLKDLTHSTSDRVETLKF 763
Query: 425 LNDSLHPLLHYHLQND-PKQDEPEPRKIESPQPATTPASPINVTNQSIL------PPSPM 477
L D HL ND + +P + + +++ +Q L PP+
Sbjct: 764 LLD--------HLGNDSTSTTQHQPEMTQDQEDLNQENEEVSLQHQENLTHVQEDPPNTQ 815
Query: 478 SINQLPHPLVSTELTSPTHTPQQIHQEPP------RTIATHSMHGIHKPKIQFNLTTSIT 531
++ H + +S P Q+ PP R + F T S+
Sbjct: 816 EHSE--HVQEIQDDSSEDEEPTQVLPPPPPLRRSTRIRRKKEFFNSNAVAHPFQATCSLA 873
Query: 532 SSPLPHNP--------------KAALSDSNWKAAMLDEFDALIKNKTWELVPRPQNVNFI 577
PL H + A+ W+ A+ DE +A+ +N TW+ P+ +
Sbjct: 874 LVPLDHQAFLSKISEHWIPQTYEEAMEVKEWRDAIADEINAMKRNHTWDEDDLPKGKKTV 933
Query: 578 RSMWIFRHKKKSDGSFERYKARLVGDGRSQ*VGVDCDETFSPVVKPTTIRIVLTIALSKS 637
S W+F K KS+G ERYK RLV G +Q G D ETF+PV K T+R+VL +A + S
Sbjct: 934 SSRWVFTIKYKSNGDIERYKTRLVARGFTQTYGSDYMETFAPVAKLHTVRVVLALATNLS 993
Query: 638 WPIHQLDVKNVFLHGELQETVYMHQPMGFRDPIHPDYVCLLKKSLYGLKQAPRAWYQRFA 697
W + Q+DVKN FL GEL++ VYM P G D I D V L+K++YGLKQ+PRAWY + +
Sbjct: 994 WGLWQMDVKNAFLQGELEDDVYMTPPPGLEDTIPCDKVLRLRKAIYGLKQSPRAWYHKLS 1053
Query: 698 DFAFTIGFSHSKSDHSLFIYRKGNDMTYILLYVDDIILTASSDVLRRSIMSLLASEFAMK 757
GF S+SDH+LF + + +L+YVDD+I+T + S + L S F +K
Sbjct: 1054 RTLKDHGFKKSESDHTLFTLQSPQGIVVVLIYVDDLIITGDNKDGIDSTKTFLKSCFDIK 1113
Query: 758 DLGTLSYF 765
DLG L YF
Sbjct: 1114 DLGELKYF 1121
>UniRef100_Q6L3M9 Putative late blight resistance protein [Solanum demissum]
Length = 1630
Score = 419 bits (1078), Expect = e-115
Identities = 268/800 (33%), Positives = 382/800 (47%), Gaps = 97/800 (12%)
Query: 18 KNIVVGSGQEIPIRGYGQTYISPPYPPLSLNNVLHAPKLIKNLIFVRRFTIDNNVTIEFD 77
+ I +G G IPI G T +S L N L + + NL+ V +F DN+ +IEF
Sbjct: 313 EEIAMGDGNTIPISHTGNTNLSASNQQFKLLNTLCSHSIKNNLLSVSKFCRDNHTSIEFF 372
Query: 78 PFSFSVEDIQTGIPLMRCDSTGDLYPLTTTTSHQSTSPATFAALSPTLWHNRLGHPGANV 137
PFS+ V+D+ TG PL R + LY ++H + P + LWH RLGHP
Sbjct: 373 PFSYCVKDLSTGAPLFRGQNRDGLYEWPLGSAHHT--PQCNVVVPLHLWHRRLGHPNHRT 430
Query: 138 LSFLNKNKFIECKQISSPSICQSCIYGKHVKLPFSISNSTTSKPFDIIHSDLW-TSPILS 196
L+ + + + SIC SC K +LPFS ++ + +P II++DLW SP+LS
Sbjct: 431 LNMIFHQFSLPVSHSRTASICNSCYSNKMHRLPFSENSLQSQRPLQIIYTDLWGPSPVLS 490
Query: 197 SAGHKYYLFFLDDYTNFVWTFPIGRKSQVPSIFSSFHAFIKTQFGTTIKCFQCDNGTEFN 256
+YY F+D Y+ ++ F I K +V +F + H ++ +F T I D G EF
Sbjct: 491 IDNKRYYALFVDQYSKYMCLFTIKSKKEVLDVFQALHPLLERRFQTKIMSLYTDGGGEFQ 550
Query: 257 NEYFTQFCHKNGMVFRFSCPHTSPQNGKAERKIRAINNFIRTSLAHSSMPPSFWHHALQI 316
+ + G+ + P+T + ER+ + + +T L +S+P SFW A
Sbjct: 551 G--LSSYLKIQGIEHLVTPPYTPQRVASVERRHKHVVETAKTLLHQASLPSSFWSFACHQ 608
Query: 317 TTYLQNILPSKILSHHSPTQYLYHRDPSYTHLRVFGCLCYPLFPSTAINKLQARSTPCAF 376
YL N L + L + P + L+H P Y LRVFGCLCYP A NKL+ +STPC +
Sbjct: 609 AVYLINRLTTPNLQNKCPYEILFHEAPKYESLRVFGCLCYPWLKPYAKNKLEPKSTPCVY 668
Query: 377 LGYPQNHRGYKCYDLSSKKIIISRHVIFDETQFPFAKT-----HTSSTHTYEFLND---- 427
LG+ H ++C+D K+ +SR V F E +PF + ST ++E D
Sbjct: 669 LGFSTKHYCHQCFDPVKNKLYLSRDVQFLEDTYPFHNIFLNLKNQQSTDSWEICYDVLPV 728
Query: 428 --------SLHPL-----LHYHLQND-PKQDEPEPRKIESPQPATTPASPINVT------ 467
S H L ++ L N P + E + Q ++P+ P +T
Sbjct: 729 TNKPSSFDSCHTLPDALPVYSLLPNSMPARSEGVSIASGNSQTLSSPSLPHTITPPPDYT 788
Query: 468 ----------------NQSILP-PSPMSINQLPHPLVSTELTSPTHTPQQIHQEPPRTIA 510
+ S+LP PSP+ LP + + PT P T +
Sbjct: 789 QPQPLITYQRKNHQQPSTSVLPLPSPIPPTNLPSQSSANNSSQPTLALAPSDPSPVVTTS 848
Query: 511 THSMHGIHK-----PKIQFNLTTSITSSPLPHNPKAALSDSNWKAAMLDEFDALIKNKTW 565
+H M K PK QF++ ++SS +PH K A +W+ AM EFDAL++N TW
Sbjct: 849 SHPMVTRSKTNSLQPK-QFSVNVQLSSSFVPHTYKQACPHPHWREAMHAEFDALVRNWTW 907
Query: 566 ELVPRPQNVNFIRSMWIFRHKKKSDGSFERYKARLVGDGRSQ*VGVDCDETFSPVVKPTT 625
+LVP ++N + PVVKP T
Sbjct: 908 DLVPVTHSMNVV----------------------------------------DPVVKPIT 927
Query: 626 IRIVLTIALSKSWPIHQLDVKNVFLHGELQETVYMHQPMGFRDPIHPDYVCLLKKSLYGL 685
IR+VLTI +WPIHQ+DV N FL G L+E VYM QP GF D +VC L K +YGL
Sbjct: 928 IRLVLTIVTQYNWPIHQIDVNNAFLQGSLEEEVYMRQPPGFEDQSLSTHVCKLNKVIYGL 987
Query: 686 KQAPRAWYQRFADFAFTIGFSHSKSDHSLFIYRKGNDMTYILLYVDDIILTASSDVLRRS 745
KQAPRAWY + T+GF S+SD SLFI Y+L+YVD II+T + R
Sbjct: 988 KQAPRAWYNELKSYLLTVGFVKSQSDSSLFILHNFGFTVYVLIYVDAIIITGNQIHGVRH 1047
Query: 746 IMSLLASEFAMKDLGTLSYF 765
I+ L + F++KDLG L YF
Sbjct: 1048 IIDGLFTRFSLKDLGQLHYF 1067
>UniRef100_Q710T7 Gag-pol polyprotein [Populus deltoides]
Length = 1382
Score = 414 bits (1065), Expect = e-114
Identities = 280/804 (34%), Positives = 405/804 (49%), Gaps = 53/804 (6%)
Query: 1 MTASQGTLSTYSNLSINKNIVVGSGQEIPIRGYGQTYISPPYPPLSLNNVLHAPKLIKNL 60
M+ + ++ S LS + ++ G +P+ G G LSL NV PKL NL
Sbjct: 352 MSPDSSSFTSVSPLS-SIPVMTADGTPMPLAGVGSVVTLH----LSLPNVYLIPKLKLNL 406
Query: 61 IFVRRFTIDNNVTIEFDPFSFSVEDIQTGIPLMRCDSTGDLYPL---------TTTTSHQ 111
+ + + + F V+D+Q+ + LY L TT
Sbjct: 407 ASIGQICDSGDYLVMFSGSFCCVQDLQSQKLIGTGRRENGLYILDELKVPVVVAATTVDL 466
Query: 112 STSPATFAALSPTLWHNRLGHPGANVLSFLNKNKFIECKQISSPSICQSCIYGKHVKLPF 171
S + ++ S LWH+RLGH ++ L FL + + S C C K LPF
Sbjct: 467 SFFRLSLSSSSFYLWHSRLGHVSSSRLRFLASTGALGNLKTCDISDCSGCKLAKFSALPF 526
Query: 172 SISNSTTSKPFDIIHSDLW-TSPILSSAGHKYYLFFLDDYTNFVWTFPIGRKSQVPSIFS 230
+ S S +S PFD+IHSD+W SP+ + G +YY+ F+DD+T + W + + +S+ I++
Sbjct: 527 NRSTSVSSSPFDLIHSDVWGPSPVSTKGGSRYYVSFIDDHTRYCWVYLMKHRSEFFEIYA 586
Query: 231 SFHAFIKTQFGTTIKCFQCDNGTEFNNEYFTQFCHKNGMVFRFSCPHTSPQNGKAERKIR 290
+F A IKTQ IKCF+CD G E+ + F Q +G + + SC T QNG AERK R
Sbjct: 587 AFRALIKTQHSAVIKCFRCDLGGEYTSNKFCQMLALDGTIHQTSCTDTPEQNGVAERKHR 646
Query: 291 AINNFIRTSLAHSSMPPSFWHHALQITTYLQNILPSKILSHHSPTQYLYHRDPSYTHLRV 350
I R+ L + + FW A+ L N +PS S SP + LY P Y+ RV
Sbjct: 647 HIVETARSLLLSAFVLSEFWGEAVLTAVSLINTIPSSHSSGLSPFEKLYGHVPDYSSFRV 706
Query: 351 FGCLCYPLFPSTAINKLQARSTPCAFLGYPQNHRGYKCYDLSSKKIIISRHVIFDETQFP 410
FGC + L P NKL +RS C FLGY + +GY+C+D ++K+ +S HV+F E P
Sbjct: 707 FGCTYFVLHPHVERNKLSSRSAICVFLGYGEGKKGYRCFDPITQKLYVSHHVVFLE-HIP 765
Query: 411 FAKTHTSSTHTYEFLNDSLHPLLHYHLQNDPKQDEPEPRKIESPQPATTPASPINVTNQS 470
F S+TH+ +D +H + ++ P R I + A T +
Sbjct: 766 FFSI-PSTTHSLT-KSDLIH--IDPFSEDSGNDTSPYVRSICTHNSAGT---------GT 812
Query: 471 ILPPSPMSINQLPHPLVSTELTSPTHTPQQIHQEPPRTIATHSMHGIHKPKIQFNLTTSI 530
+L +P + P S+E+ P PPR + P ++ +S
Sbjct: 813 LLSGTPEASFSSTAPQASSEIVDP----------PPRQ-SIRIRKSTKLPDFAYSCYSSS 861
Query: 531 TSSPL--------PHNPKAALSDSNWKAAMLDEFDALIKNKTWELVPRPQNVNFIRSMWI 582
+S L P + K A+ D + AM +E AL K TW+LVP P + + W+
Sbjct: 862 FTSFLAYIHCLFEPSSYKEAILDPLGQQAMDEELSALHKTDTWDLVPLPPGKSVVGCRWV 921
Query: 583 FRHKKKSDGSFERYKARLVGDGRSQ*VGVDCDETFSPVVKPTTIRIVLTIALSKSWPIHQ 642
++ K SDGS ERYKARLV G SQ G+D +ETF+P+ K TTIR ++ +A + W I Q
Sbjct: 922 YKIKTNSDGSIERYKARLVAKGYSQQYGMDYEETFAPIAKMTTIRTLIAVASIRQWHISQ 981
Query: 643 LDVKNVFLHGELQETVYMHQPMGFRDPIHPDYVCLLKKSLYGLKQAPRAWYQRFADFAFT 702
LDVKN FL+G+LQE VYM P G YVC LKK+LYGLKQAPRAW+++F+ +
Sbjct: 982 LDVKNAFLNGDLQEEVYMAPPPGISH--DSGYVCKLKKALYGLKQAPRAWFEKFSIVISS 1039
Query: 703 IGFSHSKSDHSLFIYRKGNDMTYILLYVDDIILTASSDVLRRSIMSL-LASEFAMKDLGT 761
+GF S D +LFI + LYVDD+I+T D+ S++ LA F MKDLG
Sbjct: 1040 LGFVSSSHDSALFIKCTDAGRIILSLYVDDMIIT-GDDIDGISVLKTELARRFEMKDLGY 1098
Query: 762 LSYFFR-PCSYTSCRWLVS*SKKI 784
L YF +Y+ +L+S SK +
Sbjct: 1099 LRYFLGIEVAYSPRGYLLSQSKYV 1122
>UniRef100_O23741 SLG-Sc and SLA-Sc genes and Melmoth retrotransposon sequence
[Brassica oleracea]
Length = 1131
Score = 407 bits (1045), Expect = e-111
Identities = 252/769 (32%), Positives = 381/769 (48%), Gaps = 59/769 (7%)
Query: 9 STYSNLSIN--KNIVVGSGQEIPIRGYGQTYISPPYPPLSLNNVLHAPKLIKNLIFVRRF 66
S Y+++ I N+ + +G + I G G ++ ++L+NVL+ P+ NL+ +
Sbjct: 406 SMYTSIDITTTSNVNLPNGMIVKISGVGIVQLNEH---ITLHNVLYIPEFRLNLLSISSL 462
Query: 67 TIDNNVTIEFDPFSFSVEDIQTGIPLMRCDSTGDLYPLTTTTSHQSTSPATFAALSPTLW 126
T D + FD S +++D G + + +LY L +S + A + +LW
Sbjct: 463 TSDIGSQVIFDVSSCAIQDPTKGWTIGQGRRVANLYVLDVKSSPMKIN----AVVDISLW 518
Query: 127 HNRLGHPGANVLSFLNKNKFIECKQISSPSICQSCIYGKHVKLPFSISNSTTSKPFDIIH 186
H RLGHP L +++ + + C C K KL +S N + F ++H
Sbjct: 519 HKRLGHPSYTRLDKISEALGTTKHKNKGDAHCHVCHLAKQKKLSYSSQNHICTASFQLLH 578
Query: 187 SDLWTS-PILSSAGHKYYLFFLDDYTNFVWTFPIGRKSQVPSIFSSFHAFIKTQFGTTIK 245
D+W + + G+KY+L +DD++ W + + KS V IF +F I+TQ+ T IK
Sbjct: 579 VDVWGPFSVETLEGYKYFLTIVDDHSRATWIYLLQSKSDVLHIFPTFVNQIETQYNTKIK 638
Query: 246 CFQCDNGTEFNNEYFTQFCHKNGMVFRFSCPHTSPQNGKAERKIRAINNFIRTSLAHSSM 305
+ DN E + FT+ + G+V SCP T QN ERK + + N R + S +
Sbjct: 639 SVRRDNAPELS---FTELFKEKGIVSYHSCPETLEQNSVLERKHQHLLNVARALMFQSQV 695
Query: 306 PPSFWHHALQITTYLQNILPSKILSHHSPTQYLYHRDPSYTHLRVFGCLCYPLFPSTAIN 365
P +W + +L N PS +L++ SP + L + P Y LR FGCLCY +
Sbjct: 696 PLQYWGDCVLTAAFLINRTPSPLLANKSPYEVLMGKAPQYDQLRTFGCLCYGSTSPKQRH 755
Query: 366 KLQARSTPCAFLGYPQNHRGYKCYDLSSKKIIISRHVIFDETQFPFAKTHTSSTHTYEFL 425
K RS C FLGYP ++GYK DL S KI ISR+V F E FP AK + F
Sbjct: 756 KFMPRSRACVFLGYPSGYKGYKLLDLESNKIYISRNVTFHEDIFPMAKHQKMDESSLHFF 815
Query: 426 NDSLHPLLHYHLQNDPKQDEPEPRKIESPQPATTPASPINVTNQSILPPSPMSINQLPHP 485
P T P++P N S P S +S P
Sbjct: 816 ----------------------------PPKVTVPSAPS--PNISSSPFSTLS------P 839
Query: 486 LVST-ELTSPTHTPQ----QIHQEPPRTIATHSMHGIHKPKIQFNLTTSITSSPLPHNPK 540
+S + T P H +H +T S I + + SIT+ P+P +
Sbjct: 840 QISKRQRTVPAHLKDFHCYSVHDSAYPISSTLSYSQISSHHLAY--INSITNIPIPQSYA 897
Query: 541 AALSDSNWKAAMLDEFDALIKNKTWELVPRPQNVNFIRSMWIFRHKKKSDGSFERYKARL 600
W + E DA+ +N TW++VP P+ I W+ K +DG+ ER K+RL
Sbjct: 898 EVRQSKEWTESADKELDAMEENDTWDVVPLPKGKKAIGCRWVHTLKFNADGTLERRKSRL 957
Query: 601 VGDGRSQ*VGVDCDETFSPVVKPTTIRIVLTIALSKSWPIHQLDVKNVFLHGELQETVYM 660
VG G +Q G+D ETFSPV K T++++L + SK W +HQLD+ N FL+GEL E +YM
Sbjct: 958 VGKGYTQKEGLDYIETFSPVAKMATVKLLLKVGASKKWFLHQLDISNAFLNGELDEEIYM 1017
Query: 661 HQPMGF---RDPIHPDYVCLLKKSLYGLKQAPRAWYQRFADFAFTIGFSHSKSDHSLFIY 717
P G+ + + P+ VC LKKS+YGLKQA R W+++F+ F +GF + DH+LF+
Sbjct: 1018 KLPEGYAERKGDLPPNAVCKLKKSIYGLKQASRQWFKKFSTSLFQLGFQKAHGDHTLFVR 1077
Query: 718 RKGNDMTYILLYVDDIILTASSDVLRRSIMSLLASEFAMKDLGTLSYFF 766
+ ND +L+YVDDI++ ++ D + + S L S F ++DLG+L YFF
Sbjct: 1078 QTENDFVAVLVYVDDIVIASTDDAVAVKLKSDLKSFFKLRDLGSLKYFF 1126
>UniRef100_O04543 F20P5.25 protein [Arabidopsis thaliana]
Length = 1315
Score = 398 bits (1022), Expect = e-109
Identities = 252/738 (34%), Positives = 363/738 (49%), Gaps = 43/738 (5%)
Query: 45 LSLNNVLHAPKLIKNLIFVRRFTIDNNVTIEFDPFSFSVEDIQTGIPLMRCDSTGDLYPL 104
L LN+VL P+ NL+ V T I FD S ++D + + +LY +
Sbjct: 323 LILNDVLFIPQFKFNLLSVSSLTKSMGCRIWFDETSCVLQDATRELMVGMGKQVANLYIV 382
Query: 105 TTTT-SHQST-SPATFAAL-SPTLWHNRLGHPGANVLSFLNKNKFIECKQISSPSICQSC 161
+ SH T S T A++ S LWH RLGHP L ++ ++ ++ C+ C
Sbjct: 383 DLDSLSHPGTDSSITVASVTSHDLWHKRLGHPSVQKLQPMSSLLSFPKQKNNTDFHCRVC 442
Query: 162 IYGKHVKLPFSISNSTTSKPFDIIHSDLWTS-PILSSAGHKYYLFFLDDYTNFVWTFPIG 220
K LPF N+ +S+PFD+IH D W + + G++Y+L +DDY+ W + +
Sbjct: 443 HISKQKHLPFVSHNNKSSRPFDLIHIDTWGPFSVQTHDGYRYFLTIVDDYSRATWVYLLR 502
Query: 221 RKSQVPSIFSSFHAFIKTQFGTTIKCFQCDNGTEFNNEYFTQFCHKNGMVFRFSCPHTSP 280
KS V ++ +F ++ QF TTIK + DN E N FTQF H G+V SCP T
Sbjct: 503 NKSDVLTVIPTFVTMVENQFETTIKGVRSDNAPELN---FTQFYHSKGIVPYHSCPETPQ 559
Query: 281 QNGKAERKIRAINNFIRTSLAHSSMPPSFWHHALQITTYLQNILPSKILSHHSPTQYLYH 340
QN ERK + I N R+ S +P S+W + YL N LP+ IL P + L
Sbjct: 560 QNSVVERKHQHILNVARSLFFQSHIPISYWGDCILTAVYLINRLPAPILEDKCPFEVLTK 619
Query: 341 RDPSYTHLRVFGCLCYPLFPSTAINKLQARSTPCAFLGYPQNHRGYKCYDLSSKKIIISR 400
P+Y H++VFGCLCY +K R+ CAF+GYP +GYK DL + II+SR
Sbjct: 620 TVPTYDHIKVFGCLCYASTSPKDRHKFSPRAKACAFIGYPSGFKGYKLLDLETHSIIVSR 679
Query: 401 HVIFDETQFPFAKTHTSSTHTYEFLNDSLHPLLHYHLQNDPKQDEPEPRKIESPQPATTP 460
HV+F E FPF + S F P + P + +S +
Sbjct: 680 HVVFHEELFPFLGSDLSQEEQNFF----------------PDLNPTPPMQRQS----SDH 719
Query: 461 ASPINVTNQSILPPSPMSINQLPHPLVSTELTSPTHTPQQIHQEPPRTIATHSMHGIHK- 519
+P + ++ + PS N +P P V T P + ++ + + H I K
Sbjct: 720 VNPSDSSSSVEILPSANPTNNVPEPSVQTS-HRKAKKPAYLQDYYCHSVVSSTPHEIRKF 778
Query: 520 --------PKIQFNLTTSITSSPLPHNPKAALSDSNWKAAMLDEFDALIKNKTWELVPRP 571
P + F T P + L W+ AM EFD L TWE+ P
Sbjct: 779 LSYDRINDPYLTFLACLDKTKEPSNYTEAEKL--QVWRDAMGAEFDFLEGTHTWEVCSLP 836
Query: 572 QNVNFIRSMWIFRHKKKSDGSFERYKARLVGDGRSQ*VGVDCDETFSPVVKPTTIRIVLT 631
+ I WIF+ K SDGS ERYKARLV G +Q G+D +ETFSPV K +++++L
Sbjct: 837 ADKRCIGCRWIFKIKYNSDGSVERYKARLVAQGYTQKEGIDYNETFSPVAKLNSVKLLLG 896
Query: 632 IALSKSWPIHQLDVKNVFLHGELQETVYMHQPMGFR----DPIHPDYVCLLKKSLYGLKQ 687
+A + QLD+ N FL+G+L E +YM P G+ D + P+ VC LKKSLYGLKQ
Sbjct: 897 VAARFKLSLTQLDISNAFLNGDLDEEIYMRLPQGYASRQGDSLPPNAVCRLKKSLYGLKQ 956
Query: 688 APRAWYQRFADFAFTIGFSHSKSDHSLFIYRKGNDMTYILLYVDDIILTASSDVLRRSIM 747
A R WY +F+ +GF S DH+ F+ +L+Y+DDII+ +++D +
Sbjct: 957 ASRQWYLKFSSTLLGLGFIQSYCDHTCFLKISDGIFLCVLVYIDDIIIASNNDAAVDILK 1016
Query: 748 SLLASEFAMKDLGTLSYF 765
S + S F ++DLG L YF
Sbjct: 1017 SQMKSFFKLRDLGELKYF 1034
>UniRef100_Q9SIM3 Putative retroelement pol polyprotein [Arabidopsis thaliana]
Length = 1461
Score = 397 bits (1019), Expect = e-108
Identities = 238/756 (31%), Positives = 376/756 (49%), Gaps = 45/756 (5%)
Query: 15 SINKNIVVGSGQEIPIRGYGQTYISPPYPPLSLNNVLHAPKLIKNLIFVRRFTIDNNVTI 74
SI + + +G + I G G I+ + L NVL P+ NLI + T D +
Sbjct: 465 SIVSFVNLPTGPNVRISGVGTVLINKD---IILQNVLFIPEFRLNLISISSLTTDLGTRV 521
Query: 75 EFDPFSFSVEDIQTGIPLMRCDSTGDLYPLTTTTSHQSTSPATFAALSPTLWHNRLGHPG 134
FDP ++D+ G+ L G+LY L T QS + + A + ++WH RLGHP
Sbjct: 522 IFDPSCCQIQDLTKGLTLGEGKRIGNLYVLDT----QSPAISVNAVVDVSVWHKRLGHPS 577
Query: 135 ANVLSFLNKNKFIECKQISSPSICQSCIYGKHVKLPFSISNSTTSKPFDIIHSDLWTS-P 193
+ L L++ + + C C K KL F +N+ + F+++H D+W
Sbjct: 578 FSRLDSLSEVLGTTRHKNKKSAYCHVCHLAKQKKLSFPSANNICNSTFELLHIDVWGPFS 637
Query: 194 ILSSAGHKYYLFFLDDYTNFVWTFPIGRKSQVPSIFSSFHAFIKTQFGTTIKCFQCDNGT 253
+ + G+KY+L +DD++ W + + KS V ++F +F ++ Q+ T +K + DN
Sbjct: 638 VETVEGYKYFLTIVDDHSRATWIYLLKSKSDVLTVFPAFIDLVENQYDTRVKSVRSDNAK 697
Query: 254 EFNNEYFTQFCHKNGMVFRFSCPHTSPQNGKAERKIRAINNFIRTSLAHSSMPPSFWHHA 313
E FT+F G+V SCP T QN ERK + I N R + S+M +W
Sbjct: 698 ELA---FTEFYKAKGIVSFHSCPETPEQNSVVERKHQHILNVARALMFQSNMSLPYWGDC 754
Query: 314 LQITTYLQNILPSKILSHHSPTQYLYHRDPSYTHLRVFGCLCYPLFPSTAINKLQARSTP 373
+ +L N PS +LS+ +P + L + P Y+ L+ FGCLCY S +K RS
Sbjct: 755 VLTAVFLINRTPSALLSNKTPFEVLTGKLPDYSQLKTFGCLCYSSTSSKQRHKFLPRSRA 814
Query: 374 CAFLGYPQNHRGYKCYDLSSKKIIISRHVIFDETQFPFAKTHTSSTHTYEFLNDSLHPLL 433
C FLGYP +GYK DL S + ISR+V F E FP A + S+T +D P+
Sbjct: 815 CVFLGYPFGFKGYKLLDLESNVVHISRNVEFHEELFPLASSQQSATTA----SDVFTPMD 870
Query: 434 HYHLQNDPKQDEPEPRKIESPQPATTPASPINVTNQSILPPSPMSINQLPHPLVSTELTS 493
N P P+ SP ++ + P + V+ + +
Sbjct: 871 PLSSGNSITSHLPSPQ-----------ISPSTQISKRRITKFPAHLQDYHCYFVNKDDSH 919
Query: 494 PTHTPQQIHQEPPRTIATHSMHGIHKPKIQFNLTTSITSSPLPHNPKAALSDSNWKAAML 553
P + Q P +H ++ +I+ P+P + A W A+
Sbjct: 920 PISSSLSYSQISP----SHMLY-----------INNISKIPIPQSYHEAKDSKEWCGAID 964
Query: 554 DEFDALIKNKTWELVPRPQNVNFIRSMWIFRHKKKSDGSFERYKARLVGDGRSQ*VGVDC 613
E A+ + TWE+ P + W+F K +DGS ER+KAR+V G +Q G+D
Sbjct: 965 QEIGAMERTDTWEITSLPPGKKAVGCKWVFTVKFHADGSLERFKARIVAKGYTQKEGLDY 1024
Query: 614 DETFSPVVKPTTIRIVLTIALSKSWPIHQLDVKNVFLHGELQETVYMHQPMGFRD----P 669
ETFSPV K T++++L ++ SK W ++QLD+ N FL+G+L+ET+YM P G+ D
Sbjct: 1025 TETFSPVAKMATVKLLLKVSASKKWYLNQLDISNAFLNGDLEETIYMKLPDGYADIKGTS 1084
Query: 670 IHPDYVCLLKKSLYGLKQAPRAWYQRFADFAFTIGFSHSKSDHSLFIYRKGNDMTYILLY 729
+ P+ VC LKKS+YGLKQA R W+ +F++ +GF DH+LF+ G++ +L+Y
Sbjct: 1085 LPPNVVCRLKKSIYGLKQASRQWFLKFSNSLLALGFEKQHGDHTLFVRCIGSEFIVLLVY 1144
Query: 730 VDDIILTASSDVLRRSIMSLLASEFAMKDLGTLSYF 765
VDDI++ ++++ +S+ L + F +++LG L YF
Sbjct: 1145 VDDIVIASTTEQAAQSLTEALKASFKLRELGPLKYF 1180
>UniRef100_Q7Y141 Putative polyprotein [Oryza sativa]
Length = 1335
Score = 389 bits (999), Expect = e-106
Identities = 238/760 (31%), Positives = 379/760 (49%), Gaps = 42/760 (5%)
Query: 15 SINKNIVVGSGQEIPIRGYGQTYISPPYPPLSLNNVLHAPKLIKNLIFVRRFTIDNNVTI 74
S + I +G+G G G + P + +VL P L +NL+ + + +++ +
Sbjct: 349 SYHAKIHMGNGSIAQSEGKGTVAVQTADGPKFIKDVLLVPDLKQNLLSIGQL-LEHGYAV 407
Query: 75 EFDPFSFSVEDIQTGIPLMRCDSTGD---LYPLTTTTSHQSTSPATFAALSPTLWHNRLG 131
F+ FS + D + + + + + L + TT S + LWH R+G
Sbjct: 408 YFEDFSCKILDRKNNRLVAKINMEKNRNFLLRMNHTTQMALRSEVDIS----DLWHKRMG 463
Query: 132 HPGANVLSFLNKNKFIECKQI----SSPSICQSCIYGKHVKLPFSISNS-TTSKPFDIIH 186
H L L ++ S P C+ C++GK ++ F S + S P +++H
Sbjct: 464 HLNYRALKLLRTKGMVQGLPFITLKSDP--CEGCVFGKQIRASFPHSGAWRASAPLELVH 521
Query: 187 SDLWTS-PILSSAGHKYYLFFLDDYTNFVWTFPIGRKSQVPSIFSSFHAFIKTQFGTTIK 245
+D+ P +S G+ Y++ F+DDYT +W + + KS IF F A ++ Q IK
Sbjct: 522 ADIVGKVPTISEGGNWYFITFIDDYTRMIWVYFLKEKSAALEIFKKFKAMVENQSNRKIK 581
Query: 246 CFQCDNGTEFNNEYFTQFCHKNGMVFRFSCPHTSPQNGKAERKIRAINNFIRTSLAHSSM 305
+ D G E+ ++ F ++C G+ + + +++ QNG AERK R IN+ + L M
Sbjct: 582 VLRSDQGREYISKEFEKYCENAGIRRQLTAGYSAQQNGVAERKNRTINDMANSMLQDKGM 641
Query: 306 PPSFWHHALQITTYLQNILPSKILSHHSPTQYLYHRDPSYTHLRVFGCLCYPLFPSTAIN 365
P SFW A+ Y+ N P+K +++ +P + Y + P H+RVFGC+CY P+
Sbjct: 642 PKSFWAEAVNTAVYILNRSPTKAVTNRTPFEAWYGKKPVIGHMRVFGCICYAQVPAQKRV 701
Query: 366 KLQARSTPCAFLGYPQNHRGYKCYDLSSKKIIISRHVIFDETQFPFAKTHTSSTHTYEFL 425
K +S C F+GY +GY+ Y+L KKIIISR IFDE S+T ++
Sbjct: 702 KFDNKSDRCIFVGYADGIKGYRLYNLEKKKIIISRDAIFDE----------SATWNWKSP 751
Query: 426 NDSLHPLLHYHLQNDPKQDEPEPRKIESPQPATTPASPINVTNQSILPPSPMSINQLPHP 485
S PLL + ++E P+ P+SP++ ++ S SP S Q+
Sbjct: 752 EASSTPLLPTTTITLGQPHMHGTHEVEDHTPSPQPSSPMSSSSASS-DSSPSSEEQI--- 807
Query: 486 LVSTELTSPTHTPQQIHQEPPRTIATHSMHGIHKPKIQFNLTTSITSSPLPHNPKAALSD 545
++P P+++ +T G + + S+ P + + A
Sbjct: 808 ------STPESAPRRVRSMVELLESTSQQRGSEQHEF---CNYSVVE---PQSFQEAEKH 855
Query: 546 SNWKAAMLDEFDALIKNKTWELVPRPQNVNFIRSMWIFRHKKKSDGSFERYKARLVGDGR 605
NW AM DE + KN TWELV RP++ I W+++ K DGS ++YKARLV G
Sbjct: 856 DNWIKAMEDEIHMIEKNNTWELVDRPRDREVIGVKWVYKTKLNPDGSVQKYKARLVAKGF 915
Query: 606 SQ*VGVDCDETFSPVVKPTTIRIVLTIALSKSWPIHQLDVKNVFLHGELQETVYMHQPMG 665
Q G+D ET++PV + TIR ++ +A K W I+QLDVK+ FL+G L E +Y+ QP G
Sbjct: 916 KQKPGIDYYETYAPVARLETIRTIIALAAQKRWKIYQLDVKSAFLNGYLDEEIYVEQPEG 975
Query: 666 FRDPIHPDYVCLLKKSLYGLKQAPRAWYQRFADFAFTIGFSHSKSDHSLFIYRKGNDMTY 725
F + V LKK+LYGLKQAPRAWY + + GF+ S S+ +L++ + G D+
Sbjct: 976 FSVQGGENKVFRLKKALYGLKQAPRAWYSQIDKYFIQKGFAKSISEPTLYVNKTGTDILI 1035
Query: 726 ILLYVDDIILTASSDVLRRSIMSLLASEFAMKDLGTLSYF 765
+ LYVDD+I T +S+ + + + + M DLG L YF
Sbjct: 1036 VSLYVDDLIYTGNSEKMMQDFKKDMMHTYEMSDLGLLHYF 1075
Database: uniref100
Posted date: Jan 5, 2005 1:24 AM
Number of letters in database: 848,049,833
Number of sequences in database: 2,790,947
Lambda K H
0.332 0.142 0.470
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,705,036,690
Number of Sequences: 2790947
Number of extensions: 73959978
Number of successful extensions: 241433
Number of sequences better than 10.0: 2137
Number of HSP's better than 10.0 without gapping: 1534
Number of HSP's successfully gapped in prelim test: 625
Number of HSP's that attempted gapping in prelim test: 235152
Number of HSP's gapped (non-prelim): 4021
length of query: 993
length of database: 848,049,833
effective HSP length: 137
effective length of query: 856
effective length of database: 465,690,094
effective search space: 398630720464
effective search space used: 398630720464
T: 11
A: 40
X1: 15 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (22.0 bits)
S2: 80 (35.4 bits)
Medicago: description of AC144645.3