
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC145449.7 + phase: 0 /pseudo
(965 letters)
Database: uniref100
2,790,947 sequences; 848,049,833 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
UniRef100_Q6L3H0 Putative receptor kinase [Solanum demissum] 604 e-171
UniRef100_Q6L3Q0 Putative polyprotein [Solanum demissum] 522 e-146
UniRef100_Q7XE85 Putative pol polyprotein [Oryza sativa] 399 e-109
UniRef100_Q5XWK9 Gag-pol polyprotein-like [Solanum tuberosum] 399 e-109
UniRef100_Q9FWZ5 Putative retroelement polyprotein [Arabidopsis ... 389 e-106
UniRef100_Q710T7 Gag-pol polyprotein [Populus deltoides] 385 e-105
UniRef100_Q9ZPU4 Putative retroelement pol polyprotein [Arabidop... 367 e-100
UniRef100_Q9ZQK0 Putative retroelement pol polyprotein [Arabidop... 365 3e-99
UniRef100_O04543 F20P5.25 protein [Arabidopsis thaliana] 364 8e-99
UniRef100_Q9XII7 Putative retroelement pol polyprotein [Arabidop... 362 2e-98
UniRef100_Q9FIC5 Retroelement pol polyprotein-like [Arabidopsis ... 358 4e-97
UniRef100_Q94KV0 Polyprotein [Arabidopsis thaliana] 355 4e-96
UniRef100_Q9C692 Polyprotein, putative [Arabidopsis thaliana] 352 2e-95
UniRef100_O81617 F8M12.17 protein [Arabidopsis thaliana] 349 3e-94
UniRef100_Q9FXB7 Putative retroelement polyprotein [Arabidopsis ... 344 6e-93
UniRef100_Q9SA17 F28K20.17 protein [Arabidopsis thaliana] 342 3e-92
UniRef100_Q9FLA4 Polyprotein [Arabidopsis thaliana] 342 3e-92
UniRef100_Q9SSB1 T18A20.5 protein [Arabidopsis thaliana] 338 6e-91
UniRef100_O23302 Retrovirus-related like polyprotein [Arabidopsi... 332 3e-89
UniRef100_Q94IU9 Copia-like polyprotein [Arabidopsis thaliana] 331 7e-89
>UniRef100_Q6L3H0 Putative receptor kinase [Solanum demissum]
Length = 1358
Score = 604 bits (1558), Expect = e-171
Identities = 319/599 (53%), Positives = 396/599 (65%), Gaps = 70/599 (11%)
Query: 1 DFNTGKTIGT*SISQGLYYLHSQSS-NICGVSASPDMIHRRLGHPSFDKLKVLVPQLSHL 59
D +TG+ IGT SQGLYYL S +S C ++ SPD+IH+RLGH S KL+ +VP LS L
Sbjct: 451 DRSTGQMIGTGHESQGLYYLTSSNSLAACSITDSPDLIHKRLGHSSLSKLQKMVPSLSSL 510
Query: 60 KSLDCESCQLGKHVRASFPSSPNKRSKSPFDIVHSDVWGPSRVMSTLG*RYYVTFIDGFS 119
+LDCESCQLGKH RA+F S RS+S F +VHSD+WGPSRV STLG RY+V+FID +S
Sbjct: 511 STLDCESCQLGKHTRATFSRSTEGRSESIFSLVHSDIWGPSRVSSTLGFRYFVSFIDDYS 570
Query: 120 RCTWIILLKDRSQLFGAFLTFCSEIKNQFGKGIRILRSDNAKEYFFAPFNSFMASLGIIH 179
+CTW+ L+KDRS+LF F +F +EI+NQFG IR RSDNA EY + F FM GIIH
Sbjct: 571 KCTWVFLMKDRSELFSIFKSFFAEIQNQFGVSIRTFRSDNALEYLSSQFREFMTHQGIIH 630
Query: 180 QSSCPHTPQQNGVAERKHCHLVDTTRTLLINAHAPFKFWGDAILTACYLINRMPSSVLDN 239
Q++CP+TPQQNGVAERK+ HL++T RTLL+ ++ P +FWGDA+LT+CYLINRMPSS + N
Sbjct: 631 QTTCPYTPQQNGVAERKNRHLIETARTLLLESNVPLRFWGDAVLTSCYLINRMPSSSIQN 690
Query: 240 EIPQSLLFPKDPLYRVQLRVYGSTCFVHDLTPGRDKLSARAVKCVFLGYSRTQKGYRCYS 299
++P S+LFP+ LY + RV+GSTCFVH+L PG+DKL+ RA+KCVFLGYSR QKGYRCYS
Sbjct: 691 QVPHSILFPQSHLYPIPPRVFGSTCFVHNLAPGKDKLAPRALKCVFLGYSRVQKGYRCYS 750
Query: 300 PSTRRFYISADVTFFEDTPFFASPTTTSSTTDVTDSQVIPTPLFHPIFEPPVSTQSSPQL 359
R+ +SADVTFFE P++ T+S+ DV+ IP L P F T +SP +
Sbjct: 751 HDLHRYLMSADVTFFESQPYY----TSSNHPDVSMVLPIPQVLPVPTFVESTVTSTSPVV 806
Query: 360 QSNPEFRRYGNIYERRHVEAPETSPIDSSDSAPKTVTTDSSDSATAPISSPVVVPPEPSN 419
P+ + P+ T DS AP +P P PS
Sbjct: 807 ----------------------VPPLLTYHRRPRP-TLVPDDSCHAPDPAPTADLPPPSQ 843
Query: 420 DLPIALHKGKRSTANPHPVYNFLSYHRLSPSYFAFVSALSSVSIPKTVHEALSHQEWKQA 479
P+AL KG EALSH W+QA
Sbjct: 844 --PLALQKG----------------------------------------EALSHSGWRQA 861
Query: 480 MIDEMVALESNHTWELVSPSPGKSIVGCR*VFNVKVGLDGQVDRLKARLVAIGYTQVYGQ 539
M+DEM AL + TWELVS GKS VGCR V+ VK+G DGQVDRLKARLVA GYTQ++G
Sbjct: 862 MVDEMSALHKSGTWELVSLPAGKSTVGCRWVYAVKIGPDGQVDRLKARLVAKGYTQIFGL 921
Query: 540 DYNDTFSPVAKMTSVRLFIAMTAMKR*PLFQLDIKNAFLHGDLEEEIYMEQPSGFVAWG 598
DY+DTF+PVAK+ SVRLF++M A++ PL QLDIKNAFLHGDLEEE+YMEQP GFVA G
Sbjct: 922 DYSDTFAPVAKIASVRLFLSMAAVRHWPLHQLDIKNAFLHGDLEEEVYMEQPPGFVAQG 980
Score = 36.2 bits (82), Expect = 4.9
Identities = 14/22 (63%), Positives = 17/22 (76%)
Query: 944 SLRSPRITYICDKMDAYDMYAP 965
SL PRI YIC+K+ YD+YAP
Sbjct: 1336 SLTCPRINYICNKLGTYDLYAP 1357
>UniRef100_Q6L3Q0 Putative polyprotein [Solanum demissum]
Length = 1336
Score = 522 bits (1344), Expect = e-146
Identities = 284/605 (46%), Positives = 365/605 (59%), Gaps = 50/605 (8%)
Query: 1 DFNTGKTIGT*SISQGLYYLHSQS--SNICGVSASPDMIHRRLGHPSFDKLKVLVPQLSH 58
D T + IG +S GLY L + S C SP H RLGHPS LK L PQ +
Sbjct: 456 DLMTKQIIGKRHVSDGLYILDEWTPPSVACSSIVSPFEAHCRLGHPSLPVLKKLCPQFHN 515
Query: 59 LKSLDCESCQLGKHVRASFPSSPNKRSKSPFDIVHSDVWGPSRVMSTLG*RYYVTFIDGF 118
+ S+DCESC KH R S NKR+ F++VHSDVWGP V+S +G RY+VTF+D F
Sbjct: 516 VPSIDCESCHFAKHHRISLSPRNNKRANFAFELVHSDVWGPCPVVSKVGFRYFVTFMDDF 575
Query: 119 SRCTWIILLKDRSQLFGAFLTFCSEIKNQFGKGIRILRSDNAKEYFFAPFNSFMASLGII 178
SR TWI +K+RS++F F FC+EIK QF + ILRSDNA+E+ A F ++M GI+
Sbjct: 576 SRMTWIYFMKNRSEVFSHFSNFCAEIKTQFNASVHILRSDNAREFMSASFQNYMNQYGIL 635
Query: 179 HQSSCPHTPQQNGVAERKHCHLVDTTRTLLINAHAPFKFWGDAILTACYLINRMPSSVLD 238
HQSSC TP QNGVAERK+ HL++T R LL P +FW D + TA +LINRMPS+VL+
Sbjct: 636 HQSSCVDTPSQNGVAERKNRHLLETARVLLFQMKVPKQFWADTVSTASFLINRMPSTVLN 695
Query: 239 NEIPQSLLFPKDPLYRVQLRVYGSTCFVHDLTPGRDKLSARAVKCVFLGYSRTQKGYRCY 298
+IP +LFP PL+ ++ +V+GSTC+V D+ P KL +A+KCVFLGYSR QKGYRCY
Sbjct: 696 GDIPYGVLFPNKPLFPLEPKVFGSTCYVRDVRPHITKLDPKALKCVFLGYSRLQKGYRCY 755
Query: 299 SPSTRRFYISADVTFFEDTPFFASPTTTSSTTDVTDSQVIPTPLFHPIFE-PPVSTQSSP 357
SP+ R+ +S DV F E FF+SP T + D + + I+ P ++
Sbjct: 756 SPTLNRYMVSIDVVFSESISFFSSPDTFPTQGQQEDEEWL-------IYRTTPSRSEQHK 808
Query: 358 QLQSNPEFRRYGNIYERRHVEAP--ETSPIDSSDSAPKTVTTDSSDSATAPISSPVVVPP 415
++ + E E +AP +T P + + VT D+ + T S P+ V P
Sbjct: 809 EVPGSVE-----QSMENVSSDAPLAQTKPPIVQVYSRRQVTNDTCPAPTLSSSDPLPVNP 863
Query: 416 EPSN--DLPIALHKGKRSTANPHPVYNFLSYHRLSPSYFAFVSALSSVSIPKTVHEALSH 473
P+ D+PIAL K S+ +PKTV EAL+H
Sbjct: 864 SPTENLDIPIALRK-------------------------------DSIFVPKTVREALNH 892
Query: 474 QEWKQAMIDEMVALESNHTWELVSPSPGKSIVGCR*VFNVKVGLDGQVDRLKARLVAIGY 533
W AM+DE+ AL+ NHTW LV GK VGC+ VF +KV DG + RLKARLVA GY
Sbjct: 893 PGWYDAMLDEIHALDDNHTWNLVDLPKGKKAVGCKWVFTIKVNPDGSMARLKARLVAKGY 952
Query: 534 TQVYGQDYNDTFSPVAKMTSVRLFIAMTAMKR*PLFQLDIKNAFLHGDLEEEIYMEQPSG 593
Q YG DY+DTFSPVAK+TSVRLFI++ A + PL QL IKNAFLHGDL+EE+YMEQP G
Sbjct: 953 AQTYGVDYSDTFSPVAKLTSVRLFISLAASQNWPLHQLAIKNAFLHGDLQEEVYMEQPPG 1012
Query: 594 FVAWG 598
FVA G
Sbjct: 1013 FVAQG 1017
>UniRef100_Q7XE85 Putative pol polyprotein [Oryza sativa]
Length = 1688
Score = 399 bits (1025), Expect = e-109
Identities = 266/637 (41%), Positives = 344/637 (53%), Gaps = 47/637 (7%)
Query: 1 DFNTGKTIGT*---SISQGLYYLHSQS------------SNICGVSA-SPDMIHRRLGHP 44
D +TG IGT S GLY L S S S +C + S H RLGH
Sbjct: 273 DRHTGAVIGTGHRQKRSCGLYILDSLSLPSSSTNTPSVYSPMCSTACKSFPQWHHRLGHL 332
Query: 45 SFDKLKVLVPQLSHLKSLD------CESCQLGKHVRASFPSSPNKRSKSPFDIVHSDVWG 98
+L L+ Q L S+ C+ C+LGK V+ +PSS + RS PFD+VHSDVWG
Sbjct: 333 CGSRLATLINQ-GVLGSVPVDTTFVCKGCKLGKQVQLPYPSSTS-RSSRPFDLVHSDVWG 390
Query: 99 PSRVMSTLG*RYYVTFIDGFSRCTWIILLKDRSQLFGAFLTFCSEIKNQFGKGIRILRSD 158
S S G YYV F+D +SR TWI +K RSQL + +F I QF IRI RSD
Sbjct: 391 KSPFPSKGGHNYYVIFVDDYSRYTWIYFMKHRSQLISIYQSFAQMIHTQFSSAIRIFRSD 450
Query: 159 NAKEYFFAPFNSFMASLGIIHQSSCPHTPQQNGVAERKHCHLVDTTRTLLINAHAPFKFW 218
+ EY F F+ S G + Q SCP QNGVAERKH H+++T RTLLI + P FW
Sbjct: 451 SGGEYMSNAFREFLVSQGTLPQLSCPGAHAQNGVAERKHRHIIETARTLLIASFVPAHFW 510
Query: 219 GDAILTACYLINRMPSSVLDNEIPQSLLFPKDPLYRVQLRVYGSTCFVHDLTPGRDKLSA 278
+AI TA YLIN PSS L P +LF P Y LRV+G TC+V R KL+A
Sbjct: 511 AEAISTAVYLINMQPSSSLQGRSPGEVLFGSPPRYD-HLRVFGCTCYVLLAPRERTKLTA 569
Query: 279 RAVKCVFLGYSRTQKGYRCYSPSTRRFYISADVTFFEDTPFFASPTTTSSTTDVTDSQVI 338
++V+CVFLGYS KGYRCY PS RR IS DVTF E+ PFF S T S+ + + S +
Sbjct: 570 QSVECVFLGYSLEHKGYRCYDPSARRIRISRDVTFDENKPFFYSSTNQPSSPENSISFLY 629
Query: 339 PTPLFHPIFEP--PVSTQSSPQLQSNPEFRRYGNIYERRHVEAPETSP-------IDSSD 389
P+ P P P++ SP S P Y +P SP I +S
Sbjct: 630 LPPIPSPESLPSSPITPSPSPIPPSVP-----SPTYVPPPPPSPSPSPVSPPPSHIPASS 684
Query: 390 SAPKTVTTDSSDSATAPISSPVVVPPEPSNDLPIALHKGKRST--ANPHPVYNFLSYHRL 447
S P +T + D+ S +P E P L S ++P P YN + L
Sbjct: 685 SPPHVPSTITLDTFPFHYSRRPKIPNESQPSQP-TLEDPTCSVDDSSPAPRYNLRARDAL 743
Query: 448 -SPSYFAFVSALSSVSIPKTVHEALSHQEWKQAMIDEMVALESNHTWELVSPSPGKSI-V 505
+P+ FV + V P T EA+ WK AM +E+ ALE +TW++V P P ++ +
Sbjct: 744 RAPNRDDFV--VGVVFEPSTYQEAIVLPHWKLAMSEELAALERTNTWDVV-PLPSHAVPI 800
Query: 506 GCR*VFNVKVGLDGQVDRLKARLVAIGYTQVYGQDYNDTFSPVAKMTSVRLFIAMTAMKR 565
C+ V+ VK DGQV+R KARLVA G+ Q +G+DY++TF+PVA MT+VR IA+ A +
Sbjct: 801 TCKWVYKVKTKSDGQVERYKARLVARGFQQAHGRDYDETFAPVAHMTTVRTLIAVAATRS 860
Query: 566 *PLFQLDIKNAFLHGDLEEEIYMEQPSGFVAWGGVVW 602
+ Q+D+KNAFLHGDL EE+YM P G A G V+
Sbjct: 861 WTISQMDVKNAFLHGDLHEEVYMHPPPGVEAPPGHVF 897
>UniRef100_Q5XWK9 Gag-pol polyprotein-like [Solanum tuberosum]
Length = 1212
Score = 399 bits (1025), Expect = e-109
Identities = 228/575 (39%), Positives = 339/575 (58%), Gaps = 74/575 (12%)
Query: 31 SASPDMIHRRLGHPSFDKLKVLVPQLSH-----------LKSLDCESCQLGKHVRASFPS 79
++ ++ H+RLGHP+ V++ +S+ + S+DC +C+LGK FP+
Sbjct: 441 ASKTEVWHKRLGHPN----SVVLSHISNSGLLGNKNKFSVASIDCSTCKLGKSKTLPFPN 496
Query: 80 SPNKRSKSPFDIVHSDVWGPSRVMSTLG*RYYVTFIDGFSRCTWIILLKDRSQLFGAFLT 139
++ +K FD++HSDVWG S ++S +Y++TFID +SR TW+ L+ +S++F F T
Sbjct: 497 FGSRATKC-FDVIHSDVWGISPIISHAHFKYFMTFIDDYSRFTWVYFLRSKSEVFSMFKT 555
Query: 140 FCSEIKNQFGKGIRILRSDNAKEYFFAPFNSFMASLGIIHQSSCPHTPQQNGVAERKHCH 199
F + I+ QF I++LRSD+ EY F F+ GI+ Q SCP+TPQQNGVAERK+ H
Sbjct: 556 FLAYIETQFSTCIKLLRSDSGGEYMSYEFKKFLLDKGIVSQHSCPYTPQQNGVAERKNRH 615
Query: 200 LVDTTRTLLINAHAPFKFWGDAILTACYLINRMPSSVLDNEIPQSLLFPKDPLYRVQLRV 259
L+D TRTLLI + P K+W +A+ TA YLINR+PS VL+ E P L+ ++P Y
Sbjct: 616 LLDVTRTLLIESSVPSKYWVEALSTAVYLINRLPSKVLNLESPYFRLYHQNPNYS-DFHT 674
Query: 260 YGSTCFVHDLTPGR-DKLSARAVKCVFLGYSRTQKGYRCYSPSTRRFYISADVTFFEDTP 318
+G CFVH L P + +KLS ++ KC F+GYS +QKG+ CY P + +F IS +V FFE+
Sbjct: 675 FGCVCFVH-LPPSQCNKLSVQSTKCAFMGYSTSQKGFICYDPCSHKFRISRNVVFFENQY 733
Query: 319 FFASPTTTSSTTDVTDSQVIPTPLFHPIFEPPVSTQSSPQLQSNPEFRRY--GNIYERRH 376
FF + SS +PL P FE S+ F+R+ G +YERR
Sbjct: 734 FFPTIVDLSSV----------SPLL-PTFEDLSSS-----------FKRFKPGFVYERRR 771
Query: 377 VEAPETSPIDSSDSAPKTVTTDSSDSATAPISSPVVVPPEPSNDLPIALHKGKRSTANPH 436
P + ++AP+ + +SS S P EP+ +RST
Sbjct: 772 PTLPYPNTDPPPETAPQLESENSSRSG----------PLEPT----------RRSTRVSR 811
Query: 437 PVYNFLSYHRLSPSYFAFVSALSSVSIPKTVHEALSHQEWKQAMIDEMVALESNHTWELV 496
+P+++ F S LS++S+P +A H+ W++AM +E++AL+ N TW++V
Sbjct: 812 -----------TPNWYGFSSTLSNISVPSCYSQASKHECWQKAMEEELLALKENDTWDIV 860
Query: 497 SPSPGKSIVGCR*VFNVKVGLDGQVDRLKARLVAIGYTQVYGQDYNDTFSPVAKMTSVRL 556
S +GC+ V+++K+ DG +DR KARLV +G Q YG DY +TF+PVAKMT+VR
Sbjct: 861 SCPSNVRPIGCKWVYSIKLHSDGTLDRYKARLVVLGNRQEYGVDYEETFAPVAKMTTVRT 920
Query: 557 FIAMTAMKR*PLFQLDIKNAFLHGDLEEEIYMEQP 591
IA+ A + L+Q D+KNAFLHGDL+E+IYM+ P
Sbjct: 921 IIAIAASQNWSLYQKDVKNAFLHGDLKEDIYMKPP 955
>UniRef100_Q9FWZ5 Putative retroelement polyprotein [Arabidopsis thaliana]
Length = 1404
Score = 389 bits (1000), Expect = e-106
Identities = 237/616 (38%), Positives = 332/616 (53%), Gaps = 33/616 (5%)
Query: 1 DFNTGKTIGT*SISQGLYYLHSQSSNICGVSASPDMI--------HRRLGHPSFDKLKVL 52
D TGK IG LY L S N +S + H RLGHP LK++
Sbjct: 416 DIETGKVIGEGGSKGELYVLEDLSPNSSSCFSSKSHLGISFNTLWHARLGHPHTRALKLM 475
Query: 53 VPQLSHLKSLDCESCQLGKHVRASFPSSPNKRSKSPFDIVHSDVWGPSRVMSTLG*RYYV 112
+P +S CE+C LGKH ++ FP S K FD+VHSDVW S +S +Y+V
Sbjct: 476 LPNIS-FDHTSCEACILGKHCKSVFPKSLTIYEKC-FDLVHSDVW-TSPCVSRDNNKYFV 532
Query: 113 TFIDGFSRCTWIILLKDRSQLFGAFLTFCSEIKNQFGKGIRILRSDNAKEYFFAPFNSFM 172
TFI+ S+ TWI LL + ++F AF F + + NQF I++ R+DN EY F +
Sbjct: 533 TFINEKSKYTWITLLPSKDRVFEAFTNFETYVTNQFNAKIKVFRTDNGGEYTSQKFRDHL 592
Query: 173 ASLGIIHQSSCPHTPQQNGVAERKHCHLVDTTRTLLINAHAPFKFWGDAILTACYLINRM 232
A GIIHQ+SCP+TPQQNGVAERK+ HL++ R+++ + P +FWGDA+LTACYLINR
Sbjct: 593 AKRGIIHQTSCPYTPQQNGVAERKNRHLMEVARSMMFHTSVPKRFWGDAVLTACYLINRT 652
Query: 233 PSSVLDNEIPQSLLFPKDPLYRVQLRVYGSTCFVHDLTPG--RDKLSARAVKCVFLGYSR 290
P+ VL + P +L P + LRV+G CFV L PG R KL A++ KC+FLGYS
Sbjct: 653 PTKVLSDLSPFEVLNNTKP-FIDHLRVFGCVCFV--LIPGEQRSKLDAKSTKCMFLGYST 709
Query: 291 TQKGYRCYSPSTRRFYISADVTFFEDTPFFASPTTTSSTTDVTDS-----QVIPTPLFHP 345
TQKGY+C+ P+ R +IS DV F E+ + + + D+T S + + L H
Sbjct: 710 TQKGYKCFDPTKNRTFISRDVKFLENQD-YNNKKDWENLKDLTHSTSDRVETLKFLLDHL 768
Query: 346 IFEPPVSTQSSPQLQS-----NPEFRRYGNIYERRHVEAPETSPIDSSDSAPKTVTTDSS 400
+ +TQ P++ N E ++ E P S D S
Sbjct: 769 GNDSTSTTQHQPEMTQDQEDLNQENEEVSLQHQENLTHVQEDPPNTQEHSEHVQEIQDDS 828
Query: 401 DSATAPISSPVVVPPEPSNDLPIALHKGK---RSTANPHPVYNFLSYHRLSPSYFAFVSA 457
P V+PP P + + K S A HP S + + AF+S
Sbjct: 829 SEDEEPTQ---VLPPPPPLRRSTRIRRKKEFFNSNAVAHPFQATCSLALVPLDHQAFLSK 885
Query: 458 LSSVSIPKTVHEALSHQEWKQAMIDEMVALESNHTWELVSPSPGKSIVGCR*VFNVKVGL 517
+S IP+T EA+ +EW+ A+ DE+ A++ NHTW+ GK V R VF +K
Sbjct: 886 ISEHWIPQTYEEAMEVKEWRDAIADEINAMKRNHTWDEDDLPKGKKTVSSRWVFTIKYKS 945
Query: 518 DGQVDRLKARLVAIGYTQVYGQDYNDTFSPVAKMTSVRLFIAMTAMKR*PLFQLDIKNAF 577
+G ++R K RLVA G+TQ YG DY +TF+PVAK+ +VR+ +A+ L+Q+D+KNAF
Sbjct: 946 NGDIERYKTRLVARGFTQTYGSDYMETFAPVAKLHTVRVVLALATNLSWGLWQMDVKNAF 1005
Query: 578 LHGDLEEEIYMEQPSG 593
L G+LE+++YM P G
Sbjct: 1006 LQGELEDDVYMTPPPG 1021
>UniRef100_Q710T7 Gag-pol polyprotein [Populus deltoides]
Length = 1382
Score = 385 bits (988), Expect = e-105
Identities = 240/620 (38%), Positives = 333/620 (53%), Gaps = 57/620 (9%)
Query: 1 DFNTGKTIGT*SISQGLYYLHSQSSNICGVSASPDMI--------------HRRLGHPSF 46
D + K IGT GLY L + + + D+ H RLGH S
Sbjct: 431 DLQSQKLIGTGRRENGLYILDELKVPVVVAATTVDLSFFRLSLSSSSFYLWHSRLGHVSS 490
Query: 47 DKLKVLVPQ--LSHLKSLD---CESCQLGKHVRASFPSSPNKRSKSPFDIVHSDVWGPSR 101
+L+ L L +LK+ D C C+L K F S + S SPFD++HSDVWGPS
Sbjct: 491 SRLRFLASTGALGNLKTCDISDCSGCKLAKFSALPFNRSTSV-SSSPFDLIHSDVWGPSP 549
Query: 102 VMSTLG*RYYVTFIDGFSRCTWIILLKDRSQLFGAFLTFCSEIKNQFGKGIRILRSDNAK 161
V + G RYYV+FID +R W+ L+K RS+ F + F + IK Q I+ R D
Sbjct: 550 VSTKGGSRYYVSFIDDHTRYCWVYLMKHRSEFFEIYAAFRALIKTQHSAVIKCFRCDLGG 609
Query: 162 EYFFAPFNSFMASLGIIHQSSCPHTPQQNGVAERKHCHLVDTTRTLLINAHAPFKFWGDA 221
EY F +A G IHQ+SC TP+QNGVAERKH H+V+T R+LL++A +FWG+A
Sbjct: 610 EYTSNKFCQMLALDGTIHQTSCTDTPEQNGVAERKHRHIVETARSLLLSAFVLSEFWGEA 669
Query: 222 ILTACYLINRMPSSVLDNEIPQSLLFPKDPLYRVQLRVYGSTCFVHDLTPGRDKLSARAV 281
+LTA LIN +PSS P L+ P Y RV+G T FV R+KLS+R+
Sbjct: 670 VLTAVSLINTIPSSHSSGLSPFEKLYGHVPDYS-SFRVFGCTYFVLHPHVERNKLSSRSA 728
Query: 282 KCVFLGYSRTQKGYRCYSPSTRRFYISADVTFFEDTPFFASPTTTSSTTDVTDSQVIPTP 341
CVFLGY +KGYRC+ P T++ Y+S V F E PFF+ P+TT S T + P
Sbjct: 729 ICVFLGYGEGKKGYRCFDPITQKLYVSHHVVFLEHIPFFSIPSTTHSLTKSDLIHIDP-- 786
Query: 342 LFHPIFEPPVSTQSSPQLQSNPEFRRYGNIYERRHVEAPETSPIDSSDSAPKTVTTDSSD 401
F +SP ++S G T + S T ++S
Sbjct: 787 -----FSEDSGNDTSPYVRSICTHNSAG------------TGTLLSG-------TPEASF 822
Query: 402 SATAPISSPVVVPPEPSNDLPIALHKGKRSTANPHPVYNFLSYHRLSPSYFAFVSALSSV 461
S+TAP +S +V P P + I ++ST P +Y S S+ +F++ + +
Sbjct: 823 SSTAPQASSEIVDPPPRQSIRI-----RKSTKLPD-----FAYSCYSSSFTSFLAYIHCL 872
Query: 462 SIPKTVHEALSHQEWKQAMIDEMVALESNHTWELVSPSPGKSIVGCR*VFNVKVGLDGQV 521
P + EA+ +QAM +E+ AL TW+LV PGKS+VGCR V+ +K DG +
Sbjct: 873 FEPSSYKEAILDPLGQQAMDEELSALHKTDTWDLVPLPPGKSVVGCRWVYKIKTNSDGSI 932
Query: 522 DRLKARLVAIGYTQVYGQDYNDTFSPVAKMTSVRLFIAMTAMKR*PLFQLDIKNAFLHGD 581
+R KARLVA GY+Q YG DY +TF+P+AKMT++R IA+ ++++ + QLD+KNAFL+GD
Sbjct: 933 ERYKARLVAKGYSQQYGMDYEETFAPIAKMTTIRTLIAVASIRQWHISQLDVKNAFLNGD 992
Query: 582 LEEEIYMEQPSGFVAWGGVV 601
L+EE+YM P G G V
Sbjct: 993 LQEEVYMAPPPGISHDSGYV 1012
>UniRef100_Q9ZPU4 Putative retroelement pol polyprotein [Arabidopsis thaliana]
Length = 1501
Score = 367 bits (942), Expect = e-100
Identities = 229/653 (35%), Positives = 336/653 (51%), Gaps = 80/653 (12%)
Query: 1 DFNTGKTIGT*SISQGLYYLHSQSS---NICGVSASPDMIHRRLGHPSFDKLKVLV---P 54
D ++ IG+ G+YYL + + V + + H+RLGHPSF L L
Sbjct: 491 DRSSKTLIGSGEERGGVYYLTDVTPAKIHTANVDSDQALWHQRLGHPSFSVLSSLPLFSK 550
Query: 55 QLSHLKSLDCESCQLGKHVRASFPSSPNKRSKSPFDIVHSDVWGPSRVMSTLG*RYYVTF 114
S + S C+ C K R FP S NK + F ++H DVWGP RV ++ G Y++T
Sbjct: 551 TSSTVTSHSCDVCFRAKQTREVFPESINKTEEC-FSLIHCDVWGPYRVPASCGAVYFLTI 609
Query: 115 IDGFSRCTWIILLKDRSQLFGAFLTFCSEIKNQFGKGIRILRSDNAKEYFFAPFNSFMAS 174
+D +SR W LL ++S++ F + QFGK ++++RSDN E F +S+
Sbjct: 610 VDDYSRAVWTYLLLEKSEVRQVLTNFLKYAEKQFGKTVKMVRSDNGTE--FMCLSSYFRE 667
Query: 175 LGIIHQSSCPHTPQQNGVAERKHCHLVDTTRTLLINAHAPFKFWGDAILTACYLINRMPS 234
GIIHQ+SC TPQQNG ERKH H+++ R LL A P KFWG++ILTA YLINR PS
Sbjct: 668 NGIIHQTSCVGTPQQNGRVERKHRHILNVARALLFQASLPIKFWGESILTAAYLINRTPS 727
Query: 235 SVLDNEIPQSLLFPKDPLYRVQLRVYGSTCFVHDLTPGRDKLSARAVKCVFLGYSRTQKG 294
S+L P +L P+Y QLRV+GS C+VH +T +DK R+ C+F+GY +KG
Sbjct: 728 SILSGRTPYEVLHGSKPVYS-QLRVFGSACYVHRVTRDKDKFGQRSRSCIFVGYPFGKKG 786
Query: 295 YRCYSPSTRRFYISADVTFFEDT-PFFASPTTTSSTT----------------------D 331
++ Y F +S DV F E+ P+ ++T ++T D
Sbjct: 787 WKVYDIERNEFLVSRDVIFREEVFPYAGVNSSTLASTSLPTVSEDDDWAIPPLEVRGSID 846
Query: 332 VTDSQVIPTPLFHPIFEPPVSTQSSPQLQSNPEFRRYGNIYERRHVEAPETSPIDSSDS- 390
+++ + + + VS P + P+ + P +SP+ S S
Sbjct: 847 SVETERVVCTTDEVVLDTSVSDSEIPNQEFVPD-------------DTPPSSPLSVSPSG 893
Query: 391 APKTVTTDSSDSATAPISSPVVV-------------PPEPSNDL--------PIALHK-- 427
+P T TT P++SP+ V PP ND P ++H
Sbjct: 894 SPNTPTTP----IVVPVASPIPVSPPKQRKSKRATHPPPKLNDYVLYNAMYTPSSIHALP 949
Query: 428 --GKRSTANP----HPVYNFLSYHRLSPSYFAFVSALSSVSIPKTVHEALSHQEWKQAMI 481
+S+ P P+ +++S S S+ A+++A++ PK EA+ + W AM
Sbjct: 950 ADPSQSSTVPGKSLFPLTDYVSDAAFSSSHRAYLAAITDNVEPKHFKEAVQIKVWNDAMF 1009
Query: 482 DEMVALESNHTWELVSPSPGKSIVGCR*VFNVKVGLDGQVDRLKARLVAIGYTQVYGQDY 541
E+ ALE N TW++V PGK +G + VF K DG V+R KARLV G QV G+DY
Sbjct: 1010 TEVDALEINKTWDIVDLPPGKVAIGSQWVFKTKYNSDGTVERYKARLVVQGNKQVEGEDY 1069
Query: 542 NDTFSPVAKMTSVRLFIAMTAMKR*PLFQLDIKNAFLHGDLEEEIYMEQPSGF 594
+TF+PV +MT+VR + A + ++Q+D+ NAFLHGDLEEE+YM+ P GF
Sbjct: 1070 KETFAPVVRMTTVRTLLRNVAANQWEVYQMDVHNAFLHGDLEEEVYMKLPPGF 1122
>UniRef100_Q9ZQK0 Putative retroelement pol polyprotein [Arabidopsis thaliana]
Length = 1664
Score = 365 bits (938), Expect = 3e-99
Identities = 231/636 (36%), Positives = 335/636 (52%), Gaps = 57/636 (8%)
Query: 1 DFNTGKTIGT*SISQGLYYLH---------SQSSNICGVSASPDMIHRRLGHPSFDKLKV 51
D T + +G GLY L S S+I G +A+ + H RLGHP LK+
Sbjct: 400 DIETSRVLGQGVTKDGLYVLEDTKPSVPLSSHFSSILG-NANSESWHARLGHPHSRALKL 458
Query: 52 LVPQLSHLKSLDCESCQLGKHVRASFPSSPNKRSKSPFDIVHSDVWGPSRVMSTLG*RYY 111
L+P S K+ +CE+C LGKH ++ FP S K FD++HSDVW S +S +Y+
Sbjct: 459 LLPSTS-FKNDECEACILGKHCKSVFPKSSTIYEKC-FDLIHSDVW-TSPCLSRENHKYF 515
Query: 112 VTFIDGFSRCTWIILLKDRSQLFGAFLTFCSEIKNQFGKGIRILRSDNAKEYFFAPFNSF 171
VTFID S+ TW LL + ++ AF F + + N + I+ILRSDN EY F
Sbjct: 516 VTFIDEKSKFTWFTLLPSKDRVLEAFTNFQTYVTNHYDAKIKILRSDNRGEYTSHAFKQH 575
Query: 172 MASLGIIHQSSCPHTPQQNGVAERKHCHLVDTTRTLLINAHAPFKFWGDAILTACYLINR 231
+ GIIHQ+SCP+TPQQNGVAERK+ HL++ R ++ + + P FW D +++ACYLIN+
Sbjct: 576 LNKHGIIHQTSCPYTPQQNGVAERKNRHLMEVRRVMMFHTNVPKHFWIDGVVSACYLINQ 635
Query: 232 MPSSVLDNEIPQSLLFPKDPLYRVQLRVYGSTCFVHDLTPGRDKLSARAVKCVFLGYSRT 291
P+ +L + P +L P LRV+G CFV R+KL ++ K +F+GYS
Sbjct: 636 TPTKILLDSSPFEVLNKVKPFIN-HLRVFGCVCFVLISGEQRNKLQPKSTKGMFIGYSIN 694
Query: 292 QKGYRCYSPSTRRFYISADVTFFEDTPFFASPTTTSSTTDVTDSQVIPTPLFHPIFE--- 348
QKGY+CY TR+ IS DV F E ++ D+TDS I E
Sbjct: 695 QKGYKCYVLETRKVLISRDVKFLESKSYY-DKKNWEDIQDLTDSPSDRATNLRIILERLG 753
Query: 349 -PPVSTQSSPQLQSNPEF------------------RRYGNIYERRHVEAPETSPIDSSD 389
+ TQ++P+ SNPE + G E +E E+S + D
Sbjct: 754 VSNIQTQTTPRT-SNPETITQPENMEEEEEEEEEEEEKQGKEQELITLEETESSKVQEKD 812
Query: 390 SA---PKTVTTDSSDSATAPISSPVVVPPEPSNDLPIALHKGKRSTAN---------PHP 437
++ T++ + + P + P S L K KR N HP
Sbjct: 813 TSLLNDDNGHTNNQEEDSNSREEPRI--PRRSEHL-----KDKRVYYNNQVYFDNVVEHP 865
Query: 438 VYNFLSYHRLSPSYFAFVSALSSVSIPKTVHEALSHQEWKQAMIDEMVALESNHTWELVS 497
+ + L + F + IP+T EA++HQ W+ A+ E A+E+NHTW+
Sbjct: 866 IQVVCTLAHLPEEHQVFFGKVDQHWIPQTYEEAITHQVWRDAIAAEKQAMENNHTWDEDE 925
Query: 498 PSPGKSIVGCR*VFNVKVGLDGQVDRLKARLVAIGYTQVYGQDYNDTFSPVAKMTSVRLF 557
GK +V + VF +K DG+++R KARLVA G+TQ YG+DY DTF+PVAK+ +VR+
Sbjct: 926 LPRGKKVVTSKWVFAIKYKSDGEIERYKARLVARGFTQTYGEDYLDTFAPVAKLHTVRVV 985
Query: 558 IAMTAMKR*PLFQLDIKNAFLHGDLEEEIYMEQPSG 593
+++T L+Q+D+KNAFL G+LEE++YM+ P G
Sbjct: 986 LSLTTNLEWDLWQMDVKNAFLQGELEEKVYMKPPPG 1021
>UniRef100_O04543 F20P5.25 protein [Arabidopsis thaliana]
Length = 1315
Score = 364 bits (934), Expect = 8e-99
Identities = 220/585 (37%), Positives = 301/585 (50%), Gaps = 69/585 (11%)
Query: 33 SPDMIHRRLGHPSFDKLKVLVPQLSHLKSLD-----CESCQLGKHVRASFPSSPNKRSKS 87
S D+ H+RLGHPS KL+ + LS K + C C + K F S NK S+
Sbjct: 403 SHDLWHKRLGHPSVQKLQPMSSLLSFPKQKNNTDFHCRVCHISKQKHLPFVSHNNKSSR- 461
Query: 88 PFDIVHSDVWGPSRVMSTLG*RYYVTFIDGFSRCTWIILLKDRSQLFGAFLTFCSEIKNQ 147
PFD++H D WGP V + G RY++T +D +SR TW+ LL+++S + TF + ++NQ
Sbjct: 462 PFDLIHIDTWGPFSVQTHDGYRYFLTIVDDYSRATWVYLLRNKSDVLTVIPTFVTMVENQ 521
Query: 148 FGKGIRILRSDNAKEYFFAPFNSFMASLGIIHQSSCPHTPQQNGVAERKHCHLVDTTRTL 207
F I+ +RSDNA E F F S GI+ SCP TPQQN V ERKH H+++ R+L
Sbjct: 522 FETTIKGVRSDNAPEL---NFTQFYHSKGIVPYHSCPETPQQNSVVERKHQHILNVARSL 578
Query: 208 LINAHAPFKFWGDAILTACYLINRMPSSVLDNEIPQSLLFPKDPLYRVQLRVYGSTCFVH 267
+H P +WGD ILTA YLINR+P+ +L+++ P +L P Y ++V+G C+
Sbjct: 579 FFQSHIPISYWGDCILTAVYLINRLPAPILEDKCPFEVLTKTVPTYD-HIKVFGCLCYAS 637
Query: 268 DLTPGRDKLSARAVKCVFLGYSRTQKGYRCYSPSTRRFYISADVTFFEDT-PFFASPTTT 326
R K S RA C F+GY KGY+ T +S V F E+ PF S
Sbjct: 638 TSPKDRHKFSPRAKACAFIGYPSGFKGYKLLDLETHSIIVSRHVVFHEELFPFLGS---- 693
Query: 327 SSTTDVTDSQVIPTPLFHPIFEPPVSTQSSPQLQSNPEFRRYGNIYERRHVEAPETSPID 386
D++ + P +P PP+ QSS + +P D
Sbjct: 694 ----DLSQEEQNFFPDLNP--TPPMQRQSSDHV-----------------------NPSD 724
Query: 387 SSDSAPKTVTTDSSDSATAPISSPVVVPPEPSNDLPIALHKGKRS------------TAN 434
SS S P ++P PEPS + + K K+ ++
Sbjct: 725 SSSSV-----------EILPSANPTNNVPEPS--VQTSHRKAKKPAYLQDYYCHSVVSST 771
Query: 435 PHPVYNFLSYHRLSPSYFAFVSALSSVSIPKTVHEALSHQEWKQAMIDEMVALESNHTWE 494
PH + FLSY R++ Y F++ L P EA Q W+ AM E LE HTWE
Sbjct: 772 PHEIRKFLSYDRINDPYLTFLACLDKTKEPSNYTEAEKLQVWRDAMGAEFDFLEGTHTWE 831
Query: 495 LVSPSPGKSIVGCR*VFNVKVGLDGQVDRLKARLVAIGYTQVYGQDYNDTFSPVAKMTSV 554
+ S K +GCR +F +K DG V+R KARLVA GYTQ G DYN+TFSPVAK+ SV
Sbjct: 832 VCSLPADKRCIGCRWIFKIKYNSDGSVERYKARLVAQGYTQKEGIDYNETFSPVAKLNSV 891
Query: 555 RLFIAMTAMKR*PLFQLDIKNAFLHGDLEEEIYMEQPSGFVAWGG 599
+L + + A + L QLDI NAFL+GDL+EEIYM P G+ + G
Sbjct: 892 KLLLGVAARFKLSLTQLDISNAFLNGDLDEEIYMRLPQGYASRQG 936
>UniRef100_Q9XII7 Putative retroelement pol polyprotein [Arabidopsis thaliana]
Length = 1454
Score = 362 bits (930), Expect = 2e-98
Identities = 224/613 (36%), Positives = 324/613 (52%), Gaps = 60/613 (9%)
Query: 1 DFNTGKTIGT*SISQGLYYLHSQSSNICGVSASPD--MIHRRLGHPSFDKLKVLVPQLSH 58
D G+ +G LY L +I V+A D M HRRLGH S +L + L
Sbjct: 521 DLIKGRMLGQGRRVANLYLLDVGDQSI-SVNAVVDISMWHRRLGHASLQRLDAISDSLGT 579
Query: 59 LKSLD-----CESCQLGKHVRASFPSSPNKRSKSPFDIVHSDVWGPSRVMSTLG*RYYVT 113
+ + C C L K + SFP+S NK K FD++H DVWGP V + G +Y++T
Sbjct: 580 TRHKNKGSDFCHVCHLAKQRKLSFPTS-NKVCKEIFDLLHIDVWGPFSVETVEGYKYFLT 638
Query: 114 FIDGFSRCTWIILLKDRSQLFGAFLTFCSEIKNQFGKGIRILRSDNAKEYFFAPFNSFMA 173
+D SR TW+ LLK +S++ F F +++NQ+ ++ +RSDNA E F SF A
Sbjct: 639 IVDDHSRATWMYLLKTKSEVLTVFPAFIQQVENQYKVKVKAVRSDNAPEL---KFTSFYA 695
Query: 174 SLGIIHQSSCPHTPQQNGVAERKHCHLVDTTRTLLINAHAPFKFWGDAILTACYLINRMP 233
GI+ SCP TP+QN V ERKH H+++ R L+ + P WGD +LTA +LINR P
Sbjct: 696 EKGIVSFHSCPETPEQNSVVERKHQHILNVARALMFQSQVPLSLWGDCVLTAVFLINRTP 755
Query: 234 SSVLDNEIPQSLLFPKDPLYRVQLRVYGSTCFVHDLTPGRDKLSARAVKCVFLGYSRTQK 293
S +L N+ P +L P+Y QLR +G C+ R K R+ C+FLGY K
Sbjct: 756 SQLLMNKTPYEILTGTAPVYE-QLRTFGCLCYSSTSPKQRHKFQPRSRACLFLGYPSGYK 814
Query: 294 GYRCYSPSTRRFYISADVTFFEDT-PFFASPTTTSSTTDVTDSQVIPTPLFHPIFEPPVS 352
GY+ + +IS +V F E+ P +P + SS LF P+ PVS
Sbjct: 815 GYKLMDLESNTVFISRNVQFHEEVFPLAKNPGSESS-----------LKLFTPMV--PVS 861
Query: 353 TQSSPQLQSNPEFRRYGNIYERRHVEAPETSPIDSSDSAPKTVTTDSSDSATAPISSPVV 412
+ G I + H +P + P SD P+ S V
Sbjct: 862 S---------------GIISDTTH--SPSSLPSQISDLPPQI------------SSQRVR 892
Query: 413 VPPEPSNDLPIALHKGKRSTANPHPVYNFLSYHRLSPSYFAFVSALSSVSIPKTVHEALS 472
PP ND H + + +P+ + +SY ++SPS+ +++ ++ + IP EA
Sbjct: 893 KPPAHLND----YHCNTMQSDHKYPISSTISYSKISPSHMCYINNITKIPIPTNYAEAQD 948
Query: 473 HQEWKQAMIDEMVALESNHTWELVSPSPGKSIVGCR*VFNVKVGLDGQVDRLKARLVAIG 532
+EW +A+ E+ A+E +TWE+ + GK VGC+ VF +K DG ++R KARLVA G
Sbjct: 949 TKEWCEAVDAEIGAMEKTNTWEITTLPKGKKAVGCKWVFTLKFLADGNLERYKARLVAKG 1008
Query: 533 YTQVYGQDYNDTFSPVAKMTSVRLFIAMTAMKR*PLFQLDIKNAFLHGDLEEEIYMEQPS 592
YTQ G DY DTFSPVAKMT+++L + ++A K+ L QLD+ NAFL+G+LEEEI+M+ P
Sbjct: 1009 YTQKEGLDYTDTFSPVAKMTTIKLLLKVSASKKWFLKQLDVSNAFLNGELEEEIFMKIPE 1068
Query: 593 GFVAWGGVVWYAN 605
G+ G+V +N
Sbjct: 1069 GYAERKGIVLPSN 1081
>UniRef100_Q9FIC5 Retroelement pol polyprotein-like [Arabidopsis thaliana]
Length = 1462
Score = 358 bits (919), Expect = 4e-97
Identities = 234/641 (36%), Positives = 336/641 (51%), Gaps = 65/641 (10%)
Query: 8 IGT*SISQGLYYLHSQSS--NICGVSASPDMIHRRLGHPSFDKLKVLVPQLSHLKSLD-- 63
IG GLY+ + ++ + +S + H RLGHPS LK+L S + D
Sbjct: 440 IGAGKQQNGLYFFRGTETVASMTRMDSSSQLWHCRLGHPSSKVLKLLSFSDSTGHAFDSK 499
Query: 64 -CESCQLGKHVRASFPSSPNKRSKSPFDIVHSDVWGPSRVMSTLG*RYYVTFIDGFSRCT 122
CE C K R FP S NK S SPF++VH D+WGP R S G Y++T +D ++R
Sbjct: 500 TCEICIKAKQTRDPFPLSNNKTS-SPFEMVHCDLWGPYRTTSICGSNYFLTLVDNYTRAV 558
Query: 123 WIILLKDRSQLFGAFLTFCSEIKNQFGKGIRILRSDNAKEYFFAPFNSFMASLGIIHQSS 182
W+ LL + F S ++ QF I+ +RSDN E F +SF GIIH++S
Sbjct: 559 WLYLLPSKQTAPMHLKNFISLVERQFSTKIKTIRSDNGTE--FVCLSSFFVDHGIIHETS 616
Query: 183 CPHTPQQNGVAERKHCHLVDTTRTLLINAHAPFKFWGDAILTACYLINRMPSSVLDNEIP 242
C TPQQNG ERKH H+++ R L A P +FW LTA YLINR P+ +L + P
Sbjct: 617 CVGTPQQNGRVERKHRHILNVARALRFQARLPIEFWSYCALTAAYLINRTPTPLLQGKTP 676
Query: 243 QSLLFPKDPLYRVQLRVYGSTCFVHDLTPGRDKLSARAVKCVFLGYSRTQKGYRCYSPST 302
LL+ + P +RV+G C+VH+ G DK +R+ K +FLGY +KG+R Y+ T
Sbjct: 677 FELLYNRPPPVN-HIRVFGCICYVHNQKHGGDKFESRSNKSIFLGYPFAKKGWRVYNFET 735
Query: 303 RRFYISADVTFFE-DTPFFAS-----PTTTSSTTDVTDSQVIPTPLFHPIFEPPVS---- 352
+S DV F E + PF AS P + S ++ S +P+ L P PVS
Sbjct: 736 GVISVSRDVVFRETEFPFPASVFDSTPDSQLSPSNADQSFFLPSELQAPT---PVSITTT 792
Query: 353 ---TQSSPQLQSNPEFRRY--------GNIYERRHVEAP---ETSPIDSSDS-----APK 393
TQSS N + R + + + +P E+SP S S +P
Sbjct: 793 LELTQSSSSTNLNDDNFRIPSDESSSVNEMSDNEDLNSPTTNESSPFLSPASPSLPLSPA 852
Query: 394 TVTTDSSDSATAPISSPVVVPPEPSNDLPIALHKGKRS--------------------TA 433
+++ S +A +P S P + PEP +L L KGKR +
Sbjct: 853 SLSLPLSPAAPSP-SLPKIAEPEPEPEL---LGKGKRKKTQPVRLADYATTLLHQPHPSV 908
Query: 434 NPHPVYNFLSYHRLSPSYFAFVSALSSVSIPKTVHEALSHQEWKQAMIDEMVALESNHTW 493
P+P+ N++S + S +Y A+V A+S PK+ EA+ + W+ A+ DE+V+LE+ TW
Sbjct: 909 TPYPLDNYVSSSQFSAAYQAYVFAISLGIEPKSYKEAILDENWRCAVSDEIVSLENLGTW 968
Query: 494 ELVSPSPGKSIVGCR*VFNVKVGLDGQVDRLKARLVAIGYTQVYGQDYNDTFSPVAKMTS 553
+ PGK +GC+ VF +K DG ++R KARLV +G Q G DY++TF+PVAKM +
Sbjct: 969 TVEDLPPGKKALGCKWVFRLKYKSDGTLERHKARLVVLGNKQTEGIDYSETFAPVAKMVT 1028
Query: 554 VRLFIAMTAMKR*PLFQLDIKNAFLHGDLEEEIYMEQPSGF 594
VR F+ A + Q+D+ NAFLHGDL+EE+Y++ P GF
Sbjct: 1029 VRAFLQQVASLDWEVHQMDVHNAFLHGDLDEEVYIKFPPGF 1069
>UniRef100_Q94KV0 Polyprotein [Arabidopsis thaliana]
Length = 1453
Score = 355 bits (911), Expect = 4e-96
Identities = 228/623 (36%), Positives = 321/623 (50%), Gaps = 52/623 (8%)
Query: 1 DFNTGKTIGT*SISQGLYYLHSQ------SSNICGVSASPDMIHRRLGHPSFDKLKVLVP 54
D NT K + S GLY L +Q S+ C +AS ++ H RLGH + L+ L
Sbjct: 419 DINTQKVVSKGPRSNGLYVLENQEFVAFYSNRQC--AASEEIWHHRLGHSNSRILQQLKS 476
Query: 55 --QLSHLKSLD---CESCQLGKHVRASFPSSPNKRSKSPFDIVHSDVWGPSRVMSTLG*R 109
++S KS CE CQ+GK + F SS N R +H D+WGPS V+S G +
Sbjct: 477 SKEISFNKSRMSPVCEPCQMGKSSKLQFFSS-NSRELDLLGRIHCDLWGPSPVVSKQGFK 535
Query: 110 YYVTFIDGFSRCTWIILLKDRSQLFGAFLTFCSEIKNQFGKGIRILRSDNAKEYFFAPFN 169
YYV F+D +SR +W LK +S F F+ F + ++NQF I++ +SD E+
Sbjct: 536 YYVVFVDDYSRYSWFYPLKAKSDFFAVFVAFQNLVENQFNTKIKVFQSDGGGEFTSNLMK 595
Query: 170 SFMASLGIIHQSSCPHTPQQNGVAERKHCHLVDTTRTLLINAHAPFKFWGDAILTACYLI 229
+ GI H+ SCP+TPQQNG+AERKH H V+ +++ ++H P +FW +A TA +L
Sbjct: 596 KHLTDCGIQHRISCPYTPQQNGIAERKHRHFVELGLSMMFHSHTPLQFWVEAFFTASFLS 655
Query: 230 NRMPSSVLDNEIPQSLLFPKDPLYRVQLRVYGSTCFVHDLTPGRDKLSARAVKCVFLGYS 289
N +PS L N P L + P Y LRV+G+ C+ G K R+++CVFLGY+
Sbjct: 656 NMLPSPSLGNVSPLEALLKQKPNY-AMLRVFGTACYPCLRPLGEHKFEPRSLQCVFLGYN 714
Query: 290 RTQKGYRCYSPSTRRFYISADVTFFEDT-PF-----FASPTTTSSTTDV-------TDSQ 336
KGYRC P T R YIS V F E+T PF F P SS D
Sbjct: 715 SQYKGYRCLYPPTGRVYISRHVIFDEETFPFKQKYQFLVPQYESSLLSAWQSSIPQADQS 774
Query: 337 VIPTP---LFHPIFEPP-VSTQSSPQLQSNPEFRRYGNIYERRHVEAPETSPIDSSDSAP 392
+IP + +PP + + + P G + E ++ E + +S +
Sbjct: 775 LIPQAEEGKIESLAKPPSIQKNTIQDTTTQPAILTEGVLNEEEEEDSFEETETESLNEET 834
Query: 393 KTVTTDSSDSATAPISSPVVVPPEPSNDLPIALHKGKRSTANPHPVYNFLSYHRLSPSYF 452
T +D A + V EP N P+ RS A H S + +
Sbjct: 835 HT----QNDEAEVTVEEE--VQQEPENTHPMT----TRSKAGIHK----------SNTRY 874
Query: 453 AFVSALSSVSIPKTVHEALSHQEWKQAMIDEMVALESNHTWELVSPSPGKSIVGCR*VFN 512
A +++ SV PK++ EAL+H W A+ DEM + HTW LV P+ +I+GCR VF
Sbjct: 875 ALLTSKFSVEEPKSIDEALNHPGWNNAVNDEMRTIHMLHTWSLVQPTEDMNILGCRWVFK 934
Query: 513 VKVGLDGQVDRLKARLVAIGYTQVYGQDYNDTFSPVAKMTSVRLFIAMTAMKR*PLFQLD 572
K+ DG VD+LKARLVA G+ Q G DY +TFSPV + ++RL + + K + QLD
Sbjct: 935 TKLKPDGSVDKLKARLVAKGFHQEEGLDYLETFSPVVRTATIRLVLDVATAKGWNIKQLD 994
Query: 573 IKNAFLHGDLEEEIYMEQPSGFV 595
+ NAFLHG+L+E +YM QP GFV
Sbjct: 995 VSNAFLHGELKEPVYMLQPPGFV 1017
>UniRef100_Q9C692 Polyprotein, putative [Arabidopsis thaliana]
Length = 1468
Score = 352 bits (904), Expect = 2e-95
Identities = 203/590 (34%), Positives = 322/590 (54%), Gaps = 34/590 (5%)
Query: 30 VSASPDMIHRRLGHPSFDKLKVLVPQ--LSHLKSL---DCESCQLGKHVRASFPSSPNKR 84
V A D+ HRRLGH S DK+ L+P+ LS K + C++C K R +FP S N R
Sbjct: 509 VKAPFDLWHRRLGHAS-DKIVNLLPRELLSSGKEILENVCDTCMRAKQTRDTFPLSDN-R 566
Query: 85 SKSPFDIVHSDVWGPSRVMSTLG*RYYVTFIDGFSRCTWIILLKDRSQLFGAFLTFCSEI 144
S F ++H DVWGP R S G RY++T +D +SR W+ L+ D+S+ F + +
Sbjct: 567 SMDSFQLIHCDVWGPYRAPSYSGARYFLTIVDDYSRGVWVYLMTDKSETQKHLKDFIALV 626
Query: 145 KNQFGKGIRILRSDNAKEYFFAPFNSFMASLGIIHQSSCPHTPQQNGVAERKHCHLVDTT 204
+ QF I+I+RSDN E F + GI H++SC TP QNG ERKH H+++
Sbjct: 627 ERQFDTEIKIVRSDNGTE--FLCMREYFLHKGIAHETSCVGTPHQNGRVERKHRHILNIA 684
Query: 205 RTLLINAHAPFKFWGDAILTACYLINRMPSSVLDNEIPQSLLFPKDPLYRVQLRVYGSTC 264
R L ++ P +FWG+ IL+A YLINR PS +L + P +L+ P Y LRV+GS C
Sbjct: 685 RALRFQSYLPIQFWGECILSAAYLINRTPSMLLQGKSPYEMLYKTAPKYS-HLRVFGSLC 743
Query: 265 FVHDLTPGRDKLSARAVKCVFLGYSRTQKGYRCYSPSTRRFYISADVTFFEDTPFFASPT 324
+ H+ DK +AR+ +CVF+GY QKG+R + ++F++S DV F++T F S
Sbjct: 744 YAHNQNHKGDKFAARSRRCVFVGYPHGQKGWRLFDLEEQKFFVSRDV-IFQETEFPYSKM 802
Query: 325 TTSSTTDVTDSQVIPTPLFHPIFEP--------------------PVSTQSSPQLQSNPE 364
+ + + + P P P+ + + + S E
Sbjct: 803 SCNEEDERVLVDCVGPPFIEEAIGPRTIIGRNIGEATVGPNVATGPIIPEINQESSSPSE 862
Query: 365 FRRYGNIYERRHVEAPETSPIDSSDSAPKTVTTDSSDSATAPISSPVVVPPEPSNDLPIA 424
F ++ +T+ + S + P + S T P+ + +N + +
Sbjct: 863 FVSLSSLDPFLASSTVQTADLPLSSTTPAPIQLRRSSRQT---QKPMKLKNFVTNTVSVE 919
Query: 425 LHKGKRSTANPHPVYNFLSYHRLSPSYFAFVSALSSVSIPKTVHEALSHQEWKQAMIDEM 484
+ S+++ +P+ ++ HR + S+ AF++A+++ P T +EA+ + W++AM E+
Sbjct: 920 SISPEASSSSLYPIEKYVDCHRFTSSHKAFLAAVTAGMEPTTYNEAMVDKAWREAMSAEI 979
Query: 485 VALESNHTWELVSPSPGKSIVGCR*VFNVKVGLDGQVDRLKARLVAIGYTQVYGQDYNDT 544
+L N T+ +V+ PGK +G + V+ +K DG ++R KARLV +G Q G DY++T
Sbjct: 980 ESLRVNQTFSIVNLPPGKRALGNKWVYKIKYRSDGAIERYKARLVVLGNCQKEGVDYDET 1039
Query: 545 FSPVAKMTSVRLFIAMTAMKR*PLFQLDIKNAFLHGDLEEEIYMEQPSGF 594
F+PVAKM++VRLF+ + A + + Q+D+ NAFLHGDL+EE+YM+ P GF
Sbjct: 1040 FAPVAKMSTVRLFLGVAAARDWHVHQMDVHNAFLHGDLKEEVYMKLPQGF 1089
>UniRef100_O81617 F8M12.17 protein [Arabidopsis thaliana]
Length = 1633
Score = 349 bits (895), Expect = 3e-94
Identities = 212/623 (34%), Positives = 320/623 (51%), Gaps = 39/623 (6%)
Query: 1 DFNTGKTIGT*SISQGLYYLHSQSSNICGVSASPDMIHRRLGHPSFDKLKVLVPQLSHLK 60
+ G IG LY L +Q + S SP + HPS L+ LV + LK
Sbjct: 469 ELTRGLMIGRGKTYNNLYILETQRT-----SFSPSLPAATSRHPSLPALQKLVSSIPSLK 523
Query: 61 SLD-----CESCQLGKHVRASFPSSPNKRSKSPFDIVHSDVWGPSRVMSTLG*RYYVTFI 115
S+ C L K R ++ S N S SPFD++H D+WGP + S G RY++T +
Sbjct: 524 SVSSTASHCRISPLAKQKRLAYVSHNNLAS-SPFDLIHLDIWGPFSIESVDGFRYFLTLV 582
Query: 116 DGFSRCTWIILLKDRSQLFGAFLTFCSEIKNQFGKGIRILRSDNAKEYFFAPFNSFMASL 175
D +R TW+ ++K++S++ F F I Q+ I+ +RSDN KE F F+
Sbjct: 583 DDCTRTTWVYMMKNKSEVSNIFPVFVKLIFTQYNAKIKAIRSDNVKEL---AFTKFVKEQ 639
Query: 176 GIIHQSSCPHTPQQNGVAERKHCHLVDTTRTLLINAHAPFKFWGDAILTACYLINRMPSS 235
G+IHQ SC +TPQQN V ERKH HL++ R+LL ++ P ++W D +LTA YLINR+PS
Sbjct: 640 GMIHQFSCAYTPQQNSVVERKHQHLLNIARSLLFQSNVPLQYWSDCVLTAAYLINRLPSP 699
Query: 236 VLDNEIPQSLLFPKDPLYRVQLRVYGSTCFVHDLTPGRDKLSARAVKCVFLGYSRTQKGY 295
+LDN+ P LL K P Y + + C+ R+K S RA CVFLGY KGY
Sbjct: 700 LLDNKTPFELLLKKIPDYTL---LKSCLCYASTNVHDRNKFSPRARPCVFLGYPSGYKGY 756
Query: 296 RCYSPSTRRFYISADVTFFEDTPFFASPTTTSSTTDVTDSQVIPTPL-FHPIFEPPVSTQ 354
+ + I+ +V F E F + + D+ + ++P P H + P+
Sbjct: 757 KVLDLESHSISITRNVVFHETKFPFKTSKFLKESVDMFPNSILPLPAPLHFVESMPLDDD 816
Query: 355 SSPQLQSNPEFRRYGNIYERRHVEAPETSPIDSSDSAPKTVTTDS------SDSATAPI- 407
L+++ N P S +++ ++ + T+S +A AP
Sbjct: 817 ----LRADDNNASTSNSASSASSIPPLPSTVNTQNTDALDIDTNSVPIARPKRNAKAPAY 872
Query: 408 -------SSPVVVPPEPSNDLPIALHKGK---RSTANPHPVYNFLSYHRLSPSYFAFVSA 457
S P + P+ I + P+P+ +SY +L+P + +++ A
Sbjct: 873 LSEYHCNSVPFLSSLSPTTSTSIETPSSSIPPKKITTPYPMSTAISYDKLTPLFHSYICA 932
Query: 458 LSSVSIPKTVHEALSHQEWKQAMIDEMVALESNHTWELVSPSPGKSIVGCR*VFNVKVGL 517
+ + PK +A+ ++W +A +E+ ALE N TW + S + GK++VGC+ VF +K
Sbjct: 933 YNVETEPKAFTQAMKSEKWTRAANEELHALEQNKTWIVESLTEGKNVVGCKWVFTIKYNP 992
Query: 518 DGQVDRLKARLVAIGYTQVYGQDYNDTFSPVAKMTSVRLFIAMTAMKR*PLFQLDIKNAF 577
DG ++R KARLVA G+TQ G DY +TFSPVAK SV+L + + A L Q+D+ NAF
Sbjct: 993 DGSIERYKARLVAQGFTQQEGIDYMETFSPVAKFGSVKLLLGLAAATGWSLTQMDVSNAF 1052
Query: 578 LHGDLEEEIYMEQPSGFVAWGGV 600
LHG+L+EEIYM P G+ G+
Sbjct: 1053 LHGELDEEIYMSLPQGYTPPTGI 1075
>UniRef100_Q9FXB7 Putative retroelement polyprotein [Arabidopsis thaliana]
Length = 1486
Score = 344 bits (883), Expect = 6e-93
Identities = 220/644 (34%), Positives = 326/644 (50%), Gaps = 54/644 (8%)
Query: 1 DFNTGKTIGT*SISQGLYYLHSQSSNICGVSA---SPDMIHRRLGHPSFDKLKVLV---P 54
D T IG GLY+ + S S + H+RLGHPS L +L
Sbjct: 468 DRTTLMLIGAGRELNGLYFFRGVETAAAVTSKALPSSQLWHQRLGHPSSKALHLLPFSDV 527
Query: 55 QLSHLKSLDCESCQLGKHVRASFPSSPNKRSKSPFDIVHSDVWGPSRVMSTLG*RYYVTF 114
S S CE C K R FP S NK S + F++VH D+WGP R S G RY++T
Sbjct: 528 TSSTFDSKTCEICIQAKQTRDPFPLSSNKTSFA-FELVHCDLWGPYRTTSICGSRYFLTL 586
Query: 115 IDGFSRCTWIILLKDRSQLFGAFLTFCSEIKNQFGKGIRILRSDNAKEYFFAPFNSFMAS 174
+D +SR W+ LL + + F + ++ Q+ I+++RSDN E F + F A
Sbjct: 587 VDDYSRAVWLYLLPSKQEAPKHLKNFIALVERQYTTNIKMIRSDNGSE--FICLSDFFAQ 644
Query: 175 LGIIHQSSCPHTPQQNGVAERKHCHLVDTTRTLLINAHAPFKFWGDAILTACYLINRMPS 234
GIIH++SC TPQQNG ERKH H+++ R L + P +FW LTA YLINR P+
Sbjct: 645 KGIIHETSCVGTPQQNGRVERKHRHILNVARALRFQSGLPIEFWSYCALTAAYLINRTPT 704
Query: 235 SVLDNEIPQSLLFPKDPLYRVQLRVYGSTCFVHDLTPGRDKLSARAVKCVFLGYSRTQKG 294
+L + P L++ + P + +R++G C+VH+L G DK ++R+ K +FLGY +KG
Sbjct: 705 PLLKGKTPFELIYNRPPPLQ-HIRIFGCICYVHNLKHGGDKFASRSNKSIFLGYPFAKKG 763
Query: 295 YRCYSPSTRRFYISADVTFFE-DTPFFASPTTTSSTTD---VTDSQVIPTPLFHPIFEPP 350
+R Y+ T +S DV F E + F S +S + D V S++ + P+
Sbjct: 764 WRVYNIETGVVSVSRDVVFRETEFHFPISVMDSSPSLDPVLVDSSELEEISMTPPVTPSS 823
Query: 351 VSTQSSPQLQSNPEFRRYGNIYERRHVEAPETSPIDSSDSAP-----KTVTTDSSDSATA 405
+T SSP S+P + +P+ S+ ++ + +TTD DS +
Sbjct: 824 PATPSSPVTPSSPVTPSSPVSPSSPVTPSSPVTPVSSTTTSAAIDTIEDITTDLEDSTSM 883
Query: 406 PI------------------SSPVVVPPEPSNDLPIALHKGKRS---------------- 431
SS V PP +L H+ KR
Sbjct: 884 DFFPDDEDEFSPTATESPASSSSPVHPPAVQLELLGKGHRPKRPPVKLADYVTTLLHQPF 943
Query: 432 -TANPHPVYNFLSYHRLSPSYFAFVSALSSVSIPKTVHEALSHQEWKQAMIDEMVALESN 490
+A P+P+ N++S R S +Y A++ A++S + P+ +EA+ WK A+ E+ +LE+
Sbjct: 944 PSATPYPLDNYISSSRFSDNYQAYILAITSGNEPRNYNEAMLDDHWKGAVSHEIGSLENL 1003
Query: 491 HTWELVSPSPGKSIVGCR*VFNVKVGLDGQVDRLKARLVAIGYTQVYGQDYNDTFSPVAK 550
TW + PGK +GC+ VF +K DG ++R KARLV +G Q G DY +TF+PVAK
Sbjct: 1004 GTWTVEDLPPGKKALGCKWVFRLKYKSDGTLERHKARLVVLGNNQTEGLDYTETFAPVAK 1063
Query: 551 MTSVRLFIAMTAMKR*PLFQLDIKNAFLHGDLEEEIYMEQPSGF 594
M +VR F+ + Q+D+ NAFLHGDL+EE+YM+ P GF
Sbjct: 1064 MVTVRAFLQQVVSLDWEVHQMDVHNAFLHGDLDEEVYMQFPPGF 1107
>UniRef100_Q9SA17 F28K20.17 protein [Arabidopsis thaliana]
Length = 1415
Score = 342 bits (877), Expect = 3e-92
Identities = 214/614 (34%), Positives = 322/614 (51%), Gaps = 57/614 (9%)
Query: 1 DFNTGKTIGT*SISQGLYYLHSQ------SSNICGVSASPDMIHRRLGHPSFDKLKVL-- 52
D T K + T GLY L +Q S+ C +A+ ++ H RLGH + L+ L
Sbjct: 416 DLQTQKVVTTGPRRNGLYVLENQEFVALYSNRQC--AATEEVWHHRLGHANSKALQHLQN 473
Query: 53 --VPQLSHLKSLD-CESCQLGKHVRASFPSSPNKRSKSPFDIVHSDVWGPSRVMSTLG*R 109
Q++ ++ CE CQ+GK R F S + R P D +H D+WGPS V+S G +
Sbjct: 474 SKAIQINKSRTSPVCEPCQMGKSSRLPFLIS-DSRVLHPLDRIHCDLWGPSPVVSNQGLK 532
Query: 110 YYVTFIDGFSRCTWIILLKDRSQLFGAFLTFCSEIKNQFGKGIRILRSDNAKEYFFAPFN 169
YY F+D +SR +W L ++S+ F++F ++NQ I++ +SD E+
Sbjct: 533 YYAIFVDDYSRYSWFYPLHNKSEFLSVFISFQKLVENQLNTKIKVFQSDGGGEFVSNKLK 592
Query: 170 SFMASLGIIHQSSCPHTPQQNGVAERKHCHLVDTTRTLLINAHAPFKFWGDAILTACYLI 229
+ ++ GI H+ SCP+TPQQNG+AERKH HLV+ ++L ++H P KFW ++ TA Y+I
Sbjct: 593 THLSEHGIHHRISCPYTPQQNGLAERKHRHLVELGLSMLFHSHTPQKFWVESFFTANYII 652
Query: 230 NRMPSSVLDNEIPQSLLFPKDPLYRVQLRVYGSTCFVHDLTPGRDKLSARAVKCVFLGYS 289
NR+PSSVL N P LF + P Y LRV+GS C+ ++K R+++CVFLGY+
Sbjct: 653 NRLPSSVLKNLSPYEALFGEKPDYS-SLRVFGSACYPCLRPLAQNKFDPRSLQCVFLGYN 711
Query: 290 RTQKGYRCYSPSTRRFYISADVTFFE-DTPFFASPTTTSSTTDVTDSQVIP---TPLFHP 345
KGYRC+ P T + YIS +V F E + PF ++P TPL
Sbjct: 712 SQYKGYRCFYPPTGKVYISRNVIFNESELPF-----------KEKYQSLVPQYSTPLLQA 760
Query: 346 IFEPPVSTQSSP----QLQSNPEFRRYGNIYERRHVEAPETSPIDSSDSAPKTVTTDSSD 401
+S S P QL S P N Y V T P +S++ + SD
Sbjct: 761 WQHNKISEISVPAAPVQLFSKPIDL---NTYAGSQVTEQLTDPEPTSNN-------EGSD 810
Query: 402 SATAPISSPVVVPPEPSNDLPIALHKGKRSTANPHPVYNFLSYHRLSPSYFAFVSALSSV 461
P++ + E + + K P+ Y A +++ +
Sbjct: 811 EEVNPVAEEIAANQEQVINSHAMTTRSKAGIQKPNTRY-------------ALITSRMNT 857
Query: 462 SIPKTVHEALSHQEWKQAMIDEMVALESNHTWELVSPSPGKSIVGCR*VFNVKVGLDGQV 521
+ PKT+ A+ H W +A+ +E+ + HTW LV P+ +I+ + VF K+ DG +
Sbjct: 858 AEPKTLASAMKHPGWNEAVHEEINRVHMLHTWSLVPPTDDMNILSSKWVFKTKLHPDGSI 917
Query: 522 DRLKARLVAIGYTQVYGQDYNDTFSPVAKMTSVRLFIAMTAMKR*PLFQLDIKNAFLHGD 581
D+LKARLVA G+ Q G DY +TFSPV + ++RL + ++ K P+ QLD+ NAFLHG+
Sbjct: 918 DKLKARLVAKGFDQEEGVDYLETFSPVVRTATIRLVLDVSTSKGWPIKQLDVSNAFLHGE 977
Query: 582 LEEEIYMEQPSGFV 595
L+E ++M QPSGF+
Sbjct: 978 LQEPVFMYQPSGFI 991
>UniRef100_Q9FLA4 Polyprotein [Arabidopsis thaliana]
Length = 1429
Score = 342 bits (877), Expect = 3e-92
Identities = 234/663 (35%), Positives = 325/663 (48%), Gaps = 86/663 (12%)
Query: 1 DFNTGKTIGT*SISQGLYYLHSQSSNICGVSASPD------MIHRRLGHPSFDKLKVLVP 54
D NTG + LY +I ++ASP H+RLGHP+ LK +V
Sbjct: 408 DLNTGARLLQGRTRNELYEWPVNQKSITILTASPSPKTDLSSWHQRLGHPALPILKDVVS 467
Query: 55 QLSHL-------KSLDCESCQLGKHVRASFPSSPNKRSKSPFDIVHSDVWGPSRVMSTLG 107
HL K L C C + K + F ++ S+ P + +++DVW S +S
Sbjct: 468 HF-HLPLSNTIPKQLPCSDCSINKSHKLPFFTNTIVSSQ-PLEYLYTDVW-TSPCISVDN 524
Query: 108 *RYYVTFIDGFSRCTWIILLKDRSQLFGAFLTFCSEIKNQFGKGIRILRSDNAKEYFFAP 167
+YY+ +D F+R TW+ LK +SQ+ F+ F + ++N+F IR L SDN E F
Sbjct: 525 YKYYLVIVDHFTRYTWMYPLKQKSQVKDVFVAFKALVENRFQSRIRTLYSDNGGE--FIG 582
Query: 168 FNSFMASLGIIHQSSCPHTPQQNGVAERKHCHLVDTTRTLLINAHAPFKFWGDAILTACY 227
F+A+ GI H +S PHTP+ NG+AERKH H+V+T LL +A P FW A TA Y
Sbjct: 583 LRPFLAAHGISHLTSPPHTPEHNGLAERKHRHIVETGLALLTHASLPKTFWTYAFATAVY 642
Query: 228 LINRMPSSVLDNEIPQSLLFPKDPLYRVQLRVYGSTCFVHDLTPGRDKLSARAVKCVFLG 287
LINRMP+ VL P LF P Y ++LRV+G C+ +KL AR+ CVFLG
Sbjct: 643 LINRMPTEVLQGTSPYVKLFQMSPNY-LKLRVFGCLCYPWLRPYNTNKLEARSTMCVFLG 701
Query: 288 YSRTQKGYRCYSPSTRRFYISADVTFFEDTPFFASPTTTSSTTDVTDSQVIPT---PLFH 344
YS TQ Y C +T R Y S V F E + FASP T+ + + T SQ T PL
Sbjct: 702 YSLTQSAYLCLDIATNRIYTSRHVQFVESSFPFASPRTSETDSTQTMSQPTTTNVIPLLQ 761
Query: 345 --------------PIFEPPVSTQSSPQ-----------LQSNPEFRRYGNIYERRHV-- 377
PIF P + SSP S+ NI V
Sbjct: 762 RPPHIAPPTALPLCPIFHSPPHSPSSPASPPSEHVPLTAASSSSNAINDDNISSTGQVSV 821
Query: 378 -----EAPETSPIDSSDS----APKTVTTDSSDSATAPISSPVVV--------------- 413
++P T+P + + S +P T+ S ++T P S V
Sbjct: 822 SGPTSQSPHTTPTNQNTSPLSKSPNPTNTNQSQNSTPPTSPTTSVHQHSPTPSPLPQNPP 881
Query: 414 -PPEPSNDLPIALHKGKRSTANPHPVYNFLSYHRLSPSYFAFVSALSSVSIPKTVHEALS 472
PP P ND P+ + K P +N L+ S + S +IP TV +AL
Sbjct: 882 LPPPPQNDHPMRT-RAKNQITKPKTKFN------LTTSLTS-----SKPTIPTTVAQALK 929
Query: 473 HQEWKQAMIDEMVALESNHTWELVSPSPGKSIVGCR*VFNVKVGLDGQVDRLKARLVAIG 532
W+ AM +E+ A NHTW+LVSP K ++ C+ +F +K +DG + R KARLVA G
Sbjct: 930 DPNWRNAMSEEINAQMKNHTWDLVSPEEAKHVISCKWIFTLKYNVDGSIARYKARLVARG 989
Query: 533 YTQVYGQDYNDTFSPVAKMTSVRLFIAMTAMKR*PLFQLDIKNAFLHGDLEEEIYMEQPS 592
+ Q YG DY++TFSPV K T++R + + + + Q+DI NAFL G L EE+Y+ QP
Sbjct: 990 FNQQYGIDYSETFSPVIKSTTIRTVLEVAVKRNWSIHQVDINNAFLQGTLNEEVYVSQPP 1049
Query: 593 GFV 595
GF+
Sbjct: 1050 GFI 1052
>UniRef100_Q9SSB1 T18A20.5 protein [Arabidopsis thaliana]
Length = 1522
Score = 338 bits (866), Expect = 6e-91
Identities = 223/624 (35%), Positives = 307/624 (48%), Gaps = 65/624 (10%)
Query: 17 LYYLHSQSSNICGVSASPDMIHRRLGHPSFDKLKVLVPQ-----LSHLKSLDCESCQLGK 71
L L+S N SAS ++ HRRLGH + + L L ++ + CE+C LGK
Sbjct: 444 LQVLYSTRQN----SASSEVWHRRLGHANAEVLHQLASSKSIIIINKVVKTVCEACHLGK 499
Query: 72 HVRASFPSSPNKRSKSPFDIVHSDVWGPSRVMSTLG*RYYVTFIDGFSRCTWIILLKDRS 131
R F S S+ P + +H D+WGPS S G RYYV FID +SR TW LK +S
Sbjct: 500 STRLPFMLSTFNASR-PLERIHCDLWGPSPTSSVQGFRYYVVFIDHYSRFTWFYPLKLKS 558
Query: 132 QLFGAFLTFCSEIKNQFGKGIRILRSDNAKEYFFAPFNSFMASLGIIHQSSCPHTPQQNG 191
F F+ F ++NQ G I+I + D E+ + F + GI SCP+TPQQNG
Sbjct: 559 DFFSTFVMFQKLVENQLGHKIKIFQCDGGGEFISSQFLKHLQDHGIQQNMSCPYTPQQNG 618
Query: 192 VAERKHCHLVDTTRTLLINAHAPFKFWGDAILTACYLINRMPSSVLD-NEIPQSLLFPKD 250
+AERKH H+V+ +++ + P K+W ++ TA ++IN +P+S LD NE P L+ K
Sbjct: 619 MAERKHRHIVELGLSMIFQSKLPLKYWLESFFTANFVINLLPTSSLDNNESPYQKLYGKA 678
Query: 251 PLYRVQLRVYGSTCFVHDLTPGRDKLSARAVKCVFLGYSRTQKGYRCYSPSTRRFYISAD 310
P Y LRV+G C+ K R++KCVFLGY+ KGYRC P T R YIS
Sbjct: 679 PEYSA-LRVFGCACYPTLRDYASTKFDPRSLKCVFLGYNEKYKGYRCLYPPTGRIYISRH 737
Query: 311 VTFFEDTPFFASPTTTSSTTDVTDSQVIPTPLFHPIFEPPVSTQSSPQLQSNPEFRRYGN 370
V F E+T F S + D T FH + P QS + S P+
Sbjct: 738 VVFDENTHPFESIYSHLHPQDKTPLLEAWFKSFHHV-TPTQPDQSRYPVSSIPQPETTDL 796
Query: 371 IYERRHVEAPETSPIDSSD-------------SAPKTVTTDS------------------ 399
V A P S D S +T DS
Sbjct: 797 SAAPASVAAETAGPNASDDTSQDNETISVVSGSPERTTGLDSASIGDSYHSPTADSSHPS 856
Query: 400 ---SDSATAPISSPVVVPPEPSNDLPIA-----LHKGKRSTANPHPVYNFLSYHRLSPSY 451
S A++P SP+ + P P+ + +GK + P+ Y L+ H++
Sbjct: 857 PARSSPASSPQGSPIQMAPAQQVQAPVTNEHAMVTRGKEGISKPNKRYVLLT-HKV---- 911
Query: 452 FAFVSALSSVSIPKTVHEALSHQEWKQAMIDEMVALESNHTWELVSPSPGKSIVGCR*VF 511
S+ PKTV EAL H W AM +EM + TW LV SP +++G VF
Sbjct: 912 --------SIPEPKTVTEALKHPGWNNAMQEEMGNCKETETWTLVPYSPNMNVLGSMWVF 963
Query: 512 NVKVGLDGQVDRLKARLVAIGYTQVYGQDYNDTFSPVAKMTSVRLFIAMTAMKR*PLFQL 571
K+ DG +D+LKARLVA G+ Q G DY +T+SPV + +VRL + + + + L Q+
Sbjct: 964 RTKLHADGSLDKLKARLVAKGFKQEEGIDYLETYSPVVRTPTVRLILHVATVLKWELKQM 1023
Query: 572 DIKNAFLHGDLEEEIYMEQPSGFV 595
D+KNAFLHGDL E +YM QP+GFV
Sbjct: 1024 DVKNAFLHGDLTETVYMRQPAGFV 1047
>UniRef100_O23302 Retrovirus-related like polyprotein [Arabidopsis thaliana]
Length = 1489
Score = 332 bits (851), Expect = 3e-89
Identities = 197/592 (33%), Positives = 307/592 (51%), Gaps = 48/592 (8%)
Query: 38 HRRLGHPSFDKLKVLVPQLSHLKSLDCESCQLGKHVRASFPSSPNKRSKSPFDIVHSDVW 97
H+RLGHPS V++ +L L + S N + +PFD+VH D+W
Sbjct: 603 HQRLGHPS----SVVLQKLKRLAYI-----------------SHNNLASNPFDLVHLDIW 641
Query: 98 GPSRVMSTLG*RYYVTFIDGFSRCTWIILLKDRSQLFGAFLTFCSEIKNQFGKGIRILRS 157
GP + S G RY++T +D +R TW+ +L+++ + F F + QF I+ +RS
Sbjct: 642 GPFSIESIEGFRYFLTVVDDCTRTTWVYMLRNKKDVSSVFPEFIKLVSTQFNAKIKAIRS 701
Query: 158 DNAKEYFFAPFNSFMASLGIIHQSSCPHTPQQNGVAERKHCHLVDTTRTLLINAHAPFKF 217
DNA E F + G++H SC +TPQQN V ERKH H+++ R LL ++ P ++
Sbjct: 702 DNAPEL---GFTEIVKEHGMLHHFSCAYTPQQNSVVERKHQHILNVARALLFQSNIPMQY 758
Query: 218 WGDAILTACYLINRMPSSVLDNEIPQSLLFPKDPLYRVQLRVYGSTCFVHDLTPGRDKLS 277
W D + TA +LINR+PS +L+N+ P L+ K P Y + L+ +G CFV R K +
Sbjct: 759 WSDCVTTAVFLINRLPSPLLNNKSPYELILNKQPDYSL-LKNFGCLCFVSTNAHERTKFT 817
Query: 278 ARAVKCVFLGYSRTQKGYRCYSPSTRRFYISADVTFFEDTPFFASPTTTSSTTDVTDSQV 337
RA CVFLGY KGY+ + +S +V F E F + + D+ + +
Sbjct: 818 PRARACVFLGYPSGYKGYKVLDLESHSVTVSRNVVFKEHVFPFKTSELLNKAVDMFPNSI 877
Query: 338 IPTPL-FHPIFEPPVSTQSSPQLQSNPEFRRYGNIYERRHVEAP---------ETSPIDS 387
+P P H + P+ + S + + + R N P ET IDS
Sbjct: 878 LPLPAPLHFVETMPLIDEDS-LIPTTTDSRTADNHASSSSSALPSIIPPSSNTETQDIDS 936
Query: 388 S----DSAPKTVTTDS--SDSATAPISSPVVVPPE----PSNDLPIALHKGKRSTANPHP 437
+ + +T S S+ + + S +PP P + LP P+P
Sbjct: 937 NAVPITRSKRTTRAPSYLSEYHCSLVPSISTLPPTDSSIPIHPLPEIFTASSPKKTTPYP 996
Query: 438 VYNFLSYHRLSPSYFAFVSALSSVSIPKTVHEALSHQEWKQAMIDEMVALESNHTWELVS 497
+ +SY + +P +++ A ++ + PKT +A+ ++W + ++E+ A+E N TW + S
Sbjct: 997 ISTVVSYDKYTPLCQSYIFAYNTETEPKTFSQAMKSEKWIRVAVEELQAMELNKTWSVES 1056
Query: 498 PSPGKSIVGCR*VFNVKVGLDGQVDRLKARLVAIGYTQVYGQDYNDTFSPVAKMTSVRLF 557
P K++VGC+ VF +K DG V+R KARLVA G+TQ G D+ DTFSPVAK+TS ++
Sbjct: 1057 LPPDKNVVGCKWVFTIKYNPDGTVERYKARLVAQGFTQQEGIDFLDTFSPVAKLTSAKMM 1116
Query: 558 IAMTAMKR*PLFQLDIKNAFLHGDLEEEIYMEQPSGFVAWGGVVWYAN--CR 607
+ + A+ L Q+D+ +AFLHGDL+EEI+M P G+ G + N CR
Sbjct: 1117 LGLAAITGWTLTQMDVSDAFLHGDLDEEIFMSLPQGYTPPAGTILPPNPVCR 1168
>UniRef100_Q94IU9 Copia-like polyprotein [Arabidopsis thaliana]
Length = 1466
Score = 331 bits (848), Expect = 7e-89
Identities = 221/622 (35%), Positives = 312/622 (49%), Gaps = 57/622 (9%)
Query: 1 DFNTGKTIGT*SISQGLYYLHSQ------SSNICGVSASPDMIHRRLGHPSFDKLKVLVP 54
D T K + + GLY L + S+ C +AS + H RLGH + L+ L+
Sbjct: 418 DLTTQKVVSKGPRNNGLYMLENSEFVALYSNRQC--AASMETWHHRLGHSNSKILQQLLT 475
Query: 55 QLS-----HLKSLDCESCQLGKHVRASFPSSPNKRSKSPFDIVHSDVWGPSRVMSTLG*R 109
+ S CE CQ+GK R F SS + R+ P D VH D+WGPS V+S G +
Sbjct: 476 RKEIQVNKSRTSPVCEPCQMGKSTRLQFFSS-DFRALKPLDRVHCDLWGPSPVVSNQGFK 534
Query: 110 YYVTFIDGFSRCTWIILLKDRSQLFGAFLTFCSEIKNQFGKGIRILRSDNAKEYFFAPFN 169
YY F+D FSR +W L+ +S+ F+ + ++NQ G I+ +SD E+
Sbjct: 535 YYAVFVDDFSRFSWFFPLRMKSKFISVFIAYQKLVENQLGTKIKEFQSDGGGEFTSNKLK 594
Query: 170 SFMASLGIIHQSSCPHTPQQNGVAERKHCHLVDTTRTLLINAHAPFKFWGDAILTACYLI 229
GI H+ SCP+TPQQNGVAERKH HLV+ ++L ++H P KFW +A TA YL
Sbjct: 595 EHFREHGIHHRISCPYTPQQNGVAERKHRHLVELGLSMLYHSHTPLKFWVEAFFTANYLS 654
Query: 230 NRMPSSVLDNEIPQSLLFPKDPLYRVQLRVYGSTCFVHDLTPGRDKLSARAVKCVFLGYS 289
N +PSSVL P LF + Y LRV+G+ C+ ++K R+++CVFLGY
Sbjct: 655 NLLPSSVLKEISPYETLFQQKVDY-TPLRVFGTACYPCLRPLAKNKFDPRSLQCVFLGYH 713
Query: 290 RTQKGYRCYSPSTRRFYISADVTFFE-DTPF---FASPTTTSST--------TDVTDSQV 337
KGYRC P T + YIS V F E PF + S T TD+T V
Sbjct: 714 NQYKGYRCLYPPTGKVYISRHVIFDEAQFPFKEKYHSLVPKYQTTLLQAWQHTDLTPPSV 773
Query: 338 IPTPLFHPI---FEPPVSTQSSPQLQSNPEFRRYGNIYERRHVEAPETSPIDSSDSAPKT 394
P+ P+ P ++++ P + E EA + SSD +T
Sbjct: 774 -PSSQLQPLARQMTPMATSENQPMMNYETE-------------EAVNVNMETSSDE--ET 817
Query: 395 VTTDSSDSATAPISSPVVVPPEPSNDLPIALHKGKRSTANPHPVYNFLSYHRLSPS-YFA 453
+ D D AP+ ND G+ S N HP+ P+ +A
Sbjct: 818 ESNDEFDHEVAPV----------LNDQNEDNALGQGSLENLHPMITRSKDGIQKPNPRYA 867
Query: 454 FVSALSSVSIPKTVHEALSHQEWKQAMIDEMVALESNHTWELVSPSPGKSIVGCR*VFNV 513
+ + SS PKT+ A+ H W A++DE+ + +TW LV + +I+ + VF
Sbjct: 868 LIVSKSSFDEPKTITTAMKHPSWNAAVMDEIDRIHMLNTWSLVPATEDMNILTSKWVFKT 927
Query: 514 KVGLDGQVDRLKARLVAIGYTQVYGQDYNDTFSPVAKMTSVRLFIAMTAMKR*PLFQLDI 573
K+ DG +D+LKARLVA G+ Q G DY +TFSPV + ++RL + PL QLD+
Sbjct: 928 KLKPDGTIDKLKARLVAKGFDQEEGVDYLETFSPVVRTATIRLVLDTATANEWPLKQLDV 987
Query: 574 KNAFLHGDLEEEIYMEQPSGFV 595
NAFLHG+L+E ++M QPSGFV
Sbjct: 988 SNAFLHGELQEPVFMFQPSGFV 1009
Database: uniref100
Posted date: Jan 5, 2005 1:24 AM
Number of letters in database: 848,049,833
Number of sequences in database: 2,790,947
Lambda K H
0.339 0.146 0.489
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,512,022,912
Number of Sequences: 2790947
Number of extensions: 61071305
Number of successful extensions: 232333
Number of sequences better than 10.0: 1696
Number of HSP's better than 10.0 without gapping: 1230
Number of HSP's successfully gapped in prelim test: 477
Number of HSP's that attempted gapping in prelim test: 227166
Number of HSP's gapped (non-prelim): 3277
length of query: 965
length of database: 848,049,833
effective HSP length: 137
effective length of query: 828
effective length of database: 465,690,094
effective search space: 385591397832
effective search space used: 385591397832
T: 11
A: 40
X1: 15 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 39 (21.8 bits)
S2: 80 (35.4 bits)
Medicago: description of AC145449.7