
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC122166.6 - phase: 0 /pseudo
(1091 letters)
Database: uniref100
2,790,947 sequences; 848,049,833 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
UniRef100_O22148 Putative non-LTR retroelement reverse transcrip... 431 e-119
UniRef100_Q7XNY0 OSJNBb0015N08.11 protein [Oryza sativa] 408 e-112
UniRef100_Q7XJS2 Putative non-LTR retroelement reverse transcrip... 402 e-110
UniRef100_Q75GM6 Putative non-LTR retroelement reverse transcrip... 402 e-110
UniRef100_Q9FW98 Putative non-LTR retroelement reverse transcrip... 396 e-108
UniRef100_Q7X970 BZIP-like protein [Oryza sativa] 394 e-107
UniRef100_Q9FXJ5 F5A9.24 [Arabidopsis thaliana] 389 e-106
UniRef100_Q8LJX1 Putative reverse transcriptase [Sorghum bicolor] 382 e-104
UniRef100_Q9FYK1 F21J9.30 [Arabidopsis thaliana] 377 e-103
UniRef100_O82276 Putative non-LTR retroelement reverse transcrip... 377 e-102
UniRef100_Q9SKJ4 Putative non-LTR retroelement reverse transcrip... 376 e-102
UniRef100_Q9C6L3 Hypothetical protein F2J7.11 [Arabidopsis thali... 373 e-101
UniRef100_Q9SJ38 Putative non-LTR retroelement reverse transcrip... 370 e-100
UniRef100_Q9SZ87 RNA-directed DNA polymerase-like protein [Arabi... 369 e-100
UniRef100_O23757 Reverse transcriptase [Beta vulgaris subsp. vul... 363 1e-98
UniRef100_Q7X5Z1 OSJNBa0006A01.14 protein [Oryza sativa] 363 1e-98
UniRef100_Q65WR1 Putative polyprotein [Oryza sativa] 363 2e-98
UniRef100_O81630 F8M12.22 protein [Arabidopsis thaliana] 363 2e-98
UniRef100_O81517 T24M8.8 protein [Arabidopsis thaliana] 363 2e-98
UniRef100_Q9SN65 Hypothetical protein F25I24.40 [Arabidopsis tha... 362 3e-98
>UniRef100_O22148 Putative non-LTR retroelement reverse transcriptase [Arabidopsis
thaliana]
Length = 1374
Score = 431 bits (1108), Expect = e-119
Identities = 319/1098 (29%), Positives = 506/1098 (46%), Gaps = 76/1098 (6%)
Query: 1 VGASRGMVTIW-DRVEVEVWSTVRGVHFLLIHGRFLKSNEEFYIFNMYAPCDRRAKQVLW 59
+G S G+ +W D V+++V + + LI + ++EFY+ +Y + + LW
Sbjct: 62 IGKSGGLALMWKDSVQIKVLQSDKR----LIDALLIWQDKEFYLTCIYGEPVQAERGELW 117
Query: 60 NSLSSRLLSLGSSKVCVCGDFNTVRSVDERRSVRG-SQVVDDCAPFNDFIEDRVLINLPL 118
L+ LS S + GDFN + VD + G ++ C F + L +
Sbjct: 118 ERLTRLGLSR-SGPWMLTGDFNEL--VDPSEKIGGPARKESSCLEFRQMLNSCGLWEVNH 174
Query: 119 CGRKFTWYNG--DGRSMSTLDRFLLSEEWCMVWPTCLQVAQLRGLSDHCPIVLTVDEKNW 176
G +F+WY D LDR + ++ W ++P + SDH P++ + NW
Sbjct: 175 SGYQFSWYGNRNDELVQCRLDRTVANQAWMELFPQAKATYLQKICSDHSPLINNLVGDNW 234
Query: 177 GP-RPVRLLKCWQDMPGYSDFVREKWGSFQVSGWGGFVLKEKLKLMKAALKEWHVANTLN 235
+ K W G+ D + W Q S ++ EK+ + + +W ++
Sbjct: 235 RKWAGFKYDKRWVQREGFKDLLCNFWS--QQSTKTNALMMEKIASCRREISKW---KRVS 289
Query: 236 LPSKIVSLKNRRAVLDSKGEEEELSEEELGELHGISSDIHSLSRLNTSICWQQ-SRLNWL 294
PS V ++ + LD+ ++ EL L S ++ N WQ+ SR+ W+
Sbjct: 290 KPSSAVRIQELQFKLDAATKQIPFDRRELARLKKELSQEYN----NEEQFWQEKSRIMWM 345
Query: 295 RDGDANSKFFHSVLAARRRINSLSSILVDGVVVEGVQPVRQAVFTHSKNHFLAPIMTRPS 354
R+GD N+K+FH+ RR N + ++ + EG + + +
Sbjct: 346 RNGDRNTKYFHAATKNRRAQNRIQKLIDE----EGREWTSDEDLGRVAEAYFKKLFASED 401
Query: 355 VGHL--QFRKL----SSVERGGLIKPFHEEEIKAAVWDCDSYKIPGPDGINLGFIKSFWH 408
VG+ + L S L+ P +EE++ A + + +K PGPDG+N + FW
Sbjct: 402 VGYTVEELENLTPLVSDQMNNNLLAPITKEEVQRATFSINPHKCPGPDGMNGFLYQQFWE 461
Query: 409 EMKAEIVRFVTEFHRNRKLSKGINSRFIALIPKVASPQSLNEFRPISLVGSLYKILAKIL 468
M +I V F R+ + +G+N I LIPK+ + + +FRPISL +YK++ K++
Sbjct: 462 TMGDQITEMVQAFFRSGSIEEGMNKTNICLIPKILKAEKMTDFRPISLCNVIYKVIGKLM 521
Query: 469 ANRLRQVIGSVVSEVQSAFVKNRQILDGILITNEV---VDEARKSKKDLLLFKVDFEKAY 525
ANRL++++ S++SE Q+AFVK R I D ILI +E+ + K ++ + K D KAY
Sbjct: 522 ANRLKKILPSLISETQAAFVKGRLISDNILIAHELLHALSSNNKCSEEFIAIKTDISKAY 581
Query: 526 DSVDWGYLDEVMGSMTFPTVW*KWIKECVGTATASVLVNGCPTDEFFLERGLRQGDPLSP 585
D V+W +L++ M + F W + I ECV + VL+NG P E RGLRQGDPLSP
Sbjct: 582 DRVEWPFLEKAMRGLGFADHWIRLIMECVKSVRYQVLINGTPHGEIIPSRGLRQGDPLSP 641
Query: 586 FLFLLAAEGLNVLMKALVDAGLFKGYSVGRADPVVVSHLQFADDTLLIGNKSWANVRALR 645
+LF++ E L ++++ G V R P +SHL FADD++ N AL
Sbjct: 642 YLFVICTEMLVKMLQSAEQKNQITGLKVARGAP-PISHLLFADDSMFYCK---VNDEALG 697
Query: 646 AGLILLE---AMSGLKVNFHKSSL-VGVNITGSWLSEAASVLGCKVGKIPFLYLGLSIGG 701
+ ++E SG +VN+ KSS+ G +I+ LG + +YLGL
Sbjct: 698 QIIRIIEEYSLASGQRVNYLKSSIYFGKHISEERRCLVKRKLGIEREGGEGVYLGLPESF 757
Query: 702 DPRRLLFWEPVVNRIKNRLSGWQSRFLSFGGRLVLLKFVLTALPVYALSFFKAPSGIISS 761
++ + +R+ ++ GWQS FLS GG+ +LLK V ALP Y +S FK P I
Sbjct: 758 QGSKVATLSYLKDRLGKKVLGWQSNFLSPGGKEILLKAVAMALPTYTMSCFKIPKTICQQ 817
Query: 762 IESLFNKFFWGGGEDKRKISWIRWDTLSLRKEYGGLGVTRLREFNLALLGKWCWRLLLEK 821
IES+ +F+W ++ R + W W LS K GGLG + FN+ALLGK WR++ EK
Sbjct: 818 IESVMAEFWWKNKKEGRGLHWKAWCHLSRPKAVGGLGFKEIEAFNIALLGKQLWRMITEK 877
Query: 822 EALWRKVLVARYGVADGGLEDG-GRSCSSWWREIVRIRDGIGEGGEGWFGSCVRRRVGDG 880
++L KV +RY L G S W+ I + I +G +R +G+G
Sbjct: 878 DSLMAKVFKSRYFSKSDPLNAPLGSRPSFAWKSIYEAQVLIKQG--------IRAVIGNG 929
Query: 881 AETDFWWDCWCG-----DVSLCERFSRLYDLTVNKLISVRNMILLGVDVGGEALRWRRRL 935
+ W D W G +R + N + V++++L G W
Sbjct: 930 ETINVWTDPWIGAKPAKAAQAVKRSHLVSQYAANSIHVVKDLLL----PDGRDWNWNLVS 985
Query: 936 WAWEEELVEEFXALLLTVSLQESVTDRWIWLPTQDDGYSVRGAYDMLTS--------QE- 986
+ + E AL + DR+ W ++ YSV+ Y ++T QE
Sbjct: 986 LLFPDNTQENILALR---PGGKETRDRFTWEYSRSGHYSVKSGYWVMTEIINQRNNPQEV 1042
Query: 987 -QPHLHQNLELIWHTQVPLKVSILAWRLLRDRLPTKDNLANRGILPLEARMCVSSCGNEE 1045
QP L + IW VP K+ WR + + L NLA R + + CV + E
Sbjct: 1043 LQPSLDPIFQQIWKLDVPPKIHHFLWRCVNNCLSVASNLAYRHL--AREKSCVRCPSHGE 1100
Query: 1046 DVNHLFLSCATFSALWPL 1063
VNHL C W +
Sbjct: 1101 TVNHLLFKCPFARLTWAI 1118
>UniRef100_Q7XNY0 OSJNBb0015N08.11 protein [Oryza sativa]
Length = 1026
Score = 408 bits (1048), Expect = e-112
Identities = 252/770 (32%), Positives = 403/770 (51%), Gaps = 45/770 (5%)
Query: 320 ILVDGVVVEGVQPVRQAVFTHSKNHFLAPIMTRPSVGHLQFRKLSSVERGGLIKPFHEEE 379
++ DGV +EG + + + KN F S+ K++ + GLI P EE
Sbjct: 184 LVQDGVCIEGQRDLMMYITNFYKNLFGHSQEVSISLSLDNVMKVTEEDAIGLITPVTMEE 243
Query: 380 IKAAVWDCDSYKIPGPDGINLGFIKSFWHEMKAEIVRFVTEFHRNRKLSKGINSRFIALI 439
++ V+D K PGPDG+ F FW +K +++ + +F RN +N I LI
Sbjct: 244 LRNVVFDLKKNKSPGPDGMPGEFYIHFWDLIKHDLLDLINDFQRNSVNIDRLNFGVITLI 303
Query: 440 PKVASPQSLNEFRPISLVGSLYKILAKILANRLRQVIGSVVSEVQSAFVKNRQILDGILI 499
PK + +FRPI L+ +KI+ K+L NRL + ++S+ Q+AF+KNR I++G++I
Sbjct: 304 PKTKDASQIQKFRPICLLNVSFKIITKVLMNRLNGTMAYIISKQQTAFLKNRFIMEGVVI 363
Query: 500 TNEVVDEARKSKKDLLLFKVDFEKAYDSVDWGYLDEVMGSMTFPTVW*KWIKECVGTATA 559
+EV++ + K+ +LFKVDFEKAYD V+W ++ ++ + FP W WI + V
Sbjct: 364 LHEVLNSIHQKKQSGILFKVDFEKAYDKVNWVFIYRMLKAKGFPDQWCDWIMKVVMGGKV 423
Query: 560 SVLVNGCPTDEFFLERGLRQGDPLSPFLFLLAAEGLNVLMKALVDAGLFKGYSVGRADPV 619
+V VN F +GLRQGDPLSP LF LAAE L +L++ + L +G + +
Sbjct: 424 AVKVNDQIGSFFKTHKGLRQGDPLSPLLFNLAAEALTLLVQRAEENSLIEGLGTNGDNKI 483
Query: 620 VVSHLQFADDTLLIGNKSWANVRALRAGLILLEAMSGLKVNFHKSSLVGVNITGSWLSEA 679
+ LQ+ADDT+ + N + + L+ L L E +SGLK+NF+KS +
Sbjct: 484 AI--LQYADDTIFLINDKLDHAKNLKYILCLFEQLSGLKINFNKSEVFCFGEAKEKQDLY 541
Query: 680 ASVLGCKVGKIPFLYLGLSIGGDPRRLL--FWEPVVNRIKNRLSGWQSRFLSFGGRLVLL 737
+++ CKVG +P YLG+ I D +R+L W+ N+++++L WQ R S GGRL+LL
Sbjct: 542 SNIFTCKVGSLPLKYLGIPI--DQKRILNKDWKLAENKMEHKLGCWQGRLQSIGGRLILL 599
Query: 738 KFVLTALPVYALSFFKAPSGIISSIESLFNKFFWGGGEDKRKISWIRWDTLSLRKEYGGL 797
L+++P+Y +SF++ P G+ I+ +F W + RK + W + ++ GGL
Sbjct: 600 NSTLSSVPMYMISFYRLPKGVQERIDYFRKRFLWQEDQGIRKYHLVNWPLVCSPRDQGGL 659
Query: 798 GVTRLREFNLALLGKWCWRLLLEKEALWRKVLVARYGVADGGLEDGGR---SCSSWWREI 854
GV L N A+LGKW WRL E E W++++ A+Y +D L G R S +W+ +
Sbjct: 660 GVLDLEAMNKAMLGKWIWRLENE-EGWWQEIIYAKY-CSDKPL-SGLRLKAGSSHFWQGV 716
Query: 855 VRIRDGIGEGGEGWFGSCVRRRVGDGAETDFWWDCWCGDVSLCERFSRLYDLTVNKLISV 914
+ ++D F S + VG+G +T FW D W G L +F LY + + K I++
Sbjct: 717 MEVKDD--------FFSFCTKIVGNGEKTLFWEDSWLGGKPLAIQFPSLYGIVITKRITI 768
Query: 915 RNMILLGV-------DVGGEALR-WRRRLWAWEEELVEEFXALLLTVSLQESVTDRWIWL 966
++ G+ D+ G+ LR WR+ + +WE ++L E+ D+ W
Sbjct: 769 ADLNRKGIDCMKFRRDLHGDKLRDWRKIVNSWE------------GLNLVENCKDKLWWT 816
Query: 967 PTQDDGYSVRGAYDMLTSQEQPHLHQNLELIWHTQVPLKVSILAWRLLRDRLPTKDNLAN 1026
++D ++VR Y L Q+ ++ IW +VPLK+ I W ++++ TKDNL
Sbjct: 817 LSKDGKFTVRSFYRALKLQQTSFPNKK---IWKFRVPLKIRIFIWFFTKNKILTKDNLLK 873
Query: 1027 RGILPLEARMCVSSCGNEEDVNHLFLSCATFSALWPLVQAWLGVVGVDSQ 1076
RG + + C E V HLF C +W ++ L V V S+
Sbjct: 874 RGWRKGDNK--CQFCDKVETVQHLFFDCPLARLIWNIIACALNVKPVLSR 921
>UniRef100_Q7XJS2 Putative non-LTR retroelement reverse transcriptase [Arabidopsis
thaliana]
Length = 1344
Score = 402 bits (1033), Expect = e-110
Identities = 326/1096 (29%), Positives = 504/1096 (45%), Gaps = 82/1096 (7%)
Query: 2 GASRGMVTIW-DRVEVEVWSTVRGVHFLLIHGRFLKSNEEFYIFNMYAPCDRRAKQVLWN 60
G S G+ W +E+E + + L + R N+ ++I +Y + LW
Sbjct: 41 GRSGGLAIFWKSHLEIEFLYADKNLMDLQVSSR----NKVWFISCVYGLPVTHMRPKLWE 96
Query: 61 SLSSRLLSLGSSKVCVCGDFNTVRSVDERRS-VRGSQVVDDCAPFNDFIEDRVLINLPLC 119
L+S L + C+ GDFN +RS DE+ R S C F + + + L
Sbjct: 97 HLNSIGLKRAEAW-CLIGDFNDIRSNDEKLGGPRRSPSSFQC--FEHMLLNCSMHELGST 153
Query: 120 GRKFTW--YNGDGRSMSTLDRFLLSEEWCMVWPTCLQVAQLRGLSDHCPIVLTVDEKNWG 177
G FTW D LDR + W ++P Q + SDH P+++ N
Sbjct: 154 GNSFTWGGNRNDQWVQCKLDRCFGNPAWFSIFPNAHQWFLEKFGSDHRPVLVKFTNDNEL 213
Query: 178 PR-PVRLLKCWQDMPGYSDFVREKWGSFQVSGWGGFVLKEKLKLMKAALKEWHVANTLNL 236
R R K D P + + W S G L + A+ W ++ N
Sbjct: 214 FRGQFRYDKRLDDDPYCIEVIHRSWNSAMSQGTHSSFFS--LIECRRAISVWKHSSDTNA 271
Query: 237 PSKIVSLKNRRAVLDSKGEEEELSEEELGELHGISSDIHSLSRLNTSICWQQ-SRLNWLR 295
S+I K R LD+ E+ + + I + SL+ + + W+Q SR WL
Sbjct: 272 QSRI---KRLRKDLDA---EKSIQIPCWPRIEYIKDQL-SLAYGDEELFWRQKSRQKWLA 324
Query: 296 DGDANSKFFHSVLAARRRINSLSSILVDGVVVEGVQPVRQAVFTHSKNHFLAPIMTRPSV 355
GD N+ FFH+ + + R N LS +L + + R + + F + T +
Sbjct: 325 GGDKNTGFFHATVHSERLKNELSFLLDEN----DQEFTRNSDKGKIASSFFENLFTSTYI 380
Query: 356 ----GHLQF--RKLSSVERGGLIKPFHEEEIKAAVWDCDSYKIPGPDGINLGFIKSFWHE 409
HL+ K++S LI+ E E+ AV+ + PGPDG F + W
Sbjct: 381 LTHNNHLEGLQAKVTSEMNHNLIQEVTELEVYNAVFSINKESAPGPDGFTALFFQQHWDL 440
Query: 410 MKAEIVRFVTEFHRNRKLSKGINSRFIALIPKVASPQSLNEFRPISLVGSLYKILAKILA 469
+K +I+ + F L + N I LIPK+ SPQ +++ RPISL LYKI++KIL
Sbjct: 441 VKHQILTEIFGFFETGVLPQDWNHTHICLIPKITSPQRMSDLRPISLCSVLYKIISKILT 500
Query: 470 NRLRQVIGSVVSEVQSAFVKNRQILDGILITNEVVDEARKS---KKDLLLFKVDFEKAYD 526
RL++ + ++VS QSAFV R I D IL+ +E++ R + K+ + FK D KAYD
Sbjct: 501 QRLKKHLPAIVSTTQSAFVPQRLISDNILVAHEMIHSLRTNDRISKEHMAFKTDMSKAYD 560
Query: 527 SVDWGYLDEVMGSMTFPTVW*KWIKECVGTATASVLVNGCPTDEFFLERGLRQGDPLSPF 586
V+W +L+ +M ++ F W WI CV + + SVL+NG P RG+RQGDPLSP
Sbjct: 561 RVEWPFLETMMTALGFNNKWISWIMNCVTSVSYSVLINGQPYGHIIPTRGIRQGDPLSPA 620
Query: 587 LFLLAAEGLNVLMKALVDAGLFKGYSVGRADPVVVSHLQFADDTLLIGNKSWANVRALRA 646
LF+L E L ++ AG G + V V+HL FADDTLL+ + L
Sbjct: 621 LFVLCTEALIHILNKAEQAGKITGIQF-QDKKVSVNHLLFADDTLLMCKATKQECEELMQ 679
Query: 647 GLILLEAMSGLKVNFHKSSLV-GVNIT---GSWLSEAASV-LGCKVGKIPFLYLGLS--I 699
L +SG +N +KS++ G N+ W+ + + L GK YLGL +
Sbjct: 680 CLSQYGQLSGQMINLNKSAITFGKNVDIQIKDWIKSRSGISLEGGTGK----YLGLPECL 735
Query: 700 GGDPRRLLFWEPVVNRIKNRLSGWQSRFLSFGGRLVLLKFVLTALPVYALSFFKAPSGII 759
G R L + + ++++RL+GW ++ LS GG+ VLLK + ALPVY +S FK P +
Sbjct: 736 SGSKRDLFGF--IKEKLQSRLTGWYAKTLSQGGKEVLLKSIALALPVYVMSCFKLPKNLC 793
Query: 760 SSIESLFNKFFWGGGEDKRKISWIRWDTLSLRKEYGGLGVTRLREFNLALLGKWCWRLLL 819
+ ++ F+W + KRKI W+ W L+L K+ GG G L+ FN ALL K WR+L
Sbjct: 794 QKLTTVMMDFWWNSMQQKRKIHWLSWQRLTLPKDQGGFGFKDLQCFNQALLAKQAWRVLQ 853
Query: 820 EKEALWRKVLVARY-GVADGGLEDGGRSCSSWWREIVRIRDGIGEGGEGWFGSCVRRRVG 878
EK +L+ +V +RY +D G S WR I+ R+ + +G +R +G
Sbjct: 854 EKGSLFSRVFQSRYFSNSDFLSATRGSRPSYAWRSILFGRELLMQG--------LRTVIG 905
Query: 879 DGAETDFWWDCWCGDVSLCERFSRLYDLTVNKLISVRNMILLGVDVGGEALRWRRRLWAW 938
+G +T W D W D S +R + V+ +S L+ L R L+ W
Sbjct: 906 NGQKTFVWTDKWLHDGSNRRPLNRRRFINVDLKVSQ----LIDPTSRNWNLNMLRDLFPW 961
Query: 939 EE-ELVEEFXALLLTVSLQESVTDRWIWLPTQDDGYSVRGAYDMLTSQEQPHLHQNLEL- 996
++ E++ + L D + WL + + YSV+ Y+ L+ Q L+Q ++
Sbjct: 962 KDVEIILKQRPLFFK-------EDSFCWLHSHNGLYSVKTGYEFLSKQVHHRLYQEAKVK 1014
Query: 997 ---------IWHTQVPLKVSILAWRLLRDRLPTKDNLANRGILPLEARMCVSSCGNEEDV 1047
IW+ K+ I W+ L +P +D L RGI + C+ E +
Sbjct: 1015 PSVNSLFDKIWNLHTAPKIRIFLWKALHGAIPVEDRLRTRGIRSDDG--CLMCDTENETI 1072
Query: 1048 NHLFLSCATFSALWPL 1063
NH+ C +W +
Sbjct: 1073 NHILFECPLARQVWAI 1088
>UniRef100_Q75GM6 Putative non-LTR retroelement reverse transcriptase [Oryza sativa]
Length = 1614
Score = 402 bits (1033), Expect = e-110
Identities = 264/905 (29%), Positives = 446/905 (49%), Gaps = 41/905 (4%)
Query: 16 VEVWSTVRGVHFLLIHGRFLKSNEEFYIFNMYAPCDRRAKQVLWNSLSSRLLSLGSSKVC 75
V+V T G +F+ K ++ + ++Y P D K V L + S V
Sbjct: 698 VDVTETHVGSYFINCVVVNRKDKFQWEVISVYGPVDNSLKNVFLEELRWHI-SGREHPVM 756
Query: 76 VCGDFNTVRSVDERRSVRGSQVVDDCAP-FNDFIEDRVLINLPLCGRKFTWYNGDG-RSM 133
+ GDFN + E+ + S + C FN FI D L + + G KFTW N +
Sbjct: 757 LGGDFNLYKFASEKSN---SNIDVRCMDMFNKFISDLDLREVHIIGPKFTWTNKQYCPTQ 813
Query: 134 STLDRFLLSEEWCMVWPTCLQVAQLRGLSDHCPIVLTVDEKNWGP-RPVRLLKCWQDMPG 192
LDR L+S++W +P L + LR SDH P++L E R R W + G
Sbjct: 814 EVLDRILVSDDWDDRYPNSLISSVLRVGSDHTPLLLDTYELVVEKCRYFRFESAWLAVEG 873
Query: 193 YSDFVREKW----GSFQVSGWGGFVLKEKLKLMKAALKEWHVANTLNLPSKIVSLKNRRA 248
+ D V +K+ GS+ + W +K M+ L W + +SL+ +
Sbjct: 874 FKDLVGKKFPDRDGSYILEFWN-----KKQVEMRRFLTGWGINKQSEDRRAKLSLQKKLE 928
Query: 249 VLDSKGEEEELSEEELGELHGISSDIHSLSRLNTSICWQQSRLNWLRDGDANSKFFHSVL 308
+DSK + EL+ +E + + + + L ++ WL +GD N++++H +
Sbjct: 929 EIDSKAVDAELNADEWRHRYELEDALEHIYELEELYWHKRCGEQWLLEGDRNTEYYHRIA 988
Query: 309 AARRRINSLSSILVDGVVVEGVQPVRQAVFTHSKNHFLAPIMTRPSVGH---LQFRKLSS 365
R++ +++S++ + + ++ + + K+ F + + + + L
Sbjct: 989 NGRKKRCTITSLMDGDRELTTKEDLKHHIVEYYKSLFRSESSSSIHLSQGVWVDSLCLKE 1048
Query: 366 VERGGLIKPFHEEEIKAAVWDCDSYKIPGPDGINLGFIKSFWHEMKAEIVRFVTEFHRNR 425
++ L++PF EE+ + + GPDG ++ F ++FW +++ ++ + +
Sbjct: 1049 EDKNTLMRPFTIEELYKVLREAKPNTAFGPDGFSIPFYRAFWPQLRPDLFEMLLMLYNEE 1108
Query: 426 KLSKGINSRFIALIPKVASPQSLNEFRPISLVGSLYKILAKILANRLRQVIGSVVSEVQS 485
K +N I+LIPK ++P + +FRPI ++ +K ++K + NRL ++ V+S Q+
Sbjct: 1109 LDLKRLNFGVISLIPKNSNPTDIKQFRPICVLNDCFKFISKCVCNRLTEIARDVISPTQT 1168
Query: 486 AFVKNRQILDGILITNEVVDEARKSKKDLLLFKVDFEKAYDSVDWGYLDEVMGSMTFPTV 545
AF+ R IL+G +I +EV+ E + + ++ K+DFEKAYD V W +L EVM FP+
Sbjct: 1169 AFIPGRFILEGCVIIHEVLHEMNRKNLEGIILKIDFEKAYDKVSWDFLIEVMVRKGFPSK 1228
Query: 546 W*KWIKECVGTATASVLVNGCPTDEFFLERGLRQGDPLSPFLFLLAAEGLNVLMKALVDA 605
W WIK CV + +NG TD F RGLRQGDPLSP LF L ++ L ++ +
Sbjct: 1229 WVNWIKTCVMGGRVCININGERTDFFRTFRGLRQGDPLSPLLFNLISDALAAMLDSAKRE 1288
Query: 606 GLFKGYSVGRADPVVVSHLQFADDTLLIGNKSWANVRALRAGLILLEAMSGLKVNFHKSS 665
G+ G V P ++HLQ+ADDT+L + A + L E M+ LKVN+HKS
Sbjct: 1289 GVLSGL-VPDIFPGGITHLQYADDTVLFVANDDKQIVATKFILYCFEEMAVLKVNYHKSE 1347
Query: 666 LVGVNITGSWLSEAASVLGCKVGKIPFLYLGLSIGGDPRRLLFWEPVVNRIKNRLSGWQS 725
+ + ++ + + A + C VG+ P YLGL IG D L ++ + +++ RL+ W +
Sbjct: 1348 IFTLGLSDNDTNRVAMMFNCPVGQFPMKYLGLPIGPDKILNLGFDFLGQKLEKRLNSWGN 1407
Query: 726 RFLSFGGRLVLLKFVLTALPVYALSFFKAPSGIISSIESLFNKFFWGGGEDKRKISWIRW 785
LS GR V + L+++P YA+ F++ P G+ S+ +++W K K ++W
Sbjct: 1408 N-LSHAGRAVQINTCLSSIPSYAMCFYQLPEGVHQKFGSIRGRYYWARNRLKGKYHMVKW 1466
Query: 786 DTLSLRKEYGGLGVTRLREFNLALLGKWCWRLLLEKEALWRKVLVARYGVADGGLEDGG- 844
+ L+ K+YGGLG T R N ALL KW ++ E ++L ++L +Y L+DGG
Sbjct: 1467 EDLAFPKDYGGLGFTETRRMNTALLAKWIMKIESEDDSLCIELLRRKY------LQDGGF 1520
Query: 845 -----RSCSSWWREIVRIRDGIGEGGEGWFGSCVRRRVGDGAETDFWWDCWCGDVSLCER 899
R S +W+ ++ IR + G W +VGDG+ FW D W G L
Sbjct: 1521 FQCKERYASQFWKGLLNIRRWLSLGSV-W-------QVGDGSHISFWRDVWWGQCPLRTL 1572
Query: 900 FSRLY 904
F ++
Sbjct: 1573 FPAIF 1577
>UniRef100_Q9FW98 Putative non-LTR retroelement reverse transcriptase [Oryza sativa]
Length = 1382
Score = 396 bits (1018), Expect = e-108
Identities = 326/1110 (29%), Positives = 498/1110 (44%), Gaps = 100/1110 (9%)
Query: 2 GASRGMVTIWDRVEVEVWSTVRGV--HFLLIHGRFLKSNEE---FYIFNMYAPCDRRAKQ 56
G S G+ W ++RG HF+ + L S EE + I +Y R +
Sbjct: 68 GLSGGLALFWTTAYTV---SLRGFNSHFIDV----LVSTEELPPWRISFVYGEPKRELRH 120
Query: 57 VLWNSLSSRLLSLGSSKVCVCGDFNTVRSVDERRSVRGSQVVDDCAPFNDFIEDRVLINL 116
WN L RL CGDFN V +DE +R + F ++D LI+L
Sbjct: 121 FFWNLLR-RLHDQWRGPWLCCGDFNEVLCLDEHLGMR-ERSEPHMQHFRSCLDDCGLIDL 178
Query: 117 PLCGRKFTWYN---GDGRSMSTLDRFLLSEEWCMVWPTCLQVAQLRGLSDHCPIVLTVDE 173
G KFTW N + S LDR + + E+ + CL + SDH I + +
Sbjct: 179 GFVGPKFTWSNKQDANSNSKVRLDRAVANGEFSRYFEDCLVENVITTSSDHYAISIDLSR 238
Query: 174 KNWGPRPV------RLLKCWQDMPGYSDFVREKWGSFQVSGWGGFVLKEKLKLMKAALKE 227
+N G R + R W Y + V W G + L+ + +LK+
Sbjct: 239 RNHGQRRIPIQQGFRFEAAWLRAEDYREVVENSWRISSAGCVGLRGVWSVLQQVAVSLKD 298
Query: 228 WHVANTLNLPSKIVSLKNR-RAVLDSKGEEEELSEEELGELHGISSDIHSLSRLNTSICW 286
W A+ ++ KI+ ++ + +++ S + + EE+L I + L +
Sbjct: 299 WSKASFGSVRRKILKMERKLKSLRQSPVNDVVIQEEKL-----IEQQLCELFEKEEIMAR 353
Query: 287 QQSRLNWLRDGDANSKFFHSVLAARRRINSLSSILVDG----VVVEGVQPVRQAVFTHSK 342
Q+SR++WLR+GD N+ FFH+ +ARRR N + ++ D + EG++ + + + +
Sbjct: 354 QRSRVDWLREGDRNTAFFHARASARRRTNRIKELVRDDGSRCISQEGIKRMAEVFY---E 410
Query: 343 NHFLA-PIMTRPSVGHLQFRKLSSVERGGLIKPFHEEEIKAAVWDCDSYKIPGPDGINLG 401
N F + P + V K+ G L K + EEIK A++ S K PGPDG
Sbjct: 411 NLFSSEPCDSMEEVLDAIPNKVGDFINGELGKQYTNEEIKTALFQMGSTKAPGPDGFPAL 470
Query: 402 FIKSFWHEMKAEIVRFVTEFHRNRKLSKGINSRFIALIPKVASPQSLNEFRPISLVGSLY 461
F ++ W ++ I V F ++ +G+ + LIPKV + L++FRPISL LY
Sbjct: 471 FYQTHWGILEEHICNAVRGFLLGEEIPEGLCDSVVVLIPKVNNASHLSKFRPISLCNVLY 530
Query: 462 KILAKILANRLRQVIGSVVSEVQSAFVKNRQILDGILITNEVVDEARK--SKKDLLLFKV 519
KI +K+LANRL+ + +VSE QSAFV R I D L+ E + RK +K K+
Sbjct: 531 KIASKVLANRLKPFLPDIVSEFQSAFVPGRLITDSALVAYECLHTIRKQHNKNPFFALKI 590
Query: 520 DFEKAYDSVDWGYLDEVMGSMTFPTVW*KWIKECVGTATASVLVNGCPTDEFFLERGLRQ 579
D KAYD V+W YL + + F W + CV + +V +NG T RG+RQ
Sbjct: 591 DMMKAYDRVEWAYLSGCLSKLGFSQDWINTVMRCVSSVRYAVKINGELTKPVVPSRGIRQ 650
Query: 580 GDPLSPFLFLLAAEGLNVLMKALVDAGLFKGYSVGRADPVVVSHLQFADDTLLIGNKSWA 639
GDP+SP+LFLL EGL+ L+ AG +G GR P +SHL FADD++
Sbjct: 651 GDPISPYLFLLCTEGLSCLLHKKEVAGELQGIKNGRHGP-PISHLLFADDSIFFAKADSR 709
Query: 640 NVRALRAGLILLEAMSGLKVNFHKSSL-VGVNITGSWLSEAASVLGCKVGKIPFLYLGLS 698
NV+AL+ L + SG K+N HKSS+ G + S L + YLG+
Sbjct: 710 NVQALKNTLRSYCSASGQKINLHKSSIFFGKRCPDAVKISVKSCLQVDNEVLQDSYLGMP 769
Query: 699 IGGDPRRLLFWEPVVNRIKNRLSGWQSRFLSFGGRLVLLKFVLTALPVYALSFFKAPSGI 758
F++ + RI R++GW R LS G +LK V A+P Y +S F+ P I
Sbjct: 770 TEIGLATTNFFKFLPERIWKRVNGWTDRPLSRAGMETMLKAVAQAIPNYVMSCFRIPVSI 829
Query: 759 ISSIESLFNKFFWGGGEDKRKISWIRWDTLSLRKEYGGLGVTRLREFNLALLGKWCWRLL 818
+++ +WG + K+K+ W W LS K GG+G FN A+LG+ CWRLL
Sbjct: 830 CEKMKTCIADHWWGFEDGKKKMHWKSWSWLSTPKFLGGMGFREFTTFNQAMLGRQCWRLL 889
Query: 819 LEKEALWRKVLVARYGVADGGLEDG-GRSCSSWWREIVRIRDGIGEGGEGWFGSCVRRRV 877
+ ++L +VL RY E +S S WR ++ R+ + +G VR V
Sbjct: 890 TDPDSLCSRVLKGRYFPNSSFWEAAQPKSPSFTWRSLLFGRELLAKG--------VRWGV 941
Query: 878 GDGAETDFWWDCWCGD-----VSLCERFSRLYDLTVNKLISVRNMILLGVDVGGEALRWR 932
GDG + D W V+ F D TV+ L++ E R
Sbjct: 942 GDGKTIKIFSDNWIPGFRPQLVTTLSPFPT--DATVSCLMN-------------EDAR-- 984
Query: 933 RRLWAWEEELVEEFXALLLTVSL------QESVTDRWIWLPTQDDGYSVRGAYDMLTSQ- 985
W+ +L+ + + + + D W + YSVR AY++ S+
Sbjct: 985 ----CWDGDLIRSLFPVDIAKEILQIPISRHGDADFASWPHDKLGLYSVRSAYNLARSEA 1040
Query: 986 ---EQPHLHQNL-----------ELIWHTQVPLKVSILAWRLLRDRLPTKDNLANRGILP 1031
+Q + + + + +W P K+ I WR + L T L R I
Sbjct: 1041 FFADQSNSGRGMASRLLESQKDWKGLWKINAPGKMKITLWRAAHECLATGFQLRRRHIPS 1100
Query: 1032 LEARMCVSSCGNEEDVNHLFLSCATFSALW 1061
+ CV C ++ V H+FL C + +W
Sbjct: 1101 TDG--CV-FCNRDDTVEHVFLFCPFAAQIW 1127
>UniRef100_Q7X970 BZIP-like protein [Oryza sativa]
Length = 2367
Score = 394 bits (1011), Expect = e-107
Identities = 328/1120 (29%), Positives = 500/1120 (44%), Gaps = 98/1120 (8%)
Query: 2 GASRGMVTIWDRVEVEVWSTVRGVHFLLIHGRFLKSNEE--FYIFNMYAPCDRRAKQVLW 59
G S G+ WD V V+ ++ I S EE +++ +Y + +W
Sbjct: 792 GMSGGLALYWDE---SVSVDVKDINKRYIDAYVQLSPEEPQWHVTFVYGEPRVENRHRMW 848
Query: 60 NSLSSRLLSLGSSKVCVCGDFNTVRSVDER--RSVRGSQVVDDCAPFNDFIEDRVLINLP 117
SL + S V GDFN E R+ RG + D F D ++D L +L
Sbjct: 849 -SLLRTIHQSSSLPWAVIGDFNETMWQFEHFSRTPRGEPQMQD---FRDVLQDCELHDLG 904
Query: 118 LCGRKFTWYN---GDGRSMSTLDRFLLSEEWCMVWPTCLQVAQLRGLSDHCPIVLTVDEK 174
G T+ N G LDR + ++W ++ T V + SDHCPI+L + K
Sbjct: 905 FKGVPHTYDNKREGWRNVKVRLDRVVADDKWRDIYSTAQVVHLVSPCSDHCPILLNLVVK 964
Query: 175 NWGPRPVRLLKC------WQDMPGYSDFVREKWGSFQVSGWGGFVLKEKLKLMKAALKEW 228
+ P +R KC W+ P + + E W G + K K+M AL+ W
Sbjct: 965 D--PHQLRQ-KCLHYEIVWEREPEATQVIEEAWVVAGEKADLGDINKALAKVM-TALRSW 1020
Query: 229 HVANTLNLPSKIVSLKNRRAVLDSKGEEEELSEEELGELHGISSDIHSLSRLNTSICWQQ 288
A N+ ++ + + A L + + + + ++ L + Q+
Sbjct: 1021 SRAKVKNVGRELEKARKKLAELIESNADRTV-------IRNATDHMNELLYREEMLWLQR 1073
Query: 289 SRLNWLRDGDANSKFFHSVLAARRRINSLSSIL-VDGVVVEGVQPVRQAVFTHSKNHFLA 347
SR+NWL+D D N+KFFHS R + N +S + + V + + ++ + A
Sbjct: 1074 SRVNWLKDEDRNTKFFHSRAVWRAKKNKISKLRDANETVHSSTMKLESMATEYFQDVYTA 1133
Query: 348 -PIMTRPSVGHLQFRKLSSVERGGLIKPFHEEEIKAAVWDCDSYKIPGPDGINLGFIKSF 406
P + +V L K++ + L + F E+EI A++ K PGPDG F +
Sbjct: 1134 DPNLNPETVTRLIQEKVTDIMNEKLCEDFTEDEISQAIFQIGPLKSPGPDGFPARFYQRN 1193
Query: 407 WHEMKAEIVRFVTEFHRNRKLSKGINSRFIALIPKVASPQSLNEFRPISLVGSLYKILAK 466
W +KA+I+ V F + + +G+N I LIPK P L +FRPISL +YK+++K
Sbjct: 1194 WGTIKADIIGAVRRFFQTGLMPEGVNDTAIVLIPKKEQPVDLRDFRPISLCNVVYKVVSK 1253
Query: 467 ILANRLRQVIGSVVSEVQSAFVKNRQILDGILITNEVVDEARKSKK---DLLLFKVDFEK 523
L NRLR ++ +VS QSAFV+ R I D L+ E +K+KK +K+D K
Sbjct: 1254 CLVNRLRPILDDLVSVEQSAFVQGRMITDNALLAFECFHAMQKNKKANHAACAYKLDLSK 1313
Query: 524 AYDSVDWGYLDEVMGSMTFPTVW*KWIKECVGTATASVLVNGCPTDEFFLERGLRQGDPL 583
AYD VDW +L+ M + F W WI +CV + V NG F RGLRQGDPL
Sbjct: 1314 AYDRVDWRFLEMAMNKLGFARRWVNWIMKCVTSVRYMVKFNGTLLQSFAPTRGLRQGDPL 1373
Query: 584 SPFLFLLAAEGLNVLMKALVDAGLFKGYSVGRADPVVVSHLQFADDTLLIGNKSWANVRA 643
PFLFL A+GL++L+K V + V RA P +SHL FADDTLL
Sbjct: 1374 LPFLFLFVADGLSLLLKEKVAQNSLTPFKVCRAAP-GISHLLFADDTLLFFKAHQREAEV 1432
Query: 644 LRAGLILLEAMSGLKVNFHKSSLVGVNITGSWLSEAAS-VLGCKVGKIPFLYLGLSIGGD 702
++ L +G +N K S++ + +SEA S +L + + YLG
Sbjct: 1433 VKEVLSSYAMGTGQLINPAKCSILMGGASTPAVSEAISEILQVERDRFEDRYLGFPTPEG 1492
Query: 703 PRRLLFWEPVVNRIKNRLSGWQSRFLSFGGRLVLLKFVLTALPVYALSFFKAPSGIISSI 762
++ + +I R+ W LS GG+ VL+K V+ A+PVY + FK P +I +
Sbjct: 1493 RMHKGRFQSLQAKIWKRVIQWGENHLSTGGKEVLIKAVIQAIPVYVMGIFKLPESVIDDL 1552
Query: 763 ESLFNKFFWGGGEDKRKISWIRWDTLSLRKEYGGLGVTRLREFNLALLGKWCWRLLLEKE 822
L F+W +RK W WD+L+ K GGLG R FN ALL + WRL+ +
Sbjct: 1553 TKLTKNFWWDSMNGQRKTHWKAWDSLTKPKSLGGLGFRDYRLFNQALLARQAWRLITYPD 1612
Query: 823 ALWRKVLVARYGVADGGLEDG--GRSCSSWWREIVRIRDGIGEGGEGWFGSCVRRRVGDG 880
+L +VL A+Y G L D G + S WR I D + +G + RVG+G
Sbjct: 1613 SLCARVLKAKY-FPHGSLIDTSFGSNSSPAWRSIEYGLDLLKKG--------IIWRVGNG 1663
Query: 881 AETDFWWDCWCGDVSLCERFSRLYDLTVNKLISVRNMILLGVDVGGEA---LRWRRRL-- 935
W D W L SR + G+A L+W L
Sbjct: 1664 NSIRIWRDSW-----LPRDHSRR-------------------PITGKANCRLKWVSDLIT 1699
Query: 936 --WAWEEELVEEF-----XALLLTVSLQESVTDRWI-WLPTQDDGYSVRGAYDML----- 982
+W+ + ++ ++L + + + +I W P ++ +SVR AY +
Sbjct: 1700 EDGSWDVPKIHQYFHNLDAEVILNICISSRSEEDFIAWHPDKNGMFSVRSAYRLAAQLVN 1759
Query: 983 ----TSQEQPHLHQNLELIWHTQVPLKVSILAWRLLRDRLPTKDNLANRGILPLEARMCV 1038
+S ++++ E+IW +VP KV I AWR+ + L T N R + ++ MC
Sbjct: 1760 IEESSSSGTNNINKAWEMIWKCKVPQKVKIFAWRVASNCLATMVNKKKRKL--EQSDMCQ 1817
Query: 1039 SSCGNEEDVNHLFLSCATFSALWPLVQAWLGVVGVDSQST 1078
ED H C S LW + G V VD +++
Sbjct: 1818 ICDRENEDDAHALCRCIQASQLWSCMHK-SGSVSVDIKAS 1856
>UniRef100_Q9FXJ5 F5A9.24 [Arabidopsis thaliana]
Length = 1254
Score = 389 bits (1000), Expect = e-106
Identities = 312/1064 (29%), Positives = 484/1064 (45%), Gaps = 82/1064 (7%)
Query: 1 VGASRGMVTIWDRVEVEVWSTVRGVHFLLIHGRFLKSNEEFYIFNMYAPCDRRAKQVLWN 60
VG G+ +W + V ++ V L+ + F + +Y DR + W
Sbjct: 28 VGKCGGLALLW---KSSVQVDLKFVDKNLMDAQVQFGAVNFCVSCVYGDPDRSKRSQAWE 84
Query: 61 SLSSRLLSLGS-SKVCVCGDFNTVRSVDERRSVRGSQVVD-DCAPFNDFIEDRVLINLPL 118
+S + +G K C+ GDFN + E+ G + D DC FN+ I+ L+ +P
Sbjct: 85 RISR--IGVGRRDKWCMFGDFNDILHNGEKNG--GPRRSDLDCKAFNEMIKGCDLVEMPA 140
Query: 119 CGRKFTWYN--GDGRSMSTLDRFLLSEEWCMVWPTCLQV-AQLRGLSDHCPIVLTVDEKN 175
G FTW GD LDR ++EW +P Q RG SDH P+++ +
Sbjct: 141 HGNGFTWAGRRGDHWIQCRLDRAFGNKEWFCFFPVSNQTFLDFRG-SDHRPVLIKLMSSQ 199
Query: 176 WGPR-PVRLLKCWQDMPGYSDFVREKWGSFQVSGWGGFVLKEKLKLMKAALKEWHVANTL 234
R R K + + + W + + ++L+ + +L W N L
Sbjct: 200 DSYRGQFRFDKRFLFKEDVKEAIIRTWSRGKHGT--NISVADRLRACRKSLSSWKKQNNL 257
Query: 235 NLPSKIVSLKNRRAVLDSKGEEEELSEEELGELHGISSDIHSLSRLNTSICWQQSRLNWL 294
N KI L+ A L+ +E+ L + + D+ R + Q+SR WL
Sbjct: 258 NSLDKINQLE---AALE---KEQSLVWPIFQRVSVLKKDLAKAYREEEAYWKQKSRQKWL 311
Query: 295 RDGDANSKFFHSVLAA---RRRINSLSSILVDGVVVEGVQPVRQAVFTHSKNHFLAPIMT 351
R G+ NSK+FH+ + R+RI L V+G + + + N F + +
Sbjct: 312 RSGNRNSKYFHAAVKQNRQRKRIEKLKD--VNGNMQTSEAAKGEVAAAYFGNLFKS---S 366
Query: 352 RPSVGHLQFR----KLSSVERGGLIKPFHEEEIKAAVWDCDSYKIPGPDGINLGFIKSFW 407
PS F ++S V L+ +EIK AV+ PGPDG++ F + +W
Sbjct: 367 NPSGFTDWFSGLVPRVSEVMNESLVGEVSAQEIKEAVFSIKPASAPGPDGMSALFFQHYW 426
Query: 408 HEMKAEIVRFVTEFHRNRKLSKGINSRFIALIPKVASPQSLNEFRPISLVGSLYKILAKI 467
+ ++ V +F + + N + LIPK P + + RPISL LYKI++KI
Sbjct: 427 STVGNQVTSEVKKFFADGIMPAEWNYTHLCLIPKTQHPTEMVDLRPISLCSVLYKIISKI 486
Query: 468 LANRLRQVIGSVVSEVQSAFVKNRQILDGILITNEVVDEAR---KSKKDLLLFKVDFEKA 524
+A RL+ + +VS+ QSAFV R I D IL+ +E+V + + + + K D KA
Sbjct: 487 MAKRLQPWLPEIVSDTQSAFVSERLITDNILVAHELVHSLKVHPRISSEFMAVKSDMSKA 546
Query: 525 YDSVDWGYLDEVMGSMTFPTVW*KWIKECVGTATASVLVNGCPTDEFFLERGLRQGDPLS 584
YD V+W YL ++ S+ F W WI CV + T SVL+N CP L+RGLRQGDPLS
Sbjct: 547 YDRVEWSYLRSLLLSLGFHLKWVNWIMVCVSSVTYSVLINDCPFGLIILQRGLRQGDPLS 606
Query: 585 PFLFLLAAEGLNVLMKALVDAGLFKGYSVGRADPVVVSHLQFADDTLLIGNKSWANVRAL 644
PFLF+L EGL L+ G +G P+V HL FADD+L + S L
Sbjct: 607 PFLFVLCTEGLTHLLNKAQWEGALEGIQFSENGPMV-HHLLFADDSLFLCKASREQSLVL 665
Query: 645 RAGLILLEAMSGLKVNFHKSSLV-GVNITGSWLSEAASVLGCKVGKIPFLYLGLSIGGDP 703
+ L + +G +N +KSS+ G + + LG YLGL
Sbjct: 666 QKILKVYGNATGQTINLNKSSITFGEKVDEQLKGTIRTCLGIFTEGGAGTYLGLPECFSG 725
Query: 704 RRLLFWEPVVNRIKNRLSGWQSRFLSFGGRLVLLKFVLTALPVYALSFFKAPSGIISSIE 763
++ + +R+K +L W +R LS GG+ VLLK V A+PV+A+S FK P ++E
Sbjct: 726 SKVDMLHYLKDRLKEKLDVWFTRCLSQGGKEVLLKSVALAMPVFAMSCFKLPITTCENLE 785
Query: 764 SLFNKFFWGGGEDKRKISWIRWDTLSLRKEYGGLGVTRLREFNLALLGKWCWRLLLEKEA 823
S F+W + RKI W W+ L L K+ GGLG ++ FN ALL K WRLL +
Sbjct: 786 SAMASFWWDSCDHSRKIHWQSWERLCLPKDSGGLGFRDIQSFNQALLAKQAWRLLHFPDC 845
Query: 824 LWRKVLVARYGVADGGLEDG-GRSCSSWWREIVRIRDGIGEGGEGWFGSCVRRRVGDGAE 882
L ++L +RY A L+ + S WR I+ R+ + +G +++RVGDGA
Sbjct: 846 LLSRLLKSRYFDATDFLDAALSQRPSFGWRSILFGRELLSKG--------LQKRVGDGAS 897
Query: 883 TDFWWDCWCGDVSLCE--RFSRLYDLTVNKLISVRNMILLGVDVGGEALRWRRRLWAWEE 940
W D W D R + +YD+T + V+ ++ R W+E
Sbjct: 898 LFVWIDPWIDDNGFRAPWRKNLIYDVT----LKVKALL-------------NPRTGFWDE 940
Query: 941 ELVE-----EFXALLLTVSLQESVTDRWIWLPTQDDGYSVRGAYDMLTSQEQPHLHQNLE 995
E++ E + + S D ++W + +SV+ AY + + +L +
Sbjct: 941 EVLHDLFLPEDILRIKAIKPVISQADFFVWKLNKSGDFSVKSAYWLAYQTKSQNLRSEVS 1000
Query: 996 L----------IWHTQVPLKVSILAWRLLRDRLPTKDNLANRGI 1029
+ +W+ Q K+ I W++L LP +NL RG+
Sbjct: 1001 MQPSTLGLKTQVWNLQTDPKIKIFLWKVLSGILPVAENLNGRGM 1044
>UniRef100_Q8LJX1 Putative reverse transcriptase [Sorghum bicolor]
Length = 1998
Score = 382 bits (980), Expect = e-104
Identities = 302/1071 (28%), Positives = 484/1071 (44%), Gaps = 37/1071 (3%)
Query: 1 VGASRGMVTIWDRVEVEVWSTVRGVHFLLIHGRFLKSNEEFYIFNMYAPCDRRAKQVLWN 60
VGAS G++T W + + I + +N+ + + N+YAPC K+ N
Sbjct: 871 VGASGGILTAWKSKLFSGELIFSNEYAISIQMSSMHNNDSWLLTNVYAPCTDEGKRQFVN 930
Query: 61 SLSSRLLSLGSSKVCVCGDFNTVRSVDERRSVRGSQVVDDCAPFNDFIEDRVLINLPLCG 120
+ + + V GDFN +R ++R G + FN I L + L G
Sbjct: 931 WFKN-IQTPHEVDWLVVGDFNLMRKHEDRNKEGGDS--SEMFLFNSAISSLGLNEIILQG 987
Query: 121 RKFTWYNGDGRSM-STLDRFLLSEEWCMVWPTCLQVAQLRGLSDHCPIVLTVDEKNWGPR 179
RK+TW N + +D S W + + SDH P ++ V K +
Sbjct: 988 RKYTWSNMQSNPLLQKIDWTFTSNSWNIKFLDTSVKGYDMTPSDHAPCLVKVSTKIPKTK 1047
Query: 180 PVRLLKCWQDMPGYSDFVREKWGSFQVSGWGGFVLKEKLKLMKAALKEWHVANTLNLPSK 239
R CW +PG+ + + W + K K ++ ALKE A+ +L +
Sbjct: 1048 VFRFENCWLRLPGFQQIISDVWNVPTHHLDQAKSITAKFKKLRKALKEKQ-ASMAHLKTV 1106
Query: 240 IVSLKNRRAVLDSKGEEEELSEEELGELHGISSDIHSLSRLNTSICWQQSRLNWLRDGDA 299
I +K L+ E +LS E + + L Q+ + W++ GDA
Sbjct: 1107 ISKVKLVIQFLELLEEYRDLSMLEWNFRAILKKKLTELLEQQKIYWRQRGAIKWIKLGDA 1166
Query: 300 NSKFFHSVLAARRRINSLSSILV-DGVVVEGVQPVRQAVFTHSKNHFLAPIMTRPSVGHL 358
+KFFH+ + R N ++ + DG ++ + Q ++ K TR +
Sbjct: 1167 TTKFFHANATIKMRGNLVNQLQKQDGSILTAHKDKEQHIWEEFKERLGTKEETRFGINPD 1226
Query: 359 QFRKLSSVERGGLIKPFHEEEIKAAVWDCDSYKIPGPDGINLGFIKSFWHEMKAEIVRFV 418
+ S+ + L +PF E EI++ V + K PGPDG + F+K+ W +K +I
Sbjct: 1227 TILQASN-DLETLEEPFTETEIESVVKLLPNDKSPGPDGFSNEFMKASWQTVKVDIFNLC 1285
Query: 419 TEFHRNRKLSKGINSRFIALIPKVASPQSLNEFRPISLVGSLYKILAKILANRLRQVIGS 478
FH + + IN+ FI LIPK +P S+N++RPISL+ + K++ K+LANRL+ I
Sbjct: 1286 HAFHNHNVCLRSINTSFITLIPKSHNPVSVNDYRPISLLNTSMKLITKLLANRLQMKITD 1345
Query: 479 VVSEVQSAFVKNRQILDGILITNEVVDEARKSKKDLLLFKVDFEKAYDSVDWGYLDEVMG 538
+V + Q F++ R I D + + E + S+K +++ K+DFEKA+D ++ + +M
Sbjct: 1346 LVHKNQYGFIRQRTIQDCLAWSFEYLHLCHHSRKAIVIIKLDFEKAFDKMNHKAMLTIMK 1405
Query: 539 SMTFPTVW*KWIKECVGTATASVLVNGCPTDEFFLERGLRQGDPLSPFLFLLAAEGLNVL 598
+ F W W+K + T++VL+NG P RG+RQGDPLSP LF+LAA+ L L
Sbjct: 1406 AKGFGQKWLNWMKSIFTSGTSNVLLNGTPGKTIHCLRGVRQGDPLSPLLFVLAADFLQSL 1465
Query: 599 MKALVDAGLFK---GYSVGRADPVVVSHLQFADDTLLIGNKSWANVRALRAGLILLEAMS 655
+ L K + P++ Q+ADDTL+I + L++ + +
Sbjct: 1466 INKAKCMDLLKLPIPMQTNQDFPII----QYADDTLIIAEGDTRQLLILKSIINTFSEAT 1521
Query: 656 GLKVNFHKSSLVGVNITGSWLSEAASVLGCKVGKIPFLYLGLSIGGDPRRLLFWEPVVNR 715
GLKVN KS ++ +N+ L A GC G PF YLGL +G + ++P++N+
Sbjct: 1522 GLKVNLQKSMMLPINMDEERLDTLARTFGCSKGSFPFTYLGLPLGITKPTIQDYQPLINK 1581
Query: 716 IKNRLSGWQSRFLSFGGRLVLLKFVLTALPVYALSFFKAPSGIISSIESLFNKFFW-GGG 774
+ RL G + FLS GRL L V TALP + + P +I I+ W G G
Sbjct: 1582 CEARL-GSVATFLSEAGRLELTNAVFTALPTFYMCTLAIPKSVIHKIDKFRKHCLWRGNG 1640
Query: 775 EDKRKISWIRWDTLSLRKEYGGLGVTRLREFNLALLGKWCWRLLLEKEALWRKVLVARYG 834
+ RK W + K GGLGV + + N ALL K + +K+ W +++ ++
Sbjct: 1641 INARKPPKAAWKLVCKPKNEGGLGVIDIEKQNEALLMKNLDKFFNKKDTPWVQMIWDKH- 1699
Query: 835 VADGGLEDGGRSCSSWWREIVRIRDGIGEGGEGWFGSCVRRRVGDGAETDFWWDCWCGDV 894
A G L R S WWR+I+++ D E S ++ + G+ FW D W G
Sbjct: 1700 YAKGKLPGQIRKGSFWWRDILKLLDPYKE------MSLIQMKGGESIV--FWQDKW-GMQ 1750
Query: 895 SLCERFSRLYDLTVNKLISVRNMILLGVDVGGEALRWRRRLWAWEEELVEEFXALLLTVS 954
L L+ +K +S +V G+ R E + L +
Sbjct: 1751 KLRLEMPELFSFVQHKFLS-------AAEVLGQEDFTRLLQLPISETAFHQLQQLTTRID 1803
Query: 955 LQESVTDRWIWLPTQDDGYSVRGAYDMLTSQEQPHLHQNLELIWHTQVPLKVSILAWRLL 1014
E ++ W + +S AY L +Q +H+ + IW + K + W L+
Sbjct: 1804 NLERTQEQDQWKYKWGNNFSATKAYSELMGHQQ--IHEVHKWIWKSFCQPKHKVFFWLLI 1861
Query: 1015 RDRLPTKDNLANRGILPLEARMCVSSCGN-EEDVNHLFLSCATFSALWPLV 1064
+DRL T N+ R + L++ CV N EE +HLFL CA W L+
Sbjct: 1862 KDRLST-CNILKRKNMQLQSYNCVLCLQNSEETTSHLFLHCAYARQCWQLL 1911
>UniRef100_Q9FYK1 F21J9.30 [Arabidopsis thaliana]
Length = 1270
Score = 377 bits (969), Expect = e-103
Identities = 305/1048 (29%), Positives = 475/1048 (45%), Gaps = 82/1048 (7%)
Query: 1 VGASRGMVTIWDRVEVEVWSTVRGVHFLLIHGRFLKSNEEFYIFNMYAPCDRRAKQVLWN 60
VG G+ +W + V ++ V L+ + F + +Y DR + W
Sbjct: 25 VGKCGGLALLW---KSSVQVDLKFVDKNLMDAQVQFGAVNFCVSCVYGDPDRSKRSQAWE 81
Query: 61 SLSSRLLSLGS-SKVCVCGDFNTVRSVDERRSVRGSQVVD-DCAPFNDFIEDRVLINLPL 118
+S + +G K C+ GDFN + E+ G + D DC FN+ I+ L+ +P
Sbjct: 82 RISR--IGVGRRDKWCMFGDFNDILHNGEKNG--GPRRSDLDCKAFNEMIKGCDLVEMPA 137
Query: 119 CGRKFTWYN--GDGRSMSTLDRFLLSEEWCMVWPTCLQV-AQLRGLSDHCPIVLTVDEKN 175
G FTW GD LDR ++EW +P Q RG SDH P+++ +
Sbjct: 138 HGNGFTWAGRRGDHWIQCRLDRAFGNKEWFCFFPVSNQTFLDFRG-SDHRPVLIKLMSSQ 196
Query: 176 WGPR-PVRLLKCWQDMPGYSDFVREKWGSFQVSGWGGFVLKEKLKLMKAALKEWHVANTL 234
R R K + + + W + + ++L+ + +L W N L
Sbjct: 197 DSYRGQFRFDKRFLFKEDVKEAIIRTWSRGKHGT--NISVADRLRACRKSLSSWKKQNNL 254
Query: 235 NLPSKIVSLKNRRAVLDSKGEEEELSEEELGELHGISSDIHSLSRLNTSICWQQSRLNWL 294
N KI L+ A L+ +E+ L + + D+ R + Q+SR WL
Sbjct: 255 NSLDKINQLE---AALE---KEQSLVWPIFQRVSVLKKDLAKAYREEEAYWKQKSRQKWL 308
Query: 295 RDGDANSKFFHSVLAA---RRRINSLSSILVDGVVVEGVQPVRQAVFTHSKNHFLAPIMT 351
R G+ NSK+FH+ + R+RI L V+G + + + N F + +
Sbjct: 309 RSGNRNSKYFHAAVKQNRQRKRIEKLKD--VNGNMQTSEAAKGEVAAAYFGNLFKS---S 363
Query: 352 RPSVGHLQFR----KLSSVERGGLIKPFHEEEIKAAVWDCDSYKIPGPDGINLGFIKSFW 407
PS F ++S V L+ +EIK AV+ PGPDG++ F + +W
Sbjct: 364 NPSGFTDWFSGLVPRVSEVMNESLVGEVSAQEIKEAVFSIKPASAPGPDGMSALFFQHYW 423
Query: 408 HEMKAEIVRFVTEFHRNRKLSKGINSRFIALIPKVASPQSLNEFRPISLVGSLYKILAKI 467
+ ++ V +F + + N + LIPK P + + RPISL LYKI++KI
Sbjct: 424 STVGNQVTSEVKKFFADGIMPAEWNYTHLCLIPKTQHPTEMVDLRPISLCSVLYKIISKI 483
Query: 468 LANRLRQVIGSVVSEVQSAFVKNRQILDGILITNEVVDEAR---KSKKDLLLFKVDFEKA 524
+A RL+ + +VS+ QSAFV R I D IL+ +E+V + + + + K D KA
Sbjct: 484 MAKRLQPWLPEIVSDTQSAFVSERLITDNILVAHELVHSLKVHPRISSEFMAVKSDMSKA 543
Query: 525 YDSVDWGYLDEVMGSMTFPTVW*KWIKECVGTATASVLVNGCPTDEFFLERGLRQGDPLS 584
YD V+W YL ++ S+ F W WI CV + T SVL+N CP L+RGLRQGDPLS
Sbjct: 544 YDRVEWSYLRSLLLSLGFHLKWVNWIMVCVSSVTYSVLINDCPFGLIILQRGLRQGDPLS 603
Query: 585 PFLFLLAAEGLNVLMKALVDAGLFKGYSVGRADPVVVSHLQFADDTLLIGNKSWANVRAL 644
PFLF+L EGL L+ G +G P+V HL FADD+L + S L
Sbjct: 604 PFLFVLCTEGLTHLLNKAQWEGALEGIQFSENGPMV-HHLLFADDSLFLCKASREQSLVL 662
Query: 645 RAGLILLEAMSGLKVNFHKSSLV-GVNITGSWLSEAASVLGCKVGKIPFLYLGLSIGGDP 703
+ L + +G +N +KSS+ G + + LG YLGL
Sbjct: 663 QKILKVYGNATGQTINLNKSSITFGEKVDEQLKGTIRTCLGIFTEGGAGTYLGLPECFSG 722
Query: 704 RRLLFWEPVVNRIKNRLSGWQSRFLSFGGRLVLLKFVLTALPVYALSFFKAPSGIISSIE 763
++ + +R+K +L W +R LS GG+ VLLK V A+PV+A+S FK P ++E
Sbjct: 723 SKVDMLHYLKDRLKEKLDVWFTRCLSQGGKEVLLKSVALAMPVFAMSCFKLPITTCENLE 782
Query: 764 SLFNKFFWGGGEDKRKISWIRWDTLSLRKEYGGLGVTRLREFNLALLGKWCWRLLLEKEA 823
S F+W + RKI W W+ L L K+ GGLG ++ FN ALL K WRLL +
Sbjct: 783 SAMASFWWDSCDHSRKIHWQSWERLCLPKDSGGLGFRDIQSFNQALLAKQAWRLLHFPDC 842
Query: 824 LWRKVLVARYGVADGGLEDG-GRSCSSWWREIVRIRDGIGEGGEGWFGSCVRRRVGDGAE 882
L ++L +RY A L+ + S WR I+ R+ + +G +++RVGDGA
Sbjct: 843 LLSRLLKSRYFDATDFLDAALSQRPSFGWRSILFGRELLSKG--------LQKRVGDGAS 894
Query: 883 TDFWWDCWCGDVSLCE--RFSRLYDLTVNKLISVRNMILLGVDVGGEALRWRRRLWAWEE 940
W D W D R + +YD+T + V+ ++ R W+E
Sbjct: 895 LFVWIDPWIDDNGFRAPWRKNLIYDVT----LKVKALL-------------NPRTGFWDE 937
Query: 941 ELVE-----EFXALLLTVSLQESVTDRWIWLPTQDDGYSVRGAYDMLTSQEQPHLHQNLE 995
E++ E + + S D ++W + +SV+ AY + + +L +
Sbjct: 938 EVLHDLFLPEDILRIKAIKPVISQADFFVWKLNKSGDFSVKSAYWLAYQTKSQNLRSEVS 997
Query: 996 L----------IWHTQVPLKVSILAWRL 1013
+ +W+ Q K+ I W++
Sbjct: 998 MQPSTLGLKTQVWNLQTDPKIKIFLWKV 1025
>UniRef100_O82276 Putative non-LTR retroelement reverse transcriptase [Arabidopsis
thaliana]
Length = 1231
Score = 377 bits (968), Expect = e-102
Identities = 305/1054 (28%), Positives = 471/1054 (43%), Gaps = 121/1054 (11%)
Query: 42 YIFNMYAPCDRRAKQVLWNSLSSRLLSLGSSKVCVCGDFNTVRSVDERRSVRGSQVVDDC 101
++ +YA + LW L + L + + GDFNT+ VDER G D
Sbjct: 2 HLIVVYAAPSVSRRSGLWGELKDVVNGL-EGPLLIGGDFNTILWVDERMGGNGRLSPDSL 60
Query: 102 APFNDFIEDRVLINLPLCGRKFTWYNGDGRSM---STLDRFLLSEEWCMVWPTCLQVAQL 158
A F D+I + LI+L G KFTW G S LDR + + W +
Sbjct: 61 A-FGDWINELSLIDLGFKGNKFTWRRGRQESTVVAKRLDRVFVCAHARLKWQEAVVSHLP 119
Query: 159 RGLSDHCPIVLTVDEKNWGPRPVRLLKCWQDMPGYSDFVREKWGSFQVSGWGGFVLKEKL 218
SDH P+ + ++ P R L+ W RE +G V KEKL
Sbjct: 120 FMASDHAPLYVQLE-----PLQQRKLRKWN---------REVFGDIHVR-------KEKL 158
Query: 219 KLMKAALKEWHVANTLNLPSKIVSLKNRRAVLDSKGEEEELSEEELGELHGISSDIHSLS 278
+ +K + +L ++ L++EE+ + ++ +
Sbjct: 159 ---------------------VADIKEVQDLLGVVLSDDLLAKEEV-----LLKEMDLVL 192
Query: 279 RLNTSICWQQSRLNWLRDGDANSKFFHSVLAARRRINSLSSILVDGV--VVEGVQPVRQA 336
++ +Q+SR ++ GD N+ FFH+ RRR N + S+ D V + V+ A
Sbjct: 193 EQEETLWFQKSREKYIELGDRNTTFFHTSTIIRRRRNRIESLKGDDDRWVTDKVELEAMA 252
Query: 337 VFTHSKNHFLAPIM-TRPSVGHLQFRKLSSVERGGLIKPFHEEEIKAAVWDCDSYKIPGP 395
+ + + + L + R + F +S E+ L++ F + E+ +AV +K PGP
Sbjct: 253 LTYYKRLYSLEDVSEVRNMLPTGGFASISEAEKAALLQAFTKAEVVSAVKSMGRFKAPGP 312
Query: 396 DGINLGFIKSFWHEMKAEIVRFVTEFHRNRKLSKGINSRFIALIPKVASPQSLNEFRPIS 455
DG F + W + + RFV EF L N + LI KVA P+ + +FRP+S
Sbjct: 313 DGYQPVFYQQCWETVGPSVTRFVLEFFETGVLPASTNDALLVLIAKVAKPERIQQFRPVS 372
Query: 456 LVGSLYKILAKILANRLRQVIGSVVSEVQSAFVKNRQILDGILITNEVVDEAR--KSKKD 513
L L+KI+ K++ RL+ VI ++ Q++F+ R +D I++ E V R K +K
Sbjct: 373 LCNVLFKIITKMMVTRLKNVISKLIGPAQASFIPGRLSIDNIVLVQEAVHSMRRKKGRKG 432
Query: 514 LLLFKVDFEKAYDSVDWGYLDEVMGSMTFPTVW*KWIKECVGTATASVLVNGCPTDEFFL 573
+L K+D EKAYD V W +L E + + W I V + SVL NG TD F
Sbjct: 433 WMLLKLDLEKAYDRVRWDFLQETLEAAGLSEGWTSRIMAGVTDPSMSVLWNGERTDSFVP 492
Query: 574 ERGLRQGDPLSPFLFLLAAEGLNVLMKALVDAGLFKGYSVGRADPVVVSHLQFADDTLLI 633
RGLRQGDPLSP+LF+L E L L++A V +K +V +SH+ FADD +L
Sbjct: 493 ARGLRQGDPLSPYLFVLCLERLCHLIEASVGKREWKPIAVS-CGGSKLSHVCFADDLILF 551
Query: 634 GNKSWANVRALRAGLILLEAMSGLKVNFHKSSLV---GVNITGSWLSEAASVLGC--KVG 688
S A +R +R L SG KV+ KS + V+ L S +GC ++G
Sbjct: 552 AEASVAQIRIIRRVLERFCEASGQKVSLEKSKIFFSHNVSREMEQLISEESGIGCTKELG 611
Query: 689 KIPFLYLGLSIGGDPRRLLFWEPVVNRIKNRLSGWQSRFLSFGGRLVLLKFVLTALPVYA 748
K YLG+ I + V+ R+ RL+GW+ R LS GR+ L K VL+++PV+
Sbjct: 612 K----YLGMPILQKRMNKETFGEVLERVSARLAGWKGRSLSLAGRITLTKAVLSSIPVHV 667
Query: 749 LSFFKAPSGIISSIESLFNKFFWGGGEDKRKISWIRWDTLSLRKEYGGLGVTRLREFNLA 808
+S P + +++ F WG +K+K + W + K GG+G+ R+ N A
Sbjct: 668 MSAILLPVSTLDTLDRYSRTFLWGSTMEKKKQHLLSWRKICKPKAEGGIGLRSARDMNKA 727
Query: 809 LLGKWCWRLLLEKEALWRKVLVARYGVADGGLEDGG------RSCSSWWREIVRIRDGIG 862
L+ K WRLL +KE+LW +V+ +Y V GG++D R S+W V +R+ +
Sbjct: 728 LVAKVGWRLLQDKESLWARVVRKKYKV--GGVQDTSWLKPQPRWSSTWRSVAVGLREVVV 785
Query: 863 EGGEGWFGSCVRRRVGDGAETDFWWDCWCGDVSLCERFSRLYDLTVNKLISVRNMILLGV 922
+ G GW GDG FW D W L E LG
Sbjct: 786 K-GVGWV-------PGDGCTIRFWLDRWLLQEPLVE---------------------LGT 816
Query: 923 DV--GGEALRWRRRLW----AWEEELV-----EEFXALLLTVSLQESV--TDRWIWLPTQ 969
D+ GE ++ W W E++ E LL+V +Q + D W TQ
Sbjct: 817 DMIPEGERIKVAADYWLPGSGWNLEILGLYLPETVKRRLLSVVVQVFLGNGDEISWKGTQ 876
Query: 970 DDGYSVRGAYDMLTSQ--EQPHLHQNLELIWHTQVPLKVSILAWRLLRDRLPTKDNLANR 1027
D ++VR AY +L ++P++ IW P +V + W + ++ + T R
Sbjct: 877 DGAFTVRSAYSLLQGDVGDRPNMGSFFNRIWKLITPERVRVFIWLVSQNVIMTNVERVRR 936
Query: 1028 GILPLEARMCVSSCGNEEDVNHLFLSCATFSALW 1061
+ E +C G EE + H+ C +W
Sbjct: 937 HL--SENAICSVCNGAEETILHVLRDCPAMEPIW 968
>UniRef100_Q9SKJ4 Putative non-LTR retroelement reverse transcriptase [Arabidopsis
thaliana]
Length = 1750
Score = 376 bits (966), Expect = e-102
Identities = 306/1126 (27%), Positives = 516/1126 (45%), Gaps = 91/1126 (8%)
Query: 2 GASRGMVTIWDRVEVEVWSTVRGVHFLLIHGRFLKSNEEFYIFNMYAPCDRRAKQVLWNS 61
G S G+ +W V ++ LI +N+ FY+ +Y + + LW +
Sbjct: 444 GRSGGLALMWKN---NVSLSLISQDERLIDSHVTFNNKSFYLSCVYGHPTQSERHQLWQT 500
Query: 62 LSSRLLSLGSSKVCVCGDFNTVRSVDERRSVRGSQVVDDCA--PFNDFIEDRVLINLPLC 119
L + +++ + GDFN + S E+ G + ++ F + + + ++
Sbjct: 501 LE-HISDNRNAEWLLVGDFNEILSNAEKI---GGPMREEWTFRNFRNMVSHCDIEDMRSK 556
Query: 120 GRKFTWYNGDGRSMST---LDRFLLSEEWCMVWPTC-LQVAQLRGLSDHCPIVLTVDEKN 175
G +F+W G+ + + LDR ++ W +P ++ G SDH P+++ +E
Sbjct: 557 GDRFSWV-GERHTHTVKCCLDRVFINSAWTATFPYAEIEFLDFTG-SDHKPVLVHFNESF 614
Query: 176 WGPRPVRLLKCWQ---DMPGYSDFVREKWGSFQVSGWGGFVLKEKLKLMKAALKEWHVAN 232
PR +L + D+P + V+ W + + S + E++ + A+ A+
Sbjct: 615 --PRRSKLFRFDNRLIDIPTFKRIVQTSWRTNRNSR--STPITERISSCRQAMARLKHAS 670
Query: 233 TLNLPSKIVSLKN--RRAVLDSKGEEEELSEEELGELHGISSDIHSLSRLNTSICWQQ-S 289
LN +I L++ RA+ ++ + +L + L SD I W+Q S
Sbjct: 671 NLNSEQRIKKLQSSLNRAMESTRRVDRQLIPQLQESLAKAFSD--------EEIYWKQKS 722
Query: 290 RLNWLRDGDANSKFFHSVLAARRRINSLSSILVD-GVVVEGVQPVRQAVFTHSKNHFLAP 348
R W+++GD N+ +FH+ R N +++I+ D G + G + + N F
Sbjct: 723 RNQWMKEGDQNTGYFHACTKTRYSQNRVNTIMDDQGRMFTGDKEIGNHAQDFFTNIFSTN 782
Query: 349 IMTRPSVGHLQFRK-LSSVERGGLIKPFHEEEIKAAVWDCDSYKIPGPDGINLGFIKSFW 407
+ + F+ +++ L K F + EI A+ K PGPDG+ F K+ W
Sbjct: 783 GIKVSPIDFADFKSTVTNTVNLDLTKEFSDTEIYDAICQIGDDKAPGPDGLTARFYKNCW 842
Query: 408 HEMKAEIVRFVTEFHRNRKLSKGINSRFIALIPKVASPQSLNEFRPISLVGSLYKILAKI 467
+ +++ V +F + IN I +IPK+ +P +L+++RPI+L LYK+++K
Sbjct: 843 DIVGYDVILEVKKFFETSFMKPSINHTNICMIPKITNPTTLSDYRPIALCNVLYKVISKC 902
Query: 468 LANRLRQVIGSVVSEVQSAFVKNRQILDGILITNEVVDEARKSK---KDLLLFKVDFEKA 524
L NRL+ + S+VS+ Q+AF+ R I D ++I +EV+ + K K + K D KA
Sbjct: 903 LVNRLKSHLNSIVSDSQAAFIPGRIINDNVMIAHEVMHSLKVRKRVSKTYMAVKTDVSKA 962
Query: 525 YDSVDWGYLDEVMGSMTFPTVW*KWIKECVGTATASVLVNGCPTDEFFLERGLRQGDPLS 584
YD V+W +L+ M F W WI V + SVL+NG P RG+RQGDPLS
Sbjct: 963 YDRVEWDFLETTMRLFGFCNKWIGWIMAAVKSVHYSVLINGSPHGYITPTRGIRQGDPLS 1022
Query: 585 PFLFLLAAEGLNVLMKALVDAGLFKGYSVGRADPVVVSHLQFADDTLLIGNKSWANVRAL 644
P+LF+L + L+ L+ +G +G +G P + +HLQFADD+L + N +AL
Sbjct: 1023 PYLFILCGDILSHLINGRASSGDLRGVRIGNGAPAI-THLQFADDSLFFCQANVRNCQAL 1081
Query: 645 RAGLILLEAMSGLKVNFHKSSLV-GVNITGSWLSEAASVLGCKVGKIPFLYLGLSIGGDP 703
+ + E SG K+N KS + G + GS S +L YLGL
Sbjct: 1082 KDVFDVYEYYSGQKINVQKSMITFGSRVYGSTQSRLKQILEIPNQGGGGKYLGLPEQFGR 1141
Query: 704 RRLLFWEPVVNRIKNRLSGWQSRFLSFGGRLVLLKFVLTALPVYALSFFKAPSGIISSIE 763
++ +E +++R+K R S W +RFLS G+ ++LK V A+PVYA+S FK P GI+S IE
Sbjct: 1142 KKKEMFEYIIDRVKKRTSTWSARFLSPAGKEIMLKSVALAMPVYAMSCFKLPKGIVSEIE 1201
Query: 764 SLFNKFFWGGGEDKRKISWIRWDTLSLRKEYGGLGVTRLREFNLALLGKWCWRLLLEKEA 823
SL F+W ++R I W+ W L K+ GGLG L +FN ALL K WRL+ +
Sbjct: 1202 SLLMNFWWEKASNQRGIPWVAWKRLQYSKKEGGLGFRDLAKFNDALLAKQAWRLIQYPNS 1261
Query: 824 LWRKVLVARYGVADGGLEDGGRSCSSW-WREIVRIRDGIGEGGEGWFGSCVRRRVGDGAE 882
L+ +V+ ARY L+ R S+ W ++ DGI +G R +GDG
Sbjct: 1262 LFARVMKARYFKDVSILDAKVRKQQSYGWASLL---DGIALLKKG-----TRHLIGDGQN 1313
Query: 883 TDFWWDCWCGD-----VSLCERFSRLYDLTVNKLISVRNMILLGVDVGGEALRWRRRLWA 937
D ++ E + ++T+N L + +
Sbjct: 1314 IRIGLDNIVDSHPPRPLNTEETYK---EMTINNLFERKG-----------------SYYF 1353
Query: 938 WEEELVEEF-----XALLLTVSLQES-VTDRWIWLPTQDDGYSVRGAYDMLTSQEQPHL- 990
W++ + +F + + L +S D+ IW Y+VR Y +LT ++
Sbjct: 1354 WDDSKISQFVDQSDHGFIHRIYLAKSKKPDKIIWNYNTTGEYTVRSGYWLLTHDPSTNIP 1413
Query: 991 -----HQNLEL---IWHTQVPLKVSILAWRLLRDRLPTKDNLANRGILPLEARMCVSSCG 1042
H +++L IW+ + K+ WR L L T + L RG+ + C
Sbjct: 1414 AINPPHGSIDLKTRIWNLPIMPKLKHFLWRALSQALATTERLTTRGMRIDPS--CPRCHR 1471
Query: 1043 NEEDVNHLFLSCATFSALWPLVQAWLGVVGVDSQSTSDHLVQFINY 1088
E +NH +C + W L + L + S +++ +N+
Sbjct: 1472 ENESINHALFTCPFATMAWRLSDSSLIRNQLMSNDFEENISNILNF 1517
>UniRef100_Q9C6L3 Hypothetical protein F2J7.11 [Arabidopsis thaliana]
Length = 1213
Score = 373 bits (958), Expect = e-101
Identities = 293/1040 (28%), Positives = 493/1040 (47%), Gaps = 49/1040 (4%)
Query: 46 MYAPCDRRAKQVLWNSLSSRLLS--LGSSKVCVCGDFNTVRSVDERRSVRGSQVVDDCAP 103
+YA + +++ LW + + ++S +G V GDFN V + E + V +
Sbjct: 109 VYAANEVASRKELWIEIVNMVVSGIIGDRPWLVLGDFNQVLNPQEHSNPVSLNVDINMRD 168
Query: 104 FNDFIEDRVLINLPLCGRKFTWYNGDGRS--MSTLDRFLLSEEWCMVWPTCLQVAQLRGL 161
F D + L +L G FTW+N + +DR L+++ W ++P+ L +
Sbjct: 169 FRDCLLAAELSDLRYKGNTFTWWNKSHTTPVAKKIDRILVNDSWNALFPSSLGIFGSLDF 228
Query: 162 SDHCPIVLTVDEKNW-GPRPVRLLKCWQDMPGYSDFVREKWGSFQVSGWGGFVLKEKLKL 220
SDH + ++E + RP + + + VR+ W + V G F + +KLK
Sbjct: 229 SDHVSCGVVLEETSIKAKRPFKFFNYLLKNLDFLNLVRDNWFTLNVVGSSMFRVSKKLKA 288
Query: 221 MKAALKEWHVANTLNLPSKIVSLKNRRAVLDSKGEEEELSEEELGELHGISSDIHSLSRL 280
+K +K++ N L + + + + EL H L+
Sbjct: 289 LKKPIKDFSRLNYSELEKRTKEAHDFLIGCQDRTLADPTPINASFELEA-ERKWHILTAA 347
Query: 281 NTSICWQQSRLNWLRDGDANSKFFHSVLAARRRINSLSSI------LVDGVVVEGVQPVR 334
S Q+SR++W +GD N+K+FH + AR NS+S++ LVD EG+ +
Sbjct: 348 EESFFRQKSRISWFAEGDGNTKYFHRMADARNSSNSISALYDGNGKLVDSQ--EGILDLC 405
Query: 335 QAVFTHSKNHFLAP-IMTRPSVGHLQFRKLSSVERGGLIKPFHEEEIKAAVWDCDSYKIP 393
+ F + P +M + + L + S + L F E+I+AA++ K
Sbjct: 406 ASYFGSLLGDEVDPYLMEQNDMNLLLSYRCSPAQVCELESTFSNEDIRAALFSLPRNKSC 465
Query: 394 GPDGINLGFIKSFWHEMKAEIVRFVTEFHRNRKLSKGINSRFIALIPKVASPQSLNEFRP 453
GPDG F W + AE+ + EF + L K N+ I LIPK+ +P ++FRP
Sbjct: 466 GPDGFTAEFFIDSWSIVGAEVTDAIKEFFSSGCLLKQWNATTIVLIPKIVNPTCTSDFRP 525
Query: 454 ISLVGSLYKILAKILANRLRQVIGSVVSEVQSAFVKNRQILDGILITNEVVDEARKSK-K 512
IS + +LYK++A++L +RL++++ V+S QSAF+ R + + +L+ ++V S
Sbjct: 526 ISCLNTLYKVIARLLTDRLQRLLSGVISSAQSAFLPGRSLAENVLLATDLVHGYNWSNIS 585
Query: 513 DLLLFKVDFEKAYDSVDWGYLDEVMGSMTFPTVW*KWIKECVGTATASVLVNGCPTDEFF 572
+ KVD +KA+DSV W ++ + ++ P + WI +C+ T T +V +NG F
Sbjct: 586 PRGMLKVDLKKAFDSVRWEFVIAALRALAIPEKFINWISQCISTPTFTVSINGGNGGFFK 645
Query: 573 LERGLRQGDPLSPFLFLLAAEGLNVLMKALVDAGLFKGYSVGRADPVVVSHLQFADDTLL 632
+GLRQGDPLSP+LF+LA E + L+ + ++GL + +A + +SHL FADD ++
Sbjct: 646 STKGLRQGDPLSPYLFVLAMEAFSNLLHSRYESGLIHYHP--KASNLSISHLMFADDVMI 703
Query: 633 IGNKSWANVRALRAGLILLEAMSGLKVNFHKSSL--VGVNITGSWLSEAASVLGCKVGKI 690
+ ++ + L + SGLKVN KS L G+N S A + G +G +
Sbjct: 704 FFDGGSFSLHGICETLDDFASWSGLKVNKDKSHLYLAGLN---QLESNANAAYGFPIGTL 760
Query: 691 PFLYLGLSIGGDPRRLLFWEPVVNRIKNRLSGWQSRFLSFGGRLVLLKFVLTALPVYALS 750
P YLGL + R+ +EP++ +I R W ++ LSF GR+ L+ V+ + +S
Sbjct: 761 PIRYLGLPLMNRKLRIAEYEPLLEKITARFRSWVNKCLSFAGRIQLISSVIFGSINFWMS 820
Query: 751 FFKAPSGIISSIESLFNKFFWGGGEDKRKISWIRWDTLSLRKEYGGLGVTRLREFNLALL 810
F P G I IESL ++F W G ++ K + W L L K GGLG+ RL E+N L
Sbjct: 821 TFLLPKGCIKRIESLCSRFLWSGNIEQAKGIKVSWAALCLPKSEGGLGLRRLLEWNKTLS 880
Query: 811 GKWCWRLLLEKEALWRKVLVARYGVADGGL--EDGGRSCSSWWREIVRIRDGIGEGGEGW 868
+ WRL + K++LW + ++ G +GG+S S W+ ++ +R +
Sbjct: 881 MRLIWRLFVAKDSLWADWQHLHH-LSRGSFWAVEGGQSDSWTWKRLLSLRPLAHQ----- 934
Query: 869 FGSCVRRRVGDGAETDFWWDCWCGDVSLCERFSRLYDLTVNKLISVRNMILLGVDVGGEA 928
F C +VG+G + D+W+D W SL F + D+ + S+R +L V
Sbjct: 935 FLVC---KVGNGLKADYWYDNW---TSLGPLFRIIGDIGPS---SLRVPLLAKVASAFSE 985
Query: 929 LRWRRRL--WAWEEELVEEFXALLLTVSLQESVTDRWIWLPT--QDDGYSVRGAYDMLTS 984
WR + A + + + + + + QE V DR+ W G+S ++ +
Sbjct: 986 DGWRLPVSRSAPAKGIHDHLCTVPVPSTAQEDV-DRYEWSVNGFLCQGFSAAKTWEAI-- 1042
Query: 985 QEQPHLHQNLELIWHTQVPLKVSILAWRLLRDRLPTKDNLANRGILPLEARMCVSSCGNE 1044
+ + + IW K + W +RL T+ LA+ G + +A CV
Sbjct: 1043 RPKATVKSWASSIWFKGAVPKYAFNMWVSHLNRLLTRQRLASWGHIQSDA--CVLCSFAS 1100
Query: 1045 EDVNHLFLSCATFSALWPLV 1064
E +HL L C + +W LV
Sbjct: 1101 ESRDHLLLICEFSAQVWRLV 1120
>UniRef100_Q9SJ38 Putative non-LTR retroelement reverse transcriptase [Arabidopsis
thaliana]
Length = 1229
Score = 370 bits (950), Expect = e-100
Identities = 288/986 (29%), Positives = 442/986 (44%), Gaps = 72/986 (7%)
Query: 104 FNDFIEDRVLINLPLCGRKFTW--YNGDGRSMSTLDRFLLSEEWCMVWPTCLQVAQLRGL 161
F FI L +L G F+W D LDR + + W +P+
Sbjct: 58 FRSFISKNGLWDLKYSGNPFSWRGMRYDWFVRQRLDRAMSNNSWLESFPSGRSEYLRFEG 117
Query: 162 SDHCPIVLTVDEKNWGPR-PVRLLKCWQDMPGYSDFVREKWGSFQVSGWGGFVLKEKLKL 220
SDH P+V+ VDE R R +D + ++E W + G + K+
Sbjct: 118 SDHRPLVVFVDEARVKRRGQFRFDNRLRDNDVVNALIQETW-----TNAGDASVLTKMNQ 172
Query: 221 MKAALKEWHVANTLNLPSKIVSLKNRRAVLDSKGEEEELSEEELGE--LHGISSDIHSLS 278
+ + W LN I + K EE L+ + + +++ +
Sbjct: 173 CRREIINWTRLQNLNSAELIEKTQ--------KALEEALTADPPNPTTIGALTATLEHAY 224
Query: 279 RLNTSICWQQSRLNWLRDGDANSKFFHSVLAARRRINSLSSILVDGVVVEGVQPVRQAVF 338
+L Q+SR+ WL GD N+ +FH+V RR N L+ ++ D + GV +
Sbjct: 225 KLEEQFWKQRSRVLWLHSGDRNTGYFHAVTRNRRTQNRLT-VMED---INGVAQHEEHQI 280
Query: 339 THSKNHFLAPIMTRPSVGHLQF------RKLSSVERGGLIKPFHEEEIKAAVWDCDSYKI 392
+ + + I T S G +S + L + ++EE+K AV+ ++ K
Sbjct: 281 SQIISGYFQQIFTSESDGDFSVVDEAIEPMVSQGDNDFLTRIPNDEEVKDAVFSINASKA 340
Query: 393 PGPDGINLGFIKSFWHEMKAEIVRFVTEFHRNRKLSKGINSRFIALIPKVASPQSLNEFR 452
PGPDG GF S+WH + ++ R + F ++ + +N I LIPK P+ + ++R
Sbjct: 341 PGPDGFTAGFYHSYWHIISTDVGREIRLFFTSKNFPRRMNETHIRLIPKDLGPRKVADYR 400
Query: 453 PISLVGSLYKILAKILANRLRQVIGSVVSEVQSAFVKNRQILDGILITNEVVDEARKS-- 510
PI+L YKI+AKI+ R++ ++ ++SE QSAFV R I D +LIT+EV+ R S
Sbjct: 401 PIALCNIFYKIVAKIMTKRMQLILPKLISENQSAFVPGRVISDNVLITHEVLHFLRTSSA 460
Query: 511 -KKDLLLFKVDFEKAYDSVDWGYLDEVMGSMTFPTVW*KWIKECVGTATASVLVNGCPTD 569
K + K D KAYD V+W +L +V+ F ++W W+ ECV + + S L+NG P
Sbjct: 461 KKHCSMAVKTDMSKAYDRVEWDFLKKVLQRFGFHSIWIDWVLECVTSVSYSFLINGTPQG 520
Query: 570 EFFLERGLRQGDPLSPFLFLLAAEGLNVLMKALVDAGLFKGYSVGRADPVVVSHLQFADD 629
+ RGLRQGDPLSP LF+L E L+ L G V P V+HL FADD
Sbjct: 521 KVVPTRGLRQGDPLSPCLFILCTEVLSGLCTRAQRLRQLPGVRVSINGP-RVNHLLFADD 579
Query: 630 TLLIGNKSWANVRALRAGLILLEAMSGLKVNFHKSSLVGVNIT-GSWLSEAASVLGCKVG 688
T+ + L L SG +NFHKSS+ + T S + +L +
Sbjct: 580 TMFFSKSDPESCNKLSEILSRYGKASGQSINFHKSSVTFSSKTPRSVKGQVKRILKIRKE 639
Query: 689 KIPFLYLGLSIGGDPRRLLFWEPVVNRIKNRLSGWQSRFLSFGGRLVLLKFVLTALPVYA 748
YLGL R+ + ++++I+ + W SRFLS G+ V+LK VL ++P+Y+
Sbjct: 640 GGTGKYLGLPEHFGRRKRDIFGAIIDKIRQKSHSWASRFLSQAGKQVMLKAVLASMPLYS 699
Query: 749 LSFFKAPSGIISSIESLFNKFFWGGGEDKRKISWIRWDTLSLRKEYGGLGVTRLREFNLA 808
+S FK PS + I+SL +F+W D RK SW+ W L+ K GGLG + N +
Sbjct: 700 MSCFKLPSALCRKIQSLLTRFWWDTKPDVRKTSWVAWSKLTNPKNAGGLGFRDIERCNDS 759
Query: 809 LLGKWCWRLLLEKEALWRKVLVARYGVADGGLEDGGRS-CSSWWREIVRIRDGIGEGGEG 867
LL K WRLL E+L ++L+ +Y + +E S S WR I+ R+ + E G G
Sbjct: 760 LLAKLGWRLLNSPESLLSRILLGKYCHSSSFMECKLPSQPSHGWRSIIAGREILKE-GLG 818
Query: 868 WFGSCVRRRVGDGAETDFWWDCWCGD----VSLCERFSRLYDLTVNKLISVRNMILLGVD 923
W + +G + W D W V + DL V+ LI+ +
Sbjct: 819 WL-------ITNGEKVSIWNDPWLSISKPLVPIGPALREHQDLRVSALINQNTL------ 865
Query: 924 VGGEALRWRRRLWAWEE--ELVEEFXALLLTVSLQES-VTDRWIWLPTQDDGYSVRGAYD 980
W W + ++ + L+ + S D+ WLP + Y+ R Y
Sbjct: 866 -----------QWDWNKIAVILPNYENLIKQLPAPSSRGVDKLAWLPVKSGQYTSRSGYG 914
Query: 981 M--LTSQEQPHLHQNLEL-IWHTQVPLKVSILAWRLLRDRLPTKDNLANRGILPLEARMC 1037
+ + S P N + +W Q K+ L W+ + LP L R I P A
Sbjct: 915 IASVASIPIPQTQFNWQSNLWKLQTLPKIKHLMWKAAMEALPVGIQLVRRHISPSAA--- 971
Query: 1038 VSSCGNEEDVNHLFLSCATFSALWPL 1063
CG E HLF C + +W L
Sbjct: 972 CHRCGAPESTTHLFFHCEFAAQVWEL 997
>UniRef100_Q9SZ87 RNA-directed DNA polymerase-like protein [Arabidopsis thaliana]
Length = 1274
Score = 369 bits (946), Expect = e-100
Identities = 307/1057 (29%), Positives = 480/1057 (45%), Gaps = 83/1057 (7%)
Query: 34 FLKSNEEFYIFNMYAPCDRRAKQVLWNSLSSRLLSLGSSKVCVCGDFNTVRSVDERRS-- 91
+ K N E I AP + V W+ +SS L + SS + GDFN + E++
Sbjct: 37 YWKENVEVEILEA-APNFIDNRSVFWDKISS-LGAQRSSAWLLTGDFNDILDNSEKQGGP 94
Query: 92 VRGSQVVDDCAPFNDFIEDRVLINLPLCGRKFTWYNGDGRS---MSTLDRFLLSEEWCMV 148
+R F F+ L ++ G +W G S S LDR L + W +
Sbjct: 95 LRWEGFF---LAFRSFVSQNGLWDINHTGNSLSW-RGTRYSHFIKSRLDRALGNCSWSEL 150
Query: 149 WPTC-LQVAQLRGLSDHCPIVLTVDEKNWG-PRPVRLLKCWQDMPGYSDFVREKWGSFQV 206
+P + + G SDH P+V +P R + ++ V+E W +
Sbjct: 151 FPMSKCEYLRFEG-SDHRPLVTYFGAPPLKRSKPFRFDRRLREKEEIRALVKEVWELARQ 209
Query: 207 SGWGGFVLKEKLKLMKAALKEWHVANTLNLPSKIVSLKNRRAVLDSKGEEEELSEEELGE 266
+ K+ + ++ +W N I K + L+S + +G
Sbjct: 210 DS-----VLYKISRCRQSIIKWTKEQNSNSAKAI---KKAQQALESALSADIPDPSLIGS 261
Query: 267 LHGISSDIHSLSRLNTSICWQQSRLNWLRDGDANSKFFHSVLAARRRINSLSSILVDGV- 325
I+ ++ + R Q SR+ WL GD N +FH+ RR +N+LS ++ DG
Sbjct: 262 ---ITQELEAAYRQEELFWKQWSRVQWLNSGDRNKGYFHATTRTRRMLNNLS-VIEDGSG 317
Query: 326 --------VVEGVQPVRQAVFTHSKNHFLAPIMTRPSVGHLQFRKLSSVERGGLIKPFHE 377
+ + Q +FT S N L + S +SS LIK
Sbjct: 318 QEFHEEEQIASTISSYFQNIFTTSNNSDLQVVQEALSP------IISSHCNEELIKISSL 371
Query: 378 EEIKAAVWDCDSYKIPGPDGINLGFIKSFWHEMKAEIVRFVTEFHRNRKLSKGINSRFIA 437
EIK A++ + K PGPDG + F ++W ++A++ R + F + LS +N +
Sbjct: 372 LEIKEALFSISADKAPGPDGFSASFFHAYWDIIEADVSRDIRSFFVDSCLSPRLNETHVT 431
Query: 438 LIPKVASPQSLNEFRPISLVGSLYKILAKILANRLRQVIGSVVSEVQSAFVKNRQILDGI 497
LIPK+++P+ ++++RPI+L YKI+AKIL RL+ + ++S QSAFV R I D +
Sbjct: 432 LIPKISAPRKVSDYRPIALCNVQYKIVAKILTRRLQPWLSELISLHQSAFVPGRAIADNV 491
Query: 498 LITNEVVDEARKS---KKDLLLFKVDFEKAYDSVDWGYLDEVMGSMTFPTVW*KWIKECV 554
LIT+E++ R S K + K D KAYD + W +L EV+ + F W +W+ +CV
Sbjct: 492 LITHEILHFLRVSGAKKYCSMAIKTDMSKAYDRIKWNFLQEVLMRLGFHDKWIRWVMQCV 551
Query: 555 GTATASVLVNGCPTDEFFLERGLRQGDPLSPFLFLLAAEGLNVLMKALVDAGLFKGYSVG 614
T + S L+NG P RGLRQGDPLSP+LF+L E L+ L + + G+ G V
Sbjct: 552 CTVSYSFLINGSPQGSVVPSRGLRQGDPLSPYLFILCTEVLSGLCRKAQEKGVMVGIRVA 611
Query: 615 RADPVVVSHLQFADDTLLIGNKSWANVRALRAGLILLEAMSGLKVNFHKSSLVGVNITGS 674
R P V+HL FADDT+ + AL L E SG +N KS++ + T
Sbjct: 612 RGSP-QVNHLLFADDTMFFCKTNPTCCGALSNILKKYELASGQSINLAKSAITFSSKTPQ 670
Query: 675 WLSEAASVLGCKVGKIPFL--YLGLSIGGDPRRLLFWEPVVNRIKNRLSGWQSRFLSFGG 732
+ L ++ + YLGL R+ + +V+RI+ R W RFLS G
Sbjct: 671 DIKRRVK-LSLRIDNEGGIGKYLGLPEHFGRRKRDIFSSIVDRIRQRSHSWSIRFLSSAG 729
Query: 733 RLVLLKFVLTALPVYALSFFKAPSGIISSIESLFNKFFWGGGEDKRKISWIRWDTLSLRK 792
+ +LLK VL+++P YA+ FK P+ + I+S+ +F+W DKRK++W+ WD L+L
Sbjct: 730 KQILLKAVLSSMPSYAMMCFKLPASLCKQIQSVLTRFWWDSKPDKRKMAWVSWDKLTLPI 789
Query: 793 EYGGLGVTRLREFNLALLGKWCWRLLLEKEALWRKVLVARYGVADGGLEDGGRS--CSSW 850
GGLG + K WR+L E +L +VL+ +Y ++ S
Sbjct: 790 NEGGLGFREIE-------AKLSWRILKEPHSLLSRVLLGKYCNTSSFMDCSASPSFASHG 842
Query: 851 WREIVRIRDGIGEGGEGWFGSCVRRRVGDGAETDFWWDCWCGDVSLCERFSRLYDLTVNK 910
WR I+ RD + + G GW +G G + W + W S NK
Sbjct: 843 WRGILAGRDLLRK-GLGW-------SIGQGDSINVWTEAWLSPSSPQTPIGP--PTETNK 892
Query: 911 LISVRNMILLGVDVGG-EALRWRRRLWAWEEELVEEFXALLLTVSLQESVTDRWIWLPTQ 969
+SV ++I V EA+ R+ L +E+++ + + + LQ+S+ +WLP +
Sbjct: 893 DLSVHDLICHDVKSWNVEAI--RKHLPQYEDQIRK---ITINALPLQDSL----VWLPVK 943
Query: 970 DDGYSVRGAYDM--LTSQEQPHLHQNLEL-IWHTQVPLKVSILAWRLLRDRLPTKDNLAN 1026
Y+ + Y + L S L N + IW KV W+ ++ LP + L+
Sbjct: 944 SGEYTTKTGYALAKLNSFPASQLDFNWQKNIWKIHTSPKVKHFLWKAMKGALPVGEALSR 1003
Query: 1027 RGILPLEARMCVSSCGNEEDVNHLFLSCATFSALWPL 1063
R I EA + CG E HL L C +W L
Sbjct: 1004 RNI---EAEVTCKRCGQTESSLHLMLLCPYAKKVWEL 1037
>UniRef100_O23757 Reverse transcriptase [Beta vulgaris subsp. vulgaris]
Length = 507
Score = 363 bits (933), Expect = 1e-98
Identities = 200/514 (38%), Positives = 284/514 (54%), Gaps = 11/514 (2%)
Query: 72 SKVCVCGDFNTVRSVDERRS-VRGSQVVDDCAPFNDFIEDRVLINLPLCGRKFTWYNGDG 130
S + GD+N ER S + SQ D F +FI + LI + FTW+ G
Sbjct: 4 SPCLIMGDYNETLIPAERGSHLISSQGARD---FQNFINNMGLIEISPIKGFFTWFRG-- 58
Query: 131 RSMSTLDRFLLSEEWCMVWPTCLQVAQLRGLSDHCPIVLTVDEKNWGPRPVRLLKCWQDM 190
+S S LDR + EW +P+ R SDHCP++ +NWGP+P R L CW
Sbjct: 59 QSKSKLDRLFVQPEWLTSFPSLKLNILKRSPSDHCPLLAFSSHQNWGPKPFRFLNCWLTH 118
Query: 191 PGYSDFVREKWGSFQVSGWGGFVLKEKLKLMKAALKEWHVANTLNLPSKIVSLKNRRAVL 250
P + E W S + +KL+ +K ALK+W+ + I + + L
Sbjct: 119 PQCMKIISEIWTKNHNSS-----VPDKLRTLKEALKQWNQKEFGIIDDNIKHCEEKIHFL 173
Query: 251 DSKGEEEELSEEELGELHGISSDIHSLSRLNTSICWQQSRLNWLRDGDANSKFFHSVLAA 310
D+ L EL E ++ + + + S SR W+++GD+N+++FH++
Sbjct: 174 DNIANSRLLCTNELQERDDAQLNLWTWLKRSESYWALNSRARWIKEGDSNTRYFHTLATI 233
Query: 311 RRRINSLSSILVDGVVVEGVQPVRQAVFTHSKNHFLAPIMTRPSVGHLQFRKLSSVERGG 370
RR N L I +G + +++ + ++ F TRP L F KLS E
Sbjct: 234 RRHKNHLQYIKTEGKNISKPGEIKEEAVAYFQHIFAEEFSTRPVFNGLNFNKLSESECSS 293
Query: 371 LIKPFHEEEIKAAVWDCDSYKIPGPDGINLGFIKSFWHEMKAEIVRFVTEFHRNRKLSKG 430
LI PF EEI AV CD+ K PGPDG N FIKS W +K +I + V +F + L G
Sbjct: 294 LIAPFTHEEIDEAVDSCDAQKAPGPDGFNFKFIKSSWEVIKKDIYKIVEDFWASGSLPHG 353
Query: 431 INSRFIALIPKVASPQSLNEFRPISLVGSLYKILAKILANRLRQVIGSVVSEVQSAFVKN 490
NS FIALIPK P+ NEFRPIS+VG LYKI+AKILA RL++V+ ++ QS+F+K
Sbjct: 354 CNSAFIALIPKTQHPRGFNEFRPISMVGCLYKIIAKILARRLQRVMDHLIGPYQSSFIKG 413
Query: 491 RQILDGILITNEVVDEARKSKKDLLLFKVDFEKAYDSVDWGYLDEVMGSMTFPTVW*KWI 550
RQILDG+LI +E++D R+ K + ++ K+DF KA+DS+ W YL+ V+ M FP VW WI
Sbjct: 414 RQILDGVLIASELIDSCRRMKNEAVVLKLDFHKAFDSISWNYLEWVLCQMNFPLVWRSWI 473
Query: 551 KECVGTATASVLVNGCPTDEFFLERGLRQGDPLS 584
+ CV +A+A+VL+NG P+ F L+RGLRQGDPLS
Sbjct: 474 RSCVMSASAAVLINGSPSSVFKLQRGLRQGDPLS 507
>UniRef100_Q7X5Z1 OSJNBa0006A01.14 protein [Oryza sativa]
Length = 1189
Score = 363 bits (933), Expect = 1e-98
Identities = 299/1095 (27%), Positives = 466/1095 (42%), Gaps = 91/1095 (8%)
Query: 2 GASRGMVTIWDRVEVEVWSTVRGVHFLLIHGRFLKSNEEFYIFNMYAPCDRRAKQVLWNS 61
G G++ +W + +V G + ++S F + +Y P K
Sbjct: 69 GTRGGILLLWSSGDFDVQEVRIGRFSISAQITDIRSGVTFLLTGVYGPTRHHLKDSFLRH 128
Query: 62 LSSRLLSLGSSKVCVCGDFNTVRSVDER--------RSVRGSQVVDDCAPFNDFIEDRVL 113
+ S G + + GDFN + ++ R R V+D C L
Sbjct: 129 IRRLRPSPGEGWL-ILGDFNMIYRARDKNNSNLNLARMRRFRAVIDRCE----------L 177
Query: 114 INLPLCGRKFTWYNGDGRS-MSTLDRFLLSEEWCMVWPTCLQVAQLRGLSDHCPIVLTVD 172
+PL RKFTW N ++ + LDR +EEW + +P + A G SDH P+VL+
Sbjct: 178 HEIPLQNRKFTWSNERRKATLVKLDRCFCNEEWDLAFPHHVLHALPTGPSDHYPLVLSNP 237
Query: 173 EKNWGPRPVRLLKCWQDMPGYSDFVREKWGSFQVSGWGGFVLKEKLKLMKAALKEWHVAN 232
+ PR + W +PG+ D V++ W L +KL+ L++W
Sbjct: 238 QGPHRPRTFKFENFWTRVPGFKDKVKDIWSQPSSHTEPMHRLNQKLQHTAVGLRDWTKGI 297
Query: 233 TLNLPSKIVSLKNRRAVLDSKGEEEELSEEELGELHGISSDIHSLSRLNTSICWQQSRLN 292
+ N LD E EL+ E + + L+ + + Q SR+
Sbjct: 298 CSEAKLQFHMALNVIQRLDVAQEHRELTPLETRLRAALKRKVLGLASIERARKKQASRVT 357
Query: 293 WLRDGDANSKFFHSVLAARRRINSLSSILVDG--VVVEGVQPVRQAVFTHSKNHFLAPIM 350
+R+GDAN+KFFH + ARRR N++ + D V G + + + H + F+
Sbjct: 358 HIREGDANTKFFHLKVNARRRRNTIQRLKKDSGWAVTHGDKEL--TIHDHFAS-FMGRPG 414
Query: 351 TRPSVGHLQFRKLSSVERGGLIKPFHEEEIKAAVWDCDSYKIPGPDGINLGFIKSFWHEM 410
RP+ + + ++ ++ L +PF EEE+ A+ + + K PGPDG F K W +
Sbjct: 415 PRPNELNWETLDITPIDLATLGEPFTEEEVHKAIKEMPADKAPGPDGFTGAFFKVCWEII 474
Query: 411 KAEIVRFVTEFHRNRKLSKGI-NSRFIALIPKVASPQSLNEFRPISLVGSLYKILAKILA 469
K +I+ + + R + NS I LIPK +S++++RPISL+ S+ KI AK+LA
Sbjct: 475 KDDILLVFSSIYNLRCAHLNLLNSANIVLIPKKDGAESVSDYRPISLIHSIAKIFAKMLA 534
Query: 470 NRLRQVIGSVVSEVQSAFVKNRQILDGILITNEVVDEARKSKKDLLLFKVDFEKAYDSVD 529
RLR + ++S QSAF+K R I D L + + ++ +LLFK+D KA+DSV
Sbjct: 535 LRLRPHMHELISVNQSAFIKGRSIHDNYLFVRNTIRRYHRQRRAMLLFKLDITKAFDSVR 594
Query: 530 WGYLDEVMGSMTFPTVW*KWIKECVGTATASVLVNGCPTDEFFLERGLRQGDPLSPFLFL 589
W YL ++ FPT W W+ + ++ + VL+NG P RGLRQGDPLSP LF+
Sbjct: 595 WDYLLALLQRKGFPTRWIDWLGALLSSSISQVLLNGSPGQRIKHGRGLRQGDPLSPLLFI 654
Query: 590 LAAEGLNVLMKALVDAGLFKGYSVGRADPVVVSHLQFADDTLLIGNKSWANVRALRAGLI 649
LA + L ++ + G GR + +S +ADD ++ N + +V L
Sbjct: 655 LAIDPLQRILSKATELGAISKLR-GRTTRLRIS--MYADDDVIFINPTQNDVATSANILH 711
Query: 650 LLEAMSGLKVNFHKSSLVGVNITGSWLSEAASVLGCKVGKIPFLYLGLSIGGDPRRLLFW 709
GL N KS + + L E + K P YLGL + R
Sbjct: 712 RFGTAIGLVTNLQKSQVAAIRCGNIDLEEVLQGVPAKRANFPLKYLGLPLALGRLRKTDL 771
Query: 710 EPVVNRIKNRLSGWQSRFLSFGGRLVLLKFVLTALPVYALSFFKAPSGIISSIESLFNKF 769
+PV ++I R++ W+ + ++ GR L+K VL+A P+Y L+ K + ++ +F
Sbjct: 772 QPVFDKISGRVASWRGKNMAAAGRTTLVKSVLSAQPIYLLTALKVMKESLEQLDKQRRRF 831
Query: 770 FWGGGED----KRKISWIRWDTLSLRKEYGGLGVTRLREFNLALLGKWCWRLLLEKEALW 825
W G D K K++W + L GGLGV L +F AL +W W + W
Sbjct: 832 LWAGTGDITGGKCKVNWAK---TCLPTSQGGLGVLSLDKFTRALRLRWLWNEWRDPSKPW 888
Query: 826 RKVLVARYGVADGGLEDGGRSCSSWWREIVRIRDGIGEGGEGWFGSCVRRRVGDGAETDF 885
GLE C DG+ F + + VGDG T F
Sbjct: 889 V------------GLET---PC-----------DGVDRN---LFAASTKITVGDGNTTRF 919
Query: 886 WWDCWCGDVSLCERFSRLYDLTVNKLISVRN-----------MILLGVDVGGEALRWRRR 934
W W + +Y+++ N+ S+R + G+ + L R
Sbjct: 920 WDSAWINGRRPKDLMPLVYEISKNRKKSLRQGKEDDSWVHDLTLDAGLSITINLLDQLVR 979
Query: 935 LWAWEEELVEEFXALLLTVSLQESVTDRWIWLPTQDDGYSVRGAYDMLTSQEQPHLHQNL 994
LW + V L D+ +W T Y+ AY P+ + N
Sbjct: 980 LWE-----------AVRNVHLDSEEPDQIVWKFTSSGHYTASSAYHA-QCLGAPNTNFN- 1026
Query: 995 ELIWHTQVPLKVSILAWRLLRDRLPTKDNLANRGILPLEARMCVSSCGNEEDVNHLFLSC 1054
LIW P K AW ++++R+ T D LA RG C + E HL C
Sbjct: 1027 SLIWKVWAPGKCKFHAWLIIQNRVWTSDRLATRGW--RNNGHCPLCRCDTETAFHLVAMC 1084
Query: 1055 ATFSALWPLVQAWLG 1069
+W LV W G
Sbjct: 1085 RYTKRIWRLVATWAG 1099
>UniRef100_Q65WR1 Putative polyprotein [Oryza sativa]
Length = 1023
Score = 363 bits (931), Expect = 2e-98
Identities = 282/974 (28%), Positives = 434/974 (43%), Gaps = 68/974 (6%)
Query: 113 LINLPLCGRKFTWYNGDGRS-MSTLDRFLLSEEWCMVWPTCLQVAQLRGLSDHCPIVLTV 171
L +PL RKFTW N ++ + LDR +EEW + +P + A G SDHCP+VL+
Sbjct: 11 LHEIPLQNRKFTWSNERRKATLVKLDRCFCNEEWDLAFPHHVLHAFPTGPSDHCPLVLSN 70
Query: 172 DEKNWGPRPVRLLKCWQDMPGYSDFVREKWGSFQVSGWGGFVLKEKLKLMKAALKEWHVA 231
+ PR + W +PG+ D V++ W L +KL+ L++W
Sbjct: 71 PQGPHRPRTFKFENFWTRVPGFKDKVKDIWSQPSSHTEPMHHLNQKLQRTAVGLRDWTKG 130
Query: 232 NTLNLPSKIVSLKNRRAVLDSKGEEEELSEEELGELHGISSDIHSLSRLNTSICWQQSRL 291
+ N LD E ELS E + + L+ + + Q SR+
Sbjct: 131 ICSEAKLQFHMALNVIQRLDVAQEHRELSPPETRLRAALKRKVLGLASIERARKKQASRV 190
Query: 292 NWLRDGDANSKFFHSVLAARRRINSLSSILVDG--VVVEGVQPVRQAVFTHSKNHFLAPI 349
+R+GDAN+KFFH + ARRR N++ + D V G + + + H + F+
Sbjct: 191 THIREGDANTKFFHLRVNARRRRNTIQRLKKDSGWAVTHGEKEL--TIHDHFAS-FMGRP 247
Query: 350 MTRPSVGHLQFRKLSSVERGGLIKPFHEEEIKAAVWDCDSYKIPGPDGINLGFIKSFWHE 409
RP+ + + ++ ++ L +PF EEE+ A+ + + K PGPDG F K W
Sbjct: 248 GPRPNELNWETLDITPIDLATLGEPFTEEEVHRAIKEMPADKAPGPDGFTGAFFKVCWEI 307
Query: 410 MKAEIVRFVTEFHRNRKLSKGI-NSRFIALIPKVASPQSLNEFRPISLVGSLYKILAKIL 468
+K +I+ + + R + NS I LIPK +S++++RPISL+ S+ KI AK+L
Sbjct: 308 IKDDILLVFSSIYNLRCAHLNLLNSANIVLIPKKDGAESVSDYRPISLIHSIAKIFAKML 367
Query: 469 ANRLRQVIGSVVSEVQSAFVKNRQILDGILITNEVVDEARKSKKDLLLFKVDFEKAYDSV 528
A RLR + ++S QSAF+K R I D L ++ + ++ +LLFK+D KA+DSV
Sbjct: 368 ALRLRPHMHELISVNQSAFIKGRSIHDNYLFVRNMIRRYHRLRRAMLLFKLDITKAFDSV 427
Query: 529 DWGYLDEVMGSMTFPTVW*KWIKECVGTATASVLVNGCPTDEFFLERGLRQGDPLSPFLF 588
W YL ++ FPT W W+ + ++T+ VL+NG P RGLRQGDPLSP LF
Sbjct: 428 RWDYLLALLQRKGFPTRWIDWLGALLSSSTSQVLLNGSPGQRIKHGRGLRQGDPLSPLLF 487
Query: 589 LLAAEGLNVLMKALVDAGLFKGYSVGRADPVVVSHLQFADDTLLIGNKSWANVRALRAGL 648
+LA + L ++ + G GR + +S +ADD ++ N + +V L
Sbjct: 488 ILAIDPLQRILSKATELGAISKLR-GRTTRLRIS--MYADDAVIFINPTQNDVATSANIL 544
Query: 649 ILLEAMSGLKVNFHKSSLVGVNITGSWLSEAASVLGCKVGKIPFLYLGLSIGGDPRRLLF 708
+GL N KS + + L E + K P YLGL + R
Sbjct: 545 HRFGTATGLVTNLQKSQVAAIRCGNIDLEEVLQGVPAKRANFPLKYLGLPLVLGRLRKTD 604
Query: 709 WEPVVNRIKNRLSGWQSRFLSFGGRLVLLKFVLTALPVYALSFFKAPSGIISSIESLFNK 768
+PV ++I R++ W+ + ++ GR L+K VL+A P+Y L+ K + ++ +
Sbjct: 605 LQPVFDKISGRVASWRGKNMAAAGRTTLVKSVLSAQPIYLLTALKVMKESLEQLDKQRRR 664
Query: 769 FFWGGGED----KRKISWIRWDTLSLRKEYGGLGVTRLREFNLALLGKWCWRLLLEKEAL 824
F W G D K K++W + L GGLGV L +F AL +W W +
Sbjct: 665 FLWAGTGDITGGKCKVNWAK---TCLPTSQGGLGVLSLDKFTRALRLRWLWNEWRDPSKP 721
Query: 825 WRKVLVARYGVADGGLEDGGRSCSSWWREIVRIRDGIGEGGEGWFGSCVRRRVGDGAETD 884
W GLE C R++ F + + VGDG T
Sbjct: 722 WV------------GLET---PCDGVDRDL--------------FAASTKITVGDGNTTR 752
Query: 885 FWWDCWCGDVSLCERFSRLYDLTVNKLISVRNMILLGVDVGGEALRWRRRLWAWE----- 939
FW W + +Y+++ N+ S+R G E W L
Sbjct: 753 FWDSAWINGRRPKDLMPLVYEISKNRKKSLRQ--------GKEDDSWVHDLTLDAGSSIT 804
Query: 940 ----EELVEEFXALLLTVSLQESVTDRWIWLPTQDDGYSVRGAYDMLTSQEQPHLHQNLE 995
++LV + A+ V L D+ +W T Y+ AY P+ + N
Sbjct: 805 INLLDQLVRLWEAVR-NVHLDSEEPDQIVWKFTSSGHYTASSAYHA-QCLGAPNTNFN-S 861
Query: 996 LIWHTQVPLKVSILAWRLLRDRLPTKDNLANRGILPLEARMCVSSCGNEEDVNHLFLSCA 1055
LIW P K AW ++++R+ T D LA RG C + E HL C
Sbjct: 862 LIWKVWAPGKCKFHAWLIIQNRVWTSDRLATRGW--RNNGHCPLCRCDTETALHLVAMCR 919
Query: 1056 TFSALWPLVQAWLG 1069
+W LV W G
Sbjct: 920 YTKRIWRLVATWAG 933
>UniRef100_O81630 F8M12.22 protein [Arabidopsis thaliana]
Length = 1662
Score = 363 bits (931), Expect = 2e-98
Identities = 265/891 (29%), Positives = 444/891 (49%), Gaps = 40/891 (4%)
Query: 2 GASRGMVTIWDRVEVEVWSTVRGVHFLLIHGRFLKSNEEFYIFNMYAPCDRRAKQVLWNS 61
G S G+ +W + V + + + + +H +N FY+ +Y + + LW
Sbjct: 444 GHSGGLALLW-KDSVRLSNLYQDDRHIDVHISI--NNINFYLSRVYGHPCQSERHSLWTH 500
Query: 62 LSSRLLSLGSSKVCVCGDFNTVRSVDERRSVRGSQVVD-DCAPFNDFIEDRVLINLPLCG 120
+ L + + GDFN + S +E+ + G Q + F + + L ++ G
Sbjct: 501 FEN-LSKTRNDPWILIGDFNEILSNNEK--IGGPQRDEWTFRGFRNMVSTCDLKDIRSIG 557
Query: 121 RKFTWYNGDGRSMST---LDRFLLSEEWCMVWPTC-LQVAQLRGLSDHCPIVLTVDEKNW 176
+F+W G+ S + LDR ++ E ++P L+ + G SDH P+ L++++
Sbjct: 558 DRFSWV-GERHSHTVKCCLDRAFINSEGAFLFPFAELEFLEFTG-SDHKPLFLSLEKTET 615
Query: 177 GP-RPVRLLKCWQDMPGYSDFVREKWGSFQVSGWGGFVLKEKLKLMKAALKEWHVANTLN 235
RP R K ++P + +V+ W ++G L ++++ + A+ + + LN
Sbjct: 616 RKMRPFRFDKRLLEVPHFKTYVKAGWNK-AINGQRKH-LPDQVRTCRQAMAKLKHKSNLN 673
Query: 236 LPSKIVSLKNRRAVLDSKGEEEELSEEELGELHGISSDIHSLSRLNTSICWQQSRLNWLR 295
+I L+ A LD ++ E + I ++ R Q+SR W++
Sbjct: 674 SRIRINQLQ---AALDKA--MSSVNRTERRTISHIQRELTVAYRDEERYWQQKSRNQWMK 728
Query: 296 DGDANSKFFHSVLAARRRINSLSSIL-VDGVVVEGVQPV---RQAVFTHSKNHFLAPIMT 351
+GD N++FFH+ R +N L +I +G++ G + + Q FT P+
Sbjct: 729 EGDRNTEFFHACTKTRFSVNRLVTIKDEEGMIYRGDKEIGVHAQEFFTKVYESNGRPVSI 788
Query: 352 RPSVGHLQFRKLSSVE-RGGLIKPFHEEEIKAAVWDCDSYKIPGPDGINLGFIKSFWHEM 410
G F+ + + + L K + EI A+ K PGPDG+ F KS W +
Sbjct: 789 IDFAG---FKPIVTEQINDDLTKDLSDLEIYNAICHIGDDKAPGPDGLTARFYKSCWEIV 845
Query: 411 KAEIVRFVTEFHRNRKLSKGINSRFIALIPKVASPQSLNEFRPISLVGSLYKILAKILAN 470
++++ V F R + + IN I +IPK+ +P++L+++RPI+L LYKI++K L
Sbjct: 846 GPDVIKEVKIFFRTSYMKQSINHTNICMIPKITNPETLSDYRPIALCNVLYKIISKCLVE 905
Query: 471 RLRQVIGSVVSEVQSAFVKNRQILDGILITNEVVDEARKSKK---DLLLFKVDFEKAYDS 527
RL+ + ++VS+ Q+AF+ R + D ++I +E++ + K+ + K D KAYD
Sbjct: 906 RLKGHLDAIVSDSQAAFIPGRLVNDNVMIAHEMMHSLKTRKRVSQSYMAVKTDVSKAYDR 965
Query: 528 VDWGYLDEVMGSMTFPTVW*KWIKECVGTATASVLVNGCPTDEFFLERGLRQGDPLSPFL 587
V+W +L+ M F W KWI V + SVLVNG P +RG+RQGDPLSP+L
Sbjct: 966 VEWNFLETTMRLFGFSETWIKWIMGAVKSVNYSVLVNGIPHGTIQPQRGIRQGDPLSPYL 1025
Query: 588 FLLAAEGLNVLMKALVDAGLFKGYSVGRADPVVVSHLQFADDTLLIGNKSWANVRALRAG 647
F+L A+ LN L+K V G +G +G P V +HLQFADD+L + N +AL+
Sbjct: 1026 FILCADILNHLIKNRVAEGDIRGIRIGNGVPGV-THLQFADDSLFFCQSNVRNCQALKDV 1084
Query: 648 LILLEAMSGLKVNFHKSSLV-GVNITGSWLSEAASVLGCKVGKIPFLYLGLSIGGDPRRL 706
+ E SG K+N KS + G + G+ + ++LG + YLGL ++
Sbjct: 1085 FDVYEYYSGQKINMSKSMITFGSRVHGTTQNRLKNILGIQSHGGGGKYLGLPEQFGRKKR 1144
Query: 707 LFWEPVVNRIKNRLSGWQSRFLSFGGRLVLLKFVLTALPVYALSFFKAPSGIISSIESLF 766
+ ++ R+K R S W +++LS G+ ++LK V ++PVYA+S FK P I+S IE+L
Sbjct: 1145 DMFNYIIERVKKRTSSWSAKYLSPAGKEIMLKSVAMSMPVYAMSCFKLPLNIVSEIEALL 1204
Query: 767 NKFFWGGGEDKRKISWIRWDTLSLRKEYGGLGVTRLREFNLALLGKWCWRLLLEKEALWR 826
F+W KR+I WI W L K+ GGLG L +FN ALL K WR++ +L+
Sbjct: 1205 MNFWWEKNAKKREIPWIAWKRLQYSKKEGGLGFRDLAKFNDALLAKQVWRMINNPNSLFA 1264
Query: 827 KVLVARYGVADGGLEDGGRSCSSW-WREIVRIRDGIGEG-----GEGWFGS 871
+++ ARY D L+ + S+ W ++ D I +G G+G GS
Sbjct: 1265 RIMKARYFREDSILDAKRQRYQSYGWTSMLAGLDVIKKGSRFIVGDGKTGS 1315
Score = 40.0 bits (92), Expect = 0.39
Identities = 25/92 (27%), Positives = 43/92 (46%), Gaps = 4/92 (4%)
Query: 997 IWHTQVPLKVSILAWRLLRDRLPTKDNLANRGILPLEARMCVSSCGNEEDVNHLFLSCAT 1056
+W + K+ + WR + LPT+ L RG + ++ C EE +NH+ +C
Sbjct: 1362 LWKLPIIPKIKYMLWRTISKALPTRSRLLTRG-MDIDPH-CPRCPTEEETINHVLFTCPY 1419
Query: 1057 FSALWPLVQ-AWLGVVGVDSQSTSDHLVQFIN 1087
+++W L WL SQ T +++ IN
Sbjct: 1420 AASIWGLSNFPWL-PGHTFSQDTEENISFLIN 1450
>UniRef100_O81517 T24M8.8 protein [Arabidopsis thaliana]
Length = 1164
Score = 363 bits (931), Expect = 2e-98
Identities = 284/1050 (27%), Positives = 479/1050 (45%), Gaps = 89/1050 (8%)
Query: 46 MYAPCDRRAKQVLWNSLS--SRLLSLGSSKVCVCGDFNTVRSVDERRSVRGSQVVDDCAP 103
+YA D +Q+LWN + S + V GDFN + E + G V
Sbjct: 6 VYASTDEVTRQILWNEIVDFSNDPCVIDKPWTVLGDFNQILHPSEHSTSDGFNVDRPTRI 65
Query: 104 FNDFIEDRVLINLPLCGRKFTWYNGDGRS--MSTLDRFLLSEEWCMVWPTCLQVAQLRGL 161
F + I L +L G FTW+N R+ LDR L++++W +P+ L +
Sbjct: 66 FRETILLASLTDLSFRGNTFTWWNKRSRAPVAKKLDRILVNDKWTTTFPSSLGLFGEPDF 125
Query: 162 SDHCPIVLTVDEKN-WGPRPVRLLKCWQDMPGYSDFVREKWGSFQVSGWGGFVLKEKLKL 220
SDH L++ + +P R + + KW S V+G + + KLK
Sbjct: 126 SDHSSCELSLMSASPRSKKPFRFNNFLLKDENFLSLICLKWFSTSVTGSAMYRVSVKLKA 185
Query: 221 MKAALKEWHVANTLNLPSKIVS-----LKNRRAVLDSKGEEEELSEEELGELHGISSDIH 275
+K ++++ N ++ + L + +L S E E I
Sbjct: 186 LKKVIRDFSRDNYSDIEKRTKEAHDALLLAQSVLLASPCPSNAAIEAETQRKWRI----- 240
Query: 276 SLSRLNTSICWQQSRLNWLRDGDANSKFFHSVLAARRRINSLSSILVD--GVVVEGVQPV 333
L+ S +Q+SR+NWLR+GD NS +FH + +AR+ +N + L D G +EG Q +
Sbjct: 241 -LAEAEASFFYQRSRVNWLREGDMNSSYFHKMASARQSLNHIH-FLSDPVGDRIEGQQNL 298
Query: 334 RQAVFTHSKNHFLA----PIMTRPSVGHLQFRKLSSVERGGLIKPFHEEEIKAAVWDCDS 389
+ +++ + P+ + + +L + S ++ L PF E+IK A +
Sbjct: 299 ENHCVEYFQSNLGSEQGLPLFEQADISNLLSYRCSPAQQVSLDTPFSSEQIKNAFFSLPR 358
Query: 390 YKIPGPDGINLGFIKSFWHEMKAEIVRFVTEFHRNRKLSKGINSRFIALIPKVASPQSLN 449
K GPDG + F + W + E+ + EF + KL K N+ + LIPK+ + S++
Sbjct: 359 NKASGPDGFSPEFFCACWPIIGGEVTEAIHEFFTSGKLLKQWNATNLVLIPKITNASSMS 418
Query: 450 EFRPISLVGSLYKILAKILANRLRQVIGSVVSEVQSAFVKNRQILDGILITNEVV-DEAR 508
+FRPIS + ++YK+++K+L +RL+ + + +S QSAF+ R L+ +L+ E+V +
Sbjct: 419 DFRPISCLNTVYKVISKLLTDRLKDFLPAAISHSQSAFMPGRLFLENVLLATELVHGYNK 478
Query: 509 KSKKDLLLFKVDFEKAYDSVDWGYLDEVMGSMTFPTVW*KWIKECVGTATASVLVNGCPT 568
K+ + KVD KA+DSV W ++ + ++ P + WI EC+ TA+ SV++NG
Sbjct: 479 KNIAPSSMLKVDLRKAFDSVRWDFIVSALRALNVPEKFTCWILECLSTASFSVILNGHSA 538
Query: 569 DEFFLERGLRQGDPLSPFLFLLAAEGLNVLMKALVDAGLFKGYSVGRADPVVVSHLQFAD 628
F+ +GLRQGDP+SP+LF+LA E + L+++ +G + + + +SHL FAD
Sbjct: 539 GHFWSSKGLRQGDPMSPYLFVLAMEVFSGLLQSRYTSGYIAYHP--KTSQLEISHLMFAD 596
Query: 629 DTLLIGNKSWANVRALRAGLILLEAMSGLKVNFHKSSLVGVNITGSWLSEAASVLGCKVG 688
D ++ + +++ + L SGL +N +K+ L ++ S S++ + G K+G
Sbjct: 597 DVMIFFDGKSSSLHGIVESLEDFAGWSGLLMNTNKTQLYHAGLSQS-ESDSMASYGFKLG 655
Query: 689 KIPFLYLGLSIGGDPRRLLFWEPVVNRIKNRLSGWQSRFLSFGGRLVLLKFVLTALPVYA 748
+P YLGL + + + P++ +I R + W R LSF GR+ LL V++ + +
Sbjct: 656 SLPVRYLGLPLMSRKLTIAEYAPLIEKITARFNSWVVRLLSFAGRVQLLASVISGIVNFW 715
Query: 749 LSFFKAPSGIISSIESLFNKFFWGGGEDKRKISWIRWDTLSLRKEYGGLGVTRLREFNLA 808
+S F P G I IESL ++F W DK+ I+ + W + L K GG+G+ R N
Sbjct: 716 ISSFILPLGCIKKIESLCSRFLWSSRIDKKGIAKVAWSQVCLPKAEGGIGLRRFAVSNRT 775
Query: 809 LLGKWCWRLLLEKEALWRKVLVARYGVADGGLEDGGRSCSSW-----------WREIVRI 857
L + W L +LW VA G+S S W W+ ++R+
Sbjct: 776 LYLRMIWLLFSNSGSLW---------VAWHKQHSLGKSTSFWNQPEKPHDSWNWKCLLRL 826
Query: 858 RDGIGEGGEGWFGSCVRRRVGDGAETDFWWDCWCGDVSLCERFSR--LYDLTVNKLISVR 915
R + E +R VG+G + FW+D W L + DL V+ +
Sbjct: 827 R-VVAE-------RFIRCNVGNGRDASFWFDNWTPFGPLIKFLGNEGPRDLRVHLNAKIS 878
Query: 916 NMILLGVDVGGEALRWRRRLWAWEEELVEEFXAL---LLTVSLQESVTD----RWIWLPT 968
++ W+ + ++ +L L +S+ D W+
Sbjct: 879 DVC-------------TSEGWSIADPRSDQALSLHTHLTNISMPSDAQDLDSYDWVVDNK 925
Query: 969 QDDGYSVRGAYDMLTSQEQPHLHQNLELIWHTQVPLKVSILAWRLLRDRLPTKDNLANRG 1028
G+S + L P +W K + W DRLPTK LA+ G
Sbjct: 926 VCQGFSAAATWSALRPSSAP--VPWARAVWFKGATPKHAFHLWTAHLDRLPTKVRLASWG 983
Query: 1029 ILPLEARMCVSSCG----NEEDVNHLFLSC 1054
+ ++CG + E +HLFLSC
Sbjct: 984 M------QIDTTCGLCSLHPETRDHLFLSC 1007
>UniRef100_Q9SN65 Hypothetical protein F25I24.40 [Arabidopsis thaliana]
Length = 1294
Score = 362 bits (930), Expect = 3e-98
Identities = 266/895 (29%), Positives = 444/895 (48%), Gaps = 43/895 (4%)
Query: 2 GASRGMVTIWDRVEVEVWSTVRGVHFLLIHGRFLKSNEEFYIFNMYAPCDRRAKQVLWNS 61
G S G+ +W + V + + + + +H +N FY+ +Y + + LW
Sbjct: 424 GHSGGLALLW-KDSVRLSNLYQDDRHIDVHISI--NNINFYLSRVYGHPCQSERHSLWTH 480
Query: 62 LSSRLLSLGSSKVCVCGDFNTVRSVDERRSVRGSQVVD-DCAPFNDFIEDRVLINLPLCG 120
+ L + + GDFN + S +E+ + G Q + F + + L ++ G
Sbjct: 481 FEN-LSKTRNDPWILIGDFNEILSNNEK--IGGPQRDEWTFRGFRNMVSTCDLKDIRSIG 537
Query: 121 RKFTWYNGDGRSMST---LDRFLLSEEWCMVWPTC-LQVAQLRGLSDHCPIVLTVDEKNW 176
+F+W G+ S + LDR ++ E ++P L+ + G SDH P+ L++++
Sbjct: 538 DRFSWV-GERHSHTVKCCLDRAFINSEGAFLFPFAELEFLEFTG-SDHKPLFLSLEKTET 595
Query: 177 GP-RPVRLLKCWQDMPGYSDFVREKWGSFQVSGWGGFVLKEKLKLMKAALKEWHVANTLN 235
RP R K ++P + +V+ W ++G L ++++ + A+ + + LN
Sbjct: 596 RKMRPFRFDKRLLEVPHFKTYVKAGWNK-AINGQRKH-LPDQVRTCRQAMAKLKHKSNLN 653
Query: 236 LPSKIVSLKNRRAVLDSKGEEEELSEEELGELHGISSDIHSLSRLNTSICWQQSRLNWLR 295
+I L+ A LD ++ E + I ++ R Q+SR W++
Sbjct: 654 SRIRINQLQ---AALDKA--MSSVNRTERRTISHIQRELTVAYRDEERYWQQKSRNQWMK 708
Query: 296 DGDANSKFFHSVLAARRRINSLSSIL-VDGVVVEGVQPV---RQAVFTHSKNHFLAPIMT 351
+GD N++FFH+ R +N L +I +G++ G + + Q FT P+
Sbjct: 709 EGDRNTEFFHACTKTRFSVNRLVTIKDEEGMIYRGDKEIGVHAQEFFTKVYESNGRPVSI 768
Query: 352 RPSVGHLQFRKLSSVE-RGGLIKPFHEEEIKAAVWDCDSYKIPGPDGINLGFIKSFWHEM 410
G F+ + + + L K + EI A+ K PGPDG+ F KS W +
Sbjct: 769 IDFAG---FKPIVTEQINDDLTKDLSDLEIYNAICHIGDDKAPGPDGLTARFYKSCWEIV 825
Query: 411 KAEIVRFVTEFHRNRKLSKGINSRFIALIPKVASPQSLNEFRPISLVGSLYKILAKILAN 470
++++ V F R + + IN I +IPK+ +P++L+++RPI+L LYKI++K L
Sbjct: 826 GPDVIKEVKIFFRTSYMKQSINHTNICMIPKITNPETLSDYRPIALCNVLYKIISKCLVE 885
Query: 471 RLRQVIGSVVSEVQSAFVKNRQILDGILITNEVVDEARKSKK---DLLLFKVDFEKAYDS 527
RL+ + ++VS+ Q+AF+ R + D ++I +E++ + K+ + K D KAYD
Sbjct: 886 RLKGHLDAIVSDSQAAFIPGRLVNDNVMIAHEMMHSLKTRKRVSQSYMAVKTDVSKAYDR 945
Query: 528 VDWGYLDEVMGSMTFPTVW*KWIKECVGTATASVLVNGCPTDEFFLERGLRQGDPLSPFL 587
V+W +L+ M F W KWI V + SVLVNG P +RG+RQGDPLSP+L
Sbjct: 946 VEWNFLETTMRLFGFSETWIKWIMGAVKSVNYSVLVNGIPHGTIQPQRGIRQGDPLSPYL 1005
Query: 588 FLLAAEGLNVLMKALVDAGLFKGYSVGRADPVVVSHLQFADDTLLIGNKSWANVRALRAG 647
F+L A+ LN L+K V G +G +G P V +HLQFADD+L + N +AL+
Sbjct: 1006 FILCADILNHLIKNRVAEGDIRGIRIGNGVPGV-THLQFADDSLFFCQSNVRNCQALKDV 1064
Query: 648 LILLEAMSGLKVNFHKSSLV-GVNITGSWLSEAASVLGCKVGKIPFLYLGLSIGGDPRRL 706
+ E SG K+N KS + G + G+ + ++LG + YLGL ++
Sbjct: 1065 FDVYEYYSGQKINMSKSMITFGSRVHGTTQNRLKNILGIQSHGGGGKYLGLPEQFGRKKR 1124
Query: 707 LFWEPVVNRIKNRLSGWQSRFLSFGGRLVLLKFVLTALPVYALSFFKAPSGIISSIESLF 766
+ ++ R+K R S W +++LS G+ ++LK V ++PVYA+S FK P I+S IE+L
Sbjct: 1125 DMFNYIIERVKKRTSSWSAKYLSPAGKEIMLKSVAMSMPVYAMSCFKLPLNIVSEIEALL 1184
Query: 767 NKFFWGGGEDKRKISWIRWDTLSLRKEYGGLGVTRLREFNLALLGKWCWRLLLEKEALWR 826
F+W KR+I WI W L K+ GGLG L +FN ALL K WR++ +L+
Sbjct: 1185 MNFWWEKNAKKREIPWIAWKRLQYSKKEGGLGFRDLAKFNDALLAKQVWRMINNPNSLFA 1244
Query: 827 KVLVARYGVADGGLEDGGRSCSSW-WREIVRIRDGIGEGGEGWFGSCVRRRVGDG 880
+++ ARY D L+ + S+ W ++ D I +G R VGDG
Sbjct: 1245 RIMKARYFREDSILDAKRQRYQSYGWTSMLAGLDVIKKGS--------RFIVGDG 1291
Database: uniref100
Posted date: Jan 5, 2005 1:24 AM
Number of letters in database: 848,049,833
Number of sequences in database: 2,790,947
Lambda K H
0.323 0.140 0.445
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,855,608,303
Number of Sequences: 2790947
Number of extensions: 80332074
Number of successful extensions: 166678
Number of sequences better than 10.0: 1122
Number of HSP's better than 10.0 without gapping: 722
Number of HSP's successfully gapped in prelim test: 400
Number of HSP's that attempted gapping in prelim test: 163581
Number of HSP's gapped (non-prelim): 1762
length of query: 1091
length of database: 848,049,833
effective HSP length: 138
effective length of query: 953
effective length of database: 462,899,147
effective search space: 441142887091
effective search space used: 441142887091
T: 11
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 80 (35.4 bits)
Medicago: description of AC122166.6