
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0110a.13
(1389 letters)
Database: uniref100
2,790,947 sequences; 848,049,833 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
UniRef100_Q8H992 Oryza sativa (japonica cultivar-group) retrotra... 360 2e-97
UniRef100_Q9LVQ1 Non-LTR retroelement reverse transcriptase-like... 359 4e-97
UniRef100_Q8LJX1 Putative reverse transcriptase [Sorghum bicolor] 354 1e-95
UniRef100_Q9C6L3 Hypothetical protein F2J7.11 [Arabidopsis thali... 352 4e-95
UniRef100_O22148 Putative non-LTR retroelement reverse transcrip... 352 4e-95
UniRef100_Q9SN65 Hypothetical protein F25I24.40 [Arabidopsis tha... 343 2e-92
UniRef100_O81630 F8M12.22 protein [Arabidopsis thaliana] 343 2e-92
UniRef100_Q7DLW4 Non-ltr retrotransposon reverse transcriptase-l... 343 3e-92
UniRef100_Q9LRU8 Non-LTR retroelement reverse transcriptase-like... 342 4e-92
UniRef100_Q9SZ87 RNA-directed DNA polymerase-like protein [Arabi... 340 1e-91
UniRef100_Q9SKJ4 Putative non-LTR retroelement reverse transcrip... 335 6e-90
UniRef100_Q9SIQ5 Putative non-LTR retroelement reverse transcrip... 335 6e-90
UniRef100_Q75GM6 Putative non-LTR retroelement reverse transcrip... 333 2e-89
UniRef100_Q9FW98 Putative non-LTR retroelement reverse transcrip... 326 3e-87
UniRef100_O81517 T24M8.8 protein [Arabidopsis thaliana] 326 3e-87
UniRef100_O65244 F21E10.5 protein [Arabidopsis thaliana] 325 8e-87
UniRef100_Q9SIS9 Putative non-LTR retroelement reverse transcrip... 323 2e-86
UniRef100_Q9FYK1 F21J9.30 [Arabidopsis thaliana] 323 2e-86
UniRef100_Q9FXJ5 F5A9.24 [Arabidopsis thaliana] 323 2e-86
UniRef100_O23757 Reverse transcriptase [Beta vulgaris subsp. vul... 323 2e-86
>UniRef100_Q8H992 Oryza sativa (japonica cultivar-group) retrotransposon Karma DNA,
complete sequence [Oryza sativa]
Length = 1197
Score = 360 bits (924), Expect = 2e-97
Identities = 256/839 (30%), Positives = 414/839 (48%), Gaps = 28/839 (3%)
Query: 6 WNVRGLGRAEKRRAVVEALSGLKPEIVLVQESKLDSSRVKTVEAFANAVKMDYVSVDAVG 65
WNVRGLG K V ++S P VL+QE+KL + + + +Y V + G
Sbjct: 10 WNVRGLGDKNKCDIVKNSISDCNPSFVLLQETKLQNIPPEKAKTILPPNLANYDFVGSYG 69
Query: 66 SAGGLISLWNPASYAMESAVKNVRFIGLLLRALVDSSLLFIMNVYGPNVDSERYDFFNLL 125
S GG+++ WN A + S + + + + + N+YGP ++ DF + L
Sbjct: 70 SRGGMLTAWN-ADFTKTSFISRHFSLTISFTSTFTDLSFSLTNIYGPAELDQKQDFLHEL 128
Query: 126 SEALSLNNCFCKIMGGDFNAILNEGERIG--IQMGVESSFSDFVADSNLIDLPLQSDKFT 183
E L +++ GDFN + ++ G+ ++F+ + +L +LPL +T
Sbjct: 129 LEIDPLIQG-PRLLIGDFNLTRSPSDKNSQNFSTGLAAAFNQTINSLSLFELPLPDRLYT 187
Query: 184 WYSSRAGGLWSRIDRWLVSEDAINKFDGISQSVENWGLSDRRPIILSLGAINFGPKPFLF 243
W + R + +R+DR + N S S SD P++L+ K F F
Sbjct: 188 WSNKREVPVLARLDRAFFNNAWNNTLPNTSLSSCTRTTSDHVPLLLTASTSIPKSKTFRF 247
Query: 244 YNHWLLDKEFEKLVSDWWCSATTVGWSAFVLQQKLKGLKNRIKEWQGYASSVSRVRINTI 303
NH LLD++F V W + A L KLK ++ IK W + +++
Sbjct: 248 ENHLLLDRDFLPSVMHRWPQQLNMNDPAACLAAKLKQTRSIIKVWMK-----EHRKRSSL 302
Query: 304 EADLQVTMDRLASEEPGIELRKN----RLTILEGLWQEYRKEEYRWRQKARVRWLREGDR 359
D + T+D L E L R + E L+Q + + W+Q+ +V+ L+ G
Sbjct: 303 NEDCKFTIDLLDYIEELRPLNDGERCLRNLVQEKLFQAIQSKVAYWKQRGKVKRLKIGID 362
Query: 360 NTSFF--HSISKIRKSKKLISQLVVGGVSIEDPNMIKDAIFSHFQSFFTSENRLRPKIRC 417
N++FF H+ R S+ I + G I D N + S FQ F S+ +
Sbjct: 363 NSAFFKAHASKNYRNSR--IRSIKHNGAEIADHNGKALLLHSFFQHFLGSDCQPLWDFDL 420
Query: 418 ANLSRLSQANKMSLEKQ---FSEGEIWEVLKSCDGNRAPGTDGFNLNFFKQFWGIVKEDV 474
+ L + + L Q FS EI +K + N APG DGF +F+ W +KED+
Sbjct: 421 STLYSEDERDYNLLASQGLQFSTEEIITSIKGMNSNSAPGPDGFGPSFYNSAWNSIKEDI 480
Query: 475 VKFFEDFHSTGKLVKGLNATFISLIPKHNCPSTVSDFRPISLIGSAYKLISKVLASRLQV 534
+K F S ++ +N I LIPK +TV +RPISL + K++SKVLA+RLQ
Sbjct: 481 IKLAASFCSNSVQLERINRAHIVLIPKPGKENTVDGYRPISLQNCSVKILSKVLANRLQK 540
Query: 535 VTPLLISENQFAFAQGRQISDCILIANEVVDFLNKKQGGGFLLKLDFAKDYDNIDWGFLF 594
V +I +Q F +GR IS+ ++ A E++ + ++ ++KLDFAK +D++ W LF
Sbjct: 541 VLTRMIHLDQTGFLKGRCISENLIYATELIQACHARKCKTLIIKLDFAKAFDSVLWTSLF 600
Query: 595 DILKEMNFGDRWINWIRTCVSTATLSILVNGSSTGFFEMEKGLRQGDPLSPFLFNLCVNG 654
IL F + WI+WI++ + T+ ++L+NG + +KGLRQGDPLSP+LF L +
Sbjct: 601 QILAVRGFPNNWISWIKSLLQTSKSAVLLNGIPGKWINCKKGLRQGDPLSPYLFILVADV 660
Query: 655 LSCLLNQLLDDSNFSGVR--IGPGLALNHLQFADDTLLFCENDENQIDLLCNTLLSFLFT 712
L LL + NF +R I +Q+ADDTL+ C +E+ + L +TLL F
Sbjct: 661 LQRLL-----EKNFQ-IRHPIYQDRPCATIQYADDTLVICRAEEDDVLALRSTLLQFSKA 714
Query: 713 SGLKVNLHKSTIIGCNVDVRETQRVAGMYGWSIGQLPILYLGAPLGGNPRRASFWEPMLD 772
+GL++N KST+I ++D + ++ + + LP+ YLG PL + + +P++
Sbjct: 715 TGLQINFAKSTMISLHIDRSKESSLSELLQCKLESLPMSYLGLPLSLHKLTNNDLQPIVV 774
Query: 773 KLRRKRRTWNSKFISLSGRLTLMKATLCSVPLFLMAIFKAPLKVVGEVEKILRSFLQGG 831
K+ W + +S + RL L+ A L SVP++ M+ FK P KV+ ++K R+F G
Sbjct: 775 KVDSFLTGWEASLLSQAERLILVNAVLSSVPVYAMSAFKLPPKVIEAIDKRRRAFFWTG 833
>UniRef100_Q9LVQ1 Non-LTR retroelement reverse transcriptase-like [Arabidopsis
thaliana]
Length = 1072
Score = 359 bits (921), Expect = 4e-97
Identities = 236/716 (32%), Positives = 372/716 (50%), Gaps = 28/716 (3%)
Query: 139 MGGDFNAILNEGERIGI-QMGVESSFSDF---VADSNLIDLPLQSDKFTWYS-SRAGGLW 193
M GDFN +L E + ++ DF +++ L DL + + FTW++ S +
Sbjct: 1 MLGDFNQVLLPQEHSNPPSLNIDRRMRDFGSCLSEMELSDLVFKGNSFTWWNKSSIRPIA 60
Query: 194 SRIDRWLVSEDAINKFDGISQSVENWGLSDRRP--IILSLGAINFGPKPFLFYNHWLLDK 251
++DR L ++ N + N SD ++L I+ +PF F+N L ++
Sbjct: 61 KKLDRILANDSWCNLYPSSHGLFGNLDFSDHVSCGVVLEANGIS-AKRPFKFFNFLLKNE 119
Query: 252 EFEKLVSDWWCSATTVGWSAFVLQQKLKGLKNRIKEWQGYASSVSRVRINTIEA-DLQVT 310
+F +V D W S VG S + + +KLK +K IK++ + S + + T EA +L +T
Sbjct: 120 DFLNVVMDNWFSTNVVGSSMYRVSKKLKAMKKPIKDFSRL--NYSGIELRTKEAHELLIT 177
Query: 311 MDRLASEEPGI-----ELRKNRLTILEGLWQEYRKEEYRWRQKARVRWLREGDRNTSFFH 365
L P + EL R +L EE + Q++RV W EGD NT +FH
Sbjct: 178 CQNLTLANPSVSNAALELEAQRKWVLLSC-----AEESFFHQRSRVSWFAEGDSNTHYFH 232
Query: 366 SISKIRKSKKLISQLV-VGGVSIEDPNMIKDAIFSHFQSFFTS-ENRLRPKIRCANLS-- 421
+ RKS I+ LV G+ I+ I D ++++ S E+ + NL
Sbjct: 233 RMVDSRKSFNTINSLVDSNGLLIDSQQGILDHCVTYYERLLGSIESPFSMEQEDMNLLLT 292
Query: 422 -RLSQANKMSLEKQFSEGEIWEVLKSCDGNRAPGTDGFNLNFFKQFWGIVKEDVVKFFED 480
R SQ LEK F++ EI KS N+ G DG+++ FF+ W I+ +V+ +
Sbjct: 293 YRCSQDQCSELEKSFTDDEIKAAFKSLPRNKTSGPDGYSVEFFRDTWSIIGPEVLAAIHE 352
Query: 481 FHSTGKLVKGLNATFISLIPKHNCPSTVSDFRPISLIGSAYKLISKVLASRLQVVTPLLI 540
F +G+L+K NAT + LIPK + T+S+FRPIS + + YK+ISK+L SRLQ + +I
Sbjct: 353 FFDSGQLLKQWNATTLVLIPKTSNACTISEFRPISCLNTLYKVISKLLTSRLQGLLSAVI 412
Query: 541 SENQFAFAQGRQISDCILIANEVVDFLNKKQ-GGGFLLKLDFAKDYDNIDWGFLFDILKE 599
+Q AF GR +++ +L+A E+V N+ +LK+D K +D++ W F+ L+
Sbjct: 413 GHSQSAFLPGRSLAENVLLATEMVHGYNRLNISPRGMLKVDLKKAFDSVKWEFVTAALRA 472
Query: 600 MNFGDRWINWIRTCVSTATLSILVNGSSTGFFEMEKGLRQGDPLSPFLFNLCVNGLSCLL 659
+ +R+INWI C++T + +I VNG++ GFF KGLRQGDPLSP+LF L + S LL
Sbjct: 473 LAIPERYINWIHQCITTPSFTISVNGATGGFFRSTKGLRQGDPLSPYLFVLAMEVFSKLL 532
Query: 660 NQLLDDSNFSGVRIGPGLALNHLQFADDTLLFCENDENQIDLLCNTLLSFLFTSGLKVNL 719
D L+++HL FADD ++F + + + +C TL F SGLKVN
Sbjct: 533 YSRYDSGYIHYHPKAGDLSISHLMFADDVMIFFDGGSSSMHGICETLDDFADWSGLKVNK 592
Query: 720 HKSTIIGCNVDVRETQRVAGMYGWSIGQLPILYLGAPLGGNPRRASFWEPMLDKLRRKRR 779
KS + +D+ E + + YG+ G PI YLG PL R + + P+L+KL + R
Sbjct: 593 DKSQLFQAGLDLSE-RITSAAYGFPAGTFPIRYLGLPLMCRKLRIADYGPLLEKLSARLR 651
Query: 780 TWNSKFISLSGRLTLMKATLCSVPLFLMAIFKAPLKVVGEVEKILRSFLQGGKLHG 835
+W SK +S +GR L+ + + + F M+ F P + ++E + FL G + G
Sbjct: 652 SWVSKALSFAGRTQLISSVIFGLINFWMSTFLLPKGCIKKIESLCSKFLWAGSIDG 707
>UniRef100_Q8LJX1 Putative reverse transcriptase [Sorghum bicolor]
Length = 1998
Score = 354 bits (908), Expect = 1e-95
Identities = 239/837 (28%), Positives = 425/837 (50%), Gaps = 22/837 (2%)
Query: 6 WNVRGLGRAEKRRAVVEALSGLKPEIVLVQESKLDSSRVKTVEAFANAVKMDYVSVDAVG 65
WNVRGL EK + + L +I+ +QE+K DS + + + + VG
Sbjct: 813 WNVRGLNSKEKWNPIRNKIVDLAVDIICLQETKKDSFDIAFLRNICPPDFDRFEYLPLVG 872
Query: 66 SAGGLISLWNPASYAMESAVKNVRFIGLLLRALVDSSLLFIMNVYGPNVDSERYDFFNLL 125
++GG+++ W ++ E N I + + ++ ++ + NVY P D + F N
Sbjct: 873 ASGGILTAWKSKLFSGELIFSNEYAISIQMSSMHNNDSWLLTNVYAPCTDEGKRQFVNWF 932
Query: 126 SEALSLNNCFCKIMGGDFNAILNEGERIGIQMGVESS----FSDFVADSNLIDLPLQSDK 181
+ + ++G DFN + +R + G +SS F+ ++ L ++ LQ K
Sbjct: 933 KNIQTPHEVDWLVVG-DFNLMRKHEDRN--KEGGDSSEMFLFNSAISSLGLNEIILQGRK 989
Query: 182 FTWYSSRAGGLWSRIDRWLVSEDAIN-KFDGISQSVENWGL--SDRRPIILSLGAINFGP 238
+TW + ++ L +ID W + ++ N KF + SV+ + + SD P ++ +
Sbjct: 990 YTWSNMQSNPLLQKID-WTFTSNSWNIKF--LDTSVKGYDMTPSDHAPCLVKVSTKIPKT 1046
Query: 239 KPFLFYNHWLLDKEFEKLVSDWWCSATTVGWSAFVLQQKLKGLKNRIKEWQGYASSVSRV 298
K F F N WL F++++SD W T A + K K L+ +KE Q +S++ +
Sbjct: 1047 KVFRFENCWLRLPGFQQIISDVWNVPTHHLDQAKSITAKFKKLRKALKEKQ---ASMAHL 1103
Query: 299 RINTIEADLQVTMDRLASEEPGIELRK--NRLTILEGLWQEYRKEEYRWRQKARVRWLRE 356
+ + L + L E + + + R + + L + +++ WRQ+ ++W++
Sbjct: 1104 KTVISKVKLVIQFLELLEEYRDLSMLEWNFRAILKKKLTELLEQQKIYWRQRGAIKWIKL 1163
Query: 357 GDRNTSFFHSISKIRKSKKLISQLVVGGVSIEDPNMIKDA-IFSHFQSFFTSENRLRPKI 415
GD T FFH+ + I+ L++QL SI + K+ I+ F+ ++ R I
Sbjct: 1164 GDATTKFFHANATIKMRGNLVNQLQKQDGSILTAHKDKEQHIWEEFKERLGTKEETRFGI 1223
Query: 416 RCANLSRLSQANKMSLEKQFSEGEIWEVLKSCDGNRAPGTDGFNLNFFKQFWGIVKEDVV 475
+ + S + +LE+ F+E EI V+K +++PG DGF+ F K W VK D+
Sbjct: 1224 NPDTILQASN-DLETLEEPFTETEIESVVKLLPNDKSPGPDGFSNEFMKASWQTVKVDIF 1282
Query: 476 KFFEDFHSTGKLVKGLNATFISLIPKHNCPSTVSDFRPISLIGSAYKLISKVLASRLQVV 535
FH+ ++ +N +FI+LIPK + P +V+D+RPISL+ ++ KLI+K+LA+RLQ+
Sbjct: 1283 NLCHAFHNHNVCLRSINTSFITLIPKSHNPVSVNDYRPISLLNTSMKLITKLLANRLQMK 1342
Query: 536 TPLLISENQFAFAQGRQISDCILIANEVVDFLNKKQGGGFLLKLDFAKDYDNIDWGFLFD 595
L+ +NQ+ F + R I DC+ + E + + + ++KLDF K +D ++ +
Sbjct: 1343 ITDLVHKNQYGFIRQRTIQDCLAWSFEYLHLCHHSRKAIVIIKLDFEKAFDKMNHKAMLT 1402
Query: 596 ILKEMNFGDRWINWIRTCVSTATLSILVNGSSTGFFEMEKGLRQGDPLSPFLFNLCVNGL 655
I+K FG +W+NW+++ ++ T ++L+NG+ +G+RQGDPLSP LF L + L
Sbjct: 1403 IMKAKGFGQKWLNWMKSIFTSGTSNVLLNGTPGKTIHCLRGVRQGDPLSPLLFVLAADFL 1462
Query: 656 SCLLNQL-LDDSNFSGVRIGPGLALNHLQFADDTLLFCENDENQIDLLCNTLLSFLFTSG 714
L+N+ D + + +Q+ADDTL+ E D Q+ +L + + +F +G
Sbjct: 1463 QSLINKAKCMDLLKLPIPMQTNQDFPIIQYADDTLIIAEGDTRQLLILKSIINTFSEATG 1522
Query: 715 LKVNLHKSTIIGCNVDVRETQRVAGMYGWSIGQLPILYLGAPLGGNPRRASFWEPMLDKL 774
LKVNL KS ++ N+D +A +G S G P YLG PLG ++P+++K
Sbjct: 1523 LKVNLQKSMMLPINMDEERLDTLARTFGCSKGSFPFTYLGLPLGITKPTIQDYQPLINKC 1582
Query: 775 RRKRRTWNSKFISLSGRLTLMKATLCSVPLFLMAIFKAPLKVVGEVEKILRSFLQGG 831
+ + + F+S +GRL L A ++P F M P V+ +++K + L G
Sbjct: 1583 EARLGS-VATFLSEAGRLELTNAVFTALPTFYMCTLAIPKSVIHKIDKFRKHCLWRG 1638
>UniRef100_Q9C6L3 Hypothetical protein F2J7.11 [Arabidopsis thaliana]
Length = 1213
Score = 352 bits (904), Expect = 4e-95
Identities = 272/862 (31%), Positives = 416/862 (47%), Gaps = 48/862 (5%)
Query: 6 WNVRGLGRAEKRRAVVEALSGLKPEIVLVQESKLDSSRVKTVEAFANAVKMDYVSVD--A 63
WN+RG R + + KP V E+ + + + F NA+ + V+ A
Sbjct: 8 WNIRGFNNVSHRSGFKKWVKANKPIFGGVIETHVKQPKDRK---FINALLPGWSFVENYA 64
Query: 64 VGSAGGLISLWNPASYAMESAVKNVRFIGLLLRALVDSSLLFIMNVYGPNVDSERYDFFN 123
G + +W+P+ + A K+++ I + S + + VY N + R + +
Sbjct: 65 FSDLGKIWVMWDPSVQVVVVA-KSLQMITCEVLLPGSPSWIIVSVVYAANEVASRKELWI 123
Query: 124 LLSEALSLNNCFCKIMG-------GDFNAILNEGERIG-IQMGVESSFSDF---VADSNL 172
+ +N I+G GDFN +LN E + + V+ + DF + + L
Sbjct: 124 EI-----VNMVVSGIIGDRPWLVLGDFNQVLNPQEHSNPVSLNVDINMRDFRDCLLAAEL 178
Query: 173 IDLPLQSDKFTWYS-SRAGGLWSRIDRWLVSEDAINKFDGISQSVENWGLSDRRP--IIL 229
DL + + FTW++ S + +IDR LV++ F + SD ++L
Sbjct: 179 SDLRYKGNTFTWWNKSHTTPVAKKIDRILVNDSWNALFPSSLGIFGSLDFSDHVSCGVVL 238
Query: 230 SLGAINFGPKPFLFYNHWLLDKEFEKLVSDWWCSATTVGWSAFVLQQKLKGLKNRIKEWQ 289
+I +PF F+N+ L + +F LV D W + VG S F + +KLK LK IK++
Sbjct: 239 EETSIK-AKRPFKFFNYLLKNLDFLNLVRDNWFTLNVVGSSMFRVSKKLKALKKPIKDFS 297
Query: 290 GYASSVSRVRINTIEADLQVTMDR-LASEEP---GIELRKNRLTILEGLWQEYRK-EEYR 344
S R L DR LA P EL R W EE
Sbjct: 298 RLNYSELEKRTKEAHDFLIGCQDRTLADPTPINASFELEAERK------WHILTAAEESF 351
Query: 345 WRQKARVRWLREGDRNTSFFHSISKIRKSKKLISQLVVG-GVSIEDPNMIKDAIFSHFQS 403
+RQK+R+ W EGD NT +FH ++ R S IS L G G ++ I D S+F S
Sbjct: 352 FRQKSRISWFAEGDGNTKYFHRMADARNSSNSISALYDGNGKLVDSQEGILDLCASYFGS 411
Query: 404 FFTSENRLRPKIRCAN-----LS-RLSQANKMSLEKQFSEGEIWEVLKSCDGNRAPGTDG 457
E + P + N LS R S A LE FS +I L S N++ G DG
Sbjct: 412 LLGDE--VDPYLMEQNDMNLLLSYRCSPAQVCELESTFSNEDIRAALFSLPRNKSCGPDG 469
Query: 458 FNLNFFKQFWGIVKEDVVKFFEDFHSTGKLVKGLNATFISLIPKHNCPSTVSDFRPISLI 517
F FF W IV +V ++F S+G L+K NAT I LIPK P+ SDFRPIS +
Sbjct: 470 FTAEFFIDSWSIVGAEVTDAIKEFFSSGCLLKQWNATTIVLIPKIVNPTCTSDFRPISCL 529
Query: 518 GSAYKLISKVLASRLQVVTPLLISENQFAFAQGRQISDCILIANEVVDFLN-KKQGGGFL 576
+ YK+I+++L RLQ + +IS Q AF GR +++ +L+A ++V N +
Sbjct: 530 NTLYKVIARLLTDRLQRLLSGVISSAQSAFLPGRSLAENVLLATDLVHGYNWSNISPRGM 589
Query: 577 LKLDFAKDYDNIDWGFLFDILKEMNFGDRWINWIRTCVSTATLSILVNGSSTGFFEMEKG 636
LK+D K +D++ W F+ L+ + +++INWI C+ST T ++ +NG + GFF+ KG
Sbjct: 590 LKVDLKKAFDSVRWEFVIAALRALAIPEKFINWISQCISTPTFTVSINGGNGGFFKSTKG 649
Query: 637 LRQGDPLSPFLFNLCVNGLSCLLNQLLDDSNFSGVRIGPGLALNHLQFADDTLLFCENDE 696
LRQGDPLSP+LF L + S LL+ + L+++HL FADD ++F +
Sbjct: 650 LRQGDPLSPYLFVLAMEAFSNLLHSRYESGLIHYHPKASNLSISHLMFADDVMIFFDGGS 709
Query: 697 NQIDLLCNTLLSFLFTSGLKVNLHKSTIIGCNVDVRETQRVAGMYGWSIGQLPILYLGAP 756
+ +C TL F SGLKVN KS + ++ E+ A YG+ IG LPI YLG P
Sbjct: 710 FSLHGICETLDDFASWSGLKVNKDKSHLYLAGLNQLESNANAA-YGFPIGTLPIRYLGLP 768
Query: 757 LGGNPRRASFWEPMLDKLRRKRRTWNSKFISLSGRLTLMKATLCSVPLFLMAIFKAPLKV 816
L R + +EP+L+K+ + R+W +K +S +GR+ L+ + + F M+ F P
Sbjct: 769 LMNRKLRIAEYEPLLEKITARFRSWVNKCLSFAGRIQLISSVIFGSINFWMSTFLLPKGC 828
Query: 817 VGEVEKILRSFLQGGKLHGYHG 838
+ +E + FL G + G
Sbjct: 829 IKRIESLCSRFLWSGNIEQAKG 850
>UniRef100_O22148 Putative non-LTR retroelement reverse transcriptase [Arabidopsis
thaliana]
Length = 1374
Score = 352 bits (904), Expect = 4e-95
Identities = 243/849 (28%), Positives = 420/849 (48%), Gaps = 32/849 (3%)
Query: 2 RWVSWNVRGLGRAEKRRAVVEALSGLKPEIVLVQESKLDSSRVKTVEAFANAVKMDYVSV 61
R +SWN +G+G R + E PE++ + E+K + ++ V D +V
Sbjct: 2 RILSWNCQGVGNTPTVRHLREIRGLYFPEVIFLCETKKRRNYLENVVGHLGF--FDLHTV 59
Query: 62 DAVGSAGGLISLWNPASYAMESAVKNVRFIGLLLRALVDSSLLFIMNVYGPNVDSERYDF 121
+ +G +GGL +W S ++ + R I LL + ++ +YG V +ER +
Sbjct: 60 EPIGKSGGLALMWKD-SVQIKVLQSDKRLIDALL--IWQDKEFYLTCIYGEPVQAERGEL 116
Query: 122 FNLLSEALSLNNCFCKIMGGDFNAILNEGERIGIQMGVESS---FSDFVADSNLIDLPLQ 178
+ L+ L L+ ++ GDFN +++ E+IG ESS F + L ++
Sbjct: 117 WERLTR-LGLSRSGPWMLTGDFNELVDPSEKIGGPARKESSCLEFRQMLNSCGLWEVNHS 175
Query: 179 SDKFTWYSSRAGGLWS-RIDRWLVSEDAINKFDGISQSVENWGLSDRRPIILSLGAINFG 237
+F+WY +R L R+DR + ++ + F + SD P+I +L N+
Sbjct: 176 GYQFSWYGNRNDELVQCRLDRTVANQAWMELFPQAKATYLQKICSDHSPLINNLVGDNWR 235
Query: 238 P-KPFLFYNHWLLDKEFEKLVSDWWCSATTVGWSAFVLQQKLKGLKNRIKEWQGYASSVS 296
F + W+ + F+ L+ ++W +T + ++ +K+ + I +W+ + S
Sbjct: 236 KWAGFKYDKRWVQREGFKDLLCNFWSQQSTK--TNALMMEKIASCRREISKWKRVSKPSS 293
Query: 297 RVRINTIEADLQVTMDRLASEEPGIELRKNRLTILEGLWQEYRKEEYRWRQKARVRWLRE 356
VRI ++ L ++ + + K L+ QEY EE W++K+R+ W+R
Sbjct: 294 AVRIQELQFKLDAATKQIPFDRRELARLKKELS------QEYNNEEQFWQEKSRIMWMRN 347
Query: 357 GDRNTSFFHSISKIRKSKKLISQLVVGG----VSIEDPNMIKDAIFSHFQSFFTSENRLR 412
GDRNT +FH+ +K R+++ I +L+ S ED + +A +F+ F SE+
Sbjct: 348 GDRNTKYFHAATKNRRAQNRIQKLIDEEGREWTSDEDLGRVAEA---YFKKLFASEDVGY 404
Query: 413 PKIRCANLSRL-SQANKMSLEKQFSEGEIWEVLKSCDGNRAPGTDGFNLNFFKQFWGIVK 471
NL+ L S +L ++ E+ S + ++ PG DG N ++QFW +
Sbjct: 405 TVEELENLTPLVSDQMNNNLLAPITKEEVQRATFSINPHKCPGPDGMNGFLYQQFWETMG 464
Query: 472 EDVVKFFEDFHSTGKLVKGLNATFISLIPKHNCPSTVSDFRPISLIGSAYKLISKVLASR 531
+ + + + F +G + +G+N T I LIPK ++DFRPISL YK+I K++A+R
Sbjct: 465 DQITEMVQAFFRSGSIEEGMNKTNICLIPKILKAEKMTDFRPISLCNVIYKVIGKLMANR 524
Query: 532 LQVVTPLLISENQFAFAQGRQISDCILIANEVVDFL--NKKQGGGFL-LKLDFAKDYDNI 588
L+ + P LISE Q AF +GR ISD ILIA+E++ L N K F+ +K D +K YD +
Sbjct: 525 LKKILPSLISETQAAFVKGRLISDNILIAHELLHALSSNNKCSEEFIAIKTDISKAYDRV 584
Query: 589 DWGFLFDILKEMNFGDRWINWIRTCVSTATLSILVNGSSTGFFEMEKGLRQGDPLSPFLF 648
+W FL ++ + F D WI I CV + +L+NG+ G +GLRQGDPLSP+LF
Sbjct: 585 EWPFLEKAMRGLGFADHWIRLIMECVKSVRYQVLINGTPHGEIIPSRGLRQGDPLSPYLF 644
Query: 649 NLCVNGLSCLLNQLLDDSNFSGVRIGPGL-ALNHLQFADDTLLFCENDENQIDLLCNTLL 707
+C L +L + +G+++ G ++HL FADD++ +C+ ++ + + +
Sbjct: 645 VICTEMLVKMLQSAEQKNQITGLKVARGAPPISHLLFADDSMFYCKVNDEALGQIIRIIE 704
Query: 708 SFLFTSGLKVNLHKSTI-IGCNVDVRETQRVAGMYGWSIGQLPILYLGAPLGGNPRRASF 766
+ SG +VN KS+I G ++ V G +YLG P + +
Sbjct: 705 EYSLASGQRVNYLKSSIYFGKHISEERRCLVKRKLGIEREGGEGVYLGLPESFQGSKVAT 764
Query: 767 WEPMLDKLRRKRRTWNSKFISLSGRLTLMKATLCSVPLFLMAIFKAPLKVVGEVEKILRS 826
+ D+L +K W S F+S G+ L+KA ++P + M+ FK P + ++E ++
Sbjct: 765 LSYLKDRLGKKVLGWQSNFLSPGGKEILLKAVAMALPTYTMSCFKIPKTICQQIESVMAE 824
Query: 827 FLQGGKLHG 835
F K G
Sbjct: 825 FWWKNKKEG 833
>UniRef100_Q9SN65 Hypothetical protein F25I24.40 [Arabidopsis thaliana]
Length = 1294
Score = 343 bits (881), Expect = 2e-92
Identities = 242/781 (30%), Positives = 401/781 (50%), Gaps = 35/781 (4%)
Query: 65 GSAGGLISLWNPASYAMESAVKNVRFIGLLLRALVDSSLLFIMNVYGPNVDSERYDFFNL 124
G +GGL LW S + + ++ R I + + +++ ++ VYG SER+ +
Sbjct: 424 GHSGGLALLWKD-SVRLSNLYQDDRHIDVHIS--INNINFYLSRVYGHPCQSERHSLWTH 480
Query: 125 LSEALSLNNCFCKIMGGDFNAILNEGERIGIQMGVESSFSDF---VADSNLIDLPLQSDK 181
E LS I+ GDFN IL+ E+IG E +F F V+ +L D+ D+
Sbjct: 481 F-ENLSKTRNDPWILIGDFNEILSNNEKIGGPQRDEWTFRGFRNMVSTCDLKDIRSIGDR 539
Query: 182 FTWYSSR-AGGLWSRIDRWLVSEDAINKFDGISQSVENWGLSDRRPIILSLGAINFGP-K 239
F+W R + + +DR ++ + F + SD +P+ LSL +
Sbjct: 540 FSWVGERHSHTVKCCLDRAFINSEGAFLFPFAELEFLEFTGSDHKPLFLSLEKTETRKMR 599
Query: 240 PFLFYNHWLLDKEFEKLVSDWWCSATTVGWSAFVLQQKLKGLKNRIKEWQGYASSVSRVR 299
PF F L F+ V W A + L +++ + + + + ++ SR+R
Sbjct: 600 PFRFDKRLLEVPHFKTYVKAGWNKA--INGQRKHLPDQVRTCRQAMAKLKHKSNLNSRIR 657
Query: 300 INTIEADLQVTMDRLASEEPGIELRKNRLTILEGLWQEYRKEEYRWRQKARVRWLREGDR 359
IN ++A L M + E R+ I L YR EE W+QK+R +W++EGDR
Sbjct: 658 INQLQAALDKAMSSVNRTE-----RRTISHIQRELTVAYRDEERYWQQKSRNQWMKEGDR 712
Query: 360 NTSFFHSISKIRKSKKLISQLVVGGVSIEDPNMIKD--AIFSHFQSFFTS--ENRLRPK- 414
NT FFH+ +K R S +++LV + E+ + + I H Q FFT E+ RP
Sbjct: 713 NTEFFHACTKTRFS---VNRLVT--IKDEEGMIYRGDKEIGVHAQEFFTKVYESNGRPVS 767
Query: 415 -IRCANLSRL--SQANKMSLEKQFSEGEIWEVLKSCDGNRAPGTDGFNLNFFKQFWGIVK 471
I A + Q N L K S+ EI+ + ++APG DG F+K W IV
Sbjct: 768 IIDFAGFKPIVTEQIND-DLTKDLSDLEIYNAICHIGDDKAPGPDGLTARFYKSCWEIVG 826
Query: 472 EDVVKFFEDFHSTGKLVKGLNATFISLIPKHNCPSTVSDFRPISLIGSAYKLISKVLASR 531
DV+K + F T + + +N T I +IPK P T+SD+RPI+L YK+ISK L R
Sbjct: 827 PDVIKEVKIFFRTSYMKQSINHTNICMIPKITNPETLSDYRPIALCNVLYKIISKCLVER 886
Query: 532 LQVVTPLLISENQFAFAQGRQISDCILIANEVVDFL--NKKQGGGFL-LKLDFAKDYDNI 588
L+ ++S++Q AF GR ++D ++IA+E++ L K+ ++ +K D +K YD +
Sbjct: 887 LKGHLDAIVSDSQAAFIPGRLVNDNVMIAHEMMHSLKTRKRVSQSYMAVKTDVSKAYDRV 946
Query: 589 DWGFLFDILKEMNFGDRWINWIRTCVSTATLSILVNGSSTGFFEMEKGLRQGDPLSPFLF 648
+W FL ++ F + WI WI V + S+LVNG G + ++G+RQGDPLSP+LF
Sbjct: 947 EWNFLETTMRLFGFSETWIKWIMGAVKSVNYSVLVNGIPHGTIQPQRGIRQGDPLSPYLF 1006
Query: 649 NLCVNGLSCLLNQLLDDSNFSGVRIGPGL-ALNHLQFADDTLLFCENDENQIDLLCNTLL 707
LC + L+ L+ + + + G+RIG G+ + HLQFADD+L FC+++ L +
Sbjct: 1007 ILCADILNHLIKNRVAEGDIRGIRIGNGVPGVTHLQFADDSLFFCQSNVRNCQALKDVFD 1066
Query: 708 SFLFTSGLKVNLHKSTI-IGCNVDVRETQRVAGMYGWSIGQLPILYLGAPLGGNPRRASF 766
+ + SG K+N+ KS I G V R+ + G YLG P ++
Sbjct: 1067 VYEYYSGQKINMSKSMITFGSRVHGTTQNRLKNILGIQSHGGGGKYLGLPEQFGRKKRDM 1126
Query: 767 WEPMLDKLRRKRRTWNSKFISLSGRLTLMKATLCSVPLFLMAIFKAPLKVVGEVEKILRS 826
+ ++++++++ +W++K++S +G+ ++K+ S+P++ M+ FK PL +V E+E +L +
Sbjct: 1127 FNYIIERVKKRTSSWSAKYLSPAGKEIMLKSVAMSMPVYAMSCFKLPLNIVSEIEALLMN 1186
Query: 827 F 827
F
Sbjct: 1187 F 1187
>UniRef100_O81630 F8M12.22 protein [Arabidopsis thaliana]
Length = 1662
Score = 343 bits (881), Expect = 2e-92
Identities = 242/781 (30%), Positives = 401/781 (50%), Gaps = 35/781 (4%)
Query: 65 GSAGGLISLWNPASYAMESAVKNVRFIGLLLRALVDSSLLFIMNVYGPNVDSERYDFFNL 124
G +GGL LW S + + ++ R I + + +++ ++ VYG SER+ +
Sbjct: 444 GHSGGLALLWKD-SVRLSNLYQDDRHIDVHIS--INNINFYLSRVYGHPCQSERHSLWTH 500
Query: 125 LSEALSLNNCFCKIMGGDFNAILNEGERIGIQMGVESSFSDF---VADSNLIDLPLQSDK 181
E LS I+ GDFN IL+ E+IG E +F F V+ +L D+ D+
Sbjct: 501 F-ENLSKTRNDPWILIGDFNEILSNNEKIGGPQRDEWTFRGFRNMVSTCDLKDIRSIGDR 559
Query: 182 FTWYSSR-AGGLWSRIDRWLVSEDAINKFDGISQSVENWGLSDRRPIILSLGAINFGP-K 239
F+W R + + +DR ++ + F + SD +P+ LSL +
Sbjct: 560 FSWVGERHSHTVKCCLDRAFINSEGAFLFPFAELEFLEFTGSDHKPLFLSLEKTETRKMR 619
Query: 240 PFLFYNHWLLDKEFEKLVSDWWCSATTVGWSAFVLQQKLKGLKNRIKEWQGYASSVSRVR 299
PF F L F+ V W A + L +++ + + + + ++ SR+R
Sbjct: 620 PFRFDKRLLEVPHFKTYVKAGWNKA--INGQRKHLPDQVRTCRQAMAKLKHKSNLNSRIR 677
Query: 300 INTIEADLQVTMDRLASEEPGIELRKNRLTILEGLWQEYRKEEYRWRQKARVRWLREGDR 359
IN ++A L M + E R+ I L YR EE W+QK+R +W++EGDR
Sbjct: 678 INQLQAALDKAMSSVNRTE-----RRTISHIQRELTVAYRDEERYWQQKSRNQWMKEGDR 732
Query: 360 NTSFFHSISKIRKSKKLISQLVVGGVSIEDPNMIKD--AIFSHFQSFFTS--ENRLRPK- 414
NT FFH+ +K R S +++LV + E+ + + I H Q FFT E+ RP
Sbjct: 733 NTEFFHACTKTRFS---VNRLVT--IKDEEGMIYRGDKEIGVHAQEFFTKVYESNGRPVS 787
Query: 415 -IRCANLSRL--SQANKMSLEKQFSEGEIWEVLKSCDGNRAPGTDGFNLNFFKQFWGIVK 471
I A + Q N L K S+ EI+ + ++APG DG F+K W IV
Sbjct: 788 IIDFAGFKPIVTEQIND-DLTKDLSDLEIYNAICHIGDDKAPGPDGLTARFYKSCWEIVG 846
Query: 472 EDVVKFFEDFHSTGKLVKGLNATFISLIPKHNCPSTVSDFRPISLIGSAYKLISKVLASR 531
DV+K + F T + + +N T I +IPK P T+SD+RPI+L YK+ISK L R
Sbjct: 847 PDVIKEVKIFFRTSYMKQSINHTNICMIPKITNPETLSDYRPIALCNVLYKIISKCLVER 906
Query: 532 LQVVTPLLISENQFAFAQGRQISDCILIANEVVDFL--NKKQGGGFL-LKLDFAKDYDNI 588
L+ ++S++Q AF GR ++D ++IA+E++ L K+ ++ +K D +K YD +
Sbjct: 907 LKGHLDAIVSDSQAAFIPGRLVNDNVMIAHEMMHSLKTRKRVSQSYMAVKTDVSKAYDRV 966
Query: 589 DWGFLFDILKEMNFGDRWINWIRTCVSTATLSILVNGSSTGFFEMEKGLRQGDPLSPFLF 648
+W FL ++ F + WI WI V + S+LVNG G + ++G+RQGDPLSP+LF
Sbjct: 967 EWNFLETTMRLFGFSETWIKWIMGAVKSVNYSVLVNGIPHGTIQPQRGIRQGDPLSPYLF 1026
Query: 649 NLCVNGLSCLLNQLLDDSNFSGVRIGPGL-ALNHLQFADDTLLFCENDENQIDLLCNTLL 707
LC + L+ L+ + + + G+RIG G+ + HLQFADD+L FC+++ L +
Sbjct: 1027 ILCADILNHLIKNRVAEGDIRGIRIGNGVPGVTHLQFADDSLFFCQSNVRNCQALKDVFD 1086
Query: 708 SFLFTSGLKVNLHKSTI-IGCNVDVRETQRVAGMYGWSIGQLPILYLGAPLGGNPRRASF 766
+ + SG K+N+ KS I G V R+ + G YLG P ++
Sbjct: 1087 VYEYYSGQKINMSKSMITFGSRVHGTTQNRLKNILGIQSHGGGGKYLGLPEQFGRKKRDM 1146
Query: 767 WEPMLDKLRRKRRTWNSKFISLSGRLTLMKATLCSVPLFLMAIFKAPLKVVGEVEKILRS 826
+ ++++++++ +W++K++S +G+ ++K+ S+P++ M+ FK PL +V E+E +L +
Sbjct: 1147 FNYIIERVKKRTSSWSAKYLSPAGKEIMLKSVAMSMPVYAMSCFKLPLNIVSEIEALLMN 1206
Query: 827 F 827
F
Sbjct: 1207 F 1207
>UniRef100_Q7DLW4 Non-ltr retrotransposon reverse transcriptase-like protein
[Arabidopsis thaliana]
Length = 893
Score = 343 bits (879), Expect = 3e-92
Identities = 273/865 (31%), Positives = 427/865 (48%), Gaps = 58/865 (6%)
Query: 6 WNVRGLGRAEKRRAVVEALSGLKPEIVLVQESKLDSSRVKTVEAFANAVK-MDYVSVDAV 64
WNVRG + RR + KP + E+ + + K + +N + +V
Sbjct: 8 WNVRGFNISSHRRGFKKWFLLNKPLFGGLIETHVKQPKEK--KFISNLLPGWSFVENYEF 65
Query: 65 GSAGGLISLWNPASYAMESAVKNVRFIGLLLRALVDSSLLFIMN-VYGPNVDSERYDFFN 123
G + LW+P S + ++++ I L L DS F+++ VY N + R + +N
Sbjct: 66 SVLGKIWVLWDP-SVKVVVIGRSLQMITCEL-LLPDSPSWFVVSIVYASNEEGTRKELWN 123
Query: 124 LLSEALSLNNCFCK---IMGGDFNAILNEGERIGIQMGVE-SSFSDFVADSNLIDLPLQS 179
L + L+L+ I+ GDFN ILN I +G + +F + DS+L DL +
Sbjct: 124 ELVQ-LALSPVVVGRSWIVLGDFNQILNPESAINANIGRKIRAFRSCLLDSDLYDLVYKG 182
Query: 180 DKFTWYSSRAGG-LWSRIDRWLVSEDAINKFDGISQSVENWGLSDRRPIILSLG-AINFG 237
+TW++ + L +IDR LV++ F + SD + L A+
Sbjct: 183 SSYTWWNKCSSRPLAKKIDRILVNDHWNTLFPSAYANFGEPDFSDHSSCEVVLDPAVLKA 242
Query: 238 PKPFLFYNHWLLDKEFEKLVSDWWCSATTVGWSAFVLQQKLKGLKNRIKEWQGYASSVSR 297
+PF F+N++L + +F +L+ + W S G + + + +KLK LK I SR
Sbjct: 243 KRPFRFFNYFLHNPDFLQLIRENWYSCNVSGSAMYRVSKKLKHLKLPI-------CCFSR 295
Query: 298 VRINTIEADLQVTMDRLASEEPGIELRKNRLTI-----------LEGL--WQEYRK-EEY 343
+ IE + SE I L + R+T+ LE WQ K EE
Sbjct: 296 ENYSDIE--------KRVSEAHAIVLHRQRITLTNPSVVHATLELEATRKWQILAKAEES 347
Query: 344 RWRQKARVRWLREGDRNTSFFHSISKIRKSKKLISQLVVG-GVSIEDPNMIKDAIFSHFQ 402
+ QK+ + WL EGD NT++FH ++ +RKS I+ L+ G IE IK+ I H
Sbjct: 348 FFCQKSSISWLYEGDNNTAYFHKMADMRKSINTINFLIDDFGERIETQQGIKEGIKEHSC 407
Query: 403 SFFTS-------ENRLRPKIRCANLS-RLSQANKMSLEKQFSEGEIWEVLKSCDGNRAPG 454
+FF S EN L LS R S LE+ FS+ +I E S N+A G
Sbjct: 408 NFFESLLCGVEGENSLAQSDMNLLLSFRCSVDQINDLERSFSDLDIQEAFFSLPRNKASG 467
Query: 455 TDGFNLNFFKQFWGIVKEDVVKFFEDFHSTGKLVKGLNATFISLIPKHNCPSTVSDFRPI 514
DG++ FFK W +V +V + ++F +G+L+K NAT + LIPK S ++DFRPI
Sbjct: 468 PDGYSSEFFKGVWFVVGPEVTEAVQEFFRSGQLLKQWNATTLVLIPKITNSSKMTDFRPI 527
Query: 515 SLIGSAYKLISKVLASRLQVVTPLLISENQFAFAQGRQISDCILIANEVVDFLNKKQ-GG 573
S + + YK+I+K+L SRL+ + +IS +Q AF GR +S+ +L+A E+V N K
Sbjct: 528 SCLNTLYKVIAKLLTSRLKKLLNEVISPSQSAFLPGRLLSENVLLATEIVHGYNTKNISS 587
Query: 574 GFLLKLDFAKDYDNIDWGFLFDILKEMNFGDRWINWIRTCVSTATLSILVNGSSTGFFEM 633
+LK+D K +D++ W F+ + + ++++ WI C+ST S++VNGSS+GFF+
Sbjct: 588 RGMLKVDLRKAFDSVRWDFIISAFRALAVPEKFVCWINQCISTPYFSVMVNGSSSGFFKS 647
Query: 634 EKGLRQGDPLSPFLFNLCVNGLSCLLNQLLDDSNFSGVRIGPGLALNHLQFADDTLLFCE 693
KGLRQGDPLSP+LF L + S LL D L+++HL FADD ++F +
Sbjct: 648 NKGLRQGDPLSPYLFVLAMEVFSSLLKARFDAGYIQYHPKTADLSISHLMFADDVMVFFD 707
Query: 694 NDENQIDLLCNTLLSFLFTSGLKVNLHKSTIIGCNVDVRETQRVAGMYGWSIGQLPILYL 753
+ + + L F SGL VN K+ + D E ++ YG+ I LPI YL
Sbjct: 708 GGSSSLHGISEALDDFASWSGLHVNKDKTNLYLAGTDEVEALAIS-HYGFPISTLPIRYL 766
Query: 754 GAPLGGNPRRASFWEPMLDKLRRKRRTWNSKFISLSGRLTLMKATLCSVPLFLMAIFKAP 813
G PL + S +E L ++ R+W K +S +GR+ L+ + + + F M+ F
Sbjct: 767 GLPLMSRKLKISEYE-----LVKRFRSWAVKSLSFAGRVQLITSVITGLVNFWMSTFVLL 821
Query: 814 LKVVGEVEKILRSFLQGGKLHGYHG 838
L V ++E + FL G + G
Sbjct: 822 LGCVKKIESLCSRFLWSGSIDASKG 846
>UniRef100_Q9LRU8 Non-LTR retroelement reverse transcriptase-like protein
[Arabidopsis thaliana]
Length = 893
Score = 342 bits (878), Expect = 4e-92
Identities = 273/865 (31%), Positives = 427/865 (48%), Gaps = 58/865 (6%)
Query: 6 WNVRGLGRAEKRRAVVEALSGLKPEIVLVQESKLDSSRVKTVEAFANAVK-MDYVSVDAV 64
WNVRG + RR + KP + E+ + + K + +N + +V
Sbjct: 8 WNVRGFNISSHRRGFKKWFLLNKPLFGGLIETHVKQPKEK--KFISNLLPGWSFVENYEF 65
Query: 65 GSAGGLISLWNPASYAMESAVKNVRFIGLLLRALVDSSLLFIMN-VYGPNVDSERYDFFN 123
G + LW+P S + ++++ I L L DS F+++ VY N + R + +N
Sbjct: 66 SVLGKIWVLWDP-SVKVVVIGRSLQMITCEL-LLPDSPSWFVVSIVYASNEEGTRKELWN 123
Query: 124 LLSEALSLNNCFCK---IMGGDFNAILNEGERIGIQMGVE-SSFSDFVADSNLIDLPLQS 179
L + L+L+ I+ GDFN ILN I +G + +F + DS+L DL +
Sbjct: 124 ELVQ-LALSPVVVGRSWIVLGDFNQILNPESAINANIGRKIRAFRSCLLDSDLYDLVYKG 182
Query: 180 DKFTWYSSRAGG-LWSRIDRWLVSEDAINKFDGISQSVENWGLSDRRPIILSLG-AINFG 237
+TW++ + L +IDR LV++ F + SD + L A+
Sbjct: 183 SSYTWWNKCSSRPLAKKIDRILVNDHWNTLFPSAYANFGEPDFSDHSSCEVVLDPAVLKA 242
Query: 238 PKPFLFYNHWLLDKEFEKLVSDWWCSATTVGWSAFVLQQKLKGLKNRIKEWQGYASSVSR 297
+PF F+N++L + +F +L+ + W S G + + + +KLK LK I SR
Sbjct: 243 KRPFRFFNYFLHNPDFLQLIRENWYSCNVSGSAMYRVSKKLKHLKLPI-------CCFSR 295
Query: 298 VRINTIEADLQVTMDRLASEEPGIELRKNRLTI-----------LEGL--WQEYRK-EEY 343
+ IE + SE I L + R+T+ LE WQ K EE
Sbjct: 296 ENYSDIE--------KRVSEAHAIVLHRQRITLTNPSVVHATLELEATRKWQILAKAEES 347
Query: 344 RWRQKARVRWLREGDRNTSFFHSISKIRKSKKLISQLVVG-GVSIEDPNMIKDAIFSHFQ 402
+ QK+ + WL EGD NT++FH ++ +RKS I+ L+ G IE IK+ I H
Sbjct: 348 FFCQKSSISWLYEGDNNTAYFHKMADMRKSINTINFLIDDFGERIETQQGIKEGIKEHSC 407
Query: 403 SFFTS-------ENRLRPKIRCANLS-RLSQANKMSLEKQFSEGEIWEVLKSCDGNRAPG 454
+FF S EN L LS R S LE+ FS+ +I E S N+A G
Sbjct: 408 NFFESLLCGVEGENSLAQSDMNLLLSFRCSVDQINDLERSFSDLDIQEAFFSLPRNKASG 467
Query: 455 TDGFNLNFFKQFWGIVKEDVVKFFEDFHSTGKLVKGLNATFISLIPKHNCPSTVSDFRPI 514
DG++ FFK W +V +V + ++F +G+L+K NAT + LIPK S ++DFRPI
Sbjct: 468 PDGYSSEFFKGVWFVVGPEVTEAVQEFFRSGQLLKQWNATTLVLIPKITNSSKMTDFRPI 527
Query: 515 SLIGSAYKLISKVLASRLQVVTPLLISENQFAFAQGRQISDCILIANEVVDFLNKKQ-GG 573
S + + YK+I+K+L SRL+ + +IS +Q AF GR +S+ +L+A E+V N K
Sbjct: 528 SCLNTLYKVIAKLLTSRLKKLLNEVISPSQSAFLPGRLLSENVLLATEIVHGYNTKNISS 587
Query: 574 GFLLKLDFAKDYDNIDWGFLFDILKEMNFGDRWINWIRTCVSTATLSILVNGSSTGFFEM 633
+LK+D K +D++ W F+ + + ++++ WI C+ST S++VNGSS+GFF+
Sbjct: 588 RGMLKVDLRKAFDSVRWDFIISAFRALAVPEKFVCWINQCISTPYFSVMVNGSSSGFFKS 647
Query: 634 EKGLRQGDPLSPFLFNLCVNGLSCLLNQLLDDSNFSGVRIGPGLALNHLQFADDTLLFCE 693
KGLRQGDPLSP+LF L + S LL D L+++HL FADD ++F +
Sbjct: 648 NKGLRQGDPLSPYLFVLAMEVFSSLLKARFDAGYIHYHPKTADLSISHLMFADDVMVFFD 707
Query: 694 NDENQIDLLCNTLLSFLFTSGLKVNLHKSTIIGCNVDVRETQRVAGMYGWSIGQLPILYL 753
+ + + L F SGL VN K+ + D E ++ YG+ I LPI YL
Sbjct: 708 GGSSSLHGISEALDDFASWSGLHVNKDKTNLYLAGTDEVEALAIS-HYGFPISTLPIRYL 766
Query: 754 GAPLGGNPRRASFWEPMLDKLRRKRRTWNSKFISLSGRLTLMKATLCSVPLFLMAIFKAP 813
G PL + S +E L ++ R+W K +S +GR+ L+ + + + F M+ F
Sbjct: 767 GLPLMSRKLKISEYE-----LVKRFRSWAVKSLSFAGRVQLITSVITGLVNFWMSTFVLL 821
Query: 814 LKVVGEVEKILRSFLQGGKLHGYHG 838
L V ++E + FL G + G
Sbjct: 822 LGCVKKIESLCSRFLWSGSIDASKG 846
>UniRef100_Q9SZ87 RNA-directed DNA polymerase-like protein [Arabidopsis thaliana]
Length = 1274
Score = 340 bits (873), Expect = 1e-91
Identities = 235/732 (32%), Positives = 364/732 (49%), Gaps = 32/732 (4%)
Query: 112 PNVDSERYDFFNLLSEALSLNNCFCKIMGGDFNAILNEGERIGIQMGVES---SFSDFVA 168
PN R F++ +S +L ++ GDFN IL+ E+ G + E +F FV+
Sbjct: 51 PNFIDNRSVFWDKIS-SLGAQRSSAWLLTGDFNDILDNSEKQGGPLRWEGFFLAFRSFVS 109
Query: 169 DSNLIDLPLQSDKFTWYSSRAGG-LWSRIDRWLVSEDAINKFDGISQSVENWGLSDRRPI 227
+ L D+ + +W +R + SR+DR L + F + SD RP+
Sbjct: 110 QNGLWDINHTGNSLSWRGTRYSHFIKSRLDRALGNCSWSELFPMSKCEYLRFEGSDHRPL 169
Query: 228 ILSLGAINFG-PKPFLFYNHWLLDKEFEKLVSDWWCSATTVGWSAFVLQQKLKGLKNRIK 286
+ GA KPF F +E LV + W A + K+ + I
Sbjct: 170 VTYFGAPPLKRSKPFRFDRRLREKEEIRALVKEVWELARQDS-----VLYKISRCRQSII 224
Query: 287 EWQGYASSVSRVRINTIEADLQVTMDRLASEEPGIELRKNRLTILEGLWQEYRKEEYRWR 346
+W +S S I + L+ L+++ P L + I + L YR+EE W+
Sbjct: 225 KWTKEQNSNSAKAIKKAQQALE---SALSADIPDPSLIGS---ITQELEAAYRQEELFWK 278
Query: 347 QKARVRWLREGDRNTSFFHSISKIRKSKKLISQLVVG-GVSIEDPNMIKDAIFSHFQSFF 405
Q +RV+WL GDRN +FH+ ++ R+ +S + G G + I I S+FQ+ F
Sbjct: 279 QWSRVQWLNSGDRNKGYFHATTRTRRMLNNLSVIEDGSGQEFHEEEQIASTISSYFQNIF 338
Query: 406 TSENRLRPKIRCANLSRLSQAN-KMSLEKQFSEGEIWEVLKSCDGNRAPGTDGFNLNFFK 464
T+ N ++ LS + ++ L K S EI E L S ++APG DGF+ +FF
Sbjct: 339 TTSNNSDLQVVQEALSPIISSHCNEELIKISSLLEIKEALFSISADKAPGPDGFSASFFH 398
Query: 465 QFWGIVKEDVVKFFEDFHSTGKLVKGLNATFISLIPKHNCPSTVSDFRPISLIGSAYKLI 524
+W I++ DV + F L LN T ++LIPK + P VSD+RPI+L YK++
Sbjct: 399 AYWDIIEADVSRDIRSFFVDSCLSPRLNETHVTLIPKISAPRKVSDYRPIALCNVQYKIV 458
Query: 525 SKVLASRLQVVTPLLISENQFAFAQGRQISDCILIANEVVDFL---NKKQGGGFLLKLDF 581
+K+L RLQ LIS +Q AF GR I+D +LI +E++ FL K+ +K D
Sbjct: 459 AKILTRRLQPWLSELISLHQSAFVPGRAIADNVLITHEILHFLRVSGAKKYCSMAIKTDM 518
Query: 582 AKDYDNIDWGFLFDILKEMNFGDRWINWIRTCVSTATLSILVNGSSTGFFEMEKGLRQGD 641
+K YD I W FL ++L + F D+WI W+ CV T + S L+NGS G +GLRQGD
Sbjct: 519 SKAYDRIKWNFLQEVLMRLGFHDKWIRWVMQCVCTVSYSFLINGSPQGSVVPSRGLRQGD 578
Query: 642 PLSPFLFNLCVNGLSCLLNQLLDDSNFSGVRIGPGL-ALNHLQFADDTLLFCENDENQID 700
PLSP+LF LC LS L + + G+R+ G +NHL FADDT+ FC+ +
Sbjct: 579 PLSPYLFILCTEVLSGLCRKAQEKGVMVGIRVARGSPQVNHLLFADDTMFFCKTNPTCCG 638
Query: 701 LLCNTLLSFLFTSGLKVNLHKSTIIGCNVDVRETQRVAGM-----YGWSIGQLPILYLGA 755
L N L + SG +NL KS I + ++ +R + IG+ YLG
Sbjct: 639 ALSNILKKYELASGQSINLAKSAITFSSKTPQDIKRRVKLSLRIDNEGGIGK----YLGL 694
Query: 756 PLGGNPRRASFWEPMLDKLRRKRRTWNSKFISLSGRLTLMKATLCSVPLFLMAIFKAPLK 815
P R+ + ++D++R++ +W+ +F+S +G+ L+KA L S+P + M FK P
Sbjct: 695 PEHFGRRKRDIFSSIVDRIRQRSHSWSIRFLSSAGKQILLKAVLSSMPSYAMMCFKLPAS 754
Query: 816 VVGEVEKILRSF 827
+ +++ +L F
Sbjct: 755 LCKQIQSVLTRF 766
>UniRef100_Q9SKJ4 Putative non-LTR retroelement reverse transcriptase [Arabidopsis
thaliana]
Length = 1750
Score = 335 bits (859), Expect = 6e-90
Identities = 237/792 (29%), Positives = 406/792 (50%), Gaps = 57/792 (7%)
Query: 65 GSAGGLISLW-NPASYAMESAVKNVRFIGLLLRALVDSSLLF------IMNVYGPNVDSE 117
G +GGL +W N S ++ S + L+DS + F + VYG SE
Sbjct: 444 GRSGGLALMWKNNVSLSLISQDER----------LIDSHVTFNNKSFYLSCVYGHPTQSE 493
Query: 118 RYDFFNLLSEALSLNNCFCKIMGGDFNAILNEGERIGIQMGVESSFSDF---VADSNLID 174
R+ + L E +S N ++ GDFN IL+ E+IG M E +F +F V+ ++ D
Sbjct: 494 RHQLWQTL-EHISDNRNAEWLLVGDFNEILSNAEKIGGPMREEWTFRNFRNMVSHCDIED 552
Query: 175 LPLQSDKFTWYSSR-AGGLWSRIDRWLVSEDAINKFDGISQSVENWGLSDRRPIILSLG- 232
+ + D+F+W R + +DR ++ F ++ SD +P+++
Sbjct: 553 MRSKGDRFSWVGERHTHTVKCCLDRVFINSAWTATFPYAEIEFLDFTGSDHKPVLVHFNE 612
Query: 233 AINFGPKPFLFYNHWLLDKEFEKLVSDWWCSATTVGWSAFVLQQKLKGLKNRIKEWQGYA 292
+ K F F N + F+++V W T + + +++ + + + +
Sbjct: 613 SFPRRSKLFRFDNRLIDIPTFKRIVQTSW--RTNRNSRSTPITERISSCRQAMARLKHAS 670
Query: 293 SSVSRVRINTIEADLQVTMDRLASEEPGIELRKNRLTILEGLWQEYRKEEYRWRQKARVR 352
+ S RI +++ L M+ + R+ + E L + + EE W+QK+R +
Sbjct: 671 NLNSEQRIKKLQSSLNRAMESTRRVD-----RQLIPQLQESLAKAFSDEEIYWKQKSRNQ 725
Query: 353 WLREGDRNTSFFHSISKIRKSKKLISQLV--VGGVSIEDPNMIKDAIFSHFQSFFT---S 407
W++EGD+NT +FH+ +K R S+ ++ ++ G + D I +H Q FFT S
Sbjct: 726 WMKEGDQNTGYFHACTKTRYSQNRVNTIMDDQGRMFTGDKE-----IGNHAQDFFTNIFS 780
Query: 408 ENRLRPK-IRCANL-SRLSQANKMSLEKQFSEGEIWEVLKSCDGNRAPGTDGFNLNFFKQ 465
N ++ I A+ S ++ + L K+FS+ EI++ + ++APG DG F+K
Sbjct: 781 TNGIKVSPIDFADFKSTVTNTVNLDLTKEFSDTEIYDAICQIGDDKAPGPDGLTARFYKN 840
Query: 466 FWGIVKEDVVKFFEDFHSTGKLVKGLNATFISLIPKHNCPSTVSDFRPISLIGSAYKLIS 525
W IV DV+ + F T + +N T I +IPK P+T+SD+RPI+L YK+IS
Sbjct: 841 CWDIVGYDVILEVKKFFETSFMKPSINHTNICMIPKITNPTTLSDYRPIALCNVLYKVIS 900
Query: 526 KVLASRLQVVTPLLISENQFAFAQGRQISDCILIANEVVDFLNKKQGGG---FLLKLDFA 582
K L +RL+ ++S++Q AF GR I+D ++IA+EV+ L ++ +K D +
Sbjct: 901 KCLVNRLKSHLNSIVSDSQAAFIPGRIINDNVMIAHEVMHSLKVRKRVSKTYMAVKTDVS 960
Query: 583 KDYDNIDWGFLFDILKEMNFGDRWINWIRTCVSTATLSILVNGSSTGFFEMEKGLRQGDP 642
K YD ++W FL ++ F ++WI WI V + S+L+NGS G+ +G+RQGDP
Sbjct: 961 KAYDRVEWDFLETTMRLFGFCNKWIGWIMAAVKSVHYSVLINGSPHGYITPTRGIRQGDP 1020
Query: 643 LSPFLFNLCVNGLSCLLNQLLDDSNFSGVRIGPGL-ALNHLQFADDTLLFCENDENQIDL 701
LSP+LF LC + LS L+N + GVRIG G A+ HLQFADD+L FC+ +
Sbjct: 1021 LSPYLFILCGDILSHLINGRASSGDLRGVRIGNGAPAITHLQFADDSLFFCQANVRNCQA 1080
Query: 702 LCNTLLSFLFTSGLKVNLHKSTI-IGCNVDVRETQRVAGMYGWSIGQLPI-----LYLGA 755
L + + + SG K+N+ KS I G V R+ I ++P YLG
Sbjct: 1081 LKDVFDVYEYYSGQKINVQKSMITFGSRVYGSTQSRLK-----QILEIPNQGGGGKYLGL 1135
Query: 756 PLGGNPRRASFWEPMLDKLRRKRRTWNSKFISLSGRLTLMKATLCSVPLFLMAIFKAPLK 815
P ++ +E ++D+++++ TW+++F+S +G+ ++K+ ++P++ M+ FK P
Sbjct: 1136 PEQFGRKKKEMFEYIIDRVKKRTSTWSARFLSPAGKEIMLKSVALAMPVYAMSCFKLPKG 1195
Query: 816 VVGEVEKILRSF 827
+V E+E +L +F
Sbjct: 1196 IVSEIESLLMNF 1207
>UniRef100_Q9SIQ5 Putative non-LTR retroelement reverse transcriptase [Arabidopsis
thaliana]
Length = 1524
Score = 335 bits (859), Expect = 6e-90
Identities = 235/793 (29%), Positives = 405/793 (50%), Gaps = 59/793 (7%)
Query: 65 GSAGGLISLW-NPASYAMESAVKNVRFIGLLLRALVDSSLLF------IMNVYGPNVDSE 117
G +GGL +W N S ++ S + L+DS + F + VYG SE
Sbjct: 218 GRSGGLALMWKNNVSLSLISQDER----------LIDSHVTFNNKSFYLSCVYGHPTQSE 267
Query: 118 RYDFFNLLSEALSLNNCFCKIMGGDFNAILNEGERIGIQMGVESSFSDF---VADSNLID 174
R+ + L E +S N ++ GDFN IL+ E+IG M E +F +F V+ ++ D
Sbjct: 268 RHQLWQTL-EHISDNRNAEWLLVGDFNEILSNAEKIGGPMREEWTFRNFRNMVSHCDIED 326
Query: 175 LPLQSDKFTWYSSR-AGGLWSRIDRWLVSEDAINKFDGISQSVENWGLSDRRPIILSLG- 232
+ + D+F+W R + +DR ++ F ++ SD +P+++
Sbjct: 327 MRSKGDRFSWVGERHTHTVKCCLDRVFINSAWTATFPYAETEFLDFTGSDHKPVLVHFNE 386
Query: 233 AINFGPKPFLFYNHWLLDKEFEKLVSDWWCSATTVGWSAFVLQQKLKGLKNRIKEWQGYA 292
+ K F F N + F+++V W T + + +++ + + + +
Sbjct: 387 SFPRRSKLFRFDNRLIDIPTFKRIVQTSW--RTNRNSRSTPITERISSCRQAMARLKHAS 444
Query: 293 SSVSRVRINTIEADLQVTMDRLASEEPGIELRKNRLTILEGLWQEYRKEEYRWRQKARVR 352
+ S RI +++ L M+ + R+ + E L + + EE W+QK+R +
Sbjct: 445 NLNSEQRIKKLQSSLNRAMESTRRVD-----RQLIPQLQESLAKAFSDEEIYWKQKSRNQ 499
Query: 353 WLREGDRNTSFFHSISKIRKSKKLISQLV--VGGVSIEDPNMIKDAIFSHFQSFFT---S 407
W++EGD+NT +FH+ +K R S+ ++ ++ G + D I +H Q FFT S
Sbjct: 500 WMKEGDQNTGYFHACTKTRYSQNRVNTIMDDQGRMFTGDKE-----IGNHAQDFFTNIFS 554
Query: 408 ENRLRPK-IRCANL-SRLSQANKMSLEKQFSEGEIWEVLKSCDGNRAPGTDGFNLNFFKQ 465
N ++ I A+ S ++ + L K+FS+ EI++ + ++APG DG F+K
Sbjct: 555 TNGIKVSPIDFADFKSTVTNTVNLDLTKEFSDTEIYDAICQIGDDKAPGPDGLTARFYKN 614
Query: 466 FWGIVKEDVVKFFEDFHSTGKLVKGLNATFISLIPKHNCPSTVSDFRPISLIGSAYKLIS 525
W IV DV+ + F T + +N T I +IPK P+T+SD+RPI+L YK+IS
Sbjct: 615 CWDIVGYDVILEVKKFFETSFMKPSINHTNICMIPKITNPTTLSDYRPIALCNVLYKVIS 674
Query: 526 KVLASRLQVVTPLLISENQFAFAQGRQISDCILIANEVVDFLNKKQGGG---FLLKLDFA 582
K L +RL+ ++S++Q AF GR I+D ++IA+EV+ L ++ +K D +
Sbjct: 675 KCLVNRLKSHLNSIVSDSQAAFIPGRIINDNVMIAHEVMHSLKVRKRVSKTYMAVKTDVS 734
Query: 583 KDYDNIDWGFLFDILKEMNFGDRWINWIRTCVSTATLSILVNGSSTGFFEMEKGLRQGDP 642
K YD ++W FL ++ F ++WI WI V + S+L+NGS G+ +G+RQGDP
Sbjct: 735 KAYDRVEWDFLETTMRLFGFCNKWIGWIMAAVKSVHYSVLINGSPHGYITPTRGIRQGDP 794
Query: 643 LSPFLFNLCVNGLSCLLNQLLDDSNFSGVRIGPGL-ALNHLQFADDTLLFCENDENQIDL 701
LSP+LF LC + LS L+N + GVRIG G A+ HLQFADD+L FC+ +
Sbjct: 795 LSPYLFILCGDILSHLINGRASSGDLRGVRIGNGAPAITHLQFADDSLFFCQANVRNCQA 854
Query: 702 LCNTLLSFLFTSGLKVNLHKSTI-IGCNV------DVRETQRVAGMYGWSIGQLPILYLG 754
L + + + SG K+N+ KS I G V +++ + G YLG
Sbjct: 855 LKDVFDVYEYYSGQKINVQKSMITFGSRVYGSTQSKLKQILEIPNQGGGG------KYLG 908
Query: 755 APLGGNPRRASFWEPMLDKLRRKRRTWNSKFISLSGRLTLMKATLCSVPLFLMAIFKAPL 814
P ++ +E ++D+++++ TW+++F+S +G+ ++K+ ++P++ M+ FK P
Sbjct: 909 LPEQFGRKKKEMFEYIIDRVKKRTSTWSARFLSPAGKEIMLKSVALAMPVYAMSCFKLPK 968
Query: 815 KVVGEVEKILRSF 827
+V E+E +L +F
Sbjct: 969 GIVSEIESLLMNF 981
>UniRef100_Q75GM6 Putative non-LTR retroelement reverse transcriptase [Oryza sativa]
Length = 1614
Score = 333 bits (854), Expect = 2e-89
Identities = 222/721 (30%), Positives = 372/721 (50%), Gaps = 22/721 (3%)
Query: 106 IMNVYGPNVDSERYDFFNLLSEALSLNNCFCKIMGGDFNA--ILNEGERIGIQMGVESSF 163
+++VYGP +S + F L +S ++GGDFN +E I + F
Sbjct: 725 VISVYGPVDNSLKNVFLEELRWHISGRE-HPVMLGGDFNLYKFASEKSNSNIDVRCMDMF 783
Query: 164 SDFVADSNLIDLPLQSDKFTWYSSRAGGLWSRIDRWLVSEDAINKF-DGISQSVENWGLS 222
+ F++D +L ++ + KFTW + + +DR LVS+D +++ + + SV G S
Sbjct: 784 NKFISDLDLREVHIIGPKFTWTNKQYCPTQEVLDRILVSDDWDDRYPNSLISSVLRVG-S 842
Query: 223 DRRPIILSLGAINFGP-KPFLFYNHWLLDKEFEKLVSDWWCSATTVGWSAFVLQ---QKL 278
D P++L + + F F + WL + F+ LV + +++L+ +K
Sbjct: 843 DHTPLLLDTYELVVEKCRYFRFESAWLAVEGFKDLVGKKFPDRD----GSYILEFWNKKQ 898
Query: 279 KGLKNRIKEWQGYASSVSRVRINTIEADLQVTMDRLASEEPGIELRKNRLTILEGLWQEY 338
++ + W S R +++ L+ + E + ++R + + L Y
Sbjct: 899 VEMRRFLTGWGINKQSEDRRAKLSLQKKLEEIDSKAVDAELNADEWRHRYELEDALEHIY 958
Query: 339 RKEEYRWRQKARVRWLREGDRNTSFFHSISKIRKSKKLISQLVVGGVSIEDPNMIKDAIF 398
EE W ++ +WL EGDRNT ++H I+ RK + I+ L+ G + +K I
Sbjct: 959 ELEELYWHKRCGEQWLLEGDRNTEYYHRIANGRKKRCTITSLMDGDRELTTKEDLKHHIV 1018
Query: 399 SHFQSFFTSEN----RLRPKIRCANLSRLSQANKMSLEKQFSEGEIWEVLKSCDGNRAPG 454
+++S F SE+ L + +L L + +K +L + F+ E+++VL+ N A G
Sbjct: 1019 EYYKSLFRSESSSSIHLSQGVWVDSLC-LKEEDKNTLMRPFTIEELYKVLREAKPNTAFG 1077
Query: 455 TDGFNLNFFKQFWGIVKEDVVKFFEDFHSTGKLVKGLNATFISLIPKHNCPSTVSDFRPI 514
DGF++ F++ FW ++ D+ + ++ +K LN ISLIPK++ P+ + FRPI
Sbjct: 1078 PDGFSIPFYRAFWPQLRPDLFEMLLMLYNEELDLKRLNFGVISLIPKNSNPTDIKQFRPI 1137
Query: 515 SLIGSAYKLISKVLASRLQVVTPLLISENQFAFAQGRQISDCILIANEVVDFLNKKQGGG 574
++ +K ISK + +RL + +IS Q AF GR I + +I +EV+ +N+K G
Sbjct: 1138 CVLNDCFKFISKCVCNRLTEIARDVISPTQTAFIPGRFILEGCVIIHEVLHEMNRKNLEG 1197
Query: 575 FLLKLDFAKDYDNIDWGFLFDILKEMNFGDRWINWIRTCVSTATLSILVNGSSTGFFEME 634
+LK+DF K YD + W FL +++ F +W+NWI+TCV + I +NG T FF
Sbjct: 1198 IILKIDFEKAYDKVSWDFLIEVMVRKGFPSKWVNWIKTCVMGGRVCININGERTDFFRTF 1257
Query: 635 KGLRQGDPLSPFLFNLCVNGLSCLLNQLLDDSNFSGV--RIGPGLALNHLQFADDTLLFC 692
+GLRQGDPLSP LFNL + L+ +L+ + SG+ I PG + HLQ+ADDT+LF
Sbjct: 1258 RGLRQGDPLSPLLFNLISDALAAMLDSAKREGVLSGLVPDIFPG-GITHLQYADDTVLFV 1316
Query: 693 ENDENQIDLLCNTLLSFLFTSGLKVNLHKSTIIGCNVDVRETQRVAGMYGWSIGQLPILY 752
ND+ QI L F + LKVN HKS I + +T RVA M+ +GQ P+ Y
Sbjct: 1317 ANDDKQIVATKFILYCFEEMAVLKVNYHKSEIFTLGLSDNDTNRVAMMFNCPVGQFPMKY 1376
Query: 753 LGAPLGGNPRRASFWEPMLDKLRRKRRTWNSKFISLSGRLTLMKATLCSVPLFLMAIFKA 812
LG P+G + ++ + KL ++ +W + +S +GR + L S+P + M ++
Sbjct: 1377 LGLPIGPDKILNLGFDFLGQKLEKRLNSWGNN-LSHAGRAVQINTCLSSIPSYAMCFYQL 1435
Query: 813 P 813
P
Sbjct: 1436 P 1436
>UniRef100_Q9FW98 Putative non-LTR retroelement reverse transcriptase [Oryza sativa]
Length = 1382
Score = 326 bits (836), Expect = 3e-87
Identities = 261/856 (30%), Positives = 414/856 (47%), Gaps = 76/856 (8%)
Query: 4 VSW--NVRGLGRAEKRRAVVEALSGLKPEIVLVQESKLDSSRVKTVE---AFANAVKMDY 58
V+W N RGLG A + + L+P +V + E+K+ + + + F+ +
Sbjct: 7 VTWGRNCRGLGSAATVGELRWLVKSLRPSLVFLSETKMRDKQARNLMWSLGFSGSF---- 62
Query: 59 VSVDAVGSAGGLISLWNPASYAMESAVKNVRFIGLLLRALVDSSLLFIMNVYGPNVDSER 118
+V G +GGL W A Y + N FI +L+ + + I VYG R
Sbjct: 63 -AVSCEGLSGGLALFWTTA-YTVSLRGFNSHFIDVLV-STEELPPWRISFVYGEPKRELR 119
Query: 119 YDFFNLLSEALSLNNCFCK--IMGGDFNAILNEGERIGIQMGVESSFSDF---VADSNLI 173
+ F+NLL L++ + + GDFN +L E +G++ E F + D LI
Sbjct: 120 HFFWNLLRR---LHDQWRGPWLCCGDFNEVLCLDEHLGMRERSEPHMQHFRSCLDDCGLI 176
Query: 174 DLPLQSDKFTWYSSRAGGLWS--RIDRWLVSEDAINKFDGISQSVENWGLSDRRPIILSL 231
DL KFTW + + S R+DR + + + F+ SD I + L
Sbjct: 177 DLGFVGPKFTWSNKQDANSNSKVRLDRAVANGEFSRYFEDCLVENVITTSSDHYAISIDL 236
Query: 232 GAINFGPKP------FLFYNHWLLDKEFEKLVSDWW--CSATTVG----WSAFVLQQKLK 279
N G + F F WL +++ ++V + W SA VG WS VLQQ
Sbjct: 237 SRRNHGQRRIPIQQGFRFEAAWLRAEDYREVVENSWRISSAGCVGLRGVWS--VLQQVAV 294
Query: 280 GLKNRIKEWQGYASSVSRVRINTIEADLQVTMDRLASEEPGIELRKNRLTILEGLWQEYR 339
LK +W + R +I +E L+ ++ + +++ +L I + L + +
Sbjct: 295 SLK----DWSKASFGSVRRKILKMERKLKSLRQSPVND---VVIQEEKL-IEQQLCELFE 346
Query: 340 KEEYRWRQKARVRWLREGDRNTSFFHSISKIRKSKKLISQLVVG-GVSIEDPNMIKDAIF 398
KEE RQ++RV WLREGDRNT+FFH+ + R+ I +LV G IK
Sbjct: 347 KEEIMARQRSRVDWLREGDRNTAFFHARASARRRTNRIKELVRDDGSRCISQEGIKRMAE 406
Query: 399 SHFQSFFTSENRLRPKIRCANLSRLSQA--NKMS------LEKQFSEGEIWEVLKSCDGN 450
+++ F+SE C ++ + A NK+ L KQ++ EI L
Sbjct: 407 VFYENLFSSEP-------CDSMEEVLDAIPNKVGDFINGELGKQYTNEEIKTALFQMGST 459
Query: 451 RAPGTDGFNLNFFKQFWGIVKEDVVKFFEDFHSTGKLVKGLNATFISLIPKHNCPSTVSD 510
+APG DGF F++ WGI++E + F ++ +GL + + LIPK N S +S
Sbjct: 460 KAPGPDGFPALFYQTHWGILEEHICNAVRGFLLGEEIPEGLCDSVVVLIPKVNNASHLSK 519
Query: 511 FRPISLIGSAYKLISKVLASRLQVVTPLLISENQFAFAQGRQISDCILIANEVVDFLNKK 570
FRPISL YK+ SKVLA+RL+ P ++SE Q AF GR I+D L+A E + + K+
Sbjct: 520 FRPISLCNVLYKIASKVLANRLKPFLPDIVSEFQSAFVPGRLITDSALVAYECLHTIRKQ 579
Query: 571 QGGG--FLLKLDFAKDYDNIDWGFLFDILKEMNFGDRWINWIRTCVSTATLSILVNGSST 628
F LK+D K YD ++W +L L ++ F WIN + CVS+ ++ +NG T
Sbjct: 580 HNKNPFFALKIDMMKAYDRVEWAYLSGCLSKLGFSQDWINTVMRCVSSVRYAVKINGELT 639
Query: 629 GFFEMEKGLRQGDPLSPFLFNLCVNGLSCLLNQLLDDSNFSGVRIG-PGLALNHLQFADD 687
+G+RQGDP+SP+LF LC GLSCLL++ G++ G G ++HL FADD
Sbjct: 640 KPVVPSRGIRQGDPISPYLFLLCTEGLSCLLHKKEVAGELQGIKNGRHGPPISHLLFADD 699
Query: 688 TLLFCENDENQIDLLCNTLLSFLFTSGLKVNLHKSTII-------GCNVDVRETQRVAGM 740
++ F + D + L NTL S+ SG K+NLHKS+I + V+ +V
Sbjct: 700 SIFFAKADSRNVQALKNTLRSYCSASGQKINLHKSSIFFGKRCPDAVKISVKSCLQVDNE 759
Query: 741 YGWSIGQLPILYLGAPLGGNPRRASFWEPMLDKLRRKRRTWNSKFISLSGRLTLMKATLC 800
L YLG P +F++ + +++ ++ W + +S +G T++KA
Sbjct: 760 V------LQDSYLGMPTEIGLATTNFFKFLPERIWKRVNGWTDRPLSRAGMETMLKAVAQ 813
Query: 801 SVPLFLMAIFKAPLKV 816
++P ++M+ F+ P+ +
Sbjct: 814 AIPNYVMSCFRIPVSI 829
>UniRef100_O81517 T24M8.8 protein [Arabidopsis thaliana]
Length = 1164
Score = 326 bits (836), Expect = 3e-87
Identities = 228/713 (31%), Positives = 364/713 (50%), Gaps = 30/713 (4%)
Query: 141 GDFNAILNEGERI---GIQMGVESS-FSDFVADSNLIDLPLQSDKFTWYSSRAGG-LWSR 195
GDFN IL+ E G + + F + + ++L DL + + FTW++ R+ + +
Sbjct: 40 GDFNQILHPSEHSTSDGFNVDRPTRIFRETILLASLTDLSFRGNTFTWWNKRSRAPVAKK 99
Query: 196 IDRWLVSEDAINKFDGISQSVENWGLSDRRPIILSL-GAINFGPKPFLFYNHWLLDKEFE 254
+DR LV++ F SD LSL A KPF F N L D+ F
Sbjct: 100 LDRILVNDKWTTTFPSSLGLFGEPDFSDHSSCELSLMSASPRSKKPFRFNNFLLKDENFL 159
Query: 255 KLVSDWWCSATTVGWSAFVLQQKLKGLKNRIKEWQGYASSVSRVRINTIEADLQVTMDR- 313
L+ W S + G + + + KLK LK I+++ + S + T EA + + +
Sbjct: 160 SLICLKWFSTSVTGSAMYRVSVKLKALKKVIRDFS--RDNYSDIEKRTKEAHDALLLAQS 217
Query: 314 --LASEEPG---IELRKNRLTILEGLWQEYRKEEYRW-RQKARVRWLREGDRNTSFFHSI 367
LAS P IE R W+ + E + Q++RV WLREGD N+S+FH +
Sbjct: 218 VLLASPCPSNAAIEAETQRK------WRILAEAEASFFYQRSRVNWLREGDMNSSYFHKM 271
Query: 368 SKIRKSKKLISQLVVG-GVSIEDPNMIKDAIFSHFQSFFTSENRLRPKIRCANLSRL--- 423
+ R+S I L G IE +++ +FQS SE L P A++S L
Sbjct: 272 ASARQSLNHIHFLSDPVGDRIEGQQNLENHCVEYFQSNLGSEQGL-PLFEQADISNLLSY 330
Query: 424 --SQANKMSLEKQFSEGEIWEVLKSCDGNRAPGTDGFNLNFFKQFWGIVKEDVVKFFEDF 481
S A ++SL+ FS +I S N+A G DGF+ FF W I+ +V + +F
Sbjct: 331 RCSPAQQVSLDTPFSSEQIKNAFFSLPRNKASGPDGFSPEFFCACWPIIGGEVTEAIHEF 390
Query: 482 HSTGKLVKGLNATFISLIPKHNCPSTVSDFRPISLIGSAYKLISKVLASRLQVVTPLLIS 541
++GKL+K NAT + LIPK S++SDFRPIS + + YK+ISK+L RL+ P IS
Sbjct: 391 FTSGKLLKQWNATNLVLIPKITNASSMSDFRPISCLNTVYKVISKLLTDRLKDFLPAAIS 450
Query: 542 ENQFAFAQGRQISDCILIANEVVDFLNKKQ-GGGFLLKLDFAKDYDNIDWGFLFDILKEM 600
+Q AF GR + +L+A E+V NKK +LK+D K +D++ W F+ L+ +
Sbjct: 451 HSQSAFMPGRLFLENVLLATELVHGYNKKNIAPSSMLKVDLRKAFDSVRWDFIVSALRAL 510
Query: 601 NFGDRWINWIRTCVSTATLSILVNGSSTGFFEMEKGLRQGDPLSPFLFNLCVNGLSCLLN 660
N +++ WI C+STA+ S+++NG S G F KGLRQGDP+SP+LF L + S LL
Sbjct: 511 NVPEKFTCWILECLSTASFSVILNGHSAGHFWSSKGLRQGDPMSPYLFVLAMEVFSGLLQ 570
Query: 661 QLLDDSNFSGVRIGPGLALNHLQFADDTLLFCENDENQIDLLCNTLLSFLFTSGLKVNLH 720
+ L ++HL FADD ++F + + + + +L F SGL +N +
Sbjct: 571 SRYTSGYIAYHPKTSQLEISHLMFADDVMIFFDGKSSSLHGIVESLEDFAGWSGLLMNTN 630
Query: 721 KSTIIGCNVDVRETQRVAGMYGWSIGQLPILYLGAPLGGNPRRASFWEPMLDKLRRKRRT 780
K+ + + E+ +A YG+ +G LP+ YLG PL + + P+++K+ + +
Sbjct: 631 KTQLYHAGLSQSESDSMAS-YGFKLGSLPVRYLGLPLMSRKLTIAEYAPLIEKITARFNS 689
Query: 781 WNSKFISLSGRLTLMKATLCSVPLFLMAIFKAPLKVVGEVEKILRSFLQGGKL 833
W + +S +GR+ L+ + + + F ++ F PL + ++E + FL ++
Sbjct: 690 WVVRLLSFAGRVQLLASVISGIVNFWISSFILPLGCIKKIESLCSRFLWSSRI 742
>UniRef100_O65244 F21E10.5 protein [Arabidopsis thaliana]
Length = 928
Score = 325 bits (832), Expect = 8e-87
Identities = 221/719 (30%), Positives = 352/719 (48%), Gaps = 31/719 (4%)
Query: 138 IMGGDFNAILNEGERIGIQMG--VESSFSDF---VADSNLIDLPLQSDKFTWYSSRAGGL 192
I+ GDFN IL+ E + + DF V ++ DL FTW + R L
Sbjct: 26 IIFGDFNEILDMEEHSNSRENPVTTTGMRDFQMAVNHCSITDLAYHGPLFTWSNKRENDL 85
Query: 193 WSR-IDRWLVSEDAINKFDGISQSVENWGLSDRRPIILSL----GAINFGPKPFLFYNHW 247
++ +DR LV++ + F E G SD ++L GA+ G +PF F N
Sbjct: 86 IAKKLDRVLVNDVWLQSFPRSYSVFEAGGCSDHLRCRINLNVGAGAVVKGKRPFKFVNVI 145
Query: 248 LLDKEFEKLVSDWWCSATTVGWSA---FVLQQKLKGLK----NRIKEWQGYASSVSRVRI 300
+ F V +W + S F +KLKGLK N KE G ++
Sbjct: 146 TEMEHFIPTVESYWNETEAIFMSTSSLFRFSKKLKGLKPLLRNLGKERLGNLVKQTKEAF 205
Query: 301 NTIEADLQVTMDRLASEEPGIELRKNRLTILEGLWQEYRKEEYRW-RQKARVRWLREGDR 359
T+ Q ++A+ P +N W E ++ +Q++++ WL GDR
Sbjct: 206 ETL---CQKQAMKMANPSPSSMQEENEAY---AKWDHIAVLEEKFLKQRSKLHWLDIGDR 259
Query: 360 NTSFFHSISKIRKSKKLISQLVVGGVSI-EDPNMIKDAIFSHFQSFFTSENRLRPKIRCA 418
N FH R+++ I +++ S+ IK HF+ F I
Sbjct: 260 NNKAFHRAVVAREAQNSIREIICHDGSVASQEEKIKTEAEHHFREFLQLIPNDFEGIAVE 319
Query: 419 NLS-----RLSQANKMSLEKQFSEGEIWEVLKSCDGNRAPGTDGFNLNFFKQFWGIVKED 473
L R S ++K L S EI +V+ S +++PG DG+ F+K W I+ +
Sbjct: 320 ELQDLLPYRCSDSDKEMLTNHVSAEEIHKVVFSMPNDKSPGPDGYTAEFYKGAWNIIGAE 379
Query: 474 VVKFFEDFHSTGKLVKGLNATFISLIPKHNCPSTVSDFRPISLIGSAYKLISKVLASRLQ 533
+ + F + G L KG+N+T ++LIPK + D+RPIS YK+ISK++A+RL+
Sbjct: 380 FILAIQSFFAKGFLPKGINSTILALIPKKKEAKEMKDYRPISCCNVLYKVISKIIANRLK 439
Query: 534 VVTPLLISENQFAFAQGRQISDCILIANEVV-DFLNKKQGGGFLLKLDFAKDYDNIDWGF 592
+V P I NQ AF + R + + +L+A E+V D+ LK+D +K +D++ W F
Sbjct: 440 LVLPKFIVGNQSAFVKDRLLIENVLLATEIVKDYHKDSVSSRCALKIDISKAFDSVQWKF 499
Query: 593 LFDILKEMNFGDRWINWIRTCVSTATLSILVNGSSTGFFEMEKGLRQGDPLSPFLFNLCV 652
L ++L+ MNF + +WI C++TA+ S+ VNG G F + LRQG LSP+LF + +
Sbjct: 500 LINVLEAMNFPPEFTHWITLCITTASFSVQVNGELAGVFSSARELRQGCSLSPYLFVISM 559
Query: 653 NGLSCLLNQLLDDSNFSGVRIGPGLALNHLQFADDTLLFCENDENQIDLLCNTLLSFLFT 712
+ LS +L++ + F + L HL FADD ++ + ID + L F
Sbjct: 560 DVLSKMLDKAVGARQFGYHPKCRAIGLTHLSFADDLMILSDGKVRSIDGIVKVLYEFAKW 619
Query: 713 SGLKVNLHKSTIIGCNVDVRETQRVAGMYGWSIGQLPILYLGAPLGGNPRRASFWEPMLD 772
SGLK+++ KST+ V Q + + + +G+LP+ YLG PL AS P+++
Sbjct: 620 SGLKISMEKSTMYLAGVQASVYQEIVQKFSFDVGKLPVRYLGLPLVSKRLTASDCLPLIE 679
Query: 773 KLRRKRRTWNSKFISLSGRLTLMKATLCSVPLFLMAIFKAPLKVVGEVEKILRSFLQGG 831
+LR+K W S+F+S +GRL L+ +TL S+ F MA F+ P + E++K+ +FL G
Sbjct: 680 QLRKKIEAWTSRFLSFAGRLNLISSTLWSICNFWMAAFRLPRACIREIDKLCSAFLWSG 738
>UniRef100_Q9SIS9 Putative non-LTR retroelement reverse transcriptase [Arabidopsis
thaliana]
Length = 1715
Score = 323 bits (829), Expect = 2e-86
Identities = 233/852 (27%), Positives = 408/852 (47%), Gaps = 46/852 (5%)
Query: 6 WNVRGLGRAEKRRAVVEALSGLKPEIVLVQESKLDSSRVKTVEAFANAVKM---DYVSVD 62
WN +GLG+ R + E +++ + E+K + + + VKM D +
Sbjct: 368 WNCQGLGQPLTVRRLEEVQRVYFLDMLFLIETKQQDNYTRDL-----GVKMGFEDMCIIS 422
Query: 63 AVGSAGGLISLWNPASYAMESAVKNVRFIGLLLRALVDSSLLFIMNVYGPNVDSERYDFF 122
G +GGL+ W +++ +VR + L + + ++ +YG + SER+ +
Sbjct: 423 PRGLSGGLVVYWKK-HLSIQVISHDVRLVDLYVE--YKNFNFYLSCIYGHPIPSERHHLW 479
Query: 123 NLLSEALSLNNCFCKIMGGDFNAILNEGERIGIQ---MGVESSFSDFVADSNLIDLPLQS 179
L + +S + +M GDFN ILN E+ G + +G +F++ + N+ DL +
Sbjct: 480 EKL-QRVSAHRSGPWMMCGDFNEILNLNEKKGGRRRSIGSLQNFTNMINCCNMKDLKSKG 538
Query: 180 DKFTWYSSRAGG-LWSRIDRWLVSEDAINKFDGISQSVENWGLSDRRPIILSL------- 231
+ ++W R + S +DR ++ D F SD P+I+ +
Sbjct: 539 NPYSWVGKRQNETIESCLDRVFINSDWQASFPAFETEFLPIAGSDHAPVIIDIAEEVCTK 598
Query: 232 -GAINFGPKPFLFYNHWLLDKEFEKLVSDWWCSATTVGWSAFVLQQKLKGLKNRIKEWQG 290
G + + F F ++F V W + + +KL + + +W+
Sbjct: 599 RGQFRYDRRHFQF-------EDFVDSVQRGWNRGRSDSHGGYY--EKLHCCRQELAKWKR 649
Query: 291 YASSVSRVRINTIEADLQVTMDRLASEEPGIELRKNRLTILEGLWQEYRKEEYRWRQKAR 350
+ + +I T++ + A+E + L + + L Q YR EE W K+R
Sbjct: 650 RTKTNTAEKIETLKYRVD------AAERDHTLPHQTILRLRQDLNQAYRDEELYWHLKSR 703
Query: 351 VRWLREGDRNTSFFHSISKIRKSKKLISQLV--VGGVSIEDPNMIKDAIFSHFQSFFTSE 408
RW+ GDRNT FF++ +K+RKS+ I + G + D + K A F T++
Sbjct: 704 NRWMLLGDRNTMFFYASTKLRKSRNRIKAITDAQGIENFRDDTIGKVAENYFADLFTTTQ 763
Query: 409 NRLRPKIRCANLSRLSQANKMSLEKQFSEGEIWEVLKSCDGNRAPGTDGFNLNFFKQFWG 468
+I ++++ L + ++ E+ + + + +RAPG DGF F+ W
Sbjct: 764 TSDWEEIISGIAPKVTEQMNHELLQSVTDQEVRDAVFAIGADRAPGFDGFTAAFYHHLWD 823
Query: 469 IVKEDVVKFFEDFHSTGKLVKGLNATFISLIPKHNCPSTVSDFRPISLIGSAYKLISKVL 528
++ DV F + + +N T I LIPK P +SD+RPISL ++YK+ISK+L
Sbjct: 824 LIGNDVCLMVRHFFESDVMDNQINQTQICLIPKIIDPKHMSDYRPISLCTASYKIISKIL 883
Query: 529 ASRLQVVTPLLISENQFAFAQGRQISDCILIANEVVDFLNKK---QGGGFLLKLDFAKDY 585
RL+ +IS++Q AF G+ ISD +L+A+E++ L + Q G +K D +K Y
Sbjct: 884 IKRLKQCLGDVISDSQAAFVPGQNISDNVLVAHELLHSLKSRRECQSGYVAVKTDISKAY 943
Query: 586 DNIDWGFLFDILKEMNFGDRWINWIRTCVSTATLSILVNGSSTGFFEMEKGLRQGDPLSP 645
D ++W FL ++ ++ F RW+ WI TCV++ + +L+NGS G +G+RQGDPLSP
Sbjct: 944 DRVEWNFLEKVMIQLGFAPRWVKWIMTCVTSVSYEVLINGSPYGKIFPSRGIRQGDPLSP 1003
Query: 646 FLFNLCVNGLSCLLNQLLDDSNFSGVRIGPG-LALNHLQFADDTLLFCENDENQIDLLCN 704
+LF C LS +L + + G++I LA++HL FADD+L FC I+ L
Sbjct: 1004 YLFLFCAEVLSNMLRKAEVNKQIHGMKITKDCLAISHLLFADDSLFFCRASNQNIEQLAL 1063
Query: 705 TLLSFLFTSGLKVNLHKSTII-GCNVDVRETQRVAGMYGWSIGQLPILYLGAPLGGNPRR 763
+ SG K+N KS+II G + QR+ + G + YLG P R+
Sbjct: 1064 IFKKYEEASGQKINYAKSSIIFGQKIPTMRRQRLHRLLGIDNVRGGGKYLGLPEQLGRRK 1123
Query: 764 ASFWEPMLDKLRRKRRTWNSKFISLSGRLTLMKATLCSVPLFLMAIFKAPLKVVGEVEKI 823
+E ++ K++ + W ++S +G+ ++KA ++P++ M F P + E+ +
Sbjct: 1124 VELFEYIVTKVKERTEGWAYNYLSPAGKEIVIKAIAMALPVYSMNCFLLPTLICNEINSL 1183
Query: 824 LRSFLQGGKLHG 835
+ +F G + G
Sbjct: 1184 ITAFWWGKENEG 1195
>UniRef100_Q9FYK1 F21J9.30 [Arabidopsis thaliana]
Length = 1270
Score = 323 bits (829), Expect = 2e-86
Identities = 237/789 (30%), Positives = 398/789 (50%), Gaps = 42/789 (5%)
Query: 60 SVDAVGSAGGLISLWNPASYAMESAVKNVRFIGLLLRALVDSSLLF------IMNVYGPN 113
+V+ VG GGL LW S +++F+ + L+D+ + F + VYG
Sbjct: 21 TVEPVGKCGGLALLWK------SSVQVDLKFVD---KNLMDAQVQFGAVNFCVSCVYGDP 71
Query: 114 VDSERYDFFNLLSE-ALSLNNCFCKIMGGDFNAILNEGERIGIQMGVE---SSFSDFVAD 169
S+R + +S + + +C M GDFN IL+ GE+ G + +F++ +
Sbjct: 72 DRSKRSQAWERISRIGVGRRDKWC--MFGDFNDILHNGEKNGGPRRSDLDCKAFNEMIKG 129
Query: 170 SNLIDLPLQSDKFTWYSSRAGGLW--SRIDRWLVSEDAINKFDGISQSVENWGLSDRRPI 227
+L+++P + FTW + R G W R+DR +++ F +Q+ ++ SD RP+
Sbjct: 130 CDLVEMPAHGNGFTW-AGRRGDHWIQCRLDRAFGNKEWFCFFPVSNQTFLDFRGSDHRPV 188
Query: 228 ILSLGAINFGPK-PFLFYNHWLLDKEFEKLVSDWWCSATTVGWSAFVLQQKLKGLKNRIK 286
++ L + + F F +L ++ ++ + W S G + V +L+ + +
Sbjct: 189 LIKLMSSQDSYRGQFRFDKRFLFKEDVKEAIIRTW-SRGKHGTNISVAD-RLRACRKSLS 246
Query: 287 EWQGYASSVSRVRINTIEADLQVTMDRLASEEPGIELRKNRLTILE-GLWQEYRKEEYRW 345
W+ + S +IN +EA L+ E+ + R+++L+ L + YR+EE W
Sbjct: 247 SWKKQNNLNSLDKINQLEAALE-------KEQSLVWPIFQRVSVLKKDLAKAYREEEAYW 299
Query: 346 RQKARVRWLREGDRNTSFFHSISKIRKSKKLISQLVVGGVSIEDPNMIKDAIFS-HFQSF 404
+QK+R +WLR G+RN+ +FH+ K + +K I +L +++ K + + +F +
Sbjct: 300 KQKSRQKWLRSGNRNSKYFHAAVKQNRQRKRIEKLKDVNGNMQTSEAAKGEVAAAYFGNL 359
Query: 405 FTSENRLRPKIRCANL-SRLSQANKMSLEKQFSEGEIWEVLKSCDGNRAPGTDGFNLNFF 463
F S N + L R+S+ SL + S EI E + S APG DG + FF
Sbjct: 360 FKSSNPSGFTDWFSGLVPRVSEVMNESLVGEVSAQEIKEAVFSIKPASAPGPDGMSALFF 419
Query: 464 KQFWGIVKEDVVKFFEDFHSTGKLVKGLNATFISLIPKHNCPSTVSDFRPISLIGSAYKL 523
+ +W V V + F + G + N T + LIPK P+ + D RPISL YK+
Sbjct: 420 QHYWSTVGNQVTSEVKKFFADGIMPAEWNYTHLCLIPKTQHPTEMVDLRPISLCSVLYKI 479
Query: 524 ISKVLASRLQVVTPLLISENQFAFAQGRQISDCILIANEVVDFL--NKKQGGGFL-LKLD 580
ISK++A RLQ P ++S+ Q AF R I+D IL+A+E+V L + + F+ +K D
Sbjct: 480 ISKIMAKRLQPWLPEIVSDTQSAFVSERLITDNILVAHELVHSLKVHPRISSEFMAVKSD 539
Query: 581 FAKDYDNIDWGFLFDILKEMNFGDRWINWIRTCVSTATLSILVNGSSTGFFEMEKGLRQG 640
+K YD ++W +L +L + F +W+NWI CVS+ T S+L+N G +++GLRQG
Sbjct: 540 MSKAYDRVEWSYLRSLLLSLGFHLKWVNWIMVCVSSVTYSVLINDCPFGLIILQRGLRQG 599
Query: 641 DPLSPFLFNLCVNGLSCLLNQLLDDSNFSGVRIGP-GLALNHLQFADDTLLFCENDENQI 699
DPLSPFLF LC GL+ LLN+ + G++ G ++HL FADD+L C+ Q
Sbjct: 600 DPLSPFLFVLCTEGLTHLLNKAQWEGALEGIQFSENGPMVHHLLFADDSLFLCKASREQS 659
Query: 700 DLLCNTLLSFLFTSGLKVNLHKSTI-IGCNVDVRETQRVAGMYGWSIGQLPILYLGAPLG 758
+L L + +G +NL+KS+I G VD + + G YLG P
Sbjct: 660 LVLQKILKVYGNATGQTINLNKSSITFGEKVDEQLKGTIRTCLGIFTEGGAGTYLGLPEC 719
Query: 759 GNPRRASFWEPMLDKLRRKRRTWNSKFISLSGRLTLMKATLCSVPLFLMAIFKAPLKVVG 818
+ + + D+L+ K W ++ +S G+ L+K+ ++P+F M+ FK P+
Sbjct: 720 FSGSKVDMLHYLKDRLKEKLDVWFTRCLSQGGKEVLLKSVALAMPVFAMSCFKLPITTCE 779
Query: 819 EVEKILRSF 827
+E + SF
Sbjct: 780 NLESAMASF 788
>UniRef100_Q9FXJ5 F5A9.24 [Arabidopsis thaliana]
Length = 1254
Score = 323 bits (829), Expect = 2e-86
Identities = 237/789 (30%), Positives = 398/789 (50%), Gaps = 42/789 (5%)
Query: 60 SVDAVGSAGGLISLWNPASYAMESAVKNVRFIGLLLRALVDSSLLF------IMNVYGPN 113
+V+ VG GGL LW S +++F+ + L+D+ + F + VYG
Sbjct: 24 TVEPVGKCGGLALLWK------SSVQVDLKFVD---KNLMDAQVQFGAVNFCVSCVYGDP 74
Query: 114 VDSERYDFFNLLSE-ALSLNNCFCKIMGGDFNAILNEGERIGIQMGVE---SSFSDFVAD 169
S+R + +S + + +C M GDFN IL+ GE+ G + +F++ +
Sbjct: 75 DRSKRSQAWERISRIGVGRRDKWC--MFGDFNDILHNGEKNGGPRRSDLDCKAFNEMIKG 132
Query: 170 SNLIDLPLQSDKFTWYSSRAGGLW--SRIDRWLVSEDAINKFDGISQSVENWGLSDRRPI 227
+L+++P + FTW + R G W R+DR +++ F +Q+ ++ SD RP+
Sbjct: 133 CDLVEMPAHGNGFTW-AGRRGDHWIQCRLDRAFGNKEWFCFFPVSNQTFLDFRGSDHRPV 191
Query: 228 ILSLGAINFGPK-PFLFYNHWLLDKEFEKLVSDWWCSATTVGWSAFVLQQKLKGLKNRIK 286
++ L + + F F +L ++ ++ + W S G + V +L+ + +
Sbjct: 192 LIKLMSSQDSYRGQFRFDKRFLFKEDVKEAIIRTW-SRGKHGTNISVAD-RLRACRKSLS 249
Query: 287 EWQGYASSVSRVRINTIEADLQVTMDRLASEEPGIELRKNRLTILE-GLWQEYRKEEYRW 345
W+ + S +IN +EA L+ E+ + R+++L+ L + YR+EE W
Sbjct: 250 SWKKQNNLNSLDKINQLEAALE-------KEQSLVWPIFQRVSVLKKDLAKAYREEEAYW 302
Query: 346 RQKARVRWLREGDRNTSFFHSISKIRKSKKLISQLVVGGVSIEDPNMIKDAIFS-HFQSF 404
+QK+R +WLR G+RN+ +FH+ K + +K I +L +++ K + + +F +
Sbjct: 303 KQKSRQKWLRSGNRNSKYFHAAVKQNRQRKRIEKLKDVNGNMQTSEAAKGEVAAAYFGNL 362
Query: 405 FTSENRLRPKIRCANL-SRLSQANKMSLEKQFSEGEIWEVLKSCDGNRAPGTDGFNLNFF 463
F S N + L R+S+ SL + S EI E + S APG DG + FF
Sbjct: 363 FKSSNPSGFTDWFSGLVPRVSEVMNESLVGEVSAQEIKEAVFSIKPASAPGPDGMSALFF 422
Query: 464 KQFWGIVKEDVVKFFEDFHSTGKLVKGLNATFISLIPKHNCPSTVSDFRPISLIGSAYKL 523
+ +W V V + F + G + N T + LIPK P+ + D RPISL YK+
Sbjct: 423 QHYWSTVGNQVTSEVKKFFADGIMPAEWNYTHLCLIPKTQHPTEMVDLRPISLCSVLYKI 482
Query: 524 ISKVLASRLQVVTPLLISENQFAFAQGRQISDCILIANEVVDFL--NKKQGGGFL-LKLD 580
ISK++A RLQ P ++S+ Q AF R I+D IL+A+E+V L + + F+ +K D
Sbjct: 483 ISKIMAKRLQPWLPEIVSDTQSAFVSERLITDNILVAHELVHSLKVHPRISSEFMAVKSD 542
Query: 581 FAKDYDNIDWGFLFDILKEMNFGDRWINWIRTCVSTATLSILVNGSSTGFFEMEKGLRQG 640
+K YD ++W +L +L + F +W+NWI CVS+ T S+L+N G +++GLRQG
Sbjct: 543 MSKAYDRVEWSYLRSLLLSLGFHLKWVNWIMVCVSSVTYSVLINDCPFGLIILQRGLRQG 602
Query: 641 DPLSPFLFNLCVNGLSCLLNQLLDDSNFSGVRIGP-GLALNHLQFADDTLLFCENDENQI 699
DPLSPFLF LC GL+ LLN+ + G++ G ++HL FADD+L C+ Q
Sbjct: 603 DPLSPFLFVLCTEGLTHLLNKAQWEGALEGIQFSENGPMVHHLLFADDSLFLCKASREQS 662
Query: 700 DLLCNTLLSFLFTSGLKVNLHKSTI-IGCNVDVRETQRVAGMYGWSIGQLPILYLGAPLG 758
+L L + +G +NL+KS+I G VD + + G YLG P
Sbjct: 663 LVLQKILKVYGNATGQTINLNKSSITFGEKVDEQLKGTIRTCLGIFTEGGAGTYLGLPEC 722
Query: 759 GNPRRASFWEPMLDKLRRKRRTWNSKFISLSGRLTLMKATLCSVPLFLMAIFKAPLKVVG 818
+ + + D+L+ K W ++ +S G+ L+K+ ++P+F M+ FK P+
Sbjct: 723 FSGSKVDMLHYLKDRLKEKLDVWFTRCLSQGGKEVLLKSVALAMPVFAMSCFKLPITTCE 782
Query: 819 EVEKILRSF 827
+E + SF
Sbjct: 783 NLESAMASF 791
>UniRef100_O23757 Reverse transcriptase [Beta vulgaris subsp. vulgaris]
Length = 507
Score = 323 bits (828), Expect = 2e-86
Identities = 182/515 (35%), Positives = 282/515 (54%), Gaps = 19/515 (3%)
Query: 136 CKIMGGDFNAILNEGERIGIQMGVESS--FSDFVADSNLIDLPLQSDKFTWYSSRAGGLW 193
C IMG D+N L ER + + + F +F+ + LI++ FTW+ ++
Sbjct: 6 CLIMG-DYNETLIPAERGSHLISSQGARDFQNFINNMGLIEISPIKGFFTWFRGQSK--- 61
Query: 194 SRIDRWLVSEDAINKFDGISQSVENWGLSDRRPIILSLGAINFGPKPFLFYNHWLLDKEF 253
S++DR V + + F + ++ SD P++ N+GPKPF F N WL +
Sbjct: 62 SKLDRLFVQPEWLTSFPSLKLNILKRSPSDHCPLLAFSSHQNWGPKPFRFLNCWLTHPQC 121
Query: 254 EKLVSDWWCSATTVGWSAFVLQQKLKGLKNRIKEWQGYASSVSRVRINTIEADLQVTMDR 313
K++S+ W + KL+ LK +K+W + I E + +D
Sbjct: 122 MKIISEIWTKNHNSS-----VPDKLRTLKEALKQWNQKEFGIIDDNIKHCEEKIHF-LDN 175
Query: 314 LASEEPGIELRKNRLTILE----GLWQEYRKEEYRWRQKARVRWLREGDRNTSFFHSISK 369
+A+ L N L + LW ++ E W +R RW++EGD NT +FH+++
Sbjct: 176 IANSRL---LCTNELQERDDAQLNLWTWLKRSESYWALNSRARWIKEGDSNTRYFHTLAT 232
Query: 370 IRKSKKLISQLVVGGVSIEDPNMIKDAIFSHFQSFFTSENRLRPKIRCANLSRLSQANKM 429
IR+ K + + G +I P IK+ ++FQ F E RP N ++LS++
Sbjct: 233 IRRHKNHLQYIKTEGKNISKPGEIKEEAVAYFQHIFAEEFSTRPVFNGLNFNKLSESECS 292
Query: 430 SLEKQFSEGEIWEVLKSCDGNRAPGTDGFNLNFFKQFWGIVKEDVVKFFEDFHSTGKLVK 489
SL F+ EI E + SCD +APG DGFN F K W ++K+D+ K EDF ++G L
Sbjct: 293 SLIAPFTHEEIDEAVDSCDAQKAPGPDGFNFKFIKSSWEVIKKDIYKIVEDFWASGSLPH 352
Query: 490 GLNATFISLIPKHNCPSTVSDFRPISLIGSAYKLISKVLASRLQVVTPLLISENQFAFAQ 549
G N+ FI+LIPK P ++FRPIS++G YK+I+K+LA RLQ V LI Q +F +
Sbjct: 353 GCNSAFIALIPKTQHPRGFNEFRPISMVGCLYKIIAKILARRLQRVMDHLIGPYQSSFIK 412
Query: 550 GRQISDCILIANEVVDFLNKKQGGGFLLKLDFAKDYDNIDWGFLFDILKEMNFGDRWINW 609
GRQI D +LIA+E++D + + +LKLDF K +D+I W +L +L +MNF W +W
Sbjct: 413 GRQILDGVLIASELIDSCRRMKNEAVVLKLDFHKAFDSISWNYLEWVLCQMNFPLVWRSW 472
Query: 610 IRTCVSTATLSILVNGSSTGFFEMEKGLRQGDPLS 644
IR+CV +A+ ++L+NGS + F++++GLRQGDPLS
Sbjct: 473 IRSCVMSASAAVLINGSPSSVFKLQRGLRQGDPLS 507
Database: uniref100
Posted date: Jan 5, 2005 1:24 AM
Number of letters in database: 848,049,833
Number of sequences in database: 2,790,947
Lambda K H
0.334 0.147 0.477
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,219,791,623
Number of Sequences: 2790947
Number of extensions: 91741261
Number of successful extensions: 312169
Number of sequences better than 10.0: 1179
Number of HSP's better than 10.0 without gapping: 718
Number of HSP's successfully gapped in prelim test: 461
Number of HSP's that attempted gapping in prelim test: 309149
Number of HSP's gapped (non-prelim): 1737
length of query: 1389
length of database: 848,049,833
effective HSP length: 140
effective length of query: 1249
effective length of database: 457,317,253
effective search space: 571189248997
effective search space used: 571189248997
T: 11
A: 40
X1: 15 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 39 (21.6 bits)
S2: 81 (35.8 bits)
Lotus: description of TM0110a.13