
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0299.5
(547 letters)
Database: uniref100
2,790,947 sequences; 848,049,833 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
UniRef100_O82276 Putative non-LTR retroelement reverse transcrip... 162 2e-38
UniRef100_Q9FG26 Non-LTR retroelement reverse transcriptase-like... 146 2e-33
UniRef100_O80790 Putative non-LTR retroelement reverse transcrip... 130 1e-28
UniRef100_O22148 Putative non-LTR retroelement reverse transcrip... 127 1e-27
UniRef100_Q9SHY0 F1E22.12 [Arabidopsis thaliana] 125 4e-27
UniRef100_Q9SZD8 Hypothetical protein F19B15.120 [Arabidopsis th... 112 2e-23
UniRef100_Q7XJS2 Putative non-LTR retroelement reverse transcrip... 110 1e-22
UniRef100_Q6L3U7 Putative RNase H domain containing protein [Sol... 108 5e-22
UniRef100_Q9SIS9 Putative non-LTR retroelement reverse transcrip... 107 7e-22
UniRef100_Q9SKJ4 Putative non-LTR retroelement reverse transcrip... 104 8e-21
UniRef100_Q9SK98 Very similar to retrotransposon reverse transcr... 103 2e-20
UniRef100_Q9SZ87 RNA-directed DNA polymerase-like protein [Arabi... 101 5e-20
UniRef100_Q9SIQ5 Putative non-LTR retroelement reverse transcrip... 101 5e-20
UniRef100_Q9SHP2 Putative non-LTR retroelement reverse transcrip... 99 3e-19
UniRef100_Q9SF52 Putative non-LTR reverse transcriptase [Arabido... 98 5e-19
UniRef100_Q75M12 Hypothetical protein P0676G05.2 [Oryza sativa] 92 4e-17
UniRef100_Q9ZUY7 Putative reverse transcriptase [Arabidopsis tha... 88 7e-16
UniRef100_Q8H0A5 Putative reverse transcriptase [Oryza sativa] 87 1e-15
UniRef100_Q7XS38 OSJNBa0081G05.2 protein [Oryza sativa] 83 2e-14
UniRef100_Q9SJ38 Putative non-LTR retroelement reverse transcrip... 82 5e-14
>UniRef100_O82276 Putative non-LTR retroelement reverse transcriptase [Arabidopsis
thaliana]
Length = 1231
Score = 162 bits (410), Expect = 2e-38
Identities = 128/473 (27%), Positives = 211/473 (44%), Gaps = 35/473 (7%)
Query: 65 VVGKAVCQMIKKSDKLWVRVLEHKY----LRDTSIHKVQAHQHDSPIWKGI-LWARDMID 119
+V K ++++ + LW RV+ KY ++DTS K Q S W+ + + R+++
Sbjct: 728 LVAKVGWRLLQDKESLWARVVRKKYKVGGVQDTSWLKPQPRW--SSTWRSVAVGLREVVV 785
Query: 120 QRFEFRIGKGDT-SVWYQDWSGIGIIANQIPFVHISD--------VNLTLCDLIQDNKWN 170
+ + G G T W W Q P V + + + + + WN
Sbjct: 786 KGVGWVPGDGCTIRFWLDRW------LLQEPLVELGTDMIPEGERIKVAADYWLPGSGWN 839
Query: 171 LQRLYTNLPHSLQQQFLAVQPQICMNREDAWIWKDGSSGRYSVRDAYEWIN-HLAHNPIE 229
L+ L LP +++++ L+V Q+ + D WK G ++VR AY + + P
Sbjct: 840 LEILGLYLPETVKRRLLSVVVQVFLGNGDEISWKGTQDGAFTVRSAYSLLQGDVGDRPNM 899
Query: 230 DRKLNWVWKLRVPEKIRMFTWQVLHNAIPVNELWVRCHLASDATCARCGNVVEDGLHCLR 289
N +WKL PE++R+F W V N I N VR HL+ +A C+ C E LH LR
Sbjct: 900 GSFFNRIWKLITPERVRVFIWLVSQNVIMTNVERVRRHLSENAICSVCNGAEETILHVLR 959
Query: 290 DCSFS*DLWRRMGAI-NWRNFRYNNIISW-FSSM--ARGVHGIQFLAGVWGAWKWRCNWL 345
DC +WRR+ + F +++ W F++M +G+ F G+W AWKWRC +
Sbjct: 960 DCPAMEPIWRRLLPLRRHHEFFSQSLLEWLFTNMDPVKGIWPTLFGMGIWWAWKWRCCDV 1019
Query: 346 LDSQRWP------IEVVWRRIAHDHDDWAWCAPSN-DLLLCHPWSPPPPDTVKCNSDGSF 398
++ I+ + + H P+ + W P VK +DG+
Sbjct: 1020 FGERKICRDRLKFIKDMAEEVRRVHVGAVGNRPNGVRVERMIRWQVPSDGWVKITTDGAS 1079
Query: 399 REDVQRMGGVGVIRDHQGRWVAGCYLGEAAGNAFRAEAKALLDVLELAWNRGYSRLICDV 458
R + G IR+ QG W+ G L + A AE L +AW++G+ R+ D+
Sbjct: 1080 RGNHGLAAAGGAIRNGQGEWLGGFALNIGSCAAPLAELWGAYYGLLIAWDKGFRRVELDL 1139
Query: 459 NCDNLVTILVEAEAVQMHSEFHVLHSITQLLARDWHVRINSVHRDSNAVADHL 511
+C LV + H ++ RDW VR++ V+R++N +AD L
Sbjct: 1140 DC-KLVVGFLSTGVSNAHPLSFLVRLCQGFFTRDWLVRVSHVYREANRLADGL 1191
>UniRef100_Q9FG26 Non-LTR retroelement reverse transcriptase-like [Arabidopsis
thaliana]
Length = 676
Score = 146 bits (368), Expect = 2e-33
Identities = 127/488 (26%), Positives = 219/488 (44%), Gaps = 34/488 (6%)
Query: 80 LWVRVLEHKYLRDTSIHK---VQAHQHDSPIWKGI-LWARDMIDQRFEFRIGKGDTSVWY 135
LW RVL KY + IH + S +W+ + + R+++++ + +G G ++
Sbjct: 184 LWARVLRSKY-KIGDIHDSAWMTPKGTWSALWRSVNVGLREVVNRGIGWVLGDGKIIRFW 242
Query: 136 QD-W----SGIGIIANQIPFVHISDVNLTLCDL-IQDNKWNLQRLYTNLPHSLQQQFLAV 189
QD W + +++Q+P + + + D I+ W+++R+ LP ++Q+ LAV
Sbjct: 243 QDRWLLSTPLLEWVSDQLP---VEERGQRVADYWIEGVGWDMERIAVFLPEFMRQRLLAV 299
Query: 190 QPQICMNREDAWIWKDGSSGRYSVRDAY--EWINHLAHNPIEDRKLNWVWKLRVPEKIRM 247
C ED W +GR++V AY + ++ ++ + R + VW++ VPE+ R+
Sbjct: 300 VIGGCYGVEDKMSWVGTENGRFTVSSAYLIQSVDEISKQCMS-RFFDRVWRVMVPERARI 358
Query: 248 FTWQVLHNAIPVNELWVRCHLASDATCARCGNVVEDGLHCLRDCSFS*DLWRR-MGAINW 306
F W V + + N VR H+A C C E +H LRDC +W R + +
Sbjct: 359 FLWLVGNQVVLTNAERVRRHMADSDVCPLCKGASESLIHVLRDCPAMMGIWMRVVPVMEQ 418
Query: 307 RNFRYNNIISWF-------SSMARGVHGIQFLAGVWGAWKWRCNWLL-DSQRWPIEVVWR 358
R F +++ W S R F VW WKWRC ++ + R V +
Sbjct: 419 RRFFETSLLEWMYGNLKERSDSERRSWPTLFALTVWWGWKWRCGYVFGEDSRCRDRVKFL 478
Query: 359 RIAHDHDDWAWCAPSND------LLLCHPWSPPPPDTVKCNSDGSFREDVQRMGGVGVIR 412
+ A + A A + D + W P V N+DG+ + + GVIR
Sbjct: 479 KSAVAEVEAAHLAANGDAREDVLVERMIAWRKPAEGWVTMNTDGASHGNPGQATAGGVIR 538
Query: 413 DHQGRWVAGCYLGEAAGNAFRAEAKALLDVLELAWNRGYSRLICDVNCDNLVTILVEAEA 472
D G W+ G L +A AE + L +AW RG+ R+ +V+ LV +++
Sbjct: 539 DEHGSWLVGFALNIGVCSAPLAELWGVYYGLVVAWERGWRRVRLEVD-SALVVGFLQSGI 597
Query: 473 VQMHSEFHVLHSITQLLARDWHVRINSVHRDSNAVADHLVRRGAAAMSSES*IIQSQDHD 532
H ++ +++DW VRI V+R++N +AD L A + ++ S
Sbjct: 598 GDSHPLAFLVRLCHGFISKDWIVRITHVYREANRLADGLANY-AFTLPFGFLLLDSCPEH 656
Query: 533 VEYLLLKD 540
V +LL+D
Sbjct: 657 VSSILLED 664
>UniRef100_O80790 Putative non-LTR retroelement reverse transcriptase [Arabidopsis
thaliana]
Length = 970
Score = 130 bits (326), Expect = 1e-28
Identities = 95/327 (29%), Positives = 151/327 (46%), Gaps = 19/327 (5%)
Query: 199 DAWIWKDGSSGRYSVRDAYEWINHLAHN-PIEDRKLNWVWKLRVPEKIRMFTWQVLHNAI 257
D WK +G ++VR AYE + A P+ L +WKL PE++R+F W V H I
Sbjct: 609 DELSWKGTQNGDFTVRSAYELLKPEAEERPLIGSFLKQIWKLVAPERVRVFIWLVSHMVI 668
Query: 258 PVNELWVRCHLASDATCARCGNVVEDGLHCLRDCSFS*DLWRRMGAINWRNFRYNNIISW 317
N VR HL+ ATC+ C E LH LRDC +W+R+ +N ++
Sbjct: 669 MTNVERVRRHLSDIATCSVCNGADESILHVLRDCPAMTPIWQRLLPQRRQNEFFSQFEWL 728
Query: 318 FSSM--ARGVHGIQFLAGVWGAWKWRCNWLLDSQRWP------IEVVWRRIAHDH----D 365
F+++ A+G F G+W AWKWRC + ++ I+ + + H +
Sbjct: 729 FTNLDPAKGDWPTLFSMGIWWAWKWRCGDVFGERKLCRDRLKFIKDIAEEVRKAHVGTLN 788
Query: 366 DWAWCAPSNDLLLCHPWSPPPPDTVKCNSDGSFREDVQRMGGVGVIRDHQGRWVAGCYLG 425
+ A ++ W P VK +DG+ R G I + QG W+ G L
Sbjct: 789 NHVKRARVERMI---RWKAPSDRWVKLTTDGASRGHQGLAAASGAILNLQGEWLGGFALN 845
Query: 426 EAAGNAFRAEAKALLDVLELAWNRGYSRLICDVNCDN-LVTILVEAEAVQMHSEFHVLHS 484
+ +A AE L +AW++G+ R+ ++N D+ LV + + H ++
Sbjct: 846 IGSCDAPLAELWGAYYGLLIAWDKGFRRV--ELNLDSELVVGFLSTGISKAHPLSFLVRL 903
Query: 485 ITQLLARDWHVRINSVHRDSNAVADHL 511
RDW VR++ V+R++N +AD L
Sbjct: 904 CQGFFTRDWLVRVSHVYREANRLADGL 930
>UniRef100_O22148 Putative non-LTR retroelement reverse transcriptase [Arabidopsis
thaliana]
Length = 1374
Score = 127 bits (318), Expect = 1e-27
Identities = 128/499 (25%), Positives = 202/499 (39%), Gaps = 42/499 (8%)
Query: 59 E*F*YCVVGKAVCQMIKKSDKLWVRVLEHKYLRDTSIHKVQAHQHDSPIWKGILWARDMI 118
E F ++GK + +MI + D L +V + +Y + S WK I A+ +I
Sbjct: 859 EAFNIALLGKQLWRMITEKDSLMAKVFKSRYFSKSDPLNAPLGSRPSFAWKSIYEAQVLI 918
Query: 119 DQRFEFRIGKGDT-SVWYQDWSGI--GIIANQIPFVHI------SDVNLTLCDLIQDNK- 168
Q IG G+T +VW W G A + H+ + +++ L+ D +
Sbjct: 919 KQGIRAVIGNGETINVWTDPWIGAKPAKAAQAVKRSHLVSQYAANSIHVVKDLLLPDGRD 978
Query: 169 WNLQRLYTNLPHSLQQQFLAVQPQICMNREDAWIWKDGSSGRYSVRDAY----EWINHLA 224
WN + P + Q+ LA++P R D + W+ SG YSV+ Y E IN
Sbjct: 979 WNWNLVSLLFPDNTQENILALRPGGKETR-DRFTWEYSRSGHYSVKSGYWVMTEIINQ-R 1036
Query: 225 HNPIE------DRKLNWVWKLRVPEKIRMFTWQVLHNAIPVNELWVRCHLASDATCARCG 278
+NP E D +WKL VP KI F W+ ++N + V HLA + +C RC
Sbjct: 1037 NNPQEVLQPSLDPIFQQIWKLDVPPKIHHFLWRCVNNCLSVASNLAYRHLAREKSCVRCP 1096
Query: 279 NVVEDGLHCLRDCSFS*DLWR-----RMGAINWRNFRYNN---IISWFSSM-ARGVHGIQ 329
+ E H L C F+ W W + N ++S S H
Sbjct: 1097 SHGETVNHLLFKCPFARLTWAISPLPAPPGGEWAESLFRNMHHVLSVHKSQPEESDHHAL 1156
Query: 330 FLAGVWGAWKWRCNWLLDSQRWPIEVVWRRIAHDHDDW------AWCAPSNDLLLCHPWS 383
+W WK R + + + + V + D D W S+ C W
Sbjct: 1157 IPWILWRLWKNRNDLVFKGREFTAPQVILKATEDMDAWNNRKEPQPQVTSSTRDRCVKWQ 1216
Query: 384 PPPPDTVKCNSDGSFREDVQRMGGVGVIRDHQGR--WVAGCYLGEAAGNAFRAEAKALLD 441
PP VKCN+DG++ +D+ G V+R+H GR W+ G + + E +AL
Sbjct: 1217 PPSHGWVKCNTDGAWSKDLGNCGVGWVLRNHTGRLLWL-GLRALPSQQSVLETEVEALRW 1275
Query: 442 VLELAWNRGYSRLICDVNCDNLVTILVEAEAVQMHSEFHVLHSITQLLARDWHVRINSVH 501
+ Y R+I + + LV+++ + + S + I LL V+
Sbjct: 1276 AVLSLSRFNYRRVIFESDSQYLVSLI--QNEMDIPSLAPRIQDIRNLLRHFEEVKFQFTR 1333
Query: 502 RDSNAVADHLVRRGAAAMS 520
R+ N VAD R + M+
Sbjct: 1334 REGNNVADRTARESLSLMN 1352
>UniRef100_Q9SHY0 F1E22.12 [Arabidopsis thaliana]
Length = 1055
Score = 125 bits (313), Expect = 4e-27
Identities = 122/486 (25%), Positives = 198/486 (40%), Gaps = 57/486 (11%)
Query: 65 VVGKAVCQMIKKSDKLWVRVLEHKY----LRDTSIHKVQAHQHDSPIWKGI-LWARDMID 119
++ K +++++ + LW VL+ KY +RD+ + S W+ I + RD++
Sbjct: 222 LISKVGWRLLQEKNSLWTLVLQKKYHVGEIRDSRWLIPKGSW--SSTWRSIAIGLRDVVS 279
Query: 120 QRFEFRIGKGDT-SVWYQDWSGIGIIANQIPFVHISDVNL-TLCDL-------IQDNKWN 170
+ G G W W + P + + + T CD I W+
Sbjct: 280 HGVGWIPGDGQQIRFWTDRW------VSGKPLLELDNGERPTDCDTVVAKDLWIPGRGWD 333
Query: 171 LQRLYTNLPHSLQQQFLAVQPQICMNREDAWIWKDGSSGRYSVRDAYEWIN-HLAHNPIE 229
++ ++ + + AV + D WK G++SVR AYE + P
Sbjct: 334 FAKIDPYTTNNTRLELRAVVLDLVTGARDRLSWKFSQDGQFSVRSAYEMLTVDEVPRPNM 393
Query: 230 DRKLNWVWKLRVPEKIRMFTWQVLHNAIPVNELWVRCHLASDATCARCGNVVEDGLHCLR 289
N +WK+RVPE+++ F W V + A+ E R HL++ C C VE LH LR
Sbjct: 394 ASFFNCLWKVRVPERVKTFLWLVGNQAVMTEEERHRRHLSASNVCQVCKGGVESMLHVLR 453
Query: 290 DCSFS*DLW-RRMGAINWRNFRYNNIISWFSSMARGVHGIQ-------FLAGVWGAWKWR 341
DC +W R + + F ++ W G + F +W WKWR
Sbjct: 454 DCPAQLGIWVRVVPQRRQQGFFSKSLFEWLYDNLGDRSGCEDIPWSTIFAVIIWWGWKWR 513
Query: 342 CNWLLDS-----------QRWPIEVVWRRIAHDHDDWAWCA-PSNDLLLCHPWSPPPPDT 389
C + + W +EV AH + P + ++ W P
Sbjct: 514 CGNIFGENTKCRDRVKFVKEWAVEVY---RAHSGNVLVGITQPRVERMI--GWVSPCVGW 568
Query: 390 VKCNSDGSFREDVQRMGGVGVIRDHQGRWVAGCYLGEAAGNAFRAEAKALLDVLELAWNR 449
VK N+DG+ R + GV+RD G W G L +A +AE + L AW +
Sbjct: 569 VKVNTDGASRGNPGLASAGGVLRDCTGAWCGGFSLNIGRCSAPQAELWGVYYGLYFAWEK 628
Query: 450 GYSRLICDVNCDNLVTILVEAEAVQMHSEFHVLHSITQL----LARDWHVRINSVHRDSN 505
R+ +V+ + +V L S+ H L + +L L +DW VRI V+R++N
Sbjct: 629 KVPRVELEVDSEVIVGFLKTG-----ISDSHPLSFLVRLCHGFLQKDWLVRIVHVYREAN 683
Query: 506 AVADHL 511
+AD L
Sbjct: 684 RLADGL 689
>UniRef100_Q9SZD8 Hypothetical protein F19B15.120 [Arabidopsis thaliana]
Length = 575
Score = 112 bits (281), Expect = 2e-23
Identities = 110/505 (21%), Positives = 214/505 (41%), Gaps = 52/505 (10%)
Query: 59 E*F*YCVVGKAVCQMIKKSDKLWVRVLEHKYLRDTSIHKVQAHQHDSPIWKGILWARDMI 118
E F ++GK + +M+ + + L +V + +Y + S +WK I +++++
Sbjct: 62 EAFNLALLGKQMWRMLSRPESLMAKVFKSRYFHKSDPLNAPLGSRPSFVWKSIHASQEIL 121
Query: 119 DQRFEFRIGKG-DTSVWYQDW-----SGIGIIANQIPFVHISDVN--LTLCDLIQDN--K 168
Q +G G D +W W + + ++P + V+ L + DLI ++ +
Sbjct: 122 RQGARAVVGNGEDIIIWRHKWLDSKPASAALRMQRVPPQEYASVSSILKVSDLIDESGRE 181
Query: 169 WN---LQRLYTNLPHSLQQQFLAVQPQICMNREDAWIWKDGSSGRYSVRDAYEWINHLAH 225
W ++ L+ + L + +I D++ W SSG Y+V+ Y + + +
Sbjct: 182 WRKDVIEMLFPEVERKLIGELRPGGRRIL----DSYTWDYTSSGDYTVKSGYWVLTQIIN 237
Query: 226 N-----PIEDRKLN----WVWKLRVPEKIRMFTWQVLHNAIPVNELWVRCHLASDATCAR 276
+ + LN +WK + KI+ F W+ L N++PV HL+ ++ C R
Sbjct: 238 KRSSPQEVSEPSLNPIYQKIWKSQTSPKIQHFLWKCLSNSLPVAGALAYRHLSKESACIR 297
Query: 277 CGNVVEDGLHCLRDCSFS*DLWR------RMGAINWRNFRYNNIISWFSSMARGVHGIQF 330
C + E H L C+F+ W +G W + Y N+ W ++ G +
Sbjct: 298 CPSCKETVNHLLFKCTFARLTWAISSIPIPLGG-EWADSIYVNLY-WVFNLGNGNPQWEK 355
Query: 331 LAG-----VWGAWKWRCNWLLDSQRWPIEVVWRRIAHDHDDW--------AWCAPSNDLL 377
+ +W WK R + + + + V RR D ++W P +
Sbjct: 356 ASQLVPWLLWRLWKNRNELVFRGREFNAQEVLRRAEDDLEEWRIRTEAESCGTKPQVNRS 415
Query: 378 LCHPWSPPPPDTVKCNSDGSFREDVQRMGGVGVIRDHQG--RWVAGCYLGEAAGNAFRAE 435
C W PPP VKCN+D ++ D +R G V+R+ +G +W+ L + + AE
Sbjct: 416 SCGRWRPPPHQWVKCNTDATWNRDNERCGIGWVLRNEKGEVKWMGARALPKLK-SVLEAE 474
Query: 436 AKALLDVLELAWNRGYSRLICDVNCDNLVTILVEAEAVQMHSEFHVLHSITQLLARDWHV 495
+A+ + Y+ +I + + L+ IL E S + + +LL++ V
Sbjct: 475 LEAMRWAVLSLSRFQYNYVIFESDSQVLIEILNNDEI--WPSLKPTIQDLQRLLSQFTEV 532
Query: 496 RINSVHRDSNAVADHLVRRGAAAMS 520
+ + R+ N +A+ + R + ++
Sbjct: 533 KFVFIPREGNTLAERVARESLSFLN 557
>UniRef100_Q7XJS2 Putative non-LTR retroelement reverse transcriptase [Arabidopsis
thaliana]
Length = 1344
Score = 110 bits (275), Expect = 1e-22
Identities = 93/384 (24%), Positives = 162/384 (41%), Gaps = 32/384 (8%)
Query: 61 F*YCVVGKAVCQMIKKSDKLWVRVLEHKYLRDTSIHKVQAHQHDSPIWKGILWARDMIDQ 120
F ++ K +++++ L+ RV + +Y ++ S W+ IL+ R+++ Q
Sbjct: 839 FNQALLAKQAWRVLQEKGSLFSRVFQSRYFSNSDFLSATRGSRPSYAWRSILFGRELLMQ 898
Query: 121 RFEFRIGKGD-TSVWYQDWSGIGIIANQIPFVHISDVNLTLCDLIQ--DNKWNLQRLYTN 177
IG G T VW W G + +V+L + LI WNL L
Sbjct: 899 GLRTVIGNGQKTFVWTDKWLHDGSNRRPLNRRRFINVDLKVSQLIDPTSRNWNLNMLRDL 958
Query: 178 LPHSLQQQFLAVQPQICMNREDAWIWKDGSSGRYSVRDAYEWINHLAHN---------PI 228
P + L +P +ED++ W +G YSV+ YE+++ H+ P
Sbjct: 959 FPWKDVEIILKQRPLFF--KEDSFCWLHSHNGLYSVKTGYEFLSKQVHHRLYQEAKVKPS 1016
Query: 229 EDRKLNWVWKLRVPEKIRMFTWQVLHNAIPVNELWVRCHLASDATCARCGNVVEDGLHCL 288
+ + +W L KIR+F W+ LH AIPV + + SD C C E H L
Sbjct: 1017 VNSLFDKIWNLHTAPKIRIFLWKALHGAIPVEDRLRTRGIRSDDGCLMCDTENETINHIL 1076
Query: 289 RDCSFS*DLWRRMGAINWRNFRYNNIISWFSSMARGV---------HGIQFLAG--VWGA 337
+C + +W + ++ ++N S +++M+R + H ++F++ +W
Sbjct: 1077 FECPLARQVW-AITHLSSAGSEFSN--SVYTNMSRLIDLTQQNDLPHHLRFVSPWILWFL 1133
Query: 338 WKWRCNWLLDSQRWPIEVVWRRIAHDHDDW--AWCAPSND--LLLCHPWSPPPPDTVKCN 393
WK R L + + + + + +W A ND L W PP P +KCN
Sbjct: 1134 WKNRNALLFEGKGSITTTLVDKAYEAYHEWFSAQTHMQNDEKHLKITKWCPPLPGELKCN 1193
Query: 394 SDGSFREDVQRMGGVGVIRDHQGR 417
++ + G V+RD QG+
Sbjct: 1194 IGFAWSKQHHFSGASWVVRDSQGK 1217
>UniRef100_Q6L3U7 Putative RNase H domain containing protein [Solanum demissum]
Length = 722
Score = 108 bits (269), Expect = 5e-22
Identities = 108/494 (21%), Positives = 207/494 (41%), Gaps = 57/494 (11%)
Query: 69 AVCQMIKKSDKLWVRVLEHKYLRDTSIHKVQAHQHD--SPIWKGILWARDMIDQRFEFRI 126
A C +++ D LW + K + + +H V + S W +L R ++ + I
Sbjct: 26 AKCTDLERKD-LWASLEATKRIYCSRVHPVAKAKSSKQSHTWSKMLKIRHSVENNILWII 84
Query: 127 GKGDTSVWYQDWSGIGIIANQI-PFVHISDVNLTLCDLIQDNKWNLQRLYTNLPHSLQQQ 185
G+ S+W+ +W G G ++N + P H + N+ D I +W+ +L LP + Q
Sbjct: 85 YAGNVSMWWDNWMGNGALSNILPPPSHYNKDNVK--DFIHKREWDFDKLSDILPPQVVNQ 142
Query: 186 FLAVQPQICMNREDAWIWKDGSSGRYSVRDAY-EWINHLAHNPIEDRKLNWVWKLRVPEK 244
+++ P N+ D IW +G ++ + AY + N N + N +W + P K
Sbjct: 143 IVSI-PIGDPNQSDYAIWIPSENGHFTTKSAYVDCSNTREKNDMR----NKIWHGKFPFK 197
Query: 245 IRMFTWQVLHNAIPVNELWVRCHLASDATCARCGNVVEDGL-HCLRDCSFS*DLWRRMGA 303
+ TW+++ N +P + + D+ C C N+ + + H + + LW++ G
Sbjct: 198 MSFLTWRLVQNKLPFYDTVGKFVDNIDSNCVCCKNMKTETINHVFLNSDVASYLWKKFGG 257
Query: 304 INWRNFRYNNIIS-----WFSSMARGVHGIQF----LAGVWGAWKWRCNWLLDSQRWPIE 354
+ R ++ I+ W +H + + W WK RC Q+
Sbjct: 258 TLGIDTRASSTINLLKTWWNVQTHNSIHNVIIHTLPILIFWEIWKRRCACKYGDQK---- 313
Query: 355 VVWRRIAHDHDDW-----------------AWCAPSNDLLLCHP--------WSPPPPDT 389
+W R +H W +W N + P W+ P +
Sbjct: 314 KMWYRTMENHVWWNLKMSLRMTFPSFEIGNSWRDLLNKVESLRPYPKWKIVHWNTPNINC 373
Query: 390 VKCNSDGSFREDVQRMGGVGVIRDHQGRWVAGCYLGEAAGNAFRAEAKALLDVLELAWNR 449
VK N+DGSF +G ++RDH R + + + + AEA A + +
Sbjct: 374 VKINTDGSFSSGNAGLG--WIVRDHTRRMIMAFSIPSSCSSNNLAEALAARFGILWCLQQ 431
Query: 450 GYSRLICDVNCDNLVTILVEAEAVQMHSEFHVLHSITQLLARDWHVRINSVHRDSNAVAD 509
G+ +++ +V ++ +A + + V+ I Q++A+ + +N +R++N VAD
Sbjct: 432 GFHNCYLELDSKLVVDMVRNGQATNLKIK-GVVEDIIQVVAK-MNCEVNHCYREANQVAD 489
Query: 510 HLVRRGAAAMSSES 523
L + A +S+E+
Sbjct: 490 ALAKH--AVISNEA 501
>UniRef100_Q9SIS9 Putative non-LTR retroelement reverse transcriptase [Arabidopsis
thaliana]
Length = 1715
Score = 107 bits (268), Expect = 7e-22
Identities = 120/523 (22%), Positives = 217/523 (40%), Gaps = 70/523 (13%)
Query: 51 WRLRHQGD------E*F*YCVVGKAVCQMIKKSDKLWVRVLEHKYLRDTSIHKVQAHQHD 104
W ++GD F ++ K +++ L R+ + Y +T+ + H
Sbjct: 1189 WGKENEGDLGFKDLHQFNRALLAKQAWRILTNPQSLLARLYKGLYYPNTTYLRANKGGHA 1248
Query: 105 SPIWKGILWARDMIDQRFEFRIGKGDTS-VWYQDWSGIGIIANQIPFVHISDVNLTLCDL 163
S W I + ++ Q R+G G T+ +W W + + + I D ++ + DL
Sbjct: 1249 SYGWNSIQEGKLLLQQGLRVRLGDGQTTKIWEDPW--LPTLPPRPARGPILDEDMKVADL 1306
Query: 164 IQDNK--WNLQRLYTNLPHSLQQQFLAVQPQICMNREDAWIWKDGSSGRYSVRDAYEWIN 221
++NK W+ ++ + + QQ D++ W + +Y+VR Y
Sbjct: 1307 WRENKREWD-PVIFEGVLNPEDQQLAKSLYLSNYAARDSYKWAYTRNTQYTVRSGYWVAT 1365
Query: 222 HL------AHNPIE-DRKLNW-VWKLRVPEKIRMFTWQVLHNAIPVNELWVRCHLASDAT 273
H+ NP+E D L +W+L++ KI+ F W+ L A+ ++ +D T
Sbjct: 1366 HVNLTEEEIINPLEGDVPLKQEIWRLKITPKIKHFIWRCLSGALSTTTQLRNRNIPADPT 1425
Query: 274 CARCGNVVEDGLHCLRDCSFS*DLWRRMGAINWRNFRYNNIISWFSSMARGV-------- 325
C RC N E H + CS++ +WR NF +N + + ++ +
Sbjct: 1426 CQRCCNADETINHIIFTCSYAQVVWRS------ANFSGSNRLCFTDNLEENIRLILQGKK 1479
Query: 326 -------HGIQFLAGVWGAWKWRCNWLLDS-QRWPIEVVWRRIAHDHDDWAW-------- 369
+G+ +W WK R +L R+P +V ++ + +W
Sbjct: 1480 NQNLPILNGLMPFWIMWRLWKSRNEYLFQQLDRFPWKVA-QKAEQEATEWVETMVNDTAI 1538
Query: 370 ---CAPSND--LLLCHPWSPPPPDTVKCNSDGSFREDVQRMGGVGVIRDHQGRWV-AGCY 423
A SND L WS PP +KCN D + + ++RD GR + +GC
Sbjct: 1539 SHNTAQSNDRPLSRSKQWSSPPEGFLKCNFDSGYVQGRDYTSTGWILRDCNGRVLHSGCA 1598
Query: 424 LGEAAGNAFRAEAKALLDVLELAWNRGYSRLICDVNCDNLVTILVEAEAVQMHSEFHVLH 483
+ + +A +AEA L L++ W RGY + + + L ++ + E + H+L
Sbjct: 1599 KLQQSYSALQAEALGFLHALQMVWIRGYCYVWFEGDNLELTNLINKTE------DHHLLE 1652
Query: 484 SITQLLARDWHVR-----INSVHRDSNAVADHLVRRGAAAMSS 521
++ + R W + I V+R+ N AD L + A +MSS
Sbjct: 1653 TLLYDI-RFWMTKLPFSSIGYVNRERNLAADKLTKY-ANSMSS 1693
>UniRef100_Q9SKJ4 Putative non-LTR retroelement reverse transcriptase [Arabidopsis
thaliana]
Length = 1750
Score = 104 bits (259), Expect = 8e-21
Identities = 119/493 (24%), Positives = 197/493 (39%), Gaps = 57/493 (11%)
Query: 65 VVGKAVCQMIKKSDKLWVRVLEHKYLRDTSIHKVQAHQHDSPIWKGILWARDMIDQRFEF 124
++ K ++I+ + L+ RV++ +Y +D SI + + S W +L ++ +
Sbjct: 1247 LLAKQAWRLIQYPNSLFARVMKARYFKDVSILDAKVRKQQSYGWASLLDGIALLKKGTRH 1306
Query: 125 RIGKGDTSVWYQDWSGIGIIANQIPFVHISDVNLTLCDLIQDNKWNLQRLYTNLPHSLQQ 184
IG G G+ I + P ++ T ++ +N + + Y S
Sbjct: 1307 LIGDGQNIR-----IGLDNIVDSHPPRPLNTEE-TYKEMTINNLFERKGSYYFWDDSKIS 1360
Query: 185 QFLAVQPQICMNR--------EDAWIWKDGSSGRYSVRDAYEWINH------LAHNPIE- 229
QF+ ++R D IW ++G Y+VR Y + H A NP
Sbjct: 1361 QFVDQSDHGFIHRIYLAKSKKPDKIIWNYNTTGEYTVRSGYWLLTHDPSTNIPAINPPHG 1420
Query: 230 --DRKLNWVWKLRVPEKIRMFTWQVLHNAIPVNELWVRCHLASDATCARCGNVVEDGLHC 287
D K +W L + K++ F W+ L A+ E + D +C RC E H
Sbjct: 1421 SIDLKTR-IWNLPIMPKLKHFLWRALSQALATTERLTTRGMRIDPSCPRCHRENESINHA 1479
Query: 288 LRDCSFS*DLWRRMGAINWRN------FRYN--NIISWFSSMARG-VHGIQFLAGVWGAW 338
L C F+ WR + RN F N NI+++ H + + +W W
Sbjct: 1480 LFTCPFATMAWRLSDSSLIRNQLMSNDFEENISNILNFVQDTTMSDFHKLLPVWLIWRIW 1539
Query: 339 KWRCNWLLDSQR-WPIEVVWRRIAHDHDDWAWC--------APSNDLLLCH---PWSPPP 386
K R N + + R P + V A HD W PS + W PP
Sbjct: 1540 KARNNVVFNKFRESPSKTVLSAKAETHD---WLNATQSHKKTPSPTRQIAENKIEWRNPP 1596
Query: 387 PDTVKCNSDGSFREDVQRMGGVG--VIRDHQGRWVA-GCYLGEAAGNAFRAEAKALLDVL 443
VKCN D F DVQ++ G +IR+H G ++ G N AE KALL L
Sbjct: 1597 ATYVKCNFDAGF--DVQKLEATGGWIIRNHYGTPISWGSMKLAHTSNPLEAETKALLAAL 1654
Query: 444 ELAWNRGYSRLICDVNCDNLVTILVEAEAVQMHSEF-HVLHSITQLLARDWHVRINSVHR 502
+ W RGY+++ + +C L+ ++ + HS + L I+ + ++ + +
Sbjct: 1655 QQTWIRGYTQVFMEGDCQTLINLI---NGISFHSSLANHLEDISFWANKFASIQFGFIRK 1711
Query: 503 DSNAVADHLVRRG 515
N +A L + G
Sbjct: 1712 KGNKLAHVLAKYG 1724
>UniRef100_Q9SK98 Very similar to retrotransposon reverse transcriptase [Arabidopsis
thaliana]
Length = 1231
Score = 103 bits (256), Expect = 2e-20
Identities = 102/403 (25%), Positives = 166/403 (40%), Gaps = 57/403 (14%)
Query: 61 F*YCVVGKAVCQMIKKSDKLWVRVLEHKYLRDTSIHKVQAHQHDSPIWKGILWARDMIDQ 120
F ++ K ++++ D L+ R+++ +Y S W+ IL RD++ +
Sbjct: 709 FNQALLAKQAWRLLQFPDCLFARLIKSRYFPVGEFLDSDVGSRPSFGWRSILHGRDLLCR 768
Query: 121 RFEFRIGKGDT-SVWYQDWSGIGIIANQIPFVH--ISDVNLTLCDLIQDNK--WNLQRLY 175
R+G G + VW W + + P++ I +V+L + DLI K W L +L
Sbjct: 769 GLVKRVGNGKSIRVWIDYWLDDNGL--RAPWIKNPIINVDLLVSDLIDYEKRDWRLDKLE 826
Query: 176 TNLPHSLQQQFLAVQPQICMNREDAWIWKDGSSGRYSVRDAYEWI------NHLAHNPIE 229
+ +P + + +D WIWK SG YSV+ Y W+ +A +
Sbjct: 827 EQFFPDDVVKIRENRPVVSL--DDFWIWKHNKSGDYSVKLGY-WLASNQNLGQVAIEAMM 883
Query: 230 DRKLN----WVWKLRVPEKIRMFTWQVLHNAIPVNELWVRCHLASDATCARCGNVVEDGL 285
LN VWKL+ KI++F W+VL AIPV +L + D+ C CG E
Sbjct: 884 QPSLNDLKTQVWKLQTEPKIKVFLWKVLSGAIPVVDLLSYRGMKLDSRCQTCGCEGESIQ 943
Query: 286 HCLRDCSFS*DLWR-----------RMGAI--NWRNFRYN-NIISWFSSMARGVHGIQFL 331
H L CSF +W G++ N +F N + + W + R I
Sbjct: 944 HVLFSCSFPRQVWAMSNIHVPLLGFECGSVYANLYHFLINRDNLKWPVELRRSFPWI--- 1000
Query: 332 AGVWGAWKWRCNWLLDSQRWPIEVVWRRIAHDHDDWAWC----------APSNDLLLCHP 381
+W WK R + + +R+ + ++ D +DW +D + P
Sbjct: 1001 --IWRIWKNRNLFFFEGKRFTVLETILKVRKDVEDWFAAQVVEKERRAEVGQSDQQVFSP 1058
Query: 382 --------WSPPPPDTVKCNSDGSFREDVQRMGGVGVIRDHQG 416
W PPP D VKCN S+ + G V+R+ +G
Sbjct: 1059 RNVSPVVRWLPPPTDWVKCNVGLSWSRRNRLAGVAWVLRNDRG 1101
>UniRef100_Q9SZ87 RNA-directed DNA polymerase-like protein [Arabidopsis thaliana]
Length = 1274
Score = 101 bits (252), Expect = 5e-20
Identities = 114/491 (23%), Positives = 204/491 (41%), Gaps = 51/491 (10%)
Query: 65 VVGKAVCQMIKKSDKLWVRVLEHKYLRDTSIHKVQAH-QHDSPIWKGILWARDMIDQRFE 123
+ K +++K+ L RVL KY +S A S W+GIL RD++ +
Sbjct: 799 IEAKLSWRILKEPHSLLSRVLLGKYCNTSSFMDCSASPSFASHGWRGILAGRDLLRKGLG 858
Query: 124 FRIGKGDT-SVWYQDWSGIGIIANQIPFVHISDVN--LTLCDLIQDN--KWNLQRLYTNL 178
+ IG+GD+ +VW + W + + Q P ++ N L++ DLI + WN++ + +L
Sbjct: 859 WSIGQGDSINVWTEAW--LSPSSPQTPIGPPTETNKDLSVHDLICHDVKSWNVEAIRKHL 916
Query: 179 PHSLQQQFLAVQPQICMNREDAWIWKDGSSGRYSVRDAYEWINHLAHNPIEDRKLNW--- 235
P + Q + + +D+ +W SG Y+ + Y + L P NW
Sbjct: 917 PQ-YEDQIRKITIN-ALPLQDSLVWLPVKSGEYTTKTGYA-LAKLNSFPASQLDFNWQKN 973
Query: 236 VWKLRVPEKIRMFTWQVLHNAIPVNELWVRCHLASDATCARCGNVVEDGLHCLRDCSFS* 295
+WK+ K++ F W+ + A+PV E R ++ ++ TC RCG E LH + C ++
Sbjct: 974 IWKIHTSPKVKHFLWKAMKGALPVGEALSRRNIEAEVTCKRCGQ-TESSLHLMLLCPYAK 1032
Query: 296 DLWRRMGAINWRNFRYNNIISWFSSMARGVHGIQFLAG---------------VWGAWKW 340
+W + +N + SS+A + + + +W WK
Sbjct: 1033 KVWELAPVL------FNPSEATHSSVALLLVDAKRMVALPPTGLGSAPLYPWLLWHLWKA 1086
Query: 341 RCNWLLDSQRWPIEVVWRRIAHDHDDWAWCAPSNDLLLCHP-----WSPPPPD--TVKCN 393
R + D+ E + + D W LL+ HP + P P+ C
Sbjct: 1087 RNRLIFDNHSCSEEGLVLKAILDARAWM----EAQLLIHHPSPISDYPSPTPNLKVTSCF 1142
Query: 394 SDGSFREDVQRMGGVGVIRDHQGRWVAGCYLGEAAGNAFRAEAKALLDVLELAWNRGYSR 453
D ++ G + ++ + G+A AE A+ L A + G +
Sbjct: 1143 VDAAWTTSGYCGMGWFLQDPYKVKIKENQSSSSFVGSALMAETLAVHLALVDALSTGVRQ 1202
Query: 454 LICDVNCDNLVTILVEAEA-VQMHSEFHVLHSITQLLARDWHVRINSVHRDSNAVADHLV 512
L +C L+++L ++ V++ +LH I +L H+ + R SN VAD L
Sbjct: 1203 LNVFSDCKELISLLNSGKSIVELRG---LLHDIRELSVSFTHLCFFFIPRLSNVVADSLA 1259
Query: 513 RRGAAAMSSES 523
+ + + S S
Sbjct: 1260 KSALSVILSSS 1270
>UniRef100_Q9SIQ5 Putative non-LTR retroelement reverse transcriptase [Arabidopsis
thaliana]
Length = 1524
Score = 101 bits (252), Expect = 5e-20
Identities = 119/493 (24%), Positives = 195/493 (39%), Gaps = 57/493 (11%)
Query: 65 VVGKAVCQMIKKSDKLWVRVLEHKYLRDTSIHKVQAHQHDSPIWKGILWARDMIDQRFEF 124
++ K ++I+ + L+ RV++ +Y +D SI + + S W +L ++ +
Sbjct: 1021 LLAKQAWRLIQYPNSLFARVMKARYFKDVSILDAKVRKQQSYGWASLLDGIALLKKGTRH 1080
Query: 125 RIGKGDTSVWYQDWSGIGIIANQIPFVHISDVNLTLCDLIQDNKWNLQRLYTNLPHSLQQ 184
IG G G+ I + P ++ T ++ +N + + Y S
Sbjct: 1081 LIGDGQNIR-----IGLDNIVDSHPPRPLNTEE-TYKEMTINNLFERKGSYYFWDDSKIS 1134
Query: 185 QFLAVQPQICMNR--------EDAWIWKDGSSGRYSVRDAYEWINH------LAHNPIE- 229
QF+ ++R D IW ++G Y+VR Y + H A NP
Sbjct: 1135 QFVDQSDHGFIHRIYLAKSKKPDKIIWNYNTTGEYTVRSGYWLLTHDPSTNIPAINPPHG 1194
Query: 230 --DRKLNWVWKLRVPEKIRMFTWQVLHNAIPVNELWVRCHLASDATCARCGNVVEDGLHC 287
D K +W L + K++ F W+ L A+ E + D C RC E H
Sbjct: 1195 SIDLKTR-IWNLPIMPKLKHFLWRALSQALATTERLTTRGMRIDPICPRCHRENESINHA 1253
Query: 288 LRDCSFS*DLWRRMGAINWRN------FRYN--NIISWFSSMARG-VHGIQFLAGVWGAW 338
L C F+ W + RN F N NI+++ H + + +W W
Sbjct: 1254 LFTCPFATMAWWLSDSSLIRNQLMSNDFEENISNILNFVQDTTMSDFHKLLPVWLIWRIW 1313
Query: 339 KWRCNWLLDSQR-WPIEVVWRRIAHDHDDWAWC--------APSNDLLLCH---PWSPPP 386
K R N + + R P + V A HD W PS + W PP
Sbjct: 1314 KARNNVVFNKFRESPSKTVLSAKAETHD---WLNATQSHKKTPSPTRQIAENKIEWRNPP 1370
Query: 387 PDTVKCNSDGSFREDVQRMGGVG--VIRDHQGRWVA-GCYLGEAAGNAFRAEAKALLDVL 443
VKCN D F DVQ++ G +IR+H G ++ G N AE KALL L
Sbjct: 1371 ATYVKCNFDAGF--DVQKLEATGGWIIRNHYGTPISWGSMKLAHTSNPLEAETKALLAAL 1428
Query: 444 ELAWNRGYSRLICDVNCDNLVTILVEAEAVQMHSEF-HVLHSITQLLARDWHVRINSVHR 502
+ W RGY+++ + +C L+ ++ + HS + L I+ + ++ + R
Sbjct: 1429 QQTWIRGYTQVFMEGDCQTLINLI---NGISFHSSLANHLEDISFWANKFASIQFGFIRR 1485
Query: 503 DSNAVADHLVRRG 515
N +A L + G
Sbjct: 1486 KGNKLAHVLAKYG 1498
>UniRef100_Q9SHP2 Putative non-LTR retroelement reverse transcriptase [Arabidopsis
thaliana]
Length = 773
Score = 99.0 bits (245), Expect = 3e-19
Identities = 78/327 (23%), Positives = 132/327 (39%), Gaps = 23/327 (7%)
Query: 61 F*YCVVGKAVCQMIKKSDKLWVRVLEHKYLRDTSIHKVQAHQHDSPIWKGILWARDMIDQ 120
F ++ K +++++ L RV + KY + +A S WK IL +I +
Sbjct: 316 FNIALLAKQSWRILQQPFSLMARVFKAKYFPKERLLDAKATSQSSYAWKSILHGTKLISR 375
Query: 121 RFEFRIGKGDT-SVWYQDWSGIGIIANQIPFVHISDVNLTLCDLIQDNKWNLQRLYTNLP 179
++ G G+ +W +W + + L + DL+ + +WN L +
Sbjct: 376 GLKYIAGNGNNIQLWKDNWLPLNPPRPPVGTCDSIYSQLKVSDLLIEGRWNEDLLCKLIH 435
Query: 180 HSLQQQFLAVQPQICMNREDAWIWKDGSSGRYSVRDAYEWINHLAH---------NPIED 230
+ A++P I DA W G YSV+ Y + L+ N +
Sbjct: 436 QNDIPHIRAIRPSIT-GANDAITWIYTHDGNYSVKSGYHLLRKLSQQQHASLPSPNEVSA 494
Query: 231 RKL-NWVWKLRVPEKIRMFTWQVLHNAIPVNELWVRCHLASDATCARCGNVVEDGLHCLR 289
+ + +WK P KI+ F W+ HNA+P R L +D TC RCG ED H L
Sbjct: 495 QTVFTNIWKQNAPPKIKHFWWRSAHNALPTAGNLKRRRLITDDTCQRCGEASEDVNHLLF 554
Query: 290 DCSFS*DLWRRM-------GAINWRNFRYN--NIISWFSSMARGVHGIQFLAGVWGAWKW 340
C S ++W + ++ +F N +I S + V F+ W WK
Sbjct: 555 QCRVSKEIWEQAHIKLCPGDSLMSNSFNQNLESIQKLNQSARKDVSLFPFIG--WRIWKM 612
Query: 341 RCNWLLDSQRWPIEVVWRRIAHDHDDW 367
R + + +++RW I ++ D W
Sbjct: 613 RNDLIFNNKRWSIPDSIQKALIDQQQW 639
>UniRef100_Q9SF52 Putative non-LTR reverse transcriptase [Arabidopsis thaliana]
Length = 484
Score = 98.2 bits (243), Expect = 5e-19
Identities = 95/353 (26%), Positives = 144/353 (39%), Gaps = 43/353 (12%)
Query: 197 REDAWIWKDGSSGRYSVRDAYEWINH------LAHNPIE---DRKLNWVWKLRVPEKIRM 247
+ D IW ++G Y+VR Y + H A NP D K +W L + K++
Sbjct: 115 KPDKIIWNYNTTGEYTVRSGYWLLTHDPSTNIPAINPPHGSIDLKTR-IWNLPIMPKLKH 173
Query: 248 FTWQVLHNAIPVNELWVRCHLASDATCARCGNVVEDGLHCLRDCSFS*DLWRRMGAINWR 307
F W+ L A+ E + D +C RC E H L C F+ WR + R
Sbjct: 174 FLWRALSQALATTERLTTRGMRIDPSCPRCHRENESINHALFTCPFATMAWRLSDSSLIR 233
Query: 308 N------FRYN--NIISWFSSMARG-VHGIQFLAGVWGAWKWRCNWLLDSQR-WPIEVVW 357
N F N NI+++ H + + +W WK R N + + R P + V
Sbjct: 234 NQLMSNDFEENISNILNFVQDTTMSDFHKLLPVWLIWRIWKARNNVVFNKFRESPSKTVL 293
Query: 358 RRIAHDHDDWAWC--------APSNDLLLCH---PWSPPPPDTVKCNSDGSFREDVQRMG 406
A HD W PS + W PP VKCN D F DVQ++
Sbjct: 294 SAKAETHD---WLNATQSHKKTPSPTRQIAENKIEWRNPPATYVKCNFDAGF--DVQKLE 348
Query: 407 GVG--VIRDHQGRWVA-GCYLGEAAGNAFRAEAKALLDVLELAWNRGYSRLICDVNCDNL 463
G +IR+H G ++ G N AE KALL L+ W RGY+++ + +C L
Sbjct: 349 ATGGWIIRNHYGTPISWGSMKLAHTSNPLEAETKALLAALQQTWIRGYTQVFMEGDCQTL 408
Query: 464 VTILVEAEAVQMHSEF-HVLHSITQLLARDWHVRINSVHRDSNAVADHLVRRG 515
+ ++ + HS + L I+ + ++ + R N +A L + G
Sbjct: 409 INLI---NGISFHSSLANHLEDISFWANKFASIQFGFIRRKGNKLAHVLAKYG 458
>UniRef100_Q75M12 Hypothetical protein P0676G05.2 [Oryza sativa]
Length = 1936
Score = 92.0 bits (227), Expect = 4e-17
Identities = 109/472 (23%), Positives = 191/472 (40%), Gaps = 97/472 (20%)
Query: 72 QMIKKSDKLWVRVLEHKYLRDTSIHKVQAHQHDSPIWKGILWARDMIDQRFEFRIGKGDT 131
++I+ + L +VL+ KY S+ + SP W GI + D++ + +RIG G++
Sbjct: 1512 RLIEFPNSLCAQVLKAKYFPHGSLTDTTFSANASPTWHGIEYGLDLLKKGIIWRIGNGNS 1571
Query: 132 -SVWYQDWSGIGIIANQIPFVHISDVNLTL---CDLI-QDNKWNLQRLYTNLPHSLQQQF 186
+W W + + S N L DLI +D W+ ++ Q F
Sbjct: 1572 VRIWRDPWIPRDLSRRPVS----SKANCRLKWVSDLIAEDGTWDSAKI--------NQYF 1619
Query: 187 LAVQP----QICMN---REDAWIWKDGSSGRYSVRDAYEWINHLAH----NPIEDRKLN- 234
L + +IC++ ED W +GR+SVR AY+ LA + +LN
Sbjct: 1620 LKIDADIIQKICISARLEEDFIAWHPDKTGRFSVRSAYKLALQLADMNNCSSSSSSRLNK 1679
Query: 235 -W--VWKLRVPEKIRMFTWQVLHNAIPVNELWVRCHLASDATCARCGNVVEDGLHCLRDC 291
W +WK VP+K+R+F W+V N++ E + +L C C ED H L C
Sbjct: 1680 SWELIWKCNVPQKVRIFAWRVASNSLATMENKKKRNLERFDVCGICDREKEDAGHALCRC 1739
Query: 292 SFS*DLWRRMGAINWRNFRYNNIISWFSSMARGVHGIQFLAGVWGAWKWRCNWLLDSQRW 351
+ LW +N +Y +++ + A I+ G +++RW
Sbjct: 1740 VHANSLW-----VNLEKGKYVVSVNFPKTTA-----IKHTPGA------------ENRRW 1777
Query: 352 PIEVVWRRIAHDHDDWAWCAPSNDLLLCHPWSPPPPDTVKCNSDGSFREDVQRMGGVG-V 410
P +K N DGSF + ++ GG+G +
Sbjct: 1778 -------------------------------ERPRNGWMKLNVDGSFDINSEK-GGIGMI 1805
Query: 411 IRDHQGRWV-AGCYLGEAAGNAFRAEAKALLDVLELAWNRGYSRLICDVNCDNLVTILVE 469
+R+ G + + C ++ AE A ++ L LA + + + +C +++ +L
Sbjct: 1806 LRNCLGNVIFSSCRSLDSCSGPLEAELHACVEGLHLALHWTLLPIQVETDCSSVIQLLNH 1865
Query: 470 AEAVQMHSEFHVLHSITQ----LLARDWHVRINSVHRDSNAVADHLVRRGAA 517
+ + VL +I Q L+A D + I+ V R N ++ L + A
Sbjct: 1866 PD-----KDRSVLANIAQEAKSLMAGDRQIAISKVQRSQNVISHFLANKARA 1912
>UniRef100_Q9ZUY7 Putative reverse transcriptase [Arabidopsis thaliana]
Length = 314
Score = 87.8 bits (216), Expect = 7e-16
Identities = 73/271 (26%), Positives = 117/271 (42%), Gaps = 33/271 (12%)
Query: 265 RCHLASDATCARCGNVVEDGLHCLRDCSFS*DLWRRM-GAINWRNFRYNNIISW-FSSMA 322
R HL+ C C + +H LRDC +W R+ A R F +++ W F+++
Sbjct: 13 RRHLSDSDICQICKGAEKTIIHILRDCPAMEGIWIRLVPAGKRREFFTQSLLEWLFANLG 72
Query: 323 ------RGVHGIQFLAGVWGAWKWRCN--------------WLLDSQRWP--IEVVWRRI 360
F +W AWKWRC +L D R V+ R +
Sbjct: 73 DRRKTCESTWSTLFALSIWWAWKWRCGNIFGVQDKCRDRVRFLKDLARETSMAHVIVRTL 132
Query: 361 AHDHDDWAWCAPSNDLLLCHPWSPPPPDTVKCNSDGSFREDVQRMGGVGVIRDHQGRWVA 420
+ H + + L+ WS P K N+DG+ R + GV+RD +G W
Sbjct: 133 SGGHGERV------ERLIA--WSKPEEGWWKLNTDGASRGNPGLASAGGVLRDEEGAWRG 184
Query: 421 GCYLGEAAGNAFRAEAKALLDVLELAWNRGYSRLICDVNCDNLVTILVEAEAVQMHSEFH 480
G L +A AE + L +AW R +RL +V+ + +V L + ++H
Sbjct: 185 GFALNIGVCSAPLAELWGVYYGLYIAWERRVTRLEIEVDSEIVVGFL-KIGINEVHPLSF 243
Query: 481 VLHSITQLLARDWHVRINSVHRDSNAVADHL 511
++ ++RDW VRI+ V+R++N +AD L
Sbjct: 244 LVRLCHDFISRDWRVRISHVYREANRLADGL 274
>UniRef100_Q8H0A5 Putative reverse transcriptase [Oryza sativa]
Length = 1557
Score = 87.0 bits (214), Expect = 1e-15
Identities = 80/302 (26%), Positives = 129/302 (42%), Gaps = 34/302 (11%)
Query: 61 F*YCVVGKAVCQMIKKSDKLWVRVLEHKYLRDTSIHKVQAHQHDSPIWKGILWARDMIDQ 120
F ++ + ++I D L RVL+ KY + SI + SP W+ I +++ +
Sbjct: 1230 FNQALLARQAWRLIDNPDSLCARVLKAKYYPNGSIVDTSFGGNASPGWQAIEHGLELVKK 1289
Query: 121 RFEFRIGKG-DTSVWYQDWSGIGIIANQIPFVHISDVNLT-LCDLIQDN-KWNLQRLYTN 177
+RIG G VW W + ++ P ++ + + DL+ DN W+ ++
Sbjct: 1290 GIIWRIGNGRSVRVWQDPWLPRDL--SRRPITPKNNCRIKWVADLMLDNGMWDANKI--- 1344
Query: 178 LPHSLQQQFLAVQPQICM-------NREDAWIWKDGSSGRYSVRDAYE----WINHLAHN 226
Q FL V +I + + ED W G +SVR AY W A +
Sbjct: 1345 -----NQIFLPVDVEIILKLRTSSRDEEDFIAWHPDKLGNFSVRTAYRLAENWAKEEASS 1399
Query: 227 PIEDRKLN--W--VWKLRVPEKIRMFTWQVLHNAIPVNELWVRCHLASDATCARCGNVVE 282
D + W +WK VP K+++FTW+ N +P + + +L TC CG E
Sbjct: 1400 SSSDVNIRKAWELLWKCNVPSKVKIFTWRATSNCLPTWDNKKKRNLEISDTCVICGMEKE 1459
Query: 283 DGLHCLRDCSFS*DLWRRMGAINWRNFRYNNII---SW-FSSMARGVHGIQ--FLAGVWG 336
D +H L C + LW M N + R ++ + SW F+ +A Q FL +W
Sbjct: 1460 DTMHALCRCPQAKHLWLAMKESNDLSLRMDDHLLGPSWLFNRLALLPDHEQPMFLMVLWR 1519
Query: 337 AW 338
W
Sbjct: 1520 IW 1521
>UniRef100_Q7XS38 OSJNBa0081G05.2 protein [Oryza sativa]
Length = 1711
Score = 83.2 bits (204), Expect = 2e-14
Identities = 83/319 (26%), Positives = 133/319 (41%), Gaps = 46/319 (14%)
Query: 72 QMIKKSDKLWVRVLEHKYLRDTSIHKVQAHQHDSPIWKGILWARDMIDQRFEFRIGKGDT 131
++I+ + L +VL+ KY S+ + SP W GI + D++ + +RIG G++
Sbjct: 1356 RLIEFPNSLCAQVLKAKYFPHGSLTDTTFSANASPTWHGIEYGLDLLKKGIIWRIGNGNS 1415
Query: 132 -SVWYQDWSGIGIIANQIPFVHISDVNLTL---CDLI-QDNKWNLQRLYTNLPHSLQQQF 186
+W W + + S N L DLI +D W+ ++ Q F
Sbjct: 1416 VRIWRDPWIPKDLSRRPVS----SKANCRLKWVSDLIAEDGTWDSAKI--------NQYF 1463
Query: 187 LAVQP----QICMN---REDAWIWKDGSSGRYSVRDAYEWINHLAH----NPIEDRKLN- 234
L + +IC++ ED W +GR+SVR AY+ LA + +LN
Sbjct: 1464 LKIDADIIQKICISARLEEDFIAWHPDKTGRFSVRSAYKLALQLADMNNCSSSSSSRLNK 1523
Query: 235 -W--VWKLRVPEKIRMFTWQVLHNAIPVNELWVRCHLASDATCARCGNVVEDGLHCLRDC 291
W +WK VP+K+R+F W+V N++ E + +L C C ED H L C
Sbjct: 1524 SWELIWKCNVPQKVRIFAWRVASNSLATMENKKKRNLERFDVCGICDREKEDAGHALCCC 1583
Query: 292 SFS*DLWRRM----------GAINWRNFRYNNIISWFSSMARGVHGIQFLAGVWGAWKWR 341
+ LW M +I + NF + + S R + L +W W R
Sbjct: 1584 VHANSLWVSMHKSGSISLDVKSIQFDNFWLFDCLEKLSEYERAL----VLMIIWRNWFVR 1639
Query: 342 CNWLLDSQRWPIEVVWRRI 360
+ + P+EV R I
Sbjct: 1640 NEVMHNKPAPPMEVSKRFI 1658
>UniRef100_Q9SJ38 Putative non-LTR retroelement reverse transcriptase [Arabidopsis
thaliana]
Length = 1229
Score = 81.6 bits (200), Expect = 5e-14
Identities = 61/240 (25%), Positives = 102/240 (42%), Gaps = 10/240 (4%)
Query: 65 VVGKAVCQMIKKSDKLWVRVLEHKYLRDTSIHKVQAHQHDSPIWKGILWARDMIDQRFEF 124
++ K +++ + L R+L KY +S + + S W+ I+ R+++ + +
Sbjct: 760 LLAKLGWRLLNSPESLLSRILLGKYCHSSSFMECKLPSQPSHGWRSIIAGREILKEGLGW 819
Query: 125 RIGKGD-TSVWYQDWSGIGIIANQIPFVHISDVNLTLCDLIQDN--KWNLQRLYTNLPHS 181
I G+ S+W W I I +L + LI N +W+ ++ LP+
Sbjct: 820 LITNGEKVSIWNDPWLSISKPLVPIGPALREHQDLRVSALINQNTLQWDWNKIAVILPN- 878
Query: 182 LQQQFLAVQPQICMNREDAWIWKDGSSGRYSVRDAYEWINHLAHNPIEDRKLNW---VWK 238
+ + P D W SG+Y+ R Y I +A PI + NW +WK
Sbjct: 879 -YENLIKQLPAPSSRGVDKLAWLPVKSGQYTSRSGYG-IASVASIPIPQTQFNWQSNLWK 936
Query: 239 LRVPEKIRMFTWQVLHNAIPVNELWVRCHLASDATCARCGNVVEDGLHCLRDCSFS*DLW 298
L+ KI+ W+ A+PV VR H++ A C RCG E H C F+ +W
Sbjct: 937 LQTLPKIKHLMWKAAMEALPVGIQLVRRHISPSAACHRCG-APESTTHLFFHCEFAAQVW 995
Database: uniref100
Posted date: Jan 5, 2005 1:24 AM
Number of letters in database: 848,049,833
Number of sequences in database: 2,790,947
Lambda K H
0.332 0.143 0.506
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 955,211,275
Number of Sequences: 2790947
Number of extensions: 40421451
Number of successful extensions: 93501
Number of sequences better than 10.0: 231
Number of HSP's better than 10.0 without gapping: 87
Number of HSP's successfully gapped in prelim test: 144
Number of HSP's that attempted gapping in prelim test: 93091
Number of HSP's gapped (non-prelim): 291
length of query: 547
length of database: 848,049,833
effective HSP length: 132
effective length of query: 415
effective length of database: 479,644,829
effective search space: 199052604035
effective search space used: 199052604035
T: 11
A: 40
X1: 15 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (22.0 bits)
S2: 77 (34.3 bits)
Lotus: description of TM0299.5