Miyakogusa Predicted Gene
- Lj2g3v0914630.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj2g3v0914630.1 CUFF.35705.1
(342 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G78970.1 | Symbols: LUP1, ATLUP1 | lupeol synthase 1 | chr1:2... 465 e-131
AT1G78970.2 | Symbols: LUP1, ATLUP1 | lupeol synthase 1 | chr1:2... 465 e-131
AT2G07050.1 | Symbols: CAS1 | cycloartenol synthase 1 | chr2:292... 457 e-129
AT1G78955.1 | Symbols: CAMS1 | camelliol C synthase 1 | chr1:296... 452 e-127
AT1G78960.1 | Symbols: ATLUP2, LUP2 | lupeol synthase 2 | chr1:2... 446 e-126
AT1G78950.1 | Symbols: | Terpenoid cyclases family protein | ch... 431 e-121
AT1G66960.1 | Symbols: | Terpenoid cyclases family protein | ch... 419 e-117
AT3G45130.1 | Symbols: LAS1 | lanosterol synthase 1 | chr3:16512... 405 e-113
AT1G78500.1 | Symbols: | Terpenoid cyclases family protein | ch... 377 e-105
AT5G36150.1 | Symbols: ATPEN3, PEN3 | putative pentacyclic trite... 369 e-102
AT5G48010.2 | Symbols: THAS, THAS1 | thalianol synthase 1 | chr5... 368 e-102
AT4G15340.1 | Symbols: ATPEN1, 04C11, PEN1 | pentacyclic triterp... 360 e-100
AT3G29255.1 | Symbols: | catalytics;intramolecular transferases... 357 7e-99
AT5G42600.1 | Symbols: MRN1 | marneral synthase | chr5:17053566-... 355 2e-98
AT4G15370.1 | Symbols: BARS1, PEN2 | baruol synthase 1 | chr4:87... 354 6e-98
AT5G48010.1 | Symbols: THAS, THAS1 | thalianol synthase 1 | chr5... 347 9e-96
>AT1G78970.1 | Symbols: LUP1, ATLUP1 | lupeol synthase 1 |
chr1:29703414-29707715 FORWARD LENGTH=757
Length = 757
Score = 465 bits (1196), Expect = e-131, Method: Compositional matrix adjust.
Identities = 206/340 (60%), Positives = 265/340 (77%)
Query: 1 MWDAAFAIQAILSGNVSEEYGPTLKKAHHFVKASQVRENPSGDFKAMYRHISKGAWTFSM 60
+WD FAIQA+L+ N+ +E LK+ H+++KASQVRENPSGDF++MYRHISKGAWTFS
Sbjct: 415 LWDTGFAIQALLASNLPDETDDALKRGHNYIKASQVRENPSGDFRSMYRHISKGAWTFSD 474
Query: 61 HDHGWQVSDCTAEGLKVALLLSEMSDDLVGAKMETEQFYDAVNVILSLQSSNGGFPAWEP 120
DHGWQVSDCTAE LK LLLS MS D+VG K++ EQ YD+VN++LSLQS NGG AWEP
Sbjct: 475 RDHGWQVSDCTAEALKCCLLLSMMSADIVGQKIDDEQLYDSVNLLLSLQSGNGGVNAWEP 534
Query: 121 QRAYQWLEKFNPTEFFEETLIEREYVECTGSAMQALALFRKLYPKHRRKEIDRCISKAIR 180
RAY+WLE NPTEF T++ERE+VECT S +QAL LFRKLYP HR+KEI+R I KA++
Sbjct: 535 SRAYKWLELLNPTEFMANTMVEREFVECTSSVIQALDLFRKLYPDHRKKEINRSIEKAVQ 594
Query: 181 YIENTQNPDGSWYGCWGICYTYGTWFAVEGLTACGKNFQNSVTLRRACKFLLSKQLPNGG 240
+I++ Q PDGSWYG WG+C+ Y TWFA+ GL A G+ + + + +R FLL+ Q +GG
Sbjct: 595 FIQDNQTPDGSWYGNWGVCFIYATWFALGGLAAAGETYNDCLAMRNGVHFLLTTQRDDGG 654
Query: 241 WGESYLSSQDKVYTNIEGKRANLVQSSWALLSLMRAGQAEIDPTPIHRGIRLLINSQMDD 300
WGESYLS ++ Y EG+R+NLVQ+SWA+++L+ GQAE D P+HR +L+INSQ+++
Sbjct: 655 WGESYLSCSEQRYIPSEGERSNLVQTSWAMMALIHTGQAERDLIPLHRAAKLIINSQLEN 714
Query: 301 GDFPQQEITGVFMRNCTLNYSSYRNIFPIWALGEYRRRVL 340
GDFPQQEI G FM C L+Y++YRN FP+WAL EYR+ V
Sbjct: 715 GDFPQQEIVGAFMNTCMLHYATYRNTFPLWALAEYRKVVF 754
>AT1G78970.2 | Symbols: LUP1, ATLUP1 | lupeol synthase 1 |
chr1:29703414-29707715 FORWARD LENGTH=757
Length = 757
Score = 465 bits (1196), Expect = e-131, Method: Compositional matrix adjust.
Identities = 206/340 (60%), Positives = 265/340 (77%)
Query: 1 MWDAAFAIQAILSGNVSEEYGPTLKKAHHFVKASQVRENPSGDFKAMYRHISKGAWTFSM 60
+WD FAIQA+L+ N+ +E LK+ H+++KASQVRENPSGDF++MYRHISKGAWTFS
Sbjct: 415 LWDTGFAIQALLASNLPDETDDALKRGHNYIKASQVRENPSGDFRSMYRHISKGAWTFSD 474
Query: 61 HDHGWQVSDCTAEGLKVALLLSEMSDDLVGAKMETEQFYDAVNVILSLQSSNGGFPAWEP 120
DHGWQVSDCTAE LK LLLS MS D+VG K++ EQ YD+VN++LSLQS NGG AWEP
Sbjct: 475 RDHGWQVSDCTAEALKCCLLLSMMSADIVGQKIDDEQLYDSVNLLLSLQSGNGGVNAWEP 534
Query: 121 QRAYQWLEKFNPTEFFEETLIEREYVECTGSAMQALALFRKLYPKHRRKEIDRCISKAIR 180
RAY+WLE NPTEF T++ERE+VECT S +QAL LFRKLYP HR+KEI+R I KA++
Sbjct: 535 SRAYKWLELLNPTEFMANTMVEREFVECTSSVIQALDLFRKLYPDHRKKEINRSIEKAVQ 594
Query: 181 YIENTQNPDGSWYGCWGICYTYGTWFAVEGLTACGKNFQNSVTLRRACKFLLSKQLPNGG 240
+I++ Q PDGSWYG WG+C+ Y TWFA+ GL A G+ + + + +R FLL+ Q +GG
Sbjct: 595 FIQDNQTPDGSWYGNWGVCFIYATWFALGGLAAAGETYNDCLAMRNGVHFLLTTQRDDGG 654
Query: 241 WGESYLSSQDKVYTNIEGKRANLVQSSWALLSLMRAGQAEIDPTPIHRGIRLLINSQMDD 300
WGESYLS ++ Y EG+R+NLVQ+SWA+++L+ GQAE D P+HR +L+INSQ+++
Sbjct: 655 WGESYLSCSEQRYIPSEGERSNLVQTSWAMMALIHTGQAERDLIPLHRAAKLIINSQLEN 714
Query: 301 GDFPQQEITGVFMRNCTLNYSSYRNIFPIWALGEYRRRVL 340
GDFPQQEI G FM C L+Y++YRN FP+WAL EYR+ V
Sbjct: 715 GDFPQQEIVGAFMNTCMLHYATYRNTFPLWALAEYRKVVF 754
>AT2G07050.1 | Symbols: CAS1 | cycloartenol synthase 1 |
chr2:2924629-2930295 FORWARD LENGTH=759
Length = 759
Score = 457 bits (1177), Expect = e-129, Method: Compositional matrix adjust.
Identities = 207/340 (60%), Positives = 266/340 (78%)
Query: 1 MWDAAFAIQAILSGNVSEEYGPTLKKAHHFVKASQVRENPSGDFKAMYRHISKGAWTFSM 60
+WD FAIQAIL+ N+ EEYGP L+KAH FVK SQV E+ GD YRHISKGAW FS
Sbjct: 415 LWDTGFAIQAILATNLVEEYGPVLEKAHSFVKNSQVLEDCPGDLNYWYRHISKGAWPFST 474
Query: 61 HDHGWQVSDCTAEGLKVALLLSEMSDDLVGAKMETEQFYDAVNVILSLQSSNGGFPAWEP 120
DHGW +SDCTAEGLK ALLLS++ +VG ++ ++ Y+AVNVI+SLQ+++GG +E
Sbjct: 475 ADHGWPISDCTAEGLKAALLLSKVPKAIVGEPIDAKRLYEAVNVIISLQNADGGLATYEL 534
Query: 121 QRAYQWLEKFNPTEFFEETLIEREYVECTGSAMQALALFRKLYPKHRRKEIDRCISKAIR 180
R+Y WLE NP E F + +I+ YVECT +A+QAL FRKLYP HR+KE+D CI KA++
Sbjct: 535 TRSYPWLELINPAETFGDIVIDYPYVECTSAAIQALISFRKLYPGHRKKEVDECIEKAVK 594
Query: 181 YIENTQNPDGSWYGCWGICYTYGTWFAVEGLTACGKNFQNSVTLRRACKFLLSKQLPNGG 240
+IE+ Q DGSWYG W +C+TYGTWF V+GL A GK +NS + +AC+FLLSKQ P+GG
Sbjct: 595 FIESIQAADGSWYGSWAVCFTYGTWFGVKGLVAVGKTLKNSPHVAKACEFLLSKQQPSGG 654
Query: 241 WGESYLSSQDKVYTNIEGKRANLVQSSWALLSLMRAGQAEIDPTPIHRGIRLLINSQMDD 300
WGESYLS QDKVY+N++G R+++V ++WA+L+L+ AGQAE+D P+HR R LIN+QM++
Sbjct: 655 WGESYLSCQDKVYSNLDGNRSHVVNTAWAMLALIGAGQAEVDRKPLHRAARYLINAQMEN 714
Query: 301 GDFPQQEITGVFMRNCTLNYSSYRNIFPIWALGEYRRRVL 340
GDFPQQEI GVF RNC + Y++YRNIFPIWALGEYR +VL
Sbjct: 715 GDFPQQEIMGVFNRNCMITYAAYRNIFPIWALGEYRCQVL 754
>AT1G78955.1 | Symbols: CAMS1 | camelliol C synthase 1 |
chr1:29689153-29694255 REVERSE LENGTH=769
Length = 769
Score = 452 bits (1162), Expect = e-127, Method: Compositional matrix adjust.
Identities = 203/339 (59%), Positives = 258/339 (76%)
Query: 1 MWDAAFAIQAILSGNVSEEYGPTLKKAHHFVKASQVRENPSGDFKAMYRHISKGAWTFSM 60
+WD+ FA+QA+++ N+ E L++ + F+K SQVRENPSGDF MYRHISKG+WTFS
Sbjct: 418 LWDSGFALQALVASNLVNEIPDVLRRGYDFLKNSQVRENPSGDFTNMYRHISKGSWTFSD 477
Query: 61 HDHGWQVSDCTAEGLKVALLLSEMSDDLVGAKMETEQFYDAVNVILSLQSSNGGFPAWEP 120
DHGWQ SDCTAE K LLLS + D+VG KM+ EQ Y+AV ++LSLQS NGG AWEP
Sbjct: 478 RDHGWQASDCTAESFKCCLLLSMIPPDIVGPKMDPEQLYEAVTILLSLQSKNGGVTAWEP 537
Query: 121 QRAYQWLEKFNPTEFFEETLIEREYVECTGSAMQALALFRKLYPKHRRKEIDRCISKAIR 180
R +WLE NPTE F + ++E EY ECT SA+QAL LF++LYP HR +EI+ I KA++
Sbjct: 538 ARGQEWLELLNPTEVFADIVVEHEYNECTSSAIQALILFKQLYPNHRTEEINTSIKKAVQ 597
Query: 181 YIENTQNPDGSWYGCWGICYTYGTWFAVEGLTACGKNFQNSVTLRRACKFLLSKQLPNGG 240
YIE+ Q DGSWYG WG+C+TY TWF + GL A GK + N + +R+ FLL+ Q NGG
Sbjct: 598 YIESIQMLDGSWYGSWGVCFTYSTWFGLGGLAAAGKTYNNCLAMRKGVHFLLTTQKDNGG 657
Query: 241 WGESYLSSQDKVYTNIEGKRANLVQSSWALLSLMRAGQAEIDPTPIHRGIRLLINSQMDD 300
WGESYLS K Y EG+R+NLVQ+SWA++ L+ AGQAE DP+P+HR +LLINSQ+++
Sbjct: 658 WGESYLSCPKKRYIPSEGERSNLVQTSWAMMGLLHAGQAERDPSPLHRAAKLLINSQLEN 717
Query: 301 GDFPQQEITGVFMRNCTLNYSSYRNIFPIWALGEYRRRV 339
GDFPQQEITG FM+NC L+Y++YRNIFP+WAL EYRRRV
Sbjct: 718 GDFPQQEITGAFMKNCLLHYAAYRNIFPVWALAEYRRRV 756
>AT1G78960.1 | Symbols: ATLUP2, LUP2 | lupeol synthase 2 |
chr1:29696722-29701024 FORWARD LENGTH=763
Length = 763
Score = 446 bits (1148), Expect = e-126, Method: Compositional matrix adjust.
Identities = 198/337 (58%), Positives = 259/337 (76%)
Query: 1 MWDAAFAIQAILSGNVSEEYGPTLKKAHHFVKASQVRENPSGDFKAMYRHISKGAWTFSM 60
+WD FAIQA+L+ ++S+E L+K H F+K SQVRENPSGDFK+MYRHISKGAWT S
Sbjct: 418 LWDTVFAIQALLACDLSDETDDVLRKGHSFIKKSQVRENPSGDFKSMYRHISKGAWTLSD 477
Query: 61 HDHGWQVSDCTAEGLKVALLLSEMSDDLVGAKMETEQFYDAVNVILSLQSSNGGFPAWEP 120
DHGWQVSDCTAE LK +LLS M ++VG K++ EQ YD+VN++LSLQ GG AWEP
Sbjct: 478 RDHGWQVSDCTAEALKCCMLLSMMPAEVVGQKIDPEQLYDSVNLLLSLQGEKGGLTAWEP 537
Query: 121 QRAYQWLEKFNPTEFFEETLIEREYVECTGSAMQALALFRKLYPKHRRKEIDRCISKAIR 180
RA +WLE NPT+FF + EREYVECT + +QAL LF++LYP HR KEI + I K ++
Sbjct: 538 VRAQEWLELLNPTDFFTCVMAEREYVECTSAVIQALVLFKQLYPDHRTKEIIKSIEKGVQ 597
Query: 181 YIENTQNPDGSWYGCWGICYTYGTWFAVEGLTACGKNFQNSVTLRRACKFLLSKQLPNGG 240
+IE+ Q PDGSW+G WGIC+ Y TWFA+ GL A GK +++ + +R+ FLL+ Q +GG
Sbjct: 598 FIESKQTPDGSWHGNWGICFIYATWFALSGLAAAGKTYKSCLAVRKGVDFLLAIQEEDGG 657
Query: 241 WGESYLSSQDKVYTNIEGKRANLVQSSWALLSLMRAGQAEIDPTPIHRGIRLLINSQMDD 300
WGES+LS ++ Y +EG R+NLVQ++WA++ L+ AGQAE DPTP+HR +L+I SQ+++
Sbjct: 658 WGESHLSCPEQRYIPLEGNRSNLVQTAWAMMGLIHAGQAERDPTPLHRAAKLIITSQLEN 717
Query: 301 GDFPQQEITGVFMRNCTLNYSSYRNIFPIWALGEYRR 337
GDFPQQEI GVFM C L+Y++YRNIFP+WAL EYR+
Sbjct: 718 GDFPQQEILGVFMNTCMLHYATYRNIFPLWALAEYRK 754
>AT1G78950.1 | Symbols: | Terpenoid cyclases family protein |
chr1:29684558-29688673 REVERSE LENGTH=759
Length = 759
Score = 431 bits (1109), Expect = e-121, Method: Compositional matrix adjust.
Identities = 201/339 (59%), Positives = 256/339 (75%)
Query: 1 MWDAAFAIQAILSGNVSEEYGPTLKKAHHFVKASQVRENPSGDFKAMYRHISKGAWTFSM 60
+WD FA+QA+L+ N+S E L++ H F+K SQV ENPSGD+K+MYRHISKGAWTFS
Sbjct: 418 LWDTGFAMQALLASNLSSEISDVLRRGHEFIKNSQVGENPSGDYKSMYRHISKGAWTFSD 477
Query: 61 HDHGWQVSDCTAEGLKVALLLSEMSDDLVGAKMETEQFYDAVNVILSLQSSNGGFPAWEP 120
DHGWQVSDCTA GLK LL S ++ D+VG K + E+ +D+VN++LSLQS NGG AWEP
Sbjct: 478 RDHGWQVSDCTAHGLKCCLLFSMLAPDIVGPKQDPERLHDSVNILLSLQSKNGGMTAWEP 537
Query: 121 QRAYQWLEKFNPTEFFEETLIEREYVECTGSAMQALALFRKLYPKHRRKEIDRCISKAIR 180
A +WLE NPTE F + +IE EY ECT SA+QAL+LF++LYP HR EI I KA
Sbjct: 538 AGAPKWLELLNPTEMFSDIVIEHEYSECTSSAIQALSLFKQLYPDHRTTEITAFIKKAAE 597
Query: 181 YIENTQNPDGSWYGCWGICYTYGTWFAVEGLTACGKNFQNSVTLRRACKFLLSKQLPNGG 240
Y+EN Q DGSWYG WGIC+TYGTWFA+ GL A GK F + +R+ +FLL+ Q NGG
Sbjct: 598 YLENMQTRDGSWYGNWGICFTYGTWFALAGLAAAGKTFNDCEAIRKGVQFLLAAQKDNGG 657
Query: 241 WGESYLSSQDKVYTNIEGKRANLVQSSWALLSLMRAGQAEIDPTPIHRGIRLLINSQMDD 300
WGESYLS K+Y G+ +N+VQ++WAL+ L+ +GQAE DP P+HR +L+INSQ++
Sbjct: 658 WGESYLSCSKKIYIAQVGEISNVVQTAWALMGLIHSGQAERDPIPLHRAAKLIINSQLES 717
Query: 301 GDFPQQEITGVFMRNCTLNYSSYRNIFPIWALGEYRRRV 339
GDFPQQ+ TGVF++NCTL+Y++YRNI P+WAL EYR RV
Sbjct: 718 GDFPQQQATGVFLKNCTLHYAAYRNIHPLWALAEYRARV 756
>AT1G66960.1 | Symbols: | Terpenoid cyclases family protein |
chr1:24985155-24989664 REVERSE LENGTH=763
Length = 763
Score = 419 bits (1078), Expect = e-117, Method: Compositional matrix adjust.
Identities = 185/337 (54%), Positives = 249/337 (73%)
Query: 1 MWDAAFAIQAILSGNVSEEYGPTLKKAHHFVKASQVRENPSGDFKAMYRHISKGAWTFSM 60
+W FA+QA+L+ + +E L++AH ++K SQVR+NPSGDFK+MYRHISKG WT S
Sbjct: 418 LWMTGFAVQALLASDPRDETYDVLRRAHDYIKKSQVRDNPSGDFKSMYRHISKGGWTLSD 477
Query: 61 HDHGWQVSDCTAEGLKVALLLSEMSDDLVGAKMETEQFYDAVNVILSLQSSNGGFPAWEP 120
DHGWQVSDCTAE K +LLS M D+ G K+ EQ YD+VN++LSLQS NGGF AWEP
Sbjct: 478 RDHGWQVSDCTAEAAKCCMLLSTMPTDITGEKINLEQLYDSVNLMLSLQSENGGFTAWEP 537
Query: 121 QRAYQWLEKFNPTEFFEETLIEREYVECTGSAMQALALFRKLYPKHRRKEIDRCISKAIR 180
RAY+W+E NPT+ F + EREY ECT + +QAL +F +LYP HR KEI + I KA++
Sbjct: 538 VRAYKWMELMNPTDLFANAMTEREYTECTSAVLQALVIFNQLYPDHRTKEITKSIEKAVQ 597
Query: 181 YIENTQNPDGSWYGCWGICYTYGTWFAVEGLTACGKNFQNSVTLRRACKFLLSKQLPNGG 240
+IE+ Q DGSWYG WGIC+TYGTWFA+ GL A GK + N +++R FLL+ Q +GG
Sbjct: 598 FIESKQLRDGSWYGSWGICFTYGTWFALCGLAAIGKTYNNCLSMRDGVHFLLNIQNEDGG 657
Query: 241 WGESYLSSQDKVYTNIEGKRANLVQSSWALLSLMRAGQAEIDPTPIHRGIRLLINSQMDD 300
WGESY+S ++ Y +EG R+N+VQ++WA+++L+ AGQA+ D P+H + +I SQ+++
Sbjct: 658 WGESYMSCPEQRYIPLEGNRSNVVQTAWAMMALIHAGQAKRDLIPLHSAAKFIITSQLEN 717
Query: 301 GDFPQQEITGVFMRNCTLNYSSYRNIFPIWALGEYRR 337
GDFPQQE+ G M C L+YS+Y++IFP WAL EYR+
Sbjct: 718 GDFPQQELLGASMSTCMLHYSTYKDIFPPWALAEYRK 754
>AT3G45130.1 | Symbols: LAS1 | lanosterol synthase 1 |
chr3:16512552-16517522 REVERSE LENGTH=756
Length = 756
Score = 405 bits (1042), Expect = e-113, Method: Compositional matrix adjust.
Identities = 196/340 (57%), Positives = 257/340 (75%)
Query: 1 MWDAAFAIQAILSGNVSEEYGPTLKKAHHFVKASQVRENPSGDFKAMYRHISKGAWTFSM 60
+WD A+QAIL+ N+ ++YG LKKAH+++K +Q+R++ SGD YRH KG W FS
Sbjct: 415 LWDVTLAVQAILATNLVDDYGLMLKKAHNYIKNTQIRKDTSGDPGLWYRHPCKGGWGFST 474
Query: 61 HDHGWQVSDCTAEGLKVALLLSEMSDDLVGAKMETEQFYDAVNVILSLQSSNGGFPAWEP 120
D+ W VSDCTAE LK ALLLS+M +LVG M E DAVN ILSLQ+ NGGF ++E
Sbjct: 475 GDNPWPVSDCTAEALKAALLLSQMPVNLVGEPMPEEHLVDAVNFILSLQNKNGGFASYEL 534
Query: 121 QRAYQWLEKFNPTEFFEETLIEREYVECTGSAMQALALFRKLYPKHRRKEIDRCISKAIR 180
R+Y LE NP+E F + +I+ +YVECT +A+Q L LF L ++RKEI I+KA+
Sbjct: 535 TRSYPELEVINPSETFGDIIIDYQYVECTSAAIQGLVLFTTLNSSYKRKEIVGSINKAVE 594
Query: 181 YIENTQNPDGSWYGCWGICYTYGTWFAVEGLTACGKNFQNSVTLRRACKFLLSKQLPNGG 240
+IE TQ PDGSWYG WG+C+TY TWF ++G+ A GK +++S+ +R+AC FLLSKQL GG
Sbjct: 595 FIEKTQLPDGSWYGSWGVCFTYATWFGIKGMLASGKTYESSLCIRKACGFLLSKQLCCGG 654
Query: 241 WGESYLSSQDKVYTNIEGKRANLVQSSWALLSLMRAGQAEIDPTPIHRGIRLLINSQMDD 300
WGESYLS Q+KVYTN+ G ++++V +SWALL+L+ AGQA DP P+HRG + LINSQM+D
Sbjct: 655 WGESYLSCQNKVYTNLPGNKSHIVNTSWALLALIEAGQASRDPMPLHRGAKSLINSQMED 714
Query: 301 GDFPQQEITGVFMRNCTLNYSSYRNIFPIWALGEYRRRVL 340
GD+PQQEI GVF RNC ++YS+YRNIFPIWALGEYR+ +L
Sbjct: 715 GDYPQQEILGVFNRNCMISYSAYRNIFPIWALGEYRKLML 754
>AT1G78500.1 | Symbols: | Terpenoid cyclases family protein |
chr1:29531646-29535177 FORWARD LENGTH=767
Length = 767
Score = 377 bits (969), Expect = e-105, Method: Compositional matrix adjust.
Identities = 174/341 (51%), Positives = 236/341 (69%), Gaps = 4/341 (1%)
Query: 1 MWDAAFAIQAILS----GNVSEEYGPTLKKAHHFVKASQVRENPSGDFKAMYRHISKGAW 56
+WD AF++Q +L+ + +E TL K + F+ SQ+ +NP GD + M + I+KG W
Sbjct: 420 LWDTAFSLQVMLAYQDVDDDDDEIRSTLIKGYSFLNKSQLTQNPPGDHRKMLKDIAKGGW 479
Query: 57 TFSMHDHGWQVSDCTAEGLKVALLLSEMSDDLVGAKMETEQFYDAVNVILSLQSSNGGFP 116
TFS D GW VSDCTAE L+ L+ M +L+G KM+ E+ YDAVN++L QS NGG
Sbjct: 480 TFSDQDQGWPVSDCTAESLECCLVFGSMPSELIGEKMDVERLYDAVNLLLYFQSKNGGIT 539
Query: 117 AWEPQRAYQWLEKFNPTEFFEETLIEREYVECTGSAMQALALFRKLYPKHRRKEIDRCIS 176
WE R WLE +P EF E+T++E EYVECTGSA+ ALA F K +P+HRR+E+++ I
Sbjct: 540 VWEAARGRTWLEWLSPVEFMEDTIVEHEYVECTGSAIVALARFLKEFPEHRREEVEKFIK 599
Query: 177 KAIRYIENTQNPDGSWYGCWGICYTYGTWFAVEGLTACGKNFQNSVTLRRACKFLLSKQL 236
A++YIE+ Q PDGSWYG WG+C+ YGT+FAV GL A GK +QN +R+A +F+L Q
Sbjct: 600 NAVKYIESFQMPDGSWYGNWGVCFMYGTFFAVRGLVAAGKTYQNCEPIRKAVQFILETQN 659
Query: 237 PNGGWGESYLSSQDKVYTNIEGKRANLVQSSWALLSLMRAGQAEIDPTPIHRGIRLLINS 296
GGWGESYLS +K YT +EG R N+V + AL+ L+ GQ E DP P+HR ++LINS
Sbjct: 660 VEGGWGESYLSCPNKKYTLLEGNRTNVVNTGQALMVLIMGGQMERDPLPVHRAAKVLINS 719
Query: 297 QMDDGDFPQQEITGVFMRNCTLNYSSYRNIFPIWALGEYRR 337
Q+D+GDFPQ+EI GVF N ++Y++YRNIF +WAL Y +
Sbjct: 720 QLDNGDFPQEEIMGVFKMNVMVHYATYRNIFTLWALTYYTK 760
>AT5G36150.1 | Symbols: ATPEN3, PEN3 | putative pentacyclic
triterpene synthase 3 | chr5:14220737-14225422 REVERSE
LENGTH=760
Length = 760
Score = 369 bits (946), Expect = e-102, Method: Compositional matrix adjust.
Identities = 166/339 (48%), Positives = 229/339 (67%)
Query: 1 MWDAAFAIQAILSGNVSEEYGPTLKKAHHFVKASQVRENPSGDFKAMYRHISKGAWTFSM 60
+WD AF +Q +L+ +V +E PTL K + +++ SQ ENP GD+ M+R ISKG W +S
Sbjct: 418 IWDTAFVLQVMLAADVDDEIRPTLIKGYSYLRKSQFTENPPGDYINMFRDISKGGWGYSD 477
Query: 61 HDHGWQVSDCTAEGLKVALLLSEMSDDLVGAKMETEQFYDAVNVILSLQSSNGGFPAWEP 120
D GW VSDC +E L+ L+ MS + +G KME E+ YDAVN++L +QS NGG WE
Sbjct: 478 KDQGWPVSDCISESLECCLIFESMSSEFIGEKMEVERLYDAVNMLLYMQSRNGGISIWEA 537
Query: 121 QRAYQWLEKFNPTEFFEETLIEREYVECTGSAMQALALFRKLYPKHRRKEIDRCISKAIR 180
+WLE +P EF E+T++E EY+ECTGSA+ LA F K +P HR +E+ + I+K ++
Sbjct: 538 ASGKKWLEWLSPIEFIEDTILEHEYLECTGSAIVVLARFMKQFPGHRTEEVKKFITKGVK 597
Query: 181 YIENTQNPDGSWYGCWGICYTYGTWFAVEGLTACGKNFQNSVTLRRACKFLLSKQLPNGG 240
YIE+ Q DGSWYG WGIC+ YGT+FAV GL A G + N +RRA +FLL Q GG
Sbjct: 598 YIESLQIADGSWYGNWGICFIYGTFFAVRGLVAAGNTYDNCEAIRRAVRFLLDIQNGEGG 657
Query: 241 WGESYLSSQDKVYTNIEGKRANLVQSSWALLSLMRAGQAEIDPTPIHRGIRLLINSQMDD 300
WGES+LS +K Y +EG + ++V + AL+ L+ GQ + DP P+HR ++LINSQMD+
Sbjct: 658 WGESFLSCPNKNYIPLEGNKTDVVNTGQALMVLIMGGQMDRDPLPVHRAAKVLINSQMDN 717
Query: 301 GDFPQQEITGVFMRNCTLNYSSYRNIFPIWALGEYRRRV 339
GDFPQQEI GV+ N LN+ ++RN F +WAL Y + +
Sbjct: 718 GDFPQQEIRGVYKMNVMLNFPTFRNSFTLWALTHYTKAI 756
>AT5G48010.2 | Symbols: THAS, THAS1 | thalianol synthase 1 |
chr5:19457001-19461538 FORWARD LENGTH=766
Length = 766
Score = 368 bits (945), Expect = e-102, Method: Compositional matrix adjust.
Identities = 174/347 (50%), Positives = 229/347 (65%), Gaps = 7/347 (2%)
Query: 1 MWDAAFAIQAILSGNVSE----EYGPTLKKAHHFVKASQVRENPSGDFKAMYRHISKGAW 56
+WD A ++ A+L G E TL K + ++K SQ+ ENP GD M+RH +KG W
Sbjct: 419 LWDTALSLHALLDGIDDHDVDDEIKTTLVKGYDYLKKSQITENPRGDHFKMFRHKTKGGW 478
Query: 57 TFSMHDHGWQVSDCTAEGLKVALLLSEMSDDLVGAKMETEQFYDAVNVILSLQSSNGGFP 116
TFS D GW VSDCTAE L+ L M +L+G KM+ E+ YDAV+ +L LQS NGG
Sbjct: 479 TFSDQDQGWPVSDCTAESLECCLFFESMPSELIGKKMDVEKLYDAVDYLLYLQSDNGGIA 538
Query: 117 AWEPQRAYQWLEKFNPTEFFEETLIEREYVECTGSAMQALALFRKLYPKHRRKEIDRCIS 176
AW+P WLE +P EF E+T++E EYVECTGSA+ AL F K +P ++ E+ R I+
Sbjct: 539 AWQPVEGKAWLEWLSPVEFLEDTIVEYEYVECTGSAIAALTQFNKQFPGYKNVEVKRFIT 598
Query: 177 KAIRYIENTQNPDGSWYGCWGICYTYGTWFAVEGLTACGKNFQNSVTLRRACKFLLSKQL 236
KA +YIE+ Q DGSWYG WG+C+ YGT+FAV GL A GK + N +R+A +FLL Q
Sbjct: 599 KAAKYIEDMQTVDGSWYGNWGVCFIYGTFFAVRGLVAAGKTYSNCEAIRKAVRFLLDTQN 658
Query: 237 PNGGWGESYLSSQDKVYTNIEGKRANLVQSSWALLSLMRAGQAEIDPTPIHRGIRLLINS 296
P GGWGES+LS K YT ++G N+VQ++ AL+ L+ Q E DP P+HR ++LINS
Sbjct: 659 PEGGWGESFLSCPSKKYTPLKGNSTNVVQTAQALMVLIMGDQMERDPLPVHRAAQVLINS 718
Query: 297 QMDDGDFPQQEITGVFMRNCTLNYSSYRNIFPIWALGEYR---RRVL 340
Q+D+GDFPQQEI G FMR L++ +YRN F +WAL Y RR+L
Sbjct: 719 QLDNGDFPQQEIMGTFMRTVMLHFPTYRNTFSLWALTHYTHALRRLL 765
>AT4G15340.1 | Symbols: ATPEN1, 04C11, PEN1 | pentacyclic triterpene
synthase 1 | chr4:8754670-8760589 REVERSE LENGTH=766
Length = 766
Score = 360 bits (925), Expect = e-100, Method: Compositional matrix adjust.
Identities = 172/338 (50%), Positives = 222/338 (65%), Gaps = 3/338 (0%)
Query: 1 MWDAAFAIQAILSGNVSEEYG---PTLKKAHHFVKASQVRENPSGDFKAMYRHISKGAWT 57
+WD ++ +L G + TL K + ++K SQV ENP D M+RHISKG WT
Sbjct: 420 LWDTVMSLHFLLDGVEDDVDDEIRSTLVKGYDYLKKSQVTENPPSDHIKMFRHISKGGWT 479
Query: 58 FSMHDHGWQVSDCTAEGLKVALLLSEMSDDLVGAKMETEQFYDAVNVILSLQSSNGGFPA 117
FS D GW VSDCTAE LK LL M + VG KM+ E+ +DAV+ +L LQS NGG A
Sbjct: 480 FSDKDQGWPVSDCTAESLKCCLLFERMPSEFVGQKMDVEKLFDAVDFLLYLQSDNGGITA 539
Query: 118 WEPQRAYQWLEKFNPTEFFEETLIEREYVECTGSAMQALALFRKLYPKHRRKEIDRCISK 177
WEP WLE F+P EF ++T+IE EYVECTGSA+ AL F K +P+ R+KE++R I+
Sbjct: 540 WEPADGKTWLEWFSPVEFVQDTVIEHEYVECTGSAIVALTQFSKQFPEFRKKEVERFITN 599
Query: 178 AIRYIENTQNPDGSWYGCWGICYTYGTWFAVEGLTACGKNFQNSVTLRRACKFLLSKQLP 237
++YIE+ Q DGSW G WG+C+ YGT FAV GL A GK F N +RRA +FLL Q
Sbjct: 600 GVKYIEDLQMKDGSWCGNWGVCFIYGTLFAVRGLVAAGKTFHNCEPIRRAVRFLLDTQNQ 659
Query: 238 NGGWGESYLSSQDKVYTNIEGKRANLVQSSWALLSLMRAGQAEIDPTPIHRGIRLLINSQ 297
GGWGESYLS K YT + G + N+V + AL+ L+ GQ E DP P+HR +++IN Q
Sbjct: 660 EGGWGESYLSCLRKKYTPLAGNKTNIVSTGQALMVLIMGGQMERDPLPVHRAAKVVINLQ 719
Query: 298 MDDGDFPQQEITGVFMRNCTLNYSSYRNIFPIWALGEY 335
+D+GDFPQQE+ GVF N L+Y +YRNI+ +WAL Y
Sbjct: 720 LDNGDFPQQEVMGVFNMNVLLHYPTYRNIYSLWALTLY 757
>AT3G29255.1 | Symbols: | catalytics;intramolecular transferases |
chr3:11209586-11213909 FORWARD LENGTH=706
Length = 706
Score = 357 bits (916), Expect = 7e-99, Method: Compositional matrix adjust.
Identities = 163/337 (48%), Positives = 226/337 (67%)
Query: 1 MWDAAFAIQAILSGNVSEEYGPTLKKAHHFVKASQVRENPSGDFKAMYRHISKGAWTFSM 60
+WD ++ +L+ ++ +E L K + F++ SQ+ ENP G + M+R ISKG W FS
Sbjct: 364 IWDTVLLLKVMLAADIDDEIRSMLIKGYSFLRKSQLIENPPGYYIKMFRDISKGGWGFSD 423
Query: 61 HDHGWQVSDCTAEGLKVALLLSEMSDDLVGAKMETEQFYDAVNVILSLQSSNGGFPAWEP 120
D GW SDCT+E L+ L+ M + + KM+ E+ YDAVN++L LQS NGG WE
Sbjct: 424 KDQGWPASDCTSESLECCLIFESMPSNFIDEKMDVERLYDAVNMLLYLQSENGGKAVWER 483
Query: 121 QRAYQWLEKFNPTEFFEETLIEREYVECTGSAMQALALFRKLYPKHRRKEIDRCISKAIR 180
+WLE +P EF EET++E EYVECTGSA+ L F K +P+HR KEI+ I+KA++
Sbjct: 484 ASGKKWLEWLSPIEFMEETILEHEYVECTGSAVVVLTRFMKQFPRHRTKEIETFIAKAVK 543
Query: 181 YIENTQNPDGSWYGCWGICYTYGTWFAVEGLTACGKNFQNSVTLRRACKFLLSKQLPNGG 240
YIE+ Q DGSWYG WG+C+ Y T+FAV GL A GK +Q+ +RRA +FLL Q GG
Sbjct: 544 YIESLQMADGSWYGNWGVCFIYATFFAVRGLVAAGKTYQSYEPIRRAVQFLLKIQNDEGG 603
Query: 241 WGESYLSSQDKVYTNIEGKRANLVQSSWALLSLMRAGQAEIDPTPIHRGIRLLINSQMDD 300
WGES+LS K Y ++EG + N+V + A++ L+ +GQ E DP P+HR ++LINSQM++
Sbjct: 604 WGESFLSCPGKKYISLEGNKTNVVNTGQAMMVLIMSGQMERDPLPVHRAAKVLINSQMEN 663
Query: 301 GDFPQQEITGVFMRNCTLNYSSYRNIFPIWALGEYRR 337
GDFPQQE+ GV+ N L+Y +YRNIF +WAL Y +
Sbjct: 664 GDFPQQELRGVYKMNVLLHYPTYRNIFSLWALTYYTK 700
>AT5G42600.1 | Symbols: MRN1 | marneral synthase |
chr5:17053566-17057975 FORWARD LENGTH=761
Length = 761
Score = 355 bits (912), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 163/334 (48%), Positives = 224/334 (67%), Gaps = 1/334 (0%)
Query: 2 WDAAFAIQAILSGNVSEEYGPTLKKAHHFVKASQVRENPSGDFKAMYRHISKGAWTFSMH 61
W+AA ++Q +L+ N+ +E TL K + F+K SQ+ ENP GD M+R I+KG WTF
Sbjct: 420 WNAALSLQVMLAANMDDEIRSTLIKGYDFLKQSQISENPQGDHLKMFRDITKGGWTFQDR 479
Query: 62 DHGWQVSDCTAEGLKVALLLSEMSDDLVGAKMETEQFYDAVNVILSLQSSNGGFPAWEPQ 121
+ G +SD TAE ++ + M + +G KM+ E+ YDAVN ++ LQS NGG P WEP
Sbjct: 480 EQGLPISDGTAESIECCIHFHRMPSEFIGEKMDVEKLYDAVNFLIYLQSDNGGMPVWEPA 539
Query: 122 RAYQWLEKFNPTEFFEETLIEREYVECTGSAMQALALFRKLYPKHRRKEIDRCISKAIRY 181
+WLE +P E E T++E+EY+ECTGS + L F+K +P HR KEI++ I K ++Y
Sbjct: 540 PGKKWLEWLSPVEHVENTVVEQEYLECTGSVIAGLVCFKKEFPDHRPKEIEKLIKKGLKY 599
Query: 182 IENTQNPDGSWYGCWGICYTYGTWFAVEGLTACGKNFQNSVTLRRACKFLLSKQLPNGGW 241
IE+ Q PDGSWYG WG+C+TYGT FAV GL A GK F NS +RRA +F+L+ Q GGW
Sbjct: 600 IEDLQMPDGSWYGNWGVCFTYGTLFAVRGLAAAGKTFGNSEAIRRAVQFILNTQNAEGGW 659
Query: 242 GESYLSSQDKVYTNIEGKRANLVQSSWALLSLMRAGQAEIDPTPIHRGIRLLINSQMDDG 301
GES LS +K Y +G N+V + A++ L+ GQ E DP+P+HR ++LINSQ+D G
Sbjct: 660 GESALSCPNKKYIPSKGNVTNVVNTGQAMMVLLIGGQMERDPSPVHRAAKVLINSQLDIG 719
Query: 302 DFPQQEITGVFMRNCTLNYSSYRNIFPIWALGEY 335
DFPQQE G++M N L+Y +YRN+F +WAL Y
Sbjct: 720 DFPQQERRGIYM-NMLLHYPTYRNMFSLWALALY 752
>AT4G15370.1 | Symbols: BARS1, PEN2 | baruol synthase 1 |
chr4:8773786-8779685 REVERSE LENGTH=759
Length = 759
Score = 354 bits (908), Expect = 6e-98, Method: Compositional matrix adjust.
Identities = 164/340 (48%), Positives = 224/340 (65%), Gaps = 10/340 (2%)
Query: 1 MWDAAFAIQAILSG---NVSEEYGPTLKKAHHFVKASQVRENPSGDFKAMYRHISKGAWT 57
+WD A ++ + G +V EE TL K + +++ SQV ENP GD+ M+RH++KG WT
Sbjct: 422 VWDTALSLHVFIDGFDDDVDEEIRSTLLKGYDYLEKSQVTENPPGDYMKMFRHMAKGGWT 481
Query: 58 FSMHDHGWQVSDCTAEGLKVALLLSEMSDDLVGAKMETEQFYDAVNVILSLQSSNGGFPA 117
FS D GW VSDCTAE L+ L MS + +G KM+ E+ YDAV+ +L LQS NGG A
Sbjct: 482 FSDQDQGWPVSDCTAESLECCLFFESMSSEFIGKKMDVEKLYDAVDFLLYLQSDNGGITA 541
Query: 118 WEPQRAYQWLEKFNPTEFFEETLIEREYVECTGSAMQALALFRKLYPKHRRKEIDRCISK 177
W+P EF E+ ++E EYVECTGSA+ ALA F K +P ++++E++R I+K
Sbjct: 542 WQPADG-------KLVEFIEDAVVEHEYVECTGSAIVALAQFNKQFPGYKKEEVERFITK 594
Query: 178 AIRYIENTQNPDGSWYGCWGICYTYGTWFAVEGLTACGKNFQNSVTLRRACKFLLSKQLP 237
++YIE+ Q DGSWYG WG+C+ YGT+FAV GL A GK + N +RRA +F+L Q
Sbjct: 595 GVKYIEDLQMVDGSWYGNWGVCFIYGTFFAVRGLVAAGKCYNNCEAIRRAVRFILDTQNT 654
Query: 238 NGGWGESYLSSQDKVYTNIEGKRANLVQSSWALLSLMRAGQAEIDPTPIHRGIRLLINSQ 297
GGWGESYLS K Y + G + N+V + AL+ L+ Q + DP P+HR ++LINSQ
Sbjct: 655 EGGWGESYLSCPRKKYIPLIGNKTNVVNTGQALMVLIMGNQMKRDPLPVHRAAKVLINSQ 714
Query: 298 MDDGDFPQQEITGVFMRNCTLNYSSYRNIFPIWALGEYRR 337
MD+GDFPQQEI GVF N L++ +YRN+F +WAL Y +
Sbjct: 715 MDNGDFPQQEIMGVFKMNVMLHFPTYRNMFTLWALTHYTK 754
>AT5G48010.1 | Symbols: THAS, THAS1 | thalianol synthase 1 |
chr5:19457001-19461538 FORWARD LENGTH=758
Length = 758
Score = 347 bits (889), Expect = 9e-96, Method: Compositional matrix adjust.
Identities = 169/347 (48%), Positives = 220/347 (63%), Gaps = 15/347 (4%)
Query: 1 MWDAAFAIQAILSGNVSE----EYGPTLKKAHHFVKASQVRENPSGDFKAMYRHISKGAW 56
+WD A ++ A+L G E TL K + ++K SQ+ ENP GD M+RH +KG W
Sbjct: 419 LWDTALSLHALLDGIDDHDVDDEIKTTLVKGYDYLKKSQITENPRGDHFKMFRHKTKGGW 478
Query: 57 TFSMHDHGWQVSDCTAEGLKVALLLSEMSDDLVGAKMETEQFYDAVNVILSLQSSNGGFP 116
TFS D GW VSDCTAE L+ L M +L+G KM+ E+ YDAV+ +L LQS NGG
Sbjct: 479 TFSDQDQGWPVSDCTAESLECCLFFESMPSELIGKKMDVEKLYDAVDYLLYLQSDNGGIA 538
Query: 117 AWEPQRAYQWLEKFNPTEFFEETLIEREYVECTGSAMQALALFRKLYPKHRRKEIDRCIS 176
AW+P WLE N F YVECTGSA+ AL F K +P ++ E+ R I+
Sbjct: 539 AWQPVEGKAWLELLNIMIF--------RYVECTGSAIAALTQFNKQFPGYKNVEVKRFIT 590
Query: 177 KAIRYIENTQNPDGSWYGCWGICYTYGTWFAVEGLTACGKNFQNSVTLRRACKFLLSKQL 236
KA +YIE+ Q DGSWYG WG+C+ YGT+FAV GL A GK + N +R+A +FLL Q
Sbjct: 591 KAAKYIEDMQTVDGSWYGNWGVCFIYGTFFAVRGLVAAGKTYSNCEAIRKAVRFLLDTQN 650
Query: 237 PNGGWGESYLSSQDKVYTNIEGKRANLVQSSWALLSLMRAGQAEIDPTPIHRGIRLLINS 296
P GGWGES+LS K YT ++G N+VQ++ AL+ L+ Q E DP P+HR ++LINS
Sbjct: 651 PEGGWGESFLSCPSKKYTPLKGNSTNVVQTAQALMVLIMGDQMERDPLPVHRAAQVLINS 710
Query: 297 QMDDGDFPQQEITGVFMRNCTLNYSSYRNIFPIWALGEYR---RRVL 340
Q+D+GDFPQQEI G FMR L++ +YRN F +WAL Y RR+L
Sbjct: 711 QLDNGDFPQQEIMGTFMRTVMLHFPTYRNTFSLWALTHYTHALRRLL 757