Miyakogusa Predicted Gene
- Lj1g3v3329370.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v3329370.1 Non Chatacterized Hit- tr|I3T028|I3T028_LOTJA
Uncharacterized protein OS=Lotus japonicus PE=2
SV=1,98.84,0,DUF914,Protein of unknown function DUF914, eukaryotic;
Multidrug resistance efflux transporter EmrE,,CUFF.30390.1
(346 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G59310.1 | Symbols: | Eukaryotic protein of unknown function... 448 e-126
AT3G59340.1 | Symbols: | Eukaryotic protein of unknown function... 445 e-125
AT3G59310.2 | Symbols: | Eukaryotic protein of unknown function... 433 e-122
AT3G59320.1 | Symbols: | Eukaryotic protein of unknown function... 387 e-108
AT3G59320.2 | Symbols: | Eukaryotic protein of unknown function... 267 9e-72
AT3G59330.1 | Symbols: | Eukaryotic protein of unknown function... 266 1e-71
AT3G07080.1 | Symbols: | EamA-like transporter family | chr3:22... 52 4e-07
>AT3G59310.1 | Symbols: | Eukaryotic protein of unknown function
(DUF914) | chr3:21922316-21924313 REVERSE LENGTH=332
Length = 332
Score = 448 bits (1153), Expect = e-126, Method: Compositional matrix adjust.
Identities = 222/323 (68%), Positives = 262/323 (81%), Gaps = 2/323 (0%)
Query: 1 MAMDFRKFWTRNMLIGLGLGQFLSLLITSTGFTSSELAKKGINAPTSQSFLNYVFLVIVY 60
M D ++ T+ LIGLGLGQ LSLL TS GFTSSELA+KGIN PTSQ FLNYV L IVY
Sbjct: 1 MGFDLKEIRTKKTLIGLGLGQLLSLLATSNGFTSSELARKGINVPTSQCFLNYVLLAIVY 60
Query: 61 GSILLYRRKPLKAKWYYYIILGLVDVEANFLVVKAYQYTSLTSVMLLDCWSIPCVMLLTW 120
GSI+LYRR +KAKWYYY +L VDVEANFLVVKAYQYTSLTSVMLLDCW+IPCV++LTW
Sbjct: 61 GSIMLYRRSDIKAKWYYYFLLAFVDVEANFLVVKAYQYTSLTSVMLLDCWAIPCVLVLTW 120
Query: 121 IFLKTKYRFLKITGVIVCIAGLVLVVFSDVHAGDRAGGSNPRKGDAIVFAGATLYAISNV 180
+LKTKYR +KI+GV +CI G+ +VVFSDVHAGDRAGGSNP KGD +V AGATLYA+SN
Sbjct: 121 FYLKTKYRLMKISGVFICIVGVFMVVFSDVHAGDRAGGSNPVKGDFLVLAGATLYAVSNT 180
Query: 181 SEEFLIKNADRVELMGMLGLFGGIVSAIQISVLERNELKSIHWSAGAALPFVGFSVAMFM 240
SEEFL+KNAD VELM LG FG I+SAIQ+S+LER+ELK+IHWS GA PF+ F++ MF+
Sbjct: 181 SEEFLVKNADTVELMTFLGFFGAIISAIQVSILERDELKAIHWSTGAVFPFLRFTLTMFL 240
Query: 241 FYSLVPVLLKINGSTMLNLSLLTSDMWAVVIRIFAYHEKVDWMYFVAFGAVTVGILIYSG 300
FY LVPVLLK NG+TM NLSLLTSDMWAV+IR F YHEKVDW+YF+AF G++IYS
Sbjct: 241 FYPLVPVLLKTNGATMFNLSLLTSDMWAVLIRTFGYHEKVDWLYFLAFATTATGLIIYSM 300
Query: 301 GDKDEDEDQHPLNSAEDPPRIKQ 323
+KD++E H D +++
Sbjct: 301 KEKDQEE--HRFEEVGDEAAMQR 321
>AT3G59340.1 | Symbols: | Eukaryotic protein of unknown function
(DUF914) | chr3:21928971-21931030 REVERSE LENGTH=333
Length = 333
Score = 445 bits (1144), Expect = e-125, Method: Compositional matrix adjust.
Identities = 205/300 (68%), Positives = 252/300 (84%), Gaps = 1/300 (0%)
Query: 1 MAMDFRKF-WTRNMLIGLGLGQFLSLLITSTGFTSSELAKKGINAPTSQSFLNYVFLVIV 59
M DF + T+ LIGLGLGQ LSLL TS FTSSELA+KGINAPTSQ+FL+Y L +V
Sbjct: 3 MGFDFNEMNKTKKTLIGLGLGQILSLLCTSNAFTSSELARKGINAPTSQTFLSYTLLAVV 62
Query: 60 YGSILLYRRKPLKAKWYYYIILGLVDVEANFLVVKAYQYTSLTSVMLLDCWSIPCVMLLT 119
YG I+LYRR +K KWY+Y +L LVDVE NFLVVKA QYTS+TS+MLLDCW+IPCV++LT
Sbjct: 63 YGGIMLYRRPTIKGKWYHYFLLALVDVEGNFLVVKANQYTSITSIMLLDCWAIPCVLVLT 122
Query: 120 WIFLKTKYRFLKITGVIVCIAGLVLVVFSDVHAGDRAGGSNPRKGDAIVFAGATLYAISN 179
W+FLKTKYR +KI+GV +CIAG+V+V+FSDVHAG RAGGSNP KGD +V AGATLYA+SN
Sbjct: 123 WMFLKTKYRLMKISGVFICIAGVVMVLFSDVHAGSRAGGSNPVKGDFLVLAGATLYAVSN 182
Query: 180 VSEEFLIKNADRVELMGMLGLFGGIVSAIQISVLERNELKSIHWSAGAALPFVGFSVAMF 239
+EEFL+KNAD VELM +GLFG I+SAIQ+++ E+ ELK+IHWSA A PF+ F++ MF
Sbjct: 183 TTEEFLVKNADTVELMTFMGLFGAIISAIQVAIFEQGELKAIHWSADAVFPFLRFAITMF 242
Query: 240 MFYSLVPVLLKINGSTMLNLSLLTSDMWAVVIRIFAYHEKVDWMYFVAFGAVTVGILIYS 299
+FYSL+P+LL+ NGSTM LSLLTSDMWAV+IRIFAYHEKVDW+Y++AF +G++IYS
Sbjct: 243 LFYSLLPILLRTNGSTMFTLSLLTSDMWAVLIRIFAYHEKVDWLYYLAFATTAIGLIIYS 302
>AT3G59310.2 | Symbols: | Eukaryotic protein of unknown function
(DUF914) | chr3:21922316-21924313 REVERSE LENGTH=363
Length = 363
Score = 433 bits (1114), Expect = e-122, Method: Compositional matrix adjust.
Identities = 222/354 (62%), Positives = 262/354 (74%), Gaps = 33/354 (9%)
Query: 1 MAMDFRKFWTRNMLIGLGLGQFLSLLITSTGFTSSELAKKGINAPTSQSFLNYVFLVIVY 60
M D ++ T+ LIGLGLGQ LSLL TS GFTSSELA+KGIN PTSQ FLNYV L IVY
Sbjct: 1 MGFDLKEIRTKKTLIGLGLGQLLSLLATSNGFTSSELARKGINVPTSQCFLNYVLLAIVY 60
Query: 61 GSILLYRRKPLKAKWYYYIILGLVDVEANFLVVKAYQYTSLTSVMLLDCWSIPCVMLLTW 120
GSI+LYRR +KAKWYYY +L VDVEANFLVVKAYQYTSLTSVMLLDCW+IPCV++LTW
Sbjct: 61 GSIMLYRRSDIKAKWYYYFLLAFVDVEANFLVVKAYQYTSLTSVMLLDCWAIPCVLVLTW 120
Query: 121 IFLKTKYRFLKITGVIVCIAGLVLVVFSDVHAGDRAGGSNPRKGDAIVFAGATLYAISNV 180
+LKTKYR +KI+GV +CI G+ +VVFSDVHAGDRAGGSNP KGD +V AGATLYA+SN
Sbjct: 121 FYLKTKYRLMKISGVFICIVGVFMVVFSDVHAGDRAGGSNPVKGDFLVLAGATLYAVSNT 180
Query: 181 SEEFLIKNADRVELMGMLGLFGGIVSAIQISVLERNELKSIHWSAGA------------- 227
SEEFL+KNAD VELM LG FG I+SAIQ+S+LER+ELK+IHWS GA
Sbjct: 181 SEEFLVKNADTVELMTFLGFFGAIISAIQVSILERDELKAIHWSTGAVGFLAMAISILTS 240
Query: 228 ------------------ALPFVGFSVAMFMFYSLVPVLLKINGSTMLNLSLLTSDMWAV 269
PF+ F++ MF+FY LVPVLLK NG+TM NLSLLTSDMWAV
Sbjct: 241 ANQRRHILVYLLHFSRFQVFPFLRFTLTMFLFYPLVPVLLKTNGATMFNLSLLTSDMWAV 300
Query: 270 VIRIFAYHEKVDWMYFVAFGAVTVGILIYSGGDKDEDEDQHPLNSAEDPPRIKQ 323
+IR F YHEKVDW+YF+AF G++IYS +KD++E H D +++
Sbjct: 301 LIRTFGYHEKVDWLYFLAFATTATGLIIYSMKEKDQEE--HRFEEVGDEAAMQR 352
>AT3G59320.1 | Symbols: | Eukaryotic protein of unknown function
(DUF914) | chr3:21924705-21926840 REVERSE LENGTH=339
Length = 339
Score = 387 bits (994), Expect = e-108, Method: Compositional matrix adjust.
Identities = 198/333 (59%), Positives = 262/333 (78%), Gaps = 1/333 (0%)
Query: 1 MAMDFRKFWTRNMLIGLGLGQFLSLLITSTGFTSSELAKKGINAPTSQSFLNYVFLVIVY 60
+ +F + T+ LIGLGLGQ +SLL T +SE+A+KGI+APTSQ+FL YV L IVY
Sbjct: 2 VCFNFNEMKTKKTLIGLGLGQIISLLSTLIASIASEIARKGISAPTSQTFLGYVSLAIVY 61
Query: 61 GSILLYRRKPLKAKWYYYIILGLVDVEANFLVVKAYQYTSLTSVMLLDCWSIPCVMLLTW 120
G I+LYRR +K KWY+Y +L +VDVEANFLVVKA+Q TS+TS+MLLDCW+IPCV++LTW
Sbjct: 62 GGIMLYRRSAIKVKWYHYFLLAVVDVEANFLVVKAFQNTSMTSIMLLDCWAIPCVLVLTW 121
Query: 121 IFLKTKYRFLKITGVIVCIAGLVLVVFSDVHAGDRAGGSNPRKGDAIVFAGATLYAISNV 180
+FLKT+YR +KI+GV++CI G+V+VVFSDVHAGDRAGGSNP KGD +V AGATLYA+SNV
Sbjct: 122 VFLKTRYRLMKISGVVICIVGVVMVVFSDVHAGDRAGGSNPVKGDFLVIAGATLYAVSNV 181
Query: 181 SEEFLIKNADRVELMGMLGLFGGIVSAIQISVLERNELKSIHWSAGAALPFVGFSVAMFM 240
+EEFL+KNAD ELM LGLFG I++AIQIS+ ER +++I WS A L ++G ++ +F+
Sbjct: 182 TEEFLVKNADVTELMAFLGLFGAIIAAIQISIFERGAVRAIQWSTEAILLYIGGALGLFL 241
Query: 241 FYSLVPVLLKINGSTMLNLSLLTSDMWAVVIRIFAYHEKVDWMYFVAFGAVTVGILIYSG 300
FY+L+ +L+K NGSTM NLSLLTSDMWA++IR F YHEKVDW+YF+AF G++IYS
Sbjct: 242 FYTLITILIKNNGSTMFNLSLLTSDMWAILIRTFGYHEKVDWLYFLAFATTATGLIIYSM 301
Query: 301 GDKDEDEDQHPLNSAEDPPRIKQDEEAKSGNSS 333
+KDE E+Q + ++ +E+ S +S+
Sbjct: 302 KEKDE-EEQRSGEVVSERRKLFDEEDGDSLHSN 333
>AT3G59320.2 | Symbols: | Eukaryotic protein of unknown function
(DUF914) | chr3:21924705-21926109 REVERSE LENGTH=238
Length = 238
Score = 267 bits (682), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 138/233 (59%), Positives = 186/233 (79%), Gaps = 1/233 (0%)
Query: 101 LTSVMLLDCWSIPCVMLLTWIFLKTKYRFLKITGVIVCIAGLVLVVFSDVHAGDRAGGSN 160
+TS+MLLDCW+IPCV++LTW+FLKT+YR +KI+GV++CI G+V+VVFSDVHAGDRAGGSN
Sbjct: 1 MTSIMLLDCWAIPCVLVLTWVFLKTRYRLMKISGVVICIVGVVMVVFSDVHAGDRAGGSN 60
Query: 161 PRKGDAIVFAGATLYAISNVSEEFLIKNADRVELMGMLGLFGGIVSAIQISVLERNELKS 220
P KGD +V AGATLYA+SNV+EEFL+KNAD ELM LGLFG I++AIQIS+ ER +++
Sbjct: 61 PVKGDFLVIAGATLYAVSNVTEEFLVKNADVTELMAFLGLFGAIIAAIQISIFERGAVRA 120
Query: 221 IHWSAGAALPFVGFSVAMFMFYSLVPVLLKINGSTMLNLSLLTSDMWAVVIRIFAYHEKV 280
I WS A L ++G ++ +F+FY+L+ +L+K NGSTM NLSLLTSDMWA++IR F YHEKV
Sbjct: 121 IQWSTEAILLYIGGALGLFLFYTLITILIKNNGSTMFNLSLLTSDMWAILIRTFGYHEKV 180
Query: 281 DWMYFVAFGAVTVGILIYSGGDKDEDEDQHPLNSAEDPPRIKQDEEAKSGNSS 333
DW+YF+AF G++IYS +KDE E+Q + ++ +E+ S +S+
Sbjct: 181 DWLYFLAFATTATGLIIYSMKEKDE-EEQRSGEVVSERRKLFDEEDGDSLHSN 232
>AT3G59330.1 | Symbols: | Eukaryotic protein of unknown function
(DUF914) | chr3:21926984-21928332 REVERSE LENGTH=248
Length = 248
Score = 266 bits (681), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 145/236 (61%), Positives = 172/236 (72%), Gaps = 37/236 (15%)
Query: 4 DFRKFWTRNMLIGLGLGQFLSLLITSTGFTSSELAKK----------------------- 40
+F + T+ LIG GLGQ LSLL TS FTSSE+A+K
Sbjct: 5 NFNRIKTKKTLIGFGLGQILSLLSTSLSFTSSEIARKDFSFFFFSNHPIFKHSHELTKFE 64
Query: 41 ---GINAPTSQSFLNYVFLVIVYGSILLYRRKPLKAKWYYYIILGLVDVEANFLVVKAYQ 97
GINAPTSQSFL YV L IVYG AKWY+Y++L VDVEANFLVVKAYQ
Sbjct: 65 IFIGINAPTSQSFLGYVLLAIVYG-----------AKWYHYLLLAFVDVEANFLVVKAYQ 113
Query: 98 YTSLTSVMLLDCWSIPCVMLLTWIFLKTKYRFLKITGVIVCIAGLVLVVFSDVHAGDRAG 157
TS+TSVMLLDCW+IPCV++ TW+FLKTKYR +KI+GV++C G+V+VVFSDVHAGDRAG
Sbjct: 114 NTSMTSVMLLDCWAIPCVLVFTWVFLKTKYRLMKISGVVICNVGVVMVVFSDVHAGDRAG 173
Query: 158 GSNPRKGDAIVFAGATLYAISNVSEEFLIKNADRVELMGMLGLFGGIVSAIQISVL 213
GSNP KGD +V AGATLYA+SNVS+EFL+KNADRV+LM +LGLFG I+ AIQIS+
Sbjct: 174 GSNPIKGDFLVIAGATLYAVSNVSQEFLVKNADRVQLMSLLGLFGAIIGAIQISIF 229
>AT3G07080.1 | Symbols: | EamA-like transporter family |
chr3:2241360-2242934 FORWARD LENGTH=438
Length = 438
Score = 52.4 bits (124), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 34/114 (29%), Positives = 59/114 (51%), Gaps = 7/114 (6%)
Query: 95 AYQYTSLTSVMLLDCWSIPCVMLLTWIFLKTKYRFLKITGVIVCIAGLVLVVFSDVHAGD 154
+ +YT++TS +L S L++ IFL K+ +LK+ V++C++G ++V D +
Sbjct: 184 SLKYTTVTSNTILSSASSLFTFLVSLIFLGEKFTWLKLFSVLLCMSGTIIVSMGDSESKS 243
Query: 155 RAGGSNPRKGDAIVFAGATLYAIS-NVSEEFLIKNADR------VELMGMLGLF 201
A NP GD + A LYA+ + + L + +R + +G LGLF
Sbjct: 244 NAVAKNPLLGDILSLVSAALYAVYITLIRKKLPDDDERNGRVSMAQFLGFLGLF 297