Miyakogusa Predicted Gene
- Lj0g3v0317119.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0317119.1 Non Chatacterized Hit- tr|I1LUN6|I1LUN6_SOYBN
Uncharacterized protein OS=Glycine max PE=4 SV=1,85.36,0,no
description,RmlC-like jelly roll fold; UNCHARACTERIZED,NULL;
DUF1637,Cysteamine dioxygenase; RmlC,CUFF.21480.1
(239 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT2G42670.1 | Symbols: | Protein of unknown function (DUF1637) ... 350 3e-97
AT2G42670.2 | Symbols: | Protein of unknown function (DUF1637) ... 347 4e-96
AT3G58670.3 | Symbols: | Protein of unknown function (DUF1637) ... 340 5e-94
AT3G58670.2 | Symbols: | Protein of unknown function (DUF1637) ... 340 5e-94
AT3G58670.1 | Symbols: | Protein of unknown function (DUF1637) ... 340 5e-94
AT5G15120.1 | Symbols: | Protein of unknown function (DUF1637) ... 201 4e-52
AT1G18490.1 | Symbols: | Protein of unknown function (DUF1637) ... 193 8e-50
AT5G39890.1 | Symbols: | Protein of unknown function (DUF1637) ... 182 2e-46
>AT2G42670.1 | Symbols: | Protein of unknown function (DUF1637) |
chr2:17772004-17773700 REVERSE LENGTH=241
Length = 241
Score = 350 bits (899), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 167/241 (69%), Positives = 198/241 (82%), Gaps = 2/241 (0%)
Query: 1 MPFYLQRLYRLCKASFSPDGPVSEEAIAKVQEKLEKIKPIDVGLEQEAQVVRNWSASALE 60
MP++ QRLY CKASFS DGP++E+A+ KV+ LEKIKP DVG+EQ+AQ+ R+ S E
Sbjct: 1 MPYFAQRLYNTCKASFSSDGPITEDALEKVRNVLEKIKPSDVGIEQDAQLARSRSGPLNE 60
Query: 61 CNGNHQPPAPIKYLHLHECDSFSIGIFCMPPSSVIPLHNHPGMTVLSKLLYGSVHVKSYD 120
NG++Q P IKYLHLHECDSFSIGIFCMPPSS+IPLHNHPGMTVLSKL+YGS+HVKSYD
Sbjct: 61 RNGSNQSPPAIKYLHLHECDSFSIGIFCMPPSSMIPLHNHPGMTVLSKLVYGSMHVKSYD 120
Query: 121 WVE--FPGPTDPSEARAAKLVKDKEMTAPTRTTVLYPTLGGNIHCFRAITPCAIFDILSP 178
W+E P DPS+AR AKLVKD EMTA + T LYP GGNIHCF+AIT CAI DIL+P
Sbjct: 121 WLEPQLTEPEDPSQARPAKLVKDTEMTAQSPVTTLYPKSGGNIHCFKAITHCAILDILAP 180
Query: 179 PYSSDNGRHCSYFQKSSRKDLPETLEMNGVTVSDVTWLEEFQPPDEFAIRRGLYRGPVIK 238
PYSS++ RHC+YF+KS R+DLP LE++G V+DVTWLEEFQPPD+F IRR YRGPVI+
Sbjct: 181 PYSSEHDRHCTYFRKSRREDLPGELEVDGEVVTDVTWLEEFQPPDDFVIRRIPYRGPVIR 240
Query: 239 T 239
T
Sbjct: 241 T 241
>AT2G42670.2 | Symbols: | Protein of unknown function (DUF1637) |
chr2:17772004-17773700 REVERSE LENGTH=242
Length = 242
Score = 347 bits (891), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 168/242 (69%), Positives = 198/242 (81%), Gaps = 3/242 (1%)
Query: 1 MPFYLQRLYRLCKASFSPDGPVSEEAIAKVQEKLEKIKPIDVGLEQEAQVVRNWSASALE 60
MP++ QRLY CKASFS DGP++E+A+ KV+ LEKIKP DVG+EQ+AQ+ R+ S E
Sbjct: 1 MPYFAQRLYNTCKASFSSDGPITEDALEKVRNVLEKIKPSDVGIEQDAQLARSRSGPLNE 60
Query: 61 CNGNHQPPAPIKYLHLHECDSFSIGIFCMPPSSVIPLHNHPGMTVLSKLLYGSVHVKSYD 120
NG++Q P IKYLHLHECDSFSIGIFCMPPSS+IPLHNHPGMTVLSKL+YGS+HVKSYD
Sbjct: 61 RNGSNQSPPAIKYLHLHECDSFSIGIFCMPPSSMIPLHNHPGMTVLSKLVYGSMHVKSYD 120
Query: 121 WVE--FPGPTDPS-EARAAKLVKDKEMTAPTRTTVLYPTLGGNIHCFRAITPCAIFDILS 177
W+E P DPS EAR AKLVKD EMTA + T LYP GGNIHCF+AIT CAI DIL+
Sbjct: 121 WLEPQLTEPEDPSQEARPAKLVKDTEMTAQSPVTTLYPKSGGNIHCFKAITHCAILDILA 180
Query: 178 PPYSSDNGRHCSYFQKSSRKDLPETLEMNGVTVSDVTWLEEFQPPDEFAIRRGLYRGPVI 237
PPYSS++ RHC+YF+KS R+DLP LE++G V+DVTWLEEFQPPD+F IRR YRGPVI
Sbjct: 181 PPYSSEHDRHCTYFRKSRREDLPGELEVDGEVVTDVTWLEEFQPPDDFVIRRIPYRGPVI 240
Query: 238 KT 239
+T
Sbjct: 241 RT 242
>AT3G58670.3 | Symbols: | Protein of unknown function (DUF1637) |
chr3:21703693-21705314 REVERSE LENGTH=242
Length = 242
Score = 340 bits (872), Expect = 5e-94, Method: Compositional matrix adjust.
Identities = 167/242 (69%), Positives = 193/242 (79%), Gaps = 5/242 (2%)
Query: 1 MPFYLQRLYRLCKASFSPDGPVSEEAIAKVQEKLEKIKPIDVGLEQEAQVVRNWSASALE 60
MP+++QRL+ CK+S SP+GPVSEEA+ KV+ LEKIKP DVGLEQEAQ+VRNW E
Sbjct: 1 MPYFIQRLFNTCKSSLSPNGPVSEEALDKVRNVLEKIKPSDVGLEQEAQLVRNWPGPGNE 60
Query: 61 CNGNHQPPAPIKYLHLHECDSFSIGIFCMPPSSVIPLHNHPGMTVLSKLLYGSVHVKSYD 120
NGNH IKYL LHECDSFSIGIFCMPP S+IPLHNHPGMTVLSKL+YGS+HVKSYD
Sbjct: 61 RNGNHHSLPAIKYLQLHECDSFSIGIFCMPPGSIIPLHNHPGMTVLSKLVYGSMHVKSYD 120
Query: 121 WVEFPGPT---DPSEARAAKLVKDKEMTAPTRTTVLYPTLGGNIHCFRAITPCAIFDILS 177
W E P + DP +AR AKLVKD +MT+P+ T LYPT GGNIHCF+AIT CAIFDILS
Sbjct: 121 WAE-PDQSELDDPLQARPAKLVKDIDMTSPSPATTLYPTTGGNIHCFKAITHCAIFDILS 179
Query: 178 PPYSSDNGRHCSYFQKSSRKDLPETLE-MNGVTVSDVTWLEEFQPPDEFAIRRGLYRGPV 236
PPYSS +GRHC+YF+KS DLP +E MNG +S+VTWLEE+QPPD F I R YRGPV
Sbjct: 180 PPYSSTHGRHCNYFRKSPMLDLPGEIEVMNGEVISNVTWLEEYQPPDNFVIWRVPYRGPV 239
Query: 237 IK 238
I+
Sbjct: 240 IR 241
>AT3G58670.2 | Symbols: | Protein of unknown function (DUF1637) |
chr3:21703693-21705314 REVERSE LENGTH=242
Length = 242
Score = 340 bits (872), Expect = 5e-94, Method: Compositional matrix adjust.
Identities = 167/242 (69%), Positives = 193/242 (79%), Gaps = 5/242 (2%)
Query: 1 MPFYLQRLYRLCKASFSPDGPVSEEAIAKVQEKLEKIKPIDVGLEQEAQVVRNWSASALE 60
MP+++QRL+ CK+S SP+GPVSEEA+ KV+ LEKIKP DVGLEQEAQ+VRNW E
Sbjct: 1 MPYFIQRLFNTCKSSLSPNGPVSEEALDKVRNVLEKIKPSDVGLEQEAQLVRNWPGPGNE 60
Query: 61 CNGNHQPPAPIKYLHLHECDSFSIGIFCMPPSSVIPLHNHPGMTVLSKLLYGSVHVKSYD 120
NGNH IKYL LHECDSFSIGIFCMPP S+IPLHNHPGMTVLSKL+YGS+HVKSYD
Sbjct: 61 RNGNHHSLPAIKYLQLHECDSFSIGIFCMPPGSIIPLHNHPGMTVLSKLVYGSMHVKSYD 120
Query: 121 WVEFPGPT---DPSEARAAKLVKDKEMTAPTRTTVLYPTLGGNIHCFRAITPCAIFDILS 177
W E P + DP +AR AKLVKD +MT+P+ T LYPT GGNIHCF+AIT CAIFDILS
Sbjct: 121 WAE-PDQSELDDPLQARPAKLVKDIDMTSPSPATTLYPTTGGNIHCFKAITHCAIFDILS 179
Query: 178 PPYSSDNGRHCSYFQKSSRKDLPETLE-MNGVTVSDVTWLEEFQPPDEFAIRRGLYRGPV 236
PPYSS +GRHC+YF+KS DLP +E MNG +S+VTWLEE+QPPD F I R YRGPV
Sbjct: 180 PPYSSTHGRHCNYFRKSPMLDLPGEIEVMNGEVISNVTWLEEYQPPDNFVIWRVPYRGPV 239
Query: 237 IK 238
I+
Sbjct: 240 IR 241
>AT3G58670.1 | Symbols: | Protein of unknown function (DUF1637) |
chr3:21703693-21705314 REVERSE LENGTH=242
Length = 242
Score = 340 bits (872), Expect = 5e-94, Method: Compositional matrix adjust.
Identities = 167/242 (69%), Positives = 193/242 (79%), Gaps = 5/242 (2%)
Query: 1 MPFYLQRLYRLCKASFSPDGPVSEEAIAKVQEKLEKIKPIDVGLEQEAQVVRNWSASALE 60
MP+++QRL+ CK+S SP+GPVSEEA+ KV+ LEKIKP DVGLEQEAQ+VRNW E
Sbjct: 1 MPYFIQRLFNTCKSSLSPNGPVSEEALDKVRNVLEKIKPSDVGLEQEAQLVRNWPGPGNE 60
Query: 61 CNGNHQPPAPIKYLHLHECDSFSIGIFCMPPSSVIPLHNHPGMTVLSKLLYGSVHVKSYD 120
NGNH IKYL LHECDSFSIGIFCMPP S+IPLHNHPGMTVLSKL+YGS+HVKSYD
Sbjct: 61 RNGNHHSLPAIKYLQLHECDSFSIGIFCMPPGSIIPLHNHPGMTVLSKLVYGSMHVKSYD 120
Query: 121 WVEFPGPT---DPSEARAAKLVKDKEMTAPTRTTVLYPTLGGNIHCFRAITPCAIFDILS 177
W E P + DP +AR AKLVKD +MT+P+ T LYPT GGNIHCF+AIT CAIFDILS
Sbjct: 121 WAE-PDQSELDDPLQARPAKLVKDIDMTSPSPATTLYPTTGGNIHCFKAITHCAIFDILS 179
Query: 178 PPYSSDNGRHCSYFQKSSRKDLPETLE-MNGVTVSDVTWLEEFQPPDEFAIRRGLYRGPV 236
PPYSS +GRHC+YF+KS DLP +E MNG +S+VTWLEE+QPPD F I R YRGPV
Sbjct: 180 PPYSSTHGRHCNYFRKSPMLDLPGEIEVMNGEVISNVTWLEEYQPPDNFVIWRVPYRGPV 239
Query: 237 IK 238
I+
Sbjct: 240 IR 241
>AT5G15120.1 | Symbols: | Protein of unknown function (DUF1637) |
chr5:4898814-4900351 FORWARD LENGTH=293
Length = 293
Score = 201 bits (510), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 113/244 (46%), Positives = 146/244 (59%), Gaps = 19/244 (7%)
Query: 5 LQRLYRLCKASFSPDGPV---SEEAIAKVQEKLEKIKPIDVGLEQEAQVVRNWSASALEC 61
++RL+ CK FS GP SE+ I +++E L+ +KP DVGL R S +E
Sbjct: 58 VRRLFNTCKEVFSNGGPGVIPSEDKIQQLREILDDMKPEDVGLTPTMPYFR--PNSGVEA 115
Query: 62 NGNHQPPAPIKYLHLHECDSFSIGIFCMPPSSVIPLHNHPGMTVLSKLLYGSVHVKSYDW 121
+ PI YLHLH+CD FSIGIFC+PPS VIPLHNHPGMTV SKLL+G++H+KSYDW
Sbjct: 116 RSS----PPITYLHLHQCDQFSIGIFCLPPSGVIPLHNHPGMTVFSKLLFGTMHIKSYDW 171
Query: 122 VEFPGPTDPSEARAAKLVKDKEMTAPTRTTVLYPTLGGNIHCFRAITPCAIFDILSPPYS 181
V P S+ R AKL D TAP ++LYP GGN+H F AIT CA+ D+L PPY
Sbjct: 172 V-VDAPMRDSKTRLAKLKVDSTFTAPCNASILYPEDGGNMHRFTAITACAVLDVLGPPYC 230
Query: 182 SDNGRHCSYFQKSSRKDLPETLEMNGVTVSD-----VTWLEEF--QPPDEFAIRRGLYRG 234
+ GRHC+YF + L + E + V S+ WL+E P D + LYRG
Sbjct: 231 NPEGRHCTYFLEFPLDKL--SSEDDDVLSSEEEKEGYAWLQERDDNPEDHTNVVGALYRG 288
Query: 235 PVIK 238
P ++
Sbjct: 289 PKVE 292
>AT1G18490.1 | Symbols: | Protein of unknown function (DUF1637) |
chr1:6367116-6368640 FORWARD LENGTH=282
Length = 282
Score = 193 bits (491), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 112/253 (44%), Positives = 145/253 (57%), Gaps = 23/253 (9%)
Query: 5 LQRLYRLCKASFSPDGPV-SEEAIAKVQEKLEKIKPIDVGLEQEAQVV-RNWSASAL-EC 61
+Q LY LCK +F+ P + AI K+ L+ + P DVGLE+ +Q R + S +
Sbjct: 35 VQELYDLCKETFTGKAPSPASMAIQKLCSVLDSVSPADVGLEEVSQDDDRGYGVSGVSRF 94
Query: 62 NGNHQPPAPIKYLHLHECDSFSIGIFCMPPSSVIPLHNHPGMTVLSKLLYGSVHVKSYDW 121
N + PI +L +HECD+F++ IFC P SSVIPLH+HP M V SK+LYGS+HVK+YDW
Sbjct: 95 NRVGRWAQPITFLDIHECDTFTMCIFCFPTSSVIPLHDHPEMAVFSKILYGSLHVKAYDW 154
Query: 122 VEFP-------GPTDPSEARAAKLVKDKEMTAPTRTTVLYPTLGGNIHCFRAITPCAIFD 174
VE P G AR AKLV DK +T + LYP GGN+HCF A+TPCA+ D
Sbjct: 155 VEPPCIITQDKGVPGSLPARLAKLVSDKVITPQSEIPALYPKTGGNLHCFTALTPCAVLD 214
Query: 175 ILSPPYSSDNGRHCSYFQKSSRKDLP-ETLEM-NGVTVSD------VTWLEEFQPPDEFA 226
ILSPPY GR CSY+ D P T + NG+ D WL + PD+
Sbjct: 215 ILSPPYKESVGRSCSYYM-----DYPFSTFALENGMKKVDEGKEDEYAWLVQIDTPDDLH 269
Query: 227 IRRGLYRGPVIKT 239
+R G Y GP I+
Sbjct: 270 MRPGSYTGPTIRV 282
>AT5G39890.1 | Symbols: | Protein of unknown function (DUF1637) |
chr5:15974543-15976013 FORWARD LENGTH=276
Length = 276
Score = 182 bits (462), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 103/246 (41%), Positives = 148/246 (60%), Gaps = 29/246 (11%)
Query: 5 LQRLYRLCKASFSPDGPV----SEEAIAKVQEKLEKIKPIDVGLEQEAQVVRNWSASALE 60
+Q+L+ CK F+ DG S+E I ++ L++IKP DVG+ + R+
Sbjct: 47 VQKLFDTCKKVFA-DGKSGTVPSQENIEMLRAVLDEIKPEDVGVNPKMSYFRSTVTG--- 102
Query: 61 CNGNHQPPAPIKYLHLHECDSFSIGIFCMPPSSVIPLHNHPGMTVLSKLLYGSVHVKSYD 120
+ P + YLH++ C FSI IFC+PPS VIPLHNHP MTV SKLL+G++H+KSYD
Sbjct: 103 -----RSPL-VTYLHIYACHRFSICIFCLPPSGVIPLHNHPEMTVFSKLLFGTMHIKSYD 156
Query: 121 WV-EFPGPTDPSEARAAKLVKDKEMTAPTRTTVLYPTLGGNIHCFRAITPCAIFDILSPP 179
WV + P P+ S+ R AK+ D + TAP T++LYP GGN+HCF A T CA+ D++ PP
Sbjct: 157 WVPDSPQPS--SDTRLAKVKVDSDFTAPCDTSILYPADGGNMHCFTAKTACAVLDVIGPP 214
Query: 180 YSSDNGRHCSYFQKSSRKDLP-ETLEMNGVTVSD-----VTWLEEF-QPPDEFAIRRGLY 232
YS GRHC+Y+ D P + ++GV V++ WL+E + P++ + +Y
Sbjct: 215 YSDPAGRHCTYYF-----DYPFSSFSVDGVVVAEEEKEGYAWLKEREEKPEDLTVTALMY 269
Query: 233 RGPVIK 238
GP IK
Sbjct: 270 SGPTIK 275