Miyakogusa Predicted Gene
- Lj0g3v0257439.2
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0257439.2 Non Chatacterized Hit- tr|C6TMW7|C6TMW7_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.37638
PE,87.45,0,DUF1637,Cysteamine dioxygenase; no description,RmlC-like
jelly roll fold; RmlC-like cupins,RmlC-like,CUFF.16932.2
(239 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT2G42670.1 | Symbols: | Protein of unknown function (DUF1637) ... 374 e-104
AT2G42670.2 | Symbols: | Protein of unknown function (DUF1637) ... 369 e-103
AT3G58670.3 | Symbols: | Protein of unknown function (DUF1637) ... 353 5e-98
AT3G58670.2 | Symbols: | Protein of unknown function (DUF1637) ... 353 5e-98
AT3G58670.1 | Symbols: | Protein of unknown function (DUF1637) ... 353 5e-98
AT5G15120.1 | Symbols: | Protein of unknown function (DUF1637) ... 203 7e-53
AT1G18490.1 | Symbols: | Protein of unknown function (DUF1637) ... 192 2e-49
AT5G39890.1 | Symbols: | Protein of unknown function (DUF1637) ... 184 4e-47
>AT2G42670.1 | Symbols: | Protein of unknown function (DUF1637) |
chr2:17772004-17773700 REVERSE LENGTH=241
Length = 241
Score = 374 bits (960), Expect = e-104, Method: Compositional matrix adjust.
Identities = 177/241 (73%), Positives = 209/241 (86%), Gaps = 2/241 (0%)
Query: 1 MPYYVQRLYRLCKASFSPDGPVSQEAIEKVCEKLEKIKPSDVGLEQEAQVVRNWSGPVLE 60
MPY+ QRLY CKASFS DGP++++A+EKV LEKIKPSDVG+EQ+AQ+ R+ SGP+ E
Sbjct: 1 MPYFAQRLYNTCKASFSSDGPITEDALEKVRNVLEKIKPSDVGIEQDAQLARSRSGPLNE 60
Query: 61 RHGSHKSLPPIKYLHLHECDSFSIGIFCMPPSSIIPLHNHPGMTVLSKLLYGSLYVKSYD 120
R+GS++S P IKYLHLHECDSFSIGIFCMPPSS+IPLHNHPGMTVLSKL+YGS++VKSYD
Sbjct: 61 RNGSNQSPPAIKYLHLHECDSFSIGIFCMPPSSMIPLHNHPGMTVLSKLVYGSMHVKSYD 120
Query: 121 WID--VPGSADPSQARPAKLVKDTEMTAPSPTTVLYPSSGGNIHSFQAITPCAIFDILSP 178
W++ + DPSQARPAKLVKDTEMTA SP T LYP SGGNIH F+AIT CAI DIL+P
Sbjct: 121 WLEPQLTEPEDPSQARPAKLVKDTEMTAQSPVTTLYPKSGGNIHCFKAITHCAILDILAP 180
Query: 179 PYSSEHERHCTYFRRSQKKDLPVNLQLDGVTASEVTWLEEFQPPDDFVIRRGLYRGPVIR 238
PYSSEH+RHCTYFR+S+++DLP L++DG ++VTWLEEFQPPDDFVIRR YRGPVIR
Sbjct: 181 PYSSEHDRHCTYFRKSRREDLPGELEVDGEVVTDVTWLEEFQPPDDFVIRRIPYRGPVIR 240
Query: 239 T 239
T
Sbjct: 241 T 241
>AT2G42670.2 | Symbols: | Protein of unknown function (DUF1637) |
chr2:17772004-17773700 REVERSE LENGTH=242
Length = 242
Score = 369 bits (948), Expect = e-103, Method: Compositional matrix adjust.
Identities = 177/242 (73%), Positives = 209/242 (86%), Gaps = 3/242 (1%)
Query: 1 MPYYVQRLYRLCKASFSPDGPVSQEAIEKVCEKLEKIKPSDVGLEQEAQVVRNWSGPVLE 60
MPY+ QRLY CKASFS DGP++++A+EKV LEKIKPSDVG+EQ+AQ+ R+ SGP+ E
Sbjct: 1 MPYFAQRLYNTCKASFSSDGPITEDALEKVRNVLEKIKPSDVGIEQDAQLARSRSGPLNE 60
Query: 61 RHGSHKSLPPIKYLHLHECDSFSIGIFCMPPSSIIPLHNHPGMTVLSKLLYGSLYVKSYD 120
R+GS++S P IKYLHLHECDSFSIGIFCMPPSS+IPLHNHPGMTVLSKL+YGS++VKSYD
Sbjct: 61 RNGSNQSPPAIKYLHLHECDSFSIGIFCMPPSSMIPLHNHPGMTVLSKLVYGSMHVKSYD 120
Query: 121 WID--VPGSADPSQ-ARPAKLVKDTEMTAPSPTTVLYPSSGGNIHSFQAITPCAIFDILS 177
W++ + DPSQ ARPAKLVKDTEMTA SP T LYP SGGNIH F+AIT CAI DIL+
Sbjct: 121 WLEPQLTEPEDPSQEARPAKLVKDTEMTAQSPVTTLYPKSGGNIHCFKAITHCAILDILA 180
Query: 178 PPYSSEHERHCTYFRRSQKKDLPVNLQLDGVTASEVTWLEEFQPPDDFVIRRGLYRGPVI 237
PPYSSEH+RHCTYFR+S+++DLP L++DG ++VTWLEEFQPPDDFVIRR YRGPVI
Sbjct: 181 PPYSSEHDRHCTYFRKSRREDLPGELEVDGEVVTDVTWLEEFQPPDDFVIRRIPYRGPVI 240
Query: 238 RT 239
RT
Sbjct: 241 RT 242
>AT3G58670.3 | Symbols: | Protein of unknown function (DUF1637) |
chr3:21703693-21705314 REVERSE LENGTH=242
Length = 242
Score = 353 bits (907), Expect = 5e-98, Method: Compositional matrix adjust.
Identities = 172/241 (71%), Positives = 199/241 (82%), Gaps = 3/241 (1%)
Query: 1 MPYYVQRLYRLCKASFSPDGPVSQEAIEKVCEKLEKIKPSDVGLEQEAQVVRNWSGPVLE 60
MPY++QRL+ CK+S SP+GPVS+EA++KV LEKIKPSDVGLEQEAQ+VRNW GP E
Sbjct: 1 MPYFIQRLFNTCKSSLSPNGPVSEEALDKVRNVLEKIKPSDVGLEQEAQLVRNWPGPGNE 60
Query: 61 RHGSHKSLPPIKYLHLHECDSFSIGIFCMPPSSIIPLHNHPGMTVLSKLLYGSLYVKSYD 120
R+G+H SLP IKYL LHECDSFSIGIFCMPP SIIPLHNHPGMTVLSKL+YGS++VKSYD
Sbjct: 61 RNGNHHSLPAIKYLQLHECDSFSIGIFCMPPGSIIPLHNHPGMTVLSKLVYGSMHVKSYD 120
Query: 121 WIDVPGSA--DPSQARPAKLVKDTEMTAPSPTTVLYPSSGGNIHSFQAITPCAIFDILSP 178
W + S DP QARPAKLVKD +MT+PSP T LYP++GGNIH F+AIT CAIFDILSP
Sbjct: 121 WAEPDQSELDDPLQARPAKLVKDIDMTSPSPATTLYPTTGGNIHCFKAITHCAIFDILSP 180
Query: 179 PYSSEHERHCTYFRRSQKKDLPVNLQ-LDGVTASEVTWLEEFQPPDDFVIRRGLYRGPVI 237
PYSS H RHC YFR+S DLP ++ ++G S VTWLEE+QPPD+FVI R YRGPVI
Sbjct: 181 PYSSTHGRHCNYFRKSPMLDLPGEIEVMNGEVISNVTWLEEYQPPDNFVIWRVPYRGPVI 240
Query: 238 R 238
R
Sbjct: 241 R 241
>AT3G58670.2 | Symbols: | Protein of unknown function (DUF1637) |
chr3:21703693-21705314 REVERSE LENGTH=242
Length = 242
Score = 353 bits (907), Expect = 5e-98, Method: Compositional matrix adjust.
Identities = 172/241 (71%), Positives = 199/241 (82%), Gaps = 3/241 (1%)
Query: 1 MPYYVQRLYRLCKASFSPDGPVSQEAIEKVCEKLEKIKPSDVGLEQEAQVVRNWSGPVLE 60
MPY++QRL+ CK+S SP+GPVS+EA++KV LEKIKPSDVGLEQEAQ+VRNW GP E
Sbjct: 1 MPYFIQRLFNTCKSSLSPNGPVSEEALDKVRNVLEKIKPSDVGLEQEAQLVRNWPGPGNE 60
Query: 61 RHGSHKSLPPIKYLHLHECDSFSIGIFCMPPSSIIPLHNHPGMTVLSKLLYGSLYVKSYD 120
R+G+H SLP IKYL LHECDSFSIGIFCMPP SIIPLHNHPGMTVLSKL+YGS++VKSYD
Sbjct: 61 RNGNHHSLPAIKYLQLHECDSFSIGIFCMPPGSIIPLHNHPGMTVLSKLVYGSMHVKSYD 120
Query: 121 WIDVPGSA--DPSQARPAKLVKDTEMTAPSPTTVLYPSSGGNIHSFQAITPCAIFDILSP 178
W + S DP QARPAKLVKD +MT+PSP T LYP++GGNIH F+AIT CAIFDILSP
Sbjct: 121 WAEPDQSELDDPLQARPAKLVKDIDMTSPSPATTLYPTTGGNIHCFKAITHCAIFDILSP 180
Query: 179 PYSSEHERHCTYFRRSQKKDLPVNLQ-LDGVTASEVTWLEEFQPPDDFVIRRGLYRGPVI 237
PYSS H RHC YFR+S DLP ++ ++G S VTWLEE+QPPD+FVI R YRGPVI
Sbjct: 181 PYSSTHGRHCNYFRKSPMLDLPGEIEVMNGEVISNVTWLEEYQPPDNFVIWRVPYRGPVI 240
Query: 238 R 238
R
Sbjct: 241 R 241
>AT3G58670.1 | Symbols: | Protein of unknown function (DUF1637) |
chr3:21703693-21705314 REVERSE LENGTH=242
Length = 242
Score = 353 bits (907), Expect = 5e-98, Method: Compositional matrix adjust.
Identities = 172/241 (71%), Positives = 199/241 (82%), Gaps = 3/241 (1%)
Query: 1 MPYYVQRLYRLCKASFSPDGPVSQEAIEKVCEKLEKIKPSDVGLEQEAQVVRNWSGPVLE 60
MPY++QRL+ CK+S SP+GPVS+EA++KV LEKIKPSDVGLEQEAQ+VRNW GP E
Sbjct: 1 MPYFIQRLFNTCKSSLSPNGPVSEEALDKVRNVLEKIKPSDVGLEQEAQLVRNWPGPGNE 60
Query: 61 RHGSHKSLPPIKYLHLHECDSFSIGIFCMPPSSIIPLHNHPGMTVLSKLLYGSLYVKSYD 120
R+G+H SLP IKYL LHECDSFSIGIFCMPP SIIPLHNHPGMTVLSKL+YGS++VKSYD
Sbjct: 61 RNGNHHSLPAIKYLQLHECDSFSIGIFCMPPGSIIPLHNHPGMTVLSKLVYGSMHVKSYD 120
Query: 121 WIDVPGSA--DPSQARPAKLVKDTEMTAPSPTTVLYPSSGGNIHSFQAITPCAIFDILSP 178
W + S DP QARPAKLVKD +MT+PSP T LYP++GGNIH F+AIT CAIFDILSP
Sbjct: 121 WAEPDQSELDDPLQARPAKLVKDIDMTSPSPATTLYPTTGGNIHCFKAITHCAIFDILSP 180
Query: 179 PYSSEHERHCTYFRRSQKKDLPVNLQ-LDGVTASEVTWLEEFQPPDDFVIRRGLYRGPVI 237
PYSS H RHC YFR+S DLP ++ ++G S VTWLEE+QPPD+FVI R YRGPVI
Sbjct: 181 PYSSTHGRHCNYFRKSPMLDLPGEIEVMNGEVISNVTWLEEYQPPDNFVIWRVPYRGPVI 240
Query: 238 R 238
R
Sbjct: 241 R 241
>AT5G15120.1 | Symbols: | Protein of unknown function (DUF1637) |
chr5:4898814-4900351 FORWARD LENGTH=293
Length = 293
Score = 203 bits (517), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 113/244 (46%), Positives = 146/244 (59%), Gaps = 19/244 (7%)
Query: 5 VQRLYRLCKASFSPDGPV---SQEAIEKVCEKLEKIKPSDVGLEQEAQVVRNWSGPVLER 61
V+RL+ CK FS GP S++ I+++ E L+ +KP DVGL R SG
Sbjct: 58 VRRLFNTCKEVFSNGGPGVIPSEDKIQQLREILDDMKPEDVGLTPTMPYFRPNSGV---- 113
Query: 62 HGSHKSLPPIKYLHLHECDSFSIGIFCMPPSSIIPLHNHPGMTVLSKLLYGSLYVKSYDW 121
+S PPI YLHLH+CD FSIGIFC+PPS +IPLHNHPGMTV SKLL+G++++KSYDW
Sbjct: 114 --EARSSPPITYLHLHQCDQFSIGIFCLPPSGVIPLHNHPGMTVFSKLLFGTMHIKSYDW 171
Query: 122 IDVPGSADPSQARPAKLVKDTEMTAPSPTTVLYPSSGGNIHSFQAITPCAIFDILSPPYS 181
+ V S+ R AKL D+ TAP ++LYP GGN+H F AIT CA+ D+L PPY
Sbjct: 172 V-VDAPMRDSKTRLAKLKVDSTFTAPCNASILYPEDGGNMHRFTAITACAVLDVLGPPYC 230
Query: 182 SEHERHCTYFRRSQKKDLPVNLQLDGVTASE-----VTWLEEF--QPPDDFVIRRGLYRG 234
+ RHCTYF L + + D V +SE WL+E P D + LYRG
Sbjct: 231 NPEGRHCTYFLEFPLDKL--SSEDDDVLSSEEEKEGYAWLQERDDNPEDHTNVVGALYRG 288
Query: 235 PVIR 238
P +
Sbjct: 289 PKVE 292
>AT1G18490.1 | Symbols: | Protein of unknown function (DUF1637) |
chr1:6367116-6368640 FORWARD LENGTH=282
Length = 282
Score = 192 bits (487), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 109/256 (42%), Positives = 146/256 (57%), Gaps = 29/256 (11%)
Query: 5 VQRLYRLCKASFSPDGPV-SQEAIEKVCEKLEKIKPSDVGLEQEAQVV-RNWSGPVLERH 62
VQ LY LCK +F+ P + AI+K+C L+ + P+DVGLE+ +Q R + + R
Sbjct: 35 VQELYDLCKETFTGKAPSPASMAIQKLCSVLDSVSPADVGLEEVSQDDDRGYGVSGVSRF 94
Query: 63 GS-HKSLPPIKYLHLHECDSFSIGIFCMPPSSIIPLHNHPGMTVLSKLLYGSLYVKSYDW 121
+ PI +L +HECD+F++ IFC P SS+IPLH+HP M V SK+LYGSL+VK+YDW
Sbjct: 95 NRVGRWAQPITFLDIHECDTFTMCIFCFPTSSVIPLHDHPEMAVFSKILYGSLHVKAYDW 154
Query: 122 ID----------VPGSADPSQARPAKLVKDTEMTAPSPTTVLYPSSGGNIHSFQAITPCA 171
++ VPGS AR AKLV D +T S LYP +GGN+H F A+TPCA
Sbjct: 155 VEPPCIITQDKGVPGSL---PARLAKLVSDKVITPQSEIPALYPKTGGNLHCFTALTPCA 211
Query: 172 IFDILSPPYSSEHERHCTYFRRSQKKDLPVNL--------QLDGVTASEVTWLEEFQPPD 223
+ DILSPPY R C+Y+ D P + ++D E WL + PD
Sbjct: 212 VLDILSPPYKESVGRSCSYYM-----DYPFSTFALENGMKKVDEGKEDEYAWLVQIDTPD 266
Query: 224 DFVIRRGLYRGPVIRT 239
D +R G Y GP IR
Sbjct: 267 DLHMRPGSYTGPTIRV 282
>AT5G39890.1 | Symbols: | Protein of unknown function (DUF1637) |
chr5:15974543-15976013 FORWARD LENGTH=276
Length = 276
Score = 184 bits (468), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 107/246 (43%), Positives = 146/246 (59%), Gaps = 29/246 (11%)
Query: 5 VQRLYRLCKASFSPDGPV----SQEAIEKVCEKLEKIKPSDVGLEQEAQVVRNWSGPVLE 60
VQ+L+ CK F+ DG SQE IE + L++IKP DVG+ + R+ V
Sbjct: 47 VQKLFDTCKKVFA-DGKSGTVPSQENIEMLRAVLDEIKPEDVGVNPKMSYFRS---TVTG 102
Query: 61 RHGSHKSLPPIKYLHLHECDSFSIGIFCMPPSSIIPLHNHPGMTVLSKLLYGSLYVKSYD 120
R P + YLH++ C FSI IFC+PPS +IPLHNHP MTV SKLL+G++++KSYD
Sbjct: 103 RS------PLVTYLHIYACHRFSICIFCLPPSGVIPLHNHPEMTVFSKLLFGTMHIKSYD 156
Query: 121 WIDVPGSADP-SQARPAKLVKDTEMTAPSPTTVLYPSSGGNIHSFQAITPCAIFDILSPP 179
W VP S P S R AK+ D++ TAP T++LYP+ GGN+H F A T CA+ D++ PP
Sbjct: 157 W--VPDSPQPSSDTRLAKVKVDSDFTAPCDTSILYPADGGNMHCFTAKTACAVLDVIGPP 214
Query: 180 YSSEHERHCTYFRRSQKKDLPV-NLQLDGVTASE-----VTWLEEF-QPPDDFVIRRGLY 232
YS RHCTY+ D P + +DGV +E WL+E + P+D + +Y
Sbjct: 215 YSDPAGRHCTYYF-----DYPFSSFSVDGVVVAEEEKEGYAWLKEREEKPEDLTVTALMY 269
Query: 233 RGPVIR 238
GP I+
Sbjct: 270 SGPTIK 275