Miyakogusa Predicted Gene
- Lj1g3v0250220.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v0250220.1 Non Chatacterized Hit- tr|I1MAW3|I1MAW3_SOYBN
Uncharacterized protein OS=Glycine max PE=4
SV=1,79.31,0.002,UNCHARACTERIZED,NULL; no description,RmlC-like jelly
roll fold; DUF1637,Cysteamine dioxygenase; RmlC,gene.g29107.t1.1
(221 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G18490.1 | Symbols: | Protein of unknown function (DUF1637) ... 154 6e-38
AT2G42670.1 | Symbols: | Protein of unknown function (DUF1637) ... 149 2e-36
AT2G42670.2 | Symbols: | Protein of unknown function (DUF1637) ... 148 3e-36
AT3G58670.3 | Symbols: | Protein of unknown function (DUF1637) ... 147 6e-36
AT3G58670.2 | Symbols: | Protein of unknown function (DUF1637) ... 147 6e-36
AT3G58670.1 | Symbols: | Protein of unknown function (DUF1637) ... 147 6e-36
AT5G15120.1 | Symbols: | Protein of unknown function (DUF1637) ... 146 1e-35
AT5G39890.1 | Symbols: | Protein of unknown function (DUF1637) ... 145 2e-35
>AT1G18490.1 | Symbols: | Protein of unknown function (DUF1637) |
chr1:6367116-6368640 FORWARD LENGTH=282
Length = 282
Score = 154 bits (388), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 91/226 (40%), Positives = 118/226 (52%), Gaps = 34/226 (15%)
Query: 9 ELNEFADNFEAIDVGIDEFGFCESPTSDSSRGLLCEHGFSE-------ITYIHIHECDDF 61
+L D+ DVG++E S D G+ F+ IT++ IHECD F
Sbjct: 60 KLCSVLDSVSPADVGLEEV----SQDDDRGYGVSGVSRFNRVGRWAQPITFLDIHECDTF 115
Query: 62 SIGVFCLPAGKVFPLHDHPEMTVLSKILYGSVYVKAYDWI---------KLDSTKCPTRL 112
++ +FC P V PLHDHPEM V SKILYGS++VKAYDW+ K P RL
Sbjct: 116 TMCIFCFPTSSVIPLHDHPEMAVFSKILYGSLHVKAYDWVEPPCIITQDKGVPGSLPARL 175
Query: 113 AGKVIDQVMKAPHEPTVLFPRSGGNIHSFRALTPCAILDVLSPPYSEEFARPSTYYSDVP 172
A V D+V+ E L+P++GGN+H F ALTPCA+LD+LSPPY E R +YY D P
Sbjct: 176 AKLVSDKVITPQSEIPALYPKTGGNLHCFTALTPCAVLDILSPPYKESVGRSCSYYMDYP 235
Query: 173 IPSL---NG-----------YAILEEKPLPSDLVVHGAQYLGPSIE 204
+ NG YA L + P DL + Y GP+I
Sbjct: 236 FSTFALENGMKKVDEGKEDEYAWLVQIDTPDDLHMRPGSYTGPTIR 281
>AT2G42670.1 | Symbols: | Protein of unknown function (DUF1637) |
chr2:17772004-17773700 REVERSE LENGTH=241
Length = 241
Score = 149 bits (376), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 88/217 (40%), Positives = 119/217 (54%), Gaps = 24/217 (11%)
Query: 9 ELNEFADNFEAIDVGIDEFGFCESPTSDSSRGLLCEHGFSE-----ITYIHIHECDDFSI 63
++ + + DVGI++ ++ + S G L E S I Y+H+HECD FSI
Sbjct: 29 KVRNVLEKIKPSDVGIEQ----DAQLARSRSGPLNERNGSNQSPPAIKYLHLHECDSFSI 84
Query: 64 GVFCLPAGKVFPLHDHPEMTVLSKILYGSVYVKAYDWIKLDSTK----CPTRLAGKVIDQ 119
G+FC+P + PLH+HP MTVLSK++YGS++VK+YDW++ T+ R A V D
Sbjct: 85 GIFCMPPSSMIPLHNHPGMTVLSKLVYGSMHVKSYDWLEPQLTEPEDPSQARPAKLVKDT 144
Query: 120 VMKAPHEPTVLFPRSGGNIHSFRALTPCAILDVLSPPYSEEFARPSTYYSDVPIPSLNG- 178
M A T L+P+SGGNIH F+A+T CAILD+L+PPYS E R TY+ L G
Sbjct: 145 EMTAQSPVTTLYPKSGGNIHCFKAITHCAILDILAPPYSSEHDRHCTYFRKSRREDLPGE 204
Query: 179 ----------YAILEEKPLPSDLVVHGAQYLGPSIET 205
LEE P D V+ Y GP I T
Sbjct: 205 LEVDGEVVTDVTWLEEFQPPDDFVIRRIPYRGPVIRT 241
>AT2G42670.2 | Symbols: | Protein of unknown function (DUF1637) |
chr2:17772004-17773700 REVERSE LENGTH=242
Length = 242
Score = 148 bits (374), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 89/218 (40%), Positives = 121/218 (55%), Gaps = 25/218 (11%)
Query: 9 ELNEFADNFEAIDVGIDEFGFCESPTSDSSRGLLCEHGFSE-----ITYIHIHECDDFSI 63
++ + + DVGI++ ++ + S G L E S I Y+H+HECD FSI
Sbjct: 29 KVRNVLEKIKPSDVGIEQ----DAQLARSRSGPLNERNGSNQSPPAIKYLHLHECDSFSI 84
Query: 64 GVFCLPAGKVFPLHDHPEMTVLSKILYGSVYVKAYDWI--KLDSTKCPT---RLAGKVID 118
G+FC+P + PLH+HP MTVLSK++YGS++VK+YDW+ +L + P+ R A V D
Sbjct: 85 GIFCMPPSSMIPLHNHPGMTVLSKLVYGSMHVKSYDWLEPQLTEPEDPSQEARPAKLVKD 144
Query: 119 QVMKAPHEPTVLFPRSGGNIHSFRALTPCAILDVLSPPYSEEFARPSTYYSDVPIPSLNG 178
M A T L+P+SGGNIH F+A+T CAILD+L+PPYS E R TY+ L G
Sbjct: 145 TEMTAQSPVTTLYPKSGGNIHCFKAITHCAILDILAPPYSSEHDRHCTYFRKSRREDLPG 204
Query: 179 -----------YAILEEKPLPSDLVVHGAQYLGPSIET 205
LEE P D V+ Y GP I T
Sbjct: 205 ELEVDGEVVTDVTWLEEFQPPDDFVIRRIPYRGPVIRT 242
>AT3G58670.3 | Symbols: | Protein of unknown function (DUF1637) |
chr3:21703693-21705314 REVERSE LENGTH=242
Length = 242
Score = 147 bits (371), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 78/176 (44%), Positives = 102/176 (57%), Gaps = 16/176 (9%)
Query: 45 HGFSEITYIHIHECDDFSIGVFCLPAGKVFPLHDHPEMTVLSKILYGSVYVKAYDWIKLD 104
H I Y+ +HECD FSIG+FC+P G + PLH+HP MTVLSK++YGS++VK+YDW + D
Sbjct: 66 HSLPAIKYLQLHECDSFSIGIFCMPPGSIIPLHNHPGMTVLSKLVYGSMHVKSYDWAEPD 125
Query: 105 STK----CPTRLAGKVIDQVMKAPHEPTVLFPRSGGNIHSFRALTPCAILDVLSPPYSEE 160
++ R A V D M +P T L+P +GGNIH F+A+T CAI D+LSPPYS
Sbjct: 126 QSELDDPLQARPAKLVKDIDMTSPSPATTLYPTTGGNIHCFKAITHCAIFDILSPPYSST 185
Query: 161 FARPSTYYSDVP-------IPSLNGYAI-----LEEKPLPSDLVVHGAQYLGPSIE 204
R Y+ P I +NG I LEE P + V+ Y GP I
Sbjct: 186 HGRHCNYFRKSPMLDLPGEIEVMNGEVISNVTWLEEYQPPDNFVIWRVPYRGPVIR 241
>AT3G58670.2 | Symbols: | Protein of unknown function (DUF1637) |
chr3:21703693-21705314 REVERSE LENGTH=242
Length = 242
Score = 147 bits (371), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 78/176 (44%), Positives = 102/176 (57%), Gaps = 16/176 (9%)
Query: 45 HGFSEITYIHIHECDDFSIGVFCLPAGKVFPLHDHPEMTVLSKILYGSVYVKAYDWIKLD 104
H I Y+ +HECD FSIG+FC+P G + PLH+HP MTVLSK++YGS++VK+YDW + D
Sbjct: 66 HSLPAIKYLQLHECDSFSIGIFCMPPGSIIPLHNHPGMTVLSKLVYGSMHVKSYDWAEPD 125
Query: 105 STK----CPTRLAGKVIDQVMKAPHEPTVLFPRSGGNIHSFRALTPCAILDVLSPPYSEE 160
++ R A V D M +P T L+P +GGNIH F+A+T CAI D+LSPPYS
Sbjct: 126 QSELDDPLQARPAKLVKDIDMTSPSPATTLYPTTGGNIHCFKAITHCAIFDILSPPYSST 185
Query: 161 FARPSTYYSDVP-------IPSLNGYAI-----LEEKPLPSDLVVHGAQYLGPSIE 204
R Y+ P I +NG I LEE P + V+ Y GP I
Sbjct: 186 HGRHCNYFRKSPMLDLPGEIEVMNGEVISNVTWLEEYQPPDNFVIWRVPYRGPVIR 241
>AT3G58670.1 | Symbols: | Protein of unknown function (DUF1637) |
chr3:21703693-21705314 REVERSE LENGTH=242
Length = 242
Score = 147 bits (371), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 78/176 (44%), Positives = 102/176 (57%), Gaps = 16/176 (9%)
Query: 45 HGFSEITYIHIHECDDFSIGVFCLPAGKVFPLHDHPEMTVLSKILYGSVYVKAYDWIKLD 104
H I Y+ +HECD FSIG+FC+P G + PLH+HP MTVLSK++YGS++VK+YDW + D
Sbjct: 66 HSLPAIKYLQLHECDSFSIGIFCMPPGSIIPLHNHPGMTVLSKLVYGSMHVKSYDWAEPD 125
Query: 105 STK----CPTRLAGKVIDQVMKAPHEPTVLFPRSGGNIHSFRALTPCAILDVLSPPYSEE 160
++ R A V D M +P T L+P +GGNIH F+A+T CAI D+LSPPYS
Sbjct: 126 QSELDDPLQARPAKLVKDIDMTSPSPATTLYPTTGGNIHCFKAITHCAIFDILSPPYSST 185
Query: 161 FARPSTYYSDVP-------IPSLNGYAI-----LEEKPLPSDLVVHGAQYLGPSIE 204
R Y+ P I +NG I LEE P + V+ Y GP I
Sbjct: 186 HGRHCNYFRKSPMLDLPGEIEVMNGEVISNVTWLEEYQPPDNFVIWRVPYRGPVIR 241
>AT5G15120.1 | Symbols: | Protein of unknown function (DUF1637) |
chr5:4898814-4900351 FORWARD LENGTH=293
Length = 293
Score = 146 bits (368), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 87/216 (40%), Positives = 121/216 (56%), Gaps = 28/216 (12%)
Query: 9 ELNEFADNFEAIDVGIDEFGFCESPTSDSSRGLLCEHGFSE--ITYIHIHECDDFSIGVF 66
+L E D+ + DVG+ +PT R S ITY+H+H+CD FSIG+F
Sbjct: 85 QLREILDDMKPEDVGL-------TPTMPYFRPNSGVEARSSPPITYLHLHQCDQFSIGIF 137
Query: 67 CLPAGKVFPLHDHPEMTVLSKILYGSVYVKAYDWIKLDS--TKCPTRLAGKVIDQVMKAP 124
CLP V PLH+HP MTV SK+L+G++++K+YDW+ +D+ TRLA +D AP
Sbjct: 138 CLPPSGVIPLHNHPGMTVFSKLLFGTMHIKSYDWV-VDAPMRDSKTRLAKLKVDSTFTAP 196
Query: 125 HEPTVLFPRSGGNIHSFRALTPCAILDVLSPPYSEEFARPSTYYSDVPIPSLN------- 177
++L+P GGN+H F A+T CA+LDVL PPY R TY+ + P+ L+
Sbjct: 197 CNASILYPEDGGNMHRFTAITACAVLDVLGPPYCNPEGRHCTYFLEFPLDKLSSEDDDVL 256
Query: 178 -------GYAILEEK-PLPSDLV-VHGAQYLGPSIE 204
GYA L+E+ P D V GA Y GP +E
Sbjct: 257 SSEEEKEGYAWLQERDDNPEDHTNVVGALYRGPKVE 292
>AT5G39890.1 | Symbols: | Protein of unknown function (DUF1637) |
chr5:15974543-15976013 FORWARD LENGTH=276
Length = 276
Score = 145 bits (366), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 77/170 (45%), Positives = 105/170 (61%), Gaps = 16/170 (9%)
Query: 50 ITYIHIHECDDFSIGVFCLPAGKVFPLHDHPEMTVLSKILYGSVYVKAYDWIKLDSTK-- 107
+TY+HI+ C FSI +FCLP V PLH+HPEMTV SK+L+G++++K+YDW+ DS +
Sbjct: 107 VTYLHIYACHRFSICIFCLPPSGVIPLHNHPEMTVFSKLLFGTMHIKSYDWVP-DSPQPS 165
Query: 108 CPTRLAGKVIDQVMKAPHEPTVLFPRSGGNIHSFRALTPCAILDVLSPPYSEEFARPSTY 167
TRLA +D AP + ++L+P GGN+H F A T CA+LDV+ PPYS+ R TY
Sbjct: 166 SDTRLAKVKVDSDFTAPCDTSILYPADGGNMHCFTAKTACAVLDVIGPPYSDPAGRHCTY 225
Query: 168 YSDVPIPSLN------------GYAILEEK-PLPSDLVVHGAQYLGPSIE 204
Y D P S + GYA L+E+ P DL V Y GP+I+
Sbjct: 226 YFDYPFSSFSVDGVVVAEEEKEGYAWLKEREEKPEDLTVTALMYSGPTIK 275