Miyakogusa Predicted Gene

Lj2g3v1034740.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj2g3v1034740.1 Non Chatacterized Hit- tr|I1LWJ1|I1LWJ1_SOYBN
Uncharacterized protein OS=Glycine max PE=4 SV=1,83.63,0,RmlC-like
cupins,RmlC-like cupin domain; DUF1637,Cysteamine dioxygenase; no
description,RmlC-like je,CUFF.36075.1
         (281 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT5G39890.1 | Symbols:  | Protein of unknown function (DUF1637) ...   338   3e-93
AT5G15120.1 | Symbols:  | Protein of unknown function (DUF1637) ...   301   4e-82
AT1G18490.1 | Symbols:  | Protein of unknown function (DUF1637) ...   203   1e-52
AT2G42670.2 | Symbols:  | Protein of unknown function (DUF1637) ...   199   2e-51
AT2G42670.1 | Symbols:  | Protein of unknown function (DUF1637) ...   195   2e-50
AT3G58670.3 | Symbols:  | Protein of unknown function (DUF1637) ...   190   1e-48
AT3G58670.2 | Symbols:  | Protein of unknown function (DUF1637) ...   190   1e-48
AT3G58670.1 | Symbols:  | Protein of unknown function (DUF1637) ...   190   1e-48

>AT5G39890.1 | Symbols:  | Protein of unknown function (DUF1637) |
           chr5:15974543-15976013 FORWARD LENGTH=276
          Length = 276

 Score =  338 bits (866), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 166/259 (64%), Positives = 196/259 (75%), Gaps = 14/259 (5%)

Query: 25  SSGSKRNRRRQKK--MPPVQKLFDTCKEVFASGGTGFIPPPQDIQRLQAVLDAIRPEDVG 82
           S+  K+ +RR KK  + PVQKLFDTCK+VFA G +G +P  ++I+ L+AVLD I+PEDVG
Sbjct: 29  SNSRKKIQRRSKKTLICPVQKLFDTCKKVFADGKSGTVPSQENIEMLRAVLDEIKPEDVG 88

Query: 83  LKPDMPHFRSSSAQRIPKITYLHIYECEKFSMGIFCLPPSGVIPLHNHPGMTVFSKLLFG 142
           + P M +FRS+   R P +TYLHIY C +FS+ IFCLPPSGVIPLHNHP MTVFSKLLFG
Sbjct: 89  VNPKMSYFRSTVTGRSPLVTYLHIYACHRFSICIFCLPPSGVIPLHNHPEMTVFSKLLFG 148

Query: 143 TMHIKSYDWVVDLPPESPTIVKPTENQALEMRLAKVKVDADFTAPCNPSILYPEDGGNMH 202
           TMHIKSYDWV D P  S            + RLAKVKVD+DFTAPC+ SILYP DGGNMH
Sbjct: 149 TMHIKSYDWVPDSPQPSS-----------DTRLAKVKVDSDFTAPCDTSILYPADGGNMH 197

Query: 203 CFTAVTACAVLDVLGPPYSDYEGRHCTYYNNYPFSDISVEGISIPEEERNGYEWLQEKEQ 262
           CFTA TACAVLDV+GPPYSD  GRHCTYY +YPFS  SV+G+ + EEE+ GY WL+E+E+
Sbjct: 198 CFTAKTACAVLDVIGPPYSDPAGRHCTYYFDYPFSSFSVDGVVVAEEEKEGYAWLKEREE 257

Query: 263 L-EDLEVDGKMYSGPKIHE 280
             EDL V   MYSGP I E
Sbjct: 258 KPEDLTVTALMYSGPTIKE 276


>AT5G15120.1 | Symbols:  | Protein of unknown function (DUF1637) |
           chr5:4898814-4900351 FORWARD LENGTH=293
          Length = 293

 Score =  301 bits (771), Expect = 4e-82,   Method: Compositional matrix adjust.
 Identities = 148/247 (59%), Positives = 184/247 (74%), Gaps = 18/247 (7%)

Query: 41  VQKLFDTCKEVFASGGTGFIPPPQDIQRLQAVLDAIRPEDVGLKPDMPHFRSSS---AQR 97
           V++LF+TCKEVF++GG G IP    IQ+L+ +LD ++PEDVGL P MP+FR +S   A+ 
Sbjct: 58  VRRLFNTCKEVFSNGGPGVIPSEDKIQQLREILDDMKPEDVGLTPTMPYFRPNSGVEARS 117

Query: 98  IPKITYLHIYECEKFSMGIFCLPPSGVIPLHNHPGMTVFSKLLFGTMHIKSYDWVVDLPP 157
            P ITYLH+++C++FS+GIFCLPPSGVIPLHNHPGMTVFSKLLFGTMHIKSYDWVVD P 
Sbjct: 118 SPPITYLHLHQCDQFSIGIFCLPPSGVIPLHNHPGMTVFSKLLFGTMHIKSYDWVVDAP- 176

Query: 158 ESPTIVKPTENQALEMRLAKVKVDADFTAPCNPSILYPEDGGNMHCFTAVTACAVLDVLG 217
                      +  + RLAK+KVD+ FTAPCN SILYPEDGGNMH FTA+TACAVLDVLG
Sbjct: 177 ----------MRDSKTRLAKLKVDSTFTAPCNASILYPEDGGNMHRFTAITACAVLDVLG 226

Query: 218 PPYSDYEGRHCTYYNNYPFSDISVEGISI--PEEERNGYEWLQEKEQ--LEDLEVDGKMY 273
           PPY + EGRHCTY+  +P   +S E   +   EEE+ GY WLQE++    +   V G +Y
Sbjct: 227 PPYCNPEGRHCTYFLEFPLDKLSSEDDDVLSSEEEKEGYAWLQERDDNPEDHTNVVGALY 286

Query: 274 SGPKIHE 280
            GPK+ +
Sbjct: 287 RGPKVED 293


>AT1G18490.1 | Symbols:  | Protein of unknown function (DUF1637) |
           chr1:6367116-6368640 FORWARD LENGTH=282
          Length = 282

 Score =  203 bits (517), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 114/263 (43%), Positives = 156/263 (59%), Gaps = 26/263 (9%)

Query: 33  RRQKKMPPVQKLFDTCKEVFASGGTGFIPPPQD--IQRLQAVLDAIRPEDVGL----KPD 86
           R Q+K P VQ+L+D CKE F    TG  P P    IQ+L +VLD++ P DVGL    + D
Sbjct: 27  RNQEKSPKVQELYDLCKETF----TGKAPSPASMAIQKLCSVLDSVSPADVGLEEVSQDD 82

Query: 87  MPHFRSSSAQRIPK-------ITYLHIYECEKFSMGIFCLPPSGVIPLHNHPGMTVFSKL 139
              +  S   R  +       IT+L I+EC+ F+M IFC P S VIPLH+HP M VFSK+
Sbjct: 83  DRGYGVSGVSRFNRVGRWAQPITFLDIHECDTFTMCIFCFPTSSVIPLHDHPEMAVFSKI 142

Query: 140 LFGTMHIKSYDWVVDLPPESPTIVKPTEN--QALEMRLAKVKVDADFTAPCNPSILYPED 197
           L+G++H+K+YDWV     E P I+   +    +L  RLAK+  D   T       LYP+ 
Sbjct: 143 LYGSLHVKAYDWV-----EPPCIITQDKGVPGSLPARLAKLVSDKVITPQSEIPALYPKT 197

Query: 198 GGNMHCFTAVTACAVLDVLGPPYSDYEGRHCTYYNNYPFSDISVEG--ISIPEEERNGYE 255
           GGN+HCFTA+T CAVLD+L PPY +  GR C+YY +YPFS  ++E     + E + + Y 
Sbjct: 198 GGNLHCFTALTPCAVLDILSPPYKESVGRSCSYYMDYPFSTFALENGMKKVDEGKEDEYA 257

Query: 256 WLQEKEQLEDLEVDGKMYSGPKI 278
           WL + +  +DL +    Y+GP I
Sbjct: 258 WLVQIDTPDDLHMRPGSYTGPTI 280


>AT2G42670.2 | Symbols:  | Protein of unknown function (DUF1637) |
           chr2:17772004-17773700 REVERSE LENGTH=242
          Length = 242

 Score =  199 bits (506), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 106/248 (42%), Positives = 151/248 (60%), Gaps = 22/248 (8%)

Query: 41  VQKLFDTCKEVFASGGTGFIPPPQD-IQRLQAVLDAIRPEDVGLKPDMPHFRSSSA---- 95
            Q+L++TCK  F+S G    P  +D +++++ VL+ I+P DVG++ D    RS S     
Sbjct: 5   AQRLYNTCKASFSSDG----PITEDALEKVRNVLEKIKPSDVGIEQDAQLARSRSGPLNE 60

Query: 96  -----QRIPKITYLHIYECEKFSMGIFCLPPSGVIPLHNHPGMTVFSKLLFGTMHIKSYD 150
                Q  P I YLH++EC+ FS+GIFC+PPS +IPLHNHPGMTV SKL++G+MH+KSYD
Sbjct: 61  RNGSNQSPPAIKYLHLHECDSFSIGIFCMPPSSMIPLHNHPGMTVLSKLVYGSMHVKSYD 120

Query: 151 WVVDLPPESPTIVKPTENQALEMRLAKVKVDADFTAPCNPSILYPEDGGNMHCFTAVTAC 210
           W+       P + +P E+ + E R AK+  D + TA    + LYP+ GGN+HCF A+T C
Sbjct: 121 WL------EPQLTEP-EDPSQEARPAKLVKDTEMTAQSPVTTLYPKSGGNIHCFKAITHC 173

Query: 211 AVLDVLGPPYSDYEGRHCTYYNNYPFSDISVEGISIPEEERNGYEWLQEKEQLEDLEVDG 270
           A+LD+L PPYS    RHCTY+      D+  E + +  E      WL+E +  +D  +  
Sbjct: 174 AILDILAPPYSSEHDRHCTYFRKSRREDLPGE-LEVDGEVVTDVTWLEEFQPPDDFVIRR 232

Query: 271 KMYSGPKI 278
             Y GP I
Sbjct: 233 IPYRGPVI 240


>AT2G42670.1 | Symbols:  | Protein of unknown function (DUF1637) |
           chr2:17772004-17773700 REVERSE LENGTH=241
          Length = 241

 Score =  195 bits (496), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 104/248 (41%), Positives = 149/248 (60%), Gaps = 23/248 (9%)

Query: 41  VQKLFDTCKEVFASGGTGFIPPPQD-IQRLQAVLDAIRPEDVGLKPDMPHFRSSSA---- 95
            Q+L++TCK  F+S G    P  +D +++++ VL+ I+P DVG++ D    RS S     
Sbjct: 5   AQRLYNTCKASFSSDG----PITEDALEKVRNVLEKIKPSDVGIEQDAQLARSRSGPLNE 60

Query: 96  -----QRIPKITYLHIYECEKFSMGIFCLPPSGVIPLHNHPGMTVFSKLLFGTMHIKSYD 150
                Q  P I YLH++EC+ FS+GIFC+PPS +IPLHNHPGMTV SKL++G+MH+KSYD
Sbjct: 61  RNGSNQSPPAIKYLHLHECDSFSIGIFCMPPSSMIPLHNHPGMTVLSKLVYGSMHVKSYD 120

Query: 151 WVVDLPPESPTIVKPTENQALEMRLAKVKVDADFTAPCNPSILYPEDGGNMHCFTAVTAC 210
           W+       P + +P +    + R AK+  D + TA    + LYP+ GGN+HCF A+T C
Sbjct: 121 WL------EPQLTEPEDPS--QARPAKLVKDTEMTAQSPVTTLYPKSGGNIHCFKAITHC 172

Query: 211 AVLDVLGPPYSDYEGRHCTYYNNYPFSDISVEGISIPEEERNGYEWLQEKEQLEDLEVDG 270
           A+LD+L PPYS    RHCTY+      D+  E + +  E      WL+E +  +D  +  
Sbjct: 173 AILDILAPPYSSEHDRHCTYFRKSRREDLPGE-LEVDGEVVTDVTWLEEFQPPDDFVIRR 231

Query: 271 KMYSGPKI 278
             Y GP I
Sbjct: 232 IPYRGPVI 239


>AT3G58670.3 | Symbols:  | Protein of unknown function (DUF1637) |
           chr3:21703693-21705314 REVERSE LENGTH=242
          Length = 242

 Score =  190 bits (482), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 98/249 (39%), Positives = 144/249 (57%), Gaps = 20/249 (8%)

Query: 41  VQKLFDTCKEVFASGGTGFIPPPQDIQRLQAVLDAIRPEDVGLK---------PDMPHFR 91
           +Q+LF+TCK   +  G       + + +++ VL+ I+P DVGL+         P   + R
Sbjct: 5   IQRLFNTCKSSLSPNGPV---SEEALDKVRNVLEKIKPSDVGLEQEAQLVRNWPGPGNER 61

Query: 92  SSSAQRIPKITYLHIYECEKFSMGIFCLPPSGVIPLHNHPGMTVFSKLLFGTMHIKSYDW 151
           + +   +P I YL ++EC+ FS+GIFC+PP  +IPLHNHPGMTV SKL++G+MH+KSYDW
Sbjct: 62  NGNHHSLPAIKYLQLHECDSFSIGIFCMPPGSIIPLHNHPGMTVLSKLVYGSMHVKSYDW 121

Query: 152 VVDLPPESPTIVKPTENQALEMRLAKVKVDADFTAPCNPSILYPEDGGNMHCFTAVTACA 211
                P+   +  P     L+ R AK+  D D T+P   + LYP  GGN+HCF A+T CA
Sbjct: 122 A---EPDQSELDDP-----LQARPAKLVKDIDMTSPSPATTLYPTTGGNIHCFKAITHCA 173

Query: 212 VLDVLGPPYSDYEGRHCTYYNNYPFSDISVEGISIPEEERNGYEWLQEKEQLEDLEVDGK 271
           + D+L PPYS   GRHC Y+   P  D+  E   +  E  +   WL+E +  ++  +   
Sbjct: 174 IFDILSPPYSSTHGRHCNYFRKSPMLDLPGEIEVMNGEVISNVTWLEEYQPPDNFVIWRV 233

Query: 272 MYSGPKIHE 280
            Y GP I +
Sbjct: 234 PYRGPVIRK 242


>AT3G58670.2 | Symbols:  | Protein of unknown function (DUF1637) |
           chr3:21703693-21705314 REVERSE LENGTH=242
          Length = 242

 Score =  190 bits (482), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 98/249 (39%), Positives = 144/249 (57%), Gaps = 20/249 (8%)

Query: 41  VQKLFDTCKEVFASGGTGFIPPPQDIQRLQAVLDAIRPEDVGLK---------PDMPHFR 91
           +Q+LF+TCK   +  G       + + +++ VL+ I+P DVGL+         P   + R
Sbjct: 5   IQRLFNTCKSSLSPNGPV---SEEALDKVRNVLEKIKPSDVGLEQEAQLVRNWPGPGNER 61

Query: 92  SSSAQRIPKITYLHIYECEKFSMGIFCLPPSGVIPLHNHPGMTVFSKLLFGTMHIKSYDW 151
           + +   +P I YL ++EC+ FS+GIFC+PP  +IPLHNHPGMTV SKL++G+MH+KSYDW
Sbjct: 62  NGNHHSLPAIKYLQLHECDSFSIGIFCMPPGSIIPLHNHPGMTVLSKLVYGSMHVKSYDW 121

Query: 152 VVDLPPESPTIVKPTENQALEMRLAKVKVDADFTAPCNPSILYPEDGGNMHCFTAVTACA 211
                P+   +  P     L+ R AK+  D D T+P   + LYP  GGN+HCF A+T CA
Sbjct: 122 A---EPDQSELDDP-----LQARPAKLVKDIDMTSPSPATTLYPTTGGNIHCFKAITHCA 173

Query: 212 VLDVLGPPYSDYEGRHCTYYNNYPFSDISVEGISIPEEERNGYEWLQEKEQLEDLEVDGK 271
           + D+L PPYS   GRHC Y+   P  D+  E   +  E  +   WL+E +  ++  +   
Sbjct: 174 IFDILSPPYSSTHGRHCNYFRKSPMLDLPGEIEVMNGEVISNVTWLEEYQPPDNFVIWRV 233

Query: 272 MYSGPKIHE 280
            Y GP I +
Sbjct: 234 PYRGPVIRK 242


>AT3G58670.1 | Symbols:  | Protein of unknown function (DUF1637) |
           chr3:21703693-21705314 REVERSE LENGTH=242
          Length = 242

 Score =  190 bits (482), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 98/249 (39%), Positives = 144/249 (57%), Gaps = 20/249 (8%)

Query: 41  VQKLFDTCKEVFASGGTGFIPPPQDIQRLQAVLDAIRPEDVGLK---------PDMPHFR 91
           +Q+LF+TCK   +  G       + + +++ VL+ I+P DVGL+         P   + R
Sbjct: 5   IQRLFNTCKSSLSPNGPV---SEEALDKVRNVLEKIKPSDVGLEQEAQLVRNWPGPGNER 61

Query: 92  SSSAQRIPKITYLHIYECEKFSMGIFCLPPSGVIPLHNHPGMTVFSKLLFGTMHIKSYDW 151
           + +   +P I YL ++EC+ FS+GIFC+PP  +IPLHNHPGMTV SKL++G+MH+KSYDW
Sbjct: 62  NGNHHSLPAIKYLQLHECDSFSIGIFCMPPGSIIPLHNHPGMTVLSKLVYGSMHVKSYDW 121

Query: 152 VVDLPPESPTIVKPTENQALEMRLAKVKVDADFTAPCNPSILYPEDGGNMHCFTAVTACA 211
                P+   +  P     L+ R AK+  D D T+P   + LYP  GGN+HCF A+T CA
Sbjct: 122 A---EPDQSELDDP-----LQARPAKLVKDIDMTSPSPATTLYPTTGGNIHCFKAITHCA 173

Query: 212 VLDVLGPPYSDYEGRHCTYYNNYPFSDISVEGISIPEEERNGYEWLQEKEQLEDLEVDGK 271
           + D+L PPYS   GRHC Y+   P  D+  E   +  E  +   WL+E +  ++  +   
Sbjct: 174 IFDILSPPYSSTHGRHCNYFRKSPMLDLPGEIEVMNGEVISNVTWLEEYQPPDNFVIWRV 233

Query: 272 MYSGPKIHE 280
            Y GP I +
Sbjct: 234 PYRGPVIRK 242