Miyakogusa Predicted Gene

Lj3g3v0339230.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj3g3v0339230.1 Non Chatacterized Hit- tr|I1M581|I1M581_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.32547
PE,80.17,0,seg,NULL; SANT  SWI3, ADA2, N-CoR and TFIIIB''
DNA-bin,SANT/Myb domain; SUBFAMILY NOT NAMED,NULL; FA,CUFF.40566.1
         (243 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT3G07565.1 | Symbols:  | Protein of unknown function (DUF3755) ...   277   5e-75
AT3G07565.3 | Symbols:  | Protein of unknown function (DUF3755) ...   272   1e-73
AT3G07565.4 | Symbols:  | Protein of unknown function (DUF3755) ...   265   2e-71
AT3G07565.2 | Symbols:  | Protein of unknown function (DUF3755) ...   157   5e-39
AT1G10820.1 | Symbols:  | Protein of unknown function (DUF3755) ...   143   1e-34
AT1G10820.2 | Symbols:  | Protein of unknown function (DUF3755) ...   143   1e-34
AT1G60670.2 | Symbols:  | Protein of unknown function (DUF3755) ...   137   5e-33
AT1G68160.1 | Symbols:  | Protein of unknown function (DUF3755) ...    96   2e-20
AT2G43470.1 | Symbols:  | Protein of unknown function (DUF3755) ...    93   2e-19
AT1G60670.1 | Symbols:  | Protein of unknown function (DUF3755) ...    91   7e-19

>AT3G07565.1 | Symbols:  | Protein of unknown function (DUF3755) |
           chr3:2413823-2415872 FORWARD LENGTH=258
          Length = 258

 Score =  277 bits (708), Expect = 5e-75,   Method: Compositional matrix adjust.
 Identities = 134/217 (61%), Positives = 165/217 (76%), Gaps = 2/217 (0%)

Query: 24  GNSVPEASAVAVNIPNMKHNPGISLDWSPEEQAILEDGLSKHVTESNIVRYAKIAQLLQN 83
           G +   A+  +  I  ++HNPGIS DW+ EEQ++LED L K+ TE ++ RYAKIA  +++
Sbjct: 40  GGNTGAAADNSQTIGALRHNPGISTDWTLEEQSLLEDLLVKYATEPSVFRYAKIAMKMKD 99

Query: 84  KTVRDVALRVRWMNKKENSKRRKDDHNLSRKSKDKKERVSDSAAKPS-HFPARSNVPPYA 142
           KTVRDVALR RWM KKEN KRRK+DH+ SRKSKDKKE+ +DS+AK S H     N P YA
Sbjct: 100 KTVRDVALRCRWMTKKENGKRRKEDHS-SRKSKDKKEKATDSSAKSSSHLNVHPNGPSYA 158

Query: 143 PPMITMDNDDGISYTAIGGPTSELLEQNAQALNQISANLSAFQIQENINLFCQTRDNIFK 202
           PPM+ +D DDGISY AIGG + +LLEQNAQ  NQ+S N SAFQ+ EN+N+ C+ RDNI  
Sbjct: 159 PPMMPIDTDDGISYKAIGGVSGDLLEQNAQMFNQLSTNFSAFQLHENVNILCKARDNILA 218

Query: 203 IMNDLCDTPEVMKQMPPLPVKVNDELANSILPRTHHH 239
           I+NDL D PEVMKQMPPLPVK+N+ELANSILPR  H 
Sbjct: 219 ILNDLNDMPEVMKQMPPLPVKLNEELANSILPRPSHQ 255


>AT3G07565.3 | Symbols:  | Protein of unknown function (DUF3755) |
           chr3:2413823-2415872 FORWARD LENGTH=259
          Length = 259

 Score =  272 bits (696), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 134/218 (61%), Positives = 165/218 (75%), Gaps = 3/218 (1%)

Query: 24  GNSVPEASAVAVNIPNMKHNPGISLDWSPEEQAILEDGLSKHVTESNIVRYAKIAQLLQN 83
           G +   A+  +  I  ++HNPGIS DW+ EEQ++LED L K+ TE ++ RYAKIA  +++
Sbjct: 40  GGNTGAAADNSQTIGALRHNPGISTDWTLEEQSLLEDLLVKYATEPSVFRYAKIAMKMKD 99

Query: 84  KTVRDVALRVRWMNKKENSKRRKDDHNLSRKSKDKK-ERVSDSAAKPS-HFPARSNVPPY 141
           KTVRDVALR RWM KKEN KRRK+DH+ SRKSKDKK E+ +DS+AK S H     N P Y
Sbjct: 100 KTVRDVALRCRWMTKKENGKRRKEDHS-SRKSKDKKQEKATDSSAKSSSHLNVHPNGPSY 158

Query: 142 APPMITMDNDDGISYTAIGGPTSELLEQNAQALNQISANLSAFQIQENINLFCQTRDNIF 201
           APPM+ +D DDGISY AIGG + +LLEQNAQ  NQ+S N SAFQ+ EN+N+ C+ RDNI 
Sbjct: 159 APPMMPIDTDDGISYKAIGGVSGDLLEQNAQMFNQLSTNFSAFQLHENVNILCKARDNIL 218

Query: 202 KIMNDLCDTPEVMKQMPPLPVKVNDELANSILPRTHHH 239
            I+NDL D PEVMKQMPPLPVK+N+ELANSILPR  H 
Sbjct: 219 AILNDLNDMPEVMKQMPPLPVKLNEELANSILPRPSHQ 256


>AT3G07565.4 | Symbols:  | Protein of unknown function (DUF3755) |
           chr3:2413823-2415872 FORWARD LENGTH=268
          Length = 268

 Score =  265 bits (677), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 134/227 (59%), Positives = 165/227 (72%), Gaps = 12/227 (5%)

Query: 24  GNSVPEASAVAVNIPNMKHNPGISLDWSPEEQAILEDGLSKHVTESNIVRYAKIAQLLQN 83
           G +   A+  +  I  ++HNPGIS DW+ EEQ++LED L K+ TE ++ RYAKIA  +++
Sbjct: 40  GGNTGAAADNSQTIGALRHNPGISTDWTLEEQSLLEDLLVKYATEPSVFRYAKIAMKMKD 99

Query: 84  KTVRDVALRVRWMNKKENSKRRKDDHNLSRKSKDKK-ERVSDSAAK-PSHFPARSNVPPY 141
           KTVRDVALR RWM KKEN KRRK+DH+ SRKSKDKK E+ +DS+AK  SH     N P Y
Sbjct: 100 KTVRDVALRCRWMTKKENGKRRKEDHS-SRKSKDKKQEKATDSSAKSSSHLNVHPNGPSY 158

Query: 142 APPMITMDNDDGISYTAIGGPTSELLEQNAQALNQISANLSAFQI---------QENINL 192
           APPM+ +D DDGISY AIGG + +LLEQNAQ  NQ+S N SAFQ+          EN+N+
Sbjct: 159 APPMMPIDTDDGISYKAIGGVSGDLLEQNAQMFNQLSTNFSAFQVNSTSTFHLLHENVNI 218

Query: 193 FCQTRDNIFKIMNDLCDTPEVMKQMPPLPVKVNDELANSILPRTHHH 239
            C+ RDNI  I+NDL D PEVMKQMPPLPVK+N+ELANSILPR  H 
Sbjct: 219 LCKARDNILAILNDLNDMPEVMKQMPPLPVKLNEELANSILPRPSHQ 265


>AT3G07565.2 | Symbols:  | Protein of unknown function (DUF3755) |
           chr3:2413823-2415240 FORWARD LENGTH=192
          Length = 192

 Score =  157 bits (398), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 78/137 (56%), Positives = 100/137 (72%), Gaps = 2/137 (1%)

Query: 24  GNSVPEASAVAVNIPNMKHNPGISLDWSPEEQAILEDGLSKHVTESNIVRYAKIAQLLQN 83
           G +   A+  +  I  ++HNPGIS DW+ EEQ++LED L K+ TE ++ RYAKIA  +++
Sbjct: 40  GGNTGAAADNSQTIGALRHNPGISTDWTLEEQSLLEDLLVKYATEPSVFRYAKIAMKMKD 99

Query: 84  KTVRDVALRVRWMNKKENSKRRKDDHNLSRKSKDKKERVSDSAAK-PSHFPARSNVPPYA 142
           KTVRDVALR RWM KKEN KRRK+DH+ SRKSKDKKE+ +DS+AK  SH     N P YA
Sbjct: 100 KTVRDVALRCRWMTKKENGKRRKEDHS-SRKSKDKKEKATDSSAKSSSHLNVHPNGPSYA 158

Query: 143 PPMITMDNDDGISYTAI 159
           PPM+ +D DDGISY  +
Sbjct: 159 PPMMPIDTDDGISYKGL 175


>AT1G10820.1 | Symbols:  | Protein of unknown function (DUF3755) |
           chr1:3601437-3604650 REVERSE LENGTH=232
          Length = 232

 Score =  143 bits (360), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 83/212 (39%), Positives = 127/212 (59%), Gaps = 10/212 (4%)

Query: 27  VPEASAVAVNIPNMKHNPGISLDWSPEEQAILEDGLSKHVTESNIVRYAKIAQLLQNKTV 86
           +P   A       +K    + +DWS EEQ +LE+GL+K   E  I +Y KIA  L +KTV
Sbjct: 9   LPTVDASGSVAAGVKQEAALVMDWSVEEQYVLENGLAKLKDEPKISKYVKIAATLPDKTV 68

Query: 87  RDVALRVRWMNKKENSKRRKDDHNLSRKSKDKKERVSDSAAKPSHFPARSNVPPYAPPMI 146
           RDVALR RWM +K   +R+++D+N ++    +K  V D++ + +     SNVP      +
Sbjct: 69  RDVALRCRWMTRK---RRKREDNNAAKNISTRK--VVDTSPELNML---SNVPQQNALYV 120

Query: 147 T--MDNDDGISYTAIGGPTSELLEQNAQALNQISANLSAFQIQENINLFCQTRDNIFKIM 204
              M +     +  +     +LL+QNAQA +QIS NLSA ++Q+NI+LF Q R+NI  I+
Sbjct: 121 LNNMCHSTRTPFEGLSDAVMDLLQQNAQAFSQISYNLSACKLQDNISLFHQARNNISAIL 180

Query: 205 NDLCDTPEVMKQMPPLPVKVNDELANSILPRT 236
            D+ + P +M +MP LPV +ND+LA+++L  T
Sbjct: 181 TDMKEMPGIMSRMPALPVSINDDLASNLLSST 212


>AT1G10820.2 | Symbols:  | Protein of unknown function (DUF3755) |
           chr1:3601437-3604650 REVERSE LENGTH=258
          Length = 258

 Score =  143 bits (360), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 81/199 (40%), Positives = 124/199 (62%), Gaps = 10/199 (5%)

Query: 40  MKHNPGISLDWSPEEQAILEDGLSKHVTESNIVRYAKIAQLLQNKTVRDVALRVRWMNKK 99
           +K    + +DWS EEQ +LE+GL+K   E  I +Y KIA  L +KTVRDVALR RWM +K
Sbjct: 48  VKQEAALVMDWSVEEQYVLENGLAKLKDEPKISKYVKIAATLPDKTVRDVALRCRWMTRK 107

Query: 100 ENSKRRKDDHNLSRKSKDKKERVSDSAAKPSHFPARSNVPPYAPPMIT--MDNDDGISYT 157
              +R+++D+N ++    +K  V D++ + +     SNVP      +   M +     + 
Sbjct: 108 ---RRKREDNNAAKNISTRK--VVDTSPELNML---SNVPQQNALYVLNNMCHSTRTPFE 159

Query: 158 AIGGPTSELLEQNAQALNQISANLSAFQIQENINLFCQTRDNIFKIMNDLCDTPEVMKQM 217
            +     +LL+QNAQA +QIS NLSA ++Q+NI+LF Q R+NI  I+ D+ + P +M +M
Sbjct: 160 GLSDAVMDLLQQNAQAFSQISYNLSACKLQDNISLFHQARNNISAILTDMKEMPGIMSRM 219

Query: 218 PPLPVKVNDELANSILPRT 236
           P LPV +ND+LA+++L  T
Sbjct: 220 PALPVSINDDLASNLLSST 238


>AT1G60670.2 | Symbols:  | Protein of unknown function (DUF3755) |
           chr1:22344099-22347140 FORWARD LENGTH=254
          Length = 254

 Score =  137 bits (346), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 79/208 (37%), Positives = 123/208 (59%), Gaps = 13/208 (6%)

Query: 36  NIPNMKHNPGISLDWSPEEQAILEDGLSKHVTESNIVRYAKIAQLLQNKTVRDVALRVRW 95
           ++  +KH   +++DWS EEQ ILE GLSK   E  + +Y KIA  L +K+VRDVA+R +W
Sbjct: 40  SVTGLKHEASLAVDWSVEEQYILEKGLSKFKDEPQVTKYVKIAATLPDKSVRDVAMRCKW 99

Query: 96  MNKKENSKRRKDDHNLSRKSKDKKERVSDSAAKPSHFPARSNV-PPYAPPMITMDNDDGI 154
           M +K   +R+ ++H+   K   +K  V D   K + F         YA  M  M     +
Sbjct: 100 MTQK---RRKGEEHSTGTKVSYRK--VVDLPPKLNMFSTEPQQNATYA--MNHMCQSARM 152

Query: 155 SYTAIGGPTSELLEQNAQALNQISANLSAFQIQENINLFCQTRDNIFKIMNDLCDTPEVM 214
            +  +     E L QNAQA +QIS+NLS  + Q+N++LF   R+NI  I+ND+ + P ++
Sbjct: 153 PFEGLSDAVMERLRQNAQAFSQISSNLSVCKPQDNVSLFYMARNNISAILNDMKEMPGII 212

Query: 215 KQMPPLPVKVNDELANSIL-----PRTH 237
            +MPPLPV +N++LA+S++     PR++
Sbjct: 213 SRMPPLPVSINNDLASSLVTSATQPRSY 240


>AT1G68160.1 | Symbols:  | Protein of unknown function (DUF3755) |
           chr1:25546168-25548625 REVERSE LENGTH=273
          Length = 273

 Score = 95.9 bits (237), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 63/181 (34%), Positives = 104/181 (57%), Gaps = 2/181 (1%)

Query: 49  DWSPEEQAILEDGLSKHVTESNIVRYAKIAQLLQNKTVRDVALRVRWMNKKENSKRRKDD 108
           +WS EEQ IL+ GL K+    +I  Y +I   L +K++RD+ALR RW+ +K       + 
Sbjct: 79  EWSNEEQYILDAGLEKYKDMPSIDMYIQIGNTLPDKSIRDIALRCRWLRRKRRKSEELNC 138

Query: 109 HNLSRKSKDKKERVSDSAAKPSHFPARSNVPPYAPPMITMDNDDGISYTAIGGPTSELLE 168
              +  SK K+   S  ++ PS  P      P++ P  +      I+   +    + L+E
Sbjct: 139 GRRASSSKGKQVESSSKSSIPSVLPHNMASYPFSGP--STSTSKQITSEDLSSYATNLIE 196

Query: 169 QNAQALNQISANLSAFQIQENINLFCQTRDNIFKIMNDLCDTPEVMKQMPPLPVKVNDEL 228
           QN +A +QI ANLS+++  +N++LF Q R+N+  I N++ + P +M +MPPLPV +ND+L
Sbjct: 197 QNVRAFSQIRANLSSYKAGDNLDLFRQARNNLITIQNEINNMPGLMNKMPPLPVTINDDL 256

Query: 229 A 229
           +
Sbjct: 257 S 257


>AT2G43470.1 | Symbols:  | Protein of unknown function (DUF3755) |
           chr2:18049944-18051218 REVERSE LENGTH=210
          Length = 210

 Score = 92.8 bits (229), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 67/189 (35%), Positives = 102/189 (53%), Gaps = 23/189 (12%)

Query: 45  GISLDWSPEEQAILEDGLSKHVTESN--IVRYAKIAQLLQNKTVRDVALRVRWMNKKENS 102
           GI+L+W+  E  IL   L  + ++S   + RY +I + LQ+KT+RDVA R RW+  K+ +
Sbjct: 25  GIALNWTTAEDDILIQLLDSYSSDSRSAVTRYLQILEFLQDKTIRDVAARSRWIYNKKIA 84

Query: 103 KRRKDDHN-LSRKSKDKKERVSDSAAKPSHFPARSNVPPYAPPMITMDNDDGISYTAIGG 161
           K++K+DHN L     D +E V+   A   + P++   P                  +  G
Sbjct: 85  KKKKEDHNGLGTTRVDNEEIVNMVLASQVYQPSQVFQP------------------SQHG 126

Query: 162 PTSELLEQNAQALNQISANLSAFQIQENINLFCQTRDNIFKIMNDLCD-TPEVMKQMP-P 219
             +ELL  N Q  NQI ANL+   + +N++LF + R+NI  ++ DL +   E  K MP  
Sbjct: 127 VHNELLNHNKQWFNQIYANLTFLNLTDNLDLFRKIRENIKSLLKDLNENVSETWKNMPSS 186

Query: 220 LPVKVNDEL 228
           LP K+NDEL
Sbjct: 187 LPEKLNDEL 195


>AT1G60670.1 | Symbols:  | Protein of unknown function (DUF3755) |
           chr1:22344099-22346773 FORWARD LENGTH=201
          Length = 201

 Score = 90.9 bits (224), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 56/149 (37%), Positives = 81/149 (54%), Gaps = 8/149 (5%)

Query: 35  VNIPNMKHNPGISLDWSPEEQAILEDGLSKHVTESNIVRYAKIAQLLQNKTVRDVALRVR 94
            ++  +KH   +++DWS EEQ ILE GLSK   E  + +Y KIA  L +K+VRDVA+R +
Sbjct: 39  TSVTGLKHEASLAVDWSVEEQYILEKGLSKFKDEPQVTKYVKIAATLPDKSVRDVAMRCK 98

Query: 95  WMNKKENSKRRKDDHNLSRKSKDKKERVSDSAAKPSHFPAR-SNVPPYAPPMITMDNDDG 153
           WM +K   +R+ ++H+   K   +K  V D   K + F         YA  M  M     
Sbjct: 99  WMTQK---RRKGEEHSTGTKVSYRK--VVDLPPKLNMFSTEPQQNATYA--MNHMCQSAR 151

Query: 154 ISYTAIGGPTSELLEQNAQALNQISANLS 182
           + +  +     E L QNAQA +QIS+NLS
Sbjct: 152 MPFEGLSDAVMERLRQNAQAFSQISSNLS 180