Miyakogusa Predicted Gene
- Lj3g3v0339230.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj3g3v0339230.1 Non Chatacterized Hit- tr|I1M581|I1M581_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.32547
PE,80.17,0,seg,NULL; SANT SWI3, ADA2, N-CoR and TFIIIB''
DNA-bin,SANT/Myb domain; SUBFAMILY NOT NAMED,NULL; FA,CUFF.40566.1
(243 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G07565.1 | Symbols: | Protein of unknown function (DUF3755) ... 277 5e-75
AT3G07565.3 | Symbols: | Protein of unknown function (DUF3755) ... 272 1e-73
AT3G07565.4 | Symbols: | Protein of unknown function (DUF3755) ... 265 2e-71
AT3G07565.2 | Symbols: | Protein of unknown function (DUF3755) ... 157 5e-39
AT1G10820.1 | Symbols: | Protein of unknown function (DUF3755) ... 143 1e-34
AT1G10820.2 | Symbols: | Protein of unknown function (DUF3755) ... 143 1e-34
AT1G60670.2 | Symbols: | Protein of unknown function (DUF3755) ... 137 5e-33
AT1G68160.1 | Symbols: | Protein of unknown function (DUF3755) ... 96 2e-20
AT2G43470.1 | Symbols: | Protein of unknown function (DUF3755) ... 93 2e-19
AT1G60670.1 | Symbols: | Protein of unknown function (DUF3755) ... 91 7e-19
>AT3G07565.1 | Symbols: | Protein of unknown function (DUF3755) |
chr3:2413823-2415872 FORWARD LENGTH=258
Length = 258
Score = 277 bits (708), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 134/217 (61%), Positives = 165/217 (76%), Gaps = 2/217 (0%)
Query: 24 GNSVPEASAVAVNIPNMKHNPGISLDWSPEEQAILEDGLSKHVTESNIVRYAKIAQLLQN 83
G + A+ + I ++HNPGIS DW+ EEQ++LED L K+ TE ++ RYAKIA +++
Sbjct: 40 GGNTGAAADNSQTIGALRHNPGISTDWTLEEQSLLEDLLVKYATEPSVFRYAKIAMKMKD 99
Query: 84 KTVRDVALRVRWMNKKENSKRRKDDHNLSRKSKDKKERVSDSAAKPS-HFPARSNVPPYA 142
KTVRDVALR RWM KKEN KRRK+DH+ SRKSKDKKE+ +DS+AK S H N P YA
Sbjct: 100 KTVRDVALRCRWMTKKENGKRRKEDHS-SRKSKDKKEKATDSSAKSSSHLNVHPNGPSYA 158
Query: 143 PPMITMDNDDGISYTAIGGPTSELLEQNAQALNQISANLSAFQIQENINLFCQTRDNIFK 202
PPM+ +D DDGISY AIGG + +LLEQNAQ NQ+S N SAFQ+ EN+N+ C+ RDNI
Sbjct: 159 PPMMPIDTDDGISYKAIGGVSGDLLEQNAQMFNQLSTNFSAFQLHENVNILCKARDNILA 218
Query: 203 IMNDLCDTPEVMKQMPPLPVKVNDELANSILPRTHHH 239
I+NDL D PEVMKQMPPLPVK+N+ELANSILPR H
Sbjct: 219 ILNDLNDMPEVMKQMPPLPVKLNEELANSILPRPSHQ 255
>AT3G07565.3 | Symbols: | Protein of unknown function (DUF3755) |
chr3:2413823-2415872 FORWARD LENGTH=259
Length = 259
Score = 272 bits (696), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 134/218 (61%), Positives = 165/218 (75%), Gaps = 3/218 (1%)
Query: 24 GNSVPEASAVAVNIPNMKHNPGISLDWSPEEQAILEDGLSKHVTESNIVRYAKIAQLLQN 83
G + A+ + I ++HNPGIS DW+ EEQ++LED L K+ TE ++ RYAKIA +++
Sbjct: 40 GGNTGAAADNSQTIGALRHNPGISTDWTLEEQSLLEDLLVKYATEPSVFRYAKIAMKMKD 99
Query: 84 KTVRDVALRVRWMNKKENSKRRKDDHNLSRKSKDKK-ERVSDSAAKPS-HFPARSNVPPY 141
KTVRDVALR RWM KKEN KRRK+DH+ SRKSKDKK E+ +DS+AK S H N P Y
Sbjct: 100 KTVRDVALRCRWMTKKENGKRRKEDHS-SRKSKDKKQEKATDSSAKSSSHLNVHPNGPSY 158
Query: 142 APPMITMDNDDGISYTAIGGPTSELLEQNAQALNQISANLSAFQIQENINLFCQTRDNIF 201
APPM+ +D DDGISY AIGG + +LLEQNAQ NQ+S N SAFQ+ EN+N+ C+ RDNI
Sbjct: 159 APPMMPIDTDDGISYKAIGGVSGDLLEQNAQMFNQLSTNFSAFQLHENVNILCKARDNIL 218
Query: 202 KIMNDLCDTPEVMKQMPPLPVKVNDELANSILPRTHHH 239
I+NDL D PEVMKQMPPLPVK+N+ELANSILPR H
Sbjct: 219 AILNDLNDMPEVMKQMPPLPVKLNEELANSILPRPSHQ 256
>AT3G07565.4 | Symbols: | Protein of unknown function (DUF3755) |
chr3:2413823-2415872 FORWARD LENGTH=268
Length = 268
Score = 265 bits (677), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 134/227 (59%), Positives = 165/227 (72%), Gaps = 12/227 (5%)
Query: 24 GNSVPEASAVAVNIPNMKHNPGISLDWSPEEQAILEDGLSKHVTESNIVRYAKIAQLLQN 83
G + A+ + I ++HNPGIS DW+ EEQ++LED L K+ TE ++ RYAKIA +++
Sbjct: 40 GGNTGAAADNSQTIGALRHNPGISTDWTLEEQSLLEDLLVKYATEPSVFRYAKIAMKMKD 99
Query: 84 KTVRDVALRVRWMNKKENSKRRKDDHNLSRKSKDKK-ERVSDSAAK-PSHFPARSNVPPY 141
KTVRDVALR RWM KKEN KRRK+DH+ SRKSKDKK E+ +DS+AK SH N P Y
Sbjct: 100 KTVRDVALRCRWMTKKENGKRRKEDHS-SRKSKDKKQEKATDSSAKSSSHLNVHPNGPSY 158
Query: 142 APPMITMDNDDGISYTAIGGPTSELLEQNAQALNQISANLSAFQI---------QENINL 192
APPM+ +D DDGISY AIGG + +LLEQNAQ NQ+S N SAFQ+ EN+N+
Sbjct: 159 APPMMPIDTDDGISYKAIGGVSGDLLEQNAQMFNQLSTNFSAFQVNSTSTFHLLHENVNI 218
Query: 193 FCQTRDNIFKIMNDLCDTPEVMKQMPPLPVKVNDELANSILPRTHHH 239
C+ RDNI I+NDL D PEVMKQMPPLPVK+N+ELANSILPR H
Sbjct: 219 LCKARDNILAILNDLNDMPEVMKQMPPLPVKLNEELANSILPRPSHQ 265
>AT3G07565.2 | Symbols: | Protein of unknown function (DUF3755) |
chr3:2413823-2415240 FORWARD LENGTH=192
Length = 192
Score = 157 bits (398), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 78/137 (56%), Positives = 100/137 (72%), Gaps = 2/137 (1%)
Query: 24 GNSVPEASAVAVNIPNMKHNPGISLDWSPEEQAILEDGLSKHVTESNIVRYAKIAQLLQN 83
G + A+ + I ++HNPGIS DW+ EEQ++LED L K+ TE ++ RYAKIA +++
Sbjct: 40 GGNTGAAADNSQTIGALRHNPGISTDWTLEEQSLLEDLLVKYATEPSVFRYAKIAMKMKD 99
Query: 84 KTVRDVALRVRWMNKKENSKRRKDDHNLSRKSKDKKERVSDSAAK-PSHFPARSNVPPYA 142
KTVRDVALR RWM KKEN KRRK+DH+ SRKSKDKKE+ +DS+AK SH N P YA
Sbjct: 100 KTVRDVALRCRWMTKKENGKRRKEDHS-SRKSKDKKEKATDSSAKSSSHLNVHPNGPSYA 158
Query: 143 PPMITMDNDDGISYTAI 159
PPM+ +D DDGISY +
Sbjct: 159 PPMMPIDTDDGISYKGL 175
>AT1G10820.1 | Symbols: | Protein of unknown function (DUF3755) |
chr1:3601437-3604650 REVERSE LENGTH=232
Length = 232
Score = 143 bits (360), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 83/212 (39%), Positives = 127/212 (59%), Gaps = 10/212 (4%)
Query: 27 VPEASAVAVNIPNMKHNPGISLDWSPEEQAILEDGLSKHVTESNIVRYAKIAQLLQNKTV 86
+P A +K + +DWS EEQ +LE+GL+K E I +Y KIA L +KTV
Sbjct: 9 LPTVDASGSVAAGVKQEAALVMDWSVEEQYVLENGLAKLKDEPKISKYVKIAATLPDKTV 68
Query: 87 RDVALRVRWMNKKENSKRRKDDHNLSRKSKDKKERVSDSAAKPSHFPARSNVPPYAPPMI 146
RDVALR RWM +K +R+++D+N ++ +K V D++ + + SNVP +
Sbjct: 69 RDVALRCRWMTRK---RRKREDNNAAKNISTRK--VVDTSPELNML---SNVPQQNALYV 120
Query: 147 T--MDNDDGISYTAIGGPTSELLEQNAQALNQISANLSAFQIQENINLFCQTRDNIFKIM 204
M + + + +LL+QNAQA +QIS NLSA ++Q+NI+LF Q R+NI I+
Sbjct: 121 LNNMCHSTRTPFEGLSDAVMDLLQQNAQAFSQISYNLSACKLQDNISLFHQARNNISAIL 180
Query: 205 NDLCDTPEVMKQMPPLPVKVNDELANSILPRT 236
D+ + P +M +MP LPV +ND+LA+++L T
Sbjct: 181 TDMKEMPGIMSRMPALPVSINDDLASNLLSST 212
>AT1G10820.2 | Symbols: | Protein of unknown function (DUF3755) |
chr1:3601437-3604650 REVERSE LENGTH=258
Length = 258
Score = 143 bits (360), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 81/199 (40%), Positives = 124/199 (62%), Gaps = 10/199 (5%)
Query: 40 MKHNPGISLDWSPEEQAILEDGLSKHVTESNIVRYAKIAQLLQNKTVRDVALRVRWMNKK 99
+K + +DWS EEQ +LE+GL+K E I +Y KIA L +KTVRDVALR RWM +K
Sbjct: 48 VKQEAALVMDWSVEEQYVLENGLAKLKDEPKISKYVKIAATLPDKTVRDVALRCRWMTRK 107
Query: 100 ENSKRRKDDHNLSRKSKDKKERVSDSAAKPSHFPARSNVPPYAPPMIT--MDNDDGISYT 157
+R+++D+N ++ +K V D++ + + SNVP + M + +
Sbjct: 108 ---RRKREDNNAAKNISTRK--VVDTSPELNML---SNVPQQNALYVLNNMCHSTRTPFE 159
Query: 158 AIGGPTSELLEQNAQALNQISANLSAFQIQENINLFCQTRDNIFKIMNDLCDTPEVMKQM 217
+ +LL+QNAQA +QIS NLSA ++Q+NI+LF Q R+NI I+ D+ + P +M +M
Sbjct: 160 GLSDAVMDLLQQNAQAFSQISYNLSACKLQDNISLFHQARNNISAILTDMKEMPGIMSRM 219
Query: 218 PPLPVKVNDELANSILPRT 236
P LPV +ND+LA+++L T
Sbjct: 220 PALPVSINDDLASNLLSST 238
>AT1G60670.2 | Symbols: | Protein of unknown function (DUF3755) |
chr1:22344099-22347140 FORWARD LENGTH=254
Length = 254
Score = 137 bits (346), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 79/208 (37%), Positives = 123/208 (59%), Gaps = 13/208 (6%)
Query: 36 NIPNMKHNPGISLDWSPEEQAILEDGLSKHVTESNIVRYAKIAQLLQNKTVRDVALRVRW 95
++ +KH +++DWS EEQ ILE GLSK E + +Y KIA L +K+VRDVA+R +W
Sbjct: 40 SVTGLKHEASLAVDWSVEEQYILEKGLSKFKDEPQVTKYVKIAATLPDKSVRDVAMRCKW 99
Query: 96 MNKKENSKRRKDDHNLSRKSKDKKERVSDSAAKPSHFPARSNV-PPYAPPMITMDNDDGI 154
M +K +R+ ++H+ K +K V D K + F YA M M +
Sbjct: 100 MTQK---RRKGEEHSTGTKVSYRK--VVDLPPKLNMFSTEPQQNATYA--MNHMCQSARM 152
Query: 155 SYTAIGGPTSELLEQNAQALNQISANLSAFQIQENINLFCQTRDNIFKIMNDLCDTPEVM 214
+ + E L QNAQA +QIS+NLS + Q+N++LF R+NI I+ND+ + P ++
Sbjct: 153 PFEGLSDAVMERLRQNAQAFSQISSNLSVCKPQDNVSLFYMARNNISAILNDMKEMPGII 212
Query: 215 KQMPPLPVKVNDELANSIL-----PRTH 237
+MPPLPV +N++LA+S++ PR++
Sbjct: 213 SRMPPLPVSINNDLASSLVTSATQPRSY 240
>AT1G68160.1 | Symbols: | Protein of unknown function (DUF3755) |
chr1:25546168-25548625 REVERSE LENGTH=273
Length = 273
Score = 95.9 bits (237), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 63/181 (34%), Positives = 104/181 (57%), Gaps = 2/181 (1%)
Query: 49 DWSPEEQAILEDGLSKHVTESNIVRYAKIAQLLQNKTVRDVALRVRWMNKKENSKRRKDD 108
+WS EEQ IL+ GL K+ +I Y +I L +K++RD+ALR RW+ +K +
Sbjct: 79 EWSNEEQYILDAGLEKYKDMPSIDMYIQIGNTLPDKSIRDIALRCRWLRRKRRKSEELNC 138
Query: 109 HNLSRKSKDKKERVSDSAAKPSHFPARSNVPPYAPPMITMDNDDGISYTAIGGPTSELLE 168
+ SK K+ S ++ PS P P++ P + I+ + + L+E
Sbjct: 139 GRRASSSKGKQVESSSKSSIPSVLPHNMASYPFSGP--STSTSKQITSEDLSSYATNLIE 196
Query: 169 QNAQALNQISANLSAFQIQENINLFCQTRDNIFKIMNDLCDTPEVMKQMPPLPVKVNDEL 228
QN +A +QI ANLS+++ +N++LF Q R+N+ I N++ + P +M +MPPLPV +ND+L
Sbjct: 197 QNVRAFSQIRANLSSYKAGDNLDLFRQARNNLITIQNEINNMPGLMNKMPPLPVTINDDL 256
Query: 229 A 229
+
Sbjct: 257 S 257
>AT2G43470.1 | Symbols: | Protein of unknown function (DUF3755) |
chr2:18049944-18051218 REVERSE LENGTH=210
Length = 210
Score = 92.8 bits (229), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 67/189 (35%), Positives = 102/189 (53%), Gaps = 23/189 (12%)
Query: 45 GISLDWSPEEQAILEDGLSKHVTESN--IVRYAKIAQLLQNKTVRDVALRVRWMNKKENS 102
GI+L+W+ E IL L + ++S + RY +I + LQ+KT+RDVA R RW+ K+ +
Sbjct: 25 GIALNWTTAEDDILIQLLDSYSSDSRSAVTRYLQILEFLQDKTIRDVAARSRWIYNKKIA 84
Query: 103 KRRKDDHN-LSRKSKDKKERVSDSAAKPSHFPARSNVPPYAPPMITMDNDDGISYTAIGG 161
K++K+DHN L D +E V+ A + P++ P + G
Sbjct: 85 KKKKEDHNGLGTTRVDNEEIVNMVLASQVYQPSQVFQP------------------SQHG 126
Query: 162 PTSELLEQNAQALNQISANLSAFQIQENINLFCQTRDNIFKIMNDLCD-TPEVMKQMP-P 219
+ELL N Q NQI ANL+ + +N++LF + R+NI ++ DL + E K MP
Sbjct: 127 VHNELLNHNKQWFNQIYANLTFLNLTDNLDLFRKIRENIKSLLKDLNENVSETWKNMPSS 186
Query: 220 LPVKVNDEL 228
LP K+NDEL
Sbjct: 187 LPEKLNDEL 195
>AT1G60670.1 | Symbols: | Protein of unknown function (DUF3755) |
chr1:22344099-22346773 FORWARD LENGTH=201
Length = 201
Score = 90.9 bits (224), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 56/149 (37%), Positives = 81/149 (54%), Gaps = 8/149 (5%)
Query: 35 VNIPNMKHNPGISLDWSPEEQAILEDGLSKHVTESNIVRYAKIAQLLQNKTVRDVALRVR 94
++ +KH +++DWS EEQ ILE GLSK E + +Y KIA L +K+VRDVA+R +
Sbjct: 39 TSVTGLKHEASLAVDWSVEEQYILEKGLSKFKDEPQVTKYVKIAATLPDKSVRDVAMRCK 98
Query: 95 WMNKKENSKRRKDDHNLSRKSKDKKERVSDSAAKPSHFPAR-SNVPPYAPPMITMDNDDG 153
WM +K +R+ ++H+ K +K V D K + F YA M M
Sbjct: 99 WMTQK---RRKGEEHSTGTKVSYRK--VVDLPPKLNMFSTEPQQNATYA--MNHMCQSAR 151
Query: 154 ISYTAIGGPTSELLEQNAQALNQISANLS 182
+ + + E L QNAQA +QIS+NLS
Sbjct: 152 MPFEGLSDAVMERLRQNAQAFSQISSNLS 180