Miyakogusa Predicted Gene
- Lj0g3v0196769.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0196769.1 Non Chatacterized Hit- tr|I1KE43|I1KE43_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.57417
PE,89.47,0,seg,NULL; ORGANIC SOLUTE TRANSPORTER-RELATED,Organic solute
transporter Ost-alpha; Solute_trans_a,Or,CUFF.12470.1
(245 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT4G21570.1 | Symbols: | Protein of unknown function (DUF300) |... 324 4e-89
AT1G11200.1 | Symbols: | Protein of unknown function (DUF300) |... 317 6e-87
AT1G77220.1 | Symbols: | Protein of unknown function (DUF300) |... 86 3e-17
AT5G26740.3 | Symbols: | Protein of unknown function (DUF300) |... 79 3e-15
AT5G26740.2 | Symbols: | Protein of unknown function (DUF300) |... 79 3e-15
AT5G26740.1 | Symbols: | Protein of unknown function (DUF300) |... 79 3e-15
AT3G05940.1 | Symbols: | Protein of unknown function (DUF300) |... 74 1e-13
AT1G23070.1 | Symbols: | Protein of unknown function (DUF300) |... 67 9e-12
AT4G38360.1 | Symbols: LAZ1 | Protein of unknown function (DUF30... 67 1e-11
AT4G38360.2 | Symbols: LAZ1 | Protein of unknown function (DUF30... 65 4e-11
>AT4G21570.1 | Symbols: | Protein of unknown function (DUF300) |
chr4:11471126-11472269 REVERSE LENGTH=294
Length = 294
Score = 324 bits (830), Expect = 4e-89, Method: Compositional matrix adjust.
Identities = 151/221 (68%), Positives = 179/221 (80%)
Query: 1 MNPAQIVLFGSTFCVMIAVHFSMKLISEHVLNWKKPKEQKAIIIIIMMAPLYAVDSYVGL 60
+ P QI + S F V++ +HF+++L+S+H+ +WK PKEQKAI+II++MAP+YAV S++GL
Sbjct: 7 LKPPQITFYCSAFSVLLTLHFTIQLVSQHLFHWKNPKEQKAILIIVLMAPIYAVVSFIGL 66
Query: 61 INFFGSETFFTFLDSIKECYEALVIAKFLALMYSYLNISLSKNIVPDEIKGREIHHSFPM 120
+ GSETFF FL+SIKECYEALVIAKFLALMYSYLNIS+SKNI+PD IKGREIHHSFPM
Sbjct: 67 LEVKGSETFFLFLESIKECYEALVIAKFLALMYSYLNISMSKNILPDGIKGREIHHSFPM 126
Query: 121 TLFQPHTTRLDHHTLKLLKYWTWQFVVLRPMCSILMITLQYLEVYPSWINWTITIIXXXX 180
TLFQPH RLD HTLKLLKYWTWQFVV+RP+CS LMI LQ + YPSW++WT TII
Sbjct: 127 TLFQPHVVRLDRHTLKLLKYWTWQFVVIRPVCSTLMIALQLIGFYPSWLSWTFTIIVNFS 186
Query: 181 XXXXXXXXXXFYHVFAKELEPHKPLSKFLCIKGIVFFCFWQ 221
FYHVFAKEL PH PL+KFLCIKGIVFF FWQ
Sbjct: 187 VSLALYSLVIFYHVFAKELAPHNPLAKFLCIKGIVFFVFWQ 227
>AT1G11200.1 | Symbols: | Protein of unknown function (DUF300) |
chr1:3753896-3755459 FORWARD LENGTH=295
Length = 295
Score = 317 bits (811), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 142/221 (64%), Positives = 181/221 (81%)
Query: 1 MNPAQIVLFGSTFCVMIAVHFSMKLISEHVLNWKKPKEQKAIIIIIMMAPLYAVDSYVGL 60
++PA+I + GS FCV++++HF+M+L+S+H+ WKKP EQ+AI+II++MAP+YA++S+VGL
Sbjct: 7 LSPAEITVMGSVFCVLLSMHFTMQLVSQHLFYWKKPNEQRAILIIVLMAPVYAINSFVGL 66
Query: 61 INFFGSETFFTFLDSIKECYEALVIAKFLALMYSYLNISLSKNIVPDEIKGREIHHSFPM 120
++ GS+ FF FLD++KECYEALVIAKFLALMYSY+NIS+S I+PDE KGREIHHSFPM
Sbjct: 67 LDAKGSKPFFMFLDAVKECYEALVIAKFLALMYSYVNISMSARIIPDEFKGREIHHSFPM 126
Query: 121 TLFQPHTTRLDHHTLKLLKYWTWQFVVLRPMCSILMITLQYLEVYPSWINWTITIIXXXX 180
TLF P TT LD+ TLK LK WTWQF ++RP+CSILMITLQ L +YP W++W T I
Sbjct: 127 TLFVPRTTHLDYLTLKQLKQWTWQFCIIRPVCSILMITLQILGIYPVWLSWIFTAILNVS 186
Query: 181 XXXXXXXXXXFYHVFAKELEPHKPLSKFLCIKGIVFFCFWQ 221
FYHVFAKELEPHKPL+KF+C+KGIVFFCFWQ
Sbjct: 187 VSLALYSLVKFYHVFAKELEPHKPLTKFMCVKGIVFFCFWQ 227
>AT1G77220.1 | Symbols: | Protein of unknown function (DUF300) |
chr1:29013232-29015530 FORWARD LENGTH=484
Length = 484
Score = 85.5 bits (210), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 69/235 (29%), Positives = 114/235 (48%), Gaps = 26/235 (11%)
Query: 7 VLFGSTFCVMIAVHFSMKLISEHVLNWKKPKEQKAIIIIIMMAPLYAVDSYVGLINFFGS 66
+L S F V+IA+ M LI EH+ ++ +P+EQK +I +I+M P+YAV+S++ L+N S
Sbjct: 43 ILSASVF-VVIAILLPMYLIFEHLASYNQPEEQKFLIGLILMVPVYAVESFLSLVN---S 98
Query: 67 ETFFTFLDSIKECYEALVIAKFLALMYSYLN--------------ISLSKNIVPDEIKGR 112
E F + I++CYEA + F + + L+ I+ S ++
Sbjct: 99 EAAFN-CEVIRDCYEAFALYCFERYLIACLDGEERTIEFMEQQTVITQSTPLLEGTCSYG 157
Query: 113 EIHHSFPMTLFQPHTTRLDHHTLKLLKYWTWQFVVLRPMCSILMITLQYLEVYPSW-INW 171
+ H FPM F + L +K Q+++L+ +C++L + L+ VY W
Sbjct: 158 VVEHPFPMNCFVKDWS-LGPQFYHAVKIGIVQYMILKMICALLAMILEAFGVYGEGKFAW 216
Query: 172 T-----ITIIXXXXXXXXXXXXXXFYHVFAKELEPHKPLSKFLCIKGIVFFCFWQ 221
+ ++ FY+V +L P KPL+KFL K IVF +WQ
Sbjct: 217 NYGYPYLAVVLNFSQTWALYCLVQFYNVIKDKLAPIKPLAKFLTFKSIVFLTWWQ 271
>AT5G26740.3 | Symbols: | Protein of unknown function (DUF300) |
chr5:9292436-9294407 FORWARD LENGTH=422
Length = 422
Score = 78.6 bits (192), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 65/216 (30%), Positives = 101/216 (46%), Gaps = 17/216 (7%)
Query: 14 CVMIAVHFSMKLISEHVLNWKKPKEQKAIIIIIMMAPLYAVDSYVGLINFFGSETFFTFL 73
C + A+ ++ I H+LN+ +P Q+ I+ II M P+YA S++ L+ S +F
Sbjct: 16 CTVGAIALAIFHIYRHLLNYTEPTYQRYIVRIIFMVPVYAFMSFLSLV-LPKSSIYF--- 71
Query: 74 DSIKECYEALVIAKFLALMYSYLNISLSKNIVPDEIKGREIHHSFPM--TLFQPHTTRLD 131
DSI+E YEA VI FL+L +++ V + GR + S+ + F P T LD
Sbjct: 72 DSIREVYEAWVIYNFLSLCLAWVG---GPGSVVLSLSGRSLKPSWSLMTCCFPPLT--LD 126
Query: 132 HHTLKLLKYWTWQFVVLRPMCSILMITLQYLEVY------PSWINWTITIIXXXXXXXXX 185
++ K QFV+L+P+ + + L Y P +TII
Sbjct: 127 GRFIRRCKQGCLQFVILKPILVAVTLVLYAKGKYKDGNFNPDQAYLYLTIIYTISYTVAL 186
Query: 186 XXXXXFYHVFAKELEPHKPLSKFLCIKGIVFFCFWQ 221
FY L+P P+ KF+ IK +VF +WQ
Sbjct: 187 YALVLFYMACRDLLQPFNPVPKFVIIKSVVFLTYWQ 222
>AT5G26740.2 | Symbols: | Protein of unknown function (DUF300) |
chr5:9292436-9294407 FORWARD LENGTH=422
Length = 422
Score = 78.6 bits (192), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 65/216 (30%), Positives = 101/216 (46%), Gaps = 17/216 (7%)
Query: 14 CVMIAVHFSMKLISEHVLNWKKPKEQKAIIIIIMMAPLYAVDSYVGLINFFGSETFFTFL 73
C + A+ ++ I H+LN+ +P Q+ I+ II M P+YA S++ L+ S +F
Sbjct: 16 CTVGAIALAIFHIYRHLLNYTEPTYQRYIVRIIFMVPVYAFMSFLSLV-LPKSSIYF--- 71
Query: 74 DSIKECYEALVIAKFLALMYSYLNISLSKNIVPDEIKGREIHHSFPM--TLFQPHTTRLD 131
DSI+E YEA VI FL+L +++ V + GR + S+ + F P T LD
Sbjct: 72 DSIREVYEAWVIYNFLSLCLAWVG---GPGSVVLSLSGRSLKPSWSLMTCCFPPLT--LD 126
Query: 132 HHTLKLLKYWTWQFVVLRPMCSILMITLQYLEVY------PSWINWTITIIXXXXXXXXX 185
++ K QFV+L+P+ + + L Y P +TII
Sbjct: 127 GRFIRRCKQGCLQFVILKPILVAVTLVLYAKGKYKDGNFNPDQAYLYLTIIYTISYTVAL 186
Query: 186 XXXXXFYHVFAKELEPHKPLSKFLCIKGIVFFCFWQ 221
FY L+P P+ KF+ IK +VF +WQ
Sbjct: 187 YALVLFYMACRDLLQPFNPVPKFVIIKSVVFLTYWQ 222
>AT5G26740.1 | Symbols: | Protein of unknown function (DUF300) |
chr5:9292436-9294407 FORWARD LENGTH=422
Length = 422
Score = 78.6 bits (192), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 65/216 (30%), Positives = 101/216 (46%), Gaps = 17/216 (7%)
Query: 14 CVMIAVHFSMKLISEHVLNWKKPKEQKAIIIIIMMAPLYAVDSYVGLINFFGSETFFTFL 73
C + A+ ++ I H+LN+ +P Q+ I+ II M P+YA S++ L+ S +F
Sbjct: 16 CTVGAIALAIFHIYRHLLNYTEPTYQRYIVRIIFMVPVYAFMSFLSLV-LPKSSIYF--- 71
Query: 74 DSIKECYEALVIAKFLALMYSYLNISLSKNIVPDEIKGREIHHSFPM--TLFQPHTTRLD 131
DSI+E YEA VI FL+L +++ V + GR + S+ + F P T LD
Sbjct: 72 DSIREVYEAWVIYNFLSLCLAWVG---GPGSVVLSLSGRSLKPSWSLMTCCFPPLT--LD 126
Query: 132 HHTLKLLKYWTWQFVVLRPMCSILMITLQYLEVY------PSWINWTITIIXXXXXXXXX 185
++ K QFV+L+P+ + + L Y P +TII
Sbjct: 127 GRFIRRCKQGCLQFVILKPILVAVTLVLYAKGKYKDGNFNPDQAYLYLTIIYTISYTVAL 186
Query: 186 XXXXXFYHVFAKELEPHKPLSKFLCIKGIVFFCFWQ 221
FY L+P P+ KF+ IK +VF +WQ
Sbjct: 187 YALVLFYMACRDLLQPFNPVPKFVIIKSVVFLTYWQ 222
>AT3G05940.1 | Symbols: | Protein of unknown function (DUF300) |
chr3:1777592-1779648 REVERSE LENGTH=422
Length = 422
Score = 73.6 bits (179), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 61/214 (28%), Positives = 101/214 (47%), Gaps = 13/214 (6%)
Query: 14 CVMIAVHFSMKLISEHVLNWKKPKEQKAIIIIIMMAPLYAVDSYVGLINFFGSETFFTFL 73
C + A+ ++ I +H+LN+ +P Q+ I+ I+ M P+YA+ S++ L+ S +F
Sbjct: 16 CTVGAIALALFHIYKHLLNYTEPIYQRYIVRIVFMVPVYALMSFLALV-LPKSSIYF--- 71
Query: 74 DSIKECYEALVIAKFLALMYSYLNISLSKNIVPDEIKGREIHHSFPMTLFQPHTTRLDHH 133
+SI+E YEA VI FL+L +++ S I + GR + S+ + LD
Sbjct: 72 NSIREVYEAWVIYNFLSLCLAWVGGPGSVVI---SLTGRSLKPSWHLMTCCIPPLPLDGR 128
Query: 134 TLKLLKYWTWQFVVLRPMCSILMITLQYLEVY------PSWINWTITIIXXXXXXXXXXX 187
++ K QFV+L+P+ + + L Y P +TII
Sbjct: 129 FIRRCKQGCLQFVILKPILVAVTLVLYAKGKYKDGNFSPDQSYLYLTIIYTISYTVALYA 188
Query: 188 XXXFYHVFAKELEPHKPLSKFLCIKGIVFFCFWQ 221
FY L+P P+ KF+ IK +VF +WQ
Sbjct: 189 LVLFYVACKDLLQPFNPVPKFVIIKSVVFLTYWQ 222
>AT1G23070.1 | Symbols: | Protein of unknown function (DUF300) |
chr1:8174011-8175758 REVERSE LENGTH=403
Length = 403
Score = 67.4 bits (163), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 64/240 (26%), Positives = 108/240 (45%), Gaps = 26/240 (10%)
Query: 7 VLFGSTFCVMIAVHFSMKLISEHVLNWKKPKEQKAIIIIIMMAPLYAVDSYVGLINFFGS 66
++ G +F +A+ S+ I +H+ + P EQK I+ ++ M P+YA +S + L N
Sbjct: 17 LIIGGSFAT-VAICLSLYSILQHLRFYTNPAEQKWIVSVLFMVPVYATESIISLSN---- 71
Query: 67 ETFFTFLDSIKECYEALVIAKFLALMYS----------YLNISLSKNIVPD---EIKGRE 113
F D ++ CYEA + F + + + YL K ++ + E K ++
Sbjct: 72 SKFSLPCDILRNCYEAFALYSFGSYLVACLGGERRVVEYLENESKKPLLEEGANESKKKK 131
Query: 114 IHHSFPMTLFQPHTTRLDHHTLKLLKYWTWQFVVLRPMCSILMITLQYLEVYPSW-INWT 172
+SF L P+ L + K+ Q+++L+ C+ L L+ L VY W
Sbjct: 132 KKNSFWKFLCDPYV--LGRELFVIEKFGLVQYMILKTFCAFLTFLLELLGVYGDGEFKWY 189
Query: 173 -----ITIIXXXXXXXXXXXXXXFYHVFAKELEPHKPLSKFLCIKGIVFFCFWQVFSYAL 227
I ++ FY+V + L+ KPL+KF+ K IVF +WQ F AL
Sbjct: 190 YGYPYIVVVLNFSQMWALFCLVQFYNVTHERLKEIKPLAKFISFKAIVFATWWQGFGIAL 249
>AT4G38360.1 | Symbols: LAZ1 | Protein of unknown function (DUF300)
| chr4:17967389-17969170 FORWARD LENGTH=304
Length = 304
Score = 67.0 bits (162), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 56/235 (23%), Positives = 108/235 (45%), Gaps = 29/235 (12%)
Query: 15 VMIAVHFSMKLISEHVLNWKKPKEQKAIIIIIMMAPLYAVDSYVGLINFFGSETFFTFLD 74
+++ + S+ L+ +H+ +K P+EQK +I +I+M P Y+++S+ L+ +
Sbjct: 28 LVLTLSLSLFLVFDHLSTYKNPEEQKFLIGVILMVPCYSIESFASLVK----PSISVDCG 83
Query: 75 SIKECYEALVIAKFLALMYSYLNISLSKNIVPDEIKGRE---------------IHHSFP 119
+++CYE+ + F + + + + I E +GR+ I H FP
Sbjct: 84 ILRDCYESFAMYCFGRYLVACIG-GEERTIEFMERQGRKSFKTPLLDHKDEKGIIKHPFP 142
Query: 120 MTLF-QPHTTRLDHHTLKLLKYWTWQFVVLRPMCSILMITLQYLEVY-PSWINWT----- 172
M LF +P RL +++K+ Q+++++ + ++ + L+ VY W
Sbjct: 143 MNLFLKPW--RLSPWFYQVVKFGIVQYMIIKSLTALTALILEAFGVYCEGEFKWGCGYPY 200
Query: 173 ITIIXXXXXXXXXXXXXXFYHVFAKELEPHKPLSKFLCIKGIVFFCFWQVFSYAL 227
+ ++ FY EL +PL+KFL K IVF +WQ + AL
Sbjct: 201 LAVVLNFSQSWALYCLVQFYGATKDELAHIQPLAKFLTFKSIVFLTWWQGVAIAL 255
>AT4G38360.2 | Symbols: LAZ1 | Protein of unknown function (DUF300)
| chr4:17967389-17969798 FORWARD LENGTH=485
Length = 485
Score = 65.1 bits (157), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 56/235 (23%), Positives = 108/235 (45%), Gaps = 29/235 (12%)
Query: 15 VMIAVHFSMKLISEHVLNWKKPKEQKAIIIIIMMAPLYAVDSYVGLINFFGSETFFTFLD 74
+++ + S+ L+ +H+ +K P+EQK +I +I+M P Y+++S+ L+ +
Sbjct: 28 LVLTLSLSLFLVFDHLSTYKNPEEQKFLIGVILMVPCYSIESFASLVK----PSISVDCG 83
Query: 75 SIKECYEALVIAKFLALMYSYLNISLSKNIVPDEIKGRE---------------IHHSFP 119
+++CYE+ + F + + + + I E +GR+ I H FP
Sbjct: 84 ILRDCYESFAMYCFGRYLVACIG-GEERTIEFMERQGRKSFKTPLLDHKDEKGIIKHPFP 142
Query: 120 MTLF-QPHTTRLDHHTLKLLKYWTWQFVVLRPMCSILMITLQYLEVY-PSWINWT----- 172
M LF +P RL +++K+ Q+++++ + ++ + L+ VY W
Sbjct: 143 MNLFLKPW--RLSPWFYQVVKFGIVQYMIIKSLTALTALILEAFGVYCEGEFKWGCGYPY 200
Query: 173 ITIIXXXXXXXXXXXXXXFYHVFAKELEPHKPLSKFLCIKGIVFFCFWQVFSYAL 227
+ ++ FY EL +PL+KFL K IVF +WQ + AL
Sbjct: 201 LAVVLNFSQSWALYCLVQFYGATKDELAHIQPLAKFLTFKSIVFLTWWQGVAIAL 255