Miyakogusa Predicted Gene

Lj0g3v0196769.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj0g3v0196769.1 Non Chatacterized Hit- tr|I1KE43|I1KE43_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.57417
PE,89.47,0,seg,NULL; ORGANIC SOLUTE TRANSPORTER-RELATED,Organic solute
transporter Ost-alpha; Solute_trans_a,Or,CUFF.12470.1
         (245 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT4G21570.1 | Symbols:  | Protein of unknown function (DUF300) |...   324   4e-89
AT1G11200.1 | Symbols:  | Protein of unknown function (DUF300) |...   317   6e-87
AT1G77220.1 | Symbols:  | Protein of unknown function (DUF300) |...    86   3e-17
AT5G26740.3 | Symbols:  | Protein of unknown function (DUF300) |...    79   3e-15
AT5G26740.2 | Symbols:  | Protein of unknown function (DUF300) |...    79   3e-15
AT5G26740.1 | Symbols:  | Protein of unknown function (DUF300) |...    79   3e-15
AT3G05940.1 | Symbols:  | Protein of unknown function (DUF300) |...    74   1e-13
AT1G23070.1 | Symbols:  | Protein of unknown function (DUF300) |...    67   9e-12
AT4G38360.1 | Symbols: LAZ1 | Protein of unknown function (DUF30...    67   1e-11
AT4G38360.2 | Symbols: LAZ1 | Protein of unknown function (DUF30...    65   4e-11

>AT4G21570.1 | Symbols:  | Protein of unknown function (DUF300) |
           chr4:11471126-11472269 REVERSE LENGTH=294
          Length = 294

 Score =  324 bits (830), Expect = 4e-89,   Method: Compositional matrix adjust.
 Identities = 151/221 (68%), Positives = 179/221 (80%)

Query: 1   MNPAQIVLFGSTFCVMIAVHFSMKLISEHVLNWKKPKEQKAIIIIIMMAPLYAVDSYVGL 60
           + P QI  + S F V++ +HF+++L+S+H+ +WK PKEQKAI+II++MAP+YAV S++GL
Sbjct: 7   LKPPQITFYCSAFSVLLTLHFTIQLVSQHLFHWKNPKEQKAILIIVLMAPIYAVVSFIGL 66

Query: 61  INFFGSETFFTFLDSIKECYEALVIAKFLALMYSYLNISLSKNIVPDEIKGREIHHSFPM 120
           +   GSETFF FL+SIKECYEALVIAKFLALMYSYLNIS+SKNI+PD IKGREIHHSFPM
Sbjct: 67  LEVKGSETFFLFLESIKECYEALVIAKFLALMYSYLNISMSKNILPDGIKGREIHHSFPM 126

Query: 121 TLFQPHTTRLDHHTLKLLKYWTWQFVVLRPMCSILMITLQYLEVYPSWINWTITIIXXXX 180
           TLFQPH  RLD HTLKLLKYWTWQFVV+RP+CS LMI LQ +  YPSW++WT TII    
Sbjct: 127 TLFQPHVVRLDRHTLKLLKYWTWQFVVIRPVCSTLMIALQLIGFYPSWLSWTFTIIVNFS 186

Query: 181 XXXXXXXXXXFYHVFAKELEPHKPLSKFLCIKGIVFFCFWQ 221
                     FYHVFAKEL PH PL+KFLCIKGIVFF FWQ
Sbjct: 187 VSLALYSLVIFYHVFAKELAPHNPLAKFLCIKGIVFFVFWQ 227


>AT1G11200.1 | Symbols:  | Protein of unknown function (DUF300) |
           chr1:3753896-3755459 FORWARD LENGTH=295
          Length = 295

 Score =  317 bits (811), Expect = 6e-87,   Method: Compositional matrix adjust.
 Identities = 142/221 (64%), Positives = 181/221 (81%)

Query: 1   MNPAQIVLFGSTFCVMIAVHFSMKLISEHVLNWKKPKEQKAIIIIIMMAPLYAVDSYVGL 60
           ++PA+I + GS FCV++++HF+M+L+S+H+  WKKP EQ+AI+II++MAP+YA++S+VGL
Sbjct: 7   LSPAEITVMGSVFCVLLSMHFTMQLVSQHLFYWKKPNEQRAILIIVLMAPVYAINSFVGL 66

Query: 61  INFFGSETFFTFLDSIKECYEALVIAKFLALMYSYLNISLSKNIVPDEIKGREIHHSFPM 120
           ++  GS+ FF FLD++KECYEALVIAKFLALMYSY+NIS+S  I+PDE KGREIHHSFPM
Sbjct: 67  LDAKGSKPFFMFLDAVKECYEALVIAKFLALMYSYVNISMSARIIPDEFKGREIHHSFPM 126

Query: 121 TLFQPHTTRLDHHTLKLLKYWTWQFVVLRPMCSILMITLQYLEVYPSWINWTITIIXXXX 180
           TLF P TT LD+ TLK LK WTWQF ++RP+CSILMITLQ L +YP W++W  T I    
Sbjct: 127 TLFVPRTTHLDYLTLKQLKQWTWQFCIIRPVCSILMITLQILGIYPVWLSWIFTAILNVS 186

Query: 181 XXXXXXXXXXFYHVFAKELEPHKPLSKFLCIKGIVFFCFWQ 221
                     FYHVFAKELEPHKPL+KF+C+KGIVFFCFWQ
Sbjct: 187 VSLALYSLVKFYHVFAKELEPHKPLTKFMCVKGIVFFCFWQ 227


>AT1G77220.1 | Symbols:  | Protein of unknown function (DUF300) |
           chr1:29013232-29015530 FORWARD LENGTH=484
          Length = 484

 Score = 85.5 bits (210), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 69/235 (29%), Positives = 114/235 (48%), Gaps = 26/235 (11%)

Query: 7   VLFGSTFCVMIAVHFSMKLISEHVLNWKKPKEQKAIIIIIMMAPLYAVDSYVGLINFFGS 66
           +L  S F V+IA+   M LI EH+ ++ +P+EQK +I +I+M P+YAV+S++ L+N   S
Sbjct: 43  ILSASVF-VVIAILLPMYLIFEHLASYNQPEEQKFLIGLILMVPVYAVESFLSLVN---S 98

Query: 67  ETFFTFLDSIKECYEALVIAKFLALMYSYLN--------------ISLSKNIVPDEIKGR 112
           E  F   + I++CYEA  +  F   + + L+              I+ S  ++       
Sbjct: 99  EAAFN-CEVIRDCYEAFALYCFERYLIACLDGEERTIEFMEQQTVITQSTPLLEGTCSYG 157

Query: 113 EIHHSFPMTLFQPHTTRLDHHTLKLLKYWTWQFVVLRPMCSILMITLQYLEVYPSW-INW 171
            + H FPM  F    + L       +K    Q+++L+ +C++L + L+   VY      W
Sbjct: 158 VVEHPFPMNCFVKDWS-LGPQFYHAVKIGIVQYMILKMICALLAMILEAFGVYGEGKFAW 216

Query: 172 T-----ITIIXXXXXXXXXXXXXXFYHVFAKELEPHKPLSKFLCIKGIVFFCFWQ 221
                 + ++              FY+V   +L P KPL+KFL  K IVF  +WQ
Sbjct: 217 NYGYPYLAVVLNFSQTWALYCLVQFYNVIKDKLAPIKPLAKFLTFKSIVFLTWWQ 271


>AT5G26740.3 | Symbols:  | Protein of unknown function (DUF300) |
           chr5:9292436-9294407 FORWARD LENGTH=422
          Length = 422

 Score = 78.6 bits (192), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 65/216 (30%), Positives = 101/216 (46%), Gaps = 17/216 (7%)

Query: 14  CVMIAVHFSMKLISEHVLNWKKPKEQKAIIIIIMMAPLYAVDSYVGLINFFGSETFFTFL 73
           C + A+  ++  I  H+LN+ +P  Q+ I+ II M P+YA  S++ L+    S  +F   
Sbjct: 16  CTVGAIALAIFHIYRHLLNYTEPTYQRYIVRIIFMVPVYAFMSFLSLV-LPKSSIYF--- 71

Query: 74  DSIKECYEALVIAKFLALMYSYLNISLSKNIVPDEIKGREIHHSFPM--TLFQPHTTRLD 131
           DSI+E YEA VI  FL+L  +++        V   + GR +  S+ +    F P T  LD
Sbjct: 72  DSIREVYEAWVIYNFLSLCLAWVG---GPGSVVLSLSGRSLKPSWSLMTCCFPPLT--LD 126

Query: 132 HHTLKLLKYWTWQFVVLRPMCSILMITLQYLEVY------PSWINWTITIIXXXXXXXXX 185
              ++  K    QFV+L+P+   + + L     Y      P      +TII         
Sbjct: 127 GRFIRRCKQGCLQFVILKPILVAVTLVLYAKGKYKDGNFNPDQAYLYLTIIYTISYTVAL 186

Query: 186 XXXXXFYHVFAKELEPHKPLSKFLCIKGIVFFCFWQ 221
                FY      L+P  P+ KF+ IK +VF  +WQ
Sbjct: 187 YALVLFYMACRDLLQPFNPVPKFVIIKSVVFLTYWQ 222


>AT5G26740.2 | Symbols:  | Protein of unknown function (DUF300) |
           chr5:9292436-9294407 FORWARD LENGTH=422
          Length = 422

 Score = 78.6 bits (192), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 65/216 (30%), Positives = 101/216 (46%), Gaps = 17/216 (7%)

Query: 14  CVMIAVHFSMKLISEHVLNWKKPKEQKAIIIIIMMAPLYAVDSYVGLINFFGSETFFTFL 73
           C + A+  ++  I  H+LN+ +P  Q+ I+ II M P+YA  S++ L+    S  +F   
Sbjct: 16  CTVGAIALAIFHIYRHLLNYTEPTYQRYIVRIIFMVPVYAFMSFLSLV-LPKSSIYF--- 71

Query: 74  DSIKECYEALVIAKFLALMYSYLNISLSKNIVPDEIKGREIHHSFPM--TLFQPHTTRLD 131
           DSI+E YEA VI  FL+L  +++        V   + GR +  S+ +    F P T  LD
Sbjct: 72  DSIREVYEAWVIYNFLSLCLAWVG---GPGSVVLSLSGRSLKPSWSLMTCCFPPLT--LD 126

Query: 132 HHTLKLLKYWTWQFVVLRPMCSILMITLQYLEVY------PSWINWTITIIXXXXXXXXX 185
              ++  K    QFV+L+P+   + + L     Y      P      +TII         
Sbjct: 127 GRFIRRCKQGCLQFVILKPILVAVTLVLYAKGKYKDGNFNPDQAYLYLTIIYTISYTVAL 186

Query: 186 XXXXXFYHVFAKELEPHKPLSKFLCIKGIVFFCFWQ 221
                FY      L+P  P+ KF+ IK +VF  +WQ
Sbjct: 187 YALVLFYMACRDLLQPFNPVPKFVIIKSVVFLTYWQ 222


>AT5G26740.1 | Symbols:  | Protein of unknown function (DUF300) |
           chr5:9292436-9294407 FORWARD LENGTH=422
          Length = 422

 Score = 78.6 bits (192), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 65/216 (30%), Positives = 101/216 (46%), Gaps = 17/216 (7%)

Query: 14  CVMIAVHFSMKLISEHVLNWKKPKEQKAIIIIIMMAPLYAVDSYVGLINFFGSETFFTFL 73
           C + A+  ++  I  H+LN+ +P  Q+ I+ II M P+YA  S++ L+    S  +F   
Sbjct: 16  CTVGAIALAIFHIYRHLLNYTEPTYQRYIVRIIFMVPVYAFMSFLSLV-LPKSSIYF--- 71

Query: 74  DSIKECYEALVIAKFLALMYSYLNISLSKNIVPDEIKGREIHHSFPM--TLFQPHTTRLD 131
           DSI+E YEA VI  FL+L  +++        V   + GR +  S+ +    F P T  LD
Sbjct: 72  DSIREVYEAWVIYNFLSLCLAWVG---GPGSVVLSLSGRSLKPSWSLMTCCFPPLT--LD 126

Query: 132 HHTLKLLKYWTWQFVVLRPMCSILMITLQYLEVY------PSWINWTITIIXXXXXXXXX 185
              ++  K    QFV+L+P+   + + L     Y      P      +TII         
Sbjct: 127 GRFIRRCKQGCLQFVILKPILVAVTLVLYAKGKYKDGNFNPDQAYLYLTIIYTISYTVAL 186

Query: 186 XXXXXFYHVFAKELEPHKPLSKFLCIKGIVFFCFWQ 221
                FY      L+P  P+ KF+ IK +VF  +WQ
Sbjct: 187 YALVLFYMACRDLLQPFNPVPKFVIIKSVVFLTYWQ 222


>AT3G05940.1 | Symbols:  | Protein of unknown function (DUF300) |
           chr3:1777592-1779648 REVERSE LENGTH=422
          Length = 422

 Score = 73.6 bits (179), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 61/214 (28%), Positives = 101/214 (47%), Gaps = 13/214 (6%)

Query: 14  CVMIAVHFSMKLISEHVLNWKKPKEQKAIIIIIMMAPLYAVDSYVGLINFFGSETFFTFL 73
           C + A+  ++  I +H+LN+ +P  Q+ I+ I+ M P+YA+ S++ L+    S  +F   
Sbjct: 16  CTVGAIALALFHIYKHLLNYTEPIYQRYIVRIVFMVPVYALMSFLALV-LPKSSIYF--- 71

Query: 74  DSIKECYEALVIAKFLALMYSYLNISLSKNIVPDEIKGREIHHSFPMTLFQPHTTRLDHH 133
           +SI+E YEA VI  FL+L  +++    S  I    + GR +  S+ +         LD  
Sbjct: 72  NSIREVYEAWVIYNFLSLCLAWVGGPGSVVI---SLTGRSLKPSWHLMTCCIPPLPLDGR 128

Query: 134 TLKLLKYWTWQFVVLRPMCSILMITLQYLEVY------PSWINWTITIIXXXXXXXXXXX 187
            ++  K    QFV+L+P+   + + L     Y      P      +TII           
Sbjct: 129 FIRRCKQGCLQFVILKPILVAVTLVLYAKGKYKDGNFSPDQSYLYLTIIYTISYTVALYA 188

Query: 188 XXXFYHVFAKELEPHKPLSKFLCIKGIVFFCFWQ 221
              FY      L+P  P+ KF+ IK +VF  +WQ
Sbjct: 189 LVLFYVACKDLLQPFNPVPKFVIIKSVVFLTYWQ 222


>AT1G23070.1 | Symbols:  | Protein of unknown function (DUF300) |
           chr1:8174011-8175758 REVERSE LENGTH=403
          Length = 403

 Score = 67.4 bits (163), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 64/240 (26%), Positives = 108/240 (45%), Gaps = 26/240 (10%)

Query: 7   VLFGSTFCVMIAVHFSMKLISEHVLNWKKPKEQKAIIIIIMMAPLYAVDSYVGLINFFGS 66
           ++ G +F   +A+  S+  I +H+  +  P EQK I+ ++ M P+YA +S + L N    
Sbjct: 17  LIIGGSFAT-VAICLSLYSILQHLRFYTNPAEQKWIVSVLFMVPVYATESIISLSN---- 71

Query: 67  ETFFTFLDSIKECYEALVIAKFLALMYS----------YLNISLSKNIVPD---EIKGRE 113
             F    D ++ CYEA  +  F + + +          YL     K ++ +   E K ++
Sbjct: 72  SKFSLPCDILRNCYEAFALYSFGSYLVACLGGERRVVEYLENESKKPLLEEGANESKKKK 131

Query: 114 IHHSFPMTLFQPHTTRLDHHTLKLLKYWTWQFVVLRPMCSILMITLQYLEVYPSW-INWT 172
             +SF   L  P+   L      + K+   Q+++L+  C+ L   L+ L VY      W 
Sbjct: 132 KKNSFWKFLCDPYV--LGRELFVIEKFGLVQYMILKTFCAFLTFLLELLGVYGDGEFKWY 189

Query: 173 -----ITIIXXXXXXXXXXXXXXFYHVFAKELEPHKPLSKFLCIKGIVFFCFWQVFSYAL 227
                I ++              FY+V  + L+  KPL+KF+  K IVF  +WQ F  AL
Sbjct: 190 YGYPYIVVVLNFSQMWALFCLVQFYNVTHERLKEIKPLAKFISFKAIVFATWWQGFGIAL 249


>AT4G38360.1 | Symbols: LAZ1 | Protein of unknown function (DUF300)
           | chr4:17967389-17969170 FORWARD LENGTH=304
          Length = 304

 Score = 67.0 bits (162), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 56/235 (23%), Positives = 108/235 (45%), Gaps = 29/235 (12%)

Query: 15  VMIAVHFSMKLISEHVLNWKKPKEQKAIIIIIMMAPLYAVDSYVGLINFFGSETFFTFLD 74
           +++ +  S+ L+ +H+  +K P+EQK +I +I+M P Y+++S+  L+      +      
Sbjct: 28  LVLTLSLSLFLVFDHLSTYKNPEEQKFLIGVILMVPCYSIESFASLVK----PSISVDCG 83

Query: 75  SIKECYEALVIAKFLALMYSYLNISLSKNIVPDEIKGRE---------------IHHSFP 119
            +++CYE+  +  F   + + +     + I   E +GR+               I H FP
Sbjct: 84  ILRDCYESFAMYCFGRYLVACIG-GEERTIEFMERQGRKSFKTPLLDHKDEKGIIKHPFP 142

Query: 120 MTLF-QPHTTRLDHHTLKLLKYWTWQFVVLRPMCSILMITLQYLEVY-PSWINWT----- 172
           M LF +P   RL     +++K+   Q+++++ + ++  + L+   VY      W      
Sbjct: 143 MNLFLKPW--RLSPWFYQVVKFGIVQYMIIKSLTALTALILEAFGVYCEGEFKWGCGYPY 200

Query: 173 ITIIXXXXXXXXXXXXXXFYHVFAKELEPHKPLSKFLCIKGIVFFCFWQVFSYAL 227
           + ++              FY     EL   +PL+KFL  K IVF  +WQ  + AL
Sbjct: 201 LAVVLNFSQSWALYCLVQFYGATKDELAHIQPLAKFLTFKSIVFLTWWQGVAIAL 255


>AT4G38360.2 | Symbols: LAZ1 | Protein of unknown function (DUF300)
           | chr4:17967389-17969798 FORWARD LENGTH=485
          Length = 485

 Score = 65.1 bits (157), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 56/235 (23%), Positives = 108/235 (45%), Gaps = 29/235 (12%)

Query: 15  VMIAVHFSMKLISEHVLNWKKPKEQKAIIIIIMMAPLYAVDSYVGLINFFGSETFFTFLD 74
           +++ +  S+ L+ +H+  +K P+EQK +I +I+M P Y+++S+  L+      +      
Sbjct: 28  LVLTLSLSLFLVFDHLSTYKNPEEQKFLIGVILMVPCYSIESFASLVK----PSISVDCG 83

Query: 75  SIKECYEALVIAKFLALMYSYLNISLSKNIVPDEIKGRE---------------IHHSFP 119
            +++CYE+  +  F   + + +     + I   E +GR+               I H FP
Sbjct: 84  ILRDCYESFAMYCFGRYLVACIG-GEERTIEFMERQGRKSFKTPLLDHKDEKGIIKHPFP 142

Query: 120 MTLF-QPHTTRLDHHTLKLLKYWTWQFVVLRPMCSILMITLQYLEVY-PSWINWT----- 172
           M LF +P   RL     +++K+   Q+++++ + ++  + L+   VY      W      
Sbjct: 143 MNLFLKPW--RLSPWFYQVVKFGIVQYMIIKSLTALTALILEAFGVYCEGEFKWGCGYPY 200

Query: 173 ITIIXXXXXXXXXXXXXXFYHVFAKELEPHKPLSKFLCIKGIVFFCFWQVFSYAL 227
           + ++              FY     EL   +PL+KFL  K IVF  +WQ  + AL
Sbjct: 201 LAVVLNFSQSWALYCLVQFYGATKDELAHIQPLAKFLTFKSIVFLTWWQGVAIAL 255