Miyakogusa Predicted Gene

Lj5g3v1889530.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj5g3v1889530.1 Non Chatacterized Hit- tr|B7FK17|B7FK17_MEDTR
Uncharacterized protein OS=Medicago truncatula PE=2
SV,51.67,2e-18,DUF1475,Protein of unknown function
DUF1475,CUFF.56136.1
         (242 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT1G22750.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   255   2e-68
AT1G22750.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   253   1e-67
AT1G22750.4 | Symbols:  | unknown protein; CONTAINS InterPro DOM...   250   5e-67
AT1G22750.3 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   250   6e-67

>AT1G22750.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: vacuole;
           EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15
           growth stages; CONTAINS InterPro DOMAIN/s: Protein of
           unknown function DUF1475 (InterPro:IPR009943); Has 185
           Blast hits to 155 proteins in 21 species: Archae - 0;
           Bacteria - 8; Metazoa - 3; Fungi - 0; Plants - 64;
           Viruses - 0; Other Eukaryotes - 110 (source: NCBI
           BLink). | chr1:8050911-8052618 FORWARD LENGTH=244
          Length = 244

 Score =  255 bits (651), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 131/243 (53%), Positives = 165/243 (67%), Gaps = 2/243 (0%)

Query: 1   MGKIPLASGLRTLFTVLGALMLATLLYTIFTDGLPF--RKELLTPWMAATLVDFYINIVP 58
           M    L +GL+ +  V+  LMLATL+YTI TDGLP   R+++ TPW   T++DFYIN+VP
Sbjct: 1   MATTSLVTGLKVVLPVMFCLMLATLVYTIITDGLPLPDRQDVFTPWFVTTILDFYINLVP 60

Query: 59  FSVWIAYKESNWISSLLWIIFLVCFGSITTSAYIVLQFVKLSSQECSQDPMYYVLLRHPH 118
            +VWI YKES W  S+LW I L+ FGS+TT  Y+ LQ +KL++QE S+DPMYY+LLR   
Sbjct: 61  IAVWIVYKESTWSGSILWTILLIIFGSLTTCVYLFLQLLKLTNQEASEDPMYYLLLRDSI 120

Query: 119 KNDAEPKGKISFVAMLRIIFSILGVVMLGTLVYTLVTDGSPFRTELLTPWMVATLVDFYI 178
           K+    + K S V   R +F  LG VMLG LVYT  T GSPF  ELL PWMV  LV+FYI
Sbjct: 121 KDGVGLRDKNSLVVTARFVFGALGCVMLGALVYTCFTYGSPFHMELLYPWMVVLLVNFYI 180

Query: 179 NVTALAVWVAYKESSWICAVLWIILLICFGSITTCVYIVCQLLQISIQDHAYLVLVPHGD 238
           +V  L+VWV YKESS I  +LW+ LLI  GS+ T   IV QL ++S  D  YLVLV + +
Sbjct: 181 DVAVLSVWVVYKESSLIIGILWVALLIGLGSVGTSAVIVVQLFRLSPLDPLYLVLVNNSN 240

Query: 239 RKQ 241
           RKQ
Sbjct: 241 RKQ 243


>AT1G22750.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: vacuole;
           EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15
           growth stages; CONTAINS InterPro DOMAIN/s: Protein of
           unknown function DUF1475 (InterPro:IPR009943); Has 185
           Blast hits to 155 proteins in 21 species: Archae - 0;
           Bacteria - 8; Metazoa - 3; Fungi - 0; Plants - 64;
           Viruses - 0; Other Eukaryotes - 110 (source: NCBI
           BLink). | chr1:8050911-8052631 FORWARD LENGTH=247
          Length = 247

 Score =  253 bits (645), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 130/242 (53%), Positives = 164/242 (67%), Gaps = 2/242 (0%)

Query: 1   MGKIPLASGLRTLFTVLGALMLATLLYTIFTDGLPF--RKELLTPWMAATLVDFYINIVP 58
           M    L +GL+ +  V+  LMLATL+YTI TDGLP   R+++ TPW   T++DFYIN+VP
Sbjct: 1   MATTSLVTGLKVVLPVMFCLMLATLVYTIITDGLPLPDRQDVFTPWFVTTILDFYINLVP 60

Query: 59  FSVWIAYKESNWISSLLWIIFLVCFGSITTSAYIVLQFVKLSSQECSQDPMYYVLLRHPH 118
            +VWI YKES W  S+LW I L+ FGS+TT  Y+ LQ +KL++QE S+DPMYY+LLR   
Sbjct: 61  IAVWIVYKESTWSGSILWTILLIIFGSLTTCVYLFLQLLKLTNQEASEDPMYYLLLRDSI 120

Query: 119 KNDAEPKGKISFVAMLRIIFSILGVVMLGTLVYTLVTDGSPFRTELLTPWMVATLVDFYI 178
           K+    + K S V   R +F  LG VMLG LVYT  T GSPF  ELL PWMV  LV+FYI
Sbjct: 121 KDGVGLRDKNSLVVTARFVFGALGCVMLGALVYTCFTYGSPFHMELLYPWMVVLLVNFYI 180

Query: 179 NVTALAVWVAYKESSWICAVLWIILLICFGSITTCVYIVCQLLQISIQDHAYLVLVPHGD 238
           +V  L+VWV YKESS I  +LW+ LLI  GS+ T   IV QL ++S  D  YLVLV + +
Sbjct: 181 DVAVLSVWVVYKESSLIIGILWVALLIGLGSVGTSAVIVVQLFRLSPLDPLYLVLVNNSN 240

Query: 239 RK 240
           RK
Sbjct: 241 RK 242


>AT1G22750.4 | Symbols:  | unknown protein; CONTAINS InterPro
           DOMAIN/s: Protein of unknown function DUF1475
           (InterPro:IPR009943); Has 186 Blast hits to 155 proteins
           in 21 species: Archae - 0; Bacteria - 8; Metazoa - 3;
           Fungi - 0; Plants - 65; Viruses - 0; Other Eukaryotes -
           110 (source: NCBI BLink). | chr1:8050911-8052714 FORWARD
           LENGTH=257
          Length = 257

 Score =  250 bits (639), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 129/241 (53%), Positives = 163/241 (67%), Gaps = 2/241 (0%)

Query: 1   MGKIPLASGLRTLFTVLGALMLATLLYTIFTDGLPF--RKELLTPWMAATLVDFYINIVP 58
           M    L +GL+ +  V+  LMLATL+YTI TDGLP   R+++ TPW   T++DFYIN+VP
Sbjct: 1   MATTSLVTGLKVVLPVMFCLMLATLVYTIITDGLPLPDRQDVFTPWFVTTILDFYINLVP 60

Query: 59  FSVWIAYKESNWISSLLWIIFLVCFGSITTSAYIVLQFVKLSSQECSQDPMYYVLLRHPH 118
            +VWI YKES W  S+LW I L+ FGS+TT  Y+ LQ +KL++QE S+DPMYY+LLR   
Sbjct: 61  IAVWIVYKESTWSGSILWTILLIIFGSLTTCVYLFLQLLKLTNQEASEDPMYYLLLRDSI 120

Query: 119 KNDAEPKGKISFVAMLRIIFSILGVVMLGTLVYTLVTDGSPFRTELLTPWMVATLVDFYI 178
           K+    + K S V   R +F  LG VMLG LVYT  T GSPF  ELL PWMV  LV+FYI
Sbjct: 121 KDGVGLRDKNSLVVTARFVFGALGCVMLGALVYTCFTYGSPFHMELLYPWMVVLLVNFYI 180

Query: 179 NVTALAVWVAYKESSWICAVLWIILLICFGSITTCVYIVCQLLQISIQDHAYLVLVPHGD 238
           +V  L+VWV YKESS I  +LW+ LLI  GS+ T   IV QL ++S  D  YLVLV + +
Sbjct: 181 DVAVLSVWVVYKESSLIIGILWVALLIGLGSVGTSAVIVVQLFRLSPLDPLYLVLVNNSN 240

Query: 239 R 239
           R
Sbjct: 241 R 241


>AT1G22750.3 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: vacuole;
           EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15
           growth stages; CONTAINS InterPro DOMAIN/s: Protein of
           unknown function DUF1475 (InterPro:IPR009943); Has 35333
           Blast hits to 34131 proteins in 2444 species: Archae -
           798; Bacteria - 22429; Metazoa - 974; Fungi - 991;
           Plants - 531; Viruses - 0; Other Eukaryotes - 9610
           (source: NCBI BLink). | chr1:8050911-8052618 FORWARD
           LENGTH=241
          Length = 241

 Score =  250 bits (639), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 129/241 (53%), Positives = 163/241 (67%), Gaps = 2/241 (0%)

Query: 1   MGKIPLASGLRTLFTVLGALMLATLLYTIFTDGLPF--RKELLTPWMAATLVDFYINIVP 58
           M    L +GL+ +  V+  LMLATL+YTI TDGLP   R+++ TPW   T++DFYIN+VP
Sbjct: 1   MATTSLVTGLKVVLPVMFCLMLATLVYTIITDGLPLPDRQDVFTPWFVTTILDFYINLVP 60

Query: 59  FSVWIAYKESNWISSLLWIIFLVCFGSITTSAYIVLQFVKLSSQECSQDPMYYVLLRHPH 118
            +VWI YKES W  S+LW I L+ FGS+TT  Y+ LQ +KL++QE S+DPMYY+LLR   
Sbjct: 61  IAVWIVYKESTWSGSILWTILLIIFGSLTTCVYLFLQLLKLTNQEASEDPMYYLLLRDSI 120

Query: 119 KNDAEPKGKISFVAMLRIIFSILGVVMLGTLVYTLVTDGSPFRTELLTPWMVATLVDFYI 178
           K+    + K S V   R +F  LG VMLG LVYT  T GSPF  ELL PWMV  LV+FYI
Sbjct: 121 KDGVGLRDKNSLVVTARFVFGALGCVMLGALVYTCFTYGSPFHMELLYPWMVVLLVNFYI 180

Query: 179 NVTALAVWVAYKESSWICAVLWIILLICFGSITTCVYIVCQLLQISIQDHAYLVLVPHGD 238
           +V  L+VWV YKESS I  +LW+ LLI  GS+ T   IV QL ++S  D  YLVLV + +
Sbjct: 181 DVAVLSVWVVYKESSLIIGILWVALLIGLGSVGTSAVIVVQLFRLSPLDPLYLVLVNNSN 240

Query: 239 R 239
           R
Sbjct: 241 R 241