Miyakogusa Predicted Gene
- Lj5g3v1889560.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj5g3v1889560.1 Non Chatacterized Hit- tr|D8SX59|D8SX59_SELML
Putative uncharacterized protein OS=Selaginella
moelle,46.23,8e-19,seg,NULL; DUF1475,Protein of unknown function
DUF1475,CUFF.56119.1
(200 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G22750.4 | Symbols: | unknown protein; CONTAINS InterPro DOM... 129 2e-30
AT1G22750.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 129 2e-30
AT1G22750.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 128 2e-30
AT1G22750.3 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 128 2e-30
>AT1G22750.4 | Symbols: | unknown protein; CONTAINS InterPro
DOMAIN/s: Protein of unknown function DUF1475
(InterPro:IPR009943); Has 186 Blast hits to 155 proteins
in 21 species: Archae - 0; Bacteria - 8; Metazoa - 3;
Fungi - 0; Plants - 65; Viruses - 0; Other Eukaryotes -
110 (source: NCBI BLink). | chr1:8050911-8052714 FORWARD
LENGTH=257
Length = 257
Score = 129 bits (323), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 70/144 (48%), Positives = 92/144 (63%), Gaps = 3/144 (2%)
Query: 1 MKLTSSQASSEDLIYYVLLRTPHKNDPELKGKLSFVVMLRILFSILGVVMLGTLVYTLVT 60
+KLT+ +AS ED +YY+LLR K+ L+ K S VV R +F LG VMLG LVYT T
Sbjct: 99 LKLTNQEAS-EDPMYYLLLRDSIKDGVGLRDKNSLVVTARFVFGALGCVMLGALVYTCFT 157
Query: 61 AGSPFRMEIFTPWMSATLIDFYVNVVALAVWVTYKEPSWICAVFWIILLICFGSIATCTY 120
GSPF ME+ PWM L++FY++V L+VWV YKE S I + W+ LLI GS+ T
Sbjct: 158 YGSPFHMELLYPWMVVLLVNFYIDVAVLSVWVVYKESSLIIGILWVALLIGLGSVGTSAV 217
Query: 121 IVWKLLQIQ--DPAYLVLVRQAVK 142
IV +L ++ DP YLVLV + +
Sbjct: 218 IVVQLFRLSPLDPLYLVLVNNSNR 241
Score = 104 bits (260), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 50/117 (42%), Positives = 79/117 (67%), Gaps = 8/117 (6%)
Query: 34 SFVVMLRILFSILGVVMLGTLVYTLVTAGSPF--RMEIFTPWMSATLIDFYVNVVALAVW 91
S V L+++ ++ +ML TLVYT++T G P R ++FTPW T++DFY+N+V +AVW
Sbjct: 5 SLVTGLKVVLPVMFCLMLATLVYTIITDGLPLPDRQDVFTPWFVTTILDFYINLVPIAVW 64
Query: 92 VTYKEPSWICAVFWIILLICFGSIATCTYIVWKLLQI------QDPAYLVLVRQAVK 142
+ YKE +W ++ W ILLI FGS+ TC Y+ +LL++ +DP Y +L+R ++K
Sbjct: 65 IVYKESTWSGSILWTILLIIFGSLTTCVYLFLQLLKLTNQEASEDPMYYLLLRDSIK 121
>AT1G22750.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: vacuole;
EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15
growth stages; CONTAINS InterPro DOMAIN/s: Protein of
unknown function DUF1475 (InterPro:IPR009943); Has 185
Blast hits to 155 proteins in 21 species: Archae - 0;
Bacteria - 8; Metazoa - 3; Fungi - 0; Plants - 64;
Viruses - 0; Other Eukaryotes - 110 (source: NCBI
BLink). | chr1:8050911-8052618 FORWARD LENGTH=244
Length = 244
Score = 129 bits (323), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 70/144 (48%), Positives = 92/144 (63%), Gaps = 3/144 (2%)
Query: 1 MKLTSSQASSEDLIYYVLLRTPHKNDPELKGKLSFVVMLRILFSILGVVMLGTLVYTLVT 60
+KLT+ +AS ED +YY+LLR K+ L+ K S VV R +F LG VMLG LVYT T
Sbjct: 99 LKLTNQEAS-EDPMYYLLLRDSIKDGVGLRDKNSLVVTARFVFGALGCVMLGALVYTCFT 157
Query: 61 AGSPFRMEIFTPWMSATLIDFYVNVVALAVWVTYKEPSWICAVFWIILLICFGSIATCTY 120
GSPF ME+ PWM L++FY++V L+VWV YKE S I + W+ LLI GS+ T
Sbjct: 158 YGSPFHMELLYPWMVVLLVNFYIDVAVLSVWVVYKESSLIIGILWVALLIGLGSVGTSAV 217
Query: 121 IVWKLLQIQ--DPAYLVLVRQAVK 142
IV +L ++ DP YLVLV + +
Sbjct: 218 IVVQLFRLSPLDPLYLVLVNNSNR 241
Score = 104 bits (260), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 50/117 (42%), Positives = 79/117 (67%), Gaps = 8/117 (6%)
Query: 34 SFVVMLRILFSILGVVMLGTLVYTLVTAGSPF--RMEIFTPWMSATLIDFYVNVVALAVW 91
S V L+++ ++ +ML TLVYT++T G P R ++FTPW T++DFY+N+V +AVW
Sbjct: 5 SLVTGLKVVLPVMFCLMLATLVYTIITDGLPLPDRQDVFTPWFVTTILDFYINLVPIAVW 64
Query: 92 VTYKEPSWICAVFWIILLICFGSIATCTYIVWKLLQI------QDPAYLVLVRQAVK 142
+ YKE +W ++ W ILLI FGS+ TC Y+ +LL++ +DP Y +L+R ++K
Sbjct: 65 IVYKESTWSGSILWTILLIIFGSLTTCVYLFLQLLKLTNQEASEDPMYYLLLRDSIK 121
>AT1G22750.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: vacuole;
EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15
growth stages; CONTAINS InterPro DOMAIN/s: Protein of
unknown function DUF1475 (InterPro:IPR009943); Has 185
Blast hits to 155 proteins in 21 species: Archae - 0;
Bacteria - 8; Metazoa - 3; Fungi - 0; Plants - 64;
Viruses - 0; Other Eukaryotes - 110 (source: NCBI
BLink). | chr1:8050911-8052631 FORWARD LENGTH=247
Length = 247
Score = 128 bits (322), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 70/144 (48%), Positives = 92/144 (63%), Gaps = 3/144 (2%)
Query: 1 MKLTSSQASSEDLIYYVLLRTPHKNDPELKGKLSFVVMLRILFSILGVVMLGTLVYTLVT 60
+KLT+ +AS ED +YY+LLR K+ L+ K S VV R +F LG VMLG LVYT T
Sbjct: 99 LKLTNQEAS-EDPMYYLLLRDSIKDGVGLRDKNSLVVTARFVFGALGCVMLGALVYTCFT 157
Query: 61 AGSPFRMEIFTPWMSATLIDFYVNVVALAVWVTYKEPSWICAVFWIILLICFGSIATCTY 120
GSPF ME+ PWM L++FY++V L+VWV YKE S I + W+ LLI GS+ T
Sbjct: 158 YGSPFHMELLYPWMVVLLVNFYIDVAVLSVWVVYKESSLIIGILWVALLIGLGSVGTSAV 217
Query: 121 IVWKLLQIQ--DPAYLVLVRQAVK 142
IV +L ++ DP YLVLV + +
Sbjct: 218 IVVQLFRLSPLDPLYLVLVNNSNR 241
Score = 104 bits (259), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 50/117 (42%), Positives = 79/117 (67%), Gaps = 8/117 (6%)
Query: 34 SFVVMLRILFSILGVVMLGTLVYTLVTAGSPF--RMEIFTPWMSATLIDFYVNVVALAVW 91
S V L+++ ++ +ML TLVYT++T G P R ++FTPW T++DFY+N+V +AVW
Sbjct: 5 SLVTGLKVVLPVMFCLMLATLVYTIITDGLPLPDRQDVFTPWFVTTILDFYINLVPIAVW 64
Query: 92 VTYKEPSWICAVFWIILLICFGSIATCTYIVWKLLQI------QDPAYLVLVRQAVK 142
+ YKE +W ++ W ILLI FGS+ TC Y+ +LL++ +DP Y +L+R ++K
Sbjct: 65 IVYKESTWSGSILWTILLIIFGSLTTCVYLFLQLLKLTNQEASEDPMYYLLLRDSIK 121
>AT1G22750.3 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: vacuole;
EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15
growth stages; CONTAINS InterPro DOMAIN/s: Protein of
unknown function DUF1475 (InterPro:IPR009943); Has 35333
Blast hits to 34131 proteins in 2444 species: Archae -
798; Bacteria - 22429; Metazoa - 974; Fungi - 991;
Plants - 531; Viruses - 0; Other Eukaryotes - 9610
(source: NCBI BLink). | chr1:8050911-8052618 FORWARD
LENGTH=241
Length = 241
Score = 128 bits (322), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 70/144 (48%), Positives = 92/144 (63%), Gaps = 3/144 (2%)
Query: 1 MKLTSSQASSEDLIYYVLLRTPHKNDPELKGKLSFVVMLRILFSILGVVMLGTLVYTLVT 60
+KLT+ +AS ED +YY+LLR K+ L+ K S VV R +F LG VMLG LVYT T
Sbjct: 99 LKLTNQEAS-EDPMYYLLLRDSIKDGVGLRDKNSLVVTARFVFGALGCVMLGALVYTCFT 157
Query: 61 AGSPFRMEIFTPWMSATLIDFYVNVVALAVWVTYKEPSWICAVFWIILLICFGSIATCTY 120
GSPF ME+ PWM L++FY++V L+VWV YKE S I + W+ LLI GS+ T
Sbjct: 158 YGSPFHMELLYPWMVVLLVNFYIDVAVLSVWVVYKESSLIIGILWVALLIGLGSVGTSAV 217
Query: 121 IVWKLLQIQ--DPAYLVLVRQAVK 142
IV +L ++ DP YLVLV + +
Sbjct: 218 IVVQLFRLSPLDPLYLVLVNNSNR 241
Score = 104 bits (259), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 50/117 (42%), Positives = 79/117 (67%), Gaps = 8/117 (6%)
Query: 34 SFVVMLRILFSILGVVMLGTLVYTLVTAGSPF--RMEIFTPWMSATLIDFYVNVVALAVW 91
S V L+++ ++ +ML TLVYT++T G P R ++FTPW T++DFY+N+V +AVW
Sbjct: 5 SLVTGLKVVLPVMFCLMLATLVYTIITDGLPLPDRQDVFTPWFVTTILDFYINLVPIAVW 64
Query: 92 VTYKEPSWICAVFWIILLICFGSIATCTYIVWKLLQI------QDPAYLVLVRQAVK 142
+ YKE +W ++ W ILLI FGS+ TC Y+ +LL++ +DP Y +L+R ++K
Sbjct: 65 IVYKESTWSGSILWTILLIIFGSLTTCVYLFLQLLKLTNQEASEDPMYYLLLRDSIK 121