Miyakogusa Predicted Gene

Lj1g3v2069890.2
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj1g3v2069890.2 tr|Q6NNL9|Q6NNL9_ARATH At5g11630 OS=Arabidopsis
thaliana GN=At5g11630 PE=4 SV=1,56.04,1e-18,seg,NULL,CUFF.28418.2
         (91 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT5G11630.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    92   6e-20
AT5G11630.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    81   1e-16
AT5G47455.3 | Symbols:  | unknown protein; BEST Arabidopsis thal...    71   1e-13
AT4G17310.2 | Symbols:  | unknown protein; LOCATED IN: chloropla...    71   1e-13
AT5G47455.4 | Symbols:  | unknown protein; BEST Arabidopsis thal...    59   9e-10
AT5G47455.2 | Symbols:  | unknown protein; BEST Arabidopsis thal...    58   1e-09
AT5G47455.5 | Symbols:  | unknown protein; BEST Arabidopsis thal...    58   1e-09
AT4G17310.1 | Symbols:  | unknown protein; LOCATED IN: chloropla...    58   2e-09
AT5G47455.7 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    57   3e-09
AT5G47455.6 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    57   3e-09
AT5G47455.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    57   3e-09
AT4G39300.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    50   3e-07
AT4G39300.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    47   2e-06

>AT5G11630.2 | Symbols:  | unknown protein; FUNCTIONS IN:
          molecular_function unknown; INVOLVED IN:
          biological_process unknown; LOCATED IN: chloroplast;
          EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
          growth stages; BEST Arabidopsis thaliana protein match
          is: unknown protein (TAIR:AT4G17310.2); Has 82 Blast
          hits to 82 proteins in 10 species: Archae - 0; Bacteria
          - 0; Metazoa - 0; Fungi - 0; Plants - 82; Viruses - 0;
          Other Eukaryotes - 0 (source: NCBI BLink). |
          chr5:3740545-3741529 FORWARD LENGTH=91
          Length = 91

 Score = 92.4 bits (228), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 49/91 (53%), Positives = 62/91 (68%)

Query: 1  MASRYRSFSQPAMSLIKSTITKPTTNPKPSPFLFKXXXXXXXXXXVAELGCVQSLLPLHS 60
          MASR RS S+PA S  +S + KP+  PK +               + +LG +QSLLPL+S
Sbjct: 1  MASRCRSLSKPAFSAFRSAMNKPSIRPKSASSFIGVPPSPGFSRPIGQLGSLQSLLPLYS 60

Query: 61 AVSSARLTSCLGIDSRTSRSLSQEMGLSTPR 91
          AV+SARLTSCLGIDS+ SRSL+QE+GLS PR
Sbjct: 61 AVASARLTSCLGIDSQNSRSLAQELGLSVPR 91


>AT5G11630.1 | Symbols:  | unknown protein; FUNCTIONS IN:
          molecular_function unknown; INVOLVED IN:
          biological_process unknown; LOCATED IN: chloroplast;
          EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
          growth stages; BEST Arabidopsis thaliana protein match
          is: unknown protein (TAIR:AT4G17310.1); Has 90 Blast
          hits to 90 proteins in 10 species: Archae - 0; Bacteria
          - 0; Metazoa - 0; Fungi - 0; Plants - 90; Viruses - 0;
          Other Eukaryotes - 0 (source: NCBI BLink). |
          chr5:3740545-3740909 FORWARD LENGTH=93
          Length = 93

 Score = 80.9 bits (198), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 44/85 (51%), Positives = 56/85 (65%)

Query: 1  MASRYRSFSQPAMSLIKSTITKPTTNPKPSPFLFKXXXXXXXXXXVAELGCVQSLLPLHS 60
          MASR RS S+PA S  +S + KP+  PK +               + +LG +QSLLPL+S
Sbjct: 1  MASRCRSLSKPAFSAFRSAMNKPSIRPKSASSFIGVPPSPGFSRPIGQLGSLQSLLPLYS 60

Query: 61 AVSSARLTSCLGIDSRTSRSLSQEM 85
          AV+SARLTSCLGIDS+ SRSL+Q M
Sbjct: 61 AVASARLTSCLGIDSQNSRSLAQGM 85


>AT5G47455.3 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G17310.2); Has 138 Blast hits to 138 proteins
           in 13 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 138; Viruses - 0; Other Eukaryotes -
           0 (source: NCBI BLink). | chr5:19250805-19252229 FORWARD
           LENGTH=106
          Length = 106

 Score = 71.2 bits (173), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 34/45 (75%), Positives = 40/45 (88%)

Query: 47  AELGCVQSLLPLHSAVSSARLTSCLGIDSRTSRSLSQEMGLSTPR 91
           +ELGCVQSLLPLHS V++ARLTSCL   SR+SR+L+QEMGLS PR
Sbjct: 62  SELGCVQSLLPLHSTVAAARLTSCLSTTSRSSRALTQEMGLSVPR 106


>AT4G17310.2 | Symbols:  | unknown protein; LOCATED IN:
          chloroplast; EXPRESSED IN: 9 plant structures;
          EXPRESSED DURING: LP.04 four leaves visible, 4
          anthesis, petal differentiation and expansion stage;
          BEST Arabidopsis thaliana protein match is: unknown
          protein (TAIR:AT5G47455.3); Has 35333 Blast hits to
          34131 proteins in 2444 species: Archae - 798; Bacteria
          - 22429; Metazoa - 974; Fungi - 991; Plants - 531;
          Viruses - 0; Other Eukaryotes - 9610 (source: NCBI
          BLink). | chr4:9685429-9686848 REVERSE LENGTH=97
          Length = 97

 Score = 71.2 bits (173), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 34/45 (75%), Positives = 40/45 (88%)

Query: 47 AELGCVQSLLPLHSAVSSARLTSCLGIDSRTSRSLSQEMGLSTPR 91
          +ELGCVQSLLPLHS V++ARLTSCL   SR+SR+LSQE+GLS PR
Sbjct: 53 SELGCVQSLLPLHSTVAAARLTSCLSTTSRSSRALSQELGLSVPR 97


>AT5G47455.4 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G17310.2); Has 125 Blast hits to 125 proteins
           in 13 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 125; Viruses - 0; Other Eukaryotes -
           0 (source: NCBI BLink). | chr5:19250805-19252261 FORWARD
           LENGTH=116
          Length = 116

 Score = 58.5 bits (140), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 28/42 (66%), Positives = 35/42 (83%)

Query: 47  AELGCVQSLLPLHSAVSSARLTSCLGIDSRTSRSLSQEMGLS 88
           +ELGCVQSLLPLHS V++ARLTSCL   SR+SR+L+Q+   S
Sbjct: 62  SELGCVQSLLPLHSTVAAARLTSCLSTTSRSSRALTQDGTFS 103


>AT5G47455.2 | Symbols:  | unknown protein; BEST Arabidopsis
          thaliana protein match is: unknown protein
          (TAIR:AT4G17310.2); Has 132 Blast hits to 132 proteins
          in 13 species: Archae - 0; Bacteria - 0; Metazoa - 0;
          Fungi - 0; Plants - 132; Viruses - 0; Other Eukaryotes
          - 0 (source: NCBI BLink). | chr5:19250805-19251954
          FORWARD LENGTH=104
          Length = 104

 Score = 58.2 bits (139), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 27/38 (71%), Positives = 34/38 (89%)

Query: 47 AELGCVQSLLPLHSAVSSARLTSCLGIDSRTSRSLSQE 84
          +ELGCVQSLLPLHS V++ARLTSCL   SR+SR+L+Q+
Sbjct: 62 SELGCVQSLLPLHSTVAAARLTSCLSTTSRSSRALTQD 99


>AT5G47455.5 | Symbols:  | unknown protein; BEST Arabidopsis
          thaliana protein match is: unknown protein
          (TAIR:AT4G17310.2); Has 132 Blast hits to 132 proteins
          in 13 species: Archae - 0; Bacteria - 0; Metazoa - 0;
          Fungi - 0; Plants - 132; Viruses - 0; Other Eukaryotes
          - 0 (source: NCBI BLink). | chr5:19250805-19251954
          FORWARD LENGTH=104
          Length = 104

 Score = 58.2 bits (139), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 27/38 (71%), Positives = 34/38 (89%)

Query: 47 AELGCVQSLLPLHSAVSSARLTSCLGIDSRTSRSLSQE 84
          +ELGCVQSLLPLHS V++ARLTSCL   SR+SR+L+Q+
Sbjct: 62 SELGCVQSLLPLHSTVAAARLTSCLSTTSRSSRALTQD 99


>AT4G17310.1 | Symbols:  | unknown protein; LOCATED IN:
          chloroplast; EXPRESSED IN: 9 plant structures;
          EXPRESSED DURING: LP.04 four leaves visible, 4
          anthesis, petal differentiation and expansion stage;
          BEST Arabidopsis thaliana protein match is: unknown
          protein (TAIR:AT5G47455.7); Has 164 Blast hits to 164
          proteins in 12 species: Archae - 0; Bacteria - 0;
          Metazoa - 0; Fungi - 0; Plants - 164; Viruses - 0;
          Other Eukaryotes - 0 (source: NCBI BLink). |
          chr4:9686466-9686848 REVERSE LENGTH=99
          Length = 99

 Score = 57.8 bits (138), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 28/37 (75%), Positives = 33/37 (89%)

Query: 47 AELGCVQSLLPLHSAVSSARLTSCLGIDSRTSRSLSQ 83
          +ELGCVQSLLPLHS V++ARLTSCL   SR+SR+LSQ
Sbjct: 53 SELGCVQSLLPLHSTVAAARLTSCLSTTSRSSRALSQ 89


>AT5G47455.7 | Symbols:  | unknown protein; FUNCTIONS IN:
          molecular_function unknown; INVOLVED IN:
          biological_process unknown; LOCATED IN: chloroplast;
          EXPRESSED IN: 13 plant structures; EXPRESSED DURING: 8
          growth stages; BEST Arabidopsis thaliana protein match
          is: unknown protein (TAIR:AT4G17310.1); Has 147 Blast
          hits to 147 proteins in 13 species: Archae - 0;
          Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 147;
          Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).
          | chr5:19250805-19251665 FORWARD LENGTH=100
          Length = 100

 Score = 57.0 bits (136), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 27/37 (72%), Positives = 33/37 (89%)

Query: 47 AELGCVQSLLPLHSAVSSARLTSCLGIDSRTSRSLSQ 83
          +ELGCVQSLLPLHS V++ARLTSCL   SR+SR+L+Q
Sbjct: 62 SELGCVQSLLPLHSTVAAARLTSCLSTTSRSSRALTQ 98


>AT5G47455.6 | Symbols:  | unknown protein; FUNCTIONS IN:
          molecular_function unknown; INVOLVED IN:
          biological_process unknown; LOCATED IN: chloroplast;
          EXPRESSED IN: 13 plant structures; EXPRESSED DURING: 8
          growth stages; BEST Arabidopsis thaliana protein match
          is: unknown protein (TAIR:AT4G17310.1); Has 35333 Blast
          hits to 34131 proteins in 2444 species: Archae - 798;
          Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants -
          531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI
          BLink). | chr5:19250805-19251665 FORWARD LENGTH=100
          Length = 100

 Score = 57.0 bits (136), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 27/37 (72%), Positives = 33/37 (89%)

Query: 47 AELGCVQSLLPLHSAVSSARLTSCLGIDSRTSRSLSQ 83
          +ELGCVQSLLPLHS V++ARLTSCL   SR+SR+L+Q
Sbjct: 62 SELGCVQSLLPLHSTVAAARLTSCLSTTSRSSRALTQ 98


>AT5G47455.1 | Symbols:  | unknown protein; FUNCTIONS IN:
          molecular_function unknown; INVOLVED IN:
          biological_process unknown; LOCATED IN: chloroplast;
          EXPRESSED IN: 13 plant structures; EXPRESSED DURING: 8
          growth stages; BEST Arabidopsis thaliana protein match
          is: unknown protein (TAIR:AT4G17310.1); Has 147 Blast
          hits to 147 proteins in 13 species: Archae - 0;
          Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 147;
          Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).
          | chr5:19250805-19251665 FORWARD LENGTH=100
          Length = 100

 Score = 57.0 bits (136), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 27/37 (72%), Positives = 33/37 (89%)

Query: 47 AELGCVQSLLPLHSAVSSARLTSCLGIDSRTSRSLSQ 83
          +ELGCVQSLLPLHS V++ARLTSCL   SR+SR+L+Q
Sbjct: 62 SELGCVQSLLPLHSTVAAARLTSCLSTTSRSSRALTQ 98


>AT4G39300.2 | Symbols:  | unknown protein; FUNCTIONS IN:
          molecular_function unknown; INVOLVED IN:
          biological_process unknown; LOCATED IN: chloroplast;
          EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
          growth stages; BEST Arabidopsis thaliana protein match
          is: unknown protein (TAIR:AT5G11630.2); Has 89 Blast
          hits to 89 proteins in 11 species: Archae - 0; Bacteria
          - 0; Metazoa - 0; Fungi - 0; Plants - 89; Viruses - 0;
          Other Eukaryotes - 0 (source: NCBI BLink). |
          chr4:18286835-18288156 FORWARD LENGTH=96
          Length = 96

 Score = 50.1 bits (118), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 33/87 (37%), Positives = 49/87 (56%), Gaps = 8/87 (9%)

Query: 1  MASRYRSFSQPAMSLIKSTITKPTTNPKPSPFLFKXXXXXXXXXXVAELGCVQSLLPLHS 60
          + SR R+ +Q +++L      KPTT    SPF             ++ LG V++++PLHS
Sbjct: 16 LVSRSRTVTQKSLNL------KPTTTS--SPFASMSQSIPRASRVLSALGSVETMIPLHS 67

Query: 61 AVSSARLTSCLGIDSRTSRSLSQEMGL 87
          AV+SARL S +  DS     LSQE+G+
Sbjct: 68 AVASARLRSSIAADSSCWSLLSQELGV 94


>AT4G39300.1 | Symbols:  | unknown protein; FUNCTIONS IN:
          molecular_function unknown; INVOLVED IN:
          biological_process unknown; LOCATED IN: chloroplast;
          EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
          growth stages; Has 30201 Blast hits to 17322 proteins
          in 780 species: Archae - 12; Bacteria - 1396; Metazoa -
          17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other
          Eukaryotes - 2996 (source: NCBI BLink). |
          chr4:18286835-18287823 FORWARD LENGTH=96
          Length = 96

 Score = 47.4 bits (111), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 35/90 (38%), Positives = 50/90 (55%), Gaps = 10/90 (11%)

Query: 1  MASRYRSFSQPAMSLIKSTITKPTTNPKPSPFLFKXXXXXXXXXXVAELGCVQSLLPLHS 60
          + SR R+ +Q +++L      KPTT    SPF             ++ LG V++++PLHS
Sbjct: 16 LVSRSRTVTQKSLNL------KPTTTS--SPFASMSQSIPRASRVLSALGSVETMIPLHS 67

Query: 61 AVSSARLTSCLGIDSRTSRSLSQEMGLSTP 90
          AV+SARL S +  DS     LSQ  GL+TP
Sbjct: 68 AVASARLRSSIAADSSCWSLLSQ--GLATP 95