Miyakogusa Predicted Gene

Lj4g3v1880630.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj4g3v1880630.1 tr|Q9XI35|Q9XI35_ARATH AT1G15350 protein
OS=Arabidopsis thaliana GN=F9L1.29 PE=4
SV=1,42.73,3e-19,DUF4050,Domain of unknown function DUF4050;
seg,NULL,CUFF.49811.1
         (173 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT5G25360.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   172   1e-43
AT5G25360.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   172   1e-43
AT4G32342.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   127   4e-30
AT3G15770.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   123   5e-29
AT3G15770.2 | Symbols:  | unknown protein; BEST Arabidopsis thal...   123   5e-29
AT1G15350.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   122   9e-29
AT1G15350.2 | Symbols:  | unknown protein; BEST Arabidopsis thal...   122   9e-29
AT1G15350.3 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   100   5e-22
AT3G54880.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    80   8e-16
AT5G03440.2 | Symbols:  | unknown protein; BEST Arabidopsis thal...    68   4e-12
AT5G03440.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    68   4e-12

>AT5G25360.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 23 plant
           structures; EXPRESSED DURING: 15 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT4G32342.1). | chr5:8799934-8802333 REVERSE
           LENGTH=169
          Length = 169

 Score =  172 bits (435), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 84/168 (50%), Positives = 121/168 (72%), Gaps = 5/168 (2%)

Query: 5   IRTAWITDLFAXXXXXXXXXXKPTPIIAVDEPAKGLRIQGQSVRKPTISDGFWSSSTCDL 64
           I  +WI  LF           KP  I+AVDEP+KGLRIQG+ V+KP++S+ FWS+STC++
Sbjct: 6   IIPSWIYQLFGCMGGCFGCCNKPPLIVAVDEPSKGLRIQGRLVKKPSVSEDFWSTSTCEM 65

Query: 65  DNSTIQSQKSISSVSTLNNILYHSNGTSSAGTTAEFVNEGLLLWNESRLQWVGSGRSKKQ 124
           DNST+QSQ+S+SS+S  NN    ++ ++S     EFVN GL LWN++R QW+ +G S+K+
Sbjct: 66  DNSTLQSQRSMSSISFTNN----TSTSASTSNPTEFVNHGLNLWNQTRQQWLANGTSQKK 121

Query: 125 TQQKRDPKLNWNTSYESLIGTRQPFPKSVPLSEMVEFLVDVWEREGMY 172
             + R+P ++WN +YESL+G  + F + +PL EMV+FLVDVWE+EG+Y
Sbjct: 122 A-KVREPTISWNATYESLLGMNKRFSRPIPLPEMVDFLVDVWEQEGLY 168


>AT5G25360.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G32342.1); Has 1807 Blast hits to 1807 proteins
           in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
           Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
           - 339 (source: NCBI BLink). | chr5:8799934-8802333
           REVERSE LENGTH=169
          Length = 169

 Score =  172 bits (435), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 84/168 (50%), Positives = 121/168 (72%), Gaps = 5/168 (2%)

Query: 5   IRTAWITDLFAXXXXXXXXXXKPTPIIAVDEPAKGLRIQGQSVRKPTISDGFWSSSTCDL 64
           I  +WI  LF           KP  I+AVDEP+KGLRIQG+ V+KP++S+ FWS+STC++
Sbjct: 6   IIPSWIYQLFGCMGGCFGCCNKPPLIVAVDEPSKGLRIQGRLVKKPSVSEDFWSTSTCEM 65

Query: 65  DNSTIQSQKSISSVSTLNNILYHSNGTSSAGTTAEFVNEGLLLWNESRLQWVGSGRSKKQ 124
           DNST+QSQ+S+SS+S  NN    ++ ++S     EFVN GL LWN++R QW+ +G S+K+
Sbjct: 66  DNSTLQSQRSMSSISFTNN----TSTSASTSNPTEFVNHGLNLWNQTRQQWLANGTSQKK 121

Query: 125 TQQKRDPKLNWNTSYESLIGTRQPFPKSVPLSEMVEFLVDVWEREGMY 172
             + R+P ++WN +YESL+G  + F + +PL EMV+FLVDVWE+EG+Y
Sbjct: 122 A-KVREPTISWNATYESLLGMNKRFSRPIPLPEMVDFLVDVWEQEGLY 168


>AT4G32342.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT5G25360.2);
           Has 30201 Blast hits to 17322 proteins in 780 species:
           Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi -
           3422; Plants - 5037; Viruses - 0; Other Eukaryotes -
           2996 (source: NCBI BLink). | chr4:15615184-15616200
           REVERSE LENGTH=161
          Length = 161

 Score =  127 bits (319), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 69/145 (47%), Positives = 95/145 (65%), Gaps = 14/145 (9%)

Query: 30  IIAVDEPAKGLRIQGQSVRKPTIS-DGFWSSSTCDLD-NSTIQSQKSISSVSTLNNILYH 87
           ++ VDEP+KGL+IQG+ V+K + S D FWS+STCD+D N TIQSQ S        N  + 
Sbjct: 28  VVEVDEPSKGLKIQGKIVKKDSASSDDFWSTSTCDMDHNITIQSQSS--------NPPFD 79

Query: 88  SNGTSSAGTTAEFVNEGLLLWNESRLQWVGSGRSKKQTQQKRDPKLNWNTSYESLIGTRQ 147
              ++S  T  EFVN GL+LWN +R QW       +Q     +P ++WN++Y+SL+ T +
Sbjct: 80  PQCSTSNST--EFVNHGLILWNHTRQQWRECL--TRQQCLVPEPAISWNSTYDSLLSTNK 135

Query: 148 PFPKSVPLSEMVEFLVDVWEREGMY 172
            FP+ +PL EMV FLVDVWE EG+Y
Sbjct: 136 LFPQPIPLKEMVHFLVDVWEEEGLY 160


>AT3G15770.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT5G25360.2); Has 143 Blast hits to 143 proteins
           in 25 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 136; Viruses - 0; Other Eukaryotes -
           7 (source: NCBI BLink). | chr3:5340243-5341216 FORWARD
           LENGTH=162
          Length = 162

 Score =  123 bits (309), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 60/129 (46%), Positives = 91/129 (70%), Gaps = 5/129 (3%)

Query: 47  VRKPTI--SDGFWSSSTCDLDNSTIQSQKSISSVSTLNNILYHSNGTSSAGTTAEFVNEG 104
           +RKP++  S+ FW+++T D++++   +  S+SS+ST N  +      SS+   AEFVN G
Sbjct: 37  LRKPSVVASEDFWTNTTLDMESN---AHGSVSSISTTNLTIDSQGCGSSSNEPAEFVNHG 93

Query: 105 LLLWNESRLQWVGSGRSKKQTQQKRDPKLNWNTSYESLIGTRQPFPKSVPLSEMVEFLVD 164
           L+LWN++R QWVG  RS+ +    R+P LN N +YESL+G+ + FP+ +PL EMV+FLV+
Sbjct: 94  LVLWNQTRQQWVGDKRSESRKSVGREPILNENVTYESLLGSNKRFPRPIPLDEMVQFLVE 153

Query: 165 VWEREGMYG 173
           VWE EG+YG
Sbjct: 154 VWEEEGLYG 162


>AT3G15770.2 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G15350.1); Has 35333 Blast hits to 34131
           proteins in 2444 species: Archae - 798; Bacteria -
           22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
           - 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
           chr3:5340243-5341216 FORWARD LENGTH=161
          Length = 161

 Score =  123 bits (309), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 60/129 (46%), Positives = 91/129 (70%), Gaps = 5/129 (3%)

Query: 47  VRKPTI--SDGFWSSSTCDLDNSTIQSQKSISSVSTLNNILYHSNGTSSAGTTAEFVNEG 104
           +RKP++  S+ FW+++T D++++   +  S+SS+ST N  +      SS+   AEFVN G
Sbjct: 36  LRKPSVVASEDFWTNTTLDMESN---AHGSVSSISTTNLTIDSQGCGSSSNEPAEFVNHG 92

Query: 105 LLLWNESRLQWVGSGRSKKQTQQKRDPKLNWNTSYESLIGTRQPFPKSVPLSEMVEFLVD 164
           L+LWN++R QWVG  RS+ +    R+P LN N +YESL+G+ + FP+ +PL EMV+FLV+
Sbjct: 93  LVLWNQTRQQWVGDKRSESRKSVGREPILNENVTYESLLGSNKRFPRPIPLDEMVQFLVE 152

Query: 165 VWEREGMYG 173
           VWE EG+YG
Sbjct: 153 VWEEEGLYG 161


>AT1G15350.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G15770.2); Has 148 Blast hits to 148 proteins
           in 25 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 141; Viruses - 0; Other Eukaryotes -
           7 (source: NCBI BLink). | chr1:5278481-5279486 REVERSE
           LENGTH=154
          Length = 154

 Score =  122 bits (307), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 59/140 (42%), Positives = 91/140 (65%), Gaps = 7/140 (5%)

Query: 34  DEPAKGLRIQGQSVRKPTISDGFWSSSTCDLDNSTIQSQKSISSVSTLNNILYHSNGTSS 93
           D P+  +    +  +KP++S+ FWS+ST D+DN T  SQ S+SS    N      +   +
Sbjct: 20  DPPSNSI---ARPCKKPSVSEDFWSTSTVDMDNITFPSQGSLSSS---NQTFDSQSAARN 73

Query: 94  AGTTAEFVNEGLLLWNESRLQWVGSGRSKKQTQQKRDPKLNWNTS-YESLIGTRQPFPKS 152
           +    E+VN+GLLLWN++R +WVG  +        +  KLNWNT+ Y+SL+G+ + FP+ 
Sbjct: 74  SNAPPEYVNQGLLLWNQTRERWVGKDKPNNPVDHNQGAKLNWNTATYDSLLGSNKLFPQP 133

Query: 153 VPLSEMVEFLVDVWEREGMY 172
           +PL+EMV+FLVD+WE+EG+Y
Sbjct: 134 IPLTEMVDFLVDIWEQEGLY 153


>AT1G15350.2 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G15770.2); Has 148 Blast hits to 148 proteins
           in 25 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 141; Viruses - 0; Other Eukaryotes -
           7 (source: NCBI BLink). | chr1:5278481-5279486 REVERSE
           LENGTH=154
          Length = 154

 Score =  122 bits (307), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 59/140 (42%), Positives = 91/140 (65%), Gaps = 7/140 (5%)

Query: 34  DEPAKGLRIQGQSVRKPTISDGFWSSSTCDLDNSTIQSQKSISSVSTLNNILYHSNGTSS 93
           D P+  +    +  +KP++S+ FWS+ST D+DN T  SQ S+SS    N      +   +
Sbjct: 20  DPPSNSI---ARPCKKPSVSEDFWSTSTVDMDNITFPSQGSLSSS---NQTFDSQSAARN 73

Query: 94  AGTTAEFVNEGLLLWNESRLQWVGSGRSKKQTQQKRDPKLNWNTS-YESLIGTRQPFPKS 152
           +    E+VN+GLLLWN++R +WVG  +        +  KLNWNT+ Y+SL+G+ + FP+ 
Sbjct: 74  SNAPPEYVNQGLLLWNQTRERWVGKDKPNNPVDHNQGAKLNWNTATYDSLLGSNKLFPQP 133

Query: 153 VPLSEMVEFLVDVWEREGMY 172
           +PL+EMV+FLVD+WE+EG+Y
Sbjct: 134 IPLTEMVDFLVDIWEQEGLY 153


>AT1G15350.3 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: chloroplast;
           EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 12
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT3G15770.2); Has 145 Blast
           hits to 145 proteins in 25 species: Archae - 0; Bacteria
           - 0; Metazoa - 0; Fungi - 0; Plants - 138; Viruses - 0;
           Other Eukaryotes - 7 (source: NCBI BLink). |
           chr1:5278481-5279056 REVERSE LENGTH=108
          Length = 108

 Score =  100 bits (249), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 48/110 (43%), Positives = 72/110 (65%), Gaps = 4/110 (3%)

Query: 64  LDNSTIQSQKSISSVSTLNNILYHSNGTSSAGTTAEFVNEGLLLWNESRLQWVGSGRSKK 123
           +DN T  SQ S+SS    N      +   ++    E+VN+GLLLWN++R +WVG  +   
Sbjct: 1   MDNITFPSQGSLSSS---NQTFDSQSAARNSNAPPEYVNQGLLLWNQTRERWVGKDKPNN 57

Query: 124 QTQQKRDPKLNWNTS-YESLIGTRQPFPKSVPLSEMVEFLVDVWEREGMY 172
                +  KLNWNT+ Y+SL+G+ + FP+ +PL+EMV+FLVD+WE+EG+Y
Sbjct: 58  PVDHNQGAKLNWNTATYDSLLGSNKLFPQPIPLTEMVDFLVDIWEQEGLY 107


>AT3G54880.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT5G25360.2); Has 137 Blast hits to 137 proteins
           in 19 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 136; Viruses - 0; Other Eukaryotes -
           1 (source: NCBI BLink). | chr3:20337078-20337786 REVERSE
           LENGTH=112
          Length = 112

 Score = 79.7 bits (195), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 34/81 (41%), Positives = 56/81 (69%), Gaps = 1/81 (1%)

Query: 92  SSAGTTAEFVNEGLLLWNESRLQWVGSGRSKKQTQQKRDPKLNWNTSYESLIGTRQPFPK 151
           +++  T   VN G  +W E+R +WVG  +S+++    +D  ++W+T+YE L+ T +PF +
Sbjct: 32  TNSENTLTLVNHGAKMWQENREKWVG-DQSRQRKNTAKDQIISWSTTYEDLLSTHEPFSE 90

Query: 152 SVPLSEMVEFLVDVWEREGMY 172
           S+PL EMV+FLVD+W  EG+Y
Sbjct: 91  SIPLPEMVDFLVDIWYDEGLY 111


>AT5G03440.2 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G54880.1); Has 35333 Blast hits to 34131
           proteins in 2444 species: Archae - 798; Bacteria -
           22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
           - 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
           chr5:857179-857898 REVERSE LENGTH=98
          Length = 98

 Score = 67.8 bits (164), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 30/90 (33%), Positives = 52/90 (57%), Gaps = 1/90 (1%)

Query: 83  NILYHSNGTSSAGTTAEFVNEGLLLWNESRLQWVGSGRSKKQTQQKRDPKLNWNTSYESL 142
           N+  +S+      +   FVN   + W E R +WVG   S + ++   +P + +N +YE L
Sbjct: 9   NVSSNSSNDKEKSSEEIFVNHAEIAWQEMRKKWVGD-PSNRTSEMPDEPVIGFNATYEDL 67

Query: 143 IGTRQPFPKSVPLSEMVEFLVDVWEREGMY 172
           + +  PF K +PL+EMV+FL D+W  +G++
Sbjct: 68  LTSNTPFNKPIPLAEMVDFLFDIWHGDGLF 97


>AT5G03440.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G54880.1); Has 30201 Blast hits to 17322
           proteins in 780 species: Archae - 12; Bacteria - 1396;
           Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
           0; Other Eukaryotes - 2996 (source: NCBI BLink). |
           chr5:857179-857898 REVERSE LENGTH=98
          Length = 98

 Score = 67.8 bits (164), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 30/90 (33%), Positives = 52/90 (57%), Gaps = 1/90 (1%)

Query: 83  NILYHSNGTSSAGTTAEFVNEGLLLWNESRLQWVGSGRSKKQTQQKRDPKLNWNTSYESL 142
           N+  +S+      +   FVN   + W E R +WVG   S + ++   +P + +N +YE L
Sbjct: 9   NVSSNSSNDKEKSSEEIFVNHAEIAWQEMRKKWVGD-PSNRTSEMPDEPVIGFNATYEDL 67

Query: 143 IGTRQPFPKSVPLSEMVEFLVDVWEREGMY 172
           + +  PF K +PL+EMV+FL D+W  +G++
Sbjct: 68  LTSNTPFNKPIPLAEMVDFLFDIWHGDGLF 97