Miyakogusa Predicted Gene

Lj3g3v0065550.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj3g3v0065550.1 Non Chatacterized Hit- tr|C6T0Q9|C6T0Q9_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.1641
PE=,81.53,0,DUF4050,Domain of unknown function DUF4050,CUFF.40268.1
         (157 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT5G25360.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   177   2e-45
AT5G25360.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   177   2e-45
AT1G15350.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   162   6e-41
AT1G15350.2 | Symbols:  | unknown protein; BEST Arabidopsis thal...   162   6e-41
AT4G32342.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   152   8e-38
AT3G15770.2 | Symbols:  | unknown protein; BEST Arabidopsis thal...   148   2e-36
AT3G15770.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   143   4e-35
AT1G15350.3 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   126   5e-30
AT3G54880.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    91   2e-19
AT5G03440.2 | Symbols:  | unknown protein; BEST Arabidopsis thal...    86   1e-17
AT5G03440.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    86   1e-17

>AT5G25360.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 23 plant
           structures; EXPRESSED DURING: 15 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT4G32342.1). | chr5:8799934-8802333 REVERSE
           LENGTH=169
          Length = 169

 Score =  177 bits (449), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 91/146 (62%), Positives = 105/146 (71%), Gaps = 3/146 (2%)

Query: 12  KPTLIATADVPSKRLTQHNKAVKKASTSEDFWTTSTHDMDNSAVQSQGSISSTSVTNLPA 71
           KP LI   D PSK L    + VKK S SEDFW+TST +MDNS +QSQ S+SS S TN   
Sbjct: 27  KPPLIVAVDEPSKGLRIQGRLVKKPSVSEDFWSTSTCEMDNSTLQSQRSMSSISFTN-NT 85

Query: 72  DPHAGSSNPTEFVNHGLTLWNQTRQRWIGNNKRPEKQTEQSREPKLSWNATYDSLLASNK 131
              A +SNPTEFVNHGL LWNQTRQ+W+ N    +K   + REP +SWNATY+SLL  NK
Sbjct: 86  STSASTSNPTEFVNHGLNLWNQTRQQWLANGTSQKK--AKVREPTISWNATYESLLGMNK 143

Query: 132 PFPQPIPLAEMVDFLVDIWEQEGLYD 157
            F +PIPL EMVDFLVD+WEQEGLYD
Sbjct: 144 RFSRPIPLPEMVDFLVDVWEQEGLYD 169


>AT5G25360.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G32342.1); Has 1807 Blast hits to 1807 proteins
           in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
           Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
           - 339 (source: NCBI BLink). | chr5:8799934-8802333
           REVERSE LENGTH=169
          Length = 169

 Score =  177 bits (449), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 91/146 (62%), Positives = 105/146 (71%), Gaps = 3/146 (2%)

Query: 12  KPTLIATADVPSKRLTQHNKAVKKASTSEDFWTTSTHDMDNSAVQSQGSISSTSVTNLPA 71
           KP LI   D PSK L    + VKK S SEDFW+TST +MDNS +QSQ S+SS S TN   
Sbjct: 27  KPPLIVAVDEPSKGLRIQGRLVKKPSVSEDFWSTSTCEMDNSTLQSQRSMSSISFTN-NT 85

Query: 72  DPHAGSSNPTEFVNHGLTLWNQTRQRWIGNNKRPEKQTEQSREPKLSWNATYDSLLASNK 131
              A +SNPTEFVNHGL LWNQTRQ+W+ N    +K   + REP +SWNATY+SLL  NK
Sbjct: 86  STSASTSNPTEFVNHGLNLWNQTRQQWLANGTSQKK--AKVREPTISWNATYESLLGMNK 143

Query: 132 PFPQPIPLAEMVDFLVDIWEQEGLYD 157
            F +PIPL EMVDFLVD+WEQEGLYD
Sbjct: 144 RFSRPIPLPEMVDFLVDVWEQEGLYD 169


>AT1G15350.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G15770.2); Has 148 Blast hits to 148 proteins
           in 25 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 141; Viruses - 0; Other Eukaryotes -
           7 (source: NCBI BLink). | chr1:5278481-5279486 REVERSE
           LENGTH=154
          Length = 154

 Score =  162 bits (411), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 87/158 (55%), Positives = 107/158 (67%), Gaps = 7/158 (4%)

Query: 3   MGGCLGLHK--KPTLIATADVPSKRLTQHNKAVKKASTSEDFWTTSTHDMDNSAVQSQGS 60
           MGGC+G ++  + T  +  D PS  +    +  KK S SEDFW+TST DMDN    SQGS
Sbjct: 1   MGGCVGCYREHRSTAASLKDPPSNSIA---RPCKKPSVSEDFWSTSTVDMDNITFPSQGS 57

Query: 61  ISSTSVTNLPADPHAGSSNPTEFVNHGLTLWNQTRQRWIGNNKRPEKQTEQSREPKLSWN 120
           +SS++ T         S+ P E+VN GL LWNQTR+RW+G +K P    + ++  KL+WN
Sbjct: 58  LSSSNQTFDSQSAARNSNAPPEYVNQGLLLWNQTRERWVGKDK-PNNPVDHNQGAKLNWN 116

Query: 121 -ATYDSLLASNKPFPQPIPLAEMVDFLVDIWEQEGLYD 157
            ATYDSLL SNK FPQPIPL EMVDFLVDIWEQEGLYD
Sbjct: 117 TATYDSLLGSNKLFPQPIPLTEMVDFLVDIWEQEGLYD 154


>AT1G15350.2 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G15770.2); Has 148 Blast hits to 148 proteins
           in 25 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 141; Viruses - 0; Other Eukaryotes -
           7 (source: NCBI BLink). | chr1:5278481-5279486 REVERSE
           LENGTH=154
          Length = 154

 Score =  162 bits (411), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 87/158 (55%), Positives = 107/158 (67%), Gaps = 7/158 (4%)

Query: 3   MGGCLGLHK--KPTLIATADVPSKRLTQHNKAVKKASTSEDFWTTSTHDMDNSAVQSQGS 60
           MGGC+G ++  + T  +  D PS  +    +  KK S SEDFW+TST DMDN    SQGS
Sbjct: 1   MGGCVGCYREHRSTAASLKDPPSNSIA---RPCKKPSVSEDFWSTSTVDMDNITFPSQGS 57

Query: 61  ISSTSVTNLPADPHAGSSNPTEFVNHGLTLWNQTRQRWIGNNKRPEKQTEQSREPKLSWN 120
           +SS++ T         S+ P E+VN GL LWNQTR+RW+G +K P    + ++  KL+WN
Sbjct: 58  LSSSNQTFDSQSAARNSNAPPEYVNQGLLLWNQTRERWVGKDK-PNNPVDHNQGAKLNWN 116

Query: 121 -ATYDSLLASNKPFPQPIPLAEMVDFLVDIWEQEGLYD 157
            ATYDSLL SNK FPQPIPL EMVDFLVDIWEQEGLYD
Sbjct: 117 TATYDSLLGSNKLFPQPIPLTEMVDFLVDIWEQEGLYD 154


>AT4G32342.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT5G25360.2);
           Has 30201 Blast hits to 17322 proteins in 780 species:
           Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi -
           3422; Plants - 5037; Viruses - 0; Other Eukaryotes -
           2996 (source: NCBI BLink). | chr4:15615184-15616200
           REVERSE LENGTH=161
          Length = 161

 Score =  152 bits (384), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 82/158 (51%), Positives = 103/158 (65%), Gaps = 14/158 (8%)

Query: 1   MHMGGCLGLHKKPTLIATADVPSKRLTQHNKAVKK-ASTSEDFWTTSTHDMD-NSAVQSQ 58
           ++  GC    ++  L+   D PSK L    K VKK +++S+DFW+TST DMD N  +QSQ
Sbjct: 15  INCFGCCNRERR--LVVEVDEPSKGLKIQGKIVKKDSASSDDFWSTSTCDMDHNITIQSQ 72

Query: 59  GSISSTSVTNLPADPHAGSSNPTEFVNHGLTLWNQTRQRWIGNNKRPEKQTEQSREPKLS 118
            S       N P DP   +SN TEFVNHGL LWN TRQ+W    +   +Q     EP +S
Sbjct: 73  SS-------NPPFDPQCSTSNSTEFVNHGLILWNHTRQQW---RECLTRQQCLVPEPAIS 122

Query: 119 WNATYDSLLASNKPFPQPIPLAEMVDFLVDIWEQEGLY 156
           WN+TYDSLL++NK FPQPIPL EMV FLVD+WE+EGLY
Sbjct: 123 WNSTYDSLLSTNKLFPQPIPLKEMVHFLVDVWEEEGLY 160


>AT3G15770.2 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G15350.1); Has 35333 Blast hits to 34131
           proteins in 2444 species: Archae - 798; Bacteria -
           22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
           - 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
           chr3:5340243-5341216 FORWARD LENGTH=161
          Length = 161

 Score =  148 bits (373), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 84/164 (51%), Positives = 105/164 (64%), Gaps = 14/164 (8%)

Query: 3   MGGCLGLHKKPTLIATADVP----SKRLTQHN-KAVKKAST--SEDFWTTSTHDMDNSAV 55
           MG CL    K     + DVP       L + +   ++K S   SEDFWT +T DM+++A 
Sbjct: 1   MGSCLACFDKSKAKTSVDVPLNGTKDVLVEEDWSELRKPSVVASEDFWTNTTLDMESNA- 59

Query: 56  QSQGSISSTSVTNLPADPH-AGSSN--PTEFVNHGLTLWNQTRQRWIGNNKRPEKQTEQS 112
              GS+SS S TNL  D    GSS+  P EFVNHGL LWNQTRQ+W+G +KR E +    
Sbjct: 60  --HGSVSSISTTNLTIDSQGCGSSSNEPAEFVNHGLVLWNQTRQQWVG-DKRSESRKSVG 116

Query: 113 REPKLSWNATYDSLLASNKPFPQPIPLAEMVDFLVDIWEQEGLY 156
           REP L+ N TY+SLL SNK FP+PIPL EMV FLV++WE+EGLY
Sbjct: 117 REPILNENVTYESLLGSNKRFPRPIPLDEMVQFLVEVWEEEGLY 160


>AT3G15770.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT5G25360.2); Has 143 Blast hits to 143 proteins
           in 25 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 136; Viruses - 0; Other Eukaryotes -
           7 (source: NCBI BLink). | chr3:5340243-5341216 FORWARD
           LENGTH=162
          Length = 162

 Score =  143 bits (361), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 82/161 (50%), Positives = 103/161 (63%), Gaps = 14/161 (8%)

Query: 6   CLGLHKKPTLIATADVP----SKRLTQHN-KAVKKAST--SEDFWTTSTHDMDNSAVQSQ 58
           CL    K     + DVP       L + +   ++K S   SEDFWT +T DM+++A    
Sbjct: 5   CLACFDKSKAKTSVDVPLNGTKDVLVEEDWSELRKPSVVASEDFWTNTTLDMESNA---H 61

Query: 59  GSISSTSVTNLPADPH-AGSSN--PTEFVNHGLTLWNQTRQRWIGNNKRPEKQTEQSREP 115
           GS+SS S TNL  D    GSS+  P EFVNHGL LWNQTRQ+W+G +KR E +    REP
Sbjct: 62  GSVSSISTTNLTIDSQGCGSSSNEPAEFVNHGLVLWNQTRQQWVG-DKRSESRKSVGREP 120

Query: 116 KLSWNATYDSLLASNKPFPQPIPLAEMVDFLVDIWEQEGLY 156
            L+ N TY+SLL SNK FP+PIPL EMV FLV++WE+EGLY
Sbjct: 121 ILNENVTYESLLGSNKRFPRPIPLDEMVQFLVEVWEEEGLY 161


>AT1G15350.3 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: chloroplast;
           EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 12
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT3G15770.2); Has 145 Blast
           hits to 145 proteins in 25 species: Archae - 0; Bacteria
           - 0; Metazoa - 0; Fungi - 0; Plants - 138; Viruses - 0;
           Other Eukaryotes - 7 (source: NCBI BLink). |
           chr1:5278481-5279056 REVERSE LENGTH=108
          Length = 108

 Score =  126 bits (317), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 66/109 (60%), Positives = 78/109 (71%), Gaps = 2/109 (1%)

Query: 50  MDNSAVQSQGSISSTSVTNLPADPHAGSSNPTEFVNHGLTLWNQTRQRWIGNNKRPEKQT 109
           MDN    SQGS+SS++ T         S+ P E+VN GL LWNQTR+RW+G +K P    
Sbjct: 1   MDNITFPSQGSLSSSNQTFDSQSAARNSNAPPEYVNQGLLLWNQTRERWVGKDK-PNNPV 59

Query: 110 EQSREPKLSWN-ATYDSLLASNKPFPQPIPLAEMVDFLVDIWEQEGLYD 157
           + ++  KL+WN ATYDSLL SNK FPQPIPL EMVDFLVDIWEQEGLYD
Sbjct: 60  DHNQGAKLNWNTATYDSLLGSNKLFPQPIPLTEMVDFLVDIWEQEGLYD 108


>AT3G54880.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT5G25360.2); Has 137 Blast hits to 137 proteins
           in 19 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 136; Viruses - 0; Other Eukaryotes -
           1 (source: NCBI BLink). | chr3:20337078-20337786 REVERSE
           LENGTH=112
          Length = 112

 Score = 91.3 bits (225), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 39/82 (47%), Positives = 56/82 (68%), Gaps = 2/82 (2%)

Query: 76  GSSNPTEFVNHGLTLWNQTRQRWIGNNKRPEKQTEQSREPKLSWNATYDSLLASNKPFPQ 135
            S N    VNHG  +W + R++W+G+  R  K T  +++  +SW+ TY+ LL++++PF +
Sbjct: 33  NSENTLTLVNHGAKMWQENREKWVGDQSRQRKNT--AKDQIISWSTTYEDLLSTHEPFSE 90

Query: 136 PIPLAEMVDFLVDIWEQEGLYD 157
            IPL EMVDFLVDIW  EGLYD
Sbjct: 91  SIPLPEMVDFLVDIWYDEGLYD 112


>AT5G03440.2 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G54880.1); Has 35333 Blast hits to 34131
           proteins in 2444 species: Archae - 798; Bacteria -
           22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
           - 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
           chr5:857179-857898 REVERSE LENGTH=98
          Length = 98

 Score = 85.5 bits (210), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 42/94 (44%), Positives = 58/94 (61%), Gaps = 8/94 (8%)

Query: 70  PADPHAGSSNPTE------FVNHGLTLWNQTRQRWIGNNKRPEKQTEQSREPKLSWNATY 123
           P +  + SSN  E      FVNH    W + R++W+G+     + +E   EP + +NATY
Sbjct: 7   PGNVSSNSSNDKEKSSEEIFVNHAEIAWQEMRKKWVGDPSN--RTSEMPDEPVIGFNATY 64

Query: 124 DSLLASNKPFPQPIPLAEMVDFLVDIWEQEGLYD 157
           + LL SN PF +PIPLAEMVDFL DIW  +GL++
Sbjct: 65  EDLLTSNTPFNKPIPLAEMVDFLFDIWHGDGLFE 98


>AT5G03440.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G54880.1); Has 30201 Blast hits to 17322
           proteins in 780 species: Archae - 12; Bacteria - 1396;
           Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
           0; Other Eukaryotes - 2996 (source: NCBI BLink). |
           chr5:857179-857898 REVERSE LENGTH=98
          Length = 98

 Score = 85.5 bits (210), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 42/94 (44%), Positives = 58/94 (61%), Gaps = 8/94 (8%)

Query: 70  PADPHAGSSNPTE------FVNHGLTLWNQTRQRWIGNNKRPEKQTEQSREPKLSWNATY 123
           P +  + SSN  E      FVNH    W + R++W+G+     + +E   EP + +NATY
Sbjct: 7   PGNVSSNSSNDKEKSSEEIFVNHAEIAWQEMRKKWVGDPSN--RTSEMPDEPVIGFNATY 64

Query: 124 DSLLASNKPFPQPIPLAEMVDFLVDIWEQEGLYD 157
           + LL SN PF +PIPLAEMVDFL DIW  +GL++
Sbjct: 65  EDLLTSNTPFNKPIPLAEMVDFLFDIWHGDGLFE 98