Miyakogusa Predicted Gene

Lj6g3v1333100.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj6g3v1333100.1 Non Chatacterized Hit- tr|I1LVM4|I1LVM4_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.32283
PE,40.14,1e-17,seg,NULL,CUFF.59354.1
         (163 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT2G19180.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    79   1e-15
AT1G16840.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    65   1e-11
AT1G16840.3 | Symbols:  | unknown protein; BEST Arabidopsis thal...    65   1e-11
AT1G16840.4 | Symbols:  | unknown protein; BEST Arabidopsis thal...    65   1e-11
AT1G16840.2 | Symbols:  | unknown protein; BEST Arabidopsis thal...    59   1e-09
AT1G78890.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    56   1e-08

>AT2G19180.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G16840.1); Has 64 Blast hits to 64 proteins in
           12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi
           - 0; Plants - 64; Viruses - 0; Other Eukaryotes - 0
           (source: NCBI BLink). | chr2:8324742-8325575 FORWARD
           LENGTH=179
          Length = 179

 Score = 79.3 bits (194), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 44/147 (29%), Positives = 77/147 (52%), Gaps = 7/147 (4%)

Query: 24  AVCHRHRSSRATKATLTEIDAEHEV---TLKMFDDLIQRILVKKATPDWLPFLPGYSFWX 80
            VC   R S   +    EID+++E     LK  ++ ++RI+V  +TPDWLPF PG SFW 
Sbjct: 31  VVCFSRRFSSIPQVIELEIDSKNEAEAAILKKLNEFVRRIIVHNSTPDWLPFAPGSSFWV 90

Query: 81  XX-XXXXXXXXHLAHRFNSSDQPQDALNLESHHGWPDPNYFL---QGNAPAHSGESGVEL 136
                      +L  +  +    +++L+L S +GWP  ++F+    G++     E+ VEL
Sbjct: 91  PPHQITATKIANLVDKVTNPLTEEESLSLSSPYGWPCSSFFIPPPDGSSSTQEEEASVEL 150

Query: 137 NLPEEGTVKVKVITFSDNVANSEDEEG 163
            +P    ++VK+  + D + + + E+G
Sbjct: 151 KIPGNEMLEVKLAHYPDPIYSFKPEDG 177


>AT1G16840.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G78890.1); Has 71 Blast hits to 71 proteins in
           12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi
           - 0; Plants - 71; Viruses - 0; Other Eukaryotes - 0
           (source: NCBI BLink). | chr1:5763111-5763984 REVERSE
           LENGTH=161
          Length = 161

 Score = 65.5 bits (158), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 36/116 (31%), Positives = 57/116 (49%), Gaps = 18/116 (15%)

Query: 50  LKMFDDLIQRILVKKATPDWLPFLPGYSFWXXXXXXXXXXXHLAHRFNSSDQP---QDAL 106
           ++  +D + RI V++A PDWLPF+PG S+W            +A        P   +++L
Sbjct: 60  IQKLEDAVHRIFVRRAQPDWLPFVPGASYWVPPPGSGSQSHGIAQLVVKLANPLTHEESL 119

Query: 107 NLESHHGWPDPNYFLQGNAPAHSGESGVELNLPEEGTVKVKVITFSDNVANSEDEE 162
           +  S HGWP  +YFL+G  P                 ++ K  T S++ ++SEDEE
Sbjct: 120 STNSSHGWPSSDYFLKGVQPQ---------------LMETKTETTSNSESHSEDEE 160


>AT1G16840.3 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G78890.1); Has 71 Blast hits to 71 proteins in
           12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi
           - 0; Plants - 71; Viruses - 0; Other Eukaryotes - 0
           (source: NCBI BLink). | chr1:5763111-5763984 REVERSE
           LENGTH=161
          Length = 161

 Score = 65.5 bits (158), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 36/116 (31%), Positives = 57/116 (49%), Gaps = 18/116 (15%)

Query: 50  LKMFDDLIQRILVKKATPDWLPFLPGYSFWXXXXXXXXXXXHLAHRFNSSDQP---QDAL 106
           ++  +D + RI V++A PDWLPF+PG S+W            +A        P   +++L
Sbjct: 60  IQKLEDAVHRIFVRRAQPDWLPFVPGASYWVPPPGSGSQSHGIAQLVVKLANPLTHEESL 119

Query: 107 NLESHHGWPDPNYFLQGNAPAHSGESGVELNLPEEGTVKVKVITFSDNVANSEDEE 162
           +  S HGWP  +YFL+G  P                 ++ K  T S++ ++SEDEE
Sbjct: 120 STNSSHGWPSSDYFLKGVQPQ---------------LMETKTETTSNSESHSEDEE 160


>AT1G16840.4 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G78890.1); Has 35333 Blast hits to 34131
           proteins in 2444 species: Archae - 798; Bacteria -
           22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
           - 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
           chr1:5763111-5763984 REVERSE LENGTH=161
          Length = 161

 Score = 65.5 bits (158), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 36/116 (31%), Positives = 57/116 (49%), Gaps = 18/116 (15%)

Query: 50  LKMFDDLIQRILVKKATPDWLPFLPGYSFWXXXXXXXXXXXHLAHRFNSSDQP---QDAL 106
           ++  +D + RI V++A PDWLPF+PG S+W            +A        P   +++L
Sbjct: 60  IQKLEDAVHRIFVRRAQPDWLPFVPGASYWVPPPGSGSQSHGIAQLVVKLANPLTHEESL 119

Query: 107 NLESHHGWPDPNYFLQGNAPAHSGESGVELNLPEEGTVKVKVITFSDNVANSEDEE 162
           +  S HGWP  +YFL+G  P                 ++ K  T S++ ++SEDEE
Sbjct: 120 STNSSHGWPSSDYFLKGVQPQ---------------LMETKTETTSNSESHSEDEE 160


>AT1G16840.2 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G78890.1); Has 71 Blast hits to 71 proteins in
           12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi
           - 0; Plants - 71; Viruses - 0; Other Eukaryotes - 0
           (source: NCBI BLink). | chr1:5763499-5763984 REVERSE
           LENGTH=161
          Length = 161

 Score = 59.3 bits (142), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 31/98 (31%), Positives = 49/98 (50%), Gaps = 9/98 (9%)

Query: 36  KATLTEIDAE------HEVTLKMFDDLIQRILVKKATPDWLPFLPGYSFWXXXXXXXXXX 89
           +  L EID         +  ++  +D + RI V++A PDWLPF+PG S+W          
Sbjct: 40  RGDLYEIDTSAASQSPSDPLIQKLEDAVHRIFVRRAQPDWLPFVPGASYWVPPPGSGSQS 99

Query: 90  XHLAHRFNSSDQP---QDALNLESHHGWPDPNYFLQGN 124
             +A        P   +++L+  S HGWP  +YFL+G+
Sbjct: 100 HGIAQLVVKLANPLTHEESLSTNSSHGWPSSDYFLKGS 137


>AT1G78890.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G16840.1); Has 30201 Blast hits to 17322
           proteins in 780 species: Archae - 12; Bacteria - 1396;
           Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
           0; Other Eukaryotes - 2996 (source: NCBI BLink). |
           chr1:29656623-29657537 FORWARD LENGTH=155
          Length = 155

 Score = 55.8 bits (133), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 27/91 (29%), Positives = 48/91 (52%), Gaps = 1/91 (1%)

Query: 37  ATLTEIDAEHEVTLKMFDDLIQRILVKKATPDWLPFLPGYSFWXXXXXXXX-XXXHLAHR 95
             + EID   +  +   +D + RI+V+++ PDWLPF+PG SFW             L  +
Sbjct: 46  GVIYEIDIAADPLVNKLEDAVHRIMVRRSAPDWLPFVPGASFWVPPPRSQSHGIAKLVEK 105

Query: 96  FNSSDQPQDALNLESHHGWPDPNYFLQGNAP 126
             +    ++++++ S  GWP  +YF++G  P
Sbjct: 106 LANPISDEESISISSVRGWPCSDYFIKGVKP 136