Miyakogusa Predicted Gene

Lj1g3v4955320.2
Show Alignment: 
BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj1g3v4955320.2 Non Chatacterized Hit- tr|A5C2D9|A5C2D9_VITVI
Putative uncharacterized protein OS=Vitis vinifera
GN=,43.97,1e-16,seg,NULL; FAMILY NOT NAMED,NULL;
coiled-coil,NULL,CUFF.33705.2
         (148 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT3G48860.1 | Symbols:  | unknown protein; INVOLVED IN: biologic...   130   3e-31
AT3G48860.2 | Symbols:  | unknown protein; INVOLVED IN: biologic...   130   4e-31
AT4G08630.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   128   2e-30
AT5G13260.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   127   3e-30
AT5G23700.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   126   5e-30
AT4G25070.2 | Symbols:  | unknown protein; EXPRESSED IN: culture...   112   1e-25
AT4G25070.1 | Symbols:  | unknown protein; EXPRESSED IN: culture...   112   1e-25

>AT3G48860.1 | Symbols:  | unknown protein; INVOLVED IN:
           biological_process unknown; LOCATED IN: plasma membrane;
           EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT5G23700.1); Has 12232 Blast
           hits to 9546 proteins in 892 species: Archae - 172;
           Bacteria - 1174; Metazoa - 6487; Fungi - 1343; Plants -
           856; Viruses - 50; Other Eukaryotes - 2150 (source: NCBI
           BLink). | chr3:18117619-18120865 FORWARD LENGTH=494
          Length = 494

 Score =  130 bits (327), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 66/121 (54%), Positives = 84/121 (69%), Gaps = 2/121 (1%)

Query: 5   EEATSALEKLRLVTQRMILTPEEMEEVVLKRCWLARYWGLCVRHGIHAEIAEAKCKYWST 64
           +EA S  + LR +TQRMILT +EMEEVVLKRCWLARYWGL V+HGI A+IA ++ ++WS 
Sbjct: 320 QEAESEAKSLRTMTQRMILTQDEMEEVVLKRCWLARYWGLAVQHGICADIAPSRQEHWSK 379

Query: 65  FAPNPVEVVLAAGEKAKXXX--XXXXXXXXXXXXXNELSGEGNVENMLFVEQGLRELVSL 122
            AP P E+V +A +KAK                  ++L+GEGN+E+ML VE GLREL SL
Sbjct: 380 LAPLPFELVTSAAQKAKELSWDKGGNDRSKAARDLSDLTGEGNIESMLSVEMGLRELASL 439

Query: 123 K 123
           K
Sbjct: 440 K 440


>AT3G48860.2 | Symbols:  | unknown protein; INVOLVED IN:
           biological_process unknown; LOCATED IN: plasma membrane;
           EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT5G23700.1); Has 12429 Blast
           hits to 9751 proteins in 897 species: Archae - 180;
           Bacteria - 1190; Metazoa - 6552; Fungi - 1361; Plants -
           886; Viruses - 50; Other Eukaryotes - 2210 (source: NCBI
           BLink). | chr3:18117619-18121853 FORWARD LENGTH=577
          Length = 577

 Score =  130 bits (326), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 66/121 (54%), Positives = 84/121 (69%), Gaps = 2/121 (1%)

Query: 5   EEATSALEKLRLVTQRMILTPEEMEEVVLKRCWLARYWGLCVRHGIHAEIAEAKCKYWST 64
           +EA S  + LR +TQRMILT +EMEEVVLKRCWLARYWGL V+HGI A+IA ++ ++WS 
Sbjct: 320 QEAESEAKSLRTMTQRMILTQDEMEEVVLKRCWLARYWGLAVQHGICADIAPSRQEHWSK 379

Query: 65  FAPNPVEVVLAAGEKAKXXX--XXXXXXXXXXXXXNELSGEGNVENMLFVEQGLRELVSL 122
            AP P E+V +A +KAK                  ++L+GEGN+E+ML VE GLREL SL
Sbjct: 380 LAPLPFELVTSAAQKAKELSWDKGGNDRSKAARDLSDLTGEGNIESMLSVEMGLRELASL 439

Query: 123 K 123
           K
Sbjct: 440 K 440


>AT4G08630.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G48860.2); Has 1487 Blast hits to 747 proteins
           in 184 species: Archae - 0; Bacteria - 56; Metazoa -
           305; Fungi - 197; Plants - 180; Viruses - 3; Other
           Eukaryotes - 746 (source: NCBI BLink). |
           chr4:5506998-5511959 REVERSE LENGTH=845
          Length = 845

 Score =  128 bits (321), Expect = 2e-30,   Method: Composition-based stats.
 Identities = 70/174 (40%), Positives = 92/174 (52%), Gaps = 36/174 (20%)

Query: 6   EATSALEKLRLVTQRMILTPEEMEEVVLKRCWLARYWGLCVRHGIHAEIAEAKCKYWSTF 65
           E    L  L+ VT+R+ILT EEMEEVVLKRCWL+RYWGLCVRHGI  +IA  K +YWS+F
Sbjct: 541 EVELELNSLKTVTKRLILTQEEMEEVVLKRCWLSRYWGLCVRHGIQPDIAGGKHEYWSSF 600

Query: 66  APNPVEVVLAAGEKAK------------------------------------XXXXXXXX 89
           AP P+E+VL+AG++A+                                            
Sbjct: 601 APLPLEIVLSAGQRARDGVSQCNIFHLAAEISLELFGIVLTSLVLTLWSPHQAANNTYGE 660

Query: 90  XXXXXXXXNELSGEGNVENMLFVEQGLRELVSLKKGGRSIGSCNGSTQASKCIE 143
                    E SGEGN+ENM++VE+GLREL SLK     I   +    + +C++
Sbjct: 661 REKSLQNLQETSGEGNLENMIWVEKGLRELASLKNQSSVIQETDLKYDSLRCLK 714


>AT5G13260.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G48860.2); Has 30201 Blast hits to 17322
           proteins in 780 species: Archae - 12; Bacteria - 1396;
           Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
           0; Other Eukaryotes - 2996 (source: NCBI BLink). |
           chr5:4243164-4246677 FORWARD LENGTH=537
          Length = 537

 Score =  127 bits (319), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 66/122 (54%), Positives = 83/122 (68%), Gaps = 5/122 (4%)

Query: 7   ATSALEKLRLVTQRMILTPEEMEEVVLKRCWLARYWGLCVRHGIHAEIAEAKCKYWSTFA 66
           A S +  LR +T RMILT +EMEEVVLKRCWLARYWGL  R+GI ++IA +K +YWS+ A
Sbjct: 298 AESEVNGLRTMTHRMILTQKEMEEVVLKRCWLARYWGLASRYGICSDIATSKYEYWSSLA 357

Query: 67  PNPVEVVLAAGEKAKXXXXXXXXXXXXXXXX-----NELSGEGNVENMLFVEQGLRELVS 121
           P P E+VL+AG+KAK                     N+L+GEGN+E+ML VE GL+EL S
Sbjct: 358 PLPFEIVLSAGQKAKEESWEKESEENEKRSQLVQDINDLTGEGNIESMLSVEMGLKELTS 417

Query: 122 LK 123
           LK
Sbjct: 418 LK 419


>AT5G23700.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G48860.2); Has 30201 Blast hits to 17322
           proteins in 780 species: Archae - 12; Bacteria - 1396;
           Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
           0; Other Eukaryotes - 2996 (source: NCBI BLink). |
           chr5:7992851-7996420 FORWARD LENGTH=573
          Length = 573

 Score =  126 bits (317), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 63/119 (52%), Positives = 82/119 (68%), Gaps = 6/119 (5%)

Query: 5   EEATSALEKLRLVTQRMILTPEEMEEVVLKRCWLARYWGLCVRHGIHAEIAEAKCKYWST 64
           +EA S  + LR++TQRM+LT +EMEEV LKRCWLARYWGL V+HGI A+IA ++ + WS 
Sbjct: 294 QEAESEAKALRIMTQRMVLTQDEMEEVALKRCWLARYWGLAVQHGICADIAPSRHEKWSA 353

Query: 65  FAPNPVEVVLAAGEKAKXXXXXXXXXXXXXXXXNELSGEGNVENMLFVEQGLRELVSLK 123
            AP P E+V++A +K K                ++L GEGN+E+ML VE GLREL SLK
Sbjct: 354 LAPLPFELVISAAQKTK------DDQSKTARFLSDLPGEGNIESMLSVEMGLRELASLK 406


>AT4G25070.2 | Symbols:  | unknown protein; EXPRESSED IN: cultured
           cell; BEST Arabidopsis thaliana protein match is:
           unknown protein (TAIR:AT3G48860.2); Has 30201 Blast hits
           to 17322 proteins in 780 species: Archae - 12; Bacteria
           - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037;
           Viruses - 0; Other Eukaryotes - 2996 (source: NCBI
           BLink). | chr4:12872482-12876468 FORWARD LENGTH=767
          Length = 767

 Score =  112 bits (280), Expect = 1e-25,   Method: Composition-based stats.
 Identities = 57/122 (46%), Positives = 80/122 (65%), Gaps = 3/122 (2%)

Query: 5   EEATSALEKLRLVTQRMILTPEEMEEVVLKRCWLARYWGLCVRHGIHAEIAEAKCKYWST 64
           +E  + ++ LR +  R IL+ EEMEEVVLKRCWLARYW L V+HGI  +I+ ++ ++WS 
Sbjct: 511 QEVEAEIKSLRTMIHRTILSQEEMEEVVLKRCWLARYWELAVQHGICEDISTSRYEHWSA 570

Query: 65  FAPNPVEVVLAAGEKAK---XXXXXXXXXXXXXXXXNELSGEGNVENMLFVEQGLRELVS 121
            AP P EVVL+A +K++                   ++L+GEGN+E+ML VE GLRE+ S
Sbjct: 571 LAPLPSEVVLSAAQKSEDSWQTGGSDRTWSKVISNFSDLNGEGNIESMLAVETGLREIAS 630

Query: 122 LK 123
           LK
Sbjct: 631 LK 632


>AT4G25070.1 | Symbols:  | unknown protein; EXPRESSED IN: cultured
           cell; BEST Arabidopsis thaliana protein match is:
           unknown protein (TAIR:AT3G48860.2); Has 14837 Blast hits
           to 10961 proteins in 1163 species: Archae - 189;
           Bacteria - 1924; Metazoa - 7665; Fungi - 1127; Plants -
           653; Viruses - 80; Other Eukaryotes - 3199 (source: NCBI
           BLink). | chr4:12872482-12876468 FORWARD LENGTH=765
          Length = 765

 Score =  112 bits (279), Expect = 1e-25,   Method: Composition-based stats.
 Identities = 57/122 (46%), Positives = 80/122 (65%), Gaps = 3/122 (2%)

Query: 5   EEATSALEKLRLVTQRMILTPEEMEEVVLKRCWLARYWGLCVRHGIHAEIAEAKCKYWST 64
           +E  + ++ LR +  R IL+ EEMEEVVLKRCWLARYW L V+HGI  +I+ ++ ++WS 
Sbjct: 509 QEVEAEIKSLRTMIHRTILSQEEMEEVVLKRCWLARYWELAVQHGICEDISTSRYEHWSA 568

Query: 65  FAPNPVEVVLAAGEKAK---XXXXXXXXXXXXXXXXNELSGEGNVENMLFVEQGLRELVS 121
            AP P EVVL+A +K++                   ++L+GEGN+E+ML VE GLRE+ S
Sbjct: 569 LAPLPSEVVLSAAQKSEDSWQTGGSDRTWSKVISNFSDLNGEGNIESMLAVETGLREIAS 628

Query: 122 LK 123
           LK
Sbjct: 629 LK 630