Miyakogusa Predicted Gene

Lj5g3v0068810.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj5g3v0068810.1 Non Chatacterized Hit- tr|K4B4V9|K4B4V9_SOLLC
Uncharacterized protein OS=Solanum lycopersicum
GN=Sol,35.76,1e-18,Myb_DNA-bind_3,Myb/SANT-like domain;
seg,NULL,CUFF.52471.1
         (321 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT5G27260.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    73   3e-13
AT1G30140.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    59   5e-09
AT2G24960.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    55   8e-08
AT2G29880.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    49   4e-06

>AT5G27260.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G29880.1); Has 1807 Blast hits to 1807 proteins
           in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
           Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
           - 339 (source: NCBI BLink). | chr5:9603943-9604930
           FORWARD LENGTH=303
          Length = 303

 Score = 72.8 bits (177), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 50/155 (32%), Positives = 78/155 (50%), Gaps = 12/155 (7%)

Query: 9   QSKKRQ------WTAEEDAVLVAGLLQLVDDGWKADANSFKPGYTKVLEKHLQTKIPD-- 60
           Q KKR+      W+ EE  +LV  L++ +++ W+ D+N      T  +E     +I    
Sbjct: 5   QPKKRKKGDYNPWSPEETKLLVQLLVEGINNNWR-DSNGTISKLT--VETKFMPEINKEF 61

Query: 61  CKLKASPHIESRVKHIKKQYFAIKDMFGPSASGFGWDPTRNMIVVEREIYREWCKSHPVA 120
           C+ K   H  SR+K++K QY +  D+    +SGFGWDP         E++ ++ K+HP  
Sbjct: 62  CRSKNYNHYLSRMKYLKIQYQSCLDL-QRFSSGFGWDPLTKRFTASDEVWSDYLKAHPNN 120

Query: 121 VGLYGKPFPHFDSLDIVFGKDRATGTHAESPADAA 155
             L    F  FD L I+FG+  ATG +A    D+ 
Sbjct: 121 KQLRYDTFEFFDELQIIFGEGVATGKNAIGLCDST 155


>AT1G30140.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT5G27260.1); Has 313 Blast hits to 256 proteins
           in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 8; Plants - 295; Viruses - 0; Other Eukaryotes -
           10 (source: NCBI BLink). | chr1:10598764-10599527
           FORWARD LENGTH=222
          Length = 222

 Score = 58.9 bits (141), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 44/144 (30%), Positives = 68/144 (47%), Gaps = 10/144 (6%)

Query: 14  QWTAEEDAVLVAGLLQLVDDGWKADANSFKPGYTKVLEKHLQT--KIPDCKLKASPHIES 71
           QWT +E  VL+    +L+   W+  +     G   V  K L    K   C  K   +  S
Sbjct: 16  QWTPDETDVLI----ELIRQNWRDSSGII--GKLTVESKLLPALNKRLGCN-KNHKNYMS 68

Query: 72  RVKHIKKQYFAIKDMFGPSASGFGWDPTRNMIVVEREIYREWCKSHPVAVGLYGKPFPHF 131
           R+K +K  Y +  D+    +SGFGWDP         E++R++ K+HP    +  +   HF
Sbjct: 69  RLKFLKNLYQSYLDL-KRFSSGFGWDPETKKFTAPDEVWRDYLKAHPNHKHMQTESIDHF 127

Query: 132 DSLDIVFGKDRATGTHAESPADAA 155
           + L I+FG   ATG+ A   +D+ 
Sbjct: 128 EDLQIIFGDVVATGSFAVGMSDST 151


>AT2G24960.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 21 plant
           structures; EXPRESSED DURING: 12 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT4G02210.2); Has 30201 Blast hits to 17322
           proteins in 780 species: Archae - 12; Bacteria - 1396;
           Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
           0; Other Eukaryotes - 2996 (source: NCBI BLink). |
           chr2:10617263-10620034 FORWARD LENGTH=774
          Length = 774

 Score = 54.7 bits (130), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 24/80 (30%), Positives = 46/80 (57%), Gaps = 2/80 (2%)

Query: 69  IESRVKHIKKQYFAIKDMFGPSASGFGWDPTRNMIVVEREIYREWCKSHPVAVGLYGKPF 128
           +++R KH+++ Y  IK  F    +GF WD  R+M++ + +I+  + ++HP A     K  
Sbjct: 378 LKNRYKHLRRLYNDIK--FLLEQNGFSWDARRDMVIADDDIWNTYIQAHPEARSYRVKTI 435

Query: 129 PHFDSLDIVFGKDRATGTHA 148
           P + +L  +FGK+ + G + 
Sbjct: 436 PSYPNLCFIFGKETSDGRYT 455


>AT2G29880.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT5G27260.1); Has 260 Blast hits to 212 proteins
           in 20 species: Archae - 0; Bacteria - 0; Metazoa - 1;
           Fungi - 10; Plants - 240; Viruses - 0; Other Eukaryotes
           - 9 (source: NCBI BLink). | chr2:12742536-12743545
           FORWARD LENGTH=308
          Length = 308

 Score = 49.3 bits (116), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 38/152 (25%), Positives = 64/152 (42%), Gaps = 8/152 (5%)

Query: 3   GTTESQQSKKRQ------WTAEEDAVLVAGLLQLVDDGWKADANSFKPGYTKVLEKHLQT 56
           G    + SKK++      W+ +E   L A L+  +  GW+    +      +     L  
Sbjct: 4   GDQAGETSKKKKKGPYMSWSDQECYELTAILVDAIKRGWRDKNGTISKTTVERKILPLLN 63

Query: 57  KIPDCKLKASPHIESRVKHIKKQYFAIKDMFGPSASGFGWDPTRNMIVVEREIYREWCKS 116
           K   C    + ++ SR+K +KK+Y     +F  S SGFGWDP         +++  +   
Sbjct: 64  KKFKCNKTYTNYL-SRMKSMKKEYSVYAALFWFS-SGFGWDPITKQFTAPDDVWAAYLMG 121

Query: 117 HPVAVGLYGKPFPHFDSLDIVFGKDRATGTHA 148
           HP    +    F  F+ L ++F    A G +A
Sbjct: 122 HPNHHHMRTSTFEDFEDLQLIFESAIAKGNNA 153