Miyakogusa Predicted Gene

Lj4g3v0685320.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj4g3v0685320.1 CUFF.47932.1
         (777 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT2G22795.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   108   1e-23
AT4G37820.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   105   9e-23
AT4G33740.3 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    87   4e-17
AT4G33740.2 | Symbols:  | unknown protein; BEST Arabidopsis thal...    87   4e-17
AT4G33740.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    87   4e-17

>AT2G22795.1 | Symbols:  | unknown protein; BEST Arabidopsis
          thaliana protein match is: unknown protein
          (TAIR:AT4G37820.1); Has 799854 Blast hits to 188815
          proteins in 4452 species: Archae - 4529; Bacteria -
          144236; Metazoa - 287749; Fungi - 87083; Plants -
          43826; Viruses - 3662; Other Eukaryotes - 228769
          (source: NCBI BLink). | chr2:9697380-9699584 REVERSE
          LENGTH=734
          Length = 734

 Score =  108 bits (271), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 46/73 (63%), Positives = 56/73 (76%), Gaps = 4/73 (5%)

Query: 1  MFRLSPRRSQRSKGFKVKHALQICVLLGVCIWLVYQIKHTNEKKASYGESAKT----GND 56
          MFR SPRR QRSKGFKVKH +Q+ +LL V IWL+YQ+KH++EKKA + ESAK      + 
Sbjct: 1  MFRSSPRRGQRSKGFKVKHCIQLTLLLSVGIWLLYQVKHSHEKKAQFEESAKIVVGGVDK 60

Query: 57 VVRLGRKDLNPLV 69
          VV+LGRKDL P V
Sbjct: 61 VVKLGRKDLIPRV 73


>AT4G37820.1 | Symbols:  | unknown protein; BEST Arabidopsis
          thaliana protein match is: unknown protein
          (TAIR:AT2G22795.1); Has 433572 Blast hits to 177005
          proteins in 4263 species: Archae - 2016; Bacteria -
          67591; Metazoa - 157995; Fungi - 49745; Plants - 22011;
          Viruses - 2192; Other Eukaryotes - 132022 (source: NCBI
          BLink). | chr4:17785692-17787290 FORWARD LENGTH=532
          Length = 532

 Score =  105 bits (263), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 44/77 (57%), Positives = 56/77 (72%), Gaps = 6/77 (7%)

Query: 1  MFRLSPRRSQRSKGFKVKHALQICVLLGVCIWLVYQIKHTNEKKASYGESAK-----TGN 55
          M R +PRR QRSKG KVKH +Q+ +LLGV IWL+YQ+KH++EKKA +  ++K       N
Sbjct: 1  MVR-TPRRGQRSKGIKVKHCIQLTLLLGVGIWLIYQMKHSHEKKAEFEGTSKIVVDDIDN 59

Query: 56 DVVRLGRKDLNPLVEET 72
           VV LGRKDL P +EET
Sbjct: 60 TVVNLGRKDLRPRIEET 76


>AT4G33740.3 | Symbols:  | unknown protein; FUNCTIONS IN:
          molecular_function unknown; INVOLVED IN:
          biological_process unknown; LOCATED IN:
          cellular_component unknown; BEST Arabidopsis thaliana
          protein match is: unknown protein (TAIR:AT4G37820.1);
          Has 138092 Blast hits to 73110 proteins in 2951
          species: Archae - 732; Bacteria - 17903; Metazoa -
          48520; Fungi - 16808; Plants - 7078; Viruses - 1044;
          Other Eukaryotes - 46007 (source: NCBI BLink). |
          chr4:16187384-16188802 FORWARD LENGTH=472
          Length = 472

 Score = 87.4 bits (215), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 38/70 (54%), Positives = 49/70 (70%), Gaps = 10/70 (14%)

Query: 8  RSQR-SKGFKVKHALQICVLLGVCIWLVYQIKHTNEKKASYGES---------AKTGNDV 57
          RSQR SKG K KH LQICVLLGVCIWL+YQ+K++++KK  + E          ++  + V
Sbjct: 7  RSQRGSKGIKGKHVLQICVLLGVCIWLIYQVKYSHDKKKEFYEKDVEKSTVLLSEVEDGV 66

Query: 58 VRLGRKDLNP 67
          V+LGRKDL P
Sbjct: 67 VKLGRKDLLP 76


>AT4G33740.2 | Symbols:  | unknown protein; BEST Arabidopsis
          thaliana protein match is: unknown protein
          (TAIR:AT4G37820.1); Has 138092 Blast hits to 73110
          proteins in 2951 species: Archae - 732; Bacteria -
          17903; Metazoa - 48520; Fungi - 16808; Plants - 7078;
          Viruses - 1044; Other Eukaryotes - 46007 (source: NCBI
          BLink). | chr4:16187384-16188802 FORWARD LENGTH=472
          Length = 472

 Score = 87.4 bits (215), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 38/70 (54%), Positives = 49/70 (70%), Gaps = 10/70 (14%)

Query: 8  RSQR-SKGFKVKHALQICVLLGVCIWLVYQIKHTNEKKASYGES---------AKTGNDV 57
          RSQR SKG K KH LQICVLLGVCIWL+YQ+K++++KK  + E          ++  + V
Sbjct: 7  RSQRGSKGIKGKHVLQICVLLGVCIWLIYQVKYSHDKKKEFYEKDVEKSTVLLSEVEDGV 66

Query: 58 VRLGRKDLNP 67
          V+LGRKDL P
Sbjct: 67 VKLGRKDLLP 76


>AT4G33740.1 | Symbols:  | unknown protein; BEST Arabidopsis
          thaliana protein match is: unknown protein
          (TAIR:AT4G37820.1); Has 138210 Blast hits to 73191
          proteins in 2959 species: Archae - 732; Bacteria -
          18006; Metazoa - 48521; Fungi - 16820; Plants - 7078;
          Viruses - 1046; Other Eukaryotes - 46007 (source: NCBI
          BLink). | chr4:16187384-16188802 FORWARD LENGTH=472
          Length = 472

 Score = 87.4 bits (215), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 38/70 (54%), Positives = 49/70 (70%), Gaps = 10/70 (14%)

Query: 8  RSQR-SKGFKVKHALQICVLLGVCIWLVYQIKHTNEKKASYGES---------AKTGNDV 57
          RSQR SKG K KH LQICVLLGVCIWL+YQ+K++++KK  + E          ++  + V
Sbjct: 7  RSQRGSKGIKGKHVLQICVLLGVCIWLIYQVKYSHDKKKEFYEKDVEKSTVLLSEVEDGV 66

Query: 58 VRLGRKDLNP 67
          V+LGRKDL P
Sbjct: 67 VKLGRKDLLP 76