Miyakogusa Predicted Gene

Lj4g3v2264010.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj4g3v2264010.1 tr|C1MHW5|C1MHW5_MICPC Predicted protein
OS=Micromonas pusilla (strain CCMP1545)
GN=MICPUCDRAFT_4627,39.82,0.000000000000001,seg,NULL; FAMILY NOT
NAMED,NULL,CUFF.50626.1
         (528 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT4G16180.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   760   0.0  
AT4G16180.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   140   2e-33
AT3G28720.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    57   4e-08

>AT4G16180.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: plasma membrane;
           BEST Arabidopsis thaliana protein match is: unknown
           protein (TAIR:AT3G28720.1); Has 5 Blast hits to 5
           proteins in 3 species: Archae - 0; Bacteria - 0; Metazoa
           - 0; Fungi - 0; Plants - 4; Viruses - 0; Other
           Eukaryotes - 1 (source: NCBI BLink). |
           chr4:9165365-9170323 REVERSE LENGTH=820
          Length = 820

 Score =  760 bits (1963), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 367/515 (71%), Positives = 427/515 (82%), Gaps = 3/515 (0%)

Query: 1   MVPAGKARETEFGREVPLFEVEATAVEPVFQKLYSYIFDMDSVGSSVTEMDRPVPSAIFI 60
           MVPAG A E +FGR +P ++VEA  VE  F +LYSYIFD+D    S    D+P+PSAIF+
Sbjct: 154 MVPAGTALEADFGRHLPAYDVEAIKVESAFNQLYSYIFDIDVGSGSAATADKPIPSAIFV 213

Query: 61  VNFDKVRVDPRNKEIDLDSLMYGKIPKLTEEDMKGQEXXXXXXXXXXXXXATQVWLSSGR 120
           VNFDKVR+DP+N EIDLDSLM+ K+P+L++ D + QE             A+QVWL+SGR
Sbjct: 214 VNFDKVRMDPKNTEIDLDSLMFAKLPELSDADKEKQEADYIYRYRYNGGGASQVWLASGR 273

Query: 121 FVVIDLSAGPCTYGKIEAEEGSVSSRTLPRLRNVMHLS--STPSYQSSSDIFLGQLASLV 178
           +VVIDLSAGPCTYGKIE EEGSVS RT+PR+RN++     S   +QS+ DIF GQLA+LV
Sbjct: 274 YVVIDLSAGPCTYGKIETEEGSVSPRTVPRIRNIVLPGNVSPVGHQSTHDIFSGQLAALV 333

Query: 179 STTVEHVIAPDVRFETVDLASRLLIPIIVLQNHNRYNIMEKGHNYSINIEEIRAEVKNLL 238
           +TT+EHVIAPDVRFETVDLA+R+L+PIIVLQNHNRYNIME+G NYSINIEEI +EVK ++
Sbjct: 334 ATTIEHVIAPDVRFETVDLATRVLVPIIVLQNHNRYNIMERGQNYSINIEEIESEVKKMI 393

Query: 239 NDGQEVVIVGGAHSLHRHEKLEIAVSKAMRGHSLQETKNDGRFHVHTKTYLDGAILKEEM 298
           + GQEVVIVGGAH LHRHEKL IAVSKAMRGHSLQETK DGRFHVHTKTYLDGAILKEEM
Sbjct: 394 HHGQEVVIVGGAHPLHRHEKLAIAVSKAMRGHSLQETKKDGRFHVHTKTYLDGAILKEEM 453

Query: 299 ERSADVLSAGLLEVADPSLSSKYFLRQHWMDESEGSTDSILKHKPLWASYNS-XXXXXXX 357
           ERS DVL+AGLL+V+DP LS+KYFLRQ W DESEGS+DSI+KH+PLW+SY+S        
Sbjct: 454 ERSTDVLAAGLLDVSDPGLSNKYFLRQSWDDESEGSSDSIVKHRPLWSSYSSKLQKGKKK 513

Query: 358 XXXXXQGDLQPTYGTRVVPVFVLSLADVDSNLMMEDESMVWTSNDVVIVLEHQNAKIPLS 417
                +GDL  TYGTRV+PVF+LSLADVD  LMMEDES+VW S+DVVIVL+H N KIPLS
Sbjct: 514 KAVKKKGDLYRTYGTRVIPVFILSLADVDPMLMMEDESLVWASSDVVIVLQHLNEKIPLS 573

Query: 418 YVSETYRRHALPSQAQRHILAGIASVVGGLSAPYEKASHVHERPVVNWLWAAGCHPFGPF 477
           YVSET R+HA+PSQ QRH+LAGIAS +GG+SAPYEK SH HERP+ NWLWAAGCHPFGPF
Sbjct: 574 YVSETERQHAVPSQVQRHVLAGIASALGGVSAPYEKTSHAHERPITNWLWAAGCHPFGPF 633

Query: 478 SNTSHISQMLRDVALRNSIYARVDSVLRKIRETSE 512
           SN S ISQML+DVALRN+IYARVDS LRKIRETSE
Sbjct: 634 SNVSLISQMLQDVALRNTIYARVDSALRKIRETSE 668


>AT4G16180.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: plasma membrane;
           Has 25 Blast hits to 25 proteins in 9 species: Archae -
           0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 19;
           Viruses - 0; Other Eukaryotes - 6 (source: NCBI BLink).
           | chr4:9168562-9170323 REVERSE LENGTH=273
          Length = 273

 Score =  140 bits (353), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 65/120 (54%), Positives = 84/120 (70%)

Query: 1   MVPAGKARETEFGREVPLFEVEATAVEPVFQKLYSYIFDMDSVGSSVTEMDRPVPSAIFI 60
           MVPAG A E +FGR +P ++VEA  VE  F +LYSYIFD+D    S    D+P+PSAIF+
Sbjct: 154 MVPAGTALEADFGRHLPAYDVEAIKVESAFNQLYSYIFDIDVGSGSAATADKPIPSAIFV 213

Query: 61  VNFDKVRVDPRNKEIDLDSLMYGKIPKLTEEDMKGQEXXXXXXXXXXXXXATQVWLSSGR 120
           VNFDKVR+DP+N EIDLDSLM+ K+P+L++ D + QE             A+QVWL+SGR
Sbjct: 214 VNFDKVRMDPKNTEIDLDSLMFAKLPELSDADKEKQEADYIYRYRYNGGGASQVWLASGR 273


>AT3G28720.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 25 plant
           structures; EXPRESSED DURING: 13 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT5G58100.1); Has 1610 Blast hits to 344 proteins
           in 85 species: Archae - 0; Bacteria - 567; Metazoa - 95;
           Fungi - 71; Plants - 145; Viruses - 0; Other Eukaryotes
           - 732 (source: NCBI BLink). | chr3:10782276-10784339
           FORWARD LENGTH=687
          Length = 687

 Score = 57.0 bits (136), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 34/132 (25%), Positives = 65/132 (49%), Gaps = 1/132 (0%)

Query: 373 RVVPVFVLSLADVDSNLMMEDESMVWTSNDVVIVLEHQNAKIPLSYVSETYRRHALPSQA 432
           RV+PV+V  L D+++ L+++         D+VI +  +  +    Y              
Sbjct: 433 RVLPVYVFDL-DINTPLLLDRYHQSVAFRDMVIAVRTRGTQTVSDYTCNGRHVFVHTRDL 491

Query: 433 QRHILAGIASVVGGLSAPYEKASHVHERPVVNWLWAAGCHPFGPFSNTSHISQMLRDVAL 492
           +R ++  I   + G+S+ +   S  H   +V++ W+ G  PFGPFS+ S +S + +D A 
Sbjct: 492 ERPLVGSILQSMWGVSSTHLTWSPRHNTTLVDYTWSIGQTPFGPFSDISSLSFVQKDAAK 551

Query: 493 RNSIYARVDSVL 504
           RN I   +++ +
Sbjct: 552 RNVILTSLNTTI 563