Miyakogusa Predicted Gene

Lj0g3v0305589.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj0g3v0305589.1 Non Chatacterized Hit- tr|D8SUN4|D8SUN4_SELML
Putative uncharacterized protein OS=Selaginella
moelle,28.32,3e-17,seg,NULL; FAMILY NOT NAMED,NULL,CUFF.20567.1
         (438 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT3G27390.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   483   e-136
AT5G40640.1 | Symbols:  | unknown protein; LOCATED IN: plasma me...   467   e-132
AT4G12680.1 | Symbols:  | unknown protein; INVOLVED IN: vegetati...   320   1e-87
AT4G37030.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   249   2e-66

>AT3G27390.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: plasma membrane;
           EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT5G40640.1); Has 101 Blast
           hits to 99 proteins in 12 species: Archae - 0; Bacteria
           - 0; Metazoa - 0; Fungi - 0; Plants - 101; Viruses - 0;
           Other Eukaryotes - 0 (source: NCBI BLink). |
           chr3:10133372-10136111 REVERSE LENGTH=588
          Length = 588

 Score =  483 bits (1242), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 240/422 (56%), Positives = 309/422 (73%), Gaps = 3/422 (0%)

Query: 1   MEPPVEFWASLWSFICFLPYFIGLWLLGHIKGVIFCPLICLIMTIGNSAIILGLWVIHCI 60
           MEPP+ F ASL+ F+ FLPYFIGL  LG IKG++ CPL+CL++TIGNSA+IL L  +H +
Sbjct: 1   MEPPIGFRASLFQFLLFLPYFIGLLFLGFIKGIVLCPLVCLVVTIGNSAVILSLLPVHIV 60

Query: 61  WTYYCVVRSKQLGPLFKLVMCTCLLPALLILWPXXXXXXXXXXXAAYGFFSPIVATFEAV 120
           WT+Y +V +KQ+GP+ K+ +C CL PA +ILWP           A YGFFSPI ATF+AV
Sbjct: 61  WTFYSIVSAKQVGPILKIFLCLCL-PAAIILWPIVGILGSVLGGALYGFFSPIFATFDAV 119

Query: 121 EGGKENKIVHCFIDGTWSTVLKTFDIVKDVGNACFHTYFSVMDDLKEE-GNGRYYEIRPL 179
             GK  +  HCF DGTWST+ ++F +V+D  + CFH+YFS+MD+LK+   + +YYEIR L
Sbjct: 120 GEGKPYQFFHCFYDGTWSTMQRSFTVVRDFKDVCFHSYFSLMDELKQSCPDRKYYEIRLL 179

Query: 180 YVPGAVVASVLGIIIDVPVISFVAIYKSPYMLFKGWNRLLHDLIGREGPFLETICVPLAG 239
            +PGA+V SVLGI++D PVIS VAI KSPYMLFKGW+RL HDLIGREGPFLET+CVP+AG
Sbjct: 180 QLPGALVVSVLGILVDPPVISLVAICKSPYMLFKGWHRLFHDLIGREGPFLETMCVPIAG 239

Query: 240 LAILLWPLAVVGAVLASVIASIILGVRAGVVAYEETSVFYGLRYIVAALSLYDEYSNDVL 299
           LAILLWPLAV GAV+ SVI+SI LG  AGVV+Y+E+S +YGL YIVA++S+YDEYS D+L
Sbjct: 240 LAILLWPLAVTGAVIGSVISSIFLGAYAGVVSYQESSFYYGLCYIVASVSIYDEYSTDIL 299

Query: 300 DMPQRSCFPRPPFRKKDEMPXXXXXXXXXXXXXXXXXXXXXXXXXKNSIAELKPFELLDG 359
           D+P+ SCFPRP +R+KDE P                         +  + ++KP +LL+ 
Sbjct: 300 DLPEGSCFPRPKYRRKDEEP-TPFSGPVPRLGSVKNASSMRGGSVRVPMIDIKPLDLLNE 358

Query: 360 LCKECLQMGERLVSEGLITSEDIQETWFGKESRVISVGLPAYCLLQALLRSAKANSPGIL 419
           L  EC + GE L ++GLI S+DI+E    K S+VISVGLPAY LL  +LRS KANS G+L
Sbjct: 359 LFVECRRYGEVLATKGLINSKDIEEARSSKGSQVISVGLPAYGLLYEILRSVKANSSGLL 418

Query: 420 IS 421
           +S
Sbjct: 419 LS 420


>AT5G40640.1 | Symbols:  | unknown protein; LOCATED IN: plasma
           membrane; EXPRESSED IN: 19 plant structures; EXPRESSED
           DURING: 7 growth stages; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT3G27390.1);
           Has 104 Blast hits to 102 proteins in 14 species: Archae
           - 0; Bacteria - 0; Metazoa - 2; Fungi - 0; Plants - 101;
           Viruses - 0; Other Eukaryotes - 1 (source: NCBI BLink).
           | chr5:16277345-16280258 FORWARD LENGTH=586
          Length = 586

 Score =  467 bits (1201), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 235/421 (55%), Positives = 293/421 (69%), Gaps = 3/421 (0%)

Query: 1   MEPPVEFWASLWSFICFLPYFIGLWLLGHIKGVIFCPLICLIMTIGNSAIILGLWVIHCI 60
           MEPP    ASLW FI F+PYF GL LLG IKG++ CPLICL + IGNSAIILGL  +H I
Sbjct: 1   MEPPTGILASLWQFILFIPYFTGLLLLGVIKGIVLCPLICLTVAIGNSAIILGLLPVHAI 60

Query: 61  WTYYCVVRSKQLGPLFKLVMCTCLLPALLILWPXXXXXXXXXXXAAYGFFSPIVATFEAV 120
           WT Y +  +KQLGP+ K+ +C C+ P  +ILW            A YGF SPI ATF+AV
Sbjct: 61  WTLYSIASAKQLGPILKIFLCLCV-PLGVILWLVVSILGSVLGGAIYGFLSPIFATFDAV 119

Query: 121 EGGKENKIVHCFIDGTWSTVLKTFDIVKDVGNACFHTYFSVMDDLKEE-GNGRYYEIRPL 179
             GK N   HCF DGTWSTV  +F +V D  + CFH+YFS MDDL+    N  YYEIR L
Sbjct: 120 GEGKSNPFFHCFYDGTWSTVQGSFTVVCDFKDVCFHSYFSFMDDLRTSTANRHYYEIRLL 179

Query: 180 YVPGAVVASVLGIIIDVPVISFVAIYKSPYMLFKGWNRLLHDLIGREGPFLETICVPLAG 239
            +PGAV+ +VLGI++D PVIS +A+ KSPYMLFKGW+RL HDLIGREGPFLET+CVP+AG
Sbjct: 180 QIPGAVIVAVLGILVDFPVISLLALCKSPYMLFKGWHRLFHDLIGREGPFLETMCVPIAG 239

Query: 240 LAILLWPLAVVGAVLASVIASIILGVRAGVVAYEETSVFYGLRYIVAALSLYDEYSNDVL 299
           L ILLWPLAVVGAVL SV++S+ LG   GVV+Y+E+S F+GL Y+VA++S+YDEYSNDVL
Sbjct: 240 LVILLWPLAVVGAVLGSVVSSVFLGAYGGVVSYQESSFFFGLCYVVASVSIYDEYSNDVL 299

Query: 300 DMPQRSCFPRPPFRKKDEMPXXXXXXXXXXXXXXXXXXXXXXXXXKNSIAELKPFELLDG 359
           DMP+ SCFPRP +R+ +E                           K  + +LKP +LL+ 
Sbjct: 300 DMPEGSCFPRPIYRRNEE-GASTAFSGGLSRPNSFKTTPSRGGSNKGPMIDLKPLDLLEA 358

Query: 360 LCKECLQMGERLVSEGLITSEDIQETWFGKESRVISVGLPAYCLLQALLRSAKANSPGIL 419
           L  EC + GE +V++G+I S+DI+E    K S+VIS GLPAY LL  LLRS K+NS G+L
Sbjct: 359 LFVECRRHGEIMVTKGIINSKDIEEAKSSKGSQVISFGLPAYSLLHELLRSIKSNSTGLL 418

Query: 420 I 420
           +
Sbjct: 419 L 419


>AT4G12680.1 | Symbols:  | unknown protein; INVOLVED IN: vegetative
           to reproductive phase transition of meristem; LOCATED
           IN: endomembrane system; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT5G40640.1);
           Has 103 Blast hits to 103 proteins in 14 species: Archae
           - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 103;
           Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).
           | chr4:7475104-7478174 FORWARD LENGTH=575
          Length = 575

 Score =  320 bits (821), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 188/432 (43%), Positives = 267/432 (61%), Gaps = 12/432 (2%)

Query: 1   MEPPVEFWASLWSFICFLPYFIGLWLLGHIKGVIFCPLICLIMTIGNSAIILGLWVIHCI 60
           ME P  F+  LWSF+ FLPYF  L LLG  K +I  P+   I+ +GNS +I+GLW  H I
Sbjct: 1   MEVPKGFFEKLWSFVSFLPYFFLLLLLGVTKALIIGPISSAIILVGNSCVIIGLWPAHFI 60

Query: 61  WTYYCVVRSKQLGPLFKLVMCTCLLPALLILWPXXXXXXXXXXXAAYGFFSPIVATFEAV 120
           WTYYC+ R+K++G + K  +   L P  L+LWP            AYGFF+P++ATFEAV
Sbjct: 61  WTYYCLARTKRIGLVLK-TLALVLFPLPLLLWPVAGIVGSLFGGIAYGFFTPLMATFEAV 119

Query: 121 EGGKENKIVHCFIDGTWSTVLKTFDIVKDVGNACFHTYFSVMDDLKE--EGNGRYYEIRP 178
                +K  HCF+DG++ST+  +  +V D  + CFH+YFS MD+L+E    +    EI+ 
Sbjct: 120 GESVTSKCYHCFVDGSFSTIKGSCTVVTDFTDFCFHSYFSYMDELREMVSADVEPLEIKL 179

Query: 179 LYVPGAVVASVLGIIIDVPVISFVAIYKSPYMLFKGWNRLLHDLIGREGPFLETICVPLA 238
             +P  ++AS++G+++DV +I+ VA+YKSPYML KGW RLL DL+GREGPFLE++CVP A
Sbjct: 180 SRLPSCLLASLIGVMVDVLLITAVAVYKSPYMLLKGWKRLLEDLVGREGPFLESVCVPFA 239

Query: 239 GLAILLWPLAVVGAVLASVIASIILGVRAGVVAYEETSVFYGLRYIVAALSLYDEYSNDV 298
           GLAILLWPLAV GAV+ASV++S  LG+ +GV+ ++E S   GL YI+AA+SL+DEY ND+
Sbjct: 240 GLAILLWPLAVAGAVIASVLSSFFLGLYSGVIVHQEDSFRMGLNYIIAAVSLFDEYVNDL 299

Query: 299 LDMPQRSCFPRPPFRKKDEM---------PXXXXXXXXXXXXXXXXXXXXXXXXXKNSIA 349
           L + + +  PRP +R K E                                    K +I 
Sbjct: 300 LYLREGTSLPRPCYRTKTETVHGKRILGESKNVDLKSKRSSSLGSKLVSEQSRTLKKAIT 359

Query: 350 ELKPFELLDGLCKECLQMGERLVSEGLITSEDIQETWFGKESRVISVGLPAYCLLQALLR 409
             KP ++ + L K C   G  L+ +GLI  +D++E      S+ + + LPA+ +LQ LL 
Sbjct: 360 LYKPVQVWEWLFKSCEVNGRILLRDGLIDVKDVEECLVKGNSKKLYIKLPAWTVLQCLLA 419

Query: 410 SAKANSPGILIS 421
           SAK+NS G++I+
Sbjct: 420 SAKSNSSGLVIT 431


>AT4G37030.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: endomembrane
           system; BEST Arabidopsis thaliana protein match is:
           unknown protein (TAIR:AT4G12680.1); Has 101 Blast hits
           to 99 proteins in 12 species: Archae - 0; Bacteria - 0;
           Metazoa - 0; Fungi - 0; Plants - 101; Viruses - 0; Other
           Eukaryotes - 0 (source: NCBI BLink). |
           chr4:17452150-17454629 FORWARD LENGTH=569
          Length = 569

 Score =  249 bits (637), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 139/426 (32%), Positives = 226/426 (53%), Gaps = 21/426 (4%)

Query: 13  SFICFLPYFIGLWLLGHIKGVIFCPLICLIMTIGNSAIILGLWVIHCIWTYYCVVRSKQL 72
           S++ F   F   + LG IKG+I  P+  L + +GN  +IL L+  H  WT Y V ++ + 
Sbjct: 15  SYVIFA--FCSAFFLGAIKGLIVGPIAGLTLIVGNVGVILCLFPAHVTWTIYAVAKTNRF 72

Query: 73  GPLFKLVMCTCLLPALLILWPXXXXXXXXXXXAAYGFFSPIVATFEAVEGGKE-NKIVHC 131
               K+ +   L PAL  +W              YGFF+P ++ FEA     E NK  HC
Sbjct: 73  DIPLKVAILVAL-PALFGIWLGLSLAISVLVGVGYGFFTPWISAFEAFRQDTESNKFFHC 131

Query: 132 FIDGTWSTVLKTFDIVKDVGNACFHTYFSVMDDLKEEG-NGRYYEIRPLYVPGAVVASVL 190
            +DGTW T+  +  +V D  + C+H+Y   + +L+E   +     +R ++VPG ++  +L
Sbjct: 132 LVDGTWGTIKGSCIVVTDFADFCYHSYPLYLKELRESPVSDELQTLRLIHVPGCIIVGIL 191

Query: 191 GIIIDVPVISFVAIYKSPYMLFKGWNRLLHDLIGREGPFLETICVPLAGLAILLWPLAVV 250
           G++ID+P+ + +A+ KSPY+L KGW RL  D I REGPFLE  C+P+AGL +LLWP+ V+
Sbjct: 192 GLVIDIPLFTAIAVIKSPYLLLKGWYRLAQDAINREGPFLEIACIPVAGLTVLLWPIVVI 251

Query: 251 GAVLASVIASIILGVRAGVVAYEETSVFYGLRYIVAALSLYDEYSNDVLDMPQRSCFPRP 310
           G +L ++ +SI +G+   VV ++E S   G+ Y++A +  +DEY+ND L + + + FP+P
Sbjct: 252 GFILVTIFSSIFVGLYGAVVVFQERSFRRGVSYVIAVVGEFDEYTNDWLYLREGTIFPKP 311

Query: 311 PFRK-----KDEMPXXXXXXXXXXXXXXXXX--------XXXXXXXXKNSIAELKPFELL 357
            +R        E+                                  + +I E++  ++ 
Sbjct: 312 RYRMGRGSFSSEVSVIVHPSDVTRVNSSGSVDAPAMLVPSLVHSVSVREAIQEVRMVQIW 371

Query: 358 DGLCKECLQMGERLVSEGLITSEDIQETW---FGKESRVISVGLPAYCLLQALLRSAKAN 414
           + +       G+ L+   ++T  D+ E+     G ES +I+VGLP+Y LL  LL S KA 
Sbjct: 372 EHMMGWFEMQGKELLDAEVLTPTDLYESLKGRHGNESSIINVGLPSYALLHTLLSSIKAG 431

Query: 415 SPGILI 420
             G+L+
Sbjct: 432 VHGVLL 437