Miyakogusa Predicted Gene

Lj0g3v0101219.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj0g3v0101219.1 Non Chatacterized Hit- tr|G7J850|G7J850_MEDTR
Membrane protein, putative OS=Medicago truncatula
GN=M,81.71,0,seg,NULL; FAMILY NOT NAMED,NULL,CUFF.5675.1
         (430 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT4G12680.1 | Symbols:  | unknown protein; INVOLVED IN: vegetati...   525   e-149
AT5G40640.1 | Symbols:  | unknown protein; LOCATED IN: plasma me...   281   8e-76
AT3G27390.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   277   1e-74
AT4G37030.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   250   2e-66

>AT4G12680.1 | Symbols:  | unknown protein; INVOLVED IN: vegetative
           to reproductive phase transition of meristem; LOCATED
           IN: endomembrane system; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT5G40640.1);
           Has 103 Blast hits to 103 proteins in 14 species: Archae
           - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 103;
           Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).
           | chr4:7475104-7478174 FORWARD LENGTH=575
          Length = 575

 Score =  525 bits (1352), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 258/431 (59%), Positives = 318/431 (73%), Gaps = 3/431 (0%)

Query: 1   MVQDVTDFCYYSYFSYTDELRENLPPNEKPIDIRXXXXXXXXXXXXXXXXFAMAIIISIA 60
           +V D TDFC++SYFSY DELRE +  + +P++I+                  + +I ++A
Sbjct: 145 VVTDFTDFCFHSYFSYMDELREMVSADVEPLEIKLSRLPSCLLASLIGVMVDVLLITAVA 204

Query: 61  IWKSPYMLFRGWKRLLEDLIGRKGPFLETECVPFAGLAIILWPLAVVGAVLAASIISFFL 120
           ++KSPYML +GWKRLLEDL+GR+GPFLE+ CVPFAGLAI+LWPLAV GAV+A+ + SFFL
Sbjct: 205 VYKSPYMLLKGWKRLLEDLVGREGPFLESVCVPFAGLAILLWPLAVAGAVIASVLSSFFL 264

Query: 121 GLYSGVVVHQEDSMQMGFAYIVSVVSLFDEYVNDLLYLREGSCIPRPIYRRKMTHALESK 180
           GLYSGV+VHQEDS +MG  YI++ VSLFDEYVNDLLYLREG+ +PRP YR K       +
Sbjct: 265 GLYSGVIVHQEDSFRMGLNYIIAAVSLFDEYVNDLLYLREGTSLPRPCYRTKTETVHGKR 324

Query: 181 SLGGS-NHNLKIRRDSSQNSKHILQQTRSLKWKIQQYKPVQVWDWLFKSCEVNGRIVLRD 239
            LG S N +LK +R SS  SK + +Q+R+LK  I  YKPVQVW+WLFKSCEVNGRI+LRD
Sbjct: 325 ILGESKNVDLKSKRSSSLGSKLVSEQSRTLKKAITLYKPVQVWEWLFKSCEVNGRILLRD 384

Query: 240 GLISVKEIEECILKGNCKKLGIKLPAWSLLQCLLTSAKSNSDGLVISDEVELTRMNGPKD 299
           GLI VK++EEC++KGN KKL IKLPAW++LQCLL SAKSNS GLVI+D VELT +N P+D
Sbjct: 385 GLIDVKDVEECLVKGNSKKLYIKLPAWTVLQCLLASAKSNSSGLVITDGVELTELNSPRD 444

Query: 300 KVFEWFIGPLLIMXXXXXXXXXXXXXXXXXXXXVMRCKNDIPEEWDSTGFPSNDNVRRAQ 359
           KVF W +GPLLIM                    VM CKN+  E+WD+TGFPS+D VR+AQ
Sbjct: 445 KVFVWLVGPLLIMKEQIKNLKLTEDEEFCLRKLVMVCKNERTEDWDNTGFPSSDTVRKAQ 504

Query: 360 LQAIIRRLQGIVASMSRIPTFRRRFRNLVKVLYMEALQASASASHIGANAIPKHREKGSL 419
           LQAIIRRLQG+VASMSRIPTFRRRF NLVKVLY+EAL+  AS +  G    P   + G+L
Sbjct: 505 LQAIIRRLQGMVASMSRIPTFRRRFMNLVKVLYIEALEMGASGNRAGGILKPNSDQTGNL 564

Query: 420 QRKE--DNNVV 428
            R E  D +VV
Sbjct: 565 DRTETPDMDVV 575


>AT5G40640.1 | Symbols:  | unknown protein; LOCATED IN: plasma
           membrane; EXPRESSED IN: 19 plant structures; EXPRESSED
           DURING: 7 growth stages; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT3G27390.1);
           Has 104 Blast hits to 102 proteins in 14 species: Archae
           - 0; Bacteria - 0; Metazoa - 2; Fungi - 0; Plants - 101;
           Viruses - 0; Other Eukaryotes - 1 (source: NCBI BLink).
           | chr5:16277345-16280258 FORWARD LENGTH=586
          Length = 586

 Score =  281 bits (718), Expect = 8e-76,   Method: Compositional matrix adjust.
 Identities = 167/398 (41%), Positives = 218/398 (54%), Gaps = 21/398 (5%)

Query: 1   MVQDVTDFCYYSYFSYTDELRENLPPNEKPIDIRXXXXXXXXXXXXXXXXFAMAIIISIA 60
           +V D  D C++SYFS+ D+LR +   N    +IR                    +I  +A
Sbjct: 145 VVCDFKDVCFHSYFSFMDDLRTS-TANRHYYEIRLLQIPGAVIVAVLGILVDFPVISLLA 203

Query: 61  IWKSPYMLFRGWKRLLEDLIGRKGPFLETECVPFAGLAIILWPLAVVGAVLAASIISFFL 120
           + KSPYMLF+GW RL  DLIGR+GPFLET CVP AGL I+LWPLAVVGAVL + + S FL
Sbjct: 204 LCKSPYMLFKGWHRLFHDLIGREGPFLETMCVPIAGLVILLWPLAVVGAVLGSVVSSVFL 263

Query: 121 GLYSGVVVHQEDSMQMGFAYIVSVVSLFDEYVNDLLYLREGSCIPRPIYRRKMTHALESK 180
           G Y GVV +QE S   G  Y+V+ VS++DEY ND+L + EGSC PRPIYRR    A  + 
Sbjct: 264 GAYGGVVSYQESSFFFGLCYVVASVSIYDEYSNDVLDMPEGSCFPRPIYRRNEEGASTAF 323

Query: 181 SLGGSNHNLKIRRDSSQNSKHILQQTRSLKWKIQQYKPVQVWDWLFKSCEVNGRIVLRDG 240
           S G S  N         + K    +  S K  +   KP+ + + LF  C  +G I++  G
Sbjct: 324 SGGLSRPN---------SFKTTPSRGGSNKGPMIDLKPLDLLEALFVECRRHGEIMVTKG 374

Query: 241 LISVKEIEECILKGNCKKLGIKLPAWSLLQCLLTSAKSNSDGLVISDEV-ELTRMNGPKD 299
           +I+ K+IEE       + +   LPA+SLL  LL S KSNS GL++ D V E+T  N PKD
Sbjct: 375 IINSKDIEEAKSSKGSQVISFGLPAYSLLHELLRSIKSNSTGLLLGDGVTEITTRNRPKD 434

Query: 300 KVFEWFIGPLLIMXXXXXXXXXXXXXXXXXXXXVM------RCKNDIPEEWDSTGFPSND 353
             F+WF+ P LI+                    V+      R K+ I E       P   
Sbjct: 435 AFFDWFLNPFLILKDQIEAANLSEEEEEYLGKLVLLFGDSERLKSSIVESES----PPLT 490

Query: 354 NVRRAQLQAIIRRLQGIVASMSRIPTFRRRFRNLVKVL 391
            +R+A+L A  RRLQG+  S+SR PTFRR F  LVK L
Sbjct: 491 ELRKAELDAFARRLQGLTKSVSRYPTFRRHFVELVKKL 528


>AT3G27390.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: plasma membrane;
           EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT5G40640.1); Has 101 Blast
           hits to 99 proteins in 12 species: Archae - 0; Bacteria
           - 0; Metazoa - 0; Fungi - 0; Plants - 101; Viruses - 0;
           Other Eukaryotes - 0 (source: NCBI BLink). |
           chr3:10133372-10136111 REVERSE LENGTH=588
          Length = 588

 Score =  277 bits (708), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 163/396 (41%), Positives = 220/396 (55%), Gaps = 17/396 (4%)

Query: 1   MVQDVTDFCYYSYFSYTDELRENLPPNEKPIDIRXXXXXXXXXXXXXXXXFAMAIIISIA 60
           +V+D  D C++SYFS  DEL+++ P + K  +IR                    +I  +A
Sbjct: 145 VVRDFKDVCFHSYFSLMDELKQSCP-DRKYYEIRLLQLPGALVVSVLGILVDPPVISLVA 203

Query: 61  IWKSPYMLFRGWKRLLEDLIGRKGPFLETECVPFAGLAIILWPLAVVGAVLAASIISFFL 120
           I KSPYMLF+GW RL  DLIGR+GPFLET CVP AGLAI+LWPLAV GAV+ + I S FL
Sbjct: 204 ICKSPYMLFKGWHRLFHDLIGREGPFLETMCVPIAGLAILLWPLAVTGAVIGSVISSIFL 263

Query: 121 GLYSGVVVHQEDSMQMGFAYIVSVVSLFDEYVNDLLYLREGSCIPRPIYRRKMTHALESK 180
           G Y+GVV +QE S   G  YIV+ VS++DEY  D+L L EGSC PRP YRRK     E  
Sbjct: 264 GAYAGVVSYQESSFYYGLCYIVASVSIYDEYSTDILDLPEGSCFPRPKYRRKDE---EPT 320

Query: 181 SLGGSNHNLKIRRDSSQNSKHILQQTRSLKWKIQQYKPVQVWDWLFKSCEVNGRIVLRDG 240
              G    L   +++S        +  S++  +   KP+ + + LF  C   G ++   G
Sbjct: 321 PFSGPVPRLGSVKNASS------MRGGSVRVPMIDIKPLDLLNELFVECRRYGEVLATKG 374

Query: 241 LISVKEIEECILKGNCKKLGIKLPAWSLLQCLLTSAKSNSDGLVISDEV-ELTRMNGPKD 299
           LI+ K+IEE       + + + LPA+ LL  +L S K+NS GL++SD V E+T MN PKD
Sbjct: 375 LINSKDIEEARSSKGSQVISVGLPAYGLLYEILRSVKANSSGLLLSDGVTEITTMNRPKD 434

Query: 300 KVFEWFIGPLLIMXXXXXXXXXXXXXXXXXXXXVMRCKNDIPEEWDS----TGFPSNDNV 355
             F+WF+ P LI+                    V+   +  PE   S    +  P     
Sbjct: 435 VFFDWFLNPFLILKEQMKATNLSEEEEEYLGRLVLLFGD--PERLKSSNAISASPPLTER 492

Query: 356 RRAQLQAIIRRLQGIVASMSRIPTFRRRFRNLVKVL 391
           +RA+L A  RR+QG+  ++SR PTFRR F  LVK L
Sbjct: 493 KRAELDAFARRMQGLTKTVSRYPTFRRHFVALVKKL 528


>AT4G37030.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: endomembrane
           system; BEST Arabidopsis thaliana protein match is:
           unknown protein (TAIR:AT4G12680.1); Has 101 Blast hits
           to 99 proteins in 12 species: Archae - 0; Bacteria - 0;
           Metazoa - 0; Fungi - 0; Plants - 101; Viruses - 0; Other
           Eukaryotes - 0 (source: NCBI BLink). |
           chr4:17452150-17454629 FORWARD LENGTH=569
          Length = 569

 Score =  250 bits (638), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 141/403 (34%), Positives = 221/403 (54%), Gaps = 16/403 (3%)

Query: 1   MVQDVTDFCYYSYFSYTDELRENLPPNEKPIDIRXXXXXXXXXXXXXXXXFAMAIIISIA 60
           +V D  DFCY+SY  Y  ELRE+ P +++   +R                  + +  +IA
Sbjct: 146 VVTDFADFCYHSYPLYLKELRES-PVSDELQTLRLIHVPGCIIVGILGLVIDIPLFTAIA 204

Query: 61  IWKSPYMLFRGWKRLLEDLIGRKGPFLETECVPFAGLAIILWPLAVVGAVLAASIISFFL 120
           + KSPY+L +GW RL +D I R+GPFLE  C+P AGL ++LWP+ V+G +L     S F+
Sbjct: 205 VIKSPYLLLKGWYRLAQDAINREGPFLEIACIPVAGLTVLLWPIVVIGFILVTIFSSIFV 264

Query: 121 GLYSGVVVHQEDSMQMGFAYIVSVVSLFDEYVNDLLYLREGSCIPRPIYRRKMTHALESK 180
           GLY  VVV QE S + G +Y+++VV  FDEY ND LYLREG+  P+P YR  M     S 
Sbjct: 265 GLYGAVVVFQERSFRRGVSYVIAVVGEFDEYTNDWLYLREGTIFPKPRYR--MGRGSFSS 322

Query: 181 SLGGSNHNLKIRRDSSQNSKHI-------LQQTRSLKWKIQQYKPVQVWDWLFKSCEVNG 233
            +    H   + R +S  S          L  + S++  IQ+ + VQ+W+ +    E+ G
Sbjct: 323 EVSVIVHPSDVTRVNSSGSVDAPAMLVPSLVHSVSVREAIQEVRMVQIWEHMMGWFEMQG 382

Query: 234 RIVLRDGLISVKEIEECILKG----NCKKLGIKLPAWSLLQCLLTSAKSNSDGLVISDEV 289
           + +L   +++  ++ E  LKG        + + LP+++LL  LL+S K+   G+++ D  
Sbjct: 383 KELLDAEVLTPTDLYES-LKGRHGNESSIINVGLPSYALLHTLLSSIKAGVHGVLLLDGS 441

Query: 290 ELTRMNGPKDKVFEWFIGPLLIMXXXXXXXXXXXXXXXXXXXXVMRCKNDI-PEEWDSTG 348
           E+T +N P+DK  +W   P++++                    V+   ++   E WD+  
Sbjct: 442 EVTHLNRPQDKFLDWVFNPIMVLKDQIRALKLGESEVKYLEKVVLFGNHEQRMEAWDNHS 501

Query: 349 FPSNDNVRRAQLQAIIRRLQGIVASMSRIPTFRRRFRNLVKVL 391
            P  +N+R AQ+Q I RR+ G+V S+S++PT+RRRFR +VK L
Sbjct: 502 NPPQENLRTAQIQGISRRMMGMVRSVSKLPTYRRRFRQVVKAL 544