Miyakogusa Predicted Gene

Lj1g3v3740580.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj1g3v3740580.1 Non Chatacterized Hit- tr|I1MLP6|I1MLP6_SOYBN
Uncharacterized protein OS=Glycine max PE=4 SV=1,90.2,1e-18,
,CUFF.31129.1
         (384 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT3G01720.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   505   e-143
AT5G25265.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    55   1e-07
AT5G13500.3 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    54   1e-07
AT5G13500.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    54   1e-07
AT5G13500.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    54   1e-07

>AT3G01720.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: endomembrane
           system; EXPRESSED IN: 24 plant structures; EXPRESSED
           DURING: 13 growth stages; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT5G25265.1);
           Has 374 Blast hits to 211 proteins in 23 species: Archae
           - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 316;
           Viruses - 0; Other Eukaryotes - 58 (source: NCBI BLink).
           | chr3:262412-265608 REVERSE LENGTH=802
          Length = 802

 Score =  505 bits (1300), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 248/384 (64%), Positives = 290/384 (75%), Gaps = 7/384 (1%)

Query: 1   MHSFHLRGQPGNITRLLSCSHEDLKLYKGHSLAPTHYVPSMSRHPLTGDWYPAINKPAAV 60
           MHSF   GQPGNITRLLSC+ E LK YKGH LAPTHYVPSMSRHPLTGDWYPAINKPAAV
Sbjct: 414 MHSFRQSGQPGNITRLLSCTDEALKNYKGHDLAPTHYVPSMSRHPLTGDWYPAINKPAAV 473

Query: 61  LHWLNHANTDAEFIVILDADMILRGPITPWEFKAERGRPVSTPYDYLIGWGNELAKLHTS 120
           +HWL+H N DAE++VILDADMILRGPITPWEFKA RGRPVSTPYDYLIG  N+LA+LHT 
Sbjct: 474 VHWLHHTNIDAEYVVILDADMILRGPITPWEFKAARGRPVSTPYDYLIGCDNDLARLHTR 533

Query: 121 HPDACDKVGGVIIMHIDDLRKFALLWLHKTEEVRADRAHFARNITGDIYESGWISEMYGY 180
           +P+ACDKVGGVIIMHI+DLRKFA+ WL KT+EVRAD+ H+ + +TGDIYESGWISEMYGY
Sbjct: 534 NPEACDKVGGVIIMHIEDLRKFAMYWLLKTQEVRADKEHYGKELTGDIYESGWISEMYGY 593

Query: 181 SFGAAELKLRHTINGEIMTYPGYVHEPGVKYRVFHYGLRFSVGNWSFNKAEWRDVDLVNR 240
           SFGAAEL LRH+IN EIM YPGYV EPG  YRVFHYGL F VGNWSF+KA WR+ DL+N+
Sbjct: 594 SFGAAELNLRHSINKEIMIYPGYVPEPGADYRVFHYGLEFKVGNWSFDKANWRNTDLINK 653

Query: 241 CWSKFPEPPDPSTLDHDDKDNFQRNLLSIECVKTLNEALHLHHERKECPRAGSLSTSKGD 300
           CW+KFP+PP PS +   D D  QR+LLSIEC + LNEAL LHH+R+ CP  GS ST K  
Sbjct: 654 CWAKFPDPPSPSAVHQTDNDLRQRDLLSIECGQKLNEALFLHHKRRNCPEPGSESTEKIS 713

Query: 301 KIEEFGDFNGKFDSKNNHMSANDSEGLTTVPKDRIGIPSSFRFWVIFLCSFSGFGFLVII 360
              + G+   K          +D    ++   +  G  S+ + WVI L   SG GFLV++
Sbjct: 714 VSRKVGNIETK------QTQGSDETKESSGSSESEGRFSTLKLWVIALWLISGVGFLVVM 767

Query: 361 FVVHS-GHKRRGMKMKHLRSRRRN 383
            +V S    R   + K  R++RR 
Sbjct: 768 LLVFSTRRGRGTTRGKGYRNKRRT 791



 Score =  345 bits (885), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 170/294 (57%), Positives = 206/294 (70%), Gaps = 4/294 (1%)

Query: 1   MHSFHLRGQPGNITRLLSCSHEDLKLYKGHSLAPTHYVPSMSRHPLTGDWYPAINKPAAV 60
           MHSF   GQPG ITRLLSC+ +  K Y+G +LAPT  VPS SRHP TGDWYPAINKP  V
Sbjct: 50  MHSFLKSGQPGPITRLLSCTDDQKKTYRGMNLAPTFEVPSWSRHPKTGDWYPAINKPVGV 109

Query: 61  LHWLNHANTD--AEFIVILDADMILRGPITPWEFKAERGRPVSTPYDYLIGWGNELAKLH 118
           L+WL H+      +++VILDADMI+RGPI PWE  AERGRP +  Y YL+G  N L +LH
Sbjct: 110 LYWLQHSEEAKHVDWVVILDADMIIRGPIIPWELGAERGRPFAAHYGYLVGCDNLLVRLH 169

Query: 119 TSHPDACDKVGGVIIMHIDDLRKFALLWLHKTEEVRADRAHFARNITGDIYESGWISEMY 178
           T HP+ CDKVGG++ MHIDDLR  A LWL KTE+VR D AH+  N+TGDIY  GWISEMY
Sbjct: 170 TKHPELCDKVGGLLAMHIDDLRVLAPLWLSKTEDVRQDTAHWTTNLTGDIYGKGWISEMY 229

Query: 179 GYSFGAAELKLRHTINGEIMTYPGYVHEPGVKYRVFHYGLRFSVGNWSFNKAEWRDVDLV 238
           GYSFGAAE  L+H IN ++M YPGYV   GV+  + HYGL FS+GNWSF K +  + ++V
Sbjct: 230 GYSFGAAEAGLKHKINDDLMIYPGYVPREGVEPVLMHYGLPFSIGNWSFTKLDHHEDNIV 289

Query: 239 NRCWSKFPEPPDPSTLDHDDKDNFQRN--LLSIECVKTLNEALHLHHERKECPR 290
             C   FPEPP P  +   + D  +R   +LS+EC+ TLNE L L H    CP+
Sbjct: 290 YDCNRLFPEPPYPREVKIMEPDPSKRRGLILSLECMNTLNEGLILRHAENGCPK 343


>AT5G25265.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: plasma membrane,
           membrane; EXPRESSED IN: cultured cell, leaf; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT2G25260.1); Has 30201 Blast hits to 17322
           proteins in 780 species: Archae - 12; Bacteria - 1396;
           Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
           0; Other Eukaryotes - 2996 (source: NCBI BLink). |
           chr5:8754794-8756855 REVERSE LENGTH=366
          Length = 366

 Score = 54.7 bits (130), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 56/224 (25%), Positives = 99/224 (44%), Gaps = 36/224 (16%)

Query: 30  HSLAPTHY---VPSMSRHPLTG---DWYPAINKPAAVLHWLNHANTDAEFIVILDADMIL 83
           HS  P  Y   +P+    PL       Y  +N+P A + WL   +   ++I++ + D I+
Sbjct: 114 HSGKPDQYMDEIPTFVAQPLPSGMDQGYVVLNRPWAFVQWLQQTDIKEDYILMSEPDHII 173

Query: 84  RGPITPWEFKAERGRPVSTPYDYLIGWGNELAKLHTSHPDA------CDKVGGV-IIMHI 136
             PI      A+ G   + P+ Y+     E   L   +P+        D +G   +I+  
Sbjct: 174 VKPI---PNLAKDGLGAAFPFFYIEPKKYEKV-LRKYYPEVRGPVTNIDPIGNSPVIVGK 229

Query: 137 DDLRKFALLWLHKT----EEVRADRAHFARNITGDIYESGWISEMYGYSFGAAELKLRHT 192
           D L+K A  W++ +    ++  AD+A             GW+ EMY Y+  +A   + + 
Sbjct: 230 DALKKIAPTWMNVSLAMKKDPEADKAF------------GWVLEMYAYAVSSALHGVSNI 277

Query: 193 INGEIMTYPGYVHEPGVKYRV-FHYGLRFSV-GNWSFNK-AEWR 233
           ++ + M  P +  E G KY + + YG  + + G  ++ K  EWR
Sbjct: 278 LHKDFMIQPPWDIEVGDKYIIHYTYGCDYDMKGKLTYGKIGEWR 321


>AT5G13500.3 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: endomembrane
           system; EXPRESSED IN: 22 plant structures; EXPRESSED
           DURING: 13 growth stages; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT5G25265.1);
           Has 35333 Blast hits to 34131 proteins in 2444 species:
           Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi -
           991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610
           (source: NCBI BLink). | chr5:4338676-4340827 FORWARD
           LENGTH=358
          Length = 358

 Score = 54.3 bits (129), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 50/232 (21%), Positives = 100/232 (43%), Gaps = 26/232 (11%)

Query: 11  GNITRLLSCSHEDLKLYKGHSLAPTHYVPSMSRHPLTGDWYPAINKPAAVLHWLNHANTD 70
           G  TR+L   + D  + +  +       P + R       Y  +N+P A + WL  A   
Sbjct: 98  GGFTRILHSGNSDNLMDEIPTFVVDPLPPGLDRG------YVVLNRPWAFVQWLERATIK 151

Query: 71  AEFIVILDADMILRGPITPWEFKAERGRPVSTPYDYLI--GWGNELAKLHTSHPDACDKV 128
            +++++ + D +    + P    A  G P + P+ Y+    + N + K + +       +
Sbjct: 152 EDYVLMAEPDHVF---VNPLPNLAVGGFPAAFPFFYITPEKYENIVRKYYPAEMGPVTNI 208

Query: 129 GGV----IIMHIDDLRKFALLWLHKTEEVRADRAHFARNITGDIYESGWISEMYGYSFGA 184
             +    +I+  + L K A  W++ +  ++ D        T   +  GW+ EMYGY+  +
Sbjct: 209 DPIGNSPVIISKESLEKIAPTWMNVSLTMKNDPE------TDKAF--GWVLEMYGYAIAS 260

Query: 185 AELKLRHTINGEIMTYPGYVHEPGVKYRV-FHYGLRFSV-GNWSFNK-AEWR 233
           A   +RH +  + M  P +      K+ + + YG  +++ G  ++ K  EWR
Sbjct: 261 AIHGVRHILRKDFMLQPPWDLSTKGKFIIHYTYGCDYNMKGELTYGKIGEWR 312


>AT5G13500.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: endomembrane
           system; EXPRESSED IN: 22 plant structures; EXPRESSED
           DURING: 13 growth stages; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT5G25265.1);
           Has 228 Blast hits to 200 proteins in 21 species: Archae
           - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 213;
           Viruses - 0; Other Eukaryotes - 15 (source: NCBI BLink).
           | chr5:4338676-4340827 FORWARD LENGTH=358
          Length = 358

 Score = 54.3 bits (129), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 50/232 (21%), Positives = 100/232 (43%), Gaps = 26/232 (11%)

Query: 11  GNITRLLSCSHEDLKLYKGHSLAPTHYVPSMSRHPLTGDWYPAINKPAAVLHWLNHANTD 70
           G  TR+L   + D  + +  +       P + R       Y  +N+P A + WL  A   
Sbjct: 98  GGFTRILHSGNSDNLMDEIPTFVVDPLPPGLDRG------YVVLNRPWAFVQWLERATIK 151

Query: 71  AEFIVILDADMILRGPITPWEFKAERGRPVSTPYDYLI--GWGNELAKLHTSHPDACDKV 128
            +++++ + D +    + P    A  G P + P+ Y+    + N + K + +       +
Sbjct: 152 EDYVLMAEPDHVF---VNPLPNLAVGGFPAAFPFFYITPEKYENIVRKYYPAEMGPVTNI 208

Query: 129 GGV----IIMHIDDLRKFALLWLHKTEEVRADRAHFARNITGDIYESGWISEMYGYSFGA 184
             +    +I+  + L K A  W++ +  ++ D        T   +  GW+ EMYGY+  +
Sbjct: 209 DPIGNSPVIISKESLEKIAPTWMNVSLTMKNDPE------TDKAF--GWVLEMYGYAIAS 260

Query: 185 AELKLRHTINGEIMTYPGYVHEPGVKYRV-FHYGLRFSV-GNWSFNK-AEWR 233
           A   +RH +  + M  P +      K+ + + YG  +++ G  ++ K  EWR
Sbjct: 261 AIHGVRHILRKDFMLQPPWDLSTKGKFIIHYTYGCDYNMKGELTYGKIGEWR 312


>AT5G13500.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: endomembrane
           system; EXPRESSED IN: 22 plant structures; EXPRESSED
           DURING: 13 growth stages; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT5G25265.1);
           Has 1807 Blast hits to 1807 proteins in 277 species:
           Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347;
           Plants - 385; Viruses - 0; Other Eukaryotes - 339
           (source: NCBI BLink). | chr5:4338676-4340827 FORWARD
           LENGTH=358
          Length = 358

 Score = 54.3 bits (129), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 50/232 (21%), Positives = 100/232 (43%), Gaps = 26/232 (11%)

Query: 11  GNITRLLSCSHEDLKLYKGHSLAPTHYVPSMSRHPLTGDWYPAINKPAAVLHWLNHANTD 70
           G  TR+L   + D  + +  +       P + R       Y  +N+P A + WL  A   
Sbjct: 98  GGFTRILHSGNSDNLMDEIPTFVVDPLPPGLDRG------YVVLNRPWAFVQWLERATIK 151

Query: 71  AEFIVILDADMILRGPITPWEFKAERGRPVSTPYDYLI--GWGNELAKLHTSHPDACDKV 128
            +++++ + D +    + P    A  G P + P+ Y+    + N + K + +       +
Sbjct: 152 EDYVLMAEPDHVF---VNPLPNLAVGGFPAAFPFFYITPEKYENIVRKYYPAEMGPVTNI 208

Query: 129 GGV----IIMHIDDLRKFALLWLHKTEEVRADRAHFARNITGDIYESGWISEMYGYSFGA 184
             +    +I+  + L K A  W++ +  ++ D        T   +  GW+ EMYGY+  +
Sbjct: 209 DPIGNSPVIISKESLEKIAPTWMNVSLTMKNDPE------TDKAF--GWVLEMYGYAIAS 260

Query: 185 AELKLRHTINGEIMTYPGYVHEPGVKYRV-FHYGLRFSV-GNWSFNK-AEWR 233
           A   +RH +  + M  P +      K+ + + YG  +++ G  ++ K  EWR
Sbjct: 261 AIHGVRHILRKDFMLQPPWDLSTKGKFIIHYTYGCDYNMKGELTYGKIGEWR 312