Miyakogusa Predicted Gene

Lj5g3v1598310.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj5g3v1598310.1 Non Chatacterized Hit- tr|I1JBF3|I1JBF3_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.57568
PE,86.45,0,seg,NULL; DUF4033,Domain of unknown function
DUF4033,CUFF.55547.1
         (248 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT4G01995.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   291   4e-79
AT1G03055.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   130   9e-31
AT1G64680.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   130   1e-30

>AT4G01995.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G64680.1); Has 30201 Blast hits to 17322
           proteins in 780 species: Archae - 12; Bacteria - 1396;
           Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
           0; Other Eukaryotes - 2996 (source: NCBI BLink). |
           chr4:873075-874619 FORWARD LENGTH=258
          Length = 258

 Score =  291 bits (744), Expect = 4e-79,   Method: Compositional matrix adjust.
 Identities = 144/218 (66%), Positives = 172/218 (78%), Gaps = 7/218 (3%)

Query: 29  AAPKIEYKPNVVDDLFLNLFRNKLVQEVGWDSKKPGYDGLIEVANRLMMKGTTNSDTIEA 88
            APK+EYKP  +DD F+  FRNKLV+EVG DS+KPGY GLIE+   L++KG T S+T +A
Sbjct: 46  GAPKLEYKPGPLDDFFMQSFRNKLVEEVGSDSEKPGYVGLIELVKLLLLKGRTRSETSDA 105

Query: 89  TVRILRSLFPPFLLELYKMLIAPLGGGKIAAIMVARVTALTCQWLMGPCKVNSVDLPDGT 148
            VRIL+SLFPP +LELYK+LIAP+  GK+AA+MVARVT LTCQWLMGP KVN +DLP+G 
Sbjct: 106 AVRILKSLFPPLILELYKLLIAPIAQGKLAALMVARVTVLTCQWLMGPSKVNIIDLPNGE 165

Query: 149 SCSSGVYVERCKYLEESKCVGICTNTCKFPTQAFFKDHMGVPLLMEPNFGDYSCQFKFGV 208
           S  SGV+VE+C+YLEESKCVG+C NTCK PTQ FFKD+MGVPL+MEPNF DYSCQFKFGV
Sbjct: 166 SWDSGVFVEKCQYLEESKCVGVCINTCKLPTQTFFKDYMGVPLVMEPNFKDYSCQFKFGV 225

Query: 209 LPPLPEDDTVLKEPCLEACPNASRRRIVTRKTDITECP 246
            P  PEDD  + EPC E C  A RR++ +      ECP
Sbjct: 226 AP--PEDDGNVNEPCFETCSIAGRRKLKS-----GECP 256


>AT1G03055.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G64680.1); Has 143 Blast hits to 143 proteins
           in 26 species: Archae - 0; Bacteria - 6; Metazoa - 0;
           Fungi - 0; Plants - 122; Viruses - 0; Other Eukaryotes -
           15 (source: NCBI BLink). | chr1:710102-711763 REVERSE
           LENGTH=264
          Length = 264

 Score =  130 bits (327), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 67/175 (38%), Positives = 101/175 (57%), Gaps = 11/175 (6%)

Query: 56  VGWDSKKPGYDGLIEVANRLMMKGTTNSDTIEATVRILRSL---FPPFLLELYKMLIAPL 112
           +   SK   YD L++ A R+    + N DT +    +L SL    P  +  L KM   P 
Sbjct: 88  ISSSSKSTDYDRLVDTATRV----SRNFDTKQQHEFVLSSLDRALPTVISSLIKMAFPP- 142

Query: 113 GGGKIAAIMVARVTALTCQWLMGPCKVNSVDLPDGTSCSSGVYVERCKYLEESKCVGICT 172
              K++  + A  T ++  WL+GP +V   ++ +G    S VY+E+C++LE+S CVG+CT
Sbjct: 143 --SKVSRELFALFTTISFAWLVGPSEVRETEV-NGRKEKSVVYIEKCRFLEQSNCVGMCT 199

Query: 173 NTCKFPTQAFFKDHMGVPLLMEPNFGDYSCQFKFGVLPPLPEDDTVLKEPCLEAC 227
           + CK P+Q F K+ +G+P+ MEP+F D SC+  FG  PP  EDD  +K+PC E C
Sbjct: 200 HICKIPSQIFIKNSLGMPIYMEPDFNDLSCKMMFGREPPEIEDDPAMKQPCFEFC 254


>AT1G64680.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G03055.1); Has 146 Blast hits to 146 proteins
           in 26 species: Archae - 0; Bacteria - 6; Metazoa - 0;
           Fungi - 0; Plants - 125; Viruses - 0; Other Eukaryotes -
           15 (source: NCBI BLink). | chr1:24036071-24037062
           FORWARD LENGTH=250
          Length = 250

 Score =  130 bits (326), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 74/205 (36%), Positives = 110/205 (53%), Gaps = 21/205 (10%)

Query: 32  KIEYKPNVVDDLFLNLFRNKLVQEVG------------WDSKKPGYDGLIEVANRLMMKG 79
           K  Y+  +V+ +F+ LF  K+  + G            W+     Y+  +EV+ R+M +G
Sbjct: 36  KTRYEDGLVERVFMGLFARKM-DKFGSKKKKDTKEKGFWEYD---YESFVEVSKRVM-QG 90

Query: 80  TTNSDTIEATVRILRSLFPPFLLELYKMLIAPLGGGKIAAIMVARVTALTCQWLMGPCKV 139
            +     EA   +L S+ PP   E ++ L  P    K AA   A +T     WL+GP +V
Sbjct: 91  RSRVQQQEAVREVLLSMLPPGAPEQFRKLFPPT---KWAAEFNAALTVPFFHWLVGPSQV 147

Query: 140 NSVDLPDGTSCSSGVYVERCKYLEESKCVGICTNTCKFPTQAFFKDHMGVPLLMEPNFGD 199
             V++ +G    SGV +++C+YLE S CVG+C N CK PTQ FF +  G+PL M PN+ D
Sbjct: 148 IEVEV-NGVKQRSGVRIKKCRYLENSGCVGMCVNMCKIPTQDFFTNEFGLPLTMNPNYED 206

Query: 200 YSCQFKFGVLPPLPEDDTVLKEPCL 224
            SC+  +G  PP  E+D   K+PCL
Sbjct: 207 MSCEMIYGQAPPAFEEDVATKQPCL 231