Miyakogusa Predicted Gene

Lj1g3v4449220.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj1g3v4449220.1 Non Chatacterized Hit- tr|I1JF58|I1JF58_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.13613
PE,65.15,0,DUF4033,Domain of unknown function DUF4033,CUFF.32393.1
         (264 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT1G03055.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   250   8e-67
AT1G64680.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   163   9e-41
AT1G03055.2 | Symbols:  | unknown protein; BEST Arabidopsis thal...   140   1e-33
AT4G01995.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   116   2e-26

>AT1G03055.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G64680.1); Has 143 Blast hits to 143 proteins
           in 26 species: Archae - 0; Bacteria - 6; Metazoa - 0;
           Fungi - 0; Plants - 122; Viruses - 0; Other Eukaryotes -
           15 (source: NCBI BLink). | chr1:710102-711763 REVERSE
           LENGTH=264
          Length = 264

 Score =  250 bits (638), Expect = 8e-67,   Method: Compositional matrix adjust.
 Identities = 114/214 (53%), Positives = 159/214 (74%), Gaps = 2/214 (0%)

Query: 48  AARKTIGESDSPMKTNIYKDNWFDRLAINHLSKSVQEATGVINHK--SGYEGLVEAANVA 105
           AA++T     S  K    +D++F ++AIN+LSK++Q+A G+ +    + Y+ LV+ A   
Sbjct: 48  AAKETARIETSNTKNASIEDSFFSKIAINYLSKNLQDAAGISSSSKSTDYDRLVDTATRV 107

Query: 106 KHKFSPVQQQEVVIQALDKAFPKPILDLIKTLLPPSKFAREYYAVFTTLFFAWLVGPSEV 165
              F   QQ E V+ +LD+A P  I  LIK   PPSK +RE +A+FTT+ FAWLVGPSEV
Sbjct: 108 SRNFDTKQQHEFVLSSLDRALPTVISSLIKMAFPPSKVSRELFALFTTISFAWLVGPSEV 167

Query: 166 RESEVNGRREKNVVYVKKCRFLEATNCVGMCTNICKMPSQSFIKDSLGMSFNMVPNFDDM 225
           RE+EVNGR+EK+VVY++KCRFLE +NCVGMCT+ICK+PSQ FIK+SLGM   M P+F+D+
Sbjct: 168 RETEVNGRKEKSVVYIEKCRFLEQSNCVGMCTHICKIPSQIFIKNSLGMPIYMEPDFNDL 227

Query: 226 SCEMIFGQDPPALADDPALKQPCYKLCKAYKQHG 259
           SC+M+FG++PP + DDPA+KQPC++ CK+ K +G
Sbjct: 228 SCKMMFGREPPEIEDDPAMKQPCFEFCKSNKSYG 261


>AT1G64680.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G03055.1); Has 146 Blast hits to 146 proteins
           in 26 species: Archae - 0; Bacteria - 6; Metazoa - 0;
           Fungi - 0; Plants - 125; Viruses - 0; Other Eukaryotes -
           15 (source: NCBI BLink). | chr1:24036071-24037062
           FORWARD LENGTH=250
          Length = 250

 Score =  163 bits (413), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 78/169 (46%), Positives = 103/169 (60%), Gaps = 1/169 (0%)

Query: 95  YEGLVEAANVAKHKFSPVQQQEVVIQALDKAFPKPILDLIKTLLPPSKFAREYYAVFTTL 154
           YE  VE +       S VQQQE V + L    P    +  + L PP+K+A E+ A  T  
Sbjct: 77  YESFVEVSKRVMQGRSRVQQQEAVREVLLSMLPPGAPEQFRKLFPPTKWAAEFNAALTVP 136

Query: 155 FFAWLVGPSEVRESEVNGRREKNVVYVKKCRFLEATNCVGMCTNICKMPSQSFIKDSLGM 214
           FF WLVGPS+V E EVNG ++++ V +KKCR+LE + CVGMC N+CK+P+Q F  +  G+
Sbjct: 137 FFHWLVGPSQVIEVEVNGVKQRSGVRIKKCRYLENSGCVGMCVNMCKIPTQDFFTNEFGL 196

Query: 215 SFNMVPNFDDMSCEMIFGQDPPALADDPALKQPCYK-LCKAYKQHGPSC 262
              M PN++DMSCEMI+GQ PPA  +D A KQPC   +C       P C
Sbjct: 197 PLTMNPNYEDMSCEMIYGQAPPAFEEDVATKQPCLADICSMSNPSSPIC 245


>AT1G03055.2 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G64680.1); Has 35333 Blast hits to 34131
           proteins in 2444 species: Archae - 798; Bacteria -
           22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
           - 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
           chr1:710912-711763 REVERSE LENGTH=200
          Length = 200

 Score =  140 bits (352), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 68/141 (48%), Positives = 96/141 (68%), Gaps = 2/141 (1%)

Query: 48  AARKTIGESDSPMKTNIYKDNWFDRLAINHLSKSVQEATGVINHK--SGYEGLVEAANVA 105
           AA++T     S  K    +D++F ++AIN+LSK++Q+A G+ +    + Y+ LV+ A   
Sbjct: 48  AAKETARIETSNTKNASIEDSFFSKIAINYLSKNLQDAAGISSSSKSTDYDRLVDTATRV 107

Query: 106 KHKFSPVQQQEVVIQALDKAFPKPILDLIKTLLPPSKFAREYYAVFTTLFFAWLVGPSEV 165
              F   QQ E V+ +LD+A P  I  LIK   PPSK +RE +A+FTT+ FAWLVGPSEV
Sbjct: 108 SRNFDTKQQHEFVLSSLDRALPTVISSLIKMAFPPSKVSRELFALFTTISFAWLVGPSEV 167

Query: 166 RESEVNGRREKNVVYVKKCRF 186
           RE+EVNGR+EK+VVY++KCR 
Sbjct: 168 RETEVNGRKEKSVVYIEKCRL 188


>AT4G01995.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G64680.1); Has 30201 Blast hits to 17322
           proteins in 780 species: Archae - 12; Bacteria - 1396;
           Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
           0; Other Eukaryotes - 2996 (source: NCBI BLink). |
           chr4:873075-874619 FORWARD LENGTH=258
          Length = 258

 Score =  116 bits (290), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 69/193 (35%), Positives = 102/193 (52%), Gaps = 7/193 (3%)

Query: 65  YKDNWFDRLAINHLSKSVQEATGVINHKSGYEGLVEAANVAKHKF-SPVQQQEVVIQALD 123
           YK    D   +      + E  G  + K GY GL+E   +   K  +  +  +  ++ L 
Sbjct: 52  YKPGPLDDFFMQSFRNKLVEEVGSDSEKPGYVGLIELVKLLLLKGRTRSETSDAAVRILK 111

Query: 124 KAFPKPILDLIKTLLPP---SKFAREYYAVFTTLFFAWLVGPSEVRESEV-NGRREKNVV 179
             FP  IL+L K L+ P    K A    A  T L   WL+GPS+V   ++ NG    + V
Sbjct: 112 SLFPPLILELYKLLIAPIAQGKLAALMVARVTVLTCQWLMGPSKVNIIDLPNGESWDSGV 171

Query: 180 YVKKCRFLEATNCVGMCTNICKMPSQSFIKDSLGMSFNMVPNFDDMSCEMIFGQDPPALA 239
           +V+KC++LE + CVG+C N CK+P+Q+F KD +G+   M PNF D SC+  FG  PP   
Sbjct: 172 FVEKCQYLEESKCVGVCINTCKLPTQTFFKDYMGVPLVMEPNFKDYSCQFKFGVAPP--E 229

Query: 240 DDPALKQPCYKLC 252
           DD  + +PC++ C
Sbjct: 230 DDGNVNEPCFETC 242