Miyakogusa Predicted Gene

Lj3g3v0808820.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj3g3v0808820.1 Non Chatacterized Hit- tr|I1R3V5|I1R3V5_ORYGL
Uncharacterized protein (Fragment) OS=Oryza
glaberrima,42.39,1e-17,seg,NULL,
NODE_61267_length_1879_cov_14.011176.path1.1
         (379 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT3G51940.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   132   3e-31
AT5G03990.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   114   1e-25

>AT3G51940.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT5G03990.1); Has 215 Blast hits to 164 proteins
           in 38 species: Archae - 0; Bacteria - 35; Metazoa - 16;
           Fungi - 18; Plants - 121; Viruses - 0; Other Eukaryotes
           - 25 (source: NCBI BLink). | chr3:19273708-19275157
           REVERSE LENGTH=453
          Length = 453

 Score =  132 bits (333), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 111/353 (31%), Positives = 166/353 (47%), Gaps = 63/353 (17%)

Query: 38  GIPLWEKKYCTKIGSVPWQKIVDSK--KSMYCHSNVHNWNDSAAEEAFQNAKKRYWARIN 95
           GIP+WEK++C  IGSVPWQK+V++K  KS Y + NV  W+DSA E+ F N KKR+W+++N
Sbjct: 32  GIPVWEKRFCEVIGSVPWQKVVEAKDFKSWY-NGNVITWDDSACEDTFHNEKKRFWSQVN 90

Query: 96  NLPCDISLPDPESFIEQIDWNPYIDPEQIKELDKAFFVLPDEEQGDATKNKRTKTSVDDE 155
            L CD+S+PDP+ +I ++DW+ ++DPE I++L+KA+F  PD+       N   K    D+
Sbjct: 91  GLHCDVSIPDPDLYISEVDWDTFVDPELIRDLEKAYFAPPDD------VNIGFKRGRGDK 144

Query: 156 DAWKPTGTPLSRVLENKEWNQEDYHD----DSGNMDNTDNPWE---CSVTRQNGGLTGND 208
           +       P +R+LE    N +D  +     SG      + WE   C V  +      ND
Sbjct: 145 NWSGCDTVPEARMLETPWKNSDDIIETGKKSSGWNLTEGSSWEAKPCCVNEK-----AND 199

Query: 209 NPWECSVTPQNGGLTDNSWKGDHAQSWGWNEGRDHDNQCRDWNSGFSQKDKGWGKVGCSS 268
                  T   G LT   W+         N+    D     W      KD GW K G  +
Sbjct: 200 -------TASGGCLTTEEWRE--------NQWIAKDRVNDSWEYSGQGKDDGWDKSGHQN 244

Query: 269 WSQQQSNDWASFSNSWGCKSSQQNVTPVNTGWGNRGANVSGWKQQ------------ENT 316
              + S ++    N W  + S    T  +T WG  G +  GW+ +            +N 
Sbjct: 245 KKVKGSEEYKKIDNPWEAQPSCIKETAKDTTWG--GCSGEGWEDRGWNNDSWGSGGWDNR 302

Query: 317 DLS-RGLQFK-----------RNNGGCSAWNQSYQRREGSFRHNSGYNSSQFQ 357
           DL  +G++ K           R   GC+ W   +     +FR  SG N+  +Q
Sbjct: 303 DLGNQGMEMKEWRGKGYSRDFREPKGCNPWKGGFVPDNVAFRE-SGVNAGGWQ 354


>AT5G03990.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G51940.1); Has 30201 Blast hits to 17322
           proteins in 780 species: Archae - 12; Bacteria - 1396;
           Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
           0; Other Eukaryotes - 2996 (source: NCBI BLink). |
           chr5:1075957-1077358 FORWARD LENGTH=302
          Length = 302

 Score =  114 bits (285), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 48/101 (47%), Positives = 73/101 (72%)

Query: 39  IPLWEKKYCTKIGSVPWQKIVDSKKSMYCHSNVHNWNDSAAEEAFQNAKKRYWARINNLP 98
           +P WEK +C  IGSVPW K+V++K+ M+ +  V  W+DSA E+AF+NAK R+WA IN L 
Sbjct: 41  VPAWEKDFCAVIGSVPWWKVVEAKRFMHIYDRVVQWDDSAGEDAFKNAKSRFWAEINGLT 100

Query: 99  CDISLPDPESFIEQIDWNPYIDPEQIKELDKAFFVLPDEEQ 139
           CD+SLPDP+ +I+ +DW+  +D E I +L++    L +E++
Sbjct: 101 CDLSLPDPDVYIDDVDWDAEVDNELILDLERGPDPLTEEQE 141