Miyakogusa Predicted Gene

Lj0g3v0201229.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj0g3v0201229.1 tr|Q655R5|Q655R5_ORYSJ Dentin
sialophosphoprotein-like OS=Oryza sativa subsp. japonica
GN=P0686E06.3,35.74,2e-18, ,CUFF.12776.1
         (260 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT1G06660.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   161   5e-40
AT2G30820.2 | Symbols:  | unknown protein; BEST Arabidopsis thal...   158   3e-39
AT2G30820.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   158   3e-39
AT1G04030.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    52   6e-07

>AT1G06660.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G30820.2); Has 166 Blast hits to 144 proteins
           in 35 species: Archae - 0; Bacteria - 17; Metazoa - 13;
           Fungi - 20; Plants - 104; Viruses - 0; Other Eukaryotes
           - 12 (source: NCBI BLink). | chr1:2037461-2040148
           REVERSE LENGTH=481
          Length = 481

 Score =  161 bits (407), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 119/270 (44%), Positives = 150/270 (55%), Gaps = 50/270 (18%)

Query: 24  TETERKNKSVRFECVSSDDGDVKKIKSPNNQSACKP-----------SPYPTPLKLFDEM 72
           T T  KNKSVRFEC   D        S  N S+ KP           SP PTPLKL DEM
Sbjct: 214 TFTAGKNKSVRFEC---DLDQSNSSNSSENGSSRKPEMGGKICFTVSSPNPTPLKLSDEM 270

Query: 73  QTPGTVYPASLEKSRDCKHRVRSQFVYSNDNPSENVFLSKILEEKDLNTEQDSSELSDYV 132
           QTPGT+YPA++E     + R+RSQFV+S  N  EN  L K+         +DS E  DY 
Sbjct: 271 QTPGTIYPANMESGGRGRPRIRSQFVHSVSNIMENASLYKVY--------KDSHEGLDYE 322

Query: 133 KQAQSATPTPE---KGLKKFANENESVMEASLSSWLR-------------PASVIM---E 173
           +Q ++ TP+ E   + +++ ++E  S  EAS S WL              P   ++   +
Sbjct: 323 EQIEAETPSSETYGEKVEESSDEKLSKFEASFSPWLNQINENIAALNERTPGVGVITPGD 382

Query: 174 DKLVGVVAAHGNEDEDSHISHPKWWDGNGIPNSTTKYKEDQKVMWHATPFEERLNKALVE 233
             ++G+VAA   E+E + IS PK WDGNGIPNSTTKYKEDQKV WHATPFE RL KAL E
Sbjct: 383 RPIIGLVAAQWIENEQTEIS-PKMWDGNGIPNSTTKYKEDQKVSWHATPFEVRLEKALSE 441

Query: 234 ---GNVISQRKLVCGKPVAFEEIDESDTAL 260
               ++  QRKL     V  EE+ E DT +
Sbjct: 442 EGGQSLFPQRKL----EVMMEEV-EGDTDI 466


>AT2G30820.2 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G06660.1); Has 35333 Blast hits to 34131
           proteins in 2444 species: Archae - 798; Bacteria -
           22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
           - 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
           chr2:13129800-13132391 FORWARD LENGTH=421
          Length = 421

 Score =  158 bits (400), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 101/225 (44%), Positives = 136/225 (60%), Gaps = 25/225 (11%)

Query: 24  TETERKNKSVRFECVSSDDGDVKKIKSPNNQSACKP-----------SPYPTPLKLFDEM 72
           T    K KSVR E    D G      S  N++A KP           SPY T +KL DE+
Sbjct: 173 TGVREKTKSVRSEI---DFGQSYSSSSSKNRTASKPEMVGKTSISATSPYTTSMKLSDEI 229

Query: 73  QTPGTVYPASLEKSRDCKHRVRSQFVYSNDNPSENVFLSKILEEKDLNTEQDSSELSDYV 132
           QTPGT++PA++E +   + R+RSQFV+S  N   N  L K+ E+ + N EQ  +++  Y 
Sbjct: 230 QTPGTIFPANMESAGRERRRIRSQFVHSASNLIVNASLCKLHEDSNANLEQ--AKVQAYK 287

Query: 133 KQAQSATPTPE---KGLKKFANENESVMEASLSSWLRPASVIMEDK-LVGVVAAHGNEDE 188
           ++ ++ +PT     + L++ ++  + + E S S    P S+   D+ ++G+VAAH NE E
Sbjct: 288 EKTENESPTSTICGEKLEESSDGKKQIGEISSS----PLSINPGDRPIIGMVAAHWNEKE 343

Query: 189 DSHISHPKWWDGNGIPNSTTKYKEDQKVMWHATPFEERLNKALVE 233
            S IS PKWWDGNGIPNST KYKEDQKV WHATPFEERL KAL E
Sbjct: 344 HSQIS-PKWWDGNGIPNSTNKYKEDQKVSWHATPFEERLEKALSE 387


>AT2G30820.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G06660.1); Has 35333 Blast hits to 34131
           proteins in 2444 species: Archae - 798; Bacteria -
           22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
           - 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
           chr2:13129800-13132391 FORWARD LENGTH=421
          Length = 421

 Score =  158 bits (400), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 101/225 (44%), Positives = 136/225 (60%), Gaps = 25/225 (11%)

Query: 24  TETERKNKSVRFECVSSDDGDVKKIKSPNNQSACKP-----------SPYPTPLKLFDEM 72
           T    K KSVR E    D G      S  N++A KP           SPY T +KL DE+
Sbjct: 173 TGVREKTKSVRSEI---DFGQSYSSSSSKNRTASKPEMVGKTSISATSPYTTSMKLSDEI 229

Query: 73  QTPGTVYPASLEKSRDCKHRVRSQFVYSNDNPSENVFLSKILEEKDLNTEQDSSELSDYV 132
           QTPGT++PA++E +   + R+RSQFV+S  N   N  L K+ E+ + N EQ  +++  Y 
Sbjct: 230 QTPGTIFPANMESAGRERRRIRSQFVHSASNLIVNASLCKLHEDSNANLEQ--AKVQAYK 287

Query: 133 KQAQSATPTPE---KGLKKFANENESVMEASLSSWLRPASVIMEDK-LVGVVAAHGNEDE 188
           ++ ++ +PT     + L++ ++  + + E S S    P S+   D+ ++G+VAAH NE E
Sbjct: 288 EKTENESPTSTICGEKLEESSDGKKQIGEISSS----PLSINPGDRPIIGMVAAHWNEKE 343

Query: 189 DSHISHPKWWDGNGIPNSTTKYKEDQKVMWHATPFEERLNKALVE 233
            S IS PKWWDGNGIPNST KYKEDQKV WHATPFEERL KAL E
Sbjct: 344 HSQIS-PKWWDGNGIPNSTNKYKEDQKVSWHATPFEERLEKALSE 387


>AT1G04030.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT5G44040.1); Has 1835 Blast hits to 1511 proteins
           in 238 species: Archae - 7; Bacteria - 164; Metazoa -
           377; Fungi - 135; Plants - 187; Viruses - 22; Other
           Eukaryotes - 943 (source: NCBI BLink). |
           chr1:1040597-1042313 FORWARD LENGTH=434
          Length = 434

 Score = 51.6 bits (122), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 20/30 (66%), Positives = 26/30 (86%)

Query: 202 GIPNSTTKYKEDQKVMWHATPFEERLNKAL 231
           GIPN+++KY+ED+ V WH+TPFE RL KAL
Sbjct: 400 GIPNTSSKYREDKSVNWHSTPFEARLEKAL 429