Miyakogusa Predicted Gene

Lj3g3v3236620.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj3g3v3236620.1 tr|B9HPB2|B9HPB2_POPTR Predicted protein
OS=Populus trichocarpa GN=POPTRDRAFT_821523 PE=4
SV=1,30.08,4e-18,seg,NULL,CUFF.45474.1
         (356 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT5G59050.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    76   3e-14
AT3G54000.1 | Symbols:  | CONTAINS InterPro DOMAIN/s: Uncharacte...    56   4e-08
AT5G59050.2 | Symbols:  | unknown protein; Has 35333 Blast hits ...    50   3e-06

>AT5G59050.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G54000.1); Has 35333 Blast hits to 34131
           proteins in 2444 species: Archae - 798; Bacteria -
           22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
           - 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
           chr5:23839494-23842519 REVERSE LENGTH=312
          Length = 312

 Score = 76.3 bits (186), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 109/354 (30%), Positives = 146/354 (41%), Gaps = 95/354 (26%)

Query: 25  LNPHFPPHSTNAQGRKDMDNDDDLLAELTHQMQMAQFMLQEDNKFDFP-GIGSENLEPWE 83
           L P F     ++   K+ D +D+ + ELT QM    +MLQ+D K     G GS       
Sbjct: 32  LRPDFVSDEPDSPKAKNEDEEDEYITELTRQM--TNYMLQDDEKHQKSCGSGS------- 82

Query: 84  SIDWSPQSTLWSPLCSKHGSSEGLSSKEPSPPTTPDNTRDWESVNDMVGMLEKM--KLDE 141
               SPQSTLWSP  S   S  G  S+EPSPP TP               +EK+  K+D 
Sbjct: 83  ---GSPQSTLWSPFASGLSSPIG-PSREPSPPLTPATVP-----------VEKIMTKIDT 127

Query: 142 RGSSKYHSGYGVGVCSDQSLFQEQIRATXXXXXXXXXXXXXXXXXTDH--SHKSQQFQXX 199
           +  +       +   S Q+L  +QIR+                        HK++ +   
Sbjct: 128 KPVT-------IPFQSKQALIDDQIRSIQANFQKIKKEKEKERQRNADVLGHKARNYHH- 179

Query: 200 XXXXXXXXXXXXXXXXXXXXXXXXXQTRPGSLHQ--QTGSEMQAIFPGSSGSRATSCGTG 257
                                          LHQ  +  S ++A+F   SGSR  S GTG
Sbjct: 180 -------------------------------LHQNQRPRSGVKAVFVDGSGSRTGSGGTG 208

Query: 258 VFLPRA-GTNAPSSSESRNKRPGNKGCPTVLIPARVVQALQLHFDQIAATSGPTKPAGFP 316
           VFLPR  GT      ESR K     GC TV+IPARVV+AL++HFD++   S  T  +  P
Sbjct: 209 VFLPRGHGTVV----ESRKK----SGCSTVIIPARVVEALKVHFDKLGVPS--TFSSDIP 258

Query: 317 ALNDVLV-------NNRNGLYSLQKRQSRRKPE-------HVQNDMILPREWTY 356
             +D L+          N   SL + QS    E       H +    LP+EWTY
Sbjct: 259 PFHDALLVSMNNKKIKSNKNTSLSRVQSGSPYEMEMSAESHQEPPADLPQEWTY 312


>AT3G54000.1 | Symbols:  | CONTAINS InterPro DOMAIN/s:
           Uncharacterised conserved protein UCP022260
           (InterPro:IPR016802); Has 94 Blast hits to 94 proteins
           in 14 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 94; Viruses - 0; Other Eukaryotes -
           0 (source: NCBI BLink). | chr3:19996757-19998197 FORWARD
           LENGTH=352
          Length = 352

 Score = 55.8 bits (133), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 44/120 (36%), Positives = 59/120 (49%), Gaps = 16/120 (13%)

Query: 239 MQAIFPGSSGSRATSCGTGVFLPRAGTNAPSSSESRNKRPGNKGCPTVLIPARVVQALQL 298
           M+A+F G    +  S GTGVFLPR+  N  S +E+R K        TVL+PAR+ Q L L
Sbjct: 247 MRAVFIGDHTGKRGSTGTGVFLPRS-VNHTSRTETREK----PTISTVLVPARLAQVLNL 301

Query: 299 HFDQIAATSGPTKPAGFPALNDVLVNNR--NGLYSLQKRQSRRKPEHVQNDMILPREWTY 356
           +  +   ++          LNDV    R  NG +S Q     R  + VQ +  LP EW Y
Sbjct: 302 NLGEPVRSTA--------TLNDVSWRQRSNNGGFSSQMVGGVRAEQSVQ-EPRLPSEWAY 352


>AT5G59050.2 | Symbols:  | unknown protein; Has 35333 Blast hits to
           34131 proteins in 2444 species: Archae - 798; Bacteria -
           22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
           - 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
           chr5:23841849-23842519 REVERSE LENGTH=193
          Length = 193

 Score = 49.7 bits (117), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 38/94 (40%), Positives = 48/94 (51%), Gaps = 12/94 (12%)

Query: 25  LNPHFPPHSTNAQGRKDMDNDDDLLAELTHQMQMAQFMLQEDNKFDFPGIGSENLEPWES 84
           L P F     ++   K+ D +D+ + ELT QM    +MLQ+D K      GS +      
Sbjct: 32  LRPDFVSDEPDSPKAKNEDEEDEYITELTRQM--TNYMLQDDEKHQ-KSCGSGS------ 82

Query: 85  IDWSPQSTLWSPLCSKHGSSEGLSSKEPSPPTTP 118
              SPQSTLWSP  S   S  G  S+EPSPP TP
Sbjct: 83  --GSPQSTLWSPFASGLSSPIG-PSREPSPPLTP 113