Miyakogusa Predicted Gene

Lj5g3v0279280.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj5g3v0279280.1 Non Chatacterized Hit- tr|B6SWB0|B6SWB0_MAIZE
Putative uncharacterized protein OS=Zea mays PE=2
SV=1,39.7,9e-17,SUBFAMILY NOT NAMED,NULL; FAMILY NOT NAMED,NULL;
seg,NULL,CUFF.52707.1
         (473 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT1G76660.1 | Symbols:  | FUNCTIONS IN: molecular_function unkno...   454   e-128
AT5G52430.1 | Symbols:  | hydroxyproline-rich glycoprotein famil...   134   1e-31
AT1G63720.1 | Symbols:  | BEST Arabidopsis thaliana protein matc...   108   1e-23
AT4G25620.1 | Symbols:  | hydroxyproline-rich glycoprotein famil...    91   2e-18

>AT1G76660.1 | Symbols:  | FUNCTIONS IN: molecular_function unknown;
           INVOLVED IN: biological_process unknown; LOCATED IN:
           plasma membrane; EXPRESSED IN: 22 plant structures;
           EXPRESSED DURING: 13 growth stages; BEST Arabidopsis
           thaliana protein match is: hydroxyproline-rich
           glycoprotein family protein (TAIR:AT5G52430.1); Has 353
           Blast hits to 231 proteins in 60 species: Archae - 0;
           Bacteria - 6; Metazoa - 57; Fungi - 22; Plants - 125;
           Viruses - 4; Other Eukaryotes - 139 (source: NCBI
           BLink). | chr1:28769157-28771036 REVERSE LENGTH=431
          Length = 431

 Score =  454 bits (1167), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 263/476 (55%), Positives = 310/476 (65%), Gaps = 57/476 (11%)

Query: 1   MGSEQNRFPQHERRKRWGGCWGAFSCFGSQKGGKRIVPASRIPDS-NGSAAQPNGPQAVG 59
           MGSEQ      ++RKRWGGC G FSCF SQKGGKRIVPASRIP+  N SA+QPNG    G
Sbjct: 1   MGSEQ------DQRKRWGGCLGVFSCFKSQKGGKRIVPASRIPEGGNVSASQPNGAHQAG 54

Query: 60  LTNQTT--GXXXXXXXXXXXXXXFTHSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPY 117
           + N     G              FT+SALPST QSP+C+LSL+ANSPGGPSS+M+ATGPY
Sbjct: 55  VLNNQAAGGINLSLLAPPSSPASFTNSALPSTTQSPNCYLSLAANSPGGPSSSMYATGPY 114

Query: 118 AHETQLVSPPVFSNFTTEPSTAPLTPPPELAHLTTPSSPDVPFAHFLSSSADLKNRDKNN 177
           AHETQLVSPPVFS FTTEPSTAP TPPPELA LT PSSPDVP+A FL+SS DLKN  K +
Sbjct: 115 AHETQLVSPPVFSTFTTEPSTAPFTPPPELARLTAPSSPDVPYARFLTSSMDLKNSGKGH 174

Query: 178 YINANDLQATYSLYPGSPASSLISPISRNSGDCLSSSFPEREFRPQWNSSIPLENGKYQR 237
           Y   NDLQATYSLYPGSPAS+L SPISR SGD L S        PQ        NGK  R
Sbjct: 175 Y---NDLQATYSLYPGSPASALRSPISRASGDGLLS--------PQ--------NGKCSR 215

Query: 238 TGSGRVSGHDTNSVTIASQDTNFFCPATYAQFYLDQNPPFPHNGGRLSVSKDSDVQSTG- 296
           + SG   G+DTN V+   Q++NFFCP T+A+FYLD +P  P NGGRLSVSKDSDV  T  
Sbjct: 216 SDSGNTFGYDTNGVSTPLQESNFFCPETFAKFYLDHDPSVPQNGGRLSVSKDSDVYPTNG 275

Query: 297 -GNGHQSRHARSPKQDVEEIEAYRASFGFSADEIISTSQYVEISDVMDDSFTMMPFTTGK 355
            GNG+Q+R  RSPKQD+EE+EAYRASFGFSADEII+TSQYVEI+DVMD SF    ++   
Sbjct: 276 YGNGNQNRQNRSPKQDMEELEAYRASFGFSADEIITTSQYVEITDVMDGSFNTSAYS--- 332

Query: 356 SAMEESIEPSLMKGFKAHETHVAMQNLKSLRLDPGPVGKEARSPVPICHGYEDNKSPGYC 415
                   PS  +     E ++  Q       D      + +SP    + Y+D+K     
Sbjct: 333 --------PSDGQKLLRREANLLSQTSPKSEADLDSQVVDFQSPKS-SNSYKDHKQ---- 379

Query: 416 SNSSGLSTPENHTLLDDEDIFSKMGSSRICRKYQMGLSCSDAEVDYRRGRSLREGK 471
                     N    D+E + S++GS +  R Y   +S SDAEV+YRRGRSLRE +
Sbjct: 380 ---------RNRIHADEEALLSRVGSVKGSRSYH--ISSSDAEVEYRRGRSLRESR 424


>AT5G52430.1 | Symbols:  | hydroxyproline-rich glycoprotein family
           protein | chr5:21283093-21285045 REVERSE LENGTH=438
          Length = 438

 Score =  134 bits (338), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 97/228 (42%), Positives = 122/228 (53%), Gaps = 21/228 (9%)

Query: 9   PQHERRKRWGGCWGAFSCFGSQKGGKRIVPASRIPDSNGSAAQPNGPQAVGLTNQTTGXX 68
           P   ++ RWG CW  +SCFG+QK  KRI  A  +P+   S     G   V + N  T   
Sbjct: 28  PSSSQKGRWGKCWSLYSCFGTQKNNKRIGNAVLVPEPVTS-----GVPVVTVQNSATSTT 82

Query: 69  XXXXXXX--XXXXXFTHSALPSTAQSPSCFLSLSAN--SPGGPSSTMFATGPYAHETQLV 124
                         F  S   S + SP   LSL++N  SP  P S +F  GPYA+ETQ V
Sbjct: 83  VVLPFIAPPSSPASFLQSDPSSVSHSPVGPLSLTSNTFSPKEPQS-VFTVGPYANETQPV 141

Query: 125 SPPVFSNFTTEPSTAPLTPPPELA-HLTTPSSPDVPFAHFLSSSADLKNRDKNNYINAND 183
           +PPVFS F TEPSTAP TPPPE + H+TTPSSP+VPFA  L+SS +L  RD  + +N   
Sbjct: 142 TPPVFSAFITEPSTAPYTPPPESSVHITTPSSPEVPFAQLLTSSLELTRRDSTSGMNQKF 201

Query: 184 LQATYS-----LYPGSP-ASSLISPISRNSGDCLSSSFPER----EFR 221
             + Y      + PGSP   +LISP S  S    SS +P +    EFR
Sbjct: 202 SSSHYEFRSNQVCPGSPGGGNLISPGSVISNSGTSSPYPGKSPMVEFR 249


>AT1G63720.1 | Symbols:  | BEST Arabidopsis thaliana protein match
           is: hydroxyproline-rich glycoprotein family protein
           (TAIR:AT5G52430.1); Has 490 Blast hits to 394 proteins
           in 96 species: Archae - 0; Bacteria - 2; Metazoa - 132;
           Fungi - 88; Plants - 175; Viruses - 14; Other Eukaryotes
           - 79 (source: NCBI BLink). | chr1:23636122-23637348
           REVERSE LENGTH=358
          Length = 358

 Score =  108 bits (269), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 81/215 (37%), Positives = 110/215 (51%), Gaps = 11/215 (5%)

Query: 9   PQHERRKRWGGCWGAFSCFGSQKGGKRIVPASRIPDSNGSAAQPNGPQAVGLTNQTTGXX 68
           P H++RK W   W    CFGS +  KRI  +  +P+    ++  +     G  +  T   
Sbjct: 33  PIHKKRKWWNR-WSLLKCFGSSRQRKRIGNSVLVPEPVSMSSSNSTTSNSGYRSVIT--T 89

Query: 69  XXXXXXXXXXXXFTHSALPSTAQSPSCFLSLSANSPGGPSSTMFATGPYAHETQLVSPPV 128
                       F  S  PS  QSP   LS S   P     ++FA GPYAHETQLVSPPV
Sbjct: 90  LPFIAPPSSPASFFQSEPPSATQSPVGILSFSP-LPCNNRPSIFAIGPYAHETQLVSPPV 148

Query: 129 FSNFTTEPSTAPLTPPPELAHL----TTPSSPDVPFAHFLSSSADLKNRDKNNYINANDL 184
           FS +TTEPS+AP+TPP + + +    TTPSSP+VPFA   +S+    +      ++++  
Sbjct: 149 FSTYTTEPSSAPITPPLDDSSIYLTTTTPSSPEVPFAQLFNSNHQTGSYGYKFPMSSSYE 208

Query: 185 QATYSLYPGSPASSLISPISRNSGDCLSSSFPERE 219
              Y L PGSP   LISP   + G   +S FP+ E
Sbjct: 209 FQFYQLPPGSPLGQLISP---SPGSGPTSPFPDGE 240


>AT4G25620.1 | Symbols:  | hydroxyproline-rich glycoprotein family
           protein | chr4:13067447-13069296 REVERSE LENGTH=449
          Length = 449

 Score = 90.9 bits (224), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 87/234 (37%), Positives = 116/234 (49%), Gaps = 32/234 (13%)

Query: 1   MGSEQNRFPQHERRKRWGGCWGAFSCFGSQKGGKRIVPASRIPD--SNGSAAQPNGPQAV 58
           + +E    P   ++KR G  W  + CFGS+K  KRI  A  +P+  ++G+A  P      
Sbjct: 20  VSAESRTQPSSVQKKR-GSWWSLYWCFGSKKNNKRIGHAVLVPEPAASGAAVAP----VQ 74

Query: 59  GLTNQTTGXXXXXXXXXXXXXXFTHSALPSTAQSP--SCFLSLSANSPGGPSSTMFATGP 116
             ++ +T               F  S  PS + +P      SL+ N P  PS+  F  GP
Sbjct: 75  NSSSNSTSIFMPFIAPPSSPASFLPSGPPSASHTPDPGLLCSLTVNEP--PSA--FTIGP 130

Query: 117 YAHETQLVSPPVFSNFTTEPSTAPLTPPPELAHLTTPSSPDVPFAHFLSSSADLKNRDKN 176
           YAHETQ V+PPVFS FTTEPSTAP T         +PSSP+VPFA  L+SS +   R+  
Sbjct: 131 YAHETQPVTPPVFSAFTTEPSTAPFT-----PPPESPSSPEVPFAQLLTSSLERARRNSG 185

Query: 177 NYINANDLQATYS-----LYPGSPASSLISPISRNSGDCLSSSFPER----EFR 221
             +N     A Y      +YPGSP  +LISP     G   SS +P +    EFR
Sbjct: 186 GGMNQKFSAAHYEFKSCQVYPGSPGGNLISP-----GSGTSSPYPGKCSIIEFR 234