Miyakogusa Predicted Gene

Lj0g3v0115719.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj0g3v0115719.1 Non Chatacterized Hit- tr|D7LC59|D7LC59_ARALL
Putative uncharacterized protein OS=Arabidopsis
lyrata,31.22,8e-19,LEA_2,Late embryogenesis abundant protein, LEA-14;
seg,NULL,gene.g8746.t1.1
         (202 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT2G30505.1 | Symbols:  | Late embryogenesis abundant (LEA) hydr...    87   8e-18
AT4G01110.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    59   2e-09
AT2G46300.1 | Symbols:  | Late embryogenesis abundant (LEA) hydr...    51   5e-07
AT1G01453.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    47   7e-06
AT1G01453.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    47   8e-06

>AT2G30505.1 | Symbols:  | Late embryogenesis abundant (LEA)
           hydroxyproline-rich glycoprotein family |
           chr2:13001121-13002086 REVERSE LENGTH=321
          Length = 321

 Score = 87.0 bits (214), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 54/189 (28%), Positives = 89/189 (47%), Gaps = 16/189 (8%)

Query: 27  FFACCAWGCXXXXXXXXXXXXAGISYLGFLKAGMPKV---------------DSSQKMDA 71
           F  CCA  C             G+S    +K+ +P+V                +   M+A
Sbjct: 133 FRKCCACTCMFVSVVLIIVLLVGLSANSSIKSILPQVLVTNLKFSRLDIAKSSTDLLMNA 192

Query: 72  DISLGLRISNKNEKLKLLYGPLSVDVTSEDVPLGMAKLKGFSQMPKNDTDLDMTMALHNA 131
           +++  L++SN N+K  L Y P+  D++SE++ LG   L GF Q P N T L +   L  +
Sbjct: 193 NLNTVLQLSNNNDKTVLYYSPMKADISSENINLGKKTLSGFKQDPGNVTSLKILTRLRKS 252

Query: 132 DVDKYAADDLKSDINANEMVFDVYVSGHIGLKVGSLQMTDIPFLASCHQIKQMDVDFGRR 191
            V    A  L +     E + DV++ G + +     ++  IP + +C  +KQ DV  G +
Sbjct: 253 KVYDVDATLLTNKEKTLEALVDVFLRGKLSVDWLGFKV-HIPIVIACESVKQSDVINGLK 311

Query: 192 PECDVKMFA 200
           P CDV++F+
Sbjct: 312 PACDVRIFS 320


>AT4G01110.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G01453.1); Has 273 Blast hits to 272 proteins
           in 18 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 273; Viruses - 0; Other Eukaryotes -
           0 (source: NCBI BLink). | chr4:480176-481056 REVERSE
           LENGTH=261
          Length = 261

 Score = 58.9 bits (141), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 38/142 (26%), Positives = 68/142 (47%), Gaps = 5/142 (3%)

Query: 64  DSSQKMDADISLGLRISNKNEKLKLLYGPLSVDVT-SED---VPLGMAKLKGFSQMPKND 119
           D   ++ A+ +  L   N N KL+  YG + V V+  ED     LG  K+KGF + P N 
Sbjct: 116 DGLSQLTAEATARLDFRNPNGKLRYYYGNVDVAVSVGEDDFETSLGSTKVKGFVEKPGNR 175

Query: 120 TDLDMTMALHNADVDKYAADDLKSDINANEMVFDVYVSGHIGLKVGSLQMTDIPFLASCH 179
           T + + + +    VD      L++D+ + ++V  V     +GL VG  ++  +    SC 
Sbjct: 176 TVVIVPIKVKKQQVDDPTVKRLRADMKSKKLVVKVMAKTKVGLGVGRRKIVTVGVTISCG 235

Query: 180 QIKQMDVDFGRRPECDVKMFAF 201
            ++   +D  +  +C +KM  +
Sbjct: 236 GVRLQTLD-SKMSKCTIKMLKW 256


>AT2G46300.1 | Symbols:  | Late embryogenesis abundant (LEA)
           hydroxyproline-rich glycoprotein family |
           chr2:19008392-19009247 FORWARD LENGTH=252
          Length = 252

 Score = 50.8 bits (120), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 31/124 (25%), Positives = 60/124 (48%), Gaps = 9/124 (7%)

Query: 77  LRISNKNEKLKLLYGPLSVDVT----SEDVPLGMAKLKGFSQMPKNDTDLDMTMALHNAD 132
           + + N N KL   YG  +VD++    +++  +G   + GF Q PKN T + +   + N  
Sbjct: 121 VEMKNPNSKLVFYYGNTAVDLSVGSGNDETGMGETTMNGFRQGPKNSTSVKVETTVKNQL 180

Query: 133 VDKYAADDLKSDINANEMVFDVYVSGHIGLKVGSLQMTDIPFLASCH--QIKQMDVDFGR 190
           V++  A  L +   + ++V +V     +GL VG +++  +     C    + ++D D   
Sbjct: 181 VERGLAKRLAAKFQSKDLVINVVAKTKVGLGVGGIKIGMLAVNLRCGGVSLNKLDTD--- 237

Query: 191 RPEC 194
            P+C
Sbjct: 238 SPKC 241


>AT1G01453.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT4G01110.1);
           Has 30201 Blast hits to 17322 proteins in 780 species:
           Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi -
           3422; Plants - 5037; Viruses - 0; Other Eukaryotes -
           2996 (source: NCBI BLink). | chr1:166853-167798 REVERSE
           LENGTH=267
          Length = 267

 Score = 47.4 bits (111), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 34/137 (24%), Positives = 59/137 (43%), Gaps = 5/137 (3%)

Query: 69  MDADISLGLRISNKNEKLKLLYGPLSVDVT----SEDVPLGMAKLKGFSQMPKNDTDLDM 124
           + AD +  L   N N KL   YG   V V       +  L   K+KGF + P N T + +
Sbjct: 127 LSADTTSVLDFRNPNGKLTFYYGDTDVAVILGEKDFETNLESTKVKGFIEKPGNRTAVIV 186

Query: 125 TMALHNADVDKYAADDLKSDINANEMVFDVYVSGHIGLKVGSLQMTDIPFLASCHQIKQM 184
              +    VD   A  L+ ++ + +++  V     +GL VGS ++  +     C  +   
Sbjct: 187 PTTVRKRQVDDPTAKRLQVELKSKKLLVTVTAKTKVGLAVGSRKIVTVGVSLRCGGVILQ 246

Query: 185 DVDFGRRPECDVKMFAF 201
            +D  +  +C +KM  +
Sbjct: 247 TLD-SKMAQCTIKMLKW 262


>AT1G01453.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT4G01110.1);
           Has 30201 Blast hits to 17322 proteins in 780 species:
           Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi -
           3422; Plants - 5037; Viruses - 0; Other Eukaryotes -
           2996 (source: NCBI BLink). | chr1:166929-167798 REVERSE
           LENGTH=289
          Length = 289

 Score = 47.0 bits (110), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 34/137 (24%), Positives = 59/137 (43%), Gaps = 5/137 (3%)

Query: 69  MDADISLGLRISNKNEKLKLLYGPLSVDVT----SEDVPLGMAKLKGFSQMPKNDTDLDM 124
           + AD +  L   N N KL   YG   V V       +  L   K+KGF + P N T + +
Sbjct: 127 LSADTTSVLDFRNPNGKLTFYYGDTDVAVILGEKDFETNLESTKVKGFIEKPGNRTAVIV 186

Query: 125 TMALHNADVDKYAADDLKSDINANEMVFDVYVSGHIGLKVGSLQMTDIPFLASCHQIKQM 184
              +    VD   A  L+ ++ + +++  V     +GL VGS ++  +     C  +   
Sbjct: 187 PTTVRKRQVDDPTAKRLQVELKSKKLLVTVTAKTKVGLAVGSRKIVTVGVSLRCGGVILQ 246

Query: 185 DVDFGRRPECDVKMFAF 201
            +D  +  +C +KM  +
Sbjct: 247 TLD-SKMAQCTIKMLKW 262