Miyakogusa Predicted Gene
- Lj0g3v0115719.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0115719.1 Non Chatacterized Hit- tr|D7LC59|D7LC59_ARALL
Putative uncharacterized protein OS=Arabidopsis
lyrata,31.22,8e-19,LEA_2,Late embryogenesis abundant protein, LEA-14;
seg,NULL,gene.g8746.t1.1
(202 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT2G30505.1 | Symbols: | Late embryogenesis abundant (LEA) hydr... 87 8e-18
AT4G01110.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 59 2e-09
AT2G46300.1 | Symbols: | Late embryogenesis abundant (LEA) hydr... 51 5e-07
AT1G01453.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 47 7e-06
AT1G01453.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 47 8e-06
>AT2G30505.1 | Symbols: | Late embryogenesis abundant (LEA)
hydroxyproline-rich glycoprotein family |
chr2:13001121-13002086 REVERSE LENGTH=321
Length = 321
Score = 87.0 bits (214), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 54/189 (28%), Positives = 89/189 (47%), Gaps = 16/189 (8%)
Query: 27 FFACCAWGCXXXXXXXXXXXXAGISYLGFLKAGMPKV---------------DSSQKMDA 71
F CCA C G+S +K+ +P+V + M+A
Sbjct: 133 FRKCCACTCMFVSVVLIIVLLVGLSANSSIKSILPQVLVTNLKFSRLDIAKSSTDLLMNA 192
Query: 72 DISLGLRISNKNEKLKLLYGPLSVDVTSEDVPLGMAKLKGFSQMPKNDTDLDMTMALHNA 131
+++ L++SN N+K L Y P+ D++SE++ LG L GF Q P N T L + L +
Sbjct: 193 NLNTVLQLSNNNDKTVLYYSPMKADISSENINLGKKTLSGFKQDPGNVTSLKILTRLRKS 252
Query: 132 DVDKYAADDLKSDINANEMVFDVYVSGHIGLKVGSLQMTDIPFLASCHQIKQMDVDFGRR 191
V A L + E + DV++ G + + ++ IP + +C +KQ DV G +
Sbjct: 253 KVYDVDATLLTNKEKTLEALVDVFLRGKLSVDWLGFKV-HIPIVIACESVKQSDVINGLK 311
Query: 192 PECDVKMFA 200
P CDV++F+
Sbjct: 312 PACDVRIFS 320
>AT4G01110.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G01453.1); Has 273 Blast hits to 272 proteins
in 18 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 273; Viruses - 0; Other Eukaryotes -
0 (source: NCBI BLink). | chr4:480176-481056 REVERSE
LENGTH=261
Length = 261
Score = 58.9 bits (141), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 38/142 (26%), Positives = 68/142 (47%), Gaps = 5/142 (3%)
Query: 64 DSSQKMDADISLGLRISNKNEKLKLLYGPLSVDVT-SED---VPLGMAKLKGFSQMPKND 119
D ++ A+ + L N N KL+ YG + V V+ ED LG K+KGF + P N
Sbjct: 116 DGLSQLTAEATARLDFRNPNGKLRYYYGNVDVAVSVGEDDFETSLGSTKVKGFVEKPGNR 175
Query: 120 TDLDMTMALHNADVDKYAADDLKSDINANEMVFDVYVSGHIGLKVGSLQMTDIPFLASCH 179
T + + + + VD L++D+ + ++V V +GL VG ++ + SC
Sbjct: 176 TVVIVPIKVKKQQVDDPTVKRLRADMKSKKLVVKVMAKTKVGLGVGRRKIVTVGVTISCG 235
Query: 180 QIKQMDVDFGRRPECDVKMFAF 201
++ +D + +C +KM +
Sbjct: 236 GVRLQTLD-SKMSKCTIKMLKW 256
>AT2G46300.1 | Symbols: | Late embryogenesis abundant (LEA)
hydroxyproline-rich glycoprotein family |
chr2:19008392-19009247 FORWARD LENGTH=252
Length = 252
Score = 50.8 bits (120), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 31/124 (25%), Positives = 60/124 (48%), Gaps = 9/124 (7%)
Query: 77 LRISNKNEKLKLLYGPLSVDVT----SEDVPLGMAKLKGFSQMPKNDTDLDMTMALHNAD 132
+ + N N KL YG +VD++ +++ +G + GF Q PKN T + + + N
Sbjct: 121 VEMKNPNSKLVFYYGNTAVDLSVGSGNDETGMGETTMNGFRQGPKNSTSVKVETTVKNQL 180
Query: 133 VDKYAADDLKSDINANEMVFDVYVSGHIGLKVGSLQMTDIPFLASCH--QIKQMDVDFGR 190
V++ A L + + ++V +V +GL VG +++ + C + ++D D
Sbjct: 181 VERGLAKRLAAKFQSKDLVINVVAKTKVGLGVGGIKIGMLAVNLRCGGVSLNKLDTD--- 237
Query: 191 RPEC 194
P+C
Sbjct: 238 SPKC 241
>AT1G01453.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT4G01110.1);
Has 30201 Blast hits to 17322 proteins in 780 species:
Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi -
3422; Plants - 5037; Viruses - 0; Other Eukaryotes -
2996 (source: NCBI BLink). | chr1:166853-167798 REVERSE
LENGTH=267
Length = 267
Score = 47.4 bits (111), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 34/137 (24%), Positives = 59/137 (43%), Gaps = 5/137 (3%)
Query: 69 MDADISLGLRISNKNEKLKLLYGPLSVDVT----SEDVPLGMAKLKGFSQMPKNDTDLDM 124
+ AD + L N N KL YG V V + L K+KGF + P N T + +
Sbjct: 127 LSADTTSVLDFRNPNGKLTFYYGDTDVAVILGEKDFETNLESTKVKGFIEKPGNRTAVIV 186
Query: 125 TMALHNADVDKYAADDLKSDINANEMVFDVYVSGHIGLKVGSLQMTDIPFLASCHQIKQM 184
+ VD A L+ ++ + +++ V +GL VGS ++ + C +
Sbjct: 187 PTTVRKRQVDDPTAKRLQVELKSKKLLVTVTAKTKVGLAVGSRKIVTVGVSLRCGGVILQ 246
Query: 185 DVDFGRRPECDVKMFAF 201
+D + +C +KM +
Sbjct: 247 TLD-SKMAQCTIKMLKW 262
>AT1G01453.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT4G01110.1);
Has 30201 Blast hits to 17322 proteins in 780 species:
Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi -
3422; Plants - 5037; Viruses - 0; Other Eukaryotes -
2996 (source: NCBI BLink). | chr1:166929-167798 REVERSE
LENGTH=289
Length = 289
Score = 47.0 bits (110), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 34/137 (24%), Positives = 59/137 (43%), Gaps = 5/137 (3%)
Query: 69 MDADISLGLRISNKNEKLKLLYGPLSVDVT----SEDVPLGMAKLKGFSQMPKNDTDLDM 124
+ AD + L N N KL YG V V + L K+KGF + P N T + +
Sbjct: 127 LSADTTSVLDFRNPNGKLTFYYGDTDVAVILGEKDFETNLESTKVKGFIEKPGNRTAVIV 186
Query: 125 TMALHNADVDKYAADDLKSDINANEMVFDVYVSGHIGLKVGSLQMTDIPFLASCHQIKQM 184
+ VD A L+ ++ + +++ V +GL VGS ++ + C +
Sbjct: 187 PTTVRKRQVDDPTAKRLQVELKSKKLLVTVTAKTKVGLAVGSRKIVTVGVSLRCGGVILQ 246
Query: 185 DVDFGRRPECDVKMFAF 201
+D + +C +KM +
Sbjct: 247 TLD-SKMAQCTIKMLKW 262