Miyakogusa Predicted Gene
- Lj0g3v0275279.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0275279.1 tr|E4MWC7|E4MWC7_THEHA mRNA, clone: RTFL01-07-P23
OS=Thellungiella halophila PE=2 SV=1,30.84,9e-17,seg,NULL,CUFF.18241.1
(275 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT4G02920.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 77 1e-14
AT4G02920.2 | Symbols: | unknown protein; BEST Arabidopsis thal... 77 1e-14
>AT4G02920.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G03340.1); Has 41 Blast hits to 41 proteins in
12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi
- 0; Plants - 41; Viruses - 0; Other Eukaryotes - 0
(source: NCBI BLink). | chr4:1292816-1294670 FORWARD
LENGTH=418
Length = 418
Score = 77.0 bits (188), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 64/230 (27%), Positives = 108/230 (46%), Gaps = 8/230 (3%)
Query: 4 KLSLMASHGYP-SGLLLHQELGLPGSYMKGCQTLLPSPVAKPDMIRYQSPCLKPNLCEEL 62
KL M SHGY GL L Q+L +K ++ L +P A+ ++I P NL EL
Sbjct: 3 KLCFMTSHGYSIPGLGLPQDL-CNTEIIKNSRSHLVNPGARQEII----PASSFNLNTEL 57
Query: 63 TKIRSDCFDCNQFVNADFSTQRPVLLDVQAPCSEALLFGFGIVEKCTKHDQVLNFLMSGT 122
+ +QFV D + +P+L+DV E+L+ FGI +K + ++V+ FL+S +
Sbjct: 58 LEPWKPVSSFSQFVEIDSAMMKPLLMDVHETAPESLILSFGIADKFARQEKVMEFLLSQS 117
Query: 123 AETGIGGANXXXXXXXXXXXXXGMDDPQQPLASFIYPYSKFDIQKSLLYFAQDPALSSKI 182
E G + +P + Y ++ K +L +D + +
Sbjct: 118 EEFKEKGFDMSLLNELMEFESMKSSSQLRPYDTSSVLYLNQELGKPVLDLVRDMMENPEF 177
Query: 183 TVLPDGQITFMGTGI-EMKDLLSVVAESYLTK-TLHKGEKHSVLVPHFIR 230
+V +G + F + E+ DLLS+ +E L++ + K + S L+PHF R
Sbjct: 178 SVRSNGHVLFSSSSNPELNDLLSIASEFNLSRNSTTKWRQLSPLIPHFQR 227
>AT4G02920.2 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G03340.1); Has 41 Blast hits to 41 proteins in
12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi
- 0; Plants - 41; Viruses - 0; Other Eukaryotes - 0
(source: NCBI BLink). | chr4:1292816-1294670 FORWARD
LENGTH=419
Length = 419
Score = 76.6 bits (187), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 63/230 (27%), Positives = 107/230 (46%), Gaps = 7/230 (3%)
Query: 4 KLSLMASHGYP-SGLLLHQELGLPGSYMKGCQTLLPSPVAKPDMIRYQSPCLKPNLCEEL 62
KL M SHGY GL L Q+L + ++ L +P A+ ++I P NL EL
Sbjct: 3 KLCFMTSHGYSIPGLGLPQDLCNTEIIKQNSRSHLVNPGARQEII----PASSFNLNTEL 58
Query: 63 TKIRSDCFDCNQFVNADFSTQRPVLLDVQAPCSEALLFGFGIVEKCTKHDQVLNFLMSGT 122
+ +QFV D + +P+L+DV E+L+ FGI +K + ++V+ FL+S +
Sbjct: 59 LEPWKPVSSFSQFVEIDSAMMKPLLMDVHETAPESLILSFGIADKFARQEKVMEFLLSQS 118
Query: 123 AETGIGGANXXXXXXXXXXXXXGMDDPQQPLASFIYPYSKFDIQKSLLYFAQDPALSSKI 182
E G + +P + Y ++ K +L +D + +
Sbjct: 119 EEFKEKGFDMSLLNELMEFESMKSSSQLRPYDTSSVLYLNQELGKPVLDLVRDMMENPEF 178
Query: 183 TVLPDGQITFMGTGI-EMKDLLSVVAESYLTK-TLHKGEKHSVLVPHFIR 230
+V +G + F + E+ DLLS+ +E L++ + K + S L+PHF R
Sbjct: 179 SVRSNGHVLFSSSSNPELNDLLSIASEFNLSRNSTTKWRQLSPLIPHFQR 228