Miyakogusa Predicted Gene
- Lj4g3v2371630.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj4g3v2371630.1 Non Chatacterized Hit- tr|D8S8C2|D8S8C2_SELML
Putative uncharacterized protein OS=Selaginella moelle,27.64,3e-17,
,CUFF.50842.1
(376 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT4G15640.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 446 e-125
AT3G21465.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 439 e-123
AT3G21465.2 | Symbols: | unknown protein; BEST Arabidopsis thal... 338 4e-93
>AT4G15640.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G21465.1); Has 38 Blast hits to 38 proteins in
14 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi
- 0; Plants - 38; Viruses - 0; Other Eukaryotes - 0
(source: NCBI BLink). | chr4:8919245-8921852 REVERSE
LENGTH=390
Length = 390
Score = 446 bits (1146), Expect = e-125, Method: Compositional matrix adjust.
Identities = 218/393 (55%), Positives = 284/393 (72%), Gaps = 24/393 (6%)
Query: 1 MQVLSNARRVSRLLQSPISLSSHLPNSQPPIFSGLAQPN---HWQVKTGPLHFF------ 51
MQ L N+RR++ +L+ ++ ++L FS +Q + W+ +
Sbjct: 1 MQFLQNSRRIATVLKPIVTTETNLR-----FFSSASQISRSFQWRSQASSSSSVLKGIDS 55
Query: 52 ----------VSNASFTSGWEPLQATPTEAVKELYDKMLESVNTKRSMPPNAWLWQMIAN 101
+ +SFTS E L PT VK+L+DKML SVN KRSMPPNAWLW +I N
Sbjct: 56 SSHLSSMSLLIPRSSFTSEAEKLAGNPTVTVKDLHDKMLNSVNVKRSMPPNAWLWLLIEN 115
Query: 102 CKHQHDIGLLFEILQKLRTFRLSNLRIHDDFNCNLCREVAKACVHAGALDFGKKALWKHN 161
C++Q DI LLF++LQ LR FRLSNLRIHD+FNCNLC++VAK CV GA+D GKKALWKHN
Sbjct: 116 CQNQDDIHLLFDVLQNLRRFRLSNLRIHDNFNCNLCQQVAKTCVRVGAIDSGKKALWKHN 175
Query: 162 VYGLAPSVASAHHLLTYAKNHNDTKLLVEVMKLLKRNDVPLQPGTADIVFSICYNTDEWD 221
V+GL PSVASAHHL++YA H +++L+ EVM+LLK ND+PLQPGTAD+VF IC++TD+WD
Sbjct: 176 VHGLTPSVASAHHLMSYALEHKNSELMEEVMQLLKTNDLPLQPGTADLVFRICHDTDKWD 235
Query: 222 LINKYGKRFVLAGVKLRPTSIDTWMKFAAKRGDTESLWKIEKMRSDTKKPHTLATGFSCA 281
L+ KY K+F AGVKLR T+ D WM+FAAKRGDTESLWK++K RS+T HTL+T FSCA
Sbjct: 236 LLAKYSKKFSKAGVKLRKTTFDVWMEFAAKRGDTESLWKVDKQRSETYSQHTLSTAFSCA 295
Query: 282 KGLLLEHKPSEAAAKIQVLNQTLSDAKKSGIKDELQKLISEWPLEVIKHKKEEERKTLAA 341
KG LLE KP EAAA IQ++ Q D KKS I E +KL++EWP++VIKH+ +E++K LAA
Sbjct: 296 KGFLLESKPEEAAAVIQIICQAYPDEKKSAISTEFEKLVNEWPVDVIKHQTDEDKKALAA 355
Query: 342 SLKSDIRAMVSDLLNTGLEVNFSLEELNSKEGI 374
SLKS I +MV+ LL++GL+V+ L+ELN E +
Sbjct: 356 SLKSVIPSMVNTLLSSGLKVSVDLDELNKNEAL 388
>AT3G21465.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G15640.1); Has 38 Blast hits to 38 proteins in
14 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi
- 0; Plants - 38; Viruses - 0; Other Eukaryotes - 0
(source: NCBI BLink). | chr3:7560750-7563202 FORWARD
LENGTH=388
Length = 388
Score = 439 bits (1130), Expect = e-123, Method: Compositional matrix adjust.
Identities = 213/387 (55%), Positives = 284/387 (73%), Gaps = 14/387 (3%)
Query: 1 MQVLSNARRVSR------LLQSPISLSSHLPNS------QPPIFSG-LAQPNHWQVKTGP 47
MQ LS++RR++R L+++ + S+ P+ +F G + N ++
Sbjct: 1 MQYLSHSRRIARALKPLALVETDVRFVSNAPSHVVHRSLSHSVFPGKMDTCNQLYAQSPM 60
Query: 48 LHFFVSNASFTSGWEPLQATPTEAVKELYDKMLESVNTKRSMPPNAWLWQMIANCKHQHD 107
+ A+F+S + ++ PTEAVKEL+ K+L+SVN KRSMPPNAWLW +I NC+++ D
Sbjct: 61 TGLMMLRANFSSEAKHVE-NPTEAVKELHSKILDSVNVKRSMPPNAWLWSLIDNCRNEDD 119
Query: 108 IGLLFEILQKLRTFRLSNLRIHDDFNCNLCREVAKACVHAGALDFGKKALWKHNVYGLAP 167
I LF++LQ LR FRLSNLRIHD+FNCNLC++VAK CV GA++ GK+ALWKHNV+GL P
Sbjct: 120 ISFLFDVLQNLRRFRLSNLRIHDNFNCNLCQQVAKTCVRVGAINHGKRALWKHNVHGLTP 179
Query: 168 SVASAHHLLTYAKNHNDTKLLVEVMKLLKRNDVPLQPGTADIVFSICYNTDEWDLINKYG 227
SVASAHHLL+YA H D KL+ EVMKLLK N++PLQPGTAD+VF IC++TD WDL+ KY
Sbjct: 180 SVASAHHLLSYALKHKDAKLMDEVMKLLKMNNLPLQPGTADLVFRICHDTDNWDLLVKYS 239
Query: 228 KRFVLAGVKLRPTSIDTWMKFAAKRGDTESLWKIEKMRSDTKKPHTLATGFSCAKGLLLE 287
K+F AGVKLR T+ D WM+FAAKRGDTESLW ++K+RS+T HTL+ FSCAKG LLE
Sbjct: 240 KKFCKAGVKLRKTTFDVWMEFAAKRGDTESLWNVDKLRSETYTQHTLSGAFSCAKGFLLE 299
Query: 288 HKPSEAAAKIQVLNQTLSDAKKSGIKDELQKLISEWPLEVIKHKKEEERKTLAASLKSDI 347
HKP EAAA IQ++ Q D KKS ++ E +KL++EW +++IKH+ E+++K +AASLKSDI
Sbjct: 300 HKPEEAAAVIQIICQAYPDEKKSALEAEFKKLVNEWSVDIIKHQNEQDKKDVAASLKSDI 359
Query: 348 RAMVSDLLNTGLEVNFSLEELNSKEGI 374
AMV+ L+N+GL V L ELN E +
Sbjct: 360 PAMVNALVNSGLRVRVDLNELNKNEAL 386
>AT3G21465.2 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G15640.1); Has 35333 Blast hits to 34131
proteins in 2444 species: Archae - 798; Bacteria -
22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
- 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
chr3:7560750-7562698 FORWARD LENGTH=312
Length = 312
Score = 338 bits (866), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 164/295 (55%), Positives = 216/295 (73%), Gaps = 14/295 (4%)
Query: 1 MQVLSNARRVSR------LLQSPISLSSHLPNS------QPPIFSG-LAQPNHWQVKTGP 47
MQ LS++RR++R L+++ + S+ P+ +F G + N ++
Sbjct: 1 MQYLSHSRRIARALKPLALVETDVRFVSNAPSHVVHRSLSHSVFPGKMDTCNQLYAQSPM 60
Query: 48 LHFFVSNASFTSGWEPLQATPTEAVKELYDKMLESVNTKRSMPPNAWLWQMIANCKHQHD 107
+ A+F+S + ++ PTEAVKEL+ K+L+SVN KRSMPPNAWLW +I NC+++ D
Sbjct: 61 TGLMMLRANFSSEAKHVE-NPTEAVKELHSKILDSVNVKRSMPPNAWLWSLIDNCRNEDD 119
Query: 108 IGLLFEILQKLRTFRLSNLRIHDDFNCNLCREVAKACVHAGALDFGKKALWKHNVYGLAP 167
I LF++LQ LR FRLSNLRIHD+FNCNLC++VAK CV GA++ GK+ALWKHNV+GL P
Sbjct: 120 ISFLFDVLQNLRRFRLSNLRIHDNFNCNLCQQVAKTCVRVGAINHGKRALWKHNVHGLTP 179
Query: 168 SVASAHHLLTYAKNHNDTKLLVEVMKLLKRNDVPLQPGTADIVFSICYNTDEWDLINKYG 227
SVASAHHLL+YA H D KL+ EVMKLLK N++PLQPGTAD+VF IC++TD WDL+ KY
Sbjct: 180 SVASAHHLLSYALKHKDAKLMDEVMKLLKMNNLPLQPGTADLVFRICHDTDNWDLLVKYS 239
Query: 228 KRFVLAGVKLRPTSIDTWMKFAAKRGDTESLWKIEKMRSDTKKPHTLATGFSCAK 282
K+F AGVKLR T+ D WM+FAAKRGDTESLW ++K+RS+T HTL+ FSCAK
Sbjct: 240 KKFCKAGVKLRKTTFDVWMEFAAKRGDTESLWNVDKLRSETYTQHTLSGAFSCAK 294