Miyakogusa Predicted Gene

Lj4g3v2371630.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj4g3v2371630.1 Non Chatacterized Hit- tr|D8S8C2|D8S8C2_SELML
Putative uncharacterized protein OS=Selaginella moelle,27.64,3e-17,
,CUFF.50842.1
         (376 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT4G15640.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   446   e-125
AT3G21465.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   439   e-123
AT3G21465.2 | Symbols:  | unknown protein; BEST Arabidopsis thal...   338   4e-93

>AT4G15640.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G21465.1); Has 38 Blast hits to 38 proteins in
           14 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi
           - 0; Plants - 38; Viruses - 0; Other Eukaryotes - 0
           (source: NCBI BLink). | chr4:8919245-8921852 REVERSE
           LENGTH=390
          Length = 390

 Score =  446 bits (1146), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 218/393 (55%), Positives = 284/393 (72%), Gaps = 24/393 (6%)

Query: 1   MQVLSNARRVSRLLQSPISLSSHLPNSQPPIFSGLAQPN---HWQVKTGPLHFF------ 51
           MQ L N+RR++ +L+  ++  ++L       FS  +Q +    W+ +             
Sbjct: 1   MQFLQNSRRIATVLKPIVTTETNLR-----FFSSASQISRSFQWRSQASSSSSVLKGIDS 55

Query: 52  ----------VSNASFTSGWEPLQATPTEAVKELYDKMLESVNTKRSMPPNAWLWQMIAN 101
                     +  +SFTS  E L   PT  VK+L+DKML SVN KRSMPPNAWLW +I N
Sbjct: 56  SSHLSSMSLLIPRSSFTSEAEKLAGNPTVTVKDLHDKMLNSVNVKRSMPPNAWLWLLIEN 115

Query: 102 CKHQHDIGLLFEILQKLRTFRLSNLRIHDDFNCNLCREVAKACVHAGALDFGKKALWKHN 161
           C++Q DI LLF++LQ LR FRLSNLRIHD+FNCNLC++VAK CV  GA+D GKKALWKHN
Sbjct: 116 CQNQDDIHLLFDVLQNLRRFRLSNLRIHDNFNCNLCQQVAKTCVRVGAIDSGKKALWKHN 175

Query: 162 VYGLAPSVASAHHLLTYAKNHNDTKLLVEVMKLLKRNDVPLQPGTADIVFSICYNTDEWD 221
           V+GL PSVASAHHL++YA  H +++L+ EVM+LLK ND+PLQPGTAD+VF IC++TD+WD
Sbjct: 176 VHGLTPSVASAHHLMSYALEHKNSELMEEVMQLLKTNDLPLQPGTADLVFRICHDTDKWD 235

Query: 222 LINKYGKRFVLAGVKLRPTSIDTWMKFAAKRGDTESLWKIEKMRSDTKKPHTLATGFSCA 281
           L+ KY K+F  AGVKLR T+ D WM+FAAKRGDTESLWK++K RS+T   HTL+T FSCA
Sbjct: 236 LLAKYSKKFSKAGVKLRKTTFDVWMEFAAKRGDTESLWKVDKQRSETYSQHTLSTAFSCA 295

Query: 282 KGLLLEHKPSEAAAKIQVLNQTLSDAKKSGIKDELQKLISEWPLEVIKHKKEEERKTLAA 341
           KG LLE KP EAAA IQ++ Q   D KKS I  E +KL++EWP++VIKH+ +E++K LAA
Sbjct: 296 KGFLLESKPEEAAAVIQIICQAYPDEKKSAISTEFEKLVNEWPVDVIKHQTDEDKKALAA 355

Query: 342 SLKSDIRAMVSDLLNTGLEVNFSLEELNSKEGI 374
           SLKS I +MV+ LL++GL+V+  L+ELN  E +
Sbjct: 356 SLKSVIPSMVNTLLSSGLKVSVDLDELNKNEAL 388


>AT3G21465.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G15640.1); Has 38 Blast hits to 38 proteins in
           14 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi
           - 0; Plants - 38; Viruses - 0; Other Eukaryotes - 0
           (source: NCBI BLink). | chr3:7560750-7563202 FORWARD
           LENGTH=388
          Length = 388

 Score =  439 bits (1130), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 213/387 (55%), Positives = 284/387 (73%), Gaps = 14/387 (3%)

Query: 1   MQVLSNARRVSR------LLQSPISLSSHLPNS------QPPIFSG-LAQPNHWQVKTGP 47
           MQ LS++RR++R      L+++ +   S+ P+          +F G +   N    ++  
Sbjct: 1   MQYLSHSRRIARALKPLALVETDVRFVSNAPSHVVHRSLSHSVFPGKMDTCNQLYAQSPM 60

Query: 48  LHFFVSNASFTSGWEPLQATPTEAVKELYDKMLESVNTKRSMPPNAWLWQMIANCKHQHD 107
               +  A+F+S  + ++  PTEAVKEL+ K+L+SVN KRSMPPNAWLW +I NC+++ D
Sbjct: 61  TGLMMLRANFSSEAKHVE-NPTEAVKELHSKILDSVNVKRSMPPNAWLWSLIDNCRNEDD 119

Query: 108 IGLLFEILQKLRTFRLSNLRIHDDFNCNLCREVAKACVHAGALDFGKKALWKHNVYGLAP 167
           I  LF++LQ LR FRLSNLRIHD+FNCNLC++VAK CV  GA++ GK+ALWKHNV+GL P
Sbjct: 120 ISFLFDVLQNLRRFRLSNLRIHDNFNCNLCQQVAKTCVRVGAINHGKRALWKHNVHGLTP 179

Query: 168 SVASAHHLLTYAKNHNDTKLLVEVMKLLKRNDVPLQPGTADIVFSICYNTDEWDLINKYG 227
           SVASAHHLL+YA  H D KL+ EVMKLLK N++PLQPGTAD+VF IC++TD WDL+ KY 
Sbjct: 180 SVASAHHLLSYALKHKDAKLMDEVMKLLKMNNLPLQPGTADLVFRICHDTDNWDLLVKYS 239

Query: 228 KRFVLAGVKLRPTSIDTWMKFAAKRGDTESLWKIEKMRSDTKKPHTLATGFSCAKGLLLE 287
           K+F  AGVKLR T+ D WM+FAAKRGDTESLW ++K+RS+T   HTL+  FSCAKG LLE
Sbjct: 240 KKFCKAGVKLRKTTFDVWMEFAAKRGDTESLWNVDKLRSETYTQHTLSGAFSCAKGFLLE 299

Query: 288 HKPSEAAAKIQVLNQTLSDAKKSGIKDELQKLISEWPLEVIKHKKEEERKTLAASLKSDI 347
           HKP EAAA IQ++ Q   D KKS ++ E +KL++EW +++IKH+ E+++K +AASLKSDI
Sbjct: 300 HKPEEAAAVIQIICQAYPDEKKSALEAEFKKLVNEWSVDIIKHQNEQDKKDVAASLKSDI 359

Query: 348 RAMVSDLLNTGLEVNFSLEELNSKEGI 374
            AMV+ L+N+GL V   L ELN  E +
Sbjct: 360 PAMVNALVNSGLRVRVDLNELNKNEAL 386


>AT3G21465.2 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G15640.1); Has 35333 Blast hits to 34131
           proteins in 2444 species: Archae - 798; Bacteria -
           22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
           - 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
           chr3:7560750-7562698 FORWARD LENGTH=312
          Length = 312

 Score =  338 bits (866), Expect = 4e-93,   Method: Compositional matrix adjust.
 Identities = 164/295 (55%), Positives = 216/295 (73%), Gaps = 14/295 (4%)

Query: 1   MQVLSNARRVSR------LLQSPISLSSHLPNS------QPPIFSG-LAQPNHWQVKTGP 47
           MQ LS++RR++R      L+++ +   S+ P+          +F G +   N    ++  
Sbjct: 1   MQYLSHSRRIARALKPLALVETDVRFVSNAPSHVVHRSLSHSVFPGKMDTCNQLYAQSPM 60

Query: 48  LHFFVSNASFTSGWEPLQATPTEAVKELYDKMLESVNTKRSMPPNAWLWQMIANCKHQHD 107
               +  A+F+S  + ++  PTEAVKEL+ K+L+SVN KRSMPPNAWLW +I NC+++ D
Sbjct: 61  TGLMMLRANFSSEAKHVE-NPTEAVKELHSKILDSVNVKRSMPPNAWLWSLIDNCRNEDD 119

Query: 108 IGLLFEILQKLRTFRLSNLRIHDDFNCNLCREVAKACVHAGALDFGKKALWKHNVYGLAP 167
           I  LF++LQ LR FRLSNLRIHD+FNCNLC++VAK CV  GA++ GK+ALWKHNV+GL P
Sbjct: 120 ISFLFDVLQNLRRFRLSNLRIHDNFNCNLCQQVAKTCVRVGAINHGKRALWKHNVHGLTP 179

Query: 168 SVASAHHLLTYAKNHNDTKLLVEVMKLLKRNDVPLQPGTADIVFSICYNTDEWDLINKYG 227
           SVASAHHLL+YA  H D KL+ EVMKLLK N++PLQPGTAD+VF IC++TD WDL+ KY 
Sbjct: 180 SVASAHHLLSYALKHKDAKLMDEVMKLLKMNNLPLQPGTADLVFRICHDTDNWDLLVKYS 239

Query: 228 KRFVLAGVKLRPTSIDTWMKFAAKRGDTESLWKIEKMRSDTKKPHTLATGFSCAK 282
           K+F  AGVKLR T+ D WM+FAAKRGDTESLW ++K+RS+T   HTL+  FSCAK
Sbjct: 240 KKFCKAGVKLRKTTFDVWMEFAAKRGDTESLWNVDKLRSETYTQHTLSGAFSCAK 294