Miyakogusa Predicted Gene

Lj6g3v0433920.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj6g3v0433920.1 tr|D7MH52|D7MH52_ARALL Predicted protein
OS=Arabidopsis lyrata subsp. lyrata GN=ARALYDRAFT_658867
PE,43.44,6e-19, ,CUFF.57825.1
         (150 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT1G70780.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   135   8e-33
AT1G23150.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   119   7e-28
AT5G37730.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   113   6e-26
AT2G01554.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    61   3e-10
AT2G27830.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    53   8e-08
AT4G22758.1 | Symbols:  | unknown protein; LOCATED IN: chloropla...    49   2e-06

>AT1G70780.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: mitochondrion;
           EXPRESSED IN: sperm cell, male gametophyte, pollen tube;
           EXPRESSED DURING: L mature pollen stage, M germinated
           pollen stage; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT1G23150.1); Has 143 Blast
           hits to 143 proteins in 17 species: Archae - 0; Bacteria
           - 0; Metazoa - 0; Fungi - 0; Plants - 143; Viruses - 0;
           Other Eukaryotes - 0 (source: NCBI BLink). |
           chr1:26695462-26695975 REVERSE LENGTH=140
          Length = 140

 Score =  135 bits (341), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 70/131 (53%), Positives = 88/131 (67%), Gaps = 3/131 (2%)

Query: 23  DQKAKTNRFLITINILGSAGPIRFVVNEEKLVSGVIDTALKSYAREGRLPVLGFDASNFL 82
           +Q AK NR LI++ +LGSAGPIRFV  E+ LV+ VIDTALK YAREGRLP+LG D ++FL
Sbjct: 10  NQNAKGNRILISVTVLGSAGPIRFVAYEDDLVASVIDTALKGYAREGRLPLLGSDFNDFL 69

Query: 83  LYRANAGFDALNPLEPIGSYGERNFVLCKKPV---YHPSKTEPQSELLSQKSSGGWKAWL 139
           LY    G +AL+  + IGS G RNF+LC+KP       S     S +   +  G  KAW+
Sbjct: 70  LYCPMVGPEALSTWDAIGSLGARNFMLCRKPEEKKVEESNGRSDSTINGARKGGSLKAWI 129

Query: 140 NKSFGLKILSH 150
           NKSF LK+ SH
Sbjct: 130 NKSFNLKVSSH 140


>AT1G23150.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G70780.1); Has 124 Blast hits to 124 proteins
           in 17 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 124; Viruses - 0; Other Eukaryotes -
           0 (source: NCBI BLink). | chr1:8206948-8207461 FORWARD
           LENGTH=141
          Length = 141

 Score =  119 bits (298), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 69/134 (51%), Positives = 87/134 (64%), Gaps = 6/134 (4%)

Query: 23  DQKAKTNRFLITINILGSAGPIRFVVNEEKLVSGVIDTALKSYAREGRLPVLGFDASNFL 82
           +Q  K NR LI++  LGSAGPIRFV NE  LV+ VIDTALK YAREGRLP+LG D ++F+
Sbjct: 8   NQIVKGNRILISVTFLGSAGPIRFVANEGDLVASVIDTALKCYAREGRLPILGSDFNDFV 67

Query: 83  LYRANAGFDALNPLEPIGSYGERNFVLCKKPVYHPSKTEPQSEL-----LSQKSSGG-WK 136
            Y    G  AL+P E IGS G RNF+LCKK        E +         ++K +GG +K
Sbjct: 68  FYCPMVGPGALSPWEAIGSVGVRNFMLCKKKPEEKKVEEDKGRSNFPINGARKGAGGSFK 127

Query: 137 AWLNKSFGLKILSH 150
           AW+NKS  LK+ +H
Sbjct: 128 AWINKSLRLKVTTH 141


>AT5G37730.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G23150.1); Has 1807 Blast hits to 1807 proteins
           in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
           Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
           - 339 (source: NCBI BLink). | chr5:14986141-14986884
           REVERSE LENGTH=182
          Length = 182

 Score =  113 bits (282), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 57/123 (46%), Positives = 75/123 (60%), Gaps = 1/123 (0%)

Query: 15  NRGSVYSDDQKAKTNRFLITINILGSAGPIRFVVNEEKLVSGVIDTALKSYAREGRLPVL 74
           NR        K K  + L+++N+LGS GPIRF+ NE+  VS  I+T LK+YAR+GR+PVL
Sbjct: 2   NRNENVKGVMKRKNKKLLVSVNVLGSVGPIRFLANEDDEVSSAINTTLKAYARQGRIPVL 61

Query: 75  GFDASNFLLYRANAGFDALNPLEPIGSYGERNFVLCKKPVYHPSKTEPQSELLSQKSSGG 134
           GFD  NF+ Y  NAGF+ L+P E IGS    NF++CKK      K E   E  + +   G
Sbjct: 62  GFDVDNFIFYSINAGFNTLHPQEKIGSMDVTNFLMCKKEPRPLEKVEGIRESRA-RIGHG 120

Query: 135 WKA 137
           WK 
Sbjct: 121 WKT 123


>AT2G01554.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT1G70780.1);
           Has 30201 Blast hits to 17322 proteins in 780 species:
           Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi -
           3422; Plants - 5037; Viruses - 0; Other Eukaryotes -
           2996 (source: NCBI BLink). | chr2:249687-250181 FORWARD
           LENGTH=105
          Length = 105

 Score = 60.8 bits (146), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 47/134 (35%), Positives = 64/134 (47%), Gaps = 38/134 (28%)

Query: 20  YSDDQKAKT-NRFLITINILGSAGPIRFVVNEEKLVSGVIDTALKSYAREGRLPVLGFDA 78
           +  +QK+KT NRFL++IN+LGSAG                      +    ++P      
Sbjct: 7   HQRNQKSKTTNRFLVSINVLGSAG----------------------FHFSDQIPAF---- 40

Query: 79  SNFLLYRANAGFDALN-PLE-PIGSYGERNFVLCKKPVYHPSKTEPQSELLSQKSSGGWK 136
                    + F AL  PL+  IGS G RNFVL  K      +    +   ++K+SG WK
Sbjct: 41  ---------SDFIALTVPLKGKIGSTGSRNFVLSNKLETQNLEDSMMTTTTTRKTSGRWK 91

Query: 137 AWLNKSFGLKILSH 150
           AWLNKSFGL + SH
Sbjct: 92  AWLNKSFGLMVPSH 105


>AT2G27830.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G22758.1); Has 131 Blast hits to 131 proteins
           in 17 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 0; Plants - 131; Viruses - 0; Other Eukaryotes -
           0 (source: NCBI BLink). | chr2:11860734-11861306 FORWARD
           LENGTH=190
          Length = 190

 Score = 52.8 bits (125), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 26/87 (29%), Positives = 48/87 (55%), Gaps = 1/87 (1%)

Query: 27  KTNRFLITINILGSAGPIRFVVNEEKLVSGVIDTALKSYAREGRLPVLG-FDASNFLLYR 85
           +  + L+ + + GS G ++ +++ E  VS +ID A++ Y +E R P L   + S F L+ 
Sbjct: 68  RLTKLLLNVTVQGSLGAVQIIISPESTVSDLIDAAVRQYVKEARRPFLPESEPSRFDLHY 127

Query: 86  ANAGFDALNPLEPIGSYGERNFVLCKK 112
           +    +++   E + S G RNF LC +
Sbjct: 128 SQFSLESIVRDEKLISLGSRNFFLCGR 154


>AT4G22758.1 | Symbols:  | unknown protein; LOCATED IN: chloroplast;
           BEST Arabidopsis thaliana protein match is: unknown
           protein (TAIR:AT2G27830.1). | chr4:11958477-11959904
           FORWARD LENGTH=255
          Length = 255

 Score = 48.5 bits (114), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 27/83 (32%), Positives = 44/83 (53%), Gaps = 1/83 (1%)

Query: 30  RFLITINILGSAGPIRFVVNEEKLVSGVIDTALKSYAREGRLPVLGFDASNFLLYRANAG 89
           + +I++ + GS GP+R +V     V   I   +  Y +EGR P L  D++ F L++++  
Sbjct: 123 KVIISVAVEGSPGPVRAMVKLSCNVEETIKIVVDKYCKEGRTPKLDRDSA-FELHQSHFS 181

Query: 90  FDALNPLEPIGSYGERNFVLCKK 112
              L   E IG  G R+F + KK
Sbjct: 182 IQCLEKREIIGELGSRSFYMRKK 204