Miyakogusa Predicted Gene

Lj2g3v1902120.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj2g3v1902120.1 tr|A9TF35|A9TF35_PHYPA Predicted protein
OS=Physcomitrella patens subsp. patens
GN=PHYPADRAFT_169716,79.37,5e-18,seg,NULL;
coiled-coil,NULL,CUFF.38125.1
         (190 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT5G52550.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   134   5e-32
AT4G25690.2 | Symbols:  | unknown protein; BEST Arabidopsis thal...   124   5e-29
AT4G25690.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   124   5e-29
AT4G25670.2 | Symbols:  | unknown protein; LOCATED IN: cellular_...   116   9e-27
AT4G25670.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   116   9e-27

>AT5G52550.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G25670.2); Has 30201 Blast hits to 17322
           proteins in 780 species: Archae - 12; Bacteria - 1396;
           Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
           0; Other Eukaryotes - 2996 (source: NCBI BLink). |
           chr5:21327914-21328996 REVERSE LENGTH=360
          Length = 360

 Score =  134 bits (336), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 80/178 (44%), Positives = 103/178 (57%), Gaps = 20/178 (11%)

Query: 22  DELDRLKQAEKKKRRLEKALSTSAAIISELGXXXXXXXXXXXRLDEEGAAIAEAVALHVL 81
           +EL+R+KQAE+KKRR+EK+++TSAAI +EL            RLDEEGAAIAEAVALHVL
Sbjct: 188 EELERIKQAERKKRRIEKSIATSAAIRAELEKKKLRKLEEQRRLDEEGAAIAEAVALHVL 247

Query: 82  LGEDSDESCNVVIN-EGGCNSWNYNHGLDFFMGGKRACFPHLDGGTWSV----TAENGNW 136
           LGED D+S    +N E G   W+Y   ++ F GG    FPH    +++V       + NW
Sbjct: 248 LGEDCDDSYRNTLNQETGFKPWDYTTKINLFSGGINRFFPHQRCSSYAVHDNNRTRDSNW 307

Query: 137 SFSSGPFEFEKNVHEPLYEDAGW----GCTGLSADLIAAQAARSLHIAEDTDEDRILF 190
           S  S         +EP     GW       G+SADL   QA  SL I+E+TD D I+F
Sbjct: 308 SSVS---------YEPFAR--GWDNNNNNMGISADLFDTQAVSSLQISENTDVDAIVF 354



 Score = 57.4 bits (137), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 31/53 (58%), Positives = 38/53 (71%)

Query: 22  DELDRLKQAEKKKRRLEKALSTSAAIISELGXXXXXXXXXXXRLDEEGAAIAE 74
           DEL+R+KQAE KK RLEK+++TSAAI++EL            RL EEGAAIAE
Sbjct: 85  DELERIKQAENKKNRLEKSIATSAAIMAELEKKKLRKLEEQKRLAEEGAAIAE 137


>AT4G25690.2 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G25670.2); Has 35333 Blast hits to 34131
           proteins in 2444 species: Archae - 798; Bacteria -
           22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
           - 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
           chr4:13090421-13090996 REVERSE LENGTH=191
          Length = 191

 Score =  124 bits (310), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 82/179 (45%), Positives = 96/179 (53%), Gaps = 26/179 (14%)

Query: 22  DELDRLKQAEKKKRRLEKALSTSAAIISELGXXXXXXXXXXXRLDEEGAAIAEAVALHVL 81
           DE DR+KQAEKKKRRLEKAL+TSAAI +EL            RLDEEGAAIAEAVALHVL
Sbjct: 23  DEFDRIKQAEKKKRRLEKALATSAAIRAELEKKKQKRLEEQQRLDEEGAAIAEAVALHVL 82

Query: 82  LGEDSDESCNVVINEGGCNSWNYNHGLDFFMGGKRACFPHLDGGTWSVT----AENG--- 134
           LGEDSD+S  V   E          G+D F   +    P     +++V       NG   
Sbjct: 83  LGEDSDDSSRVKFGE------ETGFGMDLFRDERTNYVPRQSCASYAVQGIGFVSNGYGL 136

Query: 135 ---NWSFSSGPFEFEKNVHEPLYEDAGWGCTGLSADLIAAQAARSLHIAEDTDEDRILF 190
              NWS S  PF           +D       +SADLIAAQA  SL I+ED D +  +F
Sbjct: 137 GDSNWSVSYKPF----------MKDVWDNNMVISADLIAAQAVSSLQISEDADRNAYVF 185


>AT4G25690.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G25670.2); Has 73 Blast hits to 60 proteins in
           13 species: Archae - 0; Bacteria - 2; Metazoa - 4; Fungi
           - 0; Plants - 67; Viruses - 0; Other Eukaryotes - 0
           (source: NCBI BLink). | chr4:13090421-13090996 REVERSE
           LENGTH=191
          Length = 191

 Score =  124 bits (310), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 82/179 (45%), Positives = 96/179 (53%), Gaps = 26/179 (14%)

Query: 22  DELDRLKQAEKKKRRLEKALSTSAAIISELGXXXXXXXXXXXRLDEEGAAIAEAVALHVL 81
           DE DR+KQAEKKKRRLEKAL+TSAAI +EL            RLDEEGAAIAEAVALHVL
Sbjct: 23  DEFDRIKQAEKKKRRLEKALATSAAIRAELEKKKQKRLEEQQRLDEEGAAIAEAVALHVL 82

Query: 82  LGEDSDESCNVVINEGGCNSWNYNHGLDFFMGGKRACFPHLDGGTWSVT----AENG--- 134
           LGEDSD+S  V   E          G+D F   +    P     +++V       NG   
Sbjct: 83  LGEDSDDSSRVKFGE------ETGFGMDLFRDERTNYVPRQSCASYAVQGIGFVSNGYGL 136

Query: 135 ---NWSFSSGPFEFEKNVHEPLYEDAGWGCTGLSADLIAAQAARSLHIAEDTDEDRILF 190
              NWS S  PF           +D       +SADLIAAQA  SL I+ED D +  +F
Sbjct: 137 GDSNWSVSYKPF----------MKDVWDNNMVISADLIAAQAVSSLQISEDADRNAYVF 185


>AT4G25670.2 | Symbols:  | unknown protein; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: male
           gametophyte, pollen tube; EXPRESSED DURING: M germinated
           pollen stage; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT4G25690.2). |
           chr4:13085431-13085997 REVERSE LENGTH=188
          Length = 188

 Score =  116 bits (291), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 75/170 (44%), Positives = 92/170 (54%), Gaps = 11/170 (6%)

Query: 22  DELDRLKQAEKKKRRLEKALSTSAAIISELGXXXXXXXXXXXRLDEEGAAIAEAVALHVL 81
           DE DR+KQAEKKKRRLEKAL+TSAAI +EL            RLDEEGAAIAEAVALHVL
Sbjct: 23  DEFDRIKQAEKKKRRLEKALATSAAIRAELEKKKQKRLEEQQRLDEEGAAIAEAVALHVL 82

Query: 82  LGEDSDESCNVVINEGGCNSWNYNHGLDFFMGGKRACFPHLDGGTWSVTAENGNWSFSSG 141
           LGEDSD+S  V   E           +D F   +    P     +++V        F S 
Sbjct: 83  LGEDSDDSSRVKFGE------EKGFTMDLFRDERTNYVPRQSCASYAVQG----IGFVSN 132

Query: 142 PFEFEKNVHEPLYEDAGW-GCTGLSADLIAAQAARSLHIAEDTDEDRILF 190
            +    +   P      W    G+SADLIAAQA  +L I+E+ D +  +F
Sbjct: 133 GYGLGDSNWSPFTRRGAWDNNMGISADLIAAQAVSALQISENADGNAFVF 182


>AT4G25670.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G25690.2); Has 30201 Blast hits to 17322
           proteins in 780 species: Archae - 12; Bacteria - 1396;
           Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
           0; Other Eukaryotes - 2996 (source: NCBI BLink). |
           chr4:13085431-13085997 REVERSE LENGTH=188
          Length = 188

 Score =  116 bits (291), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 75/170 (44%), Positives = 92/170 (54%), Gaps = 11/170 (6%)

Query: 22  DELDRLKQAEKKKRRLEKALSTSAAIISELGXXXXXXXXXXXRLDEEGAAIAEAVALHVL 81
           DE DR+KQAEKKKRRLEKAL+TSAAI +EL            RLDEEGAAIAEAVALHVL
Sbjct: 23  DEFDRIKQAEKKKRRLEKALATSAAIRAELEKKKQKRLEEQQRLDEEGAAIAEAVALHVL 82

Query: 82  LGEDSDESCNVVINEGGCNSWNYNHGLDFFMGGKRACFPHLDGGTWSVTAENGNWSFSSG 141
           LGEDSD+S  V   E           +D F   +    P     +++V        F S 
Sbjct: 83  LGEDSDDSSRVKFGE------EKGFTMDLFRDERTNYVPRQSCASYAVQG----IGFVSN 132

Query: 142 PFEFEKNVHEPLYEDAGW-GCTGLSADLIAAQAARSLHIAEDTDEDRILF 190
            +    +   P      W    G+SADLIAAQA  +L I+E+ D +  +F
Sbjct: 133 GYGLGDSNWSPFTRRGAWDNNMGISADLIAAQAVSALQISENADGNAFVF 182