Miyakogusa Predicted Gene

Lj1g3v0415000.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj1g3v0415000.1 Non Chatacterized Hit- tr|D7MMX4|D7MMX4_ARALL
Putative uncharacterized protein OS=Arabidopsis
lyrata,47.29,5e-19,seg,NULL; NHL REPEAT-CONTAINING PROTEIN,NULL;
FAMILY NOT NAMED,NULL,CUFF.25710.1
         (151 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT5G14890.1 | Symbols:  | NHL domain-containing protein | chr5:4...    74   4e-14
AT3G01430.1 | Symbols:  | BEST Arabidopsis thaliana protein matc...    71   3e-13
AT5G62865.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    70   5e-13
AT3G48020.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    67   6e-12

>AT5G14890.1 | Symbols:  | NHL domain-containing protein |
           chr5:4818056-4821534 FORWARD LENGTH=754
          Length = 754

 Score = 73.6 bits (179), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 50/138 (36%), Positives = 68/138 (49%), Gaps = 29/138 (21%)

Query: 17  LCCFGSRRST----TSWWERVRATXXXXXXXXXXXHPTTWSAGDRWWSRGFMRAREWSEI 72
           L C GS + +    + WW+R+R                     +RWW  G+M+ REWSEI
Sbjct: 614 LPCLGSSQPSGPNGSVWWQRIRTV-------------DKLEPDERWWVSGWMKMREWSEI 660

Query: 73  VAGPRWKTFIXXXXXXXXXXXXXHAG-------KYQYDPLSYALNFDEGPGQNGDFEDDV 125
           VAGP+WKTFI               G        ++YD  SY+LNFD+G  Q G FED+ 
Sbjct: 661 VAGPKWKTFIRRFGRNHCCNGGIDGGCNRPEHVSFRYDSWSYSLNFDDG-KQTGHFEDEF 719

Query: 126 VSDGFRNFSARYAMAPPL 143
               +R++S R+A AP L
Sbjct: 720 ---PYRDYSMRFA-APSL 733


>AT3G01430.1 | Symbols:  | BEST Arabidopsis thaliana protein match
           is: NHL domain-containing protein (TAIR:AT5G14890.1);
           Has 98 Blast hits to 98 proteins in 12 species: Archae -
           0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 98;
           Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).
           | chr3:165595-166137 REVERSE LENGTH=180
          Length = 180

 Score = 70.9 bits (172), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 54/174 (31%), Positives = 77/174 (44%), Gaps = 49/174 (28%)

Query: 1   MEELDSTGNWQDT--SNSLCCF---------GSRRSTTSWWERVRATXXXXXXXXXXXHP 49
           + E+D+T +  +   +   CCF          S R  + WW+R+                
Sbjct: 8   IAEVDATDDMHEALFAKRGCCFLMPCLASSQPSTRGGSVWWQRITTVDKL---------- 57

Query: 50  TTWSAGDRWWSRGFMRAREWSEIVAGPRWKTFI--------------------XXXXXXX 89
                 +RWW RG+ R REWSE+VAGPRWKT+I                           
Sbjct: 58  ---EPDERWWIRGWRRMREWSELVAGPRWKTYIRRFGRSNCCGGGGGRVGNSSGGCGGGA 114

Query: 90  XXXXXXHAGKYQYDPLSYALNFDEGPGQNGDFEDDVVSDGFRNFSARYAMAPPL 143
                   GK++YD LSY+LNFD+G  Q G F+D+     +R++S R+A AP L
Sbjct: 115 MPNRSSDQGKFRYDQLSYSLNFDDG-NQTGHFDDEF---PYRDYSMRFA-APSL 163


>AT5G62865.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G48020.1). | chr5:25234064-25234567 FORWARD
           LENGTH=167
          Length = 167

 Score = 70.5 bits (171), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 52/132 (39%), Positives = 64/132 (48%), Gaps = 19/132 (14%)

Query: 18  CCFGS-RRSTTSW------WERVRATXXXXXXXXXXXHPTTWSAGDRWWSRGFMRAREWS 70
           CCF S RRS +S       W R+R              P       RWW R  ++ REWS
Sbjct: 21  CCFPSFRRSRSSTAVGYSSWGRIRTVDDSNHSGDHGDEP-------RWWIRASLKIREWS 73

Query: 71  EIVAGPRWKTFIXXXXXXXXXXXXXHAG-KYQYDPLSYALNFDEGPGQNGDFEDDVVSDG 129
           EIVAGPRWKTFI              A  K+QYDPLSY+LNF      + + ++ V   G
Sbjct: 74  EIVAGPRWKTFIRRFNRDPRRGRDWDASEKFQYDPLSYSLNF----DDDDEEDEYVGLGG 129

Query: 130 FRNFSARYAMAP 141
            R+FS R+A  P
Sbjct: 130 LRSFSTRFASVP 141


>AT3G48020.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: chloroplast;
           EXPRESSED IN: 11 plant structures; EXPRESSED DURING:
           LP.04 four leaves visible, 4 anthesis; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT5G62865.1); Has 82 Blast hits to 82 proteins in
           12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi
           - 0; Plants - 82; Viruses - 0; Other Eukaryotes - 0
           (source: NCBI BLink). | chr3:17724593-17725000 FORWARD
           LENGTH=135
          Length = 135

 Score = 66.6 bits (161), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 46/127 (36%), Positives = 61/127 (48%), Gaps = 21/127 (16%)

Query: 16  SLCCFGSRRSTTSWWERVRATXXXXXXXXXXXHPTTWSAGDRWWSRGFMRAREWSEIVAG 75
           S CC  + +S  SWW+R+                       RWW R F++ REWSEIVAG
Sbjct: 13  SSCCSTTVKS--SWWQRIHRNNHQE---------------PRWWVRAFLKIREWSEIVAG 55

Query: 76  PRWKTFIXXXXXXXXXXXX-XHAGKYQYDPLSYALNFDEGPGQNGDFEDDVVSDGFRNFS 134
           PRWKTFI               + K++YDP+SY L+F++    + D        G R+FS
Sbjct: 56  PRWKTFIRRFNRDPRRGQDWDDSDKFRYDPVSYTLSFEDEDKDDDDEAG---VGGVRSFS 112

Query: 135 ARYAMAP 141
            RYA  P
Sbjct: 113 MRYASVP 119