Miyakogusa Predicted Gene
- Lj1g3v0415000.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v0415000.1 Non Chatacterized Hit- tr|D7MMX4|D7MMX4_ARALL
Putative uncharacterized protein OS=Arabidopsis
lyrata,47.29,5e-19,seg,NULL; NHL REPEAT-CONTAINING PROTEIN,NULL;
FAMILY NOT NAMED,NULL,CUFF.25710.1
(151 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G14890.1 | Symbols: | NHL domain-containing protein | chr5:4... 74 4e-14
AT3G01430.1 | Symbols: | BEST Arabidopsis thaliana protein matc... 71 3e-13
AT5G62865.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 70 5e-13
AT3G48020.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 67 6e-12
>AT5G14890.1 | Symbols: | NHL domain-containing protein |
chr5:4818056-4821534 FORWARD LENGTH=754
Length = 754
Score = 73.6 bits (179), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 50/138 (36%), Positives = 68/138 (49%), Gaps = 29/138 (21%)
Query: 17 LCCFGSRRST----TSWWERVRATXXXXXXXXXXXHPTTWSAGDRWWSRGFMRAREWSEI 72
L C GS + + + WW+R+R +RWW G+M+ REWSEI
Sbjct: 614 LPCLGSSQPSGPNGSVWWQRIRTV-------------DKLEPDERWWVSGWMKMREWSEI 660
Query: 73 VAGPRWKTFIXXXXXXXXXXXXXHAG-------KYQYDPLSYALNFDEGPGQNGDFEDDV 125
VAGP+WKTFI G ++YD SY+LNFD+G Q G FED+
Sbjct: 661 VAGPKWKTFIRRFGRNHCCNGGIDGGCNRPEHVSFRYDSWSYSLNFDDG-KQTGHFEDEF 719
Query: 126 VSDGFRNFSARYAMAPPL 143
+R++S R+A AP L
Sbjct: 720 ---PYRDYSMRFA-APSL 733
>AT3G01430.1 | Symbols: | BEST Arabidopsis thaliana protein match
is: NHL domain-containing protein (TAIR:AT5G14890.1);
Has 98 Blast hits to 98 proteins in 12 species: Archae -
0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 98;
Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).
| chr3:165595-166137 REVERSE LENGTH=180
Length = 180
Score = 70.9 bits (172), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 54/174 (31%), Positives = 77/174 (44%), Gaps = 49/174 (28%)
Query: 1 MEELDSTGNWQDT--SNSLCCF---------GSRRSTTSWWERVRATXXXXXXXXXXXHP 49
+ E+D+T + + + CCF S R + WW+R+
Sbjct: 8 IAEVDATDDMHEALFAKRGCCFLMPCLASSQPSTRGGSVWWQRITTVDKL---------- 57
Query: 50 TTWSAGDRWWSRGFMRAREWSEIVAGPRWKTFI--------------------XXXXXXX 89
+RWW RG+ R REWSE+VAGPRWKT+I
Sbjct: 58 ---EPDERWWIRGWRRMREWSELVAGPRWKTYIRRFGRSNCCGGGGGRVGNSSGGCGGGA 114
Query: 90 XXXXXXHAGKYQYDPLSYALNFDEGPGQNGDFEDDVVSDGFRNFSARYAMAPPL 143
GK++YD LSY+LNFD+G Q G F+D+ +R++S R+A AP L
Sbjct: 115 MPNRSSDQGKFRYDQLSYSLNFDDG-NQTGHFDDEF---PYRDYSMRFA-APSL 163
>AT5G62865.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G48020.1). | chr5:25234064-25234567 FORWARD
LENGTH=167
Length = 167
Score = 70.5 bits (171), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 52/132 (39%), Positives = 64/132 (48%), Gaps = 19/132 (14%)
Query: 18 CCFGS-RRSTTSW------WERVRATXXXXXXXXXXXHPTTWSAGDRWWSRGFMRAREWS 70
CCF S RRS +S W R+R P RWW R ++ REWS
Sbjct: 21 CCFPSFRRSRSSTAVGYSSWGRIRTVDDSNHSGDHGDEP-------RWWIRASLKIREWS 73
Query: 71 EIVAGPRWKTFIXXXXXXXXXXXXXHAG-KYQYDPLSYALNFDEGPGQNGDFEDDVVSDG 129
EIVAGPRWKTFI A K+QYDPLSY+LNF + + ++ V G
Sbjct: 74 EIVAGPRWKTFIRRFNRDPRRGRDWDASEKFQYDPLSYSLNF----DDDDEEDEYVGLGG 129
Query: 130 FRNFSARYAMAP 141
R+FS R+A P
Sbjct: 130 LRSFSTRFASVP 141
>AT3G48020.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: chloroplast;
EXPRESSED IN: 11 plant structures; EXPRESSED DURING:
LP.04 four leaves visible, 4 anthesis; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G62865.1); Has 82 Blast hits to 82 proteins in
12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi
- 0; Plants - 82; Viruses - 0; Other Eukaryotes - 0
(source: NCBI BLink). | chr3:17724593-17725000 FORWARD
LENGTH=135
Length = 135
Score = 66.6 bits (161), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 46/127 (36%), Positives = 61/127 (48%), Gaps = 21/127 (16%)
Query: 16 SLCCFGSRRSTTSWWERVRATXXXXXXXXXXXHPTTWSAGDRWWSRGFMRAREWSEIVAG 75
S CC + +S SWW+R+ RWW R F++ REWSEIVAG
Sbjct: 13 SSCCSTTVKS--SWWQRIHRNNHQE---------------PRWWVRAFLKIREWSEIVAG 55
Query: 76 PRWKTFIXXXXXXXXXXXX-XHAGKYQYDPLSYALNFDEGPGQNGDFEDDVVSDGFRNFS 134
PRWKTFI + K++YDP+SY L+F++ + D G R+FS
Sbjct: 56 PRWKTFIRRFNRDPRRGQDWDDSDKFRYDPVSYTLSFEDEDKDDDDEAG---VGGVRSFS 112
Query: 135 ARYAMAP 141
RYA P
Sbjct: 113 MRYASVP 119