Miyakogusa Predicted Gene
- Lj4g3v1535080.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj4g3v1535080.1 Non Chatacterized Hit- tr|K4C8K9|K4C8K9_SOLLC
Uncharacterized protein OS=Solanum lycopersicum GN=Sol,48.12,3e-19,NHL
REPEAT-CONTAINING PROTEIN,NULL; FAMILY NOT NAMED,NULL,CUFF.49357.1
(144 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G62865.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 85 2e-17
AT5G14890.1 | Symbols: | NHL domain-containing protein | chr5:4... 84 4e-17
AT3G01430.1 | Symbols: | BEST Arabidopsis thaliana protein matc... 78 2e-15
AT3G48020.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 77 3e-15
AT5G25240.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 54 4e-08
>AT5G62865.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G48020.1). | chr5:25234064-25234567 FORWARD
LENGTH=167
Length = 167
Score = 84.7 bits (208), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 65/151 (43%), Positives = 78/151 (51%), Gaps = 23/151 (15%)
Query: 3 IELESESTTPTGHQRSNAFCFCFGPRRRS--------SWWQRVRT----SHSTVSGD--R 48
+EL T R + C CF RRS S W R+RT +HS GD R
Sbjct: 1 MELSQSDPTRDPDTRYDQRCCCFPSFRRSRSSTAVGYSSWGRIRTVDDSNHSGDHGDEPR 60
Query: 49 WWSRGIRALKKVREWSEILAGPRWKTFIRRLSHH----RSHKRMTKYQYDPFSYALNFDE 104
WW IRA K+REWSEI+AGPRWKTFIRR + R K+QYDP SY+LNFD+
Sbjct: 61 WW---IRASLKIREWSEIVAGPRWKTFIRRFNRDPRRGRDWDASEKFQYDPLSYSLNFDD 117
Query: 105 GQNGD--FPDDGFRNFSTRYAVAAVKPVSSP 133
D G R+FSTR+A V +P
Sbjct: 118 DDEEDEYVGLGGLRSFSTRFASVPVYSGKAP 148
>AT5G14890.1 | Symbols: | NHL domain-containing protein |
chr5:4818056-4821534 FORWARD LENGTH=754
Length = 754
Score = 83.6 bits (205), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 52/131 (39%), Positives = 73/131 (55%), Gaps = 24/131 (18%)
Query: 22 CF---CFGPRRRSS-----WWQRVRTSHSTVSGDRWWSRGIRALKKVREWSEILAGPRWK 73
CF C G + S WWQR+RT +RWW G K+REWSEI+AGP+WK
Sbjct: 611 CFILPCLGSSQPSGPNGSVWWQRIRTVDKLEPDERWWVSG---WMKMREWSEIVAGPKWK 667
Query: 74 TFIRRLSHHR----------SHKRMTKYQYDPFSYALNFDEG-QNGDFPDD-GFRNFSTR 121
TFIRR + + ++YD +SY+LNFD+G Q G F D+ +R++S R
Sbjct: 668 TFIRRFGRNHCCNGGIDGGCNRPEHVSFRYDSWSYSLNFDDGKQTGHFEDEFPYRDYSMR 727
Query: 122 YAVAAVKPVSS 132
+A ++ PVS+
Sbjct: 728 FAAPSL-PVST 737
>AT3G01430.1 | Symbols: | BEST Arabidopsis thaliana protein match
is: NHL domain-containing protein (TAIR:AT5G14890.1);
Has 98 Blast hits to 98 proteins in 12 species: Archae -
0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 98;
Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).
| chr3:165595-166137 REVERSE LENGTH=180
Length = 180
Score = 78.2 bits (191), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 50/128 (39%), Positives = 71/128 (55%), Gaps = 31/128 (24%)
Query: 31 SSWWQRVRTSHSTVSGDRWWSRGIRALKKVREWSEILAGPRWKTFIRRLSH--------- 81
S WWQR+ T +RWW RG R ++REWSE++AGPRWKT+IRR
Sbjct: 45 SVWWQRITTVDKLEPDERWWIRGWR---RMREWSELVAGPRWKTYIRRFGRSNCCGGGGG 101
Query: 82 ---------------HRSHKRMTKYQYDPFSYALNFDEG-QNGDFPDD-GFRNFSTRYAV 124
+RS + K++YD SY+LNFD+G Q G F D+ +R++S R+A
Sbjct: 102 RVGNSSGGCGGGAMPNRSSDQ-GKFRYDQLSYSLNFDDGNQTGHFDDEFPYRDYSMRFAA 160
Query: 125 AAVKPVSS 132
++ PVS+
Sbjct: 161 PSL-PVST 167
>AT3G48020.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: chloroplast;
EXPRESSED IN: 11 plant structures; EXPRESSED DURING:
LP.04 four leaves visible, 4 anthesis; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G62865.1); Has 82 Blast hits to 82 proteins in
12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi
- 0; Plants - 82; Viruses - 0; Other Eukaryotes - 0
(source: NCBI BLink). | chr3:17724593-17725000 FORWARD
LENGTH=135
Length = 135
Score = 77.4 bits (189), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 54/126 (42%), Positives = 69/126 (54%), Gaps = 14/126 (11%)
Query: 24 CFGPRRRSSWWQRV-RTSHSTVSGDRWWSRGIRALKKVREWSEILAGPRWKTFIRRLSHH 82
C +SSWWQR+ R +H RWW +RA K+REWSEI+AGPRWKTFIRR +
Sbjct: 15 CCSTTVKSSWWQRIHRNNHQE---PRWW---VRAFLKIREWSEIVAGPRWKTFIRRFNRD 68
Query: 83 ----RSHKRMTKYQYDPFSYALNFDEGQNGDFPD---DGFRNFSTRYAVAAVKPVSSPEK 135
+ K++YDP SY L+F++ D + G R+FS RYA V SP
Sbjct: 69 PRRGQDWDDSDKFRYDPVSYTLSFEDEDKDDDDEAGVGGVRSFSMRYASVPVASGKSPAV 128
Query: 136 GSDVAV 141
S AV
Sbjct: 129 ISVDAV 134
>AT5G25240.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 23 plant
structures; EXPRESSED DURING: 13 growth stages; Has 1807
Blast hits to 1807 proteins in 277 species: Archae - 0;
Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385;
Viruses - 0; Other Eukaryotes - 339 (source: NCBI
BLink). | chr5:8746779-8747174 REVERSE LENGTH=131
Length = 131
Score = 53.5 bits (127), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 38/115 (33%), Positives = 57/115 (49%), Gaps = 9/115 (7%)
Query: 3 IELESESTTPTGHQRSNAFCFC-------FGPRRRSSWWQRVRTSHS-TVSGDRWWSRGI 54
+ + E+ ++ + AFC C F RR R R S + +R + G
Sbjct: 1 MATDRENLLSDDYEETAAFCGCGYFRSFSFTRWRRGDDESRSRGGWSGCLQEERRGNWGS 60
Query: 55 RALKKVREWSEILAGPRWKTFIRRLSHHRSH-KRMTKYQYDPFSYALNFDEGQNG 108
LK ++E SE +AGP+WK FIR S R +R + YD +Y+LNFD+G +G
Sbjct: 61 EKLKGLKEISEKIAGPKWKNFIRSFSSGRKKMRRDVDFTYDLKNYSLNFDDGGDG 115