Miyakogusa Predicted Gene
- Lj4g3v2789340.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj4g3v2789340.1 Non Chatacterized Hit- tr|C5XW36|C5XW36_SORBI
Putative uncharacterized protein Sb04g004670
OS=Sorghu,28.85,3e-18,FtsH protease domain-like,NULL;
seg,NULL,CUFF.51670.1
(330 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT2G21960.1 | Symbols: | unknown protein; LOCATED IN: chloropla... 442 e-124
AT1G56180.1 | Symbols: | unknown protein; INVOLVED IN: biologic... 91 1e-18
AT5G27290.1 | Symbols: | unknown protein; LOCATED IN: chloropla... 75 6e-14
AT1G54680.3 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 59 4e-09
AT1G54680.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 57 2e-08
AT1G54680.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 56 3e-08
AT5G27290.2 | Symbols: | unknown protein; LOCATED IN: chloropla... 51 1e-06
>AT2G21960.1 | Symbols: | unknown protein; LOCATED IN: chloroplast;
EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT1G56180.1); Has 224 Blast
hits to 222 proteins in 59 species: Archae - 0; Bacteria
- 65; Metazoa - 0; Fungi - 0; Plants - 134; Viruses - 0;
Other Eukaryotes - 25 (source: NCBI BLink). |
chr2:9354990-9356958 FORWARD LENGTH=332
Length = 332
Score = 442 bits (1136), Expect = e-124, Method: Compositional matrix adjust.
Identities = 217/283 (76%), Positives = 250/283 (88%)
Query: 41 GVELNTLESAIAKKDSNAVKEALDQLSEGGWAKKWGSQPYVXXXXXXXXXXXXXGIKNAE 100
G +L++LESAI KKDSN VKEALD+LSE GWAKKW SQPY+ GIKNAE
Sbjct: 50 GFDLSSLESAINKKDSNGVKEALDKLSEEGWAKKWSSQPYLSRRTTSLRELTTLGIKNAE 109
Query: 101 NLAIPSVRNDAAFLFTVVGTTGFLGILTGQLPGDWGFFVPYLIGSISLVVLAVGSISPGL 160
LAIPSVRNDAAFLFTVVG+TGF+ +L GQLPGDWGFFVPYL+GSISLVVLAVGS+SPGL
Sbjct: 110 TLAIPSVRNDAAFLFTVVGSTGFIAVLAGQLPGDWGFFVPYLVGSISLVVLAVGSVSPGL 169
Query: 161 LQAAIGSFSTVFPDYQERIARHEAAHFLIAYLLGVPILGYSLDIGKEHVNLIDQRLEKLI 220
LQAAI FST FPDYQERIA HEAAHFL+AYL+G+PILGYSLDIGKEHVNLID+RL KLI
Sbjct: 170 LQAAISGFSTFFPDYQERIAAHEAAHFLVAYLIGLPILGYSLDIGKEHVNLIDERLAKLI 229
Query: 221 YSGQLNAKEIDRLAVVSMAGLAAEGLIYDKVIGQSADLFTLQRFINRTKPQLSKDQQQNL 280
YSG+L++KE+DRLA V+MAGLAAEGL YDKVIGQSADLF+LQRFINR++P++S +QQQNL
Sbjct: 230 YSGKLDSKELDRLAAVAMAGLAAEGLKYDKVIGQSADLFSLQRFINRSQPKISNEQQQNL 289
Query: 281 TRWAVMFAASLLKNNKESHEALMASMTKKASVVECIQTIESVA 323
TRWAV+++ASLLKNNK HEALMA+M+K ASV+ECIQTIE+ +
Sbjct: 290 TRWAVLYSASLLKNNKTIHEALMAAMSKNASVLECIQTIETAS 332
>AT1G56180.1 | Symbols: | unknown protein; INVOLVED IN:
biological_process unknown; LOCATED IN: chloroplast;
EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT5G27290.1); Has 436 Blast
hits to 436 proteins in 83 species: Archae - 0; Bacteria
- 153; Metazoa - 0; Fungi - 0; Plants - 160; Viruses -
0; Other Eukaryotes - 123 (source: NCBI BLink). |
chr1:21026243-21028047 REVERSE LENGTH=389
Length = 389
Score = 90.9 bits (224), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 54/166 (32%), Positives = 84/166 (50%), Gaps = 9/166 (5%)
Query: 164 AIGSFSTVFPDYQERIARHEAAHFLIAYLLGVPILGYSLDI---------GKEHVNLIDQ 214
+ S +P ++ RI HEA H L+AYL+G PI G LD G+ DQ
Sbjct: 217 CLAQVSCYWPPHKRRIVVHEAGHLLVAYLMGCPIRGVILDPVVAMQMGVQGQAGTQFWDQ 276
Query: 215 RLEKLIYSGQLNAKEIDRLAVVSMAGLAAEGLIYDKVIGQSADLFTLQRFINRTKPQLSK 274
++E I G+L+ DR ++V AG+AAE L+Y + G D + +P LS
Sbjct: 277 KMESEIAEGRLSGSSFDRYSMVLFAGIAAEALVYGEAEGGENDENLFRSISVLLEPPLSV 336
Query: 275 DQQQNLTRWAVMFAASLLKNNKESHEALMASMTKKASVVECIQTIE 320
Q N RW+V+ + +LLK +K +H A + ++ + + I+ IE
Sbjct: 337 AQMSNQARWSVLQSYNLLKWHKAAHRAAVEALQVGSPLSIVIRRIE 382
>AT5G27290.1 | Symbols: | unknown protein; LOCATED IN: chloroplast;
EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT1G54680.3); Has 30201 Blast
hits to 17322 proteins in 780 species: Archae - 12;
Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants -
5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI
BLink). | chr5:9618590-9620473 REVERSE LENGTH=341
Length = 341
Score = 75.1 bits (183), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 49/157 (31%), Positives = 82/157 (52%), Gaps = 13/157 (8%)
Query: 175 YQERIARHEAAHFLIAYLLGVPILGYSLD----IGKE-HVNL------IDQRLEKLIYSG 223
Y R+ +HEA HFL+AYL+G+ GY+L + KE +N+ +D + + SG
Sbjct: 179 YHNRVVQHEAGHFLVAYLVGILPRGYTLSSLEALQKEGSLNIQAGSAFVDYEFLEEVNSG 238
Query: 224 QLNAKEIDRLAVVSMAGLAAEGLIYDKVIGQSADLFTLQRFINRTKPQLSKDQQQNLTRW 283
+++A ++R + +++AG+A E L+Y G D+ L + K Q RW
Sbjct: 239 KVSATMLNRFSCIALAGVATEYLLYGYAEGGLDDISKLDGLVKSLGFTQKKADSQ--VRW 296
Query: 284 AVMFAASLLKNNKESHEALMASMTKKASVVECIQTIE 320
+V+ LL+ ++ + L +M+K SV CIQ IE
Sbjct: 297 SVLNTILLLRRHEIARSKLAQAMSKGESVGSCIQIIE 333
>AT1G54680.3 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: endomembrane
system; EXPRESSED IN: 22 plant structures; EXPRESSED
DURING: 13 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT5G27290.1). |
chr1:20413211-20414482 FORWARD LENGTH=217
Length = 217
Score = 58.9 bits (141), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 43/169 (25%), Positives = 82/169 (48%), Gaps = 21/169 (12%)
Query: 179 IARHEAAHFLIAYLLGVPILGY---SLDIGKEHVNLIDQRLE-----------KLIYSGQ 224
+ +HE+ HFL+ YLLGV Y +L+ +++V+ + R+E K GQ
Sbjct: 51 VVQHESGHFLVGYLLGVLPRHYEIPTLEAVRQNVSNVTGRVEFVGFEFLKQLMKDDVDGQ 110
Query: 225 LN-----AKEIDRLAVVSMAGLAAEGLIYDKVIGQSADLFTLQRFINRTKPQLSKDQQQN 279
+N +K ++ + V + G+ E +++ G +D+ L + ++ +++
Sbjct: 111 MNQGNISSKTLNNFSCVILGGMVTEHILFGYSEGLYSDIVKLNDVLRWLG--FTESEKEA 168
Query: 280 LTRWAVMFAASLLKNNKESHEALMASMTKKASVVECIQTIESVAAEFNI 328
+WAV SLL ++KE+ +L +M K + CI+ IES + I
Sbjct: 169 HIKWAVSNTVSLLHSHKEARVSLAETMAKAKPISTCIEAIESAISTHQI 217
>AT1G54680.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: endomembrane
system; EXPRESSED IN: 22 plant structures; EXPRESSED
DURING: 13 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT5G27290.1);
Has 200 Blast hits to 200 proteins in 57 species: Archae
- 0; Bacteria - 59; Metazoa - 0; Fungi - 0; Plants -
127; Viruses - 0; Other Eukaryotes - 14 (source: NCBI
BLink). | chr1:20413211-20414482 FORWARD LENGTH=223
Length = 223
Score = 56.6 bits (135), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 43/175 (24%), Positives = 82/175 (46%), Gaps = 27/175 (15%)
Query: 179 IARHEAAHFLIAYLLGVPILGY---SLDIGKEHVNLIDQRLE-----------------K 218
+ +HE+ HFL+ YLLGV Y +L+ +++V+ + R+E K
Sbjct: 51 VVQHESGHFLVGYLLGVLPRHYEIPTLEAVRQNVSNVTGRVEFVGFEFLKQVGAANQLMK 110
Query: 219 LIYSGQLN-----AKEIDRLAVVSMAGLAAEGLIYDKVIGQSADLFTLQRFINRTKPQLS 273
GQ+N +K ++ + V + G+ E +++ G +D+ L + +
Sbjct: 111 DDVDGQMNQGNISSKTLNNFSCVILGGMVTEHILFGYSEGLYSDIVKLNDVLRWLG--FT 168
Query: 274 KDQQQNLTRWAVMFAASLLKNNKESHEALMASMTKKASVVECIQTIESVAAEFNI 328
+ +++ +WAV SLL ++KE+ +L +M K + CI+ IES + I
Sbjct: 169 ESEKEAHIKWAVSNTVSLLHSHKEARVSLAETMAKAKPISTCIEAIESAISTHQI 223
>AT1G54680.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; EXPRESSED IN: 22 plant
structures; EXPRESSED DURING: 13 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT5G27290.1); Has 30201 Blast hits to 17322
proteins in 780 species: Archae - 12; Bacteria - 1396;
Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
0; Other Eukaryotes - 2996 (source: NCBI BLink). |
chr1:20413337-20414482 FORWARD LENGTH=219
Length = 219
Score = 56.2 bits (134), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 43/175 (24%), Positives = 82/175 (46%), Gaps = 27/175 (15%)
Query: 179 IARHEAAHFLIAYLLGVPILGY---SLDIGKEHVNLIDQRLE-----------------K 218
+ +HE+ HFL+ YLLGV Y +L+ +++V+ + R+E K
Sbjct: 47 VVQHESGHFLVGYLLGVLPRHYEIPTLEAVRQNVSNVTGRVEFVGFEFLKQVGAANQLMK 106
Query: 219 LIYSGQLN-----AKEIDRLAVVSMAGLAAEGLIYDKVIGQSADLFTLQRFINRTKPQLS 273
GQ+N +K ++ + V + G+ E +++ G +D+ L + +
Sbjct: 107 DDVDGQMNQGNISSKTLNNFSCVILGGMVTEHILFGYSEGLYSDIVKLNDVLRWLG--FT 164
Query: 274 KDQQQNLTRWAVMFAASLLKNNKESHEALMASMTKKASVVECIQTIESVAAEFNI 328
+ +++ +WAV SLL ++KE+ +L +M K + CI+ IES + I
Sbjct: 165 ESEKEAHIKWAVSNTVSLLHSHKEARVSLAETMAKAKPISTCIEAIESAISTHQI 219
>AT5G27290.2 | Symbols: | unknown protein; LOCATED IN: chloroplast;
EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT1G54680.3); Has 199 Blast
hits to 194 proteins in 57 species: Archae - 0; Bacteria
- 61; Metazoa - 0; Fungi - 0; Plants - 129; Viruses - 0;
Other Eukaryotes - 9 (source: NCBI BLink). |
chr5:9618787-9620473 REVERSE LENGTH=262
Length = 262
Score = 50.8 bits (120), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 51/84 (60%), Gaps = 11/84 (13%)
Query: 175 YQERIARHEAAHFLIAYLLGVPILGYSLD----IGKE-HVNL------IDQRLEKLIYSG 223
Y R+ +HEA HFL+AYL+G+ GY+L + KE +N+ +D + + SG
Sbjct: 179 YHNRVVQHEAGHFLVAYLVGILPRGYTLSSLEALQKEGSLNIQAGSAFVDYEFLEEVNSG 238
Query: 224 QLNAKEIDRLAVVSMAGLAAEGLI 247
+++A ++R + +++AG+A E L+
Sbjct: 239 KVSATMLNRFSCIALAGVATEYLL 262