Miyakogusa Predicted Gene
- Lj1g3v2546310.2
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v2546310.2 tr|Q0DH96|Q0DH96_ORYSJ Os05g0481700 protein
OS=Oryza sativa subsp. japonica GN=Os05g0481700 PE=4
SV=,79.17,0.00000000000004,seg,NULL; coiled-coil,NULL;
UNCHARACTERIZED,NULL,CUFF.29145.2
(381 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G01970.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 245 4e-65
AT1G30050.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 228 5e-60
AT2G30530.1 | Symbols: | unknown protein; LOCATED IN: cellular_... 188 5e-48
AT4G02800.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 103 2e-22
>AT5G01970.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G30050.1); Has 240 Blast hits to 236 proteins
in 72 species: Archae - 0; Bacteria - 15; Metazoa - 51;
Fungi - 19; Plants - 119; Viruses - 0; Other Eukaryotes
- 36 (source: NCBI BLink). | chr5:373014-374651 REVERSE
LENGTH=351
Length = 351
Score = 245 bits (626), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 161/357 (45%), Positives = 197/357 (55%), Gaps = 87/357 (24%)
Query: 1 MAYRRRQGITRASTFKEEFHSSLDNNDNVKDSSVHHEKNXXXXXXXXXXXXXXXXXXXXX 60
MAYRRRQGI + +TFKEE D + S+ K+
Sbjct: 1 MAYRRRQGIGKFATFKEEV-------DRLPPESITAVKD--------------------- 32
Query: 61 XXXXXXXXXXXRREPSLSFAFAPSYDADHQRSKNFPSQFKPL------------------ 102
R P+ S +P++D RSKNF ++ K L
Sbjct: 33 -----------RSPPARS---SPAFD--QPRSKNFTTEPKGLWGVIAQKAKSVIEDDKSS 76
Query: 103 ------------YEPPDSNKMMENPTFRKGLDRITTSLNQLGDTFEKAFEEGRTMVESKT 150
Y + K M+NP R+GLD++T+SLNQ+GDTFEKAFE+GRT+VE+KT
Sbjct: 77 DRSTTASQSRFSYLSDEGFKKMDNPKLRRGLDKLTSSLNQIGDTFEKAFEDGRTLVENKT 136
Query: 151 ADL-----RTQIRRKGNVPEDTN------LASEMRNPWQQPAQTQNPSRQETQLKASRDV 199
AD+ + Q RR+G ED N ++S + +QP Q N ETQLKASRDV
Sbjct: 137 ADIIQETRKLQTRRRGTGGEDENQNQSYGVSSSWKKSPEQPMQL-NHIEHETQLKASRDV 195
Query: 200 XXXXXXXXXXXXXXXXTVKADLAFAKARCAQLEEENKLLRDREGSEKGQNREDDDLIRLQ 259
TVKADLAFAK RCAQLEEENK LR+ EKG N D+DLIRLQ
Sbjct: 196 AMATAAKAKLLLRELKTVKADLAFAKERCAQLEEENKHLRESH-REKGSNPADEDLIRLQ 254
Query: 260 LETLLAEKARLANENETYSRENRFLREVVEYHQLTMQDVVYLDEGMEEVTEVYPVDS 316
LE+LLAEKARLA+EN Y+RENRFLRE+VEYHQLTMQDVVY+DEG EEVT+V P S
Sbjct: 255 LESLLAEKARLAHENSVYARENRFLREIVEYHQLTMQDVVYIDEGSEEVTQVSPFVS 311
>AT1G30050.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G01970.1); Has 246 Blast hits to 244 proteins
in 61 species: Archae - 0; Bacteria - 8; Metazoa - 78;
Fungi - 10; Plants - 117; Viruses - 0; Other Eukaryotes
- 33 (source: NCBI BLink). | chr1:10543177-10544418
FORWARD LENGTH=389
Length = 389
Score = 228 bits (581), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 127/206 (61%), Positives = 146/206 (70%), Gaps = 18/206 (8%)
Query: 114 NPTFRKGLDRITTSLNQLGDTFEKAFEEGRTMVESKTADLRTQIRRKGN--VPEDTN--- 168
NPT RK +D+ITTSLN +GD+FEKAFEEGRT+V S QIRRKG+ + D N
Sbjct: 112 NPTIRKSIDKITTSLNHIGDSFEKAFEEGRTIVAS-------QIRRKGSDLIDSDNNNYH 164
Query: 169 LASEMRNPWQQPAQTQNPSRQETQLKASRDVXXXXXXXXXXXXXXXXTVKADLAFAKARC 228
+S +PWQ Q P+ +E+QLKASRDV TVKADLAFAK RC
Sbjct: 165 QSSGSSSPWQPLTQ---PNPRESQLKASRDVAMATAAKAKLLLRELKTVKADLAFAKERC 221
Query: 229 AQLEEENKLLRDREGSEKGQNR-EDDDLIRLQLETLLAEKARLANENETYSRENRFLREV 287
+QLEEENK LRD +KG N DDDLIRLQLETLLAEKARLA+EN Y+RENRFLRE+
Sbjct: 222 SQLEEENKRLRDNR--DKGNNNPADDDLIRLQLETLLAEKARLAHENSIYARENRFLREI 279
Query: 288 VEYHQLTMQDVVYLDEGMEEVTEVYP 313
VEYHQLTMQDVVY+DEG+EEV EV P
Sbjct: 280 VEYHQLTMQDVVYIDEGIEEVAEVNP 305
>AT2G30530.1 | Symbols: | unknown protein; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 25 plant
structures; EXPRESSED DURING: 15 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT5G01970.1); Has 5513 Blast hits to 872 proteins
in 154 species: Archae - 0; Bacteria - 30; Metazoa -
615; Fungi - 144; Plants - 149; Viruses - 12; Other
Eukaryotes - 4563 (source: NCBI BLink). |
chr2:13009071-13010867 FORWARD LENGTH=371
Length = 371
Score = 188 bits (478), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 108/203 (53%), Positives = 133/203 (65%), Gaps = 13/203 (6%)
Query: 113 ENPTFRKGLDRITTSLNQLGDTFEKAFEEGRTMVESKTADLRTQIRRKGNVPEDTNLASE 172
ENP+ ++ LD IT+SLN +G T EEG T VE++TA + + R+K + + +L
Sbjct: 158 ENPSLQRRLDAITSSLNYIGGTIGTVVEEGITAVENRTAGIIQETRKK--IKKKPSLTRN 215
Query: 173 MRNPWQQPAQTQNPSRQETQLKASRDVXXXXXXXXXXXXXXXXTVKADLAFAKARCAQLE 232
+NP Q + E QLKASRDV VK+DLAFAK RCAQLE
Sbjct: 216 QQNPEIQ-------ADLEIQLKASRDVAMAMAAKAKLLLRELKMVKSDLAFAKQRCAQLE 268
Query: 233 EENKLLRD-REGSEKGQNREDDDLIRLQLETLLAEKARLANENETYSRENRFLREVVEYH 291
EENK+LR+ R G + +DDDL+RLQLETLLAEKARLA+EN Y+REN +LR VVEYH
Sbjct: 269 EENKVLRENRSGDSQT---DDDDLVRLQLETLLAEKARLAHENSIYTRENLYLRGVVEYH 325
Query: 292 QLTMQDVVYLDEGMEEVTEVYPV 314
QLTMQDVVY DE EEVTEVYP+
Sbjct: 326 QLTMQDVVYFDEKTEEVTEVYPI 348
>AT4G02800.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: chloroplast;
EXPRESSED IN: 16 plant structures; EXPRESSED DURING: 9
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT5G01970.1); Has 3209 Blast
hits to 2720 proteins in 308 species: Archae - 13;
Bacteria - 213; Metazoa - 1207; Fungi - 247; Plants -
183; Viruses - 21; Other Eukaryotes - 1325 (source: NCBI
BLink). | chr4:1250126-1251478 FORWARD LENGTH=333
Length = 333
Score = 103 bits (257), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 61/141 (43%), Positives = 84/141 (59%), Gaps = 12/141 (8%)
Query: 182 QTQNPSRQETQ-------LKASRDVXXXXXXXXXXXXXXXXTVKADLAFAKARCAQLEEE 234
QT++ S +E + +K ++++ T+K+DL+F + RC LEEE
Sbjct: 160 QTEDSSNEEEKRPKERKIMKKAKNIAISMAAKANSLARELKTIKSDLSFIQERCGLLEEE 219
Query: 235 NKLLRDREGSEKGQNREDDDLIRLQLETLLAEKARLANENETYSRENRFLREVVEYHQLT 294
NK LRD G KG E+DDL+RLQLE LLAEKARLANEN REN+ L ++VEYHQ+T
Sbjct: 220 NKRLRD--GFVKGVRPEEDDLVRLQLEVLLAEKARLANENANLVRENQCLHQMVEYHQIT 277
Query: 295 MQDVVYLDEGMEEVTEVYPVD 315
QD L E+V + + +D
Sbjct: 278 SQD---LSPSYEQVVQGFCLD 295