Miyakogusa Predicted Gene
- Lj4g3v2375020.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj4g3v2375020.1 Non Chatacterized Hit- tr|B9ET76|B9ET76_ORYSJ
Putative uncharacterized protein OS=Oryza sativa
subsp,42.05,0.000001,seg,NULL; FAMILY NOT NAMED,NULL;
coiled-coil,NULL,CUFF.50873.1
(339 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT4G15545.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 355 3e-98
AT1G16520.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 204 7e-53
AT1G56080.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 199 3e-51
>AT4G15545.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G16520.1); Has 30201 Blast hits to 17322
proteins in 780 species: Archae - 12; Bacteria - 1396;
Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
0; Other Eukaryotes - 2996 (source: NCBI BLink). |
chr4:8875932-8877567 FORWARD LENGTH=337
Length = 337
Score = 355 bits (911), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 195/338 (57%), Positives = 237/338 (70%), Gaps = 14/338 (4%)
Query: 2 AAESGDLNFDLPDDLVQVLPSDPFEQLDLARKITSIALSARVNALESESSELRARIAEKD 61
+A +G +FDLPD+L+QVLPSDPFEQLD+ARKITSIALS RV+ALESESS+LR +AEK+
Sbjct: 14 SAITGSRSFDLPDELLQVLPSDPFEQLDVARKITSIALSTRVSALESESSDLRELLAEKE 73
Query: 62 HLIAELQSQLDSLDATLSATADNLHRAEQDKESLINENASLSNTVRKLNRDVSKLEVFRK 121
ELQS ++SL+A+LS L A+ +KE+LI ENASLSNTV++L RDVSKLE FRK
Sbjct: 74 KEFEELQSHVESLEASLSDAFHKLSLADGEKENLIRENASLSNTVKRLQRDVSKLEGFRK 133
Query: 122 TLMRSLQEEDDNSGGAPDMVAKVXXXXXXXXXXXFGDNEAXXXXXXXXXXXXXXDTGNSF 181
TLM SLQ++D N+G ++AK D +
Sbjct: 134 TLMMSLQDDDQNAGTT-QIIAKPTPNDD--------DTPFQPSRHSSIQSQQASEAIEPA 184
Query: 182 AEDHESDAIRPRVPYSLLLASQSTTPRLTPPGSPPSLSASVSPTRTSKPVSPRRHSISFA 241
A D+E+DA +P + SL L SQ+TTPRLTPPGSPP LSAS +P TS+P+SPRRHS+SFA
Sbjct: 185 ATDNENDAPKPSLSASLPLVSQTTTPRLTPPGSPPILSASGTPKTTSRPISPRRHSVSFA 244
Query: 242 TSRGMHDDRTXXXXXXXXXXXXXXXXXXRTRVDGKEFFRQVRNRLSYEQFGAFLANVKEL 301
T+RGM DD RTRVDGKEFFRQVR+RLSYEQFGAFL NVK+L
Sbjct: 245 TTRGMFDD-----TRSSISISEPGSQTARTRVDGKEFFRQVRSRLSYEQFGAFLGNVKDL 299
Query: 302 NSHKQTREVTLQKADEIFGPENKDLYNIFEGLITRNVH 339
N+HKQTRE TL+KA+EIFG +N+DLY IFEGLITRN H
Sbjct: 300 NAHKQTREETLRKAEEIFGGDNRDLYVIFEGLITRNAH 337
>AT1G16520.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: plasma membrane;
EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT1G56080.1); Has 243 Blast
hits to 234 proteins in 69 species: Archae - 2; Bacteria
- 2; Metazoa - 61; Fungi - 9; Plants - 125; Viruses - 0;
Other Eukaryotes - 44 (source: NCBI BLink). |
chr1:5648904-5650998 FORWARD LENGTH=325
Length = 325
Score = 204 bits (519), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 129/345 (37%), Positives = 182/345 (52%), Gaps = 38/345 (11%)
Query: 8 LNFDLPDDLVQVLPSDPFEQLDLARKITSIALSARVNALESESSELRARIAEKDHLIAEL 67
L+F+LP++++ V+P DPFEQLDLARKITS+A+++RV+ L+SE ELR ++ K+ ++ EL
Sbjct: 6 LDFELPEEVLSVIPMDPFEQLDLARKITSMAIASRVSNLDSEVVELRQKLLGKESVVREL 65
Query: 68 QSQLDSLDATLSATADNLHRAEQDKESLINENASLSNTVRKLNRDVSKLEVFRKTLMRSL 127
+ + L+ L +D +L E SL+ TV KL RD++KLE F++ L++SL
Sbjct: 66 EEKASRLERDCREADSRLKVVLEDNMNLTKEKDSLAMTVTKLTRDLAKLETFKRQLIKSL 125
Query: 128 QEED------------DNSGGAPDMVAKVXXXXXXXXXXXFGDNEAXXXXXXXXXXXXXX 175
+E D G P ++ D
Sbjct: 126 SDESGPQTEPVDIRTCDQPGSYPGKDGRINAHSIKQAYSGSTDTNNPVVEASKY------ 179
Query: 176 DTGNSFAEDHESDAIRPRVPYSLLLASQSTTPRLTPPGSPPSLSASVSPTRTSKPVSPRR 235
TGN F+ + +PRLTP +P +S SVSP S SP+R
Sbjct: 180 -TGNKFS------------------MTSYISPRLTPTATPKIISTSVSPRGYSAAGSPKR 220
Query: 236 HSISFATSRGMHDDRTXXXXXXXXXXXXXXXXXXRT-RVDGKEFFRQVRNRLSYEQFGAF 294
S + + ++ + RT R+DGKEFFRQ R+RLSYEQF +F
Sbjct: 221 TSGAVSPTKATLWYPSSQQSSAANSPPRNRTLPARTPRMDGKEFFRQARSRLSYEQFSSF 280
Query: 295 LANVKELNSHKQTREVTLQKADEIFGPENKDLYNIFEGLITRNVH 339
LAN+KELN+ KQTRE TL+KADEIFG ENKDLY F+GL+ RN+
Sbjct: 281 LANIKELNAQKQTREETLRKADEIFGEENKDLYLSFQGLLNRNMR 325
>AT1G56080.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: plasma membrane;
EXPRESSED IN: 12 plant structures; EXPRESSED DURING: 6
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT1G16520.1); Has 196 Blast
hits to 193 proteins in 50 species: Archae - 2; Bacteria
- 0; Metazoa - 9; Fungi - 2; Plants - 132; Viruses - 0;
Other Eukaryotes - 51 (source: NCBI BLink). |
chr1:20974457-20976215 REVERSE LENGTH=310
Length = 310
Score = 199 bits (505), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 124/340 (36%), Positives = 180/340 (52%), Gaps = 38/340 (11%)
Query: 1 MAAESGDLNFDLPDDLVQVLPSDPFEQLDLARKITSIALSARVNALESESSELRARIAEK 60
M+ GD F+L D+++ V+P+DP++QLDLARKITS+A+++RV+ LES+ S LR ++ EK
Sbjct: 1 MSQSGGD--FNLSDEILAVIPTDPYDQLDLARKITSMAIASRVSNLESQVSGLRQKLLEK 58
Query: 61 DHLIAELQSQLDSLDATLSATADNLHRAEQDKESLINENASLSNTVRKLNRDVSKLEVFR 120
D L+ EL+ ++ S + +L + L E SL+ T +KL RD +KLE F+
Sbjct: 59 DRLVHELEDRVSSFERLYHEADSSLKNVVDENMKLTQERDSLAITAKKLGRDYAKLEAFK 118
Query: 121 KTLMRSLQEEDDNSGGAPDMVAKVXXXXXXXXXXXFGDNEAXXXXXXXXXXXXXXDTGNS 180
+ LM+SL +++ + D V V + +NE
Sbjct: 119 RQLMQSLNDDNPSQTETAD-VRMVPRGKDENSNGSYSNNEG------------------- 158
Query: 181 FAEDHESDAIRPRVPYSLLLASQSTTPRLTPPGSPPSLSASVSPTRTSKPVSPRRHSISF 240
+E + ++ P+ +P TP G+P LS + SP S SP+ S +
Sbjct: 159 LSEARQRQSMTPQF-----------SPAFTPSGTPKILSTAASPRSYSAASSPKLFSGAA 207
Query: 241 ATSRGMHDDRTXXXXXXXXXXXXX-----XXXXXRTRVDGKEFFRQVRNRLSYEQFGAFL 295
+ + +D R R+DGKEFFRQ R+RLSYEQF AFL
Sbjct: 208 SPTSSHYDIRMWSSTSQQSSVANSPPRSHSVSARHPRIDGKEFFRQARSRLSYEQFSAFL 267
Query: 296 ANVKELNSHKQTREVTLQKADEIFGPENKDLYNIFEGLIT 335
AN+KELN+ KQ RE TLQKA+EIFG EN DLY F+GL+T
Sbjct: 268 ANIKELNARKQGREETLQKAEEIFGKENNDLYISFKGLLT 307