Miyakogusa Predicted Gene
- Lj1g3v3904400.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v3904400.1 Non Chatacterized Hit- tr|J3LXE0|J3LXE0_ORYBR
Uncharacterized protein OS=Oryza brachyantha
GN=OB04G1,28.12,2e-16,seg,NULL; coiled-coil,NULL,CUFF.31477.1
(445 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G13970.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 166 3e-41
AT5G13310.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 83 5e-16
>AT5G13970.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: endomembrane
system; EXPRESSED IN: 23 plant structures; EXPRESSED
DURING: 13 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT5G13310.1);
Has 1807 Blast hits to 1807 proteins in 277 species:
Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347;
Plants - 385; Viruses - 0; Other Eukaryotes - 339
(source: NCBI BLink). | chr5:4506383-4507804 REVERSE
LENGTH=404
Length = 404
Score = 166 bits (421), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 133/373 (35%), Positives = 190/373 (50%), Gaps = 75/373 (20%)
Query: 94 EPTKPSDYDDEQWEIRSGIGLDSTLDFEEEEDQYDKQAVGNENSGDRVY--MKDITDDRV 151
E KPSDYDDE+WEI++ IG+DSTLD EEE D DK A+ G++VY MKD+ DD
Sbjct: 79 ELQKPSDYDDEEWEIKNSIGMDSTLDMEEELDDNDKVAL-----GEKVYCCMKDVNDD-- 131
Query: 152 EISPCDRVYMRDMTDDGVEISSCGVLPTKFREFERDLRANHMAARIRLKQDEEATKEIGA 211
D+ VE LP F E E+D RAN +AA++RLK+D EA ++ +
Sbjct: 132 ---------YETEADEWVE------LPASFNEREKDPRANLIAAKLRLKEDAEAVNKLNS 176
Query: 212 LHVSEKSPPDISXXXX----------------------XXDAVNPKSILKSRENPS-EPK 248
LHVSE+ ++S D K ILK REN + + K
Sbjct: 177 LHVSEELQDNLSMSTENEKPFVVSEDNLLGAFKESHVGSSDENGLKPILKRRENQADDSK 236
Query: 249 PQKRVRFDSECDGKGDDEENEGTRDVRRKTTSMEEEEVEALNQPSKAQEFASAVPDYIRN 308
KRVRF S+ D EG D + +S E++VEA+ + + +PDY+RN
Sbjct: 237 SPKRVRFSSDV---KDRTLTEGDNDSVMEASSPNEDKVEAV--------YPTGIPDYMRN 285
Query: 309 PSRYTHYTFDSASDMDDKANKEAYMGFLAQMKGTGSQADDALDDLP-SVTFISKKKSSDV 367
PS+YT YTF+S ++D+++N++AYM FL ++ D L +LP SV F+ K+K
Sbjct: 286 PSKYTRYTFESG-EVDEESNRKAYMDFLNMIRSKDESLVDPLMELPRSVAFVPKRKPMAE 344
Query: 368 TMVENEMVSRQKLDVGKESMNKKAFPVSIAAAGDN-ENSDVCAMEEDEPEVMEDAKKSSQ 426
+ VEN ++K +A A D E+ + AMEEDEPE + K
Sbjct: 345 SKVEN--------------IDKDCEGRRVAIAVDTIEDCTISAMEEDEPETAQHVTKRPG 390
Query: 427 RLNNRKYRKKPQE 439
R + ++ P+E
Sbjct: 391 RQYRARAKEDPEE 403
>AT5G13310.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: chloroplast;
EXPRESSED IN: 13 plant structures; EXPRESSED DURING: 7
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT5G13970.1); Has 1807 Blast
hits to 1807 proteins in 277 species: Archae - 0;
Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385;
Viruses - 0; Other Eukaryotes - 339 (source: NCBI
BLink). | chr5:4263561-4264989 FORWARD LENGTH=370
Length = 370
Score = 82.8 bits (203), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 87/270 (32%), Positives = 122/270 (45%), Gaps = 64/270 (23%)
Query: 103 DEQWEIRSGIGLDSTLDFEEEEDQYDKQAVGNENSGDRVYMKDITDDRVEISPCDRVYMR 162
D W IR+ +GLD TLD E EED+YDK A+G EN G+
Sbjct: 101 DGVWSIRASMGLDRTLDDEAEEDEYDKVALGEENDGE----------------------- 137
Query: 163 DMTDDGVEISSCGVLPTKFREFERDLRANHMAARIRLKQDE-EATK-EIGALHVSEKSPP 220
CG + RD RAN++AARIRLK+DE EA K A SE P
Sbjct: 138 ----------GCGRI--------RDPRANYVAARIRLKEDEIEANKFNTSASQPSESKEP 179
Query: 221 DISXXXXXXDAVNPKSILKSRENPS--EPKPQKRVRFDSECDGKGDDEENEGTRDVRRKT 278
+A+ K ILK +EN S E + KRVRFDS + E+ + K
Sbjct: 180 ---HAEESSEAMPRKPILKRKENSSDSEARTSKRVRFDSVPEETLKKPEDTCSASASSKI 236
Query: 279 TSMEEEEVEALNQPSKAQEFASAVPDYIRNPSRYTHYTFDSASDMDDKANKEAYMGFLAQ 338
S + + + VPDY+ NPS YT Y+FD + ++D ++ YM
Sbjct: 237 VSHQGKS-------------GARVPDYLLNPSSYTRYSFDPSCELDVESPTGEYMDTPNA 283
Query: 339 MKGTGSQADDALDDLPSVTFISKKKSSDVT 368
++G + ++ P V+FI + K+ DV+
Sbjct: 284 VEGLKNPESES---FPKVSFIPQNKTKDVS 310