Miyakogusa Predicted Gene
- Lj1g3v1785790.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v1785790.1 Non Chatacterized Hit- tr|D8TLL4|D8TLL4_VOLCA
Putative uncharacterized protein OS=Volvox carteri
GN=,30.7,3e-16,seg,NULL,CUFF.27821.1
(261 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT4G32960.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 350 6e-97
AT4G32970.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 293 8e-80
>AT4G32960.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G32970.1); Has 106 Blast hits to 106 proteins
in 39 species: Archae - 0; Bacteria - 0; Metazoa - 62;
Fungi - 0; Plants - 37; Viruses - 0; Other Eukaryotes -
7 (source: NCBI BLink). | chr4:15908785-15910141 REVERSE
LENGTH=264
Length = 264
Score = 350 bits (898), Expect = 6e-97, Method: Compositional matrix adjust.
Identities = 157/233 (67%), Positives = 197/233 (84%), Gaps = 1/233 (0%)
Query: 16 YTLQPSRICNEDILFCIDIDPQSLVEMKTATGHNGRPLTRLDSIKQSILLFVHSKLTINP 75
Y L+P RI +EDIL CID+D +S+VEMKT TG NGRPL R++ +KQ+I+LF+H+KL+INP
Sbjct: 16 YALKPGRIKSEDILICIDVDAESMVEMKT-TGTNGRPLIRMECVKQAIILFIHNKLSINP 74
Query: 76 DHRFAFATLSNTVSWLKKDFSSDVESTMAAMRGLSATNISTQPDLTNLFRLAAHEAKKSR 135
DHRFAFATL+ + +WLKK+F+SD ES +A++RGLS S++ DLT LFR AA EAK SR
Sbjct: 75 DHRFAFATLAKSAAWLKKEFTSDAESAVASLRGLSGNKSSSRADLTLLFRAAAQEAKVSR 134
Query: 136 AQGRILRVILFYCRSNVRPQHQWPVNQKLYTLDVMYLHDKPGPENCPQEVYDTLVEALEH 195
AQ RI RVIL YCRS++RP H+WP+NQKL+TLDVMYLHDKP P+NCPQ+VYD+LV+A+EH
Sbjct: 135 AQNRIFRVILIYCRSSMRPTHEWPLNQKLFTLDVMYLHDKPSPDNCPQDVYDSLVDAVEH 194
Query: 196 VSEYEGYILESGQGLARVVFRHVLILLSHPQQRCIQENIDIPKSLAKKAPQGE 248
VSEYEGYI ESGQGLAR VF+ + +LL+HPQQRC Q+++DIP SLAKK P E
Sbjct: 195 VSEYEGYIFESGQGLARSVFKPMSMLLTHPQQRCAQDDLDIPMSLAKKVPVTE 247
>AT4G32970.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G32960.1); Has 552 Blast hits to 489 proteins
in 85 species: Archae - 4; Bacteria - 14; Metazoa - 187;
Fungi - 12; Plants - 225; Viruses - 0; Other Eukaryotes
- 110 (source: NCBI BLink). | chr4:15910671-15914300
REVERSE LENGTH=638
Length = 638
Score = 293 bits (750), Expect = 8e-80, Method: Compositional matrix adjust.
Identities = 137/219 (62%), Positives = 169/219 (77%), Gaps = 3/219 (1%)
Query: 25 NEDILFCIDIDPQSLVEMKTATGHNGRPLTRLDSIKQSILLFVHSKLTINPDHRFAFATL 84
+EDIL C+D+D +S VEMKT TG NG+PL RL+ +K +I F+H KL+ N DHRFAFATL
Sbjct: 21 SEDILICVDVDAESTVEMKT-TGTNGKPLNRLECLKLAITRFIHDKLSRNSDHRFAFATL 79
Query: 85 SNTVSWLKKDFSSDVESTMAAMRGLSATNISTQPDLTNLFRLAAHEAKKSRAQGRILRVI 144
S + +WLKK+FS+D +S A++R +SAT S DLT LF+ AA EAK SRAQ RILRVI
Sbjct: 80 SQSAAWLKKEFSNDAKSAAASLREMSATRSSGPADLTFLFQEAAQEAKTSRAQNRILRVI 139
Query: 145 LFYCRSNVRPQHQWPVNQKLYTLDVMYLHDKPGPENCPQEVYDTLVEALEHVSEYEGYIL 204
L YCRS+VRP H WP+NQKL+TLDVMYLHDK GP+NC +VYD+LV+A+E VSEYEGYI
Sbjct: 140 LMYCRSSVRPTHDWPINQKLFTLDVMYLHDKSGPDNCTHDVYDSLVDAIERVSEYEGYIF 199
Query: 205 ESGQGLARVVFRHVLILLSHPQQRCIQENIDIPKSLAKK 243
E GL++ VFR + LLSHP QRC Q +D+PK AKK
Sbjct: 200 EGSHGLSQSVFRRMSTLLSHPPQRCAQ--VDLPKPPAKK 236