Miyakogusa Predicted Gene
- Lj5g3v1853090.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj5g3v1853090.1 Non Chatacterized Hit- tr|A4RSS8|A4RSS8_OSTLU
Putative uncharacterized protein Dal3 OS=Ostreococcus ,33.33,5e-19,no
description,Ureidoglycolate hydrolase domain; seg,NULL; RmlC-like
cupins,RmlC-like cupin domain,CUFF.56048.1
(233 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT2G35820.1 | Symbols: | ureidoglycolate hydrolases | chr2:1505... 249 9e-67
AT2G35810.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 243 7e-65
AT2G35830.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 241 3e-64
AT2G35830.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 229 9e-61
>AT2G35820.1 | Symbols: | ureidoglycolate hydrolases |
chr2:15050933-15052004 FORWARD LENGTH=192
Length = 192
Score = 249 bits (637), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 115/183 (62%), Positives = 142/183 (77%), Gaps = 2/183 (1%)
Query: 50 EEPKLLKLKPIEATPATFKDYGQVIQASPDGHGFGPHDAQLDLTRGIPRFYIMHLENRPL 109
+ P +KL PIEATP F DYGQVI+AS DG GFGP+DAQLDL+RGIPRFYIM + + P
Sbjct: 3 KSPVEVKLIPIEATPENFADYGQVIEASRDGAGFGPNDAQLDLSRGIPRFYIMRIRDTPF 62
Query: 110 KFSNITHHASVTQCLGSIGGHAWYLGVAKPSIVDSSEIKDDTDRKVVQSRSGHLYVPPAI 169
FS +THHASVTQCLGSIGGH WYLGVAKP++++ + D+ ++SRSGHLY PPA+
Sbjct: 63 DFSVLTHHASVTQCLGSIGGHVWYLGVAKPTLIEDGDDGKMVDK--LKSRSGHLYAPPAV 120
Query: 170 EDIHVFKVTGSKFLKLNRGTWHAGPIFKADSMDFYNLELSNTNEIDHTTHNFKKDNGVIF 229
E+I VF+V+G KF+KLN GTWH GP+F MDFYNLELSNTN +D TT++F K+ GV
Sbjct: 121 EEIRVFRVSGPKFIKLNHGTWHVGPLFSDSYMDFYNLELSNTNAVDRTTYDFIKNKGVTI 180
Query: 230 SID 232
+D
Sbjct: 181 RVD 183
>AT2G35810.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT2G35830.2); Has 153 Blast hits to 153 proteins
in 52 species: Archae - 0; Bacteria - 62; Metazoa - 0;
Fungi - 0; Plants - 82; Viruses - 0; Other Eukaryotes -
9 (source: NCBI BLink). | chr2:15049152-15050067 FORWARD
LENGTH=199
Length = 199
Score = 243 bits (621), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 115/186 (61%), Positives = 145/186 (77%), Gaps = 7/186 (3%)
Query: 50 EEPKLLKLKPIEATPATFKDYGQVIQASPDGHGFGPHDAQLDLTRGIPRFYIMHLENRPL 109
+ P + L PIEATP TF +YGQVI+AS DG G+GP+DAQLDL++GIPR YI+ L+ PL
Sbjct: 17 KSPVEVNLIPIEATPETFAEYGQVIEASRDGAGYGPNDAQLDLSKGIPRLYILRLKETPL 76
Query: 110 KFSNITHHASVTQCLGSIGGHAWYLGVAKPSIVDSSEIKDDTDRKV--VQSRSGHLYVPP 167
F ITHHA VTQCLGSIGG WY+GVAKPS+++ DD R+V V+++SGHLY+PP
Sbjct: 77 GFFKITHHAKVTQCLGSIGGDIWYMGVAKPSLIED----DDDGRRVDTVKAKSGHLYIPP 132
Query: 168 AIEDIHVFKVTGSKFLKLNRGTWHAGPIFKADS-MDFYNLELSNTNEIDHTTHNFKKDNG 226
+E+I VF+ +G KF+KL+RGTWHAGP+F S MDFYNLELSNTN +DHT+H+F K+NG
Sbjct: 133 EVEEIRVFRFSGPKFVKLHRGTWHAGPLFSGSSIMDFYNLELSNTNVVDHTSHDFTKNNG 192
Query: 227 VIFSID 232
V F D
Sbjct: 193 VSFRFD 198
>AT2G35830.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT2G35810.1). |
chr2:15052302-15053160 FORWARD LENGTH=189
Length = 189
Score = 241 bits (615), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 118/186 (63%), Positives = 142/186 (76%), Gaps = 6/186 (3%)
Query: 50 EEPKLLKLKPIEATPATFKDYGQVIQASPDGHGFGPHDAQLDLTRGIPRFYIMHLENRPL 109
+ P + L PIEATP F +YGQVI+AS DG GFGPHDAQLDL+RG PR YI+ L+ PL
Sbjct: 3 KSPVEVNLIPIEATPENFAEYGQVIEASRDGAGFGPHDAQLDLSRGTPRLYILRLKETPL 62
Query: 110 KFSNITHHASVTQCLGSIGGHAWYLGVAKPSIVDSSEIKDDTDRKV--VQSRSGHLYVPP 167
F ITHHA VTQCLGSIGG WY+GVAKPS++ E DD R V V+S+SGHLY+PP
Sbjct: 63 GFFKITHHAKVTQCLGSIGGDVWYMGVAKPSLI---EDDDDDGRSVDTVKSKSGHLYIPP 119
Query: 168 AIEDIHVFKVTGSKFLKLNRGTWHAGPIFKADS-MDFYNLELSNTNEIDHTTHNFKKDNG 226
+E+I VF+ +G KF+KL+RGTWHAGP+F S MDFYNLELSNTN +DHT+H+F K+NG
Sbjct: 120 EVEEIRVFRFSGPKFVKLHRGTWHAGPLFSGSSFMDFYNLELSNTNVVDHTSHDFTKNNG 179
Query: 227 VIFSID 232
V F D
Sbjct: 180 VSFGFD 185
>AT2G35830.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT2G35810.1); Has 155 Blast hits to 155 proteins
in 54 species: Archae - 0; Bacteria - 66; Metazoa - 0;
Fungi - 0; Plants - 82; Viruses - 0; Other Eukaryotes -
7 (source: NCBI BLink). | chr2:15052302-15053160 FORWARD
LENGTH=184
Length = 184
Score = 229 bits (585), Expect = 9e-61, Method: Compositional matrix adjust.
Identities = 116/186 (62%), Positives = 139/186 (74%), Gaps = 11/186 (5%)
Query: 50 EEPKLLKLKPIEATPATFKDYGQVIQASPDGHGFGPHDAQLDLTRGIPRFYIMHLENRPL 109
+ P + L PIEATP F +YGQVI+AS DG GFGPHDAQLDL+RG PR L+ PL
Sbjct: 3 KSPVEVNLIPIEATPENFAEYGQVIEASRDGAGFGPHDAQLDLSRGTPR-----LKETPL 57
Query: 110 KFSNITHHASVTQCLGSIGGHAWYLGVAKPSIVDSSEIKDDTDRKV--VQSRSGHLYVPP 167
F ITHHA VTQCLGSIGG WY+GVAKPS++ E DD R V V+S+SGHLY+PP
Sbjct: 58 GFFKITHHAKVTQCLGSIGGDVWYMGVAKPSLI---EDDDDDGRSVDTVKSKSGHLYIPP 114
Query: 168 AIEDIHVFKVTGSKFLKLNRGTWHAGPIFKADS-MDFYNLELSNTNEIDHTTHNFKKDNG 226
+E+I VF+ +G KF+KL+RGTWHAGP+F S MDFYNLELSNTN +DHT+H+F K+NG
Sbjct: 115 EVEEIRVFRFSGPKFVKLHRGTWHAGPLFSGSSFMDFYNLELSNTNVVDHTSHDFTKNNG 174
Query: 227 VIFSID 232
V F D
Sbjct: 175 VSFGFD 180