Miyakogusa Predicted Gene
- Lj3g3v0839390.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj3g3v0839390.1 Non Chatacterized Hit- tr|B9T0Y0|B9T0Y0_RICCO
Putative uncharacterized protein OS=Ricinus communis
G,65.62,4e-18,seg,NULL,CUFF.41576.1
(359 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G08490.1 | Symbols: | BEST Arabidopsis thaliana protein matc... 194 9e-50
AT3G24600.1 | Symbols: | Late embryogenesis abundant protein, g... 72 6e-13
AT5G42860.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 66 4e-11
AT1G45688.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 65 1e-10
AT4G35170.1 | Symbols: | Late embryogenesis abundant (LEA) hydr... 54 2e-07
AT1G45688.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 53 4e-07
AT2G41990.1 | Symbols: | CONTAINS InterPro DOMAIN/s: Late embry... 49 5e-06
>AT3G08490.1 | Symbols: | BEST Arabidopsis thaliana protein match
is: Late embryogenesis abundant protein, group 2
(TAIR:AT3G24600.1); Has 161 Blast hits to 158 proteins
in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 161; Viruses - 0; Other Eukaryotes -
0 (source: NCBI BLink). | chr3:2574105-2575125 REVERSE
LENGTH=271
Length = 271
Score = 194 bits (492), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 98/206 (47%), Positives = 126/206 (61%), Gaps = 2/206 (0%)
Query: 155 KRYFSYSYADSCGWICLQMSWRLMVSFGIALLVFYIATKPPPPNFSLEISKIPEFKLGEG 214
KR ++S WI LQ+ WR + S G+ALLVFYIAT+PP PN S I + +F L EG
Sbjct: 67 KRLVPLGTSNSSWWIVLQVGWRFLFSLGVALLVFYIATQPPHPNISFRIGRFNQFMLEEG 126
Query: 215 VDRTGVTTKILTCNCSMNLIIENKSRFFGLHIRPPLMDMKFSVLPFASSNGPKLYAES-G 273
VD GV+TK LT NCS LII+NKS FGLHI PP + F L FA + GPKLY S
Sbjct: 127 VDSHGVSTKFLTFNCSTKLIIDNKSNVFGLHIHPPSIKFFFGPLNFAKAQGPKLYGLSHE 186
Query: 274 LTIFTLQLGVKNKPMYGAGRSMQDMLDSGKGLPIAIRVILSSSFEVVPNLIKPRFHHRVE 333
T F L + N+ MYGAG M DML S GLP+ +R + S + VV N+I P++HH+VE
Sbjct: 187 STTFQLYIATTNRAMYGAGTEMNDMLLSRAGLPLILRTSIISDYRVVWNIINPKYHHKVE 246
Query: 334 CIVVLKKDYDRKHRTQAFNSTCKVTS 359
C+++L D +R C++ S
Sbjct: 247 CLLLL-ADKERHSHVTMIREKCRLVS 271
>AT3G24600.1 | Symbols: | Late embryogenesis abundant protein,
group 2 | chr3:8972195-8974867 REVERSE LENGTH=506
Length = 506
Score = 72.0 bits (175), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 73/148 (49%), Gaps = 2/148 (1%)
Query: 188 FYIATKPPPPNFSLEISKIPEFKLGEGVDRTGVTTKILTCNCSMNLIIENKSRFFGLHIR 247
+ A+ P P S++ I F GEG+DRTGV TKIL+ N S+ + I++ + +FG+H+
Sbjct: 335 LWGASHPFSPIVSVKSVDIHSFYYGEGIDRTGVATKILSFNSSVKVTIDSPAPYFGIHVS 394
Query: 248 PPLMDMKFSVLPFASSNGPKLYA-ESGLTIFTLQLGVKNKPMYGAGRSMQDMLDSGKGLP 306
+ FS L A+ Y I ++L P+YGAG + GK +P
Sbjct: 395 SSTFKLTFSALTLATGQLKSYYQPRKSKHISIVKLTGAEVPLYGAGPHLAASDKKGK-VP 453
Query: 307 IAIRVILSSSFEVVPNLIKPRFHHRVEC 334
+ + + S ++ L+K + + V C
Sbjct: 454 VKLEFEIRSRGNLLGKLVKSKHENHVSC 481
Score = 54.3 bits (129), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 38/141 (26%), Positives = 68/141 (48%), Gaps = 10/141 (7%)
Query: 176 RLMVSFGIALLVFYI-------ATKPPPPNFSLEISKIPEFKLGEGVDRTGVTTKILTCN 228
RL++ L +F++ A++ PP ++ + F GEG D TGV TKI+
Sbjct: 111 RLILGVVATLSIFFLLCSVLFGASQSSPPIVYIKGVNVRSFYYGEGSDNTGVPTKIMNVK 170
Query: 229 CSMNLIIENKSRFFGLHIRPPLMDMKFSVLPFASSNGPKLYAESGLTIFTLQLGV--KNK 286
CS+ + N S FG+H+ + + +S ++ K Y + + T ++ +
Sbjct: 171 CSVVITTHNPSTLFGIHVSSTAVSLIYSRQFTLANARLKSYHQPKQSNHTSRINLIGSKV 230
Query: 287 PMYGAGRSMQDMLDSGKGLPI 307
P+YGAG + +SG G+P+
Sbjct: 231 PLYGAGAELVASDNSG-GVPV 250
>AT5G42860.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: plasma membrane;
EXPRESSED IN: 19 plant structures; EXPRESSED DURING: 11
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT1G45688.1); Has 1807 Blast
hits to 1807 proteins in 277 species: Archae - 0;
Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385;
Viruses - 0; Other Eukaryotes - 339 (source: NCBI
BLink). | chr5:17183339-17184857 REVERSE LENGTH=320
Length = 320
Score = 65.9 bits (159), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 51/217 (23%), Positives = 94/217 (43%), Gaps = 28/217 (12%)
Query: 166 CGWICLQMSWRLMVSFGIALLVFYIATKPPPPNFSLEISKIPEFKLGEGVDRTGVTTKIL 225
C + + + L+ F L+ Y A KP P S++ + K+ G D G+ T ++
Sbjct: 108 CYVLAFIVGFSLL--FAFFSLILYAAAKPQKPKISVKSITFEQLKVQAGQDAGGIGTDMI 165
Query: 226 TCNCSMNLIIENKSRFFGLHIRPPLMDMKFSVLPFASSNGPKLYA--ESGLTIFTLQLGV 283
T N ++ ++ N FFG+H+ +D+ FS + S + K Y +S T+ LG
Sbjct: 166 TMNATLRMLYRNTGTFFGVHVTSSPIDLSFSQITIGSGSIKKFYQSRKSQRTVVVNVLGD 225
Query: 284 KNKPMYGAGRSM---------------------QDMLDSGKGLPIAIRVILSSSFEVVPN 322
K P+YG+G ++ + +P+ + + S V+
Sbjct: 226 K-IPLYGSGSTLVPPPPPAPIPKPKKKKGPIVIVEPPAPPAPVPMRLNFTVRSRAYVLGK 284
Query: 323 LIKPRFHHRVECIVVLKKDYDRKHRTQAFNSTCKVTS 359
L++P+F+ R+ C++ + KH + C VTS
Sbjct: 285 LVQPKFYKRIVCLINFEHKKLSKH--IPITNNCTVTS 319
>AT1G45688.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: plasma membrane;
EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT5G42860.1); Has 258 Blast
hits to 242 proteins in 39 species: Archae - 0; Bacteria
- 11; Metazoa - 10; Fungi - 14; Plants - 198; Viruses -
17; Other Eukaryotes - 8 (source: NCBI BLink). |
chr1:17191502-17192870 FORWARD LENGTH=342
Length = 342
Score = 64.7 bits (156), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 50/207 (24%), Positives = 90/207 (43%), Gaps = 27/207 (13%)
Query: 177 LMVSFGIALLVFYIATKPPPPNFSLEISKIPEFKLGEGVDRTGVTTKILTCNCSMNLIIE 236
+ FG L+ Y A KP P +++ K+ G D GV T ++T N ++ ++
Sbjct: 138 FFILFGFFSLILYGAAKPMKPKITVKSITFETLKIQAGQDAGGVGTDMITMNATLRMLYR 197
Query: 237 NKSRFFGLHIRPPLMDMKFSVLPFASSNGPKLYA--ESGLTIFTLQLGVKNKPMYGAGRS 294
N FFG+H+ +D+ FS + S + K Y +S T+ +G K P+YG+G +
Sbjct: 198 NTGTFFGVHVTSTPIDLSFSQIKIGSGSVKKFYQGRKSERTVLVHVIGEKI-PLYGSGST 256
Query: 295 ----------------------MQDMLDSGKGLPIAIRVILSSSFEVVPNLIKPRFHHRV 332
+ D +P+ + ++ S V+ L++P+F+ ++
Sbjct: 257 LLPPAPPAPLPKPKKKKGAPVPIPDPPAPPAPVPMTLSFVVRSRAYVLGKLVQPKFYKKI 316
Query: 333 ECIVVLKKDYDRKHRTQAFNSTCKVTS 359
EC + + KH N C VT+
Sbjct: 317 ECDINFEHKNLNKHIVITKN--CTVTT 341
>AT4G35170.1 | Symbols: | Late embryogenesis abundant (LEA)
hydroxyproline-rich glycoprotein family |
chr4:16736839-16738186 FORWARD LENGTH=299
Length = 299
Score = 53.5 bits (127), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 45/201 (22%), Positives = 87/201 (43%), Gaps = 11/201 (5%)
Query: 144 DYYYGNGKGGWKRYFSYSYADSCGWICLQMSWRLMVSFGIALLVFYIATKPPPPNFSLEI 203
DY +G +R + Y+ CL + L+++F + L+ + +K P +L+
Sbjct: 92 DYDEMDGPDEKRRRITRFYS------CLLFT--LVLAFTLFCLILWGVSKSFAPIATLKE 143
Query: 204 SKIPEFKLGEGVDRTGVTTKILTCNCSMNLIIENKSRFFGLHIRPPLMDMKFSVLPFASS 263
+ + G D++GV T +LT N ++ ++ N + FF +H+ + + +S L AS
Sbjct: 144 MVLENLNVQSGNDQSGVLTDMLTLNSTVRILYRNPATFFTVHVTSAPLQLSYSQLILASG 203
Query: 264 N-GPKLYAESGLTIFTLQLGVKNKPMYGAGRSM--QDMLDSGKGLPIAIRVILSSSFEVV 320
G I ++ P+YG ++ Q LP+ + L + V+
Sbjct: 204 QMGEFSQRRKSERIIETKVFGDQIPLYGGVPALFGQRAEPDQVVLPLNLTFTLRARAYVL 263
Query: 321 PNLIKPRFHHRVECIVVLKKD 341
L+K FH ++C + D
Sbjct: 264 GRLVKTTFHSNIKCSITFYGD 284
>AT1G45688.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: plasma membrane;
EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT5G42860.1); Has 35333 Blast
hits to 34131 proteins in 2444 species: Archae - 798;
Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants -
531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI
BLink). | chr1:17191502-17192464 FORWARD LENGTH=248
Length = 248
Score = 52.8 bits (125), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 25/88 (28%), Positives = 43/88 (48%)
Query: 177 LMVSFGIALLVFYIATKPPPPNFSLEISKIPEFKLGEGVDRTGVTTKILTCNCSMNLIIE 236
+ FG L+ Y A KP P +++ K+ G D GV T ++T N ++ ++
Sbjct: 138 FFILFGFFSLILYGAAKPMKPKITVKSITFETLKIQAGQDAGGVGTDMITMNATLRMLYR 197
Query: 237 NKSRFFGLHIRPPLMDMKFSVLPFASSN 264
N FFG+H+ +D+ FS + S +
Sbjct: 198 NTGTFFGVHVTSTPIDLSFSQIKIGSGS 225
>AT2G41990.1 | Symbols: | CONTAINS InterPro DOMAIN/s: Late
embryogenesis abundant protein, group 2
(InterPro:IPR004864); BEST Arabidopsis thaliana protein
match is: Late embryogenesis abundant (LEA)
hydroxyproline-rich glycoprotein family
(TAIR:AT4G35170.1); Has 172 Blast hits to 168 proteins
in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 172; Viruses - 0; Other Eukaryotes -
0 (source: NCBI BLink). | chr2:17527396-17528527 FORWARD
LENGTH=297
Length = 297
Score = 49.3 bits (116), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 41/177 (23%), Positives = 79/177 (44%), Gaps = 11/177 (6%)
Query: 168 WICLQMSWRLMVSFGIALLVFYIATKPPPPNFSLEISKIPEFKLGEGVDRTGVTTKILTC 227
W+ L + + F + L+ + A+K PP +++ + + L G D +GV T +L+
Sbjct: 117 WLLLSV----IFLFTVFSLILWGASKSYPPKVTVKGMLVRDLNLQAGNDLSGVPTDMLSL 172
Query: 228 NCSMNLIIENKSRFFGLHI--RPPLMDMKFSVLPFASSNGPKLYAESGLTIFTLQLGVKN 285
N ++ + N S FF +H+ P L+ +L N + + T+ G
Sbjct: 173 NSTVRIYYRNPSTFFAVHVTASPLLLHYSNLLLSSGEMNKFTVGRNGETNVVTVVQG-HQ 231
Query: 286 KPMYGAGRSMQDMLDSGKGLPIAIRVILSSSFEVVPNLIKPRFHHRVECIVVLKKDY 342
P+YG D L LP+ + ++L S ++ L+ +F+ R+ C L ++
Sbjct: 232 IPLYGGVSFHLDTLS----LPLNLTIVLHSKAYILGRLVTSKFYTRIICSFTLDANH 284