Miyakogusa Predicted Gene
- Lj0g3v0048639.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0048639.1 Non Chatacterized Hit- tr|D8S8A7|D8S8A7_SELML
Putative uncharacterized protein (Fragment)
OS=Selagin,35.2,7e-17,seg,NULL,
NODE_36379_length_2067_cov_94.238029.path1.1
(549 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G55760.3 | Symbols: | unknown protein; EXPRESSED IN: 16 plan... 613 e-176
AT3G55760.2 | Symbols: | unknown protein; LOCATED IN: chloropla... 613 e-176
AT3G55760.1 | Symbols: | unknown protein; LOCATED IN: chloropla... 613 e-176
AT1G42430.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 201 1e-51
AT1G42430.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 200 2e-51
>AT3G55760.3 | Symbols: | unknown protein; EXPRESSED IN: 16 plant
structures; EXPRESSED DURING: 10 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT1G42430.2). | chr3:20700648-20702886 FORWARD
LENGTH=578
Length = 578
Score = 613 bits (1581), Expect = e-176, Method: Compositional matrix adjust.
Identities = 313/519 (60%), Positives = 379/519 (73%), Gaps = 22/519 (4%)
Query: 44 YLDMWKKAVDRDRKTSQFNKIAASAPVENVDHSH--DLEKKTDEFHKLLHVSTQERDRIQ 101
YLDMWK AVDR++K F KIA + + + DLEKK+DEF K+L VS +ERDRIQ
Sbjct: 65 YLDMWKNAVDREKKEKAFEKIAENVVAVDGEKEKGGDLEKKSDEFQKILEVSVEERDRIQ 124
Query: 102 RMQVIDRXXXXXXXXXXXL-------------NESNTSATASTQYQHESGSGTQSGSVLV 148
RMQV+DR L NE NT + T+ + G S +V V
Sbjct: 125 RMQVVDRAAAAISAARAILASNNSGDGKEGFPNEDNTVTSEVTETPKNAKLGMWSRTVYV 184
Query: 149 PESSEPPRNAIPGPDFWSWTPPPLDGDVPSDGSSGLKLDTKSSVHSTLPNPVAERERSPQ 208
P S E PGPDFWSWTPP G S S L+ K + TLPNPV E+++S
Sbjct: 185 PRS-ETSGTETPGPDFWSWTPP--QGSEIS--SVDLQAVEKPAEFPTLPNPVLEKDKSAD 239
Query: 209 SLSIPFESLLSQSKDSPHTLPPFQSSLEL-EAAASNLESPSLEEEQSRGALSSGHAADAV 267
SLSIP+ES+LS + S T+PPF+S +E+ + A + S +L E +SS +A +
Sbjct: 240 SLSIPYESMLSSERHS-FTIPPFESLIEVRKEAETKPSSETLSTEHDLDLISSANAEEVA 298
Query: 268 RALSEADKSSPIGVSPDGLRWWKESGIEQRPDGVICRWTMTRGVSADKAVEWQEKFWEAS 327
R L D+SS GVS DGL+WWK++G+E+RPDGV+CRWTM RGV+AD VEWQ+K+WEAS
Sbjct: 299 RVLDSLDESSTHGVSEDGLKWWKQTGVEKRPDGVVCRWTMIRGVTADGVVEWQDKYWEAS 358
Query: 328 DEVGYKELGSEKSGRDASGNVWHEFWRESMHEENGLMHMEKTADKWGSNGQGNEWQEKWW 387
D+ G+KELGSEKSGRDA+GNVW EFWRESM +ENG++HMEKTADKWG +GQG+EWQEKWW
Sbjct: 359 DDFGFKELGSEKSGRDATGNVWREFWRESMSQENGVVHMEKTADKWGKSGQGDEWQEKWW 418
Query: 388 ERYNASGQAEKWAHKWCSIDPNTPLEAGHAHVWHERWGETYDGYGGSTKYTDKWAERSQD 447
E Y+A+G++EKWAHKWCSID NTPL+AGHAHVWHERWGE YDG GGSTKYTDKWAER
Sbjct: 419 EHYDATGKSEKWAHKWCSIDRNTPLDAGHAHVWHERWGEKYDGQGGSTKYTDKWAERWVG 478
Query: 448 GGWEKWGDKWDENFDLNGHGIKQGETWWEGKHGERWNRTWGEQHNGSGWVHKYGKSSSGE 507
GW+KWGDKWDENF+ + G+KQGETWWEGKHG+RWNR+WGE HNGSGWVHKYGKSSSGE
Sbjct: 479 DGWDKWGDKWDENFNPSAQGVKQGETWWEGKHGDRWNRSWGEGHNGSGWVHKYGKSSSGE 538
Query: 508 HWDTHEGQDTWYERFPHFGFFHCYENSVQLREVPKPSEI 546
HWDTH Q+TWYE+FPHFGFFHC++NSVQLR V KPS++
Sbjct: 539 HWDTHVPQETWYEKFPHFGFFHCFDNSVQLRAVKKPSDM 577
>AT3G55760.2 | Symbols: | unknown protein; LOCATED IN: chloroplast;
EXPRESSED IN: 16 plant structures; EXPRESSED DURING: 10
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT1G42430.2); Has 176 Blast
hits to 125 proteins in 40 species: Archae - 0; Bacteria
- 3; Metazoa - 19; Fungi - 9; Plants - 81; Viruses - 0;
Other Eukaryotes - 64 (source: NCBI BLink). |
chr3:20700648-20702886 FORWARD LENGTH=578
Length = 578
Score = 613 bits (1581), Expect = e-176, Method: Compositional matrix adjust.
Identities = 313/519 (60%), Positives = 379/519 (73%), Gaps = 22/519 (4%)
Query: 44 YLDMWKKAVDRDRKTSQFNKIAASAPVENVDHSH--DLEKKTDEFHKLLHVSTQERDRIQ 101
YLDMWK AVDR++K F KIA + + + DLEKK+DEF K+L VS +ERDRIQ
Sbjct: 65 YLDMWKNAVDREKKEKAFEKIAENVVAVDGEKEKGGDLEKKSDEFQKILEVSVEERDRIQ 124
Query: 102 RMQVIDRXXXXXXXXXXXL-------------NESNTSATASTQYQHESGSGTQSGSVLV 148
RMQV+DR L NE NT + T+ + G S +V V
Sbjct: 125 RMQVVDRAAAAISAARAILASNNSGDGKEGFPNEDNTVTSEVTETPKNAKLGMWSRTVYV 184
Query: 149 PESSEPPRNAIPGPDFWSWTPPPLDGDVPSDGSSGLKLDTKSSVHSTLPNPVAERERSPQ 208
P S E PGPDFWSWTPP G S S L+ K + TLPNPV E+++S
Sbjct: 185 PRS-ETSGTETPGPDFWSWTPP--QGSEIS--SVDLQAVEKPAEFPTLPNPVLEKDKSAD 239
Query: 209 SLSIPFESLLSQSKDSPHTLPPFQSSLEL-EAAASNLESPSLEEEQSRGALSSGHAADAV 267
SLSIP+ES+LS + S T+PPF+S +E+ + A + S +L E +SS +A +
Sbjct: 240 SLSIPYESMLSSERHS-FTIPPFESLIEVRKEAETKPSSETLSTEHDLDLISSANAEEVA 298
Query: 268 RALSEADKSSPIGVSPDGLRWWKESGIEQRPDGVICRWTMTRGVSADKAVEWQEKFWEAS 327
R L D+SS GVS DGL+WWK++G+E+RPDGV+CRWTM RGV+AD VEWQ+K+WEAS
Sbjct: 299 RVLDSLDESSTHGVSEDGLKWWKQTGVEKRPDGVVCRWTMIRGVTADGVVEWQDKYWEAS 358
Query: 328 DEVGYKELGSEKSGRDASGNVWHEFWRESMHEENGLMHMEKTADKWGSNGQGNEWQEKWW 387
D+ G+KELGSEKSGRDA+GNVW EFWRESM +ENG++HMEKTADKWG +GQG+EWQEKWW
Sbjct: 359 DDFGFKELGSEKSGRDATGNVWREFWRESMSQENGVVHMEKTADKWGKSGQGDEWQEKWW 418
Query: 388 ERYNASGQAEKWAHKWCSIDPNTPLEAGHAHVWHERWGETYDGYGGSTKYTDKWAERSQD 447
E Y+A+G++EKWAHKWCSID NTPL+AGHAHVWHERWGE YDG GGSTKYTDKWAER
Sbjct: 419 EHYDATGKSEKWAHKWCSIDRNTPLDAGHAHVWHERWGEKYDGQGGSTKYTDKWAERWVG 478
Query: 448 GGWEKWGDKWDENFDLNGHGIKQGETWWEGKHGERWNRTWGEQHNGSGWVHKYGKSSSGE 507
GW+KWGDKWDENF+ + G+KQGETWWEGKHG+RWNR+WGE HNGSGWVHKYGKSSSGE
Sbjct: 479 DGWDKWGDKWDENFNPSAQGVKQGETWWEGKHGDRWNRSWGEGHNGSGWVHKYGKSSSGE 538
Query: 508 HWDTHEGQDTWYERFPHFGFFHCYENSVQLREVPKPSEI 546
HWDTH Q+TWYE+FPHFGFFHC++NSVQLR V KPS++
Sbjct: 539 HWDTHVPQETWYEKFPHFGFFHCFDNSVQLRAVKKPSDM 577
>AT3G55760.1 | Symbols: | unknown protein; LOCATED IN: chloroplast
stroma, chloroplast; EXPRESSED IN: 16 plant structures;
EXPRESSED DURING: 10 growth stages; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G42430.2); Has 176 Blast hits to 125 proteins
in 40 species: Archae - 0; Bacteria - 3; Metazoa - 19;
Fungi - 9; Plants - 81; Viruses - 0; Other Eukaryotes -
64 (source: NCBI BLink). | chr3:20700648-20702886
FORWARD LENGTH=578
Length = 578
Score = 613 bits (1581), Expect = e-176, Method: Compositional matrix adjust.
Identities = 313/519 (60%), Positives = 379/519 (73%), Gaps = 22/519 (4%)
Query: 44 YLDMWKKAVDRDRKTSQFNKIAASAPVENVDHSH--DLEKKTDEFHKLLHVSTQERDRIQ 101
YLDMWK AVDR++K F KIA + + + DLEKK+DEF K+L VS +ERDRIQ
Sbjct: 65 YLDMWKNAVDREKKEKAFEKIAENVVAVDGEKEKGGDLEKKSDEFQKILEVSVEERDRIQ 124
Query: 102 RMQVIDRXXXXXXXXXXXL-------------NESNTSATASTQYQHESGSGTQSGSVLV 148
RMQV+DR L NE NT + T+ + G S +V V
Sbjct: 125 RMQVVDRAAAAISAARAILASNNSGDGKEGFPNEDNTVTSEVTETPKNAKLGMWSRTVYV 184
Query: 149 PESSEPPRNAIPGPDFWSWTPPPLDGDVPSDGSSGLKLDTKSSVHSTLPNPVAERERSPQ 208
P S E PGPDFWSWTPP G S S L+ K + TLPNPV E+++S
Sbjct: 185 PRS-ETSGTETPGPDFWSWTPP--QGSEIS--SVDLQAVEKPAEFPTLPNPVLEKDKSAD 239
Query: 209 SLSIPFESLLSQSKDSPHTLPPFQSSLEL-EAAASNLESPSLEEEQSRGALSSGHAADAV 267
SLSIP+ES+LS + S T+PPF+S +E+ + A + S +L E +SS +A +
Sbjct: 240 SLSIPYESMLSSERHS-FTIPPFESLIEVRKEAETKPSSETLSTEHDLDLISSANAEEVA 298
Query: 268 RALSEADKSSPIGVSPDGLRWWKESGIEQRPDGVICRWTMTRGVSADKAVEWQEKFWEAS 327
R L D+SS GVS DGL+WWK++G+E+RPDGV+CRWTM RGV+AD VEWQ+K+WEAS
Sbjct: 299 RVLDSLDESSTHGVSEDGLKWWKQTGVEKRPDGVVCRWTMIRGVTADGVVEWQDKYWEAS 358
Query: 328 DEVGYKELGSEKSGRDASGNVWHEFWRESMHEENGLMHMEKTADKWGSNGQGNEWQEKWW 387
D+ G+KELGSEKSGRDA+GNVW EFWRESM +ENG++HMEKTADKWG +GQG+EWQEKWW
Sbjct: 359 DDFGFKELGSEKSGRDATGNVWREFWRESMSQENGVVHMEKTADKWGKSGQGDEWQEKWW 418
Query: 388 ERYNASGQAEKWAHKWCSIDPNTPLEAGHAHVWHERWGETYDGYGGSTKYTDKWAERSQD 447
E Y+A+G++EKWAHKWCSID NTPL+AGHAHVWHERWGE YDG GGSTKYTDKWAER
Sbjct: 419 EHYDATGKSEKWAHKWCSIDRNTPLDAGHAHVWHERWGEKYDGQGGSTKYTDKWAERWVG 478
Query: 448 GGWEKWGDKWDENFDLNGHGIKQGETWWEGKHGERWNRTWGEQHNGSGWVHKYGKSSSGE 507
GW+KWGDKWDENF+ + G+KQGETWWEGKHG+RWNR+WGE HNGSGWVHKYGKSSSGE
Sbjct: 479 DGWDKWGDKWDENFNPSAQGVKQGETWWEGKHGDRWNRSWGEGHNGSGWVHKYGKSSSGE 538
Query: 508 HWDTHEGQDTWYERFPHFGFFHCYENSVQLREVPKPSEI 546
HWDTH Q+TWYE+FPHFGFFHC++NSVQLR V KPS++
Sbjct: 539 HWDTHVPQETWYEKFPHFGFFHCFDNSVQLRAVKKPSDM 577
>AT1G42430.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G55760.3); Has 186 Blast hits to 143 proteins
in 47 species: Archae - 0; Bacteria - 23; Metazoa - 14;
Fungi - 6; Plants - 87; Viruses - 0; Other Eukaryotes -
56 (source: NCBI BLink). | chr1:15891512-15894322
FORWARD LENGTH=426
Length = 426
Score = 201 bits (510), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 118/264 (44%), Positives = 164/264 (62%), Gaps = 16/264 (6%)
Query: 280 GVSPDGLRWWKESGIEQRPDGVICRWTMTRGVSADKAVEWQEKFWEASDEVGYKELGSEK 339
G + DG W++ESG + +G CRW+ G S D + EW E +WE SD GYKELG EK
Sbjct: 145 GTNEDGSSWFRESGHDLGDNGYRCRWSRMGGRSHDGSSEWTETWWEKSDWTGYKELGVEK 204
Query: 340 SGRDASGNVWHEFWRESMHEE--NGLMHMEKTADKWGSNGQGNE-WQEKWWERYNASGQA 396
SG+++ G+ W E W+E +H++ + L +E++A K +G N W EKWWE+Y+A G
Sbjct: 205 SGKNSEGDSWWETWQEVLHQDEWSNLARIERSAQKQAKSGTENAGWYEKWWEKYDAKGWT 264
Query: 397 EKWAHKWCSIDPNTPLEAGHAHVWHERWGETYDGYGGSTKYTDKWAERSQDGGWEKWGDK 456
EK AHK+ ++ + W E+WGE YDG G K+TDKWAE KWGDK
Sbjct: 265 EKGAHKYGRLNEQS---------WWEKWGEHYDGRGSVLKWTDKWAETELG---TKWGDK 312
Query: 457 WDENFDLNGHGIKQGETWWEGKHGERWNRTWGEQHNGSGWVHKYGKSSSGEHWDTHEGQD 516
W+E F +G G +QGETW + +RW+RTWGE+H G+G VHKYGKS++GE WD ++
Sbjct: 313 WEEKF-FSGIGSRQGETWHVSPNSDRWSRTWGEEHFGNGKVHKYGKSTTGESWDIVVDEE 371
Query: 517 TWYERFPHFGFFHCYENSVQLREV 540
T+YE PH+G+ +S QL +
Sbjct: 372 TYYEAEPHYGWADVVGDSTQLLSI 395
>AT1G42430.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT3G55760.3). |
chr1:15891512-15894322 FORWARD LENGTH=409
Length = 409
Score = 200 bits (509), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 129/317 (40%), Positives = 185/317 (58%), Gaps = 24/317 (7%)
Query: 233 SSLELEAAASNLESPSLEEEQSRGA---LSSGHAAD---AVRALSEADKSSPIGVSPDGL 286
+ ++ ++A SP L QSR +G A + + L+E + G + DG
Sbjct: 77 AGIKTSSSAVPFASPKLTGPQSRDTPPKRDTGIANEKDWGIDLLNE--NVNEAGTNEDGS 134
Query: 287 RWWKESGIEQRPDGVICRWTMTRGVSADKAVEWQEKFWEASDEVGYKELGSEKSGRDASG 346
W++ESG + +G CRW+ G S D + EW E +WE SD GYKELG EKSG+++ G
Sbjct: 135 SWFRESGHDLGDNGYRCRWSRMGGRSHDGSSEWTETWWEKSDWTGYKELGVEKSGKNSEG 194
Query: 347 NVWHEFWRESMHEE--NGLMHMEKTADKWGSNGQGNE-WQEKWWERYNASGQAEKWAHKW 403
+ W E W+E +H++ + L +E++A K +G N W EKWWE+Y+A G EK AHK+
Sbjct: 195 DSWWETWQEVLHQDEWSNLARIERSAQKQAKSGTENAGWYEKWWEKYDAKGWTEKGAHKY 254
Query: 404 CSIDPNTPLEAGHAHVWHERWGETYDGYGGSTKYTDKWAERSQDGGWEKWGDKWDENFDL 463
++ + W E+WGE YDG G K+TDKWAE KWGDKW+E F
Sbjct: 255 GRLNEQS---------WWEKWGEHYDGRGSVLKWTDKWAETELG---TKWGDKWEEKF-F 301
Query: 464 NGHGIKQGETWWEGKHGERWNRTWGEQHNGSGWVHKYGKSSSGEHWDTHEGQDTWYERFP 523
+G G +QGETW + +RW+RTWGE+H G+G VHKYGKS++GE WD ++T+YE P
Sbjct: 302 SGIGSRQGETWHVSPNSDRWSRTWGEEHFGNGKVHKYGKSTTGESWDIVVDEETYYEAEP 361
Query: 524 HFGFFHCYENSVQLREV 540
H+G+ +S QL +
Sbjct: 362 HYGWADVVGDSTQLLSI 378