Miyakogusa Predicted Gene
- Lj1g3v1900590.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v1900590.1 Non Chatacterized Hit- tr|B9ETM2|B9ETM2_ORYSJ
Putative uncharacterized protein OS=Oryza sativa
subsp,42.98,6e-18,seg,NULL; FAMILY NOT NAMED,NULL,CUFF.28097.1
(618 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G11760.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 301 8e-82
AT5G04860.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 291 1e-78
AT2G10560.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 182 9e-46
AT2G25460.1 | Symbols: | CONTAINS InterPro DOMAIN/s: C2 calcium... 164 2e-40
>AT3G11760.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 23 plant
structures; EXPRESSED DURING: 14 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT5G04860.1); Has 84 Blast hits to 73 proteins in
13 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi
- 0; Plants - 84; Viruses - 0; Other Eukaryotes - 0
(source: NCBI BLink). | chr3:3718529-3721123 FORWARD
LENGTH=702
Length = 702
Score = 301 bits (771), Expect = 8e-82, Method: Compositional matrix adjust.
Identities = 154/293 (52%), Positives = 197/293 (67%), Gaps = 35/293 (11%)
Query: 330 STSSWGVKEFSSRDGKSKLKTNVFFASFDQRSEKASGESACTVLVALIAHWLHSNKG-MP 388
+ SW KE SRDG KL+T+VF AS DQRSE+A+GESACT LVA+IA W N MP
Sbjct: 413 AIGSWEEKEVISRDGHMKLQTSVFLASIDQRSERAAGESACTALVAVIADWFQKNGNLMP 472
Query: 389 TRAEFDSLITQGSSEWRRLCNSNYYSKLFPDKHFDLETIIDANLRPLVVLPQKSYTGFFS 448
+++FDSLI +GS EWR LC + Y + FPDKHFDL+T++ A +RPL V+P KS+ GFF
Sbjct: 473 IKSQFDSLIREGSLEWRNLCENETYMQKFPDKHFDLDTVLQAKIRPLTVIPGKSFVGFFH 532
Query: 449 PE------KFHCLKGAMSFDEIWDEI--------------NTNVVGVYIVSWNDHFFVLK 488
P+ +F L+GAMSFD IW EI + + VYIVSWNDHFFVLK
Sbjct: 533 PDGMINEGRFEFLQGAMSFDSIWAEIISLEESSANGDSYDDDSPPHVYIVSWNDHFFVLK 592
Query: 489 VEADAYYIIDSLGERLFEGCNQAFMLKFDESCVMYG--EVEEEGPLKASVGGARSAESEK 546
VE +AYYIID+LGERL+EGC+QA++LKFD V++ EE G +ESE
Sbjct: 593 VEKEAYYIIDTLGERLYEGCDQAYVLKFDHKTVIHKILHTEEAG-----------SESEP 641
Query: 547 SLEVVCRGKECCKEFIKRFLAAIPVRQLEKEETKKWAVSSTYLHRQLQIDFHY 599
E++ RGKE CKE+IK FLAAIP+R+L+ E+ KK S+ +H +LQI+FHY
Sbjct: 642 ESEILSRGKESCKEYIKNFLAAIPIRELQ-EDIKKGLASTAPVHHRLQIEFHY 693
>AT5G04860.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G11760.1); Has 1807 Blast hits to 1807 proteins
in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
- 339 (source: NCBI BLink). | chr5:1411760-1414459
REVERSE LENGTH=782
Length = 782
Score = 291 bits (744), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 154/300 (51%), Positives = 195/300 (65%), Gaps = 36/300 (12%)
Query: 333 SWGVKEFSSRDGKSKLKTNVFFASFDQRSEKASGESACTVLVALIAHWLHSNKGM-PTRA 391
SW KE SRDG KL VF AS DQRSE+A+GESACT LVA++AHWL SN+ + PTR+
Sbjct: 430 SWETKEIISRDGLMKLTARVFLASIDQRSERAAGESACTALVAVMAHWLGSNRDIIPTRS 489
Query: 392 EFDSLITQGSSEWRRLCNSNYYSKLFPDKHFDLETIIDANLRPLVVLPQKSYTGFFSPE- 450
EFDSLI +GSSEWR +C + Y + FPDKHFDLET++ A +RP+ V+P++S+ GFF PE
Sbjct: 490 EFDSLIREGSSEWRNMCENEEYRERFPDKHFDLETVLQAKVRPICVVPERSFIGFFHPEK 549
Query: 451 ----------KFHCLKGAMSFDEIWDEI-------NTNVVGVYIVSWNDHFFVLKVEADA 493
LKG MSFD IW+E+ + + +YIVSWNDHFFVL V DA
Sbjct: 550 SEEEEGKEDASLDFLKGVMSFDSIWEELMKQEPEESASEPVIYIVSWNDHFFVLLVNHDA 609
Query: 494 YYIIDSLGERLFEGCNQAFMLKFDESCVMYGEVEEEGPLKASVGGARSAESEKSLE---- 549
YYIID+LGERL+EGCNQA++LKFD+ + KA +G + KS +
Sbjct: 610 YYIIDTLGERLYEGCNQAYVLKFDKDAEIKRLPSVIKDNKADMGNQKQGGKNKSEQPERS 669
Query: 550 ----------VVCRGKECCKEFIKRFLAAIPVRQLEKEETKKWAVSSTYLHRQLQIDFHY 599
VVCRGKE C+E+IK FLAAIP++Q+ K + KK VSS LH +LQI+ HY
Sbjct: 670 KESEEQEEEEVVCRGKESCREYIKSFLAAIPIQQV-KADMKKGLVSS--LHHRLQIELHY 726
>AT2G10560.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G04860.1); Has 70 Blast hits to 70 proteins in
12 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi
- 0; Plants - 70; Viruses - 0; Other Eukaryotes - 0
(source: NCBI BLink). | chr2:4109862-4110698 REVERSE
LENGTH=278
Length = 278
Score = 182 bits (461), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 103/225 (45%), Positives = 139/225 (61%), Gaps = 35/225 (15%)
Query: 407 LCNSNYYSKLFPDKHFDLETIIDANLRPLVVLPQKSYTGFFSPEK-----------FHCL 455
+C + Y + FPDKHFDLET++ A +RP+ V+P++++ GFF EK L
Sbjct: 1 MCENEEYRERFPDKHFDLETVLQAKVRPICVVPERTFIGFFHREKSKEEEEKEDVSLDFL 60
Query: 456 KGAMSFDEIWDEI-------NTNVVGVYIVSWNDHFFVLKVEADAYYIIDSLGERLFEGC 508
KG MSFD IW+EI + + +YIVSWNDH+FVL V DAYYIID+LGER++EGC
Sbjct: 61 KGVMSFDSIWEEIMKQEPEESASEHVIYIVSWNDHYFVLLVNHDAYYIIDTLGERVYEGC 120
Query: 509 NQAFMLKFDESCVMYGEVEEEGPLKASVGGA------------RSAESEKSLE--VVCRG 554
NQA++LKFD+ + KA +G RS ESE+ E VVCRG
Sbjct: 121 NQAYVLKFDQDAEIKRLPSVIKDNKADMGSQKQGGKNKYEQPERSKESEEQGEEVVVCRG 180
Query: 555 KECCKEFIKRFLAAIPVRQLEKEETKKWAVSSTYLHRQLQIDFHY 599
KE C+E+IK FLAAIP++Q+ K + K+ VSS H +LQI+ +Y
Sbjct: 181 KESCREYIKSFLAAIPIQQV-KADMKEGLVSS--FHHRLQIELYY 222
>AT2G25460.1 | Symbols: | CONTAINS InterPro DOMAIN/s: C2
calcium-dependent membrane targeting
(InterPro:IPR000008); BEST Arabidopsis thaliana protein
match is: unknown protein (TAIR:AT5G04860.1); Has 108
Blast hits to 69 proteins in 11 species: Archae - 0;
Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 108;
Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).
| chr2:10833175-10835374 REVERSE LENGTH=423
Length = 423
Score = 164 bits (415), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 129/377 (34%), Positives = 190/377 (50%), Gaps = 58/377 (15%)
Query: 101 YNMEFHVLHGKG-DAVGEITKAKMTVVGKASVTVDMAELLAGERMKTGSSSHNHHHLQRK 159
+N+ F+V +G+ DA K K +++GKAS+ D++EL +S ++RK
Sbjct: 93 WNLSFNVFYGENMDA-----KNKKSLIGKASL--DLSEL----------ASKQESTVERK 135
Query: 160 LPIHLKVNGLIIEATLLVCMRLMKL----DDLQGPFENSIHSEKKSGIIGKLKSLTCLGR 215
LPI K + L EATL+V + ++ DD + S+ S I K KS + G
Sbjct: 136 LPIRSKGSVLSKEATLVVNVTFSEVRTEPDDFMQLGQISVDS-----AIPKKKSTSRRGG 190
Query: 216 KNNVKVDXXXXXXXXXXXXXXVFYXXXXXXXXXXXXXXXXXXGFHNSESRLGFRSSGTKT 275
++ V GF
Sbjct: 191 GSHFDSFSSPATASSSGGRSPVVETGLFSSPDENQSEPGQKAGF---------------- 234
Query: 276 PLDTSQSNWSYMSRNRSFK-GWNTKTDASKQETSTTDPCPLTKSEEHSLQFYLQDSTSSW 334
NW + R SF W + + + E + T P + E+ + + ++ + W
Sbjct: 235 -------NW-WKRRRLSFSMTW--RREPREDEVTKTSTKPSEELEKPATEIPIE--ANKW 282
Query: 335 GVKEFSSRDGKSKLKTNVFFASFDQRSEKASGESACTVLVALIAHWLHSNKGM--PTRAE 392
+K+ SRDGKSKLK+ V+ AS DQRSE+A+GE+AC + ++AHW H+N + P+
Sbjct: 283 VMKDLVSRDGKSKLKSEVYLASIDQRSEQAAGEAACAAVAVVVAHWFHANPKLINPSGTA 342
Query: 393 FDSLITQGSSEWRRLCNSNYYSKLFPDKHFDLETIIDANLRPLVVLPQKSYTGFFSPEKF 452
FDSLITQGSS W+ LC+ Y +LFP++HFDLETI+ ANLRP+ V KS+TG FSPE+F
Sbjct: 343 FDSLITQGSSLWQSLCDKESYLRLFPNRHFDLETIVSANLRPVRVCTDKSFTGLFSPERF 402
Query: 453 HCLKGAMSFDEIWDEIN 469
L G MSFD+IWDE++
Sbjct: 403 ASLDGLMSFDQIWDELS 419