Miyakogusa Predicted Gene
- Lj6g3v2218370.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj6g3v2218370.1 Non Chatacterized Hit- tr|E5GCK1|E5GCK1_CUCME
Putative uncharacterized protein OS=Cucumis melo subsp,44.95,3e-18,
,CUFF.60849.1
(395 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G64170.2 | Symbols: | dentin sialophosphoprotein-related | c... 130 2e-30
AT5G64170.1 | Symbols: | dentin sialophosphoprotein-related | c... 130 2e-30
AT3G54500.2 | Symbols: | BEST Arabidopsis thaliana protein matc... 77 2e-14
AT3G54500.1 | Symbols: | BEST Arabidopsis thaliana protein matc... 77 2e-14
AT3G54500.3 | Symbols: | FUNCTIONS IN: molecular_function unkno... 77 3e-14
AT3G54500.4 | Symbols: | FUNCTIONS IN: molecular_function unkno... 77 3e-14
AT3G12320.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 50 3e-06
AT5G06980.4 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 50 4e-06
>AT5G64170.2 | Symbols: | dentin sialophosphoprotein-related |
chr5:25672904-25675934 REVERSE LENGTH=616
Length = 616
Score = 130 bits (326), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 67/112 (59%), Positives = 80/112 (71%), Gaps = 5/112 (4%)
Query: 253 EMDSSTVQESSPMSSSLDRTSLEATSFSHLQQALDQLDIRTKLCIRDSLYRLAKSAEQRH 312
E+++S +Q SS +SS +D SLEATSF LQQ ++QLD+RTKLCIRDSLYRLAKSAEQRH
Sbjct: 458 ELETSNMQGSSCVSSVVDDISLEATSFRQLQQVIEQLDVRTKLCIRDSLYRLAKSAEQRH 517
Query: 313 DNVNTNGYIGDDSEACQAMSVQDPSRCTGFMDIEIDTNPIDRSIAHLLFHRP 364
N + A + + + GFMDIE DTNPIDRSIAHLLFHRP
Sbjct: 518 HGGNR-----PEKGAGSHLVTGEADKYAGFMDIETDTNPIDRSIAHLLFHRP 564
>AT5G64170.1 | Symbols: | dentin sialophosphoprotein-related |
chr5:25672904-25675445 REVERSE LENGTH=566
Length = 566
Score = 130 bits (326), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 67/112 (59%), Positives = 80/112 (71%), Gaps = 5/112 (4%)
Query: 253 EMDSSTVQESSPMSSSLDRTSLEATSFSHLQQALDQLDIRTKLCIRDSLYRLAKSAEQRH 312
E+++S +Q SS +SS +D SLEATSF LQQ ++QLD+RTKLCIRDSLYRLAKSAEQRH
Sbjct: 408 ELETSNMQGSSCVSSVVDDISLEATSFRQLQQVIEQLDVRTKLCIRDSLYRLAKSAEQRH 467
Query: 313 DNVNTNGYIGDDSEACQAMSVQDPSRCTGFMDIEIDTNPIDRSIAHLLFHRP 364
N + A + + + GFMDIE DTNPIDRSIAHLLFHRP
Sbjct: 468 HGGNR-----PEKGAGSHLVTGEADKYAGFMDIETDTNPIDRSIAHLLFHRP 514
>AT3G54500.2 | Symbols: | BEST Arabidopsis thaliana protein match
is: dentin sialophosphoprotein-related
(TAIR:AT5G64170.1); Has 162 Blast hits to 160 proteins
in 41 species: Archae - 0; Bacteria - 0; Metazoa - 23;
Fungi - 11; Plants - 118; Viruses - 0; Other Eukaryotes
- 10 (source: NCBI BLink). | chr3:20175792-20178275
REVERSE LENGTH=569
Length = 569
Score = 77.0 bits (188), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 50/127 (39%), Positives = 68/127 (53%), Gaps = 5/127 (3%)
Query: 241 SEVQGVSLGFSPEMDSSTVQESSPMSSSLDRTSLEATSFSHLQQALDQLDIRTKLCIRDS 300
+ +QG++ S + SS Q+ S ++ S E LQ + +LD+ T+ CIRDS
Sbjct: 363 TNLQGLTAMPSFDPSSSLEQDDSGKFAAAVDNSAEFAVLYRLQDVVAKLDMGTRTCIRDS 422
Query: 301 LYRLAKSAEQRH---DNVNTNGYIGDDSEACQAMSVQDPSRCTGFMDIEIDTNPIDRSIA 357
L+RLA SA QRH D ++N DD E + R G D E TNP DR++A
Sbjct: 423 LFRLAGSAAQRHYTSDTSHSNKTSQDDQEVIP--REESRYRYAGMPDTEAVTNPTDRTVA 480
Query: 358 HLLFHRP 364
HLLFHRP
Sbjct: 481 HLLFHRP 487
>AT3G54500.1 | Symbols: | BEST Arabidopsis thaliana protein match
is: dentin sialophosphoprotein-related
(TAIR:AT5G64170.1); Has 183 Blast hits to 175 proteins
in 44 species: Archae - 0; Bacteria - 4; Metazoa - 38;
Fungi - 11; Plants - 120; Viruses - 0; Other Eukaryotes
- 10 (source: NCBI BLink). | chr3:20175792-20178737
REVERSE LENGTH=648
Length = 648
Score = 77.0 bits (188), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 50/127 (39%), Positives = 68/127 (53%), Gaps = 5/127 (3%)
Query: 241 SEVQGVSLGFSPEMDSSTVQESSPMSSSLDRTSLEATSFSHLQQALDQLDIRTKLCIRDS 300
+ +QG++ S + SS Q+ S ++ S E LQ + +LD+ T+ CIRDS
Sbjct: 442 TNLQGLTAMPSFDPSSSLEQDDSGKFAAAVDNSAEFAVLYRLQDVVAKLDMGTRTCIRDS 501
Query: 301 LYRLAKSAEQRH---DNVNTNGYIGDDSEACQAMSVQDPSRCTGFMDIEIDTNPIDRSIA 357
L+RLA SA QRH D ++N DD E + R G D E TNP DR++A
Sbjct: 502 LFRLAGSAAQRHYTSDTSHSNKTSQDDQEVIPREESR--YRYAGMPDTEAVTNPTDRTVA 559
Query: 358 HLLFHRP 364
HLLFHRP
Sbjct: 560 HLLFHRP 566
>AT3G54500.3 | Symbols: | FUNCTIONS IN: molecular_function unknown;
INVOLVED IN: biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 22 plant
structures; EXPRESSED DURING: 13 growth stages; BEST
Arabidopsis thaliana protein match is: dentin
sialophosphoprotein-related (TAIR:AT5G64170.1). |
chr3:20175792-20178482 REVERSE LENGTH=669
Length = 669
Score = 76.6 bits (187), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 50/127 (39%), Positives = 68/127 (53%), Gaps = 5/127 (3%)
Query: 241 SEVQGVSLGFSPEMDSSTVQESSPMSSSLDRTSLEATSFSHLQQALDQLDIRTKLCIRDS 300
+ +QG++ S + SS Q+ S ++ S E LQ + +LD+ T+ CIRDS
Sbjct: 463 TNLQGLTAMPSFDPSSSLEQDDSGKFAAAVDNSAEFAVLYRLQDVVAKLDMGTRTCIRDS 522
Query: 301 LYRLAKSAEQRH---DNVNTNGYIGDDSEACQAMSVQDPSRCTGFMDIEIDTNPIDRSIA 357
L+RLA SA QRH D ++N DD E + R G D E TNP DR++A
Sbjct: 523 LFRLAGSAAQRHYTSDTSHSNKTSQDDQEVIP--REESRYRYAGMPDTEAVTNPTDRTVA 580
Query: 358 HLLFHRP 364
HLLFHRP
Sbjct: 581 HLLFHRP 587
>AT3G54500.4 | Symbols: | FUNCTIONS IN: molecular_function unknown;
INVOLVED IN: biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 22 plant
structures; EXPRESSED DURING: 13 growth stages; BEST
Arabidopsis thaliana protein match is: dentin
sialophosphoprotein-related (TAIR:AT5G64170.2). |
chr3:20175792-20178737 REVERSE LENGTH=663
Length = 663
Score = 76.6 bits (187), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 50/127 (39%), Positives = 68/127 (53%), Gaps = 5/127 (3%)
Query: 241 SEVQGVSLGFSPEMDSSTVQESSPMSSSLDRTSLEATSFSHLQQALDQLDIRTKLCIRDS 300
+ +QG++ S + SS Q+ S ++ S E LQ + +LD+ T+ CIRDS
Sbjct: 457 TNLQGLTAMPSFDPSSSLEQDDSGKFAAAVDNSAEFAVLYRLQDVVAKLDMGTRTCIRDS 516
Query: 301 LYRLAKSAEQRH---DNVNTNGYIGDDSEACQAMSVQDPSRCTGFMDIEIDTNPIDRSIA 357
L+RLA SA QRH D ++N DD E + R G D E TNP DR++A
Sbjct: 517 LFRLAGSAAQRHYTSDTSHSNKTSQDDQEVIP--REESRYRYAGMPDTEAVTNPTDRTVA 574
Query: 358 HLLFHRP 364
HLLFHRP
Sbjct: 575 HLLFHRP 581
>AT3G12320.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G06980.4); Has 102 Blast hits to 102 proteins
in 16 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 0; Plants - 98; Viruses - 0; Other Eukaryotes -
4 (source: NCBI BLink). | chr3:3924034-3925262 FORWARD
LENGTH=269
Length = 269
Score = 50.1 bits (118), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 33/98 (33%), Positives = 54/98 (55%), Gaps = 10/98 (10%)
Query: 270 DRTSLEATSFSHLQQALDQLDIRTKLCIRDSLYRLAKSAEQRHDNVNTNGYIGDDSEACQ 329
D S E LQ+A +L T+ C RD+ YRLA+S++ + D+V+ N SE
Sbjct: 163 DELSSEEFVLQDLQRASQKLTDETRKCFRDTFYRLARSSQDKSDSVSPN------SEELL 216
Query: 330 AMSVQ----DPSRCTGFMDIEIDTNPIDRSIAHLLFHR 363
+ + D +R + +IE +TN IDR++A+L F++
Sbjct: 217 MQTSRYDYGDGNRFSREEEIESETNSIDRAVANLTFNK 254
>AT5G06980.4 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN: response to
karrikin; LOCATED IN: cellular_component unknown;
EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 11
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT3G12320.1). |
chr5:2167799-2169110 FORWARD LENGTH=279
Length = 279
Score = 49.7 bits (117), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 62/115 (53%), Gaps = 13/115 (11%)
Query: 255 DSSTVQESSPMS---SSLDRTSLEATSFSHLQQALDQLDIRTKLCIRDSLYRLAKSAEQR 311
D ++E +PM+ + + S E LQ+A + L T+ C RD+ YRLAK+++Q+
Sbjct: 153 DMPLMEEDAPMNLCEENKEEASAEEVVLQDLQRATEMLTDDTRKCFRDTFYRLAKNSQQK 212
Query: 312 HDNVNTNGYIGDDSEACQAMSVQDPSRCTGFMDI---EIDTNPIDRSIAHLLFHR 363
D+ N++ ++ D + S D S F+ + + N IDR++A+L F++
Sbjct: 213 SDS-NSDEFLEDRT------SSNDSSPSMTFLSVGKLNLKPNSIDRAVANLTFNK 260