Miyakogusa Predicted Gene
- Lj0g3v0351759.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0351759.1 Non Chatacterized Hit- tr|B9SAW9|B9SAW9_RICCO
Putative uncharacterized protein OS=Ricinus communis
G,36,0.000000000000003,Myb_DNA-bind_3,Myb/SANT-like
domain,CUFF.24219.1
(174 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT4G02550.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 246 8e-66
AT4G02550.3 | Symbols: | unknown protein; BEST Arabidopsis thal... 245 1e-65
AT4G02550.4 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 234 2e-62
AT4G02550.2 | Symbols: | unknown protein; BEST Arabidopsis thal... 234 2e-62
AT2G24960.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 84 6e-17
AT4G02210.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 81 4e-16
AT4G02210.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 81 4e-16
AT2G24960.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 73 1e-13
AT1G30140.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 53 1e-07
AT5G27260.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 53 1e-07
AT5G05800.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 52 3e-07
AT5G05800.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 52 3e-07
>AT4G02550.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G02210.2); Has 370 Blast hits to 300 proteins
in 18 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 10; Plants - 354; Viruses - 0; Other Eukaryotes
- 6 (source: NCBI BLink). | chr4:1120622-1121629 REVERSE
LENGTH=307
Length = 307
Score = 246 bits (627), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 110/174 (63%), Positives = 145/174 (83%), Gaps = 2/174 (1%)
Query: 1 MEHYDLQRQRRDLKNKGRNVVWSIAMDKCLIEELAVQAKSGNKIDKCFNENAYSAACLAV 60
M+ Y + +R+++K+KGRNV+WS+ MDKCLIE LAVQAK+GNK+DKCFN+ AY+AAC+AV
Sbjct: 1 MDQYGIPVERKEMKHKGRNVIWSVGMDKCLIEALAVQAKNGNKVDKCFNDKAYTAACVAV 60
Query: 61 NTCFNLKLNNQKVINRLKTIKKRYKVMKDMLSQDGFWWNPNTKMIECDSDELWKKYIAAH 120
NT FNL L +QK INRLKTIKKRY+VM+D+LS+DGFWWN +TKMI+C+SDELW++YIA +
Sbjct: 61 NTRFNLNLTSQKAINRLKTIKKRYRVMRDILSRDGFWWNSSTKMIDCESDELWRRYIAVN 120
Query: 121 PDARGFGGKQIEMYDELKIVCGNYQAPSRWAKMNNGS--HQMDMKNCEDESTSF 172
PDA+ F GKQIEMY+EL+ VCG+YQ P ++ K+ S H D+K E++S SF
Sbjct: 121 PDAKAFRGKQIEMYEELRTVCGDYQTPGKYNKVKKESSHHLNDVKQFEEDSVSF 174
>AT4G02550.3 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G02210.2); Has 35333 Blast hits to 34131
proteins in 2444 species: Archae - 798; Bacteria -
22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
- 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
chr4:1120622-1121674 REVERSE LENGTH=322
Length = 322
Score = 245 bits (625), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 110/174 (63%), Positives = 145/174 (83%), Gaps = 2/174 (1%)
Query: 1 MEHYDLQRQRRDLKNKGRNVVWSIAMDKCLIEELAVQAKSGNKIDKCFNENAYSAACLAV 60
M+ Y + +R+++K+KGRNV+WS+ MDKCLIE LAVQAK+GNK+DKCFN+ AY+AAC+AV
Sbjct: 16 MDQYGIPVERKEMKHKGRNVIWSVGMDKCLIEALAVQAKNGNKVDKCFNDKAYTAACVAV 75
Query: 61 NTCFNLKLNNQKVINRLKTIKKRYKVMKDMLSQDGFWWNPNTKMIECDSDELWKKYIAAH 120
NT FNL L +QK INRLKTIKKRY+VM+D+LS+DGFWWN +TKMI+C+SDELW++YIA +
Sbjct: 76 NTRFNLNLTSQKAINRLKTIKKRYRVMRDILSRDGFWWNSSTKMIDCESDELWRRYIAVN 135
Query: 121 PDARGFGGKQIEMYDELKIVCGNYQAPSRWAKMNNGS--HQMDMKNCEDESTSF 172
PDA+ F GKQIEMY+EL+ VCG+YQ P ++ K+ S H D+K E++S SF
Sbjct: 136 PDAKAFRGKQIEMYEELRTVCGDYQTPGKYNKVKKESSHHLNDVKQFEEDSVSF 189
>AT4G02550.4 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 18 plant
structures; EXPRESSED DURING: 7 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT4G02210.2). | chr4:1120622-1121629 REVERSE
LENGTH=278
Length = 278
Score = 234 bits (597), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 101/147 (68%), Positives = 130/147 (88%)
Query: 1 MEHYDLQRQRRDLKNKGRNVVWSIAMDKCLIEELAVQAKSGNKIDKCFNENAYSAACLAV 60
M+ Y + +R+++K+KGRNV+WS+ MDKCLIE LAVQAK+GNK+DKCFN+ AY+AAC+AV
Sbjct: 1 MDQYGIPVERKEMKHKGRNVIWSVGMDKCLIEALAVQAKNGNKVDKCFNDKAYTAACVAV 60
Query: 61 NTCFNLKLNNQKVINRLKTIKKRYKVMKDMLSQDGFWWNPNTKMIECDSDELWKKYIAAH 120
NT FNL L +QK INRLKTIKKRY+VM+D+LS+DGFWWN +TKMI+C+SDELW++YIA +
Sbjct: 61 NTRFNLNLTSQKAINRLKTIKKRYRVMRDILSRDGFWWNSSTKMIDCESDELWRRYIAVN 120
Query: 121 PDARGFGGKQIEMYDELKIVCGNYQAP 147
PDA+ F GKQIEMY+EL+ VCG+YQ P
Sbjct: 121 PDAKAFRGKQIEMYEELRTVCGDYQTP 147
>AT4G02550.2 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G02210.2); Has 350 Blast hits to 284 proteins
in 18 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 13; Plants - 331; Viruses - 0; Other Eukaryotes
- 6 (source: NCBI BLink). | chr4:1120622-1121629 REVERSE
LENGTH=278
Length = 278
Score = 234 bits (597), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 101/147 (68%), Positives = 130/147 (88%)
Query: 1 MEHYDLQRQRRDLKNKGRNVVWSIAMDKCLIEELAVQAKSGNKIDKCFNENAYSAACLAV 60
M+ Y + +R+++K+KGRNV+WS+ MDKCLIE LAVQAK+GNK+DKCFN+ AY+AAC+AV
Sbjct: 1 MDQYGIPVERKEMKHKGRNVIWSVGMDKCLIEALAVQAKNGNKVDKCFNDKAYTAACVAV 60
Query: 61 NTCFNLKLNNQKVINRLKTIKKRYKVMKDMLSQDGFWWNPNTKMIECDSDELWKKYIAAH 120
NT FNL L +QK INRLKTIKKRY+VM+D+LS+DGFWWN +TKMI+C+SDELW++YIA +
Sbjct: 61 NTRFNLNLTSQKAINRLKTIKKRYRVMRDILSRDGFWWNSSTKMIDCESDELWRRYIAVN 120
Query: 121 PDARGFGGKQIEMYDELKIVCGNYQAP 147
PDA+ F GKQIEMY+EL+ VCG+YQ P
Sbjct: 121 PDAKAFRGKQIEMYEELRTVCGDYQTP 147
>AT2G24960.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 21 plant
structures; EXPRESSED DURING: 12 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT4G02210.2); Has 30201 Blast hits to 17322
proteins in 780 species: Archae - 12; Bacteria - 1396;
Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
0; Other Eukaryotes - 2996 (source: NCBI BLink). |
chr2:10617263-10620034 FORWARD LENGTH=774
Length = 774
Score = 83.6 bits (205), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 42/134 (31%), Positives = 71/134 (52%), Gaps = 1/134 (0%)
Query: 20 VVWSIAMDKCLIEELAVQAKSGNKIDKCFNENAYSAACLAVNTCFNLKLNNQKVINRLKT 79
+ W+ MD LI+ L Q +GN++ + F +A++ A N F + N + NR K
Sbjct: 325 IFWTPPMDYHLIDLLVEQVNNGNRVGQTFITSAWNEMVTAFNAKFGSQHNKDVLKNRYKH 384
Query: 80 IKKRYKVMKDMLSQDGFWWNPNTKMIECDSDELWKKYIAAHPDARGFGGKQIEMYDELKI 139
+++ Y +K +L Q+GF W+ M+ D D++W YI AHP+AR + K I Y L
Sbjct: 385 LRRLYNDIKFLLEQNGFSWDARRDMVIAD-DDIWNTYIQAHPEARSYRVKTIPSYPNLCF 443
Query: 140 VCGNYQAPSRWAKM 153
+ G + R+ ++
Sbjct: 444 IFGKETSDGRYTRL 457
Score = 73.2 bits (178), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 62/118 (52%), Gaps = 1/118 (0%)
Query: 20 VVWSIAMDKCLIEELAVQAKSGNKIDKCFNENAYSAACLAVNTCFNLKLNNQKVINRLKT 79
+ W+ MD CLI+ + Q GNKI + F E A++ + N F L+ + + NR
Sbjct: 509 IEWTRVMDHCLIDLMLEQVSRGNKIGETFTEQAWADMAESFNAKFGLQTDMFMLENRYIL 568
Query: 80 IKKRYKVMKDMLSQDGFWWNPNTKMIECDSDELWKKYIAAHPDARGFGGKQIEMYDEL 137
+ K + ++L+ DGF W+ + I + DE W+ YI HPDA + GK ++ Y L
Sbjct: 569 LMKERDDINNILNLDGFTWDVEKQTIVAE-DEYWEAYIKEHPDATIYKGKTLDSYGNL 625
Score = 68.9 bits (167), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 36/119 (30%), Positives = 61/119 (51%), Gaps = 1/119 (0%)
Query: 22 WSIAMDKCLIEELAVQAKSGNKIDKCFNENAYSAACLAVNTCFNLKLNNQKVINRLKTIK 81
W++ MD+ +E + Q GNK F++ A+ + N F+ + + + +R +
Sbjct: 172 WTLEMDQYFVEIMVDQIGRGNKTGNAFSKQAWIDMLVLFNARFSGQYGKRVLRHRYNKLL 231
Query: 82 KRYKVMKDMLSQDGFWWNPNTKMIECDSDELWKKYIAAHPDARGFGGKQIEMYDELKIV 140
K YK M+ +L +DGF W+ MI D D +W YI HP AR + K + Y++L +
Sbjct: 232 KYYKDMEAILKEDGFSWDETRLMISAD-DAVWDSYIKDHPLARTYRMKSLPSYNDLDTI 289
Score = 63.9 bits (154), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 36/150 (24%), Positives = 72/150 (48%), Gaps = 4/150 (2%)
Query: 15 NKGRNVVWSIAMDKCLIEELAVQAKSGNKIDKCFNENAYSAACLAVNTCFNLKLNNQKVI 74
N W+ M++ I+ + GN+ FN+ A++ N+ F + + +
Sbjct: 8 NDRTRTYWTPTMERFFIDLMLEHLHRGNRTGHTFNKQAWNEMLTVFNSKFGSQYDKDVLK 67
Query: 75 NRLKTIKKRYKVMKDMLSQDGFWWNPNTKMIECDSDELWKKYIAAHPDARGFGGKQIEMY 134
+R + K+Y +K +L GF W+ + + D D LW Y+ AHP+AR + K + +
Sbjct: 68 SRYTNLWKQYNDVKCLLDHGGFVWDQTHQTVIGD-DSLWSLYLKAHPEARVYKTKPVLNF 126
Query: 135 DELKIVCGNYQAPSRWAKMNNGSHQMDMKN 164
+L ++ G A R++ SH +++++
Sbjct: 127 SDLCLIYGYTVADGRYSM---SSHDLEIED 153
>AT4G02210.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 22 plant
structures; EXPRESSED DURING: 13 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT2G24960.2). | chr4:974320-975917 REVERSE
LENGTH=439
Length = 439
Score = 80.9 bits (198), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 69/124 (55%), Gaps = 1/124 (0%)
Query: 20 VVWSIAMDKCLIEELAVQAKSGNKIDKCFNENAYSAACLAVNTCFNLKLNNQKVINRLKT 79
W MD+ I+ + QA+ GN+I+ F + A++ N F + + NR K+
Sbjct: 184 TTWHPPMDRYFIDLMLDQARRGNQIEGVFRKQAWTEMVNLFNAKFESNFDVDVLKNRYKS 243
Query: 80 IKKRYKVMKDMLSQDGFWWNPNTKMIECDSDELWKKYIAAHPDARGFGGKQIEMYDELKI 139
+++++ +K +L DGF W+ +M+ D++ +W+ YI AH DAR F + I Y +L +
Sbjct: 244 LRRQFNAIKSILRSDGFAWDNERQMVTADNN-VWQDYIKAHRDARQFMTRPIPYYKDLCV 302
Query: 140 VCGN 143
+CG+
Sbjct: 303 LCGD 306
Score = 70.1 bits (170), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 38/127 (29%), Positives = 66/127 (51%), Gaps = 2/127 (1%)
Query: 15 NKGRNVVWSIAMDKCLIEELAVQAKSGNKI-DKCFNENAYSAACLAVNTCFNLKLNNQKV 73
N+ VW+ MD+ IE + Q + GN+ D F++ A+ + F +
Sbjct: 7 NERLRTVWTPEMDQYFIELMVEQVRKGNRFEDHLFSKRAWKFMSCSFTAKFKFLYGKDVL 66
Query: 74 INRLKTIKKRYKVMKDMLSQDGFWWNPNTKMIECDSDELWKKYIAAHPDARGFGGKQIEM 133
NR KT++ +K + ++L +DGF W+ +M+ D + +W +Y+ HPD+R F K I
Sbjct: 67 KNRHKTLRNLFKSVNNLLIEDGFSWDDTRQMVVAD-NCVWDEYLKIHPDSRSFRIKSIPC 125
Query: 134 YDELKIV 140
Y +L +V
Sbjct: 126 YKDLCLV 132
>AT4G02210.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT2G24960.2); Has 791 Blast hits to 465 proteins
in 19 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 17; Plants - 748; Viruses - 0; Other Eukaryotes
- 26 (source: NCBI BLink). | chr4:974320-975917 REVERSE
LENGTH=439
Length = 439
Score = 80.9 bits (198), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 69/124 (55%), Gaps = 1/124 (0%)
Query: 20 VVWSIAMDKCLIEELAVQAKSGNKIDKCFNENAYSAACLAVNTCFNLKLNNQKVINRLKT 79
W MD+ I+ + QA+ GN+I+ F + A++ N F + + NR K+
Sbjct: 184 TTWHPPMDRYFIDLMLDQARRGNQIEGVFRKQAWTEMVNLFNAKFESNFDVDVLKNRYKS 243
Query: 80 IKKRYKVMKDMLSQDGFWWNPNTKMIECDSDELWKKYIAAHPDARGFGGKQIEMYDELKI 139
+++++ +K +L DGF W+ +M+ D++ +W+ YI AH DAR F + I Y +L +
Sbjct: 244 LRRQFNAIKSILRSDGFAWDNERQMVTADNN-VWQDYIKAHRDARQFMTRPIPYYKDLCV 302
Query: 140 VCGN 143
+CG+
Sbjct: 303 LCGD 306
Score = 70.1 bits (170), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 38/127 (29%), Positives = 66/127 (51%), Gaps = 2/127 (1%)
Query: 15 NKGRNVVWSIAMDKCLIEELAVQAKSGNKI-DKCFNENAYSAACLAVNTCFNLKLNNQKV 73
N+ VW+ MD+ IE + Q + GN+ D F++ A+ + F +
Sbjct: 7 NERLRTVWTPEMDQYFIELMVEQVRKGNRFEDHLFSKRAWKFMSCSFTAKFKFLYGKDVL 66
Query: 74 INRLKTIKKRYKVMKDMLSQDGFWWNPNTKMIECDSDELWKKYIAAHPDARGFGGKQIEM 133
NR KT++ +K + ++L +DGF W+ +M+ D + +W +Y+ HPD+R F K I
Sbjct: 67 KNRHKTLRNLFKSVNNLLIEDGFSWDDTRQMVVAD-NCVWDEYLKIHPDSRSFRIKSIPC 125
Query: 134 YDELKIV 140
Y +L +V
Sbjct: 126 YKDLCLV 132
>AT2G24960.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G02210.2); Has 1453 Blast hits to 509 proteins
in 26 species: Archae - 0; Bacteria - 0; Metazoa - 1;
Fungi - 39; Plants - 1363; Viruses - 0; Other Eukaryotes
- 50 (source: NCBI BLink). | chr2:10617263-10620034
FORWARD LENGTH=797
Length = 797
Score = 73.2 bits (178), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 62/118 (52%), Gaps = 1/118 (0%)
Query: 20 VVWSIAMDKCLIEELAVQAKSGNKIDKCFNENAYSAACLAVNTCFNLKLNNQKVINRLKT 79
+ W+ MD CLI+ + Q GNKI + F E A++ + N F L+ + + NR
Sbjct: 532 IEWTRVMDHCLIDLMLEQVSRGNKIGETFTEQAWADMAESFNAKFGLQTDMFMLENRYIL 591
Query: 80 IKKRYKVMKDMLSQDGFWWNPNTKMIECDSDELWKKYIAAHPDARGFGGKQIEMYDEL 137
+ K + ++L+ DGF W+ + I + DE W+ YI HPDA + GK ++ Y L
Sbjct: 592 LMKERDDINNILNLDGFTWDVEKQTIVAE-DEYWEAYIKEHPDATIYKGKTLDSYGNL 648
Score = 70.5 bits (171), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 42/157 (26%), Positives = 71/157 (45%), Gaps = 24/157 (15%)
Query: 20 VVWSIAMDKCLIEELAVQAKSGNKIDKCFNENAYSAACLAVNTCFNLKLNNQKVINRLKT 79
+ W+ MD LI+ L Q +GN++ + F +A++ A N F + N + NR K
Sbjct: 325 IFWTPPMDYHLIDLLVEQVNNGNRVGQTFITSAWNEMVTAFNAKFGSQHNKDVLKNRYKH 384
Query: 80 IKKRYKVMKDMLSQDGFWWNPNTKMIECDSDELWKKYI---------------------- 117
+++ Y +K +L Q+GF W+ M+ D D++W YI
Sbjct: 385 LRRLYNDIKFLLEQNGFSWDARRDMVIAD-DDIWNTYIQACHILFLFKISVICLCLQMKH 443
Query: 118 -AAHPDARGFGGKQIEMYDELKIVCGNYQAPSRWAKM 153
AHP+AR + K I Y L + G + R+ ++
Sbjct: 444 VQAHPEARSYRVKTIPSYPNLCFIFGKETSDGRYTRL 480
Score = 68.6 bits (166), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 36/119 (30%), Positives = 61/119 (51%), Gaps = 1/119 (0%)
Query: 22 WSIAMDKCLIEELAVQAKSGNKIDKCFNENAYSAACLAVNTCFNLKLNNQKVINRLKTIK 81
W++ MD+ +E + Q GNK F++ A+ + N F+ + + + +R +
Sbjct: 172 WTLEMDQYFVEIMVDQIGRGNKTGNAFSKQAWIDMLVLFNARFSGQYGKRVLRHRYNKLL 231
Query: 82 KRYKVMKDMLSQDGFWWNPNTKMIECDSDELWKKYIAAHPDARGFGGKQIEMYDELKIV 140
K YK M+ +L +DGF W+ MI D D +W YI HP AR + K + Y++L +
Sbjct: 232 KYYKDMEAILKEDGFSWDETRLMISAD-DAVWDSYIKDHPLARTYRMKSLPSYNDLDTI 289
Score = 63.9 bits (154), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 36/150 (24%), Positives = 72/150 (48%), Gaps = 4/150 (2%)
Query: 15 NKGRNVVWSIAMDKCLIEELAVQAKSGNKIDKCFNENAYSAACLAVNTCFNLKLNNQKVI 74
N W+ M++ I+ + GN+ FN+ A++ N+ F + + +
Sbjct: 8 NDRTRTYWTPTMERFFIDLMLEHLHRGNRTGHTFNKQAWNEMLTVFNSKFGSQYDKDVLK 67
Query: 75 NRLKTIKKRYKVMKDMLSQDGFWWNPNTKMIECDSDELWKKYIAAHPDARGFGGKQIEMY 134
+R + K+Y +K +L GF W+ + + D D LW Y+ AHP+AR + K + +
Sbjct: 68 SRYTNLWKQYNDVKCLLDHGGFVWDQTHQTVIGD-DSLWSLYLKAHPEARVYKTKPVLNF 126
Query: 135 DELKIVCGNYQAPSRWAKMNNGSHQMDMKN 164
+L ++ G A R++ SH +++++
Sbjct: 127 SDLCLIYGYTVADGRYSM---SSHDLEIED 153
>AT1G30140.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G27260.1); Has 313 Blast hits to 256 proteins
in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 8; Plants - 295; Viruses - 0; Other Eukaryotes -
10 (source: NCBI BLink). | chr1:10598764-10599527
FORWARD LENGTH=222
Length = 222
Score = 53.1 bits (126), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 52/99 (52%), Gaps = 2/99 (2%)
Query: 54 SAACLAVNTCFNLKLNNQKVINRLKTIKKRYKVMKDMLS-QDGFWWNPNTKMIECDSDEL 112
S A+N N++ ++RLK +K Y+ D+ GF W+P TK DE+
Sbjct: 47 SKLLPALNKRLGCNKNHKNYMSRLKFLKNLYQSYLDLKRFSSGFGWDPETKKFTA-PDEV 105
Query: 113 WKKYIAAHPDARGFGGKQIEMYDELKIVCGNYQAPSRWA 151
W+ Y+ AHP+ + + I+ +++L+I+ G+ A +A
Sbjct: 106 WRDYLKAHPNHKHMQTESIDHFEDLQIIFGDVVATGSFA 144
>AT5G27260.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT2G29880.1); Has 1807 Blast hits to 1807 proteins
in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
- 339 (source: NCBI BLink). | chr5:9603943-9604930
FORWARD LENGTH=303
Length = 303
Score = 52.8 bits (125), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 39/138 (28%), Positives = 67/138 (48%), Gaps = 16/138 (11%)
Query: 14 KNKGRNVVWSIAMDKCLIEELAVQAKSGNKIDKCFNENAYSAACLAVNTCFNLKLNNQ-- 71
+ KG WS K L++ L V+ I+ + ++ + + L V T F ++N +
Sbjct: 9 RKKGDYNPWSPEETKLLVQ-LLVEG-----INNNWRDSNGTISKLTVETKFMPEINKEFC 62
Query: 72 ------KVINRLKTIKKRYKVMKDMLS-QDGFWWNPNTKMIECDSDELWKKYIAAHPDAR 124
++R+K +K +Y+ D+ GF W+P TK SDE+W Y+ AHP+ +
Sbjct: 63 RSKNYNHYLSRMKYLKIQYQSCLDLQRFSSGFGWDPLTKRFTA-SDEVWSDYLKAHPNNK 121
Query: 125 GFGGKQIEMYDELKIVCG 142
E +DEL+I+ G
Sbjct: 122 QLRYDTFEFFDELQIIFG 139
>AT5G05800.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 24 plant
structures; EXPRESSED DURING: 15 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT3G11290.1); Has 881 Blast hits to 512 proteins
in 30 species: Archae - 0; Bacteria - 2; Metazoa - 0;
Fungi - 38; Plants - 833; Viruses - 0; Other Eukaryotes
- 8 (source: NCBI BLink). | chr5:1743234-1744751 REVERSE
LENGTH=449
Length = 449
Score = 51.6 bits (122), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 31/135 (22%), Positives = 57/135 (42%), Gaps = 8/135 (5%)
Query: 22 WSIAMDKCLIEELAVQAKSGNKIDKCFNENAYSAACLAVNTCFNLKLNNQKVINRLKTIK 81
WS + K ++ L + GN+ D FN+ + +N L ++ N +
Sbjct: 170 WSPSTHKLFLDLLVQETLKGNRPDTHFNKEGWKTILGTINENTGLGYTRPQLKNHWDCTR 229
Query: 82 KRYKVMKDMLSQDGFWWNPNTKMIECDSDELWKKYIAAHPDARGFGGKQIEMYDELKIVC 141
K +K+ ++ W+P ++ ++E W+ YI +P A F K++ D+L I+
Sbjct: 230 KAWKIWCQLVGASSMKWDPESRSFGA-TEEEWRIYIRENPRAGQFRHKEVPHADQLAIIF 288
Query: 142 G-------NYQAPSR 149
Y PSR
Sbjct: 289 NGVIEPGETYTPPSR 303
>AT5G05800.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G11290.1); Has 1807 Blast hits to 1807 proteins
in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
- 339 (source: NCBI BLink). | chr5:1743234-1744751
REVERSE LENGTH=449
Length = 449
Score = 51.6 bits (122), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 31/135 (22%), Positives = 57/135 (42%), Gaps = 8/135 (5%)
Query: 22 WSIAMDKCLIEELAVQAKSGNKIDKCFNENAYSAACLAVNTCFNLKLNNQKVINRLKTIK 81
WS + K ++ L + GN+ D FN+ + +N L ++ N +
Sbjct: 170 WSPSTHKLFLDLLVQETLKGNRPDTHFNKEGWKTILGTINENTGLGYTRPQLKNHWDCTR 229
Query: 82 KRYKVMKDMLSQDGFWWNPNTKMIECDSDELWKKYIAAHPDARGFGGKQIEMYDELKIVC 141
K +K+ ++ W+P ++ ++E W+ YI +P A F K++ D+L I+
Sbjct: 230 KAWKIWCQLVGASSMKWDPESRSFGA-TEEEWRIYIRENPRAGQFRHKEVPHADQLAIIF 288
Query: 142 G-------NYQAPSR 149
Y PSR
Sbjct: 289 NGVIEPGETYTPPSR 303