Miyakogusa Predicted Gene
- Lj5g3v1913900.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj5g3v1913900.1 CUFF.56178.1
(135 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT4G02550.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 70 5e-13
AT4G02550.3 | Symbols: | unknown protein; BEST Arabidopsis thal... 70 5e-13
AT4G02550.4 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 70 5e-13
AT4G02550.2 | Symbols: | unknown protein; BEST Arabidopsis thal... 70 5e-13
AT4G02210.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 63 6e-11
AT4G02210.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 63 6e-11
AT2G24960.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 62 8e-11
AT2G24960.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 62 1e-10
>AT4G02550.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G02210.2); Has 370 Blast hits to 300 proteins
in 18 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 10; Plants - 354; Viruses - 0; Other Eukaryotes
- 6 (source: NCBI BLink). | chr4:1120622-1121629 REVERSE
LENGTH=307
Length = 307
Score = 69.7 bits (169), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 54/98 (55%)
Query: 30 WNLEMDRALADILRDQRSMGNKSDGAWKGVAYNTAAQILSSRFNLQLIGENVKNRIKLWR 89
W++ MD+ L + L Q GNK D + AY A +++RFNL L + NR+K +
Sbjct: 22 WSVGMDKCLIEALAVQAKNGNKVDKCFNDKAYTAACVAVNTRFNLNLTSQKAINRLKTIK 81
Query: 90 SWYGIVSDILSQSGFDWDGTKYMISVSNEDAWNEYVKV 127
Y ++ DILS+ GF W+ + MI +++ W Y+ V
Sbjct: 82 KRYRVMRDILSRDGFWWNSSTKMIDCESDELWRRYIAV 119
>AT4G02550.3 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G02210.2); Has 35333 Blast hits to 34131
proteins in 2444 species: Archae - 798; Bacteria -
22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
- 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
chr4:1120622-1121674 REVERSE LENGTH=322
Length = 322
Score = 69.7 bits (169), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 54/98 (55%)
Query: 30 WNLEMDRALADILRDQRSMGNKSDGAWKGVAYNTAAQILSSRFNLQLIGENVKNRIKLWR 89
W++ MD+ L + L Q GNK D + AY A +++RFNL L + NR+K +
Sbjct: 37 WSVGMDKCLIEALAVQAKNGNKVDKCFNDKAYTAACVAVNTRFNLNLTSQKAINRLKTIK 96
Query: 90 SWYGIVSDILSQSGFDWDGTKYMISVSNEDAWNEYVKV 127
Y ++ DILS+ GF W+ + MI +++ W Y+ V
Sbjct: 97 KRYRVMRDILSRDGFWWNSSTKMIDCESDELWRRYIAV 134
>AT4G02550.4 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 18 plant
structures; EXPRESSED DURING: 7 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT4G02210.2). | chr4:1120622-1121629 REVERSE
LENGTH=278
Length = 278
Score = 69.7 bits (169), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 54/98 (55%)
Query: 30 WNLEMDRALADILRDQRSMGNKSDGAWKGVAYNTAAQILSSRFNLQLIGENVKNRIKLWR 89
W++ MD+ L + L Q GNK D + AY A +++RFNL L + NR+K +
Sbjct: 22 WSVGMDKCLIEALAVQAKNGNKVDKCFNDKAYTAACVAVNTRFNLNLTSQKAINRLKTIK 81
Query: 90 SWYGIVSDILSQSGFDWDGTKYMISVSNEDAWNEYVKV 127
Y ++ DILS+ GF W+ + MI +++ W Y+ V
Sbjct: 82 KRYRVMRDILSRDGFWWNSSTKMIDCESDELWRRYIAV 119
>AT4G02550.2 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G02210.2); Has 350 Blast hits to 284 proteins
in 18 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 13; Plants - 331; Viruses - 0; Other Eukaryotes
- 6 (source: NCBI BLink). | chr4:1120622-1121629 REVERSE
LENGTH=278
Length = 278
Score = 69.7 bits (169), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 54/98 (55%)
Query: 30 WNLEMDRALADILRDQRSMGNKSDGAWKGVAYNTAAQILSSRFNLQLIGENVKNRIKLWR 89
W++ MD+ L + L Q GNK D + AY A +++RFNL L + NR+K +
Sbjct: 22 WSVGMDKCLIEALAVQAKNGNKVDKCFNDKAYTAACVAVNTRFNLNLTSQKAINRLKTIK 81
Query: 90 SWYGIVSDILSQSGFDWDGTKYMISVSNEDAWNEYVKV 127
Y ++ DILS+ GF W+ + MI +++ W Y+ V
Sbjct: 82 KRYRVMRDILSRDGFWWNSSTKMIDCESDELWRRYIAV 119
>AT4G02210.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 22 plant
structures; EXPRESSED DURING: 13 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT2G24960.2). | chr4:974320-975917 REVERSE
LENGTH=439
Length = 439
Score = 63.2 bits (152), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 30/104 (28%), Positives = 53/104 (50%), Gaps = 1/104 (0%)
Query: 24 SRAYFTWNLEMDRALADILRDQRSMGNKSDGAWKGVAYNTAAQILSSRFNLQLIGENVKN 83
+R TW+ MDR D++ DQ GN+ +G ++ A+ + +++F + +KN
Sbjct: 180 TRCRTTWHPPMDRYFIDLMLDQARRGNQIEGVFRKQAWTEMVNLFNAKFESNFDVDVLKN 239
Query: 84 RIKLWRSWYGIVSDILSQSGFDWDGTKYMISVSNEDAWNEYVKV 127
R K R + + IL GF WD + M++ N + W +Y+K
Sbjct: 240 RYKSLRRQFNAIKSILRSDGFAWDNERQMVTADN-NVWQDYIKA 282
Score = 52.0 bits (123), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 59/105 (56%), Gaps = 4/105 (3%)
Query: 25 RAYFTWNLEMDRALADILRDQRSMGNK-SDGAWKGVAYNTAAQILSSRFNLQLIGENV-K 82
R W EMD+ +++ +Q GN+ D + A+ + +++F L G++V K
Sbjct: 9 RLRTVWTPEMDQYFIELMVEQVRKGNRFEDHLFSKRAWKFMSCSFTAKFKF-LYGKDVLK 67
Query: 83 NRIKLWRSWYGIVSDILSQSGFDWDGTKYMISVSNEDAWNEYVKV 127
NR K R+ + V+++L + GF WD T+ M+ V++ W+EY+K+
Sbjct: 68 NRHKTLRNLFKSVNNLLIEDGFSWDDTRQMV-VADNCVWDEYLKI 111
>AT4G02210.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT2G24960.2); Has 791 Blast hits to 465 proteins
in 19 species: Archae - 0; Bacteria - 0; Metazoa - 0;
Fungi - 17; Plants - 748; Viruses - 0; Other Eukaryotes
- 26 (source: NCBI BLink). | chr4:974320-975917 REVERSE
LENGTH=439
Length = 439
Score = 63.2 bits (152), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 30/104 (28%), Positives = 53/104 (50%), Gaps = 1/104 (0%)
Query: 24 SRAYFTWNLEMDRALADILRDQRSMGNKSDGAWKGVAYNTAAQILSSRFNLQLIGENVKN 83
+R TW+ MDR D++ DQ GN+ +G ++ A+ + +++F + +KN
Sbjct: 180 TRCRTTWHPPMDRYFIDLMLDQARRGNQIEGVFRKQAWTEMVNLFNAKFESNFDVDVLKN 239
Query: 84 RIKLWRSWYGIVSDILSQSGFDWDGTKYMISVSNEDAWNEYVKV 127
R K R + + IL GF WD + M++ N + W +Y+K
Sbjct: 240 RYKSLRRQFNAIKSILRSDGFAWDNERQMVTADN-NVWQDYIKA 282
Score = 52.0 bits (123), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 59/105 (56%), Gaps = 4/105 (3%)
Query: 25 RAYFTWNLEMDRALADILRDQRSMGNK-SDGAWKGVAYNTAAQILSSRFNLQLIGENV-K 82
R W EMD+ +++ +Q GN+ D + A+ + +++F L G++V K
Sbjct: 9 RLRTVWTPEMDQYFIELMVEQVRKGNRFEDHLFSKRAWKFMSCSFTAKFKF-LYGKDVLK 67
Query: 83 NRIKLWRSWYGIVSDILSQSGFDWDGTKYMISVSNEDAWNEYVKV 127
NR K R+ + V+++L + GF WD T+ M+ V++ W+EY+K+
Sbjct: 68 NRHKTLRNLFKSVNNLLIEDGFSWDDTRQMV-VADNCVWDEYLKI 111
>AT2G24960.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G02210.2); Has 1453 Blast hits to 509 proteins
in 26 species: Archae - 0; Bacteria - 0; Metazoa - 1;
Fungi - 39; Plants - 1363; Viruses - 0; Other Eukaryotes
- 50 (source: NCBI BLink). | chr2:10617263-10620034
FORWARD LENGTH=797
Length = 797
Score = 62.4 bits (150), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 36/116 (31%), Positives = 61/116 (52%), Gaps = 3/116 (2%)
Query: 13 TEASNEDKKDDSRAYFTWNLEMDRALADILRDQRSMGNKSDGAWKGVAYNTAAQILSSRF 72
T+AS E D +R + W MD L D+L +Q + GN+ + A+N +++F
Sbjct: 312 TKASQEQNSDRTRIF--WTPPMDYHLIDLLVEQVNNGNRVGQTFITSAWNEMVTAFNAKF 369
Query: 73 NLQLIGENVKNRIKLWRSWYGIVSDILSQSGFDWDGTKYMISVSNEDAWNEYVKVM 128
Q + +KNR K R Y + +L Q+GF WD + M+ ++++D WN Y++
Sbjct: 370 GSQHNKDVLKNRYKHLRRLYNDIKFLLEQNGFSWDARRDMV-IADDDIWNTYIQAC 424
Score = 55.5 bits (132), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 53/97 (54%), Gaps = 1/97 (1%)
Query: 30 WNLEMDRALADILRDQRSMGNKSDGAWKGVAYNTAAQILSSRFNLQLIGENVKNRIKLWR 89
W LEMD+ +I+ DQ GNK+ A+ A+ + ++RF+ Q +++R
Sbjct: 172 WTLEMDQYFVEIMVDQIGRGNKTGNAFSKQAWIDMLVLFNARFSGQYGKRVLRHRYNKLL 231
Query: 90 SWYGIVSDILSQSGFDWDGTKYMISVSNEDAWNEYVK 126
+Y + IL + GF WD T+ MIS +++ W+ Y+K
Sbjct: 232 KYYKDMEAILKEDGFSWDETRLMIS-ADDAVWDSYIK 267
>AT2G24960.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 21 plant
structures; EXPRESSED DURING: 12 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT4G02210.2); Has 30201 Blast hits to 17322
proteins in 780 species: Archae - 12; Bacteria - 1396;
Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
0; Other Eukaryotes - 2996 (source: NCBI BLink). |
chr2:10617263-10620034 FORWARD LENGTH=774
Length = 774
Score = 62.0 bits (149), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 36/115 (31%), Positives = 61/115 (53%), Gaps = 3/115 (2%)
Query: 13 TEASNEDKKDDSRAYFTWNLEMDRALADILRDQRSMGNKSDGAWKGVAYNTAAQILSSRF 72
T+AS E D +R + W MD L D+L +Q + GN+ + A+N +++F
Sbjct: 312 TKASQEQNSDRTRIF--WTPPMDYHLIDLLVEQVNNGNRVGQTFITSAWNEMVTAFNAKF 369
Query: 73 NLQLIGENVKNRIKLWRSWYGIVSDILSQSGFDWDGTKYMISVSNEDAWNEYVKV 127
Q + +KNR K R Y + +L Q+GF WD + M+ ++++D WN Y++
Sbjct: 370 GSQHNKDVLKNRYKHLRRLYNDIKFLLEQNGFSWDARRDMV-IADDDIWNTYIQA 423
Score = 55.5 bits (132), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 53/97 (54%), Gaps = 1/97 (1%)
Query: 30 WNLEMDRALADILRDQRSMGNKSDGAWKGVAYNTAAQILSSRFNLQLIGENVKNRIKLWR 89
W LEMD+ +I+ DQ GNK+ A+ A+ + ++RF+ Q +++R
Sbjct: 172 WTLEMDQYFVEIMVDQIGRGNKTGNAFSKQAWIDMLVLFNARFSGQYGKRVLRHRYNKLL 231
Query: 90 SWYGIVSDILSQSGFDWDGTKYMISVSNEDAWNEYVK 126
+Y + IL + GF WD T+ MIS +++ W+ Y+K
Sbjct: 232 KYYKDMEAILKEDGFSWDETRLMIS-ADDAVWDSYIK 267