Miyakogusa Predicted Gene
- Lj4g3v2290290.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj4g3v2290290.1 Non Chatacterized Hit- tr|I3T5W5|I3T5W5_LOTJA
Uncharacterized protein OS=Lotus japonicus PE=2
SV=1,69.86,4e-17,seg,NULL; FAMILY NOT NAMED,NULL,CUFF.50710.1
(294 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G06790.2 | Symbols: | plastid developmental protein DAG, put... 264 5e-71
AT3G06790.1 | Symbols: | plastid developmental protein DAG, put... 261 5e-70
AT3G15000.1 | Symbols: | cobalt ion binding | chr3:5050321-5052... 190 1e-48
AT2G35240.1 | Symbols: | plastid developmental protein DAG, put... 162 3e-40
AT2G33430.1 | Symbols: DAL1, DAL | differentiation and greening-... 159 2e-39
AT1G32580.1 | Symbols: | plastid developmental protein DAG, put... 157 1e-38
AT1G11430.1 | Symbols: | plastid developmental protein DAG, put... 156 2e-38
AT4G20020.2 | Symbols: | unknown protein; BEST Arabidopsis thal... 122 3e-28
AT4G20020.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 122 4e-28
AT1G72530.1 | Symbols: | plastid developmental protein DAG, put... 121 4e-28
AT1G72530.2 | Symbols: | plastid developmental protein DAG, put... 115 3e-26
AT5G44780.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 107 1e-23
AT1G53260.1 | Symbols: | LOCATED IN: endomembrane system; BEST ... 92 3e-19
AT1G53260.2 | Symbols: | LOCATED IN: endomembrane system; BEST ... 92 5e-19
AT3G20930.1 | Symbols: | RNA-binding (RRM/RBD/RNP motifs) famil... 59 5e-09
>AT3G06790.2 | Symbols: | plastid developmental protein DAG,
putative | chr3:2144564-2145743 REVERSE LENGTH=244
Length = 244
Score = 264 bits (674), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 127/171 (74%), Positives = 141/171 (82%), Gaps = 1/171 (0%)
Query: 51 FSVRTKSSGSGYSPLNDPSPNWSNRPPKETILLDGCDYEHWLIVMEFPENPKPSEQEMVN 110
S R K+SGSGYSPLNDPSPNWSNRPPKETILLDGCDYEHWLIVMEF +PKP+E+EM+N
Sbjct: 57 ISTRPKTSGSGYSPLNDPSPNWSNRPPKETILLDGCDYEHWLIVMEF-TDPKPTEEEMIN 115
Query: 111 AYVKTLTQIVGSEEEAMKKIYSVSTHTYTGFGALISEELSYKVKELPGVLWVLPDSYLDV 170
+YVKTLT ++G EEEA KKIYSV T TYTGFGALISEELS KVK LPGVLWVLPDSYLDV
Sbjct: 116 SYVKTLTSVLGCEEEAKKKIYSVCTSTYTGFGALISEELSCKVKALPGVLWVLPDSYLDV 175
Query: 171 PNKDYGGDLFVDGKVIPRPQYRYAERQXXXXXXXXXXXXXXXXMQVERTDP 221
PNKDYGGDL+V+GKVIPRPQYR+ E++ MQVER +P
Sbjct: 176 PNKDYGGDLYVEGKVIPRPQYRFTEQRHTRPRPRPRYDRRRETMQVERREP 226
>AT3G06790.1 | Symbols: | plastid developmental protein DAG,
putative | chr3:2144564-2145743 REVERSE LENGTH=244
Length = 244
Score = 261 bits (666), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 126/171 (73%), Positives = 141/171 (82%), Gaps = 1/171 (0%)
Query: 51 FSVRTKSSGSGYSPLNDPSPNWSNRPPKETILLDGCDYEHWLIVMEFPENPKPSEQEMVN 110
S R K+SGSGYSPLNDPSPNWSNRPPKETILLDGCDYEHWLIVMEF +PKP+E+EM+N
Sbjct: 57 ISTRPKTSGSGYSPLNDPSPNWSNRPPKETILLDGCDYEHWLIVMEF-TDPKPTEEEMIN 115
Query: 111 AYVKTLTQIVGSEEEAMKKIYSVSTHTYTGFGALISEELSYKVKELPGVLWVLPDSYLDV 170
+YVKTLT ++G +EEA KKIYSV T TYTGFGALISEELS KVK LPGVLWVLPDSYLDV
Sbjct: 116 SYVKTLTSVLGWQEEAKKKIYSVCTSTYTGFGALISEELSCKVKALPGVLWVLPDSYLDV 175
Query: 171 PNKDYGGDLFVDGKVIPRPQYRYAERQXXXXXXXXXXXXXXXXMQVERTDP 221
PNKDYGGDL+V+GKVIPRPQYR+ E++ MQVER +P
Sbjct: 176 PNKDYGGDLYVEGKVIPRPQYRFTEQRHTRPRPRPRYDRRRETMQVERREP 226
>AT3G15000.1 | Symbols: | cobalt ion binding | chr3:5050321-5052121
FORWARD LENGTH=395
Length = 395
Score = 190 bits (482), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 89/136 (65%), Positives = 112/136 (82%), Gaps = 1/136 (0%)
Query: 52 SVRTKSSGSGYSPLNDPSPNWSNRPPKETILLDGCDYEHWLIVMEFPENPKPSEQEMVNA 111
SV+ S+ + S LNDP+PNWSNRPPKETILLDGCD+EHWL+V+E P+ +P+ E++++
Sbjct: 61 SVKGLSTQATSSSLNDPNPNWSNRPPKETILLDGCDFEHWLVVVEPPQG-EPTRDEIIDS 119
Query: 112 YVKTLTQIVGSEEEAMKKIYSVSTHTYTGFGALISEELSYKVKELPGVLWVLPDSYLDVP 171
Y+KTL QIVGSE+EA KIYSVST Y FGAL+SE+LS+K+KEL V WVLPDSYLDV
Sbjct: 120 YIKTLAQIVGSEDEARMKIYSVSTRCYYAFGALVSEDLSHKLKELSNVRWVLPDSYLDVR 179
Query: 172 NKDYGGDLFVDGKVIP 187
NKDYGG+ F+DGK +P
Sbjct: 180 NKDYGGEPFIDGKAVP 195
>AT2G35240.1 | Symbols: | plastid developmental protein DAG,
putative | chr2:14845099-14846262 REVERSE LENGTH=232
Length = 232
Score = 162 bits (409), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 85/152 (55%), Positives = 107/152 (70%), Gaps = 5/152 (3%)
Query: 45 VPDTVKFS-VRTK--SSGSGYSPLNDPSPNWSNRPPKETI-LLDGCDYEHWLIVMEFPEN 100
+P +F+ +RT+ SG YSPL S N+S+RPP E L GCDYEHWLIVME P
Sbjct: 50 IPALTRFTTIRTRMDRSGGSYSPLKSGS-NFSDRPPTEMAPLFPGCDYEHWLIVMEKPGG 108
Query: 101 PKPSEQEMVNAYVKTLTQIVGSEEEAMKKIYSVSTHTYTGFGALISEELSYKVKELPGVL 160
+Q+M++ YV+TL +IVGSEEEA KKIY+VS Y GFG I EE S K++ LPGVL
Sbjct: 109 ENAQKQQMIDCYVQTLAKIVGSEEEARKKIYNVSCERYFGFGCEIDEETSNKLEGLPGVL 168
Query: 161 WVLPDSYLDVPNKDYGGDLFVDGKVIPRPQYR 192
+VLPDSY+D KDYG +LFV+G+V+PRP R
Sbjct: 169 FVLPDSYVDPEFKDYGAELFVNGEVVPRPPER 200
>AT2G33430.1 | Symbols: DAL1, DAL | differentiation and
greening-like 1 | chr2:14162732-14164729 FORWARD
LENGTH=219
Length = 219
Score = 159 bits (402), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 80/144 (55%), Positives = 105/144 (72%), Gaps = 4/144 (2%)
Query: 48 TVKFSVRTKS--SGSGYSPLNDPSPNWSNRPPKETI-LLDGCDYEHWLIVMEFPENPKPS 104
T FS+R + SGS YSPLN S N+S+RPP E L GCDYEHWLIVM+ P +
Sbjct: 42 TRFFSIRCGANRSGSTYSPLNSGS-NFSDRPPTEMAPLFPGCDYEHWLIVMDKPGGEGAT 100
Query: 105 EQEMVNAYVKTLTQIVGSEEEAMKKIYSVSTHTYTGFGALISEELSYKVKELPGVLWVLP 164
+Q+M++ Y++TL ++VGSEEEA K+IY+VS Y GFG I EE S K++ LPGVL+VLP
Sbjct: 101 KQQMIDCYIQTLAKVVGSEEEAKKRIYNVSCERYLGFGCEIDEETSTKLEGLPGVLFVLP 160
Query: 165 DSYLDVPNKDYGGDLFVDGKVIPR 188
DSY+D NKDYG +LFV+G+++ R
Sbjct: 161 DSYVDPENKDYGAELFVNGEIVQR 184
>AT1G32580.1 | Symbols: | plastid developmental protein DAG,
putative | chr1:11784108-11785430 FORWARD LENGTH=229
Length = 229
Score = 157 bits (396), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 77/141 (54%), Positives = 102/141 (72%), Gaps = 4/141 (2%)
Query: 52 SVRTK--SSGSGYSPLNDPSPNWSNRPPKETI-LLDGCDYEHWLIVMEFPENPKPSEQEM 108
++RT+ SG YSPL S N+S+R P E L GCDYEHWLIVM+ P ++Q+M
Sbjct: 55 TIRTRMDRSGGSYSPLKSGS-NFSDRAPTEMAPLFPGCDYEHWLIVMDKPGGENATKQQM 113
Query: 109 VNAYVKTLTQIVGSEEEAMKKIYSVSTHTYTGFGALISEELSYKVKELPGVLWVLPDSYL 168
++ YV+TL +I+GSEEEA KKIY+VS Y GFG I EE S K++ LPGVL++LPDSY+
Sbjct: 114 IDCYVQTLAKIIGSEEEAKKKIYNVSCERYFGFGCEIDEETSNKLEGLPGVLFILPDSYV 173
Query: 169 DVPNKDYGGDLFVDGKVIPRP 189
D NKDYG +LFV+G+++ RP
Sbjct: 174 DQENKDYGAELFVNGEIVQRP 194
>AT1G11430.1 | Symbols: | plastid developmental protein DAG,
putative | chr1:3847273-3848938 FORWARD LENGTH=232
Length = 232
Score = 156 bits (394), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 68/110 (61%), Positives = 86/110 (78%)
Query: 78 KETILLDGCDYEHWLIVMEFPENPKPSEQEMVNAYVKTLTQIVGSEEEAMKKIYSVSTHT 137
+ETI+L GCDY HWLIVMEFP++P PS +M++ Y+ TL ++GS EEA K +Y+ ST T
Sbjct: 77 RETIMLPGCDYNHWLIVMEFPKDPAPSRDQMIDTYLNTLATVLGSMEEAKKNMYAFSTTT 136
Query: 138 YTGFGALISEELSYKVKELPGVLWVLPDSYLDVPNKDYGGDLFVDGKVIP 187
YTGF I EE S K K LPGVLWVLPDSY+DV NKDYGGD +++G++IP
Sbjct: 137 YTGFQCTIDEETSEKFKGLPGVLWVLPDSYIDVKNKDYGGDKYINGEIIP 186
>AT4G20020.2 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G44780.1); Has 35333 Blast hits to 34131
proteins in 2444 species: Archae - 798; Bacteria -
22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
- 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
chr4:10844360-10846085 REVERSE LENGTH=406
Length = 406
Score = 122 bits (306), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 57/114 (50%), Positives = 83/114 (72%), Gaps = 2/114 (1%)
Query: 78 KETILLDGCDYEHWLIVMEFPENPKP-SEQEMVNAYVKTLTQIVG-SEEEAMKKIYSVST 135
++T+L +GCDY HWLI M+F + P S +EMV AY +T Q +G S EEA +++Y+ ST
Sbjct: 77 EDTVLFEGCDYNHWLITMDFSKEETPKSPEEMVAAYEETCAQGLGISVEEAKQRMYACST 136
Query: 136 HTYTGFGALISEELSYKVKELPGVLWVLPDSYLDVPNKDYGGDLFVDGKVIPRP 189
TY GF A+++E+ S K K+LPGV+++LPDSY+D NK+YGGD + +G + RP
Sbjct: 137 TTYQGFQAIMTEQESEKFKDLPGVVFILPDSYIDPQNKEYGGDKYENGVITHRP 190
>AT4G20020.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G44780.1); Has 28928 Blast hits to 16023
proteins in 1033 species: Archae - 4; Bacteria - 4155;
Metazoa - 15463; Fungi - 2938; Plants - 3091; Viruses -
205; Other Eukaryotes - 3072 (source: NCBI BLink). |
chr4:10844433-10846085 REVERSE LENGTH=419
Length = 419
Score = 122 bits (305), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 57/114 (50%), Positives = 83/114 (72%), Gaps = 2/114 (1%)
Query: 78 KETILLDGCDYEHWLIVMEFPENPKP-SEQEMVNAYVKTLTQIVG-SEEEAMKKIYSVST 135
++T+L +GCDY HWLI M+F + P S +EMV AY +T Q +G S EEA +++Y+ ST
Sbjct: 77 EDTVLFEGCDYNHWLITMDFSKEETPKSPEEMVAAYEETCAQGLGISVEEAKQRMYACST 136
Query: 136 HTYTGFGALISEELSYKVKELPGVLWVLPDSYLDVPNKDYGGDLFVDGKVIPRP 189
TY GF A+++E+ S K K+LPGV+++LPDSY+D NK+YGGD + +G + RP
Sbjct: 137 TTYQGFQAIMTEQESEKFKDLPGVVFILPDSYIDPQNKEYGGDKYENGVITHRP 190
>AT1G72530.1 | Symbols: | plastid developmental protein DAG,
putative | chr1:27312999-27313937 FORWARD LENGTH=188
Length = 188
Score = 121 bits (304), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 68/163 (41%), Positives = 101/163 (61%), Gaps = 7/163 (4%)
Query: 36 ALAAKQTLPVPDTVKFSVRTKSSGSGYS-PLNDPSPNWSN--RPPKETILLDGCDYEHWL 92
A ++ L + V+F + S SG S +N + +WS R P L++GCDY+HWL
Sbjct: 2 ARIIRRPLNLTAAVRFRLSPLSPFSGNSGSINSETTSWSELIRVPS---LVEGCDYKHWL 58
Query: 93 IVMEFPENPKPSEQEMVNAYVKTLTQIVGSEEEAMKKIYSVSTHTYTGFGALISEELSYK 152
++M+ P N P+ +V ++V+TL +GSEEEA + IYSVST Y FG I E L+YK
Sbjct: 59 VLMK-PPNGYPTRNHIVQSFVETLAMALGSEEEAKRSIYSVSTKYYYAFGCRIHEPLTYK 117
Query: 153 VKELPGVLWVLPDSYLDVPNKDYGGDLFVDGKVIPRPQYRYAE 195
++ LP V WVLPDS++ + YGG+ FVDG+V+P + +A+
Sbjct: 118 IRSLPDVKWVLPDSFIVDGDNRYGGEPFVDGEVVPYDEKYHAD 160
>AT1G72530.2 | Symbols: | plastid developmental protein DAG,
putative | chr1:27312999-27313937 FORWARD LENGTH=192
Length = 192
Score = 115 bits (289), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 68/167 (40%), Positives = 101/167 (60%), Gaps = 11/167 (6%)
Query: 36 ALAAKQTLPVPDTVKFSVRTKSSGSGYS-PLNDPSPNWSN--RPPKETILLDGCDYEHWL 92
A ++ L + V+F + S SG S +N + +WS R P L++GCDY+HWL
Sbjct: 2 ARIIRRPLNLTAAVRFRLSPLSPFSGNSGSINSETTSWSELIRVPS---LVEGCDYKHWL 58
Query: 93 IVMEFPENPKPSEQEMVNAYVKTLTQIVGSEEEAMKKIYSVSTHTYTGFGALISEELSYK 152
++M+ P N P+ +V ++V+TL +GSEEEA + IYSVST Y FG I E L+YK
Sbjct: 59 VLMK-PPNGYPTRNHIVQSFVETLAMALGSEEEAKRSIYSVSTKYYYAFGCRIHEPLTYK 117
Query: 153 VKELPGVLWVLPDSYLDVPNKDYG----GDLFVDGKVIPRPQYRYAE 195
++ LP V WVLPDS++ + YG G+ FVDG+V+P + +A+
Sbjct: 118 IRSLPDVKWVLPDSFIVDGDNRYGVFFAGEPFVDGEVVPYDEKYHAD 164
>AT5G44780.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G20020.2); Has 9661 Blast hits to 6233 proteins
in 635 species: Archae - 4; Bacteria - 1116; Metazoa -
4251; Fungi - 1510; Plants - 1359; Viruses - 43; Other
Eukaryotes - 1378 (source: NCBI BLink). |
chr5:18068100-18070544 FORWARD LENGTH=723
Length = 723
Score = 107 bits (266), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 48/107 (44%), Positives = 73/107 (68%), Gaps = 1/107 (0%)
Query: 84 DGCDYEHWLIVMEFPENPKPSEQEMVNAYVKTLTQIVG-SEEEAMKKIYSVSTHTYTGFG 142
+GCD+ HWLI M FP++ PS +EM++ + +T + + S EEA KKIY++ T +Y GF
Sbjct: 78 EGCDFNHWLITMNFPKDNLPSREEMISIFEQTCAKGLAISLEEAKKKIYAICTTSYQGFQ 137
Query: 143 ALISEELSYKVKELPGVLWVLPDSYLDVPNKDYGGDLFVDGKVIPRP 189
A ++ K ++LPGV +++PDSY+DV NK YGGD + +G + P P
Sbjct: 138 ATMTIGEVEKFRDLPGVQYIIPDSYIDVENKVYGGDKYENGVITPGP 184
>AT1G53260.1 | Symbols: | LOCATED IN: endomembrane system; BEST
Arabidopsis thaliana protein match is: cobalt ion
binding (TAIR:AT3G15000.1); Has 32763 Blast hits to
18534 proteins in 929 species: Archae - 22; Bacteria -
2420; Metazoa - 15140; Fungi - 5401; Plants - 5313;
Viruses - 485; Other Eukaryotes - 3982 (source: NCBI
BLink). | chr1:19859393-19860421 REVERSE LENGTH=271
Length = 271
Score = 92.4 bits (228), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 41/59 (69%), Positives = 48/59 (81%)
Query: 129 KIYSVSTHTYTGFGALISEELSYKVKELPGVLWVLPDSYLDVPNKDYGGDLFVDGKVIP 187
KIYSVS Y FGAL+SE+LS+K+KELP V WVLPDSYLD NKDYGG+ F+DGK +P
Sbjct: 2 KIYSVSHKCYFAFGALVSEDLSHKIKELPKVKWVLPDSYLDGKNKDYGGEPFIDGKAVP 60
>AT1G53260.2 | Symbols: | LOCATED IN: endomembrane system; BEST
Arabidopsis thaliana protein match is: cobalt ion
binding (TAIR:AT3G15000.1); Has 246 Blast hits to 241
proteins in 32 species: Archae - 0; Bacteria - 2;
Metazoa - 7; Fungi - 16; Plants - 212; Viruses - 1;
Other Eukaryotes - 8 (source: NCBI BLink). |
chr1:19859406-19860421 REVERSE LENGTH=230
Length = 230
Score = 91.7 bits (226), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 43/65 (66%), Positives = 51/65 (78%), Gaps = 1/65 (1%)
Query: 129 KIYSVSTHTYTGFGALISEELSYKVKELPGVLWVLPDSYLDVPNKDYGGDLFVDGKVIP- 187
KIYSVS Y FGAL+SE+LS+K+KELP V WVLPDSYLD NKDYGG+ F+DGK +P
Sbjct: 2 KIYSVSHKCYFAFGALVSEDLSHKIKELPKVKWVLPDSYLDGKNKDYGGEPFIDGKAVPY 61
Query: 188 RPQYR 192
P+Y
Sbjct: 62 DPKYH 66
>AT3G20930.1 | Symbols: | RNA-binding (RRM/RBD/RNP motifs) family
protein | chr3:7331739-7333749 FORWARD LENGTH=374
Length = 374
Score = 58.5 bits (140), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 52/87 (59%)
Query: 90 HWLIVMEFPENPKPSEQEMVNAYVKTLTQIVGSEEEAMKKIYSVSTHTYTGFGALISEEL 149
+W+++++ P + S+ MV+ YV+ L +++G+E++A IY S T+ GF I E+
Sbjct: 72 YWMVLLDKPPHWVSSKSAMVDYYVEILAKVLGNEKDAQVSIYDASFDTHFGFCCHIDEDA 131
Query: 150 SYKVKELPGVLWVLPDSYLDVPNKDYG 176
S ++ LPGV+ + P+ K+YG
Sbjct: 132 SRQLASLPGVVSIRPEQDYSSEKKNYG 158
Score = 58.2 bits (139), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 60/108 (55%)
Query: 71 NWSNRPPKETILLDGCDYEHWLIVMEFPENPKPSEQEMVNAYVKTLTQIVGSEEEAMKKI 130
N+ K L D +HW++ ++ P ++ +MV+ V+ L++++ +E++A +
Sbjct: 156 NYGIGSHKGVSLFDHGTVKHWMVRIDKPGVGIVTKAQMVDHCVQLLSKVLWNEKDAQMCL 215
Query: 131 YSVSTHTYTGFGALISEELSYKVKELPGVLWVLPDSYLDVPNKDYGGD 178
Y VS + GF + E + ++ +PGVL V+PD+ + NKDY GD
Sbjct: 216 YHVSWQSDFGFCCDLDERSAVELAGVPGVLAVVPDNSFESLNKDYEGD 263