Miyakogusa Predicted Gene
- Lj6g3v2133230.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj6g3v2133230.1 Non Chatacterized Hit- tr|I3SC95|I3SC95_LOTJA
Uncharacterized protein OS=Lotus japonicus PE=2 SV=1,98.7,0,FAMILY NOT
NAMED,NULL; seg,NULL,CUFF.60694.1
(231 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G11430.1 | Symbols: | plastid developmental protein DAG, put... 297 4e-81
AT3G06790.2 | Symbols: | plastid developmental protein DAG, put... 154 5e-38
AT3G06790.1 | Symbols: | plastid developmental protein DAG, put... 154 6e-38
AT2G33430.1 | Symbols: DAL1, DAL | differentiation and greening-... 142 2e-34
AT1G32580.1 | Symbols: | plastid developmental protein DAG, put... 140 1e-33
AT4G20020.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 138 4e-33
AT4G20020.2 | Symbols: | unknown protein; BEST Arabidopsis thal... 137 4e-33
AT2G35240.1 | Symbols: | plastid developmental protein DAG, put... 137 6e-33
AT5G44780.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 127 7e-30
AT3G15000.1 | Symbols: | cobalt ion binding | chr3:5050321-5052... 122 1e-28
AT1G72530.1 | Symbols: | plastid developmental protein DAG, put... 106 1e-23
AT1G72530.2 | Symbols: | plastid developmental protein DAG, put... 100 8e-22
AT1G53260.2 | Symbols: | LOCATED IN: endomembrane system; BEST ... 68 6e-12
AT1G53260.1 | Symbols: | LOCATED IN: endomembrane system; BEST ... 67 1e-11
AT3G20930.1 | Symbols: | RNA-binding (RRM/RBD/RNP motifs) famil... 65 3e-11
>AT1G11430.1 | Symbols: | plastid developmental protein DAG,
putative | chr1:3847273-3848938 FORWARD LENGTH=232
Length = 232
Score = 297 bits (761), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 141/183 (77%), Positives = 160/183 (87%), Gaps = 2/183 (1%)
Query: 40 RSIRAVTRARNPTRIRAA-LDEDYSAKRSSSSEQRETIMLPGCDYNHWLIVMEFPKDPAP 98
R+I R ++AA +D DYS+KRS+S+EQRETIMLPGCDYNHWLIVMEFPKDPAP
Sbjct: 43 RNISTAGSRRRVAIVKAATVDSDYSSKRSNSNEQRETIMLPGCDYNHWLIVMEFPKDPAP 102
Query: 99 SREQMIETYLFTLSTVLGSMEEAKKNMYAFSTTTYTGFQCTVDEATSEKFKGLPGVLWVL 158
SR+QMI+TYL TL+TVLGSMEEAKKNMYAFSTTTYTGFQCT+DE TSEKFKGLPGVLWVL
Sbjct: 103 SRDQMIDTYLNTLATVLGSMEEAKKNMYAFSTTTYTGFQCTIDEETSEKFKGLPGVLWVL 162
Query: 159 PDSYIDVKNKDYGGDKYINGEIIPSKYPTYQPKRSGGSRNDSRRYERKRDG-PPTDRRRP 217
PDSYIDVKNKDYGGDKYINGEIIP YPTYQPK+ ++ S+RYERKRDG PP ++R+P
Sbjct: 163 PDSYIDVKNKDYGGDKYINGEIIPCTYPTYQPKQRNNTKYQSKRYERKRDGPPPPEQRKP 222
Query: 218 KQE 220
+QE
Sbjct: 223 RQE 225
>AT3G06790.2 | Symbols: | plastid developmental protein DAG,
putative | chr3:2144564-2145743 REVERSE LENGTH=244
Length = 244
Score = 154 bits (389), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 69/110 (62%), Positives = 85/110 (77%), Gaps = 1/110 (0%)
Query: 73 RETIMLPGCDYNHWLIVMEFPKDPAPSREQMIETYLFTLSTVLGSMEEAKKNMYAFSTTT 132
+ETI+L GCDY HWLIVMEF DP P+ E+MI +Y+ TL++VLG EEAKK +Y+ T+T
Sbjct: 84 KETILLDGCDYEHWLIVMEF-TDPKPTEEEMINSYVKTLTSVLGCEEEAKKKIYSVCTST 142
Query: 133 YTGFQCTVDEATSEKFKGLPGVLWVLPDSYIDVKNKDYGGDKYINGEIIP 182
YTGF + E S K K LPGVLWVLPDSY+DV NKDYGGD Y+ G++IP
Sbjct: 143 YTGFGALISEELSCKVKALPGVLWVLPDSYLDVPNKDYGGDLYVEGKVIP 192
>AT3G06790.1 | Symbols: | plastid developmental protein DAG,
putative | chr3:2144564-2145743 REVERSE LENGTH=244
Length = 244
Score = 154 bits (388), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 69/110 (62%), Positives = 85/110 (77%), Gaps = 1/110 (0%)
Query: 73 RETIMLPGCDYNHWLIVMEFPKDPAPSREQMIETYLFTLSTVLGSMEEAKKNMYAFSTTT 132
+ETI+L GCDY HWLIVMEF DP P+ E+MI +Y+ TL++VLG EEAKK +Y+ T+T
Sbjct: 84 KETILLDGCDYEHWLIVMEF-TDPKPTEEEMINSYVKTLTSVLGWQEEAKKKIYSVCTST 142
Query: 133 YTGFQCTVDEATSEKFKGLPGVLWVLPDSYIDVKNKDYGGDKYINGEIIP 182
YTGF + E S K K LPGVLWVLPDSY+DV NKDYGGD Y+ G++IP
Sbjct: 143 YTGFGALISEELSCKVKALPGVLWVLPDSYLDVPNKDYGGDLYVEGKVIP 192
>AT2G33430.1 | Symbols: DAL1, DAL | differentiation and
greening-like 1 | chr2:14162732-14164729 FORWARD
LENGTH=219
Length = 219
Score = 142 bits (358), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 68/139 (48%), Positives = 92/139 (66%), Gaps = 7/139 (5%)
Query: 77 MLPGCDYNHWLIVMEFPKDPAPSREQMIETYLFTLSTVLGSMEEAKKNMYAFSTTTYTGF 136
+ PGCDY HWLIVM+ P +++QMI+ Y+ TL+ V+GS EEAKK +Y S Y GF
Sbjct: 78 LFPGCDYEHWLIVMDKPGGEGATKQQMIDCYIQTLAKVVGSEEEAKKRIYNVSCERYLGF 137
Query: 137 QCTVDEATSEKFKGLPGVLWVLPDSYIDVKNKDYGGDKYINGEIIP-----SKYPTYQPK 191
C +DE TS K +GLPGVL+VLPDSY+D +NKDYG + ++NGEI+ + QP+
Sbjct: 138 GCEIDEETSTKLEGLPGVLFVLPDSYVDPENKDYGAELFVNGEIVQRSPERQRRVEPQPQ 197
Query: 192 RSGG--SRNDSRRYERKRD 208
R+ ND RY R+R+
Sbjct: 198 RAQDRPRYNDRTRYSRRRE 216
>AT1G32580.1 | Symbols: | plastid developmental protein DAG,
putative | chr1:11784108-11785430 FORWARD LENGTH=229
Length = 229
Score = 140 bits (352), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 57/105 (54%), Positives = 78/105 (74%)
Query: 77 MLPGCDYNHWLIVMEFPKDPAPSREQMIETYLFTLSTVLGSMEEAKKNMYAFSTTTYTGF 136
+ PGCDY HWLIVM+ P +++QMI+ Y+ TL+ ++GS EEAKK +Y S Y GF
Sbjct: 87 LFPGCDYEHWLIVMDKPGGENATKQQMIDCYVQTLAKIIGSEEEAKKKIYNVSCERYFGF 146
Query: 137 QCTVDEATSEKFKGLPGVLWVLPDSYIDVKNKDYGGDKYINGEII 181
C +DE TS K +GLPGVL++LPDSY+D +NKDYG + ++NGEI+
Sbjct: 147 GCEIDEETSNKLEGLPGVLFILPDSYVDQENKDYGAELFVNGEIV 191
>AT4G20020.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G44780.1); Has 28928 Blast hits to 16023
proteins in 1033 species: Archae - 4; Bacteria - 4155;
Metazoa - 15463; Fungi - 2938; Plants - 3091; Viruses -
205; Other Eukaryotes - 3072 (source: NCBI BLink). |
chr4:10844433-10846085 REVERSE LENGTH=419
Length = 419
Score = 138 bits (347), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 82/181 (45%), Positives = 107/181 (59%), Gaps = 16/181 (8%)
Query: 37 VAPRSIRAVTRARNPTRIRAALDEDYSAKRSSSSEQRETIMLPGCDYNHWLIVMEFPKDP 96
V RS TRA P R+ + Y + +T++ GCDYNHWLI M+F K+
Sbjct: 45 VIGRSTEVATRA--PARLFST--RQYKLYKEGDEITEDTVLFEGCDYNHWLITMDFSKEE 100
Query: 97 AP-SREQMIETYLFTLSTVLG-SMEEAKKNMYAFSTTTYTGFQCTVDEATSEKFKGLPGV 154
P S E+M+ Y T + LG S+EEAK+ MYA STTTY GFQ + E SEKFK LPGV
Sbjct: 101 TPKSPEEMVAAYEETCAQGLGISVEEAKQRMYACSTTTYQGFQAIMTEQESEKFKDLPGV 160
Query: 155 LWVLPDSYIDVKNKDYGGDKYINGEIIPSKYPTYQ-----PK----RSGGSRNDSRRYER 205
+++LPDSYID +NK+YGGDKY NG +I + P Q P+ RSGG + ++R
Sbjct: 161 VFILPDSYIDPQNKEYGGDKYENG-VITHRPPPIQSGRARPRPRFDRSGGGSGGPQNFQR 219
Query: 206 K 206
Sbjct: 220 N 220
>AT4G20020.2 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G44780.1); Has 35333 Blast hits to 34131
proteins in 2444 species: Archae - 798; Bacteria -
22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
- 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
chr4:10844360-10846085 REVERSE LENGTH=406
Length = 406
Score = 137 bits (346), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 82/181 (45%), Positives = 107/181 (59%), Gaps = 16/181 (8%)
Query: 37 VAPRSIRAVTRARNPTRIRAALDEDYSAKRSSSSEQRETIMLPGCDYNHWLIVMEFPKDP 96
V RS TRA P R+ + Y + +T++ GCDYNHWLI M+F K+
Sbjct: 45 VIGRSTEVATRA--PARLFST--RQYKLYKEGDEITEDTVLFEGCDYNHWLITMDFSKEE 100
Query: 97 AP-SREQMIETYLFTLSTVLG-SMEEAKKNMYAFSTTTYTGFQCTVDEATSEKFKGLPGV 154
P S E+M+ Y T + LG S+EEAK+ MYA STTTY GFQ + E SEKFK LPGV
Sbjct: 101 TPKSPEEMVAAYEETCAQGLGISVEEAKQRMYACSTTTYQGFQAIMTEQESEKFKDLPGV 160
Query: 155 LWVLPDSYIDVKNKDYGGDKYINGEIIPSKYPTYQ-----PK----RSGGSRNDSRRYER 205
+++LPDSYID +NK+YGGDKY NG +I + P Q P+ RSGG + ++R
Sbjct: 161 VFILPDSYIDPQNKEYGGDKYENG-VITHRPPPIQSGRARPRPRFDRSGGGSGGPQNFQR 219
Query: 206 K 206
Sbjct: 220 N 220
>AT2G35240.1 | Symbols: | plastid developmental protein DAG,
putative | chr2:14845099-14846262 REVERSE LENGTH=232
Length = 232
Score = 137 bits (345), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 57/106 (53%), Positives = 77/106 (72%)
Query: 77 MLPGCDYNHWLIVMEFPKDPAPSREQMIETYLFTLSTVLGSMEEAKKNMYAFSTTTYTGF 136
+ PGCDY HWLIVME P ++QMI+ Y+ TL+ ++GS EEA+K +Y S Y GF
Sbjct: 90 LFPGCDYEHWLIVMEKPGGENAQKQQMIDCYVQTLAKIVGSEEEARKKIYNVSCERYFGF 149
Query: 137 QCTVDEATSEKFKGLPGVLWVLPDSYIDVKNKDYGGDKYINGEIIP 182
C +DE TS K +GLPGVL+VLPDSY+D + KDYG + ++NGE++P
Sbjct: 150 GCEIDEETSNKLEGLPGVLFVLPDSYVDPEFKDYGAELFVNGEVVP 195
>AT5G44780.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT4G20020.2); Has 9661 Blast hits to 6233 proteins
in 635 species: Archae - 4; Bacteria - 1116; Metazoa -
4251; Fungi - 1510; Plants - 1359; Viruses - 43; Other
Eukaryotes - 1378 (source: NCBI BLink). |
chr5:18068100-18070544 FORWARD LENGTH=723
Length = 723
Score = 127 bits (318), Expect = 7e-30, Method: Composition-based stats.
Identities = 74/159 (46%), Positives = 94/159 (59%), Gaps = 8/159 (5%)
Query: 39 PRSIRAVTRA--RNPTRIRAALDEDYSAKRSSSSEQRETIMLPGCDYNHWLIVMEFPKDP 96
PRS+ + A R+P R+ + Y S + GCD+NHWLI M FPKD
Sbjct: 39 PRSVVKQSTAINRSPARLFSTTQYQYDPYTGEDSFMPDN---EGCDFNHWLITMNFPKDN 95
Query: 97 APSREQMIETYLFTLSTVLG-SMEEAKKNMYAFSTTTYTGFQCTVDEATSEKFKGLPGVL 155
PSRE+MI + T + L S+EEAKK +YA TT+Y GFQ T+ EKF+ LPGV
Sbjct: 96 LPSREEMISIFEQTCAKGLAISLEEAKKKIYAICTTSYQGFQATMTIGEVEKFRDLPGVQ 155
Query: 156 WVLPDSYIDVKNKDYGGDKYINGEIIPSKYPTYQPKRSG 194
+++PDSYIDV+NK YGGDKY NG I P P P + G
Sbjct: 156 YIIPDSYIDVENKVYGGDKYENGVITPGPVPV--PTKEG 192
>AT3G15000.1 | Symbols: | cobalt ion binding | chr3:5050321-5052121
FORWARD LENGTH=395
Length = 395
Score = 122 bits (307), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 57/124 (45%), Positives = 85/124 (68%), Gaps = 6/124 (4%)
Query: 68 SSSEQRETIMLPGCDYNHWLIVMEFPKDPAPSREQMIETYLFTLSTVLGSMEEAKKNMYA 127
S+ +ETI+L GCD+ HWL+V+E P+ P+R+++I++Y+ TL+ ++GS +EA+ +Y+
Sbjct: 82 SNRPPKETILLDGCDFEHWLVVVEPPQG-EPTRDEIIDSYIKTLAQIVGSEDEARMKIYS 140
Query: 128 FSTTTYTGFQCTVDEATSEKFKGLPGVLWVLPDSYIDVKNKDYGGDKYINGEIIPSKYPT 187
ST Y F V E S K K L V WVLPDSY+DV+NKDYGG+ +I+G+ +P
Sbjct: 141 VSTRCYYAFGALVSEDLSHKLKELSNVRWVLPDSYLDVRNKDYGGEPFIDGKAVP----- 195
Query: 188 YQPK 191
Y PK
Sbjct: 196 YDPK 199
>AT1G72530.1 | Symbols: | plastid developmental protein DAG,
putative | chr1:27312999-27313937 FORWARD LENGTH=188
Length = 188
Score = 106 bits (264), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 48/120 (40%), Positives = 78/120 (65%), Gaps = 5/120 (4%)
Query: 67 SSSSEQRETIMLP----GCDYNHWLIVMEFPKDPAPSREQMIETYLFTLSTVLGSMEEAK 122
S ++ E I +P GCDY HWL++M+ P + P+R ++++++ TL+ LGS EEAK
Sbjct: 34 SETTSWSELIRVPSLVEGCDYKHWLVLMK-PPNGYPTRNHIVQSFVETLAMALGSEEEAK 92
Query: 123 KNMYAFSTTTYTGFQCTVDEATSEKFKGLPGVLWVLPDSYIDVKNKDYGGDKYINGEIIP 182
+++Y+ ST Y F C + E + K + LP V WVLPDS+I + YGG+ +++GE++P
Sbjct: 93 RSIYSVSTKYYYAFGCRIHEPLTYKIRSLPDVKWVLPDSFIVDGDNRYGGEPFVDGEVVP 152
>AT1G72530.2 | Symbols: | plastid developmental protein DAG,
putative | chr1:27312999-27313937 FORWARD LENGTH=192
Length = 192
Score = 100 bits (249), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 48/124 (38%), Positives = 78/124 (62%), Gaps = 9/124 (7%)
Query: 67 SSSSEQRETIMLP----GCDYNHWLIVMEFPKDPAPSREQMIETYLFTLSTVLGSMEEAK 122
S ++ E I +P GCDY HWL++M+ P + P+R ++++++ TL+ LGS EEAK
Sbjct: 34 SETTSWSELIRVPSLVEGCDYKHWLVLMK-PPNGYPTRNHIVQSFVETLAMALGSEEEAK 92
Query: 123 KNMYAFSTTTYTGFQCTVDEATSEKFKGLPGVLWVLPDSYIDVKNKDYG----GDKYING 178
+++Y+ ST Y F C + E + K + LP V WVLPDS+I + YG G+ +++G
Sbjct: 93 RSIYSVSTKYYYAFGCRIHEPLTYKIRSLPDVKWVLPDSFIVDGDNRYGVFFAGEPFVDG 152
Query: 179 EIIP 182
E++P
Sbjct: 153 EVVP 156
>AT1G53260.2 | Symbols: | LOCATED IN: endomembrane system; BEST
Arabidopsis thaliana protein match is: cobalt ion
binding (TAIR:AT3G15000.1); Has 246 Blast hits to 241
proteins in 32 species: Archae - 0; Bacteria - 2;
Metazoa - 7; Fungi - 16; Plants - 212; Viruses - 1;
Other Eukaryotes - 8 (source: NCBI BLink). |
chr1:19859406-19860421 REVERSE LENGTH=230
Length = 230
Score = 67.8 bits (164), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 33/67 (49%), Positives = 41/67 (61%), Gaps = 5/67 (7%)
Query: 125 MYAFSTTTYTGFQCTVDEATSEKFKGLPGVLWVLPDSYIDVKNKDYGGDKYINGEIIPSK 184
+Y+ S Y F V E S K K LP V WVLPDSY+D KNKDYGG+ +I+G+ +P
Sbjct: 3 IYSVSHKCYFAFGALVSEDLSHKIKELPKVKWVLPDSYLDGKNKDYGGEPFIDGKAVP-- 60
Query: 185 YPTYQPK 191
Y PK
Sbjct: 61 ---YDPK 64
>AT1G53260.1 | Symbols: | LOCATED IN: endomembrane system; BEST
Arabidopsis thaliana protein match is: cobalt ion
binding (TAIR:AT3G15000.1); Has 32763 Blast hits to
18534 proteins in 929 species: Archae - 22; Bacteria -
2420; Metazoa - 15140; Fungi - 5401; Plants - 5313;
Viruses - 485; Other Eukaryotes - 3982 (source: NCBI
BLink). | chr1:19859393-19860421 REVERSE LENGTH=271
Length = 271
Score = 67.0 bits (162), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 30/58 (51%), Positives = 38/58 (65%)
Query: 125 MYAFSTTTYTGFQCTVDEATSEKFKGLPGVLWVLPDSYIDVKNKDYGGDKYINGEIIP 182
+Y+ S Y F V E S K K LP V WVLPDSY+D KNKDYGG+ +I+G+ +P
Sbjct: 3 IYSVSHKCYFAFGALVSEDLSHKIKELPKVKWVLPDSYLDGKNKDYGGEPFIDGKAVP 60
>AT3G20930.1 | Symbols: | RNA-binding (RRM/RBD/RNP motifs) family
protein | chr3:7331739-7333749 FORWARD LENGTH=374
Length = 374
Score = 65.5 bits (158), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 65/117 (55%), Gaps = 2/117 (1%)
Query: 59 DEDYSAKRSS--SSEQRETIMLPGCDYNHWLIVMEFPKDPAPSREQMIETYLFTLSTVLG 116
++DYS+++ + + + HW++ ++ P ++ QM++ + LS VL
Sbjct: 147 EQDYSSEKKNYGIGSHKGVSLFDHGTVKHWMVRIDKPGVGIVTKAQMVDHCVQLLSKVLW 206
Query: 117 SMEEAKKNMYAFSTTTYTGFQCTVDEATSEKFKGLPGVLWVLPDSYIDVKNKDYGGD 173
+ ++A+ +Y S + GF C +DE ++ + G+PGVL V+PD+ + NKDY GD
Sbjct: 207 NEKDAQMCLYHVSWQSDFGFCCDLDERSAVELAGVPGVLAVVPDNSFESLNKDYEGD 263
Score = 60.1 bits (144), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 53/88 (60%)
Query: 84 NHWLIVMEFPKDPAPSREQMIETYLFTLSTVLGSMEEAKKNMYAFSTTTYTGFQCTVDEA 143
++W+++++ P S+ M++ Y+ L+ VLG+ ++A+ ++Y S T+ GF C +DE
Sbjct: 71 SYWMVLLDKPPHWVSSKSAMVDYYVEILAKVLGNEKDAQVSIYDASFDTHFGFCCHIDED 130
Query: 144 TSEKFKGLPGVLWVLPDSYIDVKNKDYG 171
S + LPGV+ + P+ + K+YG
Sbjct: 131 ASRQLASLPGVVSIRPEQDYSSEKKNYG 158