Miyakogusa Predicted Gene
- Lj1g3v2280650.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v2280650.1 Non Chatacterized Hit- tr|O82329|O82329_ARATH
Putative uncharacterized protein At2g14910
OS=Arabidop,25.7,1e-18,seg,NULL; DUF760,Protein of unknown function
DUF760; coiled-coil,NULL,CUFF.28791.1
(341 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G63610.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 429 e-120
AT1G63610.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 429 e-120
AT2G14910.1 | Symbols: | unknown protein; LOCATED IN: chloropla... 92 4e-19
AT5G14970.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 88 8e-18
AT2G14910.2 | Symbols: | unknown protein; LOCATED IN: chloropla... 80 1e-15
>AT1G63610.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: chloroplast;
EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT2G14910.1); Has 35333 Blast
hits to 34131 proteins in 2444 species: Archae - 798;
Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants -
531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI
BLink). | chr1:23583675-23585599 REVERSE LENGTH=341
Length = 341
Score = 429 bits (1103), Expect = e-120, Method: Compositional matrix adjust.
Identities = 217/284 (76%), Positives = 253/284 (89%), Gaps = 4/284 (1%)
Query: 59 LRVEAYDSSTNPAGDSNSKPPNGTLS--KSRREILLEYVKNVQPEFMEMFVKRAPQQVVD 116
LRV+AY SS++ + DS++ PPNGT KSRR+ILLEYV+NV+PEFMEMFVKRAP+ VV+
Sbjct: 57 LRVKAYGSSSDSSADSST-PPNGTRQQPKSRRDILLEYVQNVKPEFMEMFVKRAPKHVVE 115
Query: 117 AMRQTVTNMIGTLPPQFFAVTITTVAENLAQLMYSVMMTGYMFKNAQYRLELQEGLEQVA 176
AMRQTVTNMIGTLPPQFFAVT+T+VAENLAQLM SV+MTGYMF+NAQYRLELQ+ LEQVA
Sbjct: 116 AMRQTVTNMIGTLPPQFFAVTVTSVAENLAQLMMSVLMTGYMFRNAQYRLELQQSLEQVA 175
Query: 177 LPDVQDKKD-KPDYAPGTQKNVSGEVLRWNHVSGPERIDAKKYXXXXXXXXXXXNRQVGR 235
LP+ +D+K DYAPGTQKNVSGEV+RWN+VSGPE+IDAKKY NRQVGR
Sbjct: 176 LPEPRDQKGGDEDYAPGTQKNVSGEVIRWNNVSGPEKIDAKKYIELLEAEIEELNRQVGR 235
Query: 236 QSSNAQNELLEYLKSLEPRNLKDLTSSAGEDVVLAMNTFIKRLLAVADPSQMKTSVTETS 295
+S+N QNE+LEYLKSLEP+NLK+LTS+AGEDV +AMNTF+KRLLAV+DP+QMKT+VTETS
Sbjct: 236 KSANQQNEILEYLKSLEPQNLKELTSTAGEDVAVAMNTFVKRLLAVSDPNQMKTNVTETS 295
Query: 296 ATELSKLLYWLMVVGYSIRNIEVRYDMERVLGTPPKLAELPPGE 339
A +L+KLLYWLMVVGYSIRNIEVR+DMERVLGT PKLAELPPGE
Sbjct: 296 AADLAKLLYWLMVVGYSIRNIEVRFDMERVLGTQPKLAELPPGE 339
>AT1G63610.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: chloroplast;
EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT2G14910.1); Has 537 Blast
hits to 411 proteins in 100 species: Archae - 0;
Bacteria - 231; Metazoa - 0; Fungi - 0; Plants - 94;
Viruses - 0; Other Eukaryotes - 212 (source: NCBI
BLink). | chr1:23583675-23585599 REVERSE LENGTH=340
Length = 340
Score = 429 bits (1103), Expect = e-120, Method: Compositional matrix adjust.
Identities = 217/283 (76%), Positives = 253/283 (89%), Gaps = 3/283 (1%)
Query: 59 LRVEAYDSSTNPAGDSNSKPPNGTLS-KSRREILLEYVKNVQPEFMEMFVKRAPQQVVDA 117
LRV+AY SS++ + DS++ PPNGT KSRR+ILLEYV+NV+PEFMEMFVKRAP+ VV+A
Sbjct: 57 LRVKAYGSSSDSSADSST-PPNGTRQPKSRRDILLEYVQNVKPEFMEMFVKRAPKHVVEA 115
Query: 118 MRQTVTNMIGTLPPQFFAVTITTVAENLAQLMYSVMMTGYMFKNAQYRLELQEGLEQVAL 177
MRQTVTNMIGTLPPQFFAVT+T+VAENLAQLM SV+MTGYMF+NAQYRLELQ+ LEQVAL
Sbjct: 116 MRQTVTNMIGTLPPQFFAVTVTSVAENLAQLMMSVLMTGYMFRNAQYRLELQQSLEQVAL 175
Query: 178 PDVQDKKD-KPDYAPGTQKNVSGEVLRWNHVSGPERIDAKKYXXXXXXXXXXXNRQVGRQ 236
P+ +D+K DYAPGTQKNVSGEV+RWN+VSGPE+IDAKKY NRQVGR+
Sbjct: 176 PEPRDQKGGDEDYAPGTQKNVSGEVIRWNNVSGPEKIDAKKYIELLEAEIEELNRQVGRK 235
Query: 237 SSNAQNELLEYLKSLEPRNLKDLTSSAGEDVVLAMNTFIKRLLAVADPSQMKTSVTETSA 296
S+N QNE+LEYLKSLEP+NLK+LTS+AGEDV +AMNTF+KRLLAV+DP+QMKT+VTETSA
Sbjct: 236 SANQQNEILEYLKSLEPQNLKELTSTAGEDVAVAMNTFVKRLLAVSDPNQMKTNVTETSA 295
Query: 297 TELSKLLYWLMVVGYSIRNIEVRYDMERVLGTPPKLAELPPGE 339
+L+KLLYWLMVVGYSIRNIEVR+DMERVLGT PKLAELPPGE
Sbjct: 296 ADLAKLLYWLMVVGYSIRNIEVRFDMERVLGTQPKLAELPPGE 338
>AT2G14910.1 | Symbols: | unknown protein; LOCATED IN: chloroplast;
EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT5G14970.1); Has 605 Blast
hits to 425 proteins in 102 species: Archae - 0;
Bacteria - 300; Metazoa - 25; Fungi - 0; Plants - 89;
Viruses - 0; Other Eukaryotes - 191 (source: NCBI
BLink). | chr2:6406957-6409015 REVERSE LENGTH=386
Length = 386
Score = 92.4 bits (228), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 79/330 (23%), Positives = 135/330 (40%), Gaps = 85/330 (25%)
Query: 71 AGDSNSKPPNGTLSKSRRE----ILLEYVKNVQPEFMEMFVKRAPQQVVDAMRQTVTNMI 126
GD + TL R +L + ++ ++P + + K P +DAM++T++ M+
Sbjct: 57 CGDDGFSLDDFTLHSDSRSPKKCVLSDLIQEIEPLDVSLIQKDVPVTTLDAMKRTISGML 116
Query: 127 GTLPPQFFAVTITTVAENLAQLMYSVMMTGYMFKNAQYRLELQEGLEQVALPDVQDKKDK 186
G LP F V I ++ E L++L+ S MMTGY +NA+YRL L++ L D+
Sbjct: 117 GLLPSDRFQVHIESLWEPLSKLLVSSMMTGYTLRNAEYRLFLEKNL------DMSGGGLD 170
Query: 187 PDYAPGTQKNVSGEVLRWNHVSGPERIDAKKYXXXXXXXXXXXNRQVGRQSSNAQ----- 241
+ T+ ++ G +HVS +K+ +GR SS AQ
Sbjct: 171 SHASENTEYDMEGTFPDEDHVS------SKRDSRTQNLSETIDEEGLGRVSSEAQEYILR 224
Query: 242 --------------------------------NELLEYLKSLEPRNLKDLTSSAGEDVVL 269
N+LL+YL+SL+P + +L+ A +V
Sbjct: 225 LQSQLSSVKKELQEMRRKNAALQMQQFVGEEKNDLLDYLRSLQPEKVAELSEPAAPEVKE 284
Query: 270 AMNTFIKRLLAVADPS-QMKTSVTETSATE------------------------------ 298
+++ + LLA P K +E TE
Sbjct: 285 TIHSVVHGLLATLSPKMHSKFPASEVPPTETVKAKSDEDCAELVENTSLQFQPLISLTRD 344
Query: 299 -LSKLLYWLMVVGYSIRNIEVRYDMERVLG 327
L++LL+W M++G+ +R +E R ++ VL
Sbjct: 345 YLARLLFWCMLLGHYLRGLEYRMELMEVLS 374
>AT5G14970.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT2G14910.1); Has 579 Blast hits to 397 proteins
in 95 species: Archae - 0; Bacteria - 294; Metazoa - 0;
Fungi - 0; Plants - 86; Viruses - 0; Other Eukaryotes -
199 (source: NCBI BLink). | chr5:4847368-4848760 FORWARD
LENGTH=355
Length = 355
Score = 88.2 bits (217), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 67/277 (24%), Positives = 128/277 (46%), Gaps = 41/277 (14%)
Query: 90 ILLEYVKNVQPEFMEMFVKRAPQQVVDAMRQTVTNMIGTLPPQFFAVTITTVAENLAQLM 149
++ V +QP + K D+M+QT+++M+G LP F+V++T + L +L+
Sbjct: 79 VIANLVNRIQPLDTSVISKGLSDSAKDSMKQTISSMLGLLPSDQFSVSVTISEQPLYRLL 138
Query: 150 YSVMMTGYMFKNAQYRLELQEGLEQVALP--DVQDKKDKPDYAPGTQKNVSGEVLRWNHV 207
S ++TGY NA+YR+ L+ + P + +D+ K + G++K +S ++ N V
Sbjct: 139 ISSIITGYTLWNAEYRVSLRRNFDIPIDPRKEEEDQSSKDNVRFGSEKGMSEDL--GNCV 196
Query: 208 SGPERI----------DAKKYXXXXXXXXXXXNRQVGRQSSNA---------QNELLEYL 248
ER+ +A Y ++ Q A +N+LL+YL
Sbjct: 197 EEFERLSPQVFGDLSPEALSYIQLLQSELSSMKEELDSQKKKALRIECEKGNRNDLLDYL 256
Query: 249 KSLEPRNLKDLTSSAGEDVVLAMNTFIKRLLAVADPSQMKTSV----------------- 291
+SL+P + +L+ + +V +N ++ +L Q ++
Sbjct: 257 RSLDPEMVTELSQLSSPEVEEIVNQLVQNVLERLFEDQTTSNFMQNPGIRTTEGGDGTGR 316
Query: 292 -TETSATELSKLLYWLMVVGYSIRNIEVRYDMERVLG 327
+TS L+KLL+W M++G+ +R +E R + V+G
Sbjct: 317 KVDTSRDYLAKLLFWCMLLGHHLRGLENRLHLSCVVG 353
>AT2G14910.2 | Symbols: | unknown protein; LOCATED IN: chloroplast;
EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15
growth stages; BEST Arabidopsis thaliana protein match
is: unknown protein (TAIR:AT5G14970.1); Has 35333 Blast
hits to 34131 proteins in 2444 species: Archae - 798;
Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants -
531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI
BLink). | chr2:6407195-6409015 REVERSE LENGTH=366
Length = 366
Score = 80.5 bits (197), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 64/255 (25%), Positives = 109/255 (42%), Gaps = 53/255 (20%)
Query: 71 AGDSNSKPPNGTLSKSRRE----ILLEYVKNVQPEFMEMFVKRAPQQVVDAMRQTVTNMI 126
GD + TL R +L + ++ ++P + + K P +DAM++T++ M+
Sbjct: 57 CGDDGFSLDDFTLHSDSRSPKKCVLSDLIQEIEPLDVSLIQKDVPVTTLDAMKRTISGML 116
Query: 127 GTLPPQFFAVTITTVAENLAQLMYSVMMTGYMFKNAQYRLELQEGLEQVALPDVQDKKDK 186
G LP F V I ++ E L++L+ S MMTGY +NA+YRL L++ L D+
Sbjct: 117 GLLPSDRFQVHIESLWEPLSKLLVSSMMTGYTLRNAEYRLFLEKNL------DMSGGGLD 170
Query: 187 PDYAPGTQKNVSGEVLRWNHVSGPERIDAKKYXXXXXXXXXXXNRQVGRQSSNAQ----- 241
+ T+ ++ G +HVS +K+ +GR SS AQ
Sbjct: 171 SHASENTEYDMEGTFPDEDHVS------SKRDSRTQNLSETIDEEGLGRVSSEAQEYILR 224
Query: 242 --------------------------------NELLEYLKSLEPRNLKDLTSSAGEDVVL 269
N+LL+YL+SL+P + +L+ A +V
Sbjct: 225 LQSQLSSVKKELQEMRRKNAALQMQQFVGEEKNDLLDYLRSLQPEKVAELSEPAAPEVKE 284
Query: 270 AMNTFIKRLLAVADP 284
+++ + LLA P
Sbjct: 285 TIHSVVHGLLATLSP 299