Miyakogusa Predicted Gene

Lj1g3v2280650.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj1g3v2280650.1 Non Chatacterized Hit- tr|O82329|O82329_ARATH
Putative uncharacterized protein At2g14910
OS=Arabidop,25.7,1e-18,seg,NULL; DUF760,Protein of unknown function
DUF760; coiled-coil,NULL,CUFF.28791.1
         (341 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT1G63610.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   429   e-120
AT1G63610.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   429   e-120
AT2G14910.1 | Symbols:  | unknown protein; LOCATED IN: chloropla...    92   4e-19
AT5G14970.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    88   8e-18
AT2G14910.2 | Symbols:  | unknown protein; LOCATED IN: chloropla...    80   1e-15

>AT1G63610.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: chloroplast;
           EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT2G14910.1); Has 35333 Blast
           hits to 34131 proteins in 2444 species: Archae - 798;
           Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants -
           531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI
           BLink). | chr1:23583675-23585599 REVERSE LENGTH=341
          Length = 341

 Score =  429 bits (1103), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 217/284 (76%), Positives = 253/284 (89%), Gaps = 4/284 (1%)

Query: 59  LRVEAYDSSTNPAGDSNSKPPNGTLS--KSRREILLEYVKNVQPEFMEMFVKRAPQQVVD 116
           LRV+AY SS++ + DS++ PPNGT    KSRR+ILLEYV+NV+PEFMEMFVKRAP+ VV+
Sbjct: 57  LRVKAYGSSSDSSADSST-PPNGTRQQPKSRRDILLEYVQNVKPEFMEMFVKRAPKHVVE 115

Query: 117 AMRQTVTNMIGTLPPQFFAVTITTVAENLAQLMYSVMMTGYMFKNAQYRLELQEGLEQVA 176
           AMRQTVTNMIGTLPPQFFAVT+T+VAENLAQLM SV+MTGYMF+NAQYRLELQ+ LEQVA
Sbjct: 116 AMRQTVTNMIGTLPPQFFAVTVTSVAENLAQLMMSVLMTGYMFRNAQYRLELQQSLEQVA 175

Query: 177 LPDVQDKKD-KPDYAPGTQKNVSGEVLRWNHVSGPERIDAKKYXXXXXXXXXXXNRQVGR 235
           LP+ +D+K    DYAPGTQKNVSGEV+RWN+VSGPE+IDAKKY           NRQVGR
Sbjct: 176 LPEPRDQKGGDEDYAPGTQKNVSGEVIRWNNVSGPEKIDAKKYIELLEAEIEELNRQVGR 235

Query: 236 QSSNAQNELLEYLKSLEPRNLKDLTSSAGEDVVLAMNTFIKRLLAVADPSQMKTSVTETS 295
           +S+N QNE+LEYLKSLEP+NLK+LTS+AGEDV +AMNTF+KRLLAV+DP+QMKT+VTETS
Sbjct: 236 KSANQQNEILEYLKSLEPQNLKELTSTAGEDVAVAMNTFVKRLLAVSDPNQMKTNVTETS 295

Query: 296 ATELSKLLYWLMVVGYSIRNIEVRYDMERVLGTPPKLAELPPGE 339
           A +L+KLLYWLMVVGYSIRNIEVR+DMERVLGT PKLAELPPGE
Sbjct: 296 AADLAKLLYWLMVVGYSIRNIEVRFDMERVLGTQPKLAELPPGE 339


>AT1G63610.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: chloroplast;
           EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT2G14910.1); Has 537 Blast
           hits to 411 proteins in 100 species: Archae - 0;
           Bacteria - 231; Metazoa - 0; Fungi - 0; Plants - 94;
           Viruses - 0; Other Eukaryotes - 212 (source: NCBI
           BLink). | chr1:23583675-23585599 REVERSE LENGTH=340
          Length = 340

 Score =  429 bits (1103), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 217/283 (76%), Positives = 253/283 (89%), Gaps = 3/283 (1%)

Query: 59  LRVEAYDSSTNPAGDSNSKPPNGTLS-KSRREILLEYVKNVQPEFMEMFVKRAPQQVVDA 117
           LRV+AY SS++ + DS++ PPNGT   KSRR+ILLEYV+NV+PEFMEMFVKRAP+ VV+A
Sbjct: 57  LRVKAYGSSSDSSADSST-PPNGTRQPKSRRDILLEYVQNVKPEFMEMFVKRAPKHVVEA 115

Query: 118 MRQTVTNMIGTLPPQFFAVTITTVAENLAQLMYSVMMTGYMFKNAQYRLELQEGLEQVAL 177
           MRQTVTNMIGTLPPQFFAVT+T+VAENLAQLM SV+MTGYMF+NAQYRLELQ+ LEQVAL
Sbjct: 116 MRQTVTNMIGTLPPQFFAVTVTSVAENLAQLMMSVLMTGYMFRNAQYRLELQQSLEQVAL 175

Query: 178 PDVQDKKD-KPDYAPGTQKNVSGEVLRWNHVSGPERIDAKKYXXXXXXXXXXXNRQVGRQ 236
           P+ +D+K    DYAPGTQKNVSGEV+RWN+VSGPE+IDAKKY           NRQVGR+
Sbjct: 176 PEPRDQKGGDEDYAPGTQKNVSGEVIRWNNVSGPEKIDAKKYIELLEAEIEELNRQVGRK 235

Query: 237 SSNAQNELLEYLKSLEPRNLKDLTSSAGEDVVLAMNTFIKRLLAVADPSQMKTSVTETSA 296
           S+N QNE+LEYLKSLEP+NLK+LTS+AGEDV +AMNTF+KRLLAV+DP+QMKT+VTETSA
Sbjct: 236 SANQQNEILEYLKSLEPQNLKELTSTAGEDVAVAMNTFVKRLLAVSDPNQMKTNVTETSA 295

Query: 297 TELSKLLYWLMVVGYSIRNIEVRYDMERVLGTPPKLAELPPGE 339
            +L+KLLYWLMVVGYSIRNIEVR+DMERVLGT PKLAELPPGE
Sbjct: 296 ADLAKLLYWLMVVGYSIRNIEVRFDMERVLGTQPKLAELPPGE 338


>AT2G14910.1 | Symbols:  | unknown protein; LOCATED IN: chloroplast;
           EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT5G14970.1); Has 605 Blast
           hits to 425 proteins in 102 species: Archae - 0;
           Bacteria - 300; Metazoa - 25; Fungi - 0; Plants - 89;
           Viruses - 0; Other Eukaryotes - 191 (source: NCBI
           BLink). | chr2:6406957-6409015 REVERSE LENGTH=386
          Length = 386

 Score = 92.4 bits (228), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 79/330 (23%), Positives = 135/330 (40%), Gaps = 85/330 (25%)

Query: 71  AGDSNSKPPNGTLSKSRRE----ILLEYVKNVQPEFMEMFVKRAPQQVVDAMRQTVTNMI 126
            GD      + TL    R     +L + ++ ++P  + +  K  P   +DAM++T++ M+
Sbjct: 57  CGDDGFSLDDFTLHSDSRSPKKCVLSDLIQEIEPLDVSLIQKDVPVTTLDAMKRTISGML 116

Query: 127 GTLPPQFFAVTITTVAENLAQLMYSVMMTGYMFKNAQYRLELQEGLEQVALPDVQDKKDK 186
           G LP   F V I ++ E L++L+ S MMTGY  +NA+YRL L++ L      D+      
Sbjct: 117 GLLPSDRFQVHIESLWEPLSKLLVSSMMTGYTLRNAEYRLFLEKNL------DMSGGGLD 170

Query: 187 PDYAPGTQKNVSGEVLRWNHVSGPERIDAKKYXXXXXXXXXXXNRQVGRQSSNAQ----- 241
              +  T+ ++ G     +HVS      +K+               +GR SS AQ     
Sbjct: 171 SHASENTEYDMEGTFPDEDHVS------SKRDSRTQNLSETIDEEGLGRVSSEAQEYILR 224

Query: 242 --------------------------------NELLEYLKSLEPRNLKDLTSSAGEDVVL 269
                                           N+LL+YL+SL+P  + +L+  A  +V  
Sbjct: 225 LQSQLSSVKKELQEMRRKNAALQMQQFVGEEKNDLLDYLRSLQPEKVAELSEPAAPEVKE 284

Query: 270 AMNTFIKRLLAVADPS-QMKTSVTETSATE------------------------------ 298
            +++ +  LLA   P    K   +E   TE                              
Sbjct: 285 TIHSVVHGLLATLSPKMHSKFPASEVPPTETVKAKSDEDCAELVENTSLQFQPLISLTRD 344

Query: 299 -LSKLLYWLMVVGYSIRNIEVRYDMERVLG 327
            L++LL+W M++G+ +R +E R ++  VL 
Sbjct: 345 YLARLLFWCMLLGHYLRGLEYRMELMEVLS 374


>AT5G14970.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G14910.1); Has 579 Blast hits to 397 proteins
           in 95 species: Archae - 0; Bacteria - 294; Metazoa - 0;
           Fungi - 0; Plants - 86; Viruses - 0; Other Eukaryotes -
           199 (source: NCBI BLink). | chr5:4847368-4848760 FORWARD
           LENGTH=355
          Length = 355

 Score = 88.2 bits (217), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 67/277 (24%), Positives = 128/277 (46%), Gaps = 41/277 (14%)

Query: 90  ILLEYVKNVQPEFMEMFVKRAPQQVVDAMRQTVTNMIGTLPPQFFAVTITTVAENLAQLM 149
           ++   V  +QP    +  K       D+M+QT+++M+G LP   F+V++T   + L +L+
Sbjct: 79  VIANLVNRIQPLDTSVISKGLSDSAKDSMKQTISSMLGLLPSDQFSVSVTISEQPLYRLL 138

Query: 150 YSVMMTGYMFKNAQYRLELQEGLEQVALP--DVQDKKDKPDYAPGTQKNVSGEVLRWNHV 207
            S ++TGY   NA+YR+ L+   +    P  + +D+  K +   G++K +S ++   N V
Sbjct: 139 ISSIITGYTLWNAEYRVSLRRNFDIPIDPRKEEEDQSSKDNVRFGSEKGMSEDL--GNCV 196

Query: 208 SGPERI----------DAKKYXXXXXXXXXXXNRQVGRQSSNA---------QNELLEYL 248
              ER+          +A  Y             ++  Q   A         +N+LL+YL
Sbjct: 197 EEFERLSPQVFGDLSPEALSYIQLLQSELSSMKEELDSQKKKALRIECEKGNRNDLLDYL 256

Query: 249 KSLEPRNLKDLTSSAGEDVVLAMNTFIKRLLAVADPSQMKTSV----------------- 291
           +SL+P  + +L+  +  +V   +N  ++ +L      Q  ++                  
Sbjct: 257 RSLDPEMVTELSQLSSPEVEEIVNQLVQNVLERLFEDQTTSNFMQNPGIRTTEGGDGTGR 316

Query: 292 -TETSATELSKLLYWLMVVGYSIRNIEVRYDMERVLG 327
             +TS   L+KLL+W M++G+ +R +E R  +  V+G
Sbjct: 317 KVDTSRDYLAKLLFWCMLLGHHLRGLENRLHLSCVVG 353


>AT2G14910.2 | Symbols:  | unknown protein; LOCATED IN: chloroplast;
           EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT5G14970.1); Has 35333 Blast
           hits to 34131 proteins in 2444 species: Archae - 798;
           Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants -
           531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI
           BLink). | chr2:6407195-6409015 REVERSE LENGTH=366
          Length = 366

 Score = 80.5 bits (197), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 64/255 (25%), Positives = 109/255 (42%), Gaps = 53/255 (20%)

Query: 71  AGDSNSKPPNGTLSKSRRE----ILLEYVKNVQPEFMEMFVKRAPQQVVDAMRQTVTNMI 126
            GD      + TL    R     +L + ++ ++P  + +  K  P   +DAM++T++ M+
Sbjct: 57  CGDDGFSLDDFTLHSDSRSPKKCVLSDLIQEIEPLDVSLIQKDVPVTTLDAMKRTISGML 116

Query: 127 GTLPPQFFAVTITTVAENLAQLMYSVMMTGYMFKNAQYRLELQEGLEQVALPDVQDKKDK 186
           G LP   F V I ++ E L++L+ S MMTGY  +NA+YRL L++ L      D+      
Sbjct: 117 GLLPSDRFQVHIESLWEPLSKLLVSSMMTGYTLRNAEYRLFLEKNL------DMSGGGLD 170

Query: 187 PDYAPGTQKNVSGEVLRWNHVSGPERIDAKKYXXXXXXXXXXXNRQVGRQSSNAQ----- 241
              +  T+ ++ G     +HVS      +K+               +GR SS AQ     
Sbjct: 171 SHASENTEYDMEGTFPDEDHVS------SKRDSRTQNLSETIDEEGLGRVSSEAQEYILR 224

Query: 242 --------------------------------NELLEYLKSLEPRNLKDLTSSAGEDVVL 269
                                           N+LL+YL+SL+P  + +L+  A  +V  
Sbjct: 225 LQSQLSSVKKELQEMRRKNAALQMQQFVGEEKNDLLDYLRSLQPEKVAELSEPAAPEVKE 284

Query: 270 AMNTFIKRLLAVADP 284
            +++ +  LLA   P
Sbjct: 285 TIHSVVHGLLATLSP 299