Miyakogusa Predicted Gene

Lj4g3v2789340.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj4g3v2789340.1 Non Chatacterized Hit- tr|C5XW36|C5XW36_SORBI
Putative uncharacterized protein Sb04g004670
OS=Sorghu,28.85,3e-18,FtsH protease domain-like,NULL;
seg,NULL,CUFF.51670.1
         (330 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT2G21960.1 | Symbols:  | unknown protein; LOCATED IN: chloropla...   442   e-124
AT1G56180.1 | Symbols:  | unknown protein; INVOLVED IN: biologic...    91   1e-18
AT5G27290.1 | Symbols:  | unknown protein; LOCATED IN: chloropla...    75   6e-14
AT1G54680.3 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    59   4e-09
AT1G54680.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    57   2e-08
AT1G54680.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    56   3e-08
AT5G27290.2 | Symbols:  | unknown protein; LOCATED IN: chloropla...    51   1e-06

>AT2G21960.1 | Symbols:  | unknown protein; LOCATED IN: chloroplast;
           EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT1G56180.1); Has 224 Blast
           hits to 222 proteins in 59 species: Archae - 0; Bacteria
           - 65; Metazoa - 0; Fungi - 0; Plants - 134; Viruses - 0;
           Other Eukaryotes - 25 (source: NCBI BLink). |
           chr2:9354990-9356958 FORWARD LENGTH=332
          Length = 332

 Score =  442 bits (1136), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 217/283 (76%), Positives = 250/283 (88%)

Query: 41  GVELNTLESAIAKKDSNAVKEALDQLSEGGWAKKWGSQPYVXXXXXXXXXXXXXGIKNAE 100
           G +L++LESAI KKDSN VKEALD+LSE GWAKKW SQPY+             GIKNAE
Sbjct: 50  GFDLSSLESAINKKDSNGVKEALDKLSEEGWAKKWSSQPYLSRRTTSLRELTTLGIKNAE 109

Query: 101 NLAIPSVRNDAAFLFTVVGTTGFLGILTGQLPGDWGFFVPYLIGSISLVVLAVGSISPGL 160
            LAIPSVRNDAAFLFTVVG+TGF+ +L GQLPGDWGFFVPYL+GSISLVVLAVGS+SPGL
Sbjct: 110 TLAIPSVRNDAAFLFTVVGSTGFIAVLAGQLPGDWGFFVPYLVGSISLVVLAVGSVSPGL 169

Query: 161 LQAAIGSFSTVFPDYQERIARHEAAHFLIAYLLGVPILGYSLDIGKEHVNLIDQRLEKLI 220
           LQAAI  FST FPDYQERIA HEAAHFL+AYL+G+PILGYSLDIGKEHVNLID+RL KLI
Sbjct: 170 LQAAISGFSTFFPDYQERIAAHEAAHFLVAYLIGLPILGYSLDIGKEHVNLIDERLAKLI 229

Query: 221 YSGQLNAKEIDRLAVVSMAGLAAEGLIYDKVIGQSADLFTLQRFINRTKPQLSKDQQQNL 280
           YSG+L++KE+DRLA V+MAGLAAEGL YDKVIGQSADLF+LQRFINR++P++S +QQQNL
Sbjct: 230 YSGKLDSKELDRLAAVAMAGLAAEGLKYDKVIGQSADLFSLQRFINRSQPKISNEQQQNL 289

Query: 281 TRWAVMFAASLLKNNKESHEALMASMTKKASVVECIQTIESVA 323
           TRWAV+++ASLLKNNK  HEALMA+M+K ASV+ECIQTIE+ +
Sbjct: 290 TRWAVLYSASLLKNNKTIHEALMAAMSKNASVLECIQTIETAS 332


>AT1G56180.1 | Symbols:  | unknown protein; INVOLVED IN:
           biological_process unknown; LOCATED IN: chloroplast;
           EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT5G27290.1); Has 436 Blast
           hits to 436 proteins in 83 species: Archae - 0; Bacteria
           - 153; Metazoa - 0; Fungi - 0; Plants - 160; Viruses -
           0; Other Eukaryotes - 123 (source: NCBI BLink). |
           chr1:21026243-21028047 REVERSE LENGTH=389
          Length = 389

 Score = 90.9 bits (224), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 54/166 (32%), Positives = 84/166 (50%), Gaps = 9/166 (5%)

Query: 164 AIGSFSTVFPDYQERIARHEAAHFLIAYLLGVPILGYSLDI---------GKEHVNLIDQ 214
            +   S  +P ++ RI  HEA H L+AYL+G PI G  LD          G+      DQ
Sbjct: 217 CLAQVSCYWPPHKRRIVVHEAGHLLVAYLMGCPIRGVILDPVVAMQMGVQGQAGTQFWDQ 276

Query: 215 RLEKLIYSGQLNAKEIDRLAVVSMAGLAAEGLIYDKVIGQSADLFTLQRFINRTKPQLSK 274
           ++E  I  G+L+    DR ++V  AG+AAE L+Y +  G   D    +      +P LS 
Sbjct: 277 KMESEIAEGRLSGSSFDRYSMVLFAGIAAEALVYGEAEGGENDENLFRSISVLLEPPLSV 336

Query: 275 DQQQNLTRWAVMFAASLLKNNKESHEALMASMTKKASVVECIQTIE 320
            Q  N  RW+V+ + +LLK +K +H A + ++   + +   I+ IE
Sbjct: 337 AQMSNQARWSVLQSYNLLKWHKAAHRAAVEALQVGSPLSIVIRRIE 382


>AT5G27290.1 | Symbols:  | unknown protein; LOCATED IN: chloroplast;
           EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT1G54680.3); Has 30201 Blast
           hits to 17322 proteins in 780 species: Archae - 12;
           Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants -
           5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI
           BLink). | chr5:9618590-9620473 REVERSE LENGTH=341
          Length = 341

 Score = 75.1 bits (183), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 49/157 (31%), Positives = 82/157 (52%), Gaps = 13/157 (8%)

Query: 175 YQERIARHEAAHFLIAYLLGVPILGYSLD----IGKE-HVNL------IDQRLEKLIYSG 223
           Y  R+ +HEA HFL+AYL+G+   GY+L     + KE  +N+      +D    + + SG
Sbjct: 179 YHNRVVQHEAGHFLVAYLVGILPRGYTLSSLEALQKEGSLNIQAGSAFVDYEFLEEVNSG 238

Query: 224 QLNAKEIDRLAVVSMAGLAAEGLIYDKVIGQSADLFTLQRFINRTKPQLSKDQQQNLTRW 283
           +++A  ++R + +++AG+A E L+Y    G   D+  L   +        K   Q   RW
Sbjct: 239 KVSATMLNRFSCIALAGVATEYLLYGYAEGGLDDISKLDGLVKSLGFTQKKADSQ--VRW 296

Query: 284 AVMFAASLLKNNKESHEALMASMTKKASVVECIQTIE 320
           +V+    LL+ ++ +   L  +M+K  SV  CIQ IE
Sbjct: 297 SVLNTILLLRRHEIARSKLAQAMSKGESVGSCIQIIE 333


>AT1G54680.3 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: endomembrane
           system; EXPRESSED IN: 22 plant structures; EXPRESSED
           DURING: 13 growth stages; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT5G27290.1). |
           chr1:20413211-20414482 FORWARD LENGTH=217
          Length = 217

 Score = 58.9 bits (141), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 43/169 (25%), Positives = 82/169 (48%), Gaps = 21/169 (12%)

Query: 179 IARHEAAHFLIAYLLGVPILGY---SLDIGKEHVNLIDQRLE-----------KLIYSGQ 224
           + +HE+ HFL+ YLLGV    Y   +L+  +++V+ +  R+E           K    GQ
Sbjct: 51  VVQHESGHFLVGYLLGVLPRHYEIPTLEAVRQNVSNVTGRVEFVGFEFLKQLMKDDVDGQ 110

Query: 225 LN-----AKEIDRLAVVSMAGLAAEGLIYDKVIGQSADLFTLQRFINRTKPQLSKDQQQN 279
           +N     +K ++  + V + G+  E +++    G  +D+  L   +       ++ +++ 
Sbjct: 111 MNQGNISSKTLNNFSCVILGGMVTEHILFGYSEGLYSDIVKLNDVLRWLG--FTESEKEA 168

Query: 280 LTRWAVMFAASLLKNNKESHEALMASMTKKASVVECIQTIESVAAEFNI 328
             +WAV    SLL ++KE+  +L  +M K   +  CI+ IES  +   I
Sbjct: 169 HIKWAVSNTVSLLHSHKEARVSLAETMAKAKPISTCIEAIESAISTHQI 217


>AT1G54680.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: endomembrane
           system; EXPRESSED IN: 22 plant structures; EXPRESSED
           DURING: 13 growth stages; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT5G27290.1);
           Has 200 Blast hits to 200 proteins in 57 species: Archae
           - 0; Bacteria - 59; Metazoa - 0; Fungi - 0; Plants -
           127; Viruses - 0; Other Eukaryotes - 14 (source: NCBI
           BLink). | chr1:20413211-20414482 FORWARD LENGTH=223
          Length = 223

 Score = 56.6 bits (135), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 43/175 (24%), Positives = 82/175 (46%), Gaps = 27/175 (15%)

Query: 179 IARHEAAHFLIAYLLGVPILGY---SLDIGKEHVNLIDQRLE-----------------K 218
           + +HE+ HFL+ YLLGV    Y   +L+  +++V+ +  R+E                 K
Sbjct: 51  VVQHESGHFLVGYLLGVLPRHYEIPTLEAVRQNVSNVTGRVEFVGFEFLKQVGAANQLMK 110

Query: 219 LIYSGQLN-----AKEIDRLAVVSMAGLAAEGLIYDKVIGQSADLFTLQRFINRTKPQLS 273
               GQ+N     +K ++  + V + G+  E +++    G  +D+  L   +       +
Sbjct: 111 DDVDGQMNQGNISSKTLNNFSCVILGGMVTEHILFGYSEGLYSDIVKLNDVLRWLG--FT 168

Query: 274 KDQQQNLTRWAVMFAASLLKNNKESHEALMASMTKKASVVECIQTIESVAAEFNI 328
           + +++   +WAV    SLL ++KE+  +L  +M K   +  CI+ IES  +   I
Sbjct: 169 ESEKEAHIKWAVSNTVSLLHSHKEARVSLAETMAKAKPISTCIEAIESAISTHQI 223


>AT1G54680.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; EXPRESSED IN: 22 plant
           structures; EXPRESSED DURING: 13 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT5G27290.1); Has 30201 Blast hits to 17322
           proteins in 780 species: Archae - 12; Bacteria - 1396;
           Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
           0; Other Eukaryotes - 2996 (source: NCBI BLink). |
           chr1:20413337-20414482 FORWARD LENGTH=219
          Length = 219

 Score = 56.2 bits (134), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 43/175 (24%), Positives = 82/175 (46%), Gaps = 27/175 (15%)

Query: 179 IARHEAAHFLIAYLLGVPILGY---SLDIGKEHVNLIDQRLE-----------------K 218
           + +HE+ HFL+ YLLGV    Y   +L+  +++V+ +  R+E                 K
Sbjct: 47  VVQHESGHFLVGYLLGVLPRHYEIPTLEAVRQNVSNVTGRVEFVGFEFLKQVGAANQLMK 106

Query: 219 LIYSGQLN-----AKEIDRLAVVSMAGLAAEGLIYDKVIGQSADLFTLQRFINRTKPQLS 273
               GQ+N     +K ++  + V + G+  E +++    G  +D+  L   +       +
Sbjct: 107 DDVDGQMNQGNISSKTLNNFSCVILGGMVTEHILFGYSEGLYSDIVKLNDVLRWLG--FT 164

Query: 274 KDQQQNLTRWAVMFAASLLKNNKESHEALMASMTKKASVVECIQTIESVAAEFNI 328
           + +++   +WAV    SLL ++KE+  +L  +M K   +  CI+ IES  +   I
Sbjct: 165 ESEKEAHIKWAVSNTVSLLHSHKEARVSLAETMAKAKPISTCIEAIESAISTHQI 219


>AT5G27290.2 | Symbols:  | unknown protein; LOCATED IN: chloroplast;
           EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT1G54680.3); Has 199 Blast
           hits to 194 proteins in 57 species: Archae - 0; Bacteria
           - 61; Metazoa - 0; Fungi - 0; Plants - 129; Viruses - 0;
           Other Eukaryotes - 9 (source: NCBI BLink). |
           chr5:9618787-9620473 REVERSE LENGTH=262
          Length = 262

 Score = 50.8 bits (120), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 28/84 (33%), Positives = 51/84 (60%), Gaps = 11/84 (13%)

Query: 175 YQERIARHEAAHFLIAYLLGVPILGYSLD----IGKE-HVNL------IDQRLEKLIYSG 223
           Y  R+ +HEA HFL+AYL+G+   GY+L     + KE  +N+      +D    + + SG
Sbjct: 179 YHNRVVQHEAGHFLVAYLVGILPRGYTLSSLEALQKEGSLNIQAGSAFVDYEFLEEVNSG 238

Query: 224 QLNAKEIDRLAVVSMAGLAAEGLI 247
           +++A  ++R + +++AG+A E L+
Sbjct: 239 KVSATMLNRFSCIALAGVATEYLL 262