Miyakogusa Predicted Gene

Lj4g3v1539340.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj4g3v1539340.1 tr|Q9FJW1|Q9FJW1_ARATH At5g67620 OS=Arabidopsis
thaliana GN=At5g67620 PE=2 SV=1,33.67,4e-19,seg,NULL; DUF4228,Protein
of unknown function DUF4228,CUFF.49382.1
         (183 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT5G50090.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   152   9e-38
AT5G50090.2 | Symbols:  | unknown protein; BEST Arabidopsis thal...   152   2e-37
AT1G60010.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   142   1e-34
AT5G62900.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   140   5e-34
AT1G10530.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   130   5e-31
AT5G67620.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    97   5e-21

>AT5G50090.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN: N-terminal
           protein myristoylation; LOCATED IN: cellular_component
           unknown; BEST Arabidopsis thaliana protein match is:
           unknown protein (TAIR:AT5G62900.1); Has 1807 Blast hits
           to 1807 proteins in 277 species: Archae - 0; Bacteria -
           0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses -
           0; Other Eukaryotes - 339 (source: NCBI BLink). |
           chr5:20369961-20370878 FORWARD LENGTH=159
          Length = 159

 Score =  152 bits (385), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 93/183 (50%), Positives = 113/183 (61%), Gaps = 25/183 (13%)

Query: 1   MGNCQAIDTATLVIQQPNGKEEKLYWPVSASEVMKTNPGHYVALLISTTLCTSKDSENCP 60
           MGNCQA+DTA +VIQ PNGKEEKL  PVSAS VMK NPGH V+LLISTT           
Sbjct: 1   MGNCQAVDTARVVIQHPNGKEEKLSCPVSASYVMKMNPGHCVSLLISTT----------A 50

Query: 61  NKKSDSNTTNPVRLTRIKLLKPTDTLVLGQVYRLISAQEVMKGLLAKKQAKMKRNESESA 120
              + S    P+RLTRIKLL+PTDTLVLG VYRLI+ +EVMKGL+AKK +K+K+    S 
Sbjct: 51  LSSASSGHGGPLRLTRIKLLRPTDTLVLGHVYRLITTKEVMKGLMAKKCSKLKKESKGSD 110

Query: 121 PKINQKKEMMDKPARRSEPEENQEAKNERHGSRXXXXXXXXXXXXXKSRTWQPSLKSISE 180
            K+   K +     +    ++ Q  K E+  SR              SR+WQPSL+SISE
Sbjct: 111 DKLEMVKAI--NSTKLDNEDQLQMKKQEKERSRI-------------SRSWQPSLQSISE 155

Query: 181 ATS 183
             S
Sbjct: 156 GGS 158


>AT5G50090.2 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT5G62900.1); Has 35333 Blast hits to 34131
           proteins in 2444 species: Archae - 798; Bacteria -
           22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
           - 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
           chr5:20369961-20370878 FORWARD LENGTH=153
          Length = 153

 Score =  152 bits (383), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 95/183 (51%), Positives = 114/183 (62%), Gaps = 31/183 (16%)

Query: 1   MGNCQAIDTATLVIQQPNGKEEKLYWPVSASEVMKTNPGHYVALLISTTLCTSKDSENCP 60
           MGNCQA+DTA +VIQ PNGKEEKL  PVSAS VMK NPGH V+LLISTT           
Sbjct: 1   MGNCQAVDTARVVIQHPNGKEEKLSCPVSASYVMKMNPGHCVSLLISTT----------A 50

Query: 61  NKKSDSNTTNPVRLTRIKLLKPTDTLVLGQVYRLISAQEVMKGLLAKKQAKMKRNESESA 120
              + S    P+RLTRIKLL+PTDTLVLG VYRLI+ +EVMKGL+AKK +K+K+    S 
Sbjct: 51  LSSASSGHGGPLRLTRIKLLRPTDTLVLGHVYRLITTKEVMKGLMAKKCSKLKKESKGSD 110

Query: 121 PKINQKKEMMDKPARRSEPEENQEAKNERHGSRXXXXXXXXXXXXXKSRTWQPSLKSISE 180
            K+   K      A  S   +N++ + ER  SR              SR+WQPSL+SISE
Sbjct: 111 DKLEMVK------AINSTKLDNEDQEKER--SRI-------------SRSWQPSLQSISE 149

Query: 181 ATS 183
             S
Sbjct: 150 GGS 152


>AT1G60010.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN: N-terminal
           protein myristoylation; LOCATED IN: cellular_component
           unknown; EXPRESSED IN: 22 plant structures; EXPRESSED
           DURING: 13 growth stages; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT1G10530.1);
           Has 185 Blast hits to 185 proteins in 18 species: Archae
           - 0; Bacteria - 0; Metazoa - 0; Fungi - 3; Plants - 180;
           Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink).
           | chr1:22095660-22096434 REVERSE LENGTH=173
          Length = 173

 Score =  142 bits (359), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 85/193 (44%), Positives = 116/193 (60%), Gaps = 30/193 (15%)

Query: 1   MGNCQAIDTATLVIQQPNGKEEKLYWPVSASEVMKTNPGHYVALLI--------STTLCT 52
           MGNCQA+D A LV+Q P+GK ++ Y PVS SE+M+  PGHYV+L+I        +TT  T
Sbjct: 1   MGNCQAVDAAALVLQHPDGKIDRYYGPVSVSEIMRMYPGHYVSLIIPLPEKNIPATTTTT 60

Query: 53  SKDSENCPNKKSDSNTTNPVRLTRIKLLKPTDTLVLGQVYRLISAQEVMKGLLAKKQAKM 112
              SE              VR TR+KLL+PT+ LVLG  YRLI++QEVMK L AKK AK 
Sbjct: 61  DDKSER-----------KVVRFTRVKLLRPTENLVLGHAYRLITSQEVMKVLRAKKYAKT 109

Query: 113 KRNESES--APKINQKKEMMDKPARRSEPEENQEAKNERHGSRXXXXXXXXXXXXXKSRT 170
           K+++SE+    K    ++ +D+    S+  +N E K+E+  S              +S+T
Sbjct: 110 KKHQSETSKEKKKPSSEKKIDE---ESDKNQNLETKDEKQRS------VLTNSASSRSKT 160

Query: 171 WQPSLKSISEATS 183
           W+PSL+SISEATS
Sbjct: 161 WRPSLQSISEATS 173


>AT5G62900.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN: N-terminal
           protein myristoylation; LOCATED IN: cellular_component
           unknown; EXPRESSED IN: 22 plant structures; EXPRESSED
           DURING: 12 growth stages; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT5G50090.1);
           Has 157 Blast hits to 157 proteins in 14 species: Archae
           - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 157;
           Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).
           | chr5:25248872-25249725 FORWARD LENGTH=161
          Length = 161

 Score =  140 bits (353), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 80/183 (43%), Positives = 111/183 (60%), Gaps = 22/183 (12%)

Query: 1   MGNCQAIDTATLVIQQPNGKEEKLYWPVSASEVMKTNPGHYVALLISTTLCTSKDSENCP 60
           MGNCQA + AT VIQQP+GK  + Y  V+ASEV+K++PGH+VALL+S+ +         P
Sbjct: 1   MGNCQAAEAATTVIQQPDGKSVRFYCTVNASEVIKSHPGHHVALLLSSAV---------P 51

Query: 61  NKKSDSNTTNPVRLTRIKLLKPTDTLVLGQVYRLISAQEVMKGLLAKKQAKMKRNESESA 120
           +  S       +R+TRIKLL+P+D L+LG VYRLIS++EVMKG+ AKK  KMK+   E  
Sbjct: 52  HGGS-------LRVTRIKLLRPSDNLLLGHVYRLISSEEVMKGIRAKKSGKMKKIHGE-- 102

Query: 121 PKINQKKEMMDKPARRSEPEENQEAKNERHGSRXXXXXXXXXXXXXKSRTWQPSLKSISE 180
              +  +E ++    RSE   +++ +   H                K R WQPSL+SISE
Sbjct: 103 --FSVAEEEINPLTLRSESASDKDTQRRIH--EKQRGMMNTGGATNKVRAWQPSLQSISE 158

Query: 181 ATS 183
           +TS
Sbjct: 159 STS 161


>AT1G10530.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN: N-terminal
           protein myristoylation; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT1G60010.1);
           Has 143 Blast hits to 143 proteins in 14 species: Archae
           - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 143;
           Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).
           | chr1:3471805-3472526 REVERSE LENGTH=166
          Length = 166

 Score =  130 bits (327), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 80/185 (43%), Positives = 106/185 (57%), Gaps = 21/185 (11%)

Query: 1   MGNCQAIDTATLVIQQPNGKEEKLYWPVSASEVMKTNPGHYVALLISTTLCTSKDSENCP 60
           MGNCQA++ A LV+Q P G  ++ Y  VS +EVM   PGHYV+L+I     + ++ +N P
Sbjct: 1   MGNCQAVNAAVLVLQHPGGIIDRYYSSVSVTEVMAMYPGHYVSLIIP---LSEEEEKNIP 57

Query: 61  --NKKSDSNTTNPVRLTRIKLLKPTDTLVLGQVYRLISAQEVMKGLLAKKQAKMKRNESE 118
              K  D      VR TR++LL+PT+ LVLG  YRLI++QEVMK L  KK AK K+++ E
Sbjct: 58  ATEKGDDKKQRKAVRFTRVQLLRPTENLVLGHAYRLITSQEVMKVLREKKSAKTKKHQIE 117

Query: 119 SAPKINQKKEMMDKPARRSEPEENQEAKNERHGSRXXXXXXXXXXXXXKSRTWQPSLKSI 178
              K    K+  DK      PE+ Q  K  R                 KS+TW+PSL+SI
Sbjct: 118 ---KTTTAKKFSDKKV----PEKKQ-GKQFR--------VIRNSTSLLKSKTWRPSLQSI 161

Query: 179 SEATS 183
           SEATS
Sbjct: 162 SEATS 166


>AT5G67620.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN: N-terminal
           protein myristoylation; LOCATED IN: microtubule; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT5G62900.1); Has 1807 Blast hits to 1807 proteins
           in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
           Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
           - 339 (source: NCBI BLink). | chr5:26964891-26965720
           REVERSE LENGTH=182
          Length = 182

 Score = 97.4 bits (241), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 48/115 (41%), Positives = 74/115 (64%), Gaps = 16/115 (13%)

Query: 1   MGNCQAIDTATLVIQQP-NGKEEKLYWPVSASEVMKTNPGHYVALLISTTLCTSKDSENC 59
           MGNCQA + AT++I  P   K E++YW V+AS++MK+NPGHYVA+++++   T K+ +  
Sbjct: 1   MGNCQAAEAATVLIHHPAENKVERIYWSVTASDIMKSNPGHYVAVVVTSP--TMKNEKGL 58

Query: 60  PNKKSDSNTTNPVRLTRIKLLKPTDTLVLGQVYRLISAQEVMKGLLAKKQAKMKR 114
           P             L ++KLL+P DTL++G VYRL+S +EV+     KK  K+ +
Sbjct: 59  P-------------LKQLKLLRPDDTLLIGHVYRLVSFEEVLNEFATKKCVKLGK 100