Miyakogusa Predicted Gene

Lj0g3v0359799.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj0g3v0359799.1 Non Chatacterized Hit- tr|C5XYY7|C5XYY7_SORBI
Putative uncharacterized protein Sb04g008610
OS=Sorghu,28.48,8e-18,coiled-coil,NULL; seg,NULL,CUFF.24775.1
         (457 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT3G58110.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   197   1e-50
AT3G58110.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   197   2e-50
AT2G42370.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   177   1e-44

>AT3G58110.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT2G42370.1). |
           chr3:21516775-21519129 FORWARD LENGTH=754
          Length = 754

 Score =  197 bits (502), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 141/410 (34%), Positives = 206/410 (50%), Gaps = 14/410 (3%)

Query: 25  LEEHNIELSLGQDNAXXXXXXXXXXXXXXIMMEFELSKEEEPGMWLIDQRNNVGEPFLRQ 84
           +EEH +EL+LGQ+                  M+ E +K+EE   W  +  ++ G  FLR+
Sbjct: 339 VEEHMLELNLGQETVSEMVSGEERGPVEGQPMDVEENKKEEDERWAWNGDSHAGSHFLRR 398

Query: 85  CRNVDVN--GMDCGLVKXXXXXXXXXXXXXXXXXXXXXXXXXXXXFHLSPKYSNHFEGMT 142
           C +        D  +                              F   P   +  +G+ 
Sbjct: 399 CNHSSAREGDEDNHIEGSMEMGEDEPIEDVEEEETEEDTEKHEGGFPFFPN-GDSLQGVG 457

Query: 143 SGTGSIIHAMEAGQRPFSSGIDLHDNP-GGDFLSSRDDPPMISGS---SLFGNGH-KRDI 197
            G   +  A   G   ++SG+ +H N  GGDFL+SR +  M  GS   SLFGNG+ KR+I
Sbjct: 458 QGNLMLGDASPLG---YNSGLQIHGNSIGGDFLASRGEMHMAMGSGSSSLFGNGNNKREI 514

Query: 198 GLVDNHNSHHFLNVSNKRMRSDSP-WNSKPVDFEMCMEQMEHCMGKVRMMYASKDEAFEE 256
              +N  ++H  N  NKR+R++ P W+ KP   +MC++QM +   K R+ +A KD   E+
Sbjct: 515 EH-ENGITYHSHNPINKRLRTEEPSWDEKPPPVDMCLDQMAYWAEKARLSFAEKDREREQ 573

Query: 257 SNMNGQLLLNELQKRDDEIDRLHKAKIEESQRRQMEMYRLEKELYMMQSLVEGYRKAMKE 316
           S +N Q L+NELQ +   I  L + K EE QR+ + +Y+LE EL MM S+VEGYRKA+K 
Sbjct: 574 SVINQQYLMNELQSKTAMIQELERTKFEEQQRKDIMIYKLESELRMMTSVVEGYRKALKI 633

Query: 317 TQKAFAEYRARCP-QADEPLYKDVPGSGGLVLSVTXXXXXXXXXXXXXXXXXXXXXXXFG 375
           TQKA  E+R RCP + D+ +Y DV GSGGLVLS T                         
Sbjct: 634 TQKASREHRKRCPLRDDKQVYMDVKGSGGLVLSTTEIEKLRLKQEEEDRMQRVLAKRQID 693

Query: 376 DVELTYMGELESHMIVIESFNDRLMAMENQVKHLKEVKAKSKVSDPPECA 425
           D E  ++ + E HM  +E  N+RL+  E++VK L+E  ++SK  +  E A
Sbjct: 694 DFEHNWLNKFEEHMEAVELLNERLIENEDEVKILRETLSESKNIETSEVA 743


>AT3G58110.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: cultured cell;
           BEST Arabidopsis thaliana protein match is: unknown
           protein (TAIR:AT2G42370.1); Has 2534 Blast hits to 1905
           proteins in 233 species: Archae - 11; Bacteria - 102;
           Metazoa - 890; Fungi - 241; Plants - 124; Viruses - 59;
           Other Eukaryotes - 1107 (source: NCBI BLink). |
           chr3:21516775-21519129 FORWARD LENGTH=784
          Length = 784

 Score =  197 bits (500), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 141/410 (34%), Positives = 206/410 (50%), Gaps = 14/410 (3%)

Query: 25  LEEHNIELSLGQDNAXXXXXXXXXXXXXXIMMEFELSKEEEPGMWLIDQRNNVGEPFLRQ 84
           +EEH +EL+LGQ+                  M+ E +K+EE   W  +  ++ G  FLR+
Sbjct: 369 VEEHMLELNLGQETVSEMVSGEERGPVEGQPMDVEENKKEEDERWAWNGDSHAGSHFLRR 428

Query: 85  CRNVDVN--GMDCGLVKXXXXXXXXXXXXXXXXXXXXXXXXXXXXFHLSPKYSNHFEGMT 142
           C +        D  +                              F   P   +  +G+ 
Sbjct: 429 CNHSSAREGDEDNHIEGSMEMGEDEPIEDVEEEETEEDTEKHEGGFPFFPN-GDSLQGVG 487

Query: 143 SGTGSIIHAMEAGQRPFSSGIDLHDNP-GGDFLSSRDDPPMISGS---SLFGNGH-KRDI 197
            G   +  A   G   ++SG+ +H N  GGDFL+SR +  M  GS   SLFGNG+ KR+I
Sbjct: 488 QGNLMLGDASPLG---YNSGLQIHGNSIGGDFLASRGEMHMAMGSGSSSLFGNGNNKREI 544

Query: 198 GLVDNHNSHHFLNVSNKRMRSDSP-WNSKPVDFEMCMEQMEHCMGKVRMMYASKDEAFEE 256
              +N  ++H  N  NKR+R++ P W+ KP   +MC++QM +   K R+ +A KD   E+
Sbjct: 545 EH-ENGITYHSHNPINKRLRTEEPSWDEKPPPVDMCLDQMAYWAEKARLSFAEKDREREQ 603

Query: 257 SNMNGQLLLNELQKRDDEIDRLHKAKIEESQRRQMEMYRLEKELYMMQSLVEGYRKAMKE 316
           S +N Q L+NELQ +   I  L + K EE QR+ + +Y+LE EL MM S+VEGYRKA+K 
Sbjct: 604 SVINQQYLMNELQSKTAMIQELERTKFEEQQRKDIMIYKLESELRMMTSVVEGYRKALKI 663

Query: 317 TQKAFAEYRARCP-QADEPLYKDVPGSGGLVLSVTXXXXXXXXXXXXXXXXXXXXXXXFG 375
           TQKA  E+R RCP + D+ +Y DV GSGGLVLS T                         
Sbjct: 664 TQKASREHRKRCPLRDDKQVYMDVKGSGGLVLSTTEIEKLRLKQEEEDRMQRVLAKRQID 723

Query: 376 DVELTYMGELESHMIVIESFNDRLMAMENQVKHLKEVKAKSKVSDPPECA 425
           D E  ++ + E HM  +E  N+RL+  E++VK L+E  ++SK  +  E A
Sbjct: 724 DFEHNWLNKFEEHMEAVELLNERLIENEDEVKILRETLSESKNIETSEVA 773


>AT2G42370.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G58110.2); Has 205 Blast hits to 191 proteins
           in 60 species: Archae - 3; Bacteria - 23; Metazoa - 73;
           Fungi - 8; Plants - 34; Viruses - 0; Other Eukaryotes -
           64 (source: NCBI BLink). | chr2:17643334-17645533
           FORWARD LENGTH=715
          Length = 715

 Score =  177 bits (449), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 100/256 (39%), Positives = 153/256 (59%), Gaps = 4/256 (1%)

Query: 159 FSSGIDLHDNPGGDFLSSRDDPPMISGSSLFGNGHKRDIGLVDNHNSHHFLN-VSNKRMR 217
           ++SG+ +H +   DFL+ R    M+ G S FGN +KR+ G  +N  S+HF N  S KR++
Sbjct: 445 YNSGLQVHGSSTCDFLAPRAVMHMVPGRSHFGNDNKREFGH-ENDISYHFDNPASTKRLK 503

Query: 218 SDSPWNSKPVDFEMCMEQMEHCMGKVRMMYASKDEAFEESNMNGQLLLNELQKRDDEIDR 277
           + S W+ KPV F++CMEQ++H   K ++ Y  KD+A  ESNM  Q+L NELQ+R+D I +
Sbjct: 504 TPS-WDDKPVPFDICMEQIKHLADKAKLSYVEKDQACGESNMREQMLQNELQRREDIIQQ 562

Query: 278 LHKAKIEESQRRQMEMYRLEKELYMMQSLVEGYRKAMKETQKAFAEYRARCPQADEPLYK 337
           LHK   EE  ++ +E+Y+LE EL MM S++  Y+KA+KE+QKA  ++R  CP  D+P+Y 
Sbjct: 563 LHKESYEELHKKNVEIYKLENELRMMTSVLAWYQKALKESQKACRKHRKVCPLLDKPIYI 622

Query: 338 DVPGSGGLVLSVTXXXXXXXXXXXXXXXXXXXXXXXFGDVELTYMGELESHM-IVIESFN 396
           DV G+GGLVLS                           +V   ++ E E ++   +E  +
Sbjct: 623 DVKGTGGLVLSTAEIEKLRLKEEKEEGMRRVLIERQVKEVGSLWIKEYEVNLKKKVELLD 682

Query: 397 DRLMAMENQVKHLKEV 412
            +L+  +N++K LKE 
Sbjct: 683 GKLIGFQNKMKLLKET 698