Miyakogusa Predicted Gene

Lj0g3v0325869.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj0g3v0325869.1 Non Chatacterized Hit- tr|D7U2W3|D7U2W3_VITVI
Putative uncharacterized protein OS=Vitis vinifera
GN=,32.5,0.0000000000005,SUBFAMILY NOT NAMED,NULL;
NFRKB-RELATED,Nuclear factor related to kappa-B-binding protein;
seg,NULL,CUFF.22145.1
         (389 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT3G45830.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   320   1e-87
AT1G02290.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   121   7e-28
AT5G13950.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    73   3e-13
AT5G13950.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    73   3e-13
AT5G13950.3 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    73   4e-13

>AT3G45830.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 24 plant
           structures; EXPRESSED DURING: 15 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT1G02290.1); Has 499 Blast hits to 438 proteins
           in 100 species: Archae - 0; Bacteria - 7; Metazoa - 236;
           Fungi - 15; Plants - 108; Viruses - 2; Other Eukaryotes
           - 131 (source: NCBI BLink). | chr3:16841277-16845173
           FORWARD LENGTH=1298
          Length = 1298

 Score =  320 bits (819), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 172/332 (51%), Positives = 210/332 (63%), Gaps = 15/332 (4%)

Query: 1   MAIEKNSFKVTRLDSEGSPLSRETMSSDEDEARRGNSAVXXXXXXXXXXXXXXXXX---X 57
           MAIEK++ KV+R D E S  S ++MSS E+  RR NS V                     
Sbjct: 1   MAIEKSNVKVSRFDLEYSHGSGDSMSSYEE--RRKNSVVNNVDSEDEDDDFDEDDSGAGS 58

Query: 58  XXXXLLELGETGAEFCQIGNQTCSIPLELYDLAGLEDILSVDVWNEILSEEERLELAKYL 117
               LLEL ETGAEFCQ+GN TCSIP ELYDL  LEDILSVDVWNE L+E+ER  L+ YL
Sbjct: 59  DDFDLLELAETGAEFCQVGNVTCSIPFELYDLPSLEDILSVDVWNECLTEKERFSLSSYL 118

Query: 118 PDVDQETFVQSLRELFTGCNLHFGSPVKKLFDMLKGGLCEPRVALYREGLNVFQKKQHYH 177
           PDVDQ TF+++L+ELF GCN HFGSPVKKLFDMLKGG CEPR  LY EG ++F + +HYH
Sbjct: 119 PDVDQLTFMRTLKELFEGCNFHFGSPVKKLFDMLKGGQCEPRNTLYLEGRSLFLRTKHYH 178

Query: 178 LLRKHQNNMVSSLCQIRDAWRNCRGYSIEERLRVLNITRSQKSLMYXXXXXXXXXXXXXX 237
            LRK+ N+MV +LCQ RDAW +C+GYSI+E+LRVLNI +SQK+LM               
Sbjct: 179 SLRKYHNDMVVNLCQTRDAWTSCKGYSIDEKLRVLNIVKSQKTLMREKKDDFEDDSSEKD 238

Query: 238 XXV--IWSRKNKDRKSAAQKTGRYPLHGVGSGLELQPRGRSAVLEQEKYGKQNPKGILKL 295
                 W RK KDRKS  +K  R+  +GV SGLE  PR + A +EQ+ YGK   K     
Sbjct: 239 EPFDKPWGRKGKDRKSTQKKLARHAGYGVDSGLEF-PRRQLAAVEQDLYGKPKSK----- 292

Query: 296 AGPKIPSAKDPTGNFSSAYHALDMNPGLNGSA 327
             PK P AK   G +++ Y+   MN   N S+
Sbjct: 293 --PKFPFAKTSVGPYATGYNGYGMNSAYNPSS 322


>AT1G02290.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G45830.1); Has 134 Blast hits to 134 proteins
           in 37 species: Archae - 0; Bacteria - 0; Metazoa - 54;
           Fungi - 0; Plants - 78; Viruses - 0; Other Eukaryotes -
           2 (source: NCBI BLink). | chr1:450646-451977 REVERSE
           LENGTH=443
          Length = 443

 Score =  121 bits (304), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 50/133 (37%), Positives = 85/133 (63%)

Query: 65  LGETGAEFCQIGNQTCSIPLELYDLAGLEDILSVDVWNEILSEEERLELAKYLPDVDQET 124
           + +   E   +  Q C+IP ELYDL  L  ILSV+ WN +L+EEER  L+ +LPD+D +T
Sbjct: 21  IAQVNCELALVEGQLCNIPYELYDLPDLTGILSVETWNSLLTEEERFFLSCFLPDMDPQT 80

Query: 125 FVQSLRELFTGCNLHFGSPVKKLFDMLKGGLCEPRVALYREGLNVFQKKQHYHLLRKHQN 184
           F  +++EL  G NL+FG+P  K +  L GGL  P+VA ++EG+   +++++Y+ L+ +  
Sbjct: 81  FSLTMQELLDGANLYFGNPEDKFYKNLLGGLFTPKVACFKEGVMFVKRRKYYYSLKFYHE 140

Query: 185 NMVSSLCQIRDAW 197
            ++ +  +++  W
Sbjct: 141 KLIRTFTEMQRVW 153


>AT5G13950.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 23 plant
           structures; EXPRESSED DURING: 13 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT1G02290.1); Has 147 Blast hits to 145 proteins
           in 44 species: Archae - 0; Bacteria - 2; Metazoa - 56;
           Fungi - 6; Plants - 81; Viruses - 0; Other Eukaryotes -
           2 (source: NCBI BLink). | chr5:4496196-4500206 REVERSE
           LENGTH=939
          Length = 939

 Score = 72.8 bits (177), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 40/125 (32%), Positives = 65/125 (52%), Gaps = 4/125 (3%)

Query: 78  QTCSIPLELYDLAGLEDILSVDVWNEILSEEERLELAKYLPD-VDQETFVQSLRELFTGC 136
           Q C +P E + L  L ++LS +VW   LS+ ER  L ++LP+ VD E  VQ+   L  G 
Sbjct: 85  QVCPVPHETFQLENLSEVLSNEVWRSCLSDGERNYLRQFLPEGVDVEQVVQA---LLDGE 141

Query: 137 NLHFGSPVKKLFDMLKGGLCEPRVALYREGLNVFQKKQHYHLLRKHQNNMVSSLCQIRDA 196
           N HFG+P       +  G   P   + RE      K+++Y  L K+  +++  L  +++ 
Sbjct: 142 NFHFGNPSLDWGTAVCSGKAHPDQIVSREECLRADKRRYYSNLEKYHQDIIDYLQTLKEK 201

Query: 197 WRNCR 201
           W +C+
Sbjct: 202 WESCK 206


>AT5G13950.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G02290.1); Has 147 Blast hits to 145 proteins
           in 44 species: Archae - 0; Bacteria - 2; Metazoa - 56;
           Fungi - 6; Plants - 81; Viruses - 0; Other Eukaryotes -
           2 (source: NCBI BLink). | chr5:4496196-4500206 REVERSE
           LENGTH=939
          Length = 939

 Score = 72.8 bits (177), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 40/125 (32%), Positives = 65/125 (52%), Gaps = 4/125 (3%)

Query: 78  QTCSIPLELYDLAGLEDILSVDVWNEILSEEERLELAKYLPD-VDQETFVQSLRELFTGC 136
           Q C +P E + L  L ++LS +VW   LS+ ER  L ++LP+ VD E  VQ+   L  G 
Sbjct: 85  QVCPVPHETFQLENLSEVLSNEVWRSCLSDGERNYLRQFLPEGVDVEQVVQA---LLDGE 141

Query: 137 NLHFGSPVKKLFDMLKGGLCEPRVALYREGLNVFQKKQHYHLLRKHQNNMVSSLCQIRDA 196
           N HFG+P       +  G   P   + RE      K+++Y  L K+  +++  L  +++ 
Sbjct: 142 NFHFGNPSLDWGTAVCSGKAHPDQIVSREECLRADKRRYYSNLEKYHQDIIDYLQTLKEK 201

Query: 197 WRNCR 201
           W +C+
Sbjct: 202 WESCK 206


>AT5G13950.3 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 23 plant
           structures; EXPRESSED DURING: 13 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT1G02290.1). | chr5:4496196-4500206 REVERSE
           LENGTH=954
          Length = 954

 Score = 72.8 bits (177), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 40/125 (32%), Positives = 65/125 (52%), Gaps = 4/125 (3%)

Query: 78  QTCSIPLELYDLAGLEDILSVDVWNEILSEEERLELAKYLPD-VDQETFVQSLRELFTGC 136
           Q C +P E + L  L ++LS +VW   LS+ ER  L ++LP+ VD E  VQ+   L  G 
Sbjct: 85  QVCPVPHETFQLENLSEVLSNEVWRSCLSDGERNYLRQFLPEGVDVEQVVQA---LLDGE 141

Query: 137 NLHFGSPVKKLFDMLKGGLCEPRVALYREGLNVFQKKQHYHLLRKHQNNMVSSLCQIRDA 196
           N HFG+P       +  G   P   + RE      K+++Y  L K+  +++  L  +++ 
Sbjct: 142 NFHFGNPSLDWGTAVCSGKAHPDQIVSREECLRADKRRYYSNLEKYHQDIIDYLQTLKEK 201

Query: 197 WRNCR 201
           W +C+
Sbjct: 202 WESCK 206