Miyakogusa Predicted Gene

Lj3g3v0917050.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj3g3v0917050.1 Non Chatacterized Hit- tr|I1KTH6|I1KTH6_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.10814
PE,74.13,0,seg,NULL; coiled-coil,NULL,CUFF.41617.1
         (327 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT3G14750.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   216   1e-56
AT1G67170.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   120   2e-27
AT1G55170.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   110   9e-25
AT2G30120.2 | Symbols:  | unknown protein; EXPRESSED IN: 22 plan...    90   2e-18
AT5G61920.2 | Symbols:  | unknown protein; INVOLVED IN: biologic...    65   8e-11
AT5G61920.1 | Symbols:  | unknown protein; INVOLVED IN: biologic...    65   8e-11
AT2G30120.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    62   7e-10

>AT3G14750.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G67170.1); Has 4036 Blast hits to 3091 proteins
           in 519 species: Archae - 61; Bacteria - 669; Metazoa -
           1503; Fungi - 255; Plants - 421; Viruses - 4; Other
           Eukaryotes - 1123 (source: NCBI BLink). |
           chr3:4953765-4955373 REVERSE LENGTH=331
          Length = 331

 Score =  216 bits (551), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 122/256 (47%), Positives = 156/256 (60%), Gaps = 10/256 (3%)

Query: 1   MSGRNRGXXXXXX---XXAGLSPPVHDPMFGARGGGAIXXXXXXXXXXXXXAFLEELRES 57
           MSGRNRG           +GL  PVH P F                     + +++ RE 
Sbjct: 1   MSGRNRGPPPPSMKGGSYSGLQAPVHQPPF------VRGLGGGPVPPPPHPSMIDDSREP 54

Query: 58  QXXXXXXXXXXXXAAVIEERLAAQHGEIQGLLGDNQSLAATHVALKQEQEAAQHELQRMA 117
           Q            + ++E+RLAAQ+ ++QGLL DNQ LAATHVALKQE E AQHELQR+ 
Sbjct: 55  QFRVDARGLPPQFS-ILEDRLAAQNQDVQGLLADNQRLAATHVALKQELEVAQHELQRIM 113

Query: 118 HFNESLRADTEGRMRELCDKSXXXXXXXXXXXXXXXXXXXXXXDIKELTVVRQDLSGQVQ 177
           H+ +SLRA+ E  MRE+ DKS                      DIKE T  RQ+L+ QV 
Sbjct: 114 HYIDSLRAEEEIMMREMYDKSMRSEMELREVDAMRAEIQKIRADIKEFTSGRQELTSQVH 173

Query: 178 AMTHDLGRMNNDLKQVPALKADVEAMRQELQRARAAIEYEKKGFAENYEHGQVMEKKLIS 237
            MT DL R+  DL+Q+P L A++E  +QELQRARAAI+YEKKG+AENYEHG++ME KL++
Sbjct: 174 LMTQDLARLTADLQQIPTLTAEIENTKQELQRARAAIDYEKKGYAENYEHGKIMEHKLVA 233

Query: 238 MAREMEKLRAEISNAE 253
           MARE+EKLRAEI+N+E
Sbjct: 234 MARELEKLRAEIANSE 249


>AT1G67170.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G14750.1); Has 5478 Blast hits to 4354 proteins
           in 533 species: Archae - 87; Bacteria - 653; Metazoa -
           2554; Fungi - 380; Plants - 418; Viruses - 16; Other
           Eukaryotes - 1370 (source: NCBI BLink). |
           chr1:25127727-25129145 FORWARD LENGTH=359
          Length = 359

 Score =  120 bits (300), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 66/179 (36%), Positives = 107/179 (59%)

Query: 73  VIEERLAAQHGEIQGLLGDNQSLAATHVALKQEQEAAQHELQRMAHFNESLRADTEGRMR 132
           V+E++  AQHGE+Q L  +NQ L  TH +L+QE  AAQHE+Q +     S++++ E RM 
Sbjct: 57  VMEQKFVAQHGELQRLAIENQRLGGTHGSLRQELAAAQHEIQMLHAQIGSMKSEREQRMM 116

Query: 133 ELCDKSXXXXXXXXXXXXXXXXXXXXXXDIKELTVVRQDLSGQVQAMTHDLGRMNNDLKQ 192
            L +K                       + + L V R++L  +V  +T +L +  +D++Q
Sbjct: 117 GLAEKVAKMETELQKSEAVKLEMQQARAEARSLVVAREELMSKVHQLTQELQKSRSDVQQ 176

Query: 193 VPALKADVEAMRQELQRARAAIEYEKKGFAENYEHGQVMEKKLISMAREMEKLRAEISN 251
           +PAL +++E +RQE Q+ RA  +YEKK + ++ E  Q MEK  ++MARE+EKL+A++ N
Sbjct: 177 IPALMSELENLRQEYQQCRATYDYEKKFYNDHLESLQAMEKNYMTMAREVEKLQAQLMN 235


>AT1G55170.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G14750.1); Has 13439 Blast hits to 8993
           proteins in 828 species: Archae - 344; Bacteria - 1469;
           Metazoa - 6958; Fungi - 1008; Plants - 683; Viruses -
           29; Other Eukaryotes - 2948 (source: NCBI BLink). |
           chr1:20580578-20581706 FORWARD LENGTH=283
          Length = 283

 Score =  110 bits (276), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 65/173 (37%), Positives = 101/173 (58%)

Query: 81  QHGEIQGLLGDNQSLAATHVALKQEQEAAQHELQRMAHFNESLRADTEGRMRELCDKSXX 140
           Q  EI+ LL DN  LA   + L++E  AA+ EL RM      LRA+ + ++RE  +K   
Sbjct: 55  QDAEIRRLLSDNHGLADDRMVLERELVAAKEELHRMNLMISDLRAEQDLQLREFSEKRHK 114

Query: 141 XXXXXXXXXXXXXXXXXXXXDIKELTVVRQDLSGQVQAMTHDLGRMNNDLKQVPALKADV 200
                               ++++L  ++++LSG VQ +  DL ++ +D KQ+P ++A+V
Sbjct: 115 LEGDVRAMESYKKEASQLRGEVQKLDEIKRELSGNVQLLRKDLAKLQSDNKQIPGMRAEV 174

Query: 201 EAMRQELQRARAAIEYEKKGFAENYEHGQVMEKKLISMAREMEKLRAEISNAE 253
           + +++EL  AR AIEYEKK   E  E  Q MEK ++SMARE+EKLRAE++  +
Sbjct: 175 KDLQKELMHARDAIEYEKKEKFELMEQRQTMEKNMVSMAREVEKLRAELATVD 227


>AT2G30120.2 | Symbols:  | unknown protein; EXPRESSED IN: 22 plant
           structures; EXPRESSED DURING: 12 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT3G14750.1); Has 2527 Blast hits to 2101 proteins
           in 358 species: Archae - 77; Bacteria - 245; Metazoa -
           1087; Fungi - 215; Plants - 350; Viruses - 4; Other
           Eukaryotes - 549 (source: NCBI BLink). |
           chr2:12860607-12861862 REVERSE LENGTH=288
          Length = 288

 Score = 89.7 bits (221), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 54/183 (29%), Positives = 96/183 (52%)

Query: 71  AAVIEERLAAQHGEIQGLLGDNQSLAATHVALKQEQEAAQHELQRMAHFNESLRADTEGR 130
           + ++E+R+A QH EIQ LL DNQ LA  H+ LK +   A+ EL+R+      ++A+ E +
Sbjct: 36  SVILEDRIAIQHREIQSLLNDNQRLAVAHIGLKDQLNVAKRELERLLETAVKVKAEGEAK 95

Query: 131 MRELCDKSXXXXXXXXXXXXXXXXXXXXXXDIKELTVVRQDLSGQVQAMTHDLGRMNNDL 190
           +RE+   +                      D++ L   RQ+L+ ++     ++ +   + 
Sbjct: 96  VREVYQNALRMEAEARVIDGLGAELGQVRSDVQRLGSDRQELATELAMFDDEMAKAKPNS 155

Query: 191 KQVPALKADVEAMRQELQRARAAIEYEKKGFAENYEHGQVMEKKLISMAREMEKLRAEIS 250
            +   +K ++E +R E+++ RAA+E EKK  A N  H + MEK +  + RE+ KL  E+ 
Sbjct: 156 DRAIEVKLEIEILRGEIRKGRAALELEKKTRASNLHHERGMEKTIDHLNREIVKLEEELV 215

Query: 251 NAE 253
           + E
Sbjct: 216 DLE 218


>AT5G61920.2 | Symbols:  | unknown protein; INVOLVED IN:
           biological_process unknown; LOCATED IN: chloroplast;
           EXPRESSED IN: 6 plant structures; EXPRESSED DURING: 4
           anthesis, F mature embryo stage, petal differentiation
           and expansion stage, E expanded cotyledon stage, D
           bilateral stage; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT1G67170.1); Has 30201 Blast
           hits to 17322 proteins in 780 species: Archae - 12;
           Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants -
           5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI
           BLink). | chr5:24864830-24865628 FORWARD LENGTH=238
          Length = 238

 Score = 64.7 bits (156), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 52/181 (28%), Positives = 91/181 (50%), Gaps = 2/181 (1%)

Query: 73  VIEERLAAQHGEIQGLLGDNQSLAATHVALKQEQEAAQHELQRM-AHFNESLRADTEGRM 131
           ++E ++A Q  EI  L  DN+ LA+++VALK++   A  E+Q + AH  ++   D E ++
Sbjct: 52  ILENKIAVQAAEIDRLSNDNRKLASSYVALKEDLTVADREVQGLRAHIRKT-ETDHEIQI 110

Query: 132 RELCDKSXXXXXXXXXXXXXXXXXXXXXXDIKELTVVRQDLSGQVQAMTHDLGRMNNDLK 191
           R   +K                       +   L   R++L+ +V+    DL ++  + +
Sbjct: 111 RSTLEKIAKMEGMVKNRENIRREVQSAHIEAHRLAREREELASKVKLGMKDLKKVCLEAE 170

Query: 192 QVPALKADVEAMRQELQRARAAIEYEKKGFAENYEHGQVMEKKLISMAREMEKLRAEISN 251
            + A   ++E +++E QR R   E EK G  E     + ME+K+I   + +EKLR+EIS 
Sbjct: 171 SLEASSQELERLKEEHQRLRKEFEEEKSGNVEKLAQLKGMERKIIGAVKAIEKLRSEIST 230

Query: 252 A 252
           A
Sbjct: 231 A 231


>AT5G61920.1 | Symbols:  | unknown protein; INVOLVED IN:
           biological_process unknown; LOCATED IN: chloroplast;
           EXPRESSED IN: 6 plant structures; EXPRESSED DURING: 4
           anthesis, F mature embryo stage, petal differentiation
           and expansion stage, E expanded cotyledon stage, D
           bilateral stage; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT1G67170.1); Has 1807 Blast
           hits to 1807 proteins in 277 species: Archae - 0;
           Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385;
           Viruses - 0; Other Eukaryotes - 339 (source: NCBI
           BLink). | chr5:24864830-24865628 FORWARD LENGTH=238
          Length = 238

 Score = 64.7 bits (156), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 52/181 (28%), Positives = 91/181 (50%), Gaps = 2/181 (1%)

Query: 73  VIEERLAAQHGEIQGLLGDNQSLAATHVALKQEQEAAQHELQRM-AHFNESLRADTEGRM 131
           ++E ++A Q  EI  L  DN+ LA+++VALK++   A  E+Q + AH  ++   D E ++
Sbjct: 52  ILENKIAVQAAEIDRLSNDNRKLASSYVALKEDLTVADREVQGLRAHIRKT-ETDHEIQI 110

Query: 132 RELCDKSXXXXXXXXXXXXXXXXXXXXXXDIKELTVVRQDLSGQVQAMTHDLGRMNNDLK 191
           R   +K                       +   L   R++L+ +V+    DL ++  + +
Sbjct: 111 RSTLEKIAKMEGMVKNRENIRREVQSAHIEAHRLAREREELASKVKLGMKDLKKVCLEAE 170

Query: 192 QVPALKADVEAMRQELQRARAAIEYEKKGFAENYEHGQVMEKKLISMAREMEKLRAEISN 251
            + A   ++E +++E QR R   E EK G  E     + ME+K+I   + +EKLR+EIS 
Sbjct: 171 SLEASSQELERLKEEHQRLRKEFEEEKSGNVEKLAQLKGMERKIIGAVKAIEKLRSEIST 230

Query: 252 A 252
           A
Sbjct: 231 A 231


>AT2G30120.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G14750.1); Has 275 Blast hits to 241 proteins
           in 42 species: Archae - 4; Bacteria - 15; Metazoa - 15;
           Fungi - 4; Plants - 188; Viruses - 0; Other Eukaryotes -
           49 (source: NCBI BLink). | chr2:12861332-12861862
           REVERSE LENGTH=176
          Length = 176

 Score = 61.6 bits (148), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 36/141 (25%), Positives = 71/141 (50%)

Query: 71  AAVIEERLAAQHGEIQGLLGDNQSLAATHVALKQEQEAAQHELQRMAHFNESLRADTEGR 130
           + ++E+R+A QH EIQ LL DNQ LA  H+ LK +   A+ EL+R+      ++A+ E +
Sbjct: 36  SVILEDRIAIQHREIQSLLNDNQRLAVAHIGLKDQLNVAKRELERLLETAVKVKAEGEAK 95

Query: 131 MRELCDKSXXXXXXXXXXXXXXXXXXXXXXDIKELTVVRQDLSGQVQAMTHDLGRMNNDL 190
           +RE+   +                      D++ L   RQ+L+ ++     ++ +   + 
Sbjct: 96  VREVYQNALRMEAEARVIDGLGAELGQVRSDVQRLGSDRQELATELAMFDDEMAKAKPNS 155

Query: 191 KQVPALKADVEAMRQELQRAR 211
            +   +K ++E +R E+++ R
Sbjct: 156 DRAIEVKLEIEILRGEIRKGR 176