Miyakogusa Predicted Gene

Lj1g3v3105130.2
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj1g3v3105130.2 Non Chatacterized Hit- tr|I1KWP5|I1KWP5_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.28833
PE,70.87,0,coiled-coil,NULL; seg,NULL,CUFF.30060.2
         (420 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT1G67170.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   281   4e-76
AT3G14750.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   151   8e-37
AT1G55170.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   132   5e-31
AT5G61920.2 | Symbols:  | unknown protein; INVOLVED IN: biologic...   100   1e-21
AT5G61920.1 | Symbols:  | unknown protein; INVOLVED IN: biologic...   100   1e-21
AT2G30120.2 | Symbols:  | unknown protein; EXPRESSED IN: 22 plan...    90   3e-18
AT2G30120.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    60   3e-09

>AT1G67170.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G14750.1); Has 5478 Blast hits to 4354 proteins
           in 533 species: Archae - 87; Bacteria - 653; Metazoa -
           2554; Fungi - 380; Plants - 418; Viruses - 16; Other
           Eukaryotes - 1370 (source: NCBI BLink). |
           chr1:25127727-25129145 FORWARD LENGTH=359
          Length = 359

 Score =  281 bits (720), Expect = 4e-76,   Method: Compositional matrix adjust.
 Identities = 151/280 (53%), Positives = 192/280 (68%), Gaps = 9/280 (3%)

Query: 24  QVLEQKLESQHVEMQRLATENQRLAATHGALRQDVAAAQHELQMLHAHTAGMKAEREQQM 83
           +V+EQK  +QH E+QRLA ENQRL  THG+LRQ++AAAQHE+QMLHA    MK+EREQ+M
Sbjct: 56  EVMEQKFVAQHGELQRLAIENQRLGGTHGSLRQELAAAQHEIQMLHAQIGSMKSEREQRM 115

Query: 84  RAVQDKIVKMEADLQAAEPVKMELQQARREVQSLMVSREELIAKAHHLRQEIQRVHADVQ 143
             + +K+ KME +LQ +E VK+E+QQAR E +SL+V+REEL++K H L QE+Q+  +DVQ
Sbjct: 116 MGLAEKVAKMETELQKSEAVKLEMQQARAEARSLVVAREELMSKVHQLTQELQKSRSDVQ 175

Query: 144 QIPALLSELEGLRQEYQHCRATFEYEKKLYSDHLESLQGMEKNYVSMSREVEKLRAELTN 203
           QIPAL+SELE LRQEYQ CRAT++YEKK Y+DHLESLQ MEKNY++M+REVEKL+A+L N
Sbjct: 176 QIPALMSELENLRQEYQQCRATYDYEKKFYNDHLESLQAMEKNYMTMAREVEKLQAQLMN 235

Query: 204 KANVDQRSSGPYGGTSGTNENEASGLPVGQNAYEDGYAVAQGRAPLPPASSGGNTNIXXX 263
            AN D+R+ GPYG      E +ASG   G   YED +   QG  P P A +    N    
Sbjct: 236 NANSDRRAGGPYGNNINA-EIDASGHQSGNGYYEDAFG-PQGYIPQPVAGNATGPNSVVG 293

Query: 264 XXXXXXXXXXXXXYDAPRGPGYVAPTGPTYDAQRSGAYDP 303
                        Y  P+ PGY  P GP       G+YDP
Sbjct: 294 AAQYPYQGVTQPGY-FPQRPGYNFPRGP------PGSYDP 326


>AT3G14750.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G67170.1); Has 4036 Blast hits to 3091 proteins
           in 519 species: Archae - 61; Bacteria - 669; Metazoa -
           1503; Fungi - 255; Plants - 421; Viruses - 4; Other
           Eukaryotes - 1123 (source: NCBI BLink). |
           chr3:4953765-4955373 REVERSE LENGTH=331
          Length = 331

 Score =  151 bits (381), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 85/223 (38%), Positives = 138/223 (61%), Gaps = 8/223 (3%)

Query: 25  VLEQKLESQHVEMQRLATENQRLAATHGALRQDVAAAQHELQMLHAHTAGMKAEREQQMR 84
           +LE +L +Q+ ++Q L  +NQRLAATH AL+Q++  AQHELQ +  +   ++AE E  MR
Sbjct: 69  ILEDRLAAQNQDVQGLLADNQRLAATHVALKQELEVAQHELQRIMHYIDSLRAEEEIMMR 128

Query: 85  AVQDKIVKMEADLQAAEPVKMELQQARREVQSLMVSREELIAKAHHLRQEIQRVHADVQQ 144
            + DK ++ E +L+  + ++ E+Q+ R +++     R+EL ++ H + Q++ R+ AD+QQ
Sbjct: 129 EMYDKSMRSEMELREVDAMRAEIQKIRADIKEFTSGRQELTSQVHLMTQDLARLTADLQQ 188

Query: 145 IPALLSELEGLRQEYQHCRATFEYEKKLYSDHLESLQGMEKNYVSMSREVEKLRAELTNK 204
           IP L +E+E  +QE Q  RA  +YEKK Y+++ E  + ME   V+M+RE+EKLRAE+ N 
Sbjct: 189 IPTLTAEIENTKQELQRARAAIDYEKKGYAENYEHGKIMEHKLVAMARELEKLRAEIAN- 247

Query: 205 ANVDQRSSGPYGG-----TSGTNENEASGLPVGQNAYEDGYAV 242
           +     ++GP G        G   N  +G PV  N Y+  Y +
Sbjct: 248 SETSAYANGPVGNPGGVAYGGGYGNPEAGYPV--NPYQPNYTM 288


>AT1G55170.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G14750.1); Has 13439 Blast hits to 8993
           proteins in 828 species: Archae - 344; Bacteria - 1469;
           Metazoa - 6958; Fungi - 1008; Plants - 683; Viruses -
           29; Other Eukaryotes - 2948 (source: NCBI BLink). |
           chr1:20580578-20581706 FORWARD LENGTH=283
          Length = 283

 Score =  132 bits (332), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 86/232 (37%), Positives = 130/232 (56%), Gaps = 4/232 (1%)

Query: 24  QVLEQKLESQHVEMQRLATENQRLAATHGALRQDVAAAQHELQMLHAHTAGMKAEREQQM 83
           Q+ E ++  Q  E++RL ++N  LA     L +++ AA+ EL  ++   + ++AE++ Q+
Sbjct: 46  QIQEGEIRRQDAEIRRLLSDNHGLADDRMVLERELVAAKEELHRMNLMISDLRAEQDLQL 105

Query: 84  RAVQDKIVKMEADLQAAEPVKMELQQARREVQSLMVSREELIAKAHHLRQEIQRVHADVQ 143
           R   +K  K+E D++A E  K E  Q R EVQ L   + EL      LR+++ ++ +D +
Sbjct: 106 REFSEKRHKLEGDVRAMESYKKEASQLRGEVQKLDEIKRELSGNVQLLRKDLAKLQSDNK 165

Query: 144 QIPALLSELEGLRQEYQHCRATFEYEKKLYSDHLESLQGMEKNYVSMSREVEKLRAELTN 203
           QIP + +E++ L++E  H R   EYEKK   + +E  Q MEKN VSM+REVEKLRAEL  
Sbjct: 166 QIPGMRAEVKDLQKELMHARDAIEYEKKEKFELMEQRQTMEKNMVSMAREVEKLRAEL-- 223

Query: 204 KANVDQRSSGPYGGTSGTNENEASGLPVGQNAYEDGYAVAQGRAPLPPASSG 255
            A VD R  G +GG+ G N N   G   G     D Y  +  R+      SG
Sbjct: 224 -ATVDSRPWG-FGGSYGMNYNNMDGTFRGSYGENDTYLGSSERSQYYSHGSG 273


>AT5G61920.2 | Symbols:  | unknown protein; INVOLVED IN:
           biological_process unknown; LOCATED IN: chloroplast;
           EXPRESSED IN: 6 plant structures; EXPRESSED DURING: 4
           anthesis, F mature embryo stage, petal differentiation
           and expansion stage, E expanded cotyledon stage, D
           bilateral stage; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT1G67170.1); Has 30201 Blast
           hits to 17322 proteins in 780 species: Archae - 12;
           Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants -
           5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI
           BLink). | chr5:24864830-24865628 FORWARD LENGTH=238
          Length = 238

 Score =  100 bits (250), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 62/182 (34%), Positives = 107/182 (58%)

Query: 25  VLEQKLESQHVEMQRLATENQRLAATHGALRQDVAAAQHELQMLHAHTAGMKAEREQQMR 84
           +LE K+  Q  E+ RL+ +N++LA+++ AL++D+  A  E+Q L AH    + + E Q+R
Sbjct: 52  ILENKIAVQAAEIDRLSNDNRKLASSYVALKEDLTVADREVQGLRAHIRKTETDHEIQIR 111

Query: 85  AVQDKIVKMEADLQAAEPVKMELQQARREVQSLMVSREELIAKAHHLRQEIQRVHADVQQ 144
           +  +KI KME  ++  E ++ E+Q A  E   L   REEL +K     +++++V  + + 
Sbjct: 112 STLEKIAKMEGMVKNRENIRREVQSAHIEAHRLAREREELASKVKLGMKDLKKVCLEAES 171

Query: 145 IPALLSELEGLRQEYQHCRATFEYEKKLYSDHLESLQGMEKNYVSMSREVEKLRAELTNK 204
           + A   ELE L++E+Q  R  FE EK    + L  L+GME+  +   + +EKLR+E++  
Sbjct: 172 LEASSQELERLKEEHQRLRKEFEEEKSGNVEKLAQLKGMERKIIGAVKAIEKLRSEISTA 231

Query: 205 AN 206
            N
Sbjct: 232 RN 233


>AT5G61920.1 | Symbols:  | unknown protein; INVOLVED IN:
           biological_process unknown; LOCATED IN: chloroplast;
           EXPRESSED IN: 6 plant structures; EXPRESSED DURING: 4
           anthesis, F mature embryo stage, petal differentiation
           and expansion stage, E expanded cotyledon stage, D
           bilateral stage; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT1G67170.1); Has 1807 Blast
           hits to 1807 proteins in 277 species: Archae - 0;
           Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385;
           Viruses - 0; Other Eukaryotes - 339 (source: NCBI
           BLink). | chr5:24864830-24865628 FORWARD LENGTH=238
          Length = 238

 Score =  100 bits (250), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 62/182 (34%), Positives = 107/182 (58%)

Query: 25  VLEQKLESQHVEMQRLATENQRLAATHGALRQDVAAAQHELQMLHAHTAGMKAEREQQMR 84
           +LE K+  Q  E+ RL+ +N++LA+++ AL++D+  A  E+Q L AH    + + E Q+R
Sbjct: 52  ILENKIAVQAAEIDRLSNDNRKLASSYVALKEDLTVADREVQGLRAHIRKTETDHEIQIR 111

Query: 85  AVQDKIVKMEADLQAAEPVKMELQQARREVQSLMVSREELIAKAHHLRQEIQRVHADVQQ 144
           +  +KI KME  ++  E ++ E+Q A  E   L   REEL +K     +++++V  + + 
Sbjct: 112 STLEKIAKMEGMVKNRENIRREVQSAHIEAHRLAREREELASKVKLGMKDLKKVCLEAES 171

Query: 145 IPALLSELEGLRQEYQHCRATFEYEKKLYSDHLESLQGMEKNYVSMSREVEKLRAELTNK 204
           + A   ELE L++E+Q  R  FE EK    + L  L+GME+  +   + +EKLR+E++  
Sbjct: 172 LEASSQELERLKEEHQRLRKEFEEEKSGNVEKLAQLKGMERKIIGAVKAIEKLRSEISTA 231

Query: 205 AN 206
            N
Sbjct: 232 RN 233


>AT2G30120.2 | Symbols:  | unknown protein; EXPRESSED IN: 22 plant
           structures; EXPRESSED DURING: 12 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT3G14750.1); Has 2527 Blast hits to 2101 proteins
           in 358 species: Archae - 77; Bacteria - 245; Metazoa -
           1087; Fungi - 215; Plants - 350; Viruses - 4; Other
           Eukaryotes - 549 (source: NCBI BLink). |
           chr2:12860607-12861862 REVERSE LENGTH=288
          Length = 288

 Score = 89.7 bits (221), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 60/179 (33%), Positives = 95/179 (53%)

Query: 25  VLEQKLESQHVEMQRLATENQRLAATHGALRQDVAAAQHELQMLHAHTAGMKAEREQQMR 84
           +LE ++  QH E+Q L  +NQRLA  H  L+  +  A+ EL+ L      +KAE E ++R
Sbjct: 38  ILEDRIAIQHREIQSLLNDNQRLAVAHIGLKDQLNVAKRELERLLETAVKVKAEGEAKVR 97

Query: 85  AVQDKIVKMEADLQAAEPVKMELQQARREVQSLMVSREELIAKAHHLRQEIQRVHADVQQ 144
            V    ++MEA+ +  + +  EL Q R +VQ L   R+EL  +      E+ +   +  +
Sbjct: 98  EVYQNALRMEAEARVIDGLGAELGQVRSDVQRLGSDRQELATELAMFDDEMAKAKPNSDR 157

Query: 145 IPALLSELEGLRQEYQHCRATFEYEKKLYSDHLESLQGMEKNYVSMSREVEKLRAELTN 203
              +  E+E LR E +  RA  E EKK  + +L   +GMEK    ++RE+ KL  EL +
Sbjct: 158 AIEVKLEIEILRGEIRKGRAALELEKKTRASNLHHERGMEKTIDHLNREIVKLEEELVD 216


>AT2G30120.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G14750.1); Has 275 Blast hits to 241 proteins
           in 42 species: Archae - 4; Bacteria - 15; Metazoa - 15;
           Fungi - 4; Plants - 188; Viruses - 0; Other Eukaryotes -
           49 (source: NCBI BLink). | chr2:12861332-12861862
           REVERSE LENGTH=176
          Length = 176

 Score = 59.7 bits (143), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 44/139 (31%), Positives = 72/139 (51%)

Query: 25  VLEQKLESQHVEMQRLATENQRLAATHGALRQDVAAAQHELQMLHAHTAGMKAEREQQMR 84
           +LE ++  QH E+Q L  +NQRLA  H  L+  +  A+ EL+ L      +KAE E ++R
Sbjct: 38  ILEDRIAIQHREIQSLLNDNQRLAVAHIGLKDQLNVAKRELERLLETAVKVKAEGEAKVR 97

Query: 85  AVQDKIVKMEADLQAAEPVKMELQQARREVQSLMVSREELIAKAHHLRQEIQRVHADVQQ 144
            V    ++MEA+ +  + +  EL Q R +VQ L   R+EL  +      E+ +   +  +
Sbjct: 98  EVYQNALRMEAEARVIDGLGAELGQVRSDVQRLGSDRQELATELAMFDDEMAKAKPNSDR 157

Query: 145 IPALLSELEGLRQEYQHCR 163
              +  E+E LR E +  R
Sbjct: 158 AIEVKLEIEILRGEIRKGR 176