Miyakogusa Predicted Gene

Lj4g3v2021640.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj4g3v2021640.1 Non Chatacterized Hit- tr|I3SVM3|I3SVM3_LOTJA
Uncharacterized protein OS=Lotus japonicus PE=2
SV=1,99.64,0,coiled-coil,NULL; seg,NULL,CUFF.50144.1
         (304 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT1G67170.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   154   6e-38
AT5G61920.2 | Symbols:  | unknown protein; INVOLVED IN: biologic...   140   8e-34
AT5G61920.1 | Symbols:  | unknown protein; INVOLVED IN: biologic...   140   8e-34
AT3G14750.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   135   5e-32
AT1G55170.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   114   7e-26
AT2G30120.2 | Symbols:  | unknown protein; EXPRESSED IN: 22 plan...    74   1e-13
AT2G30120.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    61   1e-09

>AT1G67170.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G14750.1); Has 5478 Blast hits to 4354 proteins
           in 533 species: Archae - 87; Bacteria - 653; Metazoa -
           2554; Fungi - 380; Plants - 418; Viruses - 16; Other
           Eukaryotes - 1370 (source: NCBI BLink). |
           chr1:25127727-25129145 FORWARD LENGTH=359
          Length = 359

 Score =  154 bits (390), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 90/281 (32%), Positives = 162/281 (57%), Gaps = 27/281 (9%)

Query: 1   MDARGKV-PTSSFERRLVQASG--------MMRHGQLPGLXXXXXXXXXXXXXXHRSLES 51
           M+++G++ P+    RR +   G           HG +P                + S   
Sbjct: 1   MESKGRIHPSHHHMRRPLPGPGGCIAHPETFGNHGAIP---------PSAAQGVYPSFNM 51

Query: 52  LPQSHLLEDKVADQEEEIERLAGDNRRLAKTHVALRDDLVSAAQDVQKLKSHIRSIQTES 111
           LP   ++E K   Q  E++RLA +N+RL  TH +LR +L +A  ++Q L + I S+++E 
Sbjct: 52  LPPPEVMEQKFVAQHGELQRLAIENQRLGGTHGSLRQELAAAQHEIQMLHAQIGSMKSER 111

Query: 112 DIQIRVLLDKIAKMEVDIRAGDSVRKDLQQALIEAQSLAASRQELSAEIQRAAQEVKKAI 171
           + ++  L +K+AKME +++  ++V+ ++QQA  EA+SL  +R+EL +++ +  QE++K+ 
Sbjct: 112 EQRMMGLAEKVAKMETELQKSEAVKLEMQQARAEARSLVVAREELMSKVHQLTQELQKSR 171

Query: 172 SDVKSLPDLQAELDDLVQERQRLRSTFEYEKSKNIELVDQMKIKEKNLIAMAREVEVLQA 231
           SDV+ +P L +EL++L QE Q+ R+T++YEK    + ++ ++  EKN + MAREVE LQA
Sbjct: 172 SDVQQIPALMSELENLRQEYQQCRATYDYEKKFYNDHLESLQAMEKNYMTMAREVEKLQA 231

Query: 232 EIL---NAEKRANAPNLFRATTPVD------GSGSFSDPYG 263
           +++   N+++RA  P        +D      G+G + D +G
Sbjct: 232 QLMNNANSDRRAGGPYGNNINAEIDASGHQSGNGYYEDAFG 272


>AT5G61920.2 | Symbols:  | unknown protein; INVOLVED IN:
           biological_process unknown; LOCATED IN: chloroplast;
           EXPRESSED IN: 6 plant structures; EXPRESSED DURING: 4
           anthesis, F mature embryo stage, petal differentiation
           and expansion stage, E expanded cotyledon stage, D
           bilateral stage; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT1G67170.1); Has 30201 Blast
           hits to 17322 proteins in 780 species: Archae - 12;
           Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants -
           5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI
           BLink). | chr5:24864830-24865628 FORWARD LENGTH=238
          Length = 238

 Score =  140 bits (354), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 79/186 (42%), Positives = 128/186 (68%)

Query: 55  SHLLEDKVADQEEEIERLAGDNRRLAKTHVALRDDLVSAAQDVQKLKSHIRSIQTESDIQ 114
           S +LE+K+A Q  EI+RL+ DNR+LA ++VAL++DL  A ++VQ L++HIR  +T+ +IQ
Sbjct: 50  SDILENKIAVQAAEIDRLSNDNRKLASSYVALKEDLTVADREVQGLRAHIRKTETDHEIQ 109

Query: 115 IRVLLDKIAKMEVDIRAGDSVRKDLQQALIEAQSLAASRQELSAEIQRAAQEVKKAISDV 174
           IR  L+KIAKME  ++  +++R+++Q A IEA  LA  R+EL+++++   +++KK   + 
Sbjct: 110 IRSTLEKIAKMEGMVKNRENIRREVQSAHIEAHRLAREREELASKVKLGMKDLKKVCLEA 169

Query: 175 KSLPDLQAELDDLVQERQRLRSTFEYEKSKNIELVDQMKIKEKNLIAMAREVEVLQAEIL 234
           +SL     EL+ L +E QRLR  FE EKS N+E + Q+K  E+ +I   + +E L++EI 
Sbjct: 170 ESLEASSQELERLKEEHQRLRKEFEEEKSGNVEKLAQLKGMERKIIGAVKAIEKLRSEIS 229

Query: 235 NAEKRA 240
            A  +A
Sbjct: 230 TARNKA 235


>AT5G61920.1 | Symbols:  | unknown protein; INVOLVED IN:
           biological_process unknown; LOCATED IN: chloroplast;
           EXPRESSED IN: 6 plant structures; EXPRESSED DURING: 4
           anthesis, F mature embryo stage, petal differentiation
           and expansion stage, E expanded cotyledon stage, D
           bilateral stage; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT1G67170.1); Has 1807 Blast
           hits to 1807 proteins in 277 species: Archae - 0;
           Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385;
           Viruses - 0; Other Eukaryotes - 339 (source: NCBI
           BLink). | chr5:24864830-24865628 FORWARD LENGTH=238
          Length = 238

 Score =  140 bits (354), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 79/186 (42%), Positives = 128/186 (68%)

Query: 55  SHLLEDKVADQEEEIERLAGDNRRLAKTHVALRDDLVSAAQDVQKLKSHIRSIQTESDIQ 114
           S +LE+K+A Q  EI+RL+ DNR+LA ++VAL++DL  A ++VQ L++HIR  +T+ +IQ
Sbjct: 50  SDILENKIAVQAAEIDRLSNDNRKLASSYVALKEDLTVADREVQGLRAHIRKTETDHEIQ 109

Query: 115 IRVLLDKIAKMEVDIRAGDSVRKDLQQALIEAQSLAASRQELSAEIQRAAQEVKKAISDV 174
           IR  L+KIAKME  ++  +++R+++Q A IEA  LA  R+EL+++++   +++KK   + 
Sbjct: 110 IRSTLEKIAKMEGMVKNRENIRREVQSAHIEAHRLAREREELASKVKLGMKDLKKVCLEA 169

Query: 175 KSLPDLQAELDDLVQERQRLRSTFEYEKSKNIELVDQMKIKEKNLIAMAREVEVLQAEIL 234
           +SL     EL+ L +E QRLR  FE EKS N+E + Q+K  E+ +I   + +E L++EI 
Sbjct: 170 ESLEASSQELERLKEEHQRLRKEFEEEKSGNVEKLAQLKGMERKIIGAVKAIEKLRSEIS 229

Query: 235 NAEKRA 240
            A  +A
Sbjct: 230 TARNKA 235


>AT3G14750.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G67170.1); Has 4036 Blast hits to 3091 proteins
           in 519 species: Archae - 61; Bacteria - 669; Metazoa -
           1503; Fungi - 255; Plants - 421; Viruses - 4; Other
           Eukaryotes - 1123 (source: NCBI BLink). |
           chr3:4953765-4955373 REVERSE LENGTH=331
          Length = 331

 Score =  135 bits (339), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 70/190 (36%), Positives = 123/190 (64%)

Query: 53  PQSHLLEDKVADQEEEIERLAGDNRRLAKTHVALRDDLVSAAQDVQKLKSHIRSIQTESD 112
           PQ  +LED++A Q ++++ L  DN+RLA THVAL+ +L  A  ++Q++  +I S++ E +
Sbjct: 65  PQFSILEDRLAAQNQDVQGLLADNQRLAATHVALKQELEVAQHELQRIMHYIDSLRAEEE 124

Query: 113 IQIRVLLDKIAKMEVDIRAGDSVRKDLQQALIEAQSLAASRQELSAEIQRAAQEVKKAIS 172
           I +R + DK  + E+++R  D++R ++Q+   + +   + RQEL++++    Q++ +  +
Sbjct: 125 IMMREMYDKSMRSEMELREVDAMRAEIQKIRADIKEFTSGRQELTSQVHLMTQDLARLTA 184

Query: 173 DVKSLPDLQAELDDLVQERQRLRSTFEYEKSKNIELVDQMKIKEKNLIAMAREVEVLQAE 232
           D++ +P L AE+++  QE QR R+  +YEK    E  +  KI E  L+AMARE+E L+AE
Sbjct: 185 DLQQIPTLTAEIENTKQELQRARAAIDYEKKGYAENYEHGKIMEHKLVAMARELEKLRAE 244

Query: 233 ILNAEKRANA 242
           I N+E  A A
Sbjct: 245 IANSETSAYA 254


>AT1G55170.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G14750.1); Has 13439 Blast hits to 8993
           proteins in 828 species: Archae - 344; Bacteria - 1469;
           Metazoa - 6958; Fungi - 1008; Plants - 683; Viruses -
           29; Other Eukaryotes - 2948 (source: NCBI BLink). |
           chr1:20580578-20581706 FORWARD LENGTH=283
          Length = 283

 Score =  114 bits (286), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 72/208 (34%), Positives = 119/208 (57%), Gaps = 6/208 (2%)

Query: 59  EDKVADQEEEIERLAGDNRRLAKTHVALRDDLVSAAQDVQKLKSHIRSIQTESDIQIRVL 118
           E ++  Q+ EI RL  DN  LA   + L  +LV+A +++ ++   I  ++ E D+Q+R  
Sbjct: 49  EGEIRRQDAEIRRLLSDNHGLADDRMVLERELVAAKEELHRMNLMISDLRAEQDLQLREF 108

Query: 119 LDKIAKMEVDIRAGDSVRKDLQQALIEAQSLAASRQELSAEIQRAAQEVKKAISDVKSLP 178
            +K  K+E D+RA +S +K+  Q   E Q L   ++ELS  +Q   +++ K  SD K +P
Sbjct: 109 SEKRHKLEGDVRAMESYKKEASQLRGEVQKLDEIKRELSGNVQLLRKDLAKLQSDNKQIP 168

Query: 179 DLQAELDDLVQERQRLRSTFEYEKSKNIELVDQMKIKEKNLIAMAREVEVLQAEILNAEK 238
            ++AE+ DL +E    R   EYEK +  EL++Q +  EKN+++MAREVE L+AE+   + 
Sbjct: 169 GMRAEVKDLQKELMHARDAIEYEKKEKFELMEQRQTMEKNMVSMAREVEKLRAELATVDS 228

Query: 239 RANAPNLFRATTPVDGS---GSFSDPYG 263
           R   P  F  +  ++ +   G+F   YG
Sbjct: 229 R---PWGFGGSYGMNYNNMDGTFRGSYG 253


>AT2G30120.2 | Symbols:  | unknown protein; EXPRESSED IN: 22 plant
           structures; EXPRESSED DURING: 12 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT3G14750.1); Has 2527 Blast hits to 2101 proteins
           in 358 species: Archae - 77; Bacteria - 245; Metazoa -
           1087; Fungi - 215; Plants - 350; Viruses - 4; Other
           Eukaryotes - 549 (source: NCBI BLink). |
           chr2:12860607-12861862 REVERSE LENGTH=288
          Length = 288

 Score = 73.9 bits (180), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 58/210 (27%), Positives = 106/210 (50%), Gaps = 1/210 (0%)

Query: 54  QSHLLEDKVADQEEEIERLAGDNRRLAKTHVALRDDLVSAAQDVQKLKSHIRSIQTESDI 113
           +S +LED++A Q  EI+ L  DN+RLA  H+ L+D L  A +++++L      ++ E + 
Sbjct: 35  RSVILEDRIAIQHREIQSLLNDNQRLAVAHIGLKDQLNVAKRELERLLETAVKVKAEGEA 94

Query: 114 QIRVLLDKIAKMEVDIRAGDSVRKDLQQALIEAQSLAASRQELSAEIQRAAQEVKKAISD 173
           ++R +     +ME + R  D +  +L Q   + Q L + RQEL+ E+     E+ KA  +
Sbjct: 95  KVREVYQNALRMEAEARVIDGLGAELGQVRSDVQRLGSDRQELATELAMFDDEMAKAKPN 154

Query: 174 VKSLPDLQAELDDLVQERQRLRSTFEYEKSKNIELVDQMKIKEKNLIAMAREVEVLQAEI 233
                +++ E++ L  E ++ R+  E EK      +   +  EK +  + RE+  L+ E+
Sbjct: 155 SDRAIEVKLEIEILRGEIRKGRAALELEKKTRASNLHHERGMEKTIDHLNREIVKLEEEL 214

Query: 234 LNAEKRANAPNLFRATTPVDGSGSFSDPYG 263
           ++ E +A   N      P    G  +  YG
Sbjct: 215 VDLETKAREANAAAEAAPTPSPG-LAASYG 243


>AT2G30120.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G14750.1); Has 275 Blast hits to 241 proteins
           in 42 species: Archae - 4; Bacteria - 15; Metazoa - 15;
           Fungi - 4; Plants - 188; Viruses - 0; Other Eukaryotes -
           49 (source: NCBI BLink). | chr2:12861332-12861862
           REVERSE LENGTH=176
          Length = 176

 Score = 60.8 bits (146), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 38/117 (32%), Positives = 66/117 (56%)

Query: 54  QSHLLEDKVADQEEEIERLAGDNRRLAKTHVALRDDLVSAAQDVQKLKSHIRSIQTESDI 113
           +S +LED++A Q  EI+ L  DN+RLA  H+ L+D L  A +++++L      ++ E + 
Sbjct: 35  RSVILEDRIAIQHREIQSLLNDNQRLAVAHIGLKDQLNVAKRELERLLETAVKVKAEGEA 94

Query: 114 QIRVLLDKIAKMEVDIRAGDSVRKDLQQALIEAQSLAASRQELSAEIQRAAQEVKKA 170
           ++R +     +ME + R  D +  +L Q   + Q L + RQEL+ E+     E+ KA
Sbjct: 95  KVREVYQNALRMEAEARVIDGLGAELGQVRSDVQRLGSDRQELATELAMFDDEMAKA 151