Miyakogusa Predicted Gene

Lj0g3v0197229.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj0g3v0197229.1 Non Chatacterized Hit- tr|I1M029|I1M029_SOYBN
Uncharacterized protein (Fragment) OS=Glycine max PE=4,83.02,9e-19,NHL
REPEAT-CONTAINING PROTEIN,NULL; FAMILY NOT NAMED,NULL;
seg,NULL,CUFF.12481.1
         (362 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT1G70280.2 | Symbols:  | NHL domain-containing protein | chr1:2...   234   5e-62
AT1G70280.1 | Symbols:  | NHL domain-containing protein | chr1:2...   234   5e-62
AT5G14890.1 | Symbols:  | NHL domain-containing protein | chr5:4...   202   4e-52
AT1G23880.1 | Symbols:  | NHL domain-containing protein | chr1:8...   187   1e-47
AT3G14860.1 | Symbols:  | NHL domain-containing protein | chr3:4...    96   5e-20
AT3G14860.2 | Symbols:  | NHL domain-containing protein | chr3:4...    96   5e-20
AT1G23890.1 | Symbols:  | NHL domain-containing protein | chr1:8...    63   3e-10
AT1G23890.2 | Symbols:  | NHL domain-containing protein | chr1:8...    62   5e-10

>AT1G70280.2 | Symbols:  | NHL domain-containing protein |
           chr1:26466086-26468471 REVERSE LENGTH=509
          Length = 509

 Score =  234 bits (598), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 156/366 (42%), Positives = 196/366 (53%), Gaps = 30/366 (8%)

Query: 2   AIRKISDEGITTIAXXXXXXXXXXXXXXPSEDAKFSNDFDVVYAGRSCSLLVVDRGNQAI 61
           AIRKIS+ G+TTIA              PSEDAKFSNDFDVVY G SCSLLV+DRGN+AI
Sbjct: 165 AIRKISEGGVTTIAGGKTVRNGGHVDG-PSEDAKFSNDFDVVYVGSSCSLLVIDRGNKAI 223

Query: 62  REIQLHQDDCTKYKYDEYDNDSSFHLGIAVLIAVGFFGYMLALLQWRVRAMFSSPDDPRG 121
           REIQLH DDC       Y   S F LGIAVL+A GFFGYMLALLQ RV ++ SS +D   
Sbjct: 224 REIQLHFDDCA------YQYGSGFPLGIAVLVAAGFFGYMLALLQRRVGSIVSSHNDQEM 277

Query: 122 PLRTKGTPFAAEQQQKRXXXXXXXXXXXXXXAEDEFDKQDEGFFVSLGRLLVNSSSSMGE 181
                   F A+  QK                +++ +KQ+E F VSLG+L+ N+  S+ E
Sbjct: 278 --------FEADPDQK---PMKHSRPSLIPAGDEQLEKQEETFVVSLGKLVSNAWESVME 326

Query: 182 ILGSFFTGSKRKPLPXXXXXXXXXXXXXANRHSSNPWPMQESFVIPDGDEPPGIEARTPT 241
           IL       ++K                A   +S PWP+QESFVI D D PP +E R PT
Sbjct: 327 IL-------RKKQTGTSFQQYHGTTKQSAAFSTSTPWPIQESFVIRDEDGPPPVEPRNPT 379

Query: 242 LRKPYPFMPNEIEKPQQFKQTQGYLNRWDDGGYDEXXXXXXXXXXXXXXXXXXXVQNR-Y 300
            RK Y FM  + EK QQ +Q++ + + WD    ++                      R Y
Sbjct: 380 PRKTYAFMSKDAEKMQQLRQSRAFYSSWDAEFPNQQQQQQKQHQKHQHQQQQQQQHRRHY 439

Query: 301 SSTPQGYYEQNRE-TNEIVFGAVQEHDGRREAM-VIKAVDYGDP--KFSHQNVRPRLNYV 356
           SS P  YYEQ+ E +NEIVFGAVQE   +R A    K ++ GD     + QN+  R + V
Sbjct: 440 SSIPHTYYEQDSEKSNEIVFGAVQEQSSKRVAKPKPKPIESGDQMNNNTQQNLHYRSHSV 499

Query: 357 GYSHGY 362
            Y +GY
Sbjct: 500 SYPYGY 505


>AT1G70280.1 | Symbols:  | NHL domain-containing protein |
           chr1:26466086-26468116 REVERSE LENGTH=447
          Length = 447

 Score =  234 bits (598), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 157/366 (42%), Positives = 197/366 (53%), Gaps = 30/366 (8%)

Query: 2   AIRKISDEGITTIAXXXXXXXXXXXXXXPSEDAKFSNDFDVVYAGRSCSLLVVDRGNQAI 61
           AIRKIS+ G+TTIA              PSEDAKFSNDFDVVY G SCSLLV+DRGN+AI
Sbjct: 103 AIRKISEGGVTTIAGGKTVRNGGHVDG-PSEDAKFSNDFDVVYVGSSCSLLVIDRGNKAI 161

Query: 62  REIQLHQDDCTKYKYDEYDNDSSFHLGIAVLIAVGFFGYMLALLQWRVRAMFSSPDDPRG 121
           REIQLH DDC       Y   S F LGIAVL+A GFFGYMLALLQ RV ++ SS +D   
Sbjct: 162 REIQLHFDDCA------YQYGSGFPLGIAVLVAAGFFGYMLALLQRRVGSIVSSHNDQEM 215

Query: 122 PLRTKGTPFAAEQQQKRXXXXXXXXXXXXXXAEDEFDKQDEGFFVSLGRLLVNSSSSMGE 181
                   F A+  QK                +++ +KQ+E F VSLG+L+ N+  S+ E
Sbjct: 216 --------FEADPDQK---PMKHSRPSLIPAGDEQLEKQEETFVVSLGKLVSNAWESVME 264

Query: 182 ILGSFFTGSKRKPLPXXXXXXXXXXXXXANRHSSNPWPMQESFVIPDGDEPPGIEARTPT 241
           IL    TG+  +                A   +S PWP+QESFVI D D PP +E R PT
Sbjct: 265 ILRKKQTGTSFQ-------QYHGTTKQSAAFSTSTPWPIQESFVIRDEDGPPPVEPRNPT 317

Query: 242 LRKPYPFMPNEIEKPQQFKQTQGYLNRWDDGGYDEXXXXXXXXXXXXXXXXXXXVQNR-Y 300
            RK Y FM  + EK QQ +Q++ + + WD    ++                      R Y
Sbjct: 318 PRKTYAFMSKDAEKMQQLRQSRAFYSSWDAEFPNQQQQQQKQHQKHQHQQQQQQQHRRHY 377

Query: 301 SSTPQGYYEQNRE-TNEIVFGAVQEHDGRREAM-VIKAVDYGDP--KFSHQNVRPRLNYV 356
           SS P  YYEQ+ E +NEIVFGAVQE   +R A    K ++ GD     + QN+  R + V
Sbjct: 378 SSIPHTYYEQDSEKSNEIVFGAVQEQSSKRVAKPKPKPIESGDQMNNNTQQNLHYRSHSV 437

Query: 357 GYSHGY 362
            Y +GY
Sbjct: 438 SYPYGY 443


>AT5G14890.1 | Symbols:  | NHL domain-containing protein |
           chr5:4818056-4821534 FORWARD LENGTH=754
          Length = 754

 Score =  202 bits (513), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 137/359 (38%), Positives = 182/359 (50%), Gaps = 53/359 (14%)

Query: 1   MAIRKISDEGITTIAXXXXXXXXXXXXXXPSEDAKFSNDFDVVYAGRSCSLLVVDRGNQA 60
           MAIRKISD+G++TIA                E  +FS+DFD++Y   SCSLLV+DRGNQ 
Sbjct: 173 MAIRKISDDGVSTIAAGGRWSGGSK-----EESMRFSDDFDLIYVSSSCSLLVIDRGNQL 227

Query: 61  IREIQLHQDDCTKYKYDEYDNDSSFHLGIAVLIAVGFFGYMLALLQWRVRAMFSSPDDPR 120
           I+EIQLH  DC++    E D DS  HLG A+L+A  FFGYMLALL  RVR++FSS     
Sbjct: 228 IKEIQLHDHDCSQ---PEPDTDS-LHLGTALLVAAVFFGYMLALLVRRVRSLFSSSSHDT 283

Query: 121 GPLRTKGTPFAAEQQQKRXXXXXXXXXXXXXXAEDEFDKQDEGFFVSLGRLLVNSSSSMG 180
              R   TP       +R                +    ++EGF  SLG+L+V + SS+ 
Sbjct: 284 KSKRHVATPSMTMAPYQRYPRPVRQPLIPPQHESE----KEEGFLGSLGKLVVKTGSSVS 339

Query: 181 EILGSFFTGSKRKPLPXXXXXXXXXXXXXANRHSSNPWPMQESFVIPDGDEPPGIEARTP 240
           E++    +GS+    P              ++   N WP+QESF IP+ D PP +E R+ 
Sbjct: 340 EMM----SGSRNVIPPNFHQYH--------HQQEPNQWPVQESFAIPEEDGPPALEPRSG 387

Query: 241 TLRKPYPFMPNEIEKPQQFKQTQGYLNRWDDGGYDEXXXXXXXXXXXXXXXXXXXVQNRY 300
           T            +KP  + + QG                                Q + 
Sbjct: 388 T----------NPDKP--YLRAQG----------------TNQNRSYYQDYDQYQNQQKR 419

Query: 301 SSTPQGYYEQNRETNEIVFGAVQEHDGRREAMVIKAVDYGDPKFSHQNVRPRLNYVGYS 359
           +      +E NRE NEIVFGAVQE DGRREAMVIKAVD+ +     +N+RPR+NY+GYS
Sbjct: 420 NVNDTASFEDNREKNEIVFGAVQEQDGRREAMVIKAVDFNEAINDQRNLRPRINYMGYS 478


>AT1G23880.1 | Symbols:  | NHL domain-containing protein |
           chr1:8436125-8438636 FORWARD LENGTH=545
          Length = 545

 Score =  187 bits (474), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 143/375 (38%), Positives = 179/375 (47%), Gaps = 72/375 (19%)

Query: 2   AIRKISDEGITTIAXXXXXXXXXXXXXXPSEDAKFSNDFDVVYAGRSCSLLVVDRGNQAI 61
           AIRKIS+ G+TTIA              PSEDAKFSNDFDVVY G SCSLLV+DRGNQAI
Sbjct: 227 AIRKISEAGVTTIAGGKMVRGGGHVDG-PSEDAKFSNDFDVVYLGSSCSLLVIDRGNQAI 285

Query: 62  REIQLHQDDCTKYKYDEYDNDSSFHLGIAVLIAVGFFGYMLALLQWRVRAMFS------- 114
           REIQLH DDC     D+Y   S F LGIAVL+A  FFGYMLALLQ R+ ++ S       
Sbjct: 286 REIQLHFDDCA----DQY--GSGFPLGIAVLVAAVFFGYMLALLQRRLSSIVSYHTDQEV 339

Query: 115 ---SPD-DPRGPLRTKGTPFAAEQQQKRXXXXXXXXXXXXXXAEDEFDKQDEGFFVSLGR 170
               PD DP  P+R                              DE +KQ+E F  +L  
Sbjct: 340 FEAVPDQDPIKPVRPP-----------------------LILTGDEQEKQEESFLGTLQI 376

Query: 171 LLVNSSSSMGEILGSFFTGSKRKPLPXXXXXXXXXXXXXANRHSSNPWPMQESFVIPDGD 230
            + N+     E+    F G ++K                 +  S+  WP+QESFVI + D
Sbjct: 377 FISNAWVFSVELFSGMFPGLRKK---QTVGLNFNHQETKHSAFSTTSWPIQESFVIHNKD 433

Query: 231 EPPGIEARTPTLRKPYPFMPNE-IEKPQQFKQTQGYLNRWDDGGYDEXXXXXXXXXXXXX 289
           EPP +E+R  T  K YPFM  +  EK QQ +Q++  L R  D  +               
Sbjct: 434 EPPPVESRNATPGKIYPFMSKDATEKMQQLRQSRA-LYRSLDAEF---------LQEQQQ 483

Query: 290 XXXXXXVQNRYSSTPQGYYEQNRE-TNEIVFGAVQEHDGRREAMVIKAVDYGDPKFSHQN 348
                     +S+ P   YEQ+ E TNEIVFG  QE D                  +HQN
Sbjct: 484 EKHQQYHHRHHSTIPYTLYEQSSEKTNEIVFGPGQEQDQMN---------------THQN 528

Query: 349 VRPRLN-YVGYSHGY 362
           +  R + +V Y +GY
Sbjct: 529 IHHRAHQFVSYPYGY 543


>AT3G14860.1 | Symbols:  | NHL domain-containing protein |
           chr3:4998591-5000894 REVERSE LENGTH=492
          Length = 492

 Score = 95.5 bits (236), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 52/107 (48%), Positives = 65/107 (60%), Gaps = 9/107 (8%)

Query: 1   MAIRKISDEGITTIAXXXXXXXXXXXXXXPSEDAKFSNDFDVVYAGRSCSLLVVDRGNQA 60
           +AIRKI D G+TTIA              PSEDAKFSNDFDVVY   +CSLLV+DRGN A
Sbjct: 171 LAIRKIGDSGVTTIAGGKSNIAGYRDG--PSEDAKFSNDFDVVYVRPTCSLLVIDRGNAA 228

Query: 61  IREIQLHQDDCTKYKYDEYDNDSSFHLG-IAVLIAVGFFGYMLALLQ 106
           +R+I L ++DC      +Y +DSS  L  I ++I     GY   +LQ
Sbjct: 229 LRQISLSEEDC------DYQDDSSISLTDILLVIGAVLIGYATCMLQ 269


>AT3G14860.2 | Symbols:  | NHL domain-containing protein |
           chr3:4998591-5000894 REVERSE LENGTH=493
          Length = 493

 Score = 95.5 bits (236), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 52/107 (48%), Positives = 65/107 (60%), Gaps = 9/107 (8%)

Query: 1   MAIRKISDEGITTIAXXXXXXXXXXXXXXPSEDAKFSNDFDVVYAGRSCSLLVVDRGNQA 60
           +AIRKI D G+TTIA              PSEDAKFSNDFDVVY   +CSLLV+DRGN A
Sbjct: 171 LAIRKIGDSGVTTIAGGKSNIAGYRDG--PSEDAKFSNDFDVVYVRPTCSLLVIDRGNAA 228

Query: 61  IREIQLHQDDCTKYKYDEYDNDSSFHLG-IAVLIAVGFFGYMLALLQ 106
           +R+I L ++DC      +Y +DSS  L  I ++I     GY   +LQ
Sbjct: 229 LRQISLSEEDC------DYQDDSSISLTDILLVIGAVLIGYATCMLQ 269


>AT1G23890.1 | Symbols:  | NHL domain-containing protein |
           chr1:8439321-8440803 REVERSE LENGTH=261
          Length = 261

 Score = 63.2 bits (152), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 36/101 (35%), Positives = 56/101 (55%), Gaps = 7/101 (6%)

Query: 2   AIRKISDEG-ITTIAXXXXXXXXXXXXXXPSEDAKFSNDFDVVYAGRSCSLLVVDRGNQA 60
           AIRKIS  G +TTIA              P+++A FS+DF++ +  + C LLV D GN+ 
Sbjct: 123 AIRKISSSGSVTTIAGGISKAFGHRDG--PAQNATFSSDFEITFVPQRCCLLVSDHGNEM 180

Query: 61  IREIQLHQDDCTKYKYDEYDNDSSFHLGIAVL----IAVGF 97
           IR+I L ++DC +  +      S + +GI +     +A+GF
Sbjct: 181 IRQINLKEEDCLENSHSNLGTYSLWSIGIVLSCILGVAIGF 221


>AT1G23890.2 | Symbols:  | NHL domain-containing protein |
           chr1:8438900-8440803 REVERSE LENGTH=400
          Length = 400

 Score = 62.4 bits (150), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 36/101 (35%), Positives = 56/101 (55%), Gaps = 7/101 (6%)

Query: 2   AIRKISDEG-ITTIAXXXXXXXXXXXXXXPSEDAKFSNDFDVVYAGRSCSLLVVDRGNQA 60
           AIRKIS  G +TTIA              P+++A FS+DF++ +  + C LLV D GN+ 
Sbjct: 123 AIRKISSSGSVTTIAGGISKAFGHRDG--PAQNATFSSDFEITFVPQRCCLLVSDHGNEM 180

Query: 61  IREIQLHQDDCTKYKYDEYDNDSSFHLGIAVL----IAVGF 97
           IR+I L ++DC +  +      S + +GI +     +A+GF
Sbjct: 181 IRQINLKEEDCLENSHSNLGTYSLWSIGIVLSCILGVAIGF 221