Miyakogusa Predicted Gene
- Lj0g3v0197229.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0197229.1 Non Chatacterized Hit- tr|I1M029|I1M029_SOYBN
Uncharacterized protein (Fragment) OS=Glycine max PE=4,83.02,9e-19,NHL
REPEAT-CONTAINING PROTEIN,NULL; FAMILY NOT NAMED,NULL;
seg,NULL,CUFF.12481.1
(362 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G70280.2 | Symbols: | NHL domain-containing protein | chr1:2... 234 5e-62
AT1G70280.1 | Symbols: | NHL domain-containing protein | chr1:2... 234 5e-62
AT5G14890.1 | Symbols: | NHL domain-containing protein | chr5:4... 202 4e-52
AT1G23880.1 | Symbols: | NHL domain-containing protein | chr1:8... 187 1e-47
AT3G14860.1 | Symbols: | NHL domain-containing protein | chr3:4... 96 5e-20
AT3G14860.2 | Symbols: | NHL domain-containing protein | chr3:4... 96 5e-20
AT1G23890.1 | Symbols: | NHL domain-containing protein | chr1:8... 63 3e-10
AT1G23890.2 | Symbols: | NHL domain-containing protein | chr1:8... 62 5e-10
>AT1G70280.2 | Symbols: | NHL domain-containing protein |
chr1:26466086-26468471 REVERSE LENGTH=509
Length = 509
Score = 234 bits (598), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 156/366 (42%), Positives = 196/366 (53%), Gaps = 30/366 (8%)
Query: 2 AIRKISDEGITTIAXXXXXXXXXXXXXXPSEDAKFSNDFDVVYAGRSCSLLVVDRGNQAI 61
AIRKIS+ G+TTIA PSEDAKFSNDFDVVY G SCSLLV+DRGN+AI
Sbjct: 165 AIRKISEGGVTTIAGGKTVRNGGHVDG-PSEDAKFSNDFDVVYVGSSCSLLVIDRGNKAI 223
Query: 62 REIQLHQDDCTKYKYDEYDNDSSFHLGIAVLIAVGFFGYMLALLQWRVRAMFSSPDDPRG 121
REIQLH DDC Y S F LGIAVL+A GFFGYMLALLQ RV ++ SS +D
Sbjct: 224 REIQLHFDDCA------YQYGSGFPLGIAVLVAAGFFGYMLALLQRRVGSIVSSHNDQEM 277
Query: 122 PLRTKGTPFAAEQQQKRXXXXXXXXXXXXXXAEDEFDKQDEGFFVSLGRLLVNSSSSMGE 181
F A+ QK +++ +KQ+E F VSLG+L+ N+ S+ E
Sbjct: 278 --------FEADPDQK---PMKHSRPSLIPAGDEQLEKQEETFVVSLGKLVSNAWESVME 326
Query: 182 ILGSFFTGSKRKPLPXXXXXXXXXXXXXANRHSSNPWPMQESFVIPDGDEPPGIEARTPT 241
IL ++K A +S PWP+QESFVI D D PP +E R PT
Sbjct: 327 IL-------RKKQTGTSFQQYHGTTKQSAAFSTSTPWPIQESFVIRDEDGPPPVEPRNPT 379
Query: 242 LRKPYPFMPNEIEKPQQFKQTQGYLNRWDDGGYDEXXXXXXXXXXXXXXXXXXXVQNR-Y 300
RK Y FM + EK QQ +Q++ + + WD ++ R Y
Sbjct: 380 PRKTYAFMSKDAEKMQQLRQSRAFYSSWDAEFPNQQQQQQKQHQKHQHQQQQQQQHRRHY 439
Query: 301 SSTPQGYYEQNRE-TNEIVFGAVQEHDGRREAM-VIKAVDYGDP--KFSHQNVRPRLNYV 356
SS P YYEQ+ E +NEIVFGAVQE +R A K ++ GD + QN+ R + V
Sbjct: 440 SSIPHTYYEQDSEKSNEIVFGAVQEQSSKRVAKPKPKPIESGDQMNNNTQQNLHYRSHSV 499
Query: 357 GYSHGY 362
Y +GY
Sbjct: 500 SYPYGY 505
>AT1G70280.1 | Symbols: | NHL domain-containing protein |
chr1:26466086-26468116 REVERSE LENGTH=447
Length = 447
Score = 234 bits (598), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 157/366 (42%), Positives = 197/366 (53%), Gaps = 30/366 (8%)
Query: 2 AIRKISDEGITTIAXXXXXXXXXXXXXXPSEDAKFSNDFDVVYAGRSCSLLVVDRGNQAI 61
AIRKIS+ G+TTIA PSEDAKFSNDFDVVY G SCSLLV+DRGN+AI
Sbjct: 103 AIRKISEGGVTTIAGGKTVRNGGHVDG-PSEDAKFSNDFDVVYVGSSCSLLVIDRGNKAI 161
Query: 62 REIQLHQDDCTKYKYDEYDNDSSFHLGIAVLIAVGFFGYMLALLQWRVRAMFSSPDDPRG 121
REIQLH DDC Y S F LGIAVL+A GFFGYMLALLQ RV ++ SS +D
Sbjct: 162 REIQLHFDDCA------YQYGSGFPLGIAVLVAAGFFGYMLALLQRRVGSIVSSHNDQEM 215
Query: 122 PLRTKGTPFAAEQQQKRXXXXXXXXXXXXXXAEDEFDKQDEGFFVSLGRLLVNSSSSMGE 181
F A+ QK +++ +KQ+E F VSLG+L+ N+ S+ E
Sbjct: 216 --------FEADPDQK---PMKHSRPSLIPAGDEQLEKQEETFVVSLGKLVSNAWESVME 264
Query: 182 ILGSFFTGSKRKPLPXXXXXXXXXXXXXANRHSSNPWPMQESFVIPDGDEPPGIEARTPT 241
IL TG+ + A +S PWP+QESFVI D D PP +E R PT
Sbjct: 265 ILRKKQTGTSFQ-------QYHGTTKQSAAFSTSTPWPIQESFVIRDEDGPPPVEPRNPT 317
Query: 242 LRKPYPFMPNEIEKPQQFKQTQGYLNRWDDGGYDEXXXXXXXXXXXXXXXXXXXVQNR-Y 300
RK Y FM + EK QQ +Q++ + + WD ++ R Y
Sbjct: 318 PRKTYAFMSKDAEKMQQLRQSRAFYSSWDAEFPNQQQQQQKQHQKHQHQQQQQQQHRRHY 377
Query: 301 SSTPQGYYEQNRE-TNEIVFGAVQEHDGRREAM-VIKAVDYGDP--KFSHQNVRPRLNYV 356
SS P YYEQ+ E +NEIVFGAVQE +R A K ++ GD + QN+ R + V
Sbjct: 378 SSIPHTYYEQDSEKSNEIVFGAVQEQSSKRVAKPKPKPIESGDQMNNNTQQNLHYRSHSV 437
Query: 357 GYSHGY 362
Y +GY
Sbjct: 438 SYPYGY 443
>AT5G14890.1 | Symbols: | NHL domain-containing protein |
chr5:4818056-4821534 FORWARD LENGTH=754
Length = 754
Score = 202 bits (513), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 137/359 (38%), Positives = 182/359 (50%), Gaps = 53/359 (14%)
Query: 1 MAIRKISDEGITTIAXXXXXXXXXXXXXXPSEDAKFSNDFDVVYAGRSCSLLVVDRGNQA 60
MAIRKISD+G++TIA E +FS+DFD++Y SCSLLV+DRGNQ
Sbjct: 173 MAIRKISDDGVSTIAAGGRWSGGSK-----EESMRFSDDFDLIYVSSSCSLLVIDRGNQL 227
Query: 61 IREIQLHQDDCTKYKYDEYDNDSSFHLGIAVLIAVGFFGYMLALLQWRVRAMFSSPDDPR 120
I+EIQLH DC++ E D DS HLG A+L+A FFGYMLALL RVR++FSS
Sbjct: 228 IKEIQLHDHDCSQ---PEPDTDS-LHLGTALLVAAVFFGYMLALLVRRVRSLFSSSSHDT 283
Query: 121 GPLRTKGTPFAAEQQQKRXXXXXXXXXXXXXXAEDEFDKQDEGFFVSLGRLLVNSSSSMG 180
R TP +R + ++EGF SLG+L+V + SS+
Sbjct: 284 KSKRHVATPSMTMAPYQRYPRPVRQPLIPPQHESE----KEEGFLGSLGKLVVKTGSSVS 339
Query: 181 EILGSFFTGSKRKPLPXXXXXXXXXXXXXANRHSSNPWPMQESFVIPDGDEPPGIEARTP 240
E++ +GS+ P ++ N WP+QESF IP+ D PP +E R+
Sbjct: 340 EMM----SGSRNVIPPNFHQYH--------HQQEPNQWPVQESFAIPEEDGPPALEPRSG 387
Query: 241 TLRKPYPFMPNEIEKPQQFKQTQGYLNRWDDGGYDEXXXXXXXXXXXXXXXXXXXVQNRY 300
T +KP + + QG Q +
Sbjct: 388 T----------NPDKP--YLRAQG----------------TNQNRSYYQDYDQYQNQQKR 419
Query: 301 SSTPQGYYEQNRETNEIVFGAVQEHDGRREAMVIKAVDYGDPKFSHQNVRPRLNYVGYS 359
+ +E NRE NEIVFGAVQE DGRREAMVIKAVD+ + +N+RPR+NY+GYS
Sbjct: 420 NVNDTASFEDNREKNEIVFGAVQEQDGRREAMVIKAVDFNEAINDQRNLRPRINYMGYS 478
>AT1G23880.1 | Symbols: | NHL domain-containing protein |
chr1:8436125-8438636 FORWARD LENGTH=545
Length = 545
Score = 187 bits (474), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 143/375 (38%), Positives = 179/375 (47%), Gaps = 72/375 (19%)
Query: 2 AIRKISDEGITTIAXXXXXXXXXXXXXXPSEDAKFSNDFDVVYAGRSCSLLVVDRGNQAI 61
AIRKIS+ G+TTIA PSEDAKFSNDFDVVY G SCSLLV+DRGNQAI
Sbjct: 227 AIRKISEAGVTTIAGGKMVRGGGHVDG-PSEDAKFSNDFDVVYLGSSCSLLVIDRGNQAI 285
Query: 62 REIQLHQDDCTKYKYDEYDNDSSFHLGIAVLIAVGFFGYMLALLQWRVRAMFS------- 114
REIQLH DDC D+Y S F LGIAVL+A FFGYMLALLQ R+ ++ S
Sbjct: 286 REIQLHFDDCA----DQY--GSGFPLGIAVLVAAVFFGYMLALLQRRLSSIVSYHTDQEV 339
Query: 115 ---SPD-DPRGPLRTKGTPFAAEQQQKRXXXXXXXXXXXXXXAEDEFDKQDEGFFVSLGR 170
PD DP P+R DE +KQ+E F +L
Sbjct: 340 FEAVPDQDPIKPVRPP-----------------------LILTGDEQEKQEESFLGTLQI 376
Query: 171 LLVNSSSSMGEILGSFFTGSKRKPLPXXXXXXXXXXXXXANRHSSNPWPMQESFVIPDGD 230
+ N+ E+ F G ++K + S+ WP+QESFVI + D
Sbjct: 377 FISNAWVFSVELFSGMFPGLRKK---QTVGLNFNHQETKHSAFSTTSWPIQESFVIHNKD 433
Query: 231 EPPGIEARTPTLRKPYPFMPNE-IEKPQQFKQTQGYLNRWDDGGYDEXXXXXXXXXXXXX 289
EPP +E+R T K YPFM + EK QQ +Q++ L R D +
Sbjct: 434 EPPPVESRNATPGKIYPFMSKDATEKMQQLRQSRA-LYRSLDAEF---------LQEQQQ 483
Query: 290 XXXXXXVQNRYSSTPQGYYEQNRE-TNEIVFGAVQEHDGRREAMVIKAVDYGDPKFSHQN 348
+S+ P YEQ+ E TNEIVFG QE D +HQN
Sbjct: 484 EKHQQYHHRHHSTIPYTLYEQSSEKTNEIVFGPGQEQDQMN---------------THQN 528
Query: 349 VRPRLN-YVGYSHGY 362
+ R + +V Y +GY
Sbjct: 529 IHHRAHQFVSYPYGY 543
>AT3G14860.1 | Symbols: | NHL domain-containing protein |
chr3:4998591-5000894 REVERSE LENGTH=492
Length = 492
Score = 95.5 bits (236), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 52/107 (48%), Positives = 65/107 (60%), Gaps = 9/107 (8%)
Query: 1 MAIRKISDEGITTIAXXXXXXXXXXXXXXPSEDAKFSNDFDVVYAGRSCSLLVVDRGNQA 60
+AIRKI D G+TTIA PSEDAKFSNDFDVVY +CSLLV+DRGN A
Sbjct: 171 LAIRKIGDSGVTTIAGGKSNIAGYRDG--PSEDAKFSNDFDVVYVRPTCSLLVIDRGNAA 228
Query: 61 IREIQLHQDDCTKYKYDEYDNDSSFHLG-IAVLIAVGFFGYMLALLQ 106
+R+I L ++DC +Y +DSS L I ++I GY +LQ
Sbjct: 229 LRQISLSEEDC------DYQDDSSISLTDILLVIGAVLIGYATCMLQ 269
>AT3G14860.2 | Symbols: | NHL domain-containing protein |
chr3:4998591-5000894 REVERSE LENGTH=493
Length = 493
Score = 95.5 bits (236), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 52/107 (48%), Positives = 65/107 (60%), Gaps = 9/107 (8%)
Query: 1 MAIRKISDEGITTIAXXXXXXXXXXXXXXPSEDAKFSNDFDVVYAGRSCSLLVVDRGNQA 60
+AIRKI D G+TTIA PSEDAKFSNDFDVVY +CSLLV+DRGN A
Sbjct: 171 LAIRKIGDSGVTTIAGGKSNIAGYRDG--PSEDAKFSNDFDVVYVRPTCSLLVIDRGNAA 228
Query: 61 IREIQLHQDDCTKYKYDEYDNDSSFHLG-IAVLIAVGFFGYMLALLQ 106
+R+I L ++DC +Y +DSS L I ++I GY +LQ
Sbjct: 229 LRQISLSEEDC------DYQDDSSISLTDILLVIGAVLIGYATCMLQ 269
>AT1G23890.1 | Symbols: | NHL domain-containing protein |
chr1:8439321-8440803 REVERSE LENGTH=261
Length = 261
Score = 63.2 bits (152), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 56/101 (55%), Gaps = 7/101 (6%)
Query: 2 AIRKISDEG-ITTIAXXXXXXXXXXXXXXPSEDAKFSNDFDVVYAGRSCSLLVVDRGNQA 60
AIRKIS G +TTIA P+++A FS+DF++ + + C LLV D GN+
Sbjct: 123 AIRKISSSGSVTTIAGGISKAFGHRDG--PAQNATFSSDFEITFVPQRCCLLVSDHGNEM 180
Query: 61 IREIQLHQDDCTKYKYDEYDNDSSFHLGIAVL----IAVGF 97
IR+I L ++DC + + S + +GI + +A+GF
Sbjct: 181 IRQINLKEEDCLENSHSNLGTYSLWSIGIVLSCILGVAIGF 221
>AT1G23890.2 | Symbols: | NHL domain-containing protein |
chr1:8438900-8440803 REVERSE LENGTH=400
Length = 400
Score = 62.4 bits (150), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 56/101 (55%), Gaps = 7/101 (6%)
Query: 2 AIRKISDEG-ITTIAXXXXXXXXXXXXXXPSEDAKFSNDFDVVYAGRSCSLLVVDRGNQA 60
AIRKIS G +TTIA P+++A FS+DF++ + + C LLV D GN+
Sbjct: 123 AIRKISSSGSVTTIAGGISKAFGHRDG--PAQNATFSSDFEITFVPQRCCLLVSDHGNEM 180
Query: 61 IREIQLHQDDCTKYKYDEYDNDSSFHLGIAVL----IAVGF 97
IR+I L ++DC + + S + +GI + +A+GF
Sbjct: 181 IRQINLKEEDCLENSHSNLGTYSLWSIGIVLSCILGVAIGF 221