Miyakogusa Predicted Gene
- Lj0g3v0197229.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0197229.1 Non Characterized Hit- tr|I1M029|I1M029_SOYBN
Uncharacterized protein (Fragment) OS=Glycine max PE=4,83.02,9e-19,NHL
REPEAT-CONTAINING PROTEIN,NULL; FAMILY NOT NAMED,NULL;
seg,NULL,CUFF.12481.1
(362 letters)
Database: Medicago_aa4.0v1
62,319 sequences; 21,947,249 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Medtr6g007720.1 | NHL repeat protein | HC | chr6:1850293-1854343... 390 e-108
Medtr8g063760.1 | NHL repeat protein | HC | chr8:26718504-267142... 266 3e-71
Medtr5g034750.1 | NHL repeat protein | HC | chr5:15072581-150770... 201 1e-51
Medtr2g075860.1 | NHL repeat protein | HC | chr2:31740677-317353... 94 2e-19
Medtr8g058630.1 | NHL repeat protein | HC | chr8:20081350-200803... 81 2e-15
Medtr5g034550.1 | NHL repeat protein | HC | chr5:15003202-150038... 69 6e-12
Medtr8g063700.1 | plant/T23E23-13 protein, putative | HC | chr8:... 58 1e-08
>Medtr6g007720.1 | NHL repeat protein | HC | chr6:1850293-1854343 |
20130731
Length = 562
Score = 390 bits (1001), Expect = e-108, Method: Compositional matrix adjust.
Identities = 222/402 (55%), Positives = 246/402 (61%), Gaps = 46/402 (11%)
Query: 1 MAIRKISDEGITTIAXXXXXXXXXXXXXXPSEDAKFSNDFDVVYAGRSCSLLVVDRGNQA 60
MAIRKISDEG+TTIA PSEDAKFSNDFD++YA SCSLLV DRGNQA
Sbjct: 167 MAIRKISDEGVTTIAGGGKRGQLGGHVDGPSEDAKFSNDFDLIYARSSCSLLVDDRGNQA 226
Query: 61 IREIQLHQDDC---TKYKYDEYDNDSSFHLGIAVLIAVGFFGYMLALLQWRVRAMFSSPD 117
IREIQL+QDDC T DEY+ D+SF LGIA L++ GFFGYMLALL+ RV MFSS D
Sbjct: 227 IREIQLNQDDCITSTTTTNDEYEYDNSFPLGIAALVSAGFFGYMLALLKRRVTDMFSSSD 286
Query: 118 DPRGPLRTKGTPFAAEQQQKRXXXXXXXXXXXXXXAEDEFDKQDEGFFVSLGRLLVNSSS 177
D R +RTKGTPFA++Q+ EDEF+K DEGFFVSLGRLLVNSSS
Sbjct: 287 DSRAHIRTKGTPFASQQRPP-----PKSVRPPLIPNEDEFEKHDEGFFVSLGRLLVNSSS 341
Query: 178 SMGEILGSFFTGSKRKPLPXXXXXXXXXXXXXANRHSSNPWPMQESFVIPDGDE-PPGIE 236
SMGEI S F GSKRKPL ANR SN WPMQESFVIPDGDE PP +E
Sbjct: 342 SMGEIFLSLFLGSKRKPLSYHQYQQHQQQYHYANRQHSNSWPMQESFVIPDGDEPPPNME 401
Query: 237 ARTPTLRKPYPFMPNEIEKPQQF--------------------------KQTQGYLNRWD 270
+TPT RK YP+ E+E ++ K + YLN D
Sbjct: 402 TKTPTQRKTYPYTNKELEMLEKTRDNGFYETNIFPPPVPINRQTESTISKHNRAYLNMLD 461
Query: 271 DGGYD---------EXXXXXXXXXXXXXXXXXXXVQNRYSS-TPQGYYEQNRETNEIVFG 320
YD VQ+ YSS TP YYE+N ETNEIVFG
Sbjct: 462 K-SYDHEQLQQHHHHHHQEQQQHQNHHQQPQHSKVQSHYSSTTPSSYYEKNCETNEIVFG 520
Query: 321 AVQEHDGRREAMVIKAVDYGDPKFSHQNVRPRLNYVGYSHGY 362
AVQEHDGRREAMVIKAVDYGDPKFSH N+RPRLNYVGYSH Y
Sbjct: 521 AVQEHDGRREAMVIKAVDYGDPKFSHHNIRPRLNYVGYSHNY 562
>Medtr8g063760.1 | NHL repeat protein | HC | chr8:26718504-26714203
| 20130731
Length = 521
Score = 266 bits (679), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 162/367 (44%), Positives = 208/367 (56%), Gaps = 22/367 (5%)
Query: 1 MAIRKISDEGITTIAXXXXXXXXXXXXXXPSEDAKFSNDFDVVYAGRSCSLLVVDRGNQA 60
MAIRKISD G+TTIA PSE+AKFS+DFDVVY G SCSLLVVDRGNQA
Sbjct: 170 MAIRKISDSGVTTIAGGKWSRGGGHVDG-PSEEAKFSDDFDVVYVGSSCSLLVVDRGNQA 228
Query: 61 IREIQLHQDDCTKYKYDEYDNDSSFHLGIAVLIAVGFFGYMLALLQWRVRAMFSSPDDPR 120
IREIQLH DDC Y+Y S F LGIA+L+ GFFGYMLALLQ R+ + S D +
Sbjct: 229 IREIQLHFDDCA-YRY-----GSDFPLGIAMLVGAGFFGYMLALLQRRLGTIVES-QDAQ 281
Query: 121 GPLRTKGTPFAAEQQQKRXXXXXXXXXXXXXXAEDEFDKQDEGFFVSLGRLLVNSSSSMG 180
PL + + Q+ +E E +KQ+E FF SLG+LL N+ SSM
Sbjct: 282 VPLTVMPSVSRSTYQKP-----LKSVRPPLIPSEYEPEKQEESFFGSLGKLLANAGSSMV 336
Query: 181 EILGSFFTGSKRKPLPXXXXXXXXXXXXXANRHSSNPWPMQESFVIPDGDEPPGIEARTP 240
EI+G F +R+ P ++ N WP QESFVIP DEPP I+ R P
Sbjct: 337 EIMGGLFPVFRRR--PQSYHQFQRQTLIQQSQKQVNDWPAQESFVIPREDEPPSIDTRAP 394
Query: 241 TLRKPYPFMPNEIEKPQQFKQTQGYLNRWD-DGGYDEXXXXXXXXXXXXXXXXXXXVQNR 299
T RK YPFM + EK QQ +Q++ + + WD D + ++
Sbjct: 395 TPRKTYPFMSKDAEKIQQLRQSKAFYSGWDGDQHQQQQPQPQPQPQQQQQQQQQQQQKHH 454
Query: 300 Y-----SSTPQGYYEQ-NRETNEIVFGAVQEHDGRREAMVIKAVDYGDPKFSHQNVRPRL 353
Y SS P +YEQ N TNE+VFGAVQE DG++E++VI V+YG + H++ R R+
Sbjct: 455 YRHQYQSSVPHTFYEQTNETTNEVVFGAVQEQDGKKESVVITPVEYGGSLYEHRDFRSRM 514
Query: 354 NYVGYSH 360
+Y+GY +
Sbjct: 515 SYMGYKY 521
>Medtr5g034750.1 | NHL repeat protein | HC | chr5:15072581-15077047
| 20130731
Length = 560
Score = 201 bits (510), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 123/270 (45%), Positives = 152/270 (56%), Gaps = 20/270 (7%)
Query: 1 MAIRKISDEGITTIAXXXXXXXXXXXXXXPSEDAKFSNDFDVVYAGRSCSLLVVDRGNQA 60
MAIRKISD G+TTIA PSE+AKFSNDFDVVY G SCSLLV+DRGNQA
Sbjct: 167 MAIRKISDSGVTTIAGGKLSRGGGHVDG-PSEEAKFSNDFDVVYVGSSCSLLVIDRGNQA 225
Query: 61 IREIQLHQDDCTKYKYDEYDNDSSFHLGIAVLIAVGFFGYMLALLQWRVRAMFSSPDDPR 120
IREIQL DDC Y +S F LGIA+L+ GFFGYMLALLQ R+ + +S D
Sbjct: 226 IREIQLRFDDCA------YQYESGFPLGIAMLLGAGFFGYMLALLQRRLSTIVASQDMTL 279
Query: 121 GPLRTKGTPFAAEQQQKRXXXXXXXXXXXXXXAEDEFDKQDEGFFVSLGRLLVNSSSSMG 180
+ + F+ QK +EDE KQ+EG F S+G+LL N+ +S+
Sbjct: 280 AE-SSAMSDFSPSPYQK----PLKSVRPPLIPSEDESYKQEEGLFASIGKLLTNAGASVV 334
Query: 181 EILGSFFTGSKRKPLPXXXXXXXXXXXXXANRHSSNPWPMQESFVIPDGDEPPGIEARTP 240
EI+ G ++KP N WP+QESFVI + DEPP I+ RTP
Sbjct: 335 EIM-----GFRKKP---QSYEFQSQPLFHQPERQINAWPVQESFVITNEDEPPSIDPRTP 386
Query: 241 TLRKPYPFMPNEIEKPQQFKQTQGYLNRWD 270
T +K YPFM + EK QQ Q + N W+
Sbjct: 387 TPKKTYPFMIKDTEKMQQLWQGRALYNGWE 416
Score = 64.7 bits (156), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 34/62 (54%), Positives = 46/62 (74%), Gaps = 3/62 (4%)
Query: 297 QNRY-SSTPQGYYEQNRE-TNEIVFGAVQEHDGRREAMVIKAVDYGDPKFSHQNVRPRLN 354
+N+Y SS YYEQ+ E TNEIVFGAVQE D +E++VIK +DYGD + H N+R R++
Sbjct: 497 RNQYHSSVAHTYYEQSHEETNEIVFGAVQEQD-EKESVVIKPLDYGDSFYDHHNMRSRIS 555
Query: 355 YV 356
Y+
Sbjct: 556 YI 557
>Medtr2g075860.1 | NHL repeat protein | HC | chr2:31740677-31735332
| 20130731
Length = 493
Score = 94.0 bits (232), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 78/241 (32%), Positives = 111/241 (46%), Gaps = 43/241 (17%)
Query: 1 MAIRKISDEGITTIAXXXXXXXXXXXXXXPSEDAKFSNDFDVVYAGRSCSLLVVDRGNQA 60
+AIRKI D G+TTIA PSEDAKFSNDFDVVY +CSLLV+DRGN A
Sbjct: 164 LAIRKIGDAGVTTIAGGKSNVAGYRDG--PSEDAKFSNDFDVVYVRPTCSLLVIDRGNAA 221
Query: 61 IREIQLHQDDCTKYKYDEYDNDSSFHLGIAVLIAVGFFGYMLALLQWRVRAMFSSPDDPR 120
+R+I L Q+DC +Y + S I +++ GY +LQ + F S
Sbjct: 222 LRKIILDQEDC------DYQSSSISSTDILIVVGAVLVGYATCMLQQGFGSSFFS----- 270
Query: 121 GPLRTKGTPFAA-EQQQKRXXXXXXXXXXXXXXAEDEFDKQDEGFFVSLGRLLVNSSSSM 179
R+ G F E KR E K+D G + S G+L+ + S
Sbjct: 271 -KTRSSGQEFKGRESNDKRMPIP-------------ESSKEDPG-WPSFGQLIADLSKLS 315
Query: 180 GEILGSFFTGSKRKPLPXXXXXXXXXXXXXANRHSSNPWPMQESFVIPDGD-EPPGIEAR 238
E L S FT + +P N + P+++ V+P+ + +PP ++ +
Sbjct: 316 LEALASAFT----QFMPSHFKF---------NSRKTGLTPLKDRLVMPEDEVQPPLVKRK 362
Query: 239 T 239
T
Sbjct: 363 T 363
>Medtr8g058630.1 | NHL repeat protein | HC | chr8:20081350-20080362
| 20130731
Length = 150
Score = 80.9 bits (198), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 41/71 (57%), Positives = 48/71 (67%), Gaps = 2/71 (2%)
Query: 1 MAIRKISDEGITTIAXXXXXXXXXXXXXXPSEDAKFSNDFDVVYAGRSCSLLVVDRGNQA 60
MAIRKI D G+TTIA P EDAK SNDFDVVY +CSLLV+DRGN A
Sbjct: 60 MAIRKIGDAGVTTIAGGKSNVAGYRDG--PGEDAKLSNDFDVVYIRPTCSLLVIDRGNAA 117
Query: 61 IREIQLHQDDC 71
+R+I L+Q+DC
Sbjct: 118 LRQIFLNQEDC 128
>Medtr5g034550.1 | NHL repeat protein | HC | chr5:15003202-15003898
| 20130731
Length = 154
Score = 68.9 bits (167), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 39/75 (52%), Positives = 45/75 (60%), Gaps = 17/75 (22%)
Query: 40 FDVVYAGRSCSLLVVDRGNQAIREIQLHQDDCTKYKYDEYDNDSSFHLG----------- 88
FDV+Y G S SLLV+DRG QAIREIQL DDC Y +S F LG
Sbjct: 80 FDVIYVGSSYSLLVIDRGKQAIREIQLRFDDCA------YQYESRFPLGKLNKFKVCLYR 133
Query: 89 IAVLIAVGFFGYMLA 103
IA+L+ GFFGYM+A
Sbjct: 134 IAMLVGAGFFGYMMA 148
>Medtr8g063700.1 | plant/T23E23-13 protein, putative | HC |
chr8:26697994-26701680 | 20130731
Length = 384
Score = 58.2 bits (139), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 31/96 (32%), Positives = 49/96 (51%), Gaps = 2/96 (2%)
Query: 2 AIRKISDEGITTIAXXXXXXXXXXXXXXPSEDAKFSNDFDVVYAGRSCSLLVVDRGNQAI 61
IRKIS G+TTIA P+++A FSNDF++ + C+LLV D +Q +
Sbjct: 128 VIRKISTNGVTTIAGGSSEKSSIKDG--PAQNASFSNDFELTFIPALCALLVSDHMHQLV 185
Query: 62 REIQLHQDDCTKYKYDEYDNDSSFHLGIAVLIAVGF 97
+I L ++DCT ++ LG+ + +G
Sbjct: 186 HQINLKEEDCTLGSKSALGAVMTWTLGLGLSCILGL 221