Miyakogusa Predicted Gene
- Lj3g3v1061470.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj3g3v1061470.1 Non Characterized Hit- tr|D8TA08|D8TA08_SELML
Putative uncharacterized protein (Fragment) OS=Selagin,35.43,6e-18,no
description,Six-bladed beta-propeller, TolB-like; NHL
REPEAT-CONTAINING PROTEIN,NULL; FAMILY NOT ,CUFF.42103.1
(481 letters)
Database: Medicago_aa4.0v1
62,319 sequences; 21,947,249 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Medtr2g075860.1 | NHL repeat protein | HC | chr2:31740677-317353... 691 0.0
Medtr5g034750.1 | NHL repeat protein | HC | chr5:15072581-150770... 241 9e-64
Medtr8g063760.1 | NHL repeat protein | HC | chr8:26718504-267142... 239 5e-63
Medtr8g058630.1 | NHL repeat protein | HC | chr8:20081350-200803... 226 3e-59
Medtr6g007720.1 | NHL repeat protein | HC | chr6:1850293-1854343... 220 3e-57
Medtr8g063700.1 | plant/T23E23-13 protein, putative | HC | chr8:... 139 4e-33
Medtr8g058280.1 | transmembrane protein, putative | HC | chr8:20... 134 2e-31
Medtr5g034550.1 | NHL repeat protein | HC | chr5:15003202-150038... 113 5e-25
Medtr8g058310.1 | hypothetical protein | HC | chr8:20082079-2008... 60 3e-09
>Medtr2g075860.1 | NHL repeat protein | HC | chr2:31740677-31735332
| 20130731
Length = 493
Score = 691 bits (1783), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 359/475 (75%), Positives = 381/475 (80%), Gaps = 18/475 (3%)
Query: 21 SLRFQ-AHAAPAGPLIKHLSSLIKWTRSAS-KTPHSDENVLQFENGYVVETVVEGNEIGV 78
SL FQ HAAPAGPLIKHLSSLIKWTRSA+ KTPHSD NVLQFENGYVVETVVEGNEIGV
Sbjct: 23 SLHFQPTHAAPAGPLIKHLSSLIKWTRSATTKTPHSDGNVLQFENGYVVETVVEGNEIGV 82
Query: 79 IPYRIRVSEEDGELLSVDETNSNVVRITLPLSQYSRGRLVAGSFQGYTGHVDGKPNDARF 138
IPYRIRVSEEDGEL +VDE NSN+VRIT PLSQYSRGRLVAGSFQGYT HVDGKP+DARF
Sbjct: 83 IPYRIRVSEEDGELFAVDEINSNIVRITPPLSQYSRGRLVAGSFQGYTDHVDGKPSDARF 142
Query: 139 NHPKGIAMDDKGNTYVADIQNMAIRKIGDAGVTTIAGGKSNVAGYRDGPSEDAKFSNDFD 198
NHPKGI MDDKGN YVAD QN+AIRKIGDAGVTTIAGGKSNVAGYRDGPSEDAKFSNDFD
Sbjct: 143 NHPKGITMDDKGNVYVADTQNLAIRKIGDAGVTTIAGGKSNVAGYRDGPSEDAKFSNDFD 202
Query: 199 VVYVRPTCSLLVIDRGNAALRKISLDQEDCDYQSSSISSTDILTVIGAVMVGYATCMLQQ 258
VVYVRPTCSLLVIDRGNAALRKI LDQEDCDYQSSSISSTDIL V+GAV+VGYATCMLQQ
Sbjct: 203 VVYVRPTCSLLVIDRGNAALRKIILDQEDCDYQSSSISSTDILIVVGAVLVGYATCMLQQ 262
Query: 259 GFGLPFFSKTKPSASEFKEQVSSEKHMPFLESSKEEPGWPSFGQLIVDLSKLSLEALARA 318
GFG FFSKT+ S EFK + S++K MP ESSKE+PGWPSFGQLI DLSKLSLEALA A
Sbjct: 263 GFGSSFFSKTRSSGQEFKGRESNDKRMPIPESSKEDPGWPSFGQLIADLSKLSLEALASA 322
Query: 319 FAQVIPSHLISGSPKRGLTPLNDRFLMPEDQV---LVNRKTTPAPLIENRHVPQVHHTPR 375
F Q +PSH S K GLTPL DR +MPED+V LV RKTTP + ENR +PQVH
Sbjct: 323 FTQFMPSHFKFNSRKTGLTPLKDRLVMPEDEVQPPLVKRKTTPVTVTENRQMPQVH---- 378
Query: 376 TAEXXXXXXXXXXXXXXXXXXXXXXXHRSSKR---PEFYGSTEIP-PYTKSKSQKERPRH 431
TA HRSSKR EFYGS E+P Y KSKSQKERPRH
Sbjct: 379 TATITEKYSEAKPPKVKSSSFKDPSKHRSSKRSEYAEFYGSGEVPSSYAKSKSQKERPRH 438
Query: 432 RQREKSGEVVF---GAEPKPVETK-AVDQSATKFDHYNMR-AKYVSGESYRFNSQ 481
R REKSGEVVF GAE KPVE + AVD S +KFD Y+MR YV GES+RFNSQ
Sbjct: 439 RHREKSGEVVFPTNGAEAKPVEPRAAVDHSNSKFDRYSMRTGGYVPGESFRFNSQ 493
>Medtr5g034750.1 | NHL repeat protein | HC | chr5:15072581-15077047
| 20130731
Length = 560
Score = 241 bits (616), Expect = 9e-64, Method: Compositional matrix adjust.
Identities = 125/231 (54%), Positives = 163/231 (70%), Gaps = 5/231 (2%)
Query: 32 GPLIKHLSSLIKWTRS---ASKTPHSDENVLQFENGYVVETVVEGNEIGVIPYRIRVSEE 88
G L + + KW S +K + +++++FE+GY VETV +G+++G+ PY + V
Sbjct: 37 GFLSNAVPAFTKWVFSLKPTTKKAIAGKSMMKFESGYNVETVFDGSKLGIEPYAVEVLS- 95
Query: 89 DGELLSVDETNSNVVRITLPLSQYSRGRLVAGSFQGYTGHVDGKPNDARFNHPKGIAMDD 148
+GELL +D NSN+ +I+ LS YSR +LVAGS +GY+GHVDGK +AR NHPKGI +DD
Sbjct: 96 NGELLILDSENSNIYKISSSLSLYSRPKLVAGSAEGYSGHVDGKLREARMNHPKGITVDD 155
Query: 149 KGNTYVADIQNMAIRKIGDAGVTTIAGGK-SNVAGYRDGPSEDAKFSNDFDVVYVRPTCS 207
+GN YVADI NMAIRKI D+GVTTIAGGK S G+ DGPSE+AKFSNDFDVVYV +CS
Sbjct: 156 RGNIYVADIMNMAIRKISDSGVTTIAGGKLSRGGGHVDGPSEEAKFSNDFDVVYVGSSCS 215
Query: 208 LLVIDRGNAALRKISLDQEDCDYQSSSISSTDILTVIGAVMVGYATCMLQQ 258
LLVIDRGN A+R+I L +DC YQ S I ++GA GY +LQ+
Sbjct: 216 LLVIDRGNQAIREIQLRFDDCAYQYESGFPLGIAMLLGAGFFGYMLALLQR 266
>Medtr8g063760.1 | NHL repeat protein | HC | chr8:26718504-26714203
| 20130731
Length = 521
Score = 239 bits (610), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 123/239 (51%), Positives = 166/239 (69%), Gaps = 8/239 (3%)
Query: 24 FQAHAAPAGPLIKHLSSLIKWTRSASKTPHSDENVLQFENGYVVETVVEGNEIGVIPYRI 83
F ++A PA S + ++ +KT +++++FE+GY VETV +G+++G+ PY +
Sbjct: 41 FLSNAVPA------FSKWVWSLKATTKTGVLSKSMMKFESGYNVETVFDGSKLGIEPYAV 94
Query: 84 RVSEEDGELLSVDETNSNVVRITLPLSQYSRGRLVAGSFQGYTGHVDGKPNDARFNHPKG 143
V +GELL +D NSN+ RI+ LS YSR +LVAGS +GY+GHVDG+ +AR NHPKG
Sbjct: 95 EVLH-NGELLILDSANSNLYRISSSLSLYSRPKLVAGSAEGYSGHVDGRLREARMNHPKG 153
Query: 144 IAMDDKGNTYVADIQNMAIRKIGDAGVTTIAGGK-SNVAGYRDGPSEDAKFSNDFDVVYV 202
I +DD+GN YVAD NMAIRKI D+GVTTIAGGK S G+ DGPSE+AKFS+DFDVVYV
Sbjct: 154 ITVDDRGNIYVADTANMAIRKISDSGVTTIAGGKWSRGGGHVDGPSEEAKFSDDFDVVYV 213
Query: 203 RPTCSLLVIDRGNAALRKISLDQEDCDYQSSSISSTDILTVIGAVMVGYATCMLQQGFG 261
+CSLLV+DRGN A+R+I L +DC Y+ S I ++GA GY +LQ+ G
Sbjct: 214 GSSCSLLVVDRGNQAIREIQLHFDDCAYRYGSDFPLGIAMLVGAGFFGYMLALLQRRLG 272
>Medtr8g058630.1 | NHL repeat protein | HC | chr8:20081350-20080362
| 20130731
Length = 150
Score = 226 bits (577), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 108/128 (84%), Positives = 117/128 (91%)
Query: 113 SRGRLVAGSFQGYTGHVDGKPNDARFNHPKGIAMDDKGNTYVADIQNMAIRKIGDAGVTT 172
SR RLVAGSF G TGHVDGK +DARF++PKGIA+DDKGN YVAD QNMAIRKIGDAGVTT
Sbjct: 13 SRERLVAGSFLGRTGHVDGKLSDARFHYPKGIALDDKGNVYVADTQNMAIRKIGDAGVTT 72
Query: 173 IAGGKSNVAGYRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRKISLDQEDCDYQS 232
IAGGKSNVAGYRDGP EDAK SNDFDVVY+RPTCSLLVIDRGNAALR+I L+QEDC+YQS
Sbjct: 73 IAGGKSNVAGYRDGPGEDAKLSNDFDVVYIRPTCSLLVIDRGNAALRQIFLNQEDCNYQS 132
Query: 233 SSISSTDI 240
SSIS T +
Sbjct: 133 SSISLTGL 140
>Medtr6g007720.1 | NHL repeat protein | HC | chr6:1850293-1854343 |
20130731
Length = 562
Score = 220 bits (560), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 113/239 (47%), Positives = 162/239 (67%), Gaps = 20/239 (8%)
Query: 38 LSSLIKWTRSASKTP-------HSDENVLQFENGYVVETVVEGNEIGVIPYRIRVSEEDG 90
+SSL+KW S P HS ++++FE+GY VET+ +G+++G+ P+ I +S+ DG
Sbjct: 40 VSSLLKWIWSLKSKPKVKVPVQHS-RSMVKFESGYNVETIFDGSKLGIEPHSIEISQ-DG 97
Query: 91 ELLSVDETNSNVVRITLPLSQYSRGRLVAGSFQGYTGHVDGKPNDARFNHPKGIAMDDKG 150
E L +D NSN+ +I+ P+S+YS+ +L+AGS +GY GH+DG+ DAR NHPKG+ +DD G
Sbjct: 98 EYLVLDSENSNIYKISSPMSRYSKPKLLAGSSEGYIGHIDGRSRDARLNHPKGLTVDDSG 157
Query: 151 NTYVADIQNMAIRKIGDAGVTTIA--GGKSNVAGYRDGPSEDAKFSNDFDVVYVRPTCSL 208
N Y+AD NMAIRKI D GVTTIA G + + G+ DGPSEDAKFSNDFD++Y R +CSL
Sbjct: 158 NIYIADTLNMAIRKISDEGVTTIAGGGKRGQLGGHVDGPSEDAKFSNDFDLIYARSSCSL 217
Query: 209 LVIDRGNAALRKISLDQEDC---------DYQSSSISSTDILTVIGAVMVGYATCMLQQ 258
LV DRGN A+R+I L+Q+DC +Y+ + I ++ A GY +L++
Sbjct: 218 LVDDRGNQAIREIQLNQDDCITSTTTTNDEYEYDNSFPLGIAALVSAGFFGYMLALLKR 276
>Medtr8g063700.1 | plant/T23E23-13 protein, putative | HC |
chr8:26697994-26701680 | 20130731
Length = 384
Score = 139 bits (351), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 94/275 (34%), Positives = 146/275 (53%), Gaps = 27/275 (9%)
Query: 62 ENGYVVETVVEGNEIGVIPYRIRVSEEDGELLSVDETNSNVVRITLPLSQYSRGRLVAGS 121
E GY + T+++G+++ + P+ I +L+ +D TNS + LP+SQ S + +G+
Sbjct: 31 EEGYTITTILDGHKLHINPFSILQRPISSDLIVLDSTNSTFYTVQLPISQESVFKRFSGN 90
Query: 122 FQGYTGHVDGKPNDARFNHPKGIAMDDKGNTYVADIQNMAIRKIGDAGVTTIAGGKSNVA 181
G G+ DG ARF+ P+ A+D +GN YVAD N IRKI GVTTIAGG S +
Sbjct: 91 --GSPGYEDGDVGLARFDKPRSFAVDFRGNVYVADRVNKVIRKISTNGVTTIAGGSSEKS 148
Query: 182 GYRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRKISLDQEDCDYQSSSISSTDIL 241
+DGP+++A FSNDF++ ++ C+LLV D + + +I+L +EDC S S
Sbjct: 149 SIKDGPAQNASFSNDFELTFIPALCALLVSDHMHQLVHQINLKEEDCTLGSKS------- 201
Query: 242 TVIGAVM---VGYA-TCMLQQGFGL---PFF------SKTKPSASEFKEQVSSEKHMPFL 288
+GAVM +G +C+L G+ P+ S+ +A+ Q + K +P L
Sbjct: 202 -ALGAVMTWTLGLGLSCILGLVIGIVIRPYIIPHEHTSRCHFTATWKHCQTNLVKLVPTL 260
Query: 289 ----ESSKEEPGWPSFGQLIVDLSKLSLEALARAF 319
+S+ G S + V L +LSL L F
Sbjct: 261 YSGIKSAVASCGCSSVFTVAVRLWELSLSLLVLMF 295
>Medtr8g058280.1 | transmembrane protein, putative | HC |
chr8:20080057-20074260 | 20130731
Length = 273
Score = 134 bits (337), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 103/251 (41%), Positives = 123/251 (49%), Gaps = 50/251 (19%)
Query: 241 LTVIGAVMVGYATCMLQQGFGLPFFSKTKPSASEFKEQVSSEKHMPFLESSKEEPGWPSF 300
L I V + Y MLQQ PS +FK + SS+K LE +KEE GWPSF
Sbjct: 59 LYFILGVYIAY---MLQQ-----------PSERDFKGEASSDKSKSTLERTKEETGWPSF 104
Query: 301 GQLIVDLSKLSLEALARAFAQVIPSHLISGSPKRGLTPLNDRFLMPEDQV---LVNRKTT 357
Q +VDL L S +RGLTPL DR LMPED++ LV R++
Sbjct: 105 RQ-VVDL-------------------LKPDSTRRGLTPLKDRLLMPEDELEPLLVKRQSA 144
Query: 358 PAP-LIENRHVPQVHHTPRTAEXXXXXXXXXXXXXXXXXXXXXXXHRSSKR---PEFYGS 413
AP L E R + H AE H SSKR EFYGS
Sbjct: 145 LAPPLTETRKI----HLKSAAEKYSETKIAKVKSSTPKDPSLPSKHHSSKRQEYAEFYGS 200
Query: 414 TEIPPYTKSKSQKERPRHRQREKSGEV--VFGAEPKPVETKAVDQSATKFDHYN---MRA 468
+EIP TKSK QK+R RHR+REKS EV G E KP+E +A TK+D YN MR
Sbjct: 201 SEIPAPTKSKIQKQRSRHRRREKSEEVSGAVGTEQKPLEMRAAGYFNTKYDQYNYNMMRP 260
Query: 469 KYVSGESYRFN 479
KYV ++ RFN
Sbjct: 261 KYVPEDTSRFN 271
>Medtr5g034550.1 | NHL repeat protein | HC | chr5:15003202-15003898
| 20130731
Length = 154
Score = 113 bits (282), Expect = 5e-25, Method: Composition-based stats.
Identities = 62/123 (50%), Positives = 75/123 (60%), Gaps = 7/123 (5%)
Query: 112 YSRGRLVAGSFQGYTGHVDGKPNDARFNHPKGIAMDDKGNTYVADIQNMAIRKIGDAGVT 171
Y R +LVAGS +GY+GHVD K +AR NHPKGI +DD+GN YVADI NMAIRKI
Sbjct: 2 YGRPKLVAGSAEGYSGHVDEKLREARMNHPKGITVDDRGNIYVADIINMAIRKIS----- 56
Query: 172 TIAGGKSNVAGYRDGPSEDAKFSNDFDVVYVRPTCSLLVIDRGNAALRKISLDQEDCDYQ 231
G + S + FDV+YV + SLLVIDRG A+R+I L +DC YQ
Sbjct: 57 --LGNNMTYLSFLYEESLILFYLLLFDVIYVGSSYSLLVIDRGKQAIREIQLRFDDCAYQ 114
Query: 232 SSS 234
S
Sbjct: 115 YES 117
>Medtr8g058310.1 | hypothetical protein | HC |
chr8:20082079-20081626 | 20130731
Length = 66
Score = 60.5 bits (145), Expect = 3e-09, Method: Composition-based stats.
Identities = 32/50 (64%), Positives = 35/50 (70%), Gaps = 1/50 (2%)
Query: 21 SLRFQAHAAP-AGPLIKHLSSLIKWTRSASKTPHSDENVLQFENGYVVET 69
SL F AHA P G LI HLSSL+ S +KT SD NV+QFENGYVVET
Sbjct: 14 SLHFSAHATPLGGTLINHLSSLLIRKLSNTKTSKSDGNVVQFENGYVVET 63