Miyakogusa Predicted Gene
- Lj1g3v4752850.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v4752850.1 Non Characterized Hit- tr|I1NAV7|I1NAV7_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.43823 PE,75.76,0,FAMILY
NOT NAMED,NULL; seg,NULL; NT-C2,EEIG1/EHBP1 N-terminal
domain,CUFF.33113.1
(753 letters)
Database: Medicago_aa4.0v1
62,319 sequences; 21,947,249 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Medtr1g056180.1 | EEIG1/EHBP1 protein amine-terminal domain prot... 984 0.0
Medtr7g106540.2 | EEIG1/EHBP1 protein amine-terminal domain prot... 866 0.0
Medtr7g106540.1 | EEIG1/EHBP1 protein amine-terminal domain prot... 640 0.0
Medtr3g107090.1 | hypothetical protein | HC | chr3:49452455-4944... 298 1e-80
>Medtr1g056180.1 | EEIG1/EHBP1 protein amine-terminal domain protein
| HC | chr1:24589156-24585076 | 20130731
Length = 753
Score = 984 bits (2544), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 508/762 (66%), Positives = 583/762 (76%), Gaps = 31/762 (4%)
Query: 1 MVVKMMRWRPWPPP-SKRFHVRLTVRKLTGC-DLLRDDSRSKLTLEIRWKGPKSSLPSLR 58
MVVKMM+WRPWPPP S++F V+L ++ L+G DL +++ +EIRWKGPK +L SLR
Sbjct: 1 MVVKMMKWRPWPPPISRKFEVKLLIKTLSGGFDLSPENT---FAVEIRWKGPKLALSSLR 57
Query: 59 WNSVARNFTAEAAVDTTA-GAVTWDEEFQSLCNLTADKHNAFHPWEIAFTLFNGLNQNIK 117
N+V RNFT EA V WDEEF S NL+A+K N FHPWEIAFT+FNGLNQ K
Sbjct: 58 RNAVVRNFTGEAHTKGDEHDVVLWDEEFCSFVNLSANKENGFHPWEIAFTVFNGLNQRPK 117
Query: 118 RKVPIIGTALLNIAEFASPTDQKDFDLNIPLTLPGG-SVEPSPSLCISISLVEISGAQGS 176
K+P++GT LN+AE+AS DQKDFDL+IPLT+PGG SV+PS SL ISISLVE+ AQ +
Sbjct: 118 NKIPVVGTGSLNLAEYASVVDQKDFDLSIPLTIPGGASVDPSLSLTISISLVELRVAQEN 177
Query: 177 LESVHRTIVPVSSPPAQSGETTMAEKGDELSAIKAGLRKVKIFTEYVXXXXXXXXXXXXX 236
E H++IVPV+SP QSGE+ + DE+S IKAGLRKVKI TE+V
Sbjct: 178 SELGHKSIVPVASPLNQSGES----EKDEVSTIKAGLRKVKILTEFVSTRKSRKPSREEE 233
Query: 237 XXXXXXXXXXXDGECNYPVXXXXXX--XXXXXXXXXXXXXXXFRKSFSYGPLAYANAGGA 294
DGE NYP RKSFSYG LA+ANAGG+
Sbjct: 234 GSEGNYSARSEDGEYNYPFDSDSLDDFEEGDSHSEEVKEDSSVRKSFSYGKLAFANAGGS 293
Query: 295 FCSNMRVNCDDEGWVYYSHRMSDAGCLRMEDSTLSSSEPNV-QSSMRSILSWRKRKLSFR 353
F S+MRV DDE WVYYS+ SD L EDS +SSSEP V QSS RS+L WRKRKLSFR
Sbjct: 294 FYSSMRVKGDDEDWVYYSNHKSDVESLPKEDSIVSSSEPYVAQSSRRSLLPWRKRKLSFR 353
Query: 354 SPKKANKGEPLLKKAYAEEGGDDIDFDRRQLSSDESLSLRLYKNEDDSCANRSSISEFGD 413
SPK +KGEPLLKKAY EEGGDDIDFDRRQLSSDES+S +K EDDS ANR+S+SEFGD
Sbjct: 354 SPK--SKGEPLLKKAYGEEGGDDIDFDRRQLSSDESISFGSHKAEDDSGANRTSVSEFGD 411
Query: 414 DNFAVGSWEQKEVMSRDGHMKLQTQVFFASIDQRSERAAGESACTALVAVIADWFQNSPD 473
DNFAVGSWEQKEVMSRDGHMKLQTQVFFASIDQRSERAAGESACTALVAVIADWFQN+ D
Sbjct: 412 DNFAVGSWEQKEVMSRDGHMKLQTQVFFASIDQRSERAAGESACTALVAVIADWFQNNRD 471
Query: 474 LMPIKSQFDSLIREGSSEWRSMCDNETYRERFPDKHFDLETVIQAKIRPLSVVPSKSFIG 533
LMPIKSQFDSLIR+GS EWR++C+N+TYRERFPDKHFDLETV+QAKIRPLSVVP KSFIG
Sbjct: 472 LMPIKSQFDSLIRDGSLEWRNLCENQTYRERFPDKHFDLETVVQAKIRPLSVVPEKSFIG 531
Query: 534 FFHPEGMDEEKFDILHGAMSFDNIWDEISGSGHESLSNGEPHVYIVSWNDHFFILKVEAD 593
FFHPEGMDE +FD LHGAMSFDNIWDEIS +G++S N EP ++I+SWNDHFFILKVEAD
Sbjct: 532 FFHPEGMDEGRFDFLHGAMSFDNIWDEISNAGNDSTCNNEPRIFIISWNDHFFILKVEAD 591
Query: 594 CYYIIDTLGERLYEGCNQAYILKFDSSTVIHKMQNAAKSSLEDKTTSNQQTVAEVLERD- 652
Y IIDTLGERLYEGCNQAYILKFDS+TVI+KM N +SS+E+ TT QQTVA+VLE +
Sbjct: 592 SYCIIDTLGERLYEGCNQAYILKFDSNTVIYKMPNVTQSSVEN-TTGEQQTVADVLEHND 650
Query: 653 ------NSKEVDSSSGVAEQ----QEEEVVLCRGKEACKEYIKSFLAAIPIRELQADVKK 702
N KE++S + +Q +EE+ VLC+GKEACKEYIKSFLAAIPIRELQADVKK
Sbjct: 651 RQVQQINDKELESGAEAGDQSKSEREEDEVLCKGKEACKEYIKSFLAAIPIRELQADVKK 710
Query: 703 GLVLMSSTQVHHRLQIEFHYTQLLQSCPATPAVELEASIAAT 744
GL+ SST +HHRLQIEFHYTQLLQSC P E EAS+A T
Sbjct: 711 GLI--SSTPLHHRLQIEFHYTQLLQSCDVVPVAE-EASVAVT 749
>Medtr7g106540.2 | EEIG1/EHBP1 protein amine-terminal domain protein
| HC | chr7:43374781-43371223 | 20130731
Length = 742
Score = 866 bits (2238), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 475/770 (61%), Positives = 556/770 (72%), Gaps = 49/770 (6%)
Query: 1 MVVKMMRWRPWPP-PSKRFHVRLTVRKLTGCDLLRDDSRSKLTLEIRWKGPKSSLPSLRW 59
MVVKMMRWRPWPP +K++HVRL V+KL GC+ +R S SKL ++++WKGPKS+L SLR
Sbjct: 1 MVVKMMRWRPWPPLVTKKYHVRLNVKKLIGCNPIRQSS-SKLAIQVKWKGPKSTLYSLRR 59
Query: 60 NSVARNFTAEAAV---DTTAG-AVTWDEEFQSLCNLTA-DKHNA-FHPWEIAFTLFNGLN 113
SVARNFT EA + D +AG V +DEEF LCNL +K N FHPWEIAF LF+G N
Sbjct: 60 GSVARNFTREAVMEYDDFSAGFVVNFDEEFHHLCNLNGYNKDNVLFHPWEIAFNLFHGFN 119
Query: 114 QNIKRKVPIIGTALLNIAEFASPTDQKDFDLNIPLTLPGGSVEPSPSLCISISLVEISGA 173
+ ++K ++GTALLNIAEFA+ DQ F LNIPLT PG S + SP LCISISLVE +
Sbjct: 120 GSPRKKGTVVGTALLNIAEFANSIDQTSFGLNIPLTNPGDSAQQSPLLCISISLVERA-- 177
Query: 174 QGSLESVHRTIVPVSSPPAQSGETTMAEKGDELSAIKAGLRKVKIFTEYVXXXXXXXXXX 233
+VHR+IVPVSSP AQS E TM EKGD LSAIKAGLRKV+ FTE+V
Sbjct: 178 -----TVHRSIVPVSSPSAQSEEITMTEKGD-LSAIKAGLRKVRFFTEFVSSRKSEKACR 231
Query: 234 XXXXXXXXXXXXXXDGECNYPVXXXXXXXXXXXXXXXXXXXXXFRKSFSYGPLAYANAGG 293
D NY + SF YG LA ANAGG
Sbjct: 232 GEEGSSDSKYSRSGDNTHNYSLDFDSLDDLGDSDRGNEDSSVG--NSFRYGTLASANAGG 289
Query: 294 A--FCSNMRVNCDDEG-WVYYSHRMSDAGCLRMEDSTLSSSEPNV-QSSMRSILSWRKRK 349
+ + SN R+NCD++ WVY+S+ M DAG M++ST SSSEP + QS+ RSIL WRKRK
Sbjct: 290 SCSYYSNTRMNCDEKDIWVYHSYCMPDAGRSHMDESTTSSSEPYLSQSTKRSILPWRKRK 349
Query: 350 LSFRSPKKANKGEPLLKKAYAEEGGDDIDFDRRQLSSDESLSL--RLYKNEDDSCANRSS 407
+GEPLLKKAYAE+GGDDIDFDRRQLSS+ S+ + +K D +
Sbjct: 350 F---------RGEPLLKKAYAEDGGDDIDFDRRQLSSESDESISSKFHKTGD------TP 394
Query: 408 ISEFGDDNFAVGSWEQKEVMSRDGHMKLQTQVFFASIDQRSERAAGESACTALVAVIADW 467
ISEFGDD F VGSWEQKE+ SRDG MKL+TQVFFASIDQRSERAAGESACTALVAVIADW
Sbjct: 395 ISEFGDDMFVVGSWEQKEITSRDGQMKLETQVFFASIDQRSERAAGESACTALVAVIADW 454
Query: 468 FQNSPDLMPIKSQFDSLIREGSSEWRSMCDNETYRERFPDKHFDLETVIQAKIRPLSVVP 527
FQN D MP+KSQFD LIREGSSEWR++ DNET R RFPD HFDLETV++AKIRPLSVVP
Sbjct: 455 FQNYHDFMPLKSQFDGLIREGSSEWRNLRDNETCRNRFPDGHFDLETVLEAKIRPLSVVP 514
Query: 528 SKSFIGFFHPEGMDEEKFDILHGAMSFDNIWDEISGSGHESLSNGEPHVYIVSWNDHFFI 587
KSFIGFFHPEG+DEE+F+ L G+MSFDNIWDEIS S HE LSNG PHV+IVSWNDHFFI
Sbjct: 515 GKSFIGFFHPEGVDEERFEFLRGSMSFDNIWDEISDSEHECLSNGGPHVFIVSWNDHFFI 574
Query: 588 LKVEADCYYIIDTLGERLYEGCNQAYILKFDSSTVIHKMQNAAKSSLEDKTTSNQQTVAE 647
LKVE+DCYYIIDTLGERL+EGCNQAYILKFDS+TVIHK ++ +SS E K T +TVA+
Sbjct: 575 LKVESDCYYIIDTLGERLHEGCNQAYILKFDSNTVIHKTKSVPQSSNEKKATGGWKTVAQ 634
Query: 648 VLERDNS--KEVDSSS-GVAEQQEEEVVLCRGKEACKEYIKSFLAAIPIRELQADVKKGL 704
+LER+ K+V++ G + + +EEVV+CRGKEACKEYIKSFLAAIPIRELQAD+ K
Sbjct: 635 LLERNKKPFKDVNNGKEGDSIEDQEEVVVCRGKEACKEYIKSFLAAIPIRELQADIMKNN 694
Query: 705 VLMSSTQVHHRLQIEFHYTQLLQSC-PATPAVELEASIAATPETLALAIT 753
+ S H RLQIEFHYTQLL C TPAV EA++ A +TLALAI
Sbjct: 695 MTFSP---HQRLQIEFHYTQLLPPCLSTTPAV--EATMFAA-DTLALAIN 738
>Medtr7g106540.1 | EEIG1/EHBP1 protein amine-terminal domain protein
| HC | chr7:43373739-43371223 | 20130731
Length = 545
Score = 640 bits (1651), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 352/565 (62%), Positives = 407/565 (72%), Gaps = 34/565 (6%)
Query: 199 MAEKGDELSAIKAGLRKVKIFTEYVXXXXXXXXXXXXXXXXXXXXXXXXDGECNYPVXXX 258
M EKGD LSAIKAGLRKV+ FTE+V D NY +
Sbjct: 1 MTEKGD-LSAIKAGLRKVRFFTEFVSSRKSEKACRGEEGSSDSKYSRSGDNTHNYSLDFD 59
Query: 259 XXXXXXXXXXXXXXXXXXFRKSFSYGPLAYANAGGA--FCSNMRVNCDDEG-WVYYSHRM 315
SF YG LA ANAGG+ + SN R+NCD++ WVY+S+ M
Sbjct: 60 SLDDLGDSDRGNEDSSVG--NSFRYGTLASANAGGSCSYYSNTRMNCDEKDIWVYHSYCM 117
Query: 316 SDAGCLRMEDSTLSSSEPNV-QSSMRSILSWRKRKLSFRSPKKANKGEPLLKKAYAEEGG 374
DAG M++ST SSSEP + QS+ RSIL WRKRK +GEPLLKKAYAE+GG
Sbjct: 118 PDAGRSHMDESTTSSSEPYLSQSTKRSILPWRKRKF---------RGEPLLKKAYAEDGG 168
Query: 375 DDIDFDRRQLSSDESLSL--RLYKNEDDSCANRSSISEFGDDNFAVGSWEQKEVMSRDGH 432
DDIDFDRRQLSS+ S+ + +K D + ISEFGDD F VGSWEQKE+ SRDG
Sbjct: 169 DDIDFDRRQLSSESDESISSKFHKTGD------TPISEFGDDMFVVGSWEQKEITSRDGQ 222
Query: 433 MKLQTQVFFASIDQRSERAAGESACTALVAVIADWFQNSPDLMPIKSQFDSLIREGSSEW 492
MKL+TQVFFASIDQRSERAAGESACTALVAVIADWFQN D MP+KSQFD LIREGSSEW
Sbjct: 223 MKLETQVFFASIDQRSERAAGESACTALVAVIADWFQNYHDFMPLKSQFDGLIREGSSEW 282
Query: 493 RSMCDNETYRERFPDKHFDLETVIQAKIRPLSVVPSKSFIGFFHPEGMDEEKFDILHGAM 552
R++ DNET R RFPD HFDLETV++AKIRPLSVVP KSFIGFFHPEG+DEE+F+ L G+M
Sbjct: 283 RNLRDNETCRNRFPDGHFDLETVLEAKIRPLSVVPGKSFIGFFHPEGVDEERFEFLRGSM 342
Query: 553 SFDNIWDEISGSGHESLSNGEPHVYIVSWNDHFFILKVEADCYYIIDTLGERLYEGCNQA 612
SFDNIWDEIS S HE LSNG PHV+IVSWNDHFFILKVE+DCYYIIDTLGERL+EGCNQA
Sbjct: 343 SFDNIWDEISDSEHECLSNGGPHVFIVSWNDHFFILKVESDCYYIIDTLGERLHEGCNQA 402
Query: 613 YILKFDSSTVIHKMQNAAKSSLEDKTTSNQQTVAEVLERDNS--KEVDSSS-GVAEQQEE 669
YILKFDS+TVIHK ++ +SS E K T +TVA++LER+ K+V++ G + + +E
Sbjct: 403 YILKFDSNTVIHKTKSVPQSSNEKKATGGWKTVAQLLERNKKPFKDVNNGKEGDSIEDQE 462
Query: 670 EVVLCRGKEACKEYIKSFLAAIPIRELQADVKKGLVLMSSTQVHHRLQIEFHYTQLLQSC 729
EVV+CRGKEACKEYIKSFLAAIPIRELQAD+ K + S H RLQIEFHYTQLL C
Sbjct: 463 EVVVCRGKEACKEYIKSFLAAIPIRELQADIMKNNMTFSP---HQRLQIEFHYTQLLPPC 519
Query: 730 -PATPAVELEASIAATPETLALAIT 753
TPAV EA++ A +TLALAI
Sbjct: 520 LSTTPAV--EATMFAA-DTLALAIN 541
>Medtr3g107090.1 | hypothetical protein | HC |
chr3:49452455-49448631 | 20130731
Length = 619
Score = 298 bits (763), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 154/303 (50%), Positives = 206/303 (67%), Gaps = 29/303 (9%)
Query: 421 WEQKEVMSRDGHMKLQTQVFFASIDQRSERAAGESACTALVAVIADWFQNSPDLMPIKSQ 480
WE K ++SRDG KL+T VFFAS DQRSE+A+GESACT LVA+I+ W ++ + P ++Q
Sbjct: 323 WEMKYILSRDGQSKLKTNVFFASFDQRSEKASGESACTVLVALISHWLHSNQGI-PTRTQ 381
Query: 481 FDSLIREGSSEWRSMCDNETYRERFPDKHFDLETVIQAKIRPLSVVPSKSFIGFFHPEGM 540
FD+LI +GSSEWR +C N+ Y + FPDKHFDLETVI A +RPL V+P KS+ GFF P
Sbjct: 382 FDNLITQGSSEWRKLCKNDYYSKHFPDKHFDLETVIDANLRPLVVLPQKSYTGFFSP--- 438
Query: 541 DEEKFDILHGAMSFD-NIWDEISGSGHESLSNGEPHVYIVSWNDHFFILKVEADCYYIID 599
EKF L GAMSFD + + + ++ + EP +YIVSWNDHFFILKVE D YYIID
Sbjct: 439 --EKFHCLEGAMSFDEIWNEINNINNKNTVVDFEPRIYIVSWNDHFFILKVEVDAYYIID 496
Query: 600 TLGERLYEGCNQAYILKFDSSTVIHKMQNAAKSSLEDKTTSNQQTVAEVLERDNSKEVDS 659
+LGERL+EGC +A++LKFD S V++ K N++ V +K+V +
Sbjct: 497 SLGERLFEGCQRAFVLKFDDSCVMY-----------GKVDKNEEEV--------NKDVGT 537
Query: 660 SSGVAEQQEEEVVLCRGKEACKEYIKSFLAAIPIRELQADVKKGLVLMSSTQVHHRLQIE 719
S ++ E+V CRGKE CKE+IK FLAAI +R+L+ + KK V S+ +H +LQI+
Sbjct: 538 KSSSRGEESFEIV-CRGKECCKEFIKRFLAAISVRQLEKEEKKWSV--SNPYLHRQLQID 594
Query: 720 FHY 722
FH+
Sbjct: 595 FHF 597