Miyakogusa Predicted Gene
- Lj5g3v1697640.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj5g3v1697640.1 tr|C1EEU2|C1EEU2_MICSR Predicted protein
OS=Micromonas sp. (strain RCC299 / NOUM17)
GN=MICPUN_106293,24.46,1e-17,FAMILY NOT NAMED,NULL;
seg,NULL,CUFF.55759.1
(533 letters)
Database: Medicago_aa4.0v1
62,319 sequences; 21,947,249 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Medtr1g056180.1 | EEIG1/EHBP1 protein amine-terminal domain prot... 447 e-125
Medtr7g106540.2 | EEIG1/EHBP1 protein amine-terminal domain prot... 359 3e-99
Medtr7g106540.1 | EEIG1/EHBP1 protein amine-terminal domain prot... 358 5e-99
Medtr3g107090.1 | hypothetical protein | HC | chr3:49452455-4944... 249 6e-66
>Medtr1g056180.1 | EEIG1/EHBP1 protein amine-terminal domain protein
| HC | chr1:24589156-24585076 | 20130731
Length = 753
Score = 447 bits (1150), Expect = e-125, Method: Compositional matrix adjust.
Identities = 257/536 (47%), Positives = 338/536 (63%), Gaps = 43/536 (8%)
Query: 34 EFSTLKAGLRRVKTFTDYVSTRRAKKASYKDEGSDGRSSTRSEDSEYKYTSDLDSQDN-- 91
E ST+KAGLR+VK T++VSTR+++K S ++EGS+G S RSED EY Y D DS D+
Sbjct: 202 EVSTIKAGLRKVKILTEFVSTRKSRKPSREEEGSEGNYSARSEDGEYNYPFDSDSLDDFE 261
Query: 92 DDVNKSEERDEDSCVRHSMSYETLTCGNNVGESPYTSSTFNGKNEFKLNSGSQNSYFGDA 151
+ + SEE EDS VR S SY L N G S Y+S G +E + + S ++
Sbjct: 262 EGDSHSEEVKEDSSVRKSFSYGKLAFAN-AGGSFYSSMRVKGDDEDWVYYSNHKSDV-ES 319
Query: 152 LVENYNTCDQVEYHI---SKYRILSWRKRKLQFRSRSFKVKGELLLKKHXXXXXXXXXXX 208
L + + E ++ S+ +L WRKRKL FRS K KGE LLKK
Sbjct: 320 LPKEDSIVSSSEPYVAQSSRRSLLPWRKRKLSFRSP--KSKGEPLLKKAYGEEGGDDIDF 377
Query: 209 XRRVLSSSDEYTCQKWHKTQENITTTQSSISGFGENNFTVGTWEHKEVISQDGLMKLHTE 268
RR LSS DE HK +++ ++S+S FG++NF VG+WE KEV+S+DG MKL T+
Sbjct: 378 DRRQLSS-DESISFGSHKAEDDSGANRTSVSEFGDDNFAVGSWEQKEVMSRDGHMKLQTQ 436
Query: 269 IFFASIDQRSECAAGESACAVLVVLIVDWLKLNQAEIPIKCEFDSLIKDGSSEWRIICEN 328
+FFASIDQRSE AAGESAC LV +I DW + N+ +PIK +FDSLI+DGS EWR +CEN
Sbjct: 437 VFFASIDQRSERAAGESACTALVAVIADWFQNNRDLMPIKSQFDSLIRDGSLEWRNLCEN 496
Query: 329 KDYMKNFPDKHFDLDTVFRAKTRRVSVVPEKSYVXXXXXXXXXXXXXXXXXXXAMSFDSI 388
+ Y + FPDKHFDL+TV +AK R +SVVPEKS++ AMSFD+I
Sbjct: 497 QTYRERFPDKHFDLETVVQAKIRPLSVVPEKSFI-GFFHPEGMDEGRFDFLHGAMSFDNI 555
Query: 389 WEEISHCASELHLFSEAIVYIVSWNDHFFVLKVQHDANYIIDTLGERLHEGCNQAYILKF 448
W+EIS+ ++ +E ++I+SWNDHFF+LKV+ D+ IIDTLGERL+EGCNQAYILKF
Sbjct: 556 WDEISNAGNDSTCNNEPRIFIISWNDHFFILKVEADSYCIIDTLGERLYEGCNQAYILKF 615
Query: 449 DANTRIEKLCN------ENQVLDAKPPSD----------EVNDV---------------- 476
D+NT I K+ N EN + + +D ++ND
Sbjct: 616 DSNTVIYKMPNVTQSSVENTTGEQQTVADVLEHNDRQVQQINDKELESGAEAGDQSKSER 675
Query: 477 RKKEIICRGKESCKEYIKRFLASIPVRELQDDVKRGLKASMPLHRRLQIEFHYTTL 532
+ E++C+GKE+CKEYIK FLA+IP+RELQ DVK+GL +S PLH RLQIEFHYT L
Sbjct: 676 EEDEVLCKGKEACKEYIKSFLAAIPIRELQADVKKGLISSTPLHHRLQIEFHYTQL 731
>Medtr7g106540.2 | EEIG1/EHBP1 protein amine-terminal domain protein
| HC | chr7:43374781-43371223 | 20130731
Length = 742
Score = 359 bits (922), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 227/536 (42%), Positives = 304/536 (56%), Gaps = 63/536 (11%)
Query: 34 EFSTLKAGLRRVKTFTDYVSTRRAKKASYKDEGSDGRSSTRSEDSEYKYTSDLDSQDNDD 93
+ S +KAGLR+V+ FT++VS+R+++KA +EGS +RS D+ + Y+ LD DD
Sbjct: 203 DLSAIKAGLRKVRFFTEFVSSRKSEKACRGEEGSSDSKYSRSGDNTHNYS--LDFDSLDD 260
Query: 94 VNKSEERDEDSCVRHSMSYETLTCGNNVGESPYTSSTFNGKNEFKLNSGSQNSYF----- 148
+ S+ +EDS V +S Y TL N G Y S+T ++N ++ +
Sbjct: 261 LGDSDRGNEDSSVGNSFRYGTLASANAGGSCSYYSNT-------RMNCDEKDIWVYHSYC 313
Query: 149 ----GDALVENYNTCDQVEY--HISKYRILSWRKRKLQFRSRSFKVKGELLLKKHXXXXX 202
G + ++ T Y +K IL WRKRK +GE LLKK
Sbjct: 314 MPDAGRSHMDESTTSSSEPYLSQSTKRSILPWRKRKF---------RGEPLLKKAYAEDG 364
Query: 203 XXXXXXXRRVLSSSDEYTCQ-KWHKTQENITTTQSSISGFGENNFTVGTWEHKEVISQDG 261
RR LSS + + K+HKT + + IS FG++ F VG+WE KE+ S+DG
Sbjct: 365 GDDIDFDRRQLSSESDESISSKFHKTGD------TPISEFGDDMFVVGSWEQKEITSRDG 418
Query: 262 LMKLHTEIFFASIDQRSECAAGESACAVLVVLIVDWLKLNQAEIPIKCEFDSLIKDGSSE 321
MKL T++FFASIDQRSE AAGESAC LV +I DW + +P+K +FD LI++GSSE
Sbjct: 419 QMKLETQVFFASIDQRSERAAGESACTALVAVIADWFQNYHDFMPLKSQFDGLIREGSSE 478
Query: 322 WRIICENKDYMKNFPDKHFDLDTVFRAKTRRVSVVPEKSYVXXXXXXXXXXXXXXXXXXX 381
WR + +N+ FPD HFDL+TV AK R +SVVP KS++
Sbjct: 479 WRNLRDNETCRNRFPDGHFDLETVLEAKIRPLSVVPGKSFI-GFFHPEGVDEERFEFLRG 537
Query: 382 AMSFDSIWEEISHCASELHLFSEAIVYIVSWNDHFFVLKVQHDANYIIDTLGERLHEGCN 441
+MSFD+IW+EIS E V+IVSWNDHFF+LKV+ D YIIDTLGERLHEGCN
Sbjct: 538 SMSFDNIWDEISDSEHECLSNGGPHVFIVSWNDHFFILKVESDCYYIIDTLGERLHEGCN 597
Query: 442 QAYILKFDANTRIEKL------CNENQVLDA-----------KPPSDEVNDVRKKE---- 480
QAYILKFD+NT I K NE + K P +VN+ ++ +
Sbjct: 598 QAYILKFDSNTVIHKTKSVPQSSNEKKATGGWKTVAQLLERNKKPFKDVNNGKEGDSIED 657
Query: 481 ----IICRGKESCKEYIKRFLASIPVRELQDDVKRGLKASMPLHRRLQIEFHYTTL 532
++CRGKE+CKEYIK FLA+IP+RELQ D+ + P H+RLQIEFHYT L
Sbjct: 658 QEEVVVCRGKEACKEYIKSFLAAIPIRELQADIMKNNMTFSP-HQRLQIEFHYTQL 712
>Medtr7g106540.1 | EEIG1/EHBP1 protein amine-terminal domain protein
| HC | chr7:43373739-43371223 | 20130731
Length = 545
Score = 358 bits (920), Expect = 5e-99, Method: Compositional matrix adjust.
Identities = 227/536 (42%), Positives = 304/536 (56%), Gaps = 63/536 (11%)
Query: 34 EFSTLKAGLRRVKTFTDYVSTRRAKKASYKDEGSDGRSSTRSEDSEYKYTSDLDSQDNDD 93
+ S +KAGLR+V+ FT++VS+R+++KA +EGS +RS D+ + Y+ LD DD
Sbjct: 6 DLSAIKAGLRKVRFFTEFVSSRKSEKACRGEEGSSDSKYSRSGDNTHNYS--LDFDSLDD 63
Query: 94 VNKSEERDEDSCVRHSMSYETLTCGNNVGESPYTSSTFNGKNEFKLNSGSQNSYF----- 148
+ S+ +EDS V +S Y TL N G Y S+T ++N ++ +
Sbjct: 64 LGDSDRGNEDSSVGNSFRYGTLASANAGGSCSYYSNT-------RMNCDEKDIWVYHSYC 116
Query: 149 ----GDALVENYNTCDQVEY--HISKYRILSWRKRKLQFRSRSFKVKGELLLKKHXXXXX 202
G + ++ T Y +K IL WRKRK +GE LLKK
Sbjct: 117 MPDAGRSHMDESTTSSSEPYLSQSTKRSILPWRKRKF---------RGEPLLKKAYAEDG 167
Query: 203 XXXXXXXRRVLSSSDEYTCQ-KWHKTQENITTTQSSISGFGENNFTVGTWEHKEVISQDG 261
RR LSS + + K+HKT + + IS FG++ F VG+WE KE+ S+DG
Sbjct: 168 GDDIDFDRRQLSSESDESISSKFHKTGD------TPISEFGDDMFVVGSWEQKEITSRDG 221
Query: 262 LMKLHTEIFFASIDQRSECAAGESACAVLVVLIVDWLKLNQAEIPIKCEFDSLIKDGSSE 321
MKL T++FFASIDQRSE AAGESAC LV +I DW + +P+K +FD LI++GSSE
Sbjct: 222 QMKLETQVFFASIDQRSERAAGESACTALVAVIADWFQNYHDFMPLKSQFDGLIREGSSE 281
Query: 322 WRIICENKDYMKNFPDKHFDLDTVFRAKTRRVSVVPEKSYVXXXXXXXXXXXXXXXXXXX 381
WR + +N+ FPD HFDL+TV AK R +SVVP KS++
Sbjct: 282 WRNLRDNETCRNRFPDGHFDLETVLEAKIRPLSVVPGKSFI-GFFHPEGVDEERFEFLRG 340
Query: 382 AMSFDSIWEEISHCASELHLFSEAIVYIVSWNDHFFVLKVQHDANYIIDTLGERLHEGCN 441
+MSFD+IW+EIS E V+IVSWNDHFF+LKV+ D YIIDTLGERLHEGCN
Sbjct: 341 SMSFDNIWDEISDSEHECLSNGGPHVFIVSWNDHFFILKVESDCYYIIDTLGERLHEGCN 400
Query: 442 QAYILKFDANTRIEKL------CNENQVLDA-----------KPPSDEVNDVRKKE---- 480
QAYILKFD+NT I K NE + K P +VN+ ++ +
Sbjct: 401 QAYILKFDSNTVIHKTKSVPQSSNEKKATGGWKTVAQLLERNKKPFKDVNNGKEGDSIED 460
Query: 481 ----IICRGKESCKEYIKRFLASIPVRELQDDVKRGLKASMPLHRRLQIEFHYTTL 532
++CRGKE+CKEYIK FLA+IP+RELQ D+ + P H+RLQIEFHYT L
Sbjct: 461 QEEVVVCRGKEACKEYIKSFLAAIPIRELQADIMKNNMTFSP-HQRLQIEFHYTQL 515
>Medtr3g107090.1 | hypothetical protein | HC |
chr3:49452455-49448631 | 20130731
Length = 619
Score = 249 bits (635), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 144/290 (49%), Positives = 189/290 (65%), Gaps = 26/290 (8%)
Query: 251 WEHKEVISQDGLMKLHTEIFFASIDQRSECAAGESACAVLVVLIVDWLKLNQAEIPIKCE 310
WE K ++S+DG KL T +FFAS DQRSE A+GESAC VLV LI WL NQ IP + +
Sbjct: 323 WEMKYILSRDGQSKLKTNVFFASFDQRSEKASGESACTVLVALISHWLHSNQG-IPTRTQ 381
Query: 311 FDSLIKDGSSEWRIICENKDYMKNFPDKHFDLDTVFRAKTRRVSVVPEKSYVXXXXXXXX 370
FD+LI GSSEWR +C+N Y K+FPDKHFDL+TV A R + V+P+KSY
Sbjct: 382 FDNLITQGSSEWRKLCKNDYYSKHFPDKHFDLETVIDANLRPLVVLPQKSYTGFFSPEKF 441
Query: 371 XXXXXXXXXXXAMSFDSIWEEISHCASELHLFS-EAIVYIVSWNDHFFVLKVQHDANYII 429
AMSFD IW EI++ ++ + E +YIVSWNDHFF+LKV+ DA YII
Sbjct: 442 HCLEG------AMSFDEIWNEINNINNKNTVVDFEPRIYIVSWNDHFFILKVEVDAYYII 495
Query: 430 DTLGERLHEGCNQAYILKFDANTRIEKLCNENQVLDAKPPSDEVN-DVRKK--------- 479
D+LGERL EGC +A++LKFD + + ++N+ +EVN DV K
Sbjct: 496 DSLGERLFEGCQRAFVLKFDDSCVMYGKVDKNE--------EEVNKDVGTKSSSRGEESF 547
Query: 480 EIICRGKESCKEYIKRFLASIPVRELQDDVKRGLKASMPLHRRLQIEFHY 529
EI+CRGKE CKE+IKRFLA+I VR+L+ + K+ ++ LHR+LQI+FH+
Sbjct: 548 EIVCRGKECCKEFIKRFLAAISVRQLEKEEKKWSVSNPYLHRQLQIDFHF 597