Miyakogusa Predicted Gene
- Lj0g3v0099839.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0099839.1 Non Characterized Hit- tr|I1NI08|I1NI08_SOYBN
Uncharacterized protein OS=Glycine max PE=4 SV=1,81.03,0,SUBFAMILY NOT
NAMED,NULL; FAMILY NOT NAMED,NULL; MHD1,Munc13 homology 1;
MHD2,Mammalian uncoordinate,CUFF.5606.1
(368 letters)
Database: Medicago_aa4.0v1
62,319 sequences; 21,947,249 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Medtr1g073760.1 | DUF810 family protein | HC | chr1:32731874-327... 570 e-163
Medtr1g044480.1 | DUF810 family protein | HC | chr1:16796191-167... 514 e-146
Medtr6g043090.1 | DUF810 family protein | HC | chr6:14880864-148... 440 e-124
Medtr3g109630.1 | DUF810 family protein | HC | chr3:51284444-512... 212 5e-55
Medtr1g114270.1 | DUF810 family protein | HC | chr1:51577527-515... 175 6e-44
Medtr8g040190.1 | plant/MOJ9-14 protein | HC | chr8:14975472-149... 130 2e-30
Medtr8g040190.2 | plant/MOJ9-14 protein | HC | chr8:14975590-149... 130 2e-30
>Medtr1g073760.1 | DUF810 family protein | HC | chr1:32731874-32736073
| 20130731
Length = 1016
Score = 570 bits (1469), Expect = e-163, Method: Compositional matrix adjust.
Identities = 277/370 (74%), Positives = 315/370 (85%), Gaps = 2/370 (0%)
Query: 1 MKLATTTAEEFFQVPIAITEDLVQDLADGLENLFLDYMKFVAACGSRQSYIPMLPPLTRC 60
MKL T +EFFQVPIAITEDLVQDL DGLE LFLDYMKFVA+CGS+QSYIP+LPPLTRC
Sbjct: 647 MKLVETIVDEFFQVPIAITEDLVQDLVDGLEILFLDYMKFVASCGSKQSYIPLLPPLTRC 706
Query: 61 NRDSKFTKLWKKAAPCGANFQDLHHMKGAFEGHHPRSSTSRGTQRLYVRLNTLHYLLTHI 120
NRDS+F KLWKKAAPCG+N + + + G E H+PR STSRGTQRLY+R+NTLHYL+T+I
Sbjct: 707 NRDSRFLKLWKKAAPCGSNNEGTYQINGTHESHNPRPSTSRGTQRLYIRINTLHYLMTNI 766
Query: 121 HSLEKSISMNPGIVPSNRLRFANNRRAQSN--SYFESVNLSILAACQHVSEVAAYRLIFL 178
HSLEK+++MNPG+VPSNRLRF NNRRAQ+N SYFES+NLSILAAC+HVSEV +YRL+FL
Sbjct: 767 HSLEKALAMNPGVVPSNRLRFTNNRRAQNNNNSYFESLNLSILAACRHVSEVTSYRLVFL 826
Query: 179 DSSFVFYDSLYVGRVQRGQIKPALKVLKQNLSLMTTILTDRAQPLAMKEVMKASFDALLM 238
DSS VFYD LYVG V RG I+P ++ LKQNL M+TILTDRAQPLA KE+MKASFDA LM
Sbjct: 827 DSSSVFYDRLYVGGVTRGHIRPVVRNLKQNLIQMSTILTDRAQPLATKEIMKASFDAFLM 886
Query: 239 VLLAGGHSRMFHRSDHEIIYEDFEHLKRVFSNCVDGLIAXXXXXXXXXXXXXXIALMGQN 298
VLLAGG SR+FHR DH + +DFE+LKRVFSN V+GLIA IALMGQ+
Sbjct: 887 VLLAGGTSRVFHRHDHMALRDDFENLKRVFSNNVEGLIAENVVDGEAAVVQGVIALMGQS 946
Query: 299 TEQLMEDFSIVTCESSGIGIMGNGQKLPMPPTTGKWNRADPNTILRVLCYRNDRAADQFL 358
TEQLMEDF+I CESSGIG+MGNGQKLPMPPTT KWNRADPNT+LRVLCYRND AA+QFL
Sbjct: 947 TEQLMEDFTIAACESSGIGMMGNGQKLPMPPTTCKWNRADPNTMLRVLCYRNDGAANQFL 1006
Query: 359 KRTFQLAKRR 368
KRTFQLAKRR
Sbjct: 1007 KRTFQLAKRR 1016
>Medtr1g044480.1 | DUF810 family protein | HC | chr1:16796191-16792012
| 20130731
Length = 1034
Score = 514 bits (1325), Expect = e-146, Method: Compositional matrix adjust.
Identities = 253/372 (68%), Positives = 297/372 (79%), Gaps = 4/372 (1%)
Query: 1 MKLATTTAEEFFQVPIAITEDLVQDLADGLENLFLDYMKFVAACGSRQSYIPMLPPLTRC 60
+KLA TT EEFFQ+PI ITE+LVQDLA+GLE+L DYM FVAACGS+QSYIP LP LTRC
Sbjct: 663 VKLAKTTVEEFFQIPIGITEELVQDLANGLESLLQDYMMFVAACGSKQSYIPPLPALTRC 722
Query: 61 NRDSKFTKLWKKAAPCGANFQDLHHMKGAFEGHHPRSSTSRGTQRLYVRLNTLHYLLTHI 120
NRDSKF+KLWK+AAPC N +L H+ G EGH+P+ STSRGTQRLY+RLNTLHYLL I
Sbjct: 723 NRDSKFSKLWKRAAPCATNLSELDHINGTNEGHNPKPSTSRGTQRLYIRLNTLHYLLVQI 782
Query: 121 HSLEKSISMNPGIVPSNRLRFANNRRAQSN---SYFESVNLSILAACQHVSEVAAYRLIF 177
SLEK +S N IVPS R F +N R QS SYFE+V S+ AACQ+VSEVAAYRLIF
Sbjct: 783 QSLEKLLSQNHCIVPSTRHSFTSNLRTQSTKSGSYFETVISSLPAACQNVSEVAAYRLIF 842
Query: 178 LDSSFVFYDSLYVGRVQRGQIKPALKVLKQNLSLMTTILTDRAQPLAMKEVMKASFDALL 237
LDSS VFYD+LYV V +I+PAL++ KQNL+L+TT+L DRAQPLAMKEVM+ASFDA L
Sbjct: 843 LDSSSVFYDTLYVDDVANARIRPALRIAKQNLTLLTTLLVDRAQPLAMKEVMRASFDAFL 902
Query: 238 MVLLAGGHSRMFHRSDHEIIYEDFEHLKRVFSNCVDGLIAXXXXXXXXXXXXXXIALMGQ 297
MVLLAGG+SR+F+RSDH +I EDFE L RVF +C +GL++ + LM Q
Sbjct: 903 MVLLAGGNSRVFNRSDHVMIQEDFESLNRVFCSCGEGLVSENVVEREAAVVKGVVGLMAQ 962
Query: 298 NTEQLMEDFSIVTCESSGIGIMG-NGQKLPMPPTTGKWNRADPNTILRVLCYRNDRAADQ 356
NTEQLMEDFSI +CE GIG+M NGQKLPMPPTTG+W+R+DPNTILRVLC+RN+RAA+
Sbjct: 963 NTEQLMEDFSIASCEKGGIGVMNMNGQKLPMPPTTGRWHRSDPNTILRVLCHRNERAANY 1022
Query: 357 FLKRTFQLAKRR 368
FLKRTF LAKRR
Sbjct: 1023 FLKRTFHLAKRR 1034
>Medtr6g043090.1 | DUF810 family protein | HC | chr6:14880864-14875572
| 20130731
Length = 1147
Score = 440 bits (1132), Expect = e-124, Method: Compositional matrix adjust.
Identities = 212/371 (57%), Positives = 271/371 (73%), Gaps = 7/371 (1%)
Query: 1 MKLATTTAEEFFQVPIAITEDLVQDLADGLENLFLDYMKFVAACGSRQSYIPMLPPLTRC 60
M LA +EFFQ+P+AITEDLVQ+L DGL +F +Y F+A CG +++YIP LPPLTRC
Sbjct: 781 MNLAKKIVQEFFQIPVAITEDLVQELVDGLHKIFREYTMFIATCGLKENYIPSLPPLTRC 840
Query: 61 NRDSKFTKLWKKAAPCGANFQDLHHMKGAFEGHHPRSSTSRGTQRLYVRLNTLHYLLTHI 120
NR+SKF KLWK A+PC + +D HM G +E +HP S TSRGTQRLY+RLNTLHYLL+HI
Sbjct: 841 NRNSKFHKLWKIASPCNVSCED-PHMYGIYEANHPHSCTSRGTQRLYIRLNTLHYLLSHI 899
Query: 121 HSLEKSISMNPGIVPSNRLRFANNRR---AQSNSYFESVNLSILAACQHVSEVAAYRLIF 177
L+KS+++ G+VP +R R + N + ++ SYFE+V SI+AAC+HVSEVA++RLIF
Sbjct: 900 SILDKSLTLTQGVVPCDRGRRSTNTQNTQGKTTSYFETVENSIIAACKHVSEVASHRLIF 959
Query: 178 LDSSFVFYDSLYVGRVQRGQIKPALKVLKQNLSLMTTILTDRAQPLAMKEVMKASFDALL 237
DS+ FY+SLY G V +I AL +LK N+ LM+ ILT+RAQPL +KE+MK DA L
Sbjct: 960 FDSNSFFYESLYAGDVANARINNALIILKHNIKLMSAILTERAQPLLIKEIMKTCNDAFL 1019
Query: 238 MVLLAGGHSRMFHRSDHEIIYEDFEHLKRVFSNCVDGLIAXXXXXXXXXXXXXXIALMGQ 297
+VLLAGG +RMF+ SDH I EDF+ LK+ F C + LIA I LMG
Sbjct: 1020 LVLLAGGTTRMFNESDHVSIQEDFQCLKQEFYRCGEELIAESVVDKEGEVVEGVIGLMGT 1079
Query: 298 NTEQLMEDFSIVTCESSGIGIMGNGQKLPMPPTTGKWNRADPNTILRVLCYRNDRAADQF 357
+TE+L+E+ S ++ E+ G+ NG KLPMPPTTGKWNR DPNTILRVLCYRNDR A+ F
Sbjct: 1080 STEELLENLSNLSSEN---GVNENGTKLPMPPTTGKWNRTDPNTILRVLCYRNDRVANHF 1136
Query: 358 LKRTFQLAKRR 368
LKRT+Q+AKRR
Sbjct: 1137 LKRTYQIAKRR 1147
>Medtr3g109630.1 | DUF810 family protein | HC |
chr3:51284444-51289909 | 20130731
Length = 998
Score = 212 bits (539), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 135/373 (36%), Positives = 200/373 (53%), Gaps = 25/373 (6%)
Query: 1 MKLATTTAEEFFQVPIAITEDLVQDLADGLENLFLDYM-KFVAACGSRQSYIPMLPPLTR 59
+++ T + FFQ+PI + L+ ++ G++ Y+ K + CGSR ++IP +P LTR
Sbjct: 644 LRIINETLDAFFQLPIPMHPALLPEVMHGVDRCLQYYVAKAKSGCGSRNTFIPTMPALTR 703
Query: 60 CNRDSKFTKLWKKAAPCGANFQDLHHMKGAFEGHHPRSSTSRGTQRLYVRLNTLHYLLTH 119
C SKF KK N Q + + A G +S G +L VR+NTL ++L
Sbjct: 704 CTIGSKFQGFGKKKDK-SPNSQK-RNSQVATNG-----DSSFGIPQLCVRINTLQWILGE 756
Query: 120 IHSLEKSISMNPGIVPSNRLRFANNRRAQ--SN---SYFESVNLSILAACQHVSEVAAYR 174
LEK I LR + + R + SN S FE + L Q + E AYR
Sbjct: 757 FDVLEKRII--------TLLRNSESAREEDFSNGLASKFELSPAACLEGIQQLCEAVAYR 808
Query: 175 LIFLDSSFVFYDSLYVGRVQRGQIKPALKVLKQNLSLMTTILTDRAQPLAMKEVMKASFD 234
++F D S V +DSLYVG ++ P L+ L++NL ++ + ++ + + E+M+ASFD
Sbjct: 809 IVFHDLSHVLWDSLYVGDPSSSRVDPFLQELERNLMFISDNVHEKIRTRIITEIMRASFD 868
Query: 235 ALLMVLLAGGHSRMFHRSDHEIIYEDFEHLKRVFSNCVDGLIAXXXXXXXXXXXXXXIAL 294
L VLLAGG SR F R D +II +DF+ LK +F DGL + L
Sbjct: 869 GFLFVLLAGGPSRAFSRKDSQIIEDDFKVLKELFWANGDGL-PSEIIDRFATTLRSILPL 927
Query: 295 MGQNTEQLMEDFSIVTCESSGIGIMGNGQKLPMPPTTGKWNRADPNTILRVLCYRNDRAA 354
+TE L+E F +T E+ ++P+PPT+G+W +DPNT+LRVLCYRND AA
Sbjct: 928 FRTDTESLIEQFRRITVETYKSSARS---RIPLPPTSGQWGPSDPNTLLRVLCYRNDEAA 984
Query: 355 DQFLKRTFQLAKR 367
+FLK+T+ L K+
Sbjct: 985 SKFLKKTYDLPKK 997
>Medtr1g114270.1 | DUF810 family protein | HC |
chr1:51577527-51573509 | 20130731
Length = 922
Score = 175 bits (444), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 120/369 (32%), Positives = 185/369 (50%), Gaps = 32/369 (8%)
Query: 1 MKLATTTAEEFFQVPIAITEDLVQDLADGLENLFLDY-MKFVAACGSRQSYIPMLPPLTR 59
++ T E FF +PI++ L+ +L GL+ Y +K + CG+R ++IP P LTR
Sbjct: 583 LRFIDDTLEAFFLLPISMHAVLLPELISGLDKSIQQYILKAKSGCGNRNTFIPTTPALTR 642
Query: 60 CNRDSKFTKLW-KKAAPCGANFQDLHHMKGAFEGHHPRSSTSRGTQRLYVRLNTLHYLLT 118
C+ K+ ++ KK P Q + K + SS L VR+NT+ +
Sbjct: 643 CSTKGKYHGVFRKKEKP-----QMIQRRKALVSTTNGDSSFD--VPHLCVRINTMQRIRM 695
Query: 119 HIHSLEKSISMNPGIVPSNRLRFANNRRAQSNSYFESVNLSILAACQHVSEVAAYRLIFL 178
+ LEK I N SN N A S F+ +++ + + E AY+ IF
Sbjct: 696 ELGVLEKRIVANL----SNSNSTGENDIANGVS-FKFSAAAVVEGIRQLCECIAYKAIFQ 750
Query: 179 DSSFVFYDSLYVGRVQRGQIKPALKVLKQNLSLMTTILTDRAQPLAMKEVMKASFDALLM 238
D V +D LYVG V +I+P L L+ L ++++ + D+ + + EVM+ASFD L+
Sbjct: 751 DLCHVLWDGLYVGEVSSTRIEPFLHELEHYLEIISSTVHDKVRTRVIIEVMRASFDGFLL 810
Query: 239 VLLAGGHSRMFHRSDHEIIYEDFEHLKRVFSNCVDGLIAXXXXXXXXXXXXXXIALMGQN 298
VLLAGG SR F D ++ EDF+ L +F + DGL A L+ +
Sbjct: 811 VLLAGGSSRAFSLQDSFVLEEDFKLLSDLFWSNGDGLPAE---------------LIKKQ 855
Query: 299 TEQLMEDFSIVTCESSGIGIMGNGQKLPMPPTTGKWNRADPNTILRVLCYRNDRAADQFL 358
+ + + FS +T E G +LP+PP KW+ +P+T+LRVLCYRND A +FL
Sbjct: 856 SATVRDQFSQLTREMYG---SSAKSRLPLPPKAEKWSPREPDTLLRVLCYRNDETAAKFL 912
Query: 359 KRTFQLAKR 367
K+ + L +
Sbjct: 913 KKNYNLPTK 921
>Medtr8g040190.1 | plant/MOJ9-14 protein | HC | chr8:14975472-14959730
| 20130731
Length = 1102
Score = 130 bits (327), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 104/370 (28%), Positives = 175/370 (47%), Gaps = 30/370 (8%)
Query: 2 KLATTTAEEFF--QVPIAITEDLVQDLADGLENLFLDYMKFVA-ACGSRQSYIPMLPPLT 58
++ T ++FF +VP+ TE + + G++N Y V S++ IP +P LT
Sbjct: 755 RIVEETVDQFFGLKVPMRFTE--LNSMFRGIDNALQVYGNLVVHDLASKEDLIPPVPVLT 812
Query: 59 RCNRDSKFTKLWKKAAPCGANFQDLHHMKGAFEGHHPRSSTSRGTQRLYVRLNTLHYLLT 118
R ++++ K + K + + PR + T L V+LNTL+Y ++
Sbjct: 813 RYSKEAGLIKAFVKKELFDTRVLEREETR-------PREISVLTTPTLCVQLNTLYYAIS 865
Query: 119 HIHSLEKSISMNPGIVPSNR--LRFANNRRAQSNSYFESVNLSILAACQHVSEVAAYRLI 176
H++ LE SI S +R + + +++ ++ F+ I AA + + E ++I
Sbjct: 866 HLNKLEDSIWERWTHKRSQEKLIRKSIDDKSKKDT-FDGSRTVINAAMERICEYTGTKII 924
Query: 177 FLDSSFVFYDSLYVGRVQRGQIKPALKVLKQNLSLMTTILTDRAQPLAMKEVMKASFDAL 236
F D F D+LY V ++ ++ L LS + I+ + + + +++AS D L
Sbjct: 925 FCDLRVPFIDNLYKPSVSGSRVDVLIEPLDMELSQLCDIVVEPLRDRIVTSLLQASLDGL 984
Query: 237 LMVLLAGGHSRMFHRSDHEIIYEDFEHLKRVFSNCVDGLIAXXXXXXXXXXXXXXIALMG 296
L V+L GG SR+F D +++ ED E LK F + DGL I L G
Sbjct: 985 LRVILDGGPSRVFFPGDAKLLEEDLEALKEFFISGGDGL-PRGVVENQVARVRVVIKLHG 1043
Query: 297 QNTEQLMEDFSIVTCESSGIGIMGNGQKLPMPPTTGKWNRADPNTILRVLCYRNDRAADQ 356
T +L+ED +SG+ + G KL AD T+LR+LC+R+D A Q
Sbjct: 1044 YETRELIEDLK----SASGLEMQGGKGKLG----------ADSKTLLRILCHRSDSEASQ 1089
Query: 357 FLKRTFQLAK 366
FLK+ F++ K
Sbjct: 1090 FLKKQFKIPK 1099
>Medtr8g040190.2 | plant/MOJ9-14 protein | HC | chr8:14975590-14959726
| 20130731
Length = 1102
Score = 130 bits (327), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 104/370 (28%), Positives = 175/370 (47%), Gaps = 30/370 (8%)
Query: 2 KLATTTAEEFF--QVPIAITEDLVQDLADGLENLFLDYMKFVA-ACGSRQSYIPMLPPLT 58
++ T ++FF +VP+ TE + + G++N Y V S++ IP +P LT
Sbjct: 755 RIVEETVDQFFGLKVPMRFTE--LNSMFRGIDNALQVYGNLVVHDLASKEDLIPPVPVLT 812
Query: 59 RCNRDSKFTKLWKKAAPCGANFQDLHHMKGAFEGHHPRSSTSRGTQRLYVRLNTLHYLLT 118
R ++++ K + K + + PR + T L V+LNTL+Y ++
Sbjct: 813 RYSKEAGLIKAFVKKELFDTRVLEREETR-------PREISVLTTPTLCVQLNTLYYAIS 865
Query: 119 HIHSLEKSISMNPGIVPSNR--LRFANNRRAQSNSYFESVNLSILAACQHVSEVAAYRLI 176
H++ LE SI S +R + + +++ ++ F+ I AA + + E ++I
Sbjct: 866 HLNKLEDSIWERWTHKRSQEKLIRKSIDDKSKKDT-FDGSRTVINAAMERICEYTGTKII 924
Query: 177 FLDSSFVFYDSLYVGRVQRGQIKPALKVLKQNLSLMTTILTDRAQPLAMKEVMKASFDAL 236
F D F D+LY V ++ ++ L LS + I+ + + + +++AS D L
Sbjct: 925 FCDLRVPFIDNLYKPSVSGSRVDVLIEPLDMELSQLCDIVVEPLRDRIVTSLLQASLDGL 984
Query: 237 LMVLLAGGHSRMFHRSDHEIIYEDFEHLKRVFSNCVDGLIAXXXXXXXXXXXXXXIALMG 296
L V+L GG SR+F D +++ ED E LK F + DGL I L G
Sbjct: 985 LRVILDGGPSRVFFPGDAKLLEEDLEALKEFFISGGDGL-PRGVVENQVARVRVVIKLHG 1043
Query: 297 QNTEQLMEDFSIVTCESSGIGIMGNGQKLPMPPTTGKWNRADPNTILRVLCYRNDRAADQ 356
T +L+ED +SG+ + G KL AD T+LR+LC+R+D A Q
Sbjct: 1044 YETRELIEDLK----SASGLEMQGGKGKLG----------ADSKTLLRILCHRSDSEASQ 1089
Query: 357 FLKRTFQLAK 366
FLK+ F++ K
Sbjct: 1090 FLKKQFKIPK 1099