Miyakogusa Predicted Gene
- Lj1g3v3183520.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v3183520.1 Non Characterized Hit- tr|I1N524|I1N524_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.19692 PE,88.03,0,no
description,Glycoside hydrolase, catalytic domain;
GLHYDRLASE20,Beta-hexosaminidase subunit alpha,CUFF.30216.1
(297 letters)
Database: Medicago_aa4.0v1
62,319 sequences; 21,947,249 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Medtr7g023060.2 | glycoside hydrolase family 20 domain protein |... 526 e-150
Medtr7g023060.1 | glycoside hydrolase family 20 domain protein |... 526 e-150
Medtr1g115475.1 | glycoside hydrolase family 20 domain protein |... 518 e-147
Medtr2g062560.1 | glycoside hydrolase family 20 domain protein |... 136 2e-32
>Medtr7g023060.2 | glycoside hydrolase family 20 domain protein | HC
| chr7:7511429-7521878 | 20130731
Length = 558
Score = 526 bits (1356), Expect = e-150, Method: Compositional matrix adjust.
Identities = 240/284 (84%), Positives = 264/284 (92%)
Query: 14 IYSFAKMRGINVMAEVDVPGHAESWGAGYPDLWPSPSCREPLDVSKTFTFDVLSGILTDL 73
I +FAKMRGINVM EVDVPGHAESWGAGYPDLWPSPSC+EPLDVSK FTFDV+SGIL+D+
Sbjct: 275 IVNFAKMRGINVMPEVDVPGHAESWGAGYPDLWPSPSCKEPLDVSKNFTFDVISGILSDM 334
Query: 74 RKIFPFELFHLGGDEVHTDCWSNTSHIKEWLQTHNMTAKDAYQYFVLKAQEIALSKNWSP 133
RKIFPFELFHLGGDEVHTDCW+NTSH+KEWLQ+HNMT KDAY+YFVLKAQ+IALSK W+P
Sbjct: 335 RKIFPFELFHLGGDEVHTDCWTNTSHVKEWLQSHNMTTKDAYEYFVLKAQDIALSKKWTP 394
Query: 134 VNWEETFNTFPTKLNPKTVVHNWLGPGVCPKVVAKGFRCIFSNQGVWYLDHLDVPWDKVY 193
VNWEETFNTFP+KL+P+TVVHNWL GVC K VAKGFRCIFSNQGVWYLDHLDVPWD+VY
Sbjct: 395 VNWEETFNTFPSKLHPETVVHNWLVSGVCAKAVAKGFRCIFSNQGVWYLDHLDVPWDEVY 454
Query: 194 TAEPLEGIHNASEQKLVLGGEVCMWAETADTSDVQQTIWPRAAAAAERLWSQRDSTSMRN 253
TA+PLE IH SE+KL+LGGEVCMW ETAD S+VQQTIWPRAAAAAER+WS+RD TS RN
Sbjct: 455 TADPLEFIHKESEEKLILGGEVCMWGETADASNVQQTIWPRAAAAAERMWSERDFTSTRN 514
Query: 254 ATLTALPRLQNFRCLLNSRGVAAAPVTNFYARRAPTGPGSCYEQ 297
ATLTALPRLQ+FRCLLN RGV AAPVTN+YARRAP G GSCY+Q
Sbjct: 515 ATLTALPRLQHFRCLLNRRGVPAAPVTNYYARRAPDGTGSCYDQ 558
>Medtr7g023060.1 | glycoside hydrolase family 20 domain protein | HC
| chr7:7511429-7521769 | 20130731
Length = 558
Score = 526 bits (1356), Expect = e-150, Method: Compositional matrix adjust.
Identities = 240/284 (84%), Positives = 264/284 (92%)
Query: 14 IYSFAKMRGINVMAEVDVPGHAESWGAGYPDLWPSPSCREPLDVSKTFTFDVLSGILTDL 73
I +FAKMRGINVM EVDVPGHAESWGAGYPDLWPSPSC+EPLDVSK FTFDV+SGIL+D+
Sbjct: 275 IVNFAKMRGINVMPEVDVPGHAESWGAGYPDLWPSPSCKEPLDVSKNFTFDVISGILSDM 334
Query: 74 RKIFPFELFHLGGDEVHTDCWSNTSHIKEWLQTHNMTAKDAYQYFVLKAQEIALSKNWSP 133
RKIFPFELFHLGGDEVHTDCW+NTSH+KEWLQ+HNMT KDAY+YFVLKAQ+IALSK W+P
Sbjct: 335 RKIFPFELFHLGGDEVHTDCWTNTSHVKEWLQSHNMTTKDAYEYFVLKAQDIALSKKWTP 394
Query: 134 VNWEETFNTFPTKLNPKTVVHNWLGPGVCPKVVAKGFRCIFSNQGVWYLDHLDVPWDKVY 193
VNWEETFNTFP+KL+P+TVVHNWL GVC K VAKGFRCIFSNQGVWYLDHLDVPWD+VY
Sbjct: 395 VNWEETFNTFPSKLHPETVVHNWLVSGVCAKAVAKGFRCIFSNQGVWYLDHLDVPWDEVY 454
Query: 194 TAEPLEGIHNASEQKLVLGGEVCMWAETADTSDVQQTIWPRAAAAAERLWSQRDSTSMRN 253
TA+PLE IH SE+KL+LGGEVCMW ETAD S+VQQTIWPRAAAAAER+WS+RD TS RN
Sbjct: 455 TADPLEFIHKESEEKLILGGEVCMWGETADASNVQQTIWPRAAAAAERMWSERDFTSTRN 514
Query: 254 ATLTALPRLQNFRCLLNSRGVAAAPVTNFYARRAPTGPGSCYEQ 297
ATLTALPRLQ+FRCLLN RGV AAPVTN+YARRAP G GSCY+Q
Sbjct: 515 ATLTALPRLQHFRCLLNRRGVPAAPVTNYYARRAPDGTGSCYDQ 558
>Medtr1g115475.1 | glycoside hydrolase family 20 domain protein | HC
| chr1:52161852-52155998 | 20130731
Length = 536
Score = 518 bits (1333), Expect = e-147, Method: Compositional matrix adjust.
Identities = 238/284 (83%), Positives = 257/284 (90%)
Query: 14 IYSFAKMRGINVMAEVDVPGHAESWGAGYPDLWPSPSCREPLDVSKTFTFDVLSGILTDL 73
I +F+KMRGINVMAEVDVPGHAESWG GYPDLWPSP+C+ PLDVSK FTFDVLSGI+TD+
Sbjct: 253 IVNFSKMRGINVMAEVDVPGHAESWGVGYPDLWPSPTCKSPLDVSKKFTFDVLSGIMTDI 312
Query: 74 RKIFPFELFHLGGDEVHTDCWSNTSHIKEWLQTHNMTAKDAYQYFVLKAQEIALSKNWSP 133
RKIFPFELFHLGGDEV+TDCW+NT+ + +WLQ H M A DAYQYFVLKAQ +A+SKNWSP
Sbjct: 313 RKIFPFELFHLGGDEVNTDCWTNTTQVNKWLQNHKMAANDAYQYFVLKAQNMAISKNWSP 372
Query: 134 VNWEETFNTFPTKLNPKTVVHNWLGPGVCPKVVAKGFRCIFSNQGVWYLDHLDVPWDKVY 193
VNWEETFNTFPTKL+P+TVVHNWLGPGVCPKVVAKG RCIFSNQGVWYLDH+DVPWD VY
Sbjct: 373 VNWEETFNTFPTKLHPRTVVHNWLGPGVCPKVVAKGLRCIFSNQGVWYLDHVDVPWDVVY 432
Query: 194 TAEPLEGIHNASEQKLVLGGEVCMWAETADTSDVQQTIWPRAAAAAERLWSQRDSTSMRN 253
AEPLEGIH ASEQKLVLGGEVCMWAE ADTSDVQQTIWPRAAAAAERLWS+R TS RN
Sbjct: 433 NAEPLEGIHEASEQKLVLGGEVCMWAERADTSDVQQTIWPRAAAAAERLWSERQYTSGRN 492
Query: 254 ATLTALPRLQNFRCLLNSRGVAAAPVTNFYARRAPTGPGSCYEQ 297
+ TAL RLQ FRCLLN RGV AAPVTNFYAR AP GPGSC+EQ
Sbjct: 493 SNSTALSRLQYFRCLLNRRGVPAAPVTNFYARTAPDGPGSCFEQ 536
>Medtr2g062560.1 | glycoside hydrolase family 20 domain protein | HC
| chr2:26410129-26414001 | 20130731
Length = 568
Score = 136 bits (343), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 98/335 (29%), Positives = 146/335 (43%), Gaps = 60/335 (17%)
Query: 14 IYSFAKMRGINVMAEVDVPGHAESWGAGYPDL--------------WPSPSCREP----L 55
+ F RG+ V+ E+D PGH SW YPD+ WP EP L
Sbjct: 234 VVEFGLDRGVRVIPEIDAPGHTGSWALAYPDIVACANMFWWPAGSDWPDRLAAEPGTGHL 293
Query: 56 DVSKTFTFDVLSGILTDLRKIFPFELFHLGGDEVHTDCWSNTSHIKEWLQTHNMTAKDAY 115
+ T+ VL ++ D+ +FP + +H G DEV CW I+++L ++N T
Sbjct: 294 NPLNPKTYQVLKNVIRDVTTLFPEQFYHSGADEVVPGCWKTDPTIQKFL-SNNGTLSQVL 352
Query: 116 QYFVLKAQEIALSKNWSPVNWEETFNT----FPTKLNPK--TVVHNW-LGPGVCPKVVAK 168
+ F+ LS N + V WE+ P+ + PK ++ W G ++V+
Sbjct: 353 ETFINNTLPFILSLNRTVVYWEDVLLDDTVHVPSTILPKEHVILQTWNNGHNNTKRIVSS 412
Query: 169 GFRCIFSNQGVWYLD--HLDV--------------------------PWDKVYTAEPLEG 200
G+R I S+ +YLD H D W +Y + G
Sbjct: 413 GYRAIVSSSDFYYLDCGHGDFTGNNSIYDNQTGSDKNDGGSWCGPFKTWQNIYNYDITYG 472
Query: 201 IHNASEQKLVLGGEVCMWAETADTSDVQQTIWPRAAAAAERLWS-QRDSTSMRNATLTAL 259
+ E KLVLGGEV +W+E AD + + +WPR +A AE LWS RD ++ A
Sbjct: 473 L-TEEEAKLVLGGEVALWSEQADETVLDSRLWPRTSAMAESLWSGNRDEKGLKRYA-EAT 530
Query: 260 PRLQNFRCLLNSRGVAAAPVTNFYARRAPTGPGSC 294
RL +R + SRG+ A P+ + R PG C
Sbjct: 531 DRLNEWRSRMVSRGIGAEPIQPLWCVR---NPGMC 562