Miyakogusa Predicted Gene
- chr1.CM0017.200.nc
BLASTP 2.2.18 [Mar-02-2008]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= chr1.CM0017.200.nc + phase: 0
(322 letters)
Database: Medicago_aa2.0
38,834 sequences; 10,231,785 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
IMGA|CT025534_17.4 Glycoside hydrolase, family 19; Chitin-bindin... 488 e-138
IMGA|AC148763_10.4 Glycoside hydrolase, family 19; Chitin-bindin... 381 e-106
IMGA|AC169517_10.5 Glycoside hydrolase, family 19; Chitin-bindin... 381 e-106
IMGA|AC148763_19.4 Glycoside hydrolase, family 19; Chitin-bindin... 358 2e-99
IMGA|AC139745_14.5 Glycoside hydrolase, family 19 chr07_pseudomo... 303 7e-83
IMGA|AC160516_8.4 Glycoside hydrolase, family 19 chr07_pseudomol... 190 7e-49
IMGA|AC126778_12.4 Glycoside hydrolase, family 19 chr02_pseudomo... 179 2e-45
IMGA|AC137554_23.4 Glycoside hydrolase, family 19; Chitin-bindin... 178 3e-45
IMGA|AC137554_43.4 Glycoside hydrolase, family 19; Chitin-bindin... 164 3e-41
>IMGA|CT025534_17.4 Glycoside hydrolase, family 19; Chitin-binding,
type 1 chr03_pseudomolecule_IMGAG_V2 32708700-32707178 F
EGN_Mt071002 20080227
Length = 325
Score = 488 bits (1255), Expect = e-138, Method: Compositional matrix adjust.
Identities = 234/312 (75%), Positives = 259/312 (83%), Gaps = 3/312 (0%)
Query: 2 IKMKMRLGIAILVSFILVGWCRGEQCGSQAGGALCPGGICCSKYGWCGSTSEYXXXXXXX 61
+ M++ L + V +++G EQCG QAGGALCPGG+CCSK+GWCGST +Y
Sbjct: 1 MMMRLALVVTTAVLLVIIGCSFAEQCGKQAGGALCPGGLCCSKFGWCGSTGDYCGDGCQS 60
Query: 62 XXXXXXXXXXXXXIISRDTFNTMLKHRDDGGCPAKGFYTYDAFISAAKAYPSFGTTGDTS 121
+ISRDTFN MLKHRDD GC K YTYDAFISAAKA+P+F GDT+
Sbjct: 61 QCSGSSGDLGS--LISRDTFNNMLKHRDDSGCQGKRLYTYDAFISAAKAFPNFANNGDTA 118
Query: 122 TRKREIAAFFGQTSHETTGGWATAPDGPYAWGYCFVREQNPSA-YCSPSSQWPCASGKQY 180
T+KREIAAF GQTSHETTGGWATAPDGPYAWGYCFVREQNPS+ YC PSS++PCASGKQY
Sbjct: 119 TKKREIAAFLGQTSHETTGGWATAPDGPYAWGYCFVREQNPSSTYCQPSSEFPCASGKQY 178
Query: 181 YGRGPIQITWNYNYGQCGRAIGADLLNNPDAVATDAVISFKTAIWFWMTAQSPKPSCHDV 240
YGRGPIQI+WNYNYGQCGRAIG DLLNNPD VATD VISFKTA+WFWMT QSPKPSCHDV
Sbjct: 179 YGRGPIQISWNYNYGQCGRAIGVDLLNNPDLVATDPVISFKTALWFWMTPQSPKPSCHDV 238
Query: 241 ITGRWSPSSADQAAGRVSGYGTVTNIINGGLECGKGQDSRVQDRIGFYKRYCDLLGVGYG 300
ITGRWSPSSAD+AAGR+ GYGTVTNIINGGLECG+GQD RVQDRIGFYKRYCD+LGVGYG
Sbjct: 239 ITGRWSPSSADRAAGRLPGYGTVTNIINGGLECGRGQDGRVQDRIGFYKRYCDILGVGYG 298
Query: 301 NNLDCASQRPFG 312
+NLDC SQRPFG
Sbjct: 299 DNLDCFSQRPFG 310
>IMGA|AC148763_10.4 Glycoside hydrolase, family 19; Chitin-binding,
type 1 chr08_pseudomolecule_IMGAG_V2 18118056-18115035 E
EGN_Mt071002 20080227
Length = 320
Score = 381 bits (979), Expect = e-106, Method: Compositional matrix adjust.
Identities = 183/300 (61%), Positives = 221/300 (73%), Gaps = 12/300 (4%)
Query: 23 RGEQCGSQAGGALCPGGICCSKYGWCGSTSEY----------XXXXXXXXXXXXXXXXXX 72
+ EQCGSQA GALCP G+CCSK+G+CG+T +Y
Sbjct: 21 KAEQCGSQANGALCPNGLCCSKFGFCGNTDQYCGDGCQSQCKSSPTPNPPTPSTGGGGDV 80
Query: 73 XXIISRDTFNTMLKHRDDGGCPAKGFYTYDAFISAAKAYPSFGTTGDTSTRKREIAAFFG 132
II F+ MLK+R+D C GFYTYD FI+AA+++ FGTTGD +TRK+E+AAF
Sbjct: 81 GSIIPSSLFDQMLKYRNDQRCAGHGFYTYDGFIAAARSFNGFGTTGDDATRKKELAAFLA 140
Query: 133 QTSHETTGGWATAPDGPYAWGYCFVREQNPSA-YCSPSSQWPCASGKQYYGRGPIQITWN 191
QTSHETTGGW +APDGPYAWGYCFV E++ +CSP WPCA GK+YYGRGPIQ+T N
Sbjct: 141 QTSHETTGGWPSAPDGPYAWGYCFVTEKDAQGDFCSPGD-WPCAPGKRYYGRGPIQLTHN 199
Query: 192 YNYGQCGRAIGADLLNNPDAVATDAVISFKTAIWFWMTAQSPKPSCHDVITGRWSPSSAD 251
YNYGQ G+AI DL+NNPD V+T+ +SFKTAIWFWMT Q+ KPS HDVITGRW+PS+AD
Sbjct: 200 YNYGQAGKAINEDLINNPDLVSTNPTVSFKTAIWFWMTPQANKPSSHDVITGRWTPSAAD 259
Query: 252 QAAGRVSGYGTVTNIINGGLECGKGQDSRVQDRIGFYKRYCDLLGVGYGNNLDCASQRPF 311
+AGRV GYG +TNIINGGLECG GQD +V+DR+GFY+RYC +LGV GNNLDC +QRPF
Sbjct: 260 SSAGRVPGYGVITNIINGGLECGHGQDPKVEDRVGFYRRYCQILGVNPGNNLDCNNQRPF 319
>IMGA|AC169517_10.5 Glycoside hydrolase, family 19; Chitin-binding,
type 1 chr08_pseudomolecule_IMGAG_V2 18103212-18099801 E
EGN_Mt071002 20080227
Length = 320
Score = 381 bits (978), Expect = e-106, Method: Compositional matrix adjust.
Identities = 186/317 (58%), Positives = 231/317 (72%), Gaps = 13/317 (4%)
Query: 5 KMRLGIAILVSFILVGWCRGEQCGSQAGGALCPGGICCSKYGWCGSTSEY---------X 55
K+ I L++F L + +QCG QA GA+C +CCS++G+CG+T++Y
Sbjct: 6 KLSCLILCLLAFFLGS--KAQQCGRQANGAVCANRLCCSQFGYCGNTADYCGAGCQSQCT 63
Query: 56 XXXXXXXXXXXXXXXXXXXIISRDTFNTMLKHRDDGGCPAKGFYTYDAFISAAKAYPSFG 115
+IS F+ MLK+R+D C A+GFY+YD+FI+AA+++ FG
Sbjct: 64 SNPTPTPTTPTPSGGDVGSLISSSMFDEMLKYRNDPRCAARGFYSYDSFITAARSFNGFG 123
Query: 116 TTGDTSTRKREIAAFFGQTSHETTGGWATAPDGPYAWGYCFVREQN-PSAYCSPSSQWPC 174
TTGD +TRKRE+AAF GQTSHETTGGW TAPDGPYAWGYCFV E+N PS YCSP + WPC
Sbjct: 124 TTGDENTRKREVAAFLGQTSHETTGGWPTAPDGPYAWGYCFVNERNPPSDYCSPGT-WPC 182
Query: 175 ASGKQYYGRGPIQITWNYNYGQCGRAIGADLLNNPDAVATDAVISFKTAIWFWMTAQSPK 234
A GK+YYGRGPIQ+T NYNYG GRAI DL+NNPD V+++ +SF+TA+WFWMT Q K
Sbjct: 183 APGKRYYGRGPIQLTHNYNYGPAGRAINQDLINNPDLVSSNPSVSFRTALWFWMTPQGNK 242
Query: 235 PSCHDVITGRWSPSSADQAAGRVSGYGTVTNIINGGLECGKGQDSRVQDRIGFYKRYCDL 294
PS HDVITGRW+PS AD++A RV GYG +TNIINGGLECG+GQD RV+DRIGFYKRYC L
Sbjct: 243 PSSHDVITGRWTPSDADRSARRVPGYGVITNIINGGLECGRGQDPRVEDRIGFYKRYCQL 302
Query: 295 LGVGYGNNLDCASQRPF 311
L G+NLDC +QRPF
Sbjct: 303 LRTTTGDNLDCYNQRPF 319
>IMGA|AC148763_19.4 Glycoside hydrolase, family 19; Chitin-binding,
type 1 chr08_pseudomolecule_IMGAG_V2 18113090-18110769 E
EGN_Mt071002 20080227
Length = 309
Score = 358 bits (919), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 173/301 (57%), Positives = 210/301 (69%), Gaps = 25/301 (8%)
Query: 23 RGEQCGSQAGGALCPGGICCSKYGWCGSTSEY------------XXXXXXXXXXXXXXXX 70
+ EQCGSQA A+CP G+CCSK+GWCG+T +Y
Sbjct: 21 KAEQCGSQANRAVCPNGLCCSKFGWCGTTDQYCGAGCQSQCRSSSTPTPSTPTPGTGGGG 80
Query: 71 XXXXIISRDTFNTMLKHRDDGGCPAKGFYTYDAFISAAKAYPSFGTTGDTSTRKREIAAF 130
++ F+ MLK+R+D CP GFYTYD FI+A +++ FGTTGD +TRKRE+AAF
Sbjct: 81 DVGRLVPSFLFDQMLKYRNDARCPGHGFYTYDGFIAATRSFNGFGTTGDDTTRKRELAAF 140
Query: 131 FGQTSHETTGGWATAPDGPYAWGYCFVREQNPSAYCSPSSQWPCASGKQYYGRGPIQITW 190
QTSHETTGGW++APDGPYAWGYCFV E+N A K+YYGRGPIQ+T
Sbjct: 141 LAQTSHETTGGWSSAPDGPYAWGYCFVNERN-------------AQEKRYYGRGPIQLTH 187
Query: 191 NYNYGQCGRAIGADLLNNPDAVATDAVISFKTAIWFWMTAQSPKPSCHDVITGRWSPSSA 250
+YNYGQ G+AI DL+NNPD V+T+ +SFKTAIWFWMT Q KPS HDVI GRW+PS A
Sbjct: 188 DYNYGQAGKAINQDLINNPDLVSTNPTVSFKTAIWFWMTPQGNKPSSHDVIIGRWTPSGA 247
Query: 251 DQAAGRVSGYGTVTNIINGGLECGKGQDSRVQDRIGFYKRYCDLLGVGYGNNLDCASQRP 310
D++AGRV GYG +TNIINGGLECG GQD+RV DRIGFY+RYC +LGV G+NLDC +QR
Sbjct: 248 DRSAGRVPGYGVITNIINGGLECGHGQDARVNDRIGFYRRYCQILGVSPGDNLDCNNQRS 307
Query: 311 F 311
F
Sbjct: 308 F 308
>IMGA|AC139745_14.5 Glycoside hydrolase, family 19
chr07_pseudomolecule_IMGAG_V2 28966483-28963252 E
EGN_Mt071002 20080227
Length = 278
Score = 303 bits (776), Expect = 7e-83, Method: Compositional matrix adjust.
Identities = 139/239 (58%), Positives = 176/239 (73%), Gaps = 2/239 (0%)
Query: 75 IISRDTFNTMLKHRDDGGCPAKGFYTYDAFISAAKAYPSFGTTGDTSTRKREIAAFFGQT 134
+IS++ ++T+ H+DD CPAK FY Y +FI A+K +P FGTTG +TRKREIAAF Q
Sbjct: 39 LISKNLYDTIFLHKDDTACPAKNFYPYQSFIEASKYFPQFGTTGCLATRKREIAAFLAQI 98
Query: 135 SHETTGGWATAPDGPYAWGYCFVREQNP-SAYC-SPSSQWPCASGKQYYGRGPIQITWNY 192
SHETTGGWATAPDGP++WG CF E +P S YC S WPC GK Y GRGPIQ++WNY
Sbjct: 99 SHETTGGWATAPDGPFSWGLCFKEEISPQSNYCDSTDKDWPCFEGKTYKGRGPIQLSWNY 158
Query: 193 NYGQCGRAIGADLLNNPDAVATDAVISFKTAIWFWMTAQSPKPSCHDVITGRWSPSSADQ 252
NYG G+A+G D L NP+ V+ ++VI+FKTA+WFWMT + P PSCH+V+ G++ + AD
Sbjct: 159 NYGPAGKALGFDGLRNPEIVSNNSVIAFKTALWFWMTERKPIPSCHNVMVGKYLATKADI 218
Query: 253 AAGRVSGYGTVTNIINGGLECGKGQDSRVQDRIGFYKRYCDLLGVGYGNNLDCASQRPF 311
AA R +G+G VTNI+NGGLECG D+RV DRIGF++RY L V G NLDC Q+ F
Sbjct: 219 AANRTAGFGLVTNIVNGGLECGIPNDARVNDRIGFFQRYTKLFNVDTGPNLDCGYQKSF 277
>IMGA|AC160516_8.4 Glycoside hydrolase, family 19
chr07_pseudomolecule_IMGAG_V2 4739217-4735456 F
EGN_Mt071002 20080227
Length = 319
Score = 190 bits (482), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 99/248 (39%), Positives = 139/248 (56%), Gaps = 11/248 (4%)
Query: 81 FNTMLKHRDDGGCPAKGFYTYDAFISAAKAYPS--FGTTGDTSTRKREIAAFFGQTSHET 138
F + R+ A GF+ Y +FI+AA + FGTTG+ +T+ EIAAF G +T
Sbjct: 69 FENLFSKRNTPIAHAVGFWDYHSFINAASLFEPLGFGTTGNKTTQMMEIAAFLGHVGSKT 128
Query: 139 TGGWATAPDGPYAWGYCFVREQNPS-AYCSPSSQ--WPCASGKQYYGRGPIQITWNYNYG 195
+ G+ A GP AWG C+ E +P+ YC + +PC G +YYGRG I I WNYNYG
Sbjct: 129 SCGYGVATGGPLAWGLCYNHEMSPAQTYCDDYYKLTYPCTPGAEYYGRGAIPIYWNYNYG 188
Query: 196 QCGRAIGADLLNNPDAVATDAVISFKTAIWFWMT-AQSPKPSCHDVITGRWSPSSADQAA 254
G A+ +LL++P+ + +A ++F+ AIW WMT + +PS HD G W P+ D
Sbjct: 189 AAGEALKVNLLDHPEYIEQNATLAFQAAIWKWMTPIKKSQPSAHDAFVGNWKPTKNDTME 248
Query: 255 GRVSGYGTVTNIINGGLECGKGQDSRVQDRIGFYKRYCDLLGVGY-----GNNLDCASQR 309
RV G+G NI+ G CG+G + + + Y Y DLLGVG + L CA QR
Sbjct: 249 NRVPGFGATMNILYGEGVCGQGDVDSMNNIVSHYLYYLDLLGVGRERAGTHDVLTCAEQR 308
Query: 310 PFGSNSQL 317
PF N++L
Sbjct: 309 PFNPNTKL 316
>IMGA|AC126778_12.4 Glycoside hydrolase, family 19
chr02_pseudomolecule_IMGAG_V2 8467656-8465381 F
EGN_Mt071002 20080227
Length = 318
Score = 179 bits (453), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 98/243 (40%), Positives = 137/243 (56%), Gaps = 12/243 (4%)
Query: 81 FNTMLKHRDDGGCPAKGFYTYDAFISAAKAYPS--FGTTGDTSTRKREIAAFFGQTSHET 138
F + R+D A GF+ Y +FI+AA Y FGT+G ++E+AAF G +T
Sbjct: 65 FENLFSKRNDPTAHASGFWDYRSFITAAALYQPLGFGTSGGKHGGQKEVAAFLGHVGSKT 124
Query: 139 TGGWATAPDGPYAWGYCFVREQNPSA-YCSPSSQ--WPCASGKQYYGRGPIQITWNYNYG 195
+ G+ A GP+AWG C+ +E +P YC + +PC+ G YYGRG I I WNYNYG
Sbjct: 125 SCGYGVATGGPFAWGLCYNKELSPDKFYCDDYYKLTYPCSPGAAYYGRGAIPIYWNYNYG 184
Query: 196 QCGRAIGADLLNNPDAVATDAVISFKTAIWFWMT-AQSPKPSCHDVITGRWSPSSADQAA 254
+ G A+ DLLN+P+ + +A ++F+ A+W WMT + PS HDV G W P+ D +
Sbjct: 185 KIGEALKVDLLNHPEYIEQNATLAFQAALWKWMTPPEKHIPSPHDVFVGNWKPTKNDTLS 244
Query: 255 GRVSGYGTVTNIINGGLECGKGQDSR-VQDRIGFYKRYCDLLGVGY---GNN--LDCASQ 308
RV G+G N++ G C +G D+ + + I Y Y DLLGVG G N L CA Q
Sbjct: 245 KRVPGFGATINVLYGDQVCDQGSDNEAMSNIISHYLYYLDLLGVGREEAGPNEILSCAEQ 304
Query: 309 RPF 311
F
Sbjct: 305 AAF 307
>IMGA|AC137554_23.4 Glycoside hydrolase, family 19; Chitin-binding,
type 1 chr02_pseudomolecule_IMGAG_V2 23068533-23070500 F
EGN_Mt071002 20080227
Length = 282
Score = 178 bits (452), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 110/309 (35%), Positives = 147/309 (47%), Gaps = 53/309 (17%)
Query: 8 LGIAILVSFILVGWCRGEQCGSQAGGALCPGGICCSKYGWCGSTSEYXXX--------XX 59
L IA + ++ + CG C G+CCS+YG+CG+ Y
Sbjct: 16 LAIAFFIMIMVPKNVSAQNCG-------CAEGVCCSQYGYCGNGDAYCGTGCKQGPCYAG 68
Query: 60 XXXXXXXXXXXXXXXIISRDTFNTMLKHRDDGGCPAKGFYTYDAFISAAKAYPSFGTTGD 119
I+++D FN ++ D C K FYT AF+ A +Y FG +G
Sbjct: 69 QTPPSLPNNDANVADILTQDFFNRIIDQADSS-CAGKNFYTRAAFLDALNSYNQFGRSGS 127
Query: 120 TSTRKREIAAFFGQTSHETTGGWATAPDGPYAWGYCFVREQN-PSA-YCSP-SSQWPCAS 176
KRE+AA F +HET +C+ E + PS YC +++WPCA
Sbjct: 128 LDDSKREVAAAFAHFTHETGH-------------FCYTEEIDGPSKDYCDEGNTEWPCAP 174
Query: 177 GKQYYGRGPIQITWNYNYGQCGRAIGADLLNNPDAVATDAVISFKTAIWFWMTAQSPKPS 236
K YYGRGPIQ++WNYNYG GR G D LN+P+ VA D +SFKTA+W+WM +
Sbjct: 175 NKGYYGRGPIQLSWNYNYGPAGRDNGFDGLNSPETVANDPTVSFKTALWYWMN------N 228
Query: 237 CHDVITGRWSPSSADQAAGRVSGYGTVTNIINGGLECGKGQDSRVQDRIGFYKRYCDLLG 296
H VI G+G ING LEC S VQ R+G+Y +YC LG
Sbjct: 229 VHGVIN---------------QGFGATIRAINGRLECDGANPSTVQTRVGYYTQYCSELG 273
Query: 297 VGYGNNLDC 305
V G+NL C
Sbjct: 274 VAPGDNLTC 282
>IMGA|AC137554_43.4 Glycoside hydrolase, family 19; Chitin-binding,
type 1 chr02_pseudomolecule_IMGAG_V2 23073560-23075156 E
EGN_Mt071002 20080227
Length = 263
Score = 164 bits (416), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 97/295 (32%), Positives = 146/295 (49%), Gaps = 47/295 (15%)
Query: 24 GEQCGSQAGGALCPGGICCSKYGWCGSTSEY----------XXXXXXXXXXXXXXXXXXX 73
G++ S A C G+CCS++G+CG+T Y
Sbjct: 3 GKKLPSIAQNCGCEEGLCCSEHGYCGNTDPYCGTGCKQGPCYAGQISPSTPGPSNDVNVA 62
Query: 74 XIISRDTFNTMLKHRDDGGCPAKGFYTYDAFISAAKAYPSFGTTGDTSTRKREIAAFFGQ 133
I++++ FN+++ D C K FY+ F+ A +Y FG G KREIAA F
Sbjct: 63 DIVTQEFFNSIIDQAD-SSCAGKNFYSRAVFLDALGSYNQFGRVGSVDDSKREIAAAFAH 121
Query: 134 TSHETTGGWATAPDGPYAWGYCFVREQNPSA--YCSPS-SQWPCASGKQYYGRGPIQITW 190
+HET +C++ E++ ++ YC S +++PCA K YYGRGPIQ++W
Sbjct: 122 FTHETGH-------------FCYIEEKDGASKDYCDESNTEYPCAPNKGYYGRGPIQLSW 168
Query: 191 NYNYGQCGRAIGADLLNNPDAVATDAVISFKTAIWFWMTAQSPKPSCHDVITGRWSPSSA 250
N+NYG G+ G D LN+P+ VA D ++SFKTA+W+WM H+V+ +
Sbjct: 169 NFNYGPAGKDSGFDELNSPETVANDPLVSFKTALWYWMN------HVHNVMNQQ------ 216
Query: 251 DQAAGRVSGYGTVTNIINGGLECGKGQDSRVQDRIGFYKRYCDLLGVGYGNNLDC 305
G+G ING LEC + V+ R+ +Y +YC LGV G+ L+C
Sbjct: 217 --------GFGATVRAINGRLECDGVDPNTVKARVDYYTQYCSQLGVAPGDKLNC 263