Miyakogusa Predicted Gene

chr1.CM0017.200.nc
Show Alignment: 

BLASTP 2.2.18 [Mar-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= chr1.CM0017.200.nc + phase: 0 
         (322 letters)

Database: Medicago_aa2.0 
           38,834 sequences; 10,231,785 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

IMGA|CT025534_17.4 Glycoside hydrolase, family 19; Chitin-bindin...   488   e-138
IMGA|AC148763_10.4 Glycoside hydrolase, family 19; Chitin-bindin...   381   e-106
IMGA|AC169517_10.5 Glycoside hydrolase, family 19; Chitin-bindin...   381   e-106
IMGA|AC148763_19.4 Glycoside hydrolase, family 19; Chitin-bindin...   358   2e-99
IMGA|AC139745_14.5 Glycoside hydrolase, family 19 chr07_pseudomo...   303   7e-83
IMGA|AC160516_8.4 Glycoside hydrolase, family 19 chr07_pseudomol...   190   7e-49
IMGA|AC126778_12.4 Glycoside hydrolase, family 19 chr02_pseudomo...   179   2e-45
IMGA|AC137554_23.4 Glycoside hydrolase, family 19; Chitin-bindin...   178   3e-45
IMGA|AC137554_43.4 Glycoside hydrolase, family 19; Chitin-bindin...   164   3e-41

>IMGA|CT025534_17.4 Glycoside hydrolase, family 19; Chitin-binding,
           type 1 chr03_pseudomolecule_IMGAG_V2 32708700-32707178 F
           EGN_Mt071002 20080227
          Length = 325

 Score =  488 bits (1255), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 234/312 (75%), Positives = 259/312 (83%), Gaps = 3/312 (0%)

Query: 2   IKMKMRLGIAILVSFILVGWCRGEQCGSQAGGALCPGGICCSKYGWCGSTSEYXXXXXXX 61
           + M++ L +   V  +++G    EQCG QAGGALCPGG+CCSK+GWCGST +Y       
Sbjct: 1   MMMRLALVVTTAVLLVIIGCSFAEQCGKQAGGALCPGGLCCSKFGWCGSTGDYCGDGCQS 60

Query: 62  XXXXXXXXXXXXXIISRDTFNTMLKHRDDGGCPAKGFYTYDAFISAAKAYPSFGTTGDTS 121
                        +ISRDTFN MLKHRDD GC  K  YTYDAFISAAKA+P+F   GDT+
Sbjct: 61  QCSGSSGDLGS--LISRDTFNNMLKHRDDSGCQGKRLYTYDAFISAAKAFPNFANNGDTA 118

Query: 122 TRKREIAAFFGQTSHETTGGWATAPDGPYAWGYCFVREQNPSA-YCSPSSQWPCASGKQY 180
           T+KREIAAF GQTSHETTGGWATAPDGPYAWGYCFVREQNPS+ YC PSS++PCASGKQY
Sbjct: 119 TKKREIAAFLGQTSHETTGGWATAPDGPYAWGYCFVREQNPSSTYCQPSSEFPCASGKQY 178

Query: 181 YGRGPIQITWNYNYGQCGRAIGADLLNNPDAVATDAVISFKTAIWFWMTAQSPKPSCHDV 240
           YGRGPIQI+WNYNYGQCGRAIG DLLNNPD VATD VISFKTA+WFWMT QSPKPSCHDV
Sbjct: 179 YGRGPIQISWNYNYGQCGRAIGVDLLNNPDLVATDPVISFKTALWFWMTPQSPKPSCHDV 238

Query: 241 ITGRWSPSSADQAAGRVSGYGTVTNIINGGLECGKGQDSRVQDRIGFYKRYCDLLGVGYG 300
           ITGRWSPSSAD+AAGR+ GYGTVTNIINGGLECG+GQD RVQDRIGFYKRYCD+LGVGYG
Sbjct: 239 ITGRWSPSSADRAAGRLPGYGTVTNIINGGLECGRGQDGRVQDRIGFYKRYCDILGVGYG 298

Query: 301 NNLDCASQRPFG 312
           +NLDC SQRPFG
Sbjct: 299 DNLDCFSQRPFG 310


>IMGA|AC148763_10.4 Glycoside hydrolase, family 19; Chitin-binding,
           type 1 chr08_pseudomolecule_IMGAG_V2 18118056-18115035 E
           EGN_Mt071002 20080227
          Length = 320

 Score =  381 bits (979), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 183/300 (61%), Positives = 221/300 (73%), Gaps = 12/300 (4%)

Query: 23  RGEQCGSQAGGALCPGGICCSKYGWCGSTSEY----------XXXXXXXXXXXXXXXXXX 72
           + EQCGSQA GALCP G+CCSK+G+CG+T +Y                            
Sbjct: 21  KAEQCGSQANGALCPNGLCCSKFGFCGNTDQYCGDGCQSQCKSSPTPNPPTPSTGGGGDV 80

Query: 73  XXIISRDTFNTMLKHRDDGGCPAKGFYTYDAFISAAKAYPSFGTTGDTSTRKREIAAFFG 132
             II    F+ MLK+R+D  C   GFYTYD FI+AA+++  FGTTGD +TRK+E+AAF  
Sbjct: 81  GSIIPSSLFDQMLKYRNDQRCAGHGFYTYDGFIAAARSFNGFGTTGDDATRKKELAAFLA 140

Query: 133 QTSHETTGGWATAPDGPYAWGYCFVREQNPSA-YCSPSSQWPCASGKQYYGRGPIQITWN 191
           QTSHETTGGW +APDGPYAWGYCFV E++    +CSP   WPCA GK+YYGRGPIQ+T N
Sbjct: 141 QTSHETTGGWPSAPDGPYAWGYCFVTEKDAQGDFCSPGD-WPCAPGKRYYGRGPIQLTHN 199

Query: 192 YNYGQCGRAIGADLLNNPDAVATDAVISFKTAIWFWMTAQSPKPSCHDVITGRWSPSSAD 251
           YNYGQ G+AI  DL+NNPD V+T+  +SFKTAIWFWMT Q+ KPS HDVITGRW+PS+AD
Sbjct: 200 YNYGQAGKAINEDLINNPDLVSTNPTVSFKTAIWFWMTPQANKPSSHDVITGRWTPSAAD 259

Query: 252 QAAGRVSGYGTVTNIINGGLECGKGQDSRVQDRIGFYKRYCDLLGVGYGNNLDCASQRPF 311
            +AGRV GYG +TNIINGGLECG GQD +V+DR+GFY+RYC +LGV  GNNLDC +QRPF
Sbjct: 260 SSAGRVPGYGVITNIINGGLECGHGQDPKVEDRVGFYRRYCQILGVNPGNNLDCNNQRPF 319


>IMGA|AC169517_10.5 Glycoside hydrolase, family 19; Chitin-binding,
           type 1 chr08_pseudomolecule_IMGAG_V2 18103212-18099801 E
           EGN_Mt071002 20080227
          Length = 320

 Score =  381 bits (978), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 186/317 (58%), Positives = 231/317 (72%), Gaps = 13/317 (4%)

Query: 5   KMRLGIAILVSFILVGWCRGEQCGSQAGGALCPGGICCSKYGWCGSTSEY---------X 55
           K+   I  L++F L    + +QCG QA GA+C   +CCS++G+CG+T++Y          
Sbjct: 6   KLSCLILCLLAFFLGS--KAQQCGRQANGAVCANRLCCSQFGYCGNTADYCGAGCQSQCT 63

Query: 56  XXXXXXXXXXXXXXXXXXXIISRDTFNTMLKHRDDGGCPAKGFYTYDAFISAAKAYPSFG 115
                              +IS   F+ MLK+R+D  C A+GFY+YD+FI+AA+++  FG
Sbjct: 64  SNPTPTPTTPTPSGGDVGSLISSSMFDEMLKYRNDPRCAARGFYSYDSFITAARSFNGFG 123

Query: 116 TTGDTSTRKREIAAFFGQTSHETTGGWATAPDGPYAWGYCFVREQN-PSAYCSPSSQWPC 174
           TTGD +TRKRE+AAF GQTSHETTGGW TAPDGPYAWGYCFV E+N PS YCSP + WPC
Sbjct: 124 TTGDENTRKREVAAFLGQTSHETTGGWPTAPDGPYAWGYCFVNERNPPSDYCSPGT-WPC 182

Query: 175 ASGKQYYGRGPIQITWNYNYGQCGRAIGADLLNNPDAVATDAVISFKTAIWFWMTAQSPK 234
           A GK+YYGRGPIQ+T NYNYG  GRAI  DL+NNPD V+++  +SF+TA+WFWMT Q  K
Sbjct: 183 APGKRYYGRGPIQLTHNYNYGPAGRAINQDLINNPDLVSSNPSVSFRTALWFWMTPQGNK 242

Query: 235 PSCHDVITGRWSPSSADQAAGRVSGYGTVTNIINGGLECGKGQDSRVQDRIGFYKRYCDL 294
           PS HDVITGRW+PS AD++A RV GYG +TNIINGGLECG+GQD RV+DRIGFYKRYC L
Sbjct: 243 PSSHDVITGRWTPSDADRSARRVPGYGVITNIINGGLECGRGQDPRVEDRIGFYKRYCQL 302

Query: 295 LGVGYGNNLDCASQRPF 311
           L    G+NLDC +QRPF
Sbjct: 303 LRTTTGDNLDCYNQRPF 319


>IMGA|AC148763_19.4 Glycoside hydrolase, family 19; Chitin-binding,
           type 1 chr08_pseudomolecule_IMGAG_V2 18113090-18110769 E
           EGN_Mt071002 20080227
          Length = 309

 Score =  358 bits (919), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 173/301 (57%), Positives = 210/301 (69%), Gaps = 25/301 (8%)

Query: 23  RGEQCGSQAGGALCPGGICCSKYGWCGSTSEY------------XXXXXXXXXXXXXXXX 70
           + EQCGSQA  A+CP G+CCSK+GWCG+T +Y                            
Sbjct: 21  KAEQCGSQANRAVCPNGLCCSKFGWCGTTDQYCGAGCQSQCRSSSTPTPSTPTPGTGGGG 80

Query: 71  XXXXIISRDTFNTMLKHRDDGGCPAKGFYTYDAFISAAKAYPSFGTTGDTSTRKREIAAF 130
               ++    F+ MLK+R+D  CP  GFYTYD FI+A +++  FGTTGD +TRKRE+AAF
Sbjct: 81  DVGRLVPSFLFDQMLKYRNDARCPGHGFYTYDGFIAATRSFNGFGTTGDDTTRKRELAAF 140

Query: 131 FGQTSHETTGGWATAPDGPYAWGYCFVREQNPSAYCSPSSQWPCASGKQYYGRGPIQITW 190
             QTSHETTGGW++APDGPYAWGYCFV E+N             A  K+YYGRGPIQ+T 
Sbjct: 141 LAQTSHETTGGWSSAPDGPYAWGYCFVNERN-------------AQEKRYYGRGPIQLTH 187

Query: 191 NYNYGQCGRAIGADLLNNPDAVATDAVISFKTAIWFWMTAQSPKPSCHDVITGRWSPSSA 250
           +YNYGQ G+AI  DL+NNPD V+T+  +SFKTAIWFWMT Q  KPS HDVI GRW+PS A
Sbjct: 188 DYNYGQAGKAINQDLINNPDLVSTNPTVSFKTAIWFWMTPQGNKPSSHDVIIGRWTPSGA 247

Query: 251 DQAAGRVSGYGTVTNIINGGLECGKGQDSRVQDRIGFYKRYCDLLGVGYGNNLDCASQRP 310
           D++AGRV GYG +TNIINGGLECG GQD+RV DRIGFY+RYC +LGV  G+NLDC +QR 
Sbjct: 248 DRSAGRVPGYGVITNIINGGLECGHGQDARVNDRIGFYRRYCQILGVSPGDNLDCNNQRS 307

Query: 311 F 311
           F
Sbjct: 308 F 308


>IMGA|AC139745_14.5 Glycoside hydrolase, family 19
           chr07_pseudomolecule_IMGAG_V2 28966483-28963252 E
           EGN_Mt071002 20080227
          Length = 278

 Score =  303 bits (776), Expect = 7e-83,   Method: Compositional matrix adjust.
 Identities = 139/239 (58%), Positives = 176/239 (73%), Gaps = 2/239 (0%)

Query: 75  IISRDTFNTMLKHRDDGGCPAKGFYTYDAFISAAKAYPSFGTTGDTSTRKREIAAFFGQT 134
           +IS++ ++T+  H+DD  CPAK FY Y +FI A+K +P FGTTG  +TRKREIAAF  Q 
Sbjct: 39  LISKNLYDTIFLHKDDTACPAKNFYPYQSFIEASKYFPQFGTTGCLATRKREIAAFLAQI 98

Query: 135 SHETTGGWATAPDGPYAWGYCFVREQNP-SAYC-SPSSQWPCASGKQYYGRGPIQITWNY 192
           SHETTGGWATAPDGP++WG CF  E +P S YC S    WPC  GK Y GRGPIQ++WNY
Sbjct: 99  SHETTGGWATAPDGPFSWGLCFKEEISPQSNYCDSTDKDWPCFEGKTYKGRGPIQLSWNY 158

Query: 193 NYGQCGRAIGADLLNNPDAVATDAVISFKTAIWFWMTAQSPKPSCHDVITGRWSPSSADQ 252
           NYG  G+A+G D L NP+ V+ ++VI+FKTA+WFWMT + P PSCH+V+ G++  + AD 
Sbjct: 159 NYGPAGKALGFDGLRNPEIVSNNSVIAFKTALWFWMTERKPIPSCHNVMVGKYLATKADI 218

Query: 253 AAGRVSGYGTVTNIINGGLECGKGQDSRVQDRIGFYKRYCDLLGVGYGNNLDCASQRPF 311
           AA R +G+G VTNI+NGGLECG   D+RV DRIGF++RY  L  V  G NLDC  Q+ F
Sbjct: 219 AANRTAGFGLVTNIVNGGLECGIPNDARVNDRIGFFQRYTKLFNVDTGPNLDCGYQKSF 277


>IMGA|AC160516_8.4 Glycoside hydrolase, family 19
           chr07_pseudomolecule_IMGAG_V2 4739217-4735456 F
           EGN_Mt071002 20080227
          Length = 319

 Score =  190 bits (482), Expect = 7e-49,   Method: Compositional matrix adjust.
 Identities = 99/248 (39%), Positives = 139/248 (56%), Gaps = 11/248 (4%)

Query: 81  FNTMLKHRDDGGCPAKGFYTYDAFISAAKAYPS--FGTTGDTSTRKREIAAFFGQTSHET 138
           F  +   R+     A GF+ Y +FI+AA  +    FGTTG+ +T+  EIAAF G    +T
Sbjct: 69  FENLFSKRNTPIAHAVGFWDYHSFINAASLFEPLGFGTTGNKTTQMMEIAAFLGHVGSKT 128

Query: 139 TGGWATAPDGPYAWGYCFVREQNPS-AYCSPSSQ--WPCASGKQYYGRGPIQITWNYNYG 195
           + G+  A  GP AWG C+  E +P+  YC    +  +PC  G +YYGRG I I WNYNYG
Sbjct: 129 SCGYGVATGGPLAWGLCYNHEMSPAQTYCDDYYKLTYPCTPGAEYYGRGAIPIYWNYNYG 188

Query: 196 QCGRAIGADLLNNPDAVATDAVISFKTAIWFWMT-AQSPKPSCHDVITGRWSPSSADQAA 254
             G A+  +LL++P+ +  +A ++F+ AIW WMT  +  +PS HD   G W P+  D   
Sbjct: 189 AAGEALKVNLLDHPEYIEQNATLAFQAAIWKWMTPIKKSQPSAHDAFVGNWKPTKNDTME 248

Query: 255 GRVSGYGTVTNIINGGLECGKGQDSRVQDRIGFYKRYCDLLGVGY-----GNNLDCASQR 309
            RV G+G   NI+ G   CG+G    + + +  Y  Y DLLGVG       + L CA QR
Sbjct: 249 NRVPGFGATMNILYGEGVCGQGDVDSMNNIVSHYLYYLDLLGVGRERAGTHDVLTCAEQR 308

Query: 310 PFGSNSQL 317
           PF  N++L
Sbjct: 309 PFNPNTKL 316


>IMGA|AC126778_12.4 Glycoside hydrolase, family 19
           chr02_pseudomolecule_IMGAG_V2 8467656-8465381 F
           EGN_Mt071002 20080227
          Length = 318

 Score =  179 bits (453), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 98/243 (40%), Positives = 137/243 (56%), Gaps = 12/243 (4%)

Query: 81  FNTMLKHRDDGGCPAKGFYTYDAFISAAKAYPS--FGTTGDTSTRKREIAAFFGQTSHET 138
           F  +   R+D    A GF+ Y +FI+AA  Y    FGT+G     ++E+AAF G    +T
Sbjct: 65  FENLFSKRNDPTAHASGFWDYRSFITAAALYQPLGFGTSGGKHGGQKEVAAFLGHVGSKT 124

Query: 139 TGGWATAPDGPYAWGYCFVREQNPSA-YCSPSSQ--WPCASGKQYYGRGPIQITWNYNYG 195
           + G+  A  GP+AWG C+ +E +P   YC    +  +PC+ G  YYGRG I I WNYNYG
Sbjct: 125 SCGYGVATGGPFAWGLCYNKELSPDKFYCDDYYKLTYPCSPGAAYYGRGAIPIYWNYNYG 184

Query: 196 QCGRAIGADLLNNPDAVATDAVISFKTAIWFWMT-AQSPKPSCHDVITGRWSPSSADQAA 254
           + G A+  DLLN+P+ +  +A ++F+ A+W WMT  +   PS HDV  G W P+  D  +
Sbjct: 185 KIGEALKVDLLNHPEYIEQNATLAFQAALWKWMTPPEKHIPSPHDVFVGNWKPTKNDTLS 244

Query: 255 GRVSGYGTVTNIINGGLECGKGQDSR-VQDRIGFYKRYCDLLGVGY---GNN--LDCASQ 308
            RV G+G   N++ G   C +G D+  + + I  Y  Y DLLGVG    G N  L CA Q
Sbjct: 245 KRVPGFGATINVLYGDQVCDQGSDNEAMSNIISHYLYYLDLLGVGREEAGPNEILSCAEQ 304

Query: 309 RPF 311
             F
Sbjct: 305 AAF 307


>IMGA|AC137554_23.4 Glycoside hydrolase, family 19; Chitin-binding,
           type 1 chr02_pseudomolecule_IMGAG_V2 23068533-23070500 F
           EGN_Mt071002 20080227
          Length = 282

 Score =  178 bits (452), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 110/309 (35%), Positives = 147/309 (47%), Gaps = 53/309 (17%)

Query: 8   LGIAILVSFILVGWCRGEQCGSQAGGALCPGGICCSKYGWCGSTSEYXXX--------XX 59
           L IA  +  ++      + CG       C  G+CCS+YG+CG+   Y             
Sbjct: 16  LAIAFFIMIMVPKNVSAQNCG-------CAEGVCCSQYGYCGNGDAYCGTGCKQGPCYAG 68

Query: 60  XXXXXXXXXXXXXXXIISRDTFNTMLKHRDDGGCPAKGFYTYDAFISAAKAYPSFGTTGD 119
                          I+++D FN ++   D   C  K FYT  AF+ A  +Y  FG +G 
Sbjct: 69  QTPPSLPNNDANVADILTQDFFNRIIDQADSS-CAGKNFYTRAAFLDALNSYNQFGRSGS 127

Query: 120 TSTRKREIAAFFGQTSHETTGGWATAPDGPYAWGYCFVREQN-PSA-YCSP-SSQWPCAS 176
               KRE+AA F   +HET               +C+  E + PS  YC   +++WPCA 
Sbjct: 128 LDDSKREVAAAFAHFTHETGH-------------FCYTEEIDGPSKDYCDEGNTEWPCAP 174

Query: 177 GKQYYGRGPIQITWNYNYGQCGRAIGADLLNNPDAVATDAVISFKTAIWFWMTAQSPKPS 236
            K YYGRGPIQ++WNYNYG  GR  G D LN+P+ VA D  +SFKTA+W+WM       +
Sbjct: 175 NKGYYGRGPIQLSWNYNYGPAGRDNGFDGLNSPETVANDPTVSFKTALWYWMN------N 228

Query: 237 CHDVITGRWSPSSADQAAGRVSGYGTVTNIINGGLECGKGQDSRVQDRIGFYKRYCDLLG 296
            H VI                 G+G     ING LEC     S VQ R+G+Y +YC  LG
Sbjct: 229 VHGVIN---------------QGFGATIRAINGRLECDGANPSTVQTRVGYYTQYCSELG 273

Query: 297 VGYGNNLDC 305
           V  G+NL C
Sbjct: 274 VAPGDNLTC 282


>IMGA|AC137554_43.4 Glycoside hydrolase, family 19; Chitin-binding,
           type 1 chr02_pseudomolecule_IMGAG_V2 23073560-23075156 E
           EGN_Mt071002 20080227
          Length = 263

 Score =  164 bits (416), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 97/295 (32%), Positives = 146/295 (49%), Gaps = 47/295 (15%)

Query: 24  GEQCGSQAGGALCPGGICCSKYGWCGSTSEY----------XXXXXXXXXXXXXXXXXXX 73
           G++  S A    C  G+CCS++G+CG+T  Y                             
Sbjct: 3   GKKLPSIAQNCGCEEGLCCSEHGYCGNTDPYCGTGCKQGPCYAGQISPSTPGPSNDVNVA 62

Query: 74  XIISRDTFNTMLKHRDDGGCPAKGFYTYDAFISAAKAYPSFGTTGDTSTRKREIAAFFGQ 133
            I++++ FN+++   D   C  K FY+   F+ A  +Y  FG  G     KREIAA F  
Sbjct: 63  DIVTQEFFNSIIDQAD-SSCAGKNFYSRAVFLDALGSYNQFGRVGSVDDSKREIAAAFAH 121

Query: 134 TSHETTGGWATAPDGPYAWGYCFVREQNPSA--YCSPS-SQWPCASGKQYYGRGPIQITW 190
            +HET               +C++ E++ ++  YC  S +++PCA  K YYGRGPIQ++W
Sbjct: 122 FTHETGH-------------FCYIEEKDGASKDYCDESNTEYPCAPNKGYYGRGPIQLSW 168

Query: 191 NYNYGQCGRAIGADLLNNPDAVATDAVISFKTAIWFWMTAQSPKPSCHDVITGRWSPSSA 250
           N+NYG  G+  G D LN+P+ VA D ++SFKTA+W+WM         H+V+  +      
Sbjct: 169 NFNYGPAGKDSGFDELNSPETVANDPLVSFKTALWYWMN------HVHNVMNQQ------ 216

Query: 251 DQAAGRVSGYGTVTNIINGGLECGKGQDSRVQDRIGFYKRYCDLLGVGYGNNLDC 305
                   G+G     ING LEC     + V+ R+ +Y +YC  LGV  G+ L+C
Sbjct: 217 --------GFGATVRAINGRLECDGVDPNTVKARVDYYTQYCSQLGVAPGDKLNC 263