Miyakogusa Predicted Gene

Lj0g3v0285999.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj0g3v0285999.1 Non Chatacterized Hit- tr|I1LKM8|I1LKM8_SOYBN
Uncharacterized protein (Fragment) OS=Glycine max GN=G,73.39,0,no
description,Glycoside hydrolase, catalytic domain; no
description,NULL; SUBFAMILY NOT NAMED,NULL;,CUFF.19073.1
         (551 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT1G13130.1 | Symbols:  | Cellulase (glycosyl hydrolase family 5...   598   e-171
AT3G26130.1 | Symbols:  | Cellulase (glycosyl hydrolase family 5...   583   e-167
AT3G26140.1 | Symbols:  | Cellulase (glycosyl hydrolase family 5...   580   e-166
AT5G17500.1 | Symbols:  | Glycosyl hydrolase superfamily protein...   525   e-149
AT5G16700.1 | Symbols:  | Glycosyl hydrolase superfamily protein...   438   e-123

>AT1G13130.1 | Symbols:  | Cellulase (glycosyl hydrolase family 5)
           protein | chr1:4474726-4477820 FORWARD LENGTH=552
          Length = 552

 Score =  598 bits (1542), Expect = e-171,   Method: Compositional matrix adjust.
 Identities = 291/517 (56%), Positives = 364/517 (70%), Gaps = 13/517 (2%)

Query: 41  PVGALPLSTSGRWIVNEXXXXXXERVKLACVNWVSHLDAMVVEGLSQQPVDVISKGIKSK 100
           P  + PLSTS RWIV+E       RVKL C NW SHL  +V EGLS+QPVD ++K I   
Sbjct: 29  PNMSYPLSTSSRWIVDENGL----RVKLVCANWPSHLQPVVAEGLSKQPVDAVAKKIVEM 84

Query: 101 GFNCVRLTWSLSLLTNDSL----TVRESFQNLGLLQSISGMQANNPSFIDLPLIKALQAV 156
           GFNCVRLTW L L+TN++L    TVR+SFQ+LGL   I G Q NNPS IDLPLI+A + V
Sbjct: 85  GFNCVRLTWPLDLMTNETLANNVTVRQSFQSLGLNDDIVGFQTNNPSIIDLPLIEAYKTV 144

Query: 157 VKSLGDNDVMVILDNHITQAMWCCSNTDGNGFFGDQHFDPNLWIMGLTKMATLFNGVTNV 216
           V +LG+NDVMVILDNH+T+  WCC+N DGNGFFGDQ FDP +W+  L KMA  FNGV+NV
Sbjct: 145 VTTLGNNDVMVILDNHLTKPGWCCANDDGNGFFGDQFFDPTVWVAALKKMAATFNGVSNV 204

Query: 217 VGMSLRNELRGPKQNVPDWYRYMSKGAEVVHAANPNVLVILSGLNFDTDLSFLDKQPVQL 276
           VGMSLRNELRGPKQNV DW++YM +GAE VH+AN  VLVILSGL+FD DLSF+  +PV+L
Sbjct: 205 VGMSLRNELRGPKQNVNDWFKYMQQGAEAVHSANNKVLVILSGLSFDADLSFVRSRPVKL 264

Query: 277 TFNKKLVFAAHWYSFSNTQAWTTGDPNQACGQVTRNMMRMSGFLLQQGWPLFLSEFGVDM 336
           +F  KLVF  HWYSFS+  +W   +PN  CG+V   +    G+LL QG+PLFLSEFG+D 
Sbjct: 265 SFTGKLVFELHWYSFSDGNSWAANNPNDICGRVLNRIGNGGGYLLNQGFPLFLSEFGIDE 324

Query: 337 RGTNVNDNRFLNCFMALAAELDFDWALWTLAGSYYTARGIVGMEEFYGLLNGDWSQVRSE 396
           RG N NDNR+  C    AAE D DW+LW L GSYY  +G VGM E+YG+L+ DW  VR+ 
Sbjct: 325 RGVNTNDNRYFGCLTGWAAENDVDWSLWALTGSYYLRQGKVGMNEYYGVLDSDWISVRNS 384

Query: 397 SFLQRISAVQLPFQGPS-EAKPYKVIFHPLSGLCVLKNAQD--LLILGSCSNSDIWEYTE 453
           SFLQ+IS +Q P QGP      Y ++FHPL+GLC++++  D  +L LG C++S+ W YT 
Sbjct: 385 SFLQKISFLQSPLQGPGPRTDAYNLVFHPLTGLCIVRSLDDPKMLTLGPCNSSEPWSYT- 443

Query: 454 QKILSMKGTDFCLQAADGEGKQVKLGKEWSTPNSAWEMISDSNMQLSSKLNNGTSSVCLD 513
           +K L +K    CLQ+   +          ST  S W+ IS S M L+S  +N T S+CLD
Sbjct: 444 KKALRIKDQQLCLQSNGPKNPVTMTRTSCSTSGSKWQTISASRMHLASTTSNKT-SLCLD 502

Query: 514 VDADNAIVTNACKCLNNDKTCDPASQWFKLVDSTRKL 550
           VD  N +V NACKCL+ DK+C+P SQWFK++ +TR L
Sbjct: 503 VDTANNVVANACKCLSKDKSCEPMSQWFKIIKATRPL 539


>AT3G26130.1 | Symbols:  | Cellulase (glycosyl hydrolase family 5)
           protein | chr3:9553708-9555611 REVERSE LENGTH=551
          Length = 551

 Score =  583 bits (1504), Expect = e-167,   Method: Compositional matrix adjust.
 Identities = 286/514 (55%), Positives = 372/514 (72%), Gaps = 14/514 (2%)

Query: 44  ALPLSTSGRWIVNEXXXXXXERVKLACVNWVSHLDAMVVEGLSQQPVDVISKGIKSKGFN 103
           A P ST  RWIV++       RVKL CVNW SHL+  V EGLS+QP+D I++ I S GFN
Sbjct: 20  AFPPSTDSRWIVDDGNKG--RRVKLTCVNWPSHLETAVAEGLSKQPLDAIAEKIVSMGFN 77

Query: 104 CVRLTWSLSLLTNDS----LTVRESFQNLGLLQSISGMQANNPSFIDLPLIKALQAVVKS 159
           CVRLTW L L T++S    +TVR+S +   L +++SG Q +NP+ +DLPLIKA Q VV  
Sbjct: 78  CVRLTWPLYLATDESFSAFMTVRQSLRKFRLFEAVSGFQTHNPTILDLPLIKAFQEVVYC 137

Query: 160 LGDNDVMVILDNHITQAMWCCSNTDGNGFFGDQHFDPNLWIMGLTKMATLFNGVT-NVVG 218
           L  + VMVILDNHI+Q  WCCS+ DGNGFFGD+H +P +WI GL KMA++F  V+ NVVG
Sbjct: 138 LEKHRVMVILDNHISQPGWCCSDNDGNGFFGDKHLNPQVWIKGLKKMASMFANVSSNVVG 197

Query: 219 MSLRNELRGPKQNVPDWYRYMSKGAEVVHAANPNVLVILSGLNFDTDLSFLDKQPVQLTF 278
           MSLRNELRGPKQN+ DWY+YM +GAE VH+ NPNVLVI+SGLN+ TDLSFL ++P +++F
Sbjct: 198 MSLRNELRGPKQNIKDWYKYMREGAEAVHSVNPNVLVIVSGLNYATDLSFLRERPFEVSF 257

Query: 279 NKKLVFAAHWYSFSNTQAWTTGDPNQACGQVTRNMMRMSGFLLQQGWPLFLSEFGVDMRG 338
            +K+VF  HWY F NT  W   + N+ CG+ T  MM+MSGFLL++G PLF+SEFG+D RG
Sbjct: 258 RRKVVFEIHWYGFWNT--WEGDNLNKICGKETEKMMKMSGFLLEKGIPLFVSEFGIDQRG 315

Query: 339 TNVNDNRFLNCFMALAAELDFDWALWTLAGSYYTARGIVGMEEFYGLLNGDWSQVRSESF 398
            N NDN+FL+CFMALAA+ D DW+LWTLAGSYY     +G +E YG+L+ +WS +R+ + 
Sbjct: 316 NNANDNKFLSCFMALAADRDLDWSLWTLAGSYYIREKSIGSDESYGVLDFNWSSIRNSTI 375

Query: 399 LQRISAVQLPFQGPSEAKPYKVIFHPLSGLCVLKNAQDLLILGSCSNSDIWEYTEQKILS 458
           LQ ISA+Q PF G  E +P K++FHP +GLC+++ +   L LGSC+ S+ W  +  ++LS
Sbjct: 376 LQMISAIQTPFIGLMETQPKKIMFHPSTGLCIVRKSLFQLKLGSCNRSESWRLSSHRVLS 435

Query: 459 MKGTD-FCLQAADGEGKQVKLGKEWSTPN-SAWEMISDSNMQLSSKLNNGTSSVCLDVDA 516
           +      CL+A + +GK VKL   +S    S W++ SDS MQLSS   NG  SVCLDVD 
Sbjct: 436 LAEEQILCLKAYE-KGKSVKLRLFFSESYCSKWKLFSDSKMQLSSITKNGF-SVCLDVDT 493

Query: 517 D-NAIVTNACKCLNNDKTCDPASQWFKLVDSTRK 549
           + N IVTN+CKCL  + +CDP SQWFKLV STR+
Sbjct: 494 ENNNIVTNSCKCLRGNSSCDPRSQWFKLVTSTRR 527


>AT3G26140.1 | Symbols:  | Cellulase (glycosyl hydrolase family 5)
           protein | chr3:9559742-9563070 REVERSE LENGTH=508
          Length = 508

 Score =  580 bits (1495), Expect = e-166,   Method: Compositional matrix adjust.
 Identities = 288/515 (55%), Positives = 372/515 (72%), Gaps = 18/515 (3%)

Query: 44  ALPLSTSGRWIVNEXXXXXXERVKLACVNWVSHLDAMVVEGLSQQPVDVISKGIKSKGFN 103
           A PLST+ RWI++E      +RVKLACVNW SHL  +V EGLS+Q VD ++K I + GFN
Sbjct: 2   AYPLSTNSRWIIDEKG----QRVKLACVNWPSHLQPVVAEGLSKQSVDDLAKKIMAMGFN 57

Query: 104 CVRLTWSLSLLTNDSL----TVRESFQNLGLLQSISGMQANNPSFIDLPLIKALQAVVKS 159
           CVR TW L L TN++L    TVR+SFQ+LGL   ISG +  NPS IDLPLI+A + VV  
Sbjct: 58  CVRFTWPLDLATNETLANNVTVRQSFQSLGLNDDISGFETKNPSMIDLPLIEAYKKVVAK 117

Query: 160 LGDNDVMVILDNHITQAMWCCSNTDGNGFFGDQHFDPNLWIMGLTKMATLFNGVTNVVGM 219
           LG+N+VMVILDNH+T+  WCC   DGNGFFGD  FDP  WI GLTK+A  F G TNVVGM
Sbjct: 118 LGNNNVMVILDNHVTKPGWCCGYNDGNGFFGDTFFDPTTWIAGLTKIAMTFKGATNVVGM 177

Query: 220 SLRNELRGPKQNVPDWYRYMSKGAEVVHAANPNVLVILSGLNFDTDLSFLDKQPVQLTFN 279
           SLRNELRGPKQNV DW++YM +GAE VH ANPNVLVILSGL++DTDLSF+  + V LTF 
Sbjct: 178 SLRNELRGPKQNVDDWFKYMQQGAEAVHEANPNVLVILSGLSYDTDLSFVRSRHVNLTFT 237

Query: 280 KKLVFAAHWYSFSNTQAWTTGDPNQACGQVTRNMMRMSGFLLQQGWPLFLSEFGVDMRGT 339
           +KLVF  H YSF+NT  W++ +PN+ACG++ +++    GF L+  +P+FLSEFG+D+RG 
Sbjct: 238 RKLVFELHRYSFTNTNTWSSKNPNEACGEILKSIENGGGFNLRD-FPVFLSEFGIDLRGK 296

Query: 340 NVNDNRFLNCFMALAAELDFDWALWTLAGSYYTARGIVGMEEFYGLLNGDWSQVRSESFL 399
           NVNDNR++ C +  AAE D DW++WTL GSYY   G+VGM EFYG+L+ DW +VRS+SFL
Sbjct: 297 NVNDNRYIGCILGWAAENDVDWSIWTLQGSYYLREGVVGMSEFYGILDSDWVRVRSQSFL 356

Query: 400 QRISAVQLPFQGP-SEAKPYKVIFHPLSGLCVLKNAQD--LLILGSCSNSDIWEYTEQKI 456
           QR+S +  P QGP S++K Y ++FHPL+GLC+L++  D   + LG C+ S  W YT Q  
Sbjct: 357 QRLSLILSPLQGPGSQSKVYNLVFHPLTGLCMLQSILDPTKVTLGLCNESQPWSYTPQNT 416

Query: 457 LSMKGTDFCLQAADGEGKQVKLGK-EWSTPN-SAWEMISDSNMQLSSKLNNGTSSVCLDV 514
           L++K    CL++  G    VKL +   S+PN S WE IS SNM L++K  N  +S+CLDV
Sbjct: 417 LTLKDKSLCLEST-GPNAPVKLSETSCSSPNLSEWETISASNMLLAAKSTN--NSLCLDV 473

Query: 515 DADNAIVTNACKCLN-NDKTCDPASQWFKLVDSTR 548
           D  N ++ + CKC+   D +CDP SQWFK+V  ++
Sbjct: 474 DETNNLMASNCKCVKGEDSSCDPISQWFKIVKVSK 508


>AT5G17500.1 | Symbols:  | Glycosyl hydrolase superfamily protein |
           chr5:5767379-5769719 FORWARD LENGTH=526
          Length = 526

 Score =  525 bits (1353), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 263/508 (51%), Positives = 334/508 (65%), Gaps = 19/508 (3%)

Query: 46  PLSTSGRWIVNEXXXXXXERVKLACVNWVSHLDAMVVEGLSQQPVDVISKGIKSKGFNCV 105
           PL T  RWIVN        RVKLAC NW SHL  +V EGLS QP+D ISK IK  GFNCV
Sbjct: 27  PLFTKSRWIVNNKG----HRVKLACANWPSHLKPVVAEGLSSQPMDSISKKIKDMGFNCV 82

Query: 106 RLTWSLSLLTNDSL----TVRESFQNLGLLQSISGMQANNPSFIDLPLIKALQAVVKSLG 161
           RLTW L L+ ND+L    TV++SF+  GL   + G+  +NP  ++ PLI   QAVV SLG
Sbjct: 83  RLTWPLELMINDTLAFNVTVKQSFERYGLDHELQGIYTHNPYIVNTPLINVFQAVVYSLG 142

Query: 162 DNDVMVILDNHITQAMWCCSNTDGNGFFGDQHFDPNLWIMGLTKMATLFNGVTNVVGMSL 221
            +DVMVILDNH T   WCCSN D + FFGD  F+P+LW++GL KMAT+F  V NVVGMSL
Sbjct: 143 RHDVMVILDNHKTVPGWCCSNDDPDAFFGDPKFNPDLWMLGLKKMATIFMNVKNVVGMSL 202

Query: 222 RNELRGPKQNVPDWYRYMSKGAEVVHAANPNVLVILSGLNFDTDLSFLDKQPVQLTFNKK 281
           RNELRG      DWY+YM KGAE VH +NPNVLVILSGLNFD DLSFL  +PV L+F KK
Sbjct: 203 RNELRGYNHTSKDWYKYMQKGAEAVHTSNPNVLVILSGLNFDADLSFLKDRPVNLSFKKK 262

Query: 282 LVFAAHWYSFSN-TQAWTTGDPNQACGQVTRNMMRMSGFLLQQGWPLFLSEFGVDMRGTN 340
           LV   HWYSF++ T  W + + N  C Q+     R  GF+L QG+PLFLSEFG D RG +
Sbjct: 263 LVLELHWYSFTDGTGQWKSHNVNDFCSQMFSKERRTGGFVLDQGFPLFLSEFGTDQRGGD 322

Query: 341 VNDNRFLNCFMALAAELDFDWALWTLAGSYYTARGIVGMEEFYGLLNGDWSQVRSESFLQ 400
           +  NR++NC +A AAE D DWA+W + G YY   G  G+ E YG+L+ +W  V + ++L+
Sbjct: 323 LEGNRYMNCMLAWAAEKDLDWAVWAVTGVYYFREGKRGVVEAYGMLDANWHNVHNYTYLR 382

Query: 401 RISAVQLPFQGPS-EAKPYKVIFHPLSGLCVLKNA---QDLLILGSCSNSDIWEYTEQKI 456
           R+S +Q P  GP  +   +K IFHPL+GLC+++ +   +  L LG C+  + W Y+   I
Sbjct: 383 RLSVIQPPHTGPGVKHNHHKKIFHPLTGLCLVRKSHCHESELTLGPCTKDEPWSYSHGGI 442

Query: 457 LSM-KGTDFCLQAADGEGKQVKLGKEWSTPNSAWEMISDSNMQLSSKLNNGTSSVCLDVD 515
           L + +G   CL+     GK VKLG+      +  E IS + M LS   ++G S VCLDVD
Sbjct: 443 LEIRRGHKSCLEGETAVGKSVKLGR----ICTKIEQISATKMHLSFNTSDG-SLVCLDVD 497

Query: 516 ADNAIVTNACKCLNNDKTCDPASQWFKL 543
           +DN +V N+C CL  D TC+PASQWFK+
Sbjct: 498 SDNNVVANSCNCLTGDTTCEPASQWFKI 525


>AT5G16700.1 | Symbols:  | Glycosyl hydrolase superfamily protein |
           chr5:5480763-5483045 FORWARD LENGTH=488
          Length = 488

 Score =  438 bits (1126), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 242/517 (46%), Positives = 315/517 (60%), Gaps = 55/517 (10%)

Query: 38  TIKPVGALPLSTSGRWIVNEXXXXXXERVKLACVNWVSHLDAMVVEGLSQQPVDVISKGI 97
           T K   + PLST  RWIV+E      +RVKLACVNW +HL   V EGLS+QP+D ISK I
Sbjct: 17  TSKLTTSYPLSTKSRWIVDEKG----QRVKLACVNWPAHLQPTVAEGLSKQPLDSISKKI 72

Query: 98  KSKGFNCVRLTWSLSLLTNDSL----TVRESFQNLGLLQSISGMQANNPSFIDLPLIKAL 153
            S GFNCVRLTW L L+TND+L    TV++SF++L L + + G+Q +NP  + LPL  A 
Sbjct: 73  VSMGFNCVRLTWPLDLVTNDTLALKVTVKQSFESLKLFEDVLGIQTHNPKLLHLPLFNAF 132

Query: 154 QAVVKSLGDNDVMVILDNHITQAMWCCSNTDGNGFFGDQHFDPNLWIMGLTKMATLFNGV 213
           Q VV +LG+N VMVILDNH+T   WCC + D + FFG  HFDP +W  GL KMATLF   
Sbjct: 133 QEVVSNLGENGVMVILDNHLTTPGWCCGDNDLDAFFGYPHFDPLVWAKGLRKMATLFRNF 192

Query: 214 TNVVGMSLRNELRGPKQNVPDWYRYMSKGAEVVHAANPNVLVILSGLNFDTDLSFLDKQP 273
           T+V+GMSLRNE RG +     W+R+M +GAE VHAANP +LVILSG++FDT+LSFL  + 
Sbjct: 193 THVIGMSLRNEPRGARDYPDLWFRHMPQGAEAVHAANPKLLVILSGIDFDTNLSFLRDRS 252

Query: 274 VQLTFNKKLVFAAHWYSFSNTQ-AWTTGDPNQACGQVTRNMMRMSGFLLQQGWPLFLSEF 332
           V ++F  KLVF  HWYSFS+ + +W   + N  C ++   +    GFLL +G+PL LSEF
Sbjct: 253 VNVSFTDKLVFELHWYSFSDGRDSWRKHNSNDFCVKIIEKVTHNGGFLLGRGFPLILSEF 312

Query: 333 GVDMRGTNVNDNRFLNCFMALAAELDFDWALWTLAGSYYTARGIVGMEEFYGLLNGDWSQ 392
           G D RG +++ NR++NC +A AAE D DWA+W L G YY                     
Sbjct: 313 GTDQRGGDMSGNRYMNCLVAWAAENDLDWAVWALTGDYY--------------------- 351

Query: 393 VRSESFLQRISAVQLPFQGPSEAKPYKVIFHPLSGLCVLKNAQD---LLILGSCSNSDIW 449
           +R+               GP       ++FHP +GLCV  N  D    L LG C  SD W
Sbjct: 352 LRT---------------GPGLRPNKNLLFHPSTGLCVTNNPSDNIPTLRLGPCPKSDPW 396

Query: 450 EYT-EQKILSMKGTDFCLQAADGEGKQVKLGKEWSTPNSAWEMISDSNMQLSSKLNNGTS 508
            +   + IL +     C++A +  G++VKLG    T  S    IS + M LS K +NG  
Sbjct: 397 TFNPSEGILWI--NKMCVEAPNVVGQKVKLGV--GTKCSKLGQISATKMHLSFKTSNGL- 451

Query: 509 SVCLDVDA-DNAIVTNACKCLNNDKTCDPASQWFKLV 544
            +CLDVD  DN++V N CK L  D +CDPASQWFK++
Sbjct: 452 LLCLDVDERDNSVVANRCKFLTMDASCDPASQWFKVL 488