Miyakogusa Predicted Gene
- Lj0g3v0285999.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0285999.1 Non Chatacterized Hit- tr|I1LKM8|I1LKM8_SOYBN
Uncharacterized protein (Fragment) OS=Glycine max GN=G,73.39,0,no
description,Glycoside hydrolase, catalytic domain; no
description,NULL; SUBFAMILY NOT NAMED,NULL;,CUFF.19073.1
(551 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G13130.1 | Symbols: | Cellulase (glycosyl hydrolase family 5... 598 e-171
AT3G26130.1 | Symbols: | Cellulase (glycosyl hydrolase family 5... 583 e-167
AT3G26140.1 | Symbols: | Cellulase (glycosyl hydrolase family 5... 580 e-166
AT5G17500.1 | Symbols: | Glycosyl hydrolase superfamily protein... 525 e-149
AT5G16700.1 | Symbols: | Glycosyl hydrolase superfamily protein... 438 e-123
>AT1G13130.1 | Symbols: | Cellulase (glycosyl hydrolase family 5)
protein | chr1:4474726-4477820 FORWARD LENGTH=552
Length = 552
Score = 598 bits (1542), Expect = e-171, Method: Compositional matrix adjust.
Identities = 291/517 (56%), Positives = 364/517 (70%), Gaps = 13/517 (2%)
Query: 41 PVGALPLSTSGRWIVNEXXXXXXERVKLACVNWVSHLDAMVVEGLSQQPVDVISKGIKSK 100
P + PLSTS RWIV+E RVKL C NW SHL +V EGLS+QPVD ++K I
Sbjct: 29 PNMSYPLSTSSRWIVDENGL----RVKLVCANWPSHLQPVVAEGLSKQPVDAVAKKIVEM 84
Query: 101 GFNCVRLTWSLSLLTNDSL----TVRESFQNLGLLQSISGMQANNPSFIDLPLIKALQAV 156
GFNCVRLTW L L+TN++L TVR+SFQ+LGL I G Q NNPS IDLPLI+A + V
Sbjct: 85 GFNCVRLTWPLDLMTNETLANNVTVRQSFQSLGLNDDIVGFQTNNPSIIDLPLIEAYKTV 144
Query: 157 VKSLGDNDVMVILDNHITQAMWCCSNTDGNGFFGDQHFDPNLWIMGLTKMATLFNGVTNV 216
V +LG+NDVMVILDNH+T+ WCC+N DGNGFFGDQ FDP +W+ L KMA FNGV+NV
Sbjct: 145 VTTLGNNDVMVILDNHLTKPGWCCANDDGNGFFGDQFFDPTVWVAALKKMAATFNGVSNV 204
Query: 217 VGMSLRNELRGPKQNVPDWYRYMSKGAEVVHAANPNVLVILSGLNFDTDLSFLDKQPVQL 276
VGMSLRNELRGPKQNV DW++YM +GAE VH+AN VLVILSGL+FD DLSF+ +PV+L
Sbjct: 205 VGMSLRNELRGPKQNVNDWFKYMQQGAEAVHSANNKVLVILSGLSFDADLSFVRSRPVKL 264
Query: 277 TFNKKLVFAAHWYSFSNTQAWTTGDPNQACGQVTRNMMRMSGFLLQQGWPLFLSEFGVDM 336
+F KLVF HWYSFS+ +W +PN CG+V + G+LL QG+PLFLSEFG+D
Sbjct: 265 SFTGKLVFELHWYSFSDGNSWAANNPNDICGRVLNRIGNGGGYLLNQGFPLFLSEFGIDE 324
Query: 337 RGTNVNDNRFLNCFMALAAELDFDWALWTLAGSYYTARGIVGMEEFYGLLNGDWSQVRSE 396
RG N NDNR+ C AAE D DW+LW L GSYY +G VGM E+YG+L+ DW VR+
Sbjct: 325 RGVNTNDNRYFGCLTGWAAENDVDWSLWALTGSYYLRQGKVGMNEYYGVLDSDWISVRNS 384
Query: 397 SFLQRISAVQLPFQGPS-EAKPYKVIFHPLSGLCVLKNAQD--LLILGSCSNSDIWEYTE 453
SFLQ+IS +Q P QGP Y ++FHPL+GLC++++ D +L LG C++S+ W YT
Sbjct: 385 SFLQKISFLQSPLQGPGPRTDAYNLVFHPLTGLCIVRSLDDPKMLTLGPCNSSEPWSYT- 443
Query: 454 QKILSMKGTDFCLQAADGEGKQVKLGKEWSTPNSAWEMISDSNMQLSSKLNNGTSSVCLD 513
+K L +K CLQ+ + ST S W+ IS S M L+S +N T S+CLD
Sbjct: 444 KKALRIKDQQLCLQSNGPKNPVTMTRTSCSTSGSKWQTISASRMHLASTTSNKT-SLCLD 502
Query: 514 VDADNAIVTNACKCLNNDKTCDPASQWFKLVDSTRKL 550
VD N +V NACKCL+ DK+C+P SQWFK++ +TR L
Sbjct: 503 VDTANNVVANACKCLSKDKSCEPMSQWFKIIKATRPL 539
>AT3G26130.1 | Symbols: | Cellulase (glycosyl hydrolase family 5)
protein | chr3:9553708-9555611 REVERSE LENGTH=551
Length = 551
Score = 583 bits (1504), Expect = e-167, Method: Compositional matrix adjust.
Identities = 286/514 (55%), Positives = 372/514 (72%), Gaps = 14/514 (2%)
Query: 44 ALPLSTSGRWIVNEXXXXXXERVKLACVNWVSHLDAMVVEGLSQQPVDVISKGIKSKGFN 103
A P ST RWIV++ RVKL CVNW SHL+ V EGLS+QP+D I++ I S GFN
Sbjct: 20 AFPPSTDSRWIVDDGNKG--RRVKLTCVNWPSHLETAVAEGLSKQPLDAIAEKIVSMGFN 77
Query: 104 CVRLTWSLSLLTNDS----LTVRESFQNLGLLQSISGMQANNPSFIDLPLIKALQAVVKS 159
CVRLTW L L T++S +TVR+S + L +++SG Q +NP+ +DLPLIKA Q VV
Sbjct: 78 CVRLTWPLYLATDESFSAFMTVRQSLRKFRLFEAVSGFQTHNPTILDLPLIKAFQEVVYC 137
Query: 160 LGDNDVMVILDNHITQAMWCCSNTDGNGFFGDQHFDPNLWIMGLTKMATLFNGVT-NVVG 218
L + VMVILDNHI+Q WCCS+ DGNGFFGD+H +P +WI GL KMA++F V+ NVVG
Sbjct: 138 LEKHRVMVILDNHISQPGWCCSDNDGNGFFGDKHLNPQVWIKGLKKMASMFANVSSNVVG 197
Query: 219 MSLRNELRGPKQNVPDWYRYMSKGAEVVHAANPNVLVILSGLNFDTDLSFLDKQPVQLTF 278
MSLRNELRGPKQN+ DWY+YM +GAE VH+ NPNVLVI+SGLN+ TDLSFL ++P +++F
Sbjct: 198 MSLRNELRGPKQNIKDWYKYMREGAEAVHSVNPNVLVIVSGLNYATDLSFLRERPFEVSF 257
Query: 279 NKKLVFAAHWYSFSNTQAWTTGDPNQACGQVTRNMMRMSGFLLQQGWPLFLSEFGVDMRG 338
+K+VF HWY F NT W + N+ CG+ T MM+MSGFLL++G PLF+SEFG+D RG
Sbjct: 258 RRKVVFEIHWYGFWNT--WEGDNLNKICGKETEKMMKMSGFLLEKGIPLFVSEFGIDQRG 315
Query: 339 TNVNDNRFLNCFMALAAELDFDWALWTLAGSYYTARGIVGMEEFYGLLNGDWSQVRSESF 398
N NDN+FL+CFMALAA+ D DW+LWTLAGSYY +G +E YG+L+ +WS +R+ +
Sbjct: 316 NNANDNKFLSCFMALAADRDLDWSLWTLAGSYYIREKSIGSDESYGVLDFNWSSIRNSTI 375
Query: 399 LQRISAVQLPFQGPSEAKPYKVIFHPLSGLCVLKNAQDLLILGSCSNSDIWEYTEQKILS 458
LQ ISA+Q PF G E +P K++FHP +GLC+++ + L LGSC+ S+ W + ++LS
Sbjct: 376 LQMISAIQTPFIGLMETQPKKIMFHPSTGLCIVRKSLFQLKLGSCNRSESWRLSSHRVLS 435
Query: 459 MKGTD-FCLQAADGEGKQVKLGKEWSTPN-SAWEMISDSNMQLSSKLNNGTSSVCLDVDA 516
+ CL+A + +GK VKL +S S W++ SDS MQLSS NG SVCLDVD
Sbjct: 436 LAEEQILCLKAYE-KGKSVKLRLFFSESYCSKWKLFSDSKMQLSSITKNGF-SVCLDVDT 493
Query: 517 D-NAIVTNACKCLNNDKTCDPASQWFKLVDSTRK 549
+ N IVTN+CKCL + +CDP SQWFKLV STR+
Sbjct: 494 ENNNIVTNSCKCLRGNSSCDPRSQWFKLVTSTRR 527
>AT3G26140.1 | Symbols: | Cellulase (glycosyl hydrolase family 5)
protein | chr3:9559742-9563070 REVERSE LENGTH=508
Length = 508
Score = 580 bits (1495), Expect = e-166, Method: Compositional matrix adjust.
Identities = 288/515 (55%), Positives = 372/515 (72%), Gaps = 18/515 (3%)
Query: 44 ALPLSTSGRWIVNEXXXXXXERVKLACVNWVSHLDAMVVEGLSQQPVDVISKGIKSKGFN 103
A PLST+ RWI++E +RVKLACVNW SHL +V EGLS+Q VD ++K I + GFN
Sbjct: 2 AYPLSTNSRWIIDEKG----QRVKLACVNWPSHLQPVVAEGLSKQSVDDLAKKIMAMGFN 57
Query: 104 CVRLTWSLSLLTNDSL----TVRESFQNLGLLQSISGMQANNPSFIDLPLIKALQAVVKS 159
CVR TW L L TN++L TVR+SFQ+LGL ISG + NPS IDLPLI+A + VV
Sbjct: 58 CVRFTWPLDLATNETLANNVTVRQSFQSLGLNDDISGFETKNPSMIDLPLIEAYKKVVAK 117
Query: 160 LGDNDVMVILDNHITQAMWCCSNTDGNGFFGDQHFDPNLWIMGLTKMATLFNGVTNVVGM 219
LG+N+VMVILDNH+T+ WCC DGNGFFGD FDP WI GLTK+A F G TNVVGM
Sbjct: 118 LGNNNVMVILDNHVTKPGWCCGYNDGNGFFGDTFFDPTTWIAGLTKIAMTFKGATNVVGM 177
Query: 220 SLRNELRGPKQNVPDWYRYMSKGAEVVHAANPNVLVILSGLNFDTDLSFLDKQPVQLTFN 279
SLRNELRGPKQNV DW++YM +GAE VH ANPNVLVILSGL++DTDLSF+ + V LTF
Sbjct: 178 SLRNELRGPKQNVDDWFKYMQQGAEAVHEANPNVLVILSGLSYDTDLSFVRSRHVNLTFT 237
Query: 280 KKLVFAAHWYSFSNTQAWTTGDPNQACGQVTRNMMRMSGFLLQQGWPLFLSEFGVDMRGT 339
+KLVF H YSF+NT W++ +PN+ACG++ +++ GF L+ +P+FLSEFG+D+RG
Sbjct: 238 RKLVFELHRYSFTNTNTWSSKNPNEACGEILKSIENGGGFNLRD-FPVFLSEFGIDLRGK 296
Query: 340 NVNDNRFLNCFMALAAELDFDWALWTLAGSYYTARGIVGMEEFYGLLNGDWSQVRSESFL 399
NVNDNR++ C + AAE D DW++WTL GSYY G+VGM EFYG+L+ DW +VRS+SFL
Sbjct: 297 NVNDNRYIGCILGWAAENDVDWSIWTLQGSYYLREGVVGMSEFYGILDSDWVRVRSQSFL 356
Query: 400 QRISAVQLPFQGP-SEAKPYKVIFHPLSGLCVLKNAQD--LLILGSCSNSDIWEYTEQKI 456
QR+S + P QGP S++K Y ++FHPL+GLC+L++ D + LG C+ S W YT Q
Sbjct: 357 QRLSLILSPLQGPGSQSKVYNLVFHPLTGLCMLQSILDPTKVTLGLCNESQPWSYTPQNT 416
Query: 457 LSMKGTDFCLQAADGEGKQVKLGK-EWSTPN-SAWEMISDSNMQLSSKLNNGTSSVCLDV 514
L++K CL++ G VKL + S+PN S WE IS SNM L++K N +S+CLDV
Sbjct: 417 LTLKDKSLCLEST-GPNAPVKLSETSCSSPNLSEWETISASNMLLAAKSTN--NSLCLDV 473
Query: 515 DADNAIVTNACKCLN-NDKTCDPASQWFKLVDSTR 548
D N ++ + CKC+ D +CDP SQWFK+V ++
Sbjct: 474 DETNNLMASNCKCVKGEDSSCDPISQWFKIVKVSK 508
>AT5G17500.1 | Symbols: | Glycosyl hydrolase superfamily protein |
chr5:5767379-5769719 FORWARD LENGTH=526
Length = 526
Score = 525 bits (1353), Expect = e-149, Method: Compositional matrix adjust.
Identities = 263/508 (51%), Positives = 334/508 (65%), Gaps = 19/508 (3%)
Query: 46 PLSTSGRWIVNEXXXXXXERVKLACVNWVSHLDAMVVEGLSQQPVDVISKGIKSKGFNCV 105
PL T RWIVN RVKLAC NW SHL +V EGLS QP+D ISK IK GFNCV
Sbjct: 27 PLFTKSRWIVNNKG----HRVKLACANWPSHLKPVVAEGLSSQPMDSISKKIKDMGFNCV 82
Query: 106 RLTWSLSLLTNDSL----TVRESFQNLGLLQSISGMQANNPSFIDLPLIKALQAVVKSLG 161
RLTW L L+ ND+L TV++SF+ GL + G+ +NP ++ PLI QAVV SLG
Sbjct: 83 RLTWPLELMINDTLAFNVTVKQSFERYGLDHELQGIYTHNPYIVNTPLINVFQAVVYSLG 142
Query: 162 DNDVMVILDNHITQAMWCCSNTDGNGFFGDQHFDPNLWIMGLTKMATLFNGVTNVVGMSL 221
+DVMVILDNH T WCCSN D + FFGD F+P+LW++GL KMAT+F V NVVGMSL
Sbjct: 143 RHDVMVILDNHKTVPGWCCSNDDPDAFFGDPKFNPDLWMLGLKKMATIFMNVKNVVGMSL 202
Query: 222 RNELRGPKQNVPDWYRYMSKGAEVVHAANPNVLVILSGLNFDTDLSFLDKQPVQLTFNKK 281
RNELRG DWY+YM KGAE VH +NPNVLVILSGLNFD DLSFL +PV L+F KK
Sbjct: 203 RNELRGYNHTSKDWYKYMQKGAEAVHTSNPNVLVILSGLNFDADLSFLKDRPVNLSFKKK 262
Query: 282 LVFAAHWYSFSN-TQAWTTGDPNQACGQVTRNMMRMSGFLLQQGWPLFLSEFGVDMRGTN 340
LV HWYSF++ T W + + N C Q+ R GF+L QG+PLFLSEFG D RG +
Sbjct: 263 LVLELHWYSFTDGTGQWKSHNVNDFCSQMFSKERRTGGFVLDQGFPLFLSEFGTDQRGGD 322
Query: 341 VNDNRFLNCFMALAAELDFDWALWTLAGSYYTARGIVGMEEFYGLLNGDWSQVRSESFLQ 400
+ NR++NC +A AAE D DWA+W + G YY G G+ E YG+L+ +W V + ++L+
Sbjct: 323 LEGNRYMNCMLAWAAEKDLDWAVWAVTGVYYFREGKRGVVEAYGMLDANWHNVHNYTYLR 382
Query: 401 RISAVQLPFQGPS-EAKPYKVIFHPLSGLCVLKNA---QDLLILGSCSNSDIWEYTEQKI 456
R+S +Q P GP + +K IFHPL+GLC+++ + + L LG C+ + W Y+ I
Sbjct: 383 RLSVIQPPHTGPGVKHNHHKKIFHPLTGLCLVRKSHCHESELTLGPCTKDEPWSYSHGGI 442
Query: 457 LSM-KGTDFCLQAADGEGKQVKLGKEWSTPNSAWEMISDSNMQLSSKLNNGTSSVCLDVD 515
L + +G CL+ GK VKLG+ + E IS + M LS ++G S VCLDVD
Sbjct: 443 LEIRRGHKSCLEGETAVGKSVKLGR----ICTKIEQISATKMHLSFNTSDG-SLVCLDVD 497
Query: 516 ADNAIVTNACKCLNNDKTCDPASQWFKL 543
+DN +V N+C CL D TC+PASQWFK+
Sbjct: 498 SDNNVVANSCNCLTGDTTCEPASQWFKI 525
>AT5G16700.1 | Symbols: | Glycosyl hydrolase superfamily protein |
chr5:5480763-5483045 FORWARD LENGTH=488
Length = 488
Score = 438 bits (1126), Expect = e-123, Method: Compositional matrix adjust.
Identities = 242/517 (46%), Positives = 315/517 (60%), Gaps = 55/517 (10%)
Query: 38 TIKPVGALPLSTSGRWIVNEXXXXXXERVKLACVNWVSHLDAMVVEGLSQQPVDVISKGI 97
T K + PLST RWIV+E +RVKLACVNW +HL V EGLS+QP+D ISK I
Sbjct: 17 TSKLTTSYPLSTKSRWIVDEKG----QRVKLACVNWPAHLQPTVAEGLSKQPLDSISKKI 72
Query: 98 KSKGFNCVRLTWSLSLLTNDSL----TVRESFQNLGLLQSISGMQANNPSFIDLPLIKAL 153
S GFNCVRLTW L L+TND+L TV++SF++L L + + G+Q +NP + LPL A
Sbjct: 73 VSMGFNCVRLTWPLDLVTNDTLALKVTVKQSFESLKLFEDVLGIQTHNPKLLHLPLFNAF 132
Query: 154 QAVVKSLGDNDVMVILDNHITQAMWCCSNTDGNGFFGDQHFDPNLWIMGLTKMATLFNGV 213
Q VV +LG+N VMVILDNH+T WCC + D + FFG HFDP +W GL KMATLF
Sbjct: 133 QEVVSNLGENGVMVILDNHLTTPGWCCGDNDLDAFFGYPHFDPLVWAKGLRKMATLFRNF 192
Query: 214 TNVVGMSLRNELRGPKQNVPDWYRYMSKGAEVVHAANPNVLVILSGLNFDTDLSFLDKQP 273
T+V+GMSLRNE RG + W+R+M +GAE VHAANP +LVILSG++FDT+LSFL +
Sbjct: 193 THVIGMSLRNEPRGARDYPDLWFRHMPQGAEAVHAANPKLLVILSGIDFDTNLSFLRDRS 252
Query: 274 VQLTFNKKLVFAAHWYSFSNTQ-AWTTGDPNQACGQVTRNMMRMSGFLLQQGWPLFLSEF 332
V ++F KLVF HWYSFS+ + +W + N C ++ + GFLL +G+PL LSEF
Sbjct: 253 VNVSFTDKLVFELHWYSFSDGRDSWRKHNSNDFCVKIIEKVTHNGGFLLGRGFPLILSEF 312
Query: 333 GVDMRGTNVNDNRFLNCFMALAAELDFDWALWTLAGSYYTARGIVGMEEFYGLLNGDWSQ 392
G D RG +++ NR++NC +A AAE D DWA+W L G YY
Sbjct: 313 GTDQRGGDMSGNRYMNCLVAWAAENDLDWAVWALTGDYY--------------------- 351
Query: 393 VRSESFLQRISAVQLPFQGPSEAKPYKVIFHPLSGLCVLKNAQD---LLILGSCSNSDIW 449
+R+ GP ++FHP +GLCV N D L LG C SD W
Sbjct: 352 LRT---------------GPGLRPNKNLLFHPSTGLCVTNNPSDNIPTLRLGPCPKSDPW 396
Query: 450 EYT-EQKILSMKGTDFCLQAADGEGKQVKLGKEWSTPNSAWEMISDSNMQLSSKLNNGTS 508
+ + IL + C++A + G++VKLG T S IS + M LS K +NG
Sbjct: 397 TFNPSEGILWI--NKMCVEAPNVVGQKVKLGV--GTKCSKLGQISATKMHLSFKTSNGL- 451
Query: 509 SVCLDVDA-DNAIVTNACKCLNNDKTCDPASQWFKLV 544
+CLDVD DN++V N CK L D +CDPASQWFK++
Sbjct: 452 LLCLDVDERDNSVVANRCKFLTMDASCDPASQWFKVL 488