Miyakogusa Predicted Gene
- Lj4g3v2717180.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj4g3v2717180.1 CUFF.51555.1
(576 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G13130.1 | Symbols: | Cellulase (glycosyl hydrolase family 5... 613 e-176
AT3G26140.1 | Symbols: | Cellulase (glycosyl hydrolase family 5... 565 e-161
AT3G26130.1 | Symbols: | Cellulase (glycosyl hydrolase family 5... 557 e-159
AT5G17500.1 | Symbols: | Glycosyl hydrolase superfamily protein... 556 e-158
AT5G16700.1 | Symbols: | Glycosyl hydrolase superfamily protein... 447 e-126
>AT1G13130.1 | Symbols: | Cellulase (glycosyl hydrolase family 5)
protein | chr1:4474726-4477820 FORWARD LENGTH=552
Length = 552
Score = 613 bits (1581), Expect = e-176, Method: Compositional matrix adjust.
Identities = 295/506 (58%), Positives = 376/506 (74%), Gaps = 7/506 (1%)
Query: 50 LNTNSRWIVNQDGLRVKLACVNWVSHLDAVVAEGLSKQPVDVISKGIKSMGFNCVRLTWP 109
L+T+SRWIV+++GLRVKL C NW SHL VVAEGLSKQPVD ++K I MGFNCVRLTWP
Sbjct: 35 LSTSSRWIVDENGLRVKLVCANWPSHLQPVVAEGLSKQPVDAVAKKIVEMGFNCVRLTWP 94
Query: 110 ILLVTNDSLSS-LTVRQSFQNLGLLDSVAGVQANNPSIIDLTLIQAFQAVVKSLGDNDVM 168
+ L+TN++L++ +TVRQSFQ+LGL D + G Q NNPSIIDL LI+A++ VV +LG+NDVM
Sbjct: 95 LDLMTNETLANNVTVRQSFQSLGLNDDIVGFQTNNPSIIDLPLIEAYKTVVTTLGNNDVM 154
Query: 169 AILDNHITQPGWCCSNSXXXXXXXXXXXXXXQWILGLTKMATLFNGVPNVVGMSLRNELR 228
ILDNH+T+PGWCC+N W+ L KMA FNGV NVVGMSLRNELR
Sbjct: 155 VILDNHLTKPGWCCANDDGNGFFGDQFFDPTVWVAALKKMAATFNGVSNVVGMSLRNELR 214
Query: 229 GPKQNVNDWYRYMVKGAEAVHAANPDVLVILSGLNFDKDLSFIKNRPVNLTFKGKLVYEA 288
GPKQNVNDW++YM +GAEAVH+AN VLVILSGL+FD DLSF+++RPV L+F GKLV+E
Sbjct: 215 GPKQNVNDWFKYMQQGAEAVHSANNKVLVILSGLSFDADLSFVRSRPVKLSFTGKLVFEL 274
Query: 289 HWYGFTDGQAWVSGNPNQVCGQVAGNMKRTSGFLVDQGWPLFVSEFGVDLRGSNVNDNRY 348
HWY F+DG +W + NPN +CG+V + G+L++QG+PLF+SEFG+D RG N NDNRY
Sbjct: 275 HWYSFSDGNSWAANNPNDICGRVLNRIGNGGGYLLNQGFPLFLSEFGIDERGVNTNDNRY 334
Query: 349 LNCFMAVAAELDLDWALWTLVGSYYFRQGVRGMEEFYGVLRWDWTQVRNTSFLNRINGLQ 408
C AAE D+DW+LW L GSYY RQG GM E+YGVL DW VRN+SFL +I+ LQ
Sbjct: 335 FGCLTGWAAENDVDWSLWALTGSYYLRQGKVGMNEYYGVLDSDWISVRNSSFLQKISFLQ 394
Query: 409 LPFRGPGITKGNPYKLIFHPLTGLCVTRKSLLEP--LTLGPCSFSDGWNYTPQKTLSIKG 466
P +GPG + + Y L+FHPLTGLC+ R SL +P LTLGPC+ S+ W+YT +K L IK
Sbjct: 395 SPLQGPG-PRTDAYNLVFHPLTGLCIVR-SLDDPKMLTLGPCNSSEPWSYT-KKALRIKD 451
Query: 467 TYFCIQAENEGMPAKLS-IICSGPNNKWEMISDSKLHLSSKVNNGSSVCLDVDENNNIVT 525
C+Q+ P ++ CS +KW+ IS S++HL+S +N +S+CLDVD NN+V
Sbjct: 452 QQLCLQSNGPKNPVTMTRTSCSTSGSKWQTISASRMHLASTTSNKTSLCLDVDTANNVVA 511
Query: 526 NSCKCLSRDVKCDPGSQWFKLIDSGR 551
N+CKCLS+D C+P SQWFK+I + R
Sbjct: 512 NACKCLSKDKSCEPMSQWFKIIKATR 537
>AT3G26140.1 | Symbols: | Cellulase (glycosyl hydrolase family 5)
protein | chr3:9559742-9563070 REVERSE LENGTH=508
Length = 508
Score = 565 bits (1457), Expect = e-161, Method: Compositional matrix adjust.
Identities = 279/504 (55%), Positives = 372/504 (73%), Gaps = 10/504 (1%)
Query: 50 LNTNSRWIVNQDGLRVKLACVNWVSHLDAVVAEGLSKQPVDVISKGIKSMGFNCVRLTWP 109
L+TNSRWI+++ G RVKLACVNW SHL VVAEGLSKQ VD ++K I +MGFNCVR TWP
Sbjct: 5 LSTNSRWIIDEKGQRVKLACVNWPSHLQPVVAEGLSKQSVDDLAKKIMAMGFNCVRFTWP 64
Query: 110 ILLVTNDSLSS-LTVRQSFQNLGLLDSVAGVQANNPSIIDLTLIQAFQAVVKSLGDNDVM 168
+ L TN++L++ +TVRQSFQ+LGL D ++G + NPS+IDL LI+A++ VV LG+N+VM
Sbjct: 65 LDLATNETLANNVTVRQSFQSLGLNDDISGFETKNPSMIDLPLIEAYKKVVAKLGNNNVM 124
Query: 169 AILDNHITQPGWCCSNSXXXXXXXXXXXXXXQWILGLTKMATLFNGVPNVVGMSLRNELR 228
ILDNH+T+PGWCC + WI GLTK+A F G NVVGMSLRNELR
Sbjct: 125 VILDNHVTKPGWCCGYNDGNGFFGDTFFDPTTWIAGLTKIAMTFKGATNVVGMSLRNELR 184
Query: 229 GPKQNVNDWYRYMVKGAEAVHAANPDVLVILSGLNFDKDLSFIKNRPVNLTFKGKLVYEA 288
GPKQNV+DW++YM +GAEAVH ANP+VLVILSGL++D DLSF+++R VNLTF KLV+E
Sbjct: 185 GPKQNVDDWFKYMQQGAEAVHEANPNVLVILSGLSYDTDLSFVRSRHVNLTFTRKLVFEL 244
Query: 289 HWYGFTDGQAWVSGNPNQVCGQVAGNMKRTSGFLVDQGWPLFVSEFGVDLRGSNVNDNRY 348
H Y FT+ W S NPN+ CG++ +++ GF + + +P+F+SEFG+DLRG NVNDNRY
Sbjct: 245 HRYSFTNTNTWSSKNPNEACGEILKSIENGGGFNL-RDFPVFLSEFGIDLRGKNVNDNRY 303
Query: 349 LNCFMAVAAELDLDWALWTLVGSYYFRQGVRGMEEFYGVLRWDWTQVRNTSFLNRINGLQ 408
+ C + AAE D+DW++WTL GSYY R+GV GM EFYG+L DW +VR+ SFL R++ +
Sbjct: 304 IGCILGWAAENDVDWSIWTLQGSYYLREGVVGMSEFYGILDSDWVRVRSQSFLQRLSLIL 363
Query: 409 LPFRGPGITKGNPYKLIFHPLTGLCVTRKSLLEP--LTLGPCSFSDGWNYTPQKTLSIKG 466
P +GPG ++ Y L+FHPLTGLC+ +S+L+P +TLG C+ S W+YTPQ TL++K
Sbjct: 364 SPLQGPG-SQSKVYNLVFHPLTGLCML-QSILDPTKVTLGLCNESQPWSYTPQNTLTLKD 421
Query: 467 TYFCIQAENEGMPAKLS-IICSGPN-NKWEMISDSKLHLSSKVNNGSSVCLDVDENNNIV 524
C+++ P KLS CS PN ++WE IS S + L++K N +S+CLDVDE NN++
Sbjct: 422 KSLCLESTGPNAPVKLSETSCSSPNLSEWETISASNMLLAAKSTN-NSLCLDVDETNNLM 480
Query: 525 TNSCKCLS-RDVKCDPGSQWFKLI 547
++CKC+ D CDP SQWFK++
Sbjct: 481 ASNCKCVKGEDSSCDPISQWFKIV 504
>AT3G26130.1 | Symbols: | Cellulase (glycosyl hydrolase family 5)
protein | chr3:9553708-9555611 REVERSE LENGTH=551
Length = 551
Score = 557 bits (1436), Expect = e-159, Method: Compositional matrix adjust.
Identities = 276/509 (54%), Positives = 364/509 (71%), Gaps = 12/509 (2%)
Query: 51 NTNSRWIVNQ--DGLRVKLACVNWVSHLDAVVAEGLSKQPVDVISKGIKSMGFNCVRLTW 108
+T+SRWIV+ G RVKL CVNW SHL+ VAEGLSKQP+D I++ I SMGFNCVRLTW
Sbjct: 24 STDSRWIVDDGNKGRRVKLTCVNWPSHLETAVAEGLSKQPLDAIAEKIVSMGFNCVRLTW 83
Query: 109 PILLVTNDSLSS-LTVRQSFQNLGLLDSVAGVQANNPSIIDLTLIQAFQAVVKSLGDNDV 167
P+ L T++S S+ +TVRQS + L ++V+G Q +NP+I+DL LI+AFQ VV L + V
Sbjct: 84 PLYLATDESFSAFMTVRQSLRKFRLFEAVSGFQTHNPTILDLPLIKAFQEVVYCLEKHRV 143
Query: 168 MAILDNHITQPGWCCSNSXXXXXXXXXXXXXXQWILGLTKMATLFNGVP-NVVGMSLRNE 226
M ILDNHI+QPGWCCS++ WI GL KMA++F V NVVGMSLRNE
Sbjct: 144 MVILDNHISQPGWCCSDNDGNGFFGDKHLNPQVWIKGLKKMASMFANVSSNVVGMSLRNE 203
Query: 227 LRGPKQNVNDWYRYMVKGAEAVHAANPDVLVILSGLNFDKDLSFIKNRPVNLTFKGKLVY 286
LRGPKQN+ DWY+YM +GAEAVH+ NP+VLVI+SGLN+ DLSF++ RP ++F+ K+V+
Sbjct: 204 LRGPKQNIKDWYKYMREGAEAVHSVNPNVLVIVSGLNYATDLSFLRERPFEVSFRRKVVF 263
Query: 287 EAHWYGFTDGQAWVSGNPNQVCGQVAGNMKRTSGFLVDQGWPLFVSEFGVDLRGSNVNDN 346
E HWYGF + W N N++CG+ M + SGFL+++G PLFVSEFG+D RG+N NDN
Sbjct: 264 EIHWYGFWN--TWEGDNLNKICGKETEKMMKMSGFLLEKGIPLFVSEFGIDQRGNNANDN 321
Query: 347 RYLNCFMAVAAELDLDWALWTLVGSYYFRQGVRGMEEFYGVLRWDWTQVRNTSFLNRING 406
++L+CFMA+AA+ DLDW+LWTL GSYY R+ G +E YGVL ++W+ +RN++ L I+
Sbjct: 322 KFLSCFMALAADRDLDWSLWTLAGSYYIREKSIGSDESYGVLDFNWSSIRNSTILQMISA 381
Query: 407 LQLPFRGPGITKGNPYKLIFHPLTGLCVTRKSLLEPLTLGPCSFSDGWNYTPQKTLSI-K 465
+Q PF G+ + P K++FHP TGLC+ RKSL + L LG C+ S+ W + + LS+ +
Sbjct: 382 IQTPF--IGLMETQPKKIMFHPSTGLCIVRKSLFQ-LKLGSCNRSESWRLSSHRVLSLAE 438
Query: 466 GTYFCIQAENEGMPAKLSIICSGPN-NKWEMISDSKLHLSSKVNNGSSVCLDVD-ENNNI 523
C++A +G KL + S +KW++ SDSK+ LSS NG SVCLDVD ENNNI
Sbjct: 439 EQILCLKAYEKGKSVKLRLFFSESYCSKWKLFSDSKMQLSSITKNGFSVCLDVDTENNNI 498
Query: 524 VTNSCKCLSRDVKCDPGSQWFKLIDSGRR 552
VTNSCKCL + CDP SQWFKL+ S RR
Sbjct: 499 VTNSCKCLRGNSSCDPRSQWFKLVTSTRR 527
>AT5G17500.1 | Symbols: | Glycosyl hydrolase superfamily protein |
chr5:5767379-5769719 FORWARD LENGTH=526
Length = 526
Score = 556 bits (1434), Expect = e-158, Method: Compositional matrix adjust.
Identities = 283/504 (56%), Positives = 347/504 (68%), Gaps = 11/504 (2%)
Query: 50 LNTNSRWIVNQDGLRVKLACVNWVSHLDAVVAEGLSKQPVDVISKGIKSMGFNCVRLTWP 109
L T SRWIVN G RVKLAC NW SHL VVAEGLS QP+D ISK IK MGFNCVRLTWP
Sbjct: 28 LFTKSRWIVNNKGHRVKLACANWPSHLKPVVAEGLSSQPMDSISKKIKDMGFNCVRLTWP 87
Query: 110 ILLVTNDSLS-SLTVRQSFQNLGLLDSVAGVQANNPSIIDLTLIQAFQAVVKSLGDNDVM 168
+ L+ ND+L+ ++TV+QSF+ GL + G+ +NP I++ LI FQAVV SLG +DVM
Sbjct: 88 LELMINDTLAFNVTVKQSFERYGLDHELQGIYTHNPYIVNTPLINVFQAVVYSLGRHDVM 147
Query: 169 AILDNHITQPGWCCSNSXXXXXXXXXXXXXXQWILGLTKMATLFNGVPNVVGMSLRNELR 228
ILDNH T PGWCCSN W+LGL KMAT+F V NVVGMSLRNELR
Sbjct: 148 VILDNHKTVPGWCCSNDDPDAFFGDPKFNPDLWMLGLKKMATIFMNVKNVVGMSLRNELR 207
Query: 229 GPKQNVNDWYRYMVKGAEAVHAANPDVLVILSGLNFDKDLSFIKNRPVNLTFKGKLVYEA 288
G DWY+YM KGAEAVH +NP+VLVILSGLNFD DLSF+K+RPVNL+FK KLV E
Sbjct: 208 GYNHTSKDWYKYMQKGAEAVHTSNPNVLVILSGLNFDADLSFLKDRPVNLSFKKKLVLEL 267
Query: 289 HWYGFTDGQA-WVSGNPNQVCGQVAGNMKRTSGFLVDQGWPLFVSEFGVDLRGSNVNDNR 347
HWY FTDG W S N N C Q+ +RT GF++DQG+PLF+SEFG D RG ++ NR
Sbjct: 268 HWYSFTDGTGQWKSHNVNDFCSQMFSKERRTGGFVLDQGFPLFLSEFGTDQRGGDLEGNR 327
Query: 348 YLNCFMAVAAELDLDWALWTLVGSYYFRQGVRGMEEFYGVLRWDWTQVRNTSFLNRINGL 407
Y+NC +A AAE DLDWA+W + G YYFR+G RG+ E YG+L +W V N ++L R++ +
Sbjct: 328 YMNCMLAWAAEKDLDWAVWAVTGVYYFREGKRGVVEAYGMLDANWHNVHNYTYLRRLSVI 387
Query: 408 QLPFRGPGITKGNPYKLIFHPLTGLCVTRKSLLE--PLTLGPCSFSDGWNYTPQKTLSI- 464
Q P GPG+ K N +K IFHPLTGLC+ RKS LTLGPC+ + W+Y+ L I
Sbjct: 388 QPPHTGPGV-KHNHHKKIFHPLTGLCLVRKSHCHESELTLGPCTKDEPWSYSHGGILEIR 446
Query: 465 KGTYFCIQAENE-GMPAKLSIICSGPNNKWEMISDSKLHLSSKVNNGSSVCLDVDENNNI 523
+G C++ E G KL IC+ K E IS +K+HLS ++GS VCLDVD +NN+
Sbjct: 447 RGHKSCLEGETAVGKSVKLGRICT----KIEQISATKMHLSFNTSDGSLVCLDVDSDNNV 502
Query: 524 VTNSCKCLSRDVKCDPGSQWFKLI 547
V NSC CL+ D C+P SQWFK+
Sbjct: 503 VANSCNCLTGDTTCEPASQWFKIF 526
>AT5G16700.1 | Symbols: | Glycosyl hydrolase superfamily protein |
chr5:5480763-5483045 FORWARD LENGTH=488
Length = 488
Score = 447 bits (1151), Expect = e-126, Method: Compositional matrix adjust.
Identities = 241/506 (47%), Positives = 316/506 (62%), Gaps = 51/506 (10%)
Query: 50 LNTNSRWIVNQDGLRVKLACVNWVSHLDAVVAEGLSKQPVDVISKGIKSMGFNCVRLTWP 109
L+T SRWIV++ G RVKLACVNW +HL VAEGLSKQP+D ISK I SMGFNCVRLTWP
Sbjct: 26 LSTKSRWIVDEKGQRVKLACVNWPAHLQPTVAEGLSKQPLDSISKKIVSMGFNCVRLTWP 85
Query: 110 ILLVTNDSLS-SLTVRQSFQNLGLLDSVAGVQANNPSIIDLTLIQAFQAVVKSLGDNDVM 168
+ LVTND+L+ +TV+QSF++L L + V G+Q +NP ++ L L AFQ VV +LG+N VM
Sbjct: 86 LDLVTNDTLALKVTVKQSFESLKLFEDVLGIQTHNPKLLHLPLFNAFQEVVSNLGENGVM 145
Query: 169 AILDNHITQPGWCCSNSXXXXXXXXXXXXXXQWILGLTKMATLFNGVPNVVGMSLRNELR 228
ILDNH+T PGWCC ++ W GL KMATLF +V+GMSLRNE R
Sbjct: 146 VILDNHLTTPGWCCGDNDLDAFFGYPHFDPLVWAKGLRKMATLFRNFTHVIGMSLRNEPR 205
Query: 229 GPKQNVNDWYRYMVKGAEAVHAANPDVLVILSGLNFDKDLSFIKNRPVNLTFKGKLVYEA 288
G + + W+R+M +GAEAVHAANP +LVILSG++FD +LSF+++R VN++F KLV+E
Sbjct: 206 GARDYPDLWFRHMPQGAEAVHAANPKLLVILSGIDFDTNLSFLRDRSVNVSFTDKLVFEL 265
Query: 289 HWYGFTDGQ-AWVSGNPNQVCGQVAGNMKRTSGFLVDQGWPLFVSEFGVDLRGSNVNDNR 347
HWY F+DG+ +W N N C ++ + GFL+ +G+PL +SEFG D RG +++ NR
Sbjct: 266 HWYSFSDGRDSWRKHNSNDFCVKIIEKVTHNGGFLLGRGFPLILSEFGTDQRGGDMSGNR 325
Query: 348 YLNCFMAVAAELDLDWALWTLVGSYYFRQGVRGMEEFYGVLRWDWTQVRNTSFLNRINGL 407
Y+NC +A AAE DLDWA+W L G YY R
Sbjct: 326 YMNCLVAWAAENDLDWAVWALTGDYYLRT------------------------------- 354
Query: 408 QLPFRGPGITKGNPYKLIFHPLTGLCVTRKSL--LEPLTLGPCSFSDGWNYTPQKTLSIK 465
GPG+ L+FHP TGLCVT + L LGPC SD W + P + + +
Sbjct: 355 -----GPGLRPNK--NLLFHPSTGLCVTNNPSDNIPTLRLGPCPKSDPWTFNPSEGI-LW 406
Query: 466 GTYFCIQAEN-EGMPAKLSI--ICSGPNNKWEMISDSKLHLSSKVNNGSSVCLDVDE-NN 521
C++A N G KL + CS K IS +K+HLS K +NG +CLDVDE +N
Sbjct: 407 INKMCVEAPNVVGQKVKLGVGTKCS----KLGQISATKMHLSFKTSNGLLLCLDVDERDN 462
Query: 522 NIVTNSCKCLSRDVKCDPGSQWFKLI 547
++V N CK L+ D CDP SQWFK++
Sbjct: 463 SVVANRCKFLTMDASCDPASQWFKVL 488