Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC001081A_C01 KMC001081A_c01
(946 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_194967.1| cellulose synthase catalytic subunit (RSW1); pr... 419 e-122
gb|AAF89961.1|AF200525_1 cellulose synthase-1 [Zea mays] 369 e-106
gb|AAF89962.1|AF200526_1 cellulose synthase-2 [Zea mays] 363 e-104
pir||T02209 cellulose synthase (EC 2.4.1.-) catalytic chain - ri... 348 4e-99
ref|NP_180124.1| putative cellulose synthase catalytic subunit; ... 319 4e-90
>ref|NP_194967.1| cellulose synthase catalytic subunit (RSW1); protein id:
At4g32410.1 [Arabidopsis thaliana]
gi|7484861|pir||T05351 cellulose synthase (EC 2.4.1.-)
catalytic chain RSW1 - Arabidopsis thaliana
gi|2827139|gb|AAC39334.1| cellulose synthase catalytic
subunit [Arabidopsis thaliana]
gi|4049343|emb|CAA22568.1| cellulose synthase catalytic
subunit (RSW1) [Arabidopsis thaliana]
gi|7270145|emb|CAB79958.1| cellulose synthase catalytic
subunit (RSW1) [Arabidopsis thaliana]
Length = 1081
Score = 419 bits (1078), Expect(2) = e-122
Identities = 207/262 (79%), Positives = 226/262 (86%), Gaps = 1/262 (0%)
Frame = +1
Query: 79 MEANAGMVAGSHKRNELVRIRHDSSDSGPKPLKNLNGQICQICGDNVGISATGDVFVACN 258
MEA+AG+VAGS++RNELVRIRH+S D G KPLKN+NGQICQICGD+VG++ TGDVFVACN
Sbjct: 1 MEASAGLVAGSYRRNELVRIRHES-DGGTKPLKNMNGQICQICGDDVGLAETGDVFVACN 59
Query: 259 ECGFPVCRPCYEYERKDGNQSCPQCKTRYKRQRGSARVDGDEDEDDVDDLENEFNYVQGN 438
EC FPVCRPCYEYERKDG Q CPQCKTR++R RGS RV+GDEDEDDVDD+ENEFNY QG
Sbjct: 60 ECAFPVCRPCYEYERKDGTQCCPQCKTRFRRHRGSPRVEGDEDEDDVDDIENEFNYAQGA 119
Query: 439 AKASRQWEEGSDLSLSSRRDPQQPIPLLTNGQTVSGEIPCATPDTQSVRTTSGPLGPGDK 618
KA Q G + S SSR + QPIPLLT+G TVSGEI TPDTQSVRTTSGPLGP D+
Sbjct: 120 NKARHQ-RHGEEFSSSSRHE-SQPIPLLTHGHTVSGEI--RTPDTQSVRTTSGPLGPSDR 175
Query: 619 -AHSLHYTDPRQPVPVRIVDPSKDLNSYGLGNVDWKERVEGWKLKQEKNMVQMTGKYPEG 795
A S Y DPRQPVPVRIVDPSKDLNSYGLGNVDWKERVEGWKLKQEKNM+QMTGKY EG
Sbjct: 176 NAISSPYIDPRQPVPVRIVDPSKDLNSYGLGNVDWKERVEGWKLKQEKNMLQMTGKYHEG 235
Query: 796 KGGDIEGTGSNGEELQMVDDAR 861
KGG+IEGTGSNGEELQM DD R
Sbjct: 236 KGGEIEGTGSNGEELQMADDTR 257
Score = 42.0 bits (97), Expect(2) = e-122
Identities = 19/27 (70%), Positives = 24/27 (88%)
Frame = +3
Query: 864 PMSRIVPISSTQITPYRVVIILR*LFL 944
PMSR+VPI S+++TPYRVVIILR + L
Sbjct: 259 PMSRVVPIPSSRLTPYRVVIILRLIIL 285
>gb|AAF89961.1|AF200525_1 cellulose synthase-1 [Zea mays]
Length = 1075
Score = 369 bits (946), Expect(2) = e-106
Identities = 177/265 (66%), Positives = 207/265 (77%), Gaps = 4/265 (1%)
Frame = +1
Query: 79 MEANAGMVAGSHKRNELVRIRHDSSDSGP-KPLKNLNGQICQICGDNVGISATGDVFVAC 255
M AN GMVAGSH RNE V IRHD G KP K+ NGQ+CQICGD+VG+SATGDVFVAC
Sbjct: 1 MAANKGMVAGSHNRNEFVMIRHDGDVPGSAKPTKSANGQVCQICGDSVGVSATGDVFVAC 60
Query: 256 NECGFPVCRPCYEYERKDGNQSCPQCKTRYKRQRGSARVDGDEDEDDVDDLENEFNYVQG 435
NEC FPVCRPCYEYERK+GNQ CPQCKTRYKRQ+GS RV GDEDE+DVDDL+NEFNY QG
Sbjct: 61 NECAFPVCRPCYEYERKEGNQCCPQCKTRYKRQKGSPRVHGDEDEEDVDDLDNEFNYKQG 120
Query: 436 NAKASRQWE---EGSDLSLSSRRDPQQPIPLLTNGQTVSGEIPCATPDTQSVRTTSGPLG 606
+ K +W+ + +DLS S+R +P IP LT+GQ +SGEIP A+PD S+R+ +
Sbjct: 121 SGKGP-EWQLQGDDADLSSSARHEPHHRIPRLTSGQQISGEIPDASPDRHSIRSPTS--- 176
Query: 607 PGDKAHSLHYTDPRQPVPVRIVDPSKDLNSYGLGNVDWKERVEGWKLKQEKNMVQMTGKY 786
Y DP PVPVRIVDPSKDLNSYGL +VDWKERVE W++KQ+KNM+Q+T KY
Sbjct: 177 --------SYVDPSVPVPVRIVDPSKDLNSYGLNSVDWKERVESWRVKQDKNMMQVTNKY 228
Query: 787 PEGKGGDIEGTGSNGEELQMVDDAR 861
PE +GGD+EGTGSNGE +QMVDDAR
Sbjct: 229 PEARGGDMEGTGSNGEXMQMVDDAR 253
Score = 38.5 bits (88), Expect(2) = e-106
Identities = 19/27 (70%), Positives = 22/27 (81%)
Frame = +3
Query: 864 PMSRIVPISSTQITPYRVVIILR*LFL 944
P+SRIVPISS Q+ YRVVIILR + L
Sbjct: 255 PLSRIVPISSNQLNLYRVVIILRLIIL 281
>gb|AAF89962.1|AF200526_1 cellulose synthase-2 [Zea mays]
Length = 1074
Score = 363 bits (932), Expect(2) = e-104
Identities = 176/265 (66%), Positives = 206/265 (77%), Gaps = 4/265 (1%)
Frame = +1
Query: 79 MEANAGMVAGSHKRNELVRIRHDSSDSGP-KPLKNLNGQICQICGDNVGISATGDVFVAC 255
M AN GMVAGSH RNE V IRHD P KP K+ NGQ+CQICGD VG+SATGDVFVAC
Sbjct: 1 MAANKGMVAGSHNRNEFVMIRHDGDAPVPAKPTKSANGQVCQICGDTVGVSATGDVFVAC 60
Query: 256 NECGFPVCRPCYEYERKDGNQSCPQCKTRYKRQRGSARVDGDEDEDDVDDLENEFNYVQG 435
NEC FPVCRPCYEYERK+GNQ CPQCKTRYKRQ+GS RV GD++E+DVDDL+NEFNY QG
Sbjct: 61 NECAFPVCRPCYEYERKEGNQCCPQCKTRYKRQKGSPRVHGDDEEEDVDDLDNEFNYKQG 120
Query: 436 NAKASRQWE---EGSDLSLSSRRDPQQPIPLLTNGQTVSGEIPCATPDTQSVRTTSGPLG 606
N K +W+ + +DLS S+R DP IP LT+GQ +SGEIP A+PD S+R+ +
Sbjct: 121 NGKGP-EWQLQGDDADLSSSARHDPHHRIPRLTSGQQISGEIPDASPDRHSIRSPTS--- 176
Query: 607 PGDKAHSLHYTDPRQPVPVRIVDPSKDLNSYGLGNVDWKERVEGWKLKQEKNMVQMTGKY 786
Y DP PVPVRIVDPSKDLNSYGL +VDWKERVE W++KQ+KNM+Q+T KY
Sbjct: 177 --------SYVDPSVPVPVRIVDPSKDLNSYGLNSVDWKERVESWRVKQDKNMLQVTNKY 228
Query: 787 PEGKGGDIEGTGSNGEELQMVDDAR 861
PE + GD+EGTGSNGE++QMVDDAR
Sbjct: 229 PEAR-GDMEGTGSNGEDMQMVDDAR 252
Score = 38.1 bits (87), Expect(2) = e-104
Identities = 18/27 (66%), Positives = 22/27 (80%)
Frame = +3
Query: 864 PMSRIVPISSTQITPYRVVIILR*LFL 944
P+SRIVPISS Q+ YR+VIILR + L
Sbjct: 254 PLSRIVPISSNQLNLYRIVIILRLIIL 280
>pir||T02209 cellulose synthase (EC 2.4.1.-) catalytic chain - rice (fragment)
gi|2781433|gb|AAC39333.1| RSW1-like cellulose synthase
catalytic subunit [Oryza sativa subsp. japonica]
Length = 583
Score = 348 bits (894), Expect(2) = 4e-99
Identities = 171/266 (64%), Positives = 201/266 (75%), Gaps = 5/266 (1%)
Frame = +1
Query: 79 MEANAGMVAGSHKRNELVRIRHDSSDSGP-KPLKNLNGQICQICGDNVGISATGDVFVAC 255
M ANAGMVAGS RNE V IR D P KP K++NGQ+CQICGD VG+SATGDVFVAC
Sbjct: 1 MAANAGMVAGSRNRNEFVMIRPDGDAPPPAKPGKSVNGQVCQICGDTVGVSATGDVFVAC 60
Query: 256 NECGFPVCRPCYEYERKDGNQSCPQCKTRYKRQRGSARVDGDEDEDDVDDLENEFNYVQG 435
NEC FPVCRPCYEYERK+GNQ CPQCKTRYKR +GS RV GDE+E+DVDDL+NEFNY G
Sbjct: 61 NECAFPVCRPCYEYERKEGNQCCPQCKTRYKRHKGSPRVQGDEEEEDVDDLDNEFNYKHG 120
Query: 436 NAKASRQWE---EGSDLSL-SSRRDPQQPIPLLTNGQTVSGEIPCATPDTQSVRTTSGPL 603
N K +W+ +G D+ L SS R Q IP LT+GQ +SGEIP A+PD S+R+ +
Sbjct: 121 NGKGP-EWQIQRQGEDVDLSSSSRHEQHRIPRLTSGQQISGEIPDASPDRHSIRSGTS-- 177
Query: 604 GPGDKAHSLHYTDPRQPVPVRIVDPSKDLNSYGLGNVDWKERVEGWKLKQEKNMVQMTGK 783
Y DP PVPVRIVDPSKDLNSYG+ +VDW+ERV W+ KQ+KNM+Q+ K
Sbjct: 178 ---------SYVDPSVPVPVRIVDPSKDLNSYGINSVDWQERVASWRNKQDKNMMQVANK 228
Query: 784 YPEGKGGDIEGTGSNGEELQMVDDAR 861
YPE +GGD+EGTGSNGE++QMVDDAR
Sbjct: 229 YPEARGGDMEGTGSNGEDIQMVDDAR 254
Score = 36.2 bits (82), Expect(2) = 4e-99
Identities = 17/27 (62%), Positives = 21/27 (76%)
Frame = +3
Query: 864 PMSRIVPISSTQITPYRVVIILR*LFL 944
P+SRIVPI S Q+ YR+VIILR + L
Sbjct: 256 PLSRIVPIPSNQLNLYRIVIILRLIIL 282
>ref|NP_180124.1| putative cellulose synthase catalytic subunit; protein id:
At2g25540.1 [Arabidopsis thaliana]
gi|25412330|pir||F84649 probable cellulose synthase
catalytic subunit [imported] - Arabidopsis thaliana
gi|4432865|gb|AAD20713.1| putative cellulose synthase
catalytic subunit [Arabidopsis thaliana]
Length = 1065
Score = 319 bits (818), Expect(2) = 4e-90
Identities = 163/262 (62%), Positives = 194/262 (73%), Gaps = 7/262 (2%)
Frame = +1
Query: 97 MVAGSHKRNELVRIRHDSSDSGPKPLKNLNGQICQICGDNVGISATGDVFVACNECGFPV 276
MVAGS++R E VR R D SD G KPLK+LNGQICQICGD+VG++ TG+VFVACNECGFP+
Sbjct: 1 MVAGSYRRYEFVRNR-DDSDDGLKPLKDLNGQICQICGDDVGLTKTGNVFVACNECGFPL 59
Query: 277 CRPCYEYERKDGNQSCPQCKTRYKRQRGSARVDGDEDEDDVDDLENEFNYVQGNAKASRQ 456
C+ CYEYERKDG+Q CPQCK R++R GS RV+ DE EDDV+D+ENEF+Y QGN KA R
Sbjct: 60 CQSCYEYERKDGSQCCPQCKARFRRHNGSPRVEVDEKEDDVNDIENEFDYTQGNNKA-RL 118
Query: 457 WEEGSDLSLSSRRDPQQPIPLLTNGQTVSGEIPCATPDTQSVRTTSGPLGPGDKAHSLHY 636
+ S SSR + P+ LLT+G VSGEIP TPD + L P
Sbjct: 119 PHRAEEFSSSSRHEESLPVSLLTHGHPVSGEIP--TPDRNAT------LSP--------C 162
Query: 637 TDPRQP-------VPVRIVDPSKDLNSYGLGNVDWKERVEGWKLKQEKNMVQMTGKYPEG 795
DP+ P +PVRI+DPSKDLNSYGL NVDWK+R++GWKLKQ+KNM+ MTGKY EG
Sbjct: 163 IDPQLPGIYQLLLLPVRILDPSKDLNSYGLVNVDWKKRIQGWKLKQDKNMIHMTGKYHEG 222
Query: 796 KGGDIEGTGSNGEELQMVDDAR 861
KGG+ EGTGSNG+ELQMVDDAR
Sbjct: 223 KGGEFEGTGSNGDELQMVDDAR 244
Score = 35.4 bits (80), Expect(2) = 4e-90
Identities = 15/27 (55%), Positives = 21/27 (77%)
Frame = +3
Query: 864 PMSRIVPISSTQITPYRVVIILR*LFL 944
PMSR+V S ++TPYR+VI+LR + L
Sbjct: 246 PMSRVVHFPSARMTPYRIVIVLRLIIL 272
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 854,061,055
Number of Sequences: 1393205
Number of extensions: 20524183
Number of successful extensions: 89622
Number of sequences better than 10.0: 258
Number of HSP's better than 10.0 without gapping: 70276
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 85821
length of database: 448,689,247
effective HSP length: 123
effective length of database: 277,325,032
effective search space used: 52969081112
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)