Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC000412A_C01 KMC000412A_c01
(1146 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gb|AAD39534.2| cellulose synthase catalytic subunit [Gossypium h... 409 e-113
ref|NP_196136.1| cellulose synthase catalytic subunit (gb|AAC393... 408 e-113
pir||T52054 cellulose synthase (EC 2.4.1.-) catalytic subunit [v... 408 e-113
gb|AAF89964.1|AF200528_1 cellulose synthase-4 [Zea mays] 304 1e-81
gb|AAF89969.1|AF200533_1 cellulose synthase-9 [Zea mays] 298 8e-80
>gb|AAD39534.2| cellulose synthase catalytic subunit [Gossypium hirsutum]
Length = 1067
Score = 409 bits (1051), Expect = e-113
Identities = 204/273 (74%), Positives = 217/273 (78%)
Frame = +2
Query: 326 MESEGEAGAKPITTSGAQVCQICSDNVGKTVDGEAFIACDVCAFPVCRPCYEYERKDGNQ 505
MESEG+ G KP+ G Q CQIC DNVGK DG+ FIAC++CAFPVCRPCYEYERKDGNQ
Sbjct: 1 MESEGDIGGKPMKNLGGQTCQICGDNVGKNTDGDPFIACNICAFPVCRPCYEYERKDGNQ 60
Query: 506 SCPQCKTRYKRHKGSPAILGDSEEEGGADDGASDFNYDSEHQNQKQKISERMWGWQLTYG 685
SCPQCKTRYK KGSPAILGD E G ADDGASDF Y SE+Q QKQK++ERM GW YG
Sbjct: 61 SCPQCKTRYKWQKGSPAILGDRETGGDADDGASDFIY-SENQEQKQKLAERMQGWNAKYG 119
Query: 686 RSEEVGAPNYDKEVSHNHIPRLTSGQEVSGEFSAGSPERLSMASPTAGGGKRVHNLPYSS 865
R E+VGAP YDKE+SHNHIP LTSGQEVSGE SA SPERLSMASP GGK S
Sbjct: 120 RGEDVGAPTYDKEISHNHIPLLTSGQEVSGELSAASPERLSMASPGVAGGK--------S 171
Query: 866 DVNQSPNIRVGDSGGLGNVAWKERVDGWKMKQEKNVVPMSTGQAASERGAGDIDASTDVL 1045
+ +R S GLGNVAWKERVDGWKMKQEKN VPMST QA SERG GDIDASTDVL
Sbjct: 172 SIRVVDPVREFGSSGLGNVAWKERVDGWKMKQEKNTVPMSTCQATSERGLGDIDASTDVL 231
Query: 1046 VDDSLLNDEARQPLSRKVSVPSSRINPYRMVII 1144
VDDS LNDEARQPLSRKVSV SS+INPYRMVII
Sbjct: 232 VDDSQLNDEARQPLSRKVSVSSSKINPYRMVII 264
>ref|NP_196136.1| cellulose synthase catalytic subunit (gb|AAC39336.1); protein id:
At5g05170.1, supported by cDNA: gi_2827142 [Arabidopsis
thaliana] gi|9759258|dbj|BAB09693.1| cellulose synthase
catalytic subunit [Arabidopsis thaliana]
gi|26983832|gb|AAN86168.1| putative cellulose synthase
catalytic subunit [Arabidopsis thaliana]
Length = 1065
Score = 408 bits (1049), Expect = e-113
Identities = 203/273 (74%), Positives = 226/273 (82%)
Frame = +2
Query: 326 MESEGEAGAKPITTSGAQVCQICSDNVGKTVDGEAFIACDVCAFPVCRPCYEYERKDGNQ 505
MESEGE KP+ Q CQICSDNVGKTVDG+ F+ACD+C+FPVCRPCYEYERKDGNQ
Sbjct: 1 MESEGETAGKPMKNIVPQTCQICSDNVGKTVDGDRFVACDICSFPVCRPCYEYERKDGNQ 60
Query: 506 SCPQCKTRYKRHKGSPAILGDSEEEGGADDGASDFNYDSEHQNQKQKISERMWGWQLTYG 685
SCPQCKTRYKR KGSPAI GD +E+G AD+G +FNY QK+KISERM GW LT G
Sbjct: 61 SCPQCKTRYKRLKGSPAIPGDKDEDGLADEGTVEFNYP-----QKEKISERMLGWHLTRG 115
Query: 686 RSEEVGAPNYDKEVSHNHIPRLTSGQEVSGEFSAGSPERLSMASPTAGGGKRVHNLPYSS 865
+ EE+G P YDKEVSHNH+PRLTS Q+ SGEFSA SPERLS++S T GGKR LPYSS
Sbjct: 116 KGEEMGEPQYDKEVSHNHLPRLTSRQDTSGEFSAASPERLSVSS-TIAGGKR---LPYSS 171
Query: 866 DVNQSPNIRVGDSGGLGNVAWKERVDGWKMKQEKNVVPMSTGQAASERGAGDIDASTDVL 1045
DVNQSPN R+ D GLGNVAWKERVDGWKMKQEKN P+ST QAASERG DIDASTD+L
Sbjct: 172 DVNQSPNRRIVDPVGLGNVAWKERVDGWKMKQEKNTGPVST-QAASERGGVDIDASTDIL 230
Query: 1046 VDDSLLNDEARQPLSRKVSVPSSRINPYRMVII 1144
D++LLNDEARQPLSRKVS+PSSRINPYRMVI+
Sbjct: 231 ADEALLNDEARQPLSRKVSIPSSRINPYRMVIM 263
>pir||T52054 cellulose synthase (EC 2.4.1.-) catalytic subunit [validated] -
Arabidopsis thaliana gi|2827143|gb|AAC39336.1| cellulose
synthase catalytic subunit [Arabidopsis thaliana]
Length = 1065
Score = 408 bits (1049), Expect = e-113
Identities = 203/273 (74%), Positives = 226/273 (82%)
Frame = +2
Query: 326 MESEGEAGAKPITTSGAQVCQICSDNVGKTVDGEAFIACDVCAFPVCRPCYEYERKDGNQ 505
MESEGE KP+ Q CQICSDNVGKTVDG+ F+ACD+C+FPVCRPCYEYERKDGNQ
Sbjct: 1 MESEGETAGKPMKNIVPQTCQICSDNVGKTVDGDRFVACDICSFPVCRPCYEYERKDGNQ 60
Query: 506 SCPQCKTRYKRHKGSPAILGDSEEEGGADDGASDFNYDSEHQNQKQKISERMWGWQLTYG 685
SCPQCKTRYKR KGSPAI GD +E+G AD+G +FNY QK+KISERM GW LT G
Sbjct: 61 SCPQCKTRYKRLKGSPAIPGDKDEDGLADEGTVEFNYP-----QKEKISERMLGWHLTRG 115
Query: 686 RSEEVGAPNYDKEVSHNHIPRLTSGQEVSGEFSAGSPERLSMASPTAGGGKRVHNLPYSS 865
+ EE+G P YDKEVSHNH+PRLTS Q+ SGEFSA SPERLS++S T GGKR LPYSS
Sbjct: 116 KGEEMGEPQYDKEVSHNHLPRLTSRQDTSGEFSAASPERLSVSS-TIAGGKR---LPYSS 171
Query: 866 DVNQSPNIRVGDSGGLGNVAWKERVDGWKMKQEKNVVPMSTGQAASERGAGDIDASTDVL 1045
DVNQSPN R+ D GLGNVAWKERVDGWKMKQEKN P+ST QAASERG DIDASTD+L
Sbjct: 172 DVNQSPNRRIVDPVGLGNVAWKERVDGWKMKQEKNTGPVST-QAASERGGVDIDASTDIL 230
Query: 1046 VDDSLLNDEARQPLSRKVSVPSSRINPYRMVII 1144
D++LLNDEARQPLSRKVS+PSSRINPYRMVI+
Sbjct: 231 ADEALLNDEARQPLSRKVSIPSSRINPYRMVIM 263
>gb|AAF89964.1|AF200528_1 cellulose synthase-4 [Zea mays]
Length = 1077
Score = 304 bits (779), Expect = 1e-81
Identities = 166/286 (58%), Positives = 194/286 (67%), Gaps = 16/286 (5%)
Frame = +2
Query: 335 EGEA-GAKPITTSGAQVCQICSDNVGKTVDGEAFIACDVCAFPVCRPCYEYERKDGNQSC 511
EG+A G K G QVCQIC D VG T +G+ F ACDVC FPVCRPCYEYERKDG Q+C
Sbjct: 2 EGDADGVKSGRRGGGQVCQICGDGVGTTAEGDVFAACDVCGFPVCRPCYEYERKDGTQAC 61
Query: 512 PQCKTRYKRHKGSPAILGDSEEEGGADDGASDFNY-DSEHQNQKQKISERMWGWQLTYGR 688
PQCKT+YKRHKGSPAI G EEG D SDFNY S +++QKQKI++RM W++ G
Sbjct: 62 PQCKTKYKRHKGSPAIRG---EEGDDTDADSDFNYLASGNEDQKQKIADRMRSWRMNVGG 118
Query: 689 SEEVGAPNYDK-----------EVSHNHIPRLTSGQEVSGEFSAGSPERLSMASPTAGGG 835
S +VG P YD E+ +IP +T+ Q +SGE SP+ M SPT G
Sbjct: 119 SGDVGRPKYDSGEIGLTKYDSGEIPRGYIPSVTNSQ-ISGEIPGASPDH-HMMSPTGNIG 176
Query: 836 KRVHNLPYSSDVNQSPNIRVGDSGGLGNVAWKERVDGWKMKQEKNVVPMSTGQ--AASE- 1006
KR PY VN SPN SG +GNVAWKERVDGWKMKQ+K +PM+ G A SE
Sbjct: 177 KRA-PFPY---VNHSPNPSREFSGSIGNVAWKERVDGWKMKQDKGTIPMTNGTSIAPSEG 232
Query: 1007 RGAGDIDASTDVLVDDSLLNDEARQPLSRKVSVPSSRINPYRMVII 1144
RG GDIDASTD ++D+LLNDE RQPLSRKV +PSSRINPYRMVI+
Sbjct: 233 RGVGDIDASTDYNMEDALLNDETRQPLSRKVPLPSSRINPYRMVIV 278
>gb|AAF89969.1|AF200533_1 cellulose synthase-9 [Zea mays]
Length = 1079
Score = 298 bits (764), Expect = 8e-80
Identities = 162/286 (56%), Positives = 194/286 (67%), Gaps = 16/286 (5%)
Frame = +2
Query: 335 EGEA-GAKPITTSGAQVCQICSDNVGKTVDGEAFIACDVCAFPVCRPCYEYERKDGNQSC 511
EG+A G K G QVCQIC D VG T +G+ F ACDVC FPVCRPCYEYERKDG Q+C
Sbjct: 2 EGDADGVKSGRRGGGQVCQICGDGVGTTAEGDVFTACDVCGFPVCRPCYEYERKDGTQAC 61
Query: 512 PQCKTRYKRHKGSPAILGDSEEEGGADDGASDFNYD-SEHQNQKQKISERMWGWQLTYGR 688
PQCK +YKRHKGSPAI G+ ++ ADD ASDFNY S + +QKQKI++RM W++ G
Sbjct: 62 PQCKNKYKRHKGSPAIRGEEGDDTDADD-ASDFNYPASGNDDQKQKIADRMRSWRMNAGG 120
Query: 689 SEEVGAPNYDK-----------EVSHNHIPRLTSGQEVSGEFSAGSPERLSMASPTAGGG 835
S +VG P YD E+ +IP +T+ Q +SGE SP+ M SPT G
Sbjct: 121 SGDVGRPKYDSGEIGLTKYDSGEIPRGYIPSVTNSQ-ISGEIPGASPDH-HMMSPTGNIG 178
Query: 836 KRVHNLPYSSDVNQSPNIRVGDSGGLGNVAWKERVDGWKMKQEKNVVPMSTGQ--AASE- 1006
+R PY +N S N SG +GNVAWKERVDGWKMKQ+K +PM+ G A SE
Sbjct: 179 RRA-PFPY---MNHSSNPSREFSGSVGNVAWKERVDGWKMKQDKGTIPMTNGTSIAPSEG 234
Query: 1007 RGAGDIDASTDVLVDDSLLNDEARQPLSRKVSVPSSRINPYRMVII 1144
RG GDIDASTD ++D+LLNDE RQPLSRKV +PSSRINPYRMVI+
Sbjct: 235 RGVGDIDASTDYNMEDALLNDETRQPLSRKVPLPSSRINPYRMVIV 280
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,062,949,583
Number of Sequences: 1393205
Number of extensions: 25761385
Number of successful extensions: 95533
Number of sequences better than 10.0: 191
Number of HSP's better than 10.0 without gapping: 81202
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 92727
length of database: 448,689,247
effective HSP length: 125
effective length of database: 274,538,622
effective search space used: 70281887232
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)