
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0037a.5
(991 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC77631 similar to PIR|T51579|T51579 cellulose synthase catalyti... 1162 0.0
TC85693 similar to PIR|T05351|T05351 cellulose synthase (EC 2.4.... 1050 0.0
TC86887 homologue to PIR|T10800|T10800 cellulose synthase (EC 2.... 874 0.0
TC89697 similar to GP|4886756|gb|AAD32031.1| cellulose synthase ... 478 e-149
TC79809 homologue to GP|6446577|gb|AAD39534.2| cellulose synthas... 452 e-127
BM780101 similar to GP|9622890|gb cellulose synthase-9 {Zea mays... 401 e-112
TC86230 homologue to GP|6446577|gb|AAD39534.2| cellulose synthas... 222 e-108
TC80467 homologue to GP|18875454|gb|AAK11588.2 cellulose synthas... 306 e-105
TC87399 similar to GP|17381168|gb|AAL36396.1 putative cellulose ... 355 6e-98
TC85440 homologue to GP|6934298|gb|AAF31705.1| heat-shock protei... 349 3e-96
TC77822 weakly similar to GP|21536834|gb|AAM61166.1 unknown {Ara... 348 6e-96
TC82247 similar to PIR|T10797|T10797 cellulose synthase (EC 2.4.... 323 2e-88
BG644386 similar to PIR|T05351|T0 cellulose synthase (EC 2.4.1.-... 307 1e-83
TC86229 homologue to GP|6446577|gb|AAD39534.2| cellulose synthas... 282 6e-82
TC90826 homologue to GP|6446577|gb|AAD39534.2| cellulose synthas... 271 1e-72
TC89444 similar to GP|12619788|gb|AAG60543.1 cellulose synthase-... 233 1e-66
AW773751 similar to PIR|D86157|D8 hypothetical protein AAF02892.... 224 1e-58
AW688489 similar to PIR|T10797|T10 cellulose synthase (EC 2.4.1.... 219 3e-57
TC79459 homologue to GP|6446577|gb|AAD39534.2| cellulose synthas... 214 1e-55
BG452180 similar to PIR|T10800|T10 cellulose synthase (EC 2.4.1.... 174 5e-53
>TC77631 similar to PIR|T51579|T51579 cellulose synthase catalytic subunit
(IRX3) - Arabidopsis thaliana, partial (77%)
Length = 2694
Score = 1162 bits (3006), Expect = 0.0
Identities = 540/797 (67%), Positives = 657/797 (81%), Gaps = 15/797 (1%)
Frame = +1
Query: 210 SGRLSPYRMMVVTRLLLLLLFIQYRIFHPVPDAIGLWLISVVCEIWLTLSWIVDQLPKWF 269
S +++PYRM++V RL++L F++YR+ +PV DA+GLWL S++CEIW +SWI+DQ PKW+
Sbjct: 4 SSKINPYRMVIVARLVILGFFLRYRLMNPVHDAMGLWLTSIICEIWFAISWILDQFPKWY 183
Query: 270 PIDRETYLDRLSIRFEPENKPNMLSPVDIFVTTVDPIKEPPLVTANTVLSILALDYPAHK 329
PIDRETYLDRLS+R+E E +PNML+PVD+FV+TVDP+KEPPL TANTVLSILA+DYP K
Sbjct: 184 PIDRETYLDRLSLRYEREGEPNMLAPVDVFVSTVDPLKEPPLNTANTVLSILAMDYPIDK 363
Query: 330 ISCYVSDDGASMLTFEALQETAEFARKWVPFCKKFSAEPRAPERYFSQKIDFLKDTLQST 389
ISCY+SDDGASM TFEAL ETAEFARKWVPFCKKF EPRAPE YFS+KID+LKD +Q T
Sbjct: 364 ISCYISDDGASMCTFEALSETAEFARKWVPFCKKFLIEPRAPEMYFSEKIDYLKDKVQPT 543
Query: 390 YVKERRTMKREYEEFKVRINALVAKSLRVPPEGWTLKDETPWPGNNTKDHPSMIQILLGH 449
+VKERR+MKREYEEFKVRINALVAK+ +VP GW ++D TPWPGNNTKDHP MIQ+ LGH
Sbjct: 544 FVKERRSMKREYEEFKVRINALVAKAQKVPAGGWIMQDGTPWPGNNTKDHPGMIQVFLGH 723
Query: 450 SEGH--EGNELPCLIYISREKRPAFQHHSKAGAMNALLRVSAVLSNAPFVLNLDCNHYVN 507
S GH EGN+LP L+Y+SREKRP FQHH KAGAMNAL+RVSAVL+NAPF+LNLDC+HY+N
Sbjct: 724 SGGHDSEGNQLPRLVYVSREKRPGFQHHKKAGAMNALVRVSAVLTNAPFMLNLDCDHYIN 903
Query: 508 NSKVVREAMCFFMDIQFGNSIGFVQFPLRFDSLDRNDRYANKNTVLFDINLRCQDGLQGP 567
NSK VREAMCF MD Q G + +VQFP RFD +D +DRYAN+NTV FDIN++ DG+QGP
Sbjct: 904 NSKAVREAMCFLMDPQTGKKVCYVQFPQRFDGIDAHDRYANRNTVFFDINMKGLDGIQGP 1083
Query: 568 AYIGSACIFRRKALNGFDPPKASKRQREVQV-------------HSKQDESGEDGSIKEA 614
Y+G+ C+FRR+AL G++PPK KR + V H+ D +GE ++
Sbjct: 1084VYVGTGCVFRRQALYGYNPPKGPKRPKMVSCDCCPCFGRRKKVKHAMNDANGEAAGLR-G 1260
Query: 615 TDEDKQLLKSHMNVENKFGNSTLFMNSSLTEEGGVDPSSSQEALLKEAIHVLSCRYEDRT 674
++DK+LL S MN E KFG S++F+ S L EEGGV PSSS + LKEAIHV+SC YED+T
Sbjct: 1261MEDDKELLMSQMNFEKKFGQSSIFVTSVLMEEGGVPPSSSPASQLKEAIHVISCGYEDKT 1440
Query: 675 LWGYEVGLSYGSIAADVLTSLKLHSRGWRSVYCMPKRAAFRGTAPINLTERLNQVLRWAV 734
WG E+G YGSI D+LT K+H RGWRS+YCMPKR AF+GTAPINL++RLNQVLRWA+
Sbjct: 1441EWGIELGWIYGSITEDILTGFKMHCRGWRSIYCMPKRVAFKGTAPINLSDRLNQVLRWAL 1620
Query: 735 GSLEILFSRHCPIWYGFKEGRLKGLQRIAYINSTVYPFSSIPLLIYCLIPAICLLTDKFI 794
GS+EI FS HCP+WYG KEG+LK L+R AY N+TVYPF+SIPL+ YC++PA+CLLTDKFI
Sbjct: 1621GSIEIFFSHHCPLWYGHKEGKLKWLERFAYANTTVYPFTSIPLVAYCILPAVCLLTDKFI 1800
Query: 795 TPSVDTFASMIIISLFISIFGSAILELRWSGVSLEEWWRSQQFWVIGSVSAHLFAVAQAL 854
P + TFAS+ ++LF SI + ILEL+WSGVS+EEWWR++QFWVIG VSAHLFAV Q L
Sbjct: 1801MPPISTFASLYFVALFSSIMATGILELKWSGVSIEEWWRNEQFWVIGGVSAHLFAVIQGL 1980
Query: 855 MGGLAKKVNKNFSVVSKAPDDEEFHELYTIRWTALLVPPTTIIIINLIGVVAGFTDAINS 914
+ LA ++ NF+V SKA DDEEF ELY I+WT LL+PPTTI+IIN++GVVAG +DAIN+
Sbjct: 1981LKVLA-GIDTNFTVTSKATDDEEFGELYAIKWTTLLIPPTTILIINIVGVVAGISDAINN 2157
Query: 915 GAHSYGALLGKLFFSLWVIAHLYPFLKGLMGRQNRTPTLIVIWSVLLASIFSLVWVRLDP 974
G S+G L GKLFFS WVI HLYPFLKGLMGRQNRTPT++VIWSVLLASIFSL+WVR+DP
Sbjct: 2158GYQSWGPLFGKLFFSFWVIVHLYPFLKGLMGRQNRTPTIVVIWSVLLASIFSLLWVRIDP 2337
Query: 975 FVLKTKGPDVKQCGISC 991
FV+KTKGPD K CGI+C
Sbjct: 2338FVMKTKGPDTKLCGINC 2388
>TC85693 similar to PIR|T05351|T05351 cellulose synthase (EC 2.4.1.-)
catalytic chain RSW1 - Arabidopsis thaliana, partial
(79%)
Length = 3029
Score = 1050 bits (2714), Expect = 0.0
Identities = 505/880 (57%), Positives = 653/880 (73%), Gaps = 52/880 (5%)
Frame = +3
Query: 164 LDDKEKTDDWKL-NQGNLWPETAAPVD---------------PEKNMNDETRQPLSRKVA 207
+D KE+ + WKL ++ N+ T D E M D+ RQP+SR V
Sbjct: 6 VDWKERVEGWKLKHEKNMVQMTGRYADGKSGGGDIEGTGSNGEELQMVDDARQPMSRIVP 185
Query: 208 IPSGRLSPYRMMVVTRLLLLLLFIQYRIFHPVPDAIGLWLISVVCEIWLTLSWIVDQLPK 267
I S +L+PYR+++V RL++L F+QYR+ HPV DA LWL SV+CEIW SWI+DQ PK
Sbjct: 186 ISSSQLTPYRVVIVFRLIVLGFFLQYRVTHPVKDAYPLWLTSVICEIWFAFSWILDQFPK 365
Query: 268 WFPIDRETYLDRLSIRFEPENKPNMLSPVDIFVTTVDPIKEPPLVTANTVLSILALDYPA 327
W PI+RETYL+RL+IR++ + +P+ L+PVD+FV+TVDP+KEPP+VTANTVLSILA+DYP
Sbjct: 366 WSPINRETYLERLAIRYDRDGEPSQLAPVDVFVSTVDPLKEPPIVTANTVLSILAVDYPV 545
Query: 328 HKISCYVSDDGASMLTFEALQETAEFARKWVPFCKKFSAEPRAPERYFSQKIDFLKDTLQ 387
K+SCYVSDDG++ML+FEAL ETAEFA+ WVPFCKK S EPRAPE YF QKID+LKD +Q
Sbjct: 546 DKVSCYVSDDGSAMLSFEALSETAEFAKMWVPFCKKHSIEPRAPEFYFLQKIDYLKDKVQ 725
Query: 388 STYVKERRTMKREYEEFKVRINALVAKSLRVPPEGWTLKDETPWPGNNTKDHPSMIQILL 447
++VKERR MKR+YEEFKVRINA VAK+ ++P EGWT++D TPWPGNN +DHP MIQ+ L
Sbjct: 726 PSFVKERRAMKRQYEEFKVRINAYVAKAQKMPEEGWTMQDGTPWPGNNPRDHPGMIQVFL 905
Query: 448 GHSEG--HEGNELPCLIYISREKRPAFQHHSKAGAMNALLRVSAVLSNAPFVLNLDCNHY 505
GHS G +GNELP L+Y+SREKRP FQHH KAGAMNAL+RVSAVL+N ++LN+DC+HY
Sbjct: 906 GHSGGLDTDGNELPRLVYVSREKRPGFQHHKKAGAMNALIRVSAVLTNGAYLLNVDCDHY 1085
Query: 506 VNNSKVVREAMCFFMDIQFGNSIGFVQFPLRFDSLDRNDRYANKNTVLFDINLRCQDGLQ 565
NNSK ++EAMCF MD +G +VQFP RFD +D +DRYAN+N V FDINL+ QDG+Q
Sbjct: 1086FNNSKALKEAMCFMMDPAYGKKTCYVQFPQRFDGIDLHDRYANRNIVFFDINLKGQDGIQ 1265
Query: 566 GPAYIGSACIFRRKALNGFDP--------------------------------PKASKRQ 593
GP Y+G+ C F R+AL G+DP K ++
Sbjct: 1266GPVYVGTGCCFNRQALYGYDPVLTEEDLEPNIIVKSCWGSRKKGKGGNKKYGDKKRGVKR 1445
Query: 594 REVQVHSKQDESGEDGSIKEATDEDKQLLKSHMNVENKFGNSTLFMNSSLTEEGGVDPSS 653
E + E E+G E D+++ LL S ++E +FG S +F+ ++ E+GG+ PS+
Sbjct: 1446TESTIPIFNMEDIEEG--VEGYDDERSLLMSQKSLEKRFGQSPVFIAATFMEQGGLPPST 1619
Query: 654 SQEALLKEAIHVLSCRYEDRTLWGYEVGLSYGSIAADVLTSLKLHSRGWRSVYCMPKRAA 713
+ LLKEAIHV+SC YED+T WG E+G YGS+ D+LT K+H+RGW SVYCMP R A
Sbjct: 1620NSTTLLKEAIHVISCGYEDKTEWGKEIGWIYGSVTEDILTGFKMHARGWISVYCMPPRPA 1799
Query: 714 FRGTAPINLTERLNQVLRWAVGSLEILFSRHCPIWYGFKEGRLKGLQRIAYINSTVYPFS 773
F+G+APINL++RLNQVLRWA+GS+EI SRHCP+WYG+ GR++ L R+AYIN+ +YPF+
Sbjct: 1800FKGSAPINLSDRLNQVLRWALGSIEIFLSRHCPLWYGY-NGRMRPLMRLAYINTIIYPFT 1976
Query: 774 SIPLLIYCLIPAICLLTDKFITPSVDTFASMIIISLFISIFGSAILELRWSGVSLEEWWR 833
SIPLL YC++PA CLLT+KFI P + FASM I LF SIF ++ILELRWSGV +E+WWR
Sbjct: 1977SIPLLAYCVLPAFCLLTNKFIIPEISNFASMWFILLFTSIFTTSILELRWSGVGIEDWWR 2156
Query: 834 SQQFWVIGSVSAHLFAVAQALMGGLAKKVNKNFSVVSKAPD-DEEFHELYTIRWTALLVP 892
++QFWVIG SAHLFAV Q L+ LA ++ NF+V SKA D D +F ELY +WT+LL+P
Sbjct: 2157NEQFWVIGGTSAHLFAVFQGLLKVLA-GIDTNFTVTSKANDEDGDFAELYVFKWTSLLIP 2333
Query: 893 PTTIIIINLIGVVAGFTDAINSGAHSYGALLGKLFFSLWVIAHLYPFLKGLMGRQNRTPT 952
PTT++I+NLIG+VAG + AINSG S+G L GKLFF++WVIAHLYPFLKGL+G+ NRTPT
Sbjct: 2334PTTVLIVNLIGIVAGVSFAINSGYQSWGPLFGKLFFAIWVIAHLYPFLKGLLGKSNRTPT 2513
Query: 953 LIVIWSVLLASIFSLVWVRLDPFVL-KTKGPDVKQCGISC 991
++++W+VLLASIFSL+WVR+DPF+ K QCGI+C
Sbjct: 2514IVIVWAVLLASIFSLLWVRIDPFISDPNKSSSNSQCGINC 2633
>TC86887 homologue to PIR|T10800|T10800 cellulose synthase (EC 2.4.1.-)
catalytic chain celA2 - upland cotton (fragment),
partial (98%)
Length = 2463
Score = 874 bits (2258), Expect = 0.0
Identities = 418/716 (58%), Positives = 540/716 (75%), Gaps = 50/716 (6%)
Frame = +3
Query: 326 PAHKISCYVSDDGASMLTFEALQETAEFARKWVPFCKKFSAEPRAPERYFSQKIDFLKDT 385
P K++CYVSDDGASML F+ L ET+EFAR+WVPFCKK+S EPRAPE YF++KID+LKD
Sbjct: 12 PCEKVTCYVSDDGASMLLFDCLAETSEFARRWVPFCKKYSIEPRAPEYYFNEKIDYLKDK 191
Query: 386 LQSTYVKERRTMKREYEEFKVRINALVAKSLRVPPEGWTLKDETPWPGNNTKDHPSMIQI 445
++ T+VKERR+MKREYEEFKV+INALVAK+L+ P EGW ++D TPWPGNNT+DHP MIQ+
Sbjct: 192 VEPTFVKERRSMKREYEEFKVKINALVAKALKKPEEGWVMQDGTPWPGNNTRDHPGMIQV 371
Query: 446 LLGHSEGH--EGNELPCLIYISREKRPAFQHHSKAGAMNALLRVSAVLSNAPFVLNLDCN 503
LG + EG ELP L+YISREKRP + HH KAGAMNAL+RVSAVL+NAPF+LNLDC+
Sbjct: 372 YLGSAGALDVEGKELPKLVYISREKRPGYPHHKKAGAMNALVRVSAVLTNAPFMLNLDCD 551
Query: 504 HYVNNSKVVREAMCFFMDIQFGNSIGFVQFPLRFDSLDRNDRYANKNTVLFDINLRCQDG 563
HY+NNSK +REAMCF MD Q G + +VQFP RFD +DR+DRYAN+NTV FDIN++ DG
Sbjct: 552 HYINNSKALREAMCFLMDPQLGKKLCYVQFPQRFDGIDRHDRYANRNTVFFDINMKGLDG 731
Query: 564 LQGPAYIGSACIFRRKALNGFDPPKASKRQREV--------------------------- 596
+QGP Y+G+ +F R+AL G+DPP + KR +
Sbjct: 732 IQGPVYVGTGTVFNRQALYGYDPPVSEKRPKMTCDCWPKWCCFCCGSRKTKSKKKSGTNG 911
Query: 597 -----QVHSKQDESGED------GSI---------KEATDE-DKQLLKSHMNVENKFGNS 635
+++ K+ G+D GS+ E +E +K L S + E +FG S
Sbjct: 912 RSLFSRLYKKKKMGGKDYVRKGSGSMFDLEEIEEGLEGYEELEKSSLMSQKSFEKRFGQS 1091
Query: 636 TLFMNSSLTEEGGVDPSSSQEALLKEAIHVLSCRYEDRTLWGYEVGLSYGSIAADVLTSL 695
+F+ S+L E GG+ ++ ++L+KEAIH +SC YE++T WG E+G YGS+ D+LT
Sbjct: 1092PVFIASTLMENGGLPEGTNTQSLVKEAIHNISCGYEEKTDWGKEIGWIYGSVTEDILTGF 1271
Query: 696 KLHSRGWRSVYCMPKRAAFRGTAPINLTERLNQVLRWAVGSLEILFSRHCPIWYGFKEGR 755
K+H RGW+SVYCMPKR AF+G+APINL++RL+QVLRWA+GS+EI SRHCP+WYG+ G+
Sbjct: 1272KMHCRGWKSVYCMPKRPAFKGSAPINLSDRLHQVLRWALGSVEIFLSRHCPLWYGYG-GK 1448
Query: 756 LKGLQRIAYINSTVYPFSSIPLLIYCLIPAICLLTDKFITPSVDTFASMIIISLFISIFG 815
LK L+R+AY N+ VYPF+SIPLL YC IPA+CLLT KFI P++ AS+ ++LFISI
Sbjct: 1449LKYLERLAYTNTIVYPFTSIPLLAYCTIPAVCLLTGKFIIPTLTNLASVWFMALFISIIL 1628
Query: 816 SAILELRWSGVSLEEWWRSQQFWVIGSVSAHLFAVAQALMGGLAKKVNKNFSVVSKAPDD 875
+ +LELRWSGV++E+WWR++QFWVIG VSAHLFAV Q L+ LA V+ NF+V +KA DD
Sbjct: 1629TGVLELRWSGVAIEDWWRNEQFWVIGGVSAHLFAVFQGLLKVLAG-VDTNFTVTAKAADD 1805
Query: 876 EEFHELYTIRWTALLVPPTTIIIINLIGVVAGFTDAINSGAHSYGALLGKLFFSLWVIAH 935
EF ELY +WT LL+PPTT+II+N++GVVAG +DAINSG+ S+G L GKLFF+ WVI H
Sbjct: 1806AEFGELYLFKWTTLLIPPTTLIILNIVGVVAGVSDAINSGSGSWGPLFGKLFFAFWVIVH 1985
Query: 936 LYPFLKGLMGRQNRTPTLIVIWSVLLASIFSLVWVRLDPFVLKTKGPDVKQCGISC 991
LYPFLKGLMG+QNRTPT++V+WS+LLASIFSL+WVR+DPF+ K GP +KQCG+ C
Sbjct: 1986LYPFLKGLMGKQNRTPTIVVLWSILLASIFSLIWVRIDPFLPKQTGPILKQCGVEC 2153
>TC89697 similar to GP|4886756|gb|AAD32031.1| cellulose synthase catalytic
subunit {Arabidopsis thaliana}, partial (26%)
Length = 1011
Score = 478 bits (1230), Expect(2) = e-149
Identities = 243/312 (77%), Positives = 260/312 (82%)
Frame = +2
Query: 1 MEASTRLFAGSHNSNELVVIQGNDEPKQVKNLDGQLCEICGDSVGLTVDGDLFVACEECG 60
MEA + LFAGS NSNELVVIQ +EPK VKNLDGQ CEICGDSVG TV+GDLFVACEECG
Sbjct: 5 MEAKSGLFAGSLNSNELVVIQKQNEPKAVKNLDGQDCEICGDSVGRTVEGDLFVACEECG 184
Query: 61 FPVCRPCYEYERREGTQGCPQCHTRYKRIKGSPRVSGDEDEDDVDDIEQEFKMEEEKYKL 120
FPVCRPCYEYER+EG+Q CPQCHTRYKRIKGSPRV GDEDE+DVDDIEQEFKMEEEKYKL
Sbjct: 185 FPVCRPCYEYERKEGSQNCPQCHTRYKRIKGSPRVEGDEDEEDVDDIEQEFKMEEEKYKL 364
Query: 121 KQEEMLQGKMKHGDDDENAKPLLVNGELPISSYSIVEPAGGEKLDDKEKTDDWKLNQGNL 180
M Q M DDD+ E P+ S+SI E G KLD+KEKTD+WK QGNL
Sbjct: 365 ----MHQDNMNSIDDDDTKYR-----EQPLYSHSIGENYGA-KLDNKEKTDEWK-QQGNL 511
Query: 181 WPETAAPVDPEKNMNDETRQPLSRKVAIPSGRLSPYRMMVVTRLLLLLLFIQYRIFHPVP 240
ET A VDPEK M DETRQPLSRKVAIPSGRLSPYRMMVV RL+LLLLF +YRI HPVP
Sbjct: 512 LIETDA-VDPEKAMKDETRQPLSRKVAIPSGRLSPYRMMVVARLILLLLFFEYRISHPVP 688
Query: 241 DAIGLWLISVVCEIWLTLSWIVDQLPKWFPIDRETYLDRLSIRFEPENKPNMLSPVDIFV 300
DAIGLW ISV CEIWL LSWIVDQ+PKWFPIDRETYLDRLS+RFEPENKPNM SP+DIF+
Sbjct: 689 DAIGLWFISVSCEIWLALSWIVDQIPKWFPIDRETYLDRLSVRFEPENKPNMPSPIDIFI 868
Query: 301 TTVDPIKEPPLV 312
TT DPIKEPPLV
Sbjct: 869 TTADPIKEPPLV 904
Score = 70.9 bits (172), Expect(2) = e-149
Identities = 35/37 (94%), Positives = 36/37 (96%)
Frame = +3
Query: 311 LVTANTVLSILALDYPAHKISCYVSDDGASMLTFEAL 347
L TANTVLSILALDYPA+KISCYVSDDGASMLTFEAL
Sbjct: 900 LCTANTVLSILALDYPANKISCYVSDDGASMLTFEAL 1010
>TC79809 homologue to GP|6446577|gb|AAD39534.2| cellulose synthase catalytic
subunit {Gossypium hirsutum}, partial (29%)
Length = 1139
Score = 452 bits (1163), Expect = e-127
Identities = 207/316 (65%), Positives = 263/316 (82%), Gaps = 1/316 (0%)
Frame = +1
Query: 677 GYEVGLSYGSIAADVLTSLKLHSRGWRSVYCMPKRAAFRGTAPINLTERLNQVLRWAVGS 736
G E+G YGS+ D+LT K+H+RGWRS+YCMPKR AF+G+APINL++RLNQVLRWA+GS
Sbjct: 19 GTEIGWIYGSVTEDILTGFKMHARGWRSIYCMPKRPAFKGSAPINLSDRLNQVLRWALGS 198
Query: 737 LEILFSRHCPIWYGFKEGRLKGLQRIAYINSTVYPFSSIPLLIYCLIPAICLLTDKFITP 796
+EILFSRHCPIWYG+ GRLK L+R AYIN+T+YP ++IPLL+YC +PA+CLLT+KFI P
Sbjct: 199 VEILFSRHCPIWYGYG-GRLKWLERFAYINTTIYPVTAIPLLMYCTLPAVCLLTNKFIIP 375
Query: 797 SVDTFASMIIISLFISIFGSAILELRWSGVSLEEWWRSQQFWVIGSVSAHLFAVAQALMG 856
+ AS+ ISLF+SIF + ILE+RWSGV ++EWWR++QFWVIG VSAHLFAV Q L+
Sbjct: 376 QISNLASIWFISLFLSIFATGILEMRWSGVGIDEWWRNEQFWVIGGVSAHLFAVFQGLLK 555
Query: 857 GLAKKVNKNFSVVSKAPDDE-EFHELYTIRWTALLVPPTTIIIINLIGVVAGFTDAINSG 915
LA ++ NF+V SKA D++ + ELY I+WT LL+PPTT++IINL+GVVAG + AINSG
Sbjct: 556 VLAG-IDTNFTVTSKASDEDGDSAELYMIKWTTLLIPPTTLLIINLVGVVAGISYAINSG 732
Query: 916 AHSYGALLGKLFFSLWVIAHLYPFLKGLMGRQNRTPTLIVIWSVLLASIFSLVWVRLDPF 975
S+G L GKLFF+ WVI HLYPFLKGLMGRQNRTPT++V+WS+LLASIFSL+WVR+DPF
Sbjct: 733 YQSWGPLFGKLFFAFWVIVHLYPFLKGLMGRQNRTPTIVVVWSILLASIFSLLWVRVDPF 912
Query: 976 VLKTKGPDVKQCGISC 991
+ GP ++CGI+C
Sbjct: 913 TTRVTGPKAEECGINC 960
>BM780101 similar to GP|9622890|gb cellulose synthase-9 {Zea mays}, partial
(20%)
Length = 709
Score = 401 bits (1031), Expect = e-112
Identities = 188/236 (79%), Positives = 214/236 (90%), Gaps = 1/236 (0%)
Frame = +1
Query: 724 ERLNQVLRWAVGSLEILFSRHCPIWYGFKEGRLKGLQRIAYINSTVYPFSSIPLLIYCLI 783
ERLNQVLRWAVGSLEILFS HCPIWYGFKEGRLK LQRIAYINSTVYPFS++PL+IYC++
Sbjct: 1 ERLNQVLRWAVGSLEILFSHHCPIWYGFKEGRLKLLQRIAYINSTVYPFSALPLIIYCIV 180
Query: 784 PAICLLTDKFITPSVDTFASMIIISLFISIFGSAILELRWSGVSLEEWWRSQQFWVIGSV 843
PA+CLLTDKFITPSV TFAS++ ISLFISIF S+ILELRWSGVSLEEWWR+QQFWVIGS+
Sbjct: 181 PAVCLLTDKFITPSVGTFASLVFISLFISIFASSILELRWSGVSLEEWWRNQQFWVIGSI 360
Query: 844 SAHLFAVAQALMGGLAKKVNKNFSVVSKAPDDE-EFHELYTIRWTALLVPPTTIIIINLI 902
SAHLFA+ Q LMG + N +F++VSKAPDD+ EF+ELYTIRWT LL+PPTT+ I N+I
Sbjct: 361 SAHLFAIVQGLMGRFLGRFNAHFNIVSKAPDDDGEFNELYTIRWTVLLIPPTTVTIFNII 540
Query: 903 GVVAGFTDAINSGAHSYGALLGKLFFSLWVIAHLYPFLKGLMGRQNRTPTLIVIWS 958
G+VAGFTDAINSG H +GAL+GKLFFS WVIAHLYPF GLMGRQNRTPTL+VIW+
Sbjct: 541 GIVAGFTDAINSGEHEWGALIGKLFFSSWVIAHLYPFP*GLMGRQNRTPTLVVIWA 708
>TC86230 homologue to GP|6446577|gb|AAD39534.2| cellulose synthase catalytic
subunit {Gossypium hirsutum}, partial (29%)
Length = 984
Score = 222 bits (565), Expect(3) = e-108
Identities = 109/179 (60%), Positives = 136/179 (75%), Gaps = 9/179 (5%)
Frame = +2
Query: 329 KISCYVSDDGASML-------TFEALQETAEFARKWVPFCKKFSAEPRAPERYFSQKIDF 381
+ SCYVSDDGA+ML +LQE + K K++ EPRAPE YF+QKID+
Sbjct: 452 RCSCYVSDDGAAMLII*SSC*NITSLQENGFHSPK------KYNIEPRAPEWYFAQKIDY 613
Query: 382 LKDTLQSTYVKERRTMKREYEEFKVRINALVAKSLRVPPEGWTLKDETPWPGNNTKDHPS 441
LKD +Q+++VK+RR MKREYEEFK+RIN LVAK+ +VP EGW ++D TPWPGNN +DHP
Sbjct: 614 LKDKVQTSFVKDRRAMKREYEEFKIRINGLVAKATKVPEEGWVMQDGTPWPGNNVRDHPG 793
Query: 442 MIQILLGHSEG--HEGNELPCLIYISREKRPAFQHHSKAGAMNALLRVSAVLSNAPFVL 498
MIQ+ LG S G +GNELP L+Y+SREKRP FQHH KAGAMNAL+RVSAVL+N PF++
Sbjct: 794 MIQVFLGQSGGLDTDGNELPRLVYVSREKRPGFQHHKKAGAMNALVRVSAVLTNGPFLI 970
Score = 189 bits (481), Expect(3) = e-108
Identities = 82/129 (63%), Positives = 112/129 (86%)
Frame = +3
Query: 194 MNDETRQPLSRKVAIPSGRLSPYRMMVVTRLLLLLLFIQYRIFHPVPDAIGLWLISVVCE 253
+NDE RQPLSRKV+IPS R++PYR+++V RL++L +F+ YR+ +PV +A LWL+SV+CE
Sbjct: 45 LNDEARQPLSRKVSIPSSRINPYRLVIVLRLVVLCIFLHYRLTNPVRNAYALWLVSVICE 224
Query: 254 IWLTLSWIVDQLPKWFPIDRETYLDRLSIRFEPENKPNMLSPVDIFVTTVDPIKEPPLVT 313
IW +SWI+DQ PKW P++RETYLDRL++R++ E +P+ L+ VDIFV+TVDP+KEPPLVT
Sbjct: 225 IWFAISWILDQFPKWLPVNRETYLDRLALRYDREGEPSQLAAVDIFVSTVDPLKEPPLVT 404
Query: 314 ANTVLSILA 322
ANTVLSIL+
Sbjct: 405 ANTVLSILS 431
Score = 21.2 bits (43), Expect(3) = e-108
Identities = 7/8 (87%), Positives = 8/8 (99%)
Frame = +3
Query: 495 PFVLNLDC 502
PF+LNLDC
Sbjct: 960 PFLLNLDC 983
>TC80467 homologue to GP|18875454|gb|AAK11588.2 cellulose synthase CesA-1
{Zinnia elegans}, partial (26%)
Length = 810
Score = 306 bits (784), Expect(2) = e-105
Identities = 142/202 (70%), Positives = 171/202 (84%), Gaps = 2/202 (0%)
Frame = +3
Query: 270 PIDRETYLDRLSIRFEPENKPNMLSPVDIFVTTVDPIKEPPLVTANTVLSILALDYPAHK 329
PI+R + DRLS RFE E +P L+PVD FV+TVDP+KEPPL+TANTVLSILA+DYP K
Sbjct: 6 PINRVAFTDRLSARFEREGEPCQLAPVDFFVSTVDPLKEPPLITANTVLSILAVDYPVEK 185
Query: 330 ISCYVSDDGASMLTFEALQETAEFARKWVPFCKKFSAEPRAPERYFSQKIDFLKDTLQST 389
+SCYVSDDGA+MLTFE+L ETA+FARKWVPFCKKF EPRAPE YFSQKID+LKD +Q +
Sbjct: 186 VSCYVSDDGAAMLTFESLVETADFARKWVPFCKKFEIEPRAPEFYFSQKIDYLKDKVQPS 365
Query: 390 YVKERRTMKREYEEFKVRINALVAKSLRVPPEGWTLKDETPWPGNNTKDHPSMIQILLGH 449
+VKERR MKREYEE+KVR+NALVAK+ + P EGWT++D T WPGNN++DHP MIQ+ LGH
Sbjct: 366 FVKERRAMKREYEEYKVRVNALVAKAQKTPDEGWTMQDGTSWPGNNSRDHPGMIQVFLGH 545
Query: 450 SEGH--EGNELPCLIYISREKR 469
S H EGNELP L+Y+SR ++
Sbjct: 546 SGAHDIEGNELPRLVYVSRREK 611
Score = 95.1 bits (235), Expect(2) = e-105
Identities = 42/52 (80%), Positives = 48/52 (91%)
Frame = +1
Query: 467 EKRPAFQHHSKAGAMNALLRVSAVLSNAPFVLNLDCNHYVNNSKVVREAMCF 518
EKRP +QHH KAGA NAL+RVSAVL+NAP++LNLDC+HYVNNSK VREAMCF
Sbjct: 604 EKRPGYQHHKKAGAENALVRVSAVLTNAPYILNLDCDHYVNNSKAVREAMCF 759
>TC87399 similar to GP|17381168|gb|AAL36396.1 putative cellulose synthase
catalytic subunit {Arabidopsis thaliana}, partial (53%)
Length = 1829
Score = 355 bits (910), Expect = 6e-98
Identities = 207/563 (36%), Positives = 311/563 (54%), Gaps = 9/563 (1%)
Frame = +3
Query: 245 LWLISVVCEIWLTLSWIVDQLPKWFPIDRETYLDRLSIRFEPENKPNMLSPVDIFVTTVD 304
+W + E+W W + Q +W + R+ + DRLS R+E +ML VDIFV T D
Sbjct: 216 VWFGMLAAELWFGFYWFLTQAFRWNLVFRQPFKDRLSQRYE-----HMLPEVDIFVCTAD 380
Query: 305 PIKEPPLVTANTVLSILALDYPAHKISCYVSDDGASMLTFEALQETAEFARKWVPFCKKF 364
P EPP++ NTVLS++A DYP+ K+S Y+SDDG S +TF AL E A FA+ W+PFCK+F
Sbjct: 381 PEIEPPMMVINTVLSVMAFDYPSEKLSVYLSDDGGSEITFYALLEAATFAKHWLPFCKRF 560
Query: 365 SAEPRAPERYFSQKIDFLKDTLQSTYVKERRTMKREYEEFKVRINALVAKSLRVPPEGWT 424
EPR+P YF+ +KDT E +K+ Y E + RI A L+ P+
Sbjct: 561 KVEPRSPAAYFNG----IKDT---NIANELVAIKKLYNEMEKRIED--ATKLKRGPQEAR 713
Query: 425 LKDE--TPWPGNNTK-DHPSMIQILLGHSEGHEGNE------LPCLIYISREKRPAFQHH 475
LK + + W ++K DH +++QILL H + H+ ++ LP L+Y++REKRP + H+
Sbjct: 714 LKHKGFSQWDSYSSKRDHDTILQILL-HKKDHDNSKDVHGFMLPTLVYLAREKRPQYHHN 890
Query: 476 SKAGAMNALLRVSAVLSNAPFVLNLDCNHYVNNSKVVREAMCFFMDIQFGNSIGFVQFPL 535
KAGAMN+LLRVS+++SN +LN+DC+ Y NNS+ +R+++C+FMD + G+ I FVQ P
Sbjct: 891 YKAGAMNSLLRVSSIISNGKVILNVDCDMYSNNSESIRDSLCYFMDEEKGHEIAFVQSPQ 1070
Query: 536 RFDSLDRNDRYANKNTVLFDINLRCQDGLQGPAYIGSACIFRRKALNGFDPPKASKRQRE 595
F+++ +ND YA+ + ++ DG GP YIG+ C +R++L G
Sbjct: 1071AFENVTKNDLYASALLAIAEVEFHGADGCGGPLYIGTGCFHKRESLCGM----------- 1217
Query: 596 VQVHSKQDESGEDGSIKEATDEDKQLLKSHMNVENKFGNSTLFMNSSLTEEGGVDPSSSQ 655
+ +DE + KS N+ TEE +
Sbjct: 1218-----------------KFSDEYRHNWKSEDNLS--------------TEE-------TL 1283
Query: 656 EALLKEAIHVLSCRYEDRTLWGYEVGLSYGSIAADVLTSLKLHSRGWRSVYCMPKRAAFR 715
L +++ + SC YE+ T WG E+GL YG DV+T L + S GW+SVY P R AF
Sbjct: 1284HELEEKSKGLASCSYEENTQWGKEMGLKYGCPVEDVITGLSIQSNGWKSVYYNPARKAFL 1463
Query: 716 GTAPINLTERLNQVLRWAVGSLEILFSRHCPIWYGFKEGRLKGLQRIAYINSTVYPFSSI 775
G AP +L + L Q RW+ G +ILFS++ P WY F G++ ++ Y ++ + +
Sbjct: 1464GVAPTSLLQVLIQHKRWSEGDFQILFSKYSPAWYAF--GKINLSLQMGYCAYCLWAPNCL 1637
Query: 776 PLLIYCLIPAICLLTDKFITPSV 798
L Y +IP++ LL + P V
Sbjct: 1638ATLFYSIIPSLYLLKGIPLFPKV 1706
>TC85440 homologue to GP|6934298|gb|AAF31705.1| heat-shock protein 80
{Euphorbia esula}, partial (53%)
Length = 3855
Score = 349 bits (896), Expect = 3e-96
Identities = 225/747 (30%), Positives = 372/747 (49%), Gaps = 13/747 (1%)
Frame = -1
Query: 217 RMMVVTRLLLLLLFIQYRIFHPVPDAIGLWLISVVCEIWLTLSWIVDQLPKWFPIDRETY 276
R+ + + +L YRI + + W++ + E+ L++ W +Q +W P+ R
Sbjct: 3210 RLHIFFHFICVLFLFYYRINNFIISYP--WILMTLAELILSVLWFFNQAYRWRPVSRSVM 3037
Query: 277 LDRLSIRFEPENKPNMLSPVDIFVTTVDPIKEPPLVTANTVLSILALDYPAHKISCYVSD 336
+++L L +DIFV T+DP KEP + NTV+S +A+DYP++K+S Y+SD
Sbjct: 3036 VEKLPA-------DEKLPGLDIFVCTIDPEKEPTVEVMNTVVSAIAMDYPSNKLSIYLSD 2878
Query: 337 DGASMLTFEALQETAEFARKWVPFCKKFSAEPRAPERYFSQKIDFLKDTLQSTYVKERRT 396
DGAS +T ++E +FA+ WVPFCKK+ + R P+ +FS + + ER
Sbjct: 2877 DGASAITLFGIKEATQFAKVWVPFCKKYGVKSRCPKVFFSPMAEDEHVLRTQEFEAERDQ 2698
Query: 397 MKREYEEFKVRINALVA--KSLRVPPEGWTLKDETPWPGNNTKDHPSMIQILLGHSEGHE 454
+K +YE+ + I + K+LR+ D PS I+I+ +E
Sbjct: 2697 IKVKYEKMEKNIEKFGSDPKNLRM-----------------VTDRPSRIEII------NE 2587
Query: 455 GNELPCLIYISREKRPAFQHHSKAGAMNALLRVSAVLSNAPFVLNLDCNHYVNNSKVVRE 514
E+P ++Y+SRE+RP+ H K GA+N LLRVS ++SN P+VL +DC+ Y N+ ++
Sbjct: 2586 EPEIPRVVYVSRERRPSLPHKFKGGALNTLLRVSGLISNGPYVLAVDCDMYCNDPSSAKQ 2407
Query: 515 AMCFFMDIQFGNSIGFVQFPLRFDSLDRNDRYANKNTVLFDINLRCQDGLQGPAYIGSAC 574
AMCFF+D + I FVQFP F +L + D Y N++ F + DGL+GP G+
Sbjct: 2406 AMCFFLDPETSKYIAFVQFPQMFHNLSKKDIYDNQSRTAFKTMWQGMDGLRGPGLSGTGN 2227
Query: 575 IFRRKALNGFDPPKASKRQREVQVHSKQDESGEDGSIKEATDEDKQLLKSHMNVENKFGN 634
R AL P +D LL + +N FG
Sbjct: 2226 YLNRSALLFGSP----------------------------VQKDDYLL----DAQNYFGK 2143
Query: 635 STLFMNS--SLTEEGGVDPSSSQEALLKEAIHVLSCRYEDRTLWGYEVGLSYGSIAADVL 692
ST ++ S ++ + + + S+E +L+EA V S YE T WG E+G SYG + +
Sbjct: 2142 STTYIESLKAIRGQQTIKKNLSKEEILREAQVVASSSYESNTKWGTEIGFSYGILLESTI 1963
Query: 693 TSLKLHSRGWRSVYCMPKRAAFRGTAPINLTERLNQVLRWAVGSLEILFSRHCPIWYGFK 752
T LHSRGW+S Y PK F G AP ++ E + Q+++W S++ P YGF
Sbjct: 1962 TGYLLHSRGWKSAYLYPKTPCFLGCAPTDIKEGMLQLVKWLSELCLFAVSKYSPFTYGF- 1786
Query: 753 EGRLKGLQRIAYINSTVYPFSSIPLLIYCLIPAICLLTDKFITPSVDTFASMIIISLFIS 812
R+ + Y ++ +I ++Y ++P +C L + P V + L+++
Sbjct: 1785 -SRMSAIHNFTYCFMSISSIYAIGFILYGIVPQVCFLKGIPVFPKVTDPWFAVFAFLYVA 1609
Query: 813 IFGSAILELRWSGVSLEEWWRSQQFWVIGSVSAHLFAVAQALMG--GLAKKVNKNFSVVS 870
++E+ S+ WW Q+ W++ SV++ LFA+ +A+ GL K K F++ +
Sbjct: 1608 TQIQHLIEVISGDGSVSMWWDEQRIWILKSVTS-LFAMTEAVKKWFGLNK---KKFNLSN 1441
Query: 871 KAPD-DEEFHELY-----TIRWTALLVPP-TTIIIINLIGVVAGFTDAINSGAHSYGALL 923
KA D D+E + Y + AL + P ++I+N I G N+ +
Sbjct: 1440 KAIDTDKEKIKKYEQGRFDFQGAALYMSPMVVLLIVNTICFFGGLWRLFNT--RDIEDMF 1267
Query: 924 GKLFFSLWVIAHLYPFLKGLMGRQNRT 950
G+LF +V+A YP +G++ ++++
Sbjct: 1266 GQLFLVSYVMALSYPIFEGIITMKSKS 1186
>TC77822 weakly similar to GP|21536834|gb|AAM61166.1 unknown {Arabidopsis
thaliana}, partial (33%)
Length = 2391
Score = 348 bits (893), Expect = 6e-96
Identities = 234/770 (30%), Positives = 385/770 (49%), Gaps = 25/770 (3%)
Frame = +2
Query: 217 RMMVVTRLLLLLLFIQYRIFHPVPDA------IGLWLISVVCEIWLTLSWIVDQLPKWFP 270
R+ + + L + YR+ D+ + +L+ EI L+ WI DQ +W P
Sbjct: 74 RLHTILHSIALCFLVYYRLCFFFQDSKTRETPLLPYLLVFSSEIVLSFIWIFDQAFRWNP 253
Query: 271 IDRETYLDRLSIRFEPENKPNMLSPVDIFVTTVDPIKEPPLVTANTVLSILALDYPAHKI 330
I R + +RL PEN + L +D+F+ T DP KEP L NTVLS +A+DYP K+
Sbjct: 254 IKRTVFPERL-----PEN--DKLPNIDVFICTADPTKEPTLDVMNTVLSAMAMDYPPEKL 412
Query: 331 SCYVSDDGASMLTFEALQETAEFARKWVPFCKKFSAEPRAPERYFSQKIDFLKDTLQST- 389
YVSDDG S +T ++E +FA+ W+PFC ++ R PE YFS + D ++
Sbjct: 413 HVYVSDDGGSPITLNGMKEAWKFAKWWIPFCTRYRISCRCPEAYFSDSQNDGDDFSENVE 592
Query: 390 YVKERRTMKREYEEFKVRINALVAKSLRVPPEGWTLKDETPWPGNNTKDHPSMIQILLGH 449
++ ++R +K +YE FK I +RV +D+ G ++HPS I+++ +
Sbjct: 593 FIADKRMIKEKYEAFKEGI-------MRVK------EDQNHTTGITGQNHPSTIEVIQEN 733
Query: 450 SEGH-EGNELPCLIYISREKRPAFQHHSKAGAMNALLRVSAVLSNAPFVLNLDCNHYVNN 508
G E +LP L+Y+SREK+P+ HH KAGA+N L RVSAV+SN+P++L LDC+ +
Sbjct: 734 CSGEIEQVKLPLLVYVSREKKPSHPHHFKAGALNVLYRVSAVISNSPYLLVLDCDMFCGE 913
Query: 509 SKVVREAMCFFMDIQFGNSIGFVQFPLRFDSLDRNDRYANKNTVLFDINLRCQDGLQGPA 568
R+AMCF +D + S+ FVQFP +F ++ +ND Y +++ + + + DG+ GP
Sbjct: 914 PASARQAMCFHLDPKKSPSLAFVQFPQKFHNISKNDIYDSQHRSTYTVLWQGMDGITGPL 1093
Query: 569 YIGSACIFRRKALNGFDPPKASKRQREVQVHSKQDESGEDGSIKEATDEDKQLLKSHMNV 628
G+ +R+AL G + IK+ + L+ ++
Sbjct: 1094LSGTGFYMKREALYG------------------------NYKIKDTDFK----LQEYVGT 1189
Query: 629 ENKFGNSTLFMNSS--LTEEGGVDPSSSQEALLKEAIHVLSCRYEDRTLWGYEVGLSYGS 686
N+F S L N S + +G P +KE + + SC YE T WG EVG YG+
Sbjct: 1190SNEFIKS-LKQNCSPNIVTDGNALP-------IKETLLLTSCNYEIGTKWGKEVGFMYGT 1345
Query: 687 IAADVLTSLKLHSRGWRSVYCMPKRAAFRGTAPINLTERLNQVLRWAVGSLEILFSRHCP 746
+ DV TS+ L GW SVYC P + F G + NL + Q RW+ G LE ++ CP
Sbjct: 1346VCEDVHTSIMLSCNGWNSVYCDPPKPQFLGNSATNLNDLFIQGTRWSSGLLESGLTKVCP 1525
Query: 747 IWYGFKEGRLKGLQRIAYINSTVYPFSSIPLLIYCLIPAICLLTDKFITPSVDTFASMII 806
+ R+ L R T +P +P + ++P ICLL+ + P V I
Sbjct: 1526LIN--CPLRMSLLLRFCLTYITCFPLHCLPFWCFAIVPQICLLSGVSLYPKVSEPFFFIY 1699
Query: 807 ISLFISIFGSAILELRWSGVSLEEWWRSQQFWVIGSVSAHLFAVAQALMGGLAKKVNKNF 866
+++S + E +G + Q+ ++ S++ HL+ + LM + +F
Sbjct: 1700AFIYLSAQTKHLFEALSTGGTFRTMIIEQRMRMMRSITCHLYGLLDCLMKEFGLR-EASF 1876
Query: 867 SVVSKAPDDEEF----HELYTIRW-TALLVPPTTIIIINLIGVVAGFTDAINSGAHSYGA 921
+K D+E+ + Y R LVP +I+IN+ + G ++ G
Sbjct: 1877MPTNKVKDEEQTMLYQMDKYDFRIPNMFLVPMVALIMINISCFIGGIYRVLSLGE----- 2041
Query: 922 LLGKLFFSLWVIAHL----YPFLKGLMGRQNR---TPTLIV---IWSVLL 961
L K+F ++++AH+ YP ++G++ R+++ +P+++V +W+ +L
Sbjct: 2042-LDKMFIQIYLMAHIILVNYPIIEGIVIRKDKGRISPSVVVTSNVWATIL 2188
>TC82247 similar to PIR|T10797|T10797 cellulose synthase (EC 2.4.1.-)
catalytic chain celA1 - upland cotton, partial (25%)
Length = 965
Score = 323 bits (829), Expect = 2e-88
Identities = 152/242 (62%), Positives = 195/242 (79%)
Frame = +1
Query: 749 YGFKEGRLKGLQRIAYINSTVYPFSSIPLLIYCLIPAICLLTDKFITPSVDTFASMIIIS 808
YGF GRLK LQR+AYIN+ VYPF+S+PL+ YC +PAICLLT KFI P++ AS + +
Sbjct: 1 YGFAGGRLKLLQRLAYINTIVYPFTSLPLVAYCTLPAICLLTGKFIIPTLSNIASALFLG 180
Query: 809 LFISIFGSAILELRWSGVSLEEWWRSQQFWVIGSVSAHLFAVAQALMGGLAKKVNKNFSV 868
LFISI +++LELRWSGV++E+ WR++QFWVIG VSAHLFAV Q + LA V+ NF+V
Sbjct: 181 LFISIILTSVLELRWSGVTIEDLWRNEQFWVIGGVSAHLFAVFQGFLKMLAG-VDTNFTV 357
Query: 869 VSKAPDDEEFHELYTIRWTALLVPPTTIIIINLIGVVAGFTDAINSGAHSYGALLGKLFF 928
+KA DD EF ELY I+WT LL+PPT++IIINL+GVVAGF+DA+N G S+G L+GK+FF
Sbjct: 358 TAKAADDAEFGELYMIKWTTLLIPPTSLIIINLVGVVAGFSDALNGGYESWGPLIGKVFF 537
Query: 929 SLWVIAHLYPFLKGLMGRQNRTPTLIVIWSVLLASIFSLVWVRLDPFVLKTKGPDVKQCG 988
+ WVI HLYPFLKGLMGRQNRTPT++++WSVLLAS+FSL+WV+++PFV K + Q
Sbjct: 538 AFWVIFHLYPFLKGLMGRQNRTPTIVILWSVLLASVFSLIWVKINPFVSKVDSSAISQTC 717
Query: 989 IS 990
IS
Sbjct: 718 IS 723
>BG644386 similar to PIR|T05351|T0 cellulose synthase (EC 2.4.1.-) catalytic
chain RSW1 - Arabidopsis thaliana, partial (19%)
Length = 658
Score = 307 bits (786), Expect = 1e-83
Identities = 137/218 (62%), Positives = 182/218 (82%)
Frame = +1
Query: 210 SGRLSPYRMMVVTRLLLLLLFIQYRIFHPVPDAIGLWLISVVCEIWLTLSWIVDQLPKWF 269
S L+PYR++++ RL++L F+QYR+ HPV DA LWL+SV+CE+W LSW++DQ PKW
Sbjct: 4 SSHLTPYRVVIILRLIILGFFMQYRLTHPVNDAYPLWLVSVICEVWFALSWLLDQFPKWS 183
Query: 270 PIDRETYLDRLSIRFEPENKPNMLSPVDIFVTTVDPIKEPPLVTANTVLSILALDYPAHK 329
P++RET+LDRL++R + E +P+ L+PVD+FV+TVDP+KEPPL+TANTVLSILA+DYP K
Sbjct: 184 PVNRETFLDRLALRHDREGEPSQLAPVDVFVSTVDPLKEPPLITANTVLSILAVDYPVDK 363
Query: 330 ISCYVSDDGASMLTFEALQETAEFARKWVPFCKKFSAEPRAPERYFSQKIDFLKDTLQST 389
+SCYVSDDG++MLTFEAL ETAEFAR+WVPFCKKFS EPRAPE YF+QKID+LKD +Q +
Sbjct: 364 VSCYVSDDGSAMLTFEALSETAEFARRWVPFCKKFSIEPRAPEFYFAQKIDYLKDKVQPS 543
Query: 390 YVKERRTMKREYEEFKVRINALVAKSLRVPPEGWTLKD 427
+VKER+ MKREYEEFK ++ L+ + + WT++D
Sbjct: 544 FVKERKAMKREYEEFKY-VSMLLLQKHKNA*RSWTMQD 654
>TC86229 homologue to GP|6446577|gb|AAD39534.2| cellulose synthase catalytic
subunit {Gossypium hirsutum}, partial (22%)
Length = 739
Score = 282 bits (721), Expect(2) = 6e-82
Identities = 128/192 (66%), Positives = 159/192 (82%), Gaps = 2/192 (1%)
Frame = +1
Query: 375 FSQKIDFLKDTLQSTYVKERRTMKREYEEFKVRINALVAKSLRVPPEGWTLKDETPWPGN 434
F +K+ LKD +Q+++VK+RR MKREYEEFK+R+ LVAK+++VP EGW ++D TPWPGN
Sbjct: 19 FQRKLTTLKDKVQASFVKDRRAMKREYEEFKIRVXGLVAKAVKVPEEGWVMQDGTPWPGN 198
Query: 435 NTKDHPSMIQILLGHSEG--HEGNELPCLIYISREKRPAFQHHSKAGAMNALLRVSAVLS 492
NT+DHP MIQ+ LG S G +GNELP L+Y+SREKRP FQHH KAGAMNAL+RVSAVL+
Sbjct: 199 NTRDHPGMIQVFLGQSGGLDTDGNELPRLVYVSREKRPGFQHHKKAGAMNALVRVSAVLT 378
Query: 493 NAPFVLNLDCNHYVNNSKVVREAMCFFMDIQFGNSIGFVQFPLRFDSLDRNDRYANKNTV 552
N PF+LNLDC+HY+NNSK +REAMCF MD G ++ +VQFP RFD +DRNDRYAN+NTV
Sbjct: 379 NGPFLLNLDCDHYINNSKALREAMCFMMDPNLGKNVCYVQFPQRFDGIDRNDRYANRNTV 558
Query: 553 LFDINLRCQDGL 564
FDINLR DG+
Sbjct: 559 FFDINLRGLDGI 594
Score = 42.0 bits (97), Expect(2) = 6e-82
Identities = 17/36 (47%), Positives = 23/36 (63%)
Frame = +2
Query: 565 QGPAYIGSACIFRRKALNGFDPPKASKRQREVQVHS 600
QGP Y+G+ C+F R AL G+DPP K ++ V S
Sbjct: 596 QGPVYVGTGCVFNRTALYGYDPPIKPKHKKPSLVSS 703
>TC90826 homologue to GP|6446577|gb|AAD39534.2| cellulose synthase catalytic
subunit {Gossypium hirsutum}, partial (20%)
Length = 702
Score = 271 bits (692), Expect = 1e-72
Identities = 122/202 (60%), Positives = 159/202 (78%), Gaps = 1/202 (0%)
Frame = +2
Query: 649 VDPSSSQEALLKEAIHVLSCRYEDRTLWGYEVGLSYGSIAADVLTSLKLHSRGWRSVYCM 708
V S++ E LLKEAIHV+SC YED++ WG E+G YGS+ D+LT K+H+RGWRS+YCM
Sbjct: 95 VPQSATPETLLKEAIHVISCGYEDKSEWGTEIGWIYGSVTEDILTGFKMHARGWRSIYCM 274
Query: 709 PKRAAFRGTAPINLTERLNQVLRWAVGSLEILFSRHCPIWYGFKEGRLKGLQRIAYI-NS 767
PK AAF+G+APINL++RLNQVLRWA+GS+EIL SRHCPIWYG+ GRLK L +
Sbjct: 275 PKLAAFKGSAPINLSDRLNQVLRWALGSVEILLSRHCPIWYGY-SGRLKWL*EVLRT*TP 451
Query: 768 TVYPFSSIPLLIYCLIPAICLLTDKFITPSVDTFASMIIISLFISIFGSAILELRWSGVS 827
+YP +SIPLL+YC +PA+CLLT+KFI P + AS+ +ISLF+SIF + IL ++WSGV
Sbjct: 452 PIYPITSIPLLMYCTLPAVCLLTNKFIIPQIXNIASIWVISLFLSIFATGILXMKWSGVG 631
Query: 828 LEEWWRSQQFWVIGSVSAHLFA 849
++EWW+++QFW IG S HLFA
Sbjct: 632 IDEWWKNEQFWXIGGGSXHLFA 697
>TC89444 similar to GP|12619788|gb|AAG60543.1 cellulose synthase-like CSLD3
{Arabidopsis thaliana}, partial (24%)
Length = 966
Score = 233 bits (594), Expect(2) = 1e-66
Identities = 133/302 (44%), Positives = 177/302 (58%), Gaps = 45/302 (14%)
Frame = +2
Query: 429 TPWPGNNTKDHPSMIQILL------------GHSEGHEGNE----LPCLIYISREKRPAF 472
TP P ++ DH S+IQ++L S G E LP L+Y+SREKRP +
Sbjct: 5 TPAPEHSRGDHSSIIQVMLKPPSDEPLTGPESDSNGMNLTEVDIRLPMLVYVSREKRPGY 184
Query: 473 QHHSKAGAMNALLRVSAVLSNAPFVLNLDCNHYVNNSKVVREAMCFFMDIQFGNSIGFVQ 532
H+ KAGAMNAL+R SAV+SN PF+LNLDC+HY+ NS+ +RE MC+ MD + G+ I +VQ
Sbjct: 185 DHNKKAGAMNALVRASAVMSNGPFILNLDCDHYIYNSEAIREGMCYMMD-RDGDKISYVQ 361
Query: 533 FPLRFDSLDRNDRYANKNTVLFDINLRCQDGLQGPAYIGSACIFRRKALNGFDPPK---- 588
FP RF+ +D +DRYAN NTV FD+N+R DG+QGP Y+G+ C+FRR AL GFDPP+
Sbjct: 362 FPQRFEGIDPSDRYANHNTVFFDVNMRALDGIQGPVYVGTGCLFRRTALYGFDPPRVQEE 541
Query: 589 -----ASKRQREVQVHSKQDESGEDGSIKEATDEDKQLLKSHMNVENKFGNSTLFMNSSL 643
SK++ V S D ED S++ D++ L S + + KFGNSTLF++S
Sbjct: 542 ATGWFGSKKKNSSTVASVPDV--EDQSLRNGGSIDEEELSSAL-IPKKFGNSTLFVDSIR 712
Query: 644 TEE-------------GGVDPSS-------SQEALLKEAIHVLSCRYEDRTLWGYEVGLS 683
E G P + A + EAI V+SC YED+T WG VG
Sbjct: 713 VAEFQGRPLADHPSIKNGRQPGALTLPRDLLDAATIAEAISVISCWYEDKTEWGDRVGWI 892
Query: 684 YG 685
YG
Sbjct: 893 YG 898
Score = 39.7 bits (91), Expect(2) = 1e-66
Identities = 14/23 (60%), Positives = 19/23 (81%)
Frame = +3
Query: 685 GSIAADVLTSLKLHSRGWRSVYC 707
GS+ DV+T ++H+RGWRSVYC
Sbjct: 897 GSVTEDVVTGYRMHNRGWRSVYC 965
>AW773751 similar to PIR|D86157|D8 hypothetical protein AAF02892.1 [imported]
- Arabidopsis thaliana, partial (16%)
Length = 600
Score = 224 bits (572), Expect = 1e-58
Identities = 110/196 (56%), Positives = 141/196 (71%), Gaps = 5/196 (2%)
Frame = +3
Query: 197 ETRQPLSRKVAIPSGRLSPYRMMVVTRLLLLLLFIQYRIFHPVPDAIGLWLISVVCEIWL 256
++R PL+RKV + + LSPYR+++V RL L LF+ +RI HP +A+ LW +SV CE+W
Sbjct: 3 KSRTPLTRKVGVSAAILSPYRLLMVMRLAALGLFLTWRILHPNHEAMWLWAMSVTCELWF 182
Query: 257 TLSWIVDQLPKWFPIDRETYLDRLSIRFEPENKPNM-----LSPVDIFVTTVDPIKEPPL 311
SW++DQLPK P++R T L L RFE N N L +D+FV+T DP KEPPL
Sbjct: 183 AFSWLLDQLPKLCPVNRVTDLSVLKERFESPNLRNPKGRSDLPGIDVFVSTADPEKEPPL 362
Query: 312 VTANTVLSILALDYPAHKISCYVSDDGASMLTFEALQETAEFARKWVPFCKKFSAEPRAP 371
VTANT+LSILA+DYP K +CY+SDDG ++LTFEAL ETA FAR WVPFC+K EPR P
Sbjct: 363 VTANTILSILAVDYPVEKDACYLSDDGGALLTFEALAETASFARFWVPFCRKHQIEPRNP 542
Query: 372 ERYFSQKIDFLKDTLQ 387
E YFSQK DFLK+ ++
Sbjct: 543 EAYFSQKRDFLKNKVR 590
>AW688489 similar to PIR|T10797|T10 cellulose synthase (EC 2.4.1.-) catalytic
chain celA1 - upland cotton, partial (17%)
Length = 577
Score = 219 bits (559), Expect = 3e-57
Identities = 98/160 (61%), Positives = 127/160 (79%)
Frame = +2
Query: 201 PLSRKVAIPSGRLSPYRMMVVTRLLLLLLFIQYRIFHPVPDAIGLWLISVVCEIWLTLSW 260
PLS + I +L+PYR +++ RL++L LF YR+ +PV A LWL S++CEIW SW
Sbjct: 98 PLSVLMPIVKSKLAPYRTVIIVRLVILGLFFHYRVTNPVESAFPLWLTSIICEIWFAFSW 277
Query: 261 IVDQLPKWFPIDRETYLDRLSIRFEPENKPNMLSPVDIFVTTVDPIKEPPLVTANTVLSI 320
++DQ PKW P++R TY++ LS RFE E +P+ L+ VD FV+TVDP+KEPPL+TANTVLSI
Sbjct: 278 VLDQFPKWSPVNRHTYIENLSARFEREGEPSGLASVDFFVSTVDPLKEPPLITANTVLSI 457
Query: 321 LALDYPAHKISCYVSDDGASMLTFEALQETAEFARKWVPF 360
LA+DYP K+SCYVSDDGA+MLTFE+L ETAEFA+KWVPF
Sbjct: 458 LAVDYPVDKVSCYVSDDGAAMLTFESLVETAEFAKKWVPF 577
>TC79459 homologue to GP|6446577|gb|AAD39534.2| cellulose synthase catalytic
subunit {Gossypium hirsutum}, partial (14%)
Length = 781
Score = 214 bits (545), Expect = 1e-55
Identities = 101/158 (63%), Positives = 127/158 (79%), Gaps = 1/158 (0%)
Frame = +1
Query: 835 QQFWVIGSVSAHLFAVAQALMGGLAKKVNKNFSVVSKAPDDE-EFHELYTIRWTALLVPP 893
+QFWVIG VSAHLFAV Q L+ LA ++ NF+V SKA D++ + ELY +WT LL+PP
Sbjct: 1 EQFWVIGGVSAHLFAVFQGLLKVLAG-IDTNFTVTSKASDEDGDSAELYMFKWTTLLIPP 177
Query: 894 TTIIIINLIGVVAGFTDAINSGAHSYGALLGKLFFSLWVIAHLYPFLKGLMGRQNRTPTL 953
TT++IINL+GVVAG + A+NSG S+G L GKLFF+ WVI HLYPFLKGLMGRQNRTPT+
Sbjct: 178 TTLLIINLVGVVAGISYAVNSGYQSWGPLFGKLFFAFWVIIHLYPFLKGLMGRQNRTPTI 357
Query: 954 IVIWSVLLASIFSLVWVRLDPFVLKTKGPDVKQCGISC 991
+V+WS+LLASIFSL+WVR+DPF + GP + CGI+C
Sbjct: 358 VVVWSILLASIFSLLWVRIDPFTTRVTGPKSEMCGINC 471
>BG452180 similar to PIR|T10800|T10 cellulose synthase (EC 2.4.1.-) catalytic
chain celA2 - upland cotton (fragment), partial (24%)
Length = 586
Score = 174 bits (442), Expect(3) = 5e-53
Identities = 77/123 (62%), Positives = 105/123 (84%)
Frame = +2
Query: 726 LNQVLRWAVGSLEILFSRHCPIWYGFKEGRLKGLQRIAYINSTVYPFSSIPLLIYCLIPA 785
L+QVL+WA+G+ +I FS +CP+WYG+ G+LK LQR+AY+N+ VYPFSSIPLL+YC IPA
Sbjct: 2 LHQVLKWALGATQIFFSGYCPLWYGYS-GKLKWLQRLAYMNAIVYPFSSIPLLVYCTIPA 178
Query: 786 ICLLTDKFITPSVDTFASMIIISLFISIFGSAILELRWSGVSLEEWWRSQQFWVIGSVSA 845
ICLLT KFI P++ + AS+ +++LFISI + +LELRWS V++++WWR++QFWVIG VSA
Sbjct: 179 ICLLTGKFILPTLTSLASIWLMTLFISIILTCVLELRWSKVNIQDWWRNEQFWVIGGVSA 358
Query: 846 HLF 848
HLF
Sbjct: 359 HLF 367
Score = 42.4 bits (98), Expect(3) = 5e-53
Identities = 20/50 (40%), Positives = 33/50 (66%)
Frame = +3
Query: 848 FAVAQALMGGLAKKVNKNFSVVSKAPDDEEFHELYTIRWTALLVPPTTII 897
FAV Q L+ + + +F+V +K+ DD F +L+ +WT LL+PPT+I+
Sbjct: 366 FAVFQGLLKVAGR--DTSFAVRTKSADDTAFGQLHLFKWTTLLIPPTSIV 509
Score = 31.2 bits (69), Expect(3) = 5e-53
Identities = 13/25 (52%), Positives = 20/25 (80%)
Frame = +1
Query: 898 IINLIGVVAGFTDAINSGAHSYGAL 922
I+N+IG+ A ++AINSG +S+G L
Sbjct: 511 ILNMIGIXAXVSEAINSGYNSWGLL 585
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.321 0.138 0.423
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 33,919,388
Number of Sequences: 36976
Number of extensions: 525628
Number of successful extensions: 3079
Number of sequences better than 10.0: 98
Number of HSP's better than 10.0 without gapping: 2883
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 2979
length of query: 991
length of database: 9,014,727
effective HSP length: 106
effective length of query: 885
effective length of database: 5,095,271
effective search space: 4509314835
effective search space used: 4509314835
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 63 (28.9 bits)
Lotus: description of TM0037a.5