KMC004106A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC004106A_C01 KMC004106A_c01
atattgtcacattttataacatttcatgaaagcatgttcgttaggatacattgcacatat
atatggactacatatatgatAATATGACAATTGTTTTGAATTGCATTACCAATATCATAG
TACTAACAATGTGATTGATATATCTTTGCCTAGGAAACATTAGAAACAAAGTCTACATAG
AACCAAGAATTTGATAGTTTAATTGAAGACCTTCCAAAAGAGACATCATGAGCAGATAGA
CCAGAGACAACAAAAAGCCTGGGAATAAGAAAGTTATCTCAATTCACCATCAAACGTTCC
TGATTCCAGGGATTCACTATCTGAACAAAATTCATAAGCTCAATAAACAGAGTAATATTT
AGGTCCATTCACTATTATCATCATGTTGATGACTTAAGAGGGAAAAAGAATTTTACACAA
AAAACAAAAGAAAGAAATTGTCATATGACAAAAAGAATGAGCTATCATGGGAACTGGAAT
GAACCACCTATCTCGTTGGCACCTTGAGGGGGATTGATTGCAACCCAAAGGAGTGATATT
GTGATTGCTATGAGACCTGACCACACATAAACAATGGTAGGTGTCCTCCCTCTTCTTCCC
ATCAAACCTTTAGCAAAAGGGTAGAGATGAGTCAGCACCCAAAAGCTGAAGAAAACACCA
CCTAGCAAACGGCTCCACTGAGGTATGGTGCTGTATATGGTCCTGCTCACTCCGACTGCA
ATTGCTATCAAGTTAACCATCATGATTGTGATGGGTGGTATCATGAGC


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC004106A_C01 KMC004106A_c01
         (768 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAO03579.1| cellulose synthase-like protein D4 [Populus tremu...   192  6e-48
ref|NP_186955.1| putative cellulose synthase catalytic subunit; ...   189  5e-47
ref|NP_197193.1| cellulose synthase catalytic subunit -like prot...   186  3e-46
dbj|BAA93027.1| Similar to Arabidopsis thaliana DNA chromosome 4...   182  6e-45
gb|AAL58185.1|AC027037_7 putative cellulose synthase [Oryza sativa]   179  3e-44

>gb|AAO03579.1| cellulose synthase-like protein D4 [Populus tremuloides]
          Length = 1104

 Score =  192 bits (487), Expect = 6e-48
 Identities = 92/100 (92%), Positives = 94/100 (94%)
 Frame = -2

Query: 767  LMIPPITIMMVNLIAIAVGVSRTIYSTIPQWSRLLGGVFFSFWVLTHLYPFAKGLMGRRG 588
            LMIPPITIMMVNLIAIAVG SRTIYS IPQWSRLLGGVFFSFWVL HLYPFAKGLMGRRG
Sbjct: 1005 LMIPPITIMMVNLIAIAVGFSRTIYSVIPQWSRLLGGVFFSFWVLAHLYPFAKGLMGRRG 1064

Query: 587  RTPTIVYVWSGLIAITISLLWVAINPPQGANEIGGSFQFP 468
            RTPTIV+VWSGLIAITISLLWVAINPP G  +IGGSFQFP
Sbjct: 1065 RTPTIVFVWSGLIAITISLLWVAINPPSGTTQIGGSFQFP 1104

>ref|NP_186955.1| putative cellulose synthase catalytic subunit; protein id:
            At3g03050.1, supported by cDNA: gi_12619787, supported by
            cDNA: gi_13430535 [Arabidopsis thaliana]
            gi|6714431|gb|AAF26119.1|AC012328_22 putative cellulose
            synthase catalytic subunit [Arabidopsis thaliana]
            gi|12619788|gb|AAG60543.1|AF232907_1 cellulose
            synthase-like CSLD3 [Arabidopsis thaliana]
            gi|13430536|gb|AAK25890.1|AF360180_1 putative cellulose
            synthase catalytic subunit [Arabidopsis thaliana]
            gi|14532744|gb|AAK64073.1| putative cellulose synthase
            catalytic subunit [Arabidopsis thaliana]
            gi|25136916|emb|CAC82909.1| cellulose synthase-like
            protein [Arabidopsis thaliana]
          Length = 1145

 Score =  189 bits (479), Expect = 5e-47
 Identities = 89/100 (89%), Positives = 94/100 (94%)
 Frame = -2

Query: 767  LMIPPITIMMVNLIAIAVGVSRTIYSTIPQWSRLLGGVFFSFWVLTHLYPFAKGLMGRRG 588
            LMIPPITIMMVNLIAIAVG SRTIYS IPQWS+L+GGVFFSFWVL HLYPFAKGLMGRRG
Sbjct: 1046 LMIPPITIMMVNLIAIAVGFSRTIYSVIPQWSKLIGGVFFSFWVLAHLYPFAKGLMGRRG 1105

Query: 587  RTPTIVYVWSGLIAITISLLWVAINPPQGANEIGGSFQFP 468
            RTPTIVYVWSGL+AITISLLWVAINPP G+ +IGGSF FP
Sbjct: 1106 RTPTIVYVWSGLVAITISLLWVAINPPAGSTQIGGSFTFP 1145

>ref|NP_197193.1| cellulose synthase catalytic subunit -like protein; protein id:
            At5g16910.1 [Arabidopsis thaliana]
            gi|11357224|pir||T51546 cellulose synthase catalytic
            subunit-like protein - Arabidopsis thaliana
            gi|9755692|emb|CAC01704.1| cellulose synthase catalytic
            subunit-like protein [Arabidopsis thaliana]
          Length = 1145

 Score =  186 bits (472), Expect = 3e-46
 Identities = 87/100 (87%), Positives = 93/100 (93%)
 Frame = -2

Query: 767  LMIPPITIMMVNLIAIAVGVSRTIYSTIPQWSRLLGGVFFSFWVLTHLYPFAKGLMGRRG 588
            LMIPPITI+MVNLIAIAVG SRTIYS +PQWS+L+GGVFFSFWVL HLYPFAKGLMGRRG
Sbjct: 1046 LMIPPITIIMVNLIAIAVGFSRTIYSVVPQWSKLIGGVFFSFWVLAHLYPFAKGLMGRRG 1105

Query: 587  RTPTIVYVWSGLIAITISLLWVAINPPQGANEIGGSFQFP 468
            RTPTIVYVWSGL+AITISLLWVAINPP G  EIGG+F FP
Sbjct: 1106 RTPTIVYVWSGLVAITISLLWVAINPPAGNTEIGGNFSFP 1145

>dbj|BAA93027.1| Similar to Arabidopsis thaliana DNA chromosome 4, BAC clone F20D10;
            putative protein. (AL035538) [Oryza sativa (japonica
            cultivar-group)]
          Length = 1170

 Score =  182 bits (461), Expect = 6e-45
 Identities = 87/100 (87%), Positives = 92/100 (92%)
 Frame = -2

Query: 767  LMIPPITIMMVNLIAIAVGVSRTIYSTIPQWSRLLGGVFFSFWVLTHLYPFAKGLMGRRG 588
            LMIPPI IMMVNLIAIAVG SRTIYS IPQWS+LLGGVFFSFWVL HLYPFAKGLMGRRG
Sbjct: 1071 LMIPPIVIMMVNLIAIAVGFSRTIYSEIPQWSKLLGGVFFSFWVLAHLYPFAKGLMGRRG 1130

Query: 587  RTPTIVYVWSGLIAITISLLWVAINPPQGANEIGGSFQFP 468
            RTPTIV+VWSGL+AITISLLWVAINPP   ++IGGSF FP
Sbjct: 1131 RTPTIVFVWSGLLAITISLLWVAINPPSQNSQIGGSFTFP 1170

>gb|AAL58185.1|AC027037_7 putative cellulose synthase [Oryza sativa]
          Length = 1127

 Score =  179 bits (455), Expect = 3e-44
 Identities = 84/102 (82%), Positives = 94/102 (91%), Gaps = 2/102 (1%)
 Frame = -2

Query: 767  LMIPPITIMMVNLIAIAVGVSRTIYSTIPQWSRLLGGVFFSFWVLTHLYPFAKGLMGRRG 588
            LMIPP+TI+M+NL+AIAVG SRTIYSTIPQWS+LLGGVFFSFWVL HLYPFAKGLMGRRG
Sbjct: 1026 LMIPPLTIIMINLVAIAVGFSRTIYSTIPQWSKLLGGVFFSFWVLAHLYPFAKGLMGRRG 1085

Query: 587  RTPTIVYVWSGLIAITISLLWVAINPP--QGANEIGGSFQFP 468
            RTPTIVYVWSGL+AITISLLW+AI PP  Q  +++GGSF FP
Sbjct: 1086 RTPTIVYVWSGLVAITISLLWIAIKPPSAQANSQLGGSFSFP 1127

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 651,385,951
Number of Sequences: 1393205
Number of extensions: 14251178
Number of successful extensions: 31159
Number of sequences better than 10.0: 81
Number of HSP's better than 10.0 without gapping: 29862
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 31135
length of database: 448,689,247
effective HSP length: 121
effective length of database: 280,111,442
effective search space used: 37534933228
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MRL020h12_f BP084767 1 482
2 MRL037d12_f BP085543 180 310
3 MR005d08_f BP076313 262 612
4 MRL019c05_f BP084680 265 702
5 MRL031h12_f BP085281 265 631
6 MRL017f03_f BP084589 396 768




Lotus japonicus
Kazusa DNA Research Institute