KMC004770A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC004770A_C01 KMC004770A_c01
AACTAAGAAGTTGCACCTTTTATCCATCATATGATTTGGCATAGACTAAGGAAAACTAAT
AAATAAAGGACCCACGAAATAAATTACTATAAAAATCTCAGCAAAACGATCTATGCAAAC
AAAACCACACTGTCTAAACAAGTTTGCTAAAGGTTATATAATTCAAGCGATTTTTGATAC
ATCTTCTTGGTTGTCCTTAAGCCTGTGTATTGCCAATTCAAATCCTAACAATAACTGCAC
ACTGTCTGTGATTTGTAGTATCTTGATCTATGATAACAGAATCTCGAAAAGAACTTCTCC
TTTCCGAAATGAAAATGACGATCTGCTCTAACTAGCTGTAATAGATTTTGGTGCAGCAAG
TTGGCAGAGGAGAAACTAGAAAACTATTAACACGAACTTAAGTAACTCGTAGATGTCGCA
GAGCTTTACACGATGGTTCCAACATAGCCAACTCCAGCAATGGTGAAGGATAAGGTCTGA
AGGAAGAGGTAGATGAAGTAGTTGTGCTTCCCATGGACATAGTCATAGCACCCGCACAAG
AACAGGAAGCCAGCGAATGCCAACTCCAGGTAATTAAGTCTCTCCAGAAATTTGGATTTG
GTTTTCTTGGTAGCTTTGACATTAGTCTTCTTAGCAGCATCTCCTGTTTTCTTATTATTA
TTGGCGACGGAATCTCCAAGTTTTTCAGTGACAACCCATTCATTGGCCCTTCCATATTCT
AGTAGACCTATGATGGTTGCTTTGGTACGGTGCAGCGACATCACATTCTCGAAAAGGATC
CAGTAGAACAGAAGATGAATGGACCTTGGTGTTCCAACTGAATTGAGGATAGTGATGACA
GAAGGGATATAAACAGCACCCCATATGGGAACATGAACCTCAGGGACCAAAATTGTGAGG
GGAATCACGACGC


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC004770A_C01 KMC004770A_c01
         (913 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_197666.1| glucosyltransferase-like protein; protein id: A...   227  2e-58
dbj|BAA82375.1| Similar to glycogenin glucosyltransferase (EC 2....   199  4e-50
gb|AAL25128.1|AF432499_1 cellulose synthase-like protein OsCslA9...   198  8e-50
ref|NP_195996.1| putative protein; protein id: At5g03760.1, supp...   191  1e-47
ref|NP_173762.2| hypothetical protein; protein id: At1g23480.1, ...   189  5e-47

>ref|NP_197666.1| glucosyltransferase-like protein; protein id: At5g22740.1,
           supported by cDNA: gi_16648763, supported by cDNA:
           gi_16648964, supported by cDNA: gi_20259889 [Arabidopsis
           thaliana] gi|10178248|dbj|BAB11680.1|
           glucosyltransferase-like protein [Arabidopsis thaliana]
           gi|16648764|gb|AAL25573.1| AT5g22740/MDJ22_16
           [Arabidopsis thaliana] gi|16648965|gb|AAL24334.1|
           glucosyltransferase-like protein [Arabidopsis thaliana]
           gi|20259890|gb|AAM13292.1| glucosyltransferase-like
           protein [Arabidopsis thaliana]
          Length = 534

 Score =  227 bits (579), Expect = 2e-58
 Identities = 116/163 (71%), Positives = 132/163 (80%), Gaps = 2/163 (1%)
 Frame = -3

Query: 911 VVIPLTILVPEVHVPIWGAVYIPSVITILNSVGTPRSIHLLFYWILFENVMSLHRTKATI 732
           VV+PLTILVPEV VPIWG+VYIPS+ITILNSVGTPRSIHLLFYWILFENVMSLHRTKAT+
Sbjct: 380 VVLPLTILVPEVKVPIWGSVYIPSIITILNSVGTPRSIHLLFYWILFENVMSLHRTKATL 439

Query: 731 IGLLEYGRANEWVVTEKLGDSVANNNKKTGDAAKKTNVKATKKTKS--KFLERLNYLELA 558
           IGL E GRANEWVVT KLG         +G +A K N K  K+     K  +RLN LEL 
Sbjct: 440 IGLFEAGRANEWVVTAKLG---------SGQSA-KGNTKGIKRFPRIFKLPDRLNTLELG 489

Query: 557 FAGFLFLCGCYDYVHGKHNYFIYLFLQTLSFTIAGVGYVGTIV 429
           FA FLF+CGCYD+VHGK+NYFIYLFLQT+SF I+G+G++GT V
Sbjct: 490 FAAFLFVCGCYDFVHGKNNYFIYLFLQTMSFFISGLGWIGTYV 532

>dbj|BAA82375.1| Similar to glycogenin glucosyltransferase (EC 2.4.1.186). (Z97341)
           [Oryza sativa (japonica cultivar-group)]
          Length = 355

 Score =  199 bits (507), Expect = 4e-50
 Identities = 98/161 (60%), Positives = 119/161 (73%)
 Frame = -3

Query: 911 VVIPLTILVPEVHVPIWGAVYIPSVITILNSVGTPRSIHLLFYWILFENVMSLHRTKATI 732
           ++IP TI VPEV +P WG VYIP++IT+LNSVGTPRS HLLF+WILFENVMSLHRTKAT+
Sbjct: 201 LIIPATIFVPEVRIPKWGCVYIPTIITLLNSVGTPRSFHLLFFWILFENVMSLHRTKATL 260

Query: 731 IGLLEYGRANEWVVTEKLGDSVANNNKKTGDAAKKTNVKATKKTKSKFLERLNYLELAFA 552
           IGLLE GRANEWVVTEKLG+++           K ++  + KK+  +  +RLN  EL  A
Sbjct: 261 IGLLEAGRANEWVVTEKLGNAL---------KMKSSSKSSAKKSFMRVWDRLNVTELGVA 311

Query: 551 GFLFLCGCYDYVHGKHNYFIYLFLQTLSFTIAGVGYVGTIV 429
            FLF CG YD   GK ++FIYLF Q  +F I G+GYVGTIV
Sbjct: 312 AFLFSCGWYDLAFGKDHFFIYLFFQGAAFFIVGIGYVGTIV 352

>gb|AAL25128.1|AF432499_1 cellulose synthase-like protein OsCslA9 [Oryza sativa]
          Length = 527

 Score =  198 bits (504), Expect = 8e-50
 Identities = 93/161 (57%), Positives = 122/161 (75%)
 Frame = -3

Query: 911 VVIPLTILVPEVHVPIWGAVYIPSVITILNSVGTPRSIHLLFYWILFENVMSLHRTKATI 732
           +V+P T+L+PEV +P WG VY+PS++TILNS+GTPRS+HLL +W+LFENVMSLHRTKAT+
Sbjct: 375 LVVPATVLIPEVEIPRWGYVYLPSIVTILNSIGTPRSLHLLIFWVLFENVMSLHRTKATL 434

Query: 731 IGLLEYGRANEWVVTEKLGDSVANNNKKTGDAAKKTNVKATKKTKSKFLERLNYLELAFA 552
           IGLLE GR NEWVVTEKLGD++            K   KA ++ + +  +R+N LEL F+
Sbjct: 435 IGLLETGRVNEWVVTEKLGDAL----------KLKLPGKAFRRPRMRIGDRVNALELGFS 484

Query: 551 GFLFLCGCYDYVHGKHNYFIYLFLQTLSFTIAGVGYVGTIV 429
            +L  CGCYD  +GK  Y ++LFLQ+++F I GVGYVGTIV
Sbjct: 485 AYLSFCGCYDIAYGKGYYSLFLFLQSITFFIIGVGYVGTIV 525

>ref|NP_195996.1| putative protein; protein id: At5g03760.1, supported by cDNA:
           gi_16974551 [Arabidopsis thaliana]
           gi|11357494|pir||T48403 hypothetical protein F17C15.180
           - Arabidopsis thaliana gi|7340661|emb|CAB82941.1|
           putative protein [Arabidopsis thaliana]
           gi|9758004|dbj|BAB08601.1| glucosyltransferase-like
           protein [Arabidopsis thaliana]
           gi|16974552|gb|AAL31192.1| AT5g03760/F17C15_180
           [Arabidopsis thaliana] gi|23506155|gb|AAN31089.1|
           At5g03760/F17C15_180 [Arabidopsis thaliana]
          Length = 533

 Score =  191 bits (485), Expect = 1e-47
 Identities = 94/161 (58%), Positives = 118/161 (72%)
 Frame = -3

Query: 911 VVIPLTILVPEVHVPIWGAVYIPSVITILNSVGTPRSIHLLFYWILFENVMSLHRTKATI 732
           V++P T+LVPEV VP WGAVYIPSVIT+LN+VGTPRS+HL+ +WILFENVMSLHRTKAT 
Sbjct: 380 VILPATVLVPEVTVPKWGAVYIPSVITLLNAVGTPRSLHLMVFWILFENVMSLHRTKATF 439

Query: 731 IGLLEYGRANEWVVTEKLGDSVANNNKKTGDAAKKTNVKATKKTKSKFLERLNYLELAFA 552
           IGLLE GR NEW+VTEKLGD  A +  KT          + K  + +F +R++ LEL   
Sbjct: 440 IGLLEGGRVNEWIVTEKLGDVKAKSATKT----------SKKVIRFRFGDRIHVLELGVG 489

Query: 551 GFLFLCGCYDYVHGKHNYFIYLFLQTLSFTIAGVGYVGTIV 429
            +L   GCYD   GK++Y++YLF Q ++F IAG G +GTIV
Sbjct: 490 MYLLFVGCYDAFFGKNHYYLYLFAQAIAFFIAGFGQIGTIV 530

>ref|NP_173762.2| hypothetical protein; protein id: At1g23480.1, supported by cDNA:
           gi_20466605 [Arabidopsis thaliana]
           gi|8778578|gb|AAF79586.1|AC007945_6 F28C11.11
           [Arabidopsis thaliana] gi|20466606|gb|AAM20620.1|
           unknown protein [Arabidopsis thaliana]
           gi|23197990|gb|AAN15522.1| unknown protein [Arabidopsis
           thaliana]
          Length = 556

 Score =  189 bits (480), Expect = 5e-47
 Identities = 88/161 (54%), Positives = 117/161 (72%)
 Frame = -3

Query: 911 VVIPLTILVPEVHVPIWGAVYIPSVITILNSVGTPRSIHLLFYWILFENVMSLHRTKATI 732
           +++P T+L PE+ VP W  VY P+ ITILN++ TPRS+HLL +WILFENVMS+HRTKAT 
Sbjct: 403 LILPTTVLFPELQVPKWATVYFPTTITILNAIATPRSLHLLVFWILFENVMSMHRTKATF 462

Query: 731 IGLLEYGRANEWVVTEKLGDSVANNNKKTGDAAKKTNVKATKKTKSKFLERLNYLELAFA 552
           IGLLE GR NEWVVTEKLGD++ +          K   KAT K  ++F +RLN+ EL   
Sbjct: 463 IGLLEAGRVNEWVVTEKLGDTLKS----------KLIGKATTKLYTRFGQRLNWRELVVG 512

Query: 551 GFLFLCGCYDYVHGKHNYFIYLFLQTLSFTIAGVGYVGTIV 429
            ++F CGCYD+ +G   +++YLFLQ+ +F +AGVGY+GT V
Sbjct: 513 LYIFFCGCYDFAYGGSYFYVYLFLQSCAFFVAGVGYIGTFV 553

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 787,033,908
Number of Sequences: 1393205
Number of extensions: 17553037
Number of successful extensions: 52924
Number of sequences better than 10.0: 54
Number of HSP's better than 10.0 without gapping: 50541
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 52855
length of database: 448,689,247
effective HSP length: 123
effective length of database: 277,325,032
effective search space used: 49918505760
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MFBL004b09_f BP041461 1 593
2 MPDL080d01_f AV780650 25 614
3 MPDL047f05_f AV778891 78 749
4 MWL049f07_f AV769413 82 657
5 MPDL053d07_f AV779185 87 684
6 MF003c05_f BP028381 94 298
7 MR096c07_f BP083350 107 711
8 MPD093c01_f AV776090 108 657
9 MPDL052f09_f AV779150 109 756
10 MWM063d01_f AV765703 112 450
11 MPDL001h04_f AV776593 123 815
12 MWL006b09_f AV768671 175 474
13 SPD027d01_f BP046133 199 824
14 MPDL020g03_f AV777525 532 999
15 MF014h03_f BP029004 540 1037
16 MF007c12_f BP028587 548 780




Lotus japonicus
Kazusa DNA Research Institute