KMC000345A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC000345A_C01 KMC000345A_c01
ggtaaGAGATCATACGATGGGACTCTTATTTCCATAAATATGAAAAAATAACCGTCTCCA
AGGTTTTTAAAATTGCAATACATATATTATGTTCCCTGAGGAAGCCATAGTTTCCTTGGC
CTTGGTCACATCTGCAATGCAAAGTATTTTTGACAAAAGATGGCATGGGAGTAAAGAAAA
TGAAAATTAGTAGAGAACTTTACCAGATCAATAAAAGCATGGTCTTGGTTTAATATACAT
GCAAATGTAGTTCATTAGAATTTCACTGATTTGCTTTCAGTCCACTATTCCAGTGCAATC
TTGGTCTCAGGGTAGCTCAAGTTCAGGTATTTGTTGAATAGCCACCTAGAAGTGTTTAGA
GCATCACCTCTACTTTCCACAGGAAATATATTCCTACTGCTTTGCCAATCATTAGTAAGC
TTTATCCACTCCCTCCTCCATTCCTTTAGCTTGAAATCCTCACCTCTGTCTAAGCTTTCT
CTTAAATATTTGAAATAAATAGCAGCTCTTGGGCCATAGTAATCATGAAGAACCCCGCTC
CAGTACTTGTTTCCATAGTCACGGAGCAAGCTGGGTTCCTCCTCTGTGTngtcaaaccac
atggttattngtgttctagcattccactcaaac


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC000345A_C01 KMC000345A_c01
         (633 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

emb|CAA77084.1| alpha-N-acetylglucosaminidase [Nicotiana tabacum]     131  8e-30
ref|NP_196873.1| alpha-N-acetylglucosaminidase; protein id: At5g...   121  7e-27
gb|AAC26842.1| alpha-N-acetylglucosaminidase [Mus musculus] gi|2...    60  3e-08
ref|NP_038820.1| alpha-N-acetylglucosaminidase (Sanfilippo disea...    60  3e-08
ref|XP_220983.1| similar to alpha-N-acetylglucosaminidase [Mus m...    58  1e-07

>emb|CAA77084.1| alpha-N-acetylglucosaminidase [Nicotiana tabacum]
          Length = 811

 Score =  131 bits (329), Expect = 8e-30
 Identities = 58/104 (55%), Positives = 75/104 (71%)
 Frame = -2

Query: 632  FEWNARTXITMWFDXTEEEPSLLRDYGNKYWSGVLHDYYGPRAAIYFKYLRESLDRGEDF 453
            +EWNART ITMWFD T+   S L DY NK+WSG+L  YY PRA+IYF+ L +SL    DF
Sbjct: 708  YEWNARTQITMWFDNTKYNQSQLHDYANKFWSGLLEAYYLPRASIYFELLSKSLKEKVDF 767

Query: 452  KLKEWRREWIKLTNDWQSSRNIFPVESRGDALNTSRWLFNKYLN 321
            KL+EWR+EWI  +N WQ S  ++PV+++GDAL  +  LF KY +
Sbjct: 768  KLEEWRKEWIAYSNKWQESTELYPVKAQGDALAIATALFEKYFS 811

>ref|NP_196873.1| alpha-N-acetylglucosaminidase; protein id: At5g13690.1, supported by
            cDNA: gi_19423947 [Arabidopsis thaliana]
            gi|9758035|dbj|BAB08696.1| alpha-N-acetylglucosaminidase
            [Arabidopsis thaliana] gi|19423948|gb|AAL87291.1|
            putative alpha-N-acetylglucosaminidase [Arabidopsis
            thaliana] gi|21436231|gb|AAM51254.1| putative
            alpha-N-acetylglucosaminidase [Arabidopsis thaliana]
          Length = 806

 Score =  121 bits (304), Expect = 7e-27
 Identities = 53/103 (51%), Positives = 75/103 (72%), Gaps = 1/103 (0%)
 Frame = -2

Query: 632  FEWNARTXITMWFDXTEEEPSLLRDYGNKYWSGVLHDYYGPRAAIYFKYLRESLDRGEDF 453
            +EWNART +TMW+D  +   S L DY NK+WSG+L DYY PRA +YF  + +SL   + F
Sbjct: 702  YEWNARTQVTMWYDSNDVNQSKLHDYANKFWSGLLEDYYLPRARLYFNEMLKSLRDKKIF 761

Query: 452  KLKEWRREWIKLTNDW-QSSRNIFPVESRGDALNTSRWLFNKY 327
            K+++WRREWI +++ W QSS  ++PV+++GDAL  SR L +KY
Sbjct: 762  KVEKWRREWIMMSHKWQQSSSEVYPVKAKGDALAISRHLLSKY 804

>gb|AAC26842.1| alpha-N-acetylglucosaminidase [Mus musculus]
           gi|20385160|gb|AAM21194.1|AF363242_1
           N-acetyl-glucosaminidase [Mus musculus]
          Length = 739

 Score = 59.7 bits (143), Expect = 3e-08
 Identities = 34/102 (33%), Positives = 56/102 (54%)
 Frame = -2

Query: 632 FEWNARTXITMWFDXTEEEPSLLRDYGNKYWSGVLHDYYGPRAAIYFKYLRESLDRGEDF 453
           +E N+R  IT+W      E ++L DY NK  +G++ DYY PR  ++   L  SL RG  F
Sbjct: 636 YEQNSRYQITLW----GPEGNIL-DYANKQLAGLVADYYQPRWCLFLGTLAHSLARGVPF 690

Query: 452 KLKEWRREWIKLTNDWQSSRNIFPVESRGDALNTSRWLFNKY 327
           +  E+ +    L   +  ++  +P + RGD ++ S+ +F KY
Sbjct: 691 QQHEFEKNVFPLEQAFVYNKKRYPSQPRGDTVDLSKKIFLKY 732

>ref|NP_038820.1| alpha-N-acetylglucosaminidase (Sanfilippo disease IIIB);
           alpha-N-acetylglucosaminidase, lysosomal [Mus musculus]
           gi|2660688|gb|AAB88084.1| Naglu [Mus musculus]
          Length = 739

 Score = 59.7 bits (143), Expect = 3e-08
 Identities = 34/102 (33%), Positives = 56/102 (54%)
 Frame = -2

Query: 632 FEWNARTXITMWFDXTEEEPSLLRDYGNKYWSGVLHDYYGPRAAIYFKYLRESLDRGEDF 453
           +E N+R  IT+W      E ++L DY NK  +G++ DYY PR  ++   L  SL RG  F
Sbjct: 636 YEQNSRYQITLW----GPEGNIL-DYANKQLAGLVADYYQPRWCLFLGTLAHSLARGVPF 690

Query: 452 KLKEWRREWIKLTNDWQSSRNIFPVESRGDALNTSRWLFNKY 327
           +  E+ +    L   +  ++  +P + RGD ++ S+ +F KY
Sbjct: 691 QQHEFEKNVFPLEQAFVYNKKRYPSQPRGDTVDLSKKIFLKY 732

>ref|XP_220983.1| similar to alpha-N-acetylglucosaminidase [Mus musculus] [Rattus
           norvegicus]
          Length = 633

 Score = 57.8 bits (138), Expect = 1e-07
 Identities = 31/102 (30%), Positives = 58/102 (56%)
 Frame = -2

Query: 632 FEWNARTXITMWFDXTEEEPSLLRDYGNKYWSGVLHDYYGPRAAIYFKYLRESLDRGEDF 453
           +E N+R  IT+W      E ++L DY NK  +G++ DYY PR  ++   L  SL RG  F
Sbjct: 530 YEQNSRYQITLW----GPEGNIL-DYANKQLAGLVADYYQPRWCLFLGTLAHSLARGIPF 584

Query: 452 KLKEWRREWIKLTNDWQSSRNIFPVESRGDALNTSRWLFNKY 327
           +  ++ +    L   + +++  +P++ +GD ++ S+ +F K+
Sbjct: 585 QQHQFEKSVFPLEQAFINNKKRYPIQPQGDTVDLSKKIFLKF 626

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 549,990,255
Number of Sequences: 1393205
Number of extensions: 11829336
Number of successful extensions: 23129
Number of sequences better than 10.0: 24
Number of HSP's better than 10.0 without gapping: 22535
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 23120
length of database: 448,689,247
effective HSP length: 118
effective length of database: 284,291,057
effective search space used: 26154777244
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MPD026a04_f AV771747 1 449
2 GENLf034a12 BP064109 6 488
3 GENLf013c12 BP063021 10 492
4 GENLf061b12 BP065598 181 636




Lotus japonicus
Kazusa DNA Research Institute