Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC000345A_C01 KMC000345A_c01
(633 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
emb|CAA77084.1| alpha-N-acetylglucosaminidase [Nicotiana tabacum] 131 8e-30
ref|NP_196873.1| alpha-N-acetylglucosaminidase; protein id: At5g... 121 7e-27
gb|AAC26842.1| alpha-N-acetylglucosaminidase [Mus musculus] gi|2... 60 3e-08
ref|NP_038820.1| alpha-N-acetylglucosaminidase (Sanfilippo disea... 60 3e-08
ref|XP_220983.1| similar to alpha-N-acetylglucosaminidase [Mus m... 58 1e-07
>emb|CAA77084.1| alpha-N-acetylglucosaminidase [Nicotiana tabacum]
Length = 811
Score = 131 bits (329), Expect = 8e-30
Identities = 58/104 (55%), Positives = 75/104 (71%)
Frame = -2
Query: 632 FEWNARTXITMWFDXTEEEPSLLRDYGNKYWSGVLHDYYGPRAAIYFKYLRESLDRGEDF 453
+EWNART ITMWFD T+ S L DY NK+WSG+L YY PRA+IYF+ L +SL DF
Sbjct: 708 YEWNARTQITMWFDNTKYNQSQLHDYANKFWSGLLEAYYLPRASIYFELLSKSLKEKVDF 767
Query: 452 KLKEWRREWIKLTNDWQSSRNIFPVESRGDALNTSRWLFNKYLN 321
KL+EWR+EWI +N WQ S ++PV+++GDAL + LF KY +
Sbjct: 768 KLEEWRKEWIAYSNKWQESTELYPVKAQGDALAIATALFEKYFS 811
>ref|NP_196873.1| alpha-N-acetylglucosaminidase; protein id: At5g13690.1, supported by
cDNA: gi_19423947 [Arabidopsis thaliana]
gi|9758035|dbj|BAB08696.1| alpha-N-acetylglucosaminidase
[Arabidopsis thaliana] gi|19423948|gb|AAL87291.1|
putative alpha-N-acetylglucosaminidase [Arabidopsis
thaliana] gi|21436231|gb|AAM51254.1| putative
alpha-N-acetylglucosaminidase [Arabidopsis thaliana]
Length = 806
Score = 121 bits (304), Expect = 7e-27
Identities = 53/103 (51%), Positives = 75/103 (72%), Gaps = 1/103 (0%)
Frame = -2
Query: 632 FEWNARTXITMWFDXTEEEPSLLRDYGNKYWSGVLHDYYGPRAAIYFKYLRESLDRGEDF 453
+EWNART +TMW+D + S L DY NK+WSG+L DYY PRA +YF + +SL + F
Sbjct: 702 YEWNARTQVTMWYDSNDVNQSKLHDYANKFWSGLLEDYYLPRARLYFNEMLKSLRDKKIF 761
Query: 452 KLKEWRREWIKLTNDW-QSSRNIFPVESRGDALNTSRWLFNKY 327
K+++WRREWI +++ W QSS ++PV+++GDAL SR L +KY
Sbjct: 762 KVEKWRREWIMMSHKWQQSSSEVYPVKAKGDALAISRHLLSKY 804
>gb|AAC26842.1| alpha-N-acetylglucosaminidase [Mus musculus]
gi|20385160|gb|AAM21194.1|AF363242_1
N-acetyl-glucosaminidase [Mus musculus]
Length = 739
Score = 59.7 bits (143), Expect = 3e-08
Identities = 34/102 (33%), Positives = 56/102 (54%)
Frame = -2
Query: 632 FEWNARTXITMWFDXTEEEPSLLRDYGNKYWSGVLHDYYGPRAAIYFKYLRESLDRGEDF 453
+E N+R IT+W E ++L DY NK +G++ DYY PR ++ L SL RG F
Sbjct: 636 YEQNSRYQITLW----GPEGNIL-DYANKQLAGLVADYYQPRWCLFLGTLAHSLARGVPF 690
Query: 452 KLKEWRREWIKLTNDWQSSRNIFPVESRGDALNTSRWLFNKY 327
+ E+ + L + ++ +P + RGD ++ S+ +F KY
Sbjct: 691 QQHEFEKNVFPLEQAFVYNKKRYPSQPRGDTVDLSKKIFLKY 732
>ref|NP_038820.1| alpha-N-acetylglucosaminidase (Sanfilippo disease IIIB);
alpha-N-acetylglucosaminidase, lysosomal [Mus musculus]
gi|2660688|gb|AAB88084.1| Naglu [Mus musculus]
Length = 739
Score = 59.7 bits (143), Expect = 3e-08
Identities = 34/102 (33%), Positives = 56/102 (54%)
Frame = -2
Query: 632 FEWNARTXITMWFDXTEEEPSLLRDYGNKYWSGVLHDYYGPRAAIYFKYLRESLDRGEDF 453
+E N+R IT+W E ++L DY NK +G++ DYY PR ++ L SL RG F
Sbjct: 636 YEQNSRYQITLW----GPEGNIL-DYANKQLAGLVADYYQPRWCLFLGTLAHSLARGVPF 690
Query: 452 KLKEWRREWIKLTNDWQSSRNIFPVESRGDALNTSRWLFNKY 327
+ E+ + L + ++ +P + RGD ++ S+ +F KY
Sbjct: 691 QQHEFEKNVFPLEQAFVYNKKRYPSQPRGDTVDLSKKIFLKY 732
>ref|XP_220983.1| similar to alpha-N-acetylglucosaminidase [Mus musculus] [Rattus
norvegicus]
Length = 633
Score = 57.8 bits (138), Expect = 1e-07
Identities = 31/102 (30%), Positives = 58/102 (56%)
Frame = -2
Query: 632 FEWNARTXITMWFDXTEEEPSLLRDYGNKYWSGVLHDYYGPRAAIYFKYLRESLDRGEDF 453
+E N+R IT+W E ++L DY NK +G++ DYY PR ++ L SL RG F
Sbjct: 530 YEQNSRYQITLW----GPEGNIL-DYANKQLAGLVADYYQPRWCLFLGTLAHSLARGIPF 584
Query: 452 KLKEWRREWIKLTNDWQSSRNIFPVESRGDALNTSRWLFNKY 327
+ ++ + L + +++ +P++ +GD ++ S+ +F K+
Sbjct: 585 QQHQFEKSVFPLEQAFINNKKRYPIQPQGDTVDLSKKIFLKF 626
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 549,990,255
Number of Sequences: 1393205
Number of extensions: 11829336
Number of successful extensions: 23129
Number of sequences better than 10.0: 24
Number of HSP's better than 10.0 without gapping: 22535
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 23120
length of database: 448,689,247
effective HSP length: 118
effective length of database: 284,291,057
effective search space used: 26154777244
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)