Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC004724A_C01 KMC004724A_c01
(447 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_195107.1| glycosyl hydrolase family 10; protein id: At4g3... 94 5e-19
pir||B85398 hypothetical protein AT4g33820 [imported] - Arabidop... 91 2e-18
ref|NP_680761.1| glycosyl hydrolase family 10; protein id: At4g3... 91 2e-18
ref|NP_179076.1| glycosyl hydrolase family 10; protein id: At2g1... 87 5e-17
pir||T05212 hypothetical protein F17I5.30 - Arabidopsis thaliana... 87 6e-17
>ref|NP_195107.1| glycosyl hydrolase family 10; protein id: At4g33810.1 [Arabidopsis
thaliana] gi|7487091|pir||T04998 hypothetical protein
T16L1.300 - Arabidopsis thaliana
gi|3549683|emb|CAA20594.1| beta-xylan endohydrolase-like
protein [Arabidopsis thaliana]
gi|7270330|emb|CAB80098.1| beta-xylan endohydrolase-like
protein [Arabidopsis thaliana]
Length = 536
Score = 93.6 bits (231), Expect = 5e-19
Identities = 55/123 (44%), Positives = 77/123 (61%), Gaps = 6/123 (4%)
Frame = -2
Query: 446 HQEDYVDWILREGYAHPAVQGIIMFMGPVQAGFKKAPLADENFKNTPIGDVVDMLIREWG 267
+QE Y++ ILRE Y+HPAV+GII+F GP +GF K LAD+ F NT GDV+D L++EW
Sbjct: 414 NQEVYIEEILREAYSHPAVKGIIIFAGPEVSGFDKLTLADKYFNNTATGDVIDKLLKEWQ 473
Query: 266 TG---PHEAKADSRG-IVDISLHHGDYDVIVTHP-QKQISKTLNLSVRKGSPQ-QTIQVK 105
P DS ++SL HG Y+V V+HP K +S + +L V K Q Q ++V
Sbjct: 474 QSSEIPKIFMTDSENDEEEVSLLHGHYNVNVSHPWMKNMSTSFSLEVTKEMGQRQVVRVV 533
Query: 104 MHA 96
++A
Sbjct: 534 INA 536
>pir||B85398 hypothetical protein AT4g33820 [imported] - Arabidopsis thaliana
gi|7270331|emb|CAB80099.1| putative protein [Arabidopsis
thaliana]
Length = 546
Score = 91.3 bits (225), Expect = 2e-18
Identities = 54/124 (43%), Positives = 75/124 (59%), Gaps = 7/124 (5%)
Frame = -2
Query: 446 HQEDYVDWILREGYAHPAVQGIIMFMGPVQAGFKKAPLADENFKNTPIGDVVDMLIREWG 267
+Q YV+ ILRE Y+HPAV+GII+F GP +GF K LAD++F NT GDV+D L++EW
Sbjct: 423 NQAQYVEDILREAYSHPAVKGIIIFGGPEVSGFDKLTLADKDFNNTQTGDVIDKLLKEWQ 482
Query: 266 TGPHEAKADSRGIVD-----ISLHHGDYDVIVTHPQ-KQISKTLNLSVRKGSPQ-QTIQV 108
E + + D +SL HG Y+V V+HP +S + +L V K Q Q I+V
Sbjct: 483 QKSSEIQTNFTADSDNEEEEVSLLHGHYNVNVSHPWIANLSTSFSLEVTKEMDQDQVIRV 542
Query: 107 KMHA 96
+ A
Sbjct: 543 VISA 546
>ref|NP_680761.1| glycosyl hydrolase family 10; protein id: At4g33820.1 [Arabidopsis
thaliana] gi|27754330|gb|AAO22618.1| putative glycosyl
hydrolase family 10 protein [Arabidopsis thaliana]
Length = 570
Score = 91.3 bits (225), Expect = 2e-18
Identities = 54/124 (43%), Positives = 75/124 (59%), Gaps = 7/124 (5%)
Frame = -2
Query: 446 HQEDYVDWILREGYAHPAVQGIIMFMGPVQAGFKKAPLADENFKNTPIGDVVDMLIREWG 267
+Q YV+ ILRE Y+HPAV+GII+F GP +GF K LAD++F NT GDV+D L++EW
Sbjct: 447 NQAQYVEDILREAYSHPAVKGIIIFGGPEVSGFDKLTLADKDFNNTQTGDVIDKLLKEWQ 506
Query: 266 TGPHEAKADSRGIVD-----ISLHHGDYDVIVTHPQ-KQISKTLNLSVRKGSPQ-QTIQV 108
E + + D +SL HG Y+V V+HP +S + +L V K Q Q I+V
Sbjct: 507 QKSSEIQTNFTADSDNEEEEVSLLHGHYNVNVSHPWIANLSTSFSLEVTKEMDQDQVIRV 566
Query: 107 KMHA 96
+ A
Sbjct: 567 VISA 570
>ref|NP_179076.1| glycosyl hydrolase family 10; protein id: At2g14690.1 [Arabidopsis
thaliana] gi|25411580|pir||C84520 1,4-beta-xylan
endohydrolase [imported] - Arabidopsis thaliana
gi|3810591|gb|AAC69373.1| 1,4-beta-xylan endohydrolase
[Arabidopsis thaliana]
Length = 552
Score = 87.0 bits (214), Expect = 5e-17
Identities = 53/130 (40%), Positives = 76/130 (57%), Gaps = 14/130 (10%)
Frame = -2
Query: 443 QEDYVDWILREGYAHPAVQGIIMFMGPVQAGFKKAPLADENFKNTPIGDVVDMLIREWGT 264
Q Y++ ILRE Y+HPAV+ II++ GP +GF K LAD++FKNT GD++D L++EW
Sbjct: 423 QVKYMEDILREAYSHPAVKAIILYGGPEVSGFDKLTLADKDFKNTQAGDLIDKLLQEWKQ 482
Query: 263 GP-------HEAKADSRGIV-----DISLHHGDYDVIVTHP-QKQISKTLNLSVRKGSPQ 123
P HE + G + +ISL HG Y V VT+P K +S ++ V K S
Sbjct: 483 EPVEIPIQHHEHNDEEGGRIIGFSPEISLLHGHYRVTVTNPSMKNLSTRFSVEVTKESGH 542
Query: 122 -QTIQVKMHA 96
Q +Q+ + A
Sbjct: 543 LQEVQLVIDA 552
>pir||T05212 hypothetical protein F17I5.30 - Arabidopsis thaliana
gi|3297808|emb|CAA19866.1| putative protein [Arabidopsis
thaliana] gi|7270333|emb|CAB80101.1| putative protein
[Arabidopsis thaliana]
Length = 669
Score = 86.7 bits (213), Expect = 6e-17
Identities = 41/101 (40%), Positives = 60/101 (58%), Gaps = 2/101 (1%)
Frame = -2
Query: 437 DYVDWILREGYAHPAVQGIIMFMGPVQAGFKKAPLADENFKNTPIGDVVDMLIREWG--T 264
+Y + +LREG+AHP V G++M+ G +G + L D NFKN P GDVVD L+REWG
Sbjct: 551 NYFEQVLREGHAHPKVNGMVMWTGYSPSGCYRMCLTDGNFKNLPTGDVVDKLLREWGGLR 610
Query: 263 GPHEAKADSRGIVDISLHHGDYDVIVTHPQKQISKTLNLSV 141
D+ G+ + L HGDYD+ ++HP + N ++
Sbjct: 611 SQTTGVTDANGLFEAPLFHGDYDLRISHPLTNSKASYNFTL 651
Score = 48.1 bits (113), Expect = 2e-05
Identities = 21/50 (42%), Positives = 33/50 (66%)
Frame = -2
Query: 443 QEDYVDWILREGYAHPAVQGIIMFMGPVQAGFKKAPLADENFKNTPIGDV 294
Q Y + +LR+G+AHP V+G++++ G +G + L D NF+N P GDV
Sbjct: 60 QAKYFEQVLRDGHAHPQVKGMVVWGGYSPSGCYRMCLTDGNFRNLPTGDV 109
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 375,685,478
Number of Sequences: 1393205
Number of extensions: 7805549
Number of successful extensions: 16530
Number of sequences better than 10.0: 47
Number of HSP's better than 10.0 without gapping: 16189
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 16515
length of database: 448,689,247
effective HSP length: 124
effective length of database: 275,931,827
effective search space used: 6622363848
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)