Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC005240A_C01 KMC005240A_c01
(566 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
dbj|BAC43014.1| unknown protein [Arabidopsis thaliana] gi|290290... 191 4e-48
ref|NP_568978.1| glycosyl hydrolase family 35 (beta-galactosidas... 159 2e-38
emb|CAB64742.1| putative beta-galactosidase [Arabidopsis thaliana] 159 2e-38
dbj|BAB10473.1| beta-galactosidase [Arabidopsis thaliana] 159 2e-38
ref|NP_177866.1| glycosyl hydrolase family 35 (beta-galactosidas... 154 7e-37
>dbj|BAC43014.1| unknown protein [Arabidopsis thaliana] gi|29029060|gb|AAO64909.1|
At1g77410 [Arabidopsis thaliana]
Length = 820
Score = 191 bits (486), Expect = 4e-48
Identities = 97/154 (62%), Positives = 110/154 (70%), Gaps = 6/154 (3%)
Frame = -2
Query: 565 AYLERRLAGLRSVKV----QARDVTNPSWGYQIGLLGEKLQIYTASGSSKVQWESFQSS- 401
A+LERR+ G RSVK+ N SWGYQ+GL GEK +YT GS+KVQW+ ++ S
Sbjct: 545 AHLERRVVGSRSVKIWNGRYQLYFNNYSWGYQVGLKGEKFHVYTEDGSAKVQWKQYRDSK 604
Query: 400 T*PLTWYQTTFDAPEGNHPVVLNLGSMGKGITWVNGQGIGRYWVSFHTPDGTSSQNWYHI 221
+ PLTWY+ +FD PEG PV LNLGSMGKG WVNGQ IGRYWVSFHT G SQ WYHI
Sbjct: 605 SQPLTWYKASFDTPEGEDPVALNLGSMGKGEAWVNGQSIGRYWVSFHTYKGNPSQIWYHI 664
Query: 220 PRSILKSTGNLLVILEEE-SGNPLEITLDTVYTT 122
PRS LK NLLVILEEE GNPL IT+DTV T
Sbjct: 665 PRSFLKPNSNLLVILEEEREGNPLGITIDTVSVT 698
>ref|NP_568978.1| glycosyl hydrolase family 35 (beta-galactosidase); protein id:
At5g63800.1, supported by cDNA: gi_16649044, supported
by cDNA: gi_20260007 [Arabidopsis thaliana]
gi|16649045|gb|AAL24374.1| beta-galactosidase
[Arabidopsis thaliana] gi|20260008|gb|AAM13351.1|
beta-galactosidase [Arabidopsis thaliana]
Length = 420
Score = 159 bits (403), Expect = 2e-38
Identities = 77/153 (50%), Positives = 104/153 (67%), Gaps = 8/153 (5%)
Frame = -2
Query: 565 AYLERRLAGLRSVKVQAR-----DVTNPSWGYQIGLLGEKLQIYTASGSSKVQWESFQSS 401
AY+ERR GL V++ D++ WGY +GLLGEK+++Y ++V+W ++
Sbjct: 255 AYMERRSYGLTKVQISCGGTKPIDLSRSQWGYSVGLLGEKVRLYQWKNLNRVKWSMNKAG 314
Query: 400 ---T*PLTWYQTTFDAPEGNHPVVLNLGSMGKGITWVNGQGIGRYWVSFHTPDGTSSQNW 230
PL WY+TTFD P G+ PV L++ SMGKG WVNG+ IGRYWVSF TP G SQ+
Sbjct: 315 LIKNRPLAWYKTTFDGPNGDGPVGLHMSSMGKGEIWVNGESIGRYWVSFLTPAGQPSQSI 374
Query: 229 YHIPRSILKSTGNLLVILEEESGNPLEITLDTV 131
YHIPR+ LK +GNLLV+ EEE G+PL I+L+T+
Sbjct: 375 YHIPRAFLKPSGNLLVVFEEEGGDPLGISLNTI 407
>emb|CAB64742.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 718
Score = 159 bits (403), Expect = 2e-38
Identities = 77/153 (50%), Positives = 104/153 (67%), Gaps = 8/153 (5%)
Frame = -2
Query: 565 AYLERRLAGLRSVKVQAR-----DVTNPSWGYQIGLLGEKLQIYTASGSSKVQWESFQSS 401
AY+ERR GL V++ D++ WGY +GLLGEK+++Y ++V+W ++
Sbjct: 553 AYMERRSYGLTKVQISCGGTKPIDLSRSQWGYSVGLLGEKVRLYQWKNLNRVKWSMNKAG 612
Query: 400 ---T*PLTWYQTTFDAPEGNHPVVLNLGSMGKGITWVNGQGIGRYWVSFHTPDGTSSQNW 230
PL WY+TTFD P G+ PV L++ SMGKG WVNG+ IGRYWVSF TP G SQ+
Sbjct: 613 LIKNRPLAWYKTTFDGPNGDGPVGLHMSSMGKGEIWVNGESIGRYWVSFLTPAGQPSQSI 672
Query: 229 YHIPRSILKSTGNLLVILEEESGNPLEITLDTV 131
YHIPR+ LK +GNLLV+ EEE G+PL I+L+T+
Sbjct: 673 YHIPRAFLKPSGNLLVVFEEEGGDPLGISLNTI 705
>dbj|BAB10473.1| beta-galactosidase [Arabidopsis thaliana]
Length = 718
Score = 159 bits (403), Expect = 2e-38
Identities = 77/153 (50%), Positives = 104/153 (67%), Gaps = 8/153 (5%)
Frame = -2
Query: 565 AYLERRLAGLRSVKVQAR-----DVTNPSWGYQIGLLGEKLQIYTASGSSKVQWESFQSS 401
AY+ERR GL V++ D++ WGY +GLLGEK+++Y ++V+W ++
Sbjct: 553 AYMERRSYGLTKVQISCGGTKPIDLSRSQWGYSVGLLGEKVRLYQWKNLNRVKWSMNKAG 612
Query: 400 ---T*PLTWYQTTFDAPEGNHPVVLNLGSMGKGITWVNGQGIGRYWVSFHTPDGTSSQNW 230
PL WY+TTFD P G+ PV L++ SMGKG WVNG+ IGRYWVSF TP G SQ+
Sbjct: 613 LIKNRPLAWYKTTFDGPNGDGPVGLHMSSMGKGEIWVNGESIGRYWVSFLTPAGQPSQSI 672
Query: 229 YHIPRSILKSTGNLLVILEEESGNPLEITLDTV 131
YHIPR+ LK +GNLLV+ EEE G+PL I+L+T+
Sbjct: 673 YHIPRAFLKPSGNLLVVFEEEGGDPLGISLNTI 705
>ref|NP_177866.1| glycosyl hydrolase family 35 (beta-galactosidase); protein id:
At1g77410.1 [Arabidopsis thaliana]
gi|25326165|pir||D96803 probable beta-galactosidase
[imported] - Arabidopsis thaliana
gi|11079481|gb|AAG29193.1|AC078898_3 beta-galactosidase,
putative [Arabidopsis thaliana]
Length = 780
Score = 154 bits (389), Expect = 7e-37
Identities = 85/154 (55%), Positives = 99/154 (64%), Gaps = 6/154 (3%)
Frame = -2
Query: 565 AYLERRLAGLRSVKV----QARDVTNPSWGYQIGLLGEKLQIYTASGSSKVQWESFQSS- 401
A+LERR+ G RSVK+ N SWGYQ+GL GEK +YT GS+KVQW+ ++ S
Sbjct: 521 AHLERRVVGSRSVKIWNGRYQLYFNNYSWGYQVGLKGEKFHVYTEDGSAKVQWKQYRDSK 580
Query: 400 T*PLTWYQTTFDAPEGNHPVVLNLGSMGKGITWVNGQGIGRYWVSFHTPDGTSSQNWYHI 221
+ PLTWY+ +FD PEG PV LNLGSMGKG WVNGQ I + S YHI
Sbjct: 581 SQPLTWYKASFDTPEGEDPVALNLGSMGKGEAWVNGQSIAMF-----------SYFRYHI 629
Query: 220 PRSILKSTGNLLVILEEE-SGNPLEITLDTVYTT 122
PRS LK NLLVILEEE GNPL IT+DTV T
Sbjct: 630 PRSFLKPNSNLLVILEEEREGNPLGITIDTVSVT 663
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 511,677,349
Number of Sequences: 1393205
Number of extensions: 11697382
Number of successful extensions: 33193
Number of sequences better than 10.0: 160
Number of HSP's better than 10.0 without gapping: 28904
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 31805
length of database: 448,689,247
effective HSP length: 116
effective length of database: 287,077,467
effective search space used: 20669577624
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)