Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC014061A_C01 KMC014061A_c01
(676 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_172460.1| hypothetical protein; protein id: At1g09890.1 [... 82 3e-25
pir||C86233 hypothetical protein [imported] - Arabidopsis thalia... 82 3e-25
ref|NP_195516.1| LG127/30 like gene; protein id: At4g38030.1 [Ar... 75 5e-23
gb|AAO42144.1| unknown protein [Arabidopsis thaliana] 67 5e-22
emb|CAA76417.1| MYST1 [Arabidopsis thaliana] 67 5e-22
>ref|NP_172460.1| hypothetical protein; protein id: At1g09890.1 [Arabidopsis
thaliana]
Length = 477
Score = 82.0 bits (201), Expect(2) = 3e-25
Identities = 35/46 (76%), Positives = 38/46 (82%)
Frame = +1
Query: 535 QSYQFWTRTDENGNFKIENIVPGDYNLYAWIPGFIGDYKYNDIITI 672
+ YQFWTRTDE G F I I PG YNLYAWIPGFIGDYKY+D+ITI
Sbjct: 376 KEYQFWTRTDEEGFFYISGIRPGQYNLYAWIPGFIGDYKYDDVITI 421
Score = 55.5 bits (132), Expect(2) = 3e-25
Identities = 31/95 (32%), Positives = 40/95 (41%)
Frame = +3
Query: 150 EVRSWPYDFPQSQDFFPSKQRGQVTGQLQVRDGYIHVNV*CTYNLWLR*Q*ITYMAS*IL 329
E SWPY FP S D+ ++QRG V G+L V+D Y+
Sbjct: 314 EAESWPYSFPASDDYVKTEQRGNVVGRLLVQDRYV------------------------- 348
Query: 330 GLVLFIRGKTSVYPSNAYIGLPFPGEAGSWQTEGK 434
K + + Y+GL PG AGSWQ E K
Sbjct: 349 -------DKDFIAANRGYVGLAVPGAAGSWQRECK 376
>pir||C86233 hypothetical protein [imported] - Arabidopsis thaliana
gi|2160179|gb|AAB60742.1| F21M12.28 gene product
[Arabidopsis thaliana]
Length = 447
Score = 82.0 bits (201), Expect(2) = 3e-25
Identities = 35/46 (76%), Positives = 38/46 (82%)
Frame = +1
Query: 535 QSYQFWTRTDENGNFKIENIVPGDYNLYAWIPGFIGDYKYNDIITI 672
+ YQFWTRTDE G F I I PG YNLYAWIPGFIGDYKY+D+ITI
Sbjct: 346 KEYQFWTRTDEEGFFYISGIRPGQYNLYAWIPGFIGDYKYDDVITI 391
Score = 55.5 bits (132), Expect(2) = 3e-25
Identities = 31/95 (32%), Positives = 40/95 (41%)
Frame = +3
Query: 150 EVRSWPYDFPQSQDFFPSKQRGQVTGQLQVRDGYIHVNV*CTYNLWLR*Q*ITYMAS*IL 329
E SWPY FP S D+ ++QRG V G+L V+D Y+
Sbjct: 284 EAESWPYSFPASDDYVKTEQRGNVVGRLLVQDRYV------------------------- 318
Query: 330 GLVLFIRGKTSVYPSNAYIGLPFPGEAGSWQTEGK 434
K + + Y+GL PG AGSWQ E K
Sbjct: 319 -------DKDFIAANRGYVGLAVPGAAGSWQRECK 346
>ref|NP_195516.1| LG127/30 like gene; protein id: At4g38030.1 [Arabidopsis thaliana]
gi|7485845|pir||T05630 hypothetical protein F20D10.150 -
Arabidopsis thaliana gi|4467109|emb|CAB37543.1| LG127/30
like gene [Arabidopsis thaliana]
gi|7270786|emb|CAB80468.1| LG127/30 like gene
[Arabidopsis thaliana]
Length = 649
Score = 75.5 bits (184), Expect(2) = 5e-23
Identities = 27/48 (56%), Positives = 39/48 (81%)
Frame = +1
Query: 529 HTQSYQFWTRTDENGNFKIENIVPGDYNLYAWIPGFIGDYKYNDIITI 672
+T+ YQFWT+T+E G F IEN+ PG YNLY W+PGFIGD++Y +++ +
Sbjct: 388 NTKGYQFWTKTNETGYFTIENVRPGTYNLYGWVPGFIGDFRYQNLVNV 435
Score = 54.3 bits (129), Expect(2) = 5e-23
Identities = 33/95 (34%), Positives = 43/95 (44%)
Frame = +3
Query: 150 EVRSWPYDFPQSQDFFPSKQRGQVTGQLQVRDGYIHVNV*CTYNLWLR*Q*ITYMAS*IL 329
EV++WPYDF S D+ ++RG VTG+L V D ++
Sbjct: 332 EVKAWPYDFVASSDYLSRRERGSVTGRLLVNDRFL------------------------- 366
Query: 330 GLVLFIRGKTSVYPSNAYIGLPFPGEAGSWQTEGK 434
GK +AY+GL PGEAGSWQT K
Sbjct: 367 -----TPGK------SAYVGLAPPGEAGSWQTNTK 390
>gb|AAO42144.1| unknown protein [Arabidopsis thaliana]
Length = 678
Score = 67.4 bits (163), Expect(2) = 5e-22
Identities = 29/46 (63%), Positives = 34/46 (73%)
Frame = +1
Query: 535 QSYQFWTRTDENGNFKIENIVPGDYNLYAWIPGFIGDYKYNDIITI 672
+ YQFWTR D+ G F I N+ PG Y+LYAW+ GFIGDYKY ITI
Sbjct: 416 KGYQFWTRADKMGMFTIANVRPGTYSLYAWVSGFIGDYKYVRDITI 461
Score = 58.9 bits (141), Expect(2) = 5e-22
Identities = 36/96 (37%), Positives = 45/96 (46%)
Frame = +3
Query: 147 NEVRSWPYDFPQSQDFFPSKQRGQVTGQLQVRDGYIHVNV*CTYNLWLR*Q*ITYMAS*I 326
+EV+SWPYDF +S D+ QRG V GQL V D Y
Sbjct: 352 SEVQSWPYDFVKSVDYPLHHQRGTVKGQLFVIDRY------------------------- 386
Query: 327 LGLVLFIRGKTSVYPSNAYIGLPFPGEAGSWQTEGK 434
I+ T ++ A++GL PGEAGSWQTE K
Sbjct: 387 ------IKNVTYLFGQFAFVGLALPGEAGSWQTENK 416
>emb|CAA76417.1| MYST1 [Arabidopsis thaliana]
Length = 435
Score = 67.4 bits (163), Expect(2) = 5e-22
Identities = 29/46 (63%), Positives = 34/46 (73%)
Frame = +1
Query: 535 QSYQFWTRTDENGNFKIENIVPGDYNLYAWIPGFIGDYKYNDIITI 672
+ YQFWTR D+ G F I N+ PG Y+LYAW+ GFIGDYKY ITI
Sbjct: 173 KGYQFWTRADKMGMFTIANVRPGTYSLYAWVSGFIGDYKYVRDITI 218
Score = 58.9 bits (141), Expect(2) = 5e-22
Identities = 36/96 (37%), Positives = 45/96 (46%)
Frame = +3
Query: 147 NEVRSWPYDFPQSQDFFPSKQRGQVTGQLQVRDGYIHVNV*CTYNLWLR*Q*ITYMAS*I 326
+EV+SWPYDF +S D+ QRG V GQL V D Y
Sbjct: 109 SEVQSWPYDFVKSVDYPLHHQRGTVKGQLFVIDRY------------------------- 143
Query: 327 LGLVLFIRGKTSVYPSNAYIGLPFPGEAGSWQTEGK 434
I+ T ++ A++GL PGEAGSWQTE K
Sbjct: 144 ------IKNVTYLFGQFAFVGLALPGEAGSWQTENK 173
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 594,829,621
Number of Sequences: 1393205
Number of extensions: 12803468
Number of successful extensions: 28734
Number of sequences better than 10.0: 20
Number of HSP's better than 10.0 without gapping: 27911
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 28731
length of database: 448,689,247
effective HSP length: 119
effective length of database: 282,897,852
effective search space used: 29704274460
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)