Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC002573A_C01 KMC002573A_c01
(644 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_563729.1| expressed protein; protein id: At1g05140.1, sup... 164 1e-39
gb|AAM98118.1| unknown protein [Arabidopsis thaliana] 164 1e-39
ref|NP_565745.1| expressed protein; protein id: At2g32480.1, sup... 161 6e-39
ref|ZP_00073812.1| hypothetical protein [Trichodesmium erythraeu... 104 1e-21
ref|NP_441081.1| hypothetical protein [Synechocystis sp. PCC 680... 103 2e-21
>ref|NP_563729.1| expressed protein; protein id: At1g05140.1, supported by cDNA:
gi_15010609, supported by cDNA: gi_17065221 [Arabidopsis
thaliana] gi|25331649|pir||F86185 hypothetical protein
[imported] - Arabidopsis thaliana
gi|2388583|gb|AAB71464.1| Similar to Synechocystis
hypothetical protein (gb|D90908). [Arabidopsis thaliana]
gi|17065222|gb|AAL32765.1| Unknown protein [Arabidopsis
thaliana]
Length = 441
Score = 164 bits (414), Expect = 1e-39
Identities = 85/95 (89%), Positives = 91/95 (95%)
Frame = -2
Query: 643 IIAVGAEVARSNIDGLYQFAAILNINLAVINLLPLPALDGGSLALILVEAARGGRKLPLE 464
IIAVGAEVARSN DGLYQFAA+LN+NLAVINLLPLPALDGG+LALIL+EA RGGRKLPLE
Sbjct: 347 IIAVGAEVARSNADGLYQFAALLNLNLAVINLLPLPALDGGTLALILLEAVRGGRKLPLE 406
Query: 463 VEQRIMSSGIMLVLLLGLYLIVRDTLNLDFIKEML 359
VEQ IMSSGIMLVL LGL+LIV+DTLNLDFIKEML
Sbjct: 407 VEQGIMSSGIMLVLFLGLFLIVKDTLNLDFIKEML 441
>gb|AAM98118.1| unknown protein [Arabidopsis thaliana]
Length = 441
Score = 164 bits (414), Expect = 1e-39
Identities = 85/95 (89%), Positives = 91/95 (95%)
Frame = -2
Query: 643 IIAVGAEVARSNIDGLYQFAAILNINLAVINLLPLPALDGGSLALILVEAARGGRKLPLE 464
IIAVGAEVARSN DGLYQFAA+LN+NLAVINLLPLPALDGG+LALIL+EA RGGRKLPLE
Sbjct: 347 IIAVGAEVARSNADGLYQFAALLNLNLAVINLLPLPALDGGTLALILLEAVRGGRKLPLE 406
Query: 463 VEQRIMSSGIMLVLLLGLYLIVRDTLNLDFIKEML 359
VEQ IMSSGIMLVL LGL+LIV+DTLNLDFIKEML
Sbjct: 407 VEQGIMSSGIMLVLFLGLFLIVKDTLNLDFIKEML 441
>ref|NP_565745.1| expressed protein; protein id: At2g32480.1, supported by cDNA:
18983., supported by cDNA: gi_14423491 [Arabidopsis
thaliana] gi|7487438|pir||T02547 hypothetical protein
At2g32480 [imported] - Arabidopsis thaliana
gi|3298536|gb|AAC25930.1| expressed protein [Arabidopsis
thaliana] gi|14423492|gb|AAK62428.1|AF386983_1 Unknown
protein [Arabidopsis thaliana]
gi|21553979|gb|AAM63060.1| unknown [Arabidopsis
thaliana]
Length = 447
Score = 161 bits (408), Expect = 6e-39
Identities = 83/95 (87%), Positives = 92/95 (96%)
Frame = -2
Query: 643 IIAVGAEVARSNIDGLYQFAAILNINLAVINLLPLPALDGGSLALILVEAARGGRKLPLE 464
IIAVGAEVARSNIDGLYQFAA+LNINLAVINLLPLPALDGG+LALIL+EA RGG+KLP+E
Sbjct: 353 IIAVGAEVARSNIDGLYQFAALLNINLAVINLLPLPALDGGTLALILLEAVRGGKKLPVE 412
Query: 463 VEQRIMSSGIMLVLLLGLYLIVRDTLNLDFIKEML 359
VEQ IMSSGIMLV+ LGL+LIV+DTL+LDFIKEML
Sbjct: 413 VEQGIMSSGIMLVIFLGLFLIVKDTLSLDFIKEML 447
>ref|ZP_00073812.1| hypothetical protein [Trichodesmium erythraeum IMS101]
Length = 364
Score = 104 bits (259), Expect = 1e-21
Identities = 46/94 (48%), Positives = 76/94 (79%)
Frame = -2
Query: 643 IIAVGAEVARSNIDGLYQFAAILNINLAVINLLPLPALDGGSLALILVEAARGGRKLPLE 464
I+ +GA++A+ N+ L++F A+++INLAVIN+LPLPALDGG LA +++E R G+ LPL
Sbjct: 269 IVDMGAKIAQDNVGELFKFGALISINLAVINILPLPALDGGQLAFLVIEGVR-GKPLPLR 327
Query: 463 VEQRIMSSGIMLVLLLGLYLIVRDTLNLDFIKEM 362
+++ +M +G++L+L LG++LI+RDT NLD ++ +
Sbjct: 328 IQENVMQTGLVLLLGLGVFLIIRDTANLDGVRSI 361
>ref|NP_441081.1| hypothetical protein [Synechocystis sp. PCC 6803]
gi|2496803|sp|P73714|YI21_SYNY3 Hypothetical zinc
metalloprotease slr1821 gi|7470513|pir||S77203
hypothetical protein slr1821 - Synechocystis sp. (strain
PCC 6803) gi|1652842|dbj|BAA17761.1|
ORF_ID:slr1821~hypothetical protein [Synechocystis sp.
PCC 6803]
Length = 366
Score = 103 bits (257), Expect = 2e-21
Identities = 50/95 (52%), Positives = 72/95 (75%)
Frame = -2
Query: 643 IIAVGAEVARSNIDGLYQFAAILNINLAVINLLPLPALDGGSLALILVEAARGGRKLPLE 464
I+ GA +ARS+ L+QF A+++INLAVIN+LPLPALDGG L +L+E G+ LP +
Sbjct: 266 IVEYGANIARSDASNLFQFGALISINLAVINILPLPALDGGQLVFLLIEGLL-GKPLPEK 324
Query: 463 VEQRIMSSGIMLVLLLGLYLIVRDTLNLDFIKEML 359
+ +M +G++L+L LG++LIVRDTLNL F++E L
Sbjct: 325 FQMGVMQTGLVLLLSLGVFLIVRDTLNLTFVQEFL 359
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 518,272,093
Number of Sequences: 1393205
Number of extensions: 10907170
Number of successful extensions: 26045
Number of sequences better than 10.0: 151
Number of HSP's better than 10.0 without gapping: 25232
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 26029
length of database: 448,689,247
effective HSP length: 118
effective length of database: 284,291,057
effective search space used: 27291941472
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)