Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC003917A_C01 KMC003917A_c01
(622 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_187801.1| hypothetical protein; protein id: At3g11950.1 [... 162 3e-39
dbj|BAB03104.1| dbj|BAA17774.1~gene_id:MEC18.5~similar to unknow... 117 1e-25
ref|NP_179485.2| hypothetical protein; protein id: At2g18950.1, ... 77 1e-13
ref|ZP_00070982.1| hypothetical protein [Trichodesmium erythraeu... 76 4e-13
gb|ZP_00107768.1| hypothetical protein [Nostoc punctiforme] 74 2e-12
>ref|NP_187801.1| hypothetical protein; protein id: At3g11950.1 [Arabidopsis thaliana]
Length = 970
Score = 162 bits (411), Expect = 3e-39
Identities = 83/124 (66%), Positives = 96/124 (76%), Gaps = 16/124 (12%)
Frame = -3
Query: 620 ITKDLPDVEGDRKYQISTFATKLGVRNIAFLGSGILLMNYIVSILAAIYMPQ-------- 465
ITKDLPDVEGDRK+QIST ATKLGVRNIAFLGSG+LL+NY+ +I A YMPQ
Sbjct: 847 ITKDLPDVEGDRKFQISTLATKLGVRNIAFLGSGLLLVNYVSAISLAFYMPQYAALKRPT 906
Query: 464 --------AFRRWLMIPAHMILASSLIYQVKILEQANHTKEAISGFYQFIWNLFYAEYAL 309
FR LMIPAH+ILAS LI+Q +LE+AN+TKEAISG+Y+FIWNLFYAEY L
Sbjct: 907 LLSFNNEQVFRGSLMIPAHVILASGLIFQTWVLEKANYTKEAISGYYRFIWNLFYAEYLL 966
Query: 308 FPFI 297
FPF+
Sbjct: 967 FPFL 970
>dbj|BAB03104.1| dbj|BAA17774.1~gene_id:MEC18.5~similar to unknown protein
[Arabidopsis thaliana]
Length = 441
Score = 117 bits (293), Expect = 1e-25
Identities = 63/100 (63%), Positives = 73/100 (73%), Gaps = 16/100 (16%)
Frame = -3
Query: 620 ITKDLPDVEGDRKYQISTFATKLGVRNIAFLGSGILLMNYIVSILAAIYMP--------- 468
ITKDLPDVEGDRK+QIST ATKLGVRNIAFLGSG+LL+NY+ +I A YMP
Sbjct: 295 ITKDLPDVEGDRKFQISTLATKLGVRNIAFLGSGLLLVNYVSAISLAFYMPQYAALKRPT 354
Query: 467 -------QAFRRWLMIPAHMILASSLIYQVKILEQANHTK 369
Q FR LMIPAH+ILAS LI+Q +LE+AN+TK
Sbjct: 355 LLSFNNEQVFRGSLMIPAHVILASGLIFQTWVLEKANYTK 394
>ref|NP_179485.2| hypothetical protein; protein id: At2g18950.1, supported by cDNA:
gi_17104827, supported by cDNA: gi_17380873, supported
by cDNA: gi_20384918, supported by cDNA: gi_21281071
[Arabidopsis thaliana]
gi|17104828|gb|AAL35412.1|AF324344_1 tocopherol
polyprenyltransferase [Arabidopsis thaliana]
gi|17380874|gb|AAL36249.1| unknown protein [Arabidopsis
thaliana] gi|20384919|gb|AAM10489.1| homogentisate
phytylprenyltransferase [Arabidopsis thaliana]
gi|21281072|gb|AAM45041.1| unknown protein [Arabidopsis
thaliana]
Length = 393
Score = 77.4 bits (189), Expect = 1e-13
Identities = 39/106 (36%), Positives = 65/106 (60%)
Frame = -3
Query: 614 KDLPDVEGDRKYQISTFATKLGVRNIAFLGSGILLMNYIVSILAAIYMPQAFRRWLMIPA 435
KD+PD+EGD+ + I +F+ LG + + + +L M Y V+IL P + + + +
Sbjct: 289 KDIPDIEGDKIFGIRSFSVTLGQKRVFWTCVTLLQMAYAVAILVGATSPFIWSKVISVVG 348
Query: 434 HMILASSLIYQVKILEQANHTKEAISGFYQFIWNLFYAEYALFPFI 297
H+ILA++L + K ++ ++ T+ I+ Y FIW LFYAEY L PF+
Sbjct: 349 HVILATTLWARAKSVDLSSKTE--ITSCYMFIWKLFYAEYLLLPFL 392
>ref|ZP_00070982.1| hypothetical protein [Trichodesmium erythraeum IMS101]
Length = 349
Score = 75.9 bits (185), Expect = 4e-13
Identities = 36/106 (33%), Positives = 62/106 (57%)
Frame = -3
Query: 620 ITKDLPDVEGDRKYQISTFATKLGVRNIAFLGSGILLMNYIVSILAAIYMPQAFRRWLMI 441
I KD+PD+EGDR+Y I+TF KLG + L +L Y+ ++ + + + ++
Sbjct: 241 IFKDIPDIEGDRQYNINTFTIKLGAFAVFNLARWVLTFCYLGMVMVGVVWLASVNLFFLV 300
Query: 440 PAHMILASSLIYQVKILEQANHTKEAISGFYQFIWNLFYAEYALFP 303
+H++ + + + ++ H K+AI+ FYQFIW LF+ EY +FP
Sbjct: 301 ISHLLALGIMWWFSQRVDL--HDKKAIADFYQFIWKLFFLEYLIFP 344
>gb|ZP_00107768.1| hypothetical protein [Nostoc punctiforme]
Length = 322
Score = 73.9 bits (180), Expect = 2e-12
Identities = 42/117 (35%), Positives = 64/117 (53%)
Frame = -3
Query: 620 ITKDLPDVEGDRKYQISTFATKLGVRNIAFLGSGILLMNYIVSILAAIYMPQAFRRWLMI 441
I KD+PD+EGDR Y I+TF KLG + + L ++ + Y+ IL + + +I
Sbjct: 213 IFKDIPDIEGDRLYNITTFTIKLGSQAVFNLALWVITVCYLGIILVGVLRIASVNPIFLI 272
Query: 440 PAHMILASSLIYQVKILEQANHTKEAISGFYQFIWNLFYAEYALFPFI**TAAQCFL 270
AH+ L + ++ ++ + K AI+ FYQFIW LF+ EY +FP CFL
Sbjct: 273 TAHLALLVWMWWRSLAVDLQD--KSAIAQFYQFIWKLFFIEYLIFPI------ACFL 321
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 483,835,859
Number of Sequences: 1393205
Number of extensions: 9611271
Number of successful extensions: 19573
Number of sequences better than 10.0: 32
Number of HSP's better than 10.0 without gapping: 19154
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 19565
length of database: 448,689,247
effective HSP length: 118
effective length of database: 284,291,057
effective search space used: 25017613016
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)