Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC002949A_C01 KMC002949A_c01
(568 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gb|AAF73132.1|AF149017_1 homogentisate 1,2-dioxygenase [Lycopers... 219 2e-56
gb|AAM65958.1| homogentisate 1,2-dioxygenase [Arabidopsis thaliana] 212 2e-54
ref|NP_200219.1| homogentisate 1,2-dioxygenase; protein id: At5g... 212 2e-54
gb|AAD00360.1| homogentisate 1,2-dioxygenase [Arabidopsis thaliana] 211 7e-54
dbj|BAC22220.1| putative homogentisate 1,2-dioxygenase [Oryza sa... 182 3e-45
>gb|AAF73132.1|AF149017_1 homogentisate 1,2-dioxygenase [Lycopersicon esculentum]
Length = 477
Score = 219 bits (557), Expect = 2e-56
Identities = 94/112 (83%), Positives = 105/112 (92%)
Frame = -3
Query: 566 VAEHTFRPPYYHRNCMSEFMGLIHGGYEAKVDGFLPGGASLHSCMTPHGPDTKSYEATIA 387
VAEHTFRPPYYHRNCMSEFMGLI+GGYEAK DGF PGGASLHSCMTPHGPDTK++EATIA
Sbjct: 330 VAEHTFRPPYYHRNCMSEFMGLIYGGYEAKADGFHPGGASLHSCMTPHGPDTKTFEATIA 389
Query: 386 RGNDVGPNKITDTMAFMFESCLIPRISRWALESPFLDHDYYQCWIGLRSHFN 231
GN+ GP++I DTMAFMFESCL+PR+ WALESPF+DHDYYQCWIGL+SHF+
Sbjct: 390 LGNEAGPHRIADTMAFMFESCLVPRVCPWALESPFMDHDYYQCWIGLKSHFS 441
>gb|AAM65958.1| homogentisate 1,2-dioxygenase [Arabidopsis thaliana]
Length = 461
Score = 212 bits (540), Expect = 2e-54
Identities = 94/112 (83%), Positives = 102/112 (90%)
Frame = -3
Query: 566 VAEHTFRPPYYHRNCMSEFMGLIHGGYEAKVDGFLPGGASLHSCMTPHGPDTKSYEATIA 387
VAEHTFRPPYYHRNCMSEFMGLI+G YEAK DGFLPGGASLHSCMTPHGPDT +YEATIA
Sbjct: 330 VAEHTFRPPYYHRNCMSEFMGLIYGAYEAKADGFLPGGASLHSCMTPHGPDTTTYEATIA 389
Query: 386 RGNDVGPNKITDTMAFMFESCLIPRISRWALESPFLDHDYYQCWIGLRSHFN 231
R N + P+K+T TMAFMFES LIPR+ WALESPFLDHDYYQCWIGL+SHF+
Sbjct: 390 RVNAMAPSKLTGTMAFMFESALIPRVCHWALESPFLDHDYYQCWIGLKSHFS 441
>ref|NP_200219.1| homogentisate 1,2-dioxygenase; protein id: At5g54080.1, supported
by cDNA: 6599., supported by cDNA: gi_4098646
[Arabidopsis thaliana] gi|13432134|sp|Q9ZRA2|HGD_ARATH
Homogentisate 1,2-dioxygenase (Homogentisicase)
(Homogentisate oxygenase) (Homogentisic acid oxidase)
gi|7108615|gb|AAF36499.1|AF130845_1 homogentisate
1,2-dioxygenase [Arabidopsis thaliana]
gi|8809579|dbj|BAA97130.1| homogentisate 1,2-dioxygenase
[Arabidopsis thaliana] gi|22655252|gb|AAM98216.1|
homogentisate 1,2-dioxygenase [Arabidopsis thaliana]
Length = 461
Score = 212 bits (540), Expect = 2e-54
Identities = 94/112 (83%), Positives = 102/112 (90%)
Frame = -3
Query: 566 VAEHTFRPPYYHRNCMSEFMGLIHGGYEAKVDGFLPGGASLHSCMTPHGPDTKSYEATIA 387
VAEHTFRPPYYHRNCMSEFMGLI+G YEAK DGFLPGGASLHSCMTPHGPDT +YEATIA
Sbjct: 330 VAEHTFRPPYYHRNCMSEFMGLIYGAYEAKADGFLPGGASLHSCMTPHGPDTTTYEATIA 389
Query: 386 RGNDVGPNKITDTMAFMFESCLIPRISRWALESPFLDHDYYQCWIGLRSHFN 231
R N + P+K+T TMAFMFES LIPR+ WALESPFLDHDYYQCWIGL+SHF+
Sbjct: 390 RVNAMAPSKLTGTMAFMFESALIPRVCHWALESPFLDHDYYQCWIGLKSHFS 441
>gb|AAD00360.1| homogentisate 1,2-dioxygenase [Arabidopsis thaliana]
Length = 461
Score = 211 bits (536), Expect = 7e-54
Identities = 93/112 (83%), Positives = 102/112 (91%)
Frame = -3
Query: 566 VAEHTFRPPYYHRNCMSEFMGLIHGGYEAKVDGFLPGGASLHSCMTPHGPDTKSYEATIA 387
VAEHTFRPPYYHRNCMSEFMGLI+G YEAK DGFLPGGASLHSCMTPHGPDT +YEATIA
Sbjct: 330 VAEHTFRPPYYHRNCMSEFMGLIYGAYEAKADGFLPGGASLHSCMTPHGPDTTTYEATIA 389
Query: 386 RGNDVGPNKITDTMAFMFESCLIPRISRWALESPFLDHDYYQCWIGLRSHFN 231
R N + P+K+T TMAFMFES LIPR+ WALESPFLDH+YYQCWIGL+SHF+
Sbjct: 390 RVNAMAPSKLTGTMAFMFESALIPRVCHWALESPFLDHEYYQCWIGLKSHFS 441
>dbj|BAC22220.1| putative homogentisate 1,2-dioxygenase [Oryza sativa (japonica
cultivar-group)]
Length = 485
Score = 182 bits (461), Expect = 3e-45
Identities = 85/125 (68%), Positives = 100/125 (80%), Gaps = 13/125 (10%)
Frame = -3
Query: 566 VAEHTFRPPYYHRNCMSEFMGLIHGGYE-------------AKVDGFLPGGASLHSCMTP 426
VAE+TFRPPYYHRNCMSEFMGLI+G YE AK DGFLPGGASLHSCMTP
Sbjct: 347 VAENTFRPPYYHRNCMSEFMGLIYGIYELVMIRTHLSELMQAKADGFLPGGASLHSCMTP 406
Query: 425 HGPDTKSYEATIARGNDVGPNKITDTMAFMFESCLIPRISRWALESPFLDHDYYQCWIGL 246
HGPDTK+YEATI+R + P++++ T+AFMFES LIPR+ +WAL+SP D DYYQCWIGL
Sbjct: 407 HGPDTKTYEATISRPDANEPSRLSGTLAFMFESALIPRVCQWALDSPSRDLDYYQCWIGL 466
Query: 245 RSHFN 231
+SHF+
Sbjct: 467 KSHFS 471
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 503,524,918
Number of Sequences: 1393205
Number of extensions: 11336231
Number of successful extensions: 25019
Number of sequences better than 10.0: 61
Number of HSP's better than 10.0 without gapping: 24067
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 24906
length of database: 448,689,247
effective HSP length: 116
effective length of database: 287,077,467
effective search space used: 20669577624
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)