Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC005302A_C01 KMC005302A_c01
(521 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_194356.1| putative protein; protein id: At4g26260.1 [Arab... 173 2e-42
gb|AAN13052.1| unknown protein [Arabidopsis thaliana] 173 2e-42
ref|NP_565459.1| expressed protein; protein id: At2g19800.1, sup... 169 1e-41
gb|AAM63498.1| unknown [Arabidopsis thaliana] 167 7e-41
gb|AAF43953.1|AC012188_30 Strong similarity to an unknown protei... 162 2e-39
>ref|NP_194356.1| putative protein; protein id: At4g26260.1 [Arabidopsis thaliana]
gi|7487428|pir||T06010 hypothetical protein T25K17.70 -
Arabidopsis thaliana gi|4539422|emb|CAB38955.1| putative
protein [Arabidopsis thaliana]
gi|7269477|emb|CAB79481.1| putative protein [Arabidopsis
thaliana]
Length = 318
Score = 173 bits (438), Expect(2) = 2e-42
Identities = 80/106 (75%), Positives = 91/106 (85%)
Frame = +3
Query: 204 DINSYGKSFRDYYGESERQKSVEELYRLQHINQTYEFAKTMREEYGKLNKGEMGIWECCE 383
++N++G+ FRDY ESERQK VEE YRLQHINQT +F K MR EYGKL+K M IWECCE
Sbjct: 50 EMNAFGRQFRDYDVESERQKGVEEFYRLQHINQTVDFVKKMRAEYGKLDKMVMSIWECCE 109
Query: 384 LLDEIVDASDPDLEESQIKHALQSAEAIRKDYPNEDWLHLTALIHD 521
LL+E+VD SDPDL+E QI+H LQSAEAIRKDYPNEDWLHLTALIHD
Sbjct: 110 LLNEVVDESDPDLDEPQIQHLLQSAEAIRKDYPNEDWLHLTALIHD 155
Score = 21.2 bits (43), Expect(2) = 2e-42
Identities = 10/21 (47%), Positives = 14/21 (66%), Gaps = 2/21 (9%)
Frame = +2
Query: 146 LDGGFPLPKPFSS--AGFIAP 202
LDGGF +PK ++ F+AP
Sbjct: 29 LDGGFSMPKMDTNDDEAFLAP 49
>gb|AAN13052.1| unknown protein [Arabidopsis thaliana]
Length = 317
Score = 173 bits (438), Expect(2) = 2e-42
Identities = 80/106 (75%), Positives = 91/106 (85%)
Frame = +3
Query: 204 DINSYGKSFRDYYGESERQKSVEELYRLQHINQTYEFAKTMREEYGKLNKGEMGIWECCE 383
++N++G+ FRDY ESERQK VEE YRLQHINQT +F K MR EYGKL+K M IWECCE
Sbjct: 49 EMNAFGRQFRDYDVESERQKGVEEFYRLQHINQTVDFVKKMRAEYGKLDKMVMSIWECCE 108
Query: 384 LLDEIVDASDPDLEESQIKHALQSAEAIRKDYPNEDWLHLTALIHD 521
LL+E+VD SDPDL+E QI+H LQSAEAIRKDYPNEDWLHLTALIHD
Sbjct: 109 LLNEVVDESDPDLDEPQIQHLLQSAEAIRKDYPNEDWLHLTALIHD 154
Score = 21.2 bits (43), Expect(2) = 2e-42
Identities = 10/21 (47%), Positives = 14/21 (66%), Gaps = 2/21 (9%)
Frame = +2
Query: 146 LDGGFPLPKPFSS--AGFIAP 202
LDGGF +PK ++ F+AP
Sbjct: 28 LDGGFSMPKMDTNDDEAFLAP 48
>ref|NP_565459.1| expressed protein; protein id: At2g19800.1, supported by cDNA:
254633. [Arabidopsis thaliana]
gi|20197290|gb|AAC62136.2| expressed protein
[Arabidopsis thaliana]
Length = 317
Score = 169 bits (429), Expect = 1e-41
Identities = 79/107 (73%), Positives = 92/107 (85%), Gaps = 1/107 (0%)
Frame = +3
Query: 204 DINSYGKSFRDYY-GESERQKSVEELYRLQHINQTYEFAKTMREEYGKLNKGEMGIWECC 380
D+N G SFRDY GESERQ+ VEE YR+QHI+QTY+F K MR+EYGKLNK EM IWECC
Sbjct: 48 DMNFLGHSFRDYENGESERQQGVEEFYRMQHIHQTYDFVKKMRKEYGKLNKMEMSIWECC 107
Query: 381 ELLDEIVDASDPDLEESQIKHALQSAEAIRKDYPNEDWLHLTALIHD 521
ELL+ +VD SDPDL+E QI+H LQ+AEAIR+DYP+EDWLHLTALIHD
Sbjct: 108 ELLNNVVDESDPDLDEPQIQHLLQTAEAIRRDYPDEDWLHLTALIHD 154
>gb|AAM63498.1| unknown [Arabidopsis thaliana]
Length = 317
Score = 167 bits (423), Expect = 7e-41
Identities = 78/107 (72%), Positives = 91/107 (84%), Gaps = 1/107 (0%)
Frame = +3
Query: 204 DINSYGKSFRDYYG-ESERQKSVEELYRLQHINQTYEFAKTMREEYGKLNKGEMGIWECC 380
D+N G SFRDY ESERQ+ VEE YR+QHI+QTY+F K MR+EYGKLNK EM IWECC
Sbjct: 48 DMNFLGHSFRDYENDESERQQGVEEFYRMQHIHQTYDFVKKMRKEYGKLNKMEMSIWECC 107
Query: 381 ELLDEIVDASDPDLEESQIKHALQSAEAIRKDYPNEDWLHLTALIHD 521
ELL+ +VD SDPDL+E QI+H LQ+AEAIR+DYP+EDWLHLTALIHD
Sbjct: 108 ELLNNVVDESDPDLDEPQIQHLLQTAEAIRRDYPDEDWLHLTALIHD 154
>gb|AAF43953.1|AC012188_30 Strong similarity to an unknown protein from Arabidopsis thaliana
gb|AL049171.1
Length = 422
Score = 162 bits (410), Expect = 2e-39
Identities = 72/104 (69%), Positives = 87/104 (83%)
Frame = +3
Query: 210 NSYGKSFRDYYGESERQKSVEELYRLQHINQTYEFAKTMREEYGKLNKGEMGIWECCELL 389
NS+G++FRDY ESER++ VEE YR+ HI QT +F + MREEY KLN+ EM IWECCELL
Sbjct: 43 NSFGRTFRDYDAESERRRGVEEFYRVNHIGQTVDFVRKMREEYEKLNRTEMSIWECCELL 102
Query: 390 DEIVDASDPDLEESQIKHALQSAEAIRKDYPNEDWLHLTALIHD 521
+E +D SDPDL+E QI+H LQ+AEAIRKDYP+EDWLHLT LIHD
Sbjct: 103 NEFIDESDPDLDEPQIEHLLQTAEAIRKDYPDEDWLHLTGLIHD 146
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 487,817,737
Number of Sequences: 1393205
Number of extensions: 11254940
Number of successful extensions: 28694
Number of sequences better than 10.0: 44
Number of HSP's better than 10.0 without gapping: 27791
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 28663
length of database: 448,689,247
effective HSP length: 115
effective length of database: 288,470,672
effective search space used: 16731298976
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)