Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC000536A_C01 KMC000536A_c01
(984 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_201121.1| putative protein; protein id: At5g63160.1 [Arab... 115 1e-24
ref|NP_566902.1| putative protein; protein id: At3g48360.1, supp... 111 2e-23
pir||T06706 hypothetical protein T29H11.120 - Arabidopsis thalia... 111 2e-23
ref|NP_201549.1| putative protein; protein id: At5g67480.1, supp... 98 2e-19
dbj|BAB92571.1| P0497A05.15 [Oryza sativa (japonica cultivar-gro... 85 2e-15
>ref|NP_201121.1| putative protein; protein id: At5g63160.1 [Arabidopsis thaliana]
gi|10177297|dbj|BAB10558.1| contains similarity to
unknown protein~gene_id:MDC12.13~pir||T06706
[Arabidopsis thaliana]
Length = 365
Score = 115 bits (287), Expect = 1e-24
Identities = 57/102 (55%), Positives = 73/102 (70%), Gaps = 7/102 (6%)
Frame = -2
Query: 983 LDHICTEGCTLVGPHHVEVDRKS------GPCNKFATCQGVQQLIRHYATCTKRMGG-GC 825
++HICTEGCTLVGP +D KS GPC+ F+TC G+Q LIRH+A C KR+ G GC
Sbjct: 217 IEHICTEGCTLVGPSS-NLDNKSTCQAKPGPCSAFSTCYGLQLLIRHFAVCKKRVDGKGC 275
Query: 824 LRCKRMWQLFRLHSYGCEHADSCNVPLCRPFQYQNADRENEK 699
+RCKRM QL RLHS C+ ++SC VPLCR QY+N +++K
Sbjct: 276 VRCKRMIQLLRLHSSICDQSESCRVPLCR--QYKNRGEKDKK 315
>ref|NP_566902.1| putative protein; protein id: At3g48360.1, supported by cDNA:
gi_14532781, supported by cDNA: gi_19310816 [Arabidopsis
thaliana] gi|14532782|gb|AAK64172.1| unknown protein
[Arabidopsis thaliana] gi|19310817|gb|AAL85139.1|
unknown protein [Arabidopsis thaliana]
gi|23397078|gb|AAN31824.1| unknown protein [Arabidopsis
thaliana]
Length = 364
Score = 111 bits (277), Expect = 2e-23
Identities = 55/94 (58%), Positives = 65/94 (68%), Gaps = 9/94 (9%)
Frame = -2
Query: 983 LDHICTEGCTLVGPHHVEVDR--------KSGPCNKFATCQGVQQLIRHYATCTKRMGG- 831
++HICT+GCTLVGP +V VD KS PC F+TC G+Q LIRH+A C +R
Sbjct: 227 IEHICTQGCTLVGPSNV-VDNNKKSMTAEKSEPCKAFSTCYGLQLLIRHFAVCKRRNNDK 285
Query: 830 GCLRCKRMWQLFRLHSYGCEHADSCNVPLCRPFQ 729
GCLRCKRM QLFRLHS C+ DSC VPLCR F+
Sbjct: 286 GCLRCKRMLQLFRLHSLICDQPDSCRVPLCRQFR 319
>pir||T06706 hypothetical protein T29H11.120 - Arabidopsis thaliana
gi|4678352|emb|CAB41162.1| putative protein [Arabidopsis
thaliana]
Length = 367
Score = 111 bits (277), Expect = 2e-23
Identities = 55/94 (58%), Positives = 65/94 (68%), Gaps = 9/94 (9%)
Frame = -2
Query: 983 LDHICTEGCTLVGPHHVEVDR--------KSGPCNKFATCQGVQQLIRHYATCTKRMGG- 831
++HICT+GCTLVGP +V VD KS PC F+TC G+Q LIRH+A C +R
Sbjct: 230 IEHICTQGCTLVGPSNV-VDNNKKSMTAEKSEPCKAFSTCYGLQLLIRHFAVCKRRNNDK 288
Query: 830 GCLRCKRMWQLFRLHSYGCEHADSCNVPLCRPFQ 729
GCLRCKRM QLFRLHS C+ DSC VPLCR F+
Sbjct: 289 GCLRCKRMLQLFRLHSLICDQPDSCRVPLCRQFR 322
>ref|NP_201549.1| putative protein; protein id: At5g67480.1, supported by cDNA:
gi_15529177, supported by cDNA: gi_17386119 [Arabidopsis
thaliana] gi|9757869|dbj|BAB08456.1|
gene_id:K9I9.4~pir||T04718~strong similarity to unknown
protein [Arabidopsis thaliana]
gi|15529178|gb|AAK97683.1| AT5g67480/K9I9_4 [Arabidopsis
thaliana] gi|17386120|gb|AAL38606.1|AF446873_1
AT5g67480/K9I9_4 [Arabidopsis thaliana]
Length = 372
Score = 97.8 bits (242), Expect = 2e-19
Identities = 42/95 (44%), Positives = 60/95 (62%)
Frame = -2
Query: 983 LDHICTEGCTLVGPHHVEVDRKSGPCNKFATCQGVQQLIRHYATCTKRMGGGCLRCKRMW 804
L HIC +GC +GPH + CN + C+G++ LIRH+A C R+ GGC+ CKRMW
Sbjct: 250 LVHICRDGCKTIGPHDKDFKPNHATCN-YEACKGLESLIRHFAGCKLRVPGGCVHCKRMW 308
Query: 803 QLFRLHSYGCEHADSCNVPLCRPFQYQNADRENEK 699
QL LHS C +D C VPLCR + + +++++K
Sbjct: 309 QLLELHSRVCAGSDQCRVPLCRNLK-EKMEKQSKK 342
>dbj|BAB92571.1| P0497A05.15 [Oryza sativa (japonica cultivar-group)]
gi|20804925|dbj|BAB92604.1| P0456E05.3 [Oryza sativa
(japonica cultivar-group)]
Length = 347
Score = 85.1 bits (209), Expect = 2e-15
Identities = 45/97 (46%), Positives = 59/97 (60%), Gaps = 2/97 (2%)
Frame = -2
Query: 983 LDHICTEGCTLVGPHHVEVDRKSGPCNKFAT-CQGVQQLIRHYATCTKRMGGGCLRCKRM 807
L HICTEGCT VGP V + PC +AT C+G+Q LIRH++ C + C RC+RM
Sbjct: 217 LSHICTEGCTEVGP--VGRAPAAAPCPAYATACRGLQLLIRHFSRCHRT---SCPRCQRM 271
Query: 806 WQLFRLHSYGCEHADS-CNVPLCRPFQYQNADRENEK 699
WQL RLH+ C+ D CN PLC F+ + ++ K
Sbjct: 272 WQLLRLHAALCDLPDGHCNTPLCMQFRRKEEEKAAAK 308
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 865,487,493
Number of Sequences: 1393205
Number of extensions: 19273477
Number of successful extensions: 57077
Number of sequences better than 10.0: 49
Number of HSP's better than 10.0 without gapping: 53514
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 57011
length of database: 448,689,247
effective HSP length: 123
effective length of database: 277,325,032
effective search space used: 56574306528
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)