Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC008331A_C02 KMC008331A_c02
(697 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_568280.1| putative protein; protein id: At5g12470.1, supp... 165 5e-40
dbj|BAB92870.1| OJ1294_F06.10 [Oryza sativa (japonica cultivar-g... 84 2e-15
ref|NP_191173.2| chloroplast lumen common protein family; protei... 78 1e-13
pir||T47731 hypothetical protein F18O21.100 - Arabidopsis thalia... 78 1e-13
ref|NP_565930.1| chloroplast lumen common protein family; protei... 76 4e-13
>ref|NP_568280.1| putative protein; protein id: At5g12470.1, supported by cDNA:
gi_20268751, supported by cDNA: gi_21281148 [Arabidopsis
thaliana] gi|14586377|emb|CAC42908.1| putative protein
[Arabidopsis thaliana] gi|20268752|gb|AAM14079.1|
unknown protein [Arabidopsis thaliana]
gi|21281149|gb|AAM45049.1| unknown protein [Arabidopsis
thaliana] gi|27311697|gb|AAO00814.1| putative protein
[Arabidopsis thaliana]
Length = 386
Score = 165 bits (418), Expect = 5e-40
Identities = 78/104 (75%), Positives = 92/104 (88%)
Frame = -2
Query: 696 ASLIGTGVTNGLINARKVVDKSFADEAEDIPIVSTSIAYGVYMAVSSNLRYQILAGVIEQ 517
+SL+GT +TN I ARK VD++ E E +PIVSTS+AYGVYMAVSSNLRYQI+AGVIEQ
Sbjct: 281 SSLVGTAITNAFIKARKAVDQNSEGEVETVPIVSTSVAYGVYMAVSSNLRYQIVAGVIEQ 340
Query: 516 RILEPLLHKHKLMLSAICFAVRTGNTFLGSLLWVDYARWVGVQK 385
R+LEP+LH+HKL LSA+CFAVRTGNTFLGSLLWVDYAR +G+QK
Sbjct: 341 RLLEPMLHQHKLALSALCFAVRTGNTFLGSLLWVDYARLIGIQK 384
>dbj|BAB92870.1| OJ1294_F06.10 [Oryza sativa (japonica cultivar-group)]
Length = 784
Score = 83.6 bits (205), Expect = 2e-15
Identities = 42/62 (67%), Positives = 48/62 (76%)
Frame = -2
Query: 696 ASLIGTGVTNGLINARKVVDKSFADEAEDIPIVSTSIAYGVYMAVSSNLRYQILAGVIEQ 517
ASLIGTGVTN LI ARK VDK DE EDIP++STS+AYGVYMAVSSNLR +++Q
Sbjct: 219 ASLIGTGVTNALIKARKAVDKELDDEVEDIPVLSTSVAYGVYMAVSSNLRRPSFPPLVQQ 278
Query: 516 RI 511
I
Sbjct: 279 PI 280
>ref|NP_191173.2| chloroplast lumen common protein family; protein id: At3g56140.1,
supported by cDNA: gi_20260423 [Arabidopsis thaliana]
gi|20260424|gb|AAM13110.1| putative protein [Arabidopsis
thaliana]
Length = 745
Score = 78.2 bits (191), Expect = 1e-13
Identities = 41/105 (39%), Positives = 66/105 (62%), Gaps = 2/105 (1%)
Frame = -2
Query: 696 ASLIGTGVTNGLINARKVVDKSF--ADEAEDIPIVSTSIAYGVYMAVSSNLRYQILAGVI 523
+S G +N L ARKV+ A++ + P++ T++ YG ++ S+NLRYQI+AG+I
Sbjct: 605 SSFAAVGASNALNIARKVIKPELVVAEKPKRSPLLKTAMVYGGFLGTSANLRYQIIAGLI 664
Query: 522 EQRILEPLLHKHKLMLSAICFAVRTGNTFLGSLLWVDYARWVGVQ 388
E R+ + L L+++AI F VRT N++ G+ W+D AR G+Q
Sbjct: 665 EHRLSDE-LSSQPLLVNAISFVVRTLNSYFGTQQWIDLARSTGLQ 708
>pir||T47731 hypothetical protein F18O21.100 - Arabidopsis thaliana
gi|7572912|emb|CAB87413.1| putative protein [Arabidopsis
thaliana]
Length = 755
Score = 78.2 bits (191), Expect = 1e-13
Identities = 41/105 (39%), Positives = 66/105 (62%), Gaps = 2/105 (1%)
Frame = -2
Query: 696 ASLIGTGVTNGLINARKVVDKSF--ADEAEDIPIVSTSIAYGVYMAVSSNLRYQILAGVI 523
+S G +N L ARKV+ A++ + P++ T++ YG ++ S+NLRYQI+AG+I
Sbjct: 615 SSFAAVGASNALNIARKVIKPELVVAEKPKRSPLLKTAMVYGGFLGTSANLRYQIIAGLI 674
Query: 522 EQRILEPLLHKHKLMLSAICFAVRTGNTFLGSLLWVDYARWVGVQ 388
E R+ + L L+++AI F VRT N++ G+ W+D AR G+Q
Sbjct: 675 EHRLSDE-LSSQPLLVNAISFVVRTLNSYFGTQQWIDLARSTGLQ 718
>ref|NP_565930.1| chloroplast lumen common protein family; protein id: At2g40400.1,
supported by cDNA: gi_15294187, supported by cDNA:
gi_20857081 [Arabidopsis thaliana]
gi|25344247|pir||A84829 hypothetical protein At2g40400
[imported] - Arabidopsis thaliana
gi|4586056|gb|AAD25674.1| chloroplast lumen common
protein family [Arabidopsis thaliana]
gi|15294188|gb|AAK95271.1|AF410285_1 At2g40400/T3G21.17
[Arabidopsis thaliana] gi|20857082|gb|AAM26698.1|
At2g40400/T3G21.17 [Arabidopsis thaliana]
Length = 735
Score = 76.3 bits (186), Expect = 4e-13
Identities = 39/105 (37%), Positives = 63/105 (59%), Gaps = 2/105 (1%)
Frame = -2
Query: 696 ASLIGTGVTNGLINARKVV--DKSFADEAEDIPIVSTSIAYGVYMAVSSNLRYQILAGVI 523
+S G +N L RK + + ++A+ P++ T++ YG Y+ SSN+RYQI+AG+I
Sbjct: 596 SSFAAVGSSNALYAIRKFIKPELGVGEQAKRSPMLKTALVYGGYLGTSSNIRYQIIAGLI 655
Query: 522 EQRILEPLLHKHKLMLSAICFAVRTGNTFLGSLLWVDYARWVGVQ 388
E RI + L L+++ I F VR N++ G+ W+D AR G+Q
Sbjct: 656 EHRISDE-LSSQPLLVNMISFVVRVANSYFGTQQWIDLARSTGLQ 699
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 624,456,915
Number of Sequences: 1393205
Number of extensions: 14013111
Number of successful extensions: 35191
Number of sequences better than 10.0: 18
Number of HSP's better than 10.0 without gapping: 33375
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 35171
length of database: 448,689,247
effective HSP length: 119
effective length of database: 282,897,852
effective search space used: 31684559424
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)