Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC003757A_C01 KMC003757A_c01
(572 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_190813.1| putative protein; protein id: At3g52450.1 [Arab... 137 1e-32
gb|AAO64764.1| At2g35930 [Arabidopsis thaliana] 134 2e-31
ref|NP_181137.1| unknown protein; protein id: At2g35930.1 [Arabi... 134 2e-31
dbj|BAC01203.1| P0505D12.10 [Oryza sativa (japonica cultivar-gro... 116 6e-26
ref|NP_566402.1| expressed protein; protein id: At3g11840.1, sup... 100 1e-20
>ref|NP_190813.1| putative protein; protein id: At3g52450.1 [Arabidopsis thaliana]
gi|7486004|pir||T08454 hypothetical protein F22O6.170 -
Arabidopsis thaliana gi|4886282|emb|CAB43434.1| putative
protein [Arabidopsis thaliana]
Length = 435
Score = 137 bits (344), Expect(2) = 1e-32
Identities = 67/94 (71%), Positives = 83/94 (88%)
Frame = -2
Query: 571 LLDLLCQCAEGRAELLSHAAGLAVVSKKILRVSTVANDRAVRILLSVCRFSASPSVLQEM 392
+LD+LCQCAEGRAE L+H A +AVVSKKILRVS + ++RAVR+LLSV RF A+PS+LQEM
Sbjct: 324 VLDMLCQCAEGRAEFLNHGAAIAVVSKKILRVSQITSERAVRVLLSVGRFCATPSLLQEM 383
Query: 391 LKLGVVAKLCLVLQVDSGSQAKEKAREILKLHGK 290
L+LGVVAKLCLVLQV G++ KEKA+E+LKLH +
Sbjct: 384 LQLGVVAKLCLVLQVSCGNKTKEKAKELLKLHAR 417
Score = 24.3 bits (51), Expect(2) = 1e-32
Identities = 7/10 (70%), Positives = 8/10 (80%)
Frame = -3
Query: 291 RAWRHSPCIP 262
R WR SPC+P
Sbjct: 417 RVWRESPCVP 426
>gb|AAO64764.1| At2g35930 [Arabidopsis thaliana]
Length = 411
Score = 134 bits (337), Expect(2) = 2e-31
Identities = 67/94 (71%), Positives = 80/94 (84%)
Frame = -2
Query: 571 LLDLLCQCAEGRAELLSHAAGLAVVSKKILRVSTVANDRAVRILLSVCRFSASPSVLQEM 392
+LDLLCQCAEGRAE L+H A +AVV KKILRVS A+DRAVR+LLSV RF A+P++L EM
Sbjct: 300 VLDLLCQCAEGRAEFLNHGAAIAVVCKKILRVSQTASDRAVRVLLSVGRFCATPALLHEM 359
Query: 391 LKLGVVAKLCLVLQVDSGSQAKEKAREILKLHGK 290
L+LGVVAKLCLVLQV G + KEKA+E+LKLH +
Sbjct: 360 LQLGVVAKLCLVLQVSCGGKTKEKAKELLKLHAR 393
Score = 23.5 bits (49), Expect(2) = 2e-31
Identities = 7/15 (46%), Positives = 9/15 (59%)
Frame = -3
Query: 291 RAWRHSPCIPSTFAL 247
R W+ SPC+P L
Sbjct: 393 RVWKDSPCLPKNMIL 407
>ref|NP_181137.1| unknown protein; protein id: At2g35930.1 [Arabidopsis thaliana]
gi|25408431|pir||G84774 hypothetical protein At2g35930
[imported] - Arabidopsis thaliana
gi|4510376|gb|AAD21464.1| unknown protein [Arabidopsis
thaliana]
Length = 406
Score = 134 bits (337), Expect(2) = 2e-31
Identities = 67/94 (71%), Positives = 80/94 (84%)
Frame = -2
Query: 571 LLDLLCQCAEGRAELLSHAAGLAVVSKKILRVSTVANDRAVRILLSVCRFSASPSVLQEM 392
+LDLLCQCAEGRAE L+H A +AVV KKILRVS A+DRAVR+LLSV RF A+P++L EM
Sbjct: 295 VLDLLCQCAEGRAEFLNHGAAIAVVCKKILRVSQTASDRAVRVLLSVGRFCATPALLHEM 354
Query: 391 LKLGVVAKLCLVLQVDSGSQAKEKAREILKLHGK 290
L+LGVVAKLCLVLQV G + KEKA+E+LKLH +
Sbjct: 355 LQLGVVAKLCLVLQVSCGGKTKEKAKELLKLHAR 388
Score = 23.5 bits (49), Expect(2) = 2e-31
Identities = 7/15 (46%), Positives = 9/15 (59%)
Frame = -3
Query: 291 RAWRHSPCIPSTFAL 247
R W+ SPC+P L
Sbjct: 388 RVWKDSPCLPKNMIL 402
>dbj|BAC01203.1| P0505D12.10 [Oryza sativa (japonica cultivar-group)]
Length = 462
Score = 116 bits (291), Expect(2) = 6e-26
Identities = 59/95 (62%), Positives = 76/95 (79%), Gaps = 1/95 (1%)
Frame = -2
Query: 571 LLDLLCQCAEGRAELLSHAAGLAVVSKKILRVSTVANDRAVRILLSVCRFSASPSVLQEM 392
+LD LC CAEGRAEL++HAAG+AVV KK+LRVS A++RAVR+L SV R +A+P+VLQEM
Sbjct: 350 VLDRLCTCAEGRAELVAHAAGVAVVGKKVLRVSEAASERAVRVLRSVARHAATPAVLQEM 409
Query: 391 LKLGVVAKLCLVLQVDS-GSQAKEKAREILKLHGK 290
+ GVV KLCL L+ + G + KEKA E+LKLH +
Sbjct: 410 AQCGVVGKLCLALRSEQCGVKTKEKAHEVLKLHSR 444
Score = 22.3 bits (46), Expect(2) = 6e-26
Identities = 7/13 (53%), Positives = 9/13 (68%)
Frame = -3
Query: 291 RAWRHSPCIPSTF 253
R WR SPC+ +F
Sbjct: 444 RVWRASPCLSPSF 456
>ref|NP_566402.1| expressed protein; protein id: At3g11840.1, supported by cDNA:
100676. [Arabidopsis thaliana]
Length = 470
Score = 100 bits (249), Expect = 1e-20
Identities = 50/92 (54%), Positives = 67/92 (72%)
Frame = -2
Query: 571 LLDLLCQCAEGRAELLSHAAGLAVVSKKILRVSTVANDRAVRILLSVCRFSASPSVLQEM 392
+L LC CA GRAE+L+H G+AVV+K++LRVS A+DRA+ IL +V +FS V++EM
Sbjct: 351 VLSRLCCCANGRAEILAHRGGIAVVTKRLLRVSPAADDRAISILTTVSKFSPENMVVEEM 410
Query: 391 LKLGVVAKLCLVLQVDSGSQAKEKAREILKLH 296
+ +G V KLC VL +D G KEKA+EILK H
Sbjct: 411 VNVGTVEKLCSVLGMDCGLNLKEKAKEILKDH 442
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 510,207,595
Number of Sequences: 1393205
Number of extensions: 12167473
Number of successful extensions: 117339
Number of sequences better than 10.0: 2398
Number of HSP's better than 10.0 without gapping: 68914
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 97903
length of database: 448,689,247
effective HSP length: 116
effective length of database: 287,077,467
effective search space used: 21243732558
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)