Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC001773A_C01 KMC001773A_c01
(580 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gb|AAG13433.1|AC051634_14 unknown protein [Oryza sativa (japonic... 278 4e-74
ref|NP_192817.1| hypothetical protein; protein id: At4g10790.1 [... 266 1e-70
pir||T01898 hypothetical protein T12H20.9 - Arabidopsis thaliana... 266 1e-70
ref|NP_567675.1| putative protein; protein id: At4g23040.1, supp... 97 1e-19
pir||T05136 hypothetical protein F7H19.230 - Arabidopsis thalian... 94 9e-19
>gb|AAG13433.1|AC051634_14 unknown protein [Oryza sativa (japonica cultivar-group)]
Length = 466
Score = 278 bits (711), Expect = 4e-74
Identities = 141/181 (77%), Positives = 163/181 (89%)
Frame = -1
Query: 580 EESSPTLVAARLDAEERRNNIRLREEQDAAYRAALEADQARERQRREEQERLAREAAEAE 401
EE S +LVAAR+DAEER NN RLREEQDAAYRAALEADQARERQRREEQE+ REAAEAE
Sbjct: 284 EECSASLVAARIDAEERLNNQRLREEQDAAYRAALEADQARERQRREEQEKREREAAEAE 343
Query: 400 RKRKEEEEAREREAREAAEKQAALAKIRQQKAESLGEEPAKGPNVTQVLVRFPTGERKER 221
RKRKEEEEA+ER A+EAAEK+AALA+ RQ+KA +LG EP KGP+VT+VL+RFPTGERKER
Sbjct: 344 RKRKEEEEAQERAAQEAAEKEAALARRRQEKAMALGAEPEKGPDVTRVLIRFPTGERKER 403
Query: 220 RFNNTVTIQSVYDYVDSLGCLEADSYSLVSNFPRVVYGQEKLTLSLKEAGLHPQASLFVE 41
RFN++ TI S+YDYVDSL CL+A+ YSLVSNFPRV YG EKL+ +L+EAGLHPQASLF+E
Sbjct: 404 RFNSSTTITSLYDYVDSLDCLKAEKYSLVSNFPRVTYGPEKLSQTLEEAGLHPQASLFIE 463
Query: 40 L 38
+
Sbjct: 464 I 464
>ref|NP_192817.1| hypothetical protein; protein id: At4g10790.1 [Arabidopsis
thaliana] gi|25407518|pir||H85112 hypothetical protein
AT4g10790 [imported] - Arabidopsis thaliana
gi|7267777|emb|CAB81180.1| predicted protein of unknown
function [Arabidopsis thaliana]
Length = 480
Score = 266 bits (681), Expect = 1e-70
Identities = 132/182 (72%), Positives = 160/182 (87%)
Frame = -1
Query: 580 EESSPTLVAARLDAEERRNNIRLREEQDAAYRAALEADQARERQRREEQERLAREAAEAE 401
E+SSPTLV AR++AEERR N+RLREEQDAAYRAALEADQARE+QR+EE+ERL REAAEAE
Sbjct: 299 EDSSPTLVTARVEAEERRTNLRLREEQDAAYRAALEADQAREQQRQEEKERLEREAAEAE 358
Query: 400 RKRKEEEEAREREAREAAEKQAALAKIRQQKAESLGEEPAKGPNVTQVLVRFPTGERKER 221
RK KEEEEARER AREA E+QAA ++RQ+KA +LGEEP KGP+VTQVLVRFP GERK R
Sbjct: 359 RKLKEEEEARERAAREAEERQAARVRMRQEKALALGEEPEKGPDVTQVLVRFPNGERKGR 418
Query: 220 RFNNTVTIQSVYDYVDSLGCLEADSYSLVSNFPRVVYGQEKLTLSLKEAGLHPQASLFVE 41
F + IQ++YDYVDSLG L+ + YSL++NFPR VYG++K ++SLK+AGLHPQASLF+E
Sbjct: 419 MFKSETKIQTLYDYVDSLGLLDTEEYSLITNFPRTVYGRDKESMSLKDAGLHPQASLFIE 478
Query: 40 LS 35
++
Sbjct: 479 IN 480
>pir||T01898 hypothetical protein T12H20.9 - Arabidopsis thaliana
gi|3600032|gb|AAC35520.1| contains similarity to
tropomyosin (Pfam: Tropomyosin.hmm, score: 14.57) and
ATP synthase (Pfam: ATP-synt_B.hmm, score: 10.89)
[Arabidopsis thaliana]
Length = 466
Score = 266 bits (681), Expect = 1e-70
Identities = 132/182 (72%), Positives = 160/182 (87%)
Frame = -1
Query: 580 EESSPTLVAARLDAEERRNNIRLREEQDAAYRAALEADQARERQRREEQERLAREAAEAE 401
E+SSPTLV AR++AEERR N+RLREEQDAAYRAALEADQARE+QR+EE+ERL REAAEAE
Sbjct: 285 EDSSPTLVTARVEAEERRTNLRLREEQDAAYRAALEADQAREQQRQEEKERLEREAAEAE 344
Query: 400 RKRKEEEEAREREAREAAEKQAALAKIRQQKAESLGEEPAKGPNVTQVLVRFPTGERKER 221
RK KEEEEARER AREA E+QAA ++RQ+KA +LGEEP KGP+VTQVLVRFP GERK R
Sbjct: 345 RKLKEEEEARERAAREAEERQAARVRMRQEKALALGEEPEKGPDVTQVLVRFPNGERKGR 404
Query: 220 RFNNTVTIQSVYDYVDSLGCLEADSYSLVSNFPRVVYGQEKLTLSLKEAGLHPQASLFVE 41
F + IQ++YDYVDSLG L+ + YSL++NFPR VYG++K ++SLK+AGLHPQASLF+E
Sbjct: 405 MFKSETKIQTLYDYVDSLGLLDTEEYSLITNFPRTVYGRDKESMSLKDAGLHPQASLFIE 464
Query: 40 LS 35
++
Sbjct: 465 IN 466
>ref|NP_567675.1| putative protein; protein id: At4g23040.1, supported by cDNA:
gi_13430703 [Arabidopsis thaliana]
gi|13430704|gb|AAK25974.1|AF360264_1 unknown protein
[Arabidopsis thaliana] gi|23296844|gb|AAN13184.1|
unknown protein [Arabidopsis thaliana]
Length = 525
Score = 97.4 bits (241), Expect = 1e-19
Identities = 67/180 (37%), Positives = 97/180 (53%), Gaps = 2/180 (1%)
Frame = -1
Query: 571 SPTLVAARLDAEERRNNIRLREEQDAAYRAALEADQARERQRREEQERLAREAAEAERKR 392
SP+L A RL +RE+QD Y A+LEAD+ + RR E+E EA E E KR
Sbjct: 362 SPSLTAQRL----------IREQQDDEYLASLEADRVKAEARRLEEEAARVEAIE-EAKR 410
Query: 391 KEEEEAREREAREAAEKQAALAKIRQQKAESLGEEPAKGP-NVTQVLVRFPTGERKERRF 215
KEEE R+ E + E+Q K SL +EP G N + VR P G R RRF
Sbjct: 411 KEEEARRKVEEEQELERQLV------SKEASLPQEPPAGEENAITLQVRLPDGTRHGRRF 464
Query: 214 NNTVTIQSVYDYVDSLGCLEADSYSLVSNFPRVVYGQEKLTLSLKEAGL-HPQASLFVEL 38
+ +QS++D++D ++ ++Y LV +PR +G + + +L + GL Q +LF+EL
Sbjct: 465 FKSDKLQSLFDFIDICRVVKPNTYRLVRPYPRRAFGDGECSSTLNDIGLTSKQEALFLEL 524
>pir||T05136 hypothetical protein F7H19.230 - Arabidopsis thaliana
gi|3292830|emb|CAA19820.1| putative protein [Arabidopsis
thaliana] gi|7269151|emb|CAB79259.1| putative protein
[Arabidopsis thaliana]
Length = 577
Score = 94.4 bits (233), Expect = 9e-19
Identities = 66/187 (35%), Positives = 100/187 (53%), Gaps = 9/187 (4%)
Frame = -1
Query: 571 SPTLVAARLDAEERRNN-------IRLREEQDAAYRAALEADQARERQRREEQERLAREA 413
SP+L A RL E++ + ++ + QD Y A+LEAD+ + RR E+E EA
Sbjct: 397 SPSLTAQRLIREQQDTDDDEFLFLLKCKLVQDDEYLASLEADRVKAEARRLEEEAARVEA 456
Query: 412 AEAERKRKEEEEAREREAREAAEKQAALAKIRQQKAESLGEEPAKGP-NVTQVLVRFPTG 236
E E KRKEEE R+ E + E+Q K SL +EP G N + VR P G
Sbjct: 457 IE-EAKRKEEEARRKVEEEQELERQLV------SKEASLPQEPPAGEENAITLQVRLPDG 509
Query: 235 ERKERRFNNTVTIQSVYDYVDSLGCLEADSYSLVSNFPRVVYGQEKLTLSLKEAGL-HPQ 59
R RRF + +QS++D++D ++ ++Y LV +PR +G + + +L + GL Q
Sbjct: 510 TRHGRRFFKSDKLQSLFDFIDICRVVKPNTYRLVRPYPRRAFGDGECSSTLNDIGLTSKQ 569
Query: 58 ASLFVEL 38
+LF+EL
Sbjct: 570 EALFLEL 576
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 474,285,816
Number of Sequences: 1393205
Number of extensions: 10008772
Number of successful extensions: 105540
Number of sequences better than 10.0: 5053
Number of HSP's better than 10.0 without gapping: 54788
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 82245
length of database: 448,689,247
effective HSP length: 117
effective length of database: 285,684,262
effective search space used: 21426319650
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)