Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC002287A_C01 KMC002287A_c01
(759 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gb|AAO42013.1| unknown protein [Arabidopsis thaliana] 66 5e-10
ref|NP_197871.1| putative protein; protein id: At5g24890.1, supp... 65 1e-09
gb|AAO53442.1| putative KID-containing protein [Brassica napus] 55 1e-06
gb|AAL06484.1|AF411794_1 At2g24550/F25P17.15 [Arabidopsis thalia... 42 0.007
gb|EAA31256.1| predicted protein [Neurospora crassa] 40 0.036
>gb|AAO42013.1| unknown protein [Arabidopsis thaliana]
Length = 240
Score = 66.2 bits (160), Expect = 5e-10
Identities = 44/116 (37%), Positives = 60/116 (50%), Gaps = 6/116 (5%)
Frame = -3
Query: 754 IASKWSRKSSFYSWSNPQSMPLFPVTEDLDDDYEEEEEEEDAEKARKVPSASSSSSSSLA 575
I +K +RKS FYSW NP+SMPL PV ED DDD EE++EE+ S
Sbjct: 149 ICNKLARKS-FYSWQNPKSMPLLPVNEDEDDDDEEDDEED--------------LKSGFD 193
Query: 574 EEKKQEDPVQMRHNRIPESYAAHMRLRLGSFKSRS------FSLADLQEHDDEEED 425
E K D E + +R GSFK+R+ F+L+DL E +D+++D
Sbjct: 194 ENKSSSD----------EEGVKKVVVRKGSFKNRAYKSRSCFALSDLIEEEDDDDD 239
>ref|NP_197871.1| putative protein; protein id: At5g24890.1, supported by cDNA:
42528. [Arabidopsis thaliana] gi|21593751|gb|AAM65718.1|
unknown [Arabidopsis thaliana]
Length = 240
Score = 65.1 bits (157), Expect = 1e-09
Identities = 43/116 (37%), Positives = 60/116 (51%), Gaps = 6/116 (5%)
Frame = -3
Query: 754 IASKWSRKSSFYSWSNPQSMPLFPVTEDLDDDYEEEEEEEDAEKARKVPSASSSSSSSLA 575
I +K +RKS FYSW NP+SMPL PV ED DDD E+++EE+ S
Sbjct: 149 ICNKLARKS-FYSWQNPKSMPLLPVNEDEDDDDEDDDEED--------------LKSGFD 193
Query: 574 EEKKQEDPVQMRHNRIPESYAAHMRLRLGSFKSRS------FSLADLQEHDDEEED 425
E K D E + +R GSFK+R+ F+L+DL E +D+++D
Sbjct: 194 ENKSSSD----------EEGVKKVVVRKGSFKNRAYKSRSCFALSDLIEEEDDDDD 239
>gb|AAO53442.1| putative KID-containing protein [Brassica napus]
Length = 215
Score = 54.7 bits (130), Expect = 1e-06
Identities = 42/117 (35%), Positives = 59/117 (49%), Gaps = 6/117 (5%)
Frame = -3
Query: 754 IASKWSRKSSFYSWSNPQSMPLFPVTEDLDDDYEEEEEEEDAEKARKVPSASSSSSSSLA 575
I +K +RKS FYSW NP+SMPL PV ED DD EE +D + L+
Sbjct: 136 IYNKLARKS-FYSWQNPKSMPLLPVHEDNDD-----EEGDDGD---------------LS 174
Query: 574 EEKKQEDPVQMRHNRIPESYAAHMRLRLGSFKSRS------FSLADLQEHDDEEEDD 422
+E++ D + R SFK+R+ F+L+DLQE ++EEED+
Sbjct: 175 DEERGGDVLARRP----------------SFKNRALKSMSCFALSDLQEEEEEEEDE 215
>gb|AAL06484.1|AF411794_1 At2g24550/F25P17.15 [Arabidopsis thaliana]
gi|20466816|gb|AAM20725.1| unknown protein [Arabidopsis
thaliana] gi|23198218|gb|AAN15636.1| unknown protein
[Arabidopsis thaliana]
Length = 245
Score = 42.4 bits (98), Expect = 0.007
Identities = 31/123 (25%), Positives = 51/123 (41%), Gaps = 11/123 (8%)
Frame = -3
Query: 757 LIASKWSRK------SSFYSWSNPQSMPLFPVTEDLDDDY-----EEEEEEEDAEKARKV 611
+IA+K R+ S+FYSW NP SMPL + E ++D+ + E+++ D + RK+
Sbjct: 149 VIANKLRRRGRSMSASNFYSWQNPNSMPLLALQEPNEEDHHIHNDDYEDDDGDGDDHRKI 208
Query: 610 PSASSSSSSSLAEEKKQEDPVQMRHNRIPESYAAHMRLRLGSFKSRSFSLADLQEHDDEE 431
+ +A+ + F L+ LQE DD +
Sbjct: 209 MMMMKNKKELMAQTRS------------------------------CFCLSSLQEEDDGD 238
Query: 430 EDD 422
DD
Sbjct: 239 GDD 241
>gb|EAA31256.1| predicted protein [Neurospora crassa]
Length = 336
Score = 40.0 bits (92), Expect = 0.036
Identities = 22/82 (26%), Positives = 38/82 (45%)
Frame = -3
Query: 667 DDDYEEEEEEEDAEKARKVPSASSSSSSSLAEEKKQEDPVQMRHNRIPESYAAHMRLRLG 488
DDD ++EEE E++E+ + P + S+++ KK + P + + P +
Sbjct: 116 DDDEDDEEEAEESEEPEERPRKRAKSAANKKPAKKAKSPKRKNKKKAPNKKKKASNKKKA 175
Query: 487 SFKSRSFSLADLQEHDDEEEDD 422
S K S A E + EEE +
Sbjct: 176 SNKKASKKKAKESEDESEEESE 197
Score = 34.3 bits (77), Expect = 2.0
Identities = 24/79 (30%), Positives = 35/79 (43%)
Frame = -3
Query: 664 DDYEEEEEEEDAEKARKVPSASSSSSSSLAEEKKQEDPVQMRHNRIPESYAAHMRLRLGS 485
DD EEE EE AE A + + AEE+ +E+ + + E AA R +
Sbjct: 53 DDSEEEPEEVPAE-------APAEEAEEEAEEEAEEEAEEEAEEEVEEEPAAKRRKTTKA 105
Query: 484 FKSRSFSLADLQEHDDEEE 428
+ +D + DDEEE
Sbjct: 106 AGGKRKRASDDDDEDDEEE 124
Score = 33.9 bits (76), Expect = 2.6
Identities = 15/52 (28%), Positives = 27/52 (51%)
Frame = -3
Query: 676 EDLDDDYEEEEEEEDAEKARKVPSASSSSSSSLAEEKKQEDPVQMRHNRIPE 521
E+ +++ EEE EEE A K RK A+ +++ ++D + + PE
Sbjct: 81 EEAEEEAEEEVEEEPAAKRRKTTKAAGGKRKRASDDDDEDDEEEAEESEEPE 132
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 589,591,295
Number of Sequences: 1393205
Number of extensions: 12243809
Number of successful extensions: 128419
Number of sequences better than 10.0: 536
Number of HSP's better than 10.0 without gapping: 55632
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 94081
length of database: 448,689,247
effective HSP length: 120
effective length of database: 281,504,647
effective search space used: 37158613404
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)