Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC002424A_C01 KMC002424A_c01
(423 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_566902.1| putative protein; protein id: At3g48360.1, supp... 159 6e-39
pir||T06706 hypothetical protein T29H11.120 - Arabidopsis thalia... 156 5e-38
ref|NP_201121.1| putative protein; protein id: At5g63160.1 [Arab... 156 6e-38
ref|NP_172060.1| hypothetical protein; protein id: At1g05690.1 [... 107 4e-23
ref|NP_568031.1| putative protein; protein id: At4g37610.1, supp... 105 1e-22
>ref|NP_566902.1| putative protein; protein id: At3g48360.1, supported by cDNA:
gi_14532781, supported by cDNA: gi_19310816 [Arabidopsis
thaliana] gi|14532782|gb|AAK64172.1| unknown protein
[Arabidopsis thaliana] gi|19310817|gb|AAL85139.1|
unknown protein [Arabidopsis thaliana]
gi|23397078|gb|AAN31824.1| unknown protein [Arabidopsis
thaliana]
Length = 364
Score = 159 bits (403), Expect = 6e-39
Identities = 82/126 (65%), Positives = 101/126 (80%), Gaps = 3/126 (2%)
Frame = +3
Query: 54 PEPDIFIHTSDGTRIPAHSNILASMSPVLESMIDRP-RKHRS--SERIIQIHGVPGDAVT 224
P D+ I TSD RIPAHS +LAS SPVL +++ +P R++R S+R+I+I GVP DAV+
Sbjct: 32 PTSDVEIVTSDNRRIPAHSGVLASASPVLMNIMKKPMRRYRGCGSKRVIKILGVPCDAVS 91
Query: 225 AFLTFLYSRRCTEDEMDRYGMHLLALSHVYMVPHLKQRCTKGLSQRVNTENVVDMLQLAR 404
F+ FLYS TEDEM+RYG+HLLALSHVYMV LKQRC+KG+ QR+ TENVVD+LQLAR
Sbjct: 92 VFIKFLYSSSLTEDEMERYGIHLLALSHVYMVTQLKQRCSKGVVQRLTTENVVDVLQLAR 151
Query: 405 LCDAPD 422
LCDAPD
Sbjct: 152 LCDAPD 157
>pir||T06706 hypothetical protein T29H11.120 - Arabidopsis thaliana
gi|4678352|emb|CAB41162.1| putative protein [Arabidopsis
thaliana]
Length = 367
Score = 156 bits (395), Expect = 5e-38
Identities = 83/129 (64%), Positives = 102/129 (78%), Gaps = 6/129 (4%)
Frame = +3
Query: 54 PEPDIFIHTSDGTRIPAHSNILASMSPVLESMIDRP-RKHRS--SERIIQIHGVPGDAVT 224
P D+ I TSD RIPAHS +LAS SPVL +++ +P R++R S+R+I+I GVP DAV+
Sbjct: 32 PTSDVEIVTSDNRRIPAHSGVLASASPVLMNIMKKPMRRYRGCGSKRVIKILGVPCDAVS 91
Query: 225 AFLTFLYSRRC---TEDEMDRYGMHLLALSHVYMVPHLKQRCTKGLSQRVNTENVVDMLQ 395
F+ FLYS R TEDEM+RYG+HLLALSHVYMV LKQRC+KG+ QR+ TENVVD+LQ
Sbjct: 92 VFIKFLYSSRLVCLTEDEMERYGIHLLALSHVYMVTQLKQRCSKGVVQRLTTENVVDVLQ 151
Query: 396 LARLCDAPD 422
LARLCDAPD
Sbjct: 152 LARLCDAPD 160
>ref|NP_201121.1| putative protein; protein id: At5g63160.1 [Arabidopsis thaliana]
gi|10177297|dbj|BAB10558.1| contains similarity to
unknown protein~gene_id:MDC12.13~pir||T06706
[Arabidopsis thaliana]
Length = 365
Score = 156 bits (394), Expect = 6e-38
Identities = 80/124 (64%), Positives = 99/124 (79%), Gaps = 2/124 (1%)
Frame = +3
Query: 57 EPDIFIHTSDGTRIPAHSNILASMSPVLESMIDRPRKHR--SSERIIQIHGVPGDAVTAF 230
E D+ I TS IPAHS ILAS+SPVL ++I++PRK SS+++I+I GVP DAV+ F
Sbjct: 24 ETDVEIITSGRRSIPAHSGILASVSPVLTNIIEKPRKIHGGSSKKVIKILGVPCDAVSVF 83
Query: 231 LTFLYSRRCTEDEMDRYGMHLLALSHVYMVPHLKQRCTKGLSQRVNTENVVDMLQLARLC 410
+ FLYS TE+EM++YG+HLLALSHVYMV LKQRCTKG+ +RV ENVVD+LQLARLC
Sbjct: 84 VRFLYSPSVTENEMEKYGIHLLALSHVYMVTQLKQRCTKGVGERVTAENVVDILQLARLC 143
Query: 411 DAPD 422
DAPD
Sbjct: 144 DAPD 147
>ref|NP_172060.1| hypothetical protein; protein id: At1g05690.1 [Arabidopsis
thaliana] gi|25367421|pir||B86191 hypothetical protein
[imported] - Arabidopsis thaliana
gi|4836923|gb|AAD30625.1|AC007153_17 Hypothetical
protein [Arabidopsis thaliana]
Length = 322
Score = 107 bits (266), Expect = 4e-23
Identities = 53/118 (44%), Positives = 81/118 (67%), Gaps = 1/118 (0%)
Frame = +3
Query: 63 DIFIHTSDGTRIPAHSNILASMSPVLESMIDRPRKHRSSERIIQIHGVPGDAVTAFLTFL 242
D ++ T + + PAHS++LA+ SPV+ +++++ R ++ ++IHGVP +AV F+ FL
Sbjct: 55 DTYVETDNKSHFPAHSSVLAAASPVIATLLNQSRD-KNGNTYLKIHGVPCEAVYMFIRFL 113
Query: 243 YSRRCTEDEMDRYGMHLLALSHVYMVPHLKQRCTKGLSQR-VNTENVVDMLQLARLCD 413
YS E+EM ++ +HLL LSH Y VP LK+ C + L Q +N ENV+D+LQLAR CD
Sbjct: 114 YSSCYEEEEMKKFVLHLLVLSHCYSVPSLKRLCVEILDQGWINKENVIDVLQLARNCD 171
>ref|NP_568031.1| putative protein; protein id: At4g37610.1, supported by cDNA:
122670. [Arabidopsis thaliana]
Length = 368
Score = 105 bits (262), Expect = 1e-22
Identities = 55/120 (45%), Positives = 75/120 (61%), Gaps = 1/120 (0%)
Frame = +3
Query: 63 DIFIHTSDGTRIPAHSNILASMSPVLESMIDRPRKHRSSERIIQIHGVPGDAVTAFLTFL 242
D+ IHT D I AHSN++ S V+ M+ + K +S + I I GVP A+ F+ FL
Sbjct: 56 DVLIHTDDNGLIYAHSNVIGMASDVIRGMM-KQHKRKSHRKSISILGVPHHALRVFIRFL 114
Query: 243 YSRRCTEDEMDRYGMHLLALSHVYMVPHLKQRCTKGL-SQRVNTENVVDMLQLARLCDAP 419
YS + +M+ + +HLL LSHVY+VPHLK+ C S +N ENV+D+ QLA LCDAP
Sbjct: 115 YSSCYEKQDMEDFAIHLLVLSHVYVVPHLKRVCESEFESSLLNKENVIDVFQLALLCDAP 174
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 429,951,800
Number of Sequences: 1393205
Number of extensions: 10206527
Number of successful extensions: 51902
Number of sequences better than 10.0: 442
Number of HSP's better than 10.0 without gapping: 42868
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 50976
length of database: 448,689,247
effective HSP length: 116
effective length of database: 287,077,467
effective search space used: 6889859208
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)