Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC017731A_C01 KMC017731A_c01
(543 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_193738.1| putative protein; protein id: At4g20050.1 [Arab... 101 5e-21
ref|NP_567595.1| putative protein; protein id: At4g20040.1, supp... 70 1e-11
pir||T04888 hypothetical protein F18F4.140 - Arabidopsis thalian... 70 1e-11
ref|NP_615744.1| proteophosphoglycan [Methanosarcina acetivorans... 37 0.12
emb|CAA64965.1| cDNA6 [Brugia pahangi] 36 0.26
>ref|NP_193738.1| putative protein; protein id: At4g20050.1 [Arabidopsis thaliana]
gi|7485729|pir||T04889 hypothetical protein F18F4.150 -
Arabidopsis thaliana gi|2827659|emb|CAA16613.1| putative
protein [Arabidopsis thaliana]
gi|7268800|emb|CAB79005.1| putative protein [Arabidopsis
thaliana]
Length = 481
Score = 101 bits (252), Expect = 5e-21
Identities = 50/82 (60%), Positives = 65/82 (78%), Gaps = 1/82 (1%)
Frame = -1
Query: 540 GMKLKATVAKKSMQGNGTSWSVDFNDVLLFPNLIKNVQYSL-SSTGSTFPNHALRNVSEN 364
GM K+TVA+ S+ GNGTSW+VDFN VLLFP+LI +VQY+L +S FP HALRNVS+N
Sbjct: 400 GMVEKSTVARGSVDGNGTSWTVDFNPVLLFPDLINHVQYTLVASEAGVFPLHALRNVSDN 459
Query: 363 RVVIETNEAVSANVFITVDQSM 298
RVV+ETN V+ V++TV+Q +
Sbjct: 460 RVVVETNAPVTGTVYVTVNQGV 481
>ref|NP_567595.1| putative protein; protein id: At4g20040.1, supported by cDNA:
gi_16226215 [Arabidopsis thaliana]
gi|16226216|gb|AAL16105.1|AF428273_1 AT4g20040/F18F4_140
[Arabidopsis thaliana] gi|22137222|gb|AAM91456.1|
AT4g20040/F18F4_140 [Arabidopsis thaliana]
Length = 483
Score = 70.5 bits (171), Expect = 1e-11
Identities = 35/79 (44%), Positives = 47/79 (59%)
Frame = -1
Query: 540 GMKLKATVAKKSMQGNGTSWSVDFNDVLLFPNLIKNVQYSLSSTGSTFPNHALRNVSENR 361
GM LK+T K + NGT W DF+ VL+FPN I + Q+S + P +A+ NVS N
Sbjct: 405 GMMLKSTTGKAMVSANGTRWIADFSPVLVFPNRINHYQHSFFAQSGQIPANAVTNVSNNM 464
Query: 360 VVIETNEAVSANVFITVDQ 304
VV+ET+ AV+ V I Q
Sbjct: 465 VVVETDRAVTGTVSIIAYQ 483
>pir||T04888 hypothetical protein F18F4.140 - Arabidopsis thaliana
gi|2827658|emb|CAA16612.1| putative protein [Arabidopsis
thaliana] gi|7268799|emb|CAB79004.1| putative protein
[Arabidopsis thaliana]
Length = 453
Score = 70.5 bits (171), Expect = 1e-11
Identities = 35/79 (44%), Positives = 47/79 (59%)
Frame = -1
Query: 540 GMKLKATVAKKSMQGNGTSWSVDFNDVLLFPNLIKNVQYSLSSTGSTFPNHALRNVSENR 361
GM LK+T K + NGT W DF+ VL+FPN I + Q+S + P +A+ NVS N
Sbjct: 375 GMMLKSTTGKAMVSANGTRWIADFSPVLVFPNRINHYQHSFFAQSGQIPANAVTNVSNNM 434
Query: 360 VVIETNEAVSANVFITVDQ 304
VV+ET+ AV+ V I Q
Sbjct: 435 VVVETDRAVTGTVSIIAYQ 453
>ref|NP_615744.1| proteophosphoglycan [Methanosarcina acetivorans str. C2A]
gi|19914595|gb|AAM04224.1| proteophosphoglycan
[Methanosarcina acetivorans str. C2A]
Length = 146
Score = 37.4 bits (85), Expect = 0.12
Identities = 19/60 (31%), Positives = 31/60 (51%), Gaps = 4/60 (6%)
Frame = -2
Query: 182 SHIFILHFSVSLFFYHIFY*----IFYFFLSLSFLSLFLLTKVYTSLKDVKHFYSPYIII 15
S F+ +F VS FF+ F+ + YFF+S F+S F ++ + V HF+ Y +
Sbjct: 20 SSFFVSNFFVSYFFFSSFFVSNFFVSYFFVSYFFVSYFFVSHFFVGYFFVSHFFVGYFFV 79
Score = 35.4 bits (80), Expect = 0.45
Identities = 18/53 (33%), Positives = 28/53 (51%)
Frame = -2
Query: 182 SHIFILHFSVSLFFYHIFY*IFYFFLSLSFLSLFLLTKVYTSLKDVKHFYSPY 24
S F+ +F VS FF F+ + YFF+S F+ F ++ + V HF+ Y
Sbjct: 35 SSFFVSNFFVSYFFVSYFF-VSYFFVSHFFVGYFFVSHFFVGYFFVSHFFVGY 86
>emb|CAA64965.1| cDNA6 [Brugia pahangi]
Length = 175
Score = 36.2 bits (82), Expect = 0.26
Identities = 30/94 (31%), Positives = 48/94 (50%), Gaps = 7/94 (7%)
Frame = -2
Query: 353 LKQMKQYLQM-FLSLWIKAWQVEEV*SWRHKCLFN---KS*LISSDCRGNCAYLMKEL-- 192
LK +++YL++ +L W+ WR K + +S L D +C +++ +
Sbjct: 51 LKILQRYLRVEYLWTWLLGRTCAT--RWRRKLSLSDGFRSLLNEKDYNFSCLFIIDDYSR 108
Query: 191 -CILSHIFILHFSVSLFFYHIFY*IFYFFLSLSF 93
CI S F+L SLFF+ +F IF+FF SL F
Sbjct: 109 TCIKSSAFLL--VSSLFFFLLFLHIFFFFFSLFF 140
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 434,109,771
Number of Sequences: 1393205
Number of extensions: 9133609
Number of successful extensions: 46683
Number of sequences better than 10.0: 54
Number of HSP's better than 10.0 without gapping: 39229
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 45732
length of database: 448,689,247
effective HSP length: 115
effective length of database: 288,470,672
effective search space used: 18750593680
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)