Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC016293A_C01 KMC016293A_c01
(575 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_566576.1| expressed protein; protein id: At3g17380.1, sup... 92 6e-18
gb|AAM61614.1| unknown [Arabidopsis thaliana] 92 6e-18
dbj|BAB02742.1| gb|AAD28294.1~gene_id:MGD8.22~similar to unknown... 92 6e-18
ref|NP_189462.1| unknown protein; protein id: At3g28220.1, suppo... 71 1e-11
gb|AAM97014.1| expressed protein [Arabidopsis thaliana] 71 1e-11
>ref|NP_566576.1| expressed protein; protein id: At3g17380.1, supported by cDNA:
125943. [Arabidopsis thaliana]
Length = 309
Score = 91.7 bits (226), Expect = 6e-18
Identities = 48/130 (36%), Positives = 74/130 (56%), Gaps = 2/130 (1%)
Frame = -2
Query: 559 HSWKFNKFSQ-SMEKHESESFFGGDYKWKLVLYPNGIVEGKGNSMSLFLVL-DVSTLPPN 386
H WK FS+ E ++S +FF GD KWK+ YP G +G G +S++L L D T+
Sbjct: 176 HVWKIENFSKLDKESYDSNAFFAGDRKWKIEFYPTGTKQGTGTHLSIYLTLVDPETISDG 235
Query: 385 TKLVVDCILRAKDQLREQHAIQKFCRKFSESASVWGSRRLVALAKVRDLKNGLLLGDSCI 206
TK+ V+ +R DQL+ +H K + FS S+S G + V++ +GLLL D C+
Sbjct: 236 TKIFVEFTIRIFDQLQGRHIAGKVTKWFSRSSSEHGWVKYVSMVYFTQPNSGLLLKDVCL 295
Query: 205 LEAEFTILGL 176
+EA+ + G+
Sbjct: 296 VEADVCVHGI 305
Score = 63.9 bits (154), Expect = 1e-09
Identities = 40/115 (34%), Positives = 63/115 (54%), Gaps = 3/115 (2%)
Frame = -2
Query: 529 SMEKHESESFFGGDYKWKLVLYPNG-IVEGKGNSMSLFLVL-DVSTLPPNTKLVVDCILR 356
++E++E+ESF G YKWKLVLYPNG + + +S++L L D S+L P ++ L
Sbjct: 36 AIERYETESFEAGGYKWKLVLYPNGNKSKNTKDHVSVYLSLADSSSLSPGWEVYAVFRLY 95
Query: 355 AKDQLREQHAI-QKFCRKFSESASVWGSRRLVALAKVRDLKNGLLLGDSCILEAE 194
DQ ++ + I Q R+F WG + + D NG L+ D+C+ A+
Sbjct: 96 LLDQNKDNYLILQGNERRFHSVKREWGFDKFIPTGTFSDASNGYLMEDTCMFGAD 150
>gb|AAM61614.1| unknown [Arabidopsis thaliana]
Length = 309
Score = 91.7 bits (226), Expect = 6e-18
Identities = 48/130 (36%), Positives = 74/130 (56%), Gaps = 2/130 (1%)
Frame = -2
Query: 559 HSWKFNKFSQ-SMEKHESESFFGGDYKWKLVLYPNGIVEGKGNSMSLFLVL-DVSTLPPN 386
H WK FS+ E ++S +FF GD KWK+ YP G +G G +S++L L D T+
Sbjct: 176 HVWKIENFSKLDKESYDSNAFFAGDRKWKIEFYPTGTKQGTGTHLSIYLTLVDPETISDG 235
Query: 385 TKLVVDCILRAKDQLREQHAIQKFCRKFSESASVWGSRRLVALAKVRDLKNGLLLGDSCI 206
TK+ V+ +R DQL+ +H K + FS S+S G + V++ +GLLL D C+
Sbjct: 236 TKIFVEFTIRIFDQLQGRHIAGKVTKWFSRSSSEHGWVKYVSMVYFTQPNSGLLLKDVCL 295
Query: 205 LEAEFTILGL 176
+EA+ + G+
Sbjct: 296 VEADVCVHGI 305
Score = 64.3 bits (155), Expect = 1e-09
Identities = 40/115 (34%), Positives = 63/115 (54%), Gaps = 3/115 (2%)
Frame = -2
Query: 529 SMEKHESESFFGGDYKWKLVLYPNG-IVEGKGNSMSLFLVL-DVSTLPPNTKLVVDCILR 356
++E++E+ESF G YKWKLVLYPNG + + +S++L L D S+L P ++ L
Sbjct: 36 AIERYETESFEAGGYKWKLVLYPNGNKSKNTKDHVSVYLALADSSSLSPGWEVYAVFRLY 95
Query: 355 AKDQLREQHAI-QKFCRKFSESASVWGSRRLVALAKVRDLKNGLLLGDSCILEAE 194
DQ ++ + I Q R+F WG + + D NG L+ D+C+ A+
Sbjct: 96 LLDQNKDNYLILQGNERRFHSVKREWGFDKFIPTGTFSDSSNGYLMEDTCMFGAD 150
>dbj|BAB02742.1| gb|AAD28294.1~gene_id:MGD8.22~similar to unknown protein
[Arabidopsis thaliana]
Length = 304
Score = 91.7 bits (226), Expect = 6e-18
Identities = 48/130 (36%), Positives = 74/130 (56%), Gaps = 2/130 (1%)
Frame = -2
Query: 559 HSWKFNKFSQ-SMEKHESESFFGGDYKWKLVLYPNGIVEGKGNSMSLFLVL-DVSTLPPN 386
H WK FS+ E ++S +FF GD KWK+ YP G +G G +S++L L D T+
Sbjct: 171 HVWKIENFSKLDKESYDSNAFFAGDRKWKIEFYPTGTKQGTGTHLSIYLTLVDPETISDG 230
Query: 385 TKLVVDCILRAKDQLREQHAIQKFCRKFSESASVWGSRRLVALAKVRDLKNGLLLGDSCI 206
TK+ V+ +R DQL+ +H K + FS S+S G + V++ +GLLL D C+
Sbjct: 231 TKIFVEFTIRIFDQLQGRHIAGKVTKWFSRSSSEHGWVKYVSMVYFTQPNSGLLLKDVCL 290
Query: 205 LEAEFTILGL 176
+EA+ + G+
Sbjct: 291 VEADVCVHGI 300
Score = 63.9 bits (154), Expect = 1e-09
Identities = 40/115 (34%), Positives = 63/115 (54%), Gaps = 3/115 (2%)
Frame = -2
Query: 529 SMEKHESESFFGGDYKWKLVLYPNG-IVEGKGNSMSLFLVL-DVSTLPPNTKLVVDCILR 356
++E++E+ESF G YKWKLVLYPNG + + +S++L L D S+L P ++ L
Sbjct: 31 AIERYETESFEAGGYKWKLVLYPNGNKSKNTKDHVSVYLSLADSSSLSPGWEVYAVFRLY 90
Query: 355 AKDQLREQHAI-QKFCRKFSESASVWGSRRLVALAKVRDLKNGLLLGDSCILEAE 194
DQ ++ + I Q R+F WG + + D NG L+ D+C+ A+
Sbjct: 91 LLDQNKDNYLILQGNERRFHSVKREWGFDKFIPTGTFSDASNGYLMEDTCMFGAD 145
>ref|NP_189462.1| unknown protein; protein id: At3g28220.1, supported by cDNA:
gi_13937241 [Arabidopsis thaliana]
gi|11994584|dbj|BAB02639.1|
dbj|BAA87936.1~gene_id:T19D11.3~similar to unknown
protein [Arabidopsis thaliana]
gi|13937242|gb|AAK50113.1|AF372976_1 AT3g28220/T19D11_3
[Arabidopsis thaliana] gi|22137146|gb|AAM91418.1|
AT3g28220/T19D11_3 [Arabidopsis thaliana]
Length = 370
Score = 70.9 bits (172), Expect = 1e-11
Identities = 45/126 (35%), Positives = 72/126 (56%), Gaps = 3/126 (2%)
Frame = -2
Query: 559 HSWKFNKFSQSMEK--HESESFFGGDYKWKLVLYPNGIVEGKGNSMSLFLV-LDVSTLPP 389
+ W FS S+EK + S+ F G W L +YP+G EG+GNS+SL++V +DV P
Sbjct: 239 YKWTLPNFS-SLEKQYYVSDKFVIGGRSWALKVYPSGDGEGQGNSLSLYVVAVDVK---P 294
Query: 388 NTKLVVDCILRAKDQLREQHAIQKFCRKFSESASVWGSRRLVALAKVRDLKNGLLLGDSC 209
K+ + LR +Q +H ++K +S+ A+ WG ++ V A ++D GLL+ D+
Sbjct: 295 YDKIYLKAKLRIINQRDSKH-MEKKVESWSDQANSWGFQKFVPFADLKDTSKGLLVNDTL 353
Query: 208 ILEAEF 191
+E EF
Sbjct: 354 KMEIEF 359
Score = 41.2 bits (95), Expect = 0.009
Identities = 27/62 (43%), Positives = 37/62 (59%), Gaps = 4/62 (6%)
Frame = -2
Query: 547 FNKFSQS--MEKHESESFFGGDYKWKLVLYPNG-IVEGKG-NSMSLFLVLDVSTLPPNTK 380
F KF+ S EK+ES F G Y W L++YP G I EG N +S+++ +D STL + K
Sbjct: 90 FIKFATSPNAEKYESRPFESGGYNWTLIVYPKGNIKEGAPLNYVSMYVQIDNSTLLNSPK 149
Query: 379 LV 374
V
Sbjct: 150 EV 151
>gb|AAM97014.1| expressed protein [Arabidopsis thaliana]
Length = 290
Score = 70.9 bits (172), Expect = 1e-11
Identities = 39/121 (32%), Positives = 66/121 (54%), Gaps = 1/121 (0%)
Frame = -2
Query: 553 WKFNKFS-QSMEKHESESFFGGDYKWKLVLYPNGIVEGKGNSMSLFLVLDVSTLPPNTKL 377
W+ KFS + ++ + S+SF G W L +YPNG+ GNS+SL+L+ D S N K
Sbjct: 150 WRLTKFSTRFLDSYTSDSFSSGGRNWALKVYPNGVGNATGNSLSLYLLSDQS----NDKG 205
Query: 376 VVDCILRAKDQLREQHAIQKFCRKFSESASVWGSRRLVALAKVRDLKNGLLLGDSCILEA 197
V+ LR DQ++ + +K + + + WG R ++ A +++ G L+ D+ LE
Sbjct: 206 YVEAKLRVIDQIQSNNFEKKVAAWPNATENGWGFDRFLSFADIKNTSKGFLVNDTLKLEV 265
Query: 196 E 194
+
Sbjct: 266 Q 266
Score = 32.3 bits (72), Expect = 4.4
Identities = 18/48 (37%), Positives = 27/48 (55%), Gaps = 1/48 (2%)
Frame = -2
Query: 520 KHESESFFGGDYKWKLVLYP-NGIVEGKGNSMSLFLVLDVSTLPPNTK 380
K+ES F G Y W L++YP I G +S+++ +D S+L N K
Sbjct: 11 KYESRPFSVGGYNWTLLIYPVIYIPTDSGGYVSIYVRVDNSSLITNPK 58
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 496,961,157
Number of Sequences: 1393205
Number of extensions: 10842945
Number of successful extensions: 27454
Number of sequences better than 10.0: 142
Number of HSP's better than 10.0 without gapping: 26379
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 27401
length of database: 448,689,247
effective HSP length: 116
effective length of database: 287,077,467
effective search space used: 21530810025
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)