Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC003226A_C01 KMC003226A_c01
(516 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_565238.1| expressed protein; protein id: At1g80440.1, sup... 102 2e-21
gb|AAM64770.1| unknown [Arabidopsis thaliana] 101 4e-21
ref|NP_563979.1| expressed protein; protein id: At1g15670.1, sup... 98 5e-20
ref|XP_172388.2| similar to gastric mucin (clone PGM-2A) - pig (... 45 6e-04
emb|CAC95146.1| period 1 protein [Spalax judaei] 44 0.001
>ref|NP_565238.1| expressed protein; protein id: At1g80440.1, supported by cDNA:
32885., supported by cDNA: gi_17528985 [Arabidopsis
thaliana] gi|25406664|pir||B96836 unknown protein
T21F11.23 [imported] - Arabidopsis thaliana
gi|6730740|gb|AAF27130.1|AC018849_18 unknown protein;
76867-75803 [Arabidopsis thaliana]
gi|17528986|gb|AAL38703.1| unknown protein [Arabidopsis
thaliana] gi|21436099|gb|AAM51296.1| unknown protein
[Arabidopsis thaliana]
Length = 354
Score = 102 bits (255), Expect = 2e-21
Identities = 58/137 (42%), Positives = 79/137 (57%), Gaps = 7/137 (5%)
Frame = -3
Query: 514 QGGFERTAEAFDAAEGKWGQVEENFL---GCATCPRTVVDGGDGDESGYMCSGGGDLMAK 344
QG F +TAE+FD + +WG + E+FL G P T V GGD Y C GGD+M
Sbjct: 226 QGQFSKTAESFDVSTWEWGPLTEDFLDDTGDTVSPPTCVAGGDL----YAC-WGGDVMMF 280
Query: 343 RGDTWQKVARVPGEICNVAYVGALDGSVLLIGSN----GYGGVHMGYVFDVKSCSWRKLE 176
D WQKV ++P ++ NV YV G +++IG+ GYG +GY+ D+ S W KLE
Sbjct: 281 LNDKWQKVGQIPADVYNVTYVAVRPGMLIVIGNGKALAGYGEATVGYICDLSSSRWVKLE 340
Query: 175 SPEGFHGHVQSGCILEI 125
+ GHVQ+GC LE+
Sbjct: 341 T---HGGHVQAGCFLEV 354
>gb|AAM64770.1| unknown [Arabidopsis thaliana]
Length = 354
Score = 101 bits (252), Expect = 4e-21
Identities = 57/137 (41%), Positives = 78/137 (56%), Gaps = 7/137 (5%)
Frame = -3
Query: 514 QGGFERTAEAFDAAEGKWGQVEENFL---GCATCPRTVVDGGDGDESGYMCSGGGDLMAK 344
QG F +TAE+FD + +WG + E+FL G P T GGD Y C GGD+M
Sbjct: 226 QGQFSKTAESFDVSTWEWGPLTEDFLDDTGDTVSPPTCAAGGDL----YAC-WGGDVMMF 280
Query: 343 RGDTWQKVARVPGEICNVAYVGALDGSVLLIGSN----GYGGVHMGYVFDVKSCSWRKLE 176
D WQKV ++P ++ NV YV G +++IG+ GYG +GY+ D+ S W KLE
Sbjct: 281 LNDKWQKVGQIPADVYNVTYVAVRPGKLIVIGNGKALAGYGEATVGYICDLSSSRWVKLE 340
Query: 175 SPEGFHGHVQSGCILEI 125
+ GHVQ+GC LE+
Sbjct: 341 T---HGGHVQAGCFLEV 354
>ref|NP_563979.1| expressed protein; protein id: At1g15670.1, supported by cDNA:
gi_12083253, supported by cDNA: gi_15146331, supported
by cDNA: gi_16323267 [Arabidopsis thaliana]
gi|25518687|pir||G86290 hypothetical protein F7H2.1 -
Arabidopsis thaliana gi|8927646|gb|AAF82137.1|AC034256_1
Contains similarity to Keap1 from Mus musculus
gb|AB020063 and contains two Kelch PF|01344 motifs.
ESTs gb|BE038279, gb|N38284, gb|T23017, gb|T21823,
gb|T45708, gb|T46757, gb|Z33921, gb|Z25966, gb|AI995282,
gb|AI100737 come from this gene. [Arabidopsis thaliana]
gi|12083254|gb|AAG48786.1|AF332423_1 unknown protein
[Arabidopsis thaliana] gi|15146332|gb|AAK83649.1|
At1g15670/F7H2_1 [Arabidopsis thaliana]
gi|16323268|gb|AAL15368.1| At1g15670/F7H2_1 [Arabidopsis
thaliana]
Length = 359
Score = 98.2 bits (243), Expect = 5e-20
Identities = 54/133 (40%), Positives = 75/133 (55%), Gaps = 3/133 (2%)
Frame = -3
Query: 514 QGGFERTAEAFDAAEGKWGQVEENFLGCATC--PRTVVDGGDGDESGYMCSGGGDLMAKR 341
QG F +TAE+FD +W E FL P G +GD Y C DLM +
Sbjct: 230 QGQFSKTAESFDVTTWRWSPQGEEFLSSEMTMWPPICAAGENGDL--YACCRR-DLMMMK 286
Query: 340 GDTWQKVARVPGEICNVAYVGAL-DGSVLLIGSNGYGGVHMGYVFDVKSCSWRKLESPEG 164
DTW KV +P ++CNV+YV G++++IGS YG +GY +D+ + W KLE+ +
Sbjct: 287 DDTWYKVGNLPADVCNVSYVAIRRSGNLVVIGSARYGEPSVGYNWDMSNSRWLKLETHDK 346
Query: 163 FHGHVQSGCILEI 125
+ GHVQ+GC LEI
Sbjct: 347 YEGHVQAGCFLEI 359
>ref|XP_172388.2| similar to gastric mucin (clone PGM-2A) - pig (fragment) [Homo
sapiens]
Length = 375
Score = 44.7 bits (104), Expect = 6e-04
Identities = 40/122 (32%), Positives = 50/122 (40%)
Frame = +1
Query: 151 HVHGTLLDSPIFSNYNSSHQTHNPYELHHTHSILSTTLTHPTLQHTPRCISHPAPSPPSA 330
H HG DSP+ + SHQ TH S T Q PR S PA PP+
Sbjct: 86 HSHGP--DSPL--SQRQSHQL-TATATARTHLCPSVRATSSQPQPRPRLTSVPASEPPAH 140
Query: 331 TCPPSSPSNPRRHCTCTRSHLHRRRPQPSSDTWRNPRSSLPPGPTSPPPHQTPLRSSQNL 510
+ P SP + R+ T + R PS R S PG TS PP + P S +
Sbjct: 141 SHGPDSPLSQRQSHQLTATATARTHLCPSV---RATSSQPRPGLTSVPPSEPPAHSHSHG 197
Query: 511 PE 516
P+
Sbjct: 198 PD 199
Score = 38.9 bits (89), Expect = 0.035
Identities = 32/112 (28%), Positives = 45/112 (39%), Gaps = 3/112 (2%)
Frame = +1
Query: 151 HVHGTLLDSPIFSNYNSSHQTHNPYELHHTHSILSTTLTHPTLQHTPRCISHPAPSPPSA 330
H H DSP+ S Q+H + L ++ P+ Q P S PA PP+
Sbjct: 32 HSHSHSPDSPL-----SQRQSHQLTATATARTHLCPSVRAPSSQPRPGLTSVPASEPPAH 86
Query: 331 TCPPSSPSNPRRHCTCTRSHLHRRRPQPS---SDTWRNPRSSLPPGPTSPPP 477
+ P SP + R+ T + R PS + + PR L P S PP
Sbjct: 87 SHGPDSPLSQRQSHQLTATATARTHLCPSVRATSSQPQPRPRLTSVPASEPP 138
Score = 37.4 bits (85), Expect = 0.10
Identities = 32/112 (28%), Positives = 46/112 (40%), Gaps = 3/112 (2%)
Frame = +1
Query: 151 HVHGTLLDSPIFSNYNSSHQTHNPYELHHTHSILSTTLTHPTLQHTPRCISHPAPSPPSA 330
H HG DSP+ S HQ+H + L ++ + Q P S PA PP+
Sbjct: 194 HSHGP--DSPL-----SQHQSHQLTATATARTHLCPSVRATSSQPRPGLTSVPASEPPAH 246
Query: 331 TCPPSSPSNPRRHCTCTRSHLHRRRPQPS---SDTWRNPRSSLPPGPTSPPP 477
+ P SP + R+ T + R PS + + P+ L P S PP
Sbjct: 247 SHGPDSPLSQRQSHQLTATATARTHLCPSVRATSSQPQPQPGLTSVPASEPP 298
Score = 37.4 bits (85), Expect = 0.10
Identities = 35/112 (31%), Positives = 44/112 (39%), Gaps = 3/112 (2%)
Frame = +1
Query: 151 HVHGTLLDSPIFSNYNSSHQTHNPYELHHTHSILSTTLTHPTLQHTPRCISHPAPSPPSA 330
H HG DSP+ + SHQ TH S T Q P S PA PP+
Sbjct: 246 HSHGP--DSPL--SQRQSHQL-TATATARTHLCPSVRATSSQPQPQPGLTSVPASEPPAH 300
Query: 331 TCPPSSPSNPRRHCTCTRSHLHRRRPQPS---SDTWRNPRSSLPPGPTSPPP 477
+ P SP + R+ T + R PS + + PR L P S PP
Sbjct: 301 SHSPDSPLSHRQSPQLTATATARTHLCPSVRATSSQPQPRPGLTSVPASEPP 352
>emb|CAC95146.1| period 1 protein [Spalax judaei]
Length = 1285
Score = 43.5 bits (101), Expect = 0.001
Identities = 37/124 (29%), Positives = 45/124 (35%), Gaps = 5/124 (4%)
Frame = +1
Query: 133 RCNHFGHVHGTLLDSPIFSNYNSSHQTHNPYELHHTHSILSTTLTHPTLQHTPR-----C 297
R G + G S S H P HH S + H HTPR C
Sbjct: 793 RFKDLGRLRGLDSSSATPSAPGCHHGPVPPGRRHHCRSKAKRSRHH----HTPRAEAPCC 848
Query: 298 ISHPAPSPPSATCPPSSPSNPRRHCTCTRSHLHRRRPQPSSDTWRNPRSSLPPGPTSPPP 477
+SHP+P PPS PP + P + + P P P+ LPP PT PP
Sbjct: 849 VSHPSPVPPSGPWPPPPSTTP-------FPAVVQPYPLPVFSARGGPQ-PLPPAPTPMPP 900
Query: 478 HQTP 489
P
Sbjct: 901 ATFP 904
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 588,390,139
Number of Sequences: 1393205
Number of extensions: 16563633
Number of successful extensions: 114593
Number of sequences better than 10.0: 1760
Number of HSP's better than 10.0 without gapping: 74808
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 100954
length of database: 448,689,247
effective HSP length: 115
effective length of database: 288,470,672
effective search space used: 16154357632
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)