Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC003302A_C01 KMC003302A_c01
(591 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_201433.1| GATA zinc finger protein; protein id: At5g66320... 149 3e-35
ref|NP_190677.1| GATA zinc finger protein; protein id: At3g51080... 145 5e-34
gb|AAK98698.1|AC069158_10 Putative GATA-1 zinc finger protein [O... 137 1e-31
ref|NP_195347.1| GATA zinc finger protein; protein id: At4g36240... 132 3e-30
pir||T05288 GATA-binding transcription factor homolog 3 [importe... 131 6e-30
>ref|NP_201433.1| GATA zinc finger protein; protein id: At5g66320.1 [Arabidopsis
thaliana] gi|10177426|dbj|BAB10711.1| GATA-binding
transcription factor-like protein [Arabidopsis thaliana]
gi|22531223|gb|AAM97115.1| GATA-binding transcription
factor-like protein [Arabidopsis thaliana]
Length = 339
Score = 149 bits (375), Expect = 3e-35
Identities = 74/113 (65%), Positives = 88/113 (77%), Gaps = 5/113 (4%)
Frame = -1
Query: 552 QPPMKKQKKKPEAQDHTGGP----QFQRRCSHCQVQKTPQWRAGPLGPKTLCNACGVRFK 385
+PP K+ KK A+ G Q QR+CSHC VQKTPQWRAGP+G KTLCNACGVR+K
Sbjct: 222 RPPFPKKHKKRSAESVFSGELQQLQPQRKCSHCGVQKTPQWRAGPMGAKTLCNACGVRYK 281
Query: 384 SGRLFPEYRPACSPTFSGEIHSNSHRKVLEMRRRKE-TAEPVSGLDQARMVQS 229
SGRL PEYRPACSPTFS E+HSN HRKV+EMRR+KE T++ +GL+Q +VQS
Sbjct: 282 SGRLLPEYRPACSPTFSSELHSNHHRKVIEMRRKKEPTSDNETGLNQ--LVQS 332
>ref|NP_190677.1| GATA zinc finger protein; protein id: At3g51080.1, supported by
cDNA: gi_17381183 [Arabidopsis thaliana]
gi|11358891|pir||T45739 transcription factor-like
protein - Arabidopsis thaliana
gi|6562260|emb|CAB62630.1| transcription factor-like
protein [Arabidopsis thaliana]
gi|17381184|gb|AAL36404.1| putative transcription factor
[Arabidopsis thaliana] gi|21436205|gb|AAM51390.1|
putative transcription factor [Arabidopsis thaliana]
Length = 312
Score = 145 bits (365), Expect = 5e-34
Identities = 74/127 (58%), Positives = 89/127 (69%), Gaps = 11/127 (8%)
Frame = -1
Query: 573 LSGEGFRQPPMKKQKKKPEAQDHTGGPQFQ-----RRCSHCQVQKTPQWRAGPLGPKTLC 409
L+ F PM K +KK + + G Q Q R+C HC VQKTPQWRAGPLG KTLC
Sbjct: 186 LASGQFLDEPMTKTQKKKKVWKNAGQTQTQTQTQTRQCGHCGVQKTPQWRAGPLGAKTLC 245
Query: 408 NACGVRFKSGRLFPEYRPACSPTFSGEIHSNSHRKVLEMRRRKETAEPV--SGLDQ---- 247
NACGVR+KSGRL PEYRPACSPTFS E+HSN H KV+EMRR+KET++ +GL+Q
Sbjct: 246 NACGVRYKSGRLLPEYRPACSPTFSSELHSNHHSKVIEMRRKKETSDGAEETGLNQPVQT 305
Query: 246 ARMVQSF 226
++V SF
Sbjct: 306 VQVVSSF 312
>gb|AAK98698.1|AC069158_10 Putative GATA-1 zinc finger protein [Oryza sativa]
Length = 418
Score = 137 bits (344), Expect = 1e-31
Identities = 73/125 (58%), Positives = 84/125 (66%), Gaps = 24/125 (19%)
Frame = -1
Query: 549 PPMKKQKK--KP----------------EAQDHTGG-----PQFQRRCSHCQVQKTPQWR 439
PPMKK+KK KP +A GG P RRC+HCQ++KTPQWR
Sbjct: 288 PPMKKKKKAKKPAAPAAASDAEADADAADADYEEGGALALPPGTVRRCTHCQIEKTPQWR 347
Query: 438 AGPLGPKTLCNACGVRFKSGRLFPEYRPACSPTFSGEIHSNSHRKVLEMRRR-KETAEPV 262
AGPLGPKTLCNACGVR+KSGRLFPEYRPA SPTF IHSNSH+KV+EMR++ TA+P
Sbjct: 348 AGPLGPKTLCNACGVRYKSGRLFPEYRPAASPTFMPSIHSNSHKKVVEMRQKATRTADPS 407
Query: 261 SGLDQ 247
L Q
Sbjct: 408 CDLLQ 412
>ref|NP_195347.1| GATA zinc finger protein; protein id: At4g36240.1, supported by
cDNA: gi_18252998 [Arabidopsis thaliana]
gi|7486030|pir||T04593 hypothetical protein F23E13.130 -
Arabidopsis thaliana gi|2961383|emb|CAA18130.1| putative
protein [Arabidopsis thaliana]
gi|7270577|emb|CAB80295.1| putative protein [Arabidopsis
thaliana] gi|18252999|gb|AAL62426.1| putative protein
[Arabidopsis thaliana] gi|21389681|gb|AAM48039.1|
putative protein [Arabidopsis thaliana]
Length = 238
Score = 132 bits (333), Expect = 3e-30
Identities = 68/108 (62%), Positives = 75/108 (68%)
Frame = -1
Query: 588 SMAPPLSGEGFRQPPMKKQKKKPEAQDHTGGPQFQRRCSHCQVQKTPQWRAGPLGPKTLC 409
S +P LS R+ +QK Q +R CSHC VQKTPQWR GPLG KTLC
Sbjct: 129 SPSPLLSTAVARRKKRGRQKVDASYGGVVQQQQLRRCCSHCGVQKTPQWRMGPLGAKTLC 188
Query: 408 NACGVRFKSGRLFPEYRPACSPTFSGEIHSNSHRKVLEMRRRKETAEP 265
NACGVRFKSGRL PEYRPACSPTF+ EIHSNSHRKVLE+R K A+P
Sbjct: 189 NACGVRFKSGRLLPEYRPACSPTFTNEIHSNSHRKVLELRLMK-VADP 235
>pir||T05288 GATA-binding transcription factor homolog 3 [imported] -
Arabidopsis thaliana
Length = 269
Score = 131 bits (330), Expect = 6e-30
Identities = 59/74 (79%), Positives = 62/74 (83%)
Frame = -1
Query: 489 FQRRCSHCQVQKTPQWRAGPLGPKTLCNACGVRFKSGRLFPEYRPACSPTFSGEIHSNSH 310
FQRRCSHC TPQWR GP+GPKTLCNACGVRFKSGRL PEYRPA SPTFS EIHSN H
Sbjct: 178 FQRRCSHCGTNNTPQWRTGPVGPKTLCNACGVRFKSGRLCPEYRPADSPTFSNEIHSNLH 237
Query: 309 RKVLEMRRRKETAE 268
RKVLE+R+ KE E
Sbjct: 238 RKVLELRKSKELGE 251
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 540,571,912
Number of Sequences: 1393205
Number of extensions: 12833082
Number of successful extensions: 46438
Number of sequences better than 10.0: 275
Number of HSP's better than 10.0 without gapping: 43101
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 46167
length of database: 448,689,247
effective HSP length: 117
effective length of database: 285,684,262
effective search space used: 22569056698
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)