KMC004527A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC004527A_C01 KMC004527A_c01
AGTTTAAAAAAACTATATTAGATTAACCAAATCAACAAAGCTAGTGGATACAAACAATCA
AATCAGTGCCTTCCTAATAATATCCGTTACCTGCCTAGCTCAGCTAGCTGATTAGCACTA
ATTAAACCACTAAAATATACTATTATACAAAAACACGAACCGGGTTCAAACCGGTTGGTG
GGGGTGTACCTAATCTACTACTATTTCCCCTAATCTAGATGGAAAATTTTCAGCATGTGG
GAACCATTTGGGTCGGAGTCAAACCGGCATCCGGTCCAACCACCTCCTTCCTCCTCCTTA
TCTCCAACACTTTCCGGTGGCTGTTGGAGTGAATTTCACCGCAGAAGGTGGGGCTGCACG
CCGGTCTATACTCCGAACAAAGCCGACCGGACTTGTAGCGAACCCCGCACGCGTTGCAGA
GAGTCTTGGCGCCTAGCGGACCGGTCCTCCACTGCGGCGTCTTCTGCACCTGGCAATGAC
TGCACCGCCGCTGCGACTGCGCACCGGCGCCTTCCACCTCCTCCGCCTTCCTCTTCAGCT
TCTTCGCCGGCG


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC004527A_C01 KMC004527A_c01
         (552 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_201433.1| GATA zinc finger protein; protein id: At5g66320...   139  2e-32
ref|NP_190677.1| GATA zinc finger protein; protein id: At3g51080...   130  1e-29
ref|NP_195347.1| GATA zinc finger protein; protein id: At4g36240...   128  5e-29
pir||T05288 GATA-binding transcription factor homolog 3 [importe...   127  9e-29
ref|NP_195194.1| GATA transcription factor 3; protein id: At4g34...   127  9e-29

>ref|NP_201433.1| GATA zinc finger protein; protein id: At5g66320.1 [Arabidopsis
           thaliana] gi|10177426|dbj|BAB10711.1| GATA-binding
           transcription factor-like protein [Arabidopsis thaliana]
           gi|22531223|gb|AAM97115.1| GATA-binding transcription
           factor-like protein [Arabidopsis thaliana]
          Length = 339

 Score =  139 bits (350), Expect = 2e-32
 Identities = 66/92 (71%), Positives = 72/92 (77%), Gaps = 3/92 (3%)
 Frame = -3

Query: 550 PAKKLKRKAEEV---EGAGAQSQRRCSHCQVQKTPQWRTGPLGAKTLCNACGVRYKSGRL 380
           P K  KR AE V   E    Q QR+CSHC VQKTPQWR GP+GAKTLCNACGVRYKSGRL
Sbjct: 226 PKKHKKRSAESVFSGELQQLQPQRKCSHCGVQKTPQWRAGPMGAKTLCNACGVRYKSGRL 285

Query: 379 CSEYRPACSPTFCGEIHSNSHRKVLEIRRRKE 284
             EYRPACSPTF  E+HSN HRKV+E+RR+KE
Sbjct: 286 LPEYRPACSPTFSSELHSNHHRKVIEMRRKKE 317

>ref|NP_190677.1| GATA zinc finger protein; protein id: At3g51080.1, supported by
           cDNA: gi_17381183 [Arabidopsis thaliana]
           gi|11358891|pir||T45739 transcription factor-like
           protein - Arabidopsis thaliana
           gi|6562260|emb|CAB62630.1| transcription factor-like
           protein [Arabidopsis thaliana]
           gi|17381184|gb|AAL36404.1| putative transcription factor
           [Arabidopsis thaliana] gi|21436205|gb|AAM51390.1|
           putative transcription factor [Arabidopsis thaliana]
          Length = 312

 Score =  130 bits (326), Expect = 1e-29
 Identities = 58/87 (66%), Positives = 67/87 (76%)
 Frame = -3

Query: 544 KKLKRKAEEVEGAGAQSQRRCSHCQVQKTPQWRTGPLGAKTLCNACGVRYKSGRLCSEYR 365
           KK+ + A + +       R+C HC VQKTPQWR GPLGAKTLCNACGVRYKSGRL  EYR
Sbjct: 203 KKVWKNAGQTQTQTQTQTRQCGHCGVQKTPQWRAGPLGAKTLCNACGVRYKSGRLLPEYR 262

Query: 364 PACSPTFCGEIHSNSHRKVLEIRRRKE 284
           PACSPTF  E+HSN H KV+E+RR+KE
Sbjct: 263 PACSPTFSSELHSNHHSKVIEMRRKKE 289

>ref|NP_195347.1| GATA zinc finger protein; protein id: At4g36240.1, supported by
           cDNA: gi_18252998 [Arabidopsis thaliana]
           gi|7486030|pir||T04593 hypothetical protein F23E13.130 -
           Arabidopsis thaliana gi|2961383|emb|CAA18130.1| putative
           protein [Arabidopsis thaliana]
           gi|7270577|emb|CAB80295.1| putative protein [Arabidopsis
           thaliana] gi|18252999|gb|AAL62426.1| putative protein
           [Arabidopsis thaliana] gi|21389681|gb|AAM48039.1|
           putative protein [Arabidopsis thaliana]
          Length = 238

 Score =  128 bits (321), Expect = 5e-29
 Identities = 62/89 (69%), Positives = 68/89 (75%), Gaps = 3/89 (3%)
 Frame = -3

Query: 544 KKLKRKAEEVEGAGAQSQ---RRCSHCQVQKTPQWRTGPLGAKTLCNACGVRYKSGRLCS 374
           K+ ++K +   G   Q Q   R CSHC VQKTPQWR GPLGAKTLCNACGVR+KSGRL  
Sbjct: 143 KRGRQKVDASYGGVVQQQQLRRCCSHCGVQKTPQWRMGPLGAKTLCNACGVRFKSGRLLP 202

Query: 373 EYRPACSPTFCGEIHSNSHRKVLEIRRRK 287
           EYRPACSPTF  EIHSNSHRKVLE+R  K
Sbjct: 203 EYRPACSPTFTNEIHSNSHRKVLELRLMK 231

>pir||T05288 GATA-binding transcription factor homolog 3 [imported] -
           Arabidopsis thaliana
          Length = 269

 Score =  127 bits (319), Expect = 9e-29
 Identities = 59/85 (69%), Positives = 66/85 (77%)
 Frame = -3

Query: 493 QRRCSHCQVQKTPQWRTGPLGAKTLCNACGVRYKSGRLCSEYRPACSPTFCGEIHSNSHR 314
           QRRCSHC    TPQWRTGP+G KTLCNACGVR+KSGRLC EYRPA SPTF  EIHSN HR
Sbjct: 179 QRRCSHCGTNNTPQWRTGPVGPKTLCNACGVRFKSGRLCPEYRPADSPTFSNEIHSNLHR 238

Query: 313 KVLEIRRRKEVVGPDAGLTPTQMVP 239
           KVLE+R+ KE +G + G   T+  P
Sbjct: 239 KVLELRKSKE-LGEETGEASTKSDP 262

>ref|NP_195194.1| GATA transcription factor 3; protein id: At4g34680.1, supported by
           cDNA: gi_20466647 [Arabidopsis thaliana]
           gi|25352341|pir||H85408 GATA transcription factor 3
           [imported] - Arabidopsis thaliana
           gi|2959734|emb|CAA74001.1| homologous to GATA-binding
           transcription factors [Arabidopsis thaliana]
           gi|5678627|emb|CAA18847.2| GATA transcription factor 3
           [Arabidopsis thaliana] gi|7270419|emb|CAB80185.1| GATA
           transcription factor 3 [Arabidopsis thaliana]
          Length = 269

 Score =  127 bits (319), Expect = 9e-29
 Identities = 59/85 (69%), Positives = 66/85 (77%)
 Frame = -3

Query: 493 QRRCSHCQVQKTPQWRTGPLGAKTLCNACGVRYKSGRLCSEYRPACSPTFCGEIHSNSHR 314
           QRRCSHC    TPQWRTGP+G KTLCNACGVR+KSGRLC EYRPA SPTF  EIHSN HR
Sbjct: 179 QRRCSHCGTNNTPQWRTGPVGPKTLCNACGVRFKSGRLCPEYRPADSPTFSNEIHSNLHR 238

Query: 313 KVLEIRRRKEVVGPDAGLTPTQMVP 239
           KVLE+R+ KE +G + G   T+  P
Sbjct: 239 KVLELRKSKE-LGEETGEASTKSDP 262

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 534,918,826
Number of Sequences: 1393205
Number of extensions: 13319886
Number of successful extensions: 63752
Number of sequences better than 10.0: 350
Number of HSP's better than 10.0 without gapping: 51550
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 61912
length of database: 448,689,247
effective HSP length: 116
effective length of database: 287,077,467
effective search space used: 19234190289
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MR053f10_f BP080101 1 388
2 MFBL053c10_f BP043966 1 497
3 MWL046a02_f AV769346 6 486
4 SPD015h02_f BP045215 9 537
5 SPD011a07_f BP044845 25 552




Lotus japonicus
Kazusa DNA Research Institute