KMC000443A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC000443A_C01 KMC000443A_c01
gttttgGAACGACACAACCTAAACAGATATTACATTTAACAGCTTACATATTCAATACAT
CTGGTATGGGGTATAAGTCTTTACAAGCTCCTCGTTGCTACTTCCACTAGATCTTTCTTT
GCCTGACGTAATCGGAGGCAAATAAAGTATTAGTGGAAGTAGCAAAAAGAAGCTAAGGGT
AAGCACCCCCTCCATCTTTATGATAAAAAGAGAAAAGTTAATAAGAATGAAAATGTACTA
CTATCCCCTTTCATCTTCTGTACTACCTCTAACATTGAATAATACACAACACAAGATGGC
TCCAGATGGTTCACCATTTAGCTAGAGTAATGCCAATGACAACTCTCAACTCACACAAAA
AACAACTGTGCCTCTCTACTAATGATTTGTCTTCTGACTCTTTTTTAGTTCAACAATGTC
ATCTTGTCCCAGCCACTTTTTCTTCTGGATCCTGAACTTGCCCGGATCAACAATCTCAAG
AGCACGAAGAATGCATTTGCGGTATGAAAAATCTTCAACACTTTCATCACTTCGTATTAT
ATGGAAGCAGCGGCTATCCTTATGTA


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC000443A_C01 KMC000443A_c01
         (566 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_176490.1| RNA polymerase IIA large subunit, putative; pro...    52  7e-06
pir||D96655 hypothetical protein F16M19.19 [imported] - Arabidop...    52  7e-06
gb|AAM21312.1|AF371327_1 EMB514 [Arabidopsis thaliana] gi|215934...    38  0.10
ref|NP_201050.1| putative protein; protein id: At5g62440.1 [Arab...    38  0.10
ref|NP_181533.1| unknown protein; protein id: At2g40040.1 [Arabi...    33  2.5

>ref|NP_176490.1| RNA polymerase IIA large subunit, putative; protein id: At1g63020.1
            [Arabidopsis thaliana]
          Length = 1432

 Score = 51.6 bits (122), Expect = 7e-06
 Identities = 22/45 (48%), Positives = 29/45 (63%)
 Frame = -3

Query: 564  HKDSRCFHIIRSDESVEDFSYRKCILRALEIVDPGKFRIQKKKWL 430
            H DS CF ++R D + EDFSY KC+L A +I+ P K    K K+L
Sbjct: 1374 HGDSCCFEVVRIDGTFEDFSYHKCVLGATKIIAPKKMNFYKSKYL 1418

>pir||D96655 hypothetical protein F16M19.19 [imported] - Arabidopsis thaliana
            gi|8493592|gb|AAF75815.1|AC011000_18 Contains similarity
            to DNA-directed RNA Polymerase alpha-subunit from
            Methanococcus jannaschii gi|3915860 and contains
            Ribosomal protein L36 PF|00444 and RNA-Polymerase
            alpha-subunit PF|00623 domains.  EST gb|AA597311 comes
            from this gene. [Arabidopsis thaliana]
            gi|12323252|gb|AAG51604.1|AC010795_8 RNA polymerase IIA
            largest subunit, putative; 12353-6556 [Arabidopsis
            thaliana]
          Length = 1453

 Score = 51.6 bits (122), Expect = 7e-06
 Identities = 22/45 (48%), Positives = 29/45 (63%)
 Frame = -3

Query: 564  HKDSRCFHIIRSDESVEDFSYRKCILRALEIVDPGKFRIQKKKWL 430
            H DS CF ++R D + EDFSY KC+L A +I+ P K    K K+L
Sbjct: 1395 HGDSCCFEVVRIDGTFEDFSYHKCVLGATKIIAPKKMNFYKSKYL 1439

>gb|AAM21312.1|AF371327_1 EMB514 [Arabidopsis thaliana] gi|21593438|gb|AAM65405.1| unknown
           [Arabidopsis thaliana]
          Length = 202

 Score = 37.7 bits (86), Expect = 0.10
 Identities = 13/28 (46%), Positives = 22/28 (78%)
 Frame = -3

Query: 555 SRCFHIIRSDESVEDFSYRKCILRALEI 472
           SRCF ++R D++ +DFS+RKC+ + L +
Sbjct: 137 SRCFFLVREDDTADDFSFRKCVDQILPL 164

>ref|NP_201050.1| putative protein; protein id: At5g62440.1 [Arabidopsis thaliana]
           gi|10178075|dbj|BAB11494.1| gene_id:K19B1.5~similar to
           unknown protein~sp|Q42463 [Arabidopsis thaliana]
          Length = 100

 Score = 37.7 bits (86), Expect = 0.10
 Identities = 13/28 (46%), Positives = 22/28 (78%)
 Frame = -3

Query: 555 SRCFHIIRSDESVEDFSYRKCILRALEI 472
           SRCF ++R D++ +DFS+RKC+ + L +
Sbjct: 35  SRCFFLVREDDTADDFSFRKCVDQILPL 62

>ref|NP_181533.1| unknown protein; protein id: At2g40040.1 [Arabidopsis thaliana]
           gi|25408660|pir||E84824 hypothetical protein At2g40040
           [imported] - Arabidopsis thaliana
           gi|2088657|gb|AAB95289.1| unknown protein [Arabidopsis
           thaliana]
          Length = 839

 Score = 33.1 bits (74), Expect = 2.5
 Identities = 12/23 (52%), Positives = 18/23 (78%)
 Frame = -3

Query: 558 DSRCFHIIRSDESVEDFSYRKCI 490
           DSRCF ++ +D + +DFSYRK +
Sbjct: 668 DSRCFFVVSTDGAKQDFSYRKSL 690

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 462,980,199
Number of Sequences: 1393205
Number of extensions: 9136697
Number of successful extensions: 20171
Number of sequences better than 10.0: 15
Number of HSP's better than 10.0 without gapping: 19534
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 20168
length of database: 448,689,247
effective HSP length: 116
effective length of database: 287,077,467
effective search space used: 20669577624
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 GNLf001a05 BP074826 1 334
2 GENLf060h02 BP065581 7 505
3 SPDL023h12_f BP053466 7 540
4 GENLf016h01 BP063235 8 494
5 SPDL043b10_f BP054693 18 578




Lotus japonicus
Kazusa DNA Research Institute