KMC002718A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC002718A_C01 KMC002718A_c01
AGTTAAACGGGGTGCATGTCATTCACTACTCCAAAATAAACATTCAAGTAAACAAACATC
CAGGAACCGAACTGATATATACACCACAGAGTTCACAAGTTTTAGGTGAAAGAGCTAGAA
ATGCTACATGTTAAACAGTTGTTGTCTCATCTAGCAAAGTTCAGTTTCAAATTTGAGATT
AAGGCCCATGCTAAAAAGCTCTGGGCTCAATAACTTAGCTCCATAAGGGACATTGGCTAC
CACAAGGTCATCAGCAGATTCACAGAAGCGGCAATAGGGACCTCGTATTTTACGGCCACC
AGGCCACCGGCCTTAAGATCACATTTGCAACATTTTTGGCATTTACGGCAAATGTGTATC
TGGGACGAGTCACTCAGAGTGAAGAGGCGTTCATACAGGTTTGCTGATGCGGCCATGTGC
TATGAGGCAATCACGCTCCATTTCACCAAACTTGATACCTCCAAATCGCTTCCGGTCAGC
AACAGGCTGTCGGGTTAGCGGGT


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC002718A_C01 KMC002718A_c01
         (503 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

dbj|BAB01854.1| DNA-directed RNA polymerase, subunit B [Arabidop...   117  7e-26
ref|NP_189020.1| DNA-directed RNA polymerase subunit, putative; ...   117  7e-26
dbj|BAB02021.1| DNA-dependent RNA polymerase II [Arabidopsis tha...   116  1e-25
ref|NP_188437.1| DNA-directed RNA polymerase II second largest c...   116  1e-25
sp|Q42877|RPB2_LYCES DNA-directed RNA polymerase II 135 kDa poly...    58  7e-08

>dbj|BAB01854.1| DNA-directed RNA polymerase, subunit B [Arabidopsis thaliana]
          Length = 1172

 Score =  117 bits (293), Expect = 7e-26
 Identities = 67/120 (55%), Positives = 74/120 (60%), Gaps = 4/120 (3%)
 Frame = -3

Query: 501  PLTRQPVADRKRFGGIKFGEMERDCLIAHGRISKPV*TPLHSE*LV----PDTHLP*MPK 334
            PLTRQPVADRKRFGGIKFGEMERDCLIAHG  +      LH            H+    K
Sbjct: 1058 PLTRQPVADRKRFGGIKFGEMERDCLIAHGASAN-----LHERLFTLSDSSQMHICRKCK 1112

Query: 333  MLQM*S*GRWPGGRKIRGPYCRFCESADDLVVANVPYGAKLLSPELFSMGLNLKFETELC 154
                        GRKIRGPYCR C S+D +V   VPYGAKLL  ELFSMG+ L F+T+LC
Sbjct: 1113 TYANVIERTPSSGRKIRGPYCRVCVSSDHVVRVYVPYGAKLLCQELFSMGITLNFDTKLC 1172

 Score = 50.1 bits (118), Expect = 1e-05
 Identities = 22/25 (88%), Positives = 25/25 (100%)
 Frame = -1

Query: 410  ASANLYERLFTLSDSSQIHICRKCQ 336
            ASANL+ERLFTLSDSSQ+HICRKC+
Sbjct: 1088 ASANLHERLFTLSDSSQMHICRKCK 1112

>ref|NP_189020.1| DNA-directed RNA polymerase subunit, putative; protein id:
            At3g23780.1 [Arabidopsis thaliana]
          Length = 946

 Score =  117 bits (293), Expect = 7e-26
 Identities = 67/120 (55%), Positives = 74/120 (60%), Gaps = 4/120 (3%)
 Frame = -3

Query: 501  PLTRQPVADRKRFGGIKFGEMERDCLIAHGRISKPV*TPLHSE*LV----PDTHLP*MPK 334
            PLTRQPVADRKRFGGIKFGEMERDCLIAHG  +      LH            H+    K
Sbjct: 832  PLTRQPVADRKRFGGIKFGEMERDCLIAHGASAN-----LHERLFTLSDSSQMHICRKCK 886

Query: 333  MLQM*S*GRWPGGRKIRGPYCRFCESADDLVVANVPYGAKLLSPELFSMGLNLKFETELC 154
                        GRKIRGPYCR C S+D +V   VPYGAKLL  ELFSMG+ L F+T+LC
Sbjct: 887  TYANVIERTPSSGRKIRGPYCRVCVSSDHVVRVYVPYGAKLLCQELFSMGITLNFDTKLC 946

 Score = 50.1 bits (118), Expect = 1e-05
 Identities = 22/25 (88%), Positives = 25/25 (100%)
 Frame = -1

Query: 410 ASANLYERLFTLSDSSQIHICRKCQ 336
           ASANL+ERLFTLSDSSQ+HICRKC+
Sbjct: 862 ASANLHERLFTLSDSSQMHICRKCK 886

>dbj|BAB02021.1| DNA-dependent RNA polymerase II [Arabidopsis thaliana]
          Length = 1119

 Score =  116 bits (291), Expect = 1e-25
 Identities = 66/120 (55%), Positives = 74/120 (61%), Gaps = 4/120 (3%)
 Frame = -3

Query: 501  PLTRQPVADRKRFGGIKFGEMERDCLIAHGRISKPV*TPLHSE*LV----PDTHLP*MPK 334
            PLTRQPVADRKRFGGI+FGEMERDCLIAHG  +      LH            H+    K
Sbjct: 1005 PLTRQPVADRKRFGGIRFGEMERDCLIAHGASAN-----LHERLFTLSDSSQMHICRKCK 1059

Query: 333  MLQM*S*GRWPGGRKIRGPYCRFCESADDLVVANVPYGAKLLSPELFSMGLNLKFETELC 154
                        GRKIRGPYCR C S+D +V   VPYGAKLL  ELFSMG+ L F+T+LC
Sbjct: 1060 TYANVIERTPSSGRKIRGPYCRVCASSDHVVRVYVPYGAKLLCQELFSMGITLNFDTKLC 1119

 Score = 50.1 bits (118), Expect = 1e-05
 Identities = 22/25 (88%), Positives = 25/25 (100%)
 Frame = -1

Query: 410  ASANLYERLFTLSDSSQIHICRKCQ 336
            ASANL+ERLFTLSDSSQ+HICRKC+
Sbjct: 1035 ASANLHERLFTLSDSSQMHICRKCK 1059

>ref|NP_188437.1| DNA-directed RNA polymerase II second largest chain, putative;
            protein id: At3g18090.1 [Arabidopsis thaliana]
          Length = 1038

 Score =  116 bits (291), Expect = 1e-25
 Identities = 66/120 (55%), Positives = 74/120 (61%), Gaps = 4/120 (3%)
 Frame = -3

Query: 501  PLTRQPVADRKRFGGIKFGEMERDCLIAHGRISKPV*TPLHSE*LV----PDTHLP*MPK 334
            PLTRQPVADRKRFGGI+FGEMERDCLIAHG  +      LH            H+    K
Sbjct: 924  PLTRQPVADRKRFGGIRFGEMERDCLIAHGASAN-----LHERLFTLSDSSQMHICRKCK 978

Query: 333  MLQM*S*GRWPGGRKIRGPYCRFCESADDLVVANVPYGAKLLSPELFSMGLNLKFETELC 154
                        GRKIRGPYCR C S+D +V   VPYGAKLL  ELFSMG+ L F+T+LC
Sbjct: 979  TYANVIERTPSSGRKIRGPYCRVCASSDHVVRVYVPYGAKLLCQELFSMGITLNFDTKLC 1038

 Score = 50.1 bits (118), Expect = 1e-05
 Identities = 22/25 (88%), Positives = 25/25 (100%)
 Frame = -1

Query: 410  ASANLYERLFTLSDSSQIHICRKCQ 336
            ASANL+ERLFTLSDSSQ+HICRKC+
Sbjct: 954  ASANLHERLFTLSDSSQMHICRKCK 978

>sp|Q42877|RPB2_LYCES DNA-directed RNA polymerase II 135 kDa polypeptide (RNA polymerase II
            subunit 2) gi|2129929|pir||S65068 DNA-directed RNA
            polymerase (EC 2.7.7.6) II second largest chain - tomato
            gi|1049068|gb|AAC49273.1| RNA polymerase II subunit 2
          Length = 1191

 Score = 57.8 bits (138), Expect = 7e-08
 Identities = 37/106 (34%), Positives = 54/106 (50%)
 Frame = -3

Query: 498  LTRQPVADRKRFGGIKFGEMERDCLIAHGRISKPV*TPLHSE*LVPDTHLP*MPKMLQM* 319
            LTRQP   R R GG++FGEMERDC+IAHG  +  +   L  +      H+     ++ + 
Sbjct: 1074 LTRQPAEGRSRDGGLRFGEMERDCMIAHG-AAHFLKERLFDQSDAYRVHVCERCGLIAI- 1131

Query: 318  S*GRWPGGRKIRGPYCRFCESADDLVVANVPYGAKLLSPELFSMGL 181
                     K     CR C++  D+V  ++PY  KLL  EL +M +
Sbjct: 1132 ------ANLKKNSFECRGCKNKTDIVQVHIPYACKLLFQELMAMAI 1171

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 410,912,021
Number of Sequences: 1393205
Number of extensions: 7837415
Number of successful extensions: 21951
Number of sequences better than 10.0: 337
Number of HSP's better than 10.0 without gapping: 20846
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 21945
length of database: 448,689,247
effective HSP length: 114
effective length of database: 289,863,877
effective search space used: 15362785481
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MPDL047d01_f AV778874 1 165
2 SPDL035b04_f BP054176 33 465
3 SPDL091d08_f BP057707 33 155
4 MRL018d03_f BP084632 41 539
5 GNLf004d11 BP075020 47 351




Lotus japonicus
Kazusa DNA Research Institute