KCC003102A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KCC003102A_C01 KCC003102A_c01
aggcgatggaggacatgGTGGTGCGGTACGGGGGCACGGTGCGCAACAGCCAGGGCGATG
TGGTGCAGTTCCTGTATGGCGAGGACGGCATGGACGCCGTGCGCATCGAGGGCCAGATGT
TCGAGTACCTGAAGTGGGACCCCGCCAAGCTGGACAAGGCGTACCGCATCGACACCACCC
GCGACATGCCGCCCGACTGGCTGTCCGCGGAGGAGTACGAGGCGCTACGGACCGACCCCG
CCGTGGAGCAGGCGATGCGTGACGAGATGGCCCAGATCAAGGAGGACTTGCGCGTGCTGC
GCGAGGAGGTGCTGACCAACGGTGATGAGAAGGTCAACATCCCGCTCAACCTAGCGCGAC
TCATCTGGAATGCCCAGACCAAGTTCAACTGCAAGCCGCACAGGCCCGGGTGGACGGGGC
TGCAGGTCAAGGAGGTCATCACCAAGGTGCGGGAGCTGTGCGAGCGGCTGGTGGTGGTGA
TTGGCAGCGACGGGCTGTCGGTGGAGGCGCAGCGGAACGCCACCATCATGTTCCACTCGC
TGGTGCGCATGCA


Nr search


BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KCC003102A_C01 KCC003102A_c01
         (553 letters)

Database: nr 
           1,537,769 sequences; 498,525,298 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAM45153.1|AF395835_1 RNA polymerase II largest subunit [Cerc...   158  5e-38
gb|AAG48836.1|AC084218_6 similar to Arabidopsis thaliana DNA-dir...   147  1e-34
pir||S14183 DNA-directed RNA polymerase (EC 2.7.7.6) largest cha...   143  1e-33
gb|AAC49712.1| RNA polymerase II largest subunit [Spirogyra sp.]      141  7e-33
ref|NP_195305.1| DNA-directed RNA polymerase (EC 2.7.7.6) II lar...   137  7e-32

>gb|AAM45153.1|AF395835_1 RNA polymerase II largest subunit [Cercomonas ATCC50319]
          Length = 1014

 Score =  158 bits (399), Expect = 5e-38
 Identities = 79/182 (43%), Positives = 122/182 (66%)
 Frame = +3

Query: 3    AMEDMVVRYGGTVRNSQGDVVQFLYGEDGMDAVRIEGQMFEYLKWDPAKLDKAYRIDTTR 182
            AMED++++Y  TVRNS GD++QF YGEDGMD V IE Q  + LK D A+  K Y++D +R
Sbjct: 778  AMEDVMIKYDATVRNSLGDIIQFAYGEDGMDGVYIEKQKLDSLKMDNARFLKTYQLDVSR 837

Query: 183  DMPPDWLSAEEYEALRTDPAVEQAMRDEMAQIKEDLRVLREEVLTNGDEKVNIPLNLARL 362
                D++ AE  E L   P    A+  E+ Q+++D ++LR+++   G++ V++P+NL RL
Sbjct: 838  PEQLDFMDAEVREHLLRTPEAHDALETELKQLRDDRQLLRDKIQPTGEDMVHLPVNLKRL 897

Query: 363  IWNAQTKFNCKPHRPGWTGLQVKEVITKVRELCERLVVVIGSDGLSVEAQRNATIMFHSL 542
            IWNAQ +F+    R   + ++ ++VI  V+ L ERL+VV G+D LS++AQ NAT +F  +
Sbjct: 898  IWNAQKRFHV--DRFSKSNIEPQQVIADVKSLAERLIVVRGTDPLSLQAQTNATTLFKIM 955

Query: 543  VR 548
            +R
Sbjct: 956  LR 957

>gb|AAG48836.1|AC084218_6 similar to Arabidopsis thaliana DNA-directed RNA polymerase (EC
            2.7.7.6) II largest chain (JDMU1) [Oryza sativa]
          Length = 1741

 Score =  147 bits (370), Expect = 1e-34
 Identities = 79/183 (43%), Positives = 113/183 (61%), Gaps = 1/183 (0%)
 Frame = +3

Query: 3    AMEDMVVRYGGTVRNSQGDVVQFLYGEDGMDAVRIEGQMFEYLKWDPAKLDKAYRID-TT 179
            AMED++V+Y GTVRNS GDV+QFLYGEDGMDA+ IE Q  + LK   A+ D  +R +   
Sbjct: 810  AMEDIMVKYDGTVRNSLGDVIQFLYGEDGMDAIWIESQKLDSLKMKKAEFDNVFRYELDD 869

Query: 180  RDMPPDWLSAEEYEALRTDPAVEQAMRDEMAQIKEDLRVLREEVLTNGDEKVNIPLNLAR 359
             +  P++LS +  E L+T   +      E+ +++ D   L  E+ T GD    +P+NL R
Sbjct: 870  ENWKPNYLSTQHAEDLKTISEIRNVFEAEVQKLEADRFQLGTEIATTGDNTWPMPVNLKR 929

Query: 360  LIWNAQTKFNCKPHRPGWTGLQVKEVITKVRELCERLVVVIGSDGLSVEAQRNATIMFHS 539
            LIWNAQ  F     RP  + +   E++  + +L ERL VV G D +S+EAQ+NAT+ F+ 
Sbjct: 930  LIWNAQKTFKIDLRRP--SDMHPMEIVDAIDKLQERLKVVPGDDDISIEAQKNATLFFNI 987

Query: 540  LVR 548
            L+R
Sbjct: 988  LLR 990

>pir||S14183 DNA-directed RNA polymerase (EC 2.7.7.6) largest chain  (isoform C)
           - soybean (fragment) gi|18736|emb|CAA36736.1|
           DNA-directed RNA polymerase [Glycine max]
          Length = 977

 Score =  143 bits (361), Expect = 1e-33
 Identities = 77/183 (42%), Positives = 113/183 (61%), Gaps = 1/183 (0%)
 Frame = +3

Query: 3   AMEDMVVRYGGTVRNSQGDVVQFLYGEDGMDAVRIEGQMFEYLKWDPAKLDKAYRID-TT 179
           AMED++++Y GTVRNS GDV+QFLYGEDGMDA+ IE Q  + LK    + D+ +R +   
Sbjct: 54  AMEDIMLKYDGTVRNSLGDVIQFLYGEDGMDAIWIETQKLDTLKMKKTEFDRVFRYEFDE 113

Query: 180 RDMPPDWLSAEEYEALRTDPAVEQAMRDEMAQIKEDLRVLREEVLTNGDEKVNIPLNLAR 359
            +  P+++  E  E L+T          E+ +++ D   L  E+ +NGD  + +P+NL R
Sbjct: 114 ENWKPNYMLQEPVEDLKTIREFRNVFEAEVQKLEADRHQLAIEIASNGDNSLPLPVNLKR 173

Query: 360 LIWNAQTKFNCKPHRPGWTGLQVKEVITKVRELCERLVVVIGSDGLSVEAQRNATIMFHS 539
           LIWNAQ  F     RP  + +   E++  + +L ERL VV G D LS EAQ+NAT++F+ 
Sbjct: 174 LIWNAQKTFKVDFRRP--SDMHPMEIVEAIDKLQERLKVVPGEDALSQEAQKNATLLFNI 231

Query: 540 LVR 548
           L+R
Sbjct: 232 LLR 234

>gb|AAC49712.1| RNA polymerase II largest subunit [Spirogyra sp.]
          Length = 613

 Score =  141 bits (355), Expect = 7e-33
 Identities = 77/183 (42%), Positives = 112/183 (61%), Gaps = 1/183 (0%)
 Frame = +3

Query: 3   AMEDMVVRYGGTVRNSQGDVVQFLYGEDGMDAVRIEGQMFEYLKWDPAKLDKAYRIDTTR 182
           AMED++V+Y GTVRNS GDV+QFLYGEDGMDAV IE Q    +K + +  D  YR +  +
Sbjct: 372 AMEDVMVKYDGTVRNSLGDVIQFLYGEDGMDAVWIESQSLPSMKMNKSTFDATYRYEIDQ 431

Query: 183 -DMPPDWLSAEEYEALRTDPAVEQAMRDEMAQIKEDLRVLREEVLTNGDEKVNIPLNLAR 359
            D  PD++  +  +  +      Q M  E+ Q+++D R L  E+   GD    +P+N+ R
Sbjct: 432 EDWSPDYMDPQFAKDAKIVAEFRQVMDAEVLQLEQDRRTLGLEIAPTGDSSWPLPVNIKR 491

Query: 360 LIWNAQTKFNCKPHRPGWTGLQVKEVITKVRELCERLVVVIGSDGLSVEAQRNATIMFHS 539
           LIWNAQ  F     +P  + +   +V+  + +L ERL VV+G D +S EAQ+NAT+ F+ 
Sbjct: 492 LIWNAQKIFKIDLRKP--SDMNPMDVVDGMDKLQERLKVVVGDDHISREAQKNATLFFNC 549

Query: 540 LVR 548
           L+R
Sbjct: 550 LLR 552

>ref|NP_195305.1| DNA-directed RNA polymerase (EC 2.7.7.6) II largest chain
            [Arabidopsis thaliana] gi|12644149|sp|P18616|RPB1_ARATH
            DNA-directed RNA polymerase II largest subunit
            gi|25288404|pir||G85422 hypothetical protein AT4g35800
            [imported] - Arabidopsis thaliana
            gi|4883421|emb|CAA21466.2| DNA-directed RNA polymerase
            (EC 2.7.7.6) II largest chain [Arabidopsis thaliana]
            gi|7270532|emb|CAB81489.1| DNA-directed RNA polymerase
            (EC 2.7.7.6) II largest chain [Arabidopsis thaliana]
          Length = 1840

 Score =  137 bits (346), Expect = 7e-32
 Identities = 78/183 (42%), Positives = 112/183 (60%), Gaps = 1/183 (0%)
 Frame = +3

Query: 3    AMEDMVVRYGGTVRNSQGDVVQFLYGEDGMDAVRIEGQMFEYLKWDPAKLDKAYRID-TT 179
            AMED++V+Y GTVRNS GDV+QFLYGEDGMDAV IE Q  + LK   ++ D+ ++ +   
Sbjct: 864  AMEDIMVKYDGTVRNSLGDVIQFLYGEDGMDAVWIESQKLDSLKMKKSEFDRTFKYEIDD 923

Query: 180  RDMPPDWLSAEEYEALRTDPAVEQAMRDEMAQIKEDLRVLREEVLTNGDEKVNIPLNLAR 359
             +  P +LS E  E L+    +      E ++++ D   L  E+ TNGD    +P+N+ R
Sbjct: 924  ENWNPTYLSDEHLEDLKGIRELRDVFDAEYSKLETDRFQLGTEIATNGDSTWPLPVNIKR 983

Query: 360  LIWNAQTKFNCKPHRPGWTGLQVKEVITKVRELCERLVVVIGSDGLSVEAQRNATIMFHS 539
             IWNAQ  F     +   + +   E++  V +L ERL+VV G D LSVEAQ+NAT+ F+ 
Sbjct: 984  HIWNAQKTFKIDLRK--ISDMHPVEIVDAVDKLQERLLVVPGDDALSVEAQKNATLFFNI 1041

Query: 540  LVR 548
            L+R
Sbjct: 1042 LLR 1044



EST assemble image


clone accession position
1 LCL077a05_r AV630308 1 330
2 LCL013e04_r AV626675 18 553
3 LCL064f10_r AV629727 62 315




Chlamydomonas reinhardtii
Kazusa DNA Research Institute