KMC004514A_c02
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC004514A_C02 KMC004514A_c02
gtactaaaatgctccttttaaataaaggggaccaaaatgcacaaggggTATATATCAGGG
ATGAAAAGTGCAATTAAGCCAATAATGAATAATCTTTAAAATATGCATTGAATCATTCAA
GCAGCGACCACTTTCTTCTAAGCACCGAGCATAAACTTGCAATGAATACATCACTTGACA
AAGGTTTAACTTTCTTCAATTTGCGTTTCCTCGTCCTCAACCTCTTCTGTTGGCCACCAA
GAAGCAGGGCCCAGACCATCAAAATCTATTGATCCAGTGATCAACATAGGGGCAAATGGC
TTGGAGTCTTTTGGAAGTTCAACTGGAATCGGTGGGTAACTCACACTCTGAATTTGGAAC
TTGATCTCCTCATTAGTATCATCAATAGGAAAAGGTTGATCCCCATAATTCCAATACCAT
GTGCCTTTCTTGCTATTTATTGGTTCAGCCTCAAAGTGGTTTGGGCTGGGCAGGTTGTGA
GCAGGAATATAAATATCATCGAAGAATCCAAGGGACAAGCGCAAGCCGTCTTCATCAGAT
GAAAGAAGCTTTGCAGTAATAATCTCCCCTTCAAATGGACGAAACATAATCAAATTGAAT
ACCACCTTATAAGTCGGAGCACCATCA


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC004514A_C02 KMC004514A_c02
         (627 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_172164.1| RNA polymerase, putative; protein id: At1g06790...   166  2e-40
ref|NP_505625.1| DNA directed RNA polymerase III like [Caenorhab...    92  6e-18
ref|NP_084505.2| RIKEN cDNA 5031409G22 [Mus musculus] gi|2766296...    83  3e-15
ref|NP_612211.1| RNA polymerase III subunit RPC8 [Homo sapiens] ...    83  3e-15
dbj|BAB33335.1| KIAA1665 protein [Homo sapiens]                        83  3e-15

>ref|NP_172164.1| RNA polymerase, putative; protein id: At1g06790.1, supported by
           cDNA: gi_17529107 [Arabidopsis thaliana]
           gi|25312598|pir||G86202 hypothetical protein [imported]
           - Arabidopsis thaliana
           gi|7523702|gb|AAF63141.1|AC011001_11 hypothetical
           protein [Arabidopsis thaliana]
           gi|17529108|gb|AAL38764.1| putative RNA polymerase
           [Arabidopsis thaliana] gi|23296904|gb|AAN13199.1|
           putative RNA polymerase [Arabidopsis thaliana]
          Length = 204

 Score =  166 bits (421), Expect = 2e-40
 Identities = 75/142 (52%), Positives = 103/142 (71%), Gaps = 2/142 (1%)
 Frame = -2

Query: 626 DGAPTYKVVFNLIMFRPFEGEIITAKLLSSDEDGLRLSLGFFDDIYIPAHNLPSPNHFEA 447
           DGA TYKV   +++FRPF GE+I AK   SD +GLRL+LGFFDDIY+PA  +P PN  E 
Sbjct: 63  DGAATYKVGLRIVVFRPFVGEVIAAKFKESDANGLRLTLGFFDDIYVPAPLMPKPNRCEP 122

Query: 446 EPINSKKGTWYWNYGD--QPFPIDDTNEEIKFQIQSVSYPPIPVELPKDSKPFAPMLITG 273
           +P N K+  W W YG+  + + +DD   +IKF+++S+SYP +P E  +D+KPFAPM++TG
Sbjct: 123 DPYNRKQMIWVWEYGEPKEDYIVDDAC-QIKFRVESISYPSVPTERAEDAKPFAPMVVTG 181

Query: 272 SIDFDGLGPASWWPTEEVEDEE 207
           ++D DGLGP SWW + E  D+E
Sbjct: 182 NMDDDGLGPVSWWDSYEQVDQE 203

>ref|NP_505625.1| DNA directed RNA polymerase III like [Caenorhabditis elegans]
           gi|7511361|pir||T28049 hypothetical protein ZK856.10 -
           Caenorhabditis elegans gi|3881812|emb|CAA94858.1|
           Hypothetical protein ZK856.10 [Caenorhabditis elegans]
          Length = 239

 Score = 92.0 bits (227), Expect = 6e-18
 Identities = 50/145 (34%), Positives = 80/145 (54%), Gaps = 6/145 (4%)
 Frame = -2

Query: 608 KVVFNLIMFRPFEGEIITAKLLSSDEDGLRLSLGFFDDIYIPAHNLPSPNHFEAEPINSK 429
           +V F +I+FRPF  E+I AK++ S   GL L++ FF+DI++PA  LP P+ FE E     
Sbjct: 69  RVKFRMIVFRPFVDEVIEAKVIGSSRQGLCLTIQFFEDIFVPAEKLPEPHVFEEE----- 123

Query: 428 KGTWYWNY----GDQPFPI-DDTNEEIKFQIQSVSYPPIPVELP-KDSKPFAPMLITGSI 267
              WYW Y    G+ P  +  D  + ++F++  + +  +  EL  ++ K    M I G++
Sbjct: 124 GQVWYWEYAQEDGEPPAKLYMDPGKIVRFRVTEIIFKDLKPELTHEERKTEKSMEIKGTM 183

Query: 266 DFDGLGPASWWPTEEVEDEETQIEE 192
              GLG   WW  E+ +DE  + E+
Sbjct: 184 ASTGLGCIGWWAAEDEDDEAVEDEQ 208

>ref|NP_084505.2| RIKEN cDNA 5031409G22 [Mus musculus] gi|27662966|ref|XP_216998.1|
           similar to RIKEN cDNA 5031409G22 [Mus musculus] [Rattus
           norvegicus] gi|14789799|gb|AAH10793.1| RIKEN cDNA
           5031409G22 gene [Mus musculus]
           gi|26387053|dbj|BAB31893.2| unnamed protein product [Mus
           musculus]
          Length = 204

 Score = 83.2 bits (204), Expect = 3e-15
 Identities = 55/145 (37%), Positives = 70/145 (47%), Gaps = 14/145 (9%)
 Frame = -2

Query: 626 DGAPTYKVVFNLIMFRPFEGEIITAKLLSSDEDGLRLSLGFFDDIYIPAHNLPSPNHF-E 450
           DGA   KV F  ++F PF  EI+  K+     +G+ +SLGFFDDI IP  +L  P  F E
Sbjct: 63  DGASHTKVHFRYVVFHPFLDEILIGKIKGCSPEGVHVSLGFFDDILIPPESLQQPAKFDE 122

Query: 449 AEPINSKKGTWYWNYGDQPFPID---DTNEEIKFQIQSVSY------PPIPVELPKDS-- 303
           AE +      W W Y  +    D   DT EEI+F++   S+       P   E    S  
Sbjct: 123 AEQV------WVWEYETEEGAHDLYMDTGEEIRFRVVDESFVDTSPTGPSSAEAASSSEE 176

Query: 302 --KPFAPMLITGSIDFDGLGPASWW 234
             K  AP  + GSI   GLG  SWW
Sbjct: 177 LPKKEAPYTLVGSISEPGLGLLSWW 201

>ref|NP_612211.1| RNA polymerase III subunit RPC8 [Homo sapiens]
           gi|24429623|gb|AAM18217.1| RNA polymerase III subunit
           RPC8 [Homo sapiens]
          Length = 204

 Score = 83.2 bits (204), Expect = 3e-15
 Identities = 57/148 (38%), Positives = 72/148 (48%), Gaps = 17/148 (11%)
 Frame = -2

Query: 626 DGAPTYKVVFNLIMFRPFEGEIITAKLLSSDEDGLRLSLGFFDDIYIPAHNLPSPNHF-E 450
           DGA   KV F  ++F PF  EI+  K+     +G+ +SLGFFDDI IP  +L  P  F E
Sbjct: 63  DGASHTKVHFRCVVFHPFLDEILIGKIKGCSPEGVHVSLGFFDDILIPPESLQQPAKFDE 122

Query: 449 AEPINSKKGTWYWNYGDQPFPID---DTNEEIKFQIQSVSY----PPIP---------VE 318
           AE +      W W Y  +    D   DT EEI+F++   S+    P  P          E
Sbjct: 123 AEQV------WVWEYETEEGAHDLYMDTGEEIRFRVVDESFVDTSPTGPSSADATTSSEE 176

Query: 317 LPKDSKPFAPMLITGSIDFDGLGPASWW 234
           LPK     AP  + GSI   GLG  SWW
Sbjct: 177 LPKKE---APYTLVGSISEPGLGLLSWW 201

>dbj|BAB33335.1| KIAA1665 protein [Homo sapiens]
          Length = 217

 Score = 83.2 bits (204), Expect = 3e-15
 Identities = 57/148 (38%), Positives = 72/148 (48%), Gaps = 17/148 (11%)
 Frame = -2

Query: 626 DGAPTYKVVFNLIMFRPFEGEIITAKLLSSDEDGLRLSLGFFDDIYIPAHNLPSPNHF-E 450
           DGA   KV F  ++F PF  EI+  K+     +G+ +SLGFFDDI IP  +L  P  F E
Sbjct: 76  DGASHTKVHFRCVVFHPFLDEILIGKIKGCSPEGVHVSLGFFDDILIPPESLQQPAKFDE 135

Query: 449 AEPINSKKGTWYWNYGDQPFPID---DTNEEIKFQIQSVSY----PPIP---------VE 318
           AE +      W W Y  +    D   DT EEI+F++   S+    P  P          E
Sbjct: 136 AEQV------WVWEYETEEGAHDLYMDTGEEIRFRVVDESFVDTSPTGPSSADATTSSEE 189

Query: 317 LPKDSKPFAPMLITGSIDFDGLGPASWW 234
           LPK     AP  + GSI   GLG  SWW
Sbjct: 190 LPKKE---APYTLVGSISEPGLGLLSWW 214

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 552,375,963
Number of Sequences: 1393205
Number of extensions: 12123482
Number of successful extensions: 34377
Number of sequences better than 10.0: 46
Number of HSP's better than 10.0 without gapping: 33227
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 34341
length of database: 448,689,247
effective HSP length: 118
effective length of database: 284,291,057
effective search space used: 25586195130
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 SPD076c08_f BP050066 1 426
2 SPD077c05_f BP050147 32 627
3 MR051c08_f BP079935 62 422
4 MR069e05_f BP081308 74 428
5 MWM209g10_f AV767964 76 623




Lotus japonicus
Kazusa DNA Research Institute