Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC004514A_C02 KMC004514A_c02
(627 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_172164.1| RNA polymerase, putative; protein id: At1g06790... 166 2e-40
ref|NP_505625.1| DNA directed RNA polymerase III like [Caenorhab... 92 6e-18
ref|NP_084505.2| RIKEN cDNA 5031409G22 [Mus musculus] gi|2766296... 83 3e-15
ref|NP_612211.1| RNA polymerase III subunit RPC8 [Homo sapiens] ... 83 3e-15
dbj|BAB33335.1| KIAA1665 protein [Homo sapiens] 83 3e-15
>ref|NP_172164.1| RNA polymerase, putative; protein id: At1g06790.1, supported by
cDNA: gi_17529107 [Arabidopsis thaliana]
gi|25312598|pir||G86202 hypothetical protein [imported]
- Arabidopsis thaliana
gi|7523702|gb|AAF63141.1|AC011001_11 hypothetical
protein [Arabidopsis thaliana]
gi|17529108|gb|AAL38764.1| putative RNA polymerase
[Arabidopsis thaliana] gi|23296904|gb|AAN13199.1|
putative RNA polymerase [Arabidopsis thaliana]
Length = 204
Score = 166 bits (421), Expect = 2e-40
Identities = 75/142 (52%), Positives = 103/142 (71%), Gaps = 2/142 (1%)
Frame = -2
Query: 626 DGAPTYKVVFNLIMFRPFEGEIITAKLLSSDEDGLRLSLGFFDDIYIPAHNLPSPNHFEA 447
DGA TYKV +++FRPF GE+I AK SD +GLRL+LGFFDDIY+PA +P PN E
Sbjct: 63 DGAATYKVGLRIVVFRPFVGEVIAAKFKESDANGLRLTLGFFDDIYVPAPLMPKPNRCEP 122
Query: 446 EPINSKKGTWYWNYGD--QPFPIDDTNEEIKFQIQSVSYPPIPVELPKDSKPFAPMLITG 273
+P N K+ W W YG+ + + +DD +IKF+++S+SYP +P E +D+KPFAPM++TG
Sbjct: 123 DPYNRKQMIWVWEYGEPKEDYIVDDAC-QIKFRVESISYPSVPTERAEDAKPFAPMVVTG 181
Query: 272 SIDFDGLGPASWWPTEEVEDEE 207
++D DGLGP SWW + E D+E
Sbjct: 182 NMDDDGLGPVSWWDSYEQVDQE 203
>ref|NP_505625.1| DNA directed RNA polymerase III like [Caenorhabditis elegans]
gi|7511361|pir||T28049 hypothetical protein ZK856.10 -
Caenorhabditis elegans gi|3881812|emb|CAA94858.1|
Hypothetical protein ZK856.10 [Caenorhabditis elegans]
Length = 239
Score = 92.0 bits (227), Expect = 6e-18
Identities = 50/145 (34%), Positives = 80/145 (54%), Gaps = 6/145 (4%)
Frame = -2
Query: 608 KVVFNLIMFRPFEGEIITAKLLSSDEDGLRLSLGFFDDIYIPAHNLPSPNHFEAEPINSK 429
+V F +I+FRPF E+I AK++ S GL L++ FF+DI++PA LP P+ FE E
Sbjct: 69 RVKFRMIVFRPFVDEVIEAKVIGSSRQGLCLTIQFFEDIFVPAEKLPEPHVFEEE----- 123
Query: 428 KGTWYWNY----GDQPFPI-DDTNEEIKFQIQSVSYPPIPVELP-KDSKPFAPMLITGSI 267
WYW Y G+ P + D + ++F++ + + + EL ++ K M I G++
Sbjct: 124 GQVWYWEYAQEDGEPPAKLYMDPGKIVRFRVTEIIFKDLKPELTHEERKTEKSMEIKGTM 183
Query: 266 DFDGLGPASWWPTEEVEDEETQIEE 192
GLG WW E+ +DE + E+
Sbjct: 184 ASTGLGCIGWWAAEDEDDEAVEDEQ 208
>ref|NP_084505.2| RIKEN cDNA 5031409G22 [Mus musculus] gi|27662966|ref|XP_216998.1|
similar to RIKEN cDNA 5031409G22 [Mus musculus] [Rattus
norvegicus] gi|14789799|gb|AAH10793.1| RIKEN cDNA
5031409G22 gene [Mus musculus]
gi|26387053|dbj|BAB31893.2| unnamed protein product [Mus
musculus]
Length = 204
Score = 83.2 bits (204), Expect = 3e-15
Identities = 55/145 (37%), Positives = 70/145 (47%), Gaps = 14/145 (9%)
Frame = -2
Query: 626 DGAPTYKVVFNLIMFRPFEGEIITAKLLSSDEDGLRLSLGFFDDIYIPAHNLPSPNHF-E 450
DGA KV F ++F PF EI+ K+ +G+ +SLGFFDDI IP +L P F E
Sbjct: 63 DGASHTKVHFRYVVFHPFLDEILIGKIKGCSPEGVHVSLGFFDDILIPPESLQQPAKFDE 122
Query: 449 AEPINSKKGTWYWNYGDQPFPID---DTNEEIKFQIQSVSY------PPIPVELPKDS-- 303
AE + W W Y + D DT EEI+F++ S+ P E S
Sbjct: 123 AEQV------WVWEYETEEGAHDLYMDTGEEIRFRVVDESFVDTSPTGPSSAEAASSSEE 176
Query: 302 --KPFAPMLITGSIDFDGLGPASWW 234
K AP + GSI GLG SWW
Sbjct: 177 LPKKEAPYTLVGSISEPGLGLLSWW 201
>ref|NP_612211.1| RNA polymerase III subunit RPC8 [Homo sapiens]
gi|24429623|gb|AAM18217.1| RNA polymerase III subunit
RPC8 [Homo sapiens]
Length = 204
Score = 83.2 bits (204), Expect = 3e-15
Identities = 57/148 (38%), Positives = 72/148 (48%), Gaps = 17/148 (11%)
Frame = -2
Query: 626 DGAPTYKVVFNLIMFRPFEGEIITAKLLSSDEDGLRLSLGFFDDIYIPAHNLPSPNHF-E 450
DGA KV F ++F PF EI+ K+ +G+ +SLGFFDDI IP +L P F E
Sbjct: 63 DGASHTKVHFRCVVFHPFLDEILIGKIKGCSPEGVHVSLGFFDDILIPPESLQQPAKFDE 122
Query: 449 AEPINSKKGTWYWNYGDQPFPID---DTNEEIKFQIQSVSY----PPIP---------VE 318
AE + W W Y + D DT EEI+F++ S+ P P E
Sbjct: 123 AEQV------WVWEYETEEGAHDLYMDTGEEIRFRVVDESFVDTSPTGPSSADATTSSEE 176
Query: 317 LPKDSKPFAPMLITGSIDFDGLGPASWW 234
LPK AP + GSI GLG SWW
Sbjct: 177 LPKKE---APYTLVGSISEPGLGLLSWW 201
>dbj|BAB33335.1| KIAA1665 protein [Homo sapiens]
Length = 217
Score = 83.2 bits (204), Expect = 3e-15
Identities = 57/148 (38%), Positives = 72/148 (48%), Gaps = 17/148 (11%)
Frame = -2
Query: 626 DGAPTYKVVFNLIMFRPFEGEIITAKLLSSDEDGLRLSLGFFDDIYIPAHNLPSPNHF-E 450
DGA KV F ++F PF EI+ K+ +G+ +SLGFFDDI IP +L P F E
Sbjct: 76 DGASHTKVHFRCVVFHPFLDEILIGKIKGCSPEGVHVSLGFFDDILIPPESLQQPAKFDE 135
Query: 449 AEPINSKKGTWYWNYGDQPFPID---DTNEEIKFQIQSVSY----PPIP---------VE 318
AE + W W Y + D DT EEI+F++ S+ P P E
Sbjct: 136 AEQV------WVWEYETEEGAHDLYMDTGEEIRFRVVDESFVDTSPTGPSSADATTSSEE 189
Query: 317 LPKDSKPFAPMLITGSIDFDGLGPASWW 234
LPK AP + GSI GLG SWW
Sbjct: 190 LPKKE---APYTLVGSISEPGLGLLSWW 214
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 552,375,963
Number of Sequences: 1393205
Number of extensions: 12123482
Number of successful extensions: 34377
Number of sequences better than 10.0: 46
Number of HSP's better than 10.0 without gapping: 33227
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 34341
length of database: 448,689,247
effective HSP length: 118
effective length of database: 284,291,057
effective search space used: 25586195130
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)