Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC003811A_C01 KMC003811A_c01
(692 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_188772.2| hypothetical protein; protein id: At3g21350.1, ... 171 9e-42
gb|EAA14057.1| agCP8654 [Anopheles gambiae str. PEST] 53 5e-06
ref|NP_731403.2| CG9473-PB [Drosophila melanogaster] gi|28573125... 50 3e-05
ref|NP_081489.2| RIKEN cDNA 1500012F11 [Mus musculus] gi|1534183... 49 7e-05
gb|AAC26869.1| RNA polymerase transcriptional regulation mediato... 49 9e-05
>ref|NP_188772.2| hypothetical protein; protein id: At3g21350.1, supported by cDNA:
gi_20258922 [Arabidopsis thaliana]
gi|9294682|dbj|BAB03048.1| contains similarity to RNA
polymerase transcriptional regulation
mediator~gene_id:MHC9.3 [Arabidopsis thaliana]
gi|17979205|gb|AAL49841.1| unknown protein [Arabidopsis
thaliana] gi|20258923|gb|AAM14177.1| unknown protein
[Arabidopsis thaliana]
Length = 257
Score = 171 bits (433), Expect = 9e-42
Identities = 92/140 (65%), Positives = 100/140 (70%)
Frame = -1
Query: 692 LAYYILDGSIYQAPQLCHVFAARIGRALYYIEKAFTTAASKLEKIGYAGPVDSETETASL 513
L YYILDGSIYQAPQLC VFAAR+ R +Y I KAFT AASKLE I VD+E +
Sbjct: 122 LTYYILDGSIYQAPQLCSVFAARVSRTIYNISKAFTDAASKLETIRQ---VDTENQNEPA 178
Query: 512 ESKVAKETIDLKEVKRVDHILASLQRKLPPAPPPPPFPEGYVPPSTAETEKGTETQESAE 333
ESK A ET+DLKE+KRVD IL SL RKL P PPPPPFPEGYV ++ TQ E
Sbjct: 179 ESKPASETVDLKEMKRVDVILTSLYRKLAPPPPPPPFPEGYVSQEALGEKEELGTQ-GGE 237
Query: 332 PQAPQVDPIIDQGPAKRMKF 273
Q PQVDPIIDQGPAKRMKF
Sbjct: 238 SQPPQVDPIIDQGPAKRMKF 257
>gb|EAA14057.1| agCP8654 [Anopheles gambiae str. PEST]
Length = 283
Score = 52.8 bits (125), Expect = 5e-06
Identities = 40/124 (32%), Positives = 58/124 (46%), Gaps = 19/124 (15%)
Frame = -1
Query: 686 YYILDGSIYQAPQLCHVFAARIGRALYYIEKAFTTAAS-----------------KLEKI 558
YYI+ G++YQAP L VF +RI +++++ AF A+S K
Sbjct: 112 YYIIAGTVYQAPDLASVFNSRILSTVHHLQTAFDEASSYSRYHPSKGYSWDFSSNKASTH 171
Query: 557 GYAGPVDSETET-ASLESKVAKETIDLKEVKRVDHILASLQRKLP-PAPPPPPFPEGYVP 384
A V +T+T E+ V +E + + +RVD +L L RK P P P P G P
Sbjct: 172 AMAEQVAEKTKTQTKKEAPVKEEPSSIFQRQRVDMLLGDLLRKFPLPLPQMTNNPTGGNP 231
Query: 383 PSTA 372
TA
Sbjct: 232 SDTA 235
>ref|NP_731403.2| CG9473-PB [Drosophila melanogaster] gi|28573125|ref|NP_651988.2|
CG9473-PA [Drosophila melanogaster]
gi|21428452|gb|AAM49886.1| LD15729p [Drosophila
melanogaster] gi|25012576|gb|AAN71388.1| RE38674p
[Drosophila melanogaster] gi|28381219|gb|AAF54445.2|
CG9473-PA [Drosophila melanogaster]
gi|28381220|gb|AAN13445.2| CG9473-PB [Drosophila
melanogaster]
Length = 249
Score = 50.1 bits (118), Expect = 3e-05
Identities = 39/127 (30%), Positives = 58/127 (44%), Gaps = 11/127 (8%)
Frame = -1
Query: 686 YYILDGSIYQAPQLCHVFAARIGRALYYIEKAFTTAASKLE---KIGYAGPVDSE---TE 525
YYI+ G++Y+AP L +V +RI + ++ AF A+S GY S ++
Sbjct: 98 YYIIGGTVYKAPDLANVINSRILNTVVNLQSAFEEASSYARYHPNKGYTWDFSSNKVFSD 157
Query: 524 TASLESKVAKETID-----LKEVKRVDHILASLQRKLPPAPPPPPFPEGYVPPSTAETEK 360
+ + K A D L + +RVD +LA L RK P PP PP + P A +
Sbjct: 158 RSKSDKKDANSAKDENSGTLFQKQRVDMLLAELLRKFP--PPIPPMLQNLQQPPPAGDDL 215
Query: 359 GTETQES 339
T S
Sbjct: 216 NTARNAS 222
>ref|NP_081489.2| RIKEN cDNA 1500012F11 [Mus musculus] gi|15341839|gb|AAH13096.1|
Unknown (protein for MGC:12137) [Mus musculus]
Length = 195
Score = 48.9 bits (115), Expect = 7e-05
Identities = 46/147 (31%), Positives = 68/147 (45%), Gaps = 10/147 (6%)
Frame = -1
Query: 686 YYILDGSIYQAPQLCHVFAARIGRALYYIEKAFTTAASKLEKIGYAG-----PVDSETET 522
YYI+ G IYQAP L V +R+ A++ I+ AF A S G E E
Sbjct: 47 YYIIAGVIYQAPDLGSVINSRVLTAVHGIQSAFDEAMSYCRYHPSKGYWWHFKDHEEQEK 106
Query: 521 ASLESKVAKETIDLKEVKRVDHILASLQRKLPPAPPPPPFPEGYVPPSTA--ETEKGTET 348
++K +E + + +RVD +L L++K PP E VP A E E ET
Sbjct: 107 VKPKAKRKEEPSSIFQRQRVDALLIDLRQKFPPRFVQQKSGEKPVPVDQAKKEAEPLPET 166
Query: 347 QESAEPQAPQ--VDPIIDQG-PAKRMK 276
+S E ++ + + +G P KRM+
Sbjct: 167 VKSEEKESTKNIQQTVSTKGPPEKRMR 193
>gb|AAC26869.1| RNA polymerase transcriptional regulation mediator [Homo sapiens]
Length = 246
Score = 48.5 bits (114), Expect = 9e-05
Identities = 43/149 (28%), Positives = 67/149 (44%), Gaps = 12/149 (8%)
Frame = -1
Query: 686 YYILDGSIYQAPQLCHVFAARIGRALYYIEKAFTTAASKLEKIGYAG-----PVDSETET 522
YYI+ G IYQAP L V +R+ A++ I+ AF A S G V E +
Sbjct: 98 YYIIAGVIYQAPDLGSVINSRVLTAVHGIQSAFDEAMSYCRYHPSKGYWWHFKVHEEQDK 157
Query: 521 ASLESKVAKETIDLKEVKRVDHILASLQRKLPPAPPPPPFPEGYVPPSTAETEKGTE-TQ 345
++K +E + + +RVD +L L++K P P G P +T+K E
Sbjct: 158 VRPKAKRKEEPSSIFQRQRVDALLLDLRQKFP--PKFVQLKPGEKPVPVDQTKKEAEPIP 215
Query: 344 ESAEPQAPQVDPIIDQ------GPAKRMK 276
E+ +P+ + + Q P KRM+
Sbjct: 216 ETVKPEEKETTKNVQQTVSAKGPPEKRMR 244
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 582,744,058
Number of Sequences: 1393205
Number of extensions: 13080239
Number of successful extensions: 147254
Number of sequences better than 10.0: 454
Number of HSP's better than 10.0 without gapping: 70137
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 119378
length of database: 448,689,247
effective HSP length: 119
effective length of database: 282,897,852
effective search space used: 31401661572
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)