Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC004674A_C01 KMC004674A_c01
(540 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
dbj|BAC22308.1| hypothetical protein~similar to Arabidopsis thal... 118 4e-26
ref|NP_199590.1| unknown protein; protein id: At5g47790.1, suppo... 95 5e-19
ref|NP_012897.1| Large subunit of transcription factor tfIIE; Tf... 39 0.040
ref|NP_175322.1| nucleolin, putative; protein id: At1g48920.1 [A... 37 0.12
ref|NP_776728.1| cylicin, basic protein of sperm head cytoskelet... 37 0.20
>dbj|BAC22308.1| hypothetical protein~similar to Arabidopsis thaliana chromosome 5,
At5g47790 [Oryza sativa (japonica cultivar-group)]
Length = 421
Score = 118 bits (296), Expect = 4e-26
Identities = 67/123 (54%), Positives = 84/123 (67%)
Frame = -2
Query: 539 VGVKEGSLVGKYESLVQITVIPKGKEQSSVKEEADSSQKGVTDKLQEVMKKIKTPVKTGI 360
VGVKEGSLVGKYESLVQ+TVIPKGKEQ S KE A S GVTDKL++V+ K+K+ K GI
Sbjct: 309 VGVKEGSLVGKYESLVQVTVIPKGKEQPSPKESASPS--GVTDKLKQVLTKVKSTAKGGI 366
Query: 359 YDDLYGESFSVKVGSSWAYSTSSTGERASSDKEDEKGNTLSGKSDSKPSNADGDEEDDDD 180
YDDLYG++ +G SWAY + E+ + DEK + SG D+ + +D+DD
Sbjct: 367 YDDLYGDTVPQLLGPSWAYRSDDQAEKVKA--ADEKKS--SGNMDTNSA------DDNDD 416
Query: 179 LFG 171
LFG
Sbjct: 417 LFG 419
>ref|NP_199590.1| unknown protein; protein id: At5g47790.1, supported by cDNA:
gi_18700140 [Arabidopsis thaliana]
gi|10177915|dbj|BAB11326.1| gene_id:MCA23.11~unknown
protein [Arabidopsis thaliana]
gi|18700141|gb|AAL77682.1| AT5g47790/MCA23_11
[Arabidopsis thaliana]
Length = 369
Score = 95.1 bits (235), Expect = 5e-19
Identities = 62/127 (48%), Positives = 78/127 (60%), Gaps = 4/127 (3%)
Frame = -2
Query: 539 VGVKEGSLVGKYESLVQITVIPKGKEQSSVKEE---ADSSQKGVTDKLQEVMKKIKTPVK 369
+ VKEGSLVGKYESLV++T+IPKGK VKEE ++ GVTD+LQE M +K K
Sbjct: 266 INVKEGSLVGKYESLVRVTLIPKGK----VKEEKAFTGGTRGGVTDRLQEAMNMLKRGPK 321
Query: 368 TGIYDDLY-GESFSVKVGSSWAYSTSSTGERASSDKEDEKGNTLSGKSDSKPSNADGDEE 192
TGIYDDLY G+S + VG+SWA S + A+ E E G G+E+
Sbjct: 322 TGIYDDLYGGDSLAKAVGTSWA----SVSQPAA---ETECGGV-------------GEED 361
Query: 191 DDDDLFG 171
D+DDLFG
Sbjct: 362 DNDDLFG 368
>ref|NP_012897.1| Large subunit of transcription factor tfIIE; Tfa1p [Saccharomyces
cerevisiae] gi|549038|sp|P36100|T2EA_YEAST Transcription
initiation factor IIE, alpha subunit (TFIIE-alpha)
(Transcription factor A large subunit) (Factor A 66 kDa
subunit) gi|539375|pir||S37845 transcription initiation
factor IIE chain TFA1 - yeast (Saccharomyces
cerevisiae) gi|486027|emb|CAA81863.1| ORF YKL028w
[Saccharomyces cerevisiae] gi|607958|gb|AAA62665.1|
transcription factor TFIIE, large subunit
Length = 482
Score = 38.9 bits (89), Expect = 0.040
Identities = 27/102 (26%), Positives = 53/102 (51%), Gaps = 6/102 (5%)
Frame = -2
Query: 467 KEQSSVKEEADSSQKGVTDKLQEVMKKIKTPVKTGIYDDLYGESFSVKVGSSWAYSTSST 288
KE+ +EE + ++ +++++VM + +D + E + G++ S +S
Sbjct: 373 KEEEEEEEEEEDEEEEEEEEMEDVMDDNDETARENALEDEF-EDVTDTAGTAKTESNTSN 431
Query: 287 GERASS--DKEDEKGN---TLSGKS-DSKPSNADGDEEDDDD 180
+ S DK ++ N T SG S ++KP++ D D++DDDD
Sbjct: 432 DVKQESINDKTEDAVNATATASGPSANAKPNDGDDDDDDDDD 473
>ref|NP_175322.1| nucleolin, putative; protein id: At1g48920.1 [Arabidopsis thaliana]
gi|25405286|pir||A96527 probable nuM1 protein [imported]
- Arabidopsis thaliana
gi|11094815|gb|AAG29744.1|AC084414_12 nuM1 protein,
putative [Arabidopsis thaliana]
gi|28973759|gb|AAO64195.1| putative nucleolin
[Arabidopsis thaliana]
Length = 557
Score = 37.4 bits (85), Expect = 0.12
Identities = 33/117 (28%), Positives = 50/117 (42%), Gaps = 15/117 (12%)
Frame = -2
Query: 485 TVIPKGKEQSSVKEEADSSQK-GVTDKLQEVMKK--IKTPVKTGIYDDLYGESF-----S 330
TV K K+ SS ++ S ++ VT K K +K ++ DD E +
Sbjct: 114 TVAKKSKDDSSSSDDDSSDEEVAVTKKPAAAAKNGSVKAKKESSSEDDSSSEDEPAKKPA 173
Query: 329 VKVGSSWAYSTSSTGERASSDKEDEKGNT-------LSGKSDSKPSNADGDEEDDDD 180
K+ A +SS+ + + D EDEK T S S S+ D DEE +D+
Sbjct: 174 AKIAKPAAKDSSSSDDDSDEDSEDEKPATKKAAPAAAKAASSSDSSDEDSDEESEDE 230
>ref|NP_776728.1| cylicin, basic protein of sperm head cytoskeleton 2 [Bos taurus]
gi|2498277|sp|Q28092|CYL2_BOVIN CYLICIN II
(MULTIPLE-BAND POLYPEPTIDE II) gi|2136733|pir||I46014
cylicin II - bovine gi|757754|emb|CAA86753.1| cylicin II
[Bos taurus]
Length = 488
Score = 36.6 bits (83), Expect = 0.20
Identities = 31/111 (27%), Positives = 45/111 (39%), Gaps = 14/111 (12%)
Frame = -2
Query: 473 KGKEQSSVKEEADSSQKGVTDKLQEVMKKIKTPVKTGIY--------------DDLYGES 336
KGK+ S +E+ + +G ++ KK K K G DD G+
Sbjct: 274 KGKKGSKKGKESATESEGEKGDAKKDDKKGKKGSKKGKESATESEGEKGDAKKDDKKGKK 333
Query: 335 FSVKVGSSWAYSTSSTGERASSDKEDEKGNTLSGKSDSKPSNADGDEEDDD 183
S K S S G+ DK+ +KG+ +SDSK GD + DD
Sbjct: 334 GSKKGKESATESEGEKGDAKKDDKKGKKGSKKGKESDSKAEGDKGDAKKDD 384
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 498,322,373
Number of Sequences: 1393205
Number of extensions: 11285660
Number of successful extensions: 62737
Number of sequences better than 10.0: 241
Number of HSP's better than 10.0 without gapping: 45196
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 57104
length of database: 448,689,247
effective HSP length: 115
effective length of database: 288,470,672
effective search space used: 18462123008
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)