Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC003202A_C01 KMC003202A_c01
(519 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
dbj|BAB92191.1| putative TATA binding protein-associated factor ... 73 2e-12
ref|NP_171987.1| TATA binding protein-associated factor, putativ... 59 4e-08
pir||C96585 hypothetical protein F20D21.18 [imported] - Arabidop... 38 0.080
gb|AAL32535.1| Very similar to TATA binding protein-associated f... 38 0.080
ref|NP_175838.2| hypothetical protein; protein id: At1g54360.1, ... 38 0.080
>dbj|BAB92191.1| putative TATA binding protein-associated factor [Oryza sativa
(japonica cultivar-group)]
Length = 541
Score = 72.8 bits (177), Expect = 2e-12
Identities = 45/118 (38%), Positives = 65/118 (54%), Gaps = 3/118 (2%)
Frame = -3
Query: 508 MQQQQPPLKKIATDAPGGVIAMNSMSVDDMQGSTAGYSTMMGVSNVGMSSMGRQL--SND 335
+ QPPLKK+ TD AMNSM+ M G+ G+ST + ++ +S QL S
Sbjct: 429 LSTSQPPLKKMTTDG-----AMNSMTSAPMPGTMDGFSTQLPNPSMTQTSSSGQLVESTA 483
Query: 334 HHMPGREVAGQQS-KASAILAQAWKDDIDAGQLLSSVVELFGERVLSFIPKPEACIFL 164
+ R+ + + S +L AWK+D +AG LLSS+ E+FGE + SF+ PE FL
Sbjct: 484 SGVIRRDQGSNHTQRVSTVLRLAWKEDQNAGHLLSSLYEVFGEAIFSFVQPPEISFFL 541
>ref|NP_171987.1| TATA binding protein-associated factor, putative; protein id:
At1g04950.1, supported by cDNA: gi_15293056, supported
by cDNA: gi_20259030 [Arabidopsis thaliana]
gi|25350689|pir||A86183 hypothetical protein [imported]
- Arabidopsis thaliana
gi|7211972|gb|AAF40443.1|AC004809_1 Strong similarity to
the TATA binding protein-associated factor from A.
thaliana gb|Y13673. ESTs gb|N38153 and gb|W43450 come
from this gene. [Arabidopsis thaliana]
gi|15293057|gb|AAK93639.1| putative TATA binding
protein-associated factor [Arabidopsis thaliana]
gi|20259031|gb|AAM14231.1| putative TATA binding
protein-associated factor [Arabidopsis thaliana]
Length = 549
Score = 58.5 bits (140), Expect = 4e-08
Identities = 40/120 (33%), Positives = 59/120 (48%), Gaps = 6/120 (5%)
Frame = -3
Query: 505 QQQQPPLKKIATDAPGGVIAMNSMSVDDMQGSTAGYSTMMGVSNVGMSSMGRQLSNDHHM 326
+ Q P + I D P GV + + MQ + ++V SS Q S+ +
Sbjct: 431 ENQSPQKRLITMDGPDGVHSQDQSGSAPMQVDNPVENDNPPQNSVQPSS-SEQASDANES 489
Query: 325 PGREVAGQQSKAS------AILAQAWKDDIDAGQLLSSVVELFGERVLSFIPKPEACIFL 164
R ++S S AIL Q WKDD+D+G+LL + EL+G+R+L FIP E +FL
Sbjct: 490 ESRNGKVKESGRSRAITMKAILDQIWKDDLDSGRLLVKLHELYGDRILPFIPSTEMSVFL 549
>pir||C96585 hypothetical protein F20D21.18 [imported] - Arabidopsis thaliana
gi|4585980|gb|AAD25616.1|AC005287_18 Very similar to
TATA binding protein-associated factor [Arabidopsis
thaliana]
Length = 491
Score = 37.7 bits (86), Expect = 0.080
Identities = 37/118 (31%), Positives = 48/118 (40%)
Frame = -3
Query: 517 DNLMQQQQPPLKKIATDAPGGVIAMNSMSVDDMQGSTAGYSTMMGVSNVGMSSMGRQLSN 338
DNL Q PPLKKIA GG+I M+S + M G + V S +
Sbjct: 400 DNLTHQ--PPLKKIAV---GGIIQMSSTQMQ-----------MRGTTTVPQQSHTDADAR 443
Query: 337 DHHMPGREVAGQQSKASAILAQAWKDDIDAGQLLSSVVELFGERVLSFIPKPEACIFL 164
H+ P +A + S A+ D L + E FGE +L F P E FL
Sbjct: 444 HHNSPST-IAPKTSAAAGT---------DVDNYLFPLFEYFGESMLMFTPTHELSFFL 491
>gb|AAL32535.1| Very similar to TATA binding protein-associated factor [Arabidopsis
thaliana] gi|28059031|gb|AAO29980.1| Very similar to
TATA binding protein-associated factor [Arabidopsis
thaliana]
Length = 466
Score = 37.7 bits (86), Expect = 0.080
Identities = 37/118 (31%), Positives = 48/118 (40%)
Frame = -3
Query: 517 DNLMQQQQPPLKKIATDAPGGVIAMNSMSVDDMQGSTAGYSTMMGVSNVGMSSMGRQLSN 338
DNL Q PPLKKIA GG+I M+S + M G + V S +
Sbjct: 375 DNLTHQ--PPLKKIAV---GGIIQMSSTQMQ-----------MRGTTTVPQQSHTDADAR 418
Query: 337 DHHMPGREVAGQQSKASAILAQAWKDDIDAGQLLSSVVELFGERVLSFIPKPEACIFL 164
H+ P +A + S A+ D L + E FGE +L F P E FL
Sbjct: 419 HHNSPST-IAPKTSAAAGT---------DVDNYLFPLFEYFGESMLMFTPTHELSFFL 466
>ref|NP_175838.2| hypothetical protein; protein id: At1g54360.1, supported by cDNA:
gi_17064761 [Arabidopsis thaliana]
Length = 447
Score = 37.7 bits (86), Expect = 0.080
Identities = 37/118 (31%), Positives = 48/118 (40%)
Frame = -3
Query: 517 DNLMQQQQPPLKKIATDAPGGVIAMNSMSVDDMQGSTAGYSTMMGVSNVGMSSMGRQLSN 338
DNL Q PPLKKIA GG+I M+S + M G + V S +
Sbjct: 356 DNLTHQ--PPLKKIAV---GGIIQMSSTQMQ-----------MRGTTTVPQQSHTDADAR 399
Query: 337 DHHMPGREVAGQQSKASAILAQAWKDDIDAGQLLSSVVELFGERVLSFIPKPEACIFL 164
H+ P +A + S A+ D L + E FGE +L F P E FL
Sbjct: 400 HHNSPST-IAPKTSAAAGT---------DVDNYLFPLFEYFGESMLMFTPTHELSFFL 447
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 453,122,783
Number of Sequences: 1393205
Number of extensions: 9808067
Number of successful extensions: 26676
Number of sequences better than 10.0: 27
Number of HSP's better than 10.0 without gapping: 25674
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 26646
length of database: 448,689,247
effective HSP length: 115
effective length of database: 288,470,672
effective search space used: 16442828304
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)