KMC003202A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC003202A_C01 KMC003202A_c01
ATAGAGAGAATCTATGATAAATAAATCATGCAAGACCAAAGAACACAAGAAAGAAGTAGG
GTGTTCACACTAGTTTTTTCATGGGTTTAAATTTTAAAGACATCAAAGTAAAGGTTTCTA
TTCTAGCCTATAACGATACAAAAATTCAGTGAACTCAGCTTTACAAAAATATACATGCTT
CAGGTTTTGGAATAAATGACAACACTCTTTCGCCAAATAATTCAACCACTGATGACAGCA
ACTGTCCTGCATCGATGTCATCCTTCCAAGCTTGAGCAAGAATGGCAGATGCTTTCGATT
GTTGACCCGCAACCTCTCTTCCTGGCATGTGATGGTCATTGGATAATTGACGACCCATTG
ATGACATGCCAACATTGGAAACCCCCATCATGGTGGAATATCCAGCTGTTGAACCTTGCA
TGTCGTCCACTGACATGGAGTTCATTGCTATTACACCCCCAGGAGCATCTGTTGCAATTT
TCTTCAGTGGAGGTTGCTGCTGCTGCATTAGATTGTCAG


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC003202A_C01 KMC003202A_c01
         (519 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

dbj|BAB92191.1| putative TATA binding protein-associated factor ...    73  2e-12
ref|NP_171987.1| TATA binding protein-associated factor, putativ...    59  4e-08
pir||C96585 hypothetical protein F20D21.18 [imported] - Arabidop...    38  0.080
gb|AAL32535.1| Very similar to TATA binding protein-associated f...    38  0.080
ref|NP_175838.2| hypothetical protein; protein id: At1g54360.1, ...    38  0.080

>dbj|BAB92191.1| putative TATA binding protein-associated factor [Oryza sativa
           (japonica cultivar-group)]
          Length = 541

 Score = 72.8 bits (177), Expect = 2e-12
 Identities = 45/118 (38%), Positives = 65/118 (54%), Gaps = 3/118 (2%)
 Frame = -3

Query: 508 MQQQQPPLKKIATDAPGGVIAMNSMSVDDMQGSTAGYSTMMGVSNVGMSSMGRQL--SND 335
           +   QPPLKK+ TD      AMNSM+   M G+  G+ST +   ++  +S   QL  S  
Sbjct: 429 LSTSQPPLKKMTTDG-----AMNSMTSAPMPGTMDGFSTQLPNPSMTQTSSSGQLVESTA 483

Query: 334 HHMPGREVAGQQS-KASAILAQAWKDDIDAGQLLSSVVELFGERVLSFIPKPEACIFL 164
             +  R+     + + S +L  AWK+D +AG LLSS+ E+FGE + SF+  PE   FL
Sbjct: 484 SGVIRRDQGSNHTQRVSTVLRLAWKEDQNAGHLLSSLYEVFGEAIFSFVQPPEISFFL 541

>ref|NP_171987.1| TATA binding protein-associated factor, putative; protein id:
           At1g04950.1, supported by cDNA: gi_15293056, supported
           by cDNA: gi_20259030 [Arabidopsis thaliana]
           gi|25350689|pir||A86183 hypothetical protein [imported]
           - Arabidopsis thaliana
           gi|7211972|gb|AAF40443.1|AC004809_1 Strong similarity to
           the TATA binding protein-associated factor from A.
           thaliana gb|Y13673.  ESTs gb|N38153 and gb|W43450 come
           from this gene. [Arabidopsis thaliana]
           gi|15293057|gb|AAK93639.1| putative TATA binding
           protein-associated factor [Arabidopsis thaliana]
           gi|20259031|gb|AAM14231.1| putative TATA binding
           protein-associated factor [Arabidopsis thaliana]
          Length = 549

 Score = 58.5 bits (140), Expect = 4e-08
 Identities = 40/120 (33%), Positives = 59/120 (48%), Gaps = 6/120 (5%)
 Frame = -3

Query: 505 QQQQPPLKKIATDAPGGVIAMNSMSVDDMQGSTAGYSTMMGVSNVGMSSMGRQLSNDHHM 326
           + Q P  + I  D P GV + +      MQ      +     ++V  SS   Q S+ +  
Sbjct: 431 ENQSPQKRLITMDGPDGVHSQDQSGSAPMQVDNPVENDNPPQNSVQPSS-SEQASDANES 489

Query: 325 PGREVAGQQSKAS------AILAQAWKDDIDAGQLLSSVVELFGERVLSFIPKPEACIFL 164
             R    ++S  S      AIL Q WKDD+D+G+LL  + EL+G+R+L FIP  E  +FL
Sbjct: 490 ESRNGKVKESGRSRAITMKAILDQIWKDDLDSGRLLVKLHELYGDRILPFIPSTEMSVFL 549

>pir||C96585 hypothetical protein F20D21.18 [imported] - Arabidopsis thaliana
           gi|4585980|gb|AAD25616.1|AC005287_18 Very similar to
           TATA binding protein-associated factor [Arabidopsis
           thaliana]
          Length = 491

 Score = 37.7 bits (86), Expect = 0.080
 Identities = 37/118 (31%), Positives = 48/118 (40%)
 Frame = -3

Query: 517 DNLMQQQQPPLKKIATDAPGGVIAMNSMSVDDMQGSTAGYSTMMGVSNVGMSSMGRQLSN 338
           DNL  Q  PPLKKIA    GG+I M+S  +            M G + V   S     + 
Sbjct: 400 DNLTHQ--PPLKKIAV---GGIIQMSSTQMQ-----------MRGTTTVPQQSHTDADAR 443

Query: 337 DHHMPGREVAGQQSKASAILAQAWKDDIDAGQLLSSVVELFGERVLSFIPKPEACIFL 164
            H+ P   +A + S A+           D    L  + E FGE +L F P  E   FL
Sbjct: 444 HHNSPST-IAPKTSAAAGT---------DVDNYLFPLFEYFGESMLMFTPTHELSFFL 491

>gb|AAL32535.1| Very similar to TATA binding protein-associated factor [Arabidopsis
           thaliana] gi|28059031|gb|AAO29980.1| Very similar to
           TATA binding protein-associated factor [Arabidopsis
           thaliana]
          Length = 466

 Score = 37.7 bits (86), Expect = 0.080
 Identities = 37/118 (31%), Positives = 48/118 (40%)
 Frame = -3

Query: 517 DNLMQQQQPPLKKIATDAPGGVIAMNSMSVDDMQGSTAGYSTMMGVSNVGMSSMGRQLSN 338
           DNL  Q  PPLKKIA    GG+I M+S  +            M G + V   S     + 
Sbjct: 375 DNLTHQ--PPLKKIAV---GGIIQMSSTQMQ-----------MRGTTTVPQQSHTDADAR 418

Query: 337 DHHMPGREVAGQQSKASAILAQAWKDDIDAGQLLSSVVELFGERVLSFIPKPEACIFL 164
            H+ P   +A + S A+           D    L  + E FGE +L F P  E   FL
Sbjct: 419 HHNSPST-IAPKTSAAAGT---------DVDNYLFPLFEYFGESMLMFTPTHELSFFL 466

>ref|NP_175838.2| hypothetical protein; protein id: At1g54360.1, supported by cDNA:
           gi_17064761 [Arabidopsis thaliana]
          Length = 447

 Score = 37.7 bits (86), Expect = 0.080
 Identities = 37/118 (31%), Positives = 48/118 (40%)
 Frame = -3

Query: 517 DNLMQQQQPPLKKIATDAPGGVIAMNSMSVDDMQGSTAGYSTMMGVSNVGMSSMGRQLSN 338
           DNL  Q  PPLKKIA    GG+I M+S  +            M G + V   S     + 
Sbjct: 356 DNLTHQ--PPLKKIAV---GGIIQMSSTQMQ-----------MRGTTTVPQQSHTDADAR 399

Query: 337 DHHMPGREVAGQQSKASAILAQAWKDDIDAGQLLSSVVELFGERVLSFIPKPEACIFL 164
            H+ P   +A + S A+           D    L  + E FGE +L F P  E   FL
Sbjct: 400 HHNSPST-IAPKTSAAAGT---------DVDNYLFPLFEYFGESMLMFTPTHELSFFL 447

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 453,122,783
Number of Sequences: 1393205
Number of extensions: 9808067
Number of successful extensions: 26676
Number of sequences better than 10.0: 27
Number of HSP's better than 10.0 without gapping: 25674
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 26646
length of database: 448,689,247
effective HSP length: 115
effective length of database: 288,470,672
effective search space used: 16442828304
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 GNf026g05 BP069266 1 521
2 MR014g11_f BP077063 1 378
3 MWL055b10_f AV769521 1 388




Lotus japonicus
Kazusa DNA Research Institute