KMC009324A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC009324A_C01 KMC009324A_c01
gcatccaataattgtgtccatggcaTATTTCATATTTATAGAAATGGGGGCAAATGTTTC
CCATTTTACATACCTACTAACATTACTCATTCTTTTTTTTATCAAAGGGAAGGGGGAAGA
GCCAGAAAGCACAGGACCTAGCTGGATGCTACAATCATTCTAATAACTGGATATCAATAT
GAAGTAATATGAAGGTATTAACACAATCTTTTTCTACCTGCAAGCCATCAATATGAAGTA
ATATGAAGGTATTAACACATAATCTTTTTCTACCTGCAAGCCACTGTTGCAAACCAGTGA
TATTGTCCACAAAATTCAGATGTGGTCCATTTACCGGAAAAGCCTAGGGACTGATCTCTT
AAAGACTTTCCCTTCAAGCTTTAGTCGTCTATATGCTTCTCTCAGATGACAAGGCCGGAT
TGGTCCAGATTCCTTCCTCTCTTTGATAACTATTCTAGCTGTTTCCACAACTTCACCAAC
AAACATTTTTGCAATCCCTGATACTACAATTGTTAT


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC009324A_C01 KMC009324A_c01
         (516 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_193761.1| putative protein; protein id: At4g20280.1, supp...   103  1e-21
pir||D86333 T20H2.22 protein - Arabidopsis thaliana gi|8778998|g...   100  2e-20
ref|NP_173429.1| hypothetical protein; protein id: At1g20000.1 [...   100  2e-20
pir||T05336 hypothetical protein F1C12.195 - Arabidopsis thalian...    95  4e-19
ref|NP_584650.1| TRANSCRIPTION INITIATION FACTOR TFIID 28kDa SUB...    59  4e-08

>ref|NP_193761.1| putative protein; protein id: At4g20280.1, supported by cDNA:
           gi_13877756, supported by cDNA: gi_15293300, supported
           by cDNA: gi_7638154 [Arabidopsis thaliana]
           gi|7638155|gb|AAF65405.1|AF238326_1 putative TATA
           binding protein associated factor 24kDa subunit
           [Arabidopsis thaliana]
           gi|13877757|gb|AAK43956.1|AF370141_1 unknown protein
           [Arabidopsis thaliana] gi|15293301|gb|AAK93761.1|
           unknown protein [Arabidopsis thaliana]
          Length = 210

 Score =  103 bits (257), Expect = 1e-21
 Identities = 50/59 (84%), Positives = 55/59 (92%)
 Frame = -1

Query: 510 IVVSGIAKMFVGEVVETARIVIKERKESGPIRPCHLREAYRRLKLEGKVFKRSVPRLFR 334
           IV  GIAKMFVGE+VETAR+V+ ERKESGPIRPCH+RE+YRRLKLEGKV KRSVPRLFR
Sbjct: 152 IVACGIAKMFVGELVETARVVMAERKESGPIRPCHIRESYRRLKLEGKVPKRSVPRLFR 210

>pir||D86333 T20H2.22 protein - Arabidopsis thaliana
           gi|8778998|gb|AAF79913.1|AC022472_22 Contains similarity
           to PRO2134 mRNA from Homo sapiens gb|AF118094.
           [Arabidopsis thaliana]
          Length = 233

 Score = 99.8 bits (247), Expect = 2e-20
 Identities = 47/61 (77%), Positives = 57/61 (93%)
 Frame = -1

Query: 516 ITIVVSGIAKMFVGEVVETARIVIKERKESGPIRPCHLREAYRRLKLEGKVFKRSVPRLF 337
           + IVV GIAKMFVG++VETAR+V++ERKESGPIRPCH+RE+YRRLKL+GKV +RSV RLF
Sbjct: 173 MNIVVRGIAKMFVGDLVETARVVMRERKESGPIRPCHIRESYRRLKLQGKVPQRSVQRLF 232

Query: 336 R 334
           R
Sbjct: 233 R 233

>ref|NP_173429.1| hypothetical protein; protein id: At1g20000.1 [Arabidopsis
           thaliana]
          Length = 204

 Score = 99.8 bits (247), Expect = 2e-20
 Identities = 47/61 (77%), Positives = 57/61 (93%)
 Frame = -1

Query: 516 ITIVVSGIAKMFVGEVVETARIVIKERKESGPIRPCHLREAYRRLKLEGKVFKRSVPRLF 337
           + IVV GIAKMFVG++VETAR+V++ERKESGPIRPCH+RE+YRRLKL+GKV +RSV RLF
Sbjct: 144 MNIVVRGIAKMFVGDLVETARVVMRERKESGPIRPCHIRESYRRLKLQGKVPQRSVQRLF 203

Query: 336 R 334
           R
Sbjct: 204 R 204

>pir||T05336 hypothetical protein F1C12.195 - Arabidopsis thaliana
           gi|2982445|emb|CAA18253.1| putative protein [Arabidopsis
           thaliana] gi|7268823|emb|CAB79028.1| putative protein
           [Arabidopsis thaliana]
          Length = 221

 Score = 95.1 bits (235), Expect = 4e-19
 Identities = 50/70 (71%), Positives = 55/70 (78%), Gaps = 11/70 (15%)
 Frame = -1

Query: 510 IVVSGIAKMFVGEVVET-----------ARIVIKERKESGPIRPCHLREAYRRLKLEGKV 364
           IV  GIAKMFVGE+VET           AR+V+ ERKESGPIRPCH+RE+YRRLKLEGKV
Sbjct: 152 IVACGIAKMFVGELVETGHLLNLNTLSVARVVMAERKESGPIRPCHIRESYRRLKLEGKV 211

Query: 363 FKRSVPRLFR 334
            KRSVPRLFR
Sbjct: 212 PKRSVPRLFR 221

>ref|NP_584650.1| TRANSCRIPTION INITIATION FACTOR TFIID 28kDa SUBUNIT
           [Encephalitozoon cuniculi] gi|19068686|emb|CAD25154.1|
           TRANSCRIPTION INITIATION FACTOR TFIID 28kDa SUBUNIT
           [Encephalitozoon cuniculi]
          Length = 148

 Score = 58.5 bits (140), Expect = 4e-08
 Identities = 29/56 (51%), Positives = 40/56 (70%), Gaps = 3/56 (5%)
 Frame = -1

Query: 510 IVVSGIAKMFVGEVVETARIVIKERKESGPIRPCHLREAYRRLKL---EGKVFKRS 352
           I V G+AK+FVGE++E A+ V +ER+E GP+ P H+ EAYRRL       KVFK++
Sbjct: 89  IAVCGLAKVFVGEMIEIAKAVQEERREEGPLLPSHIHEAYRRLYKRIPNTKVFKKA 144

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 446,855,055
Number of Sequences: 1393205
Number of extensions: 9164706
Number of successful extensions: 18680
Number of sequences better than 10.0: 36
Number of HSP's better than 10.0 without gapping: 18309
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 18673
length of database: 448,689,247
effective HSP length: 115
effective length of database: 288,470,672
effective search space used: 16154357632
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 GNf088f11 BP073886 1 363
2 MFB057h11_f BP038162 26 516




Lotus japonicus
Kazusa DNA Research Institute