KMC016537A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC016537A_C01 KMC016537A_c01
aacatcctTAAACTCATCAGTTTTACATTCCATATAATGAATGGGTCATGGTAATCATCA
AGAAATGAATATAGGAACCAGTGAGCCACCAATAACTGGCATAAAGGAAAACTTAAATGA
GATTACATATTTTTCCAGATCAGCGTAAGACATAAAAGGATACTAGATGAATGCTAAACA
GTACCATATTCAAACACAATAACTTTGAACCCGAATCACAAGTCTTTCTAATTTATATTA
TTTGCTTGGGAATTGGAACTTTAAGCACTAAATGGAATCCCCACAATCTGAAATGAAACT
CATACTATTATTATTATCTTCATGCAAAATTTCAGTTGTCACTCCTGCTAGGTTTGTCTG
AATTGAAAACATCCTCAGTGGAAGGAGCAGGAACCCATTCGTTAGTTGCTGGATTTCCCT
TGGAAGAGTGGTGGTGGCCAGTGTCATTATCTGGTTCTATGTGGAAGCCTGTGCCACTTT
TGTCTACTTGGATAAACCCTTTTCCTGTCGTATGGTGGTGTTTGTCCACATTGTTGAGAT
CTTTGTAATACTCTGTTTCCACAAATTCCTCTGCCACTTGATTTCCATTGTAAACCAACT
TCTTCTCCTGAGGATCATTCTCAAGGCGTTGAAACTGCAACATTTCTGGTGTAGAATCAT
TGCTAACTCCATTGATGTTGTCCACATATTGAGAAGAATACTCACTGCGTTGAGGCTTTG
TGTGGCGCTTTGGTGCCCCTTCTTCAAAATACTCTCTGGTCTGTGCTTC


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC016537A_C01 KMC016537A_c01
         (769 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_195447.1| putative protein; protein id: At4g37300.1, supp...    94  3e-18
gb|EAA21835.1| CCAAT-box DNA binding protein subunit B [Plasmodi...    39  0.11
gb|AAO51932.1| similar to Derepression of GCN4 expression; Gcn2p...    35  0.90
gb|EAA17720.1| ring-infested erythrocyte surface antigen, putati...    34  2.0
ref|NP_701379.1| hypothetical protein [Plasmodium falciparum 3D7...    34  2.0

>ref|NP_195447.1| putative protein; protein id: At4g37300.1, supported by cDNA:
           gi_13877858, supported by cDNA: gi_14517511, supported
           by cDNA: gi_15809769, supported by cDNA: gi_17065601
           [Arabidopsis thaliana] gi|25372664|pir||G85440
           hypothetical protein AT4g37300 [imported] - Arabidopsis
           thaliana gi|2464852|emb|CAB16754.1| putative protein
           [Arabidopsis thaliana] gi|7270713|emb|CAB80396.1|
           putative protein [Arabidopsis thaliana]
           gi|13877859|gb|AAK44007.1|AF370192_1 unknown protein
           [Arabidopsis thaliana] gi|14517512|gb|AAK62646.1|
           AT4g37300/C7A10_60 [Arabidopsis thaliana]
           gi|15809770|gb|AAL06813.1| AT4g37300/C7A10_60
           [Arabidopsis thaliana] gi|17065602|gb|AAL33781.1|
           unknown protein [Arabidopsis thaliana]
          Length = 173

 Score = 93.6 bits (231), Expect = 3e-18
 Identities = 63/152 (41%), Positives = 83/152 (54%), Gaps = 9/152 (5%)
 Frame = -1

Query: 763 QTREYFEEGAPKRHTKPQRSEYSSQYVDNINGVSNDSTPEMLQFQRLENDPQEKKLVY-N 587
           Q R  F+  APKR TKP RSE       + +    D  PE  +FQ L++    K L   +
Sbjct: 24  QIRADFDSLAPKRPTKPTRSEPGFPGSFSASDKITDH-PEADKFQSLQSQTHGKVLGEGD 82

Query: 586 GNQVAEEFVETEYYKDLNNVDKHHHTTGKGFIQVDKSGTGFHIE-----PDNDTGHHHSS 422
            + V +EF+ETEYY +L  +DK HHTTG GFI V K   G   E        D G     
Sbjct: 83  SSAVQDEFLETEYYSNLTAIDKQHHTTGSGFINVVKEDNGEESEAVTAAAIGDGGEKAVY 142

Query: 421 KGNPATNEWVPAPSTEDVFNSD---KPSRSDN 335
           + NPATNEW+PA  TE+ F+S+   KP+RS++
Sbjct: 143 RSNPATNEWIPA--TEEDFDSESSSKPNRSES 172

>gb|EAA21835.1| CCAAT-box DNA binding protein subunit B [Plasmodium yoelii yoelii]
          Length = 998

 Score = 38.5 bits (88), Expect = 0.11
 Identities = 38/133 (28%), Positives = 55/133 (40%), Gaps = 22/133 (16%)
 Frame = -1

Query: 745 EEGAPKRHTKPQRSEYSSQYVDNINGV--SNDSTPEMLQFQRLENDPQEKKLV------- 593
           E+ + K + K +RS  SS +  N N     ND   E  + Q +END ++KK         
Sbjct: 339 EDRSKKEYRKNERSN-SSNFSQNGNEFCKENDEKKEKYETQSVENDEKKKKYETQSVEND 397

Query: 592 --------YNGNQVAEEFVETEYYKDLNN---VDKHHHTTGKGFIQVDKS--GTGFHIEP 452
                   +N +   E      Y +D NN   +DK +H  G+  I  D S      H E 
Sbjct: 398 EKKKNSDHFNDSGKGETTNHNNYKQDHNNNYYMDKEYH-DGRKLINKDYSIYDEDIHFEN 456

Query: 451 DNDTGHHHSSKGN 413
           D    +H S KG+
Sbjct: 457 DKKYNNHTSDKGS 469

>gb|AAO51932.1| similar to Derepression of GCN4 expression; Gcn2p [Saccharomyces
            cerevisiae] [Dictyostelium discoideum]
          Length = 1700

 Score = 35.4 bits (80), Expect = 0.90
 Identities = 27/97 (27%), Positives = 46/97 (46%)
 Frame = -1

Query: 694  SQYVDNINGVSNDSTPEMLQFQRLENDPQEKKLVYNGNQVAEEFVETEYYKDLNNVDKHH 515
            S    NIN VSN +   +L    +++  Q  +L ++     + F+ETE+  +L +  KHH
Sbjct: 1164 STSTSNINKVSNKTGKSIL----MDDSGQLLELRHDLRVSFKSFIETEFL-NLGDHYKHH 1218

Query: 514  HTTGKGFIQVDKSGTGFHIEPDNDTGHHHSSKGNPAT 404
            H   +  ++ D S  G     +++  HH   K N  T
Sbjct: 1219 HQQ-QNDVRHDNSSNGNSNNNNSNDRHHDQDKSNTTT 1254

>gb|EAA17720.1| ring-infested erythrocyte surface antigen, putative [Plasmodium
           yoelii yoelii]
          Length = 515

 Score = 34.3 bits (77), Expect = 2.0
 Identities = 18/77 (23%), Positives = 37/77 (47%)
 Frame = -1

Query: 706 SEYSSQYVDNINGVSNDSTPEMLQFQRLENDPQEKKLVYNGNQVAEEFVETEYYKDLNNV 527
           +E S +Y +NIN        E +    +  +  E +L  + ++  EE++E   Y++  N 
Sbjct: 290 NEMSEKYDENINENEMGEKEENINENEMSEEYDENQLEDSYSKDNEEYIEKNVYENNRNR 349

Query: 526 DKHHHTTGKGFIQVDKS 476
            ++H+   K  I  +K+
Sbjct: 350 IEYHNNINKNIISFEKN 366

>ref|NP_701379.1| hypothetical protein [Plasmodium falciparum 3D7]
            gi|23496545|gb|AAN36103.1|AE014844_14 hypothetical
            protein [Plasmodium falciparum 3D7]
          Length = 1129

 Score = 34.3 bits (77), Expect = 2.0
 Identities = 27/137 (19%), Positives = 57/137 (40%), Gaps = 7/137 (5%)
 Frame = -1

Query: 718  KPQRSEYSSQYVDNINGVSNDSTPEMLQFQRLENDPQEKKLVYNGNQVAEEFVETEYYKD 539
            K + +E +++ ++ +N   N+   EM +    EN+ +  +     N      V  E   +
Sbjct: 844  KEENNEVNNEEMNEVNNEVNNE--EMNEVNNEENNEENNEENNEENNEEMNEVNNEENNE 901

Query: 538  LNNVDKHH-------HTTGKGFIQVDKSGTGFHIEPDNDTGHHHSSKGNPATNEWVPAPS 380
            +NN + +            KG  +++  G     E +N+T    ++KGN  TN+      
Sbjct: 902  VNNEENNEVNNETNKEENNKGNDEINNEGNN---EENNETNKEENNKGNDETNKEENNKG 958

Query: 379  TEDVFNSDKPSRSDN*N 329
             +++ N      +D+ N
Sbjct: 959  NDEINNEGNNKGNDDMN 975

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 696,758,650
Number of Sequences: 1393205
Number of extensions: 15978851
Number of successful extensions: 43686
Number of sequences better than 10.0: 21
Number of HSP's better than 10.0 without gapping: 41828
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 43622
length of database: 448,689,247
effective HSP length: 121
effective length of database: 280,111,442
effective search space used: 37534933228
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 SPDL006f06_f BP052363 1 466
2 MFB048d10_f BP037487 9 246
3 MFB012a08_f BP034764 13 523
4 SPDL065c02_f BP056025 103 650
5 MFB021c06_f BP035485 207 771




Lotus japonicus
Kazusa DNA Research Institute