KMC004522A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC004522A_C01 KMC004522A_c01
ataaaaaatgatttcatacaagagcgttaaaacaaacaaagaaaccattgtattgtatgt
taaacacttaaacctATCCAAAATTCTCCTTCTCCATCTTTCCACTTGTATATAAGACAA
GTAATCCTGCAAGTGTGTGTGTAAAAGTCTGTCTGTACAATATAAAAATTCACGGGCATC
AACATTTTCTTGAATGCATTCCTCCACAGCAGTAAGGAAAAGAAGAAAATAAAATAAAGG
AGAACTAGTATAGACAGACAAGTGGTATAAGGTTTTAGTGAGTATTTGGACTCTCTTGAG
ATCAATGCCGATCCATATTGAACTCAACCTGACAAAAGTTCTGTTCCGTGTGAGCATTGA
GATGTGCATTTTATTCAGTTCCAGAGGAAAGCCAAACACACTAGTACCATGGCTAGGGGT
AGATAGTGCAGAGGTCATAGCCAGAGATTAATGAGCTAACAGCAAAGGCAATGAAAGCAG
GGAAGGCCAATCCAATGGAAGCACTAGCCATCTCAGTGAATCTCATCTTTTCCCCAATTG
GATTGCCAGTCATCAAGCCGGGTGGCTGCAGATGATGCTGTTGATATCAAAAGATATGCT
AGAACCTGATCCATGAAGAAATCGAAGTGAAAGTGCGGGTGGTGATTGATTAGTTGTCTC
CCAGCATATAGTTGATAGCCCAGATCACTTGCTTGAAAAGCTGCATATG


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC004522A_C01 KMC004522A_c01
         (709 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAO22748.1| unknown protein [Arabidopsis thaliana]                  80  2e-22
ref|NP_201088.1| putative protein; protein id: At5g62820.1 [Arab...    68  8e-17
ref|NP_181174.1| hypothetical protein; protein id: At2g36330.1 [...    57  2e-15
ref|NP_198846.1| putative protein; protein id: At5g40300.1 [Arab...    61  3e-14
dbj|BAB10851.1| gene_id:MQB2.14~unknown protein [Arabidopsis tha...    45  3e-10

>gb|AAO22748.1| unknown protein [Arabidopsis thaliana]
          Length = 283

 Score = 80.5 bits (197), Expect(2) = 2e-22
 Identities = 36/62 (58%), Positives = 47/62 (75%)
 Frame = -3

Query: 707 YAAFQASDLGYQLYAGRQLINHHPHFHFDFFMDQVLAYLLISTASSAATRLDDWQSNWGK 528
           Y++FQA DL Y L   + LI+HH    F+F +DQVLAYLL+S +++A TR+DDW SNWGK
Sbjct: 186 YSSFQACDLAYHLVKEKHLISHHLRPLFEFIIDQVLAYLLMSASTAAVTRVDDWVSNWGK 245

Query: 527 DE 522
           DE
Sbjct: 246 DE 247

 Score = 47.8 bits (112), Expect(2) = 2e-22
 Identities = 24/33 (72%), Positives = 28/33 (84%)
 Frame = -1

Query: 529 KMRFTEMASASIGLAFPAFIAFAVSSLISGYDL 431
           K  FTEMASASI ++F AF+AFA SSLISGY+L
Sbjct: 245 KDEFTEMASASIAMSFLAFLAFAFSSLISGYNL 277

>ref|NP_201088.1| putative protein; protein id: At5g62820.1 [Arabidopsis thaliana]
          Length = 297

 Score = 67.8 bits (164), Expect(2) = 8e-17
 Identities = 34/62 (54%), Positives = 42/62 (66%)
 Frame = -3

Query: 707 YAAFQASDLGYQLYAGRQLINHHPHFHFDFFMDQVLAYLLISTASSAATRLDDWQSNWGK 528
           Y+AF+A D    +     +IN   H  F F MDQ+LAYLL+S +S AATR+DDW SNWGK
Sbjct: 200 YSAFEACDAACYIAKESYMINCGFHDLFVFSMDQLLAYLLMSASSCAATRVDDWVSNWGK 259

Query: 527 DE 522
           DE
Sbjct: 260 DE 261

 Score = 41.2 bits (95), Expect(2) = 8e-17
 Identities = 22/35 (62%), Positives = 27/35 (76%)
 Frame = -1

Query: 529 KMRFTEMASASIGLAFPAFIAFAVSSLISGYDLCT 425
           K  FT+MA+ASI ++F AF AFAVS+LIS Y L T
Sbjct: 259 KDEFTQMATASIAVSFLAFGAFAVSALISSYRLFT 293

>ref|NP_181174.1| hypothetical protein; protein id: At2g36330.1 [Arabidopsis
           thaliana] gi|25408462|pir||D84779 hypothetical protein
           At2g36330 [imported] - Arabidopsis thaliana
           gi|4510344|gb|AAD21433.1| hypothetical protein
           [Arabidopsis thaliana]
          Length = 431

 Score = 56.6 bits (135), Expect(2) = 2e-15
 Identities = 28/60 (46%), Positives = 36/60 (59%)
 Frame = -3

Query: 701 AFQASDLGYQLYAGRQLINHHPHFHFDFFMDQVLAYLLISTASSAATRLDDWQSNWGKDE 522
           +FQA DL Y L   + LI+HH    F+F +DQ          ++A TR+DDW SNWGKDE
Sbjct: 346 SFQACDLAYHLVKEKHLISHHLRPLFEFIIDQ----------ATAVTRVDDWVSNWGKDE 395

 Score = 47.8 bits (112), Expect(2) = 2e-15
 Identities = 24/33 (72%), Positives = 28/33 (84%)
 Frame = -1

Query: 529 KMRFTEMASASIGLAFPAFIAFAVSSLISGYDL 431
           K  FTEMASASI ++F AF+AFA SSLISGY+L
Sbjct: 393 KDEFTEMASASIAMSFLAFLAFAFSSLISGYNL 425

>ref|NP_198846.1| putative protein; protein id: At5g40300.1 [Arabidopsis thaliana]
           gi|10178139|dbj|BAB11584.1| gene_id:MPO12.1~unknown
           protein [Arabidopsis thaliana]
          Length = 270

 Score = 60.8 bits (146), Expect(2) = 3e-14
 Identities = 28/62 (45%), Positives = 39/62 (62%)
 Frame = -3

Query: 707 YAAFQASDLGYQLYAGRQLINHHPHFHFDFFMDQVLAYLLISTASSAATRLDDWQSNWGK 528
           Y+ F   DL Y L    +   H+     +F +DQ+LAYLL S ++SA+ R+DDWQSNWG 
Sbjct: 173 YSGFMICDLVYLLSTSIRRSRHNLRHFLEFGLDQMLAYLLASASTSASIRVDDWQSNWGA 232

Query: 527 DE 522
           D+
Sbjct: 233 DK 234

 Score = 39.3 bits (90), Expect(2) = 3e-14
 Identities = 16/34 (47%), Positives = 25/34 (73%)
 Frame = -1

Query: 523 RFTEMASASIGLAFPAFIAFAVSSLISGYDLCTI 422
           +F ++A AS+ L++ +F+AFA  SL SGY LC +
Sbjct: 234 KFPDLARASVALSYVSFVAFAFCSLASGYALCAL 267

>dbj|BAB10851.1| gene_id:MQB2.14~unknown protein [Arabidopsis thaliana]
          Length = 273

 Score = 45.4 bits (106), Expect(2) = 3e-10
 Identities = 27/59 (45%), Positives = 31/59 (51%)
 Frame = -3

Query: 698 FQASDLGYQLYAGRQLINHHPHFHFDFFMDQVLAYLLISTASSAATRLDDWQSNWGKDE 522
           F+A D    +     +IN   H  F F MDQV          SAATR+DDW SNWGKDE
Sbjct: 189 FEACDAACYIAKESYMINCGFHDLFVFSMDQV----------SAATRVDDWVSNWGKDE 237

 Score = 41.2 bits (95), Expect(2) = 3e-10
 Identities = 22/35 (62%), Positives = 27/35 (76%)
 Frame = -1

Query: 529 KMRFTEMASASIGLAFPAFIAFAVSSLISGYDLCT 425
           K  FT+MA+ASI ++F AF AFAVS+LIS Y L T
Sbjct: 235 KDEFTQMATASIAVSFLAFGAFAVSALISSYRLFT 269

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 609,386,857
Number of Sequences: 1393205
Number of extensions: 13157675
Number of successful extensions: 33292
Number of sequences better than 10.0: 19
Number of HSP's better than 10.0 without gapping: 32243
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 33272
length of database: 448,689,247
effective HSP length: 120
effective length of database: 281,504,647
effective search space used: 32373034405
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MWL015e04_f AV768836 1 299
2 MR052f09_f BP080028 76 442
3 MPD074b05_f AV774833 164 710




Lotus japonicus
Kazusa DNA Research Institute