KMC001773A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC001773A_C01 KMC001773A_c01
GAATTGAGAAGGGAAACACCAGAGTGTCCCTCTAACTGAGCTCCACAAACAGGCTGGCCT
GAGGATGCAATCCTGCTTCCTTCAATGACAATGTCAGTTTTTCTTGTCCATAAACGACCC
GCGGAAAATTGGAGACCAGGCTGTAGCTATCAGCTTCTAAACAACCCAATGAGTCAACAT
AGTCATATACAGACTGAATTGTCACTGTGTTGTTGAACCTCCTTTCCTTGCGTTCACCAG
TTGGAAATCGTACCAAAACCTGTGTAACATTAGGCCCTTTTGCAGGTTCTTCACCAAGGG
ACTCAGCTTTTTGTTGACGGATTTTAGCTAATGCAGCTTGTTTCTCTGCAGCTTCACGTG
CTTCTCTTTCACGAGCCTCCTCTTCCTCCTTGCGCTTCCTCTCAGCTTCAGCAGCTTCCC
TTGCAAGACGTTCTTGTTCTTCTCTTCTCTGCCGTTCCCTAGCTTGATCAGCTTCAAGTG
CAGCTCTGTATGCAGCATCTTGCTCCTCCCTTAACCGGATATTATTTCTTCTTTCTTCTG
CATCAAGCCTTGCTGCAACAAGAGTTGGGGAGCTTTCTTC


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC001773A_C01 KMC001773A_c01
         (580 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAG13433.1|AC051634_14 unknown protein [Oryza sativa (japonic...   278  4e-74
ref|NP_192817.1| hypothetical protein; protein id: At4g10790.1 [...   266  1e-70
pir||T01898 hypothetical protein T12H20.9 - Arabidopsis thaliana...   266  1e-70
ref|NP_567675.1| putative protein; protein id: At4g23040.1, supp...    97  1e-19
pir||T05136 hypothetical protein F7H19.230 - Arabidopsis thalian...    94  9e-19

>gb|AAG13433.1|AC051634_14 unknown protein [Oryza sativa (japonica cultivar-group)]
          Length = 466

 Score =  278 bits (711), Expect = 4e-74
 Identities = 141/181 (77%), Positives = 163/181 (89%)
 Frame = -1

Query: 580 EESSPTLVAARLDAEERRNNIRLREEQDAAYRAALEADQARERQRREEQERLAREAAEAE 401
           EE S +LVAAR+DAEER NN RLREEQDAAYRAALEADQARERQRREEQE+  REAAEAE
Sbjct: 284 EECSASLVAARIDAEERLNNQRLREEQDAAYRAALEADQARERQRREEQEKREREAAEAE 343

Query: 400 RKRKEEEEAREREAREAAEKQAALAKIRQQKAESLGEEPAKGPNVTQVLVRFPTGERKER 221
           RKRKEEEEA+ER A+EAAEK+AALA+ RQ+KA +LG EP KGP+VT+VL+RFPTGERKER
Sbjct: 344 RKRKEEEEAQERAAQEAAEKEAALARRRQEKAMALGAEPEKGPDVTRVLIRFPTGERKER 403

Query: 220 RFNNTVTIQSVYDYVDSLGCLEADSYSLVSNFPRVVYGQEKLTLSLKEAGLHPQASLFVE 41
           RFN++ TI S+YDYVDSL CL+A+ YSLVSNFPRV YG EKL+ +L+EAGLHPQASLF+E
Sbjct: 404 RFNSSTTITSLYDYVDSLDCLKAEKYSLVSNFPRVTYGPEKLSQTLEEAGLHPQASLFIE 463

Query: 40  L 38
           +
Sbjct: 464 I 464

>ref|NP_192817.1| hypothetical protein; protein id: At4g10790.1 [Arabidopsis
           thaliana] gi|25407518|pir||H85112 hypothetical protein
           AT4g10790 [imported] - Arabidopsis thaliana
           gi|7267777|emb|CAB81180.1| predicted protein of unknown
           function [Arabidopsis thaliana]
          Length = 480

 Score =  266 bits (681), Expect = 1e-70
 Identities = 132/182 (72%), Positives = 160/182 (87%)
 Frame = -1

Query: 580 EESSPTLVAARLDAEERRNNIRLREEQDAAYRAALEADQARERQRREEQERLAREAAEAE 401
           E+SSPTLV AR++AEERR N+RLREEQDAAYRAALEADQARE+QR+EE+ERL REAAEAE
Sbjct: 299 EDSSPTLVTARVEAEERRTNLRLREEQDAAYRAALEADQAREQQRQEEKERLEREAAEAE 358

Query: 400 RKRKEEEEAREREAREAAEKQAALAKIRQQKAESLGEEPAKGPNVTQVLVRFPTGERKER 221
           RK KEEEEARER AREA E+QAA  ++RQ+KA +LGEEP KGP+VTQVLVRFP GERK R
Sbjct: 359 RKLKEEEEARERAAREAEERQAARVRMRQEKALALGEEPEKGPDVTQVLVRFPNGERKGR 418

Query: 220 RFNNTVTIQSVYDYVDSLGCLEADSYSLVSNFPRVVYGQEKLTLSLKEAGLHPQASLFVE 41
            F +   IQ++YDYVDSLG L+ + YSL++NFPR VYG++K ++SLK+AGLHPQASLF+E
Sbjct: 419 MFKSETKIQTLYDYVDSLGLLDTEEYSLITNFPRTVYGRDKESMSLKDAGLHPQASLFIE 478

Query: 40  LS 35
           ++
Sbjct: 479 IN 480

>pir||T01898 hypothetical protein T12H20.9 - Arabidopsis thaliana
           gi|3600032|gb|AAC35520.1| contains similarity to
           tropomyosin (Pfam: Tropomyosin.hmm, score: 14.57) and
           ATP synthase (Pfam: ATP-synt_B.hmm, score: 10.89)
           [Arabidopsis thaliana]
          Length = 466

 Score =  266 bits (681), Expect = 1e-70
 Identities = 132/182 (72%), Positives = 160/182 (87%)
 Frame = -1

Query: 580 EESSPTLVAARLDAEERRNNIRLREEQDAAYRAALEADQARERQRREEQERLAREAAEAE 401
           E+SSPTLV AR++AEERR N+RLREEQDAAYRAALEADQARE+QR+EE+ERL REAAEAE
Sbjct: 285 EDSSPTLVTARVEAEERRTNLRLREEQDAAYRAALEADQAREQQRQEEKERLEREAAEAE 344

Query: 400 RKRKEEEEAREREAREAAEKQAALAKIRQQKAESLGEEPAKGPNVTQVLVRFPTGERKER 221
           RK KEEEEARER AREA E+QAA  ++RQ+KA +LGEEP KGP+VTQVLVRFP GERK R
Sbjct: 345 RKLKEEEEARERAAREAEERQAARVRMRQEKALALGEEPEKGPDVTQVLVRFPNGERKGR 404

Query: 220 RFNNTVTIQSVYDYVDSLGCLEADSYSLVSNFPRVVYGQEKLTLSLKEAGLHPQASLFVE 41
            F +   IQ++YDYVDSLG L+ + YSL++NFPR VYG++K ++SLK+AGLHPQASLF+E
Sbjct: 405 MFKSETKIQTLYDYVDSLGLLDTEEYSLITNFPRTVYGRDKESMSLKDAGLHPQASLFIE 464

Query: 40  LS 35
           ++
Sbjct: 465 IN 466

>ref|NP_567675.1| putative protein; protein id: At4g23040.1, supported by cDNA:
           gi_13430703 [Arabidopsis thaliana]
           gi|13430704|gb|AAK25974.1|AF360264_1 unknown protein
           [Arabidopsis thaliana] gi|23296844|gb|AAN13184.1|
           unknown protein [Arabidopsis thaliana]
          Length = 525

 Score = 97.4 bits (241), Expect = 1e-19
 Identities = 67/180 (37%), Positives = 97/180 (53%), Gaps = 2/180 (1%)
 Frame = -1

Query: 571 SPTLVAARLDAEERRNNIRLREEQDAAYRAALEADQARERQRREEQERLAREAAEAERKR 392
           SP+L A RL          +RE+QD  Y A+LEAD+ +   RR E+E    EA E E KR
Sbjct: 362 SPSLTAQRL----------IREQQDDEYLASLEADRVKAEARRLEEEAARVEAIE-EAKR 410

Query: 391 KEEEEAREREAREAAEKQAALAKIRQQKAESLGEEPAKGP-NVTQVLVRFPTGERKERRF 215
           KEEE  R+ E  +  E+Q         K  SL +EP  G  N   + VR P G R  RRF
Sbjct: 411 KEEEARRKVEEEQELERQLV------SKEASLPQEPPAGEENAITLQVRLPDGTRHGRRF 464

Query: 214 NNTVTIQSVYDYVDSLGCLEADSYSLVSNFPRVVYGQEKLTLSLKEAGL-HPQASLFVEL 38
             +  +QS++D++D    ++ ++Y LV  +PR  +G  + + +L + GL   Q +LF+EL
Sbjct: 465 FKSDKLQSLFDFIDICRVVKPNTYRLVRPYPRRAFGDGECSSTLNDIGLTSKQEALFLEL 524

>pir||T05136 hypothetical protein F7H19.230 - Arabidopsis thaliana
           gi|3292830|emb|CAA19820.1| putative protein [Arabidopsis
           thaliana] gi|7269151|emb|CAB79259.1| putative protein
           [Arabidopsis thaliana]
          Length = 577

 Score = 94.4 bits (233), Expect = 9e-19
 Identities = 66/187 (35%), Positives = 100/187 (53%), Gaps = 9/187 (4%)
 Frame = -1

Query: 571 SPTLVAARLDAEERRNN-------IRLREEQDAAYRAALEADQARERQRREEQERLAREA 413
           SP+L A RL  E++  +       ++ +  QD  Y A+LEAD+ +   RR E+E    EA
Sbjct: 397 SPSLTAQRLIREQQDTDDDEFLFLLKCKLVQDDEYLASLEADRVKAEARRLEEEAARVEA 456

Query: 412 AEAERKRKEEEEAREREAREAAEKQAALAKIRQQKAESLGEEPAKGP-NVTQVLVRFPTG 236
            E E KRKEEE  R+ E  +  E+Q         K  SL +EP  G  N   + VR P G
Sbjct: 457 IE-EAKRKEEEARRKVEEEQELERQLV------SKEASLPQEPPAGEENAITLQVRLPDG 509

Query: 235 ERKERRFNNTVTIQSVYDYVDSLGCLEADSYSLVSNFPRVVYGQEKLTLSLKEAGL-HPQ 59
            R  RRF  +  +QS++D++D    ++ ++Y LV  +PR  +G  + + +L + GL   Q
Sbjct: 510 TRHGRRFFKSDKLQSLFDFIDICRVVKPNTYRLVRPYPRRAFGDGECSSTLNDIGLTSKQ 569

Query: 58  ASLFVEL 38
            +LF+EL
Sbjct: 570 EALFLEL 576

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 474,285,816
Number of Sequences: 1393205
Number of extensions: 10008772
Number of successful extensions: 105540
Number of sequences better than 10.0: 5053
Number of HSP's better than 10.0 without gapping: 54788
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 82245
length of database: 448,689,247
effective HSP length: 117
effective length of database: 285,684,262
effective search space used: 21426319650
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MPD035h02_f AV772425 1 523
2 MR041c10_f BP079159 6 264
3 MF038d08_f BP030286 6 487
4 GNf096e10 BP074492 6 454
5 MWM245b01_f AV768474 6 592
6 MF054g12_f BP031160 6 381
7 GENf014f11 BP058946 8 383
8 MR023d10_f BP077750 9 196
9 SPD020g03_f BP045598 9 545
10 GNf053d11 BP071316 10 490
11 MFB046h09_f BP037390 11 482
12 GNf054d05 BP071389 11 423
13 GNf079d03 BP073196 14 305
14 SPD056a11_f BP048433 14 548




Lotus japonicus
Kazusa DNA Research Institute