KMC011613A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC011613A_C01 KMC011613A_c01
aagctagtcaaaaaagctataagctagctgaaataacatgtcaaacacagacACATACAT
AACATCACTCTGATATACAGAATTACAAATGCTAAGCCATTAATTTCCCCCTAATATAAA
ACTTATTAATTCAGTCCAAGATCACATCACATAACAACAAGTGGATTCACATATGTAGTA
GGATCCACATATAGTTGCTTTTATAAAGATGAATCCACTTCTAATGCTTCCATAATTGAA
ACCAAATTAACATGCATAACAATTAATACAAACAGTGACACACGTAGTGATAAAATGACT
ACTTTTTGAAGAAACCACCAGCCAACTTAGTAAAATCTCCACCCAAACCACCTCCACCAG
CAGATTGAGCAGCATGCTCACCTTGAGATTCTTCTGGTTTGGAAGGAGCAGGAGCAGGAG
CAGCACCACCAGGCTTATAATCGTGGAGGTAATCCGCAGCCTTATCAACGTACTGTCCTA
CGCCCTTCTGTTCATCCAGCTTGGCGTACTGACTAGCTGCATCAAGAAGATCACCAGCTG
CATCTGCCGCCTTGGCTTGGTCCACCTGGTCAGATTCGTTCCTCAACGCCGCTGGTGATG
CCTCGCTCACCACCTTGGCGCTAGCTAAAAGCTCGCTAGTTGA


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC011613A_C01 KMC011613A_c01
         (643 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_178443.1| unknown protein; protein id: At2g03440.1 [Arabi...   106  2e-22
ref|NP_563934.1| expressed protein; protein id: At1g13930.1, sup...   106  3e-22
gb|AAO42832.1| At2g03440 [Arabidopsis thaliana]                       106  3e-22
gb|AAM62639.1| unknown [Arabidopsis thaliana]                         103  3e-21
pir||S71562 drought-induced protein SDi-6 - common sunflower (fr...   100  2e-20

>ref|NP_178443.1| unknown protein; protein id: At2g03440.1 [Arabidopsis thaliana]
           gi|25411006|pir||E84448 hypothetical protein At2g03440
           [imported] - Arabidopsis thaliana
           gi|4335755|gb|AAD17432.1| unknown protein [Arabidopsis
           thaliana] gi|22531277|gb|AAM97142.1| unknown protein
           [Arabidopsis thaliana]
          Length = 187

 Score =  106 bits (265), Expect = 2e-22
 Identities = 61/132 (46%), Positives = 80/132 (60%), Gaps = 19/132 (14%)
 Frame = -1

Query: 643 STSELLASAKVVSEASPAALRNESDQVDQAKAADAAGDLLDAASQYAKLDEQKGVGQYVD 464
           + +EL+ASAK+V+EA+ AA R+ESD++D+AK A A  D+LDAAS+Y KLDE+ GVGQY++
Sbjct: 64  TNAELMASAKIVAEAAQAAARHESDKLDKAKVAGATADILDAASRYGKLDEKSGVGQYLE 123

Query: 463 KAADYLHDYKP-------------------GGAAPAPAPSKPEESQGEHAAQSAGGGGLG 341
           KA  YLH Y+                    GG A APA  K +E  G       GG G  
Sbjct: 124 KAEQYLHKYETSHSHSSTGGTGSHGNVGGHGGGAGAPAAKKEDEKSG-------GGHGF- 175

Query: 340 GDFTKLAGGFFK 305
           GD+ K+A GF K
Sbjct: 176 GDYAKMAQGFMK 187

>ref|NP_563934.1| expressed protein; protein id: At1g13930.1, supported by cDNA:
           1505., supported by cDNA: gi_15010677, supported by
           cDNA: gi_16323297 [Arabidopsis thaliana]
           gi|25372939|pir||D86272 hypothetical protein F7A19.2
           [imported] - Arabidopsis thaliana
           gi|5080769|gb|AAD39279.1|AC007576_2 Unknown protein
           [Arabidopsis thaliana]
           gi|8778391|gb|AAF79399.1|AC068197_9 F16A14.14
           [Arabidopsis thaliana] gi|15010678|gb|AAK73998.1|
           At1g13930/F16A14.27 [Arabidopsis thaliana]
           gi|16323298|gb|AAL15404.1| At1g13930/F16A14.27
           [Arabidopsis thaliana]
          Length = 155

 Score =  106 bits (264), Expect = 3e-22
 Identities = 61/121 (50%), Positives = 80/121 (65%), Gaps = 8/121 (6%)
 Frame = -1

Query: 643 STSELLASAKVVSEASPAALRNESDQVDQAKAADAAGDLLDAASQYAKLDEQKGVGQYVD 464
           + +EL+ASAKVV+EA+ AA RNESD++D+ K A A+ D+LDAA +Y K DE+   GQY+D
Sbjct: 36  TNAELMASAKVVAEAAQAAARNESDKLDKGKVAGASADILDAAEKYGKFDEKSSTGQYLD 95

Query: 463 KAADYLHDYK----PGGAAPAPAPSKPE-ESQGEHAAQ---SAGGGGLGGDFTKLAGGFF 308
           KA  YL+DY+     G   P P  S+ E  SQ E AA+      GGGLGG + K+A GF 
Sbjct: 96  KAEKYLNDYESSHSTGAGGPPPPTSQAEPASQPEPAAKKDDEESGGGLGG-YAKMAQGFL 154

Query: 307 K 305
           K
Sbjct: 155 K 155

>gb|AAO42832.1| At2g03440 [Arabidopsis thaliana]
          Length = 187

 Score =  106 bits (264), Expect = 3e-22
 Identities = 60/132 (45%), Positives = 80/132 (60%), Gaps = 19/132 (14%)
 Frame = -1

Query: 643 STSELLASAKVVSEASPAALRNESDQVDQAKAADAAGDLLDAASQYAKLDEQKGVGQYVD 464
           + +EL+ASAK+++EA+ AA R+ESD++D+AK A A  D+LDAAS+Y KLDE+ GVGQY++
Sbjct: 64  TNAELMASAKIIAEAAQAAARHESDKLDKAKVAGATADILDAASRYGKLDEKSGVGQYLE 123

Query: 463 KAADYLHDYKP-------------------GGAAPAPAPSKPEESQGEHAAQSAGGGGLG 341
           KA  YLH Y+                    GG A APA  K +E  G       GG G  
Sbjct: 124 KAEQYLHKYETSHSHSSTGGTGSHGNVGGHGGGAGAPAAKKEDEKSG-------GGHGF- 175

Query: 340 GDFTKLAGGFFK 305
           GD+ K+A GF K
Sbjct: 176 GDYAKMAQGFMK 187

>gb|AAM62639.1| unknown [Arabidopsis thaliana]
          Length = 155

 Score =  103 bits (256), Expect = 3e-21
 Identities = 59/121 (48%), Positives = 79/121 (64%), Gaps = 8/121 (6%)
 Frame = -1

Query: 643 STSELLASAKVVSEASPAALRNESDQVDQAKAADAAGDLLDAASQYAKLDEQKGVGQYVD 464
           + +EL+ASAKVV+EA+ AA RNESD++D+ K A A+ D+LDA+ +Y K DE+   G Y+D
Sbjct: 36  TNAELMASAKVVAEAAQAAARNESDKLDKGKVAGASADILDASEKYGKFDEKSSTGHYLD 95

Query: 463 KAADYLHDYK----PGGAAPAPAPSKPE-ESQGEHAAQ---SAGGGGLGGDFTKLAGGFF 308
           KA  YL+DY+     G   P P  S+ E  SQ E AA+      GGGLGG + K+A GF 
Sbjct: 96  KAEKYLNDYESSHSTGAGGPPPPTSQAEPASQPEPAAKKDDEESGGGLGG-YAKMAQGFL 154

Query: 307 K 305
           K
Sbjct: 155 K 155

>pir||S71562 drought-induced protein SDi-6 - common sunflower (fragment)
          Length = 168

 Score =  100 bits (248), Expect = 2e-20
 Identities = 56/125 (44%), Positives = 79/125 (62%), Gaps = 12/125 (9%)
 Frame = -1

Query: 643 STSELLASAKVVSEASPAALRNESDQVDQAKAADAAGDLLDAASQYAKLDEQKGVGQYVD 464
           ++++LL+SAK+V+EA+ +A  N++DQ+D+ K A A  DLLD++ +Y K DE +GVGQY+ 
Sbjct: 44  TSTDLLSSAKLVAEAAQSAATNKTDQIDKQKVAGATADLLDSSKEYGKFDESQGVGQYIK 103

Query: 463 KAADYLHDYKPGGAA----PAPAPSKPEESQGE--HAAQSAGG-----GGLG-GDFTKLA 320
           +A DYLH Y+  GA     PA AP   EE + E     +  GG      G+G GD  K A
Sbjct: 104 QADDYLHKYEKSGATGATPPAEAPLVTEEKKAEAPPGVEEKGGKDESESGIGAGDAIKAA 163

Query: 319 GGFFK 305
           G FFK
Sbjct: 164 GSFFK 168

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 552,238,625
Number of Sequences: 1393205
Number of extensions: 13426490
Number of successful extensions: 76967
Number of sequences better than 10.0: 134
Number of HSP's better than 10.0 without gapping: 53672
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 71901
length of database: 448,689,247
effective HSP length: 118
effective length of database: 284,291,057
effective search space used: 27007650415
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MFBL021b08_f BP042298 1 450
2 MFB007b02_f BP034389 53 557
3 MPD030h10_f AV772090 59 494
4 SPD047e06_f BP047750 118 242
5 MFB030c09_f BP036184 132 643




Lotus japonicus
Kazusa DNA Research Institute