KMC012234A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC012234A_C01 KMC012234A_c01
agtcacataaatcaTAAGTGAATGCACTTGTGCACAAGAAATCTGTCTGGTCCAAATTTA
CATTTCCTTTGGGGTCTTCCCAAAAGATTTAGTTTTCTTTTTGGGGACTAGTACATTCTA
GCAAGCTGAATTTTGCTTTGCCAAAACCACCTGAAATTGTGAGCTGTGGGAAGAAATGGG
ATATGAACAAATCCTCCAAGTTATTGATACATTTCAAAGAAGAAAAGCAAAAAGAGCTTG
GAAGATGGACTTTGGGCTCCGACACTTTTCAGGAGGACAATCTCTGCTTATTAAACCTTG
GATCTGATCCGCTTGTTTTCCCTCCATTGCATCCTTTAGAGGCCCCCAACATTCCATTCT
GTTGTATTTTTTTGAGCTTGAATGAAAATAAATTCGGACTATGGAATCTAGTAATTCCGG
GTGCTCATGGAAAAGTACAGAATCTCCATATTGAATTGCTTTGAGTACCAAAGGCATTTC
CTTGGAGAATATGTGATATCTAAATTCAGCAGCCAGATCCAGGAAGGGATTGGGACCACT
GACAAAGCAATGAACATGGAGACACATTTCATTTTT


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC012234A_C01 KMC012234A_c01
         (576 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_564489.1| expressed protein; protein id: At1g44000.1, sup...   145  3e-34
gb|AAF79680.1|AC022314_21 F9C16.20 [Arabidopsis thaliana]             135  4e-31
ref|NP_192928.1| putative protein; protein id: At4g11910.1 [Arab...   109  3e-23
ref|NP_567673.1| expressed protein; protein id: At4g22920.1, sup...   101  6e-21
gb|AAM64476.1| unknown [Arabidopsis thaliana]                         101  6e-21

>ref|NP_564489.1| expressed protein; protein id: At1g44000.1, supported by cDNA:
           gi_15028026, supported by cDNA: gi_20259312 [Arabidopsis
           thaliana] gi|15028027|gb|AAK76544.1| unknown protein
           [Arabidopsis thaliana] gi|20259313|gb|AAM14392.1|
           unknown protein [Arabidopsis thaliana]
          Length = 260

 Score =  145 bits (367), Expect = 3e-34
 Identities = 71/119 (59%), Positives = 89/119 (74%)
 Frame = -1

Query: 573 NEMCLHVHCFVSGPNPFLDLAAEFRYHIFSKEMPLVLKAIQYGDSVLFHEHPELLDSIVR 394
           +E+ LH+HC VSG +   D+AAE RYHIFSKE+PLVLKA+ +GDSV+F E+PEL+D+ V 
Sbjct: 143 DELRLHIHCCVSGMSLLQDVAAELRYHIFSKELPLVLKAVVHGDSVMFRENPELMDAYVW 202

Query: 393 IYFHSSSKKYNRMECWGPLKDAMEGKQADQIQGLISRDCPPEKCRSPKSIFQALFAFLL 217
           +YFHSS+ KYNR+ECWGPLKDA +GKQ    QG +S     +  R  KSIF  LF FLL
Sbjct: 203 VYFHSSTPKYNRIECWGPLKDAAKGKQQGNHQGFLSSTTSRKLIRH-KSIFHTLFTFLL 260

>gb|AAF79680.1|AC022314_21 F9C16.20 [Arabidopsis thaliana]
          Length = 299

 Score =  135 bits (340), Expect = 4e-31
 Identities = 61/96 (63%), Positives = 78/96 (80%)
 Frame = -1

Query: 573 NEMCLHVHCFVSGPNPFLDLAAEFRYHIFSKEMPLVLKAIQYGDSVLFHEHPELLDSIVR 394
           +E+ LH+HC VSG +   D+AAE RYHIFSKE+PLVLKA+ +GDSV+F E+PEL+D+ V 
Sbjct: 143 DELRLHIHCCVSGMSLLQDVAAELRYHIFSKELPLVLKAVVHGDSVMFRENPELMDAYVW 202

Query: 393 IYFHSSSKKYNRMECWGPLKDAMEGKQADQIQGLIS 286
           +YFHSS+ KYNR+ECWGPLKDA +GKQ    QG +S
Sbjct: 203 VYFHSSTPKYNRIECWGPLKDAAKGKQQGNHQGFLS 238

>ref|NP_192928.1| putative protein; protein id: At4g11910.1 [Arabidopsis thaliana]
           gi|7487452|pir||T09350 hypothetical protein T26M18.120 -
           Arabidopsis thaliana gi|5002526|emb|CAB44329.1| putative
           protein [Arabidopsis thaliana]
           gi|7267892|emb|CAB78234.1| putative protein [Arabidopsis
           thaliana]
          Length = 466

 Score =  109 bits (272), Expect = 3e-23
 Identities = 54/114 (47%), Positives = 76/114 (66%), Gaps = 6/114 (5%)
 Frame = -1

Query: 576 KNEMCLHVHCFVSGPNPFLDLAAEFRYHIFSKEMPLVLKAIQYGDSVLFHEHPELLDSIV 397
           K +M LHVHC +SG + FL+L A+ RY+IF KE+P+VL+A  +GD  L + HPEL +S V
Sbjct: 122 KGKMSLHVHCHISGGHFFLNLIAKLRYYIFCKELPVVLEAFAHGDEYLLNNHPELQESPV 181

Query: 396 RIYFHSSSKKYNRMECWGPLKDAM-----EGKQADQIQGLISRDCPPE-KCRSP 253
            +YFHS+  +YN++ECWGPL +AM     +G+   + + L    CP E KC  P
Sbjct: 182 WVYFHSNIPEYNKVECWGPLWEAMSQHQHDGRTHKKSETLPELPCPDECKCCFP 235

 Score = 97.4 bits (241), Expect = 1e-19
 Identities = 43/83 (51%), Positives = 60/83 (71%)
 Frame = -1

Query: 576 KNEMCLHVHCFVSGPNPFLDLAAEFRYHIFSKEMPLVLKAIQYGDSVLFHEHPELLDSIV 397
           K+ M LHVHC +SG +  LDL AE RY IF KE+P+VLKA  +GD  + + +PEL ++ V
Sbjct: 315 KSNMSLHVHCHISGDHFLLDLIAELRYFIFCKELPMVLKAFVHGDENMLNNYPELHEAFV 374

Query: 396 RIYFHSSSKKYNRMECWGPLKDA 328
            +YFHS+  K+N++ECWG L +A
Sbjct: 375 WVYFHSNIPKFNKVECWGRLCEA 397

>ref|NP_567673.1| expressed protein; protein id: At4g22920.1, supported by cDNA:
           29391., supported by cDNA: gi_17380887, supported by
           cDNA: gi_20465936 [Arabidopsis thaliana]
           gi|7486565|pir||T05123 hypothetical protein F7H19.100 -
           Arabidopsis thaliana gi|3292817|emb|CAA19807.1|
           hypothetical protein [Arabidopsis thaliana]
           gi|7269139|emb|CAB79247.1| hypothetical protein
           [Arabidopsis thaliana] gi|17380888|gb|AAL36256.1|
           unknown protein [Arabidopsis thaliana]
           gi|20465937|gb|AAM20154.1| unknown protein [Arabidopsis
           thaliana]
          Length = 268

 Score =  101 bits (252), Expect = 6e-21
 Identities = 44/84 (52%), Positives = 64/84 (75%)
 Frame = -1

Query: 576 KNEMCLHVHCFVSGPNPFLDLAAEFRYHIFSKEMPLVLKAIQYGDSVLFHEHPELLDSIV 397
           K +M LHVHC +SG +  LDL A+FRY IF KE+P+VLKA  +GD  L + +PEL +++V
Sbjct: 126 KGKMSLHVHCHISGGHFLLDLFAKFRYFIFCKELPVVLKAFVHGDGNLLNNYPELQEALV 185

Query: 396 RIYFHSSSKKYNRMECWGPLKDAM 325
            +YFHS+  ++N++ECWGPL +A+
Sbjct: 186 WVYFHSNVNEFNKVECWGPLWEAV 209

>gb|AAM64476.1| unknown [Arabidopsis thaliana]
          Length = 268

 Score =  101 bits (252), Expect = 6e-21
 Identities = 44/84 (52%), Positives = 64/84 (75%)
 Frame = -1

Query: 576 KNEMCLHVHCFVSGPNPFLDLAAEFRYHIFSKEMPLVLKAIQYGDSVLFHEHPELLDSIV 397
           K +M LHVHC +SG +  LDL A+FRY IF KE+P+VLKA  +GD  L + +PEL +++V
Sbjct: 126 KGKMSLHVHCHISGGHFLLDLFAKFRYFIFCKELPVVLKAFVHGDGNLLNNYPELQEALV 185

Query: 396 RIYFHSSSKKYNRMECWGPLKDAM 325
            +YFHS+  ++N++ECWGPL +A+
Sbjct: 186 WVYFHSNVNEFNKVECWGPLWEAV 209

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 547,093,798
Number of Sequences: 1393205
Number of extensions: 12132664
Number of successful extensions: 33248
Number of sequences better than 10.0: 28
Number of HSP's better than 10.0 without gapping: 32369
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 33232
length of database: 448,689,247
effective HSP length: 116
effective length of database: 287,077,467
effective search space used: 21530810025
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 SPD069f01_f BP049533 1 582
2 SPD018f04_f BP045429 2 554
3 MPD066f02_f AV774403 35 472
4 SPD089h04_f BP051151 35 575
5 SPD091e07_f BP051276 35 508




Lotus japonicus
Kazusa DNA Research Institute