KMC004910A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC004910A_C01 KMC004910A_c01
CAAAGCACATTACCGATATAAATAAGCTTGTAACAGGAAATCTAATGACAATAAGTACCA
AAAATACAACTATCCAAGGACCATCAAGGGGACTAGATTATATTCTAAGGTTCTTTCTAA
GAATTTTGTAATGGTATGATCCTTCTAAATACCAATTACGTAATATATTGTTGACAAAAA
AAAGTGCATAAAATATGGGTAAAGAAGTGGAAGCCACCAAGAGGTTAACCAAGCAATAAA
TCAGAAACCAATTCATTAATCAAAGAAGCTAGAATGTCCTGCTCAACCTCTGAACCCTCT
TCAAATGCTTCAATGTCAAAGTCAAGCCATTTCCCACATCCAGTGCTCATATCCTTGCTC
ACAAGCTCATCCACCATCACCTCTTCCATGTTTCTAAAACCAAACATCTCCTTGTACAAT
TCTTCTGCCAACCACCTCTTCCTCTGAACTGATGTCACCCATCTAGGCCATGATTTGCAC
CTTCCCACAAAAGCTTGCGTAAATCTCAACTCTAGGCACTCACTCACACAGTCAAATAAA
ACCTTCCTTTCAAGCTTTGAGTACTCATCTCCATAGTTTTCTGTTCCACTACTGnCCTGA
TTTTCCAATAGATCAAAGAGATTCGGCATTATGACTGTATCAG


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC004910A_C01 KMC004910A_c01
         (643 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_198043.1| putative protein; protein id: At5g26910.1 [Arab...   106  3e-22
pir||T01769 hypothetical protein A_IG002P16.18 - Arabidopsis tha...   106  3e-22
ref|NP_187226.1| hypothetical protein; protein id: At3g05750.1 [...   101  7e-21
ref|NP_191424.1| putative protein; protein id: At3g58650.1 [Arab...   100  1e-20
dbj|BAC43056.1| unknown protein [Arabidopsis thaliana] gi|290290...   100  2e-20

>ref|NP_198043.1| putative protein; protein id: At5g26910.1 [Arabidopsis thaliana]
          Length = 900

 Score =  106 bits (264), Expect = 3e-22
 Identities = 55/137 (40%), Positives = 91/137 (66%), Gaps = 1/137 (0%)
 Frame = -3

Query: 635  VIMPNLFDLLENQXSSGTENYGDEYSKLERKVLFDCVSECLELRFTQAFVGRCKSW-PRW 459
            V+  +LFD +E +            +K++RK LFD V++CL LR  Q F+G C+    + 
Sbjct: 746  VLPASLFDEMEGRGEVTA-------AKIKRKTLFDFVNKCLALRCEQMFMGSCRGLLGKG 798

Query: 458  VTSVQRKRWLAEELYKEMFGFRNMEEVMVDELVSKDMSTGCGKWLDFDIEAFEEGSEVEQ 279
                +++ WLAEEL +E+ G + M E+M+DELV K+MS+  G+WLDF+ E +EEG ++E 
Sbjct: 799  GFLFEQRDWLAEELNREIHGLKKMREMMMDELVDKEMSSFEGRWLDFERETYEEGIDIEG 858

Query: 278  DILASLINELVSDLLLG 228
            +I+++L+++LV+DL+ G
Sbjct: 859  EIVSTLVDDLVNDLVSG 875

>pir||T01769 hypothetical protein A_IG002P16.18 - Arabidopsis thaliana
            gi|2191167|gb|AAB61053.1| Hypothetical protein F2P16.18
            [Arabidopsis thaliana]
          Length = 912

 Score =  106 bits (264), Expect = 3e-22
 Identities = 55/137 (40%), Positives = 91/137 (66%), Gaps = 1/137 (0%)
 Frame = -3

Query: 635  VIMPNLFDLLENQXSSGTENYGDEYSKLERKVLFDCVSECLELRFTQAFVGRCKSW-PRW 459
            V+  +LFD +E +            +K++RK LFD V++CL LR  Q F+G C+    + 
Sbjct: 758  VLPASLFDEMEGRGEVTA-------AKIKRKTLFDFVNKCLALRCEQMFMGSCRGLLGKG 810

Query: 458  VTSVQRKRWLAEELYKEMFGFRNMEEVMVDELVSKDMSTGCGKWLDFDIEAFEEGSEVEQ 279
                +++ WLAEEL +E+ G + M E+M+DELV K+MS+  G+WLDF+ E +EEG ++E 
Sbjct: 811  GFLFEQRDWLAEELNREIHGLKKMREMMMDELVDKEMSSFEGRWLDFERETYEEGIDIEG 870

Query: 278  DILASLINELVSDLLLG 228
            +I+++L+++LV+DL+ G
Sbjct: 871  EIVSTLVDDLVNDLVSG 887

>ref|NP_187226.1| hypothetical protein; protein id: At3g05750.1 [Arabidopsis thaliana]
            gi|6714388|gb|AAF26077.1|AC012393_3 hypothetical protein
            [Arabidopsis thaliana]
          Length = 798

 Score =  101 bits (252), Expect = 7e-21
 Identities = 52/129 (40%), Positives = 83/129 (64%), Gaps = 1/129 (0%)
 Frame = -3

Query: 614  DLLENQXSSGTENYGDEYSKLERKVLFDCVSECLELRFTQAFVGRCKS-WPRWVTSVQRK 438
            D+L       TE   D   K+ERK LFD V++ L L+  Q F+G CK    +    ++R+
Sbjct: 668  DILPLSLFDETEGKRDARGKIERKTLFDLVNQWLTLKCEQMFMGTCKGVLGKQDIFLERR 727

Query: 437  RWLAEELYKEMFGFRNMEEVMVDELVSKDMSTGCGKWLDFDIEAFEEGSEVEQDILASLI 258
              LA+++ KE  G + M E+M+DELV  DMS+  GKWLD+  E +EEG E+E++I++ L+
Sbjct: 728  EILADQVLKEAQGLKKMREMMMDELVDNDMSSCEGKWLDYMRETYEEGIEIEEEIVSELV 787

Query: 257  NELVSDLLL 231
            ++L++DL++
Sbjct: 788  DDLINDLIM 796

>ref|NP_191424.1| putative protein; protein id: At3g58650.1 [Arabidopsis thaliana]
            gi|11292204|pir||T45685 hypothetical protein F14P22.240 -
            Arabidopsis thaliana gi|6735382|emb|CAB68203.1| putative
            protein [Arabidopsis thaliana]
          Length = 820

 Score =  100 bits (250), Expect = 1e-20
 Identities = 51/137 (37%), Positives = 87/137 (63%), Gaps = 1/137 (0%)
 Frame = -3

Query: 641  DTVIMPNLFDLLENQXSSGTENYGDEYSKLERKVLFDCVSECLELRFTQAFVGRCKSWPR 462
            ++++  +LFD +E    + T        K ERK LFDCV++CL ++F +  +G CK    
Sbjct: 680  ESLLPSSLFDEMERSRGAATS------MKTERKALFDCVNQCLAVKFERMLIGSCKGMMM 733

Query: 461  -WVTSVQRKRWLAEELYKEMFGFRNMEEVMVDELVSKDMSTGCGKWLDFDIEAFEEGSEV 285
                 ++ +  LAEE+ +E+ G + M E+M+DELV  DMS   G+W+ ++ E FEEG ++
Sbjct: 734  SGGILLEHRDLLAEEVNREVKGLKKMREMMIDELVDHDMSCFEGRWIGYEREMFEEGIDM 793

Query: 284  EQDILASLINELVSDLL 234
            E +I+++L+++LVSD+L
Sbjct: 794  EGEIVSALVDDLVSDIL 810

>dbj|BAC43056.1| unknown protein [Arabidopsis thaliana] gi|29029000|gb|AAO64879.1|
           At3g58650 [Arabidopsis thaliana]
          Length = 660

 Score =  100 bits (249), Expect = 2e-20
 Identities = 50/137 (36%), Positives = 87/137 (63%), Gaps = 1/137 (0%)
 Frame = -3

Query: 641 DTVIMPNLFDLLENQXSSGTENYGDEYSKLERKVLFDCVSECLELRFTQAFVGRCKSWPR 462
           ++++  +LFD +E    + T        K ERK LFDCV++CL ++F +  +G CK    
Sbjct: 520 ESLLPSSLFDEMERSRGAATS------MKTERKALFDCVNQCLAVKFERMLIGSCKGMMM 573

Query: 461 -WVTSVQRKRWLAEELYKEMFGFRNMEEVMVDELVSKDMSTGCGKWLDFDIEAFEEGSEV 285
                ++ +  LAEE+ +E+ G + M E+M+DELV  DMS   G+W+ ++ E FEEG ++
Sbjct: 574 SGGILLEHRDLLAEEVNREVKGLKKMREMMIDELVDHDMSCFEGRWIGYEREMFEEGIDM 633

Query: 284 EQDILASLINELVSDLL 234
           E +I+++L+++L+SD+L
Sbjct: 634 EGEIVSALVDDLISDIL 650

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 539,431,968
Number of Sequences: 1393205
Number of extensions: 11576976
Number of successful extensions: 33272
Number of sequences better than 10.0: 65
Number of HSP's better than 10.0 without gapping: 31355
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 33086
length of database: 448,689,247
effective HSP length: 118
effective length of database: 284,291,057
effective search space used: 27007650415
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 SPDL029g07_f BP053829 1 560
2 MRL021g09_f BP084813 6 363
3 GENLf033e04 BP064076 6 514
4 SPDL064b12_f BP055965 29 593
5 MFBL035e05_f BP043018 30 481
6 MFBL018b05_f BP042147 30 257
7 SPDL075c05_f BP056633 39 487
8 SPDL075b12_f BP056629 39 535
9 MRL026d11_f BP085052 40 495
10 SPDL012e07_f BP052732 51 580
11 SPDL094h03_f BP057931 86 605
12 MPDL056g10_f AV779357 111 665
13 MPDL050f05_f AV779050 140 597




Lotus japonicus
Kazusa DNA Research Institute