KMC003820A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC003820A_C01 KMC003820A_c01
gaaagagggacaTACTGTCTTAAAGTGGAGAACTATCAATACCCAGCATCGTACCCAGAT
TCCGGCGACTCGTCGCCGCGGTCGAGGGAGATCGACTTCGAGAATCCGACGCCATGGGAG
GACCAGCAGAGTCCACACAACTACAAAGCCAAGTTCCTCTGCAGCTACGGCGGGAAGATC
CAGCCACGCACCCACGACAACCAGCTCTCCTACGTCGGCGGCGGACAGCCAAGATCCTCG
CCGTCGACCGGAACACCAAGTGTCCCAACGTGCCTCTCCAAGCTCGCCGCCCTCTGCGAC
GCTGCACCGCAAGAGCTCACCTTCAAGTACCAGCTCCCCGGCGAGGATCTCGACGCCCTC
ATCTCCGTCACCAACGACGACGACCTCGAGCACTTGATGCATGAGTACGATCGCCTCTAT
CGGCCTGCTTCGAAACCCGTCAGGATGAGGCTCTTCCTCTTCTCTGCACCGAATCCGGGT
CCTCTATCTCAACAACCCGACCCGCTTAAGCCACAACCCAACGTTGACTTCCTCTTTGGC
CTCGAGAA


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC003820A_C01 KMC003820A_c01
         (548 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_196524.1| putative protein; protein id: At5g09620.1 [Arab...   195  3e-49
ref|NP_201248.1| putative protein; protein id: At5g64430.1, supp...   190  1e-47
ref|NP_565256.1| expressed protein; protein id: At2g01190.1, sup...   110  9e-24
gb|AAM62991.1| unknown [Arabidopsis thaliana]                         108  4e-23
ref|NP_567290.1| putative protein; protein id: At4g05150.1, supp...    92  5e-18

>ref|NP_196524.1| putative protein; protein id: At5g09620.1 [Arabidopsis thaliana]
           gi|11357503|pir||T49936 hypothetical protein F17I14.190
           - Arabidopsis thaliana gi|7671427|emb|CAB89368.1|
           putative protein [Arabidopsis thaliana]
           gi|9758990|dbj|BAB09517.1|
           gb|AAD14519.1~gene_id:MTH16.3~similar to unknown protein
           [Arabidopsis thaliana]
          Length = 531

 Score =  195 bits (495), Expect = 3e-49
 Identities = 106/189 (56%), Positives = 126/189 (66%), Gaps = 23/189 (12%)
 Frame = +1

Query: 49  SYPDSGDSSPRSREIDFENPTPWEDQQSPHNYKAKFLCSYGGKIQPRTHDNQLSYVGGGQ 228
           SYPDS +SSPRSR+++FENP+PWEDQQ   NYK K +CSYGGKIQPR HDNQL+YV G  
Sbjct: 8   SYPDSAESSPRSRDVEFENPSPWEDQQQ-QNYKVKLMCSYGGKIQPRPHDNQLTYVNGDT 66

Query: 229 PRSSPSTGTPSVPTCLSKLAALCDAAPQ--ELTFKYQLPGEDLDALISVTNDDDLEHLMH 402
              S   G    P  +SKL+A+C       E++FKYQLPGEDLDALISVTND+DLEH+MH
Sbjct: 67  KIMSVDRGI-RFPALVSKLSAVCSGGGDGGEISFKYQLPGEDLDALISVTNDEDLEHMMH 125

Query: 403 EYDRLYRPASKPVRMRLFLF-SAPNPG----------------PLSQQPDPLK----PQP 519
           EYDRL R ++KP RMRLFLF S+P  G                P+  +P+  K    P  
Sbjct: 126 EYDRLLRLSTKPARMRLFLFPSSPISGGFGSEGSTKSDRDTLNPIPSRPESEKSVTAPPN 185

Query: 520 NVDFLFGLE 546
           N DFLFG E
Sbjct: 186 NADFLFGSE 194

>ref|NP_201248.1| putative protein; protein id: At5g64430.1, supported by cDNA:
           gi_17064915, supported by cDNA: gi_20259929, supported
           by cDNA: gi_20260125 [Arabidopsis thaliana]
           gi|10178224|dbj|BAB11604.1| contains similarity to
           unknown protein~gb|AAD14519.1~gene_id:T12B11.2
           [Arabidopsis thaliana] gi|17064916|gb|AAL32612.1|
           Unknown protein [Arabidopsis thaliana]
           gi|20259930|gb|AAM13312.1| unknown protein [Arabidopsis
           thaliana] gi|20260126|gb|AAM12961.1| unknown protein
           [Arabidopsis thaliana]
          Length = 513

 Score =  190 bits (482), Expect = 1e-47
 Identities = 109/203 (53%), Positives = 124/203 (60%), Gaps = 29/203 (14%)
 Frame = +1

Query: 25  VENYQYPASYPDSGDSSPRSREIDFENPTP-WEDQ---QSPHNYKAKFLCSYGGKIQPRT 192
           +E + Y  SYPDS DSSPRSREI+F+NP P W+DQ   Q  H+YK KF+CSYGGKIQPR 
Sbjct: 1   MEKFSYN-SYPDSTDSSPRSREIEFDNPPPPWDDQNQNQQQHSYKVKFMCSYGGKIQPRP 59

Query: 193 HDNQLSYVGGGQPRSSPSTGTPSVPTCLSKLAALC---DAAPQELTFKYQLPGEDLDALI 363
           HDNQL+YV G     S   G    P   SKL+ +C   D    E+TFKYQLPGEDLDALI
Sbjct: 60  HDNQLTYVNGETKILSVDRGI-RFPVLASKLSTVCGGGDGGGGEVTFKYQLPGEDLDALI 118

Query: 364 SVTNDDDLEHLMHEYDRLYRPASKPVRMRLFLFSAPN----------------------P 477
           SVTNDDDLEH+MHEYDRL R +SKP RMRLFLF A +                      P
Sbjct: 119 SVTNDDDLEHMMHEYDRLLRLSSKPARMRLFLFPASSGFGSQSSTQSDRDRFVEALNTVP 178

Query: 478 GPLSQQPDPLKPQPNVDFLFGLE 546
                +     P  N DFLFG E
Sbjct: 179 RLSESEKSVTAPPNNADFLFGSE 201

>ref|NP_565256.1| expressed protein; protein id: At2g01190.1, supported by cDNA:
           17996., supported by cDNA: gi_17979118 [Arabidopsis
           thaliana] gi|17979119|gb|AAL49817.1| unknown protein
           [Arabidopsis thaliana] gi|20197587|gb|AAD14519.2|
           expressed protein [Arabidopsis thaliana]
           gi|21436179|gb|AAM51377.1| unknown protein [Arabidopsis
           thaliana]
          Length = 720

 Score =  110 bits (276), Expect = 9e-24
 Identities = 67/155 (43%), Positives = 93/155 (59%), Gaps = 14/155 (9%)
 Frame = +1

Query: 46  ASYPDSGDSSPRSREIDFENPTPWEDQQSPH-----------NYKAKFLCSYGGKIQPRT 192
           +SYP+S DSSPRSR  D      W+D  +P            + K +F+CSYGG I PR 
Sbjct: 35  SSYPESLDSSPRSRTTD-----GWDDLPAPSGGGGGGGGSAVSSKLRFMCSYGGHILPRP 89

Query: 193 HDNQLSYVGGGQPRSSPSTGTPSVPTCLSKLA-ALCDAAPQELTFKYQLPGEDLDALISV 369
           HD  L Y+GG   R        S+P+ +++L+  L D   +  T KYQLP EDLD+LISV
Sbjct: 90  HDKSLCYMGG-DTRIVVVDRNSSLPSLIARLSNTLLDG--RSFTLKYQLPSEDLDSLISV 146

Query: 370 TNDDDLEHLMHEYDRLYRP--ASKPVRMRLFLFSA 468
           T D+DL++++ EYDR      ++KP R+RLFLF++
Sbjct: 147 TTDEDLDNMIEEYDRTISASNSTKPSRLRLFLFTS 181

>gb|AAM62991.1| unknown [Arabidopsis thaliana]
          Length = 720

 Score =  108 bits (270), Expect = 4e-23
 Identities = 66/155 (42%), Positives = 92/155 (58%), Gaps = 14/155 (9%)
 Frame = +1

Query: 46  ASYPDSGDSSPRSREIDFENPTPWEDQQSPH-----------NYKAKFLCSYGGKIQPRT 192
           +SYP+S DSSPRSR  D      W+D  +P            + K + +CSYGG I PR 
Sbjct: 35  SSYPESLDSSPRSRTTD-----GWDDLPAPSGGGGGGGGSAVSSKLRLMCSYGGHILPRP 89

Query: 193 HDNQLSYVGGGQPRSSPSTGTPSVPTCLSKLA-ALCDAAPQELTFKYQLPGEDLDALISV 369
           HD  L Y+GG   R        S+P+ +++L+  L D   +  T KYQLP EDLD+LISV
Sbjct: 90  HDKSLCYMGG-DTRIVVVDRNSSLPSLIARLSNTLLDG--RSFTLKYQLPSEDLDSLISV 146

Query: 370 TNDDDLEHLMHEYDRLYRP--ASKPVRMRLFLFSA 468
           T D+DL++++ EYDR      ++KP R+RLFLF++
Sbjct: 147 TTDEDLDNMIEEYDRTISASNSTKPSRLRLFLFTS 181

>ref|NP_567290.1| putative protein; protein id: At4g05150.1, supported by cDNA:
           gi_15809939 [Arabidopsis thaliana]
           gi|15809940|gb|AAL06897.1| AT4g05150/C17L7_70
           [Arabidopsis thaliana]
          Length = 477

 Score = 91.7 bits (226), Expect = 5e-18
 Identities = 59/137 (43%), Positives = 82/137 (59%), Gaps = 1/137 (0%)
 Frame = +1

Query: 58  DSGDSSPRSREIDFENPTPWEDQQSPHNYKAKFLCSYGGKIQPRTHDNQLSYVGGGQPRS 237
           DS  SSPRS           E    P   + +F+C++GG+I PR  DNQL YVGG     
Sbjct: 43  DSLASSPRS-----------EYDSQP---RVRFMCTFGGRILPRPPDNQLCYVGGDNRMV 88

Query: 238 SPSTGTPSVPTCLSKLAALCDAAPQELTFKYQLPGEDLDALISVTNDDDLEHLMHEYDRL 417
           +    T +  + LSKLA L  +    ++ KYQLP EDLDALISV+ D+D+E++M EYDR+
Sbjct: 89  AVHRHT-TFASLLSKLAKL--SGKSNISVKYQLPNEDLDALISVSTDEDVENMMDEYDRV 145

Query: 418 YRPAS-KPVRMRLFLFS 465
            +  + +  R+RLFLF+
Sbjct: 146 AQNQNPRASRLRLFLFT 162

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.314    0.136    0.419 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 553,951,036
Number of Sequences: 1393205
Number of extensions: 14297277
Number of successful extensions: 111119
Number of sequences better than 10.0: 4231
Number of HSP's better than 10.0 without gapping: 68258
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 97699
length of database: 448,689,247
effective HSP length: 116
effective length of database: 287,077,467
effective search space used: 18947112822
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)


EST assemble image


clone accession position
1 MWM098f12_f AV766326 1 187
2 MFB079h08_f BP039811 10 548
3 MFB087f08_f BP040373 46 122
4 GNf076g09 BP073012 62 428




Lotus japonicus
Kazusa DNA Research Institute