KMC001676A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC001676A_C01 KMC001676A_c01
aaaaaagtATGAAATCACGGCTAAGAATATAATTAAAATTAGATGTTAACCACTTAAAAT
AACTTTTAAGTTTCATCCTTACATGTGCAGCTTATAAGAAATTAAATAATGCCAAAGCAA
ACATATCATTACATTATGAAGTTGAAGCAAATCCATAACAGAGATTAAAAGTTATTGACT
GTTTCCTAAGAAAACACACTTGTGAGTGGCTGCACTTGGGACTTCACACTCAATTTAGCA
CTGGATCTGAACATGTTTCACCCTGAAGCAGCAGGGAAGAGCTAAATGTTAGAATTGAAG
TGCTCAGATCAAACTCCAAAATCTTATCTTCCAGTTGACGCCCACCAAGAACAACTGCAT
TTTTGGCCTTCTTACCTCCATCCACAAACGCCAAACAGAGCACCTTCTCTTTCACCTCAA
CCATGGTATTGTGACCAAACATTTCATATTTCAATCCGCCATCAAACAACAATTCGATAA
CCGGCACAGCTGGTCCCATATCAAGATCATCAATGGAACCAGCATCAAAGCATGCTTCAA
ATGGTGCCACTGATTTCACTCTCTTAATCTTTCTGGCTGAAGCCTTCTTCACAAAATCTC
TAACAAAGGGCTTGTAGATGGAG


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC001676A_C01 KMC001676A_c01
         (623 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

emb|CAC10209.1| putative extracellular dermal glycoprotein [Cice...   138  5e-32
ref|NP_171821.1| unknown protein; protein id: At1g03220.1, suppo...   117  9e-26
gb|AAM61574.1| EDGP precursor [Arabidopsis thaliana]                  116  2e-25
ref|NP_197412.1| dermal glycoprotein - like; protein id: At5g191...   115  4e-25
ref|NP_563679.1| expressed protein; protein id: At1g03230.1, sup...   112  3e-24

>emb|CAC10209.1| putative extracellular dermal glycoprotein [Cicer arietinum]
          Length = 369

 Score =  138 bits (348), Expect = 5e-32
 Identities = 68/130 (52%), Positives = 100/130 (76%), Gaps = 5/130 (3%)
 Frame = -2

Query: 622 SIYKPFVRDFVKKASARKIKRVKSVAPFEACFDAGSIDDLDMGPAVPVIELLFDGGLKYE 443
           S+YKPF+RDF+KKAS RK+K+V+SVAPFEACF++ +I++     ++P I+L+  GG+++ 
Sbjct: 245 SVYKPFIRDFLKKASDRKLKKVESVAPFEACFESTNIEN-----SLPRIDLVLQGGVQWS 299

Query: 442 MFGHNTMVEVKEKVLCLAFVDGGKK-----AKNAVVLGGRQLEDKILEFDLSTSILTFSS 278
           ++G+N MV VK+ V CL FVDGG +     AK ++V+GG QLED +L FDL++S L+FSS
Sbjct: 300 IYGNNLMVNVKKNVACLGFVDGGTEPRMSFAKASIVIGGHQLEDNLLVFDLNSSKLSFSS 359

Query: 277 SLLLQGETCS 248
           SLL+   +CS
Sbjct: 360 SLLVHNASCS 369

>ref|NP_171821.1| unknown protein; protein id: At1g03220.1, supported by cDNA:
           12454., supported by cDNA: gi_13272442, supported by
           cDNA: gi_14334705, supported by cDNA: gi_16323419
           [Arabidopsis thaliana] gi|25296172|pir||F86163
           hypothetical protein F15K9.17 - Arabidopsis thaliana
           gi|3850579|gb|AAC72119.1| Strong similarity to gb|D14550
           extracellular dermal glycoprotein (EDGP) precursor from
           Daucus carota.  ESTs gb|H37281, gb|T44167, gb|T21813,
           gb|N38437, gb|Z26470, gb|R65072, gb|N76373, gb|F15470,
           gb|Z35182, gb|H76373, gb|Z34678 and gb|Z35387 come from
           this gene. [Arabidopsis thaliana]
           gi|13272443|gb|AAK17160.1|AF325092_1 unknown protein
           [Arabidopsis thaliana] gi|14334706|gb|AAK59531.1|
           unknown protein [Arabidopsis thaliana]
           gi|16323420|gb|AAL15204.1| unknown protein [Arabidopsis
           thaliana]
          Length = 433

 Score =  117 bits (294), Expect = 9e-26
 Identities = 62/127 (48%), Positives = 86/127 (66%), Gaps = 1/127 (0%)
 Frame = -2

Query: 622 SIYKPFVRDFVKKASARKIKRVKSVAPFEACFDAGSIDDLDMGPAVPVIEL-LFDGGLKY 446
           SIY  F  +FVK+A+AR IKRV SV PF ACF   ++    +G AVP IEL L    + +
Sbjct: 300 SIYNAFTSEFVKQAAARSIKRVASVKPFGACFSTKNVGVTRLGYAVPEIELVLHSKDVVW 359

Query: 445 EMFGHNTMVEVKEKVLCLAFVDGGKKAKNAVVLGGRQLEDKILEFDLSTSILTFSSSLLL 266
            +FG N+MV V + V+CL FVDGG  A+ +VV+GG QLED ++EFDL+++   FSS+LL 
Sbjct: 360 RIFGANSMVSVSDDVICLGFVDGGVNARTSVVIGGFQLEDNLIEFDLASNKFGFSSTLLG 419

Query: 265 QGETCSD 245
           +   C++
Sbjct: 420 RQTNCAN 426

>gb|AAM61574.1| EDGP precursor [Arabidopsis thaliana]
          Length = 433

 Score =  116 bits (291), Expect = 2e-25
 Identities = 62/127 (48%), Positives = 85/127 (66%), Gaps = 1/127 (0%)
 Frame = -2

Query: 622 SIYKPFVRDFVKKASARKIKRVKSVAPFEACFDAGSIDDLDMGPAVPVIEL-LFDGGLKY 446
           SIY  F  +FVK+A AR IKRV SV PF ACF   ++    +G AVP IEL L    + +
Sbjct: 300 SIYNAFTSEFVKQALARSIKRVASVKPFGACFSTKNVGVTRLGYAVPEIELVLHSKDVVW 359

Query: 445 EMFGHNTMVEVKEKVLCLAFVDGGKKAKNAVVLGGRQLEDKILEFDLSTSILTFSSSLLL 266
            +FG N+MV V + V+CL FVDGG  A+ +VV+GG QLED ++EFDL+++   FSS+LL 
Sbjct: 360 RIFGANSMVSVSDDVICLGFVDGGVNARTSVVIGGFQLEDNLIEFDLASNRFGFSSTLLG 419

Query: 265 QGETCSD 245
           +   C++
Sbjct: 420 RQTNCAN 426

>ref|NP_197412.1| dermal glycoprotein - like; protein id: At5g19110.1 [Arabidopsis
           thaliana]
          Length = 405

 Score =  115 bits (289), Expect = 4e-25
 Identities = 62/128 (48%), Positives = 84/128 (65%), Gaps = 4/128 (3%)
 Frame = -2

Query: 619 IYKPFVRDFVKKASARKIKRVKSVAPFEACFDAGSID-DLDMGPAVPVIELLFDGGL--- 452
           IY    + F  KA A  I +V SVAPF+ CFD+ +   +L  GP VPVIE+   G +   
Sbjct: 272 IYNALAQSFTLKAKAMGIAKVPSVAPFKHCFDSRTAGKNLTAGPNVPVIEIGLPGRIGEV 331

Query: 451 KYEMFGHNTMVEVKEKVLCLAFVDGGKKAKNAVVLGGRQLEDKILEFDLSTSILTFSSSL 272
           K+  +G NT+V+VKE V+CLAF+DGGK  K+ +V+G  QL+D +LEFD S ++L FS SL
Sbjct: 332 KWGFYGANTVVKVKETVMCLAFIDGGKTPKDLMVIGTHQLQDHMLEFDFSGTVLAFSESL 391

Query: 271 LLQGETCS 248
           LL   +CS
Sbjct: 392 LLHNTSCS 399

>ref|NP_563679.1| expressed protein; protein id: At1g03230.1, supported by cDNA:
           gi_12083229 [Arabidopsis thaliana]
           gi|25296173|pir||G86163 hypothetical protein F15K9.16 -
           Arabidopsis thaliana gi|3850580|gb|AAC72120.1| Strong
           similarity to gb|D14550 extracellular dermal
           glycoprotein (EDGP) precursor from Daucus carota.  ESTs
           gb|84105 and gb|AI100071 come from this gene.
           [Arabidopsis thaliana]
           gi|12083230|gb|AAG48774.1|AF332411_1 unknown protein
           [Arabidopsis thaliana]
          Length = 434

 Score =  112 bits (281), Expect = 3e-24
 Identities = 59/127 (46%), Positives = 85/127 (66%), Gaps = 1/127 (0%)
 Frame = -2

Query: 622 SIYKPFVRDFVKKASARKIKRVKSVAPFEACFDAGSIDDLDMGPAVPVIEL-LFDGGLKY 446
           SIYK F  +F+++A+AR IKRV SV PF ACF   ++    +G AVP I+L L    + +
Sbjct: 301 SIYKAFTSEFIRQAAARSIKRVASVKPFGACFSTKNVGVTRLGYAVPEIQLVLHSKDVVW 360

Query: 445 EMFGHNTMVEVKEKVLCLAFVDGGKKAKNAVVLGGRQLEDKILEFDLSTSILTFSSSLLL 266
            +FG N+MV V + V+CL FVDGG     +VV+GG QLED ++EFDL+++   FSS+LL 
Sbjct: 361 RIFGANSMVSVSDDVICLGFVDGGVNPGASVVIGGFQLEDNLIEFDLASNKFGFSSTLLG 420

Query: 265 QGETCSD 245
           +   C++
Sbjct: 421 RQTNCAN 427

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 515,420,033
Number of Sequences: 1393205
Number of extensions: 11100044
Number of successful extensions: 25921
Number of sequences better than 10.0: 66
Number of HSP's better than 10.0 without gapping: 24848
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 25848
length of database: 448,689,247
effective HSP length: 118
effective length of database: 284,291,057
effective search space used: 25301904073
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MR032b12_f BP078445 1 426
2 MWM061a07_f AV765656 9 541
3 MR047h02_f BP079672 31 484
4 MR028f10_f BP078179 31 406
5 MR025f08_f BP077930 31 391
6 MFB012g11_f BP034819 31 388
7 MWL073f09_f AV769899 31 628
8 GENf092g01 BP062210 31 403
9 MR070c08_f BP081374 32 427
10 MR072b01_f BP081511 34 541
11 MR021g03_f BP077618 36 412
12 MR069b09_f BP081280 46 525
13 GENf022c08 BP059288 48 403
14 GENf009c10 BP058715 54 496
15 MR036c04_f BP078772 62 329
16 MWM042f06_f AV765337 75 319
17 GNf045e11 BP070694 104 508
18 MR008b03_f BP076533 108 511
19 MR012d01_f BP076860 108 503
20 MR091g07_f BP083029 108 477
21 MR091h07_f BP083041 112 256




Lotus japonicus
Kazusa DNA Research Institute