KMC001209A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC001209A_C01 KMC001209A_c01
gGCGAATGGGTACGGGCCCCCCCGGAAACTACCGACTCTTTCTCCCTCTGATTTCTCATC
CAAACAACAACAACAACACCCTTCTCCGATTAGGGTTAGGGTTTCCATCGCTTCCCTTCG
TCGTTCGTCGCTTCCGTTCTCAGAGCTATGGAGTCTTCACTCGTGATTAAGGTGAAATAT
GGTGATACACTTCGGCGGTTTAATGCGCCTGTGGCTGAGAATAATAGGTTGGATCTTAAC
ATGGCTGGGCTGAGGGCCAAGATATGTTCTATTTTCGATTTCGCTGCTGATGCGAAGCTG
ATTTTGACATACGTTGATGAAGATGGTGACTCGGTGACTCTTGTGGATGATGAGGACTTG
CTTGATGTGATGAGGCAGAAATTGAAATTCTTGAAAATTGATGTGAAAATGGTTAATGAC
AATGTTGGTAATTCAAATTTTGGGTCTAGTGGAAGTTCCACCCCTTTAAGATCTCCTCCT
GTCTCAAACCCATTTGCTGGTGGAGATTTTGGTAAAGCTGATGTTTTTGATGCTTTGCCA
GAGCCATTACGTGAGGCTCTTTGCTCATCACTTTCAAAAGCTGCCTCTTCAAGTCCAGTG
CTGGCTACTCTTGCTGAT


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC001209A_C01 KMC001209A_c01
         (618 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_194200.1| putative protein; protein id: At4g24690.1, supp...   133  2e-30
ref|NP_493462.1| Human TFG related TFG-1 (49.7 kD) (tfg-1) [Caen...    44  0.002
ref|NP_594221.1| mating and morphogenesis protein Scd1p. [Schizo...    42  0.006
gb|AAO47735.1| MUC6 [Mus musculus]                                     39  0.070
emb|CAD54411.1| secreted gel-forming mucin [Mus musculus]              39  0.070

>ref|NP_194200.1| putative protein; protein id: At4g24690.1, supported by cDNA:
           gi_17065501 [Arabidopsis thaliana]
           gi|7452446|pir||T05565 hypothetical protein F22K18.110 -
           Arabidopsis thaliana gi|4220521|emb|CAA22994.1| putative
           protein [Arabidopsis thaliana]
           gi|7269320|emb|CAB79379.1| putative protein [Arabidopsis
           thaliana] gi|17065502|gb|AAL32905.1| putative protein
           [Arabidopsis thaliana] gi|22136162|gb|AAM91159.1|
           putative protein [Arabidopsis thaliana]
           gi|22655264|gb|AAM98222.1| unknown protein [Arabidopsis
           thaliana]
          Length = 704

 Score =  133 bits (334), Expect = 2e-30
 Identities = 79/166 (47%), Positives = 104/166 (62%), Gaps = 6/166 (3%)
 Frame = +1

Query: 139 LRAMESSLVIKVKYGDTLRRFNAPVAENNRLDLNMAGLRAKICSIFDFAADAKLILTYVD 318
           + +  ++LV+KV YG  LRRF  PV  N +LDL MAGL+ KI ++F+ +ADA+L LTY D
Sbjct: 1   MESTANALVVKVSYGGVLRRFRVPVKANGQLDLEMAGLKEKIAALFNLSADAELSLTYSD 60

Query: 319 EDGDSVTLVDDEDLLDVMRQKLKFLKIDVKMVNDNVGNSNFG--SSGSSTPLRSPPVSNP 492
           EDGD V LVDD DL DV  Q+LKFLKI+   VN  V  ++    SSGSSTP   P   NP
Sbjct: 61  EDGDVVALVDDNDLFDVTNQRLKFLKIN---VNAGVSTNSAAPESSGSSTPAGMP---NP 114

Query: 493 FAGGDFGKADVFDALPEPLREAL----CSSLSKAASSSPVLATLAD 618
            +    G  DV  A+P P+R+ +        SKA++SSPV+  + D
Sbjct: 115 VSKIQKGINDVLMAVPNPMRDTISKVYMDLASKASTSSPVVGEMLD 160

>ref|NP_493462.1| Human TFG related TFG-1 (49.7 kD) (tfg-1) [Caenorhabditis elegans]
           gi|6580329|emb|CAB63398.1| C. elegans TFG-1 protein
           (corresponding sequence Y63D3A.5) [Caenorhabditis
           elegans]
          Length = 486

 Score = 43.9 bits (102), Expect = 0.002
 Identities = 23/70 (32%), Positives = 38/70 (53%)
 Frame = +1

Query: 154 SSLVIKVKYGDTLRRFNAPVAENNRLDLNMAGLRAKICSIFDFAADAKLILTYVDEDGDS 333
           +S ++K ++ D +R+ +   A     DL +  L   +  +    +DA  +L Y DE+GD 
Sbjct: 9   TSTILKARHADVVRKTSLHHAN----DLTLIDLVLNVQRLLALPSDANFVLKYKDEEGDL 64

Query: 334 VTLVDDEDLL 363
           VTL +D DLL
Sbjct: 65  VTLAEDSDLL 74

>ref|NP_594221.1| mating and morphogenesis protein Scd1p. [Schizosaccharomyces pombe]
           gi|6094245|sp|P40995|SCD1_SCHPO Protein scd1
           gi|7492166|pir||T37789 Scd1 protein - fission yeast
           (Schizosaccharomyces pombe) gi|2330697|emb|CAB11037.1|
           mating and morphogenesis protein Scd1p.
           [Schizosaccharomyces pombe] gi|5296001|gb|AAA50556.2|
           Scd1 protein [Schizosaccharomyces pombe]
          Length = 872

 Score = 42.0 bits (97), Expect = 0.006
 Identities = 27/75 (36%), Positives = 36/75 (48%)
 Frame = +1

Query: 139 LRAMESSLVIKVKYGDTLRRFNAPVAENNRLDLNMAGLRAKICSIFDFAADAKLILTYVD 318
           LR  E SLV+ V +  T     A V             + K+C I   A   ++ L YVD
Sbjct: 779 LRLHEVSLVLVVAHDITFDELLAKVEH-----------KIKLCGILKQAVPFRVRLKYVD 827

Query: 319 EDGDSVTLVDDEDLL 363
           EDGD +T+  DED+L
Sbjct: 828 EDGDFITITSDEDVL 842

>gb|AAO47735.1| MUC6 [Mus musculus]
          Length = 2850

 Score = 38.5 bits (88), Expect = 0.070
 Identities = 22/82 (26%), Positives = 40/82 (47%)
 Frame = -2

Query: 443  PKFELPTLSLTIFTSIFKNFNFCLITSSKSSSSTRVTESPSSSTYVKISFASAAKSKIEH 264
            P F  PT + T+ +S     +     SS+ +SST +  +  ++++V    + ++KS   H
Sbjct: 2617 PTFVSPTAASTVISSALPTIHMTPTPSSRPTSSTGLLSTSKTTSHVPTFSSFSSKSTTAH 2676

Query: 263  ILALSPAMLRSNLLFSATGALN 198
            + +L+     S LL S  G  N
Sbjct: 2677 LTSLTTQAATSGLLSSTMGMTN 2698

>emb|CAD54411.1| secreted gel-forming mucin [Mus musculus]
          Length = 1414

 Score = 38.5 bits (88), Expect = 0.070
 Identities = 22/82 (26%), Positives = 40/82 (47%)
 Frame = -2

Query: 443  PKFELPTLSLTIFTSIFKNFNFCLITSSKSSSSTRVTESPSSSTYVKISFASAAKSKIEH 264
            P F  PT + T+ +S     +     SS+ +SST +  +  ++++V    + ++KS   H
Sbjct: 1182 PTFVSPTAASTVISSALPTIHMTPTPSSRPTSSTGLLSTSKTTSHVPTFSSFSSKSTTAH 1241

Query: 263  ILALSPAMLRSNLLFSATGALN 198
            + +L+     S LL S  G  N
Sbjct: 1242 LTSLTTQAATSGLLSSTMGMTN 1263

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 545,683,727
Number of Sequences: 1393205
Number of extensions: 12087194
Number of successful extensions: 55264
Number of sequences better than 10.0: 91
Number of HSP's better than 10.0 without gapping: 46486
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 52708
length of database: 448,689,247
effective HSP length: 118
effective length of database: 284,291,057
effective search space used: 24733321959
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 SPDL030c09_f BP053864 1 580
2 MPDL048h03_f AV778948 2 412
3 MPDL042h04_f AV778650 19 539
4 MPDL044h01_f AV778745 22 415
5 MPDL030f12_f AV778000 24 374
6 MPDL004f11_f AV776733 24 306
7 MPDL036c03_f AV778290 27 619
8 MPDL059a07_f AV779481 33 557
9 MRL042b04_f BP085749 34 494
10 MFL006c09_f BP033641 35 195
11 GENLf065h11 BP065886 36 570
12 MFBL046c06_f BP043588 43 525
13 MFBL037a04_f BP043098 45 607




Lotus japonicus
Kazusa DNA Research Institute