KMC000314A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC000314A_C01 KMC000314A_c01
aacaaatagataatagcaatAACAAAAAACAGAAGTTATGCTATAGTAGCTTCATATAAA
AAATGGGCACGTGACAAATACTGTCAGGTTAAATTACAATGATTTGAAAGAGCCAGCTTC
AGATATGTTAAGGATTACAAAAATCAAATGTGACCAATGGCTATGTGGAATCCAAAATCA
TGGGTGCAAGTAAGCCACAGGATGCTCCCTTCTTACAATACCCACTTTCATAAAATTTAC
AAACTCGCTGTCCTTGGGGAGAAGGCCTATATGAGCCTCCATTCCCAACACCAAACAATG
ATGGTCTGTTCGAAGCAGTTCTGACTCTACCAAAACCTGATTCCCTGCCAGGACCAGGAA
AACTCCGATCTCTGGGCACAGAATAACGATCGCTGCCATATCTTGTTTGGCCATCCCCAA
GGCCCGGAGCTACTACAGAAGTGTATGAATTTGTGCTTCTGTTCTCATGCACTTCCATCT
GCCCTACTCCCCAACTCGTGTTTGTCTCCCGCTGATCCATTCCACCCCATCCCAAGTGTG
TATTTTCCAACCCAAGTCTCCAAGTCGTGTCTATCGCCATGCCCA


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC000314A_C01 KMC000314A_c01
         (585 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_190681.1| putative protein; protein id: At3g51120.1 [Arab...    55  8e-07
ref|NP_194274.1| putative protein; protein id: At4g25440.1 [Arab...    39  0.037
emb|CAC19847.1| zfwd1 protein [Arabidopsis thaliana]                   39  0.037
emb|CAC19848.1| zfwd2 protein [Arabidopsis thaliana]                   37  0.24
ref|NP_200011.1| putative protein; protein id: At5g51980.1 [Arab...    37  0.24

>ref|NP_190681.1| putative protein; protein id: At3g51120.1 [Arabidopsis thaliana]
            gi|11290564|pir||T45743 hypothetical protein F24M12.160 -
            Arabidopsis thaliana gi|6562264|emb|CAB62634.1| putative
            protein [Arabidopsis thaliana]
          Length = 1247

 Score = 54.7 bits (130), Expect = 8e-07
 Identities = 47/135 (34%), Positives = 60/135 (43%), Gaps = 6/135 (4%)
 Frame = -3

Query: 568  TTWRLGLENTHLGWGGMDQRETNTSWGVGQMEVHENRSTNSYTSVVAPGLGDGQTRYGSD 389
            T  RLG E T     G  +R   +  GV        RS +S+ S  A G  +   R    
Sbjct: 1117 TALRLGSETTVEA--GTVERLPKSVLGVSSEP--SPRSLSSHDSSSARGSTERSPRVSQP 1172

Query: 388  RYSV--PRDRSFPGPGRESGFGRVRTASNRP--SLFGVGNG-GSYRPSP-QGQRVCKFYE 227
            + S    RDR +   G  S F         P  +  G  +G GSY   P +G ++CKFYE
Sbjct: 1173 KRSSGHSRDRQWLNNGHNSSFNNSHNNRQWPYSNSHGYDHGSGSYAAHPPKGLKICKFYE 1232

Query: 226  SGYCKKGASCGLLAP 182
            SGYCK+GASC    P
Sbjct: 1233 SGYCKRGASCSFWHP 1247

>ref|NP_194274.1| putative protein; protein id: At4g25440.1 [Arabidopsis thaliana]
           gi|7486814|pir||T05803 hypothetical protein M7J2.190 -
           Arabidopsis thaliana gi|2980806|emb|CAA18182.1| putative
           protein [Arabidopsis thaliana]
          Length = 668

 Score = 39.3 bits (90), Expect = 0.037
 Identities = 35/120 (29%), Positives = 45/120 (37%), Gaps = 7/120 (5%)
 Frame = -3

Query: 526 GGMDQRETNTS-------WGVGQMEVHENRSTNSYTSVVAPGLGDGQTRYGSDRYSVPRD 368
           GG   R T  S       W  G+     NR    Y     PG G G     S++  V  +
Sbjct: 17  GGGSNRPTTDSNQKVCFHWRAGRC----NRYPCPYLHRELPGPGSGPVAASSNK-RVADE 71

Query: 367 RSFPGPGRESGFGRVRTASNRPSLFGVGNGGSYRPSPQGQRVCKFYESGYCKKGASCGLL 188
             F GP    G G   TA+N       G  G  R   + +++CKF+  G C  G  C  L
Sbjct: 72  SGFAGPSHRRGPGFSGTANNW------GRFGGNRTVTKTEKLCKFWVDGNCPYGDKCRYL 125

>emb|CAC19847.1| zfwd1 protein [Arabidopsis thaliana]
          Length = 430

 Score = 39.3 bits (90), Expect = 0.037
 Identities = 35/120 (29%), Positives = 45/120 (37%), Gaps = 7/120 (5%)
 Frame = -3

Query: 526 GGMDQRETNTS-------WGVGQMEVHENRSTNSYTSVVAPGLGDGQTRYGSDRYSVPRD 368
           GG   R T  S       W  G+     NR    Y     PG G G     S++  V  +
Sbjct: 17  GGGSNRPTTDSNQKVCFHWRAGRC----NRYPCPYLHRELPGPGSGPVAASSNK-RVADE 71

Query: 367 RSFPGPGRESGFGRVRTASNRPSLFGVGNGGSYRPSPQGQRVCKFYESGYCKKGASCGLL 188
             F GP    G G   TA+N       G  G  R   + +++CKF+  G C  G  C  L
Sbjct: 72  SGFAGPSHRRGPGFSGTANNW------GRFGGNRTVTKTEKLCKFWVDGNCPYGDKCRYL 125

>emb|CAC19848.1| zfwd2 protein [Arabidopsis thaliana]
          Length = 443

 Score = 36.6 bits (83), Expect = 0.24
 Identities = 30/93 (32%), Positives = 38/93 (40%), Gaps = 1/93 (1%)
 Frame = -3

Query: 463 NRSTNSYTSVVAPGLGDGQTRY-GSDRYSVPRDRSFPGPGRESGFGRVRTASNRPSLFGV 287
           NRS   Y     PG G GQ +  G     V  +  F GP    G G    +S+    FG 
Sbjct: 45  NRSPCPYLHRELPGPGPGQGQGPGYTNKRVAEESGFAGPSHRRGPGFNGNSSSSWGRFG- 103

Query: 286 GNGGSYRPSPQGQRVCKFYESGYCKKGASCGLL 188
           GN    R   + ++VC F+  G C  G  C  L
Sbjct: 104 GN----RTVTKTEKVCNFWVDGNCTYGDKCRYL 132

>ref|NP_200011.1| putative protein; protein id: At5g51980.1 [Arabidopsis thaliana]
           gi|10177733|dbj|BAB11046.1| contains similarity to
           myosin heavy chain kinase~gene_id:MSG15.6 [Arabidopsis
           thaliana]
          Length = 437

 Score = 36.6 bits (83), Expect = 0.24
 Identities = 30/93 (32%), Positives = 38/93 (40%), Gaps = 1/93 (1%)
 Frame = -3

Query: 463 NRSTNSYTSVVAPGLGDGQTRY-GSDRYSVPRDRSFPGPGRESGFGRVRTASNRPSLFGV 287
           NRS   Y     PG G GQ +  G     V  +  F GP    G G    +S+    FG 
Sbjct: 45  NRSPCPYLHRELPGPGPGQGQGPGYTNKRVAEESGFAGPSHRRGPGFNGNSSSSWGRFG- 103

Query: 286 GNGGSYRPSPQGQRVCKFYESGYCKKGASCGLL 188
           GN    R   + ++VC F+  G C  G  C  L
Sbjct: 104 GN----RTVTKTEKVCNFWVDGNCTYGDKCRYL 132

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 589,235,831
Number of Sequences: 1393205
Number of extensions: 15038293
Number of successful extensions: 40696
Number of sequences better than 10.0: 49
Number of HSP's better than 10.0 without gapping: 38681
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 40647
length of database: 448,689,247
effective HSP length: 117
effective length of database: 285,684,262
effective search space used: 21997688174
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MFL020e10_f BP033872 1 296
2 GNLf017b02 BP075771 21 392
3 GENLf071b08 BP066164 21 447
4 MPDL004g05_f AV776736 27 313
5 MWM195c01_f AV767716 27 594
6 SPDL048b10_f BP054993 27 558
7 MRL040e05_f BP085674 28 407
8 GENLf030h01 BP063942 30 508
9 SPDL057g06_f BP055595 31 577
10 GENLf080h07 BP066720 32 558
11 GENLf011g05 BP062945 34 546
12 GENf048b09 BP060366 38 232
13 GENLf067h09 BP065982 38 478
14 GENf058a07 BP060798 38 405
15 MRL033e10_f BP085352 43 400
16 MWL051b06_f AV769445 47 453
17 GENLf082h11 BP066845 48 558
18 GENLf022c10 BP063500 111 575




Lotus japonicus
Kazusa DNA Research Institute