KMC002634A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC002634A_C01 KMC002634A_c01
gggtcgggcccccccttcatgctcttttcatgaagttaggttcaccaccatttttgaaac
cttacatttcccacctccaaAGCTTGTTAAGACCTCTGGGATCAACATGGGTAAGCATCT
TTGCTACTGGGTTTCACTCTTCAGCTCCTGGTTGGCACCCTCTCAGTCTCAGCTGCACCC
TCCACCACTTCCCCTGCAAAAATTGTGAGTGGGTTCGTCTCTAATGCTGTGCCTGCTTTC
ACCAAATGGGTATGGTCACTCAAGGCCACCACCAGAACGGCGGTTTCCAGCAGGTCGATG
ATGAAGTTCGAGAGTGGGTACAGTGTGGAGACTGTGTTTGATGGAAGCAAGCTCGGAGTT
GAGCCTTATGCTGTGGAGGTGTTGCCTAATGGGGAGCTTCTGATTCTGGATTCTGCTAAT
AGTAACCTTTACAGAATCTCCTCCTCACTTTCTCTGTATAGCAGACCAAAGCTGGTGGCT
GGATCAGCTGAAGGATATTCTGGACATGTGGATGGGAAGCTTAGAGAGGCTAGAATGAAC
CAGCCAAAGGGAATAACTGTTGATGACCGAGGAAATATTTATGTTGCAGATACTATGAAT
ATGGCAATTAGGAAAATTAGTGATTCAGGGATCACAACGATTGCTGGAGGAAAATGGAGC
CGTGGAGGGGGCCATGTTGATGGACCAAGTGAAGAAGCTAAATTTTCTGATGACTTTGAT
GTGGTTTATGTTGGAAGCAGTTGTTCTCTACTCATTGTAGATAGAGGAAACCGAGCTATC
AGAGAGGT


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC002634A_C01 KMC002634A_c01
         (788 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_173800.2| unknown protein; protein id: At1g23880.1, suppo...   325  4e-88
pir||T01495 hypothetical protein F17O7.19 - Arabidopsis thaliana...   288  8e-77
gb|AAF87138.1|AC002423_3 T23E23.5 [Arabidopsis thaliana]              283  3e-75
ref|NP_177185.2| unknown protein; protein id: At1g70280.1, suppo...   280  2e-74
ref|NP_196993.1| putative protein; protein id: At5g14890.1 [Arab...   261  8e-69

>ref|NP_173800.2| unknown protein; protein id: At1g23880.1, supported by cDNA:
           gi_18700090, supported by cDNA: gi_20856012 [Arabidopsis
           thaliana] gi|18700091|gb|AAL77657.1| At1g23880/T23E23_8
           [Arabidopsis thaliana] gi|20856013|gb|AAM26643.1|
           At1g23880/T23E23_8 [Arabidopsis thaliana]
          Length = 545

 Score =  325 bits (834), Expect = 4e-88
 Identities = 157/216 (72%), Positives = 193/216 (88%), Gaps = 2/216 (0%)
 Frame = +1

Query: 145 LLVGTLSVSAAPSTTSPAKIVSGFVSNAVPAFTKWVWSL--KATTRTAVSSRSMMKFESG 318
           +L+ +  V++APS+TSPAKIV+ F+SN   +  KW+WSL  K TT+TAV ++SM+KFE+G
Sbjct: 72  ILLFSAFVASAPSSTSPAKIVNSFISNHGTSLLKWLWSLSFKTTTKTAVPTKSMVKFENG 131

Query: 319 YSVETVFDGSKLGVEPYAVEVLPNGELLILDSANSNLYRISSSLSLYSRPKLVAGSAEGY 498
           YSVETV DGSKLG+EPY+++VL NGELLILDS NSN+Y+ISSSLSLYSRP+LV GS EGY
Sbjct: 132 YSVETVLDGSKLGIEPYSIQVLSNGELLILDSQNSNIYQISSSLSLYSRPRLVTGSPEGY 191

Query: 499 SGHVDGKLREARMNQPKGITVDDRGNIYVADTMNMAIRKISDSGITTIAGGKWSRGGGHV 678
            GHVDG+LR+AR+N PKG+TVDDRGNIYVADT+N AIRKIS++G+TTIAGGK  RGGGHV
Sbjct: 192 PGHVDGRLRDARLNNPKGLTVDDRGNIYVADTVNNAIRKISEAGVTTIAGGKMVRGGGHV 251

Query: 679 DGPSEEAKFSDDFDVVYVGSSCSLLIVDRGNRAIRE 786
           DGPSE+AKFS+DFDVVY+GSSCSLL++DRGN+AIRE
Sbjct: 252 DGPSEDAKFSNDFDVVYLGSSCSLLVIDRGNQAIRE 287

>pir||T01495 hypothetical protein F17O7.19 - Arabidopsis thaliana
           gi|3176691|gb|AAC18814.1| Contains homology to
           serine/threonine protein kinase gb|X99618 from
           Mycobacterium tuberculosis.  ESTs gb|F14403, gb|F14404,
           and gb|N96730 come from this gene. [Arabidopsis
           thaliana]
          Length = 493

 Score =  288 bits (736), Expect = 8e-77
 Identities = 147/222 (66%), Positives = 174/222 (78%)
 Frame = +1

Query: 121 LLLGFTLQLLVGTLSVSAAPSTTSPAKIVSGFVSNAVPAFTKWVWSLKATTRTAVSSRSM 300
           L+L   + LL G   VS+APS  SPA                            +++RSM
Sbjct: 6   LVLSILILLLSGI--VSSAPSANSPA---------------------------TIATRSM 36

Query: 301 MKFESGYSVETVFDGSKLGVEPYAVEVLPNGELLILDSANSNLYRISSSLSLYSRPKLVA 480
           +KFE+GYSVETVFDGSKLG+EPY++EVLPNGELLILDS NSN+Y+ISSSLSLYSRP+LV 
Sbjct: 37  VKFENGYSVETVFDGSKLGIEPYSIEVLPNGELLILDSENSNIYKISSSLSLYSRPRLVT 96

Query: 481 GSAEGYSGHVDGKLREARMNQPKGITVDDRGNIYVADTMNMAIRKISDSGITTIAGGKWS 660
           GS EGY GHVDG+LR+A++N PKG+TVDDRGNIYVADT+N AIRKIS+ G+TTIAGGK  
Sbjct: 97  GSPEGYPGHVDGRLRDAKLNHPKGLTVDDRGNIYVADTVNNAIRKISEGGVTTIAGGKTV 156

Query: 661 RGGGHVDGPSEEAKFSDDFDVVYVGSSCSLLIVDRGNRAIRE 786
           R GGHVDGPSE+AKFS+DFDVVYVGSSCSLL++DRGN+AIRE
Sbjct: 157 RNGGHVDGPSEDAKFSNDFDVVYVGSSCSLLVIDRGNKAIRE 198

>gb|AAF87138.1|AC002423_3 T23E23.5 [Arabidopsis thaliana]
          Length = 493

 Score =  283 bits (723), Expect = 3e-75
 Identities = 138/192 (71%), Positives = 169/192 (87%), Gaps = 1/192 (0%)
 Frame = +1

Query: 214 FVSNAVPAFTKWVWSLKATTRTA-VSSRSMMKFESGYSVETVFDGSKLGVEPYAVEVLPN 390
           F+   +  F+ +V S  ++T  A V ++SM+KFE+GYSVETV DGSKLG+EPY+++VL N
Sbjct: 7   FLGIIILLFSAFVASAPSSTSPATVPTKSMVKFENGYSVETVLDGSKLGIEPYSIQVLSN 66

Query: 391 GELLILDSANSNLYRISSSLSLYSRPKLVAGSAEGYSGHVDGKLREARMNQPKGITVDDR 570
           GELLILDS NSN+Y+ISSSLSLYSRP+LV GS EGY GHVDG+LR+AR+N PKG+TVDDR
Sbjct: 67  GELLILDSQNSNIYQISSSLSLYSRPRLVTGSPEGYPGHVDGRLRDARLNNPKGLTVDDR 126

Query: 571 GNIYVADTMNMAIRKISDSGITTIAGGKWSRGGGHVDGPSEEAKFSDDFDVVYVGSSCSL 750
           GNIYVADT+N AIRKIS++G+TTIAGGK  RGGGHVDGPSE+AKFS+DFDVVY+GSSCSL
Sbjct: 127 GNIYVADTVNNAIRKISEAGVTTIAGGKMVRGGGHVDGPSEDAKFSNDFDVVYLGSSCSL 186

Query: 751 LIVDRGNRAIRE 786
           L++DRGN+AIRE
Sbjct: 187 LVIDRGNQAIRE 198

>ref|NP_177185.2| unknown protein; protein id: At1g70280.1, supported by cDNA:
           gi_17065223 [Arabidopsis thaliana]
           gi|17065224|gb|AAL32766.1| Unknown protein [Arabidopsis
           thaliana] gi|21387163|gb|AAM47985.1| unknown protein
           [Arabidopsis thaliana]
          Length = 447

 Score =  280 bits (715), Expect = 2e-74
 Identities = 132/163 (80%), Positives = 153/163 (92%)
 Frame = +1

Query: 298 MMKFESGYSVETVFDGSKLGVEPYAVEVLPNGELLILDSANSNLYRISSSLSLYSRPKLV 477
           M+KFE+GYSVETVFDGSKLG+EPY++EVLPNGELLILDS NSN+Y+ISSSLSLYSRP+LV
Sbjct: 1   MVKFENGYSVETVFDGSKLGIEPYSIEVLPNGELLILDSENSNIYKISSSLSLYSRPRLV 60

Query: 478 AGSAEGYSGHVDGKLREARMNQPKGITVDDRGNIYVADTMNMAIRKISDSGITTIAGGKW 657
            GS EGY GHVDG+LR+A++N PKG+TVDDRGNIYVADT+N AIRKIS+ G+TTIAGGK 
Sbjct: 61  TGSPEGYPGHVDGRLRDAKLNHPKGLTVDDRGNIYVADTVNNAIRKISEGGVTTIAGGKT 120

Query: 658 SRGGGHVDGPSEEAKFSDDFDVVYVGSSCSLLIVDRGNRAIRE 786
            R GGHVDGPSE+AKFS+DFDVVYVGSSCSLL++DRGN+AIRE
Sbjct: 121 VRNGGHVDGPSEDAKFSNDFDVVYVGSSCSLLVIDRGNKAIRE 163

>ref|NP_196993.1| putative protein; protein id: At5g14890.1 [Arabidopsis thaliana]
           gi|11357718|pir||T51434 hypothetical protein F2G14_10 -
           Arabidopsis thaliana gi|9755656|emb|CAC01808.1| putative
           protein [Arabidopsis thaliana]
          Length = 733

 Score =  261 bits (667), Expect = 8e-69
 Identities = 128/204 (62%), Positives = 165/204 (80%), Gaps = 8/204 (3%)
 Frame = +1

Query: 199 KIVSGFVSNAVPAFTKWVWSLKA------TTRTAVSSRSMMKFESGYSVETVFDGSKLGV 360
           +IVSG V+N      KW+WSL+       TT++ VSSRSM+K+ESGY++ETVFDGSKLG+
Sbjct: 11  EIVSGLVTNVASILWKWLWSLQTSTTTTTTTKSGVSSRSMVKYESGYNMETVFDGSKLGI 70

Query: 361 EPYAVEVLPNG-ELLILDSANSNLYRISSSLSLYSRPKLVAGSAEGYSGHVDGKLREARM 537
           EPYA+EV PNG EL++LDS NSN+++IS  LS Y +PKL++GS EGY+GHVDGKL+EARM
Sbjct: 71  EPYAIEVSPNGGELIVLDSENSNIHKISMPLSRYGKPKLLSGSQEGYTGHVDGKLKEARM 130

Query: 538 NQPKGITVDDRGNIYVADTMNMAIRKISDSGITTI-AGGKWSRGGGHVDGPSEEAKFSDD 714
           N+P+G+ +DDRGNIYVADT+NMAIRKISD G++TI AGG+WS G        E  +FSDD
Sbjct: 131 NRPRGLAMDDRGNIYVADTINMAIRKISDDGVSTIAAGGRWSGG-----SKEESMRFSDD 185

Query: 715 FDVVYVGSSCSLLIVDRGNRAIRE 786
           FD++YV SSCSLL++DRGN+ I+E
Sbjct: 186 FDLIYVSSSCSLLVIDRGNQLIKE 209

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 764,924,592
Number of Sequences: 1393205
Number of extensions: 18803564
Number of successful extensions: 55400
Number of sequences better than 10.0: 139
Number of HSP's better than 10.0 without gapping: 51626
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 55311
length of database: 448,689,247
effective HSP length: 121
effective length of database: 280,111,442
effective search space used: 39495713322
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MPDL079b04_f AV780570 1 555
2 GENf088d04 BP062073 305 788




Lotus japonicus
Kazusa DNA Research Institute