KMC004118A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC004118A_C01 KMC004118A_c01
aattCTGCAAAGACTACACATTAGTAGCATTTTGCTGAAAACATTTAGGCGCATGTGTTC
ATTTATACCAACACCCTCTTCTTACACATATAAATAGAATAGAATGCCGTCATAAAATTA
GAAGAAATGGCCGAGAGATATAATATGAACGCAAGCAGGTAGGTAATTGAAGAAAACTGT
ACATGTGTATATAAAAACAAATGGGAAATGGCCCTTGTGATTCATTGATCCTAGCTAGCT
ATCATCTATTTCATTAACCTCTTGTCAGTGATGACCACGCCTAAGGGCTGACAGAAATCC
TCCCGATTTGTCCCTGCGCCGATCCTCTCCATCATCGCCAACCTCCTTAGCTAGCTTCTG
CAGGATGTCAATAATTTCAGAGAAATCGGGTCTCAATGTTGAATCTTGCTGCCAAGACCT
CTCAAGAAGCTCAACAAACTTTGGATGAGTGTTCTTGGGAATGGTGGGCCGCAGGCCCTT
TTGAACCACTCCTATAGCTGCCTGCAGAGGA


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC004118A_C01 KMC004118A_c01
         (511 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_568041.1| protein kinase like protein; protein id: At4g38...   117  1e-25
ref|NP_195303.2| putative protein; protein id: At4g35780.1, supp...   113  1e-24
pir||T04683 hypothetical protein F8D20.290 - Arabidopsis thalian...   102  3e-21
pir||T05675 hypothetical protein F20M13.30 - Arabidopsis thalian...    99  2e-20
pir||T04688 hypothetical protein F4B14.50 - Arabidopsis thaliana       83  2e-15

>ref|NP_568041.1| protein kinase like protein; protein id: At4g38470.1 [Arabidopsis
           thaliana]
          Length = 575

 Score =  117 bits (292), Expect = 1e-25
 Identities = 57/86 (66%), Positives = 70/86 (81%), Gaps = 5/86 (5%)
 Frame = -2

Query: 510 PLQAAIGVVQKGLRPTIPKNTHPKFVELLERSWQQDSTLRPDFSEIIDILQKLAKEVGDD 331
           PLQAA+GVVQKGLRPTIPKNTHPK  ELLER W+ DST RPDFSEII+ LQ++AKEVG++
Sbjct: 490 PLQAAVGVVQKGLRPTIPKNTHPKLAELLERLWEHDSTQRPDFSEIIEQLQEIAKEVGEE 549

Query: 330 GEDRRRDKS---GGFLSALRRG--HH 268
           GE++++  +   GG  +ALRR   HH
Sbjct: 550 GEEKKKSSTGLGGGIFAALRRSTTHH 575

>ref|NP_195303.2| putative protein; protein id: At4g35780.1, supported by cDNA:
           gi_20260235 [Arabidopsis thaliana]
           gi|20260236|gb|AAM13016.1| putative protein [Arabidopsis
           thaliana] gi|22136520|gb|AAM91338.1| putative protein
           [Arabidopsis thaliana]
          Length = 570

 Score =  113 bits (282), Expect = 1e-24
 Identities = 52/80 (65%), Positives = 65/80 (81%)
 Frame = -2

Query: 510 PLQAAIGVVQKGLRPTIPKNTHPKFVELLERSWQQDSTLRPDFSEIIDILQKLAKEVGDD 331
           PLQAA+GVVQKGLRP IPK THPK  ELLE+ WQQD  LRP+F+EII++L +L +EVGDD
Sbjct: 492 PLQAAVGVVQKGLRPKIPKETHPKLTELLEKCWQQDPALRPNFAEIIEMLNQLIREVGDD 551

Query: 330 GEDRRRDKSGGFLSALRRGH 271
             +R +DK GG+ S L++GH
Sbjct: 552 --ERHKDKHGGYFSGLKKGH 569

>pir||T04683 hypothetical protein F8D20.290 - Arabidopsis thaliana
           gi|3367596|emb|CAA20048.1| putative protein [Arabidopsis
           thaliana] gi|7270530|emb|CAB81487.1| putative protein
           [Arabidopsis thaliana]
          Length = 553

 Score =  102 bits (254), Expect = 3e-21
 Identities = 49/80 (61%), Positives = 61/80 (76%)
 Frame = -2

Query: 510 PLQAAIGVVQKGLRPTIPKNTHPKFVELLERSWQQDSTLRPDFSEIIDILQKLAKEVGDD 331
           PLQAA+GVVQKGLRP IPK THPK  ELLE+ WQQD  LRP+F+EII++L +L +EV D 
Sbjct: 475 PLQAAVGVVQKGLRPKIPKETHPKLTELLEKCWQQDPALRPNFAEIIEMLNQLIREVID- 533

Query: 330 GEDRRRDKSGGFLSALRRGH 271
                +DK GG+ S L++GH
Sbjct: 534 -LSLHKDKHGGYFSGLKKGH 552

>pir||T05675 hypothetical protein F20M13.30 - Arabidopsis thaliana
           gi|4467134|emb|CAB37503.1| protein kinase like protein
           [Arabidopsis thaliana] gi|7270830|emb|CAB80511.1|
           protein kinase like protein [Arabidopsis thaliana]
          Length = 545

 Score = 99.4 bits (246), Expect = 2e-20
 Identities = 48/76 (63%), Positives = 60/76 (78%), Gaps = 5/76 (6%)
 Frame = -2

Query: 480 KGLRPTIPKNTHPKFVELLERSWQQDSTLRPDFSEIIDILQKLAKEVGDDGEDRRRDKS- 304
           KGLRPTIPKNTHPK  ELLER W+ DST RPDFSEII+ LQ++AKEVG++GE++++  + 
Sbjct: 470 KGLRPTIPKNTHPKLAELLERLWEHDSTQRPDFSEIIEQLQEIAKEVGEEGEEKKKSSTG 529

Query: 303 --GGFLSALRRG--HH 268
             GG  +ALRR   HH
Sbjct: 530 LGGGIFAALRRSTTHH 545

>pir||T04688 hypothetical protein F4B14.50 - Arabidopsis thaliana
          Length = 509

 Score = 83.2 bits (204), Expect = 2e-15
 Identities = 39/70 (55%), Positives = 51/70 (72%)
 Frame = -2

Query: 480 KGLRPTIPKNTHPKFVELLERSWQQDSTLRPDFSEIIDILQKLAKEVGDDGEDRRRDKSG 301
           +GLRP IPK THPK  ELLE+ WQQD  LRP+F+EII++L +L +EV D      +DK G
Sbjct: 441 EGLRPKIPKETHPKLTELLEKCWQQDPALRPNFAEIIEMLNQLIREVID--LSLHKDKHG 498

Query: 300 GFLSALRRGH 271
           G+ S L++GH
Sbjct: 499 GYFSGLKKGH 508

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 419,755,495
Number of Sequences: 1393205
Number of extensions: 8612770
Number of successful extensions: 22728
Number of sequences better than 10.0: 434
Number of HSP's better than 10.0 without gapping: 22186
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 22720
length of database: 448,689,247
effective HSP length: 114
effective length of database: 289,863,877
effective search space used: 15942513235
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MR006c09_f BP076385 1 381
2 MWM063f05_f AV765709 5 358
3 MR083d01_f BP082376 26 512




Lotus japonicus
Kazusa DNA Research Institute