KMC021196A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC021196A_C01 KMC021196A_c01
acgTTGAAATGATTTTTTATATTTTATTTAACAATTTATTTATAAATAATAATGTTGGCC
TATTAGAGTAGAGTTAAATAATCTCATGGTTCACTTAGCCAACCCATAATAAGTGATGCT
ACATACACATACACAGTACAAACCAGTTTGGATTGAACTTCCAAACAATGTTTAAAATCA
GAATGAAACATTGAAAGCGTGGCTTTGAAATAGGAGACCTTTTCCAAATAGTTCATGCAA
CAAAATGGGTAAGATAGGCCTCAGCCAGCATGTGACCCAACTTAACTTGAGCCTCCAAGG
TAAGATGAAGATTATCTTCCTTCAACTGAAATCCCTTTGCATCCACGCAAATCACATTTG
GAAGTTCAATCCCTTTTTGTGCTTCCCTCACTTTCTCCATGTACTCAAACCCTGAGGCTA
TTGCAACCTGAATGATGGGAAGAGATGGAAGATTGAGGTCTTGACGAACATTGTGGATGA
GCGTTTCCATGTTGACCTTGTAAGCCTCTGCATCATGCTCACTGGAAGTGTCACTCTCTC
CCTGGAACCACAGCAATGCTTTGATCTCGCCACCGTGGTCGCCGGTCACGCTGAATTTGG
CTCTCTTCACCATGTTTTCATACAACTCCTCCCCACGCGCCCACTCCTTCATGGGGGTGC
CACCCACGGCACACGGGACAAGGCCGAACTCGCCGCCCACGCGCCGCCGCACCGCGTTGG
CGAAGCACATGCCGGGACCCACGCCGCAGACTTTCTTAGTGTCGATGTCGGTGTGGAGAG
GCTCGTGAGCTGGCTCCCATCGGAGGGCGGCGCTGAGGCGGAGGATGGAAGGGTCAGGGT
GGCACTCCGGTGGAACGACGCCGTCCCAGCGCTTGTTTGGGTGGTGGTGGGGGTTCTTAA
TCACGCCGCCTCGTCCGgccatgttgctttggccggagagaatgaagatctgtcgtttgg
ttttgggtggatgctcgtttggatctgcatgtgtggc


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC021196A_C01 KMC021196A_c01
         (997 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAM65927.1| unknown [Arabidopsis thaliana]                         322  6e-87
ref|NP_567960.1| Expressed protein; protein id: At4g34215.1, sup...   317  2e-85
pir||T05414 protein kinase homolog F28A23.20 - Arabidopsis thali...   238  1e-61
ref|NP_190869.1| putative protein; protein id: At3g53010.1 [Arab...   176  5e-43
ref|NP_347168.1| Acetylxylan esterase related enzyme [Clostridiu...    89  1e-16

>gb|AAM65927.1| unknown [Arabidopsis thaliana]
          Length = 260

 Score =  322 bits (825), Expect = 6e-87
 Identities = 167/250 (66%), Positives = 190/250 (75%), Gaps = 3/250 (1%)
 Frame = -1

Query: 982 PNEHPPKTKRQIFILSGQSNMAGRGGVIKNPHHHPNKRWDGVVPPECHPDPSILRLSAAL 803
           P    P    QIFILSGQSNMAGRGGV+K+ HHH    WD ++PPEC P+ SILRLSA L
Sbjct: 13  PEIQSPIPPNQIFILSGQSNMAGRGGVVKD-HHHNRWVWDKILPPECAPNSSILRLSADL 71

Query: 802 RWEPAHEPLHTDIDTKKVCGVGPGMCFANAVRRRVGGE---FGLVPCAVGGTPMKEWARG 632
           RWE AHEPLH DIDT KVCGVGPGM FANAV+ RV  +    GLVPCA GGT +KEW RG
Sbjct: 72  RWEEAHEPLHVDIDTGKVCGVGPGMAFANAVKNRVETDSAVIGLVPCASGGTAIKEWERG 131

Query: 631 EELYENMVKRAKFSVTGDHGGEIKALLWFQGESDTSSEHDAEAYKVNMETLIHNVRQDLN 452
             LYE MVKR + S     GGEIKA+LW+QGESD    HDAE+Y  NM+ LI N+R DLN
Sbjct: 132 SHLYERMVKRTEES--RKCGGEIKAVLWYQGESDVLDIHDAESYGNNMDRLIKNLRHDLN 189

Query: 451 LPSLPIIQVAIASGFEYMEKVREAQKGIELPNVICVDAKGFQLKEDNLHLTLEAQVKLGH 272
           LPSLPIIQVAIASG  Y++KVREAQ G++L NV+CVDAKG  LK DNLHLT EAQV+LG 
Sbjct: 190 LPSLPIIQVAIASGGGYIDKVREAQLGLKLSNVVCVDAKGLPLKSDNLHLTTEAQVQLGL 249

Query: 271 MLAEAYLTHF 242
            LA+AYL++F
Sbjct: 250 SLAQAYLSNF 259

>ref|NP_567960.1| Expressed protein; protein id: At4g34215.1, supported by cDNA:
           6401. [Arabidopsis thaliana]
          Length = 260

 Score =  317 bits (812), Expect = 2e-85
 Identities = 165/250 (66%), Positives = 189/250 (75%), Gaps = 3/250 (1%)
 Frame = -1

Query: 982 PNEHPPKTKRQIFILSGQSNMAGRGGVIKNPHHHPNKRWDGVVPPECHPDPSILRLSAAL 803
           P    P    QIFILSGQSNMAGRGGV K+ HH+    WD ++PPEC P+ SILRLSA L
Sbjct: 13  PEIQSPIPPNQIFILSGQSNMAGRGGVFKD-HHNNRWVWDKILPPECAPNSSILRLSADL 71

Query: 802 RWEPAHEPLHTDIDTKKVCGVGPGMCFANAVRRRVGGE---FGLVPCAVGGTPMKEWARG 632
           RWE AHEPLH DIDT KVCGVGPGM FANAV+ R+  +    GLVPCA GGT +KEW RG
Sbjct: 72  RWEEAHEPLHVDIDTGKVCGVGPGMAFANAVKNRLETDSAVIGLVPCASGGTAIKEWERG 131

Query: 631 EELYENMVKRAKFSVTGDHGGEIKALLWFQGESDTSSEHDAEAYKVNMETLIHNVRQDLN 452
             LYE MVKR + S     GGEIKA+LW+QGESD    HDAE+Y  NM+ LI N+R DLN
Sbjct: 132 SHLYERMVKRTEES--RKCGGEIKAVLWYQGESDVLDIHDAESYGNNMDRLIKNLRHDLN 189

Query: 451 LPSLPIIQVAIASGFEYMEKVREAQKGIELPNVICVDAKGFQLKEDNLHLTLEAQVKLGH 272
           LPSLPIIQVAIASG  Y++KVREAQ G++L NV+CVDAKG  LK DNLHLT EAQV+LG 
Sbjct: 190 LPSLPIIQVAIASGGGYIDKVREAQLGLKLSNVVCVDAKGLPLKSDNLHLTTEAQVQLGL 249

Query: 271 MLAEAYLTHF 242
            LA+AYL++F
Sbjct: 250 SLAQAYLSNF 259

>pir||T05414 protein kinase homolog F28A23.20 - Arabidopsis thaliana
            gi|2911040|emb|CAA17550.1| receptor protein kinase-like
            protein [Arabidopsis thaliana] gi|7270372|emb|CAB80139.1|
            receptor protein kinase-like protein [Arabidopsis
            thaliana]
          Length = 980

 Score =  238 bits (606), Expect = 1e-61
 Identities = 123/192 (64%), Positives = 140/192 (72%), Gaps = 3/192 (1%)
 Frame = -1

Query: 982  PNEHPPKTKRQIFILSGQSNMAGRGGVIKNPHHHPNKRWDGVVPPECHPDPSILRLSAAL 803
            P    P    QIFILSGQSNMAGRGGV K+ HH+    WD ++PPEC P+ SILRLSA L
Sbjct: 788  PEIQSPIPPNQIFILSGQSNMAGRGGVFKD-HHNNRWVWDKILPPECAPNSSILRLSADL 846

Query: 802  RWEPAHEPLHTDIDTKKVCGVGPGMCFANAVRRRV---GGEFGLVPCAVGGTPMKEWARG 632
            RWE AHEPLH DIDT KVCGVGPGM FANAV+ R+       GLVPCA GGT +KEW RG
Sbjct: 847  RWEEAHEPLHVDIDTGKVCGVGPGMAFANAVKNRLETDSAVIGLVPCASGGTAIKEWERG 906

Query: 631  EELYENMVKRAKFSVTGDHGGEIKALLWFQGESDTSSEHDAEAYKVNMETLIHNVRQDLN 452
              LYE MVKR + S     GGEIKA+LW+QGESD    HDAE+Y  NM+ LI N+R DLN
Sbjct: 907  SHLYERMVKRTEES--RKCGGEIKAVLWYQGESDVLDIHDAESYGNNMDRLIKNLRHDLN 964

Query: 451  LPSLPIIQVAIA 416
            LPSLPIIQV+++
Sbjct: 965  LPSLPIIQVSLS 976

>ref|NP_190869.1| putative protein; protein id: At3g53010.1 [Arabidopsis thaliana]
           gi|11357879|pir||T47558 hypothetical protein F8J2.180 -
           Arabidopsis thaliana gi|7529725|emb|CAB86905.1| putative
           protein [Arabidopsis thaliana]
          Length = 169

 Score =  176 bits (446), Expect = 5e-43
 Identities = 88/166 (53%), Positives = 110/166 (66%)
 Frame = -1

Query: 922 MAGRGGVIKNPHHHPNKRWDGVVPPECHPDPSILRLSAALRWEPAHEPLHTDIDTKKVCG 743
           MAGRGGV  +   +    WDGV+PPEC  +PSILRL++ L W+ A EPLH DID  K  G
Sbjct: 1   MAGRGGVYNDTATNTTV-WDGVIPPECRSNPSILRLTSKLEWKEAKEPLHVDIDINKTNG 59

Query: 742 VGPGMCFANAVRRRVGGEFGLVPCAVGGTPMKEWARGEELYENMVKRAKFSVTGDHGGEI 563
           VGPGM FAN V  R G + GLVPC++GGT + +W +GE LYE  VKRAK ++    GG  
Sbjct: 60  VGPGMPFANRVVNRFG-QVGLVPCSIGGTKLSQWQKGEFLYEETVKRAKAAMASGGGGSY 118

Query: 562 KALLWFQGESDTSSEHDAEAYKVNMETLIHNVRQDLNLPSLPIIQV 425
           +A+LW+QGESDT    DA  YK  +     ++R DL  P+LPIIQV
Sbjct: 119 RAVLWYQGESDTVDMVDASVYKKRLVKFFSDLRNDLQHPNLPIIQV 164

>ref|NP_347168.1| Acetylxylan esterase related enzyme [Clostridium acetobutylicum]
           gi|25495842|pir||A96965 acetylxylan esterase related
           enzyme [imported] - Clostridium acetobutylicum
           gi|15023393|gb|AAK78508.1|AE007568_2 Acetylxylan
           esterase related enzyme [Clostridium acetobutylicum]
          Length = 282

 Score = 89.0 bits (219), Expect = 1e-16
 Identities = 72/243 (29%), Positives = 115/243 (46%), Gaps = 12/243 (4%)
 Frame = -1

Query: 946 FILSGQSNMAGRGGVIKNPHHHPNKRWDGVVPPECHPDPSILRLSAALRWEPAHEPLHTD 767
           F++ GQSNMAGRG + + P  + N+R               +++    RW+   EP++ D
Sbjct: 5   FLMLGQSNMAGRGFINEVPMIY-NER---------------IQMLRNGRWQMMTEPINYD 48

Query: 766 IDTKKVCGVGPGMCFANAVRRRVGGEF-GLVPCAVGGTPMKEWARGEELYENMVKRAKFS 590
              + V G+     FA+A  ++   +  GL+PCA GG+ + EWA    L+ + +  AKF+
Sbjct: 49  ---RPVSGISLAGSFADAWSQKNQEDIIGLIPCAEGGSSIDEWALDGVLFRHALTEAKFA 105

Query: 589 VTGDHGGEIKALLWFQGESDTSSEHDAEAYKVNMETLIHNVRQDLNLPSLPIIQVAIASG 410
           +      E+  +LW QGESD S   + + Y   +  +I  +R++LN+P +PII   +   
Sbjct: 106 M---ESSELTGILWHQGESD-SLNGNYKVYYKKLLLIIEALRKELNVPDIPIIIGGLGDF 161

Query: 409 F----------EYMEKVREAQK-GIELPNVICVDAKGFQLKEDNLHLTLEAQVKLGHMLA 263
                      EY    +E QK   E  N   V A G     D +H+   +Q K G    
Sbjct: 162 LGKERFGKGCTEYNFINKELQKFAFEQDNCYFVTASGLTCNPDGIHIDAISQRKFGLRYF 221

Query: 262 EAY 254
           EA+
Sbjct: 222 EAF 224

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,003,811,331
Number of Sequences: 1393205
Number of extensions: 26906236
Number of successful extensions: 122251
Number of sequences better than 10.0: 378
Number of HSP's better than 10.0 without gapping: 93268
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 117294
length of database: 448,689,247
effective HSP length: 124
effective length of database: 275,931,827
effective search space used: 57117888189
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MF048b04_f BP030801 1 542
2 MF054a05_f BP031122 3 471
3 MF030a06_f BP029829 6 560
4 MF044d10_f BP030615 15 495
5 MFBL050d02_f BP043813 428 998




Lotus japonicus
Kazusa DNA Research Institute