KMC003300A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC003300A_C01 KMC003300A_c01
gaataaagagttgttaataacaactcctccttgcttaccaaaaaaaaacaactcctaaaG
GGGATGCCTGCTAAAAAGGCAATGACCAGATCAATTTATTCAACTCAACAAATATTATTA
CATTGCCACCAAGGTTTTTGGTGTATCAAATACAGAATGCAGTGAGGTTCTTCTTTTGCA
AGAGTTCAGTTCTCCAATTACAATTCTTATCATCTGATATATCTATGGGAGCAACTATGT
ATTCTGCAATTTTAGGAACAGGCCTAGACCAGAACACACCCTTCCATCACCGTAACAGCC
TTCCCACGCAAGAACACTCGTGGGTTCTCCTCATCAAGATGAATATGGAGAATTCCCCCT
CTGGGTGATGCCTGATTAGCAATGAAATCACACTTCCCTAGCTTCTTACTCCAGTAGGAT
GCTAAGGCACAATGTGCAGTCCCACAAACAGGATCCTCATCGACTCCAAATTTTGGGCAG
AAGAATCGACTATAGAAGTCAAATCCCGACTCTGGAGGAGCAATCGCTGAGACAATTACC
CCCCTTCCAGGAAATTTAACCATTGCATCAAGTTGTGGCTGAACTGCTATGACATCTTCT
CTTGATGCGAGTCAACCAGGAAGTCCTCGCTATTTTGTGACTCTTTATATCATTATGGGA
GCACCAT


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC003300A_C01 KMC003300A_c01
         (667 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_192195.1| putative protein; protein id: At4g02860.1, supp...   149  3e-35
gb|AAM20496.1| putative protein [Arabidopsis thaliana]                149  4e-35
ref|NP_171820.1| unknown protein; protein id: At1g03210.1, suppo...   142  4e-33
ref|NP_192194.1| putative protein; protein id: At4g02850.1 [Arab...   138  8e-32
dbj|BAA88529.1| unnamed protein product [Oryza sativa (japonica ...   135  5e-31

>ref|NP_192195.1| putative protein; protein id: At4g02860.1, supported by cDNA:
           gi_20466357 [Arabidopsis thaliana]
           gi|25354020|pir||D85036 hypothetical protein AT4g02860
           [imported] - Arabidopsis thaliana
           gi|4263517|gb|AAD15343.1| similar to PHZF, catalyzing
           the hydroxylation of phenazine-1-carboxylic acid to
           2-hydroxy-phenazine-1-carboxylic acid [Arabidopsis
           thaliana] gi|7269771|emb|CAB77771.1| putative protein
           [Arabidopsis thaliana]
          Length = 294

 Score =  149 bits (376), Expect = 3e-35
 Identities = 76/133 (57%), Positives = 94/133 (70%)
 Frame = -3

Query: 665 GAPIMI*RVTK*RGLPG*LASREDVIAVQPQLDAMVKFPGRGVIVSAIAPPESGFDFYSR 486
           GA I+  + T    +   L S+E V  +QP++D ++K P  G+IV+A     S +DFYSR
Sbjct: 162 GATIVDIKATATNNILVVLPSKESVTELQPRMDDILKCPCDGIIVTAAGSTGSSYDFYSR 221

Query: 485 FFCPKFGVDEDPVCGTAHCALASYWSKKLGKCDFIANQASPRGGILHIHLDEENPRVFLR 306
           +F PKFGVDEDPVCG+AHCALA YWS K+ K DF+A QAS R G + IHLD+E  RV LR
Sbjct: 222 YFAPKFGVDEDPVCGSAHCALAHYWSIKMNKFDFLAYQASSRSGTIRIHLDKEKQRVLLR 281

Query: 305 GKAVTVMEGCVLV 267
           GKAVTVMEG VLV
Sbjct: 282 GKAVTVMEGHVLV 294

>gb|AAM20496.1| putative protein [Arabidopsis thaliana]
          Length = 294

 Score =  149 bits (375), Expect = 4e-35
 Identities = 72/115 (62%), Positives = 87/115 (75%)
 Frame = -3

Query: 611 LASREDVIAVQPQLDAMVKFPGRGVIVSAIAPPESGFDFYSRFFCPKFGVDEDPVCGTAH 432
           L S+E V  +QP++D ++K P  G+IV+A     S +DFYSR+F PKFGVDEDPVCG+AH
Sbjct: 180 LPSKESVTELQPRMDDILKCPCDGIIVTAAGSTGSSYDFYSRYFAPKFGVDEDPVCGSAH 239

Query: 431 CALASYWSKKLGKCDFIANQASPRGGILHIHLDEENPRVFLRGKAVTVMEGCVLV 267
           CALA YWS K+ K DF+A QAS R G + IHLD+E  RV LRGKAVTVMEG VLV
Sbjct: 240 CALAHYWSIKMNKFDFLAYQASSRSGTIRIHLDKEKQRVLLRGKAVTVMEGHVLV 294

>ref|NP_171820.1| unknown protein; protein id: At1g03210.1, supported by cDNA:
           gi_12083231 [Arabidopsis thaliana]
           gi|25354022|pir||D86163 F15K9.19 protein - Arabidopsis
           thaliana gi|3850584|gb|AAC72124.1| ESTs gb|H37641 and
           gb|AA651422 come from this gene. [Arabidopsis thaliana]
           gi|12083232|gb|AAG48775.1|AF332412_1 unknown protein
           [Arabidopsis thaliana]
          Length = 286

 Score =  142 bits (358), Expect = 4e-33
 Identities = 75/133 (56%), Positives = 96/133 (71%)
 Frame = -3

Query: 665 GAPIMI*RVTK*RGLPG*LASREDVIAVQPQLDAMVKFPGRGVIVSAIAPPESGFDFYSR 486
           GA I+  + TK + L   L+S E VI ++P+LD + K P  G++V+A A   S +DF SR
Sbjct: 155 GATILDVKATK-KDLLVVLSSWEAVIDLKPRLDEISKCPCEGMMVTAAASDGSTYDFCSR 213

Query: 485 FFCPKFGVDEDPVCGTAHCALASYWSKKLGKCDFIANQASPRGGILHIHLDEENPRVFLR 306
           +F P+FG++EDPV G+AHCALA YWS ++ KCDF A QAS RGG L +HLD+E  RV LR
Sbjct: 214 YFAPRFGINEDPVTGSAHCALAHYWSLRMNKCDFFAYQASSRGGTLKVHLDKEKQRVLLR 273

Query: 305 GKAVTVMEGCVLV 267
           GKAVTVMEG VLV
Sbjct: 274 GKAVTVMEGYVLV 286

>ref|NP_192194.1| putative protein; protein id: At4g02850.1 [Arabidopsis thaliana]
           gi|25354018|pir||C85036 hypothetical protein AT4g02850
           [imported] - Arabidopsis thaliana
           gi|4263518|gb|AAD15344.1| similar to PHZF, catalyzing
           the hydroxylation of phenazine-1-carboxylic acid to
           2-hydroxy-phenazine-1-carboxylic acid [Arabidopsis
           thaliana] gi|7269770|emb|CAB77770.1| putative protein
           [Arabidopsis thaliana]
          Length = 313

 Score =  138 bits (347), Expect = 8e-32
 Identities = 67/115 (58%), Positives = 84/115 (72%)
 Frame = -3

Query: 611 LASREDVIAVQPQLDAMVKFPGRGVIVSAIAPPESGFDFYSRFFCPKFGVDEDPVCGTAH 432
           L+S E VI  QP++D +VK PG+ +IV+A AP  S FDF SR F PK G++ED VCG+AH
Sbjct: 199 LSSWESVIEFQPRVDDIVKCPGKVMIVTAAAPQGSPFDFCSRLFAPKLGLNEDSVCGSAH 258

Query: 431 CALASYWSKKLGKCDFIANQASPRGGILHIHLDEENPRVFLRGKAVTVMEGCVLV 267
           C+LA YWS K+ KCDF+A  AS R G L +H D+E  RV L GKAVTVM+G +LV
Sbjct: 259 CSLAHYWSLKMNKCDFVAFAASQRSGTLKVHYDKEKQRVLLTGKAVTVMKGSILV 313

>dbj|BAA88529.1| unnamed protein product [Oryza sativa (japonica cultivar-group)]
           gi|6815075|dbj|BAA90362.1| unnamed protein product
           [Oryza sativa (japonica cultivar-group)]
          Length = 309

 Score =  135 bits (340), Expect = 5e-31
 Identities = 65/114 (57%), Positives = 83/114 (72%)
 Frame = -3

Query: 611 LASREDVIAVQPQLDAMVKFPGRGVIVSAIAPPESGFDFYSRFFCPKFGVDEDPVCGTAH 432
           L+S ++V  + P ++ + K  GRGVIV+  AP  S +DF+SRFFCPKFG+DEDPVCG+AH
Sbjct: 195 LSSGKEVADIIPNINEIKKCDGRGVIVTGPAPAGSDYDFFSRFFCPKFGIDEDPVCGSAH 254

Query: 431 CALASYWSKKLGKCDFIANQASPRGGILHIHLDEENPRVFLRGKAVTVMEGCVL 270
           C LA YW  KLGK    A QASPR G L++ LD EN RV ++G+AVTVM G +L
Sbjct: 255 CVLAPYWGGKLGKQKLTAFQASPRSGTLYLELDGENRRVRIQGEAVTVMAGTLL 308

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 602,555,773
Number of Sequences: 1393205
Number of extensions: 13663995
Number of successful extensions: 29009
Number of sequences better than 10.0: 77
Number of HSP's better than 10.0 without gapping: 27997
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 28987
length of database: 448,689,247
effective HSP length: 119
effective length of database: 282,897,852
effective search space used: 28855580904
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MFB051g02_f BP037720 1 397
2 MPD037f08_f AV772544 55 566
3 MWM104d02_f AV766414 66 487
4 MR057f05_f BP080394 72 418
5 GNf087c12 BP073778 73 494
6 GNf084a12 BP073533 73 546
7 SPD006c03_f BP044458 76 557
8 GNf032h03 BP069725 85 440
9 GNf043d03 BP070527 85 481
10 MF006a06_f BP028523 95 227
11 SPD043h09_f BP047465 98 459
12 MFB021b06_f BP035476 98 676
13 MFB038h01_f BP036818 101 568




Lotus japonicus
Kazusa DNA Research Institute