KMC001786A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC001786A_C01 KMC001786A_c01
AAACTAATTGAATCATGGCTTGAATTACTGACATAATCCCAGCTTGAACAGGAAACATCA
GCATATGATACAGAAGTCCATTTATTTCAAACTTTATTTGTGGCATAAAGCCAATGACTT
CTGGGATCTGTTACAGAAGCATGTGATTTCTAGCTTAACATGGATCATGATGTGGCACCA
GGATTCTTAGTAATGGGCAAGTTCCCATCATAAGTGGATTTGAAGACATCTGGTTTCCTC
ACAAGGCGAGGTCCAGTTGGTACCTTGTTATTTTTAGGACGGATATCCGTGGTCAAATAT
AGCTGTTGGGATGGCCAGAGTGGCACAAGCATTGGGGGCATCCACTATTGAAGAGATTCT
TCCTTCACAAGGAATGCAAGACAGCAGAAGATACACCTGTTCTTTGGAGTAACCAAACTT
GGATATGTAGTCAATGGCATTGAGCACTGCTCGCTTGAACGCAACAGTTGCATCTAGGTA
GTGCTGCTTCCCTCTTTCATCCACACTGATGCCCTCAAACACCAGCCACTCTGAGAAGCT
TGGTTCAACAGG


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC001786A_C01 KMC001786A_c01
         (552 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_568028.1| formamidase - like protein; protein id: At4g375...   160  5e-54
pir||T04712 probable formamidase (EC 3.5.1.49) F19F18.40 - Arabi...   160  5e-54
gb|AAM64380.1| formamidase-like protein [Arabidopsis thaliana]        158  2e-53
ref|NP_568029.1| formamidase - like protein; protein id: At4g375...   160  1e-51
gb|AAK59505.1| putative formamidase [Arabidopsis thaliana]            160  5e-51

>ref|NP_568028.1| formamidase - like protein; protein id: At4g37550.1, supported by
           cDNA: 23732. [Arabidopsis thaliana]
          Length = 452

 Score =  160 bits (405), Expect(2) = 5e-54
 Identities = 77/87 (88%), Positives = 81/87 (92%)
 Frame = -1

Query: 552 PVEPSFSEWLVFEGISVDERGKQHYLDATVAFKRAVLNAIDYISKFGYSKEQVYLLLSCI 373
           PVEP FSEWLVFEGISVDE GKQHYLDATVA+KRAVLNAIDY+ KFGYSKEQVYLLLSC 
Sbjct: 325 PVEPRFSEWLVFEGISVDESGKQHYLDATVAYKRAVLNAIDYLFKFGYSKEQVYLLLSCC 384

Query: 372 PCEGRISSIVDAPNACATLAIPTAIFD 292
           PCEGR+S IVD+PNA ATLAIPTAIFD
Sbjct: 385 PCEGRLSGIVDSPNAVATLAIPTAIFD 411

 Score = 72.4 bits (176), Expect(2) = 5e-54
 Identities = 32/40 (80%), Positives = 35/40 (87%)
 Frame = -2

Query: 287 DIRPKNNKVPTGPRLVRKPDVFKSTYDGNLPITKNPGATS 168
           DIRPKN KVP GPR+VRKPDV KSTYDG LPITKNP ++S
Sbjct: 413 DIRPKNRKVPVGPRVVRKPDVLKSTYDGKLPITKNPSSSS 452

>pir||T04712 probable formamidase (EC 3.5.1.49) F19F18.40 - Arabidopsis thaliana
           gi|4468980|emb|CAB38294.1| formamidase-like protein
           [Arabidopsis thaliana] gi|7270737|emb|CAB80420.1|
           formamidase-like protein [Arabidopsis thaliana]
          Length = 432

 Score =  160 bits (405), Expect(2) = 5e-54
 Identities = 77/87 (88%), Positives = 81/87 (92%)
 Frame = -1

Query: 552 PVEPSFSEWLVFEGISVDERGKQHYLDATVAFKRAVLNAIDYISKFGYSKEQVYLLLSCI 373
           PVEP FSEWLVFEGISVDE GKQHYLDATVA+KRAVLNAIDY+ KFGYSKEQVYLLLSC 
Sbjct: 305 PVEPRFSEWLVFEGISVDESGKQHYLDATVAYKRAVLNAIDYLFKFGYSKEQVYLLLSCC 364

Query: 372 PCEGRISSIVDAPNACATLAIPTAIFD 292
           PCEGR+S IVD+PNA ATLAIPTAIFD
Sbjct: 365 PCEGRLSGIVDSPNAVATLAIPTAIFD 391

 Score = 72.4 bits (176), Expect(2) = 5e-54
 Identities = 32/40 (80%), Positives = 35/40 (87%)
 Frame = -2

Query: 287 DIRPKNNKVPTGPRLVRKPDVFKSTYDGNLPITKNPGATS 168
           DIRPKN KVP GPR+VRKPDV KSTYDG LPITKNP ++S
Sbjct: 393 DIRPKNRKVPVGPRVVRKPDVLKSTYDGKLPITKNPSSSS 432

>gb|AAM64380.1| formamidase-like protein [Arabidopsis thaliana]
          Length = 452

 Score =  158 bits (400), Expect(2) = 2e-53
 Identities = 76/87 (87%), Positives = 81/87 (92%)
 Frame = -1

Query: 552 PVEPSFSEWLVFEGISVDERGKQHYLDATVAFKRAVLNAIDYISKFGYSKEQVYLLLSCI 373
           PVEP FSEWLVFEGISVDE GKQHYLDATVA+KRAVLNAIDY+ KFGYSKEQVYLLLSC 
Sbjct: 325 PVEPRFSEWLVFEGISVDESGKQHYLDATVAYKRAVLNAIDYLFKFGYSKEQVYLLLSCC 384

Query: 372 PCEGRISSIVDAPNACATLAIPTAIFD 292
           PCEGR+S IVD+P+A ATLAIPTAIFD
Sbjct: 385 PCEGRLSGIVDSPSAVATLAIPTAIFD 411

 Score = 72.4 bits (176), Expect(2) = 2e-53
 Identities = 32/40 (80%), Positives = 35/40 (87%)
 Frame = -2

Query: 287 DIRPKNNKVPTGPRLVRKPDVFKSTYDGNLPITKNPGATS 168
           DIRPKN KVP GPR+VRKPDV KSTYDG LPITKNP ++S
Sbjct: 413 DIRPKNRKVPVGPRVVRKPDVLKSTYDGKLPITKNPSSSS 452

>ref|NP_568029.1| formamidase - like protein; protein id: At4g37560.1, supported by
           cDNA: gi_14334653 [Arabidopsis thaliana]
           gi|23297225|gb|AAN12921.1| putative formamidase
           [Arabidopsis thaliana]
          Length = 452

 Score =  160 bits (404), Expect(2) = 1e-51
 Identities = 77/87 (88%), Positives = 81/87 (92%)
 Frame = -1

Query: 552 PVEPSFSEWLVFEGISVDERGKQHYLDATVAFKRAVLNAIDYISKFGYSKEQVYLLLSCI 373
           PVEP FSEWLVFEGISVDE G+QHYLDATVA+KRAVLNAIDY+ KFGYSKEQVYLLLSC 
Sbjct: 325 PVEPRFSEWLVFEGISVDESGRQHYLDATVAYKRAVLNAIDYLFKFGYSKEQVYLLLSCC 384

Query: 372 PCEGRISSIVDAPNACATLAIPTAIFD 292
           PCEGRIS IVD+PNA ATLAIPTAIFD
Sbjct: 385 PCEGRISGIVDSPNAVATLAIPTAIFD 411

 Score = 65.1 bits (157), Expect(2) = 1e-51
 Identities = 29/40 (72%), Positives = 33/40 (82%)
 Frame = -2

Query: 287 DIRPKNNKVPTGPRLVRKPDVFKSTYDGNLPITKNPGATS 168
           DIRPK  KVPTG R+V+KPDV KSTYDG LPITKN  ++S
Sbjct: 413 DIRPKTRKVPTGARIVKKPDVMKSTYDGKLPITKNSSSSS 452

>gb|AAK59505.1| putative formamidase [Arabidopsis thaliana]
          Length = 452

 Score =  160 bits (404), Expect(2) = 5e-51
 Identities = 77/87 (88%), Positives = 81/87 (92%)
 Frame = -1

Query: 552 PVEPSFSEWLVFEGISVDERGKQHYLDATVAFKRAVLNAIDYISKFGYSKEQVYLLLSCI 373
           PVEP FSEWLVFEGISVDE G+QHYLDATVA+KRAVLNAIDY+ KFGYSKEQVYLLLSC 
Sbjct: 325 PVEPRFSEWLVFEGISVDESGRQHYLDATVAYKRAVLNAIDYLFKFGYSKEQVYLLLSCC 384

Query: 372 PCEGRISSIVDAPNACATLAIPTAIFD 292
           PCEGRIS IVD+PNA ATLAIPTAIFD
Sbjct: 385 PCEGRISGIVDSPNAVATLAIPTAIFD 411

 Score = 62.8 bits (151), Expect(2) = 5e-51
 Identities = 28/40 (70%), Positives = 32/40 (80%)
 Frame = -2

Query: 287 DIRPKNNKVPTGPRLVRKPDVFKSTYDGNLPITKNPGATS 168
           DIRPK  KVPTG R+V+KPDV KSTYDG LPI KN  ++S
Sbjct: 413 DIRPKTRKVPTGARIVKKPDVMKSTYDGKLPIIKNSSSSS 452

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 496,379,959
Number of Sequences: 1393205
Number of extensions: 11014656
Number of successful extensions: 28290
Number of sequences better than 10.0: 32
Number of HSP's better than 10.0 without gapping: 27667
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 28284
length of database: 448,689,247
effective HSP length: 116
effective length of database: 287,077,467
effective search space used: 19234190289
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 GENf015d12 BP058979 1 466
2 MR032h09_f BP078501 1 400
3 MR013g02_f BP076975 1 390
4 GENf069g07 BP061336 2 311
5 GNf083b02 BP073466 2 394
6 GENf071e01 BP061404 2 504
7 SPD024h11_f BP045934 7 563




Lotus japonicus
Kazusa DNA Research Institute