KMC000391A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC000391A_C01 KMC000391A_c01
agtggcatttcacaaatATAACAAGTATATGAGTACTTAGTTTAAAGTTATGTAACCAAA
TAAAAAGTTTAAAGTTCACACATATAAGGGTGATACAAACTTTAAGAGATATAGAATAGC
AAGGCACATTTTATTGTCTGATACAACTAGTCTGCTAGGGCAATACAGACAGCTTACTCG
AAACTATTTGGTGTTATTTGCTTAAGAGGAGACTGATGTGAAAGCTATCATTTCCTCTGA
GCTAATGGTTTGATTAAGTCTGGAAGAGAGAGCTCTAGCCTTACACTCTGAATCCACCTT
GCTTTGATACGGATCTGCTCAATGCTTAAGTTCAAGTTCATGATGATGGAAGGGTTCGCG
TGACTGGCAAAAAATGCTTCTATTCCATTTGCTTCCTCGTTGCTGTGGGTCAGTGGAACT
ATTGGGCTTATGAAATGGGTGAGCAAGAGTCCAGCACCATATTTGGCCAAAA


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC000391A_C01 KMC000391A_c01
         (472 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

pir||T05189 glutamyl aminopeptidase homolog F4I10.20 - Arabidops...    61  5e-09
ref|NP_195035.2| aminopeptidase- like protein; protein id: At4g3...    61  5e-09
gb|EAA10722.1| ebiP4374 [Anopheles gambiae str. PEST]                  38  0.061
ref|NP_496210.1| ATPase Associated with diverse cellular Activit...    33  2.0
ref|NP_391851.1| alternate gene name: yxdE~myo-inositol cataboli...    32  2.6

>pir||T05189 glutamyl aminopeptidase homolog F4I10.20 - Arabidopsis thaliana
            gi|4455323|emb|CAB36783.1| aminopeptidase-like protein
            [Arabidopsis thaliana] gi|7270256|emb|CAB80026.1|
            aminopeptidase-like protein [Arabidopsis thaliana]
          Length = 873

 Score = 61.2 bits (147), Expect = 5e-09
 Identities = 29/74 (39%), Positives = 47/74 (63%)
 Frame = -3

Query: 461  YGAGLLLTHFISPIVPLTHSNEEANGIEAFFASHANPSIIMNLNLSIEQIRIKARWIQSV 282
            +G+G L+T FIS +V    S E+A  +E FFA+ + PS+   L  SIE++ I A W++S+
Sbjct: 798  WGSGFLITRFISAVVSPFASFEKAKEVEEFFATRSKPSMARTLKQSIERVHINANWVESI 857

Query: 281  RLELSLPDLIKPLA 240
            + E +L  L+  L+
Sbjct: 858  KKEDNLTQLVAQLS 871

>ref|NP_195035.2| aminopeptidase- like protein; protein id: At4g33090.1, supported by
            cDNA: gi_17473510 [Arabidopsis thaliana]
            gi|17473511|gb|AAL38379.1| AT4g33090/F4I10_20
            [Arabidopsis thaliana] gi|24209879|gb|AAN41401.1|
            aminopeptidase M [Arabidopsis thaliana]
            gi|29028734|gb|AAO64746.1| At4g33090/F4I10_20
            [Arabidopsis thaliana]
          Length = 879

 Score = 61.2 bits (147), Expect = 5e-09
 Identities = 29/74 (39%), Positives = 47/74 (63%)
 Frame = -3

Query: 461  YGAGLLLTHFISPIVPLTHSNEEANGIEAFFASHANPSIIMNLNLSIEQIRIKARWIQSV 282
            +G+G L+T FIS +V    S E+A  +E FFA+ + PS+   L  SIE++ I A W++S+
Sbjct: 804  WGSGFLITRFISAVVSPFASFEKAKEVEEFFATRSKPSMARTLKQSIERVHINANWVESI 863

Query: 281  RLELSLPDLIKPLA 240
            + E +L  L+  L+
Sbjct: 864  KKEDNLTQLVAQLS 877

>gb|EAA10722.1| ebiP4374 [Anopheles gambiae str. PEST]
          Length = 809

 Score = 37.7 bits (86), Expect = 0.061
 Identities = 20/60 (33%), Positives = 28/60 (46%)
 Frame = -3

Query: 470 LAKYGAGLLLTHFISPIVPLTHSNEEANGIEAFFASHANPSIIMNLNLSIEQIRIKARWI 291
           L +Y  G LL   I  +     + E A  +E FF  H  P     ++ SIE IR+ A W+
Sbjct: 750 LNQYEGGFLLARLIKYLTENFSTEERAKEVEQFFREHDFPGTERTVSQSIETIRLNADWM 809

>ref|NP_496210.1| ATPase Associated with diverse cellular Activities [Caenorhabditis
           elegans] gi|7504131|pir||T22612 hypothetical protein
           F54B3.3 - Caenorhabditis elegans
           gi|3877493|emb|CAA88471.1| Hypothetical protein F54B3.3
           [Caenorhabditis elegans]
          Length = 610

 Score = 32.7 bits (73), Expect = 2.0
 Identities = 19/47 (40%), Positives = 30/47 (63%), Gaps = 1/47 (2%)
 Frame = -2

Query: 366 QSREPFHHHELELKH*ADPYQ-SKVDSECKARALSSRLNQTISSEEM 229
           Q R+    HEL LKH    Y+  K+D+E +ARA ++R N+ ++ E+M
Sbjct: 180 QLRKQTIEHELALKH---KYELEKIDAETRARAKAARDNRDVNLEQM 223

>ref|NP_391851.1| alternate gene name: yxdE~myo-inositol catabolism [Bacillus
           subtilis] gi|1176989|sp|P42416|IOLE_BACSU IolE protein
           gi|7451104|pir||E69645 myo-inositol catabolism iolE -
           Bacillus subtilis gi|709985|dbj|BAA03294.1| hypothetical
           protein [Bacillus subtilis] gi|2636518|emb|CAB16008.1|
           iolE [Bacillus subtilis subsp. subtilis str. 168]
          Length = 297

 Score = 32.3 bits (72), Expect = 2.6
 Identities = 23/74 (31%), Positives = 31/74 (41%)
 Frame = -3

Query: 470 LAKYGAGLLLTHFISPIVPLTHSNEEANGIEAFFASHANPSIIMNLNLSIEQIRIKARWI 291
           + + GAG  L H +S IV       E  G   FF   A    I+N  L +  +RI  +W 
Sbjct: 20  MPEIGAGNTLQHLLSDIVVARFQGTEVGG---FFPEPA----ILNKELKLRNLRIAGKWF 72

Query: 290 QSVRLELSLPDLIK 249
            S  L   L +  K
Sbjct: 73  SSFILRDGLGEAAK 86

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 372,117,486
Number of Sequences: 1393205
Number of extensions: 6868119
Number of successful extensions: 13900
Number of sequences better than 10.0: 17
Number of HSP's better than 10.0 without gapping: 13700
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 13899
length of database: 448,689,247
effective HSP length: 113
effective length of database: 291,257,082
effective search space used: 12524054526
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 GENLf014g09 BP063107 1 472
2 GENLf039h12 BP064434 18 493




Lotus japonicus
Kazusa DNA Research Institute