KMC016274A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC016274A_C01 KMC016274A_c01
tgaatgaaaccgagacatcctgctgaaatgaattgaaactgagtcatgcaAATCCTTCTG
GAATGACAATATTTGAACTGAAATGGGAAACACAAGCCCTTCTCAAATTTACAATAAGTT
ACATTCTAAAGCAAAGATTCTGAGATTCTGAGACATAATGAGAAGACAGGAAAATATGTC
TTTTCAAGATATTTCTTCCCTTGCAAAATATCTAGCCACTTTAGACAATTTTTGAGCAGC
AAGTGTTATCATTTTGTATATAAAACAAGTATCCTCCATCGGCAGTACTATTCGAGCACA
TCCACATCCTTGTGGGGCAATAGAGTTTTTTTCCATACACAAGTCTCTCATACCAAAGAC
TGCTTAAGGCTAGCTAGCTCTCCCACTTAAATCAAACATCCGATTACCATAAAAACAGAG
AAAAAAAGCTTTAAATACAAAAATGATCTAATTCCGATTACAAATCAAACTCCAGGGGAT
CAGTCCTTGCAACCAGTAAGAGATTCAGATAAGCAACAAGAGTCTTTTCTTCCTTGTTTG
CAAGCAACTAAGGTAAAAATGGCAGGTCTGCCAATGCTAAGCATTTTCAAAACCGAAAAT
TCGAAAAACACCTTCATCAGCCTGATACTTGTTTTTGTCATCACCAGCATATGCAAGTAA
ATTGTATTTAGGATTCCACTCAACGCTGTTCATGGCAGCTCTACAAGGAATCTGATGCAC
TGTTCG


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC016274A_C01 KMC016274A_c01
         (726 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_200424.1| putative protein; protein id: At5g56130.1, supp...    97  3e-19
gb|EAA03873.1| agCP2291 [Anopheles gambiae str. PEST]                  37  0.37
ref|NP_115737.1| hypothetical protein MGC5469 [Homo sapiens] gi|...    36  0.48
ref|NP_082873.1| RIKEN cDNA 2410044K02 [Mus musculus] gi|1284627...    36  0.48
gb|AAH19603.1| Similar to hypothetical protein MGC5469 [Mus musc...    36  0.48

>ref|NP_200424.1| putative protein; protein id: At5g56130.1, supported by cDNA:
           gi_20260441 [Arabidopsis thaliana]
           gi|9758633|dbj|BAB09295.1|
           gb|AAF54217.1~gene_id:MDA7.19~similar to unknown protein
           [Arabidopsis thaliana] gi|20260442|gb|AAM13119.1|
           unknown protein [Arabidopsis thaliana]
           gi|23197932|gb|AAN15493.1| unknown protein [Arabidopsis
           thaliana]
          Length = 315

 Score = 96.7 bits (239), Expect = 3e-19
 Identities = 45/50 (90%), Positives = 47/50 (94%), Gaps = 1/50 (2%)
 Frame = -1

Query: 726 RTVHQIPCRAAMNSVEWNPKYNLLAYAGDDKN-KYQADEGVFRIFGFENA 580
           RTVHQIPCRAAMNSVEWNPKYNLLAYAGDDKN KY  DEGVFRIFGFE++
Sbjct: 266 RTVHQIPCRAAMNSVEWNPKYNLLAYAGDDKNPKYNTDEGVFRIFGFESS 315

>gb|EAA03873.1| agCP2291 [Anopheles gambiae str. PEST]
          Length = 268

 Score = 36.6 bits (83), Expect = 0.37
 Identities = 20/46 (43%), Positives = 27/46 (58%), Gaps = 2/46 (4%)
 Frame = -1

Query: 720 VHQIPCRAAMNSVEWNPKYNLLAYAGDDK--NKYQADEGVFRIFGF 589
           V  I   AA  +V W+PK  +LAYA DDK  N  + D G  +++GF
Sbjct: 221 VADISVDAATFTVAWHPKQYILAYACDDKDANDRRRDAGSLKVWGF 266

>ref|NP_115737.1| hypothetical protein MGC5469 [Homo sapiens]
           gi|13905124|gb|AAH06849.1|AAH06849 Similar to RIKEN cDNA
           2410044K02 gene [Homo sapiens]
          Length = 351

 Score = 36.2 bits (82), Expect = 0.48
 Identities = 18/47 (38%), Positives = 29/47 (61%), Gaps = 3/47 (6%)
 Frame = -1

Query: 714 QIPCRAAMNSVEWNPKYNLLAYAGDDKN-KYQA--DEGVFRIFGFEN 583
           ++ C +   +V W+PK  LLA+A DDK+ KY +  + G  ++FG  N
Sbjct: 303 EVQCESPTFTVAWHPKRPLLAFACDDKDGKYDSSREAGTVKLFGLPN 349

>ref|NP_082873.1| RIKEN cDNA 2410044K02 [Mus musculus] gi|12846277|dbj|BAB27103.1|
           unnamed protein product [Mus musculus]
          Length = 351

 Score = 36.2 bits (82), Expect = 0.48
 Identities = 18/47 (38%), Positives = 29/47 (61%), Gaps = 3/47 (6%)
 Frame = -1

Query: 714 QIPCRAAMNSVEWNPKYNLLAYAGDDKN-KYQA--DEGVFRIFGFEN 583
           ++ C +   +V W+PK  LLA+A DDK+ KY +  + G  ++FG  N
Sbjct: 303 EVQCESPTFTVAWHPKRPLLAFACDDKDGKYDSSREAGTVKLFGLPN 349

>gb|AAH19603.1| Similar to hypothetical protein MGC5469 [Mus musculus]
          Length = 351

 Score = 36.2 bits (82), Expect = 0.48
 Identities = 18/47 (38%), Positives = 29/47 (61%), Gaps = 3/47 (6%)
 Frame = -1

Query: 714 QIPCRAAMNSVEWNPKYNLLAYAGDDKN-KYQA--DEGVFRIFGFEN 583
           ++ C +   +V W+PK  LLA+A DDK+ KY +  + G  ++FG  N
Sbjct: 303 EVQCESPTFTVAWHPKRPLLAFACDDKDGKYDSSREAGTVKLFGLPN 349

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 584,401,225
Number of Sequences: 1393205
Number of extensions: 12175443
Number of successful extensions: 26392
Number of sequences better than 10.0: 28
Number of HSP's better than 10.0 without gapping: 25508
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 26347
length of database: 448,689,247
effective HSP length: 120
effective length of database: 281,504,647
effective search space used: 34062062287
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MFB002g10_f BP034067 1 533
2 MF017b07_f BP029132 51 597
3 SPD014b08_f BP045094 148 699
4 MWM238c01_f AV768363 178 726
5 MF078h06_f BP032449 246 623




Lotus japonicus
Kazusa DNA Research Institute