KMC015794A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC015794A_C01 KMC015794A_c01
ttccatcaattctatggagcatcagcatataagtcttccaagcacaaaggatatcagtcc
atgtactaaactcatagtacAAAAAATCTACTATGGGAAGTTGGAATCACTTATATCTCA
TATCATCTGATAAAGCAAGACAGCAAAATTGTAACATTCTGATGTGAGGAGCTTAAAAAA
AATAAATTACATACAGTGGCCTCTAAATACAGTACTAAGATCAAGAATGTGATCGTGACC
GTACAAATTCTTCTAGCCTATCCCACACGTCCAATGCTGCGGCCCTGTCCTGGTGTCGCT
TGTTTTTGCTAATCGAGATAATCTTTACATTCCACTGCTTCACAGTCTTAACTGACTCCA
TACTATCGTCTTCAAAACGTATGAAAAATCCCATAATTTTATTAAAAATCTCAACATGAT
CCTTGAAAGGGCATTCCTTAAACTG


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC015794A_C01 KMC015794A_c01
         (445 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_188898.1| unknown protein; protein id: At3g22590.1, suppo...   139  6e-33
ref|NP_649863.1| CG11990-PA [Drosophila melanogaster] gi|7299132...    54  4e-07
gb|EAA14889.1| agCP4958 [Anopheles gambiae str. PEST]                  53  7e-07
ref|NP_500465.2| Putative nuclear protein of eukaryotic origin [...    47  4e-05
ref|NP_666103.1| cDNA sequence, BC027756; hypothetical protein M...    47  7e-05

>ref|NP_188898.1| unknown protein; protein id: At3g22590.1, supported by cDNA:
           gi_17529301 [Arabidopsis thaliana]
           gi|11994291|dbj|BAB01474.1| gene_id:F16J14.15~unknown
           protein [Arabidopsis thaliana]
           gi|17529302|gb|AAL38878.1| unknown protein [Arabidopsis
           thaliana] gi|23296828|gb|AAN13180.1| unknown protein
           [Arabidopsis thaliana]
          Length = 415

 Score =  139 bits (351), Expect = 6e-33
 Identities = 65/74 (87%), Positives = 72/74 (96%)
 Frame = -1

Query: 445 QFKECPFKDHVEIFNKIMGFFIRFEDDSMESVKTVKQWNVKIISISKNKRHQDRAAALDV 266
           QFK+ PFKDHVEIFNKI+GFF+RFEDDS+ES KTVKQWNVKIISISKNKRHQDRAAAL+V
Sbjct: 342 QFKDWPFKDHVEIFNKIIGFFLRFEDDSIESAKTVKQWNVKIISISKNKRHQDRAAALEV 401

Query: 265 WDRLEEFVRSRSHS 224
           W++LEEFVRSRSHS
Sbjct: 402 WEKLEEFVRSRSHS 415

>ref|NP_649863.1| CG11990-PA [Drosophila melanogaster] gi|7299132|gb|AAF54331.1|
           CG11990-PA [Drosophila melanogaster]
           gi|16769708|gb|AAL29073.1| LD47989p [Drosophila
           melanogaster]
          Length = 538

 Score = 53.9 bits (128), Expect = 4e-07
 Identities = 25/69 (36%), Positives = 46/69 (66%), Gaps = 1/69 (1%)
 Frame = -1

Query: 445 QFKECPFKDH-VEIFNKIMGFFIRFEDDSMESVKTVKQWNVKIISISKNKRHQDRAAALD 269
           QFK  P++ + V+IF+KI  F + F +  ++S   V++W+V ++ +S+NKRH DRA    
Sbjct: 463 QFKGWPWEGNPVDIFSKICAFHLCFSEMKLDS--NVERWSVTLLRLSQNKRHMDRAVLSK 520

Query: 268 VWDRLEEFV 242
            W+ L++++
Sbjct: 521 FWETLDKYI 529

>gb|EAA14889.1| agCP4958 [Anopheles gambiae str. PEST]
          Length = 543

 Score = 53.1 bits (126), Expect = 7e-07
 Identities = 25/69 (36%), Positives = 46/69 (66%), Gaps = 1/69 (1%)
 Frame = -1

Query: 445 QFKECPFKDH-VEIFNKIMGFFIRFEDDSMESVKTVKQWNVKIISISKNKRHQDRAAALD 269
           QFK  P+  + VEIF+KI  F +R++D  +++   V +W V +++IS+ KRH D+A  + 
Sbjct: 468 QFKGWPWDGNPVEIFSKIAAFHLRYDDLKLDA--NVAKWAVTVLNISRTKRHLDKACLMA 525

Query: 268 VWDRLEEFV 242
            W++L+ ++
Sbjct: 526 FWEKLDLYM 534

>ref|NP_500465.2| Putative nuclear protein of eukaryotic origin [Caenorhabditis
           elegans] gi|16950402|gb|AAF39795.2| Hypothetical protein
           F35F11.1 [Caenorhabditis elegans]
          Length = 554

 Score = 47.4 bits (111), Expect = 4e-05
 Identities = 24/58 (41%), Positives = 31/58 (53%)
 Frame = -1

Query: 412 EIFNKIMGFFIRFEDDSMESVKTVKQWNVKIISISKNKRHQDRAAALDVWDRLEEFVR 239
           +IF  I  F    + D  + V  V QWNV  I +S  KRH D+A    VW+ +E FVR
Sbjct: 487 DIFTHIPAFHFHVDQD--KPVAQVMQWNVHKIPVSATKRHMDKARFSQVWETIENFVR 542

>ref|NP_666103.1| cDNA sequence, BC027756; hypothetical protein MGC36559 [Mus
           musculus] gi|20379598|gb|AAH27756.1| cDNA sequence,
           BC027756 [Mus musculus] gi|21411055|gb|AAH31127.1|
           Unknown (protein for MGC:36559) [Mus musculus]
           gi|26348817|dbj|BAC38048.1| unnamed protein product [Mus
           musculus]
          Length = 531

 Score = 46.6 bits (109), Expect = 7e-05
 Identities = 25/78 (32%), Positives = 46/78 (58%), Gaps = 5/78 (6%)
 Frame = -1

Query: 445 QFKECPFK----DHVEIFNKIMGFFIRFEDDSMESVKTVKQWNVKIISISKNKRHQDRAA 278
           QFK  P+       V+IF KI  F +++++  ++    V++W+V ++ +S +KRH DR  
Sbjct: 453 QFKGWPWLLPDGSPVDIFAKIKAFHLKYDEVRLDP--NVQKWDVTVLELSYHKRHLDRPV 510

Query: 277 ALDVWDRLEEF-VRSRSH 227
            L  W+ L+ + V+ +SH
Sbjct: 511 FLRFWETLDRYMVKHKSH 528

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 347,913,734
Number of Sequences: 1393205
Number of extensions: 6351606
Number of successful extensions: 15971
Number of sequences better than 10.0: 28
Number of HSP's better than 10.0 without gapping: 15754
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 15967
length of database: 448,689,247
effective HSP length: 123
effective length of database: 277,325,032
effective search space used: 6655800768
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MFB064f07_f BP038657 1 445
2 MWM157b10_f AV767155 124 435




Lotus japonicus
Kazusa DNA Research Institute