KMC002118A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC002118A_C01 KMC002118A_c01
gggcccccctcgattttttcATGAAAGGATGGAGGCATAGATCTTCTTCCACTCTAAGTT
GAATTAATATACATAAGATTCCCAGTACTGACTTTCAGCATAAGTTACATAAATCATAGA
TGATATAATTAACAGATCAGTTGCATCATAATGAATGTTCAGGCCTCGAATGTGCAAACC
AATTAAAGATTCCAACAGTGAGAAATAGGACAAGCGAGCTCAATGCCAAACTCAGCATCA
AAAAATAAATATCACAACTCTTTTCAGTGAGTTGGCTTTGACAACCTGAGGAAGAGATTG
GAGAAGGGACAGGGATGGCTTTCGAAGCGTGTGATTAAGCAGTAACATAAATGTCGGAAG
TGTTAAGGCTCCAGCTGGATGTGCAGTAGCCAGTGAAACAGGTACATATGAAAGAAGTGT
TGAAACCCCCAATGTGACCTGAAGAGCTGCCATGCCAACTATGCTTCCAATCACAGATCG
TACGGCAGGGTGTATGTCCAGCTTTCTTGTCGCCCCCCACAAAGCAGCCACGGAAATTAG
AGTTGCAGTGGCAAGGATACGGTGATCGAGCTGTACAGTCGATGTATTCTCAAAGAAATT
TCGAATCAG


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC002118A_C01 KMC002118A_c01
         (609 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_200420.1| putative protein; protein id: At5g56090.1 [Arab...   172  2e-42
ref|NP_611855.1| CG3803-PA [Drosophila melanogaster] gi|7291690|...    90  2e-17
gb|EAA31207.1| hypothetical protein [Neurospora crassa]                83  3e-15
gb|EAA00923.1| agCP11935 [Anopheles gambiae str. PEST]                 82  7e-15
gb|AAH46017.1| Similar to RIKEN cDNA 2900026G05 gene [Danio rerio]     77  2e-13

>ref|NP_200420.1| putative protein; protein id: At5g56090.1 [Arabidopsis thaliana]
           gi|9758629|dbj|BAB09291.1| contains similarity to
           cytochrome oxidase assembly factor~gene_id:MDA7.15
           [Arabidopsis thaliana] gi|26450079|dbj|BAC42159.1|
           unknown protein [Arabidopsis thaliana]
           gi|28827330|gb|AAO50509.1| unknown protein [Arabidopsis
           thaliana]
          Length = 457

 Score =  172 bits (437), Expect = 2e-42
 Identities = 86/113 (76%), Positives = 100/113 (88%)
 Frame = -1

Query: 609 LIRNFFENTSTVQLDHRILATATLISVAALWGATRKLDIHPAVRSVIGSIVGMAALQVTL 430
           L+RNFFENT+TVQLDHR+LAT TLI++  +W  TRKLDIHPAV+++IGS VGM A+QVTL
Sbjct: 343 LLRNFFENTATVQLDHRLLATTTLIAIGTMWWFTRKLDIHPAVKALIGSTVGMTAVQVTL 402

Query: 429 GVSTLLSYVPVSLATAHPAGALTLPTFMLLLNHTLRKPSLSLLQSLPQVVKAN 271
           GVSTLLSYVPVSL +AH AGALTL T MLLLNHTLR+PS SLL+SLPQV K+N
Sbjct: 403 GVSTLLSYVPVSLGSAHQAGALTLLTLMLLLNHTLRRPSPSLLKSLPQVAKSN 455

>ref|NP_611855.1| CG3803-PA [Drosophila melanogaster] gi|7291690|gb|AAF47112.1|
           CG3803-PA [Drosophila melanogaster]
           gi|17946096|gb|AAL49090.1| RE54691p [Drosophila
           melanogaster]
          Length = 393

 Score = 90.1 bits (222), Expect = 2e-17
 Identities = 49/106 (46%), Positives = 68/106 (63%)
 Frame = -1

Query: 603 RNFFENTSTVQLDHRILATATLISVAALWGATRKLDIHPAVRSVIGSIVGMAALQVTLGV 424
           +N  EN +TVQ +HRIL  +T+    ALW  TR++ +       I ++V MA  Q TLGV
Sbjct: 293 KNITENPTTVQFNHRILGISTVTLTTALWLVTRRMQLPKRANWAINAVVAMAWTQATLGV 352

Query: 423 STLLSYVPVSLATAHPAGALTLPTFMLLLNHTLRKPSLSLLQSLPQ 286
           +TLL+YVPV LATAH +G+L L +F L L+H +R     LL+ LP+
Sbjct: 353 TTLLNYVPVPLATAHQSGSLILLSFALWLSHEVR-----LLKYLPK 393

>gb|EAA31207.1| hypothetical protein [Neurospora crassa]
          Length = 492

 Score = 82.8 bits (203), Expect = 3e-15
 Identities = 49/104 (47%), Positives = 62/104 (59%), Gaps = 4/104 (3%)
 Frame = -1

Query: 603 RNFFENTSTVQLDHRILATATLISVAALWGATR----KLDIHPAVRSVIGSIVGMAALQV 436
           RN  EN S VQLDHRILA  T  +V AL   +R    K  + PA R  I  ++ + +LQV
Sbjct: 369 RNMLENPSLVQLDHRILAMTTFTAVCALVAYSRSGRVKAALPPAARKGITGLLHLVSLQV 428

Query: 435 TLGVSTLLSYVPVSLATAHPAGALTLPTFMLLLNHTLRKPSLSL 304
            LG+STL+  VP+ LA AH AGAL L T +L+    LR P  +L
Sbjct: 429 ALGISTLIYMVPIPLAAAHQAGALALLTGVLVAGQRLRIPKATL 472

>gb|EAA00923.1| agCP11935 [Anopheles gambiae str. PEST]
          Length = 443

 Score = 81.6 bits (200), Expect = 7e-15
 Identities = 42/107 (39%), Positives = 66/107 (61%)
 Frame = -1

Query: 606 IRNFFENTSTVQLDHRILATATLISVAALWGATRKLDIHPAVRSVIGSIVGMAALQVTLG 427
           +RNF EN +TVQ DHR+L TATL  +  ++  +R+  + P       ++  M  +QV LG
Sbjct: 342 LRNFTENPTTVQFDHRVLGTATLTLITGMFLLSRRRLLPPRAYKAATAVAAMGWMQVALG 401

Query: 426 VSTLLSYVPVSLATAHPAGALTLPTFMLLLNHTLRKPSLSLLQSLPQ 286
           ++TLL+YVPV LA +H +G+L L +  + L H L+     L++ LP+
Sbjct: 402 ITTLLTYVPVPLAASHQSGSLVLLSLAIWLTHELK-----LVKRLPK 443

>gb|AAH46017.1| Similar to RIKEN cDNA 2900026G05 gene [Danio rerio]
          Length = 399

 Score = 76.6 bits (187), Expect = 2e-13
 Identities = 39/96 (40%), Positives = 58/96 (59%)
 Frame = -1

Query: 606 IRNFFENTSTVQLDHRILATATLISVAALWGATRKLDIHPAVRSVIGSIVGMAALQVTLG 427
           ++N FEN +TVQ DHRIL   +L ++  L+  +R++ +    +  I  +  MA  QV LG
Sbjct: 301 LKNVFENPTTVQFDHRILGIGSLTAITGLYLFSRRMILPRRAKMAISLLTAMAYTQVVLG 360

Query: 426 VSTLLSYVPVSLATAHPAGALTLPTFMLLLNHTLRK 319
           +STLL YVP  LA  H +G++ L TF + +   LRK
Sbjct: 361 ISTLLLYVPTPLAATHQSGSVALLTFAIWVLAELRK 396

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 524,058,588
Number of Sequences: 1393205
Number of extensions: 11060558
Number of successful extensions: 30156
Number of sequences better than 10.0: 72
Number of HSP's better than 10.0 without gapping: 29018
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 30125
length of database: 448,689,247
effective HSP length: 117
effective length of database: 285,684,262
effective search space used: 24283162270
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MPD030d05_f AV772049 1 612
2 MFB012f09_f BP034810 25 614




Lotus japonicus
Kazusa DNA Research Institute