KMC000951A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC000951A_C01 KMC000951A_c01
gctaaccgggtgcataatatATCAATAACCAGAAGTGTGATAAGTGACAAAATTCGGAAG
TCAAGACTACAGCAATAACTATATATGTAACAAAAGTTGGAAAAAAAAAGATTAAAAAAG
ACAAACGGGCCTCAAGCCTAGCCAGTTCATGCATCCAATCCTCTCCTTCCAAAGGCTCGA
GAATGGATGGGGCTGTTGCATTGCAGGAAAGTCACAGTCAAACACTTGATGCCAATTTCT
CCGATTCTAGGTGGCAATATCACACCTGCTGCTGTTTCGGTTAGATATCTACACAGGAAA
AAACTCATCTTCAAGAGGCAATCTGTGGAAGCAGATTTCATACAACCTCTGATCACTTGA
TCAACTCGGAAACGTAGAGGCACATTGGTTGCCTCTATCTCTGCTTGTATTCTTCATGGC
TCAACACTCAATTTATCCCCAGTTTTTCTCACATTTTTTATAAAAGCTCCTCTAAAGAAT
CTATGCTGAAGAGCCACGATCCTACTGCCATGAGGGGTGTTTGTAACTGAATCTATCAGT
TCCATCCCAACCACCATCAGGTTCTTTTCTTTTCAATGCACCT


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC000951A_C01 KMC000951A_c01
         (583 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_190388.1| putative protein; protein id: At3g48050.1 [Arab...    38  0.11
emb|CAC69851.1| hypothetical protein [Nicotiana tabacum]               33  2.0
ref|NP_190389.1| putative protein; protein id: At3g48060.1 [Arab...    33  2.6
gb|EAA11905.1| ebiP4537 [Anopheles gambiae str. PEST]                  33  3.4
gb|AAH26761.1| similar to Dystrophia myotonica-containing WD rep...    32  5.8

>ref|NP_190388.1| putative protein; protein id: At3g48050.1 [Arabidopsis thaliana]
            gi|7487129|pir||T06678 hypothetical protein T17F15.80 -
            Arabidopsis thaliana gi|4678323|emb|CAB41134.1| putative
            protein [Arabidopsis thaliana]
          Length = 1613

 Score = 37.7 bits (86), Expect = 0.11
 Identities = 16/26 (61%), Positives = 18/26 (68%)
 Frame = -2

Query: 582  GALKRKEPDGGWDGTDRFSYKHPSWQ 505
            G LKRKEP+GGWDG     Y+  SWQ
Sbjct: 1593 GVLKRKEPEGGWDG-----YRQSSWQ 1613

>emb|CAC69851.1| hypothetical protein [Nicotiana tabacum]
          Length = 305

 Score = 33.5 bits (75), Expect = 2.0
 Identities = 15/25 (60%), Positives = 19/25 (76%)
 Frame = -2

Query: 582 GALKRKEPDGGWDGTDRFSYKHPSW 508
           G LKRKEP+GGWD ++ F +K  SW
Sbjct: 282 GVLKRKEPEGGWD-SENFRFKQ-SW 304

>ref|NP_190389.1| putative protein; protein id: At3g48060.1 [Arabidopsis thaliana]
            gi|7487128|pir||T06677 hypothetical protein T17F15.70 -
            Arabidopsis thaliana gi|4678322|emb|CAB41133.1| putative
            protein [Arabidopsis thaliana]
          Length = 1611

 Score = 33.1 bits (74), Expect = 2.6
 Identities = 12/14 (85%), Positives = 13/14 (92%)
 Frame = -2

Query: 582  GALKRKEPDGGWDG 541
            G LKRKEP+GGWDG
Sbjct: 1593 GVLKRKEPEGGWDG 1606

>gb|EAA11905.1| ebiP4537 [Anopheles gambiae str. PEST]
          Length = 1015

 Score = 32.7 bits (73), Expect = 3.4
 Identities = 25/81 (30%), Positives = 38/81 (46%), Gaps = 12/81 (14%)
 Frame = -3

Query: 401 EIEATNVPLR------FRVDQVIRGCMKSASTDC------LLKMSFFLCRYLTETAAGVI 258
           +++A + PL       FRV+  + G +K+ STDC      LL M   LC+ LT  +  ++
Sbjct: 397 KLDANSAPLTMMLKFVFRVESPLAGILKALSTDCSKVQDALLAM---LCQVLTGNSLDLV 453

Query: 257 LPPRIGEIGIKCLTVTFLQCN 195
           L     E   K      L+CN
Sbjct: 454 LSVASVEGKFKAFISGLLKCN 474

>gb|AAH26761.1| similar to Dystrophia myotonica-containing WD repeat [Mus musculus]
          Length = 539

 Score = 32.0 bits (71), Expect = 5.8
 Identities = 25/78 (32%), Positives = 37/78 (47%)
 Frame = +3

Query: 126 RASSLASSCIQSSPSKGSRMDGAVALQESHSQTLDANFSDSRWQYHTCCCFG*ISTQEKT 305
           R++SL  S + ++ SKGS MDGA+A   S   TL  +  D + ++H           EK 
Sbjct: 401 RSNSLPHSAVSNAASKGSVMDGAIASGVSKFATL--SLHDRKERHH-----------EKD 447

Query: 306 HLQEAICGSRFHTTSDHL 359
           H +    G     +SD L
Sbjct: 448 HKRNHSMGHISSKSSDKL 465

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 504,551,739
Number of Sequences: 1393205
Number of extensions: 10651351
Number of successful extensions: 27012
Number of sequences better than 10.0: 14
Number of HSP's better than 10.0 without gapping: 26360
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 27003
length of database: 448,689,247
effective HSP length: 117
effective length of database: 285,684,262
effective search space used: 21712003912
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 GENLf047b12 BP064824 1 492
2 MWM180e02_f AV767509 21 583




Lotus japonicus
Kazusa DNA Research Institute