KMC001733A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC001733A_C01 KMC001733A_c01
TTCGGCACGAGGATTTCTCTCCAATTCTCTCTCTCTTTTCTCTCCAATTCCAAACTCTCT
TGTTTTTCCTTCATAGTCCTATAAAATTTGGCCATGTAATCCTGTAGTAGAACAACAAAA
AGAGACTATAGAGTGTGTGTGTAAGAAAGATAATTGGTCACATCTTCTGTTGTTTATAAT
TTGTTAGTTGTTTGTGTGTGTTGGAATTGTTAGATGGGTGTTGAAGAGGGAGAAGAAGTG
TTCAGAAAGAAGGGACACATGGAAGTTAGTGGAGATACATGGTTGTATGCTAGCTGCCTT
AAGAACTGTGCAAACTGGCGCTGTTTAAGATACCAATCTCCATCTTCAATCACTACTACT
CCCTTTAAGTAAAAACAAACAACAACCTTGTCAAGTTGTGATCCTCTCACTTTTTCTTAA
TTACATTTTAAGGGACCAAAACTTAAGTACCTTGATCTAACAAGATCTGTGATTTTCCAC
TCATGAAGAGACTTGGAAGATCTGATTCTGTACGTGCTCTCATGAAGA


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC001733A_C01 KMC001733A_c01
         (528 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_705048.1| hypothetical protein [Plasmodium falciparum 3D7...    35  0.54
ref|NP_567205.1| contains similarity to gag proteins; protein id...    33  2.1
dbj|BAC42619.1| unknown protein [Arabidopsis thaliana] gi|290288...    33  2.1
gb|ZP_00111270.1| hypothetical protein [Nostoc punctiforme]            31  7.9
ref|NP_501784.1| Putative membrane protein family member, with a...    31  7.9

>ref|NP_705048.1| hypothetical protein [Plasmodium falciparum 3D7]
            gi|23615293|emb|CAD52284.1| hypothetical protein
            [Plasmodium falciparum 3D7]
          Length = 1205

 Score = 35.0 bits (79), Expect = 0.54
 Identities = 17/52 (32%), Positives = 30/52 (57%), Gaps = 3/52 (5%)
 Frame = -1

Query: 264  FHVSLLSEHFFSLFNTHLTIP---THTNN*QIINNRRCDQLSFLHTHSIVSF 118
            +++ LL+EHF S   +H+ I     ++NN  I NN +  Q + ++TH  + F
Sbjct: 1110 YNIKLLAEHFKSYVESHIKIKINKNYSNNNVISNNNKTTQNNTINTHEYLKF 1161

>ref|NP_567205.1| contains similarity to gag proteins; protein id: At4g00980.1
           [Arabidopsis thaliana] gi|7485280|pir||T01549
           hypothetical protein A_TM018A10.1 - Arabidopsis thaliana
           gi|2252849|gb|AAB62847.1| contains similarity to gag
           proteins [Arabidopsis thaliana]
           gi|7267595|emb|CAB80907.1| contains similarity to gag
           proteins~similarity to~Contains TonB-dependent receptor
           proteins signatures AA1-6~contains EST gb:AI994181.1,
           T45472, AI994919.1 [Arabidopsis thaliana]
          Length = 462

 Score = 33.1 bits (74), Expect = 2.1
 Identities = 29/88 (32%), Positives = 39/88 (43%), Gaps = 1/88 (1%)
 Frame = -2

Query: 344 DGDWYLK-QRQFAQFLRQLAYNHVSPLTSMCPFFLNTSSPSSTPI*QFQHTQTTNKL*TT 168
           DG  YL    Q   FL+QL   +V  L+  CP   ++  P + P  +      T K    
Sbjct: 173 DGKSYLYWASQMELFLKQLKLTYV--LSEPCPSIGSSQGPETNPR-EITRADATGKKWLR 229

Query: 167 EDVTNYLSYTHTL*SLFVVLLQDYMAKF 84
           +D   YL YTH + SL   L + Y  KF
Sbjct: 230 DD---YLCYTHLMNSLSDHLYRRYSQKF 254

>dbj|BAC42619.1| unknown protein [Arabidopsis thaliana] gi|29028884|gb|AAO64821.1|
           At4g00980 [Arabidopsis thaliana]
          Length = 488

 Score = 33.1 bits (74), Expect = 2.1
 Identities = 29/88 (32%), Positives = 39/88 (43%), Gaps = 1/88 (1%)
 Frame = -2

Query: 344 DGDWYLK-QRQFAQFLRQLAYNHVSPLTSMCPFFLNTSSPSSTPI*QFQHTQTTNKL*TT 168
           DG  YL    Q   FL+QL   +V  L+  CP   ++  P + P  +      T K    
Sbjct: 199 DGKSYLYWASQMELFLKQLKLTYV--LSEPCPSIGSSQGPETNPR-EITRADATGKKWLR 255

Query: 167 EDVTNYLSYTHTL*SLFVVLLQDYMAKF 84
           +D   YL YTH + SL   L + Y  KF
Sbjct: 256 DD---YLCYTHLMNSLSDHLYRRYSQKF 280

>gb|ZP_00111270.1| hypothetical protein [Nostoc punctiforme]
          Length = 210

 Score = 31.2 bits (69), Expect = 7.9
 Identities = 21/73 (28%), Positives = 32/73 (43%), Gaps = 9/73 (12%)
 Frame = -2

Query: 368 LKGVVVIEDGDWYLKQRQFAQFLR---------QLAYNHVSPLTSMCPFFLNTSSPSSTP 216
           L+G+ VI   D Y   +Q  Q +          Q+AYNH+S     CP    ++    TP
Sbjct: 125 LQGIKVIRSKDTYFSSQQPRQQVTLAPGKQASFQIAYNHISSPQENCPI---SNKIEITP 181

Query: 215 I*QFQHTQTTNKL 177
              +QH   T ++
Sbjct: 182 PNAYQHLTVTEEI 194

>ref|NP_501784.1| Putative membrane protein family member, with a transmembrane
           domain, of bilaterial origin [Caenorhabditis elegans]
           gi|6137288|sp|O45435|YV4Q_CAEEL Hypothetical protein
           F32B6.9 in chromosome IV gi|7500332|pir||T21644
           hypothetical protein F32B6.9 - Caenorhabditis elegans
           gi|3876567|emb|CAB03043.1| Hypothetical protein F32B6.9
           [Caenorhabditis elegans]
          Length = 413

 Score = 31.2 bits (69), Expect = 7.9
 Identities = 18/48 (37%), Positives = 28/48 (57%), Gaps = 3/48 (6%)
 Frame = -2

Query: 374 FYLKGVVVIED--GDWYLKQRQFAQFLRQL-AYNHVSPLTSMCPFFLN 240
           FYLKG+ +I+D   D    +R F  F RQ  +Y  + PLT +  F+++
Sbjct: 41  FYLKGIDLIDDDEDDRLKMRRMFETFCRQCDSYTRLIPLTFLLGFYVS 88

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 435,695,482
Number of Sequences: 1393205
Number of extensions: 9110512
Number of successful extensions: 21969
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 21210
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 21917
length of database: 448,689,247
effective HSP length: 115
effective length of database: 288,470,672
effective search space used: 17308240320
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 GNf067e07 BP072347 1 492
2 GENf012c03 BP058847 4 512
3 MPD017h04_f AV771198 18 531
4 MFBL021a07_f BP042289 52 501




Lotus japonicus
Kazusa DNA Research Institute