KMC003192A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC003192A_C01 KMC003192A_c01
atttatattcatgcgtagatcaaaagctatgagAAATAAGTTTTAAATTTCTCTAAAATT
AAAAGAAAGAAATTAGTAATAACAATGCTTAAAGCATGAGTTCATTTGGAGATAGCGCGC
GCTAATAACAACAAACGAAGATCAATCCTAAAAGAAGTAGTTCCTTATTGATACATGAAA
TGAACAACTTGAACTAATGTACAGACTTAAAGTCGGTCTATGACAAAGGTGGAACCATTC
AAACAAAATTATTTTCATCTTAACCATCAGTTCTTCCTCATCTAACTAAAGGCATAAGAA
GGTACTCTGCAAATAAGAGCTTCCAGGCAAGCATGTAGAAGGATGACAAGGAAGCTTGGC
TTTTCAAATCTACAGATTTTGCATTGTGCCAAAGAACTGAAGCCAGAACAACATTTCCCA
TACCCGTGACAATTTTACTCCACATGAAAGATGATGTTGCTCCCACCCCAAGAGCGATAC
CAAAAGCTATTTCAAAAAGGGAAACACAAAACCAAAATACCGGTTTTTGACCAAAACGTG
TTGCAAAAGTTGAGATTCCATGTTTTTTATCTCCTTCAATGTCAGGTAAGTCCTGGAACA
ATGCTATTCCTATCGAGTACAAGCTCATGAACACCATTACAAAATTCAATGA


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC003192A_C01 KMC003192A_c01
         (652 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_179485.2| hypothetical protein; protein id: At2g18950.1, ...   146  2e-34
pir||T01623 hypothetical protein At2g18950 [imported] - Arabidop...   114  9e-25
gb|ZP_00107768.1| hypothetical protein [Nostoc punctiforme]            77  3e-13
ref|NP_441094.1| unknown protein [Synechocystis sp. PCC 6803] gi...    74  2e-12
ref|NP_487488.1| hypothetical protein [Nostoc sp. PCC 7120] gi|2...    72  5e-12

>ref|NP_179485.2| hypothetical protein; protein id: At2g18950.1, supported by cDNA:
           gi_17104827, supported by cDNA: gi_17380873, supported
           by cDNA: gi_20384918, supported by cDNA: gi_21281071
           [Arabidopsis thaliana]
           gi|17104828|gb|AAL35412.1|AF324344_1 tocopherol
           polyprenyltransferase [Arabidopsis thaliana]
           gi|17380874|gb|AAL36249.1| unknown protein [Arabidopsis
           thaliana] gi|20384919|gb|AAM10489.1| homogentisate
           phytylprenyltransferase [Arabidopsis thaliana]
           gi|21281072|gb|AAM45041.1| unknown protein [Arabidopsis
           thaliana]
          Length = 393

 Score =  146 bits (369), Expect = 2e-34
 Identities = 66/123 (53%), Positives = 93/123 (74%)
 Frame = -1

Query: 649 LNFVMVFMSLYSIGIALFQDLPDIEGDKKHGISTFATRFGQKPVFWFCVSLFEIAFGIAL 470
           L F   FMS +S+ IALF+D+PDIEGDK  GI +F+   GQK VFW CV+L ++A+ +A+
Sbjct: 271 LIFATAFMSFFSVVIALFKDIPDIEGDKIFGIRSFSVTLGQKRVFWTCVTLLQMAYAVAI 330

Query: 469 GVGATSSFMWSKIVTGMGNVVLASVLWHNAKSVDLKSQASLSSFYMLAWKLLFAEYLLMP 290
            VGATS F+WSK+++ +G+V+LA+ LW  AKSVDL S+  ++S YM  WKL +AEYLL+P
Sbjct: 331 LVGATSPFIWSKVISVVGHVILATTLWARAKSVDLSSKTEITSCYMFIWKLFYAEYLLLP 390

Query: 289 LVR 281
            ++
Sbjct: 391 FLK 393

>pir||T01623 hypothetical protein At2g18950 [imported] - Arabidopsis thaliana
           gi|3004556|gb|AAC09029.1| hypothetical protein
           [Arabidopsis thaliana]
          Length = 210

 Score =  114 bits (286), Expect = 9e-25
 Identities = 55/112 (49%), Positives = 80/112 (71%), Gaps = 1/112 (0%)
 Frame = -1

Query: 643 FVMVFMSLYSIGIAL-FQDLPDIEGDKKHGISTFATRFGQKPVFWFCVSLFEIAFGIALG 467
           F  +F+S + +G A    D+PDIEGDK  GI +F+   GQK VFW CV+L ++A+ +A+ 
Sbjct: 97  FWALFVS-FMLGTAYSINDIPDIEGDKIFGIRSFSVTLGQKRVFWTCVTLLQMAYAVAIL 155

Query: 466 VGATSSFMWSKIVTGMGNVVLASVLWHNAKSVDLKSQASLSSFYMLAWKLLF 311
           VGATS F+WSK+++ +G+V+LA+ LW  AKSVDL S+  ++S YM  WK+ F
Sbjct: 156 VGATSPFIWSKVISVVGHVILATTLWARAKSVDLSSKTEITSCYMFIWKVRF 207

>gb|ZP_00107768.1| hypothetical protein [Nostoc punctiforme]
          Length = 322

 Score = 76.6 bits (187), Expect = 3e-13
 Identities = 35/116 (30%), Positives = 67/116 (57%)
 Frame = -1

Query: 634 VFMSLYSIGIALFQDLPDIEGDKKHGISTFATRFGQKPVFWFCVSLFEIAFGIALGVGAT 455
           +F+ +++  IA+F+D+PDIEGD+ + I+TF  + G + VF   + +  + +   + VG  
Sbjct: 202 LFILVFTFAIAIFKDIPDIEGDRLYNITTFTIKLGSQAVFNLALWVITVCYLGIILVGVL 261

Query: 454 SSFMWSKIVTGMGNVVLASVLWHNAKSVDLKSQASLSSFYMLAWKLLFAEYLLMPL 287
                + I     ++ L   +W  + +VDL+ +++++ FY   WKL F EYL+ P+
Sbjct: 262 RIASVNPIFLITAHLALLVWMWWRSLAVDLQDKSAIAQFYQFIWKLFFIEYLIFPI 317

>ref|NP_441094.1| unknown protein [Synechocystis sp. PCC 6803] gi|7470486|pir||S74813
           hypothetical protein slr1736 - Synechocystis sp. (strain
           PCC 6803) gi|1652856|dbj|BAA17774.1|
           ORF_ID:slr1736~unknown protein [Synechocystis sp. PCC
           6803]
          Length = 308

 Score = 73.6 bits (179), Expect = 2e-12
 Identities = 37/117 (31%), Positives = 71/117 (60%), Gaps = 1/117 (0%)
 Frame = -1

Query: 634 VFMSLYSIGIALFQDLPDIEGDKKHGISTFATRFGQKPVFWFCVSLFEIAFGIALGV-GA 458
           +F+ ++++ IA+F+D+PD+EGD++  I T   + G++ VF   + L    + +A+ + G 
Sbjct: 181 LFILVFTVAIAIFKDVPDMEGDRQFKIQTLTLQIGKQNVFRGTLILLTGCY-LAMAIWGL 239

Query: 457 TSSFMWSKIVTGMGNVVLASVLWHNAKSVDLKSQASLSSFYMLAWKLLFAEYLLMPL 287
            ++   +     + ++ L ++LW  ++ V L+S+  ++SFY   WKL F EYLL PL
Sbjct: 240 WAAMPLNTAFLIVSHLCLLALLWWRSRDVHLESKTEIASFYQFIWKLFFLEYLLYPL 296

>ref|NP_487488.1| hypothetical protein [Nostoc sp. PCC 7120] gi|25535404|pir||AI2236
           hypothetical protein alr3448 [imported] - Nostoc sp.
           (strain PCC 7120) gi|17132581|dbj|BAB75147.1|
           ORF_ID:alr3448~hypothetical protein [Nostoc sp. PCC
           7120]
          Length = 318

 Score = 72.4 bits (176), Expect = 5e-12
 Identities = 31/116 (26%), Positives = 66/116 (56%)
 Frame = -1

Query: 634 VFMSLYSIGIALFQDLPDIEGDKKHGISTFATRFGQKPVFWFCVSLFEIAFGIALGVGAT 455
           VF+ +++  IA+F+D+PD+EGD+ + I+T   + G + VF   + +  + +   + +G  
Sbjct: 198 VFILIFTFAIAIFKDIPDMEGDRLYNITTLTIQLGPQAVFNLAMWVLTVCYLGMVIIGVL 257

Query: 454 SSFMWSKIVTGMGNVVLASVLWHNAKSVDLKSQASLSSFYMLAWKLLFAEYLLMPL 287
                + +   + ++V+   +W  + +VD+  + +++ FY   WKL F EYL+ P+
Sbjct: 258 RLGTINSVFLVVTHLVILCWMWMQSLAVDIHDKTAIAQFYQFIWKLFFLEYLMFPI 313

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 525,706,355
Number of Sequences: 1393205
Number of extensions: 11136723
Number of successful extensions: 26874
Number of sequences better than 10.0: 52
Number of HSP's better than 10.0 without gapping: 26284
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 26866
length of database: 448,689,247
effective HSP length: 118
effective length of database: 284,291,057
effective search space used: 27860523586
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MWM217b10_f AV768059 1 360
2 GNf097h12 BP074594 34 512
3 MR067f11_f BP081176 57 450
4 GNf073h01 BP072795 63 564
5 MR087g10_f BP082710 71 169
6 GNf025f12 BP069193 75 369
7 GNf065b03 BP072172 75 492
8 MR098d12_f BP083509 150 654




Lotus japonicus
Kazusa DNA Research Institute