KMC007123A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC007123A_C01 KMC007123A_c01
caaagtAACAAAAAAAACTGGTTTCTTTAATATCTTTCAATCTGCTGAAACCTTATCATC
AGTTTCAAATTATGTCCTTCTACCCAAAAGGAAAAAAATGTATGCTCACATCCTCATAAT
ATCAATTGAATGAGGAAACTTGGAAGAAAAACAAGAATAGCACTCAGCAATGAGCACAAA
AAATCAAAAATAGAGTGGAGAAAACCATTTGCGCTAACCTACGATCAAACTTCTTATAAA
TTAAATTGCTTTGTGGTGTCCACTAATGAAGCAGATGAGCATCTTGAGAAAAATCTAGTC
AATCAAGGAAGATTCGAGTCATAATCTTGGTCTCGGGCCTACTGAACTCCTATCCAACGA
GCATAGTCCAACCACAATAATGAACCCAAATATGTATTGGCGGTTCGAACAGCAAAGCAA
AGTGGGCTAAGCACAAGCTTGTGCTGGTGCAGTAAAG


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC007123A_C01 KMC007123A_c01
         (457 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_568280.1| putative protein; protein id: At5g12470.1, supp...    69  3e-11
ref|NP_565930.1| chloroplast lumen common protein family; protei...    37  0.12
ref|NP_191173.2| chloroplast lumen common protein family; protei...    35  0.28
pir||T47731 hypothetical protein F18O21.100 - Arabidopsis thalia...    35  0.28
ref|NP_507780.1| Predicted CDS, DNA polymerase type B, organella...    32  3.0

>ref|NP_568280.1| putative protein; protein id: At5g12470.1, supported by cDNA:
           gi_20268751, supported by cDNA: gi_21281148 [Arabidopsis
           thaliana] gi|14586377|emb|CAC42908.1| putative protein
           [Arabidopsis thaliana] gi|20268752|gb|AAM14079.1|
           unknown protein [Arabidopsis thaliana]
           gi|21281149|gb|AAM45049.1| unknown protein [Arabidopsis
           thaliana] gi|27311697|gb|AAO00814.1| putative protein
           [Arabidopsis thaliana]
          Length = 386

 Score = 68.6 bits (166), Expect = 3e-11
 Identities = 30/38 (78%), Positives = 34/38 (88%)
 Frame = -3

Query: 455 LLHQHKLVLSPLCFAVRTANTYLGSLLWLDYARWIGVQ 342
           +LHQHKL LS LCFAVRT NT+LGSLLW+DYAR IG+Q
Sbjct: 346 MLHQHKLALSALCFAVRTGNTFLGSLLWVDYARLIGIQ 383

>ref|NP_565930.1| chloroplast lumen common protein family; protein id: At2g40400.1,
           supported by cDNA: gi_15294187, supported by cDNA:
           gi_20857081 [Arabidopsis thaliana]
           gi|25344247|pir||A84829 hypothetical protein At2g40400
           [imported] - Arabidopsis thaliana
           gi|4586056|gb|AAD25674.1| chloroplast lumen common
           protein family [Arabidopsis thaliana]
           gi|15294188|gb|AAK95271.1|AF410285_1 At2g40400/T3G21.17
           [Arabidopsis thaliana] gi|20857082|gb|AAM26698.1|
           At2g40400/T3G21.17 [Arabidopsis thaliana]
          Length = 735

 Score = 36.6 bits (83), Expect = 0.12
 Identities = 22/73 (30%), Positives = 36/73 (49%), Gaps = 4/73 (5%)
 Frame = -3

Query: 452 LHQHKLVLSPLCFAVRTANTYLGSLLWLDYARWIGVQ*ARDQDYDSNLP*L----TRFFS 285
           L    L+++ + F VR AN+Y G+  W+D AR  G+Q  +     + +P +    T  +S
Sbjct: 663 LSSQPLLVNMISFVVRVANSYFGTQQWIDLARSTGLQTQKSVTTSNQIPEVASQSTVEYS 722

Query: 284 RCSSASLVDTTKQ 246
               AS+ D   Q
Sbjct: 723 TTEEASMDDLKNQ 735

>ref|NP_191173.2| chloroplast lumen common protein family; protein id: At3g56140.1,
           supported by cDNA: gi_20260423 [Arabidopsis thaliana]
           gi|20260424|gb|AAM13110.1| putative protein [Arabidopsis
           thaliana]
          Length = 745

 Score = 35.4 bits (80), Expect = 0.28
 Identities = 15/37 (40%), Positives = 23/37 (61%)
 Frame = -3

Query: 452 LHQHKLVLSPLCFAVRTANTYLGSLLWLDYARWIGVQ 342
           L    L+++ + F VRT N+Y G+  W+D AR  G+Q
Sbjct: 672 LSSQPLLVNAISFVVRTLNSYFGTQQWIDLARSTGLQ 708

>pir||T47731 hypothetical protein F18O21.100 - Arabidopsis thaliana
           gi|7572912|emb|CAB87413.1| putative protein [Arabidopsis
           thaliana]
          Length = 755

 Score = 35.4 bits (80), Expect = 0.28
 Identities = 15/37 (40%), Positives = 23/37 (61%)
 Frame = -3

Query: 452 LHQHKLVLSPLCFAVRTANTYLGSLLWLDYARWIGVQ 342
           L    L+++ + F VRT N+Y G+  W+D AR  G+Q
Sbjct: 682 LSSQPLLVNAISFVVRTLNSYFGTQQWIDLARSTGLQ 718

>ref|NP_507780.1| Predicted CDS, DNA polymerase type B, organellar and viral family
           member [Caenorhabditis elegans] gi|7496497|pir||T19459
           hypothetical protein C25F9.2 - Caenorhabditis elegans
           gi|3874466|emb|CAB03918.1| Hypothetical protein C25F9.2
           [Caenorhabditis elegans]
          Length = 1469

 Score = 32.0 bits (71), Expect = 3.0
 Identities = 20/75 (26%), Positives = 32/75 (42%)
 Frame = +1

Query: 67  KLCPSTQKEKNVCSHPHNIN*MRKLGRKTRIALSNEHKKSKIEWRKPFALTYDQTSYKLN 246
           K+C    ++ +VC HP                 + + KK K E +K    TY    Y + 
Sbjct: 561 KICKEKLEKNHVCEHP---------------LPTEKDKKKKREKQK----TYKVIVYDME 601

Query: 247 CFVVSTNEADEHLEK 291
           C V ++ E  EH+E+
Sbjct: 602 CIVANSGEYTEHVER 616

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 363,890,212
Number of Sequences: 1393205
Number of extensions: 6803269
Number of successful extensions: 14170
Number of sequences better than 10.0: 15
Number of HSP's better than 10.0 without gapping: 13884
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 14162
length of database: 448,689,247
effective HSP length: 112
effective length of database: 292,650,287
effective search space used: 11413361193
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 GENf031b09 BP059652 1 357
2 MFBL044h05_f BP043524 7 457




Lotus japonicus
Kazusa DNA Research Institute