KMC005782A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC005782A_C01 KMC005782A_c01
gctAGTACAAAATGCTTGATTGAATTGCATACTGAAAGGATATACAATATACATAATCTG
AGGAGCAAGAGACATAAGAGAGGAATGAAATAATGAAGTATTCCATTTTGCTCATAAAGA
CAAATATAAGAATCCTCTCTTGCTCATAAGAATCCTACACTGTAGGAAGGCACAAATCTG
TATTTATATAGGAAAGATGGCTGTGGGGAAATTTCTCCTTCATCATGAGCATTTCTTTAT
ATGAACTTAAATGATGTGACCGCCTTCTGAATTTCGGAAGCATACTTGTCTGTTTCCTCT
TCCACATACTGTCCTGTCACAGTATACAGTCTGTTATACCAACCATTTGTTGCCATCCCT
ATAGCTGAATATAGATGTCTGCGACTCTCACCAGGATTTTGCAACGAGTACTCGATATAA
TAAATTCCTTTAGATGATTTACTATTTATGAGTTTAGCAGCTACACCAGGCGGTTTTCTC
CAGCTTCTGTCCAGCCCACTAACCAGAGTCTCAGCAAACTCATCAACCTTGCCAAACGAT
TCCATCTTAGTGAAATCCGGCCC


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC005782A_C01 KMC005782A_c01
         (563 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAM61552.1| thylakoid lumen protein, chloroplast precursor [A...   181  7e-45
ref|NP_565131.1| oxygen-evolving complex-23 related protein; pro...   180  1e-44
pir||C96792 unknown protein F14G6.5 [imported] - Arabidopsis tha...   180  1e-44
ref|XP_127166.3| similar to Cdc42-binding protein kinase beta [R...    34  1.4
dbj|BAC65833.1| mKIAA1757 protein [Mus musculus]                       34  1.4

>gb|AAM61552.1| thylakoid lumen protein, chloroplast precursor [Arabidopsis
           thaliana]
          Length = 247

 Score =  181 bits (458), Expect = 7e-45
 Identities = 83/108 (76%), Positives = 99/108 (90%)
 Frame = -1

Query: 563 GPDFTKMESFGKVDEFAETLVSGLDRSWRKPPGVAAKLINSKSSKGIYYIEYSLQNPGES 384
           GPDFT+MESFGKV+ FAETLVSGLDRSW+KP GV AKLI+S++SKG YYIEY+LQNPGE+
Sbjct: 140 GPDFTRMESFGKVEAFAETLVSGLDRSWQKPVGVTAKLIDSRASKGFYYIEYTLQNPGEA 199

Query: 383 RRHLYSAIGMATNGWYNRLYTVTGQYVEEETDKYASEIQKAVTSFKFI 240
           R+HLYSAIGMATNGWYNRLYTVTGQ+ +EE+ + +S+IQK V SF+FI
Sbjct: 200 RKHLYSAIGMATNGWYNRLYTVTGQFTDEESSEQSSKIQKTVKSFRFI 247

>ref|NP_565131.1| oxygen-evolving complex-23 related protein; protein id:
           At1g76450.1, supported by cDNA: 123862. [Arabidopsis
           thaliana] gi|18203439|sp|Q9S720|THL1_ARATH Unknown
           thylakoid lumen protein, chloroplast precursor
          Length = 247

 Score =  180 bits (456), Expect = 1e-44
 Identities = 83/108 (76%), Positives = 99/108 (90%)
 Frame = -1

Query: 563 GPDFTKMESFGKVDEFAETLVSGLDRSWRKPPGVAAKLINSKSSKGIYYIEYSLQNPGES 384
           GPDFT+MESFGKV+ FAETLVSGLDRSW+KP GV AKLI+S++SKG YYIEY+LQNPGE+
Sbjct: 140 GPDFTRMESFGKVEAFAETLVSGLDRSWQKPVGVTAKLIDSRASKGFYYIEYTLQNPGEA 199

Query: 383 RRHLYSAIGMATNGWYNRLYTVTGQYVEEETDKYASEIQKAVTSFKFI 240
           R+HLYSAIGMATNGWYNRLYTVTGQ+ +EE+ + +S+IQK V SF+FI
Sbjct: 200 RKHLYSAIGMATNGWYNRLYTVTGQFTDEESAEQSSKIQKTVKSFRFI 247

>pir||C96792 unknown protein F14G6.5 [imported] - Arabidopsis thaliana
           gi|6554474|gb|AAF16656.1|AC012394_5 unknown protein;
           20843-19352 [Arabidopsis thaliana]
           gi|12323974|gb|AAG51945.1|AC015450_6 unknown protein;
           20920-22411 [Arabidopsis thaliana]
          Length = 220

 Score =  180 bits (456), Expect = 1e-44
 Identities = 83/108 (76%), Positives = 99/108 (90%)
 Frame = -1

Query: 563 GPDFTKMESFGKVDEFAETLVSGLDRSWRKPPGVAAKLINSKSSKGIYYIEYSLQNPGES 384
           GPDFT+MESFGKV+ FAETLVSGLDRSW+KP GV AKLI+S++SKG YYIEY+LQNPGE+
Sbjct: 113 GPDFTRMESFGKVEAFAETLVSGLDRSWQKPVGVTAKLIDSRASKGFYYIEYTLQNPGEA 172

Query: 383 RRHLYSAIGMATNGWYNRLYTVTGQYVEEETDKYASEIQKAVTSFKFI 240
           R+HLYSAIGMATNGWYNRLYTVTGQ+ +EE+ + +S+IQK V SF+FI
Sbjct: 173 RKHLYSAIGMATNGWYNRLYTVTGQFTDEESAEQSSKIQKTVKSFRFI 220

>ref|XP_127166.3| similar to Cdc42-binding protein kinase beta [Rattus norvegicus]
           [Mus musculus]
          Length = 1463

 Score = 33.9 bits (76), Expect = 1.4
 Identities = 18/47 (38%), Positives = 28/47 (59%), Gaps = 1/47 (2%)
 Frame = -2

Query: 259 SHHLSSYKEMLMMKEKFPHSHLS-YINTDLCLPTV*DSYEQERILIF 122
           SH L+  KE+LM+K+K   S    +   +  + TV D YE+ER ++F
Sbjct: 474 SHQLALQKEVLMLKDKLEKSKRERHSEMEEAIGTVKDKYERERAMLF 520

>dbj|BAC65833.1| mKIAA1757 protein [Mus musculus]
          Length = 1450

 Score = 33.9 bits (76), Expect = 1.4
 Identities = 17/54 (31%), Positives = 29/54 (53%)
 Frame = -1

Query: 350 TNGWYNRLYTVTGQYVEEETDKYASEIQKAVTSFKFI*RNAHDEGEISPQPSFL 189
           T GW NR+   T QY E  T++Y++++Q A+       ++ H   E   +P+ L
Sbjct: 212 TEGWENRIRLWTDQYEEAFTNQYSADVQNALE------QHLHSNKEFVGKPAIL 259

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 479,509,690
Number of Sequences: 1393205
Number of extensions: 10349116
Number of successful extensions: 30343
Number of sequences better than 10.0: 26
Number of HSP's better than 10.0 without gapping: 29538
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 30336
length of database: 448,689,247
effective HSP length: 116
effective length of database: 287,077,467
effective search space used: 20382500157
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MWM179b03_f AV767491 1 563
2 MFBL053d01_f BP043968 4 512
3 MWM200d11_f AV767804 16 530




Lotus japonicus
Kazusa DNA Research Institute