KMC003652A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC003652A_C01 KMC003652A_c01
ggctttAAAGACACTAGAAGGTCATCATCCCTTATCACTATTAGAATGGATGATAAATAT
CTGAAAACATGGTGGAAAGACAAGGCTCCCACAATAGCAGTTTGGTATATGATGTATTAG
AGCAATACCAGTAAGAAATTACATCTTATAAGTTATTTCAATGATGGATGAAAAGTGAAA
GCACAAGCAACAATAGCCAGTGCCAAGTTTTCACTAAAGTGTAGTAGTGGAGTTCAATAG
AACACTCCATTCACATCCTTATTTCTCTAACCTCTATCATTTTCACTAATCATACAACTA
TCACCCACCCATTATTAAAGAGTGCCAAAGTAAAAGAATTCAACTCAGGTAGAATCACCT
TTGTGTTTGCATCCATGATCCATCCATCAATCCTTATTTTATTCCAATTGCCAAAAAGAC
CTCAAATTAACCTTCACCTATCATTGATCTGTACCAAAAACTTTGATCCTAGACAATGGG
CTCCTCAAGTAACCATCAACCTCAAACTTCTTGGGTGACAACTCTCTGTACTTAGCGAAC
TGCACCTTCGCTTCATCATTCTTATCAAGCAAACTATAAATCATTCCCCTG


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC003652A_C01 KMC003652A_c01
         (591 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_566609.1| expressed protein; protein id: At3g18420.1, sup...    83  3e-15
ref|NP_568104.1| chloroplast lumen common protein family; protei...    39  0.064
pir||T48280 hypothetical protein T22P11.180 - Arabidopsis thalia...    37  0.19
gb|AAM66986.1| unknown [Arabidopsis thaliana]                          36  0.42
ref|NP_565860.1| chloroplast lumen common protein family; protei...    36  0.42

>ref|NP_566609.1| expressed protein; protein id: At3g18420.1, supported by cDNA:
           23733., supported by cDNA: gi_14335157, supported by
           cDNA: gi_18655374 [Arabidopsis thaliana]
           gi|11994105|dbj|BAB01108.1|
           gb|AAC98059.1~gene_id:MYF24.14~similar to unknown
           protein [Arabidopsis thaliana]
           gi|14335158|gb|AAK59859.1| AT3g18420/MYF24_13
           [Arabidopsis thaliana] gi|18655375|gb|AAL76143.1|
           AT3g18420/MYF24_13 [Arabidopsis thaliana]
           gi|21592430|gb|AAM64381.1| unknown [Arabidopsis
           thaliana]
          Length = 316

 Score = 82.8 bits (203), Expect = 3e-15
 Identities = 39/46 (84%), Positives = 44/46 (94%)
 Frame = -2

Query: 590 RGMIYSLLDKNDEAKVQFAKYRELSPKKFEVDGYLRSPLSRIKVFG 453
           RGMIYSLLDKN EAK QFAKYRELSPKKFEV+GYLR+PLS++K+FG
Sbjct: 266 RGMIYSLLDKNVEAKEQFAKYRELSPKKFEVEGYLRTPLSKMKLFG 311

>ref|NP_568104.1| chloroplast lumen common protein family; protein id: At5g02590.1,
           supported by cDNA: 13930. [Arabidopsis thaliana]
           gi|21553360|gb|AAM62453.1| unknown [Arabidopsis
           thaliana]
          Length = 326

 Score = 38.5 bits (88), Expect = 0.064
 Identities = 20/50 (40%), Positives = 32/50 (64%), Gaps = 1/50 (2%)
 Frame = -2

Query: 590 RGMIYSLLDKNDEAKVQFAKYRELSPKKFEVDGYL-RSPLSRIKVFGTDQ 444
           +G+IY+L+ K DEA+ QFA++R L P+      YL  + L+  K+F  +Q
Sbjct: 277 QGLIYTLMKKKDEAEKQFAEFRRLVPENHPYKEYLDANVLNTNKLFAKNQ 326

>pir||T48280 hypothetical protein T22P11.180 - Arabidopsis thaliana
           gi|7413648|emb|CAB85996.1| putative protein [Arabidopsis
           thaliana]
          Length = 407

 Score = 37.0 bits (84), Expect = 0.19
 Identities = 16/35 (45%), Positives = 24/35 (67%)
 Frame = -2

Query: 590 RGMIYSLLDKNDEAKVQFAKYRELSPKKFEVDGYL 486
           +G+IY+L+ K DEA+ QFA++R L P+      YL
Sbjct: 277 QGLIYTLMKKKDEAEKQFAEFRRLVPENHPYKEYL 311

>gb|AAM66986.1| unknown [Arabidopsis thaliana]
          Length = 333

 Score = 35.8 bits (81), Expect = 0.42
 Identities = 17/45 (37%), Positives = 26/45 (57%)
 Frame = -2

Query: 590 RGMIYSLLDKNDEAKVQFAKYRELSPKKFEVDGYLRSPLSRIKVF 456
           +G+IY++L K +EA+ QF K+R L PK      Y    +   K+F
Sbjct: 279 QGIIYTVLKKENEAEKQFEKFRRLVPKNHPYREYFMDNMVASKLF 323

>ref|NP_565860.1| chloroplast lumen common protein family; protein id: At2g37400.1,
           supported by cDNA: 9001. [Arabidopsis thaliana]
           gi|25408548|pir||C84792 hypothetical protein At2g37400
           [imported] - Arabidopsis thaliana
           gi|4056493|gb|AAC98059.1| chloroplast lumen common
           protein family [Arabidopsis thaliana]
          Length = 333

 Score = 35.8 bits (81), Expect = 0.42
 Identities = 17/45 (37%), Positives = 26/45 (57%)
 Frame = -2

Query: 590 RGMIYSLLDKNDEAKVQFAKYRELSPKKFEVDGYLRSPLSRIKVF 456
           +G+IY++L K +EA+ QF K+R L PK      Y    +   K+F
Sbjct: 279 QGIIYTVLKKENEAEKQFEKFRRLVPKNHPYREYFMDNMVASKLF 323

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 489,218,139
Number of Sequences: 1393205
Number of extensions: 10191132
Number of successful extensions: 18779
Number of sequences better than 10.0: 22
Number of HSP's better than 10.0 without gapping: 18369
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 18775
length of database: 448,689,247
effective HSP length: 117
effective length of database: 285,684,262
effective search space used: 22569056698
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MF019d01_f BP029252 1 443
2 MF045f11_f BP030682 7 568
3 MR074a10_f BP081660 24 412
4 GNf058c01 BP071666 74 507
5 MF040h06_f BP030411 100 374
6 MFB035d04_f BP036563 117 597
7 MFB064d10_f BP038640 122 593




Lotus japonicus
Kazusa DNA Research Institute