KMC002089A_c04
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC002089A_C04 KMC002089A_c04
cagatttCGTTAACCAATCTTTTTATTAGTTAAGCAATCGTTTACATCGCATTTCAGACT
CTTATATTAAACTTGGTTTATTATATTTTAAAAAATTAACGTATTAATTTACGTATAAAA
TACGACATATGTAGGTAAAAAAACACTAGTATGTTGGAGAAAAAACCAAAAATCTATTGA
GAGAGCATGCTCCCCCACCCCCTTTCTGATATTTAGGTCTTCTTTACAGCTATGGTAATC
GCTTCCTCTATAGCTGGGAAAAAAGTTCCAATCTGGTATGTGAAAAAACCCACTAGCATT
GGTATCAATTCTAAGTGCATAGCTCCAAATTCTGGAACAAGAATCGCATTCCAGCGGTTA
TAAACCATGACTAATATAACAGGAACCAGTAACCTGGGCTGCCCAATGGCTCCCTTGACA
AAGCCCTTTCCACCATTGGTTCTTAGTGCGTCCACGCTACTTCCCAGCATGCGAATATAT
GCCAAAGAACCAAGCAAGCCAGCACCAAAGCTAGCTGCAATCTCAGGAGAATAAGAAACA
TAAGATGAAACCAGACCAACACCACCTATGCCCAATGTAAGGATTTGCATCTTGTTCTTT
AACTTATCATACTGCTCCTCAGCTTGAGCAGCCAAAACTTCTGGATTTTCACGAGGGCGC
CTCAGTTCCCGAAACATCACATC


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC002089A_C04 KMC002089A_c04
         (683 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_565711.1| expressed protein; protein id: At2g31040.1, sup...   237  1e-61
gb|AAM64960.1| unknown [Arabidopsis thaliana]                         237  1e-61
dbj|BAB84384.1| P0007F06.23 [Oryza sativa (japonica cultivar-gro...   192  3e-48
ref|NP_484055.1| ATP synthase subunit 1 [Nostoc sp. PCC 7120] gi...    55  9e-07
ref|ZP_00074541.1| hypothetical protein [Trichodesmium erythraeu...    55  1e-06

>ref|NP_565711.1| expressed protein; protein id: At2g31040.1, supported by cDNA:
           35095., supported by cDNA: gi_15215860, supported by
           cDNA: gi_19699267 [Arabidopsis thaliana]
           gi|25370752|pir||G84715 hypothetical protein At2g31040
           [imported] - Arabidopsis thaliana
           gi|3746067|gb|AAC63842.1| expressed protein [Arabidopsis
           thaliana] gi|15215861|gb|AAK91474.1| At2g31040/T16B12.15
           [Arabidopsis thaliana] gi|19699268|gb|AAL91000.1|
           At2g31040/T16B12.15 [Arabidopsis thaliana]
           gi|20197222|gb|AAM14978.1| expressed protein
           [Arabidopsis thaliana]
          Length = 350

 Score =  237 bits (604), Expect = 1e-61
 Identities = 118/155 (76%), Positives = 137/155 (88%)
 Frame = -1

Query: 683 DVMFRELRRPRENPEVLAAQAEEQYDKLKNKMQILTLGIGGVGLVSSYVSYSPEIAASFG 504
           DVMFRELRRPR +PEV AA+  EQY KLKNK+Q+LTLGIGGVGLVS+Y+SY+PEIA SFG
Sbjct: 185 DVMFRELRRPRGDPEVQAAKDREQYFKLKNKIQVLTLGIGGVGLVSAYISYTPEIALSFG 244

Query: 503 AGLLGSLAYIRMLGSSVDALRTNGGKGFVKGAIGQPRLLVPVILVMVYNRWNAILVPEFG 324
           AGLLGSLAY+RMLG+SVDA+  +G +G  KGA  QPRLLVPV+LVM++NRWNAILVPE+G
Sbjct: 245 AGLLGSLAYMRMLGNSVDAM-ADGARGVAKGAANQPRLLVPVVLVMIFNRWNAILVPEYG 303

Query: 323 AMHLELIPMLVGFFTYQIGTFFPAIEEAITIAVKK 219
            MHLELIPMLVGFFTY+I TFF AIEEAI+I  +K
Sbjct: 304 FMHLELIPMLVGFFTYKIATFFQAIEEAISITTQK 338

>gb|AAM64960.1| unknown [Arabidopsis thaliana]
          Length = 350

 Score =  237 bits (604), Expect = 1e-61
 Identities = 118/155 (76%), Positives = 137/155 (88%)
 Frame = -1

Query: 683 DVMFRELRRPRENPEVLAAQAEEQYDKLKNKMQILTLGIGGVGLVSSYVSYSPEIAASFG 504
           DVMFRELRRPR +PEV AA+  EQY KLKNK+Q+LTLGIGGVGLVS+Y+SY+PEIA SFG
Sbjct: 185 DVMFRELRRPRGDPEVQAAKDREQYFKLKNKIQVLTLGIGGVGLVSAYISYTPEIALSFG 244

Query: 503 AGLLGSLAYIRMLGSSVDALRTNGGKGFVKGAIGQPRLLVPVILVMVYNRWNAILVPEFG 324
           AGLLGSLAY+RMLG+SVDA+  +G +G  KGA  QPRLLVPV+LVM++NRWNAILVPE+G
Sbjct: 245 AGLLGSLAYMRMLGNSVDAM-ADGARGVAKGAANQPRLLVPVVLVMIFNRWNAILVPEYG 303

Query: 323 AMHLELIPMLVGFFTYQIGTFFPAIEEAITIAVKK 219
            MHLELIPMLVGFFTY+I TFF AIEEAI+I  +K
Sbjct: 304 FMHLELIPMLVGFFTYKIATFFQAIEEAISITTQK 338

>dbj|BAB84384.1| P0007F06.23 [Oryza sativa (japonica cultivar-group)]
           gi|21644622|dbj|BAC01181.1| P0485G01.15 [Oryza sativa
           (japonica cultivar-group)]
          Length = 311

 Score =  192 bits (489), Expect = 3e-48
 Identities = 101/152 (66%), Positives = 124/152 (81%), Gaps = 3/152 (1%)
 Frame = -1

Query: 683 DVMFREL--RRPRENPEVLAAQAEEQYDKLKNKMQILTLGIGGVGLVSSYVSYSPEIAAS 510
           DV+ RE   R  + +PEVLAA++ EQY +LK ++Q+ TLGIGG+GLVS+Y SYSPEIAAS
Sbjct: 154 DVILRESKSRGQQGDPEVLAAKSREQYLELKQRLQLFTLGIGGIGLVSAYFSYSPEIAAS 213

Query: 509 FGAGLLGSLAYIRMLGSSVDALRTNGGKG-FVKGAIGQPRLLVPVILVMVYNRWNAILVP 333
           FGAGL+GS+ Y+RMLG+SVD+L   GG G  VK A  QPRLL+PV LVM+YNRWN ILVP
Sbjct: 214 FGAGLIGSVLYLRMLGTSVDSLA--GGTGETVKSAAAQPRLLIPVALVMMYNRWNEILVP 271

Query: 332 EFGAMHLELIPMLVGFFTYQIGTFFPAIEEAI 237
           ++G MHLELIPMLVGFFTY+I TF  AI+E+I
Sbjct: 272 DYGFMHLELIPMLVGFFTYKIATFAQAIQESI 303

>ref|NP_484055.1| ATP synthase subunit 1 [Nostoc sp. PCC 7120]
           gi|20141204|sp|P12403|ATPZ_ANASP ATP synthase protein I
           gi|25296872|pir||AC1808 ATP synthase chain 1 [imported]
           - Nostoc sp. (strain PCC 7120)
           gi|17134989|dbj|BAB77535.1| ATP synthase subunit 1
           [Nostoc sp. PCC 7120]
          Length = 122

 Score = 55.1 bits (131), Expect = 9e-07
 Identities = 31/115 (26%), Positives = 63/115 (53%)
 Frame = -1

Query: 617 EQYDKLKNKMQILTLGIGGVGLVSSYVSYSPEIAASFGAGLLGSLAYIRMLGSSVDALRT 438
           +++ +L  ++ ++TL + GV  +S ++ YS  IA ++  G    + Y+RML   V+ L  
Sbjct: 2   QEFYQLYQELVLITLVLTGVVFISVWIFYSLNIALNYLLGACTGVVYLRMLAKDVERL-- 59

Query: 437 NGGKGFVKGAIGQPRLLVPVILVMVYNRWNAILVPEFGAMHLELIPMLVGFFTYQ 273
               G  K ++ + RL + + L+++ +RWN           L+++P+ +GF TY+
Sbjct: 60  ----GREKQSLSKTRLALLMALILLASRWN----------QLQIMPIFLGFLTYK 100

>ref|ZP_00074541.1| hypothetical protein [Trichodesmium erythraeum IMS101]
          Length = 146

 Score = 54.7 bits (130), Expect = 1e-06
 Identities = 33/115 (28%), Positives = 63/115 (54%)
 Frame = -1

Query: 617 EQYDKLKNKMQILTLGIGGVGLVSSYVSYSPEIAASFGAGLLGSLAYIRMLGSSVDALRT 438
           ++Y KL+ ++ ++TL I G+  +  +V YS  IA ++  G   S+ Y+RML   V+ +  
Sbjct: 32  KEYYKLQEELYVITLTITGIIFIFVWVFYSLNIALNYLIGATTSVVYLRMLAKDVERI-- 89

Query: 437 NGGKGFVKGAIGQPRLLVPVILVMVYNRWNAILVPEFGAMHLELIPMLVGFFTYQ 273
               G  KG++ + RL + V L+++  + N           L+++P+ +GF TY+
Sbjct: 90  ----GREKGSLSKTRLAILVGLIILAAQLN----------ELKILPIFLGFLTYK 130

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 602,393,495
Number of Sequences: 1393205
Number of extensions: 14125347
Number of successful extensions: 38410
Number of sequences better than 10.0: 49
Number of HSP's better than 10.0 without gapping: 37030
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 38374
length of database: 448,689,247
effective HSP length: 119
effective length of database: 282,897,852
effective search space used: 30552968016
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 SPDL034c04_f BP054112 1 468
2 GENf087f08 BP062049 10 386
3 GENf036a09 BP059866 55 501
4 MWM107c12_f AV766461 64 384
5 MPD008c06_f AV770516 75 559
6 GNf002b06 BP067497 108 506
7 MF029d05_f BP029794 229 683




Lotus japonicus
Kazusa DNA Research Institute