KMC012378A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC012378A_C01 KMC012378A_c01
ttctaagtgactggctcaacctcataacaattcaaaagttcacagggcaggtggactaca
catttttggtggaAATGGAATTTCATGAATACCAGGTAGAGAGAAATGGTTGTCTTTGAC
AAAATTAAACCTTGAGAATACATCACTCAGTGGTGGTTGCAGATCTATTTATTTACAAGA
AGGAACTTCATCTTTCCTTTATCTTGGTTGTCCTTTCTTATCTTCAGGTGAAGTAGGGAA
GCCAAGCTTGAGGACTGTTCACAATTCCTACTGGATGGGTGACTGTAGGAAAATCAGAGA
TGATAAGTAGCAATGTAGCATCATCAACCTTCCGTTATGGCTGTTAGTCCTGCAATTTCT
TTCAGTGGCTTCCACATAAACTTTCTCCAGTTTCCACATTCAGCAGTGACCCTCACATTA
GGCCTAAGGGTAGGAAGCAGAGCAAGGAACTGCAGTTCTGCTTTAGATAGAACCTTGGCT
TCAATTATACACACCACCAGTCCTTTTGCATTTGGTGGCAATGCTTCAAGTCTGTAATCA
ACTTCTGGTGGGAAATACCATCTTGCTATTGGCTGCTTGCCAACATGAGGAGGGCAAACA
GGGTGAATCTTGGTGAGTGTGGTTTCAGGATCACCAAGACCACGAGCTGAAAGGTCTAGA
AAATAGTAACCTTGGCGCCTGAGACGAGACAAGTAGCGGCCTTCATAGCCTCCCTCATGA
GGAGAGTAAATTGCAAGTGCTTTGTGCTCTTCAACATCTGAAAGCCAACGTCCAAGATCA
GGTTTCAACAAGTCTCCCCCTATGAAGTCTCCTAACCCAACTCCACTGCATCTTACCACT
CCTACAACATCCTTTTTGCTTGTCTCATAATGTCTTCTTCCTCCCCAACCAGTCATCATC
CCAATGGTGTTGTTCCTTGTTTGTTGCTGTCTCTTTGTGACATGGAAGCATGTTGGTCCT
CCATAACCCAAGCTGGTTGCTgctgacatttttctgatgaatttttgttttggtgctgaa
gctgtactaattcctcactaaaatgtgagaggttattagag


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC012378A_C01 KMC012378A_c01
         (1061 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_200634.1| similar to unknown protein (pir||S75584); prote...   286  5e-76
dbj|BAB90365.1| B1065E10.16 [Oryza sativa (japonica cultivar-gro...   285  1e-75
ref|ZP_00074126.1| hypothetical protein [Trichodesmium erythraeu...   132  7e-30
gb|ZP_00106138.1| hypothetical protein [Nostoc punctiforme]           130  5e-29
ref|NP_488256.1| hypothetical protein [Nostoc sp. PCC 7120] gi|2...   129  1e-28

>ref|NP_200634.1| similar to unknown protein (pir||S75584); protein id: At5g58260.1,
           supported by cDNA: 3488., supported by cDNA:
           gi_19347862, supported by cDNA: gi_21280986 [Arabidopsis
           thaliana] gi|8777327|dbj|BAA96917.1|
           gene_id:MCK7.13~pir||S75584~similar to unknown protein
           [Arabidopsis thaliana] gi|19347863|gb|AAL85990.1|
           unknown protein [Arabidopsis thaliana]
           gi|21280987|gb|AAM45047.1| unknown protein [Arabidopsis
           thaliana]
          Length = 209

 Score =  286 bits (731), Expect = 5e-76
 Identities = 139/207 (67%), Positives = 164/207 (79%), Gaps = 1/207 (0%)
 Frame = -1

Query: 956 PTCFHVTKRQQQTRNNTIG-MMTGWGGRRHYETSKKDVVGVVRCSGVGLGDFIGGDLLKP 780
           P CF  +   Q  +  T+G  +     +R   T        V+CS +   D+IGGDL+KP
Sbjct: 13  PPCFEAS---QVKKIKTVGSFLVNTRSKRRRSTG-------VKCSSIA--DYIGGDLVKP 60

Query: 779 DLGRWLSDVEEHKALAIYSPHEGGYEGRYLSRLRRQGYYFLDLSARGLGDPETTLTKIHP 600
           D+G+WL DVEEHKA+AIY+PHEGGYEGRYL+RL+ QGYYFLD+SARGLGDPETTL K +P
Sbjct: 61  DIGQWLQDVEEHKAIAIYAPHEGGYEGRYLNRLKMQGYYFLDISARGLGDPETTLLKNYP 120

Query: 599 VCPPHVGKQPIARWYFPPEVDYRLEALPPNAKGLVVCIIEAKVLSKAELQFLALLPTLRP 420
           VCP H+GKQPIARWY+PPEVDYRL ALPP+AKGLVV ++EAKVLSK+ELQFLALLP+LRP
Sbjct: 121 VCPAHLGKQPIARWYYPPEVDYRLAALPPSAKGLVVWVLEAKVLSKSELQFLALLPSLRP 180

Query: 419 NVRVTAECGNWRKFMWKPLKEIAGLTA 339
           NVRV AECGNWRKF+WKPL EIA L A
Sbjct: 181 NVRVIAECGNWRKFVWKPLAEIANLAA 207

>dbj|BAB90365.1| B1065E10.16 [Oryza sativa (japonica cultivar-group)]
          Length = 211

 Score =  285 bits (728), Expect = 1e-75
 Identities = 130/158 (82%), Positives = 145/158 (91%)
 Frame = -1

Query: 818 GLGDFIGGDLLKPDLGRWLSDVEEHKALAIYSPHEGGYEGRYLSRLRRQGYYFLDLSARG 639
           GL DF+GGDL+KPD+GRWL DVE+HK+LAIY PHEGGYEGRYLSRL  QGYYFLDLSARG
Sbjct: 46  GLWDFVGGDLVKPDMGRWLDDVEKHKSLAIYPPHEGGYEGRYLSRLSYQGYYFLDLSARG 105

Query: 638 LGDPETTLTKIHPVCPPHVGKQPIARWYFPPEVDYRLEALPPNAKGLVVCIIEAKVLSKA 459
           LGDPETTLTKIHPVCPP +G+QP+ARWYFPPEVDYRL  L P+AKGLVV ++EAKVLSKA
Sbjct: 106 LGDPETTLTKIHPVCPPSLGRQPVARWYFPPEVDYRLSLLHPDAKGLVVWVMEAKVLSKA 165

Query: 458 ELQFLALLPTLRPNVRVTAECGNWRKFMWKPLKEIAGL 345
           ELQFLA+LP +RP VRV AECGNWRKF+WKPLK+IAGL
Sbjct: 166 ELQFLAILPDIRPKVRVIAECGNWRKFVWKPLKQIAGL 203

>ref|ZP_00074126.1| hypothetical protein [Trichodesmium erythraeum IMS101]
          Length = 158

 Score =  132 bits (333), Expect = 7e-30
 Identities = 66/142 (46%), Positives = 91/142 (63%), Gaps = 7/142 (4%)
 Frame = -1

Query: 764 LSDVEEHKALAIYSPHEGGYEGRYLSRLRRQGYYFLDLSARGLGDPETTLTKIHPVCPPH 585
           + D+E+  +LA+Y P EGG+EGRY  RLR  GY    ++ARGLGD    LT +H V PPH
Sbjct: 11  IRDLEKSGSLAVYPPLEGGFEGRYQRRLRASGYVSESITARGLGDLAMYLTGVHGVRPPH 70

Query: 584 VGKQPIAR-------WYFPPEVDYRLEALPPNAKGLVVCIIEAKVLSKAELQFLALLPTL 426
           +GK+ +         +Y PP V+Y+LE LPP AKGLV+ I+E ++LS  E+++L +LP  
Sbjct: 71  LGKKTVGNGPAVGYVYYVPPIVNYKLEHLPPKAKGLVLWIMEGQILSSQEIEYLTVLPKS 130

Query: 425 RPNVRVTAECGNWRKFMWKPLK 360
            P V+V  E G  R F W PL+
Sbjct: 131 EPRVKVIVEMGGDRFFRWTPLQ 152

>gb|ZP_00106138.1| hypothetical protein [Nostoc punctiforme]
          Length = 158

 Score =  130 bits (326), Expect = 5e-29
 Identities = 68/143 (47%), Positives = 88/143 (60%), Gaps = 7/143 (4%)
 Frame = -1

Query: 764 LSDVEEHKALAIYSPHEGGYEGRYLSRLRRQGYYFLDLSARGLGDPETTLTKIHPVCPPH 585
           + D+E+  AL +Y P EGGYEGRY  RLR  GY  L ++A+GLGD    LT+IH V PPH
Sbjct: 11  IRDLEKFGALGVYVPLEGGYEGRYQRRLRAAGYTTLHITAKGLGDVAAYLTRIHGVRPPH 70

Query: 584 VGKQPIAR-------WYFPPEVDYRLEALPPNAKGLVVCIIEAKVLSKAELQFLALLPTL 426
           +GK+           +Y PP +D  LE LPP +KGLV+ IIE  +LS  EL++L  LP L
Sbjct: 71  LGKKSTGSGAAVGQVYYLPPILDSHLEQLPPKSKGLVLWIIEGHILSNEELEYLTNLPQL 130

Query: 425 RPNVRVTAECGNWRKFMWKPLKE 357
            P V+V  E G  R F W  L++
Sbjct: 131 EPRVKVVIERGGDRAFRWTSLEK 153

>ref|NP_488256.1| hypothetical protein [Nostoc sp. PCC 7120] gi|25332975|pir||AI2332
           hypothetical protein alr4216 [imported] - Nostoc sp.
           (strain PCC 7120) gi|17133351|dbj|BAB75915.1|
           ORF_ID:alr4216~hypothetical protein [Nostoc sp. PCC
           7120]
          Length = 162

 Score =  129 bits (323), Expect = 1e-28
 Identities = 67/143 (46%), Positives = 88/143 (60%), Gaps = 7/143 (4%)
 Frame = -1

Query: 764 LSDVEEHKALAIYSPHEGGYEGRYLSRLRRQGYYFLDLSARGLGDPETTLTKIHPVCPPH 585
           + D+E+  ++ +Y P EGG+EGRY  RLR  GY  L  +ARGLGD    LT +H V PPH
Sbjct: 11  IRDLEKFGSVGVYVPLEGGFEGRYRRRLRAAGYTTLQFTARGLGDVAAYLTGVHGVRPPH 70

Query: 584 VGKQPIAR-------WYFPPEVDYRLEALPPNAKGLVVCIIEAKVLSKAELQFLALLPTL 426
           +GK+           +Y PP V  +LE LPP +KGLV+ IIE  +LS  E++FL  LP+L
Sbjct: 71  LGKKSSGNGAAVGNVYYLPPIVGSQLEHLPPKSKGLVLWIIEGHILSDQEVEFLTSLPSL 130

Query: 425 RPNVRVTAECGNWRKFMWKPLKE 357
            P V+V  E G  R F WK LK+
Sbjct: 131 EPRVKVVVERGGDRTFRWKALKD 153

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 940,883,948
Number of Sequences: 1393205
Number of extensions: 22012931
Number of successful extensions: 72666
Number of sequences better than 10.0: 17
Number of HSP's better than 10.0 without gapping: 64362
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 72378
length of database: 448,689,247
effective HSP length: 124
effective length of database: 275,931,827
effective search space used: 63188388383
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MPD074b12_f AV774838 1 439
2 MFB037a09_f BP036676 74 630
3 MWM192f05_f AV767664 504 1061




Lotus japonicus
Kazusa DNA Research Institute