KMC006474A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC006474A_C01 KMC006474A_c01
gagttttttttttttttttttacagaaaatattctattgataaaataacacaaggaaaaT
TTGCAACAATGTATCATTATGAAAATGAATTTCATATTTTCCACATGATAATATCATTTG
AAGATAAAGATAAATACAAAAACTTTAAATAGCTGGGACCTTGTCAAATGAATACACGTA
TTGTATGAGCATCTCCTAGCATGTTTATCCTCTTACAGAAATTAGAAGACAATTTGGAAA
CAGTAAGAGATTGAAGTCAGGAGTTAAGGAGGAAAGGTAGATCTTATGGATTCCATCTGC
AGATTCTTCTCATATTCATCCAGGCTCATGAAGATCAATCAGGAGAAGACTAGGAACTTC
ATTCAAGCGTATATACTGATTCATATACCCGCATTTGCATGCAAACAATTCAAAAATTCT
AGGCGCTTTCACGGGATGATTTGTATATCCGGTATGCAGCTATGCACATCGTGGCATTAC
CTACAAGAGTCAGTGCCGCTTGAAGAGCCACCAATACCTCTAGAGACTCATCGTTATAAA
AGAAATGCCATGTGCAAGCACAGAATGC


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC006474A_C01 KMC006474A_c01
         (568 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

dbj|BAC43411.1| unknown protein [Arabidopsis thaliana] gi|289734...    80  1e-14
ref|NP_488548.1| hypothetical protein [Nostoc sp. PCC 7120] gi|2...    62  5e-09
ref|NP_442291.1| hypothetical protein [Synechocystis sp. PCC 680...    61  1e-08
ref|ZP_00072807.1| hypothetical protein [Trichodesmium erythraeu...    60  2e-08
ref|ZP_00115571.1| hypothetical protein [Synechococcus sp. WH 8102]    51  9e-06

>dbj|BAC43411.1| unknown protein [Arabidopsis thaliana] gi|28973427|gb|AAO64038.1|
           unknown protein [Arabidopsis thaliana]
          Length = 193

 Score = 80.5 bits (197), Expect = 1e-14
 Identities = 37/45 (82%), Positives = 40/45 (88%)
 Frame = -1

Query: 568 AFCACTWHFFYNDESLEVLVALQAALTLVGNATMCIAAYRIYKSS 434
           A CACTWHFFYNDESLEVLVALQAALT+ GN T+CIAA+RI K S
Sbjct: 140 ALCACTWHFFYNDESLEVLVALQAALTVFGNITLCIAAFRINKLS 184

>ref|NP_488548.1| hypothetical protein [Nostoc sp. PCC 7120] gi|25354814|pir||AD2369
           hypothetical protein all4508 [imported] - Nostoc sp.
           (strain PCC 7120) gi|17133644|dbj|BAB76207.1|
           ORF_ID:all4508~hypothetical protein [Nostoc sp. PCC
           7120]
          Length = 102

 Score = 62.0 bits (149), Expect = 5e-09
 Identities = 27/46 (58%), Positives = 36/46 (77%)
 Frame = -1

Query: 568 AFCACTWHFFYNDESLEVLVALQAALTLVGNATMCIAAYRIYKSSR 431
           A CACTWH+F N ESLE +V LQA +TLVGN T+  AA+ I++S++
Sbjct: 52  AMCACTWHYFDNSESLEWIVTLQATMTLVGNFTLLAAAWLIWRSAK 97

>ref|NP_442291.1| hypothetical protein [Synechocystis sp. PCC 6803]
           gi|6136533|sp|Q55720|Y49L_SYNY3 Ycf49-like protein
           gi|7469614|pir||S76515 hypothetical protein -
           Synechocystis sp. (strain PCC 6803)
           gi|1001630|dbj|BAA10361.1| ORF_ID:sll0608~hypothetical
           protein [Synechocystis sp. PCC 6803]
          Length = 104

 Score = 60.8 bits (146), Expect = 1e-08
 Identities = 26/47 (55%), Positives = 34/47 (72%)
 Frame = -1

Query: 568 AFCACTWHFFYNDESLEVLVALQAALTLVGNATMCIAAYRIYKSSRE 428
           A CACTWHFF N   L+ LV LQA  T++GN T+C+AA+ IY+ S +
Sbjct: 52  ATCACTWHFFDNASQLDWLVTLQALTTVIGNITLCLAAWWIYRQSAQ 98

>ref|ZP_00072807.1| hypothetical protein [Trichodesmium erythraeum IMS101]
          Length = 105

 Score = 60.1 bits (144), Expect = 2e-08
 Identities = 26/47 (55%), Positives = 36/47 (76%)
 Frame = -1

Query: 568 AFCACTWHFFYNDESLEVLVALQAALTLVGNATMCIAAYRIYKSSRE 428
           A CACTWH+F ND +LE LV LQA++TL+GN T+  AA+ I+  S++
Sbjct: 52  AMCACTWHYFDNDPNLEWLVTLQASMTLLGNFTLLAAAWWIFSESKK 98

>ref|ZP_00115571.1| hypothetical protein [Synechococcus sp. WH 8102]
          Length = 92

 Score = 51.2 bits (121), Expect = 9e-06
 Identities = 24/48 (50%), Positives = 30/48 (62%)
 Frame = -1

Query: 568 AFCACTWHFFYNDESLEVLVALQAALTLVGNATMCIAAYRIYKSSRES 425
           A  ACTWH F N E+L  LV LQAALTL+GN  +  AA+ + +    S
Sbjct: 45  AMAACTWHLFDNSEALRPLVTLQAALTLIGNMVLAWAAWSLLQRRETS 92

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 456,217,294
Number of Sequences: 1393205
Number of extensions: 9248069
Number of successful extensions: 17745
Number of sequences better than 10.0: 19
Number of HSP's better than 10.0 without gapping: 17251
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 17730
length of database: 448,689,247
effective HSP length: 116
effective length of database: 287,077,467
effective search space used: 20669577624
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 SPDL015e10_f BP052939 1 545
2 SPDL069e08_f BP056282 25 569
3 SPDL047c09_f BP054935 65 567
4 GENLf060f09 BP065573 79 364




Lotus japonicus
Kazusa DNA Research Institute