KMC001427A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC001427A_C01 KMC001427A_c01
cataaaggagaggaacaccacacaagtccctttacataaaattatcagctgcctcataaa
aggataaaagttcaaagccgGACAGATTAACACTGGAAACTGAAGCTAAAGCATGATGCT
AAAATATGATACTGGATAGATTCAAGGATATGCTTTACATTCCTCTAACTTACAATAGCA
GCTCCCCCTTCAGGAGTGCTTATGATAGAGTCAACAAAGTGGTATAAATCGAAGATTCAT
ATAAAACAGTGCAACCTAAATGTCCTCTATAATATTACATAGCCCACAGGTGACTAATAT
CACTTAAGCATAAAAATAAAAAAAATAAGAATTTTAGCCAGGGCCTGGTTTAACCTTGGC
TTTGGAAGCACCTCCCGTACTAGACACCATATATGCTCAAGAACAGCCATGAAAATGAGA
CAGACCCATCATTTAAGTCTGTATGCTGCATTATACTTAAGTTTAACCCCTACTATTTAC
TTGAATTCGTGGTTGTGAAAAGAAGAGTTGGAATGTTTTCTCCTTTTCGCTTCAGGTTCT
CCTAAATGCAAGGAAACAAAATTATCTTCATCAGGGCCATTTCGAAGTTCAGAAAGACTT
TTAAATTCCAAATCTTCTTTTGGGTAGCGCTGACGAACT


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC001427A_C01 KMC001427A_c01
         (639 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

pir||T17712 hypothetical protein A222R - Chlorella virus PBCV-1        33  4.2
ref|NP_048569.2| similar to Aquifex cellulose synthase, correspo...    33  4.2
gb|AAL24122.1| unknown protein [Arabidopsis thaliana]                  32  5.5
ref|NP_565798.1| expressed protein; protein id: At2g35155.1, sup...    32  5.5
gb|AAM20488.1| putative protein [Arabidopsis thaliana] gi|250840...    32  9.3

>pir||T17712 hypothetical protein A222R - Chlorella virus PBCV-1
          Length = 333

 Score = 32.7 bits (73), Expect = 4.2
 Identities = 18/42 (42%), Positives = 24/42 (56%), Gaps = 2/42 (4%)
 Frame = +1

Query: 415 MRQTHHLSLYAALY--LSLTPTIYLNSWL*KEELECFLLFAS 534
           M+  + L LY  LY  LS+ PT++ NSW   E+  C L F S
Sbjct: 182 MQMWYSLQLYFTLYGWLSIGPTVFFNSWCSSED-TCVLTFGS 222

>ref|NP_048569.2| similar to Aquifex cellulose synthase, corresponds to GenBank
           Accession Number AE000738 [Paramecium bursaria Chlorella
           virus 1] gi|11612647|gb|AAC96590.2| similar to Aquifex
           cellulose synthase, corresponds to GenBank Accession
           Number AE000738 [Paramecium bursaria Chlorella virus 1]
          Length = 432

 Score = 32.7 bits (73), Expect = 4.2
 Identities = 18/42 (42%), Positives = 24/42 (56%), Gaps = 2/42 (4%)
 Frame = +1

Query: 415 MRQTHHLSLYAALY--LSLTPTIYLNSWL*KEELECFLLFAS 534
           M+  + L LY  LY  LS+ PT++ NSW   E+  C L F S
Sbjct: 182 MQMWYSLQLYFTLYGWLSIGPTVFFNSWCSSED-TCVLTFGS 222

>gb|AAL24122.1| unknown protein [Arabidopsis thaliana]
          Length = 579

 Score = 32.3 bits (72), Expect = 5.5
 Identities = 17/39 (43%), Positives = 24/39 (60%)
 Frame = -2

Query: 632 QRYPKEDLEFKSLSELRNGPDEDNFVSLHLGEPEAKRRK 516
           Q  PK D    +L  L+N  +E+  +SLHLGEP+ K+ K
Sbjct: 543 QEIPKLD----NLMALKNSSEEEVNISLHLGEPKLKKPK 577

>ref|NP_565798.1| expressed protein; protein id: At2g35155.1, supported by cDNA:
           gi_16604658 [Arabidopsis thaliana]
           gi|20197214|gb|AAM14975.1| expressed protein
           [Arabidopsis thaliana] gi|23297468|gb|AAN12976.1|
           unknown protein [Arabidopsis thaliana]
          Length = 579

 Score = 32.3 bits (72), Expect = 5.5
 Identities = 17/39 (43%), Positives = 24/39 (60%)
 Frame = -2

Query: 632 QRYPKEDLEFKSLSELRNGPDEDNFVSLHLGEPEAKRRK 516
           Q  PK D    +L  L+N  +E+  +SLHLGEP+ K+ K
Sbjct: 543 QEIPKLD----NLMALKNSSEEEVNISLHLGEPKLKKPK 577

>gb|AAM20488.1| putative protein [Arabidopsis thaliana] gi|25084087|gb|AAN72171.1|
           putative protein [Arabidopsis thaliana]
          Length = 607

 Score = 31.6 bits (70), Expect = 9.3
 Identities = 17/39 (43%), Positives = 24/39 (60%), Gaps = 1/39 (2%)
 Frame = -2

Query: 617 EDLEFKSLSELRNGPDEDNF-VSLHLGEPEAKRRKHSNS 504
           E+LE K+LS L+     D    SL LGE + K+RK ++S
Sbjct: 560 ENLESKNLSSLKTSSSGDEIGFSLQLGESDTKKRKRTDS 598

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 508,900,792
Number of Sequences: 1393205
Number of extensions: 10098255
Number of successful extensions: 18692
Number of sequences better than 10.0: 11
Number of HSP's better than 10.0 without gapping: 18210
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 18682
length of database: 448,689,247
effective HSP length: 118
effective length of database: 284,291,057
effective search space used: 26723359358
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MPDL071h03_f AV780163 1 508
2 MWM209f04_f AV767958 102 423
3 MPD051a08_f AV773439 111 599
4 MFB072f10_f BP039260 115 547
5 MR014c07_f BP077018 137 493
6 GENLf088a08 BP067123 138 645




Lotus japonicus
Kazusa DNA Research Institute