KMC002467A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC002467A_C01 KMC002467A_c01
atCAACCAAAGTTGCTTCAAATTAGAATAAAAAATGGCAATTTTGCCTCGAAGCATTATT
CCATTCTCTCTTCTAGGAGTGAGCACTTGCTTAATATTCAAAATTTCAAATCCTTTTTTA
CACTTGAATTGCAATTTGGCCTTTATTGGGAAGAATAGTCGAATGCAAACAAATAATTAA
ACTTAATCTACATCATTGGCCAGTCTTGATGCTGTTCATAACTGATCTTGAAGCTTTAAC
AAAGCCACATCGCCGCCACCTAAGCTGGGGAGAGAGCATTGCATAATGTCTTGTATGCAT
TTACACACAGTTTGAACTTCTCTTCAGCCATTGCCTGGGACGAGCCTTGATGCTTATCAG
GATGCCATTTTAAAGCTGATAACCGGAAGGCATTTTTAACATCTTCTATCTTCAATGGAC
CTGTTGAAGGCAGACCAAGGATTGTTCTATCAGAGCTTGATCCTACACAACAGGAACCAT
GATCCTCATTATCAGACTCATCATCACTTGCAGTTCTC


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC002467A_C01 KMC002467A_c01
         (518 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_671865.1| unknown protein; protein id: At2g18465.1 [Arabi...   102  2e-21
gb|AAM97127.1| unknown protein [Arabidopsis thaliana] gi|2489965...   102  2e-21
ref|NP_181738.1| unknown protein; protein id: At2g42080.1 [Arabi...    86  3e-16
gb|AAL87325.1| unknown protein [Arabidopsis thaliana]                  82  4e-15
emb|CAB53495.1| CAA303722.1 protein [Oryza sativa]                     67  1e-10

>ref|NP_671865.1| unknown protein; protein id: At2g18465.1 [Arabidopsis thaliana]
          Length = 257

 Score =  102 bits (255), Expect = 2e-21
 Identities = 45/62 (72%), Positives = 55/62 (88%)
 Frame = -2

Query: 454 SDRTILGLPSTGPLKIEDVKNAFRLSALKWHPDKHQGSSQAMAEEKFKLCVNAYKTLCNA 275
           S+R +LGLP  GP+K++DVKNAFR SALKWHPDKHQG SQ  A+EKFKLCV+AYK+LC+A
Sbjct: 196 SERIVLGLPLDGPIKVDDVKNAFRSSALKWHPDKHQGPSQVAAQEKFKLCVDAYKSLCSA 255

Query: 274 LS 269
           L+
Sbjct: 256 LA 257

>gb|AAM97127.1| unknown protein [Arabidopsis thaliana] gi|24899655|gb|AAN65042.1|
           unknown protein [Arabidopsis thaliana]
          Length = 268

 Score =  102 bits (255), Expect = 2e-21
 Identities = 45/62 (72%), Positives = 55/62 (88%)
 Frame = -2

Query: 454 SDRTILGLPSTGPLKIEDVKNAFRLSALKWHPDKHQGSSQAMAEEKFKLCVNAYKTLCNA 275
           S+R +LGLP  GP+K++DVKNAFR SALKWHPDKHQG SQ  A+EKFKLCV+AYK+LC+A
Sbjct: 207 SERIVLGLPLDGPIKVDDVKNAFRSSALKWHPDKHQGPSQVAAQEKFKLCVDAYKSLCSA 266

Query: 274 LS 269
           L+
Sbjct: 267 LA 268

>ref|NP_181738.1| unknown protein; protein id: At2g42080.1 [Arabidopsis thaliana]
           gi|25408783|pir||F84849 hypothetical protein At2g42080
           [imported] - Arabidopsis thaliana
           gi|1871176|gb|AAB63536.1| unknown protein [Arabidopsis
           thaliana] gi|22531201|gb|AAM97104.1| unknown protein
           [Arabidopsis thaliana] gi|25083942|gb|AAN72139.1|
           unknown protein [Arabidopsis thaliana]
          Length = 263

 Score = 85.5 bits (210), Expect = 3e-16
 Identities = 42/85 (49%), Positives = 57/85 (66%), Gaps = 6/85 (7%)
 Frame = -2

Query: 505 DDESDNEDHGSCCVGSSSD------RTILGLPSTGPLKIEDVKNAFRLSALKWHPDKHQG 344
           D++ + ED+ S    S S+      R  LGL  +GPL ++DVK+A+R  ALKWHPD+HQG
Sbjct: 177 DEDEEEEDYTSDSSDSESEPNQVSHRQALGLSPSGPLNLKDVKHAYRTCALKWHPDRHQG 236

Query: 343 SSQAMAEEKFKLCVNAYKTLCNALS 269
           S++  AE KFKLC  AY++LC  LS
Sbjct: 237 STKEAAEAKFKLCSVAYQSLCEKLS 261

>gb|AAL87325.1| unknown protein [Arabidopsis thaliana]
          Length = 254

 Score = 82.0 bits (201), Expect = 4e-15
 Identities = 42/90 (46%), Positives = 56/90 (61%), Gaps = 7/90 (7%)
 Frame = -2

Query: 517 RTASDDESDNEDHGSCCVGSS-------SDRTILGLPSTGPLKIEDVKNAFRLSALKWHP 359
           R   ++E + E++     G S       S R  LGL S+GPL +EDVK A+R  ALKWHP
Sbjct: 163 RLDEEEEEEEEEYEYSSTGVSDTEPNQESHRQTLGLSSSGPLNLEDVKIAYRACALKWHP 222

Query: 358 DKHQGSSQAMAEEKFKLCVNAYKTLCNALS 269
           D+H  S++  AEEKFKLC  AY++LC  L+
Sbjct: 223 DRHHTSTKNEAEEKFKLCTVAYQSLCEKLA 252

>emb|CAB53495.1| CAA303722.1 protein [Oryza sativa]
          Length = 308

 Score = 67.0 bits (162), Expect = 1e-10
 Identities = 39/83 (46%), Positives = 49/83 (58%), Gaps = 1/83 (1%)
 Frame = -2

Query: 508 SDDESDNEDHGSCCVGSSSDRTILGLPSTGPLKIEDVKNAFRLSALKWHPDKHQGSSQ-A 332
           SDDES++E      +GS + R ILGLP+ GPL ++ VK A       W         + A
Sbjct: 232 SDDESEDETTN---IGSHAHRAILGLPACGPLTLDAVKTA---QCTDWSVTNQCAKQRMA 285

Query: 331 MAEEKFKLCVNAYKTLCNALSPA 263
           +AEEKFKLCVNAY +LCN L  A
Sbjct: 286 VAEEKFKLCVNAYNSLCNVLKAA 308

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 421,375,620
Number of Sequences: 1393205
Number of extensions: 8397384
Number of successful extensions: 21424
Number of sequences better than 10.0: 622
Number of HSP's better than 10.0 without gapping: 20424
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 21276
length of database: 448,689,247
effective HSP length: 115
effective length of database: 288,470,672
effective search space used: 16442828304
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MF087f12_f BP032891 1 411
2 GENf069e04 BP061321 3 324
3 MFB067c02_f BP038848 3 497
4 MWM109e11_f AV766476 3 382
5 MPD016f03_f AV771107 3 466
6 MPDL018c03_f AV777394 5 519
7 MFB024g08_f BP035759 15 515
8 SPD046e12_f BP047675 17 259




Lotus japonicus
Kazusa DNA Research Institute