KMC001998A_c02
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC001998A_C02 KMC001998A_c02
ATTTGGAACAGTACGTCACACCATATATTGCACACAACAAAGGAAACGGAAAGGAAAAAC
TACATATAAACACCATAGACCAATATACAAATAAAAGTCAGAAGTATCAGAATGGACAAC
ACGATAAGAATTATAATCTCATGTCAGAAATCAAGAAACTGGGGAGGCTAATGCAAATTA
AGTAAATCATTCTATCTACGACATTTCACCACCCACGCATGAGGTGTTTGACATAGCTGC
AATGGTACGAAGTTGAAGGAAACATATTACTGAAGCAAGTCAATGGGCGCAGTAGACCAA
CACAGAGCATATGCTAAATGAATCACCTAGAGTAGCCAGAAGGAGGATATCCTGGTGGAG
GATAGGCAGCAGGGGGGTAACCAGTAGCAGGATAGCCCTGAGCAGGTGGTGGGGCGGGTG
GATAACCGTAGGGCTGCCCGTATGGTGCTGGCTGTGGTGCGGGCTGTCCATATGGTGCTG
GCTGTGGCGCATACCCAACTGAAGGAGGAACCGGCTGATCTATGCGTGACATCTGCTGAG
CAGGGGGAACTGCCATTGCAGGTTGGGGTCCAAACATCCCATCACGCTTGTCCATTTCGA
CCTTGTGCTGAATCTGCATGCATGCGCAG


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC001998A_C02 KMC001998A_c02
         (629 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAN74842.1| Unknown protein [Oryza sativa (japonica cultivar-...   129  3e-29
ref|NP_194078.2| expressed protein; protein id: At4g23470.1, sup...   117  1e-25
gb|AAM63607.1| unknown [Arabidopsis thaliana]                         117  1e-25
ref|NP_176568.1| unknown protein; protein id: At1g63830.1, suppo...   108  4e-23
ref|NP_194079.1| putative protein [Arabidopsis thaliana] gi|7485...    99  5e-20

>gb|AAN74842.1| Unknown protein [Oryza sativa (japonica cultivar-group)]
          Length = 241

 Score =  129 bits (324), Expect = 3e-29
 Identities = 69/102 (67%), Positives = 72/102 (69%), Gaps = 7/102 (6%)
 Frame = -2

Query: 619 MQIQHKVEMDKRDGMFGPQPAMAVPPAQQMSRIDQPVPPSVGYAPQPAPYGQPAPQPAPY 440
           +Q QHK+EMDKRDG FGPQP MAVPP QQMSRIDQP+PP VGY PQ   YGQ      PY
Sbjct: 134 VQTQHKIEMDKRDGKFGPQP-MAVPPMQQMSRIDQPIPPPVGYTPQQPAYGQ------PY 186

Query: 439 GQPYGYPPAPPPAQGYPATGYPPA------AYPPPG-YPPSG 335
           G   GYPPA PPAQGYP   YPPA      AYPPPG YPP G
Sbjct: 187 G---GYPPA-PPAQGYPPAAYPPAGYPQGGAYPPPGSYPPPG 224

>ref|NP_194078.2| expressed protein; protein id: At4g23470.1, supported by cDNA:
           25694., supported by cDNA: gi_17065515, supported by
           cDNA: gi_20148522 [Arabidopsis thaliana]
           gi|17065516|gb|AAL32912.1| Unknown protein [Arabidopsis
           thaliana] gi|20148523|gb|AAM10152.1| unknown protein
           [Arabidopsis thaliana]
          Length = 255

 Score =  117 bits (294), Expect = 1e-25
 Identities = 63/100 (63%), Positives = 68/100 (68%), Gaps = 5/100 (5%)
 Frame = -2

Query: 628 CACMQIQHKVEMDKRDGMFGPQPAMAVPPAQQMSRIDQPVPPSVGYAPQ----PAPYGQP 461
           CACMQ QHK+EMDKRDG FGPQP MAVPPAQQMSR DQ  PP+VGY PQ    P+ Y Q 
Sbjct: 162 CACMQTQHKMEMDKRDGKFGPQP-MAVPPAQQMSRFDQATPPAVGYPPQQGYPPSGYPQH 220

Query: 460 APQPAPYGQPYGYPPAPPPAQGYPATGYPPAAY-PPPGYP 344
            PQ  P   P GYP  PPP+     + YPP AY PPP YP
Sbjct: 221 PPQGYP---PSGYPQNPPPS---AYSQYPPGAYPPPPAYP 254

>gb|AAM63607.1| unknown [Arabidopsis thaliana]
          Length = 247

 Score =  117 bits (294), Expect = 1e-25
 Identities = 63/100 (63%), Positives = 68/100 (68%), Gaps = 5/100 (5%)
 Frame = -2

Query: 628 CACMQIQHKVEMDKRDGMFGPQPAMAVPPAQQMSRIDQPVPPSVGYAPQ----PAPYGQP 461
           CACMQ QHK+EMDKRDG FGPQP MAVPPAQQMSR DQ  PP+VGY PQ    P+ Y Q 
Sbjct: 154 CACMQTQHKMEMDKRDGKFGPQP-MAVPPAQQMSRFDQATPPAVGYPPQQGYPPSGYPQH 212

Query: 460 APQPAPYGQPYGYPPAPPPAQGYPATGYPPAAY-PPPGYP 344
            PQ  P   P GYP  PPP+     + YPP AY PPP YP
Sbjct: 213 PPQGYP---PSGYPQNPPPS---AYSQYPPGAYPPPPAYP 246

>ref|NP_176568.1| unknown protein; protein id: At1g63830.1, supported by cDNA:
           gi_19424092 [Arabidopsis thaliana]
           gi|25404446|pir||D96663 unknown protein, 55304-53614
           [imported] - Arabidopsis thaliana
           gi|12325014|gb|AAG52456.1|AC010852_13 unknown protein;
           55304-53614 [Arabidopsis thaliana]
           gi|19424093|gb|AAL87329.1| unknown protein [Arabidopsis
           thaliana] gi|21436183|gb|AAM51379.1| unknown protein
           [Arabidopsis thaliana]
          Length = 232

 Score =  108 bits (271), Expect = 4e-23
 Identities = 56/95 (58%), Positives = 59/95 (61%)
 Frame = -2

Query: 628 CACMQIQHKVEMDKRDGMFGPQPAMAVPPAQQMSRIDQPVPPSVGYAPQPAPYGQPAPQP 449
           CACMQ QHK+EMDKRDG+FG QP M VPPAQQMSR DQPVPP VGY              
Sbjct: 164 CACMQTQHKLEMDKRDGVFGSQP-MGVPPAQQMSRFDQPVPPPVGY-------------- 208

Query: 448 APYGQPYGYPPAPPPAQGYPATGYPPAAYPPPGYP 344
                P  YPP        PA GYPPA+YPPPGYP
Sbjct: 209 -----PQSYPP--------PAQGYPPASYPPPGYP 230

>ref|NP_194079.1| putative protein [Arabidopsis thaliana] gi|7485554|pir||T05386
           hypothetical protein F16G20.180 - Arabidopsis thaliana
           gi|3451073|emb|CAA20469.1| putative protein [Arabidopsis
           thaliana] gi|7269196|emb|CAB79303.1| putative protein
           [Arabidopsis thaliana]
          Length = 85

 Score = 99.0 bits (245), Expect = 5e-20
 Identities = 55/91 (60%), Positives = 60/91 (65%), Gaps = 5/91 (5%)
 Frame = -2

Query: 601 VEMDKRDGMFGPQPAMAVPPAQQMSRIDQPVPPSVGYAPQ----PAPYGQPAPQPAPYGQ 434
           +EMDKRDG FGPQP MAVPPAQQMSR DQ  PP+VGY PQ    P+ Y Q  PQ  P   
Sbjct: 1   MEMDKRDGKFGPQP-MAVPPAQQMSRFDQATPPAVGYPPQQGYPPSGYPQHPPQGYP--- 56

Query: 433 PYGYPPAPPPAQGYPATGYPPAAY-PPPGYP 344
           P GYP  PPP+     + YPP AY PPP YP
Sbjct: 57  PSGYPQNPPPS---AYSQYPPGAYPPPPAYP 84

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 640,424,059
Number of Sequences: 1393205
Number of extensions: 18217675
Number of successful extensions: 246942
Number of sequences better than 10.0: 6554
Number of HSP's better than 10.0 without gapping: 83939
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 158276
length of database: 448,689,247
effective HSP length: 118
effective length of database: 284,291,057
effective search space used: 25870486187
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MR055b08_f BP080218 1 414
2 MF084f06_f BP032738 1 556
3 GNf092g01 BP074187 42 478
4 MPD015d12_f AV771024 42 533
5 MR033h01_f BP078575 43 443
6 GNf077d06 BP073056 43 531
7 MF009a10_f BP028675 43 609
8 MWM113h02_f AV766539 43 552
9 MWM237g09_f AV768358 43 485
10 MF052h08_f BP031056 43 510
11 MR061h09_f BP080707 43 460
12 MWM102h01_f AV766395 44 520
13 GENf029e12 BP059585 45 424
14 GNf039d01 BP070223 45 555
15 MF002c10_f BP028326 46 590
16 GNf051f01 BP071164 47 513
17 SPD073e03_f BP049838 48 606
18 MR072g11_f BP081567 48 478
19 SPD067b03_f BP049320 57 558
20 MFBL031h04_f BP042830 60 606
21 MFB062f12_f BP038518 62 608
22 SPD010c07_f BP044780 62 608
23 MFB072a07_f BP039209 62 609
24 SPD094g08_f BP051538 62 581
25 MFB022f09_f BP035603 65 649
26 MPD093h05_f AV776129 66 187
27 MR080c05_f BP082143 68 283
28 MPD083d07_f AV775448 83 582
29 MR098g12_f BP083535 87 461
30 MWM080a09_f AV766018 106 644
31 GNf001h02 BP067473 115 218




Lotus japonicus
Kazusa DNA Research Institute