KMC001857A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC001857A_C01 KMC001857A_c01
ccatactatcatATGGCAATCAGCACCATTACTATAATATATAAAAGCTCATGATGCTAG
TTACATTGTTCTCCACAAGCCAGTTAATTACTAGATGAACGGATACAACAACAGCAGTAA
TGGTTCAGACTAAATCACGTCAAAGATATAAAGAAGCACATAAATTCCATACTATCATAT
GGCAATCAGCACAATTACCAGTGACATACATGTATATTATTGAAAACACACATTCGATTG
AAAAATCTATAGCTGAATGCCCCATACCAGTAGCCGCATTACAAACCATTCGTTTACACT
CTAGAAACGACATGAATGTTTGTTAGCACAACCCACTAAAACAAAAGAGCCAGCAATCTT
CTCAACTTAAAGATAATCCATCTTGAAAGCCATTTTCAGTTCCGTAAATTGTACTCAGTA
GGGAATGATCTTCCATCGCTGGGTTATCACCTTTGTTCCATTGCCAGAGGACTATAGTAG
TGCCATCATGCACACCTCCAGAATTCTTATCACCATGATAAGCATCCATATTAAGATGAA
TATTATTCACCATCCTTACAGATCTGTAGCTATCACCCAGGTCCTTGCTTTCAGTCCACA
GAATAGACTCATCGAGATAGTCTGGCTTGTAGGGTATCAGCCGAACAGGATGACTTTCAC
CAATGGAATGCTTAATGGCCTCCCCAGTGGCCTTGTTGACCAAAGAAAATGCAGGGCAAC
GCTCTGCATCCTTCACCCGGGTACTGTACTTCTCATC


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC001857A_C01 KMC001857A_c01
         (757 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_565899.1| expressed protein; protein id: At2g39050.1, sup...   175  7e-43
pir||E84812 hypothetical protein At2g39050 [imported] - Arabidop...   175  7e-43
dbj|BAB16331.1| putative r40c1 protein [Oryza sativa (japonica c...   167  3e-40
pir||T03960 r40g2 protein - rice (fragment) gi|1658313|emb|CAA70...   162  4e-39
pir||T03962 r40g3 protein - rice gi|1658315|emb|CAA70175.1| osr4...   162  4e-39

>ref|NP_565899.1| expressed protein; protein id: At2g39050.1, supported by cDNA:
           39558., supported by cDNA: gi_15724196, supported by
           cDNA: gi_20147402 [Arabidopsis thaliana]
           gi|15724197|gb|AAL06490.1|AF411801_1 At2g39050/T7F6.22
           [Arabidopsis thaliana] gi|20147403|gb|AAM10411.1|
           At2g39050/T7F6.22 [Arabidopsis thaliana]
           gi|20197446|gb|AAC79615.2| expressed protein
           [Arabidopsis thaliana] gi|21593493|gb|AAM65460.1|
           unknown [Arabidopsis thaliana]
          Length = 317

 Score =  175 bits (443), Expect = 7e-43
 Identities = 79/105 (75%), Positives = 89/105 (84%)
 Frame = -1

Query: 757 DEKYSTRVKDAERCPAFSLVNKATGEAIKHSIGESHPVRLIPYKPDYLDESILWTESKDL 578
           DEKYST+VKDA+  P F+LVNKATGEA+KHS+G +HPV LI Y PD LDES+LWTESKD 
Sbjct: 205 DEKYSTKVKDADGHPCFALVNKATGEAMKHSVGATHPVHLIRYVPDKLDESVLWTESKDF 264

Query: 577 GDSYRSVRMVNNIHLNMDAYHGDKNSGGVHDGTTIVLWQWNKGDN 443
           GD YR++RMVNN  LN+DAYHGD  SGGV DGTTIVLW WNKGDN
Sbjct: 265 GDGYRTIRMVNNTRLNVDAYHGDSKSGGVRDGTTIVLWDWNKGDN 309

>pir||E84812 hypothetical protein At2g39050 [imported] - Arabidopsis thaliana
          Length = 326

 Score =  175 bits (443), Expect = 7e-43
 Identities = 79/105 (75%), Positives = 89/105 (84%)
 Frame = -1

Query: 757 DEKYSTRVKDAERCPAFSLVNKATGEAIKHSIGESHPVRLIPYKPDYLDESILWTESKDL 578
           DEKYST+VKDA+  P F+LVNKATGEA+KHS+G +HPV LI Y PD LDES+LWTESKD 
Sbjct: 205 DEKYSTKVKDADGHPCFALVNKATGEAMKHSVGATHPVHLIRYVPDKLDESVLWTESKDF 264

Query: 577 GDSYRSVRMVNNIHLNMDAYHGDKNSGGVHDGTTIVLWQWNKGDN 443
           GD YR++RMVNN  LN+DAYHGD  SGGV DGTTIVLW WNKGDN
Sbjct: 265 GDGYRTIRMVNNTRLNVDAYHGDSKSGGVRDGTTIVLWDWNKGDN 309

>dbj|BAB16331.1| putative r40c1 protein [Oryza sativa (japonica cultivar-group)]
          Length = 268

 Score =  167 bits (422), Expect(2) = 3e-40
 Identities = 73/105 (69%), Positives = 90/105 (85%)
 Frame = -1

Query: 757 DEKYSTRVKDAERCPAFSLVNKATGEAIKHSIGESHPVRLIPYKPDYLDESILWTESKDL 578
           D KYSTRVKD E  PA +LVNKATG+A+KHSIG+SHPVRL+ Y P+Y+DES+LWTES+D+
Sbjct: 156 DMKYSTRVKDEEGYPAMALVNKATGDALKHSIGQSHPVRLVRYNPEYMDESVLWTESRDV 215

Query: 577 GDSYRSVRMVNNIHLNMDAYHGDKNSGGVHDGTTIVLWQWNKGDN 443
           G  +R +RMVNNI+LN DA HGDK+ GGV DGTT+VLW+W +GDN
Sbjct: 216 GSGFRCIRMVNNIYLNFDALHGDKDHGGVRDGTTLVLWEWCEGDN 260

 Score = 20.8 bits (42), Expect(2) = 3e-40
 Identities = 6/8 (75%), Positives = 8/8 (100%)
 Frame = -2

Query: 441 QRWKIIPY 418
           QRWKI+P+
Sbjct: 261 QRWKIVPW 268

>pir||T03960 r40g2 protein - rice (fragment) gi|1658313|emb|CAA70174.1| osr40g2
           [Oryza sativa (indica cultivar-group)]
          Length = 343

 Score =  162 bits (411), Expect = 4e-39
 Identities = 69/105 (65%), Positives = 89/105 (84%)
 Frame = -1

Query: 757 DEKYSTRVKDAERCPAFSLVNKATGEAIKHSIGESHPVRLIPYKPDYLDESILWTESKDL 578
           D ++ST+VKD E  PAF+LVNKATG A+KHS+G+SHPV+L+P+ P+Y D S+LWTESKD+
Sbjct: 58  DMRFSTKVKDGEGMPAFALVNKATGLAVKHSLGQSHPVKLVPFNPEYEDASVLWTESKDV 117

Query: 577 GDSYRSVRMVNNIHLNMDAYHGDKNSGGVHDGTTIVLWQWNKGDN 443
           G  +R +RMVNN  LN+DA+HGDK+ GGV DGTT+VLW+W KGDN
Sbjct: 118 GKGFRCIRMVNNTRLNLDAFHGDKDHGGVRDGTTVVLWEWCKGDN 162

 Score =  157 bits (397), Expect = 2e-37
 Identities = 66/105 (62%), Positives = 86/105 (81%)
 Frame = -1

Query: 757 DEKYSTRVKDAERCPAFSLVNKATGEAIKHSIGESHPVRLIPYKPDYLDESILWTESKDL 578
           D ++S +++D E  PAF+LVNK TGEAIKHS G+ HPV+L+PY P+Y DES+LW ESK +
Sbjct: 231 DMRHSNKIRDEEGYPAFALVNKVTGEAIKHSTGQGHPVKLVPYNPEYQDESVLWRESKHV 290

Query: 577 GDSYRSVRMVNNIHLNMDAYHGDKNSGGVHDGTTIVLWQWNKGDN 443
           G  +R +RMVNNI+LN DA+HGDK+ GG+HDGT IVLW+W +GDN
Sbjct: 291 GKGFRCIRMVNNIYLNFDAFHGDKDHGGIHDGTEIVLWKWCEGDN 335

>pir||T03962 r40g3 protein - rice gi|1658315|emb|CAA70175.1| osr40g3 [Oryza
           sativa (indica cultivar-group)]
          Length = 204

 Score =  162 bits (411), Expect = 4e-39
 Identities = 70/105 (66%), Positives = 90/105 (85%)
 Frame = -1

Query: 757 DEKYSTRVKDAERCPAFSLVNKATGEAIKHSIGESHPVRLIPYKPDYLDESILWTESKDL 578
           D ++ST +KD E  PAF+LVNKATG+AIKHS+G+SHPVRL+PY P+ +DES+LWTES+D+
Sbjct: 91  DMRWSTSIKDEEGYPAFALVNKATGQAIKHSLGQSHPVRLVPYNPEVMDESVLWTESRDV 150

Query: 577 GDSYRSVRMVNNIHLNMDAYHGDKNSGGVHDGTTIVLWQWNKGDN 443
           G+ +R +RMVNNI+LN DA+HGDK  GGV DGT IVLW+W +GDN
Sbjct: 151 GNGFRCIRMVNNIYLNFDAFHGDKYHGGVRDGTDIVLWKWCEGDN 195

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 664,935,863
Number of Sequences: 1393205
Number of extensions: 14474868
Number of successful extensions: 30650
Number of sequences better than 10.0: 22
Number of HSP's better than 10.0 without gapping: 29137
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 30515
length of database: 448,689,247
effective HSP length: 120
effective length of database: 281,504,647
effective search space used: 36877108757
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MR020f12_f BP077542 1 480
2 MF029b11_f BP029781 13 175
3 MF054h05_f BP031163 13 517
4 MR092g08_f BP083107 13 370
5 SPD015a04_f BP045157 13 510
6 MF036c04_f BP030195 13 499
7 MF037c07_f BP030244 13 573
8 MF002c09_f BP028325 15 566
9 SPD027c10_f BP046130 15 453
10 SPD060d03_f BP048765 18 466
11 MPD025g05_f AV771729 20 119
12 MR095c02_f BP083279 20 503
13 MFB082c12_f BP039994 39 587
14 SPD082d11_f BP050547 68 640
15 SPD049a12_f BP047879 129 260
16 MF068c09_f BP031914 140 583
17 MFB060g12_f BP038386 142 458
18 GENf051c08 BP060513 153 654
19 MR012d05_f BP076864 156 573
20 MFB037g10_f BP036736 158 685
21 GENf020b01 BP059200 158 590
22 MR025g02_f BP077935 163 548
23 SPD032d02_f BP046533 166 646
24 MR041e03_f BP079171 194 545
25 MFB061d02_f BP038424 252 772




Lotus japonicus
Kazusa DNA Research Institute