KMC001850A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC001850A_C01 KMC001850A_c01
tggttattcccaattatattaaactaagtttgatGCATAAAATCAACTCCATATGGAACT
TGATATACATTATAGGCCTACAACAAGTACACAAAGCTAACATAACTGGTTAACATACAT
AAGCTACACATGACGTGAACATAGGCTAACATGGATAAACCATATTGAATTTTTTTCTTG
AAAAGGACATCAAGATCACTTTTTTTTTTCAGCCAAAAGCATGGGACATGTTTCGACAGC
TAGTGGATACTCATAATTGTCCATGAGCTTTTAACTAACTAATAACTAATAGATGATATT
ATTTACCCTTCCTCATCATAATCTTCTTCATCCTCTTCTATGTGCGTACCAAGGTTTGCA
AGCTGCACAGAGTCAATAGCCATCTTGTCCACATCTGATAGTGACCAACCCGGAATTTCT
TCATGCTGGCTCTCTGTTGGAGGTTCTGATATCTCCTGTTTAGCTGTTGTGCTTGCCTTT
TTGTCCTCGTTTTCGTTGGTGTTGGCATTCAGATCAAGATTATGGAACTCTGCGCTTTCA
GCAGGAACTGCAGCGTTCTCCTTTGGTAACTCCTTTGTCTCTGAACTATCATGTATTGTC
ATGTCTGTATTTGTGTCATGCTGGATGCCCTGCTGAACAGAAGCACATGGCTCAGGTTCA
ACTGCCTGATGGGGTGTCTCTCGCACAGctgttcggcctcgaccacgccctcttcctcta
cctcggcctctacccctaccagtactgccagtgtggctcactcgggct


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC001850A_C01 KMC001850A_c01
         (768 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

dbj|BAB03155.1| gene_id:MQC3.30~unknown protein [Arabidopsis tha...    85  1e-15
ref|NP_187854.1| unknown protein; protein id: At3g12480.1 [Arabi...    85  1e-15
ref|NP_197450.1| putative protein; protein id: At5g19490.1 [Arab...    46  7e-04
gb|AAL73487.1|AF464904_1 repressor protein [Oryza sativa]              45  0.001
pir||T03744 myoD protein inhibitor - trichina gi|3057072|gb|AAC3...    43  0.006

>dbj|BAB03155.1| gene_id:MQC3.30~unknown protein [Arabidopsis thaliana]
           gi|18087597|gb|AAL58929.1|AF462841_1 At3g12480/MQC3.32
           [Arabidopsis thaliana] gi|23505853|gb|AAN28786.1|
           At3g12480/MQC3.32 [Arabidopsis thaliana]
          Length = 293

 Score = 85.1 bits (209), Expect = 1e-15
 Identities = 62/160 (38%), Positives = 82/160 (50%), Gaps = 11/160 (6%)
 Frame = -2

Query: 752 TGSTGRGRGRGRGRGRGRGRTAVRETPHQAVEPEPCASVQQGIQHDTNTDMTIHDSSETK 573
           +G  GRGRGRGRGRG    + A RE  ++ +E E   S Q     + N  M   +SS  +
Sbjct: 140 SGRGGRGRGRGRGRGGRAAKAAEREGLNREMEVEAANSGQP--PPEDNVKMHASESSPQE 197

Query: 572 ELPK--ENAAVPAESAEFH--------NLDLNANTNE-NEDKKASTTAKQEISEPPTESQ 426
           +  K  +  A   E  + H        + DLNA + + NE K A  T     +   T+S 
Sbjct: 198 DEKKGIDGTAASNEDTKQHLQSPKEGIDFDLNAESLDLNETKLAPATGTTTTTTAATDS- 256

Query: 425 HEEIPGWSLSDVDKMAIDSVQLANLGTHIEEDEEDYDEEG 306
            EE  GW + D+ KM  D  QLA+LG  I+EDEEDYDEEG
Sbjct: 257 -EEYSGWPMMDISKM--DPAQLASLGKRIDEDEEDYDEEG 293

>ref|NP_187854.1| unknown protein; protein id: At3g12480.1 [Arabidopsis thaliana]
           gi|12321975|gb|AAG51032.1|AC069474_31 unknown protein;
           69004-67516 [Arabidopsis thaliana]
          Length = 297

 Score = 85.1 bits (209), Expect = 1e-15
 Identities = 62/160 (38%), Positives = 82/160 (50%), Gaps = 11/160 (6%)
 Frame = -2

Query: 752 TGSTGRGRGRGRGRGRGRGRTAVRETPHQAVEPEPCASVQQGIQHDTNTDMTIHDSSETK 573
           +G  GRGRGRGRGRG    + A RE  ++ +E E   S Q     + N  M   +SS  +
Sbjct: 144 SGRGGRGRGRGRGRGGRAAKAAEREGLNREMEVEAANSGQP--PPEDNVKMHASESSPQE 201

Query: 572 ELPK--ENAAVPAESAEFH--------NLDLNANTNE-NEDKKASTTAKQEISEPPTESQ 426
           +  K  +  A   E  + H        + DLNA + + NE K A  T     +   T+S 
Sbjct: 202 DEKKGIDGTAASNEDTKQHLQSPKEGIDFDLNAESLDLNETKLAPATGTTTTTTAATDS- 260

Query: 425 HEEIPGWSLSDVDKMAIDSVQLANLGTHIEEDEEDYDEEG 306
            EE  GW + D+ KM  D  QLA+LG  I+EDEEDYDEEG
Sbjct: 261 -EEYSGWPMMDISKM--DPAQLASLGKRIDEDEEDYDEEG 297

>ref|NP_197450.1| putative protein; protein id: At5g19490.1 [Arabidopsis thaliana]
          Length = 236

 Score = 45.8 bits (107), Expect = 7e-04
 Identities = 46/153 (30%), Positives = 63/153 (41%), Gaps = 2/153 (1%)
 Frame = -2

Query: 761 VSHTGSTGRGR-GRGRGRGRGRGRTAVRETPHQAVEPEPCASVQQGIQHDTNTDMTIHDS 585
           V HT S GRGR GRGRGR  GR  + +     + +E     S +             HD 
Sbjct: 98  VKHT-SCGRGRRGRGRGRSSGRTGSGLSLKFEEDLED---GSPESSRTPSPENGSLSHDD 153

Query: 584 SETKELPKENAAVPAES-AEFHNLDLNANTNENEDKKASTTAKQEISEPPTESQHEEIPG 408
           +  K++   N    + S  +  N DLN   +EN D                E+Q E  P 
Sbjct: 154 TSWKKVASHNNHHSSNSEVKVRNFDLNVELDENGDNATW-----------LETQLERSPD 202

Query: 407 WSLSDVDKMAIDSVQLANLGTHIEEDEEDYDEE 309
           + L ++++M ID             DEEDYDEE
Sbjct: 203 YPL-EINEMKIDPDDQQQASA---SDEEDYDEE 231

>gb|AAL73487.1|AF464904_1 repressor protein [Oryza sativa]
          Length = 258

 Score = 45.1 bits (105), Expect = 0.001
 Identities = 39/146 (26%), Positives = 57/146 (38%), Gaps = 3/146 (2%)
 Frame = -2

Query: 737 RGRGRGRGRGRGRGRTAVRETPHQAVEPEPCASVQQG--IQHDTNTDMTIHDSSETKELP 564
           RGRGRGRGRGRGR  T  +E  +   E E      QG  +  +     TIH +       
Sbjct: 135 RGRGRGRGRGRGRPPTKRKEVGYVQFEDESSMFADQGEALPGEETVPETIHGTESVPPST 194

Query: 563 KENAAVPAESAEFHNLDLNANTNENEDKKASTTAKQEISEPPTESQHEEIPGWSLSD-VD 387
              A  P+ +AE    +      +N+D +                     P W + D + 
Sbjct: 195 HPPAEAPS-AAEIPAPNPKVEEAKNDDHQ---------------------PDWPMPDAIG 232

Query: 386 KMAIDSVQLANLGTHIEEDEEDYDEE 309
            + +      +L   ++ED EDYD E
Sbjct: 233 NIGVGPSGFGHLTVQVDED-EDYDNE 257

>pir||T03744 myoD protein inhibitor - trichina gi|3057072|gb|AAC38989.1| TsJ5
           [Trichinella spiralis]
          Length = 675

 Score = 42.7 bits (99), Expect = 0.006
 Identities = 36/146 (24%), Positives = 70/146 (47%), Gaps = 2/146 (1%)
 Frame = -2

Query: 740 GRGRGRGRGRGRGRGRTAVRETPHQAVEPEPCASVQQGIQHDTN-TDMTIHDSSETKELP 564
           GRG    RG+ RG   T++++  H A   E  A +    +++++  D  + D  E +E+ 
Sbjct: 93  GRGISIPRGKSRGNLSTSLKKKVHFADPIEERAKILSQSENESDMDDFVLKDDLEDEEMF 152

Query: 563 KE-NAAVPAESAEFHNLDLNANTNENEDKKASTTAKQEISEPPTESQHEEIPGWSLSDVD 387
           +E N+    E +E      N ++ ++ED+K       E  E   +   +E      +D D
Sbjct: 153 EEGNSDEDIEESE------NKDSEDDEDEKFLNMIDDEAEEDDDDEDKDE------ADDD 200

Query: 386 KMAIDSVQLANLGTHIEEDEEDYDEE 309
            + +DS ++      I+ED++D D++
Sbjct: 201 DLLLDSDEM-----DIDEDDDDEDDD 221

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 664,868,172
Number of Sequences: 1393205
Number of extensions: 14733921
Number of successful extensions: 93311
Number of sequences better than 10.0: 377
Number of HSP's better than 10.0 without gapping: 57966
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 83753
length of database: 448,689,247
effective HSP length: 121
effective length of database: 280,111,442
effective search space used: 37534933228
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 SPD074c05_f BP049892 1 522
2 MFB070b01_f BP039064 35 601
3 GNf038g08 BP070177 39 432
4 MF022f05_f BP029438 53 542
5 SPD094c10_f BP051497 55 601
6 MF069d03_f BP031962 66 221
7 MFB081h04_f BP039960 114 555
8 MPDL050g02_f AV779054 224 776




Lotus japonicus
Kazusa DNA Research Institute