KMC002091A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC002091A_C01 KMC002091A_c01
gCCTAATATATAGATATGTTTATTAAAAAAAAAAGAAAAAGTAAAGTAAAACCGGTGGTA
ATTTCGTCTACCACAAAATTTAGTAATAATTTACACTAGAATCTCTTCTCACTTAGGTAG
AAGAATGCTGTTGCTGCTGCTGAATATTCATCAACTCAAGCTCCTGGTGGCGAAGTAAAA
GAATCATTCTTTCAATTTCCAACCTGCTCCGTTCATTTTCAAGCTGGTGTCTCTCCATTT
CCCTCTCCTTCTGGCTGCTAAACCTTGCCCATTTCAACCTTGGCTTTTCCAGCTCAAACG
GCCTCGGCTCTATAACTCACTTGTTGCTCATCCAACTGCACCATCCTCTTCTTCATCCAC
TGCTTTTTCTCCCAAGCACTCTTCCCCCCATCTTGCAACACACCACTCACCTCACTACTC
AACTGTnGCATCACCTGTGATGGCATAACCCCACTTTTCCTTGCCCTCTTCCTCACAACA
TCATTGTTCCAATTC


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC002091A_C01 KMC002091A_c01
         (495 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAM13849.2| unknown protein [Arabidopsis thaliana]                  69  1e-20
ref|NP_187615.1| unknown protein; protein id: At3g10040.1 [Arabi...    69  1e-20
gb|AAK69274.1| unknown [Glycine max]                                   99  3e-20
dbj|BAB62605.1| B1153F04.12 [Oryza sativa (japonica cultivar-gro...    60  3e-13
ref|NP_564136.1| expressed protein; protein id: At1g21200.1, sup...    54  7e-10

>gb|AAM13849.2| unknown protein [Arabidopsis thaliana]
          Length = 431

 Score = 68.6 bits (166), Expect(2) = 1e-20
 Identities = 31/52 (59%), Positives = 42/52 (80%)
 Frame = -3

Query: 295 ELEKPRLKWARFSSQKEREMERHQLENERSRLEIERMILLLRHQELELMNIQ 140
           E+EK R+KW R+ S+KEREME+ +L+N+R RLE ERMIL+LR  E+EL  +Q
Sbjct: 367 EMEKQRVKWMRYRSKKEREMEKAKLDNQRRRLETERMILMLRRSEIELNELQ 418

 Score = 52.0 bits (123), Expect(2) = 1e-20
 Identities = 20/56 (35%), Positives = 39/56 (68%)
 Frame = -2

Query: 440 SQVMXQLSSEVSGVLQDGGKSAWEKKQWMKKRMVQLDEQQVSYRAEAV*AGKAKVE 273
           S  + +L  E + V++D GKS WEKK+W++++M++++E+++ Y  E V   K +V+
Sbjct: 319 STAVKRLREEAASVVEDVGKSVWEKKEWIRRKMLEIEEKKIGYEWEGVEMEKQRVK 374

>ref|NP_187615.1| unknown protein; protein id: At3g10040.1 [Arabidopsis thaliana]
           gi|6143872|gb|AAF04419.1|AC010927_12 unknown protein
           [Arabidopsis thaliana]
          Length = 418

 Score = 68.6 bits (166), Expect(2) = 1e-20
 Identities = 31/52 (59%), Positives = 42/52 (80%)
 Frame = -3

Query: 295 ELEKPRLKWARFSSQKEREMERHQLENERSRLEIERMILLLRHQELELMNIQ 140
           E+EK R+KW R+ S+KEREME+ +L+N+R RLE ERMIL+LR  E+EL  +Q
Sbjct: 354 EMEKQRVKWMRYRSKKEREMEKAKLDNQRRRLETERMILMLRRSEIELNELQ 405

 Score = 52.0 bits (123), Expect(2) = 1e-20
 Identities = 20/56 (35%), Positives = 39/56 (68%)
 Frame = -2

Query: 440 SQVMXQLSSEVSGVLQDGGKSAWEKKQWMKKRMVQLDEQQVSYRAEAV*AGKAKVE 273
           S  + +L  E + V++D GKS WEKK+W++++M++++E+++ Y  E V   K +V+
Sbjct: 306 STAVKRLREEAASVVEDVGKSVWEKKEWIRRKMLEIEEKKIGYEWEGVEMEKQRVK 361

>gb|AAK69274.1| unknown [Glycine max]
          Length = 408

 Score = 98.6 bits (244), Expect = 3e-20
 Identities = 53/98 (54%), Positives = 67/98 (68%), Gaps = 10/98 (10%)
 Frame = -3

Query: 394 KMGGRVLGRKSSG*RRGW--CSWMSNK*V--------IEPRPFELEKPRLKWARFSSQKE 245
           ++ G V G    G +  W    WM  + V         + + FELEK RLKWARFSS+KE
Sbjct: 291 QLSGEVSGVLQDGGKSAWEKKQWMKKRVVQLEEQQVSYQMQAFELEKQRLKWARFSSKKE 350

Query: 244 REMERHQLENERSRLEIERMILLLRHQELELMNIQQQQ 131
           REME+ +L+NER RLEIERM+LLLRH+ELEL+N+QQQQ
Sbjct: 351 REMEKDKLQNERRRLEIERMVLLLRHKELELVNVQQQQ 388

 Score = 95.9 bits (237), Expect = 2e-19
 Identities = 49/77 (63%), Positives = 60/77 (77%), Gaps = 3/77 (3%)
 Frame = -2

Query: 485 NDVVRKRARKSG---VMPSQVMXQLSSEVSGVLQDGGKSAWEKKQWMKKRMVQLDEQQVS 315
           NDV+R+RAR  G   V  SQ+M QLS EVSGVLQDGGKSAWEKKQWMKKR+VQL+EQQVS
Sbjct: 268 NDVMRRRARNKGGFGVSSSQMMQQLSGEVSGVLQDGGKSAWEKKQWMKKRVVQLEEQQVS 327

Query: 314 YRAEAV*AGKAKVEMGK 264
           Y+ +A    K +++  +
Sbjct: 328 YQMQAFELEKQRLKWAR 344

>dbj|BAB62605.1| B1153F04.12 [Oryza sativa (japonica cultivar-group)]
           gi|21104858|dbj|BAB93442.1| P0028G04.23 [Oryza sativa
           (japonica cultivar-group)]
          Length = 547

 Score = 60.1 bits (144), Expect(2) = 3e-13
 Identities = 27/52 (51%), Positives = 39/52 (74%)
 Frame = -3

Query: 310 EPRPFELEKPRLKWARFSSQKEREMERHQLENERSRLEIERMILLLRHQELE 155
           E R + LE+ RLKW RF + KER+MER +L N+R R++  RM+LLLR ++L+
Sbjct: 452 EVRAYHLERQRLKWERFRANKERDMERARLRNDRLRIDGRRMLLLLRQKDLD 503

 Score = 35.4 bits (80), Expect(2) = 3e-13
 Identities = 18/48 (37%), Positives = 31/48 (64%)
 Frame = -2

Query: 443 PSQVMXQLSSEVSGVLQDGGKSAWEKKQWMKKRMVQLDEQQVSYRAEA 300
           PS V  QL SE++  +  GG    + +QW+++R V+++EQQV++   A
Sbjct: 410 PSAVQ-QLQSELAAAVAGGGDPQ-QVRQWVRRRTVEVEEQQVAHEVRA 455

>ref|NP_564136.1| expressed protein; protein id: At1g21200.1, supported by cDNA:
           gi_15027986, supported by cDNA: gi_20259202 [Arabidopsis
           thaliana] gi|25372951|pir||C86345 hypothetical protein
           F16F4.11 - Arabidopsis thaliana
           gi|8920640|gb|AAF81362.1|AC036104_11 Contains weak
           similarity to DNA-binding protein (GT-1a) from Nicotiana
           tabacum gb|M93436. [Arabidopsis thaliana]
           gi|15027987|gb|AAK76524.1| unknown protein [Arabidopsis
           thaliana] gi|20259203|gb|AAM14317.1| unknown protein
           [Arabidopsis thaliana]
          Length = 443

 Score = 53.9 bits (128), Expect(2) = 7e-10
 Identities = 25/52 (48%), Positives = 38/52 (73%)
 Frame = -3

Query: 313 IEPRPFELEKPRLKWARFSSQKEREMERHQLENERSRLEIERMILLLRHQEL 158
           I+    ELEK R +W RFS ++++E+ER ++ENER +LE +RM L L+ +EL
Sbjct: 388 IQVELLELEKQRFRWQRFSKKRDQELERMRMENERMKLENDRMGLELKQREL 439

 Score = 30.4 bits (67), Expect(2) = 7e-10
 Identities = 11/30 (36%), Positives = 21/30 (69%)
 Frame = -2

Query: 392 DGGKSAWEKKQWMKKRMVQLDEQQVSYRAE 303
           + G++   +KQWM+ R +QL+EQ++  + E
Sbjct: 362 ESGRAGSVQKQWMESRTLQLEEQKLQIQVE 391

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 445,583,218
Number of Sequences: 1393205
Number of extensions: 9959005
Number of successful extensions: 38451
Number of sequences better than 10.0: 146
Number of HSP's better than 10.0 without gapping: 34313
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 37959
length of database: 448,689,247
effective HSP length: 114
effective length of database: 289,863,877
effective search space used: 14493193850
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 GENf036b11 BP059874 1 521
2 GENf046a09 BP060280 2 385
3 GENf055e05 BP060697 43 421




Lotus japonicus
Kazusa DNA Research Institute