KMC004475A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC004475A_C01 KMC004475A_c01
aagtccacaaatgAAAAGTATAATTATAAAGGAAAATCATACAGAAATATATCATAGCTT
GGTGTGTAAACAATTCGATTTTCCCAACCAAGAGAGACATTCATCTCCATTCACTGTCTT
AGCTAACCAGCTTTCAGAATTTAGATATTAAAAATACTTTAACATGACAGGCAATCTCCA
AGAAGCCTCTGCAATGGGACAAGGGACAGACTAGCTATATCATAGTGTGTTCAGCAACCA
ATTTAGCTAAAGCTAAAACTTAACAGACCCATGAATAAATGAGCACAATAGAAGATAGAA
TGTCCAGTAATCAGGACGCAGGATCTTCCAGGGGCTAAGTTGAACCCTCAAACCACTTTA
CAAGATAGCGAAAACAGCAAGTTTTTATGCTGTTTCCACGATGATCCCAATGGGGCCCTG
CAGTGAAGTTTGTGATAAGGTAAAAGAATTTGTATCAGCTACATCACATGGTATTCTAAT
TGGAGCTCCTCATAAAAAACGCTATAATGCATCCCAATAAGCCACCGGCAACAACCTGAA
GTGGAGTGTGACCAAGTGAATCGCGCAGAGGTCTGACAGTAGACAAAGGATGTTCTGGAG
GCAGCTCACACACAATTTGATTCAGCAATTCTGCTTGCCGACCTGCATGAAGTCTTACTC
CTGAGGCATCATACATAACAATACATGACAAGACGACAGCAATAGCAAAAGCCGGCGACC
CTGCTCCTTCTTGGAGACCTATAGCCAGTGCAAGAGCTGACACCGTTGCAGAATGTGATG
AAGGCATTCCGCCAGAATCAAGCATCCTCTTGGAATCCCATCTCTTTTCCTTATACCAGG
TGGTGAAGATCTTGAGGATCTGAGCGATAGCGAAGGCGAGGAAAGCTGAGAGAAGAGGAG
CGTTGAATGGAAGGGTGGAAGAGGAAGATGAAGGCGCTGAGCGCGTGCTTGCTGTAACAT
CAGCcattgttacacttcgtccatggaagccaaccatcactgcaacaacaacaacacaac
tgcactgaaaacagcttcacttcc


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC004475A_C01 KMC004475A_c01
         (1044 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAM91571.1| unknown protein [Arabidopsis thaliana] gi|2319827...   251  1e-65
dbj|BAB02356.1| gb|AAB61516.1~gene_id:MIL23.18~similar to unknow...   224  1e-57
ref|NP_188798.1| unknown protein; protein id: At3g21610.1 [Arabi...   223  4e-57
ref|NP_176927.1| F12A21.27; protein id: At1g67600.1, supported b...   201  2e-50
ref|NP_564215.1| expressed protein; protein id: At1g24350.1, sup...   200  3e-50

>gb|AAM91571.1| unknown protein [Arabidopsis thaliana] gi|23198278|gb|AAN15666.1|
           unknown protein [Arabidopsis thaliana]
          Length = 174

 Score =  251 bits (641), Expect = 1e-65
 Identities = 117/149 (78%), Positives = 135/149 (90%)
 Frame = -3

Query: 925 SSSTLPFNAPLLSAFLAFAIAQILKIFTTWYKEKRWDSKRMLDSGGMPSSHSATVSALAL 746
           S +  P N P+ SAFLAFA+AQ LK+FT WYKEKRWDSKRM+ SGGMPSSHSATV+ALA+
Sbjct: 26  SHNLFPHNLPIFSAFLAFALAQFLKVFTNWYKEKRWDSKRMISSGGMPSSHSATVTALAV 85

Query: 745 AIGLQEGAGSPAFAIAVVLSCIVMYDASGVRLHAGRQAELLNQIVCELPPEHPLSTVRPL 566
           AIG +EGAG+PAFAIAVVL+C+VMYDASGVRLHAGRQAELLNQIVCE PPEHPLSTVRPL
Sbjct: 86  AIGFEEGAGAPAFAIAVVLACVVMYDASGVRLHAGRQAELLNQIVCEFPPEHPLSTVRPL 145

Query: 565 RDSLGHTPLQVVAGGLLGCIIAFFMRSSN 479
           R+ LGHTP+QV AGG+LGC++A+ MRSS+
Sbjct: 146 RELLGHTPIQVAAGGILGCVVAYLMRSSS 174

>dbj|BAB02356.1| gb|AAB61516.1~gene_id:MIL23.18~similar to unknown protein
           [Arabidopsis thaliana]
          Length = 169

 Score =  224 bits (572), Expect = 1e-57
 Identities = 106/131 (80%), Positives = 119/131 (89%)
 Frame = -3

Query: 925 SSSTLPFNAPLLSAFLAFAIAQILKIFTTWYKEKRWDSKRMLDSGGMPSSHSATVSALAL 746
           S +  P N P+ SAFLAFA+AQ LK+FT WYKEKRWDSKRM+ SGGMPSSHSATV+ALA+
Sbjct: 22  SHNLFPHNLPIFSAFLAFALAQFLKVFTNWYKEKRWDSKRMISSGGMPSSHSATVTALAV 81

Query: 745 AIGLQEGAGSPAFAIAVVLSCIVMYDASGVRLHAGRQAELLNQIVCELPPEHPLSTVRPL 566
           AIG +EGAG+PAFAIAVVL+C+VMYDASGVRLHAGRQAELLNQIVCE PPEHPLSTVRPL
Sbjct: 82  AIGFEEGAGAPAFAIAVVLACVVMYDASGVRLHAGRQAELLNQIVCEFPPEHPLSTVRPL 141

Query: 565 RDSLGHTPLQV 533
           R+ LGHTP+QV
Sbjct: 142 RELLGHTPIQV 152

>ref|NP_188798.1| unknown protein; protein id: At3g21610.1 [Arabidopsis thaliana]
          Length = 194

 Score =  223 bits (568), Expect = 4e-57
 Identities = 105/130 (80%), Positives = 118/130 (90%)
 Frame = -3

Query: 925 SSSTLPFNAPLLSAFLAFAIAQILKIFTTWYKEKRWDSKRMLDSGGMPSSHSATVSALAL 746
           S +  P N P+ SAFLAFA+AQ LK+FT WYKEKRWDSKRM+ SGGMPSSHSATV+ALA+
Sbjct: 26  SHNLFPHNLPIFSAFLAFALAQFLKVFTNWYKEKRWDSKRMISSGGMPSSHSATVTALAV 85

Query: 745 AIGLQEGAGSPAFAIAVVLSCIVMYDASGVRLHAGRQAELLNQIVCELPPEHPLSTVRPL 566
           AIG +EGAG+PAFAIAVVL+C+VMYDASGVRLHAGRQAELLNQIVCE PPEHPLSTVRPL
Sbjct: 86  AIGFEEGAGAPAFAIAVVLACVVMYDASGVRLHAGRQAELLNQIVCEFPPEHPLSTVRPL 145

Query: 565 RDSLGHTPLQ 536
           R+ LGHTP+Q
Sbjct: 146 RELLGHTPIQ 155

>ref|NP_176927.1| F12A21.27; protein id: At1g67600.1, supported by cDNA: 28118.
           [Arabidopsis thaliana] gi|25303765|pir||E96699 protein
           F12A21.27 [imported] - Arabidopsis thaliana
           gi|11072018|gb|AAG28897.1|AC008113_13 F12A21.27
           [Arabidopsis thaliana] gi|21555489|gb|AAM63870.1|
           unknown [Arabidopsis thaliana]
          Length = 163

 Score =  201 bits (511), Expect = 2e-50
 Identities = 102/145 (70%), Positives = 119/145 (81%), Gaps = 2/145 (1%)
 Frame = -3

Query: 940 SAPSSSSSTLPF--NAPLLSAFLAFAIAQILKIFTTWYKEKRWDSKRMLDSGGMPSSHSA 767
           S  SSSS  +    N PL+SA LAF IAQ +K FT+WYKE+RWD KR++ SGGMPSSHSA
Sbjct: 4   SVASSSSHYISIFTNYPLISAVLAFTIAQFIKFFTSWYKERRWDLKRLVGSGGMPSSHSA 63

Query: 766 TVSALALAIGLQEGAGSPAFAIAVVLSCIVMYDASGVRLHAGRQAELLNQIVCELPPEHP 587
           TV+ALALA+GLQEG G   FAIA+VL+ IVMYDA+GVRLHAGRQAE+LNQIV ELP EHP
Sbjct: 64  TVTALALAVGLQEGFGGSHFAIALVLTTIVMYDATGVRLHAGRQAEVLNQIVYELPAEHP 123

Query: 586 LSTVRPLRDSLGHTPLQVVAGGLLG 512
           L+  RPLR+ LGHTP QV+AGG+LG
Sbjct: 124 LAETRPLRELLGHTPPQVIAGGMLG 148

>ref|NP_564215.1| expressed protein; protein id: At1g24350.1, supported by cDNA:
           27548. [Arabidopsis thaliana]
          Length = 168

 Score =  200 bits (509), Expect = 3e-50
 Identities = 99/156 (63%), Positives = 125/156 (79%)
 Frame = -3

Query: 967 MADVTASTRSAPSSSSSTLPFNAPLLSAFLAFAIAQILKIFTTWYKEKRWDSKRMLDSGG 788
           M D TA+  S+ S+   ++  N PL+SA  +F IAQ +K+FT+WY+E+RWD K+++ SGG
Sbjct: 1   MEDSTATATSSSSTHYFSIFTNYPLISAVTSFTIAQFIKLFTSWYRERRWDLKQLIGSGG 60

Query: 787 MPSSHSATVSALALAIGLQEGAGSPAFAIAVVLSCIVMYDASGVRLHAGRQAELLNQIVC 608
           MPSSHSATV+ALA+AIGLQEG G   FAIA++L+ +VMYDA+GVRLHAGRQAE+LNQIV 
Sbjct: 61  MPSSHSATVTALAVAIGLQEGFGGSHFAIALILASVVMYDATGVRLHAGRQAEVLNQIVY 120

Query: 607 ELPPEHPLSTVRPLRDSLGHTPLQVVAGGLLGCIIA 500
           ELP EHPL+  RPLR+ LGHTP QVVAGG+LG   A
Sbjct: 121 ELPAEHPLAESRPLRELLGHTPPQVVAGGMLGSATA 156

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 909,672,110
Number of Sequences: 1393205
Number of extensions: 19997165
Number of successful extensions: 57707
Number of sequences better than 10.0: 50
Number of HSP's better than 10.0 without gapping: 54123
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 57522
length of database: 448,689,247
effective HSP length: 124
effective length of database: 275,931,827
effective search space used: 61532797421
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MR072b09_f BP081519 1 112
2 MR044d11_f BP079407 14 453
3 MWM216f09_f AV768052 26 478
4 MR084b11_f BP082447 28 382
5 MPD004f06_f AV770274 82 386
6 MFBL039a06_f BP043208 83 570
7 SPD024h03_f BP045927 122 693
8 SPD073h02_f BP049864 135 709
9 MF059b10_f BP031397 139 622
10 SPD074b12_f BP049889 523 1049




Lotus japonicus
Kazusa DNA Research Institute