KMC004002A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC004002A_C01 KMC004002A_c01
ggaTCAATAGCTGTAAGCCTATGTAAGACAATTTTCAGTTGAAAGAAATCACATAACACA
TAAGTCAAACGAATTTAATTAGAGAAACAATTTTACATCCTCTTTTTTCCTACAAAATTG
GTTGGATTTTCTTAACTCAGGCATACGTAAGTTATACTCTCTATGTGCTTCCATGAGAAA
GGAAGAATGTATAATAATTGGTTATTACCAAGTTACGACAATGGTCCTCTTTGCTACCTC
GAGAAAATTACTAACTTGTAGTCAATACATCTGCTGACATGTAAGGCGATATGCATCCCT
GGACCTGCAATAAACAAATGGGAATTTGGATATAACCTTGTAGCAAGAGGTTGACTTTAT
CCAGAGAACTATGTGGCACAGGTGTAATCACATAGAGCACACCTTTGATTGTGTCAATAC
CCCTCACAATCCCAAGACCAAGACACCAAGGTAAATTTTCAGGTCCTTCAGAATCAACTG
CTAAACCAACAATGCTAGCATTCAAACTGTAGAACATCTCAGAGCTTGGGACCTCACGAT
GAAGATGTTGAATCTTTATGCTTGCAATGGGAACTTCATAAGGA


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC004002A_C01 KMC004002A_c01
         (584 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAL49875.1| unknown protein [Arabidopsis thaliana] gi|2046598...   150  1e-35
pir||T50810 hypothetical protein T30N20_280 - Arabidopsis thalia...   150  1e-35
gb|AAH12439.1|AAH12439 Unknown (protein for IMAGE:3859950) [Homo...    75  6e-13
ref|NP_078930.1| hypothetical protein FLJ23323 [Homo sapiens] gi...    75  6e-13
ref|NP_083003.1| RIKEN cDNA 4632412I24 [Mus musculus] gi|1285250...    74  1e-12

>gb|AAL49875.1| unknown protein [Arabidopsis thaliana] gi|20465981|gb|AAM20212.1|
           unknown protein [Arabidopsis thaliana]
          Length = 368

 Score =  150 bits (379), Expect = 1e-35
 Identities = 71/110 (64%), Positives = 94/110 (84%)
 Frame = -2

Query: 583 PYEVPIASIKIQHLHREVPSSEMFYSLNASIVGLAVDSEGPENLPWCLGLGIVRGIDTIK 404
           PYEVPI+S+ I HLH ++PSSE++YSLNASIVGL + +E  E+LP C+GLGIVRGIDT +
Sbjct: 259 PYEVPISSLTINHLHCQIPSSEVYYSLNASIVGLGISTEVFEDLPSCVGLGIVRGIDTER 318

Query: 403 GVLYVITPVPHSSLDKVNLLLQGYIQIPICLLQVQGCISPYMSADVLTTS 254
           G+LYVITPVP + ++KV+LLLQGYIQ+P CLL+V+   SPY+SA+VL ++
Sbjct: 319 GILYVITPVPENLVEKVDLLLQGYIQLPTCLLEVKDYRSPYLSANVLAST 368

>pir||T50810 hypothetical protein T30N20_280 - Arabidopsis thaliana
           gi|8979735|emb|CAB96856.1| putative protein [Arabidopsis
           thaliana]
          Length = 380

 Score =  150 bits (379), Expect = 1e-35
 Identities = 71/110 (64%), Positives = 94/110 (84%)
 Frame = -2

Query: 583 PYEVPIASIKIQHLHREVPSSEMFYSLNASIVGLAVDSEGPENLPWCLGLGIVRGIDTIK 404
           PYEVPI+S+ I HLH ++PSSE++YSLNASIVGL + +E  E+LP C+GLGIVRGIDT +
Sbjct: 271 PYEVPISSLTINHLHCQIPSSEVYYSLNASIVGLGISTEVFEDLPSCVGLGIVRGIDTER 330

Query: 403 GVLYVITPVPHSSLDKVNLLLQGYIQIPICLLQVQGCISPYMSADVLTTS 254
           G+LYVITPVP + ++KV+LLLQGYIQ+P CLL+V+   SPY+SA+VL ++
Sbjct: 331 GILYVITPVPENLVEKVDLLLQGYIQLPTCLLEVKDYRSPYLSANVLAST 380

>gb|AAH12439.1|AAH12439 Unknown (protein for IMAGE:3859950) [Homo sapiens]
          Length = 259

 Score = 75.1 bits (183), Expect = 6e-13
 Identities = 47/120 (39%), Positives = 65/120 (54%), Gaps = 15/120 (12%)
 Frame = -2

Query: 583 PYEVPIASIKIQHLHREVPSSEMFYSLNASIVGLAV---DSEGPENLPW---------CL 440
           PY+VP  ++ ++  H +V  + + Y++NAS VGL     D  G  N P          CL
Sbjct: 104 PYQVPFNAVALRITHSDVAPTHILYAVNASWVGLCKIQDDVRGYTNGPILLAQTPICDCL 163

Query: 439 GLGIVRGIDTIKGVLYVITPVPHSSLDKVNLLLQGYIQIPICLLQVQGCIS---PYMSAD 269
           G GI RGID  K + +++TPVP   L  VN LL G I IP C+L+ Q  I    PY++ D
Sbjct: 164 GFGICRGIDMEKRLYHILTPVPPEELRTVNCLLVGAIAIPHCVLKCQRGIEGTVPYVTTD 223

>ref|NP_078930.1| hypothetical protein FLJ23323 [Homo sapiens]
           gi|10439969|dbj|BAB15611.1| unnamed protein product
           [Homo sapiens] gi|14349355|gb|AAH09257.1|AAH09257
           hypothetical protein FLJ23323 [Homo sapiens]
          Length = 332

 Score = 75.1 bits (183), Expect = 6e-13
 Identities = 47/120 (39%), Positives = 65/120 (54%), Gaps = 15/120 (12%)
 Frame = -2

Query: 583 PYEVPIASIKIQHLHREVPSSEMFYSLNASIVGLAV---DSEGPENLPW---------CL 440
           PY+VP  ++ ++  H +V  + + Y++NAS VGL     D  G  N P          CL
Sbjct: 177 PYQVPFNAVALRITHSDVAPTHILYAVNASWVGLCKIQDDVRGYTNGPILLAQTPICDCL 236

Query: 439 GLGIVRGIDTIKGVLYVITPVPHSSLDKVNLLLQGYIQIPICLLQVQGCIS---PYMSAD 269
           G GI RGID  K + +++TPVP   L  VN LL G I IP C+L+ Q  I    PY++ D
Sbjct: 237 GFGICRGIDMEKRLYHILTPVPPEELRTVNCLLVGAIAIPHCVLKCQRGIEGTVPYVTTD 296

>ref|NP_083003.1| RIKEN cDNA 4632412I24 [Mus musculus] gi|12852502|dbj|BAB29433.1|
           unnamed protein product [Mus musculus]
          Length = 671

 Score = 74.3 bits (181), Expect = 1e-12
 Identities = 45/107 (42%), Positives = 61/107 (56%), Gaps = 12/107 (11%)
 Frame = -2

Query: 583 PYEVPIASIKIQHLHREVPSSEMFYSLNASIVGLA--VD-----SEGPENLPW-----CL 440
           PY+VP +++ I+ LH +V  + + Y++NAS VGL   VD     + GP  L       CL
Sbjct: 564 PYQVPFSAVAIRVLHADVAPTHILYAVNASWVGLCRIVDDMKGYTRGPILLAQNPICDCL 623

Query: 439 GLGIVRGIDTIKGVLYVITPVPHSSLDKVNLLLQGYIQIPICLLQVQ 299
           G GI RGID  K   +++TP+P   L  VN LL G I IP C+ Q Q
Sbjct: 624 GFGICRGIDMDKRTYHILTPLPPEELKTVNCLLVGSISIPHCIFQNQ 670

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 491,639,369
Number of Sequences: 1393205
Number of extensions: 10555064
Number of successful extensions: 22094
Number of sequences better than 10.0: 36
Number of HSP's better than 10.0 without gapping: 21580
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 22085
length of database: 448,689,247
effective HSP length: 117
effective length of database: 285,684,262
effective search space used: 21997688174
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MFB012b01_f BP034768 1 521
2 MFB012b02_f BP034769 4 544
3 SPD012g02_f BP044978 30 173
4 MR003h06_f BP076193 55 273
5 MFB056g11_f BP038086 64 586
6 GNf095a04 BP074372 66 480




Lotus japonicus
Kazusa DNA Research Institute