KMC008028A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC008028A_C01 KMC008028A_c01
ttttttttttttttttaaagttaaaacatatagattAGACAAACAATAATCTTTATAGTT
GAATAAGCATCTAAACAATTGAAGAGGGCATTTGATTGTATATACAGTCAAGAAAAATAT
AGACTTAGATCCGAATCATTGAATGAAAAACCTCACAAGGACAACATGGAGTGAATCAAT
CTACTCTTCTCTCCCTATTGATATCATTTTTGGAACATTCTCGAACCTCTAGACTGGTAA
AGCTCCAACCTACTCTTGGACTCATGCAACAATGCCTCAAGCGTTTCATCTTGGGTCATG
TTAGGTCTCTTGGATGCCCATTCAAGCACTGCGAGCACCTCAGCCCGAGCAAGCTCCTCA
CTGTCAGCAACATGCAGAGCAATGTACGAGAGAAGAAACAATGCTGGAACTTGAACTGTG
TGCTCTCCAAGATACACCAGCTGGACCAGGTGTTTTGCACCACCTGCAGTTATAATCGCC
TTGGAGTGATCAATGTGGAGGTAGTTCTCGGTGCAGGCGAATTTCATGAGGGAGATTGTA
GCCTCTCTTGTCAACTCTGGCTCTCTTTCATCCAGAAGCCGAAGCAAGGGACCGATGATC


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC008028A_C01 KMC008028A_c01
         (600 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_201421.1| putative protein; protein id: At5g66200.1 [Arab...   171  9e-42
ref|NP_195220.1| putative protein; protein id: At4g34940.1 [Arab...   170  1e-41
gb|AAK91880.1|AC091665_6 Unknown protein [Oryza sativa]               158  6e-38
ref|NP_195327.1| putative protein; protein id: At4g36030.1 [Arab...   156  2e-37
ref|NP_189292.1| unknown protein; protein id: At3g26600.1 [Arabi...    72  7e-12

>ref|NP_201421.1| putative protein; protein id: At5g66200.1 [Arabidopsis thaliana]
           gi|10177135|dbj|BAB10425.1|
           gene_id:K2A18.28~pir||T10240~strong similarity to
           unknown protein [Arabidopsis thaliana]
           gi|22531060|gb|AAM97034.1| putative protein [Arabidopsis
           thaliana] gi|23198098|gb|AAN15576.1| putative protein
           [Arabidopsis thaliana]
          Length = 651

 Score =  171 bits (432), Expect = 9e-42
 Identities = 88/127 (69%), Positives = 103/127 (80%)
 Frame = -2

Query: 599 IIGPLLRLLDEREPELTREATISLMKFACTENYLHIDHSKAIITAGGAKHLVQLVYLGEH 420
           +IGPL++LLDEREPE+T EA  +L KFACT NYLH DHS+ II AGG KHLVQL Y GE 
Sbjct: 520 MIGPLVKLLDEREPEVTGEAAAALTKFACTANYLHKDHSRGIIEAGGGKHLVQLAYFGEG 579

Query: 419 TVQVPALFLLSYIALHVADSEELARAEVLAVLEWASKRPNMTQDETLEALLHESKSRLEL 240
            VQ+PAL LL YIAL+V DSE+LA+ EVLAVLEWASK+  +TQ E+LEALL E+K  L+L
Sbjct: 580 GVQIPALELLCYIALNVPDSEQLAKDEVLAVLEWASKQSWVTQLESLEALLQEAKRGLDL 639

Query: 239 YQSRGSR 219
           YQ RGSR
Sbjct: 640 YQQRGSR 646

>ref|NP_195220.1| putative protein; protein id: At4g34940.1 [Arabidopsis thaliana]
           gi|7486861|pir||T10240 hypothetical protein T11I11.180 -
           Arabidopsis thaliana gi|5123711|emb|CAB45455.1| putative
           protein [Arabidopsis thaliana]
           gi|7270445|emb|CAB80211.1| putative protein [Arabidopsis
           thaliana]
          Length = 664

 Score =  170 bits (431), Expect = 1e-41
 Identities = 84/129 (65%), Positives = 106/129 (82%)
 Frame = -2

Query: 599 IIGPLLRLLDEREPELTREATISLMKFACTENYLHIDHSKAIITAGGAKHLVQLVYLGEH 420
           IIGPL++LLDERE E+  EA ++L+KF+CTEN+L  +HSKAII AGGAKHL+QLVY GE 
Sbjct: 535 IIGPLVKLLDEREAEIAMEAAVALIKFSCTENFLRDNHSKAIIAAGGAKHLIQLVYFGEQ 594

Query: 419 TVQVPALFLLSYIALHVADSEELARAEVLAVLEWASKRPNMTQDETLEALLHESKSRLEL 240
            VQVPAL LL YIAL+V DSE LA+ EVL VLEW++K+ ++ +  T++ +L E+KSRLEL
Sbjct: 595 MVQVPALMLLCYIALNVPDSETLAQEEVLVVLEWSTKQAHLVEAPTIDEILPEAKSRLEL 654

Query: 239 YQSRGSRMF 213
           YQSRGSR F
Sbjct: 655 YQSRGSRGF 663

>gb|AAK91880.1|AC091665_6 Unknown protein [Oryza sativa]
          Length = 666

 Score =  158 bits (399), Expect = 6e-38
 Identities = 79/127 (62%), Positives = 103/127 (80%)
 Frame = -2

Query: 599 IIGPLLRLLDEREPELTREATISLMKFACTENYLHIDHSKAIITAGGAKHLVQLVYLGEH 420
           +I PL+ LLDEREP + +EA ++L KFAC EN+LH++H KAI+ +GGA+HLVQLVYLG+ 
Sbjct: 535 VIAPLVELLDEREPPVIKEAVLALTKFACNENHLHVNHCKAIVDSGGARHLVQLVYLGDE 594

Query: 419 TVQVPALFLLSYIALHVADSEELARAEVLAVLEWASKRPNMTQDETLEALLHESKSRLEL 240
            VQ+ AL LL +IALHV +SEELA+A VLAVL WASK+ +M QD  ++ALL ++K RLEL
Sbjct: 595 -VQIEALILLCFIALHVPESEELAQAGVLAVLLWASKQAHMIQDMRVDALLPDAKGRLEL 653

Query: 239 YQSRGSR 219
           +QSR SR
Sbjct: 654 FQSRASR 660

>ref|NP_195327.1| putative protein; protein id: At4g36030.1 [Arabidopsis thaliana]
           gi|7487180|pir||T05495 hypothetical protein T19K4.160 -
           Arabidopsis thaliana gi|3036807|emb|CAA18497.1| putative
           protein [Arabidopsis thaliana]
           gi|7270555|emb|CAB81512.1| putative protein [Arabidopsis
           thaliana] gi|26449953|dbj|BAC42097.1| unknown protein
           [Arabidopsis thaliana] gi|28827220|gb|AAO50454.1|
           unknown protein [Arabidopsis thaliana]
          Length = 670

 Score =  156 bits (394), Expect = 2e-37
 Identities = 77/129 (59%), Positives = 103/129 (79%)
 Frame = -2

Query: 599 IIGPLLRLLDEREPELTREATISLMKFACTENYLHIDHSKAIITAGGAKHLVQLVYLGEH 420
           +I PL++LLD+ EP+L  E  I+L KFA  +N+L  +HS+ II AGG+K LVQL Y GE+
Sbjct: 540 MIVPLVKLLDDGEPDLAAEVAIALAKFATEDNFLGKEHSRTIIEAGGSKLLVQLAYFGEN 599

Query: 419 TVQVPALFLLSYIALHVADSEELARAEVLAVLEWASKRPNMTQDETLEALLHESKSRLEL 240
             Q+PA+ LLSY+A++V DSE+LA+ EVL VLEW+SK+ N+ +DE +EALL+E+KSRLEL
Sbjct: 600 GAQIPAMVLLSYVAMNVPDSEQLAKDEVLTVLEWSSKQANVLEDEDMEALLYEAKSRLEL 659

Query: 239 YQSRGSRMF 213
           YQSRGSR F
Sbjct: 660 YQSRGSRGF 668

>ref|NP_189292.1| unknown protein; protein id: At3g26600.1 [Arabidopsis thaliana]
           gi|1402879|emb|CAA66810.1| unknown [Arabidopsis
           thaliana] gi|1495247|emb|CAA66220.1| orf 05 [Arabidopsis
           thaliana] gi|9293939|dbj|BAB01842.1|
           emb|CAA66810.1~gene_id:MFE16.13~strong similarity to
           unknown protein [Arabidopsis thaliana]
          Length = 615

 Score = 71.6 bits (174), Expect = 7e-12
 Identities = 43/123 (34%), Positives = 71/123 (56%)
 Frame = -2

Query: 599 IIGPLLRLLDEREPELTREATISLMKFACTENYLHIDHSKAIITAGGAKHLVQLVYLGEH 420
           +I PL+  L     E+   A ISL KF C EN+L  +HSK II  G    L++L+   E 
Sbjct: 482 MIKPLVEKLGSSNQEVAITAVISLQKFVCPENFLCAEHSKNIIEYGAIPLLMKLIRNVEQ 541

Query: 419 TVQVPALFLLSYIALHVADSEELARAEVLAVLEWASKRPNMTQDETLEALLHESKSRLEL 240
            +Q+  L LL Y++++ ++ ++L +A+VL VLE A +   + Q+  L  L+ ++  +L L
Sbjct: 542 QMQLQCLALLCYLSVNASNHQQLEQAKVLTVLEGAERLAGL-QNMELRELVSKAIYQLSL 600

Query: 239 YQS 231
           Y +
Sbjct: 601 YNA 603

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 512,765,299
Number of Sequences: 1393205
Number of extensions: 11103279
Number of successful extensions: 32486
Number of sequences better than 10.0: 24
Number of HSP's better than 10.0 without gapping: 31105
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 32410
length of database: 448,689,247
effective HSP length: 117
effective length of database: 285,684,262
effective search space used: 23426109484
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 SPDL031f10_f BP053953 1 514
2 MFBL046g12_f BP043623 37 431
3 GNLf020f11 BP075941 42 602
4 SPD042h04_f BP047383 105 444




Lotus japonicus
Kazusa DNA Research Institute