KMC016124A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC016124A_C01 KMC016124A_c01
atggaaaacgcaattcaagttttacattggtgatggatgattctttcaaatgactaaccc
atctagcattgtatggactcACTGTGTTGAATTTGCTTAAAAGTTCCAAGCACAGATAAT
AGAGCACAAGGCACAATCAGAGTCTAGCCATAACCTTTTTCCTTGGGTGGGATATAGCCA
TAACTTTAAAACTGTTTCTTATTATATATCGAGTGGCGGTACTACAAACCATGTAATAAT
TTAAAATCAGCAGAGATCATATCCTCCACTGTGTTGTTTAAAGAAGTGATTCCGTGCTGC
GTAGAGGAGCTAATACTGAATCTGCGGGCTCATAGCTTGCTTTTTGGAACACCAATTGGT
TGATTTCTGGATCTCTTTTCTATTGCTCGCGATGCCTGCTTCACTGCCTCTTTTGGTAAA
TGTCGAGGCCGGCTCAGCCAAAGTCCTCCATCCACTATGAGAGTGTCGCCATTAATGTAT
TTTCCTGCATCTGATACCAGGAAGAGAGCAGCCATGGCAATATCCCACTTCTCCCCTTCT
TTATAAAGAGGCATGTAATCTTTGGATTnGTTATTTATTTCATCAGGAGCTAGTTTACTC
ATGCCAGGAGTGCCACTTATGGGACCCGGTGCGATCCCATTGACTCTAATATCATAGTCT
GTTCCCCATTCT


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC016124A_C01 KMC016124A_c01
         (672 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

dbj|BAB02424.1| oxidoreductase, short-chain dehydrogenase/reduct...   192  3e-48
ref|NP_187885.1| unknown protein; protein id: At3g12790.1 [Arabi...   192  3e-48
ref|NP_178765.1| unknown protein; protein id: At2g07640.1 [Arabi...   145  6e-34
pir||G86486 hypothetical protein F28J9.2 - Arabidopsis thaliana ...   137  2e-31
ref|NP_174871.1| unknown protein; protein id: At1g36580.1 [Arabi...   105  4e-22

>dbj|BAB02424.1| oxidoreductase, short-chain dehydrogenase/reductase family-like
           protein [Arabidopsis thaliana]
           gi|18252923|gb|AAL62388.1| unknown protein [Arabidopsis
           thaliana] gi|21389641|gb|AAM48019.1| unknown protein
           [Arabidopsis thaliana]
          Length = 298

 Score =  192 bits (489), Expect = 3e-48
 Identities = 87/113 (76%), Positives = 102/113 (89%)
 Frame = -2

Query: 671 EWGTDYDIRVNGIAPGPISGTPGMSKLAPDEINNXSKDYMPLYKEGEKWDIAMAALFLVS 492
           EWGTDYDIRVNGIAPGPI GTPGMSKL P+EI N +++YMPLYK GEKWDIAMAAL+L  
Sbjct: 186 EWGTDYDIRVNGIAPGPIGGTPGMSKLVPEEIENKTREYMPLYKVGEKWDIAMAALYLSC 245

Query: 491 DAGKYINGDTLIVDGGLWLSRPRHLPKEAVKQASRAIEKRSRNQPIGVPKSKL 333
           D+GKY++G T++VDGGLWLS+PRHLPKEAVKQ SRA+EKRSR +P+G+P SKL
Sbjct: 246 DSGKYVSGLTMVVDGGLWLSKPRHLPKEAVKQLSRAVEKRSRAKPVGLPTSKL 298

>ref|NP_187885.1| unknown protein; protein id: At3g12790.1 [Arabidopsis thaliana]
          Length = 172

 Score =  192 bits (489), Expect = 3e-48
 Identities = 87/113 (76%), Positives = 102/113 (89%)
 Frame = -2

Query: 671 EWGTDYDIRVNGIAPGPISGTPGMSKLAPDEINNXSKDYMPLYKEGEKWDIAMAALFLVS 492
           EWGTDYDIRVNGIAPGPI GTPGMSKL P+EI N +++YMPLYK GEKWDIAMAAL+L  
Sbjct: 60  EWGTDYDIRVNGIAPGPIGGTPGMSKLVPEEIENKTREYMPLYKVGEKWDIAMAALYLSC 119

Query: 491 DAGKYINGDTLIVDGGLWLSRPRHLPKEAVKQASRAIEKRSRNQPIGVPKSKL 333
           D+GKY++G T++VDGGLWLS+PRHLPKEAVKQ SRA+EKRSR +P+G+P SKL
Sbjct: 120 DSGKYVSGLTMVVDGGLWLSKPRHLPKEAVKQLSRAVEKRSRAKPVGLPTSKL 172

>ref|NP_178765.1| unknown protein; protein id: At2g07640.1 [Arabidopsis thaliana]
           gi|25411311|pir||B84487 hypothetical protein At2g07640
           [imported] - Arabidopsis thaliana
           gi|5001455|gb|AAD37022.1| unknown protein [Arabidopsis
           thaliana]
          Length = 156

 Score =  145 bits (365), Expect = 6e-34
 Identities = 66/88 (75%), Positives = 77/88 (87%)
 Frame = -2

Query: 671 EWGTDYDIRVNGIAPGPISGTPGMSKLAPDEINNXSKDYMPLYKEGEKWDIAMAALFLVS 492
           EWGTDYDIRVNGIA GPI GTPGMSKL P+EI N +++YMPLYK GEKWDIAMAAL+L  
Sbjct: 60  EWGTDYDIRVNGIATGPIGGTPGMSKLVPEEIENKTREYMPLYKLGEKWDIAMAALYLSC 119

Query: 491 DAGKYINGDTLIVDGGLWLSRPRHLPKE 408
           D+GKY++G T++VDGGL LS+PRHL KE
Sbjct: 120 DSGKYMSGLTMVVDGGLCLSKPRHLAKE 147

>pir||G86486 hypothetical protein F28J9.2 - Arabidopsis thaliana
           gi|6272372|gb|AAF06078.1|AC007918_2 Contains PF|00678
           Short chain dehydrogenase/reductase C-terminus domain.
           [Arabidopsis thaliana]
          Length = 186

 Score =  137 bits (344), Expect = 2e-31
 Identities = 67/102 (65%), Positives = 80/102 (77%)
 Frame = -2

Query: 671 EWGTDYDIRVNGIAPGPISGTPGMSKLAPDEINNXSKDYMPLYKEGEKWDIAMAALFLVS 492
           EWGTDYDIRVN IAPGPI        + P+EI N +++YMPLYK GEKWDIAMAAL+L  
Sbjct: 45  EWGTDYDIRVNRIAPGPIG-------VVPEEIENKTREYMPLYKLGEKWDIAMAALYLSC 97

Query: 491 DAGKYINGDTLIVDGGLWLSRPRHLPKEAVKQASRAIEKRSR 366
           D+GKY++G TL+VD  L LS+PRHL KEAVKQ SRA+ K+SR
Sbjct: 98  DSGKYVSGLTLVVDAELCLSKPRHLAKEAVKQLSRAVAKKSR 139

>ref|NP_174871.1| unknown protein; protein id: At1g36580.1 [Arabidopsis thaliana]
          Length = 237

 Score =  105 bits (263), Expect = 4e-22
 Identities = 57/102 (55%), Positives = 65/102 (62%)
 Frame = -2

Query: 671 EWGTDYDIRVNGIAPGPISGTPGMSKLAPDEINNXSKDYMPLYKEGEKWDIAMAALFLVS 492
           EWGTDYDIRVN IAPGPI                           GEKWDIAMAAL+L  
Sbjct: 31  EWGTDYDIRVNRIAPGPI---------------------------GEKWDIAMAALYLSC 63

Query: 491 DAGKYINGDTLIVDGGLWLSRPRHLPKEAVKQASRAIEKRSR 366
           D+GKY++G TL+VD  L LS+PRHL KEAVKQ SRA+ K+SR
Sbjct: 64  DSGKYVSGLTLVVDAELCLSKPRHLAKEAVKQLSRAVAKKSR 105

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 593,657,402
Number of Sequences: 1393205
Number of extensions: 12959389
Number of successful extensions: 34784
Number of sequences better than 10.0: 1376
Number of HSP's better than 10.0 without gapping: 33391
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 34535
length of database: 448,689,247
effective HSP length: 119
effective length of database: 282,897,852
effective search space used: 29421376608
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MFB048d11_f BP037488 1 473
2 MWM213b05_f AV768008 182 523
3 SPD040b01_f BP047156 187 672




Lotus japonicus
Kazusa DNA Research Institute