KMC004699A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC004699A_C01 KMC004699A_c01
cactacttttataagtaatTCTATTAATTAATCTTGATCTTATTTAACCATATACATTTC
CATTATAATCATTTTTAAGCAATACATCATTCATTCACATCATTCAAGATAGGTCTCCGT
GAAGAGGGACCTATCTCAAATGTTTATCTGAACTCATCTAACTTAAGATGGGTGAACATA
GAAATGGATGTAATAATCACTTATAATTAAGCCAATATTACATGTGAGCACAAGAACAAT
ATCTAATAAGTGACACGCACCCCTGGGGTACCTGAGAATTTCCCCAATGAGCTCTCCTTT
CCCAGAACCCACATACCTCCCTCGAATCTGACCATCTCTGATGAACAAAAACGTTGGCAC
CTGCACCACCTCCATGTCCCTCAAGAATTGCATACAGCTGTCGTTCTCATCCCCGTTCAT
CCGCGCAAACACCACCGTGTCCGCCATCGACCTCGACAGCTTAATCACCGTAGGGTAAAC
CTTAACGCAGGGGCCACAGTGCTTTAGTCCCACATCGAGCACCACGAGCTTGCTGTCAAC
CTTATGGTCCTCAATCAGCTTCTCCACATCCTCCCTGCTGTGCAGCTGCACCACCGCAGA
GTGGCTGTCACCGTAGTACAGAACATCTCCGACGAGCATGTCAGGGCCGATCGCATCCTC
TTCATGAATCTTCTCCCTGCTCTTGTAGAAGGTGAAGTGGGGAACCTTGTCCACTTTCTC
TCTTTTACAGAGCTCTTTTGTCTTGTCTGACTCGTCGCCCATTACGAGAAGGAACTCGAC
GTCGTTGCAGGTGCGGCTGAGCTCCACCAAGAAAGGGTAGATTTCGCTGCTCTCGGAACT
GTCGCTGAGGGCGTACTCCACCACTACGAGCTTGTCTTTGGCGGATAGGAGCGCAGAGTC
GAACTCTTCAATGCTGTGGACTCGCTGGACTCTTTCATCTCTTTCTGCTTTCTTGGTGCC
GGAAGAAGCTGCTGTGGCTTTTGTGATGTGAGAGTGAGATGCTCTGAGTTGAATTGGGGT
TTTATTTTTGGTGAGAGTAGAAGTGATAGATAAGGAACTGGAGCGAAGAAGGAGTTTGCG
GTGAGTGGAAGGAGAGAGAGGGGGTTTGCATATGAAATTGCTGGTGGTGATTGTGGCCAT
GGCGGGGTTGAAGGAACAAAG


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC004699A_C01 KMC004699A_c01
         (1161 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

pir||T07367 thioredoxin-like protein CDSP32, chloroplast - potat...   400  e-110
ref|NP_177735.1| chloroplast drought-induced stress protein, put...   376  e-103
gb|AAM63182.1| chloroplast drought-induced stress protein, putat...   374  e-102
gb|AAF14217.1|AF107490_1 thioredoxin [Fasciola hepatica]               59  1e-07
emb|CAB96931.1| thioredoxin h [Triticum aestivum] gi|9502274|gb|...    59  1e-07

>pir||T07367 thioredoxin-like protein CDSP32, chloroplast - potato
            gi|2582822|emb|CAA71103.1| CDSP32 protein (Chloroplast
            Drought-induced Stress Protein of 32kDa) [Solanum
            tuberosum]
          Length = 296

 Score =  400 bits (1028), Expect = e-110
 Identities = 203/299 (67%), Positives = 235/299 (77%), Gaps = 8/299 (2%)
 Frame = -1

Query: 1140 MATITTSNFICKPP--------LSPSTHRKLLLRSSSLSITSTLTKNKTPIQLRASHSHI 985
            MAT+T  NF+ KP         +SPS +       S  SI   L  NK  +        I
Sbjct: 1    MATLT--NFLLKPSPNLASITKISPSLYSNFPFEKSKQSIFKNLKTNKPLL--------I 50

Query: 984  TKATAASSGTKKAERDERVQRVHSIEEFDSALLSAKDKLVVVEYALSDSSESSEIYPFLV 805
            TKATAA    KK  + ERVQ+V+S+EE D AL  AK++LVVVE+A  DS  S  IYPF+V
Sbjct: 51   TKATAAPDVEKKVAKSERVQKVNSMEELDEALKKAKNRLVVVEFAGKDSERSKNIYPFMV 110

Query: 804  ELSRTCNDVEFLLVMGDESDKTKELCKREKVDKVPHFTFYKSREKIHEEDAIGPDMLVGD 625
             LS+TCNDV+FLLV+GDE++KTK LC+REK+DKVPHF FYKS EKIHEE+ IGPD+L GD
Sbjct: 111  NLSKTCNDVDFLLVIGDETEKTKALCRREKIDKVPHFNFYKSMEKIHEEEGIGPDLLAGD 170

Query: 624  VLYYGDSHSAVVQLHSREDVEKLIEDHKVDSKLVVLDVGLKHCGPCVKVYPTVIKLSRSM 445
            VLYYGDSHS VVQLHSREDVEK+I+DHK+D KL+VLDVGLKHCGPCVKVYPTVIKLS+ M
Sbjct: 171  VLYYGDSHSEVVQLHSREDVEKVIQDHKIDKKLIVLDVGLKHCGPCVKVYPTVIKLSKQM 230

Query: 444  ADTVVFARMNGDENDSCMQFLRDMEVVQVPTFLFIRDGQIRGRYVGSGKGELIGEILRY 268
            ADTVVFARMNGDENDSCMQFL+DM+V++VPTFLFIRDG+I GRYVGSGKGELIGEILRY
Sbjct: 231  ADTVVFARMNGDENDSCMQFLKDMDVIEVPTFLFIRDGEICGRYVGSGKGELIGEILRY 289

>ref|NP_177735.1| chloroplast drought-induced stress protein, putative; protein id:
            At1g76080.1, supported by cDNA: 20321. [Arabidopsis
            thaliana] gi|25405001|pir||A96789 protein T23E18.2
            [imported] - Arabidopsis thaliana
            gi|6573731|gb|AAF17651.1|AC009978_27 T23E18.2
            [Arabidopsis thaliana] gi|14270247|emb|CAC39419.1|
            plastid thioredoxin [Arabidopsis thaliana]
          Length = 302

 Score =  376 bits (965), Expect = e-103
 Identities = 192/294 (65%), Positives = 232/294 (78%), Gaps = 7/294 (2%)
 Frame = -1

Query: 1128 TTSNFICKPPLSPSTHRKLLLRSSSLSI-----TSTLTKNKTPIQLRASHSHITKATAAS 964
            T +NF+ KP  +        + S+S  +     T+ L + K  +  R   +   KA AAS
Sbjct: 3    TVANFLAKPISTVVPRPSSAVASTSSFVFFNHKTNPLFRRKN-LPKRLFSAVKIKAGAAS 61

Query: 963  SGT--KKAERDERVQRVHSIEEFDSALLSAKDKLVVVEYALSDSSESSEIYPFLVELSRT 790
             G        DE+VQ++HS EEFD AL +AK KLVV E+A S S +S++IYPF+VELSRT
Sbjct: 62   PGKVGTPPANDEKVQKIHSGEEFDVALKNAKSKLVVAEFATSKSDQSNKIYPFMVELSRT 121

Query: 789  CNDVEFLLVMGDESDKTKELCKREKVDKVPHFTFYKSREKIHEEDAIGPDMLVGDVLYYG 610
            CNDV FLLVMGDESDKT+ELC+REK++KVPHF+FYKS EKIHEE+ I PD L+GDVLYYG
Sbjct: 122  CNDVVFLLVMGDESDKTRELCRREKIEKVPHFSFYKSMEKIHEEEGIEPDQLMGDVLYYG 181

Query: 609  DSHSAVVQLHSREDVEKLIEDHKVDSKLVVLDVGLKHCGPCVKVYPTVIKLSRSMADTVV 430
            D+HSAVVQLH R DVEKLI++++   KL+VLDVGLKHCGPCVKVYPTV+KLSRSM++TVV
Sbjct: 182  DNHSAVVQLHGRPDVEKLIDENRTGGKLIVLDVGLKHCGPCVKVYPTVLKLSRSMSETVV 241

Query: 429  FARMNGDENDSCMQFLRDMEVVQVPTFLFIRDGQIRGRYVGSGKGELIGEILRY 268
            FARMNGDENDSCM+FL+DM V++VPTFLFIRDG+IRGRYVGSGKGELIGEILRY
Sbjct: 242  FARMNGDENDSCMEFLKDMNVIEVPTFLFIRDGEIRGRYVGSGKGELIGEILRY 295

>gb|AAM63182.1| chloroplast drought-induced stress protein, putative [Arabidopsis
            thaliana]
          Length = 302

 Score =  374 bits (960), Expect = e-102
 Identities = 190/294 (64%), Positives = 231/294 (77%), Gaps = 7/294 (2%)
 Frame = -1

Query: 1128 TTSNFICKPPLSPSTHRKLLLRSSSLSI-----TSTLTKNKTPIQLRASHSHITKATAAS 964
            T +NF+ KP  +        + S+S  +     T+ L + K  +  R   +   KA AAS
Sbjct: 3    TVANFLAKPISTVVPRPSSAVASTSSFVFFNHKTNPLFRRKN-LPKRLFSAVKIKAGAAS 61

Query: 963  SGT--KKAERDERVQRVHSIEEFDSALLSAKDKLVVVEYALSDSSESSEIYPFLVELSRT 790
             G        DE+VQ++HS EEFD AL +AK +LVV E+A S S +S++IYPF+VELSRT
Sbjct: 62   PGKVGTPPANDEKVQKIHSGEEFDEALKNAKSRLVVAEFATSKSDQSNKIYPFMVELSRT 121

Query: 789  CNDVEFLLVMGDESDKTKELCKREKVDKVPHFTFYKSREKIHEEDAIGPDMLVGDVLYYG 610
            CNDV FLLVMGDESDKT+ELC+REK++KVPHF+FYKS EKIHEE+ I PD L+GDVLYYG
Sbjct: 122  CNDVVFLLVMGDESDKTRELCRREKIEKVPHFSFYKSMEKIHEEEGIEPDQLMGDVLYYG 181

Query: 609  DSHSAVVQLHSREDVEKLIEDHKVDSKLVVLDVGLKHCGPCVKVYPTVIKLSRSMADTVV 430
            D+HS VVQLH R DVEKLI++++   KL+VLDVGLKHCGPCVKVYPTV+KLSRSM++TVV
Sbjct: 182  DNHSTVVQLHGRPDVEKLIDENRTGGKLIVLDVGLKHCGPCVKVYPTVLKLSRSMSETVV 241

Query: 429  FARMNGDENDSCMQFLRDMEVVQVPTFLFIRDGQIRGRYVGSGKGELIGEILRY 268
            FARMNGDENDSCM+FL+DM V++VPTFLFIRDG+IRGRYVGSGKGELIGEILRY
Sbjct: 242  FARMNGDENDSCMEFLKDMNVIEVPTFLFIRDGEIRGRYVGSGKGELIGEILRY 295

>gb|AAF14217.1|AF107490_1 thioredoxin [Fasciola hepatica]
          Length = 104

 Score = 59.3 bits (142), Expect = 1e-07
 Identities = 35/106 (33%), Positives = 62/106 (58%)
 Frame = -1

Query: 585 LHSREDVEKLIEDHKVDSKLVVLDVGLKHCGPCVKVYPTVIKLSRSMADTVVFARMNGDE 406
           L +  D+EKLI ++K   +L+V+D   + CGPC  + P V  L++ + + V FA+++ D+
Sbjct: 4   LRTAADLEKLINENK--GRLIVVDFFAQWCGPCRNIAPKVEALAKEIPE-VEFAKVDVDQ 60

Query: 405 NDSCMQFLRDMEVVQVPTFLFIRDGQIRGRYVGSGKGELIGEILRY 268
           N+          V  +PTF+FI+DG+   R+ G+ + +L   I R+
Sbjct: 61  NEEAAA---KYSVTAMPTFVFIKDGKEVDRFSGANETKLRETITRH 103

>emb|CAB96931.1| thioredoxin h [Triticum aestivum] gi|9502274|gb|AAF88067.1|
           thioredoxin H [Triticum aestivum]
          Length = 125

 Score = 58.9 bits (141), Expect = 1e-07
 Identities = 35/106 (33%), Positives = 55/106 (51%)
 Frame = -1

Query: 594 VVQLHSREDVEKLIEDHKVDSKLVVLDVGLKHCGPCVKVYPTVIKLSRSMADTVVFARMN 415
           V+ +HS E     IE+     KLVV+D     CGPC  + P    L++      VF +++
Sbjct: 18  VISVHSLEQWTMQIEEANAAKKLVVIDFTASWCGPCRIMAPIFADLAKKF-PAAVFLKVD 76

Query: 414 GDENDSCMQFLRDMEVVQVPTFLFIRDGQIRGRYVGSGKGELIGEI 277
            DE  S  +      V  +PTFLF+++G ++ R VG+ K EL  ++
Sbjct: 77  VDELKSIAE---QFSVEAMPTFLFMKEGDVKDRVVGAIKEELTNKV 119

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,053,940,884
Number of Sequences: 1393205
Number of extensions: 25924978
Number of successful extensions: 120165
Number of sequences better than 10.0: 792
Number of HSP's better than 10.0 without gapping: 98701
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 116917
length of database: 448,689,247
effective HSP length: 125
effective length of database: 274,538,622
effective search space used: 71654580342
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MPD014c11_f AV770948 1 546
2 MFB100d08_f BP041261 20 540
3 MPD044f08_f AV773004 20 483
4 MR081f08_f BP082250 37 424
5 MPD015e06_f AV771027 43 499
6 MPD076b09_f AV774974 43 499
7 MPD041a04_f AV772773 43 574
8 SPD100d09_f BP051981 43 631
9 MFB011g08_f BP034743 43 555
10 MWM235c05_f AV768313 43 551
11 SPD019e05_f BP045499 43 522
12 MPD019a03_f AV771281 43 591
13 MPD068f02_f AV774531 44 528
14 MPD018e03_f AV771245 44 592
15 MPD086e08_f AV775653 44 500
16 SPD065a04_f BP049151 64 477
17 SPD019g12_f BP045525 70 366
18 MPD097e10_f AV776355 82 268
19 MFB075c01_f BP039446 92 640
20 SPD024h02_f BP045926 107 544
21 SPD086c02_f BP050849 107 659
22 MF077d04_f BP032374 111 624
23 MWM005f04_f AV764730 184 541
24 SPD089b11_f BP051091 196 724
25 MFB004a04_f BP034153 623 1212
26 MWM138c10_f AV766889 639 1178




Lotus japonicus
Kazusa DNA Research Institute