KMC017909A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC017909A_C01 KMC017909A_c01
aaggtgagaacatttcaggaacttcatgacatgtacccacaggaacgagctgagctgctt
attcaaacaggtggtttatcTTCTGCTGAAGTGCAAGACATTGAGGTGGTACTGGATATG
ATGCCTTCCATTACACTTGACGTAACTTGTGAGACTGAAGGTGAAGAGGGTATGCAAGAG
GGTGACATTGTGACTTTACATGCTTGGATAAATGTTAAGAGGGGTAATGGCCTGATCGGT
GCCCTTCCGCATGCCCCCTACTACCCATTTCACAAGGAAGAGAATTTCTGGTTTTTGCTT
GCGGATTCTGTTTCGAATAATGTGTGGTTTTTCCAGAAGGTTAGTTTCTTGGATGAAGCT
GCTGCTATAACTGCTGCATCTAAGGCAATTGAGGAATCTAAGGAGGGGTCAGGGGCAACT
GTGAAGGAGACCAGCAAGGCAGTTGCAGAAGCAGTTGAGAAGGTGAAGGCGGGGTCTAGA
TTGGTAATGGGCAAGTTCCAGGCCCCATCAGAGGGTAACTACAATTTGACTTGCTATTTA
TTGTGTGACTCTTGGTTGGGTTGTGACAGAAGGACAAATGTAAAGCTCAAAATTGTGAAA
CGGACTCGGGCTGGCACCAGGGGGGCTGTTTTGGCTGACGAAGGACCTATCATGGAGGAT
GGGGTTGAGGAGGACGAGGATAATGAGGATGAAGAGTACGATGATGACTATGAGAGTGAG
TACAGTGAAGATGAAGAAGATGATCAGAACTCAAAAAATAAGCATCAAGCTACCAATGGC
ACTGCGAAAAAACATGGTCAAGCTGCTGAAAGTTCAGGCTCGGATGAAGAATGACCAGTC
TTTTTTGACGTGATAAAACCATTATCTGATCTTTTTCCTCTCTTGAATGGCAGAAGAACT
ATCAATCGTAGAACTAGTCATCTATACTGTCCATTTCAGTTTATCAGTTGGATTGCATTG
ATACTTGGGACTTttgatctcatcttagttccgctgataccatgattacttgtcatctta
tctgataattttgaagtatggaaaatgagtata


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC017909A_C01 KMC017909A_c01
         (1053 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_178112.1| putative DnaJ protein; protein id: At1g79940.1 ...   387  e-106
pir||F96830 hypothetical protein F18B13.2 [imported] - Arabidops...   387  e-106
gb|AAK92727.2| unknown protein [Arabidopsis thaliana]                 305  1e-81
pir||T04949 hypothetical protein F7J7.120 - Arabidopsis thaliana...   305  1e-81
ref|NP_567621.1| putative protein; protein id: At4g21180.1 [Arab...   305  1e-81

>ref|NP_178112.1| putative DnaJ protein; protein id: At1g79940.1 [Arabidopsis thaliana]
            gi|12324575|gb|AAG52236.1|AC011717_4 putative DnaJ
            protein; 34157-30943 [Arabidopsis thaliana]
          Length = 702

 Score =  387 bits (995), Expect = e-106
 Identities = 187/280 (66%), Positives = 234/280 (82%), Gaps = 3/280 (1%)
 Frame = +1

Query: 1    KVRTFQELHDMYPQERAELLIQTGGLSSAEVQDIEVVLDMMPSITLDVTCETEGEEGMQE 180
            KV++FQ+L +M  ++R+ELL Q  GLS+ +V+DIE VL+MMPSIT+D+TCETEGEEG+QE
Sbjct: 424  KVKSFQDLQEMRLEDRSELLTQVAGLSATDVEDIEKVLEMMPSITVDITCETEGEEGIQE 483

Query: 181  GDIVTLHAWINVKRGNGLIGALPHAPYYPFHKEENFWFLLADSVSNNVWFFQKVSFLDEA 360
            GDIVTL AW+ +KR NGL+GALPHAPY+PFHKEEN+W LLADSVSNNVWF QKVSFLDE 
Sbjct: 484  GDIVTLQAWVTLKRPNGLVGALPHAPYFPFHKEENYWVLLADSVSNNVWFSQKVSFLDEG 543

Query: 361  AAITAASKAIEESKEGSGATVKETSKAVAEAVEKVKAGSRLVMGKFQAPSEGNYNLTCYL 540
             AITAASKAI ES EGSGA VKET+ AV EA+EKVK GSRLVMGK QAP+EG YNLTC+ 
Sbjct: 544  GAITAASKAISESMEGSGAGVKETNDAVREAIEKVKGGSRLVMGKLQAPAEGTYNLTCFC 603

Query: 541  LCDSWLGCDRRTNVKLKIVKRTRAGTRGAVLADEGPIMEDGVEEDEDNEDEEYDDDYESE 720
            LCD+W+GCD++  +K+K++KRTRAGTRG V +DEG I E+G+EE+++ E+E+YDDDYESE
Sbjct: 604  LCDTWIGCDKKQALKVKVLKRTRAGTRGLV-SDEGAIAEEGMEEEDEIEEEDYDDDYESE 662

Query: 721  YSEDEED--DQNSKNKHQATNGTAK-KHGQAAESSGSDEE 831
            YSEDE++  D + K   +  NG+ K K   ++E SGS+EE
Sbjct: 663  YSEDEDEKKDMDEKRGSKKANGSVKQKKESSSEESGSEEE 702

>pir||F96830 hypothetical protein F18B13.2 [imported] - Arabidopsis thaliana
            gi|5902360|gb|AAD55462.1|AC009322_2 Hypothetical protein
            [Arabidopsis thaliana]
          Length = 719

 Score =  387 bits (995), Expect = e-106
 Identities = 187/280 (66%), Positives = 234/280 (82%), Gaps = 3/280 (1%)
 Frame = +1

Query: 1    KVRTFQELHDMYPQERAELLIQTGGLSSAEVQDIEVVLDMMPSITLDVTCETEGEEGMQE 180
            KV++FQ+L +M  ++R+ELL Q  GLS+ +V+DIE VL+MMPSIT+D+TCETEGEEG+QE
Sbjct: 441  KVKSFQDLQEMRLEDRSELLTQVAGLSATDVEDIEKVLEMMPSITVDITCETEGEEGIQE 500

Query: 181  GDIVTLHAWINVKRGNGLIGALPHAPYYPFHKEENFWFLLADSVSNNVWFFQKVSFLDEA 360
            GDIVTL AW+ +KR NGL+GALPHAPY+PFHKEEN+W LLADSVSNNVWF QKVSFLDE 
Sbjct: 501  GDIVTLQAWVTLKRPNGLVGALPHAPYFPFHKEENYWVLLADSVSNNVWFSQKVSFLDEG 560

Query: 361  AAITAASKAIEESKEGSGATVKETSKAVAEAVEKVKAGSRLVMGKFQAPSEGNYNLTCYL 540
             AITAASKAI ES EGSGA VKET+ AV EA+EKVK GSRLVMGK QAP+EG YNLTC+ 
Sbjct: 561  GAITAASKAISESMEGSGAGVKETNDAVREAIEKVKGGSRLVMGKLQAPAEGTYNLTCFC 620

Query: 541  LCDSWLGCDRRTNVKLKIVKRTRAGTRGAVLADEGPIMEDGVEEDEDNEDEEYDDDYESE 720
            LCD+W+GCD++  +K+K++KRTRAGTRG V +DEG I E+G+EE+++ E+E+YDDDYESE
Sbjct: 621  LCDTWIGCDKKQALKVKVLKRTRAGTRGLV-SDEGAIAEEGMEEEDEIEEEDYDDDYESE 679

Query: 721  YSEDEED--DQNSKNKHQATNGTAK-KHGQAAESSGSDEE 831
            YSEDE++  D + K   +  NG+ K K   ++E SGS+EE
Sbjct: 680  YSEDEDEKKDMDEKRGSKKANGSVKQKKESSSEESGSEEE 719

>gb|AAK92727.2| unknown protein [Arabidopsis thaliana]
          Length = 297

 Score =  305 bits (780), Expect = 1e-81
 Identities = 155/277 (55%), Positives = 204/277 (72%)
 Frame = +1

Query: 1   KVRTFQELHDMYPQERAELLIQTGGLSSAEVQDIEVVLDMMPSITLDVTCETEGEEGMQE 180
           +V++FQ+  ++   ER++LL +   LS  +VQDIE VL+M+PS+ ++VTC+TEGEEG+QE
Sbjct: 41  QVKSFQKFQELSLAERSKLLREVVSLSETDVQDIEKVLEMIPSLKINVTCKTEGEEGIQE 100

Query: 181 GDIVTLHAWINVKRGNGLIGALPHAPYYPFHKEENFWFLLADSVSNNVWFFQKVSFLDEA 360
           GDI+T+ AWI +KR NGLIGA+PH+PY+PFHKEENFW LLAD  SN+VWFFQKV F+DEA
Sbjct: 101 GDIMTVQAWITLKRPNGLIGAIPHSPYFPFHKEENFWVLLAD--SNHVWFFQKVKFMDEA 158

Query: 361 AAITAASKAIEESKEGSGATVKETSKAVAEAVEKVKAGSRLVMGKFQAPSEGNYNLTCYL 540
            AI AAS  I E+ E  GA+VKET+ AV EAVEKVK+GSRLVMG+  AP EG YNLTC+ 
Sbjct: 159 GAIAAASNTITETMEPLGASVKETNDAVKEAVEKVKSGSRLVMGRLLAPGEGTYNLTCFC 218

Query: 541 LCDSWLGCDRRTNVKLKIVKRTRAGTRGAVLADEGPIMEDGVEEDEDNEDEEYDDDYESE 720
           L D+W+GCD++T++K++++KRTR          EG   E+G+EE++D  +EE   DYESE
Sbjct: 219 LSDTWIGCDQKTSLKVEVLKRTR--------DVEGENAEEGLEEEDDEIEEE---DYESE 267

Query: 721 YSEDEEDDQNSKNKHQATNGTAKKHGQAAESSGSDEE 831
           YSEDEED +    K             ++E SGSDEE
Sbjct: 268 YSEDEEDKKRGSKK-------KVNKESSSEESGSDEE 297

>pir||T04949 hypothetical protein F7J7.120 - Arabidopsis thaliana
            gi|2911075|emb|CAA17537.1| putative protein [Arabidopsis
            thaliana] gi|7268915|emb|CAB79118.1| putative protein
            [Arabidopsis thaliana]
          Length = 648

 Score =  305 bits (780), Expect = 1e-81
 Identities = 155/277 (55%), Positives = 204/277 (72%)
 Frame = +1

Query: 1    KVRTFQELHDMYPQERAELLIQTGGLSSAEVQDIEVVLDMMPSITLDVTCETEGEEGMQE 180
            +V++FQ+  ++   ER++LL +   LS  +VQDIE VL+M+PS+ ++VTC+TEGEEG+QE
Sbjct: 392  QVKSFQKFQELSLAERSKLLREVVSLSETDVQDIEKVLEMIPSLKINVTCKTEGEEGIQE 451

Query: 181  GDIVTLHAWINVKRGNGLIGALPHAPYYPFHKEENFWFLLADSVSNNVWFFQKVSFLDEA 360
            GDI+T+ AWI +KR NGLIGA+PH+PY+PFHKEENFW LLAD  SN+VWFFQKV F+DEA
Sbjct: 452  GDIMTVQAWITLKRPNGLIGAIPHSPYFPFHKEENFWVLLAD--SNHVWFFQKVKFMDEA 509

Query: 361  AAITAASKAIEESKEGSGATVKETSKAVAEAVEKVKAGSRLVMGKFQAPSEGNYNLTCYL 540
             AI AAS  I E+ E  GA+VKET+ AV EAVEKVK+GSRLVMG+  AP EG YNLTC+ 
Sbjct: 510  GAIAAASNTITETMEPLGASVKETNDAVKEAVEKVKSGSRLVMGRLLAPGEGTYNLTCFC 569

Query: 541  LCDSWLGCDRRTNVKLKIVKRTRAGTRGAVLADEGPIMEDGVEEDEDNEDEEYDDDYESE 720
            L D+W+GCD++T++K++++KRTR          EG   E+G+EE++D  +EE   DYESE
Sbjct: 570  LSDTWIGCDQKTSLKVEVLKRTR--------DVEGENAEEGLEEEDDEIEEE---DYESE 618

Query: 721  YSEDEEDDQNSKNKHQATNGTAKKHGQAAESSGSDEE 831
            YSEDEED +    K             ++E SGSDEE
Sbjct: 619  YSEDEEDKKRGSKK-------KVNKESSSEESGSDEE 648

>ref|NP_567621.1| putative protein; protein id: At4g21180.1 [Arabidopsis thaliana]
          Length = 661

 Score =  305 bits (780), Expect = 1e-81
 Identities = 155/277 (55%), Positives = 204/277 (72%)
 Frame = +1

Query: 1    KVRTFQELHDMYPQERAELLIQTGGLSSAEVQDIEVVLDMMPSITLDVTCETEGEEGMQE 180
            +V++FQ+  ++   ER++LL +   LS  +VQDIE VL+M+PS+ ++VTC+TEGEEG+QE
Sbjct: 405  QVKSFQKFQELSLAERSKLLREVVSLSETDVQDIEKVLEMIPSLKINVTCKTEGEEGIQE 464

Query: 181  GDIVTLHAWINVKRGNGLIGALPHAPYYPFHKEENFWFLLADSVSNNVWFFQKVSFLDEA 360
            GDI+T+ AWI +KR NGLIGA+PH+PY+PFHKEENFW LLAD  SN+VWFFQKV F+DEA
Sbjct: 465  GDIMTVQAWITLKRPNGLIGAIPHSPYFPFHKEENFWVLLAD--SNHVWFFQKVKFMDEA 522

Query: 361  AAITAASKAIEESKEGSGATVKETSKAVAEAVEKVKAGSRLVMGKFQAPSEGNYNLTCYL 540
             AI AAS  I E+ E  GA+VKET+ AV EAVEKVK+GSRLVMG+  AP EG YNLTC+ 
Sbjct: 523  GAIAAASNTITETMEPLGASVKETNDAVKEAVEKVKSGSRLVMGRLLAPGEGTYNLTCFC 582

Query: 541  LCDSWLGCDRRTNVKLKIVKRTRAGTRGAVLADEGPIMEDGVEEDEDNEDEEYDDDYESE 720
            L D+W+GCD++T++K++++KRTR          EG   E+G+EE++D  +EE   DYESE
Sbjct: 583  LSDTWIGCDQKTSLKVEVLKRTR--------DVEGENAEEGLEEEDDEIEEE---DYESE 631

Query: 721  YSEDEEDDQNSKNKHQATNGTAKKHGQAAESSGSDEE 831
            YSEDEED +    K             ++E SGSDEE
Sbjct: 632  YSEDEEDKKRGSKK-------KVNKESSSEESGSDEE 661

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 916,366,727
Number of Sequences: 1393205
Number of extensions: 21468702
Number of successful extensions: 244155
Number of sequences better than 10.0: 2507
Number of HSP's better than 10.0 without gapping: 101974
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 173933
length of database: 448,689,247
effective HSP length: 124
effective length of database: 275,931,827
effective search space used: 62360592902
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 SPD047h05_f BP047779 1 573
2 SPDL098d04_f BP058155 468 1053




Lotus japonicus
Kazusa DNA Research Institute