KMC003735A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC003735A_C01 KMC003735A_c01
atgtcaaatacgcagcgctgtttcagaaaatgtaGCAACGCTAGAAGTTCCTTTTACTTG
TAGCTGCATCATAAATTTTCTGTCTTCTAAACCAAGTTTAAGAAGGGGAAAAAGCTTCTT
GCTAAGAATCAATTAACAAGAATGAATATCTTAGAGATGATTAAATGAGTAGGAATGTAT
AAACTATAGAATCCAACTTTGAAACTATAGAAAGCCACTTCGACATAATTAGGCCCTCAT
TTCAGCACAGGAATATGCTTCTAGAGAAAGAATGTTCCTTGAAATGCTACTATGTTGTTG
TATTTTTTGTAGTTGAGGAATAGGCTTCTGGAGAAACAATGTTCCCTTAAATGCCATTAT
GTTGTTGTATTTTTGGTAGTTGTTTATCTTCTCTCTACCCAGCCTTGAGAACCCAAATTT
GGTGGTCCATAGCGACTTAGCTTCCCTCGTCGCTGGTAGGACAAAGTTTTTCACCTTTAA
ACATCCTAACATCCTTTCAATGCAAGAGAACAGGGACCGAAAGTAACCCTTTCTTTGGTA
ATCTTTACGAGTTGCCAGTAAGGGAAGCTCTGCTACTTCTTGCCCAAACACACGAAATAG
ACAAGCACAAACCACCATTGGGTTGATAGTGAGCACTGCACAATACATTCCGCGAAAATC
ATGATCCTTGACTTTCTTT


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC003735A_C01 KMC003735A_c01
         (679 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_181288.1| unknown protein; protein id: At2g37520.1 [Arabi...   138  6e-32
ref|NP_180365.1| hypothetical protein; protein id: At2g27980.1 [...   138  6e-32
ref|NP_190936.1| putative protein; protein id: At3g53680.1 [Arab...   135  4e-31
ref|NP_181210.2| putative PHD-type zinc finger protein; protein ...   124  2e-27
pir||H84783 probable PHD-type zinc finger protein [imported] - A...   124  2e-27

>ref|NP_181288.1| unknown protein; protein id: At2g37520.1 [Arabidopsis thaliana]
            gi|7485433|pir||T02518 hypothetical protein At2g37520
            [imported] - Arabidopsis thaliana
            gi|3236235|gb|AAC23623.1| unknown protein [Arabidopsis
            thaliana] gi|20197471|gb|AAM15090.1| unknown protein
            [Arabidopsis thaliana]
          Length = 825

 Score =  138 bits (348), Expect = 6e-32
 Identities = 59/122 (48%), Positives = 88/122 (71%)
 Frame = -2

Query: 678  KKVKDHDFRGMYCAVLTINPMVVCACLFRVFGQEVAELPLLATRKDYQRKGYFRSLFSCI 499
            + +   +F GMYC VL +N +VV A L R+FGQEVAELP++AT ++YQ +GYF+ L++C+
Sbjct: 692  RNISGQEFGGMYCLVLIVNSLVVSAALLRIFGQEVAELPIVATSREYQGRGYFQGLYACV 751

Query: 498  ERMLGCLKVKNFVLPATREAKSLWTTKFGFSRLGREKINNYQKYNNIMAFKGTLFLQKPI 319
            E +L  L V+N VLPA  EA+S+WT KFGF+++  +++  YQK   +  FKGT  L+K +
Sbjct: 752  ENLLSSLNVENLVLPAAEEAESIWTKKFGFTKMSDQQLQEYQKEVQLTIFKGTSMLEKKV 811

Query: 318  PQ 313
            P+
Sbjct: 812  PK 813

>ref|NP_180365.1| hypothetical protein; protein id: At2g27980.1 [Arabidopsis thaliana]
            gi|25407937|pir||C84679 hypothetical protein At2g27980
            [imported] - Arabidopsis thaliana
            gi|4510418|gb|AAD21504.1| hypothetical protein
            [Arabidopsis thaliana]
          Length = 1008

 Score =  138 bits (348), Expect = 6e-32
 Identities = 63/121 (52%), Positives = 89/121 (73%)
 Frame = -2

Query: 678  KKVKDHDFRGMYCAVLTINPMVVCACLFRVFGQEVAELPLLATRKDYQRKGYFRSLFSCI 499
            ++ K  DF GMYC +L ++ ++V   +FRVFG E+AELPL+AT KD Q +GYF+ LF+CI
Sbjct: 874  RQTKAQDFSGMYCTMLAVDEVIVSVGIFRVFGSELAELPLVATSKDCQGQGYFQCLFACI 933

Query: 498  ERMLGCLKVKNFVLPATREAKSLWTTKFGFSRLGREKINNYQKYNNIMAFKGTLFLQKPI 319
            ER+LG L VK+ VLPA  EAKS+WT KFGF+++  E++  Y+K  ++M F GT  L+K +
Sbjct: 934  ERLLGFLNVKHIVLPAADEAKSIWTDKFGFTKMTDEEVKEYRKDYSVMIFHGTSMLRKSV 993

Query: 318  P 316
            P
Sbjct: 994  P 994

>ref|NP_190936.1| putative protein; protein id: At3g53680.1 [Arabidopsis thaliana]
            gi|11282417|pir||T45908 hypothetical protein F4P12.380 -
            Arabidopsis thaliana gi|6729519|emb|CAB67675.1| putative
            protein [Arabidopsis thaliana]
          Length = 839

 Score =  135 bits (341), Expect = 4e-31
 Identities = 57/125 (45%), Positives = 87/125 (69%)
 Frame = -2

Query: 678  KKVKDHDFRGMYCAVLTINPMVVCACLFRVFGQEVAELPLLATRKDYQRKGYFRSLFSCI 499
            + +   +F GMYC VL +N +VV A L R+FGQ+VAELP++AT ++YQ +GYF+ LF+C+
Sbjct: 710  RNISGQEFGGMYCLVLMVNSLVVSAALLRIFGQKVAELPIVATSREYQGRGYFQGLFACV 769

Query: 498  ERMLGCLKVKNFVLPATREAKSLWTTKFGFSRLGREKINNYQKYNNIMAFKGTLFLQKPI 319
            E +L  L V+N +LPA  EA+S+WT KFGF+++   ++  YQ+   +  FKGT  L+K +
Sbjct: 770  ENLLSSLNVENLLLPAAEEAESIWTNKFGFTKMTEHRLQRYQREVQLTIFKGTSMLEKKV 829

Query: 318  PQLQK 304
            P   +
Sbjct: 830  PSFSE 834

>ref|NP_181210.2| putative PHD-type zinc finger protein; protein id: At2g36720.1,
            supported by cDNA: gi_20260433 [Arabidopsis thaliana]
            gi|20260434|gb|AAM13115.1| putative PHD-type zinc finger
            protein [Arabidopsis thaliana]
          Length = 1007

 Score =  124 bits (310), Expect = 2e-27
 Identities = 64/129 (49%), Positives = 87/129 (66%), Gaps = 1/129 (0%)
 Frame = -2

Query: 678  KKVKDHDFRGMYCAVLTINPMVVCACLFRVFGQEVAELPLLATRKDYQRKGYFRSLFSCI 499
            K ++  D+ G+ CAVLT+N  VV A L RVFG+EVAELPL+ATR   + KGYF+ LFSCI
Sbjct: 853  KTMQGQDYGGICCAVLTVNATVVSAGLLRVFGREVAELPLVATRMCSREKGYFQLLFSCI 912

Query: 498  ERMLGCLKVKNFVLPATREAKSLWTTKFGFSRLGREKINNYQKY-NNIMAFKGTLFLQKP 322
            E++L  L V++ V+PA  EA+ LW  KFGF +L  E+++ Y K    ++ FKG   LQKP
Sbjct: 913  EKLLSSLNVESIVVPAAEEAEPLWMNKFGFRKLAPEQLSKYIKICYQMVRFKGASMLQKP 972

Query: 321  IPQLQKIQQ 295
            +   Q I +
Sbjct: 973  VDSHQIIDK 981

>pir||H84783 probable PHD-type zinc finger protein [imported] - Arabidopsis
            thaliana gi|4415917|gb|AAD20148.1| putative PHD-type zinc
            finger protein [Arabidopsis thaliana]
          Length = 958

 Score =  124 bits (310), Expect = 2e-27
 Identities = 64/129 (49%), Positives = 87/129 (66%), Gaps = 1/129 (0%)
 Frame = -2

Query: 678  KKVKDHDFRGMYCAVLTINPMVVCACLFRVFGQEVAELPLLATRKDYQRKGYFRSLFSCI 499
            K ++  D+ G+ CAVLT+N  VV A L RVFG+EVAELPL+ATR   + KGYF+ LFSCI
Sbjct: 804  KTMQGQDYGGICCAVLTVNATVVSAGLLRVFGREVAELPLVATRMCSREKGYFQLLFSCI 863

Query: 498  ERMLGCLKVKNFVLPATREAKSLWTTKFGFSRLGREKINNYQKY-NNIMAFKGTLFLQKP 322
            E++L  L V++ V+PA  EA+ LW  KFGF +L  E+++ Y K    ++ FKG   LQKP
Sbjct: 864  EKLLSSLNVESIVVPAAEEAEPLWMNKFGFRKLAPEQLSKYIKICYQMVRFKGASMLQKP 923

Query: 321  IPQLQKIQQ 295
            +   Q I +
Sbjct: 924  VDSHQIIDK 932

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 537,674,857
Number of Sequences: 1393205
Number of extensions: 11300948
Number of successful extensions: 24391
Number of sequences better than 10.0: 30
Number of HSP's better than 10.0 without gapping: 23721
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 24384
length of database: 448,689,247
effective HSP length: 119
effective length of database: 282,897,852
effective search space used: 29987172312
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 GNf066d02 BP072262 1 420
2 SPDL086e09_f BP057397 35 572
3 MPDL053e10_f AV779194 157 680




Lotus japonicus
Kazusa DNA Research Institute