KMC014240A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC014240A_C01 KMC014240A_c01
aaatcccatcagccaacgGAAAATTTTCATTGTTCTGGTTTGGTCAAATTGGCTTGAGAG
TGGTAATTCAATATATGCATGGAAGCAAGAGAAGGTATCAGTTCTGGGGTTACAGTGATA
GGAGCAGAAGCTCCATCAGCTTATCATGTGGCACCGAGGAGTGAAGCTCCGAACCAAGTT
CATGTCCCTGCACCAGCATCAGCAGGTACTGCTGCAGCTATGGTTGGCTCACCGGTGAGT
GTAGGGTTGGATGTTACAATTAAGAAGAAACGGGGTAGACCAAGGAAATATGGACCAGAT
GGGCCTGTCTCCATGGCATTGTCACCATTGCCAATCTCATCTTCAGTTCCGCCTTCTAAT
GACTATGCATCTGGGAAACGAGGGAAGCCACGCGGGATGGAATACAAGCAGTCGAAAGAG
ATGGGGTTGGATCATCATCTAGGTGACTTGAATGCATGCTCTGATGGTACAAGCTTTATG
CCGCATATCATCACTGTCAATGCTGGTGAGGACAT


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC014240A_C01 KMC014240A_c01
         (515 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_192945.2| putative DNA-binding protein; protein id: At4g1...    99  3e-20
pir||T06615 hypothetical protein F16J13.150 - Arabidopsis thalia...    97  1e-19
ref|NP_194262.1| putative protein; protein id: At4g25320.1, supp...    92  4e-18
ref|NP_194008.1| putative DNA binding protein; protein id: At4g2...    91  6e-18
ref|NP_199972.1| putative protein; protein id: At5g51590.1 [Arab...    84  7e-16

>ref|NP_192945.2| putative DNA-binding protein; protein id: At4g12080.1, supported by
           cDNA: gi_17979484 [Arabidopsis thaliana]
           gi|17979485|gb|AAL50079.1| AT4g12080/F16J13_150
           [Arabidopsis thaliana] gi|23506149|gb|AAN31086.1|
           At4g12080/F16J13_150 [Arabidopsis thaliana]
          Length = 356

 Score = 99.0 bits (245), Expect = 3e-20
 Identities = 75/172 (43%), Positives = 95/172 (54%), Gaps = 32/172 (18%)
 Frame = +1

Query: 94  GISSGVTVIGAEAPSAYHVAPRSEAPNQVHV------PAPASAGTA-------------- 213
           G   G+TV+ ++APS +HVA RSE+ NQ         P P+S  TA              
Sbjct: 17  GNDGGITVVRSDAPSDFHVAQRSESSNQSPTSVTPPPPQPSSHHTAPPPLQISTVTTTTT 76

Query: 214 -AAMVGSPVSVGLDVTIKKKRGRPRKYGPDGPVSMALSPLPISSSVPPSN---------D 363
            AAM G  +S GL   +KKKRGRPRKYGPDG V +ALSP PISS+  PS+         D
Sbjct: 77  TAAMEG--ISGGL---MKKKRGRPRKYGPDGTV-VALSPKPISSAPAPSHLPPPSSHVID 130

Query: 364 Y-ASGKRGKPRGM-EYKQSKEMGLDHHLGDLNACSDGTSFMPHIITVNAGED 513
           + AS KR K +    + ++K      +LG+   CS G +F PHIITVN GED
Sbjct: 131 FSASEKRSKVKPTNSFNRTKYHHQVENLGEWAPCSVGGNFTPHIITVNTGED 182

>pir||T06615 hypothetical protein F16J13.150 - Arabidopsis thaliana
           gi|4586113|emb|CAB40949.1| putative DNA-binding protein
           [Arabidopsis thaliana] gi|7267909|emb|CAB78251.1|
           putative DNA-binding protein [Arabidopsis thaliana]
          Length = 365

 Score = 96.7 bits (239), Expect = 1e-19
 Identities = 74/171 (43%), Positives = 94/171 (54%), Gaps = 32/171 (18%)
 Frame = +1

Query: 94  GISSGVTVIGAEAPSAYHVAPRSEAPNQVHV------PAPASAGTA-------------- 213
           G   G+TV+ ++APS +HVA RSE+ NQ         P P+S  TA              
Sbjct: 13  GNDGGITVVRSDAPSDFHVAQRSESSNQSPTSVTPPPPQPSSHHTAPPPLQISTVTTTTT 72

Query: 214 -AAMVGSPVSVGLDVTIKKKRGRPRKYGPDGPVSMALSPLPISSSVPPSN---------D 363
            AAM G  +S GL   +KKKRGRPRKYGPDG V +ALSP PISS+  PS+         D
Sbjct: 73  TAAMEG--ISGGL---MKKKRGRPRKYGPDGTV-VALSPKPISSAPAPSHLPPPSSHVID 126

Query: 364 Y-ASGKRGKPRGM-EYKQSKEMGLDHHLGDLNACSDGTSFMPHIITVNAGE 510
           + AS KR K +    + ++K      +LG+   CS G +F PHIITVN GE
Sbjct: 127 FSASEKRSKVKPTNSFNRTKYHHQVENLGEWAPCSVGGNFTPHIITVNTGE 177

>ref|NP_194262.1| putative protein; protein id: At4g25320.1, supported by cDNA:
           gi_20466212 [Arabidopsis thaliana]
           gi|7486058|pir||T05553 hypothetical protein F24A6.160 -
           Arabidopsis thaliana gi|4454020|emb|CAA23073.1| putative
           protein [Arabidopsis thaliana]
           gi|7269383|emb|CAB81343.1| putative protein [Arabidopsis
           thaliana] gi|20466213|gb|AAM20424.1| putative protein
           [Arabidopsis thaliana] gi|28059577|gb|AAO30071.1|
           putative protein [Arabidopsis thaliana]
          Length = 404

 Score = 92.0 bits (227), Expect = 4e-18
 Identities = 66/178 (37%), Positives = 88/178 (49%), Gaps = 33/178 (18%)
 Frame = +1

Query: 79  MEAREGISSGVTVIGAEAPSAYHVA------------PRSEAPNQVHVP---APASAGTA 213
           ME REG +    +  +      H A            PR E PN   VP    PA+A  A
Sbjct: 1   MEEREGTNINNNITSSFGLKQQHEAAASDGGYSMDPPPRPENPNPFLVPPTTVPAAATVA 60

Query: 214 AAMV---GSPVSVGLDVT------IKKKRGRPRKYGPDGPVSMALSPLPISSSVPPSNDY 366
           AA+     +P S+ +         +KKKRGRPRKY PDG + + LSP+PISSSVP ++++
Sbjct: 61  AAVTENAATPFSLTMPTENTSAEQLKKKRGRPRKYNPDGTLVVTLSPMPISSSVPLTSEF 120

Query: 367 ASGKRGKPRGME---YKQSKEMGLDHHLGDLNACSDGT------SFMPHIITVNAGED 513
              KRG+ RG      K+S+    D    D N    GT      +F PH++ VNAGED
Sbjct: 121 PPRKRGRGRGKSNRWLKKSQMFQFDRSPVDTNLAGVGTADFVGANFTPHVLIVNAGED 178

>ref|NP_194008.1| putative DNA binding protein; protein id: At4g22770.1, supported by
           cDNA: 12041. [Arabidopsis thaliana]
           gi|7486882|pir||T04572 hypothetical protein T12H17.160 -
           Arabidopsis thaliana gi|2827554|emb|CAA16562.1| putative
           DNA binding protein [Arabidopsis thaliana]
           gi|7269124|emb|CAB79232.1| putative DNA binding protein
           [Arabidopsis thaliana] gi|21537115|gb|AAM61456.1|
           putative DNA binding protein [Arabidopsis thaliana]
          Length = 334

 Score = 91.3 bits (225), Expect = 6e-18
 Identities = 69/156 (44%), Positives = 89/156 (56%), Gaps = 16/156 (10%)
 Frame = +1

Query: 94  GISSGVTVIGAEAPSAYHVAPRSEA----PNQVHVPAPA----SAGTAAAMVGSPVSVGL 249
           G   GVTV+ + APS +H+APRSE     PN V  P P     S   +AAM G   S G 
Sbjct: 13  GSDGGVTVVRSNAPSDFHMAPRSETSNTPPNSVAPPPPPPPQNSFTPSAAMDG--FSSG- 69

Query: 250 DVTIKKKRGRPRKYGPDGPVSMALSPLPISSSVPPSN---DYA--SGKRGKPRGMEYKQS 414
              IKK+RGRPRKYG DG  ++ LSP PISS+ P ++   D++  S KRGK +      S
Sbjct: 70  --PIKKRRGRPRKYGHDG-AAVTLSPNPISSAAPTTSHVIDFSTTSEKRGKMKPATPTPS 126

Query: 415 KEMGLDH---HLGDLNACSDGTSFMPHIITVNAGED 513
             +   +   +LG+ +  S   +F PHIITVNAGED
Sbjct: 127 SFIRPKYQVENLGEWSPSSAAANFTPHIITVNAGED 162

>ref|NP_199972.1| putative protein; protein id: At5g51590.1 [Arabidopsis thaliana]
           gi|9758201|dbj|BAB08675.1| contains similarity to
           DNA-binding protein~gene_id:K17N15.14 [Arabidopsis
           thaliana]
          Length = 419

 Score = 84.3 bits (207), Expect = 7e-16
 Identities = 66/192 (34%), Positives = 92/192 (47%), Gaps = 47/192 (24%)
 Frame = +1

Query: 79  MEAREG--ISSGVTVIGAEA------PSAYHVAPRSEAPNQVHVPAPASAGTAAAM---- 222
           ME REG  I++  T  G +       P  Y   PRSE PN   V   +++  AAA+    
Sbjct: 1   MEEREGTNINNIPTSFGLKQHETPLPPPGY--PPRSENPNLFPVGQSSTSSAAAAVKPSE 58

Query: 223 -VGSPVSVGLDVT-----IKKKRGRPRKYGPDGPVSMALSPLPISSSVPPSNDYASGKRG 384
            V  P S+ + V      +KKKRGRPRKY PDG +++ LSP+PISSSVP ++++ S KRG
Sbjct: 59  NVAPPFSLTMPVENSSSELKKKRGRPRKYNPDGSLAVTLSPMPISSSVPLTSEFGSRKRG 118

Query: 385 KPRGMEYKQSKEMGLDHHLGDLNACSDGT-----------------------------SF 477
           + RG    + +  G      + N  ++                               SF
Sbjct: 119 RGRGRGRGRGRGRGQGQGSREPNNNNNDNNWLKNPQMFEFNNNTPTSGGGGPAEIVSPSF 178

Query: 478 MPHIITVNAGED 513
            PH++TVNAGED
Sbjct: 179 TPHVLTVNAGED 190

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 498,270,512
Number of Sequences: 1393205
Number of extensions: 12100415
Number of successful extensions: 40394
Number of sequences better than 10.0: 183
Number of HSP's better than 10.0 without gapping: 36621
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 40021
length of database: 448,689,247
effective HSP length: 115
effective length of database: 288,470,672
effective search space used: 16154357632
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MF026c01_f BP029624 1 516
2 MPDL088e02_f AV781104 6 502
3 MF083b01_f BP032662 19 177




Lotus japonicus
Kazusa DNA Research Institute