KMC001415A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC001415A_C01 KMC001415A_c01
agtAGTAGTAGCTGAACAACATCACATTAACTGCTAATATGGGGACAAGTTTCTGAAGGG
CTTAATTGATAATCTCATACCCACAAGAAAAAAGCCCTTCTAAGCCCCTTGTTACATAGC
ATAAAATCCAAACCTAAAAATGTAACACAAGTATCTCAACATACCCAAGTCCAGATAACT
TGATAACCCCCATAAAAACACCAACTTAAAGCAAAATAACCAAACCAAAACCAAAAAAGG
AAGAGTAAAAAATAAGGGGGTACTATAAAAGCAGTGAAGTTGATTAATCACTTCTTGAGA
CTACCGCGAACGCCAGGGCGAGGCTTGGTGGAGGACTCAGCAACGACGGTGGCGTGGAGG
TTGACGACCTTGTCCTCCTCAAAAGGAATCTTGGGATGACGCGCTTCATGGTGAATCTGC
ATCGATTTCACATCAGGCGCTGTGACTTTGCAGTGAGGGCACTCCCACTTGGCATGGCCA
CCTTTCTCAAGACCCGTTCGGTCCTTCATACCCGCTTTTCCGCCGCCGCGGTTTGTCGTC
GCAGCATCCAGCTTCGCCGCCAGCTCCTTCGCCGTGTGCTTCTTCGGCTTCGCTTTCCCC
GTCATGGTTAACTTTTCCCCCTTCTCTTCTTCTGGAAAATTGATTAAGGGTTTGTTGTTT
GATTCCCTTTGGGATTATGCGATTGATGATCGGGGTGATGAGAAATtggaaccaagattg
gatattggttggggttgaaagggttgatcggaaaccctaatttggttctaccactactac
taccaa


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC001415A_C01 KMC001415A_c01
         (786 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_566182.1| expressed protein; protein id: At3g02790.1, sup...   172  6e-42
ref|NP_197151.1| putative protein; protein id: At5g16470.1 [Arab...   164  2e-39
emb|CAC33094.1| hypothetical protein [Rhodomonas sp. CS24]             77  2e-13
gb|EAA11520.1| agCP6191 [Anopheles gambiae str. PEST]                  38  0.15
gb|AAA35584.1| basonuclin                                              37  0.25

>ref|NP_566182.1| expressed protein; protein id: At3g02790.1, supported by cDNA:
           gi_13605546, supported by cDNA: gi_16323295 [Arabidopsis
           thaliana] gi|6728983|gb|AAF26981.1|AC018363_26 unknown
           protein [Arabidopsis thaliana]
           gi|13605547|gb|AAK32767.1|AF361599_1 AT3g02790/F13E7_27
           [Arabidopsis thaliana] gi|16323296|gb|AAL15403.1|
           AT3g02790/F13E7_27 [Arabidopsis thaliana]
          Length = 105

 Score =  172 bits (435), Expect = 6e-42
 Identities = 76/105 (72%), Positives = 92/105 (87%)
 Frame = -2

Query: 605 MTGKAKPKKHTAKELAAKLDAATTNRGGGKAGMKDRTGLEKGGHAKWECPHCKVTAPDVK 426
           MTGKAKPKKHTAKE+ AK+DAA TNRGGGKAG+ DRTG EKGGHAK+ECPHCK+TAP +K
Sbjct: 1   MTGKAKPKKHTAKEIQAKIDAALTNRGGGKAGIADRTGKEKGGHAKYECPHCKITAPGLK 60

Query: 425 SMQIHHEARHPKIPFEEDKVVNLHATVVAESSTKPRPGVRGSLKK 291
           +MQIHHE++HP I +EE K+VNLHA +   + +KP+PG+RGSLKK
Sbjct: 61  TMQIHHESKHPNIIYEESKLVNLHAVLAPVAESKPKPGIRGSLKK 105

>ref|NP_197151.1| putative protein; protein id: At5g16470.1 [Arabidopsis thaliana]
           gi|9759129|dbj|BAB09614.1|
           gb|AAF26981.1~gene_id:MQK4.20~similar to unknown protein
           [Arabidopsis thaliana] gi|27808636|gb|AAO24598.1|
           At5g16470 [Arabidopsis thaliana]
          Length = 104

 Score =  164 bits (414), Expect = 2e-39
 Identities = 76/106 (71%), Positives = 91/106 (85%), Gaps = 1/106 (0%)
 Frame = -2

Query: 605 MTGKAKPKKHTAKELAAKLDAATTNRGGGKAGMKDRTGLEKGGHAKWECPHCKVTAPDVK 426
           MTGKAKPKKHTAKEL AK DAA TNRGGGKAG+ DRTG EKGGHAK+ECPHCK+T PD+K
Sbjct: 1   MTGKAKPKKHTAKELQAKADAALTNRGGGKAGLADRTGKEKGGHAKYECPHCKITVPDLK 60

Query: 425 SMQIHHEARHPKIPFEEDKVVNLHATVVAES-STKPRPGVRGSLKK 291
           +MQIHHE++HPK+ +EE +  NLH  + A + S+KP+PG+RGSLKK
Sbjct: 61  TMQIHHESKHPKLTYEEPR--NLHEALAAPAESSKPKPGIRGSLKK 104

>emb|CAC33094.1| hypothetical protein [Rhodomonas sp. CS24]
          Length = 124

 Score = 77.4 bits (189), Expect = 2e-13
 Identities = 36/69 (52%), Positives = 49/69 (70%)
 Frame = -2

Query: 596 KAKPKKHTAKELAAKLDAATTNRGGGKAGMKDRTGLEKGGHAKWECPHCKVTAPDVKSMQ 417
           K    KHT+ E+A+K   AT N GGGKAG++DR G  K GHAK+ CP CK+ A  +K+MQ
Sbjct: 17  KGVAAKHTSGEVASKTALATRNAGGGKAGLQDRKG-GKAGHAKFICPECKMQAASMKNMQ 75

Query: 416 IHHEARHPK 390
            H++++HPK
Sbjct: 76  DHYDSKHPK 84

>gb|EAA11520.1| agCP6191 [Anopheles gambiae str. PEST]
          Length = 751

 Score = 38.1 bits (87), Expect = 0.15
 Identities = 26/83 (31%), Positives = 33/83 (39%), Gaps = 1/83 (1%)
 Frame = -2

Query: 632 EEEKGEKLTMTGKAKPKKHTAKELAAKLDAATTNRGGGKA-GMKDRTGLEKGGHAKWECP 456
           +E+ G K     K   KK  AK+  A+ D  TT+ G  K    K  T        K  C 
Sbjct: 246 DEKDGSKQQTPTKKSAKKEDAKDSNAEDDRKTTDDGSDKKRSTKINTKYRTSNFIKLTCT 305

Query: 455 HCKVTAPDVKSMQIHHEARHPKI 387
           HCK+     K  Q H   R  K+
Sbjct: 306 HCKLKCVTFKEYQTHLYTRTHKM 328

>gb|AAA35584.1| basonuclin
          Length = 993

 Score = 37.4 bits (85), Expect = 0.25
 Identities = 37/131 (28%), Positives = 61/131 (46%), Gaps = 19/131 (14%)
 Frame = -2

Query: 725  ISNLGSNFSSPRSSIA*SQRESNNKP-LINFPEE--EKGEKLTMTGKAKPK-------KH 576
            I + G+   +P  +   S+RE+ + P LI  P E  + G +   T   +P+       + 
Sbjct: 614  IESSGAISQTPEQATHNSERETEHTPALIMVPREVEDGGHEHYFTPGMEPQVPFSDYMEL 673

Query: 575  TAKELAAKLDAATTNRGGGKAGMKDRTGLEK-GGHA--------KWECPHCKVTAPDVKS 423
              + LA  L +A +NRG     ++D   LE  G HA        +++C  CK T  +  S
Sbjct: 674  QQRLLAGGLFSALSNRGMAFPCLEDSKELEHVGQHALARQIEENRFQCDICKKTFKNACS 733

Query: 422  MQIHHEARHPK 390
            ++IHH+  H K
Sbjct: 734  VKIHHKNMHVK 744

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 740,327,411
Number of Sequences: 1393205
Number of extensions: 18388503
Number of successful extensions: 88895
Number of sequences better than 10.0: 90
Number of HSP's better than 10.0 without gapping: 74544
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 88100
length of database: 448,689,247
effective HSP length: 121
effective length of database: 280,111,442
effective search space used: 39215601880
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MPD023d12_f AV771582 1 528
2 MWM119h02_f AV766643 4 338
3 MF065g04_f BP031776 4 282
4 SPD082b12_f BP050528 7 504
5 GNf054g10 BP071421 10 425
6 GENLf086g03 BP067052 12 513
7 MPD031f04_f AV772139 50 589
8 MWM021b07_f AV764941 386 789




Lotus japonicus
Kazusa DNA Research Institute