KMC007688A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC007688A_C01 KMC007688A_c01
aacaatgacaataccaaaactatattcccattctcattatcacccacaattaagatttcg
tagaactaatccataagaatAAGAATCATTAAGTCTAAAAGGCATAAAAATAAGCACTAC
TACCCGAGATGGGTCCGTATAAAAGAAGGCACTTGGTAAGAAATAAGAATGATGAGGTAC
ATGGCTAACAACCATATGCAAAATTGGTACTAACAATACAAATTACATGGCGATTGGCTG
AGTTGGGCATCTTGAAGCTGCTGACAGAACGGTTATAGCTCTGTAATGACAGAAAAATAA
CCATGCATCTTGGGGGATAACCATGCATCTTGGGGAACGCAGCTAACTTGACCAGATTAT
TTTTTTGTTTTCTTGTGTTTCTGTTTCTGTTTCTTTTGGGTAGGTGCTTTCTCATTTTGG
TCAACATTATCCATTAGATGCTCAAATTCCTCATAACTTGCAAATGGAGACGCAGCACTC
TTGCCACCACTTTTACGTTTTCTTTTTCTTTTACTAACTTGATCCTCCTCTTCCTCATCA
GAGGCATCATCAACATCCCCAATTTGAATATCATTGTCATCATCACTACCTTCAACTTCA
TCATCACTCCCTTCATCACCACTTTGAAGAGCATCTATGT


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC007688A_C01 KMC007688A_c01
         (640 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_177388.1| hypothetical protein; protein id: At1g72440.1 [...    54  2e-06
gb|AAM44363.1| hypothetical protein [Dictyostelium discoideum] g...    53  4e-06
sp|P04929|HRPX_PLALO HISTIDINE-RICH GLYCOPROTEIN PRECURSOR gi|72...    50  2e-05
pir||A54523 histidine-rich protein - Plasmodium lophurae (fragme...    47  2e-04
ref|NP_732342.2| CG31122-PA [Drosophila melanogaster] gi|2838135...    47  2e-04

>ref|NP_177388.1| hypothetical protein; protein id: At1g72440.1 [Arabidopsis thaliana]
            gi|25350651|pir||E96748 hypothetical protein T10D10.9
            [imported] - Arabidopsis thaliana
            gi|12325271|gb|AAG52578.1|AC016529_9 hypothetical
            protein; 39633-44904 [Arabidopsis thaliana]
          Length = 1056

 Score = 53.5 bits (127), Expect = 2e-06
 Identities = 29/86 (33%), Positives = 48/86 (55%)
 Frame = -3

Query: 617  DEGSDDEVEGSDDDNDIQIGDVDDASDEEEEDQVSKRKRKRKSGGKSAASPFASYEEFEH 438
            D   D ++   +DDN++      D  D++ +    + K+K+K   K   SPFAS EE++H
Sbjct: 965  DTDMDMDLIDDEDDNNVDDDGTGDGGDDDSDGDDGRSKKKKKEKRKRK-SPFASLEEYKH 1023

Query: 437  LMDNVDQNEKAPTQKKQKQKHKKTKK 360
            L   +DQ+EK  ++ K+K   + TKK
Sbjct: 1024 L---IDQDEKEDSKTKRKATSEPTKK 1046

>gb|AAM44363.1| hypothetical protein [Dictyostelium discoideum]
           gi|28828387|gb|AAM09303.2| similar to Plasmodium
           lophurae. Histidine-rich glycoprotein precursor
           [Dictyostelium discoideum]
          Length = 233

 Score = 52.8 bits (125), Expect = 4e-06
 Identities = 21/61 (34%), Positives = 26/61 (42%)
 Frame = +2

Query: 452 HNLQMETQHSCHHFYVFFFFY*LDPPLPHQRHHQHPQFEYHCHHHYLQLHHHSLHHHFEE 631
           H+      H  HH +     +    P  H  HH HP   +H HHH+   HHH  HHH   
Sbjct: 75  HHHHHHHHHHHHHHHHHHHHHHPHHPHHHPHHHHHPHHHHHHHHHHHHHHHHHHHHHHHH 134

Query: 632 H 634
           H
Sbjct: 135 H 135

 Score = 52.0 bits (123), Expect = 7e-06
 Identities = 23/74 (31%), Positives = 28/74 (37%)
 Frame = +2

Query: 413 HFGQHYPLDAQIPHNLQMETQHSCHHFYVFFFFY*LDPPLPHQRHHQHPQFEYHCHHHYL 592
           H   H+P     PH+      H  HH +             H  HH H    +H HHH+ 
Sbjct: 98  HHPHHHPHHHHHPHHHHHHHHHHHHHHH-------------HHHHHHHHHHHHHHHHHHH 144

Query: 593 QLHHHSLHHHFEEH 634
             HHH  HHH   H
Sbjct: 145 HHHHHHHHHHPHHH 158

 Score = 50.1 bits (118), Expect = 3e-05
 Identities = 23/74 (31%), Positives = 29/74 (39%)
 Frame = +2

Query: 413 HFGQHYPLDAQIPHNLQMETQHSCHHFYVFFFFY*LDPPLPHQRHHQHPQFEYHCHHHYL 592
           H   H+P     PH+      H  HH +     +       H  HH H    +H HHH+ 
Sbjct: 91  HHHHHHP---HHPHHHPHHHHHPHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHH 140

Query: 593 QLHHHSLHHHFEEH 634
             HHH  HHH   H
Sbjct: 141 HHHHHHHHHHHHHH 154

 Score = 48.5 bits (114), Expect = 7e-05
 Identities = 24/65 (36%), Positives = 29/65 (43%)
 Frame = +2

Query: 428 YPLDAQIPHNLQMETQHSCHHFYVFFFFY*LDPPLPHQRHHQHPQFEYHCHHHYLQLHHH 607
           Y LD   PHN      H+ H+          +P  PH  HH H    +H HHH+   HHH
Sbjct: 40  YQLDVNNPHNPN-NNPHNPHNPNN-------NPHHPHHLHHHHHHHHHHHHHHHHHHHHH 91

Query: 608 SLHHH 622
             HHH
Sbjct: 92  HHHHH 96

 Score = 34.7 bits (78), Expect = 1.1
 Identities = 17/45 (37%), Positives = 18/45 (39%), Gaps = 7/45 (15%)
 Frame = +2

Query: 527 PLPHQRHHQHPQFEYHCHHH-------YLQLHHHSLHHHFEEHLC 640
           P PH   H HP    H HHH       +   H H  HHH E   C
Sbjct: 173 PNPHPHPHPHPHPHPHPHHHPNPNPHPHPHPHPHHHHHHQEASEC 217

>sp|P04929|HRPX_PLALO HISTIDINE-RICH GLYCOPROTEIN PRECURSOR gi|72400|pir||KGZQHL
           histidine-rich glycoprotein precursor - Plasmodium
           lophurae gi|9999|emb|CAA25698.1| histidine-rich protein
           [Plasmodium lophurae] gi|224316|prf||1101401A
           protein,His rich
          Length = 351

 Score = 50.4 bits (119), Expect = 2e-05
 Identities = 23/70 (32%), Positives = 31/70 (43%)
 Frame = +2

Query: 425 HYPLDAQIPHNLQMETQHSCHHFYVFFFFY*LDPPLPHQRHHQHPQFEYHCHHHYLQLHH 604
           H+P +   PH+ +    H   H       +    P PH  HH HP   +H   H+   HH
Sbjct: 64  HHPEEHHEPHHEEHHHHHPEEHHEPHHEEHHHHHPHPHHHHHHHPPHHHHHLGHHHHHHH 123

Query: 605 HSLHHHFEEH 634
            + HHH EEH
Sbjct: 124 AAHHHHHEEH 133

 Score = 49.3 bits (116), Expect = 4e-05
 Identities = 23/75 (30%), Positives = 31/75 (40%)
 Frame = +2

Query: 410 SHFGQHYPLDAQIPHNLQMETQHSCHHFYVFFFFY*LDPPLPHQRHHQHPQFEYHCHHHY 589
           +H   H+  DA   H+   +  H  HH +     +       H  HH H   + H HHH+
Sbjct: 254 AHHHHHHHHDAHHHHHHHHDAHHHHHHHHDAHHHHHHHHDAHHHHHHHH---DAHHHHHH 310

Query: 590 LQLHHHSLHHHFEEH 634
              HHH  HHH   H
Sbjct: 311 HDAHHHHHHHHDAHH 325

 Score = 47.8 bits (112), Expect = 1e-04
 Identities = 22/75 (29%), Positives = 29/75 (38%)
 Frame = +2

Query: 410 SHFGQHYPLDAQIPHNLQMETQHSCHHFYVFFFFY*LDPPLPHQRHHQHPQFEYHCHHHY 589
           +H   H+  DA   H+   +  H  HH +                HH H    +H HHH+
Sbjct: 274 AHHHHHHHHDAHHHHHHHHDAHHHHHHHH-------------DAHHHHHHHDAHHHHHHH 320

Query: 590 LQLHHHSLHHHFEEH 634
              HHH  HHH   H
Sbjct: 321 HDAHHHHHHHHDAHH 335

 Score = 47.4 bits (111), Expect = 2e-04
 Identities = 23/75 (30%), Positives = 29/75 (38%)
 Frame = +2

Query: 410 SHFGQHYPLDAQIPHNLQMETQHSCHHFYVFFFFY*LDPPLPHQRHHQHPQFEYHCHHHY 589
           +H   H+  DA   H+   +  H  HH               H  HH H    +H HHH+
Sbjct: 284 AHHHHHHHHDAHHHHHHHHDAHHHHHHHDA------------HHHHHHHHDAHHHHHHHH 331

Query: 590 LQLHHHSLHHHFEEH 634
              HHH  HHH   H
Sbjct: 332 -DAHHHHHHHHDAHH 345

 Score = 45.1 bits (105), Expect = 8e-04
 Identities = 23/73 (31%), Positives = 31/73 (41%), Gaps = 3/73 (4%)
 Frame = +2

Query: 425 HYPLDAQIPHNLQMETQHSCHHFYVFFFFY*LD---PPLPHQRHHQHPQFEYHCHHHYLQ 595
           H+   A   H+ +    H   H + +F  + L       PH  HH H    +H HHH+  
Sbjct: 135 HHHHAAHHHHHEEHHHHHHAAHHHPWFHHHHLGYHHHHAPHHHHHHHHAPHHHHHHHHAP 194

Query: 596 LHHHSLHHHFEEH 634
            HHH  HHH   H
Sbjct: 195 HHHH--HHHHAPH 205

 Score = 45.1 bits (105), Expect = 8e-04
 Identities = 21/70 (30%), Positives = 27/70 (38%)
 Frame = +2

Query: 425 HYPLDAQIPHNLQMETQHSCHHFYVFFFFY*LDPPLPHQRHHQHPQFEYHCHHHYLQLHH 604
           H+   A   H+      H  HH +     +      PH  HH H    +H HHH+   HH
Sbjct: 178 HHHHHAPHHHHHHHHAPHHHHHHHHAPHHHHHHHHAPHHHHHHHHGHHHHHHHHHGHHHH 237

Query: 605 HSLHHHFEEH 634
           H  HH    H
Sbjct: 238 HHHHHGHHHH 247

 Score = 44.7 bits (104), Expect = 0.001
 Identities = 22/74 (29%), Positives = 27/74 (35%)
 Frame = +2

Query: 413 HFGQHYPLDAQIPHNLQMETQHSCHHFYVFFFFY*LDPPLPHQRHHQHPQFEYHCHHHYL 592
           H G H+       H+      H  HH +             H  HH H    +H HHH+ 
Sbjct: 221 HHGHHH-------HHHHHHGHHHHHHHHH-----------GHHHHHHHHHDAHHHHHHHH 262

Query: 593 QLHHHSLHHHFEEH 634
             HHH  HHH   H
Sbjct: 263 DAHHHHHHHHDAHH 276

 Score = 42.7 bits (99), Expect = 0.004
 Identities = 23/76 (30%), Positives = 28/76 (36%), Gaps = 2/76 (2%)
 Frame = +2

Query: 413 HFGQHYPLDAQIPHNLQMETQHSCHHFYVFFFFY*LDPPLPHQRHHQHPQFEYHCHHHYL 592
           H   H+P     PH+      H  HH               H+ HH H    +H HH   
Sbjct: 102 HHHHHHP-----PHHHHHLGHHHHHHHAAHHHH--------HEEHHHHHHAAHHHHHEEH 148

Query: 593 QLHHHSLHHH--FEEH 634
             HHH+ HHH  F  H
Sbjct: 149 HHHHHAAHHHPWFHHH 164

 Score = 42.4 bits (98), Expect = 0.005
 Identities = 21/74 (28%), Positives = 28/74 (37%)
 Frame = +2

Query: 413 HFGQHYPLDAQIPHNLQMETQHSCHHFYVFFFFY*LDPPLPHQRHHQHPQFEYHCHHHYL 592
           H   H+       H+      H  HH +     +       H  HH H    +H HHH+ 
Sbjct: 211 HHAPHHHHHHHHGHHHHHHHHHGHHHHHHHHHGH-------HHHHHHHHDAHHHHHHHHD 263

Query: 593 QLHHHSLHHHFEEH 634
             HHH  HHH + H
Sbjct: 264 AHHHH--HHHHDAH 275

 Score = 37.4 bits (85), Expect = 0.17
 Identities = 14/33 (42%), Positives = 16/33 (48%)
 Frame = +2

Query: 536 HQRHHQHPQFEYHCHHHYLQLHHHSLHHHFEEH 634
           H+ HH H   E+H  HH    HHH   HH   H
Sbjct: 58  HEEHHHHHPEEHHEPHHEEHHHHHPEEHHEPHH 90

>pir||A54523 histidine-rich protein - Plasmodium lophurae (fragment)
           gi|552196|gb|AAA29616.1| histidine-rich protein
           [Plasmodium lophurae]
          Length = 140

 Score = 47.4 bits (111), Expect = 2e-04
 Identities = 23/79 (29%), Positives = 30/79 (37%), Gaps = 5/79 (6%)
 Frame = +2

Query: 413 HFGQHYPLDAQIPHNLQMETQHSCHHFYVFFFFY*LDPPLPHQRHHQ-----HPQFEYHC 577
           H   H+      PH+         HH + +F  +   P   H  HH      H   + H 
Sbjct: 36  HAPHHHHHHHHAPHHHHHHPWFHHHHHHPWFHHHHHHPWFHHHHHHDAHHHHHHHHDAHH 95

Query: 578 HHHYLQLHHHSLHHHFEEH 634
           HHH+   HHH  HHH   H
Sbjct: 96  HHHHHDAHHHHHHHHDAHH 114

 Score = 46.6 bits (109), Expect = 3e-04
 Identities = 23/73 (31%), Positives = 29/73 (39%)
 Frame = +2

Query: 416 FGQHYPLDAQIPHNLQMETQHSCHHFYVFFFFY*LDPPLPHQRHHQHPQFEYHCHHHYLQ 595
           F  H+  DA   H+   +  H  HH               H  HH H    +H HHH+  
Sbjct: 75  FHHHHHHDAHHHHHHHHDAHHHHHHHDA------------HHHHHHHHDAHHHHHHHHDA 122

Query: 596 LHHHSLHHHFEEH 634
            HHH  HHH + H
Sbjct: 123 HHHH--HHHHDAH 133

 Score = 43.5 bits (101), Expect = 0.002
 Identities = 20/61 (32%), Positives = 24/61 (38%)
 Frame = +2

Query: 452 HNLQMETQHSCHHFYVFFFFY*LDPPLPHQRHHQHPQFEYHCHHHYLQLHHHSLHHHFEE 631
           H+      H  HH +            PH  HH H    +H HHH+   HHH  HHH   
Sbjct: 2   HHHHHAPHHHHHHHHA-----------PHHHHHHHHAPHHHHHHHHAPHHHH--HHHHAP 48

Query: 632 H 634
           H
Sbjct: 49  H 49

 Score = 43.5 bits (101), Expect = 0.002
 Identities = 25/76 (32%), Positives = 31/76 (39%), Gaps = 2/76 (2%)
 Frame = +2

Query: 413 HFGQHYPLDAQIPHNLQMETQHSCHHFYVFFFFY*LDPPLPHQRHHQHPQFEYHCHHHYL 592
           H   H+      PH+         HH +  +F +    P  H  HH HP F +H HHH  
Sbjct: 26  HAPHHHHHHHHAPHHHHHHHHAPHHHHHHPWFHHHHHHPWFHHHHH-HPWFHHH-HHHDA 83

Query: 593 QLHHHSLH--HHFEEH 634
             HHH  H  HH   H
Sbjct: 84  HHHHHHHHDAHHHHHH 99

 Score = 43.1 bits (100), Expect = 0.003
 Identities = 21/74 (28%), Positives = 27/74 (36%)
 Frame = +2

Query: 413 HFGQHYPLDAQIPHNLQMETQHSCHHFYVFFFFY*LDPPLPHQRHHQHPQFEYHCHHHYL 592
           H   H+P      H+   +  H  HH +                HH H    +H HHH+ 
Sbjct: 67  HHHHHHPW---FHHHHHHDAHHHHHHHH-------------DAHHHHHHHDAHHHHHHHH 110

Query: 593 QLHHHSLHHHFEEH 634
             HHH  HHH   H
Sbjct: 111 DAHHHHHHHHDAHH 124

 Score = 40.8 bits (94), Expect = 0.015
 Identities = 14/30 (46%), Positives = 16/30 (52%)
 Frame = +2

Query: 545 HHQHPQFEYHCHHHYLQLHHHSLHHHFEEH 634
           HH H    +H HHH+   HHH  HHH   H
Sbjct: 1   HHHHHHAPHHHHHHHHAPHHHHHHHHAPHH 30

 Score = 40.4 bits (93), Expect = 0.020
 Identities = 19/63 (30%), Positives = 24/63 (37%), Gaps = 2/63 (3%)
 Frame = +2

Query: 452 HNLQMETQHSCHHFYVFFFFY*LDPPLPHQRHHQHPQFEYHCHHHYL--QLHHHSLHHHF 625
           H+      H  HH +     +      PH  HH H    +H HHH+     HHH   HH 
Sbjct: 1   HHHHHHAPHHHHHHHHAPHHHHHHHHAPHHHHHHHHAPHHHHHHHHAPHHHHHHPWFHHH 60

Query: 626 EEH 634
             H
Sbjct: 61  HHH 63

>ref|NP_732342.2| CG31122-PA [Drosophila melanogaster] gi|28381352|gb|AAF55560.3|
           CG31122-PA [Drosophila melanogaster]
          Length = 642

 Score = 47.0 bits (110), Expect = 2e-04
 Identities = 18/36 (50%), Positives = 19/36 (52%)
 Frame = +2

Query: 533 PHQRHHQHPQFEYHCHHHYLQLHHHSLHHHFEEHLC 640
           P  RHH HP   +  HHH    HHHS HHH   H C
Sbjct: 593 PQARHHHHPATSHRHHHHQHHSHHHS-HHHHHHHQC 627

 Score = 32.3 bits (72), Expect = 5.5
 Identities = 17/55 (30%), Positives = 23/55 (40%), Gaps = 3/55 (5%)
 Frame = +2

Query: 452 HNLQMETQH---SCHHFYVFFFFY*LDPPLPHQRHHQHPQFEYHCHHHYLQLHHH 607
           H  Q +TQ    + HH +         P   H+ HH      +H HHH+   HHH
Sbjct: 583 HMQQQQTQQPPQARHHHH---------PATSHRHHHHQHHSHHHSHHHH---HHH 625

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 572,963,765
Number of Sequences: 1393205
Number of extensions: 13290025
Number of successful extensions: 123661
Number of sequences better than 10.0: 1786
Number of HSP's better than 10.0 without gapping: 57182
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 95011
length of database: 448,689,247
effective HSP length: 118
effective length of database: 284,291,057
effective search space used: 26723359358
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 GNLf003g06 BP074981 1 218
2 MFBL052f07_f BP043933 139 640




Lotus japonicus
Kazusa DNA Research Institute