KMC015266A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC015266A_C01 KMC015266A_c01
tgggtacgggccccccttctgctccaaaaccTAATCCCAAATCACCATGGCACAGAAGCA
GAAGCGCCCTTCTCCTCTCGAAGATCCACCCACCGCGTCTTCCTCCTCCGAATCAGAGGA
GGAAGATGACCAACCTCTCTCTCAAGTAAAACACGCAGCAGCAGAAGAAGAAGAACTTTC
CTCAGAAGAAGAAGGTTCCTCTGAGGAGGAGGAGGAGGATGAAACCACCGCTCCAGCCCC
TCCGGCAGCCACAACCTCCCACACCAAACCTCCTCAACCCGAACCCGAATCCGATTCCGC
CACCCAATCCGATTCTGAGTCCGACACCGACTCCGACCAGGCCCCCTCCGCATCCGCTCC
CACTCCTAACCCTAAAGTCAAACCCCTCGCCACCAAGCCCATGGACCAGACCCAGACCCA
CAAGCCCAAGGCTCAACCCTCCCCGGCTCCGGCCAAATCGGCAGCCAAGCGCGCCGCCGA
GAGCAATGCCAACGCCGGAGACTCCAAACGGGCTAAGAAGAAGGCAGTTGACTCGGCCCC
CGCCGCCGCCGCTGGTTCCGATGAGGAGATGGAGGAGGACGGGAAGAAGTCCGGGAACGA
CTCCAAGAAGCAGTTTACGAGATTGTGGTCCGATGAGGATGAGATCGCCATTCTCAAGGG
GCTTGCTGATTTCATTTCGAAAACTGGGAATGACCCGTTGAAGTACCCTGATGCTTTTTA
CGATTTCGTTATCAGGTCGCTTCAAGCTGATGCTACCCGCACTCAGGTGAAGGATAAGGT
TCGAAGGCTGAAGAAGAAGTTTCAGACCCTTGCAGGCAAAGGGAAGAATGGAGAGACCCC
CAAATTCCCCAAAGTCCATGATCAGAAAACTTTTGAATTGGCCAAGAAGGTATGGGGGAA
GGAGGCCAATGAAGCAGCAGCCGCGGAAGAGAAGCC


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC015266A_C01 KMC015266A_c01
         (936 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAM65376.1| unknown [Arabidopsis thaliana]                         141  1e-32
ref|NP_564784.1| expressed protein; protein id: At1g61730.1, sup...   140  2e-32
emb|CAC39398.1| storekeeper protein [Solanum tuberosum]               132  6e-30
ref|NP_194251.1| putative protein; protein id: At4g25210.1, supp...   130  2e-29
ref|NP_191949.1| putative protein; protein id: At4g00390.1, supp...   130  3e-29

>gb|AAM65376.1| unknown [Arabidopsis thaliana]
          Length = 377

 Score =  141 bits (356), Expect = 1e-32
 Identities = 96/281 (34%), Positives = 145/281 (51%), Gaps = 2/281 (0%)
 Frame = +2

Query: 62  KRPSPLEDPPTASSSSESEEEDDQPLSQVKHAAAEEEELSSEEEGSSEE--EEEDETTAP 235
           K+ +PLEDPPTA+SS E + E  +         A ++  SSEE+   +   +    TTA 
Sbjct: 3   KKLNPLEDPPTATSSDEDDVETSEA------GEASDDSSSSEEDAPIKIRIKSPSATTAA 56

Query: 236 APPAATTSHTKPPQPEPESDSATQSDSESDTDSDQAPSASAPTPNPKVKPLATKPMDQTQ 415
           APPA +T+ +       +SDS ++++++SD++S   P++ +            K  D T 
Sbjct: 57  APPAKSTAVSAAA----DSDSGSETETDSDSESTNPPNSGSGKTIALNAVNLKKKEDPTS 112

Query: 416 THKPKAQPSPAPAKSAAKRAAESNANAGDSKRAKKKAVDSAPAAAAGSDEEMEEDGKKSG 595
           +    A P+    KS  KR A   A    +KR KK                 EE  KK G
Sbjct: 113 SSATLALPA---VKSGTKRPASEAAATTSTKRVKKD----------------EETVKKPG 153

Query: 596 NDSKKQFTRLWSDEDEIAILKGLADFISKTGNDPLKYPDAFYDFVIRSLQADATRTQVKD 775
                 F RLWS+EDEI +L+G+ DF + TG  P    +AFYDF+ +S+  + ++ Q  D
Sbjct: 154 G-----FQRLWSEEDEILVLQGMIDFKADTGKSPYVDTNAFYDFLKKSISFEVSKNQFMD 208

Query: 776 KVRRLKKKFQTLAGKGKNGETPKFPKVHDQKTFELAKKVWG 898
           K+R L+KK+  +  +G+N   P F K HD+K FEL+K +WG
Sbjct: 209 KIRSLRKKY--IGKEGRN--EPSFVKAHDKKAFELSKFIWG 245

>ref|NP_564784.1| expressed protein; protein id: At1g61730.1, supported by cDNA:
           38650. [Arabidopsis thaliana] gi|25350270|pir||A96643
           hypothetical protein T13M11.9 [imported] - Arabidopsis
           thaliana gi|4508074|gb|AAD21418.1| 45341
          Length = 376

 Score =  140 bits (354), Expect = 2e-32
 Identities = 97/282 (34%), Positives = 143/282 (50%), Gaps = 3/282 (1%)
 Frame = +2

Query: 62  KRPSPLEDPPTASSSSESEEEDDQPLSQVKHAAAEEEELSSEEEGSSEEEEEDETTAPAP 241
           K+ +PLEDPPTA+SS    +EDD   S+   A        S++  SSEE+   +    +P
Sbjct: 3   KKLNPLEDPPTATSS----DEDDVETSEAGEA--------SDDSSSSEEDVPIKIRIKSP 50

Query: 242 PAATTSHTKPPQPEPESDSATQSDSESDTDSDQAPSASAPTPNPKVKPLATKPMDQTQTH 421
            A T +   PP       +A  SDS S+T++D    ++ P  +   K +A   ++  +  
Sbjct: 51  SATTAA--APPAKSTAVSTAADSDSGSETETDSDSESTNPPNSGSGKTIALNTVNLKKKE 108

Query: 422 KPKAQPSPA--PA-KSAAKRAAESNANAGDSKRAKKKAVDSAPAAAAGSDEEMEEDGKKS 592
            P +  +    PA KS  KR A   A    +KR KK                 EE  KK 
Sbjct: 109 DPTSSSATLALPAMKSGTKRPASEAAATTSTKRVKKD----------------EESVKKP 152

Query: 593 GNDSKKQFTRLWSDEDEIAILKGLADFISKTGNDPLKYPDAFYDFVIRSLQADATRTQVK 772
           G      F RLWS+EDEI +L+G+ DF + TG  P    +AFYDF+ +S+  + ++ Q  
Sbjct: 153 GG-----FQRLWSEEDEILVLQGMIDFKADTGKSPYVDTNAFYDFLKKSISFEVSKNQFM 207

Query: 773 DKVRRLKKKFQTLAGKGKNGETPKFPKVHDQKTFELAKKVWG 898
           DK+R L+KK+  +  +G+N   P F K HD+K FEL+K +WG
Sbjct: 208 DKIRSLRKKY--IGKEGRN--EPSFVKAHDKKAFELSKFIWG 245

>emb|CAC39398.1| storekeeper protein [Solanum tuberosum]
          Length = 399

 Score =  132 bits (333), Expect = 6e-30
 Identities = 101/292 (34%), Positives = 146/292 (49%), Gaps = 8/292 (2%)
 Frame = +2

Query: 47  MAQKQKRPSPLEDPPTASSSSESEEEDDQPLSQVKHAAAEEEELSSEEEG---SSEEEEE 217
           MA K K  S L D P ++SSSE +E        V+ +  EEE+ S EEEG   S EE EE
Sbjct: 1   MAPKTK--SRLVDQPPSASSSEEQE-------LVEESQEEEEQQSREEEGEEESGEETEE 51

Query: 218 DETTAPAPPAATTSHTK-----PPQPEPESDSATQSDSESDTDSDQAPSASAPTPNPKVK 382
           DE    A P      ++     P +P+  S+S +++ S SD++++   S     P+P   
Sbjct: 52  DEEPKTAHPVVKKPISQKLVQTPQKPQFSSESGSENGSGSDSEAESGNSL----PSPSAS 107

Query: 383 PLATKPMDQTQTHKPKAQPSPAPAKSAAKRAAESNANAGDSKRAKKKAVDSAPAAAAGSD 562
               KP          A  +  P+K AAKR  E+    G  K          P  A    
Sbjct: 108 DFTVKPN--------VAAKAATPSKPAAKRPQEAQKEKGKKK----------PKIA---- 145

Query: 563 EEMEEDGKKSGNDSKKQFTRLWSDEDEIAILKGLADFISKTGNDPLKYPDAFYDFVIRSL 742
              EE+ KKS    +     LWSD+D++A+LKG+ ++ +  G +P     AF++F+   L
Sbjct: 146 ---EEEEKKSPATPRS----LWSDDDQLALLKGILEYKTVKGMEPSADMSAFHEFIRGKL 198

Query: 743 QADATRTQVKDKVRRLKKKFQTLAGKGKNGETPKFPKVHDQKTFELAKKVWG 898
           QA+ +++Q+ DKVRRLKKKF T     K+GE P F K  D   FE +K++WG
Sbjct: 199 QAEVSKSQISDKVRRLKKKFLT---NVKDGEEPVFKKGQDFLIFEHSKRIWG 247

>ref|NP_194251.1| putative protein; protein id: At4g25210.1, supported by cDNA:
           gi_13272448, supported by cDNA: gi_14423517 [Arabidopsis
           thaliana] gi|7486062|pir||T05542 hypothetical protein
           F24A6.50 - Arabidopsis thaliana
           gi|4454009|emb|CAA23062.1| putative protein [Arabidopsis
           thaliana] gi|7269371|emb|CAB79430.1| putative protein
           [Arabidopsis thaliana]
           gi|13272449|gb|AAK17163.1|AF325095_1 putative protein
           [Arabidopsis thaliana]
          Length = 368

 Score =  130 bits (328), Expect = 2e-29
 Identities = 90/252 (35%), Positives = 132/252 (51%), Gaps = 7/252 (2%)
 Frame = +2

Query: 164 EEEELSSEEEGSSEEEEEDETTAPAPPAATTSHTKPPQPEPESDSATQSDSESDTDSDQA 343
           E   +SSEEE S    EE E++A  P    +S       +PESDS  +S+SES +  +  
Sbjct: 11  ESPPVSSEEEESGSSGEESESSAEVPKKVESSQ------KPESDSEGESESESSSGPEPE 64

Query: 344 PSASAPTPNPKVKPLATKPMDQTQTHK---PKAQPSPAPAKSAAKRAAESNANAGDSKRA 514
              S P    K+KP+ TKP+ +T       P++  +  P K AA  A +    + D++  
Sbjct: 65  ---SEPAKTIKLKPVGTKPIPETSGSAATVPESSTAKRPLKEAAPEAIKKQKTS-DTEHV 120

Query: 515 KKKAVDSAPAAAAGSDEEMEEDGKKSGNDSKKQFTRLWSDEDEIAILKGLADFISKTGND 694
           KK   +             +E  K S  D+KK F RL+S+ DEIA+L+G+ DF S  G D
Sbjct: 121 KKPITN-------------DEVKKISSEDAKKMFQRLFSETDEIALLQGIIDFTSTKG-D 166

Query: 695 PLKYPDAFYDFVIRSLQADATRTQVKDKVRRLKKKFQTLA----GKGKNGETPKFPKVHD 862
           P +  DAF  +V + +  DAT+ Q+  K++RLKKKF         KGK  +  +F K  +
Sbjct: 167 PYEDIDAFCIYVKKLIDFDATKNQIVTKLQRLKKKFNNAVKNSLKKGKTEDDIEFAKDLE 226

Query: 863 QKTFELAKKVWG 898
           QK FEL++K+WG
Sbjct: 227 QKGFELSRKIWG 238

>ref|NP_191949.1| putative protein; protein id: At4g00390.1, supported by cDNA:
           157614. [Arabidopsis thaliana] gi|7485269|pir||T01532
           hypothetical protein A_IG005I10.6 - Arabidopsis thaliana
           gi|2252829|gb|AAB62828.1| A_IG005I10.6 gene product
           [Arabidopsis thaliana]
           gi|6049871|gb|AAF02786.1|AF195115_6 F5I10.6 gene product
           [Arabidopsis thaliana] gi|7267126|emb|CAB80797.1|
           putative protein [Arabidopsis thaliana]
          Length = 364

 Score =  130 bits (327), Expect = 3e-29
 Identities = 92/274 (33%), Positives = 140/274 (50%), Gaps = 2/274 (0%)
 Frame = +2

Query: 83  DPPTASSSSESEEEDDQPLSQVKHAAAEEEE-LSSEEEGSSEEEEEDETTAPAPPAATTS 259
           DPPTA SS    +EDD   S+   +++EE+E + S    ++    +    + A PA +TS
Sbjct: 6   DPPTAPSS----DEDDVETSEDDSSSSEEDEPIKSLPATTAAAPAKSTAVSAATPAKSTS 61

Query: 260 -HTKPPQPEPESDSATQSDSESDTDSDQAPSASAPTPNPKVKPLATKPMDQTQTHKPKAQ 436
                P       +A  SDS S++++D    ++ P  +   K +A+K  D   +    A 
Sbjct: 62  VSAAAPSKSTAVSAAADSDSGSESETDSDSESTDPPKSGSGKTIASKKKDDPSSST--AT 119

Query: 437 PSPAPAKSAAKRAAESNANAGDSKRAKKKAVDSAPAAAAGSDEEMEEDGKKSGNDSKKQF 616
            +    KS AKRAA S A    +KR KK                 EE  KK        F
Sbjct: 120 LALPAVKSGAKRAA-SEAATTSTKRVKKD----------------EESVKKPA-----LF 157

Query: 617 TRLWSDEDEIAILKGLADFISKTGNDPLKYPDAFYDFVIRSLQADATRTQVKDKVRRLKK 796
            RLWSD+DEI++L+G+ D+ + TG  P    +AFY+F  +S+  + +++Q  DKVR L+K
Sbjct: 158 QRLWSDDDEISMLQGMIDYHADTGKSPSADTNAFYEFQKKSISFEVSKSQFSDKVRSLRK 217

Query: 797 KFQTLAGKGKNGETPKFPKVHDQKTFELAKKVWG 898
           K++   GK    + P+F K HD+K FEL+K +WG
Sbjct: 218 KYRAKEGK----DEPRFVKAHDKKAFELSKFIWG 247

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,045,729,686
Number of Sequences: 1393205
Number of extensions: 34419330
Number of successful extensions: 747714
Number of sequences better than 10.0: 18797
Number of HSP's better than 10.0 without gapping: 189148
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 423212
length of database: 448,689,247
effective HSP length: 123
effective length of database: 277,325,032
effective search space used: 52137106016
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MFB020b07_f BP035398 1 586
2 MFBL025h08_f BP042537 32 554
3 MWM066g12_f AV765774 207 614
4 SPD024c02_f BP045879 466 936




Lotus japonicus
Kazusa DNA Research Institute