KMC016381A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC016381A_C01 KMC016381A_c01
aTGAATGAATAAATAAATAAATTCAAATTACTTCTCACTTTTACAACTTTAACATTAATT
AAATTAAAATCATACAAATGATTAAAAAAAACTTCTTGGAATGATCACTCACAAATCACA
TTTTATCTATCGAACCCAACATGCCTTTGTCTTTGAGCAGTTTGAGGCACTTGAGCAGCC
AAACAGTCCCAATCATCATGCCCTCTGAGCTGTGTTCTTTCTGCATCTCAGGAGAACCAA
TTAATCATAGAATGAATTTCTCTTGTGTTATTATTTCCAATAATTCTCCAACACTATCAA
TCATTTATATGTGCAAGAATGCTTGAGGGGGATTGGAGCACTGAGCTTCTTGGGATCCAA
GGTGATGGAACATTGAATCCTCTTGTAGTACTTGGGCTTCACCAATTTCCCTAGGACATA
AGCTCTGGATCGCAGCACAAAGTTCAGGTTCAATGGCACTGGCACAGTTGGCATACCAGT
TGTGCTGCTCAAGCTAGCACCACTTCCATACAGAGGGATCTTGTTGCCCATCACTGCCAC
ACTCACCAACCTGTGACTCCTTCTATGTTGATAAAACTCCTTCATATTCCCTGCAGCAAT
CACAATTTCTGAATAGGACAGTTCTAAGGGTGTAGATGCAACATGAACCCCAAAGAATGT
GCCAGTGTTACGGTATGTGAATTTCAAAGTAGAGTTCATGGAGATCATATCAGTAGCCAC
CCCAGTGGAATCTGAACCAGCTTGGACTTGAACATGATCAAACTTTATGCTCTTGATAAA
AATCTTGGGTTTCATGGGTCTGCTAGCACCCCAGAGAATAAGCGAAAACAGTGTGAAGAG
GAGAAGAAATCCCAGAAGAAAAATGAGGAAGTAGCAGCGACGAGAGAGAGTTCTGTCACG
ATCTTCCCCTTGGAGAAGCCCTTCTTCCTCAATGACATCGATCTGCTTCCATGGCTTGAG
ACTGTGGTGGTGGTGGTGGTTGTCTTTCTTGCGGTGAGGAGCAGAGAACCGAGTGGAGGA
TGAAGAATGAGGAGGGGAAGCGTTGGGGCTGAGAACAGGAGTGGAGTGGAAGGAAGTGGT
GACGGTTTTCTCGCCGTCGTGAGAGTCCCTTGAGGGGCTCTGAACAAAGTAGAGAGGACG
GCGAGGTGGGGATCTTGCAGGGGATGATGCAGAAATgctggtcacctctgagtctgtctt
ggcatgcatttttccttctttgatggaacctgtccttgaaagaatgtgaatgacag


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC016381A_C01 KMC016381A_c01
         (1256 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_564495.1| expressed protein; protein id: At1g45688.1, sup...   348  1e-94
ref|NP_199100.1| putative protein; protein id: At5g42860.1 [Arab...   343  4e-93
gb|AAO42869.1| At5g42860 [Arabidopsis thaliana]                       341  2e-92
emb|CAB53482.1| CAA30379.1 protein [Oryza sativa]                     337  2e-91
ref|NP_181730.1| unknown protein; protein id: At2g41990.1 [Arabi...   205  1e-51

>ref|NP_564495.1| expressed protein; protein id: At1g45688.1, supported by cDNA: 8255.,
            supported by cDNA: gi_20466719 [Arabidopsis thaliana]
            gi|25405173|pir||A96511 unknown protein [imported] -
            Arabidopsis thaliana
            gi|12321012|gb|AAG50630.1|AC083835_15 unknown protein
            [Arabidopsis thaliana] gi|20466720|gb|AAM20677.1| unknown
            protein [Arabidopsis thaliana] gi|21595730|gb|AAM66126.1|
            unknown [Arabidopsis thaliana] gi|23198230|gb|AAN15642.1|
            unknown protein [Arabidopsis thaliana]
          Length = 342

 Score =  348 bits (893), Expect = 1e-94
 Identities = 189/340 (55%), Positives = 236/340 (68%), Gaps = 40/340 (11%)
 Frame = -3

Query: 1209 MHAKTDSEVTSISASSPARSPPRRPLYFVQSPSRDSHDGEKTVTTSFHSTPVLSPNASPP 1030
            MHAKTDSEVTS++ASSPARSP RRP+Y+VQSPSRDSHDGEKT T SFHSTPVLSP  SPP
Sbjct: 1    MHAKTDSEVTSLAASSPARSP-RRPVYYVQSPSRDSHDGEKTAT-SFHSTPVLSPMGSPP 58

Query: 1029 HS----------SSSTRFSA---PHRKKDNHHH------HHSLKPWKQIDVIEEEGLLQG 907
            HS          SSS+RFS    P  +K N +       H   K WK+  VIEEEGLL  
Sbjct: 59   HSHSSMGRHSRESSSSRFSGSLKPGSRKVNPNDGSKRKGHGGEKQWKECAVIEEEGLLDD 118

Query: 906  EDRDRTLSRRCYFLIFLLGFLLLFTLFSLILWGASRPMKPKIFIKSIKFDHVQVQAGSDS 727
             DRD  + RRCY L F++GF +LF  FSLIL+GA++PMKPKI +KSI F+ +++QAG D+
Sbjct: 119  GDRDGGVPRRCYVLAFIVGFFILFGFFSLILYGAAKPMKPKITVKSITFETLKIQAGQDA 178

Query: 726  TGVATDMISMNSTLKFTYRNTGTFFGVHVASTPLELSYSEIVIAAGNMKEFYQHRRSHRL 547
             GV TDMI+MN+TL+  YRNTGTFFGVHV STP++LS+S+I I +G++K+FYQ R+S R 
Sbjct: 179  GGVGTDMITMNATLRMLYRNTGTFFGVHVTSTPIDLSFSQIKIGSGSVKKFYQGRKSERT 238

Query: 546  VSVAVMGNKIPLYGSGASLSSTT---------------------GMPTVPVPLNLNFVLR 430
            V V V+G KIPLYGSG++L                           P  PVP+ L+FV+R
Sbjct: 239  VLVHVIGEKIPLYGSGSTLLPPAPPAPLPKPKKKKGAPVPIPDPPAPPAPVPMTLSFVVR 298

Query: 429  SRAYVLGKLVKPKYYKRIQCSITLDPKKLSAPIPLKHSCT 310
            SRAYVLGKLV+PK+YK+I+C I  + K L+  I +  +CT
Sbjct: 299  SRAYVLGKLVQPKFYKKIECDINFEHKNLNKHIVITKNCT 338

>ref|NP_199100.1| putative protein; protein id: At5g42860.1 [Arabidopsis thaliana]
            gi|9758574|dbj|BAB09187.1|
            emb|CAB53482.1~gene_id:MBD2.5~similar to unknown protein
            [Arabidopsis thaliana]
          Length = 320

 Score =  343 bits (879), Expect = 4e-93
 Identities = 188/323 (58%), Positives = 230/323 (71%), Gaps = 23/323 (7%)
 Frame = -3

Query: 1209 MHAKTDSEVTSISASSPARSPPRRPLYFVQSPSRDSHDGEKTVTTSFHSTPVL-SPNASP 1033
            MHAKTDSEVTS+SASSP RSP RRP YFVQSPSRDSHDGEKT T SFHSTPVL SP  SP
Sbjct: 1    MHAKTDSEVTSLSASSPTRSP-RRPAYFVQSPSRDSHDGEKTAT-SFHSTPVLTSPMGSP 58

Query: 1032 PHS-SSSTRFSAPHRKKDNHHHHHSLKPWKQIDVIEEEGLLQGEDRDR-TLSRRCYFLIF 859
            PHS SSS+RFS  +  K   H        KQ  +IEEEGLL   DR++  L RRCY L F
Sbjct: 59   PHSHSSSSRFSKINGSKRKGHAGE-----KQFAMIEEEGLLDDGDREQEALPRRCYVLAF 113

Query: 858  LLGFLLLFTLFSLILWGASRPMKPKIFIKSIKFDHVQVQAGSDSTGVATDMISMNSTLKF 679
            ++GF LLF  FSLIL+ A++P KPKI +KSI F+ ++VQAG D+ G+ TDMI+MN+TL+ 
Sbjct: 114  IVGFSLLFAFFSLILYAAAKPQKPKISVKSITFEQLKVQAGQDAGGIGTDMITMNATLRM 173

Query: 678  TYRNTGTFFGVHVASTPLELSYSEIVIAAGNMKEFYQHRRSHRLVSVAVMGNKIPLYGSG 499
             YRNTGTFFGVHV S+P++LS+S+I I +G++K+FYQ R+S R V V V+G+KIPLYGSG
Sbjct: 174  LYRNTGTFFGVHVTSSPIDLSFSQITIGSGSIKKFYQSRKSQRTVVVNVLGDKIPLYGSG 233

Query: 498  ASLSS--------------------TTGMPTVPVPLNLNFVLRSRAYVLGKLVKPKYYKR 379
            ++L                          P  PVP+ LNF +RSRAYVLGKLV+PK+YKR
Sbjct: 234  STLVPPPPPAPIPKPKKKKGPIVIVEPPAPPAPVPMRLNFTVRSRAYVLGKLVQPKFYKR 293

Query: 378  IQCSITLDPKKLSAPIPLKHSCT 310
            I C I  + KKLS  IP+ ++CT
Sbjct: 294  IVCLINFEHKKLSKHIPITNNCT 316

>gb|AAO42869.1| At5g42860 [Arabidopsis thaliana]
          Length = 320

 Score =  341 bits (874), Expect = 2e-92
 Identities = 187/323 (57%), Positives = 229/323 (70%), Gaps = 23/323 (7%)
 Frame = -3

Query: 1209 MHAKTDSEVTSISASSPARSPPRRPLYFVQSPSRDSHDGEKTVTTSFHSTPVL-SPNASP 1033
            MHAKTDSEVTS+SASSP RSP RRP YFVQSPSRDSHDGEKT T SFHSTPVL SP  SP
Sbjct: 1    MHAKTDSEVTSLSASSPTRSP-RRPAYFVQSPSRDSHDGEKTAT-SFHSTPVLTSPMGSP 58

Query: 1032 PHS-SSSTRFSAPHRKKDNHHHHHSLKPWKQIDVIEEEGLLQGEDRDR-TLSRRCYFLIF 859
            PHS SSS+RFS  +  K   H        KQ  +IEEEGLL   DR++  L RRCY L F
Sbjct: 59   PHSHSSSSRFSKINGSKRKGHAGE-----KQFAMIEEEGLLDDGDREQEALPRRCYVLAF 113

Query: 858  LLGFLLLFTLFSLILWGASRPMKPKIFIKSIKFDHVQVQAGSDSTGVATDMISMNSTLKF 679
            ++GF LLF  FSLIL+ A++P KPKI +KSI F+ ++VQAG D+ G+ TDMI+MN+TL+ 
Sbjct: 114  IVGFSLLFAFFSLILYAAAKPQKPKISVKSITFEQLKVQAGQDAGGIGTDMITMNATLRM 173

Query: 678  TYRNTGTFFGVHVASTPLELSYSEIVIAAGNMKEFYQHRRSHRLVSVAVMGNKIPLYGSG 499
             YRNTGTFFG HV S+P++LS+S+I I +G++K+FYQ R+S R V V V+G+KIPLYGSG
Sbjct: 174  LYRNTGTFFGXHVTSSPIDLSFSQITIGSGSIKKFYQSRKSQRTVVVNVLGDKIPLYGSG 233

Query: 498  ASLSS--------------------TTGMPTVPVPLNLNFVLRSRAYVLGKLVKPKYYKR 379
            ++L                          P  PVP+ LNF +RSRAYVLGKLV+PK+YKR
Sbjct: 234  STLVPPPPPAPIPKPKKKKGPIVIVEPPAPPAPVPMRLNFTVRSRAYVLGKLVQPKFYKR 293

Query: 378  IQCSITLDPKKLSAPIPLKHSCT 310
            I C I  + KKLS  IP+ ++CT
Sbjct: 294  IVCLINFEHKKLSKHIPITNNCT 316

>emb|CAB53482.1| CAA30379.1 protein [Oryza sativa]
          Length = 835

 Score =  337 bits (864), Expect = 2e-91
 Identities = 180/319 (56%), Positives = 228/319 (71%), Gaps = 14/319 (4%)
 Frame = -3

Query: 1221 KEGKMHAKTDSEVTSISASSPARSPPRR---PLYFVQSPSRDSHDGEKTVTTSFHSTPVL 1051
            K  KMHAKTDSEVTS++ SSP RSP  R   P+Y+VQSPSRDSHDGEKT T S HSTP L
Sbjct: 517  KTRKMHAKTDSEVTSLAPSSPPRSPTSRGGRPVYYVQSPSRDSHDGEKTAT-SVHSTPAL 575

Query: 1050 SPNASPPHS----SSSTRFSA-PHRKKDNHHHHHSLKP----WKQIDVIEEEGLLQGEDR 898
            SP  SP HS    SSS+RFS  P RK D         P    W++I VIEEEGLL  ED 
Sbjct: 576  SPMGSPRHSVGRDSSSSRFSGHPKRKGDKSSSGRKGAPAGKGWQEIGVIEEEGLLDDEDE 635

Query: 897  DRTLSRRC-YFLIFLLGFLLLFTLFSLILWGASRPMKPKIFIKSIKFDHVQVQAGSDSTG 721
             R + +RC YFLIF+LGF++LF+ F+L+LWGASR  KP+I IKSI F++  +QAG+D++ 
Sbjct: 636  RRGIPKRCKYFLIFVLGFVVLFSFFALVLWGASRSQKPQIVIKSITFENFIIQAGTDASL 695

Query: 720  VATDMISMNSTLKFTYRNTGTFFGVHVASTPLELSYSEIVIAAGNMKEFYQHRRSHRLVS 541
            V TDM + NST+K TYRNTGTFFG+HV + P  LSYS++ +A+G++ +FYQ R S R VS
Sbjct: 696  VPTDMATTNSTVKLTYRNTGTFFGIHVTADPFTLSYSQLTLASGDLNKFYQARSSRRTVS 755

Query: 540  VAVMGNKIPLYGSGASLSSTTGMPTV-PVPLNLNFVLRSRAYVLGKLVKPKYYKRIQCSI 364
            V VMGNK+PLYG G +L++  G  ++ PVP+ L   + SRAYVLG LVKPK+ + I+C +
Sbjct: 756  VGVMGNKVPLYGGGPTLTAGKGSGSMAPVPMILRTTVHSRAYVLGALVKPKFTRAIECKV 815

Query: 363  TLDPKKLSAPIPLKHSCTY 307
             ++P KL+ PI L  SC Y
Sbjct: 816  LMNPAKLNKPISLDKSCIY 834

>ref|NP_181730.1| unknown protein; protein id: At2g41990.1 [Arabidopsis thaliana]
            gi|25408769|pir||F84848 hypothetical protein At2g41990
            [imported] - Arabidopsis thaliana
            gi|1871184|gb|AAB63544.1| unknown protein [Arabidopsis
            thaliana]
          Length = 297

 Score =  205 bits (521), Expect = 1e-51
 Identities = 128/307 (41%), Positives = 173/307 (55%), Gaps = 8/307 (2%)
 Frame = -3

Query: 1209 MHAKTDSEVTSISASSPARSPPR---RPLYFVQSPSRDSHDGEKTVTTSFHSTPVLSPNA 1039
            MHAKTDSE TSI A+  A SPPR   RPLY+VQSPS  +HD EK    SF S   L  + 
Sbjct: 1    MHAKTDSEATSIDAA--ALSPPRSAIRPLYYVQSPS--NHDVEKM---SFGSGCSLMGSP 53

Query: 1038 SPPHSSSSTRFSAPHRKKDNHHHHHSLKPWKQID-----VIEEEGLLQGEDRDRTLSRRC 874
            + PH    +          +     +L  +K I      + + +    G D D       
Sbjct: 54   THPHYYHCSPIHHSRESSTSRFSDRALLSYKSIRERRRYINDGDDKTDGGDDDDPFRNVR 113

Query: 873  YFLIFLLGFLLLFTLFSLILWGASRPMKPKIFIKSIKFDHVQVQAGSDSTGVATDMISMN 694
             ++  LL  + LFT+FSLILWGAS+   PK+ +K +    + +QAG+D +GV TDM+S+N
Sbjct: 114  LYVWLLLSVIFLFTVFSLILWGASKSYPPKVTVKGMLVRDLNLQAGNDLSGVPTDMLSLN 173

Query: 693  STLKFTYRNTGTFFGVHVASTPLELSYSEIVIAAGNMKEFYQHRRSHRLVSVAVMGNKIP 514
            ST++  YRN  TFF VHV ++PL L YS +++++G M +F   R     V   V G++IP
Sbjct: 174  STVRIYYRNPSTFFAVHVTASPLLLHYSNLLLSSGEMNKFTVGRNGETNVVTVVQGHQIP 233

Query: 513  LYGSGASLSSTTGMPTVPVPLNLNFVLRSRAYVLGKLVKPKYYKRIQCSITLDPKKLSAP 334
            LYG G S      + T+ +PLNL  VL S+AY+LG+LV  K+Y RI CS TLD   L   
Sbjct: 234  LYG-GVSFH----LDTLSLPLNLTIVLHSKAYILGRLVTSKFYTRIICSFTLDANHLPKS 288

Query: 333  IPLKHSC 313
            I L  SC
Sbjct: 289  ISLLRSC 295

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,141,837,794
Number of Sequences: 1393205
Number of extensions: 27942702
Number of successful extensions: 126781
Number of sequences better than 10.0: 111
Number of HSP's better than 10.0 without gapping: 97870
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 120699
length of database: 448,689,247
effective HSP length: 126
effective length of database: 273,145,417
effective search space used: 79758461764
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 SPD042a06_f BP047307 1 581
2 SPD029g01_f BP046321 62 632
3 MF058g11_f BP031380 479 969
4 MF017c04_f BP029136 777 1317




Lotus japonicus
Kazusa DNA Research Institute