KMC005748A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC005748A_C01 KMC005748A_c01
ACCAATACCAATCAATCATTTTGCTTTAAAAGTGAATAAATATATACCAATTCCTTTACA
TCACTCATATCCTCAAGAACTCCATTCCAAGTACCAGAAATATCAGAAAACATCCGATCT
GCATGATCAAAATTTCCGCCTTGTAATTTGATTGCAAGGGTGGTGAAAGGTTCAACTCTA
ACAAGGTAATATAAGACTGTTCCTGCACTTGAGTAGTGTGATCCATAATGGAATTTGGGG
ATAACTGGATCATCAAAGCTAGCATATCTCTCTTGAAATTTTTTCAGACGATCTGGGTTC
AGTGCACCAACAGGCTTTGAAAGATCTCGATAAGAAGAAGGATTTGAAAGATCCAAACTC
TCTGAACTGTAATCAGAAAGAATCCAGGGAAAAACAGGATACTGCGCTATATCATTATAA
CTAGGCTCAGCCAGTGTATTGAATTGCATTAGATACTCAAAATTACTGATCTTGTTTGGA
TCTGAAGCAGAGTGTGCACAGTGGAAGAAATCGAAGAAACCAAACCCAGGAAACCACTCG
A


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC005748A_C01 KMC005748A_c01
         (541 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_182078.1| unknown protein; protein id: At2g45540.1 [Arabi...   260  7e-69
ref|NP_191651.1| putative protein; protein id: At3g60920.1 [Arab...   232  2e-60
gb|AAN38986.1| LvsC [Dictyostelium discoideum]                        200  1e-50
pir||H96615 hypothetical protein F16M22.8 [imported] - Arabidops...   195  3e-49
gb|EAA06488.1| agCP13347 [Anopheles gambiae str. PEST]                194  7e-49

>ref|NP_182078.1| unknown protein; protein id: At2g45540.1 [Arabidopsis thaliana]
            gi|7485640|pir||T00867 hypothetical protein At2g45540
            [imported] - Arabidopsis thaliana
            gi|2979554|gb|AAC06163.1| unknown protein [Arabidopsis
            thaliana]
          Length = 2946

 Score =  260 bits (665), Expect = 7e-69
 Identities = 123/143 (86%), Positives = 133/143 (92%)
 Frame = -3

Query: 473  KISNFEYLMQFNTLAEPSYNDIAQYPVFPWILSDYSSESLDLSNPSSYRDLSKPVGALNP 294
            +ISNFEYLMQ NTLA  SYNDI QYPVFPWI+SD SSESLDLSNPS++RDLSKP+GALNP
Sbjct: 2239 EISNFEYLMQLNTLAGRSYNDITQYPVFPWIISDNSSESLDLSNPSTFRDLSKPIGALNP 2298

Query: 293  DRLKKFQERYASFDDPVIPKFHYGSHYSSAGTVLYYLVRVEPFTTLAIKLQGGNFDHADR 114
            +RLKKFQERY+SF+DPVIPKFHYGSHYSSAG VLYYL RVEPFTTL+I+LQGG FDHADR
Sbjct: 2299 ERLKKFQERYSSFEDPVIPKFHYGSHYSSAGAVLYYLARVEPFTTLSIQLQGGKFDHADR 2358

Query: 113  MFSDISGTWNGVLEDMSDVKELV 45
            MFSD  GTWNGVLEDMSDVKELV
Sbjct: 2359 MFSDFPGTWNGVLEDMSDVKELV 2381

>ref|NP_191651.1| putative protein; protein id: At3g60920.1 [Arabidopsis thaliana]
            gi|11358248|pir||T50513 hypothetical protein T27I15_10 -
            Arabidopsis thaliana gi|8388608|emb|CAB94128.1| putative
            protein [Arabidopsis thaliana]
          Length = 1857

 Score =  232 bits (592), Expect = 2e-60
 Identities = 110/143 (76%), Positives = 128/143 (88%), Gaps = 1/143 (0%)
 Frame = -3

Query: 473  KISNFEYLMQFNTLAEPSYNDIAQYPVFPWILSDYSSESLDLSNPSSYRDLSK-PVGALN 297
            +ISNFEYLMQ NTLA  SYNDI QYP+FPWIL DY SE LDLSNPS+YRDLSK P+GALN
Sbjct: 1602 EISNFEYLMQLNTLAGRSYNDITQYPIFPWILCDYVSEILDLSNPSNYRDLSKVPIGALN 1661

Query: 296  PDRLKKFQERYASFDDPVIPKFHYGSHYSSAGTVLYYLVRVEPFTTLAIKLQGGNFDHAD 117
            P+RLKKFQE+++SF+DPVIPKFHYGSHYSSAG VL+YL RVEPFTTL+I+LQG  FD AD
Sbjct: 1662 PERLKKFQEKHSSFEDPVIPKFHYGSHYSSAGAVLHYLARVEPFTTLSIQLQGRKFDRAD 1721

Query: 116  RMFSDISGTWNGVLEDMSDVKEL 48
            ++FSDI+ TW GVL+DM++VKEL
Sbjct: 1722 QIFSDIAATWKGVLQDMNNVKEL 1744

>gb|AAN38986.1| LvsC [Dictyostelium discoideum]
          Length = 2402

 Score =  200 bits (508), Expect = 1e-50
 Identities = 92/156 (58%), Positives = 117/156 (74%)
 Frame = -3

Query: 512  DFFHCAHSASDPNKISNFEYLMQFNTLAEPSYNDIAQYPVFPWILSDYSSESLDLSNPSS 333
            D    A S     +ISNF+YLM  NT+A  +YND+ QYPVFPW+++DY+S  LDL+   +
Sbjct: 1853 DMLKKATSEWQARRISNFDYLMTLNTIAGRTYNDLTQYPVFPWVIADYTSPVLDLNKAET 1912

Query: 332  YRDLSKPVGALNPDRLKKFQERYASFDDPVIPKFHYGSHYSSAGTVLYYLVRVEPFTTLA 153
            +RDLSKP+GALN  RL+ F++RY SFDDPVIPKF+YGSHYSSAG VL+YL+R+EPFTT  
Sbjct: 1913 FRDLSKPIGALNEKRLEIFKDRYESFDDPVIPKFYYGSHYSSAGIVLFYLIRLEPFTTQF 1972

Query: 152  IKLQGGNFDHADRMFSDISGTWNGVLEDMSDVKELV 45
            + LQGG FDH DRMF  I+  W+  L   +DVKEL+
Sbjct: 1973 LNLQGGRFDHPDRMFDSIALAWDNSLTSSTDVKELI 2008

>pir||H96615 hypothetical protein F16M22.8 [imported] - Arabidopsis thaliana
            gi|12321834|gb|AAG50953.1|AC073943_3 hypothetical protein
            [Arabidopsis thaliana]
          Length = 1224

 Score =  195 bits (496), Expect = 3e-49
 Identities = 90/143 (62%), Positives = 114/143 (78%)
 Frame = -3

Query: 473  KISNFEYLMQFNTLAEPSYNDIAQYPVFPWILSDYSSESLDLSNPSSYRDLSKPVGALNP 294
            +I+NFEYLM  NTLA  SYND+ QYPVFPW+++DYSSE+LD S  S++RDLSKPVGAL+ 
Sbjct: 604  EITNFEYLMILNTLAGRSYNDLTQYPVFPWVVADYSSETLDFSKASTFRDLSKPVGALDT 663

Query: 293  DRLKKFQERYASFDDPVIPKFHYGSHYSSAGTVLYYLVRVEPFTTLAIKLQGGNFDHADR 114
             R + F++RY SF DP IP F+YGSHYSS G+VLYYL+R+EPFT+L   LQGG FDHADR
Sbjct: 664  RRFEIFEDRYHSFSDPDIPSFYYGSHYSSMGSVLYYLLRLEPFTSLHRSLQGGKFDHADR 723

Query: 113  MFSDISGTWNGVLEDMSDVKELV 45
            +F  + G++   L + SDVKEL+
Sbjct: 724  LFQSVEGSFRNCLSNTSDVKELI 746

>gb|EAA06488.1| agCP13347 [Anopheles gambiae str. PEST]
          Length = 1308

 Score =  194 bits (492), Expect = 7e-49
 Identities = 86/143 (60%), Positives = 112/143 (78%)
 Frame = -3

Query: 473  KISNFEYLMQFNTLAEPSYNDIAQYPVFPWILSDYSSESLDLSNPSSYRDLSKPVGALNP 294
            +ISNFEYLM  NT+A  +YND+ QYPVFPW++++Y S  LDLS PS+YRDLSKP+GALNP
Sbjct: 654  EISNFEYLMFLNTIAGRTYNDLNQYPVFPWVITNYESRELDLSQPSNYRDLSKPIGALNP 713

Query: 293  DRLKKFQERYASFDDPVIPKFHYGSHYSSAGTVLYYLVRVEPFTTLAIKLQGGNFDHADR 114
             R + F+ERY ++D P IP FHYG+HYS+A  VL +L+R+EPFTT+ + LQGG FDH DR
Sbjct: 714  SRREYFEERYETWDTPGIPPFHYGTHYSTAAFVLNWLIRIEPFTTMFLALQGGKFDHPDR 773

Query: 113  MFSDISGTWNGVLEDMSDVKELV 45
            +FS ++ +W     D SDVKEL+
Sbjct: 774  LFSSVALSWKNCQRDTSDVKELI 796

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 458,724,178
Number of Sequences: 1393205
Number of extensions: 9362191
Number of successful extensions: 25792
Number of sequences better than 10.0: 112
Number of HSP's better than 10.0 without gapping: 24881
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 25715
length of database: 448,689,247
effective HSP length: 115
effective length of database: 288,470,672
effective search space used: 18462123008
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MWM053g02_f AV765538 1 427
2 MWM170g11_f AV767354 1 541




Lotus japonicus
Kazusa DNA Research Institute