KMC003539A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC003539A_C01 KMC003539A_c01
GACAAACAACAAGAAGGAAAAATCATATTTCACGGGCTACGGTGTTATATTTTATAATTG
TTCTTCATAAAGACAATAATCCACTGTCAACTGAGGCTAAAACTATACAGGAAAAAGAAA
GAAAGGCATCAACCTGTAAATGCAATTCACTGCAGCAACTCCAAGCTTGCACAGAATATC
AAGTGCATTTTTGTATGCTTCCCGTTCAATTAGTAGGAAAGAAAAAGAAAATAAAAAAGA
AGTAGATGCGGGATTCGACCAAGTAATGATCGGCATCATACAAGATATGTTTATTTATAA
AAGCAATCAGTAGCAGCATTAAGAGTCACAGGTTGTATATACATTGAATCACTTCCACTT
CCTGAATAATCCATGCTTGCCGAACTTGCCATGCTTAAATTTCCCATGCTTGCCGAACTT
TCCATGCTTGAACTTGCCATGCTTGAATTTCCCATGTGGCATAAAACCACCATGCCCATA
ACCGCCATGTCCATAGTGGCCATGGGAAACATGGTGAGCCCCATATGCAGCAGCTGCGGC
AGCAGCACCGCCAGCAAGCATTGCTCCCATACCGCCATGCCCTGATGACAAAGCAGAGAA
GTTAGTTGTCATCAAGCACAGTACAAAAACAATTAAACAGGCAATGCCCAAGTCTACTTT
GTTTATAAAG


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC003539A_C01 KMC003539A_c01
         (670 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_193655.1| putative protein; protein id: At4g19200.1, supp...   104  9e-22
gb|AAH05782.1| Unknown (protein for MGC:12025) [Mus musculus]         100  2e-20
ref|NP_197267.1| glycine/proline-rich protein; protein id: At5g1...   100  2e-20
emb|CAB61840.1| putative glycine and proline-rich protein [Sporo...    97  2e-19
ref|NP_568642.1| expressed protein; protein id: At5g45350.1, sup...    86  5e-16

>ref|NP_193655.1| putative protein; protein id: At4g19200.1, supported by cDNA: 8188.
           [Arabidopsis thaliana] gi|25407569|pir||A85217
           hypothetical protein AT4g19200 [imported] - Arabidopsis
           thaliana gi|7268715|emb|CAB78922.1| putative protein
           [Arabidopsis thaliana] gi|21595622|gb|AAM66118.1|
           unknown [Arabidopsis thaliana]
           gi|24417344|gb|AAN60282.1| unknown [Arabidopsis
           thaliana] gi|27311843|gb|AAO00887.1| Unknown protein
           [Arabidopsis thaliana]
          Length = 179

 Score =  104 bits (260), Expect = 9e-22
 Identities = 59/96 (61%), Positives = 62/96 (64%), Gaps = 18/96 (18%)
 Frame = -2

Query: 585 SGH--GGMGAMLAGGAAAAAAAYGAHHVSH------GH-YGHGGYGHG-----GFMPHGK 448
           SGH  GG+G M+AG A AAAAAYGAHHV H      GH  GHGGYGH      G   HGK
Sbjct: 86  SGHSGGGLGGMIAGAAGAAAAAYGAHHVGHASHNPYGHAVGHGGYGHAPAHGFGHGGHGK 145

Query: 447 FKH----GKFKHGKFGKHGKFKHGKFGKHGLFRKWK 352
           FKH    GKFKHGK GKHG  KHG FG  G F+KWK
Sbjct: 146 FKHGKHGGKFKHGKHGKHG--KHGMFGGGGKFKKWK 179

>gb|AAH05782.1| Unknown (protein for MGC:12025) [Mus musculus]
          Length = 198

 Score =  100 bits (248), Expect = 2e-20
 Identities = 57/91 (62%), Positives = 60/91 (65%), Gaps = 12/91 (13%)
 Frame = -2

Query: 588 SSGHGGMGAMLAGGAAAAAAAYGAHHV---SHGHYGHGG-YGHGGFMPHGKFKHGKFKHG 421
           S G G MG MLAGGAAAAAAAYG HH+   SHGH+G GG  GH G   HG    GKFKH 
Sbjct: 115 SGGMGAMGGMLAGGAAAAAAAYGVHHLTSGSHGHHGGGGPLGHFGGGHHG----GKFKH- 169

Query: 420 KFGKHGKFKHGKFGKHG--------LFRKWK 352
             GKHGKFKHGKFGKHG         F+KWK
Sbjct: 170 --GKHGKFKHGKFGKHGGGMFGGGKKFKKWK 198

>ref|NP_197267.1| glycine/proline-rich protein; protein id: At5g17650.1 [Arabidopsis
           thaliana] gi|11357316|pir||T51469 glycine/proline-rich
           protein - Arabidopsis thaliana
           gi|9755790|emb|CAC01909.1| glycine/proline-rich protein
           [Arabidopsis thaliana]
          Length = 173

 Score =  100 bits (248), Expect = 2e-20
 Identities = 55/82 (67%), Positives = 63/82 (76%), Gaps = 6/82 (7%)
 Frame = -2

Query: 579 HGGMGAMLAGGAAAAAAAYGAHHVSH--GHYG-HGGYGHG-GFMPHGKFKHGKFKHGKFG 412
           HGG+GA++AGG AAAA   GAHH+SH  GHYG H G+G+G G+  HGKFKHGKFKHGKFG
Sbjct: 100 HGGIGAIIAGGVAAAA---GAHHMSHHHGHYGHHHGHGYGYGYHGHGKFKHGKFKHGKFG 156

Query: 411 KHGKF-KH-GKFGKHGLFRKWK 352
           KHG F KH GKF     F+KWK
Sbjct: 157 KHGMFGKHKGKF-----FKKWK 173

>emb|CAB61840.1| putative glycine and proline-rich protein [Sporobolus stapfianus]
          Length = 197

 Score = 97.1 bits (240), Expect = 2e-19
 Identities = 59/99 (59%), Positives = 64/99 (64%), Gaps = 20/99 (20%)
 Frame = -2

Query: 588 SSGHGG--MGAMLAGGAAAAAAAYGAHHVSHGH------------YGHGGYGHGGFMPHG 451
           SS HGG  MG +LAGGAAAAAAAYGAH +SHGH            Y  GGYGH G+  HG
Sbjct: 102 SSSHGGGNMG-LLAGGAAAAAAAYGAHKLSHGHSGGHGFPGGHGGYAVGGYGH-GYGGHG 159

Query: 450 KFKHGKFKHGKFGK-HGKFKHGKFGKHGL-----FRKWK 352
           KFKHG   HGKF   HGKFKHGK G HG+     F+KWK
Sbjct: 160 KFKHGHGGHGKFKHGHGKFKHGKHG-HGMFGGGKFKKWK 197

>ref|NP_568642.1| expressed protein; protein id: At5g45350.1, supported by cDNA:
           22538., supported by cDNA: gi_15529251 [Arabidopsis
           thaliana] gi|2129603|pir||S65780 glycine/proline-rich
           protein GPRP - Arabidopsis thaliana
           gi|1465364|emb|CAA59059.1| GPRP [Arabidopsis thaliana]
           gi|9758725|dbj|BAB09163.1| gene_id:MFC19.1~unknown
           protein [Arabidopsis thaliana]
           gi|15529252|gb|AAK97720.1| AT5g45350/MFC19_1
           [Arabidopsis thaliana] gi|16974403|gb|AAL31127.1|
           AT5g45350/MFC19_1 [Arabidopsis thaliana]
           gi|21592344|gb|AAM64295.1| unknown [Arabidopsis
           thaliana]
          Length = 177

 Score = 85.9 bits (211), Expect = 5e-16
 Identities = 56/95 (58%), Positives = 61/95 (63%), Gaps = 12/95 (12%)
 Frame = -2

Query: 600 FSALSSGH-GGMGAMLAGGAAAAAAAYGAHHV---SHGHYGHGGYGHG-------GF-MP 457
           + A  SGH GG+G M+AG    AAAAYGAHHV   SHG YGH  YGHG       G+   
Sbjct: 93  YPAHHSGHAGGIGGMIAG----AAAAYGAHHVAHSSHGPYGHAAYGHGFGHGHGYGYGHG 148

Query: 456 HGKFKHGKFKHGKFGKHGKFKHGKFGKHGLFRKWK 352
           HGKFKHG  KHGKF KHG  KHG FG  G F+KWK
Sbjct: 149 HGKFKHG--KHGKF-KHG--KHGMFG-GGKFKKWK 177

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 611,900,513
Number of Sequences: 1393205
Number of extensions: 15023101
Number of successful extensions: 85953
Number of sequences better than 10.0: 528
Number of HSP's better than 10.0 without gapping: 57887
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 77753
length of database: 448,689,247
effective HSP length: 119
effective length of database: 282,897,852
effective search space used: 29138478756
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MPDL045d07_f AV778779 1 521
2 MFB039f04_f BP036868 1 484
3 MRL018e03_f BP084641 18 522
4 MFBL030a08_f BP042746 170 671




Lotus japonicus
Kazusa DNA Research Institute