KMC020468A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC020468A_C01 KMC020468A_c01
caatctaaacTGAATCGTTTATCTTGTCAATGCCAAAAAGGTATCAAATCAAGCATAAGT
TAAGAAAACAAATACAGGGGCCAAGTATGATGGACTGACTTCATACAAAGTATGGTTTAT
TTATAAAGTAATAAGCACAAAGCAGCTGCATTTAAAATAGTCATAGCAGCATATATTCCT
CACTTCCACTTCTTGAACCCACCATGCTTTCCAAACTTTCCATGCTTATGCTTGCCAAAC
TTTCCATGCTTTCCATGCTTGAACTTCCCATGCTGCTTAAATTTTCCATGCCCCATGTGA
CCACCCTGTGCATAGCCACCATGTGCATAACCATGGGCGCCATGAGCGGCATGAGCGCCA
TGAGCGCCATAAGCAGCAGCTGCAGCAGCAGCACCCCCAGCAAGCATTGCACCCATGCCG
CCATGTCCTTGATGCCCACCTGGTGATAACAGCAAATAACTTAGTAGCTTCAAAGGAACA
CATCATAGAAAAGCAGAGAAAAATTTCAACCAGCAAACATTTCAATTAAGGATCAACTAA
GGGGG


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC020468A_C01 KMC020468A_c01
         (545 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_193655.1| putative protein; protein id: At4g19200.1, supp...   104  6e-22
gb|AAH05782.1| Unknown (protein for MGC:12025) [Mus musculus]         102  3e-21
emb|CAB61840.1| putative glycine and proline-rich protein [Sporo...   100  2e-20
ref|NP_568642.1| expressed protein; protein id: At5g45350.1, sup...    94  1e-18
ref|NP_197267.1| glycine/proline-rich protein; protein id: At5g1...    92  4e-18

>ref|NP_193655.1| putative protein; protein id: At4g19200.1, supported by cDNA: 8188.
           [Arabidopsis thaliana] gi|25407569|pir||A85217
           hypothetical protein AT4g19200 [imported] - Arabidopsis
           thaliana gi|7268715|emb|CAB78922.1| putative protein
           [Arabidopsis thaliana] gi|21595622|gb|AAM66118.1|
           unknown [Arabidopsis thaliana]
           gi|24417344|gb|AAN60282.1| unknown [Arabidopsis
           thaliana] gi|27311843|gb|AAO00887.1| Unknown protein
           [Arabidopsis thaliana]
          Length = 179

 Score =  104 bits (260), Expect = 6e-22
 Identities = 60/102 (58%), Positives = 66/102 (63%), Gaps = 14/102 (13%)
 Frame = -1

Query: 446 SPGGHQ-GH--GGMGAMLAGGAAAAAAAYGAHGA-HAAHGAHGYA--HGGYAQG-----G 300
           +PG H  GH  GG+G M+AG A AAAAAYGAH   HA+H  +G+A  HGGY        G
Sbjct: 80  APGAHHSGHSGGGLGGMIAGAAGAAAAAYGAHHVGHASHNPYGHAVGHGGYGHAPAHGFG 139

Query: 299 HMGHGKFKQH---GKFKHGKHGKFGKHKHGKFGKHGGFKKWK 183
           H GHGKFK     GKFKHGKHGK G  KHG FG  G FKKWK
Sbjct: 140 HGGHGKFKHGKHGGKFKHGKHGKHG--KHGMFGGGGKFKKWK 179

>gb|AAH05782.1| Unknown (protein for MGC:12025) [Mus musculus]
          Length = 198

 Score =  102 bits (254), Expect = 3e-21
 Identities = 58/94 (61%), Positives = 60/94 (63%), Gaps = 8/94 (8%)
 Frame = -1

Query: 440 GGHQGHGGMGAMLAGGAAAAAAAYGAHGAHAAHGAHGYAHGGYAQGGHMGHGKFKQHGKF 261
           GG  G G MG MLAGGAAAAAAAYG H  H   G+HG+ HGG    GH G G     GKF
Sbjct: 113 GGSGGMGAMGGMLAGGAAAAAAAYGVH--HLTSGSHGH-HGGGGPLGHFGGG--HHGGKF 167

Query: 260 KHGKHGKFGKHKHGKFGKHGG--------FKKWK 183
           KHGKHGKF   KHGKFGKHGG        FKKWK
Sbjct: 168 KHGKHGKF---KHGKFGKHGGGMFGGGKKFKKWK 198

 Score = 38.1 bits (87), Expect = 0.069
 Identities = 29/89 (32%), Positives = 31/89 (34%), Gaps = 8/89 (8%)
 Frame = -1

Query: 440 GGHQGHGGMGAMLAG--GAAAAAAAYGAHGAHAAHGAHGYAHGGY------AQGGHMGHG 285
           GGH GHG          GA      Y   G    HG +   HGGY       QGG+   G
Sbjct: 25  GGHGGHGYPPGQYPPPPGAYPPQQGYPPQGYPPQHGGYPPQHGGYPPSGYPPQGGYPPSG 84

Query: 284 KFKQHGKFKHGKHGKFGKHKHGKFGKHGG 198
              Q G    G  G  G H  G    H G
Sbjct: 85  YPPQAGYPPGGYPGAHGSHSGGHGSHHAG 113

 Score = 32.0 bits (71), Expect = 5.0
 Identities = 21/65 (32%), Positives = 26/65 (39%), Gaps = 10/65 (15%)
 Frame = -1

Query: 365 AHGAHAAHGAHGYAHGGY--------AQGGHMGHGKFKQHGKF--KHGKHGKFGKHKHGK 216
           AHG    HG HGY  G Y         Q G+   G   QHG +  +HG +   G    G 
Sbjct: 20  AHGLAGGHGGHGYPPGQYPPPPGAYPPQQGYPPQGYPPQHGGYPPQHGGYPPSGYPPQGG 79

Query: 215 FGKHG 201
           +   G
Sbjct: 80  YPPSG 84

>emb|CAB61840.1| putative glycine and proline-rich protein [Sporobolus stapfianus]
          Length = 197

 Score = 99.8 bits (247), Expect = 2e-20
 Identities = 67/108 (62%), Positives = 71/108 (65%), Gaps = 21/108 (19%)
 Frame = -1

Query: 443 PGG-HQG-----HGG--MGAMLAGGAAAAAAAYGAHG-AHAAHGAHGY--AHGGYAQG-- 303
           PGG HQG     HGG  MG +LAGGAAAAAAAYGAH  +H   G HG+   HGGYA G  
Sbjct: 93  PGGSHQGGHSSSHGGGNMG-LLAGGAAAAAAAYGAHKLSHGHSGGHGFPGGHGGYAVGGY 151

Query: 302 --GHMGHGKFKQ----HGKFKHGKHGKF--GKHKHGKFGKHGGFKKWK 183
             G+ GHGKFK     HGKFKHG HGKF  GKH HG FG  G FKKWK
Sbjct: 152 GHGYGGHGKFKHGHGGHGKFKHG-HGKFKHGKHGHGMFG-GGKFKKWK 197

>ref|NP_568642.1| expressed protein; protein id: At5g45350.1, supported by cDNA:
           22538., supported by cDNA: gi_15529251 [Arabidopsis
           thaliana] gi|2129603|pir||S65780 glycine/proline-rich
           protein GPRP - Arabidopsis thaliana
           gi|1465364|emb|CAA59059.1| GPRP [Arabidopsis thaliana]
           gi|9758725|dbj|BAB09163.1| gene_id:MFC19.1~unknown
           protein [Arabidopsis thaliana]
           gi|15529252|gb|AAK97720.1| AT5g45350/MFC19_1
           [Arabidopsis thaliana] gi|16974403|gb|AAL31127.1|
           AT5g45350/MFC19_1 [Arabidopsis thaliana]
           gi|21592344|gb|AAM64295.1| unknown [Arabidopsis
           thaliana]
          Length = 177

 Score = 93.6 bits (231), Expect = 1e-18
 Identities = 51/89 (57%), Positives = 59/89 (65%), Gaps = 2/89 (2%)
 Frame = -1

Query: 443 PGGHQGH-GGMGAMLAGGAAAAAAAYGAHG-AHAAHGAHGYAHGGYAQGGHMGHGKFKQH 270
           P  H GH GG+G M+AG    AAAAYGAH  AH++HG +G+A  G+  G   G+G    H
Sbjct: 94  PAHHSGHAGGIGGMIAG----AAAAYGAHHVAHSSHGPYGHAAYGHGFGHGHGYGYGHGH 149

Query: 269 GKFKHGKHGKFGKHKHGKFGKHGGFKKWK 183
           GKFKHGKHGKF   KHG FG  G FKKWK
Sbjct: 150 GKFKHGKHGKFKHGKHGMFG-GGKFKKWK 177

>ref|NP_197267.1| glycine/proline-rich protein; protein id: At5g17650.1 [Arabidopsis
           thaliana] gi|11357316|pir||T51469 glycine/proline-rich
           protein - Arabidopsis thaliana
           gi|9755790|emb|CAC01909.1| glycine/proline-rich protein
           [Arabidopsis thaliana]
          Length = 173

 Score = 92.0 bits (227), Expect = 4e-18
 Identities = 55/91 (60%), Positives = 60/91 (65%), Gaps = 3/91 (3%)
 Frame = -1

Query: 446 SPGGHQGHGGMGAMLAGGAAAAAAAYGAHGAHAAHGAHGYAHG-GYAQGGHMGHGKFKQH 270
           S  GH  HGG+GA++AGG AAAA   GAH     HG +G+ HG GY  G H GHGKFK H
Sbjct: 94  SHSGHH-HGGIGAIIAGGVAAAA---GAHHMSHHHGHYGHHHGHGYGYGYH-GHGKFK-H 147

Query: 269 GKFKHGKHGKFGKHKHGKFGKHGG--FKKWK 183
           GKFKHGK G     KHG FGKH G  FKKWK
Sbjct: 148 GKFKHGKFG-----KHGMFGKHKGKFFKKWK 173

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 503,883,885
Number of Sequences: 1393205
Number of extensions: 12653752
Number of successful extensions: 111339
Number of sequences better than 10.0: 1754
Number of HSP's better than 10.0 without gapping: 59389
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 90568
length of database: 448,689,247
effective HSP length: 116
effective length of database: 287,077,467
effective search space used: 18660035355
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MFBL017e09_f BP042116 1 546
2 MF014b08_f BP028966 11 461




Lotus japonicus
Kazusa DNA Research Institute