KMC005571A_c01
[Fasta Sequence]   [Nr Search]   [EST assemble image]  

Fasta Sequence
>KMC005571A_C01 KMC005571A_c01
aaaactgaagctgagcctgaatatgatctaaccaagaagctaagagttaCAAAAGCACAA
AATCAATCATAAAATAACTGGTTCAAGCCACAAAGCAGCAATCAATAATATCCAGCTAGT
TGAGAGTAAATTAACTGCTTGAACCAAAACAGCATGGGGTCAATTCAAGGAAAAACCTTG
CTGACTGATTAAACATAACTAGAGACATTATTATATGGGCGATTTCATTGCTGCTGAGAC
CCTGACGCTGATGTTCCAGTTTCGGCTGGTGCAGGCCCAGCAGGTGCAGACCCCGTCTGT
GAAGACCCTGGTTGCATAGAATGCTGATAGGGATGATTTGCAGCTGGAGGCTGGGATGGA
GCCACGACAGCATTGTATTGTGGTGGATACTGTTGATAAGGGGGAACTGGTTGTTGCATA
TAACCACCATATGGAGGATAATACTGATGATGATGATATTGGCCAGGTGGAGGCGGCATC
ATTGGACGGGGATAATGTTGCATTGGCTGCTTCTCTGGCCCGGGTTTGTTCTCCCCTGAG
CCAGAAGGTCCGCCAGGAGGACTGTCTTG


Nr search

BLASTX 2.2.2 [Dec-14-2001]

Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= KMC005571A_C01 KMC005571A_c01
         (569 letters)

Database: nr 
           1,393,205 sequences; 448,689,247 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAA42066.1| salivary proline-rich protein                           62  7e-09
pir||A39066 proline-rich protein 4 - rat                               61  9e-09
dbj|BAA95888.1| ESTs AU082563(S20379),D15187(C0226), AU082476(C0...    60  1e-08
ref|NP_180518.1| RRM-containing RNA-binding protein, putative; p...    59  6e-08
ref|NP_062603.1| proline-rich protein 15; proline-rich salivary ...    59  6e-08

>gb|AAA42066.1| salivary proline-rich protein
          Length = 202

 Score = 61.6 bits (148), Expect = 7e-09
 Identities = 50/136 (36%), Positives = 60/136 (43%), Gaps = 14/136 (10%)
 Frame = -1

Query: 563 SPPGGPS------GSGENKP---GPEKQPMQHYPRPMMPPPPGQYHHHQYYPPYGGYMQQ 411
           +PPGGP       G+ +  P   GP+++P Q   +P  PPPPG     Q  PP  G  Q 
Sbjct: 70  TPPGGPQQKPPQPGNQQGPPPPGGPQQKPPQP-EKPQGPPPPG---GPQQRPPQPGNQQG 125

Query: 410 PVPP--YQQYPPQYNAVVAPSQPPAANHPYQHSMQPGSSQTGSAPAGP--APAETGTSAS 243
           P PP   QQ PPQ      P  PP    P Q   QPG  Q    P GP   P + G   S
Sbjct: 126 PPPPGGPQQKPPQPE---KPQGPPPPGGPQQKPPQPGKPQGPPPPGGPQQRPPQPGNQQS 182

Query: 242 GSQ-QQ*NRPYNNVSS 198
             Q  Q +RP  +  S
Sbjct: 183 PPQGPQLDRPQGSFQS 198

 Score = 58.2 bits (139), Expect = 7e-08
 Identities = 36/98 (36%), Positives = 41/98 (41%), Gaps = 2/98 (2%)
 Frame = -1

Query: 560 PPGGPSGSGENKPGPEKQPMQHYPRPMMPPPPGQYHHHQYYPPYGGYMQQPVPP--YQQY 387
           PP  P+   +  P P+  P Q  P+P  P  P      Q  PP  G  Q P PP   QQ 
Sbjct: 38  PPRPPANGSQQGPPPQGGPQQKPPQPGKPQGPTPPGGPQQKPPQPGNQQGPPPPGGPQQK 97

Query: 386 PPQYNAVVAPSQPPAANHPYQHSMQPGSSQTGSAPAGP 273
           PPQ      P  PP    P Q   QPG+ Q    P GP
Sbjct: 98  PPQPE---KPQGPPPPGGPQQRPPQPGNQQGPPPPGGP 132

 Score = 44.3 bits (103), Expect = 0.001
 Identities = 38/103 (36%), Positives = 41/103 (38%), Gaps = 5/103 (4%)
 Frame = -1

Query: 566 DSPPGGPSGSGENKPGPEKQPMQHYPRP---MMPPPPGQYHHHQYYPPYGGYMQQPVPP- 399
           + P G P   G         P Q  P+P     PPPPG     Q  PP     Q P PP 
Sbjct: 102 EKPQGPPPPGG---------PQQRPPQPGNQQGPPPPG---GPQQKPPQPEKPQGPPPPG 149

Query: 398 -YQQYPPQYNAVVAPSQPPAANHPYQHSMQPGSSQTGSAPAGP 273
             QQ PPQ      P  PP    P Q   QPG+ Q  S P GP
Sbjct: 150 GPQQKPPQPG---KPQGPPPPGGPQQRPPQPGNQQ--SPPQGP 187

 Score = 40.8 bits (94), Expect = 0.012
 Identities = 26/81 (32%), Positives = 31/81 (38%)
 Frame = -1

Query: 515 EKQPMQHYPRPMMPPPPGQYHHHQYYPPYGGYMQQPVPPYQQYPPQYNAVVAPSQPPAAN 336
           ++ P Q  P P  PP P      Q  PP GG  Q+P  P +           P  P    
Sbjct: 25  DQTPNQKPPPPGFPPRPPANGSQQGPPPQGGPQQKPPQPGK-----------PQGPTPPG 73

Query: 335 HPYQHSMQPGSSQTGSAPAGP 273
            P Q   QPG+ Q    P GP
Sbjct: 74  GPQQKPPQPGNQQGPPPPGGP 94

 Score = 36.2 bits (82), Expect = 0.30
 Identities = 26/78 (33%), Positives = 32/78 (40%), Gaps = 17/78 (21%)
 Frame = -1

Query: 560 PPGGPSGSGENKPGPEKQ--------PMQHYP---RPMMPPPPGQYHH------HQYYPP 432
           PPGGP    +  P PEK         P Q  P   +P  PPPPG          +Q  PP
Sbjct: 128 PPGGPQ---QKPPQPEKPQGPPPPGGPQQKPPQPGKPQGPPPPGGPQQRPPQPGNQQSPP 184

Query: 431 YGGYMQQPVPPYQQYPPQ 378
            G  + +P   +Q   PQ
Sbjct: 185 QGPQLDRPQGSFQSLGPQ 202

>pir||A39066 proline-rich protein 4 - rat
          Length = 204

 Score = 61.2 bits (147), Expect = 9e-09
 Identities = 50/136 (36%), Positives = 60/136 (43%), Gaps = 14/136 (10%)
 Frame = -1

Query: 563 SPPGGPS------GSGENKP---GPEKQPMQHYPRPMMPPPPGQYHHHQYYPPYGGYMQQ 411
           +PPGGP       G+ +  P   GP+++P Q   +P  PPPPG     Q  PP  G  Q 
Sbjct: 72  TPPGGPQQKPPQPGNQQGPPPPGGPQQKPPQP-GKPQGPPPPG---GPQQRPPQPGNQQG 127

Query: 410 PVPP--YQQYPPQYNAVVAPSQPPAANHPYQHSMQPGSSQTGSAPAGP--APAETGTSAS 243
           P PP   QQ PPQ      P  PP    P Q   QPG  Q    P GP   P + G   S
Sbjct: 128 PPPPGGPQQKPPQPG---KPQGPPPPGGPQQKPPQPGKPQGPPPPGGPQQRPPQPGNQQS 184

Query: 242 GSQ-QQ*NRPYNNVSS 198
             Q  Q +RP  +  S
Sbjct: 185 PPQGPQLDRPQGSFQS 200

 Score = 58.2 bits (139), Expect = 7e-08
 Identities = 36/98 (36%), Positives = 41/98 (41%), Gaps = 2/98 (2%)
 Frame = -1

Query: 560 PPGGPSGSGENKPGPEKQPMQHYPRPMMPPPPGQYHHHQYYPPYGGYMQQPVPP--YQQY 387
           PP  P+   +  P P+  P Q  P+P  P  P      Q  PP  G  Q P PP   QQ 
Sbjct: 40  PPRPPANGSQQGPPPQGGPQQKPPQPGKPQGPTPPGGPQQKPPQPGNQQGPPPPGGPQQK 99

Query: 386 PPQYNAVVAPSQPPAANHPYQHSMQPGSSQTGSAPAGP 273
           PPQ      P  PP    P Q   QPG+ Q    P GP
Sbjct: 100 PPQPG---KPQGPPPPGGPQQRPPQPGNQQGPPPPGGP 134

 Score = 43.9 bits (102), Expect = 0.001
 Identities = 35/101 (34%), Positives = 37/101 (35%), Gaps = 16/101 (15%)
 Frame = -1

Query: 527 KPGPEKQ-----PMQHYPRPMMPPPPGQYHHHQYYPPYGGYMQQPVPP-----------Y 396
           +PG E Q     P Q  P P  PP P      Q  PP GG  Q+P  P            
Sbjct: 18  EPGDELQILDQTPNQKPPPPGFPPRPPANGSQQGPPPQGGPQQKPPQPGKPQGPTPPGGP 77

Query: 395 QQYPPQYNAVVAPSQPPAANHPYQHSMQPGSSQTGSAPAGP 273
           QQ PPQ         PP    P Q   QPG  Q    P GP
Sbjct: 78  QQKPPQPG---NQQGPPPPGGPQQKPPQPGKPQGPPPPGGP 115

 Score = 37.0 bits (84), Expect = 0.17
 Identities = 25/76 (32%), Positives = 34/76 (43%), Gaps = 15/76 (19%)
 Frame = -1

Query: 560 PPGGPS------GSGENKP---GPEKQPMQHYPRPMMPPPPGQYHH------HQYYPPYG 426
           PPGGP       G  +  P   GP+++P Q   +P  PPPPG          +Q  PP G
Sbjct: 130 PPGGPQQKPPQPGKPQGPPPPGGPQQKPPQP-GKPQGPPPPGGPQQRPPQPGNQQSPPQG 188

Query: 425 GYMQQPVPPYQQYPPQ 378
             + +P   +Q   PQ
Sbjct: 189 PQLDRPQGSFQSLGPQ 204

>dbj|BAA95888.1| ESTs AU082563(S20379),D15187(C0226),
           AU082476(C0226),AU082563(S20379) correspond to a region
           of the predicted gene.~Similar to Arabidopsis thaliana
           chromosome 2 BAC F16P2; putative RNA-binding protein.
           (AC004561) [Oryza sativa (japonica cultivar-group)]
          Length = 482

 Score = 60.5 bits (145), Expect = 1e-08
 Identities = 39/104 (37%), Positives = 50/104 (47%), Gaps = 8/104 (7%)
 Frame = -1

Query: 545 SGSGENKPGPEK--QPMQHYPRPMMPPPPGQYHHHQY---YPPYGGYM---QQPVPPYQQ 390
           S  G+ KPGP++  Q           P P QY+H QY   YPPYGGYM   + P PP  Q
Sbjct: 382 SQEGDGKPGPQQAAQAQASSSSGQSYPMPPQYYHGQYPPYYPPYGGYMPPPRMPYPPPPQ 441

Query: 389 YPPQYNAVVAPSQPPAANHPYQHSMQPGSSQTGSAPAGPAPAET 258
           YPP    +  P+Q  A++     S QP  +    A   P P +T
Sbjct: 442 YPPYQPMLATPAQSQASS-----SQQPAPATLHQAQV-PPPQQT 479

>ref|NP_180518.1| RRM-containing RNA-binding protein, putative; protein id:
           At2g29580.1, supported by cDNA: gi_16226862 [Arabidopsis
           thaliana] gi|25408035|pir||A84698 probable RNA-binding
           protein [imported] - Arabidopsis thaliana
           gi|3980378|gb|AAC95181.1| putative RNA-binding protein
           [Arabidopsis thaliana]
           gi|16226863|gb|AAL16284.1|AF428354_1 At2g29580/F16P2.4
           [Arabidopsis thaliana] gi|27363236|gb|AAO11537.1|
           At2g29580/F16P2.4 [Arabidopsis thaliana]
          Length = 483

 Score = 58.5 bits (140), Expect = 6e-08
 Identities = 37/87 (42%), Positives = 42/87 (47%), Gaps = 5/87 (5%)
 Frame = -1

Query: 473 PPPGQYHHHQYY-PP-YGGYMQQPVPPYQQYPPQYNAVVAPSQPPAANHPYQHSMQPGSS 300
           PP G Y  HQ Y PP YGGYMQ   PPYQQYPP ++          A+H Y     PGS 
Sbjct: 393 PPHGHYPQHQPYPPPSYGGYMQ---PPYQQYPPYHH-----GHSQQADHDYPQQPGPGSR 444

Query: 299 QTGSAP---AGPAPAETGTSASGSQQQ 228
                P   + P P     + SGS QQ
Sbjct: 445 PNPPHPSSVSAPPPDSVSAAPSGSSQQ 471

 Score = 33.1 bits (74), Expect = 2.5
 Identities = 24/90 (26%), Positives = 39/90 (42%), Gaps = 6/90 (6%)
 Frame = -1

Query: 488 RPMMPPPPGQYHHHQYYPPYGGYMQQPVPPYQQYPP----QYNAVVAPSQPPAANHPYQH 321
           RP +P P     + Q    + G + + V   QQ  P    QY     P QPP  + P+  
Sbjct: 300 RPQVPKPDQDGSNQQGSVAHSGLLPRAVISQQQNQPPPMLQYYMHPPPPQPPHQDRPFYP 359

Query: 320 SMQPG--SSQTGSAPAGPAPAETGTSASGS 237
           SM P    + + S  +G + ++   ++S S
Sbjct: 360 SMDPQRMGAVSSSKESGSSTSDNRGASSSS 389

>ref|NP_062603.1| proline-rich protein 15; proline-rich salivary protein;
           proline-rich protein B, salivary [Mus musculus]
           gi|91204|pir||A29149 proline-rich protein - mouse
           gi|200539|gb|AAA40000.1| 15-kDa proline-rich salivary
           protein
          Length = 147

 Score = 58.5 bits (140), Expect = 6e-08
 Identities = 40/101 (39%), Positives = 45/101 (43%), Gaps = 5/101 (4%)
 Frame = -1

Query: 560 PPGGPSGSGENKPGPEKQPMQHYP---RPMMPPPPGQYHHHQYYPPYGGYMQQPVPP--Y 396
           PP  P+   +  P P+  P Q  P   +P  PPPPG     Q  PP  G  Q P PP   
Sbjct: 40  PPRPPANGSQQGPPPQGGPQQKPPQPGKPQGPPPPG---GPQQKPPQPGNQQGPPPPGGP 96

Query: 395 QQYPPQYNAVVAPSQPPAANHPYQHSMQPGSSQTGSAPAGP 273
           QQ PPQ      P  PP    P Q   QPG+ Q  S P GP
Sbjct: 97  QQKPPQSG---KPQGPPPPGGPQQRPPQPGNQQ--SPPQGP 132

 Score = 47.0 bits (110), Expect = 2e-04
 Identities = 36/120 (30%), Positives = 47/120 (39%), Gaps = 7/120 (5%)
 Frame = -1

Query: 539 SGENKPGPEKQPMQHYPRPMMPPPPGQYHHHQYYPPYGGYMQQPVPPYQQYPPQYNAVVA 360
           +G+     ++ P Q  P P  PP P      Q  PP GG  Q+P  P +           
Sbjct: 19  AGDELQSLDQTPNQKPPPPGFPPRPPANGSQQGPPPQGGPQQKPPQPGK----------- 67

Query: 359 PSQPPAANHPYQHSMQPGSSQTGSAPAGP--APAETG-----TSASGSQQQ*NRPYNNVS 201
           P  PP    P Q   QPG+ Q    P GP   P ++G         G QQ+  +P N  S
Sbjct: 68  PQGPPPPGGPQQKPPQPGNQQGPPPPGGPQQKPPQSGKPQGPPPPGGPQQRPPQPGNQQS 127

 Score = 42.7 bits (99), Expect = 0.003
 Identities = 32/81 (39%), Positives = 34/81 (41%), Gaps = 5/81 (6%)
 Frame = -1

Query: 557 PGGPSGSGENKPGPEKQPMQHYPRP---MMPPPPGQYHHHQYYPPYGGYMQQPVPP--YQ 393
           PG P G     P P   P Q  P+P     PPPPG     Q  PP  G  Q P PP   Q
Sbjct: 65  PGKPQG-----PPPPGGPQQKPPQPGNQQGPPPPG---GPQQKPPQSGKPQGPPPPGGPQ 116

Query: 392 QYPPQYNAVVAPSQPPAANHP 330
           Q PPQ     +P Q P    P
Sbjct: 117 QRPPQPGNQQSPPQGPQFGRP 137

  Database: nr
    Posted date:  Apr 1, 2003  2:05 AM
  Number of letters in database: 448,689,247
  Number of sequences in database:  1,393,205
  
Lambda     K      H
   0.318    0.135    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 549,271,650
Number of Sequences: 1393205
Number of extensions: 14631059
Number of successful extensions: 108986
Number of sequences better than 10.0: 4043
Number of HSP's better than 10.0 without gapping: 62640
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 88258
length of database: 448,689,247
effective HSP length: 116
effective length of database: 287,077,467
effective search space used: 20956655091
frameshift window, decay const: 50,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)


EST assemble image


clone accession position
1 MFB044f06_f BP037233 1 392
2 MWL047e12_f AV769372 50 525
3 MPDL037c05_f AV778358 56 569
4 MWM066h04_f AV765775 61 508




Lotus japonicus
Kazusa DNA Research Institute