Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC005571A_C01 KMC005571A_c01
(569 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gb|AAA42066.1| salivary proline-rich protein 62 7e-09
pir||A39066 proline-rich protein 4 - rat 61 9e-09
dbj|BAA95888.1| ESTs AU082563(S20379),D15187(C0226), AU082476(C0... 60 1e-08
ref|NP_180518.1| RRM-containing RNA-binding protein, putative; p... 59 6e-08
ref|NP_062603.1| proline-rich protein 15; proline-rich salivary ... 59 6e-08
>gb|AAA42066.1| salivary proline-rich protein
Length = 202
Score = 61.6 bits (148), Expect = 7e-09
Identities = 50/136 (36%), Positives = 60/136 (43%), Gaps = 14/136 (10%)
Frame = -1
Query: 563 SPPGGPS------GSGENKP---GPEKQPMQHYPRPMMPPPPGQYHHHQYYPPYGGYMQQ 411
+PPGGP G+ + P GP+++P Q +P PPPPG Q PP G Q
Sbjct: 70 TPPGGPQQKPPQPGNQQGPPPPGGPQQKPPQP-EKPQGPPPPG---GPQQRPPQPGNQQG 125
Query: 410 PVPP--YQQYPPQYNAVVAPSQPPAANHPYQHSMQPGSSQTGSAPAGP--APAETGTSAS 243
P PP QQ PPQ P PP P Q QPG Q P GP P + G S
Sbjct: 126 PPPPGGPQQKPPQPE---KPQGPPPPGGPQQKPPQPGKPQGPPPPGGPQQRPPQPGNQQS 182
Query: 242 GSQ-QQ*NRPYNNVSS 198
Q Q +RP + S
Sbjct: 183 PPQGPQLDRPQGSFQS 198
Score = 58.2 bits (139), Expect = 7e-08
Identities = 36/98 (36%), Positives = 41/98 (41%), Gaps = 2/98 (2%)
Frame = -1
Query: 560 PPGGPSGSGENKPGPEKQPMQHYPRPMMPPPPGQYHHHQYYPPYGGYMQQPVPP--YQQY 387
PP P+ + P P+ P Q P+P P P Q PP G Q P PP QQ
Sbjct: 38 PPRPPANGSQQGPPPQGGPQQKPPQPGKPQGPTPPGGPQQKPPQPGNQQGPPPPGGPQQK 97
Query: 386 PPQYNAVVAPSQPPAANHPYQHSMQPGSSQTGSAPAGP 273
PPQ P PP P Q QPG+ Q P GP
Sbjct: 98 PPQPE---KPQGPPPPGGPQQRPPQPGNQQGPPPPGGP 132
Score = 44.3 bits (103), Expect = 0.001
Identities = 38/103 (36%), Positives = 41/103 (38%), Gaps = 5/103 (4%)
Frame = -1
Query: 566 DSPPGGPSGSGENKPGPEKQPMQHYPRP---MMPPPPGQYHHHQYYPPYGGYMQQPVPP- 399
+ P G P G P Q P+P PPPPG Q PP Q P PP
Sbjct: 102 EKPQGPPPPGG---------PQQRPPQPGNQQGPPPPG---GPQQKPPQPEKPQGPPPPG 149
Query: 398 -YQQYPPQYNAVVAPSQPPAANHPYQHSMQPGSSQTGSAPAGP 273
QQ PPQ P PP P Q QPG+ Q S P GP
Sbjct: 150 GPQQKPPQPG---KPQGPPPPGGPQQRPPQPGNQQ--SPPQGP 187
Score = 40.8 bits (94), Expect = 0.012
Identities = 26/81 (32%), Positives = 31/81 (38%)
Frame = -1
Query: 515 EKQPMQHYPRPMMPPPPGQYHHHQYYPPYGGYMQQPVPPYQQYPPQYNAVVAPSQPPAAN 336
++ P Q P P PP P Q PP GG Q+P P + P P
Sbjct: 25 DQTPNQKPPPPGFPPRPPANGSQQGPPPQGGPQQKPPQPGK-----------PQGPTPPG 73
Query: 335 HPYQHSMQPGSSQTGSAPAGP 273
P Q QPG+ Q P GP
Sbjct: 74 GPQQKPPQPGNQQGPPPPGGP 94
Score = 36.2 bits (82), Expect = 0.30
Identities = 26/78 (33%), Positives = 32/78 (40%), Gaps = 17/78 (21%)
Frame = -1
Query: 560 PPGGPSGSGENKPGPEKQ--------PMQHYP---RPMMPPPPGQYHH------HQYYPP 432
PPGGP + P PEK P Q P +P PPPPG +Q PP
Sbjct: 128 PPGGPQ---QKPPQPEKPQGPPPPGGPQQKPPQPGKPQGPPPPGGPQQRPPQPGNQQSPP 184
Query: 431 YGGYMQQPVPPYQQYPPQ 378
G + +P +Q PQ
Sbjct: 185 QGPQLDRPQGSFQSLGPQ 202
>pir||A39066 proline-rich protein 4 - rat
Length = 204
Score = 61.2 bits (147), Expect = 9e-09
Identities = 50/136 (36%), Positives = 60/136 (43%), Gaps = 14/136 (10%)
Frame = -1
Query: 563 SPPGGPS------GSGENKP---GPEKQPMQHYPRPMMPPPPGQYHHHQYYPPYGGYMQQ 411
+PPGGP G+ + P GP+++P Q +P PPPPG Q PP G Q
Sbjct: 72 TPPGGPQQKPPQPGNQQGPPPPGGPQQKPPQP-GKPQGPPPPG---GPQQRPPQPGNQQG 127
Query: 410 PVPP--YQQYPPQYNAVVAPSQPPAANHPYQHSMQPGSSQTGSAPAGP--APAETGTSAS 243
P PP QQ PPQ P PP P Q QPG Q P GP P + G S
Sbjct: 128 PPPPGGPQQKPPQPG---KPQGPPPPGGPQQKPPQPGKPQGPPPPGGPQQRPPQPGNQQS 184
Query: 242 GSQ-QQ*NRPYNNVSS 198
Q Q +RP + S
Sbjct: 185 PPQGPQLDRPQGSFQS 200
Score = 58.2 bits (139), Expect = 7e-08
Identities = 36/98 (36%), Positives = 41/98 (41%), Gaps = 2/98 (2%)
Frame = -1
Query: 560 PPGGPSGSGENKPGPEKQPMQHYPRPMMPPPPGQYHHHQYYPPYGGYMQQPVPP--YQQY 387
PP P+ + P P+ P Q P+P P P Q PP G Q P PP QQ
Sbjct: 40 PPRPPANGSQQGPPPQGGPQQKPPQPGKPQGPTPPGGPQQKPPQPGNQQGPPPPGGPQQK 99
Query: 386 PPQYNAVVAPSQPPAANHPYQHSMQPGSSQTGSAPAGP 273
PPQ P PP P Q QPG+ Q P GP
Sbjct: 100 PPQPG---KPQGPPPPGGPQQRPPQPGNQQGPPPPGGP 134
Score = 43.9 bits (102), Expect = 0.001
Identities = 35/101 (34%), Positives = 37/101 (35%), Gaps = 16/101 (15%)
Frame = -1
Query: 527 KPGPEKQ-----PMQHYPRPMMPPPPGQYHHHQYYPPYGGYMQQPVPP-----------Y 396
+PG E Q P Q P P PP P Q PP GG Q+P P
Sbjct: 18 EPGDELQILDQTPNQKPPPPGFPPRPPANGSQQGPPPQGGPQQKPPQPGKPQGPTPPGGP 77
Query: 395 QQYPPQYNAVVAPSQPPAANHPYQHSMQPGSSQTGSAPAGP 273
QQ PPQ PP P Q QPG Q P GP
Sbjct: 78 QQKPPQPG---NQQGPPPPGGPQQKPPQPGKPQGPPPPGGP 115
Score = 37.0 bits (84), Expect = 0.17
Identities = 25/76 (32%), Positives = 34/76 (43%), Gaps = 15/76 (19%)
Frame = -1
Query: 560 PPGGPS------GSGENKP---GPEKQPMQHYPRPMMPPPPGQYHH------HQYYPPYG 426
PPGGP G + P GP+++P Q +P PPPPG +Q PP G
Sbjct: 130 PPGGPQQKPPQPGKPQGPPPPGGPQQKPPQP-GKPQGPPPPGGPQQRPPQPGNQQSPPQG 188
Query: 425 GYMQQPVPPYQQYPPQ 378
+ +P +Q PQ
Sbjct: 189 PQLDRPQGSFQSLGPQ 204
>dbj|BAA95888.1| ESTs AU082563(S20379),D15187(C0226),
AU082476(C0226),AU082563(S20379) correspond to a region
of the predicted gene.~Similar to Arabidopsis thaliana
chromosome 2 BAC F16P2; putative RNA-binding protein.
(AC004561) [Oryza sativa (japonica cultivar-group)]
Length = 482
Score = 60.5 bits (145), Expect = 1e-08
Identities = 39/104 (37%), Positives = 50/104 (47%), Gaps = 8/104 (7%)
Frame = -1
Query: 545 SGSGENKPGPEK--QPMQHYPRPMMPPPPGQYHHHQY---YPPYGGYM---QQPVPPYQQ 390
S G+ KPGP++ Q P P QY+H QY YPPYGGYM + P PP Q
Sbjct: 382 SQEGDGKPGPQQAAQAQASSSSGQSYPMPPQYYHGQYPPYYPPYGGYMPPPRMPYPPPPQ 441
Query: 389 YPPQYNAVVAPSQPPAANHPYQHSMQPGSSQTGSAPAGPAPAET 258
YPP + P+Q A++ S QP + A P P +T
Sbjct: 442 YPPYQPMLATPAQSQASS-----SQQPAPATLHQAQV-PPPQQT 479
>ref|NP_180518.1| RRM-containing RNA-binding protein, putative; protein id:
At2g29580.1, supported by cDNA: gi_16226862 [Arabidopsis
thaliana] gi|25408035|pir||A84698 probable RNA-binding
protein [imported] - Arabidopsis thaliana
gi|3980378|gb|AAC95181.1| putative RNA-binding protein
[Arabidopsis thaliana]
gi|16226863|gb|AAL16284.1|AF428354_1 At2g29580/F16P2.4
[Arabidopsis thaliana] gi|27363236|gb|AAO11537.1|
At2g29580/F16P2.4 [Arabidopsis thaliana]
Length = 483
Score = 58.5 bits (140), Expect = 6e-08
Identities = 37/87 (42%), Positives = 42/87 (47%), Gaps = 5/87 (5%)
Frame = -1
Query: 473 PPPGQYHHHQYY-PP-YGGYMQQPVPPYQQYPPQYNAVVAPSQPPAANHPYQHSMQPGSS 300
PP G Y HQ Y PP YGGYMQ PPYQQYPP ++ A+H Y PGS
Sbjct: 393 PPHGHYPQHQPYPPPSYGGYMQ---PPYQQYPPYHH-----GHSQQADHDYPQQPGPGSR 444
Query: 299 QTGSAP---AGPAPAETGTSASGSQQQ 228
P + P P + SGS QQ
Sbjct: 445 PNPPHPSSVSAPPPDSVSAAPSGSSQQ 471
Score = 33.1 bits (74), Expect = 2.5
Identities = 24/90 (26%), Positives = 39/90 (42%), Gaps = 6/90 (6%)
Frame = -1
Query: 488 RPMMPPPPGQYHHHQYYPPYGGYMQQPVPPYQQYPP----QYNAVVAPSQPPAANHPYQH 321
RP +P P + Q + G + + V QQ P QY P QPP + P+
Sbjct: 300 RPQVPKPDQDGSNQQGSVAHSGLLPRAVISQQQNQPPPMLQYYMHPPPPQPPHQDRPFYP 359
Query: 320 SMQPG--SSQTGSAPAGPAPAETGTSASGS 237
SM P + + S +G + ++ ++S S
Sbjct: 360 SMDPQRMGAVSSSKESGSSTSDNRGASSSS 389
>ref|NP_062603.1| proline-rich protein 15; proline-rich salivary protein;
proline-rich protein B, salivary [Mus musculus]
gi|91204|pir||A29149 proline-rich protein - mouse
gi|200539|gb|AAA40000.1| 15-kDa proline-rich salivary
protein
Length = 147
Score = 58.5 bits (140), Expect = 6e-08
Identities = 40/101 (39%), Positives = 45/101 (43%), Gaps = 5/101 (4%)
Frame = -1
Query: 560 PPGGPSGSGENKPGPEKQPMQHYP---RPMMPPPPGQYHHHQYYPPYGGYMQQPVPP--Y 396
PP P+ + P P+ P Q P +P PPPPG Q PP G Q P PP
Sbjct: 40 PPRPPANGSQQGPPPQGGPQQKPPQPGKPQGPPPPG---GPQQKPPQPGNQQGPPPPGGP 96
Query: 395 QQYPPQYNAVVAPSQPPAANHPYQHSMQPGSSQTGSAPAGP 273
QQ PPQ P PP P Q QPG+ Q S P GP
Sbjct: 97 QQKPPQSG---KPQGPPPPGGPQQRPPQPGNQQ--SPPQGP 132
Score = 47.0 bits (110), Expect = 2e-04
Identities = 36/120 (30%), Positives = 47/120 (39%), Gaps = 7/120 (5%)
Frame = -1
Query: 539 SGENKPGPEKQPMQHYPRPMMPPPPGQYHHHQYYPPYGGYMQQPVPPYQQYPPQYNAVVA 360
+G+ ++ P Q P P PP P Q PP GG Q+P P +
Sbjct: 19 AGDELQSLDQTPNQKPPPPGFPPRPPANGSQQGPPPQGGPQQKPPQPGK----------- 67
Query: 359 PSQPPAANHPYQHSMQPGSSQTGSAPAGP--APAETG-----TSASGSQQQ*NRPYNNVS 201
P PP P Q QPG+ Q P GP P ++G G QQ+ +P N S
Sbjct: 68 PQGPPPPGGPQQKPPQPGNQQGPPPPGGPQQKPPQSGKPQGPPPPGGPQQRPPQPGNQQS 127
Score = 42.7 bits (99), Expect = 0.003
Identities = 32/81 (39%), Positives = 34/81 (41%), Gaps = 5/81 (6%)
Frame = -1
Query: 557 PGGPSGSGENKPGPEKQPMQHYPRP---MMPPPPGQYHHHQYYPPYGGYMQQPVPP--YQ 393
PG P G P P P Q P+P PPPPG Q PP G Q P PP Q
Sbjct: 65 PGKPQG-----PPPPGGPQQKPPQPGNQQGPPPPG---GPQQKPPQSGKPQGPPPPGGPQ 116
Query: 392 QYPPQYNAVVAPSQPPAANHP 330
Q PPQ +P Q P P
Sbjct: 117 QRPPQPGNQQSPPQGPQFGRP 137
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 549,271,650
Number of Sequences: 1393205
Number of extensions: 14631059
Number of successful extensions: 108986
Number of sequences better than 10.0: 4043
Number of HSP's better than 10.0 without gapping: 62640
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 88258
length of database: 448,689,247
effective HSP length: 116
effective length of database: 287,077,467
effective search space used: 20956655091
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)