Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KCC001839A_C01 KCC001839A_c01
(515 letters)
Database: nr
1,537,769 sequences; 498,525,298 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_862417.1| putative proline-rich extensin-like protein [Mi... 58 6e-08
ref|NP_742062.1| proline-rich proteoglycan 2 [Rattus norvegicus]... 57 1e-07
ref|NP_492875.2| pre-mRNA splicing SR protein related (68.2 kD) ... 56 2e-07
pir||A24264 proline-rich protein MP2 - mouse (fragment) 56 3e-07
ref|ZP_00057775.1| COG1205: Distinct helicase family with a uniq... 55 4e-07
>ref|NP_862417.1| putative proline-rich extensin-like protein [Micrococcus sp. 28]
gi|18025405|gb|AAK62513.1| putative proline-rich
extensin-like protein [Micrococcus sp. 28]
Length = 249
Score = 58.2 bits (139), Expect = 6e-08
Identities = 54/177 (30%), Positives = 67/177 (37%), Gaps = 20/177 (11%)
Frame = -2
Query: 514 RCPQRHPQCRRSCS------------PPTGPRRCSPAHQT----QCSRWRGRRPQGQARP 383
+ P P RR C+ P +GP R S A + P +R
Sbjct: 81 KSPPAGPARRRGCALRAVRAAWSQPVPSSGPSRSSGAGALGPGPSAATSAPAAPDSPSRC 140
Query: 382 APRSSWQPPQPRGQR----RQPPRGHRGQPRRSRPSGSSHRSAGW*YRP*RPRPWPSRRR 215
PR+ W+PP R R PP+ G+ R +RP HR W RP R RP P+RR
Sbjct: 141 RPRAHWRPPHARPPSPLVPRWPPQRALGRTRDARP---GHRCPTWAARPSR-RPRPARRA 196
Query: 214 PRSPQWSPAHQRGQQTPAAGRSWRPGRPRESSARRSWPEYAAERRPSRPTGPDGPRG 44
P +P AH GQ P A + WP R P GP PRG
Sbjct: 197 PATPPPPCAH--GQPPP--------------PAVQPWPRGGQSRLPPPRDGPGRPRG 237
Score = 37.0 bits (84), Expect = 0.15
Identities = 44/136 (32%), Positives = 52/136 (37%), Gaps = 5/136 (3%)
Frame = +2
Query: 23 CSLLTLLA--SRPVRPGRP*RSSLSSVLRPRPPRRRLSRPAWSPAPSCCRRLLPALMSWT 196
C+L + A S+PV P RSS + L P P S PA +PS CR W
Sbjct: 93 CALRAVRAAWSQPVPSSGPSRSSGAGALGPGPSAAT-SAPAAPDSPSRCR----PRAHWR 147
Query: 197 PLWRAWSAP*RPRARSSRPVSP---PCRAV*RS*RTATTWLPPMPARWLTPLAPWLWRLP 367
P P AR P+ P P RA+ R+ AR W R
Sbjct: 148 P----------PHARPPSPLVPRWPPQRALGRT----------RDARPGHRCPTWAARPS 187
Query: 368 RRPWSWPRLPLRPPPP 415
RRP R P PPPP
Sbjct: 188 RRPRPARRAPATPPPP 203
Score = 36.6 bits (83), Expect = 0.19
Identities = 31/93 (33%), Positives = 37/93 (39%)
Frame = -2
Query: 457 RRCSPAHQTQCSRWRGRRPQGQARPAPRSSWQPPQPRGQRRQPPRGHRGQPRRSRPSGSS 278
R P H+ C W R P + RPA R+ PP P + PP + PR G S
Sbjct: 171 RDARPGHR--CPTWAAR-PSRRPRPARRAPATPPPPCAHGQPPPPAVQPWPR----GGQS 223
Query: 277 HRSAGW*YRP*RPRPWPSRRRPRSPQWSPAHQR 179
R PR P R R R +PAH R
Sbjct: 224 --------RLPPPRDGPGRPRGRLAAPTPAHAR 248
>ref|NP_742062.1| proline-rich proteoglycan 2 [Rattus norvegicus]
gi|1083764|pir||B48013 proline-rich proteoglycan 2
precursor, parotid - rat gi|310200|gb|AAA03074.1|
proline-rich proteoglycan
Length = 295
Score = 57.0 bits (136), Expect = 1e-07
Identities = 51/161 (31%), Positives = 67/161 (40%), Gaps = 8/161 (4%)
Frame = -2
Query: 508 PQRHPQ--CRRSCSPPTGPRRCSPAHQTQCSRWRGRRPQG--QARPAPRSSWQPPQPRGQ 341
PQR PQ + PP GP++ P +G PQG Q P P S PP P G
Sbjct: 106 PQRPPQPGSPQGPPPPGGPQQRPP---------QGPPPQGGPQRPPQPGSPQGPPPPGGP 156
Query: 340 RRQPPRG--HRGQPRRSRPSGSSHRSAGW*YRP*RPRPWPSRRRPRSPQWSPAHQRGQQT 167
+++PP+G +G P+R GS P P P P + R+PQ P Q+
Sbjct: 157 QQRPPQGPPPQGGPQRPPQPGS----------PQGPPP-PGGPQQRAPQGPPPQGGPQRP 205
Query: 166 PAAGRSWRPGRPRESSAR--RSWPEYAAERRPSRPTGPDGP 50
P G P P R + P +RP +P P GP
Sbjct: 206 PQPGSPQGPPPPGGPQQRPPQGPPPQGGPQRPPQPGSPQGP 246
Score = 47.4 bits (111), Expect = 1e-04
Identities = 44/140 (31%), Positives = 61/140 (43%), Gaps = 6/140 (4%)
Frame = -2
Query: 508 PQRHPQ--CRRSCSPPTGPRRCSPAHQTQCSRWRGRRPQG--QARPAPRSSWQPPQPRGQ 341
PQR PQ + PP GP++ +P +G PQG Q P P S PP P G
Sbjct: 170 PQRPPQPGSPQGPPPPGGPQQRAP---------QGPPPQGGPQRPPQPGSPQGPPPPGGP 220
Query: 340 RRQPPRG--HRGQPRRSRPSGSSHRSAGW*YRP*RPRPWPSRRRPRSPQWSPAHQRGQQT 167
+++PP+G +G P+R GS P P P P + R PQ P Q+
Sbjct: 221 QQRPPQGPPPQGGPQRPPQPGS----------PQGPPP-PGGPQQRPPQGPPPQGGPQRP 269
Query: 166 PAAGRSWRPGRPRESSARRS 107
P G G P++ ++S
Sbjct: 270 PQPGNP--QGPPQQGGQQQS 287
>ref|NP_492875.2| pre-mRNA splicing SR protein related (68.2 kD) (rsr-1)
[Caenorhabditis elegans] gi|19571645|emb|CAB04214.3| C.
elegans RSR-1 protein (corresponding sequence F28D9.1)
[Caenorhabditis elegans]
Length = 601
Score = 56.2 bits (134), Expect = 2e-07
Identities = 50/158 (31%), Positives = 66/158 (41%), Gaps = 13/158 (8%)
Frame = -2
Query: 481 SCSPPTGPRRCSPAHQTQCSRWRGRRPQGQARPAPRSSWQPPQPRGQRRQPPRGHRGQPR 302
S SPP P+R ++ R + P + R +P +S PP PR +RR P + P+
Sbjct: 354 SKSPPPAPKR---------AKSRSKSPPARRRRSPSASKSPPAPR-RRRSPSKSRSPAPK 403
Query: 301 RSRPSGSSHRSAGW*YRP*RPR---------PWPSRRRPRSPQWSPAHQRGQ---QTPAA 158
R P RS P P+ P P RRR S SPA +R + ++P A
Sbjct: 404 REIPPARRRRSPSASKSPPAPKRAKSRSKSPPAPRRRRSPSQSKSPAPRRRRSPSKSPQA 463
Query: 157 GRSWR-PGRPRESSARRSWPEYAAERRPSRPTGPDGPR 47
R R P + S RR AA RR P PR
Sbjct: 464 PRRRRSPSGSKSRSPRRRRSPAAAPRRRQSPQRRRSPR 501
Score = 52.0 bits (123), Expect = 4e-06
Identities = 52/173 (30%), Positives = 68/173 (39%), Gaps = 12/173 (6%)
Frame = -2
Query: 484 RSCSPPTGPRRCSPAHQTQCSRWRGRRPQGQARPAPRSSWQPPQPR---GQRRQPPRGHR 314
RS SPP RR A ++ + R R P PAP+ P + R + PP R
Sbjct: 367 RSKSPPARRRRSPSASKSPPAPRRRRSPSKSRSPAPKREIPPARRRRSPSASKSPPAPKR 426
Query: 313 GQ--------PRRSRPSGSSHRSAGW*YRP*RPRPWPSRRRPRSPQWSPAHQ-RGQQTPA 161
+ PRR R S A R P RRR RSP S + R +++PA
Sbjct: 427 AKSRSKSPPAPRRRRSPSQSKSPAPRRRRSPSKSPQAPRRR-RSPSGSKSRSPRRRRSPA 485
Query: 160 AGRSWRPGRPRESSARRSWPEYAAERRPSRPTGPDGPRGEER*QAAFSIYGEE 2
A R R S RR ++ R S P P PR + QA ++ E
Sbjct: 486 AAPRRRQSPQRRRSPRRRRSPSSSSRSRSPPPPPRRPRQDSEQQAPVAVKSPE 538
Score = 48.1 bits (113), Expect = 6e-05
Identities = 51/169 (30%), Positives = 67/169 (39%), Gaps = 15/169 (8%)
Frame = -2
Query: 508 PQRHPQCRRSCSPPTGPRRCSPAHQT----QCSRWRGRRPQGQARP--APRSSWQPP--- 356
P+R S SPP RR SP+ + ++ R + P AR +P +S PP
Sbjct: 302 PRRRRSPSASKSPPPARRRRSPSQSKSPAPKRAKSRSKSPPAPARRRRSPSASKSPPPAP 361
Query: 355 -QPRGQRRQPPRGHRGQPRRSR--PSGSSHRSAGW*YRP*RPRPWPSRRRPRSPQWS--- 194
+ + + + PP R P S+ P+ RS P R P RR RSP S
Sbjct: 362 KRAKSRSKSPPARRRRSPSASKSPPAPRRRRSPSKSRSPAPKREIPPARRRRSPSASKSP 421
Query: 193 PAHQRGQQTPAAGRSWRPGRPRESSARRSWPEYAAERRPSRPTGPDGPR 47
PA +R A RS P PR + A RR S P PR
Sbjct: 422 PAPKR-----AKSRSKSPPAPRRRRSPSQSKSPAPRRRRSPSKSPQAPR 465
Score = 41.2 bits (95), Expect = 0.008
Identities = 34/134 (25%), Positives = 56/134 (41%), Gaps = 11/134 (8%)
Frame = -2
Query: 415 RGRRPQGQARPAPRSSWQPPQPRGQRRQPPRGHRGQPRRSR------PSGSSHRSAGW*Y 254
R + + R +P +S PP P +RR P + P+R++ P+ + R +
Sbjct: 296 RAKSGSPRRRRSPSASKSPP-PARRRRSPSQSKSPAPKRAKSRSKSPPAPARRRRSPSAS 354
Query: 253 RP*RPRPWPSRRRPRSPQWSPAHQR-----GQQTPAAGRSWRPGRPRESSARRSWPEYAA 89
+ P P ++ R +SP PA +R + PA R P + R + +R P
Sbjct: 355 KSPPPAPKRAKSRSKSP---PARRRRSPSASKSPPAPRRRRSPSKSRSPAPKREIPPARR 411
Query: 88 ERRPSRPTGPDGPR 47
R PS P P+
Sbjct: 412 RRSPSASKSPPAPK 425
Score = 39.7 bits (91), Expect = 0.023
Identities = 42/160 (26%), Positives = 57/160 (35%), Gaps = 15/160 (9%)
Frame = -2
Query: 514 RCPQRHPQCRRSCSPPTGPRRCSPAHQTQCSRWRGRRPQGQARPAPRSSWQP-------- 359
+ P + + P PRR Q++ R RR ++ APR P
Sbjct: 419 KSPPAPKRAKSRSKSPPAPRRRRSPSQSKSPAPRRRRSPSKSPQAPRRRRSPSGSKSRSP 478
Query: 358 -----PQPRGQRRQPPRGHRGQPRRSRPSGSSHRSAGW*YRP*RPRPWPSRRRPRSPQWS 194
P +RRQ P+ R RR PS SS + P P RRPR
Sbjct: 479 RRRRSPAAAPRRRQSPQRRRSPRRRRSPSSSSRSRS----------PPPPPRRPR----- 523
Query: 193 PAHQRGQQTPAAGRS--WRPGRPRESSARRSWPEYAAERR 80
QQ P A +S + RP ++ + S A E R
Sbjct: 524 --QDSEQQAPVAVKSPEMKKRRPNDTDSDVSVDSEAEEER 561
Score = 35.0 bits (79), Expect = 0.57
Identities = 32/93 (34%), Positives = 41/93 (43%)
Frame = +2
Query: 62 PGRP*RSSLSSVLRPRPPRRRLSRPAWSPAPSCCRRLLPALMSWTPLWRAWSAP*RPRAR 241
P R RS +S P P RRR + SPAP R + PA +P A +P P+
Sbjct: 372 PARRRRSPSASKSPPAPRRRRSPSKSRSPAPK--REIPPARRRRSP--SASKSPPAPKRA 427
Query: 242 SSRPVSPPCRAV*RS*RTATTWLPPMPARWLTP 340
SR SPP R R+ + P P R +P
Sbjct: 428 KSRSKSPPAP---RRRRSPSQSKSPAPRRRRSP 457
Score = 33.9 bits (76), Expect = 1.3
Identities = 39/132 (29%), Positives = 54/132 (40%), Gaps = 8/132 (6%)
Frame = +2
Query: 53 PVRPGRP*RSSLSSVLRPRPPRRRLSRPAWSPAPSCCRRLLPALMSWTPLWRAWSAP*RP 232
P P R RS +S P P+R SR S +P RR P+ P R +P +
Sbjct: 341 PPAPARRRRSPSASKSPPPAPKRAKSR---SKSPPARRRRSPSASKSPPAPRRRRSPSKS 397
Query: 233 RARSSRPVSPPCRAV*RS*RTATTWLPPMP----ARWLTPLAPWLWRLPRRPWS----WP 388
R+ + + PP R R + + PP P +R +P AP R P + S
Sbjct: 398 RSPAPKREIPPAR---RRRSPSASKSPPAPKRAKSRSKSPPAPRRRRSPSQSKSPAPRRR 454
Query: 389 RLPLRPPPPPSR 424
R P + P P R
Sbjct: 455 RSPSKSPQAPRR 466
Score = 32.3 bits (72), Expect = 3.7
Identities = 37/130 (28%), Positives = 48/130 (36%), Gaps = 11/130 (8%)
Frame = +2
Query: 59 RPGRP*RSSLSSVLRPRPPRRRLSRPAWSPAPSCCRRLLPALMSWTPLWR------AWSA 220
+ G P R S + PP RR P+ S +P+ R + P R + S
Sbjct: 298 KSGSPRRRRSPSASKSPPPARRRRSPSQSKSPAPKRAKSRSKSPPAPARRRRSPSASKSP 357
Query: 221 P*RPRARSSRPVSPPCRAV*RS*RTATTWLPPMPARWLTPL-----APWLWRLPRRPWSW 385
P P+ SR SPP R R + + PP P R +P AP P R
Sbjct: 358 PPAPKRAKSRSKSPPAR---RRRSPSASKSPPAPRRRRSPSKSRSPAPKREIPPARRRRS 414
Query: 386 PRLPLRPPPP 415
P PP P
Sbjct: 415 PSASKSPPAP 424
>pir||A24264 proline-rich protein MP2 - mouse (fragment)
Length = 240
Score = 55.8 bits (133), Expect = 3e-07
Identities = 46/157 (29%), Positives = 62/157 (39%), Gaps = 3/157 (1%)
Frame = -2
Query: 508 PQRHPQCRRSCSPPTGPRRCSPAHQTQCSRWRGRRPQGQARPA---PRSSWQPPQPRGQR 338
PQ+ P + PP GP+ P + R PQG PA PR PP P G +
Sbjct: 93 PQQRPP--QGPPPPGGPQPRPPQGPPPPGGPQLRPPQGPPPPAGPQPRPPQGPPPPAGPQ 150
Query: 337 RQPPRGHRGQPRRSRPSGSSHRSAGW*YRP*RPRPWPSRRRPRSPQWSPAHQRGQQTPAA 158
+PP+G + RP+ + G RP + P P +PR PQ P Q +P
Sbjct: 151 PRPPQGPPTTGPQPRPTQGPPPTGGPQQRPPQGPPPPGGPQPRPPQGPPPPGGPQPSPTQ 210
Query: 157 GRSWRPGRPRESSARRSWPEYAAERRPSRPTGPDGPR 47
G P + + P A + P P GPR
Sbjct: 211 G-------PPPTGGPQQTPPLAGNTQGPPPGRPQGPR 240
Score = 51.2 bits (121), Expect = 8e-06
Identities = 46/150 (30%), Positives = 61/150 (40%), Gaps = 5/150 (3%)
Frame = -2
Query: 472 PPTGPRRCSPAHQTQCSRWRGRRPQGQARPA---PRSSWQPPQPRGQRRQPPRG--HRGQ 308
PP GP+ P + R PQG P PR PP P G +++PP+G G
Sbjct: 47 PPGGPQPRPPQGPPPPGGPQPRPPQGPPPPGGPQPRPPQGPPPPGGPQQRPPQGPPPPGG 106
Query: 307 PRRSRPSGSSHRSAGW*YRP*RPRPWPSRRRPRSPQWSPAHQRGQQTPAAGRSWRPGRPR 128
P+ P G G RP + P P+ +PR PQ P Q P G +PR
Sbjct: 107 PQPRPPQGPP-PPGGPQLRPPQGPPPPAGPQPRPPQGPPPPAGPQPRPPQGPPTTGPQPR 165
Query: 127 ESSARRSWPEYAAERRPSRPTGPDGPRGEE 38
+ P ++RP P GP P G +
Sbjct: 166 PTQGPP--PTGGPQQRP--PQGPPPPGGPQ 191
Score = 46.2 bits (108), Expect = 2e-04
Identities = 44/145 (30%), Positives = 58/145 (39%)
Frame = -2
Query: 472 PPTGPRRCSPAHQTQCSRWRGRRPQGQARPAPRSSWQPPQPRGQRRQPPRGHRGQPRRSR 293
PP+G + P + +Q +G P G P PR PP P G + +PP+G
Sbjct: 14 PPSGFQPRPPVNGSQ----QGPPPPGG--PQPRPPQGPPPPGGPQPRPPQG-------PP 60
Query: 292 PSGSSHRSAGW*YRP*RPRPWPSRRRPRSPQWSPAHQRGQQTPAAGRSWRPGRPRESSAR 113
P G RP + P P +PR PQ P QQ P G PG P+ +
Sbjct: 61 PPGGPQP------RPPQGPPPPGGPQPRPPQGPPPPGGPQQRPPQGPP-PPGGPQPRPPQ 113
Query: 112 RSWPEYAAERRPSRPTGPDGPRGEE 38
P + RP P GP P G +
Sbjct: 114 GPPPPGGPQLRP--PQGPPPPAGPQ 136
>ref|ZP_00057775.1| COG1205: Distinct helicase family with a unique C-terminal domain
including a metal-binding cysteine cluster [Thermobifida
fusca]
Length = 1958
Score = 55.5 bits (132), Expect = 4e-07
Identities = 61/200 (30%), Positives = 76/200 (37%), Gaps = 40/200 (20%)
Frame = -2
Query: 502 RHPQCRRSCSPPTGPRRCSPAHQTQCSRWRGRRPQGQARPAPRSSWQPPQPR--GQRRQP 329
RH + RR GP R PA + R RR + R PR P+ R G RRQ
Sbjct: 87 RHRRDRRPAVAGRGPGR--PARLRRRGPRRRRRLCPRPRRTPRHRAARPRLRRTGPRRQR 144
Query: 328 PRGHRGQPRRSRPSGSSHRSAGW*Y------------------RP*RPRPWPSRRRPRSP 203
P HR +P R P+ HR G P RPRP +R R R+
Sbjct: 145 PHPHRQRPGRRTPARPGHRGRGGPAVAAAARRGRPGHQRPAAPHPHRPRPAAARAR-RTA 203
Query: 202 QWSPAHQRGQQTPAAGRSWRPGRPRES--------------------SARRSWPEYAAER 83
+ A R ++ AGR RP RPR + RRS AA R
Sbjct: 204 GGAGAQLRARRRATAGRRRRPARPRRPLRRLRRTPRPHARGSGQGRVAGRRSGAPGAAGR 263
Query: 82 RPSRPTGPDGPRGEER*QAA 23
R P P RG ++ +AA
Sbjct: 264 RQPLPGRPRPVRGAQQHRAA 283
Score = 50.8 bits (120), Expect = 1e-05
Identities = 54/159 (33%), Positives = 63/159 (38%), Gaps = 33/159 (20%)
Frame = -2
Query: 460 PRRCSPAHQTQCSRWRGRRPQ-GQARPAPRSSWQPPQPRGQRRQPPRGHR---------- 314
P R PAH + R R RRP+ G A P PR RG RR+P HR
Sbjct: 48 PARRQPAHLPRARRPRLRRPRGGAALPGPR--------RGGRRRPAARHRRDRRPAVAGR 99
Query: 313 --GQPRRSRPSG--------------SSHRSAGW*YR---P*RPRPWPSRRRP-RSPQWS 194
G+P R R G HR+A R P R RP P R+RP R
Sbjct: 100 GPGRPARLRRRGPRRRRRLCPRPRRTPRHRAARPRLRRTGPRRQRPHPHRQRPGRRTPAR 159
Query: 193 PAH--QRGQQTPAAGRSWRPGRPRESSARRSWPEYAAER 83
P H + G AA R RPG R ++ P AA R
Sbjct: 160 PGHRGRGGPAVAAAARRGRPGHQRPAAPHPHRPRPAAAR 198
Score = 37.4 bits (85), Expect = 0.11
Identities = 47/144 (32%), Positives = 51/144 (34%), Gaps = 6/144 (4%)
Frame = -2
Query: 472 PPTGPRRCSPAHQTQCSRWRGRRPQGQ-----ARPAPRSSWQPPQPRGQRRQPPRGHRGQ 308
P GP R PA + RGR G ARPA R P+ R R + PRG
Sbjct: 17 PARGPARRRPAGG---AAGRGRPVVGGGVARLARPARRQPAHLPRARRPRLRRPRGGAAL 73
Query: 307 PRRSRPSGSSHRSAGW*YRP*RPRPWPSRRRPRSPQWSPAHQRGQQTPAAGRSWRPGRPR 128
P R R RP RR R P RG PA R P R R
Sbjct: 74 PGPRRGG--------------RRRPAARHRRDRRP---AVAGRGPGRPARLRRRGPRRRR 116
Query: 127 ESSAR-RSWPEYAAERRPSRPTGP 59
R R P + A R R TGP
Sbjct: 117 RLCPRPRRTPRHRAARPRLRRTGP 140