Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC003543A_C01 KMC003543A_c01
(650 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
pir||S23737 proline-rich protein precursor - kidney bean gi|2104... 221 9e-57
emb|CAC16734.1| arabinogalactan protein [Daucus carota] 156 2e-37
gb|AAD30268.1|AF117607_1 putative hybrid proline-rich protein PR... 148 5e-35
ref|NP_174150.1| prolin-rich protein, putative; protein id: At1g... 136 3e-31
ref|NP_180935.1| putative proline-rich protein; protein id: At2g... 130 1e-29
>pir||S23737 proline-rich protein precursor - kidney bean
gi|21046|emb|CAA42942.1| proline-rich protein [Phaseolus
vulgaris]
Length = 297
Score = 221 bits (562), Expect = 9e-57
Identities = 104/139 (74%), Positives = 118/139 (84%), Gaps = 2/139 (1%)
Frame = -3
Query: 627 PAPVHPPV--HPFPRSFVAVQGVVFTKSCKYAGVDTLLGATPISGAVVKLECNNTRYRLV 454
P PVHPPV HPF RSFVAVQGVV+ KSCKYAGVDTLLGATP+ GAVVK++CNNT+Y+LV
Sbjct: 155 PVPVHPPVPVHPFRRSFVAVQGVVYVKSCKYAGVDTLLGATPLLGAVVKVQCNNTKYKLV 214
Query: 453 QKVKTDHNGYFYLQAPKTITTFGAHKCKVFLVSAPAGLKPSNLHGGVIGSGLRPVKPFLY 274
+ K+D NGYFY +APK+ITT+GAHKC V LV AP GLKPSNLH GV G+ LRP KPFL
Sbjct: 215 ETSKSDKNGYFYFEAPKSITTYGAHKCNVVLVGAPYGLKPSNLHSGVTGAVLRPEKPFLS 274
Query: 273 QKLPFLLYNVGPLAFEPTC 217
+KLPF+LY VGPLAFEP C
Sbjct: 275 KKLPFVLYTVGPLAFEPKC 293
Score = 32.0 bits (71), Expect = 7.4
Identities = 12/19 (63%), Positives = 14/19 (73%), Gaps = 2/19 (10%)
Frame = -3
Query: 648 HHHYPPTPAP--VHPPVHP 598
HHH+PP PAP +HPP P
Sbjct: 48 HHHHPPAPAPAPLHPPSPP 66
>emb|CAC16734.1| arabinogalactan protein [Daucus carota]
Length = 242
Score = 156 bits (395), Expect = 2e-37
Identities = 84/148 (56%), Positives = 102/148 (68%), Gaps = 5/148 (3%)
Frame = -3
Query: 636 PPTPAPVHPPVHPFPRSFVAVQGVVFTKSCKYAGVDTLLGATPISGAVVKLECNNTRYRL 457
PP AP H P R VAVQGVV+ K C Y GV+TLLGATP+ GAVVKL+CNNT+Y L
Sbjct: 92 PPVKAPSHAPTPLPARKLVAVQGVVYCKPCNYTGVETLLGATPLLGAVVKLQCNNTKYPL 151
Query: 456 VQKVKTDHNGYFYLQAPKTITTFGAHKCKVFLVSAPAGL--KPSNLHGGVIGSGL-RPVK 286
V + KTD NGYF L APKTITT+G HKC+VF+VS+P KP+NL GV G+ L + K
Sbjct: 152 VVQGKTDKNGYFSLNAPKTITTYGVHKCRVFVVSSPEKKCDKPTNLRYGVKGAILEKSTK 211
Query: 285 PFLYQKLP--FLLYNVGPLAFEPTCPGP 208
P + K P F +++VGP AFEP+ P
Sbjct: 212 PPVSTKTPATFEMFSVGPFAFEPSTKKP 239
>gb|AAD30268.1|AF117607_1 putative hybrid proline-rich protein PRP1 [Trifolium subterraneum]
Length = 97
Score = 148 bits (374), Expect = 5e-35
Identities = 65/96 (67%), Positives = 82/96 (84%)
Frame = -3
Query: 501 GAVVKLECNNTRYRLVQKVKTDHNGYFYLQAPKTITTFGAHKCKVFLVSAPAGLKPSNLH 322
GAVVKL+CNNT+Y+LVQ +TD NGYF+++ PK+IT++ AHKC V LVSAP GLKPSNLH
Sbjct: 1 GAVVKLQCNNTKYKLVQTHETDKNGYFFIEGPKSITSYAAHKCNVVLVSAPNGLKPSNLH 60
Query: 321 GGVIGSGLRPVKPFLYQKLPFLLYNVGPLAFEPTCP 214
GG+ G+GLRP KP++ + LPF++Y VGPLAFEP CP
Sbjct: 61 GGLTGAGLRPGKPYVSKGLPFIVYTVGPLAFEPKCP 96
>ref|NP_174150.1| prolin-rich protein, putative; protein id: At1g28290.1 [Arabidopsis
thaliana] gi|25513461|pir||B86409 F3H9.6 protein -
Arabidopsis thaliana gi|9795608|gb|AAF98426.1|AC021044_5
Unknown protein [Arabidopsis thaliana]
Length = 359
Score = 136 bits (342), Expect = 3e-31
Identities = 75/150 (50%), Positives = 94/150 (62%), Gaps = 9/150 (6%)
Frame = -3
Query: 636 PPTPAPVHPPVHP--FPRSFVAVQGVVFTKSCKYAGVDTLLGATPISGAVVKLECNNTRY 463
PPT PV PPV+P F RS VAV+G V+ KSCKYA +TLLGA PI GA VKL C + +
Sbjct: 210 PPTKPPVTPPVYPPKFNRSLVAVRGTVYCKSCKYAAFNTLLGAKPIEGATVKLVCKSKK- 268
Query: 462 RLVQKVKTDHNGYFYLQAPKTITTFGAHKCKVFLVSAP--AGLKPSNLHGGVIGSGLRPV 289
+ + TD NGYF L APKT+T FG C+V+LV + K S L GG +G+ L+P
Sbjct: 269 NITAETTTDKNGYFLLLAPKTVTNFGFRGCRVYLVKSKDYKCSKVSKLFGGDVGAELKPE 328
Query: 288 KPF-----LYQKLPFLLYNVGPLAFEPTCP 214
K + KL + L+NVGP AF P+CP
Sbjct: 329 KKLGKSTVVVNKLVYGLFNVGPFAFNPSCP 358
>ref|NP_180935.1| putative proline-rich protein; protein id: At2g33790.1 [Arabidopsis
thaliana] gi|25408310|pir||F84749 probable proline-rich
protein [imported] - Arabidopsis thaliana
gi|1707022|gb|AAC69131.1| putative proline-rich protein
[Arabidopsis thaliana] gi|27754471|gb|AAO22683.1|
putative proline-rich protein [Arabidopsis thaliana]
Length = 239
Score = 130 bits (328), Expect = 1e-29
Identities = 72/150 (48%), Positives = 96/150 (64%), Gaps = 9/150 (6%)
Frame = -3
Query: 636 PPTPAPVHPPVHP--FPRSFVAVQGVVFTKSCKYAGVDTLLGATPISGAVVKLECNNTRY 463
PP PV PPV+P + ++ VAV+GVV+ K+CKYAGV+ + GA P+ AVV+L C N +
Sbjct: 90 PPIKPPVLPPVYPPKYNKTLVAVRGVVYCKACKYAGVNNVQGAKPVKDAVVRLVCKNKK- 148
Query: 462 RLVQKVKTDHNGYFYLQAPKTITTFGAHKCKVFLVSAP--AGLKPSNLHGGVIGSGLRPV 289
+ + KTD NGYF L APKT+T + C+ FLV +P K S+LH G GS L+PV
Sbjct: 149 NSISETKTDKNGYFMLLAPKTVTNYDIKGCRAFLVKSPDTKCSKVSSLHDGGKGSVLKPV 208
Query: 288 -KP----FLYQKLPFLLYNVGPLAFEPTCP 214
KP + + + +YNVGP AFEPTCP
Sbjct: 209 LKPGFSSTIMRWFKYSVYNVGPFAFEPTCP 238
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 689,904,857
Number of Sequences: 1393205
Number of extensions: 18258547
Number of successful extensions: 82468
Number of sequences better than 10.0: 112
Number of HSP's better than 10.0 without gapping: 64674
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 81593
length of database: 448,689,247
effective HSP length: 118
effective length of database: 284,291,057
effective search space used: 27860523586
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)