Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC006174A_C01 KMC006174A_c01
(532 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_177041.2| arabinogalactan-protein, putative (AGP19); This... 100 2e-20
pir||H96711 hypothetical protein F14K14.17 [imported] - Arabidop... 88 6e-17
dbj|BAA81686.1| expressed in cucumber hypocotyls [Cucumis sativus] 72 5e-12
ref|NP_568027.1| arabinogalactan-protein (AGP18); protein id: At... 65 5e-10
gb|AAL06470.1|AF411780_1 AT4g37450/F6G17_100 [Arabidopsis thalia... 65 5e-10
>ref|NP_177041.2| arabinogalactan-protein, putative (AGP19); This gene structure is
inaccurate, likely due to discrepancies within
overlapping bac sequences. This will be resolved asap.
In the meantime, an either full or partial translation
is provided. [Arabidopsis thaliana]
Length = 247
Score = 99.8 bits (247), Expect = 2e-20
Identities = 58/106 (54%), Positives = 69/106 (64%), Gaps = 1/106 (0%)
Frame = -1
Query: 532 PSPVPSPPTQAPTPVTEAPAPAPVSPKHKKKGHKHKHRRHHAPAPAPTVISKSPPAPPTD 353
P+PV PP QAP+P++ PAPAP KHK+K HKHK R HHAPAPAP I SPP+PP
Sbjct: 148 PAPVSPPPVQAPSPISLPPAPAPAPTKHKRK-HKHK-RHHHAPAPAP--IPPSPPSPPV- 202
Query: 352 STADADTAPAPSPSLDLNGAPSNHLKGKHIL-ATAGLAIAVLLAVT 218
T DTAPAPSP + G N LKG+ ++ GL I LLA+T
Sbjct: 203 LTDPQDTAPAPSP--NTGGNALNQLKGRAVMWLNTGLVILFLLAMT 246
Score = 35.4 bits (80), Expect = 0.42
Identities = 29/90 (32%), Positives = 31/90 (34%), Gaps = 9/90 (10%)
Frame = -1
Query: 529 SPVPS-----PPTQAPTPVTEAPAPAPVSPKHKKKGHKHKHRRHHAPA----PAPTVISK 377
SPV S PPT A P T AP P +P PA P P V
Sbjct: 30 SPVTSTTTAPPPTTAAPPTTAAPPPTTTTPPVSAA---------QPPASPVTPPPAVTPT 80
Query: 376 SPPAPPTDSTADADTAPAPSPSLDLNGAPS 287
SPPAP T P P AP+
Sbjct: 81 SPPAPKVAPVISPATPPPQPPQSPPASAPT 110
Score = 34.3 bits (77), Expect = 0.94
Identities = 28/84 (33%), Positives = 36/84 (42%), Gaps = 10/84 (11%)
Frame = -1
Query: 532 PSPVPSPPTQAPTPVTEAPAPA-PVSPK---HKKKGHKHKHRRHHAPAPAPTVISKSPPA 365
P+ PPT PV+ A PA PV+P K +PA P +SPPA
Sbjct: 47 PTTAAPPPTTTTPPVSAAQPPASPVTPPPAVTPTSPPAPKVAPVISPATPPPQPPQSPPA 106
Query: 364 ------PPTDSTADADTAPAPSPS 311
PP S A T+P P+P+
Sbjct: 107 SAPTVSPPPVSPPPAPTSPPPTPA 130
>pir||H96711 hypothetical protein F14K14.17 [imported] - Arabidopsis thaliana
gi|5734705|gb|AAD49970.1|AC008075_3 F24J5.4 [Arabidopsis
thaliana] gi|12324144|gb|AAG52045.1|AC011914_15
hypothetical protein; 88190-87522 [Arabidopsis thaliana]
Length = 222
Score = 88.2 bits (217), Expect = 6e-17
Identities = 46/74 (62%), Positives = 53/74 (71%)
Frame = -1
Query: 532 PSPVPSPPTQAPTPVTEAPAPAPVSPKHKKKGHKHKHRRHHAPAPAPTVISKSPPAPPTD 353
P+PV PP QAP+P++ PAPAP KHK+K HKHK R HHAPAPAP I SPP+PP
Sbjct: 148 PAPVSPPPVQAPSPISLPPAPAPAPTKHKRK-HKHK-RHHHAPAPAP--IPPSPPSPPV- 202
Query: 352 STADADTAPAPSPS 311
T DTAPAPSP+
Sbjct: 203 LTDPQDTAPAPSPN 216
Score = 45.1 bits (105), Expect = 5e-04
Identities = 34/95 (35%), Positives = 40/95 (41%), Gaps = 6/95 (6%)
Frame = -1
Query: 532 PSPVPSPPTQAPT--PVTEAPAPAPVSPKHKKKGHKHKHRRHHAPAPAPTVISKSPPAP- 362
P P SPP APT P +P PAP SP APA P + PPAP
Sbjct: 98 PQPPQSPPASAPTVSPPPVSPPPAPTSPPPTPASPPP------APASPPPAPASPPPAPV 151
Query: 361 ---PTDSTADADTAPAPSPSLDLNGAPSNHLKGKH 266
P + + PAP+P AP+ H K KH
Sbjct: 152 SPPPVQAPSPISLPPAPAP------APTKH-KRKH 179
Score = 35.4 bits (80), Expect = 0.42
Identities = 29/90 (32%), Positives = 31/90 (34%), Gaps = 9/90 (10%)
Frame = -1
Query: 529 SPVPS-----PPTQAPTPVTEAPAPAPVSPKHKKKGHKHKHRRHHAPA----PAPTVISK 377
SPV S PPT A P T AP P +P PA P P V
Sbjct: 30 SPVTSTTTAPPPTTAAPPTTAAPPPTTTTPPVSAA---------QPPASPVTPPPAVTPT 80
Query: 376 SPPAPPTDSTADADTAPAPSPSLDLNGAPS 287
SPPAP T P P AP+
Sbjct: 81 SPPAPKVAPVISPATPPPQPPQSPPASAPT 110
Score = 34.3 bits (77), Expect = 0.94
Identities = 28/84 (33%), Positives = 36/84 (42%), Gaps = 10/84 (11%)
Frame = -1
Query: 532 PSPVPSPPTQAPTPVTEAPAPA-PVSPK---HKKKGHKHKHRRHHAPAPAPTVISKSPPA 365
P+ PPT PV+ A PA PV+P K +PA P +SPPA
Sbjct: 47 PTTAAPPPTTTTPPVSAAQPPASPVTPPPAVTPTSPPAPKVAPVISPATPPPQPPQSPPA 106
Query: 364 ------PPTDSTADADTAPAPSPS 311
PP S A T+P P+P+
Sbjct: 107 SAPTVSPPPVSPPPAPTSPPPTPA 130
>dbj|BAA81686.1| expressed in cucumber hypocotyls [Cucumis sativus]
Length = 243
Score = 71.6 bits (174), Expect = 5e-12
Identities = 40/100 (40%), Positives = 53/100 (53%)
Frame = -1
Query: 532 PSPVPSPPTQAPTPVTEAPAPAPVSPKHKKKGHKHKHRRHHAPAPAPTVISKSPPAPPTD 353
P+P SPP +P E PAPAP K KK H APAP+P ++ PPAPP++
Sbjct: 151 PAPESSPPAPVASPPVEVPAPAPSKKKSKK---------HRAPAPSPALL--GPPAPPSE 199
Query: 352 STADADTAPAPSPSLDLNGAPSNHLKGKHILATAGLAIAV 233
+ A ++ PAPSPSL+ +K LA A+AV
Sbjct: 200 APAGSEEGPAPSPSLEDKSGAEALMKVAGSLALGWAAVAV 239
Score = 37.0 bits (84), Expect = 0.15
Identities = 24/76 (31%), Positives = 31/76 (40%), Gaps = 3/76 (3%)
Frame = -1
Query: 532 PSPVPSPPTQAP---TPVTEAPAPAPVSPKHKKKGHKHKHRRHHAPAPAPTVISKSPPAP 362
P+PV +PP AP PV PA P + PA +P S P +P
Sbjct: 68 PAPVSTPPASAPPAVAPVASPPASTPPTAS--------------VPASSPPAASVPPSSP 113
Query: 361 PTDSTADADTAPAPSP 314
P +T A + P P P
Sbjct: 114 PA-ATVPASSPPVPVP 128
>ref|NP_568027.1| arabinogalactan-protein (AGP18); protein id: At4g37450.1, supported
by cDNA: gi_11935087, supported by cDNA: gi_15724155
[Arabidopsis thaliana]
gi|11935088|gb|AAG41964.1|AF305940_1 arabinogalactan
protein AGP18 [Arabidopsis thaliana]
Length = 209
Score = 65.1 bits (157), Expect = 5e-10
Identities = 44/104 (42%), Positives = 55/104 (52%), Gaps = 2/104 (1%)
Frame = -1
Query: 532 PSPVP-SPPTQAPTPVTEAPAPAPVSPKHKKKGHKHKHRRHHAPAPAPTVISKSPPAPPT 356
P+PV SPP PV + PAPAP KHKK K K + APAPAP ++ PPAPPT
Sbjct: 108 PAPVADSPPAPVAAPVADVPAPAP--SKHKKTTKKSK-KHQAAPAPAPELL--GPPAPPT 162
Query: 355 DSTADADTAPAPSPSL-DLNGAPSNHLKGKHILATAGLAIAVLL 227
+S A +P PS D +GA S + + A AVL+
Sbjct: 163 ESPGPNSDAFSPGPSADDQSGAASTRVLRNVAVGAVATAWAVLV 206
Score = 37.7 bits (86), Expect = 0.085
Identities = 24/80 (30%), Positives = 33/80 (41%)
Frame = -1
Query: 529 SPVPSPPTQAPTPVTEAPAPAPVSPKHKKKGHKHKHRRHHAPAPAPTVISKSPPAPPTDS 350
+P +P A +PV +PAPVS +P P P V SPP P
Sbjct: 54 APAKTPTASASSPVESPKSPAPVSES--------------SPPPTP-VPESSPPVPAPMV 98
Query: 349 TADADTAPAPSPSLDLNGAP 290
++ + P P+P D AP
Sbjct: 99 SSPVSSPPVPAPVADSPPAP 118
>gb|AAL06470.1|AF411780_1 AT4g37450/F6G17_100 [Arabidopsis thaliana]
gi|20334856|gb|AAM16184.1| AT4g37450/F6G17_100
[Arabidopsis thaliana]
Length = 113
Score = 65.1 bits (157), Expect = 5e-10
Identities = 44/104 (42%), Positives = 55/104 (52%), Gaps = 2/104 (1%)
Frame = -1
Query: 532 PSPVP-SPPTQAPTPVTEAPAPAPVSPKHKKKGHKHKHRRHHAPAPAPTVISKSPPAPPT 356
P+PV SPP PV + PAPAP KHKK K K + APAPAP ++ PPAPPT
Sbjct: 12 PAPVADSPPAPVAAPVADVPAPAP--SKHKKTTKKSK-KHQAAPAPAPELL--GPPAPPT 66
Query: 355 DSTADADTAPAPSPSL-DLNGAPSNHLKGKHILATAGLAIAVLL 227
+S A +P PS D +GA S + + A AVL+
Sbjct: 67 ESPGPNSDAFSPGPSADDQSGAASTRVLRNVAVGAVATAWAVLV 110
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 485,212,898
Number of Sequences: 1393205
Number of extensions: 12484130
Number of successful extensions: 173911
Number of sequences better than 10.0: 4078
Number of HSP's better than 10.0 without gapping: 75069
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 129910
length of database: 448,689,247
effective HSP length: 115
effective length of database: 288,470,672
effective search space used: 17596710992
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)