Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KCC003249A_C01 KCC003249A_c01
(403 letters)
Database: nr
1,537,769 sequences; 498,525,298 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gb|AAK30597.1|AF350268_1 major ampullate spidroin 2-like protein... 51 3e-06
ref|NP_506284.2| COLlagen structural gene (30.3 kD) (col-160) [C... 50 8e-06
sp|P33479|IE18_PRVKA IMMEDIATE-EARLY PROTEIN IE180 gi|418707|pir... 50 8e-06
pir||T22827 hypothetical protein F57B1.4 - Caenorhabditis elegans 50 8e-06
gb|AAC04503.1| spidroin 2 [Araneus bicentenarius] 50 8e-06
>gb|AAK30597.1|AF350268_1 major ampullate spidroin 2-like protein [Argiope trifasciata]
Length = 120
Score = 51.2 bits (121), Expect = 3e-06
Identities = 43/117 (36%), Positives = 52/117 (43%), Gaps = 4/117 (3%)
Frame = +2
Query: 26 GPYGPQQPTTATSTAAAAVAAGEAG-GASGRQLQPAPAPPTAASRYPWNMGWGAGASGAA 202
G YG P A++ AAAA A G G G SG P PP P G A A+ AA
Sbjct: 8 GGYGTSGPGGASAAAAAAAAGGPGGQGPSG------PGPPGPGGYGPSGPGAAAAAAAAA 61
Query: 203 GRENT--PLRQLPP-FPPPQPELAALPLHRAQATGTVATGTAGPGEQGPQGYQQRSG 364
G + P +Q P + P P A+ A A G G+ GPG+QGP Q G
Sbjct: 62 GGPGSQGPGQQGPGGYGPSGPGGASAAAAAAAAGGPGGQGSYGPGQQGPGAGQYGPG 118
Score = 32.0 bits (71), Expect = 2.1
Identities = 37/114 (32%), Positives = 42/114 (36%), Gaps = 9/114 (7%)
Frame = -2
Query: 339 GPCSPGPAVPVATVPVACARCSGRAASSGCGG---------GKGGSWRSGVFSLPAAPLA 187
GP GP + P + + AA+ G GG G GG SG AA A
Sbjct: 1 GPGQQGPGGYGTSGPGGASAAAAAAAAGGPGGQGPSGPGPPGPGGYGPSGP-GAAAAAAA 59
Query: 186 PAPQPMFHG*RLAAVGGAGAGWSCLPLAPPASPAATAAAAVEVAVVGCWGPYGP 25
A P G GG G P P + AA AAAA G G YGP
Sbjct: 60 AAGGPGSQGPGQQGPGGYG------PSGPGGASAAAAAAAA--GGPGGQGSYGP 105
>ref|NP_506284.2| COLlagen structural gene (30.3 kD) (col-160) [Caenorhabditis
elegans] gi|24817333|emb|CAB01508.2| C. elegans COL-160
protein (corresponding sequence F57B1.4) [Caenorhabditis
elegans]
Length = 317
Score = 50.1 bits (118), Expect = 8e-06
Identities = 41/110 (37%), Positives = 49/110 (44%), Gaps = 1/110 (0%)
Frame = +2
Query: 20 PLGPYGPQQPTTATSTAAAAVAAGEAGGASGRQLQPAPAPPTAASRYPWNMGWGAGASGA 199
P GP GP + A A G++GGAS P PA P S P + G AGA GA
Sbjct: 175 PAGPPGPSGAPGQKGPSGAPGAPGQSGGAS-LPGPPGPAGPPGPSGQPGSNG-NAGAPGA 232
Query: 200 AGR-ENTPLRQLPPFPPPQPELAALPLHRAQATGTVATGTAGPGEQGPQG 346
G+ + P P PP P A P Q +G+A PG GPQG
Sbjct: 233 PGQVVDVPGTPGPAGPPGPPGPAGAPGQPGQ------SGSAQPGGPGPQG 276
>sp|P33479|IE18_PRVKA IMMEDIATE-EARLY PROTEIN IE180 gi|418707|pir||A45344 immediate-early
protein - suid herpesvirus 1 (strain Kaplan)
gi|334071|gb|AAA47470.1| immediate-early protein
Length = 1446
Score = 50.1 bits (118), Expect = 8e-06
Identities = 45/124 (36%), Positives = 51/124 (40%), Gaps = 6/124 (4%)
Frame = +2
Query: 35 GPQQPTTATSTAAAAVAAGEAGGASGRQLQPA-----PAPPTAASRYPWNMGWGAGASGA 199
GP+ PT A AA A A G G +S PA P P A R+ G G G
Sbjct: 144 GPRPPTPAALAAAEAGAPGGPGRSSPSAASPASSSGSPGPSAAPRRWSPARGDPVGEPGP 203
Query: 200 AGRENTPLRQLPPFPPPQP-ELAALPLHRAQATGTVATGTAGPGEQGPQGYQQRSGGGDT 376
A R TP PP QP +AA P R A + A+ AGP P G S GGD
Sbjct: 204 AARPRTPA------PPAQPAAVAAAPARRGPA--SPASPAAGP-VSAPGGGGAPSAGGDR 254
Query: 377 TEAH 388
H
Sbjct: 255 GRHH 258
Score = 32.7 bits (73), Expect = 1.2
Identities = 31/111 (27%), Positives = 45/111 (39%), Gaps = 3/111 (2%)
Frame = +2
Query: 14 AEP---LGPYGPQQPTTATSTAAAAVAAGEAGGASGRQLQPAPAPPTAASRYPWNMGWGA 184
AEP L P P+QP A A AAG G G +PA ++S + +
Sbjct: 804 AEPAPGLPPLWPEQPGLVVPAPAPA-AAGAPSGLPGSG-PSSPASTKSSSSTKSSSSTKS 861
Query: 185 GASGAAGRENTPLRQLPPFPPPQPELAALPLHRAQATGTVATGTAGPGEQG 337
G SG++G ++P P P + + P R G G +G +G
Sbjct: 862 GLSGSSGYASSPAAGPDPAPERRKKKRRAPGARRPGDGEEDEGLSGAALRG 912
Score = 30.4 bits (67), Expect = 6.2
Identities = 24/68 (35%), Positives = 29/68 (42%)
Frame = -2
Query: 378 VVSPPPLRCWYPCGPCSPGPAVPVATVPVACARCSGRAASSGCGGGKGGSWRSGVFSLPA 199
V +P P P G GP+ P +T S +SS G GS SG S PA
Sbjct: 822 VPAPAPAAAGAPSGLPGSGPSSPAST-----KSSSSTKSSSSTKSGLSGS--SGYASSPA 874
Query: 198 APLAPAPQ 175
A PAP+
Sbjct: 875 AGPDPAPE 882
Score = 30.0 bits (66), Expect = 8.0
Identities = 32/92 (34%), Positives = 36/92 (38%), Gaps = 2/92 (2%)
Frame = -2
Query: 339 GPCSPGPAVPVATVPVACARCSGRAASSGCGGGKGGSWRSGVF--SLPAAPLAPAPQPMF 166
GP S + P P A A AA +G GG G S S S +P A +
Sbjct: 136 GPRSRAGSGPRPPTPAALA-----AAEAGAPGGPGRSSPSAASPASSSGSPGPSAAPRRW 190
Query: 165 HG*RLAAVGGAGAGWSCLPLAPPASPAATAAA 70
R VG G APPA PAA AAA
Sbjct: 191 SPARGDPVGEPGPAARPRTPAPPAQPAAVAAA 222
Score = 30.0 bits (66), Expect = 8.0
Identities = 37/133 (27%), Positives = 46/133 (33%), Gaps = 13/133 (9%)
Frame = +2
Query: 11 VAEPLGPYGPQQPTTATSTAAAAVAAGEAGGASGRQLQPAPAPPTAASRYPWNMGWGAGA 190
V EP P+ P AA A A G AS PA P A G A
Sbjct: 198 VGEPGPAARPRTPAPPAQPAAVAAAPARRGPAS-------PASPAAGPVSAPGGGGAPSA 250
Query: 191 SGAAGR-----------ENTPLRQLPPFP-PPQPELAALPLHRAQATGTVATGTAGPG-E 331
G GR E R+L P P + +++ P + +T TVA T G E
Sbjct: 251 GGDRGRHHHQHREPLLDEPAAARRLDPRPLGARSPVSSNPNSNSNSTTTVAVETVARGPE 310
Query: 332 QGPQGYQQRSGGG 370
+ G GG
Sbjct: 311 KDEDGLGLAGDGG 323
>pir||T22827 hypothetical protein F57B1.4 - Caenorhabditis elegans
Length = 356
Score = 50.1 bits (118), Expect = 8e-06
Identities = 41/110 (37%), Positives = 49/110 (44%), Gaps = 1/110 (0%)
Frame = +2
Query: 20 PLGPYGPQQPTTATSTAAAAVAAGEAGGASGRQLQPAPAPPTAASRYPWNMGWGAGASGA 199
P GP GP + A A G++GGAS P PA P S P + G AGA GA
Sbjct: 214 PAGPPGPSGAPGQKGPSGAPGAPGQSGGAS-LPGPPGPAGPPGPSGQPGSNG-NAGAPGA 271
Query: 200 AGR-ENTPLRQLPPFPPPQPELAALPLHRAQATGTVATGTAGPGEQGPQG 346
G+ + P P PP P A P Q +G+A PG GPQG
Sbjct: 272 PGQVVDVPGTPGPAGPPGPPGPAGAPGQPGQ------SGSAQPGGPGPQG 315
>gb|AAC04503.1| spidroin 2 [Araneus bicentenarius]
Length = 236
Score = 50.1 bits (118), Expect = 8e-06
Identities = 43/129 (33%), Positives = 54/129 (41%), Gaps = 9/129 (6%)
Frame = +2
Query: 26 GPYGPQQPTTATSTAAAAVAAGEAGGASGRQLQPAPAPPTAASRYPWNMGWGAGASGAAG 205
GPYGP AAAA AAG G SG+Q P + P+ G A A+ A G
Sbjct: 2 GPYGP-------GAAAAAAAAGGYGPGSGQQ---GPGQQGPGQQGPYGPGAAAAAAAAGG 51
Query: 206 R-----ENTPLRQLP----PFPPPQPELAALPLHRAQATGTVATGTAGPGEQGPQGYQQR 358
+ P++Q P P+ P AA +G G GPG+QGP G Q
Sbjct: 52 YGPGSGQQGPVQQGPGQQGPYGPGASAAAAAAGGYGPGSGQQGPGQQGPGQQGP-GQQGP 110
Query: 359 SGGGDTTEA 385
G G + A
Sbjct: 111 YGAGASAAA 119
Score = 32.0 bits (71), Expect = 2.1
Identities = 32/109 (29%), Positives = 43/109 (39%), Gaps = 4/109 (3%)
Frame = +2
Query: 26 GPYGPQQPTTATSTAAAAVAAGEAGGASGRQ--LQPAPAPPTAASRYPWNMGWGAGASGA 199
G GP P +AAA AAG G SG+Q Q P + P+ G A A+ A
Sbjct: 67 GQQGPYGP----GASAAAAAAGGYGPGSGQQGPGQQGPGQQGPGQQGPYGAGASAAAAAA 122
Query: 200 AGRENTPLRQLP--PFPPPQPELAALPLHRAQATGTVATGTAGPGEQGP 340
G +Q P P AA L + A+ V++ + GP
Sbjct: 123 GGYGPGSGQQGPGVRVAAPVASAAASRLSSSAASSRVSSAVSSLVSSGP 171