Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KCC000350A_C01 KCC000350A_c01
(475 letters)
Database: nr
1,537,769 sequences; 498,525,298 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_498814.1| COLlagen structural gene (col-91) [Caenorhabdit... 56 2e-07
ref|XP_325094.1| hypothetical protein [Neurospora crassa] gi|289... 54 7e-07
ref|NP_499982.1| COLlagen structural gene (33.8 kD) (col-103) [C... 54 1e-06
ref|NP_505486.1| COLlagen structural gene (27.2 kD) (col-147) [C... 53 2e-06
ref|NP_505484.1| COLlagen structural gene (27.4 kD) (col-146) [C... 53 2e-06
>ref|NP_498814.1| COLlagen structural gene (col-91) [Caenorhabditis elegans]
gi|465855|sp|P34391|YLS6_CAEEL Putative cuticle collagen
F09G8.6 gi|630599|pir||S44796 F09G8.6 protein -
Caenorhabditis elegans gi|156286|gb|AAA28008.1|
Hypothetical protein F09G8.6 [Caenorhabditis elegans]
Length = 278
Score = 55.8 bits (133), Expect = 2e-07
Identities = 43/122 (35%), Positives = 52/122 (42%), Gaps = 6/122 (4%)
Frame = +2
Query: 107 VELPSGKRIVTGTSDVVAKRDNGGQPSASSHGAASGRAESGRGPQAASGVAPANPNQSGN 286
++ P+G+ G GQP A G A G A GP +G AP P GN
Sbjct: 132 IQCPAGEAGPAGAPGAPGPAGPDGQPGADGQGGAPGPA-GPEGPAGDAG-APGAPGAPGN 189
Query: 287 SGRA--NGHVDGGARHRAAAAGPQPSGTGPNHAAGRPAS----GKPGPAVAPGANFNGGV 448
G+ NG G A A GPQ GP + G+P S G PGPA APG + G
Sbjct: 190 DGQPGQNGQRSTGTPGAAGAPGPQ----GPVGSDGQPGSAGAPGAPGPAGAPGVDGQPGA 245
Query: 449 LG 454
G
Sbjct: 246 NG 247
Score = 38.1 bits (87), Expect = 0.053
Identities = 30/78 (38%), Positives = 35/78 (44%), Gaps = 3/78 (3%)
Frame = +2
Query: 233 GPQAASGVAPANPNQSGNSGRANGHVDGGARHRAAAAGPQPSGTGPNHAAGRPASGKPGP 412
GP A G A P G++G A +DG A A+AAG A A G PGP
Sbjct: 94 GPPGAPGAA-GEPGVDGDAGAAG--IDGVAIQFASAAGGACIQCPAGEAGPAGAPGAPGP 150
Query: 413 A---VAPGANFNGGVLGP 457
A PGA+ GG GP
Sbjct: 151 AGPDGQPGADGQGGAPGP 168
Score = 36.6 bits (83), Expect = 0.15
Identities = 32/107 (29%), Positives = 42/107 (38%), Gaps = 11/107 (10%)
Frame = +2
Query: 188 ASSHGAASGRAESGRGPQAASGVAPANPNQSGNSGRANGHVDGGARHRAAAAGP------ 349
AS+ G A + +G +A AP P +G G+ GGA A GP
Sbjct: 124 ASAAGGACIQCPAG---EAGPAGAPGAPGPAGPDGQPGADGQGGAPGPAGPEGPAGDAGA 180
Query: 350 -----QPSGTGPNHAAGRPASGKPGPAVAPGANFNGGVLGPRVAAGA 475
P G G+ ++G PG A APG G G +AGA
Sbjct: 181 PGAPGAPGNDGQPGQNGQRSTGTPGAAGAPGPQGPVGSDGQPGSAGA 227
>ref|XP_325094.1| hypothetical protein [Neurospora crassa] gi|28926533|gb|EAA35504.1|
hypothetical protein [Neurospora crassa]
Length = 1168
Score = 54.3 bits (129), Expect = 7e-07
Identities = 39/147 (26%), Positives = 61/147 (40%), Gaps = 4/147 (2%)
Frame = +2
Query: 47 KPAKNGQRNDTTSTKTTSNAVELPS----GKRIVTGTSDVVAKRDNGGQPSASSHGAASG 214
+P+ N RN S ++ P+ K++ G S K +P SS S
Sbjct: 707 QPSSNANRNRNESQAPVESSDSRPTTSAKDKQLSQGPS--APKDAYNNKPPVSSRRPPSS 764
Query: 215 RAESGRGPQAASGVAPANPNQSGNSGRANGHVDGGARHRAAAAGPQPSGTGPNHAAGRPA 394
+ G P + VAP P++ G N ++ G AA A Q TG + +GRP+
Sbjct: 765 QRNGGNAPTTGNAVAPPRPSRDGRPTADNQYLSG-----AAGAPTQRPTTGGSMQSGRPS 819
Query: 395 SGKPGPAVAPGANFNGGVLGPRVAAGA 475
+P P AN +G + P +G+
Sbjct: 820 YAQPAPPEVADANVHGRIQQPSKGSGS 846
>ref|NP_499982.1| COLlagen structural gene (33.8 kD) (col-103) [Caenorhabditis
elegans] gi|25385141|pir||E88633 protein F56B3.1
[imported] - Caenorhabditis elegans
gi|13559616|gb|AAK29827.1| Collagen protein 103
[Caenorhabditis elegans]
Length = 371
Score = 53.5 bits (127), Expect = 1e-06
Identities = 46/158 (29%), Positives = 62/158 (39%), Gaps = 18/158 (11%)
Frame = +2
Query: 47 KPAKNGQRNDTTSTKTTSNAVEL-PSGKRIVTGTSDVVAKRDNGGQPSASSHGAASGRAE 223
+P NG +++ ++ + P+G G + + N GQP A S G G A
Sbjct: 162 QPGSNGGAGSNGASEGSAGGCKTCPAGPPGPPGPAGQAGRPGNDGQPGAPSFGGGVG-AP 220
Query: 224 SGRGPQAASGVAPANPNQSGNSGRANGHVDGGARHRAA---AAGPQPSGT---------- 364
GP +G +P P G GR + GG+ A P P G
Sbjct: 221 GAPGPAGDAG-SPGQPGAPGQPGRPGKNAQGGSSRPGPPGPAGPPGPPGNNGAPGGGYGV 279
Query: 365 ---GPNHAAGRP-ASGKPGPAVAPGANFNGGVLGPRVA 466
GP +GRP A G+PGP PGA N G G A
Sbjct: 280 GPPGPPGPSGRPGAPGQPGPDGQPGAPGNDGTPGTDAA 317
Score = 43.5 bits (101), Expect = 0.001
Identities = 34/116 (29%), Positives = 44/116 (37%), Gaps = 2/116 (1%)
Frame = +2
Query: 116 PSGKRIVTGTSDVVAKRDNGGQPSASSHGAASGRAE-SGRGPQAASGVAPANPNQSGNSG 292
P G R G + + GQP ++ ++G +E S G + P P +G +G
Sbjct: 141 PPGPRGPPGQAGLDGLPGAPGQPGSNGGAGSNGASEGSAGGCKTCPAGPPGPPGPAGQAG 200
Query: 293 RANGHVDGGARHRAAAAGPQPSGTGPNHAAGRPAS-GKPGPAVAPGANFNGGVLGP 457
R GA G P GP AG P G PG PG N GG P
Sbjct: 201 RPGNDGQPGAPSFGGGVG-APGAPGPAGDAGSPGQPGAPGQPGRPGKNAQGGSSRP 255
Score = 40.8 bits (94), Expect = 0.008
Identities = 39/102 (38%), Positives = 50/102 (48%), Gaps = 7/102 (6%)
Frame = +2
Query: 173 GGQPSASSHGAASGRAESGRGPQAASGV-----APANPNQSGNSGRANGHVDGGARH-RA 334
G Q S SS+ G RGP +G+ AP P +G +G +NG +G A +
Sbjct: 130 GCQCSPSSNTCPPGP----RGPPGQAGLDGLPGAPGQPGSNGGAG-SNGASEGSAGGCKT 184
Query: 335 AAAGPQPSGTGPNHAAGRPAS-GKPGPAVAPGANFNGGVLGP 457
AGP P GP AGRP + G+PG AP +F GGV P
Sbjct: 185 CPAGP-PGPPGPAGQAGRPGNDGQPG---AP--SFGGGVGAP 220
Score = 37.0 bits (84), Expect = 0.12
Identities = 35/116 (30%), Positives = 39/116 (33%), Gaps = 15/116 (12%)
Frame = +2
Query: 170 NGGQPS-ASSHGAASGRAESGRGPQAASGVAPANPNQSGNSGRANGHVDGGARHRAAAAG 346
NGG S +S G+A G GP G A + GN G+ GG A G
Sbjct: 166 NGGAGSNGASEGSAGGCKTCPAGPPGPPGPA-GQAGRPGNDGQPGAPSFGGGVGAPGAPG 224
Query: 347 P-----QPSGTGPNHAAGRPASG---------KPGPAVAPGANFNGGVLGPRVAAG 472
P P G GRP PGPA PG N G G G
Sbjct: 225 PAGDAGSPGQPGAPGQPGRPGKNAQGGSSRPGPPGPAGPPGPPGNNGAPGGGYGVG 280
>ref|NP_505486.1| COLlagen structural gene (27.2 kD) (col-147) [Caenorhabditis
elegans] gi|7507298|pir||T24586 hypothetical protein
T06E4.4 - Caenorhabditis elegans
gi|3879547|emb|CAA94788.1| C. elegans COL-147 protein
(corresponding sequence T06E4.4) [Caenorhabditis
elegans]
Length = 290
Score = 53.1 bits (126), Expect = 2e-06
Identities = 46/148 (31%), Positives = 54/148 (36%), Gaps = 5/148 (3%)
Frame = +2
Query: 47 KPAKNGQRNDTTSTKTTSNAVELPSGKRIVTGTSDVVAKRDNGGQPSASSHGAASGRAES 226
KP NG T + P+G G + G P + G G A
Sbjct: 111 KPGANGVTIGLTGGN--GPCITCPAGAPGPAGAPGAPGPQGPSGAPGQDAVGGGPGPA-- 166
Query: 227 GRGPQAASGVAPANPNQSGNSGRANGHVDGGARHR-----AAAAGPQPSGTGPNHAAGRP 391
GPQ +G A A P Q+G G GG R R + A GPQ GP
Sbjct: 167 --GPQGPAGDAGA-PGQAGAPGHPGAPGQGGQRSRGTPGPSGAPGPQGPAGGPGQPGQSG 223
Query: 392 ASGKPGPAVAPGANFNGGVLGPRVAAGA 475
+G PGPA APGA G G GA
Sbjct: 224 GAGAPGPAGAPGAPGGPGNAGTPGTPGA 251
Score = 40.0 bits (92), Expect = 0.014
Identities = 35/110 (31%), Positives = 43/110 (38%), Gaps = 4/110 (3%)
Frame = +2
Query: 116 PSGKRIVTGTSDVVAKRDNGGQPSASSHGAASGRAESGR----GPQAASGVAPANPNQSG 283
P+G + G + + G P A G R G GPQ +G P P QSG
Sbjct: 165 PAGPQGPAGDAGAPGQAGAPGHPGAPGQGGQRSRGTPGPSGAPGPQGPAG-GPGQPGQSG 223
Query: 284 NSGRANGHVDGGARHRAAAAGPQPSGTGPNHAAGRPASGKPGPAVAPGAN 433
+G G A A GP +GT G P G PG A APG +
Sbjct: 224 GAG-----APGPAGAPGAPGGPGNAGT-----PGTP--GAPGNAGAPGGD 261
Score = 33.5 bits (75), Expect = 1.3
Identities = 28/71 (39%), Positives = 30/71 (41%)
Frame = +2
Query: 260 PANPNQSGNSGRANGHVDGGARHRAAAAGPQPSGTGPNHAAGRPASGKPGPAVAPGANFN 439
P P Q G G A GH G + A G TG N +G PGPA APGA
Sbjct: 91 PGPPGQPGAQGEA-GHA--GEAGKPGANGVTIGLTGGNGPCITCPAGAPGPAGAPGA--- 144
Query: 440 GGVLGPRVAAG 472
G GP A G
Sbjct: 145 PGPQGPSGAPG 155
>ref|NP_505484.1| COLlagen structural gene (27.4 kD) (col-146) [Caenorhabditis
elegans] gi|7507300|pir||T24590 hypothetical protein
T06E4.6 - Caenorhabditis elegans
gi|3879551|emb|CAA94792.1| C. elegans COL-146 protein
(corresponding sequence T06E4.6) [Caenorhabditis
elegans]
Length = 290
Score = 53.1 bits (126), Expect = 2e-06
Identities = 47/148 (31%), Positives = 54/148 (35%), Gaps = 5/148 (3%)
Frame = +2
Query: 47 KPAKNGQRNDTTSTKTTSNAVELPSGKRIVTGTSDVVAKRDNGGQPSASSHGAASGRAES 226
KP NG T + P+G G + G P + G G A
Sbjct: 111 KPGANGVTIGLTGGN--GPCITCPAGAPGPAGAPGAPGPQGPSGAPGQDAVGEGPGPA-- 166
Query: 227 GRGPQAASGVAPANPNQSGNSGRANGHVDGGARHR-----AAAAGPQPSGTGPNHAAGRP 391
GPQ +G A A P Q+G G GG R R A A GPQ GP
Sbjct: 167 --GPQGPAGDAGA-PGQAGAPGHPGAPGQGGQRSRGTPGPAGAPGPQGPAGGPGQPGQSG 223
Query: 392 ASGKPGPAVAPGANFNGGVLGPRVAAGA 475
+G PGPA APGA G G GA
Sbjct: 224 GAGAPGPAGAPGAPGGPGQPGQDGQPGA 251
Score = 33.5 bits (75), Expect = 1.3
Identities = 28/71 (39%), Positives = 30/71 (41%)
Frame = +2
Query: 260 PANPNQSGNSGRANGHVDGGARHRAAAAGPQPSGTGPNHAAGRPASGKPGPAVAPGANFN 439
P P Q G G A GH G + A G TG N +G PGPA APGA
Sbjct: 91 PGPPGQPGAQGEA-GHA--GEAGKPGANGVTIGLTGGNGPCITCPAGAPGPAGAPGA--- 144
Query: 440 GGVLGPRVAAG 472
G GP A G
Sbjct: 145 PGPQGPSGAPG 155