Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KCC000438A_C01 KCC000438A_c01
(578 letters)
Database: nr
1,537,769 sequences; 498,525,298 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_491194.1| COLlagen structural gene (col-50) [Caenorhabdit... 54 2e-06
ref|NP_739264.1| hypothetical protein [Corynebacterium efficiens... 49 5e-05
gb|AAM61027.1| unknown [Arabidopsis thaliana] 49 5e-05
ref|NP_569011.1| arabinogalactan-protein (AGP7) [Arabidopsis tha... 49 5e-05
ref|NP_630576.1| putative membrane protein [Streptomyces coelico... 49 7e-05
>ref|NP_491194.1| COLlagen structural gene (col-50) [Caenorhabditis elegans]
gi|7508684|pir||T15142 hypothetical protein T28F2.6 -
Caenorhabditis elegans gi|2047346|gb|AAB53052.1|
Collagen protein 50 [Caenorhabditis elegans]
Length = 418
Score = 53.9 bits (128), Expect = 2e-06
Identities = 41/111 (36%), Positives = 48/111 (42%), Gaps = 18/111 (16%)
Frame = +1
Query: 298 PGTIMHTGAY--GADEPATA-AGDHAGGQNVT-QGLPGGLAVPGPAPDPVLAASPRG--- 456
P + GAY G D A A AG + GG + P A P PAP P AA+P G
Sbjct: 301 PPRTLGAGAYPEGGDAAAAAPAGGYDGGAGAAPEAAPAAAAAPQPAPAPAAAAAPAGGYQ 360
Query: 457 ----TSPAAAPPAPTTAQPPAAAPAPT-------LPGVSPSNSPLKKKQRQ 576
AA PPAP A P APAP G SP+ +KK R+
Sbjct: 361 GGAAAGAAAPPPAPAAAAAPEPAPAPAAAPPPAPAAGGSPTGGYRRKKVRR 411
Score = 32.0 bits (71), Expect = 6.4
Identities = 25/75 (33%), Positives = 28/75 (37%), Gaps = 10/75 (13%)
Frame = -1
Query: 542 GDTPGRVGAGAAAGGCAVVGAGGAAAGEVPR------GEAARTG---SGAGPGTASPPGR 390
GD AG GG AA P+ AA G GA G A+PP
Sbjct: 314 GDAAAAAPAGGYDGGAGAAPEAAPAAAAAPQPAPAPAAAAAPAGGYQGGAAAGAAAPPPA 373
Query: 389 PCVTFCP-PA*SPAA 348
P P PA +PAA
Sbjct: 374 PAAAAAPEPAPAPAA 388
Score = 31.6 bits (70), Expect = 8.3
Identities = 29/84 (34%), Positives = 30/84 (35%), Gaps = 5/84 (5%)
Frame = +1
Query: 298 PGTIMHTGAYGADEPATAAGDHAGGQNVTQGLPGGLAVPGPA----PDPVLAASPRGTSP 465
PG GA G D A G + Q G PG PGPA D SP GT
Sbjct: 209 PGPDGQPGAPGPDGQPGAGGTTSTNQ---PGPPGPAGPPGPAGPAGEDAYAQPSPAGTPG 265
Query: 466 AAAPPAPT-TAQPPAAAPAPTLPG 534
PP A P A AP G
Sbjct: 266 PPGPPGKDGEAGPDGPAGAPGTDG 289
>ref|NP_739264.1| hypothetical protein [Corynebacterium efficiens YS-314]
gi|23494498|dbj|BAC19464.1| hypothetical protein
[Corynebacterium efficiens YS-314]
Length = 609
Score = 48.9 bits (115), Expect = 5e-05
Identities = 37/104 (35%), Positives = 41/104 (38%), Gaps = 7/104 (6%)
Frame = +1
Query: 265 GATRPRPHTMDPGTIMHTGAYGADEP-ATAAGDHAGGQNVTQGLPGGLAVPG------PA 423
GA+ P P P T A GA P AT G PG A PG PA
Sbjct: 152 GASIPTPGAAMPTPGTATPAPGAAAPGATIPGSAVPAPGGAPAAPGAPAAPGAAAPRTPA 211
Query: 424 PDPVLAASPRGTSPAAAPPAPTTAQPPAAAPAPTLPGVSPSNSP 555
P AA P +P +A P P P AAP P LP P +P
Sbjct: 212 PG---AAIPGAVAPGSAVPTPGAISAPGAAPPPGLPAPGPPGAP 252
Score = 39.3 bits (90), Expect = 0.040
Identities = 33/93 (35%), Positives = 37/93 (39%), Gaps = 3/93 (3%)
Frame = +1
Query: 265 GATRPRPHTMDPGTIMHTGAYGADEPATAAGDHAGGQNVTQGLPG-GLAVPGPAPDPVLA 441
GA P P T P A G+ PA A +P G AVP P A
Sbjct: 96 GAAVPAPATPTP-----PAAPGSAIPAPGAATPTAVPTPGSAIPTPGAAVPAPGVATPSA 150
Query: 442 ASPRGTSPAAAPPAPTTAQPP--AAAPAPTLPG 534
+P AA P P TA P AAAP T+PG
Sbjct: 151 PGASIPTPGAAMPTPGTATPAPGAAAPGATIPG 183
Score = 38.9 bits (89), Expect = 0.052
Identities = 32/83 (38%), Positives = 37/83 (44%), Gaps = 2/83 (2%)
Frame = -1
Query: 536 TPGRV--GAGAAAGGCAVVGAGGAAAGEVPRGEAARTGSGAGPGTASPPGRPCVTFCPPA 363
TPG GAAA G + G+ A G P A G+ A PG A+P T P A
Sbjct: 164 TPGTATPAPGAAAPGATIPGSAVPAPGGAP----AAPGAPAAPGAAAPR-----TPAPGA 214
Query: 362 *SPAAVAGSSAP*APVCIIVPGS 294
P AVA SA P I PG+
Sbjct: 215 AIPGAVAPGSAVPTPGAISAPGA 237
Score = 36.6 bits (83), Expect = 0.26
Identities = 33/84 (39%), Positives = 37/84 (43%), Gaps = 8/84 (9%)
Frame = +1
Query: 298 PGTIMHTGAYGADEPATAA---GDHAGGQNVTQGLPGGLAVPGPAPDPVLAA-----SPR 453
PG GA PA A G A G V PG ++ PG AP P L A +P
Sbjct: 196 PGAPAAPGAAAPRTPAPGAAIPGAVAPGSAVPT--PGAISAPGAAPPPGLPAPGPPGAPG 253
Query: 454 GTSPAAAPPAPTTAQPPAAAPAPT 525
AAP AP + AAAPAPT
Sbjct: 254 APGIPAAPGAPGS----AAAPAPT 273
Score = 35.4 bits (80), Expect = 0.58
Identities = 30/90 (33%), Positives = 38/90 (41%), Gaps = 1/90 (1%)
Frame = +1
Query: 265 GATRPRPHTMDPGTIMHTGAYGADEPATAAGDHAGGQNVTQGLPGGLAVPGPAPDPVLAA 444
G+ P P P T+ + G PA A A + PG A+P P A
Sbjct: 35 GSAVPAPGGAVPPTVTN-GPTPQAPPAPGAAVPAPATPIPPAAPGS-AIPAPG-----AV 87
Query: 445 SPRGT-SPAAAPPAPTTAQPPAAAPAPTLP 531
+P +P AA PAP T PP AAP +P
Sbjct: 88 TPTAVPTPGAAVPAPATPTPP-AAPGSAIP 116
Score = 35.0 bits (79), Expect = 0.75
Identities = 32/94 (34%), Positives = 38/94 (40%), Gaps = 9/94 (9%)
Frame = +1
Query: 265 GATRPR--------PHTMDPGTIMHT-GAYGADEPATAAGDHAGGQNVTQGLPGGLAVPG 417
GA PR P + PG+ + T GA A A G A G G PG A PG
Sbjct: 203 GAAAPRTPAPGAAIPGAVAPGSAVPTPGAISAPGAAPPPGLPAPGPPGAPGAPGIPAAPG 262
Query: 418 PAPDPVLAASPRGTSPAAAPPAPTTAQPPAAAPA 519
AP A +P +AAP A T P + A
Sbjct: 263 -APGSAAAPAPTSVPRSAAPVAADTDTRPKGSTA 295
Score = 34.3 bits (77), Expect = 1.3
Identities = 31/105 (29%), Positives = 37/105 (34%), Gaps = 10/105 (9%)
Frame = -1
Query: 548 LDGDTPGRVGAGAAAGGCAVVGAGGAAA-----GEVPRGEAARTGSGAGPGTASPPGRPC 384
+DG+ P G AV GGA G P+ A + P T PP P
Sbjct: 19 MDGNQPPNPTTSPPPPGSAVPAPGGAVPPTVTNGPTPQAPPAPGAAVPAPATPIPPAAPG 78
Query: 383 VTF-CPPA*SPAAV----AGSSAP*APVCIIVPGSMV*GLGRVAP 264
P A +P AV A AP P PGS + G P
Sbjct: 79 SAIPAPGAVTPTAVPTPGAAVPAPATPTPPAAPGSAIPAPGAATP 123
Score = 33.5 bits (75), Expect = 2.2
Identities = 29/97 (29%), Positives = 35/97 (35%)
Frame = +1
Query: 265 GATRPRPHTMDPGTIMHTGAYGADEPATAAGDHAGGQNVTQGLPGGLAVPGPAPDPVLAA 444
GAT P PG A A A A G + + G AVP P A
Sbjct: 178 GATIPGSAVPAPGGAPAAPGAPAAPGAAAPRTPAPGAAIPGAVAPGSAVPTPGAISAPGA 237
Query: 445 SPRGTSPAAAPPAPTTAQPPAAAPAPTLPGVSPSNSP 555
+P PA PP A P AP PG + + +P
Sbjct: 238 APPPGLPAPGPPGAPGA--PGIPAAPGAPGSAAAPAP 272
Score = 33.1 bits (74), Expect = 2.9
Identities = 28/92 (30%), Positives = 35/92 (37%)
Frame = +1
Query: 265 GATRPRPHTMDPGTIMHTGAYGADEPATAAGDHAGGQNVTQGLPGGLAVPGPAPDPVLAA 444
G+ P P + P + GA PAT A G + G A P P P A
Sbjct: 78 GSAIPAPGAVTPTAVPTPGA-AVPAPATPTPPAAPGSAIPAP---GAATPTAVPTPGSAI 133
Query: 445 SPRGTSPAAAPPAPTTAQPPAAAPAPTLPGVS 540
+P AA PAP A P A + PG +
Sbjct: 134 P----TPGAAVPAPGVATPSAPGASIPTPGAA 161
Score = 32.3 bits (72), Expect = 4.9
Identities = 23/72 (31%), Positives = 27/72 (36%)
Frame = +1
Query: 259 LSGATRPRPHTMDPGTIMHTGAYGADEPATAAGDHAGGQNVTQGLPGGLAVPGPAPDPVL 438
+ GA P PG I GA A P A G G+P PG A P
Sbjct: 216 IPGAVAPGSAVPTPGAISAPGA--APPPGLPAPGPPGAPGAP-GIPAAPGAPGSAAAPAP 272
Query: 439 AASPRGTSPAAA 474
+ PR +P AA
Sbjct: 273 TSVPRSAAPVAA 284
Score = 31.6 bits (70), Expect = 8.3
Identities = 34/90 (37%), Positives = 38/90 (41%), Gaps = 12/90 (13%)
Frame = -1
Query: 533 PGRVGAGAAAGGCAVVGAGGAAA-----------GEVPRGEAART-GSGAGPGTASPPGR 390
P GA AA G A A GAAA G V G A T G+ + PG A PPG
Sbjct: 187 PAPGGAPAAPGAPA---APGAAAPRTPAPGAAIPGAVAPGSAVPTPGAISAPGAAPPPGL 243
Query: 389 PCVTFCPPA*SPAAVAGSSAP*APVCIIVP 300
P PP +P A +AP AP P
Sbjct: 244 PAPG--PPG-APGAPGIPAAPGAPGSAAAP 270
Score = 31.6 bits (70), Expect = 8.3
Identities = 33/102 (32%), Positives = 39/102 (37%), Gaps = 5/102 (4%)
Frame = +1
Query: 265 GATRPRPHTMDPGTIMHTGAYGADEPATAAGDHAGGQNVTQGLPGGLAVPGPAPDPVLAA 444
GA P P T P A G+ PA G T G AVP PA P A
Sbjct: 62 GAAVPAPATPIP-----PAAPGSAIPAP------GAVTPTAVPTPGAAVPAPAT-PTPPA 109
Query: 445 SPRGTSPAAAPPAPTTAQPPAAA-PAP----TLPGVSPSNSP 555
+P PA PT P +A P P PGV+ ++P
Sbjct: 110 APGSAIPAPGAATPTAVPTPGSAIPTPGAAVPAPGVATPSAP 151
>gb|AAM61027.1| unknown [Arabidopsis thaliana]
Length = 130
Score = 48.9 bits (115), Expect = 5e-05
Identities = 23/48 (47%), Positives = 27/48 (55%)
Frame = +1
Query: 412 PGPAPDPVLAASPRGTSPAAAPPAPTTAQPPAAAPAPTLPGVSPSNSP 555
P P+P + P T P AA PAPTT PPA +PAPT S + SP
Sbjct: 24 PAPSPTTTVTPPPVATPPPAATPAPTTTPPPAVSPAPTSSPPSSAPSP 71
Score = 40.4 bits (93), Expect = 0.018
Identities = 22/60 (36%), Positives = 34/60 (56%), Gaps = 8/60 (13%)
Frame = +1
Query: 394 PGGLAVPGPA--------PDPVLAASPRGTSPAAAPPAPTTAQPPAAAPAPTLPGVSPSN 549
P +A P PA P P ++ +P + P++A P+P++ P A+ PAP PGVSP +
Sbjct: 34 PPPVATPPPAATPAPTTTPPPAVSPAPTSSPPSSA-PSPSSDAPTASPPAPEGPGVSPGD 92
Score = 33.5 bits (75), Expect = 2.2
Identities = 21/55 (38%), Positives = 25/55 (45%), Gaps = 5/55 (9%)
Frame = +1
Query: 418 PAPD---PVLAASPRGTSPAAAPPAP--TTAQPPAAAPAPTLPGVSPSNSPLKKK 567
PAP P A SP +P A+PPAP P AP P+ P N+ L K
Sbjct: 58 PAPTSSPPSSAPSPSSDAPTASPPAPEGPGVSPGDLAPTPSDASAPPPNAALTNK 112
Score = 33.5 bits (75), Expect = 2.2
Identities = 17/40 (42%), Positives = 20/40 (49%), Gaps = 3/40 (7%)
Frame = +1
Query: 436 LAASPRGTSPAAAPPAPTTAQPPAAAPAPTL---PGVSPS 546
LA +P + P P PPAA PAPT P VSP+
Sbjct: 20 LAQAPAPSPTTTVTPPPVATPPPAATPAPTTTPPPAVSPA 59
>ref|NP_569011.1| arabinogalactan-protein (AGP7) [Arabidopsis thaliana]
gi|9759619|dbj|BAB11561.1| gene_id:MNA5.12~unknown
protein [Arabidopsis thaliana]
gi|15215666|gb|AAK91378.1| AT5g65390/MNA5_12
[Arabidopsis thaliana] gi|20334898|gb|AAM16205.1|
AT5g65390/MNA5_12 [Arabidopsis thaliana]
Length = 130
Score = 48.9 bits (115), Expect = 5e-05
Identities = 23/48 (47%), Positives = 27/48 (55%)
Frame = +1
Query: 412 PGPAPDPVLAASPRGTSPAAAPPAPTTAQPPAAAPAPTLPGVSPSNSP 555
P P+P + P T P AA PAPTT PPA +PAPT S + SP
Sbjct: 24 PAPSPTTTVTPPPVATPPPAATPAPTTTPPPAVSPAPTSSPPSSAPSP 71
Score = 40.0 bits (92), Expect = 0.023
Identities = 22/58 (37%), Positives = 33/58 (55%), Gaps = 8/58 (13%)
Frame = +1
Query: 394 PGGLAVPGPA--------PDPVLAASPRGTSPAAAPPAPTTAQPPAAAPAPTLPGVSP 543
P +A P PA P P ++ +P + P++A P+P++ P A+ PAP PGVSP
Sbjct: 34 PPPVATPPPAATPAPTTTPPPAVSPAPTSSPPSSA-PSPSSDAPTASPPAPEGPGVSP 90
Score = 33.9 bits (76), Expect = 1.7
Identities = 21/55 (38%), Positives = 25/55 (45%), Gaps = 5/55 (9%)
Frame = +1
Query: 418 PAPD---PVLAASPRGTSPAAAPPAP--TTAQPPAAAPAPTLPGVSPSNSPLKKK 567
PAP P A SP +P A+PPAP P AP P+ P N+ L K
Sbjct: 58 PAPTSSPPSSAPSPSSDAPTASPPAPEGPGVSPGELAPTPSDASAPPPNAALTNK 112
Score = 33.5 bits (75), Expect = 2.2
Identities = 17/40 (42%), Positives = 20/40 (49%), Gaps = 3/40 (7%)
Frame = +1
Query: 436 LAASPRGTSPAAAPPAPTTAQPPAAAPAPTL---PGVSPS 546
LA +P + P P PPAA PAPT P VSP+
Sbjct: 20 LAQAPAPSPTTTVTPPPVATPPPAATPAPTTTPPPAVSPA 59
>ref|NP_630576.1| putative membrane protein [Streptomyces coelicolor A3(2)]
gi|7480977|pir||T34724 probable membrane protein -
Streptomyces coelicolor gi|3861426|emb|CAA22031.1|
putative membrane protein [Streptomyces coelicolor
A3(2)]
Length = 205
Score = 48.5 bits (114), Expect = 7e-05
Identities = 46/147 (31%), Positives = 54/147 (36%), Gaps = 1/147 (0%)
Frame = +1
Query: 112 YLVSLLVVASALAVILLLMGRHKAPTSRAGRAIVEKEIEARKQSRITDLLSGATRPRPHT 291
Y + + +A LA+ L L G A + A A+V LL GA RP P
Sbjct: 75 YAGAAVALAVGLALALALPGWAAALITAALLAVVAY------------LLRGAARPHPSR 122
Query: 292 MDPGTIMHTGAYGADEPATAAGDHAGGQNVTQGLPGGLAVPGPAPDPVLAASPRGTSPAA 471
P G G D A DH G PGGL VP P PV +P G A
Sbjct: 123 PGPAP----GTAGHDH--VAGHDHVAGGGAPAAAPGGLGVPYPPMPPV---APGGVGGAT 173
Query: 472 APPAPTTAQPPAAAP-APTLPGVSPSN 549
P T P P AP + P N
Sbjct: 174 GAPGAGTPAPGGTGPAAPRQDDLDPEN 200