Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KCC001673A_C01 KCC001673A_c01
(939 letters)
Database: nr
1,537,769 sequences; 498,525,298 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
pir||T44768 antifreeze glycopeptide AFGP polyprotein precursor [... 49 2e-04
ref|ZP_00089621.1| COG0438: Glycosyltransferase [Azotobacter vin... 47 4e-04
ref|XP_310434.1| ENSANGP00000005649 [Anopheles gambiae] gi|21293... 46 0.001
ref|NP_295856.1| conserved hypothetical protein [Deinococcus rad... 45 0.001
pir||T43481 probable mucin DKFZp434C196.1 - human (fragment) gi|... 43 0.007
>pir||T44768 antifreeze glycopeptide AFGP polyprotein precursor [imported] -
Boreogadus saida gi|2078483|gb|AAC60129.1| antifreeze
glycopeptide AFGP polyprotein precursor
Length = 507
Score = 48.5 bits (114), Expect = 2e-04
Identities = 42/153 (27%), Positives = 61/153 (39%), Gaps = 3/153 (1%)
Frame = +1
Query: 295 ASPSGTTLAALLPRLARTTLTSRFPW-SLPA*RPRAARSPSPSSLCCLCFSSPVA--PSR 465
A+P+ AA A T T+ P + A P A +P+ ++ ++ A P+R
Sbjct: 71 ATPATAATAATTAATAATAATAATPARAARAATPATAATPATAATAATAATAATAETPAR 130
Query: 466 ASFTCRSTTPRCCGPPRSAATAAVLPAARTSRRSSAAR*TRTARAAAPPTRKWMEAGLLE 645
A+ + TP P +AATAA + T+ ++ A TA A P A
Sbjct: 131 AATPATAATPATAATPATAATAATAATSATAATAARAATPATAATPATPATAARAARAAT 190
Query: 646 QCERGAAAAAAWRQTGADDSGACCRAVPRAAVR 744
AA AA T A + A A P A R
Sbjct: 191 PATAATAATAATAATAATAATAATAATPARAAR 223
Score = 40.0 bits (92), Expect = 0.057
Identities = 51/184 (27%), Positives = 69/184 (36%), Gaps = 5/184 (2%)
Frame = +1
Query: 262 ARTSTLSMRLPASPSGTTLAALLPRLARTTLTSRFPWSLPA*RPRAARSPSPSSLCCLCF 441
AR +T + A+ T A A T T+ P RAAR+ +P++
Sbjct: 297 ARAATPATPATAATPATPATAATAATAATAATAATP-------ARAARAATPATAATPAT 349
Query: 442 SSPVAPSRASFTCRST-TPRCCGPPRSAATAAVLPAART-SRRSSAAR*TRTARAAAP-- 609
++ A + + T + TP + ATAA A T + ++AA R ARAA P
Sbjct: 350 AATAATAATAATAATAATPARAARAATPATAATAATAATAATAATAATPARAARAATPAT 409
Query: 610 PTRKWMEAGLLEQCERGAAAAAAWRQTGADDSGACCRAVP-RAAVRGWPTERCWRADAAN 786
P A AA AA T A + A P RAA P A A
Sbjct: 410 PATPATPATPATAATAATAATAATAATAATAATAATAPTPARAARAATPATGATPATAPT 469
Query: 787 GGKA 798
G A
Sbjct: 470 AGTA 473
Score = 38.5 bits (88), Expect = 0.16
Identities = 42/166 (25%), Positives = 59/166 (35%), Gaps = 7/166 (4%)
Frame = +1
Query: 262 ARTSTLSMRLPASPSGTTLAALLPRLARTTLTSRFPWSLPA*RPRAARSPSPSSLCCLCF 441
AR + + A+ T A A T T+ P RAAR+ +P++
Sbjct: 333 ARAARAATPATAATPATAATAATAATAATAATAATP-------ARAARAATPATAATAAT 385
Query: 442 SSPVA-------PSRASFTCRSTTPRCCGPPRSAATAAVLPAARTSRRSSAAR*TRTARA 600
++ A P+RA+ TP P + ATAA A T+ ++ A TA
Sbjct: 386 AATAATAATAATPARAARAATPATPATPATPATPATAATAATAATAATAATAATAATAAT 445
Query: 601 AAPPTRKWMEAGLLEQCERGAAAAAAWRQTGADDSGACCRAVPRAA 738
A P R A A A T A + A A P A
Sbjct: 446 APTPAR---AARAATPATGATPATAPTAGTAATAATAATAATPARA 488
Score = 37.0 bits (84), Expect = 0.48
Identities = 50/192 (26%), Positives = 71/192 (36%), Gaps = 9/192 (4%)
Frame = +1
Query: 196 ARMASSTSTWRRRSSTW*RTRRARTSTLSMRLPASPSGTTLAALLPRLARTTLTSRFPWS 375
A ++T+ ++T R RA T + A+P+ AA A +R
Sbjct: 79 AATTAATAATAATAATPARAARAATPATA----ATPATAATAATAATAATAETPARAATP 134
Query: 376 LPA*RPRAARSPSPSSLCCLCFSSPVAPSRASFTCRSTTPRCCGPPRSAATA-----AVL 540
A P A +P+ ++ +S A + A R+ TP P + ATA A
Sbjct: 135 ATAATPATAATPATAATAATAATSATAATAA----RAATPATAATPATPATAARAARAAT 190
Query: 541 PA----ARTSRRSSAAR*TRTARAAAPPTRKWMEAGLLEQCERGAAAAAAWRQTGADDSG 708
PA A T+ ++ A TA AA P R A A AA T A +
Sbjct: 191 PATAATAATAATAATAATAATAATAATPAR---AARAATPATAPTPATAATPATAATAAT 247
Query: 709 ACCRAVPRAAVR 744
A A P A R
Sbjct: 248 APTAATPARAAR 259
Score = 35.8 bits (81), Expect = 1.1
Identities = 48/182 (26%), Positives = 63/182 (34%), Gaps = 3/182 (1%)
Frame = +1
Query: 262 ARTSTLSMRLPASPSGTTLAALLP-RLARTTLTSRFPWSLPA*RPRAARSPS--PSSLCC 432
A +T + A+ + T A P R AR + P A P A + + P++
Sbjct: 195 ATAATAATAATAATAATAATAATPARAARAATPATAPTPATAATPATAATAATAPTAATP 254
Query: 433 LCFSSPVAPSRASFTCRSTTPRCCGPPRSAATAAVLPAARTSRRSSAAR*TRTARAAAPP 612
+ P+ A+ + TP P +AAT A A T R AA A AA P
Sbjct: 255 ARAARAATPATAATLATAATPATPATPATAATDATAATAATPAR--AATPATPATAATPA 312
Query: 613 TRKWMEAGLLEQCERGAAAAAAWRQTGADDSGACCRAVPRAAVRGWPTERCWRADAANGG 792
T AA AA T A + A A P A P A AA
Sbjct: 313 T----------PATAATAATAATAATAATPARAARAATPATAAT--PATAATAATAATAA 360
Query: 793 KA 798
A
Sbjct: 361 TA 362
Score = 35.0 bits (79), Expect = 1.8
Identities = 35/124 (28%), Positives = 44/124 (35%)
Frame = +1
Query: 433 LCFSSPVAPSRASFTCRSTTPRCCGPPRSAATAAVLPAARTSRRSSAAR*TRTARAAAPP 612
L + P A +RA+ + TP P +AATAA A T+ + A TA AA
Sbjct: 23 LLVARPAAAARAATPATAATPATAATPATAATAATEATAATAATPATAATPATAATAAT- 81
Query: 613 TRKWMEAGLLEQCERGAAAAAAWRQTGADDSGACCRAVPRAAVRGWPTERCWRADAANGG 792
AA AA T A + A A P A P A AA
Sbjct: 82 ----------------TAATAATAATAATPARAARAATPATAAT--PATAATAATAATAA 123
Query: 793 KARS 804
A +
Sbjct: 124 TAET 127
Score = 35.0 bits (79), Expect = 1.8
Identities = 51/226 (22%), Positives = 76/226 (33%), Gaps = 1/226 (0%)
Frame = +1
Query: 124 PASLRSLSSSIYTRRLRTCQAWRRARMASSTSTWRRRSSTW*RTRRARTSTLSMRLPASP 303
PA+ + +++ T RA A++ +T ++ A +T + A+
Sbjct: 233 PATAATPATAATAATAPTAATPARAARAATPATAATLATAATPATPATPATAATDATAAT 292
Query: 304 SGTTLAALLPRLARTTLTSRFPWSLPA*RPRAARSPSPSSLCCLCFSSPVAPSRASFTCR 483
+ T A P T T P A + + ++ ++P +RA+
Sbjct: 293 AATPARAATPATPATAATPATP---------ATAATAATAATAATAATPARAARAATPAT 343
Query: 484 STTPRCCGPPRSAATAAVLPAARTSRRSSAAR*TRTARAAAPPTRKWMEAGLLEQCERGA 663
+ TP +AATAA A T R ARAA P T
Sbjct: 344 AATPATAATAATAATAATAATAATP--------ARAARAATPAT----------AATAAT 385
Query: 664 AAAAAWRQTGADDSGACCRAVPRA-AVRGWPTERCWRADAANGGKA 798
AA AA T A + A A P A P A AA A
Sbjct: 386 AATAATAATAATPARAARAATPATPATPATPATPATAATAATAATA 431
>ref|ZP_00089621.1| COG0438: Glycosyltransferase [Azotobacter vinelandii]
Length = 623
Score = 47.4 bits (111), Expect = 4e-04
Identities = 41/127 (32%), Positives = 52/127 (40%), Gaps = 2/127 (1%)
Frame = +2
Query: 200 GWPVLHQRGGGEVLRGEGPGEQEHLLCQ*GYQRAPVARPSRPCSQGWREQR*HHDSPGAS 379
G P + QR G ++R G +RA V RPSR + R HH PGA
Sbjct: 3 GRPPVPQRPGRRLVRRAA-----------GDRRAGVPRPSRKDGRAQRPDN-HHGEPGAH 50
Query: 380 QPEGQGQHAHRRLRRYAACASLRRWHLRAHHLPADPLPRD--AVAHQGAPRPPPCFRRQE 553
+P+ GQH LRR+ +H R H L RD A + G P P P +
Sbjct: 51 RPQQTGQHPRPGLRRHL-------FHHRLLHGDPQRLARDRPADSRGGRPSPHPARHLDQ 103
Query: 554 QAAEAAR 574
QA R
Sbjct: 104 QARRLPR 110
>ref|XP_310434.1| ENSANGP00000005649 [Anopheles gambiae] gi|21293904|gb|EAA06049.1|
ENSANGP00000005649 [Anopheles gambiae str. PEST]
Length = 327
Score = 45.8 bits (107), Expect = 0.001
Identities = 62/222 (27%), Positives = 87/222 (38%), Gaps = 2/222 (0%)
Frame = +1
Query: 160 TRRLRTCQAWRRARMASSTSTWRRRSSTW*RTRRARTSTLSMRLPASPSGTTLAALLPRL 339
T +T + R++ +STS RR+ST RART + S T R
Sbjct: 26 TSAKKTTSSNSRSKKTTSTSNNTRRTSTSVNRTRARTGPNVWTRSTTTSATASRGTPART 85
Query: 340 ARTTLTSRFPWSLPA*RPRAARSPSPSSLCCLCFSSPVAPSRASFTCRSTTPRCCGPPRS 519
AR TS + P A S +P++ C SPV P T RS R P +
Sbjct: 86 ARR--TSMSAKASPVCTVAHACS-APTTACTSRRRSPV-PRDCRCTLRSRF-RTRSRPVT 140
Query: 520 AATAAVLPAARTSRRSSAAR*TRTARAAAPPTRKWMEAGLLEQCERGAAAAAAWRQ--TG 693
+ +A ART+RR+S + AR A R W + + A+ R+ T
Sbjct: 141 SVSACPARWARTARRTSTSARAIRARTA----RAWTASATIRASATRASKGRTARRTLTS 196
Query: 694 ADDSGACCRAVPRAAVRGWPTERCWRADAANGGKARSWRPSA 819
+G C A R W A A G AR+ RP++
Sbjct: 197 VSSTGRACTA------RAWMAATTTSATATGCGAARTARPTS 232
Score = 42.7 bits (99), Expect = 0.009
Identities = 67/241 (27%), Positives = 94/241 (38%), Gaps = 38/241 (15%)
Frame = +1
Query: 130 SLRSLSSSIYTRRLRTCQAWRRARMASSTSTWRRRSSTW*RTRR---ARTST-LSMRLPA 297
S ++ S+S TRR T + R R + + W R ++T R ART+ SM A
Sbjct: 38 SKKTTSTSNNTRRTST--SVNRTRARTGPNVWTRSTTTSATASRGTPARTARRTSMSAKA 95
Query: 298 SPSGTTLAAL--------------LPRLARTTLTSRFPW---------SLPA*RPRAARS 408
SP T A +PR R TL SRF + PA R AR
Sbjct: 96 SPVCTVAHACSAPTTACTSRRRSPVPRDCRCTLRSRFRTRSRPVTSVSACPARWARTARR 155
Query: 409 PSPSSLCCLCFSSPVAPSRASF-----------TCRSTTPRCCGPPRSAATAAVLPAART 555
S S+ ++ + A+ T R T R+ A + A T
Sbjct: 156 TSTSARAIRARTARAWTASATIRASATRASKGRTARRTLTSVSSTGRACTARAWMAATTT 215
Query: 556 SRRSSAAR*TRTARAAAPPTRKWMEAGLLEQCERGAAAAAAWRQTGADDSGACCRAVPRA 735
S ++ RTAR PT + A C A++AAWR + + SG CCRA R+
Sbjct: 216 SATATGCGAARTAR----PTSNTLPA----PCR---ASSAAWRTSSSTGSG-CCRARSRS 263
Query: 736 A 738
+
Sbjct: 264 S 264
>ref|NP_295856.1| conserved hypothetical protein [Deinococcus radiodurans]
gi|7471338|pir||B75310 conserved hypothetical protein -
Deinococcus radiodurans (strain R1)
gi|6459930|gb|AAF11681.1|AE002048_1 conserved
hypothetical protein [Deinococcus radiodurans]
Length = 528
Score = 45.4 bits (106), Expect = 0.001
Identities = 54/152 (35%), Positives = 68/152 (44%), Gaps = 28/152 (18%)
Frame = +1
Query: 379 PA*RPR-AARSPS-------PSSLCCLCFSSPVAPSRASFTCRS------TTPRCCGPPR 516
PA RP + R PS PS+ C S+P RA+ CR+ T+PR CGP
Sbjct: 368 PATRPSPSGRRPSTPVTGWWPSATGCR-LSAPRRCRRATKRCRTATGCGRTSPR-CGPSS 425
Query: 517 SAATAAVLPAARTS-RRSSAAR*TR---------TARAAAP--PTRK--WMEAGLLEQCE 654
+ AA + RTS RR+ A+R +R +A AA P PTRK W G C
Sbjct: 426 GSCRAATRRSPRTSPRRARASRASRPTIPAPAANSASAAPPNSPTRKTNWSTPG---WCP 482
Query: 655 RGAAAAAAWRQTGADDSGACCRAVPRAAVRGW 750
R AA+ + R GA P A RGW
Sbjct: 483 RSAASTPSSRSPGAPPPRVGPGPEPTARRRGW 514
>pir||T43481 probable mucin DKFZp434C196.1 - human (fragment)
gi|6599134|emb|CAB63715.1| hypothetical protein [Homo
sapiens]
Length = 580
Score = 43.1 bits (100), Expect = 0.007
Identities = 50/171 (29%), Positives = 65/171 (37%), Gaps = 4/171 (2%)
Frame = +1
Query: 106 RCACQGPASLRSL----SSSIYTRRLRTCQAWRRARMASSTSTWRRRSSTW*RTRRARTS 273
R + G +S SL S + TR + R MAS T T R S T R + T
Sbjct: 394 RASLTGTSSTASLTRTPSRASLTRTQSSSSLTRTPSMASLTRTPPRASLTRTPPRASLTR 453
Query: 274 TLSMRLPASPSGTTLAALLPRLARTTLTSRFPWSLPA*RPRAARSPSPSSLCCLCFSSPV 453
T P +L PR + T S R R+PS +SL +
Sbjct: 454 T--------PPRASLTRTPPRASLTRTPSMVSLKRSPSRASLTRTPSRASLT-------M 498
Query: 454 APSRASFTCRSTTPRCCGPPRSAATAAVLPAARTSRRSSAAR*TRTARAAA 606
PSRAS T +T G P +A+ P A +R A TRT A+
Sbjct: 499 TPSRASLTRTPSTASLTGTPPTASLTRTPPTASLTRSPPTASLTRTPSTAS 549
Score = 42.0 bits (97), Expect = 0.015
Identities = 56/205 (27%), Positives = 74/205 (35%)
Frame = +1
Query: 139 SLSSSIYTRRLRTCQAWRRARMASSTSTWRRRSSTW*RTRRARTSTLSMRLPASPSGTTL 318
S S + TR RR AS T T R S T +R + T T L
Sbjct: 1 SPSRASLTRTPPRASLMRRPSTASLTRTPSRASPTRMPSRASLKMTPFRASLTKMESTAL 60
Query: 319 AALLPRLARTTLTSRFPWSLPA*RPRAARSPSPSSLCCLCFSSPVAPSRASFTCRSTTPR 498
LPR + +R SL PRA+ + P +SP PSRAS T R
Sbjct: 61 LRTLPRASLMRTPTRA--SLMRTPPRASPTRKPPR------ASPRTPSRASPTRRLPRAS 112
Query: 499 CCGPPRSAATAAVLPAARTSRRSSAAR*TRTARAAAPPTRKWMEAGLLEQCERGAAAAAA 678
G P A+ P A + S A T T +A+P G + A
Sbjct: 113 PMGSPHRASPMRTPPRASPTGTPSTASPTGTPSSASP-------TGTPPRASPTGTPPRA 165
Query: 679 WRQTGADDSGACCRAVPRAAVRGWP 753
W T + + + R RA++ WP
Sbjct: 166 W-ATRSPSTASLTRTPSRASLTRWP 189
Score = 32.7 bits (73), Expect = 9.0
Identities = 42/139 (30%), Positives = 55/139 (39%), Gaps = 6/139 (4%)
Frame = +1
Query: 208 SSTSTWRRRSSTW*RTRRARTSTLSMRLPASPSGTTLAALLPRLART-TLTSRFPWSLPA 384
+S T R S T +R + T T S ASP+ T A L ++ T ++T P + P
Sbjct: 279 ASPRTPPRASPTTTPSRASLTRTPSW---ASPTTTPSRASLMKMESTVSITRTPPRASPT 335
Query: 385 *RPRAARSPSPSSLCCLCFSSPVA-----PSRASFTCRSTTPRCCGPPRSAATAAVLPAA 549
P A S L S A PSRAS + G P A+ P A
Sbjct: 336 GTPSRASPTGTPSRASLTGSPSRASLTGTPSRASLIGTPSRASLIGTPSRASLTGTPPRA 395
Query: 550 RTSRRSSAAR*TRTARAAA 606
+ SS A TRT A+
Sbjct: 396 SLTGTSSTASLTRTPSRAS 414