[UP]
[1][TOP] >UniRef100_B3NX29 GG19316 n=1 Tax=Drosophila erecta RepID=B3NX29_DROER Length = 906 Score = 65.1 bits (157), Expect = 3e-09 Identities = 60/175 (34%), Positives = 67/175 (38%), Gaps = 18/175 (10%) Frame = +1 Query: 4 QQPP------SYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSY-----AP 150 QQPP YG P + GG G ++G P PG Y +G GG P S P Sbjct: 559 QQPPPGPPQSQYGPPPPQNFAGGPPPMG-YAGYPPNPGQYGQAGAGGGPPPSGYWPPPPP 617 Query: 151 SSSASLP-------QGAHLGSRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQM 309 +SSA P Q A G GAPP Y PTS AP Q Sbjct: 618 TSSAQSPYQAYQQQQQAAAGGGAGAPPG--SSYPGGPPTSGAAPPPPPGGAYSTTAPSQT 675 Query: 310 PPPTGPSPHLAHGGVTAAHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGA 474 PPP G A GG T NGP + S P GGG GP+ P GA Sbjct: 676 PPPQGGGG--AGGGNT---------NPNGPNAQQSTPPPQGGAGGGAGPSGPGGA 719 [2][TOP] >UniRef100_UPI0001791D37 PREDICTED: similar to lim domain binding protein n=1 Tax=Acyrthosiphon pisum RepID=UPI0001791D37 Length = 722 Score = 64.7 bits (156), Expect = 4e-09 Identities = 50/157 (31%), Positives = 57/157 (36%), Gaps = 2/157 (1%) Frame = +1 Query: 34 PGSVVGGSSAAGSFSGPPY--APGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAP 207 P S+VGG S G G PG Y G PG H GG P Sbjct: 9 PSSMVGGPSGPGGGGGRRGYGGPGGYGGGPPGHH----------------------GGGP 46 Query: 208 PSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTAAHGVPRHHG 387 P GG GP + G P+ PP+GP PH GG HG P HHG Sbjct: 47 PGHHGGSVLGGPHGGPPGHLGGGVHHSGPSGHHGGPPSGP-PHHGGGGPPGHHGGPPHHG 105 Query: 388 ANGPASLNSAALPAYATGGGNGPAYPPGAIVSPASTA 498 GP P + GGG P + G + P S A Sbjct: 106 --GPPGSGPHGGPPHPHGGGGPPHHGAGVPLHPHSGA 140 [3][TOP] >UniRef100_Q9W3G1 CG10555 n=1 Tax=Drosophila melanogaster RepID=Q9W3G1_DROME Length = 926 Score = 64.7 bits (156), Expect = 4e-09 Identities = 59/177 (33%), Positives = 67/177 (37%), Gaps = 20/177 (11%) Frame = +1 Query: 4 QQPP------SYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSY-----AP 150 QQPP YG P + GG G ++G P PG Y +G GG P S P Sbjct: 571 QQPPPGPPQSQYGPPPPQNSAGGPPPMG-YAGYPPNPGQYGQAGAGGGPPPSGYWPPPPP 629 Query: 151 SSSASLP---------QGAHLGSRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPP 303 +SSA P Q A G GAPP Y PTS AP Sbjct: 630 TSSAQSPYQAYQQQQQQQAAAGGGAGAPPG--SSYPGGPPTSGAAPPPPPGGAYSTTAPS 687 Query: 304 QMPPPTGPSPHLAHGGVTAAHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGA 474 Q PPP G GG + P NGP + S P GGG GP+ P GA Sbjct: 688 QTPPPQGG------GGAGGGNNNP-----NGPNAQQSTPPPQGGAGGGAGPSGPGGA 733 [4][TOP] >UniRef100_A1UQ37 Putative methyl-accepting chemotaxis sensory transducer n=1 Tax=Mycobacterium sp. KMS RepID=A1UQ37_MYCSK Length = 845 Score = 64.3 bits (155), Expect = 5e-09 Identities = 69/200 (34%), Positives = 88/200 (44%), Gaps = 22/200 (11%) Frame = +1 Query: 1 AQQPPSYGSHVPGSVVGGSSAAGSFSG---PPYAPGVYAGSGPGGHPASSYAPSS--SAS 165 A PP GS GG+S+ GS PP A G+ + +G GG SS + ++ S+S Sbjct: 312 AMTPPMTPVSSGGS--GGASSLGSIGSGFKPPSASGL-SSAGTGGLSPSSLSSNAGLSSS 368 Query: 166 LPQGAHLGSRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAH 345 LP S GG P + AGG GA G +AT S+ S F + +PPP P P Sbjct: 369 LPSSVSPAS-GGLPSAAAGGGGAPG--AATSSDFSRGFNAGLGTGSVLPPPVAPPPAQPL 425 Query: 346 GGVTAAHGVPRHHG------ANGPASLNS----AALPAYATGGGNGPAYPPGAIVSPAS- 492 T A VP G A GPA + S A PA G P PP A +PA Sbjct: 426 SSTTGASSVPVSAGPAPVSAAGGPAHVASPTPAAGAPAGHMGSMGAPMMPPAA--APAGP 483 Query: 493 TATFN------RLSPAAAAA 534 FN +++PA A A Sbjct: 484 LPPFNSDLQPRQVAPAGAGA 503 [5][TOP] >UniRef100_UPI000186E27B hypothetical protein Phum_PHUM355640 n=1 Tax=Pediculus humanus corporis RepID=UPI000186E27B Length = 844 Score = 63.2 bits (152), Expect = 1e-08 Identities = 65/210 (30%), Positives = 85/210 (40%), Gaps = 37/210 (17%) Frame = +1 Query: 7 QPPSYGSHVPGSVVGGSSAAGSF------SGPPYAPGVYAGSGPGGHPASSYAPSSSASL 168 Q PS S G G S +GSF SGP + G SGP G S PSSS S Sbjct: 551 QGPSGPSGSFGGSQGPSGPSGSFDGSQGPSGPSFGGGNQGPSGPSGSFGGSQGPSSSVSF 610 Query: 169 ---------PQGAHLGSRG--------GAPPSVAGGYGASGPTSATFSNESGSF---QSL 288 P G+ GS+G GAP +G G+ G + S SGSF + Sbjct: 611 GGGNQGPSGPSGSFGGSQGPSGPSGSYGAPQGPSGSTGSFGGSQRPSSPSSGSFGGPGNQ 670 Query: 289 QPAPP--QMPPPTGPSPHLAHGGVTAAHGVPRHHGANGPASLNSAALPAYATGGGNGPAY 462 P+PP PP+GPS + GG G G GP+ ++ + A G +GP+ Sbjct: 671 GPSPPSGSYGPPSGPSG--SFGGSQGPSGPSFGGGNQGPSGPSTPSGSYGAPQGSSGPSV 728 Query: 463 P---------PGAIVSPASTATFNRLSPAA 525 P SP ++TF +P A Sbjct: 729 SFVGQQGSRVPVTGGSPGPSSTFGPTTPTA 758 Score = 56.2 bits (134), Expect = 1e-06 Identities = 57/175 (32%), Positives = 72/175 (41%), Gaps = 12/175 (6%) Frame = +1 Query: 16 SYGSHVPGSVVGGS------SAAGSFSGPPYAPGVYAGS-GPGGHPASSYAPSSSASLPQ 174 S G P GGS S G GP G + GS GP G P+ S+ S S P Sbjct: 512 SQGPSGPSGSFGGSQGPSGPSFGGGNQGPSGPSGSFGGSQGPSG-PSGSFGGSQGPSGPS 570 Query: 175 GAHLGSRGGAPPSVAGG-YGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGG 351 G+ GS+G + PS GG G SGP SGSF Q GPS ++ GG Sbjct: 571 GSFDGSQGPSGPSFGGGNQGPSGP--------SGSFGGSQ----------GPSSSVSFGG 612 Query: 352 VTAAHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSP----ASTATF 504 G GP+ P+ + GG GP+ P G+ +P ST +F Sbjct: 613 -----------GNQGPSG------PSGSFGGSQGPSGPSGSYGAPQGPSGSTGSF 650 Score = 55.8 bits (133), Expect = 2e-06 Identities = 55/172 (31%), Positives = 68/172 (39%), Gaps = 18/172 (10%) Frame = +1 Query: 13 PSYGSHVP------GSVVGGSSAAGSFSGPPY---APGVYAGS----GPGGHP---ASSY 144 PS+G P GS G + A FSG P +PG S G GG+P +SS+ Sbjct: 66 PSFGPSPPSSRPDFGSQSGSTPAGNGFSGRPSGSSSPGSGYPSAGQGGQGGYPGSSSSSF 125 Query: 145 APSSSASLPQGAHLGSRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTG 324 P G GS+G +P + GGY + GP TFS+ G P G Sbjct: 126 GPGYQGGSGGGGRPGSQGSSPGTSNGGYPSGGP---TFSSGVGG----SSGPGYQGGAGG 178 Query: 325 PSPHLAHGGVTAAHGVPRHHGANGPASLNSAALPAY--ATGGGNGPAYPPGA 474 S GG G GA G + P Y GGG+GP Y GA Sbjct: 179 GSGPGYQGGAGGGSGPGYQGGAGGGSG------PGYQGGAGGGSGPGYQGGA 224 Score = 54.3 bits (129), Expect = 5e-06 Identities = 48/162 (29%), Positives = 61/162 (37%), Gaps = 8/162 (4%) Frame = +1 Query: 13 PSY-GSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGP------GGHPASSYAPSSSASLP 171 P Y G GS G AG SGP Y G GSGP GG Y + Sbjct: 194 PGYQGGAGGGSGPGYQGGAGGGSGPGYQGGAGGGSGPGYQGGAGGGSGPGYQGGAGGGSG 253 Query: 172 QGAHLGSRGGAPPSVAGGYGASG-PTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHG 348 G G+ GG+ P GG G G P S S + +Q +GPS G Sbjct: 254 PGYQGGAGGGSGPGFQGGAGGGGRPGSQGGSGGNSGYQG----------GSGPS---FQG 300 Query: 349 GVTAAHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGA 474 G +G G+ GP + P + GG+GP + G+ Sbjct: 301 GAGGGNGPSSQGGSGGPGFQGGSGGPGFQ--GGSGPGFQGGS 340 [6][TOP] >UniRef100_UPI0000222BCC Hypothetical protein CBG04553 n=1 Tax=Caenorhabditis briggsae AF16 RepID=UPI0000222BCC Length = 723 Score = 62.8 bits (151), Expect = 2e-08 Identities = 54/179 (30%), Positives = 69/179 (38%), Gaps = 17/179 (9%) Frame = +1 Query: 49 GGSSAAGSFSGPPYAPGV-YAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPPSVAGG 225 GG ++G ++ PP G YA G GG YA + G + GG S GG Sbjct: 86 GGGGSSGGYAKPPGGGGGGYASGGGGGGGGGGYASGGGGGVSSGGYAKPSGGGGGSSGGG 145 Query: 226 YGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTAAHGVPRHHGANGPAS 405 Y + G +S G + PAP P P P+P A G A+ G G++G Sbjct: 146 YSSGGGSSG--GGGGGGYSQSAPAPAAAPAP-APAPAPAPSGGYASSG--GGGGSSGGGY 200 Query: 406 LNSAALPA----------YATGG------GNGPAYPPGAIVSPASTATFNRLSPAAAAA 534 SA PA YA+GG G G Y A P A +PA A A Sbjct: 201 SQSAPAPAPAPAPAPSGGYASGGGAGGSSGGGGGYSQSAPPPPPQPAPAPEPAPAPAPA 259 Score = 59.3 bits (142), Expect = 2e-07 Identities = 47/164 (28%), Positives = 68/164 (41%), Gaps = 1/164 (0%) Frame = +1 Query: 1 AQQPPSYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGA 180 A +P + P S G SG Y+ G + G GG + Y+ S+ P A Sbjct: 249 APEPAPAPAPAPSGGYASSGGGGGSSGGGYSSGGGSSGGGGGGSSGGYSQSAPPPPPAPA 308 Query: 181 HLGSRGGAP-PSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVT 357 + AP P+ +GGY +SG S+ G + Q APP P + P+P A G Sbjct: 309 PAPAPAPAPAPAPSGGYASSGGGSS--GGGGGGYS--QSAPPPPAPESAPAPAPAPSGGY 364 Query: 358 AAHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSPA 489 A+ G G G +S +S + GGG G Y + P+ Sbjct: 365 ASSGGGESSG--GGSSASSGGYASSGGGGGGGGGYASASAPPPS 406 Score = 56.6 bits (135), Expect = 1e-06 Identities = 53/164 (32%), Positives = 63/164 (38%), Gaps = 15/164 (9%) Frame = +1 Query: 1 AQQPPSYGSHVPGSVVGGSSAAG---SFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLP 171 A P G + G GGSS G S S PP P P PA + AP+ P Sbjct: 210 APAPAPSGGYASGGGAGGSSGGGGGYSQSAPPPPPQ----PAPAPEPAPAPAPA-----P 260 Query: 172 QGAHLGSRGGAPPSVAGGYGASGPTS-ATFSNESGSFQSLQPAPPQMPPPT---GPSPHL 339 G + S GG S GGY + G +S SG + P PP P P P+P Sbjct: 261 SGGYASSGGGGG-SSGGGYSSGGGSSGGGGGGSSGGYSQSAPPPPPAPAPAPAPAPAPAP 319 Query: 340 AHGGVTAAHGVPRHHGANG--------PASLNSAALPAYATGGG 447 A G A+ G G G P + SA PA A GG Sbjct: 320 APSGGYASSGGGSSGGGGGGYSQSAPPPPAPESAPAPAPAPSGG 363 Score = 55.8 bits (133), Expect = 2e-06 Identities = 57/201 (28%), Positives = 78/201 (38%), Gaps = 28/201 (13%) Frame = +1 Query: 16 SYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSS-------ASLPQ 174 S G + S GG S+ G +S G G G GG+ S+ AP+++ A P Sbjct: 127 SSGGYAKPSGGGGGSSGGGYSS---GGGSSGGGGGGGYSQSAPAPAAAPAPAPAPAPAPS 183 Query: 175 GAHLGSRGG-----------------AP-PSVAGGYGASGPTSATFSNESGSFQSLQPAP 300 G + S GG AP P+ +GGY ASG + S G + P P Sbjct: 184 GGYASSGGGGGSSGGGYSQSAPAPAPAPAPAPSGGY-ASGGGAGGSSGGGGGYSQSAPPP 242 Query: 301 PQMPPPT---GPSPHLAHGGVTAAHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPG 471 P P P P+P A G A+ G G + +S + GGG+ Y Sbjct: 243 PPQPAPAPEPAPAPAPAPSGGYASSG---GGGGSSGGGYSSGGGSSGGGGGGSSGGYSQS 299 Query: 472 AIVSPASTATFNRLSPAAAAA 534 A P + A +PA A A Sbjct: 300 APPPPPAPAPAPAPAPAPAPA 320 Score = 55.5 bits (132), Expect = 2e-06 Identities = 47/164 (28%), Positives = 62/164 (37%) Frame = +1 Query: 1 AQQPPSYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGA 180 A P G + G GG S+ G G AP P P + AP+ S+ Sbjct: 482 APAPAPSGGYSSGGGGGGGSSGGYSGGSAPAPASEPAPAPAPEPEPAPAPAPSS------ 535 Query: 181 HLGSRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTA 360 G G S GG G+SG +S +S GS P PP P P+P A G + Sbjct: 536 --GGYSGGSSSGGGGGGSSGGSSGGYS--GGSAAPPPPPPPAPEPAPAPAPAPAPSGGYS 591 Query: 361 AHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSPAS 492 + G G + + PA A+ PA P +PAS Sbjct: 592 SEG---GGGGGSSGGYSGGSAPAPASEPAPAPAPEPEPAPAPAS 632 Score = 53.9 bits (128), Expect = 7e-06 Identities = 48/170 (28%), Positives = 60/170 (35%), Gaps = 7/170 (4%) Frame = +1 Query: 19 YGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRG 198 Y S GS GG + PP AP + P P+ YA S G S G Sbjct: 325 YASSGGGSSGGGGGGYSQSAPPPPAPE--SAPAPAPAPSGGYASSGGGESSGGGSSASSG 382 Query: 199 GAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTA------ 360 G S GG G G SA+ SG A PPP P+P A A Sbjct: 383 GYASSGGGGGGGGGYASASAPPPSGGGGGGYSASAAPPPPPPPAPEPAPAPAPAPAPSRG 442 Query: 361 -AHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSPASTATFN 507 + G G++G S SA PA P P +PA + ++ Sbjct: 443 YSSGGGGGGGSSGGYSGGSAPAPASEPAPAPAPEQAPAPAPAPAPSGGYS 492 [7][TOP] >UniRef100_Q5NT95 Type 1 collagen alpha 2 n=1 Tax=Paralichthys olivaceus RepID=Q5NT95_PAROL Length = 1352 Score = 62.8 bits (151), Expect = 2e-08 Identities = 53/170 (31%), Positives = 65/170 (38%), Gaps = 19/170 (11%) Frame = +1 Query: 22 GSHVPGSVVG--GSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSR 195 G+ P G GS +G PG G G ++ AP +G H G Sbjct: 598 GARGPAGAPGPDGSKGEPGITGAAGGPGHQGPGGMPGERGAAGAPGGKGEKGEGGHRGPE 657 Query: 196 GGAPPSVAGGY-GASGPTSATFSN----ESGSFQSLQPAPPQMPP----PTGPSPHLAHG 348 G A A G G +GP T +N ESGSF PA P+ GP+ Sbjct: 658 GNAGRDGARGMPGPAGPPGPTGANGDKGESGSFGPAGPAGPRGASGERGEVGPAGAPGFA 717 Query: 349 GVTAAHGVPRHHGANGPASLNSAALPAYATG--------GGNGPAYPPGA 474 G A G P G GPA + P+ +G G NGPA PPGA Sbjct: 718 GPPGADGQPGARGERGPAGIKGEVGPSGPSGPAGQSGPAGPNGPAGPPGA 767 [8][TOP] >UniRef100_Q16988 Fibroin-4 (Fragment) n=1 Tax=Araneus diadematus RepID=Q16988_ARADI Length = 410 Score = 62.8 bits (151), Expect = 2e-08 Identities = 59/179 (32%), Positives = 77/179 (43%), Gaps = 16/179 (8%) Frame = +1 Query: 10 PPSYGSHVPGSVVGGSSAAGSFSGP-------PYAPGVYAGSGPGGHPASSYAPSSSASL 168 P +YG P S ++AAGS G P PG Y GPGG +S+ A +++AS Sbjct: 27 PVAYGPGGPVSSAAAAAAAGSGPGGYGPENQGPSGPGGY---GPGGSGSSAAAAAAAASG 83 Query: 169 PQGAHLGSRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHG 348 P G GS+G + P +GGYG G A+ G+ + A P G P Sbjct: 84 PGGYGPGSQGPSGPGGSGGYG-PGSQGASGPGGPGASAAAAAAAAAASGPGGYGP----- 137 Query: 349 GVTAAHGVPRHHGANGPASLNSAALPAYATGGGNGP---------AYPPGAIVSPASTA 498 G G P +G GP S +AA A + GG GP Y PG S A+ A Sbjct: 138 GSQGPSG-PGAYGPGGPGSSAAAAAAAASGPGGYGPGSQGPSGPGVYGPGGPGSSAAAA 195 Score = 57.8 bits (138), Expect = 5e-07 Identities = 52/157 (33%), Positives = 69/157 (43%), Gaps = 8/157 (5%) Frame = +1 Query: 10 PPSYGSHVPGSVVGGSSAAGSFSGP-------PYAPGVYAGSGPGGHPASSYAPSSSASL 168 P +YG PGS ++AA S G P PGVY GPG +S+ A +++ S Sbjct: 145 PGAYGPGGPGSSAAAAAAAASGPGGYGPGSQGPSGPGVYGPGGPG---SSAAAAAAAGSG 201 Query: 169 PQGAHLGSRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHG 348 P G ++G PS GGYG G SGS + A P GP G Sbjct: 202 PGGYGPENQG---PSGPGGYGPGG---------SGSSAAAAAAAASGPGGYGPGSQGPSG 249 Query: 349 -GVTAAHGVPRHHGANGPASLNSAALPAYATGGGNGP 456 G + +G P G +GP + +AA A + GG GP Sbjct: 250 PGGSGGYG-PGSQGGSGPGASAAAAAAAASGPGGYGP 285 Score = 57.4 bits (137), Expect = 6e-07 Identities = 48/166 (28%), Positives = 73/166 (43%), Gaps = 13/166 (7%) Frame = +1 Query: 34 PGSVVGGSSAAGSFSGP---------PYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHL 186 PG+ ++AA + SGP P PG Y GPGG +S+ A +++AS P G Sbjct: 116 PGASAAAAAAAAAASGPGGYGPGSQGPSGPGAY---GPGGPGSSAAAAAAAASGPGGYGP 172 Query: 187 GSRGGAPPSV--AGGYGASGPTSATFSNESGSF--QSLQPAPPQMPPPTGPSPHLAHGGV 354 GS+G + P V GG G+S +A + G + ++ P+ P P G A Sbjct: 173 GSQGPSGPGVYGPGGPGSSAAAAAAAGSGPGGYGPENQGPSGPGGYGPGGSGSSAAAAAA 232 Query: 355 TAAHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSPAS 492 A+ G+ GP+ + + GG+GP A + AS Sbjct: 233 AASGPGGYGPGSQGPSGPGGSGGYGPGSQGGSGPGASAAAAAAAAS 278 Score = 57.0 bits (136), Expect = 8e-07 Identities = 50/169 (29%), Positives = 74/169 (43%), Gaps = 19/169 (11%) Frame = +1 Query: 10 PPSYGSHVPGSVVGGSSAAGSFSGP-------PYAPGVYAGSGPGGHPASSYAPSSSASL 168 P YG PGS ++AAGS G P PG Y GPGG +S+ A +++AS Sbjct: 180 PGVYGPGGPGSSAAAAAAAGSGPGGYGPENQGPSGPGGY---GPGGSGSSAAAAAAAASG 236 Query: 169 PQGAHLGSRGGAPPSVAGGY----------GASGPTSATFSNESGSF--QSLQPAPPQMP 312 P G GS+G + P +GGY GAS +A ++ G + S P+ P Sbjct: 237 PGGYGPGSQGPSGPGGSGGYGPGSQGGSGPGASAAAAAAAASGPGGYGPGSQGPSGPGYQ 296 Query: 313 PPTGPSPHLAHGGVTAAHGVPRHHGANGPASLNSAALPAYATGGGNGPA 459 P+GP + +A+ + ++SA ++G NG A Sbjct: 297 GPSGPGAYGPSPSASASVAASVYLRLQPRLEVSSAVSSLVSSGPTNGAA 345 Score = 56.2 bits (134), Expect = 1e-06 Identities = 58/189 (30%), Positives = 78/189 (41%), Gaps = 32/189 (16%) Frame = +1 Query: 4 QQPPSYGSHVP-GSVVGGSSAAGSFSGP-----------------PYAPGVYAGSGPGGH 129 Q P G + P GS ++AA + SGP Y PG SGPGG Sbjct: 57 QGPSGPGGYGPGGSGSSAAAAAAAASGPGGYGPGSQGPSGPGGSGGYGPGSQGASGPGGP 116 Query: 130 PAS--SYAPSSSASLPQGAHLGSRGGAPPSV--AGGYGASGPTSATFSNESGSF--QSLQ 291 AS + A +++AS P G GS+G + P GG G+S +A ++ G + S Sbjct: 117 GASAAAAAAAAAASGPGGYGPGSQGPSGPGAYGPGGPGSSAAAAAAAASGPGGYGPGSQG 176 Query: 292 PAPPQMPPPTGPSPHLA--------HGGVTAAHGVPRHHGANGPASLNSAALPAYATGGG 447 P+ P + P GP A GG + P G GP S+A A A G Sbjct: 177 PSGPGVYGPGGPGSSAAAAAAAGSGPGGYGPENQGPSGPGGYGPGGSGSSAAAAAAAASG 236 Query: 448 NGPAYPPGA 474 G Y PG+ Sbjct: 237 PG-GYGPGS 244 Score = 53.5 bits (127), Expect = 9e-06 Identities = 51/172 (29%), Positives = 72/172 (41%), Gaps = 2/172 (1%) Frame = +1 Query: 25 SHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGA 204 S + GS G + P P Y GPGG +S+ A +++ S P G ++G Sbjct: 4 SAAAAAAASGSGGYGPENQGPSGPVAY---GPGGPVSSAAAAAAAGSGPGGYGPENQG-- 58 Query: 205 PPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHG-GVTAAHGVPRH 381 PS GGYG G SGS + A P GP G G + +G P Sbjct: 59 -PSGPGGYGPGG---------SGSSAAAAAAAASGPGGYGPGSQGPSGPGGSGGYG-PGS 107 Query: 382 HGANGPASLNSAALPAYATGGGNGP-AYPPGAIVSPASTATFNRLSPAAAAA 534 GA+GP ++A A A +GP Y PG+ P+ + P ++AA Sbjct: 108 QGASGPGGPGASAAAAAAAAAASGPGGYGPGS-QGPSGPGAYGPGGPGSSAA 158 [9][TOP] >UniRef100_C4XYJ5 Predicted protein n=1 Tax=Clavispora lusitaniae ATCC 42720 RepID=C4XYJ5_CLAL4 Length = 953 Score = 62.4 bits (150), Expect = 2e-08 Identities = 66/188 (35%), Positives = 77/188 (40%), Gaps = 13/188 (6%) Frame = +1 Query: 10 PPSYGSHVPGSVVGGSSAAGSFSGP-----PYAPGVYAGSGPGGHPASSYAPSSSASLPQ 174 P S GS PGS G S A GS P P +PG SG G P S +PSS + P Sbjct: 616 PGSPGS--PGSP-GASGAPGSPGSPGSPGSPGSPGSPGASGSPGSPGSPGSPSSPSGSPG 672 Query: 175 GAHLGSRGGA-----PPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHL 339 S GA P G GASG + S S S S P P P G S Sbjct: 673 SPSSPSSPGASGSPGSPGSPGSPGASGSPGSPGSPGSPSSPSGSPGSPSSPSSPGAS--- 729 Query: 340 AHGGVTAAHGVPRHHGANG-PASLNSAALPAYATGGGNGPAYP--PGAIVSPASTATFNR 510 G + G P GA+G P S S P+ +G P+ P PGA SP S + Sbjct: 730 ---GSPGSPGSPGSPGASGSPGSPGSPGSPSSPSGSPGSPSSPSSPGASGSPGSPGSPG- 785 Query: 511 LSPAAAAA 534 SP A+ A Sbjct: 786 -SPGASGA 792 Score = 58.2 bits (139), Expect = 4e-07 Identities = 59/180 (32%), Positives = 71/180 (39%), Gaps = 8/180 (4%) Frame = +1 Query: 13 PSYGSHVPGSVVGGSSAAGSFS-GPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLG 189 PS S PGS SS S S G P +PG SG G P S +PSS + P Sbjct: 664 PSSPSGSPGSPSSPSSPGASGSPGSPGSPGSPGASGSPGSPGSPGSPSSPSGSPGSPSSP 723 Query: 190 SRGGA-----PPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGV 354 S GA P G GASG + S S S S P P P G S G Sbjct: 724 SSPGASGSPGSPGSPGSPGASGSPGSPGSPGSPSSPSGSPGSPSSPSSPGAS------GS 777 Query: 355 TAAHGVPRHHGANGPASLNSAALPAYATGGGNGPAYP--PGAIVSPASTATFNRLSPAAA 528 + G P GA+G A G P P PG+ SP S+ + + SP+A+ Sbjct: 778 PGSPGSPGSPGASG------------APGAPGSPGSPGSPGSPSSPGSSESGSPSSPSAS 825 Score = 56.6 bits (135), Expect = 1e-06 Identities = 56/165 (33%), Positives = 70/165 (42%), Gaps = 4/165 (2%) Frame = +1 Query: 10 PPSYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLG 189 P S GS PGS SS +GS G P +PG SG G P S +P +S + G Sbjct: 582 PGSPGS--PGSPGSPSSPSGS-PGSPSSPGSPGASGSPGSPGSPGSPGASGAPGSPGSPG 638 Query: 190 SRGG-APPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTAAH 366 S G P G G+ G + S S S P+ P P +G SP + Sbjct: 639 SPGSPGSPGSPGASGSPGSPGSPGSPSSPSGSPGSPSSPSSPGASG-SP--------GSP 689 Query: 367 GVPRHHGANG-PASLNSAALPAYATGGGNGPAYP--PGAIVSPAS 492 G P GA+G P S S P+ +G P+ P PGA SP S Sbjct: 690 GSPGSPGASGSPGSPGSPGSPSSPSGSPGSPSSPSSPGASGSPGS 734 Score = 55.8 bits (133), Expect = 2e-06 Identities = 56/170 (32%), Positives = 66/170 (38%), Gaps = 15/170 (8%) Frame = +1 Query: 37 GSVVGGSSAAGS--------FSGPPYAPGVYAGSGPGG--HPASSYAPSSSASLPQGAHL 186 G+ G S +GS SG P APG +GP G PA PS A P Sbjct: 362 GNGSGNGSGSGSPGSPGSPGASGAPGAPGAPGPAGPAGPAGPAGPAGPSGPAGSPGSPGS 421 Query: 187 GSRGGAP--PSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTA 360 G+P P G GASG + S S S S P P P G S G Sbjct: 422 PGASGSPESPGSPGSPGASGAPGSPGSPGSPSSPSGAPGSPGSPGSPGASGSPGSPGSPG 481 Query: 361 AHGVPRHHGANG-PASLNSAALPAYATGGGNGPAYP--PGAIVSPASTAT 501 + G P GA G P S S P A+G P P PG+ SP S ++ Sbjct: 482 SPGSPGASGAPGSPGSPGSPGSPG-ASGAPGSPGSPGSPGSPGSPGSPSS 530 Score = 55.1 bits (131), Expect = 3e-06 Identities = 62/180 (34%), Positives = 76/180 (42%), Gaps = 5/180 (2%) Frame = +1 Query: 10 PPSYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLG 189 P S GS PGS G S A GS G P +PG SG G P S +P S S Sbjct: 477 PGSPGS--PGSP-GASGAPGS-PGSPGSPGSPGASGAPGSPGSPGSPGSPGS-------- 524 Query: 190 SRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMP--PPTGPSPHLAHGGVTAA 363 G+P S +G G+ G A SGS P P P P + SP + G + Sbjct: 525 --PGSPSSPSGSPGSPGSPGA-----SGS-----PGSPGSPGSPGSPSSPGSSESGSPGS 572 Query: 364 HGVPRHHGANG-PASLNSAALPAYATGGGNGPAYP--PGAIVSPASTATFNRLSPAAAAA 534 G P G+ G P S S P+ +G P+ P PGA SP S + SP A+ A Sbjct: 573 PGSPGASGSPGSPGSPGSPGSPSSPSGSPGSPSSPGSPGASGSPGSPGSPG--SPGASGA 630 Score = 53.9 bits (128), Expect = 7e-06 Identities = 63/182 (34%), Positives = 76/182 (41%), Gaps = 21/182 (11%) Frame = +1 Query: 10 PPSYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSG-PG--GHPASSYAPSSSASLPQGA 180 P S GS PGS SS +GS G P +PG G PG G P S +P SS S G+ Sbjct: 516 PGSPGS--PGSPGSPSSPSGS-PGSPGSPGASGSPGSPGSPGSPGSPSSPGSSESGSPGS 572 Query: 181 HLGSRG--------------GAPPSVAGGYGA-SGPTSATFSNESGSFQSLQPAPPQMPP 315 GS G G+P S +G G+ S P S S GS P P P Sbjct: 573 P-GSPGASGSPGSPGSPGSPGSPSSPSGSPGSPSSPGSPGASGSPGS-----PGSPGSPG 626 Query: 316 PTGPSPHLAHGGVTAAHGVPRHHGANG-PASLNSAALPAYATGGGNGPAYP--PGAIVSP 486 +G G + G P GA+G P S S P+ +G P+ P PGA SP Sbjct: 627 ASGAPGSPGSPGSPGSPGSPGSPGASGSPGSPGSPGSPSSPSGSPGSPSSPSSPGASGSP 686 Query: 487 AS 492 S Sbjct: 687 GS 688 Score = 53.5 bits (127), Expect = 9e-06 Identities = 52/157 (33%), Positives = 60/157 (38%), Gaps = 1/157 (0%) Frame = +1 Query: 25 SHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGA 204 S PGS G + S SG P +PG G G P S +P S S GS G Sbjct: 440 SGAPGSP-GSPGSPSSPSGAPGSPGSPGSPGASGSPGSPGSPGSPGSPGASGAPGSPGS- 497 Query: 205 PPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMP-PPTGPSPHLAHGGVTAAHGVPRH 381 P G GASG + S S P P P P+ PS G A G P Sbjct: 498 -PGSPGSPGASGAPGSPGSPGS-------PGSPGSPGSPSSPSGSPGSPGSPGASGSPGS 549 Query: 382 HGANGPASLNSAALPAYATGGGNGPAYPPGAIVSPAS 492 G+ P S S + P + G G PGA SP S Sbjct: 550 PGS--PGSPGSPSSPGSSESGSPGSPGSPGASGSPGS 584 [10][TOP] >UniRef100_Q692G1 Major ampullate spidroin 2 (Fragment) n=1 Tax=Nephila clavipes RepID=Q692G1_NEPCL Length = 332 Score = 62.0 bits (149), Expect = 3e-08 Identities = 64/200 (32%), Positives = 82/200 (41%), Gaps = 23/200 (11%) Frame = +1 Query: 4 QQPPSYGSHVPGSVVGGSSAAGSFSGP------PYAPGVY--AGSGPGGHPASSYAPSSS 159 Q P YG G GS+AA + +GP Y PG G GPG Y P S+ Sbjct: 36 QGPGGYGPGQQGPSGAGSAAAAAAAGPGQQGLGGYGPGQQGPGGYGPGQQGPGGYGPGSA 95 Query: 160 ASLPQGAHLGSR--GGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAP-PQMPPPTGPS 330 ++ A G + GG P G G ++A + G + Q P P GPS Sbjct: 96 SAAAAAAGPGQQGPGGYGPGQQGPSGPGSASAAAAAAGPGGYGPGQQGPGGYAPGQQGPS 155 Query: 331 -PHLAHGGVTAAHGVPRHHG--ANGPASLNSAALPAYATGGGNGPA------YPPGAIVS 483 P A AA P +G GP+ AA A A GG GPA Y PG+ V+ Sbjct: 156 GPGSAAAAAAAARAGPGGYGPAQQGPSGPGIAASAASAGPGGYGPAQQGPAGYGPGSAVA 215 Query: 484 P---ASTATFNRLSPAAAAA 534 A +A + S A+AAA Sbjct: 216 ASAGAGSAGYGPGSQASAAA 235 Score = 54.3 bits (129), Expect = 5e-06 Identities = 56/192 (29%), Positives = 66/192 (34%), Gaps = 30/192 (15%) Frame = +1 Query: 7 QPPSYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHL 186 +P G PGS ++AA G Y PG GPGG+ PS + S A Sbjct: 6 RPGQQGPSGPGSAAAAAAAAAGPGG--YGPGQ---QGPGGYGPGQQGPSGAGSAAAAAAA 60 Query: 187 G----SRGGAPPSVAG--GYGASGPTSATFSNESGSFQSLQPAPPQMPP---------PT 321 G GG P G GYG + S S + P Q P P+ Sbjct: 61 GPGQQGLGGYGPGQQGPGGYGPGQQGPGGYGPGSASAAAAAAGPGQQGPGGYGPGQQGPS 120 Query: 322 GPSPHLAHGGVTAAHG-----------VPRHHGANGPASLNSAALPAYATGGGNGPAYP- 465 GP A G P G +GP S +AA A A GG GPA Sbjct: 121 GPGSASAAAAAAGPGGYGPGQQGPGGYAPGQQGPSGPGSAAAAAAAARAGPGGYGPAQQG 180 Query: 466 ---PGAIVSPAS 492 PG S AS Sbjct: 181 PSGPGIAASAAS 192 [11][TOP] >UniRef100_B0F656 Major ampullate spidroin 2 (Fragment) n=1 Tax=Latrodectus geometricus RepID=B0F656_9ARAC Length = 388 Score = 62.0 bits (149), Expect = 3e-08 Identities = 50/174 (28%), Positives = 64/174 (36%), Gaps = 4/174 (2%) Frame = +1 Query: 10 PPSYGSHVPGSVVGGSSAAGSFSGP----PYAPGVYAGSGPGGHPASSYAPSSSASLPQG 177 P G+ + GGS G GP P PG G GPGG A+S A ++++S P G Sbjct: 199 PGGSGAAAAAAATGGSGPGGYGQGPASYAPSGPGGQQGYGPGGSGAASAAAAAASSGPGG 258 Query: 178 AHLGSRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVT 357 G+ G G YG SGP + + + P GP A Sbjct: 259 YGPGASG------PGSYGPSGP-----GGSGAAAAAAAASAPGGQQGYGPGGSGAAAAAA 307 Query: 358 AAHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSPASTATFNRLSP 519 A P A GP +AA A G G Y PG + A+ A P Sbjct: 308 AGGAGPGSQQAYGPGGSGAAAAAAAGPGSGGQQGYGPGGSAAAAAAAAAGGSGP 361 Score = 57.4 bits (137), Expect = 6e-07 Identities = 53/169 (31%), Positives = 69/169 (40%), Gaps = 6/169 (3%) Frame = +1 Query: 10 PPSYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLG 189 P +YG PG G S+AA + + PG GPGG A++ A ++ S P G G Sbjct: 166 PGAYGPSAPG---GPSAAAAAAASGGAGPGRQQSYGPGGSGAAAAAAATGGSGPGGYGQG 222 Query: 190 SRGGAPPSVAG--GYGASGPTSATFSNESGSFQSLQPAPPQMPPPT-GPSPHLAHGGVTA 360 AP G GYG G +A+ + + S P P + GPS G A Sbjct: 223 PASYAPSGPGGQQGYGPGGSGAASAAAAAASSGPGGYGPGASGPGSYGPSGPGGSGAAAA 282 Query: 361 AHGVPRHHGANGPASLNSAALPAYATGG---GNGPAYPPGAIVSPASTA 498 A G G S A A A GG G+ AY PG + A+ A Sbjct: 283 AAAASAPGGQQGYGPGGSGAAAAAAAGGAGPGSQQAYGPGGSGAAAAAA 331 [12][TOP] >UniRef100_Q9BIT8 Major ampullate spidroin 2 (Fragment) n=1 Tax=Latrodectus geometricus RepID=Q9BIT8_9ARAC Length = 399 Score = 61.6 bits (148), Expect = 3e-08 Identities = 53/164 (32%), Positives = 71/164 (43%) Frame = +1 Query: 10 PPSYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLG 189 P SYG PG G ++AA + SGP G G GPGG AS+ A +++ G + Sbjct: 3 PGSYGPSGPGGS-GAAAAAAAASGP----GGQQGYGPGGPGASAAAAAAAGGSGPGGY-- 55 Query: 190 SRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTAAHG 369 G PS GYG SGP + G S A +GP GG Sbjct: 56 ---GQGPS---GYGPSGPGAQQGYGPGGQGGSGAAAAAAAAAGSGP------GGYGPGAA 103 Query: 370 VPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSPASTAT 501 P ++G +GP +AA A A+G G Y PG + A+ A+ Sbjct: 104 GPGNYGPSGPGGSGAAASAAAASGPGGQQGYGPGGSGAAAAAAS 147 Score = 57.0 bits (136), Expect = 8e-07 Identities = 51/154 (33%), Positives = 63/154 (40%), Gaps = 3/154 (1%) Frame = +1 Query: 49 GGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSS-SASLPQGAHLGSRGGAPPSVAGG 225 GGS AA + + PG G GPGG A++ A ++ S P G G G P GG Sbjct: 137 GGSGAAAAAASGGAGPGRQQGYGPGGSGAAAAAAAAXGGSGPGGYGQGPXGYGP----GG 192 Query: 226 YGASGPTSATFSNESGSFQSLQP--APPQMPPPTGPSPHLAHGGVTAAHGVPRHHGANGP 399 G SG +A + S P A P P+GP A AA G G Sbjct: 193 QGGSGGAAAAAAAASSGPXGYGPGAAGPGNYGPSGPGGSGAAAAAAAASGPGGQQGYGPG 252 Query: 400 ASLNSAALPAYATGGGNGPAYPPGAIVSPASTAT 501 S SAA A G G AY PG + A+ A+ Sbjct: 253 GSGASAAAAAGGAGXGRQQAYGPGGSGAAAAAAS 286 Score = 56.6 bits (135), Expect = 1e-06 Identities = 60/208 (28%), Positives = 75/208 (36%), Gaps = 43/208 (20%) Frame = +1 Query: 4 QQPPSYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGH-----PASSYAPS----- 153 Q P YG PG+ G SG A AGSGPGG+ +Y PS Sbjct: 57 QGPSGYGPSGPGAQQGYGPGGQGGSGAAAAAAAAAGSGPGGYGPGAAGPGNYGPSGPGGS 116 Query: 154 ------SSASLPQG-----------AHLGSRGGAPPSVAGGY--GASGPTSATFSNESGS 276 ++AS P G A + GGA P GY G SG +A + GS Sbjct: 117 GAAASAAAASGPGGQQGYGPGGSGAAAAAASGGAGPGRQQGYGPGGSGAAAAAAAAXGGS 176 Query: 277 FQSLQPAPPQMPPPTGPSPHLAHGGVTAAHGV--------------PRHHGANGPASLNS 414 Q P GP GG AA P ++G +GP + Sbjct: 177 GPG---GYGQGPXGYGPGGQGGSGGAAAAAAAASSGPXGYGPGAAGPGNYGPSGPGGSGA 233 Query: 415 AALPAYATGGGNGPAYPPGAIVSPASTA 498 AA A A+G G Y PG + A+ A Sbjct: 234 AAAAAAASGPGGQQGYGPGGSGASAAAA 261 [13][TOP] >UniRef100_B3N0G2 GF21726 n=1 Tax=Drosophila ananassae RepID=B3N0G2_DROAN Length = 947 Score = 61.2 bits (147), Expect = 4e-08 Identities = 59/189 (31%), Positives = 74/189 (39%), Gaps = 31/189 (16%) Frame = +1 Query: 10 PPS--YGSHVPGSVVGGSSAAGSFSG-PPYAPGVYAGSGPGGHPA-SSYAP-----SSSA 162 PP YG P + GG S++G PP G Y +G GG P SY P +SSA Sbjct: 597 PPQSQYGPPPPQNTAGGPPPPMSYAGYPPNPVGQYGQAGAGGGPPPGSYGPPPPVPTSSA 656 Query: 163 SLPQGAHLGSRGGAPPSVAGGYGASGPTSATFS---NESGSFQSLQPAPPQMPPP-TGPS 330 P A+ + GGA + G GP ++ G++ S AP Q PPP G + Sbjct: 657 QSPYQAYQTAAGGATGAPPGSSYPGGPPTSVAGPPPPPGGAYSSSTTAPSQTPPPQAGGA 716 Query: 331 PHLAHGGVTAAHGVPRHHGANGPASLNSAALPAYATGGGNGPA----------------- 459 A GG G P NGP + S P GG G A Sbjct: 717 GGGAGGGGAGGSGNP-----NGPNAQQSTPPPQGGAGGAAGGAGGAPQQYAGPPPQQQQQ 771 Query: 460 -YPPGAIVS 483 PPG +VS Sbjct: 772 QQPPGVVVS 780 [14][TOP] >UniRef100_UPI0000E46467 PREDICTED: similar to MGC139263 protein n=1 Tax=Strongylocentrotus purpuratus RepID=UPI0000E46467 Length = 589 Score = 60.8 bits (146), Expect = 6e-08 Identities = 61/191 (31%), Positives = 74/191 (38%), Gaps = 30/191 (15%) Frame = +1 Query: 10 PPSYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGP---GGHP---ASSYAPSSSASLP 171 PP+ G++ P G AG PP A G G+ P GG P A Y P+ A P Sbjct: 23 PPAPGTYPPAGGAPGYPPAGVPGYPPAAAGYPTGAAPPPAGGQPYGAAPGYPPAGGAGYP 82 Query: 172 QGAHLGSRG---------GAPPSVAGGYGASG--PTSATFSNE---SGSFQSLQPAPP-- 303 G GAPP A GY +G P + + + + + QP P Sbjct: 83 PAPGYGGYPSAQPPAPGYGAPPGGAPGYPPAGGYPAAGGYPGQQPPAAGYPGQQPPPAAG 142 Query: 304 ---QMPPPT----GPSPHLAHGGVTAAHGVPRHHGANGPASLNSAALPAYATGGGNG-PA 459 Q PPP G P A G A G P+ A A AA AYA GG G P+ Sbjct: 143 YPGQQPPPAAGYPGQQPPPAGYGQPPAAGYPQQPPA---AGYPGAAPAAYAAGGAPGYPS 199 Query: 460 YPPGAIVSPAS 492 P GA P S Sbjct: 200 QPAGAQPPPPS 210 Score = 54.7 bits (130), Expect = 4e-06 Identities = 54/157 (34%), Positives = 62/157 (39%), Gaps = 4/157 (2%) Frame = +1 Query: 16 SYGSHVPGSVVGGSSAAGSFSGPPYAPGVY--AGSGPGGHPASSYA-PSSSASLPQGAHL 186 SY + P G A G P APG Y AG PG PA P ++A P GA Sbjct: 2 SYPGYPPAGAPGYPPAGQP--GYPPAPGTYPPAGGAPGYPPAGVPGYPPAAAGYPTGAAP 59 Query: 187 GSRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQ-PAPPQMPPPTGPSPHLAHGGVTAA 363 GG P A GY +G + G + S Q PAP PP G + GG AA Sbjct: 60 PPAGGQPYGAAPGYPPAGGAGYPPAPGYGGYPSAQPPAPGYGAPPGGAPGYPPAGGYPAA 119 Query: 364 HGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGA 474 G P G PA+ P A G G PP A Sbjct: 120 GGYP---GQQPPAAGYPGQQPPPA-AGYPGQQPPPAA 152 [15][TOP] >UniRef100_B6CM01 Putative uncharacterized protein n=1 Tax=Mycobacterium liflandii 128FXT RepID=B6CM01_9MYCO Length = 795 Score = 60.8 bits (146), Expect = 6e-08 Identities = 59/185 (31%), Positives = 77/185 (41%), Gaps = 14/185 (7%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGS--- 192 G V GG +A + GP ++ GV AG+G G P PS+ P A GS Sbjct: 323 GLPVSAPAAGGQAAQAAQLGPAFSRGVSAGAGLGSLP-----PSTGIGTPAAAQTGSAPA 377 Query: 193 ----RGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTG---PSPHLAHGG 351 GG P+ GA+ T + AP M PP G P+ +A GG Sbjct: 378 AGLASGGVAPTGVAAAGATPVTVTPAGAGVATGSGTAHAPAMMLPPPGLGAPAAPVAAGG 437 Query: 352 VTAAHGVPRHHGANGPASLNSAALPAYATGG---GNGPA-YPPGAIVSPASTATFNRLSP 519 AA G A A+ + +A PA TGG G+G A P ++VS +T SP Sbjct: 438 --AAGGAAAVTPAGSSATPSGSAGPAGPTGGSPAGSGAAMVVPASVVSAGTTNRSRAESP 495 Query: 520 AAAAA 534 AAA Sbjct: 496 ELAAA 500 [16][TOP] >UniRef100_Q9BIT9 Major ampullate spidroin 2 (Fragment) n=1 Tax=Latrodectus geometricus RepID=Q9BIT9_9ARAC Length = 373 Score = 60.8 bits (146), Expect = 6e-08 Identities = 54/193 (27%), Positives = 72/193 (37%), Gaps = 27/193 (13%) Frame = +1 Query: 4 QQPPSYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSS---SASLPQ 174 Q P YG PG+ G SG A AGSGPGG+ + P S S Sbjct: 40 QGPSGYGPSGPGAQQGYGPGGQGGSGAAAAAAAAAGSGPGGYGPGAAGPGSYGPSGPGGS 99 Query: 175 GAHLGSRGGAPPSVAGGYGASGP--TSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHG 348 GA + + P GYG GP ++A + GS P P+GP +G Sbjct: 100 GAAAAAAAASGPGGQQGYGPGGPGASAAAAAAAGGSGPGGYGQGPSGYGPSGPGAQQGYG 159 Query: 349 -------GVTAAHGV---------------PRHHGANGPASLNSAALPAYATGGGNGPAY 462 G AA P ++G +GP +AA A A+G G Y Sbjct: 160 PGGQGGSGAAAAAAAAAGSGRGGYGPGAAGPGNYGPSGPGGSGAAASAAAASGPGGQQGY 219 Query: 463 PPGAIVSPASTAT 501 PG + A+ A+ Sbjct: 220 GPGGSGAAAAAAS 232 Score = 58.5 bits (140), Expect = 3e-07 Identities = 46/151 (30%), Positives = 62/151 (41%) Frame = +1 Query: 49 GGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPPSVAGGY 228 GGS AA + + PG G GPGG A++ A +++ G + G P GG Sbjct: 222 GGSGAAAAAASGGAGPGRQQGYGPGGSGAAAAAAAAAGGSGPGGYGQGPAGYGPGGQGGS 281 Query: 229 GASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTAAHGVPRHHGANGPASL 408 G + +A S+ G + A P P+GP A AA G G S Sbjct: 282 GGAAAAAAAASSGPGGY-GPGAAGPGNYGPSGPGGSGAAAAAAAASGPGGQQGYGPGGSG 340 Query: 409 NSAALPAYATGGGNGPAYPPGAIVSPASTAT 501 SAA A G G AY PG + A+ A+ Sbjct: 341 ASAAAAAGGAGPGRQQAYGPGGSGAAAAAAS 371 Score = 56.6 bits (135), Expect = 1e-06 Identities = 62/208 (29%), Positives = 79/208 (37%), Gaps = 43/208 (20%) Frame = +1 Query: 4 QQPPSYGSHVPGSVVG-------GSSAAGSFS----------GPPYA-PGVYAGSGPGGH 129 Q P YG PG+ G GS AA + + GP A PG Y SGPGG Sbjct: 142 QGPSGYGPSGPGAQQGYGPGGQGGSGAAAAAAAAAGSGRGGYGPGAAGPGNYGPSGPGGS 201 Query: 130 PASSYAPSSSASLPQ---------GAHLGSRGGAPPSVAGGY--GASGPTSATFSNESGS 276 A++ A ++S Q A + GGA P GY G SG +A + GS Sbjct: 202 GAAASAAAASGPGGQQGYGPGGSGAAAAAASGGAGPGRQQGYGPGGSGAAAAAAAAAGGS 261 Query: 277 FQSLQPAPPQMPPPTGPSPHLAHGGVTAAHGV--------------PRHHGANGPASLNS 414 Q P GP GG AA P ++G +GP + Sbjct: 262 GPG---GYGQGPAGYGPGGQGGSGGAAAAAAAASSGPGGYGPGAAGPGNYGPSGPGGSGA 318 Query: 415 AALPAYATGGGNGPAYPPGAIVSPASTA 498 AA A A+G G Y PG + A+ A Sbjct: 319 AAAAAAASGPGGQQGYGPGGSGASAAAA 346 [17][TOP] >UniRef100_B4KC52 GI21960 n=1 Tax=Drosophila mojavensis RepID=B4KC52_DROMO Length = 725 Score = 60.8 bits (146), Expect = 6e-08 Identities = 62/192 (32%), Positives = 85/192 (44%), Gaps = 17/192 (8%) Frame = +1 Query: 10 PPSYGSHVPGSVVGGSSAAGSFSGPP---YAPGVYAGSGPGGHPAS--------SYAPSS 156 PPS P S S++ SFS P AP A SG G +PA+ S PSS Sbjct: 274 PPSSSYGAPSSSSSSHSSSSSFSAPSSSYSAPSPSANSG-GSYPAAPSKSYGAPSSGPSS 332 Query: 157 SASLPQ-GAHLGSRGGAPPSVAGGYGASGPTS-----ATFSNESGSFQSLQPAPPQMPPP 318 S S P A++G A PS + G +SGP+S + +N GS+ PA P Sbjct: 333 SYSAPSPSANVGGSYPAAPSSSYGAPSSGPSSSYSAPSPSANRGGSY----PAAPS-SSY 387 Query: 319 TGPSPHLAHGGVTAAHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSPASTA 498 + PSP GG A + P+S SA P+ A GG+ PA P + +P+ A Sbjct: 388 SAPSPGANSGGPYPAAPSSSYGAPAAPSSSYSAPSPS-ANSGGSYPAAPTSSYSAPSPGA 446 Query: 499 TFNRLSPAAAAA 534 P+A ++ Sbjct: 447 NSGGPYPSAPSS 458 Score = 59.7 bits (143), Expect = 1e-07 Identities = 60/182 (32%), Positives = 83/182 (45%), Gaps = 10/182 (5%) Frame = +1 Query: 16 SYGSHVPGSVVGGSSAAG---SFSGPPYAPGV-YAGSGPGGHPASSY--APSSSASLPQ- 174 SY + P + VGGS A S+ P P Y+ P + SY APSSS S P Sbjct: 333 SYSAPSPSANVGGSYPAAPSSSYGAPSSGPSSSYSAPSPSANRGGSYPAAPSSSYSAPSP 392 Query: 175 GAHLGSRGGAPPSVAGGYGASGPTSATFSNESGSFQS--LQPAPPQMPPPTGPSPHLAHG 348 GA+ G G P + + YGA S+++S S S S PA P + PSP G Sbjct: 393 GANSG--GPYPAAPSSSYGAPAAPSSSYSAPSPSANSGGSYPAAPTSSY-SAPSPGANSG 449 Query: 349 GVTAAHGVPRHHGANGPASLNSAALPA-YATGGGNGPAYPPGAIVSPASTATFNRLSPAA 525 G + +GA S NS + P+ A GG+ PA P + +PAS + + +P Sbjct: 450 GPYPS-APSSSYGAPSSGSSNSYSAPSPSANSGGSYPAAPSSSYGAPASAPSSSYSAPNP 508 Query: 526 AA 531 +A Sbjct: 509 SA 510 Score = 55.5 bits (132), Expect = 2e-06 Identities = 55/182 (30%), Positives = 87/182 (47%), Gaps = 12/182 (6%) Frame = +1 Query: 16 SYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGS-------GPGGH---PASSYAPSSSAS 165 SYG+ P S G+ + SF P AP G+ G G+ P++ PSSS Sbjct: 202 SYGAPAPPSSSYGAPSVSSFVPLPSAPSTNYGAPSKTQVLGSNGYTSGPSAPAPPSSSYG 261 Query: 166 LPQGAHLGSRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAH 345 P + R +PPS YGA P+S++ S+ S S S + P P+ S Sbjct: 262 APSSSS-SFRPISPPS--SSYGA--PSSSSSSHSSSSSFSAPSSSYSAPSPSANSGGSYP 316 Query: 346 GGVTAAHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSPAS--TATFNRLSP 519 + ++G P ++GP+S SA P+ A GG+ PA P + +P+S +++++ SP Sbjct: 317 AAPSKSYGAP----SSGPSSSYSAPSPS-ANVGGSYPAAPSSSYGAPSSGPSSSYSAPSP 371 Query: 520 AA 525 +A Sbjct: 372 SA 373 [18][TOP] >UniRef100_UPI0001B53F45 hypothetical protein StAA4_02603 n=1 Tax=Streptomyces sp. AA4 RepID=UPI0001B53F45 Length = 1500 Score = 60.5 bits (145), Expect = 8e-08 Identities = 57/173 (32%), Positives = 69/173 (39%), Gaps = 18/173 (10%) Frame = +1 Query: 10 PPSYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLG 189 P + G G GGS + G +G P PG G G G P A S A P G Sbjct: 311 PGAGGPGAGGPGAGGSGSGGPGAGGPGGPGTAGGPGAAGGPGGPGAGSPGAGGPSSGGPG 370 Query: 190 SRG-GAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTAAH 366 + G G +V G GA+ P+ N G + P P P GP+ A G AH Sbjct: 371 AGGPGGVGAVGGPGGAAAPSGPGGPNGPGGAGA--PGGPAAGGPGGPNGVGAPGDGFDAH 428 Query: 367 GVPR-----------HHGANG----PASLNSAALPAYATG--GGNGPAYPPGA 474 G HGA G A L +A L A A G GG+GPA PG+ Sbjct: 429 GPASTGPGADSPGSGGHGAAGVAAAAAGLGAAGLGAAALGAAGGSGPADGPGS 481 [19][TOP] >UniRef100_UPI0001797576 PREDICTED: collagen, type XI, alpha 2 n=1 Tax=Equus caballus RepID=UPI0001797576 Length = 1627 Score = 60.5 bits (145), Expect = 8e-08 Identities = 56/178 (31%), Positives = 61/178 (34%), Gaps = 23/178 (12%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSG-----PPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHL 186 G P G S A G+ G PP G+ GP G P P S H Sbjct: 738 GQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPPGPSGKDGLPGHP 797 Query: 187 GSRG----------GAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPH 336 G RG PP V G GA+G T E G P PP P G + Sbjct: 798 GQRGEVGFQGKTGPPGPPGVVGPQGAAGETGP--MGERG-----HPGPPGPPGEQGLTGT 850 Query: 337 LAHGGVTAAHGVPRHHGANGPASL----NSAALPAYATG----GGNGPAYPPGAIVSP 486 G G P G +GPA L LP A G G GPA PPG SP Sbjct: 851 AGKEGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGPGLKGNEGPAGPPGPAGSP 908 [20][TOP] >UniRef100_Q636W5 Collagen-like triple helix repeat protein, glycine-rich n=1 Tax=Bacillus cereus E33L RepID=Q636W5_BACCZ Length = 748 Score = 60.5 bits (145), Expect = 8e-08 Identities = 50/154 (32%), Positives = 65/154 (42%), Gaps = 3/154 (1%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGG 201 G+ P G + G+ +GP A G +GP G A A ++ A+ PQGA + Sbjct: 326 GATGPQGAQGPAGVTGA-TGPQGAQGNTGATGPQG--AQGPAGATGATGPQGAQGNTGAT 382 Query: 202 APPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPP-PTGPSPHLAHGGVTAAHGVPR 378 P + G GA+G T + +G P PP PTGP + G T GV Sbjct: 383 GPQGIQGNTGATGATGIGVTGPTG--------PSGGPPGPTGPQGNTGATGATGPQGVQG 434 Query: 379 HHGANGPASLNSAALPAYATG--GGNGPAYPPGA 474 + GA G PA ATG G GPA GA Sbjct: 435 NTGATGATGPQGVQGPAGATGPQGAQGPAGATGA 468 [21][TOP] >UniRef100_C1ENE5 Collagen triple helix repeat protein n=1 Tax=Bacillus cereus 03BB102 RepID=C1ENE5_BACC3 Length = 1191 Score = 60.5 bits (145), Expect = 8e-08 Identities = 50/156 (32%), Positives = 65/156 (41%), Gaps = 5/156 (3%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGG 201 G+ P G + A G+ +GP A G +GP G A A ++ A+ PQGA + Sbjct: 212 GATGPQGAQGPAGATGA-TGPQGAQGNTGATGPQG--AQGPAGATGATGPQGAQGNTGAT 268 Query: 202 APPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTAAHGVPRH 381 P + G GA+G T + +G P PTGP + G T GV + Sbjct: 269 GPQGIQGNTGATGATGIGVTGPTGP-----------PGPTGPQGNTGATGATGPQGVQGN 317 Query: 382 HGANGPASLNSAALPAYATG-----GGNGPAYPPGA 474 GA G A PA ATG G GPA GA Sbjct: 318 TGATGATGPQGAQGPAGATGATGPQGVQGPAGATGA 353 [22][TOP] >UniRef100_Q22260 Protein T06E4.6, confirmed by transcript evidence n=1 Tax=Caenorhabditis elegans RepID=Q22260_CAEEL Length = 290 Score = 60.5 bits (145), Expect = 8e-08 Identities = 47/161 (29%), Positives = 59/161 (36%), Gaps = 2/161 (1%) Frame = +1 Query: 52 GSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPPSVAGGYG 231 G+ + G +GPP PG G GH + P ++ G +G GG P + G Sbjct: 80 GAQSNGCPAGPPGPPGQPGAQGEAGHAGEAGKPGAN-----GVTIGLTGGNGPCITCPAG 134 Query: 232 ASGPTSATFSN--ESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTAAHGVPRHHGANGPAS 405 A GP A + + S Q A + P P GP G G P H GA G Sbjct: 135 APGPAGAPGAPGPQGPSGAPGQDAVGEGPGPAGPQGPAGDAGAPGQAGAPGHPGAPGQGG 194 Query: 406 LNSAALPAYATGGGNGPAYPPGAIVSPASTATFNRLSPAAA 528 S P A G GP P G P + PA A Sbjct: 195 QRSRGTPGPA--GAPGPQGPAGGPGQPGQSGGAGAPGPAGA 233 [23][TOP] >UniRef100_Q0Q5Z0 Tropoelastin 2 n=1 Tax=Danio rerio RepID=Q0Q5Z0_DANRE Length = 2054 Score = 60.1 bits (144), Expect = 1e-07 Identities = 56/168 (33%), Positives = 70/168 (41%), Gaps = 19/168 (11%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSGPPYAPG--VYAGSGPGGHPASSYAPSSSASLPQGAHLGSR 195 G + PG V G G Y PG G GPGG PA Y P +P G + Sbjct: 1283 GGYRPGGVPAGGYGPGGVPAGGYGPGGVPAGGYGPGGVPAGGYGP---GGVPAGGY--GP 1337 Query: 196 GGAPPSVAGGYGASGPTSATFSNESGSF----QSLQPAPP----------QMPPPTGPSP 333 GG P AGGYG G + F SG++ ++L+ P Q TGP+ Sbjct: 1338 GGVP---AGGYGPGGVPAGGFGPGSGAYPGGAKALKYGPGGSGGIPGLGLQGQVGTGPAG 1394 Query: 334 HLAH--GGVTAAHGVPRHHGANGPASLNSAALPAYATG-GGNGPAYPP 468 L + G A +G+P GA L + ALP TG GG G A P Sbjct: 1395 GLGYGPGSKAAKYGLPGFGGA-----LGTGALPGAGTGAGGYGGAQKP 1437 [24][TOP] >UniRef100_Q4DW77 Mucin-associated surface protein (MASP), putative n=1 Tax=Trypanosoma cruzi RepID=Q4DW77_TRYCR Length = 364 Score = 60.1 bits (144), Expect = 1e-07 Identities = 58/179 (32%), Positives = 73/179 (40%), Gaps = 6/179 (3%) Frame = +1 Query: 16 SYGSHVPGSVVGGSSAA-GSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGS 192 S G PG V GGS+A+ G SG PG AGS P G + S S G G Sbjct: 104 SAGGPGPGGVAGGSAASSGDSSGAVAPPGASAGSSPDGGSGGGVSSGSGGS--SGTPTGD 161 Query: 193 RGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTAAHGV 372 +G S AGG G G + + S PA PPP P+ Sbjct: 162 QGTGDVSSAGGGGGGGSGDGSTGGDGTGSVSSAPAAAPAPPPVSPA-------------- 207 Query: 373 PRHHGANGPA-SLNSAALPAY--ATGGGNGPAYPPGAIVSPAS--TATFNRLSPAAAAA 534 GPA +L S A P G +G A PG+ +S + + T N+ +PAAAAA Sbjct: 208 -------GPAVALPSDAPPGVDPPAGSSDGKAGSPGSNLSDTTGDSQTGNQ-TPAAAAA 258 [25][TOP] >UniRef100_UPI0000F1F788 PREDICTED: similar to Galectin-3 (Galactose-specific lectin 3) (Mac-2 antigen) (IgE-binding protein) (35 kDa lectin) (Carbohydrate-binding protein 35) (CBP 35) (Laminin-binding protein) (Lectin L-29) (L-34 galactoside-binding lectin) n=1 Tax=Danio rerio RepID=UPI0000F1F788 Length = 368 Score = 59.7 bits (143), Expect = 1e-07 Identities = 49/162 (30%), Positives = 58/162 (35%), Gaps = 4/162 (2%) Frame = +1 Query: 13 PSYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAG--SGPGGHPASSYAPSSSASLP--QGA 180 P PGS G A G F G P APG + G + PGG+P P P G Sbjct: 64 PQTWPSAPGSFPPGPGAPGQFPGAPAAPGQFPGAPAAPGGYPPGPGVPGQFPPNPGAPGQ 123 Query: 181 HLGSRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTA 360 G PP GG P F + G+ Q P P P P+GP Sbjct: 124 FPSMPGQFPP---GGAPMPYPVPGQFPSPPGAPQGPNPNVPYPPGPSGPG---------- 170 Query: 361 AHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSP 486 +G GP + P Y GGG P PPG+ P Sbjct: 171 ------MYGPGGPGAFPPDGGPGY--GGGMFPPVPPGSWGQP 204 [26][TOP] >UniRef100_Q9RKR9 Putative multi-domain regulatory protein n=1 Tax=Streptomyces coelicolor RepID=Q9RKR9_STRCO Length = 1334 Score = 59.7 bits (143), Expect = 1e-07 Identities = 53/154 (34%), Positives = 61/154 (39%), Gaps = 1/154 (0%) Frame = +1 Query: 37 GSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPPSV 216 G+ G A+G SGP APG G PG PA AP SS + P S Sbjct: 288 GAASGPDPASGPASGPAVAPGSGGGPAPGWWPAPGTAPGSSTAPPHDT---------ASA 338 Query: 217 AGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAH-GGVTAAHGVPRHHGAN 393 A A GPTSA + + + P P T +P A G T A G G Sbjct: 339 ADTAPAPGPTSAPGTAPAAGTAAPAPGTAGPAPGTSYAPGTAPVAGTTPAPGTAPAPGTA 398 Query: 394 GPASLNSAALPAYATGGGNGPAYPPGAIVSPAST 495 GPA S A P A G PA PG +P ST Sbjct: 399 GPARDTSYA-PGTAPVAGTTPA--PGTAPAPGST 429 [27][TOP] >UniRef100_A3Q0W3 Putative uncharacterized protein n=1 Tax=Mycobacterium sp. JLS RepID=A3Q0W3_MYCSJ Length = 946 Score = 59.7 bits (143), Expect = 1e-07 Identities = 52/173 (30%), Positives = 68/173 (39%), Gaps = 17/173 (9%) Frame = +1 Query: 34 PGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPPS 213 PG+ VG S G+ + PP P S G P + AP + P + A + Sbjct: 193 PGAPVGASGGVGAPAAPPAVPAGVVDSSSGVTPPAPAAPPAGVVQPAAGAVPPAPRAVGA 252 Query: 214 VAGGYGASG-------PTSATFSNESGSFQSLQPAPPQ--------MPPPTGPSPHLAHG 348 AGG G +G P +A +G+ PAPP PP P+P A Sbjct: 253 PAGGSGGAGAPAAPPAPPAAVVEPAAGATPPAPPAPPAAVVEPAAGATPPAPPAPPAA-- 310 Query: 349 GVTAAHGV--PRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSPASTAT 501 V A GV P GPA + A+ GP PP A+V PA+ AT Sbjct: 311 VVEPASGVTPPAPPAPGGPAGGSGGAVTP------PGPPAPPAAVVEPAAGAT 357 [28][TOP] >UniRef100_Q283I7 Fibrillar collagen (Fragment) n=1 Tax=Saccoglossus kowalevskii RepID=Q283I7_SACKO Length = 454 Score = 59.7 bits (143), Expect = 1e-07 Identities = 53/163 (32%), Positives = 63/163 (38%), Gaps = 11/163 (6%) Frame = +1 Query: 52 GSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRG-GAPPSVAGGY 228 GS SGP APG +GP G P PS P+GA GSRG P +AG Sbjct: 25 GSPGPAGMSGPMGAPGPSGEAGPQG-PTGDPGPSGPVG-PEGAR-GSRGPSGEPGIAGAP 81 Query: 229 GASGPTSATFSNESGSFQSLQPAPPQMPPP-----TGPSPHLAHGGVTAAHGVPRHHGAN 393 G +G A + F LQ P M P TGP G T G P +G + Sbjct: 82 GDAGIQGARGAKGHRGFPGLQGIPGSMGVPGEDGMTGPPGPNGPRGATGPRGSPGLNGKD 141 Query: 394 GPASLNSAALPAYATG-----GGNGPAYPPGAIVSPASTATFN 507 GP P + G G +GP PPG P F+ Sbjct: 142 GPMGQPGPEGPRGSRGDRGDSGTSGPPGPPGPPGPPGDAQGFD 184 [29][TOP] >UniRef100_Q26052 Alpha collagen type 1 (Fragment) n=1 Tax=Paracentrotus lividus RepID=Q26052_PARLI Length = 730 Score = 59.7 bits (143), Expect = 1e-07 Identities = 56/164 (34%), Positives = 62/164 (37%), Gaps = 10/164 (6%) Frame = +1 Query: 10 PPSYGSHVPGSVVGGSSAAGS--FSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAH 183 P + G G S A GS GP APG SGPGG S AP S GAH Sbjct: 307 PGAQGPRGEKGDTGASGANGSPGAPGPIGAPGPAGASGPGGDTGSVGAPGPPGS--TGAH 364 Query: 184 LGSRGGA----PPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHG- 348 GS G A P AG G +GP LQ P M P P G Sbjct: 365 -GSTGPAGPAGPAGPAGERGETGPAGHKGHTGVPGLPGLQGTPGPMGEPGAPGEQGQQGT 423 Query: 349 -GVTAAHGVPRHHGANGPASLNSAALPAYATG--GGNGPAYPPG 471 G+ A G + G GP + P G GG+GP PPG Sbjct: 424 RGLPGARGSNGNDGPAGPRGFDGPEGPRGPRGESGGSGPPGPPG 467 [30][TOP] >UniRef100_Q206M1 Major ampullate spidroin 2 (Fragment) n=1 Tax=Latrodectus hesperus RepID=Q206M1_9ARAC Length = 1198 Score = 59.7 bits (143), Expect = 1e-07 Identities = 55/176 (31%), Positives = 72/176 (40%), Gaps = 13/176 (7%) Frame = +1 Query: 13 PSYGS---HVPGSVVGGSSAAGSFSGPP---YAPGVYAGSGPGGHPASSYAPSSSASLPQ 174 P YG + PG G ++AA + +GP Y PG SGPGG A++ A ++ S P Sbjct: 103 PGYGGQQGYGPGGA-GAAAAAAAAAGPGPSGYGPGTAGPSGPGGAGAAAAAAAAGGSGPG 161 Query: 175 GAHLGSRGGAPPSVAG----GYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLA 342 G G G P G G G SG +A + G+ Q G P + Sbjct: 162 GYGQGPSGYGPSGSGGQQGFGPGGSGAAAAAAAAAGGAGPGRQ---------QGYGPGSS 212 Query: 343 HGGVTAAHGVPRHHGANGPASLNSAALPAYATGG---GNGPAYPPGAIVSPASTAT 501 AA G P + G G + A A A GG G AY PG + A+ AT Sbjct: 213 GAAAAAAAGGPGYGGQQGYGPGGAGAAAAAAAGGAGPGTQQAYGPGGSGAAAAAAT 268 Score = 55.5 bits (132), Expect = 2e-06 Identities = 54/194 (27%), Positives = 69/194 (35%), Gaps = 22/194 (11%) Frame = +1 Query: 13 PSYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGS 192 P YG G GG+ AA + + PG GPGG A++ A +++ P G G+ Sbjct: 223 PGYGGQ-QGYGPGGAGAAAAAAAGGAGPGTQQAYGPGGSGAAAAAATAAGPGPSGYGPGA 281 Query: 193 RGGAPPSVAG---------------------GYGASGPTSATFSNESGSFQSLQPAPPQM 309 G + P AG GYG SGP GS + A Sbjct: 282 AGPSGPGGAGAAAAAAAAGGSGPGGYGQGPSGYGPSGPGGQQGYGPGGSGAAAAAAAAAG 341 Query: 310 PPPTGPSPHLAHGGVTAAHGVPRHHGANGPASLNSAALPAYATGGGNGPA-YPPGAIVSP 486 G G AA G P + G G + A A A G GP+ Y PGA S Sbjct: 342 GAGPGRQQGYGQGSSGAAAGGPGYGGQQGYGPGGAGAAAAAAAAAGPGPSGYGPGAAGSS 401 Query: 487 ASTATFNRLSPAAA 528 + AAA Sbjct: 402 GPGGAGAAAAAAAA 415 Score = 55.5 bits (132), Expect = 2e-06 Identities = 56/173 (32%), Positives = 70/173 (40%), Gaps = 11/173 (6%) Frame = +1 Query: 13 PSYGS---HVPGSVVGGSSAAGSFSGPP---YAPGVYAGSGPGGHPASSYAPSSSASLPQ 174 P YG + PG G ++AA + +GP Y PG SGPGG A++ A ++ S P Sbjct: 363 PGYGGQQGYGPGGA-GAAAAAAAAAGPGPSGYGPGAAGSSGPGGAGAAAAAAAAGGSGPG 421 Query: 175 GAHLGSRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGG- 351 G G PSV YG SGP GS + A G GG Sbjct: 422 GY------GQGPSV---YGPSGPGGQQGYGPGGSGAAAAAAAAAGGAGPGRQQGYGPGGA 472 Query: 352 -VTAAHGVPRHHGANGPASLNSAALPAYATGG---GNGPAYPPGAIVSPASTA 498 AA G P + G G + A A A GG G AY PG + A+ A Sbjct: 473 AAAAAAGGPGYGGQQGYGPGGAGAAAAAAAGGAGPGRQQAYGPGGSGAAAAAA 525 Score = 55.1 bits (131), Expect = 3e-06 Identities = 54/174 (31%), Positives = 73/174 (41%), Gaps = 1/174 (0%) Frame = +1 Query: 13 PSYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGS 192 P YG G GG+ AA + + PG GPGG A++ A +++ S P G + Sbjct: 962 PGYGGQ-QGFGPGGAGAAAAAAAGGAGPGRQQAYGPGGSGAAAAAAAAAGSGPSGYGPSA 1020 Query: 193 RGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTAAHGV 372 G PS GG GA+ +A S GSF Q P GPS Sbjct: 1021 AG---PSGPGGSGAAAAAAAGGSG-PGSF-------GQGPTGYGPSG------------- 1056 Query: 373 PRHHGANGPASLNSAALPAYATGGGNGPA-YPPGAIVSPASTATFNRLSPAAAA 531 P GP + +AA A + GG GP+ Y P ++ S A++A SP A Sbjct: 1057 PGGQQGYGPGASGAAAAAAASGSGGYGPSQYVPSSVASSAASAASALSSPTTHA 1110 Score = 54.3 bits (129), Expect = 5e-06 Identities = 54/181 (29%), Positives = 72/181 (39%), Gaps = 18/181 (9%) Frame = +1 Query: 10 PPSYGSHVPGSVVGGSSAAGSFSGP----PYAPGVYAGSGPGGHPASSYAPSSSASLPQG 177 P G+ + GGS G GP P PG G GPGG A++ A +++ G Sbjct: 403 PGGAGAAAAAAAAGGSGPGGYGQGPSVYGPSGPGGQQGYGPGGSGAAAAAAAAAGGAGPG 462 Query: 178 AHLG-SRGGAPPSVAG---------GYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGP 327 G GGA + A GYG G +A + G+ Q A P +G Sbjct: 463 RQQGYGPGGAAAAAAAGGPGYGGQQGYGPGGAGAAAAAAAGGAGPGRQQA--YGPGGSGA 520 Query: 328 SPHLAHGGVTAAHGVPRHHGANGPASLNSAALPAYATGGGNGP----AYPPGAIVSPAST 495 + A G + +G GA GP +AA A GG GP AY PG + A+ Sbjct: 521 AAAAAAGSGPSGYG----PGAAGPGGAGAAAA---AAAGGAGPGRQQAYGPGGSGAAAAA 573 Query: 496 A 498 A Sbjct: 574 A 574 [31][TOP] >UniRef100_UPI000023F34A hypothetical protein FG00916.1 n=1 Tax=Gibberella zeae PH-1 RepID=UPI000023F34A Length = 1758 Score = 59.3 bits (142), Expect = 2e-07 Identities = 57/192 (29%), Positives = 77/192 (40%), Gaps = 26/192 (13%) Frame = +1 Query: 22 GSHVPG---SVVGGSSAAGSFSGP--PYAPGVYAGSGPGGHPASSYAPSSSASL-PQGAH 183 GS P +V GG S + P PY G + P SS +P+S S P + Sbjct: 1532 GSDTPAGFDTVYGGGSVGFGGTTPMSPYNRGAAS-------PFSSTSPTSPFSYSPTSPN 1584 Query: 184 LGSRGGAPPSVAGGYGASGPTSATFSNESGSFQ----SLQPAPPQMPPPTGPSPHLAHGG 351 +G +P GG G GPTS +FS S SF L+P P P + SP + Sbjct: 1585 MGYSPTSPLIDGGGMGRYGPTSPSFSPSSPSFSPTSPMLRPTSPASPSYSPTSPSYS--- 1641 Query: 352 VTAAHGVPRHHGANGPASLNSAALPAYA---------------TGGGNGPAYPPGA-IVS 483 + PRH+ PA NS P+Y+ GG P+Y P + S Sbjct: 1642 -PTSPSSPRHYSPTSPAQFNSPTSPSYSPASPNYSPTSPNVHGAGGPTSPSYSPASPSWS 1700 Query: 484 PASTATFNRLSP 519 P S ++ SP Sbjct: 1701 PTSPEAYSPTSP 1712 [32][TOP] >UniRef100_A1CEV2 Extracellular threonine rich protein, putative n=1 Tax=Aspergillus clavatus RepID=A1CEV2_ASPCL Length = 893 Score = 59.3 bits (142), Expect = 2e-07 Identities = 61/182 (33%), Positives = 73/182 (40%), Gaps = 6/182 (3%) Frame = +1 Query: 1 AQQPPSYGSHVPGSVVGGSSAAG--SFSGPPYAPGVYAGSGPGGH---PASSYAPSSSAS 165 A PP G+ P + G A G +GPP G A +GP G PA++ P ++ Sbjct: 205 ATGPP--GATGPPAATGPPGATGPPGATGPPPETGPPAATGPPGATGPPAATGPPGATG- 261 Query: 166 LPQGAHLGSRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPP-PTGPSPHLA 342 P GA PP+ G GA+GP +AT G + P PP TGP P Sbjct: 262 -PPGATGPPPETGPPAATGPPGATGPPAAT-----GPPAATGPPGATGPPGATGPPPET- 314 Query: 343 HGGVTAAHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSPASTATFNRLSPA 522 G AA G P G G P ATG G PPGA PA T P Sbjct: 315 --GPPAATGPPAATGPPGATGPPPETGPPAATGPPPGATGPPGATGPPAPTGPGAPTCPP 372 Query: 523 AA 528 AA Sbjct: 373 AA 374 Score = 53.9 bits (128), Expect = 7e-06 Identities = 59/175 (33%), Positives = 70/175 (40%), Gaps = 3/175 (1%) Frame = +1 Query: 13 PSYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGH---PASSYAPSSSASLPQGAH 183 P G+ PG+ G AA +GPP G A +GP G PA++ P ++ P GA Sbjct: 178 PPAGTGPPGAT--GPPAA---TGPPPETGPPAATGPPGATGPPAATGPPGATG--PPGAT 230 Query: 184 LGSRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTAA 363 PP+ G GA+GP +AT P P TGP P G AA Sbjct: 231 GPPPETGPPAATGPPGATGPPAATG----------PPGATGPPGATGPPPET---GPPAA 277 Query: 364 HGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSPASTATFNRLSPAAA 528 G P GA GP AA A G G PPGA P T P AA Sbjct: 278 TGPP---GATGP----PAATGPPAATGPPGATGPPGATGPPPETGPPAATGPPAA 325 [33][TOP] >UniRef100_Q0RG05 Putative serine/threonine-protein kinase n=1 Tax=Frankia alni ACN14a RepID=Q0RG05_FRAAA Length = 933 Score = 58.9 bits (141), Expect = 2e-07 Identities = 50/158 (31%), Positives = 65/158 (41%), Gaps = 2/158 (1%) Frame = +1 Query: 7 QPPSYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHL 186 QPP G+ GSV G AAG S P G SG G A AP +A+ Sbjct: 340 QPP--GTAGAGSVTGSEGAAGR-SAPGRFTGSAGASGSGRSVAPHAAPGGAATDAPAGSF 396 Query: 187 GSRGGA--PPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTA 360 G R PP AGG +G AT + + S P P + PPP P P G + Sbjct: 397 GGRPATAVPPPTAGGGPPAGAMPATQMSPA-PLASPPPVPSRTPPPGNPPPGGLPPGAVS 455 Query: 361 AHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGA 474 V + P++ + ++P A G G+ A P GA Sbjct: 456 PGAVSGSVPSAAPSASSPGSVPPRAQGPGDAYAPPGGA 493 [34][TOP] >UniRef100_C7IJQ8 Collagen triple helix repeat protein n=1 Tax=Clostridium papyrosolvens DSM 2782 RepID=C7IJQ8_9CLOT Length = 466 Score = 58.9 bits (141), Expect = 2e-07 Identities = 53/157 (33%), Positives = 68/157 (43%), Gaps = 10/157 (6%) Frame = +1 Query: 34 PGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHP-ASSYAPSSSASLPQGAHLGSRGGAPP 210 P G + A G+ +GP A G +GP G A+ ++ A+ P GA G+ G P Sbjct: 176 PTGATGATGATGA-TGPAGATGATGATGPAGATGATGPVGATGATGPAGA-TGATG--PA 231 Query: 211 SVAGGYGASGPTSAT-FSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTAAHGVPRHHG 387 G GA+GP AT + +G+ + PA TGP+ G T A G G Sbjct: 232 GATGATGATGPAGATGATGPAGATGATGPAGA-----TGPTGPAGATGATGATGATGATG 286 Query: 388 ANGPASLNSAALPAYATG--------GGNGPAYPPGA 474 A GPA A PA ATG G GPA GA Sbjct: 287 ATGPAGATGATGPAGATGATGATGATGATGPAGATGA 323 [35][TOP] >UniRef100_Q692F8 Major ampullate spidroin 2 (Fragment) n=1 Tax=Nephila clavipes RepID=Q692F8_NEPCL Length = 296 Score = 58.9 bits (141), Expect = 2e-07 Identities = 63/202 (31%), Positives = 82/202 (40%), Gaps = 27/202 (13%) Frame = +1 Query: 10 PPSYGSHVPGSVVGGSSAAGSFSGPP------YAPGVYA--GSGPGGHPASSYAPSSSAS 165 P YG G GS+AA + +GP Y PG G GPG Y P S+++ Sbjct: 1 PGGYGPGQQGPSGAGSAAAAAAAGPGQQGLGGYGPGQQGPGGYGPGQQGPGGYGPGSASA 60 Query: 166 LPQGAHLGSR--GGAPPSVAG--GYGASGPTSATFSNESGSFQSLQPAP----PQMPPPT 321 A G + GG P G G G++ +A + G + Q P P P+ Sbjct: 61 AAAAAGPGQQGPGGYGPGQQGPSGPGSASAAAAAAAAGPGGYGPGQQGPGGYAPGQQGPS 120 Query: 322 GPSPHLAHGGVTAAHG--VPRHHGANGPASLNSAALPAYATGGGNGPA------YPPGAI 477 GP A A G P G +GP AA A A GG GPA Y PG+ Sbjct: 121 GPGSAAAAAAAAAGPGGYGPAQQGPSGP---GIAASAASAGPGGYGPAQQGPAGYGPGSA 177 Query: 478 VSP---ASTATFNRLSPAAAAA 534 V+ A +A + S A+AAA Sbjct: 178 VAASAGAGSAGYGPGSQASAAA 199 [36][TOP] >UniRef100_B4Q0C4 GE15779 n=1 Tax=Drosophila yakuba RepID=B4Q0C4_DROYA Length = 920 Score = 58.9 bits (141), Expect = 2e-07 Identities = 55/173 (31%), Positives = 64/173 (36%), Gaps = 17/173 (9%) Frame = +1 Query: 4 QQPP------SYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSY-----AP 150 QQPP YG P + GG G ++G P PG Y S GG P S P Sbjct: 567 QQPPPGPPQSQYGPPPPQNSAGGPPPMG-YAGYPPNPGQYGQSAAGGGPPPSGYWPPPPP 625 Query: 151 SSSASLPQGAH------LGSRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMP 312 +SSA P A+ + GGA Y PTS AP Q P Sbjct: 626 TSSAQSPYQAYQQQQQQAAAAGGAGAPPGSSYPGGPPTSGAAPPPPPGGAYSTTAPSQTP 685 Query: 313 PPTGPSPHLAHGGVTAAHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPG 471 PP A GG + P NGP + S P GGG GP+ P G Sbjct: 686 PPQ------AGGGAGGGNTNP-----NGPNAQQSTPPPQGGAGGGAGPSGPGG 727 [37][TOP] >UniRef100_UPI00005A264B PREDICTED: similar to collagen, type XI, alpha 2 isoform 2 preproprotein isoform 2 n=1 Tax=Canis lupus familiaris RepID=UPI00005A264B Length = 1647 Score = 58.5 bits (140), Expect = 3e-07 Identities = 55/178 (30%), Positives = 60/178 (33%), Gaps = 23/178 (12%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSG-----PPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHL 186 G P G S A G+ G PP G+ GP G P P H Sbjct: 758 GQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPPGPPGKDGLPGHP 817 Query: 187 GSRG----------GAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPH 336 G RG PP V G GA+G T E G P PP P G + Sbjct: 818 GQRGEVGFQGKTGPPGPPGVVGPQGAAGETGP--MGERG-----HPGPPGPPGEQGLTGT 870 Query: 337 LAHGGVTAAHGVPRHHGANGPASL----NSAALPAYATG----GGNGPAYPPGAIVSP 486 G G P G +GPA L LP A G G GPA PPG SP Sbjct: 871 AGKEGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGLGLKGNEGPAGPPGPAGSP 928 [38][TOP] >UniRef100_Q5TJG3 Collagen type XI alpha 2 (Fragment) n=2 Tax=Canis lupus familiaris RepID=Q5TJG3_CANFA Length = 1009 Score = 58.5 bits (140), Expect = 3e-07 Identities = 55/178 (30%), Positives = 60/178 (33%), Gaps = 23/178 (12%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSG-----PPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHL 186 G P G S A G+ G PP G+ GP G P P H Sbjct: 120 GQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPPGPPGKDGLPGHP 179 Query: 187 GSRG----------GAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPH 336 G RG PP V G GA+G T E G P PP P G + Sbjct: 180 GQRGEVGFQGKTGPPGPPGVVGPQGAAGETGP--MGERG-----HPGPPGPPGEQGLTGT 232 Query: 337 LAHGGVTAAHGVPRHHGANGPASL----NSAALPAYATG----GGNGPAYPPGAIVSP 486 G G P G +GPA L LP A G G GPA PPG SP Sbjct: 233 AGKEGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGLGLKGNEGPAGPPGPAGSP 290 [39][TOP] >UniRef100_UPI00004BBB4F PREDICTED: similar to collagen, type XI, alpha 2 isoform 1 preproprotein isoform 1 n=1 Tax=Canis lupus familiaris RepID=UPI00004BBB4F Length = 1733 Score = 58.5 bits (140), Expect = 3e-07 Identities = 55/178 (30%), Positives = 60/178 (33%), Gaps = 23/178 (12%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSG-----PPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHL 186 G P G S A G+ G PP G+ GP G P P H Sbjct: 844 GQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPPGPPGKDGLPGHP 903 Query: 187 GSRG----------GAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPH 336 G RG PP V G GA+G T E G P PP P G + Sbjct: 904 GQRGEVGFQGKTGPPGPPGVVGPQGAAGETGP--MGERG-----HPGPPGPPGEQGLTGT 956 Query: 337 LAHGGVTAAHGVPRHHGANGPASL----NSAALPAYATG----GGNGPAYPPGAIVSP 486 G G P G +GPA L LP A G G GPA PPG SP Sbjct: 957 AGKEGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGLGLKGNEGPAGPPGPAGSP 1014 [40][TOP] >UniRef100_UPI0000EB2E40 Collagen type XI alpha 2 n=1 Tax=Canis lupus familiaris RepID=UPI0000EB2E40 Length = 1734 Score = 58.5 bits (140), Expect = 3e-07 Identities = 55/178 (30%), Positives = 60/178 (33%), Gaps = 23/178 (12%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSG-----PPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHL 186 G P G S A G+ G PP G+ GP G P P H Sbjct: 845 GQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPPGPPGKDGLPGHP 904 Query: 187 GSRG----------GAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPH 336 G RG PP V G GA+G T E G P PP P G + Sbjct: 905 GQRGEVGFQGKTGPPGPPGVVGPQGAAGETGP--MGERG-----HPGPPGPPGEQGLTGT 957 Query: 337 LAHGGVTAAHGVPRHHGANGPASL----NSAALPAYATG----GGNGPAYPPGAIVSP 486 G G P G +GPA L LP A G G GPA PPG SP Sbjct: 958 AGKEGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGLGLKGNEGPAGPPGPAGSP 1015 [41][TOP] >UniRef100_UPI0000EB2C0B Collagen type XI alpha 2 n=1 Tax=Canis lupus familiaris RepID=UPI0000EB2C0B Length = 1813 Score = 58.5 bits (140), Expect = 3e-07 Identities = 55/178 (30%), Positives = 60/178 (33%), Gaps = 23/178 (12%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSG-----PPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHL 186 G P G S A G+ G PP G+ GP G P P H Sbjct: 907 GQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPPGPPGKDGLPGHP 966 Query: 187 GSRG----------GAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPH 336 G RG PP V G GA+G T E G P PP P G + Sbjct: 967 GQRGEVGFQGKTGPPGPPGVVGPQGAAGETGP--MGERG-----HPGPPGPPGEQGLTGT 1019 Query: 337 LAHGGVTAAHGVPRHHGANGPASL----NSAALPAYATG----GGNGPAYPPGAIVSP 486 G G P G +GPA L LP A G G GPA PPG SP Sbjct: 1020 AGKEGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGLGLKGNEGPAGPPGPAGSP 1077 [42][TOP] >UniRef100_UPI0000EB2C0A Collagen type XI alpha 2 n=1 Tax=Canis lupus familiaris RepID=UPI0000EB2C0A Length = 1615 Score = 58.5 bits (140), Expect = 3e-07 Identities = 55/178 (30%), Positives = 60/178 (33%), Gaps = 23/178 (12%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSG-----PPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHL 186 G P G S A G+ G PP G+ GP G P P H Sbjct: 744 GQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPPGPPGKDGLPGHP 803 Query: 187 GSRG----------GAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPH 336 G RG PP V G GA+G T E G P PP P G + Sbjct: 804 GQRGEVGFQGKTGPPGPPGVVGPQGAAGETGP--MGERG-----HPGPPGPPGEQGLTGT 856 Query: 337 LAHGGVTAAHGVPRHHGANGPASL----NSAALPAYATG----GGNGPAYPPGAIVSP 486 G G P G +GPA L LP A G G GPA PPG SP Sbjct: 857 AGKEGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGLGLKGNEGPAGPPGPAGSP 914 [43][TOP] >UniRef100_UPI000179D0F0 Proline/arginine-rich protein. n=1 Tax=Bos taurus RepID=UPI000179D0F0 Length = 1659 Score = 58.5 bits (140), Expect = 3e-07 Identities = 55/178 (30%), Positives = 60/178 (33%), Gaps = 23/178 (12%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSG-----PPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHL 186 G P G S A G+ G PP G+ GP G P P H Sbjct: 770 GQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPPGPPGKDGLPGHP 829 Query: 187 GSRG----------GAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPH 336 G RG PP V G GA+G T E G P PP P G + Sbjct: 830 GQRGEVGFQGKTGPPGPPGVVGPQGAAGETGP--MGERG-----HPGPPGPPGEQGLTGT 882 Query: 337 LAHGGVTAAHGVPRHHGANGPASL----NSAALPAYATG----GGNGPAYPPGAIVSP 486 G G P G +GPA L LP A G G GPA PPG SP Sbjct: 883 AGKEGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGPGLKGNEGPAGPPGPAGSP 940 [44][TOP] >UniRef100_UPI000179D0EF Proline/arginine-rich protein. n=1 Tax=Bos taurus RepID=UPI000179D0EF Length = 1737 Score = 58.5 bits (140), Expect = 3e-07 Identities = 55/178 (30%), Positives = 60/178 (33%), Gaps = 23/178 (12%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSG-----PPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHL 186 G P G S A G+ G PP G+ GP G P P H Sbjct: 848 GQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPPGPPGKDGLPGHP 907 Query: 187 GSRG----------GAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPH 336 G RG PP V G GA+G T E G P PP P G + Sbjct: 908 GQRGEVGFQGKTGPPGPPGVVGPQGAAGETGP--MGERG-----HPGPPGPPGEQGLTGT 960 Query: 337 LAHGGVTAAHGVPRHHGANGPASL----NSAALPAYATG----GGNGPAYPPGAIVSP 486 G G P G +GPA L LP A G G GPA PPG SP Sbjct: 961 AGKEGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGPGLKGNEGPAGPPGPAGSP 1018 [45][TOP] >UniRef100_A4RRF2 Predicted protein n=1 Tax=Ostreococcus lucimarinus CCE9901 RepID=A4RRF2_OSTLU Length = 1000 Score = 58.5 bits (140), Expect = 3e-07 Identities = 53/175 (30%), Positives = 74/175 (42%), Gaps = 2/175 (1%) Frame = +1 Query: 10 PPSYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLG 189 P ++G+ G G SS G F P AP + G+ P++ AP+SSA P G G Sbjct: 237 PSAFGAPSGGGAFG-SSPTGGFGAPAAAPSPFGGAAT---PSAFGAPASSA--PSGGLFG 290 Query: 190 SRGGAPPSVAGGYGASGPTSATFS--NESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTAA 363 S GG+GAS P S+ F + + +F + P P SP A A Sbjct: 291 -------STTGGFGAS-PASSAFGAPSTTSAFGASAPTPGAFGATPSASPFGAAPSTPGA 342 Query: 364 HGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSPASTATFNRLSPAAA 528 G P A+ PA+ G G A P A +P+ST F +PA++ Sbjct: 343 FGAP-------------ASTPAFGASGAFGAAPTPSAFGAPSSTPAFG-AAPASS 383 Score = 55.8 bits (133), Expect = 2e-06 Identities = 48/163 (29%), Positives = 67/163 (41%), Gaps = 4/163 (2%) Frame = +1 Query: 55 SSAAGSFSGP----PYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPPSVAG 222 +++AG F P P+ GS GG ++ S++ + P GA G GA S G Sbjct: 170 ATSAGGFGAPAATSPFGGTTGGGSAFGGASGGAFGASATPASPFGAPSGGAFGASTSTPG 229 Query: 223 GYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTAAHGVPRHHGANGPA 402 G+GAS SA + G P P PSP GG P GA PA Sbjct: 230 GFGASAAPSAFGAPSGGGAFGSSPTGGFGAPAAAPSP---FGGA----ATPSAFGA--PA 280 Query: 403 SLNSAALPAYATGGGNGPAYPPGAIVSPASTATFNRLSPAAAA 531 S + +T GG G + A +P++T+ F +P A Sbjct: 281 SSAPSGGLFGSTTGGFGASPASSAFGAPSTTSAFGASAPTPGA 323 [46][TOP] >UniRef100_Q5TJG0 Collagen type XI alpha 2 (Fragment) n=1 Tax=Canis lupus familiaris RepID=Q5TJG0_CANFA Length = 1596 Score = 58.5 bits (140), Expect = 3e-07 Identities = 55/178 (30%), Positives = 60/178 (33%), Gaps = 23/178 (12%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSG-----PPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHL 186 G P G S A G+ G PP G+ GP G P P H Sbjct: 758 GQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPPGPPGKDGLPGHP 817 Query: 187 GSRG----------GAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPH 336 G RG PP V G GA+G T E G P PP P G + Sbjct: 818 GQRGEVGFQGKTGPPGPPGVVGPQGAAGETGP--MGERG-----HPGPPGPPGEQGLTGT 870 Query: 337 LAHGGVTAAHGVPRHHGANGPASL----NSAALPAYATG----GGNGPAYPPGAIVSP 486 G G P G +GPA L LP A G G GPA PPG SP Sbjct: 871 AGKEGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGLGLKGNEGPAGPPGPAGSP 928 [47][TOP] >UniRef100_A5D9K7 Collagen type XI alpha 2 n=1 Tax=Sus scrofa RepID=A5D9K7_PIG Length = 1651 Score = 58.5 bits (140), Expect = 3e-07 Identities = 55/178 (30%), Positives = 60/178 (33%), Gaps = 23/178 (12%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSG-----PPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHL 186 G P G S A G+ G PP G+ GP G P P H Sbjct: 761 GQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPPGPPGKDGLPGHP 820 Query: 187 GSRG----------GAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPH 336 G RG PP V G GA+G T E G P PP P G + Sbjct: 821 GQRGEVGFQGKTGPPGPPGVVGPQGAAGETGP--MGERG-----HPGPPGPPGEQGLTGT 873 Query: 337 LAHGGVTAAHGVPRHHGANGPASL----NSAALPAYATG----GGNGPAYPPGAIVSP 486 G G P G +GPA L LP A G G GPA PPG SP Sbjct: 874 AGKEGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGPGLKGNEGPAGPPGPAGSP 931 [48][TOP] >UniRef100_Q5U0Y6 LD20133p n=1 Tax=Drosophila melanogaster RepID=Q5U0Y6_DROME Length = 840 Score = 58.5 bits (140), Expect = 3e-07 Identities = 53/155 (34%), Positives = 58/155 (37%), Gaps = 13/155 (8%) Frame = +1 Query: 49 GGSSAAGSFSGPPYAPGVYAGSGPGGHPASSY----APSSSASLP---------QGAHLG 189 G + AA SG Y P AG GP P S Y P+SSA P Q A G Sbjct: 509 GPAGAATGASGHGYQPNAGAGQGP---PPSGYWPPPPPTSSAQSPYQAYQQQQQQQAAAG 565 Query: 190 SRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTAAHG 369 GAPP Y PTS AP Q PPP G GG + Sbjct: 566 GGAGAPPG--SSYPGGPPTSGAAPPPPPGGAYSTTAPSQTPPPQGG------GGAGGGNN 617 Query: 370 VPRHHGANGPASLNSAALPAYATGGGNGPAYPPGA 474 P NGP + S P GGG GP+ P GA Sbjct: 618 NP-----NGPNAQQSTPPPQGGAGGGAGPSGPGGA 647 [49][TOP] >UniRef100_B0CPK9 Predicted protein n=1 Tax=Laccaria bicolor S238N-H82 RepID=B0CPK9_LACBS Length = 584 Score = 58.5 bits (140), Expect = 3e-07 Identities = 59/193 (30%), Positives = 76/193 (39%), Gaps = 18/193 (9%) Frame = +1 Query: 10 PPSY---GSHVPGSVVG--GSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQ 174 PP Y G + P S++ G S G+ GP P GS G P A SS + Sbjct: 189 PPVYSASGPNAPSSILAAPGPSPTGAQGGPAQDPQTPTGSNTPGGPLPPPASSSFPPVNG 248 Query: 175 GAHLGSRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGV 354 A G++GG P + G++ PT N+ G PA +PP G +P A G Sbjct: 249 PAPTGAQGGPAPYSSPPTGSNAPTDP--ENQGGPV--APPASLTLPPVNGSAPTGAQGDP 304 Query: 355 TAAHGVP---------RHHGANGPASLNSAALPAYA---TGGGNGPA-YPPGAIVSPAST 495 T P + GA P +S+ P TGG GPA Y P S A T Sbjct: 305 TPNSPPPSGGDAPTDSKESGARPPPPASSSLPPVNGPAPTGGQGGPAPYSPPHTDSNAPT 364 Query: 496 ATFNRLSPAAAAA 534 N+ P A A Sbjct: 365 ELKNQGGPVAPPA 377 [50][TOP] >UniRef100_Q32S24 Collagen alpha-2(XI) chain n=1 Tax=Bos taurus RepID=COBA2_BOVIN Length = 1736 Score = 58.5 bits (140), Expect = 3e-07 Identities = 55/178 (30%), Positives = 60/178 (33%), Gaps = 23/178 (12%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSG-----PPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHL 186 G P G S A G+ G PP G+ GP G P P H Sbjct: 847 GQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPPGPPGKDGLPGHP 906 Query: 187 GSRG----------GAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPH 336 G RG PP V G GA+G T E G P PP P G + Sbjct: 907 GQRGEVGFQGKTGPPGPPGVVGPQGAAGETGP--MGERG-----HPGPPGPPGEQGLTGT 959 Query: 337 LAHGGVTAAHGVPRHHGANGPASL----NSAALPAYATG----GGNGPAYPPGAIVSP 486 G G P G +GPA L LP A G G GPA PPG SP Sbjct: 960 AGKEGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGPGLKGNEGPAGPPGPAGSP 1017 [51][TOP] >UniRef100_Q72Z02 Collagen triple helix repeat domain protein n=1 Tax=Bacillus cereus ATCC 10987 RepID=Q72Z02_BACC1 Length = 1321 Score = 58.2 bits (139), Expect = 4e-07 Identities = 47/154 (30%), Positives = 62/154 (40%), Gaps = 13/154 (8%) Frame = +1 Query: 34 PGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPPS 213 P V G + G GPP G +GPGG P+ S ++ A+ GA + Sbjct: 192 PTGVTGPTGITGPSGGPPGPTGPTGATGPGGGPSGS-TGATGATGNTGATGSTGVTGSTG 250 Query: 214 VAGGYGASGPTSATFSNESGSFQSLQ-------PAPPQ----MPPPTGPSPHLAHGGVTA 360 V G G++GPT +T + Q +Q P PQ +P PTG + GV Sbjct: 251 VTGATGSTGPTGSTGAQGLQGIQGIQGPIGPTGPEGPQGIQGIPGPTGVTGEQGIQGVQG 310 Query: 361 AHGVPRHHGANGPASLNSAALPAYATG--GGNGP 456 G G GP + A P ATG G GP Sbjct: 311 IQGATGATGDQGPQGIQGAIGPQGATGATGDQGP 344 [52][TOP] >UniRef100_A1UCV0 Putative uncharacterized protein n=2 Tax=Mycobacterium RepID=A1UCV0_MYCSK Length = 816 Score = 58.2 bits (139), Expect = 4e-07 Identities = 51/161 (31%), Positives = 67/161 (41%), Gaps = 14/161 (8%) Frame = +1 Query: 49 GGSSAAGSFSGPPYAPGVYAGSG-PGGHPAS---SYAPSSSASLPQGAHLGSRGGAPPSV 216 GG GS G P PG GSG G +P S P ++ LP + + GGA P Sbjct: 288 GGGGGLGSGGGVPKMPGGLGGSGLSGSNPLSGGVGQMPGAAGWLPNSGAVSAAGGASPLS 347 Query: 217 AGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGG-----VTAAHG---- 369 + A AT + G S QP P P PSP L+ GG V+AA G Sbjct: 348 S----AFNQGMATTAGMGGGIPSTQP-----PAPASPSPALSAGGGHAAPVSAAPGGGVS 398 Query: 370 -VPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSPA 489 G PA+ ++ ++GGG PPG++ PA Sbjct: 399 PAAAQPGMVAPAAPSALTGTGVSSGGGAPMMLPPGSMGPPA 439 [53][TOP] >UniRef100_C3CQG2 Collagen triple helix repeat domain protein n=2 Tax=Bacillus thuringiensis RepID=C3CQG2_BACTU Length = 1225 Score = 58.2 bits (139), Expect = 4e-07 Identities = 49/154 (31%), Positives = 64/154 (41%), Gaps = 13/154 (8%) Frame = +1 Query: 34 PGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPPS 213 P + G + G GPP G +GPGG P+ S ++ A+ GA GS G Sbjct: 177 PTGITGPTGITGPSGGPPGPTGPTGATGPGGGPSGS-TGATGATGSTGA-TGSTG----- 229 Query: 214 VAGGYGASGPTSATFSNESGSFQSLQ-------PAPPQ----MPPPTGPSPHLAHGGVTA 360 V G G +GPT +T + Q +Q P PQ +P PTG + GV Sbjct: 230 VTGATGTTGPTGSTGAQGLQGIQGIQGPIGPTGPEGPQGIQGIPGPTGITGEQGIQGVQG 289 Query: 361 AHGVPRHHGANGPASLNSAALPAYATG--GGNGP 456 G+ G GP + A P ATG G GP Sbjct: 290 IQGIMGATGDQGPQGIQGAIGPQGATGATGDQGP 323 [54][TOP] >UniRef100_Q868B4 Protein ZK643.8, partially confirmed by transcript evidence n=1 Tax=Caenorhabditis elegans RepID=Q868B4_CAEEL Length = 774 Score = 58.2 bits (139), Expect = 4e-07 Identities = 46/148 (31%), Positives = 55/148 (37%), Gaps = 3/148 (2%) Frame = +1 Query: 13 PSYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLG- 189 PS G G GGSS G ++ P G YA SG GG + SS G G Sbjct: 206 PSGGGGCGG---GGSSGGGGYASAPSGGGGYATSGGGGSGGYATGGSSGGGYSSGGSSGG 262 Query: 190 --SRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTAA 363 S GG GG G G + + G S A P PPP P P A V++ Sbjct: 263 GYSTGGGGGYAGGGGGGGGSSGGYAGSSGGGGYSAPAAAPPPPPPPPPPP--APAPVSSG 320 Query: 364 HGVPRHHGANGPASLNSAALPAYATGGG 447 G G S S A ++ GG Sbjct: 321 GGYSEQSSGGGGGSSYSGGGEASSSSGG 348 Score = 54.3 bits (129), Expect = 5e-06 Identities = 53/176 (30%), Positives = 72/176 (40%), Gaps = 3/176 (1%) Frame = +1 Query: 16 SYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSR 195 S G S GG S+ G S + G Y+G G +SS + SS G G+ Sbjct: 365 SSGGDSSSSSGGGYSSGGDSSSSSSSSGGYSG---GSDSSSSSSSSSGGYSSGGGDAGAS 421 Query: 196 GGAPPSVAGGYGASGPT--SATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTAAHG 369 G S AGGY S + A+ SG PAP P +G + AA Sbjct: 422 SGGESSSAGGYSGSSSSGGEASSGGYSGGSSEPAPAPEAAPASSGGYSGGSEAAPEAAPA 481 Query: 370 VPRHHGANGPASLNSAALPAYATGGGNG-PAYPPGAIVSPASTATFNRLSPAAAAA 534 P G +G + AA PA +GG +G A P A +P+ + + +P AA A Sbjct: 482 AP-SGGYSGSEAAPEAA-PAAPSGGYSGSEAAPEAAPAAPSGGYSGSEAAPEAAPA 535 Score = 53.9 bits (128), Expect = 7e-06 Identities = 48/175 (27%), Positives = 68/175 (38%), Gaps = 17/175 (9%) Frame = +1 Query: 1 AQQPPSYGSHVPGSVVGGSSAA---------GSFSGPPYAPGVYAGSGPGGHPASSYAPS 153 ++ P P GS AA G +SG AP + GG+ S AP Sbjct: 508 SEAAPEAAPAAPSGGYSGSEAAPEAAPAAPSGGYSGSEAAPEAAPAAPSGGYSGSEAAPE 567 Query: 154 SSASLPQGAHLGSRGGAPPSV-----AGGYGASG---PTSATFSNESGSFQSLQPAPPQM 309 ++ + P G + GS AP + +GGY G ++A SN SG ++ APP Sbjct: 568 AAPAAPSGGYSGSESSAPAAPEPAPSSGGYSGGGGDAGSAAGGSNYSGGGETAPAAPPPA 627 Query: 310 PPPTGPSPHLAHGGVTAAHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGA 474 P P + GA G ++AA PA + GG +G GA Sbjct: 628 PEP-----------------AQTYSGAGGE---SAAAAPAPSGGGYSGSGGAGGA 662 [55][TOP] >UniRef100_Q22256 Protein T06E4.4, confirmed by transcript evidence n=1 Tax=Caenorhabditis elegans RepID=Q22256_CAEEL Length = 290 Score = 58.2 bits (139), Expect = 4e-07 Identities = 46/161 (28%), Positives = 57/161 (35%), Gaps = 2/161 (1%) Frame = +1 Query: 52 GSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPPSVAGGYG 231 G+ + G +GPP PG G GH + P ++ G +G GG P + G Sbjct: 80 GAQSNGCPAGPPGPPGQPGAQGEAGHAGEAGKPGAN-----GVTIGLTGGNGPCITCPAG 134 Query: 232 ASGPTSATFSN--ESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTAAHGVPRHHGANGPAS 405 A GP A + + S Q A P P GP G G P H GA G Sbjct: 135 APGPAGAPGAPGPQGPSGAPGQDAVGGGPGPAGPQGPAGDAGAPGQAGAPGHPGAPGQGG 194 Query: 406 LNSAALPAYATGGGNGPAYPPGAIVSPASTATFNRLSPAAA 528 S P G GP P G P + PA A Sbjct: 195 QRSRGTP--GPSGAPGPQGPAGGPGQPGQSGGAGAPGPAGA 233 [56][TOP] >UniRef100_Q20739 Protein F54B11.2, partially confirmed by transcript evidence n=1 Tax=Caenorhabditis elegans RepID=Q20739_CAEEL Length = 304 Score = 58.2 bits (139), Expect = 4e-07 Identities = 54/169 (31%), Positives = 65/169 (38%), Gaps = 10/169 (5%) Frame = +1 Query: 52 GSSAAGSFSG--PPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPPSV--- 216 G+S+ G G P PGV G G P AP + + +GA + PP Sbjct: 95 GASSGGQCEGCCNPGPPGVAGNPGKPGKPGKPGAPGNPGAPGKGAAVPCEAKTPPPCKPC 154 Query: 217 -AGGYGASGPTS----ATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTAAHGVPRH 381 AG G GP A E+GS PA P P P GPS G A G P Sbjct: 155 PAGPPGPPGPDGPAGPAGPDGEAGS-----PAAPSPPGPPGPSGPAGPAGNDGAAGTP-- 207 Query: 382 HGANGPASLNSAALPAYATGGGNGPAYPPGAIVSPASTATFNRLSPAAA 528 G +GPA ++ PA GPA PPG P +P AA Sbjct: 208 -GPDGPAGESTYPEPA-----APGPAGPPGPAGPPGPDGASPTAAPGAA 250 [57][TOP] >UniRef100_C5DX72 ZYRO0F02728p n=1 Tax=Zygosaccharomyces rouxii CBS 732 RepID=C5DX72_ZYGRC Length = 2302 Score = 58.2 bits (139), Expect = 4e-07 Identities = 56/163 (34%), Positives = 74/163 (45%), Gaps = 2/163 (1%) Frame = +1 Query: 10 PPSYGSHVPGSVVGGSSAAGSFS-GPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHL 186 PPS S S G SSA G+ S G P AP +GSG G +S A +S+S G+ Sbjct: 792 PPSTSSSA-SSTSGSSSAPGTSSTGSPSAP---SGSGNSGASGASGASGASSSEASGSGN 847 Query: 187 GSRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHG-GVTAA 363 + GA +G GASG +SA S SG+ + AP +G S A G G +A Sbjct: 848 SATSGA-SGASGASGASGASSAPSSGASGASGASSSAPTS---TSGASSSEASGSGNSAT 903 Query: 364 HGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSPAS 492 G GA+G +S S+ + + P GA S AS Sbjct: 904 SGASGASGASGASSAPSSGASGASGASSSAPTSTSGASSSEAS 946 Score = 53.5 bits (127), Expect = 9e-06 Identities = 48/170 (28%), Positives = 71/170 (41%), Gaps = 5/170 (2%) Frame = +1 Query: 40 SVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPPSVA 219 S G+S A SG A + G ASS AP+S++ GS A + Sbjct: 848 SATSGASGASGASGASGASSAPSSGASGASGASSSAPTSTSGASSSEASGSGNSATSGAS 907 Query: 220 GGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGP----SPHLAHGGVTAAHGVPRHHG 387 G GASG +SA S SG+ + AP + S + A G+ +A P Sbjct: 908 GASGASGASSAPSSGASGASGASSSAPTSTSGASSSEASGSGNSATSGIVSASSAP---S 964 Query: 388 ANGPASLNSAALPAYATG-GGNGPAYPPGAIVSPASTATFNRLSPAAAAA 534 NG ++ + A+ + A+G + P GA S AS + + S A A+ Sbjct: 965 GNGNSATSGASGASGASGASSSAPTSTSGASSSEASGSGNSATSGATGAS 1014 [58][TOP] >UniRef100_UPI00005029C8 Procollagen, type XI, alpha 2. n=1 Tax=Rattus norvegicus RepID=UPI00005029C8 Length = 1629 Score = 57.8 bits (138), Expect = 5e-07 Identities = 55/179 (30%), Positives = 61/179 (34%), Gaps = 24/179 (13%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSG-----PPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHL 186 G P G S A G+ G PP G+ GP G P P + H Sbjct: 740 GQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPPGPAGKDGLPGHP 799 Query: 187 GSRG----------GAPPSVAGGYGASGPTSATFSNESGSF-QSLQPAPPQMPPPTGPSP 333 G RG PP V G GA+G ESG + P PP P G Sbjct: 800 GQRGEVGFQGKTGPPGPPGVVGPQGAAG--------ESGPMGERGHPGPPGPPGEQGLPG 851 Query: 334 HLAHGGVTAAHGVPRHHGANGPASL----NSAALPAYATG----GGNGPAYPPGAIVSP 486 G G P G +GPA L LP A G G GPA PPG SP Sbjct: 852 TAGKDGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGPGLKGNEGPAGPPGPAGSP 910 [59][TOP] >UniRef100_UPI00005029C7 Procollagen, type XI, alpha 2. n=1 Tax=Rattus norvegicus RepID=UPI00005029C7 Length = 1650 Score = 57.8 bits (138), Expect = 5e-07 Identities = 55/179 (30%), Positives = 61/179 (34%), Gaps = 24/179 (13%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSG-----PPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHL 186 G P G S A G+ G PP G+ GP G P P + H Sbjct: 761 GQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPPGPAGKDGLPGHP 820 Query: 187 GSRG----------GAPPSVAGGYGASGPTSATFSNESGSF-QSLQPAPPQMPPPTGPSP 333 G RG PP V G GA+G ESG + P PP P G Sbjct: 821 GQRGEVGFQGKTGPPGPPGVVGPQGAAG--------ESGPMGERGHPGPPGPPGEQGLPG 872 Query: 334 HLAHGGVTAAHGVPRHHGANGPASL----NSAALPAYATG----GGNGPAYPPGAIVSP 486 G G P G +GPA L LP A G G GPA PPG SP Sbjct: 873 TAGKDGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGPGLKGNEGPAGPPGPAGSP 931 [60][TOP] >UniRef100_UPI00005029C6 Procollagen, type XI, alpha 2. n=1 Tax=Rattus norvegicus RepID=UPI00005029C6 Length = 1655 Score = 57.8 bits (138), Expect = 5e-07 Identities = 55/179 (30%), Positives = 61/179 (34%), Gaps = 24/179 (13%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSG-----PPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHL 186 G P G S A G+ G PP G+ GP G P P + H Sbjct: 766 GQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPPGPAGKDGLPGHP 825 Query: 187 GSRG----------GAPPSVAGGYGASGPTSATFSNESGSF-QSLQPAPPQMPPPTGPSP 333 G RG PP V G GA+G ESG + P PP P G Sbjct: 826 GQRGEVGFQGKTGPPGPPGVVGPQGAAG--------ESGPMGERGHPGPPGPPGEQGLPG 877 Query: 334 HLAHGGVTAAHGVPRHHGANGPASL----NSAALPAYATG----GGNGPAYPPGAIVSP 486 G G P G +GPA L LP A G G GPA PPG SP Sbjct: 878 TAGKDGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGPGLKGNEGPAGPPGPAGSP 936 [61][TOP] >UniRef100_UPI00005029C5 Procollagen, type XI, alpha 2. n=1 Tax=Rattus norvegicus RepID=UPI00005029C5 Length = 1689 Score = 57.8 bits (138), Expect = 5e-07 Identities = 55/179 (30%), Positives = 61/179 (34%), Gaps = 24/179 (13%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSG-----PPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHL 186 G P G S A G+ G PP G+ GP G P P + H Sbjct: 800 GQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPPGPAGKDGLPGHP 859 Query: 187 GSRG----------GAPPSVAGGYGASGPTSATFSNESGSF-QSLQPAPPQMPPPTGPSP 333 G RG PP V G GA+G ESG + P PP P G Sbjct: 860 GQRGEVGFQGKTGPPGPPGVVGPQGAAG--------ESGPMGERGHPGPPGPPGEQGLPG 911 Query: 334 HLAHGGVTAAHGVPRHHGANGPASL----NSAALPAYATG----GGNGPAYPPGAIVSP 486 G G P G +GPA L LP A G G GPA PPG SP Sbjct: 912 TAGKDGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGPGLKGNEGPAGPPGPAGSP 970 [62][TOP] >UniRef100_UPI00005029C4 Procollagen, type XI, alpha 2. n=1 Tax=Rattus norvegicus RepID=UPI00005029C4 Length = 1710 Score = 57.8 bits (138), Expect = 5e-07 Identities = 55/179 (30%), Positives = 61/179 (34%), Gaps = 24/179 (13%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSG-----PPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHL 186 G P G S A G+ G PP G+ GP G P P + H Sbjct: 821 GQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPPGPAGKDGLPGHP 880 Query: 187 GSRG----------GAPPSVAGGYGASGPTSATFSNESGSF-QSLQPAPPQMPPPTGPSP 333 G RG PP V G GA+G ESG + P PP P G Sbjct: 881 GQRGEVGFQGKTGPPGPPGVVGPQGAAG--------ESGPMGERGHPGPPGPPGEQGLPG 932 Query: 334 HLAHGGVTAAHGVPRHHGANGPASL----NSAALPAYATG----GGNGPAYPPGAIVSP 486 G G P G +GPA L LP A G G GPA PPG SP Sbjct: 933 TAGKDGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGPGLKGNEGPAGPPGPAGSP 991 [63][TOP] >UniRef100_UPI00005029C3 Procollagen, type XI, alpha 2. n=1 Tax=Rattus norvegicus RepID=UPI00005029C3 Length = 1715 Score = 57.8 bits (138), Expect = 5e-07 Identities = 55/179 (30%), Positives = 61/179 (34%), Gaps = 24/179 (13%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSG-----PPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHL 186 G P G S A G+ G PP G+ GP G P P + H Sbjct: 826 GQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPPGPAGKDGLPGHP 885 Query: 187 GSRG----------GAPPSVAGGYGASGPTSATFSNESGSF-QSLQPAPPQMPPPTGPSP 333 G RG PP V G GA+G ESG + P PP P G Sbjct: 886 GQRGEVGFQGKTGPPGPPGVVGPQGAAG--------ESGPMGERGHPGPPGPPGEQGLPG 937 Query: 334 HLAHGGVTAAHGVPRHHGANGPASL----NSAALPAYATG----GGNGPAYPPGAIVSP 486 G G P G +GPA L LP A G G GPA PPG SP Sbjct: 938 TAGKDGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGPGLKGNEGPAGPPGPAGSP 996 [64][TOP] >UniRef100_UPI00005029C2 Procollagen, type XI, alpha 2. n=1 Tax=Rattus norvegicus RepID=UPI00005029C2 Length = 1736 Score = 57.8 bits (138), Expect = 5e-07 Identities = 55/179 (30%), Positives = 61/179 (34%), Gaps = 24/179 (13%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSG-----PPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHL 186 G P G S A G+ G PP G+ GP G P P + H Sbjct: 847 GQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPPGPAGKDGLPGHP 906 Query: 187 GSRG----------GAPPSVAGGYGASGPTSATFSNESGSF-QSLQPAPPQMPPPTGPSP 333 G RG PP V G GA+G ESG + P PP P G Sbjct: 907 GQRGEVGFQGKTGPPGPPGVVGPQGAAG--------ESGPMGERGHPGPPGPPGEQGLPG 958 Query: 334 HLAHGGVTAAHGVPRHHGANGPASL----NSAALPAYATG----GGNGPAYPPGAIVSP 486 G G P G +GPA L LP A G G GPA PPG SP Sbjct: 959 TAGKDGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGPGLKGNEGPAGPPGPAGSP 1017 [65][TOP] >UniRef100_Q6MGB2 Procollagen, type XI, alpha 2 n=1 Tax=Rattus norvegicus RepID=Q6MGB2_RAT Length = 1617 Score = 57.8 bits (138), Expect = 5e-07 Identities = 55/179 (30%), Positives = 61/179 (34%), Gaps = 24/179 (13%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSG-----PPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHL 186 G P G S A G+ G PP G+ GP G P P + H Sbjct: 746 GQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPPGPAGKDGLPGHP 805 Query: 187 GSRG----------GAPPSVAGGYGASGPTSATFSNESGSF-QSLQPAPPQMPPPTGPSP 333 G RG PP V G GA+G ESG + P PP P G Sbjct: 806 GQRGEVGFQGKTGPPGPPGVVGPQGAAG--------ESGPMGERGHPGPPGPPGEQGLPG 857 Query: 334 HLAHGGVTAAHGVPRHHGANGPASL----NSAALPAYATG----GGNGPAYPPGAIVSP 486 G G P G +GPA L LP A G G GPA PPG SP Sbjct: 858 TAGKDGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGPGLKGNEGPAGPPGPAGSP 916 [66][TOP] >UniRef100_C2XI26 Collagen triple helix repeat domain protein n=1 Tax=Bacillus cereus F65185 RepID=C2XI26_BACCE Length = 1309 Score = 57.8 bits (138), Expect = 5e-07 Identities = 45/154 (29%), Positives = 61/154 (39%), Gaps = 13/154 (8%) Frame = +1 Query: 34 PGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPPS 213 P + G + G GPP G +GPGG P+ S + + + G+ G Sbjct: 195 PTGITGPTGITGPSGGPPGPTGPTGATGPGGGPSGSTGATGAT-----GNTGATGST--G 247 Query: 214 VAGGYGASGPTSATFSNESGSFQSLQ-------PAPPQ----MPPPTGPSPHLAHGGVTA 360 V G G++GPT +T + Q +Q P PQ +P PTG + GV Sbjct: 248 VTGATGSTGPTGSTGAQGLQGIQGIQGPIGPTGPEGPQGIQGIPGPTGVTGEQGIQGVQG 307 Query: 361 AHGVPRHHGANGPASLNSAALPAYATG--GGNGP 456 G G GP + A P ATG G GP Sbjct: 308 IQGATGATGDQGPQGIQGAIGPQGATGATGDQGP 341 [67][TOP] >UniRef100_C2P552 Collagen triple helix repeat domain protein n=1 Tax=Bacillus cereus 172560W RepID=C2P552_BACCE Length = 1325 Score = 57.8 bits (138), Expect = 5e-07 Identities = 45/154 (29%), Positives = 61/154 (39%), Gaps = 13/154 (8%) Frame = +1 Query: 34 PGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPPS 213 P + G + G GPP G +GPGG P+ S + + + G+ G Sbjct: 195 PTGITGPTGITGPSGGPPGPTGPTGATGPGGGPSGSTGATGAT-----GNTGATGST--G 247 Query: 214 VAGGYGASGPTSATFSNESGSFQSLQ-------PAPPQ----MPPPTGPSPHLAHGGVTA 360 V G G++GPT +T + Q +Q P PQ +P PTG + GV Sbjct: 248 VTGATGSTGPTGSTGAQGLQGIQGIQGPIGPTGPEGPQGIQGIPGPTGVTGEQGIQGVQG 307 Query: 361 AHGVPRHHGANGPASLNSAALPAYATG--GGNGP 456 G G GP + A P ATG G GP Sbjct: 308 IQGAKGATGDQGPQGIQGAIGPQGATGATGDQGP 341 [68][TOP] >UniRef100_B5UNT5 Collagen triple helix repeat domain protein n=1 Tax=Bacillus cereus AH1134 RepID=B5UNT5_BACCE Length = 1309 Score = 57.8 bits (138), Expect = 5e-07 Identities = 45/154 (29%), Positives = 61/154 (39%), Gaps = 13/154 (8%) Frame = +1 Query: 34 PGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPPS 213 P + G + G GPP G +GPGG P+ S + + + G+ G Sbjct: 195 PTGITGPTGITGPSGGPPGPTGPTGATGPGGGPSGSTGATGAT-----GNTGATGST--G 247 Query: 214 VAGGYGASGPTSATFSNESGSFQSLQ-------PAPPQ----MPPPTGPSPHLAHGGVTA 360 V G G++GPT +T + Q +Q P PQ +P PTG + GV Sbjct: 248 VTGATGSTGPTGSTGAQGLQGIQGIQGPIGPTGPEGPQGIQGIPGPTGVTGEQGIQGVQG 307 Query: 361 AHGVPRHHGANGPASLNSAALPAYATG--GGNGP 456 G G GP + A P ATG G GP Sbjct: 308 IQGAKGATGDQGPQGIQGAIGPQGATGATGDQGP 341 [69][TOP] >UniRef100_Q7YXA3 Protein H06A10.2, partially confirmed by transcript evidence n=1 Tax=Caenorhabditis elegans RepID=Q7YXA3_CAEEL Length = 305 Score = 57.8 bits (138), Expect = 5e-07 Identities = 51/161 (31%), Positives = 59/161 (36%), Gaps = 1/161 (0%) Frame = +1 Query: 49 GGSSAAGSFSGPPYAPGVYAGSGPGGH-PASSYAPSSSASLPQGAHLGSRGGAPPSVAGG 225 GGS G P APG +G G P + P P G PP G Sbjct: 115 GGSPGKPGKPGKPGAPGAPGAAGKGASAPCEAKTPPPCQPCPAG---------PPGPPGP 165 Query: 226 YGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTAAHGVPRHHGANGPAS 405 G +GP E+GS PA P P P GP G A G P G +GPA Sbjct: 166 DGPAGPAGP--DGEAGS-----PAAPSPPGPPGPPGPAGPAGNDGAAGTP---GPDGPAG 215 Query: 406 LNSAALPAYATGGGNGPAYPPGAIVSPASTATFNRLSPAAA 528 ++ PA G GPA PPG P +P AA Sbjct: 216 ESTYPEPA-----GPGPAGPPGPAGPPGPDGASPTAAPGAA 251 [70][TOP] >UniRef100_A8X4T9 C. briggsae CBR-COL-44 protein n=1 Tax=Caenorhabditis briggsae RepID=A8X4T9_CAEBR Length = 301 Score = 57.8 bits (138), Expect = 5e-07 Identities = 51/171 (29%), Positives = 65/171 (38%), Gaps = 7/171 (4%) Frame = +1 Query: 37 GSVVGGSSAAGSFSG--PPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPP 210 G G++A G SG P PGV G G P AP S + +GA + PP Sbjct: 87 GGADAGAAAGGGCSGCCNPGPPGVAGNPGKPGKPGKPGAPGSPGAPGKGAAVPCEAKNPP 146 Query: 211 SV----AGGYGASGPTSATFSNESG-SFQSLQPAPPQMPPPTGPSPHLAHGGVTAAHGVP 375 AG G GP + E+G + ++ PA P P P GP + G G P Sbjct: 147 PCQPCPAGPPGPPGPDGP--AGEAGPAGEAGAPAAPSPPGPPGPPGPPGNPGADGGAGTP 204 Query: 376 RHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSPASTATFNRLSPAAA 528 GA G ++ Y G GPA PPG P +P A Sbjct: 205 GPDGAGGEST--------YPEPAGPGPAGPPGPPGPPGPDGASPTAAPGEA 247 [71][TOP] >UniRef100_A0JM00 Collagen, type 1, alpha 2 n=1 Tax=Xenopus (Silurana) tropicalis RepID=A0JM00_XENTR Length = 1354 Score = 57.4 bits (137), Expect = 6e-07 Identities = 60/179 (33%), Positives = 72/179 (40%), Gaps = 22/179 (12%) Frame = +1 Query: 1 AQQPPSYGSHVPGSVVGGSSAAGSFSGPPY---------------APGVYAGSGPGGHPA 135 AQ PP + G + A F G P APG + +GP G Sbjct: 521 AQGPPGLAGNTGDKGEQGPAGAPGFQGLPGPGGAAGELGKHGERGAPGDFGPAGPAGPRG 580 Query: 136 SSYAPSSS-ASLPQGAHLGSRG--GAPPS--VAGGYGASGPTSATFSNESGSFQSLQPAP 300 AP S A+ P GA LG RG GAP S G GA+G A + G + A Sbjct: 581 ERGAPGESGAAGPLGA-LGPRGPTGAPGSDGAKGEPGAAGLNGALGPSGPGGIPGERGAA 639 Query: 301 PQMPPPTGPSPHLAHGGVTAAHGVPRHHGANGPASLNSAALPAYATG--GGNGPAYPPG 471 +P P G H G +G P GA GPA + A PA A G G +GPA P G Sbjct: 640 G-VPGPKGEKGDAGHSG---EYGNPGRDGARGPAGASGAPGPAGAAGDRGESGPAGPSG 694 [72][TOP] >UniRef100_B7IKZ3 Collagen triple helix repeat domain protein n=1 Tax=Bacillus cereus G9842 RepID=B7IKZ3_BACC2 Length = 951 Score = 57.4 bits (137), Expect = 6e-07 Identities = 49/154 (31%), Positives = 63/154 (40%), Gaps = 13/154 (8%) Frame = +1 Query: 34 PGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPPS 213 P + G + G GPP G +GPGG P+ S ++ A+ GA GS G Sbjct: 192 PTGITGPTGITGPSGGPPGPTGPTGATGPGGGPSGS-TGATGATGDTGA-TGSTG----- 244 Query: 214 VAGGYGASGPTSATFSNESGSFQSLQ-------PAPPQ----MPPPTGPSPHLAHGGVTA 360 V G G +GPT +T + Q +Q P PQ +P PTG + GV Sbjct: 245 VTGATGTTGPTGSTGAQGLQGIQGIQGSIGPTGPEGPQGIQGIPGPTGITGEQGIQGVQG 304 Query: 361 AHGVPRHHGANGPASLNSAALPAYATG--GGNGP 456 GV G GP + A P TG G GP Sbjct: 305 IQGVTGATGDQGPQGIQGAIGPQGVTGATGDQGP 338 [73][TOP] >UniRef100_C2YXV7 Collagen triple helix repeat domain protein n=1 Tax=Bacillus cereus AH1271 RepID=C2YXV7_BACCE Length = 924 Score = 57.4 bits (137), Expect = 6e-07 Identities = 44/154 (28%), Positives = 60/154 (38%), Gaps = 13/154 (8%) Frame = +1 Query: 34 PGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPPS 213 P + G + G GPP G +GPGG P+ S + + + G+ G Sbjct: 192 PTGITGPTGITGPSGGPPGPTGATGATGPGGGPSGSTGATGAT-----GNTGATGST--G 244 Query: 214 VAGGYGASGPTSATFSNESGSFQSLQ-------PAPPQ----MPPPTGPSPHLAHGGVTA 360 + G G +GPT +T + Q +Q P PQ +P PTG + GV Sbjct: 245 ITGAAGTTGPTGSTGAQGLQGIQGVQGPIGPTGPEGPQGIQGIPGPTGVTGEQGIQGVQG 304 Query: 361 AHGVPRHHGANGPASLNSAALPAYATG--GGNGP 456 GV G GP + A P TG G GP Sbjct: 305 IQGVTGATGDQGPQGIQGAIGPQGVTGATGDQGP 338 [74][TOP] >UniRef100_A8WXW9 Putative uncharacterized protein n=1 Tax=Caenorhabditis briggsae RepID=A8WXW9_CAEBR Length = 1075 Score = 57.4 bits (137), Expect = 6e-07 Identities = 45/153 (29%), Positives = 64/153 (41%), Gaps = 1/153 (0%) Frame = +1 Query: 34 PGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAP-P 210 P S G SG Y+ G + G GG + Y+ S+ P A + AP P Sbjct: 612 PSGGYASSGGGGGSSGGGYSSGGGSSGGGGGGSSGGYSQSAPPPPPAPAPAPAPAPAPAP 671 Query: 211 SVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTAAHGVPRHHGA 390 + +GGY +SG S+ G + Q APP P + P+P A G A+ G G Sbjct: 672 APSGGYASSGGGSS--GGGGGGYS--QSAPPPPAPESAPAPAPAPSGGYASSGGGESSG- 726 Query: 391 NGPASLNSAALPAYATGGGNGPAYPPGAIVSPA 489 G +S +S + GGG G Y + P+ Sbjct: 727 -GGSSASSGGYASSGGGGGGGGGYASASAPPPS 758 Score = 55.5 bits (132), Expect = 2e-06 Identities = 47/164 (28%), Positives = 62/164 (37%) Frame = +1 Query: 1 AQQPPSYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGA 180 A P G + G GG S+ G G AP P P + AP+ S+ Sbjct: 834 APAPAPSGGYSSGGGGGGGSSGGYSGGSAPAPASEPAPAPAPEPEPAPAPAPSS------ 887 Query: 181 HLGSRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTA 360 G G S GG G+SG +S +S GS P PP P P+P A G + Sbjct: 888 --GGYSGGSSSGGGGGGSSGGSSGGYS--GGSAAPPPPPPPAPEPAPAPAPAPAPSGGYS 943 Query: 361 AHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSPAS 492 + G G + + PA A+ PA P +PAS Sbjct: 944 SEG---GGGGGSSGGYSGGSAPAPASEPAPAPAPEPEPAPAPAS 984 Score = 53.9 bits (128), Expect = 7e-06 Identities = 48/170 (28%), Positives = 60/170 (35%), Gaps = 7/170 (4%) Frame = +1 Query: 19 YGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRG 198 Y S GS GG + PP AP + P P+ YA S G S G Sbjct: 677 YASSGGGSSGGGGGGYSQSAPPPPAPE--SAPAPAPAPSGGYASSGGGESSGGGSSASSG 734 Query: 199 GAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTA------ 360 G S GG G G SA+ SG A PPP P+P A A Sbjct: 735 GYASSGGGGGGGGGYASASAPPPSGGGGGGYSASAAPPPPPPPAPEPAPAPAPAPAPSRG 794 Query: 361 -AHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSPASTATFN 507 + G G++G S SA PA P P +PA + ++ Sbjct: 795 YSSGGGGGGGSSGGYSGGSAPAPASEPAPAPAPEQAPAPAPAPAPSGGYS 844 [75][TOP] >UniRef100_P46804 Spidroin-2 (Fragment) n=1 Tax=Nephila clavipes RepID=SPD2_NEPCL Length = 627 Score = 57.4 bits (137), Expect = 6e-07 Identities = 58/187 (31%), Positives = 71/187 (37%), Gaps = 10/187 (5%) Frame = +1 Query: 4 QQPPSYGSHVPGSVVGGSSAAGSFSGPP------YAPGVYA--GSGPGGHPASSYAPSSS 159 Q P YG G GS+AA + +GP Y PG G GPG Y P S+ Sbjct: 293 QGPGGYGPGQQGPSGAGSAAAAAAAGPGQQGLGGYGPGQQGPGGYGPGQQGPGGYGPGSA 352 Query: 160 ASLPQGAHLGSRGGAPPSVAGGYGAS--GPTSATFSNESGSFQSLQPAPPQMPPPTGPSP 333 ++ A G +G GGYG GP+ GS + A P GP Sbjct: 353 SAAAAAAGPGQQG------PGGYGPGQQGPSGP------GSASAAAAAAAAGPGGYGPGQ 400 Query: 334 HLAHGGVTAAHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSPASTATFNRL 513 GG P G +GP S ++AA A A GG GP +P Sbjct: 401 Q-GPGGYA-----PGQQGPSGPGSASAAAAAAAAGPGGYGPGQQGPGGYAPGQQGPSGPG 454 Query: 514 SPAAAAA 534 S AAAAA Sbjct: 455 SAAAAAA 461 Score = 55.8 bits (133), Expect = 2e-06 Identities = 63/207 (30%), Positives = 83/207 (40%), Gaps = 31/207 (14%) Frame = +1 Query: 4 QQPPSYGSHVPGSVVGGSSAAGSFSGPP------YAPGVYAGSGPGGHPASSYAPSSSAS 165 Q P YG G GS+AA + +GP Y PG SGPG S+ A +++A+ Sbjct: 228 QGPGGYGPGQQGLSGPGSAAAAAAAGPGQQGPGGYGPGQQGPSGPG----SAAAAAAAAA 283 Query: 166 LPQGAHLGSR--GGAPPSVAGGYGASGPTSATFSNES----GSFQSLQPAP----PQMPP 315 P G G + GG P G GA +A + G + Q P P Sbjct: 284 GPGGYGPGQQGPGGYGPGQQGPSGAGSAAAAAAAGPGQQGLGGYGPGQQGPGGYGPGQQG 343 Query: 316 PTGPSPHLAHGGVTAA---------HGVPRHHGANGPASLNSAALPAYATGGGNGP---- 456 P G P A AA +G P G +GP S ++AA A A GG GP Sbjct: 344 PGGYGPGSASAAAAAAGPGQQGPGGYG-PGQQGPSGPGSASAAAAAAAAGPGGYGPGQQG 402 Query: 457 --AYPPGAIVSPASTATFNRLSPAAAA 531 Y PG P+ + + + AAAA Sbjct: 403 PGGYAPGQ-QGPSGPGSASAAAAAAAA 428 [76][TOP] >UniRef100_UPI00015B5FE6 PREDICTED: similar to CG15920-PA n=1 Tax=Nasonia vitripennis RepID=UPI00015B5FE6 Length = 752 Score = 57.0 bits (136), Expect = 8e-07 Identities = 51/169 (30%), Positives = 65/169 (38%), Gaps = 7/169 (4%) Frame = +1 Query: 10 PPSYGSHVPGSVVGGSSAAGSFSG--PPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAH 183 PP+ G G GG+ F G P +PG + G G GG P+ SY P S G+ Sbjct: 261 PPAAGGGGFGGNAGGNGGGNGFGGGRPSGSPGGFGGQGGGGRPSDSYLPPSG-----GSG 315 Query: 184 LGSRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTAA 363 G G P GG+G G A N G + +P+ PP G G Sbjct: 316 FGGGNGRQP---GGFGQQGGNGAGQQNGGGG--AGRPSSSYGPPSNGNGG--GFSGQNGG 368 Query: 364 HGVPRHHGANGPAS---LNSAALPAYATGGGN--GPAYPPGAIVSPAST 495 G P G G A +S PA +G GN G P + P S+ Sbjct: 369 RGSPSSGGGFGGAGGSPSSSYGPPAGGSGFGNNGGAGGRPSSSYGPPSS 417 [77][TOP] >UniRef100_Q1D888 General secretory system II protein E, N-terminal domain protein n=1 Tax=Myxococcus xanthus DK 1622 RepID=Q1D888_MYXXD Length = 2136 Score = 57.0 bits (136), Expect = 8e-07 Identities = 67/199 (33%), Positives = 80/199 (40%), Gaps = 21/199 (10%) Frame = +1 Query: 1 AQQPPSYGSHVP-GSVVGG--SSAAGSFSGPPYAPGVYAGSGPG--GHPASSYAPSSSAS 165 A+ PP+ G +P G V G S S G P PG PG G P SS A Sbjct: 803 ARPPPAPGLPMPHGPVPPGMMGSRPPSSPGLPAVPGGRGAKPPGMTGAPPSSVHRGPQAP 862 Query: 166 LPQGAHLGSRGGAPPSVAGGYGASGP----------TSATFSNESGSFQSLQPAPPQMPP 315 P G GAP + A G GA P T A F+ G + P PP P Sbjct: 863 GPHGTKPPGMTGAPFATAHG-GADAPVPPGTKPPGMTGAPFATAHGGADA--PVPPGTMP 919 Query: 316 P--TGPSPHLAHGGVTAAHGVPRHHGANGPASLNSAALPAYATGGGNGP----AYPPGAI 477 P TG P AHG A P GA P ++ AA PA A GG + P A PPG + Sbjct: 920 PGMTGAPPATAHGVPDA----PVPPGAVPPGTM--AAQPAAAHGGPDTPVSPGAVPPGMM 973 Query: 478 VSPASTATFNRLSPAAAAA 534 +P + +P A A Sbjct: 974 GAPPPSVHGGPHAPVALGA 992 [78][TOP] >UniRef100_A4T238 Putative uncharacterized protein n=1 Tax=Mycobacterium gilvum PYR-GCK RepID=A4T238_MYCGI Length = 811 Score = 57.0 bits (136), Expect = 8e-07 Identities = 58/165 (35%), Positives = 68/165 (41%), Gaps = 14/165 (8%) Frame = +1 Query: 37 GSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSA--SLPQ-GAHLGSRGGAP 207 G +VGG +G G P PG G G GG P P S +P G+ L S GG P Sbjct: 286 GGMVGGGMGSG---GAPKLPG---GLGSGGLPGMGSNPLGSGVDQMPSAGSGLPSAGGVP 339 Query: 208 PSVAGGYGASGPTSATFSNES--GSFQSLQPAPPQMPPPTGPSPHLAHGG------VTAA 363 G GA P A S G+ PA P P P PSP L+ G TA Sbjct: 340 ---GDGSGAGSPAVAFSQGMSTGGAIGGGMPAAP-APAPASPSPALSAGAQAAPVPATAG 395 Query: 364 HGVP---RHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSPA 489 GVP G PA+ + A TGGG PPG++ PA Sbjct: 396 GGVPPAAAQSGLVAPAAPPTGA--GMGTGGGAPMMLPPGSMGPPA 438 Score = 55.1 bits (131), Expect = 3e-06 Identities = 62/185 (33%), Positives = 76/185 (41%), Gaps = 11/185 (5%) Frame = +1 Query: 13 PSYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAP---SSSASLPQGAH 183 PS G VPG G S A +FS G+ G GG ++ AP S S +L GA Sbjct: 333 PSAGG-VPGDGSGAGSPAVAFS-----QGMSTGGAIGGGMPAAPAPAPASPSPALSAGAQ 386 Query: 184 LG-----SRGGAPPSVAGG---YGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHL 339 + GG PP+ A A+ PT A G+ L P M PP GP P Sbjct: 387 AAPVPATAGGGVPPAAAQSGLVAPAAPPTGAGMGTGGGAPMMLPPG--SMGPPAGPVPPP 444 Query: 340 AHGGVTAAHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSPASTATFNRLSP 519 A A G G+N P SAA PA A G GP P ++V+ TA Sbjct: 445 AATVPAGALGA----GSNAP----SAAPPAAAAGA--GPTLIPASVVAAGQTAAARERRE 494 Query: 520 AAAAA 534 +A AA Sbjct: 495 SADAA 499 [79][TOP] >UniRef100_A8I4M6 Predicted protein n=1 Tax=Chlamydomonas reinhardtii RepID=A8I4M6_CHLRE Length = 647 Score = 57.0 bits (136), Expect = 8e-07 Identities = 60/189 (31%), Positives = 69/189 (36%), Gaps = 41/189 (21%) Frame = +1 Query: 13 PSYGSHVPGS------VVGGSSAAG------SFSGPPYAPGVYAGSGPGG---------- 126 PSYGS +PGS V+G + A SF G G++ G G GG Sbjct: 175 PSYGSSLPGSGGTAAVVLGAGTGANVPAPSSSFLGGSLLSGLFGGRGGGGGGSAAGGAAG 234 Query: 127 -------------------HPASSYAPSSSASLPQGAHLGSRGGAPPSVAGGYGASGPTS 249 PA P+SSA L S G GG T Sbjct: 235 AAVTPDSSVHGPDSYYGVPEPAFGSLPTSSALLRARGLNASAGSILTKATGGL----KTQ 290 Query: 250 ATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTAAHGVPRHHGANGPASLNSAALPA 429 S SG+F SL P PP PPP + A GG A GVP GA A L+ AA Sbjct: 291 MKKSTSSGNFGSLWPQPPPPPPPAAAAQRNAGGG---AAGVPL--GAGPGAGLSGAA--- 342 Query: 430 YATGGGNGP 456 GGG P Sbjct: 343 ---GGGRAP 348 [80][TOP] >UniRef100_Q9N2N7 Fibrillar collagen alpha 120 and 140 chains (Fragment) n=1 Tax=Hemicentrotus pulcherrimus RepID=Q9N2N7_HEMPU Length = 632 Score = 57.0 bits (136), Expect = 8e-07 Identities = 54/164 (32%), Positives = 61/164 (37%), Gaps = 10/164 (6%) Frame = +1 Query: 10 PPSYGSHVPGSVVGGSSAAGS--FSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAH 183 P G+ G S A GS GP APG SGP G S+ AP P GA Sbjct: 207 PGPQGARGEKGDTGASGANGSPGAPGPIGAPGAAGASGPRGETGSTGAPGPQG--PTGAR 264 Query: 184 LGSRGGAPPS----VAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHG- 348 GS G A PS AG G +GP LQ P M P P G Sbjct: 265 -GSTGPAGPSGPAGPAGERGETGPAGHKGHPGVSGLPGLQGTPGPMGEPGAPGEQGQQGT 323 Query: 349 -GVTAAHGVPRHHGANGPASLNSAALP--AYATGGGNGPAYPPG 471 G+ A G + G GP + P GG +GP PPG Sbjct: 324 RGLPGARGSNGNDGPAGPRGFDGPEGPRGPRGEGGSSGPPGPPG 367 [81][TOP] >UniRef100_Q5QN39 Os01g0201600 protein n=2 Tax=Oryza sativa Japonica Group RepID=Q5QN39_ORYSJ Length = 301 Score = 56.6 bits (135), Expect = 1e-06 Identities = 45/136 (33%), Positives = 55/136 (40%), Gaps = 3/136 (2%) Frame = +1 Query: 67 GSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPPSVAGGYGASGPT 246 G + P PG + G G G S S +LP +H GGA PS GGYGAS P Sbjct: 67 GGTTTPTPIPGHHGGGGSSGTTPSHGGGPSGGALPSPSH----GGAAPSHGGGYGASPPV 122 Query: 247 SATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGG---VTAAHGVPRHHGANGPASLNSA 417 + + G + PAP G SP GG T +HG + G PA+ Sbjct: 123 T---PSPGGGYGGGSPAPSHGGGAYGSSPSTPSGGGSSPTPSHGGGAYGGGGAPAT---- 175 Query: 418 ALPAYATGGGNGPAYP 465 PA G G P P Sbjct: 176 --PASHDGHGLIPTTP 189 [82][TOP] >UniRef100_UPI0000DB7202 PREDICTED: hypothetical protein n=1 Tax=Apis mellifera RepID=UPI0000DB7202 Length = 344 Score = 56.6 bits (135), Expect = 1e-06 Identities = 55/173 (31%), Positives = 74/173 (42%), Gaps = 19/173 (10%) Frame = +1 Query: 13 PSYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSY-APSSSASLPQGAH-L 186 P GS++P S G+ G GP G +G G GG P+SSY APSS+ P + Sbjct: 29 PISGSYLPPSTSYGTPNLGG-GGPSSTYGAPSGGG-GGRPSSSYGAPSSTYGAPSSTYGA 86 Query: 187 GSRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQ---MPPPTGPSPHLAHG--- 348 S GG PS YGA S+ G+ S AP P G P ++G Sbjct: 87 PSNGGGRPS--STYGAPSNGGGRPSSSYGAPSSSYGAPSSTYGAPSNGGGRPSSSYGAPS 144 Query: 349 -----------GVTAAHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGA 474 G++ ++G P G G S+ S++ A GGG GP+ GA Sbjct: 145 FGGGGGFGGGNGLSTSYGAPSRGGGGGGGSI-SSSYGAPTGGGGGGPSTTYGA 196 [83][TOP] >UniRef100_UPI0001AE7353 UPI0001AE7353 related cluster n=1 Tax=Homo sapiens RepID=UPI0001AE7353 Length = 1629 Score = 56.6 bits (135), Expect = 1e-06 Identities = 54/178 (30%), Positives = 59/178 (33%), Gaps = 23/178 (12%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSG-----PPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHL 186 G P G S A G+ G PP G+ GP G P P H Sbjct: 740 GQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPPGPPGKDGLPGHP 799 Query: 187 GSRG----------GAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPH 336 G RG PP V G GA+G T E G P PP P G Sbjct: 800 GQRGEVGFQGKTGPPGPPGVVGPQGAAGETGP--MGERG-----HPGPPGPPGEQGLPGT 852 Query: 337 LAHGGVTAAHGVPRHHGANGPASL----NSAALPAYATG----GGNGPAYPPGAIVSP 486 G G P G +GPA L LP A G G GP+ PPG SP Sbjct: 853 AGKEGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGPGLKGNEGPSGPPGPAGSP 910 [84][TOP] >UniRef100_UPI0001AE734E UPI0001AE734E related cluster n=1 Tax=Homo sapiens RepID=UPI0001AE734E Length = 1655 Score = 56.6 bits (135), Expect = 1e-06 Identities = 54/178 (30%), Positives = 59/178 (33%), Gaps = 23/178 (12%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSG-----PPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHL 186 G P G S A G+ G PP G+ GP G P P H Sbjct: 766 GQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPPGPPGKDGLPGHP 825 Query: 187 GSRG----------GAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPH 336 G RG PP V G GA+G T E G P PP P G Sbjct: 826 GQRGEVGFQGKTGPPGPPGVVGPQGAAGETGP--MGERG-----HPGPPGPPGEQGLPGT 878 Query: 337 LAHGGVTAAHGVPRHHGANGPASL----NSAALPAYATG----GGNGPAYPPGAIVSP 486 G G P G +GPA L LP A G G GP+ PPG SP Sbjct: 879 AGKEGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGPGLKGNEGPSGPPGPAGSP 936 [85][TOP] >UniRef100_UPI0001AE734D UPI0001AE734D related cluster n=1 Tax=Homo sapiens RepID=UPI0001AE734D Length = 1676 Score = 56.6 bits (135), Expect = 1e-06 Identities = 54/178 (30%), Positives = 59/178 (33%), Gaps = 23/178 (12%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSG-----PPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHL 186 G P G S A G+ G PP G+ GP G P P H Sbjct: 787 GQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPPGPPGKDGLPGHP 846 Query: 187 GSRG----------GAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPH 336 G RG PP V G GA+G T E G P PP P G Sbjct: 847 GQRGEVGFQGKTGPPGPPGVVGPQGAAGETGP--MGERG-----HPGPPGPPGEQGLPGT 899 Query: 337 LAHGGVTAAHGVPRHHGANGPASL----NSAALPAYATG----GGNGPAYPPGAIVSP 486 G G P G +GPA L LP A G G GP+ PPG SP Sbjct: 900 AGKEGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGPGLKGNEGPSGPPGPAGSP 957 [86][TOP] >UniRef100_UPI0001AE734C UPI0001AE734C related cluster n=1 Tax=Homo sapiens RepID=UPI0001AE734C Length = 1689 Score = 56.6 bits (135), Expect = 1e-06 Identities = 54/178 (30%), Positives = 59/178 (33%), Gaps = 23/178 (12%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSG-----PPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHL 186 G P G S A G+ G PP G+ GP G P P H Sbjct: 800 GQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPPGPPGKDGLPGHP 859 Query: 187 GSRG----------GAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPH 336 G RG PP V G GA+G T E G P PP P G Sbjct: 860 GQRGEVGFQGKTGPPGPPGVVGPQGAAGETGP--MGERG-----HPGPPGPPGEQGLPGT 912 Query: 337 LAHGGVTAAHGVPRHHGANGPASL----NSAALPAYATG----GGNGPAYPPGAIVSP 486 G G P G +GPA L LP A G G GP+ PPG SP Sbjct: 913 AGKEGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGPGLKGNEGPSGPPGPAGSP 970 [87][TOP] >UniRef100_UPI0001AE734B UPI0001AE734B related cluster n=1 Tax=Homo sapiens RepID=UPI0001AE734B Length = 1710 Score = 56.6 bits (135), Expect = 1e-06 Identities = 54/178 (30%), Positives = 59/178 (33%), Gaps = 23/178 (12%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSG-----PPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHL 186 G P G S A G+ G PP G+ GP G P P H Sbjct: 821 GQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPPGPPGKDGLPGHP 880 Query: 187 GSRG----------GAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPH 336 G RG PP V G GA+G T E G P PP P G Sbjct: 881 GQRGEVGFQGKTGPPGPPGVVGPQGAAGETGP--MGERG-----HPGPPGPPGEQGLPGT 933 Query: 337 LAHGGVTAAHGVPRHHGANGPASL----NSAALPAYATG----GGNGPAYPPGAIVSP 486 G G P G +GPA L LP A G G GP+ PPG SP Sbjct: 934 AGKEGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGPGLKGNEGPSGPPGPAGSP 991 [88][TOP] >UniRef100_UPI0001AE734A UPI0001AE734A related cluster n=1 Tax=Homo sapiens RepID=UPI0001AE734A Length = 1715 Score = 56.6 bits (135), Expect = 1e-06 Identities = 54/178 (30%), Positives = 59/178 (33%), Gaps = 23/178 (12%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSG-----PPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHL 186 G P G S A G+ G PP G+ GP G P P H Sbjct: 826 GQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPPGPPGKDGLPGHP 885 Query: 187 GSRG----------GAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPH 336 G RG PP V G GA+G T E G P PP P G Sbjct: 886 GQRGEVGFQGKTGPPGPPGVVGPQGAAGETGP--MGERG-----HPGPPGPPGEQGLPGT 938 Query: 337 LAHGGVTAAHGVPRHHGANGPASL----NSAALPAYATG----GGNGPAYPPGAIVSP 486 G G P G +GPA L LP A G G GP+ PPG SP Sbjct: 939 AGKEGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGPGLKGNEGPSGPPGPAGSP 996 [89][TOP] >UniRef100_UPI0001AE7349 UPI0001AE7349 related cluster n=1 Tax=Homo sapiens RepID=UPI0001AE7349 Length = 1736 Score = 56.6 bits (135), Expect = 1e-06 Identities = 54/178 (30%), Positives = 59/178 (33%), Gaps = 23/178 (12%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSG-----PPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHL 186 G P G S A G+ G PP G+ GP G P P H Sbjct: 847 GQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPPGPPGKDGLPGHP 906 Query: 187 GSRG----------GAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPH 336 G RG PP V G GA+G T E G P PP P G Sbjct: 907 GQRGEVGFQGKTGPPGPPGVVGPQGAAGETGP--MGERG-----HPGPPGPPGEQGLPGT 959 Query: 337 LAHGGVTAAHGVPRHHGANGPASL----NSAALPAYATG----GGNGPAYPPGAIVSP 486 G G P G +GPA L LP A G G GP+ PPG SP Sbjct: 960 AGKEGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGPGLKGNEGPSGPPGPAGSP 1017 [90][TOP] >UniRef100_UPI0001AE71E7 UPI0001AE71E7 related cluster n=1 Tax=Homo sapiens RepID=UPI0001AE71E7 Length = 1655 Score = 56.6 bits (135), Expect = 1e-06 Identities = 54/178 (30%), Positives = 59/178 (33%), Gaps = 23/178 (12%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSG-----PPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHL 186 G P G S A G+ G PP G+ GP G P P H Sbjct: 766 GQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPPGPPGKDGLPGHP 825 Query: 187 GSRG----------GAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPH 336 G RG PP V G GA+G T E G P PP P G Sbjct: 826 GQRGEVGFQGKTGPPGPPGVVGPQGAAGETGP--MGERG-----HPGPPGPPGEQGLPGT 878 Query: 337 LAHGGVTAAHGVPRHHGANGPASL----NSAALPAYATG----GGNGPAYPPGAIVSP 486 G G P G +GPA L LP A G G GP+ PPG SP Sbjct: 879 AGKEGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGPGLKGNEGPSGPPGPAGSP 936 [91][TOP] >UniRef100_UPI0001AE71E6 UPI0001AE71E6 related cluster n=1 Tax=Homo sapiens RepID=UPI0001AE71E6 Length = 1676 Score = 56.6 bits (135), Expect = 1e-06 Identities = 54/178 (30%), Positives = 59/178 (33%), Gaps = 23/178 (12%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSG-----PPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHL 186 G P G S A G+ G PP G+ GP G P P H Sbjct: 787 GQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPPGPPGKDGLPGHP 846 Query: 187 GSRG----------GAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPH 336 G RG PP V G GA+G T E G P PP P G Sbjct: 847 GQRGEVGFQGKTGPPGPPGVVGPQGAAGETGP--MGERG-----HPGPPGPPGEQGLPGT 899 Query: 337 LAHGGVTAAHGVPRHHGANGPASL----NSAALPAYATG----GGNGPAYPPGAIVSP 486 G G P G +GPA L LP A G G GP+ PPG SP Sbjct: 900 AGKEGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGPGLKGNEGPSGPPGPAGSP 957 [92][TOP] >UniRef100_UPI0001AE71E5 UPI0001AE71E5 related cluster n=1 Tax=Homo sapiens RepID=UPI0001AE71E5 Length = 1715 Score = 56.6 bits (135), Expect = 1e-06 Identities = 54/178 (30%), Positives = 59/178 (33%), Gaps = 23/178 (12%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSG-----PPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHL 186 G P G S A G+ G PP G+ GP G P P H Sbjct: 826 GQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPPGPPGKDGLPGHP 885 Query: 187 GSRG----------GAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPH 336 G RG PP V G GA+G T E G P PP P G Sbjct: 886 GQRGEVGFQGKTGPPGPPGVVGPQGAAGETGP--MGERG-----HPGPPGPPGEQGLPGT 938 Query: 337 LAHGGVTAAHGVPRHHGANGPASL----NSAALPAYATG----GGNGPAYPPGAIVSP 486 G G P G +GPA L LP A G G GP+ PPG SP Sbjct: 939 AGKEGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGPGLKGNEGPSGPPGPAGSP 996 [93][TOP] >UniRef100_UPI0001AE71E4 UPI0001AE71E4 related cluster n=1 Tax=Homo sapiens RepID=UPI0001AE71E4 Length = 1736 Score = 56.6 bits (135), Expect = 1e-06 Identities = 54/178 (30%), Positives = 59/178 (33%), Gaps = 23/178 (12%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSG-----PPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHL 186 G P G S A G+ G PP G+ GP G P P H Sbjct: 847 GQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPPGPPGKDGLPGHP 906 Query: 187 GSRG----------GAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPH 336 G RG PP V G GA+G T E G P PP P G Sbjct: 907 GQRGEVGFQGKTGPPGPPGVVGPQGAAGETGP--MGERG-----HPGPPGPPGEQGLPGT 959 Query: 337 LAHGGVTAAHGVPRHHGANGPASL----NSAALPAYATG----GGNGPAYPPGAIVSP 486 G G P G +GPA L LP A G G GP+ PPG SP Sbjct: 960 AGKEGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGPGLKGNEGPSGPPGPAGSP 1017 [94][TOP] >UniRef100_UPI000173A163 UPI000173A163 related cluster n=1 Tax=Homo sapiens RepID=UPI000173A163 Length = 1623 Score = 56.6 bits (135), Expect = 1e-06 Identities = 54/178 (30%), Positives = 59/178 (33%), Gaps = 23/178 (12%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSG-----PPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHL 186 G P G S A G+ G PP G+ GP G P P H Sbjct: 734 GQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPPGPPGKDGLPGHP 793 Query: 187 GSRG----------GAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPH 336 G RG PP V G GA+G T E G P PP P G Sbjct: 794 GQRGEVGFQGKTGPPGPPGVVGPQGAAGETGP--MGERG-----HPGPPGPPGEQGLPGT 846 Query: 337 LAHGGVTAAHGVPRHHGANGPASL----NSAALPAYATG----GGNGPAYPPGAIVSP 486 G G P G +GPA L LP A G G GP+ PPG SP Sbjct: 847 AGKEGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGPGLKGNEGPSGPPGPAGSP 904 [95][TOP] >UniRef100_Q3TP88 Putative uncharacterized protein (Fragment) n=1 Tax=Mus musculus RepID=Q3TP88_MOUSE Length = 959 Score = 56.6 bits (135), Expect = 1e-06 Identities = 49/171 (28%), Positives = 61/171 (35%), Gaps = 17/171 (9%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGG 201 GS P G G +G APG SGPGG P A + G RG Sbjct: 200 GSRGPSGAPGPDGNKGE-AGAVGAPGSAGASGPGGLPGERGAAGIPGGKGEKGETGLRGD 258 Query: 202 APPS-------VAGGYGASGPTSATFSN-ESGSFQSLQPAPPQMPP-------PTGPSPH 336 + + G GA GP A+ E+G+ PA P+ P P GP+ Sbjct: 259 TGNTGRDGARGIPGAVGAPGPAGASGDRGEAGAAGPSGPAGPRGSPGERGEVGPAGPNGF 318 Query: 337 LAHGGVTAAHGVPRHHGANGPASLNSAALPAYATG--GGNGPAYPPGAIVS 483 G G G GP N P + G G +GP PPG + S Sbjct: 319 AGPAGAAGQPGAKEEKGTKGPKGENGIVGPTGSVGAAGPSGPNGPPGPVGS 369 [96][TOP] >UniRef100_B7H785 Collagen triple helix repeat domain protein n=1 Tax=Bacillus cereus B4264 RepID=B7H785_BACC4 Length = 1297 Score = 56.6 bits (135), Expect = 1e-06 Identities = 48/154 (31%), Positives = 63/154 (40%), Gaps = 13/154 (8%) Frame = +1 Query: 34 PGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPPS 213 P + G + G GPP G +GPGG P+ S ++ A+ GA GS G Sbjct: 195 PTGITGPTGITGPSGGPPGPTGPTGATGPGGGPSGS-TGATGATGSTGA-TGSTG----- 247 Query: 214 VAGGYGASGPTSATFSNESGSFQSLQ-PAPPQ----------MPPPTGPSPHLAHGGVTA 360 V G G +GPT +T + Q +Q P P +P PTG + GV Sbjct: 248 VTGATGTTGPTGSTGAQGLQGIQGIQGPIGPTGSEGPQGIQGIPGPTGVTGEQGIQGVQG 307 Query: 361 AHGVPRHHGANGPASLNSAALPAYATG--GGNGP 456 G+ G GP + A P ATG G GP Sbjct: 308 IQGITGATGDQGPQGIQGAIGPQGATGATGDQGP 341 [97][TOP] >UniRef100_C8RSP5 Ferredoxin, 4Fe-4S (Fragment) n=1 Tax=Corynebacterium jeikeium ATCC 43734 RepID=C8RSP5_CORJE Length = 1064 Score = 56.6 bits (135), Expect = 1e-06 Identities = 54/183 (29%), Positives = 76/183 (41%), Gaps = 5/183 (2%) Frame = +1 Query: 1 AQQPPSYGSHV----PGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASL 168 A PS G+ PG+ ++ + +G P APG A G PA+ APS+ A Sbjct: 845 APSAPSAGTPAAPAAPGAPAAPAAPSAPSAGAPAAPGAPAAPAAPGAPAAPSAPSAGAPA 904 Query: 169 PQGAHLGSRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGP-SPHLAH 345 GA P+ G A G +A + + + Q AP P P +P Sbjct: 905 APGA---------PAAPGAPAAPGAPAAPGAPAAPKSEDTQEAPKTSGAPAAPGAPSAPS 955 Query: 346 GGVTAAHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSPASTATFNRLSPAA 525 G AA G P PA+ + A P+ + G PA PGA +P++ A +PAA Sbjct: 956 AGAPAAPGAPA-----APAAPGAPAAPSAPSAG--APA-APGAPSAPSAGAPAAPGAPAA 1007 Query: 526 AAA 534 AA Sbjct: 1008 PAA 1010 [98][TOP] >UniRef100_C3DRK5 Collagen triple helix repeat domain protein n=1 Tax=Bacillus thuringiensis serovar sotto str. T04001 RepID=C3DRK5_BACTS Length = 951 Score = 56.6 bits (135), Expect = 1e-06 Identities = 49/154 (31%), Positives = 63/154 (40%), Gaps = 13/154 (8%) Frame = +1 Query: 34 PGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPPS 213 P + G + G GPP G +GPGG P+ S ++ A+ GA GS G Sbjct: 192 PTGITGPTGITGPSGGPPGPTGPTGATGPGGGPSGS-TGATGATGDTGA-TGSTG----- 244 Query: 214 VAGGYGASGPTSATFSNESGSFQSLQ-------PAPPQ----MPPPTGPSPHLAHGGVTA 360 V G G +GPT +T + Q +Q P PQ +P PTG + GV Sbjct: 245 VTGETGTTGPTGSTGAQGLQGIQGIQGPIGPTGPEGPQGIQGIPGPTGITGEQGIQGVQG 304 Query: 361 AHGVPRHHGANGPASLNSAALPAYATG--GGNGP 456 GV G GP + P ATG G GP Sbjct: 305 IQGVTGATGDQGPQGIQGTIGPQGATGATGDQGP 338 [99][TOP] >UniRef100_Q8WSZ3 Dragline silk protein spidroin 2 (Fragment) n=1 Tax=Nephila clavata RepID=Q8WSZ3_NEPCV Length = 301 Score = 56.6 bits (135), Expect = 1e-06 Identities = 57/190 (30%), Positives = 73/190 (38%), Gaps = 25/190 (13%) Frame = +1 Query: 4 QQPPSYGSHVPGSVVGGSSAAGSFSGP-PYAPGV---------------YAGSGPGGHPA 135 Q P YG P G S+AA + +GP Y PG Y SGP G P Sbjct: 30 QGPGGYGPSGPSGPGGASAAAAAAAGPGGYGPGQQGPGQQGPGQQGPAGYGPSGPSG-PG 88 Query: 136 SSYAPSSSASLPQGAHLGSRG----GAPPSVAGGYGASGPTSATFSNESGSFQSLQPAP- 300 + A +++A+ P G LG +G G GYG SG S G+ + P Sbjct: 89 GAAAAAAAAAGPGGYGLGQQGPGQQGPGQQGPAGYGPSG-----LSGPGGAAAAAAAGPG 143 Query: 301 ---PQMPPPTGPSPHLAHGGVTAAHGV-PRHHGANGPASLNSAALPAYATGGGNGPAYPP 468 P P+GP A G P G +GP S +AA A G G G P Sbjct: 144 GYGPGQQRPSGPGGAAAAAAAAGPGGYGPSQRGPSGPGSAAAAAAGAGPGGYGPGQKGPS 203 Query: 469 GAIVSPASTA 498 G + A+ A Sbjct: 204 GPGSAAAAAA 213 Score = 55.5 bits (132), Expect = 2e-06 Identities = 58/183 (31%), Positives = 69/183 (37%), Gaps = 6/183 (3%) Frame = +1 Query: 4 QQPPSYGSHVPGSVVGGSSAAGSFSGPP-YAPGVYAGS--GPGGHPASSYAPSSSASLPQ 174 Q P YG P G ++AA + +GP Y G GPG + Y PS Sbjct: 74 QGPAGYGPSGPSGPGGAAAAAAAAAGPGGYGLGQQGPGQQGPGQQGPAGYGPSG------ 127 Query: 175 GAHLGSRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPP--QMPPPTGPS-PHLAH 345 L GGA + A G G GP S G+ + A P P GPS P A Sbjct: 128 ---LSGPGGAAAAAAAGPGGYGPGQQRPSGPGGAAAAAAAAGPGGYGPSQRGPSGPGSAA 184 Query: 346 GGVTAAHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSPASTATFNRLSPAA 525 A G GP+ SAA A A GG GP+ A P+ S AA Sbjct: 185 AAAAGAGPGGYGPGQKGPSGPGSAAAAAAAGPGGYGPSQQGPARYGPSGPG-----SAAA 239 Query: 526 AAA 534 AAA Sbjct: 240 AAA 242 [100][TOP] >UniRef100_Q4E3X8 Mucin-associated surface protein (MASP), putative n=1 Tax=Trypanosoma cruzi RepID=Q4E3X8_TRYCR Length = 325 Score = 56.6 bits (135), Expect = 1e-06 Identities = 44/134 (32%), Positives = 62/134 (46%), Gaps = 3/134 (2%) Frame = +1 Query: 49 GGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPPSVAGGY 228 GGS+ A SG P PG GSG G S+ + +S A +P+G S GG+ GG Sbjct: 80 GGSAGATGASGGP-GPGDAGGSG-GTQKNSNSSETSDAGVPRGGD--SDGGSAAGEKGGS 135 Query: 229 GASGPTSATFSNESGSFQS---LQPAPPQMPPPTGPSPHLAHGGVTAAHGVPRHHGANGP 399 G G ++T + +GS S PAP PP+ P T A GV G++G Sbjct: 136 GGGGSGTSTDGHGTGSVSSGLSAVPAPAPAAPPSAPGHSGGPSAPTDAPGVDPSAGSSGG 195 Query: 400 ASLNSAALPAYATG 441 ++ + P+ TG Sbjct: 196 TAVPPGSNPSNTTG 209 [101][TOP] >UniRef100_B9PJ47 Putative uncharacterized protein n=1 Tax=Toxoplasma gondii GT1 RepID=B9PJ47_TOXGO Length = 994 Score = 56.6 bits (135), Expect = 1e-06 Identities = 56/185 (30%), Positives = 69/185 (37%), Gaps = 11/185 (5%) Frame = +1 Query: 10 PPSYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSG--PGGHPASSYAPSSSASLPQGAH 183 PP+ + PG+ G AA + PP P A G PG PA++ + P G Sbjct: 630 PPAAAA--PGAPPGTPPAAAAPGAPPGTPPAAAAPGAPPGTPPATAATSGAPPGTPPGTP 687 Query: 184 LGSRG---GAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPT----GPSPHLA 342 + G G+PP+ A GA + + SG APP PP T G SP Sbjct: 688 AAASGAPPGSPPATATAAGAPPGSPPATAAASG-------APPGSPPATATASGASPGTP 740 Query: 343 HGGVTAAHGVPRHHGANGPASLNSA--ALPAYATGGGNGPAYPPGAIVSPASTATFNRLS 516 G AA G P PA A P G A PP A +P A L Sbjct: 741 PGTPAAASGAPPGTPPGTPAEALGAVPGAPVATPGAAPTTATPPAAAGTPGVVAGGPGLV 800 Query: 517 PAAAA 531 PA A Sbjct: 801 PAVVA 805 Score = 53.9 bits (128), Expect = 7e-06 Identities = 54/183 (29%), Positives = 71/183 (38%), Gaps = 21/183 (11%) Frame = +1 Query: 34 PGSVVGGSSAAGSFSGPPYAPGVYAGSG-PGGHPASSYAPSSSASLPQGAHLGSRGGAPP 210 P V GG +AA + P A V +G P G P ++ A ++ S GA + GAPP Sbjct: 555 PNLVAGGIAAAIPAAAFPQATMVAGSNGLPQGVPVAAPAVPTAPSAAAGAPAAAASGAPP 614 Query: 211 SVAGGYGASG------PTSATFSNESGS-FQSLQPAPPQMPPPTGPSPHLAHG---GVTA 360 ASG P +A G+ + P P PP +P G A Sbjct: 615 GTPSAAAASGAPPGTPPAAAAPGAPPGTPPAAAAPGAPPGTPPAAAAPGAPPGTPPATAA 674 Query: 361 AHGVPRHHGANGPASLNSA---ALPAYATGGGNGPAYPPGAIV-------SPASTATFNR 510 G P PA+ + A + PA AT G P PP SP +TAT + Sbjct: 675 TSGAPPGTPPGTPAAASGAPPGSPPATATAAGAPPGSPPATAAASGAPPGSPPATATASG 734 Query: 511 LSP 519 SP Sbjct: 735 ASP 737 [102][TOP] >UniRef100_B7PZI3 Smarca4, putative n=1 Tax=Ixodes scapularis RepID=B7PZI3_IXOSC Length = 434 Score = 56.6 bits (135), Expect = 1e-06 Identities = 44/126 (34%), Positives = 48/126 (38%), Gaps = 3/126 (2%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGS-FSGPPYAPGVYAGSGP--GGHPASSYAPSSSASLPQGAHLGS 192 G G GGSS A S GPP P + G P G P YAP P Sbjct: 37 GKPPAGGGSGGSSGAPSPIMGPPPVPQQHMGMPPEGGAPPHHGYAPQPHMG-PGAVQPQV 95 Query: 193 RGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTAAHGV 372 GG PP YG P +Q QP PPQ P G P L H GV +H Sbjct: 96 YGGPPPQQQPPYGGGAP-----------YQQQQPYPPQQGVPPGGGPPLQHQGVPPSH-- 142 Query: 373 PRHHGA 390 HHG+ Sbjct: 143 -PHHGS 147 [103][TOP] >UniRef100_B3NY10 GG17589 n=1 Tax=Drosophila erecta RepID=B3NY10_DROER Length = 2024 Score = 56.6 bits (135), Expect = 1e-06 Identities = 60/181 (33%), Positives = 78/181 (43%), Gaps = 14/181 (7%) Frame = +1 Query: 34 PGSVVGGSSAAGSFSGP-PYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAP- 207 P + SS AG+ +G + V +G G G S+ A S+ S QGA G+ GG+ Sbjct: 162 PATPKSSSSGAGASTGSGTSSAAVTSGPGSGSTKVSTAASSAQQSGLQGA-TGAGGGSSS 220 Query: 208 -PSVAGGYGASGPTSA-TFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGG----VTAAHG 369 P G GA G T+A S G+ S P +PP + PH G TA G Sbjct: 221 TPGTQTGSGAGGATAARPVSAMGGTVSSTAGGAPSIPPISTMPPHTVPGSTNTTTTALAG 280 Query: 370 VPRHHGANGP----ASLNSAALPAYATGGGNGPAYP--PGAIVSPASTATFNRLSPAAAA 531 GA GP A+ N+AAL A G AYP PG +S+ + AAA Sbjct: 281 -----GAGGPGAAAANPNAAALMASLLSAGQTGAYPGAPGQTAVNSSSLLDGSTAAVAAA 335 Query: 532 A 534 A Sbjct: 336 A 336 [104][TOP] >UniRef100_Q5JP94 Collagen type XI alpha 2 n=4 Tax=Homo sapiens RepID=Q5JP94_HUMAN Length = 1650 Score = 56.6 bits (135), Expect = 1e-06 Identities = 54/178 (30%), Positives = 59/178 (33%), Gaps = 23/178 (12%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSG-----PPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHL 186 G P G S A G+ G PP G+ GP G P P H Sbjct: 761 GQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPPGPPGKDGLPGHP 820 Query: 187 GSRG----------GAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPH 336 G RG PP V G GA+G T E G P PP P G Sbjct: 821 GQRGEVGFQGKTGPPGPPGVVGPQGAAGETGP--MGERG-----HPGPPGPPGEQGLPGT 873 Query: 337 LAHGGVTAAHGVPRHHGANGPASL----NSAALPAYATG----GGNGPAYPPGAIVSP 486 G G P G +GPA L LP A G G GP+ PPG SP Sbjct: 874 AGKEGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGPGLKGNEGPSGPPGPAGSP 931 [105][TOP] >UniRef100_C9J8W5 Putative uncharacterized protein ENSP00000410951 n=1 Tax=Homo sapiens RepID=C9J8W5_HUMAN Length = 1693 Score = 56.6 bits (135), Expect = 1e-06 Identities = 54/178 (30%), Positives = 59/178 (33%), Gaps = 23/178 (12%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSG-----PPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHL 186 G P G S A G+ G PP G+ GP G P P H Sbjct: 761 GQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPPGPPGKDGLPGHP 820 Query: 187 GSRG----------GAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPH 336 G RG PP V G GA+G T E G P PP P G Sbjct: 821 GQRGEVGFQGKTGPPGPPGVVGPQGAAGETGP--MGERG-----HPGPPGPPGEQGLPGT 873 Query: 337 LAHGGVTAAHGVPRHHGANGPASL----NSAALPAYATG----GGNGPAYPPGAIVSP 486 G G P G +GPA L LP A G G GP+ PPG SP Sbjct: 874 AGKEGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGPGLKGNEGPSGPPGPAGSP 931 [106][TOP] >UniRef100_B0UXE9 Collagen, type XI, alpha 2 n=1 Tax=Homo sapiens RepID=B0UXE9_HUMAN Length = 1650 Score = 56.6 bits (135), Expect = 1e-06 Identities = 54/178 (30%), Positives = 59/178 (33%), Gaps = 23/178 (12%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSG-----PPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHL 186 G P G S A G+ G PP G+ GP G P P H Sbjct: 761 GQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPPGPPGKDGLPGHP 820 Query: 187 GSRG----------GAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPH 336 G RG PP V G GA+G T E G P PP P G Sbjct: 821 GQRGEVGFQGKTGPPGPPGVVGPQGAAGETGP--MGERG-----HPGPPGPPGEQGLPGT 873 Query: 337 LAHGGVTAAHGVPRHHGANGPASL----NSAALPAYATG----GGNGPAYPPGAIVSP 486 G G P G +GPA L LP A G G GP+ PPG SP Sbjct: 874 AGKEGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGPGLKGNEGPSGPPGPAGSP 931 [107][TOP] >UniRef100_A6NI54 Putative uncharacterized protein ENSP00000363829 n=2 Tax=Homo sapiens RepID=A6NI54_HUMAN Length = 1693 Score = 56.6 bits (135), Expect = 1e-06 Identities = 54/178 (30%), Positives = 59/178 (33%), Gaps = 23/178 (12%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSG-----PPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHL 186 G P G S A G+ G PP G+ GP G P P H Sbjct: 761 GQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPPGPPGKDGLPGHP 820 Query: 187 GSRG----------GAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPH 336 G RG PP V G GA+G T E G P PP P G Sbjct: 821 GQRGEVGFQGKTGPPGPPGVVGPQGAAGETGP--MGERG-----HPGPPGPPGEQGLPGT 873 Query: 337 LAHGGVTAAHGVPRHHGANGPASL----NSAALPAYATG----GGNGPAYPPGAIVSP 486 G G P G +GPA L LP A G G GP+ PPG SP Sbjct: 874 AGKEGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGPGLKGNEGPSGPPGPAGSP 931 [108][TOP] >UniRef100_Q9UMD9-2 Isoform 2 of Collagen alpha-1(XVII) chain n=1 Tax=Homo sapiens RepID=Q9UMD9-2 Length = 1415 Score = 56.6 bits (135), Expect = 1e-06 Identities = 54/173 (31%), Positives = 70/173 (40%), Gaps = 23/173 (13%) Frame = +1 Query: 34 PGSVVGGS-SAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHL-------- 186 PG +V S+ + GPP PG GP G P P+ A LP + Sbjct: 805 PGKIVTSEGSSMLTVPGPPGPPGAMGPPGPPGAPG----PAGPAGLPGHQEVLNLQGPPG 860 Query: 187 --GSRGGAPPSVAGGYGASGPTS----------ATFSNESGSFQSLQPAPPQMPPPTGPS 330 G RG PS+ G G GP +F + S +F S PP P P GP Sbjct: 861 PPGPRGPPGPSIPGPPGPRGPPGEGLPGPPGPPGSFLSNSETFLS---GPPGPPGPPGPK 917 Query: 331 PHLAHGGVTAAHGVPRHHGANGPASLNSAALPAYATG--GGNGPAYPPGAIVS 483 GV A G+P +GP+ S++ Y +G G GP PPG+I S Sbjct: 918 GDQGDPGVPGALGIP-----SGPSEGGSSS-TMYVSGPPGPPGPPGPPGSISS 964 [109][TOP] >UniRef100_P13942-5 Isoform 5 of Collagen alpha-2(XI) chain n=2 Tax=Homo sapiens RepID=P13942-5 Length = 1689 Score = 56.6 bits (135), Expect = 1e-06 Identities = 54/178 (30%), Positives = 59/178 (33%), Gaps = 23/178 (12%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSG-----PPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHL 186 G P G S A G+ G PP G+ GP G P P H Sbjct: 800 GQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPPGPPGKDGLPGHP 859 Query: 187 GSRG----------GAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPH 336 G RG PP V G GA+G T E G P PP P G Sbjct: 860 GQRGEVGFQGKTGPPGPPGVVGPQGAAGETGP--MGERG-----HPGPPGPPGEQGLPGT 912 Query: 337 LAHGGVTAAHGVPRHHGANGPASL----NSAALPAYATG----GGNGPAYPPGAIVSP 486 G G P G +GPA L LP A G G GP+ PPG SP Sbjct: 913 AGKEGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGPGLKGNEGPSGPPGPAGSP 970 [110][TOP] >UniRef100_P13942-2 Isoform 2 of Collagen alpha-2(XI) chain n=2 Tax=Homo sapiens RepID=P13942-2 Length = 1710 Score = 56.6 bits (135), Expect = 1e-06 Identities = 54/178 (30%), Positives = 59/178 (33%), Gaps = 23/178 (12%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSG-----PPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHL 186 G P G S A G+ G PP G+ GP G P P H Sbjct: 821 GQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPPGPPGKDGLPGHP 880 Query: 187 GSRG----------GAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPH 336 G RG PP V G GA+G T E G P PP P G Sbjct: 881 GQRGEVGFQGKTGPPGPPGVVGPQGAAGETGP--MGERG-----HPGPPGPPGEQGLPGT 933 Query: 337 LAHGGVTAAHGVPRHHGANGPASL----NSAALPAYATG----GGNGPAYPPGAIVSP 486 G G P G +GPA L LP A G G GP+ PPG SP Sbjct: 934 AGKEGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGPGLKGNEGPSGPPGPAGSP 991 [111][TOP] >UniRef100_P13942-7 Isoform 7 of Collagen alpha-2(XI) chain n=1 Tax=Homo sapiens RepID=P13942-7 Length = 1655 Score = 56.6 bits (135), Expect = 1e-06 Identities = 54/178 (30%), Positives = 59/178 (33%), Gaps = 23/178 (12%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSG-----PPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHL 186 G P G S A G+ G PP G+ GP G P P H Sbjct: 766 GQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPPGPPGKDGLPGHP 825 Query: 187 GSRG----------GAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPH 336 G RG PP V G GA+G T E G P PP P G Sbjct: 826 GQRGEVGFQGKTGPPGPPGVVGPQGAAGETGP--MGERG-----HPGPPGPPGEQGLPGT 878 Query: 337 LAHGGVTAAHGVPRHHGANGPASL----NSAALPAYATG----GGNGPAYPPGAIVSP 486 G G P G +GPA L LP A G G GP+ PPG SP Sbjct: 879 AGKEGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGPGLKGNEGPSGPPGPAGSP 936 [112][TOP] >UniRef100_P13942-3 Isoform 3 of Collagen alpha-2(XI) chain n=1 Tax=Homo sapiens RepID=P13942-3 Length = 1715 Score = 56.6 bits (135), Expect = 1e-06 Identities = 54/178 (30%), Positives = 59/178 (33%), Gaps = 23/178 (12%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSG-----PPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHL 186 G P G S A G+ G PP G+ GP G P P H Sbjct: 826 GQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPPGPPGKDGLPGHP 885 Query: 187 GSRG----------GAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPH 336 G RG PP V G GA+G T E G P PP P G Sbjct: 886 GQRGEVGFQGKTGPPGPPGVVGPQGAAGETGP--MGERG-----HPGPPGPPGEQGLPGT 938 Query: 337 LAHGGVTAAHGVPRHHGANGPASL----NSAALPAYATG----GGNGPAYPPGAIVSP 486 G G P G +GPA L LP A G G GP+ PPG SP Sbjct: 939 AGKEGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGPGLKGNEGPSGPPGPAGSP 996 [113][TOP] >UniRef100_P13942-4 Isoform 4 of Collagen alpha-2(XI) chain n=1 Tax=Homo sapiens RepID=P13942-4 Length = 1676 Score = 56.6 bits (135), Expect = 1e-06 Identities = 54/178 (30%), Positives = 59/178 (33%), Gaps = 23/178 (12%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSG-----PPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHL 186 G P G S A G+ G PP G+ GP G P P H Sbjct: 787 GQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPPGPPGKDGLPGHP 846 Query: 187 GSRG----------GAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPH 336 G RG PP V G GA+G T E G P PP P G Sbjct: 847 GQRGEVGFQGKTGPPGPPGVVGPQGAAGETGP--MGERG-----HPGPPGPPGEQGLPGT 899 Query: 337 LAHGGVTAAHGVPRHHGANGPASL----NSAALPAYATG----GGNGPAYPPGAIVSP 486 G G P G +GPA L LP A G G GP+ PPG SP Sbjct: 900 AGKEGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGPGLKGNEGPSGPPGPAGSP 957 [114][TOP] >UniRef100_P13942-8 Isoform 8 of Collagen alpha-2(XI) chain n=2 Tax=Homo sapiens RepID=P13942-8 Length = 1629 Score = 56.6 bits (135), Expect = 1e-06 Identities = 54/178 (30%), Positives = 59/178 (33%), Gaps = 23/178 (12%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSG-----PPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHL 186 G P G S A G+ G PP G+ GP G P P H Sbjct: 740 GQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPPGPPGKDGLPGHP 799 Query: 187 GSRG----------GAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPH 336 G RG PP V G GA+G T E G P PP P G Sbjct: 800 GQRGEVGFQGKTGPPGPPGVVGPQGAAGETGP--MGERG-----HPGPPGPPGEQGLPGT 852 Query: 337 LAHGGVTAAHGVPRHHGANGPASL----NSAALPAYATG----GGNGPAYPPGAIVSP 486 G G P G +GPA L LP A G G GP+ PPG SP Sbjct: 853 AGKEGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGPGLKGNEGPSGPPGPAGSP 910 [115][TOP] >UniRef100_P13942 Collagen alpha-2(XI) chain n=1 Tax=Homo sapiens RepID=COBA2_HUMAN Length = 1736 Score = 56.6 bits (135), Expect = 1e-06 Identities = 54/178 (30%), Positives = 59/178 (33%), Gaps = 23/178 (12%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSG-----PPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHL 186 G P G S A G+ G PP G+ GP G P P H Sbjct: 847 GQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPPGPPGKDGLPGHP 906 Query: 187 GSRG----------GAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPH 336 G RG PP V G GA+G T E G P PP P G Sbjct: 907 GQRGEVGFQGKTGPPGPPGVVGPQGAAGETGP--MGERG-----HPGPPGPPGEQGLPGT 959 Query: 337 LAHGGVTAAHGVPRHHGANGPASL----NSAALPAYATG----GGNGPAYPPGAIVSP 486 G G P G +GPA L LP A G G GP+ PPG SP Sbjct: 960 AGKEGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGPGLKGNEGPSGPPGPAGSP 1017 [116][TOP] >UniRef100_UPI0000E1F855 PREDICTED: similar to COL3A1 protein isoform 12 n=1 Tax=Pan troglodytes RepID=UPI0000E1F855 Length = 1457 Score = 56.2 bits (134), Expect = 1e-06 Identities = 62/204 (30%), Positives = 75/204 (36%), Gaps = 32/204 (15%) Frame = +1 Query: 13 PSYGSH--VPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHL 186 P Y S+ G VGG + +GPP PG G GHP S +P + Sbjct: 153 PQYDSYDVKSGVAVGGLAGYPGPAGPPGPPG---PPGTSGHPGSPGSPGYQGPPGEPGQA 209 Query: 187 GSRGGAPPSVAGGYGASGPTSATFSNESG----SFQSLQPAPPQMPPPTG--PSPHL-AH 345 G G PP G G SGP A ESG + P PP + P G P + H Sbjct: 210 GPSG--PPGPPGAIGPSGP--AGKDGESGRPGRPGERGLPGPPGIKGPAGIPGFPGMKGH 265 Query: 346 GGVTAA------HGVPRHHGANGPASLNSA-------ALPAYATGGGN----------GP 456 G T A +G+P +GA GP A LP A GN GP Sbjct: 266 RGETGAPGLKGENGLPGENGAPGPMGPRGAPGERGRPGLPGAAGARGNDGARGSDGQPGP 325 Query: 457 AYPPGAIVSPASTATFNRLSPAAA 528 PPG P S + PA + Sbjct: 326 PGPPGTAGFPGSPGAKGEVGPAGS 349 [117][TOP] >UniRef100_UPI0000D9A866 PREDICTED: similar to alpha 2 type I collagen isoform 1 n=1 Tax=Macaca mulatta RepID=UPI0000D9A866 Length = 1248 Score = 56.2 bits (134), Expect = 1e-06 Identities = 51/156 (32%), Positives = 60/156 (38%), Gaps = 10/156 (6%) Frame = +1 Query: 34 PGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPPS 213 PG VVG + AG SGP PG +G G P + G+RG AP Sbjct: 624 PG-VVGAAGTAGP-SGPSGLPGERGAAGIPGGKGEKGEPGLRGEIGNPGRDGARG-AP-- 678 Query: 214 VAGGYGASGPTSATFSN-ESGSFQSLQPAPPQMPP-------PTGPSPHLAHGGVTAAHG 369 G GA GP AT E+G+ PA P+ P P GP+ G G Sbjct: 679 --GAVGAPGPAGATGDRGEAGAAGPAGPAGPRGSPGERGEVGPAGPNGFAGPAGAAGQPG 736 Query: 370 VPRHHGANGPASLNSAALPAYATG--GGNGPAYPPG 471 GA GP N P G G +GP PPG Sbjct: 737 AKGERGAKGPKGENGVVGPTGPVGAAGPSGPNGPPG 772 [118][TOP] >UniRef100_UPI0000D9A865 PREDICTED: similar to alpha 2 type I collagen isoform 2 n=1 Tax=Macaca mulatta RepID=UPI0000D9A865 Length = 1363 Score = 56.2 bits (134), Expect = 1e-06 Identities = 51/156 (32%), Positives = 60/156 (38%), Gaps = 10/156 (6%) Frame = +1 Query: 34 PGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPPS 213 PG VVG + AG SGP PG +G G P + G+RG AP Sbjct: 621 PG-VVGAAGTAGP-SGPSGLPGERGAAGIPGGKGEKGEPGLRGEIGNPGRDGARG-AP-- 675 Query: 214 VAGGYGASGPTSATFSN-ESGSFQSLQPAPPQMPP-------PTGPSPHLAHGGVTAAHG 369 G GA GP AT E+G+ PA P+ P P GP+ G G Sbjct: 676 --GAVGAPGPAGATGDRGEAGAAGPAGPAGPRGSPGERGEVGPAGPNGFAGPAGAAGQPG 733 Query: 370 VPRHHGANGPASLNSAALPAYATG--GGNGPAYPPG 471 GA GP N P G G +GP PPG Sbjct: 734 AKGERGAKGPKGENGVVGPTGPVGAAGPSGPNGPPG 769 [119][TOP] >UniRef100_UPI0000D9A864 PREDICTED: similar to alpha 2 type I collagen isoform 3 n=1 Tax=Macaca mulatta RepID=UPI0000D9A864 Length = 1366 Score = 56.2 bits (134), Expect = 1e-06 Identities = 51/156 (32%), Positives = 60/156 (38%), Gaps = 10/156 (6%) Frame = +1 Query: 34 PGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPPS 213 PG VVG + AG SGP PG +G G P + G+RG AP Sbjct: 624 PG-VVGAAGTAGP-SGPSGLPGERGAAGIPGGKGEKGEPGLRGEIGNPGRDGARG-AP-- 678 Query: 214 VAGGYGASGPTSATFSN-ESGSFQSLQPAPPQMPP-------PTGPSPHLAHGGVTAAHG 369 G GA GP AT E+G+ PA P+ P P GP+ G G Sbjct: 679 --GAVGAPGPAGATGDRGEAGAAGPAGPAGPRGSPGERGEVGPAGPNGFAGPAGAAGQPG 736 Query: 370 VPRHHGANGPASLNSAALPAYATG--GGNGPAYPPG 471 GA GP N P G G +GP PPG Sbjct: 737 AKGERGAKGPKGENGVVGPTGPVGAAGPSGPNGPPG 772 [120][TOP] >UniRef100_UPI00016E5ECF UPI00016E5ECF related cluster n=1 Tax=Takifugu rubripes RepID=UPI00016E5ECF Length = 1261 Score = 56.2 bits (134), Expect = 1e-06 Identities = 51/168 (30%), Positives = 58/168 (34%), Gaps = 4/168 (2%) Frame = +1 Query: 10 PPSYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLG 189 P G P G S G P A GV A GP G P P S P G Sbjct: 737 PGPAGPPGPAGAPGLSGPIGPAGLPGPAGGVSALPGPPGPPGPPGRPGDSRQGPPG---- 792 Query: 190 SRGGAPPSVAGGYGASGPT----SATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVT 357 P GYG GP FS+ SG+F + P PP P G S GG Sbjct: 793 ------PPGPPGYGRPGPKGDKGDPGFSSSSGTFYTGPPGPPGPAGPKGSSVATYSGG-- 844 Query: 358 AAHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSPASTAT 501 +G+P G GP G GP PPG P + A+ Sbjct: 845 --NGIPGPPGPPGPPGPQGFKGSISVASGPPGPPGPPGPAGRPGTFAS 890 [121][TOP] >UniRef100_B1JZ05 Putative uncharacterized protein n=1 Tax=Burkholderia cenocepacia MC0-3 RepID=B1JZ05_BURCC Length = 387 Score = 56.2 bits (134), Expect = 1e-06 Identities = 53/156 (33%), Positives = 68/156 (43%), Gaps = 5/156 (3%) Frame = +1 Query: 82 PPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPPSVAGGYGASGPTSATFS 261 P PGV GSG PA++ A ++ A++ A + G S AG AS P A+ S Sbjct: 226 PLSVPGVAPGSGANAVPAAASAVTAPAAMRAAAPAAASGSGTVSGAGAAPASAPAPAS-S 284 Query: 262 NESGSFQSLQPAPPQMP-----PPTGPSPHLAHGGVTAAHGVPRHHGANGPASLNSAALP 426 + AP P P T P+P A G A P A+ PA A P Sbjct: 285 GGPAPAPASAAAPASAPKPISGPATAPAPSSASGSTAAPVSAP----ASAPA---PATAP 337 Query: 427 AYATGGGNGPAYPPGAIVSPASTATFNRLSPAAAAA 534 A AT + PA P A +PAS + + SPA AAA Sbjct: 338 ATAT--PSSPA-PSSAASTPASASAPSSASPAPAAA 370 [122][TOP] >UniRef100_Q9NHW4 Flagelliform silk protein (Fragment) n=1 Tax=Nephila clavipes RepID=Q9NHW4_NEPCL Length = 2249 Score = 56.2 bits (134), Expect = 1e-06 Identities = 60/162 (37%), Positives = 65/162 (40%), Gaps = 8/162 (4%) Frame = +1 Query: 10 PPSYGSHVP-GSVVGGSSAAG---SFSGPPYAPGVYAGSGP-GGHPASSYAPSSSASLPQ 174 P G + P G GS A G S SGP +GSGP GG SS PS + P Sbjct: 3 PSGTGGYAPTGYAPSGSGAGGVRPSASGP-------SGSGPSGGSRPSSSGPSGTRPSPN 55 Query: 175 GAHLGSRGGAPP--SVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHG 348 GA S GG P S +GG G SG T S SGS+ P GPS Sbjct: 56 GASGSSPGGIAPGGSNSGGAGVSGATGGPAS--SGSY-----GPGSTGGTYGPSGGSEPF 108 Query: 349 GVTAAHGVPRHHGANGPASLNSAALPAYATGGGNGP-AYPPG 471 G A G P G GP A P GG GP Y PG Sbjct: 109 GPGVAGG-PYSPGGAGPGGAGGAYGPGGVGTGGAGPGGYGPG 149 Score = 53.9 bits (128), Expect = 7e-06 Identities = 55/169 (32%), Positives = 60/169 (35%), Gaps = 19/169 (11%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSF----SGPPYAPGVYAGSGPGG-----------HPASSYAPSS 156 G PG V G S G +G PY PG GSGPGG P +Y P Sbjct: 830 GGFGPGGVGPGGSGPGGVGPGGAGRPYGPG---GSGPGGAGGAGGTGGAYGPGGAYGPGG 886 Query: 157 SASLPQGAHLGSRGGAPPSVAGG-YGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSP 333 S P GA G GG P AGG YG G +G P P GP Sbjct: 887 SGG-PGGA--GGPGGEGPGGAGGPYGPGGAGGPYGPGGAGG----PYGPGGEGGPYGPGV 939 Query: 334 HLAHGGVTAAHGV--PRHHGANGPASLNSAALPAYATGGGNGP-AYPPG 471 GG +G P G GP P GG+GP Y PG Sbjct: 940 SYGPGGAGGPYGPGGPYGPGGEGPGGAGGPYGPGGVGPGGSGPGGYGPG 988 [123][TOP] >UniRef100_Q4G1Y1 Major ampullate spidroin 2 (Fragment) n=1 Tax=Latrodectus hesperus RepID=Q4G1Y1_9ARAC Length = 542 Score = 56.2 bits (134), Expect = 1e-06 Identities = 52/174 (29%), Positives = 71/174 (40%), Gaps = 1/174 (0%) Frame = +1 Query: 13 PSYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGS 192 P YG G GG+ AA + + PG GPGG A++ A +++ S P G + Sbjct: 306 PGYGGQ-QGFGPGGAGAAAAAAAGGAGPGRQQAYGPGGSGAAAAAAAAAGSGPSGYGPSA 364 Query: 193 RGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTAAHGV 372 G PS GG GA+ +A S G Q PTG P Sbjct: 365 AG---PSGPGGSGAAAAAAAGGSGPGGFGQG----------PTGYGP-----------SG 400 Query: 373 PRHHGANGPASLNSAALPAYATGGGNGPA-YPPGAIVSPASTATFNRLSPAAAA 531 P GP + +AA A + GG GP+ Y P ++ S A++A SP A Sbjct: 401 PGGQQGYGPGASGAAAAAAASGSGGYGPSQYVPSSVASSAASAASALSSPTTHA 454 [124][TOP] >UniRef100_B6K9K2 Putative uncharacterized protein n=2 Tax=Toxoplasma gondii RepID=B6K9K2_TOXGO Length = 994 Score = 56.2 bits (134), Expect = 1e-06 Identities = 56/185 (30%), Positives = 69/185 (37%), Gaps = 11/185 (5%) Frame = +1 Query: 10 PPSYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSG--PGGHPASSYAPSSSASLPQGAH 183 PP+ + PG+ G AA + PP P A G PG PA++ + P G Sbjct: 630 PPAAAA--PGAPPGTPPAAAAPGAPPGTPPAAAAPGAPPGTPPATAATSGAPPGTPPGTP 687 Query: 184 LGSRG---GAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPT----GPSPHLA 342 + G G+PP+ A GA + + SG APP PP T G SP Sbjct: 688 AAASGAPPGSPPATATAAGAPPGSPPATAAASG-------APPGSPPATATASGASPGTP 740 Query: 343 HGGVTAAHGVPRHHGANGPASLNSA--ALPAYATGGGNGPAYPPGAIVSPASTATFNRLS 516 G AA G P PA A P G A PP A +P A L Sbjct: 741 PGTPAAASGAPPGTPPGTPAEALGAVPGAPIATPGAAPTTATPPAAAGTPGVVAGGPGLV 800 Query: 517 PAAAA 531 PA A Sbjct: 801 PAVVA 805 Score = 53.9 bits (128), Expect = 7e-06 Identities = 54/183 (29%), Positives = 71/183 (38%), Gaps = 21/183 (11%) Frame = +1 Query: 34 PGSVVGGSSAAGSFSGPPYAPGVYAGSG-PGGHPASSYAPSSSASLPQGAHLGSRGGAPP 210 P V GG +AA + P A V +G P G P ++ A ++ S GA + GAPP Sbjct: 555 PNLVAGGIAAAIPAAAFPQATMVAGSNGLPQGVPVAAPAVPTAPSAAAGAPAAAASGAPP 614 Query: 211 SVAGGYGASG------PTSATFSNESGS-FQSLQPAPPQMPPPTGPSPHLAHG---GVTA 360 ASG P +A G+ + P P PP +P G A Sbjct: 615 GTPSAAAASGAPPGTPPAAAAPGAPPGTPPAAAAPGAPPGTPPAAAAPGAPPGTPPATAA 674 Query: 361 AHGVPRHHGANGPASLNSA---ALPAYATGGGNGPAYPPGAIV-------SPASTATFNR 510 G P PA+ + A + PA AT G P PP SP +TAT + Sbjct: 675 TSGAPPGTPPGTPAAASGAPPGSPPATATAAGAPPGSPPATAAASGAPPGSPPATATASG 734 Query: 511 LSP 519 SP Sbjct: 735 ASP 737 [125][TOP] >UniRef100_B2W108 Putative uncharacterized protein n=1 Tax=Pyrenophora tritici-repentis Pt-1C-BFP RepID=B2W108_PYRTR Length = 842 Score = 56.2 bits (134), Expect = 1e-06 Identities = 47/156 (30%), Positives = 69/156 (44%) Frame = +1 Query: 19 YGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRG 198 YG VPG + + +G+ S PP AG GG+ + S + S S S P GA Sbjct: 160 YGGDVPGVSMSSAVPSGAVSSPP------AGGYGGGYGSPSPSSSPSPSTPAGAV----- 208 Query: 199 GAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTAAHGVPR 378 PP AGGYG + + S S + P +P T +P A ++A G Sbjct: 209 STPP--AGGYGGGYGGNVPGVSMSSVVPSGASSTPAIPAATTSTPAGAVS-TSSAGGYGG 265 Query: 379 HHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSP 486 +G N P S+ +P+ A+ G + P GA+ +P Sbjct: 266 GYGGNVPGVSMSSVVPSGASSGSPSASTPAGAVSTP 301 Score = 53.5 bits (127), Expect = 9e-06 Identities = 55/181 (30%), Positives = 75/181 (41%), Gaps = 20/181 (11%) Frame = +1 Query: 19 YGSHVPGSVVGGSSAAGSFSGPPYAP--GVYAGSGPG---------GHPASSYAPSSSAS 165 YGS P S S+ AG+ S PP G Y G+ PG G ++ P+++ S Sbjct: 190 YGSPSPSSSPSPSTPAGAVSTPPAGGYGGGYGGNVPGVSMSSVVPSGASSTPAIPAATTS 249 Query: 166 LPQGAHLGSRGGAPPSVAGGYGASGPTSATFS-----NESGSFQSLQPAPPQMPPPTGPS 330 P GA S G GGYG + P + S SGS + PA PP G Sbjct: 250 TPAGAVSTSSAGG---YGGGYGGNVPGVSMSSVVPSGASSGSPSASTPAGAVSTPPAGGY 306 Query: 331 PHLAHG---GVTAAHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSP-ASTA 498 G GV+ + VP + + + PA GG P+ PG V+P A+T+ Sbjct: 307 GGGYGGNVPGVSMSSVVPSGASSTPAIPAATTSTPAGGNGGYGSPSSTPGKPVTPDAATS 366 Query: 499 T 501 T Sbjct: 367 T 367 [126][TOP] >UniRef100_P02459 Collagen alpha-1(II) chain (Fragments) n=1 Tax=Bos taurus RepID=CO2A1_BOVIN Length = 747 Score = 56.2 bits (134), Expect = 1e-06 Identities = 51/173 (29%), Positives = 63/173 (36%), Gaps = 12/173 (6%) Frame = +1 Query: 10 PPSYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPAS--SYAPSSSASLPQGAH 183 PP G G+ A +GP A G GP G P + S P+ +A P Sbjct: 170 PPGPVGPAGGPGFPGAPGAKGEAGPTGARGPEGAQGPRGEPGTPGSPGPAGAAGNPGTDG 229 Query: 184 L-GSRGGA-PPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVT 357 + G++G A P +AG G GP P P P GP G+ Sbjct: 230 IPGAKGSAGAPGIAGAPGFPGPRGP-------------PGPQGATGPLGPKGQTGEPGIA 276 Query: 358 AAHGVPRHHGANGPASLNSAALPAYATG--------GGNGPAYPPGAIVSPAS 492 G G GPA + A PA G GG GPA PPG +P S Sbjct: 277 GFKGEQGPKGEPGPAGVQGAPGPAGEEGKRGARGEPGGAGPAGPPGERGAPGS 329 [127][TOP] >UniRef100_Q3TU64 Putative uncharacterized protein n=2 Tax=Mus musculus RepID=Q3TU64_MOUSE Length = 1372 Score = 55.8 bits (133), Expect = 2e-06 Identities = 49/171 (28%), Positives = 61/171 (35%), Gaps = 17/171 (9%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGG 201 GS P G G +G APG SGPGG P A + G RG Sbjct: 613 GSRGPSGAPGPDGNKGE-AGAVGAPGSAGASGPGGLPGERGAAGIPGGKGEKGETGLRGD 671 Query: 202 APPS-------VAGGYGASGPTSATFSN-ESGSFQSLQPAPPQMPP-------PTGPSPH 336 + + G GA GP A+ E+G+ PA P+ P P GP+ Sbjct: 672 TGNTGRDGARGIPGAVGAPGPAGASGDRGEAGAAGPSGPAGPRGSPGERGEVGPAGPNGF 731 Query: 337 LAHGGVTAAHGVPRHHGANGPASLNSAALPAYATG--GGNGPAYPPGAIVS 483 G G G GP N P + G G +GP PPG + S Sbjct: 732 AGPAGAAGQPGAKGEKGTKGPKGENGIVGPTGSVGAAGPSGPNGPPGPVGS 782 [128][TOP] >UniRef100_A9EYY3 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9EYY3_SORC5 Length = 421 Score = 55.8 bits (133), Expect = 2e-06 Identities = 50/172 (29%), Positives = 67/172 (38%), Gaps = 16/172 (9%) Frame = +1 Query: 10 PPSYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLG 189 PPS + P + A+ + P AP +A GPG A+ AP ++ + H G Sbjct: 155 PPSQAAFQPAPITDRMGASATQPPAPAAPPGFASPGPGA--AAPAAPEAARAPMPSPHAG 212 Query: 190 S--RGGAPPSVAG-GYGASGP-TSATFSNESGSFQSLQPAPPQMPPPTGPSPHL--AHGG 351 APP G+GA+ P S S + PA PP P+P + A G Sbjct: 213 QPPAPAAPPGFGSPGFGAAAPAVSEAARTPMPSLHAGMPAQAGPPPAAAPAPAMSAAPGA 272 Query: 352 VTAAHGVPRHHG----------ANGPASLNSAALPAYATGGGNGPAYPPGAI 477 AAHG P+ G A A L ALP+ P PP A+ Sbjct: 273 GAAAHGAPQAAGGWDGAAESPWATTSARLEMPALPSTFVQERPEPQKPPAAV 324 [129][TOP] >UniRef100_A0K683 Putative uncharacterized protein n=2 Tax=Burkholderia cenocepacia RepID=A0K683_BURCH Length = 383 Score = 55.8 bits (133), Expect = 2e-06 Identities = 52/154 (33%), Positives = 68/154 (44%), Gaps = 3/154 (1%) Frame = +1 Query: 82 PPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPPSVAGGYGASGPTSATFS 261 P APGV GSG PA++ A ++ A++ A + G S A A P SA S Sbjct: 226 PLSAPGVAPGSGANAVPAAASAVAAPAAMRAAAPTAASGAGAVSGAAPASAPAPASAGGS 285 Query: 262 ---NESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTAAHGVPRHHGANGPASLNSAALPAY 432 S + +L P P P T P+P G A A+ PA SA+ PA Sbjct: 286 APAPASAAAPALAPKPVS-GPVTAPAPSSTSGSTAAP--------ASAPA---SASAPAP 333 Query: 433 ATGGGNGPAYPPGAIVSPASTATFNRLSPAAAAA 534 AT + PA P A +PAS + + SPA A A Sbjct: 334 ATATPSSPA-PSSAASTPASASAPSSASPAPATA 366 [130][TOP] >UniRef100_Q2I6N4 Uncharacterized Gly-rich protein n=1 Tax=uncultured delta proteobacterium DeepAnt-1F12 RepID=Q2I6N4_9DELT Length = 784 Score = 55.8 bits (133), Expect = 2e-06 Identities = 56/177 (31%), Positives = 69/177 (38%), Gaps = 7/177 (3%) Frame = +1 Query: 22 GSHVPGSVVG--GSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSR 195 G P VG G+ A +GP G +GPGG A P+ G+ Sbjct: 153 GEAGPQGAVGPAGADGAAGPAGPQGLQGERGPAGPGGGEAGPAGPA-----------GAD 201 Query: 196 GGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMP-----PPTGPSPHLAHGGVTA 360 G A P AG GA GP A +G+ + PA P P P GP+ G Sbjct: 202 GVAGP--AGADGADGPDGA--QGPAGADGAQGPAGPVGPGGGEAGPAGPAGADGVAGPAG 257 Query: 361 AHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSPASTATFNRLSPAAAA 531 A G GA GPA + A PA A G GP P G PA A + ++ A A Sbjct: 258 ADGADGPDGAQGPAGADGAQGPAGA-DGAQGPVGPGGGEAGPAGPAGADGVAGPAGA 313 Score = 53.9 bits (128), Expect = 7e-06 Identities = 49/163 (30%), Positives = 61/163 (37%), Gaps = 7/163 (4%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASL--PQGAHLGSR 195 G+ P G + A G+ GP A G GPGG A P+ + + P GA Sbjct: 260 GADGPDGAQGPAGADGA-QGPAGADGAQGPVGPGGGEAGPAGPAGADGVAGPAGADGADG 318 Query: 196 GGAPPSVAGGYGASGPTSATFSNES---GSFQSLQPAPPQMPPPTGPSPHLAHGGVTAAH 366 AG GA GP A + G ++ P GP+ G A Sbjct: 319 PDGAQGPAGADGAQGPAGADGAQGPVGPGGGEAGPAGPAGADGVAGPAGADGADGPDGAQ 378 Query: 367 GVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGA--IVSPA 489 G GA GPA + A P GG GPA P GA + PA Sbjct: 379 GPAGADGAQGPAGADGAQGPVGPGGGEAGPAGPAGADGVAGPA 421 Score = 53.9 bits (128), Expect = 7e-06 Identities = 49/163 (30%), Positives = 61/163 (37%), Gaps = 7/163 (4%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASL--PQGAHLGSR 195 G+ P G + A G+ GP A G GPGG A P+ + + P GA Sbjct: 315 GADGPDGAQGPAGADGA-QGPAGADGAQGPVGPGGGEAGPAGPAGADGVAGPAGADGADG 373 Query: 196 GGAPPSVAGGYGASGPTSATFSNES---GSFQSLQPAPPQMPPPTGPSPHLAHGGVTAAH 366 AG GA GP A + G ++ P GP+ G A Sbjct: 374 PDGAQGPAGADGAQGPAGADGAQGPVGPGGGEAGPAGPAGADGVAGPAGADGADGPDGAQ 433 Query: 367 GVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGA--IVSPA 489 G GA GPA + A P GG GPA P GA + PA Sbjct: 434 GPAGADGAQGPAGADGAQGPVGPGGGEAGPAGPAGADGVAGPA 476 [131][TOP] >UniRef100_C3RI07 Putative uncharacterized protein (Fragment) n=1 Tax=Mollicutes bacterium D7 RepID=C3RI07_9MOLU Length = 424 Score = 55.8 bits (133), Expect = 2e-06 Identities = 51/181 (28%), Positives = 63/181 (34%), Gaps = 13/181 (7%) Frame = +1 Query: 16 SYGSHVPGSVVGGSSAAGSF-----SGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGA 180 S G+ P G + A GS +G A G +GP G A ++ S Sbjct: 49 STGAIGPTGPTGSTGATGSTGPTGATGEDGATGATGSTGPTGATGEDGATGATGSTGPTG 108 Query: 181 HLGSRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTA 360 G+ G P+ G GA+GPT AT E G+ PTGP+ G T Sbjct: 109 STGATGPTGPT--GATGATGPTGAT--GEDGA-----------TGPTGPTGATGEDGATG 153 Query: 361 AHGVPRHHGANGPASLNSAALPAYATG--------GGNGPAYPPGAIVSPASTATFNRLS 516 G GA GP P ATG G GP P GA +T Sbjct: 154 PTGATGEDGATGPTGATGPTGPTGATGEDGATGATGSTGPTGPTGATGEDGATGATGSTG 213 Query: 517 P 519 P Sbjct: 214 P 214 [132][TOP] >UniRef100_C2N7W6 Collagen triple helix repeat domain protein n=1 Tax=Bacillus cereus ATCC 10876 RepID=C2N7W6_BACCE Length = 1282 Score = 55.8 bits (133), Expect = 2e-06 Identities = 46/168 (27%), Positives = 63/168 (37%), Gaps = 22/168 (13%) Frame = +1 Query: 34 PGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPPS 213 P + G + G GPP G +GPGG P+ S + + + G+ G Sbjct: 195 PTGITGPTGITGPSGGPPGPTGPTGATGPGGGPSGSTGATGAT-----GNTGATGST--G 247 Query: 214 VAGGYGASGPTSATFSNESGSFQSLQ-------PAPPQ----MPPPTGPSPHLAHGGVTA 360 V G G++GPT +T + Q +Q P PQ +P PTG + GV Sbjct: 248 VTGATGSTGPTGSTGAQGLQGIQGIQGPIGPTGPEGPQGIQGIPGPTGVTGEQGIQGVQG 307 Query: 361 AHGVPRHHGANGPASLNSAALPAYATG-----------GGNGPAYPPG 471 G G GP + P+ ATG G GP P G Sbjct: 308 IQGAKGATGDQGPQGIQGVPGPSGATGPQGVQGIQGPMGDIGPTGPEG 355 [133][TOP] >UniRef100_B4V7M7 Putative uncharacterized protein n=1 Tax=Streptomyces sp. Mg1 RepID=B4V7M7_9ACTO Length = 269 Score = 55.8 bits (133), Expect = 2e-06 Identities = 49/163 (30%), Positives = 63/163 (38%), Gaps = 4/163 (2%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSGPPYAP--GVYAGSGPGGHPASSYAPSSSASLP--QGAHLG 189 G+ PG G SG + P A +G G PASS + S+SAS P +GA Sbjct: 40 GAAAPGPERGAGENVAPRSGVEFQPLSAPDAPAGSTGSPASSASSSTSASAPGSEGAAGS 99 Query: 190 SRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTAAHG 369 + G PP+ GG SAT +P P + P P GG A Sbjct: 100 TPGAGPPAAPGG-------SAT-----------RPGTSPAPGGSSPGPGAPSGGPAATQP 141 Query: 370 VPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSPASTA 498 G GP + + P GG GPA P +SP + A Sbjct: 142 ATPRPGTPGPVTPTAPTTPP----GGGGPATPANLTLSPPARA 180 [134][TOP] >UniRef100_A8IZP2 Hydroxyproline-rich glycoprotein n=1 Tax=Chlamydomonas reinhardtii RepID=A8IZP2_CHLRE Length = 585 Score = 55.8 bits (133), Expect = 2e-06 Identities = 48/160 (30%), Positives = 64/160 (40%), Gaps = 13/160 (8%) Frame = +1 Query: 13 PSYGSHVPGSVVGGSSAAGSFSGPPY------APGVYAGSGPGGHPASSYAPSSSASLPQ 174 P YG PG+ G + A PPY AP YA + PG PA AP P Sbjct: 331 PPYGYAPPGAPPGAAGAP-----PPYGYALAGAPPPYAYAPPGAAPAPYGAPPPRPYAPA 385 Query: 175 GAHLGSR-------GGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSP 333 GA+ GS GA P AG Y G + ++++ P Q P+P Sbjct: 386 GAYPGSAPPGAYAPSGAGPGPAGAYQPPGTVAPAYASQ----------PVQGSAAGAPAP 435 Query: 334 HLAHGGVTAAHGVPRHHGANGPASLNSAALPAYATGGGNG 453 AHGG + A G A+ ++ + A+G GNG Sbjct: 436 --AHGGAYGSSAAATGPAAAGAAAGGNSTVANNASGSGNG 473 [135][TOP] >UniRef100_Q4FX62 Proteophosphoglycan 5 n=1 Tax=Leishmania major strain Friedlin RepID=Q4FX62_LEIMA Length = 17392 Score = 55.8 bits (133), Expect = 2e-06 Identities = 48/176 (27%), Positives = 79/176 (44%), Gaps = 2/176 (1%) Frame = +1 Query: 13 PSYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPA--SSYAPSSSASLPQGAHL 186 PS S P + SSA S S P A A S P+ SS APSSS+S A Sbjct: 8789 PSSSSSAPSA--SSSSAPSSSSSAPSASSSSAPSSSSSAPSASSSSAPSSSSSSALSASS 8846 Query: 187 GSRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTAAH 366 S + S +S P+S++ S S S S P + PS + +++ Sbjct: 8847 SSAPSSSSSAPSASSSSAPSSSSSSAPSASSSSA----PSSSSSSAPSASSSSAPSSSSS 8902 Query: 367 GVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSPASTATFNRLSPAAAAA 534 P ++ P+S +S+A PA+++ + + P A S A +++ + S ++++A Sbjct: 8903 SAPSASSSSAPSSSSSSAPPAFSSSAPSSSSSAPSASSSSAPSSSSSAPSASSSSA 8958 Score = 53.5 bits (127), Expect = 9e-06 Identities = 45/176 (25%), Positives = 79/176 (44%), Gaps = 2/176 (1%) Frame = +1 Query: 13 PSYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPA--SSYAPSSSASLPQGAHL 186 PS S P S S+ + S S P A A S P+ SS APSSS+S P + Sbjct: 7609 PSSSSSAP-SASSSSAPSSSSSSAPSASSSSAPSSSSSAPSASSSSAPSSSSSAPSASSS 7667 Query: 187 GSRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTAAH 366 + + S +S P+S++ S SGS S P + PS + +++ Sbjct: 7668 SAPSSSSSSAPSASSSSAPSSSSSSAPSGSSSSA----PSSSSSSAPSASSSSAPSSSSS 7723 Query: 367 GVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSPASTATFNRLSPAAAAA 534 P ++ P+S +SA + ++ + + P G+ S S+++ + S ++++A Sbjct: 7724 SAPSASSSSAPSSSSSAPSASSSSAPSSSSSAPSGSSSSAPSSSSSSAPSASSSSA 7779 [136][TOP] >UniRef100_B5DXL6 GA27145 n=1 Tax=Drosophila pseudoobscura pseudoobscura RepID=B5DXL6_DROPS Length = 875 Score = 55.8 bits (133), Expect = 2e-06 Identities = 50/171 (29%), Positives = 79/171 (46%), Gaps = 8/171 (4%) Frame = +1 Query: 34 PGSVVGGSSAAGSFSGP-PYAPGVYAGSGPGGHPASSY-----APSSSASLPQGAH-LGS 192 P S S++ + GP P AP + + S P P+SSY PSSS S P ++ S Sbjct: 688 PSSSYSAPSSSSNSGGPYPAAPSI-SYSAPAAPPSSSYGAPATGPSSSYSAPSSSYGAPS 746 Query: 193 RGGAPPSVAGGYGASGPTSATF-SNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTAAHG 369 G + S + G G+S T+ +F S+ SGS P+ P S + G A G Sbjct: 747 SGSSSGSFSSGSGSSFSTAPSFGSSSSGSGSGGYPSAPSSSYSAPSSSY----GAPATGG 802 Query: 370 VPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSPASTATFNRLSPA 522 +GP+S S+A + ++ G+ P+ P + +PA + +N P+ Sbjct: 803 DSALSFPSGPSSSYSSAPASGSSSSGSYPSAPSSSYGAPAQDSGYNYSGPS 853 Score = 53.5 bits (127), Expect = 9e-06 Identities = 53/189 (28%), Positives = 81/189 (42%), Gaps = 16/189 (8%) Frame = +1 Query: 16 SYGSHVPGSVVGGSSAA--GSFSGP-----PYAP-GVYAGSGPGGHPASSY--APSSSAS 165 SYG+ GS G S+A S+ P P AP Y+ P + SY APSSS S Sbjct: 424 SYGAPSAGSSSGSFSSAPSSSYGAPSKGSFPSAPSSSYSAPSPSANSGGSYPSAPSSSYS 483 Query: 166 LPQGAHLGSRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAH 345 P + S G P + + Y A P+ +N GS+ + + P P+ S Sbjct: 484 APSPS-ANSGGSYPAAPSSSYSAPSPS----ANSGGSYPAAPSSSYSAPSPSANSGGSYP 538 Query: 346 GGVTAAHGVPRHHGANG------PASLNSAALPAYATGGGNGPAYPPGAIVSPASTATFN 507 ++++ P +G P+S SA P A GG P+ P + +P+S++ Sbjct: 539 AAPSSSYSAPSPSANSGGSYPAAPSSSYSAPSPG-ANSGGPYPSAPSSSYSAPSSSSNSG 597 Query: 508 RLSPAAAAA 534 PAA ++ Sbjct: 598 GPYPAAPSS 606 Score = 53.5 bits (127), Expect = 9e-06 Identities = 50/178 (28%), Positives = 78/178 (43%), Gaps = 8/178 (4%) Frame = +1 Query: 16 SYGSHVPGSVVGGSSAAGSFSGPPYAP-GVYAGSGPGGHPASSY--APSSSASLPQGAHL 186 SY + S S +A S P AP Y+ PG + Y APSSS S P + Sbjct: 536 SYPAAPSSSYSAPSPSANSGGSYPAAPSSSYSAPSPGANSGGPYPSAPSSSYSAPSSSS- 594 Query: 187 GSRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTAAH 366 S G P + + Y A P+ +N GS+ S + P P+ S ++++ Sbjct: 595 NSGGPYPAAPSSSYSAPSPS----ANSGGSYPSAPSSSYSAPSPSANSGGSYPSAPSSSY 650 Query: 367 GVP---RHHGANGPASLNSA--ALPAYATGGGNGPAYPPGAIVSPASTATFNRLSPAA 525 P + G + P++ +S+ A A + GGG PA P + +P+S++ PAA Sbjct: 651 SAPSPSANSGGSYPSAPSSSYGAPSASSNGGGPYPAAPSSSYSAPSSSSNSGGPYPAA 708 [137][TOP] >UniRef100_B3MRJ3 GF20989 n=1 Tax=Drosophila ananassae RepID=B3MRJ3_DROAN Length = 907 Score = 55.8 bits (133), Expect = 2e-06 Identities = 50/158 (31%), Positives = 61/158 (38%) Frame = +1 Query: 61 AAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPPSVAGGYGASG 240 AA S SG A G S GG A+ + +SSA G G+ G+ A G G Sbjct: 167 AASSGSGAG-ASGAGTVSSGGGSSANKVSAASSAQQLPGMATGAGAGSATPGAAGSGGGA 225 Query: 241 PTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTAAHGVPRHHGANGPASLNSAA 420 S S G+ S P +PP + PH G GA PA AA Sbjct: 226 TASRPVSAMGGTVSSTAGGAPSIPPISTMPPHTVPGSTNTTTTAMSGAGAAAPA----AA 281 Query: 421 LPAYATGGGNGPAYPPGAIVSPASTATFNRLSPAAAAA 534 L A G YP V+ AS N ++ AAAAA Sbjct: 282 LMASLLNPGQVGGYPGQTAVNNASLMDANSVTAAAAAA 319 [138][TOP] >UniRef100_Q01149 Collagen alpha-2(I) chain n=2 Tax=Mus musculus RepID=CO1A2_MOUSE Length = 1372 Score = 55.8 bits (133), Expect = 2e-06 Identities = 49/171 (28%), Positives = 61/171 (35%), Gaps = 17/171 (9%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGG 201 GS P G G +G APG SGPGG P A + G RG Sbjct: 613 GSRGPSGAPGPDGNKGE-AGAVGAPGSAGASGPGGLPGERGAAGIPGGKGEKGETGLRGD 671 Query: 202 APPS-------VAGGYGASGPTSATFSN-ESGSFQSLQPAPPQMPP-------PTGPSPH 336 + + G GA GP A+ E+G+ PA P+ P P GP+ Sbjct: 672 TGNTGRDGARGIPGAVGAPGPAGASGDRGEAGAAGPSGPAGPRGSPGERGEVGPAGPNGF 731 Query: 337 LAHGGVTAAHGVPRHHGANGPASLNSAALPAYATG--GGNGPAYPPGAIVS 483 G G G GP N P + G G +GP PPG + S Sbjct: 732 AGPAGAAGQPGAKGEKGTKGPKGENGIVGPTGSVGAAGPSGPNGPPGPVGS 782 [139][TOP] >UniRef100_UPI0001B513DD multidomain-containing protein family n=1 Tax=Streptomyces lividans TK24 RepID=UPI0001B513DD Length = 413 Score = 55.5 bits (132), Expect = 2e-06 Identities = 51/153 (33%), Positives = 57/153 (37%) Frame = +1 Query: 37 GSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPPSV 216 G+ G A+G SGP APG G PG PA AP SS + P S Sbjct: 288 GAASGPDPASGPASGPAVAPGSGGGPAPGWWPAPGTAPGSSTAPPHDT---------ASA 338 Query: 217 AGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTAAHGVPRHHGANG 396 A A GPTSA P P G +P A G + G G G Sbjct: 339 ADTAPAPGPTSA----------------PGTAPAPGSTPAPAPGTTGSTPGTSPAPGTAG 382 Query: 397 PASLNSAALPAYATGGGNGPAYPPGAIVSPAST 495 PA S A P A G PA PG +P ST Sbjct: 383 PARDTSYA-PGTAPVAGTTPA--PGTAPAPGST 412 [140][TOP] >UniRef100_UPI0001AE71B1 UPI0001AE71B1 related cluster n=1 Tax=Homo sapiens RepID=UPI0001AE71B1 Length = 1676 Score = 55.5 bits (132), Expect = 2e-06 Identities = 56/180 (31%), Positives = 61/180 (33%), Gaps = 25/180 (13%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSG-----PPYAPGVYAGSGPGGHPASS--YAPSSSASLPQGA 180 G P G S A G+ G PP G+ GP G P P LP Sbjct: 787 GQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPLGPPGKDGLP--G 844 Query: 181 HLGSRGGA----------PPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPS 330 H G RG PP V G GA+G T E G P PP P G Sbjct: 845 HPGQRGEVGFQGKTGPPGPPGVVGPQGAAGETGPM--GERG-----HPGPPGPPGEQGLP 897 Query: 331 PHLAHGGVTAAHGVPRHHGANGPASL----NSAALPAYATG----GGNGPAYPPGAIVSP 486 G G P G +GPA L LP A G G GP+ PPG SP Sbjct: 898 GTAGKEGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGPGLKGNEGPSGPPGPAGSP 957 [141][TOP] >UniRef100_UPI0000D60E9C UPI0000D60E9C related cluster n=1 Tax=Homo sapiens RepID=UPI0000D60E9C Length = 1629 Score = 55.5 bits (132), Expect = 2e-06 Identities = 56/180 (31%), Positives = 61/180 (33%), Gaps = 25/180 (13%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSG-----PPYAPGVYAGSGPGGHPASS--YAPSSSASLPQGA 180 G P G S A G+ G PP G+ GP G P P LP Sbjct: 740 GQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPLGPPGKDGLP--G 797 Query: 181 HLGSRGGA----------PPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPS 330 H G RG PP V G GA+G T E G P PP P G Sbjct: 798 HPGQRGEVGFQGKTGPPGPPGVVGPQGAAGETGPM--GERG-----HPGPPGPPGEQGLP 850 Query: 331 PHLAHGGVTAAHGVPRHHGANGPASL----NSAALPAYATG----GGNGPAYPPGAIVSP 486 G G P G +GPA L LP A G G GP+ PPG SP Sbjct: 851 GTAGKEGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGPGLKGNEGPSGPPGPAGSP 910 [142][TOP] >UniRef100_UPI0000D60E9B UPI0000D60E9B related cluster n=1 Tax=Homo sapiens RepID=UPI0000D60E9B Length = 1655 Score = 55.5 bits (132), Expect = 2e-06 Identities = 56/180 (31%), Positives = 61/180 (33%), Gaps = 25/180 (13%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSG-----PPYAPGVYAGSGPGGHPASS--YAPSSSASLPQGA 180 G P G S A G+ G PP G+ GP G P P LP Sbjct: 766 GQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPLGPPGKDGLP--G 823 Query: 181 HLGSRGGA----------PPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPS 330 H G RG PP V G GA+G T E G P PP P G Sbjct: 824 HPGQRGEVGFQGKTGPPGPPGVVGPQGAAGETGPM--GERG-----HPGPPGPPGEQGLP 876 Query: 331 PHLAHGGVTAAHGVPRHHGANGPASL----NSAALPAYATG----GGNGPAYPPGAIVSP 486 G G P G +GPA L LP A G G GP+ PPG SP Sbjct: 877 GTAGKEGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGPGLKGNEGPSGPPGPAGSP 936 [143][TOP] >UniRef100_UPI0000D60E9A UPI0000D60E9A related cluster n=1 Tax=Homo sapiens RepID=UPI0000D60E9A Length = 1689 Score = 55.5 bits (132), Expect = 2e-06 Identities = 56/180 (31%), Positives = 61/180 (33%), Gaps = 25/180 (13%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSG-----PPYAPGVYAGSGPGGHPASS--YAPSSSASLPQGA 180 G P G S A G+ G PP G+ GP G P P LP Sbjct: 800 GQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPLGPPGKDGLP--G 857 Query: 181 HLGSRGGA----------PPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPS 330 H G RG PP V G GA+G T E G P PP P G Sbjct: 858 HPGQRGEVGFQGKTGPPGPPGVVGPQGAAGETGPM--GERG-----HPGPPGPPGEQGLP 910 Query: 331 PHLAHGGVTAAHGVPRHHGANGPASL----NSAALPAYATG----GGNGPAYPPGAIVSP 486 G G P G +GPA L LP A G G GP+ PPG SP Sbjct: 911 GTAGKEGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGPGLKGNEGPSGPPGPAGSP 970 [144][TOP] >UniRef100_UPI0000D60E99 UPI0000D60E99 related cluster n=1 Tax=Homo sapiens RepID=UPI0000D60E99 Length = 1710 Score = 55.5 bits (132), Expect = 2e-06 Identities = 56/180 (31%), Positives = 61/180 (33%), Gaps = 25/180 (13%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSG-----PPYAPGVYAGSGPGGHPASS--YAPSSSASLPQGA 180 G P G S A G+ G PP G+ GP G P P LP Sbjct: 821 GQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPLGPPGKDGLP--G 878 Query: 181 HLGSRGGA----------PPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPS 330 H G RG PP V G GA+G T E G P PP P G Sbjct: 879 HPGQRGEVGFQGKTGPPGPPGVVGPQGAAGETGPM--GERG-----HPGPPGPPGEQGLP 931 Query: 331 PHLAHGGVTAAHGVPRHHGANGPASL----NSAALPAYATG----GGNGPAYPPGAIVSP 486 G G P G +GPA L LP A G G GP+ PPG SP Sbjct: 932 GTAGKEGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGPGLKGNEGPSGPPGPAGSP 991 [145][TOP] >UniRef100_UPI0000D60E98 UPI0000D60E98 related cluster n=1 Tax=Homo sapiens RepID=UPI0000D60E98 Length = 1715 Score = 55.5 bits (132), Expect = 2e-06 Identities = 56/180 (31%), Positives = 61/180 (33%), Gaps = 25/180 (13%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSG-----PPYAPGVYAGSGPGGHPASS--YAPSSSASLPQGA 180 G P G S A G+ G PP G+ GP G P P LP Sbjct: 826 GQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPLGPPGKDGLP--G 883 Query: 181 HLGSRGGA----------PPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPS 330 H G RG PP V G GA+G T E G P PP P G Sbjct: 884 HPGQRGEVGFQGKTGPPGPPGVVGPQGAAGETGPM--GERG-----HPGPPGPPGEQGLP 936 Query: 331 PHLAHGGVTAAHGVPRHHGANGPASL----NSAALPAYATG----GGNGPAYPPGAIVSP 486 G G P G +GPA L LP A G G GP+ PPG SP Sbjct: 937 GTAGKEGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGPGLKGNEGPSGPPGPAGSP 996 [146][TOP] >UniRef100_UPI0000D60E97 UPI0000D60E97 related cluster n=1 Tax=Homo sapiens RepID=UPI0000D60E97 Length = 1736 Score = 55.5 bits (132), Expect = 2e-06 Identities = 56/180 (31%), Positives = 61/180 (33%), Gaps = 25/180 (13%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSG-----PPYAPGVYAGSGPGGHPASS--YAPSSSASLPQGA 180 G P G S A G+ G PP G+ GP G P P LP Sbjct: 847 GQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPLGPPGKDGLP--G 904 Query: 181 HLGSRGGA----------PPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPS 330 H G RG PP V G GA+G T E G P PP P G Sbjct: 905 HPGQRGEVGFQGKTGPPGPPGVVGPQGAAGETGPM--GERG-----HPGPPGPPGEQGLP 957 Query: 331 PHLAHGGVTAAHGVPRHHGANGPASL----NSAALPAYATG----GGNGPAYPPGAIVSP 486 G G P G +GPA L LP A G G GP+ PPG SP Sbjct: 958 GTAGKEGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGPGLKGNEGPSGPPGPAGSP 1017 [147][TOP] >UniRef100_B9J1C9 Collagen triple helix repeat domain protein n=1 Tax=Bacillus cereus Q1 RepID=B9J1C9_BACCQ Length = 1330 Score = 55.5 bits (132), Expect = 2e-06 Identities = 42/154 (27%), Positives = 60/154 (38%), Gaps = 13/154 (8%) Frame = +1 Query: 34 PGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPPS 213 P + G + G GPP G +GPGG P+ S + + + G+ G Sbjct: 195 PTGITGPTGITGPSGGPPGPTGPTGATGPGGGPSGSTGATGAT-----GNTGATGNT--G 247 Query: 214 VAGGYGASGPTSATFSNESGSFQSLQ-------PAPPQ----MPPPTGPSPHLAHGGVTA 360 + G G++GPT +T + Q +Q P PQ +P PTG + GV Sbjct: 248 ITGATGSTGPTGSTGAQGLQGIQGIQGPIGPTGPEGPQGIQGIPGPTGVTGEQGIQGVQG 307 Query: 361 AHGVPRHHGANGPASLNSAALPAYATG--GGNGP 456 G+ G GP + P TG G GP Sbjct: 308 IQGITGATGDQGPQGIQGVIGPQGVTGATGDQGP 341 [148][TOP] >UniRef100_Q4MVJ1 Putative uncharacterized protein n=1 Tax=Bacillus cereus G9241 RepID=Q4MVJ1_BACCE Length = 1300 Score = 55.5 bits (132), Expect = 2e-06 Identities = 43/154 (27%), Positives = 59/154 (38%), Gaps = 13/154 (8%) Frame = +1 Query: 34 PGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPPS 213 P + G + G GPP G +GPGG P+ S + + + G+ G Sbjct: 192 PTGITGPTGITGPSGGPPGPTGPTGATGPGGGPSGSTGATGAT-----GNTGATGST--G 244 Query: 214 VAGGYGASGPTSATFSNESGSFQSLQ-------PAPPQ----MPPPTGPSPHLAHGGVTA 360 V G G +GPT +T + Q +Q P PQ +P PTG + GV Sbjct: 245 VTGATGTTGPTGSTGAQGLQGIQGIQGPIGPTGPEGPQGIQGIPGPTGVTGEQGIQGVQG 304 Query: 361 AHGVPRHHGANGPASLNSAALPAYATG--GGNGP 456 G+ G GP + P TG G GP Sbjct: 305 IQGITGATGDQGPQGIQGVIGPQGVTGATGDQGP 338 [149][TOP] >UniRef100_C3GYK9 Putative uncharacterized protein n=1 Tax=Bacillus thuringiensis serovar huazhongensis BGSC 4BD1 RepID=C3GYK9_BACTU Length = 389 Score = 55.5 bits (132), Expect = 2e-06 Identities = 52/160 (32%), Positives = 64/160 (40%), Gaps = 8/160 (5%) Frame = +1 Query: 19 YGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASL-PQGAH--LG 189 YGS G GG++ A +GP G +GP G ++ A PQGA G Sbjct: 31 YGSGCLGG--GGATGATGATGPQGPAGATGATGPPGPAGATGATGPQGPQGPQGAQGPAG 88 Query: 190 SRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTAAHG 369 ++G P G G GP AT + + Q +Q GP+ G T A G Sbjct: 89 AQGATGPQ--GPQGIQGPAGATGATGATGAQGVQ----------GPAGATGATGATGAQG 136 Query: 370 VPRHHGANGPASLNSAALPAYATG-----GGNGPAYPPGA 474 V GA GP L PA ATG G GPA GA Sbjct: 137 VQGPAGATGPQGLQGIQGPAGATGPQGLQGIQGPAGATGA 176 [150][TOP] >UniRef100_C3ERC2 Collagen triple helix repeat domain protein n=1 Tax=Bacillus thuringiensis serovar kurstaki str. T03a001 RepID=C3ERC2_BACTK Length = 594 Score = 55.5 bits (132), Expect = 2e-06 Identities = 58/182 (31%), Positives = 70/182 (38%), Gaps = 13/182 (7%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAG--SFSGPPYAPGVYAGSGPGGH-----PASSYAPSSSASLPQGA 180 GS P G + A G +GP + G +GP G P S P+ S GA Sbjct: 127 GSTGPTGATGPTGATGPTGSTGPTGSTGPTGSTGPTGSTGSTGPTGSTGPTGSTG-STGA 185 Query: 181 HLGSRGGAPPSVA----GGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHG 348 GS G P+ A G GA+GPT AT S +GS S TGP+ Sbjct: 186 -TGSTGSTGPTGATGPTGSTGATGPTGATGS--TGSTGS-----------TGPTGATGST 231 Query: 349 GVTAAHGVPRHHGANGPASLNSAALPAYATG--GGNGPAYPPGAIVSPASTATFNRLSPA 522 GVT G GA GP + P ATG G GP G+ S ST P Sbjct: 232 GVTGPTGATGSTGATGPTGSTGSTGPTGATGPTGATGPTGSTGSTGSTGSTGPTGATGPT 291 Query: 523 AA 528 + Sbjct: 292 GS 293 [151][TOP] >UniRef100_C2V1W3 Collagen triple helix repeat domain protein n=1 Tax=Bacillus cereus Rock3-28 RepID=C2V1W3_BACCE Length = 937 Score = 55.5 bits (132), Expect = 2e-06 Identities = 47/157 (29%), Positives = 62/157 (39%), Gaps = 16/157 (10%) Frame = +1 Query: 34 PGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPPS 213 P + G + G GPP G +GPGG P+ S+ + P GA G+ G Sbjct: 206 PTGITGPTGITGPSGGPPGPTGPTGATGPGGGPSG----STGVTGPTGA-TGNTG----- 255 Query: 214 VAGGYGASGPTSATFSNESGSFQSLQ----------PAPPQ----MPPPTGPSPHLAHGG 351 A G G +GPT +T + Q +Q P PQ +P PTG + G Sbjct: 256 -ATGQGLTGPTGSTGETGAQGLQGIQGIQGPIGPTGPEGPQGIQGIPGPTGVTGEQGIQG 314 Query: 352 VTAAHGVPRHHGANGPASLNSAALPAYATG--GGNGP 456 V G+ G GP + A P TG G GP Sbjct: 315 VQGIQGITGATGDQGPQGIQGAIGPQGVTGATGDQGP 351 [152][TOP] >UniRef100_B5H071 Putative uncharacterized protein (Fragment) n=1 Tax=Streptomyces clavuligerus ATCC 27064 RepID=B5H071_STRCL Length = 1007 Score = 55.5 bits (132), Expect = 2e-06 Identities = 54/177 (30%), Positives = 65/177 (36%), Gaps = 13/177 (7%) Frame = +1 Query: 34 PGSVVGGSSAAGSFSGPPYAPGVYAGSGPG-GHPASSYAPSSSASLPQGAHLGSRGGAPP 210 PG G+ AG+F P G G+GP G P A LP A GG P Sbjct: 773 PGQGRQGTGLAGAFGNRPPKNGSGRGTGPQQGGPGGPNAGDRGRQLPTPA----AGGPRP 828 Query: 211 SVAGGYGAS--GPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTAAHGVPRHH 384 + GG GA GP A P P P P H +GG+ G P+ Sbjct: 829 ELPGGPGAPQRGPNQA-------------PGPFGGPAADAPRGHEENGGLRGPGGSPQQG 875 Query: 385 GANGP-------ASLNSAALPAYATG---GGNGPAYPPGAIVSPASTATFNRLSPAA 525 G GP S A+ A G GG + PGA P +TA R+ P A Sbjct: 876 GPGGPFVRPDVFGSSQQCAVGGRAGGNPAGGPFASRNPGAEQDPTATAPMPRIDPGA 932 [153][TOP] >UniRef100_Q9BIT7 Major ampullate spidroin 2-like protein (Fragment) n=1 Tax=Nephila inaurata madagascariensis RepID=Q9BIT7_9ARAC Length = 1953 Score = 55.5 bits (132), Expect = 2e-06 Identities = 59/190 (31%), Positives = 73/190 (38%), Gaps = 15/190 (7%) Frame = +1 Query: 10 PPSYGSHVPGSVVGGSSAAGSFSGPP-YAPG---------VYAGSGPGGHPASSYAPSSS 159 P YG G GGSSAA + +GP Y PG AGSGPGG+ P Sbjct: 1223 PGGYGPGQQGP--GGSSAAAAAAGPGRYGPGQQGPGAAAAAAAGSGPGGYGPGQQGPGGP 1280 Query: 160 ASLPQGAHLG-SRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPH 336 + A G GG P G G GP +A + G + Q P G + Sbjct: 1281 GAAAAAAAAGRGPGGYGP---GQQGPGGPGAAAAAAGPGGYGPGQQGP-------GAAAA 1330 Query: 337 LAHGGVTAAHGVPRHHGANGPASLNSAALPAYATGGGNGPAYP--PGAIVSPASTATF-- 504 A G +G P G GP + +AA G G G P PGA + A + Sbjct: 1331 AAAGSGPGGYG-PGQQGPGGPGAAAAAAAGRGPGGYGQGQQGPGGPGAAAAAAGPGGYGP 1389 Query: 505 NRLSPAAAAA 534 + P AAAA Sbjct: 1390 GQQGPGAAAA 1399 Score = 53.9 bits (128), Expect = 7e-06 Identities = 52/172 (30%), Positives = 64/172 (37%), Gaps = 11/172 (6%) Frame = +1 Query: 10 PPSYGSHVPGSVVGGSSAAGSFSGPP-YAPG---------VYAGSGPGGHPASSYAPSSS 159 P YG G GG AA + +GP Y PG AG GPGG+ P S Sbjct: 816 PGGYGPGQQGP--GGPGAAAAAAGPGGYGPGQQGPGAAAAASAGRGPGGYGPGQQGPGGS 873 Query: 160 ASLPQGAHLGSRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHL 339 + A G GG P G G +A G + Q P Q P + Sbjct: 874 GAAAAAAGRGP-GGYGPGQQGPGGPGAAAAAAAGRGPGGYGPGQQGPGQQGPGGSGAAAA 932 Query: 340 AHGGVTAAHGVPRHHGANGPASLNSAALPAYATGGGNGPAYP-PGAIVSPAS 492 A G +G P G GP + +AA P GG GP PGA + A+ Sbjct: 933 AAGRGPGGYG-PGQQGPGGPGAAAAAAGP-----GGYGPGQQGPGAAAAAAA 978 [154][TOP] >UniRef100_Q2VLH2 Major ampullate spidroin 2-like (Fragment) n=1 Tax=Nephila inaurata madagascariensis RepID=Q2VLH2_9ARAC Length = 2069 Score = 55.5 bits (132), Expect = 2e-06 Identities = 59/190 (31%), Positives = 73/190 (38%), Gaps = 15/190 (7%) Frame = +1 Query: 10 PPSYGSHVPGSVVGGSSAAGSFSGPP-YAPG---------VYAGSGPGGHPASSYAPSSS 159 P YG G GGSSAA + +GP Y PG AGSGPGG+ P Sbjct: 1356 PGGYGPGQQGP--GGSSAAAAAAGPGRYGPGQQGPGAAAAAAAGSGPGGYGPGQQGPGGP 1413 Query: 160 ASLPQGAHLG-SRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPH 336 + A G GG P G G GP +A + G + Q P G + Sbjct: 1414 GAAAAAAAAGRGPGGYGP---GQQGPGGPGAAAAAAGPGGYGPGQQGP-------GAAAA 1463 Query: 337 LAHGGVTAAHGVPRHHGANGPASLNSAALPAYATGGGNGPAYP--PGAIVSPASTATF-- 504 A G +G P G GP + +AA G G G P PGA + A + Sbjct: 1464 AAAGSGPGGYG-PGQQGPGGPGAAAAAAAGRGPGGYGQGQQGPGGPGAAAAAAGPGGYGP 1522 Query: 505 NRLSPAAAAA 534 + P AAAA Sbjct: 1523 GQQGPGAAAA 1532 [155][TOP] >UniRef100_Q26634 Alpha-1 collagen n=1 Tax=Strongylocentrotus purpuratus RepID=Q26634_STRPU Length = 1414 Score = 55.5 bits (132), Expect = 2e-06 Identities = 54/164 (32%), Positives = 61/164 (37%), Gaps = 10/164 (6%) Frame = +1 Query: 10 PPSYGSHVPGSVVGGSSAAGS--FSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAH 183 P GS G S A GS GP APG SGP G S+ AP P GA Sbjct: 989 PGPQGSRGEKGDTGASGANGSPGAPGPIGAPGAAGASGPRGETGSTGAPGPLG--PTGAR 1046 Query: 184 LGSRGGAPPS----VAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHG- 348 GS G A PS AG G +GP LQ M P P G Sbjct: 1047 -GSTGPAGPSGPSGPAGERGETGPAGHKGHPGVSGLPGLQGTSGPMGEPGAPGEQGQQGT 1105 Query: 349 -GVTAAHGVPRHHGANGPASLNSAALP--AYATGGGNGPAYPPG 471 G+ A G + G +GP + P GG +GP PPG Sbjct: 1106 RGLPGARGSNGNDGPSGPRGFDGPEGPRGPRGEGGSSGPPGPPG 1149 [156][TOP] >UniRef100_Q16985 Fibroin-1 (Fragment) n=1 Tax=Araneus diadematus RepID=Q16985_ARADI Length = 360 Score = 55.5 bits (132), Expect = 2e-06 Identities = 53/169 (31%), Positives = 69/169 (40%), Gaps = 7/169 (4%) Frame = +1 Query: 49 GGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPPSVAGGY 228 GG S AG+ Y G AGSG G A++ S+ A+ G G GA AGGY Sbjct: 129 GGGSGAGAGGAGGYGQGYGAGSGAGAGAAAAAGASAGAAGGYGGGAGVGAGAGAGAAGGY 188 Query: 229 GASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTAAHGVPRHHGANGPASL 408 G S + A +G+ + A G AA G +GA A Sbjct: 189 GQSYGSGAGAGAGAGAAAA------------------AGAGARAAGGYGGGYGAGAGAGA 230 Query: 409 NSAALPAYATGGGNGPAYPPGA---IVSPASTATF----NRLSPAAAAA 534 +AA + GG G Y GA V+ AS ++ NRLS A AA+ Sbjct: 231 GAAA--SAGASGGYGGGYGGGAGAGAVAGASAGSYGGAVNRLSSAGAAS 277 [157][TOP] >UniRef100_B4IJR5 GM13722 n=1 Tax=Drosophila sechellia RepID=B4IJR5_DROSE Length = 747 Score = 55.5 bits (132), Expect = 2e-06 Identities = 59/198 (29%), Positives = 71/198 (35%), Gaps = 25/198 (12%) Frame = +1 Query: 10 PPSYGSHVPGSVVGG------SSAAGSFSGPPYAPGVYAGSGP---------GGHPASS- 141 PP G H P + G + ++ GPP+ P GP GGHP Sbjct: 549 PPHMGPHQPPPGMSGLPPPPPHTGYANYGGPPHGPPPGPPGGPARPYYQPQYGGHPTPQP 608 Query: 142 -YAPSS----SASLPQGAHLGSRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQ 306 YAP S S P G+H S PP G G ++ E GS P PPQ Sbjct: 609 YYAPFSPYQQSYGPPPGSHYMSPRPPPPQHNGNLGHP------YAPEHGS----NPPPPQ 658 Query: 307 MPPPTGPSPHLAHGGVTAAHGVPRHHGANGPASLNSAALPAYAT-GGGNGPAYPP---GA 474 P P H G P G G A+ + Y T G G GP PP GA Sbjct: 659 QQQQQQPPPGHLHEPSAGGPGAP--GGGAGAAAAAAPGAGVYPTPGAGAGPGAPPAAGGA 716 Query: 475 IVSPASTATFNRLSPAAA 528 + A+ A PA A Sbjct: 717 TLGEAAVAGGVAPPPATA 734 [158][TOP] >UniRef100_B3M1V5 GF17870 n=1 Tax=Drosophila ananassae RepID=B3M1V5_DROAN Length = 871 Score = 55.5 bits (132), Expect = 2e-06 Identities = 49/173 (28%), Positives = 64/173 (36%), Gaps = 4/173 (2%) Frame = +1 Query: 16 SYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSR 195 S+G+ P S G +A G P P + P P+SSY + S Sbjct: 221 SFGTSAPSSSYGAQAAPSKSYGAPAPPPSKSYGAPAAPPSSSYGAPAPPS--------KS 272 Query: 196 GGAPPSVAGGYGASGPTSATFSNESGSFQS--LQPAPPQMP--PPTGPSPHLAHGGVTAA 363 GAPP+ + YGA SA S+ +S PAPP P PSP + Sbjct: 273 YGAPPAPSSSYGAPSAPSAPSSSYGSPSKSYGAPPAPPSQSYGAPAAPSP---------S 323 Query: 364 HGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSPASTATFNRLSPA 522 +G P PA S PA + PA P + +P N L PA Sbjct: 324 YGAP-------PAPSKSYGAPAPPSPSYGAPAPPSSSYGAPPQAPVSNYLPPA 369 [159][TOP] >UniRef100_Q5STP6 Collagen, type XI, alpha 2 n=1 Tax=Homo sapiens RepID=Q5STP6_HUMAN Length = 1650 Score = 55.5 bits (132), Expect = 2e-06 Identities = 56/180 (31%), Positives = 61/180 (33%), Gaps = 25/180 (13%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSG-----PPYAPGVYAGSGPGGHPASS--YAPSSSASLPQGA 180 G P G S A G+ G PP G+ GP G P P LP Sbjct: 761 GQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPLGPPGKDGLP--G 818 Query: 181 HLGSRGGA----------PPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPS 330 H G RG PP V G GA+G T E G P PP P G Sbjct: 819 HPGQRGEVGFQGKTGPPGPPGVVGPQGAAGETGPM--GERG-----HPGPPGPPGEQGLP 871 Query: 331 PHLAHGGVTAAHGVPRHHGANGPASL----NSAALPAYATG----GGNGPAYPPGAIVSP 486 G G P G +GPA L LP A G G GP+ PPG SP Sbjct: 872 GTAGKEGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGPGLKGNEGPSGPPGPAGSP 931 [160][TOP] >UniRef100_C9J3N1 Putative uncharacterized protein ENSP00000405291 n=1 Tax=Homo sapiens RepID=C9J3N1_HUMAN Length = 1693 Score = 55.5 bits (132), Expect = 2e-06 Identities = 56/180 (31%), Positives = 61/180 (33%), Gaps = 25/180 (13%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSG-----PPYAPGVYAGSGPGGHPASS--YAPSSSASLPQGA 180 G P G S A G+ G PP G+ GP G P P LP Sbjct: 761 GQRGPRGATGKSGAKGTSGGDGPHGPPGERGLPGPQGPNGFPGPKGPLGPPGKDGLP--G 818 Query: 181 HLGSRGGA----------PPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPS 330 H G RG PP V G GA+G T E G P PP P G Sbjct: 819 HPGQRGEVGFQGKTGPPGPPGVVGPQGAAGETGPM--GERG-----HPGPPGPPGEQGLP 871 Query: 331 PHLAHGGVTAAHGVPRHHGANGPASL----NSAALPAYATG----GGNGPAYPPGAIVSP 486 G G P G +GPA L LP A G G GP+ PPG SP Sbjct: 872 GTAGKEGTKGDPGPPGAPGKDGPAGLRGFPGERGLPGTAGGPGLKGNEGPSGPPGPAGSP 931 [161][TOP] >UniRef100_A0RUH1 Collagen type XI alpha 2 n=1 Tax=Cenarchaeum symbiosum RepID=A0RUH1_CENSY Length = 468 Score = 55.5 bits (132), Expect = 2e-06 Identities = 59/185 (31%), Positives = 66/185 (35%), Gaps = 29/185 (15%) Frame = +1 Query: 4 QQPPSYGSHVPGSVVGGSSAAGS-----FSGPPYAPGVYAGSGPGGHPASSYAPSSSASL 168 +QPP+ P S G AG GP APG + GP G P P S Sbjct: 120 EQPPA---EPPASSRGEKGPAGQPGERGDKGPAGAPGEHGDKGPIGPPGERGIPGSPG-- 174 Query: 169 PQG-----------AHLGSRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQP------- 294 PQG +G RG P AG G +GP + + G L P Sbjct: 175 PQGDKGPAGDKGITGDMGDRGDKGP--AGEPGETGPDGP--AGDKGDRGPLGPQGLPGER 230 Query: 295 --APPQMPP----PTGPSPHLAHGGVTAAHGVPRHHGANGPASLNSAALPAYATGGGNGP 456 A P PP PTG G T G P G GPA PA GG GP Sbjct: 231 GDAGPHGPPGDKGPTGERGPTGTKGETGPPGTPGDKGLQGPAGPEGGKGPA-GVEGGKGP 289 Query: 457 AYPPG 471 A PPG Sbjct: 290 AGPPG 294 [162][TOP] >UniRef100_UPI00016E6475 UPI00016E6475 related cluster n=1 Tax=Takifugu rubripes RepID=UPI00016E6475 Length = 1741 Score = 55.1 bits (131), Expect = 3e-06 Identities = 54/169 (31%), Positives = 59/169 (34%), Gaps = 24/169 (14%) Frame = +1 Query: 52 GSSAAGSFSGPPYAPGVYAGSGPGGHPASS--YAPSSSASLPQGAHLGSRGGA------- 204 G+S + +GPP G GP G P P LP H G RG Sbjct: 838 GTSGSDGPAGPPGERGPQGPQGPLGFPGPKGPNGPPGKDGLP--GHPGQRGETGFQGKTG 895 Query: 205 PPSVAGGYGASGPTSATF-SNESGSFQSLQPAPPQMPPPTG-------------PSPHLA 342 PP G G GPT T S E G P PP P G P P Sbjct: 896 PPGPGGVVGPQGPTGGTGPSGERG-----HPGPPGPPGEQGLPGAAGKEGGKGDPGPQ-G 949 Query: 343 HGGVTAAHGVPRHHGANG-PASLNSAALPAYATGGGNGPAYPPGAIVSP 486 H G G+P G G P + A L GG GP PPG I SP Sbjct: 950 HSGKAGPPGLPGFQGQRGLPGGMGPAGLK-----GGEGPQGPPGPIGSP 993 [163][TOP] >UniRef100_UPI00016E6474 UPI00016E6474 related cluster n=1 Tax=Takifugu rubripes RepID=UPI00016E6474 Length = 1792 Score = 55.1 bits (131), Expect = 3e-06 Identities = 54/169 (31%), Positives = 59/169 (34%), Gaps = 24/169 (14%) Frame = +1 Query: 52 GSSAAGSFSGPPYAPGVYAGSGPGGHPASS--YAPSSSASLPQGAHLGSRGGA------- 204 G+S + +GPP G GP G P P LP H G RG Sbjct: 889 GTSGSDGPAGPPGERGPQGPQGPLGFPGPKGPNGPPGKDGLP--GHPGQRGETGFQGKTG 946 Query: 205 PPSVAGGYGASGPTSATF-SNESGSFQSLQPAPPQMPPPTG-------------PSPHLA 342 PP G G GPT T S E G P PP P G P P Sbjct: 947 PPGPGGVVGPQGPTGGTGPSGERG-----HPGPPGPPGEQGLPGAAGKEGGKGDPGPQ-G 1000 Query: 343 HGGVTAAHGVPRHHGANG-PASLNSAALPAYATGGGNGPAYPPGAIVSP 486 H G G+P G G P + A L GG GP PPG I SP Sbjct: 1001 HSGKAGPPGLPGFQGQRGLPGGMGPAGLK-----GGEGPQGPPGPIGSP 1044 [164][TOP] >UniRef100_UPI00016E6473 UPI00016E6473 related cluster n=1 Tax=Takifugu rubripes RepID=UPI00016E6473 Length = 1796 Score = 55.1 bits (131), Expect = 3e-06 Identities = 54/169 (31%), Positives = 59/169 (34%), Gaps = 24/169 (14%) Frame = +1 Query: 52 GSSAAGSFSGPPYAPGVYAGSGPGGHPASS--YAPSSSASLPQGAHLGSRGGA------- 204 G+S + +GPP G GP G P P LP H G RG Sbjct: 893 GTSGSDGPAGPPGERGPQGPQGPLGFPGPKGPNGPPGKDGLP--GHPGQRGETGFQGKTG 950 Query: 205 PPSVAGGYGASGPTSATF-SNESGSFQSLQPAPPQMPPPTG-------------PSPHLA 342 PP G G GPT T S E G P PP P G P P Sbjct: 951 PPGPGGVVGPQGPTGGTGPSGERG-----HPGPPGPPGEQGLPGAAGKEGGKGDPGPQ-G 1004 Query: 343 HGGVTAAHGVPRHHGANG-PASLNSAALPAYATGGGNGPAYPPGAIVSP 486 H G G+P G G P + A L GG GP PPG I SP Sbjct: 1005 HSGKAGPPGLPGFQGQRGLPGGMGPAGLK-----GGEGPQGPPGPIGSP 1048 [165][TOP] >UniRef100_UPI00016E644C UPI00016E644C related cluster n=1 Tax=Takifugu rubripes RepID=UPI00016E644C Length = 1799 Score = 55.1 bits (131), Expect = 3e-06 Identities = 54/169 (31%), Positives = 59/169 (34%), Gaps = 24/169 (14%) Frame = +1 Query: 52 GSSAAGSFSGPPYAPGVYAGSGPGGHPASS--YAPSSSASLPQGAHLGSRGGA------- 204 G+S + +GPP G GP G P P LP H G RG Sbjct: 896 GTSGSDGPAGPPGERGPQGPQGPLGFPGPKGPNGPPGKDGLP--GHPGQRGETGFQGKTG 953 Query: 205 PPSVAGGYGASGPTSATF-SNESGSFQSLQPAPPQMPPPTG-------------PSPHLA 342 PP G G GPT T S E G P PP P G P P Sbjct: 954 PPGPGGVVGPQGPTGGTGPSGERG-----HPGPPGPPGEQGLPGAAGKEGGKGDPGPQ-G 1007 Query: 343 HGGVTAAHGVPRHHGANG-PASLNSAALPAYATGGGNGPAYPPGAIVSP 486 H G G+P G G P + A L GG GP PPG I SP Sbjct: 1008 HSGKAGPPGLPGFQGQRGLPGGMGPAGLK-----GGEGPQGPPGPIGSP 1051 [166][TOP] >UniRef100_UPI00016E644B UPI00016E644B related cluster n=1 Tax=Takifugu rubripes RepID=UPI00016E644B Length = 1801 Score = 55.1 bits (131), Expect = 3e-06 Identities = 54/169 (31%), Positives = 59/169 (34%), Gaps = 24/169 (14%) Frame = +1 Query: 52 GSSAAGSFSGPPYAPGVYAGSGPGGHPASS--YAPSSSASLPQGAHLGSRGGA------- 204 G+S + +GPP G GP G P P LP H G RG Sbjct: 898 GTSGSDGPAGPPGERGPQGPQGPLGFPGPKGPNGPPGKDGLP--GHPGQRGETGFQGKTG 955 Query: 205 PPSVAGGYGASGPTSATF-SNESGSFQSLQPAPPQMPPPTG-------------PSPHLA 342 PP G G GPT T S E G P PP P G P P Sbjct: 956 PPGPGGVVGPQGPTGGTGPSGERG-----HPGPPGPPGEQGLPGAAGKEGGKGDPGPQ-G 1009 Query: 343 HGGVTAAHGVPRHHGANG-PASLNSAALPAYATGGGNGPAYPPGAIVSP 486 H G G+P G G P + A L GG GP PPG I SP Sbjct: 1010 HSGKAGPPGLPGFQGQRGLPGGMGPAGLK-----GGEGPQGPPGPIGSP 1053 [167][TOP] >UniRef100_UPI00016E644A UPI00016E644A related cluster n=1 Tax=Takifugu rubripes RepID=UPI00016E644A Length = 1812 Score = 55.1 bits (131), Expect = 3e-06 Identities = 54/169 (31%), Positives = 59/169 (34%), Gaps = 24/169 (14%) Frame = +1 Query: 52 GSSAAGSFSGPPYAPGVYAGSGPGGHPASS--YAPSSSASLPQGAHLGSRGGA------- 204 G+S + +GPP G GP G P P LP H G RG Sbjct: 909 GTSGSDGPAGPPGERGPQGPQGPLGFPGPKGPNGPPGKDGLP--GHPGQRGETGFQGKTG 966 Query: 205 PPSVAGGYGASGPTSATF-SNESGSFQSLQPAPPQMPPPTG-------------PSPHLA 342 PP G G GPT T S E G P PP P G P P Sbjct: 967 PPGPGGVVGPQGPTGGTGPSGERG-----HPGPPGPPGEQGLPGAAGKEGGKGDPGPQ-G 1020 Query: 343 HGGVTAAHGVPRHHGANG-PASLNSAALPAYATGGGNGPAYPPGAIVSP 486 H G G+P G G P + A L GG GP PPG I SP Sbjct: 1021 HSGKAGPPGLPGFQGQRGLPGGMGPAGLK-----GGEGPQGPPGPIGSP 1064 [168][TOP] >UniRef100_UPI00016E6426 UPI00016E6426 related cluster n=1 Tax=Takifugu rubripes RepID=UPI00016E6426 Length = 1810 Score = 55.1 bits (131), Expect = 3e-06 Identities = 54/169 (31%), Positives = 59/169 (34%), Gaps = 24/169 (14%) Frame = +1 Query: 52 GSSAAGSFSGPPYAPGVYAGSGPGGHPASS--YAPSSSASLPQGAHLGSRGGA------- 204 G+S + +GPP G GP G P P LP H G RG Sbjct: 907 GTSGSDGPAGPPGERGPQGPQGPLGFPGPKGPNGPPGKDGLP--GHPGQRGETGFQGKTG 964 Query: 205 PPSVAGGYGASGPTSATF-SNESGSFQSLQPAPPQMPPPTG-------------PSPHLA 342 PP G G GPT T S E G P PP P G P P Sbjct: 965 PPGPGGVVGPQGPTGGTGPSGERG-----HPGPPGPPGEQGLPGAAGKEGGKGDPGPQ-G 1018 Query: 343 HGGVTAAHGVPRHHGANG-PASLNSAALPAYATGGGNGPAYPPGAIVSP 486 H G G+P G G P + A L GG GP PPG I SP Sbjct: 1019 HSGKAGPPGLPGFQGQRGLPGGMGPAGLK-----GGEGPQGPPGPIGSP 1062 [169][TOP] >UniRef100_UPI00016E6263 UPI00016E6263 related cluster n=1 Tax=Takifugu rubripes RepID=UPI00016E6263 Length = 1729 Score = 55.1 bits (131), Expect = 3e-06 Identities = 54/169 (31%), Positives = 59/169 (34%), Gaps = 24/169 (14%) Frame = +1 Query: 52 GSSAAGSFSGPPYAPGVYAGSGPGGHPASS--YAPSSSASLPQGAHLGSRGGA------- 204 G+S + +GPP G GP G P P LP H G RG Sbjct: 849 GTSGSDGPAGPPGERGPQGPQGPLGFPGPKGPNGPPGKDGLP--GHPGQRGETGFQGKTG 906 Query: 205 PPSVAGGYGASGPTSATF-SNESGSFQSLQPAPPQMPPPTG-------------PSPHLA 342 PP G G GPT T S E G P PP P G P P Sbjct: 907 PPGPGGVVGPQGPTGGTGPSGERG-----HPGPPGPPGEQGLPGAAGKEGGKGDPGPQ-G 960 Query: 343 HGGVTAAHGVPRHHGANG-PASLNSAALPAYATGGGNGPAYPPGAIVSP 486 H G G+P G G P + A L GG GP PPG I SP Sbjct: 961 HSGKAGPPGLPGFQGQRGLPGGMGPAGLK-----GGEGPQGPPGPIGSP 1004 [170][TOP] >UniRef100_UPI00016E6262 UPI00016E6262 related cluster n=1 Tax=Takifugu rubripes RepID=UPI00016E6262 Length = 1725 Score = 55.1 bits (131), Expect = 3e-06 Identities = 54/169 (31%), Positives = 59/169 (34%), Gaps = 24/169 (14%) Frame = +1 Query: 52 GSSAAGSFSGPPYAPGVYAGSGPGGHPASS--YAPSSSASLPQGAHLGSRGGA------- 204 G+S + +GPP G GP G P P LP H G RG Sbjct: 845 GTSGSDGPAGPPGERGPQGPQGPLGFPGPKGPNGPPGKDGLP--GHPGQRGETGFQGKTG 902 Query: 205 PPSVAGGYGASGPTSATF-SNESGSFQSLQPAPPQMPPPTG-------------PSPHLA 342 PP G G GPT T S E G P PP P G P P Sbjct: 903 PPGPGGVVGPQGPTGGTGPSGERG-----HPGPPGPPGEQGLPGAAGKEGGKGDPGPQ-G 956 Query: 343 HGGVTAAHGVPRHHGANG-PASLNSAALPAYATGGGNGPAYPPGAIVSP 486 H G G+P G G P + A L GG GP PPG I SP Sbjct: 957 HSGKAGPPGLPGFQGQRGLPGGMGPAGLK-----GGEGPQGPPGPIGSP 1000 [171][TOP] >UniRef100_UPI00016E6261 UPI00016E6261 related cluster n=1 Tax=Takifugu rubripes RepID=UPI00016E6261 Length = 1737 Score = 55.1 bits (131), Expect = 3e-06 Identities = 54/169 (31%), Positives = 59/169 (34%), Gaps = 24/169 (14%) Frame = +1 Query: 52 GSSAAGSFSGPPYAPGVYAGSGPGGHPASS--YAPSSSASLPQGAHLGSRGGA------- 204 G+S + +GPP G GP G P P LP H G RG Sbjct: 834 GTSGSDGPAGPPGERGPQGPQGPLGFPGPKGPNGPPGKDGLP--GHPGQRGETGFQGKTG 891 Query: 205 PPSVAGGYGASGPTSATF-SNESGSFQSLQPAPPQMPPPTG-------------PSPHLA 342 PP G G GPT T S E G P PP P G P P Sbjct: 892 PPGPGGVVGPQGPTGGTGPSGERG-----HPGPPGPPGEQGLPGAAGKEGGKGDPGPQ-G 945 Query: 343 HGGVTAAHGVPRHHGANG-PASLNSAALPAYATGGGNGPAYPPGAIVSP 486 H G G+P G G P + A L GG GP PPG I SP Sbjct: 946 HSGKAGPPGLPGFQGQRGLPGGMGPAGLK-----GGEGPQGPPGPIGSP 989 [172][TOP] >UniRef100_UPI00016E0385 UPI00016E0385 related cluster n=1 Tax=Takifugu rubripes RepID=UPI00016E0385 Length = 1425 Score = 55.1 bits (131), Expect = 3e-06 Identities = 54/191 (28%), Positives = 68/191 (35%), Gaps = 21/191 (10%) Frame = +1 Query: 10 PPSYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGS----------GPGGHPASSYAPSSS 159 PP + SH PG + G+ A F G G+ +GS GP G P + A Sbjct: 105 PPGHPSH-PGGI--GAQMASGFDGKSGPQGMLSGSRGEAGTRGPPGPSGSPGQAGAQGPP 161 Query: 160 ASLPQGAHLGSRG-GAPPSVAGGYGASGPTSATFSNESGSFQSLQPAP--PQMPPPTGPS 330 + H+GS G P + G G G +N F A P MP P G Sbjct: 162 GEVGDPGHMGSSGQRGPEGLMGKPGEDGEPGKPGNNGEMGFSGSPGARGFPGMPGPPGLK 221 Query: 331 PHLAHGGV---TAAHGVPRHHGANGPASLNSAALPAYATG-----GGNGPAYPPGAIVSP 486 H H G+ +G GA GP A P G G +GP+ PG P Sbjct: 222 GHKGHLGILGQKGENGAVGSKGATGPHGPMGAPGPMGPAGMPGERGRSGPSGTPGKRGVP 281 Query: 487 ASTATFNRLSP 519 S L P Sbjct: 282 GSVGKPGSLGP 292 [173][TOP] >UniRef100_UPI00016E0384 UPI00016E0384 related cluster n=1 Tax=Takifugu rubripes RepID=UPI00016E0384 Length = 1435 Score = 55.1 bits (131), Expect = 3e-06 Identities = 54/191 (28%), Positives = 68/191 (35%), Gaps = 21/191 (10%) Frame = +1 Query: 10 PPSYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGS----------GPGGHPASSYAPSSS 159 PP + SH PG + G+ A F G G+ +GS GP G P + A Sbjct: 115 PPGHPSH-PGGI--GAQMASGFDGKSGPQGMLSGSRGEAGTRGPPGPSGSPGQAGAQGPP 171 Query: 160 ASLPQGAHLGSRG-GAPPSVAGGYGASGPTSATFSNESGSFQSLQPAP--PQMPPPTGPS 330 + H+GS G P + G G G +N F A P MP P G Sbjct: 172 GEVGDPGHMGSSGQRGPEGLMGKPGEDGEPGKPGNNGEMGFSGSPGARGFPGMPGPPGLK 231 Query: 331 PHLAHGGV---TAAHGVPRHHGANGPASLNSAALPAYATG-----GGNGPAYPPGAIVSP 486 H H G+ +G GA GP A P G G +GP+ PG P Sbjct: 232 GHKGHLGILGQKGENGAVGSKGATGPHGPMGAPGPMGPAGMPGERGRSGPSGTPGKRGVP 291 Query: 487 ASTATFNRLSP 519 S L P Sbjct: 292 GSVGKPGSLGP 302 [174][TOP] >UniRef100_UPI00016E0382 UPI00016E0382 related cluster n=1 Tax=Takifugu rubripes RepID=UPI00016E0382 Length = 1420 Score = 55.1 bits (131), Expect = 3e-06 Identities = 54/191 (28%), Positives = 68/191 (35%), Gaps = 21/191 (10%) Frame = +1 Query: 10 PPSYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGS----------GPGGHPASSYAPSSS 159 PP + SH PG + G+ A F G G+ +GS GP G P + A Sbjct: 100 PPGHPSH-PGGI--GAQMASGFDGKSGPQGMLSGSRGEAGTRGPPGPSGSPGQAGAQGPP 156 Query: 160 ASLPQGAHLGSRG-GAPPSVAGGYGASGPTSATFSNESGSFQSLQPAP--PQMPPPTGPS 330 + H+GS G P + G G G +N F A P MP P G Sbjct: 157 GEVGDPGHMGSSGQRGPEGLMGKPGEDGEPGKPGNNGEMGFSGSPGARGFPGMPGPPGLK 216 Query: 331 PHLAHGGV---TAAHGVPRHHGANGPASLNSAALPAYATG-----GGNGPAYPPGAIVSP 486 H H G+ +G GA GP A P G G +GP+ PG P Sbjct: 217 GHKGHLGILGQKGENGAVGSKGATGPHGPMGAPGPMGPAGMPGERGRSGPSGTPGKRGVP 276 Query: 487 ASTATFNRLSP 519 S L P Sbjct: 277 GSVGKPGSLGP 287 [175][TOP] >UniRef100_UPI00016E035C UPI00016E035C related cluster n=1 Tax=Takifugu rubripes RepID=UPI00016E035C Length = 1427 Score = 55.1 bits (131), Expect = 3e-06 Identities = 54/191 (28%), Positives = 68/191 (35%), Gaps = 21/191 (10%) Frame = +1 Query: 10 PPSYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGS----------GPGGHPASSYAPSSS 159 PP + SH PG + G+ A F G G+ +GS GP G P + A Sbjct: 107 PPGHPSH-PGGI--GAQMASGFDGKSGPQGMLSGSRGEAGTRGPPGPSGSPGQAGAQGPP 163 Query: 160 ASLPQGAHLGSRG-GAPPSVAGGYGASGPTSATFSNESGSFQSLQPAP--PQMPPPTGPS 330 + H+GS G P + G G G +N F A P MP P G Sbjct: 164 GEVGDPGHMGSSGQRGPEGLMGKPGEDGEPGKPGNNGEMGFSGSPGARGFPGMPGPPGLK 223 Query: 331 PHLAHGGV---TAAHGVPRHHGANGPASLNSAALPAYATG-----GGNGPAYPPGAIVSP 486 H H G+ +G GA GP A P G G +GP+ PG P Sbjct: 224 GHKGHLGILGQKGENGAVGSKGATGPHGPMGAPGPMGPAGMPGERGRSGPSGTPGKRGVP 283 Query: 487 ASTATFNRLSP 519 S L P Sbjct: 284 GSVGKPGSLGP 294 [176][TOP] >UniRef100_UPI00016E0359 UPI00016E0359 related cluster n=1 Tax=Takifugu rubripes RepID=UPI00016E0359 Length = 1419 Score = 55.1 bits (131), Expect = 3e-06 Identities = 54/191 (28%), Positives = 68/191 (35%), Gaps = 21/191 (10%) Frame = +1 Query: 10 PPSYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGS----------GPGGHPASSYAPSSS 159 PP + SH PG + G+ A F G G+ +GS GP G P + A Sbjct: 105 PPGHPSH-PGGI--GAQMASGFDGKSGPQGMLSGSRGEAGTRGPPGPSGSPGQAGAQGPP 161 Query: 160 ASLPQGAHLGSRG-GAPPSVAGGYGASGPTSATFSNESGSFQSLQPAP--PQMPPPTGPS 330 + H+GS G P + G G G +N F A P MP P G Sbjct: 162 GEVGDPGHMGSSGQRGPEGLMGKPGEDGEPGKPGNNGEMGFSGSPGARGFPGMPGPPGLK 221 Query: 331 PHLAHGGV---TAAHGVPRHHGANGPASLNSAALPAYATG-----GGNGPAYPPGAIVSP 486 H H G+ +G GA GP A P G G +GP+ PG P Sbjct: 222 GHKGHLGILGQKGENGAVGSKGATGPHGPMGAPGPMGPAGMPGERGRSGPSGTPGKRGVP 281 Query: 487 ASTATFNRLSP 519 S L P Sbjct: 282 GSVGKPGSLGP 292 [177][TOP] >UniRef100_UPI00016E0334 UPI00016E0334 related cluster n=1 Tax=Takifugu rubripes RepID=UPI00016E0334 Length = 1479 Score = 55.1 bits (131), Expect = 3e-06 Identities = 54/191 (28%), Positives = 68/191 (35%), Gaps = 21/191 (10%) Frame = +1 Query: 10 PPSYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGS----------GPGGHPASSYAPSSS 159 PP + SH PG + G+ A F G G+ +GS GP G P + A Sbjct: 159 PPGHPSH-PGGI--GAQMASGFDGKSGPQGMLSGSRGEAGTRGPPGPSGSPGQAGAQGPP 215 Query: 160 ASLPQGAHLGSRG-GAPPSVAGGYGASGPTSATFSNESGSFQSLQPAP--PQMPPPTGPS 330 + H+GS G P + G G G +N F A P MP P G Sbjct: 216 GEVGDPGHMGSSGQRGPEGLMGKPGEDGEPGKPGNNGEMGFSGSPGARGFPGMPGPPGLK 275 Query: 331 PHLAHGGV---TAAHGVPRHHGANGPASLNSAALPAYATG-----GGNGPAYPPGAIVSP 486 H H G+ +G GA GP A P G G +GP+ PG P Sbjct: 276 GHKGHLGILGQKGENGAVGSKGATGPHGPMGAPGPMGPAGMPGERGRSGPSGTPGKRGVP 335 Query: 487 ASTATFNRLSP 519 S L P Sbjct: 336 GSVGKPGSLGP 346 [178][TOP] >UniRef100_B2I413 Conserved hypothetical membrane protein n=2 Tax=Mycobacterium RepID=B2I413_MYCMM Length = 814 Score = 55.1 bits (131), Expect = 3e-06 Identities = 62/203 (30%), Positives = 73/203 (35%), Gaps = 28/203 (13%) Frame = +1 Query: 1 AQQPPSYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGA 180 A P G+ + G GS G P PG G G GG P + P S GA Sbjct: 268 AASPLGGGAPSMSGLGSGGGGMGSGGGIPKMPG---GLGSGGMPGTGSNPLSGVGQMPGA 324 Query: 181 HLG--SRGGAPP-SVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSP------ 333 G + GG P S AGG + + +G + PAPP P P PSP Sbjct: 325 GSGLPNAGGLPTASNAGGASPLSAFNQGAAATAGMGGGIPPAPP--PAPASPSPAPSAGG 382 Query: 334 HLA------HGGVTAAHGVPRHHGANGPASLNSAALPAYATGGGNGP------------- 456 H A GGV+ A P G PA+ SA GG P Sbjct: 383 HAAPAAAAPGGGVSPAAAQP---GVVAPAAPASAPTGVGVGAGGGAPMMLPPGSMGPPAA 439 Query: 457 AYPPGAIVSPASTATFNRLSPAA 525 A PP A PA T +PAA Sbjct: 440 AIPPPAATVPAGTVGSTNTAPAA 462 [179][TOP] >UniRef100_A9VS75 Collagen triple helix repeat n=1 Tax=Bacillus weihenstephanensis KBAB4 RepID=A9VS75_BACWK Length = 385 Score = 55.1 bits (131), Expect = 3e-06 Identities = 52/176 (29%), Positives = 64/176 (36%), Gaps = 14/176 (7%) Frame = +1 Query: 10 PPSYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPS--SSASLPQGAH 183 PP G P + G + +G GP G+ +GP G P P+ + A+ P G Sbjct: 86 PP--GPTGPTGITGATGPSGGPPGPTGPTGITGATGPSGGPPGPIGPTGITGATGPSGGP 143 Query: 184 LGSRG-----------GAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPP-PTGP 327 G G G PP G G +G T AT P PP PTGP Sbjct: 144 PGPTGPTGITGATGPSGGPPGPTGPTGITGATGAT-------------GPSGGPPGPTGP 190 Query: 328 SPHLAHGGVTAAHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSPAST 495 + G T G P G GP + A P+ G GP GA S ST Sbjct: 191 T---GITGATGPSGGP--PGPTGPTGITGATGPSGGPPGPTGPTGITGATGSTGST 241 [180][TOP] >UniRef100_A9EZ28 Protein kinase n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9EZ28_SORC5 Length = 721 Score = 55.1 bits (131), Expect = 3e-06 Identities = 58/198 (29%), Positives = 79/198 (39%), Gaps = 25/198 (12%) Frame = +1 Query: 13 PSYGSHV-PGSVVGGSSAAGSFS--GPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGA- 180 P+ GS + PGS S A + + G P + A GGHP + A + + P A Sbjct: 344 PAIGSELGPGSSGASSWEAATMAAHGAPRGSAMDAAQAHGGHPGMAQAAPGAPNGPPSAL 403 Query: 181 ------HLGSRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLA 342 H G+ G APPS G GA + N + PA P GPS Sbjct: 404 HNTGAGHAGAHG-APPSWQGA-GAPHSAPVSLHNTGSGLHNTGPAYGAPAPAHGPS---- 457 Query: 343 HGGVTAAHGVPRHHGANGPASLNSAALPAYATGGG--------------NGPAYPPGAIV 480 A HG P H GP++ +SA + + TG G +GP+ P GA V Sbjct: 458 -----APHGAPAH----GPSAPHSAPVSLHNTGSGLHNAGPAYGALAPAHGPSAPHGAPV 508 Query: 481 SPASTAT-FNRLSPAAAA 531 S +T + + PA A Sbjct: 509 SLHNTGSGLHNAGPAYGA 526 [181][TOP] >UniRef100_A4T503 Conserved hypothetical alanine and proline rich protein n=1 Tax=Mycobacterium gilvum PYR-GCK RepID=A4T503_MYCGI Length = 664 Score = 55.1 bits (131), Expect = 3e-06 Identities = 52/164 (31%), Positives = 64/164 (39%), Gaps = 6/164 (3%) Frame = +1 Query: 25 SHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGA 204 S S SS S GPP G +GS +S +PS+++ GA GA Sbjct: 202 SAASASTPSASSPMSSSGGPPSTGGASSGSPSASTSPASGSPSTASPTTSGA-----AGA 256 Query: 205 PPSVAGGYGAS----GPTSATFSNESGSFQSLQPA-PPQMPPPTGPSPHLAHGGVTAAHG 369 PS A GA+ P F N+S S PA P PP + P+P G A Sbjct: 257 QPSNASPAGAAKAQPSPIQQVF-NQSAPLASSAPAQSPAAPPSSAPAPTTPAGAAPTA-- 313 Query: 370 VPRHHGANGPASLNSAALP-AYATGGGNGPAYPPGAIVSPASTA 498 GA G S + P A A G PA PP + P S A Sbjct: 314 ---GTGAGGGLSTSGGPAPVAGAPAGAAPPAAPPVPLAPPTSPA 354 [182][TOP] >UniRef100_A4JMC1 Putative uncharacterized protein n=1 Tax=Burkholderia vietnamiensis G4 RepID=A4JMC1_BURVG Length = 715 Score = 55.1 bits (131), Expect = 3e-06 Identities = 42/138 (30%), Positives = 64/138 (46%) Frame = +1 Query: 121 GGHPASSYAPSSSASLPQGAHLGSRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAP 300 G AS + +S+A+ GA ++ GA + G + PT++ S+ + PA Sbjct: 122 GAGAASGASAASAAAAGSGAAASAQHGASAAHPGSAAVAAPTASAVSSAP-----IAPAA 176 Query: 301 PQMPPPTGPSPHLAHGGVTAAHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIV 480 P P + +P A G +++ G HGA+ A+ A PA GG +GP GAI Sbjct: 177 PAAPTSSANAP--AANGASSSAGASATHGASSAAT----AQPAAPVGGASGPHVWNGAIQ 230 Query: 481 SPASTATFNRLSPAAAAA 534 S S+A+ PAA A Sbjct: 231 SAPSSASEAAAQPAAGGA 248 [183][TOP] >UniRef100_C2X3H2 Collagen triple helix repeat domain protein n=1 Tax=Bacillus cereus Rock4-18 RepID=C2X3H2_BACCE Length = 1289 Score = 55.1 bits (131), Expect = 3e-06 Identities = 47/157 (29%), Positives = 62/157 (39%), Gaps = 16/157 (10%) Frame = +1 Query: 34 PGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPPS 213 P + G + G GPP G +GPGG P+ S+ + P GA G+ G Sbjct: 206 PTGITGPTGITGPSGGPPGPTGPTGATGPGGGPSG----STGVTGPTGA-TGNTG----- 255 Query: 214 VAGGYGASGPTSATFSNESGSFQSLQ----------PAPPQ----MPPPTGPSPHLAHGG 351 A G G +GPT +T + Q +Q P PQ +P PTG + G Sbjct: 256 -ATGQGLTGPTGSTGETGAQGLQGIQGIQGPIGPTGPEGPQGIQGIPGPTGVTGEQGIQG 314 Query: 352 VTAAHGVPRHHGANGPASLNSAALPAYATG--GGNGP 456 V G+ G GP + A P TG G GP Sbjct: 315 VQGIQGIMGATGDQGPQGIQGAIGPQGVTGATGDQGP 351 [184][TOP] >UniRef100_C2VI92 Collagen triple helix repeat domain protein n=1 Tax=Bacillus cereus Rock3-29 RepID=C2VI92_BACCE Length = 956 Score = 55.1 bits (131), Expect = 3e-06 Identities = 47/157 (29%), Positives = 62/157 (39%), Gaps = 16/157 (10%) Frame = +1 Query: 34 PGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPPS 213 P + G + G GPP G +GPGG P+ S+ + P GA G+ G Sbjct: 195 PTGITGPTGITGPSGGPPGPTGPTGATGPGGGPSG----STGMTGPTGA-TGNTG----- 244 Query: 214 VAGGYGASGPTSATFSNESGSFQSLQ----------PAPPQ----MPPPTGPSPHLAHGG 351 A G G +GPT +T + Q +Q P PQ +P PTG + G Sbjct: 245 -ATGQGLTGPTGSTGETGAQGLQGIQGIQGPIGPTGPEGPQGIQGIPGPTGVTGEQGIQG 303 Query: 352 VTAAHGVPRHHGANGPASLNSAALPAYATG--GGNGP 456 V G+ G GP + A P TG G GP Sbjct: 304 VQGIQGITGATGDQGPQGIQGAIGPQGVTGVTGDQGP 340 [185][TOP] >UniRef100_C2U3W8 Collagen triple helix repeat domain protein n=1 Tax=Bacillus cereus Rock1-3 RepID=C2U3W8_BACCE Length = 926 Score = 55.1 bits (131), Expect = 3e-06 Identities = 48/160 (30%), Positives = 62/160 (38%), Gaps = 19/160 (11%) Frame = +1 Query: 34 PGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPPS 213 P + G + G GPP G +GPGG P+ S+ + P GA G+ G Sbjct: 195 PTGITGPTGITGPSGGPPGPTGPTGATGPGGGPSG----STGVTGPTGA-TGNTG----- 244 Query: 214 VAGGYGASGPTSATFSNESGSFQSLQ-------------PAPPQ----MPPPTGPSPHLA 342 A G G +GPT +T + Q LQ P PQ +P PTG + Sbjct: 245 -ATGQGLTGPTGSTGETGAQGLQGLQGIQGIQGPIGPTGPEGPQGIQGIPGPTGVTGEQG 303 Query: 343 HGGVTAAHGVPRHHGANGPASLNSAALPAYATG--GGNGP 456 GV G+ G GP + A P TG G GP Sbjct: 304 IQGVQGIQGITGATGDQGPQGIQGAIGPQGVTGVTGDQGP 343 [186][TOP] >UniRef100_C2MS36 Collagen triple helix repeat domain protein n=1 Tax=Bacillus cereus m1293 RepID=C2MS36_BACCE Length = 1246 Score = 55.1 bits (131), Expect = 3e-06 Identities = 42/154 (27%), Positives = 59/154 (38%), Gaps = 13/154 (8%) Frame = +1 Query: 34 PGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPPS 213 P + G + G GPP G +GPGG P+ S + + + G+ G Sbjct: 192 PTGITGPTGITGPSGGPPGPTGPTGATGPGGGPSGSTGATGAT-----GNTGATGNT--G 244 Query: 214 VAGGYGASGPTSATFSNESGSFQSLQ-------PAPPQ----MPPPTGPSPHLAHGGVTA 360 + G G +GPT +T + Q +Q P PQ +P PTG + GV Sbjct: 245 ITGATGTTGPTGSTGAQGLQGIQGIQGPIGPTGPEGPQGIQGIPGPTGVTGEQGIQGVQG 304 Query: 361 AHGVPRHHGANGPASLNSAALPAYATG--GGNGP 456 G+ G GP + P TG G GP Sbjct: 305 IQGITGATGDQGPQGIQGVIGPQGVTGATGDQGP 338 [187][TOP] >UniRef100_Q9BIU8 Flagelliform silk protein (Fragment) n=1 Tax=Argiope trifasciata RepID=Q9BIU8_ARGTR Length = 1002 Score = 55.1 bits (131), Expect = 3e-06 Identities = 56/179 (31%), Positives = 63/179 (35%), Gaps = 22/179 (12%) Frame = +1 Query: 34 PGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASL-----PQGAHLGSRG 198 P V GG AG G + PG AG GPGG P + P P G G G Sbjct: 385 PEGVGGGPGGAGP-GGAGFGPGGGAGFGPGGAPGAPGGPGGPGGPGGPGGPGGVGPGGAG 443 Query: 199 GAPPSVAGGYGASGPTSATFSNESGSFQ-------------SLQPAPPQMPPPTGPSPHL 339 G P AGG G +G T +G F PA P G P Sbjct: 444 GYGPGGAGGVGPAG-TGGFGPGGAGGFGPGGAGGFGPGGAGGFGPAGAGGYGPGGVGPGG 502 Query: 340 AHG----GVTAAHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSPASTATF 504 A G GV P G GP +++ A GGG G A P GA P A F Sbjct: 503 AGGFGPGGVGPGGSGPGGAGGEGPVTVDVDVSVGGAPGGGPGGAGPGGAGFGPGGGAGF 561 Score = 54.7 bits (130), Expect = 4e-06 Identities = 55/179 (30%), Positives = 62/179 (34%), Gaps = 22/179 (12%) Frame = +1 Query: 34 PGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASL-----PQGAHLGSRG 198 P V GG AG G + PG AG GPGG P + P P G G G Sbjct: 653 PEGVGGGPGGAGP-GGAGFGPGGGAGFGPGGAPGAPGGPGGPGGPGGPGGPGGVGPGGAG 711 Query: 199 GAPPSVAGGYGASGPTSATFSNESGSFQ-------------SLQPAPPQMPPPTGPSPHL 339 G P AGG+G G T +G F P P G P Sbjct: 712 GYGPGGAGGFGPGG-TGGFGPGGAGGFGPGGAGGFGPGGAGGFGPGGAGGYGPGGVGPGG 770 Query: 340 AHG----GVTAAHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSPASTATF 504 A G GV P G GP +++ A GGG G A P GA P A F Sbjct: 771 AGGFGPGGVGPGGSGPGGAGGEGPVTVDVDVSVGGAPGGGPGGAGPGGAGFGPGGGAGF 829 Score = 53.5 bits (127), Expect = 9e-06 Identities = 55/182 (30%), Positives = 59/182 (32%), Gaps = 25/182 (13%) Frame = +1 Query: 34 PGSVVGGSSAAGSFSGP----PYAPGVYAGSGPGG---HPASSYAPSSS------ASLPQ 174 PG GG S G GP Y PG G GPGG A Y P + S P Sbjct: 839 PGGAAGGPSGPGGPGGPGGAGGYGPGGAGGYGPGGVGPGGAGGYGPGGAGGYGPGGSGPG 898 Query: 175 GAHLGSRGG------------APPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPP 318 GA G GG P V GG G +GP A F G+ AP P Sbjct: 899 GAGPGGAGGEGPVTVDVDVTVGPEGVGGGPGGAGPGGAGFGPGGGAGFGPGGAPGAPGGP 958 Query: 319 TGPSPHLAHGGVTAAHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSPASTA 498 GP GG G GP + Y GG G V PA T Sbjct: 959 GGP------GG---------PGGPGGPGGVGPGGAGGYGPGGAGG--------VGPAGTG 995 Query: 499 TF 504 F Sbjct: 996 GF 997 [188][TOP] >UniRef100_B7QAA1 Alpha-1 collagen type III, putative (Fragment) n=1 Tax=Ixodes scapularis RepID=B7QAA1_IXOSC Length = 507 Score = 55.1 bits (131), Expect = 3e-06 Identities = 54/179 (30%), Positives = 64/179 (35%), Gaps = 32/179 (17%) Frame = +1 Query: 52 GSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSS--------SASLPQG----AHLGSR 195 GS AG SGP Y PG + GG+P S AP S S P G G+ Sbjct: 20 GSGGAGRPSGPAYRPG--SSGAAGGYPGSGGAPGSGGAGGYPGSGGYPGGGGAPGAAGAG 77 Query: 196 GGAPPSVAGGY------------------GASGPTSATFSNESGSFQSLQPAPPQMPPPT 321 GG P AGGY G +G + SG + P P Sbjct: 78 GGYPKPGAGGYPGSGGVGPGAPGSGGYGPGGAGKPGSGGKPGSGGYGGGYPGSGGYPGSG 137 Query: 322 GPSPHLAHGGVTAAHGVPRHHGA--NGPASLNSAALPAYATGGGNGPAYPPGAIVSPAS 492 G + GG + G P GA +GP S S Y GG G A PG+ P S Sbjct: 138 GSGGYPGSGGSSGPGGYPGPGGASSSGPGSYPSGGGGGYRPSGGTG-AGAPGSYGKPGS 195 [189][TOP] >UniRef100_A8XVD3 C. briggsae CBR-COL-147 protein n=1 Tax=Caenorhabditis briggsae RepID=A8XVD3_CAEBR Length = 290 Score = 55.1 bits (131), Expect = 3e-06 Identities = 45/161 (27%), Positives = 57/161 (35%), Gaps = 2/161 (1%) Frame = +1 Query: 52 GSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPPSVAGGYG 231 G+ + G +GPP PG G GHP + P ++ G +G GG P + G Sbjct: 80 GAQSNGCPAGPPGPPGQPGAQGDAGHPGEAGKPGAN-----GVTIGLTGGNGPCITCPAG 134 Query: 232 ASGPTSATFS--NESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTAAHGVPRHHGANGPAS 405 A GP A + + S Q A P P GP G G P G G Sbjct: 135 APGPAGAPGAPGPQGPSGAPGQDAVGGGPGPAGPQGPAGDAGAPGQPGAPGQPGNAGRGG 194 Query: 406 LNSAALPAYATGGGNGPAYPPGAIVSPASTATFNRLSPAAA 528 S P A G GP P G P + PA + Sbjct: 195 QRSRGTPGPA--GAPGPQGPAGGPGQPGQSGGAGAPGPAGS 233 [190][TOP] >UniRef100_A6YIY0 Major ampullate spidroin 2 n=1 Tax=Latrodectus hesperus RepID=A6YIY0_9ARAC Length = 3779 Score = 55.1 bits (131), Expect = 3e-06 Identities = 54/181 (29%), Positives = 70/181 (38%), Gaps = 18/181 (9%) Frame = +1 Query: 10 PPSYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLG 189 P + PGS G++AA + GP Y G G GPGG A++ A +++ P G G Sbjct: 2525 PDRQQGYGPGS--SGAAAAAAAGGPGY--GGQQGYGPGGAGAAAAAAAAAGPGPSGYGPG 2580 Query: 190 SRGGAPPSVA-------------GGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPS 330 G A + A GYG SGP + GS + A G Sbjct: 2581 GAGAAAAAAAAGGSGPGGYGQGPSGYGPSGPGGQQGNGPGGSGAAAAAAAAAGGAGPGRQ 2640 Query: 331 PHLAHGG--VTAAHGVPRHHGANGPASLNSAALPAYATGG---GNGPAYPPGAIVSPAST 495 GG AA G P + G G + A A A GG G AY PG + A+ Sbjct: 2641 QGYGPGGAAAAAAAGGPGYGGQQGYGPGGAGAAAAAAAGGAGPGRQQAYGPGGAGAAAAA 2700 Query: 496 A 498 A Sbjct: 2701 A 2701 Score = 55.1 bits (131), Expect = 3e-06 Identities = 51/174 (29%), Positives = 70/174 (40%), Gaps = 1/174 (0%) Frame = +1 Query: 13 PSYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGS 192 P +G G GGS AA + + PG GPGG A++ A +++ S P G + Sbjct: 3548 PGFGGQ-QGYGPGGSGAAAAAAAGGAGPGRQQAYGPGGSGAAAAAAAAAGSGPSGYGPSA 3606 Query: 193 RGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTAAHGV 372 G + P +G G SGP G F Q P GPS Sbjct: 3607 AGPSGPGGSGAAGGSGP---------GGF-------GQGPAGYGPSG------------- 3637 Query: 373 PRHHGANGPASLNSAALPAYATGGGNGPA-YPPGAIVSPASTATFNRLSPAAAA 531 P GP + +AA A + GG GP+ Y P ++ S A++A SP A Sbjct: 3638 PGGQQGYGPGASGAAAAAAASGSGGYGPSQYVPSSVASSAASAASALSSPTTHA 3691 Score = 54.7 bits (130), Expect = 4e-06 Identities = 49/172 (28%), Positives = 69/172 (40%), Gaps = 8/172 (4%) Frame = +1 Query: 10 PPSYGSHVPGSVVGGSSAAGSFSGP----PYAPGVYAGSGPGGHPASSYAPSSSASLPQG 177 P G+ + GGS G GP P P G GPGG A++ A +++ S P G Sbjct: 231 PGGAGAAAGAAAAGGSGPGGYGQGPAAYGPSGPSGQQGYGPGGSGAAAAAAAAAGSGPSG 290 Query: 178 AHLGSRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHG--- 348 G G P GG GA+ +A + G + Q + P+GPS +G Sbjct: 291 --YGPGAGGP----GGAGAAAAAAAAGGSGPGGYGQGQAS----YGPSGPSGQQGYGPGG 340 Query: 349 -GVTAAHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSPASTAT 501 G AA G +G +AA A + G G Y PG + A+ + Sbjct: 341 SGAAAAAAAAAGSGPSGYGPGAAAAAAAGSAGPGTQQGYGPGGSGAAAAAGS 392 Score = 54.7 bits (130), Expect = 4e-06 Identities = 51/169 (30%), Positives = 68/169 (40%), Gaps = 7/169 (4%) Frame = +1 Query: 13 PSYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGS 192 P YG G GG+ AA + + PG GPGG A++ A +++ P G G+ Sbjct: 2657 PGYGGQ-QGYGPGGAGAAAAAAAGGAGPGRQQAYGPGGAGAAAAAAAAAGPGPSGYGPGA 2715 Query: 193 RGGAPPSVAGGYGASGPTSATFSNESGSF----QSLQPAPPQMPPPTGPSPHLAHGGVTA 360 G PS GG GA+ +A + G + P+ P GP A A Sbjct: 2716 SG---PSGTGGAGAAAAAAAAGGSGPGGYGQGASGYGPSGPGGQQGYGPGGSGAAAAAAA 2772 Query: 361 AHGV--PRHHGANGPASLNSAALPAYATGGGNGP-AYPPGAIVSPASTA 498 A G P GP S +AA A G GP Y PG + A+ A Sbjct: 2773 AAGGAGPGRQQGYGPGSSGAAAAAAAGGPGYGGPQGYGPGGAGAAAAAA 2821 Score = 54.3 bits (129), Expect = 5e-06 Identities = 57/178 (32%), Positives = 71/178 (39%), Gaps = 5/178 (2%) Frame = +1 Query: 13 PSYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASL---PQGAH 183 P YG G +G + AA + + PG GPGG A++ A + S P A Sbjct: 3137 PGYGGQ-QGYGLGVAGAAAAVAAGGAGPGRQQAYGPGGSGAAAAAAAGSGRSGYGPGAAG 3195 Query: 184 LGSRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTAA 363 G G A + AGG G SG A SG+ + P G P A AA Sbjct: 3196 TGGAGAAAAAAAGGAG-SGRQQAYGPGGSGAAAASAAGGPGYGGQQGYGPGGAGAAAAAA 3254 Query: 364 HG--VPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSPASTATFNRLSPAAAA 531 G P A GP +AA A A+G G Y PGA P+ A + AAAA Sbjct: 3255 AGGAGPGTQQAYGPGGSGAAAAAAAASGPGPS-GYEPGA-AGPSGPAGAGAAAAAAAA 3310 Score = 53.9 bits (128), Expect = 7e-06 Identities = 55/182 (30%), Positives = 79/182 (43%), Gaps = 7/182 (3%) Frame = +1 Query: 10 PPSYGSHVPGSVVGGSSAAGSFSGPP---YAPGVYAGSGPGGHPASSYAPSSSASLPQGA 180 P + ++ PG G ++AA + SGP Y PG SGP G A++ A ++ S P G Sbjct: 3260 PGTQQAYGPGGS-GAAAAAAAASGPGPSGYEPGAAGPSGPAGAGAAAAAAAAGGSGPGGY 3318 Query: 181 HLGSRGGAPPSVAG----GYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHG 348 G G P G G G SG +A + G+ Q Q +G + A G Sbjct: 3319 GQGPSGYGPSGPGGQQGYGPGGSGAAAAAAAAAGGAGPGRQQGYGQ--GSSGAAAAAAAG 3376 Query: 349 GVTAAHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSPASTATFNRLSPAAA 528 G +G + +G G + +AA+ A G G AY PG + A + + P AA Sbjct: 3377 G--PGYGGQQVYGPGGAGA--AAAVAAGGAGPGRQQAYGPGGSGAAAGSGP-SGYGPGAA 3431 Query: 529 AA 534 AA Sbjct: 3432 AA 3433 Score = 53.5 bits (127), Expect = 9e-06 Identities = 50/165 (30%), Positives = 64/165 (38%), Gaps = 8/165 (4%) Frame = +1 Query: 4 QQPPSYGSHVPGSVVG------GSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSAS 165 Q P YG PG G G++AA + + PG G GPGG A++ A Sbjct: 2601 QGPSGYGPSGPGGQQGNGPGGSGAAAAAAAAAGGAGPGRQQGYGPGGAAAAAAAGGPGYG 2660 Query: 166 LPQGAHLGSRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAH 345 QG G G A + AGG +GP G+ + A P P+G P Sbjct: 2661 GQQGYGPGGAGAAAAAAAGG---AGPGRQQAYGPGGAGAAAAAAAAAGPGPSGYGP---- 2713 Query: 346 GGVTAAHGVPRHHGANGPASLNSA-ALPAYATGGGNGP-AYPPGA 474 GA+GP+ A A A A GG+GP Y GA Sbjct: 2714 -------------GASGPSGTGGAGAAAAAAAAGGSGPGGYGQGA 2745 [191][TOP] >UniRef100_A2R1W4 Differential expressed Arsa-7 from patent US2003215950-A1-Aspergillus niger n=1 Tax=Aspergillus niger CBS 513.88 RepID=A2R1W4_ASPNC Length = 406 Score = 55.1 bits (131), Expect = 3e-06 Identities = 59/150 (39%), Positives = 67/150 (44%), Gaps = 5/150 (3%) Frame = +1 Query: 16 SYGSHVPGSVVGGSSAAGSFSGPPYAP-GVYAGSGPGGHPASSY---APSSSASLPQGAH 183 S GS GS S + SF G AP GV G+GP P+ S+ APS A + Sbjct: 273 SQGSFEQGSSSEQGSGSSSFGGNGAAPSGVAGGNGPS--PSGSFGGAAPSGVAGGNGPSP 330 Query: 184 LGSRGGAPPS-VAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTA 360 GS GGA PS VAGG G S SGSF AP + GPSP + GG A Sbjct: 331 SGSFGGAAPSGVAGGNGPS---------PSGSFGGNGAAPSGVAGGNGPSPSGSFGGNGA 381 Query: 361 AHGVPRHHGANGPASLNSAALPAYATGGGN 450 A GA G A S A PA A G + Sbjct: 382 APS-----GAAGGAPAASGA-PAAAPSGAS 405 [192][TOP] >UniRef100_UPI000186E27C conserved hypothetical protein n=1 Tax=Pediculus humanus corporis RepID=UPI000186E27C Length = 607 Score = 54.7 bits (130), Expect = 4e-06 Identities = 52/174 (29%), Positives = 76/174 (43%), Gaps = 15/174 (8%) Frame = +1 Query: 34 PGSVVGGSSA----AGSFSG---PPYAPGVYAGS-GPGGHPASSYAPSSSASLPQGAHLG 189 PGS GG+ +GSF G P G + GS GP G P+ S+ + S P G+ G Sbjct: 424 PGSSFGGAQGPFGPSGSFGGSQGPSGPSGTFGGSQGPSG-PSESFGGNQGPSGPSGSFGG 482 Query: 190 SRGGAPPSVA--GGYGASGPTSATFSNESGSFQSLQPAP--PQMPP---PTGPSPHLAHG 348 S+G + PSV+ G G+ P + S +F P P P P G SP G Sbjct: 483 SQGTSGPSVSFVGQQGSRVPVTGGSPGPSSTFGPTTPTAGYPSASPTQRPGGYSPSGTSG 542 Query: 349 GVTAAHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSPASTATFNR 510 G T + G + ++ ++ P AT GP++ P + P S+ N+ Sbjct: 543 GYTPS-------GPSSTSAFGNSQRPVSAT--TTGPSFGPSSTFGPPSSRPNNQ 587 [193][TOP] >UniRef100_UPI0000DD8F95 Os04g0245000 n=1 Tax=Oryza sativa Japonica Group RepID=UPI0000DD8F95 Length = 1541 Score = 54.7 bits (130), Expect = 4e-06 Identities = 54/169 (31%), Positives = 60/169 (35%), Gaps = 6/169 (3%) Frame = +1 Query: 1 AQQPPSYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGA 180 A PPS G+ P G +G P P + G GGH A P LP+G Sbjct: 1098 APPPPSIGAGAPPP----PPPPGGITGVPPPPPI---GGLGGHQAPPAPP-----LPEGI 1145 Query: 181 HLGSRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTG-----PSPHLAH 345 G PP GG G PP PPP G P P+ AH Sbjct: 1146 G----GVPPPPPVGGLGG---------------------PPAPPPPAGFRGGTPPPN-AH 1179 Query: 346 GGVTAAHGVPRHHGA-NGPASLNSAALPAYATGGGNGPAYPPGAIVSPA 489 GGV PR HG GP + A P G GP PPG PA Sbjct: 1180 GGVAPPPPPPRGHGGVGGPPTPPGAPAPPMPPGVPGGPPPPPGGRGLPA 1228 [194][TOP] >UniRef100_UPI00017B2D12 UPI00017B2D12 related cluster n=1 Tax=Tetraodon nigroviridis RepID=UPI00017B2D12 Length = 1568 Score = 54.7 bits (130), Expect = 4e-06 Identities = 57/187 (30%), Positives = 77/187 (41%), Gaps = 27/187 (14%) Frame = +1 Query: 10 PPSYGSHVPGSVVG---GSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGA 180 PP H PG V G + GS SGP Y P G + SY S + S Sbjct: 120 PPVSPHHTPGGPVYPGMGPYSQGSPSGP-YGPQGSQYGHQGNYHRPSYGGSGATSYSGSN 178 Query: 181 HLGSRGGAPPSVAGGYGASGPTSATFSNESGS----FQSLQPAPPQMPPPTGPS---PHL 339 +LG G+P G G+S P ++ SGS + ++ P P MP P GP P L Sbjct: 179 NLGMNAGSPGL---GQGSSQPIPVRRNHGSGSQNRGYPAMAPISPSMPHPVGPGMGPPSL 235 Query: 340 A------HGGVTAA------HGVPRHHGANGPASLNSAALPAYATGG-----GNGPAYPP 468 A G AA HG + G + P+++ + + TG GNG A P Sbjct: 236 AASNRKPQEGTVAANSTQSRHGTYQGPGVSQPSTMATIVPYSQPTGNNSSDMGNGQA--P 293 Query: 469 GAIVSPA 489 G ++PA Sbjct: 294 GYTIAPA 300 [195][TOP] >UniRef100_Q1B7N1 Putative uncharacterized protein n=1 Tax=Mycobacterium sp. MCS RepID=Q1B7N1_MYCSS Length = 771 Score = 54.7 bits (130), Expect = 4e-06 Identities = 51/184 (27%), Positives = 70/184 (38%), Gaps = 28/184 (15%) Frame = +1 Query: 34 PGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPPS 213 PG+ V S G+ + PP P S G P + AP + P + A + Sbjct: 18 PGAPVAASGGVGAPAAPPAVPAGVVDSSSGVTPPAPAAPPAGVVQPAAGAVPPAPSAVGA 77 Query: 214 VAGGYGASG-------PTSATFSNESGSFQSLQPAPPQ--------MPPPTGPSPHLAHG 348 AGG G +G P +A +G+ PAPP + PP P+P G Sbjct: 78 PAGGSGGAGAPAAPPAPPAAVVEPAAGATPPAPPAPPAAVVEPASGVTPPAPPAPGGPAG 137 Query: 349 GVTAAHGVPRH--------HGANG--PASLNSAALPAYATGGG---NGPAYPPGAIVSPA 489 G A P A+G P + + PA +GG GP PP A+V PA Sbjct: 138 GSGGAVTPPGPPAPPAAVVEPASGVTPPAPPAPGGPAGGSGGAVTPPGPPAPPAAVVEPA 197 Query: 490 STAT 501 + T Sbjct: 198 AGVT 201 [196][TOP] >UniRef100_A8LHL3 Putative uncharacterized protein n=1 Tax=Frankia sp. EAN1pec RepID=A8LHL3_FRASN Length = 391 Score = 54.7 bits (130), Expect = 4e-06 Identities = 53/172 (30%), Positives = 73/172 (42%), Gaps = 7/172 (4%) Frame = +1 Query: 4 QQPPSYGSHVPGSVVGGSSAAGSFSGPPYA---PGVYAGSGPGGHPASSYAPSSSASLPQ 174 + PS G VP S GG+ S SGPP A P AG P H S+A S AS Sbjct: 11 ESSPSSGP-VP-SPAGGNPQPLSTSGPPQASTWPAPQAGGEPAPHATGSHAAGSGASQAP 68 Query: 175 GAHLGSRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPP---TGPSPHLAH 345 G GS + P G A G S+ + G+ P P PPP +G P + Sbjct: 69 G-WTGSPAWSGPPPPGPGSAPGEVSSRAAASPGA-----PVPGVSPPPRVASGALPRWSL 122 Query: 346 GGVTAAHGVPRHHGANGPASLNSAALPAYATGGG-NGPAYPPGAIVSPASTA 498 G A + A++N+++L + GGG G P +++PA +A Sbjct: 123 GRTAVAGAIALALAVGAAAAVNASSLGSDGAGGGPGGLRGGPFQVMNPAGSA 174 [197][TOP] >UniRef100_Q01HL2 H0211F06-OSIGBa0153M17.6 protein n=1 Tax=Oryza sativa RepID=Q01HL2_ORYSA Length = 1510 Score = 54.7 bits (130), Expect = 4e-06 Identities = 54/169 (31%), Positives = 60/169 (35%), Gaps = 6/169 (3%) Frame = +1 Query: 1 AQQPPSYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGA 180 A PPS G+ P G +G P P + G GGH A P LP+G Sbjct: 1098 APPPPSIGAGAPPP----PPPPGGITGVPPPPPI---GGLGGHQAPPAPP-----LPEGI 1145 Query: 181 HLGSRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTG-----PSPHLAH 345 G PP GG G PP PPP G P P+ AH Sbjct: 1146 G----GVPPPPPVGGLGG---------------------PPAPPPPAGFRGGTPPPN-AH 1179 Query: 346 GGVTAAHGVPRHHGA-NGPASLNSAALPAYATGGGNGPAYPPGAIVSPA 489 GGV PR HG GP + A P G GP PPG PA Sbjct: 1180 GGVAPPPPPPRGHGGVGGPPTPPGAPTPPMPPGVPGGPPPPPGGRGLPA 1228 [198][TOP] >UniRef100_B9FE31 Putative uncharacterized protein n=1 Tax=Oryza sativa Japonica Group RepID=B9FE31_ORYSJ Length = 1980 Score = 54.7 bits (130), Expect = 4e-06 Identities = 54/169 (31%), Positives = 60/169 (35%), Gaps = 6/169 (3%) Frame = +1 Query: 1 AQQPPSYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGA 180 A PPS G+ P G +G P P + G GGH A P LP+G Sbjct: 1409 APPPPSIGAGAPPP----PPPPGGITGVPPPPPI---GGLGGHQAPPAPP-----LPEGI 1456 Query: 181 HLGSRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTG-----PSPHLAH 345 G PP GG G PP PPP G P P+ AH Sbjct: 1457 G----GVPPPPPVGGLGG---------------------PPAPPPPAGFRGGTPPPN-AH 1490 Query: 346 GGVTAAHGVPRHHGA-NGPASLNSAALPAYATGGGNGPAYPPGAIVSPA 489 GGV PR HG GP + A P G GP PPG PA Sbjct: 1491 GGVAPPPPPPRGHGGVGGPPTPPGAPAPPMPPGVPGGPPPPPGGRGLPA 1539 [199][TOP] >UniRef100_B1B5J3 RHYTHM OF CHLOROPLAST 15 n=1 Tax=Chlamydomonas reinhardtii RepID=B1B5J3_CHLRE Length = 631 Score = 54.7 bits (130), Expect = 4e-06 Identities = 49/155 (31%), Positives = 64/155 (41%), Gaps = 3/155 (1%) Frame = +1 Query: 4 QQPPSYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAH 183 QQ PS G+ G S AA S +P V A + P S+ A +++ S P AH Sbjct: 485 QQRPSDGATAADGTAGCSPAAVS------SPAVAAAA-----PPSTAAAAATPSAPHSAH 533 Query: 184 LGSRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTAA 363 S G S +GG G G S + + SGS + PP+ P+P A AA Sbjct: 534 KPSTHGQGSSGSGGSGCGGSGSGSGGHGSGSSARAGSKRSEPEPPSRPTPQRAVAVTEAA 593 Query: 364 HGVPRHHGANGPASLNSA---ALPAYATGGGNGPA 459 H + + NSA A A A GNG A Sbjct: 594 LASSAHPAGSSGSGRNSAGGSAAAATAAAAGNGVA 628 [200][TOP] >UniRef100_Q58MY1 Phage tail fiber-like protein n=1 Tax=Prochlorococcus phage P-SSM2 RepID=Q58MY1_BPPRM Length = 597 Score = 54.7 bits (130), Expect = 4e-06 Identities = 53/180 (29%), Positives = 67/180 (37%), Gaps = 28/180 (15%) Frame = +1 Query: 76 SGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGG-------APPSVAGG--- 225 +GPP G+ SGP G P + PQG +G GG PP AGG Sbjct: 177 AGPPGPTGITGPSGPPGPSGPGGGPGPAG--PQG-DVGPSGGPGPTGPAGPPGPAGGPPG 233 Query: 226 ----YGASGPTSATFSNESGS-FQSLQPAPPQMPPPTGPSPHLAHGGVT----------- 357 G +GPT T +GS + P P P PTGP+ G T Sbjct: 234 PQGPQGDAGPTGPTGPPGTGSPGPAGPPGPSGGPGPTGPAGPTGPDGPTGPTGPAGGPPG 293 Query: 358 --AAHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSPASTATFNRLSPAAAA 531 G P G +GPA + + P +GG GP+ PG P PA +A Sbjct: 294 PPGPSGPPGPSGGDGPAGPSGSPGPPGPSGGPPGPSGGPGPAGPPGPDGPSGPPGPAGSA 353 [201][TOP] >UniRef100_Q9BIU1 Major ampullate spidroin 2 (Fragment) n=1 Tax=Gasteracantha cancriformis RepID=Q9BIU1_GASCA Length = 342 Score = 54.7 bits (130), Expect = 4e-06 Identities = 57/199 (28%), Positives = 68/199 (34%), Gaps = 29/199 (14%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASS-----YAPSSSASLPQGAHL 186 G + PGS GG G SG PG GPG A++ Y P S QG Sbjct: 66 GGYGPGSGQGGPGQQGPGSGGQQGPGGQGPYGPGAAAAAAAAAGGYGPGSGQGGQQGPGS 125 Query: 187 GSRGGAPPSVAGGYGASGPTSATFSNESGSF--QSLQPAPPQMPPPTGP----------S 330 G GG G GP++A + G + + Q P Q P +G Sbjct: 126 QGPGSGGQQGPGGQGPYGPSAAAAAAAVGGYGPGAGQQGPGQQGPGSGGQRGPGGQGPYG 185 Query: 331 PHLAHGGVTAAHGVPRHHGANGPASLNSAALPAYATGGGNGP------------AYPPGA 474 P A AA G G GP + GG GP Y PG+ Sbjct: 186 PGAAAAAAAAAGGYGPASGQQGPGQQGPGS-GGQRGPGGQGPYGPGAAAAASAGGYGPGS 244 Query: 475 IVSPASTATFNRLSPAAAA 531 SPAS A SP A A Sbjct: 245 GGSPASGAASRLSSPQAGA 263 [202][TOP] >UniRef100_C3XWW9 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3XWW9_BRAFL Length = 309 Score = 54.7 bits (130), Expect = 4e-06 Identities = 50/156 (32%), Positives = 59/156 (37%), Gaps = 8/156 (5%) Frame = +1 Query: 34 PGSVVGGSSAAGSFSGP-PYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPP 210 P VG A G P P PG AG+GP G P S +P + G G P Sbjct: 113 PEGPVGPKGAEGERGAPGPPGPGGQAGTGPPGPPGSPGSPGEKGATGPAGPKGREG--PR 170 Query: 211 SVAGGYGASGPTSATFSNES-GSFQSLQPAPPQ----MPPPTGPSPHLAHGGVTAAHGVP 375 G G GP S S G ++ PA P+ P P GP+ G + G P Sbjct: 171 GPVGPQGLRGPVGPPGSPGSPGLKGAVGPAGPKGRGGPPGPRGPT------GPSGLPGSP 224 Query: 376 RHHGANGPASLNSAALPAYATG--GGNGPAYPPGAI 477 GA GPA P G G GP PPG + Sbjct: 225 GEKGATGPAGPKGGEGPLGPVGPQGRVGPPGPPGPV 260 [203][TOP] >UniRef100_B9PUT7 Protein transport protein sec13, putative n=2 Tax=Toxoplasma gondii RepID=B9PUT7_TOXGO Length = 654 Score = 54.7 bits (130), Expect = 4e-06 Identities = 54/181 (29%), Positives = 67/181 (37%), Gaps = 4/181 (2%) Frame = +1 Query: 1 AQQPPSYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGA 180 A Q +GS P S S PP P A HP S PSS SLPQ Sbjct: 412 APQLQPHGSAAPLGAYPPSHPPSLSSSPPTHPAHGAS-----HPPLSSFPSSHPSLPQNP 466 Query: 181 HLGSRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTG---PSPHL-AHG 348 G PPS A GP + PPQ P G P+P A+ Sbjct: 467 APGPLSATPPSTAATPRPLGPAAG--------------QPPQGSPTPGVAFPAPGAPAYP 512 Query: 349 GVTAAHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSPASTATFNRLSPAAA 528 G A+ G+ P S PA+A G A+PP V PA T+ + +P+ A Sbjct: 513 GTPASAGLYGPPTPGAPGGAQSYPQPAFAAPYPQGSAFPPA--VQPAQTSLGGQQAPSPA 570 Query: 529 A 531 + Sbjct: 571 S 571 [204][TOP] >UniRef100_B4IKV7 GM11218 n=1 Tax=Drosophila sechellia RepID=B4IKV7_DROSE Length = 1272 Score = 54.7 bits (130), Expect = 4e-06 Identities = 61/182 (33%), Positives = 78/182 (42%), Gaps = 14/182 (7%) Frame = +1 Query: 31 VPGSVVGGSSAAGSFSGP-PYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGA- 204 VP + SS AG+ +G + V +G G G S A S+ S QGA G+ GG+ Sbjct: 161 VPATPKSSSSGAGASTGSGTSSAAVTSGPGSGSTKVSVAASSAQQSGLQGA-TGAGGGSS 219 Query: 205 ------PPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGG----V 354 P S AGG A+ P SA G+ S P +PP + PH G Sbjct: 220 SAPGTQPGSGAGGAIAARPVSAM----GGTVSSTAGGAPSIPPISTMPPHTVPGSTNTTT 275 Query: 355 TAAHGVPRHHGANGPASLNSAALPAYATGGGNGPAYP--PGAIVSPASTATFNRLSPAAA 528 TA G GA G A+ N+AAL A G AYP PG +S+ + AA Sbjct: 276 TAMAGGVGGPGAAG-ANPNAAALMASLLNAGQTGAYPGAPGQTAVNSSSLLDGSTAAVAA 334 Query: 529 AA 534 AA Sbjct: 335 AA 336 [205][TOP] >UniRef100_B3RLH9 Putative uncharacterized protein (Fragment) n=1 Tax=Trichoplax adhaerens RepID=B3RLH9_TRIAD Length = 181 Score = 54.7 bits (130), Expect = 4e-06 Identities = 48/164 (29%), Positives = 64/164 (39%), Gaps = 8/164 (4%) Frame = +1 Query: 55 SSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPPSVAGGYGA 234 SS A SG Y P A S HP+++Y PSS+ P G A P + GY Sbjct: 10 SSTAYPPSGTAYPPSSTAQS----HPSTAYPPSSTGYPPSGTAYPPSSTAQPHPSTGYPP 65 Query: 235 SG---PTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTAAHGVPRHHGANGPAS 405 SG P S+T + Q PP + H + G P A P+S Sbjct: 66 SGTAYPPSSTAQPHPSTAQPHPSTGTAYPPSSTAQKHPPTAQPPPSTGYPPSGAAYPPSS 125 Query: 406 L-----NSAALPAYATGGGNGPAYPPGAIVSPASTATFNRLSPA 522 + P+ A + AYPP +P STAT+ +PA Sbjct: 126 TVYPPSGAVYPPSTAAYPPSTAAYPPSGTANPTSTATYPPSAPA 169 [206][TOP] >UniRef100_A9UV04 Predicted protein (Fragment) n=1 Tax=Monosiga brevicollis RepID=A9UV04_MONBE Length = 237 Score = 54.7 bits (130), Expect = 4e-06 Identities = 48/188 (25%), Positives = 66/188 (35%), Gaps = 24/188 (12%) Frame = +1 Query: 10 PPSYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLG 189 PP S + SA G P P A S P ++AP S+ +P A Sbjct: 10 PPMPNSAAYATPPTQPSATGPVPSAPQGPSTSAQSAGSVFPGPNHAPQSAPPMPNSAAYA 69 Query: 190 S-------RGGAPPSVAGGYGASGPTSATFSNESGSFQSLQP--------APPQMPPPTG 324 + G AP + G ++ + F + QS P PP P TG Sbjct: 70 TPPTQPSATGPAPSAPQGSSTSAQSAGSVFPGPHHAPQSAPPMPNSAAYATPPTQPSATG 129 Query: 325 PSPHLAHGGVTAAHGV------PRHHGANGPASLNSAAL---PAYATGGGNGPAYPPGAI 477 P+P G T+A P H + P NSAA P + G P+ P G+ Sbjct: 130 PAPSAPQGSSTSAQSAGSVFPGPNHAPQSAPPMPNSAAYATPPTQPSATGPAPSAPQGSS 189 Query: 478 VSPASTAT 501 S S + Sbjct: 190 TSAQSAGS 197 [207][TOP] >UniRef100_C9SR21 DNA-directed RNA polymerase II subunit RPB1 n=1 Tax=Verticillium albo-atrum VaMs.102 RepID=C9SR21_9PEZI Length = 1756 Score = 54.7 bits (130), Expect = 4e-06 Identities = 51/176 (28%), Positives = 72/176 (40%), Gaps = 15/176 (8%) Frame = +1 Query: 43 VVGGSSAAGSFSGPPYAP--GVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPPSV 216 +VG S + G Y G + G+ PG A+S +S S P + G+ G +P S Sbjct: 1526 IVGAGSDDNTGFGTEYGGTYGGFGGASPGRAGATSPFTTSPTS-PFSSFAGAGGYSPTSP 1584 Query: 217 AGGYGASGP---------TSATFSNESGSFQSL----QPAPPQMPPPTGPSPHLAHGGVT 357 GGY + P TS FS S SF +P P P + SP + T Sbjct: 1585 GGGYSPTSPLMDGGARYATSPQFSPSSPSFSPTSPVHRPTSPASPNYSPTSPSYSPTSPT 1644 Query: 358 AAHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSPASTATFNRLSPAA 525 + PRH+ PA NS P+Y+ P+Y P + + T SPA+ Sbjct: 1645 S----PRHYSPTSPAQFNSPTSPSYSPA---SPSYSPTSPNLHGAGPTSPSYSPAS 1693 [208][TOP] >UniRef100_Q7XWS7 Formin-like protein 12 n=1 Tax=Oryza sativa Japonica Group RepID=FH12_ORYSJ Length = 1669 Score = 54.7 bits (130), Expect = 4e-06 Identities = 54/169 (31%), Positives = 60/169 (35%), Gaps = 6/169 (3%) Frame = +1 Query: 1 AQQPPSYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGA 180 A PPS G+ P G +G P P + G GGH A P LP+G Sbjct: 1098 APPPPSIGAGAPPP----PPPPGGITGVPPPPPI---GGLGGHQAPPAPP-----LPEGI 1145 Query: 181 HLGSRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTG-----PSPHLAH 345 G PP GG G PP PPP G P P+ AH Sbjct: 1146 G----GVPPPPPVGGLGG---------------------PPAPPPPAGFRGGTPPPN-AH 1179 Query: 346 GGVTAAHGVPRHHGA-NGPASLNSAALPAYATGGGNGPAYPPGAIVSPA 489 GGV PR HG GP + A P G GP PPG PA Sbjct: 1180 GGVAPPPPPPRGHGGVGGPPTPPGAPAPPMPPGVPGGPPPPPGGRGLPA 1228 [209][TOP] >UniRef100_UPI0001868CED hypothetical protein BRAFLDRAFT_129955 n=1 Tax=Branchiostoma floridae RepID=UPI0001868CED Length = 703 Score = 54.3 bits (129), Expect = 5e-06 Identities = 58/175 (33%), Positives = 65/175 (37%), Gaps = 13/175 (7%) Frame = +1 Query: 34 PGSVVGGSSAAGSFS-GPPYAPGVYAGSGPGGHPASSYAPS----SSASLPQG-AHLGSR 195 P G AG S GPP PG GP G PAS P A P G +G Sbjct: 121 PPGPPGEKGPAGPVSVGPPGPPGEKGAMGPAG-PASVGPPGPPEEKGAMGPAGPVSVGPP 179 Query: 196 GGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTAAHGVP 375 G PP G G +GP S G ++ PA P P GP G G P Sbjct: 180 G--PPGEKGAMGPAGPVSVGPPGPPGDKGAMGPAGPVSVGPPGPP---GEKGAMGPPGPP 234 Query: 376 RHHGANGPA---SLNSAALPAYATG--GGNGPAYPPGAI--VSPASTATFNRLSP 519 GA GPA A P +G G GPA P G + PA +F R P Sbjct: 235 GEKGAMGPAGPPGEKGAMGPTGPSGEKGAVGPAGPLGKTGPIGPAGPVSFGRPGP 289 [210][TOP] >UniRef100_UPI00015B6358 PREDICTED: hypothetical protein n=1 Tax=Nasonia vitripennis RepID=UPI00015B6358 Length = 441 Score = 54.3 bits (129), Expect = 5e-06 Identities = 52/166 (31%), Positives = 72/166 (43%), Gaps = 7/166 (4%) Frame = +1 Query: 13 PSYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSY-APSSSASLPQGAHLG 189 P+ G G G A S G P + G + GS GG P+SSY APS+ S GA Sbjct: 42 PNLGGGGGGGGGFGGGAPSSSYGAPSSGGGFGGSFGGGAPSSSYGAPSTGGSFGGGAPSS 101 Query: 190 SRGGAPPSVAGGYGAS-GPTSATFSNESGSFQSLQPAPPQMPPPTGPS-----PHLAHGG 351 S G PS G +G S G + + S + SF P+ P G S P ++G Sbjct: 102 SYGA--PSSGGSFGGSFGGGAPSSSYGAPSFGGNAPSSSYGAPSAGGSFGGGAPSNSYGP 159 Query: 352 VTAAHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSPA 489 ++++G P G+ G +S S GG G P + +PA Sbjct: 160 PSSSYGAPSAGGSFGGSSGGS-------FGGSFGGGAPSSSYGAPA 198 Score = 54.3 bits (129), Expect = 5e-06 Identities = 53/165 (32%), Positives = 72/165 (43%), Gaps = 6/165 (3%) Frame = +1 Query: 16 SYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSY-APSSSASLPQGAHLGS 192 SYG+ G GGS G+ S AP GS GG P+SSY APSS S G G Sbjct: 62 SYGAPSSGGGFGGSFGGGAPSSSYGAPST-GGSFGGGAPSSSYGAPSSGGSF--GGSFG- 117 Query: 193 RGGAPPSVAG--GYGASGPTSATFS-NESGSFQSLQPAPPQMPPPTGPSPHLAHG--GVT 357 GGAP S G +G + P+S+ + + GSF P+ PP + A G G + Sbjct: 118 -GGAPSSSYGAPSFGGNAPSSSYGAPSAGGSFGGGAPSNSYGPPSSSYGAPSAGGSFGGS 176 Query: 358 AAHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSPAS 492 + G P+S A P+ + P+ P +P+S Sbjct: 177 SGGSFGGSFGGGAPSSSYGAPAPSRPSSNYGAPSRPSSNYGAPSS 221 [211][TOP] >UniRef100_UPI000056A77D collagen, type I, alpha 2 n=1 Tax=Danio rerio RepID=UPI000056A77D Length = 1352 Score = 54.3 bits (129), Expect = 5e-06 Identities = 50/170 (29%), Positives = 64/170 (37%), Gaps = 19/170 (11%) Frame = +1 Query: 22 GSHVPGSVVG--GSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSR 195 G+ P G G+ GP APG +G G ++ P + + + G Sbjct: 597 GARGPSGTPGPDGNKGEPGAVGPAGAPGPQGAAGMPGERGAAGTPGAKGEKGEAGYRGLE 656 Query: 196 GGAPPSVA-GGYGASGPTSATFSN----ESGSFQSLQPAPPQMPP----PTGPSPHLAHG 348 G A A G G SGP +N E+GSF PA P+ P +GP+ Sbjct: 657 GNAGKDGARGAPGPSGPPGPAGANGDKGETGSFGPPGPAGPRGAPGERGESGPAGPSGFA 716 Query: 349 GVTAAHGVPRHHGANGPASLNSAALPAYATG--------GGNGPAYPPGA 474 G A G G GPA A PA G G +GP PPGA Sbjct: 717 GPPGADGQTGPRGEKGPAGGKGDAGPAGPAGPAGNTGPLGPSGPVGPPGA 766 Score = 53.5 bits (127), Expect = 9e-06 Identities = 57/167 (34%), Positives = 65/167 (38%), Gaps = 13/167 (7%) Frame = +1 Query: 10 PPSYGSHVPGSVVGGSSAAGSF-----SGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQ 174 P G P G GSF +GP APG SGP G PS A P Sbjct: 668 PGPSGPPGPAGANGDKGETGSFGPPGPAGPRGAPGERGESGPAG-------PSGFAG-PP 719 Query: 175 GA--HLGSRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPP----PTGPSPH 336 GA G RG P AGG G +GP + +G+ L P+ P PP +GP+ Sbjct: 720 GADGQTGPRGEKGP--AGGKGDAGPAGP--AGPAGNTGPLGPSGPVGPPGARGDSGPTGL 775 Query: 337 LAHGGVTAAHGVPRHHGANGPASLNSAALPAYATG--GGNGPAYPPG 471 G G P G GPA L A G G GPA PPG Sbjct: 776 TGFPGAPGRVGPPGPAGIVGPAGLTGPAGKDGPRGPRGDVGPAGPPG 822 [212][TOP] >UniRef100_UPI00016E45BB UPI00016E45BB related cluster n=1 Tax=Takifugu rubripes RepID=UPI00016E45BB Length = 1632 Score = 54.3 bits (129), Expect = 5e-06 Identities = 53/166 (31%), Positives = 56/166 (33%), Gaps = 21/166 (12%) Frame = +1 Query: 52 GSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRG-------GAPP 210 G+S + SGPP G GP G P P H G RG PP Sbjct: 756 GTSGSDGPSGPPGERGPQGPQGPVGFPGPKGPPGPPGKDGLPGHPGQRGETGFQGKTGPP 815 Query: 211 SVAGGYGASGPTSATFS-NESGSFQSLQPAPPQMPPPTG-------------PSPHLAHG 348 G G GPT T E G P PP P G P P Sbjct: 816 GPGGVVGPQGPTGETGPVGERG-----HPGPPGPPGEQGLPGSAGKEGAKGDPGPQ---- 866 Query: 349 GVTAAHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSP 486 G + G P G G L AA PA GG GP PPG I SP Sbjct: 867 GPSGKDGPPGLRGFPGERGLPGAAGPA-GLKGGEGPQGPPGPIGSP 911 [213][TOP] >UniRef100_UPI00016E45BA UPI00016E45BA related cluster n=1 Tax=Takifugu rubripes RepID=UPI00016E45BA Length = 1724 Score = 54.3 bits (129), Expect = 5e-06 Identities = 53/166 (31%), Positives = 56/166 (33%), Gaps = 21/166 (12%) Frame = +1 Query: 52 GSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRG-------GAPP 210 G+S + SGPP G GP G P P H G RG PP Sbjct: 838 GTSGSDGPSGPPGERGPQGPQGPVGFPGPKGPPGPPGKDGLPGHPGQRGETGFQGKTGPP 897 Query: 211 SVAGGYGASGPTSATFS-NESGSFQSLQPAPPQMPPPTG-------------PSPHLAHG 348 G G GPT T E G P PP P G P P Sbjct: 898 GPGGVVGPQGPTGETGPVGERG-----HPGPPGPPGEQGLPGSAGKEGAKGDPGPQ---- 948 Query: 349 GVTAAHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSP 486 G + G P G G L AA PA GG GP PPG I SP Sbjct: 949 GPSGKDGPPGLRGFPGERGLPGAAGPA-GLKGGEGPQGPPGPIGSP 993 [214][TOP] >UniRef100_UPI00016E45B9 UPI00016E45B9 related cluster n=1 Tax=Takifugu rubripes RepID=UPI00016E45B9 Length = 1732 Score = 54.3 bits (129), Expect = 5e-06 Identities = 53/166 (31%), Positives = 56/166 (33%), Gaps = 21/166 (12%) Frame = +1 Query: 52 GSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRG-------GAPP 210 G+S + SGPP G GP G P P H G RG PP Sbjct: 843 GTSGSDGPSGPPGERGPQGPQGPVGFPGPKGPPGPPGKDGLPGHPGQRGETGFQGKTGPP 902 Query: 211 SVAGGYGASGPTSATFS-NESGSFQSLQPAPPQMPPPTG-------------PSPHLAHG 348 G G GPT T E G P PP P G P P Sbjct: 903 GPGGVVGPQGPTGETGPVGERG-----HPGPPGPPGEQGLPGSAGKEGAKGDPGPQ---- 953 Query: 349 GVTAAHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSP 486 G + G P G G L AA PA GG GP PPG I SP Sbjct: 954 GPSGKDGPPGLRGFPGERGLPGAAGPA-GLKGGEGPQGPPGPIGSP 998 [215][TOP] >UniRef100_UPI00016E45B8 UPI00016E45B8 related cluster n=1 Tax=Takifugu rubripes RepID=UPI00016E45B8 Length = 1743 Score = 54.3 bits (129), Expect = 5e-06 Identities = 53/166 (31%), Positives = 56/166 (33%), Gaps = 21/166 (12%) Frame = +1 Query: 52 GSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRG-------GAPP 210 G+S + SGPP G GP G P P H G RG PP Sbjct: 867 GTSGSDGPSGPPGERGPQGPQGPVGFPGPKGPPGPPGKDGLPGHPGQRGETGFQGKTGPP 926 Query: 211 SVAGGYGASGPTSATFS-NESGSFQSLQPAPPQMPPPTG-------------PSPHLAHG 348 G G GPT T E G P PP P G P P Sbjct: 927 GPGGVVGPQGPTGETGPVGERG-----HPGPPGPPGEQGLPGSAGKEGAKGDPGPQ---- 977 Query: 349 GVTAAHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSP 486 G + G P G G L AA PA GG GP PPG I SP Sbjct: 978 GPSGKDGPPGLRGFPGERGLPGAAGPA-GLKGGEGPQGPPGPIGSP 1022 [216][TOP] >UniRef100_UPI00016E4599 UPI00016E4599 related cluster n=1 Tax=Takifugu rubripes RepID=UPI00016E4599 Length = 1789 Score = 54.3 bits (129), Expect = 5e-06 Identities = 53/166 (31%), Positives = 56/166 (33%), Gaps = 21/166 (12%) Frame = +1 Query: 52 GSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRG-------GAPP 210 G+S + SGPP G GP G P P H G RG PP Sbjct: 890 GTSGSDGPSGPPGERGPQGPQGPVGFPGPKGPPGPPGKDGLPGHPGQRGETGFQGKTGPP 949 Query: 211 SVAGGYGASGPTSATFS-NESGSFQSLQPAPPQMPPPTG-------------PSPHLAHG 348 G G GPT T E G P PP P G P P Sbjct: 950 GPGGVVGPQGPTGETGPVGERG-----HPGPPGPPGEQGLPGSAGKEGAKGDPGPQ---- 1000 Query: 349 GVTAAHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSP 486 G + G P G G L AA PA GG GP PPG I SP Sbjct: 1001 GPSGKDGPPGLRGFPGERGLPGAAGPA-GLKGGEGPQGPPGPIGSP 1045 [217][TOP] >UniRef100_UPI00016E4598 UPI00016E4598 related cluster n=1 Tax=Takifugu rubripes RepID=UPI00016E4598 Length = 1803 Score = 54.3 bits (129), Expect = 5e-06 Identities = 53/166 (31%), Positives = 56/166 (33%), Gaps = 21/166 (12%) Frame = +1 Query: 52 GSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRG-------GAPP 210 G+S + SGPP G GP G P P H G RG PP Sbjct: 904 GTSGSDGPSGPPGERGPQGPQGPVGFPGPKGPPGPPGKDGLPGHPGQRGETGFQGKTGPP 963 Query: 211 SVAGGYGASGPTSATFS-NESGSFQSLQPAPPQMPPPTG-------------PSPHLAHG 348 G G GPT T E G P PP P G P P Sbjct: 964 GPGGVVGPQGPTGETGPVGERG-----HPGPPGPPGEQGLPGSAGKEGAKGDPGPQ---- 1014 Query: 349 GVTAAHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSP 486 G + G P G G L AA PA GG GP PPG I SP Sbjct: 1015 GPSGKDGPPGLRGFPGERGLPGAAGPA-GLKGGEGPQGPPGPIGSP 1059 [218][TOP] >UniRef100_UPI00016E4597 UPI00016E4597 related cluster n=1 Tax=Takifugu rubripes RepID=UPI00016E4597 Length = 1813 Score = 54.3 bits (129), Expect = 5e-06 Identities = 53/166 (31%), Positives = 56/166 (33%), Gaps = 21/166 (12%) Frame = +1 Query: 52 GSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRG-------GAPP 210 G+S + SGPP G GP G P P H G RG PP Sbjct: 914 GTSGSDGPSGPPGERGPQGPQGPVGFPGPKGPPGPPGKDGLPGHPGQRGETGFQGKTGPP 973 Query: 211 SVAGGYGASGPTSATFS-NESGSFQSLQPAPPQMPPPTG-------------PSPHLAHG 348 G G GPT T E G P PP P G P P Sbjct: 974 GPGGVVGPQGPTGETGPVGERG-----HPGPPGPPGEQGLPGSAGKEGAKGDPGPQ---- 1024 Query: 349 GVTAAHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSP 486 G + G P G G L AA PA GG GP PPG I SP Sbjct: 1025 GPSGKDGPPGLRGFPGERGLPGAAGPA-GLKGGEGPQGPPGPIGSP 1069 [219][TOP] >UniRef100_UPI00016E4575 UPI00016E4575 related cluster n=1 Tax=Takifugu rubripes RepID=UPI00016E4575 Length = 1763 Score = 54.3 bits (129), Expect = 5e-06 Identities = 53/166 (31%), Positives = 56/166 (33%), Gaps = 21/166 (12%) Frame = +1 Query: 52 GSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRG-------GAPP 210 G+S + SGPP G GP G P P H G RG PP Sbjct: 864 GTSGSDGPSGPPGERGPQGPQGPVGFPGPKGPPGPPGKDGLPGHPGQRGETGFQGKTGPP 923 Query: 211 SVAGGYGASGPTSATFS-NESGSFQSLQPAPPQMPPPTG-------------PSPHLAHG 348 G G GPT T E G P PP P G P P Sbjct: 924 GPGGVVGPQGPTGETGPVGERG-----HPGPPGPPGEQGLPGSAGKEGAKGDPGPQ---- 974 Query: 349 GVTAAHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSP 486 G + G P G G L AA PA GG GP PPG I SP Sbjct: 975 GPSGKDGPPGLRGFPGERGLPGAAGPA-GLKGGEGPQGPPGPIGSP 1019 [220][TOP] >UniRef100_UPI00016E4574 UPI00016E4574 related cluster n=1 Tax=Takifugu rubripes RepID=UPI00016E4574 Length = 1686 Score = 54.3 bits (129), Expect = 5e-06 Identities = 53/166 (31%), Positives = 56/166 (33%), Gaps = 21/166 (12%) Frame = +1 Query: 52 GSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRG-------GAPP 210 G+S + SGPP G GP G P P H G RG PP Sbjct: 787 GTSGSDGPSGPPGERGPQGPQGPVGFPGPKGPPGPPGKDGLPGHPGQRGETGFQGKTGPP 846 Query: 211 SVAGGYGASGPTSATFS-NESGSFQSLQPAPPQMPPPTG-------------PSPHLAHG 348 G G GPT T E G P PP P G P P Sbjct: 847 GPGGVVGPQGPTGETGPVGERG-----HPGPPGPPGEQGLPGSAGKEGAKGDPGPQ---- 897 Query: 349 GVTAAHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSP 486 G + G P G G L AA PA GG GP PPG I SP Sbjct: 898 GPSGKDGPPGLRGFPGERGLPGAAGPA-GLKGGEGPQGPPGPIGSP 942 [221][TOP] >UniRef100_UPI00016E4573 UPI00016E4573 related cluster n=1 Tax=Takifugu rubripes RepID=UPI00016E4573 Length = 1815 Score = 54.3 bits (129), Expect = 5e-06 Identities = 53/166 (31%), Positives = 56/166 (33%), Gaps = 21/166 (12%) Frame = +1 Query: 52 GSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRG-------GAPP 210 G+S + SGPP G GP G P P H G RG PP Sbjct: 916 GTSGSDGPSGPPGERGPQGPQGPVGFPGPKGPPGPPGKDGLPGHPGQRGETGFQGKTGPP 975 Query: 211 SVAGGYGASGPTSATFS-NESGSFQSLQPAPPQMPPPTG-------------PSPHLAHG 348 G G GPT T E G P PP P G P P Sbjct: 976 GPGGVVGPQGPTGETGPVGERG-----HPGPPGPPGEQGLPGSAGKEGAKGDPGPQ---- 1026 Query: 349 GVTAAHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSP 486 G + G P G G L AA PA GG GP PPG I SP Sbjct: 1027 GPSGKDGPPGLRGFPGERGLPGAAGPA-GLKGGEGPQGPPGPIGSP 1071 [222][TOP] >UniRef100_Q90YJ0 Procollagen type I alpha 2 chain n=1 Tax=Danio rerio RepID=Q90YJ0_DANRE Length = 1352 Score = 54.3 bits (129), Expect = 5e-06 Identities = 50/170 (29%), Positives = 64/170 (37%), Gaps = 19/170 (11%) Frame = +1 Query: 22 GSHVPGSVVG--GSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSR 195 G+ P G G+ GP APG +G G ++ P + + + G Sbjct: 597 GARGPSGTPGPDGNKGEPGAVGPAGAPGPQGAAGMPGERGAAGTPEAKGEKGEAGYRGLE 656 Query: 196 GGAPPSVA-GGYGASGPTSATFSN----ESGSFQSLQPAPPQMPP----PTGPSPHLAHG 348 G A A G G SGP +N E+GSF PA P+ P +GP+ Sbjct: 657 GNAGKDGARGAPGPSGPPGPAGANGDKGETGSFGPPGPAGPRGAPGERGESGPAGPSGFA 716 Query: 349 GVTAAHGVPRHHGANGPASLNSAALPAYATG--------GGNGPAYPPGA 474 G A G G GPA A PA G G +GP PPGA Sbjct: 717 GPPGADGQTGPRGEKGPAGGKGDAGPAGPAGPAGNTGPLGPSGPVGPPGA 766 Score = 53.5 bits (127), Expect = 9e-06 Identities = 57/167 (34%), Positives = 65/167 (38%), Gaps = 13/167 (7%) Frame = +1 Query: 10 PPSYGSHVPGSVVGGSSAAGSF-----SGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQ 174 P G P G GSF +GP APG SGP G PS A P Sbjct: 668 PGPSGPPGPAGANGDKGETGSFGPPGPAGPRGAPGERGESGPAG-------PSGFAG-PP 719 Query: 175 GA--HLGSRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPP----PTGPSPH 336 GA G RG P AGG G +GP + +G+ L P+ P PP +GP+ Sbjct: 720 GADGQTGPRGEKGP--AGGKGDAGPAGP--AGPAGNTGPLGPSGPVGPPGARGDSGPTGL 775 Query: 337 LAHGGVTAAHGVPRHHGANGPASLNSAALPAYATG--GGNGPAYPPG 471 G G P G GPA L A G G GPA PPG Sbjct: 776 TGFPGAPGRVGPPGPAGIVGPAGLTGPAGKDGPRGPRGDVGPAGPPG 822 [223][TOP] >UniRef100_Q6IQX2 Collagen, type I, alpha 2 n=1 Tax=Danio rerio RepID=Q6IQX2_DANRE Length = 1352 Score = 54.3 bits (129), Expect = 5e-06 Identities = 50/170 (29%), Positives = 64/170 (37%), Gaps = 19/170 (11%) Frame = +1 Query: 22 GSHVPGSVVG--GSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSR 195 G+ P G G+ GP APG +G G ++ P + + + G Sbjct: 597 GARGPSGTPGPDGNKGEPGAVGPAGAPGPQGAAGMPGERGAAGTPGAKGEKGEAGYRGLE 656 Query: 196 GGAPPSVA-GGYGASGPTSATFSN----ESGSFQSLQPAPPQMPP----PTGPSPHLAHG 348 G A A G G SGP +N E+GSF PA P+ P +GP+ Sbjct: 657 GNAGKDGARGAPGPSGPPGPAGANGDKGETGSFGPPGPAGPRGAPGERGESGPAGPSGFA 716 Query: 349 GVTAAHGVPRHHGANGPASLNSAALPAYATG--------GGNGPAYPPGA 474 G A G G GPA A PA G G +GP PPGA Sbjct: 717 GPPGADGQTGPRGEKGPAGGKGDAGPAGPAGPAGNTGPLGPSGPVGPPGA 766 Score = 53.5 bits (127), Expect = 9e-06 Identities = 57/167 (34%), Positives = 65/167 (38%), Gaps = 13/167 (7%) Frame = +1 Query: 10 PPSYGSHVPGSVVGGSSAAGSF-----SGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQ 174 P G P G GSF +GP APG SGP G PS A P Sbjct: 668 PGPSGPPGPAGANGDKGETGSFGPPGPAGPRGAPGERGESGPAG-------PSGFAG-PP 719 Query: 175 GA--HLGSRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPP----PTGPSPH 336 GA G RG P AGG G +GP + +G+ L P+ P PP +GP+ Sbjct: 720 GADGQTGPRGEKGP--AGGKGDAGPAGP--AGPAGNTGPLGPSGPVGPPGARGDSGPTGL 775 Query: 337 LAHGGVTAAHGVPRHHGANGPASLNSAALPAYATG--GGNGPAYPPG 471 G G P G GPA L A G G GPA PPG Sbjct: 776 TGFPGAPGRVGPPGPAGIVGPAGLTGPAGKDGPRGPRGDVGPAGPPG 822 [224][TOP] >UniRef100_Q8K173 Col3a1 protein (Fragment) n=1 Tax=Mus musculus RepID=Q8K173_MOUSE Length = 1222 Score = 54.3 bits (129), Expect = 5e-06 Identities = 49/155 (31%), Positives = 58/155 (37%), Gaps = 10/155 (6%) Frame = +1 Query: 31 VPGSVVGGSSAAGSFSGPPY------APGVYAGSGPGGHPASSYAPSSSASLPQGAHLGS 192 +PG+ GG G P APG G G G P P +A +P G+ Sbjct: 400 IPGT--GGPPGENGKPGEPGPKGEVGAPGAPGGKGDSGAPGER-GPPGTAGIP-----GA 451 Query: 193 RGGA-PPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTAAHG 369 RGGA PP GG G +GP ++ S Q + P P GP G A G Sbjct: 452 RGGAGPPGPEGGKGPAGPPGPPGASGSPGLQGM-PGERGGPGSPGPKGEKGEPGGAGADG 510 Query: 370 VPRHHGANGPASLNSAALPAYATGG---GNGPAYP 465 VP G GPA PA G G P P Sbjct: 511 VPGKDGPRGPAGPIGPPGPAGQPGDKGEGGSPGLP 545 [225][TOP] >UniRef100_Q8BLW4 Putative uncharacterized protein n=1 Tax=Mus musculus RepID=Q8BLW4_MOUSE Length = 1464 Score = 54.3 bits (129), Expect = 5e-06 Identities = 49/155 (31%), Positives = 58/155 (37%), Gaps = 10/155 (6%) Frame = +1 Query: 31 VPGSVVGGSSAAGSFSGPPY------APGVYAGSGPGGHPASSYAPSSSASLPQGAHLGS 192 +PG+ GG G P APG G G G P P +A +P G+ Sbjct: 642 IPGT--GGPPGENGKPGEPGPKGEVGAPGAPGGKGDSGAPGER-GPPGTAGIP-----GA 693 Query: 193 RGGA-PPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTAAHG 369 RGGA PP GG G +GP ++ S Q + P P GP G A G Sbjct: 694 RGGAGPPGPEGGKGPAGPPGPPGASGSPGLQGM-PGERGGPGSPGPKGEKGEPGGAGADG 752 Query: 370 VPRHHGANGPASLNSAALPAYATGG---GNGPAYP 465 VP G GPA PA G G P P Sbjct: 753 VPGKDGPRGPAGPIGPPGPAGQPGDKGEGGSPGLP 787 [226][TOP] >UniRef100_Q7TT32 Collagen, type III, alpha 1 n=1 Tax=Mus musculus RepID=Q7TT32_MOUSE Length = 1464 Score = 54.3 bits (129), Expect = 5e-06 Identities = 49/155 (31%), Positives = 58/155 (37%), Gaps = 10/155 (6%) Frame = +1 Query: 31 VPGSVVGGSSAAGSFSGPPY------APGVYAGSGPGGHPASSYAPSSSASLPQGAHLGS 192 +PG+ GG G P APG G G G P P +A +P G+ Sbjct: 642 IPGT--GGPPGENGKPGEPGPKGEVGAPGAPGGKGDSGAPGER-GPPGTAGIP-----GA 693 Query: 193 RGGA-PPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTAAHG 369 RGGA PP GG G +GP ++ S Q + P P GP G A G Sbjct: 694 RGGAGPPGPEGGKGPAGPPGPPGASGSPGLQGM-PGERGGPGSPGPKGEKGEPGGAGADG 752 Query: 370 VPRHHGANGPASLNSAALPAYATGG---GNGPAYP 465 VP G GPA PA G G P P Sbjct: 753 VPGKDGPRGPAGPIGPPGPAGQPGDKGEGGSPGLP 787 [227][TOP] >UniRef100_P08121 Collagen alpha-1(III) chain n=3 Tax=Mus musculus RepID=CO3A1_MOUSE Length = 1464 Score = 54.3 bits (129), Expect = 5e-06 Identities = 49/155 (31%), Positives = 58/155 (37%), Gaps = 10/155 (6%) Frame = +1 Query: 31 VPGSVVGGSSAAGSFSGPPY------APGVYAGSGPGGHPASSYAPSSSASLPQGAHLGS 192 +PG+ GG G P APG G G G P P +A +P G+ Sbjct: 642 IPGT--GGPPGENGKPGEPGPKGEVGAPGAPGGKGDSGAPGER-GPPGTAGIP-----GA 693 Query: 193 RGGA-PPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTAAHG 369 RGGA PP GG G +GP ++ S Q + P P GP G A G Sbjct: 694 RGGAGPPGPEGGKGPAGPPGPPGASGSPGLQGM-PGERGGPGSPGPKGEKGEPGGAGADG 752 Query: 370 VPRHHGANGPASLNSAALPAYATGG---GNGPAYP 465 VP G GPA PA G G P P Sbjct: 753 VPGKDGPRGPAGPIGPPGPAGQPGDKGEGGSPGLP 787 [228][TOP] >UniRef100_Q3UH72 Putative uncharacterized protein n=1 Tax=Mus musculus RepID=Q3UH72_MOUSE Length = 1464 Score = 54.3 bits (129), Expect = 5e-06 Identities = 49/155 (31%), Positives = 58/155 (37%), Gaps = 10/155 (6%) Frame = +1 Query: 31 VPGSVVGGSSAAGSFSGPPY------APGVYAGSGPGGHPASSYAPSSSASLPQGAHLGS 192 +PG+ GG G P APG G G G P P +A +P G+ Sbjct: 642 IPGT--GGPPGENGKPGEPGPKGEVGAPGAPGGKGDSGAPGER-GPPGTAGIP-----GA 693 Query: 193 RGGA-PPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTAAHG 369 RGGA PP GG G +GP ++ S Q + P P GP G A G Sbjct: 694 RGGAGPPGPEGGKGPAGPPGPPGASGSPGLQGM-PGERGGPGSPGPKGEKGEPGGAGADG 752 Query: 370 VPRHHGANGPASLNSAALPAYATGG---GNGPAYP 465 VP G GPA PA G G P P Sbjct: 753 VPGKDGPRGPAGPIGPPGPAGQPGDKGEGGSPGLP 787 [229][TOP] >UniRef100_Q9L252 Putative uncharacterized protein SCO2669 n=1 Tax=Streptomyces coelicolor RepID=Q9L252_STRCO Length = 604 Score = 54.3 bits (129), Expect = 5e-06 Identities = 53/162 (32%), Positives = 63/162 (38%), Gaps = 10/162 (6%) Frame = +1 Query: 16 SYGSHVPGSVVGGSSAAGSFSGP--PYAPGVYAGSGPGGHPASSYAPSSSASLP----QG 177 S G PG GG G F P P PG + G G P S P+ + G Sbjct: 195 SGGPGAPGGP-GGPGGPGGFGSPDGPNRPGGFGGPGSPDGPGGSGGPNGAGGFGGPGGPG 253 Query: 178 AHLGSRGGAPPSVAGGYGASGPTSATFSNESGSFQSL-QPAPPQMP-PPTGPSPHLAHGG 351 G G P+ AGG+G GP S SG F P P P P GP + GG Sbjct: 254 GPNGPGGPGGPNGAGGFG--GPGGPGGSGGSGGFGGPGGPGGPSGPNSPGGPGGYNGPGG 311 Query: 352 VTAAHGVPRHHGANGPASLNSAALPAYATG--GGNGPAYPPG 471 +G P + G GP N P +G G +GP PPG Sbjct: 312 PGGPNG-PNNPG--GPGGYNGPGGPGGPSGPNGPSGPPAPPG 350 [230][TOP] >UniRef100_C3AC52 Collagen triple helix repeat domain protein n=1 Tax=Bacillus mycoides DSM 2048 RepID=C3AC52_BACMY Length = 922 Score = 54.3 bits (129), Expect = 5e-06 Identities = 45/154 (29%), Positives = 58/154 (37%), Gaps = 13/154 (8%) Frame = +1 Query: 34 PGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPPS 213 P + G + G GPP G +GPGG P+ S + + + G G A Sbjct: 192 PTGITGPTGITGPSGGPPGPTGATGATGPGGGPSGSTGATGAT-----GNTGVTGSA--G 244 Query: 214 VAGGYGASGPTSATFSNESGSFQSLQ-------PAPPQ----MPPPTGPSPHLAHGGVTA 360 V G G SG T T + Q +Q P PQ +P PTG + GV Sbjct: 245 VTGNTGPSGSTGETGAQGLQGIQGVQGPIGPTGPEGPQGIQGIPGPTGVTGEQGIQGVQG 304 Query: 361 AHGVPRHHGANGPASLNSAALPAYATG--GGNGP 456 G+ G GP + A P TG G GP Sbjct: 305 IQGITGATGDQGPQGIQGAIGPQGITGATGDQGP 338 [231][TOP] >UniRef100_Q9LKA4 AT3G15010 protein n=1 Tax=Arabidopsis thaliana RepID=Q9LKA4_ARATH Length = 404 Score = 54.3 bits (129), Expect = 5e-06 Identities = 49/157 (31%), Positives = 66/157 (42%), Gaps = 5/157 (3%) Frame = +1 Query: 1 AQQPPSYGSHVPGSVVGGSSAAGSF--SGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQ 174 AQ S HV G +G AG + +G A G Y+G P H S+++ S Sbjct: 256 AQDGGSGHGHVHGEGMGMVRPAGPYGAAGGISAYGGYSGGPPAHHMNSTHSSMGVGSAGY 315 Query: 175 GAHLGSRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGV 354 G H G GG P G YG G SG ++ PP P G P H G+ Sbjct: 316 GGHYGGYGG--PGGTGVYGGLGGGYGGPGTGSGQYR----MPPSSMPGGGGYPESGHYGL 369 Query: 355 TAAHGVP-RHHGANGPASLNSAALPAYATGG--GNGP 456 +++ G P +HH A G ++ +P GG NGP Sbjct: 370 SSSAGYPGQHHQAVG-----TSPVPRVPHGGMYPNGP 401 [232][TOP] >UniRef100_A8IZJ6 RWP-RK transcription factor n=1 Tax=Chlamydomonas reinhardtii RepID=A8IZJ6_CHLRE Length = 1428 Score = 54.3 bits (129), Expect = 5e-06 Identities = 45/157 (28%), Positives = 57/157 (36%), Gaps = 7/157 (4%) Frame = +1 Query: 22 GSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPAS-SYAPS-SSASLPQGAHLGSR 195 G G V GG G + GPP + GV GSGP G P S P +P A Sbjct: 540 GQQRGGGVRGGMPGDGGWIGPP-SGGVAGGSGPLGRPHSPDLGPHMGGGGMPLQALQSGG 598 Query: 196 GGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTAAHGVP 375 G P+ +GGYG G G P P +G H ++G ++G Sbjct: 599 SGYGPAHSGGYGGPGGGGGDMGAGPG------PGPGHYNDMSGRGHHDSYGSAPGSYGPN 652 Query: 376 R-----HHGANGPASLNSAALPAYATGGGNGPAYPPG 471 + G G + Y GGG G Y PG Sbjct: 653 SASGGGYGGPGGGGGGQGGGMGGYGGGGGRGGGYGPG 689 [233][TOP] >UniRef100_B9QKW0 HECT domain-containing protein n=1 Tax=Toxoplasma gondii VEG RepID=B9QKW0_TOXGO Length = 11061 Score = 54.3 bits (129), Expect = 5e-06 Identities = 58/190 (30%), Positives = 78/190 (41%), Gaps = 13/190 (6%) Frame = +1 Query: 4 QQPPSYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAH 183 + PP +H PG+ + G+ G+ SGP +PG + SSSASLPQ Sbjct: 5198 RDPPRPSNH-PGTPLAGAGTGGA-SGPSVSPGF---------ASVPLLASSSASLPQNPE 5246 Query: 184 LGSRG----GAPPSVA-------GGYGASGPTSATFSNESGSFQSLQPAPPQMPPP--TG 324 L + G+ PS + GG G+ G +F S F QP P MP P +G Sbjct: 5247 LSASPNQLEGSVPSPSQRLQFRRGGLGSDG-WDGSFDASSTPFLRAQPVPTAMPMPALSG 5305 Query: 325 PSPHLAHGGVTAAHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSPASTATF 504 P+ + AA +PR +S +SA P GPA P IVSP T Sbjct: 5306 PASRPPVSSLPAAMSLPRGPAPPSGSSRDSALPPI-------GPA--PVQIVSPPLTPAL 5356 Query: 505 NRLSPAAAAA 534 P + A Sbjct: 5357 PLAGPVSGLA 5366 [234][TOP] >UniRef100_B6KP87 HECT-domain (Ubiquitin-transferase) containing protein n=1 Tax=Toxoplasma gondii ME49 RepID=B6KP87_TOXGO Length = 10999 Score = 54.3 bits (129), Expect = 5e-06 Identities = 58/190 (30%), Positives = 78/190 (41%), Gaps = 13/190 (6%) Frame = +1 Query: 4 QQPPSYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAH 183 + PP +H PG+ + G+ G+ SGP +PG + SSSASLPQ Sbjct: 5198 RDPPRPSNH-PGTPLAGAGTGGA-SGPSVSPGF---------ASVPLLASSSASLPQNPE 5246 Query: 184 LGSRG----GAPPSVA-------GGYGASGPTSATFSNESGSFQSLQPAPPQMPPP--TG 324 L + G+ PS + GG G+ G +F S F QP P MP P +G Sbjct: 5247 LSASPNQLEGSVPSPSQRLQFRRGGLGSDG-WDGSFDASSTPFLRAQPVPTAMPMPALSG 5305 Query: 325 PSPHLAHGGVTAAHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSPASTATF 504 P+ + AA +PR +S +SA P GPA P IVSP T Sbjct: 5306 PASRPPVSSLPAAMSLPRGPAPPSGSSRDSALPPI-------GPA--PVQIVSPPLTPAL 5356 Query: 505 NRLSPAAAAA 534 P + A Sbjct: 5357 PLAGPVSGLA 5366 [235][TOP] >UniRef100_B4Q0N7 GE17489 n=1 Tax=Drosophila yakuba RepID=B4Q0N7_DROYA Length = 2036 Score = 54.3 bits (129), Expect = 5e-06 Identities = 57/177 (32%), Positives = 75/177 (42%), Gaps = 10/177 (5%) Frame = +1 Query: 34 PGSVVGGSSAAGSFSGP-PYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAP- 207 P + SS AG+ +G + V +G G G S+ A S+ S QGA G+ GG+ Sbjct: 162 PATPKSSSSGAGATTGSGTSSAAVTSGPGSGSTKVSAAASSAQQSGLQGA-TGAGGGSSS 220 Query: 208 -PSVAGGYGASGPTSA-TFSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTAAHGVPRH 381 P G GA G T+A S G+ S P +PP + PH G T Sbjct: 221 TPGTQPGSGAGGATAARPVSAMGGTVSSTAGGAPSIPPISTMPPHTVPGS-TNTTTTAMA 279 Query: 382 HGANGP----ASLNSAALPAYATGGGNGPAYP--PGAIVSPASTATFNRLSPAAAAA 534 GA GP A+ N+ AL A G AYP PG +S+ + AAAA Sbjct: 280 GGAGGPGAAAANRNAEALMASLLNTGQTGAYPGAPGQTAVNSSSLLDGSTAAVAAAA 336 [236][TOP] >UniRef100_B4NI92 GK13553 n=1 Tax=Drosophila willistoni RepID=B4NI92_DROWI Length = 779 Score = 54.3 bits (129), Expect = 5e-06 Identities = 55/189 (29%), Positives = 81/189 (42%), Gaps = 13/189 (6%) Frame = +1 Query: 1 AQQPPSYGSHVPGSVVGGSSAAGSFSGP-PYAPGV-YAGSGPGGHPASSY---APSSSAS 165 A + SY + P S S+ + GP P AP Y+ P + SY APSSS S Sbjct: 385 ANRGGSYPAASPSSSYSAPSSGSNNGGPYPSAPSSSYSAPSPSANAGGSYPAAAPSSSYS 444 Query: 166 LPQGAHLGSRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPT-----GPS 330 P L S G P A S P+ + +N GS+ + P+ P + GP Sbjct: 445 APS---LDSSSGGPYRSAPSSSYSAPSPS--ANVGGSYPAATPSSSYSAPSSDSSRGGPY 499 Query: 331 PHLAHGGVTAAHGVPRHHGANGPASLNSAALPAYATGGGNG---PAYPPGAIVSPASTAT 501 P A +A + G + PA+ S++ A ++G NG P+ P + +P+ +A Sbjct: 500 PS-APSSSYSAPSPSANRGGSYPAASPSSSYSAPSSGSNNGGPYPSAPSSSYSAPSPSAN 558 Query: 502 FNRLSPAAA 528 PAAA Sbjct: 559 VGGSYPAAA 567 [237][TOP] >UniRef100_C5DNK9 KLTH0G17886p n=1 Tax=Lachancea thermotolerans CBS 6340 RepID=C5DNK9_LACTC Length = 804 Score = 54.3 bits (129), Expect = 5e-06 Identities = 58/178 (32%), Positives = 71/178 (39%), Gaps = 3/178 (1%) Frame = +1 Query: 10 PPSYGSHVPGSVVGGSSAA--GSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAH 183 PP+ +P S G+ A S PP P +A + P PASS APS S P A Sbjct: 205 PPASAPPLPSSNAPGTPAPLLPQSSAPPAPPVPFAAAPPA--PASS-APSVPKSSPSSAP 261 Query: 184 LGSRGGAPPSVAGGYGASGPTSATFSNESGSFQSLQPAPPQMPPPTGPS-PHLAHGGVTA 360 + P V G +S P + PAPP P P PS P L G Sbjct: 262 PAPPAPSAPPVPGLPKSSAPPAPPAP-------PAPPAPPAPPAPPVPSAPALPKSGAPP 314 Query: 361 AHGVPRHHGANGPASLNSAALPAYATGGGNGPAYPPGAIVSPASTATFNRLSPAAAAA 534 A VP P S A PA + P PP PAS+A R +P+AA+A Sbjct: 315 APPVPSAPAL--PKSGAPPAPPAPTLPKSSVPPAPPAPPALPASSAAPQRRAPSAASA 370 [238][TOP] >UniRef100_UPI0000F2C218 PREDICTED: similar to collagen, type XI, alpha 1, isoform 3 n=1 Tax=Monodelphis domestica RepID=UPI0000F2C218 Length = 1768 Score = 53.9 bits (128), Expect = 7e-06 Identities = 56/189 (29%), Positives = 61/189 (32%), Gaps = 21/189 (11%) Frame = +1 Query: 16 SYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGG-----HPASSYAPSSSASLPQGA 180 S G+ P G +GS GPP PG GP G P P LP Sbjct: 852 SRGARGPTGKPGPKGTSGS-DGPPGPPGERGPQGPQGPVGFPGPKGPPGPPGKDGLP--G 908 Query: 181 HLGSRG-------GAPPSVAGGYGASGPTSATFS-NESGSFQSLQPAPPQMPPPTGPSPH 336 H G RG PP G G GPT T E G P PP P G Sbjct: 909 HPGQRGETGFQGKTGPPGPGGVVGPQGPTGETGPIGERG-----HPGPPGPPGEQGLPGA 963 Query: 337 LAHGGVTAAHGVPRHHGANGPASLNS--------AALPAYATGGGNGPAYPPGAIVSPAS 492 G G G +GPA L A A GG GP PPG + SP Sbjct: 964 AGKEGAKGDPGPQGVSGKDGPAGLRGFPGERGLPGAQGAPGLKGGEGPQGPPGPLGSPGE 1023 Query: 493 TATFNRLSP 519 + P Sbjct: 1024 RGSAGTAGP 1032 [239][TOP] >UniRef100_UPI0000F2C1FC PREDICTED: similar to collagen, type XI, alpha 1, isoform 2 n=1 Tax=Monodelphis domestica RepID=UPI0000F2C1FC Length = 1819 Score = 53.9 bits (128), Expect = 7e-06 Identities = 56/189 (29%), Positives = 61/189 (32%), Gaps = 21/189 (11%) Frame = +1 Query: 16 SYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGG-----HPASSYAPSSSASLPQGA 180 S G+ P G +GS GPP PG GP G P P LP Sbjct: 903 SRGARGPTGKPGPKGTSGS-DGPPGPPGERGPQGPQGPVGFPGPKGPPGPPGKDGLP--G 959 Query: 181 HLGSRG-------GAPPSVAGGYGASGPTSATFS-NESGSFQSLQPAPPQMPPPTGPSPH 336 H G RG PP G G GPT T E G P PP P G Sbjct: 960 HPGQRGETGFQGKTGPPGPGGVVGPQGPTGETGPIGERG-----HPGPPGPPGEQGLPGA 1014 Query: 337 LAHGGVTAAHGVPRHHGANGPASLNS--------AALPAYATGGGNGPAYPPGAIVSPAS 492 G G G +GPA L A A GG GP PPG + SP Sbjct: 1015 AGKEGAKGDPGPQGVSGKDGPAGLRGFPGERGLPGAQGAPGLKGGEGPQGPPGPLGSPGE 1074 Query: 493 TATFNRLSP 519 + P Sbjct: 1075 RGSAGTAGP 1083 [240][TOP] >UniRef100_UPI0000E215D9 PREDICTED: similar to prepro-alpha2(I) collagen isoform 1 n=1 Tax=Pan troglodytes RepID=UPI0000E215D9 Length = 1039 Score = 53.9 bits (128), Expect = 7e-06 Identities = 51/156 (32%), Positives = 58/156 (37%), Gaps = 10/156 (6%) Frame = +1 Query: 34 PGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPPS 213 PG VVG AG SGP PG +G G P + G+RG AP Sbjct: 624 PG-VVGAVGTAGP-SGPSGLPGERGAAGIPGGKGEKGEPGLRGEIGNPGRDGARG-AP-- 678 Query: 214 VAGGYGASGPTSATFSN-ESGSFQSLQPAPPQMPP-------PTGPSPHLAHGGVTAAHG 369 G GA GP AT E+G+ PA P+ P P GP+ G G Sbjct: 679 --GAVGAPGPAGATGDRGEAGAAGPAGPAGPRGSPGERGEVGPAGPNGFAGPAGAAGQPG 736 Query: 370 VPRHHGANGPASLNSAALPAYATG--GGNGPAYPPG 471 GA GP N P G G GP PPG Sbjct: 737 AKGERGAKGPKGENGVVGPTGPVGAAGPAGPNGPPG 772 [241][TOP] >UniRef100_UPI0000E215D8 PREDICTED: similar to alpha2(I) collagen isoform 5 n=1 Tax=Pan troglodytes RepID=UPI0000E215D8 Length = 1201 Score = 53.9 bits (128), Expect = 7e-06 Identities = 51/156 (32%), Positives = 58/156 (37%), Gaps = 10/156 (6%) Frame = +1 Query: 34 PGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPPS 213 PG VVG AG SGP PG +G G P + G+RG AP Sbjct: 624 PG-VVGAVGTAGP-SGPSGLPGERGAAGIPGGKGEKGEPGLRGEIGNPGRDGARG-AP-- 678 Query: 214 VAGGYGASGPTSATFSN-ESGSFQSLQPAPPQMPP-------PTGPSPHLAHGGVTAAHG 369 G GA GP AT E+G+ PA P+ P P GP+ G G Sbjct: 679 --GAVGAPGPAGATGDRGEAGAAGPAGPAGPRGSPGERGEVGPAGPNGFAGPAGAAGQPG 736 Query: 370 VPRHHGANGPASLNSAALPAYATG--GGNGPAYPPG 471 GA GP N P G G GP PPG Sbjct: 737 AKGERGAKGPKGENGVVGPTGPVGAAGPAGPNGPPG 772 [242][TOP] >UniRef100_UPI0000E215D7 PREDICTED: alpha 2 type I collagen isoform 7 n=1 Tax=Pan troglodytes RepID=UPI0000E215D7 Length = 1300 Score = 53.9 bits (128), Expect = 7e-06 Identities = 51/156 (32%), Positives = 58/156 (37%), Gaps = 10/156 (6%) Frame = +1 Query: 34 PGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPPS 213 PG VVG AG SGP PG +G G P + G+RG AP Sbjct: 558 PG-VVGAVGTAGP-SGPSGLPGERGAAGIPGGKGEKGEPGLRGEIGNPGRDGARG-AP-- 612 Query: 214 VAGGYGASGPTSATFSN-ESGSFQSLQPAPPQMPP-------PTGPSPHLAHGGVTAAHG 369 G GA GP AT E+G+ PA P+ P P GP+ G G Sbjct: 613 --GAVGAPGPAGATGDRGEAGAAGPAGPAGPRGSPGERGEVGPAGPNGFAGPAGAAGQPG 670 Query: 370 VPRHHGANGPASLNSAALPAYATG--GGNGPAYPPG 471 GA GP N P G G GP PPG Sbjct: 671 AKGERGAKGPKGENGVVGPTGPVGAAGPAGPNGPPG 706 [243][TOP] >UniRef100_UPI0000E215D6 PREDICTED: similar to prepro-alpha2(I) collagen isoform 2 n=1 Tax=Pan troglodytes RepID=UPI0000E215D6 Length = 1249 Score = 53.9 bits (128), Expect = 7e-06 Identities = 51/156 (32%), Positives = 58/156 (37%), Gaps = 10/156 (6%) Frame = +1 Query: 34 PGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPPS 213 PG VVG AG SGP PG +G G P + G+RG AP Sbjct: 624 PG-VVGAVGTAGP-SGPSGLPGERGAAGIPGGKGEKGEPGLRGEIGNPGRDGARG-AP-- 678 Query: 214 VAGGYGASGPTSATFSN-ESGSFQSLQPAPPQMPP-------PTGPSPHLAHGGVTAAHG 369 G GA GP AT E+G+ PA P+ P P GP+ G G Sbjct: 679 --GAVGAPGPAGATGDRGEAGAAGPAGPAGPRGSPGERGEVGPAGPNGFAGPAGAAGQPG 736 Query: 370 VPRHHGANGPASLNSAALPAYATG--GGNGPAYPPG 471 GA GP N P G G GP PPG Sbjct: 737 AKGERGAKGPKGENGVVGPTGPVGAAGPAGPNGPPG 772 [244][TOP] >UniRef100_UPI0000E215D5 PREDICTED: alpha 2 type I collagen isoform 4 n=1 Tax=Pan troglodytes RepID=UPI0000E215D5 Length = 1363 Score = 53.9 bits (128), Expect = 7e-06 Identities = 51/156 (32%), Positives = 58/156 (37%), Gaps = 10/156 (6%) Frame = +1 Query: 34 PGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPPS 213 PG VVG AG SGP PG +G G P + G+RG AP Sbjct: 621 PG-VVGAVGTAGP-SGPSGLPGERGAAGIPGGKGEKGEPGLRGEIGNPGRDGARG-AP-- 675 Query: 214 VAGGYGASGPTSATFSN-ESGSFQSLQPAPPQMPP-------PTGPSPHLAHGGVTAAHG 369 G GA GP AT E+G+ PA P+ P P GP+ G G Sbjct: 676 --GAVGAPGPAGATGDRGEAGAAGPAGPAGPRGSPGERGEVGPAGPNGFAGPAGAAGQPG 733 Query: 370 VPRHHGANGPASLNSAALPAYATG--GGNGPAYPPG 471 GA GP N P G G GP PPG Sbjct: 734 AKGERGAKGPKGENGVVGPTGPVGAAGPAGPNGPPG 769 [245][TOP] >UniRef100_UPI0000E215D4 PREDICTED: similar to alpha2(I) collagen isoform 8 n=1 Tax=Pan troglodytes RepID=UPI0000E215D4 Length = 1312 Score = 53.9 bits (128), Expect = 7e-06 Identities = 51/156 (32%), Positives = 58/156 (37%), Gaps = 10/156 (6%) Frame = +1 Query: 34 PGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPPS 213 PG VVG AG SGP PG +G G P + G+RG AP Sbjct: 624 PG-VVGAVGTAGP-SGPSGLPGERGAAGIPGGKGEKGEPGLRGEIGNPGRDGARG-AP-- 678 Query: 214 VAGGYGASGPTSATFSN-ESGSFQSLQPAPPQMPP-------PTGPSPHLAHGGVTAAHG 369 G GA GP AT E+G+ PA P+ P P GP+ G G Sbjct: 679 --GAVGAPGPAGATGDRGEAGAAGPAGPAGPRGSPGERGEVGPAGPNGFAGPAGAAGQPG 736 Query: 370 VPRHHGANGPASLNSAALPAYATG--GGNGPAYPPG 471 GA GP N P G G GP PPG Sbjct: 737 AKGERGAKGPKGENGVVGPTGPVGAAGPAGPNGPPG 772 [246][TOP] >UniRef100_UPI0000E215D3 PREDICTED: alpha 2 type I collagen isoform 3 n=1 Tax=Pan troglodytes RepID=UPI0000E215D3 Length = 1365 Score = 53.9 bits (128), Expect = 7e-06 Identities = 51/156 (32%), Positives = 58/156 (37%), Gaps = 10/156 (6%) Frame = +1 Query: 34 PGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPPS 213 PG VVG AG SGP PG +G G P + G+RG AP Sbjct: 624 PG-VVGAVGTAGP-SGPSGLPGERGAAGIPGGKGEKGEPGLRGEIGNPGRDGARG-AP-- 678 Query: 214 VAGGYGASGPTSATFSN-ESGSFQSLQPAPPQMPP-------PTGPSPHLAHGGVTAAHG 369 G GA GP AT E+G+ PA P+ P P GP+ G G Sbjct: 679 --GAVGAPGPAGATGDRGEAGAAGPAGPAGPRGSPGERGEVGPAGPNGFAGPAGAAGQPG 736 Query: 370 VPRHHGANGPASLNSAALPAYATG--GGNGPAYPPG 471 GA GP N P G G GP PPG Sbjct: 737 AKGERGAKGPKGENGVVGPTGPVGAAGPAGPNGPPG 772 [247][TOP] >UniRef100_UPI0000E215D2 PREDICTED: alpha 2 type I collagen isoform 10 n=1 Tax=Pan troglodytes RepID=UPI0000E215D2 Length = 1366 Score = 53.9 bits (128), Expect = 7e-06 Identities = 51/156 (32%), Positives = 58/156 (37%), Gaps = 10/156 (6%) Frame = +1 Query: 34 PGSVVGGSSAAGSFSGPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPPS 213 PG VVG AG SGP PG +G G P + G+RG AP Sbjct: 624 PG-VVGAVGTAGP-SGPSGLPGERGAAGIPGGKGEKGEPGLRGEIGNPGRDGARG-AP-- 678 Query: 214 VAGGYGASGPTSATFSN-ESGSFQSLQPAPPQMPP-------PTGPSPHLAHGGVTAAHG 369 G GA GP AT E+G+ PA P+ P P GP+ G G Sbjct: 679 --GAVGAPGPAGATGDRGEAGAAGPAGPAGPRGSPGERGEVGPAGPNGFAGPAGAAGQPG 736 Query: 370 VPRHHGANGPASLNSAALPAYATG--GGNGPAYPPG 471 GA GP N P G G GP PPG Sbjct: 737 AKGERGAKGPKGENGVVGPTGPVGAAGPAGPNGPPG 772 [248][TOP] >UniRef100_UPI00005E7048 PREDICTED: similar to collagen, type XI, alpha 1, isoform 1 n=1 Tax=Monodelphis domestica RepID=UPI00005E7048 Length = 1807 Score = 53.9 bits (128), Expect = 7e-06 Identities = 56/189 (29%), Positives = 61/189 (32%), Gaps = 21/189 (11%) Frame = +1 Query: 16 SYGSHVPGSVVGGSSAAGSFSGPPYAPGVYAGSGPGG-----HPASSYAPSSSASLPQGA 180 S G+ P G +GS GPP PG GP G P P LP Sbjct: 891 SRGARGPTGKPGPKGTSGS-DGPPGPPGERGPQGPQGPVGFPGPKGPPGPPGKDGLP--G 947 Query: 181 HLGSRG-------GAPPSVAGGYGASGPTSATFS-NESGSFQSLQPAPPQMPPPTGPSPH 336 H G RG PP G G GPT T E G P PP P G Sbjct: 948 HPGQRGETGFQGKTGPPGPGGVVGPQGPTGETGPIGERG-----HPGPPGPPGEQGLPGA 1002 Query: 337 LAHGGVTAAHGVPRHHGANGPASLNS--------AALPAYATGGGNGPAYPPGAIVSPAS 492 G G G +GPA L A A GG GP PPG + SP Sbjct: 1003 AGKEGAKGDPGPQGVSGKDGPAGLRGFPGERGLPGAQGAPGLKGGEGPQGPPGPLGSPGE 1062 Query: 493 TATFNRLSP 519 + P Sbjct: 1063 RGSAGTAGP 1071 [249][TOP] >UniRef100_UPI0000121787 Hypothetical protein CBG05354 n=1 Tax=Caenorhabditis briggsae AF16 RepID=UPI0000121787 Length = 299 Score = 53.9 bits (128), Expect = 7e-06 Identities = 41/132 (31%), Positives = 47/132 (35%) Frame = +1 Query: 79 GPPYAPGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPPSVAGGYGASGPTSATF 258 GPP PG G G P S P A+ + G G PP G G G Sbjct: 151 GPPGPPGPPGPPGDSGEPGSPGLPGQDAAPGEPGPKGPPG--PPGAPGAPGTPGEPGVPA 208 Query: 259 SNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTAAHGVPRHHGANGPASLNSAALPAYAT 438 +E +P PP P P GP G + G P G NGP A Sbjct: 209 QSEP--LIPGEPGPPGEPGPQGPPGPPGQPGADGSPGQPGPKGPNGPDGQPGAD----GN 262 Query: 439 GGGNGPAYPPGA 474 G GPA PPG+ Sbjct: 263 PGAPGPAGPPGS 274 [250][TOP] >UniRef100_UPI00017B24B3 UPI00017B24B3 related cluster n=1 Tax=Tetraodon nigroviridis RepID=UPI00017B24B3 Length = 973 Score = 53.9 bits (128), Expect = 7e-06 Identities = 50/150 (33%), Positives = 59/150 (39%), Gaps = 10/150 (6%) Frame = +1 Query: 52 GSSAAGSFSGPPYA---PGVYAGSGPGGHPASSYAPSSSASLPQGAHLGSRGGAPPSVA- 219 GS F+GPP A PG+ G G + AP PQG G+ G A P+ Sbjct: 764 GSPGPAGFAGPPGADGQPGIKGEQGETGQKGDAGAPG-----PQGPS-GAPGPAGPTGVF 817 Query: 220 ---GGYGASGPTSAT-FSNESGSFQSLQPAPPQMPPPTGPSPHLAHGGVTAAHGVPRHHG 387 G GA GP AT F +G P P P P GP+ G G G Sbjct: 818 GPKGARGAQGPPGATGFPGAAGRVGP--PGPNGNPGPAGPAGSPGKDGPKGIRGDAGPPG 875 Query: 388 ANGPASLNSAALPAYATG--GGNGPAYPPG 471 G A L A P+ G G +GP PPG Sbjct: 876 RQGDAGLRGPAGPSGEKGDAGEDGPVGPPG 905