[UP]
[1][TOP] >UniRef100_B6B8H0 Putative CheA Signal Transduction Histidine Kinase n=1 Tax=Rhodobacterales bacterium Y4I RepID=B6B8H0_9RHOB Length = 326 Score = 62.4 bits (150), Expect = 2e-08 Identities = 57/177 (32%), Positives = 76/177 (42%), Gaps = 15/177 (8%) Frame = +1 Query: 1 VKEATDATEDAPEVKSDDDVKAETAAVTSATTSGDDEPLHAHGERSSRETPSHVAINTKV 180 ++ A A E AP K+D VKA+TAA A T+ +P A ++++R+ A Sbjct: 58 LRAAPAAVETAPAAKAD--VKADTAAAPEADTAVKADPGKAPADKTARKAARKAATRKPA 115 Query: 181 VEAYLAEIRSVLAACKATDALKAEAEKAAAKADKRANK----AKPPTSPVTP-------- 324 A + A K A A A+ A A DK A K A P P Sbjct: 116 ARKAPARAAAANTAAKPEQAKPAAAQPAKAAEDKPAAKQGGAAGGPGMKEAPAMFRASPA 175 Query: 325 ---VSEDSGDAAKDGEATKAAAAAPAPAADAKLTAALARSSEAAAAHAESLTSSLSA 486 E S A + A AAA APAPAA + AA A++ EA AA E+ T + A Sbjct: 176 RKSAPEQSAKAEEKPAAKPAAAPAPAPAAAPQAKAAEAKAPEAKAAAPEAKTPAAKA 232 [2][TOP] >UniRef100_Q4E065 Mucin-associated surface protein (MASP), putative n=1 Tax=Trypanosoma cruzi RepID=Q4E065_TRYCR Length = 507 Score = 58.5 bits (140), Expect = 2e-07 Identities = 51/163 (31%), Positives = 71/163 (43%), Gaps = 3/163 (1%) Frame = +1 Query: 7 EATDATEDAPEVKSDDDVKAETAAVTSATTSGDDEPLHAHGERSSRETPSHVAINTKVVE 186 EA T A ++ AETA + +A + E+++ K E Sbjct: 220 EAAAKTAAAEAAAAEAKTSAETAKMATANAATAAAKAETETEKAAAAAAKEATTKAKAAE 279 Query: 187 AYLAEIRSVLAACKATDALKAEAEKAAAKADKRANKAKPPTSPVTPVSEDSGDAAKDGEA 366 A E A KA K AE+A+AKA + A KAK +G AA+ EA Sbjct: 280 AAKDE------AAKAAATAKTAAEEASAKAAEAAAKAKAAAEAAETAKASAGKAAE--EA 331 Query: 367 TKAAAAAPAPAADAKL---TAALARSSEAAAAHAESLTSSLSA 486 KAAA A A AA+A T+A ++E AAA A++ T +A Sbjct: 332 AKAAAEAAATAAEAAAEAKTSAETANTETAAAKAKAETEKAAA 374 [3][TOP] >UniRef100_B5Q0R8 Prophage tail fibre N-family protein n=2 Tax=Salmonella enterica subsp. enterica serovar Hadar RepID=B5Q0R8_SALHA Length = 812 Score = 58.2 bits (139), Expect = 3e-07 Identities = 48/154 (31%), Positives = 70/154 (45%), Gaps = 1/154 (0%) Frame = -2 Query: 461 SAWAAAASEERASAAVSLASAAGAGAAAAALVASPSLAASPLSSETGVTGEVGGFALFAR 282 SA AA SEE A+ + A + A AAA+A A S A + S T E + Sbjct: 200 SAAAAKTSEENTDASRTAAGDSAAAAAASATAAQTSAARAGASETAAKTSETQAASSAGD 259 Query: 281 LSALAAAFSASALSASVALHAARTERISAR*ASTTFVLMATCEGVSREDLSP-*ACNGSS 105 A A A +AS +A+ + A+T +A +++T AT S + S A + +S Sbjct: 260 AGASATAAAASEKAATASAAEAKTSETNAATSASTSAASATAASSSASEASTHAAASDTS 319 Query: 104 SPLVVALVTAAVSALTSSSDLTSGASSVASVASL 3 + L TAA +A T + D A +A V SL Sbjct: 320 ASLAAQSSTAAGAAATRAEDAAKRAEDIADVISL 353 [4][TOP] >UniRef100_Q57QV3 Gifsy-2 prophage probable tail fiber protein n=1 Tax=Salmonella enterica RepID=Q57QV3_SALCH Length = 812 Score = 57.4 bits (137), Expect = 5e-07 Identities = 48/154 (31%), Positives = 70/154 (45%), Gaps = 1/154 (0%) Frame = -2 Query: 461 SAWAAAASEERASAAVSLASAAGAGAAAAALVASPSLAASPLSSETGVTGEVGGFALFAR 282 SA AA SE A A+ + A + A AAA+A A S + S T E + Sbjct: 200 SAAAAKTSEANADASRTAAGDSAAAAAASATAAQTSAERAGASETAAKTSETQAASSAGD 259 Query: 281 LSALAAAFSASALSASVALHAARTERISAR*ASTTFVLMATCEGVSREDLSP*AC-NGSS 105 A A A +AS +A+ + AA+T +A +++T AT S + S A + +S Sbjct: 260 AGASATAAAASEKAAAASAAAAKTSETNAATSASTAAASATAASSSASEASTHAAASDTS 319 Query: 104 SPLVVALVTAAVSALTSSSDLTSGASSVASVASL 3 + L TAA +A T + D A +A V SL Sbjct: 320 ASLAAQSSTAAGAAATRAEDAAKRAEDIADVISL 353 [5][TOP] >UniRef100_Q57Q69 Side tail fiber protein n=1 Tax=Salmonella enterica RepID=Q57Q69_SALCH Length = 892 Score = 57.4 bits (137), Expect = 5e-07 Identities = 48/154 (31%), Positives = 70/154 (45%), Gaps = 1/154 (0%) Frame = -2 Query: 461 SAWAAAASEERASAAVSLASAAGAGAAAAALVASPSLAASPLSSETGVTGEVGGFALFAR 282 SA AA SE A A+ + A + A AAA+A A S + S T E + Sbjct: 200 SAAAAKTSEANADASRTAAGDSAAAAAASATAAQTSAERAGASETAAKTSETQAASSAGD 259 Query: 281 LSALAAAFSASALSASVALHAARTERISAR*ASTTFVLMATCEGVSREDLSP*AC-NGSS 105 A A A +AS +A+ + AA+T +A +++T AT S + S A + +S Sbjct: 260 AGASATAAAASEKAAAASAAAAKTSETNAATSASTAAASATAASSSASEASTHAAASDTS 319 Query: 104 SPLVVALVTAAVSALTSSSDLTSGASSVASVASL 3 + L TAA +A T + D A +A V SL Sbjct: 320 ASLAAQSSTAAGAAATRAEDAAKRAEDIADVISL 353 [6][TOP] >UniRef100_C9XCW0 Prophage side tail fiber protein n=1 Tax=Salmonella enterica subsp. enterica serovar Typhimurium str. D23580 RepID=C9XCW0_SALTY Length = 790 Score = 57.4 bits (137), Expect = 5e-07 Identities = 48/154 (31%), Positives = 70/154 (45%), Gaps = 1/154 (0%) Frame = -2 Query: 461 SAWAAAASEERASAAVSLASAAGAGAAAAALVASPSLAASPLSSETGVTGEVGGFALFAR 282 SA AA SE A A+ + A + A AAA+A A S + S T E + Sbjct: 200 SAAAAKTSEANADASRTAAGDSAAAAAASATAAQTSAERAGASETAAKTSETQAASSAGD 259 Query: 281 LSALAAAFSASALSASVALHAARTERISAR*ASTTFVLMATCEGVSREDLSP*AC-NGSS 105 A A A +AS +A+ + AA+T +A +++T AT S + S A + +S Sbjct: 260 AGASATAAAASEKAAAASAAAAKTSETNAATSASTAAASATAASSSASEASTHAAASDTS 319 Query: 104 SPLVVALVTAAVSALTSSSDLTSGASSVASVASL 3 + L TAA +A T + D A +A V SL Sbjct: 320 ASLAAQSSTAAGAAATRAEDAAKRAEDIADVISL 353 [7][TOP] >UniRef100_B5N398 Side tail fiber protein n=3 Tax=Salmonella enterica subsp. enterica RepID=B5N398_SALET Length = 812 Score = 57.4 bits (137), Expect = 5e-07 Identities = 48/154 (31%), Positives = 70/154 (45%), Gaps = 1/154 (0%) Frame = -2 Query: 461 SAWAAAASEERASAAVSLASAAGAGAAAAALVASPSLAASPLSSETGVTGEVGGFALFAR 282 SA AA SE A A+ + A + A AAA+A A S + S T E + Sbjct: 200 SAAAAKTSEANADASRTAAGDSAAAAAASATAAQTSAERAGASETAAKTSETQAASSAGD 259 Query: 281 LSALAAAFSASALSASVALHAARTERISAR*ASTTFVLMATCEGVSREDLSP*AC-NGSS 105 A A A +AS +A+ + AA+T +A +++T AT S + S A + +S Sbjct: 260 AGASATAAAASEKAAAASAAAAKTSETNAATSASTAAASATAASSSASEASTHAAASDTS 319 Query: 104 SPLVVALVTAAVSALTSSSDLTSGASSVASVASL 3 + L TAA +A T + D A +A V SL Sbjct: 320 ASLAAQSSTAAGAAATRAEDAAKRAEDIADVISL 353 [8][TOP] >UniRef100_Q4D4B2 Mucin-associated surface protein (MASP), putative n=1 Tax=Trypanosoma cruzi RepID=Q4D4B2_TRYCR Length = 437 Score = 56.6 bits (135), Expect = 8e-07 Identities = 58/177 (32%), Positives = 83/177 (46%), Gaps = 23/177 (12%) Frame = +1 Query: 7 EATDATEDAPEVKSDDDVKAETAAVTSAT-----TSGDDEPLHAHGERSSRETPSHVAIN 171 +AT E A + K+ + A+ AA +A T+ + A ++ E + VA Sbjct: 152 QATATAEAATKAKAAAEKAAKEAATAAAAEAVTATAAAEAAAEAVTATAAAEAAAAVAA- 210 Query: 172 TKVVEAYLAEIRSVLAACKATDALKAEAEKAAAKADKRANKAKPPTSPV-TPVSEDSGDA 348 EA A ++ AA A +A K EAEKA +A+K A +AK + T + + +A Sbjct: 211 AATQEAAKATTKAAEAAKAAAEAAKEEAEKAKKEAEKAAEEAKAKAATAATAAAAAAAEA 270 Query: 349 AKDGEATK---------AAAAAPAPAADAKLTAALARSSEA--------AAAHAESL 468 AK EA K AAA A A A+AK A A+++EA AAA AESL Sbjct: 271 AKATEAAKEEAEKAAETAAAEAEAAEAEAKAAAEAAKAAEAKAKEAAEKAAAAAESL 327 [9][TOP] >UniRef100_B8ZMV0 Cell wall surface anchored protein n=1 Tax=Streptococcus pneumoniae ATCC 700669 RepID=B8ZMV0_STRPJ Length = 4433 Score = 56.6 bits (135), Expect = 8e-07 Identities = 45/158 (28%), Positives = 82/158 (51%) Frame = -2 Query: 479 SEEVKDSAWAAAASEERASAAVSLASAAGAGAAAAALVASPSLAASPLSSETGVTGEVGG 300 SE SA A+A++ ASA+ S ASA+ + +A+A+ S S +AS +SE+ T Sbjct: 523 SESASTSASASASTSASASASTS-ASASASTSASASASTSASASASTSASESASTSASAS 581 Query: 299 FALFARLSALAAAFSASALSASVALHAARTERISAR*ASTTFVLMATCEGVSREDLSP*A 120 + A SA +A ++++ SAS + A+ + SA +++T + S + + Sbjct: 582 ASTSASASASTSASASASTSASASTSASASASTSASESASTSASASASTSASASASTSAS 641 Query: 119 CNGSSSPLVVALVTAAVSALTSSSDLTSGASSVASVAS 6 + S+S A + + SA TS+S+ S ++S ++ S Sbjct: 642 ASASTSASASASTSTSASASTSASESASTSASASASTS 679 [10][TOP] >UniRef100_B5NH04 Side tail fiber protein n=1 Tax=Salmonella enterica subsp. enterica serovar Javiana str. GA_MM04042433 RepID=B5NH04_SALET Length = 813 Score = 56.6 bits (135), Expect = 8e-07 Identities = 54/156 (34%), Positives = 73/156 (46%), Gaps = 3/156 (1%) Frame = -2 Query: 461 SAWAAAASEERASAAVSLASAAGAGAAAAALVASPSLAASPLSSETGVTGEV--GGFALF 288 SA AA SE A A+ + A + A AAA+A A S A + S T E A Sbjct: 200 SAAAAKTSEANADASRTAAGDSAAAAAASATAAQTSAARAGASETAAKTSETQAASSAGD 259 Query: 287 ARLSALAAAFSASALSASVALHAARTERISAR*ASTTFVLMATCEGVSREDLSP*AC-NG 111 A +SA AAA S A +AS A AA+ +A +++T AT S + S A + Sbjct: 260 AGVSATAAAASEKAAAASAA--AAKISETNAATSASTAAASATAASSSASEASNHAAASD 317 Query: 110 SSSPLVVALVTAAVSALTSSSDLTSGASSVASVASL 3 +S+ L TAA +A T + D A +A V SL Sbjct: 318 TSASLAAQSSTAAGAAATRAEDAAKRAEDIADVISL 353 [11][TOP] >UniRef100_C1FFK9 Predicted protein n=1 Tax=Micromonas sp. RCC299 RepID=C1FFK9_9CHLO Length = 1470 Score = 56.2 bits (134), Expect = 1e-06 Identities = 52/158 (32%), Positives = 69/158 (43%), Gaps = 1/158 (0%) Frame = +1 Query: 16 DATEDAPEVKSDDDVKAETAAVTSATTSGDDEPLHAHGERSSRETPSHVAINTKVVEAYL 195 +A A + K + KAE AA A + D E A ++ E + A +A Sbjct: 490 EAKAQAEQAKKEAAAKAE-AAKAEAKAAADAEK--AAAAKAKEEAAAKAAAEKAEAQAKA 546 Query: 196 AEIRSVLAACKATDALKAEAEKAAAKADKRANKAKPPTSPVTPVSEDSGDAAKDGE-ATK 372 A+ ++ A K A KAEAEKA AKA KA + E + AA D E A K Sbjct: 547 AQEKAAAEAAKKEAAAKAEAEKAEAKAKAEQEKAAAEAAK----KEAAAKAAADKEAAAK 602 Query: 373 AAAAAPAPAADAKLTAALARSSEAAAAHAESLTSSLSA 486 A A A A A AK AA +E A A A++ +A Sbjct: 603 AQAEAKAQAEQAKKEAAAKAEAEKAEAKAKAAQEKAAA 640 [12][TOP] >UniRef100_B5PKH3 Prophage tail fibre N-family protein n=1 Tax=Salmonella enterica subsp. enterica serovar Weltevreden str. HI_N05-537 RepID=B5PKH3_SALET Length = 892 Score = 56.2 bits (134), Expect = 1e-06 Identities = 47/154 (30%), Positives = 70/154 (45%), Gaps = 1/154 (0%) Frame = -2 Query: 461 SAWAAAASEERASAAVSLASAAGAGAAAAALVASPSLAASPLSSETGVTGEVGGFALFAR 282 SA A+ SE A A+ + A + A AAA+A A S A + S T E + Sbjct: 200 SAAASKTSEANADASRTAAGDSAAAAAASATAAQTSAARAGASETAAKTSETQAASSAGD 259 Query: 281 LSALAAAFSASALSASVALHAARTERISAR*ASTTFVLMATCEGVSREDLSP*AC-NGSS 105 A A A +AS +A+ + A+T +A +++T AT S + S A + +S Sbjct: 260 AGASATAAAASEKAAAASAAEAKTSETNAATSASTSAASATAASSSASEASTHAAASDTS 319 Query: 104 SPLVVALVTAAVSALTSSSDLTSGASSVASVASL 3 + L TAA +A T + D A +A V SL Sbjct: 320 ASLAAQSSTAAGAAATRAEDAAKRAEDIADVISL 353 [13][TOP] >UniRef100_Q4DYV1 Mucin-associated surface protein (MASP), putative n=1 Tax=Trypanosoma cruzi RepID=Q4DYV1_TRYCR Length = 527 Score = 55.8 bits (133), Expect = 1e-06 Identities = 51/179 (28%), Positives = 74/179 (41%), Gaps = 19/179 (10%) Frame = +1 Query: 7 EATDATEDAPEVKSDDDVKAETAAVTSATTSGDDEPLHAHGERSSRETPSHVAINTKVVE 186 EA T A ++ AETA + +A + A E+++ K E Sbjct: 216 EAAAKTAAAEAAAAEAKTSAETAKMATANAATAAAKAKAETEKAAAAAAKEATTKAKAAE 275 Query: 187 AYLAEIRSVLAA-------------CKATDALKAEAEKAAAKADKRANKAKPPTSPVTPV 327 A E AA KA K AE+A+AKA + A KAK Sbjct: 276 AAKDEAAKAAAAKAAEEATAAKDEAAKAAATAKTAAEEASAKAAEAAAKAKAAAEAAETA 335 Query: 328 SEDSGDAAKDG--EATKAAAAAPAPAADAKLTAALAR----SSEAAAAHAESLTSSLSA 486 +G AA++ A +AAA A AA+AK +A A+ ++E AAA A++ T +A Sbjct: 336 KASAGKAAEEAAKAAAEAAATAAEAAAEAKTSAETAKTATANTETAAAKAKAETEKAAA 394 [14][TOP] >UniRef100_Q4DQS6 Mucin-associated surface protein (MASP), putative n=1 Tax=Trypanosoma cruzi RepID=Q4DQS6_TRYCR Length = 376 Score = 55.8 bits (133), Expect = 1e-06 Identities = 47/158 (29%), Positives = 63/158 (39%), Gaps = 1/158 (0%) Frame = +1 Query: 10 ATDATEDAPEVKS-DDDVKAETAAVTSATTSGDDEPLHAHGERSSRETPSHVAINTKVVE 186 AT A A E K+ KA T +A T A +++ A T+ E Sbjct: 110 ATKAKAAAEEAKAVATKAKAATEEAKAAATKAKAAATAAEAAEAAKAAGKAAATATEAAE 169 Query: 187 AYLAEIRSVLAACKATDALKAEAEKAAAKADKRANKAKPPTSPVTPVSEDSGDAAKDGEA 366 A A + AA +A AEAE+AAA+AD AK +E + A E Sbjct: 170 AAKAAAEAAKAAAEAAGEAAAEAEEAAAEADAAITVAKSAVEAAKEAAEKAAKAKAAAET 229 Query: 367 TKAAAAAPAPAADAKLTAALARSSEAAAAHAESLTSSL 480 +A AAA AA K A A + AA E+ +L Sbjct: 230 AEAKAAAAEAAAAEKAAATKADAKATAAKTPEAAAEAL 267 [15][TOP] >UniRef100_B2ISC7 Cell wall surface anchor family protein n=1 Tax=Streptococcus pneumoniae CGSP14 RepID=B2ISC7_STRPS Length = 4695 Score = 55.1 bits (131), Expect = 2e-06 Identities = 48/160 (30%), Positives = 84/160 (52%), Gaps = 2/160 (1%) Frame = -2 Query: 479 SEEVKDSAWAAAASEERASAAVSLASAAGAGAAAAALVASPSLAASPLSSETGVTGEVGG 300 SE SA A+A++ ASA+ S ASA+ + +A+A+ S S +AS +SE+ T Sbjct: 2201 SESASTSASASASTSASASASTS-ASASASTSASASASTSASASASTSASESASTSASAS 2259 Query: 299 FALFARLSAL--AAAFSASALSASVALHAARTERISAR*ASTTFVLMATCEGVSREDLSP 126 + A SA A+A ++++ SAS + A+ + ISA +++T + S + Sbjct: 2260 ASTSASASASTSASASASTSASASASTSASASASISASESASTSASASASTSASASASTS 2319 Query: 125 *ACNGSSSPLVVALVTAAVSALTSSSDLTSGASSVASVAS 6 + + S+S A +A+ SA TS+S S ++S ++ S Sbjct: 2320 ASASASTSASASASTSASASASTSASASASTSASASASTS 2359 Score = 54.7 bits (130), Expect = 3e-06 Identities = 48/173 (27%), Positives = 84/173 (48%), Gaps = 15/173 (8%) Frame = -2 Query: 479 SEEVKDSAWAAAASEERASAAVSL---------------ASAAGAGAAAAALVASPSLAA 345 SE SA A+A++ ASA+ S ASA+ + +A+A+ S S +A Sbjct: 579 SESASTSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSASASA 638 Query: 344 SPLSSETGVTGEVGGFALFARLSALAAAFSASALSASVALHAARTERISAR*ASTTFVLM 165 S +SE+ T + A SA +A ++++ S S + A+ + SA +++T Sbjct: 639 STSASESASTSASESASTSASASASTSASASTSASVSASTSASESASTSASASASTSASA 698 Query: 164 ATCEGVSREDLSP*ACNGSSSPLVVALVTAAVSALTSSSDLTSGASSVASVAS 6 + S + + + S+S V A +A+ SA TS+S TS ++S ++ AS Sbjct: 699 SASTSASASASTSASASASTSASVSASTSASASASTSASASTSASASASTSAS 751 Score = 54.7 bits (130), Expect = 3e-06 Identities = 48/173 (27%), Positives = 84/173 (48%), Gaps = 15/173 (8%) Frame = -2 Query: 479 SEEVKDSAWAAAASEERASAAVSL---------------ASAAGAGAAAAALVASPSLAA 345 SE SA A+A++ ASA+ S ASA+ + +A+A+ S S +A Sbjct: 1343 SESASTSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSASASA 1402 Query: 344 SPLSSETGVTGEVGGFALFARLSALAAAFSASALSASVALHAARTERISAR*ASTTFVLM 165 S +SE+ T + A SA +A ++++ S S + A+ + SA +++T Sbjct: 1403 STSASESASTSASESASTSASASASTSASASTSASVSASTSASESASTSASASASTSASA 1462 Query: 164 ATCEGVSREDLSP*ACNGSSSPLVVALVTAAVSALTSSSDLTSGASSVASVAS 6 + S + + + S+S V A +A+ SA TS+S TS ++S ++ AS Sbjct: 1463 SASTSASASASTSASASASTSASVSASTSASASASTSASASTSASASASTSAS 1515 Score = 54.7 bits (130), Expect = 3e-06 Identities = 48/173 (27%), Positives = 84/173 (48%), Gaps = 15/173 (8%) Frame = -2 Query: 479 SEEVKDSAWAAAASEERASAAVSL---------------ASAAGAGAAAAALVASPSLAA 345 SE SA A+A++ ASA+ S ASA+ + +A+A+ S S +A Sbjct: 2713 SESASTSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSASASA 2772 Query: 344 SPLSSETGVTGEVGGFALFARLSALAAAFSASALSASVALHAARTERISAR*ASTTFVLM 165 S +SE+ T + A SA +A ++++ S S + A+ + SA +++T Sbjct: 2773 STSASESASTSASESASTSASASASTSASASTSASVSASTSASESASTSASASASTSASA 2832 Query: 164 ATCEGVSREDLSP*ACNGSSSPLVVALVTAAVSALTSSSDLTSGASSVASVAS 6 + S + + + S+S V A +A+ SA TS+S TS ++S ++ AS Sbjct: 2833 SASTSASASASTSASASASTSASVSASTSASASASTSASASTSASASASTSAS 2885 Score = 54.3 bits (129), Expect = 4e-06 Identities = 48/167 (28%), Positives = 85/167 (50%), Gaps = 9/167 (5%) Frame = -2 Query: 479 SEEVKDSAWAAAASEERASAAVSLASAAGAG--AAAAALVA-------SPSLAASPLSSE 327 SE SA A+A++ ASA+ S +++A A A+A+A + S S +AS +SE Sbjct: 2019 SESASTSASASASTSASASASTSASASASASTSASASASTSASESASTSASASASTSASE 2078 Query: 326 TGVTGEVGGFALFARLSALAAAFSASALSASVALHAARTERISAR*ASTTFVLMATCEGV 147 + T + A SA +A ++++ S S + A+ + SA +++T + Sbjct: 2079 SASTSASESASTSASASASTSASASTSASVSASTSASESASTSASASASTSASASASTSA 2138 Query: 146 SREDLSP*ACNGSSSPLVVALVTAAVSALTSSSDLTSGASSVASVAS 6 S + + + S+S V A +A+ SA TS+S TS ++S ++ AS Sbjct: 2139 SASASTSASASASTSASVSASTSASASASTSASASTSASASASTSAS 2185 [16][TOP] >UniRef100_C4EI11 Putative uncharacterized protein n=1 Tax=Streptosporangium roseum DSM 43021 RepID=C4EI11_STRRS Length = 386 Score = 54.7 bits (130), Expect = 3e-06 Identities = 53/172 (30%), Positives = 74/172 (43%), Gaps = 15/172 (8%) Frame = +1 Query: 7 EATDATEDAPEVKSDDDVKAETAAVTSATTSGDDEPLHAHGERSS-RETPSHVAINTKVV 183 E A + E K+ + K E VT+ T EP A + + ET A TK Sbjct: 142 EPKPAEAETTETKAAE-TKQEAEPVTAETKPAGSEPAKATEPKPAVAETTEAKATETKAA 200 Query: 184 EAYLAEIRSV-LAACKATDALKAE-----AEKA-AAKADKRANKAKPPTSPVTPVSEDSG 342 E AE +++ + KAT+ + E AE A AAK + A AKP + TP + Sbjct: 201 EIKAAEAKTIEVTESKATETISVEVKTTEAEAAEAAKPETEAEAAKPEPAATTPAGSEPA 260 Query: 343 DAAKDGEATKAAAAA------PAPAA-DAKLTAALARSSEAAAAHAESLTSS 477 AA+ EAT+A A PAPAA ++T E A A +T + Sbjct: 261 KAARTTEATEATEATVSETEIPAPAAVQVEVTEVKVAKPETADAALVEVTET 312 [17][TOP] >UniRef100_Q0T6X6 Cell envelope integrity inner membrane protein TolA n=3 Tax=Shigella RepID=Q0T6X6_SHIF8 Length = 413 Score = 54.3 bits (129), Expect = 4e-06 Identities = 50/151 (33%), Positives = 67/151 (44%) Frame = +1 Query: 7 EATDATEDAPEVKSDDDVKAETAAVTSATTSGDDEPLHAHGERSSRETPSHVAINTKVVE 186 +A A E A + +D KAE A +A A ++ + + + + E Sbjct: 144 DAKAAEEAAKKAAADAKKKAEAEAAKAA----------AEAQKKAEVAAAALKKKAEAAE 193 Query: 187 AYLAEIRSVLAACKATDALKAEAEKAAAKADKRANKAKPPTSPVTPVSEDSGDAAKDGEA 366 A AE R AA +A + KAEAEK AA A+K A K + + AA D A Sbjct: 194 AAAAEARK-KAATEAAEKAKAEAEKKAA-AEKAAADKKAAAEKAAADKKAAEKAAADKAA 251 Query: 367 TKAAAAAPAPAADAKLTAALARSSEAAAAHA 459 AAA AAD K AA A + +AAAA A Sbjct: 252 ADKKAAAEKAAADKKAAAAKAAAEKAAAAKA 282 [18][TOP] >UniRef100_Q4E210 Mucin-associated surface protein (MASP), putative n=1 Tax=Trypanosoma cruzi RepID=Q4E210_TRYCR Length = 419 Score = 54.3 bits (129), Expect = 4e-06 Identities = 49/171 (28%), Positives = 71/171 (41%), Gaps = 10/171 (5%) Frame = +1 Query: 4 KEATDATEDAPEVKSDDDVKAETAAVTSATTSGDDEPLHAHGERSSRETPSHVAINT-KV 180 ++ A E A + K+ + A+ AA T A T + A ET + A K Sbjct: 50 RQEQGAAEAAADAKAAAETAAKAAAATKAATEAAAKAKEAEAAAKEAETAAKEAETAAKE 109 Query: 181 VEAYLAEIRSVLAACKATD------ALKAEAEKAAAKA---DKRANKAKPPTSPVTPVSE 333 EA E + A KA D A+ A E AA KA + A KAK +E Sbjct: 110 AEAAAKEAEAAAKAAKAVDTEEKAKAVAAATESAAKKATTASEAAAKAKAAAEEAKAAAE 169 Query: 334 DSGDAAKDGEATKAAAAAPAPAADAKLTAALARSSEAAAAHAESLTSSLSA 486 + A ++ +A AAA AA+A AA + A AA A + ++ +A Sbjct: 170 AAATATEEAKAAAEAAATATEAAEAAAEAAATATEAAKAAEAAAEATAEAA 220 [19][TOP] >UniRef100_Q97P71 Cell wall surface anchor family protein n=1 Tax=Streptococcus pneumoniae RepID=Q97P71_STRPN Length = 4776 Score = 54.3 bits (129), Expect = 4e-06 Identities = 48/160 (30%), Positives = 84/160 (52%), Gaps = 2/160 (1%) Frame = -2 Query: 479 SEEVKDSAWAAAASEERASAAVSLASAAGAGAAAAALVASPSLAASPLSSETGVTGEVGG 300 SE SA A+A++ ASA+ S ASA+ + +A+A+ S S +AS +SE+ T Sbjct: 684 SESASTSASASASTSASASASTS-ASASASTSASASASTSASASASTSASESASTSASAS 742 Query: 299 FALFARLSAL--AAAFSASALSASVALHAARTERISAR*ASTTFVLMATCEGVSREDLSP 126 + A SA A+A ++++ SAS + A+ + ISA +++T + S + Sbjct: 743 ASTSASASASTSASASASTSASASASTSASASASISASESASTSASASASTSASASASTS 802 Query: 125 *ACNGSSSPLVVALVTAAVSALTSSSDLTSGASSVASVAS 6 + + S+S A +A+ SA TS+S S ++S ++ S Sbjct: 803 ASASASTSASESASTSASASASTSASASASTSASASASTS 842 Score = 54.3 bits (129), Expect = 4e-06 Identities = 44/158 (27%), Positives = 80/158 (50%) Frame = -2 Query: 479 SEEVKDSAWAAAASEERASAAVSLASAAGAGAAAAALVASPSLAASPLSSETGVTGEVGG 300 S SA A+A++ ASA++S + +A A+ +A S S +AS +SE+ T Sbjct: 1770 SASASTSASASASTSASASASISASESASTSASESA-STSTSASASTSASESASTSASAS 1828 Query: 299 FALFARLSALAAAFSASALSASVALHAARTERISAR*ASTTFVLMATCEGVSREDLSP*A 120 + A SA +A ++++ SAS + A+ + SA +++T + S + + Sbjct: 1829 ASTSASASASTSASASASTSASASTSASESASTSASASASTSASASASTSASASASTSAS 1888 Query: 119 CNGSSSPLVVALVTAAVSALTSSSDLTSGASSVASVAS 6 + S+S A +A+VSA TS+S S ++S ++ S Sbjct: 1889 ASASTSASASASTSASVSASTSASASASTSASASASTS 1926 Score = 54.3 bits (129), Expect = 4e-06 Identities = 48/160 (30%), Positives = 84/160 (52%), Gaps = 2/160 (1%) Frame = -2 Query: 479 SEEVKDSAWAAAASEERASAAVSLASAAGAGAAAAALVASPSLAASPLSSETGVTGEVGG 300 SE SA A+A++ ASA+ S ASA+ + +A+A+ S S +AS +SE+ T Sbjct: 1928 SESASTSASASASTSASASASTS-ASASASTSASASASTSASASASTSASESASTSASAS 1986 Query: 299 FALFARLSAL--AAAFSASALSASVALHAARTERISAR*ASTTFVLMATCEGVSREDLSP 126 + A SA A+A ++++ SAS + A+ + ISA +++T + S + Sbjct: 1987 ASTSASASASTSASASASTSASASASTSASASASISASESASTSASASASTSASASASTS 2046 Query: 125 *ACNGSSSPLVVALVTAAVSALTSSSDLTSGASSVASVAS 6 + + S+S A +A+ SA TS+S S ++S ++ S Sbjct: 2047 ASASASTSASESASTSASASASTSASASASTSASASASTS 2086 Score = 54.3 bits (129), Expect = 4e-06 Identities = 44/158 (27%), Positives = 80/158 (50%) Frame = -2 Query: 479 SEEVKDSAWAAAASEERASAAVSLASAAGAGAAAAALVASPSLAASPLSSETGVTGEVGG 300 S SA A+A++ ASA++S + +A A+ +A S S +AS +SE+ T Sbjct: 2712 SASASTSASASASTSASASASISASESASTSASESA-STSTSASASTSASESASTSASAS 2770 Query: 299 FALFARLSALAAAFSASALSASVALHAARTERISAR*ASTTFVLMATCEGVSREDLSP*A 120 + A SA +A ++++ SAS + A+ + SA +++T + S + + Sbjct: 2771 ASTSASASASTSASASASTSASASTSASESASTSASASASTSASASASTSASASASTSAS 2830 Query: 119 CNGSSSPLVVALVTAAVSALTSSSDLTSGASSVASVAS 6 + S+S A +A+VSA TS+S S ++S ++ S Sbjct: 2831 ASASTSASASASTSASVSASTSASASASTSASASASTS 2868 Score = 54.3 bits (129), Expect = 4e-06 Identities = 48/160 (30%), Positives = 84/160 (52%), Gaps = 2/160 (1%) Frame = -2 Query: 479 SEEVKDSAWAAAASEERASAAVSLASAAGAGAAAAALVASPSLAASPLSSETGVTGEVGG 300 SE SA A+A++ ASA+ S ASA+ + +A+A+ S S +AS +SE+ T Sbjct: 3170 SESASTSASASASTSASASASTS-ASASASTSASASASTSASASASTSASESASTSASAS 3228 Query: 299 FALFARLSAL--AAAFSASALSASVALHAARTERISAR*ASTTFVLMATCEGVSREDLSP 126 + A SA A+A ++++ SAS + A+ + ISA +++T + S + Sbjct: 3229 ASTSASASASTSASASASTSASASASTSASASASISASESASTSASASASTSASASASTS 3288 Query: 125 *ACNGSSSPLVVALVTAAVSALTSSSDLTSGASSVASVAS 6 + + S+S A +A+ SA TS+S S ++S ++ S Sbjct: 3289 ASASASTSASESASTSASASASTSASASASTSASASASTS 3328 Score = 53.9 bits (128), Expect = 5e-06 Identities = 45/158 (28%), Positives = 81/158 (51%) Frame = -2 Query: 479 SEEVKDSAWAAAASEERASAAVSLASAAGAGAAAAALVASPSLAASPLSSETGVTGEVGG 300 SE SA A+A++ ASA+ S ASA+ + +A+A+ S S +AS +S + T Sbjct: 2798 SESASTSASASASTSASASASTS-ASASASTSASASASTSASASASTSASVSASTSASAS 2856 Query: 299 FALFARLSALAAAFSASALSASVALHAARTERISAR*ASTTFVLMATCEGVSREDLSP*A 120 + A SA +A +++ SAS + A+ + SA +++T + S + + Sbjct: 2857 ASTSASASASTSASESASTSASASTSASESASTSASASASTSASASASTSASASASTSAS 2916 Query: 119 CNGSSSPLVVALVTAAVSALTSSSDLTSGASSVASVAS 6 + S+S A +A+ SA TS+S+ S ++S ++ S Sbjct: 2917 ESASTSASASASTSASASASTSASESASTSASASASTS 2954 Score = 53.5 bits (127), Expect = 7e-06 Identities = 44/158 (27%), Positives = 80/158 (50%) Frame = -2 Query: 479 SEEVKDSAWAAAASEERASAAVSLASAAGAGAAAAALVASPSLAASPLSSETGVTGEVGG 300 S SA A+A++ ASA++S + +A A+ +A S S +AS +SE+ T Sbjct: 534 SASASTSASASASTSASASASISASESASTSASESA-STSTSASASTSASESASTSASAS 592 Query: 299 FALFARLSALAAAFSASALSASVALHAARTERISAR*ASTTFVLMATCEGVSREDLSP*A 120 + A SA +A ++++ SAS + A+ + SA +++T + S + + Sbjct: 593 ASTSASASASTSASASASTSASASTSASESASTSASASASTSASASASTSASASASTSAS 652 Query: 119 CNGSSSPLVVALVTAAVSALTSSSDLTSGASSVASVAS 6 + S+S V A +A+ SA TS+S S ++S ++ S Sbjct: 653 ASASTSASVSASTSASASASTSASASASTSASESASTS 690 Score = 53.5 bits (127), Expect = 7e-06 Identities = 44/158 (27%), Positives = 80/158 (50%) Frame = -2 Query: 479 SEEVKDSAWAAAASEERASAAVSLASAAGAGAAAAALVASPSLAASPLSSETGVTGEVGG 300 S SA A+A++ ASA++S + +A A+ +A S S +AS +SE+ T Sbjct: 3020 SASASTSASASASTSASASASISASESASTSASESA-STSTSASASTSASESASTSASAS 3078 Query: 299 FALFARLSALAAAFSASALSASVALHAARTERISAR*ASTTFVLMATCEGVSREDLSP*A 120 + A SA +A ++++ SAS + A+ + SA +++T + S + + Sbjct: 3079 ASTSASASASTSASASASTSASASTSASESASTSASASASTSASASASTSASASASTSAS 3138 Query: 119 CNGSSSPLVVALVTAAVSALTSSSDLTSGASSVASVAS 6 + S+S V A +A+ SA TS+S S ++S ++ S Sbjct: 3139 ASASTSASVSASTSASASASTSASASASTSASESASTS 3176 Score = 53.1 bits (126), Expect = 9e-06 Identities = 44/158 (27%), Positives = 80/158 (50%) Frame = -2 Query: 479 SEEVKDSAWAAAASEERASAAVSLASAAGAGAAAAALVASPSLAASPLSSETGVTGEVGG 300 SE SA A+A++ ASA+ S ASA+ + +A+ + S S +AS +SE+ T Sbjct: 1098 SESASTSASASASTSASASASTS-ASASASTSASESASTSTSASASTSASESASTSASAS 1156 Query: 299 FALFARLSALAAAFSASALSASVALHAARTERISAR*ASTTFVLMATCEGVSREDLSP*A 120 + A SA +A ++++ SAS + + + SA +++T + S + + Sbjct: 1157 ASTSASASASTSASASASTSASASASTSASASTSASESASTSASASASTSASASASTSAS 1216 Query: 119 CNGSSSPLVVALVTAAVSALTSSSDLTSGASSVASVAS 6 + S+S A +A+ SA TS+S S ++S ++ S Sbjct: 1217 ASASTSASASASTSASASASTSASASASTSASASASTS 1254 Score = 53.1 bits (126), Expect = 9e-06 Identities = 44/158 (27%), Positives = 80/158 (50%) Frame = -2 Query: 479 SEEVKDSAWAAAASEERASAAVSLASAAGAGAAAAALVASPSLAASPLSSETGVTGEVGG 300 SE SA A+A++ ASA+ S ASA+ + +A+ + S S +AS +SE+ T Sbjct: 2342 SESASTSASASASTSASASASTS-ASASASTSASESASTSTSASASTSASESASTSASAS 2400 Query: 299 FALFARLSALAAAFSASALSASVALHAARTERISAR*ASTTFVLMATCEGVSREDLSP*A 120 + A SA +A ++++ SAS + + + SA +++T + S + + Sbjct: 2401 ASTSASASASTSASASASTSASASASTSASASTSASESASTSASASASTSASASASTSAS 2460 Query: 119 CNGSSSPLVVALVTAAVSALTSSSDLTSGASSVASVAS 6 + S+S A +A+ SA TS+S S ++S ++ S Sbjct: 2461 ASASTSASASASTSASASASTSASASASTSASASASTS 2498 Score = 53.1 bits (126), Expect = 9e-06 Identities = 44/158 (27%), Positives = 80/158 (50%) Frame = -2 Query: 479 SEEVKDSAWAAAASEERASAAVSLASAAGAGAAAAALVASPSLAASPLSSETGVTGEVGG 300 SE SA A+A++ ASA+ S ASA+ + +A+ + S S +AS +SE+ T Sbjct: 3584 SESASTSASASASTSASASASTS-ASASASTSASESASTSTSASASTSASESASTSASAS 3642 Query: 299 FALFARLSALAAAFSASALSASVALHAARTERISAR*ASTTFVLMATCEGVSREDLSP*A 120 + A SA +A ++++ SAS + + + SA +++T + S + + Sbjct: 3643 ASTSASASASTSASASASTSASASASTSASASTSASESASTSASASASTSASASASTSAS 3702 Query: 119 CNGSSSPLVVALVTAAVSALTSSSDLTSGASSVASVAS 6 + S+S A +A+ SA TS+S S ++S ++ S Sbjct: 3703 ASASTSASASASTSASASASTSASASASTSASASASTS 3740 Score = 53.1 bits (126), Expect = 9e-06 Identities = 44/158 (27%), Positives = 80/158 (50%) Frame = -2 Query: 479 SEEVKDSAWAAAASEERASAAVSLASAAGAGAAAAALVASPSLAASPLSSETGVTGEVGG 300 SE SA A+A++ ASA+ S ASA+ + +A+ + S S +AS +SE+ T Sbjct: 4142 SESASTSASASASTSASASASTS-ASASASTSASESASTSTSASASTSASESASTSASAS 4200 Query: 299 FALFARLSALAAAFSASALSASVALHAARTERISAR*ASTTFVLMATCEGVSREDLSP*A 120 + A SA +A ++++ SAS + + + SA +++T + S + + Sbjct: 4201 ASTSASASASTSASASASTSASASASTSASASTSASESASTSASASASTSASASASTSAS 4260 Query: 119 CNGSSSPLVVALVTAAVSALTSSSDLTSGASSVASVAS 6 + S+S A +A+ SA TS+S S ++S ++ S Sbjct: 4261 ASASTSASASASTSASASASTSASASASTSASASASTS 4298 [20][TOP] >UniRef100_A8IFL6 Flagellar associated protein n=1 Tax=Chlamydomonas reinhardtii RepID=A8IFL6_CHLRE Length = 2058 Score = 53.9 bits (128), Expect = 5e-06 Identities = 53/164 (32%), Positives = 77/164 (46%), Gaps = 5/164 (3%) Frame = +1 Query: 7 EATDATEDAPEVKSDDDVKAETAAVTSATTSGDDEPLHAHGERSSRE-TPSHVAINTKVV 183 EA A E A ++ AE A +A + ++ A E ++ E T + + T Sbjct: 767 EAAAAAEAAAAGDAEAAPIAEAGAAAAAAAAAEEVAQAAADEAAAAEATATATSDETAEA 826 Query: 184 EAYLAEIRSVLAACKATDALKAEAEKAAAKADKRANKAKPPTSPVTPVSEDSGDAAKDGE 363 EA A + A A A AE++ AAA K A +AK + ++ AA+ E Sbjct: 827 EAKAAAEAAAEAKAAAEAAAAAESQAAAAAEAKLAAEAKAAEAAAEAAEAEAKAAAETKE 886 Query: 364 ATKAAAAAPA---PAADAKLTA-ALARSSEAAAAHAESLTSSLS 483 A +AAAAA A AA+AK A A A + EAA A AE+ ++ S Sbjct: 887 AAEAAAAAEAEAKAAAEAKAAAEAAAAAEEAAKAAAEAAVAAAS 930 [21][TOP] >UniRef100_Q4E1Y4 Mucin-associated surface protein (MASP), putative n=1 Tax=Trypanosoma cruzi RepID=Q4E1Y4_TRYCR Length = 426 Score = 53.9 bits (128), Expect = 5e-06 Identities = 51/159 (32%), Positives = 70/159 (44%), Gaps = 1/159 (0%) Frame = +1 Query: 4 KEATDATEDAPEVKSDDDVKAETAAVTSATTSGDDEPLHAHGERSSRETPSHVAINTKVV 183 K+AT A E A + K+ + AE AA +A E + A ++ + A Sbjct: 135 KKATAAAEAATKAKAAAEKAAEEAATAAAA-----ETVTATAAAAAEAATAAAATQEAAT 189 Query: 184 EAYLAEIRSVLAACKATDALKAEAEKAAAKADKRANKAKPPTSPVTPVSEDSGDAAKDGE 363 EA A KA A K AE+A A K A +AK + AA E Sbjct: 190 EA--------AEAAKAAAAAKGAAEEAKEAAKKAAEEAKAKAKEAAEAEAEKAAAATQ-E 240 Query: 364 ATKAAAAAPAPAADAKLTAA-LARSSEAAAAHAESLTSS 477 A KA AAA A AA A + AA A+++EAA A A+ ++ Sbjct: 241 AAKATAAAEAAAAAAAVAAAEAAKATEAAKAEAKKAAAA 279 [22][TOP] >UniRef100_UPI000180D19C PREDICTED: similar to GH18720 n=1 Tax=Ciona intestinalis RepID=UPI000180D19C Length = 754 Score = 53.9 bits (128), Expect = 5e-06 Identities = 56/156 (35%), Positives = 76/156 (48%), Gaps = 4/156 (2%) Frame = -2 Query: 461 SAWAAAASEER--ASAAVSLASAAGAGAAAAALVASPSLAASPLSSETGVTGEVGGFALF 288 SA A AS+ ASA+ + ASA+ AGA+A+A AS S A + S+ G A Sbjct: 96 SASVAGASDSMTGASASAAGASASAAGASASAAGASASAAGASASTAGASASAAGASASA 155 Query: 287 ARLSALAAAFSASALSASVALHAARTERISAR*ASTTFVLMATCEGVSREDLSP*ACNGS 108 A SA AA SASA AS A A +E ++ AST + +T G S S Sbjct: 156 AGASASAAGASASATWAS-ASTAGASESLAGASASTAWASASTA-GASAS-----TAGAS 208 Query: 107 SSPLVVALVTAAVSALT--SSSDLTSGASSVASVAS 6 +S + TA SA T +S+ T ++SV V+S Sbjct: 209 ASTAGASAFTAGASAATAGASASTTGASASVTCVSS 244 [23][TOP] >UniRef100_C7TJM8 Putative uncharacterized protein n=1 Tax=Lactobacillus rhamnosus Lc 705 RepID=C7TJM8_LACRL Length = 3390 Score = 53.9 bits (128), Expect = 5e-06 Identities = 49/159 (30%), Positives = 78/159 (49%), Gaps = 7/159 (4%) Frame = -2 Query: 461 SAWAAAASEERASAAVSLASAAGAGAAAAALVASPSLAASPLSSETGVTGEVGGFALFAR 282 S++AA+AS+ AA +SAA G AA S + AAS +++ A + Sbjct: 1405 SSYAASASK----AATEASSAADKGKNAATKALSEAYAASSAANDAASIAVAASTAASSL 1460 Query: 281 LSALAAAFSASALSASVALHAARTERISAR*AS----TTFVLMATCEGVSR---EDLSP* 123 S++ + +A++ AS A AAR+ + A AS + + +T V+ +D S Sbjct: 1461 ASSITSGNTAASDKASAASDAARSASVVASTASVKANSANAIASTASSVAASGYQDASNI 1520 Query: 122 ACNGSSSPLVVALVTAAVSALTSSSDLTSGASSVASVAS 6 A +P + +L T A SA + ++DL ASS AS AS Sbjct: 1521 ATRYPGNPSLTSLATVASSANSETADLAKSASSDASAAS 1559 [24][TOP] >UniRef100_C7TCZ4 Putative cell surface protein n=1 Tax=Lactobacillus rhamnosus GG RepID=C7TCZ4_LACRG Length = 3275 Score = 53.9 bits (128), Expect = 5e-06 Identities = 49/159 (30%), Positives = 78/159 (49%), Gaps = 7/159 (4%) Frame = -2 Query: 461 SAWAAAASEERASAAVSLASAAGAGAAAAALVASPSLAASPLSSETGVTGEVGGFALFAR 282 S++AA+AS+ AA +SAA G AA S + AAS +++ A + Sbjct: 1405 SSYAASASK----AATEASSAADKGKNAATKALSEAYAASSAANDAASIAVAASTAASSL 1460 Query: 281 LSALAAAFSASALSASVALHAARTERISAR*AS----TTFVLMATCEGVSR---EDLSP* 123 S++ + +A++ AS A AAR+ + A AS + + +T V+ +D S Sbjct: 1461 ASSITSGNTAASDKASAASDAARSASVVASTASVKANSANAIASTASSVAASGYQDASNI 1520 Query: 122 ACNGSSSPLVVALVTAAVSALTSSSDLTSGASSVASVAS 6 A +P + +L T A SA + ++DL ASS AS AS Sbjct: 1521 ATRYPGNPSLTSLATVASSANSETADLAKSASSDASAAS 1559 [25][TOP] >UniRef100_B5QK50 Putative uncharacterized protein n=1 Tax=Lactobacillus rhamnosus HN001 RepID=B5QK50_LACRH Length = 3275 Score = 53.9 bits (128), Expect = 5e-06 Identities = 49/159 (30%), Positives = 78/159 (49%), Gaps = 7/159 (4%) Frame = -2 Query: 461 SAWAAAASEERASAAVSLASAAGAGAAAAALVASPSLAASPLSSETGVTGEVGGFALFAR 282 S++AA+AS+ AA +SAA G AA S + AAS +++ A + Sbjct: 1405 SSYAASASK----AATEASSAADKGKNAATKALSEAYAASSAANDAASIAVAASTAASSL 1460 Query: 281 LSALAAAFSASALSASVALHAARTERISAR*AS----TTFVLMATCEGVSR---EDLSP* 123 S++ + +A++ AS A AAR+ + A AS + + +T V+ +D S Sbjct: 1461 ASSITSGNTAASDKASAASDAARSASVVASTASVKANSANAIASTASSVAASGYQDASNI 1520 Query: 122 ACNGSSSPLVVALVTAAVSALTSSSDLTSGASSVASVAS 6 A +P + +L T A SA + ++DL ASS AS AS Sbjct: 1521 ATRYPGNPSLTSLATVASSANSETADLAKSASSDASAAS 1559 [26][TOP] >UniRef100_B4TDX9 Side tail fiber protein n=2 Tax=Salmonella enterica subsp. enterica serovar Heidelberg RepID=B4TDX9_SALHS Length = 791 Score = 53.9 bits (128), Expect = 5e-06 Identities = 57/168 (33%), Positives = 81/168 (48%), Gaps = 7/168 (4%) Frame = -2 Query: 485 ADSEEVKDSAWAAAASEERASAAVSLASAAGAGAAAAALVASPS------LAASPLSSET 324 AD+ +A AAAA A+A S +AAG AAAAA A+ + AS +++T Sbjct: 190 ADTARTAAAASAAAAKTSEANADAS-RTAAGDSAAAAAASATAAQTSAERAGASETAAKT 248 Query: 323 GVTGEVGGFALFARLSALAAAFSASALSASVALHAARTERISAR*ASTTFVLMATCEGVS 144 T + A A SA AAA S A +AS A AA+T +A +++T AT S Sbjct: 249 SET-QAASSAGDAGASATAAAASEKAAAASAA--AAKTSETNAATSASTAAASATAASSS 305 Query: 143 REDLSP-*ACNGSSSPLVVALVTAAVSALTSSSDLTSGASSVASVASL 3 + S A + +S+ L TAA +A T + + A +A V SL Sbjct: 306 ASEASTHAAASDTSASLAAQSSTAAGAAATRAEEAAKRAEDIADVISL 353 [27][TOP] >UniRef100_Q28RH6 Mucin-associated surface protein n=1 Tax=Jannaschia sp. CCS1 RepID=Q28RH6_JANSC Length = 304 Score = 53.5 bits (127), Expect = 7e-06 Identities = 49/172 (28%), Positives = 76/172 (44%), Gaps = 10/172 (5%) Frame = +1 Query: 1 VKEATDATEDAP-----EVKSDDDVKAETAAVTSATTSGDDEPLHAHGERSSRETPSHVA 165 V++ T A ++A E + ++ AE AA A + + A ++ + A Sbjct: 58 VEDVTGAADEAAAAAEAEAAAAEEAAAEAAAAAEAEAAEAAAAVEAEAAEAAAAAEAEAA 117 Query: 166 INTKVVEAYLAEIRSVLAACKATDALKAEAEKAAAKADKRANKAKPPTSPVTPVSEDSGD 345 +EA A AA +A A +AEA +AAA A+ A +A +ED+ Sbjct: 118 EAAAAIEAEAAAAEE--AAAEAAAAAEAEAAEAAAAAEAEAAEAAAAVEAEAAAAEDAAT 175 Query: 346 ---AAKDGEATKAAAA--APAPAADAKLTAALARSSEAAAAHAESLTSSLSA 486 AA + EAT+AA A A A AA + AA + ++E A E T +A Sbjct: 176 EAAAAVEAEATEAADAVEADAAAATEEAEAAASEAAETVEAATEEATGETAA 227 [28][TOP] >UniRef100_Q4E457 Mucin-associated surface protein (MASP), putative n=1 Tax=Trypanosoma cruzi RepID=Q4E457_TRYCR Length = 427 Score = 53.5 bits (127), Expect = 7e-06 Identities = 48/161 (29%), Positives = 72/161 (44%) Frame = +1 Query: 4 KEATDATEDAPEVKSDDDVKAETAAVTSATTSGDDEPLHAHGERSSRETPSHVAINTKVV 183 K AT+AT+ A + + AE AA + E A +++ + Sbjct: 75 KAATEATKTAAVKAEEAEAAAEAAAKAAEAAEAAAEEAKAAATEAAKAVDTEEKARAAAA 134 Query: 184 EAYLAEIRSVLAACKATDALKAEAEKAAAKADKRANKAKPPTSPVTPVSEDSGDAAKDGE 363 A A ++ A+ AT A KA AE+A A A+ A KA+ + +E + A + E Sbjct: 135 AAESAATKATTASKAATKA-KAAAEEAKAAAEAAAAKAEEAEAKAA--AEAAATATEAAE 191 Query: 364 ATKAAAAAPAPAADAKLTAALARSSEAAAAHAESLTSSLSA 486 A KAAA A A AA A ++EAAA A + T++ A Sbjct: 192 AAKAAAEAAAEAA-----ATATEAAEAAATAANAATAAAKA 227 [29][TOP] >UniRef100_Q4DVE5 Mucin-associated surface protein (MASP), putative n=1 Tax=Trypanosoma cruzi RepID=Q4DVE5_TRYCR Length = 398 Score = 53.5 bits (127), Expect = 7e-06 Identities = 51/162 (31%), Positives = 68/162 (41%), Gaps = 17/162 (10%) Frame = +1 Query: 31 APEVKSDDDVKAETAAVTSATTSGDDEPLHAHGERSSRETPSHVAINTKVVEAYLAEIRS 210 A E +D AETAA +A T E E++ +E + EA +++ Sbjct: 55 AAEAAADAQAAAETAAKAAAATKAATEATKRAAEKA-KEAEAAAEAAKAAAEAAATAVKA 113 Query: 211 VLAACKA-----------------TDALKAEAEKAAAKADKRANKAKPPTSPVTPVSEDS 339 V A KA +A KA AE AAAKA A KA+ + +E + Sbjct: 114 VDAEAKAKAAAAATESAATKAKAAAEAAKAAAE-AAAKAAAAAAKAEEAEAEAKAAAEAA 172 Query: 340 GDAAKDGEATKAAAAAPAPAADAKLTAALARSSEAAAAHAES 465 A + EA KAAA A A AA AA + AA A AE+ Sbjct: 173 AKAREAAEAAKAAAVAAAGAAAEAANAATLAAKSAAKASAEA 214 [30][TOP] >UniRef100_C0PVT4 Gifsy-2 prophage probable tail fiber protein n=1 Tax=Salmonella enterica subsp. enterica serovar Paratyphi C strain RKS4594 RepID=C0PVT4_SALPC Length = 735 Score = 53.5 bits (127), Expect = 7e-06 Identities = 53/161 (32%), Positives = 77/161 (47%), Gaps = 2/161 (1%) Frame = -2 Query: 479 SEEVKDSAWAAAASEE-RASAAVSLASAAGAGAAAAALVASPSLAASPLSSETGVTGEVG 303 SE +++ A ASE+ + SA + SA A AA A AS + AAS +S G Sbjct: 118 SEASRNATAAGQASEQAQTSAGQASESATAAVNAAGAAEASATQAASSAASAESSAGTAT 177 Query: 302 GFALFARLSALAAAFSASALSASVALHAARTERISAR*ASTTFVLMATCEGVSREDLSP* 123 A A SA +A + +A +AS A AA+T +A +++T AT S + S Sbjct: 178 TKAGEASASAASADTARTAAAASAA--AAKTSETNAATSASTAAASATAASSSASEASTH 235 Query: 122 AC-NGSSSPLVVALVTAAVSALTSSSDLTSGASSVASVASL 3 A + +S+ L TAA +A T + D A +A V SL Sbjct: 236 AAASDTSASLAAQSSTAAGAAATRAEDAAKRAEDIADVISL 276 [31][TOP] >UniRef100_C4Y9W6 Predicted protein n=1 Tax=Clavispora lusitaniae ATCC 42720 RepID=C4Y9W6_CLAL4 Length = 229 Score = 53.5 bits (127), Expect = 7e-06 Identities = 53/161 (32%), Positives = 80/161 (49%), Gaps = 1/161 (0%) Frame = -2 Query: 482 DSEEVKDSAWAAAASEERASAAVSLASAAGAGAAAAALVASPSLAASPLSSETGVTGEVG 303 D+ DSA AA+AS E SA+V+ A +A AAAAA + S+AA +SE V+ + Sbjct: 36 DATSSFDSASAASASAE--SASVASAESASVAAAAAAAASLSSVAAVSAASEAAVSASIA 93 Query: 302 GFALFARLSALAAAFSASALSASVALHAARTERISAR*ASTTFVLMATCEGVSREDLSP* 123 A ++L + SAS S + A A S AS V A+ VS E + Sbjct: 94 S----AESASLVSVASASEASEAAAASIASASAASVSSASAASVSSASGASVSSESAA-- 147 Query: 122 ACNGSSSPLVVALVT-AAVSALTSSSDLTSGASSVASVASL 3 S S + AL + A+VS+L++ + +T+ VA +S+ Sbjct: 148 ----SVSSVSAALASEASVSSLSAKAHVTAYQGMVALNSSV 184 [32][TOP] >UniRef100_UPI0001851573 protein TolA n=1 Tax=Escherichia coli O157:H7 str. EC4042 RepID=UPI0001851573 Length = 310 Score = 53.1 bits (126), Expect = 9e-06 Identities = 55/157 (35%), Positives = 68/157 (43%), Gaps = 7/157 (4%) Frame = +1 Query: 10 ATDATEDAPEVKSDDDVKAETAAVTSATTS---GDDEPLHAHGERSSRETPSHVAINTKV 180 A A DA DD AE AA +A + + E A E + + A+ K Sbjct: 138 AAKAAADAKAKAEADDKAAEEAAKKAAADAKKKAEAEAAKAAAEAQKKAEAAAAALKKKA 197 Query: 181 --VEAYLAEIRSVLAACKATDALKAEAEKAAA--KADKRANKAKPPTSPVTPVSEDSGDA 348 EA AE R AA KA KA AEKAAA KA ++A K + + A Sbjct: 198 EAAEAAAAEARKKAAAEKAAADKKA-AEKAAADKKAAEKAAAEKAAAEKAAADKKAAEKA 256 Query: 349 AKDGEATKAAAAAPAPAADAKLTAALARSSEAAAAHA 459 A + A AAA AAD K AA A + +AAAA A Sbjct: 257 AAEKAAADKKAAAEKAAADKKAAAAKAAAEKAAAAKA 293 [33][TOP] >UniRef100_C6V1F2 Membrane anchored protein in TolA-TolQ-TolR complex n=4 Tax=Escherichia coli O157:H7 RepID=C6V1F2_ECO5T Length = 424 Score = 53.1 bits (126), Expect = 9e-06 Identities = 55/157 (35%), Positives = 68/157 (43%), Gaps = 7/157 (4%) Frame = +1 Query: 10 ATDATEDAPEVKSDDDVKAETAAVTSATTS---GDDEPLHAHGERSSRETPSHVAINTKV 180 A A DA DD AE AA +A + + E A E + + A+ K Sbjct: 138 AAKAAADAKAKAEADDKAAEEAAKKAAADAKKKAEAEAAKAAAEAQKKAEAAAAALKKKA 197 Query: 181 --VEAYLAEIRSVLAACKATDALKAEAEKAAA--KADKRANKAKPPTSPVTPVSEDSGDA 348 EA AE R AA KA KA AEKAAA KA ++A K + + A Sbjct: 198 EAAEAAAAEARKKAAAEKAAADKKA-AEKAAADKKAAEKAAAEKAAAEKAAADKKAAEKA 256 Query: 349 AKDGEATKAAAAAPAPAADAKLTAALARSSEAAAAHA 459 A + A AAA AAD K AA A + +AAAA A Sbjct: 257 AAEKAAADKKAAAEKAAADKKAAAAKAAAEKAAAAKA 293 [34][TOP] >UniRef100_C1VAP2 Putative uncharacterized protein n=1 Tax=Halogeometricum borinquense DSM 11551 RepID=C1VAP2_9EURY Length = 726 Score = 53.1 bits (126), Expect = 9e-06 Identities = 42/150 (28%), Positives = 66/150 (44%), Gaps = 9/150 (6%) Frame = +1 Query: 1 VKEATDATEDAPEVKSDDDVKAETAAVTSATTSGDDEPLHAHGERS--------SRETPS 156 VKE + + PE+++DD+ +A AA ++ + D+ P E S S PS Sbjct: 167 VKEVDVSIVELPEIETDDEPEASAAATETSAEAPDETPTEPSEEASEASAAEVESSSEPS 226 Query: 157 HVAINTKVVEAYLAEIRSVLAACKATDALKAEAEKAAAKADKRANKAKPPTSPVTPVSED 336 +E S + A D+ K E+E A AD ++ +PP + +E Sbjct: 227 STDSTAADAAEAASEAESAQSERAADDSSKPESE-TAEPADTEGDETEPPEADPATTTEP 285 Query: 337 SGDA-AKDGEATKAAAAAPAPAADAKLTAA 423 DA A DG+ + AAA +A A +AA Sbjct: 286 KQDAGAPDGQTAEPTAAAADQSASADQSAA 315 [35][TOP] >UniRef100_B4AB41 Side tail fiber protein n=1 Tax=Salmonella enterica subsp. enterica serovar Newport str. SL317 RepID=B4AB41_SALNE Length = 812 Score = 53.1 bits (126), Expect = 9e-06 Identities = 52/164 (31%), Positives = 75/164 (45%), Gaps = 3/164 (1%) Frame = -2 Query: 485 ADSEEVKDSAWAAAASEERASAAVSLASAAGAGAAAAA-LVASPSLAASPLSSETGV-TG 312 AD+ +A AAAA A+A VS +A + AAAAA A+ + A +SET T Sbjct: 190 ADTARTAVAASAAAAKTSEANADVSRTAAGDSAAAAAASATAAQASAERAGASETAAKTS 249 Query: 311 EVGGFALFARLSALAAAFSASALSASVALHAARTERISAR*ASTTFVLMATCEGVSREDL 132 E + A A A +AS +A+ + A+T +A ++ T AT S Sbjct: 250 ETQAASSAGDAGASATAAAASKKAAAASAAEAKTSETNAATSANTAAASATAASSSASAA 309 Query: 131 SP-*ACNGSSSPLVVALVTAAVSALTSSSDLTSGASSVASVASL 3 S A + +S+ L TAA +A T + D A +A V SL Sbjct: 310 STHAAASDTSASLAAQSSTAAGAAATRAEDAAKRAEDIADVISL 353 [36][TOP] >UniRef100_Q5ZDJ4 Putative uncharacterized protein P0686E09.25 n=1 Tax=Oryza sativa Japonica Group RepID=Q5ZDJ4_ORYSJ Length = 284 Score = 53.1 bits (126), Expect = 9e-06 Identities = 50/151 (33%), Positives = 59/151 (39%), Gaps = 17/151 (11%) Frame = +2 Query: 14 PTPRRTLLRSSLTTTSRPRPRL*LVRPPVVTMSR----------------CMLMARGPPV 145 P+PR +R S T +PRP L L PPV R L R PP Sbjct: 148 PSPRA--VRPSAVTLYQPRPPLPLAPPPVRERERERERGRERRGRRRLHHPHLRHRRPPT 205 Query: 146 RRPRTWPSTRRWWRLTWR-KFVPFSPHARPQTRSKLRRRRRRPRRTSGRTRRNPPLRLSL 322 RPR P R+ R P P A P S+LR R R P + R RR PL Sbjct: 206 VRPRALPLAASRLRVRLRAPAPPVHPRAPPLAASRLRVRPRAPPLATSRARRRSPL---- 261 Query: 323 PFQRTAATLPRTARRLRRPPPPPLRPLMPSS 415 AA++P RPPP P PS+ Sbjct: 262 ---AAAASVP-------RPPPSAAAPACPSA 282