Medicago
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= AC149581.7 + phase: 0 
         (392 letters)

Database: nr 
           2,540,612 sequences; 863,360,394 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

dbj|BAD46679.1| unknown protein [Oryza sativa (japonica cultivar...   481  e-134
ref|NP_180377.2| glycosyl hydrolase family 29 / alpha-L-fucosida...   408  e-112
dbj|BAC43615.1| unknown protein [Arabidopsis thaliana]                405  e-112
emb|CAD41073.2| OSJNBa0084K11.7 [Oryza sativa (japonica cultivar...   393  e-108
dbj|BAB81582.1| conserved hypothetical protein [Clostridium perf...   328  1e-88
ref|ZP_00523748.1| Coagulation factor 5/8 type, C-terminal [Soli...   280  7e-74
gb|AAS19690.1| FucA [Streptococcus gordonii]                          269  1e-70
ref|ZP_00403950.1| COG3669: Alpha-L-fucosidase [Streptococcus pn...   269  1e-70
ref|NP_359545.1| hypothetical protein spr1954 [Streptococcus pne...   267  4e-70
gb|AAO79818.1| conserved hypothetical protein [Bacteroides theta...   256  6e-67
gb|AAO79241.1| conserved hypothetical protein [Bacteroides theta...   254  3e-66
emb|CAH08778.1| putative exported fucosidase [Bacteroides fragil...   253  7e-66
ref|YP_100522.1| hypothetical protein BF3243 [Bacteroides fragil...   251  2e-65
gb|AAO77299.1| conserved hypothetical protein [Bacteroides theta...   251  4e-65
emb|CAH06409.1| conserved hypothetical exported protein [Bactero...   246  9e-64
gb|AAO76732.1| conserved hypothetical protein [Bacteroides theta...   244  4e-63
gb|AAQ66752.1| alpha-1,3/4-fucosidase, putative [Porphyromonas g...   243  6e-63
emb|CAH08896.1| putative lipoprotein [Bacteroides fragilis NCTC ...   243  6e-63
ref|YP_100649.1| hypothetical protein BF3371 [Bacteroides fragil...   243  6e-63
gb|EAA55123.1| hypothetical protein MG06780.4 [Magnaporthe grise...   229  1e-58

>dbj|BAD46679.1| unknown protein [Oryza sativa (japonica cultivar-group)]
          Length = 494

 Score =  481 bits (1237), Expect = e-134
 Identities = 226/390 (57%), Positives = 289/390 (73%), Gaps = 11/390 (2%)

Query: 1   MILTAKHHDGFCLWPSKYTKHSVISSKWQNGKGDVVKEFVNAATDKGIDVGIYLSPWDRH 60
           ++L AKHHDGFCLWPS +T HSV +S W+ G+GDVV+EF +AA  +G+D+GIYLSPWDRH
Sbjct: 103 VVLVAKHHDGFCLWPSAHTAHSVRASPWRGGRGDVVREFADAARARGLDIGIYLSPWDRH 162

Query: 61  DSRYGHDLLYNEYYLAQLQELLKKYQDVREIWFDGAKDPRAQNVTYYFSDWFSMVKELQS 120
           D RYG ++ YNEYYLAQL ELL  Y  V EIWFDGAK   A N+TY+F +WF  V++LQS
Sbjct: 163 DKRYGREVAYNEYYLAQLHELLTGYGSVSEIWFDGAKGKNATNMTYHFQEWFQTVRQLQS 222

Query: 121 SINIFSDAGPDVRWVGDETGTAGDTCWSTINRTSLSIGASNITQYLNTGDPKGTDWLPAE 180
           SINIFSD GPD+RWVGDE G+AG TCWSTINR+ ++IG + I +YLNTGDP+G DW+P E
Sbjct: 223 SINIFSDDGPDLRWVGDENGSAGSTCWSTINRSKITIGEAGIEKYLNTGDPRGKDWVPPE 282

Query: 181 CDVSIRPGWFWHKSESPKKLSDLLDIYYKSVGRNCVLLLNVPPNTTGLISENDAHRLKEF 240
           CDVSIRPGWFWHK+E+ K L +LL++YY SVGRNCVLLLN PPNTTGL+   D  RL+EF
Sbjct: 283 CDVSIRPGWFWHKNETAKPLPELLEVYYNSVGRNCVLLLNAPPNTTGLVDAADIARLREF 342

Query: 241 RSAIDTIFHKNIAENRYVKVSSQRGGKEGGFGPENMLDSDHLWSYWTPREDDKEK--DHW 298
           R+A+  IF  ++A     + SS+RGG+   F   N+LD     +YW P   + E    +W
Sbjct: 343 RTAVTAIFGTDLAAGSAARASSERGGR---FAAANVLDGRD-DTYWAPAAAEAEDGGGYW 398

Query: 299 IEIW--GNDGSLRFNVIRIQEAIGLGQRIERYEIYVD--GKSIIQGTTIGYKRLHRLDGD 354
           IE+    +  + +FNV+RIQE + +GQR+ER+E+YVD  G ++  GTT+G+KRLHRL G 
Sbjct: 399 IELRRPASAAARKFNVVRIQEHVAMGQRVERHEVYVDGGGAAVASGTTVGHKRLHRL-GA 457

Query: 355 VVHARVVRIRFIKARGVPLISSIGLHFDPF 384
            V  R VR+     RG PL+S++GLH DPF
Sbjct: 458 PVAGRTVRVWLASRRGPPLLSAVGLHLDPF 487


>ref|NP_180377.2| glycosyl hydrolase family 29 / alpha-L-fucosidase, putative
           [Arabidopsis thaliana]
          Length = 506

 Score =  408 bits (1049), Expect = e-112
 Identities = 202/390 (51%), Positives = 263/390 (66%), Gaps = 12/390 (3%)

Query: 1   MILTAKHHDGFCLWPSKYTKHSVISSKWQNGKGDVVKEFVNAATDKGIDVGIYLSPWDRH 60
           +ILTAKHHDGFCLWPS+YT +SV SS+W+NG GDVV E  +AA + GI +G+YLSPWDRH
Sbjct: 98  VILTAKHHDGFCLWPSEYTDYSVKSSQWRNGAGDVVAELASAAKEAGIGLGLYLSPWDRH 157

Query: 61  DSRYGHDLLYNEYYLAQLQELLKKYQDVREIWFDGAKDPRAQNVTYYFSDWFSMVKELQS 120
           +  YG  L YNE+YL+Q+ ELL KY +++E+W DGAK    +++ Y+F  WFS++ +LQ 
Sbjct: 158 EQCYGKTLEYNEFYLSQMTELLTKYGEIKEVWLDGAKGDGEKDMEYFFDTWFSLIHQLQP 217

Query: 121 SINIFSDAGPDVRWVGDETGTAGDTCWSTINRTSLSIGASNITQYLNTGDPKGTDWLPAE 180
              IFSDAGPDVRW+GDE G AG TCWS  NRT+  IG +    Y   GD  G DW+PAE
Sbjct: 218 KAVIFSDAGPDVRWIGDEAGLAGSTCWSLFNRTNAKIGDTE-PSYSQEGDGYGQDWVPAE 276

Query: 181 CDVSIRPGWFWHKSESPKKLSDLLDIYYKSVGRNCVLLLNVPPNTTGLISENDAHRLKEF 240
           CDVSIRPGWFWH SESPK    LLDIYY SVGRNC+ LLNVPPN++GLISE D   L+EF
Sbjct: 277 CDVSIRPGWFWHASESPKPAVQLLDIYYNSVGRNCLFLLNVPPNSSGLISEQDIKVLEEF 336

Query: 241 RSAIDTIFHKNIAENRYVKVSSQRGGKEGGFGPENMLDSDHLWSYWTPREDDKEKDHWIE 300
               ++IF  N+A   +V  SS RG +   FGP+N+L+ + L  YW P E+  E   W+ 
Sbjct: 337 SEMKNSIFSNNLARKAFVNSSSIRGDQSSQFGPKNVLE-EGLDKYWAPEENQNE---WVL 392

Query: 301 IWGNDGSLRFNVIRIQEAIGLGQRIERYEIYV------DGKSIIQGTTIGYKRLHRLDGD 354
                  + FNV+ I+E I +GQRI  + +        + + ++ GTT+G KRL R   +
Sbjct: 393 YLEFKDLVSFNVLEIREPIHMGQRIASFHLETRKTGSGEWERVVSGTTVGNKRLLRF-LN 451

Query: 355 VVHARVVRIRFIKARGVPLISSIGLHFDPF 384
           VV +R +++   KAR  PLIS +GL+ D F
Sbjct: 452 VVESRSLKLVVDKARTDPLISYLGLYMDKF 481


>dbj|BAC43615.1| unknown protein [Arabidopsis thaliana]
          Length = 506

 Score =  405 bits (1042), Expect = e-112
 Identities = 201/390 (51%), Positives = 262/390 (66%), Gaps = 12/390 (3%)

Query: 1   MILTAKHHDGFCLWPSKYTKHSVISSKWQNGKGDVVKEFVNAATDKGIDVGIYLSPWDRH 60
           +ILTAKHHDGFCLWPS+YT +SV SS+W+NG GDVV    +AA + GI +G+YLSPWDRH
Sbjct: 98  VILTAKHHDGFCLWPSEYTDYSVKSSQWRNGAGDVVAGLASAAKEAGIGLGLYLSPWDRH 157

Query: 61  DSRYGHDLLYNEYYLAQLQELLKKYQDVREIWFDGAKDPRAQNVTYYFSDWFSMVKELQS 120
           +  YG  L YNE+YL+Q+ ELL KY +++E+W DGAK    +++ Y+F  WFS++ +LQ 
Sbjct: 158 EQCYGKTLEYNEFYLSQMTELLTKYGEIKEVWLDGAKGDGEKDMEYFFDTWFSLIHQLQP 217

Query: 121 SINIFSDAGPDVRWVGDETGTAGDTCWSTINRTSLSIGASNITQYLNTGDPKGTDWLPAE 180
              IFSDAGPDVRW+GDE G AG TCWS  NRT+  IG +    Y   GD  G DW+PAE
Sbjct: 218 KAVIFSDAGPDVRWIGDEAGLAGSTCWSLFNRTNAKIGDTE-PSYSQEGDGYGQDWVPAE 276

Query: 181 CDVSIRPGWFWHKSESPKKLSDLLDIYYKSVGRNCVLLLNVPPNTTGLISENDAHRLKEF 240
           CDVSIRPGWFWH SESPK    LLDIYY SVGRNC+ LLNVPPN++GLISE D   L+EF
Sbjct: 277 CDVSIRPGWFWHASESPKPAVQLLDIYYNSVGRNCLFLLNVPPNSSGLISEQDIKVLEEF 336

Query: 241 RSAIDTIFHKNIAENRYVKVSSQRGGKEGGFGPENMLDSDHLWSYWTPREDDKEKDHWIE 300
               ++IF  N+A   +V  SS RG +   FGP+N+L+ + L  YW P E+  E   W+ 
Sbjct: 337 SEMKNSIFSNNLARKAFVNSSSIRGDQSSQFGPKNVLE-EGLDKYWAPEENQNE---WVL 392

Query: 301 IWGNDGSLRFNVIRIQEAIGLGQRIERYEIYV------DGKSIIQGTTIGYKRLHRLDGD 354
                  + FNV+ I+E I +GQRI  + +        + + ++ GTT+G KRL R   +
Sbjct: 393 YLEFKDLVSFNVLEIREPIHMGQRIASFHLETRKTGSGEWERVVSGTTVGNKRLLRF-LN 451

Query: 355 VVHARVVRIRFIKARGVPLISSIGLHFDPF 384
           VV +R +++   KAR  PLIS +GL+ D F
Sbjct: 452 VVESRSLKLVVDKARTDPLISYLGLYMDKF 481


>emb|CAD41073.2| OSJNBa0084K11.7 [Oryza sativa (japonica cultivar-group)]
           gi|50927915|ref|XP_473485.1| OSJNBa0084K11.7 [Oryza
           sativa (japonica cultivar-group)]
          Length = 517

 Score =  393 bits (1010), Expect = e-108
 Identities = 200/389 (51%), Positives = 260/389 (66%), Gaps = 13/389 (3%)

Query: 1   MILTAKHHDGFCLWPSKYTKHSVISSKWQNGKGDVVKEFVNAATDKGIDVGIYLSPWDRH 60
           ++LTAKHHDGFCLWPS  T +SV +S W+ G GDVV E   AA  +GI +G+YLSPWDRH
Sbjct: 100 VVLTAKHHDGFCLWPSALTNYSVAASPWKGGAGDVVGELAAAARAEGIGLGLYLSPWDRH 159

Query: 61  DSRYGHDLLYNEYYLAQLQELLKKYQDVREIWFDGAKDPRAQNVTYYFSDWFSMVKELQS 120
           +  YG  + YNE+YL Q+ ELL +Y DV E+W DGAK    +++ Y F  WF+++ +LQ 
Sbjct: 160 EPVYGDTVAYNEHYLGQMTELLTRYGDVEEVWLDGAKG-EGKDMDYMFDAWFALIHQLQQ 218

Query: 121 SINIFSDAGPDVRWVGDETGTAGDTCWSTINRTSLSIGASNITQYLNTGDPKGTDWLPAE 180
            + IFSDAGPD RWVGDE G AG TCWS  N+++++IG   I +Y   GDP G DW+PAE
Sbjct: 219 RVVIFSDAGPDTRWVGDEAGVAGYTCWSPFNKSTVTIGHI-IPEYSRCGDPFGQDWVPAE 277

Query: 181 CDVSIRPGWFWHKSESPKKLSDLLDIYYKSVGRNCVLLLNVPPNTTGLISENDAHRLKEF 240
           CDVSIRPGWFWH SE PK  + LLDIYYKSVGRNC+L+LNVPPN++GLIS  D   L+EF
Sbjct: 278 CDVSIRPGWFWHASEKPKNATTLLDIYYKSVGRNCLLILNVPPNSSGLISTEDMQVLQEF 337

Query: 241 RSAIDTIFHKNIAENRYVKVSSQRGG-KEGGFGPENMLDSDHLWSYWTPREDDKEKDHWI 299
                TIF +N A N  V  S+ RGG     F P N+L  + ++SYW P E    +  W 
Sbjct: 338 TEIRQTIFSQNFAANATVTASTVRGGLGNQQFAPSNVL-QESIYSYWAPEEG---QSSWE 393

Query: 300 EIWGNDGSLRFNVIRIQEAIGLGQRIERY--EIYVD--GKSIIQGTTIGYKRLHRLDGDV 355
            ++    S  FNVI++QE I +GQR+ ++  EI VD   ++I++GTTIGYKRL +    V
Sbjct: 394 MLFDLGQSASFNVIQLQEPIQMGQRVIKFRVEILVDELWQTIVEGTTIGYKRLFQF--PV 451

Query: 356 VHARVVRIRFIKARGVPLISSIGLHFDPF 384
           V  + +++    AR  PLIS  G+  D F
Sbjct: 452 VEGQFLKLSIDGARADPLISFFGVFTDSF 480


>dbj|BAB81582.1| conserved hypothetical protein [Clostridium perfringens str. 13]
           gi|18310858|ref|NP_562792.1| hypothetical protein
           CPE1876 [Clostridium perfringens str. 13]
          Length = 750

 Score =  328 bits (842), Expect = 1e-88
 Identities = 179/391 (45%), Positives = 242/391 (61%), Gaps = 18/391 (4%)

Query: 1   MILTAKHHDGFCLWPSKYTKHSVISSKWQNGKGDVVKEFVNAATDKGIDVGIYLSPWDRH 60
           +ILTAKHHDGFCLW S YTKH V SS W+NGKGDVVKE   A     I  G+YLSPWD++
Sbjct: 108 VILTAKHHDGFCLWDSAYTKHDVASSPWKNGKGDVVKEVSEACAKYNIKFGVYLSPWDQN 167

Query: 61  DSRY--GHDLLYNEYYLAQLQELLKKYQDVREIWFDGAKDPRAQNVTYYFSDWFSMVKEL 118
              Y  G+   YNE+Y+ QL+ELL  Y  + E+W DGAK    +   Y F +WF+++KEL
Sbjct: 168 SEHYGEGNGGDYNEFYMNQLRELLTNYGPIAEVWMDGAKGSNVKQ-EYNFEEWFALIKEL 226

Query: 119 QSSINIFSDAGPDVRWVGDETGTAGDTCWSTINRTSLSIGASNITQYLNTGDPKGTDWLP 178
           Q    IFS  GPD+RW+G+E G AG+ CWSTI+   +     N T YLN G+  G DW+ 
Sbjct: 227 QPECLIFSPQGPDIRWIGNEKGYAGEPCWSTIDIEKMK-ERENPT-YLNNGEEGGPDWVV 284

Query: 179 AECDVSIRPGWFWHKSE--SPKKLSDLLDIYYKSVGRNCVLLLNVPPNTTGLISENDAHR 236
            E DVSIRPGWF+H+S+    K L  ++DIY+KS+GRN VLLLNVPPN  G + END +R
Sbjct: 285 GESDVSIRPGWFYHESQDNEVKSLEKMMDIYFKSIGRNSVLLLNVPPNKEGKLHENDVNR 344

Query: 237 LKEFRSAIDTIFHKNIAENRYVKVSSQRGGKEGGFGPENMLDSDHLWSYWTPREDDKEKD 296
           LKEF   I  +F+ ++A N+ V V      ++  +G   ++D D+  +YW P  D+  K 
Sbjct: 345 LKEFGETIKELFNDDLALNKEVIVDG-FANRDETYGANKIVDGDY-DTYWAP--DNSSKT 400

Query: 297 HWIEIWGNDGSLRFNVIRIQEAIGLGQRIERYEIYV----DGKSIIQGTTIGYKRLHRLD 352
             IEI   + S  F+VI +QE I LGQR+  + + V    +   + +G TIGYKRL R+ 
Sbjct: 401 GTIEIDLGE-SKEFDVISLQEYIPLGQRVSSFNVEVLQGENWNKVYEGKTIGYKRLVRI- 458

Query: 353 GDVVHARVVRIRFIKARGVPLISSIGLHFDP 383
                   +RI    +  VPLI+++G++  P
Sbjct: 459 -APTKGEKIRINITGSLEVPLINNVGVYKQP 488


>ref|ZP_00523748.1| Coagulation factor 5/8 type, C-terminal [Solibacter usitatus
           Ellin6076] gi|67862145|gb|EAM57168.1| Coagulation factor
           5/8 type, C-terminal [Solibacter usitatus Ellin6076]
          Length = 491

 Score =  280 bits (715), Expect = 7e-74
 Identities = 158/402 (39%), Positives = 227/402 (56%), Gaps = 31/402 (7%)

Query: 1   MILTAKHHDGFCLWPSKYTKHSVISSKWQNGKGDVVKEFVNAATDKGIDVGIYLSPWDRH 60
           +ILT KHHDGFCLWP++ T HSV  S+W+ G+GDVV+E   AA  + +  G+YLSPWDR+
Sbjct: 91  VILTCKHHDGFCLWPTRTTDHSVAQSRWRGGRGDVVREISQAAARRKMKFGVYLSPWDRN 150

Query: 61  DSRYGHDLLYNEYYLAQLQELLKKYQDVREIWFD----------GAKDPRAQNVTYYFSD 110
            ++YG    Y + Y AQL ELL  Y  + E+W D          GAK+ R  +   Y+ D
Sbjct: 151 SAQYGTP-EYIKLYRAQLSELLTGYGPIFEVWHDGANGGDGYYGGAKEKRTIDKNTYY-D 208

Query: 111 W---FSMVKELQSSINIFSDAGPDVRWVGDETGTAGDTCWSTINRTSLSIGASN----IT 163
           W   + M++ +Q    +FSD GPDVRWVG+E G AGD CW   +      GA++     T
Sbjct: 209 WPRTWEMIRAMQPEAVVFSDVGPDVRWVGNERGVAGDPCWVAFDPVGEDGGAASPGNVRT 268

Query: 164 QYLNTGDPKGTDWLPAECDVSIRPGWFWHKSESP--KKLSDLLDIYYKSVGRNCVLLLNV 221
           +    G   G+ WLPAECDVSIRPGWFWH++E+   KK S L+++YY+SVGR   LLLNV
Sbjct: 269 KESGMGHRHGSKWLPAECDVSIRPGWFWHEAENARVKKASQLVNLYYESVGRGATLLLNV 328

Query: 222 PPNTTGLISENDAHRLKEFRSAIDTIFHKNIAENRYVKVSSQRGGKEGGFGPENMLDSDH 281
           PPN  GLIS  D   LK     +   F +N+A       +  R G +  +GP  +L+   
Sbjct: 329 PPNRDGLISSEDVASLKGLGGYLSGTFAQNLAARAKTDATHVR-GNDKQYGPAQLLNGKP 387

Query: 282 LWSYWTPREDDKEKDHWIEIWGNDGSLRFNVIRIQEAIGLGQRIERYEI----YVDGKSI 337
             ++W   +     D   ++      ++F VIR++EAI  GQR++ + +        + +
Sbjct: 388 -ETFWATDDGVTSADVTFDL---GRPVKFQVIRLREAIRFGQRVDAFTVERWQSDSWEQV 443

Query: 338 IQGTTIGYKRLHRLDGDVVHARVVRIRFIKARGVPLISSIGL 379
              T+IG +RL RLD  ++  R +R+R  +A   P +S   +
Sbjct: 444 ASSTSIGPRRLIRLDAPIIATR-LRLRVTQASASPALSEFAV 484


>gb|AAS19690.1| FucA [Streptococcus gordonii]
          Length = 576

 Score =  269 bits (688), Expect = 1e-70
 Identities = 155/395 (39%), Positives = 221/395 (55%), Gaps = 25/395 (6%)

Query: 1   MILTAKHHDGFCLWPSKYTKHSVISSKWQNGKGDVVKEFVNAATDKGIDVGIYLSPWDRH 60
           +++  KHHDGF L+PS+Y+ ++V +S W++GKGD++ E   +A+   +D+G+YLSPWD H
Sbjct: 81  VVVVVKHHDGFVLYPSRYSDYTVAASPWRDGKGDLLAEISRSASMYDMDLGLYLSPWDAH 140

Query: 61  DSRYGHDL--LYNEYYLAQLQELLKK-----YQDVREIWFDGAKDPRAQNVTYYFSDWFS 113
              Y  +    YNEYY  QL+E+L            EIW DGA+   AQ +TY F  WF 
Sbjct: 141 SPLYHIETQEAYNEYYQNQLEEILSNPLYGNKGKFVEIWMDGARGEGAQKLTYDFGSWFE 200

Query: 114 MVKELQSSINIFSDAGPDVRWVGDETGTAGDTCWSTINRTSLSIGASNITQYLNTGDPKG 173
            ++  Q    IFS    ++RW+G+E G AGD  W  I    LS   +    YL  GDP+G
Sbjct: 201 PIRHYQKDALIFSTEATELRWIGNERGRAGDPLWQKIRPEKLS--ENTPATYLCHGDPQG 258

Query: 174 TDWLPAECDVSIRPGWFWHKSESPKKLSDLLDIYYKSVGRNCVLLLNVPPNTTGLISEND 233
           T +   E DVS+R GWF+H S+ PK L DLLDIY  SVGR   LLLNVPP   GL++E D
Sbjct: 259 TQYSLGEADVSLRSGWFYHASQHPKSLPDLLDIYMDSVGRGTPLLLNVPPTKEGLLAEED 318

Query: 234 AHRLKEFRSAIDTIFHKNIAENRYVKVSSQRGGKEGGFGPENMLDSDHLWSYWTPREDDK 293
             RL+EF   I  ++  N A    V  S+++ G      P + L      ++W P  D  
Sbjct: 319 VQRLQEFHQVISDLYTDNRAYQAKVSCSNEKEG-----FPSSHLTDGREDTFWAPASD-- 371

Query: 294 EKDHWIEIWGNDGSLR-FNVIRIQEAIGLGQRIERYEIYVDGKS----IIQGTTIGYKRL 348
           E  + +EI  + G  R FN++ I+E I  GQRI  + + V  ++       G TIGYKRL
Sbjct: 372 ESAYVLEI--DLGQTRSFNLVEIREPITKGQRIAGFTLEVKQENGWIVFAHGQTIGYKRL 429

Query: 349 HRLDGDVVHARVVRIRFIKARGVPLISSIGLHFDP 383
             L G +V AR +R+     + +PL++ + ++  P
Sbjct: 430 --LLGQMVEARYLRLILTDFQALPLLNKLAVYKTP 462


>ref|ZP_00403950.1| COG3669: Alpha-L-fucosidase [Streptococcus pneumoniae TIGR4]
           gi|15901959|ref|NP_346563.1| hypothetical protein SP2146
           [Streptococcus pneumoniae TIGR4]
           gi|14973659|gb|AAK76203.1| conserved hypothetical
           protein [Streptococcus pneumoniae TIGR4]
           gi|25389606|pir||B95251 conserved hypothetical protein
           SP2146 [imported] - Streptococcus pneumoniae (strain
           TIGR4)
          Length = 559

 Score =  269 bits (687), Expect = 1e-70
 Identities = 158/402 (39%), Positives = 222/402 (54%), Gaps = 40/402 (9%)

Query: 1   MILTAKHHDGFCLWPSKYTKHSVISSKWQNGKGDVVKEFVNAATDKGIDVGIYLSPWDRH 60
           +IL  KHHDGF L+P+ +T +SV  S W+ GKGD++ E   AAT+  +D+G+YLSPWD H
Sbjct: 71  LILVVKHHDGFVLYPTAHTDYSVKVSPWRRGKGDLLLEVSQAATEFDMDMGVYLSPWDAH 130

Query: 61  DSRYGHDLL--YNEYYLAQLQELLKKYQ-----DVREIWFDGAKDPRAQNVTYYFSDWFS 113
              Y  D    YN YYLAQL+E+L            E+W DGA+   AQ V Y F  WF 
Sbjct: 131 SPLYHVDREADYNAYYLAQLKEILSNPNYGNAGKFAEVWMDGARGEGAQKVNYEFEKWFE 190

Query: 114 MVKELQSSINIFSDAGPDVRWVGDETGTAGDTCWSTINRTSLSIGASNITQYLNTGDPKG 173
            +++LQ    IFS  G  +RW+G+E G AGD  W  +N   L   A     YL  GDP G
Sbjct: 191 TIRDLQGDCLIFSTEGTSIRWIGNERGYAGDPLWQKVNPDKLGTEAE--LNYLQHGDPSG 248

Query: 174 TDWLPAECDVSIRPGWFWHKSESPKKLSDLLDIYYKSVGRNCVLLLNVPPNTTGLISEND 233
           T +   E DVSIRPGWF+H+ + PK L +L++IY+ SVGR   LLLN+PPN  GL    D
Sbjct: 249 TIFSIGEADVSIRPGWFYHEDQDPKSLEELVEIYFHSVGRGTPLLLNIPPNQAGLFDAKD 308

Query: 234 AHRLKEFRSAIDTIFHKNIAENRYVKVSSQRGGKEGGFGPENMLDSDHLWSYWTPREDDK 293
             RL EF +  + ++ +++A    V             GP   L +D    + T   D  
Sbjct: 309 IERLYEFATYRNELYKEDLALGAEVS------------GP--ALSADFACRHLT---DGL 351

Query: 294 EKDHW-------IEIWGNDGSLR-FNVIRIQEAIGLGQRIERYEIYVDGKSIIQ----GT 341
           E   W       I++  + GS + F+VI ++E + LGQRI  + + V+   + Q    G 
Sbjct: 352 ETSSWASDADLPIQLELDLGSPKTFDVIELREDLKLGQRIAAFHVQVEVDGVWQEFGSGH 411

Query: 342 TIGYKRLHRLDGDVVHARVVRIRFIKARGVPLISSIGLHFDP 383
           T+GYKRL  L G VV A+ +R+   +++ +PL++ I L+  P
Sbjct: 412 TVGYKRL--LRGAVVEAQKIRVVITESQALPLLTKISLYKTP 451


>ref|NP_359545.1| hypothetical protein spr1954 [Streptococcus pneumoniae R6]
           gi|15459653|gb|AAL00756.1| Hypothetical protein
           [Streptococcus pneumoniae R6] gi|25509375|pir||G98115
           hypothetical protein spr1954 [imported] - Streptococcus
           pneumoniae (strain R6)
          Length = 583

 Score =  267 bits (683), Expect = 4e-70
 Identities = 153/395 (38%), Positives = 218/395 (54%), Gaps = 26/395 (6%)

Query: 1   MILTAKHHDGFCLWPSKYTKHSVISSKWQNGKGDVVKEFVNAATDKGIDVGIYLSPWDRH 60
           +IL  KHHDGF L+P+ +T +SV  S W+ GKGD++ E   AAT+  +D+G+YLSPWD H
Sbjct: 95  LILVVKHHDGFVLYPTAHTDYSVKVSPWRRGKGDLLLEVSQAATEFDMDMGVYLSPWDAH 154

Query: 61  DSRYGHDLL--YNEYYLAQLQELLKKYQ-----DVREIWFDGAKDPRAQNVTYYFSDWFS 113
              Y  D    YN YYLAQL+E+L            E+W DGA+   AQ V Y F  WF 
Sbjct: 155 SPLYHVDREADYNAYYLAQLKEILSNPNYGNAGKFAEVWMDGARGEGAQKVNYEFEKWFE 214

Query: 114 MVKELQSSINIFSDAGPDVRWVGDETGTAGDTCWSTINRTSLSIGASNITQYLNTGDPKG 173
            +++LQ    IFS  G  +RW+G+E G AGD  W  +N   L   A     YL  GDP G
Sbjct: 215 TIRDLQGDCLIFSTEGTSIRWIGNERGYAGDPLWQKVNLDKLGTEAE--LNYLQHGDPSG 272

Query: 174 TDWLPAECDVSIRPGWFWHKSESPKKLSDLLDIYYKSVGRNCVLLLNVPPNTTGLISEND 233
           T +   E DVSIRPGWF+H+ + PK L +L++IY+ SVGR   LLLN+PPN  GL    D
Sbjct: 273 TIFSIGEADVSIRPGWFYHEDQDPKSLEELVEIYFHSVGRGTPLLLNIPPNQAGLFDAKD 332

Query: 234 AHRLKEFRSAIDTIFHKNIAENRYVKVSSQRGGKEGGFGPENMLDSDHLWSYWTPREDDK 293
             R  EF +  + ++ +++A           G +  G       D  HL           
Sbjct: 333 IERPYEFATYRNELYKEDLA----------LGAEVSGPALSADFDCRHLTDGLETSSWAS 382

Query: 294 EKDHWIEIWGNDGSLR-FNVIRIQEAIGLGQRIERYEIYVDGKSIIQ----GTTIGYKRL 348
           + D  I++  + GS + F+VI ++E + LGQRI  + + V+   + Q    G T+G+KRL
Sbjct: 383 DADLPIQLELDLGSPKTFDVIELREDLKLGQRIAAFHVQVEVDGVWQEFGRGFTVGHKRL 442

Query: 349 HRLDGDVVHARVVRIRFIKARGVPLISSIGLHFDP 383
             L G +V A+ VR+   +A+ +P+++ I L+  P
Sbjct: 443 --LRGPLVEAQKVRVMITEAQSIPVLTKISLYKTP 475


>gb|AAO79818.1| conserved hypothetical protein [Bacteroides thetaiotaomicron
           VPI-5482] gi|29350121|ref|NP_813624.1| hypothetical
           protein BT4713 [Bacteroides thetaiotaomicron VPI-5482]
          Length = 580

 Score =  256 bits (655), Expect = 6e-67
 Identities = 156/413 (37%), Positives = 216/413 (51%), Gaps = 32/413 (7%)

Query: 1   MILTAKHHDGFCLWPSKYTKHSVISSKWQNGKGDVVKEFVNAATDKGIDVGIYLSPWDRH 60
           +ILTAKHHDGFCLWP++ T++ + ++ +++GKGD+V+E  +A    GI   +YLSPWDRH
Sbjct: 48  VILTAKHHDGFCLWPTQLTEYCIRNTPYKDGKGDIVRELSDACKKYGIKFAVYLSPWDRH 107

Query: 61  DSRYGHDLLYNEYYLAQLQELLKKYQDVREIWFD----------GAKDPRA-QNVTYY-F 108
            + YG    Y +Y+  QL ELL  Y DV EIWFD          GAKD R     TYY +
Sbjct: 108 QANYGTP-EYVDYFYKQLHELLTNYGDVFEIWFDGANGGDGWYGGAKDARTIDRKTYYDY 166

Query: 109 SDWFSMVKELQSSINIFSDAGPDVRWVGDETGTAGDTCWSTINRTSLSIGASNITQYLNT 168
              + M+ ELQ    IFSD GP  RWVG+E G AG T WS +    +  G     + L  
Sbjct: 167 PRAYKMIDELQPQAVIFSDGGPGCRWVGNENGFAGATNWSFLRAGEVYPGYPKYRE-LQY 225

Query: 169 GDPKGTDWLPAECDVSIRPGWFWHKSESP--KKLSDLLDIYYKSVGRNCVLLLNVPPNTT 226
           G   G  W+ AECDVSIRPGWF+H  E    K +  L D+YY+SVG N  LLLN P +  
Sbjct: 226 GHADGNQWVAAECDVSIRPGWFYHPEEDDKVKTVDQLTDLYYRSVGHNATLLLNFPVDRN 285

Query: 227 GLISENDAHRLKEFRSAIDTIFHKNIAENRYVKVSSQRGGKEGGFGPENMLDSDHLWSYW 286
           GLI   D+     F   +      N+  +  V    +RGG+   F    + D  +  +YW
Sbjct: 286 GLIHPTDSLNAVSFHQRVQKELADNLLSSAKVSAFDERGGQ---FKVRAVTDGKY-DTYW 341

Query: 287 TPREDDKEKDHWIEIWGNDGSLRFNVIRIQEAIGLGQRIERYEI-YVDG------KSIIQ 339
              +     D            + N + IQE + LGQR++ + + Y +G      K   +
Sbjct: 342 ATNDGVTTADLTFTF---SQPTKMNRVMIQEYVPLGQRVKSFVVEYKEGDQWLSVKCNEE 398

Query: 340 GTTIGYKRLHRLDGDVVHARVVRIRFIKARGVPLISSIGLHFDPFWHSRFTAA 392
            TT+GYKRL R   +++    +RIRF  AR    I+ +G ++ P     +T A
Sbjct: 399 TTTVGYKRLLRF--EMIETEELRIRFTDARACLCINEVGAYYAPDATENYTPA 449


>gb|AAO79241.1| conserved hypothetical protein [Bacteroides thetaiotaomicron
           VPI-5482] gi|29349544|ref|NP_813047.1| hypothetical
           protein BT4136 [Bacteroides thetaiotaomicron VPI-5482]
          Length = 605

 Score =  254 bits (649), Expect = 3e-66
 Identities = 154/403 (38%), Positives = 217/403 (53%), Gaps = 57/403 (14%)

Query: 1   MILTAKHHDGFCLWPSKYTKHSVISSKWQNGKGDVVKEFVNAATDKGIDVGIYLSPWDRH 60
           +ILTAKHHDGFCLWP+  TKHSV SS W+NG+GDVVKE   A    G+  G+YLSPWDR+
Sbjct: 111 VILTAKHHDGFCLWPTATTKHSVASSSWKNGQGDVVKELRKACKKYGMRFGLYLSPWDRN 170

Query: 61  DSRYGHDLLYNEYYLAQLQELLKKYQDVREIWFDGA--KDPRAQNVTYYFSDWFSMVKEL 118
              YG    YN++++ QL ELL  Y +V E+WFDGA  + P  +   Y +   +  +  L
Sbjct: 171 AECYGDSPRYNKFFIRQLTELLTNYGEVHEVWFDGANGEGPNGKKQVYDWDTVYETIHRL 230

Query: 119 QSSINIFSDAGPDVRWVGDETGTAGDTCWST----------INRTSLSIGASNITQYLNT 168
           Q    + +  G D+RWVG+E+G   +T WST           ++ +  +G +  +  L +
Sbjct: 231 QPKA-VMAIMGDDIRWVGNESGLGRETEWSTTVLTPEIYARADKNNKKLGINGQSNDLGS 289

Query: 169 GD--PKGTD--WLPAECDVSIRPGWFWHKSE--SPKKLSDLLDIYYKSVGRNCVLLLNVP 222
                K T+  W P+E DVSIRPGWF+HK E    K L  L DIY++SVG N VLLLN+P
Sbjct: 290 RKMLEKATELFWYPSEVDVSIRPGWFYHKEEDNKVKSLKHLADIYFQSVGYNSVLLLNIP 349

Query: 223 PNTTGLISENDAHRLKEFRSAIDTIFHKNIAENRYVKVSSQRGGKEGGFGPENMLDSDHL 282
           P+  GLI E D  RLKEF +     + K +  + YVK    +G K               
Sbjct: 350 PDHRGLIHEADVQRLKEFAA-----YRKQVFADNYVK----KGKK--------------- 385

Query: 283 WSYWTPREDDKEKDHWIEIWGNDGSLRFNVIRIQEAIGLGQRIERYEIYV----DGKSII 338
             YW     ++      +I+        NV+ +QE I  GQRIE + +        K + 
Sbjct: 386 --YWNTASGNE------KIYQLKAGSEINVVMLQEDITKGQRIEAFTVEALTDNSWKEVA 437

Query: 339 QGTTIGYKRLHRLDGDVVHARVVRIRFIKARGVPLISSIGLHF 381
           +GTT+GYKRL R     + A  +RI+ +++R    IS +  ++
Sbjct: 438 KGTTVGYKRLLRF--PTIKADQLRIKILESRLNANISQVAAYY 478


>emb|CAH08778.1| putative exported fucosidase [Bacteroides fragilis NCTC 9343]
           gi|60682552|ref|YP_212696.1| putative exported
           fucosidase [Bacteroides fragilis NCTC 9343]
          Length = 605

 Score =  253 bits (646), Expect = 7e-66
 Identities = 160/402 (39%), Positives = 216/402 (52%), Gaps = 57/402 (14%)

Query: 2   ILTAKHHDGFCLWPSKYTKHSVISSKWQNGKGDVVKEFVNAATDKGIDVGIYLSPWDRHD 61
           ILTAKHHDGFCLWP+K T HSV +S W++GKGDVV+E  +A    GI  G+YLSPWDR+ 
Sbjct: 112 ILTAKHHDGFCLWPTKTTGHSVAASPWKDGKGDVVRELRDACDKYGIKFGVYLSPWDRNA 171

Query: 62  SRYGHDLLYNEYYLAQLQELLKKYQDVREIWFDGA--KDPRAQNVTYYFSDWFSMVKELQ 119
           S YG    YNE+++ QL ELL  Y +V E+WFDGA  + P  +   Y ++   S ++ LQ
Sbjct: 172 SCYGDSPKYNEFFIEQLTELLTNYGEVHEVWFDGANGEGPNGKKQEYDWTAILSTIRRLQ 231

Query: 120 SSINIFSDAGPDVRWVGDETGTAGDTCWSTINRT----------SLSIGASNITQYLNTG 169
               + +  G DVRWVG+E G   +T WS    T          + ++G    ++ L   
Sbjct: 232 PRA-VTAIMGDDVRWVGNERGLGRETEWSATVLTPGTYARCEEQNKALGVKATSKDLGGR 290

Query: 170 D----PKGTDWLPAECDVSIRPGWFWHKSE--SPKKLSDLLDIYYKSVGRNCVLLLNVPP 223
           D     K   W P+E DVSIRPGWF+H+ E    K L  L DIY+KSVG N VLLLN+PP
Sbjct: 291 DMLVNAKELFWYPSEVDVSIRPGWFYHQQEDNQVKSLKHLTDIYFKSVGYNSVLLLNIPP 350

Query: 224 NTTGLISENDAHRLKEFRSAIDTIFHKNIAENRYVKVSSQRGGKEGGFGPENMLDSDHLW 283
           +  G IS+ D +RLKEF      IF    A+NR       +GG +               
Sbjct: 351 DQRGRISDADVNRLKEFADYRKEIF----ADNRV------KGGLKA-------------- 386

Query: 284 SYWTPREDDKEKDHWIEIWGNDGSLRFNVIRIQEAIGLGQRIERYEI---YVDG-KSIIQ 339
             WT R  D        ++        NV+ ++E I  GQR+E + +     DG K I +
Sbjct: 387 --WTARPGD------TRVYQLKPKSEINVVMLREDISKGQRMEAFTVEALTADGWKEIAK 438

Query: 340 GTTIGYKRLHRLDGDVVHARVVRIRFIKARGVPLISSIGLHF 381
           GTT+GYKRL R+    V AR +R++    R    IS +  ++
Sbjct: 439 GTTVGYKRLIRI--PAVEARQLRVKVDACRLAANISEVAAYY 478


>ref|YP_100522.1| hypothetical protein BF3243 [Bacteroides fragilis YCH46]
           gi|52217395|dbj|BAD49988.1| conserved hypothetical
           protein [Bacteroides fragilis YCH46]
          Length = 605

 Score =  251 bits (642), Expect = 2e-65
 Identities = 159/402 (39%), Positives = 215/402 (52%), Gaps = 57/402 (14%)

Query: 2   ILTAKHHDGFCLWPSKYTKHSVISSKWQNGKGDVVKEFVNAATDKGIDVGIYLSPWDRHD 61
           ILTAKHHDGFCLWP+K T HSV +S W++GKGDVV+E  +A    GI  G+YLSPWDR+ 
Sbjct: 112 ILTAKHHDGFCLWPTKTTGHSVAASPWKDGKGDVVRELRDACDKYGIKFGVYLSPWDRNA 171

Query: 62  SRYGHDLLYNEYYLAQLQELLKKYQDVREIWFDGA--KDPRAQNVTYYFSDWFSMVKELQ 119
           S YG    YNE+++ QL ELL  Y +V E+WFDGA  + P  +   Y ++   S ++ LQ
Sbjct: 172 SCYGDSPKYNEFFIEQLTELLTNYGEVHEVWFDGANGEGPNGKKQEYDWTAILSTIRRLQ 231

Query: 120 SSINIFSDAGPDVRWVGDETGTAGDTCWSTINRT----------SLSIGASNITQYLNTG 169
               + +  G DVRWVG+E G   +T WS    T          + ++G    ++ L   
Sbjct: 232 PRA-VTAIMGDDVRWVGNERGLGRETEWSATVLTPGTYARCEEQNKALGVKATSKDLGGR 290

Query: 170 D----PKGTDWLPAECDVSIRPGWFWHKSE--SPKKLSDLLDIYYKSVGRNCVLLLNVPP 223
           D     K   W P+E DVSIRPGWF+H+ E    K L  L DIY+KSVG N VLLLN+PP
Sbjct: 291 DMLVNAKELFWYPSEVDVSIRPGWFYHQQEDNQVKSLKHLTDIYFKSVGYNSVLLLNIPP 350

Query: 224 NTTGLISENDAHRLKEFRSAIDTIFHKNIAENRYVKVSSQRGGKEGGFGPENMLDSDHLW 283
           +  G IS+ D +RLKEF      IF     +NR       +GG +               
Sbjct: 351 DQRGRISDADVNRLKEFADYRKEIF----TDNRV------KGGLKA-------------- 386

Query: 284 SYWTPREDDKEKDHWIEIWGNDGSLRFNVIRIQEAIGLGQRIERYEI---YVDG-KSIIQ 339
             WT R  D        ++        NV+ ++E I  GQR+E + +     DG K I +
Sbjct: 387 --WTARPGD------TRVYQLKPKSEINVVMLREDISKGQRMEAFTVEALTADGWKEIAK 438

Query: 340 GTTIGYKRLHRLDGDVVHARVVRIRFIKARGVPLISSIGLHF 381
           GTT+GYKRL R+    V AR +R++    R    IS +  ++
Sbjct: 439 GTTVGYKRLIRI--PAVEARQLRVKVDACRLAANISEVAAYY 478


>gb|AAO77299.1| conserved hypothetical protein [Bacteroides thetaiotaomicron
           VPI-5482] gi|29347602|ref|NP_811105.1| hypothetical
           protein BT2192 [Bacteroides thetaiotaomicron VPI-5482]
          Length = 484

 Score =  251 bits (640), Expect = 4e-65
 Identities = 159/398 (39%), Positives = 219/398 (54%), Gaps = 34/398 (8%)

Query: 2   ILTAKHHDGFCLWPSKYTKHSVISSKWQNGKGDVVKEFVNAATDKGIDVGIYLSPWDRHD 61
           ILTAKH DGFCLWPSKYT +SV ++ W+NGKGDVV+EFV+A  + G+  GIYL P DRH+
Sbjct: 94  ILTAKHADGFCLWPSKYTDYSVKNAAWKNGKGDVVREFVDACEEYGLKAGIYLGPHDRHE 153

Query: 62  --SRYGHDLLYNEYYLAQLQELLKKYQDVREIWFDGAKDPRAQNVTYYFSDWFSMVKELQ 119
             S       Y EYY  QL EL+  Y  + E W+DGA     +  T  +  W+ +V+E Q
Sbjct: 154 HLSPLYTTERYKEYYAHQLGELMSDYGKIWETWWDGA--GADELTTPVYRHWYKIVREKQ 211

Query: 120 SSINIFSDAG----PDVRWVGDETGTAGDTCWSTINRTSLSIGASNITQY---LNTGDPK 172
               IF         DVRW+G+E G AGD CW+T +    S+   +  QY   LN G   
Sbjct: 212 PDCVIFGTKNSYPFADVRWMGNEAGEAGDPCWATTD----SVAIRDEAQYYKGLNEGMLD 267

Query: 173 GTDWLPAECDVSIRPGWFWHKSESP--KKLSDLLDIYYKSVGRNCVLLLNVPPNTTGLIS 230
           G  ++PAE DVSIRP WF+H  E    K + +L DIY  SVGRN VLLLN PP+  GLI 
Sbjct: 268 GDAYIPAETDVSIRPSWFYHAEEDSRVKSVRELWDIYCTSVGRNSVLLLNFPPDRRGLIH 327

Query: 231 ENDAHRLKEFRSAIDTIFHKNIAENRYVKVSSQRGGKEGGFGPENMLDSDHLWSYWTPRE 290
             D+      +  ID  F  N+     VK ++ RG K   + PE MLD++   +Y+  ++
Sbjct: 328 STDSLHAALLKQGIDETFSTNLLRGAKVKATNVRGAK---YSPEKMLDNEKN-TYFAGKD 383

Query: 291 DDKEKDHWIEIWGNDGSLRFNVIRIQEAIGLGQRIERY--EIYVDGKSII------QGTT 342
            + + D    I+    ++ F+ + I+E I LG R  ++  E  VDGK+ I          
Sbjct: 384 GEVKAD---IIFTLPKTIEFDCLMIEEVIELGHRTTKWSVEYTVDGKNWITIPEATDKQA 440

Query: 343 IGYKRLHRLDGDVVHARVVRIRFIKARGVPLISSIGLH 380
           IG+K + RL    V A+ VR+R    +  P I + G++
Sbjct: 441 IGHKWIVRL--APVKAKQVRLRIQDGKACPAIHTFGVY 476


>emb|CAH06409.1| conserved hypothetical exported protein [Bacteroides fragilis NCTC
           9343] gi|60680223|ref|YP_210367.1| hypothetical protein
           BF0664 [Bacteroides fragilis NCTC 9343]
           gi|53712028|ref|YP_098020.1| hypothetical protein BF0735
           [Bacteroides fragilis YCH46] gi|52214893|dbj|BAD47486.1|
           conserved hypothetical protein [Bacteroides fragilis
           YCH46]
          Length = 483

 Score =  246 bits (628), Expect = 9e-64
 Identities = 156/395 (39%), Positives = 217/395 (54%), Gaps = 28/395 (7%)

Query: 2   ILTAKHHDGFCLWPSKYTKHSVISSKWQNGKGDVVKEFVNAATDKGIDVGIYLSPWDRHD 61
           I+TAKH DGFCLWPSKYT + V +S W+NGKGDVV+EFV+A  + GI  GIYL P DRH+
Sbjct: 94  IITAKHADGFCLWPSKYTDYGVKNSAWKNGKGDVVREFVDACEEYGIKAGIYLGPHDRHE 153

Query: 62  --SRYGHDLLYNEYYLAQLQELLKKYQDVREIWFDGAKDPRAQNVTYYFSDWFSMVKELQ 119
             S       Y +YY  QL+EL+  Y  V E W+DGA     +  T  ++ W+ +V+E Q
Sbjct: 154 HLSPLYTTEKYKQYYGHQLEELMGDYGKVWETWWDGA--GADELTTPVYTHWYKIVREKQ 211

Query: 120 SSINIFSDAG----PDVRWVGDETGTAGDTCWSTINRTSLSIGASNITQYLNTGDPKGTD 175
               IF         DVRW+G+E+G AGD CWST +   +     N  + LN G   G  
Sbjct: 212 PDCVIFGTKNSYPFADVRWMGNESGKAGDPCWSTTDSVCVRDEWKNY-EGLNEGVKGGDA 270

Query: 176 WLPAECDVSIRPGWFWHKSESP--KKLSDLLDIYYKSVGRNCVLLLNVPPNTTGLISEND 233
           ++PAE DVSIRP WF+H  E    K + +L DIY  SVG N VLLLN PP+  GLI   D
Sbjct: 271 YIPAETDVSIRPSWFYHAEEDSRVKSVKELWDIYCTSVGHNSVLLLNFPPDRRGLIHPTD 330

Query: 234 AHRLKEFRSAIDTIFHKNIAENRYVKVSSQRGGKEGGFGPENMLDSDHLWSYWTPREDDK 293
           +      +  +D  F  N+     VK ++ RGGK   F PE + D++   +Y+  R+  K
Sbjct: 331 SLHAALLKQGLDETFGNNLLAKAKVKATNGRGGK---FRPEFLTDNNK-ETYFAGRDGAK 386

Query: 294 EKDHWIEIWGNDGSLRFNVIRIQEAIGLGQRIERYEIYV--DGKS---IIQGT---TIGY 345
             D    ++       F+ + IQE I LG R  ++ +    DG++   I + T   T+GY
Sbjct: 387 TSD---IVFTLPRQTEFDCLMIQEVIELGHRTTKWSVEYSNDGRNWTPIPEATDKQTVGY 443

Query: 346 KRLHRLDGDVVHARVVRIRFIKARGVPLISSIGLH 380
           K + R   + V A+ VR+R +     P I + G++
Sbjct: 444 KWIVRF--EPVKAKQVRLRILDGFACPAIHTFGVY 476


>gb|AAO76732.1| conserved hypothetical protein [Bacteroides thetaiotaomicron
           VPI-5482] gi|29347035|ref|NP_810538.1| hypothetical
           protein BT1625 [Bacteroides thetaiotaomicron VPI-5482]
          Length = 605

 Score =  244 bits (622), Expect = 4e-63
 Identities = 152/406 (37%), Positives = 215/406 (52%), Gaps = 58/406 (14%)

Query: 1   MILTAKHHDGFCLWPSKYTKHSVISSKWQNGKGDVVKEFVNAATDKGIDVGIYLSPWDRH 60
           ++LTAKHHDGFCLWP+  TKHSV SS W+NG+GDVVKE  NA     +  G+YLSPWDR+
Sbjct: 111 VLLTAKHHDGFCLWPTATTKHSVASSPWKNGQGDVVKELRNACDKYDMKFGVYLSPWDRN 170

Query: 61  DSRYGHDLLYNEYYLAQLQELLKKYQDVREIWFDGA--KDPRAQNVTYYFSDWFSMVKEL 118
              YG    YNE+++ QL ELL  Y +V E+WFDGA  + P  +   Y +  ++  +++L
Sbjct: 171 AECYGDSPKYNEFFIRQLTELLTNYGEVHEVWFDGANGEGPNGKKQIYDWDAFYKTIQQL 230

Query: 119 QSSINIFSDAGPDVRWVGDETGTAGDTCWSTI----------NRTSLSIGASNITQYLNT 168
           Q    + +  G DVRWVG+E G   +T WS               +  +G  +  + L +
Sbjct: 231 QPKA-VMAIMGDDVRWVGNEKGLGRETEWSATVLTPGIYARSEENNKRLGVFSKAEDLGS 289

Query: 169 GD--PKGTD--WLPAECDVSIRPGWFWHKSESP--KKLSDLLDIYYKSVGRNCVLLLNVP 222
                K T+  W P+E DVSIRPGWF+H  E    K L  L DIY++SVG N VLLLN+P
Sbjct: 290 RAMLEKATELFWYPSEVDVSIRPGWFYHAEEDSKVKSLKHLADIYFQSVGYNSVLLLNIP 349

Query: 223 PNTTGLISENDAHRLKEFRSAIDTIFHKNIAENRYVKVSSQRGGKEGGFGPENMLDSDHL 282
           P+  GLI+E D  RL EF +  + IF  N  E         +G K+     E +  S+ +
Sbjct: 350 PDRRGLINEADVQRLNEFAAYREKIFTNNRVE---------KGRKDW----EAVSGSETV 396

Query: 283 WSYWTPREDDKEKDHWIEIWGNDGSLRFNVIRIQEAIGLGQRIERYEIYV----DGKSII 338
           +S     E                    NV+ +QE I  GQR+E + +        + + 
Sbjct: 397 YSLKPESE-------------------INVVMLQEDITKGQRVESFTVEALTEQGWQEVA 437

Query: 339 QGTTIGYKRLHRLDGDVVHARVVRIRFIKARGVPLISSIGLHF-DP 383
           +GTT+GYKR+ R     V A  +R++  + R    IS +  ++ DP
Sbjct: 438 KGTTVGYKRMVRF--PAVKATQLRVKINECRLTAHISQVAAYYADP 481


>gb|AAQ66752.1| alpha-1,3/4-fucosidase, putative [Porphyromonas gingivalis W83]
           gi|34541374|ref|NP_905853.1| alpha-1,3/4-fucosidase,
           putative [Porphyromonas gingivalis W83]
          Length = 606

 Score =  243 bits (621), Expect = 6e-63
 Identities = 151/393 (38%), Positives = 206/393 (51%), Gaps = 61/393 (15%)

Query: 1   MILTAKHHDGFCLWPSKYTKHSVISSKWQNGKGDVVKEFVNAATDKGIDVGIYLSPWDRH 60
           +ILTAKHHDGFCLWP+  T+HSV SS W+ GKGDVVKE   A  + G+  GIYLSPWDR+
Sbjct: 113 VILTAKHHDGFCLWPTATTRHSVASSPWREGKGDVVKEVRAACEEYGMKFGIYLSPWDRN 172

Query: 61  DSRYGHDLLYNEYYLAQLQELLKKYQDVREIWFDGA--KDPRAQNVTYYFSDWFSMVKEL 118
              YG    YN ++++QL ELL  Y +V E+WFDGA  + P  +   Y +  ++  ++ L
Sbjct: 173 AECYGDSHRYNRFFVSQLTELLTHYGEVHEVWFDGANGEGPNGKRQEYDWETFYDTIRRL 232

Query: 119 QSSINIFSDAGPDVRWVGDETGTAGDTCWSTI--------------NRTSLSIGASNITQ 164
           Q    + +  G DVRWVG+E G    T WS                NR  +   + ++  
Sbjct: 233 QPQA-VMAIMGDDVRWVGNERGLGRTTEWSATVLTPGIYSRSKTERNRLGIRENSPDLGS 291

Query: 165 YLNTGDPKGTDWLPAECDVSIRPGWFWHKSESP--KKLSDLLDIYYKSVGRNCVLLLNVP 222
                +     W P+E DVSIRPGWF+H +E    K L  L+DIY++SVG N VLLLNVP
Sbjct: 292 REVLKEAGEIFWYPSEVDVSIRPGWFYHAAEDKKVKSLDKLVDIYFQSVGYNSVLLLNVP 351

Query: 223 PNTTGLISENDAHRLKEFRSAIDTIF-HKNIAEN-RYVKVSSQRGGKEGGFGPENMLDSD 280
           P+  GLI E DA RLKE+   +   F H  + ++ RY  V                    
Sbjct: 352 PDRRGLIHEADALRLKEWADYLGRAFAHDRVVDSARYAVV-------------------- 391

Query: 281 HLWSYWTPREDDKEKDHWIEIWGNDGSLRFNVIRIQEAIGLGQRIERY--EIYVDG--KS 336
                         + H +E +  +     N++ +QE I  GQR E +  E +VDG  + 
Sbjct: 392 --------------QGHAVEEYALESKTHINILMLQEDITKGQRTEAFTVEAWVDGAWQE 437

Query: 337 IIQGTTIGYKRLHRLDGDVVHARVVRIRFIKAR 369
           I +GTTIGYKRL R+    V    +R+R  + R
Sbjct: 438 IGRGTTIGYKRLLRI--PAVETDRIRVRIEQCR 468


>emb|CAH08896.1| putative lipoprotein [Bacteroides fragilis NCTC 9343]
           gi|60682670|ref|YP_212814.1| putative lipoprotein
           [Bacteroides fragilis NCTC 9343]
          Length = 626

 Score =  243 bits (621), Expect = 6e-63
 Identities = 152/403 (37%), Positives = 211/403 (51%), Gaps = 34/403 (8%)

Query: 1   MILTAKHHDGFCLWPSKYTKHSVISSKWQNGKGDVVKEFVNAATDKGIDVGIYLSPWDRH 60
           +ILTAKHHDGFCLWP++ T++ + ++ +++GKGD+V E   A    GI   +YLSPWDRH
Sbjct: 93  VILTAKHHDGFCLWPTQLTEYCIRNTPYKDGKGDIVGELAAACKKYGIKFAVYLSPWDRH 152

Query: 61  DSRYGHDLLYNEYYLAQLQELLKKYQDVREIWFD----------GAKDPRA-QNVTYY-F 108
            + YG    Y +Y+  QL EL+  Y +V E+WFD          GAKD R     TYY +
Sbjct: 153 QANYGTP-EYVDYFHKQLTELMTNYGEVFEVWFDGANGGDGWYGGAKDSRTIDRKTYYNY 211

Query: 109 SDWFSMVKELQSSINIFSDAGPDVRWVGDETGTAGDTCWSTINRTSLSIGASNITQYLNT 168
              + ++ +LQ    +FSD GP  RWVG+E G AG T WS +    +  G     + L  
Sbjct: 212 PRIYEILDKLQPQAIVFSDGGPGCRWVGNENGFAGATNWSFLRAGEVYPGYPKYRE-LQY 270

Query: 169 GDPKGTDWLPAECDVSIRPGWFWHKSESP--KKLSDLLDIYYKSVGRNCVLLLNVPPNTT 226
           G   G  W+PAECDVSIRPGWF+H  E    K +  L D+YY+SVG N  LLLN P +  
Sbjct: 271 GHADGNQWVPAECDVSIRPGWFYHPEEDDRVKTVEQLTDLYYRSVGHNATLLLNFPVDRD 330

Query: 227 GLISENDAHRLKEFRSAIDTIFHKNIAENRYVKVSSQRGGKEGGFGPENMLDSDHLW-SY 285
           GLI   D+     F   +      N+      K S +RGG+           +D  W +Y
Sbjct: 331 GLIHPIDSANAVNFHKNVQKQLAHNLLAGIRPKASDERGGQFSA-----KAATDESWDTY 385

Query: 286 WTPREDDKEKDHWIEIWGNDGSLRFNVIRIQEAIGLGQRIERY--EIYVDGKSI-----I 338
           W   +     D   +    +   + N + IQE I LGQR++ +  E   DGK +      
Sbjct: 386 WATNDGVTAADIEFDFPKTE---KVNRMMIQEYIPLGQRVKSFIVEYDKDGKWLPVKLNE 442

Query: 339 QGTTIGYKRLHRLDGDVVHARVVRIRFIKARGVPLISSIGLHF 381
           + TT+GYKRL R   + V    +RIRF  AR    I++I  ++
Sbjct: 443 ETTTVGYKRLLRF--ETVSTDKLRIRFTDARACLCINNIEAYY 483


>ref|YP_100649.1| hypothetical protein BF3371 [Bacteroides fragilis YCH46]
           gi|52217522|dbj|BAD50115.1| conserved hypothetical
           protein [Bacteroides fragilis YCH46]
          Length = 626

 Score =  243 bits (621), Expect = 6e-63
 Identities = 152/403 (37%), Positives = 211/403 (51%), Gaps = 34/403 (8%)

Query: 1   MILTAKHHDGFCLWPSKYTKHSVISSKWQNGKGDVVKEFVNAATDKGIDVGIYLSPWDRH 60
           +ILTAKHHDGFCLWP++ T++ + ++ +++GKGD+V E   A    GI   +YLSPWDRH
Sbjct: 93  VILTAKHHDGFCLWPTQLTEYCIRNTPYKDGKGDIVGELAAACKKYGIKFAVYLSPWDRH 152

Query: 61  DSRYGHDLLYNEYYLAQLQELLKKYQDVREIWFD----------GAKDPRA-QNVTYY-F 108
            + YG    Y +Y+  QL EL+  Y +V E+WFD          GAKD R     TYY +
Sbjct: 153 QANYGTP-EYVDYFHKQLTELMTNYGEVFEVWFDGANGGDGWYGGAKDSRTIDRKTYYNY 211

Query: 109 SDWFSMVKELQSSINIFSDAGPDVRWVGDETGTAGDTCWSTINRTSLSIGASNITQYLNT 168
              + ++ +LQ    +FSD GP  RWVG+E G AG T WS +    +  G     + L  
Sbjct: 212 PRIYEILDKLQPQAIVFSDGGPGCRWVGNENGFAGATNWSFLRAGEVYPGYPKYRE-LQY 270

Query: 169 GDPKGTDWLPAECDVSIRPGWFWHKSESP--KKLSDLLDIYYKSVGRNCVLLLNVPPNTT 226
           G   G  W+PAECDVSIRPGWF+H  E    K +  L D+YY+SVG N  LLLN P +  
Sbjct: 271 GHADGNQWVPAECDVSIRPGWFYHPEEDDRVKTVEQLTDLYYRSVGHNATLLLNFPVDRD 330

Query: 227 GLISENDAHRLKEFRSAIDTIFHKNIAENRYVKVSSQRGGKEGGFGPENMLDSDHLW-SY 285
           GLI   D+     F   +      N+      K S +RGG+           +D  W +Y
Sbjct: 331 GLIHPIDSANAVNFHKNVQKQLAHNLLAGIRPKASDERGGQFSA-----KAATDESWDTY 385

Query: 286 WTPREDDKEKDHWIEIWGNDGSLRFNVIRIQEAIGLGQRIERY--EIYVDGKSI-----I 338
           W   +     D   +    +   + N + IQE I LGQR++ +  E   DGK +      
Sbjct: 386 WATNDGVTAADIEFDFPKTE---KVNRMMIQEYIPLGQRVKSFIVEYDKDGKWLPVKLNE 442

Query: 339 QGTTIGYKRLHRLDGDVVHARVVRIRFIKARGVPLISSIGLHF 381
           + TT+GYKRL R   + V    +RIRF  AR    I++I  ++
Sbjct: 443 ETTTVGYKRLLRF--ETVSTDKLRIRFTDARACLCINNIEAYY 483


>gb|EAA55123.1| hypothetical protein MG06780.4 [Magnaporthe grisea 70-15]
           gi|39977791|ref|XP_370283.1| hypothetical protein
           MG06780.4 [Magnaporthe grisea 70-15]
          Length = 519

 Score =  229 bits (584), Expect = 1e-58
 Identities = 149/413 (36%), Positives = 208/413 (50%), Gaps = 44/413 (10%)

Query: 1   MILTAKHHDGFCLWPSKYTKHSVISSKWQNGK------GDVVKEFVNAATDKGIDVGIYL 54
           MILTAKHHDG  LW +  T + + + KW   +       DVV+    +A   G+  G+YL
Sbjct: 107 MILTAKHHDGMALWNTSSTTYKIANGKWAKDREAERLDADVVRMAATSAKKHGLKFGVYL 166

Query: 55  SPWDRH-DSRYGHDLL------------------YNEYYLAQLQELL-KKYQD-----VR 89
           SPWD H D       L                  YNE Y+ QL EL+  K +D     + 
Sbjct: 167 SPWDIHRDPAMPKPTLAGTIFDEAQIFGDGSPGDYNELYVQQLTELVDMKLEDGSNVELF 226

Query: 90  EIWFDGAKDPRAQNVTYYFSDWFSMVKELQSSINIFSDAGPDVRWVGDETGTAGDTCWST 149
           E+W DGA    A   T+ +S +  +++  Q    ++   GPD RWVG+E G +  T W T
Sbjct: 227 EVWLDGASG-SATVQTFDWSRYREVIRTHQPGAVMWGHQGPDARWVGNEDGYSVQTNWHT 285

Query: 150 INRTSLSIGASNITQYLNTGDPKGTDWLPAECDVSIRPGWFWHKSESPKKLSDLLDIYYK 209
           I+RT          + L TG   G  W PAE D  +R GWFWH  E PK   DL+D+Y  
Sbjct: 286 ISRTQDQERYGE--RELETGVRDGLYWTPAEADARMRAGWFWHAEEKPKTAKDLMDMYMG 343

Query: 210 SVGRNCVLLLNVPPNTTGLISENDAHRLKEFRSAIDTIFHKNIAE-NRYVKVSSQRGGKE 268
           SVGR+  LLLNV P+ TG I + D   L EF+   D  F + +      V  SS R G  
Sbjct: 344 SVGRSVNLLLNVGPDNTGRIPQVDVDALMEFKELRDGFFERKLLRPGLGVSASSVRAGDA 403

Query: 269 GGFGPENMLDSDHLWSYWTPREDDKEKDHWIEIWGNDGSLRFNVIRIQEAIGLGQRIERY 328
             FGPEN+LD D   +YWT  +D       +++    G++    + IQE I LGQR+  Y
Sbjct: 404 MQFGPENVLD-DRQDTYWTMEDDQTTGSLEVDV---GGTITIEAVAIQEHIALGQRVGGY 459

Query: 329 --EIYVDG--KSIIQGTTIGYKRLHRLDGDVVHARVVRIRFIKARGVPLISSI 377
             +++ D   K ++ GT++GY R+ RL+  +   R +R+R  +A  VP+I  I
Sbjct: 460 AFDVFSDDAWKEVVSGTSVGYGRIDRLNTTMTGTR-LRLRVTQANAVPMIQGI 511


  Database: nr
    Posted date:  Jul 5, 2005 12:34 AM
  Number of letters in database: 863,360,394
  Number of sequences in database:  2,540,612
  
Lambda     K      H
   0.319    0.138    0.442 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 739,812,355
Number of Sequences: 2540612
Number of extensions: 33050253
Number of successful extensions: 68298
Number of sequences better than 10.0: 124
Number of HSP's better than 10.0 without gapping: 60
Number of HSP's successfully gapped in prelim test: 64
Number of HSP's that attempted gapping in prelim test: 67984
Number of HSP's gapped (non-prelim): 193
length of query: 392
length of database: 863,360,394
effective HSP length: 130
effective length of query: 262
effective length of database: 533,080,834
effective search space: 139667178508
effective search space used: 139667178508
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 76 (33.9 bits)


Medicago: description of AC149581.7