BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (341 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_A7FH41 Beta-hexosaminidase n=129 Tax=Gammaproteobacteri... 538 e-151 UniRef50_Q4QLU8 Beta-hexosaminidase n=138 Tax=Gammaproteobacteri... 367 e-100 UniRef50_A1SW90 Beta-hexosaminidase n=3 Tax=Alteromonadales RepI... 352 1e-95 UniRef50_Q7PC47 Beta-N-acetylhexosaminidase. Glycosyl Hydrolase ... 327 3e-88 UniRef50_Q0HJG7 Beta-hexosaminidase n=35 Tax=Gammaproteobacteria... 314 3e-84 UniRef50_Q0VQT2 Beta-hexosaminidase n=3 Tax=Gammaproteobacteria ... 311 2e-83 UniRef50_B4RS37 Beta-hexosaminidase n=2 Tax=Alteromonas macleodi... 304 4e-81 UniRef50_UPI0000E11708 beta-hexosaminidase n=1 Tax=Rhodobacteral... 302 9e-81 UniRef50_A4C9R3 Beta-hexosaminidase n=1 Tax=Pseudoalteromonas tu... 292 1e-77 UniRef50_Q3SKU2 Beta-hexosaminidase n=27 Tax=Proteobacteria RepI... 290 4e-77 UniRef50_Q1QVU0 Glycoside hydrolase, family 3-like protein n=1 T... 288 2e-76 UniRef50_C5BIH5 Glycoside hydrolase family 3 domain protein n=1 ... 282 1e-74 UniRef50_Q31G32 Beta-hexosaminidase n=1 Tax=Thiomicrospira cruno... 275 1e-72 UniRef50_A1AVY9 Beta-N-acetylhexosaminidase n=2 Tax=Gammaproteob... 272 1e-71 UniRef50_B6J7B4 Anhydromuramoyl-peptide exo-beta-N-acetylglucosa... 269 1e-70 UniRef50_C4ZK55 Glycoside hydrolase family 3 domain protein n=6 ... 266 6e-70 UniRef50_UPI0000E87F6B beta-hexosaminidase n=1 Tax=Methylophilal... 265 1e-69 UniRef50_Q4FV07 Glycoside hydrolase family protein n=32 Tax=Pseu... 264 4e-69 UniRef50_Q2SIQ7 Beta-glucosidase-related Glycosidase n=1 Tax=Hah... 263 8e-69 UniRef50_Q5P081 Beta-hexosaminidase n=63 Tax=Proteobacteria RepI... 262 2e-68 UniRef50_B6BUL6 Beta-hexosaminidase n=1 Tax=beta proteobacterium... 261 4e-68 UniRef50_A9M1Z4 Beta-hexosaminidase n=23 Tax=Neisseriaceae RepID... 258 2e-67 UniRef50_C1D8P0 NagZ n=4 Tax=Proteobacteria RepID=C1D8P0_LARHH 258 3e-67 UniRef50_A4BUN0 Glycoside hydrolase, family 3-like protein n=1 T... 257 4e-67 UniRef50_Q2BGB3 Beta-hexosaminidase n=1 Tax=Neptuniibacter caesa... 252 2e-65 UniRef50_Q1N543 Glycoside hydrolase, family 3-like protein n=1 T... 251 3e-65 UniRef50_B3T2B0 Putative glycosyl hydrolase family 3 N terminal ... 249 1e-64 UniRef50_B2FPW9 Beta-hexosaminidase n=18 Tax=Xanthomonadaceae Re... 246 9e-64 UniRef50_D0L0K9 Glycoside hydrolase family 3 domain protein n=1 ... 240 7e-62 UniRef50_C6N603 Beta-hexosaminidase n=2 Tax=Legionella RepID=C6N... 238 3e-61 UniRef50_A6GP56 Beta-hexosaminidase n=1 Tax=Limnobacter sp. MED1... 234 3e-60 UniRef50_C5T2K1 Beta-N-acetylhexosaminidase n=1 Tax=Acidovorax d... 233 7e-60 UniRef50_B5EPA1 Glycoside hydrolase family 3 domain protein n=2 ... 231 4e-59 UniRef50_C7HX27 Beta-N-acetylhexosaminidase n=1 Tax=Thiomonas in... 219 8e-56 UniRef50_C8N6C8 Beta-hexosaminidase n=1 Tax=Cardiobacterium homi... 216 6e-55 UniRef50_A5EWZ9 Beta-hexosaminidase n=1 Tax=Dichelobacter nodosu... 195 2e-48 UniRef50_A0L847 Beta-N-acetylhexosaminidase n=1 Tax=Magnetococcu... 176 1e-42 UniRef50_Q07N54 Beta-N-acetylhexosaminidase n=17 Tax=Alphaproteo... 170 9e-41 UniRef50_B3TAA3 Putative glycosyl hydrolase family 3 N terminal ... 168 2e-40 UniRef50_C6HUU9 Beta-N-acetylhexosaminidase n=1 Tax=Leptospirill... 166 8e-40 UniRef50_B8FZS5 Glycoside hydrolase family 3 domain protein n=2 ... 165 2e-39 UniRef50_B0UI83 Beta-N-acetylhexosaminidase n=3 Tax=Alphaproteob... 164 3e-39 UniRef50_Q0EWG5 Glycoside hydrolase, family 3-like protein n=1 T... 160 5e-38 UniRef50_D2MLA9 Beta-N-acetylglucosaminidase n=1 Tax=Candidatus ... 160 6e-38 UniRef50_C7IJM6 Glycoside hydrolase family 3 domain protein n=1 ... 159 9e-38 UniRef50_B6ISL4 Beta-glucosidase-like protein n=13 Tax=Alphaprot... 159 2e-37 UniRef50_B5Y6T8 Beta-N-Acetylglucosaminidase n=2 Tax=Firmicutes ... 158 2e-37 UniRef50_A8F7W1 Glycoside hydrolase family 3 domain protein n=1 ... 158 3e-37 UniRef50_C5CGK3 Glycoside hydrolase family 3 domain protein n=1 ... 155 2e-36 UniRef50_C1XLF1 Beta-glucosidase-like glycosyl hydrolase n=2 Tax... 154 6e-36 UniRef50_Q1GI78 Beta-N-acetylhexosaminidase n=16 Tax=Rhodobacter... 152 1e-35 UniRef50_Q10Y93 Beta-N-acetylhexosaminidase n=1 Tax=Trichodesmiu... 151 3e-35 UniRef50_Q8ETJ0 Beta-hexosaminidase n=1 Tax=Oceanobacillus iheye... 151 3e-35 UniRef50_C9D3C0 Beta-N-acetylhexosaminidase n=1 Tax=Silicibacter... 151 4e-35 UniRef50_A9BFQ0 Glycoside hydrolase family 3 domain protein n=1 ... 150 5e-35 UniRef50_Q0FTI5 Beta-N-acetylhexosaminidase, putative n=5 Tax=Rh... 150 6e-35 UniRef50_B9Y750 Putative uncharacterized protein n=1 Tax=Holdema... 150 8e-35 UniRef50_Q1CXY4 Glycosyl hyrolase, family 3 n=6 Tax=Cystobacteri... 149 2e-34 UniRef50_Q6MRE1 BglX2 protein n=2 Tax=Bdellovibrio bacteriovorus... 148 2e-34 UniRef50_D0LSI1 Glycoside hydrolase family 3 domain protein n=1 ... 148 3e-34 UniRef50_B2UPP0 Beta-N-acetylhexosaminidase n=1 Tax=Akkermansia ... 147 4e-34 UniRef50_Q67SQ8 Putative beta-N-acetylglucosaminidase n=1 Tax=Sy... 147 8e-34 UniRef50_A2EGF0 Glycosyl hydrolase family 3 N terminal domain co... 146 1e-33 UniRef50_C6XIZ0 Glycoside hydrolase family 3 domain protein n=1 ... 146 1e-33 UniRef50_B7S2D9 Glycosyl hydrolase family 3 N terminal domain pr... 145 2e-33 UniRef50_Q0C0J7 Beta-hexosaminidase n=2 Tax=Hyphomonadaceae RepI... 145 2e-33 UniRef50_A7BX55 Glycosyl hydrolase, family 3 n=4 Tax=Proteobacte... 144 4e-33 UniRef50_C5SMQ9 Beta-N-acetylhexosaminidase n=2 Tax=Caulobactera... 144 4e-33 UniRef50_B0K1P2 Glycoside hydrolase, family 3 domain protein n=9... 144 5e-33 UniRef50_Q3A4Z0 Putative glycosyl hydrolase n=1 Tax=Pelobacter c... 144 6e-33 UniRef50_Q1NMP4 Glycoside hydrolase, family 3-like n=3 Tax=Delta... 144 6e-33 UniRef50_D1BPZ2 Glycoside hydrolase family 3 domain protein n=3 ... 143 7e-33 UniRef50_UPI0001693BCF Beta-glucosidase-related glycosidase n=1 ... 143 9e-33 UniRef50_C6J2I0 Glycoside hydrolase n=2 Tax=Bacillales RepID=C6J... 142 1e-32 UniRef50_C9LY46 Beta-hexosaminidase n=1 Tax=Selenomonas sputigen... 142 1e-32 UniRef50_Q8YVY6 All1831 protein n=11 Tax=Nostocaceae RepID=Q8YVY... 142 2e-32 UniRef50_A3I7T1 Beta-hexosaminidase n=2 Tax=Bacillaceae RepID=A3... 142 2e-32 UniRef50_C6J1S3 Glycoside hydrolase n=2 Tax=Bacillales RepID=C6J... 142 3e-32 UniRef50_B1ZLT8 Beta-N-acetylhexosaminidase n=61 Tax=Alphaproteo... 142 3e-32 UniRef50_Q2RTG8 Beta-N-acetylhexosaminidase n=9 Tax=Alphaproteob... 141 3e-32 UniRef50_Q8CX63 Beta-N-acetylhexosaminidase (Beta-hexosaminidase... 140 5e-32 UniRef50_A8GPL9 Beta-glucosidase n=17 Tax=Rickettsia RepID=A8GPL... 140 5e-32 UniRef50_A6NU14 Putative uncharacterized protein n=1 Tax=Bactero... 140 5e-32 UniRef50_Q1IQ59 Beta-N-acetylhexosaminidase n=1 Tax=Candidatus K... 140 7e-32 UniRef50_B2IHJ4 Glycoside hydrolase family 3 domain protein n=3 ... 139 1e-31 UniRef50_A8ZS87 Beta-N-acetylhexosaminidase n=3 Tax=Desulfobacte... 139 2e-31 UniRef50_B4BR59 Glycoside hydrolase family 3 domain protein n=2 ... 138 3e-31 UniRef50_D2BER2 Beta-hexosamidase A n=1 Tax=Streptosporangium ro... 138 3e-31 UniRef50_B9E727 Putative uncharacterized protein n=1 Tax=Macroco... 138 3e-31 UniRef50_D0RP10 Beta-hexosaminidase n=1 Tax=alpha proteobacteriu... 138 3e-31 UniRef50_A8MJC3 Glycoside hydrolase family 3 domain protein n=2 ... 138 3e-31 UniRef50_C7MQM3 Beta-glucosidase-like glycosyl hydrolase n=2 Tax... 137 4e-31 UniRef50_Q0BSP6 Anhydromuramoyl-peptide exo-beta-N-acetylglucosa... 137 4e-31 UniRef50_A9F261 Put. Beta-glucosidase n=1 Tax=Sorangium cellulos... 137 5e-31 UniRef50_B5LAN7 HexI n=3 Tax=cellular organisms RepID=B5LAN7_9BURK 137 6e-31 UniRef50_C1E6R7 Glycoside hydrolase family 3 protein n=1 Tax=Mic... 137 6e-31 UniRef50_C7P2G4 Glycoside hydrolase family 3 domain protein n=1 ... 137 7e-31 UniRef50_Q11XV7 B-N-acetylglucosaminidase, glycoside hydrolase f... 137 7e-31 UniRef50_A9NEX4 Glycoside hydrolase, family 3 n=1 Tax=Acholeplas... 137 8e-31 UniRef50_B3QTH0 Beta-N-acetylhexosaminidase n=12 Tax=Chlorobiace... 137 8e-31 UniRef50_Q28S86 Beta-N-acetylhexosaminidase n=14 Tax=Rhodobacter... 136 1e-30 UniRef50_Q9RUP9 Glycosyl hyrolase, family 3 n=2 Tax=Bacteria Rep... 135 2e-30 UniRef50_B3E4C8 Glycoside hydrolase family 3 domain protein n=3 ... 135 3e-30 UniRef50_C4KZ16 Beta-N-acetylhexosaminidase n=2 Tax=Firmicutes R... 135 3e-30 UniRef50_A0LL99 Beta-N-acetylhexosaminidase n=1 Tax=Syntrophobac... 134 3e-30 UniRef50_A8SRF1 Putative uncharacterized protein n=1 Tax=Coproco... 134 4e-30 UniRef50_Q08U63 Beta-glucosidase n=1 Tax=Stigmatella aurantiaca ... 134 6e-30 UniRef50_C3RKZ3 Beta-hexosamidase A n=5 Tax=Bacteria RepID=C3RKZ... 134 7e-30 UniRef50_B0MSB5 Putative uncharacterized protein n=1 Tax=Eubacte... 133 7e-30 UniRef50_A5CZV7 Beta-glucosidase-related glycosidases and D-alan... 133 8e-30 UniRef50_Q67LG3 Putative beta-N-acetylglucosaminidase n=1 Tax=Sy... 133 8e-30 UniRef50_C8RZL5 Beta-N-acetylhexosaminidase n=1 Tax=Rhodobacter ... 133 9e-30 UniRef50_C8PT26 Glycosyl hydrolase, family 3 n=1 Tax=Treponema v... 133 9e-30 UniRef50_C8WFM5 Beta-N-acetylhexosaminidase n=3 Tax=Zymomonas mo... 133 9e-30 UniRef50_B0SE07 Beta-glucosidase-related glycosidase n=2 Tax=Lep... 133 1e-29 UniRef50_C5CH66 Beta-N-acetylhexosaminidase n=1 Tax=Kosmotoga ol... 133 1e-29 UniRef50_Q1AUI8 Glycoside hydrolase, family 3-like protein n=1 T... 132 1e-29 UniRef50_C8PQ37 Glycosyl hydrolase, family 3 n=1 Tax=Treponema v... 132 1e-29 UniRef50_A8VQ50 Choline dehydrogenase n=1 Tax=Bacillus selenitir... 132 2e-29 UniRef50_B1YFC7 Glycoside hydrolase family 3 domain protein n=6 ... 132 2e-29 UniRef50_A6TMT8 Glycoside hydrolase, family 3 domain protein n=1... 132 2e-29 UniRef50_UPI0001C42D84 glycoside hydrolase family 3 domain prote... 131 3e-29 UniRef50_Q982A1 Glycosyl hyrolase, family 3 n=2 Tax=Mesorhizobiu... 131 4e-29 UniRef50_D2AR90 Beta-glucosidase-related glycosidase-like protei... 131 4e-29 UniRef50_A3EUK5 Beta-N-acetylhexosaminidase n=2 Tax=Leptospirill... 131 4e-29 UniRef50_B3DX93 Beta-glucosidase-related glycosidase n=1 Tax=Met... 131 4e-29 UniRef50_B6BGC2 B-N-acetylglucosaminidase, glycoside hydrolase f... 131 5e-29 UniRef50_UPI00016BFC75 Beta-glucosidase-related glycosidase n=1 ... 130 5e-29 UniRef50_A3DC81 Glycoside hydrolase, family 3-like protein n=3 T... 130 6e-29 UniRef50_Q4FM08 Glycosyl hydrolase n=3 Tax=Candidatus Pelagibact... 130 6e-29 UniRef50_C6C0L9 Glycoside hydrolase family 3 domain protein n=1 ... 130 7e-29 UniRef50_C5DA89 Glycoside hydrolase family 3 domain protein n=6 ... 130 1e-28 UniRef50_C1TNN0 Beta-glucosidase-like glycosyl hydrolase n=1 Tax... 129 1e-28 UniRef50_A6Q7E8 Glycosyl hydrolase, family 3 n=1 Tax=Sulfurovum ... 129 1e-28 UniRef50_C1I263 Beta-hexosamidase A n=1 Tax=Clostridium sp. 7_2_... 129 1e-28 UniRef50_Q1K0E5 Glycoside hydrolase, family 3-like n=1 Tax=Desul... 129 1e-28 UniRef50_Q5WDA0 Beta-N-acetylhexosaminidase n=1 Tax=Bacillus cla... 129 2e-28 UniRef50_Q0F8S6 Glycosyl hydrolase, family 3 n=1 Tax=Rhodobacter... 129 2e-28 UniRef50_C1YLC2 Beta-glucosidase-like glycosyl hydrolase n=1 Tax... 129 2e-28 UniRef50_D1AF10 Glycoside hydrolase family 3 domain protein n=1 ... 128 3e-28 UniRef50_D1CIC2 Glycoside hydrolase family 3 domain protein n=1 ... 128 3e-28 UniRef50_A0RP18 Glycosyl hyrolase, family 3 n=2 Tax=Campylobacte... 128 3e-28 UniRef50_Q31RD3 Beta-glucosidase-related glycosidases-like n=5 T... 128 4e-28 UniRef50_C9RS08 Glycoside hydrolase family 3 domain protein n=1 ... 128 4e-28 UniRef50_C6D1D6 Glycoside hydrolase family 3 domain protein n=1 ... 127 4e-28 UniRef50_Q6MA50 Putative beta-N-acetylglucosaminidase n=1 Tax=Ca... 127 4e-28 UniRef50_B6GD75 Putative uncharacterized protein n=1 Tax=Collins... 127 8e-28 UniRef50_UPI0000E0E41F Beta-hexosaminidase A precursor n=1 Tax=G... 127 8e-28 UniRef50_B0P2K0 Putative uncharacterized protein n=2 Tax=Clostri... 126 1e-27 UniRef50_A8L1X4 Glycoside hydrolase family 3 domain protein n=3 ... 126 1e-27 UniRef50_A8ETM1 Glycosyl hydrolase n=2 Tax=Campylobacterales Rep... 126 1e-27 UniRef50_C6CWS0 Beta-N-acetylhexosaminidase n=1 Tax=Paenibacillu... 126 1e-27 UniRef50_Q04TR7 Beta-glucosidase-related glycosidase n=4 Tax=Lep... 126 1e-27 UniRef50_A9KMQ8 Glycoside hydrolase family 3 domain protein n=1 ... 125 2e-27 UniRef50_Q2G9D7 Beta-N-acetylhexosaminidase n=10 Tax=cellular or... 125 2e-27 UniRef50_Q73RR1 Glycosyl hydrolase, family 3 n=1 Tax=Treponema d... 125 2e-27 UniRef50_B5CPR6 Putative uncharacterized protein n=1 Tax=Ruminoc... 125 3e-27 UniRef50_C9NRN7 Beta-hexosaminidase n=1 Tax=Vibrio coralliilytic... 124 3e-27 UniRef50_A3CMS4 Beta-hexosamidase A, putative n=1 Tax=Streptococ... 124 3e-27 UniRef50_Q2JJA4 Glycosyl hydrolase domain protein n=1 Tax=Synech... 124 4e-27 UniRef50_A1JQZ0 Glycosyl hydrolase family protein n=8 Tax=Entero... 124 4e-27 UniRef50_A2DGR7 Glycosyl hydrolase family 3 N terminal domain co... 124 5e-27 UniRef50_Q15N77 Glycoside hydrolase, family 3-like n=1 Tax=Pseud... 124 5e-27 UniRef50_C6WFB0 Glycoside hydrolase family 3 domain protein n=1 ... 124 7e-27 UniRef50_D1R988 Putative uncharacterized protein n=1 Tax=Parachl... 123 8e-27 UniRef50_C8WZJ0 Glycoside hydrolase family 3 domain protein n=1 ... 122 1e-26 UniRef50_Q47VB6 Beta-hexosaminidase n=1 Tax=Colwellia psychreryt... 122 2e-26 UniRef50_UPI0001B54569 sugar hydrolase n=1 Tax=Streptomyces sp. ... 122 2e-26 UniRef50_A9E2K6 B-N-acetylglucosaminidase, glycoside hydrolase f... 122 2e-26 UniRef50_D0KBJ7 Glycoside hydrolase family 3 domain protein n=6 ... 122 2e-26 UniRef50_A1VC58 Glycoside hydrolase, family 3 domain protein n=3... 122 2e-26 UniRef50_A1ZDA7 Glycosyl hydrolase, family 3 n=1 Tax=Microscilla... 122 2e-26 UniRef50_A6G4L8 Putative uncharacterized protein n=1 Tax=Plesioc... 122 2e-26 UniRef50_A2QDM1 Contig An02c0230, complete genome n=9 Tax=Tricho... 122 3e-26 UniRef50_C0ZKK5 Probable beta-hexosaminidase n=1 Tax=Brevibacill... 121 3e-26 UniRef50_B2SGG8 Glycosyl hydrolase family 3 n=17 Tax=Francisella... 121 3e-26 UniRef50_D2PXL5 Glycoside hydrolase family 3 domain protein n=1 ... 121 4e-26 UniRef50_C9KM62 Glycosyl hydrolase domain protein n=1 Tax=Mitsuo... 121 4e-26 UniRef50_A4XA71 Glycoside hydrolase, family 3 domain protein n=4... 120 6e-26 UniRef50_A8F374 Glycoside hydrolase family 3 domain protein n=1 ... 120 7e-26 UniRef50_B4CZU1 Beta-N-acetylhexosaminidase n=2 Tax=Verrucomicro... 120 8e-26 UniRef50_Q2NRJ7 Putative glycosidase n=1 Tax=Sodalis glossinidiu... 119 1e-25 UniRef50_B4S1A2 Beta-hexosaminidase A n=2 Tax=Alteromonas macleo... 119 1e-25 UniRef50_C1F257 Glycosyl hydrolase, family 3 n=1 Tax=Acidobacter... 119 2e-25 UniRef50_Q6AJ70 Related to glycosyl hydrolase n=1 Tax=Desulfotal... 119 2e-25 UniRef50_D2M1Q8 Glycoside hydrolase family 3 domain protein n=1 ... 119 2e-25 UniRef50_C0D0H0 Putative uncharacterized protein n=1 Tax=Clostri... 118 2e-25 UniRef50_P48823 Beta-hexosaminidase A n=3 Tax=Alteromonadales Re... 118 3e-25 UniRef50_D2AXZ5 Beta-N-acetylhexosaminidase n=2 Tax=Streptospora... 118 3e-25 UniRef50_A2E8H3 Glycosyl hydrolase family 3 N terminal domain co... 118 3e-25 UniRef50_Q6L346 Beta-hexosaminidase n=1 Tax=Picrophilus torridus... 118 3e-25 UniRef50_B5RMG9 Beta-glucosidase, putative n=20 Tax=Borrelia Rep... 118 4e-25 UniRef50_Q1IZF6 Beta-N-acetylhexosaminidase n=2 Tax=Deinococcus ... 118 4e-25 UniRef50_C7MG90 Beta-glucosidase-like glycosyl hydrolase n=1 Tax... 117 5e-25 UniRef50_B6Q1F2 Beta-N-acetylglucosaminidase, putative n=1 Tax=P... 117 6e-25 UniRef50_C6WFU8 Glycoside hydrolase family 3 domain protein n=3 ... 117 9e-25 UniRef50_A5IG30 Glycosyl hydrolase n=6 Tax=Legionella RepID=A5IG... 116 1e-24 UniRef50_C6B6L4 Beta-N-acetylhexosaminidase n=11 Tax=Rhizobium R... 116 1e-24 UniRef50_D2PT32 Glycoside hydrolase family 3 domain protein n=26... 116 1e-24 UniRef50_Q30Y91 Beta-N-acetylhexosaminidase n=3 Tax=Desulfovibri... 116 1e-24 UniRef50_B2KAV5 Glycoside hydrolase family 3 domain protein n=1 ... 116 1e-24 UniRef50_B9D1W5 Glycosyl hyrolase, family 3 n=2 Tax=Campylobacte... 115 2e-24 UniRef50_C1MUP1 Glycoside hydrolase family 3 protein n=1 Tax=Mic... 115 2e-24 UniRef50_A4CPY8 Glycosyl hydrolase, family 3 n=1 Tax=Robiginital... 115 2e-24 UniRef50_UPI0001B588EB hydrolase n=1 Tax=Streptomyces sp. SPB78 ... 115 2e-24 UniRef50_UPI00019730D7 beta-glucosidase-like glycosyl hydrolase ... 115 3e-24 UniRef50_B5I9C4 Sugar hydrolase n=1 Tax=Streptomyces sviceus ATC... 115 3e-24 UniRef50_C2BJ14 Beta-N-acetylglucosaminidase n=2 Tax=Corynebacte... 114 4e-24 UniRef50_A7EUL4 Putative uncharacterized protein n=1 Tax=Sclerot... 114 4e-24 UniRef50_Q6MGM9 Beta-hexosamidase A n=1 Tax=Bdellovibrio bacteri... 114 4e-24 UniRef50_B1YER8 Beta-N-acetylhexosaminidase n=1 Tax=Exiguobacter... 114 5e-24 UniRef50_D2PU34 Glycoside hydrolase family 3 domain protein n=1 ... 114 6e-24 UniRef50_Q84IZ5 Beta-N-acetylglucosaminidase n=1 Tax=Clostridium... 114 6e-24 UniRef50_B1WW93 Beta-glucosidase n=12 Tax=Cyanobacteria RepID=B1... 113 8e-24 UniRef50_Q031K2 Beta-glucosidase-related glycosidase n=5 Tax=Lac... 113 9e-24 UniRef50_B9R4T0 Glycosyl hydrolase family 3 N terminal domain pr... 113 1e-23 UniRef50_C0EVT4 Putative uncharacterized protein n=1 Tax=Eubacte... 113 1e-23 UniRef50_A0Q7X6 Glycosyl hydrolase family 3 n=12 Tax=Francisella... 113 1e-23 UniRef50_D2L090 Glycoside hydrolase family 3 domain protein n=1 ... 113 1e-23 UniRef50_B8M4K9 Beta-N-acetylglucosaminidase, putative n=9 Tax=T... 112 2e-23 UniRef50_D2RL81 Glycoside hydrolase family 3 domain protein n=1 ... 112 2e-23 UniRef50_C4XU31 Putative beta-N-acetylhexosaminidase n=1 Tax=Des... 111 3e-23 UniRef50_Q7NG68 Glr3305 protein n=1 Tax=Gloeobacter violaceus Re... 111 3e-23 UniRef50_A5FAE4 Candidate beta-glycosidase; Glycoside hydrolase ... 111 3e-23 UniRef50_Q47M03 Beta-N-acetylhexosaminidase. Glycosyl Hydrolase ... 111 3e-23 UniRef50_Q2SV66 Beta-N-Acetylglucosaminidase n=31 Tax=Burkholder... 111 3e-23 UniRef50_A5CLY4 Putative beta-N-acetylglucosaminidase n=1 Tax=Cl... 111 3e-23 UniRef50_B8DL88 Beta-N-acetylhexosaminidase n=1 Tax=Desulfovibri... 111 4e-23 UniRef50_A7ZD64 Beta-hexosaminidase A (N-acetyl-beta-glucosamini... 110 5e-23 UniRef50_B8HWS2 Beta-N-acetylhexosaminidase n=5 Tax=Cyanobacteri... 110 6e-23 UniRef50_A0JYZ9 Beta-N-acetylhexosaminidase n=1 Tax=Arthrobacter... 110 6e-23 UniRef50_C6W3E7 Beta-lactamase n=1 Tax=Dyadobacter fermentans DS... 110 7e-23 UniRef50_A0M1W2 Family 3 glycosyl hydrolase/beta-lactamase fusio... 109 1e-22 UniRef50_C4Y752 Putative uncharacterized protein n=1 Tax=Clavisp... 109 1e-22 UniRef50_A1SM50 Glycoside hydrolase, family 3 domain protein n=7... 109 2e-22 UniRef50_B0P7U2 Putative uncharacterized protein n=1 Tax=Anaerot... 109 2e-22 UniRef50_A3HXS5 Putative hydrolase/beta lactamase fusion protein... 109 2e-22 UniRef50_UPI000196C637 hypothetical protein CATMIT_02394 n=1 Tax... 108 2e-22 UniRef50_D1CEL0 Glycoside hydrolase family 3 domain protein n=1 ... 108 2e-22 UniRef50_Q1IHT6 Glycoside hydrolase, family 3-like n=1 Tax=Candi... 108 2e-22 UniRef50_UPI0001C379AD glycoside hydrolase family 3 domain prote... 108 3e-22 UniRef50_Q8RD12 Beta-glucosidase-related glycosidases n=2 Tax=Th... 108 3e-22 UniRef50_A6LEE9 Glycoside hydrolase family 3, candidate beta-gly... 108 3e-22 UniRef50_UPI0001621051 predicted protein n=1 Tax=Physcomitrella ... 108 3e-22 UniRef50_D1YED3 Glycosyl hydrolase family 3 N-terminal domain pr... 108 3e-22 UniRef50_B1Y1A2 Glycoside hydrolase family 3 domain protein n=1 ... 108 3e-22 UniRef50_A0LR43 Glycoside hydrolase, family 3 domain protein n=1... 108 4e-22 UniRef50_B2RGN3 Glycosyl hydrolase family 3 n=2 Tax=Porphyromona... 108 4e-22 UniRef50_A1K3B3 Putative beta-hexosaminidase n=1 Tax=Azoarcus sp... 107 4e-22 UniRef50_A7VCD5 Putative uncharacterized protein n=1 Tax=Clostri... 107 5e-22 >UniRef50_A7FH41 Beta-hexosaminidase n=129 Tax=Gammaproteobacteria RepID=NAGZ_YERP3 Length = 343 Score = 538 bits (1386), Expect = e-151, Method: Compositional matrix adjust. Identities = 260/338 (76%), Positives = 289/338 (85%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 MGPVMLDV YELDAEEREIL HPLVGGLILF+RN+HD QLRELVRQIRAAS RLVVA Sbjct: 1 MGPVMLDVASYELDAEEREILKHPLVGGLILFSRNFHDAEQLRELVRQIRAASHERLVVA 60 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 VDQEGGRVQRFR+GFTRLPAAQSFAA++ +LAQEAGWLMA+EM+AMDIDISFAPV Sbjct: 61 VDQEGGRVQRFRDGFTRLPAAQSFAAINDAATAAQLAQEAGWLMAAEMMAMDIDISFAPV 120 Query: 121 LDVGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETP 180 LD+GH+SAAIGERS+H+DPQ+A +A FI GMH AGMKTTGKHFPGHGAVTADSHKETP Sbjct: 121 LDIGHVSAAIGERSFHSDPQQARIMAECFIRGMHSAGMKTTGKHFPGHGAVTADSHKETP 180 Query: 181 CDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQELG 240 D RP AEIR DM +F LI+ LDAIMPAHVIY++ D RPASGS YWL+ +LRQELG Sbjct: 181 HDNRPLAEIRTHDMVIFRELIQRKLLDAIMPAHVIYTEADARPASGSAYWLQEILRQELG 240 Query: 241 FDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERV 300 F+G+IFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNR+GAVSVLDNLSPIKA+++ Sbjct: 241 FEGIIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRQGAVSVLDNLSPIKADQL 300 Query: 301 TRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQEEK 338 TRLYH G F RQ LM S+RW+ + L L ERW K Sbjct: 301 TRLYHSGQFDRQTLMASSRWQQANKALTALSERWDAHK 338 >UniRef50_Q4QLU8 Beta-hexosaminidase n=138 Tax=Gammaproteobacteria RepID=NAGZ_HAEI8 Length = 351 Score = 367 bits (941), Expect = e-100, Method: Compositional matrix adjust. Identities = 180/348 (51%), Positives = 240/348 (68%), Gaps = 8/348 (2%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 M +++D++G EL+ EE E+L+HPLV GLILFTRN+ D Q++EL+R +R + L++ Sbjct: 1 MSTLLIDLKGKELEQEEVELLSHPLVAGLILFTRNFEDREQIQELIRSVRQRVKKPLLIT 60 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 VDQEGGRVQRFR+GFT LP+ Q+F E A+EAGW MA+EMIA+DID+SFAPV Sbjct: 61 VDQEGGRVQRFRDGFTMLPSMQAFQETLSATEQVSFAKEAGWQMAAEMIALDIDLSFAPV 120 Query: 121 LDVGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETP 180 LD+GH AIG+RS+ +D + A+ +A FIDGMH+AGM +TGKHFPGHG V ADSH ETP Sbjct: 121 LDLGHECRAIGDRSFSSDVKSAVNLAINFIDGMHQAGMASTGKHFPGHGHVLADSHLETP 180 Query: 181 CDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQELG 240 D R + EI D+ F LI +NKLDAIMPAHVIY D +PASGS YWLK +LR++L Sbjct: 181 YDDRAKDEIFGYDLQPFQQLISQNKLDAIMPAHVIYGQCDSQPASGSEYWLKEILRKKLN 240 Query: 241 FDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAE-- 298 F G IFSDDL M+GA +MG++ ER + +L+AGCD++L+CN R+G + VLDNL + E Sbjct: 241 FQGTIFSDDLGMKGAGVMGNFVERSKKALNAGCDLLLLCNEREGVIQVLDNLKLTENEPH 300 Query: 299 ------RVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQEEKAG 340 R+ L+ + +L+ RWK L + RW + +A Sbjct: 301 FMARQARLKNLFKRSVIEWNDLVLDQRWKLNHRNLADIQSRWLDIQAA 348 >UniRef50_A1SW90 Beta-hexosaminidase n=3 Tax=Alteromonadales RepID=NAGZ_PSYIN Length = 337 Score = 352 bits (903), Expect = 1e-95, Method: Compositional matrix adjust. Identities = 164/292 (56%), Positives = 214/292 (73%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 M PV+LDVEGYELD+EE+EILAHPLV G+ILFTRNY+D QL+ LV+ IR + N L++A Sbjct: 1 MRPVILDVEGYELDSEEKEILAHPLVAGIILFTRNYYDIEQLKALVKDIRRYAGNELLIA 60 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 VD EGGRVQRFR+ FTRLP+A S + M+ +LA + W+MASE+IA DID SFAPV Sbjct: 61 VDHEGGRVQRFRDDFTRLPSAGSLIEKNDMKTACELAFSSAWVMASELIACDIDFSFAPV 120 Query: 121 LDVGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETP 180 LD+ IS I R++ + + + +A +I+GM AGM +TGKHFPGHG+V ADSH P Sbjct: 121 LDLNGISNVIQNRAFSSSITETVTLAEAYINGMKSAGMVSTGKHFPGHGSVEADSHTALP 180 Query: 181 CDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQELG 240 D R + EI KD+ F +LI++ LDA+MP+HV+YS D +PA S YWL VLR LG Sbjct: 181 VDSRSELEIFTKDIKPFENLIKKGALDAVMPSHVVYSQCDLQPAGFSSYWLDDVLRTRLG 240 Query: 241 FDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL 292 F GV+ SDDLSM GA+ +G++ R ++++ AGCD+IL CN+R GAVS+LDNL Sbjct: 241 FKGVVISDDLSMHGASFVGNHLSRAESAIQAGCDLILACNDRSGAVSILDNL 292 >UniRef50_Q7PC47 Beta-N-acetylhexosaminidase. Glycosyl Hydrolase family 3 n=2 Tax=Gammaproteobacteria RepID=Q7PC47_SACD2 Length = 344 Score = 327 bits (839), Expect = 3e-88, Method: Compositional matrix adjust. Identities = 169/343 (49%), Positives = 227/343 (66%), Gaps = 19/343 (5%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 +GPV+ D+ G L E+ ++ +PL+GGLILFTRNY P+QL LV+QIR+ R +++A Sbjct: 9 LGPVIADIAGQTLSDEDIALIKNPLIGGLILFTRNYSTPSQLDALVKQIRSV-RADIILA 67 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAAL--SGMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 VD EGGRVQRFREGFTR+PA Q FA+ + E LA GWLMASE+ A DIDISFA Sbjct: 68 VDHEGGRVQRFREGFTRIPAMQVFASAYKARAELTLALACNTGWLMASELRAYDIDISFA 127 Query: 119 PVLDVG-HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 PVLDV S+ IG+R++ +DP+ A+A FIDGM +AGM TGKHFPGHG+V ADSH Sbjct: 128 PVLDVDDSFSSIIGDRAFSSDPKAVTALAGAFIDGMQQAGMACTGKHFPGHGSVRADSHL 187 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQ 237 E P D R I D+ F+ L ++KLDA+MPAH+++ +VD +P S WL+ +LR Sbjct: 188 ELPVDYRSLEAIEQLDLMPFAKL--QSKLDAVMPAHILFPEVDDQPVGFSSVWLQKILRD 245 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKA 297 ++ +DGVIFSDDL+MEGAA+ GS+ ER ++ AGCD +LVCNNR+ + V+ L+ Sbjct: 246 KMAYDGVIFSDDLTMEGAAVAGSFGERAIKAMSAGCDTLLVCNNREATLEVIQTLA---- 301 Query: 298 ERVTRLYHKGSFSRQELMDSARWKAISTRLNQLH--ERWQEEK 338 G++S + R KA + + LH +RWQE K Sbjct: 302 -------DNGNYSTSIRLTRMRGKAGAQPIYDLHNNKRWQETK 337 >UniRef50_Q0HJG7 Beta-hexosaminidase n=35 Tax=Gammaproteobacteria RepID=NAGZ_SHESM Length = 342 Score = 314 bits (805), Expect = 3e-84, Method: Compositional matrix adjust. Identities = 172/340 (50%), Positives = 218/340 (64%), Gaps = 10/340 (2%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 M +MLD+ ++ E E+L HP VGGLILF+RN+ QL LV+QIR R +++A Sbjct: 1 MSYLMLDLLSLDVSEAEAEMLRHPQVGGLILFSRNFSSREQLIALVQQIRQI-RPEILIA 59 Query: 61 VDQEGGRVQRFREGFTRLPAAQSF--AALSGMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 VD EGGRVQRFREGFT +PA AA M + A E G+LMA E++A DID+SFA Sbjct: 60 VDHEGGRVQRFREGFTLIPAMGDILPAAKGDMVLAKRWACELGFLMAIELLACDIDLSFA 119 Query: 119 PVLDVGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE 178 PVLD+ IS IG+RS+ A P + +A+A FI+GM EAGM GKHFPGHG+V ADSH Sbjct: 120 PVLDLNGISQVIGKRSFSAKPDEVIALAQSFIEGMAEAGMGAVGKHFPGHGSVAADSHIA 179 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQE 238 P D R I +D+ F LI + KL IMPAHVIY VDP PA S YWLK +LR+E Sbjct: 180 QPIDEREAEAIFNQDILPFKELIAKGKLSGIMPAHVIYPKVDPNPAGFSSYWLKQILRKE 239 Query: 239 LGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLS-PIKA 297 LGF+GVIFSDDL M+GAA G Y R QA+LDAGCDMILVCN+ G +S+L+ P A Sbjct: 240 LGFNGVIFSDDLGMKGAAFAGDYLGRAQAALDAGCDMILVCNDNPGVMSLLNGFVWPAAA 299 Query: 298 ER--VTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQ 335 + + L + + L +++RW+ QL E+ Q Sbjct: 300 PQHPASLLKPNAAQTAIALENASRWE----NAKQLAEQIQ 335 >UniRef50_Q0VQT2 Beta-hexosaminidase n=3 Tax=Gammaproteobacteria RepID=Q0VQT2_ALCBS Length = 345 Score = 311 bits (797), Expect = 2e-83, Method: Compositional matrix adjust. Identities = 165/292 (56%), Positives = 202/292 (69%), Gaps = 6/292 (2%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQ 63 +MLD+EG E+ A E+++LAHP GGLILF RNY D QL ELV+Q+RA R +++AVDQ Sbjct: 17 LMLDLEGLEVSASEKDMLAHPGAGGLILFARNYVDRQQLAELVQQVRAV-RPEMLIAVDQ 75 Query: 64 EGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEA--GWLMASEMIAMDIDISFAPVL 121 EGGRVQRFR+GF RLP + + + G LMASE+ +DIDISFAPVL Sbjct: 76 EGGRVQRFRDGFVRLPPMAALGQRYDQAPAQAVGEATLLGELMASELTELDIDISFAPVL 135 Query: 122 DVGH-ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETP 180 D+ + S+ IG+RS+H D +A+A FIDGM AGM TGKHFPGHG V ADSH E P Sbjct: 136 DLDYGNSSVIGDRSFHGDADAMIALAGAFIDGMSAAGMAATGKHFPGHGHVGADSHLELP 195 Query: 181 CDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQELG 240 DPRP +E+ A DM F +L KLD IMPAHVIYS V+ + A S YWL+T LR+ LG Sbjct: 196 IDPRPLSELEAADMRPFMAL--APKLDGIMPAHVIYSAVEQQTAGFSRYWLQTQLRERLG 253 Query: 241 FDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL 292 F GVIFSDDLSM GA +G Y +R QA+L+AGCDM+L CN +GA VLD L Sbjct: 254 FRGVIFSDDLSMAGAHGVGGYPQRAQAALEAGCDMVLACNCCEGAEQVLDFL 305 >UniRef50_B4RS37 Beta-hexosaminidase n=2 Tax=Alteromonas macleodii RepID=B4RS37_ALTMD Length = 340 Score = 304 bits (778), Expect = 4e-81, Method: Compositional matrix adjust. Identities = 157/315 (49%), Positives = 205/315 (65%), Gaps = 11/315 (3%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 MGP+MLD + EL EE+E LAHP+VGG+ILFTRNYHD QL LV+ IR ++N +++A Sbjct: 1 MGPLMLDCQAEELLPEEKEKLAHPVVGGVILFTRNYHDKRQLSALVQDIRRFAKNNVLIA 60 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALS-GMEEGGKLAQEAGWLMASEMIAMDIDISFAP 119 VD EGGRVQRFR+GFT +PA S +E A+ G +A E+ +DID SFAP Sbjct: 61 VDHEGGRVQRFRDGFTAIPAMGDILRQSEHQDESAAYAKACGLALAYELKTLDIDFSFAP 120 Query: 120 VLDVGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKET 179 VLD+ +S IG+R++ + + + +A++ I G+ M GKHFPGHG+V DSH Sbjct: 121 VLDINGVSQVIGDRAFSNNADEVIRLATQLIQGLKTVNMPAIGKHFPGHGSVAPDSHIAL 180 Query: 180 PCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQEL 239 P D R EIR DM VF +I LD +MPAHVIY+ V +PA S WLKT+L+ ++ Sbjct: 181 PVDERGFDEIRRTDMVVFERVIERALLDGVMPAHVIYNQVCDKPAGFSSTWLKTILKNDI 240 Query: 240 GFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSP----- 294 GF+G +FSDDLSM GA++ GSY ER +A+LDAGCDM+L CNN KGA S+LD L Sbjct: 241 GFNGAVFSDDLSMHGASVAGSYVERAEAALDAGCDMVLACNNPKGAESILDGLEARFSNP 300 Query: 295 -----IKAERVTRLY 304 + ERV LY Sbjct: 301 NNSDVARQERVMALY 315 >UniRef50_UPI0000E11708 beta-hexosaminidase n=1 Tax=Rhodobacterales bacterium HTCC2255 RepID=UPI0000E11708 Length = 349 Score = 302 bits (774), Expect = 9e-81, Method: Compositional matrix adjust. Identities = 147/292 (50%), Positives = 194/292 (66%), Gaps = 1/292 (0%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 M P+MLDV G+EL +E+E++AHPL GG+ILF RNYHDP QLR L QI+ ++ + +++A Sbjct: 1 MHPIMLDVAGFELTQQEQEVMAHPLTGGVILFARNYHDPEQLRHLTSQIKHSAGHPILIA 60 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 VDQEGGRVQRF+ GFT +PA ++ +LA G ++A+E I +DISFAPV Sbjct: 61 VDQEGGRVQRFKNGFTAIPAMGDIEHHDHVQ-ARQLAYACGRVIATECIGHGVDISFAPV 119 Query: 121 LDVGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETP 180 LD+ IS IG RS+ P L++A+ FI G+ GM GKHFPGHG V ADSH P Sbjct: 120 LDINAISDVIGTRSFSNAPSTILSLATEFIHGLKAIGMPAIGKHFPGHGNVKADSHIAMP 179 Query: 181 CDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQELG 240 D R + I DM++F+ +++ +D IMPAHV+Y D+D RPA S WL+ +LRQ G Sbjct: 180 SDERSKDTIFELDMAIFTHFMQQQLVDGIMPAHVVYPDIDSRPAGFSSIWLQQILRQHCG 239 Query: 241 FDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL 292 F GVIFSDDLSM A+ GS +R + +L AG DM L+CN A+ VLDNL Sbjct: 240 FQGVIFSDDLSMHAASEAGSMLDRVEHALAAGVDMALICNAPDEAIMVLDNL 291 >UniRef50_A4C9R3 Beta-hexosaminidase n=1 Tax=Pseudoalteromonas tunicata D2 RepID=A4C9R3_9GAMM Length = 338 Score = 292 bits (748), Expect = 1e-77, Method: Compositional matrix adjust. Identities = 152/336 (45%), Positives = 200/336 (59%), Gaps = 6/336 (1%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 MGP++ + G L EE E L HPLV G+ILFTRNYHD QL L QI+ S L+++ Sbjct: 1 MGPILTGISGLSLLPEEVEYLQHPLVAGVILFTRNYHDNEQLIALTAQIKKISP-MLLIS 59 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 VD EGGRVQRFR+GF+ +PA + LA + G+ +A+E+ A +D SF PV Sbjct: 60 VDHEGGRVQRFRDGFSAIPAMFEATSFQKKHNSPILA-DLGFTLAAELRAHGVDFSFTPV 118 Query: 121 LDVGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETP 180 LD+ IS I R++ P + + A FI G+H AG+K KHFPGHG+V DSH P Sbjct: 119 LDLNGISQVITTRAFAISPIEVIDAAKHFIGGLHSAGVKNVVKHFPGHGSVGPDSHIAMP 178 Query: 181 CDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQELG 240 D R +AEI DMSVF +L+ D +MPAHVIY DP+PA S YWLK VLR +LG Sbjct: 179 VDERSKAEIFNSDMSVFKALVDSGCADGVMPAHVIYPHCDPKPACFSDYWLKQVLRNKLG 238 Query: 241 FDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLD----NLSPIK 296 F G +FSDD+ M+GA MG Y R Q++LDAGCD+ L+CN G VLD NL Sbjct: 239 FKGPVFSDDMGMQGAVQMGDYVTRTQSALDAGCDLALLCNEPHGLYEVLDGLEINLYQNH 298 Query: 297 AERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHE 332 +R K + + EL RW A + ++++ + Sbjct: 299 GQRGLAYLQKKNVTMMELKTDHRWLAAQSVISKMED 334 >UniRef50_Q3SKU2 Beta-hexosaminidase n=27 Tax=Proteobacteria RepID=NAGZ_THIDA Length = 359 Score = 290 bits (743), Expect = 4e-77, Method: Compositional matrix adjust. Identities = 161/330 (48%), Positives = 215/330 (65%), Gaps = 12/330 (3%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 +GP+MLDV G EL ++R L+ PLVGG+ILF+RNY DPAQL L +I A L++ Sbjct: 3 LGPLMLDVAGTELCDDDRRRLSDPLVGGVILFSRNYRDPAQLAALCEEIHALKAAPLLIG 62 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAAL--SGMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 VD EGGRVQRFR+GFTRLP +S A+ + +LA+E G+++ASE+ A +D SFA Sbjct: 63 VDHEGGRVQRFRDGFTRLPPMRSLGAIWNEHPQRARELARETGYVLASELRAHGVDFSFA 122 Query: 119 PVLDVGH-ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 PVLD+ + S+ IG+R++HADPQ + + GM +AGM GKHFPGHG V ADSH Sbjct: 123 PVLDLDYGASSVIGDRAFHADPQAVFQLGQAVMLGMKDAGMAACGKHFPGHGFVIADSHV 182 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQ 237 + P D R +I D+ F L+ E L A+MPAHVIY +VD +PA S WL+ +LRQ Sbjct: 183 DIPVDRRTLGDIARADLVPFR-LMVEAGLAAMMPAHVIYPEVDAQPAGFSRVWLQKILRQ 241 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL----- 292 +LGFDG +FSDDL M GAA G ER +A+LDAGCDM LVCN+ + A VL L Sbjct: 242 QLGFDGAVFSDDLCMAGAAFAGGIVERVRAALDAGCDMALVCNHPELADEVLAKLDVDWP 301 Query: 293 SPIKAERVTRLYHKGSF--SRQELMDSARW 320 +P +A R+ R++ + + L +SAR+ Sbjct: 302 APARA-RLARMHGHPAHAPTMTALRESARY 330 >UniRef50_Q1QVU0 Glycoside hydrolase, family 3-like protein n=1 Tax=Chromohalobacter salexigens DSM 3043 RepID=Q1QVU0_CHRSD Length = 340 Score = 288 bits (737), Expect = 2e-76, Method: Compositional matrix adjust. Identities = 164/336 (48%), Positives = 214/336 (63%), Gaps = 8/336 (2%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 +G VMLD+EG +L EER +L P VGG+ILF RN D Q+R L R+IR R +++A Sbjct: 5 LGTVMLDIEGTQLGDEERRLLERPEVGGVILFARNTRDAEQVRRLTREIREL-RPDMLLA 63 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGM--EEGGKLAQEAGWLMASEMIAMDIDISFA 118 +DQEGGRVQR REG TRLP+ + AA +E EAGWL+ EM A D++FA Sbjct: 64 IDQEGGRVQRLREGVTRLPSMAALAAGYADAPDEVRSRVHEAGWLLGMEMAACGFDVTFA 123 Query: 119 PVLDVG-HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 PVLDV S AIG+RS+ ADP A+ FI+G+HEAGM GKHFPGHG VT DSH Sbjct: 124 PVLDVDDQRSPAIGDRSFSADPTVVAALGEAFIEGLHEAGMVAVGKHFPGHGGVTLDSHH 183 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQ 237 P D RP + +R D+ F +L KLDA+MPAHV+Y+ D RPA SP WL +LR+ Sbjct: 184 ALPEDNRPLSVLREHDLVPFKAL--SGKLDAMMPAHVVYTAFDTRPAGFSPSWLG-MLRE 240 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKA 297 E+ F GV+FSDDLSM GA + G+ A R +A+ AGCDM+LVCN+R A+ ++D + + Sbjct: 241 EMAFKGVVFSDDLSMAGAHVAGTPAARAEAAWSAGCDMVLVCNDRAAALEIVDAAAGRTS 300 Query: 298 ERVTRL-YHKGSFSRQELMDSARWKAISTRLNQLHE 332 +R+ +L Y + + L ARW+ RL L E Sbjct: 301 KRLGKLRYGRARPELETLPALARWRRAHARLEALSE 336 >UniRef50_C5BIH5 Glycoside hydrolase family 3 domain protein n=1 Tax=Teredinibacter turnerae T7901 RepID=C5BIH5_TERTT Length = 343 Score = 282 bits (722), Expect = 1e-74, Method: Compositional matrix adjust. Identities = 156/340 (45%), Positives = 202/340 (59%), Gaps = 10/340 (2%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 +GPV+LDV+G L + +L +P VGG+ILF+RNY + QL L +IR AS N +V+A Sbjct: 6 LGPVVLDVDGTTLSESDVNLLQNPWVGGVILFSRNYTNVMQLCGLTAEIRKASPN-IVIA 64 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAAL--SGMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 VD EGGRVQRF FT +PA Q L E L E GWL+A+E+IA +D+SF Sbjct: 65 VDHEGGRVQRFGGEFTPIPAMQQLGNLYVKSPVEALALTHETGWLLAAELIACGLDMSFT 124 Query: 119 PVLDVGHI-SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 PVLDV S IG RS+ DP +A++ DG+ EAGM TGKHFPGHG V DSH Sbjct: 125 PVLDVDDCRSQIIGSRSFGPDPAAVIALSEALTDGLIEAGMSATGKHFPGHGGVVEDSHL 184 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQ 237 E P D R E+ D+ F +L+ KL+AIMPAH+ + + P S +WL+ VLR+ Sbjct: 185 ELPVDNRSFDEVARHDLQPFKALL--PKLEAIMPAHIHFPQIASEPVGFSSFWLQQVLRE 242 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL----S 293 +LGF+G+IFSDDL+MEGA G Y ER +L AGCD +L+CNN GA L L S Sbjct: 243 DLGFNGIIFSDDLTMEGAVTAGDYGERTLKALAAGCDSVLICNNPAGAQEALARLDATFS 302 Query: 294 PIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHER 333 + R+ + K F L SAR K + +L L R Sbjct: 303 TVFPSRLECMRAKRCFEWDSLRGSARAKGVRAQLESLVAR 342 >UniRef50_Q31G32 Beta-hexosaminidase n=1 Tax=Thiomicrospira crunogena XCL-2 RepID=NAGZ_THICR Length = 357 Score = 275 bits (704), Expect = 1e-72, Method: Compositional matrix adjust. Identities = 148/344 (43%), Positives = 208/344 (60%), Gaps = 12/344 (3%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 +G +M+D+E L E+E L PLV G+ILF+RNY QL+ L +I +L++A Sbjct: 11 LGCIMVDLESTTLQPHEKERLLDPLVAGVILFSRNYESIEQLQALTTEIHQLRHPKLLIA 70 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAAL--SGMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 VD EGGRVQRF+EGF+ LPA + +EG +LAQ+ GWLMA+E++A+ +D SFA Sbjct: 71 VDHEGGRVQRFKEGFSMLPAMGQLGKCFRANEKEGLELAQQVGWLMATELLAVGVDFSFA 130 Query: 119 PVLDVGH-ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 PV+D+ + S IG+R++ +DP + + GM +AGM + KHFPGHG + AD+H Sbjct: 131 PVVDLDYGDSRVIGDRAFDSDPVIVGKLGKALVQGMRDAGMASVAKHFPGHGYIQADTHL 190 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQ 237 E D R EI KD+ F LI EN LDA+MPAHV Y VD PA S WL+ VLRQ Sbjct: 191 EVAVDHREFQEIAHKDIQPFLKLI-ENGLDAVMPAHVRYPKVDDLPAGFSKVWLQEVLRQ 249 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLS---- 293 + FDG I SDD+SM A G R A+L AGCD++LVCN+ A VL ++ Sbjct: 250 QCYFDGAIISDDMSMHAATEFGDAPTRVTAALKAGCDLVLVCNDPVAADEVLSQVTWETG 309 Query: 294 PIKAERVTRLYHKGSFSRQELMDSARWKA----ISTRLNQLHER 333 P+ R+ RL+ KG +L ++ W++ +S LN ++++ Sbjct: 310 PLSHARLIRLHGKGHLKLSQLHNNPLWQSRASHVSHFLNSVNQQ 353 >UniRef50_A1AVY9 Beta-N-acetylhexosaminidase n=2 Tax=Gammaproteobacteria RepID=A1AVY9_RUTMC Length = 327 Score = 272 bits (695), Expect = 1e-71, Method: Compositional matrix adjust. Identities = 134/293 (45%), Positives = 190/293 (64%), Gaps = 6/293 (2%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 MGP+M+DV G L +E++ L P +GG+ILF RN+ Q+ L++ IR ++N L++A Sbjct: 2 MGPIMMDVSGLALTTQEKQQLIKPSIGGVILFGRNFESIEQVTLLIKDIRQVNQN-LLIA 60 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKL--AQEAGWLMASEMIAMDIDISFA 118 VD EGGRVQRFR GFT LPA AL + A G+++A E++ + +D SFA Sbjct: 61 VDHEGGRVQRFRYGFTHLPAMSKLGALYDKNPDIVIDKAFSCGFVLAYELLDIGVDFSFA 120 Query: 119 PVLDVGH-ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 PVLD+ + S IG+R++H++P + +A I GMH+AGMK GKHFPGHG VTADSH Sbjct: 121 PVLDINYDTSQVIGDRAFHSNPDVIVKLAGSLISGMHKAGMKCVGKHFPGHGFVTADSHI 180 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQ 237 + P D RP +++ +DM F LI LDA+M AHV+Y+ VD + A S W+K +L+ Sbjct: 181 DLPIDDRPMSDL-LQDMKPFKRLINHG-LDALMYAHVVYTQVDDKIAGFSSKWIKDILQN 238 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLD 290 +L F GVIFSDDLSM+ A + R + SLD+GCDM+L+CN+ + ++D Sbjct: 239 QLKFKGVIFSDDLSMQSAHFIKDITNRVKTSLDSGCDMVLICNHPEFVAQIID 291 >UniRef50_B6J7B4 Anhydromuramoyl-peptide exo-beta-N-acetylglucosaminidase n=6 Tax=Coxiella burnetii RepID=B6J7B4_COXB1 Length = 313 Score = 269 bits (688), Expect = 1e-70, Method: Compositional matrix adjust. Identities = 142/325 (43%), Positives = 200/325 (61%), Gaps = 17/325 (5%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 MG ++D+EG L +ERE+L HP V G+ILFTRNY +P+QLR L++ I+ L VA Sbjct: 1 MGFAIIDLEGVALSPQERELLQHPQVAGVILFTRNYENPSQLRSLIQTIKKI--RPLFVA 58 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGME--EGGKLAQEAGWLMASEMIAMDIDISFA 118 VDQEGGRVQRF GFT LP+ + F + + K + A + E+ + ID++ Sbjct: 59 VDQEGGRVQRFHRGFTDLPSMRYFGECYQHDPLQARKHLENALKTVVHELKEVGIDVNMT 118 Query: 119 PVLDVGH-ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 PVLD+ + IGERS+ DP+ A+ S ID +H M KHFPGHG VTADSH+ Sbjct: 119 PVLDIDQGKNEIIGERSFSGDPRVVEALGSVVIDTLHANRMPAVAKHFPGHGGVTADSHQ 178 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQ 237 P D R I D FS+L ++LDAIMP H+IY D +PA+ S YWL+ +LRQ Sbjct: 179 ALPIDERDPEAILQCDWIPFSAL--SSRLDAIMPTHIIYPAFDEKPATFSRYWLQDMLRQ 236 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKA 297 L F GVI SDDL+M+GAA++G Y R +L+AGCD++ +CN+R+G +++LD+L Sbjct: 237 RLNFRGVIISDDLTMQGAAVIGDYPTRAAQALEAGCDLLTICNSREGVIAILDSL----- 291 Query: 298 ERVTRLYHKGSFSRQELMDSARWKA 322 L+++ S+Q L+ +R+ Sbjct: 292 -----LHYRNPESQQRLLSFSRFTT 311 >UniRef50_C4ZK55 Glycoside hydrolase family 3 domain protein n=6 Tax=Betaproteobacteria RepID=C4ZK55_THASP Length = 381 Score = 266 bits (681), Expect = 6e-70, Method: Compositional matrix adjust. Identities = 150/311 (48%), Positives = 193/311 (62%), Gaps = 10/311 (3%) Query: 2 GPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAV 61 GPVM+DV G L EERE L PLVGG+ILF RNY QLR L +IR L++AV Sbjct: 10 GPVMIDVAGTALTDEERERLRDPLVGGVILFARNYTGSEQLRALTAEIRGLRDPALIIAV 69 Query: 62 DQEGGRVQRFR-EGFTRLPAAQSFAALSGMEEGGKL--AQEAGWLMASEMIAMDIDISFA 118 D EGGRVQRFR +GFTRLP+ +S AL + L A+ G ++A+E+ A +D+SF Sbjct: 70 DHEGGRVQRFRTDGFTRLPSMRSLGALWAQDHLVALDAARATGVVLAAELRAHGVDLSFT 129 Query: 119 PVLDVGH-ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 PVLD+ + AIG R++H DPQ A+A GM EAGM GKHFPGHG V ADSH Sbjct: 130 PVLDLDYGCCRAIGNRAFHRDPQVVAALAQALCAGMAEAGMGCVGKHFPGHGFVEADSHH 189 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP----RPASGSPYWLKT 233 + P D R + +D++ + + +L +MPAHV+Y + DP +PA SP+WLK Sbjct: 190 DVPVDERDFDTVWNEDIAPYRHRL-GRQLAGVMPAHVVYPNADPSPEPQPAGFSPFWLKE 248 Query: 234 VLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLS 293 VLR LGF VIFSDDL+MEGA + G R +A+ AGCDM+LVCN A +LD + Sbjct: 249 VLRDRLGFQWVIFSDDLNMEGARVAGDIVGRAKAAYAAGCDMLLVCNRPDLAAELLDRWA 308 Query: 294 P-IKAERVTRL 303 P + A + RL Sbjct: 309 PDLDAGNLARL 319 >UniRef50_UPI0000E87F6B beta-hexosaminidase n=1 Tax=Methylophilales bacterium HTCC2181 RepID=UPI0000E87F6B Length = 337 Score = 265 bits (678), Expect = 1e-69, Method: Compositional matrix adjust. Identities = 140/312 (44%), Positives = 197/312 (63%), Gaps = 7/312 (2%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 M +M+DV LD+ + + ++HPLVGG+ILF+RNY + QL+ LV+ IR N +++A Sbjct: 1 MSCLMIDVASLSLDSNDIKRISHPLVGGVILFSRNYENKKQLKLLVKSIRKIKHN-ILIA 59 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGM--EEGGKLAQEAGWLMASEMIAMDIDISFA 118 VD EGGRVQRFR+ FT+LPA +L EE ++A+ GWL+A E+ DID SF Sbjct: 60 VDHEGGRVQRFRDDFTQLPAMALLGSLFDKDPEEATRIAKLCGWLIAKELGGCDIDFSFT 119 Query: 119 PVLDVGH-ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 PVLDV + S+ IG R++H + Q + +A+ +G+ +AGM+ GKHFPGHG + AD+H Sbjct: 120 PVLDVNYESSSVIGNRAFHKEIQPTIHLANSLTEGLGQAGMQGVGKHFPGHGFIKADTHL 179 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQ 237 E D R I+ DM VF +LI EN L IMP+HVIYS D +P+ S +WL+ LR+ Sbjct: 180 ELGEDERAFEVIKESDMLVFKALI-ENGLKGIMPSHVIYSLCDDKPSGLSHFWLQDQLRK 238 Query: 238 ELGFDGVIFSDDLSMEGAAIM-GSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLS-PI 295 LGF+G IFSDD+SM+ A + +R +L AGCDM+LVCN + VL +L P+ Sbjct: 239 NLGFNGAIFSDDMSMQAAVFTEKNVTKRVLKALQAGCDMVLVCNAGEDVDIVLTHLDWPV 298 Query: 296 KAERVTRLYHKG 307 + R+ G Sbjct: 299 SKKSQCRIKAMG 310 >UniRef50_Q4FV07 Glycoside hydrolase family protein n=32 Tax=Pseudomonadales RepID=Q4FV07_PSYA2 Length = 360 Score = 264 bits (674), Expect = 4e-69, Method: Compositional matrix adjust. Identities = 137/283 (48%), Positives = 185/283 (65%), Gaps = 5/283 (1%) Query: 2 GPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAV 61 G +M+D++ L AE+ ++ VGG+ILF RN D AQ+R L IR + + +++ V Sbjct: 3 GVLMIDIDSTALTAEDVSLIRQAQVGGVILFARNVTDAAQVRALCDDIRCHNAD-ILIGV 61 Query: 62 DQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKL--AQEAGWLMASEMIAMDIDISFAP 119 DQEGGRV R R GFT LPA L + L A + G+LMA+E++A+ ID+SFAP Sbjct: 62 DQEGGRVARLRNGFTPLPAMGRLGELFNQDPNRALECAYDCGYLMAAEVLAVGIDLSFAP 121 Query: 120 VLDVGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKET 179 VLD IS IG+RS+H DPQ +A+AS+F+ GM AGM TTGKHFPGHG++ DSH Sbjct: 122 VLDRDGISQVIGDRSFHHDPQAIIALASQFMRGMKAAGMATTGKHFPGHGSIAPDSHVAE 181 Query: 180 PCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQEL 239 D R +I DM F+ + LDA+MPAHVI+S VD +PA S WLK ++R++L Sbjct: 182 AIDDRSLDDIMNSDMQPFAQTLP--WLDALMPAHVIFSQVDNKPAGFSKIWLKDIIREQL 239 Query: 240 GFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNR 282 F+GV+FSDDLSM GA G + R A+++AGCD+ LVCN+R Sbjct: 240 NFEGVLFSDDLSMAGAKAAGDVSARVTAAIEAGCDIALVCNDR 282 >UniRef50_Q2SIQ7 Beta-glucosidase-related Glycosidase n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SIQ7_HAHCH Length = 346 Score = 263 bits (672), Expect = 8e-69, Method: Compositional matrix adjust. Identities = 144/304 (47%), Positives = 189/304 (62%), Gaps = 9/304 (2%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 +GP+++D++G ++ AEE+E+L+ P+VGG+I F RNY DP QL LVRQIRA R L++ Sbjct: 7 LGPLIIDLKGLDVSAEEKELLSRPIVGGVIFFARNYTDPQQLDALVRQIRAI-RPELLLC 65 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGMEE---GGKLAQEAGWLMASEMIAMDIDISF 117 VDQEGGRVQRFREG T LPA A S E +A E GWL A E+ +D+D SF Sbjct: 66 VDQEGGRVQRFREGRTLLPAPGKLRA-SYQENPAAAKAMATELGWLTAVELRELDLDFSF 124 Query: 118 APVLDVGH-ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSH 176 APVLDV + + I +R++ P A A+A F GM EAG GKHFPGHG VT D+H Sbjct: 125 APVLDVDYGKNKVIADRAFGDTPTMAAALAQAFWRGMGEAGCAGVGKHFPGHGFVTEDTH 184 Query: 177 KETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLR 236 E+ DPR E+ DM F +I E ++A+MPAHVIY DP PAS S +WL VLR Sbjct: 185 IESARDPRAYEELLGADMRPFIEMI-EAGIEAMMPAHVIYPACDPSPASQSSFWLTQVLR 243 Query: 237 QELGFDGVIFSDDLSMEGAAIMGSYAERGQA--SLDAGCDMILVCNNRKGAVSVLDNLSP 294 + L + G I SDD+ M+GA + E +L AGCD+++ CNN + + V D L Sbjct: 244 ERLQYQGAIISDDMGMQGADLEAEEDESASVVRALRAGCDLLMACNNPERLLRVADQLES 303 Query: 295 IKAE 298 + E Sbjct: 304 LPGE 307 >UniRef50_Q5P081 Beta-hexosaminidase n=63 Tax=Proteobacteria RepID=NAGZ_AZOSE Length = 356 Score = 262 bits (669), Expect = 2e-68, Method: Compositional matrix adjust. Identities = 142/298 (47%), Positives = 193/298 (64%), Gaps = 5/298 (1%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 +GPVMLDV G+ L EERE L PLVGG+ILF RN+ D QL+ L +I A L++A Sbjct: 14 LGPVMLDVAGFALTEEERERLLDPLVGGVILFARNFRDSEQLQALTAEIHALRSPALIIA 73 Query: 61 VDQEGGRVQRFR-EGFTRLPAAQSFAALSGMEEGGKL--AQEAGWLMASEMIAMDIDISF 117 VD EGGRVQRFR +GFTR+P+ + L + L A+ AG+++A+E++A +D+SF Sbjct: 74 VDHEGGRVQRFRTDGFTRIPSMRCLGRLWERDHVAALESARCAGYVLAAELLAHGVDLSF 133 Query: 118 APVLDVGH-ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSH 176 PVLD+ + S +G+R++H DP A+A + GM +AGM GKHFPGHG ADSH Sbjct: 134 TPVLDLDYGCSRVVGDRAFHRDPLVVAALAQSLVSGMADAGMGCVGKHFPGHGYAEADSH 193 Query: 177 KETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLR 236 E P D R I +D++ + + +L +MPAHVIY VDP PA S +WL+ +LR Sbjct: 194 VEIPVDEREFDAIWTEDIAPYRHRL-GRQLAGVMPAHVIYPRVDPNPAGFSRFWLQDILR 252 Query: 237 QELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSP 294 +GF GVIFSDDL+MEGA ++G R +A+ AGCD++LVCN AV +LD +P Sbjct: 253 GRVGFGGVIFSDDLTMEGATVVGDILARARAAFGAGCDVVLVCNRPDLAVDLLDRWAP 310 >UniRef50_B6BUL6 Beta-hexosaminidase n=1 Tax=beta proteobacterium KB13 RepID=B6BUL6_9PROT Length = 339 Score = 261 bits (666), Expect = 4e-68, Method: Compositional matrix adjust. Identities = 130/293 (44%), Positives = 191/293 (65%), Gaps = 7/293 (2%) Query: 2 GPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAV 61 G +++D+EG E+ + + L+HP+V G+ILFTRN+ + Q+++L+ I+ R+ L++ V Sbjct: 4 GSIIIDIEGKEITQRDIQRLSHPVVCGVILFTRNFENKNQIQDLISSIKLV-RDDLLITV 62 Query: 62 DQEGGRVQRFREGFTRLPAAQSFAALSGM--EEGGKLAQEAGWLMASEMIAMDIDISFAP 119 D EGGRVQRFRE F +P+ L E+G +LA+ AGWL+ASE+ + ID++++P Sbjct: 63 DHEGGRVQRFRENFYAMPSMSELGDLYAHDSEQGLRLARYAGWLIASELGELHIDLNYSP 122 Query: 120 VLDVGH-ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE 178 VLD+ + S+ IG RS+ +P+ + +A FIDG H AGM + GKHFPGHG + DSH+E Sbjct: 123 VLDINYGKSSVIGNRSFSDNPKAIIELAKAFIDGQHHAGMTSVGKHFPGHGFIHEDSHEE 182 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQE 238 D R AE+ D+ F SL+ ++L A+MP HVIY D +PAS S +WLK LR++ Sbjct: 183 ISIDNRSLAEME-NDIDCFKSLVN-HRLGAVMPGHVIYKKSDNKPASLSSFWLKDFLRKK 240 Query: 239 LGFDGVIFSDDLSMEGAA-IMGSYAERGQASLDAGCDMILVCNNRKGAVSVLD 290 L F G+I SDDLSM+G A + ER +L AGCD++L+CN + LD Sbjct: 241 LNFKGIIISDDLSMKGVADYIPDIYERVSQALTAGCDVVLICNKPEDVDEFLD 293 >UniRef50_A9M1Z4 Beta-hexosaminidase n=23 Tax=Neisseriaceae RepID=NAGZ_NEIM0 Length = 361 Score = 258 bits (660), Expect = 2e-67, Method: Compositional matrix adjust. Identities = 133/283 (46%), Positives = 180/283 (63%), Gaps = 5/283 (1%) Query: 2 GPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAV 61 G VM D+ + L EE++ L P VGG+ILF RN+ + AQL+EL +I+A L++AV Sbjct: 9 GLVMADIAAFRLTEEEKQRLLDPAVGGVILFRRNFQNIAQLKELTAEIKALRTPELIIAV 68 Query: 62 DQEGGRVQRFREGFTRLPAAQSFAAL---SGMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 D EGGRVQRF EGFTRLPA + + G A + G ++A+E+ A ID+SF Sbjct: 69 DHEGGRVQRFIEGFTRLPAMSTLGEIWDKDGASAAETAAGQVGRVLATELSACGIDLSFT 128 Query: 119 PVLDVGHISA-AIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 PVLD+ + IG RS+H +P+ +A G+ + GMK+ GKHFPGHG V DSH Sbjct: 129 PVLDLDWGNCPVIGNRSFHRNPEAVARLALALQKGLEKGGMKSCGKHFPGHGFVEGDSHL 188 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQ 237 P D R +E+ A D++ F + RE + A+MPAHV+Y VD +PA S WLK +LR+ Sbjct: 189 VLPEDGRSLSELEAADLAPFRIMSREG-MAAVMPAHVVYPQVDTKPAGFSEIWLKQILRR 247 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCN 280 ++GF GVIFSDDL+MEGA G ER + S +AGCD++LVCN Sbjct: 248 DIGFKGVIFSDDLTMEGACGAGGIKERARISFEAGCDIVLVCN 290 >UniRef50_C1D8P0 NagZ n=4 Tax=Proteobacteria RepID=C1D8P0_LARHH Length = 362 Score = 258 bits (658), Expect = 3e-67, Method: Compositional matrix adjust. Identities = 150/306 (49%), Positives = 185/306 (60%), Gaps = 5/306 (1%) Query: 2 GPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAV 61 GPVM+DV G L ++R L+HP+VG +ILF RN+ QLR L +IR+ L++AV Sbjct: 8 GPVMVDVAGCSLSDDDRRRLSHPMVGAVILFRRNFESVGQLRSLTEEIRSLRSPHLLIAV 67 Query: 62 DQEGGRVQRFREGFTRLPAAQSFAAL--SGMEEGGKLAQEAGWLMASEMIAMDIDISFAP 119 D EGGRVQRF GFTRLP Q+ AL E LA + G ++A+E+ A +D+SF P Sbjct: 68 DHEGGRVQRFLTGFTRLPPMQALGALWEKSPEAALDLAHDVGEVLATELRASGVDLSFTP 127 Query: 120 VLDVGH-ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE 178 VLD+ H A IG RS+HADP +A G+ GM GKHFPGHGAV DSH Sbjct: 128 VLDIDHGRCAVIGNRSFHADPNIVATLAEALTRGLVAGGMIACGKHFPGHGAVEGDSHHV 187 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQE 238 P D R A I D+ F+ L + A+MPAHV+Y VDP PA SP+WL+TVLR+ Sbjct: 188 IPVDHRSYASIEEADLIPFARLAAAG-MHAVMPAHVVYDAVDPLPAGFSPFWLQTVLRER 246 Query: 239 LGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLS-PIKA 297 +GF G IFSDDL MEGAA G A R QA+ AGCDM LVCN A +L L P+ Sbjct: 247 MGFGGTIFSDDLCMEGAAGAGGIAARAQAAFAAGCDMALVCNRPDLADELLAQLDMPVTP 306 Query: 298 ERVTRL 303 E RL Sbjct: 307 ELTGRL 312 >UniRef50_A4BUN0 Glycoside hydrolase, family 3-like protein n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BUN0_9GAMM Length = 337 Score = 257 bits (657), Expect = 4e-67, Method: Compositional matrix adjust. Identities = 147/333 (44%), Positives = 191/333 (57%), Gaps = 9/333 (2%) Query: 5 MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQE 64 M+ + G L EERE+L P +GG+ILF+RNY QL +L I A R L++AVDQE Sbjct: 1 MIGIRGLVLLPEERELLCDPRIGGVILFSRNYESIEQLSQLTEAIHAIRRPSLLIAVDQE 60 Query: 65 GGRVQRFREGFTRLPAAQSFAAL--SGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLD 122 GG VQRFR GFT LPA L G A+ GWLMA+E+ A +D+SFAPVLD Sbjct: 61 GGCVQRFRTGFTALPAPACLGRLYDHGPPVALAAAERLGWLMAAELRAAGVDLSFAPVLD 120 Query: 123 VGH-ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPC 181 V +S IGER++H + ++ GM +AGM GKHFPGHGAV ADSH E P Sbjct: 121 VERGVSRVIGERAFHRRAGAVADLTLSWVRGMRQAGMAAVGKHFPGHGAVVADSHHELPV 180 Query: 182 DPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQELGF 241 D RP I + D F LI L+ +M AHV+Y VD RPAS S W+ VLR+ LGF Sbjct: 181 DDRPLDTIASVDFLPFRRLIAVG-LEGVMMAHVLYPQVDARPASFSGVWIAGVLRETLGF 239 Query: 242 DGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLS----PIKA 297 G + SDDL M+ AA +G +R +L AGCD++L+ N+ + A S+++ L P++A Sbjct: 240 TGAVVSDDLGMQAAACIGDLYQRAGVALAAGCDLLLLGNDWEPAASIVERLPSVERPVRA 299 Query: 298 ERVTRLYHKGSFSRQELMDSARWKAISTRLNQL 330 R+ L +G L RW T QL Sbjct: 300 ARLAALRGEGRLG-SPLQRDPRWLPAVTLATQL 331 >UniRef50_Q2BGB3 Beta-hexosaminidase n=1 Tax=Neptuniibacter caesariensis RepID=Q2BGB3_9GAMM Length = 342 Score = 252 bits (643), Expect = 2e-65, Method: Compositional matrix adjust. Identities = 130/289 (44%), Positives = 186/289 (64%), Gaps = 4/289 (1%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQ 63 + LD++G EL EE+ +L PL+GG+ILF RN + Q+ ELV IRA R L++++DQ Sbjct: 13 LFLDLDGLELTDEEKVLLLDPLIGGVILFARNTQNAKQVAELVASIRAI-RPDLILSLDQ 71 Query: 64 EGGRVQRFREGFTRLPAAQSFAALS-GMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLD 122 EGGRVQR ++G T LP ++ + S +EE A++ G+LMA+E+ +DID+SFAPV+D Sbjct: 72 EGGRVQRLKDGVTLLPPVKALSKFSTSLEENRSTAEQLGYLMAAELRQLDIDLSFAPVVD 131 Query: 123 VGH-ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPC 181 + + + IG R++ D ++S +I+GM +AGM T KHFPGHG AD+H Sbjct: 132 IDYGRNTVIGNRAFGNDVSTVTVLSSGYIEGMRDAGMPATVKHFPGHGWANADTHHADAI 191 Query: 182 DPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQELGF 241 D R + I DM F I N ++A+M AHV+Y D PA S +W+ VLR LGF Sbjct: 192 DEREFSVIEQADMVPFKEAI-TNGVEAMMFAHVLYPQCDASPAGFSDFWIGNVLRDGLGF 250 Query: 242 DGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLD 290 GVIFSDDLSM+ A G YA+R + ++ AGC ++L CN+RKG ++VL+ Sbjct: 251 QGVIFSDDLSMKAAHSAGGYAKRAEHAIRAGCQVLLCCNDRKGTLAVLE 299 >UniRef50_Q1N543 Glycoside hydrolase, family 3-like protein n=1 Tax=Bermanella marisrubri RepID=Q1N543_9GAMM Length = 334 Score = 251 bits (640), Expect = 3e-65, Method: Compositional matrix adjust. Identities = 125/300 (41%), Positives = 191/300 (63%), Gaps = 6/300 (2%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQ 63 ++ D+E + L +ER L+H + G+ILFTR++ QL+ L+ +I+A + +L++ VD Sbjct: 5 LITDIETHSLSEDERTWLSHSYLAGIILFTRHFESKTQLKTLISEIKAIN-PKLLITVDH 63 Query: 64 EGGRVQRFREGFTRLPAAQSFAALSGMEEG--GKLAQEAGWLMASEMIAMDIDISFAPVL 121 EGGRVQRFR+GFTR+PA L + +LA ++ ++ E++ + ID+++APVL Sbjct: 64 EGGRVQRFRDGFTRVPAMGRLGKLYQEDSARANQLAFDSAVVLCCELLDIGIDLTYAPVL 123 Query: 122 DVGHI-SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETP 180 D+ + + +G+R + + + +AS FIDG+ G + KHFPGHG VT DSH P Sbjct: 124 DIDYQRNQVVGDRGFAHTKEAIVDLASHFIDGVKSIGFPSVAKHFPGHGWVTQDSHVSCP 183 Query: 181 CDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQELG 240 D R EI++ D+ VF L+ K+D IMPAHV+Y VD +PA S +W+ +LR ++G Sbjct: 184 VDERSFEEIKSNDLWVFDHLM--PKIDWIMPAHVVYERVDEQPAGFSKHWVTNILRNDMG 241 Query: 241 FDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERV 300 F G + SDDLSMEGAA G YA R +A+LDAGC+++L CN+ +L+ LS + ER+ Sbjct: 242 FQGPVVSDDLSMEGAAQTGGYASRAKAALDAGCNVMLACNSSVAPKEILEFLSHSEYERL 301 >UniRef50_B3T2B0 Putative glycosyl hydrolase family 3 N terminal domain protein n=2 Tax=prokaryotic environmental samples RepID=B3T2B0_9ZZZZ Length = 340 Score = 249 bits (636), Expect = 1e-64, Method: Compositional matrix adjust. Identities = 135/341 (39%), Positives = 203/341 (59%), Gaps = 19/341 (5%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 +GP+ + VEG L +++IL+H LVGG++LF+RNY + QL+ L++ I++ L+VA Sbjct: 3 LGPIFVSVEGKTLTPGDQKILSHNLVGGVVLFSRNYENKDQLKLLIKSIKSLKSPDLLVA 62 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAAL--SGMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 VDQEGGRVQRF GF +P+ +S L E G + A ++ E++ + +D SFA Sbjct: 63 VDQEGGRVQRFISGFYPIPSMRSLGNLYDQSCENGMQATCIAAEILGLELLEVGVDFSFA 122 Query: 119 PVLDVGH-ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 PVLD+ H +S IG RS+H+DP+ + +F +G+ +AGM KHFPGHG V DSH+ Sbjct: 123 PVLDIDHGVSEVIGSRSFHSDPKVVDCLGEQFYEGLQKAGMNCVAKHFPGHGGVREDSHQ 182 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQ 237 +TP D R I +D+ + +LI+ L IM AHV Y VD + A+ S YWL+ L++ Sbjct: 183 KTPVDNREMKAI-IEDLQPYKTLIKRG-LSGIMTAHVRYPLVDDQIATFSHYWLRKCLQK 240 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKA 297 EL F+G+IFSDDL M+G + S ++ S AGCD +L+CN+ + L L K Sbjct: 241 ELHFNGIIFSDDLMMKGTDFITSIPKKALLSFAAGCDFVLICNSPETVNQCLSELHYAKD 300 Query: 298 E------RVTRL--YHKGSFSRQELMDSARWKAISTRLNQL 330 + ++ L +H+G RQ + + I T+LN+L Sbjct: 301 QFANLERKIQYLIPFHQG--QRQ----AKYLQEIKTKLNRL 335 >UniRef50_B2FPW9 Beta-hexosaminidase n=18 Tax=Xanthomonadaceae RepID=NAGZ_STRMK Length = 335 Score = 246 bits (628), Expect = 9e-64, Method: Compositional matrix adjust. Identities = 130/279 (46%), Positives = 176/279 (63%), Gaps = 3/279 (1%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQ 63 +++ V G EL A+ER+ L H V G++LF RN+ Q+ +L IRAA+ ++ VDQ Sbjct: 2 LLIGVAGTELTAQERDWLQHDAVAGVVLFKRNFASRQQVTDLSAAIRAAAPRPQLICVDQ 61 Query: 64 EGGRVQRFREGFTRLPAAQSFAALSGM--EEGGKLAQEAGWLMASEMIAMDIDISFAPVL 121 EGGRVQRFREG++ LP Q AL ++ LA+ WLMASE+ A +D+SFAPV+ Sbjct: 62 EGGRVQRFREGYSDLPPLQDIGALYATDPQQALALAERHAWLMASEVRASGLDLSFAPVV 121 Query: 122 DVGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPC 181 D+G + AIG R++ DPQ A + ++ GMH GM T KHFPGHG V D+H +T Sbjct: 122 DLGRGNRAIGNRAFSEDPQVVAAFTAAYVRGMHSVGMAATLKHFPGHGTVLEDTHVDTAI 181 Query: 182 DPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQELGF 241 DPR E+RA+D+ F + I DA+M AHVIY + P PA SP W++ +LR ELGF Sbjct: 182 DPRALDELRAQDLVPFQAGIAAGA-DAVMMAHVIYPQIAPEPAGYSPRWIQDILRGELGF 240 Query: 242 DGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCN 280 GV+FSDD+ M + G R A LDAGCD++LVC+ Sbjct: 241 RGVVFSDDIGMAASHSAGGVPARVHAHLDAGCDVVLVCH 279 >UniRef50_D0L0K9 Glycoside hydrolase family 3 domain protein n=1 Tax=Halothiobacillus neapolitanus c2 RepID=D0L0K9_HALNC Length = 362 Score = 240 bits (612), Expect = 7e-62, Method: Compositional matrix adjust. Identities = 131/315 (41%), Positives = 185/315 (58%), Gaps = 9/315 (2%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 +GP+M+DV G L ER L +P VGG+ILF RN+ QL L +IRA L++A Sbjct: 4 LGPLMVDVAGLTLTDAERTRLMNPAVGGVILFARNFSSTEQLTALTAEIRALRHPHLLIA 63 Query: 61 VDQEGGRVQRFR-EGFTRLPAAQSFAAL--SGMEEGGKLAQEAGWLMASEMIAMDIDISF 117 VDQEGGRVQRFR +GF LP AL + G + A+ GWLMA ++ +D+SF Sbjct: 64 VDQEGGRVQRFRDDGFIHLPPMGRLGALFKHDRQAGLEAAKSLGWLMAVQLRMAGVDLSF 123 Query: 118 APVLDVGH-ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSH 176 APVLD+ +S IG+R++ DP +A+A +I GM EAGM GKHFPGHG++ DSH Sbjct: 124 APVLDLDRGVSGVIGDRAFSRDPASIIALAVAWISGMKEAGMAAVGKHFPGHGSIAEDSH 183 Query: 177 KETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLR 236 P D R + I D+ F ++I E + A+M AHV+Y +D PA S W++ +LR Sbjct: 184 IAHPVDHRALSTIERDDLRPFVAMI-ERFIPALMMAHVVYPKIDALPAGFSKVWVQDLLR 242 Query: 237 QELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDN----L 292 + + G +F+DDL+M A++ G R A+L AGCDM L+CN+ + +L + Sbjct: 243 KRWQYQGAVFTDDLTMAAASVAGDIGARVDAALAAGCDMALICNHPQWVDQLLTHRKFTA 302 Query: 293 SPIKAERVTRLYHKG 307 P++ R+ L G Sbjct: 303 DPLREVRLAHLMGHG 317 >UniRef50_C6N603 Beta-hexosaminidase n=2 Tax=Legionella RepID=C6N603_9GAMM Length = 358 Score = 238 bits (607), Expect = 3e-61, Method: Compositional matrix adjust. Identities = 129/282 (45%), Positives = 174/282 (61%), Gaps = 8/282 (2%) Query: 5 MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQE 64 M+D+EG EL A E IL P VG +ILFTRN+ +P QL +L+ I + L +AVD E Sbjct: 6 MVDIEGTELSALEHTILKQPNVGAVILFTRNFTNPEQLAKLICDIDEINP-ELFIAVDHE 64 Query: 65 GGRVQRF-REGFTRLPAAQSF--AALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVL 121 GG VQRF R+GF+ LPAA+++ A E G QE G +MA E++A +D+S APVL Sbjct: 65 GGNVQRFQRQGFSALPAARAYGRAYDKNHEIGIAYTQEYGEIMAKELLACGVDLSIAPVL 124 Query: 122 DVGHISAAIG--ERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKET 179 DV S I R++H D + +A FI GM+ AGM GKHFPGHG+V +DSH Sbjct: 125 DVHDKSQVIAGLNRAFHKDADAIIDLAGAFIKGMNTAGMPAVGKHFPGHGSVVSDSHIAM 184 Query: 180 PCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPASGSPYWLKTVLRQE 238 P E+ A D+ F LI + L A+MPAHV Y+ +D PA S WL+ +LR + Sbjct: 185 PVSQASMTELAANDLKPFVELINKGLLTALMPAHVTYAAIDKENPAGFSKIWLQDILRTQ 244 Query: 239 LGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCN 280 LGF G++ SD LSM+GA I G+ R + +L AGCDM+++C+ Sbjct: 245 LGFKGLVLSDCLSMKGADI-GNLTTRAEKALAAGCDMLIICH 285 >UniRef50_A6GP56 Beta-hexosaminidase n=1 Tax=Limnobacter sp. MED105 RepID=A6GP56_9BURK Length = 340 Score = 234 bits (598), Expect = 3e-60, Method: Compositional matrix adjust. Identities = 130/285 (45%), Positives = 176/285 (61%), Gaps = 6/285 (2%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 +GP ++ + G L E+ + +P VGGLILFTRN+ AQL +LV IR+ V Sbjct: 2 LGPFIIGLTGLTLSKEDIARIQNPAVGGLILFTRNFESKAQLIDLVESIRSNGGLSKPVF 61 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFA--ALSGMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 VD EGGRVQRFR GFT +P+ + A + ++ LA+EAG++MA+E+ A ID+SFA Sbjct: 62 VDHEGGRVQRFRNGFTAIPSMRQIGQRAETDFDDAVLLAREAGFVMAAELRACGIDMSFA 121 Query: 119 PVLDV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSH 176 PVLD+ G+ S IG+RS+ D + ++S ++GM AGM+ GKHFPGHG V+ DSH Sbjct: 122 PVLDLDWGN-SEIIGDRSFGKDARMVSRLSSALMNGMALAGMQACGKHFPGHGWVSLDSH 180 Query: 177 KETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLR 236 P D R EI D + L + +IMPAHV+YS+VDP PA S +W++ VLR Sbjct: 181 LALPHDNRSLNEILKVDALPYQ-LNSTLNMASIMPAHVVYSEVDPMPACFSSFWIQEVLR 239 Query: 237 QELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNN 281 + FDG + SDDL M GA +G R A+L AGCD +L CNN Sbjct: 240 VKFSFDGAVISDDLDMVGAHGVGDIRARASAALQAGCDAVLACNN 284 >UniRef50_C5T2K1 Beta-N-acetylhexosaminidase n=1 Tax=Acidovorax delafieldii 2AN RepID=C5T2K1_ACIDE Length = 396 Score = 233 bits (594), Expect = 7e-60, Method: Compositional matrix adjust. Identities = 138/342 (40%), Positives = 188/342 (54%), Gaps = 55/342 (16%) Query: 3 PVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVD 62 P++LDV G L ++R LAHPL GG+ILF RN+ + AQL +L I+A R+ L++ VD Sbjct: 6 PLILDVAGTALSPDDRRRLAHPLTGGVILFARNWENRAQLLQLTSSIKAV-RDDLLICVD 64 Query: 63 QEGGRVQRFR-EGFTRLPAAQSFAAL-------------SGMEEGGKLAQEAGWLMASEM 108 EGGRVQRFR +GFT LP ++F + SG A AG+++ +E+ Sbjct: 65 HEGGRVQRFRTDGFTHLPPMRAFGEMWLDDGKGARAVPGSGALRATNAATAAGYVLGTEL 124 Query: 109 IAMDIDISFAPVLD----------VGHI------------------------SAAIGERS 134 A +D SF PVLD VG S IG+R+ Sbjct: 125 RACGVDFSFTPVLDLDWPDDAPQAVGAARPGPAAHGAPRAGSTSRGQAATSRSGVIGDRA 184 Query: 135 YHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDM 194 +H+DP+ +A + G+ +AGM GKHFPGHG V ADSH E P D R I A D Sbjct: 185 FHSDPRVVALLAKSLMHGLLQAGMAHCGKHFPGHGHVKADSHTEVPVDRRSFKAILADDA 244 Query: 195 SVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQELGFDGVIFSDDLSMEG 254 + ++ L + L ++MPAHVIY VD RPA S WL+ +LR+++ FDG IFSDDLSM G Sbjct: 245 APYAWL--SSTLTSVMPAHVIYPRVDSRPAGFSRRWLQDILRRQMHFDGAIFSDDLSMAG 302 Query: 255 AAIMGS----YAERGQASLDAGCDMILVCNNRKGAVSVLDNL 292 A + Y + A+L+AGCD++L+CN G S +D L Sbjct: 303 ARRIDGEVVRYTDAAIAALEAGCDLVLLCNQSLGGGSAVDEL 344 >UniRef50_B5EPA1 Glycoside hydrolase family 3 domain protein n=2 Tax=Acidithiobacillus ferrooxidans RepID=B5EPA1_ACIF5 Length = 354 Score = 231 bits (588), Expect = 4e-59, Method: Compositional matrix adjust. Identities = 132/293 (45%), Positives = 180/293 (61%), Gaps = 7/293 (2%) Query: 2 GPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAV 61 GP+M+D+ G A E +L P VGG+ILF RN D Q+R L ++ A L+V + Sbjct: 13 GPLMVDIAGLRPTAAECAMLRQPAVGGVILFARNCADAGQVRALCAELHALRTTPLLVGI 72 Query: 62 DQEGGRVQRFREGFTRLPAAQSFAALS---GMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 DQEGGRVQR R G TR P + L+ G+E A++ G L+ +E+ A+ +D+ F Sbjct: 73 DQEGGRVQRLRAGVTRFPPMATLGRLADRDGLERALVAAEDWGLLLGTELRALGVDLDFT 132 Query: 119 PVLDVGH-ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 P +D+ ISA IG+R+ H DP AIA+ GM G++ KHFPGHGAV ADSH Sbjct: 133 PCVDLDRGISAVIGDRAIHRDPVWVGAIAAALWKGMARTGLQGVAKHFPGHGAVAADSHL 192 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD-PRPASGSPYWLKTVLR 236 + P D RP + I+ +D+ F++LI + AIMPAH +Y VD PA SP WL+ VLR Sbjct: 193 DLPLDDRPFSAIQ-EDLQPFAALIAAG-IPAIMPAHCLYPRVDGEHPAGFSPRWLQDVLR 250 Query: 237 QELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVL 289 +E+GF GV+ SDDLSM GA G A R A+L AG D++LVCN+ +GA + + Sbjct: 251 KEMGFTGVVVSDDLSMAGAHGAGGVAARVDAALTAGADLLLVCNDPEGAETAI 303 >UniRef50_C7HX27 Beta-N-acetylhexosaminidase n=1 Tax=Thiomonas intermedia K12 RepID=C7HX27_THIIN Length = 361 Score = 219 bits (559), Expect = 8e-56, Method: Compositional matrix adjust. Identities = 132/298 (44%), Positives = 176/298 (59%), Gaps = 13/298 (4%) Query: 3 PVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVD 62 P++LDV G +L A +R+ L +PLVGG++LF RN+ QL L QI+A R L++AVD Sbjct: 8 PLILDVAGLQLGAADRKRLRNPLVGGVVLFARNWESRGQLLSLCAQIKAV-RPDLLIAVD 66 Query: 63 QEGGRVQRFR-EGFTRLPAAQSFAALSGMEEGGKL--AQEAGWLMASEMIAMDIDISFAP 119 EGGRVQRFR +GFT LPA + A + G + A G+++ASE+ A +D SF P Sbjct: 67 HEGGRVQRFRTDGFTALPAMRELGAHWMADAPGAMRRAAACGFVLASELRACGVDFSFTP 126 Query: 120 VLDVGHI-SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE 178 VLD+ S+ IG+R++HAD + +A + G+ AGM KHFPGHG V AD+H Sbjct: 127 VLDLDWGGSSVIGDRAFHADARVVTVLAQSLMLGLARAGMVHCAKHFPGHGYVRADTHLA 186 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQE 238 D R I + D + + L L A+MPAHVIY +VD PA S WL+ VLR+ Sbjct: 187 VARDARTLDAILSADAAPYQWL--APGLRAVMPAHVIYPEVDALPAGFSSVWLRDVLRER 244 Query: 239 LGFDGVIFSDDLSMEGAAIMG-SYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPI 295 +GF G I SDDL M A G S ++ L AGCD +LVCN SV+DN P+ Sbjct: 245 IGFTGAILSDDLGMAAARAEGQSLSQAALQGLQAGCDAMLVCNQ-----SVVDNGRPL 297 >UniRef50_C8N6C8 Beta-hexosaminidase n=1 Tax=Cardiobacterium hominis ATCC 15826 RepID=C8N6C8_9GAMM Length = 335 Score = 216 bits (551), Expect = 6e-55, Method: Compositional matrix adjust. Identities = 126/284 (44%), Positives = 167/284 (58%), Gaps = 8/284 (2%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 M V++ +EG L A+E L P G+ILF+RNY PAQL+ L IRA + ++ Sbjct: 1 MSAVIIGIEGTMLLAQEAAQLRRPFCAGVILFSRNYSSPAQLKRLCATIRAQHPD-AIIC 59 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGMEE--GGKLAQEAGWLMASEMIAMDIDISFA 118 VDQEGGRVQRFREGFT LPA + +LA+ G ++A E+ ID SF Sbjct: 60 VDQEGGRVQRFREGFTALPAMGALGRCFDRRPLLACELAEAVGVIIAYELRQFSIDFSFT 119 Query: 119 PVLDV-GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 PVLD+ +S IG+R+ HA+P + +A G+ GM GKH+PGHG V DSH Sbjct: 120 PVLDLHDPVSEVIGDRALHANPAVVVTLADALRRGLGNVGMIAVGKHYPGHGRVVGDSHH 179 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQ 237 P DPR E R D+ F I E+ ++A+MPAH+++ D RPAS SP +K + + Sbjct: 180 VLPQDPRSTDE-REPDLYPFIRQI-ESGIEALMPAHIVFPQRDHRPASFSPVHIKEL--R 235 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNN 281 GFDG I SDDL M GA + AER QA+LDAG D+ L+CN+ Sbjct: 236 ARGFDGAIISDDLDMAGALGIADPAERIQAALDAGADVTLMCND 279 >UniRef50_A5EWZ9 Beta-hexosaminidase n=1 Tax=Dichelobacter nodosus VCS1703A RepID=A5EWZ9_DICNV Length = 334 Score = 195 bits (495), Expect = 2e-48, Method: Compositional matrix adjust. Identities = 115/290 (39%), Positives = 168/290 (57%), Gaps = 9/290 (3%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQ 63 + + +E +L ++ L PL G++LF+RNY +P QLR L+++IR R ++++ VD Sbjct: 3 LFIGIEQTKLCRADKRRLQSPLCSGVVLFSRNYQNPEQLRALIQEIRQL-RPQILIGVDH 61 Query: 64 EGGRVQRFREGFTRLPAAQSFAALSGMEE--GGKLAQEAGWLMASEMIAMDIDISFAPVL 121 EGGRVQRFREGFT LPAA + + +AQ G ++A E+ A ID SFAPVL Sbjct: 62 EGGRVQRFREGFTLLPAAAQIGRIFDQDPVISWSVAQACGVILAYELRAQSIDFSFAPVL 121 Query: 122 DV-GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETP 180 D+ S IG+R++H +P ++ GM GM GKHFPGHG V+ DSH E P Sbjct: 122 DLYDPKSTVIGDRAFHENPAIVSVLSCSMRRGMKTVGMAAVGKHFPGHGRVSGDSHHELP 181 Query: 181 CDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQELG 240 D R AE RA D+ F ++++ ++A+M AH++ D D PAS S ++ + ++ G Sbjct: 182 IDTRSDAERRA-DLVPFIFNVQQD-IEALMMAHILMPD-DTLPASFSAKIIQEL--RDWG 236 Query: 241 FDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLD 290 F G + SDDL M A + + QA LDAG D ++CN+ V+D Sbjct: 237 FKGAVVSDDLDMAAAKQFATPTQCVQAVLDAGADAAMICNHFADMDRVID 286 >UniRef50_A0L847 Beta-N-acetylhexosaminidase n=1 Tax=Magnetococcus sp. MC-1 RepID=A0L847_MAGSM Length = 364 Score = 176 bits (445), Expect = 1e-42, Method: Compositional matrix adjust. Identities = 108/284 (38%), Positives = 158/284 (55%), Gaps = 10/284 (3%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQ 63 +++ + G L ER L G++LF+RN P Q+ L+R IR S + V+ +DQ Sbjct: 10 LVVTLAGTRLSEPERAYLQRVEPAGVMLFSRNIEQPEQVVALIRAIRQQSPS-TVIWIDQ 68 Query: 64 EGGRVQRFREGFTRLPAAQSFAALSGM--EEGGKLAQEAGWLMASEMIAMDIDISFAPVL 121 EGGRVQR R TRLPAA+SFA + ++ +LA++ G L E+ A I I+ APV+ Sbjct: 69 EGGRVQRLRCPLTRLPAAESFAKIYAYAPQQAERLARQCGALAGRELRAFGIGINCAPVV 128 Query: 122 DVGHISA--AIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKET 179 D+ A IG R++ +P +A ++ G+ EAG+ KHFPGHG DSH+ Sbjct: 129 DIPAPGADPVIGNRAFGDNPLSISRLAGAWLQGLQEAGVMGVAKHFPGHGDAMVDSHRSL 188 Query: 180 PCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPASGSPYWLKTVLRQE 238 P E+ ++ + L+ L IM AH++ S +DP RPA+ S LK LRQ Sbjct: 189 PVVHTEIRELLSRTWVPYVQLL--PLLQGIMTAHLVMSALDPLRPATWSSVVLKDWLRQR 246 Query: 239 LGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNR 282 LG+ G+I SD L M A+ G+ +R ++ AGCD +LVC+ R Sbjct: 247 LGYGGLIISDALEM--GALGGNMEDRAARAVRAGCDQLLVCSGR 288 >UniRef50_Q07N54 Beta-N-acetylhexosaminidase n=17 Tax=Alphaproteobacteria RepID=Q07N54_RHOP5 Length = 342 Score = 170 bits (430), Expect = 9e-41, Method: Compositional matrix adjust. Identities = 109/284 (38%), Positives = 157/284 (55%), Gaps = 16/284 (5%) Query: 8 VEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNR-LVVAVDQEGG 66 + G L AEERE + L GLILF RN +P Q+ +LV + R + V +DQEGG Sbjct: 11 IAGTTLTAEEREFIRSALPWGLILFKRNIDNPKQVTKLVSEFRKEIGDADAPVLIDQEGG 70 Query: 67 RVQRFREG-FTRLPAAQSFAALSGMEEGGKLAQEAGWL----MASEMIAMDIDISFAPVL 121 RVQR + PA F++L ++ +L A WL +A+++ + I + P+ Sbjct: 71 RVQRLAPPHWPAYPAGALFSSLYDIDS--QLGLSAAWLSARLIAADLAELGITVDCLPLA 128 Query: 122 DV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKET 179 DV A IG+R+Y D K AIA DG+ + G+ KH PGHG TADSH Sbjct: 129 DVPVAGADAVIGDRAYGTDQAKVAAIAQSVADGLTQGGVLPVLKHIPGHGRATADSHLRL 188 Query: 180 PCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPASGSPYWLKTVLRQE 238 P + E+ D S F +L + L M AHV++S +DP +PA+ S ++ V+R Sbjct: 189 PVVDVSEEELDQTDFSPFQTL---SALPMAMTAHVVFSAIDPAQPATTSATIIERVIRGL 245 Query: 239 LGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNR 282 +GF G++ +DD+SM A+ GS AER +ASL AGCD++L CN + Sbjct: 246 IGFQGLLMTDDVSMN--ALAGSIAERCRASLAAGCDVVLHCNGK 287 >UniRef50_B3TAA3 Putative glycosyl hydrolase family 3 N terminal domain protein n=2 Tax=environmental samples RepID=B3TAA3_9ARCH Length = 392 Score = 168 bits (426), Expect = 2e-40, Method: Compositional matrix adjust. Identities = 111/312 (35%), Positives = 162/312 (51%), Gaps = 16/312 (5%) Query: 9 EGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRA-----ASRNRLVVAVDQ 63 EG LDA E++ +GGLILF RNY +P QL +L +++ A L ++VDQ Sbjct: 7 EGTTLDARTEELIVQQGIGGLILFERNYKNPDQLLQLTHDLQSLAAGHAEIPPLFISVDQ 66 Query: 64 EGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV 123 EGGRV R + FT+ P G + +LA G M E+ A+ I++ +APVLDV Sbjct: 67 EGGRVARLGKPFTQFPPMSCL----GQADSVELAYRFGLGMGKELRAVGINMDYAPVLDV 122 Query: 124 --GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPC 181 + IG R+ DP K + + I G ++AG+ GKHFPGHG + DSH P Sbjct: 123 HSNPENPIIGLRALDTDPAKVARLGAELIRGFYDAGVIPVGKHFPGHGDTSQDSHLTLPQ 182 Query: 182 DPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQELG 240 R + + ++ F I++ L+ IM AHV+Y D R PA+ S L VLR L Sbjct: 183 VERARESLEKSELIPFIHAIQQG-LEVIMTAHVVYPAWDARQPATFSRSILNDVLRNSLQ 241 Query: 241 FDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL-SPIKAER 299 F+G+I SDDL M+ AI + AG D+ L+C++ + ++ D++ I+ R Sbjct: 242 FEGLIISDDLDMQ--AIDEKLESIPELGTRAGVDLFLICHDLEKVCTLQDSMIRDIETAR 299 Query: 300 VTRLYHKGSFSR 311 + R K S R Sbjct: 300 IPRTTIKQSLGR 311 >UniRef50_C6HUU9 Beta-N-acetylhexosaminidase n=1 Tax=Leptospirillum ferrodiazotrophum RepID=C6HUU9_9BACT Length = 321 Score = 166 bits (421), Expect = 8e-40, Method: Compositional matrix adjust. Identities = 104/293 (35%), Positives = 162/293 (55%), Gaps = 10/293 (3%) Query: 6 LDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEG 65 + + G +L A +RE L G+++F N + AQ++ L+ +IR+ + ++AVDQEG Sbjct: 10 VSLPGPDLTAADREFLERVDPLGVVIFRENVRNAAQVKSLIEEIRSVPGSPRLLAVDQEG 69 Query: 66 GRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV-- 123 GRV R +G LPA + A E +EAG+ + + + D+ FAPVLDV Sbjct: 70 GRVARLSDGVPALPAMKILGAQPERE-----IEEAGFSLGEALHRLGFDVDFAPVLDVDS 124 Query: 124 GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDP 183 + IG+R++ +DP++A A F G+ AG+ GKHFPGHG + DSH P Sbjct: 125 NPQNPIIGDRAFSSDPEEAGVKALAFARGLLRAGILPCGKHFPGHGDTSLDSHVALPVVD 184 Query: 184 RPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQELGFD 242 P + + A+++ F I E ++ +M AHV+Y +D PA+ S + +LR++L + Sbjct: 185 APLSLLEARELVPFRMAI-EREIPLMMTAHVVYPSIDKAVPATLSRAVVTGILREKLNYG 243 Query: 243 GVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPI 295 G++ SDDL M A G E AS+DAGCD +LV ++ A++V D L I Sbjct: 244 GLVLSDDLLMAAVA-RGGVVEAALASIDAGCDGLLVLKSQSQALAVADRLEEI 295 >UniRef50_B8FZS5 Glycoside hydrolase family 3 domain protein n=2 Tax=Desulfitobacterium hafniense RepID=B8FZS5_DESHD Length = 426 Score = 165 bits (418), Expect = 2e-39, Method: Compositional matrix adjust. Identities = 91/286 (31%), Positives = 166/286 (58%), Gaps = 15/286 (5%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIR---AASRNRLVVA 60 VM+ V+GYE++A ++++ + VGG +L +N D Q+ L+ ++ +R L +A Sbjct: 78 VMVGVDGYEINANAQQLIENYHVGGFVLLKKNVRDSGQMLNLINTLKETNGVNRIPLFLA 137 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 +D+EGGR+ R F ++P++Q A + G L ++ G ++ E+ ++++FAPV Sbjct: 138 LDEEGGRISRMPAEFKKMPSSQQVGA----QNSGALGKKMGEILGREVKGFGMNVNFAPV 193 Query: 121 LDV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE 178 LD+ + IG+R++ ++P+ + + + G+ E G+ + KHFPGHG + DSH Sbjct: 194 LDIFSNPKNKVIGDRAFGSNPELVSKVGIQTMRGIQEQGIISVVKHFPGHGDTSVDSHVG 253 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPASGSPYWLKTVLRQ 237 P +R ++ F+ I N +DAIM AH++ +DP PAS S ++ +LR+ Sbjct: 254 LPRVDYDLERLRNFELRPFAEAI-ANDVDAIMLAHILLPKLDPDYPASFSEVLIRDILRK 312 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSY--AERGQASLDAGCDMILVCNN 281 E+ ++GV+ +DD++M AI+ +Y E S+ AG D++LVC++ Sbjct: 313 EMDYNGVVITDDMTM--GAIVENYNIGEAAVKSILAGSDIVLVCHD 356 >UniRef50_B0UI83 Beta-N-acetylhexosaminidase n=3 Tax=Alphaproteobacteria RepID=B0UI83_METS4 Length = 341 Score = 164 bits (416), Expect = 3e-39, Method: Compositional matrix adjust. Identities = 107/283 (37%), Positives = 150/283 (53%), Gaps = 10/283 (3%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIR-AASRNRLVVAVD 62 ++L G L AEE G ILF RN P ++R L +R R + +D Sbjct: 6 LILGCAGRTLSAEEAAFFRAVQPWGFILFKRNIGTPEEVRALTASLRDTVGRADAPILID 65 Query: 63 QEGGRVQRFREGFTR-LPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVL 121 QEGGRVQR R PAA F G EEG LA+ LMA ++ + I + APVL Sbjct: 66 QEGGRVQRMGPPHWRAYPAAARFLQAGGPEEGPALARLGARLMAHDLAGVGISVDCAPVL 125 Query: 122 DV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKET 179 DV +A IG+R+Y DP++ A +G+ G+ KH PGHG ADSH Sbjct: 126 DVPVAGANAVIGDRAYADDPREVAAFGRAVAEGLMAGGVLPVIKHVPGHGRANADSHLAL 185 Query: 180 PCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPASGSPYWLKTVLRQE 238 P P+A + A+D + F +L L M AHV+++ +DP +PA+ S ++ V+R Sbjct: 186 PVVTAPRAALEAQDFAPFRAL---ADLPLAMSAHVVFTALDPEQPATTSATVIREVVRGA 242 Query: 239 LGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNN 281 LGFDG++ +DDLSM+ A+ GS+ R +A+ AG D+ L CN Sbjct: 243 LGFDGLLMTDDLSMK--ALSGSFRGRTEAAFRAGIDVALHCNG 283 >UniRef50_Q0EWG5 Glycoside hydrolase, family 3-like protein n=1 Tax=Mariprofundus ferrooxydans PV-1 RepID=Q0EWG5_9PROT Length = 294 Score = 160 bits (406), Expect = 5e-38, Method: Compositional matrix adjust. Identities = 100/281 (35%), Positives = 152/281 (54%), Gaps = 9/281 (3%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQ 63 +++ +EG EL +ER L G+ILF RN P Q++ L+ ++RA + A+D+ Sbjct: 6 LIIGLEGTELTEQERIWLKEKPPLGVILFARNTESPEQVKALLAEVRACTGQATWAAIDE 65 Query: 64 EGGRVQRFREG-FTRLPAAQSFAALSGMEEGGKL--AQEAGWLMASEMIAMDIDISFAPV 120 EGGRV R F + A + + G + + ++ + + + APV Sbjct: 66 EGGRVNRMPWAPFNQRRHAAEYGVMYERNAGTAVQAVYQDSFVAGEALKELGFTHNCAPV 125 Query: 121 LDVGHISA--AIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE 178 LD+ H IG+R+Y AD + A+A+ + G+H+AG++ GKHFPGHG ADSH Sbjct: 126 LDLFHADGHDIIGKRAYGADVAQVAALATACMRGLHDAGIEAVGKHFPGHGRANADSHVA 185 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQE 238 P P A I ++ F+ LI E L +M AHVIY+D A+ SP+W + VLR+ Sbjct: 186 VPEVHAPLASI-LEEAEPFARLIAEG-LKHVMTAHVIYTDAVKDVATVSPFWARHVLRER 243 Query: 239 LGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVC 279 +GF G ++SDDL M+G + A+L AGCD++LVC Sbjct: 244 MGFFGSVWSDDLCMKG--VGEDVPSAASAALKAGCDILLVC 282 >UniRef50_D2MLA9 Beta-N-acetylglucosaminidase n=1 Tax=Candidatus Poribacteria sp. WGA-A3 RepID=D2MLA9_9BACT Length = 354 Score = 160 bits (405), Expect = 6e-38, Method: Compositional matrix adjust. Identities = 101/284 (35%), Positives = 150/284 (52%), Gaps = 9/284 (3%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNR-LVVAVD 62 +M+ +G EL + L GG ILF+RN DP Q+ L ++A + N L++A+D Sbjct: 11 LMVGFQGTELSRAFLDWLQECRPGGAILFSRNLVDPEQVARLTNALQAHAPNSPLLIAID 70 Query: 63 QEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLD 122 +EGGRV R +GFT PAA AA + A+ + A E+ A+ I+++ APVLD Sbjct: 71 EEGGRVSRLPQGFTAFPAASKVAACHSADTAYGTAE----VTAQELRAVGINMNLAPVLD 126 Query: 123 VGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETP 180 V + IG+R+Y P++ + G+ + G+ GKHFPGHG DSH P Sbjct: 127 VNSNPDNPIIGDRAYGTQPERVCTYGMAVMQGLEDNGIIPCGKHFPGHGDTMTDSHHALP 186 Query: 181 CDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPASGSPYWLKTVLRQEL 239 +A + A ++ F IR L AIM AHV Y +D PA+ S L +LR +L Sbjct: 187 VVEADRARLDAVELEPFRRAIRRG-LPAIMTAHVRYPALDADAPATLSQPILTDLLRNQL 245 Query: 240 GFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRK 283 GF+GV +DD+ M S E +L AG DM+++C+ ++ Sbjct: 246 GFEGVTLTDDMEMRAILDHQSIGEASVRALQAGADMVVICHQQE 289 >UniRef50_C7IJM6 Glycoside hydrolase family 3 domain protein n=1 Tax=Clostridium papyrosolvens DSM 2782 RepID=C7IJM6_9CLOT Length = 448 Score = 159 bits (403), Expect = 9e-38, Method: Compositional matrix adjust. Identities = 97/298 (32%), Positives = 164/298 (55%), Gaps = 20/298 (6%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELV---RQIRAASRNRLVVA 60 V++ ++GYE D R+++ VGG ILFT N + Q+ EL+ + I A++ L ++ Sbjct: 107 VIVGLDGYEADEHARQMIEEYRVGGFILFTSNIQNSNQMLELLNSLKNINRANKVPLFLS 166 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 VD+EGGRV R + F ++P+ Q L+ ++ + G ++ E+ + +++ FAPV Sbjct: 167 VDEEGGRVSRLPDEFLKIPSNQIIGKLNN----SSVSHQVGSIIGEELKSFGMNMDFAPV 222 Query: 121 LDVGH--ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE 178 LD+ + IG+R++ P + + + G+ + KHFPGHG + DSH Sbjct: 223 LDINSNPKNPIIGDRAFGTGPNLVSKLGVQTMKGLQSQNIIPVVKHFPGHGDTSTDSHVG 282 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPASGSPYWLKTVLRQ 237 P +R ++ F+ I EN +A+M AH++ ++DP PAS S + VLR Sbjct: 283 LPKVDNDLKRLRNFELKPFARAI-ENGAEAVMVAHILLPEIDPDNPASFSKTIINDVLRL 341 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQA---SLDAGCDMILVCNN---RKGAVSVL 289 E+ +DGV+ +DD +M AI+ +Y GQA S+ AG D++LVC++ +K +S L Sbjct: 342 EMNYDGVVITDDFTM--GAIVKNY-NIGQAAVKSILAGGDIVLVCHDFEKQKAVISAL 396 >UniRef50_B6ISL4 Beta-glucosidase-like protein n=13 Tax=Alphaproteobacteria RepID=B6ISL4_RHOCS Length = 349 Score = 159 bits (401), Expect = 2e-37, Method: Compositional matrix adjust. Identities = 108/284 (38%), Positives = 149/284 (52%), Gaps = 11/284 (3%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQ 63 V G +L A+ER G ILF RN P Q+R L +R A V +DQ Sbjct: 17 VAFGCAGPDLTAQERAFFREADPFGFILFRRNAETPEQVRRLTGDLRDAVGRDAPVLIDQ 76 Query: 64 EGGRVQRFREG-FTRLPAAQSFAALS--GMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 EGGRV R R + PAA+ L+ E G + A G L+A + + ID+ APV Sbjct: 77 EGGRVARLRPPHWPEFPAARRIGDLAERDPEAGARAAWLDGRLLAHMLADVGIDVDCAPV 136 Query: 121 LDVGHISA--AIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE 178 D+ + IG+R++ ADP +A +G+ G+ KH PGHG +ADSH+E Sbjct: 137 ADLPVPGSHDVIGDRAFGADPGLVARLARAQAEGLLAGGVLPVVKHLPGHGRASADSHRE 196 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQ 237 P +A + A D + F +L + L M AHV+Y+ VDP PAS S ++TV+R Sbjct: 197 LPVVTAERAVLEATDFAPFRAL---SDLPCGMAAHVVYTAVDPAMPASTSAAVIRTVIRG 253 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNN 281 +GF G++FSDDLSM+ A+ G ER +A L GCD+ L CN Sbjct: 254 WIGFQGLLFSDDLSMQ--ALSGGPGERARAVLAGGCDVALHCNG 295 >UniRef50_B5Y6T8 Beta-N-Acetylglucosaminidase n=2 Tax=Firmicutes RepID=B5Y6T8_COPPD Length = 392 Score = 158 bits (400), Expect = 2e-37, Method: Compositional matrix adjust. Identities = 93/274 (33%), Positives = 150/274 (54%), Gaps = 15/274 (5%) Query: 26 VGGLILFTRNYHDPAQLRELVRQIRAASR-NRL--VVAVDQEGGRVQRFREGFTRLPAAQ 82 VGG+ILF+ N D QL L+ I+A + N+L +++ DQEGGR+ R T+ P+ Sbjct: 64 VGGVILFSENIQDDKQLLALLNGIKALNEGNKLPILISTDQEGGRISRLPSRATKFPSN- 122 Query: 83 SFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV--GHISAAIGERSYHADPQ 140 + G L+ + G ++ +EM A ++ FAPVLDV + IG+RS+ ++P Sbjct: 123 ---LIIGKRNSATLSYQIGSILGTEMKAYGFNMDFAPVLDVVTNPQNKVIGDRSFGSNPD 179 Query: 141 KALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSL 200 + + G+ +G+ KHFPGHG + DSH+E P +R+ ++ F Sbjct: 180 LVSTLGVATMKGISSSGVIPVVKHFPGHGDTSVDSHQELPVSNATLPHLRSFELLPFEKA 239 Query: 201 IRENKLDAIMPAHVIYSDVDPR--PASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIM 258 I +N +M AH+ +++VDP PAS S ++ +LR ELGF GV+ +DDL ME Sbjct: 240 I-QNGASVVMVAHIKFANVDPSGLPASLSEVFISDILRNELGFSGVVITDDLHMEAITKH 298 Query: 259 GSYAERGQASLDAGCDMILVCNN---RKGAVSVL 289 S + ++ AG DM+L+C++ +K A++ L Sbjct: 299 YSVGDAAIKAVQAGADMVLICHSLDEQKQAINAL 332 >UniRef50_A8F7W1 Glycoside hydrolase family 3 domain protein n=1 Tax=Thermotoga lettingae TMO RepID=A8F7W1_THELT Length = 502 Score = 158 bits (400), Expect = 3e-37, Method: Compositional matrix adjust. Identities = 96/287 (33%), Positives = 152/287 (52%), Gaps = 9/287 (3%) Query: 5 MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQE 64 M+ ++G EL E +L G +ILF+RN P QL +L++++R + +V ++DQE Sbjct: 12 MVGIDGTELTDEAARVLEQVKPGFIILFSRNIESPEQLYQLIKELRKVVKKPVVFSIDQE 71 Query: 65 GGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVG 124 GG V R R GFT A + AA E L AG ++A EM ++ I + APVLD+ Sbjct: 72 GGIVTRLRNGFTVSCGAMALAATKD-ENNAYL---AGKILALEMKSLGITWNLAPVLDIN 127 Query: 125 H--ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCD 182 + IG RS+ PQK + A F G+ E + GKHFPG G+V D+H + P Sbjct: 128 DNPNNPGIGVRSFGDTPQKVIKFAKAFYKGLSEESVAACGKHFPGKGSVEVDAHLDMPVL 187 Query: 183 PRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP--RPASGSPYWLKTVLRQELG 240 + E++ ++ F L++E L++IMP+H+ + P PA+ S L +LR+ L Sbjct: 188 QKSLEELKRWELIPFVELMKEG-LESIMPSHIHLPQLQPDRLPATVSKEILTDLLRKTLN 246 Query: 241 FDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVS 287 + G+ +DDL M G S + S +G D++ +C+ + +S Sbjct: 247 YHGIAVADDLLMGGITKAMSVEDAVINSFKSGMDVLSLCHEPQAQIS 293 >UniRef50_C5CGK3 Glycoside hydrolase family 3 domain protein n=1 Tax=Kosmotoga olearia TBF 19.5.1 RepID=C5CGK3_KOSOT Length = 528 Score = 155 bits (392), Expect = 2e-36, Method: Compositional matrix adjust. Identities = 110/339 (32%), Positives = 172/339 (50%), Gaps = 30/339 (8%) Query: 16 EEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRN-RLVVAVDQEGGRVQRFREG 74 ++ ++L V G ++F RN D Q++ELV QI+ + VAVDQEGG V R R+ Sbjct: 44 DQIQLLIDKGVRGFVIFGRNVKDKDQVKELVEQIKTHCDDIPPFVAVDQEGGMVARIRDI 103 Query: 75 FTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGH--ISAAIGE 132 F P A +++ E L + G+L + + I++ FAPVLDV + IG Sbjct: 104 FVP-PNAMLLGSVNDTE----LTYKVGYLTGKALRDLGINMDFAPVLDVNSNPFNPIIGV 158 Query: 133 RSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAK 192 RS+ +P + F G+ AG+ GKHFPGHG DSH P +P++ Sbjct: 159 RSFWNEPTRVATNGVAFFKGLLSAGVIPVGKHFPGHGDTDTDSHTSLPVINKPESNFIET 218 Query: 193 DMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQELGFDGVIFSDDLSM 252 D+S F + ENK+ A+M AH++ D PA+ S + + LR++LGFDGVI SDD+ M Sbjct: 219 DVSPFKIAV-ENKIPALMTAHILLPFWDNVPATISKKII-SYLREKLGFDGVIISDDMLM 276 Query: 253 EGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFSRQ 312 + + S E + SL AG DM ++ + + + V A+ + + G+ + Sbjct: 277 KAVSEGKSVKEAVKQSLLAGVDMFIIWKDLETQLEV--------ADYILQEVKNGNIP-E 327 Query: 313 ELMDSA-------RWKAISTRLN----QLHERWQEEKAG 340 E++D+A R A ++ LN Q+ + W+E G Sbjct: 328 EVIDNAVKRITKLRLLAYASELNELDLQVEKIWEEISDG 366 >UniRef50_C1XLF1 Beta-glucosidase-like glycosyl hydrolase n=2 Tax=Meiothermus RepID=C1XLF1_MEIRU Length = 511 Score = 154 bits (388), Expect = 6e-36, Method: Compositional matrix adjust. Identities = 97/277 (35%), Positives = 151/277 (54%), Gaps = 9/277 (3%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQ 63 +M+D+ G LDA R L GG+ LF +N +P Q+R+LV +IRA + +A+DQ Sbjct: 11 MMVDIPGPTLDAATRAHLERYRFGGVCLFRKNIQNPEQVRQLVAEIRAVLGPQAWIAIDQ 70 Query: 64 EGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV 123 EGG VQR E F P+ + A+ + K A+ G + +I++ I+ +FAP +DV Sbjct: 71 EGGAVQRVLE-FALAPSPMALGAVGEV----KTAEAVGAAVGRALISLGINWNFAPSVDV 125 Query: 124 GH--ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPC 181 + IG+RS+ +DP+K ++ + G+ +AG+ KHFPGHG DSH E P Sbjct: 126 NTNPQNPVIGDRSFGSDPKKVARLSLAWARGLEQAGVMAAIKHFPGHGDTFLDSHLELPV 185 Query: 182 DPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQELG 240 + E+ ++ F + E ++ +IM AH+ + +D + PA+ S L LR++LG Sbjct: 186 VNKTLEELERTELYPFRKAV-EAQISSIMSAHIRFPALDKQHPATLSKKILTGFLRKKLG 244 Query: 241 FDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMIL 277 F G+I +D L M+ + E SL AG DMIL Sbjct: 245 FQGIIVTDALDMKAISKNYPIGEAAVKSLQAGADMIL 281 >UniRef50_Q1GI78 Beta-N-acetylhexosaminidase n=16 Tax=Rhodobacterales RepID=Q1GI78_SILST Length = 353 Score = 152 bits (385), Expect = 1e-35, Method: Compositional matrix adjust. Identities = 106/307 (34%), Positives = 148/307 (48%), Gaps = 29/307 (9%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 G +LD EG L A+E+ + G ILF RN QLR L + R A+ + ++ Sbjct: 3 FGATILDAEGLRLTADEKALFREAAPFGFILFARNIESGDQLRALCDEFREAAGHDCLIT 62 Query: 61 VDQEGGRVQRFREGFTR--LPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 +DQEGGRVQR R R PA A S L L+A+E+ ID + A Sbjct: 63 IDQEGGRVQRLRAPLAREFRPALDHVEAASDPVRAMYLRAR---LIAAELRDYGIDSNCA 119 Query: 119 PVLDV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSH 176 P+ DV + R Y ++ + AIA +G + G+ KH PGHG T DSH Sbjct: 120 PLADVATAETHPFLRNRCYGSNVESVAAIARACAEGHLDGGVVPVMKHIPGHGRSTMDSH 179 Query: 177 KETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLR 236 + P P +RA+D + F+ L L M AH++Y DPRPA+ SP ++ ++R Sbjct: 180 HDLPHVTAPADLLRAEDFATFAKL---RDLPMAMTAHLVYDAFDPRPATLSPVMME-IIR 235 Query: 237 QELGFDGVIFSDDLSM----------EGAAIMGSY--------AERGQASLDAGCDMILV 278 ELG+DG++ +DD+SM E G Y A + SLDAGCD+ L Sbjct: 236 NELGYDGLVMTDDISMKALWQDAAQDEARTAGGPYPDDEVTWSARLARQSLDAGCDVALF 295 Query: 279 CNNRKGA 285 CN+ A Sbjct: 296 CNSSLAA 302 >UniRef50_Q10Y93 Beta-N-acetylhexosaminidase n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q10Y93_TRIEI Length = 308 Score = 151 bits (382), Expect = 3e-35, Method: Compositional matrix adjust. Identities = 91/303 (30%), Positives = 158/303 (52%), Gaps = 11/303 (3%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNR-LVV 59 M P++ + G L E+++ G ILF RN P Q+++L +R + + + Sbjct: 1 MLPIIFSLAGTILSEAEKQLFTRSQPAGFILFQRNCEHPTQVKQLTDTLRKCINQKDIPI 60 Query: 60 AVDQEGGRVQRFREG-FTRLPAAQSFAA-LSGMEEGGKLAQEAGWLMASEMIAMDIDISF 117 +DQEGGRV R + + P+A S+ ++ + + A+ +A ++ + I+++ Sbjct: 61 LIDQEGGRVARLQPPHWWVAPSASSYLQEINDLNDAAHRAKIDAKHIAQDLQNIGINVNC 120 Query: 118 APVLD--VGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADS 175 +PVLD + IG R+Y P + + +A I+G E G+ KH PGHG DS Sbjct: 121 SPVLDCPIPEADLVIGTRAYSNSPSEIITMAKAVIEGFQEEGVMPVIKHIPGHGRALVDS 180 Query: 176 HKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPASGSPYWLKTV 234 HK P E+ D F +L + + M AHVIY+ +D A+ SP ++ + Sbjct: 181 HKALPVVETSLKELSECDFLPFKALAKS--VTWAMSAHVIYTAIDSENCATVSPTIIQEI 238 Query: 235 LRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVL-DNLS 293 +R+++GF+G++ +DDL+M+ A+ G + SL AGCD+ L C+ + + L DN++ Sbjct: 239 IRKQIGFEGILVTDDLTMK--ALSGDLVTKVNRSLKAGCDLALYCDGKLEDMEFLADNIT 296 Query: 294 PIK 296 P+K Sbjct: 297 PVK 299 >UniRef50_Q8ETJ0 Beta-hexosaminidase n=1 Tax=Oceanobacillus iheyensis RepID=Q8ETJ0_OCEIH Length = 424 Score = 151 bits (381), Expect = 3e-35, Method: Compositional matrix adjust. Identities = 99/325 (30%), Positives = 161/325 (49%), Gaps = 24/325 (7%) Query: 8 VEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNR---LVVAVDQE 64 ++G E++ + ++++ + VGG+ILF N AQ L+ I+ + N L++ VD+E Sbjct: 65 IDGTEINTDTKDMIENYHVGGVILFADNIESKAQTVNLMNDIKQVNENNPHPLLLGVDEE 124 Query: 65 GGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVG 124 GG V R + T LP + S G + +LA E G L+ +M + +++FAPVLDV Sbjct: 125 GGSVTRMPDEITSLPPSGSI----GKAQDEELAFEVGILLGQQMRGLGFNLNFAPVLDVN 180 Query: 125 HI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCD 182 + IG+RS+ P + + + G+ G+ + KHFPGHG + DSH + P Sbjct: 181 SNPDNPVIGDRSFGDSPDIVTDMGIQSMKGIQSEGIISVVKHFPGHGDTSEDSHLKLPKV 240 Query: 183 PRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQELGF 241 + E+ ++ F I + D +M AH++ +D PAS S + +LR+++GF Sbjct: 241 DKSIEELSEVELVPFEKAISKGA-DVVMTAHILLPQIDTEYPASMSEKIISGLLREDMGF 299 Query: 242 DGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPI------ 295 DGV+ SDDL+M E S+ AG DM+L+ ++ VSV L Sbjct: 300 DGVVISDDLTMGAITENIRIEEAAIQSVKAGSDMVLIAHHPDVVVSVHKKLKAAVQNGVI 359 Query: 296 -------KAERVTRLYHKGSFSRQE 313 ER+ +L K S +E Sbjct: 360 SEAKIDESVERIIQLKRKYKLSNEE 384 >UniRef50_C9D3C0 Beta-N-acetylhexosaminidase n=1 Tax=Silicibacter sp. TrichCH4B RepID=C9D3C0_9RHOB Length = 364 Score = 151 bits (381), Expect = 4e-35, Method: Compositional matrix adjust. Identities = 109/314 (34%), Positives = 151/314 (48%), Gaps = 40/314 (12%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 G +LD EG L A+E+ + G ILF RN Q+R L RAA+ + ++ Sbjct: 3 FGATILDAEGLRLTADEKALFREAAPYGFILFARNIESADQVRALCDDFRAAAGHDCLIT 62 Query: 61 VDQEGGRVQRFREGFTR--LPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 +DQEGGRVQR R R PA A S L L+A+E+ + ID + A Sbjct: 63 IDQEGGRVQRLRAPLAREWRPALDHVQAASDPMRAMYLRAR---LIAAELHDLGIDSNCA 119 Query: 119 PVLDV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSH 176 P+ DV + R Y D AIA +G + G+ KH PGHG T DSH Sbjct: 120 PLADVTGAGTHPFLRNRCYGTDVANVAAIARACAEGHLDGGVVPVMKHIPGHGRSTMDSH 179 Query: 177 KETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLR 236 + P P +RA+D + F+ L L M AH++Y DPRPA+ SP ++ V+R Sbjct: 180 HDLPHVTAPVEVLRAEDFAAFAQL---RDLPMAMTAHLVYDAFDPRPATLSPVMMQ-VIR 235 Query: 237 QELGFDGVIFSDDLSMEG------------AAIMGSYAE-RGQ----------------A 267 QELG+DG++ +DD+SM+ ++ G A RG+ A Sbjct: 236 QELGYDGLVMTDDISMKALWAPELQKPGHRPSLGGDEAPYRGEGLRPDDPDMAAGAIARA 295 Query: 268 SLDAGCDMILVCNN 281 SLDAGCD+ L CN+ Sbjct: 296 SLDAGCDVALFCNS 309 >UniRef50_A9BFQ0 Glycoside hydrolase family 3 domain protein n=1 Tax=Petrotoga mobilis SJ95 RepID=A9BFQ0_PETMO Length = 528 Score = 150 bits (380), Expect = 5e-35, Method: Compositional matrix adjust. Identities = 93/281 (33%), Positives = 148/281 (52%), Gaps = 16/281 (5%) Query: 5 MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQE 64 ++ ++G L+ E ++L L G +I F+RN + QL + + I+ + +DQE Sbjct: 13 LIGIQGTSLNNENMKVLKSTLPGVIIFFSRNIENKYQLSKFIEDIKNFLDYEPLFCIDQE 72 Query: 65 GGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVG 124 GG V R ++GFT +P+A A + E A + L+A EM+A+ ID + APV+D+ Sbjct: 73 GGTVARLKKGFTVVPSAMGITATNEAEN----AYLSANLLAKEMLAVGIDWNLAPVVDIN 128 Query: 125 H--ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCD 182 + ++ IG RS+ D L A ++ G+H+ G+ + KHFPG G V D H + Sbjct: 129 NNPKNSVIGIRSFSDDKNVVLKFAREYVKGLHDGGVLSCLKHFPGIGNVNTDPHLDL--- 185 Query: 183 PRPQAEIRAKDMSVFSSLIRENKLD--AIMPAHVIYSDVDPR--PASGSPYWLKTVLRQE 238 PQ E+ +KD + + L +D + MP HV S + + PAS S L + R+E Sbjct: 186 --PQGEL-SKDDLLSTELFPFLNIDSPSWMPTHVYLSKIQNKKEPASLSEEILTGLAREE 242 Query: 239 LGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVC 279 L + GV+ +DD M G A S E SL+AG D++ +C Sbjct: 243 LKYKGVLVADDFEMGGVANFYSAQEAVIKSLNAGMDIVSIC 283 >UniRef50_Q0FTI5 Beta-N-acetylhexosaminidase, putative n=5 Tax=Rhodobacterales RepID=Q0FTI5_9RHOB Length = 335 Score = 150 bits (379), Expect = 6e-35, Method: Compositional matrix adjust. Identities = 101/287 (35%), Positives = 143/287 (49%), Gaps = 16/287 (5%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 G +L EG L AEER A G ILF RN AQ+R L ++R A + Sbjct: 6 FGAAILGPEGLRLSAEERGFFARTDPFGFILFARNLDSRAQIRALTAELRDAVGRDAPIF 65 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWL----MASEMIAMDIDIS 116 +DQEGGRVQR R A + L + G A EA L A E+ + ID + Sbjct: 66 IDQEGGRVQRLRPPL----ATEWLPPLDDVARFGNSAAEAMRLRFLITALELRGLGIDGN 121 Query: 117 FAPVLDVGHISA--AIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 AP+LDV + + R Y P++ I G+ + G+ KH PGHG D Sbjct: 122 CAPMLDVARAATHPFLRNRCYGETPERVAEIGRAVAQGLLQGGVLPVIKHLPGHGLAQLD 181 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTV 234 SH + P P+ E+ A D + F + L M AH+++ ++ PA+ P ++ + Sbjct: 182 SHLDLPRISAPREELEAIDFAAFRPF---HDLPLGMTAHLVFEGLNTAPATVDPAMIR-L 237 Query: 235 LRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNN 281 +R E+GFDG + +DD+SM+ A+ G+ ERG ASL AGCD +L CN Sbjct: 238 IRDEIGFDGFLMTDDISMQ--ALSGTVPERGAASLAAGCDCVLHCNG 282 >UniRef50_B9Y750 Putative uncharacterized protein n=1 Tax=Holdemania filiformis DSM 12042 RepID=B9Y750_9FIRM Length = 571 Score = 150 bits (378), Expect = 8e-35, Method: Compositional matrix adjust. Identities = 97/323 (30%), Positives = 164/323 (50%), Gaps = 15/323 (4%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQ----IRAASRNRLVV 59 V++ E E+D + L +G +I++ +N + Q L Q IR A++ + Sbjct: 59 VIVSFETTEVDTKTEAWLRTNKIGNVIVYAKNIENAEQATMLTGQLQSTIRGATQIPAFI 118 Query: 60 AVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAP 119 +DQEGG V R REG T P+ + AA G E W MA E+ M +++FAP Sbjct: 119 GIDQEGGMVNRVREGVTIFPSPMAIAA--GRHEN---LYSLAWSMADELSGMGFNMNFAP 173 Query: 120 VLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 VLDV + I RSY DP+ + AS +I G+ E G+ + KHFPGHG + DSH Sbjct: 174 VLDVNSNPDNPVIHLRSYSDDPKAVASFASTWIKGLQEGGVVSVAKHFPGHGDTSEDSHF 233 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR--PASGSPYWLKTVL 235 P + +++ ++ F + IR + + AIM +H+++ ++ PA+ S + +L Sbjct: 234 ALPTVNKTLDQLKETELIPFETAIR-SGVSAIMTSHILFPKIEKEKIPATLSKTIITDLL 292 Query: 236 RQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPI 295 + ELG++G++ SD L M+ AE ++ AG DM+++ + + D +P+ Sbjct: 293 KDELGYNGIVISDSLQMDAIQSHYGMAEAAVQAIQAGVDMLILGDGKVLQPDSEDVQTPV 352 Query: 296 KAERVTRLYHKGSFSRQELMDSA 318 E + ++G+ S + L D+ Sbjct: 353 -IEALIEAVNQGTISAERLDDAV 374 >UniRef50_Q1CXY4 Glycosyl hyrolase, family 3 n=6 Tax=Cystobacterineae RepID=Q1CXY4_MYXXD Length = 369 Score = 149 bits (376), Expect = 2e-34, Method: Compositional matrix adjust. Identities = 104/305 (34%), Positives = 156/305 (51%), Gaps = 15/305 (4%) Query: 5 MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQE 64 M+ G +D+E ++ + G ILF RN A L ++ + +++VDQE Sbjct: 16 MVGFPGTHIDSELAALMDDGIYGA-ILFKRNVGSAADTAALCHALKTRAGRPFILSVDQE 74 Query: 65 GGRVQRFREG-FTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV 123 GGRV R R FT LP + G + G L+A E+ A+ D FAPVLDV Sbjct: 75 GGRVARLRGAPFTALPPMREL----GQRGDAAQVERVGRLLAYELRALGFDWDFAPVLDV 130 Query: 124 GHISA--AIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPC 181 A IG+RS+ + ++ + G+ G+ + GKHFPGHG T DSH P Sbjct: 131 DTNPANPVIGDRSFSREAEEVARLGVALARGLEAGGVASCGKHFPGHGDTTTDSHLTLPR 190 Query: 182 DPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPASGSPYWLKTVLRQELG 240 P +R+ ++ F + + L ++M AHV++ +DP PA+ S L+ VLR+ELG Sbjct: 191 LPHDLERLRSVELVPFRAFAQAG-LASLMTAHVLFDALDPGVPATMSQRVLQGVLREELG 249 Query: 241 FDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNN---RKGAVSVLDNLSPIKA 297 FDGV+ SDDL M+ A S E AG D+ LVC+N ++ A+ L + +++ Sbjct: 250 FDGVLVSDDLEMKAIAGHYSVEEATVQGTLAGVDLFLVCHNADVQRRAIEAL--VKAVES 307 Query: 298 ERVTR 302 RV+R Sbjct: 308 GRVSR 312 >UniRef50_Q6MRE1 BglX2 protein n=2 Tax=Bdellovibrio bacteriovorus RepID=Q6MRE1_BDEBA Length = 370 Score = 148 bits (374), Expect = 2e-34, Method: Compositional matrix adjust. Identities = 84/286 (29%), Positives = 148/286 (51%), Gaps = 13/286 (4%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRA-----ASRNRLV 58 + + V G+ L A+E++ + +GG+ LF RN +P Q+REL +I++ + L Sbjct: 11 MFIGVSGHALTADEKKFIVENNIGGVCLFGRNVAEPKQVRELCAEIQSLRHKQVDKAPLF 70 Query: 59 VAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 + +D EGGRV R + FT P + L A G EM A+ I++ FA Sbjct: 71 IGIDMEGGRVHRLKAPFTVWPPLRKLGDLDAPTVSFHFANRMGL----EMKAVGINLDFA 126 Query: 119 PVLDV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSH 176 P +D+ + IG+RS +DP+ AS + G ++ + T KHFPGHG DSH Sbjct: 127 PCVDIFTNPGNTVIGDRSISSDPEMVAKHASALVRGYIKSDVITCAKHFPGHGNTIVDSH 186 Query: 177 KETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVL 235 ++ P + + + ++ F + +++D +M +H+ + +DP P + S +++ ++ Sbjct: 187 EDLPVENTDLERLESCELIPFKKTFK-SRVDMVMTSHIKFPKIDPEWPVTLSETFVRKMI 245 Query: 236 RQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNN 281 R+E+ + G+I +DDL M+ E +L AG D++L CN+ Sbjct: 246 REEMRYRGLIITDDLGMKAMTKHYGIEEVPVRALKAGVDLLLYCND 291 >UniRef50_D0LSI1 Glycoside hydrolase family 3 domain protein n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LSI1_HALO1 Length = 387 Score = 148 bits (373), Expect = 3e-34, Method: Compositional matrix adjust. Identities = 110/296 (37%), Positives = 160/296 (54%), Gaps = 30/296 (10%) Query: 10 GYELDAE----EREILAHPLVGGLILFTRNY-----HDPAQLRELVRQIRAASR---NRL 57 GYE D +REI A LVGG++LF RN H LR + R++ ++ L Sbjct: 18 GYEGDYPPADMQREIAAG-LVGGVLLFKRNLPPEDEHFAPALRAINRRLHQSASGDATPL 76 Query: 58 VVAVDQEGGRVQRFREGFTRLPAAQSFAALS---GMEEGGKLAQEAGWLMASEMIAMDID 114 +A+DQEGG VQR R+ T P SF AL+ G + LA+ G + E+ A++ID Sbjct: 77 WIAIDQEGGTVQRVRQPGTVWPPMLSFDALAEARGDQYAIALAERVGLALGHELAALEID 136 Query: 115 ISFAPVLDVGHISAA---IGERSYHADP----QKALAIASRFIDGMHEAGMKTTGKHFPG 167 + FAPVLDV H + A IG+R+ +DP Q+ LA+A G+ AG+ + GKHFPG Sbjct: 137 VDFAPVLDV-HTNPANPVIGKRALSSDPEQVAQRGLALAR----GLERAGVVSCGKHFPG 191 Query: 168 HGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASG 226 HG DSH P A + A ++ F+ + + IM AHV+++ +D PA+ Sbjct: 192 HGDTDTDSHLALPRLTHGSARLDAIELLPFARAAAAD-IPMIMTAHVVFAALDDTVPATL 250 Query: 227 SPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNR 282 S + +LR+ LG+ G+I SDDL M A E ++ AGCD++L+C +R Sbjct: 251 SHAVITGLLRRRLGYQGLIVSDDLDMRAIAAHFGIGEAAVMAVRAGCDVLLLCRDR 306 >UniRef50_B2UPP0 Beta-N-acetylhexosaminidase n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UPP0_AKKM8 Length = 353 Score = 147 bits (372), Expect = 4e-34, Method: Compositional matrix adjust. Identities = 95/285 (33%), Positives = 147/285 (51%), Gaps = 10/285 (3%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 M P ++ + G+E+ AEE + G ILF+RN Q+R L +R + V+A Sbjct: 1 MLPALIGISGHEVGAEEEAAIRRLQPAGFILFSRNIDSVEQVRGLTESLRKLCLHHPVIA 60 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 VDQEGGRV R LP+ S A L + GG + E G + A + + ++++FAPV Sbjct: 61 VDQEGGRVVRTASLGLNLPSPASLARLGSV--GGIV--ELGAVTALALRYLGVNLNFAPV 116 Query: 121 LDVGH---ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 LD+ H + A+ R + + Q ++ + + G+++ GKHFPG G AD H Sbjct: 117 LDICHDPSAANALPGRCWGDNAQDVISRGGVYASNLRRGGVQSCGKHFPGMGRALADPHF 176 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPASGSPYWLKTVLR 236 P + E+ D+ F +L L +IM AH++ +DP PA+ S ++ +LR Sbjct: 177 SLPVIGLDERELFKTDLLPFLALC--PALSSIMSAHIMLPQIDPDYPATLSERVIRGLLR 234 Query: 237 QELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNN 281 LGF GV+F+DDL M S + SL AGCD+ L+C++ Sbjct: 235 DRLGFRGVVFTDDLCMGAITTQYSPDDAAFLSLKAGCDLPLICHD 279 >UniRef50_Q67SQ8 Putative beta-N-acetylglucosaminidase n=1 Tax=Symbiobacterium thermophilum RepID=Q67SQ8_SYMTH Length = 523 Score = 147 bits (370), Expect = 8e-34, Method: Compositional matrix adjust. Identities = 98/264 (37%), Positives = 142/264 (53%), Gaps = 13/264 (4%) Query: 26 VGGLILFTRNYHDPAQLRELVRQIRAASRN-RLVVAVDQEGGRVQRFREGFTRLPAAQSF 84 VGG+ILF RN Q R L +I++ S + L++AVDQEGG V R T P A S Sbjct: 39 VGGVILFARNLVSADQARRLTARIQSCSPDLPLLIAVDQEGGIVLRL--AVTPWPGAMSL 96 Query: 85 AALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHISA--AIGERSYHADPQKA 142 A + L + G +A E+ AM I+++ APVLDV A IG RS+ +DP + Sbjct: 97 GAAGSPD----LTRRVGAGIARELRAMGINMNLAPVLDVNVNPANPVIGVRSFGSDPARV 152 Query: 143 LAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIR 202 + +I G H+ G+ T KHFPGHG DSH P + A ++ F I Sbjct: 153 AEHGAAYIRGHHDEGVLATAKHFPGHGDTDVDSHLALARVPHGMDRLEAVELVPFRWAIA 212 Query: 203 ENKLDAIMPAHVIYSDVDP---RPASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMG 259 +DAIM AHV++ V+P RPA+ SP L +LR++LGFDG++ +D + M+ A Sbjct: 213 AG-VDAIMTAHVVFPAVEPDPDRPATLSPAVLTGLLREQLGFDGLVITDCMEMKAIADHA 271 Query: 260 SYAERGQASLDAGCDMILVCNNRK 283 AE ++ AG D++L+ + + Sbjct: 272 GTAEGAVQAIAAGADLVLISHTER 295 >UniRef50_A2EGF0 Glycosyl hydrolase family 3 N terminal domain containing protein n=1 Tax=Trichomonas vaginalis RepID=A2EGF0_TRIVA Length = 515 Score = 146 bits (369), Expect = 1e-33, Method: Compositional matrix adjust. Identities = 106/333 (31%), Positives = 166/333 (49%), Gaps = 19/333 (5%) Query: 5 MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRL----VVA 60 M ++ G +LDAE ++ +G LILFT N + ++++L + S ++ Sbjct: 21 MFNLPGKQLDAEIFNLIKKYHIGNLILFTINLGNGKEIKKLTSDFQEYSLKEFGFPAFIS 80 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 D EGG V+R+ E R P A + + + + L++ +A ++++ I+++ APV Sbjct: 81 SDMEGGNVERYSEDIPRFPGAAAIVSTNSIANCQDLSES----VAKVLLSLGINMNLAPV 136 Query: 121 LDVGHIS--AAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE 178 D+ S IG RS+ DPQK I+G +AG T KHFPGHG+ DSH+E Sbjct: 137 CDINDNSRNPVIGARSFGFDPQKISNYIKAHINGHKKAGCLTCAKHFPGHGSTFQDSHRE 196 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQE 238 P E KD+ F I E +D+IM AH++ + PAS SP + + R+E Sbjct: 197 LPHIIHKYDEFMKKDIPPFKDAI-EAGVDSIMTAHLLSPLDEENPASLSPKVINYI-RKE 254 Query: 239 LGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAE 298 LGFDG++ +D L M+G +I G +A L AG + IL+ N +G + + E Sbjct: 255 LGFDGLLVTDSLVMKGVSIQGIDVAVRKAIL-AG-NNILIANTMEGDFAEFYKM----IE 308 Query: 299 RVTRLYHKGSFSRQELMDSARWKAISTRLNQLH 331 V G Q + DS R + IS++L H Sbjct: 309 GVLNDVKSGVIPEQLIDDSCR-RIISSKLKLKH 340 >UniRef50_C6XIZ0 Glycoside hydrolase family 3 domain protein n=1 Tax=Hirschia baltica ATCC 49814 RepID=C6XIZ0_HIRBI Length = 358 Score = 146 bits (368), Expect = 1e-33, Method: Compositional matrix adjust. Identities = 102/284 (35%), Positives = 152/284 (53%), Gaps = 16/284 (5%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQ 63 +L V G EL EE LA G+IL R+ Q+R LV I +A +V +DQ Sbjct: 7 CILSVSGPELLDEEAIFLAQHNPWGIILMGRSVKSRKQVRALVDAIWSALGRPCLVFIDQ 66 Query: 64 EGGRVQRFREG-FTRLPAAQSFAALSGMEEGGKLAQEAGWL----MASEMIAMDIDISFA 118 EGGRV+R R + PA +++A L ++ +L +EA WL MA+E+ + I A Sbjct: 67 EGGRVRRMRPPEWPEFPAGEAYARL--YQKSPELGKEAAWLGHRLMAAELQPLGIFADCA 124 Query: 119 PVLDVGHISA--AIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSH 176 P+LD+ H A +G+R++ + PQ+ I S I G+ + G+ + KH PGHG DSH Sbjct: 125 PILDLLHHDAHDIVGDRAFGSSPQQVAEIGSAAIKGLMDGGVASVIKHIPGHGRAKQDSH 184 Query: 177 KETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASG-SPYWLKTVL 235 + P E+ KD + F++L + M AH+ Y DP A+ S ++ V+ Sbjct: 185 EALPVVDASAEELE-KDFAPFAAL---SDAPMAMTAHIAYQAFDPDVAATISTKVIRDVI 240 Query: 236 RQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVC 279 R +GFDG++ DDL M+ A+ G+ R +AS DAGCD+ L C Sbjct: 241 RGRIGFDGLLMGDDLGMK--ALGGTLESRARASFDAGCDIALHC 282 >UniRef50_B7S2D9 Glycosyl hydrolase family 3 N terminal domain protein n=1 Tax=marine gamma proteobacterium HTCC2148 RepID=B7S2D9_9GAMM Length = 567 Score = 145 bits (365), Expect = 2e-33, Method: Compositional matrix adjust. Identities = 107/336 (31%), Positives = 165/336 (49%), Gaps = 25/336 (7%) Query: 21 LAHPLVG----GLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRF-REGF 75 L H LV G+ILF N AQ ++L +IR ++A+DQEGGRV R R+ Sbjct: 40 LKHGLVAISPCGVILFRENLASIAQCQQLTAEIRECLPANTLIAIDQEGGRVTRLPRQVS 99 Query: 76 TRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHISA--AIGER 133 T + AA E KLA+E G A+E+ A+ I+I+F P LDV A IG R Sbjct: 100 TSFSGNMALAACPA-GENEKLAREVGAAQAAELKALGININFVPSLDVNSDPANPVIGVR 158 Query: 134 SYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKD 193 S+ DP ++ S + G+ E G+ KHFPGHG + DSH + PC R + E A D Sbjct: 159 SFGDDPAAVASLGSELLSGLQEGGVAGALKHFPGHGDTSQDSHIDLPCVGRSRDEAFAVD 218 Query: 194 MSVFSSLIRENKLDAIMPAHVIYSDVDPR---------PASGSPYWLKTVLRQELGFDGV 244 ++ FS +I + +M AH+ Y +D PA+ S + +LR+++ FDGV Sbjct: 219 LAPFSRVIAQANPAMVMTAHIQYPALDDSCIAGTDVAVPATLSRAMMTGLLREQMDFDGV 278 Query: 245 IFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLY 304 I +D L M+ + + E AG D+ L+ + + S + L + + V + Sbjct: 279 IITDALDMKAISARMTPTEAVLRCFAAGVDIALMPLLIRSSAS-FNQLQQLVSTAVEAI- 336 Query: 305 HKGSFSRQELMDSARWKAISTRLNQLHERWQEEKAG 340 H G +D + +A TR+ L +++ ++ G Sbjct: 337 HSGE------LDESEVRASVTRILALQQKFAPQQPG 366 >UniRef50_Q0C0J7 Beta-hexosaminidase n=2 Tax=Hyphomonadaceae RepID=Q0C0J7_HYPNA Length = 344 Score = 145 bits (365), Expect = 2e-33, Method: Compositional matrix adjust. Identities = 105/304 (34%), Positives = 160/304 (52%), Gaps = 17/304 (5%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 M +L V G EL E + A G+IL R+ Q+R+L+ I A+ ++ Sbjct: 1 MKACILSVSGPELTPGEAALFAAEDPWGVILMGRSCVSRMQVRKLIADIWNATGRETLIF 60 Query: 61 VDQEGGRVQRFREG-FTRLPAAQSFAALSGMEEGGKLAQEAGWL----MASEMIAMDIDI 115 +DQEGGRV R + + P +AAL E L +EA WL +ASE+ A+ I Sbjct: 61 IDQEGGRVARLKAPEWPLFPRGADYAAL--YERDKALGREATWLGHRLIASELAALSIHA 118 Query: 116 SFAPVLDVGHISA--AIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTA 173 +PV+D+ A IG+R++ +P++A +A ++G+ G+ KH PGHG A Sbjct: 119 DCSPVVDLPVPGAHDVIGDRAFGTEPEQAADLARAALEGLTAGGVAGVIKHIPGHGRSVA 178 Query: 174 DSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPASGSPYWLK 232 DSH E P Q E+ AKD F+ R M AH+ Y DP R A+ S ++ Sbjct: 179 DSHLELPRVTAGQNEL-AKDFDAFA---RVADAPMAMTAHIAYEAYDPGRAATVSKVMIQ 234 Query: 233 TVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNN-RKGAVSVLDN 291 ++R +GFDG++ +DDL M+ A+ GS ER AS+ AGCD++L C+ K A ++L Sbjct: 235 QIIRDRIGFDGLLMTDDLGMK--ALGGSLTERADASIVAGCDVLLHCSGFLKDAGAILAE 292 Query: 292 LSPI 295 ++ + Sbjct: 293 MTEV 296 >UniRef50_A7BX55 Glycosyl hydrolase, family 3 n=4 Tax=Proteobacteria RepID=A7BX55_9GAMM Length = 436 Score = 144 bits (363), Expect = 4e-33, Method: Compositional matrix adjust. Identities = 107/331 (32%), Positives = 165/331 (49%), Gaps = 40/331 (12%) Query: 26 VGGLILFT---------RNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFRE--G 74 +GG+ILF RN P Q++ LV++++AAS+ L +A+DQEGG+V+R +E G Sbjct: 123 LGGVILFDYDIVLKSSRRNIKSPRQVKSLVKELQAASQIPLFIAIDQEGGKVKRLKEKFG 182 Query: 75 FTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLD--VGHISAAIG- 131 F R +AQ G + L + MA + + I+ +FAPV+D + + IG Sbjct: 183 FPRTVSAQYL----GQKNNVALTYQQASKMAKTLATVGINFNFAPVVDLNINPNNPVIGK 238 Query: 132 -ERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSH--KETPCDPRPQAE 188 ERS+ A PQ A FI H G+ + KHFPGHG+ T DSH D +AE Sbjct: 239 LERSFSAYPQTVTKHALEFIKAHHAYGVLCSLKHFPGHGSSTRDSHLGMVDVTDTWNEAE 298 Query: 189 IRAKDMSVFSSLIRENKL---DAIMPAHVIYSDVDP-RPASGSPYWLKTVLRQELGFDGV 244 + + +I +K+ D+IM AH+ DP PA+ S + +LR+ L +DGV Sbjct: 299 LEP-----YQKIINADKMGIVDSIMIAHIFNRRFDPDYPATLSKRIVTGLLRENLEYDGV 353 Query: 245 IFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLY 304 I SDD+ M+ + Q +L+AG D+I++ NN K ++ I + +L Sbjct: 354 IVSDDMQMKAITSHYRFEVAIQKTLEAGIDIIVIGNNLKYEPDIVTRTVGI----IKQLI 409 Query: 305 HKGSFSRQELMDSARWKAISTRLNQLHERWQ 335 G + AR + R+ QL ++Q Sbjct: 410 QAGKITE------ARIEESYQRIQQLKSQFQ 434 >UniRef50_C5SMQ9 Beta-N-acetylhexosaminidase n=2 Tax=Caulobacteraceae RepID=C5SMQ9_9CAUL Length = 363 Score = 144 bits (363), Expect = 4e-33, Method: Compositional matrix adjust. Identities = 97/277 (35%), Positives = 147/277 (53%), Gaps = 12/277 (4%) Query: 10 GYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRN-RLVVAVDQEGGRV 68 G EL A E+ A G ILF RN P Q+R L+ ++AA+ + +++ VDQEGGRV Sbjct: 19 GLELSAAEQAFFASKKPLGFILFKRNIDTPEQVRALISTLKAAAGHPDVLILVDQEGGRV 78 Query: 69 QRFREG-FTRLPAAQSFAALSGME-EGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI 126 QRF+ + P A ++ A++ E + + LMA ++ A+ +++ APVLDV Sbjct: 79 QRFKAPHWPSYPPAAAYEAVANSPFEQREYVRMGAQLMAHDLRALGVNVDCAPVLDVPQA 138 Query: 127 SA--AIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPR 184 +G+R+Y + + +G G+ KH PGHG DSH E P Sbjct: 139 GTHDVVGDRAYGLTAETVAIMGRAAAEGFLAGGVLPVIKHAPGHGRAQVDSHHELPVVKA 198 Query: 185 PQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQELGFDG 243 + E+ D FS + M AHV+Y+ +D R PA+ S +K V+R+E+GFDG Sbjct: 199 RRDELEKVDFLPFSV---NADMPLAMTAHVVYTALDRRNPATTSKKVIK-VIREEVGFDG 254 Query: 244 VIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCN 280 ++ SDD+ M A+ G R +ASL AGCD++L C+ Sbjct: 255 LLLSDDIGM--GALKGPVGARARASLRAGCDIVLHCS 289 >UniRef50_B0K1P2 Glycoside hydrolase, family 3 domain protein n=9 Tax=Clostridia RepID=B0K1P2_THEPX Length = 520 Score = 144 bits (363), Expect = 5e-33, Method: Compositional matrix adjust. Identities = 100/318 (31%), Positives = 163/318 (51%), Gaps = 27/318 (8%) Query: 26 VGGLILFTRNYHDPAQLRELVRQIRAASRNRL----VVAVDQEGGRVQRFREGFTRLPAA 81 +G +ILF+RN QL+ L I+ +VA+DQEGG V R + T LP Sbjct: 33 IGNIILFSRNVKGKEQLQNLCNNIQMNIIKHTGIPALVAIDQEGGMVTRIYKDATYLPGN 92 Query: 82 QSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGH--ISAAIGERSYHADP 139 + AA + +E K+ + AG E+ A+ I+I+FAPVLD+ + + IG RSY +P Sbjct: 93 MAIAATNEVENAYKIGEIAG----KELRALGININFAPVLDINNNLSNPVIGVRSYGENP 148 Query: 140 QKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSS 199 +K +I G+ + + T KHFPGHG + DSH + P + + ++ F Sbjct: 149 EKVAEFGINYIKGLQKENVIATAKHFPGHGDTSVDSHLDLPLVSYNKERLYRIELYPFKK 208 Query: 200 LIRENKLDAIMPAHVIYSDVDPR--PASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAI 257 I EN ++AIM AH+++S ++ PA+ S L +LR+EL F+G+I +D + M A Sbjct: 209 AI-ENGVEAIMTAHILFSALEENKLPATLSHNILTNILRKELRFNGLIITDCMEMNAIAK 267 Query: 258 MGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDS 317 A+ ++ AG D++LV + +K + + + E V R G S + + +S Sbjct: 268 YFGTAKAAAMAIKAGADIVLVSHTKKLQIEAFNEIK----EAVLR----GEISIERIDES 319 Query: 318 ARWKAISTRLNQLHERWQ 335 R+ QL E+++ Sbjct: 320 VE------RIIQLKEKYK 331 >UniRef50_Q3A4Z0 Putative glycosyl hydrolase n=1 Tax=Pelobacter carbinolicus DSM 2380 RepID=Q3A4Z0_PELCD Length = 382 Score = 144 bits (362), Expect = 6e-33, Method: Compositional matrix adjust. Identities = 108/331 (32%), Positives = 168/331 (50%), Gaps = 50/331 (15%) Query: 4 VMLDVEGYELDAEEREILAHPLV--------GGLILFTR---------NYHDPAQLRELV 46 +M+ G +LD HP++ GG++LF N P QLR L Sbjct: 45 LMVGFAGADLDKH------HPILEDIKQRHLGGVVLFNYGPDRNHPAGNIRSPQQLRRLT 98 Query: 47 RQIRAASRNR-LVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMA 105 Q++ A+ N L++A+DQEGGR+ R ++ + P S+A+L G L + A Sbjct: 99 AQLQQAAGNTPLLIAIDQEGGRINRLKQAYG-FPPTVSYASL-GRRNDLALTRRQAQNTA 156 Query: 106 SEMIAMDIDISFAPVLD--VGHISAAIG--ERSYHADPQKALAIASRFIDGMHEAGMKTT 161 + I+++ +PV+D V + IG +RS+ A+P ++ A I H+ G+ Sbjct: 157 RTLQQAGINLNLSPVVDLNVNPANPIIGRLQRSFSAEPGTVISHAREVIRAHHQQGVLCC 216 Query: 162 GKHFPGHGAVTADSHK-----ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIY 216 KHFPGHG+ TADSHK PR ++ F++LI E DA++ AHV Sbjct: 217 LKHFPGHGSSTADSHKGFVDVTNTWSPR--------ELLPFATLIDEGLADAVLTAHVFN 268 Query: 217 SDVDPR-PASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDM 275 S++DPR PA+ S + VLR++LGF GV+ SDDL+M A + + +L+AG D+ Sbjct: 269 SELDPRYPATLSERTVHGVLRKQLGFQGVVISDDLTMGAIADQYRLEDAVEKALNAGVDI 328 Query: 276 ILVCNNRKGAVS----VLDNLSPIKAERVTR 302 +L+ +N S ++ L I + RVTR Sbjct: 329 LLLADNSPDTTSRMIAIMQKL--IDSGRVTR 357 >UniRef50_Q1NMP4 Glycoside hydrolase, family 3-like n=3 Tax=Deltaproteobacteria RepID=Q1NMP4_9DELT Length = 335 Score = 144 bits (362), Expect = 6e-33, Method: Compositional matrix adjust. Identities = 112/340 (32%), Positives = 164/340 (48%), Gaps = 25/340 (7%) Query: 2 GPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNR----L 57 G M+ + G ELD RE++ V ILF+RN P QL L ++ AA R Sbjct: 13 GLFMVGLPGSELDDSSRELIERQRVNDFILFSRNVESPEQLTRLCAELAAACRAAGLPPP 72 Query: 58 VVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISF 117 ++A+DQEGG V R +E FTR P A+ A SG L+ A A+E+ A+ I+I+F Sbjct: 73 LIAIDQEGGTVTRLKEPFTRFPDARELA--SGPNPEAALSAYA-RTCATELRAVGININF 129 Query: 118 APVLDVGHISAA--IGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADS 175 APVLD+ + R +P + I+ M G+ GKHFPG G D Sbjct: 130 APVLDLCPAGQGFYMERRVLGEEPATVARLGRLLIETMQGQGLAACGKHFPGLGTARLDP 189 Query: 176 HKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPASGSPYWLKTV 234 H++ P R + E+ A D+ F++ + IM +H +Y +DP RPA+ S L + Sbjct: 190 HQDLPTLERSREELAAYDLPPFAA-AVAVPVALIMTSHTVYPALDPQRPATLSAPILTDL 248 Query: 235 LRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSP 294 LR+++G+DGVI +DDL M E A+L AG D +L+C++ L +L Sbjct: 249 LRRQMGYDGVIITDDLEMGAIDHNLPLEEAVTAALTAGADQLLICHDHDKVRRALKHLR- 307 Query: 295 IKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERW 334 R H G+ + A A TR+ L R+ Sbjct: 308 -------RQLHTGALA------PAPVSAALTRIAALRRRF 334 >UniRef50_D1BPZ2 Glycoside hydrolase family 3 domain protein n=3 Tax=Veillonella RepID=D1BPZ2_VEIPT Length = 381 Score = 143 bits (361), Expect = 7e-33, Method: Compositional matrix adjust. Identities = 88/284 (30%), Positives = 151/284 (53%), Gaps = 16/284 (5%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNR----LVV 59 +M+ + G L+ + + +L VGG+ILF RN Q++ L+ I A ++ L + Sbjct: 66 LMIGIHGTTLNDDAKFMLNEYRVGGIILFDRNMESKDQVKTLITDINKAGKSAGLTPLFL 125 Query: 60 AVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAP 119 +DQEGG V R + ++P A+ + +E+ LA+++G +E+ + +I+FAP Sbjct: 126 GIDQEGGAVARMDDKLIKVPPAEEVGKMP-VEQAVSLAKQSG----AELKDLGFNINFAP 180 Query: 120 VLDVGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKET 179 V D+G RSY P + + A EAG+ + KHFPG G D H +T Sbjct: 181 VADLGLTYG----RSYSTSPDEVVRYAGAVGKAYDEAGLWYSYKHFPGIGKTDVDLHADT 236 Query: 180 PCDPRPQAEIRAKDMSVFSSLIRENKLD--AIMPAHVIYSDVDP-RPASGSPYWLKTVLR 236 P + + ++D VF LI+++K + IM +H +Y +DP PAS S + LR Sbjct: 237 SIVPVSKETLLSEDTKVFIDLIKQSKPNTYTIMVSHAMYPQIDPDHPASLSKAIITDWLR 296 Query: 237 QELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCN 280 +++G++GV+ +DD+ M A ++ + S+ AG D++LVC+ Sbjct: 297 KDIGYNGVVVTDDMDMGALAKHYTFGDMAVQSILAGSDILLVCH 340 >UniRef50_UPI0001693BCF Beta-glucosidase-related glycosidase n=1 Tax=Paenibacillus larvae subsp. larvae BRL-230010 RepID=UPI0001693BCF Length = 536 Score = 143 bits (360), Expect = 9e-33, Method: Compositional matrix adjust. Identities = 105/355 (29%), Positives = 168/355 (47%), Gaps = 27/355 (7%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNR----LVV 59 VM E D + ++ +G +I F RN P Q+ L +++ + L++ Sbjct: 17 VMCGFESRVPDEQIETLITKYKLGNVIYFRRNLDTPLQVFRLSSKLQELAEKEVGIPLLI 76 Query: 60 AVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAP 119 ++DQEGG V R EG +P + AL + A ++ ++ A E+ A+ I+++FAP Sbjct: 77 SIDQEGGMVNRTHEGVVGMPGNMTLGAL----RDPQAAYDSAYVSAEELRAIGINMNFAP 132 Query: 120 VLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 LDV + + IG RSY + + + G + G+ T KHFPGHG ADSH Sbjct: 133 CLDVNNNPDNPVIGVRSYGETAELVSKLGVEAMKGYQDGGVAATIKHFPGHGDTQADSHH 192 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR--PASGSPYWLKTVL 235 P P P +R ++ F I DA+M AHVI+ ++P P++ S + +L Sbjct: 193 ALPLIPHPARRLRELELVPFQKAINAGA-DAVMSAHVIFPALEPEHLPSTLSRQVMTRLL 251 Query: 236 RQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAV----SVLDN 291 R+E+GFDGVI +D L M+ AE +++AG D++LV + V ++L Sbjct: 252 REEMGFDGVITTDCLEMKAIDDHYGVAEGAVKAIEAGVDLVLVSHTLTKQVAAIGAILQA 311 Query: 292 LSPIK---------AERVTRLYHKGSFSRQELMDSARWKAISTRLNQ-LHERWQE 336 L + +R+ RL K R L + + + T +Q L ER E Sbjct: 312 LESGRLTEERIDESVDRILRLKQKLKLERDSLSEERVRRTVGTLKHQKLAERLYE 366 >UniRef50_C6J2I0 Glycoside hydrolase n=2 Tax=Bacillales RepID=C6J2I0_9BACL Length = 544 Score = 142 bits (359), Expect = 1e-32, Method: Compositional matrix adjust. Identities = 96/279 (34%), Positives = 144/279 (51%), Gaps = 18/279 (6%) Query: 14 DAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNR----LVVAVDQEGGRVQ 69 D + R ++ VGG++ F RN QL EL R ++ R++ L +++DQEGG V Sbjct: 26 DEQIRTLVQQYHVGGVVYFRRNIASVEQLAELSRSLQGLPRSKPELPLFISIDQEGGMVA 85 Query: 70 RF-REGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHISA 128 R EG +R+P + G L +E L A EM+ + I+ +FAP +DV + A Sbjct: 86 RLDHEGMSRIPGNMAL----GAANDPALTEEVARLAAKEMLQLGINFNFAPCVDVNNNPA 141 Query: 129 --AIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQ 186 IG RS+ P + + + G G+ T KHFPGHG DSH+ P + Sbjct: 142 NPVIGVRSFGEQPDRVAVHGAAAVRGYQAEGVIATAKHFPGHGDTAVDSHRGLASVPHGK 201 Query: 187 AEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR--PASGSPYWLKTVLRQELGFDGV 244 + ++ F I E +DAIM AHVI+ +P PA+ S L +LR+ELG+ G+ Sbjct: 202 ERLYQVELVPFREAIAEG-VDAIMTAHVIFPAFEPEGTPATLSHRVLTGLLREELGYGGL 260 Query: 245 IFSDDLSMEGAAIMGSY--AERGQASLDAGCDMILVCNN 281 I +D L M+ AI G Y AE +++AG D++LV + Sbjct: 261 IVTDCLEMQ--AIAGYYGIAEGAVQAIEAGADLVLVSHT 297 >UniRef50_C9LY46 Beta-hexosaminidase n=1 Tax=Selenomonas sputigena ATCC 35185 RepID=C9LY46_9FIRM Length = 408 Score = 142 bits (359), Expect = 1e-32, Method: Compositional matrix adjust. Identities = 98/302 (32%), Positives = 157/302 (51%), Gaps = 21/302 (6%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNR---LVVA 60 +M+ ++G ELDA+ R +L+ +GG+ILF RN Q+R L ++ + + L +A Sbjct: 85 LMIGIQGAELDADSRFMLSEYHIGGVILFDRNMKSQEQVRALNDSLQKNASDAGLPLFLA 144 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 +D+EGG V R +E F PAA + + A + A + AM +++FAPV Sbjct: 145 IDEEGGAVARMKEAFLPPPAAAEIGRTGDPQAAYRYAADT----AHGLKAMGFNLNFAPV 200 Query: 121 LDVGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETP 180 D+G A RSY D A + ++G +AG+ T KHFPG G +D+H++T Sbjct: 201 ADLG----ATYGRSYADDAATATKFVAAALEGHADAGLLATLKHFPGLGRGESDTHEDTV 256 Query: 181 CDPRPQAEIRAKDMSVFSSLI--RENKLDA-------IMPAHVIYSDVDPR-PASGSPYW 230 +A + A D+ F +I R +A +M H +Y +D + PAS SP Sbjct: 257 TVRADRATLEASDLVPFREMITKRNAAKNAKGASGWFVMATHTMYPALDAKNPASLSPAI 316 Query: 231 LKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLD 290 L+ +LR+EL +DGVI +DDL M + + G ++ AG D++LVC++ +V + Sbjct: 317 LQGLLREELSYDGVIVTDDLEMGAISRHYGFDRAGVEAILAGADLVLVCHDYAHETAVYN 376 Query: 291 NL 292 L Sbjct: 377 GL 378 >UniRef50_Q8YVY6 All1831 protein n=11 Tax=Nostocaceae RepID=Q8YVY6_ANASP Length = 365 Score = 142 bits (358), Expect = 2e-32, Method: Compositional matrix adjust. Identities = 96/289 (33%), Positives = 149/289 (51%), Gaps = 21/289 (7%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPA-------QLRELVRQIRA-ASRN 55 +++ + G L E++ L G+I F +N+ D +EL QI+ A R+ Sbjct: 14 LIMGISGTSLSDEDKRALGELKPIGVIFFAKNFLDGVPYAVWLETFQELHSQIQEYAERD 73 Query: 56 RLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDI 115 + +D EGGRV R TR P A + A+E A E+ ++ I++ Sbjct: 74 SMFFTLDHEGGRVVRTPLPITRFPQALLLRSQ---------AREVAKATAIELKSLGINL 124 Query: 116 SFAPVLDV-GHI-SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTA 173 S++PV D+ H + IG R++ P A A A + G+ EAG+ KHFPGHG + Sbjct: 125 SWSPVADIYSHPQNPIIGSRAFGNTPDTAAAGAREYYLGLTEAGIVGCAKHFPGHGDTSK 184 Query: 174 DSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPASGSPYWLK 232 DSH E P E++ +++ F +LI E + IM AH+++ +DP PA+ S LK Sbjct: 185 DSHVELPILNLTPEELQNRELIPFKALIEEG-IPLIMTAHILFPKIDPDLPATLSRPILK 243 Query: 233 TVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNN 281 T+LR+ELGF GV+ SDDL M+ + M + + +AGCD+ +V N Sbjct: 244 TILRKELGFQGVVVSDDLDMKAVSDMFMQSGTVARAFNAGCDLFIVSRN 292 >UniRef50_A3I7T1 Beta-hexosaminidase n=2 Tax=Bacillaceae RepID=A3I7T1_9BACI Length = 566 Score = 142 bits (357), Expect = 2e-32, Method: Compositional matrix adjust. Identities = 96/334 (28%), Positives = 159/334 (47%), Gaps = 25/334 (7%) Query: 8 VEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS-RNRL--VVAVDQE 64 V G L E + ++ +GGLIL+ N P Q L+ + A+ NRL + DQE Sbjct: 225 VSGTSLQQETKSLIRDYKIGGLILYANNLETPQQTVSLMNDLMTANGTNRLPLFLGTDQE 284 Query: 65 GGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVG 124 GG+V R P Q ++ + + E G L+ ++ A ++ FAPV+DV Sbjct: 285 GGKVVRLPGPLKNFPTNQKIGQINQ----PQFSYEIGKLLGEQLKAFGFNLDFAPVMDVN 340 Query: 125 HI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCD 182 + IG+RS+ P + + + G+ + KHFPGHG + DSH E P Sbjct: 341 SNPNNPIIGDRSFSNKPDIVSQLGIQTMQGLESQQVIPVIKHFPGHGDTSVDSHLELPKV 400 Query: 183 PRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQELGF 241 + ++ ++ F + I + +D +M H++ +D + P+S S + +LR +LGF Sbjct: 401 SKSLDDLNKLELIPFKAAI-DKGVDVVMVGHILLPKIDQQFPSSMSKEIITGLLRNQLGF 459 Query: 242 DGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVT 301 DGV+ +DD++M+ S + S+ AG D+IL+ + + +D L KA Sbjct: 460 DGVVMTDDMTMKAITNHFSIGQAAVDSVKAGNDIILIAHEFANVTTAIDAL---KAA--- 513 Query: 302 RLYHKGSFSRQELMDSARWKAISTRLNQLHERWQ 335 G S Q++ DS R R+ QL E++Q Sbjct: 514 --VKNGKISEQQINDSVR------RIIQLKEKYQ 539 >UniRef50_C6J1S3 Glycoside hydrolase n=2 Tax=Bacillales RepID=C6J1S3_9BACL Length = 426 Score = 142 bits (357), Expect = 3e-32, Method: Compositional matrix adjust. Identities = 94/297 (31%), Positives = 154/297 (51%), Gaps = 15/297 (5%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNR---LVVA 60 V++ ++G E + R+++ VGG I + N D Q L ++ A++N+ L ++ Sbjct: 81 VLVGMDGTEPNDVTRQLIEEYHVGGFIFYKDNIQDTKQALTLFNGLKTANQNKAVPLFLS 140 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 VD+EGGRV R T+LPAA+ E LA + G ++ E+ +++ FAPV Sbjct: 141 VDEEGGRVSRMPGELTKLPAARKIGNTVSEE----LAGQIGGIIGQELAGFGLNLDFAPV 196 Query: 121 LDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE 178 LDV + IG+R++ P+ + + G+ E G+ KHFPGHG + DSH Sbjct: 197 LDVNSNPDNPVIGDRAFGTKPEIVSRMGIAVMKGIREQGVIPVVKHFPGHGDTSVDSHLG 256 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQ 237 P +R ++ F I+E D +M AH++ +DP PAS S + +LR+ Sbjct: 257 LPVVEHDLDRLRKLELVPFKQAIQEGA-DVVMIAHLLMPKLDPEHPASFSKAVIHDLLRE 315 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQAS--LDAGCDMILVCNNRKGAVSVLDNL 292 ELG++G I SDD++M AI+ Y R A + AG +++LV ++ + V+ L Sbjct: 316 ELGYEGTIISDDMTM--GAIVEHYDIRDAAVQFIQAGGNIVLVGHDEEKEKQVIQAL 370 >UniRef50_B1ZLT8 Beta-N-acetylhexosaminidase n=61 Tax=Alphaproteobacteria RepID=B1ZLT8_METPB Length = 351 Score = 142 bits (357), Expect = 3e-32, Method: Compositional matrix adjust. Identities = 97/296 (32%), Positives = 148/296 (50%), Gaps = 12/296 (4%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIR-AASRNRLVVAVD 62 V+L G L EE G ILF RN P +R L +R R + +D Sbjct: 20 VILGCSGTALTPEEAAFFRDLRPWGFILFKRNIGTPDAVRALTAALRETVGRADAPILID 79 Query: 63 QEGGRVQRFREG-FTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVL 121 QEGGRVQR + + PA F AL +E A+ LMA ++ + I++ AP+L Sbjct: 80 QEGGRVQRMGPPHWPKYPAGGRFDALG--DEAAATARLGARLMAHDLSEVGINVDCAPML 137 Query: 122 DVGHISA--AIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKET 179 DV + IG+R+Y P + + + +G+ G+ KH PGHG T DSH + Sbjct: 138 DVPAAGSDNIIGDRAYGDTPGRVIELGRAVAEGLMAGGVLPVIKHIPGHGRATCDSHLDL 197 Query: 180 PCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPASGSPYWLKTVLRQE 238 P + + D F +L + M AHV+++ +DP RPA+ S ++ V+R E Sbjct: 198 PVVEADRVSLGESDFRPFRAL---ADMPLAMSAHVVFNALDPERPATLSERVMREVVRGE 254 Query: 239 LGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSP 294 +GFDG++ +DDLSM A+ G + +R +A+ AG D++L CN R + + +P Sbjct: 255 IGFDGLLMTDDLSMH--ALHGPFRDRAEAAYRAGIDVVLHCNGRMDEMRAVAEATP 308 >UniRef50_Q2RTG8 Beta-N-acetylhexosaminidase n=9 Tax=Alphaproteobacteria RepID=Q2RTG8_RHORT Length = 339 Score = 141 bits (356), Expect = 3e-32, Method: Compositional matrix adjust. Identities = 105/289 (36%), Positives = 147/289 (50%), Gaps = 16/289 (5%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAA-SRNRLVVAVD 62 V+L V G L E G ILF RN P Q+R L +RA R+ + +D Sbjct: 9 VILGVSGPALSPREVAFFREIDPFGFILFGRNVESPDQVRALTEALRACVGRDDAPILID 68 Query: 63 QEGGRVQRFREG-FTRLPAAQSFAALSGM--EEGGKLAQEAGWLMASEMIAMDIDISFAP 119 QEGGRVQR + ++P A+ F AL G E G + A+ L+A E+ + ID P Sbjct: 69 QEGGRVQRLGPPHWRKIPPAEDFGALYGRDREAGVEAARLNARLIAHELRGLGIDTDCLP 128 Query: 120 VLDVGHISA--AIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 VLDV +A +G+R+Y DP A+ + G+ G+ KH PGHG ADSH Sbjct: 129 VLDVRQSAADDIVGDRAYSDDPGVVAALGAAVAQGLLAGGVFPVIKHIPGHGRAHADSHL 188 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDA--IMPAHVIYSDVDP-RPASGSPYWLKTV 234 P P AE+ A D F +L DA M AH++Y+ +D PA+ S + V Sbjct: 189 RLPVVDNPLAELDASDFLPFKALA-----DAPFAMTAHILYTAIDAGAPATLSAEVVSQV 243 Query: 235 LRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRK 283 +R GF G++ +DDLSM+ A+ G +R + S AGCD++L CN + Sbjct: 244 IRGVCGFTGLLMTDDLSMK--ALEGPMEDRTRRSFAAGCDLVLHCNGDR 290 >UniRef50_Q8CX63 Beta-N-acetylhexosaminidase (Beta-hexosaminidase) n=1 Tax=Oceanobacillus iheyensis RepID=Q8CX63_OCEIH Length = 529 Score = 140 bits (354), Expect = 5e-32, Method: Compositional matrix adjust. Identities = 98/305 (32%), Positives = 156/305 (51%), Gaps = 29/305 (9%) Query: 26 VGGLILFTRNYHDPAQLRELVRQIRAASRNR-----LVVAVDQEGGRVQRFREGFTRLPA 80 +GG+ILFTRN P ++ L ++ +R+ L++ VDQE G V+R EG T P Sbjct: 32 IGGIILFTRNIGSPDEVLRLTASLQKEARDAGYERPLLICVDQENGTVRRLGEGTTIFPG 91 Query: 81 AQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVG--HISAAIGERSYHAD 138 + A S EE A + G+ +E+ A+ I+ + APV+DV + IG RSY Sbjct: 92 GMTIGA-SDSEEN---AYQVGYASGTELRALGINWNLAPVVDVNVNPDNPVIGVRSYGES 147 Query: 139 PQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFS 198 P++ +A + GM +AG+ TT KHFPGHG DSH P P + ++ F Sbjct: 148 PERVARLAKAAVKGMQDAGVVTTLKHFPGHGDTKVDSHLALPVIPHSIERLEEIELRPFQ 207 Query: 199 SLIRENKLDAIMPAHVIYSDVDP---RPASGSPYWLKTVLRQELGFDGVIFSDDLSMEGA 255 + + D IM AH+ + ++P +PA+ S L +LR++LG+ G+I +D + M Sbjct: 208 ASLES---DVIMSAHIYFPALEPEADKPATLSKAVLTGLLREKLGYTGLITTDCMEMNAI 264 Query: 256 A-IMGSYAERG-QASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFSRQE 313 + +G+ E+G A+L AG D+ +V + + V L + V G SR+ Sbjct: 265 SETIGT--EKGVVAALQAGVDLAMVSHTLQRQVGAL--------QEVKAAVESGELSRES 314 Query: 314 LMDSA 318 ++ SA Sbjct: 315 ILTSA 319 >UniRef50_A8GPL9 Beta-glucosidase n=17 Tax=Rickettsia RepID=A8GPL9_RICAH Length = 329 Score = 140 bits (354), Expect = 5e-32, Method: Compositional matrix adjust. Identities = 94/294 (31%), Positives = 145/294 (49%), Gaps = 31/294 (10%) Query: 3 PVMLDVEGYELDAEEREILA--HPLVGGLILFTRNY-------HDPAQLRELVRQIRAAS 53 PV++ + G EL ER + +PL G+ILF RN D L +L+ I+ Sbjct: 9 PVIIGISGPELTDAERALFEKHNPL--GVILFRRNIRKNDNGEQDKETLIKLIADIKEVL 66 Query: 54 RNRLVVAVDQEGGRVQRF-REGFTRLPAAQSFAALSGMEEG----GKLAQEAGWLMASEM 108 ++++DQEGGRV+R F PAAQ+F + ++ GK +E G Sbjct: 67 GANTIISIDQEGGRVKRLIAPTFYDAPAAQTFVDVQECKQNYINIGKELREVG------- 119 Query: 109 IAMDIDISFAPVLDVGHISA--AIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFP 166 I++ FAPV+D+ H A +G+RS+ +D + + + IDG+ E + KH P Sbjct: 120 ----INLDFAPVVDLIHEGADKIVGDRSFGSDLEVVVPLCLAAIDGLQEEKVVACIKHIP 175 Query: 167 GHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASG 226 GHG DSH P E+ D VF L + + + M AH+IY +DP + Sbjct: 176 GHGRAKVDSHIGLPIVDTSLKELEDTDFKVFKELAKYDNIKLAMTAHIIYKALDPENPAT 235 Query: 227 SPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCN 280 +K ++ +GF G+I SD + M+ A+ GS A+ + LDAG D++L C Sbjct: 236 LSKKVKAYIKNNIGFKGLIISDAIDMK--ALSGSMADITKGVLDAGVDIVLECT 287 >UniRef50_A6NU14 Putative uncharacterized protein n=1 Tax=Bacteroides capillosus ATCC 29799 RepID=A6NU14_9BACE Length = 435 Score = 140 bits (354), Expect = 5e-32, Method: Compositional matrix adjust. Identities = 99/297 (33%), Positives = 156/297 (52%), Gaps = 22/297 (7%) Query: 26 VGGLILFTRNYHDPAQLRELVRQIRAASRNR--LVVAVDQEGGRVQRFREGFTRLPAAQS 83 VGGLI F RN +L LV +RA + + + +VDQEGG V+R + + L A + Sbjct: 113 VGGLIFFGRNVGTAEELVSLVNDLRAKNGDYIPMFFSVDQEGGTVERLPDEVSPLEDAYT 172 Query: 84 FAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV--GHISAAIGERSYHADPQK 141 + E G L Q ++A+E +A ++ FAP LD+ + IG R++ + Sbjct: 173 YGQSGSSEVGYALGQ----VLANECVAFGFNLDFAPSLDIWSNPENTVIGTRAFGTTAEA 228 Query: 142 ALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLI 201 A+ G+ + G+ KHFPGHG DSH P + E+ ++ F S I Sbjct: 229 VEAVGPWAAYGLMDGGVIPVVKHFPGHGDTAVDSHVGLPTVSKTVDELLTSELIPFQSAI 288 Query: 202 --RENK-LDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAI 257 RE + + A+M AH++ + +D PAS S + +LR++LGFDGV+F+DDL+M AI Sbjct: 289 EGREGEGVPAVMVAHILMTAIDQEHPASLSQAVVTGLLREQLGFDGVVFTDDLTM--GAI 346 Query: 258 MGSYA--ERGQASLDAGCDMILVCNNRKGAV----SVLDNLSP--IKAERVTRLYHK 306 +Y E +L+AGCD++LVC+N +VLD ++ + ER+ R ++ Sbjct: 347 TENYGLDEAAVLALEAGCDVLLVCHNEGDLALARQAVLDAVASGRLTEERIDRSVYR 403 >UniRef50_Q1IQ59 Beta-N-acetylhexosaminidase n=1 Tax=Candidatus Koribacter versatilis Ellin345 RepID=Q1IQ59_ACIBL Length = 372 Score = 140 bits (353), Expect = 7e-32, Method: Compositional matrix adjust. Identities = 87/282 (30%), Positives = 141/282 (50%), Gaps = 10/282 (3%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQ 63 +++ +G E+ A R +LA G I F RN Q +L + +AA L VD Sbjct: 17 MIVGFDGTEMSARVRTLLATIQPAGTIFFKRNVATAEQTWKLNYEAQAAVSTPLFRCVDL 76 Query: 64 EGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV 123 EGG V R R+ P+ + AA + + A+ +A+E A+ + FAPV D+ Sbjct: 77 EGGTVDRLRDAVAPAPSLSNVAATGSKKVMRRFART----LAAEARALGFNTDFAPVFDL 132 Query: 124 GHISAA--IGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPC 181 + + + R+ ADP+ + +A F+ G + + GKHFPG GA DSH E P Sbjct: 133 RTVESVKVLAGRTIAADPKHIIELAREFLKGFKDENVLGCGKHFPGLGAGAVDSHYELPT 192 Query: 182 DPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPRPASGSPYWLKTVLRQEL 239 +P + +D+ + L ++++ M AH +Y + + PAS S +W+ +LR+++ Sbjct: 193 ISKPWKALWEEDLLPYRKL--KDEIAFAMVAHCVYPNATKEKAPASISRFWMTDILRKKI 250 Query: 240 GFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNN 281 GF +I SDD+ M+G S E ++ G D+ LVCNN Sbjct: 251 GFKHLICSDDMEMKGVQKAVSIEEACIQAVRGGADLFLVCNN 292 >UniRef50_B2IHJ4 Glycoside hydrolase family 3 domain protein n=3 Tax=Rhizobiales RepID=B2IHJ4_BEII9 Length = 370 Score = 139 bits (351), Expect = 1e-31, Method: Compositional matrix adjust. Identities = 106/320 (33%), Positives = 160/320 (50%), Gaps = 32/320 (10%) Query: 12 ELDAEEREILAHPLVGGLILFTRNY-HD--PAQLRE----LVRQIRAAS-RNRLVVAVDQ 63 EL ++R +LA G+IL+ N+ HD AQ E L+ I AA R L + +D Sbjct: 27 ELHDQDRALLADLRPAGVILYKSNFRHDLPYAQWLEIHARLIEDIYAACGREHLWIGIDH 86 Query: 64 EGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV 123 EGGRV R TR A + G A G M E+ ++ I+++FAPVLD+ Sbjct: 87 EGGRVCRTPAPITRFAYAAHW---------GDCAAAVGQAMGCELASLGINLNFAPVLDI 137 Query: 124 GHISAA---IGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETP 180 H + A IGER++ P+ +A F+ M + GKHFPGHG + DSH P Sbjct: 138 -HTNPANPVIGERAFATTPEAVTPLALGFLRAMQAENVLGCGKHFPGHGDTSVDSHYGLP 196 Query: 181 CDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQEL 239 ++R +++ F + I E L IM +H+++ +DP P + S + +LR+EL Sbjct: 197 SQNCSLEDLRQRELKPFIAAI-EAGLSMIMTSHILFPAIDPDFPVTLSRRFCTELLREEL 255 Query: 240 GFDGVIFSDDLSMEGAAIMGSYAERGQAS--LDAGCDMILVCNNRKGAVSVLDNLSPIKA 297 G+ GV+ SDD+ M A+ +AE A+ + AGCD+++VC + V D A Sbjct: 256 GYSGVVVSDDIGM--GAMNQFFAEPTAAARLIAAGCDLLMVCAHWTETERVRD-----FA 308 Query: 298 ERVTRLYHKGSFSRQELMDS 317 E + R KG + +L S Sbjct: 309 ESIVRARDKGEITPDQLTVS 328 >UniRef50_A8ZS87 Beta-N-acetylhexosaminidase n=3 Tax=Desulfobacteraceae RepID=A8ZS87_DESOH Length = 343 Score = 139 bits (349), Expect = 2e-31, Method: Compositional matrix adjust. Identities = 96/288 (33%), Positives = 150/288 (52%), Gaps = 18/288 (6%) Query: 9 EGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRN----RLVVAVDQE 64 EG L+ + + ++ VGG+ILF N P Q+R+L +A + L VA+DQE Sbjct: 29 EGTTLNDDLKYLIDTLKVGGIILFAVNLEHPDQIRDLCASAQAHAAACRLPPLFVAIDQE 88 Query: 65 GGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV- 123 GG+V R + FTR S A+ + ASE+ + ++++ APVLDV Sbjct: 89 GGQVARLKPPFTRFEGNPSITT-------DDQARHFARITASELAGIGVNMNMAPVLDVA 141 Query: 124 -GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCD 182 G + + R++ P++ + I+ M + G+ GKHFPG G TADSH + P Sbjct: 142 DGVTDSVMAGRAFAGGPREVARLGGVVIEEMQKNGIMAVGKHFPGIGRTTADSHIDQPWL 201 Query: 183 PRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQELGF 241 AE+ D+ F + I E + IM +H+ Y+ +DP PAS S KT+LR++LG+ Sbjct: 202 AADPAEMETTDLVPFKTAI-ERDVAGIMLSHIRYTALDPDLPASMSTPIAKTLLREKLGY 260 Query: 242 DGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCN---NRKGAV 286 +G++ +DDL M + + + AG DM+LVC+ +RK AV Sbjct: 261 EGLVMTDDLDMGAIRNHHVMDQVVRCADRAGIDMVLVCHKGPDRKKAV 308 >UniRef50_B4BR59 Glycoside hydrolase family 3 domain protein n=2 Tax=Geobacillus RepID=B4BR59_9BACI Length = 457 Score = 138 bits (348), Expect = 3e-31, Method: Compositional matrix adjust. Identities = 96/335 (28%), Positives = 168/335 (50%), Gaps = 25/335 (7%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLREL---VRQIRAASRNR--LV 58 V++ E D ++++ VGG IL+ RN P Q+ L ++++ ASR + LV Sbjct: 121 VIVGFPSTEPDENVKQMIRQYHVGGAILYDRNMKTPEQVATLTNDLQRVALASRFKIPLV 180 Query: 59 VAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 ++DQEGG++ R R+ + +P+ Q + ++A G +E+ AM I ++FA Sbjct: 181 FSIDQEGGKIVRMRQYVSPIPSQQQLGQSKDDAKVYQIAHRTG----TELAAMGIQVNFA 236 Query: 119 PVLDVGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE 178 PVLD+ +A RS+ DP A + + + G+ +AG+ KHFPG+G D H + Sbjct: 237 PVLDL----SATDSRSFGEDPVLAGRLGQQVVTGLTDAGVTAVLKHFPGNGRSQVDPHHD 292 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLDA--IMPAHVIYSDVDPR-PASGSPYWLKTVL 235 + ++ KD+ F +I D +M H+ Y D PAS S ++ +L Sbjct: 293 SSSVEAGMEDLENKDILPFRQIITNINHDRFFVMVTHIKYPVYDKEHPASLSSAIIQQLL 352 Query: 236 RQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPI 295 R++LG++G++ +DDL M + +Y E G ++ AG D++LVC+ V+D Sbjct: 353 REKLGYEGIVVTDDLEMGAVSKYFTYRELGYRAIAAGADLLLVCHTFDHQKEVIDG---- 408 Query: 296 KAERVTRLYHKGSFSRQELMDSARWKAISTRLNQL 330 + G S + + +S R K ++ +L+Q Sbjct: 409 ----IWDAVQTGKLSEERINESVR-KILAYKLSQF 438 >UniRef50_D2BER2 Beta-hexosamidase A n=1 Tax=Streptosporangium roseum DSM 43021 RepID=D2BER2_STRRD Length = 427 Score = 138 bits (347), Expect = 3e-31, Method: Compositional matrix adjust. Identities = 101/303 (33%), Positives = 159/303 (52%), Gaps = 20/303 (6%) Query: 27 GGLILF--TRNYHDPAQLRELVRQIRAASRN-RLVVAVDQEGGRVQRFREGFTRLPAAQS 83 GG+ILF N + Q+ L ++ AS L+V DQE GRV R T +P A Sbjct: 100 GGVILFPWAGNVKNVRQVVALTNGLQKASPEIPLLVGADQENGRVSRMAPLVTEMPGA-- 157 Query: 84 FAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVG--HISAAIGERSYHADPQK 141 ++ G LA++A + +E+ A+ I++ FAPV DV + IG R+Y +DP+K Sbjct: 158 --SVIGSTGDPSLARKAAKVTGTELRALGINLDFAPVADVNINPRNPVIGPRAYGSDPKK 215 Query: 142 ALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLI 201 + + + G H+AG+ +T KHFPGHG DSH P ++ D F++ I Sbjct: 216 VAPMVAAAVQGFHDAGIASTAKHFPGHGDTNVDSHSGLPVIQHSLSQWNKLDAPPFAAAI 275 Query: 202 RENKLDAIMPAHVIYSDVDPR--PASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMG 259 +N +DAIM AHV+ +DP PA+ S L +LR++LGFDGV+ +D L M G + Sbjct: 276 GKN-IDAIMSAHVVMPKLDPSGDPATLSKPILTGLLREKLGFDGVVSTDALDMAG--VRK 332 Query: 260 SYAERGQA---SLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMD 316 Y + GQ ++ AG D++L+ + A + L+ +K+ +++ S R + Sbjct: 333 KYGD-GQVAVRAIQAGVDLLLMPPDFPKAYGAV--LAAVKSGKISTARLDQSVRRLLKLK 389 Query: 317 SAR 319 +AR Sbjct: 390 AAR 392 >UniRef50_B9E727 Putative uncharacterized protein n=1 Tax=Macrococcus caseolyticus JCSC5402 RepID=B9E727_MACCJ Length = 379 Score = 138 bits (347), Expect = 3e-31, Method: Compositional matrix adjust. Identities = 83/257 (32%), Positives = 137/257 (53%), Gaps = 9/257 (3%) Query: 26 VGGLILFTRNYHDPAQLRELVRQI-RAASRNRLVVAVDQEGGRVQRFREGFTRLPAAQSF 84 +GG+ILF RN + +QL +L R I A+ + + +DQEGGRV R + +A + Sbjct: 83 IGGVILFKRNIVNESQLTQLNRDIFHNAANIPMFIGIDQEGGRVNRLPASIQNIESAYTI 142 Query: 85 AALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV--GHISAAIGERSYHADPQKA 142 G+ + A + G + + M ++ FAPVLD+ + IG+RS+ D Sbjct: 143 ----GLSHDKQYAYQQGRYIGENVKRMGFNLDFAPVLDIWSNPANKVIGDRSFGHDAHMV 198 Query: 143 LAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIR 202 ++ F G+ E G+ T+GKHFPGHG +DSH+ P + E+ ++ F I Sbjct: 199 SGMSRSFRQGITETGIITSGKHFPGHGDTVSDSHETLPVSNKSLDELMKFELIPFKQHI- 257 Query: 203 ENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSY 261 ++ +D +M +H+ + D+D + PAS S + +L+++LG+ GVI SDDLSM + S Sbjct: 258 DDGIDMLMVSHISFPDIDNKYPASISKEIVTNILKKKLGYKGVIISDDLSMGAISDRYSL 317 Query: 262 AERGQASLDAGCDMILV 278 E L +G ++L+ Sbjct: 318 NEAVVRGLQSGETIMLI 334 >UniRef50_D0RP10 Beta-hexosaminidase n=1 Tax=alpha proteobacterium HIMB114 RepID=D0RP10_9RICK Length = 306 Score = 138 bits (347), Expect = 3e-31, Method: Compositional matrix adjust. Identities = 83/283 (29%), Positives = 146/283 (51%), Gaps = 12/283 (4%) Query: 3 PVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVD 62 PV+ + G L+ +E +++ + + G I+F RN + QL++L++ I++ S N ++ +D Sbjct: 10 PVIFGLHGQFLNNDEIKLIKNNKIYGFIIFERNILNLEQLKKLIKHIKSISPNDPLIMID 69 Query: 63 QEGGRVQRFREGFT-RLPAAQSFAALSGMEEGGKLAQEAGWLMASEMI--AMDIDISFAP 119 EGGRV RF + F+ + A+ F L ++ + + +L + I + ID P Sbjct: 70 HEGGRVNRFNKIFSQKKYTAEYFGKLYLTDKKNFYKETSNFLDFNIGIFKFLGIDTVAYP 129 Query: 120 VLDVGH--ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 V D+ + IG+RS+ +P+ + FI + G+ KH PGHG DSH Sbjct: 130 VADLKYKKTHKVIGDRSFSKNPKIVNKLCEYFIQKYQDKGINCIAKHAPGHGLAVVDSHF 189 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQ 237 + P + + KD F +L + +M AH+ Y +D + A+ S L T++R+ Sbjct: 190 KLPIVEENKENLFKKDFLCFKNL----RSKFLMIAHIKYIFLDFKIATYSSKIL-TLIRK 244 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCN 280 ++ F G+I +DD+ M+ A+ + + L AGCD+IL CN Sbjct: 245 KINFKGLIMTDDICMK--ALKENLKTKVTQPLLAGCDIILHCN 285 >UniRef50_A8MJC3 Glycoside hydrolase family 3 domain protein n=2 Tax=Clostridiaceae RepID=A8MJC3_ALKOO Length = 418 Score = 138 bits (347), Expect = 3e-31, Method: Compositional matrix adjust. Identities = 99/352 (28%), Positives = 180/352 (51%), Gaps = 27/352 (7%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRA---ASRNRLVVA 60 +++ ++GY+++ +++ + VGG+IL+ RN QL +L ++ A+ L ++ Sbjct: 74 LLVGIDGYDMNDPTADLIQNYRVGGVILYGRNVKGTKQLLDLTNALKTSNTANNIPLFIS 133 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 VD+EGG V R + +LP A++ G E LA E G L+ + + + +++FAPV Sbjct: 134 VDEEGGAVSRSPKEVKKLPTARAI----GKTEDVDLAYEVGNLLGAMVKSFGYNMNFAPV 189 Query: 121 LDVGHISA--AIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE 178 LD+ A I +RS+ P + + GM G+ KHFPGHG + DSH Sbjct: 190 LDIDSNPANPVIEKRSFGKTPDIVSKMGIAEMKGMASEGVIPVVKHFPGHGDTSVDSHIG 249 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPASGSPYWLKTVLRQ 237 P I + + F S I E + +M AH++ +D +PAS S + +LR+ Sbjct: 250 LPIIHHELERIMDFEAAPFKSAIEEGA-EVVMVAHILLKKIDAEKPASMSKVIITDILRE 308 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSY--AERGQASLDAGCDMILVCNNRKGAVSVLDNLSP- 294 LGF GV+ +DD++M AI +Y A ++++AG D+ILV + + A++ + + Sbjct: 309 NLGFGGVVITDDMTM--GAISKNYDMAAAVVSAINAGADIILVGHQYENAIAAFNGIKKG 366 Query: 295 -----IKAERVTRLYH-----KGSFS-RQELMDSARWKAISTRLNQLHERWQ 335 IK +R+ H K ++ + L++S + I+ R++++ ++ Sbjct: 367 IEEGRIKMDRLDESVHRILSLKNKYNIKNTLVESMDVQKINDRIDKVMDKIN 418 >UniRef50_C7MQM3 Beta-glucosidase-like glycosyl hydrolase n=2 Tax=Actinomycetales RepID=C7MQM3_SACVD Length = 618 Score = 137 bits (346), Expect = 4e-31, Method: Compositional matrix adjust. Identities = 110/313 (35%), Positives = 161/313 (51%), Gaps = 27/313 (8%) Query: 9 EGYELDAEEREILAHPLVGGLILF----TRNYHDPAQLRELVRQIRAAS-----RNRLVV 59 E Y +D E++ VGG+I F T N P Q+ L ++ A+ R L+V Sbjct: 88 EKYGVDTPA-EVVRKYHVGGVIYFNHSGTDNIETPRQVARLSNGLQRAALQSQPRVPLIV 146 Query: 60 AVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAP 119 +VDQEGGRV R E T P+A + A +E A + A+E+ AM I+ +FAP Sbjct: 147 SVDQEGGRVTRIAEHVTEYPSAMALGASRDVEGARTAAAIS----AAELRAMGINQNFAP 202 Query: 120 VLDVGH--ISAAIGERSYHADPQKALAIASRFIDGMHEA-----GMKTTGKHFPGHGAVT 172 V DV ++ IG RS+ ADP+ A +DG + + KHFPGHG + Sbjct: 203 VADVNSNPLNPVIGSRSFSADPELAGEFVEAQVDGYQNSRQPRQTVSAAAKHFPGHGDAS 262 Query: 173 ADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR--PASGSPYW 230 ADSH + P R + + RA D+ F + E +DAIM AH+ +D PA+ S Sbjct: 263 ADSHIDLPVIDRSEEDWRANDLPPFERAV-EAGIDAIMTAHISVPSLDDSGDPATLSEPI 321 Query: 231 LKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVS-VL 289 + +LR+ELG+DGV+ +D L M G M +E +L+AG D +L+ + AV+ VL Sbjct: 322 ITGLLREELGYDGVVVTDSLGMAGVRQMYPDSEIPVRALEAGVDQMLMPPDLDAAVNGVL 381 Query: 290 DNLSP--IKAERV 300 D ++ I ER+ Sbjct: 382 DAVASGRITEERI 394 >UniRef50_Q0BSP6 Anhydromuramoyl-peptide exo-beta-N-acetylglucosaminidase n=2 Tax=Acetobacteraceae RepID=Q0BSP6_GRABC Length = 348 Score = 137 bits (346), Expect = 4e-31, Method: Compositional matrix adjust. Identities = 108/354 (30%), Positives = 167/354 (47%), Gaps = 22/354 (6%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 M ++ + G L + E +L G+ILF RN DPAQL+ L ++R+ + + Sbjct: 1 MKAAIIGLSGPTLTSAEEAMLHTQRPAGVILFRRNISDPAQLKALTARLRSILPSEAALL 60 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSG--MEEGGKLAQEAGWLMASEMIAMD-----I 113 VDQEGGRV R R R + AA+ G E K A WL + +I +D Sbjct: 61 VDQEGGRVARLRPPHWR---SHPPAAVIGRLYEADKKAGLRAAWLTGA-LIGLDCMGAGF 116 Query: 114 DISFAPVLDVGHISA--AIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAV 171 D+ APVLD+ A +G+R+Y DP + ++G+ AG++ KH PGHG Sbjct: 117 DVVCAPVLDLLVPGAHDIVGDRAYGTDPGTVTLLGGAVMEGLLAAGVRPVMKHVPGHGRA 176 Query: 172 TADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDA-IMPAHVIYSDVDP-RPASGSPY 229 ADSH E P P E+ A+ + R L M AH++Y D RPA+ SP Sbjct: 177 HADSHLELPVVPD---ELPAELEPFTRNAARFAGLPVWAMTAHILYEAWDRLRPATLSPV 233 Query: 230 WLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVL 289 + ++RQ +GF+G++ SDD++M+ A+ G + +L AGCD++L C + + Sbjct: 234 VIGQIIRQRIGFNGLLVSDDIAMK--ALSGDPVSLTRQTLAAGCDVVLHCTGVQAETQAV 291 Query: 290 DNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHE--RWQEEKAGH 341 P+ ++ L + + D RL L + R ++ GH Sbjct: 292 LEGCPLLTDQAHTLLAARPVASISIGDGESLAQERERLGLLEDGIRAKDPTEGH 345 >UniRef50_A9F261 Put. Beta-glucosidase n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9F261_SORC5 Length = 371 Score = 137 bits (345), Expect = 5e-31, Method: Compositional matrix adjust. Identities = 99/278 (35%), Positives = 146/278 (52%), Gaps = 10/278 (3%) Query: 9 EGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQI-RAASRNRL-VVAVDQEGG 66 +G E A + LA GG ILF RN D A L + + A +R + +DQEGG Sbjct: 20 DGAEPPARYLKALAEGRRGGAILFRRNVPDVAATARLCQALAEAGGPDRPPFIGIDQEGG 79 Query: 67 RVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI 126 RV R F LP+ +S L+ + L + A +ASE+ A+ I+++FAPVLDV Sbjct: 80 RVTRLPAPFLTLPSMRSLGELADL----PLLRRAARAVASELRAVGINLNFAPVLDVDSN 135 Query: 127 SA--AIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPR 184 A IG+R++ DP A F++G+ E G+ GKHFPGHG DSH P Sbjct: 136 PANPIIGDRAFGRDPGAVARGAVAFLEGLQEEGVLACGKHFPGHGDTALDSHLALPTLSH 195 Query: 185 PQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQELGFDG 243 + + ++ F + + ++M AH++ +DP PA+ S +LR E+GF+G Sbjct: 196 ARGRLDEIELPPFRAACGAG-VASLMTAHIVVEALDPGVPATLSRAICTDLLRGEIGFEG 254 Query: 244 VIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNN 281 V+FSDDL M A + E A++ AGCD +LVC++ Sbjct: 255 VLFSDDLEMAAIAAHNAVEEAAAAAVWAGCDALLVCHS 292 >UniRef50_B5LAN7 HexI n=3 Tax=cellular organisms RepID=B5LAN7_9BURK Length = 521 Score = 137 bits (345), Expect = 6e-31, Method: Compositional matrix adjust. Identities = 98/295 (33%), Positives = 157/295 (53%), Gaps = 15/295 (5%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQ 63 +M+ + G LD + + L + + LF +N +P QL +L +RA ++ +DQ Sbjct: 25 IMIRMPGTVLDEDTAQFLRDNHIRAVCLFRQNMVNPEQLAKLTADLRAVMGPNALIGIDQ 84 Query: 64 EGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV 123 EGG V R T LPA + AL +++ LA+ G +A + ++ + +FAPVLD+ Sbjct: 85 EGGAVVRA----TWLPAPPAAMALGAVDDVA-LARSVGAAVARGIRSLGFNWNFAPVLDL 139 Query: 124 GHISA--AIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPC 181 + A IGERS+ +DPQ+A +A +++G E G+ KHFPGHG DSH++ P Sbjct: 140 NNNPANPVIGERSFGSDPQRATELALAWMEGSLEQGVACCVKHFPGHGDTHVDSHRDLPT 199 Query: 182 DPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQELG 240 +P++ + A +++ F + R A+M AH++Y +DP PA+ S L +LR E Sbjct: 200 VAKPRSALDALELAPFKAARRA---PAMMSAHIVYPALDPEYPATLSRKILTDLLRSEWD 256 Query: 241 FDGVIFSDDLSMEGAAIMGSY--AERGQASLDAGCDMILVCNNRKGAVSVLDNLS 293 + GVI +D + M AI G Y +L AG DM++ NR L +L+ Sbjct: 257 YRGVIITDGMDMH--AIAGRYGVGNAAVRALAAGADMVMALGNRSTQEETLVSLT 309 >UniRef50_C1E6R7 Glycoside hydrolase family 3 protein n=1 Tax=Micromonas sp. RCC299 RepID=C1E6R7_9CHLO Length = 366 Score = 137 bits (345), Expect = 6e-31, Method: Compositional matrix adjust. Identities = 106/318 (33%), Positives = 157/318 (49%), Gaps = 46/318 (14%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQI-RAASRNRLVVAVD 62 V +GY DA RE++A G +I+F RN P Q+ L + R A+ L+V +D Sbjct: 13 VCFGFDGYTPDAHAREMIAKG-CGAVIIFDRNIKSPTQVATLCADLKRLAAPRPLLVMID 71 Query: 63 QEGGRVQRFREG-FTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVL 121 QEGGRV+RFR F+ +PAA + + SE+ A +D++ APVL Sbjct: 72 QEGGRVRRFRPPQFSDIPAAAA--------------------IGSELRACHVDMTLAPVL 111 Query: 122 DVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKET 179 DV +A IG+RSY +D + A A + FID + G+ KH+PGHG DSH Sbjct: 112 DVNTNPNNAVIGDRSYGSDARVASACGAAFIDALQAGGVAGCAKHWPGHGDTEEDSHDAL 171 Query: 180 PCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP---RPASGSPYWLKTVLR 236 P + + +M+ F + I N A++ AH++ ++P RPAS S + T LR Sbjct: 172 PASKHGLDRLESVEMAPFRAAISANVASALV-AHLLVPAMEPEAHRPASCSERCV-THLR 229 Query: 237 QELGF-DGVIFSDDLSM-------------EGAAI-MGSYAERGQASLDAGCDMILVCNN 281 + L F DGV+ +DD+ M EG G+ L AG D+ L C+ Sbjct: 230 KNLRFGDGVVMTDDMEMGALRGIPWPAREGEGGPFESGAVPRAATEGLRAGIDLFLACHT 289 Query: 282 RKGAVSVLDNLSPIKAER 299 R+ + V++ L+ I ER Sbjct: 290 RRLQLDVIEALT-IAIER 306 >UniRef50_C7P2G4 Glycoside hydrolase family 3 domain protein n=1 Tax=Halomicrobium mukohataei DSM 12286 RepID=C7P2G4_HALMD Length = 528 Score = 137 bits (344), Expect = 7e-31, Method: Compositional matrix adjust. Identities = 93/283 (32%), Positives = 142/283 (50%), Gaps = 16/283 (5%) Query: 9 EGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIR-----AASRNRLVVAVDQ 63 EG E + RE+L G +I F+RN P Q+ EL R+++ A L V DQ Sbjct: 24 EGPEPTEDLRELLTDYRCGNVIYFSRNIDSPEQVAELSRELQTIATEAGPEIPLFVTADQ 83 Query: 64 EGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV 123 EGG V R G T P+ S G LA+ G + +E+ ++ ++ PVLDV Sbjct: 84 EGGVVSRTDWG-TEPPSQMSI----GAGRDADLARSVGGAVGAELASIGVNFDLTPVLDV 138 Query: 124 GHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPC 181 + + IG RS+ +P+ + + DGM G+ GKHFPGHG +ADSH P Sbjct: 139 NNNPDNPVIGVRSFGEEPELVGDLGAAMADGMQSEGVLACGKHFPGHGDTSADSHHSLPV 198 Query: 182 DPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV---DPRPASGSPYWLKTVLRQE 238 + + A +++ F I + +DAIM HV + + D PA+ S +LR++ Sbjct: 199 VDHDRERLDAVELAPFRRAI-DAGIDAIMTTHVSFPTITGDDELPATVSRDVQTGLLREQ 257 Query: 239 LGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNN 281 LGFDG++ +D + M A E +++AGCD++LVC+ Sbjct: 258 LGFDGLVVTDGMEMNAIADEMGTPEGCVQAVEAGCDLLLVCHT 300 >UniRef50_Q11XV7 B-N-acetylglucosaminidase, glycoside hydrolase family 3 protein n=1 Tax=Cytophaga hutchinsonii ATCC 33406 RepID=Q11XV7_CYTH3 Length = 395 Score = 137 bits (344), Expect = 7e-31, Method: Compositional matrix adjust. Identities = 89/270 (32%), Positives = 146/270 (54%), Gaps = 24/270 (8%) Query: 26 VGGLILFTRNY---HDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFRE--GFTRLPA 80 VGG+I+F +N + A L++L+ ++A++ L +++D+EGG+V R +E GF +P+ Sbjct: 90 VGGIIMFEKNIAAANSAATLKKLISDLQASASTPLFMSIDEEGGKVHRLKEKYGFVPMPS 149 Query: 81 AQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGH-----ISAAIGERSY 135 A L+ ++ +A E+ + ++++AP LD+ I A +G RSY Sbjct: 150 AGYLGNLNNLDSTTYYTTN----LAKELKELGFNLNYAPCLDLATNPNNPIIAKVG-RSY 204 Query: 136 HADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMS 195 ADP A+ I+ H G+KT KHFPGHG+ +DSH D Q I ++ Sbjct: 205 SADPDLVSKQAAASINAHHTQGIKTIVKHFPGHGSSMSDSHLGI-VDVTNQWNII--ELM 261 Query: 196 VFSSLIRENKLDAIMPAHVIYSDVDPR--PASGSPYWLKTVLRQELGFDGVIFSDDLSME 253 + +LI+ +DA+M AH+I +D P + S + +LR LGF GV+FSDD+ M Sbjct: 262 PYKTLIQSGTIDAVMTAHIINEHLDQSKLPGTLSKVVVTDLLRNMLGFTGVVFSDDMQM- 320 Query: 254 GAAIMGSYAERG--QASLDAGCDMILVCNN 281 AI +Y + + S+ AG D+++ NN Sbjct: 321 -YAISKNYGQENAIKLSILAGVDVLVFGNN 349 >UniRef50_A9NEX4 Glycoside hydrolase, family 3 n=1 Tax=Acholeplasma laidlawii PG-8A RepID=A9NEX4_ACHLI Length = 523 Score = 137 bits (344), Expect = 8e-31, Method: Compositional matrix adjust. Identities = 89/266 (33%), Positives = 142/266 (53%), Gaps = 18/266 (6%) Query: 26 VGGLILFTRNYHDPAQLRELVRQIRAASRNR----LVVAVDQEGGRVQRFREGFTRLPAA 81 VG ++LF+RN AQL +L + ++ + L + +DQEGG V R E T P A Sbjct: 41 VGNILLFSRNIESAAQLFKLNQDLQKIAMETFGIPLYITIDQEGGMVTRIFEDGTFFPGA 100 Query: 82 QSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGH--ISAAIGERSYHADP 139 + AA + +E K+ G LM E+I + I++ APVLDV + + IG RS+ DP Sbjct: 101 MTIAATNNVEYAYKI----GDLMGKELIDLGINMDLAPVLDVNNNPKNPVIGVRSFSDDP 156 Query: 140 QKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSS 199 K + I G+ + + T KHFPGHG DSH P +P E+ + F + Sbjct: 157 DKVALFGNANIRGL-QNHVVATAKHFPGHGDTHVDSHLALPKVEKPLNELEKFEFKPFKA 215 Query: 200 LIRENKLDAIMPAHVIYSDV--DPRPASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAI 257 I + L AIM AH+ ++ + D P + S L +LR+E+GF+G+I +D + M+ AI Sbjct: 216 AINDG-LHAIMSAHINFTHLTEDGLPVTLSKRALTGLLREEMGFEGLIITDGIEMK--AI 272 Query: 258 MGSYA--ERGQASLDAGCDMILVCNN 281 +Y E +++AG +++ +C++ Sbjct: 273 HDNYGTIEATLKTVNAGANLVCICHD 298 >UniRef50_B3QTH0 Beta-N-acetylhexosaminidase n=12 Tax=Chlorobiaceae RepID=B3QTH0_CHLT3 Length = 384 Score = 137 bits (344), Expect = 8e-31, Method: Compositional matrix adjust. Identities = 91/280 (32%), Positives = 147/280 (52%), Gaps = 39/280 (13%) Query: 26 VGGLILF---------TRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFRE--G 74 +GG+ILF +RN P Q+ L+++++AA+ L++A+DQEGGRV R +E G Sbjct: 74 IGGVILFDYDLPLKQASRNIQSPEQVCALIKELQAAAPIPLLIAIDQEGGRVNRLKEKFG 133 Query: 75 FTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI--SAAIGE 132 F + +A ++ ++ K A++ A + A+ I+++FAPVLDV + IG+ Sbjct: 134 FPKSVSAAWLGEVNSLDSTQKYAEQT----AKTLAALGINLNFAPVLDVNTNPENPVIGK 189 Query: 133 --RSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK------ET--PCD 182 RS+ +P+ + H+ G+ + KHFPGHG+ DSHK ET P + Sbjct: 190 LARSFSENPEIVAEHGLATVKAFHQFGVLSAVKHFPGHGSAWNDSHKGLADVTETWQPLE 249 Query: 183 PRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQELGF 241 P F +I+ + D +M AHV + +D PA+ S + +LR+ELGF Sbjct: 250 IEP-----------FKRVIQAGECDMVMTAHVFNAKLDTTFPATLSQNVITGLLRKELGF 298 Query: 242 DGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNN 281 DGV+ SDD+ ME + +L+AG D+++ NN Sbjct: 299 DGVVVSDDMQMEAIRSFYGLETAVRLALNAGVDLLVFANN 338 >UniRef50_Q28S86 Beta-N-acetylhexosaminidase n=14 Tax=Rhodobacterales RepID=Q28S86_JANSC Length = 338 Score = 136 bits (343), Expect = 1e-30, Method: Compositional matrix adjust. Identities = 91/282 (32%), Positives = 142/282 (50%), Gaps = 10/282 (3%) Query: 5 MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQE 64 +L EG L A E+ A G ILF RN + AQ+ L ++RA+ + +DQE Sbjct: 7 ILGCEGLTLSASEKAFFAEARPWGFILFARNIKNAAQVLALTEELRASVGWNAPILIDQE 66 Query: 65 GGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVG 124 GGRVQR R R + A+ + ++ +L+ E+ + ID+ P DV Sbjct: 67 GGRVQRLRGPIWR-EWSPPLDAIKAANDPSRVMYLRSYLIGHELREVGIDVQCGPTCDVA 125 Query: 125 HISA--AIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCD 182 + + R Y D A + G+ ++G+ KH PGHG T DSH + P Sbjct: 126 RDTTHPFLKNRLYGYDADSVTRYAEAVVSGLADSGVACVVKHIPGHGLGTVDSHHDLPRV 185 Query: 183 PRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQELGF 241 +A + A D VF+ + ++ M AHV++ +DP PA+ SP + ++R ++GF Sbjct: 186 TADRATLDAIDFKVFNGMASQSMG---MTAHVVFEAIDPDLPATLSPTMID-LIRSDIGF 241 Query: 242 DGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRK 283 +G++ +DDLSM A+ G+ ER A+ AGCD+IL CN + Sbjct: 242 NGLLMTDDLSMN--ALPGTIGERAAAARAAGCDVILHCNGDR 281 >UniRef50_Q9RUP9 Glycosyl hyrolase, family 3 n=2 Tax=Bacteria RepID=Q9RUP9_DEIRA Length = 348 Score = 135 bits (340), Expect = 2e-30, Method: Compositional matrix adjust. Identities = 93/286 (32%), Positives = 150/286 (52%), Gaps = 16/286 (5%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQ 63 +M+D+ G ELD + L +G + LF +N QLR L +R ++A+D Sbjct: 19 LMIDLPGPELDKDTAAYLREHGIGAVCLFGKNVESAEQLRRLCADLREVMGEHALIAIDH 78 Query: 64 EGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLD- 122 EGG + R E + P+A S G + +L ++ +A ++ ++ I+ +F PVLD Sbjct: 79 EGGAITR-PEFWPFAPSAMSL----GAADDQQLTEDVNAALARQLRSVGINWNFTPVLDI 133 Query: 123 -VGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPC 181 V + IG+R+Y +D + + G G+ KHFPGHG DSH P Sbjct: 134 NVNPANPVIGDRAYGSDAARVTRHGRAALAGHTREGVAPCAKHFPGHGDTHQDSHLALPR 193 Query: 182 DPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQELG 240 + +AE+ A +++ F +L+ E AIM AH++Y +D PA+ SP L +LR+E G Sbjct: 194 VSKSRAELDAGELAPFRALLPETP--AIMTAHIVYDALDAEHPATLSPRILTGLLREEWG 251 Query: 241 FDGVIFSDDLSMEGAAIMGSYAERGQA---SLDAGCDMILVCNNRK 283 +DGVI +D + M+ AI +Y RG+A +L AG D+++ R+ Sbjct: 252 YDGVIVTDSMGMQ--AIDANYG-RGEAAVRALRAGADLVMALGRRE 294 >UniRef50_B3E4C8 Glycoside hydrolase family 3 domain protein n=3 Tax=Desulfuromonadales RepID=B3E4C8_GEOLS Length = 395 Score = 135 bits (339), Expect = 3e-30, Method: Compositional matrix adjust. Identities = 96/307 (31%), Positives = 154/307 (50%), Gaps = 23/307 (7%) Query: 27 GGLILFT---------RNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFREGFTR 77 GG++LF RN PAQL++L ++ AS L++AVDQEGG V R ++ + Sbjct: 82 GGVVLFDNNIDLGVTGRNITAPAQLKQLTTALQQASELPLLIAVDQEGGVVARLKDRYG- 140 Query: 78 LPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVG----HISAAIGER 133 PA S A G + L + + +A+ + +++ APV+D+ + A ER Sbjct: 141 FPATVS-AKYLGQQNRLDLTRSSSENLAATLDEYGFNLNLAPVVDLATNPDNPVIAFKER 199 Query: 134 SYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKD 193 S+ ADP A A+ FI H + T KHFPGHG+ DSH R E + Sbjct: 200 SFSADPALVAAHAAEFIKSHHRHHILTCLKHFPGHGSSRDDSHLGLVDVTRYWHE---NE 256 Query: 194 MSVFSSLIRENKLDAIMPAHVIYSDVDP-RPASGSPYWLKTVLRQELGFDGVIFSDDLSM 252 + + LI + D +M AH + +DP PA+ S + +LR++LGFDGV+ SDDL M Sbjct: 257 LQPYRDLISQGLCDMVMTAHTFNTAIDPDHPATLSRATIDGILRKQLGFDGVVVSDDLYM 316 Query: 253 EGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFSRQ 312 SY + +++AG D+++V N++ + ++ P E + L +G R+ Sbjct: 317 GAIIQHYSYETAVEKAINAGVDLLVVANDKLYSPDIM----PRTIELLLNLVQQGRIPRE 372 Query: 313 ELMDSAR 319 + ++R Sbjct: 373 RIEQASR 379 >UniRef50_C4KZ16 Beta-N-acetylhexosaminidase n=2 Tax=Firmicutes RepID=C4KZ16_EXISA Length = 649 Score = 135 bits (339), Expect = 3e-30, Method: Compositional matrix adjust. Identities = 88/272 (32%), Positives = 140/272 (51%), Gaps = 25/272 (9%) Query: 26 VGGLILFTRNYHDPAQLRELVRQIR------AASRNRLVVAVDQEGGRVQRFREGFTRLP 79 +GG+ILF N + Q +LV ++ A++ L V +DQEGG V R G T LP Sbjct: 87 LGGVILFAENVKETEQTTKLVHDLQEVVKQDASNDIPLFVTIDQEGGIVTRLGTG-TNLP 145 Query: 80 AAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI--SAAIGERSYHA 137 + A + A +AG ++ SE+ A+ ++++F PVLDV + + IG RS+ + Sbjct: 146 GNMALGATRNRQ----YAYDAGEIIGSELHALGVNVNFGPVLDVNNNPGNPVIGVRSFSS 201 Query: 138 DPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVF 197 DP+ + S G+ + G+ T KHFPGHG DSH P + E+R ++ F Sbjct: 202 DPELVGELGSAMTQGIQDQGVAATAKHFPGHGDTAVDSHYGLPIVDKSLDELRGLELLPF 261 Query: 198 SSLIRENKLDAIMPAHVIYSDVD-----------PRPASGSPYWLKTVLRQELGFDGVIF 246 I E +D IM AH+ ++ P PA+ S + VLR+ELG++G++ Sbjct: 262 KRAITEG-IDMIMTAHIGMPQIEDEVVESERGTFPLPATLSDDVITGVLREELGYEGIVV 320 Query: 247 SDDLSMEGAAIMGSYAERGQASLDAGCDMILV 278 +D L+M+ A + AE + +AG D+ L+ Sbjct: 321 TDALNMQAIADNFTEAEAVIKTFEAGVDIALM 352 >UniRef50_A0LL99 Beta-N-acetylhexosaminidase n=1 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LL99_SYNFM Length = 337 Score = 134 bits (338), Expect = 3e-30, Method: Compositional matrix adjust. Identities = 91/262 (34%), Positives = 130/262 (49%), Gaps = 13/262 (4%) Query: 26 VGGLILFTRNYHDPAQLRELVRQIRAASRN----RLVVAVDQEGGRVQRFREGFTRLPAA 81 +GG++LF RN P QLR L+ ++++ +R L VA+DQEGG VQR FT+LP+A Sbjct: 35 IGGIVLFRRNIQTPEQLRSLLEEMQSHARQVLGRSLWVAIDQEGGPVQRLVPPFTQLPSA 94 Query: 82 QSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLD--VGHISAAIGERSYHADP 139 A G+E + + +A A ++ M I I+ APVLD V S + R DP Sbjct: 95 CDLAQ-QGIEAVAEWSSKA----AMDLRRMGIHINLAPVLDLRVNANSHFMEGRCLGDDP 149 Query: 140 QKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSS 199 + R+I + AG+ T KHFPG G D H P P E +D+ F Sbjct: 150 LTVAELGCRWIKTLQGAGVSATAKHFPGLGLAELDPHHFAPVIRWPDQEAMQRDLLPFRK 209 Query: 200 LIRENKLDAIMPAHVIYSDVDP-RPASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIM 258 I E + +M +H +Y +D PA+ SP +LR LGF G + SDD+ M + Sbjct: 210 AI-EAGVHCVMTSHALYPFIDSVWPATLSPAINNDLLRGTLGFRGTLLSDDMDMAAVSEK 268 Query: 259 GSYAERGQASLDAGCDMILVCN 280 S+ E + L A D L+C Sbjct: 269 YSWKEMAEQGLLATIDFFLLCQ 290 >UniRef50_A8SRF1 Putative uncharacterized protein n=1 Tax=Coprococcus eutactus ATCC 27759 RepID=A8SRF1_9FIRM Length = 416 Score = 134 bits (338), Expect = 4e-30, Method: Compositional matrix adjust. Identities = 86/268 (32%), Positives = 140/268 (52%), Gaps = 11/268 (4%) Query: 14 DAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFRE 73 DA ++ + A+P VGG+I F N+ + Q R+++ ++ S+ L +A D+EGGRV R + Sbjct: 112 DATKQALQAYP-VGGIIYFANNFDNVDQTRDMISNTQSYSKIGLFIATDEEGGRVNRIMD 170 Query: 74 --GFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV--GHISAA 129 G T + + ++G A E +AS+M A+ ++ FAPV DV + Sbjct: 171 TLGTTYIGPMYDYK-----DDGPDAAFENAKTIASDMSALGFNLDFAPVADVWSNPDNTV 225 Query: 130 IGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEI 189 IG+R+Y D +A + + G + G+ T KHFPGHG DSH + R +A+I Sbjct: 226 IGDRAYSDDYAQAAELVGSAVKGFQDGGVICTLKHFPGHGDTAEDSHYSSAYVNRTKAQI 285 Query: 190 RAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQELGFDGVIFSDD 249 A +M F + I + + +M H+I D+D P++ S +LR ++ FDGV +D Sbjct: 286 MADEMQPFRTGI-DAGAELVMVGHLIVPDIDELPSTLSYKIATEMLRGDMKFDGVAITDS 344 Query: 250 LSMEGAAIMGSYAERGQASLDAGCDMIL 277 + ME A A+ ++ AG DM+L Sbjct: 345 MEMESIADNYGVADSAVMAVQAGMDMLL 372 >UniRef50_Q08U63 Beta-glucosidase n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q08U63_STIAU Length = 611 Score = 134 bits (336), Expect = 6e-30, Method: Compositional matrix adjust. Identities = 111/346 (32%), Positives = 165/346 (47%), Gaps = 42/346 (12%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRL--VVAV 61 +M+ G LD E ++ VGG+ LF N H Q+ L +R + + VA+ Sbjct: 80 MMVGFRGTVLDEEVESLVRGRRVGGVCLFKHNIHSARQVARLNDGLRRLLADHIPPFVAL 139 Query: 62 DQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVL 121 DQEGG V R + RLP + A E LA AG ++ + +++ APVL Sbjct: 140 DQEGGNVVRVSDQVVRLPGNMALGATRSTE----LAYAAGRAQGEDLRRLGFNMNLAPVL 195 Query: 122 DVG---HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE 178 DV H + IG RSY + F G EAG+ T KHFPGHG+ + DSH+ Sbjct: 196 DVNLNPH-NPVIGIRSYGDSVSLVSEMGRAFARGQQEAGLVTVAKHFPGHGSTSTDSHEV 254 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPRPASGSPYWLKTVLR 236 P + E+ + M F ++++E LD +M AHV + D PA+ SP L+ +LR Sbjct: 255 LPVMRETREEVLTQ-MEPFRAVLQEG-LDGLMTAHVAIPGLTGDSVPATLSPQVLEGLLR 312 Query: 237 QELGFDGVIFSDDLSMEGAAIMGSYAERGQA---SLDAGCDMILVCNNRKGAVSVLDNLS 293 ++LGFDG++ +D+L ME AI+ Y G+A ++ AG DM+LV Sbjct: 313 RDLGFDGLVLTDELEME--AIVQRYGV-GRAAVLAMKAGADMVLV--------------- 354 Query: 294 PIKAERVTRLY-------HKGSFSRQELMDSARWKAISTRLNQLHE 332 P + E+ T +Y H+G + L + R I+ L E Sbjct: 355 PWRPEKKTEVYEALLDAAHEGELPPERLEQAVRRILIAKLRRGLFE 400 >UniRef50_C3RKZ3 Beta-hexosamidase A n=5 Tax=Bacteria RepID=C3RKZ3_9MOLU Length = 1092 Score = 134 bits (336), Expect = 7e-30, Method: Composition-based stats. Identities = 95/289 (32%), Positives = 149/289 (51%), Gaps = 32/289 (11%) Query: 13 LDAEEREILAHPLVGGLILFTRNYHDPAQLR-ELVRQIRAASRNR--------LVVAVDQ 63 ++ E EI+ +G +ILF N++ A++ EL ++ A+ ++ L++A DQ Sbjct: 79 INDEIHEIIGEYDIGSVILFAANFNSDAKVNVELTDGLQKAAMDKDLGKNSIPLLIATDQ 138 Query: 64 EGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV 123 EGG V R G T LP + A E A +AG ++ SE+ A+ ++++FAP DV Sbjct: 139 EGGIVYRLTGG-TALPGNMALGASGNTEN----AVKAGNIIGSELNAVGVNVNFAPDADV 193 Query: 124 GHISAA--IGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPC 181 + IG RS+ ++PQ A S +I+G+ + T KHFPGHG V DSH P Sbjct: 194 NNNPNNPVIGLRSFSSNPQLAAKFVSAYIEGVQSNNVATAAKHFPGHGNVATDSHTGLPS 253 Query: 182 DPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHV---------IYSD-----VDPRPASGS 227 P + E+ ++ F + I + D +M AH+ IYSD + P PA+ S Sbjct: 254 VPATKEELYKTELVPFQAAI-DAGTDMVMTAHIQFPNIVTEEIYSDKKDELMSP-PATLS 311 Query: 228 PYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMI 276 L +LR E+ FDGVI +D ++M+G A ER ++ AG D++ Sbjct: 312 REILTDLLRDEMKFDGVIVTDSMTMQGVANYFDTNERNLLAVKAGVDIL 360 >UniRef50_B0MSB5 Putative uncharacterized protein n=1 Tax=Eubacterium siraeum DSM 15702 RepID=B0MSB5_9FIRM Length = 406 Score = 133 bits (335), Expect = 7e-30, Method: Compositional matrix adjust. Identities = 89/273 (32%), Positives = 142/273 (52%), Gaps = 9/273 (3%) Query: 14 DAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRF-R 72 DA I +P VGG+ LF+RN QL + ++++S+ L +AVD+EGGRV R Sbjct: 99 DAVISRIEEYP-VGGIALFSRNITSAEQLPMFISDLQSSSKYPLFIAVDEEGGRVARIAN 157 Query: 73 EGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI--SAAI 130 F + + +S + + K A+E G + S + + ++ FAPV D + I Sbjct: 158 SDFFNVASYKSMEDIGKSGDASK-AEEVGRQIGSYLKELGFNLDFAPVADTNTNPQNIVI 216 Query: 131 GERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIR 190 G+RSY +DP + S +DGMH++G+ T KHFPGHG D+H + E++ Sbjct: 217 GDRSYGSDPALVARMVSAQLDGMHDSGIMGTLKHFPGHGDTKDDTHSGYVSIEKTWDELK 276 Query: 191 AKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPRPASGSPYWLKTVLRQELGFDGVIFSD 248 ++ F + + K D +M +H+ +V D P S S + LR ELG+DGVI +D Sbjct: 277 ECELVPFITALP--KADMVMVSHITAVNVTSDKLPTSMSETMITGKLRNELGYDGVIITD 334 Query: 249 DLSMEGAAIMGSYAERGQASLDAGCDMILVCNN 281 ++M A + AE ++ AG D++L+ N Sbjct: 335 AMAMGAVADNYTSAEAAVTAVKAGVDIVLMPQN 367 >UniRef50_A5CZV7 Beta-glucosidase-related glycosidases and D-alanyl-D-alanine dipeptidase n=4 Tax=Bacteria RepID=A5CZV7_PELTS Length = 1139 Score = 133 bits (335), Expect = 8e-30, Method: Compositional matrix adjust. Identities = 94/283 (33%), Positives = 143/283 (50%), Gaps = 22/283 (7%) Query: 12 ELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRN-RLVVAVDQEGGRVQR 70 E++ E + +GG+ILF N Q LV Q++ AS + L +AVDQEGGRV R Sbjct: 591 EINGEIAATIKKYHLGGVILFRENVAGTGQTAGLVAQLQKASGDIPLFIAVDQEGGRVVR 650 Query: 71 FREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVG--HISA 128 + G T +P + A E+ +A+ G E+ A+ I++ APV+DV + Sbjct: 651 LQTG-TVMPGNMALGAAGSPEDTYAVAKAIG----EELNALGINVDLAPVVDVNINPDNP 705 Query: 129 AIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAE 188 IG RS+ +DPQ + +I+G+ +AG+ T KHFPGHG DSH P P A Sbjct: 706 VIGIRSFGSDPQLVADMGVAYINGLRDAGVIATVKHFPGHGDTDVDSHLGLPSVPHGPAR 765 Query: 189 IRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-------------PASGSPYWLKTVL 235 ++ ++ F + K+D +M AHV + +D PA+ S L ++ Sbjct: 766 LKTVELKPFQQAF-DQKVDMVMTAHVTFPAIDNTKAISKNDGREISIPATLSHKVLTDLV 824 Query: 236 RQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILV 278 R+E+GFDGVI +D L M+ A E ++ AG D+ L+ Sbjct: 825 RKEMGFDGVIITDALEMKAIADHFGPREAVIGAVKAGADIALM 867 >UniRef50_Q67LG3 Putative beta-N-acetylglucosaminidase n=1 Tax=Symbiobacterium thermophilum RepID=Q67LG3_SYMTH Length = 637 Score = 133 bits (335), Expect = 8e-30, Method: Compositional matrix adjust. Identities = 104/288 (36%), Positives = 152/288 (52%), Gaps = 18/288 (6%) Query: 6 LDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNR------LVV 59 + + G EL E R +LA VGG+I F R DP LR L +++AA+ R LV+ Sbjct: 107 VGLPGTELTDEIRGLLAEGKVGGVIFFGRQGSDPDVLRRLSGELQAAAAARDRATPGLVI 166 Query: 60 AVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAP 119 AVD EGG VQRF FT+ P A + A A+ G MA E++A+ ++++ AP Sbjct: 167 AVDHEGGLVQRFGPPFTQWPGAMALGAAGSSAH----AEAMGRAMARELLAVGVNMNLAP 222 Query: 120 VLDVGHISA--AIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 V DV + A IG RS+ P+ +A+ G+ G+ KHFPGHG DSH Sbjct: 223 VADVNNNPANPVIGTRSFGEQPELVARMAAALAAGLQAEGVSAVAKHFPGHGDTAVDSHL 282 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPRPASGSPYWLKTVL 235 + P + + ++ F + + +DA+M AHV++ V D RPAS SP + +L Sbjct: 283 DLPVIDHARERLDRVELVPFRAAVGAG-IDAVMVAHVVFPAVAGDGRPASLSPDVVTGLL 341 Query: 236 RQELGFDGVIFSDDLSMEGAAIMGSYA-ERGQA-SLDAGCDMILVCNN 281 + ELGF GV +D L AAI +Y E G +++AG D++LV + Sbjct: 342 KGELGFAGVAMTDALDGM-AAITDTYGVEEGLVLAVEAGADVLLVTES 388 >UniRef50_C8RZL5 Beta-N-acetylhexosaminidase n=1 Tax=Rhodobacter sp. SW2 RepID=C8RZL5_9RHOB Length = 344 Score = 133 bits (335), Expect = 9e-30, Method: Compositional matrix adjust. Identities = 103/292 (35%), Positives = 145/292 (49%), Gaps = 19/292 (6%) Query: 2 GPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAV 61 G +L G L A E + G ILF RN P QLR L +RAA V V Sbjct: 5 GATVLGCAGTALTAAESDFFRDADPFGFILFARNVETPEQLRRLTGALRAAVGRDAPVFV 64 Query: 62 DQEGGRVQRFREGFTR--LP----AAQSFAAL---SGMEEGGKLAQEAGWLMASEMIAMD 112 DQEGGRVQR R R LP AQ+ AA G+ ++ L+A+E+ ++ Sbjct: 65 DQEGGRVQRLRAPHWREWLPPMDAVAQTIAAAPAGQGLTAAARMLWLRYRLIAAELRSVG 124 Query: 113 IDISFAPVLDV--GHISAAIGERSYHADPQKALAIASRFIDGMHEA-GMKTTGKHFPGHG 169 ID + AP DV + R Y DP + +A+ SR + H A G+ KH PGHG Sbjct: 125 IDGNCAPCADVLTPDTHPFLKNRCYSDDP-RIVALLSRAVAEAHLAGGVLPVVKHLPGHG 183 Query: 170 AVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPY 229 ADSH + PR + + S F++ L M AH++++ D +PA+ S Sbjct: 184 RAVADSHLDL---PRVTVDAETLESSDFAAFRPLRDLPLAMTAHIVFTAYDDQPATCSAP 240 Query: 230 WLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNN 281 +K V+R +GF G++ +DDL+M+ A+ G+ AE+ SL AGCD+ L C Sbjct: 241 MIK-VIRDNIGFAGLLLTDDLNMQ--ALPGTLAEKTTRSLAAGCDIALHCKG 289 >UniRef50_C8PT26 Glycosyl hydrolase, family 3 n=1 Tax=Treponema vincentii ATCC 35580 RepID=C8PT26_9SPIO Length = 520 Score = 133 bits (335), Expect = 9e-30, Method: Compositional matrix adjust. Identities = 84/281 (29%), Positives = 144/281 (51%), Gaps = 11/281 (3%) Query: 26 VGGLILFTRNYHDPAQLRELVRQIRAA----SRNRLVVAVDQEGGRVQRFREGFTRLPAA 81 VG +ILF N AQL+ L I+A + + + +DQEGG V R E + +P A Sbjct: 36 VGNVILFKHNIKSLAQLKALCADIQALITAETGHSAFITIDQEGGAVSRLAEDYYNVPGA 95 Query: 82 QSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGH--ISAAIGERSYHADP 139 + AA +G E A AG + + A ++ APVLDV + IG RSY +P Sbjct: 96 MALAA-TGDERN---AYTAGKITGEILRAHGVNFDLAPVLDVNSNPKNPVIGVRSYGDNP 151 Query: 140 QKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSS 199 + I G+ G+ + KHFPGHG + DSH P + +AE+ +++ F++ Sbjct: 152 ALVDRYGIQMIRGLTAGGVLSAAKHFPGHGDTSVDSHLTLPTVRKSKAELEQTEIAPFAA 211 Query: 200 LIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIM 258 I+ L A+ +H+++ +D + PA+ S + +L++ELGF+G++ SD + M+ Sbjct: 212 AIKTGLLPAVTISHILFPAIDEQFPATISKKIVTGLLKEELGFNGLVISDCMEMKAMQTF 271 Query: 259 GSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAER 299 E A+++AG ++I + + A + ++ ++ AE Sbjct: 272 FGTEESAAAAINAGIELIFISHTIPTACAAIETVTRACAEN 312 >UniRef50_C8WFM5 Beta-N-acetylhexosaminidase n=3 Tax=Zymomonas mobilis RepID=C8WFM5_ZYMMN Length = 344 Score = 133 bits (334), Expect = 9e-30, Method: Compositional matrix adjust. Identities = 102/340 (30%), Positives = 167/340 (49%), Gaps = 24/340 (7%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 M PV + + L +E+++ G ILF RN D QL++L +RA S+ + + Sbjct: 1 MKPVFIGIADTRLSDQEKKLFRQYEPAGYILFQRNITDRTQLKKLTADLRANSQQNIPIL 60 Query: 61 VDQEGGRVQRFREG-FTRLPAAQSF------AALSGMEEGGKLAQEAGWLMASEMIAMDI 113 +DQEGGRV R + + PAA F A +S +E + L+ + I Sbjct: 61 IDQEGGRVARMKPPVWPAFPAAARFDELYQKAPVSAIEAVRANSHALALLLQD----VGI 116 Query: 114 DISFAPVLDVGHISA--AIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAV 171 ++ P+LD+ IG+RS+ +P++ A+ +DG+ + + KH PGHG Sbjct: 117 NVDCMPLLDIRDPEGDNIIGDRSFGHNPEQVAALGRAVLDGLAKGSVCGIIKHIPGHGQA 176 Query: 172 TADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRP-ASGSPYW 230 DSHK P P EI +D+ F L R K+ M AH+IY D AS S Sbjct: 177 KVDSHKALPVVDSP-IEILERDIRPFQHL-RNAKM--AMTAHIIYQAWDKESCASYSSKI 232 Query: 231 LKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAV-SVL 289 + ++R+++GF G++ SDD+ M A+ G R DAGCD+ L C+ + + ++L Sbjct: 233 IHDIIRKKIGFKGLLMSDDIGMN--ALDGDPVTRALRVFDAGCDVALHCSGKFDDMKAIL 290 Query: 290 DNLSPIKAER---VTRLYHKGSFSRQELMDSARWKAISTR 326 +N+ I+ E + ++ ++ S + +D A KA+ R Sbjct: 291 ENMDEIRPESFAALEKVTNEFSSASDFDLDRAENKALFDR 330 >UniRef50_B0SE07 Beta-glucosidase-related glycosidase n=2 Tax=Leptospira biflexa serovar Patoc RepID=B0SE07_LEPBA Length = 588 Score = 133 bits (334), Expect = 1e-29, Method: Compositional matrix adjust. Identities = 86/263 (32%), Positives = 142/263 (53%), Gaps = 16/263 (6%) Query: 27 GGLILFTRNYHDPAQLRELVRQI-RAASRNRL---VVAVDQEGGRVQRFREGFTRLPAAQ 82 GG+ILF +N ++ L + R+A N L +++ DQEGGRV R ++G T P A Sbjct: 85 GGIILFGKNLGKKEEILSLTENLQRSAKENGLPPFLISTDQEGGRVFRVQDGITHYPGAM 144 Query: 83 SFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGH--ISAAIGERSYHADPQ 140 + +G +E G++ G++ + E+ ++ ++ FAPVLD+ + ++ I RS+ +D Sbjct: 145 AVGQ-TGKKEWGEIV---GFVTSYELRSLGLNFLFAPVLDINNNPLNPVINTRSFGSDVH 200 Query: 141 KALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSL 200 + +A + G G KHFPGHG T DSH P + E+ ++ F Sbjct: 201 RVSDVAVAYERGARAGGCLPVIKHFPGHGDTTVDSHLGLPIINKSLTELEKLELVPFQES 260 Query: 201 IRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMG 259 IR +A+M AH++Y +DP PA+ S L VLR+ L FDG+I +D +ME AI Sbjct: 261 IR-GGAEAVMSAHIVYPKIDPNFPATLSKTILTDVLRKSLQFDGIIITD--AMEMHAISK 317 Query: 260 SYAER--GQASLDAGCDMILVCN 280 +Y + G ++ AG +++L+ + Sbjct: 318 NYEKDRPGVLTILAGANIVLLTS 340 >UniRef50_C5CH66 Beta-N-acetylhexosaminidase n=1 Tax=Kosmotoga olearia TBF 19.5.1 RepID=C5CH66_KOSOT Length = 504 Score = 133 bits (334), Expect = 1e-29, Method: Compositional matrix adjust. Identities = 89/294 (30%), Positives = 149/294 (50%), Gaps = 16/294 (5%) Query: 5 MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQE 64 M+ + G +DAE +IL G +ILF+RN +P+Q+++ + + + + ++A+DQE Sbjct: 12 MIGLPGKSMDAETVKILERVNPGSIILFSRNIENPSQVKDFIDECKRVLGYKPLIAIDQE 71 Query: 65 GGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVG 124 GG V R +GF P A + AA E A A ++A EM A+ ID APV+D+ Sbjct: 72 GGIVTRLTKGFAISPGAMAIAATGNSEN----AFIAASILAKEMRAVGIDWDLAPVVDIN 127 Query: 125 HISAAIGE--RSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCD 182 + G RS+ +D + +F G+ E G+ KHFPG G V+ D+H + P Sbjct: 128 NNPNNPGIGVRSFSSDVNTVVEFGRKFTKGLQENGIIACLKHFPGKGRVSVDAHIDMPVL 187 Query: 183 PRPQAEIRAKDMSVFSSLIRENKLDAI--MPAHVIYSDVD--PRPASGSPYWLKTVLRQE 238 E++ ++ F K++A+ MP+H+ + PA+ S L ++R++ Sbjct: 188 DVSYDELKKFELIPFK------KIEALSWMPSHIYIPALQHTKIPATLSEEILTELVRKK 241 Query: 239 LGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL 292 L + GV+ +DDL M G E + AG D++ VC+N ++V + L Sbjct: 242 LHYKGVLIADDLLMGGITNYFPVEEATLKAFAAGMDILTVCHNDDIQIAVKNYL 295 >UniRef50_Q1AUI8 Glycoside hydrolase, family 3-like protein n=1 Tax=Rubrobacter xylanophilus DSM 9941 RepID=Q1AUI8_RUBXD Length = 386 Score = 132 bits (333), Expect = 1e-29, Method: Compositional matrix adjust. Identities = 92/261 (35%), Positives = 141/261 (54%), Gaps = 14/261 (5%) Query: 26 VGGLILFTRNYHDPAQLRELVRQIRAASRNR-----LVVAVDQEGGRVQRFREGFTRLPA 80 VGG++LF N Q+R L +++ S L+VAVDQEGG VQ + P+ Sbjct: 88 VGGVLLFGYNMRSLPQVRRLTGELQRLSMATEPAVPLLVAVDQEGGEVQSA-PWVSPQPS 146 Query: 81 AQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHISAAIGERSYHADPQ 140 A A E ++A+ G SE++ ++ APV+D G AAIG RSY +P+ Sbjct: 147 AAEIGASGDPEAARRVARRIG----SELLEAGVNTDLAPVVDTG-FGAAIGSRSYGENPE 201 Query: 141 KALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSL 200 + S ++G EAG+ + KHFP HG T+DSH P + ++D+ F++ Sbjct: 202 LVSRMGSAAVEGFREAGVISAAKHFPNHGPATSDSHTGLPIVDHDMDTVLSRDLPPFAAA 261 Query: 201 IRENKLDAIMPAHVIYSDVD-PRPASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMG 259 +R + +M H++Y +D PAS SP ++ +LR+ LGFDGV+ +DDL+M GA G Sbjct: 262 VRAG-VPMVMVGHLLYPAIDQENPASMSPQAIR-LLRERLGFDGVVITDDLAMAGARQGG 319 Query: 260 SYAERGQASLDAGCDMILVCN 280 S A ++ AG DM++V + Sbjct: 320 SVARAAVRAVSAGADMLIVSS 340 >UniRef50_C8PQ37 Glycosyl hydrolase, family 3 n=1 Tax=Treponema vincentii ATCC 35580 RepID=C8PQ37_9SPIO Length = 561 Score = 132 bits (333), Expect = 1e-29, Method: Compositional matrix adjust. Identities = 91/261 (34%), Positives = 140/261 (53%), Gaps = 15/261 (5%) Query: 26 VGGLILFTRNYHDPAQLRELVRQIRAAS-----RNRLVVAVDQEGGRVQRFREGFTRLPA 80 +G + +F N D +L + V ++ + R L VA DQEGG ++ + + P Sbjct: 89 LGSIKIFGWNTEDSVELAKAVSLLQKKANGTRFRIPLFVATDQEGGWIRHVKGRTSETPG 148 Query: 81 AQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV--GHISAAIGERSYHAD 138 + A SG A G+ + E+ A+ I+++FAP +D+ H S+ IG RS+ Sbjct: 149 NLAIGA-SGYPID---AYYEGFYIGRELSALGINLNFAPTVDLYTNHESSIIGPRSFGQS 204 Query: 139 PQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFS 198 P+ A +A F+ G ++AG+ TT KHFPGHG DSH + P + +++ F Sbjct: 205 PEAAGILARAFMQGSNKAGVLTTAKHFPGHGDTALDSHGKLPSIAISKETFYNRELVPFQ 264 Query: 199 SLIRENKLDAIMPAHVIYSDVDP--RPASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAA 256 LI + + AIM H+ + V PAS SPY L+TVLR +LGF G+I +DD+ M GA Sbjct: 265 YLI-DADVPAIMTGHLNFPAVSEHGEPASFSPYLLQTVLRGQLGFKGLIITDDMMMHGAI 323 Query: 257 -IMGSYAERGQASLDAGCDMI 276 G+ A+ Q +L+AG D+I Sbjct: 324 RYAGTVAKAVQLALEAGNDII 344 >UniRef50_A8VQ50 Choline dehydrogenase n=1 Tax=Bacillus selenitireducens MLS10 RepID=A8VQ50_9BACI Length = 535 Score = 132 bits (332), Expect = 2e-29, Method: Compositional matrix adjust. Identities = 97/310 (31%), Positives = 154/310 (49%), Gaps = 23/310 (7%) Query: 18 REILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNR-----LVVAVDQEGGRVQRFR 72 +E++ VG +ILF RN P ++R L +++ +R+ L ++VDQE G V+R Sbjct: 30 KEMIETHRVGNIILFGRNLGSPGEIRSLTGKLQKIARDAGHERPLFISVDQENGAVRRLG 89 Query: 73 EGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI--SAAI 130 EG T P A + A E LA E G+ A E+ A+ I+ + APV DV + + I Sbjct: 90 EGTTVFPGAMTTGATGEPE----LAYECGYATALELKALGINWNLAPVADVNNNRHNPVI 145 Query: 131 GERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIR 190 G RS+ DP++A A A+ ++GM +A + T KHFPGHG DSH P P + Sbjct: 146 GVRSFSEDPEQAAAFAAASMNGMQKANIMTAVKHFPGHGDTEVDSHLSLPVIPHDMERLN 205 Query: 191 AKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR---PASGSPYWLKTVLRQELGFDGVIFS 247 ++ F I E+ D IM +H+ + ++ P + S L +LR ++GF+G+I + Sbjct: 206 HVELVPFRRCI-ESGADVIMSSHIYFPALEKEKNLPVTLSHEVLTNLLRVKMGFEGLITT 264 Query: 248 DDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHKG 307 D L M+ + A ++ AG D ++ +S+ +L E V R KG Sbjct: 265 DCLEMDAISESVGTAVGAVKAIHAGVDFVM--------ISMRTDLQKQALEAVKRDIDKG 316 Query: 308 SFSRQELMDS 317 S + +S Sbjct: 317 LISESRIEES 326 >UniRef50_B1YFC7 Glycoside hydrolase family 3 domain protein n=6 Tax=Firmicutes RepID=B1YFC7_EXIS2 Length = 646 Score = 132 bits (331), Expect = 2e-29, Method: Compositional matrix adjust. Identities = 87/272 (31%), Positives = 133/272 (48%), Gaps = 25/272 (9%) Query: 26 VGGLILFTRNYHDPAQLRELVRQIRAASRNR------LVVAVDQEGGRVQRFREGFTRLP 79 +GG+ILF N + Q +LV +R L V +DQEGG V R G T LP Sbjct: 84 LGGVILFAENVSETEQTTKLVHDLREVVEQDPSNDIPLFVTIDQEGGIVTRLGTG-TNLP 142 Query: 80 AAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI--SAAIGERSYHA 137 + G A+ AG ++ SE+ A+ I+++F PVLDV + + IG RS+ + Sbjct: 143 GNMAL----GATRSSAYAEAAGGIIGSELHALGINVNFGPVLDVNNNPGNPVIGVRSFSS 198 Query: 138 DPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVF 197 DP + S G+ E G+ T KHFPGHG DSH P + AE++ ++ F Sbjct: 199 DPNLVGELGSAMTQGIQEQGVAATAKHFPGHGDTAVDSHYGLPVVDKSLAELKQLELIPF 258 Query: 198 SSLIRENKLDAIMPAHVIYSDVD-----------PRPASGSPYWLKTVLRQELGFDGVIF 246 I E +D +M H+ ++ P PA+ S + VLR+ELG+ G++ Sbjct: 259 KRAIDEG-IDMVMTGHIGMPQIEDEVVESKQGTFPLPATLSDDVITGVLREELGYQGLVI 317 Query: 247 SDDLSMEGAAIMGSYAERGQASLDAGCDMILV 278 +D L+M+ A + +E + AG D+ L+ Sbjct: 318 TDALNMQAIADNFTESEAVIKTFKAGVDIALM 349 >UniRef50_A6TMT8 Glycoside hydrolase, family 3 domain protein n=1 Tax=Alkaliphilus metalliredigens QYMF RepID=A6TMT8_ALKMQ Length = 434 Score = 132 bits (331), Expect = 2e-29, Method: Compositional matrix adjust. Identities = 82/262 (31%), Positives = 139/262 (53%), Gaps = 14/262 (5%) Query: 26 VGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFREGFTRLPAAQSFA 85 +GG+ILF N Q ++L+ ++ SR + +AVD+EGGRV R ++LPA Q Sbjct: 105 LGGIILFAENIESIDQTKKLIDDMQKNSRLPMFIAVDEEGGRVTRLNHSRSQLPATQ--- 161 Query: 86 ALSGMEEGGK-----LAQEAGWLMASEMIAMDIDISFAPVLDVGH--ISAAIGERSYHAD 138 L G E GK L+ E G L+ E++++ +++ APV DV + IG+RS+ + Sbjct: 162 -LPGNEVLGKTKDPTLSYEVGRLLGRELLSLGFNMNLAPVADVNTNAKNPVIGDRSFSDN 220 Query: 139 PQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFS 198 PQ+ +AS G+ + KHFPGHG D+H + +++ + F Sbjct: 221 PQEVGIMASEMARGLQNENVSAVFKHFPGHGDTEFDTHHQAVVINHDLERLQSVEWVPFR 280 Query: 199 SLIRENKLDAIMPAHVIYSDV--DPRPASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAA 256 I E +DAIM AH+ + + PA+ S + +LR+E+G++G++ +D L M + Sbjct: 281 KGI-EVSVDAIMTAHIQMPQITGNDFPATLSSKIITEILREEMGYEGMVITDALEMSAVS 339 Query: 257 IMGSYAERGQASLDAGCDMILV 278 + AE +++AG D++L+ Sbjct: 340 QHYTSAEAAVLAIEAGVDILLM 361 >UniRef50_UPI0001C42D84 glycoside hydrolase family 3 domain protein n=1 Tax=Bacillus pseudofirmus OF4 RepID=UPI0001C42D84 Length = 699 Score = 131 bits (330), Expect = 3e-29, Method: Compositional matrix adjust. Identities = 88/269 (32%), Positives = 139/269 (51%), Gaps = 22/269 (8%) Query: 26 VGGLILFTRNYHDPAQLRELVRQIRAASRNR-LVVAVDQEGGRVQRFREGFTRLPAAQSF 84 +GG+ILF N Q +LV +AAS L++ +DQEGG V R + G T +P + Sbjct: 191 LGGVILFRENVVTTEQTTKLVADYQAASEKFGLLMTIDQEGGIVTRLQSG-TDMPGNMAL 249 Query: 85 AALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI--SAAIGERSYHADPQKA 142 A E LA+ G ++ SE+ ++ I+++FAPV+DV + + IG RS+ DP+ Sbjct: 250 GATRSPE----LAENVGRVIGSELHSLGINMNFAPVMDVNNNPDNPVIGVRSFGEDPKLV 305 Query: 143 LAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIR 202 + + G+ +G+ T KHFPGHG DSH P P +R ++ F + Sbjct: 306 ADMGVAYTHGLQSSGVAGTAKHFPGHGDTAVDSHLGLPEVPHDIERLREVELYPFQQAM- 364 Query: 203 ENKLDAIMPAHVIYSDVDPR-------------PASGSPYWLKTVLRQELGFDGVIFSDD 249 + +DAIM AHV + ++D PA+ S L ++R+++GFDGVI +D Sbjct: 365 DAGIDAIMTAHVTFPNIDDTKVISQKTGEEIAIPATLSYKVLTELMREDMGFDGVITTDA 424 Query: 250 LSMEGAAIMGSYAERGQASLDAGCDMILV 278 L+M A + ++ AG D+IL+ Sbjct: 425 LNMLAIADHFGPVDAAVRAVQAGSDIILM 453 >UniRef50_Q982A1 Glycosyl hyrolase, family 3 n=2 Tax=Mesorhizobium RepID=Q982A1_RHILO Length = 394 Score = 131 bits (329), Expect = 4e-29, Method: Compositional matrix adjust. Identities = 90/286 (31%), Positives = 152/286 (53%), Gaps = 16/286 (5%) Query: 26 VGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRF--REGFTRLPAAQS 83 + G+I RN +R+L + ++ + L+VA+DQEGGR+QR GF +P+ + Sbjct: 80 ITGVIYLGRNISSLEAVRQLNKGLQQYATAPLLVAIDQEGGRIQRLTGEVGFREVPSEAT 139 Query: 84 FAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLD--VGHISAAIGE--RSYHADP 139 A +E + Q+ +AS++ ++ +++ APV+D V + IG+ RS+ ADP Sbjct: 140 VAKTMSPDEASVVYQD----LASDLSSLGFNLNLAPVVDLNVNPANPIIGKLGRSFSADP 195 Query: 140 QKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSS 199 QK A A FI+ G+ T KHFPGHG+ T DSHK + ++ K++ + Sbjct: 196 QKVEAYARAFIEAHRAKGVLTALKHFPGHGSSTKDSHKGIADVTKTWSD---KELLPYKG 252 Query: 200 LIRENKLDAIMPAHVIYSDV--DPR-PASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAA 256 LI +D +M AHVI + + P PAS S L T+LR + F GV+ SDD+ M+ Sbjct: 253 LIASGDVDMVMAAHVINAKLAFSPDIPASLSRATLTTLLRDNMHFKGVVISDDMQMQAIT 312 Query: 257 IMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTR 302 S+ + ++ AG D+++ N++ ++ + ++ I +E R Sbjct: 313 DNVSFEDSVLRAVMAGNDILIFANDKHPDPAIPEKVAAILSEEAQR 358 >UniRef50_D2AR90 Beta-glucosidase-related glycosidase-like protein n=20 Tax=Actinomycetales RepID=D2AR90_STRRD Length = 520 Score = 131 bits (329), Expect = 4e-29, Method: Compositional matrix adjust. Identities = 97/298 (32%), Positives = 146/298 (48%), Gaps = 17/298 (5%) Query: 26 VGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRF--REGFTRLPAAQS 83 +GG +LF RN PAQ ELV +R +VVAVD+EGG V R R G + P ++ Sbjct: 37 LGGAVLFARNLAGPAQTAELVAGLRR-ENPAVVVAVDEEGGAVTRLEARTG-SSWPGNRA 94 Query: 84 FAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHISA--AIGERSYHADPQK 141 E ++A+E G L+AS DI + +APV+DV A IG RS+ DP+ Sbjct: 95 LGVADDAERTERVAREIGRLLAS----ADITLDYAPVVDVNANPANPVIGIRSFGPDPEL 150 Query: 142 ALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLI 201 + +I G+ AG+ KHFPGHG DSH P ++ +D+ F + + Sbjct: 151 VSRQTTAWITGLQGAGVAACAKHFPGHGDTVTDSHHALPTVHADLELLQERDLPPFRAAV 210 Query: 202 RENKLDAIMPAHVIYSDVDP-RPASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGS 260 + + A+M H++ +DP PA+ S L +LR+E+GF G++ +D + M A + Sbjct: 211 KAG-VQAVMCGHLLVPALDPGNPATLSRRILTGLLREEMGFGGMLVTDAIEMGAVAALHP 269 Query: 261 YAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDSA 318 E +L AG D I V + G SV + + R H G + L ++A Sbjct: 270 PGEIAVRALAAGVDAICVGVSSPGGESVY-----ALRDAIVRAVHDGRLPEERLAEAA 322 >UniRef50_A3EUK5 Beta-N-acetylhexosaminidase n=2 Tax=Leptospirillum sp. Group II RepID=A3EUK5_9BACT Length = 342 Score = 131 bits (329), Expect = 4e-29, Method: Compositional matrix adjust. Identities = 93/297 (31%), Positives = 152/297 (51%), Gaps = 13/297 (4%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLV-VAVD 62 + + + G L AE+ L GG++LF N +++L+ +IR A + V +A+D Sbjct: 15 LWVSLPGARLGAEDIRWLHQIRPGGVVLFAENGTRAEDVKKLISEIREALFPQDVWIAID 74 Query: 63 QEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLD 122 QEGGRV R +EG RL A++ G + ++ + A + + + IDI FAPVLD Sbjct: 75 QEGGRVARLKEGVPRLSPARTL----GNGDPSEIFRSA-FHLGRALRQTGIDIDFAPVLD 129 Query: 123 V--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETP 180 V + IG+RS+ +P + ++ F G+ G+ GKH PGHG DSH P Sbjct: 130 VDSNPDNPVIGDRSFSREPWEVARLSMAFSHGLEAGGVIPCGKHVPGHGDTLLDSHLALP 189 Query: 181 CDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQEL 239 P+A + ++ F I E + +M AHV+Y +D + PA+ S ++ +LR++ Sbjct: 190 TVDAPEAILSGREFLPFEKAIAEG-MPMLMTAHVMYPALDKKWPATLSRKIMQDMLRKKW 248 Query: 240 GFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKG---AVSVLDNLS 293 F G++ SDDLSM ++ + +L CD +LV +R A+ ++ LS Sbjct: 249 DFRGLLLSDDLSMAAVGTPFPFSVVVERALSVTCDGLLVLKSRDKPLEAIRIIKGLS 305 >UniRef50_B3DX93 Beta-glucosidase-related glycosidase n=1 Tax=Methylacidiphilum infernorum V4 RepID=B3DX93_METI4 Length = 373 Score = 131 bits (329), Expect = 4e-29, Method: Compositional matrix adjust. Identities = 83/284 (29%), Positives = 145/284 (51%), Gaps = 16/284 (5%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQ 63 +++ + G L ++ + + G I F+RN +P R L+ Q+R+ + + A+DQ Sbjct: 18 LIIGLPGAHLSQDQIQTIREIQPSGFIFFSRNLENPLSFRNLLEQLRSLVEHEPIFAIDQ 77 Query: 64 EGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV 123 EGGRV R + + P A+ + ++ L +E G L + ++ APVLD+ Sbjct: 78 EGGRVSRLKAFGSEPPGAKELGEKADLD----LLREHGELTGKLLRLFGLNFDLAPVLDM 133 Query: 124 GHIS---AAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETP 180 ++ RS+ DPQ A FI+G+ G+ GKHFPG+ D H E P Sbjct: 134 DTSEREHNSLKGRSFAPDPQDVSRFAKAFIEGLKSQGILCCGKHFPGYSFAQCDPHLELP 193 Query: 181 CDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR--PASGSPYWLKTVLRQE 238 + + E+ + + F +L +N D+ M AH+ ++DP+ PA+ S + +++RQE Sbjct: 194 QSFKSKKELESFEWIPFRAL--QNSCDSFMIAHIRNFNLDPQGLPATLSSRMI-SIIRQE 250 Query: 239 LGFDGVIFSDDLSMEGAAIMGSYAERG--QASLDAGCDMILVCN 280 F ++ SDD+ M AI+ Y+ + + SL AG D++L+C+ Sbjct: 251 WQFTQLLVSDDIDM--GAIINHYSLKDTLELSLKAGIDILLLCH 292 >UniRef50_B6BGC2 B-N-acetylglucosaminidase, glycoside hydrolase family 3 protein n=1 Tax=Campylobacterales bacterium GD 1 RepID=B6BGC2_9PROT Length = 555 Score = 131 bits (329), Expect = 5e-29, Method: Compositional matrix adjust. Identities = 99/336 (29%), Positives = 168/336 (50%), Gaps = 36/336 (10%) Query: 14 DAEEREILAHPLVGGLILFTRNYHD---------PAQLRELVRQIRAASRNRLVVAVDQE 64 + E E + +GG++LF R Y P QL+EL +++ SR L+++VDQE Sbjct: 41 NTEIVEYIQQYELGGVVLFDRFYKKRSQIKNIDTPEQLQELTAKLQKFSRKPLLISVDQE 100 Query: 65 GGRVQRFRE--GFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV-- 120 GG + R + GF R P+A+ + LS + + + ++++ +I+ +FAPV Sbjct: 101 GGNIARLKHYYGFDRFPSAKEVSYLS-LTQAKDIYNRQSKMLSNN----NINCNFAPVVD 155 Query: 121 LDVGHISAAIGE--RSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE 178 LD+ + I E RS+ D +K A ID + + + KHFPGHG+ DSHK Sbjct: 156 LDINPKNKVISELERSFGKDSEKVTHYAKIVIDSQTKHNVISVLKHFPGHGSSLGDSHKG 215 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPASGSPYWLKTVLRQ 237 ++ K++ + +LI + D IM AHV S++D PA+ S +LRQ Sbjct: 216 FVDISNTWSQ---KELVPYQNLINSKEADMIMSAHVFNSNLDALYPATLSYSTNTKLLRQ 272 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKA 297 ++GF GV+ SDDL M+ + E ++++G D IL+ N+ + V + + I A Sbjct: 273 KMGFTGVLISDDLQMKAITDHYNLKESVTLAINSGVD-ILLFGNQLASNDVKELVEIIYA 331 Query: 298 ERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHER 333 + G+ S++ +++S R R+ LH + Sbjct: 332 Q-----VKSGAISKKRIIESNR------RIENLHTK 356 >UniRef50_UPI00016BFC75 Beta-glucosidase-related glycosidase n=1 Tax=Epulopiscium sp. 'N.t. morphotype B' RepID=UPI00016BFC75 Length = 509 Score = 130 bits (328), Expect = 5e-29, Method: Compositional matrix adjust. Identities = 87/268 (32%), Positives = 139/268 (51%), Gaps = 18/268 (6%) Query: 26 VGGLILFTRNYHDPAQLRELVRQIR--AASRN--RLVVAVDQEGGRVQRFREGFTRLPAA 81 VG +ILF RN P Q+ +L++ ++ A N + + +DQE G V R EG P Sbjct: 33 VGNVILFERNCKTPEQILKLIQNLQKLAIKYNGVPMFIGIDQENGMVTRIGEGVCTFPGN 92 Query: 82 QSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHISAAI--GERSYHADP 139 + A + EE ++A+ G ++ A+ I+ + AP LD+ + G RS+ +P Sbjct: 93 MAQANGATEEEIYQVAKYTGEMLE----ALGINYNLAPSLDINNNPNNPIIGIRSFGENP 148 Query: 140 QKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSS 199 + + I G +AG+ +T KHFPGHG DSH PC P + + ++ F + Sbjct: 149 ETVTNMGLACIKGFQDAGIISTAKHFPGHGDTAIDSHLTLPCIPHSKERLEKVELYPFKA 208 Query: 200 LIRENKLDAIMPAHVIYSDVDP-RPASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIM 258 I + + +IM AH+I+ +P PA+ SP L +LRQ+LGF +I +D ME AI Sbjct: 209 AI-DAGVKSIMTAHIIFPAYEPVYPATLSPLVLTNLLRQDLGFHNLIITD--CMEMKAIS 265 Query: 259 GSYAERGQAS---LDAGCDMILVCNNRK 283 +Y + QAS + AG D++ + + K Sbjct: 266 DTYTTQ-QASVMAIAAGADIVCISHTEK 292 >UniRef50_A3DC81 Glycoside hydrolase, family 3-like protein n=3 Tax=Clostridium thermocellum RepID=A3DC81_CLOTH Length = 444 Score = 130 bits (327), Expect = 6e-29, Method: Compositional matrix adjust. Identities = 83/260 (31%), Positives = 140/260 (53%), Gaps = 13/260 (5%) Query: 27 GGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRF----REGFTRLPAAQ 82 GG+ILF+ N Q ++L+R ++ AS+ + +AVD+EGGR+ R + T++P+AQ Sbjct: 113 GGIILFSENIDTIPQTQKLIRDMQEASKIPMFIAVDEEGGRIARIGNNPKMHSTKIPSAQ 172 Query: 83 SFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI--SAAIGERSYHADPQ 140 + G+ + +LA EAG ++ +E+ A+ +++FAPV DV + IG+RS+ +DP Sbjct: 173 TI----GLADDPELAYEAGRILGAELSALGFNMNFAPVADVNTNPDNPVIGDRSFGSDPY 228 Query: 141 KALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSL 200 K + GM E + T KHFPGHG + DSH +R +++ F Sbjct: 229 KVGLMVQAMSKGMQEQNVCTVLKHFPGHGDTSYDSHLGQVVINHDIERLRQIELTPFKMG 288 Query: 201 IRENKLDAIMPAHVIYSDV--DPRPASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIM 258 I+ D +M AH+I ++ PA+ S L +LR EL + +I +D + M+ + Sbjct: 289 IKAGA-DGVMTAHIIMPNITGSNLPATLSEEILSGLLRNELKHEKLIITDAMEMKAISNY 347 Query: 259 GSYAERGQASLDAGCDMILV 278 S ++ + AG D+IL+ Sbjct: 348 WSSSKAAVMAFKAGADIILM 367 >UniRef50_Q4FM08 Glycosyl hydrolase n=3 Tax=Candidatus Pelagibacter RepID=Q4FM08_PELUB Length = 316 Score = 130 bits (327), Expect = 6e-29, Method: Compositional matrix adjust. Identities = 85/274 (31%), Positives = 137/274 (50%), Gaps = 13/274 (4%) Query: 13 LDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLV-VAVDQEGGRVQRF 71 L +E++ ++ H G+ILF+RN + QL++L QIR ++ + +DQEGGRV R Sbjct: 18 LTSEKKFLIKHK-PWGIILFSRNIKNIEQLKKLTGQIRKIFNDKKYPILIDQEGGRVNRL 76 Query: 72 REGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMD---IDISFAPVLDVGHI-- 126 ++ F F +++ K E + ++ + I+I+ PVLDV Sbjct: 77 KKFFKADKFTGEFFGNLYLKDKKKFYIEYKKFINKTVLLLKNVGININTLPVLDVRSKGS 136 Query: 127 SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQ 186 SA IG+R++ + + I I H + T KH PGHG DSHK+TP Sbjct: 137 SAIIGDRAFSDNAKIVSEIGDICIKEFHNNKIATVIKHIPGHGLAKVDSHKKTPT---IN 193 Query: 187 AEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQELGFDGVIF 246 E+R + F + +N L A M AH+IY +D + + ++R + F ++ Sbjct: 194 KELRYLNNHDFLAFKNKNSLFA-MTAHLIYKKIDKVNTATHSKKIIKLIRNLIKFRNILM 252 Query: 247 SDDLSMEGAAIMGSYAERGQASLDAGCDMILVCN 280 SDD+SM+ ++ GS A + S AGCD++L CN Sbjct: 253 SDDISMK--SLKGSIAINTKKSFSAGCDLVLHCN 284 >UniRef50_C6C0L9 Glycoside hydrolase family 3 domain protein n=1 Tax=Desulfovibrio salexigens DSM 2638 RepID=C6C0L9_DESAD Length = 373 Score = 130 bits (327), Expect = 7e-29, Method: Compositional matrix adjust. Identities = 103/357 (28%), Positives = 169/357 (47%), Gaps = 46/357 (12%) Query: 4 VMLDVEGYELDAEEREI--LAHPLVGGLILFT---------RNYHDPAQLRELVRQIRAA 52 VM+ G EL A + + VGG+ILF+ RN +QL +L +++ Sbjct: 38 VMVGFRGLELSANNPVVGDIRDARVGGVILFSKDCALNSTVRNIASSSQLIKLTSDLQSY 97 Query: 53 SRNRLVVAVDQEGGRVQRF--REGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIA 110 ++ L +++DQEGG ++R GF P+A G + A AG + + Sbjct: 98 AQVPLFISIDQEGGIIKRLTGEMGFPETPSAAEL----GNSGDTQAAFRAGKTIGKSLKE 153 Query: 111 MDIDISFAPVLDVGHISA----AIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFP 166 I+++FAPV+DV +A A +RS+ DP+ A +IDG+H G+ + KHFP Sbjct: 154 AGINMNFAPVVDVNRNAANPVIAALQRSFSDDPRIVARFADSYIDGLHSEGIISCLKHFP 213 Query: 167 GHGAVTADSHK--ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-P 223 GHG+ DSH D +E+ F LI +NK D +M AH+ + D + P Sbjct: 214 GHGSSRGDSHLGFTDVTDSWDSSELYP-----FQKLISDNKADMVMTAHIFNARWDRKYP 268 Query: 224 ASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRK 283 A+ S + +LR++LGF+G+I +DD+ M+ + + E ++ AG D++L NN Sbjct: 269 ATLSRNVIHGLLRRKLGFEGIIVTDDMQMQAVSGEYGFKEGVYRAVKAGADILLFGNN-- 326 Query: 284 GAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQEEKAG 340 + L + +L H+G + + R+ Q ++R EK G Sbjct: 327 --LIYEPGLGTKAVSTLKQLVHEGKITER-------------RIRQSYDRIMREKQG 368 >UniRef50_C5DA89 Glycoside hydrolase family 3 domain protein n=6 Tax=Bacillaceae RepID=C5DA89_GEOSW Length = 698 Score = 130 bits (326), Expect = 1e-28, Method: Compositional matrix adjust. Identities = 91/294 (30%), Positives = 148/294 (50%), Gaps = 29/294 (9%) Query: 16 EEREILAHPLVGGLILFTRNYHDPAQLRELVRQIR-AASRNRLVVAVDQEGGRVQRFREG 74 E +E++ +GG+ILF N Q +LV + AA + L++++DQEGG V R + G Sbjct: 182 EIKELIQKYHLGGIILFRENVVTTEQTAKLVADYQQAAEKFGLLISIDQEGGIVTRLQSG 241 Query: 75 FTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI--SAAIGE 132 T +P + A E LA + G + E+ A+ I+++ APVLDV + + IG Sbjct: 242 -TNMPGNMALGATRSKE----LAWKVGHAIGEELHALGINMNLAPVLDVNNNPDNPVIGV 296 Query: 133 RSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAK 192 RS+ +P+ + +I G+ ++G+ T KHFPGHG DSH P + ++ Sbjct: 297 RSFGENPELVAELGVSYIKGLQDSGVAATAKHFPGHGDTAVDSHLGLPEVSHDRDRLQKV 356 Query: 193 DMSVFSSLIRENKLDAIMPAHVIYSDVDPR-------------PASGSPYWLKTVLRQEL 239 ++ F + E +DA+M AHV + VD PA+ S L ++R+E+ Sbjct: 357 ELYPFQKAM-EAGIDAVMTAHVTFPKVDDTKVISQKDGTAISLPATLSYKVLTELMREEM 415 Query: 240 GFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMIL-------VCNNRKGAV 286 GFDGVI +D ++M+ + + ++ AG D++L V N K AV Sbjct: 416 GFDGVIITDAMNMKAISDHFGPVDAAVRAVQAGADIVLMPVGLEEVANGLKKAV 469 >UniRef50_C1TNN0 Beta-glucosidase-like glycosyl hydrolase n=1 Tax=Dethiosulfovibrio peptidovorans DSM 11002 RepID=C1TNN0_9BACT Length = 541 Score = 129 bits (325), Expect = 1e-28, Method: Compositional matrix adjust. Identities = 93/278 (33%), Positives = 138/278 (49%), Gaps = 11/278 (3%) Query: 6 LDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASR--NRLVVAVDQ 63 L G LD E R L GGLIL+ +N +Q+R L+ IR R L+VAVDQ Sbjct: 38 LSFSGTSLDEETRFRLEEIKPGGLILYGKNVESLSQVRSLLSSIRERVRLDIPLMVAVDQ 97 Query: 64 EGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV 123 EGG V R R P + A + LA+ G++M ++ ++ ID+ +APV+DV Sbjct: 98 EGGTVARIRGHGADFPGNMALGATGDPD----LAERQGFIMGRQLKSLGIDLDYAPVVDV 153 Query: 124 GH--ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPC 181 + IG RS+ D ++ + I G A M + KHFPGHG V DSH P Sbjct: 154 NSNPSNPIIGVRSFGDDVATVSSMGASMIKGFSRARMGCSAKHFPGHGDVDIDSHLGLPV 213 Query: 182 DPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQELG 240 R +R + F S + E + A+M AH++ + PA+ SP + ++LR E+G Sbjct: 214 LNRSLESMRYLEFPPFRSAV-EAGVPAVMTAHIVVPSLTGELPATLSPEAI-SLLRDEMG 271 Query: 241 FDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILV 278 F+GV+ SD + M + + +L AG D +L+ Sbjct: 272 FEGVVLSDSMGMRAISNGWGVPDAVVMALRAGVDFVLL 309 >UniRef50_A6Q7E8 Glycosyl hydrolase, family 3 n=1 Tax=Sulfurovum sp. NBC37-1 RepID=A6Q7E8_SULNB Length = 361 Score = 129 bits (325), Expect = 1e-28, Method: Compositional matrix adjust. Identities = 92/277 (33%), Positives = 145/277 (52%), Gaps = 36/277 (12%) Query: 26 VGGLILFTRNYHDPA---------QLRELVRQIRAASRN-RLVVAVDQEGGRVQRFREGF 75 +G +ILF N D + QL++L Q++ S + +L++AVDQEGG+VQR + + Sbjct: 54 IGAVILFDYNPVDKSKPKNIASRKQLKKLTAQLQKCSYDGKLLIAVDQEGGKVQRLKRKY 113 Query: 76 TRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV-----GHISAAI 130 + ++ M++ K ++ MA E+ ++ I+ APV+D+ H+ + Sbjct: 114 GFYGNFPKASDVTRMDK--KQIKKTYRKMAKELKSVGINYDLAPVVDLDINPKNHVIHGL 171 Query: 131 GERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK----ETPCDPRPQ 186 G RS+ DP+ A AS FID MH G+ T+ KHFPGHG+ D+HK T R + Sbjct: 172 G-RSFGKDPKTVAAYASTFIDAMHHYGVLTSLKHFPGHGSSVGDTHKGFVDVTNLWKRVE 230 Query: 187 AEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQELGFDGVI 245 E + K D +M AHV +D R PAS S + +LR++LGF+GV+ Sbjct: 231 LE---------PYRLLNRKADTVMVAHVFNKKLDARYPASFSNRTVNGILRKQLGFNGVV 281 Query: 246 FSDDLSMEGAAIMGSYAERG--QASLDAGCDMILVCN 280 +DDL M AI Y + + +++AG D++L N Sbjct: 282 ITDDLQM--GAISKKYGLKNTLKLAINAGDDILLFGN 316 >UniRef50_C1I263 Beta-hexosamidase A n=1 Tax=Clostridium sp. 7_2_43FAA RepID=C1I263_9CLOT Length = 862 Score = 129 bits (325), Expect = 1e-28, Method: Compositional matrix adjust. Identities = 84/290 (28%), Positives = 146/290 (50%), Gaps = 31/290 (10%) Query: 12 ELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNR------LVVAVDQEG 65 E++ E EI+ +GG+ILF +N Q +L+ ++ + N L+ +DQEG Sbjct: 75 EMNDEVAEIIDEFDLGGVILFAQNVKTTEQTTKLIHDMQQVAINDQDGNLPLLTTIDQEG 134 Query: 66 GRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGH 125 G V R G T P + A + + + G ++ E+ ++ I+++FAP +D+ + Sbjct: 135 GIVTRLGTG-TNFPGNMAIGATKNEQS----SYDTGLVIGRELNSLGINVNFAPSVDINN 189 Query: 126 --ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDP 183 ++ IG RS+ ++P + + I G+ E G+ KHFPGHG DSH P Sbjct: 190 NPLNPVIGLRSFSSNPDLVAKLGVKMIKGIQENGVSVAAKHFPGHGDTATDSHYGLPKVD 249 Query: 184 RPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-------------PASGSPYW 230 + E++ ++ F + I +N +D IM AH+ + ++ PA+ S Sbjct: 250 KSLEELKEAELKPFQAAI-DNGVDMIMTAHIQFPQIEKDTYTSIKDGQEIQIPATLSDDI 308 Query: 231 LKTVLRQELGFDGVIFSDDLSMEGAAIMGSY--AERGQASLDAGCDMILV 278 L +LR+++ FDGVI +D L+M AAI ++ E + + DAG D+ L+ Sbjct: 309 LTGLLREDMNFDGVIITDALNM--AAISENFGEVETVKIAFDAGVDIALM 356 >UniRef50_Q1K0E5 Glycoside hydrolase, family 3-like n=1 Tax=Desulfuromonas acetoxidans DSM 684 RepID=Q1K0E5_DESAC Length = 411 Score = 129 bits (325), Expect = 1e-28, Method: Compositional matrix adjust. Identities = 96/315 (30%), Positives = 159/315 (50%), Gaps = 29/315 (9%) Query: 26 VGGLILFT---------RNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFR--EG 74 +GG+ILF RN P+QL++L ++A S L++A+DQEGGR+ R + +G Sbjct: 76 LGGVILFDYDVQLRQSGRNIASPSQLKQLTSDLQALSNLPLLIAIDQEGGRIARLKPAQG 135 Query: 75 FTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV----GHISAAI 130 F P S L+ + +L E G +A+ + I+++ APV+D+ + A Sbjct: 136 F---PVTLSHRELAEQADV-RLTFEHGKKLATTLATFGINLNLAPVVDLCSNPDNPVIAR 191 Query: 131 GERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIR 190 +R + A P + A +I G H+ + T KHFPGHG+ T DSH+ + E Sbjct: 192 LDRCFSATPDQVTDQAGAYIAGHHQTAVLTCLKHFPGHGSSTTDSHQGFTDITQTWNE-- 249 Query: 191 AKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQELGFDGVIFSDD 249 +++ + LI+ ++D IM AHV S +D PA+ S + +LR LG++GV+ SDD Sbjct: 250 -EELVPYRELIQHGRVDTIMTAHVFNSHLDANDPATLSRPIITGLLRGVLGYEGVVISDD 308 Query: 250 LSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSF 309 L M+ + + +L+AG DM++ NN ++ I + RL +G Sbjct: 309 LQMKAISAHYGLETAIEKALNAGVDMLVFGNNLSYNEHSVEQAVTI----IQRLIKQGKV 364 Query: 310 SRQELMDSARWKAIS 324 S + +S W+ I+ Sbjct: 365 SEARIDES--WRRIT 377 >UniRef50_Q5WDA0 Beta-N-acetylhexosaminidase n=1 Tax=Bacillus clausii KSM-K16 RepID=Q5WDA0_BACSK Length = 521 Score = 129 bits (324), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 98/291 (33%), Positives = 144/291 (49%), Gaps = 17/291 (5%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNR-----LV 58 ++ G E + ++ + VG +ILF+RN P QL L ++ A++ L Sbjct: 13 LVFGFHGTTATPEIKAMIRNHYVGNIILFSRNIETPEQLLTLTSDLQKAAKEAGHERPLA 72 Query: 59 VAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 + VDQE G V+R G T LP A L G + A G L A E+ A+ ++ + A Sbjct: 73 ICVDQENGTVRRL-SGVTALPGAM----LLGATHKRRYASIVGQLTAKELRAVGVNWNLA 127 Query: 119 PVLDVGHISAA--IGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSH 176 PV DV + IG RS+ DP K A + GM G+ T KHFPGHG DSH Sbjct: 128 PVADVNNNPNNPVIGVRSFGEDPSKVGDFAKEMMTGMQAEGVIATLKHFPGHGDTVVDSH 187 Query: 177 KETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYS--DVDPRPASGSPYWLKTV 234 P P ++ ++ F + I E +AIM AH+ + + +P PA+ S + + Sbjct: 188 HAMPTIPHGLKRLQQVELVPFKACI-EAGAEAIMSAHIHFPALETEPVPATISYKVMTEL 246 Query: 235 LRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILV--CNNRK 283 LR ELG+ GV+ +D L ME A A+ A+L AG DM++V C +R+ Sbjct: 247 LRHELGYSGVLMTDCLEMEAIAGSIGTAQGAVAALLAGADMLVVSHCESRQ 297 >UniRef50_Q0F8S6 Glycosyl hydrolase, family 3 n=1 Tax=Rhodobacterales bacterium HTCC2255 RepID=Q0F8S6_9RHOB Length = 330 Score = 129 bits (324), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 93/288 (32%), Positives = 147/288 (51%), Gaps = 24/288 (8%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQ 63 ++ EG L EER ILF+RN P Q++ L +R + + + +DQ Sbjct: 6 IIFGCEGTYLTKEERFFFKDANPWAFILFSRNLDSPNQIKTLCNDLRDCVGSNVPILIDQ 65 Query: 64 EGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGW--------LMASEMIAMDIDI 115 EGGRV RL A L +++ K+ E + L+A E+ ++ ID+ Sbjct: 66 EGGRV-------ARLKAPIWLDWLPPLDQMQKVKDEKQYEAMFLRYRLIAHELHSLGIDV 118 Query: 116 SFAPVLDVGHISA--AIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTA 173 + AP++D+ +I++ I R Y P+ + +G+ G+ KH PGHG ++ Sbjct: 119 NCAPMVDIPNINSHEIITNRCYGKTPKIVSEMGRACAEGLLSGGVLPVLKHIPGHGRGSS 178 Query: 174 DSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD-PRPASGSPYWLK 232 DSH E P + + + D + F L + L M AH+IYS D R A+ SP + Sbjct: 179 DSHFELPIVNTSSSSLNSIDFAPFKHL---SDLPMAMTAHIIYSSFDRNRCATVSPI-MN 234 Query: 233 TVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCN 280 ++RQ +GFDG++ +DD+SM+ A+ GS + R ASL AGCD++L CN Sbjct: 235 KIIRQNIGFDGLLMTDDISMK--ALSGSLSSRASASLKAGCDVVLHCN 280 >UniRef50_C1YLC2 Beta-glucosidase-like glycosyl hydrolase n=1 Tax=Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 RepID=C1YLC2_NOCDA Length = 558 Score = 129 bits (323), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 96/274 (35%), Positives = 137/274 (50%), Gaps = 17/274 (6%) Query: 27 GGLILFTRNYHDPAQLRELVRQIR--AASRNR---LVVAVDQEGGRVQRFREGFTRLPAA 81 GGLI F N D Q+ + ++ AA + R L V +DQE G V R G TR P A Sbjct: 90 GGLIYFDANLTDAEQIATMSAGVQDLAADQGRGVPLFVGIDQEQGLVARLPVG-TRFPDA 148 Query: 82 QSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI--SAAIGERSYHADP 139 + A E LA+ A E+ A+ +++++AP DV + IG RS+ +DP Sbjct: 149 MAVGATRDTE----LAELRASTTAEELTALGVNLNYAPDADVNTDPGNPVIGIRSFGSDP 204 Query: 140 QKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSS 199 +A D AG+ + KHFPGHG DSH P P+ + A + F + Sbjct: 205 DLVAQMAVAESDAYSGAGVVSVVKHFPGHGDTDVDSHSGLPVIDMPREQWEAGHLPPFRA 264 Query: 200 LIRENKLDAIMPAHVIYSDVD----PRPASGSPYWLKTVLRQELGFDGVIFSDDLSMEGA 255 I + +DAIM AHV+ +D P PA+ SP + +LR ELG+DGV+ +D L+MEG Sbjct: 265 AI-DADVDAIMTAHVLMPQLDGSEDPEPATISPELIDGILRDELGYDGVVTTDALNMEGV 323 Query: 256 AIMGSYAERGQASLDAGCDMILVCNNRKGAVSVL 289 S E L+AG D +L+ + AVS + Sbjct: 324 RQRHSDGEIAVRVLEAGVDQLLMPPDPAAAVSAI 357 >UniRef50_D1AF10 Glycoside hydrolase family 3 domain protein n=1 Tax=Thermomonospora curvata DSM 43183 RepID=D1AF10_THECD Length = 543 Score = 128 bits (322), Expect = 3e-28, Method: Compositional matrix adjust. Identities = 94/269 (34%), Positives = 138/269 (51%), Gaps = 12/269 (4%) Query: 14 DAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFRE 73 DAE++ H VGGLI F N P Q L ++ AS+ L++AVD+E G V R Sbjct: 76 DAEQKIKKYH--VGGLIYFPDNARSPQQTARLSNALQRASKIPLLLAVDEEQGLVTRLSY 133 Query: 74 GFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHISA--AIG 131 TR P + A + + A+EA ++ +E+ A+ I+ ++APV DV A IG Sbjct: 134 -VTRFPGNMALGATAKPQT----AREAAKVIGTELRAVGINQNYAPVADVNVNPANPVIG 188 Query: 132 ERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRA 191 RS+ +DP + + G +AG+ T KHFPGHG DSH P +AE Sbjct: 189 VRSFGSDPGLVSLMLGGALQGYRDAGVAATVKHFPGHGDTDTDSHTGLPVIRHSRAEWER 248 Query: 192 KDMSVFSSLIRENKLDAIMPAHVIYSDVDPR--PASGSPYWLKTVLRQELGFDGVIFSDD 249 D F + I + +DAIM AH++ +D PA+ S L +LR+ELG+ GV+ +D Sbjct: 249 LDAPPFRAAIAQG-VDAIMTAHIVMPGLDDSGDPATLSQAVLTGLLRKELGYRGVVVTDS 307 Query: 250 LSMEGAAIMGSYAERGQASLDAGCDMILV 278 LSM GA + ++ AG D +L+ Sbjct: 308 LSMAGARTRYGAEQAAVRAVQAGADQLLM 336 >UniRef50_D1CIC2 Glycoside hydrolase family 3 domain protein n=1 Tax=Thermobaculum terrenum ATCC BAA-798 RepID=D1CIC2_THET1 Length = 543 Score = 128 bits (321), Expect = 3e-28, Method: Compositional matrix adjust. Identities = 98/325 (30%), Positives = 156/325 (48%), Gaps = 28/325 (8%) Query: 26 VGGLILFTRNYHDPAQLRELVRQIRAAS-----RNRLVVAVDQEGGRVQRFREGFTRLPA 80 VG +ILF+RN D QLREL ++ A+ R L++ +DQE G V+R +G + LP Sbjct: 37 VGNVILFSRNVRDVQQLRELTATLQRAAFDAGHRQPLLIMIDQENGVVRRLGQGASVLPG 96 Query: 81 AQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHISA--AIGERSYHAD 138 + + A E A++ E+ A+ ++++ APV DV + A IG RS+ D Sbjct: 97 SMALGAAGDPE----CARQVALTTGRELRALGVNMNLAPVADVNNNPANPVIGVRSFGED 152 Query: 139 PQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFS 198 PQ + + ++G EAG+ + KHFPGHG DSH P P +A + + ++ F Sbjct: 153 PQAVGDMVAAAVEGYAEAGVISCLKHFPGHGDTAVDSHLALPVVPHDRARLESVELPPFR 212 Query: 199 SLIRENKLDAIMPAHVIYSDV--DPRPASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAA 256 I E D+IM AH+ + + P++ SP + +LR ELGF G++ +D L M + Sbjct: 213 RGI-EAGTDSIMVAHLHLPAIMANEAPSTVSPEVIDGLLRGELGFGGLVITDCLEMRAVS 271 Query: 257 IMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMD 316 ++ +G D+ LV + V + L E G+ S L + Sbjct: 272 ETIGTERSAVLAIASGADVALVSHRHDRQVGAVRALYAALRE--------GALSEDRLQE 323 Query: 317 SA-RWKAISTRLNQLHERWQEEKAG 340 +A R + RL+ W+E G Sbjct: 324 AADRVMCLKERLS-----WREALGG 343 >UniRef50_A0RP18 Glycosyl hyrolase, family 3 n=2 Tax=Campylobacter fetus RepID=A0RP18_CAMFF Length = 354 Score = 128 bits (321), Expect = 3e-28, Method: Compositional matrix adjust. Identities = 84/299 (28%), Positives = 166/299 (55%), Gaps = 23/299 (7%) Query: 26 VGGLILFTRNYHDPAQL-RELVRQIRAASRNRLVVAVDQEGGRVQRF--REGFTRLPAAQ 82 +GG++L RN +L + + + S+++L++A+D+EGG+V RF ++GF P+A Sbjct: 58 IGGVMLLARNISTKDELIKLTKKLSSSKSKHKLLIAIDEEGGKVSRFINKDGFKHFPSAY 117 Query: 83 SFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHISAAI---GERSYHADP 139 + + + G + ++ MA ++ + I+++FAPV+DV + +++I R++ +P Sbjct: 118 EVGSKMNLNKAGIIYKD----MADQLKNLGINMNFAPVVDVYNPNSSIIGQKNRAFSKNP 173 Query: 140 QKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE-TPCDPRPQAEIRAKDMSVFS 198 + +A +S FI +A +KT KHFPGHG D+HKE T D E++ + Sbjct: 174 DEVIAYSSEFIKSFDKANIKTVLKHFPGHGNAIKDTHKEKTVVDNYDFDELKP-----YF 228 Query: 199 SLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAI 257 LI+ + IM AHV ++D + PA S + + +L+ F G + SDDL M+G Sbjct: 229 ELIKRGQASFIMVAHVYIPNLDDKNPAVLSKHIVSDILKNRFKFKGAVMSDDLLMKGLGE 288 Query: 258 MGSYAERGQASLDAGCDMILVCN---NRKGAVSVLDN--LSPIKAERVTRLYHKGSFSR 311 + S ++ +++AG D++LV N ++ ++++ L+ + + ++++ K +++R Sbjct: 289 L-SIEQKVVRAINAGVDIVLVSEYFLNNSNSIKIINDAILNAVNSGKISKERIKDAYTR 346 >UniRef50_Q31RD3 Beta-glucosidase-related glycosidases-like n=5 Tax=Cyanobacteria RepID=Q31RD3_SYNE7 Length = 542 Score = 128 bits (321), Expect = 4e-28, Method: Compositional matrix adjust. Identities = 81/245 (33%), Positives = 135/245 (55%), Gaps = 7/245 (2%) Query: 47 RQIRAASRNRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGM--EEGGKLAQEAGWLM 104 +Q++ SR L++A D E G QRF G + P + +A++ E +LA+ G + Sbjct: 74 QQLQEWSRVPLLIAADIEEGVGQRF-AGAVQFPPPLALSAIAAQDPERAQQLARSFGEIT 132 Query: 105 ASEMIAMDIDISFAPVLDVGHISA--AIGERSYHADPQKALAIASRFIDGMHEAGMKTTG 162 A+E IA+ ++ AP++D+ + A I R++ DP+ + + FI G + + TT Sbjct: 133 ATEAIAIRLNWVLAPIVDINNNPANPVINVRAFGEDPETVSQLTTAFIAGAQQFPVLTTA 192 Query: 163 KHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP- 221 KHFPGHG DSH E P P ++ + A ++ F I +DA+M AH+ +DP Sbjct: 193 KHFPGHGDTATDSHLELPQIPHDRSRLEAVELEPFRRAIAAG-VDAVMTAHLSIPALDPD 251 Query: 222 RPASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNN 281 PA+ SP L+ +LR+ELGF+G+I +D L M+ A E + + +AG D++L+ + Sbjct: 252 YPATLSPAVLQGLLRRELGFEGLIVTDALVMQAIAAHYGPGEAARLAFEAGADILLMPVD 311 Query: 282 RKGAV 286 + A+ Sbjct: 312 PEAAI 316 >UniRef50_C9RS08 Glycoside hydrolase family 3 domain protein n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RS08_FIBSS Length = 384 Score = 128 bits (321), Expect = 4e-28, Method: Compositional matrix adjust. Identities = 93/281 (33%), Positives = 147/281 (52%), Gaps = 15/281 (5%) Query: 7 DVEGYELDAEEREILA----HPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVD 62 ++ Y+L+ + +LA +P +GG+IL+ N D QL + +IR + + L+ A+D Sbjct: 69 ELPNYKLEQVNKTMLAVNKNYP-IGGMILYAHNIVDETQLGNFIAEIRELNGSPLL-AID 126 Query: 63 QEGGRVQRF--REGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 +EGGRV R E F +P +S AA++ ++ + A +A + + + + DI +APV Sbjct: 127 EEGGRVARIANNENFD-VPKYESMAAIAESDDPEE-AYKAAFTIGTYVKKYGFDIDYAPV 184 Query: 121 LDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE 178 DV + IG R++ DP+ A +++G+ AG+ T KHFPGHG V+ D+H Sbjct: 185 ADVNTNPDNIVIGPRAFSDDPETAAKFVVSYLNGLESAGVIGTLKHFPGHGDVSTDTHYG 244 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPRPASGSPYWLKTVLR 236 + E+ +M F + I+ IM AH+ V D PA+ S L+ LR Sbjct: 245 YASTDKTWDEMLKCEMIPFKAGIKAGA-QMIMTAHIAAPKVSGDDLPATLSSVILQDKLR 303 Query: 237 QELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMIL 277 ELGFDGVI +D + M S AE S+ AG D++L Sbjct: 304 GELGFDGVIVTDAMDMGAITKQFSNAEAAIKSIQAGVDVVL 344 >UniRef50_C6D1D6 Glycoside hydrolase family 3 domain protein n=1 Tax=Paenibacillus sp. JDR-2 RepID=C6D1D6_PAESJ Length = 403 Score = 127 bits (320), Expect = 4e-28, Method: Compositional matrix adjust. Identities = 83/298 (27%), Positives = 151/298 (50%), Gaps = 13/298 (4%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNR-----LV 58 +++ ++G E+ E +++ VGG+IL++ N +Q +L ++ +++ L+ Sbjct: 59 ILVGMDGTEIQPEISKLIKESHVGGIILYSNNIESASQTIKLANDLKQINQDNGAKLPLL 118 Query: 59 VAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 ++ DQEGGRV R + PA+++ G + A + G + + A+ ++ FA Sbjct: 119 LSTDQEGGRVSRLPKQIKPFPASRTI----GKTNDPQYAYQVGTALGEAVKAIGLNTDFA 174 Query: 119 PVLDVGH--ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSH 176 PVLD+ + IG R++ + + + + G+ G+ KHFPGHG + DSH Sbjct: 175 PVLDINTNPKNPVIGNRAFGTTAELVSRMGVQEMLGIQSQGVIPVVKHFPGHGDTSVDSH 234 Query: 177 KETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPASGSPYWLKTVL 235 P +R+ + FSS I+E +M AH++ + +DP PAS S ++ L Sbjct: 235 LGLPVVDHDLERLRSIEFVPFSSAIKEGA-PMVMVAHILMTKLDPDTPASMSKKVIQDYL 293 Query: 236 RQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLS 293 R EL F GV+ +DD++M + + S+ AG D+ILV ++ +V+D L+ Sbjct: 294 RGELKFSGVVITDDMTMGAVGKVKAIGPATVQSVLAGADIILVGHDPVQQQTVIDALT 351 >UniRef50_Q6MA50 Putative beta-N-acetylglucosaminidase n=1 Tax=Candidatus Protochlamydia amoebophila UWE25 RepID=Q6MA50_PARUW Length = 550 Score = 127 bits (320), Expect = 4e-28, Method: Compositional matrix adjust. Identities = 84/281 (29%), Positives = 145/281 (51%), Gaps = 16/281 (5%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILF--TRNYHDPAQLRELVRQIRAAS---RNR-- 56 +++ G + + E +++ VGG+I + + + P Q++ L ++AA+ +N+ Sbjct: 49 LIVHFRGEDCNKEALQMIYEAHVGGVIYYPWSNGLNSPLQIQHLSAHLQAAAFQKKNKIP 108 Query: 57 LVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 L+++VDQEGG V R +G T P + G + +LA E A+E++AM I+++ Sbjct: 109 LLISVDQEGGVVNRLTKGMTLFPGNYAL----GQTKDPQLAFEYASASANELLAMGINMN 164 Query: 117 FAPVLDVGH--ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 APV+DV + IG RS+ P + + S+ ++G ++AG+ T KHFPGHG +D Sbjct: 165 LAPVIDVNTQPCNPVIGIRSFSTCPDEIVRFGSKVVEGFNKAGLIATLKHFPGHGDTNSD 224 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTV 234 SH P + E+ ++ F +L + + +M AH++ +D V Sbjct: 225 SHHTLPIVNKTLVELEKVELVPFKALASQTPV--VMTAHLLVPALDALNCVTFSKSAIDV 282 Query: 235 LRQELGFDGVIFSDDLSMEGAAIMGSYAERGQ-ASLDAGCD 274 LR+E+G G+I +D L M+G G E+ SL AG D Sbjct: 283 LRKEIGEKGIILTDSLVMQGVLQDGKTVEQAALKSLQAGHD 323 >UniRef50_B6GD75 Putative uncharacterized protein n=1 Tax=Collinsella stercoris DSM 13279 RepID=B6GD75_9ACTN Length = 806 Score = 127 bits (318), Expect = 8e-28, Method: Compositional matrix adjust. Identities = 91/279 (32%), Positives = 138/279 (49%), Gaps = 32/279 (11%) Query: 26 VGGLILFTRNYHDPAQ-LRELVRQIRAASRNR---------LVVAVDQEGGRVQRFREGF 75 +GG+ILF N + Q LR + +AA RN L++ +DQEGG V R G Sbjct: 105 LGGVILFANNVKETEQTLRLAMDMQQAALRNTAQAPAGDIPLLLTIDQEGGIVYRLGSG- 163 Query: 76 TRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHISAA--IGER 133 T LP + A E+ A + G ++ E+ A+ I+++FAP DV + IG R Sbjct: 164 TALPGNMALGATRSEED----ALQCGQVIGRELEALKINVNFAPSFDVNNNPNNPVIGLR 219 Query: 134 SYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKD 193 SY +DP+ + + + GM E G+ T KHFPGHG DSH P + + E+ + Sbjct: 220 SYSSDPELVAQLGTAMMSGMQEHGIATAAKHFPGHGDAGTDSHTGLPRIEKTKEELDNLE 279 Query: 194 MSVFSSLIRENKLDAIMPAHVIYSDV--------DPR------PASGSPYWLKTVLRQEL 239 F S I E +D +M AH+ + V DP PA+ SP ++ +LR E+ Sbjct: 280 FIPFRSAI-EKGVDMVMTAHIQFPKVETAKVPSADPSMGEIELPATLSPVFMTDILRTEM 338 Query: 240 GFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILV 278 GF GV +D L+M+ + Y + + + AG D+ L+ Sbjct: 339 GFTGVSITDALNMDAISKNFGYIDAVKRTFKAGVDIALM 377 >UniRef50_UPI0000E0E41F Beta-hexosaminidase A precursor n=1 Tax=Glaciecola sp. HTCC2999 RepID=UPI0000E0E41F Length = 599 Score = 127 bits (318), Expect = 8e-28, Method: Compositional matrix adjust. Identities = 83/272 (30%), Positives = 134/272 (49%), Gaps = 21/272 (7%) Query: 26 VGGLILFTRNYHDPAQLREL---VRQIRAASRNRLVVAVDQEGGRVQRFREGFTRLPAAQ 82 +GG+ILF+ N Q+++L ++++R SR + +A+DQEGGRV R F+ PA Sbjct: 62 LGGVILFSENVQSVTQIQQLTSDIQKLRPHSRLPMFIAIDQEGGRVARLPTAFS--PAYS 119 Query: 83 SFAALSGMEE--GGKLAQEAGWLMASEMIAMDIDISFAPVLDVGH--ISAAIGERSYHAD 138 A+ + G A + G + ++ ++ I+++FAP +DV + I RS+ Sbjct: 120 GNMAIGATYDAHGTAYAAKVGEYIGRDLASVGINVNFAPSIDVNANPSNPVINVRSFGES 179 Query: 139 PQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFS 198 + + + M G+ T KHFPGHG DSH P I +D++ F Sbjct: 180 AEMVSELGGALVQHMQSQGVLTALKHFPGHGDTVVDSHTGLPQVNHDLPTIMKQDIAPFK 239 Query: 199 SLIRENKLDAIMPAHVIYSDVDP------------RPASGSPYWLKTVLRQELGFDGVIF 246 +I+E++ +M AH+ Y +D RPA+ SP L +LR LGF G++ Sbjct: 240 HIIKEHEPAFVMTAHIQYPQLDNSTLLAKNGTEQVRPATLSPVILTDLLRGTLGFKGIVI 299 Query: 247 SDDLSMEGAAIMGSYAERGQASLDAGCDMILV 278 +D L M G A S E + +AG D+ L+ Sbjct: 300 TDALDMAGIAQYFSPQEAIIETFNAGADIALM 331 >UniRef50_B0P2K0 Putative uncharacterized protein n=2 Tax=Clostridiales RepID=B0P2K0_9CLOT Length = 421 Score = 126 bits (317), Expect = 1e-27, Method: Compositional matrix adjust. Identities = 87/297 (29%), Positives = 146/297 (49%), Gaps = 24/297 (8%) Query: 8 VEGYELDAEE-----------REILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNR 56 V Y+LD E ++ L GG+I+F +N + P Q +++ +++ AS Sbjct: 84 VSVYQLDEAESKNQTSVTSQMKKTLKKYPAGGVIMFAKNINTPDQTKKMTDELQDASYIP 143 Query: 57 LVVAVDQEGGRVQRF----REGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMD 112 L +AVD+EGG+V R + T P+AQ G K + G E+ + Sbjct: 144 LFMAVDEEGGQVSRVASNPKMKMTVYPSAQEV----GRTYNNKKIAQMGKTQGKELKELG 199 Query: 113 IDISFAPVLDV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGA 170 +++ APV DV + IG+RS+ AD +K I + + M + + T KHFPG G Sbjct: 200 FNMNLAPVADVLTNKNNTEIGDRSFGADSKKVADIITTLVKNMQKQQISATLKHFPGSGQ 259 Query: 171 VTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPRPASGSP 228 D+H+ + + +R D F + I+ K DA+M +H++ S+V + P+S S Sbjct: 260 TGGDTHRGSTETDQTINALRDTDFKPFKAGIKA-KADAVMVSHLMLSNVTDEKEPSSLSS 318 Query: 229 YWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGA 285 + +LR EL + GVI +D ++M+ S E ++ AG D+I++ +N K A Sbjct: 319 RVVSDILRDELEYKGVIMTDAMNMKAITDNYSSGEAAVKAIQAGVDLIVMPDNYKEA 375 >UniRef50_A8L1X4 Glycoside hydrolase family 3 domain protein n=3 Tax=Actinomycetales RepID=A8L1X4_FRASN Length = 656 Score = 126 bits (316), Expect = 1e-27, Method: Compositional matrix adjust. Identities = 93/275 (33%), Positives = 140/275 (50%), Gaps = 22/275 (8%) Query: 19 EILAHPLVGGLILF----------TRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRV 68 E +A +GG+I F N +P Q++ L + AA+ L++A DQE G V Sbjct: 159 EAVARRGLGGVIYFDAGGTGPGALPDNIVNPNQVKTLSADLSAAASIPLLIAADQEQGTV 218 Query: 69 QRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHISA 128 R R+G T LP + A + A++A + +++ A+ I++ FAP DV A Sbjct: 219 LRVRDGVTLLPGQMAQGATGRPTD----ARDAAQITGADLRALGINVDFAPDADVNSDPA 274 Query: 129 --AIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQ 186 IGERS+ DP + ++G + G+ KHFPGHGA + DSH + P R + Sbjct: 275 NPVIGERSFGDDPTAVGRFTAAAVEGYRQVGVAAAAKHFPGHGATSVDSHADLPTITRDR 334 Query: 187 AEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPASGSPYWLKTVLRQELGFDGVI 245 A + A D+ F + I + +M H+ +DP PA+ S + +LR ELGFDGVI Sbjct: 335 AALTALDLPPFRAAIAAG-VPMVMVGHLNVPALDPAAPATLSKPVVDGLLRHELGFDGVI 393 Query: 246 FSDDLSMEGAAIMGSYAERGQA--SLDAGCDMILV 278 +D L+M AAI G A ++ AG DM+L+ Sbjct: 394 VTDALNM--AAITEHNTPGGAAVRAVQAGVDMLLM 426 >UniRef50_A8ETM1 Glycosyl hydrolase n=2 Tax=Campylobacterales RepID=A8ETM1_ARCB4 Length = 364 Score = 126 bits (316), Expect = 1e-27, Method: Compositional matrix adjust. Identities = 78/272 (28%), Positives = 140/272 (51%), Gaps = 26/272 (9%) Query: 26 VGGLILF---------TRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFRE--G 74 +GG+ILF +N D QL+ L Q++A S +L++++DQEGG VQR ++ G Sbjct: 59 LGGVILFDKDPNDKTKIKNIKDKNQLKNLNTQLQAISNQKLLISIDQEGGVVQRLKKDSG 118 Query: 75 FTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV-----GHISAA 129 F +A+ A G A+++ +A ++ + I++ FAPV+D+ + Sbjct: 119 FVDTLSAKDIAT-----NGEDFAKQSYKALAKDLKDVGINLDFAPVVDLSINKNNKVIVT 173 Query: 130 IGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEI 189 G RS+ + + +S F+D + + + + KHFPGHG+ ADSH + + Sbjct: 174 RG-RSFGESSSEVIKYSSIFVDELRKQNVISVLKHFPGHGSSLADSHLGFVDITKTWNQ- 231 Query: 190 RAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKT-VLRQELGFDGVIFSD 248 K++ + I+ N +D IM AHV ++D + Y + T +LR++LG+ GV+ +D Sbjct: 232 --KELEPYKYFIKNNDVDMIMTAHVFNENLDKNYPATLSYNINTKLLREDLGYKGVLVTD 289 Query: 249 DLSMEGAAIMGSYAERGQASLDAGCDMILVCN 280 DL M + + ++++G +M+L N Sbjct: 290 DLQMSAISKHYDLKQTLTLAINSGVNMLLFAN 321 >UniRef50_C6CWS0 Beta-N-acetylhexosaminidase n=1 Tax=Paenibacillus sp. JDR-2 RepID=C6CWS0_PAESJ Length = 535 Score = 126 bits (316), Expect = 1e-27, Method: Compositional matrix adjust. Identities = 89/277 (32%), Positives = 135/277 (48%), Gaps = 13/277 (4%) Query: 10 GYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRA----ASRNRLVVAVDQEG 65 GYE E ++ +GG I F RN D Q+ +L ++ A R ++VA+DQEG Sbjct: 25 GYEPTEEITSLIEKYGIGGTIYFGRNVRDAKQVHQLSSSLKNIAGRAGRPNMLVAIDQEG 84 Query: 66 GRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGH 125 G V R G T +P + A E G+ A+ G E+ + + +++AP LDV + Sbjct: 85 GMVARIVNGVTLMPGNMALGATGSAEGAGETARVCG----EELRLLGVTMNYAPCLDVNN 140 Query: 126 I--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDP 183 + I RSY Q + + G + + T KHFPGHG + DSH + P P Sbjct: 141 NPDNPVINVRSYGDRAQIVSELGKAALLGYQASKVAATVKHFPGHGDTSVDSHHDLPVLP 200 Query: 184 RPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP--RPASGSPYWLKTVLRQELGF 241 + + A +++ F + AIM AHV +DP +P++ S L +LR+ELG+ Sbjct: 201 HSRERLDAIELAPFRA-AIAAGTAAIMTAHVCLPALDPSGKPSTLSQPVLTGLLREELGY 259 Query: 242 DGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILV 278 +GVI +D L M+ A ++ AG DMILV Sbjct: 260 NGVIVTDCLEMDAIDRFYGPANGAVQAIQAGADMILV 296 >UniRef50_Q04TR7 Beta-glucosidase-related glycosidase n=4 Tax=Leptospira RepID=Q04TR7_LEPBJ Length = 579 Score = 126 bits (316), Expect = 1e-27, Method: Compositional matrix adjust. Identities = 92/331 (27%), Positives = 163/331 (49%), Gaps = 26/331 (7%) Query: 8 VEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLR----ELVRQIRAASRNRLVVAVDQ 63 + G LD + + L GG+ILF N ++ EL ++ +S+ L++++DQ Sbjct: 63 IPGKTLDETAAKEIEEILPGGIILFGTNLGTKEEILKLNLELQKKSMESSKLPLLISIDQ 122 Query: 64 EGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV 123 EGGRV R ++G T+ P A + G + A + G++ + ++ + + FAP LD+ Sbjct: 123 EGGRVLRVKDGVTQFPGAMAL----GQTKNADYAYKVGFVTSYQLRKLGFNFVFAPDLDI 178 Query: 124 GHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPC 181 + + I RS + P+ + G G T KHFPGHG DSH P Sbjct: 179 NNNPDNPVINTRSMGSTPETVSISGIGYEKGARIGGAIPTIKHFPGHGDTNVDSHLGLPK 238 Query: 182 DPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQELG 240 + E+ ++ F I++ + +M AH++Y +DP PA+ S L +LR ++G Sbjct: 239 IDKTLEELERMELIPFQESIQQGA-EVVMSAHIVYPKLDPDFPATLSSKILTGILRDKMG 297 Query: 241 FDGVIFSDDLSMEGAAIMGSYAER--GQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAE 298 F GV+ +D +ME AI Y +R G ++ AG D++L+ + K ++ Sbjct: 298 FKGVVITD--AMEMNAIDVHYKDRDPGVLAILAGADILLMTSWGKTTRNM--------KN 347 Query: 299 RVTRLYHKGSFSR--QELMDSARWKAISTRL 327 +++ Y KG+F + ++L+ A ++ I +L Sbjct: 348 QISTAYKKGTFQKGEKDLLKEAVYRQILLKL 378 >UniRef50_A9KMQ8 Glycoside hydrolase family 3 domain protein n=1 Tax=Clostridium phytofermentans ISDg RepID=A9KMQ8_CLOPH Length = 517 Score = 125 bits (315), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 93/314 (29%), Positives = 148/314 (47%), Gaps = 24/314 (7%) Query: 10 GYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASR----NRLVVAVDQEG 65 G +D+E + + +G ILF N D QL L ++ ++ +R + +DQEG Sbjct: 19 GTTIDSELEDFIRTYKIGNFILFKENIVDANQLSNLCEGLQQLTKKYTGHRAFITIDQEG 78 Query: 66 GRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGH 125 G V R E +P A + AA K A AG + ++ + + APV D+ Sbjct: 79 GMVTRLSEDSVNIPGAMAIAA----TRDEKNAYMAGRITGQQLRTLGFNFDLAPVADINS 134 Query: 126 I--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDP 183 + IG RSY +P + + G+ + G+ + KHFPGHG DSH P Sbjct: 135 NMDNPVIGVRSYGDEPDQVAKYCVAMMKGLTDGGVLASAKHFPGHGDTNVDSHLGLPKVH 194 Query: 184 RPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQELGFD 242 + E+ ++ F +LI E + AIM +H+I+ ++ PA+ S + +L+++LGF Sbjct: 195 KSLEEMELCELVSFKALI-EAGIPAIMSSHIIFPALEEELPATMSRKIITGLLKEKLGFK 253 Query: 243 GVIFSDDLSMEGAAIMGSYA--ERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERV 300 G++ SD ME +AI Y E + +++AG D+I V + A D L Sbjct: 254 GLVISD--CMEMSAIKKYYGSIEGIKHAIEAGVDLIFVSHTMSVAREASDVL-------- 303 Query: 301 TRLYHKGSFSRQEL 314 T LY KG S E+ Sbjct: 304 TGLYEKGELSMDEM 317 >UniRef50_Q2G9D7 Beta-N-acetylhexosaminidase n=10 Tax=cellular organisms RepID=Q2G9D7_NOVAD Length = 355 Score = 125 bits (314), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 105/318 (33%), Positives = 161/318 (50%), Gaps = 28/318 (8%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAA-SRNRLVV 59 M P + + G L +ER G ILF RN PAQLR L ++RA R+R + Sbjct: 18 MLPAIFGLSGLTLTEDERAFFRDADPAGYILFGRNVESPAQLRALTDELRALHGRDRTFI 77 Query: 60 AVDQEGGRVQRFR----------EGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMI 109 +DQEGGRV R + E F RL A S +E A+ G +A I Sbjct: 78 CIDQEGGRVARMKPPVWQPYPPGERFDRL---YDIAPASAIEAARANAEALGLDLAEAGI 134 Query: 110 AMDIDISFAPVLDVGHISA--AIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPG 167 ++D P+LDV A IG+R+ ++P + A+ +DG+ AG+ KH PG Sbjct: 135 SVDC----LPLLDVRQPGAHDVIGDRALGSEPMRVAALGRATLDGLARAGIAGVVKHMPG 190 Query: 168 HGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPASG 226 HG DSHKE P E+ D++ F +L R+ + M AH+ + D PA+ Sbjct: 191 HGRALVDSHKELPTVSASAEELE-MDLAPFRAL-RDATIG--MTAHLRFLAWDDWNPATH 246 Query: 227 SPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNR-KGA 285 SP+ ++ ++R+ +GFDG++ +DDL M+ A+ G+ ER + AGCD+ L C + Sbjct: 247 SPFVIEEIIRKAIGFDGLLLTDDLDMQ--ALGGTVPERAARAQAAGCDIALNCWAKMDDM 304 Query: 286 VSVLDNLSPIKAERVTRL 303 V + ++L+P+ + + RL Sbjct: 305 VGIANSLAPMSDKVMQRL 322 >UniRef50_Q73RR1 Glycosyl hydrolase, family 3 n=1 Tax=Treponema denticola RepID=Q73RR1_TREDE Length = 557 Score = 125 bits (314), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 86/261 (32%), Positives = 139/261 (53%), Gaps = 15/261 (5%) Query: 26 VGGLILFTRNYHDPAQLRELVRQIRAASRN-----RLVVAVDQEGGRVQRFREGFTRLPA 80 +G + +F N D +L + + ++ S L VA DQEGG V+ + + P Sbjct: 86 LGSIKVFGWNTGDSRKLAKSISILQKKSLEGRFGIPLFVATDQEGGWVRHVKGLTSETPG 145 Query: 81 AQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV--GHISAAIGERSYHAD 138 + A SG+ + + +G+ ++ E+ A+ I+++FAP +D+ H S+ IG RS+ Sbjct: 146 NLAIGA-SGIPQD---SYYSGYYISREIRALGINLNFAPTVDLLTDHDSSIIGPRSFGDS 201 Query: 139 PQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFS 198 P + + F+ G +AG+ TT KHFPGHG + DSH P + R +++ F Sbjct: 202 PHAVGILGAAFVRGSRDAGVLTTAKHFPGHGDTSIDSHGRLPKIDISEETFRNRELIPFK 261 Query: 199 SLIRENKLDAIMPAHVIYSDVDPR--PASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAA 256 LI + + AIM H+ +S + P PA+ S Y L +LR ELGF G+I +DD+ M GA Sbjct: 262 YLI-DAGVPAIMTGHLNFSSILPNGEPATFSKYLLTDILRGELGFKGLIITDDMMMHGAM 320 Query: 257 -IMGSYAERGQASLDAGCDMI 276 G A+ + +L+AG D+I Sbjct: 321 NFAGGIAKAVKMALEAGNDII 341 >UniRef50_B5CPR6 Putative uncharacterized protein n=1 Tax=Ruminococcus lactaris ATCC 29176 RepID=B5CPR6_9FIRM Length = 408 Score = 125 bits (313), Expect = 3e-27, Method: Compositional matrix adjust. Identities = 81/277 (29%), Positives = 142/277 (51%), Gaps = 15/277 (5%) Query: 14 DAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNR----LVVAVDQEGGRVQ 69 D +R + +P VGG+I F N D QL +++ ++ S + + ++VD+EGG V Sbjct: 98 DTTKRCLEQYP-VGGMIYFAGNIEDRQQLSDMLHTTQSYSTEQSGIPIFLSVDEEGGTVA 156 Query: 70 RFR-EGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV--GHI 126 R G + + + ++ E ++ Q G +A + ++ +APV DV Sbjct: 157 RVAGSGISNVSVVEDMCEVTDEERAYEIGQTIGNYLAE----LGFNLDYAPVGDVLTNPE 212 Query: 127 SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQ 186 + + +RS+ D K A++ + +DG+HE + KHFPGHGA D+H+ + Sbjct: 213 NTVVKKRSFGNDSAKVAALSQKVLDGLHEENVYGVIKHFPGHGATAGDTHEGYAYTDKSL 272 Query: 187 AEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPRPASGSPYWLKTVLRQELGFDGV 244 E+ +++ F + I EN + IM H+ +V D P+S S + VLR+++GFDG+ Sbjct: 273 EELEQQELVPFQNAI-ENHIAIIMAGHISVPNVTGDNTPSSLSKIMITDVLREKMGFDGI 331 Query: 245 IFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNN 281 + +D L+M + S AE ++AG DM+L+ N Sbjct: 332 VITDALNMGAISQHYSSAEVSVKVIEAGGDMLLMPEN 368 >UniRef50_C9NRN7 Beta-hexosaminidase n=1 Tax=Vibrio coralliilyticus ATCC BAA-450 RepID=C9NRN7_9VIBR Length = 616 Score = 124 bits (312), Expect = 3e-27, Method: Compositional matrix adjust. Identities = 90/288 (31%), Positives = 139/288 (48%), Gaps = 21/288 (7%) Query: 12 ELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNR----LVVAVDQEGGR 67 E++ R I+ + VGG+ILF+ N D Q EL+ ++ A L V +DQEGG Sbjct: 20 EINESVRHIIGNDKVGGVILFSSNLVDINQTGELIWNMQNAVEEDQEVGLFVTIDQEGGN 79 Query: 68 VQRF-REGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV--G 124 V R R T LP + A + +LA E+G ++A E+ ++ +++FAPV+DV Sbjct: 80 VVRLPRAQATNLPGNMALGAAYLNDNNPELAYESGRILAKEIKSVGFNVNFAPVVDVQSN 139 Query: 125 HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPR 184 ++ I RSY DPQ + + GM + + T KH+PGHG +DSH P + Sbjct: 140 PLNPVINVRSYGEDPQLVGLLGGQSAKGMADENVIGTFKHYPGHGDTESDSHYGLPIVNK 199 Query: 185 PQAEIRAKDMSVFSSLIRENKL-DAIMPAHVIYSDVDPR-------------PASGSPYW 230 E A D++ + I + D IM AH+ Y +D PA+ S Sbjct: 200 SIEEAYAIDLAPYKEAIDQGLAPDMIMTAHIQYPALDDSEVVGSRSGETIVVPATLSHKI 259 Query: 231 LKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILV 278 +LR+EL + G+ +D L M+G A A+ + AG D+ L+ Sbjct: 260 QTELLREELNYQGLTITDALDMQGIAAHFEEADAVIKTFQAGVDIALM 307 >UniRef50_A3CMS4 Beta-hexosamidase A, putative n=1 Tax=Streptococcus sanguinis SK36 RepID=A3CMS4_STRSV Length = 930 Score = 124 bits (312), Expect = 3e-27, Method: Compositional matrix adjust. Identities = 86/290 (29%), Positives = 145/290 (50%), Gaps = 29/290 (10%) Query: 12 ELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIR-AASRNR-------LVVAVDQ 63 EL+ E E + GG+ILF N AQ L + ++ AA N+ L++A+DQ Sbjct: 171 ELNKEVAEAVDKYDFGGVILFAENVKGTAQTLALTQALQQAAISNQANNGKLPLLLAIDQ 230 Query: 64 EGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV 123 EGG V R G T LP + A E LA +AG ++ E+ A+ ++++FAPV D Sbjct: 231 EGGIVYRLGSG-TALPGNMAVGATRDPE----LANQAGQIIGRELSALGLNVNFAPVFDT 285 Query: 124 GHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPC 181 + + IG RS+ +DP+ + + ++G+ + + KHFPGHG DSH P Sbjct: 286 NNNPQNPVIGLRSFSSDPKVVARLGTAMMEGIQKYNVAVAAKHFPGHGDTAVDSHTGLPL 345 Query: 182 DPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-------------PASGSP 228 + AE+ A ++ F + I +N D +M AH+ Y ++ PA+ S Sbjct: 346 VDKSYAELEALELLPFKAAIDKNA-DMLMTAHIQYPQIEKDTVISKQTGETIYIPATLSD 404 Query: 229 YWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILV 278 + ++R++ G+ GV+ SD + M+ A +E ++ +G D++L+ Sbjct: 405 DIITGIVRKKFGYKGVVVSDAMGMDAIAKNFGESEAAIMAIKSGVDVVLM 454 >UniRef50_Q2JJA4 Glycosyl hydrolase domain protein n=1 Tax=Synechococcus sp. JA-2-3B'a(2-13) RepID=Q2JJA4_SYNJB Length = 511 Score = 124 bits (312), Expect = 4e-27, Method: Compositional matrix adjust. Identities = 87/278 (31%), Positives = 145/278 (52%), Gaps = 10/278 (3%) Query: 5 MLDVEGYELDAEEREILAHPLVGGLILFT--RNYHDPAQLRELVRQIRAASRNRLVVAVD 62 ++ EG E + E++ VGGLI ++ N Q+ +L+ +A + L +A+D Sbjct: 17 LVYFEGAEFSSALEEMIRDYHVGGLIFYSIAGNIQTLPQVAQLIAAAQAQALVPLWIAID 76 Query: 63 QEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLD 122 QEGG V R EG T P+A + AA + + +A+ G +E+ ++ +++FAPV+D Sbjct: 77 QEGGPVVRLTEGVTVFPSAMAVAAAGSLAKAEAMARAVG----TELKSLGFNLNFAPVVD 132 Query: 123 VGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETP 180 V + IG R++ +DP++ G +AG+ T KHFPGHG DSH P Sbjct: 133 VNSNPHNPIIGIRAFGSDPKQVTDYGLALWRGYRQAGILCTPKHFPGHGDTDIDSHLGLP 192 Query: 181 CDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPASGSPYWLKTVLRQEL 239 RP ++ ++ F +LI+ +A+M AHV+ + RPA+ +P + LRQ L Sbjct: 193 RVDRPLEALQTTELRPFQALIQAGA-EAVMTAHVVMPALGSVRPATLAPEVVSGWLRQRL 251 Query: 240 GFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMIL 277 GF G++ +D L+M A E + + AG D+++ Sbjct: 252 GFQGLVLTDSLTMGAIARTYGIPEAAELAFRAGADVLV 289 >UniRef50_A1JQZ0 Glycosyl hydrolase family protein n=8 Tax=Enterobacteriaceae RepID=A1JQZ0_YERE8 Length = 625 Score = 124 bits (311), Expect = 4e-27, Method: Compositional matrix adjust. Identities = 100/322 (31%), Positives = 157/322 (48%), Gaps = 33/322 (10%) Query: 26 VGGLILFTRNYHDPAQLRELVRQIRAASRNR-LVVAVDQEGGRVQRFREGFTRLPAAQSF 84 +G +ILF N D Q+ EL+ ++ A N L ++ DQEGG V R R G T +P + Sbjct: 91 LGSIILFRENLIDTPQIVELINNLQNARSNLPLFISTDQEGGYVTRLRVG-TEMPGNMAL 149 Query: 85 AALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHISAAIGERSYHADPQKALA 144 A K AQ+AG + E+ ++ + ++ + IG RSY DP Sbjct: 150 GATGS----SKFAQQAGSIHGYELSSLGFNFGPVVDVNNNQNNPVIGVRSYSNDPVLVAE 205 Query: 145 IASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIREN 204 +A +I G+H+ + T+ KHFPGHG VT+D+H P +A + ++ F ++ Sbjct: 206 LARSYISGIHKYNVLTSLKHFPGHGNVTSDTHFSLPAVNIDKAAWQQVELKPFVEVMPVT 265 Query: 205 KLDAIMPAHVIYSDVDP------------RPASGSPYWLKTVLRQELGFDGVIFSDDLSM 252 DAIM AHV+ +D PA+ S L +LR +L FDG+I +D +M Sbjct: 266 --DAIMTAHVVVPALDNSMLTNIKGEKVGTPATLSKPILTDILRNQLKFDGLILTD--AM 321 Query: 253 EGAAIMGSYAERG---QASLDAGCDMILVCNNRKGAVSV--LDNLSPIKAERVTRLYHKG 307 + AI ++ ER + ++ AG D++L+ K S+ LD L T K Sbjct: 322 DMGAITSNF-ERNWSIKQAIMAGNDIVLMPMEIKNRASIEQLDALYDYLKTEAT----KD 376 Query: 308 SFSRQELMDSARWKAISTRLNQ 329 +Q + DSA+ + I T+LN+ Sbjct: 377 PELKQRIEDSAQ-RVIYTKLNK 397 >UniRef50_A2DGR7 Glycosyl hydrolase family 3 N terminal domain containing protein n=1 Tax=Trichomonas vaginalis RepID=A2DGR7_TRIVA Length = 553 Score = 124 bits (311), Expect = 5e-27, Method: Compositional matrix adjust. Identities = 89/332 (26%), Positives = 155/332 (46%), Gaps = 18/332 (5%) Query: 5 MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRL----VVA 60 ++D+ G L ++ IL +G IL RN + + ++ S ++ Sbjct: 59 LMDLSGLTLTDSDKAILKKYKLGNFILMGRNLGTIEETAAYSKSLQEYSMKEFGIPAFIS 118 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 +DQE G + R + T P A S AA ++ +A+ LMA+E+ A ++++ AP Sbjct: 119 IDQEAGSILRMYKNCTVFPGAMSLAATDSLD----IAKNISKLMATELRAAGVNMNHAPT 174 Query: 121 LDVGH--ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE 178 DV + IG+RS+ +P+ +I G HEAG+ TT KHFPGHGA + DSHK Sbjct: 175 SDVNSNPKNPIIGDRSFGDNPENVSKFVDYYIQGHHEAGILTTSKHFPGHGATSVDSHKA 234 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQE 238 P ++ D+ F ++ +D+IM H+I + +D A R + Sbjct: 235 LPSIDHNMTQLLEIDIPPFQQAVKSG-VDSIMVGHLI-TPIDNESACTQSKKCVDFARNQ 292 Query: 239 LGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAE 298 L +DG++ +D + M+ A E ++ +G D+I C G + D + A Sbjct: 293 LKYDGILITDSMQMK-AIKQEDLNESIATAIKSGIDLICDCG---GDIVGTDPYYSVIAY 348 Query: 299 RVTRLYHKGSFSRQELMDSARWKAISTRLNQL 330 V ++ + + ++++D A + I T+L L Sbjct: 349 IVEQV--QKNIIPEKIIDQAALRIIKTKLAML 378 >UniRef50_Q15N77 Glycoside hydrolase, family 3-like n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Q15N77_PSEA6 Length = 633 Score = 124 bits (311), Expect = 5e-27, Method: Compositional matrix adjust. Identities = 89/282 (31%), Positives = 136/282 (48%), Gaps = 24/282 (8%) Query: 18 REILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS----RNRLVVAVDQEGGRVQRFRE 73 +E++A VGG+ILF N D Q+ L R ++AAS + L++++DQEGGRV R + Sbjct: 71 KEMIADTGVGGIILFANNLVDTEQMLRLNRDLQAASASGGHSPLLISIDQEGGRVVRIPQ 130 Query: 74 GFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI--SAAIG 131 + + + E G A +G ++A E+ + +++FAP +DV + I Sbjct: 131 NISTAFSGNMAIGATYAEHGTHFATLSGDVIAKELAVLGFNVNFAPSVDVNVNPENPVIN 190 Query: 132 ERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRA 191 RS+ DPQ + + M + G+ KHFPGHG + DSH P +I+A Sbjct: 191 VRSFGEDPQVVAELGLAQMQAMQQNGIIAALKHFPGHGDTSTDSHSGLPLVEHDLHKIKA 250 Query: 192 KDMSVFSSLIRENKLDAIMPAHVIYSDVDPR------------PASGSPYWLKTVLRQEL 239 D++ F I IM AH+ Y +D PA+ S L +LR ++ Sbjct: 251 VDLAPFQYAIDNGDPGMIMTAHIQYPALDSSTFTATDGSKTILPATMSRAILTDLLRDQM 310 Query: 240 GFDGVIFSDDLSMEGAAIMGSYAERGQASLD---AGCDMILV 278 GF GVI +D L M A + + E QA L AG D+ L+ Sbjct: 311 GFRGVIITDALDM---AAIAHFYEPTQAVLQTFKAGTDIALM 349 >UniRef50_C6WFB0 Glycoside hydrolase family 3 domain protein n=1 Tax=Actinosynnema mirum DSM 43827 RepID=C6WFB0_ACTMD Length = 580 Score = 124 bits (310), Expect = 7e-27, Method: Compositional matrix adjust. Identities = 90/265 (33%), Positives = 134/265 (50%), Gaps = 18/265 (6%) Query: 27 GGLILFTRNYHD----PAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFREGFTRLPAAQ 82 GG+I F + +D PAQ+ L ++ ASR L ++ DQE G V R T+ P + Sbjct: 89 GGVIYFNNSSYDNIDTPAQIAALSNGLQQASRVPLTISTDQEMGIVTRIGPPLTQFPGSM 148 Query: 83 SFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHISA--AIGERSYHADPQ 140 + A G +E + A+ A + A E+ A+ I +FAP DV A IG RS+ +DP Sbjct: 149 ALGA--GRDE--RAAERAASVTARELRALGITQNFAPDADVNSNPANPVIGVRSFSSDPD 204 Query: 141 KALAIASRFIDGMHEAGMKTTG-----KHFPGHGAVTADSHKETPCDPRPQAEIRAKDMS 195 A + + + G + + T KHFPGHG DSH P R + R D Sbjct: 205 LAATMVAAQVRGYQQRHLSRTAVSAAAKHFPGHGDTADDSHTSLPTVDRTVEQWREVDAK 264 Query: 196 VFSSLIRENKLDAIMPAHVIYSDVDP--RPASGSPYWLKTVLRQELGFDGVIFSDDLSME 253 F + I +D+IM AH+ +DP PA+ S + +LR+ELG+DGV+ +D L+M Sbjct: 265 PFRAAIAAG-VDSIMTAHIRMPAIDPSGEPATLSKPVVTGLLREELGYDGVVITDSLAMA 323 Query: 254 GAAIMGSYAERGQASLDAGCDMILV 278 G + + AE +L AG D +L+ Sbjct: 324 GVRQLHTDAEIPVLALKAGVDQLLM 348 >UniRef50_D1R988 Putative uncharacterized protein n=1 Tax=Parachlamydia acanthamoebae str. Hall's coccus RepID=D1R988_9CHLA Length = 544 Score = 123 bits (309), Expect = 8e-27, Method: Compositional matrix adjust. Identities = 74/262 (28%), Positives = 143/262 (54%), Gaps = 16/262 (6%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILF--TRNYHDPAQLRELVRQIRAASRNR----- 56 +M+ G E + + ++++ +GG+I + D Q++ L R +++ ++ R Sbjct: 39 LMVHFYGTEANGDAKKLIQDFHIGGIIYYEWANGLTDAQQVQILSRGLQSLAQERAHPIP 98 Query: 57 LVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 L +A+DQEGG V R + GFT+ P AL +++ G A+ + E+ ++ I+++ Sbjct: 99 LFLAIDQEGGVVNRLKNGFTQFPGNY---ALGQIKKAGD-AEACAYATGLELKSVGINVN 154 Query: 117 FAPVLDVG--HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 APV+DV + IG R++ + P+ ++ ++G ++ + + KHFPGHG V D Sbjct: 155 LAPVVDVNCNEKNPIIGIRAFGSSPKDVASLGKEMMEGFRKSSILSVLKHFPGHGDVEVD 214 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPA-SGSPYWLKT 233 SH+ PC + + E+ ++ F L+ + +A+M +H+ +D + + S + Sbjct: 215 SHQALPCLNKTRGELDQVELYPFHQLLPLS--EAVMTSHLFVPALDAKTCITFSCDAVTQ 272 Query: 234 VLRQELGFDGVIFSDDLSMEGA 255 VLR+E+GF+GVI +D L+M+G Sbjct: 273 VLRKEMGFEGVILTDSLAMQGC 294 >UniRef50_C8WZJ0 Glycoside hydrolase family 3 domain protein n=1 Tax=Desulfohalobium retbaense DSM 5692 RepID=C8WZJ0_DESRD Length = 379 Score = 122 bits (307), Expect = 1e-26, Method: Compositional matrix adjust. Identities = 104/355 (29%), Positives = 166/355 (46%), Gaps = 49/355 (13%) Query: 4 VMLDVEGYELDAEER--EILAHPLVGGLILF---------TRNYHDPAQLRELVRQIRAA 52 +M+ G +A+ E + +GG++L+ RN AQL+ L ++ Sbjct: 41 LMIGFRGTTFEAKSPLGEAITEGNLGGVVLYGRDVALNKPMRNIRSAAQLQALTADLQDH 100 Query: 53 SRNRLVVAVDQEGGRVQRF--REGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIA 110 +R L +AVD+EGG+V R R GF A ++ G + + + Sbjct: 101 ARIPLFIAVDEEGGQVSRLAPRFGFPETVTAATW----GRRNDPAWTRRGARAIGQRLRN 156 Query: 111 MDIDISFAPVLDVGHI--SAAIG--ERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFP 166 + I+ APV+D+ + AIG ERS+ A+P A+ FI G+H+AG+ KHFP Sbjct: 157 LGCTINLAPVVDLNTNPDNPAIGKLERSFGANPDTVTRQAAAFIHGLHDAGILACIKHFP 216 Query: 167 GHGAVTADSHK-----ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP 221 GHG+ DSH T P K++ + L+ DA+M AHV ++ +DP Sbjct: 217 GHGSAYNDSHLGLTDISTTWSP--------KELEPYKRLVDRGLADAVMTAHVFHAGLDP 268 Query: 222 R-PASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQA--SLDAGCDMILV 278 + PA+ S + +LR+E+G++GV+ SDDL M AI S++ R L+A D+ L Sbjct: 269 KVPATLSAEIIPDILRREIGYEGVVISDDLQM--GAIRQSFSLRQTVRRCLEADVDIFLF 326 Query: 279 CNNRKGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHER 333 NN + P RV R+ Q ++ +R + R+ +L ER Sbjct: 327 GNNLE--------YEPFVWRRVQRIVR--DLVDQNIVSRSRIERSYERIQRLKER 371 >UniRef50_Q47VB6 Beta-hexosaminidase n=1 Tax=Colwellia psychrerythraea 34H RepID=Q47VB6_COLP3 Length = 637 Score = 122 bits (307), Expect = 2e-26, Method: Compositional matrix adjust. Identities = 89/285 (31%), Positives = 138/285 (48%), Gaps = 25/285 (8%) Query: 12 ELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASR-----NRLVVAVDQEGG 66 EL E ++++ VGG++LF N + AQ+ +L I+AA+ N L++++DQEGG Sbjct: 69 ELPKELAQMISTSNVGGIVLFAENVQEIAQVVKLTHDIQAAALKSEQGNPLIISIDQEGG 128 Query: 67 RVQRFRE--GFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVG 124 RV RF + GF A S E G A + ++A E+ A+ I+ ++APV+DV Sbjct: 129 RVARFAKMTGF----AGNMAIGASYPEHGTHFATQVNAVIAKELKALGINNNYAPVVDVN 184 Query: 125 HI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCD 182 + I RS+ + + + + + G+ T KHFPGHG DSH P Sbjct: 185 TNVDNPVINTRSFGESAVQVAELGASAVTAIQAQGVMATLKHFPGHGDTHVDSHLGLPRV 244 Query: 183 PRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP------------RPASGSPYW 230 +A I D++ F+ I K IM AH+ + +D RPA+ S Sbjct: 245 DHDRATIDKVDLAPFAWAIDHAKPAMIMTAHIQFPALDNSLFISKSGDKLIRPATMSRKI 304 Query: 231 LKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDM 275 L +LR+E+ FDG+I +D L M G + E + AG D+ Sbjct: 305 LTNLLREEMEFDGIIATDALDMAGVTHFFTDIEAVVETFVAGADL 349 >UniRef50_UPI0001B54569 sugar hydrolase n=1 Tax=Streptomyces sp. AA4 RepID=UPI0001B54569 Length = 483 Score = 122 bits (306), Expect = 2e-26, Method: Compositional matrix adjust. Identities = 96/302 (31%), Positives = 147/302 (48%), Gaps = 22/302 (7%) Query: 26 VGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFREG-FTRLPAAQSF 84 +GG+ILF RN D Q+ L +R+ R+ +V+ +D+EGG V R G + +P + Sbjct: 35 LGGVILFGRNVVDDEQVAALTSALRS-ERDGVVIGIDEEGGDVTRLDVGKGSFVPGPLAL 93 Query: 85 AALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHIS--AAIGERSYHADPQKA 142 A E +A G +A A + ++ AP D+ + IG R++ +DP A Sbjct: 94 GAAGDPELTASVAAALGERLA----ACGVTVNLAPCADLTLAAEDPGIGVRAFGSDPVAA 149 Query: 143 LAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIR 202 + F+ GM + G+ KHFPGHGA T DSH P PR AE+R ++ F++ I Sbjct: 150 SPHVAAFVTGMQKYGVAACAKHFPGHGAATEDSHLSLPVLPRTAAELREVELVPFAAAI- 208 Query: 203 ENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYA 262 + + +M H++ + PA+ + L VLR ELGF G + +D L M AI G+Y Sbjct: 209 DAGVRTVMSGHLVVREWGEEPATFNRTALVDVLRGELGFTGTVITDALEM--GAISGAYG 266 Query: 263 ER---GQASLDA---GCDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMD 316 G+AS+ A G D + + A LD AE + +G S + L D Sbjct: 267 RHDGIGEASVRALLGGADALCLGGAAFDA-ETLDR----TAEAIVAAVTEGRLSVERLAD 321 Query: 317 SA 318 +A Sbjct: 322 AA 323 >UniRef50_A9E2K6 B-N-acetylglucosaminidase, glycoside hydrolase family 3 protein n=1 Tax=Kordia algicida OT-1 RepID=A9E2K6_9FLAO Length = 366 Score = 122 bits (306), Expect = 2e-26, Method: Compositional matrix adjust. Identities = 74/265 (27%), Positives = 144/265 (54%), Gaps = 17/265 (6%) Query: 26 VGGLILFTRN---YHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFRE--GFTRLPA 80 +GG++++ +N + L+ L+ +++ ++ L+V++D+EGG+V R ++ GF + + Sbjct: 62 LGGVLIYEKNITKLNSVENLKRLISKLKKHAKLPLLVSIDEEGGKVNRLKQKYGFPKTVS 121 Query: 81 AQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV-GHISAAIGE--RSYHA 137 A+ ++ ++ + ++A ++ + I++++APVLDV + IG+ R++ Sbjct: 122 AKYLGEVNDIDSTKYYSD----IIAHNLLQLGINVNYAPVLDVHNDNNPPIGKNHRAFSK 177 Query: 138 DPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVF 197 +PQ + A + + +KT KHFPGHG DSH + + +++ + Sbjct: 178 NPQDIINHARQVVLSHRYFNVKTVVKHFPGHGNSRQDSHYNVTDVSKYWQQ---QEVFPY 234 Query: 198 SSLIRENKLDAIMPAHVIYSDVDPR--PASGSPYWLKTVLRQELGFDGVIFSDDLSMEGA 255 L+ E +DAIM AH++ +D PA+ S + LR +LGFDGVIFSDD+ M+ Sbjct: 235 IKLLYEGNVDAIMTAHIVNEKLDDSKLPATLSKKIMTDYLRGDLGFDGVIFSDDMQMKAI 294 Query: 256 AIMGSYAERGQASLDAGCDMILVCN 280 A + + Q ++ AG D+++ N Sbjct: 295 ADQFGFEKSIQMAIHAGVDVLMFSN 319 >UniRef50_D0KBJ7 Glycoside hydrolase family 3 domain protein n=6 Tax=Gammaproteobacteria RepID=D0KBJ7_PECWW Length = 598 Score = 122 bits (306), Expect = 2e-26, Method: Compositional matrix adjust. Identities = 92/276 (33%), Positives = 132/276 (47%), Gaps = 27/276 (9%) Query: 26 VGGLILFTRNYHDPAQLRELVRQIRAASRN---RLVVAVDQEGGRVQRFREG-FTRLPAA 81 +GG+ILF N D Q+++L A RL ++ D EGG V R G + P Sbjct: 53 IGGVILFANNLKDKLQIKKLTAWYTAMETCGGVRLFISTDNEGGNVFRLPRGDYASFPGN 112 Query: 82 QSFAALSGMEEGG---KLAQEAGWLMASEMIAMDIDISFAPVLDVGH--ISAAIGERSYH 136 ALS EGG ++A E G LMA +M+ I+ +FAPV+DV + I R++ Sbjct: 113 M---ALSAAIEGGADIQIASEQGRLMAQDMLFTHINTNFAPVVDVNTNPFNPVINVRAFS 169 Query: 137 ADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSV 196 D +A + GM + G+ T KHFPGHG+ + DSH P R + + A D+ Sbjct: 170 DDADTVYRVAEKMTAGMKQQGLITAYKHFPGHGSTSTDSHTGLPRVDRSRKDAFAIDIMP 229 Query: 197 FSSLIRENKL-DAIMPAHVIYSDVDPR------------PASGSPYWLKT-VLRQELGFD 242 + I + D +M AH+ Y +D PA+ S Y ++T +LR ELG+ Sbjct: 230 YKRAIDSQTVPDMVMTAHIQYPALDNSQVTTRNGEKITVPATMS-YAIQTDILRGELGYA 288 Query: 243 GVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILV 278 GV SD L M A S E + AG D+ L+ Sbjct: 289 GVTISDALDMGAIADHFSQEEAVRQVFTAGVDIALM 324 >UniRef50_A1VC58 Glycoside hydrolase, family 3 domain protein n=3 Tax=Desulfovibrio vulgaris RepID=A1VC58_DESVV Length = 481 Score = 122 bits (306), Expect = 2e-26, Method: Compositional matrix adjust. Identities = 98/315 (31%), Positives = 163/315 (51%), Gaps = 32/315 (10%) Query: 26 VGGLILFTRNYHDPA----------QLRELVRQIRAASRNRLVVAVDQEGGRVQRFRE-- 73 VGG+ILF R+ DP+ QL L ++AA+ L +AVDQEGGRV+R + Sbjct: 88 VGGVILFDRDM-DPSVRVRNIVSKEQLSRLTGALQAAAPVPLFIAVDQEGGRVRRLKPEY 146 Query: 74 GFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVG----HISAA 129 GF P+A S S E+ ++A +A+EM + +++ F PV+D+ + A Sbjct: 147 GFFAYPSAASLGKGSP-EDTRRMAST----LATEMAEVGLNVDFGPVVDLAVNPSNPVIA 201 Query: 130 IGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEI 189 ERSY +DP + + A+ F++G+ G+ + KHFPGHG+ DSH + Sbjct: 202 RLERSYGSDPCRVASHAAAFVNGLAWRGVVASLKHFPGHGSSLQDSHLGV---TDISSTW 258 Query: 190 RAKDMSVFSSLIRENKLDAIMPAHVIYSDVD-PRPASGSPYWLKTVLRQELGFDGVIFSD 248 R +++ ++ L+R++ +M H+ + +D PA+ S + +LR++LG+ GV+ +D Sbjct: 259 RREELGPYALLLRDDWAGMVMVGHLYNNRIDAAHPATLSQRTIDGLLRRDLGWKGVVVTD 318 Query: 249 DLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGS 308 DL M S E + +++AG D++L NN + + L+ + RL +G Sbjct: 319 DLQMGAITARYSLDETVRLAVEAGADILLFGNN----LVWDEGLAEKVHATLVRLVREGK 374 Query: 309 FSRQELMDSARWKAI 323 S Q L S W+ I Sbjct: 375 VSEQRLRQS--WERI 387 >UniRef50_A1ZDA7 Glycosyl hydrolase, family 3 n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZDA7_9SPHI Length = 383 Score = 122 bits (306), Expect = 2e-26, Method: Compositional matrix adjust. Identities = 93/320 (29%), Positives = 155/320 (48%), Gaps = 32/320 (10%) Query: 17 EREILAHPLVGGLILFTR---------NYHDPAQLRELVRQIRAASRNRLVVAVDQEGGR 67 +R+I H L GG+IL++R N +QL+ L+ ++ S L+VAVD+EGG+ Sbjct: 65 KRDIRKHHL-GGVILYSRDLLMGAKPRNIVSKSQLKTLITDLKKLSDVPLLVAVDEEGGK 123 Query: 68 VQRF--REGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV--LDV 123 V R + GF Q ++ + K A +A ++ AM +++FAPV L+V Sbjct: 124 VSRLNAKFGFATTQTPQEIGEINDLVATEKWATH----IAQKVKAMGFNVNFAPVTDLNV 179 Query: 124 GHISAAIGE--RSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSH---KE 178 + IG+ RS+ D + A +F + + G+ KHFPGHG+ DSH + Sbjct: 180 NPKAPIIGKLGRSFSGDVNTLVQHALKFTEVHQKHGIVCAIKHFPGHGSAIMDSHLGFTD 239 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQ 237 +P ++ F +I + +M AHV +D + PA+ S + LR+ Sbjct: 240 VTTTWKPI------ELKPFKQMIEQGFDGMVMTAHVFNKKLDAKYPATLSKKIMNDYLRK 293 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKA 297 + G+ G+I SDD+ M A E + S++AG D++L NN G + N+ P Sbjct: 294 KWGWQGIIISDDMQMNAIAKNFGIEEALEKSINAGVDIVLFSNN--GRIFYNKNIVPEAI 351 Query: 298 ERVTRLYHKGSFSRQELMDS 317 + +L +G SR+ + +S Sbjct: 352 NIIKKLIKQGKISRKRIDES 371 >UniRef50_A6G4L8 Putative uncharacterized protein n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G4L8_9DELT Length = 279 Score = 122 bits (305), Expect = 2e-26, Method: Compositional matrix adjust. Identities = 79/221 (35%), Positives = 118/221 (53%), Gaps = 10/221 (4%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRA--ASRNRLVVAV 61 + + G E+ + RE + VGG+ILF RN DPAQ+R +V ++ A + L VA+ Sbjct: 11 LFVGFHGLEIPPDLRERVREGRVGGVILFKRNIGDPAQVRRVVAELHALAPADAPLTVAL 70 Query: 62 DQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVL 121 DQEGGRVQR R+ +T P ++ A E+ + A+ +A E++ D+ FAPV+ Sbjct: 71 DQEGGRVQRLRDPWTEWPPMRTLGA-RAPEDTARFARA----LALELLDCGFDLDFAPVV 125 Query: 122 DV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKET 179 DV + IG+RS+ DP A + FI M AG+ KHFPGHG DSH Sbjct: 126 DVDSNPDNPVIGDRSFARDPATVGAHGAAFIRAMQAAGVAACAKHFPGHGDTDLDSHLAL 185 Query: 180 PCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD 220 P + ++ FS++I E ++ IM AHV++ +D Sbjct: 186 PRLDLDLERLERVELPPFSAVI-EAEVATIMTAHVLFPKLD 225 >UniRef50_A2QDM1 Contig An02c0230, complete genome n=9 Tax=Trichocomaceae RepID=A2QDM1_ASPNC Length = 859 Score = 122 bits (305), Expect = 3e-26, Method: Compositional matrix adjust. Identities = 85/268 (31%), Positives = 142/268 (52%), Gaps = 20/268 (7%) Query: 26 VGGLILFTRNYHDPAQLRELVRQIRAASR-----NRLVVAVDQEGGRVQRFREGFT-RLP 79 +G ++LF RN D AQL+ L ++ +R + L++ +DQE G V R +LP Sbjct: 36 IGAIVLFKRNVSDAAQLQALTHALQEEARLAGHEHPLLIGIDQENGLVTRVSPPVAPQLP 95 Query: 80 AAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGH--ISAAIGERSYHA 137 + A + E LA + G + A I++++APV D+ ++ IG RS+ Sbjct: 96 GPMALGAANSPE----LAYQVGAATGEILSASGINMNYAPVCDINSEPLNPVIGVRSFGD 151 Query: 138 DPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVF 197 +P+ AS G+ E + T KHFPGHG DSH P P+ + ++ + ++ F Sbjct: 152 NPEFVARFASASAQGLREHKVVPTVKHFPGHGDTAVDSHYGLPVIPKTREQLESCELVPF 211 Query: 198 SSLIRENKLDAIMPAHVIYSDV-DPR-PASGSPYWLKTVLRQELGFDGVIFSDDLSMEGA 255 E ++AIM AH+ + D R PA+ SP ++ +LR+++G+ G++ +D L M+G Sbjct: 212 RQAANEG-IEAIMTAHISLPAIGDGRLPATLSPDVMR-ILREDMGYSGMVITDCLEMDG- 268 Query: 256 AIMGSY-AERGQA-SLDAGCDMILVCNN 281 I +Y E+G +L AGCD I++C+ Sbjct: 269 -IRATYGTEQGAVLALAAGCDSIMICHT 295 >UniRef50_C0ZKK5 Probable beta-hexosaminidase n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0ZKK5_BREBN Length = 537 Score = 121 bits (304), Expect = 3e-26, Method: Compositional matrix adjust. Identities = 93/331 (28%), Positives = 157/331 (47%), Gaps = 33/331 (9%) Query: 16 EEREI-LAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNR-----LVVAVDQEGGRVQ 69 EE E+ + +G +ILF+RN P + + V+++++ + L+V DQE G V Sbjct: 27 EEMEVWIKQKQIGSVILFSRNIGTPEETYQRVQKLQSWAYESWHPYPLLVGTDQENGVVS 86 Query: 70 RFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI--S 127 R ++G TR P A + A +++ ++ + G E+ A I ++FAP +DV + + Sbjct: 87 RLQKGATRFPGAMALGATGNLDQAKRIYEATG----EELRAAGISVNFAPTVDVNNNPDN 142 Query: 128 AAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQA 187 IG RS+ P++ I G+ +G+ KHFPGHG + DSH+ P + Sbjct: 143 PVIGVRSFGEAPEQVARFGEAAIQGLLASGVIPAIKHFPGHGDTSIDSHEAVPVLGHDRT 202 Query: 188 EIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP--RPASGSPYWLKTVLRQELGFDGVI 245 + ++ F I + +M H+ +D PA+ S + +LR+ LGFDGV Sbjct: 203 RLDQVELIPFQRSIAAG-VPMVMVGHISLPSMDESGSPATYSKEIVTGILRESLGFDGVA 261 Query: 246 FSDDLSMEGAAIMGS--YAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRL 303 +D ME AAI G+ AE S+ AG D++LV + + D R+ Sbjct: 262 ITD--CMEMAAIAGTIGVAEAAVRSVQAGIDLVLVSHTHEVQQKTYD--------RLVAA 311 Query: 304 YHKGSFSRQELMDSARWKAISTRLNQLHERW 334 H G S + + ++ R+ QL +R+ Sbjct: 312 IHSGELSEERVNEAV------NRVLQLKKRF 336 >UniRef50_B2SGG8 Glycosyl hydrolase family 3 n=17 Tax=Francisella RepID=B2SGG8_FRATM Length = 555 Score = 121 bits (304), Expect = 3e-26, Method: Compositional matrix adjust. Identities = 80/268 (29%), Positives = 131/268 (48%), Gaps = 22/268 (8%) Query: 26 VGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFREGFTRLPAAQSFA 85 +GG ILF N + Q+ L+R ++A + + A DQEGGRV R ++G T + A Sbjct: 49 LGGFILFRENIQNNEQVISLLRDLQANTNTPIFFATDQEGGRVNRLQQG-TSGCGNMALA 107 Query: 86 ALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVG--HISAAIGERSYHADPQKAL 143 A +A+ ++ E+ ++ I+I+FAP +DV + IG RSY +P + Sbjct: 108 ATDNPHNAYTMAK----IIGDELYSLGININFAPAVDVNSNKNNPIIGVRSYSDNPDIVI 163 Query: 144 AIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRE 203 A I+G H+A + KHFPGHG DSH + E++ ++ FS L R+ Sbjct: 164 DYAKNAINGYHDAKIIDCIKHFPGHGDTATDSHLGNVNLDKTLKELQTTELLPFSKLARD 223 Query: 204 NKLDAIMPAHVIYSDVDPR-------------PASGSPYWLKTVLRQELGFDGVIFSDDL 250 + IM AH+ +D PA+ S + +L+Q++ FDG++ SD + Sbjct: 224 CSM--IMTAHISVPALDDTQYQSVSTSENIYVPATLSYKIITKLLKQQMKFDGLVVSDAM 281 Query: 251 SMEGAAIMGSYAERGQASLDAGCDMILV 278 M A E + ++ AG D++L+ Sbjct: 282 DMHAIAKHFGTIEASKLAILAGIDILLM 309 >UniRef50_D2PXL5 Glycoside hydrolase family 3 domain protein n=1 Tax=Kribbella flavida DSM 17836 RepID=D2PXL5_9ACTO Length = 487 Score = 121 bits (304), Expect = 4e-26, Method: Compositional matrix adjust. Identities = 90/264 (34%), Positives = 133/264 (50%), Gaps = 23/264 (8%) Query: 26 VGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFREGFTRLPAAQSFA 85 +G +ILFTRN D Q+ L +RA R L++A+D EGG F+ L A + Sbjct: 36 LGAVILFTRNVTDAEQVAALTTALRA-ERPDLLIAIDHEGGE-------FSHLAPANPWP 87 Query: 86 ALS----GMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHISA--AIGERSYHADP 139 S G + L + + A+ + A+ ID+ AP +DV A I RS+ Sbjct: 88 LSSPRSLGELDDTDLTRASARDAAATLAALGIDVMLAPSVDVNSNPANPIISTRSFG--- 144 Query: 140 QKALAIASR----FIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMS 195 + SR F+ G+H+AG+ KHFPGHG V DSH P R ++ A +++ Sbjct: 145 -TTAGVVSRHGVAFVQGVHDAGIAACPKHFPGHGDVAVDSHLALPRVDRSVEQVTAVELA 203 Query: 196 VFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQELGFDGVIFSDDLSMEGA 255 F + I E D +M AH+I+S D +PA+ S L+ +LR ELGF GVI +D L M+ Sbjct: 204 PFKATI-EAGADLVMSAHIIFSAYDDQPATLSHRILQGLLRDELGFTGVITTDALEMQAI 262 Query: 256 AIMGSYAERGQASLDAGCDMILVC 279 S + AS+ AG D+ ++ Sbjct: 263 TATRSIEDAAVASIRAGADLAMIA 286 >UniRef50_C9KM62 Glycosyl hydrolase domain protein n=1 Tax=Mitsuokella multacida DSM 20544 RepID=C9KM62_9FIRM Length = 381 Score = 121 bits (303), Expect = 4e-26, Method: Compositional matrix adjust. Identities = 84/282 (29%), Positives = 144/282 (51%), Gaps = 14/282 (4%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNR--LVVAV 61 VM+ V+G E+ + +L +GG+ILF RN Q ++L+ ++ + + L + V Sbjct: 66 VMIGVQGTEVTDDSLYMLHQYHMGGVILFDRNMESADQTKKLIADLQEQADQKVPLFIGV 125 Query: 62 DQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVL 121 D+EGG+V R + T P+ Q G A+E+ A ++ + +++FAPV Sbjct: 126 DEEGGQVVRGKSFLTPPPSEQEI----GRSGEVTRAEESARQTAEKLKKLGFNVNFAPVA 181 Query: 122 DVGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPC 181 DVG + RS+ DP++ + G + + KHFPG G T DSH++ Sbjct: 182 DVGDYA-----RSFGPDPEQTAKFVAAAAQGYEQEHVMYALKHFPGIGRGTVDSHEDISS 236 Query: 182 DPRPQAEIRAKDMSVFSSLIRENKLDA--IMPAHVIYSDVDP-RPASGSPYWLKTVLRQE 238 + E+ +D+ F ++I E + + ++ +H+ Y +D PAS S LR E Sbjct: 237 ITATKDELMKRDLVPFRTVIDERQPEDYFVLVSHLRYPALDAENPASLSKAIQTDFLRGE 296 Query: 239 LGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCN 280 LG+ G+I +DD+ M A S+ E G +++AG D++LVC+ Sbjct: 297 LGYRGLIITDDVEMGALAKHYSFRELGVKAVEAGSDIVLVCH 338 >UniRef50_A4XA71 Glycoside hydrolase, family 3 domain protein n=4 Tax=Micromonosporaceae RepID=A4XA71_SALTO Length = 575 Score = 120 bits (302), Expect = 6e-26, Method: Compositional matrix adjust. Identities = 97/282 (34%), Positives = 139/282 (49%), Gaps = 29/282 (10%) Query: 19 EILAHPLVGGLILF-------------TRNYHDPAQLRELVRQIRAASRNR------LVV 59 +++A +GGLIL T N +P Q+R L +R+A+ + ++ Sbjct: 107 QMIAKYRLGGLILVGFSADDPTSGNQETTNVDNPEQVRALTTGLRSAAADLATGAAPFLI 166 Query: 60 AVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMA-SEMIAMDIDISFA 118 DQE G V R +G T LP+A + A +G E E+ W A +E+ AM +++ FA Sbjct: 167 GTDQEYGVVTRITDGVTMLPSALATGA-AGKPE----LTESAWRAAGTELAAMGVNVDFA 221 Query: 119 PVLDV-GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 PV DV S IG RSY ADP S + G+ G+ T KHFPGHG DSH+ Sbjct: 222 PVADVLVTPSTVIGSRSYGADPSMVAEQVSGVVRGLQATGVAATLKHFPGHGHSATDSHE 281 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPASGSPYWLKTVLR 236 P P+P+ + + F++ A+M H+ VDP PA+ S L VLR Sbjct: 282 ALPVLPQPRTVLELEAWRPFAA-GIGAGALAVMSGHLDVRAVDPGTPATFSHTLLTEVLR 340 Query: 237 QELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILV 278 ELGF GV+ +D ++M A S E +L AG D+IL+ Sbjct: 341 GELGFQGVVITDGMNMAPAKRW-SPGEAAVRALKAGNDLILM 381 >UniRef50_A8F374 Glycoside hydrolase family 3 domain protein n=1 Tax=Thermotoga lettingae TMO RepID=A8F374_THELT Length = 456 Score = 120 bits (301), Expect = 7e-26, Method: Compositional matrix adjust. Identities = 78/245 (31%), Positives = 133/245 (54%), Gaps = 13/245 (5%) Query: 14 DAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFRE 73 ++++ +++ + +GG++L+ + DP++ E++ + +L+V+ D EGG++ E Sbjct: 14 NSDDLKVVLNYDIGGVLLYPSVFEDPSEFAEIMCNLEG---KKLLVSTDHEGGQL----E 66 Query: 74 GFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVG--HISAAIG 131 +P QS L + + L M + I++ FAPVLD+ + IG Sbjct: 67 ILPFIP--QSPGNLLFGQTNPDFVFKYCNLSGKFMKELGINMIFAPVLDLKFPETNPVIG 124 Query: 132 ERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRA 191 RSY +DP+K I+G AG+ + KHFPGHG DSH+E + E+ Sbjct: 125 YRSYGSDPKKVSEFGLSAINGYKAAGLLSCAKHFPGHGRTRQDSHQEIAVVDIDREEL-D 183 Query: 192 KDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQELGFDGVIFSDDLS 251 KD+ F + +N +D+IM AHVIYS +D +PAS S ++ ++LR EL +DG+I SD + Sbjct: 184 KDLYPFRIAV-QNGVDSIMMAHVIYSKIDEKPASISRRFIDSMLRNELQYDGLIISDAVE 242 Query: 252 MEGAA 256 M+ + Sbjct: 243 MKSLS 247 >UniRef50_B4CZU1 Beta-N-acetylhexosaminidase n=2 Tax=Verrucomicrobia RepID=B4CZU1_9BACT Length = 366 Score = 120 bits (301), Expect = 8e-26, Method: Compositional matrix adjust. Identities = 87/288 (30%), Positives = 138/288 (47%), Gaps = 16/288 (5%) Query: 1 MGPVMLD-VEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVV 59 +G ++L V G ELDA GG ILF RN QLR+L+ +R S+ ++ Sbjct: 6 LGQLLLTGVPGPELDAATAAQFRRLQPGGFILFGRNIQTANQLRKLIDDLRDLSQVEPMI 65 Query: 60 AVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAP 119 +DQEGGRV R + P A + E L + G + A + ++ P Sbjct: 66 TIDQEGGRVSRLKLVGNEPPNANQLRDKNDPE----LIRRHGTITAKLLRIFGFNLDLCP 121 Query: 120 VLDVGHISAA---IGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSH 176 VLD+ A + R Y + + + A F + G+ + GKHFPG+ D+H Sbjct: 122 VLDISFDDEADNSLRGRCYGNNVPQVIRNAGAFNSALQAGGILSCGKHFPGYSRAPLDAH 181 Query: 177 KETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP--RPASGSPYWLKTV 234 E P P + E+ +++VF +K+D++M H Y ++ P+S S + + Sbjct: 182 HELPEIPLSREELETHELAVFKHF--ADKVDSMMIGHGFYPALESSRTPSSLSHRVITDL 239 Query: 235 LRQELGFDGVIFSDDLSMEGAAIMGSYA--ERGQASLDAGCDMILVCN 280 LR+E GF G+I +DDL M AI+ Y E + ++ AG DM ++C+ Sbjct: 240 LRKEQGFKGLIMTDDLDM--GAILNHYGLEETIRLAITAGNDMAMICH 285 >UniRef50_Q2NRJ7 Putative glycosidase n=1 Tax=Sodalis glossinidius str. 'morsitans' RepID=Q2NRJ7_SODGM Length = 650 Score = 119 bits (299), Expect = 1e-25, Method: Compositional matrix adjust. Identities = 93/287 (32%), Positives = 137/287 (47%), Gaps = 25/287 (8%) Query: 13 LDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNR---LVVAVDQEGGRVQ 69 L+A IL +GG+ILF N Q+R L Q+ A + L++A+D EGG V Sbjct: 95 LNANIEHILHTYHIGGVILFANNLKKTDQIRRLTAQLAAVPQTHGIGLMIAIDNEGGNVF 154 Query: 70 RFREG-FTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHISA 128 R G +T AL L G +MA E+ ++ I+++FAPV+DV A Sbjct: 155 RLPRGEYTAFAGNMPLGALDQGHSESVLPYLQGKIMAQELRSLGINVNFAPVVDVNSNQA 214 Query: 129 --AIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQ 186 I R++ P + +A + G A + T KHFPGHG DSH + P R + Sbjct: 215 NPVINVRAFSDSPARVDKLARELVKG--SAQIVTCYKHFPGHGDTHTDSHMDLPRVDRNR 272 Query: 187 AEIRAKDMSVFSSLIRENKL-DAIMPAHVIYSDVDPR------------PASGSPYWLKT 233 E A D++ + + I EN+ D IM AH+ Y +D PA+ S Sbjct: 273 EEAYAIDLAPYRAAIAENEAPDMIMTAHIQYPALDDSRITRLDGEDIVVPATLSRRIQHD 332 Query: 234 VLRQELGFDGVIFSDDLSMEGAAIMGSYAERG--QASLDAGCDMILV 278 +LRQELG+ GV +D L M+ AI ++ R + +AG D+ L+ Sbjct: 333 LLRQELGYRGVTITDALDMK--AISANFDPRDVVKRVFNAGIDIALM 377 >UniRef50_B4S1A2 Beta-hexosaminidase A n=2 Tax=Alteromonas macleodii RepID=B4S1A2_ALTMD Length = 673 Score = 119 bits (298), Expect = 1e-25, Method: Compositional matrix adjust. Identities = 84/293 (28%), Positives = 135/293 (46%), Gaps = 25/293 (8%) Query: 26 VGGLILFTRNYHDPAQLREL----VRQIRAASRNRLVVAVDQEGGRVQRFREGFTRLPAA 81 +GG+ILF+ N + QL L + + + L +A+DQEGGRV R A Sbjct: 105 IGGVILFSENIENTEQLITLNYTMQQHMLKVGKQPLFIAIDQEGGRVARLPSDMLAPFAG 164 Query: 82 QSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGH--ISAAIGERSYHADP 139 + + G AQ + ++ + I+ +FAP +D+ + I RS+ P Sbjct: 165 NMAIGATFHQHGSTFAQRVARHIGKTLLPLGINTNFAPSVDINSEPKNPVINVRSFSEHP 224 Query: 140 QKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSS 199 ++ A+ F++ M G+ + KHFPGHG DSH P +A+ D+ FS+ Sbjct: 225 EQVAALGQTFVEAMQGTGVISAIKHFPGHGDTHVDSHSGLPLVSHTRAQAMTGDILPFSN 284 Query: 200 LIRENKLDA-IMPAHVIYSDVDPR------------PASGSPYWLKTVLRQELGFDGVIF 246 +IR A +M AH+ Y +D PA+ S L +LR++LG+ G+I Sbjct: 285 IIRSATPPAMVMSAHIQYPSLDDTKITDKNGKRQIVPATLSKKILTNLLRKQLGYKGLIV 344 Query: 247 SDDLSMEGAAIMGSYAERGQASLDAGCDMILV---CNNR---KGAVSVLDNLS 293 +D L M G S+ E + AG D+ L+ N+ V +LDN++ Sbjct: 345 TDALDMAGITQFFSHEEALVRAFSAGADIALMPFTIRNKTDINAFVKLLDNVA 397 >UniRef50_C1F257 Glycosyl hydrolase, family 3 n=1 Tax=Acidobacterium capsulatum ATCC 51196 RepID=C1F257_ACIC5 Length = 361 Score = 119 bits (297), Expect = 2e-25, Method: Compositional matrix adjust. Identities = 94/328 (28%), Positives = 152/328 (46%), Gaps = 12/328 (3%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQ 63 +M+ + G L ER L G+ILF RN Q L+R+ + VD Sbjct: 16 MMVGLSGTALSELERAWLKLIRPTGIILFRRNIETIPQTVALLREASRIIDRPTLRCVDM 75 Query: 64 EGGRVQRFREGFTRLPA-AQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLD 122 EGG V R R+ T +P+ A+ FA ++ G L+ + + APV+D Sbjct: 76 EGGMVDRLRDVLTPMPSPARVFAT-----RDPQMFYRHGKLIGKAAHMAGFNATLAPVVD 130 Query: 123 VGHISAA--IGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETP 180 + ++A + R A P + A F++G+ + KHFPG G DSH P Sbjct: 131 LALPNSAGVMKTRVVSALPHEVRQYALTFLEGLETEEVFGCAKHFPGLGGGIVDSHLTMP 190 Query: 181 CDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPRPASGSPYWLKTVLRQE 238 R A++ +D++ F SL R ++ M AH Y V + PAS SP+W++ +LR+ Sbjct: 191 VIDRKWADLWNEDLAPFRSLAR--RVPITMVAHAAYPLVTRNHTPASLSPHWIQKILRKR 248 Query: 239 LGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAE 298 +GF G++ SDD+ M G S E ++ AG D++ +C+N ++ + L + Sbjct: 249 VGFRGLVLSDDMEMGGVLSHCSMGEAVIHAIAAGTDVVEICHNPALILTAYEALIREAEQ 308 Query: 299 RVTRLYHKGSFSRQELMDSARWKAISTR 326 RV + +R+ + RW I T+ Sbjct: 309 RVAFRRQLTAAARRVEEWARRWSLIQTQ 336 >UniRef50_Q6AJ70 Related to glycosyl hydrolase n=1 Tax=Desulfotalea psychrophila RepID=Q6AJ70_DESPS Length = 377 Score = 119 bits (297), Expect = 2e-25, Method: Compositional matrix adjust. Identities = 84/278 (30%), Positives = 146/278 (52%), Gaps = 35/278 (12%) Query: 26 VGGLILFTR---------NYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFRE--G 74 +GG+ILF + N P Q+ +L +++ S L++AVDQEGG V RFR G Sbjct: 66 LGGVILFDKCLATGSRKNNITSPEQMAKLTSDLQSCSAEPLLIAVDQEGGMVSRFRSAYG 125 Query: 75 FTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV--LDVGHISAAIG- 131 F+ P+A +E+ + A++ ++ ++ + APV LD+ ++ IG Sbjct: 126 FSTSPSALMLGQQDDLEKCREAAEQTSQMLKDA----GVNFNLAPVVDLDLTALNPIIGK 181 Query: 132 -ERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIR 190 +RS+ + Q + A ++ ++G+ + KHFPGHG+ ADSH R +I Sbjct: 182 YQRSFGDNAQTVIECALSWMSAHRKSGILSCLKHFPGHGSSLADSH-------RGFVDIT 234 Query: 191 A----KDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPASGSPYWLKTVLRQELGFDGVI 245 +++ + + + ++DAIM H+ + ++DP PAS S + ++LRQE+ + G+I Sbjct: 235 ESWDREELLPYKKIHQRTEIDAIMMGHLFHKNLDPLYPASLSKRIIDSLLRQEMNYQGLI 294 Query: 246 FSDDLSMEGAAIMGSYA-ERGQA-SLDAGCDMILVCNN 281 SDD+ M AI Y E+ ++ AG DM+++ NN Sbjct: 295 ISDDMQM--GAITKYYGLEKACCRAICAGVDMVIIGNN 330 >UniRef50_D2M1Q8 Glycoside hydrolase family 3 domain protein n=1 Tax=Bacillus cellulosilyticus DSM 2522 RepID=D2M1Q8_BACS4 Length = 518 Score = 119 bits (297), Expect = 2e-25, Method: Compositional matrix adjust. Identities = 86/293 (29%), Positives = 152/293 (51%), Gaps = 22/293 (7%) Query: 9 EGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIR-----AASRNRLVVAVDQ 63 EG ++ + +E++ VG +ILF+RN P ++ L +++ A ++ L++ +DQ Sbjct: 17 EGKQVTEQMKELIHEYRVGAIILFSRNIGSPEEVLTLTTELQKEAKIARHKHPLLICIDQ 76 Query: 64 EGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV 123 E G V+R +G T P A + G + A + G E+ A+ I+ + APVLDV Sbjct: 77 ENGIVRRLGDGTTVFPGAMAL----GATQKPNHAYDIGVATGRELKALGINWNLAPVLDV 132 Query: 124 GH--ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPC 181 + + IG RS+ + + GM A + TT KHFPGHG DSH P Sbjct: 133 NNNPKNPVIGVRSFGESAELVSQFGIAAMKGMQAAEVVTTLKHFPGHGDTDVDSHVALPV 192 Query: 182 DPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPR-PASGSPYWLKTVLRQE 238 + +++ F + + N D +M AH+ + ++ DP+ PA+ S + +LR++ Sbjct: 193 ITHDLNRLEEIELAPFKACMDANA-DVVMSAHIHFPEIEKDPQVPATLSKAIITGLLREK 251 Query: 239 LGFDGVIFSDDLSMEGAA-IMGSYAERGQ-ASLDAGCDMIL---VCNNRKGAV 286 L +DGV+ +D + M+ + +G+ ERG A++ AG D+++ + N + GA+ Sbjct: 252 LQYDGVVTTDCMEMDAISETIGT--ERGAVAAIVAGVDLVMISHIANRQIGAL 302 >UniRef50_C0D0H0 Putative uncharacterized protein n=1 Tax=Clostridium asparagiforme DSM 15981 RepID=C0D0H0_9CLOT Length = 447 Score = 118 bits (296), Expect = 2e-25, Method: Compositional matrix adjust. Identities = 81/245 (33%), Positives = 125/245 (51%), Gaps = 13/245 (5%) Query: 18 REILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRN----RLVVAVDQEGGRVQRF-- 71 R+ L VGGLI F N P Q+R+L R L+++VD+EGG+V R Sbjct: 138 RKALESRPVGGLIYFRNNLQTPEQVRDLTENTARIGREVCSLPLILSVDEEGGKVTRIGG 197 Query: 72 REGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV--GHISAA 129 R GF +P + + + GK A E G + S + ++ FAP DV + Sbjct: 198 RAGFD-VPVFSNMWEIGAQNDPGK-AYEVGSAIGSYLSGYGFNLDFAPDADVLTNPDNQV 255 Query: 130 IGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEI 189 IG RS+ +DP+ ++AS+ G+ + G++ KHFPGHGA D+H + ++ Sbjct: 256 IGNRSFGSDPELVWSMASQVAKGLKDQGIQPVYKHFPGHGATLGDTHDGFSYTEKTLDQL 315 Query: 190 RAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPRPASGSPYWLKTVLRQELGFDGVIFS 247 ++ F++ ++ N IM AH+ V D PAS S + VLR++LGFDGV+ + Sbjct: 316 LEAELVPFANAVK-NGESCIMAAHIAAPGVTGDETPASLSHALITGVLREKLGFDGVVVT 374 Query: 248 DDLSM 252 D L+M Sbjct: 375 DALNM 379 >UniRef50_P48823 Beta-hexosaminidase A n=3 Tax=Alteromonadales RepID=HEXA_PSEO7 Length = 598 Score = 118 bits (296), Expect = 3e-25, Method: Compositional matrix adjust. Identities = 83/286 (29%), Positives = 137/286 (47%), Gaps = 21/286 (7%) Query: 13 LDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNR-----LVVAVDQEGGR 67 L E E+++ +GG ILF N + AQ+ L +++A++ L +A+DQEGGR Sbjct: 51 LPPELSELISRYDIGGAILFAENVQNTAQIISLTNALQSAAQQSKSQLPLFIAIDQEGGR 110 Query: 68 VQRF-REGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI 126 V R RE T S A + ++G A + + E+ ++ I+++FAP +DV Sbjct: 111 VARINREQATSFTGNMSIGA-TYPKQGDIYATKVASAIGKELNSLGINVNFAPTVDVNSN 169 Query: 127 --SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPR 184 + I RS+ +P + + AG+ + KHFPGHG DSH P Sbjct: 170 PNNPVINVRSFSENPTVVTKLGLAQVKAFEAAGVLSALKHFPGHGDTHVDSHTGLPRVDH 229 Query: 185 PQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP------------RPASGSPYWLK 232 + +I +D+ F+ +I+ + IM AH+ Y +D RPA+ S + Sbjct: 230 DRDKINQQDLLPFAEIIKASPPGMIMTAHIQYPALDNSKVVNSQGESMIRPATMSYQIMT 289 Query: 233 TVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILV 278 +LR ELG+ GV +D L M G + + + + +AG D+ L+ Sbjct: 290 QLLRHELGYQGVTVTDALDMAGISDFFNPVDATIETFNAGVDIALM 335 >UniRef50_D2AXZ5 Beta-N-acetylhexosaminidase n=2 Tax=Streptosporangium roseum DSM 43021 RepID=D2AXZ5_STRRD Length = 504 Score = 118 bits (296), Expect = 3e-25, Method: Compositional matrix adjust. Identities = 89/269 (33%), Positives = 135/269 (50%), Gaps = 20/269 (7%) Query: 26 VGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFR--EGFTRLPAAQS 83 +GG+ L++RN D AQ+R L +RA + + +V+A D+E G V R G +R P + Sbjct: 43 LGGVALYSRNLVDSAQVRALTAALRAENPD-VVIATDEEAGDVTRLEALTGSSR-PGNLA 100 Query: 84 FAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI--SAAIGERSYHADPQK 141 A+ E +A+ G +M I +++APV+DV + +G R++ ADP Sbjct: 101 LGAVDDPELTELVARGVG----RDMADAGITLNYAPVVDVNSDLDNPVVGTRAFGADPWL 156 Query: 142 ALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLI 201 A ++ G+ AG+ KHFPGHGA DSH + P E+ A + + + I Sbjct: 157 VARHARAWVTGLQSAGVAACAKHFPGHGATAVDSHNDLPTVSYSAEELAATTLPPYRTAI 216 Query: 202 RENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAA---- 256 E + A+M H++ DP PA+ S L +LR+ELGFDG+I +D + M A Sbjct: 217 -EAGVRAVMTGHLLVPAYDPDFPATMSSRILVGLLREELGFDGLIVTDGVEMPSVAERYG 275 Query: 257 IMGSYAERGQASLDAGCDMILVCNNRKGA 285 + G+ A DA C V RKGA Sbjct: 276 VTGAAVRALAAGADAVC----VGGERKGA 300 >UniRef50_A2E8H3 Glycosyl hydrolase family 3 N terminal domain containing protein n=1 Tax=Trichomonas vaginalis RepID=A2E8H3_TRIVA Length = 474 Score = 118 bits (296), Expect = 3e-25, Method: Compositional matrix adjust. Identities = 78/252 (30%), Positives = 124/252 (49%), Gaps = 13/252 (5%) Query: 34 RNYHDPAQLRELVRQIRAASRNRL----VVAVDQEGGRVQRFREGFTRLPAAQSFAALSG 89 RN + Q+R L I+ S N+ ++++DQE G R T P++ + AA + Sbjct: 3 RNLRNQEQIRNLTATIQNYSYNQYNLPSLISIDQESGSNIRMFRNSTVFPSSMTIAATNN 62 Query: 90 MEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI--SAAIGERSYHADPQKALAIAS 147 +E A ++ +L A E+ + I+++FAPV DV + +G+RS+ +P Sbjct: 63 LEN----AYKSSYLQALELNDVGINLNFAPVSDVNSNIHNPIVGDRSFGDNPNVVSKYVQ 118 Query: 148 RFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLD 207 +I G +AG+ KHFPGHG + DSH E P R ++ D+ F + I +D Sbjct: 119 NYIKGHKKAGVLCCSKHFPGHGETSQDSHYELPRINRTFESLKEIDLPPFKASINYG-VD 177 Query: 208 AIMPAHVIYSDVDPRPASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQA 267 IM H+I + P S S + LR ELG++GV +D + M+ A Y E Sbjct: 178 MIMLGHLITPLDETMPCSTSSKCI-NYLRNELGYEGVTITDSMVMK-AIKQEKYNETILE 235 Query: 268 SLDAGCDMILVC 279 ++ AG D+I C Sbjct: 236 AILAGADLICSC 247 >UniRef50_Q6L346 Beta-hexosaminidase n=1 Tax=Picrophilus torridus RepID=Q6L346_PICTO Length = 454 Score = 118 bits (295), Expect = 3e-25, Method: Compositional matrix adjust. Identities = 80/255 (31%), Positives = 134/255 (52%), Gaps = 18/255 (7%) Query: 29 LILFTRNYHDPAQLRELVRQIRA-----ASRNRLVVAVDQEGGRVQRFREGFTRLPAAQS 83 +IL N+ + +L L+ +I NR V ++DQEGG V R + LP+ Sbjct: 31 IILMRGNFSNKRELASLISKINKFYINDLKLNRPVFSIDQEGGNVIRISD-IDYLPSNYC 89 Query: 84 FAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV--GHISAAIGERSYHADPQK 141 G +++ AG + SE+ ++ I + APVLD+ + I ERS+ + +K Sbjct: 90 L----GFINKADISRMAGLITGSELRSIGILWNHAPVLDLLESPKNPVILERSFGKNSKK 145 Query: 142 ALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLI 201 + +I+G+ AG+ T KHFPGHG+V DSH P D R EI+ M S++ Sbjct: 146 VSKLGKSYINGLQSAGVAATAKHFPGHGSVIEDSHLTMPVDNRNINEIKTSMMPFVSAI- 204 Query: 202 RENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSY 261 + + +IM +HV+YS +D PAS S + +LR++L F GV +D + M+ A+ ++ Sbjct: 205 -NSGVKSIMLSHVLYSSIDDVPASLSK-RINDILRRDLNFTGVSVTDSIDMK--ALSSNF 260 Query: 262 AERGQASLDAGCDMI 276 + R + ++ + D+I Sbjct: 261 SIR-EIAMRSNADII 274 >UniRef50_B5RMG9 Beta-glucosidase, putative n=20 Tax=Borrelia RepID=B5RMG9_BORDL Length = 551 Score = 118 bits (295), Expect = 4e-25, Method: Compositional matrix adjust. Identities = 76/240 (31%), Positives = 120/240 (50%), Gaps = 15/240 (6%) Query: 26 VGGLILFTRNYHDPAQLRELVRQIRAAS-RNR----LVVAVDQEGGRVQRFREGFTRLPA 80 +GG+ +F N + L E + + + S +NR L +A DQEGG Q + ++ Sbjct: 70 LGGIKIFGWNAKNLYNLIESINKAQTISQKNRFKIPLFIATDQEGGWTQHIKLKTSKTIG 129 Query: 81 AQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV--GHISAAIGERSYHAD 138 A + A G+ +A+E+ + I+I+FAP+ D+ + IG R+Y + Sbjct: 130 NLGITATLSPND----AYFTGYHIANELKQLGININFAPITDIYSNEENFIIGPRTYSNN 185 Query: 139 PQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFS 198 PQ A F G +AG+ +T KHFPGHG DSH + P EI+ ++ + Sbjct: 186 PQIVSLFALAFYKGQKQAGIISTAKHFPGHGNTVIDSHIKMPIINSDLKEIKTNELLPYK 245 Query: 199 SLIRENKLDAIMPAHVIY---SDVDPRPASGSPYWLKTVLRQELGFDGVIFSDDLSMEGA 255 LI+EN + IM H+ Y ++ PAS S +K +LRQ+L +D +I +DDL M Sbjct: 246 ILIQEN-IPMIMSGHLAYPMLTNGKEIPASSSTKIIKEILRQDLKYDNLIITDDLLMNAV 304 >UniRef50_Q1IZF6 Beta-N-acetylhexosaminidase n=2 Tax=Deinococcus RepID=Q1IZF6_DEIGD Length = 478 Score = 118 bits (295), Expect = 4e-25, Method: Compositional matrix adjust. Identities = 91/286 (31%), Positives = 145/286 (50%), Gaps = 16/286 (5%) Query: 5 MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQE 64 M+D+ G LDA+ L + + LF +N + AQLR+L +R ++A+D E Sbjct: 1 MVDIPGKALDADTASHLRRHGIRAVCLFGKNVQNAAQLRQLCADLREVMGEAALIALDHE 60 Query: 65 GGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVG 124 GG + R T P A S +L G + L ++ +A ++ ++ ++ +FAPVLDV Sbjct: 61 GGAILRP----TFWPFAPSAMSL-GAADNVALTEDVNAALARQLRSVGVNWNFAPVLDVN 115 Query: 125 HISA--AIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCD 182 A IGER+Y AD + + + + G + KHFPGHG DSH P Sbjct: 116 VNPANPVIGERAYGADVDRVIRHGTAALLGHTRERVAGCVKHFPGHGDTHLDSHYALPRV 175 Query: 183 PRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPASGSPYWLKTVLRQELGF 241 + +AE+ A + + F + + A+M AH++Y +DP PA+ S L +LR E G+ Sbjct: 176 SKSRAELDAVEFAPFRACLPLAP--AVMTAHIVYEALDPCSPATLSRTVLTELLRGEWGY 233 Query: 242 DGVIFSDDLSMEGAAIMGSYAERGQA---SLDAGCDMILVCNNRKG 284 GV +D + M+ AI Y RG+A +L AG D+++ R+ Sbjct: 234 QGVTVTDSMGMQ--AIDAHYG-RGEAAVLALRAGADLVMALGRREA 276 >UniRef50_C7MG90 Beta-glucosidase-like glycosyl hydrolase n=1 Tax=Brachybacterium faecium DSM 4810 RepID=C7MG90_BRAFD Length = 519 Score = 117 bits (293), Expect = 5e-25, Method: Compositional matrix adjust. Identities = 107/330 (32%), Positives = 156/330 (47%), Gaps = 32/330 (9%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 +G +M G EL E+L L LILF N DP L R I A + + VVA Sbjct: 11 LGVMMAPFTGTELPDWVGEVLDAGLAS-LILFGHNTPDPQTTARLARAIHARAEH-CVVA 68 Query: 61 VDQEGGRVQRFREGF-TRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAP 119 +D+EGG V R + + LP A + + +E L + G + + A D+D+ AP Sbjct: 69 IDEEGGDVTRVQAATGSSLPTAWALGEVDDIE----LTRHVGSALGDVLAACDVDLDLAP 124 Query: 120 VLDVGHISA--AIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 VLDV A IG R++ DP + A A F G+ EAG+ T KHFPGHGA DSH Sbjct: 125 VLDVSTDPANPVIGTRAFGDDPDRVAAHARAFATGLIEAGLGTCAKHFPGHGATATDSHT 184 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQ 237 PR + R + + LD +M AHV+ PAS +P W + +L + Sbjct: 185 AL---PRIDLDARRFEREHLAPWRIAPWLDCVMTAHVLVPAFGEGPASLAP-WSRPLLDR 240 Query: 238 ELG--FDGVIFSDDLSMEGAAIMGSYAERGQA--SLDAGCDMILVCNN-RKGAVSVL--- 289 +G F G++ +D L M AA+ + + A +++AG D++ + + R+ +L Sbjct: 241 AVGGSFHGLVITDALDM--AAVTAAPGDGAAAVRAIEAGADLLCLGTSLRRDDQQMLREA 298 Query: 290 -DNLSPIKAERVTRLYHKGSFSRQELMDSA 318 D LS AE G +R+ L D A Sbjct: 299 HDALSAAVAE--------GRLTRETLRDRA 320 >UniRef50_B6Q1F2 Beta-N-acetylglucosaminidase, putative n=1 Tax=Penicillium marneffei ATCC 18224 RepID=B6Q1F2_PENMQ Length = 877 Score = 117 bits (293), Expect = 6e-25, Method: Compositional matrix adjust. Identities = 93/341 (27%), Positives = 158/341 (46%), Gaps = 38/341 (11%) Query: 10 GYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIR-----AASRNRLVVAVDQE 64 G+ E R ++ +G +ILF RN D QL+ L ++ A + L + +DQE Sbjct: 23 GFTSSPEIRTLIRDYGLGAVILFRRNIKDATQLQALTLALQQEAKDAGHEHPLFIGIDQE 82 Query: 65 GGRVQRFR-EGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV 123 G V R + + P + A + K+ + G +++ I++++APV DV Sbjct: 83 HGLVTRITPQTAAQQPGQMTLGATQSADNAYKIGKCTGEMLS----FFGINMNYAPVCDV 138 Query: 124 GH--ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPC 181 ++ IG RS+ +P ++S G+ E G+ T KHFPGHG DSH P Sbjct: 139 NSEPLNPVIGVRSFGDNPNFVAKMSSATARGLRETGVVPTVKHFPGHGDTAVDSHHGLPV 198 Query: 182 DPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV------DPRPASGSPYWLKTVL 235 + ++E+ ++ F + E ++A+M AH+ + + PA+ SP L +L Sbjct: 199 INKSRSELERCELIPFRRSVAEG-IEAVMTAHIALPRISSGPESNRLPATLSPEALG-IL 256 Query: 236 RQELGFDGVIFSDDLSMEGAAIMGSY--AERGQASLDAGCDMILVCNNRKGAVSVLDNLS 293 R ++ +DGVI +D L M G I G+Y + SL AG D +++C+ V+ + Sbjct: 257 RNDMKYDGVIVTDCLEMNG--IRGTYGTVDGTLMSLKAGSDSVMICHTYAVQVTSI---- 310 Query: 294 PIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERW 334 ERV + G + ++ R R+ L ER+ Sbjct: 311 ----ERVVQAVKSGELPESRICEALR------RITSLKERF 341 >UniRef50_C6WFU8 Glycoside hydrolase family 3 domain protein n=3 Tax=Actinomycetales RepID=C6WFU8_ACTMD Length = 403 Score = 117 bits (292), Expect = 9e-25, Method: Compositional matrix adjust. Identities = 92/265 (34%), Positives = 138/265 (52%), Gaps = 19/265 (7%) Query: 49 IRAASRNRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEM 108 + AAS L V+VD EGGRVQR +P+A++ E+ +LA++ G M + Sbjct: 135 VHAASTLPLTVSVDDEGGRVQRIDALDGDIPSARTMTRTLSTEQVRELARKRGEAMKARG 194 Query: 109 IAMDIDISFAPVLDVGHISA--AIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFP 166 + D+ APVLD+ +A IG+RS+ DP A++ A F +G+ +AG+ + KHFP Sbjct: 195 VNTDL----APVLDLTSQAANTVIGDRSFSVDPATAVSYAEAFAEGLRQAGVVSVVKHFP 250 Query: 167 GHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD-PRPAS 225 GHG + DSH + P P A++RA D++ + L R + +M H+ D+ PAS Sbjct: 251 GHGNTSGDSHLGSVTAP-PLAQLRAHDLAPYRELPRFGEDVQVMVGHIAVPDLTGGLPAS 309 Query: 226 GSPYWLKTVLRQELGFDGVIFSDDL-SMEGAAIMGSYAERGQASLDAGCDMIL-VCNNRK 283 SP + +LR E FDG++ +DDL +M + + +L AG D+ L R Sbjct: 310 LSPAAYE-LLRGEFAFDGLVMTDDLGAMRAVTDLADLPDAVLRALVAGADVALWSSGGRV 368 Query: 284 GAVSVLDNLSP------IKAERVTR 302 G VLD L + AERV R Sbjct: 369 G--EVLDRLQAAVASGELSAERVDR 391 >UniRef50_A5IG30 Glycosyl hydrolase n=6 Tax=Legionella RepID=A5IG30_LEGPC Length = 358 Score = 116 bits (291), Expect = 1e-24, Method: Compositional matrix adjust. Identities = 99/332 (29%), Positives = 167/332 (50%), Gaps = 42/332 (12%) Query: 21 LAHPLVGGLILFTR---------NYHDPAQLRELVRQI-RAASRNR-------LVVAVDQ 63 L++ +GG++LF + N + AQ+R+L+RQ+ R AS L A+D Sbjct: 31 LSNDGLGGVLLFDKDLSTGIYGKNLRNQAQIRQLIRQLNRYASMISGGNDNLPLFTAIDY 90 Query: 64 EGGRVQRFR--EGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVL 121 EGG V R EG A+ A LS + ++ Q MA + ++ +++FAPV+ Sbjct: 91 EGGVVDRLSKIEGCPPTMKARELAKLSSDDFDEEVTQ-----MAMTLQSLGFNLNFAPVV 145 Query: 122 DVGHISAA--IGE--RSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 D+ + A IG RS+ ++P+ + +A +F+D G+ KHFPGHG+ T D+H Sbjct: 146 DLNLNAHAGIIGNLGRSFSSNPEVTIRLAKQFVDVFSRHGIACAYKHFPGHGSATGDTH- 204 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRE-NKLDAIMPAHVIYSDVDPR--PASGSPYWLKTV 234 E D + ++ + SL+++ + +M AHVI +D + PA+ S L + Sbjct: 205 EGFVD--VTETFQTDELIPYRSLLKDLSNPVMVMTAHVINKHLDSKGLPATLSYDVLTLL 262 Query: 235 LRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKG-----AVSVL 289 LRQ +G+DGVI SDDL M + S E +++AG DM++ N + V+ Sbjct: 263 LRQSIGYDGVIISDDLQMHAISNHYSLEEALCLTINAGADMVIFANQLGTITAPEVIDVI 322 Query: 290 DNL---SPIKAERVTRLYHKGSFSRQELMDSA 318 + L I ++R+ Y + +Q+++ SA Sbjct: 323 EKLVIDKQIPSQRIDEAYRRIVRLKQQIICSA 354 >UniRef50_C6B6L4 Beta-N-acetylhexosaminidase n=11 Tax=Rhizobium RepID=C6B6L4_RHILS Length = 340 Score = 116 bits (290), Expect = 1e-24, Method: Compositional matrix adjust. Identities = 85/258 (32%), Positives = 124/258 (48%), Gaps = 15/258 (5%) Query: 28 GLILFTRNYHDPAQLRELVRQIR-AASRNRLVVAVDQEGGRVQRFREGFTRLPAAQSFAA 86 GL + RN +P Q R L+ + R A R+ V DQEGGRVQ G P +SF Sbjct: 32 GLFVGRRNQREPEQTRRLIDRFREAVGRDDAPVFTDQEGGRVQHLDAG--PWPLFRSFGQ 89 Query: 87 LSGM-----EEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGH--ISAAIGERSYHADP 139 + + + G K + + M + M + + +PVLD+ SA IG RS+ P Sbjct: 90 FAELARRDFDLGKKALRLSSQAMGAMMTELGLSSGCSPVLDLVFETTSAVIGARSFGPGP 149 Query: 140 QKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSS 199 A+ +DG+ E G KH PGHG T DSHKE P + + A D F + Sbjct: 150 DFVAALGREVVDGLLETGNMPVIKHIPGHGRATLDSHKERPVVDASRETLTATDFKPFVA 209 Query: 200 LIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMG 259 L M AHV+YS D + + V+R ++G++GV+ SD + M+ ++ G Sbjct: 210 L---KDTPWAMVAHVVYSAYDAELPASVSRVMHDVIRNDMGYEGVLISDCIFMQ--SLSG 264 Query: 260 SYAERGQASLDAGCDMIL 277 + ER + LDAG D+ L Sbjct: 265 TLPERVKQVLDAGYDIAL 282 >UniRef50_D2PT32 Glycoside hydrolase family 3 domain protein n=26 Tax=Actinomycetales RepID=D2PT32_9ACTO Length = 599 Score = 116 bits (290), Expect = 1e-24, Method: Compositional matrix adjust. Identities = 93/309 (30%), Positives = 155/309 (50%), Gaps = 22/309 (7%) Query: 7 DVEGYELDAEEREILAHPLVGGLILF--TRNYHDPAQLRELVRQIRAAS-------RNRL 57 +V Y + A E++A +GG+I F T + +P Q+ L ++ AS L Sbjct: 73 NVTAYGV-ATPAEVVAKYKLGGVIYFAWTDSVANPPQIAALSNGLQQASLAVGDKVSVPL 131 Query: 58 VVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISF 117 +++ DQE G V R T+ P A + G A+EA + +E+ A+ ++ ++ Sbjct: 132 LISTDQEHGVVFRVGPPATQFPGAMAL----GAGRSTADAREAAAIAGAELRAVGVNQAY 187 Query: 118 APVLDVG--HISAAIGERSYHADPQKALAIASRFIDGMHEAG-MKTTGKHFPGHGAVTAD 174 APV DV ++ IG RS+ +DP + + + G+ G + + KHFPGHG D Sbjct: 188 APVADVNVNALNPVIGVRSFSSDPTLVADLTAAQVTGLEGGGGIASCAKHFPGHGDTVDD 247 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP--RPASGSPYWLK 232 SH P + + D F + I E +D+IM AH++ +DP PA+ S L Sbjct: 248 SHTSLPTINHTREQWNTIDAPPFEAAI-EAGIDSIMTAHIVVPSLDPSGEPATLSKPILT 306 Query: 233 TVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL 292 +LR++LG+DGV+ +D L+M G AE +L+AGCD +L+ A + + L Sbjct: 307 GILREQLGYDGVVITDSLAMAGVRQKYGDAEVAVRALEAGCDQLLMSPAMDVAYNAV--L 364 Query: 293 SPIKAERVT 301 + +++ R+T Sbjct: 365 AAVRSGRIT 373 >UniRef50_Q30Y91 Beta-N-acetylhexosaminidase n=3 Tax=Desulfovibrionales RepID=Q30Y91_DESDG Length = 398 Score = 116 bits (290), Expect = 1e-24, Method: Compositional matrix adjust. Identities = 106/326 (32%), Positives = 159/326 (48%), Gaps = 40/326 (12%) Query: 14 DAEEREILAHPLVGGLILFTR---------NYHDPAQLRELVRQIRAASRNRLVVAVDQE 64 D+ R++ H +GG+ILF R N QL L ++ A++ L VAVDQE Sbjct: 63 DSVVRDV-THYNLGGVILFDRDVALGSDRRNIRSVRQLAALTAALQKAAQTPLFVAVDQE 121 Query: 65 GGRVQRF--REGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLD 122 GGRV R R GF LP+AQ A + AG ++ A+ ++++FAPV+D Sbjct: 122 GGRVARLSPRHGFAALPSAQETGAAATTAASADAGSRAGAMLH----AVGVNLNFAPVVD 177 Query: 123 VG--HISAAIGE--RSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE 178 V + IG RS+ AD + A F+ GM G+ KHFPGHG+ ADSH Sbjct: 178 VNVRPDNPVIGRAGRSFSADSEAVARHAQAFLQGMASHGVLGCVKHFPGHGSSAADSHLG 237 Query: 179 TPCDPRPQAEIRA----KDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPASGSPYWLKT 233 A+I A ++ + L+ + IM H+ S +DP PA+ S + Sbjct: 238 V-------ADISASWNENELVPYRRLLDGGHVQMIMTGHLFNSRLDPDHPATLSRAVITG 290 Query: 234 VLRQELGFDGVIFSDDLSMEGAAIMGSYA-ERGQA-SLDAGCDMILVCNNRKGAVSVLDN 291 +LR LGF GV+ SDD+ M+ AI Y E+ A +L+AG D++L NN ++ + Sbjct: 291 LLRGRLGFGGVVVSDDMQMK--AITDRYGLEQAVALALNAGVDILLFGNN----LTYDAD 344 Query: 292 LSPIKAERVTRLYHKGSFSRQELMDS 317 + P + + L +G R + +S Sbjct: 345 IVPEVVDMIESLVERGVVPRSRIEES 370 >UniRef50_B2KAV5 Glycoside hydrolase family 3 domain protein n=1 Tax=Elusimicrobium minutum Pei191 RepID=B2KAV5_ELUMP Length = 541 Score = 116 bits (290), Expect = 1e-24, Method: Compositional matrix adjust. Identities = 88/264 (33%), Positives = 136/264 (51%), Gaps = 17/264 (6%) Query: 25 LVGGLILFTRNYHDPAQLRELVRQI-----RAASRNRLVVAVDQEGGRVQR-FREGFTRL 78 LVGG+++ NY Q EL ++ ++ + L++++D EGG V GF L Sbjct: 55 LVGGVLVQWGNY-SLEQTTELAAKLQSWAAKSPHKIPLLISIDYEGGTVYTPVTLGFEYL 113 Query: 79 PAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHISAA---IGERSY 135 P AA + E ++ AG E+ I I+F+PV+DV +I+ IG RS+ Sbjct: 114 PPNMMIAAANDEEAAARIFYLAGL----ELRKAGIHINFSPVVDV-NINPGNPIIGVRSF 168 Query: 136 HADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMS 195 + P+ + + + G+ A + + KHFPGHG DSH P + E++ ++ Sbjct: 169 GSSPELVGRMGAAVVSGLSAANVMSVAKHFPGHGNTVLDSHYSLPVLNITKKEMQDVHLA 228 Query: 196 VFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQELGFDGVIFSDDLSMEG 254 F I E + IM AH+IY + DP+ PA+ S L +LR E+ F GVI SD L M+G Sbjct: 229 PFKKAI-EAGVPGIMTAHIIYKNYDPKNPATYSKRILNDLLRTEMKFKGVIISDALDMKG 287 Query: 255 AAIMGSYAERGQASLDAGCDMILV 278 A + G+ A +L+AG DM L+ Sbjct: 288 ATLDGNIALSAAKTLEAGSDMALL 311 >UniRef50_B9D1W5 Glycosyl hyrolase, family 3 n=2 Tax=Campylobacter RepID=B9D1W5_WOLRE Length = 363 Score = 115 bits (289), Expect = 2e-24, Method: Compositional matrix adjust. Identities = 84/265 (31%), Positives = 137/265 (51%), Gaps = 18/265 (6%) Query: 27 GGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFRE--GFTRLPAAQSF 84 GG++L +N D +L L + A + + +A+D+EGG++ RF++ GF +AQ Sbjct: 69 GGVMLLGKNISDKKKLTALTSAFKEAQKG-IFIAIDEEGGQITRFKDKSGFETFISAQKA 127 Query: 85 AALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGH-ISAAIGER--SYHADPQK 141 A + G+L + MA ++ + I+++FAPV DV + S IG R ++ AD + Sbjct: 128 AKTLDLAAAGELYAK----MAQQLKDVGINVNFAPVADVLNPKSTIIGSRGRAFSADIDE 183 Query: 142 ALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK-ETPCDPRPQAEIRAKDMSVFSSL 200 AS F+ G+ KHFPGHG V ADSH + + E++ + Sbjct: 184 VSLYASEFMKASQARGVIAAMKHFPGHGNVEADSHTAKVVIENFDYTELKP-----YFDA 238 Query: 201 IRENKLDAIMPAHVIYSDVDP-RPASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMG 259 IR+N+ IM H+ + D PAS SP + ++LR EL FDGV+ SDD+ M G Sbjct: 239 IRKNEAKMIMVGHIYLAQRDALLPASLSPALIDSLLRGELKFDGVVISDDMLMGGLKDF- 297 Query: 260 SYAERGQASLDAGCDMILVCNNRKG 284 + ++ ++AG D++L + + G Sbjct: 298 TLQQKVINFINAGGDVMLFSDYKIG 322 >UniRef50_C1MUP1 Glycoside hydrolase family 3 protein n=1 Tax=Micromonas pusilla CCMP1545 RepID=C1MUP1_9CHLO Length = 387 Score = 115 bits (289), Expect = 2e-24, Method: Compositional matrix adjust. Identities = 97/308 (31%), Positives = 159/308 (51%), Gaps = 25/308 (8%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQI-RAASRNRLVVAVD 62 + +G E++ R ++ G ILF RN DPAQL + ++ R AS +L++ +D Sbjct: 18 ICFGFDGVEVNDHARRMIGKG-CGACILFARNVRDPAQLSSMCGELKRLASPRKLLIMID 76 Query: 63 QEGGRVQRFREG-FTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVL 121 QEGGRV R + FT LP+A+ A+ + A+ ++ASE+ A+++D++ APV+ Sbjct: 77 QEGGRVARLKPPHFTALPSAR---AVGSAPDAANAARVVAEVVASELRAVNVDMTLAPVV 133 Query: 122 DV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE- 178 DV ++ IG+R++ AD A+ + FI + G+ KHFPGHG DSH + Sbjct: 134 DVHTNPLNTVIGDRAFAADADAVAALGAAFIRALQAKGVAACAKHFPGHGDTVEDSHDDG 193 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD----------PRPASGSP 228 P + ++ F + I + + +++ AHV ++ P PAS S Sbjct: 194 LPTLTHDMDRLETVELKPFRAAI-DAAVSSVLVAHVSVPAMEGIDPGKRFPAPVPASTSR 252 Query: 229 YWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDA---GCDMILVCNNRKGA 285 ++ LR+ L F GV+ +DDL M GA GS + G A ++A G DM L C++ + Sbjct: 253 ACVR-YLRRTLNFSGVVVTDDLEM-GALSRGSGIDLGAAVVEAARSGVDMFLACHDERIQ 310 Query: 286 VSVLDNLS 293 L+ L+ Sbjct: 311 TEALEALA 318 >UniRef50_A4CPY8 Glycosyl hydrolase, family 3 n=1 Tax=Robiginitalea biformata HTCC2501 RepID=A4CPY8_9FLAO Length = 534 Score = 115 bits (288), Expect = 2e-24, Method: Compositional matrix adjust. Identities = 91/279 (32%), Positives = 132/279 (47%), Gaps = 30/279 (10%) Query: 26 VGGLILF------TRNY-------HDP---AQLRELVRQIRAASRNRLVVAVDQEGGRVQ 69 +GGL F N+ H+P L+EL+R+ + A+R L++A+D E G Sbjct: 49 IGGLCFFHSPASAATNFEGKKEIPHNPDSLGTLKELIRRYQRAARYPLLIAMDAEWGLAM 108 Query: 70 RFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI--S 127 R G P A ALS + L + G MA++ I + APV DV + Sbjct: 109 RVENG-EPYPYALCLGALSDRD---PLLYQVGLRMAADCRDAGIHWNLAPVADVNCNPGN 164 Query: 128 AAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQA 187 IG R++ P+ AS F G+ + G+ GKHFPGHG DSH P + A Sbjct: 165 PVIGYRAFGTTPEGVARRASLFYRGLKDGGILGCGKHFPGHGDTAVDSHLALPAIDKGAA 224 Query: 188 EIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR--PASGSPYWLKTVLRQELGFDGVI 245 E+ + + F L+ E +DAIM H+ +DP PAS SP + +RQ LGFDG + Sbjct: 225 ELESTEFVPFRRLVAEG-VDAIMTGHLAVPALDPEGLPASLSPAVIGQGIRQNLGFDGAV 283 Query: 246 FSDDLSMEGAAIMGSYAERGQA---SLDAGCDMILVCNN 281 SD L+M A+ +Y G+ + AG DM+ + Sbjct: 284 ISDALNMH--AVSKNYPGPGEVAWRAFAAGNDMLCFAED 320 >UniRef50_UPI0001B588EB hydrolase n=1 Tax=Streptomyces sp. SPB78 RepID=UPI0001B588EB Length = 518 Score = 115 bits (288), Expect = 2e-24, Method: Compositional matrix adjust. Identities = 89/260 (34%), Positives = 131/260 (50%), Gaps = 16/260 (6%) Query: 26 VGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRF--REGFTRLPAAQS 83 + ++LF+RN P Q+ L ++RA LV+A+D+E G V R R G TR P + Sbjct: 36 LASVVLFSRNIETPEQVAALTTRLRA-ENPELVIAIDEESGDVTRIESRTGSTR-PGNHA 93 Query: 84 FAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI--SAAIGERSYHADPQK 141 L +AQ+ G + + +++D +AP DV + IG RS+ ADP Sbjct: 94 LGYLDDTATTHAVAQDLGHTLRAAGVSLD----YAPSADVNSNPDNPIIGVRSFGADPDS 149 Query: 142 ALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLI 201 S +I G+ AG+ KHFPGHG V DSH P +I + + F + I Sbjct: 150 VARHTSAWIRGLQSAGVAACAKHFPGHGDVAVDSHHGLPVYEASDRDIARQALPPFRAAI 209 Query: 202 RENKLDAIMPAHVIYSDVDP-RPASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGS 260 AIM AH++ +DP RPA+ S L T+LR+ELG++G+I +D + M A+ Sbjct: 210 AAGAR-AIMSAHLLVPALDPHRPATLSRRILTTLLREELGYEGLIVTDAIEM--GAVADE 266 Query: 261 YAERGQA--SLDAGCDMILV 278 Y G A ++ AG D I V Sbjct: 267 YGIEGAAVRAITAGADAICV 286 >UniRef50_UPI00019730D7 beta-glucosidase-like glycosyl hydrolase n=1 Tax=Clostridium sp. M62/1 RepID=UPI00019730D7 Length = 505 Score = 115 bits (287), Expect = 3e-24, Method: Compositional matrix adjust. Identities = 83/272 (30%), Positives = 137/272 (50%), Gaps = 17/272 (6%) Query: 26 VGGLILFTRNYHDPAQLRELVRQIRAASRNR----LVVAVDQEGGRVQRF--REGFTRLP 79 VGG++ F++N D AQ ++++ + ++ R +++AVD+EGG+V R REGF Sbjct: 198 VGGIVYFSQNLRDEAQTKDMLTRTNQYAKERTGLPMLLAVDEEGGQVARISGREGF---- 253 Query: 80 AAQSFAALS--GMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV--GHISAAIGERSY 135 A +F +S G + + A E G + + + + ++ FAPV DV + + RS+ Sbjct: 254 ALDAFPDMSDIGAQGDSEKAYEVGDAIGAYLAELGFNMDFAPVADVLTNPDNTVVKRRSF 313 Query: 136 HADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMS 195 +D + + R +G+ + G+ KHFPGHG DSH+ R E+ ++ Sbjct: 314 GSDAKTVANMVVRETEGLKKHGISPVLKHFPGHGGTAEDSHEGAAVIHRTVDELLENELI 373 Query: 196 VFSSLIRENKLDAIMPAHVIYSDV--DPRPASGSPYWLKTVLRQELGFDGVIFSDDLSME 253 F + I E D +M H+ + D RPA S + VLR L FDG+I +D ++M Sbjct: 374 PFYTGI-EAGADFVMVGHISDPEAVGDDRPAVFSKTLVTEVLRDTLKFDGIIITDAMNMG 432 Query: 254 GAAIMGSYAERGQASLDAGCDMILVCNNRKGA 285 S + +L+AG DMIL+ + K A Sbjct: 433 AITESCSSGDAAVKALEAGVDMILMPEDFKAA 464 >UniRef50_B5I9C4 Sugar hydrolase n=1 Tax=Streptomyces sviceus ATCC 29083 RepID=B5I9C4_9ACTO Length = 500 Score = 115 bits (287), Expect = 3e-24, Method: Compositional matrix adjust. Identities = 85/246 (34%), Positives = 125/246 (50%), Gaps = 23/246 (9%) Query: 26 VGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRF-REGFTRLPAAQSF 84 +GG+ILFTRN D Q+R L +RA R L+VA+D EGG + G +P + + Sbjct: 47 LGGVILFTRNVRDAEQVRRLTDTLRAL-RPDLLVAIDNEGGGIGHLVAAGAPEVPGSYAL 105 Query: 85 AALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHISA--AIGERSYHADPQKA 142 + ++ + A +A + + + S+APV DV + + R++ D Sbjct: 106 GVVDDLDLTARCADA----LAGHLATLGVTASYAPVADVQRQAGNPIVRTRAFGED---- 157 Query: 143 LAIASR----FIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFS 198 A+ SR +I G+ + KHFPGHG DSH E DPRP E+ D++ F Sbjct: 158 TALVSRHLGAWITATEARGIASCAKHFPGHGGTLTDSHHELAVDPRPYDEL---DLAPFR 214 Query: 199 SLIRENKLDAIMPAHVIYSDVDP-RPASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAI 257 + I + +M AHV++ +D RPA+ S L +LRQEL FDGV+ SD L M+ AI Sbjct: 215 AAIAAG-VPMLMSAHVVFPALDANRPATLSRRILGDLLRQELEFDGVLVSDALEMK--AI 271 Query: 258 MGSYAE 263 Y E Sbjct: 272 SDLYGE 277 >UniRef50_C2BJ14 Beta-N-acetylglucosaminidase n=2 Tax=Corynebacterium RepID=C2BJ14_9CORY Length = 406 Score = 114 bits (286), Expect = 4e-24, Method: Compositional matrix adjust. Identities = 92/261 (35%), Positives = 131/261 (50%), Gaps = 19/261 (7%) Query: 26 VGGLILFTRNYHDPAQLRELVRQIRAASRN---RLVVAVDQEGGRVQRFREGFTRLPAAQ 82 VGG+ F ++ DP L+E R I A + V++D EGGRVQRF E PA Q Sbjct: 107 VGGI--FIPSWADPNLLQEEGRDINALRQEVGRDFEVSIDFEGGRVQRFSEILGEYPAPQ 164 Query: 83 SFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV-GHISAAIGERSYHADPQK 141 AA +E +LA E G + + I++ FAPVLDV G+ +G+RS+ DP + Sbjct: 165 QMAAERSPQEVEQLAHEIG----TSLKVHGINVDFAPVLDVDGNGLEVVGDRSFSTDPVQ 220 Query: 142 ALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLI 201 A + F G+ AG+K KHFPGHG + D+H P P E+ + F + + Sbjct: 221 AGEYGAAFARGLDSAGVKAVFKHFPGHGRASGDTHLAEAVTP-PLEELEGHEFIPFKTAL 279 Query: 202 RENKLDAIMPAHVIYSDV--DPRPASGSP--YWLKTVLRQELGFDGVIFSDDL-SMEGAA 256 + A+M H+ + PAS P Y L R+ L +DG I++DD+ M+ A Sbjct: 280 PQAPNAALMMGHLAVPGLGDGQTPASILPEAYGLA---RETLQYDGPIYTDDIGGMKAIA 336 Query: 257 IMGSYAERGQASLDAGCDMIL 277 A+ ASL+AG DM L Sbjct: 337 DSLPLADAVVASLNAGADMPL 357 >UniRef50_A7EUL4 Putative uncharacterized protein n=1 Tax=Sclerotinia sclerotiorum 1980 UF-70 RepID=A7EUL4_SCLS1 Length = 553 Score = 114 bits (286), Expect = 4e-24, Method: Compositional matrix adjust. Identities = 95/344 (27%), Positives = 157/344 (45%), Gaps = 33/344 (9%) Query: 3 PVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNR-----L 57 P ++ G+ + + + ++ VG +ILF RN D AQL +L ++ + + L Sbjct: 33 PFIVGFHGHVTNEDIKTLITVHRVGTIILFQRNIADAAQLFQLAASLQEIAEDSGHAQSL 92 Query: 58 VVAVDQEGGRVQRFREGF-TRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 +A DQE G V R T LP + + A + +A A + A I+++ Sbjct: 93 FIATDQENGLVTRVNPPIATELPGSMALGAANDPSNAFSVA----VCTAKILKAYGINMN 148 Query: 117 FAPVLDVGH--ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 +APV DV + IG RS+ DP S ++G+ + G+ + KHFPGHG D Sbjct: 149 YAPVADVNSEPKNFVIGVRSFSDDPGTVGRFVSSQVEGLQQNGILSCVKHFPGHGDTAVD 208 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR--------PASG 226 SH P + + E+ A ++ F ++ +DAIM AH+ ++ PAS Sbjct: 209 SHHGLPVITKSKEELEACELVPFWRAVKAG-VDAIMTAHISLPQLNSNSNVATNQLPASL 267 Query: 227 SPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAV 286 SP ++ +LRQE+ +DG+I SD L M+G + +Y A + + C+ K V Sbjct: 268 SPDAIR-ILRQEMNYDGLIVSDCLEMDG--VRATYGTEKGAVMALKELTVSWCHTMKAQV 324 Query: 287 SVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQL 330 + E V G S QE+++S+ + I + L Sbjct: 325 GAI--------EAVIAAVKSGEIS-QEMIESSVNRVIRLKTKYL 359 >UniRef50_Q6MGM9 Beta-hexosamidase A n=1 Tax=Bdellovibrio bacteriovorus RepID=Q6MGM9_BDEBA Length = 558 Score = 114 bits (286), Expect = 4e-24, Method: Compositional matrix adjust. Identities = 81/261 (31%), Positives = 135/261 (51%), Gaps = 13/261 (4%) Query: 27 GGLILFTRNYHDPAQLRELVRQIRAAS--RNRL--VVAVDQEGGRVQRFREGFTRLPAAQ 82 G +LF RN Q+REL + +S ++L ++A+DQEGG V R PA Sbjct: 79 GSFLLFKRNMISAPQIRELNLALYKSSYKHSKLPPLIAIDQEGGSVSRL----PITPAPP 134 Query: 83 SFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV--GHISAAIGERSYHADPQ 140 + AL G + LA+E G+ + + +++ APVLDV + ++ IG RS+ +DP+ Sbjct: 135 NALAL-GQTQSPLLAEEMGYQTGLFLREVGFNMNLAPVLDVVDPYSTSFIGVRSFGSDPE 193 Query: 141 KALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSL 200 I + G+ +A + T KHFPG G++ D H + E++ +D+ F+ Sbjct: 194 LVRDIGVAYSKGLLKARVIPTAKHFPGTGSLNQDPHLTIVKNSASLEEMKKRDLIPFTGY 253 Query: 201 IRENKLDAIMPAHVIYS--DVDPRPASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIM 258 + A+M +H+IY D D PAS S KT+LR EL + G++ +DDL M+G+ + Sbjct: 254 SKIGANIAVMMSHLIYPGLDEDLEPASFSTKISKTLLRDELKYQGLVMTDDLQMQGSKQV 313 Query: 259 GSYAERGQASLDAGCDMILVC 279 +L +G D++++ Sbjct: 314 LRPEAAALRALQSGSDIVMLT 334 >UniRef50_B1YER8 Beta-N-acetylhexosaminidase n=1 Tax=Exiguobacterium sibiricum 255-15 RepID=B1YER8_EXIS2 Length = 412 Score = 114 bits (285), Expect = 5e-24, Method: Compositional matrix adjust. Identities = 76/219 (34%), Positives = 109/219 (49%), Gaps = 8/219 (3%) Query: 62 DQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVL 121 D+EGGRV R + +L + SF G + L +E G + S ++ FAPVL Sbjct: 130 DEEGGRVSRVPDAQLKLSPSLSF----GQKNDPALMKEVGQTLGSISRFYGFNMDFAPVL 185 Query: 122 DVGHISA--AIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKET 179 DV A IG+R+ ADPQ+ + + GM + + KHFPGHG T DSH Sbjct: 186 DVNSNPANPIIGDRALSADPQQVARLGVPLMQGMKKEDIVPVVKHFPGHGDTTVDSHVGL 245 Query: 180 PCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPASGSPYWLKTVLRQE 238 P +AE+ ++ F I E + +M AH+++ +D RP+S S + VLR E Sbjct: 246 PRVDATKAELEQTELVPFKRAIEEGT-EMVMVAHILFPALDKNRPSSLSKPVMTGVLRDE 304 Query: 239 LGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMIL 277 L FDGVI +DDL M A+ +L+ G DM + Sbjct: 305 LKFDGVIITDDLVMGAITEQYGLAQAASLALENGADMAM 343 >UniRef50_D2PU34 Glycoside hydrolase family 3 domain protein n=1 Tax=Kribbella flavida DSM 17836 RepID=D2PU34_9ACTO Length = 606 Score = 114 bits (285), Expect = 6e-24, Method: Compositional matrix adjust. Identities = 89/278 (32%), Positives = 132/278 (47%), Gaps = 21/278 (7%) Query: 15 AEEREILAHPLVGGLILFTR--NYHDPAQLRELVRQIRAAS-----RNRLVVAVDQEGGR 67 A E++A GG I F N +P QL Q++ + R + +A DQE G Sbjct: 89 ATPAEVVAKYKPGGWIYFNARDNVQNPTQLATYSNQLQTVATSTGIRVPMTIATDQEQGV 148 Query: 68 VQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVG--H 125 V R T+ + A + A+ A + E+ AM I +APV DV Sbjct: 149 VVRIGPPATQFGGNMAHGAARSTAD----ARTAAGITGRELKAMGIRQDYAPVADVNVNA 204 Query: 126 ISAAIGERSYHADPQKALAIASRFIDGMH-EAGMKTTGKHFPGHGAVTADSHKETPCDPR 184 ++ IG RS+ +DPQ + + G +AG+ T KHFPGHG DSH P Sbjct: 205 LNPVIGVRSFSSDPQLVSDLTVAQVQGYQKDAGIIATAKHFPGHGDTVDDSHTSLPTINH 264 Query: 185 PQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR--PASGSPYWLKTVLRQELGFD 242 + + D F + I+ +D+IM AH++ +DP PA+ S + VLR+ELGF Sbjct: 265 TREQWNTIDAPPFKAAIKAG-IDSIMTAHIVIPSLDPSGDPATLSKPIMTGVLRKELGFK 323 Query: 243 GVIFSDDLSMEGAAIMGSY--AERGQASLDAGCDMILV 278 G+I +D L M AA+ Y AE +++AG D +L+ Sbjct: 324 GLIITDALEM--AAVRAKYGDAEVAVRAIEAGVDQLLL 359 >UniRef50_Q84IZ5 Beta-N-acetylglucosaminidase n=1 Tax=Clostridium paraputrificum RepID=Q84IZ5_9CLOT Length = 413 Score = 114 bits (284), Expect = 6e-24, Method: Compositional matrix adjust. Identities = 80/282 (28%), Positives = 129/282 (45%), Gaps = 15/282 (5%) Query: 8 VEGYELDAEEREILAHPLVGGLILFT-RNYHDPAQLREL--VRQIRAASRNR----LVVA 60 V G +D + + GG+IL+ RN+ + + V I+ A+R L + Sbjct: 58 VNGTNVDDKVVNLFEDQHAGGIILYGHRNFWGSSLDNNVKYVNSIKKANRQNSDIPLFIG 117 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 D+EGG + + + R P+ G LA G A ++ + I+ F V Sbjct: 118 FDEEGGSMSQLPQELMRTPSKGEL----GNTNDSSLATGIGAGTAKKLKLLGINTDFGTV 173 Query: 121 LDVG--HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE 178 LD+ + IG RSY + +K + + G+ T KHFPGHG DSH Sbjct: 174 LDINTNKNNPIIGVRSYGSTKEKVTEFGINELKAIQNEGVIPTVKHFPGHGDTEVDSHLG 233 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQ 237 P +++ ++ F + I N +D +M AH++ +D PA+ S L +LR Sbjct: 234 LPSLNHDLNRLKSTELVPFQTAIN-NGVDMVMTAHIMLPQIDKEYPATMSKKILTDLLRD 292 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVC 279 E+G+ GVI +DDL M+ + E S++AG D++LVC Sbjct: 293 EMGYKGVIITDDLEMQAISKNWDLGEAAIKSVEAGADILLVC 334 >UniRef50_B1WW93 Beta-glucosidase n=12 Tax=Cyanobacteria RepID=B1WW93_CYAA5 Length = 539 Score = 113 bits (283), Expect = 8e-24, Method: Compositional matrix adjust. Identities = 84/289 (29%), Positives = 143/289 (49%), Gaps = 10/289 (3%) Query: 53 SRNRLVVAVDQEGGRVQRFREGFTRLPAAQSFA--ALSGMEEGGKLAQEAGWLMASEMIA 110 S+ L +A D E G QRF G T P + A S + K A + G + A E + Sbjct: 79 SKTPLFIAADIEEGVGQRF-SGATWFPPPMALGEIAKSDLPLARKYAYQMGEITAKESVK 137 Query: 111 MDIDISFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGH 168 + I+ +P++DV + + I RS+ +P+ +A FI+G TT KHFPGH Sbjct: 138 IGINWILSPIVDVNNNPDNPVINIRSFGDNPKVVSELAKAFIEGAKNHPALTTAKHFPGH 197 Query: 169 GAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPASGS 227 G + DSH P + ++ F + I + K+D++M AH++ + D PA+ S Sbjct: 198 GDTSNDSHLNLPVISHDNERLEGLELVPFQAAI-DAKVDSVMTAHLLINAWDKDNPATLS 256 Query: 228 PYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVS 287 L L+QELGF+G+I +D L M G A E +++AG D++L+ ++ + A++ Sbjct: 257 KTILTNKLKQELGFNGLIVTDALIMGGVAKFADSEEIAIKAVEAGADILLMPDDPEMAIN 316 Query: 288 VLDN---LSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHER 333 + N + ER+ K ++Q+L + + + +L+Q +R Sbjct: 317 AVYNAVETGRLTTERIDESLQKIWQAKQKLFNHQKDNSSVYQLSQKEDR 365 >UniRef50_Q031K2 Beta-glucosidase-related glycosidase n=5 Tax=Lactococcus lactis RepID=Q031K2_LACLS Length = 403 Score = 113 bits (283), Expect = 9e-24, Method: Compositional matrix adjust. Identities = 87/274 (31%), Positives = 138/274 (50%), Gaps = 39/274 (14%) Query: 26 VGGLILFTRNYHDPAQLRELVRQIR--AASRNRLVVAVDQEGGRVQRFR-EGFTRLPAAQ 82 VG +IL R+ +++ + +++ A + RL+++ DQEGG VQ + GF+ +P Sbjct: 101 VGSIILTGRSNLSIPEMKVITDKLQGLAPANRRLLISCDQEGGNVQVLQGTGFSEMP--- 157 Query: 83 SFAALSGMEEGG----KLAQEA-GWLMASEMIAMDIDISFAPVLD------VGHISAAIG 131 +G+ +G KL +EA W SE+ I+ APV D +A IG Sbjct: 158 -----NGLIQGSWAPEKLQEEAKNW--GSELSTAGINFDLAPVADQVLSADFAPQNAPIG 210 Query: 132 --ERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSH-----KETPCDPR 184 R Y D ++ A F GM AG+ TT KHFPG GAVT ++ +++ +P Sbjct: 211 YWSRQYAYDKANIVSHADAFSKGMKSAGVLTTAKHFPGLGAVTGNTDISAGVRDSQTNP- 269 Query: 185 PQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQELGFDG 243 +++ + +F LI + +IM A IY +DP P + SP + +LR +LGFDG Sbjct: 270 -----KSESVQIFKELISSG-VPSIMTATAIYDQIDPELPGAFSPKMVDGLLRNQLGFDG 323 Query: 244 VIFSDDLSMEGAAIMGSYAERGQASLDAGCDMIL 277 ++ +DDLS S +R +L AG D++L Sbjct: 324 LVITDDLSNAVQVQAWSPGQRAVLALAAGNDLVL 357 >UniRef50_B9R4T0 Glycosyl hydrolase family 3 N terminal domain protein n=1 Tax=Labrenzia alexandrii DFL-11 RepID=B9R4T0_9RHOB Length = 412 Score = 113 bits (282), Expect = 1e-23, Method: Compositional matrix adjust. Identities = 87/267 (32%), Positives = 134/267 (50%), Gaps = 15/267 (5%) Query: 21 LAHPLVGGLILFTRNYHDPAQLRELVRQIR-AASRNRLVVAVDQEGGRVQRFREGFTRLP 79 LA +GG++L RN P QL L + S + +++AVDQEGG+VQR + Sbjct: 63 LASGRIGGVMLLGRNIQTPDQLTTLTHHLADQNSGSSVLIAVDQEGGQVQRLAPLGDPMT 122 Query: 80 --AAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV----GHISAAIGER 133 AA+ A +G K+ A+++ A +I+++F PV+DV G+ +R Sbjct: 123 WSAAKDLPAATGNCAPRKVRAYYAR-RAAQLGAYNINVNFGPVVDVDINPGNPIIGKKKR 181 Query: 134 SYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKD 193 S+ ADP A+ FI AG+ T KHFPGHG+ T DSH P + R ++ Sbjct: 182 SFSADPTTVTNCAAGFIAAHQAAGIATALKHFPGHGSSTTDSHTSLPVISQ---SWRERE 238 Query: 194 MSVFSSLIRENKLDAIMPAHVI---YSDVDPRPASGSPYWLKTVLRQELGFDGVIFSDDL 250 + + L R++++D +M H++ +SD PAS S + R G D VI +DDL Sbjct: 239 LEPYRVLKRQDRVDMVMMGHLVHPRFSDTPETPASLSKKGIAEA-RALAGKDVVIITDDL 297 Query: 251 SMEGAAIMGSYAERGQASLDAGCDMIL 277 M+ A S E ++ AG D++L Sbjct: 298 EMDAVAGRYSIEEAAVQAIAAGNDVVL 324 >UniRef50_C0EVT4 Putative uncharacterized protein n=1 Tax=Eubacterium hallii DSM 3353 RepID=C0EVT4_9FIRM Length = 409 Score = 113 bits (282), Expect = 1e-23, Method: Compositional matrix adjust. Identities = 80/264 (30%), Positives = 135/264 (51%), Gaps = 15/264 (5%) Query: 27 GGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFREGFTRLPAAQSFAA 86 GG+I F+ N D Q++ ++ I+ ++ L ++VD+EGG V R ++ F A Sbjct: 100 GGVIFFSYNLKDREQVKSMIADIQKSNDIPLFISVDEEGGSVARIAN--SKNMQTTKFPA 157 Query: 87 LSGMEEGG--KLAQEAGWLMASEMIAMDIDISFAPVLDVGHI--SAAIGERSYHADPQKA 142 +S + + G K A E G + E+ + ++ FAPV DV + IG RS+ +D + Sbjct: 158 MSEIGKSGDSKKACEVGETIGKEIRELGFNLDFAPVADVNTNAENTEIGNRSFSSDAKTV 217 Query: 143 LAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIR 202 + S+ + G+ G+ +T KHFPG G D+HK +R + F S I Sbjct: 218 AGMVSQEVKGLQSQGVSSTLKHFPGQGQCGEDTHKGYVDLNATIDRLREVEFLPFESGIA 277 Query: 203 ENKLDAIMPAHVIYSDVDPR--PASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGS 260 D +M +HV S V + PAS + + +LR+EL F+ VI +D ++M+ ++ Sbjct: 278 AGA-DMVMMSHVAVSQVTGKETPASLTKLMVTDILREELQFNNVIITDAMNMK---VITK 333 Query: 261 YAERGQA---SLDAGCDMILVCNN 281 + + QA ++ AG DMIL+ +N Sbjct: 334 FYDADQAAVMAVQAGNDMILMPDN 357 >UniRef50_A0Q7X6 Glycosyl hydrolase family 3 n=12 Tax=Francisella RepID=A0Q7X6_FRATN Length = 378 Score = 113 bits (282), Expect = 1e-23, Method: Compositional matrix adjust. Identities = 87/303 (28%), Positives = 152/303 (50%), Gaps = 34/303 (11%) Query: 16 EEREILAH---PLVGGLILFTR----NYHDPAQLRELVRQIRAASRNR----------LV 58 E+ EI+ H +GG+ILF R N P QL+ L ++ ++ L+ Sbjct: 44 EKSEIVQHIKDNNIGGVILFDRKDKDNIKSPEQLKTLNSDLQKYTKIYNDLYMRPQVPLI 103 Query: 59 VAVDQEGGRVQRFRE--GFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 +A+D EGG+V R + GF A+ F+ L ++E +A+E MA M + I+++ Sbjct: 104 IAIDYEGGKVNRLKTKYGFPETFTAKKFSELD-IQEQAHVARE----MAKTMYKVGINLN 158 Query: 117 FAPVLDVGHISA-AIG--ERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTA 173 AP +D+ + + IG ERS+ D + A+ F + + + KHFPGHG+ T Sbjct: 159 LAPCVDIHNDNCPVIGKYERSFSTDVNTIVDCANLFCKATEKHKIGSVLKHFPGHGSSTN 218 Query: 174 DSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD-PRPASGSPYWLK 232 DSH E K++ + L+ ENK +M AHV+ ++D PA+ S +K Sbjct: 219 DSHLGAADITNTWTE---KELIPYIRLLPENKHCMVMTAHVMNRNIDETNPATLSNKIIK 275 Query: 233 TVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL 292 +LR ++ FDGVI SD + M+ E + +++AG ++ + + + +++DN+ Sbjct: 276 -ILRTDIKFDGVIISDAMEMKAVTEYYGELESLKLAINAGVNIFIFSD--ANSDTIIDNI 332 Query: 293 SPI 295 + + Sbjct: 333 AKL 335 >UniRef50_D2L090 Glycoside hydrolase family 3 domain protein n=1 Tax=Desulfovibrio sp. FW1012B RepID=D2L090_9DELT Length = 569 Score = 113 bits (282), Expect = 1e-23, Method: Compositional matrix adjust. Identities = 93/268 (34%), Positives = 129/268 (48%), Gaps = 18/268 (6%) Query: 5 MLDVEGYELDAEEREILAHPLVGGLILFTR--NYHDPAQLRELVRQIRAASRNR-----L 57 M+ EG ++ E ++ +GG+IL+ N PAQ+ L ++A + + L Sbjct: 49 MVWFEGPDVSPETAALIRDRRLGGVILYANAGNIETPAQVARLTAALQATAADTPQGLGL 108 Query: 58 VVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISF 117 +V VDQEGG V R R GFT P+ + AA + L ++A E++A+ I+ F Sbjct: 109 LVGVDQEGGPVARLRRGFTVFPSQMAQAAAGRAD----LVRQAAAATGRELLAVGINTDF 164 Query: 118 APVLDVGHISA--AIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADS 175 APV DV A IG RS+ A+P + G G T KHFPGHG DS Sbjct: 165 APVADVNVNPANPVIGIRSFGANPADVARFTAAATAGYASVGAICTPKHFPGHGDTAVDS 224 Query: 176 HKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR--PASGSPYWLKT 233 H P A + D F + A+M AHV+ D++P PA+ S L Sbjct: 225 HVGLPRVDHDAATLSRVDFPPFRAAFAAGAP-AVMTAHVLTPDLEPDNLPATLSRRMLSG 283 Query: 234 VLRQELGFDGVIFSDDLSMEGAAIMGSY 261 VLR LGFDGVIF+D L M A+ +Y Sbjct: 284 VLRGRLGFDGVIFTDSLGM--GAVAATY 309 >UniRef50_B8M4K9 Beta-N-acetylglucosaminidase, putative n=9 Tax=Trichocomaceae RepID=B8M4K9_TALSN Length = 2237 Score = 112 bits (281), Expect = 2e-23, Method: Compositional matrix adjust. Identities = 90/338 (26%), Positives = 149/338 (44%), Gaps = 32/338 (9%) Query: 10 GYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIR-----AASRNRLVVAVDQE 64 G+ E + ++ +G +ILF RN D QL+ L ++ A + L + +DQE Sbjct: 1385 GFTPSPEIKTLIRDYGLGAVILFRRNIKDAVQLKTLTLALQQEAKEAGHEHPLFIGIDQE 1444 Query: 65 GGRVQRFREGF-TRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV 123 G V R + P + A E ++ + G L++ I++++AP D+ Sbjct: 1445 NGLVTRIMPPLAVQQPGQMTIGATQSAENAYEVGKCTGELLS----FFGINMNYAPDCDI 1500 Query: 124 GH--ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPC 181 ++ IG RS DP + G+ E+G+ T KHFPGHG DSH P Sbjct: 1501 NSEPLNPVIGVRSAGDDPNFVARFSCATAKGLRESGVVPTAKHFPGHGDTAVDSHHGLPV 1560 Query: 182 DPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPAS-GSPYWLKT----VLR 236 + ++E+ ++ F + E ++ +M AH+ V P S G P L +LR Sbjct: 1561 ISKSRSELERCELIPFRRAVAEG-IETVMTAHIALPKVTTGPDSEGLPATLSPEALGILR 1619 Query: 237 QELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIK 296 ++ +DGVI +D L M+G E SL AG D I++C+ V + Sbjct: 1620 NDMKYDGVIVTDCLEMDGIRATYGTVEGSLMSLIAGSDSIMICHTYAVQVKSI------- 1672 Query: 297 AERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERW 334 ERV + G+ S + +S R+ +L +R+ Sbjct: 1673 -ERVVQAVQLGALSESRIDESLH------RIGKLKQRF 1703 >UniRef50_D2RL81 Glycoside hydrolase family 3 domain protein n=1 Tax=Acidaminococcus fermentans DSM 20731 RepID=D2RL81_ACIFE Length = 369 Score = 112 bits (280), Expect = 2e-23, Method: Compositional matrix adjust. Identities = 95/329 (28%), Positives = 159/329 (48%), Gaps = 25/329 (7%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNR-LVVAVD 62 +++ +EG E A ++ L GG+ILF RN DP Q++ L +++ +R L++ VD Sbjct: 57 MLVGMEGTEPSAAFQKELQENHYGGVILFDRNMKDPNQVKALNSRLQDLARPEPLLIGVD 116 Query: 63 QEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWL--MASEMIAMDIDISFAPV 120 +EGG+V R ++ +LPA S A+ G AQ W + + A+ +++ APV Sbjct: 117 EEGGQVARMKD---QLPAPPSQEAIG---RTGDPAQAEAWAAKTGTRLKALGFNLNLAPV 170 Query: 121 LDVGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETP 180 DVG ERSY + P A + G + + KHFPG G D+HK Sbjct: 171 ADVGSGM----ERSYSSHPATAAKFVEGALAGYQKVHLLACLKHFPGLGKGKEDTHKGQV 226 Query: 181 CDPRPQAEIRAKDMSVFSSLIRE--NKLDAIMPAHVIYSDVDP-RPASGSPYWLKTVLRQ 237 + + +D F + + +M HV Y +DP PAS SP + +LR Sbjct: 227 IVSADRETLLQEDGLPFRLALEKYPEPWFGVMVTHVRYPALDPDYPASLSPKIMGDLLRG 286 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKA 297 + F G++ +DDL+M A E ++ AG D+++VC+ + V L+ KA Sbjct: 287 QFHFSGLVLTDDLTMGAIASQYPLEEAAVQAVRAGADLVMVCHGPETIRRVHGALA--KA 344 Query: 298 ERVTRLYHKGSFSRQELMDSARWKAISTR 326 R +G+ R+++ D+A ++ + T+ Sbjct: 345 IR------EGTLPREQV-DAALYRILRTK 366 >UniRef50_C4XU31 Putative beta-N-acetylhexosaminidase n=1 Tax=Desulfovibrio magneticus RS-1 RepID=C4XU31_DESMR Length = 568 Score = 111 bits (278), Expect = 3e-23, Method: Compositional matrix adjust. Identities = 107/321 (33%), Positives = 149/321 (46%), Gaps = 27/321 (8%) Query: 26 VGGLILF--TRNYHDPAQLREL---VRQIRAASRNRL--VVAVDQEGGRVQRFREGFTRL 78 +GG+IL+ N P QL L ++ + AA+R+ L ++ VDQEG V R R+GFT Sbjct: 69 LGGVILYGVPGNIESPGQLAALNAGLQAVSAATRHGLGLLIGVDQEGAPVARLRKGFTLF 128 Query: 79 PAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHISA--AIGERSYH 136 P+ + AA +G + +LA A A E+ A+ I+++FAPV DV A IG RS+ Sbjct: 129 PSQMAQAA-TGRADLARLAASA---TARELRAVGINVNFAPVADVNVNPANPVIGIRSFG 184 Query: 137 ADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSV 196 + P G AGM T KHFPGHG T DSH P A + D Sbjct: 185 SAPADVARFTGAATAGYGAAGMICTPKHFPGHGDTTIDSHVGLPRSDHDAAALDKLDFPP 244 Query: 197 FSSLIRENKLDAIMPAHVIYSDVDPR---PASGSPYWLKTVLRQELGFDGVIFSDDLSME 253 A+M AH++ +D PA+ S L+ LR +GFDGVIF+D L M Sbjct: 245 -FRAAFAAGAPAVMTAHMVVPALDGEPDLPATLSRRVLEGTLRGRMGFDGVIFTDSLGMG 303 Query: 254 GAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFSRQE 313 A E +L AG D++LV + SP ER+ + R Sbjct: 304 AVADTYGIPEASVRALAAGADVLLVGAD--------AGRSP--GERLAAMDAVAEAVRSG 353 Query: 314 LMDSARWKAISTRLNQLHERW 334 + AR A R+ +L ER+ Sbjct: 354 RVPMARLNAAVGRVLRLKERY 374 >UniRef50_Q7NG68 Glr3305 protein n=1 Tax=Gloeobacter violaceus RepID=Q7NG68_GLOVI Length = 568 Score = 111 bits (278), Expect = 3e-23, Method: Compositional matrix adjust. Identities = 102/328 (31%), Positives = 156/328 (47%), Gaps = 21/328 (6%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 +G VM+ + E E R ++ VGGLI++ L E + ++A S L+V+ Sbjct: 46 IGQVMMPM--LESPEEGRVLVERFGVGGLIIYRTGAR---ALAEQLNALQALSAVPLLVS 100 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 D E G V + +G T LP A G LA+ AG + A E A+ + + FAPV Sbjct: 101 ADFERG-VGAYIDGATDLP----IAMALGAAGDAALARTAGEITAREARALGVHVVFAPV 155 Query: 121 LDVGHISAAIGE--RSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE 178 LDV + RS+ P + + +I G+ G T KHFPGHG V+AD+H+E Sbjct: 156 LDVNNNPRNPIINVRSFGQSPDLVSRLGAAYIAGLEAQGAMATAKHFPGHGNVSADTHRE 215 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR--PASGSPYWLKTVLR 236 P + ++ F S++ +M AH+ +DP PA+ S ++ LR Sbjct: 216 LAALPGNLVALEQVELKPFRSVLAGRAPHGVMAAHLWVQAIDPEPVPATCSSRVIEGFLR 275 Query: 237 QELGFDGVIFSDDLSMEGAAI---MGSYAERGQASLDAGCDMILVCN--NRKGAVSVLDN 291 ++LGF G++++D L M GA I G YA +L+AGCD++++ N KG S L Sbjct: 276 RQLGFGGLVYTDSLGM-GAVIEYAKGDYARAQLLALEAGCDVLVIPTGLNAKGEHSPLVG 334 Query: 292 LSPIKAERVTRLYHKGSFSRQELMDSAR 319 + I + + G S L S R Sbjct: 335 VE-IGTKAIKTALRAGRLSEARLDRSVR 361 >UniRef50_A5FAE4 Candidate beta-glycosidase; Glycoside hydrolase family 3 n=3 Tax=Flavobacteriaceae RepID=A5FAE4_FLAJ1 Length = 997 Score = 111 bits (278), Expect = 3e-23, Method: Compositional matrix adjust. Identities = 86/283 (30%), Positives = 137/283 (48%), Gaps = 17/283 (6%) Query: 26 VGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFREGFTRLPAAQSFA 85 VG +I F P + +L ++ ++ L V +D E G R + R P + Sbjct: 107 VGSVIFFQGG---PVRQAKLTNLYQSKAKVPLFVGIDAEWGLAMRLDSTY-RYPWNMTLG 162 Query: 86 ALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGH--ISAAIGERSYHADPQKAL 143 A+ + KL ++ G MA+E + + +FAPVLD+ + IG RS+ D Sbjct: 163 AIQDL----KLIEKVGRNMANENKRIGVHFNFAPVLDINTNPKNPIIGNRSFGEDKVNVS 218 Query: 144 AIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRE 203 A ++G+ G+ TGKHFPGHG DSH P + + ++ + + E Sbjct: 219 EKAIALMNGIQGNGVFCTGKHFPGHGDTATDSHHALPTVNFSKERLDLVELYPYKRMFDE 278 Query: 204 NKLDAIMPAHVIYSDVDP---RPASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGS 260 L ++M AH+ ++P P+S S + +L++ELGFDG+IF+D L+M+GAA Sbjct: 279 G-LVSVMVAHLNIPSLEPIPNLPSSASYNVVTNLLQKELGFDGLIFTDGLAMKGAANFKG 337 Query: 261 YAERGQASLDAGCDMILVCNNRKGAVSVLD---NLSPIKAERV 300 + A L AG D++L N AV L+ N + I ER+ Sbjct: 338 PGDLEIAVLLAGNDILLCPENVPVAVQKLEEAYNNNIITEERL 380 >UniRef50_Q47M03 Beta-N-acetylhexosaminidase. Glycosyl Hydrolase family 3 n=1 Tax=Thermobifida fusca YX RepID=Q47M03_THEFY Length = 552 Score = 111 bits (278), Expect = 3e-23, Method: Compositional matrix adjust. Identities = 93/289 (32%), Positives = 143/289 (49%), Gaps = 21/289 (7%) Query: 23 HPLVGGLILFTRNYHDPAQLRELVR--QIRAASRNR---LVVAVDQEGGRVQRFREGFTR 77 HP GG I F N P Q+ + Q RA L + VD+E G V R G R Sbjct: 82 HP--GGFIYFPENLETPEQVAAMSNGLQERATGTGAGIPLFLGVDEEQGLVSRLPFG-AR 138 Query: 78 LPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHISA---AIGERS 134 P A + A E LA+E A ++ A+ I++ +APV DV +++A IG RS Sbjct: 139 FPDAMALGAARDPE----LARELAAATAEQLAAVGINLDYAPVADV-NVNADNPVIGIRS 193 Query: 135 YHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDM 194 + +DP+ +A+ + + G+ KHFPGHG DSH P + + E D+ Sbjct: 194 FSSDPELVGELAAAEVAAFQDGGVVAVAKHFPGHGDTDVDSHTGLPVIDKSRDEWERVDL 253 Query: 195 SVFSSLIRENKLDAIMPAHVIYSDVD--PRPASGSPYWLKTVLRQELGFDGVIFSDDLSM 252 F L+ + +D +M AHV+ + PA+ SP+ + VLR+ELG+DGV+ +D L+M Sbjct: 254 PPFQRLV-DAGVDMVMTAHVLMPQLGDAEEPATLSPHLITGVLREELGYDGVVTTDALNM 312 Query: 253 EGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVT 301 EG E ++ AG D +L+ + A S + LS ++ R++ Sbjct: 313 EGVRQTHDDGEVAVRAVLAGADQLLMPPDVDLAYSAV--LSAVEEGRIS 359 >UniRef50_Q2SV66 Beta-N-Acetylglucosaminidase n=31 Tax=Burkholderiaceae RepID=Q2SV66_BURTA Length = 699 Score = 111 bits (278), Expect = 3e-23, Method: Compositional matrix adjust. Identities = 99/338 (29%), Positives = 151/338 (44%), Gaps = 23/338 (6%) Query: 12 ELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAA----SRNRLVVAVDQEGGR 67 E A R+ L VGG+ILF+ N Q R L+ IRAA S L++ +D+EGG Sbjct: 89 EFPAAARDALRDSGVGGVILFSNNLTGIEQTRRLIDGIRAAPAADSPVGLLIGIDEEGGN 148 Query: 68 VQRF-REGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGH- 125 V R R T + A +LA + G ++A+E+ A+ +++FAP +DV Sbjct: 149 VFRLPRVEATAFSGNMALGAAYEATRDDRLAYDMGRVLAAEIAAVGFNVNFAPDVDVNSN 208 Query: 126 -ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPR 184 ++ I R++ DP + R + GM + T KHFPGHG DSH P + Sbjct: 209 PLNPVINVRAFGDDPATIGLLGRRMVQGMASERVIGTFKHFPGHGDTDTDSHYGLPVVIK 268 Query: 185 PQAEIRAKDMSVFSSLIRENKL-DAIMPAHVIYSDVDPR------------PASGSPYWL 231 +A+ A D++ + I + D IM AH+ Y +D PA+ S Sbjct: 269 SRADAYAIDLAPYRQAIESGEAPDMIMTAHIQYPSLDDTRVTTRTGEQMIAPATMSRRIQ 328 Query: 232 KTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDN 291 +LR E G+ GV +D L M+G A A+ A D+ L+ + A Sbjct: 329 HDILRGEFGYRGVTITDALDMKGIADFFDEADAVIKVFQADVDIALMPVEFRTAADA-GR 387 Query: 292 LSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQ 329 L+ + +RV G R E DS+ + + T+L Sbjct: 388 LAAL-VDRVAAAVDSGRIDRAEF-DSSVRRIVLTKLRN 423 >UniRef50_A5CLY4 Putative beta-N-acetylglucosaminidase n=1 Tax=Clavibacter michiganensis subsp. michiganensis NCPPB 382 RepID=A5CLY4_CLAM3 Length = 607 Score = 111 bits (278), Expect = 3e-23, Method: Compositional matrix adjust. Identities = 92/276 (33%), Positives = 133/276 (48%), Gaps = 19/276 (6%) Query: 15 AEEREILAHPLVGGLILFT---RNYHDPAQLR----ELVRQIRAASRNRLVVAVDQE-GG 66 A E++A GG+I F N DP Q+ +L R + A R L+V+ DQE GG Sbjct: 94 ATPAEVVARHRPGGIIYFAGEKDNLVDPRQIARLSGDLQRASQGAGRIPLLVSTDQEQGG 153 Query: 67 RVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI 126 V R T LP A+ + + E++AM I+ +FAPV+DV Sbjct: 154 YV--VRAPCTLLPGQMGLAS---SPSSSADVRAVATITRDELLAMGINQAFAPVVDVASN 208 Query: 127 SA--AIGERSYHADPQKALAIASRFIDGMHE-AGMKTTGKHFPGHGAVTADSHKETPCDP 183 A I RS+ DP + +A I + G+ KHFPGHGA DSH P + Sbjct: 209 PANPIIHVRSFGDDPDRVALLAQSQIGTFQKRQGIVAAAKHFPGHGATDTDSHTGLPVNN 268 Query: 184 RPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR--PASGSPYWLKTVLRQELGF 241 R ++E+ D+ F++ I+ + IM AH++ +D PA+ S L VLRQ+LG+ Sbjct: 269 RSRSELEDNDLPPFAAAIKSG-VGVIMSAHIVVPALDSSGDPATLSYPILTGVLRQKLGY 327 Query: 242 DGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMIL 277 DGV+ +D L M G S A ++ AG D +L Sbjct: 328 DGVVITDALDMAGVRQRYSDARVPVLAIKAGVDQLL 363 >UniRef50_B8DL88 Beta-N-acetylhexosaminidase n=1 Tax=Desulfovibrio vulgaris str. 'Miyazaki F' RepID=B8DL88_DESVM Length = 586 Score = 111 bits (277), Expect = 4e-23, Method: Compositional matrix adjust. Identities = 99/298 (33%), Positives = 149/298 (50%), Gaps = 64/298 (21%) Query: 26 VGGLILFTRNYHDPAQLRELV--RQIRAASRNR------------LVVAVDQEGGRVQRF 71 VGG+ILF R+ + R ++ +Q+RA S L VAVDQEGGRVQR Sbjct: 227 VGGVILFDRDVTTRSPERNIISPKQVRALSAALQAAAGASPAGLPLFVAVDQEGGRVQRL 286 Query: 72 R--EGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV--GHIS 127 + GFT+ P A+ + E +L A M E+ + ++++FAPV+DV S Sbjct: 287 KPERGFTQYPPARQLGQGTTGET--RLRAVA---MGRELADVGVNLNFAPVVDVDVNPDS 341 Query: 128 AAIGE--RSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRP 185 AIG R+Y +DP+ A A+ FI+GM +AG+ + KHFPGHG+ ADSH Sbjct: 342 PAIGRLGRAYGSDPRAVAAHAAAFINGMAQAGVVSCLKHFPGHGSARADSHLGV---TDV 398 Query: 186 QAEIRAKDMSVFSSLIR--------------------ENKLDAIMPAHVIYSDVDPRPAS 225 A R +++ + +L+R +N+LDA PA + + +D Sbjct: 399 SATRRPEELWPYRTLLRPAGNAWADGWGGMVMVGHLYDNRLDAAHPATLSRATID----- 453 Query: 226 GSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYA--ERGQASLDAGCDMILVCNN 281 +LR+++G+ GV+ +DDL M AI Y E ++DAG D++L NN Sbjct: 454 -------GLLRRDMGWRGVVITDDLQM--GAITDRYPLEEVVFRAVDAGADILLFGNN 502 >UniRef50_A7ZD64 Beta-hexosaminidase A (N-acetyl-beta-glucosaminidase) (Beta-N-acetylhexosaminidase) (Chitobiase) n=2 Tax=Campylobacter RepID=A7ZD64_CAMC1 Length = 351 Score = 110 bits (276), Expect = 5e-23, Method: Compositional matrix adjust. Identities = 82/283 (28%), Positives = 142/283 (50%), Gaps = 20/283 (7%) Query: 14 DAEEREILA---HPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQR 70 DA R +L+ + GG++L RN AQL+ ++ I+ S ++ +A+D+EGG V R Sbjct: 43 DAAFRAMLSDAGYERFGGVMLLGRNVTSKAQLKASIKAIKEKSP-KIFIAIDEEGGNVSR 101 Query: 71 FREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHISAAI 130 ++ P ++ S ++ K A + MA + I+++FAPV+D+ ++ I Sbjct: 102 MKDKSFDGPYPSAYEVASTLDI--KSAYDLYSKMAINLKECGINLNFAPVVDLHDENSPI 159 Query: 131 ---GERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQA 187 +R++ K + A F+D E G+ TT KHFPGHG+ DSHK + + Sbjct: 160 IAAKQRAFSEYASKVVIYADAFMDAFKEQGILTTLKHFPGHGSSKEDSHKN-----KSEV 214 Query: 188 EIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV-DPRPASGSPYWLKTVLRQELGFDGVIF 246 + + + I + IM H+ + D PA+ S + +LR EL F+GV+ Sbjct: 215 TLSKDALLPYKDAISTGRAQIIMVGHLFVKGIDDDNPATLSKKIITDLLRNELKFNGVVI 274 Query: 247 SDDLSMEGAAIMGSYAERGQASLDAGCDMILV----CNNRKGA 285 SDD+ M+G + A++ ++AG D++L NN++ A Sbjct: 275 SDDMLMKGVG-DEALAQKVVKFINAGGDILLFSEFKINNQRTA 316 >UniRef50_B8HWS2 Beta-N-acetylhexosaminidase n=5 Tax=Cyanobacteria RepID=B8HWS2_CYAP4 Length = 552 Score = 110 bits (276), Expect = 6e-23, Method: Compositional matrix adjust. Identities = 82/271 (30%), Positives = 136/271 (50%), Gaps = 9/271 (3%) Query: 46 VRQIRAASRNRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEG--GKLAQEAGWL 103 +Q+++ + L++A D E G QRF G T P + AA++ A + G Sbjct: 72 TQQLQSWATIPLLLAADIEEGVGQRF-SGATWFPPPMALAAIARQNPALAADYAHQMGAH 130 Query: 104 MASEMIAMDIDISFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTT 161 A+E +A+ ++ APV+DV + + I R++ + + + + FI+G + T Sbjct: 131 TAAEALAIGLNWVLAPVVDVNNNPDNPVINVRAFGENTGEVSQLTTAFIEGAQTHPVLTA 190 Query: 162 GKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP 221 KHFPGHG DSH E P P P ++A ++ F + I +D++M AH++ +DP Sbjct: 191 AKHFPGHGDTATDSHLELPLIPHPLERLQAVELPPFQAAIAAG-VDSVMSAHLLIPALDP 249 Query: 222 -RPASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCN 280 PA+ S L LRQ LGF G+I +D L M A E +L+AG D++L+ Sbjct: 250 DYPATLSQRILTGELRQNLGFAGLIVTDALVMGAIANRYGLNEAPLMALEAGADILLMPL 309 Query: 281 NRKGAVSVLDNLSPIKAERVTRLYHKGSFSR 311 + GA+ + + +++ R+T K S R Sbjct: 310 DPAGAIQAI--CAAVESGRITVDRIKTSVER 338 >UniRef50_A0JYZ9 Beta-N-acetylhexosaminidase n=1 Tax=Arthrobacter sp. FB24 RepID=A0JYZ9_ARTS2 Length = 553 Score = 110 bits (276), Expect = 6e-23, Method: Compositional matrix adjust. Identities = 83/242 (34%), Positives = 117/242 (48%), Gaps = 22/242 (9%) Query: 26 VGGLILFTRNYH-------DPAQLRELVRQIRAASRN-----RLVVAVDQEGGRVQRFRE 73 +GG I+ N D A + + ++R A++ +V VDQEGG V R R Sbjct: 109 LGGAIIMGDNVPLSADGTVDTAAMAAGISRLRNAAKADGRTWPALVGVDQEGGVVARLRA 168 Query: 74 GFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV--GHISAAIG 131 T PA SF A + LA +AG +A+E+ + FAP DV G IG Sbjct: 169 PLTEWPAPMSFGAAGNV----GLATDAGKALAAELAGLGFTADFAPDTDVTAGPQDPTIG 224 Query: 132 ERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRA 191 R+ DP A ++ F GM AG+ + KHFPGHG+V DSH+ P A++RA Sbjct: 225 ARAMSGDPDAAASLGVGFAQGMLAAGILPSAKHFPGHGSVAVDSHENLPVQKATVAQLRA 284 Query: 192 KDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQELGFDGVIFSDDL 250 KD F + I + L IM H+ ++P PAS S T+ + +GF GV +D L Sbjct: 285 KDWKPFQAAI-DAGLPMIMTGHISVPALEPGVPASLSKQSYATL--RGMGFKGVAVTDAL 341 Query: 251 SM 252 +M Sbjct: 342 NM 343 >UniRef50_C6W3E7 Beta-lactamase n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6W3E7_DYAFD Length = 992 Score = 110 bits (275), Expect = 7e-23, Method: Compositional matrix adjust. Identities = 80/259 (30%), Positives = 129/259 (49%), Gaps = 19/259 (7%) Query: 26 VGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFREGFTRLPAAQSFA 85 VGG++ F P L ++ S+ L+VA+D E G R + R P + Sbjct: 91 VGGIVFFQGG---PMPQARLTNYYQSISKVPLLVAMDAEFGLAMRI-DSTVRYPYQMTLG 146 Query: 86 ALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI--SAAIGERSYHADPQKAL 143 A+ G + L E G +A + + + I+FAPV DV + + I RS+ + K Sbjct: 147 AIQGNND---LIYEMGAQLAKQARRLGMHINFAPVADVNNNPDNPVISFRSFGENKYKVA 203 Query: 144 AIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRE 203 A ++ GM + G+ T+ KHFPGHG DSH + P P I + ++ F +LI + Sbjct: 204 EKAVAYMRGMQDNGLLTSAKHFPGHGDTGTDSHYDLPLIPHSAQRIDSLELYPFRALI-D 262 Query: 204 NKLDAIMPAHVIYSDVDPR---PASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGS 260 N L +M AH+ +D P++ S + +LR +LGF+G+I+SD ++M+G Sbjct: 263 NGLSGMMIAHLSIPALDKTPNLPSTLSKPIISNLLRHQLGFEGLIYSDAMNMKGVT---K 319 Query: 261 YAERGQAS---LDAGCDMI 276 Y G+A L+AG D++ Sbjct: 320 YFPNGKADAMGLEAGMDLL 338 >UniRef50_A0M1W2 Family 3 glycosyl hydrolase/beta-lactamase fusion protein n=16 Tax=Bacteroidetes RepID=A0M1W2_GRAFK Length = 978 Score = 109 bits (273), Expect = 1e-22, Method: Compositional matrix adjust. Identities = 81/262 (30%), Positives = 128/262 (48%), Gaps = 20/262 (7%) Query: 26 VGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFREGFTRLPAAQSFA 85 +GGLI F++ P +L + + S L+V +D E G R F LP + Sbjct: 86 IGGLI-FSKG--GPVMQAQLTNEFQEMSEIPLLVGMDAEWGLAMRLDSTFA-LPWNMTLG 141 Query: 86 ALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI--SAAIGERSYHADPQKAL 143 A+ + KL +EAG ++ + + +FAPV+D+ + IG RS+ D Sbjct: 142 AI----QDNKLVEEAGAAISRHTKRLGVHFNFAPVVDINTNPDNPIIGNRSFGEDKINVT 197 Query: 144 AIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRE 203 A F++GMH G ++ KHFPGHG DSHK P I ++ + LI + Sbjct: 198 EKALAFMNGMHREGGLSSAKHFPGHGDTDTDSHKTLPTINFTSERIDEVELYPYRKLIPK 257 Query: 204 NKLDAIMPAHV---IYSDVDPRPASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGS 260 L ++M AH+ D +PAS S + +L++ +GF+G+IF+D L M+G + Sbjct: 258 G-LSSVMVAHLNVPALESKDGKPASLSQKIITDILKKNMGFNGLIFTDALDMKGVS---R 313 Query: 261 YAERGQASLD---AGCDMILVC 279 E G+ LD AG D++L+ Sbjct: 314 NKEPGEVDLDAFMAGNDVLLMS 335 >UniRef50_C4Y752 Putative uncharacterized protein n=1 Tax=Clavispora lusitaniae ATCC 42720 RepID=C4Y752_CLAL4 Length = 985 Score = 109 bits (273), Expect = 1e-22, Method: Compositional matrix adjust. Identities = 84/297 (28%), Positives = 135/297 (45%), Gaps = 23/297 (7%) Query: 9 EGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIR------AASRNRLVVAVD 62 + YEL + R V +IL RN Q+ L+R ++ A + L++A+D Sbjct: 22 QAYELIVKHR-------VSTMILSRRNAESAEQMAHLIRDLQRIALEHAHYKYPLLIAID 74 Query: 63 QEGGRVQRFREG--FTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 +EGG + + T+ P A + AA E L E A E+ + + PV Sbjct: 75 EEGGMLNTLFDSRDLTQFPGAMALAATGDTE----LVYEVARATARELRHIGFSVILGPV 130 Query: 121 LDVGHISAA--IGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE 178 LDV +A + RS+ + + G+ + G+ T GKHFPG G T DS E Sbjct: 131 LDVVTKMSAQLVSVRSFGSSVEDVARYGRACARGLRDGGLITVGKHFPGIGNATVDSLLE 190 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRP--ASGSPYWLKTVLR 236 P ++R ++ F+ LI+E LD I A + P A SP L +LR Sbjct: 191 VPIMAESLDQLRGSNVVPFAELIQEGLLDGISAAGCGVPAIAPDEVHACLSPVLLTQLLR 250 Query: 237 QELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLS 293 +EL FDG + S+ L M+ + ++ AGCD+++ C++ + V +D+L+ Sbjct: 251 EELHFDGFVISECLEMDALYHSIGLGQGAVLAVCAGCDLVMCCHDFERQVEAIDSLA 307 >UniRef50_A1SM50 Glycoside hydrolase, family 3 domain protein n=7 Tax=Actinobacteria (class) RepID=A1SM50_NOCSJ Length = 631 Score = 109 bits (272), Expect = 2e-22, Method: Compositional matrix adjust. Identities = 101/308 (32%), Positives = 147/308 (47%), Gaps = 29/308 (9%) Query: 27 GGLILFT-RNYHD----PAQLRELVRQIRAASRN-----RLVVAVDQEGGR-VQRF--RE 73 GG+I FT RN D P Q+ L ++AA+R L ++VDQEGG V RF Sbjct: 113 GGVIYFTTRNGDDNIGTPEQVARLSNGLQAAARALPGGIPLQISVDQEGGALVARFGAAS 172 Query: 74 GFTRLPAAQSFAALSGMEEGGKL-AQEAGWLMASEMIAMDIDISFAPVLDVG--HISAAI 130 G T+LP + A + G A+ + ++ +E+ A+ + +APV DV + I Sbjct: 173 GATQLPGNMALGAGALGTGGSAADARRSATVIGAELAAVGVTQDYAPVADVNVNPNNPVI 232 Query: 131 GERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIR 190 G RS +DP + + + G H G+ KHFPGHG DSH P ++++ Sbjct: 233 GIRSIGSDPALVSDLVAAQVRGFHRGGVSAVAKHFPGHGDTGVDSHFGLPEVTHTRSQLE 292 Query: 191 AKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQELGFDGVIFSDD 249 D+ F + I +D IM AHV+ +DP PA+ S L +LR+ELGFDG+I +D Sbjct: 293 EIDLPPFRAAIAAG-VDTIMTAHVVLPAIDPGVPATMSRKILTGLLRRELGFDGLIVTDA 351 Query: 250 LSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSF 309 L M GA +L AG D +L+ A + +KA R G Sbjct: 352 LDMGGATATYPPDVAPVRALLAGADQLLIPPEMDTAYRAV-----LKAVR------SGQI 400 Query: 310 SRQELMDS 317 SR+ L +S Sbjct: 401 SRERLNES 408 >UniRef50_B0P7U2 Putative uncharacterized protein n=1 Tax=Anaerotruncus colihominis DSM 17241 RepID=B0P7U2_9FIRM Length = 555 Score = 109 bits (272), Expect = 2e-22, Method: Compositional matrix adjust. Identities = 92/311 (29%), Positives = 149/311 (47%), Gaps = 43/311 (13%) Query: 5 MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAA-SRNR---LVVA 60 + + G LD + L +G +ILF+ N P Q EL +++ ++N ++A Sbjct: 12 LTSIPGDTLDEQTIAFLNETNIGNIILFSNNTPSPIQTAELTAALQSCITQNTGVPAIIA 71 Query: 61 VDQEGGRVQRF--------------REGFTRL---------PAAQSFAALSGMEEGGKLA 97 VDQEGG + R + G L P A + AA G + Sbjct: 72 VDQEGGEISRIIGSAYEYAPDENGIKYGEIPLGAGIETFLYPCAMAIAASGGAPTAQEPV 131 Query: 98 QE------AGWLMASEMIAMDIDISFAPVLDVGHISA--AIGERSYHADPQKALAIASRF 149 E AG + E+ A+ I+ + APV D+ A IG RSY D ++ A A+ F Sbjct: 132 PEFNAVRTAGLYIGQELRAVGINCNLAPVADINSNPANPVIGTRSYGDDAERVAADAAAF 191 Query: 150 IDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAI 209 G+ E+G+ KH+PGHG + DSH P + E+R+ ++ F++ ++ + A+ Sbjct: 192 ATGLQESGVLAVAKHYPGHGDTSIDSHYGLPSVNKTLEELRSLELIPFNA-VQNADIGAV 250 Query: 210 MPAHVIYSDVDPR--PASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGS--YAERG 265 M AH+++ +D PA+ S L VLR+ GFDG+I +D + M AIM + + E Sbjct: 251 MAAHIVFPQIDSTGLPATLSQPVLTGVLRKG-GFDGLILTDSMRM--GAIMQNFGFGEAC 307 Query: 266 QASLDAGCDMI 276 +++AG D+I Sbjct: 308 VTAINAGADLI 318 >UniRef50_A3HXS5 Putative hydrolase/beta lactamase fusion protein n=1 Tax=Algoriphagus sp. PR1 RepID=A3HXS5_9SPHI Length = 984 Score = 109 bits (272), Expect = 2e-22, Method: Compositional matrix adjust. Identities = 80/271 (29%), Positives = 126/271 (46%), Gaps = 14/271 (5%) Query: 16 EEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFREGF 75 E E++ + +GGLI F P + L +A S+ L +A+D E G R + Sbjct: 77 EISELVKNENLGGLIFFQGG---PDRQARLTNYYQAQSKTPLFIAMDAEWGISMRL-DSI 132 Query: 76 TRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI--SAAIGER 133 P A + A+ + L + G MA + + + I+FAPV+DV + IG R Sbjct: 133 PDFPKAMTLGAVQDTD----LIYQMGTEMARQFKELGMHINFAPVVDVNSNPDNPVIGYR 188 Query: 134 SYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKD 193 ++ + + A ++ G+ + G+ KHFPGHG DSH P P+ I D Sbjct: 189 AFGENKKLVAQRAVSYMKGLQDHGVIANAKHFPGHGDTEVDSHYSLPVIKNPEQRIWDVD 248 Query: 194 MSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTV---LRQELGFDGVIFSDDL 250 + + L REN L ++M AH+ +D R + K V L+ + F G+IF+D L Sbjct: 249 LYPYQELFREN-LMSVMVAHLNIPSLDSRADMATSLSKKVVTDLLQNRMNFQGLIFTDAL 307 Query: 251 SMEGAAIMGSYAERGQASLDAGCDMILVCNN 281 +M G A S E +L AG D++L + Sbjct: 308 NMRGVANTNSPGEVDLKALLAGNDILLYSQD 338 >UniRef50_UPI000196C637 hypothetical protein CATMIT_02394 n=1 Tax=Catenibacterium mitsuokai DSM 15897 RepID=UPI000196C637 Length = 790 Score = 108 bits (271), Expect = 2e-22, Method: Compositional matrix adjust. Identities = 98/358 (27%), Positives = 173/358 (48%), Gaps = 41/358 (11%) Query: 6 LDVEGYE---LDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQI-RAASRNR---LV 58 L G+E ++ + R+I+ G LI F +N + Q L + +AA+++ L+ Sbjct: 61 LQTTGHEFTKMNDQVRKIVEDYDFGALIYFAQNIQETEQSYNLSMAMQKAATKDGGIPLL 120 Query: 59 VAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 ++ DQEGG V R G T LP + A E A++AG ++ SE+ ++ I+ + A Sbjct: 121 ISADQEGGSVYRLGSG-TALPGNMALGATHNTE----YAKKAGQIIGSELSSIGINTNLA 175 Query: 119 PVLDVGHISAA--IGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSH 176 PV+DV + + IG RSY + +AS I G+ + + KHFPGHG DSH Sbjct: 176 PVVDVNNNANNPVIGLRSYSDNADVVGNMASATIAGLKDYNVIGCAKHFPGHGDTATDSH 235 Query: 177 KETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD-------------PRP 223 P + +AE+ ++ + I + ++ IM AH++Y +D +P Sbjct: 236 YGLPIVNKSKAELLNNELKPYKIAINQG-IEMIMTAHILYPQLDNTTVVSEKTGNEEKKP 294 Query: 224 ASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRK 283 A+ S + +L+ E+GF+GV+ +D ++M+ A ++ + +++AG D+I + Sbjct: 295 ATMSKAIITDLLKGEMGFNGVVTTDAMNMKAIADTFGESQAVKLAIEAGADLICMP---- 350 Query: 284 GAVSVLDNLSPIKA-----ERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQE 336 +VL N +K + V KG S L D R + ++ + N+ W+E Sbjct: 351 ---TVLYNQEDVKKLDTIIDYVEDAVKKGEISESRLDDGCR-RILTVKENRGILDWKE 404 >UniRef50_D1CEL0 Glycoside hydrolase family 3 domain protein n=1 Tax=Thermobaculum terrenum ATCC BAA-798 RepID=D1CEL0_THET1 Length = 596 Score = 108 bits (271), Expect = 2e-22, Method: Compositional matrix adjust. Identities = 82/277 (29%), Positives = 129/277 (46%), Gaps = 16/277 (5%) Query: 26 VGGLILFTR--NYHDPAQLRELVRQIRAASRNRLV-----VAVDQEGGRVQRFREGFTRL 78 VG +I FT N + Q+ L I+ +S + V ++ DQEGG + R R Sbjct: 97 VGSIIYFTWTGNIKNAQQVARLSNGIQRSSMTQRVPVPTLISTDQEGGIISRLPSQIVRS 156 Query: 79 PAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVG--HISAAIGERSYH 136 P + G A A +++ E+ A+ I+ + AP DV + IG RS+ Sbjct: 157 PGNMAL----GAARDKTYAFRAAYIVGKELRALGINQNLAPDADVNINPFNPVIGVRSFG 212 Query: 137 ADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSV 196 + + + G + T KHFPGHG DSH P Q ++ D+ Sbjct: 213 DRSWLVARLTAAQVRGYQSVDVAATAKHFPGHGDTNIDSHTGLPVIRHTQKQLEDIDLPP 272 Query: 197 FSSLIRENKLDAIMPAHVIYSDVDP--RPASGSPYWLKTVLRQELGFDGVIFSDDLSMEG 254 F + I ++ +D IM AH++ +DP RPA+ S L +LRQ+LGF GVI +D L M G Sbjct: 273 FRAAI-DSGVDVIMSAHIVVPSLDPSGRPATLSKPILTGLLRQKLGFKGVIMTDSLEMAG 331 Query: 255 AAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDN 291 M + ++ AG D++L+ + A++ + N Sbjct: 332 VRQMFPDSRVPVEAIKAGADLLLMPPDLNLAINSVVN 368 >UniRef50_Q1IHT6 Glycoside hydrolase, family 3-like n=1 Tax=Candidatus Koribacter versatilis Ellin345 RepID=Q1IHT6_ACIBL Length = 624 Score = 108 bits (271), Expect = 2e-22, Method: Compositional matrix adjust. Identities = 84/285 (29%), Positives = 142/285 (49%), Gaps = 24/285 (8%) Query: 18 REILAHPLVGGLILFTRNYHD-------PAQLRELVRQIRAASRNRLVVAVDQEGGRVQR 70 R+ +A +GG + T N+ D P + ++ +++ S L++A D E G R Sbjct: 78 RDTMARYHLGGFGV-TVNFEDGFLFKTEPYEAAMMINELQQGSEIPLIIAADFERGLSMR 136 Query: 71 FREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHISA-- 128 +E T P A +F G A+E G + A E A+ ++ ++ P DV A Sbjct: 137 LKEA-TDFPHAMAF----GATNNPAYAKEFGRITALESRAIGVEWNWFPDADVNSNPANP 191 Query: 129 AIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAE 188 I RS+ DPQ ++ +I+G H G+ TT KHFPGHG D+H T +P Sbjct: 192 IINTRSFGEDPQAVASMVKAYIEGAHAEGLLTTVKHFPGHGDTDTDTHLATARINQPLEH 251 Query: 189 IRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP---RPASGSPYWLKTVLRQELGFDGVI 245 I+ ++ F + I + +D++M H++ +DP R A+ SP + L+++LGF G++ Sbjct: 252 IQNVELVPFKAAI-DAGVDSVMIGHLVVPALDPDTNRVATISPKIVNGTLKKDLGFQGLV 310 Query: 246 FSDDLSMEGAAIMGSYAERGQ-----ASLDAGCDMILVCNNRKGA 285 +D + M G A + + G A++ AG DM+L+ ++ GA Sbjct: 311 VTDAMEMNGLAKLFGFGPEGSARAAVAAVKAGDDMLLLPSDLDGA 355 >UniRef50_UPI0001C379AD glycoside hydrolase family 3 domain protein n=2 Tax=Ruminococcus flavefaciens FD-1 RepID=UPI0001C379AD Length = 452 Score = 108 bits (270), Expect = 3e-22, Method: Compositional matrix adjust. Identities = 83/282 (29%), Positives = 133/282 (47%), Gaps = 14/282 (4%) Query: 18 REILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNR----LVVAVDQEGGRVQRFRE 73 R+ L VGGLI F+ N Q+ +++R + + L VD+EGG V R E Sbjct: 150 RDGLEEYPVGGLIFFSDNMESREQVTDMIRTTKDYAAGLGCVPLFFGVDEEGGVVSRCAE 209 Query: 74 GFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV--GHISAAIG 131 +L + S ++G A +A ++ D+ FAPV D + IG Sbjct: 210 ---KLGTSAFEPMYSYKDKGPVTAYANARKLAGDIGQFGFDLDFAPVADTWSNPDNKVIG 266 Query: 132 ERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRA 191 R+Y D ++ + + G E G+ T KHFPGHG DSH R E+R+ Sbjct: 267 TRAYSDDFEETAMLVGAAVRGFREGGVHCTLKHFPGHGDTYEDSHDGFAVSNRTLDELRS 326 Query: 192 KDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQELGFDGVIFSDDLS 251 + + F I+ D +M H+ +D PAS S + LR LG+DG++ +D L+ Sbjct: 327 NEYTAFDRGIKAGA-DFVMVGHITVPAIDNLPASLSSKVINDELRGTLGYDGIVITDALA 385 Query: 252 MEGAAIMGSYAERGQA--SLDAGCDMILVCNNRKGAVSVLDN 291 M AI +Y+ A +++AG DM+L N + +V+ +++ Sbjct: 386 M--GAIANAYSSSVCAVMAVEAGNDMLLSPKNLRESVAGIED 425 >UniRef50_Q8RD12 Beta-glucosidase-related glycosidases n=2 Tax=Thermoanaerobacteraceae RepID=Q8RD12_THETN Length = 591 Score = 108 bits (270), Expect = 3e-22, Method: Compositional matrix adjust. Identities = 83/295 (28%), Positives = 143/295 (48%), Gaps = 27/295 (9%) Query: 26 VGGLILFTRNYHDPAQLRELVRQIRAAS-RNRLVVAVDQEGGRVQRFREGFTRLPAAQSF 84 +GG+ILF+ N Q+ +L I++ S + L ++ DQEGG V R +G T+ P + Sbjct: 84 LGGVILFSENIQSIEQIVKLTNDIQSLSLQIPLFISTDQEGGLVYRLSQG-TKFPGNMAL 142 Query: 85 AALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVG--HISAAIGERSYHADPQKA 142 A + LA + G ++ SE+ A+ I+ APV+DV + IG RS+ +P Sbjct: 143 GATFSED----LAYKVGKVIGSELKAVGINWDLAPVMDVNVNPYNPVIGIRSFGGNPDVV 198 Query: 143 LAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIR 202 + F+ G+ E + + KHFPGHG V DSH + + + ++ F I+ Sbjct: 199 AKLGVAFMKGLQENKIASCVKHFPGHGDVDVDSHLGLGTVNKNRETLEKIELKPFEEAIQ 258 Query: 203 ENKLDAIMPAHVIYSDVDPR----------------PASGSPYWLKTVLRQELGFDGVIF 246 DA+M AH+ + +D PA+ S L ++R+ G++GVI Sbjct: 259 AGT-DALMTAHLSFPALDDTKVIAKTSEGVQKEITIPATLSHKILTDLVRKTWGYEGVII 317 Query: 247 SDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVS--VLDNLSPIKAER 299 +D ++M+ + + ++ AG D+IL+ + GA + VL+ I +E+ Sbjct: 318 TDAMNMKAISDNFGPVDAVVRAVKAGADIILMPVDLNGAFNELVLETKKGIISEK 372 >UniRef50_A6LEE9 Glycoside hydrolase family 3, candidate beta-glycosidase n=6 Tax=Bacteroidales RepID=A6LEE9_PARD8 Length = 999 Score = 108 bits (270), Expect = 3e-22, Method: Compositional matrix adjust. Identities = 99/326 (30%), Positives = 153/326 (46%), Gaps = 25/326 (7%) Query: 26 VGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFREGFTRLPAAQSFA 85 +GG ILF + DP E+ +++ ASR ++V++D E G R G TR P Sbjct: 80 IGG-ILFHKG--DPVTQAEVTNRLQKASRIPMLVSLDGEWGLSMRL-SGTTRFPKNMMLG 135 Query: 86 ALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI--SAAIGERSYHADPQKAL 143 A+ E L +E G + + M I I+FAP +DV + IG RS+ +P+ Sbjct: 136 AI----EDNALIEEYGKEVGRQCREMGIHINFAPDMDVNSNVDNPVIGLRSFGENPEAVS 191 Query: 144 AIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRE 203 + G+ G+ + KHFPGHG + DSH+ P +A + + ++ F I + Sbjct: 192 EKGIAYARGLENTGILSVSKHFPGHGDTSEDSHETLPVVRHNRARLDSVELLPFKRYIYD 251 Query: 204 NKLDAIMPAHVIYSDVDP--RPASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSY 261 IM H+ +D +PAS S + +L++ELGF G+ F+D L+M+GA+ + Sbjct: 252 G-FGGIMTGHLYVPALDKSHKPASFSKAVVTDLLQKELGFQGLCFTDALAMKGASTKKTD 310 Query: 262 AERGQASLDAGCDMILVCN---NRKGAVS------VLDNLSPIKAERVTRLYHKGSFSRQ 312 +A L AG D++L N AV VLD L I+A+ + L +K + Sbjct: 311 NPSVKALL-AGNDILLAPAAPINDFTAVKEAIEEGVLD-LEAIEAKCLKILRYK-YIAGL 367 Query: 313 ELMDSARWKAISTRLNQLHERWQEEK 338 K +S RLN H W K Sbjct: 368 NAYKPVETKGLSKRLNSPHAAWMAAK 393 >UniRef50_UPI0001621051 predicted protein n=1 Tax=Physcomitrella patens subsp. patens RepID=UPI0001621051 Length = 433 Score = 108 bits (269), Expect = 3e-22, Method: Compositional matrix adjust. Identities = 83/263 (31%), Positives = 131/263 (49%), Gaps = 21/263 (7%) Query: 50 RAASRNRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMI 109 R A L++ VDQEGG VQRF FT +P A+ S E G +A ++A E+ Sbjct: 108 REAGNRPLLIMVDQEGGYVQRFGPPFTVIPHARIVGDTSDPEAAGAIAS----VIAKELR 163 Query: 110 AMDIDISFAPVLDV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPG 167 A+++D++ APVLDV + IG+RS+ P + + +I G+ G+ KH+PG Sbjct: 164 AVNVDMNLAPVLDVDTNPENTVIGKRSFATTPARVANLGFAYIRGLQCEGVAACAKHYPG 223 Query: 168 HGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAI-----MPAHVIYSDVDPR 222 HG T DSH + P P + +MS F+ + + + +P ++ + Sbjct: 224 HGDTTLDSHVDLPLLPHSLTRLVEIEMSPFAKAVDADVAAVMVAHIDVPCFSGRTESARK 283 Query: 223 PASGSPYWLKTV--LRQELGFDGVIFSDDLSMEGAAIMGSY--AERGQASLDAGCDMILV 278 PA+ + T+ LR L F+GVI SD ME AI+ + E ++ AG DM+LV Sbjct: 284 PAT---MCVDTIAYLRDALLFEGVIISD--CMETGAIVKEHRIEEAAVQAVLAGIDMVLV 338 Query: 279 CNNRKGAVSVLDNL-SPIKAERV 300 + ++V+D L + ER+ Sbjct: 339 SHTLSRQIAVVDALVQAVLTERI 361 >UniRef50_D1YED3 Glycosyl hydrolase family 3 N-terminal domain protein n=3 Tax=Propionibacterium acnes RepID=D1YED3_PROAC Length = 629 Score = 108 bits (269), Expect = 3e-22, Method: Compositional matrix adjust. Identities = 90/302 (29%), Positives = 144/302 (47%), Gaps = 26/302 (8%) Query: 26 VGGLILFTRNYH-----DPAQLRELVRQIRAASRNR----LVVAVDQEGGRVQRFREGFT 76 +GG++ F + + +PA + L ++ A+R L +AVDQEGG V R T Sbjct: 84 LGGVLYFNWSGNISTPANPAAVAHLSNGLQEAARTSSTAPLAIAVDQEGGIVARITSPAT 143 Query: 77 RLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHISA--AIGERS 134 P + A++ + A+ G ++ E+ A+ I++ FAP +DV A IG RS Sbjct: 144 EFPGNMALGAVND----PRAARAQGAVLGRELAALGINVDFAPDVDVNTNPANPVIGVRS 199 Query: 135 YHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDM 194 DP + + I G+ G+ T KHFPGHG DSH P + + + + Sbjct: 200 MGDDPVRVGVLGVEQIRGIQSRGVAATAKHFPGHGDTQIDSHLGLPVVEYGRTTLE-RHL 258 Query: 195 SVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQELGFDGVIFSDDLSME 253 F I ++D IM AH+I +DP PA+ S L +LR+++G+ G+I +D L ME Sbjct: 259 VPFRMAI-SAEVDMIMTAHIIVKTLDPTMPATLSKKVLTGLLREQMGYRGIITTDALDME 317 Query: 254 GA--AIM-----GSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHK 306 GA A+M G+Y R +A+ DA + + + + PI+ + + Sbjct: 318 GAQLAVMTEDEKGTYT-RLKAAADAEGNDPTAADEGHASAEFNAFMKPIRGRVAVQAFEA 376 Query: 307 GS 308 GS Sbjct: 377 GS 378 >UniRef50_B1Y1A2 Glycoside hydrolase family 3 domain protein n=1 Tax=Leptothrix cholodnii SP-6 RepID=B1Y1A2_LEPCP Length = 554 Score = 108 bits (269), Expect = 3e-22, Method: Compositional matrix adjust. Identities = 85/276 (30%), Positives = 134/276 (48%), Gaps = 21/276 (7%) Query: 14 DAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFRE 73 D E E++ +G + + N+ DP R ++ R ASR R +V+ D EGG Sbjct: 45 DEEAIELVERHQLGAVTVI--NFSDPDACRSVIDAARKASRIRPLVSADLEGGVTSGHLT 102 Query: 74 GFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI--SAAIG 131 T P AA + ++ G EA ++++E+ ++ ++ +F+PVLDV SA +G Sbjct: 103 --TTFPNQLGCAAAACLDTYG----EALRMLSAELASLGVNWTFSPVLDVNQKFQSAIVG 156 Query: 132 ERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRA 191 RSY ++P +A I + G+ TT KH+PG G D H T +P E A Sbjct: 157 TRSYGSNPDLIAQLAGLHIQVFQDHGIATTAKHWPGEGFDDRDQHLVTTINPMTVEEWTA 216 Query: 192 KDMSVFSSLIRENKLDAIMPAHVIYSDV----------DPRPASGSPYWLKTVLRQELGF 241 +++ I L ++M AH+ + D RPAS S + +T+LR++L F Sbjct: 217 VFGQLYARAIDAGVL-SVMSAHIAFPDYLRRAGLSSIELYRPASISRHLNQTLLREQLQF 275 Query: 242 DGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMIL 277 +G+I SD M G GS + ++ GCDMIL Sbjct: 276 NGLIVSDATLMGGLESWGSRRQWLPEVIENGCDMIL 311 >UniRef50_A0LR43 Glycoside hydrolase, family 3 domain protein n=1 Tax=Acidothermus cellulolyticus 11B RepID=A0LR43_ACIC1 Length = 475 Score = 108 bits (269), Expect = 4e-22, Method: Compositional matrix adjust. Identities = 75/236 (31%), Positives = 122/236 (51%), Gaps = 12/236 (5%) Query: 26 VGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQR--FREGFTRLPAAQS 83 + G+ ++ N D + + + + + + LV A+D+EGG V R +R G + P + Sbjct: 33 LAGVAIYGHNLVDDGSVARIAQAVHNTAPDALV-ALDEEGGDVTRLEYRTG-SSYPGNLA 90 Query: 84 FAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI--SAAIGERSYHADPQK 141 + +E L +A++++A ++ + AP +DV + IG RS+ ADP K Sbjct: 91 LGVVDDLE----LTARVAAAIAADLVAAGVNYNLAPAVDVNSDPRNPVIGVRSFGADPDK 146 Query: 142 ALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLI 201 A + FI M G+ T KHFPGHGA ADSH P + R +D+ F + I Sbjct: 147 VAAHGATFITAMQSRGIATAAKHFPGHGATVADSHHTLPVIDVDEPTFRRRDLPPFVAAI 206 Query: 202 RENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAA 256 + +IM +HV+++ +D PA+ SP L+ +LR ELG+ GV+ +D L M A Sbjct: 207 QAGAA-SIMTSHVVFTALDADLPATLSPRLLRGLLRSELGYSGVVVTDALDMRAVA 261 >UniRef50_B2RGN3 Glycosyl hydrolase family 3 n=2 Tax=Porphyromonas gingivalis RepID=B2RGN3_PORG3 Length = 1003 Score = 108 bits (269), Expect = 4e-22, Method: Compositional matrix adjust. Identities = 80/257 (31%), Positives = 129/257 (50%), Gaps = 15/257 (5%) Query: 26 VGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFREGFTRLPAAQSFA 85 +GG ILF + ++ + R+++ A+ L++A+D E G R ++ R P Sbjct: 98 IGG-ILFQKG--TLSEQYTMTRRLQEAAGTPLLIALDGEWGLHMRLKDA-PRFPRNMGL- 152 Query: 86 ALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGH--ISAAIGERSYHADPQKAL 143 G ++ +L G +A + M I I+FAPVLDV + + IG RS+ +P++ Sbjct: 153 ---GHQKDNQLLYNYGREVARQCRLMGIHINFAPVLDVNNNPKNPVIGTRSFGDNPRRVA 209 Query: 144 AIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRE 203 + G+ + G+ KHFPGHG T DSHK P + E+ ++ F R Sbjct: 210 ERGIAYAQGLEDGGVMAVAKHFPGHGNTTEDSHKTLPTVFASREELENTELFPFKEFFRA 269 Query: 204 NKLDAIMPAHVIYSDVDPR---PASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGS 260 L +M AH+ ++ + P+S S +LRQE+GF G+IF+D L+M+G GS Sbjct: 270 G-LSGVMTAHLNVPALEAKKNTPSSLSHAICTDLLRQEMGFKGLIFTDGLAMQGVQTAGS 328 Query: 261 YAERGQASLDAGCDMIL 277 +A L AG D++L Sbjct: 329 QPISVRAIL-AGNDILL 344 >UniRef50_A1K3B3 Putative beta-hexosaminidase n=1 Tax=Azoarcus sp. BH72 RepID=A1K3B3_AZOSB Length = 451 Score = 107 bits (268), Expect = 4e-22, Method: Compositional matrix adjust. Identities = 92/289 (31%), Positives = 147/289 (50%), Gaps = 23/289 (7%) Query: 10 GYELDAEEREILA-HPLVGGLILFTRNYHDPA--QLRELVRQIRAA-SRNRL---VVAVD 62 GY DA+E LA L+GG+ L N + +R + +++A +RN L +VA D Sbjct: 112 GYR-DADEVAALAERGLIGGIYLGRHNVRGRSVEAVRAEIDRLQALRARNALPPLIVAAD 170 Query: 63 QEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKL-AQEAGWLMASEMIAMDIDISFAPVL 121 QEGG V PA S L+G + G ++ A++ G L + + A+ + ++F PV+ Sbjct: 171 QEGGSVAHMSPPLAPQPALASV--LAGDDAGLEMRARDYGRLQGAGLAALGVTLNFGPVV 228 Query: 122 DV--GH------ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTA 173 D+ H IGER+ DP + +A + +G+ +AG++ T KHFPG G V Sbjct: 229 DLRPAHGGPLFDTHTRIGERAIDGDPARVARVARAYAEGLADAGVQATLKHFPGLGGVMR 288 Query: 174 DSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPASGSPYWLK 232 D+H P A + A D F + AIM HV+ ++D RPAS SP ++ Sbjct: 289 DTHHFAARLDTPGAALAAHDWQAFRGAAPASA--AIMVGHVVLPELDADRPASLSPAVVQ 346 Query: 233 TVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNN 281 +LR+ G+ G++ +DDL+M GA +L+AG D++L+ + Sbjct: 347 GLLRRGWGYQGLLVTDDLNM-GAVYRRGTCRAAVEALEAGIDLVLISYD 394 >UniRef50_A7VCD5 Putative uncharacterized protein n=1 Tax=Clostridium sp. L2-50 RepID=A7VCD5_9CLOT Length = 434 Score = 107 bits (268), Expect = 5e-22, Method: Compositional matrix adjust. Identities = 82/277 (29%), Positives = 130/277 (46%), Gaps = 3/277 (1%) Query: 12 ELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRF 71 +LD E + L VGG++L ++N QL++L+++++ S L +AVD+EGG V R Sbjct: 124 KLDKEFKSALKKYPVGGIMLKSKNIQTKKQLKKLLKEMQDNSDITLFLAVDEEGGTVSRV 183 Query: 72 REGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI--SAA 129 + + + GG +A E + + ++ FAPV DV + Sbjct: 184 MQKLGNIDGGDIGSMYDYRNLGGDIAYENARQIGDNLSYYGFNLDFAPVADVWSNPGNTV 243 Query: 130 IGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEI 189 IG+R+Y +A + + G H G+ T KHFPGHG DSH + ++ Sbjct: 244 IGKRAYSNIYSEAAELIPYAVRGFHAGGVLCTLKHFPGHGNTKEDSHYGNAYVYETKDDL 303 Query: 190 RAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQELGFDGVIFSDD 249 + F + I + D +M H+I +D+ P+ S + +LR ELGF VI +D Sbjct: 304 ENNEFQSFRAGI-DAGADFVMVGHMIVTDISNIPSDLSKEIVTDILRTELGFSRVIITDS 362 Query: 250 LSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAV 286 L ME S + ++ AG DMIL N + AV Sbjct: 363 LQMESITDTYSAGDAAIYAVAAGNDMILEPENLEQAV 399 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_A7FH41 Beta-hexosaminidase n=129 Tax=Gammaproteobacteri... 437 e-121 UniRef50_A1SW90 Beta-hexosaminidase n=3 Tax=Alteromonadales RepI... 383 e-105 UniRef50_Q4QLU8 Beta-hexosaminidase n=138 Tax=Gammaproteobacteri... 374 e-102 UniRef50_Q8RD12 Beta-glucosidase-related glycosidases n=2 Tax=Th... 369 e-101 UniRef50_C6J2I0 Glycoside hydrolase n=2 Tax=Bacillales RepID=C6J... 369 e-101 UniRef50_UPI0001693BCF Beta-glucosidase-related glycosidase n=1 ... 366 e-100 UniRef50_A3I7T1 Beta-hexosaminidase n=2 Tax=Bacillaceae RepID=A3... 364 3e-99 UniRef50_C5DA89 Glycoside hydrolase family 3 domain protein n=6 ... 362 9e-99 UniRef50_Q8ETJ0 Beta-hexosaminidase n=1 Tax=Oceanobacillus iheye... 362 1e-98 UniRef50_D2MLA9 Beta-N-acetylglucosaminidase n=1 Tax=Candidatus ... 362 1e-98 UniRef50_Q3SKU2 Beta-hexosaminidase n=27 Tax=Proteobacteria RepI... 362 1e-98 UniRef50_Q8CX63 Beta-N-acetylhexosaminidase (Beta-hexosaminidase... 361 2e-98 UniRef50_A9NEX4 Glycoside hydrolase, family 3 n=1 Tax=Acholeplas... 361 3e-98 UniRef50_B0K1P2 Glycoside hydrolase, family 3 domain protein n=9... 360 5e-98 UniRef50_Q7PC47 Beta-N-acetylhexosaminidase. Glycosyl Hydrolase ... 359 7e-98 UniRef50_C7IJM6 Glycoside hydrolase family 3 domain protein n=1 ... 357 3e-97 UniRef50_A5CZV7 Beta-glucosidase-related glycosidases and D-alan... 357 5e-97 UniRef50_Q31G32 Beta-hexosaminidase n=1 Tax=Thiomicrospira cruno... 356 5e-97 UniRef50_B9Y750 Putative uncharacterized protein n=1 Tax=Holdema... 356 8e-97 UniRef50_C4KZ16 Beta-N-acetylhexosaminidase n=2 Tax=Firmicutes R... 354 3e-96 UniRef50_Q0HJG7 Beta-hexosaminidase n=35 Tax=Gammaproteobacteria... 353 4e-96 UniRef50_D2M1Q8 Glycoside hydrolase family 3 domain protein n=1 ... 353 5e-96 UniRef50_Q1QVU0 Glycoside hydrolase, family 3-like protein n=1 T... 353 6e-96 UniRef50_C1XLF1 Beta-glucosidase-like glycosyl hydrolase n=2 Tax... 353 7e-96 UniRef50_P40406 Uncharacterized lipoprotein ybbD n=5 Tax=Bacillu... 352 7e-96 UniRef50_B5Y6T8 Beta-N-Acetylglucosaminidase n=2 Tax=Firmicutes ... 352 8e-96 UniRef50_A4C9R3 Beta-hexosaminidase n=1 Tax=Pseudoalteromonas tu... 352 1e-95 UniRef50_UPI0001C42D84 glycoside hydrolase family 3 domain prote... 351 2e-95 UniRef50_C6J1S3 Glycoside hydrolase n=2 Tax=Bacillales RepID=C6J... 349 7e-95 UniRef50_A6TMT8 Glycoside hydrolase, family 3 domain protein n=1... 349 8e-95 UniRef50_C7P2G4 Glycoside hydrolase family 3 domain protein n=1 ... 349 9e-95 UniRef50_C5BIH5 Glycoside hydrolase family 3 domain protein n=1 ... 349 1e-94 UniRef50_C6D1D6 Glycoside hydrolase family 3 domain protein n=1 ... 349 1e-94 UniRef50_UPI0000E11708 beta-hexosaminidase n=1 Tax=Rhodobacteral... 347 3e-94 UniRef50_A2QDM1 Contig An02c0230, complete genome n=9 Tax=Tricho... 347 4e-94 UniRef50_B8FZS5 Glycoside hydrolase family 3 domain protein n=2 ... 347 5e-94 UniRef50_D1CIC2 Glycoside hydrolase family 3 domain protein n=1 ... 346 9e-94 UniRef50_A9KMQ8 Glycoside hydrolase family 3 domain protein n=1 ... 345 1e-93 UniRef50_B1YFC7 Glycoside hydrolase family 3 domain protein n=6 ... 345 2e-93 UniRef50_C1I263 Beta-hexosamidase A n=1 Tax=Clostridium sp. 7_2_... 345 2e-93 UniRef50_B6Q1F2 Beta-N-acetylglucosaminidase, putative n=1 Tax=P... 344 2e-93 UniRef50_B6J7B4 Anhydromuramoyl-peptide exo-beta-N-acetylglucosa... 343 5e-93 UniRef50_A4BUN0 Glycoside hydrolase, family 3-like protein n=1 T... 342 1e-92 UniRef50_C8PT26 Glycosyl hydrolase, family 3 n=1 Tax=Treponema v... 341 2e-92 UniRef50_B4BR59 Glycoside hydrolase family 3 domain protein n=2 ... 341 2e-92 UniRef50_Q67SQ8 Putative beta-N-acetylglucosaminidase n=1 Tax=Sy... 341 3e-92 UniRef50_C0ZKK5 Probable beta-hexosaminidase n=1 Tax=Brevibacill... 341 3e-92 UniRef50_A8VQ50 Choline dehydrogenase n=1 Tax=Bacillus selenitir... 339 6e-92 UniRef50_Q4FV07 Glycoside hydrolase family protein n=32 Tax=Pseu... 339 8e-92 UniRef50_Q04TR7 Beta-glucosidase-related glycosidase n=4 Tax=Lep... 339 1e-91 UniRef50_A3DC81 Glycoside hydrolase, family 3-like protein n=3 T... 338 1e-91 UniRef50_B3TAA3 Putative glycosyl hydrolase family 3 N terminal ... 338 2e-91 UniRef50_Q8ERK3 Beta-N-acetylglucosaminidase n=2 Tax=Bacillaceae... 338 2e-91 UniRef50_A9M1Z4 Beta-hexosaminidase n=23 Tax=Neisseriaceae RepID... 338 2e-91 UniRef50_B8M4K9 Beta-N-acetylglucosaminidase, putative n=9 Tax=T... 337 3e-91 UniRef50_Q6MRE1 BglX2 protein n=2 Tax=Bdellovibrio bacteriovorus... 337 3e-91 UniRef50_A8F7W1 Glycoside hydrolase family 3 domain protein n=1 ... 337 3e-91 UniRef50_B2FPW9 Beta-hexosaminidase n=18 Tax=Xanthomonadaceae Re... 336 5e-91 UniRef50_A1AVY9 Beta-N-acetylhexosaminidase n=2 Tax=Gammaproteob... 336 8e-91 UniRef50_C1TNN0 Beta-glucosidase-like glycosyl hydrolase n=1 Tax... 336 8e-91 UniRef50_A8MJC3 Glycoside hydrolase family 3 domain protein n=2 ... 335 1e-90 UniRef50_B4RS37 Beta-hexosaminidase n=2 Tax=Alteromonas macleodi... 335 2e-90 UniRef50_Q1CXY4 Glycosyl hyrolase, family 3 n=6 Tax=Cystobacteri... 334 2e-90 UniRef50_C7MQM3 Beta-glucosidase-like glycosyl hydrolase n=2 Tax... 333 5e-90 UniRef50_Q2BGB3 Beta-hexosaminidase n=1 Tax=Neptuniibacter caesa... 332 1e-89 UniRef50_D1CEL0 Glycoside hydrolase family 3 domain protein n=1 ... 332 1e-89 UniRef50_B5EPA1 Glycoside hydrolase family 3 domain protein n=2 ... 332 1e-89 UniRef50_A3CMS4 Beta-hexosamidase A, putative n=1 Tax=Streptococ... 332 1e-89 UniRef50_C9NRN7 Beta-hexosaminidase n=1 Tax=Vibrio coralliilytic... 332 2e-89 UniRef50_C4ZK55 Glycoside hydrolase family 3 domain protein n=6 ... 331 2e-89 UniRef50_D0L0K9 Glycoside hydrolase family 3 domain protein n=1 ... 331 2e-89 UniRef50_Q15N77 Glycoside hydrolase, family 3-like n=1 Tax=Pseud... 331 2e-89 UniRef50_C6CWS0 Beta-N-acetylhexosaminidase n=1 Tax=Paenibacillu... 331 3e-89 UniRef50_B3QTH0 Beta-N-acetylhexosaminidase n=12 Tax=Chlorobiace... 330 3e-89 UniRef50_UPI00016BFC75 Beta-glucosidase-related glycosidase n=1 ... 330 5e-89 UniRef50_Q2SIQ7 Beta-glucosidase-related Glycosidase n=1 Tax=Hah... 330 5e-89 UniRef50_Q9RUP9 Glycosyl hyrolase, family 3 n=2 Tax=Bacteria Rep... 329 1e-88 UniRef50_UPI0000E0E41F Beta-hexosaminidase A precursor n=1 Tax=G... 329 1e-88 UniRef50_Q08U63 Beta-glucosidase n=1 Tax=Stigmatella aurantiaca ... 329 1e-88 UniRef50_C1D8P0 NagZ n=4 Tax=Proteobacteria RepID=C1D8P0_LARHH 328 2e-88 UniRef50_B6BUL6 Beta-hexosaminidase n=1 Tax=beta proteobacterium... 327 4e-88 UniRef50_C5CGK3 Glycoside hydrolase family 3 domain protein n=1 ... 327 4e-88 UniRef50_D1AF10 Glycoside hydrolase family 3 domain protein n=1 ... 327 5e-88 UniRef50_C6WFB0 Glycoside hydrolase family 3 domain protein n=1 ... 326 7e-88 UniRef50_D2BER2 Beta-hexosamidase A n=1 Tax=Streptosporangium ro... 325 1e-87 UniRef50_B3T2B0 Putative glycosyl hydrolase family 3 N terminal ... 325 1e-87 UniRef50_A7EUL4 Putative uncharacterized protein n=1 Tax=Sclerot... 325 2e-87 UniRef50_Q0VQT2 Beta-hexosaminidase n=3 Tax=Gammaproteobacteria ... 325 2e-87 UniRef50_B3E4C8 Glycoside hydrolase family 3 domain protein n=3 ... 324 2e-87 UniRef50_A6GP56 Beta-hexosaminidase n=1 Tax=Limnobacter sp. MED1... 324 3e-87 UniRef50_C6C0L9 Glycoside hydrolase family 3 domain protein n=1 ... 324 3e-87 UniRef50_B7S2D9 Glycosyl hydrolase family 3 N terminal domain pr... 324 4e-87 UniRef50_Q84IZ5 Beta-N-acetylglucosaminidase n=1 Tax=Clostridium... 322 8e-87 UniRef50_P48823 Beta-hexosaminidase A n=3 Tax=Alteromonadales Re... 322 9e-87 UniRef50_C0EVT4 Putative uncharacterized protein n=1 Tax=Eubacte... 322 1e-86 UniRef50_B0P2K0 Putative uncharacterized protein n=2 Tax=Clostri... 321 2e-86 UniRef50_D2PU34 Glycoside hydrolase family 3 domain protein n=1 ... 321 2e-86 UniRef50_Q1N543 Glycoside hydrolase, family 3-like protein n=1 T... 321 2e-86 UniRef50_C6N603 Beta-hexosaminidase n=2 Tax=Legionella RepID=C6N... 320 3e-86 UniRef50_Q1K0E5 Glycoside hydrolase, family 3-like n=1 Tax=Desul... 319 1e-85 UniRef50_A7BX55 Glycosyl hydrolase, family 3 n=4 Tax=Proteobacte... 319 1e-85 UniRef50_A8L1X4 Glycoside hydrolase family 3 domain protein n=3 ... 318 2e-85 UniRef50_Q47M03 Beta-N-acetylhexosaminidase. Glycosyl Hydrolase ... 318 2e-85 UniRef50_B1WW93 Beta-glucosidase n=12 Tax=Cyanobacteria RepID=B1... 318 2e-85 UniRef50_Q47VB6 Beta-hexosaminidase n=1 Tax=Colwellia psychreryt... 317 3e-85 UniRef50_D2PT32 Glycoside hydrolase family 3 domain protein n=26... 317 3e-85 UniRef50_D1BPZ2 Glycoside hydrolase family 3 domain protein n=3 ... 317 5e-85 UniRef50_C9LY46 Beta-hexosaminidase n=1 Tax=Selenomonas sputigen... 317 5e-85 UniRef50_Q2JJA4 Glycosyl hydrolase domain protein n=1 Tax=Synech... 316 7e-85 UniRef50_A6NU14 Putative uncharacterized protein n=1 Tax=Bactero... 316 7e-85 UniRef50_UPI0000E87F6B beta-hexosaminidase n=1 Tax=Methylophilal... 315 1e-84 UniRef50_A2EGF0 Glycosyl hydrolase family 3 N terminal domain co... 315 1e-84 UniRef50_Q5P081 Beta-hexosaminidase n=63 Tax=Proteobacteria RepI... 315 2e-84 UniRef50_B0SE07 Beta-glucosidase-related glycosidase n=2 Tax=Lep... 315 2e-84 UniRef50_C9KM62 Glycosyl hydrolase domain protein n=1 Tax=Mitsuo... 314 2e-84 UniRef50_Q5WDA0 Beta-N-acetylhexosaminidase n=1 Tax=Bacillus cla... 314 4e-84 UniRef50_Q67LG3 Putative beta-N-acetylglucosaminidase n=1 Tax=Sy... 313 7e-84 UniRef50_Q11Z59 B-glycosidase, glycoside hydrolase family 3 prot... 312 9e-84 UniRef50_UPI000196C637 hypothetical protein CATMIT_02394 n=1 Tax... 312 9e-84 UniRef50_B4S1A2 Beta-hexosaminidase A n=2 Tax=Alteromonas macleo... 312 1e-83 UniRef50_B0MSB5 Putative uncharacterized protein n=1 Tax=Eubacte... 312 1e-83 UniRef50_B2SGG8 Glycosyl hydrolase family 3 n=17 Tax=Francisella... 311 2e-83 UniRef50_C1YLC2 Beta-glucosidase-like glycosyl hydrolase n=1 Tax... 311 2e-83 UniRef50_A1SM50 Glycoside hydrolase, family 3 domain protein n=7... 310 4e-83 UniRef50_C3RKZ3 Beta-hexosamidase A n=5 Tax=Bacteria RepID=C3RKZ... 309 7e-83 UniRef50_A0M1W2 Family 3 glycosyl hydrolase/beta-lactamase fusio... 309 7e-83 UniRef50_C2M7P2 Family 3 glycosyl hydrolase/beta-lactamase fusio... 309 7e-83 UniRef50_Q2NRJ7 Putative glycosidase n=1 Tax=Sodalis glossinidiu... 309 1e-82 UniRef50_B9E727 Putative uncharacterized protein n=1 Tax=Macroco... 308 2e-82 UniRef50_Q2SV66 Beta-N-Acetylglucosaminidase n=31 Tax=Burkholder... 308 2e-82 UniRef50_C6HUU9 Beta-N-acetylhexosaminidase n=1 Tax=Leptospirill... 308 2e-82 UniRef50_C5CH66 Beta-N-acetylhexosaminidase n=1 Tax=Kosmotoga ol... 307 3e-82 UniRef50_D1R988 Putative uncharacterized protein n=1 Tax=Parachl... 307 4e-82 UniRef50_D2RL81 Glycoside hydrolase family 3 domain protein n=1 ... 307 4e-82 UniRef50_A0L847 Beta-N-acetylhexosaminidase n=1 Tax=Magnetococcu... 307 5e-82 UniRef50_A1C3U3 Beta-N-acetylglucosaminidase, putative n=7 Tax=L... 306 6e-82 UniRef50_B3QRM6 Glycoside hydrolase family 3 domain protein n=1 ... 306 7e-82 UniRef50_C8WZJ0 Glycoside hydrolase family 3 domain protein n=1 ... 306 7e-82 UniRef50_Q73RR1 Glycosyl hydrolase, family 3 n=1 Tax=Treponema d... 306 9e-82 UniRef50_B5LAN7 HexI n=3 Tax=cellular organisms RepID=B5LAN7_9BURK 306 9e-82 UniRef50_C4JZX3 Predicted protein n=4 Tax=Onygenales RepID=C4JZX... 305 1e-81 UniRef50_A3HXS5 Putative hydrolase/beta lactamase fusion protein... 305 1e-81 UniRef50_B3QV76 Glycoside hydrolase family 3 domain protein n=1 ... 305 1e-81 UniRef50_D0KBJ7 Glycoside hydrolase family 3 domain protein n=6 ... 305 2e-81 UniRef50_B3EU27 Putative uncharacterized protein n=1 Tax=Candida... 305 2e-81 UniRef50_A8SRF1 Putative uncharacterized protein n=1 Tax=Coproco... 305 2e-81 UniRef50_A9F261 Put. Beta-glucosidase n=1 Tax=Sorangium cellulos... 304 2e-81 UniRef50_A1ZJ01 Beta-N-acetylglucosaminidase n=1 Tax=Microscilla... 304 2e-81 UniRef50_B6GD75 Putative uncharacterized protein n=1 Tax=Collins... 304 2e-81 UniRef50_Q3A4Z0 Putative glycosyl hydrolase n=1 Tax=Pelobacter c... 304 2e-81 UniRef50_A5FAE4 Candidate beta-glycosidase; Glycoside hydrolase ... 304 3e-81 UniRef50_Q31RD3 Beta-glucosidase-related glycosidases-like n=5 T... 304 5e-81 UniRef50_A1ZDA7 Glycosyl hydrolase, family 3 n=1 Tax=Microscilla... 303 5e-81 UniRef50_Q30Y91 Beta-N-acetylhexosaminidase n=3 Tax=Desulfovibri... 302 1e-80 UniRef50_A1VC58 Glycoside hydrolase, family 3 domain protein n=3... 302 1e-80 UniRef50_Q1IZF6 Beta-N-acetylhexosaminidase n=2 Tax=Deinococcus ... 302 1e-80 UniRef50_C5T2K1 Beta-N-acetylhexosaminidase n=1 Tax=Acidovorax d... 302 2e-80 UniRef50_A9BFQ0 Glycoside hydrolase family 3 domain protein n=1 ... 302 2e-80 UniRef50_B2UPP0 Beta-N-acetylhexosaminidase n=1 Tax=Akkermansia ... 301 2e-80 UniRef50_A2DGR7 Glycosyl hydrolase family 3 N terminal domain co... 301 2e-80 UniRef50_C4Y752 Putative uncharacterized protein n=1 Tax=Clavisp... 301 3e-80 UniRef50_A6LEE9 Glycoside hydrolase family 3, candidate beta-gly... 300 3e-80 UniRef50_C6W3E7 Beta-lactamase n=1 Tax=Dyadobacter fermentans DS... 300 4e-80 UniRef50_B2RGN3 Glycosyl hydrolase family 3 n=2 Tax=Porphyromona... 300 5e-80 UniRef50_A1RBY1 Putative beta-N-Acetylglucosaminidase n=1 Tax=Ar... 300 6e-80 UniRef50_A9BCX1 Beta-glucosidase-related glycosidase n=2 Tax=Cya... 300 6e-80 UniRef50_C9RS08 Glycoside hydrolase family 3 domain protein n=1 ... 300 7e-80 UniRef50_Q6MA50 Putative beta-N-acetylglucosaminidase n=1 Tax=Ca... 299 7e-80 UniRef50_Q6AJ70 Related to glycosyl hydrolase n=1 Tax=Desulfotal... 299 1e-79 UniRef50_B2IHJ4 Glycoside hydrolase family 3 domain protein n=3 ... 299 1e-79 UniRef50_Q7X3A0 Putative glycosylhydrolase n=1 Tax=Candidatus Fr... 298 2e-79 UniRef50_C7PJZ6 Beta-lactamase n=1 Tax=Chitinophaga pinensis DSM... 298 2e-79 UniRef50_C8PQ37 Glycosyl hydrolase, family 3 n=1 Tax=Treponema v... 298 2e-79 UniRef50_C6W0V4 Glycoside hydrolase family 3 domain protein n=2 ... 297 3e-79 UniRef50_Q8YVY6 All1831 protein n=11 Tax=Nostocaceae RepID=Q8YVY... 297 3e-79 UniRef50_D0LSI1 Glycoside hydrolase family 3 domain protein n=1 ... 297 3e-79 UniRef50_A1JQZ0 Glycosyl hydrolase family protein n=8 Tax=Entero... 297 4e-79 UniRef50_A6RET6 Putative uncharacterized protein n=1 Tax=Ajellom... 297 4e-79 UniRef50_C6X002 Glycoside hydrolase n=2 Tax=Flavobacteriaceae Re... 297 4e-79 UniRef50_B4CZU1 Beta-N-acetylhexosaminidase n=2 Tax=Verrucomicro... 297 5e-79 UniRef50_A5CLY4 Putative beta-N-acetylglucosaminidase n=1 Tax=Cl... 296 7e-79 UniRef50_D2AR90 Beta-glucosidase-related glycosidase-like protei... 296 7e-79 UniRef50_Q1NMP4 Glycoside hydrolase, family 3-like n=3 Tax=Delta... 296 7e-79 UniRef50_Q10Y93 Beta-N-acetylhexosaminidase n=1 Tax=Trichodesmiu... 296 8e-79 UniRef50_B2AR40 Predicted CDS Pa_4_8720 n=3 Tax=Leotiomyceta Rep... 295 2e-78 UniRef50_B5CPR6 Putative uncharacterized protein n=1 Tax=Ruminoc... 295 2e-78 UniRef50_B3EKH1 Glycoside hydrolase family 3 domain protein n=3 ... 294 4e-78 UniRef50_C6XX68 Glycoside hydrolase family 3 domain protein n=4 ... 293 6e-78 UniRef50_Q3B6L6 Beta-N-acetylglucosaminidase n=2 Tax=Chlorobiace... 293 7e-78 UniRef50_B8HWS2 Beta-N-acetylhexosaminidase n=5 Tax=Cyanobacteri... 292 9e-78 UniRef50_C7HX27 Beta-N-acetylhexosaminidase n=1 Tax=Thiomonas in... 292 1e-77 UniRef50_D2PXL5 Glycoside hydrolase family 3 domain protein n=1 ... 292 2e-77 UniRef50_A0LL99 Beta-N-acetylhexosaminidase n=1 Tax=Syntrophobac... 292 2e-77 UniRef50_Q2S5L3 Beta-N-acetylglucosaminidase n=2 Tax=Rhodotherma... 291 2e-77 UniRef50_B9WKL4 Beta-n-acetylglucosaminidase, putative n=4 Tax=S... 291 3e-77 UniRef50_B8NYN9 Beta-N-acetylglucosaminidase, putative n=1 Tax=A... 290 4e-77 UniRef50_UPI0001C379AD glycoside hydrolase family 3 domain prote... 290 5e-77 UniRef50_Q6MGM9 Beta-hexosamidase A n=1 Tax=Bdellovibrio bacteri... 290 6e-77 UniRef50_Q6MFP1 Related to beta-N-acetylglucosaminidase n=9 Tax=... 290 6e-77 UniRef50_D2AY03 Beta-N-acetylhexosaminidase n=2 Tax=Streptospora... 289 8e-77 UniRef50_A2P4H6 Beta-N-Acetylglucosaminidase n=9 Tax=Vibrio RepI... 289 8e-77 UniRef50_Q3AQ16 Beta-N-acetylglucosaminidase n=3 Tax=Chlorobium/... 288 1e-76 UniRef50_A3EUK5 Beta-N-acetylhexosaminidase n=2 Tax=Leptospirill... 288 1e-76 UniRef50_B3DX93 Beta-glucosidase-related glycosidase n=1 Tax=Met... 288 2e-76 UniRef50_Q11XV7 B-N-acetylglucosaminidase, glycoside hydrolase f... 288 2e-76 UniRef50_A5DD90 Putative uncharacterized protein n=2 Tax=Pichia ... 288 2e-76 UniRef50_A5EWZ9 Beta-hexosaminidase n=1 Tax=Dichelobacter nodosu... 288 2e-76 UniRef50_C8N6C8 Beta-hexosaminidase n=1 Tax=Cardiobacterium homi... 288 2e-76 UniRef50_C0CJ14 Putative uncharacterized protein n=1 Tax=Blautia... 288 2e-76 UniRef50_C5EIW3 Glycoside hydrolase n=2 Tax=Clostridiales RepID=... 287 3e-76 UniRef50_B0P7U2 Putative uncharacterized protein n=1 Tax=Anaerot... 287 3e-76 UniRef50_A4CPY8 Glycosyl hydrolase, family 3 n=1 Tax=Robiginital... 287 4e-76 UniRef50_Q6BNW2 DEHA2E18546p n=3 Tax=Saccharomycetaceae RepID=Q6... 287 4e-76 UniRef50_C0BMZ8 Glycoside hydrolase family 3 domain protein n=1 ... 287 6e-76 UniRef50_D1YED3 Glycosyl hydrolase family 3 N-terminal domain pr... 286 7e-76 UniRef50_UPI00019730D7 beta-glucosidase-like glycosyl hydrolase ... 286 7e-76 UniRef50_D1SAP6 Glycoside hydrolase family 3 domain protein n=4 ... 286 7e-76 UniRef50_A9WJ64 Glycoside hydrolase family 3 domain protein n=2 ... 286 8e-76 UniRef50_UPI0001B54569 sugar hydrolase n=1 Tax=Streptomyces sp. ... 286 9e-76 UniRef50_A6Q7E8 Glycosyl hydrolase, family 3 n=1 Tax=Sulfurovum ... 286 9e-76 UniRef50_C4XU31 Putative beta-N-acetylhexosaminidase n=1 Tax=Des... 286 1e-75 UniRef50_C3J950 Glycosyl hydrolase, family 3 n=2 Tax=Bacteria Re... 285 1e-75 UniRef50_Q1AUI8 Glycoside hydrolase, family 3-like protein n=1 T... 285 1e-75 UniRef50_D2QKL5 Glycoside hydrolase family 3 domain protein n=1 ... 285 1e-75 UniRef50_Q1IQ59 Beta-N-acetylhexosaminidase n=1 Tax=Candidatus K... 285 1e-75 UniRef50_Q5LHV9 Possible beta-N-acetylglucosaminidase/beta-lacta... 284 3e-75 UniRef50_UPI0001C334F9 beta-glucosidase-like glycosyl hydrolase ... 283 4e-75 UniRef50_C5SMQ9 Beta-N-acetylhexosaminidase n=2 Tax=Caulobactera... 283 4e-75 UniRef50_A4SCK6 Glycoside hydrolase, family 3 domain protein n=3... 283 5e-75 UniRef50_B2KAV5 Glycoside hydrolase family 3 domain protein n=1 ... 283 7e-75 UniRef50_Q5ZW95 Glycosyl hydrolase family 3 n=6 Tax=Legionella R... 283 8e-75 UniRef50_C7QCS9 Glycoside hydrolase family 3 domain protein n=6 ... 282 1e-74 UniRef50_B1YER8 Beta-N-acetylhexosaminidase n=1 Tax=Exiguobacter... 282 1e-74 UniRef50_A8ETM1 Glycosyl hydrolase n=2 Tax=Campylobacterales Rep... 282 2e-74 UniRef50_B6BGC2 B-N-acetylglucosaminidase, glycoside hydrolase f... 281 2e-74 UniRef50_Q982A1 Glycosyl hyrolase, family 3 n=2 Tax=Mesorhizobiu... 281 2e-74 UniRef50_B1ZLT8 Beta-N-acetylhexosaminidase n=61 Tax=Alphaproteo... 281 3e-74 UniRef50_C1A778 Beta-N-acetylhexosaminidase n=1 Tax=Gemmatimonas... 280 5e-74 UniRef50_A2E8H3 Glycosyl hydrolase family 3 N terminal domain co... 280 5e-74 UniRef50_Q1IHT6 Glycoside hydrolase, family 3-like n=1 Tax=Candi... 280 6e-74 UniRef50_C1F257 Glycosyl hydrolase, family 3 n=1 Tax=Acidobacter... 279 8e-74 UniRef50_C0BK44 Glycoside hydrolase family 3 domain protein n=1 ... 279 9e-74 UniRef50_A4C3R7 Beta-glucosidase-related glycosidase n=1 Tax=Pse... 279 1e-73 UniRef50_A7ZD64 Beta-hexosaminidase A (N-acetyl-beta-glucosamini... 278 2e-73 UniRef50_A9WLT0 Predicted O-Glycosyl hydrolase, family 3 n=1 Tax... 278 2e-73 UniRef50_A3HZ06 Possible beta-N-acetylglucosaminidase/beta-lacta... 278 2e-73 UniRef50_Q0BSP6 Anhydromuramoyl-peptide exo-beta-N-acetylglucosa... 278 3e-73 UniRef50_C8WFM5 Beta-N-acetylhexosaminidase n=3 Tax=Zymomonas mo... 278 3e-73 UniRef50_B3C6M6 Putative uncharacterized protein n=2 Tax=Bactero... 277 4e-73 UniRef50_UPI0001B588EB hydrolase n=1 Tax=Streptomyces sp. SPB78 ... 277 4e-73 Sequences not found previously or not previously below threshold: >UniRef50_A7FH41 Beta-hexosaminidase n=129 Tax=Gammaproteobacteria RepID=NAGZ_YERP3 Length = 343 Score = 437 bits (1125), Expect = e-121, Method: Composition-based stats. Identities = 260/339 (76%), Positives = 289/339 (85%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 MGPVMLDV YELDAEEREIL HPLVGGLILF+RN+HD QLRELVRQIRAAS RLVVA Sbjct: 1 MGPVMLDVASYELDAEEREILKHPLVGGLILFSRNFHDAEQLRELVRQIRAASHERLVVA 60 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 VDQEGGRVQRFR+GFTRLPAAQSFAA++ +LAQEAGWLMA+EM+AMDIDISFAPV Sbjct: 61 VDQEGGRVQRFRDGFTRLPAAQSFAAINDAATAAQLAQEAGWLMAAEMMAMDIDISFAPV 120 Query: 121 LDVGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETP 180 LD+GH+SAAIGERS+H+DPQ+A +A FI GMH AGMKTTGKHFPGHGAVTADSHKETP Sbjct: 121 LDIGHVSAAIGERSFHSDPQQARIMAECFIRGMHSAGMKTTGKHFPGHGAVTADSHKETP 180 Query: 181 CDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQELG 240 D RP AEIR DM +F LI+ LDAIMPAHVIY++ D RPASGS YWL+ +LRQELG Sbjct: 181 HDNRPLAEIRTHDMVIFRELIQRKLLDAIMPAHVIYTEADARPASGSAYWLQEILRQELG 240 Query: 241 FDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERV 300 F+G+IFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNR+GAVSVLDNLSPIKA+++ Sbjct: 241 FEGIIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRQGAVSVLDNLSPIKADQL 300 Query: 301 TRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQEEKA 339 TRLYH G F RQ LM S+RW+ + L L ERW K Sbjct: 301 TRLYHSGQFDRQTLMASSRWQQANKALTALSERWDAHKQ 339 >UniRef50_A1SW90 Beta-hexosaminidase n=3 Tax=Alteromonadales RepID=NAGZ_PSYIN Length = 337 Score = 383 bits (985), Expect = e-105, Method: Composition-based stats. Identities = 171/338 (50%), Positives = 230/338 (68%), Gaps = 5/338 (1%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 M PV+LDVEGYELD+EE+EILAHPLV G+ILFTRNY+D QL+ LV+ IR + N L++A Sbjct: 1 MRPVILDVEGYELDSEEKEILAHPLVAGIILFTRNYYDIEQLKALVKDIRRYAGNELLIA 60 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 VD EGGRVQRFR+ FTRLP+A S + M+ +LA + W+MASE+IA DID SFAPV Sbjct: 61 VDHEGGRVQRFRDDFTRLPSAGSLIEKNDMKTACELAFSSAWVMASELIACDIDFSFAPV 120 Query: 121 LDVGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETP 180 LD+ IS I R++ + + + +A +I+GM AGM +TGKHFPGHG+V ADSH P Sbjct: 121 LDLNGISNVIQNRAFSSSITETVTLAEAYINGMKSAGMVSTGKHFPGHGSVEADSHTALP 180 Query: 181 CDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQELG 240 D R + EI KD+ F +LI++ LDA+MP+HV+YS D +PA S YWL VLR LG Sbjct: 181 VDSRSELEIFTKDIKPFENLIKKGALDAVMPSHVVYSQCDLQPAGFSSYWLDDVLRTRLG 240 Query: 241 FDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERV 300 F GV+ SDDLSM GA+ +G++ R ++++ AGCD+IL CN+R GAVS+LDNL +K Sbjct: 241 FKGVVISDDLSMHGASFVGNHLSRAESAIQAGCDLILACNDRSGAVSILDNLK-VKPTAQ 299 Query: 301 TRLYHKGSFSRQE----LMDSARWKAISTRLNQLHERW 334 ++ ++ + L + W L QL E++ Sbjct: 300 YHAVNQLRSTKNKFILPLNKNPIWIKNKQMLMQLSEQF 337 >UniRef50_Q4QLU8 Beta-hexosaminidase n=138 Tax=Gammaproteobacteria RepID=NAGZ_HAEI8 Length = 351 Score = 374 bits (962), Expect = e-102, Method: Composition-based stats. Identities = 180/348 (51%), Positives = 240/348 (68%), Gaps = 8/348 (2%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 M +++D++G EL+ EE E+L+HPLV GLILFTRN+ D Q++EL+R +R + L++ Sbjct: 1 MSTLLIDLKGKELEQEEVELLSHPLVAGLILFTRNFEDREQIQELIRSVRQRVKKPLLIT 60 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 VDQEGGRVQRFR+GFT LP+ Q+F E A+EAGW MA+EMIA+DID+SFAPV Sbjct: 61 VDQEGGRVQRFRDGFTMLPSMQAFQETLSATEQVSFAKEAGWQMAAEMIALDIDLSFAPV 120 Query: 121 LDVGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETP 180 LD+GH AIG+RS+ +D + A+ +A FIDGMH+AGM +TGKHFPGHG V ADSH ETP Sbjct: 121 LDLGHECRAIGDRSFSSDVKSAVNLAINFIDGMHQAGMASTGKHFPGHGHVLADSHLETP 180 Query: 181 CDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQELG 240 D R + EI D+ F LI +NKLDAIMPAHVIY D +PASGS YWLK +LR++L Sbjct: 181 YDDRAKDEIFGYDLQPFQQLISQNKLDAIMPAHVIYGQCDSQPASGSEYWLKEILRKKLN 240 Query: 241 FDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAE-- 298 F G IFSDDL M+GA +MG++ ER + +L+AGCD++L+CN R+G + VLDNL + E Sbjct: 241 FQGTIFSDDLGMKGAGVMGNFVERSKKALNAGCDLLLLCNEREGVIQVLDNLKLTENEPH 300 Query: 299 ------RVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQEEKAG 340 R+ L+ + +L+ RWK L + RW + +A Sbjct: 301 FMARQARLKNLFKRSVIEWNDLVLDQRWKLNHRNLADIQSRWLDIQAA 348 >UniRef50_Q8RD12 Beta-glucosidase-related glycosidases n=2 Tax=Thermoanaerobacteraceae RepID=Q8RD12_THETN Length = 591 Score = 369 bits (949), Expect = e-101, Method: Composition-based stats. Identities = 88/354 (24%), Positives = 163/354 (46%), Gaps = 38/354 (10%) Query: 1 MGPVMLD-VEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS-RNRLV 58 +G +++ ++G + + +++ +GG+ILF+ N Q+ +L I++ S + L Sbjct: 58 IGQMLMPAIDGTKATPQLEKMIKDYHLGGVILFSENIQSIEQIVKLTNDIQSLSLQIPLF 117 Query: 59 VAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 ++ DQEGG V R +G T+ P + A + LA + G ++ SE+ A+ I+ A Sbjct: 118 ISTDQEGGLVYRLSQG-TKFPGNMALGATFSED----LAYKVGKVIGSELKAVGINWDLA 172 Query: 119 PVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSH 176 PV+DV + IG RS+ +P + F+ G+ E + + KHFPGHG V DSH Sbjct: 173 PVMDVNVNPYNPVIGIRSFGGNPDVVAKLGVAFMKGLQENKIASCVKHFPGHGDVDVDSH 232 Query: 177 KETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-------------- 222 + + + ++ F I+ DA+M AH+ + +D Sbjct: 233 LGLGTVNKNRETLEKIELKPFEEAIQAGT-DALMTAHLSFPALDDTKVIAKTSEGVQKEI 291 Query: 223 --PASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCN 280 PA+ S L ++R+ G++GVI +D ++M+ + + ++ AG D+IL+ Sbjct: 292 TIPATLSHKILTDLVRKTWGYEGVIITDAMNMKAISDNFGPVDAVVRAVKAGADIILMPV 351 Query: 281 NRKGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDSA-RWKAISTRLNQLHER 333 + GA + + KG S + + DS R + +L + + Sbjct: 352 DLNGAFN-----------ELVLETKKGIISEKRIDDSVRRILKLKYKLGLMETK 394 >UniRef50_C6J2I0 Glycoside hydrolase n=2 Tax=Bacillales RepID=C6J2I0_9BACL Length = 544 Score = 369 bits (948), Expect = e-101, Method: Composition-based stats. Identities = 102/337 (30%), Positives = 158/337 (46%), Gaps = 24/337 (7%) Query: 1 MGPVML-DVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRN---- 55 +G + + D + R ++ VGG++ F RN QL EL R ++ R+ Sbjct: 12 IGQLFICGFHSLVPDEQIRTLVQQYHVGGVVYFRRNIASVEQLAELSRSLQGLPRSKPEL 71 Query: 56 RLVVAVDQEGGRVQRFR-EGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDID 114 L +++DQEGG V R EG +R+P + A + L +E L A EM+ + I+ Sbjct: 72 PLFISIDQEGGMVARLDHEGMSRIPGNMALGAANDP----ALTEEVARLAAKEMLQLGIN 127 Query: 115 ISFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVT 172 +FAP +DV + + IG RS+ P + + + G G+ T KHFPGHG Sbjct: 128 FNFAPCVDVNNNPANPVIGVRSFGEQPDRVAVHGAAAVRGYQAEGVIATAKHFPGHGDTA 187 Query: 173 ADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP--RPASGSPYW 230 DSH+ P + + ++ F I E +DAIM AHVI+ +P PA+ S Sbjct: 188 VDSHRGLASVPHGKERLYQVELVPFREAIAEG-VDAIMTAHVIFPAFEPEGTPATLSHRV 246 Query: 231 LKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLD 290 L +LR+ELG+ G+I +D L M+ A AE +++AG D++LV + + + Sbjct: 247 LTGLLREELGYGGLIVTDCLEMQAIAGYYGIAEGAVQAIEAGADLVLVSHTLAEQRAAI- 305 Query: 291 NLSPIKAERVTRLYHKGSFSRQELMDS-ARWKAISTR 326 ERV G S L S R A+ + Sbjct: 306 -------ERVAEAVRSGRISEARLDRSLERILALKAK 335 >UniRef50_UPI0001693BCF Beta-glucosidase-related glycosidase n=1 Tax=Paenibacillus larvae subsp. larvae BRL-230010 RepID=UPI0001693BCF Length = 536 Score = 366 bits (939), Expect = e-100, Method: Composition-based stats. Identities = 97/344 (28%), Positives = 163/344 (47%), Gaps = 22/344 (6%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS----RNRLVV 59 VM E D + ++ +G +I F RN P Q+ L +++ + L++ Sbjct: 17 VMCGFESRVPDEQIETLITKYKLGNVIYFRRNLDTPLQVFRLSSKLQELAEKEVGIPLLI 76 Query: 60 AVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAP 119 ++DQEGG V R EG +P + AL + A ++ ++ A E+ A+ I+++FAP Sbjct: 77 SIDQEGGMVNRTHEGVVGMPGNMTLGAL----RDPQAAYDSAYVSAEELRAIGINMNFAP 132 Query: 120 VLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 LDV + + IG RSY + + + G + G+ T KHFPGHG ADSH Sbjct: 133 CLDVNNNPDNPVIGVRSYGETAELVSKLGVEAMKGYQDGGVAATIKHFPGHGDTQADSHH 192 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR--PASGSPYWLKTVL 235 P P P +R ++ F I DA+M AHVI+ ++P P++ S + +L Sbjct: 193 ALPLIPHPARRLRELELVPFQKAINAG-ADAVMSAHVIFPALEPEHLPSTLSRQVMTRLL 251 Query: 236 RQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPI 295 R+E+GFDGVI +D L M+ AE +++AG D++LV + V+ + Sbjct: 252 REEMGFDGVITTDCLEMKAIDDHYGVAEGAVKAIEAGVDLVLVSHTLTKQVAAI------ 305 Query: 296 KAERVTRLYHKGSFSRQELMDSA-RWKAISTRLNQLHERWQEEK 338 + + G + + + +S R + +L + EE+ Sbjct: 306 --GAILQALESGRLTEERIDESVDRILRLKQKLKLERDSLSEER 347 >UniRef50_A3I7T1 Beta-hexosaminidase n=2 Tax=Bacillaceae RepID=A3I7T1_9BACI Length = 566 Score = 364 bits (935), Expect = 3e-99, Method: Composition-based stats. Identities = 95/342 (27%), Positives = 159/342 (46%), Gaps = 26/342 (7%) Query: 1 MGPVML-DVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS---RNR 56 +G +M V G L E + ++ +GGLIL+ N P Q L+ + A+ R Sbjct: 217 LGQMMFAGVSGTSLQQETKSLIRDYKIGGLILYANNLETPQQTVSLMNDLMTANGTNRLP 276 Query: 57 LVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 L + DQEGG+V R P Q + + + E G L+ ++ A ++ Sbjct: 277 LFLGTDQEGGKVVRLPGPLKNFPTNQKIGQI----NQPQFSYEIGKLLGEQLKAFGFNLD 332 Query: 117 FAPVLDVG--HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 FAPV+DV + IG+RS+ P + + + G+ + KHFPGHG + D Sbjct: 333 FAPVMDVNSNPNNPIIGDRSFSNKPDIVSQLGIQTMQGLESQQVIPVIKHFPGHGDTSVD 392 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKT 233 SH E P + ++ ++ F + I + +D +M H++ +D + P+S S + Sbjct: 393 SHLELPKVSKSLDDLNKLELIPFKAAIDKG-VDVVMVGHILLPKIDQQFPSSMSKEIITG 451 Query: 234 VLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLS 293 +LR +LGFDGV+ +DD++M+ S + S+ AG D+IL+ + + +D L Sbjct: 452 LLRNQLGFDGVVMTDDMTMKAITNHFSIGQAAVDSVKAGNDIILIAHEFANVTTAIDALK 511 Query: 294 PIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQ 335 G S Q++ DS R R+ QL E++Q Sbjct: 512 A--------AVKNGKISEQQINDSVR------RIIQLKEKYQ 539 >UniRef50_C5DA89 Glycoside hydrolase family 3 domain protein n=6 Tax=Bacillaceae RepID=C5DA89_GEOSW Length = 698 Score = 362 bits (930), Expect = 9e-99, Method: Composition-based stats. Identities = 88/335 (26%), Positives = 164/335 (48%), Gaps = 34/335 (10%) Query: 11 YELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS-RNRLVVAVDQEGGRVQ 69 ++ E +E++ +GG+ILF N Q +LV + A+ + L++++DQEGG V Sbjct: 177 TDMLPEIKELIQKYHLGGIILFRENVVTTEQTAKLVADYQQAAEKFGLLISIDQEGGIVT 236 Query: 70 RFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI--S 127 R + G T +P + A E LA + G + E+ A+ I+++ APVLDV + + Sbjct: 237 RLQSG-TNMPGNMALGATRSKE----LAWKVGHAIGEELHALGINMNLAPVLDVNNNPDN 291 Query: 128 AAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQA 187 IG RS+ +P+ + +I G+ ++G+ T KHFPGHG DSH P + Sbjct: 292 PVIGVRSFGENPELVAELGVSYIKGLQDSGVAATAKHFPGHGDTAVDSHLGLPEVSHDRD 351 Query: 188 EIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-------------PASGSPYWLKTV 234 ++ ++ F + E +DA+M AHV + VD PA+ S L + Sbjct: 352 RLQKVELYPFQKAM-EAGIDAVMTAHVTFPKVDDTKVISQKDGTAISLPATLSYKVLTEL 410 Query: 235 LRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSP 294 +R+E+GFDGVI +D ++M+ + + ++ AG D++L+ + Sbjct: 411 MREEMGFDGVIITDAMNMKAISDHFGPVDAAVRAVQAGADIVLMPVGLEEV--------- 461 Query: 295 IKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQ 329 A + + G S++ + +++ + ++ ++ + Sbjct: 462 --ANGLKKAVQNGGISQERI-NASVKRILTLKVKR 493 >UniRef50_Q8ETJ0 Beta-hexosaminidase n=1 Tax=Oceanobacillus iheyensis RepID=Q8ETJ0_OCEIH Length = 424 Score = 362 bits (930), Expect = 1e-98, Method: Composition-based stats. Identities = 98/342 (28%), Positives = 170/342 (49%), Gaps = 26/342 (7%) Query: 1 MGPVMLD-VEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS---RNR 56 +G +++ ++G E++ + ++++ + VGG+ILF N AQ L+ I+ + + Sbjct: 57 IGQLIVGGIDGTEINTDTKDMIENYHVGGVILFADNIESKAQTVNLMNDIKQVNENNPHP 116 Query: 57 LVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 L++ VD+EGG V R + T LP + S E LA E G L+ +M + +++ Sbjct: 117 LLLGVDEEGGSVTRMPDEITSLPPSGSIGKAQDEE----LAFEVGILLGQQMRGLGFNLN 172 Query: 117 FAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 FAPVLDV + IG+RS+ P + + + G+ G+ + KHFPGHG + D Sbjct: 173 FAPVLDVNSNPDNPVIGDRSFGDSPDIVTDMGIQSMKGIQSEGIISVVKHFPGHGDTSED 232 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKT 233 SH + P + E+ ++ F I + D +M AH++ +D PAS S + Sbjct: 233 SHLKLPKVDKSIEELSEVELVPFEKAISKG-ADVVMTAHILLPQIDTEYPASMSEKIISG 291 Query: 234 VLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLS 293 +LR+++GFDGV+ SDDL+M E S+ AG DM+L+ ++ VSV Sbjct: 292 LLREDMGFDGVVISDDLTMGAITENIRIEEAAIQSVKAGSDMVLIAHHPDVVVSV----- 346 Query: 294 PIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQ 335 +++ G S ++ +S R+ QL +++ Sbjct: 347 ---HKKLKAAVQNGVISEAKIDESV------ERIIQLKRKYK 379 >UniRef50_D2MLA9 Beta-N-acetylglucosaminidase n=1 Tax=Candidatus Poribacteria sp. WGA-A3 RepID=D2MLA9_9BACT Length = 354 Score = 362 bits (930), Expect = 1e-98, Method: Composition-based stats. Identities = 108/328 (32%), Positives = 165/328 (50%), Gaps = 18/328 (5%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRN-RLVVAVD 62 +M+ +G EL + L GG ILF+RN DP Q+ L ++A + N L++A+D Sbjct: 11 LMVGFQGTELSRAFLDWLQECRPGGAILFSRNLVDPEQVARLTNALQAHAPNSPLLIAID 70 Query: 63 QEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLD 122 +EGGRV R +GFT PAA AA + A + A E+ A+ I+++ APVLD Sbjct: 71 EEGGRVSRLPQGFTAFPAASKVAACHSADT----AYGTAEVTAQELRAVGINMNLAPVLD 126 Query: 123 VGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETP 180 V + IG+R+Y P++ + G+ + G+ GKHFPGHG DSH P Sbjct: 127 VNSNPDNPIIGDRAYGTQPERVCTYGMAVMQGLEDNGIIPCGKHFPGHGDTMTDSHHALP 186 Query: 181 CDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQEL 239 +A + A ++ F IR L AIM AHV Y +D PA+ S L +LR +L Sbjct: 187 VVEADRARLDAVELEPFRRAIRRG-LPAIMTAHVRYPALDADAPATLSQPILTDLLRNQL 245 Query: 240 GFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAER 299 GF+GV +DD+ M S E +L AG DM+++C+ ++ + Sbjct: 246 GFEGVTLTDDMEMRAILDHQSIGEASVRALQAGADMVVICHQQERQQEAVSA-------- 297 Query: 300 VTRLYHKGSFSRQELMDS-ARWKAISTR 326 + + +G S ++L S AR +A+ + Sbjct: 298 IEQALERGELSWEKLTASVARIRALKKQ 325 >UniRef50_Q3SKU2 Beta-hexosaminidase n=27 Tax=Proteobacteria RepID=NAGZ_THIDA Length = 359 Score = 362 bits (930), Expect = 1e-98, Method: Composition-based stats. Identities = 153/295 (51%), Positives = 198/295 (67%), Gaps = 4/295 (1%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 +GP+MLDV G EL ++R L+ PLVGG+ILF+RNY DPAQL L +I A L++ Sbjct: 3 LGPLMLDVAGTELCDDDRRRLSDPLVGGVILFSRNYRDPAQLAALCEEIHALKAAPLLIG 62 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSG--MEEGGKLAQEAGWLMASEMIAMDIDISFA 118 VD EGGRVQRFR+GFTRLP +S A+ + +LA+E G+++ASE+ A +D SFA Sbjct: 63 VDHEGGRVQRFRDGFTRLPPMRSLGAIWNEHPQRARELARETGYVLASELRAHGVDFSFA 122 Query: 119 PVLDVGH-ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 PVLD+ + S+ IG+R++HADPQ + + GM +AGM GKHFPGHG V ADSH Sbjct: 123 PVLDLDYGASSVIGDRAFHADPQAVFQLGQAVMLGMKDAGMAACGKHFPGHGFVIADSHV 182 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQ 237 + P D R +I D+ F L+ E L A+MPAHVIY +VD +PA S WL+ +LRQ Sbjct: 183 DIPVDRRTLGDIARADLVPFR-LMVEAGLAAMMPAHVIYPEVDAQPAGFSRVWLQKILRQ 241 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL 292 +LGFDG +FSDDL M GAA G ER +A+LDAGCDM LVCN+ + A VL L Sbjct: 242 QLGFDGAVFSDDLCMAGAAFAGGIVERVRAALDAGCDMALVCNHPELADEVLAKL 296 >UniRef50_Q8CX63 Beta-N-acetylhexosaminidase (Beta-hexosaminidase) n=1 Tax=Oceanobacillus iheyensis RepID=Q8CX63_OCEIH Length = 529 Score = 361 bits (927), Expect = 2e-98, Method: Composition-based stats. Identities = 95/331 (28%), Positives = 161/331 (48%), Gaps = 26/331 (7%) Query: 5 MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRN-----RLVV 59 ++ +G + ++++ +GG+ILFTRN P ++ L ++ +R+ L++ Sbjct: 11 VIGFDGKVVPKSIKKLIHDYHIGGIILFTRNIGSPDEVLRLTASLQKEARDAGYERPLLI 70 Query: 60 AVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAP 119 VDQE G V+R EG T P + A E A + G+ +E+ A+ I+ + AP Sbjct: 71 CVDQENGTVRRLGEGTTIFPGGMTIGASDSEEN----AYQVGYASGTELRALGINWNLAP 126 Query: 120 VLDVG--HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 V+DV + IG RSY P++ +A + GM +AG+ TT KHFPGHG DSH Sbjct: 127 VVDVNVNPDNPVIGVRSYGESPERVARLAKAAVKGMQDAGVVTTLKHFPGHGDTKVDSHL 186 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP---RPASGSPYWLKTV 234 P P + ++ F + + D IM AH+ + ++P +PA+ S L + Sbjct: 187 ALPVIPHSIERLEEIELRPFQASLES---DVIMSAHIYFPALEPEADKPATLSKAVLTGL 243 Query: 235 LRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSP 294 LR++LG+ G+I +D + M + + A+L AG D+ +V + + V L Sbjct: 244 LREKLGYTGLITTDCMEMNAISETIGTEKGVVAALQAGVDLAMVSHTLQRQVGAL----- 298 Query: 295 IKAERVTRLYHKGSFSRQELMDSA-RWKAIS 324 + V G SR+ ++ SA R + + Sbjct: 299 ---QEVKAAVESGELSRESILTSAERIRRVK 326 >UniRef50_A9NEX4 Glycoside hydrolase, family 3 n=1 Tax=Acholeplasma laidlawii PG-8A RepID=A9NEX4_ACHLI Length = 523 Score = 361 bits (926), Expect = 3e-98, Method: Composition-based stats. Identities = 96/338 (28%), Positives = 161/338 (47%), Gaps = 24/338 (7%) Query: 1 MGPVML-DVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASR----N 55 +G +++ G + E ++ VG ++LF+RN AQL +L + ++ + Sbjct: 15 IGQLIVAGFHGTTVGPEIENLIREYKVGNILLFSRNIESAAQLFKLNQDLQKIAMETFGI 74 Query: 56 RLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDI 115 L + +DQEGG V R E T P A + AA + +E A + G LM E+I + I++ Sbjct: 75 PLYITIDQEGGMVTRIFEDGTFFPGAMTIAATNNVE----YAYKIGDLMGKELIDLGINM 130 Query: 116 SFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTA 173 APVLDV + + IG RS+ DP K + I G+ + T KHFPGHG Sbjct: 131 DLAPVLDVNNNPKNPVIGVRSFSDDPDKVALFGNANIRGLQ-NHVVATAKHFPGHGDTHV 189 Query: 174 DSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPRPASGSPYWL 231 DSH P +P E+ + F + I + L AIM AH+ ++ + D P + S L Sbjct: 190 DSHLALPKVEKPLNELEKFEFKPFKAAINDG-LHAIMSAHINFTHLTEDGLPVTLSKRAL 248 Query: 232 KTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDN 291 +LR+E+GF+G+I +D + M+ E +++AG +++ +C++ + Sbjct: 249 TGLLREEMGFEGLIITDGIEMKAIHDNYGTIEATLKTVNAGANLVCICHDLPYQIGA--- 305 Query: 292 LSPIKAERVTRLYHKGSFSRQELMDS-ARWKAISTRLN 328 ++R + G + L + AR +LN Sbjct: 306 -----SDRFNKALETGELTMDTLNERVARILTYKEKLN 338 >UniRef50_B0K1P2 Glycoside hydrolase, family 3 domain protein n=9 Tax=Clostridia RepID=B0K1P2_THEPX Length = 520 Score = 360 bits (924), Expect = 5e-98, Method: Composition-based stats. Identities = 98/340 (28%), Positives = 169/340 (49%), Gaps = 27/340 (7%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIR----AASRNRLVV 59 +M+ D ++++ +G +ILF+RN QL+ L I+ + +V Sbjct: 11 LMVGFPSPTYDDHVEKLVSEYKIGNIILFSRNVKGKEQLQNLCNNIQMNIIKHTGIPALV 70 Query: 60 AVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAP 119 A+DQEGG V R + T LP + AA + +E A + G + E+ A+ I+I+FAP Sbjct: 71 AIDQEGGMVTRIYKDATYLPGNMAIAATNEVEN----AYKIGEIAGKELRALGININFAP 126 Query: 120 VLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 VLD+ + + IG RSY +P+K +I G+ + + T KHFPGHG + DSH Sbjct: 127 VLDINNNLSNPVIGVRSYGENPEKVAEFGINYIKGLQKENVIATAKHFPGHGDTSVDSHL 186 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPRPASGSPYWLKTVL 235 + P + + ++ F I EN ++AIM AH+++S + + PA+ S L +L Sbjct: 187 DLPLVSYNKERLYRIELYPFKKAI-ENGVEAIMTAHILFSALEENKLPATLSHNILTNIL 245 Query: 236 RQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPI 295 R+EL F+G+I +D + M A A+ ++ AG D++LV + +K + + Sbjct: 246 RKELRFNGLIITDCMEMNAIAKYFGTAKAAAMAIKAGADIVLVSHTKKLQIEAFNE---- 301 Query: 296 KAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQ 335 + +G S + + +S R+ QL E+++ Sbjct: 302 ----IKEAVLRGEISIERIDESV------ERIIQLKEKYK 331 >UniRef50_Q7PC47 Beta-N-acetylhexosaminidase. Glycosyl Hydrolase family 3 n=2 Tax=Gammaproteobacteria RepID=Q7PC47_SACD2 Length = 344 Score = 359 bits (923), Expect = 7e-98, Method: Composition-based stats. Identities = 169/345 (48%), Positives = 226/345 (65%), Gaps = 19/345 (5%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 +GPV+ D+ G L E+ ++ +PL+GGLILFTRNY P+QL LV+QIR+ R +++A Sbjct: 9 LGPVIADIAGQTLSDEDIALIKNPLIGGLILFTRNYSTPSQLDALVKQIRSV-RADIILA 67 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALS--GMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 VD EGGRVQRFREGFTR+PA Q FA+ E LA GWLMASE+ A DIDISFA Sbjct: 68 VDHEGGRVQRFREGFTRIPAMQVFASAYKARAELTLALACNTGWLMASELRAYDIDISFA 127 Query: 119 PVLDVGHI-SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 PVLDV S+ IG+R++ +DP+ A+A FIDGM +AGM TGKHFPGHG+V ADSH Sbjct: 128 PVLDVDDSFSSIIGDRAFSSDPKAVTALAGAFIDGMQQAGMACTGKHFPGHGSVRADSHL 187 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQ 237 E P D R I D+ F+ L ++KLDA+MPAH+++ +VD +P S WL+ +LR Sbjct: 188 ELPVDYRSLEAIEQLDLMPFAKL--QSKLDAVMPAHILFPEVDDQPVGFSSVWLQKILRD 245 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKA 297 ++ +DGVIFSDDL+MEGAA+ GS+ ER ++ AGCD +LVCNNR+ + V+ L+ Sbjct: 246 KMAYDGVIFSDDLTMEGAAVAGSFGERAIKAMSAGCDTLLVCNNREATLEVIQTLA---- 301 Query: 298 ERVTRLYHKGSFSRQELMDSARWKAISTRLNQLH--ERWQEEKAG 340 G++S + R KA + + LH +RWQE K Sbjct: 302 -------DNGNYSTSIRLTRMRGKAGAQPIYDLHNNKRWQETKEA 339 >UniRef50_C7IJM6 Glycoside hydrolase family 3 domain protein n=1 Tax=Clostridium papyrosolvens DSM 2782 RepID=C7IJM6_9CLOT Length = 448 Score = 357 bits (917), Expect = 3e-97, Method: Composition-based stats. Identities = 91/329 (27%), Positives = 165/329 (50%), Gaps = 20/329 (6%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIR---AASRNRLVVA 60 V++ ++GYE D R+++ VGG ILFT N + Q+ EL+ ++ A++ L ++ Sbjct: 107 VIVGLDGYEADEHARQMIEEYRVGGFILFTSNIQNSNQMLELLNSLKNINRANKVPLFLS 166 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 VD+EGGRV R + F ++P+ Q L+ ++ + G ++ E+ + +++ FAPV Sbjct: 167 VDEEGGRVSRLPDEFLKIPSNQIIGKLNN----SSVSHQVGSIIGEELKSFGMNMDFAPV 222 Query: 121 LDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE 178 LD+ + IG+R++ P + + + G+ + KHFPGHG + DSH Sbjct: 223 LDINSNPKNPIIGDRAFGTGPNLVSKLGVQTMKGLQSQNIIPVVKHFPGHGDTSTDSHVG 282 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPASGSPYWLKTVLRQ 237 P +R ++ F+ I EN +A+M AH++ ++DP PAS S + VLR Sbjct: 283 LPKVDNDLKRLRNFELKPFARAI-ENGAEAVMVAHILLPEIDPDNPASFSKTIINDVLRL 341 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKA 297 E+ +DGV+ +DD +M + + S+ AG D++LVC++ + +V+ L Sbjct: 342 EMNYDGVVITDDFTMGAIVKNYNIGQAAVKSILAGGDIVLVCHDFEKQKAVISALKD--- 398 Query: 298 ERVTRLYHKGSFSRQELMDSARWKAISTR 326 G + D + + + + Sbjct: 399 -----AAQSGQLPMDRI-DRSVIRILRLK 421 >UniRef50_A5CZV7 Beta-glucosidase-related glycosidases and D-alanyl-D-alanine dipeptidase n=4 Tax=Bacteria RepID=A5CZV7_PELTS Length = 1139 Score = 357 bits (916), Expect = 5e-97, Method: Composition-based stats. Identities = 98/348 (28%), Positives = 158/348 (45%), Gaps = 40/348 (11%) Query: 4 VMLDV------EGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASR-NR 56 +M+ + E++ E + +GG+ILF N Q LV Q++ AS Sbjct: 577 LMVSFRQWKSKDVTEINGEIAATIKKYHLGGVILFRENVAGTGQTAGLVAQLQKASGDIP 636 Query: 57 LVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 L +AVDQEGGRV R + G T +P + A E+ + E+ A+ I++ Sbjct: 637 LFIAVDQEGGRVVRLQTG-TVMPGNMALGAAGSPED----TYAVAKAIGEELNALGINVD 691 Query: 117 FAPVLDVG--HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 APV+DV + IG RS+ +DPQ + +I+G+ +AG+ T KHFPGHG D Sbjct: 692 LAPVVDVNINPDNPVIGIRSFGSDPQLVADMGVAYINGLRDAGVIATVKHFPGHGDTDVD 751 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR------------ 222 SH P P A ++ ++ F + K+D +M AHV + +D Sbjct: 752 SHLGLPSVPHGPARLKTVELKPFQQAFDQ-KVDMVMTAHVTFPAIDNTKAISKNDGREIS 810 Query: 223 -PASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNN 281 PA+ S L ++R+E+GFDGVI +D L M+ A E ++ AG D+ L+ + Sbjct: 811 IPATLSHKVLTDLVRKEMGFDGVIITDALEMKAIADHFGPREAVIGAVKAGADIALMPAD 870 Query: 282 RKGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQ 329 A + L G + +S + + I ++++ Sbjct: 871 LDQAYNGL-----------LAAVRSGEIPESRIDESVK-RLIRLKISK 906 >UniRef50_Q31G32 Beta-hexosaminidase n=1 Tax=Thiomicrospira crunogena XCL-2 RepID=NAGZ_THICR Length = 357 Score = 356 bits (915), Expect = 5e-97, Method: Composition-based stats. Identities = 145/328 (44%), Positives = 198/328 (60%), Gaps = 8/328 (2%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 +G +M+D+E L E+E L PLV G+ILF+RNY QL+ L +I +L++A Sbjct: 11 LGCIMVDLESTTLQPHEKERLLDPLVAGVILFSRNYESIEQLQALTTEIHQLRHPKLLIA 70 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALS--GMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 VD EGGRVQRF+EGF+ LPA +EG +LAQ+ GWLMA+E++A+ +D SFA Sbjct: 71 VDHEGGRVQRFKEGFSMLPAMGQLGKCFRANEKEGLELAQQVGWLMATELLAVGVDFSFA 130 Query: 119 PVLDVGH-ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 PV+D+ + S IG+R++ +DP + + GM +AGM + KHFPGHG + AD+H Sbjct: 131 PVVDLDYGDSRVIGDRAFDSDPVIVGKLGKALVQGMRDAGMASVAKHFPGHGYIQADTHL 190 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQ 237 E D R EI KD+ F LI EN LDA+MPAHV Y VD PA S WL+ VLRQ Sbjct: 191 EVAVDHREFQEIAHKDIQPFLKLI-ENGLDAVMPAHVRYPKVDDLPAGFSKVWLQEVLRQ 249 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLS---- 293 + FDG I SDD+SM A G R A+L AGCD++LVCN+ A VL ++ Sbjct: 250 QCYFDGAIISDDMSMHAATEFGDAPTRVTAALKAGCDLVLVCNDPVAADEVLSQVTWETG 309 Query: 294 PIKAERVTRLYHKGSFSRQELMDSARWK 321 P+ R+ RL+ KG +L ++ W+ Sbjct: 310 PLSHARLIRLHGKGHLKLSQLHNNPLWQ 337 >UniRef50_B9Y750 Putative uncharacterized protein n=1 Tax=Holdemania filiformis DSM 12042 RepID=B9Y750_9FIRM Length = 571 Score = 356 bits (914), Expect = 8e-97, Method: Composition-based stats. Identities = 93/323 (28%), Positives = 164/323 (50%), Gaps = 15/323 (4%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRA----ASRNRLVV 59 V++ E E+D + L +G +I++ +N + Q L Q+++ A++ + Sbjct: 59 VIVSFETTEVDTKTEAWLRTNKIGNVIVYAKNIENAEQATMLTGQLQSTIRGATQIPAFI 118 Query: 60 AVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAP 119 +DQEGG V R REG T P+ + AA + W MA E+ M +++FAP Sbjct: 119 GIDQEGGMVNRVREGVTIFPSPMAIAA-----GRHENLYSLAWSMADELSGMGFNMNFAP 173 Query: 120 VLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 VLDV + I RSY DP+ + AS +I G+ E G+ + KHFPGHG + DSH Sbjct: 174 VLDVNSNPDNPVIHLRSYSDDPKAVASFASTWIKGLQEGGVVSVAKHFPGHGDTSEDSHF 233 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR--PASGSPYWLKTVL 235 P + +++ ++ F + IR + + AIM +H+++ ++ PA+ S + +L Sbjct: 234 ALPTVNKTLDQLKETELIPFETAIR-SGVSAIMTSHILFPKIEKEKIPATLSKTIITDLL 292 Query: 236 RQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPI 295 + ELG++G++ SD L M+ AE ++ AG DM+++ + + D +P+ Sbjct: 293 KDELGYNGIVISDSLQMDAIQSHYGMAEAAVQAIQAGVDMLILGDGKVLQPDSEDVQTPV 352 Query: 296 KAERVTRLYHKGSFSRQELMDSA 318 E + ++G+ S + L D+ Sbjct: 353 -IEALIEAVNQGTISAERLDDAV 374 >UniRef50_C4KZ16 Beta-N-acetylhexosaminidase n=2 Tax=Firmicutes RepID=C4KZ16_EXISA Length = 649 Score = 354 bits (909), Expect = 3e-96, Method: Composition-based stats. Identities = 98/355 (27%), Positives = 163/355 (45%), Gaps = 35/355 (9%) Query: 1 MGPVML-------DVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS 53 +G +++ L E ++ +GG+ILF N + Q +LV ++ Sbjct: 55 IGQMIMPDFRLWNGANHTSLAPEVARVIDRFDLGGVILFAENVKETEQTTKLVHDLQEVV 114 Query: 54 R------NRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASE 107 + L V +DQEGG V R G T LP + A + A +AG ++ SE Sbjct: 115 KQDASNDIPLFVTIDQEGGIVTRLGTG-TNLPGNMALGAT----RNRQYAYDAGEIIGSE 169 Query: 108 MIAMDIDISFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHF 165 + A+ ++++F PVLDV + + IG RS+ +DP+ + S G+ + G+ T KHF Sbjct: 170 LHALGVNVNFGPVLDVNNNPGNPVIGVRSFSSDPELVGELGSAMTQGIQDQGVAATAKHF 229 Query: 166 PGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD----- 220 PGHG DSH P + E+R ++ F I E +D IM AH+ ++ Sbjct: 230 PGHGDTAVDSHYGLPIVDKSLDELRGLELLPFKRAITEG-IDMIMTAHIGMPQIEDEVVE 288 Query: 221 ------PRPASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCD 274 P PA+ S + VLR+ELG++G++ +D L+M+ A + AE + +AG D Sbjct: 289 SERGTFPLPATLSDDVITGVLREELGYEGIVVTDALNMQAIADNFTEAEAVIKTFEAGVD 348 Query: 275 MILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQ 329 + L+ + V+ L I E V G S + +S + + + + Sbjct: 349 IALMPTILRSEADVIK-LEAIFEE-VIAAVKDGRLSEATIDESVE-RILKLKAER 400 >UniRef50_Q0HJG7 Beta-hexosaminidase n=35 Tax=Gammaproteobacteria RepID=NAGZ_SHESM Length = 342 Score = 353 bits (907), Expect = 4e-96, Method: Composition-based stats. Identities = 170/345 (49%), Positives = 217/345 (62%), Gaps = 10/345 (2%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 M +MLD+ ++ E E+L HP VGGLILF+RN+ QL LV+QIR R +++A Sbjct: 1 MSYLMLDLLSLDVSEAEAEMLRHPQVGGLILFSRNFSSREQLIALVQQIRQI-RPEILIA 59 Query: 61 VDQEGGRVQRFREGFTRLPAAQSF--AALSGMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 VD EGGRVQRFREGFT +PA AA M + A E G+LMA E++A DID+SFA Sbjct: 60 VDHEGGRVQRFREGFTLIPAMGDILPAAKGDMVLAKRWACELGFLMAIELLACDIDLSFA 119 Query: 119 PVLDVGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE 178 PVLD+ IS IG+RS+ A P + +A+A FI+GM EAGM GKHFPGHG+V ADSH Sbjct: 120 PVLDLNGISQVIGKRSFSAKPDEVIALAQSFIEGMAEAGMGAVGKHFPGHGSVAADSHIA 179 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQE 238 P D R I +D+ F LI + KL IMPAHVIY VDP PA S YWLK +LR+E Sbjct: 180 QPIDEREAEAIFNQDILPFKELIAKGKLSGIMPAHVIYPKVDPNPAGFSSYWLKQILRKE 239 Query: 239 LGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL---SPI 295 LGF+GVIFSDDL M+GAA G Y R QA+LDAGCDMILVCN+ G +S+L+ + Sbjct: 240 LGFNGVIFSDDLGMKGAAFAGDYLGRAQAALDAGCDMILVCNDNPGVMSLLNGFVWPAAA 299 Query: 296 KAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQEEKAG 340 + L + + L +++RW+ QL E+ Q + Sbjct: 300 PQHPASLLKPNAAQTAIALENASRWENA----KQLAEQIQLAQQA 340 >UniRef50_D2M1Q8 Glycoside hydrolase family 3 domain protein n=1 Tax=Bacillus cellulosilyticus DSM 2522 RepID=D2M1Q8_BACS4 Length = 518 Score = 353 bits (907), Expect = 5e-96, Method: Composition-based stats. Identities = 82/345 (23%), Positives = 156/345 (45%), Gaps = 30/345 (8%) Query: 1 MGPVML-DVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRN---- 55 +G + + EG ++ + +E++ VG +ILF+RN P ++ L +++ ++ Sbjct: 8 IGQMFVVGFEGKQVTEQMKELIHEYRVGAIILFSRNIGSPEEVLTLTTELQKEAKIARHK 67 Query: 56 -RLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDID 114 L++ +DQE G V+R +G T P A + A A + G E+ A+ I+ Sbjct: 68 HPLLICIDQENGIVRRLGDGTTVFPGAMALGATQKPNH----AYDIGVATGRELKALGIN 123 Query: 115 ISFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVT 172 + APVLDV + + IG RS+ + + GM A + TT KHFPGHG Sbjct: 124 WNLAPVLDVNNNPKNPVIGVRSFGESAELVSQFGIAAMKGMQAAEVVTTLKHFPGHGDTD 183 Query: 173 ADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP---RPASGSPY 229 DSH P + +++ F + + + D +M AH+ + +++ PA+ S Sbjct: 184 VDSHVALPVITHDLNRLEEIELAPFKACM-DANADVVMSAHIHFPEIEKDPQVPATLSKA 242 Query: 230 WLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVL 289 + +LR++L +DGV+ +D + M+ + A++ AG D++++ + + L Sbjct: 243 IITGLLREKLQYDGVVTTDCMEMDAISETIGTERGAVAAIVAGVDLVMISHIANRQIGAL 302 Query: 290 DNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERW 334 +S G + + R+ +L E++ Sbjct: 303 KEVS--------TAVSNGELDMSLIEQA------FERVMKLKEKY 333 >UniRef50_Q1QVU0 Glycoside hydrolase, family 3-like protein n=1 Tax=Chromohalobacter salexigens DSM 3043 RepID=Q1QVU0_CHRSD Length = 340 Score = 353 bits (906), Expect = 6e-96, Method: Composition-based stats. Identities = 163/336 (48%), Positives = 214/336 (63%), Gaps = 8/336 (2%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 +G VMLD+EG +L EER +L P VGG+ILF RN D Q+R L R+IR R +++A Sbjct: 5 LGTVMLDIEGTQLGDEERRLLERPEVGGVILFARNTRDAEQVRRLTREIREL-RPDMLLA 63 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSG--MEEGGKLAQEAGWLMASEMIAMDIDISFA 118 +DQEGGRVQR REG TRLP+ + AA +E EAGWL+ EM A D++FA Sbjct: 64 IDQEGGRVQRLREGVTRLPSMAALAAGYADAPDEVRSRVHEAGWLLGMEMAACGFDVTFA 123 Query: 119 PVLDVG-HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 PVLDV S AIG+RS+ ADP A+ FI+G+HEAGM GKHFPGHG VT DSH Sbjct: 124 PVLDVDDQRSPAIGDRSFSADPTVVAALGEAFIEGLHEAGMVAVGKHFPGHGGVTLDSHH 183 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQ 237 P D RP + +R D+ F +L KLDA+MPAHV+Y+ D RPA SP WL +LR+ Sbjct: 184 ALPEDNRPLSVLREHDLVPFKAL--SGKLDAMMPAHVVYTAFDTRPAGFSPSWL-GMLRE 240 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKA 297 E+ F GV+FSDDLSM GA + G+ A R +A+ AGCDM+LVCN+R A+ ++D + + Sbjct: 241 EMAFKGVVFSDDLSMAGAHVAGTPAARAEAAWSAGCDMVLVCNDRAAALEIVDAAAGRTS 300 Query: 298 ERVTRLYHKGSFSR-QELMDSARWKAISTRLNQLHE 332 +R+ +L + + + L ARW+ RL L E Sbjct: 301 KRLGKLRYGRARPELETLPALARWRRAHARLEALSE 336 >UniRef50_C1XLF1 Beta-glucosidase-like glycosyl hydrolase n=2 Tax=Meiothermus RepID=C1XLF1_MEIRU Length = 511 Score = 353 bits (906), Expect = 7e-96, Method: Composition-based stats. Identities = 103/340 (30%), Positives = 169/340 (49%), Gaps = 18/340 (5%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQ 63 +M+D+ G LDA R L GG+ LF +N +P Q+R+LV +IRA + +A+DQ Sbjct: 11 MMVDIPGPTLDAATRAHLERYRFGGVCLFRKNIQNPEQVRQLVAEIRAVLGPQAWIAIDQ 70 Query: 64 EGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV 123 EGG VQR E P+ + A+ + K A+ G + +I++ I+ +FAP +DV Sbjct: 71 EGGAVQRVLEFALA-PSPMALGAVGEV----KTAEAVGAAVGRALISLGINWNFAPSVDV 125 Query: 124 GHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPC 181 + IG+RS+ +DP+K ++ + G+ +AG+ KHFPGHG DSH E P Sbjct: 126 NTNPQNPVIGDRSFGSDPKKVARLSLAWARGLEQAGVMAAIKHFPGHGDTFLDSHLELPV 185 Query: 182 DPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQELG 240 + E+ ++ F + E ++ +IM AH+ + +D + PA+ S L LR++LG Sbjct: 186 VNKTLEELERTELYPFRKAV-EAQISSIMSAHIRFPALDKQHPATLSKKILTGFLRKKLG 244 Query: 241 FDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERV 300 F G+I +D L M+ + E SL AG DMIL + ++ +A + Sbjct: 245 FQGIIVTDALDMKAISKNYPIGEAAVKSLQAGADMILSLGQPEVHIA--------QATAI 296 Query: 301 TRLYHKGSFSRQELMDSA-RWKAISTRLNQLHERWQEEKA 339 + GS S ++ S R + R + + + Sbjct: 297 QQALENGSLSEEQARQSQLRLLEAALRFPGQAQPYPPAQQ 336 >UniRef50_P40406 Uncharacterized lipoprotein ybbD n=5 Tax=Bacillus RepID=YBBD_BACSU Length = 642 Score = 352 bits (905), Expect = 7e-96, Method: Composition-based stats. Identities = 85/335 (25%), Positives = 148/335 (44%), Gaps = 25/335 (7%) Query: 11 YELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS-RNRLVVAVDQEGGRVQ 69 +++ E ++ GG+ILF N Q +L + AS + L++++DQEGG V Sbjct: 71 TKMNDEVASLVKKYQFGGIILFAENVKTTKQTVQLTDDYQKASPKIPLMLSIDQEGGIVT 130 Query: 70 RFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI--S 127 R EG T P + A A + G ++ E+ A+ I+ F+PV+D+ + + Sbjct: 131 RLGEG-TNFPGNMALGAARSRIN----AYQTGSIIGKELSALGINTDFSPVVDINNNPDN 185 Query: 128 AAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQA 187 IG RS+ ++ + + + G+ + + KHFPGHG DSH P Q Sbjct: 186 PVIGVRSFSSNRELTSRLGLYTMKGLQRQDIASALKHFPGHGDTDVDSHYGLPLVSHGQE 245 Query: 188 EIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-------------PASGSPYWLKTV 234 +R ++ F I + D +M AHV + D PA+ S + + Sbjct: 246 RLREVELYPFQKAI-DAGADMVMTAHVQFPAFDDTTYKSKLDGSDILVPATLSKKVMTGL 304 Query: 235 LRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSP 294 LRQE+GF+GVI +D L+M+ A E ++ AG D+ L+ + + Sbjct: 305 LRQEMGFNGVIVTDALNMKAIADHFGQEEAVVMAVKAGVDIALMPASVTSLKE--EQKFA 362 Query: 295 IKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQ 329 + + G Q++ +S + IS ++ + Sbjct: 363 RVIQALKEAVKNGDIPEQQINNSVE-RIISLKIKR 396 >UniRef50_B5Y6T8 Beta-N-Acetylglucosaminidase n=2 Tax=Firmicutes RepID=B5Y6T8_COPPD Length = 392 Score = 352 bits (905), Expect = 8e-96, Method: Composition-based stats. Identities = 97/340 (28%), Positives = 168/340 (49%), Gaps = 21/340 (6%) Query: 1 MGPV-MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS---RNR 56 +G + ++ G + A+ ++ VGG+ILF+ N D QL L+ I+A + + Sbjct: 38 IGQMFIVSFTGTNVSADLITLVHDYHVGGVILFSENIQDDKQLLALLNGIKALNEGNKLP 97 Query: 57 LVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 ++++ DQEGGR+ R T+ P+ + L+ + G ++ +EM A ++ Sbjct: 98 ILISTDQEGGRISRLPSRATKFPSNLIIGKRNSAT----LSYQIGSILGTEMKAYGFNMD 153 Query: 117 FAPVLDV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 FAPVLDV + IG+RS+ ++P + + G+ +G+ KHFPGHG + D Sbjct: 154 FAPVLDVVTNPQNKVIGDRSFGSNPDLVSTLGVATMKGISSSGVIPVVKHFPGHGDTSVD 213 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP--RPASGSPYWLK 232 SH+E P +R+ ++ F I +N +M AH+ +++VDP PAS S ++ Sbjct: 214 SHQELPVSNATLPHLRSFELLPFEKAI-QNGASVVMVAHIKFANVDPSGLPASLSEVFIS 272 Query: 233 TVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL 292 +LR ELGF GV+ +DDL ME S + ++ AG DM+L+C++ ++ L Sbjct: 273 DILRNELGFSGVVITDDLHMEAITKHYSVGDAAIKAVQAGADMVLICHSLDEQKQAINAL 332 Query: 293 SPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHE 332 G S + + +S + A+ R L + Sbjct: 333 --------VHAVKTGQISEERINESIKRIAMLKRHYDLTD 364 >UniRef50_A4C9R3 Beta-hexosaminidase n=1 Tax=Pseudoalteromonas tunicata D2 RepID=A4C9R3_9GAMM Length = 338 Score = 352 bits (903), Expect = 1e-95, Method: Composition-based stats. Identities = 152/336 (45%), Positives = 200/336 (59%), Gaps = 6/336 (1%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 MGP++ + G L EE E L HPLV G+ILFTRNYHD QL L QI+ S L+++ Sbjct: 1 MGPILTGISGLSLLPEEVEYLQHPLVAGVILFTRNYHDNEQLIALTAQIKKIS-PMLLIS 59 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 VD EGGRVQRFR+GF+ +PA + LA + G+ +A+E+ A +D SF PV Sbjct: 60 VDHEGGRVQRFRDGFSAIPAMFEATSFQKKHNSPILA-DLGFTLAAELRAHGVDFSFTPV 118 Query: 121 LDVGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETP 180 LD+ IS I R++ P + + A FI G+H AG+K KHFPGHG+V DSH P Sbjct: 119 LDLNGISQVITTRAFAISPIEVIDAAKHFIGGLHSAGVKNVVKHFPGHGSVGPDSHIAMP 178 Query: 181 CDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQELG 240 D R +AEI DMSVF +L+ D +MPAHVIY DP+PA S YWLK VLR +LG Sbjct: 179 VDERSKAEIFNSDMSVFKALVDSGCADGVMPAHVIYPHCDPKPACFSDYWLKQVLRNKLG 238 Query: 241 FDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLD----NLSPIK 296 F G +FSDD+ M+GA MG Y R Q++LDAGCD+ L+CN G VLD NL Sbjct: 239 FKGPVFSDDMGMQGAVQMGDYVTRTQSALDAGCDLALLCNEPHGLYEVLDGLEINLYQNH 298 Query: 297 AERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHE 332 +R K + + EL RW A + ++++ + Sbjct: 299 GQRGLAYLQKKNVTMMELKTDHRWLAAQSVISKMED 334 >UniRef50_UPI0001C42D84 glycoside hydrolase family 3 domain protein n=1 Tax=Bacillus pseudofirmus OF4 RepID=UPI0001C42D84 Length = 699 Score = 351 bits (901), Expect = 2e-95, Method: Composition-based stats. Identities = 95/334 (28%), Positives = 163/334 (48%), Gaps = 34/334 (10%) Query: 12 ELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS-RNRLVVAVDQEGGRVQR 70 E+ E +++ +GG+ILF N Q +LV +AAS + L++ +DQEGG V R Sbjct: 177 EMLPEIEQLVKDYHLGGVILFRENVVTTEQTTKLVADYQAASEKFGLLMTIDQEGGIVTR 236 Query: 71 FREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI--SA 128 + G T +P + A E LA+ G ++ SE+ ++ I+++FAPV+DV + + Sbjct: 237 LQSG-TDMPGNMALGATRSPE----LAENVGRVIGSELHSLGINMNFAPVMDVNNNPDNP 291 Query: 129 AIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAE 188 IG RS+ DP+ + + G+ +G+ T KHFPGHG DSH P P Sbjct: 292 VIGVRSFGEDPKLVADMGVAYTHGLQSSGVAGTAKHFPGHGDTAVDSHLGLPEVPHDIER 351 Query: 189 IRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-------------PASGSPYWLKTVL 235 +R ++ F + + +DAIM AHV + ++D PA+ S L ++ Sbjct: 352 LREVELYPFQQAM-DAGIDAIMTAHVTFPNIDDTKVISQKTGEEIAIPATLSYKVLTELM 410 Query: 236 RQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPI 295 R+++GFDGVI +D L+M A + ++ AG D+IL+ LS + Sbjct: 411 REDMGFDGVITTDALNMLAIADHFGPVDAAVRAVQAGSDIILMP----------VGLSEV 460 Query: 296 KAERVTRLYHKGSFSRQELMDSARWKAISTRLNQ 329 + + G + + + +S + ++ +L + Sbjct: 461 RNG-LLEAVEAGKLTVERIEESVE-RILALKLKR 492 >UniRef50_C6J1S3 Glycoside hydrolase n=2 Tax=Bacillales RepID=C6J1S3_9BACL Length = 426 Score = 349 bits (897), Expect = 7e-95, Method: Composition-based stats. Identities = 95/343 (27%), Positives = 161/343 (46%), Gaps = 22/343 (6%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRN---RLVVA 60 V++ ++G E + R+++ VGG I + N D Q L ++ A++N L ++ Sbjct: 81 VLVGMDGTEPNDVTRQLIEEYHVGGFIFYKDNIQDTKQALTLFNGLKTANQNKAVPLFLS 140 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 VD+EGGRV R T+LPAA+ E LA + G ++ E+ +++ FAPV Sbjct: 141 VDEEGGRVSRMPGELTKLPAARKIGNTVSEE----LAGQIGGIIGQELAGFGLNLDFAPV 196 Query: 121 LDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE 178 LDV + IG+R++ P+ + + G+ E G+ KHFPGHG + DSH Sbjct: 197 LDVNSNPDNPVIGDRAFGTKPEIVSRMGIAVMKGIREQGVIPVVKHFPGHGDTSVDSHLG 256 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQ 237 P +R ++ F I+E D +M AH++ +DP PAS S + +LR+ Sbjct: 257 LPVVEHDLDRLRKLELVPFKQAIQEG-ADVVMIAHLLMPKLDPEHPASFSKAVIHDLLRE 315 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKA 297 ELG++G I SDD++M + + AG +++LV ++ + V+ L Sbjct: 316 ELGYEGTIISDDMTMGAIVEHYDIRDAAVQFIQAGGNIVLVGHDEEKEKQVIQALRD--- 372 Query: 298 ERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQEEKAG 340 G+ S + L + +L Q ++ + G Sbjct: 373 -----AVSSGTISAETLDERVYN---VLKLKQKYKLTNDPAQG 407 >UniRef50_A6TMT8 Glycoside hydrolase, family 3 domain protein n=1 Tax=Alkaliphilus metalliredigens QYMF RepID=A6TMT8_ALKMQ Length = 434 Score = 349 bits (896), Expect = 8e-95, Method: Composition-based stats. Identities = 85/322 (26%), Positives = 157/322 (48%), Gaps = 18/322 (5%) Query: 13 LDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFR 72 ++ +E + +GG+ILF N Q ++L+ ++ SR + +AVD+EGGRV R Sbjct: 92 INESVKEQIKQYHLGGIILFAENIESIDQTKKLIDDMQKNSRLPMFIAVDEEGGRVTRLN 151 Query: 73 EGFTRLPAAQSFA-ALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI--SAA 129 ++LPA Q + G + L+ E G L+ E++++ +++ APV DV + Sbjct: 152 HSRSQLPATQLPGNEVLGKTKDPTLSYEVGRLLGRELLSLGFNMNLAPVADVNTNAKNPV 211 Query: 130 IGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEI 189 IG+RS+ +PQ+ +AS G+ + KHFPGHG D+H + + Sbjct: 212 IGDRSFSDNPQEVGIMASEMARGLQNENVSAVFKHFPGHGDTEFDTHHQAVVINHDLERL 271 Query: 190 RAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR--PASGSPYWLKTVLRQELGFDGVIFS 247 ++ + F I E +DAIM AH+ + PA+ S + +LR+E+G++G++ + Sbjct: 272 QSVEWVPFRKGI-EVSVDAIMTAHIQMPQITGNDFPATLSSKIITEILREEMGYEGMVIT 330 Query: 248 DDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHKG 307 D L M + + AE +++AG D++L+ + + A V G Sbjct: 331 DALEMSAVSQHYTSAEAAVLAIEAGVDILLMPRSLEEAY-----------GGVLEAVSLG 379 Query: 308 SFSRQELMDSARWKAISTRLNQ 329 + + + +S R + + +L + Sbjct: 380 LITEERIEESVR-RILQVKLKR 400 >UniRef50_C7P2G4 Glycoside hydrolase family 3 domain protein n=1 Tax=Halomicrobium mukohataei DSM 12286 RepID=C7P2G4_HALMD Length = 528 Score = 349 bits (896), Expect = 9e-95, Method: Composition-based stats. Identities = 95/332 (28%), Positives = 154/332 (46%), Gaps = 25/332 (7%) Query: 5 MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS-----RNRLVV 59 ++ EG E + RE+L G +I F+RN P Q+ EL R+++ + L V Sbjct: 20 VVGFEGPEPTEDLRELLTDYRCGNVIYFSRNIDSPEQVAELSRELQTIATEAGPEIPLFV 79 Query: 60 AVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAP 119 DQEGG V R + T P+ S A + LA+ G + +E+ ++ ++ P Sbjct: 80 TADQEGGVVSR-TDWGTEPPSQMSIGAGRDAD----LARSVGGAVGAELASIGVNFDLTP 134 Query: 120 VLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 VLDV + + IG RS+ +P+ + + DGM G+ GKHFPGHG +ADSH Sbjct: 135 VLDVNNNPDNPVIGVRSFGEEPELVGDLGAAMADGMQSEGVLACGKHFPGHGDTSADSHH 194 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV---DPRPASGSPYWLKTV 234 P + + A +++ F I + +DAIM HV + + D PA+ S + Sbjct: 195 SLPVVDHDRERLDAVELAPFRRAI-DAGIDAIMTTHVSFPTITGDDELPATVSRDVQTGL 253 Query: 235 LRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSP 294 LR++LGFDG++ +D + M A E +++AGCD++LVC+ Sbjct: 254 LREQLGFDGLVVTDGMEMNAIADEMGTPEGCVQAVEAGCDLLLVCHTPAVQKD------- 306 Query: 295 IKAERVTRLYHKGSFSRQELMDSARWKAISTR 326 E V G + D+ + + + Sbjct: 307 -SVEAVIDAVESGRIDESRIDDAVE-RVLEYK 336 >UniRef50_C5BIH5 Glycoside hydrolase family 3 domain protein n=1 Tax=Teredinibacter turnerae T7901 RepID=C5BIH5_TERTT Length = 343 Score = 349 bits (896), Expect = 1e-94, Method: Composition-based stats. Identities = 155/340 (45%), Positives = 201/340 (59%), Gaps = 10/340 (2%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 +GPV+LDV+G L + +L +P VGG+ILF+RNY + QL L +IR AS N +V+A Sbjct: 6 LGPVVLDVDGTTLSESDVNLLQNPWVGGVILFSRNYTNVMQLCGLTAEIRKASPN-IVIA 64 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALS--GMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 VD EGGRVQRF FT +PA Q L E L E GWL+A+E+IA +D+SF Sbjct: 65 VDHEGGRVQRFGGEFTPIPAMQQLGNLYVKSPVEALALTHETGWLLAAELIACGLDMSFT 124 Query: 119 PVLDVGH-ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 PVLDV S IG RS+ DP +A++ DG+ EAGM TGKHFPGHG V DSH Sbjct: 125 PVLDVDDCRSQIIGSRSFGPDPAAVIALSEALTDGLIEAGMSATGKHFPGHGGVVEDSHL 184 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQ 237 E P D R E+ D+ F +L+ KL+AIMPAH+ + + P S +WL+ VLR+ Sbjct: 185 ELPVDNRSFDEVARHDLQPFKALLP--KLEAIMPAHIHFPQIASEPVGFSSFWLQQVLRE 242 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSP--- 294 +LGF+G+IFSDDL+MEGA G Y ER +L AGCD +L+CNN GA L L Sbjct: 243 DLGFNGIIFSDDLTMEGAVTAGDYGERTLKALAAGCDSVLICNNPAGAQEALARLDATFS 302 Query: 295 -IKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHER 333 + R+ + K F L SAR K + +L L R Sbjct: 303 TVFPSRLECMRAKRCFEWDSLRGSARAKGVRAQLESLVAR 342 >UniRef50_C6D1D6 Glycoside hydrolase family 3 domain protein n=1 Tax=Paenibacillus sp. JDR-2 RepID=C6D1D6_PAESJ Length = 403 Score = 349 bits (895), Expect = 1e-94, Method: Composition-based stats. Identities = 88/335 (26%), Positives = 162/335 (48%), Gaps = 23/335 (6%) Query: 1 MGPVM-LDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAA-----SR 54 +G ++ + ++G E+ E +++ VGG+IL++ N +Q +L ++ ++ Sbjct: 55 IGQMILVGMDGTEIQPEISKLIKESHVGGIILYSNNIESASQTIKLANDLKQINQDNGAK 114 Query: 55 NRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDID 114 L+++ DQEGGRV R + PA+++ + + A + G + + A+ ++ Sbjct: 115 LPLLLSTDQEGGRVSRLPKQIKPFPASRTIGKTNDP----QYAYQVGTALGEAVKAIGLN 170 Query: 115 ISFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVT 172 FAPVLD+ + IG R++ + + + + G+ G+ KHFPGHG + Sbjct: 171 TDFAPVLDINTNPKNPVIGNRAFGTTAELVSRMGVQEMLGIQSQGVIPVVKHFPGHGDTS 230 Query: 173 ADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPASGSPYWL 231 DSH P +R+ + FSS I+E +M AH++ + +DP PAS S + Sbjct: 231 VDSHLGLPVVDHDLERLRSIEFVPFSSAIKEG-APMVMVAHILMTKLDPDTPASMSKKVI 289 Query: 232 KTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDN 291 + LR EL F GV+ +DD++M + + S+ AG D+ILV ++ +V+D Sbjct: 290 QDYLRGELKFSGVVITDDMTMGAVGKVKAIGPATVQSVLAGADIILVGHDPVQQQTVIDA 349 Query: 292 LSPIKAERVTRLYHKGSFSRQELMDSARWKAISTR 326 L+ G S L D++ ++ I + Sbjct: 350 LTA--------AAQSGEISPSVL-DASVYRIIKLK 375 >UniRef50_UPI0000E11708 beta-hexosaminidase n=1 Tax=Rhodobacterales bacterium HTCC2255 RepID=UPI0000E11708 Length = 349 Score = 347 bits (891), Expect = 3e-94, Method: Composition-based stats. Identities = 147/292 (50%), Positives = 193/292 (66%), Gaps = 1/292 (0%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 M P+MLDV G+EL +E+E++AHPL GG+ILF RNYHDP QLR L QI+ ++ + +++A Sbjct: 1 MHPIMLDVAGFELTQQEQEVMAHPLTGGVILFARNYHDPEQLRHLTSQIKHSAGHPILIA 60 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 VDQEGGRVQRF+ GFT +PA + +LA G ++A+E I +DISFAPV Sbjct: 61 VDQEGGRVQRFKNGFTAIPAMGDI-EHHDHVQARQLAYACGRVIATECIGHGVDISFAPV 119 Query: 121 LDVGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETP 180 LD+ IS IG RS+ P L++A+ FI G+ GM GKHFPGHG V ADSH P Sbjct: 120 LDINAISDVIGTRSFSNAPSTILSLATEFIHGLKAIGMPAIGKHFPGHGNVKADSHIAMP 179 Query: 181 CDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQELG 240 D R + I DM++F+ +++ +D IMPAHV+Y D+D RPA S WL+ +LRQ G Sbjct: 180 SDERSKDTIFELDMAIFTHFMQQQLVDGIMPAHVVYPDIDSRPAGFSSIWLQQILRQHCG 239 Query: 241 FDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL 292 F GVIFSDDLSM A+ GS +R + +L AG DM L+CN A+ VLDNL Sbjct: 240 FQGVIFSDDLSMHAASEAGSMLDRVEHALAAGVDMALICNAPDEAIMVLDNL 291 >UniRef50_A2QDM1 Contig An02c0230, complete genome n=9 Tax=Trichocomaceae RepID=A2QDM1_ASPNC Length = 859 Score = 347 bits (890), Expect = 4e-94, Method: Composition-based stats. Identities = 92/345 (26%), Positives = 164/345 (47%), Gaps = 31/345 (8%) Query: 1 MGPVM-LDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASR----- 54 +G + + G E + ++ +G ++LF RN D AQL+ L ++ +R Sbjct: 10 IGQLFAVGFHGLTPSPEIKTLIHDYGIGAIVLFKRNVSDAAQLQALTHALQEEARLAGHE 69 Query: 55 NRLVVAVDQEGGRVQRFREGF-TRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDI 113 + L++ +DQE G V R +LP + A + E LA + G + A I Sbjct: 70 HPLLIGIDQENGLVTRVSPPVAPQLPGPMALGAANSPE----LAYQVGAATGEILSASGI 125 Query: 114 DISFAPVLDVG--HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAV 171 ++++APV D+ ++ IG RS+ +P+ AS G+ E + T KHFPGHG Sbjct: 126 NMNYAPVCDINSEPLNPVIGVRSFGDNPEFVARFASASAQGLREHKVVPTVKHFPGHGDT 185 Query: 172 TADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV-DPR-PASGSPY 229 DSH P P+ + ++ + ++ F E ++AIM AH+ + D R PA+ SP Sbjct: 186 AVDSHYGLPVIPKTREQLESCELVPFRQAANEG-IEAIMTAHISLPAIGDGRLPATLSPD 244 Query: 230 WLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVL 289 ++ +LR+++G+ G++ +D L M+G + +L AGCD I++C+ V+ + Sbjct: 245 VMR-ILREDMGYSGMVITDCLEMDGIRATYGTEQGAVLALAAGCDSIMICHTYSVQVASI 303 Query: 290 DNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERW 334 +V + G L ++ R R+++L R+ Sbjct: 304 L--------QVCQAAESGKIPSARLDEAIR------RVSELKSRF 334 >UniRef50_B8FZS5 Glycoside hydrolase family 3 domain protein n=2 Tax=Desulfitobacterium hafniense RepID=B8FZS5_DESHD Length = 426 Score = 347 bits (890), Expect = 5e-94, Method: Composition-based stats. Identities = 95/337 (28%), Positives = 177/337 (52%), Gaps = 25/337 (7%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRA---ASRNRLVVA 60 VM+ V+GYE++A ++++ + VGG +L +N D Q+ L+ ++ +R L +A Sbjct: 78 VMVGVDGYEINANAQQLIENYHVGGFVLLKKNVRDSGQMLNLINTLKETNGVNRIPLFLA 137 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 +D+EGGR+ R F ++P++Q A + G L ++ G ++ E+ ++++FAPV Sbjct: 138 LDEEGGRISRMPAEFKKMPSSQQVGA----QNSGALGKKMGEILGREVKGFGMNVNFAPV 193 Query: 121 LDV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE 178 LD+ + IG+R++ ++P+ + + + G+ E G+ + KHFPGHG + DSH Sbjct: 194 LDIFSNPKNKVIGDRAFGSNPELVSKVGIQTMRGIQEQGIISVVKHFPGHGDTSVDSHVG 253 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQ 237 P +R ++ F+ I N +DAIM AH++ +DP PAS S ++ +LR+ Sbjct: 254 LPRVDYDLERLRNFELRPFAEAI-ANDVDAIMLAHILLPKLDPDYPASFSEVLIRDILRK 312 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKA 297 E+ ++GV+ +DD++M + E S+ AG D++LVC++ +VL Sbjct: 313 EMDYNGVVITDDMTMGAIVENYNIGEAAVKSILAGSDIVLVCHDFAKEEAVLKE------ 366 Query: 298 ERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERW 334 + G + +S R+ +L E++ Sbjct: 367 --ILHAAETGKIPVDRIDESVY------RVLKLKEKY 395 >UniRef50_D1CIC2 Glycoside hydrolase family 3 domain protein n=1 Tax=Thermobaculum terrenum ATCC BAA-798 RepID=D1CIC2_THET1 Length = 543 Score = 346 bits (887), Expect = 9e-94, Method: Composition-based stats. Identities = 98/345 (28%), Positives = 163/345 (47%), Gaps = 28/345 (8%) Query: 6 LDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS-----RNRLVVA 60 + G+ E +++ VG +ILF+RN D QLREL ++ A+ R L++ Sbjct: 17 VGFPGHAPTPEVLQLIQRYRVGNVILFSRNVRDVQQLRELTATLQRAAFDAGHRQPLLIM 76 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 +DQE G V+R +G + LP + + A E A++ E+ A+ ++++ APV Sbjct: 77 IDQENGVVRRLGQGASVLPGSMALGAAGDPE----CARQVALTTGRELRALGVNMNLAPV 132 Query: 121 LDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE 178 DV + + IG RS+ DPQ + + ++G EAG+ + KHFPGHG DSH Sbjct: 133 ADVNNNPANPVIGVRSFGEDPQAVGDMVAAAVEGYAEAGVISCLKHFPGHGDTAVDSHLA 192 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPRPASGSPYWLKTVLR 236 P P +A + + ++ F I E D+IM AH+ + + P++ SP + +LR Sbjct: 193 LPVVPHDRARLESVELPPFRRGI-EAGTDSIMVAHLHLPAIMANEAPSTVSPEVIDGLLR 251 Query: 237 QELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIK 296 ELGF G++ +D L M + ++ +G D+ LV + V + L Sbjct: 252 GELGFGGLVITDCLEMRAVSETIGTERSAVLAIASGADVALVSHRHDRQVGAVRALYA-- 309 Query: 297 AERVTRLYHKGSFSRQELMDSA-RWKAISTRLNQLHERWQEEKAG 340 +G+ S L ++A R + RL+ W+E G Sbjct: 310 ------ALREGALSEDRLQEAADRVMCLKERLS-----WREALGG 343 >UniRef50_A9KMQ8 Glycoside hydrolase family 3 domain protein n=1 Tax=Clostridium phytofermentans ISDg RepID=A9KMQ8_CLOPH Length = 517 Score = 345 bits (886), Expect = 1e-93, Method: Composition-based stats. Identities = 89/330 (26%), Positives = 152/330 (46%), Gaps = 21/330 (6%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASR----NRLVV 59 ++ G +D+E + + +G ILF N D QL L ++ ++ +R + Sbjct: 13 IVAGFPGTTIDSELEDFIRTYKIGNFILFKENIVDANQLSNLCEGLQQLTKKYTGHRAFI 72 Query: 60 AVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAP 119 +DQEGG V R E +P A + AA + A AG + ++ + + AP Sbjct: 73 TIDQEGGMVTRLSEDSVNIPGAMAIAATRDEKN----AYMAGRITGQQLRTLGFNFDLAP 128 Query: 120 VLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 V D+ + IG RSY +P + + G+ + G+ + KHFPGHG DSH Sbjct: 129 VADINSNMDNPVIGVRSYGDEPDQVAKYCVAMMKGLTDGGVLASAKHFPGHGDTNVDSHL 188 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLR 236 P + E+ ++ F +LI E + AIM +H+I+ ++ PA+ S + +L+ Sbjct: 189 GLPKVHKSLEEMELCELVSFKALI-EAGIPAIMSSHIIFPALEEELPATMSRKIITGLLK 247 Query: 237 QELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIK 296 ++LGF G++ SD + M E + +++AG D+I V + A D L+ Sbjct: 248 EKLGFKGLVISDCMEMSAIKKYYGSIEGIKHAIEAGVDLIFVSHTMSVAREASDVLTG-- 305 Query: 297 AERVTRLYHKGSFSRQELMDSARWKAISTR 326 LY KG S E+ D++ K + + Sbjct: 306 ------LYEKGELSMDEM-DASIDKIMYYK 328 >UniRef50_B1YFC7 Glycoside hydrolase family 3 domain protein n=6 Tax=Firmicutes RepID=B1YFC7_EXIS2 Length = 646 Score = 345 bits (885), Expect = 2e-93, Method: Composition-based stats. Identities = 98/355 (27%), Positives = 161/355 (45%), Gaps = 35/355 (9%) Query: 1 MGPVML-------DVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS 53 +G +++ +L E I+ +GG+ILF N + Q +LV +R Sbjct: 52 IGQMIMPDFRLWDGANHTKLAPEVGRIIDRFDLGGVILFAENVSETEQTTKLVHDLREVV 111 Query: 54 R------NRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASE 107 L V +DQEGG V R G T LP + A A+ AG ++ SE Sbjct: 112 EQDPSNDIPLFVTIDQEGGIVTRLGTG-TNLPGNMALGAT----RSSAYAEAAGGIIGSE 166 Query: 108 MIAMDIDISFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHF 165 + A+ I+++F PVLDV + + IG RS+ +DP + S G+ E G+ T KHF Sbjct: 167 LHALGINVNFGPVLDVNNNPGNPVIGVRSFSSDPNLVGELGSAMTQGIQEQGVAATAKHF 226 Query: 166 PGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD----- 220 PGHG DSH P + AE++ ++ F I E +D +M H+ ++ Sbjct: 227 PGHGDTAVDSHYGLPVVDKSLAELKQLELIPFKRAIDEG-IDMVMTGHIGMPQIEDEVVE 285 Query: 221 ------PRPASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCD 274 P PA+ S + VLR+ELG+ G++ +D L+M+ A + +E + AG D Sbjct: 286 SKQGTFPLPATLSDDVITGVLREELGYQGLVITDALNMQAIADNFTESEAVIKTFKAGVD 345 Query: 275 MILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQ 329 + L+ + A V L I + V + G ++ D + + ++ + + Sbjct: 346 IALMPTILRSASDV-TKLETIFDD-VIKAVESGDLKEADI-DRSVERILTLKATR 397 >UniRef50_C1I263 Beta-hexosamidase A n=1 Tax=Clostridium sp. 7_2_43FAA RepID=C1I263_9CLOT Length = 862 Score = 345 bits (885), Expect = 2e-93, Method: Composition-based stats. Identities = 89/340 (26%), Positives = 161/340 (47%), Gaps = 30/340 (8%) Query: 11 YELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS------RNRLVVAVDQE 64 E++ E EI+ +GG+ILF +N Q +L+ ++ + L+ +DQE Sbjct: 74 TEMNDEVAEIIDEFDLGGVILFAQNVKTTEQTTKLIHDMQQVAINDQDGNLPLLTTIDQE 133 Query: 65 GGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVG 124 GG V R G T P + A + + G ++ E+ ++ I+++FAP +D+ Sbjct: 134 GGIVTRLGTG-TNFPGNMAIGATKNEQSS----YDTGLVIGRELNSLGINVNFAPSVDIN 188 Query: 125 HI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCD 182 + + IG RS+ ++P + + I G+ E G+ KHFPGHG DSH P Sbjct: 189 NNPLNPVIGLRSFSSNPDLVAKLGVKMIKGIQENGVSVAAKHFPGHGDTATDSHYGLPKV 248 Query: 183 PRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-------------PASGSPY 229 + E++ ++ F + I +N +D IM AH+ + ++ PA+ S Sbjct: 249 DKSLEELKEAELKPFQAAI-DNGVDMIMTAHIQFPQIEKDTYTSIKDGQEIQIPATLSDD 307 Query: 230 WLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVL 289 L +LR+++ FDGVI +D L+M + E + + DAG D+ L+ + + + + Sbjct: 308 ILTGLLREDMNFDGVIITDALNMAAISENFGEVETVKIAFDAGVDIALMPTILR-SKADV 366 Query: 290 DNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQ 329 L I E V KG S + + +SA + + ++++ Sbjct: 367 PKLRAI-VEGVKEAVAKGEISEERINESAA-RVVKLKIDR 404 >UniRef50_B6Q1F2 Beta-N-acetylglucosaminidase, putative n=1 Tax=Penicillium marneffei ATCC 18224 RepID=B6Q1F2_PENMQ Length = 877 Score = 344 bits (884), Expect = 2e-93, Method: Composition-based stats. Identities = 90/348 (25%), Positives = 155/348 (44%), Gaps = 35/348 (10%) Query: 2 GPVM-LDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRN----- 55 G + + G+ E R ++ +G +ILF RN D QL+ L ++ +++ Sbjct: 14 GQLFAVGFHGFTSSPEIRTLIRDYGLGAVILFRRNIKDATQLQALTLALQQEAKDAGHEH 73 Query: 56 RLVVAVDQEGGRVQRF-REGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDID 114 L + +DQE G V R + + P + A + A + G + I+ Sbjct: 74 PLFIGIDQEHGLVTRITPQTAAQQPGQMTLGATQSADN----AYKIGKCTGEMLSFFGIN 129 Query: 115 ISFAPVLDVG--HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVT 172 +++APV DV ++ IG RS+ +P ++S G+ E G+ T KHFPGHG Sbjct: 130 MNYAPVCDVNSEPLNPVIGVRSFGDNPNFVAKMSSATARGLRETGVVPTVKHFPGHGDTA 189 Query: 173 ADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV------DPRPASG 226 DSH P + ++E+ ++ F + E ++A+M AH+ + + PA+ Sbjct: 190 VDSHHGLPVINKSRSELERCELIPFRRSVAEG-IEAVMTAHIALPRISSGPESNRLPATL 248 Query: 227 SPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAV 286 SP L +LR ++ +DGVI +D L M G + SL AG D +++C+ V Sbjct: 249 SPEAL-GILRNDMKYDGVIVTDCLEMNGIRGTYGTVDGTLMSLKAGSDSVMICHTYAVQV 307 Query: 287 SVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERW 334 + + ERV + G + ++ R R+ L ER+ Sbjct: 308 TSI--------ERVVQAVKSGELPESRICEALR------RITSLKERF 341 >UniRef50_B6J7B4 Anhydromuramoyl-peptide exo-beta-N-acetylglucosaminidase n=6 Tax=Coxiella burnetii RepID=B6J7B4_COXB1 Length = 313 Score = 343 bits (881), Expect = 5e-93, Method: Composition-based stats. Identities = 142/323 (43%), Positives = 199/323 (61%), Gaps = 17/323 (5%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 MG ++D+EG L +ERE+L HP V G+ILFTRNY +P+QLR L++ I+ L VA Sbjct: 1 MGFAIIDLEGVALSPQERELLQHPQVAGVILFTRNYENPSQLRSLIQTIKKI--RPLFVA 58 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALS--GMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 VDQEGGRVQRF GFT LP+ + F + K + A + E+ + ID++ Sbjct: 59 VDQEGGRVQRFHRGFTDLPSMRYFGECYQHDPLQARKHLENALKTVVHELKEVGIDVNMT 118 Query: 119 PVLDVGH-ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 PVLD+ + IGERS+ DP+ A+ S ID +H M KHFPGHG VTADSH+ Sbjct: 119 PVLDIDQGKNEIIGERSFSGDPRVVEALGSVVIDTLHANRMPAVAKHFPGHGGVTADSHQ 178 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQ 237 P D R I D FS+L ++LDAIMP H+IY D +PA+ S YWL+ +LRQ Sbjct: 179 ALPIDERDPEAILQCDWIPFSAL--SSRLDAIMPTHIIYPAFDEKPATFSRYWLQDMLRQ 236 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKA 297 L F GVI SDDL+M+GAA++G Y R +L+AGCD++ +CN+R+G +++LD+L Sbjct: 237 RLNFRGVIISDDLTMQGAAVIGDYPTRAAQALEAGCDLLTICNSREGVIAILDSL----- 291 Query: 298 ERVTRLYHKGSFSRQELMDSARW 320 L+++ S+Q L+ +R+ Sbjct: 292 -----LHYRNPESQQRLLSFSRF 309 >UniRef50_A4BUN0 Glycoside hydrolase, family 3-like protein n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BUN0_9GAMM Length = 337 Score = 342 bits (878), Expect = 1e-92, Method: Composition-based stats. Identities = 147/333 (44%), Positives = 191/333 (57%), Gaps = 9/333 (2%) Query: 5 MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQE 64 M+ + G L EERE+L P +GG+ILF+RNY QL +L I A R L++AVDQE Sbjct: 1 MIGIRGLVLLPEERELLCDPRIGGVILFSRNYESIEQLSQLTEAIHAIRRPSLLIAVDQE 60 Query: 65 GGRVQRFREGFTRLPAAQSFAALSGMEEGGKLA--QEAGWLMASEMIAMDIDISFAPVLD 122 GG VQRFR GFT LPA L LA + GWLMA+E+ A +D+SFAPVLD Sbjct: 61 GGCVQRFRTGFTALPAPACLGRLYDHGPPVALAAAERLGWLMAAELRAAGVDLSFAPVLD 120 Query: 123 VGH-ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPC 181 V +S IGER++H + ++ GM +AGM GKHFPGHGAV ADSH E P Sbjct: 121 VERGVSRVIGERAFHRRAGAVADLTLSWVRGMRQAGMAAVGKHFPGHGAVVADSHHELPV 180 Query: 182 DPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQELGF 241 D RP I + D F LI L+ +M AHV+Y VD RPAS S W+ VLR+ LGF Sbjct: 181 DDRPLDTIASVDFLPFRRLIAVG-LEGVMMAHVLYPQVDARPASFSGVWIAGVLRETLGF 239 Query: 242 DGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL----SPIKA 297 G + SDDL M+ AA +G +R +L AGCD++L+ N+ + A S+++ L P++A Sbjct: 240 TGAVVSDDLGMQAAACIGDLYQRAGVALAAGCDLLLLGNDWEPAASIVERLPSVERPVRA 299 Query: 298 ERVTRLYHKGSFSRQELMDSARWKAISTRLNQL 330 R+ L +G L RW T QL Sbjct: 300 ARLAALRGEGRLG-SPLQRDPRWLPAVTLATQL 331 >UniRef50_C8PT26 Glycosyl hydrolase, family 3 n=1 Tax=Treponema vincentii ATCC 35580 RepID=C8PT26_9SPIO Length = 520 Score = 341 bits (876), Expect = 2e-92, Method: Composition-based stats. Identities = 88/346 (25%), Positives = 160/346 (46%), Gaps = 26/346 (7%) Query: 1 MGPVML-DVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAA----SRN 55 +G + G E+ + ++ VG +ILF N AQL+ L I+A + + Sbjct: 10 IGQMFACGFPGTEMPEDFIALVKKYKVGNVILFKHNIKSLAQLKALCADIQALITAETGH 69 Query: 56 RLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDI 115 + +DQEGG V R E + +P A + AA A AG + + A ++ Sbjct: 70 SAFITIDQEGGAVSRLAEDYYNVPGAMALAATGDERN----AYTAGKITGEILRAHGVNF 125 Query: 116 SFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTA 173 APVLDV + IG RSY +P + I G+ G+ + KHFPGHG + Sbjct: 126 DLAPVLDVNSNPKNPVIGVRSYGDNPALVDRYGIQMIRGLTAGGVLSAAKHFPGHGDTSV 185 Query: 174 DSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLK 232 DSH P + +AE+ +++ F++ I+ L A+ +H+++ +D + PA+ S + Sbjct: 186 DSHLTLPTVRKSKAELEQTEIAPFAAAIKTGLLPAVTISHILFPAIDEQFPATISKKIVT 245 Query: 233 TVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL 292 +L++ELGF+G++ SD + M+ E A+++AG ++I + + A + ++ + Sbjct: 246 GLLKEELGFNGLVISDCMEMKAMQTFFGTEESAAAAINAGIELIFISHTIPTACAAIETV 305 Query: 293 SPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQEEK 338 + AE + +AR ++ L ++ + Sbjct: 306 TRACAEN--------RIPLATVEQAAR------KIIALKKQLTPAQ 337 >UniRef50_B4BR59 Glycoside hydrolase family 3 domain protein n=2 Tax=Geobacillus RepID=B4BR59_9BACI Length = 457 Score = 341 bits (876), Expect = 2e-92, Method: Composition-based stats. Identities = 91/335 (27%), Positives = 162/335 (48%), Gaps = 25/335 (7%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS-----RNRLV 58 V++ E D ++++ VGG IL+ RN P Q+ L ++ + + LV Sbjct: 121 VIVGFPSTEPDENVKQMIRQYHVGGAILYDRNMKTPEQVATLTNDLQRVALASRFKIPLV 180 Query: 59 VAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 ++DQEGG++ R R+ + +P+ Q + ++A +E+ AM I ++FA Sbjct: 181 FSIDQEGGKIVRMRQYVSPIPSQQQLGQSKDDAKVYQIAHR----TGTELAAMGIQVNFA 236 Query: 119 PVLDVGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE 178 PVLD+ + RS+ DP A + + + G+ +AG+ KHFPG+G D H + Sbjct: 237 PVLDLSATDS----RSFGEDPVLAGRLGQQVVTGLTDAGVTAVLKHFPGNGRSQVDPHHD 292 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLD--AIMPAHVIYSDVDPR-PASGSPYWLKTVL 235 + ++ KD+ F +I D +M H+ Y D PAS S ++ +L Sbjct: 293 SSSVEAGMEDLENKDILPFRQIITNINHDRFFVMVTHIKYPVYDKEHPASLSSAIIQQLL 352 Query: 236 RQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPI 295 R++LG++G++ +DDL M + +Y E G ++ AG D++LVC+ V+D Sbjct: 353 REKLGYEGIVVTDDLEMGAVSKYFTYRELGYRAIAAGADLLLVCHTFDHQKEVIDG---- 408 Query: 296 KAERVTRLYHKGSFSRQELMDSARWKAISTRLNQL 330 + G S + + +S R K ++ +L+Q Sbjct: 409 ----IWDAVQTGKLSEERINESVR-KILAYKLSQF 438 >UniRef50_Q67SQ8 Putative beta-N-acetylglucosaminidase n=1 Tax=Symbiobacterium thermophilum RepID=Q67SQ8_SYMTH Length = 523 Score = 341 bits (875), Expect = 3e-92, Method: Composition-based stats. Identities = 99/289 (34%), Positives = 148/289 (51%), Gaps = 13/289 (4%) Query: 5 MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRN-RLVVAVDQ 63 M+ G + ++ VGG+ILF RN Q R L +I++ S + L++AVDQ Sbjct: 18 MIGFRGTDPGDVPEALIRTEGVGGVILFARNLVSADQARRLTARIQSCSPDLPLLIAVDQ 77 Query: 64 EGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV 123 EGG V R T P A S A + L + G +A E+ AM I+++ APVLDV Sbjct: 78 EGGIVLRL--AVTPWPGAMSLGAAGSPD----LTRRVGAGIARELRAMGINMNLAPVLDV 131 Query: 124 GHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPC 181 + IG RS+ +DP + + +I G H+ G+ T KHFPGHG DSH Sbjct: 132 NVNPANPVIGVRSFGSDPARVAEHGAAYIRGHHDEGVLATAKHFPGHGDTDVDSHLALAR 191 Query: 182 DPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP---RPASGSPYWLKTVLRQE 238 P + A ++ F I +DAIM AHV++ V+P RPA+ SP L +LR++ Sbjct: 192 VPHGMDRLEAVELVPFRWAI-AAGVDAIMTAHVVFPAVEPDPDRPATLSPAVLTGLLREQ 250 Query: 239 LGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVS 287 LGFDG++ +D + M+ A AE ++ AG D++L+ + + Sbjct: 251 LGFDGLVITDCMEMKAIADHAGTAEGAVQAIAAGADLVLISHTERLQRE 299 >UniRef50_C0ZKK5 Probable beta-hexosaminidase n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0ZKK5_BREBN Length = 537 Score = 341 bits (874), Expect = 3e-92, Method: Composition-based stats. Identities = 88/340 (25%), Positives = 153/340 (45%), Gaps = 28/340 (8%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRN-----RLV 58 +M+ ++ E + +G +ILF+RN P + + V+++++ + L+ Sbjct: 16 LMIGYPTLDIPEEMEVWIKQKQIGSVILFSRNIGTPEETYQRVQKLQSWAYESWHPYPLL 75 Query: 59 VAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 V DQE G V R ++G TR P A + A +++ A+ E+ A I ++FA Sbjct: 76 VGTDQENGVVSRLQKGATRFPGAMALGATGNLDQ----AKRIYEATGEELRAAGISVNFA 131 Query: 119 PVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSH 176 P +DV + + IG RS+ P++ I G+ +G+ KHFPGHG + DSH Sbjct: 132 PTVDVNNNPDNPVIGVRSFGEAPEQVARFGEAAIQGLLASGVIPAIKHFPGHGDTSIDSH 191 Query: 177 KETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD--PRPASGSPYWLKTV 234 + P + + ++ F I + +M H+ +D PA+ S + + Sbjct: 192 EAVPVLGHDRTRLDQVELIPFQRSI-AAGVPMVMVGHISLPSMDESGSPATYSKEIVTGI 250 Query: 235 LRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSP 294 LR+ LGFDGV +D + M A AE S+ AG D++LV + + D Sbjct: 251 LRESLGFDGVAITDCMEMAAIAGTIGVAEAAVRSVQAGIDLVLVSHTHEVQQKTYD---- 306 Query: 295 IKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERW 334 R+ H G S + + ++ R+ QL +R+ Sbjct: 307 ----RLVAAIHSGELSEERVNEAV------NRVLQLKKRF 336 >UniRef50_A8VQ50 Choline dehydrogenase n=1 Tax=Bacillus selenitireducens MLS10 RepID=A8VQ50_9BACI Length = 535 Score = 339 bits (871), Expect = 6e-92, Method: Composition-based stats. Identities = 101/348 (29%), Positives = 163/348 (46%), Gaps = 33/348 (9%) Query: 1 MGPVML-DVEGY---ELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRN- 55 +G +M+ E+ +E++ VG +ILF RN P ++R L +++ +R+ Sbjct: 9 LGQMMVFGFSASKPDEVSEGIKEMIETHRVGNIILFGRNLGSPGEIRSLTGKLQKIARDA 68 Query: 56 ----RLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAM 111 L ++VDQE G V+R EG T P A + A E LA E G+ A E+ A+ Sbjct: 69 GHERPLFISVDQENGAVRRLGEGTTVFPGAMTTGATGEPE----LAYECGYATALELKAL 124 Query: 112 DIDISFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHG 169 I+ + APV DV + + IG RS+ DP++A A A+ ++GM +A + T KHFPGHG Sbjct: 125 GINWNLAPVADVNNNRHNPVIGVRSFSEDPEQAAAFAAASMNGMQKANIMTAVKHFPGHG 184 Query: 170 AVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR---PASG 226 DSH P P + ++ F I E+ D IM +H+ + ++ P + Sbjct: 185 DTEVDSHLSLPVIPHDMERLNHVELVPFRRCI-ESGADVIMSSHIYFPALEKEKNLPVTL 243 Query: 227 SPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAV 286 S L +LR ++GF+G+I +D L M+ + A ++ AG D +++ Sbjct: 244 SHEVLTNLLRVKMGFEGLITTDCLEMDAISESVGTAVGAVKAIHAGVDFVMISMRTDLQK 303 Query: 287 SVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERW 334 L E V R KG S + +S R+ +R+ Sbjct: 304 QAL--------EAVKRDIDKGLISESRIEES------YQRIIAAKDRY 337 >UniRef50_Q4FV07 Glycoside hydrolase family protein n=32 Tax=Pseudomonadales RepID=Q4FV07_PSYA2 Length = 360 Score = 339 bits (871), Expect = 8e-92, Method: Composition-based stats. Identities = 137/283 (48%), Positives = 183/283 (64%), Gaps = 5/283 (1%) Query: 2 GPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAV 61 G +M+D++ L AE+ ++ VGG+ILF RN D AQ+R L IR +++ V Sbjct: 3 GVLMIDIDSTALTAEDVSLIRQAQVGGVILFARNVTDAAQVRALCDDIR-CHNADILIGV 61 Query: 62 DQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKL--AQEAGWLMASEMIAMDIDISFAP 119 DQEGGRV R R GFT LPA L + L A + G+LMA+E++A+ ID+SFAP Sbjct: 62 DQEGGRVARLRNGFTPLPAMGRLGELFNQDPNRALECAYDCGYLMAAEVLAVGIDLSFAP 121 Query: 120 VLDVGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKET 179 VLD IS IG+RS+H DPQ +A+AS+F+ GM AGM TTGKHFPGHG++ DSH Sbjct: 122 VLDRDGISQVIGDRSFHHDPQAIIALASQFMRGMKAAGMATTGKHFPGHGSIAPDSHVAE 181 Query: 180 PCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQEL 239 D R +I DM F+ + LDA+MPAHVI+S VD +PA S WLK ++R++L Sbjct: 182 AIDDRSLDDIMNSDMQPFAQTLP--WLDALMPAHVIFSQVDNKPAGFSKIWLKDIIREQL 239 Query: 240 GFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNR 282 F+GV+FSDDLSM GA G + R A+++AGCD+ LVCN+R Sbjct: 240 NFEGVLFSDDLSMAGAKAAGDVSARVTAAIEAGCDIALVCNDR 282 >UniRef50_Q04TR7 Beta-glucosidase-related glycosidase n=4 Tax=Leptospira RepID=Q04TR7_LEPBJ Length = 579 Score = 339 bits (869), Expect = 1e-91, Method: Composition-based stats. Identities = 87/337 (25%), Positives = 163/337 (48%), Gaps = 23/337 (6%) Query: 2 GPVM-LDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRA----ASRNR 56 G V+ + + G LD + + L GG+ILF N ++ +L +++ +S+ Sbjct: 56 GQVIHVAIPGKTLDETAAKEIEEILPGGIILFGTNLGTKEEILKLNLELQKKSMESSKLP 115 Query: 57 LVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 L++++DQEGGRV R ++G T+ P A + + A + G++ + ++ + + Sbjct: 116 LLISIDQEGGRVLRVKDGVTQFPGAMALGQTKNAD----YAYKVGFVTSYQLRKLGFNFV 171 Query: 117 FAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 FAP LD+ + + I RS + P+ + G G T KHFPGHG D Sbjct: 172 FAPDLDINNNPDNPVINTRSMGSTPETVSISGIGYEKGARIGGAIPTIKHFPGHGDTNVD 231 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKT 233 SH P + E+ ++ F I++ + +M AH++Y +DP PA+ S L Sbjct: 232 SHLGLPKIDKTLEELERMELIPFQESIQQG-AEVVMSAHIVYPKLDPDFPATLSSKILTG 290 Query: 234 VLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLS 293 +LR ++GF GV+ +D + M + + G ++ AG D++L+ + K ++ + Sbjct: 291 ILRDKMGFKGVVITDAMEMNAIDVHYKDRDPGVLAILAGADILLMTSWGKTTRNMKN--- 347 Query: 294 PIKAERVTRLYHKGSFS--RQELMDSARWKAISTRLN 328 +++ Y KG+F ++L+ A ++ I +L Sbjct: 348 -----QISTAYKKGTFQKGEKDLLKEAVYRQILLKLK 379 >UniRef50_A3DC81 Glycoside hydrolase, family 3-like protein n=3 Tax=Clostridium thermocellum RepID=A3DC81_CLOTH Length = 444 Score = 338 bits (868), Expect = 1e-91, Method: Composition-based stats. Identities = 88/325 (27%), Positives = 161/325 (49%), Gaps = 25/325 (7%) Query: 13 LDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRF- 71 LD + + + GG+ILF+ N Q ++L+R ++ AS+ + +AVD+EGGR+ R Sbjct: 99 LDNSTKLKIQNFNPGGIILFSENIDTIPQTQKLIRDMQEASKIPMFIAVDEEGGRIARIG 158 Query: 72 ---REGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI-- 126 + T++P+AQ+ + + +LA EAG ++ +E+ A+ +++FAPV DV Sbjct: 159 NNPKMHSTKIPSAQTIG----LADDPELAYEAGRILGAELSALGFNMNFAPVADVNTNPD 214 Query: 127 SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQ 186 + IG+RS+ +DP K + GM E + T KHFPGHG + DSH Sbjct: 215 NPVIGDRSFGSDPYKVGLMVQAMSKGMQEQNVCTVLKHFPGHGDTSYDSHLGQVVINHDI 274 Query: 187 AEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP--RPASGSPYWLKTVLRQELGFDGV 244 +R +++ F I+ D +M AH+I ++ PA+ S L +LR EL + + Sbjct: 275 ERLRQIELTPFKMGIKAG-ADGVMTAHIIMPNITGSNLPATLSEEILSGLLRNELKHEKL 333 Query: 245 IFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLY 304 I +D + M+ + S ++ + AG D+IL+ + + A + + + Sbjct: 334 IITDAMEMKAISNYWSSSKAAVMAFKAGADIILMPESFEEAYN-----------GILKAV 382 Query: 305 HKGSFSRQELMDSARWKAISTRLNQ 329 G + + L S + + ++ + + Sbjct: 383 KDGEITEERLNQSLQ-RILALKFER 406 >UniRef50_B3TAA3 Putative glycosyl hydrolase family 3 N terminal domain protein n=2 Tax=environmental samples RepID=B3TAA3_9ARCH Length = 392 Score = 338 bits (868), Expect = 2e-91, Method: Composition-based stats. Identities = 111/318 (34%), Positives = 164/318 (51%), Gaps = 18/318 (5%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASR-----NRLV 58 ++ EG LDA E++ +GGLILF RNY +P QL +L +++ + L Sbjct: 2 IISGFEGTTLDARTEELIVQQGIGGLILFERNYKNPDQLLQLTHDLQSLAAGHAEIPPLF 61 Query: 59 VAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 ++VDQEGGRV R + FT+ P +E LA G M E+ A+ I++ +A Sbjct: 62 ISVDQEGGRVARLGKPFTQFPPMSCLGQADSVE----LAYRFGLGMGKELRAVGINMDYA 117 Query: 119 PVLDV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSH 176 PVLDV + IG R+ DP K + + I G ++AG+ GKHFPGHG + DSH Sbjct: 118 PVLDVHSNPENPIIGLRALDTDPAKVARLGAELIRGFYDAGVIPVGKHFPGHGDTSQDSH 177 Query: 177 KETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVL 235 P R + + ++ F I++ L+ IM AHV+Y D R PA+ S L VL Sbjct: 178 LTLPQVERARESLEKSELIPFIHAIQQG-LEVIMTAHVVYPAWDARQPATFSRSILNDVL 236 Query: 236 RQELGFDGVIFSDDLSMEGA-AIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL-S 293 R L F+G+I SDDL M+ + S E G AG D+ L+C++ + ++ D++ Sbjct: 237 RNSLQFEGLIISDDLDMQAIDEKLESIPELGTR---AGVDLFLICHDLEKVCTLQDSMIR 293 Query: 294 PIKAERVTRLYHKGSFSR 311 I+ R+ R K S R Sbjct: 294 DIETARIPRTTIKQSLGR 311 >UniRef50_Q8ERK3 Beta-N-acetylglucosaminidase n=2 Tax=Bacillaceae RepID=Q8ERK3_OCEIH Length = 668 Score = 338 bits (867), Expect = 2e-91, Method: Composition-based stats. Identities = 88/325 (27%), Positives = 146/325 (44%), Gaps = 32/325 (9%) Query: 18 REILAHPLVGGLILFTR--NYHDP---AQLRELVRQIRAA-----SRNRLVVAVDQEGGR 67 +E++ + +GG+I F N P Q+ +L ++ S L V+ DQEGG Sbjct: 77 QEVIENYHIGGIIYFNWSQNIGTPLEKQQVHDLSNGLQDIALEQDSEIPLFVSTDQEGGI 136 Query: 68 VQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI- 126 VQR T P + A + ++ A +G ++ E+ A+ I+++FAP +DV Sbjct: 137 VQRVISPGTVFPGNMAIGATNSID----FASRSGEILGKELDALGINMNFAPTVDVNLNP 192 Query: 127 -SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRP 185 + IG RS+ DP + + I + + T KHFPGHG DSH P Sbjct: 193 ANPVIGVRSFGEDPDMVSQLGAAQITAYEDQDVVATAKHFPGHGDTDVDSHYGLPVIDHD 252 Query: 186 QAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD--PRPASGSPYWLKTVLRQELGFDG 243 + D+ F + I + + AIM H++ +D PA+ S L +LR+++GF+G Sbjct: 253 LETLHEVDLPPFQAAI-DAGVGAIMTGHIVVPALDDSGMPATLSYPILTELLREDMGFEG 311 Query: 244 VIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRL 303 +I +D L M G A + S AG D++L + P+ E V Sbjct: 312 LIITDSLGMSG-ANVVPPERVAVESFLAGADILLNPPD-----------VPVAYEGVMDA 359 Query: 304 YHKGSFSRQELMDSARWKAISTRLN 328 G S + L +S ++ +S ++ Sbjct: 360 VESGEISEERLDES-VYRILSAKME 383 >UniRef50_A9M1Z4 Beta-hexosaminidase n=23 Tax=Neisseriaceae RepID=NAGZ_NEIM0 Length = 361 Score = 338 bits (867), Expect = 2e-91, Method: Composition-based stats. Identities = 132/294 (44%), Positives = 183/294 (62%), Gaps = 5/294 (1%) Query: 2 GPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAV 61 G VM D+ + L EE++ L P VGG+ILF RN+ + AQL+EL +I+A L++AV Sbjct: 9 GLVMADIAAFRLTEEEKQRLLDPAVGGVILFRRNFQNIAQLKELTAEIKALRTPELIIAV 68 Query: 62 DQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKL---AQEAGWLMASEMIAMDIDISFA 118 D EGGRVQRF EGFTRLPA + + + A + G ++A+E+ A ID+SF Sbjct: 69 DHEGGRVQRFIEGFTRLPAMSTLGEIWDKDGASAAETAAGQVGRVLATELSACGIDLSFT 128 Query: 119 PVLDVGHIS-AAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 PVLD+ + IG RS+H +P+ +A G+ + GMK+ GKHFPGHG V DSH Sbjct: 129 PVLDLDWGNCPVIGNRSFHRNPEAVARLALALQKGLEKGGMKSCGKHFPGHGFVEGDSHL 188 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQ 237 P D R +E+ A D++ F + RE + A+MPAHV+Y VD +PA S WLK +LR+ Sbjct: 189 VLPEDGRSLSELEAADLAPFRIMSREG-MAAVMPAHVVYPQVDTKPAGFSEIWLKQILRR 247 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDN 291 ++GF GVIFSDDL+MEGA G ER + S +AGCD++LVCN + ++ Sbjct: 248 DIGFKGVIFSDDLTMEGACGAGGIKERARISFEAGCDIVLVCNRPDLVDELRED 301 >UniRef50_B8M4K9 Beta-N-acetylglucosaminidase, putative n=9 Tax=Trichocomaceae RepID=B8M4K9_TALSN Length = 2237 Score = 337 bits (866), Expect = 3e-91, Method: Composition-based stats. Identities = 91/348 (26%), Positives = 151/348 (43%), Gaps = 35/348 (10%) Query: 2 GPVM-LDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRN----- 55 G + + G+ E + ++ +G +ILF RN D QL+ L ++ ++ Sbjct: 1376 GQLFAVGFHGFTPSPEIKTLIRDYGLGAVILFRRNIKDAVQLKTLTLALQQEAKEAGHEH 1435 Query: 56 RLVVAVDQEGGRVQRFREGFT-RLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDID 114 L + +DQE G V R + P + A E A E G + I+ Sbjct: 1436 PLFIGIDQENGLVTRIMPPLAVQQPGQMTIGATQSAEN----AYEVGKCTGELLSFFGIN 1491 Query: 115 ISFAPVLDVG--HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVT 172 +++AP D+ ++ IG RS DP + G+ E+G+ T KHFPGHG Sbjct: 1492 MNYAPDCDINSEPLNPVIGVRSAGDDPNFVARFSCATAKGLRESGVVPTAKHFPGHGDTA 1551 Query: 173 ADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV------DPRPASG 226 DSH P + ++E+ ++ F + E ++ +M AH+ V + PA+ Sbjct: 1552 VDSHHGLPVISKSRSELERCELIPFRRAVAEG-IETVMTAHIALPKVTTGPDSEGLPATL 1610 Query: 227 SPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAV 286 SP L +LR ++ +DGVI +D L M+G E SL AG D I++C+ V Sbjct: 1611 SPEAL-GILRNDMKYDGVIVTDCLEMDGIRATYGTVEGSLMSLIAGSDSIMICHTYAVQV 1669 Query: 287 SVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERW 334 + ERV + G+ S + +S R+ +L +R+ Sbjct: 1670 KSI--------ERVVQAVQLGALSESRIDES------LHRIGKLKQRF 1703 >UniRef50_Q6MRE1 BglX2 protein n=2 Tax=Bdellovibrio bacteriovorus RepID=Q6MRE1_BDEBA Length = 370 Score = 337 bits (865), Expect = 3e-91, Method: Composition-based stats. Identities = 88/336 (26%), Positives = 166/336 (49%), Gaps = 22/336 (6%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS-----RNRLV 58 + + V G+ L A+E++ + +GG+ LF RN +P Q+REL +I++ + L Sbjct: 11 MFIGVSGHALTADEKKFIVENNIGGVCLFGRNVAEPKQVRELCAEIQSLRHKQVDKAPLF 70 Query: 59 VAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 + +D EGGRV R + FT P + L A G EM A+ I++ FA Sbjct: 71 IGIDMEGGRVHRLKAPFTVWPPLRKLGDLDAPTVSFHFANRMGL----EMKAVGINLDFA 126 Query: 119 PVLDV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSH 176 P +D+ + IG+RS +DP+ AS + G ++ + T KHFPGHG DSH Sbjct: 127 PCVDIFTNPGNTVIGDRSISSDPEMVAKHASALVRGYIKSDVITCAKHFPGHGNTIVDSH 186 Query: 177 KETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVL 235 ++ P + + + ++ F + +++D +M +H+ + +DP P + S +++ ++ Sbjct: 187 EDLPVENTDLERLESCELIPFKKTFK-SRVDMVMTSHIKFPKIDPEWPVTLSETFVRKMI 245 Query: 236 RQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPI 295 R+E+ + G+I +DDL M+ E +L AG D++L CN+ + D Sbjct: 246 REEMRYRGLIITDDLGMKAMTKHYGIEEVPVRALKAGVDLLLYCNDPEVPPQAYDA---- 301 Query: 296 KAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLH 331 + +GS +++L +++ + + + ++ Sbjct: 302 ----ILGALAQGSLKKEDL-EASYHRIMDFKKVKIQ 332 >UniRef50_A8F7W1 Glycoside hydrolase family 3 domain protein n=1 Tax=Thermotoga lettingae TMO RepID=A8F7W1_THELT Length = 502 Score = 337 bits (865), Expect = 3e-91, Method: Composition-based stats. Identities = 99/335 (29%), Positives = 162/335 (48%), Gaps = 23/335 (6%) Query: 5 MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQE 64 M+ ++G EL E +L G +ILF+RN P QL +L++++R + +V ++DQE Sbjct: 12 MVGIDGTELTDEAARVLEQVKPGFIILFSRNIESPEQLYQLIKELRKVVKKPVVFSIDQE 71 Query: 65 GGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV- 123 GG V R R GFT A + AA A AG ++A EM ++ I + APVLD+ Sbjct: 72 GGIVTRLRNGFTVSCGAMALAATKDENN----AYLAGKILALEMKSLGITWNLAPVLDIN 127 Query: 124 -GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCD 182 + IG RS+ PQK + A F G+ E + GKHFPG G+V D+H + P Sbjct: 128 DNPNNPGIGVRSFGDTPQKVIKFAKAFYKGLSEESVAACGKHFPGKGSVEVDAHLDMPVL 187 Query: 183 PRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPRPASGSPYWLKTVLRQELG 240 + E++ ++ F L++E L++IMP+H+ + D PA+ S L +LR+ L Sbjct: 188 QKSLEELKRWELIPFVELMKEG-LESIMPSHIHLPQLQPDRLPATVSKEILTDLLRKTLN 246 Query: 241 FDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERV 300 + G+ +DDL M G S + S +G D++ +C+ + +S L + Sbjct: 247 YHGIAVADDLLMGGITKAMSVEDAVINSFKSGMDVLSLCHEPQAQISAKKAL-------L 299 Query: 301 TRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQ 335 ++ + R R+ E++ Sbjct: 300 EQIKKNPELEK-------RLSESLGRIKNFKEKFY 327 >UniRef50_B2FPW9 Beta-hexosaminidase n=18 Tax=Xanthomonadaceae RepID=NAGZ_STRMK Length = 335 Score = 336 bits (863), Expect = 5e-91, Method: Composition-based stats. Identities = 139/328 (42%), Positives = 192/328 (58%), Gaps = 6/328 (1%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQ 63 +++ V G EL A+ER+ L H V G++LF RN+ Q+ +L IRAA+ ++ VDQ Sbjct: 2 LLIGVAGTELTAQERDWLQHDAVAGVVLFKRNFASRQQVTDLSAAIRAAAPRPQLICVDQ 61 Query: 64 EGGRVQRFREGFTRLPAAQSFAALS--GMEEGGKLAQEAGWLMASEMIAMDIDISFAPVL 121 EGGRVQRFREG++ LP Q AL ++ LA+ WLMASE+ A +D+SFAPV+ Sbjct: 62 EGGRVQRFREGYSDLPPLQDIGALYATDPQQALALAERHAWLMASEVRASGLDLSFAPVV 121 Query: 122 DVGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPC 181 D+G + AIG R++ DPQ A + ++ GMH GM T KHFPGHG V D+H +T Sbjct: 122 DLGRGNRAIGNRAFSEDPQVVAAFTAAYVRGMHSVGMAATLKHFPGHGTVLEDTHVDTAI 181 Query: 182 DPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQELGF 241 DPR E+RA+D+ F + I DA+M AHVIY + P PA SP W++ +LR ELGF Sbjct: 182 DPRALDELRAQDLVPFQAGI-AAGADAVMMAHVIYPQIAPEPAGYSPRWIQDILRGELGF 240 Query: 242 DGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLS--PIKAER 299 GV+FSDD+ M + G R A LDAGCD++LVC+ + L + + Sbjct: 241 RGVVFSDDIGMAASHSAGGVPARVHAHLDAGCDVVLVCH-PELVDEALHAVQGRSLNTAA 299 Query: 300 VTRLYHKGSFSRQELMDSARWKAISTRL 327 + L +G+ L+ AR TRL Sbjct: 300 LLGLIGRGALGWDGLLADARHGDTQTRL 327 >UniRef50_A1AVY9 Beta-N-acetylhexosaminidase n=2 Tax=Gammaproteobacteria RepID=A1AVY9_RUTMC Length = 327 Score = 336 bits (862), Expect = 8e-91, Method: Composition-based stats. Identities = 134/294 (45%), Positives = 190/294 (64%), Gaps = 6/294 (2%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 MGP+M+DV G L +E++ L P +GG+ILF RN+ Q+ L++ IR ++N L++A Sbjct: 2 MGPIMMDVSGLALTTQEKQQLIKPSIGGVILFGRNFESIEQVTLLIKDIRQVNQN-LLIA 60 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKL--AQEAGWLMASEMIAMDIDISFA 118 VD EGGRVQRFR GFT LPA AL + A G+++A E++ + +D SFA Sbjct: 61 VDHEGGRVQRFRYGFTHLPAMSKLGALYDKNPDIVIDKAFSCGFVLAYELLDIGVDFSFA 120 Query: 119 PVLDVGHI-SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 PVLD+ + S IG+R++H++P + +A I GMH+AGMK GKHFPGHG VTADSH Sbjct: 121 PVLDINYDTSQVIGDRAFHSNPDVIVKLAGSLISGMHKAGMKCVGKHFPGHGFVTADSHI 180 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQ 237 + P D RP +++ +DM F LI LDA+M AHV+Y+ VD + A S W+K +L+ Sbjct: 181 DLPIDDRPMSDL-LQDMKPFKRLINHG-LDALMYAHVVYTQVDDKIAGFSSKWIKDILQN 238 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDN 291 +L F GVIFSDDLSM+ A + R + SLD+GCDM+L+CN+ + ++D Sbjct: 239 QLKFKGVIFSDDLSMQSAHFIKDITNRVKTSLDSGCDMVLICNHPEFVAQIIDK 292 >UniRef50_C1TNN0 Beta-glucosidase-like glycosyl hydrolase n=1 Tax=Dethiosulfovibrio peptidovorans DSM 11002 RepID=C1TNN0_9BACT Length = 541 Score = 336 bits (862), Expect = 8e-91, Method: Composition-based stats. Identities = 102/330 (30%), Positives = 154/330 (46%), Gaps = 20/330 (6%) Query: 2 GPVM-LDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASR--NRLV 58 G V+ L G LD E R L GGLIL+ +N +Q+R L+ IR R L+ Sbjct: 33 GQVLALSFSGTSLDEETRFRLEEIKPGGLILYGKNVESLSQVRSLLSSIRERVRLDIPLM 92 Query: 59 VAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 VAVDQEGG V R R P + A + LA+ G++M ++ ++ ID+ +A Sbjct: 93 VAVDQEGGTVARIRGHGADFPGNMALGATGDPD----LAERQGFIMGRQLKSLGIDLDYA 148 Query: 119 PVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSH 176 PV+DV + IG RS+ D ++ + I G A M + KHFPGHG V DSH Sbjct: 149 PVVDVNSNPSNPIIGVRSFGDDVATVSSMGASMIKGFSRARMGCSAKHFPGHGDVDIDSH 208 Query: 177 KETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVL 235 P R +R + F S + E + A+M AH++ + PA+ SP + ++L Sbjct: 209 LGLPVLNRSLESMRYLEFPPFRSAV-EAGVPAVMTAHIVVPSLTGELPATLSPEAI-SLL 266 Query: 236 RQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPI 295 R E+GF+GV+ SD + M + + +L AG D +L+ + + Sbjct: 267 RDEMGFEGVVLSDSMGMRAISNGWGVPDAVVMALRAGVDFVLLGADPAFPPEGHREVR-- 324 Query: 296 KAERVTRLYHKGSFSRQELMDSA----RWK 321 +R+ G + L D+ RWK Sbjct: 325 --DRIVEAVRSGELDKARLDDAVERILRWK 352 >UniRef50_A8MJC3 Glycoside hydrolase family 3 domain protein n=2 Tax=Clostridiaceae RepID=A8MJC3_ALKOO Length = 418 Score = 335 bits (861), Expect = 1e-90, Method: Composition-based stats. Identities = 89/332 (26%), Positives = 164/332 (49%), Gaps = 20/332 (6%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRA---ASRNRLVVA 60 +++ ++GY+++ +++ + VGG+IL+ RN QL +L ++ A+ L ++ Sbjct: 74 LLVGIDGYDMNDPTADLIQNYRVGGVILYGRNVKGTKQLLDLTNALKTSNTANNIPLFIS 133 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 VD+EGG V R + +LP A++ ++ LA E G L+ + + + +++FAPV Sbjct: 134 VDEEGGAVSRSPKEVKKLPTARAIGKTEDVD----LAYEVGNLLGAMVKSFGYNMNFAPV 189 Query: 121 LDV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE 178 LD+ + I +RS+ P + + GM G+ KHFPGHG + DSH Sbjct: 190 LDIDSNPANPVIEKRSFGKTPDIVSKMGIAEMKGMASEGVIPVVKHFPGHGDTSVDSHIG 249 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD-PRPASGSPYWLKTVLRQ 237 P I + + F S I E + +M AH++ +D +PAS S + +LR+ Sbjct: 250 LPIIHHELERIMDFEAAPFKSAIEEG-AEVVMVAHILLKKIDAEKPASMSKVIITDILRE 308 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKA 297 LGF GV+ +DD++M + A ++++AG D+ILV + + A++ + Sbjct: 309 NLGFGGVVITDDMTMGAISKNYDMAAAVVSAINAGADIILVGHQYENAIAAFNG------ 362 Query: 298 ERVTRLYHKGSFSRQELMDSA-RWKAISTRLN 328 + + +G L +S R ++ + N Sbjct: 363 --IKKGIEEGRIKMDRLDESVHRILSLKNKYN 392 >UniRef50_B4RS37 Beta-hexosaminidase n=2 Tax=Alteromonas macleodii RepID=B4RS37_ALTMD Length = 340 Score = 335 bits (859), Expect = 2e-90, Method: Composition-based stats. Identities = 162/344 (47%), Positives = 217/344 (63%), Gaps = 15/344 (4%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 MGP+MLD + EL EE+E LAHP+VGG+ILFTRNYHD QL LV+ IR ++N +++A Sbjct: 1 MGPLMLDCQAEELLPEEKEKLAHPVVGGVILFTRNYHDKRQLSALVQDIRRFAKNNVLIA 60 Query: 61 VDQEGGRVQRFREGFTRLPAAQS-FAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAP 119 VD EGGRVQRFR+GFT +PA +E A+ G +A E+ +DID SFAP Sbjct: 61 VDHEGGRVQRFRDGFTAIPAMGDILRQSEHQDESAAYAKACGLALAYELKTLDIDFSFAP 120 Query: 120 VLDVGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKET 179 VLD+ +S IG+R++ + + + +A++ I G+ M GKHFPGHG+V DSH Sbjct: 121 VLDINGVSQVIGDRAFSNNADEVIRLATQLIQGLKTVNMPAIGKHFPGHGSVAPDSHIAL 180 Query: 180 PCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQEL 239 P D R EIR DM VF +I LD +MPAHVIY+ V +PA S WLKT+L+ ++ Sbjct: 181 PVDERGFDEIRRTDMVVFERVIERALLDGVMPAHVIYNQVCDKPAGFSSTWLKTILKNDI 240 Query: 240 GFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSP----- 294 GF+G +FSDDLSM GA++ GSY ER +A+LDAGCDM+L CNN KGA S+LD L Sbjct: 241 GFNGAVFSDDLSMHGASVAGSYVERAEAALDAGCDMVLACNNPKGAESILDGLEARFSNP 300 Query: 295 -----IKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHER 333 + ERV LY G L D ++++ + + +L ER Sbjct: 301 NNSDVARQERVMALY--GRVMPAGLYD--KYESAAALIARLSER 340 >UniRef50_Q1CXY4 Glycosyl hyrolase, family 3 n=6 Tax=Cystobacterineae RepID=Q1CXY4_MYXXD Length = 369 Score = 334 bits (858), Expect = 2e-90, Method: Composition-based stats. Identities = 106/339 (31%), Positives = 159/339 (46%), Gaps = 24/339 (7%) Query: 5 MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQE 64 M+ G +D+E ++ + G ILF RN A L ++ + +++VDQE Sbjct: 16 MVGFPGTHIDSELAALMDD-GIYGAILFKRNVGSAADTAALCHALKTRAGRPFILSVDQE 74 Query: 65 GGRVQRFR-EGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV 123 GGRV R R FT LP + + + G L+A E+ A+ D FAPVLDV Sbjct: 75 GGRVARLRGAPFTALPPMRELGQRGDAAQ----VERVGRLLAYELRALGFDWDFAPVLDV 130 Query: 124 --GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPC 181 + IG+RS+ + ++ + G+ G+ + GKHFPGHG T DSH P Sbjct: 131 DTNPANPVIGDRSFSREAEEVARLGVALARGLEAGGVASCGKHFPGHGDTTTDSHLTLPR 190 Query: 182 DPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPASGSPYWLKTVLRQELG 240 P +R+ ++ F + L ++M AHV++ +DP PA+ S L+ VLR+ELG Sbjct: 191 LPHDLERLRSVELVPFR-AFAQAGLASLMTAHVLFDALDPGVPATMSQRVLQGVLREELG 249 Query: 241 FDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERV 300 FDGV+ SDDL M+ A S E AG D+ LVC+N ++ L Sbjct: 250 FDGVLVSDDLEMKAIAGHYSVEEATVQGTLAGVDLFLVCHNADVQRRAIEAL-------- 301 Query: 301 TRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQEEKA 339 + G SR+ + + R RL+ L R+ Sbjct: 302 VKAVESGRVSRERIAQAHR------RLDALSARFAHPAE 334 >UniRef50_C7MQM3 Beta-glucosidase-like glycosyl hydrolase n=2 Tax=Actinomycetales RepID=C7MQM3_SACVD Length = 618 Score = 333 bits (855), Expect = 5e-90, Method: Composition-based stats. Identities = 104/327 (31%), Positives = 156/327 (47%), Gaps = 35/327 (10%) Query: 18 REILAHPLVGGLILF----TRNYHDPAQLRELVRQIRAAS-----RNRLVVAVDQEGGRV 68 E++ VGG+I F T N P Q+ L ++ A+ R L+V+VDQEGGRV Sbjct: 96 AEVVRKYHVGGVIYFNHSGTDNIETPRQVARLSNGLQRAALQSQPRVPLIVSVDQEGGRV 155 Query: 69 QRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI-- 126 R E T P+A + A +E A + A+E+ AM I+ +FAPV DV Sbjct: 156 TRIAEHVTEYPSAMALGASRDVEGART----AAAISAAELRAMGINQNFAPVADVNSNPL 211 Query: 127 SAAIGERSYHADPQKALAIASRFIDGMH-----EAGMKTTGKHFPGHGAVTADSHKETPC 181 + IG RS+ ADP+ A +DG + KHFPGHG +ADSH + P Sbjct: 212 NPVIGSRSFSADPELAGEFVEAQVDGYQNSRQPRQTVSAAAKHFPGHGDASADSHIDLPV 271 Query: 182 DPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD--PRPASGSPYWLKTVLRQEL 239 R + + RA D+ F + E +DAIM AH+ +D PA+ S + +LR+EL Sbjct: 272 IDRSEEDWRANDLPPFERAV-EAGIDAIMTAHISVPSLDDSGDPATLSEPIITGLLREEL 330 Query: 240 GFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAER 299 G+DGV+ +D L M G M +E +L+AG D +L+ + AV+ Sbjct: 331 GYDGVVVTDSLGMAGVRQMYPDSEIPVRALEAGVDQMLMPPDLDAAVN-----------G 379 Query: 300 VTRLYHKGSFSRQELMDSARWKAISTR 326 V G + + + +S + + + Sbjct: 380 VLDAVASGRITEERIDES-VLRILKLK 405 >UniRef50_Q2BGB3 Beta-hexosaminidase n=1 Tax=Neptuniibacter caesariensis RepID=Q2BGB3_9GAMM Length = 342 Score = 332 bits (852), Expect = 1e-89, Method: Composition-based stats. Identities = 135/330 (40%), Positives = 198/330 (60%), Gaps = 11/330 (3%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQ 63 + LD++G EL EE+ +L PL+GG+ILF RN + Q+ ELV IRA R L++++DQ Sbjct: 13 LFLDLDGLELTDEEKVLLLDPLIGGVILFARNTQNAKQVAELVASIRAI-RPDLILSLDQ 71 Query: 64 EGGRVQRFREGFTRLPAAQSFAALS-GMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLD 122 EGGRVQR ++G T LP ++ + S +EE A++ G+LMA+E+ +DID+SFAPV+D Sbjct: 72 EGGRVQRLKDGVTLLPPVKALSKFSTSLEENRSTAEQLGYLMAAELRQLDIDLSFAPVVD 131 Query: 123 VGH-ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPC 181 + + + IG R++ D ++S +I+GM +AGM T KHFPGHG AD+H Sbjct: 132 IDYGRNTVIGNRAFGNDVSTVTVLSSGYIEGMRDAGMPATVKHFPGHGWANADTHHADAI 191 Query: 182 DPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQELGF 241 D R + I DM F I N ++A+M AHV+Y D PA S +W+ VLR LGF Sbjct: 192 DEREFSVIEQADMVPFKEAIT-NGVEAMMFAHVLYPQCDASPAGFSDFWIGNVLRDGLGF 250 Query: 242 DGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVT 301 GVIFSDDLSM+ A G YA+R + ++ AGC ++L CN+RKG ++VL+ + E + Sbjct: 251 QGVIFSDDLSMKAAHSAGGYAKRAEHAIRAGCQVLLCCNDRKGTLAVLEYMRSACVEPYS 310 Query: 302 RLY----HKGSFSRQELMDSARWKAISTRL 327 L L + + +++RL Sbjct: 311 ALSSLKGKAIKVDEVRLEQA---RVLASRL 337 >UniRef50_D1CEL0 Glycoside hydrolase family 3 domain protein n=1 Tax=Thermobaculum terrenum ATCC BAA-798 RepID=D1CEL0_THET1 Length = 596 Score = 332 bits (852), Expect = 1e-89, Method: Composition-based stats. Identities = 88/326 (26%), Positives = 139/326 (42%), Gaps = 28/326 (8%) Query: 15 AEEREILAHPLVGGLILFT--RNYHDPAQLRELVRQIRAASRN-----RLVVAVDQEGGR 67 +I+ VG +I FT N + Q+ L I+ +S +++ DQEGG Sbjct: 86 DNASQIIQKYHVGSIIYFTWTGNIKNAQQVARLSNGIQRSSMTQRVPVPTLISTDQEGGI 145 Query: 68 VQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI- 126 + R R P + A A A +++ E+ A+ I+ + AP DV Sbjct: 146 ISRLPSQIVRSPGNMALGAARDKT----YAFRAAYIVGKELRALGINQNLAPDADVNINP 201 Query: 127 -SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRP 185 + IG RS+ + + + G + T KHFPGHG DSH P Sbjct: 202 FNPVIGVRSFGDRSWLVARLTAAQVRGYQSVDVAATAKHFPGHGDTNIDSHTGLPVIRHT 261 Query: 186 QAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP--RPASGSPYWLKTVLRQELGFDG 243 Q ++ D+ F + I ++ +D IM AH++ +DP RPA+ S L +LRQ+LGF G Sbjct: 262 QKQLEDIDLPPFRAAI-DSGVDVIMSAHIVVPSLDPSGRPATLSKPILTGLLRQKLGFKG 320 Query: 244 VIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRL 303 VI +D L M G M + ++ AG D++L+ + A++ V Sbjct: 321 VIMTDSLEMAGVRQMFPDSRVPVEAIKAGADLLLMPPDLNLAIN-----------SVVNA 369 Query: 304 YHKGSFSRQELMDSA-RWKAISTRLN 328 +G S + S R + RL Sbjct: 370 VERGEISVSRINASVLRILELKARLG 395 >UniRef50_B5EPA1 Glycoside hydrolase family 3 domain protein n=2 Tax=Acidithiobacillus ferrooxidans RepID=B5EPA1_ACIF5 Length = 354 Score = 332 bits (851), Expect = 1e-89, Method: Composition-based stats. Identities = 140/344 (40%), Positives = 194/344 (56%), Gaps = 11/344 (3%) Query: 2 GPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAV 61 GP+M+D+ G A E +L P VGG+ILF RN D Q+R L ++ A L+V + Sbjct: 13 GPLMVDIAGLRPTAAECAMLRQPAVGGVILFARNCADAGQVRALCAELHALRTTPLLVGI 72 Query: 62 DQEGGRVQRFREGFTRLPAAQSFAAL---SGMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 DQEGGRVQR R G TR P + L G+E A++ G L+ +E+ A+ +D+ F Sbjct: 73 DQEGGRVQRLRAGVTRFPPMATLGRLADRDGLERALVAAEDWGLLLGTELRALGVDLDFT 132 Query: 119 PVLDVGH-ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 P +D+ ISA IG+R+ H DP AIA+ GM G++ KHFPGHGAV ADSH Sbjct: 133 PCVDLDRGISAVIGDRAIHRDPVWVGAIAAALWKGMARTGLQGVAKHFPGHGAVAADSHL 192 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLR 236 + P D RP + I+ D+ F++LI + AIMPAH +Y VD PA SP WL+ VLR Sbjct: 193 DLPLDDRPFSAIQE-DLQPFAALI-AAGIPAIMPAHCLYPRVDGEHPAGFSPRWLQDVLR 250 Query: 237 QELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIK 296 +E+GF GV+ SDDLSM GA G A R A+L AG D++LVCN+ +GA + + ++ Sbjct: 251 KEMGFTGVVVSDDLSMAGAHGAGGVAARVDAALTAGADLLLVCNDPEGAETAIAHVRSRD 310 Query: 297 A----ERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQE 336 A +R+ RL + + D R L L + + Sbjct: 311 ALPTGDRLERLAGNAATAALSAADRVRCLQQIEELRALAPGYTD 354 >UniRef50_A3CMS4 Beta-hexosamidase A, putative n=1 Tax=Streptococcus sanguinis SK36 RepID=A3CMS4_STRSV Length = 930 Score = 332 bits (851), Expect = 1e-89, Method: Composition-based stats. Identities = 94/346 (27%), Positives = 163/346 (47%), Gaps = 37/346 (10%) Query: 11 YELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS--------RNRLVVAVD 62 EL+ E E + GG+ILF N AQ L + ++ A+ + L++A+D Sbjct: 170 TELNKEVAEAVDKYDFGGVILFAENVKGTAQTLALTQALQQAAISNQANNGKLPLLLAID 229 Query: 63 QEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLD 122 QEGG V R G T LP + A E LA +AG ++ E+ A+ ++++FAPV D Sbjct: 230 QEGGIVYRLGSG-TALPGNMAVGATRDPE----LANQAGQIIGRELSALGLNVNFAPVFD 284 Query: 123 VGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETP 180 + + IG RS+ +DP+ + + ++G+ + + KHFPGHG DSH P Sbjct: 285 TNNNPQNPVIGLRSFSSDPKVVARLGTAMMEGIQKYNVAVAAKHFPGHGDTAVDSHTGLP 344 Query: 181 CDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-------------PASGS 227 + AE+ A ++ F + I +N D +M AH+ Y ++ PA+ S Sbjct: 345 LVDKSYAELEALELLPFKAAIDKN-ADMLMTAHIQYPQIEKDTVISKQTGETIYIPATLS 403 Query: 228 PYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVS 287 + ++R++ G+ GV+ SD + M+ A +E ++ +G D++L+ + + S Sbjct: 404 DDIITGIVRKKFGYKGVVVSDAMGMDAIAKNFGESEAAIMAIKSGVDVVLMPTVLR-SKS 462 Query: 288 VLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHER 333 L + I + + +G L +S TR+ L E+ Sbjct: 463 DLAKIDKIIND-IELAVQRGDIPVSRLDESV------TRILNLKEK 501 >UniRef50_C9NRN7 Beta-hexosaminidase n=1 Tax=Vibrio coralliilyticus ATCC BAA-450 RepID=C9NRN7_9VIBR Length = 616 Score = 332 bits (851), Expect = 2e-89, Method: Composition-based stats. Identities = 97/340 (28%), Positives = 155/340 (45%), Gaps = 24/340 (7%) Query: 11 YELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNR----LVVAVDQEGG 66 E++ R I+ + VGG+ILF+ N D Q EL+ ++ A L V +DQEGG Sbjct: 19 TEINESVRHIIGNDKVGGVILFSSNLVDINQTGELIWNMQNAVEEDQEVGLFVTIDQEGG 78 Query: 67 RVQRFREGF-TRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV-- 123 V R T LP + A + +LA E+G ++A E+ ++ +++FAPV+DV Sbjct: 79 NVVRLPRAQATNLPGNMALGAAYLNDNNPELAYESGRILAKEIKSVGFNVNFAPVVDVQS 138 Query: 124 GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDP 183 ++ I RSY DPQ + + GM + + T KH+PGHG +DSH P Sbjct: 139 NPLNPVINVRSYGEDPQLVGLLGGQSAKGMADENVIGTFKHYPGHGDTESDSHYGLPIVN 198 Query: 184 RPQAEIRAKDMSVFSSLIREN-KLDAIMPAHVIYSDVDP-------------RPASGSPY 229 + E A D++ + I + D IM AH+ Y +D PA+ S Sbjct: 199 KSIEEAYAIDLAPYKEAIDQGLAPDMIMTAHIQYPALDDSEVVGSRSGETIVVPATLSHK 258 Query: 230 WLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVL 289 +LR+EL + G+ +D L M+G A A+ + AG D+ L+ + Sbjct: 259 IQTELLREELNYQGLTITDALDMQGIAAHFEEADAVIKTFQAGVDIALMPTLFRTTSG-- 316 Query: 290 DNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQ 329 + + V G S Q + +S + I+T+L + Sbjct: 317 EGQLGELIDEVVAAVEAGELSEQSIDESVE-RIIATKLKR 355 >UniRef50_C4ZK55 Glycoside hydrolase family 3 domain protein n=6 Tax=Betaproteobacteria RepID=C4ZK55_THASP Length = 381 Score = 331 bits (850), Expect = 2e-89, Method: Composition-based stats. Identities = 150/311 (48%), Positives = 193/311 (62%), Gaps = 10/311 (3%) Query: 2 GPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAV 61 GPVM+DV G L EERE L PLVGG+ILF RNY QLR L +IR L++AV Sbjct: 10 GPVMIDVAGTALTDEERERLRDPLVGGVILFARNYTGSEQLRALTAEIRGLRDPALIIAV 69 Query: 62 DQEGGRVQRFR-EGFTRLPAAQSFAALSGMEEGGKL--AQEAGWLMASEMIAMDIDISFA 118 D EGGRVQRFR +GFTRLP+ +S AL + L A+ G ++A+E+ A +D+SF Sbjct: 70 DHEGGRVQRFRTDGFTRLPSMRSLGALWAQDHLVALDAARATGVVLAAELRAHGVDLSFT 129 Query: 119 PVLDVGH-ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 PVLD+ + AIG R++H DPQ A+A GM EAGM GKHFPGHG V ADSH Sbjct: 130 PVLDLDYGCCRAIGNRAFHRDPQVVAALAQALCAGMAEAGMGCVGKHFPGHGFVEADSHH 189 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD----PRPASGSPYWLKT 233 + P D R + +D++ + + +L +MPAHV+Y + D P+PA SP+WLK Sbjct: 190 DVPVDERDFDTVWNEDIAPYRHRLGR-QLAGVMPAHVVYPNADPSPEPQPAGFSPFWLKE 248 Query: 234 VLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLS 293 VLR LGF VIFSDDL+MEGA + G R +A+ AGCDM+LVCN A +LD + Sbjct: 249 VLRDRLGFQWVIFSDDLNMEGARVAGDIVGRAKAAYAAGCDMLLVCNRPDLAAELLDRWA 308 Query: 294 P-IKAERVTRL 303 P + A + RL Sbjct: 309 PDLDAGNLARL 319 >UniRef50_D0L0K9 Glycoside hydrolase family 3 domain protein n=1 Tax=Halothiobacillus neapolitanus c2 RepID=D0L0K9_HALNC Length = 362 Score = 331 bits (850), Expect = 2e-89, Method: Composition-based stats. Identities = 131/318 (41%), Positives = 184/318 (57%), Gaps = 9/318 (2%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 +GP+M+DV G L ER L +P VGG+ILF RN+ QL L +IRA L++A Sbjct: 4 LGPLMVDVAGLTLTDAERTRLMNPAVGGVILFARNFSSTEQLTALTAEIRALRHPHLLIA 63 Query: 61 VDQEGGRVQRFRE-GFTRLPAAQSFAALS--GMEEGGKLAQEAGWLMASEMIAMDIDISF 117 VDQEGGRVQRFR+ GF LP AL + G + A+ GWLMA ++ +D+SF Sbjct: 64 VDQEGGRVQRFRDDGFIHLPPMGRLGALFKHDRQAGLEAAKSLGWLMAVQLRMAGVDLSF 123 Query: 118 APVLDVGH-ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSH 176 APVLD+ +S IG+R++ DP +A+A +I GM EAGM GKHFPGHG++ DSH Sbjct: 124 APVLDLDRGVSGVIGDRAFSRDPASIIALAVAWISGMKEAGMAAVGKHFPGHGSIAEDSH 183 Query: 177 KETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLR 236 P D R + I D+ F ++I E + A+M AHV+Y +D PA S W++ +LR Sbjct: 184 IAHPVDHRALSTIERDDLRPFVAMI-ERFIPALMMAHVVYPKIDALPAGFSKVWVQDLLR 242 Query: 237 QELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLD----NL 292 + + G +F+DDL+M A++ G R A+L AGCDM L+CN+ + +L Sbjct: 243 KRWQYQGAVFTDDLTMAAASVAGDIGARVDAALAAGCDMALICNHPQWVDQLLTHRKFTA 302 Query: 293 SPIKAERVTRLYHKGSFS 310 P++ R+ L G Sbjct: 303 DPLREVRLAHLMGHGPVP 320 >UniRef50_Q15N77 Glycoside hydrolase, family 3-like n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Q15N77_PSEA6 Length = 633 Score = 331 bits (850), Expect = 2e-89, Method: Composition-based stats. Identities = 96/334 (28%), Positives = 151/334 (45%), Gaps = 23/334 (6%) Query: 11 YELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS----RNRLVVAVDQEGG 66 L + +E++A VGG+ILF N D Q+ L R ++AAS + L++++DQEGG Sbjct: 64 TTLPDDLKEMIADTGVGGIILFANNLVDTEQMLRLNRDLQAASASGGHSPLLISIDQEGG 123 Query: 67 RVQRFREGF-TRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGH 125 RV R + T + A E G A +G ++A E+ + +++FAP +DV Sbjct: 124 RVVRIPQNISTAFSGNMAIGATY-AEHGTHFATLSGDVIAKELAVLGFNVNFAPSVDVNV 182 Query: 126 I--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDP 183 + I RS+ DPQ + + M + G+ KHFPGHG + DSH P Sbjct: 183 NPENPVINVRSFGEDPQVVAELGLAQMQAMQQNGIIAALKHFPGHGDTSTDSHSGLPLVE 242 Query: 184 RPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP------------RPASGSPYWL 231 +I+A D++ F I IM AH+ Y +D PA+ S L Sbjct: 243 HDLHKIKAVDLAPFQYAIDNGDPGMIMTAHIQYPALDSSTFTATDGSKTILPATMSRAIL 302 Query: 232 KTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDN 291 +LR ++GF GVI +D L M A + + AG D+ L+ + A + Sbjct: 303 TDLLRDQMGFRGVIITDALDMAAIAHFYEPTQAVLQTFKAGTDIALMPLAIRTAQD-IPK 361 Query: 292 LSPIKAERVTRLYHKGSFSRQELMDS-ARWKAIS 324 L + A+ + G + E+ S AR + + Sbjct: 362 LKKMIAD-LAYAVQIGDVTLAEIETSVARIQTLK 394 >UniRef50_C6CWS0 Beta-N-acetylhexosaminidase n=1 Tax=Paenibacillus sp. JDR-2 RepID=C6CWS0_PAESJ Length = 535 Score = 331 bits (849), Expect = 3e-89, Method: Composition-based stats. Identities = 96/347 (27%), Positives = 156/347 (44%), Gaps = 28/347 (8%) Query: 2 GPVML-DVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS----RNR 56 G + + GYE E ++ +GG I F RN D Q+ +L ++ + R Sbjct: 16 GQLFVFGFHGYEPTEEITSLIEKYGIGGTIYFGRNVRDAKQVHQLSSSLKNIAGRAGRPN 75 Query: 57 LVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 ++VA+DQEGG V R G T +P + A E G E + E+ + + ++ Sbjct: 76 MLVAIDQEGGMVARIVNGVTLMPGNMALGATGSAEGAG----ETARVCGEELRLLGVTMN 131 Query: 117 FAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 +AP LDV + + I RSY Q + + G + + T KHFPGHG + D Sbjct: 132 YAPCLDVNNNPDNPVINVRSYGDRAQIVSELGKAALLGYQASKVAATVKHFPGHGDTSVD 191 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP--RPASGSPYWLK 232 SH + P P + + A +++ F + I M AHV +DP +P++ S L Sbjct: 192 SHHDLPVLPHSRERLDAIELAPFRAAIAAGTAAI-MTAHVCLPALDPSGKPSTLSQPVLT 250 Query: 233 TVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL 292 +LR+ELG++GVI +D L M+ A ++ AG DMILV + + V+ L+ + Sbjct: 251 GLLREELGYNGVIVTDCLEMDAIDRFYGPANGAVQAIQAGADMILVSHTYEKQVAALEAV 310 Query: 293 SPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQEEKA 339 G S + + +S R+ QL + ++ E Sbjct: 311 VAAVE--------NGELSEERINES------LARVMQLKQDYEIEMP 343 >UniRef50_B3QTH0 Beta-N-acetylhexosaminidase n=12 Tax=Chlorobiaceae RepID=B3QTH0_CHLT3 Length = 384 Score = 330 bits (848), Expect = 3e-89, Method: Composition-based stats. Identities = 97/346 (28%), Positives = 170/346 (49%), Gaps = 32/346 (9%) Query: 4 VMLDVEGYELDAEEREI--LAHPLVGGLILF---------TRNYHDPAQLRELVRQIRAA 52 +M+ G + + I + +GG+ILF +RN P Q+ L+++++AA Sbjct: 50 LMIGFRGLHVADSSKVIVDIRQQRIGGVILFDYDLPLKQASRNIQSPEQVCALIKELQAA 109 Query: 53 SRNRLVVAVDQEGGRVQRFREGFTRLP---AAQSFAALSGMEEGGKLAQEAGWLMASEMI 109 + L++A+DQEGGRV R +E F P +A ++ ++ K A++ A + Sbjct: 110 APIPLLIAIDQEGGRVNRLKEKFG-FPKSVSAAWLGEVNSLDSTQKYAEQ----TAKTLA 164 Query: 110 AMDIDISFAPVLDVGHI--SAAIGE--RSYHADPQKALAIASRFIDGMHEAGMKTTGKHF 165 A+ I+++FAPVLDV + IG+ RS+ +P+ + H+ G+ + KHF Sbjct: 165 ALGINLNFAPVLDVNTNPENPVIGKLARSFSENPEIVAEHGLATVKAFHQFGVLSAVKHF 224 Query: 166 PGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PA 224 PGHG+ DSHK + ++ F +I+ + D +M AHV + +D PA Sbjct: 225 PGHGSAWNDSHKGLADVT---ETWQPLEIEPFKRVIQAGECDMVMTAHVFNAKLDTTFPA 281 Query: 225 SGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKG 284 + S + +LR+ELGFDGV+ SDD+ ME + +L+AG D+++ NN Sbjct: 282 TLSQNVITGLLRKELGFDGVVVSDDMQMEAIRSFYGLETAVRLALNAGVDLLVFANNSVF 341 Query: 285 AVSVLDNLSPIKAERVTRLYHKGSFSRQELMDS-ARWKAISTRLNQ 329 + + + + + +G SR+ + S R + +RL+ Sbjct: 342 EPDIAER----AHQMIRAMVLQGKVSRERIDASYQRLMKLKSRLSN 383 >UniRef50_UPI00016BFC75 Beta-glucosidase-related glycosidase n=1 Tax=Epulopiscium sp. 'N.t. morphotype B' RepID=UPI00016BFC75 Length = 509 Score = 330 bits (846), Expect = 5e-89, Method: Composition-based stats. Identities = 83/318 (26%), Positives = 152/318 (47%), Gaps = 13/318 (4%) Query: 5 MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS----RNRLVVA 60 M+ + ++++ ++ VG +ILF RN P Q+ +L++ ++ + + + Sbjct: 12 MVGFDDTKINSHIENLIKDFNVGNVILFERNCKTPEQILKLIQNLQKLAIKYNGVPMFIG 71 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 +DQE G V R EG P + A + EE + + A+ I+ + AP Sbjct: 72 IDQENGMVTRIGEGVCTFPGNMAQANGATEEE----IYQVAKYTGEMLEALGINYNLAPS 127 Query: 121 LDVGHISAAI--GERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE 178 LD+ + G RS+ +P+ + I G +AG+ +T KHFPGHG DSH Sbjct: 128 LDINNNPNNPIIGIRSFGENPETVTNMGLACIKGFQDAGIISTAKHFPGHGDTAIDSHLT 187 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQ 237 PC P + + ++ F + I + + +IM AH+I+ +P PA+ SP L +LRQ Sbjct: 188 LPCIPHSKERLEKVELYPFKAAI-DAGVKSIMTAHIIFPAYEPVYPATLSPLVLTNLLRQ 246 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKA 297 +LGF +I +D + M+ + + + ++ AG D++ + + K L+ + I+A Sbjct: 247 DLGFHNLIITDCMEMKAISDTYTTQQASVMAIAAGADIVCISHTEKLQREALEE-TLIQA 305 Query: 298 ERVTRLYHKGSFSRQELM 315 E+ L K + ++ Sbjct: 306 EQNMXLQAKIDTAADRIL 323 >UniRef50_Q2SIQ7 Beta-glucosidase-related Glycosidase n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SIQ7_HAHCH Length = 346 Score = 330 bits (846), Expect = 5e-89, Method: Composition-based stats. Identities = 142/305 (46%), Positives = 188/305 (61%), Gaps = 7/305 (2%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 +GP+++D++G ++ AEE+E+L+ P+VGG+I F RNY DP QL LVRQIRA R L++ Sbjct: 7 LGPLIIDLKGLDVSAEEKELLSRPIVGGVIFFARNYTDPQQLDALVRQIRAI-RPELLLC 65 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALS--GMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 VDQEGGRVQRFREG T LPA A +A E GWL A E+ +D+D SFA Sbjct: 66 VDQEGGRVQRFREGRTLLPAPGKLRASYQENPAAAKAMATELGWLTAVELRELDLDFSFA 125 Query: 119 PVLDVGH-ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 PVLDV + + I +R++ P A A+A F GM EAG GKHFPGHG VT D+H Sbjct: 126 PVLDVDYGKNKVIADRAFGDTPTMAAALAQAFWRGMGEAGCAGVGKHFPGHGFVTEDTHI 185 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQ 237 E+ DPR E+ DM F +I E ++A+MPAHVIY DP PAS S +WL VLR+ Sbjct: 186 ESARDPRAYEELLGADMRPFIEMI-EAGIEAMMPAHVIYPACDPSPASQSSFWLTQVLRE 244 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERG--QASLDAGCDMILVCNNRKGAVSVLDNLSPI 295 L + G I SDD+ M+GA + E +L AGCD+++ CNN + + V D L + Sbjct: 245 RLQYQGAIISDDMGMQGADLEAEEDESASVVRALRAGCDLLMACNNPERLLRVADQLESL 304 Query: 296 KAERV 300 E + Sbjct: 305 PGELL 309 >UniRef50_Q9RUP9 Glycosyl hyrolase, family 3 n=2 Tax=Bacteria RepID=Q9RUP9_DEIRA Length = 348 Score = 329 bits (844), Expect = 1e-88, Method: Composition-based stats. Identities = 90/299 (30%), Positives = 147/299 (49%), Gaps = 10/299 (3%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQ 63 +M+D+ G ELD + L +G + LF +N QLR L +R ++A+D Sbjct: 19 LMIDLPGPELDKDTAAYLREHGIGAVCLFGKNVESAEQLRRLCADLREVMGEHALIAIDH 78 Query: 64 EGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLD- 122 EGG + R E + P+A S A +L ++ +A ++ ++ I+ +F PVLD Sbjct: 79 EGGAITR-PEFWPFAPSAMSLGAADDQ----QLTEDVNAALARQLRSVGINWNFTPVLDI 133 Query: 123 -VGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPC 181 V + IG+R+Y +D + + G G+ KHFPGHG DSH P Sbjct: 134 NVNPANPVIGDRAYGSDAARVTRHGRAALAGHTREGVAPCAKHFPGHGDTHQDSHLALPR 193 Query: 182 DPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQELG 240 + +AE+ A +++ F +L+ E AIM AH++Y +D PA+ SP L +LR+E G Sbjct: 194 VSKSRAELDAGELAPFRALLPET--PAIMTAHIVYDALDAEHPATLSPRILTGLLREEWG 251 Query: 241 FDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAER 299 +DGVI +D + M+ E +L AG D+++ R+ + L ++ E Sbjct: 252 YDGVIVTDSMGMQAIDANYGRGEAAVRALRAGADLVMALGRREVQQATLAAVAEYVPEN 310 >UniRef50_UPI0000E0E41F Beta-hexosaminidase A precursor n=1 Tax=Glaciecola sp. HTCC2999 RepID=UPI0000E0E41F Length = 599 Score = 329 bits (843), Expect = 1e-88, Method: Composition-based stats. Identities = 89/334 (26%), Positives = 151/334 (45%), Gaps = 23/334 (6%) Query: 11 YELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAA---SRNRLVVAVDQEGGR 67 L E ++ +GG+ILF+ N Q+++L I+ SR + +A+DQEGGR Sbjct: 47 TALPNELASLIKKHQLGGVILFSENVQSVTQIQQLTSDIQKLRPHSRLPMFIAIDQEGGR 106 Query: 68 VQRFREGFTR-LPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI 126 V R F+ + A G A + G + ++ ++ I+++FAP +DV Sbjct: 107 VARLPTAFSPAYSGNMAIGATYDA-HGTAYAAKVGEYIGRDLASVGINVNFAPSIDVNAN 165 Query: 127 --SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPR 184 + I RS+ + + + M G+ T KHFPGHG DSH P Sbjct: 166 PSNPVINVRSFGESAEMVSELGGALVQHMQSQGVLTALKHFPGHGDTVVDSHTGLPQVNH 225 Query: 185 PQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP------------RPASGSPYWLK 232 I +D++ F +I+E++ +M AH+ Y +D RPA+ SP L Sbjct: 226 DLPTIMKQDIAPFKHIIKEHEPAFVMTAHIQYPQLDNSTLLAKNGTEQVRPATLSPVILT 285 Query: 233 TVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL 292 +LR LGF G++ +D L M G A S E + +AG D+ L+ + + + Sbjct: 286 DLLRGTLGFKGIVITDALDMAGIAQYFSPQEAIIETFNAGADIALMPYTIRN-LEDIQAF 344 Query: 293 SPIKAERVTRLYHKGSFSRQELMDSARWKAISTR 326 S + E +L G F+ + ++ + ++T+ Sbjct: 345 SSLMDELERQLEKSGDFTA---LFASYTRIMNTK 375 >UniRef50_Q08U63 Beta-glucosidase n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q08U63_STIAU Length = 611 Score = 329 bits (843), Expect = 1e-88, Method: Composition-based stats. Identities = 105/350 (30%), Positives = 156/350 (44%), Gaps = 29/350 (8%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRA--ASRNRLVVAV 61 +M+ G LD E ++ VGG+ LF N H Q+ L +R A VA+ Sbjct: 80 MMVGFRGTVLDEEVESLVRGRRVGGVCLFKHNIHSARQVARLNDGLRRLLADHIPPFVAL 139 Query: 62 DQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVL 121 DQEGG V R + RLP + A E LA AG ++ + +++ APVL Sbjct: 140 DQEGGNVVRVSDQVVRLPGNMALGATRSTE----LAYAAGRAQGEDLRRLGFNMNLAPVL 195 Query: 122 DVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKET 179 DV + IG RSY + F G EAG+ T KHFPGHG+ + DSH+ Sbjct: 196 DVNLNPHNPVIGIRSYGDSVSLVSEMGRAFARGQQEAGLVTVAKHFPGHGSTSTDSHEVL 255 Query: 180 PCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPRPASGSPYWLKTVLRQ 237 P + E+ M F ++++E LD +M AHV + D PA+ SP L+ +LR+ Sbjct: 256 PVMRETREEV-LTQMEPFRAVLQEG-LDGLMTAHVAIPGLTGDSVPATLSPQVLEGLLRR 313 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKA 297 +LGFDG++ +D+L ME ++ AG DM+LV + V + L Sbjct: 314 DLGFDGLVLTDELEMEAIVQRYGVGRAAVLAMKAGADMVLVPWRPEKKTEVYEAL----- 368 Query: 298 ERVTRLYHKGSFSRQELMDSARWKAISTRLNQ--------LHERWQEEKA 339 H+G + L + R + + +L + L ER Sbjct: 369 ---LDAAHEGELPPERLEQAVR-RILIAKLRRGLFEAPPLLEERLAAPPP 414 >UniRef50_C1D8P0 NagZ n=4 Tax=Proteobacteria RepID=C1D8P0_LARHH Length = 362 Score = 328 bits (841), Expect = 2e-88, Method: Composition-based stats. Identities = 149/316 (47%), Positives = 187/316 (59%), Gaps = 8/316 (2%) Query: 2 GPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAV 61 GPVM+DV G L ++R L+HP+VG +ILF RN+ QLR L +IR+ L++AV Sbjct: 8 GPVMVDVAGCSLSDDDRRRLSHPMVGAVILFRRNFESVGQLRSLTEEIRSLRSPHLLIAV 67 Query: 62 DQEGGRVQRFREGFTRLPAAQSFAALS--GMEEGGKLAQEAGWLMASEMIAMDIDISFAP 119 D EGGRVQRF GFTRLP Q+ AL E LA + G ++A+E+ A +D+SF P Sbjct: 68 DHEGGRVQRFLTGFTRLPPMQALGALWEKSPEAALDLAHDVGEVLATELRASGVDLSFTP 127 Query: 120 VLDVGH-ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE 178 VLD+ H A IG RS+HADP +A G+ GM GKHFPGHGAV DSH Sbjct: 128 VLDIDHGRCAVIGNRSFHADPNIVATLAEALTRGLVAGGMIACGKHFPGHGAVEGDSHHV 187 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQE 238 P D R A I D+ F+ L + A+MPAHV+Y VDP PA SP+WL+TVLR+ Sbjct: 188 IPVDHRSYASIEEADLIPFARL-AAAGMHAVMPAHVVYDAVDPLPAGFSPFWLQTVLRER 246 Query: 239 LGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL----SP 294 +GF G IFSDDL MEGAA G A R QA+ AGCDM LVCN A +L L +P Sbjct: 247 MGFGGTIFSDDLCMEGAAGAGGIAARAQAAFAAGCDMALVCNRPDLADELLAQLDMPVTP 306 Query: 295 IKAERVTRLYHKGSFS 310 R+ + G + Sbjct: 307 ELTGRLAAMAGHGEVA 322 >UniRef50_B6BUL6 Beta-hexosaminidase n=1 Tax=beta proteobacterium KB13 RepID=B6BUL6_9PROT Length = 339 Score = 327 bits (839), Expect = 4e-88, Method: Composition-based stats. Identities = 130/294 (44%), Positives = 188/294 (63%), Gaps = 7/294 (2%) Query: 2 GPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAV 61 G +++D+EG E+ + + L+HP+V G+ILFTRN+ + Q+++L+ I R+ L++ V Sbjct: 4 GSIIIDIEGKEITQRDIQRLSHPVVCGVILFTRNFENKNQIQDLISSI-KLVRDDLLITV 62 Query: 62 DQEGGRVQRFREGFTRLPAAQSFAALS--GMEEGGKLAQEAGWLMASEMIAMDIDISFAP 119 D EGGRVQRFRE F +P+ L E+G +LA+ AGWL+ASE+ + ID++++P Sbjct: 63 DHEGGRVQRFRENFYAMPSMSELGDLYAHDSEQGLRLARYAGWLIASELGELHIDLNYSP 122 Query: 120 VLDVGH-ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE 178 VLD+ + S+ IG RS+ +P+ + +A FIDG H AGM + GKHFPGHG + DSH+E Sbjct: 123 VLDINYGKSSVIGNRSFSDNPKAIIELAKAFIDGQHHAGMTSVGKHFPGHGFIHEDSHEE 182 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQE 238 D R AE+ D+ F SL+ +L A+MP HVIY D +PAS S +WLK LR++ Sbjct: 183 ISIDNRSLAEME-NDIDCFKSLVNH-RLGAVMPGHVIYKKSDNKPASLSSFWLKDFLRKK 240 Query: 239 LGFDGVIFSDDLSMEGAAIMG-SYAERGQASLDAGCDMILVCNNRKGAVSVLDN 291 L F G+I SDDLSM+G A ER +L AGCD++L+CN + LD Sbjct: 241 LNFKGIIISDDLSMKGVADYIPDIYERVSQALTAGCDVVLICNKPEDVDEFLDR 294 >UniRef50_C5CGK3 Glycoside hydrolase family 3 domain protein n=1 Tax=Kosmotoga olearia TBF 19.5.1 RepID=C5CGK3_KOSOT Length = 528 Score = 327 bits (839), Expect = 4e-88, Method: Composition-based stats. Identities = 107/349 (30%), Positives = 171/349 (48%), Gaps = 29/349 (8%) Query: 5 MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASR-NRLVVAVDQ 63 + +G + + + ++ V G ++F RN D Q++ELV QI+ VAVDQ Sbjct: 34 LFGFQGISYNDQIQLLIDK-GVRGFVIFGRNVKDKDQVKELVEQIKTHCDDIPPFVAVDQ 92 Query: 64 EGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV 123 EGG V R R+ F P A +++ E L + G+L + + I++ FAPVLDV Sbjct: 93 EGGMVARIRDIFVP-PNAMLLGSVNDTE----LTYKVGYLTGKALRDLGINMDFAPVLDV 147 Query: 124 GHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPC 181 + IG RS+ +P + F G+ AG+ GKHFPGHG DSH P Sbjct: 148 NSNPFNPIIGVRSFWNEPTRVATNGVAFFKGLLSAGVIPVGKHFPGHGDTDTDSHTSLPV 207 Query: 182 DPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQELGF 241 +P++ D+S F + ENK+ A+M AH++ D PA+ S + + LR++LGF Sbjct: 208 INKPESNFIETDVSPFKIAV-ENKIPALMTAHILLPFWDNVPATISKKII-SYLREKLGF 265 Query: 242 DGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVT 301 DGVI SDD+ M+ + S E + SL AG DM ++ + + + V A+ + Sbjct: 266 DGVIISDDMLMKAVSEGKSVKEAVKQSLLAGVDMFIIWKDLETQLEV--------ADYIL 317 Query: 302 RLYHKGSFSRQELMDSA------RWKAISTRLN----QLHERWQEEKAG 340 + G+ + + ++ R A ++ LN Q+ + W+E G Sbjct: 318 QEVKNGNIPEEVIDNAVKRITKLRLLAYASELNELDLQVEKIWEEISDG 366 >UniRef50_D1AF10 Glycoside hydrolase family 3 domain protein n=1 Tax=Thermomonospora curvata DSM 43183 RepID=D1AF10_THECD Length = 543 Score = 327 bits (838), Expect = 5e-88, Method: Composition-based stats. Identities = 100/322 (31%), Positives = 160/322 (49%), Gaps = 14/322 (4%) Query: 2 GPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAV 61 G + + DAE++ + VGGLI F N P Q L ++ AS+ L++AV Sbjct: 64 GQLFVPTFASRQDAEQK--IKKYHVGGLIYFPDNARSPQQTARLSNALQRASKIPLLLAV 121 Query: 62 DQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVL 121 D+E G V R TR P + A + + A+EA ++ +E+ A+ I+ ++APV Sbjct: 122 DEEQGLVTRL-SYVTRFPGNMALGATA----KPQTAREAAKVIGTELRAVGINQNYAPVA 176 Query: 122 DVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKET 179 DV + IG RS+ +DP + + G +AG+ T KHFPGHG DSH Sbjct: 177 DVNVNPANPVIGVRSFGSDPGLVSLMLGGALQGYRDAGVAATVKHFPGHGDTDTDSHTGL 236 Query: 180 PCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD--PRPASGSPYWLKTVLRQ 237 P +AE D F + I + +DAIM AH++ +D PA+ S L +LR+ Sbjct: 237 PVIRHSRAEWERLDAPPFRAAIAQG-VDAIMTAHIVMPGLDDSGDPATLSQAVLTGLLRK 295 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKA 297 ELG+ GV+ +D LSM GA + ++ AG D +L+ + GA + L+ ++ Sbjct: 296 ELGYRGVVVTDSLSMAGARTRYGAEQAAVRAVQAGADQLLMPPDLAGAHAA--VLAAVRD 353 Query: 298 ERVTRLYHKGSFSRQELMDSAR 319 R+++ + S +R + + R Sbjct: 354 GRISQRRLEESVTRILRLKAER 375 >UniRef50_C6WFB0 Glycoside hydrolase family 3 domain protein n=1 Tax=Actinosynnema mirum DSM 43827 RepID=C6WFB0_ACTMD Length = 580 Score = 326 bits (836), Expect = 7e-88, Method: Composition-based stats. Identities = 91/325 (28%), Positives = 144/325 (44%), Gaps = 30/325 (9%) Query: 18 REILAHPLVGGLILFTR----NYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFRE 73 +++ GG+I F N PAQ+ L ++ ASR L ++ DQE G V R Sbjct: 80 AQVVTRYRPGGVIYFNNSSYDNIDTPAQIAALSNGLQQASRVPLTISTDQEMGIVTRIGP 139 Query: 74 GFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI--SAAIG 131 T+ P + + A + A+ A + A E+ A+ I +FAP DV + IG Sbjct: 140 PLTQFPGSMALGAG----RDERAAERAASVTARELRALGITQNFAPDADVNSNPANPVIG 195 Query: 132 ERSYHADPQKALAIASRFIDGMHEAG-----MKTTGKHFPGHGAVTADSHKETPCDPRPQ 186 RS+ +DP A + + + G + + KHFPGHG DSH P R Sbjct: 196 VRSFSSDPDLAATMVAAQVRGYQQRHLSRTAVSAAAKHFPGHGDTADDSHTSLPTVDRTV 255 Query: 187 AEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP--RPASGSPYWLKTVLRQELGFDGV 244 + R D F + I +D+IM AH+ +DP PA+ S + +LR+ELG+DGV Sbjct: 256 EQWREVDAKPFRAAI-AAGVDSIMTAHIRMPAIDPSGEPATLSKPVVTGLLREELGYDGV 314 Query: 245 IFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLY 304 + +D L+M G + + AE +L AG D +L+ A+ V Sbjct: 315 VITDSLAMAGVRQLHTDAEIPVLALKAGVDQLLMPVKLGEAIDA-----------VVAAV 363 Query: 305 HKGSFSRQELMDSARWKAISTRLNQ 329 G S + + D + + + + + Sbjct: 364 RAGELSERRI-DQSVLRVLRMKFLR 387 >UniRef50_D2BER2 Beta-hexosamidase A n=1 Tax=Streptosporangium roseum DSM 43021 RepID=D2BER2_STRRD Length = 427 Score = 325 bits (834), Expect = 1e-87, Method: Composition-based stats. Identities = 98/319 (30%), Positives = 151/319 (47%), Gaps = 24/319 (7%) Query: 18 REILAHPLVGGLILF--TRNYHDPAQLRELVRQIRAASR-NRLVVAVDQEGGRVQRFREG 74 + +A GG+ILF N + Q+ L ++ AS L+V DQE GRV R Sbjct: 91 AKAVARYRPGGVILFPWAGNVKNVRQVVALTNGLQKASPEIPLLVGADQENGRVSRMAPL 150 Query: 75 FTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVG--HISAAIGE 132 T +P A + LA++A + +E+ A+ I++ FAPV DV + IG Sbjct: 151 VTEMPGASVIGSTGDP----SLARKAAKVTGTELRALGINLDFAPVADVNINPRNPVIGP 206 Query: 133 RSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAK 192 R+Y +DP+K + + + G H+AG+ +T KHFPGHG DSH P ++ Sbjct: 207 RAYGSDPKKVAPMVAAAVQGFHDAGIASTAKHFPGHGDTNVDSHSGLPVIQHSLSQWNKL 266 Query: 193 DMSVFSSLIRENKLDAIMPAHVIYSDVDP--RPASGSPYWLKTVLRQELGFDGVIFSDDL 250 D F++ I +N +DAIM AHV+ +DP PA+ S L +LR++LGFDGV+ +D L Sbjct: 267 DAPPFAAAIGKN-IDAIMSAHVVMPKLDPSGDPATLSKPILTGLLREKLGFDGVVSTDAL 325 Query: 251 SMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFS 310 M G + ++ AG D++L+ + A V G S Sbjct: 326 DMAGVRKKYGDGQVAVRAIQAGVDLLLMPPDFPKAYGA-----------VLAAVKSGKIS 374 Query: 311 RQELMDSARWKAISTRLNQ 329 L S R + + + + Sbjct: 375 TARLDQSVR-RLLKLKAAR 392 >UniRef50_B3T2B0 Putative glycosyl hydrolase family 3 N terminal domain protein n=2 Tax=prokaryotic environmental samples RepID=B3T2B0_9ZZZZ Length = 340 Score = 325 bits (834), Expect = 1e-87, Method: Composition-based stats. Identities = 132/335 (39%), Positives = 196/335 (58%), Gaps = 7/335 (2%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 +GP+ + VEG L +++IL+H LVGG++LF+RNY + QL+ L++ I++ L+VA Sbjct: 3 LGPIFVSVEGKTLTPGDQKILSHNLVGGVVLFSRNYENKDQLKLLIKSIKSLKSPDLLVA 62 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGM--EEGGKLAQEAGWLMASEMIAMDIDISFA 118 VDQEGGRVQRF GF +P+ +S L E G + A ++ E++ + +D SFA Sbjct: 63 VDQEGGRVQRFISGFYPIPSMRSLGNLYDQSCENGMQATCIAAEILGLELLEVGVDFSFA 122 Query: 119 PVLDVGH-ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 PVLD+ H +S IG RS+H+DP+ + +F +G+ +AGM KHFPGHG V DSH+ Sbjct: 123 PVLDIDHGVSEVIGSRSFHSDPKVVDCLGEQFYEGLQKAGMNCVAKHFPGHGGVREDSHQ 182 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQ 237 +TP D R I D+ + +LI+ L IM AHV Y VD + A+ S YWL+ L++ Sbjct: 183 KTPVDNREMKAIIE-DLQPYKTLIKRG-LSGIMTAHVRYPLVDDQIATFSHYWLRKCLQK 240 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKA 297 EL F+G+IFSDDL M+G + S ++ S AGCD +L+CN+ + L L K Sbjct: 241 ELHFNGIIFSDDLMMKGTDFITSIPKKALLSFAAGCDFVLICNSPETVNQCLSELHYAKD 300 Query: 298 E--RVTRLYHKGSFSRQELMDSARWKAISTRLNQL 330 + + R Q + + I T+LN+L Sbjct: 301 QFANLERKIQYLIPFHQGQRQAKYLQEIKTKLNRL 335 >UniRef50_A7EUL4 Putative uncharacterized protein n=1 Tax=Sclerotinia sclerotiorum 1980 UF-70 RepID=A7EUL4_SCLS1 Length = 553 Score = 325 bits (833), Expect = 2e-87, Method: Composition-based stats. Identities = 95/349 (27%), Positives = 154/349 (44%), Gaps = 39/349 (11%) Query: 3 PVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNR-----L 57 P ++ G+ + + + ++ VG +ILF RN D AQL +L ++ + + L Sbjct: 33 PFIVGFHGHVTNEDIKTLITVHRVGTIILFQRNIADAAQLFQLAASLQEIAEDSGHAQSL 92 Query: 58 VVAVDQEGGRVQRFREGF-TRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 +A DQE G V R T LP + + A + A A + A I+++ Sbjct: 93 FIATDQENGLVTRVNPPIATELPGSMALGAANDPSN----AFSVAVCTAKILKAYGINMN 148 Query: 117 FAPVLDVG--HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 +APV DV + IG RS+ DP S ++G+ + G+ + KHFPGHG D Sbjct: 149 YAPVADVNSEPKNFVIGVRSFSDDPGTVGRFVSSQVEGLQQNGILSCVKHFPGHGDTAVD 208 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR--------PASG 226 SH P + + E+ A ++ F ++ +DAIM AH+ ++ PAS Sbjct: 209 SHHGLPVITKSKEELEACELVPFWRAVKAG-VDAIMTAHISLPQLNSNSNVATNQLPASL 267 Query: 227 SPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMIL---VCNNRK 283 SP ++ +LRQE+ +DG+I SD L M+G + +L L C+ K Sbjct: 268 SPDAIR-ILRQEMNYDGLIVSDCLEMDGVRATYGTEKGAVMALK-----ELTVSWCHTMK 321 Query: 284 GAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHE 332 V + E V G S QE+++S+ + I + L Sbjct: 322 AQVGAI--------EAVIAAVKSGEIS-QEMIESSVNRVIRLKTKYLST 361 >UniRef50_Q0VQT2 Beta-hexosaminidase n=3 Tax=Gammaproteobacteria RepID=Q0VQT2_ALCBS Length = 345 Score = 325 bits (833), Expect = 2e-87, Method: Composition-based stats. Identities = 165/292 (56%), Positives = 202/292 (69%), Gaps = 6/292 (2%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQ 63 +MLD+EG E+ A E+++LAHP GGLILF RNY D QL ELV+Q+RA R +++AVDQ Sbjct: 17 LMLDLEGLEVSASEKDMLAHPGAGGLILFARNYVDRQQLAELVQQVRAV-RPEMLIAVDQ 75 Query: 64 EGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEA--GWLMASEMIAMDIDISFAPVL 121 EGGRVQRFR+GF RLP + + + G LMASE+ +DIDISFAPVL Sbjct: 76 EGGRVQRFRDGFVRLPPMAALGQRYDQAPAQAVGEATLLGELMASELTELDIDISFAPVL 135 Query: 122 DVGH-ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETP 180 D+ + S+ IG+RS+H D +A+A FIDGM AGM TGKHFPGHG V ADSH E P Sbjct: 136 DLDYGNSSVIGDRSFHGDADAMIALAGAFIDGMSAAGMAATGKHFPGHGHVGADSHLELP 195 Query: 181 CDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQELG 240 DPRP +E+ A DM F +L KLD IMPAHVIYS V+ + A S YWL+T LR+ LG Sbjct: 196 IDPRPLSELEAADMRPFMAL--APKLDGIMPAHVIYSAVEQQTAGFSRYWLQTQLRERLG 253 Query: 241 FDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL 292 F GVIFSDDLSM GA +G Y +R QA+L+AGCDM+L CN +GA VLD L Sbjct: 254 FRGVIFSDDLSMAGAHGVGGYPQRAQAALEAGCDMVLACNCCEGAEQVLDFL 305 >UniRef50_B3E4C8 Glycoside hydrolase family 3 domain protein n=3 Tax=Desulfuromonadales RepID=B3E4C8_GEOLS Length = 395 Score = 324 bits (832), Expect = 2e-87, Method: Composition-based stats. Identities = 100/345 (28%), Positives = 167/345 (48%), Gaps = 26/345 (7%) Query: 4 VMLDVEGYELDAE--EREILAHPLVGGLILF---------TRNYHDPAQLRELVRQIRAA 52 ++L G L+ + + + GG++LF RN PAQL++L ++ A Sbjct: 57 LLLGFPGETLEPDSPIDQAIRRYGAGGVVLFDNNIDLGVTGRNITAPAQLKQLTTALQQA 116 Query: 53 SRNRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMD 112 S L++AVDQEGG V R ++ + PA S A G + L + + +A+ + Sbjct: 117 SELPLLIAVDQEGGVVARLKDRYG-FPATVS-AKYLGQQNRLDLTRSSSENLAATLDEYG 174 Query: 113 IDISFAPVLDV--GHISAAI--GERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGH 168 +++ APV+D+ + I ERS+ ADP A A+ FI H + T KHFPGH Sbjct: 175 FNLNLAPVVDLATNPDNPVIAFKERSFSADPALVAAHAAEFIKSHHRHHILTCLKHFPGH 234 Query: 169 GAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGS 227 G+ DSH R ++ + LI + D +M AH + +DP PA+ S Sbjct: 235 GSSRDDSHLGLVDVTRY---WHENELQPYRDLISQGLCDMVMTAHTFNTAIDPDHPATLS 291 Query: 228 PYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVS 287 + +LR++LGFDGV+ SDDL M SY + +++AG D+++V N++ + Sbjct: 292 RATIDGILRKQLGFDGVVVSDDLYMGAIIQHYSYETAVEKAINAGVDLLVVANDKLYSPD 351 Query: 288 VLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHE 332 ++ P E + L +G R+ + ++R + I+ + L Sbjct: 352 IM----PRTIELLLNLVQQGRIPRERIEQASR-RIIAMKQRLLKP 391 >UniRef50_A6GP56 Beta-hexosaminidase n=1 Tax=Limnobacter sp. MED105 RepID=A6GP56_9BURK Length = 340 Score = 324 bits (831), Expect = 3e-87, Method: Composition-based stats. Identities = 131/302 (43%), Positives = 180/302 (59%), Gaps = 4/302 (1%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 +GP ++ + G L E+ + +P VGGLILFTRN+ AQL +LV IR+ V Sbjct: 2 LGPFIIGLTGLTLSKEDIARIQNPAVGGLILFTRNFESKAQLIDLVESIRSNGGLSKPVF 61 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFA--ALSGMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 VD EGGRVQRFR GFT +P+ + A + ++ LA+EAG++MA+E+ A ID+SFA Sbjct: 62 VDHEGGRVQRFRNGFTAIPSMRQIGQRAETDFDDAVLLAREAGFVMAAELRACGIDMSFA 121 Query: 119 PVLDVGH-ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 PVLD+ S IG+RS+ D + ++S ++GM AGM+ GKHFPGHG V+ DSH Sbjct: 122 PVLDLDWGNSEIIGDRSFGKDARMVSRLSSALMNGMALAGMQACGKHFPGHGWVSLDSHL 181 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQ 237 P D R EI D + L + +IMPAHV+YS+VDP PA S +W++ VLR Sbjct: 182 ALPHDNRSLNEILKVDALPYQ-LNSTLNMASIMPAHVVYSEVDPMPACFSSFWIQEVLRV 240 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKA 297 + FDG + SDDL M GA +G R A+L AGCD +L CNN ++++ P Sbjct: 241 KFSFDGAVISDDLDMVGAHGVGDIRARASAALQAGCDAVLACNNFDDIDTLVNKPVPEVH 300 Query: 298 ER 299 E Sbjct: 301 EN 302 >UniRef50_C6C0L9 Glycoside hydrolase family 3 domain protein n=1 Tax=Desulfovibrio salexigens DSM 2638 RepID=C6C0L9_DESAD Length = 373 Score = 324 bits (831), Expect = 3e-87, Method: Composition-based stats. Identities = 98/355 (27%), Positives = 164/355 (46%), Gaps = 42/355 (11%) Query: 4 VMLDVEGYELDAEEREI--LAHPLVGGLILFT---------RNYHDPAQLRELVRQIRAA 52 VM+ G EL A + + VGG+ILF+ RN +QL +L +++ Sbjct: 38 VMVGFRGLELSANNPVVGDIRDARVGGVILFSKDCALNSTVRNIASSSQLIKLTSDLQSY 97 Query: 53 SRNRLVVAVDQEGGRVQRFR--EGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIA 110 ++ L +++DQEGG ++R GF P+A + A AG + + Sbjct: 98 AQVPLFISIDQEGGIIKRLTGEMGFPETPSAAELGNSGDTQA----AFRAGKTIGKSLKE 153 Query: 111 MDIDISFAPVLDVGHI--SAAIG--ERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFP 166 I+++FAPV+DV + I +RS+ DP+ A +IDG+H G+ + KHFP Sbjct: 154 AGINMNFAPVVDVNRNAANPVIAALQRSFSDDPRIVARFADSYIDGLHSEGIISCLKHFP 213 Query: 167 GHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PAS 225 GHG+ DSH + ++ F LI +NK D +M AH+ + D + PA+ Sbjct: 214 GHGSSRGDSHLGFTDVTDS---WDSSELYPFQKLISDNKADMVMTAHIFNARWDRKYPAT 270 Query: 226 GSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGA 285 S + +LR++LGF+G+I +DD+ M+ + + E ++ AG D++L NN Sbjct: 271 LSRNVIHGLLRRKLGFEGIIVTDDMQMQAVSGEYGFKEGVYRAVKAGADILLFGNNLIYE 330 Query: 286 VSVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQEEKAG 340 L + +L H+G + + + S ++R EK G Sbjct: 331 P----GLGTKAVSTLKQLVHEGKITERRIRQS-------------YDRIMREKQG 368 >UniRef50_B7S2D9 Glycosyl hydrolase family 3 N terminal domain protein n=1 Tax=marine gamma proteobacterium HTCC2148 RepID=B7S2D9_9GAMM Length = 567 Score = 324 bits (830), Expect = 4e-87, Method: Composition-based stats. Identities = 104/341 (30%), Positives = 164/341 (48%), Gaps = 21/341 (6%) Query: 12 ELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRF 71 +L + + L G+ILF N AQ ++L +IR ++A+DQEGGRV R Sbjct: 35 QLPPQLKHGLVAISPCGVILFRENLASIAQCQQLTAEIRECLPANTLIAIDQEGGRVTRL 94 Query: 72 -REGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI--SA 128 R+ T + AA E KLA+E G A+E+ A+ I+I+F P LDV + Sbjct: 95 PRQVSTSFSGNMALAACPAGEN-EKLAREVGAAQAAELKALGININFVPSLDVNSDPANP 153 Query: 129 AIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAE 188 IG RS+ DP ++ S + G+ E G+ KHFPGHG + DSH + PC R + E Sbjct: 154 VIGVRSFGDDPAAVASLGSELLSGLQEGGVAGALKHFPGHGDTSQDSHIDLPCVGRSRDE 213 Query: 189 IRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP---------RPASGSPYWLKTVLRQEL 239 A D++ FS +I + +M AH+ Y +D PA+ S + +LR+++ Sbjct: 214 AFAVDLAPFSRVIAQANPAMVMTAHIQYPALDDSCIAGTDVAVPATLSRAMMTGLLREQM 273 Query: 240 GFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAER 299 FDGVI +D L M+ + + E AG D+ L+ + + S + L + + Sbjct: 274 DFDGVIITDALDMKAISARMTPTEAVLRCFAAGVDIALMPLLIRSSASF-NQLQQLVSTA 332 Query: 300 VTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQEEKAG 340 V H G D + +A TR+ L +++ ++ G Sbjct: 333 V-EAIHSGEL------DESEVRASVTRILALQQKFAPQQPG 366 >UniRef50_Q84IZ5 Beta-N-acetylglucosaminidase n=1 Tax=Clostridium paraputrificum RepID=Q84IZ5_9CLOT Length = 413 Score = 322 bits (827), Expect = 8e-87, Method: Composition-based stats. Identities = 87/350 (24%), Positives = 150/350 (42%), Gaps = 26/350 (7%) Query: 2 GPVML-DVEGYELDAEEREILAHPLVGGLILFT-RNY--HDPAQLRELVRQIRAASR--- 54 G +M V G +D + + GG+IL+ RN+ + V I+ A+R Sbjct: 51 GQMMFYGVNGTNVDDKVVNLFEDQHAGGIILYGHRNFWGSSLDNNVKYVNSIKKANRQNS 110 Query: 55 -NRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDI 113 L + D+EGG + + + R P+ + LA G A ++ + I Sbjct: 111 DIPLFIGFDEEGGSMSQLPQELMRTPSKGELGNTND----SSLATGIGAGTAKKLKLLGI 166 Query: 114 DISFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAV 171 + F VLD+ + IG RSY + +K + + G+ T KHFPGHG Sbjct: 167 NTDFGTVLDINTNKNNPIIGVRSYGSTKEKVTEFGINELKAIQNEGVIPTVKHFPGHGDT 226 Query: 172 TADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYW 230 DSH P +++ ++ F + I N +D +M AH++ +D PA+ S Sbjct: 227 EVDSHLGLPSLNHDLNRLKSTELVPFQTAIN-NGVDMVMTAHIMLPQIDKEYPATMSKKI 285 Query: 231 LKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLD 290 L +LR E+G+ GVI +DDL M+ + E S++AG D++LVC+ + V + Sbjct: 286 LTDLLRDEMGYKGVIITDDLEMQAISKNWDLGEAAIKSVEAGADILLVCHTIENQQKVYN 345 Query: 291 NLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLN-QLHERWQEEKA 339 V + + G + +S R + + + +L ++ Sbjct: 346 A--------VVQGVNDGKIDENRIDESVR-RILRLKYQYKLSDKANNPTQ 386 >UniRef50_P48823 Beta-hexosaminidase A n=3 Tax=Alteromonadales RepID=HEXA_PSEO7 Length = 598 Score = 322 bits (827), Expect = 9e-87, Method: Composition-based stats. Identities = 93/343 (27%), Positives = 154/343 (44%), Gaps = 30/343 (8%) Query: 11 YELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAA-----SRNRLVVAVDQEG 65 L E E+++ +GG ILF N + AQ+ L +++A S+ L +A+DQEG Sbjct: 49 TTLPPELSELISRYDIGGAILFAENVQNTAQIISLTNALQSAAQQSKSQLPLFIAIDQEG 108 Query: 66 GRVQRF-REGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVG 124 GRV R RE T S A ++G A + + E+ ++ I+++FAP +DV Sbjct: 109 GRVARINREQATSFTGNMSIGATY-PKQGDIYATKVASAIGKELNSLGINVNFAPTVDVN 167 Query: 125 --HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCD 182 + I RS+ +P + + AG+ + KHFPGHG DSH P Sbjct: 168 SNPNNPVINVRSFSENPTVVTKLGLAQVKAFEAAGVLSALKHFPGHGDTHVDSHTGLPRV 227 Query: 183 PRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP------------RPASGSPYW 230 + +I +D+ F+ +I+ + IM AH+ Y +D RPA+ S Sbjct: 228 DHDRDKINQQDLLPFAEIIKASPPGMIMTAHIQYPALDNSKVVNSQGESMIRPATMSYQI 287 Query: 231 LKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVC---NNRKGAVS 287 + +LR ELG+ GV +D L M G + + + + +AG D+ L+ NR Sbjct: 288 MTQLLRHELGYQGVTVTDALDMAGISDFFNPVDATIETFNAGVDIALMPIAIRNRADIKR 347 Query: 288 VLDNLSPIKAERVTRLYHKGSFSRQELMDS-ARWKAISTRLNQ 329 ++ + ++++L S AR + T+L Q Sbjct: 348 FEQYMAQ-----LADALETNKLNQEQLSSSMARIAKLKTKLPQ 385 >UniRef50_C0EVT4 Putative uncharacterized protein n=1 Tax=Eubacterium hallii DSM 3353 RepID=C0EVT4_9FIRM Length = 409 Score = 322 bits (825), Expect = 1e-86, Method: Composition-based stats. Identities = 85/327 (25%), Positives = 149/327 (45%), Gaps = 25/327 (7%) Query: 11 YELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQR 70 E+ + ++ L GG+I F+ N D Q++ ++ I+ ++ L ++VD+EGG V R Sbjct: 84 TEVTKKMQQKLEEYKPGGVIFFSYNLKDREQVKSMIADIQKSNDIPLFISVDEEGGSVAR 143 Query: 71 FRE----GFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI 126 T+ PA ++ A E G + E+ + ++ FAPV DV Sbjct: 144 IANSKNMQTTKFPAMSEIGKSGDSKK----ACEVGETIGKEIRELGFNLDFAPVADVNTN 199 Query: 127 --SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPR 184 + IG RS+ +D + + S+ + G+ G+ +T KHFPG G D+HK Sbjct: 200 AENTEIGNRSFSSDAKTVAGMVSQEVKGLQSQGVSSTLKHFPGQGQCGEDTHKGYVDLNA 259 Query: 185 PQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR--PASGSPYWLKTVLRQELGFD 242 +R + F S I D +M +HV S V + PAS + + +LR+EL F+ Sbjct: 260 TIDRLREVEFLPFESGI-AAGADMVMMSHVAVSQVTGKETPASLTKLMVTDILREELQFN 318 Query: 243 GVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTR 302 VI +D ++M+ + ++ AG DMIL+ +N + A E V Sbjct: 319 NVIITDAMNMKVITKFYDADQAAVMAVQAGNDMILMPDNFEQAF-----------EGVLE 367 Query: 303 LYHKGSFSRQELMDSARWKAISTRLNQ 329 G+ S ++ + A + +S ++ + Sbjct: 368 AVKDGTISESKINE-AVSRILSVKIRR 393 >UniRef50_B0P2K0 Putative uncharacterized protein n=2 Tax=Clostridiales RepID=B0P2K0_9CLOT Length = 421 Score = 321 bits (824), Expect = 2e-86, Method: Composition-based stats. Identities = 86/327 (26%), Positives = 155/327 (47%), Gaps = 25/327 (7%) Query: 11 YELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQR 70 + ++ ++ L GG+I+F +N + P Q +++ +++ AS L +AVD+EGG+V R Sbjct: 98 TSVTSQMKKTLKKYPAGGVIMFAKNINTPDQTKKMTDELQDASYIPLFMAVDEEGGQVSR 157 Query: 71 F----REGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV--G 124 + T P+AQ ++ + G E+ + +++ APV DV Sbjct: 158 VASNPKMKMTVYPSAQEVGRTYNNKK----IAQMGKTQGKELKELGFNMNLAPVADVLTN 213 Query: 125 HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPR 184 + IG+RS+ AD +K I + + M + + T KHFPG G D+H+ + + Sbjct: 214 KNNTEIGDRSFGADSKKVADIITTLVKNMQKQQISATLKHFPGSGQTGGDTHRGSTETDQ 273 Query: 185 PQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPRPASGSPYWLKTVLRQELGFD 242 +R D F + I+ K DA+M +H++ S+V + P+S S + +LR EL + Sbjct: 274 TINALRDTDFKPFKAGIK-AKADAVMVSHLMLSNVTDEKEPSSLSSRVVSDILRDELEYK 332 Query: 243 GVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTR 302 GVI +D ++M+ S E ++ AG D+I++ +N K A + + Sbjct: 333 GVIMTDAMNMKAITDNYSSGEAAVKAIQAGVDLIVMPDNYKEAYKA-----------IKK 381 Query: 303 LYHKGSFSRQELMDSARWKAISTRLNQ 329 G + S R + I T+L + Sbjct: 382 ALKSGKIKESRIDKSVR-RIIYTKLKR 407 >UniRef50_D2PU34 Glycoside hydrolase family 3 domain protein n=1 Tax=Kribbella flavida DSM 17836 RepID=D2PU34_9ACTO Length = 606 Score = 321 bits (823), Expect = 2e-86, Method: Composition-based stats. Identities = 90/315 (28%), Positives = 134/315 (42%), Gaps = 28/315 (8%) Query: 15 AEEREILAHPLVGGLILF--TRNYHDPAQLRELVRQIRAAS-----RNRLVVAVDQEGGR 67 A E++A GG I F N +P QL Q++ + R + +A DQE G Sbjct: 89 ATPAEVVAKYKPGGWIYFNARDNVQNPTQLATYSNQLQTVATSTGIRVPMTIATDQEQGV 148 Query: 68 VQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI- 126 V R T+ + A + A+ A + E+ AM I +APV DV Sbjct: 149 VVRIGPPATQFGGNMAHGAARSTAD----ARTAAGITGRELKAMGIRQDYAPVADVNVNA 204 Query: 127 -SAAIGERSYHADPQKALAIASRFIDGMH-EAGMKTTGKHFPGHGAVTADSHKETPCDPR 184 + IG RS+ +DPQ + + G +AG+ T KHFPGHG DSH P Sbjct: 205 LNPVIGVRSFSSDPQLVSDLTVAQVQGYQKDAGIIATAKHFPGHGDTVDDSHTSLPTINH 264 Query: 185 PQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP--RPASGSPYWLKTVLRQELGFD 242 + + D F + I+ +D+IM AH++ +DP PA+ S + VLR+ELGF Sbjct: 265 TREQWNTIDAPPFKAAIKAG-IDSIMTAHIVIPSLDPSGDPATLSKPIMTGVLRKELGFK 323 Query: 243 GVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTR 302 G+I +D L M AE +++AG D +L+ V Sbjct: 324 GLIITDALEMAAVRAKYGDAEVAVRAIEAGVDQLLLPPAPDVQFRA-----------VVD 372 Query: 303 LYHKGSFSRQELMDS 317 G S + + +S Sbjct: 373 AVKSGRISERRIDES 387 >UniRef50_Q1N543 Glycoside hydrolase, family 3-like protein n=1 Tax=Bermanella marisrubri RepID=Q1N543_9GAMM Length = 334 Score = 321 bits (823), Expect = 2e-86, Method: Composition-based stats. Identities = 129/328 (39%), Positives = 199/328 (60%), Gaps = 7/328 (2%) Query: 3 PVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVD 62 ++ D+E + L +ER L+H + G+ILFTR++ QL+ L+ +I+A + L++ VD Sbjct: 4 CLITDIETHSLSEDERTWLSHSYLAGIILFTRHFESKTQLKTLISEIKAINPK-LLITVD 62 Query: 63 QEGGRVQRFREGFTRLPAAQSFAALS--GMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 EGGRVQRFR+GFTR+PA L +LA ++ ++ E++ + ID+++APV Sbjct: 63 HEGGRVQRFRDGFTRVPAMGRLGKLYQEDSARANQLAFDSAVVLCCELLDIGIDLTYAPV 122 Query: 121 LDVG-HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKET 179 LD+ + +G+R + + + +AS FIDG+ G + KHFPGHG VT DSH Sbjct: 123 LDIDYQRNQVVGDRGFAHTKEAIVDLASHFIDGVKSIGFPSVAKHFPGHGWVTQDSHVSC 182 Query: 180 PCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQEL 239 P D R EI++ D+ VF L+ K+D IMPAHV+Y VD +PA S +W+ +LR ++ Sbjct: 183 PVDERSFEEIKSNDLWVFDHLMP--KIDWIMPAHVVYERVDEQPAGFSKHWVTNILRNDM 240 Query: 240 GFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAER 299 GF G + SDDLSMEGAA G YA R +A+LDAGC+++L CN+ +L+ LS + ER Sbjct: 241 GFQGPVVSDDLSMEGAAQTGGYASRAKAALDAGCNVMLACNSSVAPKEILEFLSHSEYER 300 Query: 300 VTRLYHKGSFSRQELMDSARWKAISTRL 327 + L S ++ ++D ++ R+ Sbjct: 301 LD-LSEYQSKPQKMILDMPVYQQALARI 327 >UniRef50_C6N603 Beta-hexosaminidase n=2 Tax=Legionella RepID=C6N603_9GAMM Length = 358 Score = 320 bits (822), Expect = 3e-86, Method: Composition-based stats. Identities = 130/327 (39%), Positives = 190/327 (58%), Gaps = 10/327 (3%) Query: 2 GPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAV 61 M+D+EG EL A E IL P VG +ILFTRN+ +P QL +L+ I + L +AV Sbjct: 3 SQFMVDIEGTELSALEHTILKQPNVGAVILFTRNFTNPEQLAKLICDIDEINPE-LFIAV 61 Query: 62 DQEGGRVQRF-REGFTRLPAAQSFAALSGM--EEGGKLAQEAGWLMASEMIAMDIDISFA 118 D EGG VQRF R+GF+ LPAA+++ E G QE G +MA E++A +D+S A Sbjct: 62 DHEGGNVQRFQRQGFSALPAARAYGRAYDKNHEIGIAYTQEYGEIMAKELLACGVDLSIA 121 Query: 119 PVLDVGHISAAIG--ERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSH 176 PVLDV S I R++H D + +A FI GM+ AGM GKHFPGHG+V +DSH Sbjct: 122 PVLDVHDKSQVIAGLNRAFHKDADAIIDLAGAFIKGMNTAGMPAVGKHFPGHGSVVSDSH 181 Query: 177 KETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVL 235 P E+ A D+ F LI + L A+MPAHV Y+ +D PA S WL+ +L Sbjct: 182 IAMPVSQASMTELAANDLKPFVELINKGLLTALMPAHVTYAAIDKENPAGFSKIWLQDIL 241 Query: 236 RQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPI 295 R +LGF G++ SD LSM+G A +G+ R + +L AGCDM+++C+ + + L+ + + Sbjct: 242 RTQLGFKGLVLSDCLSMKG-ADIGNLTTRAEKALAAGCDMLIICHQPRELL--LELIQTL 298 Query: 296 KAERVTRLYHKGSFSRQELMDSARWKA 322 E+ + + + +++ ++ K Sbjct: 299 NIEQTAESKERIAAFKNQMLHFSKNKV 325 >UniRef50_Q1K0E5 Glycoside hydrolase, family 3-like n=1 Tax=Desulfuromonas acetoxidans DSM 684 RepID=Q1K0E5_DESAC Length = 411 Score = 319 bits (818), Expect = 1e-85, Method: Composition-based stats. Identities = 98/347 (28%), Positives = 165/347 (47%), Gaps = 26/347 (7%) Query: 4 VMLDVEGYELDAE--EREILAHPLVGGLILF---------TRNYHDPAQLRELVRQIRAA 52 +++ G L+ + +GG+ILF RN P+QL++L ++A Sbjct: 52 LLVGFRGQTLEQAPTLVNDIQKRHLGGVILFDYDVQLRQSGRNIASPSQLKQLTSDLQAL 111 Query: 53 SRNRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMD 112 S L++A+DQEGGR+ R + P S L+ + +L E G +A+ + Sbjct: 112 SNLPLLIAIDQEGGRIARLKPAQG-FPVTLSHRELA-EQADVRLTFEHGKKLATTLATFG 169 Query: 113 IDISFAPVLDV--GHISAAIG--ERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGH 168 I+++ APV+D+ + I +R + A P + A +I G H+ + T KHFPGH Sbjct: 170 INLNLAPVVDLCSNPDNPVIARLDRCFSATPDQVTDQAGAYIAGHHQTAVLTCLKHFPGH 229 Query: 169 GAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD-PRPASGS 227 G+ T DSH+ + E +++ + LI+ ++D IM AHV S +D PA+ S Sbjct: 230 GSSTTDSHQGFTDITQTWNE---EELVPYRELIQHGRVDTIMTAHVFNSHLDANDPATLS 286 Query: 228 PYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVS 287 + +LR LG++GV+ SDDL M+ + + +L+AG DM++ NN Sbjct: 287 RPIITGLLRGVLGYEGVVISDDLQMKAISAHYGLETAIEKALNAGVDMLVFGNNLSYNEH 346 Query: 288 VLDNLSPIKAERVTRLYHKGSFSRQELMDS-ARWKAISTRLNQLHER 333 ++ I + RL +G S + +S R + RL +R Sbjct: 347 SVEQAVTI----IQRLIKQGKVSEARIDESWRRITMLKRRLTVKEQR 389 >UniRef50_A7BX55 Glycosyl hydrolase, family 3 n=4 Tax=Proteobacteria RepID=A7BX55_9GAMM Length = 436 Score = 319 bits (817), Expect = 1e-85, Method: Composition-based stats. Identities = 102/355 (28%), Positives = 168/355 (47%), Gaps = 40/355 (11%) Query: 4 VMLDVEGYELDA----EEREILAHPLVGGLILF---------TRNYHDPAQLRELVRQIR 50 +++ G + + + + +GG+ILF RN P Q++ LV++++ Sbjct: 97 LLVGFRGMSIRERGVKQIIQNIQQFHLGGVILFDYDIVLKSSRRNIKSPRQVKSLVKELQ 156 Query: 51 AASRNRLVVAVDQEGGRVQRFRE--GFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEM 108 AAS+ L +A+DQEGG+V+R +E GF R +AQ + + L + MA + Sbjct: 157 AASQIPLFIAIDQEGGKVKRLKEKFGFPRTVSAQYLGQKNNV----ALTYQQASKMAKTL 212 Query: 109 IAMDIDISFAPVLD--VGHISAAIG--ERSYHADPQKALAIASRFIDGMHEAGMKTTGKH 164 + I+ +FAPV+D + + IG ERS+ A PQ A FI H G+ + KH Sbjct: 213 ATVGINFNFAPVVDLNINPNNPVIGKLERSFSAYPQTVTKHALEFIKAHHAYGVLCSLKH 272 Query: 165 FPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRE---NKLDAIMPAHVIYSDVDP 221 FPGHG+ T DSH ++ + +I +D+IM AH+ DP Sbjct: 273 FPGHGSSTRDSHLGMVDVT---DTWNEAELEPYQKIINADKMGIVDSIMIAHIFNRRFDP 329 Query: 222 R-PASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCN 280 PA+ S + +LR+ L +DGVI SDD+ M+ + Q +L+AG D+I++ N Sbjct: 330 DYPATLSKRIVTGLLRENLEYDGVIVSDDMQMKAITSHYRFEVAIQKTLEAGIDIIVIGN 389 Query: 281 NRKGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQ 335 N K ++ I + +L G + + +S R+ QL ++Q Sbjct: 390 NLKYEPDIVTRTVGI----IKQLIQAGKITEARIEES------YQRIQQLKSQFQ 434 >UniRef50_A8L1X4 Glycoside hydrolase family 3 domain protein n=3 Tax=Actinomycetales RepID=A8L1X4_FRASN Length = 656 Score = 318 bits (816), Expect = 2e-85, Method: Composition-based stats. Identities = 92/331 (27%), Positives = 151/331 (45%), Gaps = 30/331 (9%) Query: 15 AEEREILAHPLVGGLILFT----------RNYHDPAQLRELVRQIRAASRNRLVVAVDQE 64 E +A +GG+I F N +P Q++ L + AA+ L++A DQE Sbjct: 155 DTAAEAVARRGLGGVIYFDAGGTGPGALPDNIVNPNQVKTLSADLSAAASIPLLIAADQE 214 Query: 65 GGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVG 124 G V R R+G T LP + A + A++A + +++ A+ I++ FAP DV Sbjct: 215 QGTVLRVRDGVTLLPGQMAQGATGRPTD----ARDAAQITGADLRALGINVDFAPDADVN 270 Query: 125 HI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCD 182 + IGERS+ DP + ++G + G+ KHFPGHGA + DSH + P Sbjct: 271 SDPANPVIGERSFGDDPTAVGRFTAAAVEGYRQVGVAAAAKHFPGHGATSVDSHADLPTI 330 Query: 183 PRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPASGSPYWLKTVLRQELGF 241 R +A + A D+ F + I + +M H+ +DP PA+ S + +LR ELGF Sbjct: 331 TRDRAALTALDLPPFRAAI-AAGVPMVMVGHLNVPALDPAAPATLSKPVVDGLLRHELGF 389 Query: 242 DGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVT 301 DGVI +D L+M + ++ AG DM+L+ + A+ V Sbjct: 390 DGVIVTDALNMAAITEHNTPGGAAVRAVQAGVDMLLMPPDLTQALDA-----------VV 438 Query: 302 RLYHKGSFSRQELMDSARWKAISTRLNQLHE 332 G+ + + D++ + + + H Sbjct: 439 SAVRSGAIVPERI-DASVRRILRMKWRLAHT 468 >UniRef50_Q47M03 Beta-N-acetylhexosaminidase. Glycosyl Hydrolase family 3 n=1 Tax=Thermobifida fusca YX RepID=Q47M03_THEFY Length = 552 Score = 318 bits (815), Expect = 2e-85, Method: Composition-based stats. Identities = 93/326 (28%), Positives = 147/326 (45%), Gaps = 26/326 (7%) Query: 2 GPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS-----RNR 56 G +++ V E E++ GG I F N P Q+ + ++ + Sbjct: 59 GQLLVPVLSGTTAEENAEVIERYHPGGFIYFPENLETPEQVAAMSNGLQERATGTGAGIP 118 Query: 57 LVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 L + VD+E G V R G R P A + A E LA+E A ++ A+ I++ Sbjct: 119 LFLGVDEEQGLVSRLPFG-ARFPDAMALGAARDPE----LARELAAATAEQLAAVGINLD 173 Query: 117 FAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 +APV DV + IG RS+ +DP+ +A+ + + G+ KHFPGHG D Sbjct: 174 YAPVADVNVNADNPVIGIRSFSSDPELVGELAAAEVAAFQDGGVVAVAKHFPGHGDTDVD 233 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP--RPASGSPYWLK 232 SH P + + E D+ F L+ + +D +M AHV+ + PA+ SP+ + Sbjct: 234 SHTGLPVIDKSRDEWERVDLPPFQRLV-DAGVDMVMTAHVLMPQLGDAEEPATLSPHLIT 292 Query: 233 TVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL 292 VLR+ELG+DGV+ +D L+MEG E ++ AG D +L+ + A S Sbjct: 293 GVLREELGYDGVVTTDALNMEGVRQTHDDGEVAVRAVLAGADQLLMPPDVDLAYSA---- 348 Query: 293 SPIKAERVTRLYHKGSFSRQELMDSA 318 V +G S + L +S Sbjct: 349 -------VLSAVEEGRISEERLDESV 367 >UniRef50_B1WW93 Beta-glucosidase n=12 Tax=Cyanobacteria RepID=B1WW93_CYAA5 Length = 539 Score = 318 bits (815), Expect = 2e-85, Method: Composition-based stats. Identities = 93/328 (28%), Positives = 160/328 (48%), Gaps = 13/328 (3%) Query: 14 DAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFRE 73 +A+ L +GG+IL A+L +Q++ S+ L +A D E G QRF Sbjct: 43 NAQLSHWLETLNLGGVILLGG---SAAELLLRTQQLQQWSKTPLFIAADIEEGVGQRF-S 98 Query: 74 GFTRLPAAQSFA--ALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI--SAA 129 G T P + A S + K A + G + A E + + I+ +P++DV + + Sbjct: 99 GATWFPPPMALGEIAKSDLPLARKYAYQMGEITAKESVKIGINWILSPIVDVNNNPDNPV 158 Query: 130 IGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEI 189 I RS+ +P+ +A FI+G TT KHFPGHG + DSH P + Sbjct: 159 INIRSFGDNPKVVSELAKAFIEGAKNHPALTTAKHFPGHGDTSNDSHLNLPVISHDNERL 218 Query: 190 RAKDMSVFSSLIRENKLDAIMPAHVIYSDVD-PRPASGSPYWLKTVLRQELGFDGVIFSD 248 ++ F + I + K+D++M AH++ + D PA+ S L L+QELGF+G+I +D Sbjct: 219 EGLELVPFQAAI-DAKVDSVMTAHLLINAWDKDNPATLSKTILTNKLKQELGFNGLIVTD 277 Query: 249 DLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSP---IKAERVTRLYH 305 L M G A E +++AG D++L+ ++ + A++ + N + ER+ Sbjct: 278 ALIMGGVAKFADSEEIAIKAVEAGADILLMPDDPEMAINAVYNAVETGRLTTERIDESLQ 337 Query: 306 KGSFSRQELMDSARWKAISTRLNQLHER 333 K ++Q+L + + + +L+Q +R Sbjct: 338 KIWQAKQKLFNHQKDNSSVYQLSQKEDR 365 >UniRef50_Q47VB6 Beta-hexosaminidase n=1 Tax=Colwellia psychrerythraea 34H RepID=Q47VB6_COLP3 Length = 637 Score = 317 bits (814), Expect = 3e-85, Method: Composition-based stats. Identities = 94/336 (27%), Positives = 158/336 (47%), Gaps = 27/336 (8%) Query: 11 YELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS-----RNRLVVAVDQEG 65 EL E ++++ VGG++LF N + AQ+ +L I+AA+ N L++++DQEG Sbjct: 68 TELPKELAQMISTSNVGGIVLFAENVQEIAQVVKLTHDIQAAALKSEQGNPLIISIDQEG 127 Query: 66 GRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGH 125 GRV RF + T + A S E G A + ++A E+ A+ I+ ++APV+DV Sbjct: 128 GRVARFAK-MTGFAGNMAIGA-SYPEHGTHFATQVNAVIAKELKALGINNNYAPVVDVNT 185 Query: 126 I--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDP 183 + I RS+ + + + + + G+ T KHFPGHG DSH P Sbjct: 186 NVDNPVINTRSFGESAVQVAELGASAVTAIQAQGVMATLKHFPGHGDTHVDSHLGLPRVD 245 Query: 184 RPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP------------RPASGSPYWL 231 +A I D++ F+ I K IM AH+ + +D RPA+ S L Sbjct: 246 HDRATIDKVDLAPFAWAIDHAKPAMIMTAHIQFPALDNSLFISKSGDKLIRPATMSRKIL 305 Query: 232 KTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILV------CNNRKGA 285 +LR+E+ FDG+I +D L M G + E + AG D+ ++ N+ Sbjct: 306 TNLLREEMEFDGIIATDALDMAGVTHFFTDIEAVVETFVAGADLAVMPFKVRTPNDIDNF 365 Query: 286 VSVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWK 321 + + ++ A R+ ++ + Q ++ R+K Sbjct: 366 YNFIKAVAKALAIRIEHNEYQLTEFEQSIVRLNRYK 401 >UniRef50_D2PT32 Glycoside hydrolase family 3 domain protein n=26 Tax=Actinomycetales RepID=D2PT32_9ACTO Length = 599 Score = 317 bits (814), Expect = 3e-85, Method: Composition-based stats. Identities = 89/326 (27%), Positives = 150/326 (46%), Gaps = 31/326 (9%) Query: 15 AEEREILAHPLVGGLILFTR--NYHDPAQLRELVRQIRAAS-------RNRLVVAVDQEG 65 A E++A +GG+I F + +P Q+ L ++ AS L+++ DQE Sbjct: 80 ATPAEVVAKYKLGGVIYFAWTDSVANPPQIAALSNGLQQASLAVGDKVSVPLLISTDQEH 139 Query: 66 GRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGH 125 G V R T+ P A + A + + A + +E+ A+ ++ ++APV DV Sbjct: 140 GVVFRVGPPATQFPGAMALGAGRSTADAREAA----AIAGAELRAVGVNQAYAPVADVNV 195 Query: 126 I--SAAIGERSYHADPQKALAIASRFIDGMHEAG-MKTTGKHFPGHGAVTADSHKETPCD 182 + IG RS+ +DP + + + G+ G + + KHFPGHG DSH P Sbjct: 196 NALNPVIGVRSFSSDPTLVADLTAAQVTGLEGGGGIASCAKHFPGHGDTVDDSHTSLPTI 255 Query: 183 PRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP--RPASGSPYWLKTVLRQELG 240 + + D F + I E +D+IM AH++ +DP PA+ S L +LR++LG Sbjct: 256 NHTREQWNTIDAPPFEAAI-EAGIDSIMTAHIVVPSLDPSGEPATLSKPILTGILREQLG 314 Query: 241 FDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERV 300 +DGV+ +D L+M G AE +L+AGCD +L+ A + V Sbjct: 315 YDGVVITDSLAMAGVRQKYGDAEVAVRALEAGCDQLLMSPAMDVAYNA-----------V 363 Query: 301 TRLYHKGSFSRQELMDSARWKAISTR 326 G + + L D + ++ + + Sbjct: 364 LAAVRSGRITEKRL-DQSVYRVLRLK 388 >UniRef50_D1BPZ2 Glycoside hydrolase family 3 domain protein n=3 Tax=Veillonella RepID=D1BPZ2_VEIPT Length = 381 Score = 317 bits (812), Expect = 5e-85, Method: Composition-based stats. Identities = 93/334 (27%), Positives = 169/334 (50%), Gaps = 25/334 (7%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASR----NRLVV 59 +M+ + G L+ + + +L VGG+ILF RN Q++ L+ I A + L + Sbjct: 66 LMIGIHGTTLNDDAKFMLNEYRVGGIILFDRNMESKDQVKTLITDINKAGKSAGLTPLFL 125 Query: 60 AVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAP 119 +DQEGG V R + ++P A+ + +E+ LA+++G +E+ + +I+FAP Sbjct: 126 GIDQEGGAVARMDDKLIKVPPAEEVGKMP-VEQAVSLAKQSG----AELKDLGFNINFAP 180 Query: 120 VLDVGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKET 179 V D+G RSY P + + A EAG+ + KHFPG G D H +T Sbjct: 181 VADLGLT----YGRSYSTSPDEVVRYAGAVGKAYDEAGLWYSYKHFPGIGKTDVDLHADT 236 Query: 180 PCDPRPQAEIRAKDMSVFSSLIRENKLDA--IMPAHVIYSDVDPR-PASGSPYWLKTVLR 236 P + + ++D VF LI+++K + IM +H +Y +DP PAS S + LR Sbjct: 237 SIVPVSKETLLSEDTKVFIDLIKQSKPNTYTIMVSHAMYPQIDPDHPASLSKAIITDWLR 296 Query: 237 QELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIK 296 +++G++GV+ +DD+ M A ++ + S+ AG D++LVC+ + + L Sbjct: 297 KDIGYNGVVVTDDMDMGALAKHYTFGDMAVQSILAGSDILLVCHEYEHMQEAYNGL---- 352 Query: 297 AERVTRLYHKGSFSRQELMDSARWKAISTRLNQL 330 + G S++ L +S + + + +++++ Sbjct: 353 ----MKAVKDGRISKERLDESVK-RILLMKMSKI 381 >UniRef50_C9LY46 Beta-hexosaminidase n=1 Tax=Selenomonas sputigena ATCC 35185 RepID=C9LY46_9FIRM Length = 408 Score = 317 bits (812), Expect = 5e-85, Method: Composition-based stats. Identities = 101/341 (29%), Positives = 166/341 (48%), Gaps = 30/341 (8%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRN---RLVVA 60 +M+ ++G ELDA+ R +L+ +GG+ILF RN Q+R L ++ + + L +A Sbjct: 85 LMIGIQGAELDADSRFMLSEYHIGGVILFDRNMKSQEQVRALNDSLQKNASDAGLPLFLA 144 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 +D+EGG V R +E F PAA + A A + AM +++FAPV Sbjct: 145 IDEEGGAVARMKEAFLPPPAAAEIGRTGDPQA----AYRYAADTAHGLKAMGFNLNFAPV 200 Query: 121 LDVGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETP 180 D+G A RSY D A + ++G +AG+ T KHFPG G +D+H++T Sbjct: 201 ADLG----ATYGRSYADDAATATKFVAAALEGHADAGLLATLKHFPGLGRGESDTHEDTV 256 Query: 181 CDPRPQAEIRAKDMSVFSSLI---------RENKLDAIMPAHVIYSDVDPR-PASGSPYW 230 +A + A D+ F +I + +M H +Y +D + PAS SP Sbjct: 257 TVRADRATLEASDLVPFREMITKRNAAKNAKGASGWFVMATHTMYPALDAKNPASLSPAI 316 Query: 231 LKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLD 290 L+ +LR+EL +DGVI +DDL M + + G ++ AG D++LVC++ +V + Sbjct: 317 LQGLLREELSYDGVIVTDDLEMGAISRHYGFDRAGVEAILAGADLVLVCHDYAHETAVYN 376 Query: 291 NLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLH 331 L + G S+ L D++ + + ++ L Sbjct: 377 GL--------LKAVKSGEISKDRL-DASVRRIVKAKIEYLE 408 >UniRef50_Q2JJA4 Glycosyl hydrolase domain protein n=1 Tax=Synechococcus sp. JA-2-3B'a(2-13) RepID=Q2JJA4_SYNJB Length = 511 Score = 316 bits (811), Expect = 7e-85, Method: Composition-based stats. Identities = 94/335 (28%), Positives = 161/335 (48%), Gaps = 20/335 (5%) Query: 5 MLDVEGYELDAEEREILAHPLVGGLILF--TRNYHDPAQLRELVRQIRAASRNRLVVAVD 62 ++ EG E + E++ VGGLI + N Q+ +L+ +A + L +A+D Sbjct: 17 LVYFEGAEFSSALEEMIRDYHVGGLIFYSIAGNIQTLPQVAQLIAAAQAQALVPLWIAID 76 Query: 63 QEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLD 122 QEGG V R EG T P+A + AA + + A+ + +E+ ++ +++FAPV+D Sbjct: 77 QEGGPVVRLTEGVTVFPSAMAVAAAGSLAK----AEAMARAVGTELKSLGFNLNFAPVVD 132 Query: 123 VGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETP 180 V + IG R++ +DP++ G +AG+ T KHFPGHG DSH P Sbjct: 133 VNSNPHNPIIGIRAFGSDPKQVTDYGLALWRGYRQAGILCTPKHFPGHGDTDIDSHLGLP 192 Query: 181 CDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPASGSPYWLKTVLRQEL 239 RP ++ ++ F +LI + +A+M AHV+ + RPA+ +P + LRQ L Sbjct: 193 RVDRPLEALQTTELRPFQALI-QAGAEAVMTAHVVMPALGSVRPATLAPEVVSGWLRQRL 251 Query: 240 GFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAER 299 GF G++ +D L+M A E + + AG D+++ + + ++ Sbjct: 252 GFQGLVLTDSLTMGAIARTYGIPEAAELAFRAGADVLVFGADPGFSPAIQKEAYA----H 307 Query: 300 VTRLYHKGSFSRQELMDSARWKAISTRLNQLHERW 334 + R + G S ++L S R+ L R+ Sbjct: 308 LLREFRAGRLSVEQLDRSV------ERILTLKRRY 336 >UniRef50_A6NU14 Putative uncharacterized protein n=1 Tax=Bacteroides capillosus ATCC 29799 RepID=A6NU14_9BACE Length = 435 Score = 316 bits (810), Expect = 7e-85, Method: Composition-based stats. Identities = 97/334 (29%), Positives = 159/334 (47%), Gaps = 22/334 (6%) Query: 2 GPVML-DVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASR--NRLV 58 G + + G E ++ VGGLI F RN +L LV +RA + + Sbjct: 88 GQLFIAGFYGTEDGDYVDSLIRDYKVGGLIFFGRNVGTAEELVSLVNDLRAKNGDYIPMF 147 Query: 59 VAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 +VDQEGG V+R + + L A ++ E G L G ++A+E +A ++ FA Sbjct: 148 FSVDQEGGTVERLPDEVSPLEDAYTYGQSGSSEVGYAL----GQVLANECVAFGFNLDFA 203 Query: 119 PVLDV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSH 176 P LD+ + IG R++ + A+ G+ + G+ KHFPGHG DSH Sbjct: 204 PSLDIWSNPENTVIGTRAFGTTAEAVEAVGPWAAYGLMDGGVIPVVKHFPGHGDTAVDSH 263 Query: 177 KETPCDPRPQAEIRAKDMSVFSSLI--REN-KLDAIMPAHVIYSDVD-PRPASGSPYWLK 232 P + E+ ++ F S I RE + A+M AH++ + +D PAS S + Sbjct: 264 VGLPTVSKTVDELLTSELIPFQSAIEGREGEGVPAVMVAHILMTAIDQEHPASLSQAVVT 323 Query: 233 TVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL 292 +LR++LGFDGV+F+DDL+M E +L+AGCD++LVC+N + Sbjct: 324 GLLREQLGFDGVVFTDDLTMGAITENYGLDEAAVLALEAGCDVLLVCHN--------EGD 375 Query: 293 SPIKAERVTRLYHKGSFSRQELMDSARWKAISTR 326 + + V G + + + D + ++ +S + Sbjct: 376 LALARQAVLDAVASGRLTEERI-DRSVYRILSLK 408 >UniRef50_UPI0000E87F6B beta-hexosaminidase n=1 Tax=Methylophilales bacterium HTCC2181 RepID=UPI0000E87F6B Length = 337 Score = 315 bits (809), Expect = 1e-84, Method: Composition-based stats. Identities = 138/339 (40%), Positives = 207/339 (61%), Gaps = 6/339 (1%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 M +M+DV LD+ + + ++HPLVGG+ILF+RNY + QL+ LV+ IR ++ +++A Sbjct: 1 MSCLMIDVASLSLDSNDIKRISHPLVGGVILFSRNYENKKQLKLLVKSIRKI-KHNILIA 59 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSG--MEEGGKLAQEAGWLMASEMIAMDIDISFA 118 VD EGGRVQRFR+ FT+LPA +L EE ++A+ GWL+A E+ DID SF Sbjct: 60 VDHEGGRVQRFRDDFTQLPAMALLGSLFDKDPEEATRIAKLCGWLIAKELGGCDIDFSFT 119 Query: 119 PVLDVGH-ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 PVLDV + S+ IG R++H + Q + +A+ +G+ +AGM+ GKHFPGHG + AD+H Sbjct: 120 PVLDVNYESSSVIGNRAFHKEIQPTIHLANSLTEGLGQAGMQGVGKHFPGHGFIKADTHL 179 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQ 237 E D R I+ DM VF +LI EN L IMP+HVIYS D +P+ S +WL+ LR+ Sbjct: 180 ELGEDERAFEVIKESDMLVFKALI-ENGLKGIMPSHVIYSLCDDKPSGLSHFWLQDQLRK 238 Query: 238 ELGFDGVIFSDDLSMEGAAIM-GSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIK 296 LGF+G IFSDD+SM+ A + +R +L AGCDM+LVCN + VL +L Sbjct: 239 NLGFNGAIFSDDMSMQAAVFTEKNVTKRVLKALQAGCDMVLVCNAGEDVDIVLTHLDWPV 298 Query: 297 AERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQ 335 +++ ++++ + + +T +L ++ Sbjct: 299 SKKSQCRIKAMGLNKKKNQQTQYMRLTATEAGKLILKFN 337 >UniRef50_A2EGF0 Glycosyl hydrolase family 3 N terminal domain containing protein n=1 Tax=Trichomonas vaginalis RepID=A2EGF0_TRIVA Length = 515 Score = 315 bits (808), Expect = 1e-84, Method: Composition-based stats. Identities = 101/333 (30%), Positives = 165/333 (49%), Gaps = 19/333 (5%) Query: 5 MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS----RNRLVVA 60 M ++ G +LDAE ++ +G LILFT N + ++++L + S ++ Sbjct: 21 MFNLPGKQLDAEIFNLIKKYHIGNLILFTINLGNGKEIKKLTSDFQEYSLKEFGFPAFIS 80 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 D EGG V+R+ E R P A + + + + L++ + ++++ I+++ APV Sbjct: 81 SDMEGGNVERYSEDIPRFPGAAAIVSTNSIANCQDLSESVAKV----LLSLGINMNLAPV 136 Query: 121 LDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE 178 D+ + IG RS+ DPQK I+G +AG T KHFPGHG+ DSH+E Sbjct: 137 CDINDNSRNPVIGARSFGFDPQKISNYIKAHINGHKKAGCLTCAKHFPGHGSTFQDSHRE 196 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQE 238 P E KD+ F I E +D+IM AH++ + PAS SP + + R+E Sbjct: 197 LPHIIHKYDEFMKKDIPPFKDAI-EAGVDSIMTAHLLSPLDEENPASLSPKVINYI-RKE 254 Query: 239 LGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAE 298 LGFDG++ +D L M+G +I G + ++ AG ++ L+ N +G + + E Sbjct: 255 LGFDGLLVTDSLVMKGVSIQG-IDVAVRKAILAGNNI-LIANTMEGDFAEFYKM----IE 308 Query: 299 RVTRLYHKGSFSRQELMDSARWKAISTRLNQLH 331 V G Q + DS R + IS++L H Sbjct: 309 GVLNDVKSGVIPEQLIDDSCR-RIISSKLKLKH 340 >UniRef50_Q5P081 Beta-hexosaminidase n=63 Tax=Proteobacteria RepID=NAGZ_AZOSE Length = 356 Score = 315 bits (808), Expect = 2e-84, Method: Composition-based stats. Identities = 142/298 (47%), Positives = 193/298 (64%), Gaps = 5/298 (1%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 +GPVMLDV G+ L EERE L PLVGG+ILF RN+ D QL+ L +I A L++A Sbjct: 14 LGPVMLDVAGFALTEEERERLLDPLVGGVILFARNFRDSEQLQALTAEIHALRSPALIIA 73 Query: 61 VDQEGGRVQRFR-EGFTRLPAAQSFAALSGMEEGGKL--AQEAGWLMASEMIAMDIDISF 117 VD EGGRVQRFR +GFTR+P+ + L + L A+ AG+++A+E++A +D+SF Sbjct: 74 VDHEGGRVQRFRTDGFTRIPSMRCLGRLWERDHVAALESARCAGYVLAAELLAHGVDLSF 133 Query: 118 APVLDVGH-ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSH 176 PVLD+ + S +G+R++H DP A+A + GM +AGM GKHFPGHG ADSH Sbjct: 134 TPVLDLDYGCSRVVGDRAFHRDPLVVAALAQSLVSGMADAGMGCVGKHFPGHGYAEADSH 193 Query: 177 KETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLR 236 E P D R I +D++ + + +L +MPAHVIY VDP PA S +WL+ +LR Sbjct: 194 VEIPVDEREFDAIWTEDIAPYRHRLGR-QLAGVMPAHVIYPRVDPNPAGFSRFWLQDILR 252 Query: 237 QELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSP 294 +GF GVIFSDDL+MEGA ++G R +A+ AGCD++LVCN AV +LD +P Sbjct: 253 GRVGFGGVIFSDDLTMEGATVVGDILARARAAFGAGCDVVLVCNRPDLAVDLLDRWAP 310 >UniRef50_B0SE07 Beta-glucosidase-related glycosidase n=2 Tax=Leptospira biflexa serovar Patoc RepID=B0SE07_LEPBA Length = 588 Score = 315 bits (807), Expect = 2e-84, Method: Composition-based stats. Identities = 86/330 (26%), Positives = 150/330 (45%), Gaps = 13/330 (3%) Query: 6 LDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRN----RLVVAV 61 + + +D + + GG+ILF +N ++ L ++ +++ +++ Sbjct: 64 IAIPSKTVDQIALDEIKQTKPGGIILFGKNLGKKEEILSLTENLQRSAKENGLPPFLIST 123 Query: 62 DQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVL 121 DQEGGRV R ++G T P A + E G + G++ + E+ ++ ++ FAPVL Sbjct: 124 DQEGGRVFRVQDGITHYPGAMAVGQTGKKEWG----EIVGFVTSYELRSLGLNFLFAPVL 179 Query: 122 DVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKET 179 D+ + + I RS+ +D + +A + G G KHFPGHG T DSH Sbjct: 180 DINNNPLNPVINTRSFGSDVHRVSDVAVAYERGARAGGCLPVIKHFPGHGDTTVDSHLGL 239 Query: 180 PCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQE 238 P + E+ ++ F IR +A+M AH++Y +DP PA+ S L VLR+ Sbjct: 240 PIINKSLTELEKLELVPFQESIR-GGAEAVMSAHIVYPKIDPNFPATLSKTILTDVLRKS 298 Query: 239 LGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAE 298 L FDG+I +D + M + G ++ AG +++L+ + + A L Sbjct: 299 LQFDGIIITDAMEMHAISKNYEKDRPGVLTILAGANIVLLTSWGETAKKFKAQLMDAYKN 358 Query: 299 RVTRLYHKGSFSRQELMDSARWKAISTRLN 328 K + +L+ A K I +L Sbjct: 359 GEFEFVDK-EGKKHDLLKEAVQKQIRKKLE 387 >UniRef50_C9KM62 Glycosyl hydrolase domain protein n=1 Tax=Mitsuokella multacida DSM 20544 RepID=C9KM62_9FIRM Length = 381 Score = 314 bits (806), Expect = 2e-84, Method: Composition-based stats. Identities = 90/329 (27%), Positives = 157/329 (47%), Gaps = 23/329 (6%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS--RNRLVVAV 61 VM+ V+G E+ + +L +GG+ILF RN Q ++L+ ++ + + L + V Sbjct: 66 VMIGVQGTEVTDDSLYMLHQYHMGGVILFDRNMESADQTKKLIADLQEQADQKVPLFIGV 125 Query: 62 DQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVL 121 D+EGG+V R + T P+ Q + A+E+ A ++ + +++FAPV Sbjct: 126 DEEGGQVVRGKSFLTPPPSEQEIGRSGEVTR----AEESARQTAEKLKKLGFNVNFAPVA 181 Query: 122 DVGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPC 181 DVG RS+ DP++ + G + + KHFPG G T DSH++ Sbjct: 182 DVGD-----YARSFGPDPEQTAKFVAAAAQGYEQEHVMYALKHFPGIGRGTVDSHEDISS 236 Query: 182 DPRPQAEIRAKDMSVFSSLIRENKLD--AIMPAHVIYSDVDPR-PASGSPYWLKTVLRQE 238 + E+ +D+ F ++I E + + ++ +H+ Y +D PAS S LR E Sbjct: 237 ITATKDELMKRDLVPFRTVIDERQPEDYFVLVSHLRYPALDAENPASLSKAIQTDFLRGE 296 Query: 239 LGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAE 298 LG+ G+I +DD+ M A S+ E G +++AG D++LVC+ V L Sbjct: 297 LGYRGLIITDDVEMGALAKHYSFRELGVKAVEAGSDIVLVCHEYPHETDVYLGL------ 350 Query: 299 RVTRLYHKGSFSRQELMDSARWKAISTRL 327 G+ S + + +S R + + +L Sbjct: 351 --LDAVQDGTISMERVNESVR-RIVKAKL 376 >UniRef50_Q5WDA0 Beta-N-acetylhexosaminidase n=1 Tax=Bacillus clausii KSM-K16 RepID=Q5WDA0_BACSK Length = 521 Score = 314 bits (804), Expect = 4e-84, Method: Composition-based stats. Identities = 105/344 (30%), Positives = 157/344 (45%), Gaps = 30/344 (8%) Query: 1 MGPVML-DVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRN---- 55 +G +++ G E + ++ + VG +ILF+RN P QL L ++ A++ Sbjct: 9 LGQLLVFGFHGTTATPEIKAMIRNHYVGNIILFSRNIETPEQLLTLTSDLQKAAKEAGHE 68 Query: 56 -RLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDID 114 L + VDQE G V+R G T LP A A + A G L A E+ A+ ++ Sbjct: 69 RPLAICVDQENGTVRRL-SGVTALPGAMLLGAT----HKRRYASIVGQLTAKELRAVGVN 123 Query: 115 ISFAPVLDVGHISAA--IGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVT 172 + APV DV + IG RS+ DP K A + GM G+ T KHFPGHG Sbjct: 124 WNLAPVADVNNNPNNPVIGVRSFGEDPSKVGDFAKEMMTGMQAEGVIATLKHFPGHGDTV 183 Query: 173 ADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPRPASGSPYW 230 DSH P P ++ ++ F + I E +AIM AH+ + + +P PA+ S Sbjct: 184 VDSHHAMPTIPHGLKRLQQVELVPFKACI-EAGAEAIMSAHIHFPALETEPVPATISYKV 242 Query: 231 LKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLD 290 + +LR ELG+ GV+ +D L ME A A+ A+L AG DM++V + + Sbjct: 243 MTELLRHELGYSGVLMTDCLEMEAIAGSIGTAQGAVAALLAGADMLVVSHCESRQRPTVA 302 Query: 291 NLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERW 334 L AE G F D R R++ L ++ Sbjct: 303 ALEAAFAE--------GRF------DQKRLDMSFQRIDALKAKY 332 >UniRef50_Q67LG3 Putative beta-N-acetylglucosaminidase n=1 Tax=Symbiobacterium thermophilum RepID=Q67LG3_SYMTH Length = 637 Score = 313 bits (802), Expect = 7e-84, Method: Composition-based stats. Identities = 106/343 (30%), Positives = 164/343 (47%), Gaps = 26/343 (7%) Query: 1 MGPVM-LDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASR----- 54 +G + + + G EL E R +LA VGG+I F R DP LR L +++AA+ Sbjct: 101 LGQLFWVGLPGTELTDEIRGLLAEGKVGGVIFFGRQGSDPDVLRRLSGELQAAAAARDRA 160 Query: 55 -NRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDI 113 LV+AVD EGG VQRF FT+ P A + A A+ G MA E++A+ + Sbjct: 161 TPGLVIAVDHEGGLVQRFGPPFTQWPGAMALGAAGSSAH----AEAMGRAMARELLAVGV 216 Query: 114 DISFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAV 171 +++ APV DV + + IG RS+ P+ +A+ G+ G+ KHFPGHG Sbjct: 217 NMNLAPVADVNNNPANPVIGTRSFGEQPELVARMAAALAAGLQAEGVSAVAKHFPGHGDT 276 Query: 172 TADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPRPASGSPY 229 DSH + P + + ++ F + + +DA+M AHV++ V D RPAS SP Sbjct: 277 AVDSHLDLPVIDHARERLDRVELVPFRAAVGAG-IDAVMVAHVVFPAVAGDGRPASLSPD 335 Query: 230 WLKTVLRQELGFDGVIFSDDLS-MEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSV 288 + +L+ ELGF GV +D L M E +++AG D++LV + + Sbjct: 336 VVTGLLKGELGFAGVAMTDALDGMAAITDTYGVEEGLVLAVEAGADVLLVTESFGRQQA- 394 Query: 289 LDNLSPIKAERVTRLYHKGSFSRQELMDSA-RWKAISTRLNQL 330 R+ + +G + D+A R A+ + L Sbjct: 395 -------LYRRLLQAVEEGRIPEGRVNDAAGRVLALKEKRGLL 430 >UniRef50_Q11Z59 B-glycosidase, glycoside hydrolase family 3 protein n=1 Tax=Cytophaga hutchinsonii ATCC 33406 RepID=Q11Z59_CYTH3 Length = 990 Score = 312 bits (801), Expect = 9e-84, Method: Composition-based stats. Identities = 76/315 (24%), Positives = 141/315 (44%), Gaps = 25/315 (7%) Query: 16 EEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFREGF 75 E +++ + +GG+I F P + +L + ++ S L VA+D E G R Sbjct: 73 EIEQLIKNNGIGGVIFFQGG---PVRQAQLTNRYQSVSNVPLFVAMDAEWGLGMRLDSTM 129 Query: 76 TRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI--SAAIGER 133 P + A+ + G + + + + I+FAPV+DV + IG R Sbjct: 130 -NFPKQMTLGAI----QDNAAIFNMGVEIGKQCKRLGVHINFAPVVDVNSNANNPVIGVR 184 Query: 134 SYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKD 193 S+ D A ++ GM + KHFPGHG DSH P + E+ + Sbjct: 185 SFGEDKINVSQKAIAYMKGMQSVHVMANAKHFPGHGNTDTDSHFSLPVVNKNVQELNDTE 244 Query: 194 MSVFSSLIRENKLDAIMPAHVIYSDVD--PRPASGSPYWLKTVLRQELGFDGVIFSDDLS 251 + F LI ++ + +I+ AH+ +D +PA+ S + +LR ++GF G+IF+D L+ Sbjct: 245 LYPFRQLI-DSGVGSIIVAHMNVPSLDNTNKPATLSKPIVTDLLRNDMGFRGLIFTDALN 303 Query: 252 MEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFSR 311 M+G + + E +L AG D++L N P+ +++ + + ++ Sbjct: 304 MKGVSNLYKPGEVDVKALLAGNDILLYAEN-----------VPLAIKKIVKAINDKDITK 352 Query: 312 QELMDSARWKAISTR 326 +E+ + K + + Sbjct: 353 EEIH-ARVKKILLAK 366 >UniRef50_UPI000196C637 hypothetical protein CATMIT_02394 n=1 Tax=Catenibacterium mitsuokai DSM 15897 RepID=UPI000196C637 Length = 790 Score = 312 bits (801), Expect = 9e-84, Method: Composition-based stats. Identities = 93/345 (26%), Positives = 164/345 (47%), Gaps = 28/345 (8%) Query: 11 YELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS----RNRLVVAVDQEGG 66 +++ + R+I+ G LI F +N + Q L ++ A+ L+++ DQEGG Sbjct: 69 TKMNDQVRKIVEDYDFGALIYFAQNIQETEQSYNLSMAMQKAATKDGGIPLLISADQEGG 128 Query: 67 RVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI 126 V R G T LP + A E A++AG ++ SE+ ++ I+ + APV+DV + Sbjct: 129 SVYRLGSG-TALPGNMALGATHNTE----YAKKAGQIIGSELSSIGINTNLAPVVDVNNN 183 Query: 127 SAA--IGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPR 184 + IG RSY + +AS I G+ + + KHFPGHG DSH P + Sbjct: 184 ANNPVIGLRSYSDNADVVGNMASATIAGLKDYNVIGCAKHFPGHGDTATDSHYGLPIVNK 243 Query: 185 PQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-------------RPASGSPYWL 231 +AE+ ++ + I + ++ IM AH++Y +D +PA+ S + Sbjct: 244 SKAELLNNELKPYKIAINQG-IEMIMTAHILYPQLDNTTVVSEKTGNEEKKPATMSKAII 302 Query: 232 KTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDN 291 +L+ E+GF+GV+ +D ++M+ A ++ + +++AG D+I + V Sbjct: 303 TDLLKGEMGFNGVVTTDAMNMKAIADTFGESQAVKLAIEAGADLICMPTVLYNQEDV-KK 361 Query: 292 LSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQE 336 L I + V KG S L D R + ++ + N+ W+E Sbjct: 362 LDTI-IDYVEDAVKKGEISESRLDDGCR-RILTVKENRGILDWKE 404 >UniRef50_B4S1A2 Beta-hexosaminidase A n=2 Tax=Alteromonas macleodii RepID=B4S1A2_ALTMD Length = 673 Score = 312 bits (801), Expect = 1e-83, Method: Composition-based stats. Identities = 85/339 (25%), Positives = 146/339 (43%), Gaps = 26/339 (7%) Query: 11 YELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIR----AASRNRLVVAVDQEGG 66 ++ A +L+ +GG+ILF+ N + QL L ++ + L +A+DQEGG Sbjct: 90 TKIPASLLALLSGHNIGGVILFSENIENTEQLITLNYTMQQHMLKVGKQPLFIAIDQEGG 149 Query: 67 RVQRFREGF-TRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVG- 124 RV R + A + G AQ + ++ + I+ +FAP +D+ Sbjct: 150 RVARLPSDMLAPFAGNMAIGATF-HQHGSTFAQRVARHIGKTLLPLGINTNFAPSVDINS 208 Query: 125 -HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDP 183 + I RS+ P++ A+ F++ M G+ + KHFPGHG DSH P Sbjct: 209 EPKNPVINVRSFSEHPEQVAALGQTFVEAMQGTGVISAIKHFPGHGDTHVDSHSGLPLVS 268 Query: 184 RPQAEIRAKDMSVFSSLIRENKLD-AIMPAHVIYSDVDPR------------PASGSPYW 230 +A+ D+ FS++IR +M AH+ Y +D PA+ S Sbjct: 269 HTRAQAMTGDILPFSNIIRSATPPAMVMSAHIQYPSLDDTKITDKNGKRQIVPATLSKKI 328 Query: 231 LKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVC---NNRKGAVS 287 L +LR++LG+ G+I +D L M G S+ E + AG D+ L+ N+ + Sbjct: 329 LTNLLRKQLGYKGLIVTDALDMAGITQFFSHEEALVRAFSAGADIALMPFTIRNKTDINA 388 Query: 288 VLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTR 326 + L + + + S + S+ + I + Sbjct: 389 FVKLLDNVAVKLADSALD--TSSESAFLVSSYNRIIEAK 425 >UniRef50_B0MSB5 Putative uncharacterized protein n=1 Tax=Eubacterium siraeum DSM 15702 RepID=B0MSB5_9FIRM Length = 406 Score = 312 bits (800), Expect = 1e-83, Method: Composition-based stats. Identities = 88/323 (27%), Positives = 147/323 (45%), Gaps = 26/323 (8%) Query: 12 ELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRF 71 +D + VGG+ LF+RN QL + ++++S+ L +AVD+EGGRV R Sbjct: 96 RVDDAVISRIEEYPVGGIALFSRNITSAEQLPMFISDLQSSSKYPLFIAVDEEGGRVARI 155 Query: 72 RE----GFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI- 126 + + + A+E G + S + + ++ FAPV D Sbjct: 156 ANSDFFNVASYKSMEDIGKSGDASK----AEEVGRQIGSYLKELGFNLDFAPVADTNTNP 211 Query: 127 -SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRP 185 + IG+RSY +DP + S +DGMH++G+ T KHFPGHG D+H + Sbjct: 212 QNIVIGDRSYGSDPALVARMVSAQLDGMHDSGIMGTLKHFPGHGDTKDDTHSGYVSIEKT 271 Query: 186 QAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPRPASGSPYWLKTVLRQELGFDG 243 E++ ++ F + + + D +M +H+ +V D P S S + LR ELG+DG Sbjct: 272 WDELKECELVPFITALPKA--DMVMVSHITAVNVTSDKLPTSMSETMITGKLRNELGYDG 329 Query: 244 VIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRL 303 VI +D ++M A + AE ++ AG D++L+ N A + V Sbjct: 330 VIITDAMAMGAVADNYTSAEAAVTAVKAGVDIVLMPQNLDEAFN-----------GVMNA 378 Query: 304 YHKGSFSRQELMDSARWKAISTR 326 G S + L +S + + + Sbjct: 379 VTDGEISMERLDES-VLRILKMK 400 >UniRef50_B2SGG8 Glycosyl hydrolase family 3 n=17 Tax=Francisella RepID=B2SGG8_FRATM Length = 555 Score = 311 bits (798), Expect = 2e-83, Method: Composition-based stats. Identities = 81/287 (28%), Positives = 133/287 (46%), Gaps = 22/287 (7%) Query: 8 VEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGR 67 + + + +I +GG ILF N + Q+ L+R ++A + + A DQEGGR Sbjct: 31 IPFTKTNDIVNKIFKDYNLGGFILFRENIQNNEQVISLLRDLQANTNTPIFFATDQEGGR 90 Query: 68 VQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI- 126 V R ++G T + AA A ++ E+ ++ I+I+FAP +DV Sbjct: 91 VNRLQQG-TSGCGNMALAATDNPHN----AYTMAKIIGDELYSLGININFAPAVDVNSNK 145 Query: 127 -SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRP 185 + IG RSY +P + A I+G H+A + KHFPGHG DSH + Sbjct: 146 NNPIIGVRSYSDNPDIVIDYAKNAINGYHDAKIIDCIKHFPGHGDTATDSHLGNVNLDKT 205 Query: 186 QAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-------------PASGSPYWLK 232 E++ ++ FS L R+ IM AH+ +D PA+ S + Sbjct: 206 LKELQTTELLPFSKLARD--CSMIMTAHISVPALDDTQYQSVSTSENIYVPATLSYKIIT 263 Query: 233 TVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVC 279 +L+Q++ FDG++ SD + M A E + ++ AG D++L+ Sbjct: 264 KLLKQQMKFDGLVVSDAMDMHAIAKHFGTIEASKLAILAGIDILLMP 310 >UniRef50_C1YLC2 Beta-glucosidase-like glycosyl hydrolase n=1 Tax=Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 RepID=C1YLC2_NOCDA Length = 558 Score = 311 bits (797), Expect = 2e-83, Method: Composition-based stats. Identities = 102/353 (28%), Positives = 158/353 (44%), Gaps = 31/353 (8%) Query: 1 MGPVML-DVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS-----R 54 +G +++ +G AE + GGLI F N D Q+ + ++ + Sbjct: 64 IGQLLVLTAQGTSA-AENAAQIEAYRPGGLIYFDANLTDAEQIATMSAGVQDLAADQGRG 122 Query: 55 NRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDID 114 L V +DQE G V R G TR P A + A E LA+ A E+ A+ ++ Sbjct: 123 VPLFVGIDQEQGLVARLPVG-TRFPDAMAVGATRDTE----LAELRASTTAEELTALGVN 177 Query: 115 ISFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVT 172 +++AP DV + IG RS+ +DP +A D AG+ + KHFPGHG Sbjct: 178 LNYAPDADVNTDPGNPVIGIRSFGSDPDLVAQMAVAESDAYSGAGVVSVVKHFPGHGDTD 237 Query: 173 ADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV----DPRPASGSP 228 DSH P P+ + A + F + I + +DAIM AHV+ + DP PA+ SP Sbjct: 238 VDSHSGLPVIDMPREQWEAGHLPPFRAAI-DADVDAIMTAHVLMPQLDGSEDPEPATISP 296 Query: 229 YWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSV 288 + +LR ELG+DGV+ +D L+MEG S E L+AG D +L+ + AVS Sbjct: 297 ELIDGILRDELGYDGVVTTDALNMEGVRQRHSDGEIAVRVLEAGVDQLLMPPDPAAAVSA 356 Query: 289 LDNLSPIKAERVTRLYHKGSFSRQELMDSA-RWKAISTRLNQLHERWQEEKAG 340 + +G + + + +S R A+ + L + + Sbjct: 357 -----------IREAVEQGRLTEERIDESVLRVLALKEKRGILEAEPVDAQGA 398 >UniRef50_A1SM50 Glycoside hydrolase, family 3 domain protein n=7 Tax=Actinobacteria (class) RepID=A1SM50_NOCSJ Length = 631 Score = 310 bits (796), Expect = 4e-83, Method: Composition-based stats. Identities = 98/326 (30%), Positives = 149/326 (45%), Gaps = 30/326 (9%) Query: 18 REILAHPLVGGLILFT-----RNYHDPAQLRELVRQIRAASR-----NRLVVAVDQEGG- 66 + +A GG+I FT N P Q+ L ++AA+R L ++VDQEGG Sbjct: 104 AQAIAKYQPGGVIYFTTRNGDDNIGTPEQVARLSNGLQAAARALPGGIPLQISVDQEGGA 163 Query: 67 RVQRFR--EGFTRLPAAQSFAALSGMEEGGKL-AQEAGWLMASEMIAMDIDISFAPVLDV 123 V RF G T+LP + A + G A+ + ++ +E+ A+ + +APV DV Sbjct: 164 LVARFGAASGATQLPGNMALGAGALGTGGSAADARRSATVIGAELAAVGVTQDYAPVADV 223 Query: 124 G--HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPC 181 + IG RS +DP + + + G H G+ KHFPGHG DSH P Sbjct: 224 NVNPNNPVIGIRSIGSDPALVSDLVAAQVRGFHRGGVSAVAKHFPGHGDTGVDSHFGLPE 283 Query: 182 DPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPASGSPYWLKTVLRQELG 240 ++++ D+ F + I +D IM AHV+ +DP PA+ S L +LR+ELG Sbjct: 284 VTHTRSQLEEIDLPPFRAAI-AAGVDTIMTAHVVLPAIDPGVPATMSRKILTGLLRRELG 342 Query: 241 FDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERV 300 FDG+I +D L M GA +L AG D +L+ A V Sbjct: 343 FDGLIVTDALDMGGATATYPPDVAPVRALLAGADQLLIPPEMDTAYRA-----------V 391 Query: 301 TRLYHKGSFSRQELMDSARWKAISTR 326 + G SR+ L +S ++ + + Sbjct: 392 LKAVRSGQISRERLNES-VYRILLHK 416 >UniRef50_C3RKZ3 Beta-hexosamidase A n=5 Tax=Bacteria RepID=C3RKZ3_9MOLU Length = 1092 Score = 309 bits (793), Expect = 7e-83, Method: Composition-based stats. Identities = 102/348 (29%), Positives = 168/348 (48%), Gaps = 37/348 (10%) Query: 12 ELDAEEREILAHPLVGGLILFTRNYHDPAQLR-ELVRQIRAASR--------NRLVVAVD 62 ++ E EI+ +G +ILF N++ A++ EL ++ A+ L++A D Sbjct: 78 TINDEIHEIIGEYDIGSVILFAANFNSDAKVNVELTDGLQKAAMDKDLGKNSIPLLIATD 137 Query: 63 QEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLD 122 QEGG V R G T LP + A E A +AG ++ SE+ A+ ++++FAP D Sbjct: 138 QEGGIVYRLT-GGTALPGNMALGASGNTEN----AVKAGNIIGSELNAVGVNVNFAPDAD 192 Query: 123 VGHISAA--IGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETP 180 V + IG RS+ ++PQ A S +I+G+ + T KHFPGHG V DSH P Sbjct: 193 VNNNPNNPVIGLRSFSSNPQLAAKFVSAYIEGVQSNNVATAAKHFPGHGNVATDSHTGLP 252 Query: 181 CDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV-------DPR------PASGS 227 P + E+ ++ F + I + D +M AH+ + ++ D + PA+ S Sbjct: 253 SVPATKEELYKTELVPFQAAI-DAGTDMVMTAHIQFPNIVTEEIYSDKKDELMSPPATLS 311 Query: 228 PYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVC----NNRK 283 L +LR E+ FDGVI +D ++M+G A ER ++ AG D++ + ++ K Sbjct: 312 REILTDLLRDEMKFDGVIVTDSMTMQGVANYFDTNERNLLAVKAGVDILDIPFTDISSMK 371 Query: 284 GAVSVLDNLSPIKAERVTRL--YHKGSFSRQELMDSARWKAISTRLNQ 329 S L L + T+ Y+ S +EL S + + + N+ Sbjct: 372 DMESKLIPLINAFVDAYTKEDGYNGIKLSLEELDKSVE-RILVCKYNR 418 >UniRef50_A0M1W2 Family 3 glycosyl hydrolase/beta-lactamase fusion protein n=16 Tax=Bacteroidetes RepID=A0M1W2_GRAFK Length = 978 Score = 309 bits (793), Expect = 7e-83, Method: Composition-based stats. Identities = 84/329 (25%), Positives = 144/329 (43%), Gaps = 26/329 (7%) Query: 15 AEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFREG 74 R+ + +GGLI P +L + + S L+V +D E G R Sbjct: 75 DSIRKQIKENHIGGLIFSKGG---PVMQAQLTNEFQEMSEIPLLVGMDAEWGLAMRLDST 131 Query: 75 FTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI--SAAIGE 132 F LP + A+ + KL +EAG ++ + + +FAPV+D+ + IG Sbjct: 132 F-ALPWNMTLGAI----QDNKLVEEAGAAISRHTKRLGVHFNFAPVVDINTNPDNPIIGN 186 Query: 133 RSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAK 192 RS+ D A F++GMH G ++ KHFPGHG DSHK P I Sbjct: 187 RSFGEDKINVTEKALAFMNGMHREGGLSSAKHFPGHGDTDTDSHKTLPTINFTSERIDEV 246 Query: 193 DMSVFSSLIRENKLDAIMPAHVIYSDV---DPRPASGSPYWLKTVLRQELGFDGVIFSDD 249 ++ + LI + L ++M AH+ + D +PAS S + +L++ +GF+G+IF+D Sbjct: 247 ELYPYRKLIPKG-LSSVMVAHLNVPALESKDGKPASLSQKIITDILKKNMGFNGLIFTDA 305 Query: 250 LSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSF 309 L M+G + E + AG D++L+ + A ++ + + G Sbjct: 306 LDMKGVSRNKEPGEVDLDAFMAGNDVLLMSEDIGKA-----------SKSIIEAVNSGLV 354 Query: 310 SRQELMDSARWKAISTRLNQLHERWQEEK 338 S + L S + K + + +++ Sbjct: 355 SEERLELSVK-KILYAKYKVGLNKFKPIT 382 >UniRef50_C2M7P2 Family 3 glycosyl hydrolase/beta-lactamase fusion protein n=1 Tax=Capnocytophaga gingivalis ATCC 33624 RepID=C2M7P2_CAPGI Length = 769 Score = 309 bits (793), Expect = 7e-83, Method: Composition-based stats. Identities = 80/311 (25%), Positives = 138/311 (44%), Gaps = 16/311 (5%) Query: 17 EREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFREGFT 76 +E++ +GG+I P +L + + A++ L++A+D E G R + Sbjct: 67 IKELVKKYHIGGIIFSKG---SPVAQAKLTNEFQEAAQVPLMIAMDAEWGLAMRLDSVY- 122 Query: 77 RLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI--SAAIGERS 134 P + AL L ++AG +A + + I FAP +D+ + IG RS Sbjct: 123 AFPWNMTLGALKD----NALVEQAGKRIAQHCKRLGVHIDFAPDIDINTNPQNPIIGNRS 178 Query: 135 YHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDM 194 + F GM G+ GKHFPGHG DSHK P + + + + Sbjct: 179 FGESKFNVAQKGVAFTKGMQSVGVLACGKHFPGHGDTEKDSHKTLPTVSFSRERLESTEF 238 Query: 195 SVFSSLIRENKLDAIMPAHVIYSDVD-PRPASGSPYWLKTVLRQELGFDGVIFSDDLSME 253 F LI + ++M H+ +D +PAS S + +++Q+LGF G+IFSD L M+ Sbjct: 239 FPFQKLIGAG-VASLMVGHLNVPALDEGKPASISYKIVSGLIKQQLGFQGLIFSDALGMK 297 Query: 254 GAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFSRQE 313 G A ++ + AG D++L+ ++ + L N RL H S ++ Sbjct: 298 GVADYAESSQVDLQAFLAGNDVLLMSSDPIKGIETLKNAYSAGTINEYRLAH----SVKK 353 Query: 314 LMDSARWKAIS 324 ++ + W ++ Sbjct: 354 ILKAKYWAGLN 364 >UniRef50_Q2NRJ7 Putative glycosidase n=1 Tax=Sodalis glossinidius str. 'morsitans' RepID=Q2NRJ7_SODGM Length = 650 Score = 309 bits (792), Expect = 1e-82, Method: Composition-based stats. Identities = 94/336 (27%), Positives = 150/336 (44%), Gaps = 24/336 (7%) Query: 13 LDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAA---SRNRLVVAVDQEGGRVQ 69 L+A IL +GG+ILF N Q+R L Q+ A L++A+D EGG V Sbjct: 95 LNANIEHILHTYHIGGVILFANNLKKTDQIRRLTAQLAAVPQTHGIGLMIAIDNEGGNVF 154 Query: 70 RFREG-FTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI-- 126 R G +T AL L G +MA E+ ++ I+++FAPV+DV Sbjct: 155 RLPRGEYTAFAGNMPLGALDQGHSESVLPYLQGKIMAQELRSLGINVNFAPVVDVNSNQA 214 Query: 127 SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQ 186 + I R++ P + +A + G A + T KHFPGHG DSH + P R + Sbjct: 215 NPVINVRAFSDSPARVDKLARELVKG--SAQIVTCYKHFPGHGDTHTDSHMDLPRVDRNR 272 Query: 187 AEIRAKDMSVFSSLIREN-KLDAIMPAHVIYSDVDP------------RPASGSPYWLKT 233 E A D++ + + I EN D IM AH+ Y +D PA+ S Sbjct: 273 EEAYAIDLAPYRAAIAENEAPDMIMTAHIQYPALDDSRITRLDGEDIVVPATLSRRIQHD 332 Query: 234 VLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLS 293 +LRQELG+ GV +D L M+ + + + +AG D+ L+ + + + Sbjct: 333 LLRQELGYRGVTITDALDMKAISANFDPRDVVKRVFNAGIDIALMPVSIQAPEE--EEKL 390 Query: 294 PIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQ 329 + V G + +E+ D++ + +S + + Sbjct: 391 RELIDYVATQVEHGEINGEEI-DASVARILSLKRAK 425 >UniRef50_B9E727 Putative uncharacterized protein n=1 Tax=Macrococcus caseolyticus JCSC5402 RepID=B9E727_MACCJ Length = 379 Score = 308 bits (790), Expect = 2e-82, Method: Composition-based stats. Identities = 93/330 (28%), Positives = 161/330 (48%), Gaps = 19/330 (5%) Query: 3 PVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQI-RAASRNRLVVAV 61 ++L +G + +A + A +GG+ILF RN + +QL +L R I A+ + + + Sbjct: 61 QLVLGFDGTDYNAALKRYSAR-NIGGVILFKRNIVNESQLTQLNRDIFHNAANIPMFIGI 119 Query: 62 DQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVL 121 DQEGGRV R A G+ + A + G + + M ++ FAPVL Sbjct: 120 DQEGGRVNRLPASI----QNIESAYTIGLSHDKQYAYQQGRYIGENVKRMGFNLDFAPVL 175 Query: 122 DV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKET 179 D+ + IG+RS+ D ++ F G+ E G+ T+GKHFPGHG +DSH+ Sbjct: 176 DIWSNPANKVIGDRSFGHDAHMVSGMSRSFRQGITETGIITSGKHFPGHGDTVSDSHETL 235 Query: 180 PCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQE 238 P + E+ ++ F I + +D +M +H+ + D+D + PAS S + +L+++ Sbjct: 236 PVSNKSLDELMKFELIPFKQHIDDG-IDMLMVSHISFPDIDNKYPASISKEIVTNILKKK 294 Query: 239 LGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAE 298 LG+ GVI SDDLSM + S E L +G ++L+ ++ +++ + Sbjct: 295 LGYKGVIISDDLSMGAISDRYSLNEAVVRGLQSGETIMLIGSDAVDVDTLI--------Q 346 Query: 299 RVTRLYHKGSFSRQELMDSARWKAISTRLN 328 V G+ + +++D K I +L Sbjct: 347 YVRSNVENGNIDK-KIIDENNEKIIRLKLK 375 >UniRef50_Q2SV66 Beta-N-Acetylglucosaminidase n=31 Tax=Burkholderiaceae RepID=Q2SV66_BURTA Length = 699 Score = 308 bits (790), Expect = 2e-82, Method: Composition-based stats. Identities = 97/339 (28%), Positives = 145/339 (42%), Gaps = 23/339 (6%) Query: 11 YELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAA----SRNRLVVAVDQEGG 66 E A R+ L VGG+ILF+ N Q R L+ IRAA S L++ +D+EGG Sbjct: 88 TEFPAAARDALRDSGVGGVILFSNNLTGIEQTRRLIDGIRAAPAADSPVGLLIGIDEEGG 147 Query: 67 RVQRFREGF-TRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGH 125 V R T + A +LA + G ++A+E+ A+ +++FAP +DV Sbjct: 148 NVFRLPRVEATAFSGNMALGAAYEATRDDRLAYDMGRVLAAEIAAVGFNVNFAPDVDVNS 207 Query: 126 I--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDP 183 + I R++ DP + R + GM + T KHFPGHG DSH P Sbjct: 208 NPLNPVINVRAFGDDPATIGLLGRRMVQGMASERVIGTFKHFPGHGDTDTDSHYGLPVVI 267 Query: 184 RPQAEIRAKDMSVFSSLIREN-KLDAIMPAHVIYSDVDPR------------PASGSPYW 230 + +A+ A D++ + I D IM AH+ Y +D PA+ S Sbjct: 268 KSRADAYAIDLAPYRQAIESGEAPDMIMTAHIQYPSLDDTRVTTRTGEQMIAPATMSRRI 327 Query: 231 LKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLD 290 +LR E G+ GV +D L M+G A A+ A D+ L+ + A Sbjct: 328 QHDILRGEFGYRGVTITDALDMKGIADFFDEADAVIKVFQADVDIALMPVEFRTAADA-- 385 Query: 291 NLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQ 329 +RV G R E DS+ + + T+L Sbjct: 386 GRLAALVDRVAAAVDSGRIDRAEF-DSSVRRIVLTKLRN 423 >UniRef50_C6HUU9 Beta-N-acetylhexosaminidase n=1 Tax=Leptospirillum ferrodiazotrophum RepID=C6HUU9_9BACT Length = 321 Score = 308 bits (790), Expect = 2e-82, Method: Composition-based stats. Identities = 106/303 (34%), Positives = 165/303 (54%), Gaps = 10/303 (3%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQ 63 + + + G +L A +RE L G+++F N + AQ++ L+ +IR+ + ++AVDQ Sbjct: 8 LWVSLPGPDLTAADREFLERVDPLGVVIFRENVRNAAQVKSLIEEIRSVPGSPRLLAVDQ 67 Query: 64 EGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV 123 EGGRV R +G LPA + A E +EAG+ + + + D+ FAPVLDV Sbjct: 68 EGGRVARLSDGVPALPAMKILGAQPERE-----IEEAGFSLGEALHRLGFDVDFAPVLDV 122 Query: 124 --GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPC 181 + IG+R++ +DP++A A F G+ AG+ GKHFPGHG + DSH P Sbjct: 123 DSNPQNPIIGDRAFSSDPEEAGVKALAFARGLLRAGILPCGKHFPGHGDTSLDSHVALPV 182 Query: 182 DPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPASGSPYWLKTVLRQELG 240 P + + A+++ F I E ++ +M AHV+Y +D PA+ S + +LR++L Sbjct: 183 VDAPLSLLEARELVPFRMAI-EREIPLMMTAHVVYPSIDKAVPATLSRAVVTGILREKLN 241 Query: 241 FDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERV 300 + G++ SDDL M A G E AS+DAGCD +LV ++ A++V D L I Sbjct: 242 YGGLVLSDDLLMAAVA-RGGVVEAALASIDAGCDGLLVLKSQSQALAVADRLEEILRGEP 300 Query: 301 TRL 303 RL Sbjct: 301 GRL 303 >UniRef50_C5CH66 Beta-N-acetylhexosaminidase n=1 Tax=Kosmotoga olearia TBF 19.5.1 RepID=C5CH66_KOSOT Length = 504 Score = 307 bits (788), Expect = 3e-82, Method: Composition-based stats. Identities = 91/328 (27%), Positives = 158/328 (48%), Gaps = 18/328 (5%) Query: 5 MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQE 64 M+ + G +DAE +IL G +ILF+RN +P+Q+++ + + + + ++A+DQE Sbjct: 12 MIGLPGKSMDAETVKILERVNPGSIILFSRNIENPSQVKDFIDECKRVLGYKPLIAIDQE 71 Query: 65 GGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLD-- 122 GG V R +GF P A + AA E A A ++A EM A+ ID APV+D Sbjct: 72 GGIVTRLTKGFAISPGAMAIAATGNSEN----AFIAASILAKEMRAVGIDWDLAPVVDIN 127 Query: 123 VGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCD 182 + IG RS+ +D + +F G+ E G+ KHFPG G V+ D+H + P Sbjct: 128 NNPNNPGIGVRSFSSDVNTVVEFGRKFTKGLQENGIIACLKHFPGKGRVSVDAHIDMPVL 187 Query: 183 PRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR--PASGSPYWLKTVLRQELG 240 E++ ++ F + + + MP+H+ + PA+ S L ++R++L Sbjct: 188 DVSYDELKKFELIPFKKI----EALSWMPSHIYIPALQHTKIPATLSEEILTELVRKKLH 243 Query: 241 FDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL------SP 294 + GV+ +DDL M G E + AG D++ VC+N ++V + L P Sbjct: 244 YKGVLIADDLLMGGITNYFPVEEATLKAFAAGMDILTVCHNDDIQIAVKNYLVKQIEKEP 303 Query: 295 IKAERVTRLYHKGSFSRQELMDSARWKA 322 +R+ K ++ + + + + Sbjct: 304 KLQKRLEESLSKIEAMKKLINKAPKIEI 331 >UniRef50_D1R988 Putative uncharacterized protein n=1 Tax=Parachlamydia acanthamoebae str. Hall's coccus RepID=D1R988_9CHLA Length = 544 Score = 307 bits (787), Expect = 4e-82, Method: Composition-based stats. Identities = 85/340 (25%), Positives = 165/340 (48%), Gaps = 23/340 (6%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILF--TRNYHDPAQLRELVRQIRA-----ASRNR 56 +M+ G E + + ++++ +GG+I + D Q++ L R +++ A Sbjct: 39 LMVHFYGTEANGDAKKLIQDFHIGGIIYYEWANGLTDAQQVQILSRGLQSLAQERAHPIP 98 Query: 57 LVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 L +A+DQEGG V R + GFT+ P + + + A+ + E+ ++ I+++ Sbjct: 99 LFLAIDQEGGVVNRLKNGFTQFPGNYALGQIKKAGD----AEACAYATGLELKSVGINVN 154 Query: 117 FAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 APV+DV + IG R++ + P+ ++ ++G ++ + + KHFPGHG V D Sbjct: 155 LAPVVDVNCNEKNPIIGIRAFGSSPKDVASLGKEMMEGFRKSSILSVLKHFPGHGDVEVD 214 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPA-SGSPYWLKT 233 SH+ PC + + E+ ++ F L+ + +A+M +H+ +D + + S + Sbjct: 215 SHQALPCLNKTRGELDQVELYPFHQLLPLS--EAVMTSHLFVPALDAKTCITFSCDAVTQ 272 Query: 234 VLRQELGFDGVIFSDDLSMEG-AAIMGSYAERGQASLDAGCDMILVCNNRKGAVS----- 287 VLR+E+GF+GVI +D L+M+G + S AG D++L+ + Sbjct: 273 VLRKEMGFEGVILTDSLAMQGCLGQSQGVVDAAIQSFKAGHDVLLLGGKQLLGEGGGFEI 332 Query: 288 VLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRL 327 +D ++ I + + L G S Q L S R + ++ Sbjct: 333 TIDEIAKI-HQTLVFLVQTGEISEQRLDASVRRILAAKKI 371 >UniRef50_D2RL81 Glycoside hydrolase family 3 domain protein n=1 Tax=Acidaminococcus fermentans DSM 20731 RepID=D2RL81_ACIFE Length = 369 Score = 307 bits (787), Expect = 4e-82, Method: Composition-based stats. Identities = 87/330 (26%), Positives = 154/330 (46%), Gaps = 22/330 (6%) Query: 2 GPVM-LDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASR-NRLVV 59 G +M + +EG E A ++ L GG+ILF RN DP Q++ L +++ +R L++ Sbjct: 54 GQLMLVGMEGTEPSAAFQKELQENHYGGVILFDRNMKDPNQVKALNSRLQDLARPEPLLI 113 Query: 60 AVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAP 119 VD+EGG+V R ++ P+ ++ + A+ + + A+ +++ AP Sbjct: 114 GVDEEGGQVARMKDQLPAPPSQEAIGRTGDPAQ----AEAWAAKTGTRLKALGFNLNLAP 169 Query: 120 VLDVGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKET 179 V DVG ERSY + P A + G + + KHFPG G D+HK Sbjct: 170 VADVGSGM----ERSYSSHPATAAKFVEGALAGYQKVHLLACLKHFPGLGKGKEDTHKGQ 225 Query: 180 PCDPRPQAEIRAKDMSVFSSLIRENKLD--AIMPAHVIYSDVDPR-PASGSPYWLKTVLR 236 + + +D F + + +M HV Y +DP PAS SP + +LR Sbjct: 226 VIVSADRETLLQEDGLPFRLALEKYPEPWFGVMVTHVRYPALDPDYPASLSPKIMGDLLR 285 Query: 237 QELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIK 296 + F G++ +DDL+M A E ++ AG D+++VC+ + V L+ Sbjct: 286 GQFHFSGLVLTDDLTMGAIASQYPLEEAAVQAVRAGADLVMVCHGPETIRRVHGALA--- 342 Query: 297 AERVTRLYHKGSFSRQELMDSARWKAISTR 326 + +G+ R++ +D+A ++ + T+ Sbjct: 343 -----KAIREGTLPREQ-VDAALYRILRTK 366 >UniRef50_A0L847 Beta-N-acetylhexosaminidase n=1 Tax=Magnetococcus sp. MC-1 RepID=A0L847_MAGSM Length = 364 Score = 307 bits (786), Expect = 5e-82, Method: Composition-based stats. Identities = 113/338 (33%), Positives = 171/338 (50%), Gaps = 20/338 (5%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQ 63 +++ + G L ER L G++LF+RN P Q+ L+R IR S + V+ +DQ Sbjct: 10 LVVTLAGTRLSEPERAYLQRVEPAGVMLFSRNIEQPEQVVALIRAIRQQSPS-TVIWIDQ 68 Query: 64 EGGRVQRFREGFTRLPAAQSFAALSG--MEEGGKLAQEAGWLMASEMIAMDIDISFAPVL 121 EGGRVQR R TRLPAA+SFA + ++ +LA++ G L E+ A I I+ APV+ Sbjct: 69 EGGRVQRLRCPLTRLPAAESFAKIYAYAPQQAERLARQCGALAGRELRAFGIGINCAPVV 128 Query: 122 DVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKET 179 D+ IG R++ +P +A ++ G+ EAG+ KHFPGHG DSH+ Sbjct: 129 DIPAPGADPVIGNRAFGDNPLSISRLAGAWLQGLQEAGVMGVAKHFPGHGDAMVDSHRSL 188 Query: 180 PCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPASGSPYWLKTVLRQE 238 P E+ ++ + L+ L IM AH++ S +DP RPA+ S LK LRQ Sbjct: 189 PVVHTEIRELLSRTWVPYVQLLP--LLQGIMTAHLVMSALDPLRPATWSSVVLKDWLRQR 246 Query: 239 LGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNR-KGAVSVLDNLSPIKA 297 LG+ G+I SD L M G+ +R ++ AGCD +LVC+ R L+ + Sbjct: 247 LGYGGLIISDALEMGAL--GGNMEDRAARAVRAGCDQLLVCSGRLNHMQQALNGVQQALG 304 Query: 298 ERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQ 335 E+ +++ D+ R +A+ WQ Sbjct: 305 EK---------SPQEQRQDAQRIEAVLQNYRISPGDWQ 333 >UniRef50_A1C3U3 Beta-N-acetylglucosaminidase, putative n=7 Tax=Leotiomyceta RepID=A1C3U3_ASPCL Length = 931 Score = 306 bits (785), Expect = 6e-82, Method: Composition-based stats. Identities = 84/339 (24%), Positives = 148/339 (43%), Gaps = 27/339 (7%) Query: 1 MGPV-MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRN---- 55 MG + M+ +G ++ E R ++ + +G ++L +N + LV +++ +R+ Sbjct: 19 MGQLFMMGFDGTSVNPEIRSLIENYHLGSVLLTAKNLKSAEEATRLVLELQTIARDAGHR 78 Query: 56 -RLVVAVDQEGGRVQRFRE--GFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMD 112 L++A+DQE G V + + P+A AA LA + + A E+ A+ Sbjct: 79 VPLLIALDQENGGVNSLYDEIYIRQFPSAMGIAATGSK----ALAHDVAFATAQELKAVG 134 Query: 113 IDISFAPVLDV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGA 170 ++ PVLDV +G R+ DPQ+ F+ G EAG+ T GKHFP +G Sbjct: 135 VNWILGPVLDVLTNVRKQPLGVRTSGDDPQEVSQYGVEFMKGYQEAGLVTCGKHFPSYGN 194 Query: 171 VT-ADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSD--VDPRPASGS 227 + S + P ++ + F + I LD++M V S V+ A S Sbjct: 195 LEFLGSQTDVPIITESLEQLSLTALVPFRNAIIHG-LDSMMVGGVSMSSAGVNVMHACLS 253 Query: 228 PYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVS 287 + +LR++L F+GV+ S+ L ME + +AGCD+IL+C + Sbjct: 254 EQVVDDLLRKDLKFNGVVVSECLEMEALTHNIGVGGGTVMAKNAGCDIILLCRSFPVQQE 313 Query: 288 VLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTR 326 ++ L G R + S R + ++ + Sbjct: 314 AINGLKVG--------VENGILGRARIEQSLR-RVLAMK 343 >UniRef50_B3QRM6 Glycoside hydrolase family 3 domain protein n=1 Tax=Chlorobaculum parvum NCIB 8327 RepID=B3QRM6_CHLP8 Length = 570 Score = 306 bits (785), Expect = 7e-82, Method: Composition-based stats. Identities = 82/313 (26%), Positives = 141/313 (45%), Gaps = 26/313 (8%) Query: 19 EILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFREGFTRL 78 + +GG++ + A+L + + S L+V+ D E G R +G T Sbjct: 73 RLATEGKIGGIMFLKGDLESAARLA---NRFQNLSAVPLLVSADMERGLAMRL-DGATEF 128 Query: 79 PAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLD--VGHISAAIGERSYH 136 PAA + +A E LA+ G ++A E A+ I ++AP +D + + I RS+ Sbjct: 129 PAAMAVSAAGDPE----LARSMGEIIAREARAVGIQQNYAPTVDLNINPANPVINTRSFG 184 Query: 137 ADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSV 196 +++ FIDGM G+ T KHFPGHG VT DSH P + + ++ Sbjct: 185 DRIPLVNTMSAAFIDGMQSNGLVATAKHFPGHGDVTVDSHLALPVLEGDRKRLERYELQP 244 Query: 197 FSSLIRENKLDAIMPAHVIYSDVDP--RPASGSPYWLKTVLRQELGFDGVIFSDDLSMEG 254 F S I + + ++M H+ + PAS S + +LR++LGF+G+I +D L+M+ Sbjct: 245 FRSAISQG-VMSVMVGHLAVPKLTGSLEPASLSRKIVTGLLRKDLGFNGLIVTDALNMKA 303 Query: 255 AAIMG-SYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFSRQE 313 G + E +++AG D++L + + E V G S + Sbjct: 304 LQSGGLTPEEVAVKAVEAGNDILLYPEDPERTF-----------EAVCAAVESGKISERR 352 Query: 314 LMDSARWKAISTR 326 + S R + + + Sbjct: 353 IDQSVR-RILLVK 364 >UniRef50_C8WZJ0 Glycoside hydrolase family 3 domain protein n=1 Tax=Desulfohalobium retbaense DSM 5692 RepID=C8WZJ0_DESRD Length = 379 Score = 306 bits (785), Expect = 7e-82, Method: Composition-based stats. Identities = 99/346 (28%), Positives = 160/346 (46%), Gaps = 31/346 (8%) Query: 4 VMLDVEGYELDAEER--EILAHPLVGGLILFTR---------NYHDPAQLRELVRQIRAA 52 +M+ G +A+ E + +GG++L+ R N AQL+ L ++ Sbjct: 41 LMIGFRGTTFEAKSPLGEAITEGNLGGVVLYGRDVALNKPMRNIRSAAQLQALTADLQDH 100 Query: 53 SRNRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMD 112 +R L +AVD+EGG+V R F P + AA G + + + + Sbjct: 101 ARIPLFIAVDEEGGQVSRLAPRFG-FPETVT-AATWGRRNDPAWTRRGARAIGQRLRNLG 158 Query: 113 IDISFAPVLDVGHI--SAAIG--ERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGH 168 I+ APV+D+ + AIG ERS+ A+P A+ FI G+H+AG+ KHFPGH Sbjct: 159 CTINLAPVVDLNTNPDNPAIGKLERSFGANPDTVTRQAAAFIHGLHDAGILACIKHFPGH 218 Query: 169 GAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGS 227 G+ DSH K++ + L+ DA+M AHV ++ +DP+ PA+ S Sbjct: 219 GSAYNDSHLGLTDISTT---WSPKELEPYKRLVDRGLADAVMTAHVFHAGLDPKVPATLS 275 Query: 228 PYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVS 287 + +LR+E+G++GV+ SDDL M S + + L+A D+ L NN + Sbjct: 276 AEIIPDILRREIGYEGVVISDDLQMGAIRQSFSLRQTVRRCLEADVDIFLFGNNLE---- 331 Query: 288 VLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHER 333 P RV R+ Q ++ +R + R+ +L ER Sbjct: 332 ----YEPFVWRRVQRIVRD--LVDQNIVSRSRIERSYERIQRLKER 371 >UniRef50_Q73RR1 Glycosyl hydrolase, family 3 n=1 Tax=Treponema denticola RepID=Q73RR1_TREDE Length = 557 Score = 306 bits (784), Expect = 9e-82, Method: Composition-based stats. Identities = 89/339 (26%), Positives = 156/339 (46%), Gaps = 24/339 (7%) Query: 5 MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS-----RNRLVV 59 M G + + +G + +F N D +L + + ++ S L V Sbjct: 65 MFGWAGQDPGDLLLSWIQDSGLGSIKVFGWNTGDSRKLAKSISILQKKSLEGRFGIPLFV 124 Query: 60 AVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAP 119 A DQEGG V+ + + P + A ++ +G+ ++ E+ A+ I+++FAP Sbjct: 125 ATDQEGGWVRHVKGLTSETPGNLAIGASGIPQDS----YYSGYYISREIRALGINLNFAP 180 Query: 120 VLDV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 +D+ H S+ IG RS+ P + + F+ G +AG+ TT KHFPGHG + DSH Sbjct: 181 TVDLLTDHDSSIIGPRSFGDSPHAVGILGAAFVRGSRDAGVLTTAKHFPGHGDTSIDSHG 240 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPRPASGSPYWLKTVL 235 P + R +++ F LI + + AIM H+ +S + + PA+ S Y L +L Sbjct: 241 RLPKIDISEETFRNRELIPFKYLI-DAGVPAIMTGHLNFSSILPNGEPATFSKYLLTDIL 299 Query: 236 RQELGFDGVIFSDDLSMEGAAI-MGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSP 294 R ELGF G+I +DD+ M GA G A+ + +L+AG D+I + + + Sbjct: 300 RGELGFKGLIITDDMMMHGAMNFAGGIAKAVKMALEAGNDIIESSTTPRHYQAFWKDN-- 357 Query: 295 IKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHER 333 R + ++ + D+A ++ + +L Sbjct: 358 ------IRAMKEEPQFKERVKDAA-FRILLEKLKYFKSD 389 >UniRef50_B5LAN7 HexI n=3 Tax=cellular organisms RepID=B5LAN7_9BURK Length = 521 Score = 306 bits (784), Expect = 9e-82, Method: Composition-based stats. Identities = 93/297 (31%), Positives = 152/297 (51%), Gaps = 11/297 (3%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQ 63 +M+ + G LD + + L + + LF +N +P QL +L +RA ++ +DQ Sbjct: 25 IMIRMPGTVLDEDTAQFLRDNHIRAVCLFRQNMVNPEQLAKLTADLRAVMGPNALIGIDQ 84 Query: 64 EGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV 123 EGG V R PAA + A+ + LA+ G +A + ++ + +FAPVLD+ Sbjct: 85 EGGAVVR-ATWLPAPPAAMALGAVDDV----ALARSVGAAVARGIRSLGFNWNFAPVLDL 139 Query: 124 GHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPC 181 + + IGERS+ +DPQ+A +A +++G E G+ KHFPGHG DSH++ P Sbjct: 140 NNNPANPVIGERSFGSDPQRATELALAWMEGSLEQGVACCVKHFPGHGDTHVDSHRDLPT 199 Query: 182 DPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQELG 240 +P++ + A +++ F + R A+M AH++Y +DP PA+ S L +LR E Sbjct: 200 VAKPRSALDALELAPFKAARRA---PAMMSAHIVYPALDPEYPATLSRKILTDLLRSEWD 256 Query: 241 FDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKA 297 + GVI +D + M A +L AG DM++ NR L +L+ A Sbjct: 257 YRGVIITDGMDMHAIAGRYGVGNAAVRALAAGADMVMALGNRSTQEETLVSLTMALA 313 >UniRef50_C4JZX3 Predicted protein n=4 Tax=Onygenales RepID=C4JZX3_UNCRE Length = 852 Score = 305 bits (783), Expect = 1e-81, Method: Composition-based stats. Identities = 82/334 (24%), Positives = 143/334 (42%), Gaps = 24/334 (7%) Query: 1 MGPV-MLDVEGYELDAEEREILAH-PLVGGLILFTRNYHDPAQLRELVRQIRAASRN--- 55 +G + ++ G E + ++ G +ILF RN QL L+ ++ +R Sbjct: 10 LGQLFIVGFRGPTATEEIKSLIRAPYYCGSIILFQRNIKSAEQLIRLIHDLQRTAREAGH 69 Query: 56 --RLVVAVDQEGGRVQRFREGFT-RLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMD 112 L +AVDQE G V R + +LP + + A +++ A + + Sbjct: 70 TRPLFIAVDQENGVVTRIKPPIAAQLPGSMAIGASGDLDD----ASRVSGATGELLRGLG 125 Query: 113 IDISFAPVLDVG--HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGA 170 +++++AP+ DV + IG RS D I SR G+ E + KHFPGHG Sbjct: 126 VNMNYAPLCDVNSEPANPVIGVRSPGDDGTFVGRITSRIARGLRENNIVPCVKHFPGHGD 185 Query: 171 VTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP--RPASGSP 228 DSH P + + ++ ++ F + E ++A+M +HVI S D PAS + Sbjct: 186 SKVDSHHGVPVVNKTKEKLEECELIPFRRAVAEG-IEAVMTSHVIMSAFDDSIYPASVNR 244 Query: 229 YWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSV 288 + LR+ L FDG+I +D L M+ + +L AG D ++ + V Sbjct: 245 NVVN-FLRRGLQFDGLIVTDCLEMDAIRVQFGTERGAVLALAAGVDCPMISHTYSLQVRA 303 Query: 289 LDNL------SPIKAERVTRLYHKGSFSRQELMD 316 L+ P +++R + + + +D Sbjct: 304 LEEAFRWCRNHPTMPRQISRSVSRVFALKDKFLD 337 >UniRef50_A3HXS5 Putative hydrolase/beta lactamase fusion protein n=1 Tax=Algoriphagus sp. PR1 RepID=A3HXS5_9SPHI Length = 984 Score = 305 bits (782), Expect = 1e-81, Method: Composition-based stats. Identities = 86/316 (27%), Positives = 138/316 (43%), Gaps = 26/316 (8%) Query: 16 EEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFREGF 75 E E++ + +GGLI F P + L +A S+ L +A+D E G R Sbjct: 77 EISELVKNENLGGLIFFQGG---PDRQARLTNYYQAQSKTPLFIAMDAEWGISMRLDS-I 132 Query: 76 TRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI--SAAIGER 133 P A + A+ + L + G MA + + + I+FAPV+DV + IG R Sbjct: 133 PDFPKAMTLGAVQDTD----LIYQMGTEMARQFKELGMHINFAPVVDVNSNPDNPVIGYR 188 Query: 134 SYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKD 193 ++ + + A ++ G+ + G+ KHFPGHG DSH P P+ I D Sbjct: 189 AFGENKKLVAQRAVSYMKGLQDHGVIANAKHFPGHGDTEVDSHYSLPVIKNPEQRIWDVD 248 Query: 194 MSVFSSLIRENKLDAIMPAHVIYSDVDPR---PASGSPYWLKTVLRQELGFDGVIFSDDL 250 + + L REN L ++M AH+ +D R S S + +L+ + F G+IF+D L Sbjct: 249 LYPYQELFREN-LMSVMVAHLNIPSLDSRADMATSLSKKVVTDLLQNRMNFQGLIFTDAL 307 Query: 251 SMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFS 310 +M G A S E +L AG D++L + A ++ + G S Sbjct: 308 NMRGVANTNSPGEVDLKALLAGNDILLYSQDVPKAKAM-----------IKSAVEDGVIS 356 Query: 311 RQELMDSARWKAISTR 326 E+ D K + + Sbjct: 357 ESEI-DRRVKKVLKAK 371 >UniRef50_B3QV76 Glycoside hydrolase family 3 domain protein n=1 Tax=Chloroherpeton thalassium ATCC 35110 RepID=B3QV76_CHLT3 Length = 639 Score = 305 bits (782), Expect = 1e-81, Method: Composition-based stats. Identities = 81/323 (25%), Positives = 143/323 (44%), Gaps = 25/323 (7%) Query: 15 AEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFREG 74 E E++ + LVGG++ D +L L + ++ ++ L+++ D E G R + Sbjct: 101 QELSELVKNDLVGGIMFSRG---DVYELAMLANRYQSLAKYPLLISADMEWGVAMRVKR- 156 Query: 75 FTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI--SAAIGE 132 T P AA A E G +A E A+ I ++AP +D+ + + I Sbjct: 157 TTEFPNNMGVAATWNP----MYAYEMGRAIAEEARALGIHQNYAPAVDLNNNPENPVINT 212 Query: 133 RSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAK 192 R++ + Q A+A F+ G A + T KHFPGHG DSH++ P P + + + Sbjct: 213 RAFSENVQLTNAMAKAFVMGSQSARVIATAKHFPGHGDTEIDSHRDLPVLPFTRNRLDSL 272 Query: 193 DMSVFSSLIRENKLDAIMPAHVIYSDV---DPRPASGSPYWLKTVLRQELGFDGVIFSDD 249 + F I N + ++M H+ ++ + PA+ SP + +LR+E FDG+I +D Sbjct: 273 EFRPFKYAIT-NGVMSVMVGHLALPNLVSSEQVPATLSPGIISRILRKEFAFDGLIVTDA 331 Query: 250 LSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSF 309 ++M G S E ++ AG D++L+ + + E + + K Sbjct: 332 MTMYGIRKNYSVGEAAVKAVLAGNDILLMPPD-----------VAVAHEAIVKAVEKADI 380 Query: 310 SRQELMDSARWKAISTRLNQLHE 332 + +S R I +LH Sbjct: 381 PLSHIDESVRRILIVKEWLRLHR 403 >UniRef50_D0KBJ7 Glycoside hydrolase family 3 domain protein n=6 Tax=Gammaproteobacteria RepID=D0KBJ7_PECWW Length = 598 Score = 305 bits (781), Expect = 2e-81, Method: Composition-based stats. Identities = 96/343 (27%), Positives = 149/343 (43%), Gaps = 22/343 (6%) Query: 11 YELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRA---ASRNRLVVAVDQEGGR 67 D +++ +GG+ILF N D Q+++L A RL ++ D EGG Sbjct: 38 TIPDDAIGKLITDHHIGGVILFANNLKDKLQIKKLTAWYTAMETCGGVRLFISTDNEGGN 97 Query: 68 VQRFREG-FTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI 126 V R G + P + +A ++A E G LMA +M+ I+ +FAPV+DV Sbjct: 98 VFRLPRGDYASFPGNMALSAAIEGGADIQIASEQGRLMAQDMLFTHINTNFAPVVDVNTN 157 Query: 127 --SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPR 184 + I R++ D +A + GM + G+ T KHFPGHG+ + DSH P R Sbjct: 158 PFNPVINVRAFSDDADTVYRVAEKMTAGMKQQGLITAYKHFPGHGSTSTDSHTGLPRVDR 217 Query: 185 PQAEIRAKDMSVFSSLIR-ENKLDAIMPAHVIYSDVDP------------RPASGSPYWL 231 + + A D+ + I + D +M AH+ Y +D PA+ S Sbjct: 218 SRKDAFAIDIMPYKRAIDSQTVPDMVMTAHIQYPALDNSQVTTRNGEKITVPATMSYAIQ 277 Query: 232 KTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDN 291 +LR ELG+ GV SD L M A S E + AG D+ L+ + + Sbjct: 278 TDILRGELGYAGVTISDALDMGAIADHFSQEEAVRQVFTAGVDIALMPISISSPDQI--K 335 Query: 292 LSPIKAERVTRLYHKGSFSRQELMDSA-RWKAISTRLNQLHER 333 L P + + + KG+ E+ S R + R + L E+ Sbjct: 336 LLPDLIQHIVDMVKKGNIREAEIDASVERILTLKARYHLLCEK 378 >UniRef50_B3EU27 Putative uncharacterized protein n=1 Tax=Candidatus Amoebophilus asiaticus 5a2 RepID=B3EU27_AMOA5 Length = 1007 Score = 305 bits (781), Expect = 2e-81, Method: Composition-based stats. Identities = 71/280 (25%), Positives = 133/280 (47%), Gaps = 14/280 (5%) Query: 17 EREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFREGFT 76 ++ +GGLI F DP +L Q + ++ L++A+D E G R + Sbjct: 66 IENLIQRYNIGGLIFFQG---DPISQAKLTNQYQLKAKTPLLLAIDAEWGLGMRLTNTIS 122 Query: 77 RLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI--SAAIGERS 134 P + A+ + +L + G +A ++ + I ++FAPV+D+ + + IG R+ Sbjct: 123 -YPRQMTLGAI----QDHQLIYDMGAEIARQLKLLGIHVNFAPVIDINNNPDNPGIGNRA 177 Query: 135 YHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDM 194 + ++ +I G+ + G+ KHFPG G + D H E P P + + ++ Sbjct: 178 FGDGKGSVISKGLAYIQGLQDNGILAVAKHFPGIGDASKDPHHELPTIPYDITRLESIEL 237 Query: 195 SVFSSLIRENKLDAIMPAHVIYSDVDPRP---ASGSPYWLKTVLRQELGFDGVIFSDDLS 251 F IR N + IM +H+ + P AS S + + +L+ +LGF G+IF+D L+ Sbjct: 238 YPFRKAIRAN-VGGIMVSHIYLPAYEKTPNRAASLSSHIVTQLLKNKLGFKGLIFTDALN 296 Query: 252 MEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDN 291 M+ + E +L AG D++L + A++++ + Sbjct: 297 MKAVSKYYQPGEVDLLALQAGNDILLFPEDVPKAIALIKS 336 >UniRef50_A8SRF1 Putative uncharacterized protein n=1 Tax=Coprococcus eutactus ATCC 27759 RepID=A8SRF1_9FIRM Length = 416 Score = 305 bits (781), Expect = 2e-81, Method: Composition-based stats. Identities = 87/330 (26%), Positives = 151/330 (45%), Gaps = 24/330 (7%) Query: 7 DVEGYE----LDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVD 62 + G + ++ L VGG+I F N+ + Q R+++ ++ S+ L +A D Sbjct: 100 GLTGTDDVMVAGDATKQALQAYPVGGIIYFANNFDNVDQTRDMISNTQSYSKIGLFIATD 159 Query: 63 QEGGRVQRFREGFTRLPAAQSFAALSG-MEEGGKLAQEAGWLMASEMIAMDIDISFAPVL 121 +EGGRV R + + ++G A E +AS+M A+ ++ FAPV Sbjct: 160 EEGGRVNRIMDTL----GTTYIGPMYDYKDDGPDAAFENAKTIASDMSALGFNLDFAPVA 215 Query: 122 DV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKET 179 DV + IG+R+Y D +A + + G + G+ T KHFPGHG DSH + Sbjct: 216 DVWSNPDNTVIGDRAYSDDYAQAAELVGSAVKGFQDGGVICTLKHFPGHGDTAEDSHYSS 275 Query: 180 PCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQEL 239 R +A+I A +M F + I + + +M H+I D+D P++ S +LR ++ Sbjct: 276 AYVNRTKAQIMADEMQPFRTGI-DAGAELVMVGHLIVPDIDELPSTLSYKIATEMLRGDM 334 Query: 240 GFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAER 299 FDGV +D + ME A A+ ++ AG DM+L + AV+ Sbjct: 335 KFDGVAITDSMEMESIADNYGVADSAVMAVQAGMDMLLQPADLAVAVNA----------- 383 Query: 300 VTRLYHKGSFSRQELMDSARWKAISTRLNQ 329 + G + + +S R + ++ + + Sbjct: 384 IVTAVQNGDITEPRIDESVR-RILTLKAER 412 >UniRef50_A9F261 Put. Beta-glucosidase n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9F261_SORC5 Length = 371 Score = 304 bits (780), Expect = 2e-81, Method: Composition-based stats. Identities = 108/331 (32%), Positives = 163/331 (49%), Gaps = 18/331 (5%) Query: 2 GPVMLD-VEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRN--RLV 58 G +++ +G E A + LA GG ILF RN D A L + + A Sbjct: 12 GQLIVGGFDGAEPPARYLKALAEGRRGGAILFRRNVPDVAATARLCQALAEAGGPDRPPF 71 Query: 59 VAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 + +DQEGGRV R F LP+ +S L+ + L + A +ASE+ A+ I+++FA Sbjct: 72 IGIDQEGGRVTRLPAPFLTLPSMRSLGELADLP----LLRRAARAVASELRAVGINLNFA 127 Query: 119 PVLDV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSH 176 PVLDV + IG+R++ DP A F++G+ E G+ GKHFPGHG DSH Sbjct: 128 PVLDVDSNPANPIIGDRAFGRDPGAVARGAVAFLEGLQEEGVLACGKHFPGHGDTALDSH 187 Query: 177 KETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPASGSPYWLKTVL 235 P + + ++ F + + ++M AH++ +DP PA+ S +L Sbjct: 188 LALPTLSHARGRLDEIELPPFRAACGAG-VASLMTAHIVVEALDPGVPATLSRAICTDLL 246 Query: 236 RQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPI 295 R E+GF+GV+FSDDL M A + E A++ AGCD +LVC++ V + L Sbjct: 247 RGEIGFEGVLFSDDLEMAAIAAHNAVEEAAAAAVWAGCDALLVCHSEDLQDRVHEALVR- 305 Query: 296 KAERVTRLYHKGSFSRQELMDSARWKAISTR 326 KAER R + + +AR + R Sbjct: 306 KAEREPRFRERCA------EAAARCLGLRRR 330 >UniRef50_A1ZJ01 Beta-N-acetylglucosaminidase n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZJ01_9SPHI Length = 994 Score = 304 bits (780), Expect = 2e-81, Method: Composition-based stats. Identities = 76/320 (23%), Positives = 142/320 (44%), Gaps = 20/320 (6%) Query: 1 MGPVMLDVEGYELDAEER----EILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNR 56 +G + + L + +++ VGG+I H QL + ++ Sbjct: 60 LGQLFMVAANTTLIDTHKVSVAKLIRQQHVGGVIFMRGYAH--KQLT-WTNYFQKQAKTP 116 Query: 57 LVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 L+VA+D E G R + + P + A+ + L + G +A +M + I+ Sbjct: 117 LMVAMDAEWGLGMRLKNMMS-FPFQMTLGAI----QEDTLIYQMGAQIAQQMKRLGTHIN 171 Query: 117 FAPVLDVG--HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 FAPV+DV + IG RS+ + + ++ G+ + G++ KHFPGHG D Sbjct: 172 FAPVVDVNINPKNPVIGYRSFSENRETVARKGVAYMKGLQDNGIQANAKHFPGHGDTGTD 231 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD---PRPASGSPYWL 231 SH P + + ++ F +LI+ + ++M AH+ D RP + S + Sbjct: 232 SHFALPVIKHDTNRLDSIELYPFKALIKHG-VKSMMVAHLNLPAYDSAQNRPTTLSKNVV 290 Query: 232 KTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDN 291 +L+++LG+ G+IF+D L+M+ E +L AG D++L N K A+ V+ Sbjct: 291 TKLLKEKLGYQGLIFTDALNMKAVTNFYKPGEVDLQALIAGNDVLLAPENVKKAIEVIQ- 349 Query: 292 LSPIKAERVTRLYHKGSFSR 311 I +++++ + Sbjct: 350 -KGILQKKISQTAIDQKVKK 368 >UniRef50_B6GD75 Putative uncharacterized protein n=1 Tax=Collinsella stercoris DSM 13279 RepID=B6GD75_9ACTN Length = 806 Score = 304 bits (780), Expect = 2e-81, Method: Composition-based stats. Identities = 96/357 (26%), Positives = 158/357 (44%), Gaps = 38/357 (10%) Query: 11 YELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS----------RNRLVVA 60 L+ E ++ +GG+ILF N + Q L ++ A+ L++ Sbjct: 90 TVLNEEIASVIDSFDLGGVILFANNVKETEQTLRLAMDMQQAALRNTAQAPAGDIPLLLT 149 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 +DQEGG V R G T LP + A E+ A + G ++ E+ A+ I+++FAP Sbjct: 150 IDQEGGIVYRLGSG-TALPGNMALGATRSEED----ALQCGQVIGRELEALKINVNFAPS 204 Query: 121 LDVGHISAA--IGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE 178 DV + IG RSY +DP+ + + + GM E G+ T KHFPGHG DSH Sbjct: 205 FDVNNNPNNPVIGLRSYSSDPELVAQLGTAMMSGMQEHGIATAAKHFPGHGDAGTDSHTG 264 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD--------------PRPA 224 P + + E+ + F S I E +D +M AH+ + V+ PA Sbjct: 265 LPRIEKTKEELDNLEFIPFRSAI-EKGVDMVMTAHIQFPKVETAKVPSADPSMGEIELPA 323 Query: 225 SGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKG 284 + SP ++ +LR E+GF GV +D L+M+ + Y + + + AG D+ L+ + Sbjct: 324 TLSPVFMTDILRTEMGFTGVSITDALNMDAISKNFGYIDAVKRTFKAGVDIALMPVTLRS 383 Query: 285 AVSVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQEEKAGH 341 V P E V L + + L S R + + + + +++ H Sbjct: 384 QQDV-----PKLKELVDALEQDAELTDERLNTSVR-RILELKKRRGILAYKDTAGSH 434 >UniRef50_Q3A4Z0 Putative glycosyl hydrolase n=1 Tax=Pelobacter carbinolicus DSM 2380 RepID=Q3A4Z0_PELCD Length = 382 Score = 304 bits (780), Expect = 2e-81, Method: Composition-based stats. Identities = 106/352 (30%), Positives = 174/352 (49%), Gaps = 31/352 (8%) Query: 4 VMLDVEGYELDAE--EREILAHPLVGGLILF---------TRNYHDPAQLRELVRQIRAA 52 +M+ G +LD E + +GG++LF N P QLR L Q++ A Sbjct: 45 LMVGFAGADLDKHHPILEDIKQRHLGGVVLFNYGPDRNHPAGNIRSPQQLRRLTAQLQQA 104 Query: 53 SRN-RLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAM 111 + N L++A+DQEGGR+ R ++ + P S+A+L G L + A + Sbjct: 105 AGNTPLLIAIDQEGGRINRLKQAYG-FPPTVSYASL-GRRNDLALTRRQAQNTARTLQQA 162 Query: 112 DIDISFAPVLD--VGHISAAIGE--RSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPG 167 I+++ +PV+D V + IG RS+ A+P ++ A I H+ G+ KHFPG Sbjct: 163 GINLNLSPVVDLNVNPANPIIGRLQRSFSAEPGTVISHAREVIRAHHQQGVLCCLKHFPG 222 Query: 168 HGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASG 226 HG+ TADSHK +++ F++LI E DA++ AHV S++DPR PA+ Sbjct: 223 HGSSTADSHKGFVDVT---NTWSPRELLPFATLIDEGLADAVLTAHVFNSELDPRYPATL 279 Query: 227 SPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAV 286 S + VLR++LGF GV+ SDDL+M A + + +L+AG D++L+ +N Sbjct: 280 SERTVHGVLRKQLGFQGVVISDDLTMGAIADQYRLEDAVEKALNAGVDILLLADNSPDTT 339 Query: 287 SVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQEEK 338 S + + + +L G +R+ ++ + K I + L Q + Sbjct: 340 SRMIAI-------MQKLIDSGRVTRKRIVQA--LKRIDDLKSHLRSNIQRQP 382 >UniRef50_A5FAE4 Candidate beta-glycosidase; Glycoside hydrolase family 3 n=3 Tax=Flavobacteriaceae RepID=A5FAE4_FLAJ1 Length = 997 Score = 304 bits (779), Expect = 3e-81, Method: Composition-based stats. Identities = 83/327 (25%), Positives = 148/327 (45%), Gaps = 26/327 (7%) Query: 16 EEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFREGF 75 + +++ VG +I F P + +L ++ ++ L V +D E G R + Sbjct: 97 QVSKLIEEYKVGSVIFFQGG---PVRQAKLTNLYQSKAKVPLFVGIDAEWGLAMRLDSTY 153 Query: 76 TRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI--SAAIGER 133 R P + A+ + KL ++ G MA+E + + +FAPVLD+ + IG R Sbjct: 154 -RYPWNMTLGAIQDL----KLIEKVGRNMANENKRIGVHFNFAPVLDINTNPKNPIIGNR 208 Query: 134 SYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKD 193 S+ D A ++G+ G+ TGKHFPGHG DSH P + + + Sbjct: 209 SFGEDKVNVSEKAIALMNGIQGNGVFCTGKHFPGHGDTATDSHHALPTVNFSKERLDLVE 268 Query: 194 MSVFSSLIRENKLDAIMPAHVIYSDVDP---RPASGSPYWLKTVLRQELGFDGVIFSDDL 250 + + + E + ++M AH+ ++P P+S S + +L++ELGFDG+IF+D L Sbjct: 269 LYPYKRMFDEGLV-SVMVAHLNIPSLEPIPNLPSSASYNVVTNLLQKELGFDGLIFTDGL 327 Query: 251 SMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFS 310 +M+GAA + A L AG D++L N P+ +++ Y+ + Sbjct: 328 AMKGAANFKGPGDLEIAVLLAGNDILLCPEN-----------VPVAVQKLEEAYNNNIIT 376 Query: 311 RQELMDSARWKAISTRLNQLHERWQEE 337 + L S + K + + ++ Sbjct: 377 EERLAHSVK-KILHYKYKAGLNHYKPI 402 >UniRef50_Q31RD3 Beta-glucosidase-related glycosidases-like n=5 Tax=Cyanobacteria RepID=Q31RD3_SYNE7 Length = 542 Score = 304 bits (778), Expect = 5e-81, Method: Composition-based stats. Identities = 86/296 (29%), Positives = 150/296 (50%), Gaps = 19/296 (6%) Query: 38 DPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALS--GMEEGGK 95 + A++ +Q++ SR L++A D E G QRF G + P + +A++ E + Sbjct: 65 NAAEVALRCQQLQEWSRVPLLIAADIEEGVGQRF-AGAVQFPPPLALSAIAAQDPERAQQ 123 Query: 96 LAQEAGWLMASEMIAMDIDISFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGM 153 LA+ G + A+E IA+ ++ AP++D+ + + I R++ DP+ + + FI G Sbjct: 124 LARSFGEITATEAIAIRLNWVLAPIVDINNNPANPVINVRAFGEDPETVSQLTTAFIAGA 183 Query: 154 HEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAH 213 + + TT KHFPGHG DSH E P P ++ + A ++ F I +DA+M AH Sbjct: 184 QQFPVLTTAKHFPGHGDTATDSHLELPQIPHDRSRLEAVELEPFRRAI-AAGVDAVMTAH 242 Query: 214 VIYSDVDPR-PASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAG 272 + +DP PA+ SP L+ +LR+ELGF+G+I +D L M+ A E + + +AG Sbjct: 243 LSIPALDPDYPATLSPAVLQGLLRRELGFEGLIVTDALVMQAIAAHYGPGEAARLAFEAG 302 Query: 273 CDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDS-ARWKAISTRL 327 D++L+ + + A+ + + +G + L S AR ++ Sbjct: 303 ADILLMPVDPEAAIQ-----------EIAKAIREGQITYDRLEQSLARIWRAKQKV 347 >UniRef50_A1ZDA7 Glycosyl hydrolase, family 3 n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZDA7_9SPHI Length = 383 Score = 303 bits (777), Expect = 5e-81, Method: Composition-based stats. Identities = 89/347 (25%), Positives = 160/347 (46%), Gaps = 33/347 (9%) Query: 4 VMLDVEGYELD--AEEREILAHPLVGGLILFTR---------NYHDPAQLRELVRQIRAA 52 +M+ G + + + + +GG+IL++R N +QL+ L+ ++ Sbjct: 49 LMVGFRGTSIKGKSHIKRDIRKHHLGGVILYSRDLLMGAKPRNIVSKSQLKTLITDLKKL 108 Query: 53 SRNRLVVAVDQEGGRVQRF--REGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIA 110 S L+VAVD+EGG+V R + GF Q ++ + ++ +A ++ A Sbjct: 109 SDVPLLVAVDEEGGKVSRLNAKFGFATTQTPQEIGEINDL----VATEKWATHIAQKVKA 164 Query: 111 MDIDISFAPV--LDVGHISAAIG--ERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFP 166 M +++FAPV L+V + IG RS+ D + A +F + + G+ KHFP Sbjct: 165 MGFNVNFAPVTDLNVNPKAPIIGKLGRSFSGDVNTLVQHALKFTEVHQKHGIVCAIKHFP 224 Query: 167 GHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PAS 225 GHG+ DSH + ++ F +I + +M AHV +D + PA+ Sbjct: 225 GHGSAIMDSHLGFTDVTTT---WKPIELKPFKQMIEQGFDGMVMTAHVFNKKLDAKYPAT 281 Query: 226 GSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGA 285 S + LR++ G+ G+I SDD+ M A E + S++AG D++L NN G Sbjct: 282 LSKKIMNDYLRKKWGWQGIIISDDMQMNAIAKNFGIEEALEKSINAGVDIVLFSNN--GR 339 Query: 286 VSVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHE 332 + N+ P + +L +G SR+ + +S R+ ++ + Sbjct: 340 IFYNKNIVPEAINIIKKLIKQGKISRKRIDES------YQRIKKMKQ 380 >UniRef50_Q30Y91 Beta-N-acetylhexosaminidase n=3 Tax=Desulfovibrionales RepID=Q30Y91_DESDG Length = 398 Score = 302 bits (775), Expect = 1e-80, Method: Composition-based stats. Identities = 103/355 (29%), Positives = 157/355 (44%), Gaps = 30/355 (8%) Query: 4 VMLDVEGYELD--AEEREILAHPLVGGLILF---------TRNYHDPAQLRELVRQIRAA 52 +++ G L + H +GG+ILF RN QL L ++ A Sbjct: 50 LLVGFRGESLSGADSVVRDVTHYNLGGVILFDRDVALGSDRRNIRSVRQLAALTAALQKA 109 Query: 53 SRNRLVVAVDQEGGRVQRF--REGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIA 110 ++ L VAVDQEGGRV R R GF LP+AQ A + AG + + A Sbjct: 110 AQTPLFVAVDQEGGRVARLSPRHGFAALPSAQETGAAATTAASAD----AGSRAGAMLHA 165 Query: 111 MDIDISFAPVLDVG--HISAAIG--ERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFP 166 + ++++FAPV+DV + IG RS+ AD + A F+ GM G+ KHFP Sbjct: 166 VGVNLNFAPVVDVNVRPDNPVIGRAGRSFSADSEAVARHAQAFLQGMASHGVLGCVKHFP 225 Query: 167 GHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PAS 225 GHG+ ADSH E ++ + L+ + IM H+ S +DP PA+ Sbjct: 226 GHGSSAADSHLGVADISASWNE---NELVPYRRLLDGGHVQMIMTGHLFNSRLDPDHPAT 282 Query: 226 GSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGA 285 S + +LR LGF GV+ SDD+ M+ + +L+AG D++L NN Sbjct: 283 LSRAVITGLLRGRLGFGGVVVSDDMQMKAITDRYGLEQAVALALNAGVDILLFGNNLTYD 342 Query: 286 VSVLDNLSPIKAERVTRLYHKGSFSRQELMDS-ARWKAISTRLNQLHERWQEEKA 339 ++ P + + L +G R + +S AR + L R + + Sbjct: 343 ADIV----PEVVDMIESLVERGVVPRSRIEESFARVLRLKESAGLLPARVRTVQE 393 >UniRef50_A1VC58 Glycoside hydrolase, family 3 domain protein n=3 Tax=Desulfovibrio vulgaris RepID=A1VC58_DESVV Length = 481 Score = 302 bits (774), Expect = 1e-80, Method: Composition-based stats. Identities = 101/344 (29%), Positives = 164/344 (47%), Gaps = 31/344 (9%) Query: 4 VMLDVEGYELDAE--EREILAHPLVGGLILFTR---------NYHDPAQLRELVRQIRAA 52 +ML G E +A VGG+ILF R N QL L ++AA Sbjct: 64 LMLGFRGAEPADAGPILRSIAAGHVGGVILFDRDMDPSVRVRNIVSKEQLSRLTGALQAA 123 Query: 53 SRNRLVVAVDQEGGRVQRFRE--GFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIA 110 + L +AVDQEGGRV+R + GF P+A S S + + +A+EM Sbjct: 124 APVPLFIAVDQEGGRVRRLKPEYGFFAYPSAASLGKGSPED-----TRRMASTLATEMAE 178 Query: 111 MDIDISFAPVLD--VGHISAAIG--ERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFP 166 + +++ F PV+D V + I ERSY +DP + + A+ F++G+ G+ + KHFP Sbjct: 179 VGLNVDFGPVVDLAVNPSNPVIARLERSYGSDPCRVASHAAAFVNGLAWRGVVASLKHFP 238 Query: 167 GHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPAS 225 GHG+ DSH + R +++ ++ L+R++ +M H+ + +D PA+ Sbjct: 239 GHGSSLQDSHLGVTDIS---STWRREELGPYALLLRDDWAGMVMVGHLYNNRIDAAHPAT 295 Query: 226 GSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGA 285 S + +LR++LG+ GV+ +DDL M S E + +++AG D++L NN Sbjct: 296 LSQRTIDGLLRRDLGWKGVVVTDDLQMGAITARYSLDETVRLAVEAGADILLFGNNLVWD 355 Query: 286 VSVLDNLSPIKAERVTRLYHKGSFSRQELMDS-ARWKAISTRLN 328 + L+ + RL +G S Q L S R + + L Sbjct: 356 ----EGLAEKVHATLVRLVREGKVSEQRLRQSWERIMRLKSVLT 395 >UniRef50_Q1IZF6 Beta-N-acetylhexosaminidase n=2 Tax=Deinococcus RepID=Q1IZF6_DEIGD Length = 478 Score = 302 bits (773), Expect = 1e-80, Method: Composition-based stats. Identities = 86/310 (27%), Positives = 145/310 (46%), Gaps = 10/310 (3%) Query: 5 MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQE 64 M+D+ G LDA+ L + + LF +N + AQLR+L +R ++A+D E Sbjct: 1 MVDIPGKALDADTASHLRRHGIRAVCLFGKNVQNAAQLRQLCADLREVMGEAALIALDHE 60 Query: 65 GGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVG 124 GG + R + P+A S A + L ++ +A ++ ++ ++ +FAPVLDV Sbjct: 61 GGAILR-PTFWPFAPSAMSLGAADNV----ALTEDVNAALARQLRSVGVNWNFAPVLDVN 115 Query: 125 HI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCD 182 + IGER+Y AD + + + + G + KHFPGHG DSH P Sbjct: 116 VNPANPVIGERAYGADVDRVIRHGTAALLGHTRERVAGCVKHFPGHGDTHLDSHYALPRV 175 Query: 183 PRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPASGSPYWLKTVLRQELGF 241 + +AE+ A + + F + + A+M AH++Y +DP PA+ S L +LR E G+ Sbjct: 176 SKSRAELDAVEFAPFRACLP--LAPAVMTAHIVYEALDPCSPATLSRTVLTELLRGEWGY 233 Query: 242 DGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVT 301 GV +D + M+ E +L AG D+++ R+ L ++ + Sbjct: 234 QGVTVTDSMGMQAIDAHYGRGEAAVLALRAGADLVMALGRREAQDVTLVAIARALNSELD 293 Query: 302 RLYHKGSFSR 311 S +R Sbjct: 294 AREVAASLAR 303 >UniRef50_C5T2K1 Beta-N-acetylhexosaminidase n=1 Tax=Acidovorax delafieldii 2AN RepID=C5T2K1_ACIDE Length = 396 Score = 302 bits (773), Expect = 2e-80, Method: Composition-based stats. Identities = 141/389 (36%), Positives = 201/389 (51%), Gaps = 59/389 (15%) Query: 3 PVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVD 62 P++LDV G L ++R LAHPL GG+ILF RN+ + AQL +L I+A R+ L++ VD Sbjct: 6 PLILDVAGTALSPDDRRRLAHPLTGGVILFARNWENRAQLLQLTSSIKAV-RDDLLICVD 64 Query: 63 QEGGRVQRFR-EGFTRLPAAQSFAALS-------------GMEEGGKLAQEAGWLMASEM 108 EGGRVQRFR +GFT LP ++F + G A AG+++ +E+ Sbjct: 65 HEGGRVQRFRTDGFTHLPPMRAFGEMWLDDGKGARAVPGSGALRATNAATAAGYVLGTEL 124 Query: 109 IAMDIDISFAPVLDVG----------------------------------HISAAIGERS 134 A +D SF PVLD+ S IG+R+ Sbjct: 125 RACGVDFSFTPVLDLDWPDDAPQAVGAARPGPAAHGAPRAGSTSRGQAATSRSGVIGDRA 184 Query: 135 YHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDM 194 +H+DP+ +A + G+ +AGM GKHFPGHG V ADSH E P D R I A D Sbjct: 185 FHSDPRVVALLAKSLMHGLLQAGMAHCGKHFPGHGHVKADSHTEVPVDRRSFKAILADDA 244 Query: 195 SVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQELGFDGVIFSDDLSMEG 254 + ++ L + L ++MPAHVIY VD RPA S WL+ +LR+++ FDG IFSDDLSM G Sbjct: 245 APYAWL--SSTLTSVMPAHVIYPRVDSRPAGFSRRWLQDILRRQMHFDGAIFSDDLSMAG 302 Query: 255 AAIMGS----YAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFS 310 A + Y + A+L+AGCD++L+CN G S +D L + +T +G + Sbjct: 303 ARRIDGEVVRYTDAAIAALEAGCDLVLLCNQSLGGGSAVDEL----LDGLTEAQARGRWQ 358 Query: 311 RQELMDSARWKAISTRLNQLHERWQEEKA 339 ++ R + L Q + + A Sbjct: 359 SSVDSEARRLALLPQTLPQPWDELMYQPA 387 >UniRef50_A9BFQ0 Glycoside hydrolase family 3 domain protein n=1 Tax=Petrotoga mobilis SJ95 RepID=A9BFQ0_PETMO Length = 528 Score = 302 bits (773), Expect = 2e-80, Method: Composition-based stats. Identities = 94/334 (28%), Positives = 165/334 (49%), Gaps = 21/334 (6%) Query: 5 MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQE 64 ++ ++G L+ E ++L L G +I F+RN + QL + + I+ + +DQE Sbjct: 13 LIGIQGTSLNNENMKVLKSTLPGVIIFFSRNIENKYQLSKFIEDIKNFLDYEPLFCIDQE 72 Query: 65 GGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVG 124 GG V R ++GFT +P+A A + E A + L+A EM+A+ ID + APV+D+ Sbjct: 73 GGTVARLKKGFTVVPSAMGITATNEAEN----AYLSANLLAKEMLAVGIDWNLAPVVDIN 128 Query: 125 HI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCD 182 + ++ IG RS+ D L A ++ G+H+ G+ + KHFPG G V D H + P Sbjct: 129 NNPKNSVIGIRSFSDDKNVVLKFAREYVKGLHDGGVLSCLKHFPGIGNVNTDPHLDLPQG 188 Query: 183 PRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR--PASGSPYWLKTVLRQELG 240 + ++ + ++ F ++ + MP HV S + + PAS S L + R+EL Sbjct: 189 ELSKDDLLSTELFPFLNI----DSPSWMPTHVYLSKIQNKKEPASLSEEILTGLAREELK 244 Query: 241 FDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERV 300 + GV+ +DD M G A S E SL+AG D++ +C++ + + + V Sbjct: 245 YKGVLVADDFEMGGVANFYSAQEAVIKSLNAGMDIVSICHSFEKQSAAKNA--------V 296 Query: 301 TRLYHKGSFSRQELMDSARWKAISTRLNQ-LHER 333 R Y ++++ S +++ L E+ Sbjct: 297 LREYKNNENFKKKINSSLERIQSLMKISADLREK 330 >UniRef50_B2UPP0 Beta-N-acetylhexosaminidase n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UPP0_AKKM8 Length = 353 Score = 301 bits (772), Expect = 2e-80, Method: Composition-based stats. Identities = 103/322 (31%), Positives = 161/322 (50%), Gaps = 15/322 (4%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 M P ++ + G+E+ AEE + G ILF+RN Q+R L +R + V+A Sbjct: 1 MLPALIGISGHEVGAEEEAAIRRLQPAGFILFSRNIDSVEQVRGLTESLRKLCLHHPVIA 60 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 VDQEGGRV R LP+ S A L + +L G + A + + ++++FAPV Sbjct: 61 VDQEGGRVVRTASLGLNLPSPASLARLGSVGGIVEL----GAVTALALRYLGVNLNFAPV 116 Query: 121 LDVGHISA---AIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 LD+ H + A+ R + + Q ++ + + G+++ GKHFPG G AD H Sbjct: 117 LDICHDPSAANALPGRCWGDNAQDVISRGGVYASNLRRGGVQSCGKHFPGMGRALADPHF 176 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLR 236 P + E+ D+ F +L L +IM AH++ +DP PA+ S ++ +LR Sbjct: 177 SLPVIGLDERELFKTDLLPFLALCPA--LSSIMSAHIMLPQIDPDYPATLSERVIRGLLR 234 Query: 237 QELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIK 296 LGF GV+F+DDL M S + SL AGCD+ L+C++ + LD L+ + Sbjct: 235 DRLGFRGVVFTDDLCMGAITTQYSPDDAAFLSLKAGCDLPLICHDP---LPWLDGLAS-R 290 Query: 297 AERVTRLYHKGSFSR-QELMDS 317 E + SF R ++L DS Sbjct: 291 QESLNAYDRWDSFKRVEKLSDS 312 >UniRef50_A2DGR7 Glycosyl hydrolase family 3 N terminal domain containing protein n=1 Tax=Trichomonas vaginalis RepID=A2DGR7_TRIVA Length = 553 Score = 301 bits (772), Expect = 2e-80, Method: Composition-based stats. Identities = 86/332 (25%), Positives = 151/332 (45%), Gaps = 18/332 (5%) Query: 5 MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASR----NRLVVA 60 ++D+ G L ++ IL +G IL RN + + ++ S ++ Sbjct: 59 LMDLSGLTLTDSDKAILKKYKLGNFILMGRNLGTIEETAAYSKSLQEYSMKEFGIPAFIS 118 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 +DQE G + R + T P A S AA ++ +A+ LMA+E+ A ++++ AP Sbjct: 119 IDQEAGSILRMYKNCTVFPGAMSLAATDSLD----IAKNISKLMATELRAAGVNMNHAPT 174 Query: 121 LDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE 178 DV + IG+RS+ +P+ +I G HEAG+ TT KHFPGHGA + DSHK Sbjct: 175 SDVNSNPKNPIIGDRSFGDNPENVSKFVDYYIQGHHEAGILTTSKHFPGHGATSVDSHKA 234 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQE 238 P ++ D+ F ++ + +D+IM H+I +D A R + Sbjct: 235 LPSIDHNMTQLLEIDIPPFQQAVK-SGVDSIMVGHLITP-IDNESACTQSKKCVDFARNQ 292 Query: 239 LGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAE 298 L +DG++ +D + M+ E ++ +G D+I C ++ E Sbjct: 293 LKYDGILITDSMQMKAIKQE-DLNESIATAIKSGIDLICDCGGDIVGTDPYYSVIAYIVE 351 Query: 299 RVTRLYHKGSFSRQELMDSARWKAISTRLNQL 330 +V + + ++++D A + I T+L L Sbjct: 352 QVQK-----NIIPEKIIDQAALRIIKTKLAML 378 >UniRef50_C4Y752 Putative uncharacterized protein n=1 Tax=Clavispora lusitaniae ATCC 42720 RepID=C4Y752_CLAL4 Length = 985 Score = 301 bits (771), Expect = 3e-80, Method: Composition-based stats. Identities = 88/352 (25%), Positives = 151/352 (42%), Gaps = 18/352 (5%) Query: 1 MGPVMLD-VEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS------ 53 +G ++ G + E++ V +IL RN Q+ L+R ++ + Sbjct: 6 LGQLVCGGFLGTTATPQAYELIVKHRVSTMILSRRNAESAEQMAHLIRDLQRIALEHAHY 65 Query: 54 RNRLVVAVDQEGGRVQRFRE--GFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAM 111 + L++A+D+EGG + + T+ P A + AA E L E A E+ + Sbjct: 66 KYPLLIAIDEEGGMLNTLFDSRDLTQFPGAMALAATGDTE----LVYEVARATARELRHI 121 Query: 112 DIDISFAPVLDVGHISAA--IGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHG 169 + PVLDV +A + RS+ + + G+ + G+ T GKHFPG G Sbjct: 122 GFSVILGPVLDVVTKMSAQLVSVRSFGSSVEDVARYGRACARGLRDGGLITVGKHFPGIG 181 Query: 170 AVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPRPASGS 227 T DS E P ++R ++ F+ LI+E LD I A + D A S Sbjct: 182 NATVDSLLEVPIMAESLDQLRGSNVVPFAELIQEGLLDGISAAGCGVPAIAPDEVHACLS 241 Query: 228 PYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVS 287 P L +LR+EL FDG + S+ L M+ + ++ AGCD+++ C++ + V Sbjct: 242 PVLLTQLLREELHFDGFVISECLEMDALYHSIGLGQGAVLAVCAGCDLVMCCHDFERQVE 301 Query: 288 VLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQEEKA 339 +D+L+ A + + S Q + R A + ++E + Sbjct: 302 AIDSLAKALANGMLDERI-IAASMQRIETVQRRVASWAEIFSATGPFREAQP 352 >UniRef50_A6LEE9 Glycoside hydrolase family 3, candidate beta-glycosidase n=6 Tax=Bacteroidales RepID=A6LEE9_PARD8 Length = 999 Score = 300 bits (770), Expect = 3e-80, Method: Composition-based stats. Identities = 96/354 (27%), Positives = 159/354 (44%), Gaps = 27/354 (7%) Query: 1 MGPVMLDVEGYELDAEEREILAHP----LVGGLILFTRNYHDPAQLRELVRQIRAASRNR 56 +G + + + + D + L +GG ILF + DP E+ +++ ASR Sbjct: 51 IGQLFMVIANPKSDNRNMQRLMRYVNDIKIGG-ILFHK--GDPVTQAEVTNRLQKASRIP 107 Query: 57 LVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 ++V++D E G R G TR P A+ E L +E G + + M I I+ Sbjct: 108 MLVSLDGEWGLSMRL-SGTTRFPKNMMLGAI----EDNALIEEYGKEVGRQCREMGIHIN 162 Query: 117 FAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 FAP +DV + IG RS+ +P+ + G+ G+ + KHFPGHG + D Sbjct: 163 FAPDMDVNSNVDNPVIGLRSFGENPEAVSEKGIAYARGLENTGILSVSKHFPGHGDTSED 222 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP--RPASGSPYWLK 232 SH+ P +A + + ++ F I + IM H+ +D +PAS S + Sbjct: 223 SHETLPVVRHNRARLDSVELLPFKRYIYDG-FGGIMTGHLYVPALDKSHKPASFSKAVVT 281 Query: 233 TVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVC----NNRKGAVSV 288 +L++ELGF G+ F+D L+M+GA+ + +L AG D++L N+ Sbjct: 282 DLLQKELGFQGLCFTDALAMKGASTKKTDNPSV-KALLAGNDILLAPAAPINDFTAVKEA 340 Query: 289 LDN----LSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQEEK 338 ++ L I+A+ + L +K + K +S RLN H W K Sbjct: 341 IEEGVLDLEAIEAKCLKILRYK-YIAGLNAYKPVETKGLSKRLNSPHAAWMAAK 393 >UniRef50_C6W3E7 Beta-lactamase n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6W3E7_DYAFD Length = 992 Score = 300 bits (770), Expect = 4e-80, Method: Composition-based stats. Identities = 81/315 (25%), Positives = 143/315 (45%), Gaps = 25/315 (7%) Query: 17 EREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFREGFT 76 +++ VGG++ F P L ++ S+ L+VA+D E G R Sbjct: 82 VEKLIRENKVGGIVFFQGG---PMPQARLTNYYQSISKVPLLVAMDAEFGLAMRIDS-TV 137 Query: 77 RLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI--SAAIGERS 134 R P + A+ G L E G +A + + + I+FAPV DV + + I RS Sbjct: 138 RYPYQMTLGAIQG---NNDLIYEMGAQLAKQARRLGMHINFAPVADVNNNPDNPVISFRS 194 Query: 135 YHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDM 194 + + K A ++ GM + G+ T+ KHFPGHG DSH + P P I + ++ Sbjct: 195 FGENKYKVAEKAVAYMRGMQDNGLLTSAKHFPGHGDTGTDSHYDLPLIPHSAQRIDSLEL 254 Query: 195 SVFSSLIRENKLDAIMPAHVIYSDVDPR---PASGSPYWLKTVLRQELGFDGVIFSDDLS 251 F +LI +N L +M AH+ +D P++ S + +LR +LGF+G+I+SD ++ Sbjct: 255 YPFRALI-DNGLSGMMIAHLSIPALDKTPNLPSTLSKPIISNLLRHQLGFEGLIYSDAMN 313 Query: 252 MEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFSR 311 M+G + L+AG D++ + +++ + + +G ++ Sbjct: 314 MKGVTKYFPNGKADAMGLEAGMDLLEFTEDVNKSIA-----------EIKKSIAEGKITQ 362 Query: 312 QELMDSARWKAISTR 326 E+ R K + + Sbjct: 363 AEIDRRCR-KVLEAK 376 >UniRef50_B2RGN3 Glycosyl hydrolase family 3 n=2 Tax=Porphyromonas gingivalis RepID=B2RGN3_PORG3 Length = 1003 Score = 300 bits (768), Expect = 5e-80, Method: Composition-based stats. Identities = 86/327 (26%), Positives = 145/327 (44%), Gaps = 30/327 (9%) Query: 2 GPVMLDVEGYELDAE----EREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRL 57 G +++ V L E +++ +GG ILF + + + R+++ A+ L Sbjct: 70 GQLLMPVVYPSLQEEKVKQAEQLVRTCHIGG-ILFQKGTLS--EQYTMTRRLQEAAGTPL 126 Query: 58 VVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISF 117 ++A+D E G R ++ R P ++ +L G +A + M I I+F Sbjct: 127 LIALDGEWGLHMRLKDA-PRFPRNMGLG----HQKDNQLLYNYGREVARQCRLMGIHINF 181 Query: 118 APVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADS 175 APVLDV + + IG RS+ +P++ + G+ + G+ KHFPGHG T DS Sbjct: 182 APVLDVNNNPKNPVIGTRSFGDNPRRVAERGIAYAQGLEDGGVMAVAKHFPGHGNTTEDS 241 Query: 176 HKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD---PRPASGSPYWLK 232 HK P + E+ ++ F R L +M AH+ ++ P+S S Sbjct: 242 HKTLPTVFASREELENTELFPFKEFFRAG-LSGVMTAHLNVPALEAKKNTPSSLSHAICT 300 Query: 233 TVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL 292 +LRQE+GF G+IF+D L+M+G GS ++ AG D++L + S Sbjct: 301 DLLRQEMGFKGLIFTDGLAMQGVQTAGSQP-ISVRAILAGNDILLGPVDPVKTFS----- 354 Query: 293 SPIKAERVTRLYHKGSFSRQELMDSAR 319 V + S++ L + R Sbjct: 355 ------EVLAAVEDRTISKELLDEKCR 375 >UniRef50_A1RBY1 Putative beta-N-Acetylglucosaminidase n=1 Tax=Arthrobacter aurescens TC1 RepID=A1RBY1_ARTAT Length = 681 Score = 300 bits (768), Expect = 6e-80, Method: Composition-based stats. Identities = 88/323 (27%), Positives = 143/323 (44%), Gaps = 29/323 (8%) Query: 18 REILAHPLVGGLILFT--RNYHDPAQLRELVRQIRAAS------RNRLVVAVDQEGGRVQ 69 E++ +GG++ F N ++P Q+ L ++ + L V +DQEGG V Sbjct: 95 AEVVEKYNLGGVLYFAWSGNTNNPQQVAGLSNGLQQTALREDGNGIPLAVTIDQEGGLVA 154 Query: 70 RFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI--S 127 R T LP + A + LA+ G ++ SEM AM I++ FAPVLD+ + Sbjct: 155 RIGPPATVLPGNMALGATADAN----LAKAQGEILGSEMRAMGINVDFAPVLDLNSNPDN 210 Query: 128 AAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQA 187 IG RS DP A+ I+G+ + KHFPGHG + DSH P +A Sbjct: 211 PVIGIRSMGEDPALVSALGVAQIEGIQAHNVGAAAKHFPGHGDTSVDSHYGLPTVTYDRA 270 Query: 188 EIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQELGFDGVIF 246 + + F + I + +D +M AH+I +D P + S L +LR E+GF G++ Sbjct: 271 TLNE-HLKPFKAAI-DGGVDMVMTAHIIVEAIDAEMPGTLSHKVLTGLLRDEMGFKGLVT 328 Query: 247 SDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHK 306 +D L M A E ++ AG D++L + + + V Sbjct: 329 TDALDMAAMAAEWPQEEIAVKAIQAGSDILLNSPDVDASFA-----------GVRAAVES 377 Query: 307 GSFSRQELMDSARWKAISTRLNQ 329 G + L +S R + + ++ + Sbjct: 378 GEITETRLDESVR-RILEWKVKR 399 >UniRef50_A9BCX1 Beta-glucosidase-related glycosidase n=2 Tax=Cyanobacteria RepID=A9BCX1_PROM4 Length = 543 Score = 300 bits (768), Expect = 6e-80, Method: Composition-based stats. Identities = 84/326 (25%), Positives = 141/326 (43%), Gaps = 23/326 (7%) Query: 12 ELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRF 71 EL + VGG+IL+ + ++ Q+R + + + D E G QRF Sbjct: 36 ELTNHRLQQFLEEGVGGVILYGGSI---EEITNRCAQLRMWAGKPIFLCADVEEGVGQRF 92 Query: 72 REGFTRLPAAQSFAALS--GMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI--S 127 + G +P + + E LA+ G L+ E + ++ APV DV + Sbjct: 93 QGGTWLIP-PMALGRIYLKEPEYAISLAEHYGALIGYESVICGLNWVLAPVCDVNSNPLN 151 Query: 128 AAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQA 187 I R++ +PQ +A F G+ G+ KHFPGHG DSH E P + Sbjct: 152 PVINMRAWSDNPQTVADLACAFHRGLTSQGVLGCAKHFPGHGDTKVDSHLELPVLDNDLS 211 Query: 188 EIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD-PRPASGSPYWLKTVLRQELGFDGVIF 246 + ++ F +LI++ + +IM AH+I + VD P + S L +LR+++ F+G+I Sbjct: 212 RLAEIELPPFQALIQQG-VSSIMSAHLILNRVDCNYPVTFSKRILTDLLRKKMCFEGMIV 270 Query: 247 SDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHK 306 +D L M + S + +AG D+IL+ N A+ + Sbjct: 271 TDALVMRAISKTFSSGSAAVMAFEAGADLILMPQNPSEAIDA-----------IVESLIS 319 Query: 307 GSFSRQELMDSARWKAISTRLNQLHE 332 G L DS + + ++ L QL+ Sbjct: 320 GRLPISRLEDSLQRRQLA--LAQLNS 343 >UniRef50_C9RS08 Glycoside hydrolase family 3 domain protein n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RS08_FIBSS Length = 384 Score = 300 bits (768), Expect = 7e-80, Method: Composition-based stats. Identities = 94/330 (28%), Positives = 156/330 (47%), Gaps = 23/330 (6%) Query: 8 VEGYELDAEEREILA---HPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQE 64 + Y+L+ + +LA + +GG+IL+ N D QL + +IR + + L+ A+D+E Sbjct: 70 LPNYKLEQVNKTMLAVNKNYPIGGMILYAHNIVDETQLGNFIAEIRELNGSPLL-AIDEE 128 Query: 65 GGRVQRFREGFT-RLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV 123 GGRV R +P +S AA++ ++ + A +A + + + + DI +APV DV Sbjct: 129 GGRVARIANNENFDVPKYESMAAIAESDDPEE-AYKAAFTIGTYVKKYGFDIDYAPVADV 187 Query: 124 GHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPC 181 + IG R++ DP+ A +++G+ AG+ T KHFPGHG V+ D+H Sbjct: 188 NTNPDNIVIGPRAFSDDPETAAKFVVSYLNGLESAGVIGTLKHFPGHGDVSTDTHYGYAS 247 Query: 182 DPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPRPASGSPYWLKTVLRQEL 239 + E+ +M F + I+ IM AH+ V D PA+ S L+ LR EL Sbjct: 248 TDKTWDEMLKCEMIPFKAGIKAG-AQMIMTAHIAAPKVSGDDLPATLSSVILQDKLRGEL 306 Query: 240 GFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAER 299 GFDGVI +D + M S AE S+ AG D++L Sbjct: 307 GFDGVIVTDAMDMGAITKQFSNAEAAIKSIQAGVDVVLCSREFTQVFDA----------- 355 Query: 300 VTRLYHKGSFSRQELMDSARWKAISTRLNQ 329 V + KG + +S + + + + ++ Sbjct: 356 VVKAVKKGEIKESRIDESVK-RILKLKKSK 384 >UniRef50_Q6MA50 Putative beta-N-acetylglucosaminidase n=1 Tax=Candidatus Protochlamydia amoebophila UWE25 RepID=Q6MA50_PARUW Length = 550 Score = 299 bits (767), Expect = 7e-80, Method: Composition-based stats. Identities = 94/342 (27%), Positives = 165/342 (48%), Gaps = 20/342 (5%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILF--TRNYHDPAQLRELVRQIRAAS-----RNR 56 +++ G + + E +++ VGG+I + + + P Q++ L ++AA+ + Sbjct: 49 LIVHFRGEDCNKEALQMIYEAHVGGVIYYPWSNGLNSPLQIQHLSAHLQAAAFQKKNKIP 108 Query: 57 LVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 L+++VDQEGG V R +G T P + +LA E A+E++AM I+++ Sbjct: 109 LLISVDQEGGVVNRLTKGMTLFPGNYALGQTKDP----QLAFEYASASANELLAMGINMN 164 Query: 117 FAPVLDVG--HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 APV+DV + IG RS+ P + + S+ ++G ++AG+ T KHFPGHG +D Sbjct: 165 LAPVIDVNTQPCNPVIGIRSFSTCPDEIVRFGSKVVEGFNKAGLIATLKHFPGHGDTNSD 224 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPA-SGSPYWLKT 233 SH P + E+ ++ F +L ++ +M AH++ +D + S + Sbjct: 225 SHHTLPIVNKTLVELEKVELVPFKAL--ASQTPVVMTAHLLVPALDALNCVTFSKSAI-D 281 Query: 234 VLRQELGFDGVIFSDDLSMEGAAIMG-SYAERGQASLDAGCDMILVCNNRKGAVSVLDNL 292 VLR+E+G G+I +D L M+G G + + SL AG D++L+ R S L Sbjct: 282 VLRKEIGEKGIILTDSLVMQGVLQDGKTVEQAALKSLQAGHDLLLLGGKRLLNESSEFEL 341 Query: 293 SPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERW 334 ++ + RQ L+ AR R+ L E + Sbjct: 342 EAKDVLKIHQFLVNAV--RQNLISEARIDISVKRILALKESY 381 >UniRef50_Q6AJ70 Related to glycosyl hydrolase n=1 Tax=Desulfotalea psychrophila RepID=Q6AJ70_DESPS Length = 377 Score = 299 bits (766), Expect = 1e-79, Method: Composition-based stats. Identities = 86/347 (24%), Positives = 163/347 (46%), Gaps = 31/347 (8%) Query: 4 VMLDVEGYELDA--EEREILAHPLVGGLILFTR---------NYHDPAQLRELVRQIRAA 52 +M G E+ + +GG+ILF + N P Q+ +L +++ Sbjct: 42 LMTGFIGEEISQCSSTVNDIKEHGLGGVILFDKCLATGSRKNNITSPEQMAKLTSDLQSC 101 Query: 53 SRNRLVVAVDQEGGRVQRFRE--GFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIA 110 S L++AVDQEGG V RFR GF+ P+A +E+ + A++ + + Sbjct: 102 SAEPLLIAVDQEGGMVSRFRSAYGFSTSPSALMLGQQDDLEKCREAAEQ----TSQMLKD 157 Query: 111 MDIDISFAPVLDVGHI--SAAIG--ERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFP 166 ++ + APV+D+ + IG +RS+ + Q + A ++ ++G+ + KHFP Sbjct: 158 AGVNFNLAPVVDLDLTALNPIIGKYQRSFGDNAQTVIECALSWMSAHRKSGILSCLKHFP 217 Query: 167 GHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PAS 225 GHG+ ADSH+ +++ + + + ++DAIM H+ + ++DP PAS Sbjct: 218 GHGSSLADSHRGFVDITES---WDREELLPYKKIHQRTEIDAIMMGHLFHKNLDPLYPAS 274 Query: 226 GSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGA 285 S + ++LRQE+ + G+I SDD+ M + ++ AG DM+++ NN Sbjct: 275 LSKRIIDSLLRQEMNYQGLIISDDMQMGAITKYYGLEKACCRAICAGVDMVIIGNNISSD 334 Query: 286 VSVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHE 332 ++ + + + +G S + + W+ + N+LH+ Sbjct: 335 PKIVTKI----TDSLKNSIDQGLLSEARVDQA--WQRVQGLKNKLHK 375 >UniRef50_B2IHJ4 Glycoside hydrolase family 3 domain protein n=3 Tax=Rhizobiales RepID=B2IHJ4_BEII9 Length = 370 Score = 299 bits (765), Expect = 1e-79, Method: Composition-based stats. Identities = 98/327 (29%), Positives = 154/327 (47%), Gaps = 27/327 (8%) Query: 5 MLDV-EGYELDAEEREILAHPLVGGLILFTRNYHDP---AQL----RELVRQIRAAS-RN 55 ++ + EL ++R +LA G+IL+ N+ AQ L+ I AA R Sbjct: 19 LVGLRPTPELHDQDRALLADLRPAGVILYKSNFRHDLPYAQWLEIHARLIEDIYAACGRE 78 Query: 56 RLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDI 115 L + +D EGGRV R TR ++AA G A G M E+ ++ I++ Sbjct: 79 HLWIGIDHEGGRVCRTPAPITRF----AYAAHWG-----DCAAAVGQAMGCELASLGINL 129 Query: 116 SFAPVLDV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTA 173 +FAPVLD+ + IGER++ P+ +A F+ M + GKHFPGHG + Sbjct: 130 NFAPVLDIHTNPANPVIGERAFATTPEAVTPLALGFLRAMQAENVLGCGKHFPGHGDTSV 189 Query: 174 DSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLK 232 DSH P ++R +++ F + I E L IM +H+++ +DP P + S + Sbjct: 190 DSHYGLPSQNCSLEDLRQRELKPFIAAI-EAGLSMIMTSHILFPAIDPDFPVTLSRRFCT 248 Query: 233 TVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL 292 +LR+ELG+ GV+ SDD+ M + + AGCD+++VC + V D Sbjct: 249 ELLREELGYSGVVVSDDIGMGAMNQFFAEPTAAARLIAAGCDLLMVCAHWTETERVRD-- 306 Query: 293 SPIKAERVTRLYHKGSFSRQELMDSAR 319 AE + R KG + +L S Sbjct: 307 ---FAESIVRARDKGEITPDQLTVSKV 330 >UniRef50_Q7X3A0 Putative glycosylhydrolase n=1 Tax=Candidatus Fritschea bemisiae RepID=Q7X3A0_9CHLA Length = 389 Score = 298 bits (763), Expect = 2e-79, Method: Composition-based stats. Identities = 83/332 (25%), Positives = 143/332 (43%), Gaps = 17/332 (5%) Query: 14 DAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFRE 73 ++L +G LIL + H Q+ + ++ S+ L+ D E G R +E Sbjct: 47 SEMIEKMLDVYHIGALIL--KQAHPKEQI-PFLNALQKRSKLPLLCTGDAEWGLGMRMQE 103 Query: 74 GFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGH--ISAAIG 131 + P + + L +G + + + I ++FAPV+D+ + + IG Sbjct: 104 TLS-FPRNEIMGKVKDQ----SLLFSSGKEIGRQCRIVGIHLNFAPVVDINNHPNNTVIG 158 Query: 132 ERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRA 191 RS+ +DP+K AS I GM + + KHFPGHG DSHK P + + Sbjct: 159 NRSFGSDPEKVSRCASLIIQGMRAGKVLSCLKHFPGHGDTDVDSHKGLPTILHSRKHLEQ 218 Query: 192 KDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQELGFDGVIFSDDL 250 + F + DAIM H++ +D PAS S + +L++E GF G+I +D L Sbjct: 219 VEFLPFKENLESA--DAIMTGHLMLPVLDKNFPASLSKLIVTDLLQKEWGFKGLIITDAL 276 Query: 251 SMEGAAIMGSYAERGQASLDAGCDMILV-CNNRKGAVSVLDNLSPIKAERVTRLYHKGSF 309 +M+ S E + AG D++L + ++L+N P+ + + G Sbjct: 277 NMKALTQNYSVEEIALKAFLAGHDLLLYGSHRYDDVKNLLENAIPLAYKSIQNGIINGEI 336 Query: 310 SRQELMDSARWKAIS--TRLNQLHERWQEEKA 339 +L+D+ K + RL +R+ Sbjct: 337 D-SDLLDARVLKILQVKERLGLHKDRFNPNPE 367 >UniRef50_C7PJZ6 Beta-lactamase n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PJZ6_CHIPD Length = 1012 Score = 298 bits (763), Expect = 2e-79, Method: Composition-based stats. Identities = 81/311 (26%), Positives = 139/311 (44%), Gaps = 26/311 (8%) Query: 21 LAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFREGFTRLPA 80 + + VGGLI F P + L ++ S+ L++A+D E G R T LP Sbjct: 99 IQNNKVGGLIFFQGG---PVRQANLTNYYQSISKTPLMIAIDGEWGLGMRLDS-VTSLPR 154 Query: 81 AQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHISAAIG--ERSYHAD 138 A+ LA E G +A + + I + +AP +DV + +RS+ D Sbjct: 155 NLMLGAMQDST----LAYEYGKQLAMQCRRLGIHVDYAPDMDVNNNPNNPVINDRSFGQD 210 Query: 139 PQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFS 198 + + + I GM + + KHFPGHG DSH + P + +A++ + ++ F Sbjct: 211 KFQVARMGVQVIKGMQDQNVMAVAKHFPGHGDTDVDSHLDMPVIRKSRAQLDSLELYPFR 270 Query: 199 SLIRENKLDAIMPAHVIYSDVD---PRPASGSPYWLKTVLRQELGFDGVIFSDDLSMEGA 255 I E + +IM AH+ +D P S S + L+ ELG+ G++ +D L M+G Sbjct: 271 KAI-EAGVQSIMMAHLYIPALDSTPNTPTSISHKAITDYLKGELGYKGIVITDALEMKGI 329 Query: 256 AIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFSRQELM 315 A + E SL AG DM+++ + G+V + R +G + +E + Sbjct: 330 AKFYTGGEEAARSLLAGNDMMMLPSTAAGSVDA-----------IKRAIRRGDITWEE-V 377 Query: 316 DSARWKAISTR 326 ++ K + + Sbjct: 378 NARVKKVLMAK 388 >UniRef50_C8PQ37 Glycosyl hydrolase, family 3 n=1 Tax=Treponema vincentii ATCC 35580 RepID=C8PQ37_9SPIO Length = 561 Score = 298 bits (763), Expect = 2e-79, Method: Composition-based stats. Identities = 99/338 (29%), Positives = 159/338 (47%), Gaps = 24/338 (7%) Query: 5 MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS-----RNRLVV 59 M G + +G + +F N D +L + V ++ + R L V Sbjct: 68 MFGWAGQAPTDLLMNWITERSLGSIKIFGWNTEDSVELAKAVSLLQKKANGTRFRIPLFV 127 Query: 60 AVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAP 119 A DQEGG ++ + + P + A + A G+ + E+ A+ I+++FAP Sbjct: 128 ATDQEGGWIRHVKGRTSETPGNLAIGASGYPID----AYYEGFYIGRELSALGINLNFAP 183 Query: 120 VLDV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 +D+ H S+ IG RS+ P+ A +A F+ G ++AG+ TT KHFPGHG DSH Sbjct: 184 TVDLYTNHESSIIGPRSFGQSPEAAGILARAFMQGSNKAGVLTTAKHFPGHGDTALDSHG 243 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD--PRPASGSPYWLKTVL 235 + P + +++ F LI + + AIM H+ + V PAS SPY L+TVL Sbjct: 244 KLPSIAISKETFYNRELVPFQYLI-DADVPAIMTGHLNFPAVSEHGEPASFSPYLLQTVL 302 Query: 236 RQELGFDGVIFSDDLSM-EGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSP 294 R +LGF G+I +DD+ M G+ A+ Q +L+AG D+I + + Sbjct: 303 RGQLGFKGLIITDDMMMHGAIRYAGTVAKAVQLALEAGNDIIESSTTPRLYDAFW----- 357 Query: 295 IKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHE 332 E + R+ +F R + D+AR + I T+L Sbjct: 358 --TENLQRMQTNEAF-RLRVKDAAR-RIIETKLRYFKS 391 >UniRef50_C6W0V4 Glycoside hydrolase family 3 domain protein n=2 Tax=Flexibacteraceae RepID=C6W0V4_DYAFD Length = 1018 Score = 297 bits (762), Expect = 3e-79, Method: Composition-based stats. Identities = 81/330 (24%), Positives = 147/330 (44%), Gaps = 23/330 (6%) Query: 19 EILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFREGFTRL 78 +++ +GGLI F AQ L + +A S L + +D E G R + Sbjct: 105 KLINDYQIGGLIFFQGGPVSQAQ---LTNRYQAHSNIPLFIGIDGEWGLGMRLDSTIS-F 160 Query: 79 PAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI--SAAIGERSYH 136 P A+ + +L G +A + + I I+FAPV DV + IG RS+ Sbjct: 161 PKQMVLGAI----QDDRLIYRMGDDIARQCKRLGIHINFAPVSDVNSNPANPVIGIRSFG 216 Query: 137 ADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSV 196 D + + ++ G+ + T KHFPGHG ADSH P ++ + D+ Sbjct: 217 EDKENVARKSIAYMKGLQHNRIIATAKHFPGHGDTDADSHFTLPVLNHSLEQLNS-DLYP 275 Query: 197 FSSLIRENKLDAIMPAHVIYSDVDPRP---ASGSPYWLKTVLRQELGFDGVIFSDDLSME 253 + +I ++ L ++ AH+ +D P +S S + +LR++LGF G++F+D ++M Sbjct: 276 YREMIADS-LMGVLNAHLYIPALDNTPNQASSLSDKVVNGLLRKDLGFRGLVFTDAMNMR 334 Query: 254 GAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFSRQE 313 G G A+ +L AG D++L + VS + + I ++ ++ + Sbjct: 335 GVLKNGKPADVNLKALVAGNDVLLYPESIAETVSRIKD--AINSKLISEKIIDDKV--KR 390 Query: 314 LMDSARWKAIST----RLNQLHERWQEEKA 339 ++ + W ++ +N L+ EK+ Sbjct: 391 ILQAKYWAGLNQYKPIDINNLYNDLNSEKS 420 >UniRef50_Q8YVY6 All1831 protein n=11 Tax=Nostocaceae RepID=Q8YVY6_ANASP Length = 365 Score = 297 bits (762), Expect = 3e-79, Method: Composition-based stats. Identities = 100/330 (30%), Positives = 164/330 (49%), Gaps = 25/330 (7%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHD-------PAQLRELVRQIRAAS-RN 55 +++ + G L E++ L G+I F +N+ D +EL QI+ + R+ Sbjct: 14 LIMGISGTSLSDEDKRALGELKPIGVIFFAKNFLDGVPYAVWLETFQELHSQIQEYAERD 73 Query: 56 RLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDI 115 + +D EGGRV R TR P A A+E A E+ ++ I++ Sbjct: 74 SMFFTLDHEGGRVVRTPLPITRFPQALLL---------RSQAREVAKATAIELKSLGINL 124 Query: 116 SFAPVLDV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTA 173 S++PV D+ + IG R++ P A A A + G+ EAG+ KHFPGHG + Sbjct: 125 SWSPVADIYSHPQNPIIGSRAFGNTPDTAAAGAREYYLGLTEAGIVGCAKHFPGHGDTSK 184 Query: 174 DSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLK 232 DSH E P E++ +++ F +LI E + IM AH+++ +DP PA+ S LK Sbjct: 185 DSHVELPILNLTPEELQNRELIPFKALIEEG-IPLIMTAHILFPKIDPDLPATLSRPILK 243 Query: 233 TVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL 292 T+LR+ELGF GV+ SDDL M+ + M + + +AGCD+ +V N + ++ Sbjct: 244 TILRKELGFQGVVVSDDLDMKAVSDMFMQSGTVARAFNAGCDLFIVSRNIHAS--SIERT 301 Query: 293 SPIKAERVTRLYHKGSFSRQELMDSARWKA 322 I + L GS + + +++SA+ + Sbjct: 302 YKIAEDFTDCLTD-GSLA-ESVVESAKERI 329 >UniRef50_D0LSI1 Glycoside hydrolase family 3 domain protein n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LSI1_HALO1 Length = 387 Score = 297 bits (761), Expect = 3e-79, Method: Composition-based stats. Identities = 99/296 (33%), Positives = 152/296 (51%), Gaps = 15/296 (5%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNY-----HDPAQLRELVRQI-RAASRN-- 55 + EG A+ + +A LVGG++LF RN H LR + R++ ++AS + Sbjct: 15 LWFGYEGDYPPADMQREIAAGLVGGVLLFKRNLPPEDEHFAPALRAINRRLHQSASGDAT 74 Query: 56 RLVVAVDQEGGRVQRFREGFTRLPAAQ---SFAALSGMEEGGKLAQEAGWLMASEMIAMD 112 L +A+DQEGG VQR R+ T P + A G + LA+ G + E+ A++ Sbjct: 75 PLWIAIDQEGGTVQRVRQPGTVWPPMLSFDALAEARGDQYAIALAERVGLALGHELAALE 134 Query: 113 IDISFAPVLDV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGA 170 ID+ FAPVLDV + IG+R+ +DP++ G+ AG+ + GKHFPGHG Sbjct: 135 IDVDFAPVLDVHTNPANPVIGKRALSSDPEQVAQRGLALARGLERAGVVSCGKHFPGHGD 194 Query: 171 VTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPY 229 DSH P A + A ++ F+ + IM AHV+++ +D PA+ S Sbjct: 195 TDTDSHLALPRLTHGSARLDAIELLPFARA-AAADIPMIMTAHVVFAALDDTVPATLSHA 253 Query: 230 WLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGA 285 + +LR+ LG+ G+I SDDL M A E ++ AGCD++L+C +R Sbjct: 254 VITGLLRRRLGYQGLIVSDDLDMRAIAAHFGIGEAAVMAVRAGCDVLLLCRDRAHQ 309 >UniRef50_A1JQZ0 Glycosyl hydrolase family protein n=8 Tax=Enterobacteriaceae RepID=A1JQZ0_YERE8 Length = 625 Score = 297 bits (761), Expect = 4e-79, Method: Composition-based stats. Identities = 95/340 (27%), Positives = 157/340 (46%), Gaps = 26/340 (7%) Query: 12 ELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIR-AASRNRLVVAVDQEGGRVQR 70 E++ +++ +G +ILF N D Q+ EL+ ++ A S L ++ DQEGG V R Sbjct: 77 EINDSVSKMIRDYHLGSIILFRENLIDTPQIVELINNLQNARSNLPLFISTDQEGGYVTR 136 Query: 71 FREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHISAAI 130 R G T +P + A K AQ+AG + E+ ++ + ++ + I Sbjct: 137 LRVG-TEMPGNMALGATGS----SKFAQQAGSIHGYELSSLGFNFGPVVDVNNNQNNPVI 191 Query: 131 GERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIR 190 G RSY DP +A +I G+H+ + T+ KHFPGHG VT+D+H P +A + Sbjct: 192 GVRSYSNDPVLVAELARSYISGIHKYNVLTSLKHFPGHGNVTSDTHFSLPAVNIDKAAWQ 251 Query: 191 AKDMSVFSSLIRENKLDAIMPAHVIYSDVD------------PRPASGSPYWLKTVLRQE 238 ++ F ++ DAIM AHV+ +D PA+ S L +LR + Sbjct: 252 QVELKPFVEVMP--VTDAIMTAHVVVPALDNSMLTNIKGEKVGTPATLSKPILTDILRNQ 309 Query: 239 LGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAE 298 L FDG+I +D + M + ++ AG D++L+ K + ++ L + Sbjct: 310 LKFDGLILTDAMDMGAITSNFERNWSIKQAIMAGNDIVLMPMEIKN-RASIEQLDALYDY 368 Query: 299 RVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQEEK 338 T +Q + DSA+ + I T+LN+ R E Sbjct: 369 LKTEATKDPEL-KQRIEDSAQ-RVIYTKLNK---RISPEP 403 >UniRef50_A6RET6 Putative uncharacterized protein n=1 Tax=Ajellomyces capsulatus NAm1 RepID=A6RET6_AJECN Length = 876 Score = 297 bits (761), Expect = 4e-79, Method: Composition-based stats. Identities = 80/333 (24%), Positives = 144/333 (43%), Gaps = 26/333 (7%) Query: 6 LDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRN-----RLVVA 60 + +G +++E R ++ +G +IL +N + +LV +++ +R+ L++ Sbjct: 1 MGFDGTSVNSEIRSLIEDYHLGSVILTAKNLKSAEETTKLVYELQTIARDAGHPVPLLIC 60 Query: 61 VDQEGGRVQRFRE--GFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 +DQE G V + + P+A AA LA+E A E+ A+ I+ Sbjct: 61 LDQENGGVNTLFDEIYIRQFPSAMGMAATGSK----SLAREVCKATAQELSAVGINWILG 116 Query: 119 PVLD--VGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVT-ADS 175 PVLD + +G RS DPQ+ A F+ G EAG+ T GKHFP +G + S Sbjct: 117 PVLDALTNARNQPLGVRSVGDDPQEVSAYGIEFMKGYQEAGLVTCGKHFPSYGNLDYMGS 176 Query: 176 HKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR--PASGSPYWLKT 233 + P ++ + + + I +D++M + A S + Sbjct: 177 QSDVPIITDTIEQLSVSALVPYRNAITHG-IDSMMVGGCSMASAGMTVMHACLSEQVVDE 235 Query: 234 VLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLS 293 +LR++L FDGV+ S+ L ME + ++ AGCD+IL+C + ++ L Sbjct: 236 LLRKDLKFDGVVVSECLEMEALSYNIGVGGGTVMAMKAGCDLILLCRSFSVQQEAINGLK 295 Query: 294 PIKAERVTRLYHKGSFSRQELMDSARWKAISTR 326 G S+ + +S R + + + Sbjct: 296 LG--------VENGIISKSRIRESLR-RVLDMK 319 >UniRef50_C6X002 Glycoside hydrolase n=2 Tax=Flavobacteriaceae RepID=C6X002_FLAB3 Length = 523 Score = 297 bits (760), Expect = 4e-79, Method: Composition-based stats. Identities = 78/335 (23%), Positives = 146/335 (43%), Gaps = 26/335 (7%) Query: 10 GYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQ 69 G + R+ + +GGLIL D A+ LV + + S+ +++ +D E G Q Sbjct: 11 GEDFIDNVRKTVVKEQIGGLILMQ---DDAAREINLVNEFQQKSKVPMMIGMDAEWGVYQ 67 Query: 70 RFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI--S 127 R + + P A + A+ L E +A + M ++ FAPV+DV + Sbjct: 68 RIAAAY-KFPWAMTLGAIQDKN----LITEMAAKIAEDCQRMGVNWDFAPVVDVNTNPQN 122 Query: 128 AAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQA 187 IG RS+ ++ + A + +G+ + + KHFPGHG DSH + P P Sbjct: 123 PIIGNRSFGSEVSNVVGSAMAYSNGLQQNNVLAAIKHFPGHGDTDTDSHLDLPVVSHPLK 182 Query: 188 EIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD---PRPASGSPYWLKTVLRQELGFDGV 244 + A +++ F +L+ + + +M AH+ ++ PAS S + +L+++ G+ G+ Sbjct: 183 RLEAVELAPFKALMDKG-IGGVMVAHLYIPALEKEKGVPASISKNIITGLLKEKYGYKGL 241 Query: 245 IFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLY 304 I +D L+M A E + AG D++L + + + Sbjct: 242 IITDALNMGAVAKRYKPGELDALAFKAGNDIMLFSEGVEEGKRL-----------IQLAI 290 Query: 305 HKGSFSRQELMDSARWKAISTRLNQLHERWQEEKA 339 KG S+ + +S + K + T+ E+++ Sbjct: 291 DKGEISQNRVEESVK-KILLTKYFLGLEKYKPRNP 324 >UniRef50_B4CZU1 Beta-N-acetylhexosaminidase n=2 Tax=Verrucomicrobia RepID=B4CZU1_9BACT Length = 366 Score = 297 bits (760), Expect = 5e-79, Method: Composition-based stats. Identities = 87/312 (27%), Positives = 139/312 (44%), Gaps = 12/312 (3%) Query: 1 MGPVML-DVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVV 59 +G ++L V G ELDA GG ILF RN QLR+L+ +R S+ ++ Sbjct: 6 LGQLLLTGVPGPELDAATAAQFRRLQPGGFILFGRNIQTANQLRKLIDDLRDLSQVEPMI 65 Query: 60 AVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAP 119 +DQEGGRV R + P A + E L + G + A + ++ P Sbjct: 66 TIDQEGGRVSRLKLVGNEPPNANQLRDKNDPE----LIRRHGTITAKLLRIFGFNLDLCP 121 Query: 120 VLDVGHISAA---IGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSH 176 VLD+ A + R Y + + + A F + G+ + GKHFPG+ D+H Sbjct: 122 VLDISFDDEADNSLRGRCYGNNVPQVIRNAGAFNSALQAGGILSCGKHFPGYSRAPLDAH 181 Query: 177 KETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP--RPASGSPYWLKTV 234 E P P + E+ +++VF +K+D++M H Y ++ P+S S + + Sbjct: 182 HELPEIPLSREELETHELAVFKHF--ADKVDSMMIGHGFYPALESSRTPSSLSHRVITDL 239 Query: 235 LRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSP 294 LR+E GF G+I +DDL M E + ++ AG DM ++C+ L L Sbjct: 240 LRKEQGFKGLIMTDDLDMGAILNHYGLEETIRLAITAGNDMAMICHRVAVVEEALGYLER 299 Query: 295 IKAERVTRLYHK 306 + + R Sbjct: 300 VPVADLDRALEN 311 >UniRef50_A5CLY4 Putative beta-N-acetylglucosaminidase n=1 Tax=Clavibacter michiganensis subsp. michiganensis NCPPB 382 RepID=A5CLY4_CLAM3 Length = 607 Score = 296 bits (759), Expect = 7e-79, Method: Composition-based stats. Identities = 99/333 (29%), Positives = 151/333 (45%), Gaps = 36/333 (10%) Query: 15 AEEREILAHPLVGGLILFTR---NYHDPAQLRELVRQIRAAS----RNRLVVAVDQE-GG 66 A E++A GG+I F N DP Q+ L ++ AS R L+V+ DQE GG Sbjct: 94 ATPAEVVARHRPGGIIYFAGEKDNLVDPRQIARLSGDLQRASQGAGRIPLLVSTDQEQGG 153 Query: 67 RVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV--G 124 V R T LP A+ + + E++AM I+ +FAPV+DV Sbjct: 154 YVVR--APCTLLPGQMGLASSPSSSAD---VRAVATITRDELLAMGINQAFAPVVDVASN 208 Query: 125 HISAAIGERSYHADPQKALAIASRFIDGMHE-AGMKTTGKHFPGHGAVTADSHKETPCDP 183 + I RS+ DP + +A I + G+ KHFPGHGA DSH P + Sbjct: 209 PANPIIHVRSFGDDPDRVALLAQSQIGTFQKRQGIVAAAKHFPGHGATDTDSHTGLPVNN 268 Query: 184 RPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD--PRPASGSPYWLKTVLRQELGF 241 R ++E+ D+ F++ I+ + + IM AH++ +D PA+ S L VLRQ+LG+ Sbjct: 269 RSRSELEDNDLPPFAAAIK-SGVGVIMSAHIVVPALDSSGDPATLSYPILTGVLRQKLGY 327 Query: 242 DGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVT 301 DGV+ +D L M G S A ++ AG D +L + A + V Sbjct: 328 DGVVITDALDMAGVRQRYSDARVPVLAIKAGVDQLLTPPDFFAA-----------RDGVL 376 Query: 302 RLYHKGSFSRQELMDSARWKAISTRLNQLHERW 334 G + + +S TR+ +L +R+ Sbjct: 377 AAIRAGELTEARIDESV------TRILRLKQRF 403 >UniRef50_D2AR90 Beta-glucosidase-related glycosidase-like protein n=20 Tax=Actinomycetales RepID=D2AR90_STRRD Length = 520 Score = 296 bits (759), Expect = 7e-79, Method: Composition-based stats. Identities = 96/316 (30%), Positives = 150/316 (47%), Gaps = 16/316 (5%) Query: 7 DVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGG 66 +G R L+ +GG +LF RN PAQ ELV +R + +VVAVD+EGG Sbjct: 19 GFDGTAPPDWLRRALSE-GLGGAVLFARNLAGPAQTAELVAGLRRENPA-VVVAVDEEGG 76 Query: 67 RVQRFREGF-TRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGH 125 V R + P ++ E ++A+E G L+A + DI + +APV+DV Sbjct: 77 AVTRLEARTGSSWPGNRALGVADDAERTERVAREIGRLLA----SADITLDYAPVVDVNA 132 Query: 126 I--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDP 183 + IG RS+ DP+ + +I G+ AG+ KHFPGHG DSH P Sbjct: 133 NPANPVIGIRSFGPDPELVSRQTTAWITGLQGAGVAACAKHFPGHGDTVTDSHHALPTVH 192 Query: 184 RPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPASGSPYWLKTVLRQELGFD 242 ++ +D+ F + ++ + A+M H++ +DP PA+ S L +LR+E+GF Sbjct: 193 ADLELLQERDLPPFRAAVKAG-VQAVMCGHLLVPALDPGNPATLSRRILTGLLREEMGFG 251 Query: 243 GVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTR 302 G++ +D + M A + E +L AG D I V + G SV + + R Sbjct: 252 GMLVTDAIEMGAVAALHPPGEIAVRALAAGVDAICVGVSSPGGESVY-----ALRDAIVR 306 Query: 303 LYHKGSFSRQELMDSA 318 H G + L ++A Sbjct: 307 AVHDGRLPEERLAEAA 322 >UniRef50_Q1NMP4 Glycoside hydrolase, family 3-like n=3 Tax=Deltaproteobacteria RepID=Q1NMP4_9DELT Length = 335 Score = 296 bits (759), Expect = 7e-79, Method: Composition-based stats. Identities = 108/340 (31%), Positives = 158/340 (46%), Gaps = 25/340 (7%) Query: 2 GPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASR----NRL 57 G M+ + G ELD RE++ V ILF+RN P QL L ++ AA R Sbjct: 13 GLFMVGLPGSELDDSSRELIERQRVNDFILFSRNVESPEQLTRLCAELAAACRAAGLPPP 72 Query: 58 VVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISF 117 ++A+DQEGG V R +E FTR P A+ A+ E A+E+ A+ I+I+F Sbjct: 73 LIAIDQEGGTVTRLKEPFTRFPDARELASGPNPEAALS---AYARTCATELRAVGININF 129 Query: 118 APVLDVGHISAAIG--ERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADS 175 APVLD+ R +P + I+ M G+ GKHFPG G D Sbjct: 130 APVLDLCPAGQGFYMERRVLGEEPATVARLGRLLIETMQGQGLAACGKHFPGLGTARLDP 189 Query: 176 HKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPASGSPYWLKTV 234 H++ P R + E+ A D+ + + IM +H +Y +DP RPA+ S L + Sbjct: 190 HQDLPTLERSREELAAYDLPP-FAAAVAVPVALIMTSHTVYPALDPQRPATLSAPILTDL 248 Query: 235 LRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSP 294 LR+++G+DGVI +DDL M E A+L AG D +L+C++ L +L Sbjct: 249 LRRQMGYDGVIITDDLEMGAIDHNLPLEEAVTAALTAGADQLLICHDHDKVRRALKHLR- 307 Query: 295 IKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERW 334 R H G+ + A A TR+ L R+ Sbjct: 308 -------RQLHTGALAP------APVSAALTRIAALRRRF 334 >UniRef50_Q10Y93 Beta-N-acetylhexosaminidase n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q10Y93_TRIEI Length = 308 Score = 296 bits (758), Expect = 8e-79, Method: Composition-based stats. Identities = 90/303 (29%), Positives = 156/303 (51%), Gaps = 11/303 (3%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRA-ASRNRLVV 59 M P++ + G L E+++ G ILF RN P Q+++L +R ++ + + Sbjct: 1 MLPIIFSLAGTILSEAEKQLFTRSQPAGFILFQRNCEHPTQVKQLTDTLRKCINQKDIPI 60 Query: 60 AVDQEGGRVQRF-REGFTRLPAAQS-FAALSGMEEGGKLAQEAGWLMASEMIAMDIDISF 117 +DQEGGRV R + P+A S ++ + + A+ +A ++ + I+++ Sbjct: 61 LIDQEGGRVARLQPPHWWVAPSASSYLQEINDLNDAAHRAKIDAKHIAQDLQNIGINVNC 120 Query: 118 APVLD--VGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADS 175 +PVLD + IG R+Y P + + +A I+G E G+ KH PGHG DS Sbjct: 121 SPVLDCPIPEADLVIGTRAYSNSPSEIITMAKAVIEGFQEEGVMPVIKHIPGHGRALVDS 180 Query: 176 HKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRP-ASGSPYWLKTV 234 HK P E+ D F +L + + M AHVIY+ +D A+ SP ++ + Sbjct: 181 HKALPVVETSLKELSECDFLPFKALAK--SVTWAMSAHVIYTAIDSENCATVSPTIIQEI 238 Query: 235 LRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVL-DNLS 293 +R+++GF+G++ +DDL+M+ + G + SL AGCD+ L C+ + + L DN++ Sbjct: 239 IRKQIGFEGILVTDDLTMKALS--GDLVTKVNRSLKAGCDLALYCDGKLEDMEFLADNIT 296 Query: 294 PIK 296 P+K Sbjct: 297 PVK 299 >UniRef50_B2AR40 Predicted CDS Pa_4_8720 n=3 Tax=Leotiomyceta RepID=B2AR40_PODAN Length = 898 Score = 295 bits (755), Expect = 2e-78, Method: Composition-based stats. Identities = 78/335 (23%), Positives = 145/335 (43%), Gaps = 25/335 (7%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRN-----RLV 58 V++ +G E+ + R ++ +G ++L +N Q +LV++++ + L+ Sbjct: 43 VIMGWDGTEVTPQIRHLIEEHHLGSILLTAKNLKSAHQTAKLVQELQTIAHQAGHLQPLL 102 Query: 59 VAVDQEGGRVQRF--REGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 +A+DQE G V + + P+A AA + LA + A+E+ A+ +++ Sbjct: 103 IALDQENGGVNSLYDEDYICQFPSAMGQAAAGNAD----LAYKVAKATATEVSAVGVNLI 158 Query: 117 FAPVLDV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVT-A 173 PVLDV +G R+ DPQ+ ++G +AG+ T GKHFP +G + Sbjct: 159 LGPVLDVLTNARYQPLGVRAVGDDPQEVSQYGIAAMNGYKDAGVATCGKHFPSYGNLDFL 218 Query: 174 DSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAH--VIYSDVDPRPASGSPYWL 231 S + P + E+ + F + I KLDA+ + ++ A S + Sbjct: 219 GSSLDVPIITQTLEELSLSALVPFRNAIATGKLDAMFVGGCGITNPSMNVNHACLSDQVV 278 Query: 232 KTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDN 291 +LR ELGF GV S+ L ME +++AGCD++L+C + + Sbjct: 279 DDLLRNELGFTGVAISECLEMEALRSEIGVKTGTVMAVEAGCDLVLLCRAYDVQLEAVAG 338 Query: 292 LSPIKAERVTRLYHKGSFSRQELMDSARWKAISTR 326 L +++ + S R + + + Sbjct: 339 LKLG--------VENELLTKERIYTSLR-RVLKMK 364 >UniRef50_B5CPR6 Putative uncharacterized protein n=1 Tax=Ruminococcus lactaris ATCC 29176 RepID=B5CPR6_9FIRM Length = 408 Score = 295 bits (755), Expect = 2e-78, Method: Composition-based stats. Identities = 86/336 (25%), Positives = 156/336 (46%), Gaps = 31/336 (9%) Query: 11 YELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS----RNRLVVAVDQEGG 66 + L VGG+I F N D QL +++ ++ S + ++VD+EGG Sbjct: 94 TAAGDTTKRCLEQYPVGGMIYFAGNIEDRQQLSDMLHTTQSYSTEQSGIPIFLSVDEEGG 153 Query: 67 RVQRF-REGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV-- 123 V R G + + + ++ E A E G + + + + ++ +APV DV Sbjct: 154 TVARVAGSGISNVSVVEDMCEVTDEER----AYEIGQTIGNYLAELGFNLDYAPVGDVLT 209 Query: 124 GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDP 183 + + +RS+ D K A++ + +DG+HE + KHFPGHGA D+H+ Sbjct: 210 NPENTVVKKRSFGNDSAKVAALSQKVLDGLHEENVYGVIKHFPGHGATAGDTHEGYAYTD 269 Query: 184 RPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPRPASGSPYWLKTVLRQELGF 241 + E+ +++ F + I EN + IM H+ +V D P+S S + VLR+++GF Sbjct: 270 KSLEELEQQELVPFQNAI-ENHIAIIMAGHISVPNVTGDNTPSSLSKIMITDVLREKMGF 328 Query: 242 DGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVT 301 DG++ +D L+M + S AE ++AG DM+L+ N + A + + Sbjct: 329 DGIVITDALNMGAISQHYSSAEVSVKVIEAGGDMLLMPENFQEAY-----------QGIL 377 Query: 302 RLYHKGSFSRQELMDSARWKAISTRLNQLHERWQEE 337 G+ + + + +S R R+ ++ E+ E Sbjct: 378 EAVQNGTLTEERIDESVR------RILKVKEKLTAE 407 >UniRef50_B3EKH1 Glycoside hydrolase family 3 domain protein n=3 Tax=Chlorobiaceae RepID=B3EKH1_CHLPB Length = 583 Score = 294 bits (753), Expect = 4e-78, Method: Composition-based stats. Identities = 77/317 (24%), Positives = 144/317 (45%), Gaps = 24/317 (7%) Query: 20 ILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFREGFTRLP 79 +++ VGG++ N +D A L + + + L+++ D E G R +G T Sbjct: 88 LVSQGKVGGIMFLKGNTNDAAVLA---NRFQFIAPRPLLISADMEKGLAMRL-DGATEFA 143 Query: 80 AAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI--SAAIGERSYHA 137 + + +A + L + ++A E A+ I S+ P +D+ + I RSY Sbjct: 144 PSMALSATGRPD----LVFKMAGVIAQEAKALGIYHSYGPSVDLNTNPLNPVINTRSYGD 199 Query: 138 DPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVF 197 + + +++ FIDG+ G+ T KHFPGHG VT DSH P + + ++ F Sbjct: 200 NIPLTIEMSNAFIDGLQSNGIIATAKHFPGHGDVTVDSHINLPVLNADKKRLERVELKPF 259 Query: 198 SSLIRENKLDAIMPAHVIYSDVDP--RPASGSPYWLKTVLRQELGFDGVIFSDDLSMEGA 255 + I ++ + +IM H+ PA+ S + +LR+ELGFDG+I +D L+M+ Sbjct: 260 IAAI-DHGIMSIMIGHLAIPAYTGSMTPATLSWRIVTKLLRKELGFDGLIITDALNMKAL 318 Query: 256 AIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFSRQELM 315 + + +++AG D++L + + + L +G S +++ Sbjct: 319 YQSYTLEDISLRAVEAGNDLLLFSPDPERTHTTL-----------LNAVRRGKLSEKQIN 367 Query: 316 DSARWKAISTRLNQLHE 332 S R ++ R L + Sbjct: 368 KSVRRILLAKRWLGLDK 384 >UniRef50_C6XX68 Glycoside hydrolase family 3 domain protein n=4 Tax=Sphingobacteriaceae RepID=C6XX68_PEDHD Length = 568 Score = 293 bits (751), Expect = 6e-78, Method: Composition-based stats. Identities = 73/322 (22%), Positives = 146/322 (45%), Gaps = 18/322 (5%) Query: 10 GYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQ 69 G + +++ VGGL+ F P + L ++ +R L++ D E G Sbjct: 64 GKAFEDSIGKVIKKERVGGLVFFQGG---PGRQAILTNTYQSLARVPLLITSDGEWGLGM 120 Query: 70 RFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI--S 127 R + P + A+ + L + G +A + + + ++ AP +DV + + Sbjct: 121 RLDSTIS-YPYQMALGAVQNKD----LLYKMGLEVARDYKRIGMHMNLAPDVDVNNNPKN 175 Query: 128 AAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQA 187 I RS+ + A+ ++ GM + G+ + KHFPGHG DSH + P +A Sbjct: 176 PVINFRSFGENKYNVATKAAAYMKGMQDGGLLVSIKHFPGHGDTDVDSHYDLPQLNFTRA 235 Query: 188 EIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR---PASGSPYWLKTVLRQELGFDGV 244 + + ++ F LIRE +M AH+ +D P++ S + +L++ELGF G+ Sbjct: 236 RLDSLEIYPFRELIREGAA-GVMIAHMNIPALDNTPNMPSTLSKPIVTGLLKEELGFKGI 294 Query: 245 IFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLY 304 + SD + M+G E + AG D++ + N A+ ++ ++A+R++ Sbjct: 295 VISDAMGMKGVVKNFKDGEADVMGIIAGNDILELSENSARAIKLVR--KAVRADRIS--M 350 Query: 305 HKGSFSRQELMDSARWKAISTR 326 + S ++++ + W ++ + Sbjct: 351 EQIDASVKKILTAKYWAGLNVK 372 >UniRef50_Q3B6L6 Beta-N-acetylglucosaminidase n=2 Tax=Chlorobiaceae RepID=Q3B6L6_PELLD Length = 585 Score = 293 bits (750), Expect = 7e-78, Method: Composition-based stats. Identities = 88/333 (26%), Positives = 148/333 (44%), Gaps = 23/333 (6%) Query: 4 VMLDVEG--YELDAEER----EILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRL 57 +M D G E D R +++ VGG++ + A L ++ + N L Sbjct: 66 LMADSPGAYTEKDDPARMKLDRLVSEGKVGGIMFLKGSMFSAAMLA---NHFQSIAPNPL 122 Query: 58 VVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISF 117 +++ D E G R EG TR P+A + +A E GK G +A E A + +F Sbjct: 123 LISADMERGLAMRL-EGATRFPSAMAVSAAGSSELSGK----MGEAIALEAKAAGLQWNF 177 Query: 118 APVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADS 175 AP D+ + I RSY + A +A I+GM G+ T KHFPGHG VT DS Sbjct: 178 APSADLNSNPLNPVINTRSYGDTSEGASRMADALIEGMQSKGLLATAKHFPGHGDVTVDS 237 Query: 176 HKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR--PASGSPYWLKT 233 H P + ++ F S IR + A+M H+ + PAS S + Sbjct: 238 HFALPVLEADSLRLEEYELKPFRSAIRCG-VMAVMTGHLAVPKLTGTMDPASISAPIVTG 296 Query: 234 VLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLS 293 +LR+ LGF G+I +D L+M+ + E ++ AG D++L + + + + ++ Sbjct: 297 LLRKRLGFKGLIVTDALNMKALYDGHNVEEISVRAVAAGNDVLLFSPDPERSFHAI--VN 354 Query: 294 PIKAERVTRLYHKGSFSRQELMDSARWKAISTR 326 ++ ++ S + ++ + RW + + Sbjct: 355 AVRDSVISEARIDESV--RRILQAKRWAGLRPK 385 >UniRef50_B8HWS2 Beta-N-acetylhexosaminidase n=5 Tax=Cyanobacteria RepID=B8HWS2_CYAP4 Length = 552 Score = 292 bits (749), Expect = 9e-78, Method: Composition-based stats. Identities = 83/279 (29%), Positives = 139/279 (49%), Gaps = 9/279 (3%) Query: 38 DPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALS--GMEEGGK 95 A++ +Q+++ + L++A D E G QRF G T P + AA++ Sbjct: 64 TAAEIGLRTQQLQSWATIPLLLAADIEEGVGQRF-SGATWFPPPMALAAIARQNPALAAD 122 Query: 96 LAQEAGWLMASEMIAMDIDISFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGM 153 A + G A+E +A+ ++ APV+DV + + I R++ + + + + FI+G Sbjct: 123 YAHQMGAHTAAEALAIGLNWVLAPVVDVNNNPDNPVINVRAFGENTGEVSQLTTAFIEGA 182 Query: 154 HEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAH 213 + T KHFPGHG DSH E P P P ++A ++ F + I +D++M AH Sbjct: 183 QTHPVLTAAKHFPGHGDTATDSHLELPLIPHPLERLQAVELPPFQAAI-AAGVDSVMSAH 241 Query: 214 VIYSDVDPR-PASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAG 272 ++ +DP PA+ S L LRQ LGF G+I +D L M A E +L+AG Sbjct: 242 LLIPALDPDYPATLSQRILTGELRQNLGFAGLIVTDALVMGAIANRYGLNEAPLMALEAG 301 Query: 273 CDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFSR 311 D++L+ + GA+ + + +++ R+T K S R Sbjct: 302 ADILLMPLDPAGAIQAI--CAAVESGRITVDRIKTSVER 338 >UniRef50_C7HX27 Beta-N-acetylhexosaminidase n=1 Tax=Thiomonas intermedia K12 RepID=C7HX27_THIIN Length = 361 Score = 292 bits (747), Expect = 1e-77, Method: Composition-based stats. Identities = 135/332 (40%), Positives = 186/332 (56%), Gaps = 15/332 (4%) Query: 2 GPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAV 61 P++LDV G +L A +R+ L +PLVGG++LF RN+ QL L QI+A R L++AV Sbjct: 7 SPLILDVAGLQLGAADRKRLRNPLVGGVVLFARNWESRGQLLSLCAQIKAV-RPDLLIAV 65 Query: 62 DQEGGRVQRFR-EGFTRLPAAQSFAALSGMEEGGKL--AQEAGWLMASEMIAMDIDISFA 118 D EGGRVQRFR +GFT LPA + A + G + A G+++ASE+ A +D SF Sbjct: 66 DHEGGRVQRFRTDGFTALPAMRELGAHWMADAPGAMRRAAACGFVLASELRACGVDFSFT 125 Query: 119 PVLDVGHI-SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 PVLD+ S+ IG+R++HAD + +A + G+ AGM KHFPGHG V AD+H Sbjct: 126 PVLDLDWGGSSVIGDRAFHADARVVTVLAQSLMLGLARAGMVHCAKHFPGHGYVRADTHL 185 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQ 237 D R I + D + + L L A+MPAHVIY +VD PA S WL+ VLR+ Sbjct: 186 AVARDARTLDAILSADAAPYQWL--APGLRAVMPAHVIYPEVDALPAGFSSVWLRDVLRE 243 Query: 238 ELGFDGVIFSDDLSMEGA-AIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIK 296 +GF G I SDDL M A A S ++ L AGCD +LVCN SV+DN P+ Sbjct: 244 RIGFTGAILSDDLGMAAARAEGQSLSQAALQGLQAGCDAMLVCN-----QSVVDNGRPLD 298 Query: 297 A--ERVTRLYHKGSFSRQELMDSARWKAISTR 326 + + +G ++ R + T+ Sbjct: 299 VLLRELDQALERGQWTPDVHSAQRRLDLLPTQ 330 >UniRef50_D2PXL5 Glycoside hydrolase family 3 domain protein n=1 Tax=Kribbella flavida DSM 17836 RepID=D2PXL5_9ACTO Length = 487 Score = 292 bits (747), Expect = 2e-77, Method: Composition-based stats. Identities = 94/332 (28%), Positives = 153/332 (46%), Gaps = 19/332 (5%) Query: 2 GPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAV 61 G +++ G + R +A +G +ILFTRN D Q+ L +RA R L++A+ Sbjct: 13 GALIVGFTGTTAPDDLRRAVA-AGLGAVILFTRNVTDAEQVAALTTALRA-ERPDLLIAI 70 Query: 62 DQEGGRVQRF-REGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 D EGG L + +S L + L + + A+ + A+ ID+ AP Sbjct: 71 DHEGGEFSHLAPANPWPLSSPRSLGELDDTD----LTRASARDAAATLAALGIDVMLAPS 126 Query: 121 LDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE 178 +DV + I RS+ F+ G+H+AG+ KHFPGHG V DSH Sbjct: 127 VDVNSNPANPIISTRSFGTTAGVVSRHGVAFVQGVHDAGIAACPKHFPGHGDVAVDSHLA 186 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQE 238 P R ++ A +++ F + I E D +M AH+I+S D +PA+ S L+ +LR E Sbjct: 187 LPRVDRSVEQVTAVELAPFKATI-EAGADLVMSAHIIFSAYDDQPATLSHRILQGLLRDE 245 Query: 239 LGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAE 298 LGF GVI +D L M+ S + AS+ AG D+ + +++ + Sbjct: 246 LGFTGVITTDALEMQAITATRSIEDAAVASIRAGADLAM--------IAIGEADPRALTA 297 Query: 299 RVTRLYHKGSFSRQELMDSA-RWKAISTRLNQ 329 + G+ L ++A R + ++ +L Sbjct: 298 HLVDAVSNGTLDAGRLAEAAGRVRELAGKLAS 329 >UniRef50_A0LL99 Beta-N-acetylhexosaminidase n=1 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LL99_SYNFM Length = 337 Score = 292 bits (747), Expect = 2e-77, Method: Composition-based stats. Identities = 97/325 (29%), Positives = 150/325 (46%), Gaps = 15/325 (4%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASR----NRLVV 59 +++ G + + ++ +GG++LF RN P QLR L+ ++++ +R L V Sbjct: 13 LLVGFRGTTFQEQLKSLIDEFGIGGIVLFRRNIQTPEQLRSLLEEMQSHARQVLGRSLWV 72 Query: 60 AVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAP 119 A+DQEGG VQR FT+LP+A A ++G + E A ++ M I I+ AP Sbjct: 73 AIDQEGGPVQRLVPPFTQLPSACDLA-----QQGIEAVAEWSSKAAMDLRRMGIHINLAP 127 Query: 120 VLD--VGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 VLD V S + R DP + R+I + AG+ T KHFPG G D H Sbjct: 128 VLDLRVNANSHFMEGRCLGDDPLTVAELGCRWIKTLQGAGVSATAKHFPGLGLAELDPHH 187 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLR 236 P P E +D+ F I E + +M +H +Y +D PA+ SP +LR Sbjct: 188 FAPVIRWPDQEAMQRDLLPFRKAI-EAGVHCVMTSHALYPFIDSVWPATLSPAINNDLLR 246 Query: 237 QELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIK 296 LGF G + SDD+ M + S+ E + L A D L+C + + L Sbjct: 247 GTLGFRGTLLSDDMDMAAVSEKYSWKEMAEQGLLATIDFFLLCQRTENIEQLQGALCAAI 306 Query: 297 AE--RVTRLYHKGSFSRQELMDSAR 319 A R+ ++ + + + L D R Sbjct: 307 AGSSRIEAMHRESAKRIEWLYDRHR 331 >UniRef50_Q2S5L3 Beta-N-acetylglucosaminidase n=2 Tax=Rhodothermaceae RepID=Q2S5L3_SALRD Length = 979 Score = 291 bits (746), Expect = 2e-77, Method: Composition-based stats. Identities = 83/321 (25%), Positives = 128/321 (39%), Gaps = 24/321 (7%) Query: 16 EEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFREGF 75 E ++ GGLI P + ++A + L+VA D E G R + Sbjct: 92 ETVALVEEFGAGGLIF---GPGTPMTQIAMANDLQAKAERPLLVAQDTEWGVGMRI-DAA 147 Query: 76 TRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHISAAIG--ER 133 T P A + A A G+ A E A+ + +APV DV + R Sbjct: 148 TSFPPAMAIGATRDASH----AYRVGYATAREARALGVHQLYAPVADVNNNPKNPIINVR 203 Query: 134 SYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKD 193 S+ P +AS F G G T KHFPGHG DSH P ++ + + + Sbjct: 204 SFGESPSLVGTMASAFTRGAQRGGTLATVKHFPGHGDTDVDSHINLPVLRFDRSRLDSLE 263 Query: 194 MSVFSSLIRENKLDAIMPAHVIYSDV--DPRPASGSPYWLKTVLRQELGFDGVIFSDDLS 251 + F + + ++M H+ ++ D PA+ S +LR+ELGFDG++ +D L+ Sbjct: 264 LPPFRQAF-DAGVRSVMTGHLALPEIAADSVPATLSRPLTHGLLREELGFDGLVVTDALN 322 Query: 252 MEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFSR 311 M+ E L+AG D++L+ N A V + G Sbjct: 323 MQAVTRTFGVGETAVRVLEAGADLVLMSTNPHAAHQA-----------VRQAVTSGRIDT 371 Query: 312 QELMDSARWKAISTRLNQLHE 332 E+ DS R + +LHE Sbjct: 372 TEINDSVRRLLRVKQDLRLHE 392 >UniRef50_B9WKL4 Beta-n-acetylglucosaminidase, putative n=4 Tax=Saccharomycetales RepID=B9WKL4_CANDC Length = 960 Score = 291 bits (745), Expect = 3e-77, Method: Composition-based stats. Identities = 84/343 (24%), Positives = 148/343 (43%), Gaps = 25/343 (7%) Query: 7 DVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRN------RLVVA 60 +G + + ++ V +IL +N + Q+ +L++ ++ + ++ A Sbjct: 13 GFQGTTVTPQAYHLIVEHHVSAMILSRKNAINVEQMSKLIKDLQYIAMTEGRYTYPIMFA 72 Query: 61 VDQEGGRVQRF--REGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 +D+EGG + + T+ P A + AA +L E +A+E+ + I Sbjct: 73 IDEEGGMMNSLFDPDFLTQYPGAMALAATGDP----QLVYEISRAIATEIKKIGFSIILG 128 Query: 119 PVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSH 176 PVLDV +G RS+ + + G+ + G+ T GKHFPG G DS Sbjct: 129 PVLDVVTKLSHQLVGVRSFGTTIEDVVKYGRMCARGLQDGGLFTVGKHFPGIGNAAVDSL 188 Query: 177 KETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPRPASGSPYWLKTV 234 E P +++ ++ F LI E LD I A ++ D A SP + + Sbjct: 189 LELPMMGDSLEQLKHFNVVPFGELINEGLLDGISAAGCGVPNISPDETHACLSPIVINQL 248 Query: 235 LRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSP 294 LRQEL FDG + S+ L ME + +++AGCD+++VC++ L Sbjct: 249 LRQELRFDGFVISECLEMEALYHSVGLGQGVILAINAGCDLVMVCHDM--------ALQN 300 Query: 295 IKAERVTRLYHKGSFSRQELMDS-ARWKAISTRLNQLHERWQE 336 + + + G+ + L+ S R K + TRL + + E Sbjct: 301 EATDSIKKAVVNGNLDEETLIKSLKRIKKLQTRLPSWAQIFPE 343 >UniRef50_B8NYN9 Beta-N-acetylglucosaminidase, putative n=1 Tax=Aspergillus flavus NRRL3357 RepID=B8NYN9_ASPFN Length = 733 Score = 290 bits (744), Expect = 4e-77, Method: Composition-based stats. Identities = 81/326 (24%), Positives = 135/326 (41%), Gaps = 25/326 (7%) Query: 6 LDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRN-----RLVVA 60 + +G +D + R ++ + +G ++L +N LV +++ +R+ L++A Sbjct: 1 MGFDGTSVDPQIRSLIENYHLGSILLTAKNLKSAEDATRLVLELQTIARDAGHPVPLLIA 60 Query: 61 VDQEGGRVQRFRE--GFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 +DQE G V + + P+A AA LA + A E+ A+ I+ Sbjct: 61 LDQENGGVNSLYDEIYIRQFPSAMGIAATGSK----ALAHDVAVATAQELKAVGINWILG 116 Query: 119 PVLDV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVT-ADS 175 PVLDV S +G R+ DPQ+ F+ G EAG+ T GKHFP +G + S Sbjct: 117 PVLDVLTNVRSQPLGVRTTGDDPQEVSQYGVEFMKGYKEAGLVTCGKHFPSYGNLEFLGS 176 Query: 176 HKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSD--VDPRPASGSPYWLKT 233 + P ++ + F + I +D++M V S A S + Sbjct: 177 QSDVPIITESLEQLSLSALVPFRNAILHG-IDSMMVGGVSMSSAGFHVMHACLSEQVVDD 235 Query: 234 VLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLS 293 +LR+++ FDGV+ S+ L ME + AGCD+IL+C + + L Sbjct: 236 LLRKKMKFDGVVVSECLEMEALTHNIGVGGGTVMAKKAGCDIILLCRSFPVQQEAITGLK 295 Query: 294 PIKAERVTRLYHKGSFSRQELMDSAR 319 G R + S R Sbjct: 296 LG--------VENGIIGRARIEQSLR 313 >UniRef50_UPI0001C379AD glycoside hydrolase family 3 domain protein n=2 Tax=Ruminococcus flavefaciens FD-1 RepID=UPI0001C379AD Length = 452 Score = 290 bits (743), Expect = 5e-77, Method: Composition-based stats. Identities = 85/324 (26%), Positives = 140/324 (43%), Gaps = 26/324 (8%) Query: 11 YELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASR----NRLVVAVDQEGG 66 R+ L VGGLI F+ N Q+ +++R + + L VD+EGG Sbjct: 143 TSATQATRDGLEEYPVGGLIFFSDNMESREQVTDMIRTTKDYAAGLGCVPLFFGVDEEGG 202 Query: 67 RVQRFRE--GFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV- 123 V R E G + S+ ++G A +A ++ D+ FAPV D Sbjct: 203 VVSRCAEKLGTSAFEPMYSY-----KDKGPVTAYANARKLAGDIGQFGFDLDFAPVADTW 257 Query: 124 -GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCD 182 + IG R+Y D ++ + + G E G+ T KHFPGHG DSH Sbjct: 258 SNPDNKVIGTRAYSDDFEETAMLVGAAVRGFREGGVHCTLKHFPGHGDTYEDSHDGFAVS 317 Query: 183 PRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQELGFD 242 R E+R+ + + F I+ D +M H+ +D PAS S + LR LG+D Sbjct: 318 NRTLDELRSNEYTAFDRGIKAG-ADFVMVGHITVPAIDNLPASLSSKVINDELRGTLGYD 376 Query: 243 GVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTR 302 G++ +D L+M A S + +++AG DM+L N + +V+ +++ Sbjct: 377 GIVITDALAMGAIANAYSSSVCAVMAVEAGNDMLLSPKNLRESVAGIED----------- 425 Query: 303 LYHKGSFSRQELMDSARWKAISTR 326 G S + + +S R + + + Sbjct: 426 AVANGVISEERIDESVR-RILKVK 448 >UniRef50_Q6MGM9 Beta-hexosamidase A n=1 Tax=Bdellovibrio bacteriovorus RepID=Q6MGM9_BDEBA Length = 558 Score = 290 bits (742), Expect = 6e-77, Method: Composition-based stats. Identities = 81/330 (24%), Positives = 145/330 (43%), Gaps = 22/330 (6%) Query: 5 MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRA----ASRNRLVVA 60 ++ ++ + +A G +LF RN Q+REL + S+ ++A Sbjct: 57 IVGFPHKDVAPDLESFIAKYKPGSFLLFKRNMISAPQIRELNLALYKSSYKHSKLPPLIA 116 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 +DQEGG V R P A + + LA+E G+ + + +++ APV Sbjct: 117 IDQEGGSVSRLP-ITPAPPNALALGQT----QSPLLAEEMGYQTGLFLREVGFNMNLAPV 171 Query: 121 LDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE 178 LDV ++ IG RS+ +DP+ I + G+ +A + T KHFPG G++ D H Sbjct: 172 LDVVDPYSTSFIGVRSFGSDPELVRDIGVAYSKGLLKARVIPTAKHFPGTGSLNQDPHLT 231 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP--RPASGSPYWLKTVLR 236 + E++ +D+ F+ + A+M +H+IY +D PAS S KT+LR Sbjct: 232 IVKNSASLEEMKKRDLIPFTGYSKIGANIAVMMSHLIYPGLDEDLEPASFSTKISKTLLR 291 Query: 237 QELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIK 296 EL + G++ +DDL M+G+ + +L +G D++++ + D Sbjct: 292 DELKYQGLVMTDDLQMQGSKQVLRPEAAALRALQSGSDIVMLTWSFADQGKAFD------ 345 Query: 297 AERVTRLYHKGSFSRQELMDSARWKAISTR 326 V G + + R + + + Sbjct: 346 --YVLAALKSGELPSAHVDEKLR-RILRVK 372 >UniRef50_Q6MFP1 Related to beta-N-acetylglucosaminidase n=9 Tax=Sordariomycetes RepID=Q6MFP1_NEUCR Length = 1085 Score = 290 bits (742), Expect = 6e-77, Method: Composition-based stats. Identities = 78/335 (23%), Positives = 144/335 (42%), Gaps = 25/335 (7%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS-----RNRLV 58 +++ +G E+ + RE++ +G ++L +N Q +LV++++ + L+ Sbjct: 20 LIMGWDGTEVTPQIRELITDHHLGSILLTAKNLKSAHQTAKLVQELQTIAHNAGHPYPLL 79 Query: 59 VAVDQEGGRVQRF--REGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 +A+DQE G V E + P++ AA + L+ + A+E+ A +++ Sbjct: 80 IALDQENGGVNSLFDEEHICQFPSSMGQAAAGSPD----LSYQLAKATATEVSACGVNLI 135 Query: 117 FAPVLDV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVT-A 173 PVLDV +G R+ DPQ+ + G +AG+ T GKHFP +G + Sbjct: 136 LGPVLDVLTNARYQPLGVRATSDDPQEVSQYGIAAMKGYKDAGVATCGKHFPSYGNLDFL 195 Query: 174 DSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAH--VIYSDVDPRPASGSPYWL 231 S + P + E+ + F + I KLDA+ + + A S + Sbjct: 196 GSSLDIPIITQTLEELALSALVPFRNAIATGKLDAMFVGGCGITNPSMKVAHACLSEQVV 255 Query: 232 KTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDN 291 +LR+ELGF GV S+ L ME ++ AGCD++L+C + + Sbjct: 256 DELLREELGFTGVAISECLEMEALRTEMGVRTGTIMAVQAGCDLVLLCRAYDVQLEAISG 315 Query: 292 LSPIKAERVTRLYHKGSFSRQELMDSARWKAISTR 326 L V +++ + S R + + + Sbjct: 316 LKLGLENEV--------LTKERVYTSLR-RVLKMK 341 >UniRef50_D2AY03 Beta-N-acetylhexosaminidase n=2 Tax=Streptosporangineae RepID=D2AY03_STRRD Length = 484 Score = 289 bits (741), Expect = 8e-77, Method: Composition-based stats. Identities = 93/344 (27%), Positives = 145/344 (42%), Gaps = 24/344 (6%) Query: 2 GPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAV 61 G +++ G L + +GG+ LF N DPAQL L ++R A V+++ Sbjct: 19 GTLLVAFRGTTAPEWVLRDLEN-GLGGVTLFGFNVADPAQLSGLTARLREAGEP--VISL 75 Query: 62 DQEGGRVQRFREGF-TRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 D+EGG V R + P + A+ +E L + + ++ +++ AP Sbjct: 76 DEEGGDVTRLDYHVGSPYPGNAALGAVDDVE----LTRRVYRSIGDDLARCGVNLDMAPS 131 Query: 121 LDVGH--ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE 178 DV + IG RS+ D + G+ AG+ KHFPGHGA DSH E Sbjct: 132 ADVNTADDNPVIGTRSFGPDTALVARHTVAAVHGLQAAGVAACVKHFPGHGATRQDSHLE 191 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPASGSPYWLKTVLRQ 237 P +R +++ F + I + +IM AHV + PA+ S L +LR Sbjct: 192 IPLVDASLELLRERELVPFRAAI-DAGTRSIMTAHVRVPALTGTTPATLSAAALTVLLRG 250 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKA 297 ELG+DGVI SD L M+ A SL AG D++ + D++ I Sbjct: 251 ELGYDGVIVSDALDMKAVTDTYGLAGGSVLSLAAGTDLLCLGPLPTE-----DDVRRIIT 305 Query: 298 ERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQEEKAGH 341 E + G AR +A + R+ +L + + G Sbjct: 306 E-IVAAVGDGRLPL------ARLEAAAERVARLRAWFGTSQTGQ 342 >UniRef50_A2P4H6 Beta-N-Acetylglucosaminidase n=9 Tax=Vibrio RepID=A2P4H6_VIBCH Length = 535 Score = 289 bits (741), Expect = 8e-77, Method: Composition-based stats. Identities = 83/333 (24%), Positives = 144/333 (43%), Gaps = 15/333 (4%) Query: 5 MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAA----SRNRLVVA 60 +L G A R ++A +GG L N AQ + L R ++ R L++ Sbjct: 25 ILAFPGKSAQAAHR-MIAEYNLGGCYLSQDNAETFAQAQALNRGLQQILEQHQRLPLLLG 83 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 VDQEG E T P + + + + + EM + P Sbjct: 84 VDQEGAWGVLVGESTTG-PGNLALG----VSDQAVGTERMYQVFGEEMRGAGFNCILGPC 138 Query: 121 LDVGHIS--AAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE 178 DV I RS+ P+K ++ + G+++ G+ KHFPGHG D+H++ Sbjct: 139 CDVNLNPASPIIDTRSFGDQPEKVSLHSAAAVRGLYQGGIAACAKHFPGHGDTHGDTHRD 198 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQ 237 P + AE++ D++ F + I + +D +M +H++Y +D PA+ S L+ VLRQ Sbjct: 199 IPRVDKSYAELKLNDLAPFQAAI-DAGVDLVMTSHILYPQLDANAPATLSATILQQVLRQ 257 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKA 297 ++GFDGVI SD ++M + + ++ AG MI++ + Sbjct: 258 DMGFDGVIISDSMNMGAIVNHYTPVDAAVKAMQAGITMIMLSEEHYDHSEAYLDKQLAMI 317 Query: 298 ERVTRLYHKGSFSRQELMDSARWKAISTRLNQL 330 V +G + + ++D A K + +L +L Sbjct: 318 HGVIDAVEQGVLA-ESVIDQALEKVVRLKLEKL 349 >UniRef50_Q3AQ16 Beta-N-acetylglucosaminidase n=3 Tax=Chlorobium/Pelodictyon group RepID=Q3AQ16_CHLCH Length = 592 Score = 288 bits (739), Expect = 1e-76, Method: Composition-based stats. Identities = 80/336 (23%), Positives = 146/336 (43%), Gaps = 32/336 (9%) Query: 2 GPVML-DVEGYELDAEEREIL------AHPLVGGLILFTRNYHDPAQLRELVRQIRAASR 54 G +++ D+E +++L +GG+I D +V ++A + Sbjct: 76 GQMIIADIEPTAFSPRNKKVLLLSRLAQEGKIGGVIFMKG---DAKSTGAVVNHLQALAP 132 Query: 55 NRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDID 114 L+ + D E G R G T P + A + KLA++ +A E + + Sbjct: 133 LPLLFSSDMERGVAMRI-SGTTEFPPNMALGATADP----KLAEDMATAIAQEATLLGMH 187 Query: 115 ISFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVT 172 ++AP +D+ + I R++ + +A+ I G+ G+ T KHFPGHG VT Sbjct: 188 HNYAPTVDLNSNPRNPVINTRAFGDTIPLTIVMANAIIKGLQSHGVLATAKHFPGHGNVT 247 Query: 173 ADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPRPASGSPYW 230 DSH P + ++ A ++ F + I + + IM H+ + + PA+ SP Sbjct: 248 VDSHVALPVLQATREQLEAYELIPFRAAIEQG-VATIMVGHLAVPALTGNMEPATISPAI 306 Query: 231 LKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLD 290 + T+LRQELGF G+I +D L+M+ + A ++ AG D++L + + S Sbjct: 307 VTTLLRQELGFKGLIITDALNMKALYNGSNVATLSVRAVQAGNDLLLFSPDPEATHSA-- 364 Query: 291 NLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTR 326 V + G +++ S R + + + Sbjct: 365 ---------VVQAVEAGQIPLEQINASVR-RILQAK 390 >UniRef50_A3EUK5 Beta-N-acetylhexosaminidase n=2 Tax=Leptospirillum sp. Group II RepID=A3EUK5_9BACT Length = 342 Score = 288 bits (739), Expect = 1e-76, Method: Composition-based stats. Identities = 91/314 (28%), Positives = 152/314 (48%), Gaps = 12/314 (3%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS-RNRLVVAVD 62 + + + G L AE+ L GG++LF N +++L+ +IR A + +A+D Sbjct: 15 LWVSLPGARLGAEDIRWLHQIRPGGVVLFAENGTRAEDVKKLISEIREALFPQDVWIAID 74 Query: 63 QEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLD 122 QEGGRV R +EG RL A++ E + + + + IDI FAPVLD Sbjct: 75 QEGGRVARLKEGVPRLSPARTLGNGDPSE-----IFRSAFHLGRALRQTGIDIDFAPVLD 129 Query: 123 V--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETP 180 V + IG+RS+ +P + ++ F G+ G+ GKH PGHG DSH P Sbjct: 130 VDSNPDNPVIGDRSFSREPWEVARLSMAFSHGLEAGGVIPCGKHVPGHGDTLLDSHLALP 189 Query: 181 CDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQEL 239 P+A + ++ F I E + +M AHV+Y +D + PA+ S ++ +LR++ Sbjct: 190 TVDAPEAILSGREFLPFEKAIAEG-MPMLMTAHVMYPALDKKWPATLSRKIMQDMLRKKW 248 Query: 240 GFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAER 299 F G++ SDDLSM ++ + +L CD +LV +R + + + + + Sbjct: 249 DFRGLLLSDDLSMAAVGTPFPFSVVVERALSVTCDGLLVLKSRDKPLEAIRIIKGLSEKN 308 Query: 300 --VTRLYHKGSFSR 311 + R + S +R Sbjct: 309 PGLWRARLEKSIAR 322 >UniRef50_B3DX93 Beta-glucosidase-related glycosidase n=1 Tax=Methylacidiphilum infernorum V4 RepID=B3DX93_METI4 Length = 373 Score = 288 bits (738), Expect = 2e-76, Method: Composition-based stats. Identities = 85/316 (26%), Positives = 146/316 (46%), Gaps = 12/316 (3%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQ 63 +++ + G L ++ + + G I F+RN +P R L+ Q+R+ + + A+DQ Sbjct: 18 LIIGLPGAHLSQDQIQTIREIQPSGFIFFSRNLENPLSFRNLLEQLRSLVEHEPIFAIDQ 77 Query: 64 EGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV 123 EGGRV R + + P A+ + ++ L +E G L + ++ APVLD+ Sbjct: 78 EGGRVSRLKAFGSEPPGAKELGEKADLD----LLREHGELTGKLLRLFGLNFDLAPVLDM 133 Query: 124 G---HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETP 180 ++ RS+ DPQ A FI+G+ G+ GKHFPG+ D H E P Sbjct: 134 DTSEREHNSLKGRSFAPDPQDVSRFAKAFIEGLKSQGILCCGKHFPGYSFAQCDPHLELP 193 Query: 181 CDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR--PASGSPYWLKTVLRQE 238 + + E+ + + F +L +N D+ M AH+ ++DP+ PA+ S + +++RQE Sbjct: 194 QSFKSKKELESFEWIPFRAL--QNSCDSFMIAHIRNFNLDPQGLPATLSSRMI-SIIRQE 250 Query: 239 LGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAE 298 F ++ SDD+ M S + + SL AG D++L+C+ LS Sbjct: 251 WQFTQLLVSDDIDMGAIINHYSLKDTLELSLKAGIDILLLCHRFHLVREAAAILSSFPES 310 Query: 299 RVTRLYHKGSFSRQEL 314 + R L Sbjct: 311 IKEEAAQRIENFRNRL 326 >UniRef50_Q11XV7 B-N-acetylglucosaminidase, glycoside hydrolase family 3 protein n=1 Tax=Cytophaga hutchinsonii ATCC 33406 RepID=Q11XV7_CYTH3 Length = 395 Score = 288 bits (738), Expect = 2e-76, Method: Composition-based stats. Identities = 87/338 (25%), Positives = 160/338 (47%), Gaps = 25/338 (7%) Query: 4 VMLDVEGYELDAEEREI---LAHPLVGGLILFTRNY---HDPAQLRELVRQIRAASRNRL 57 +M+ + + A + + VGG+I+F +N + A L++L+ ++A++ L Sbjct: 65 IMIGINDRKSIAATDSVLIEIKSGKVGGIIMFEKNIAAANSAATLKKLISDLQASASTPL 124 Query: 58 VVAVDQEGGRVQRFRE--GFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDI 115 +++D+EGG+V R +E GF +P+A L+ ++ +A E+ + ++ Sbjct: 125 FMSIDEEGGKVHRLKEKYGFVPMPSAGYLGNLNNLDSTTYYTTN----LAKELKELGFNL 180 Query: 116 SFAPVLDV--GHISAAIG--ERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAV 171 ++AP LD+ + I RSY ADP A+ I+ H G+KT KHFPGHG+ Sbjct: 181 NYAPCLDLATNPNNPIIAKVGRSYSADPDLVSKQAAASINAHHTQGIKTIVKHFPGHGSS 240 Query: 172 TADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD--PRPASGSPY 229 +DSH + ++ + +LI+ +DA+M AH+I +D P + S Sbjct: 241 MSDSHLGIVDVT---NQWNIIELMPYKTLIQSGTIDAVMTAHIINEHLDQSKLPGTLSKV 297 Query: 230 WLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVL 289 + +LR LGF GV+FSDD+ M + + S+ AG D+++ NN + + Sbjct: 298 VVTDLLRNMLGFTGVVFSDDMQMYAISKNYGQENAIKLSILAGVDVLVFGNNVSASDRIK 357 Query: 290 DNLSPIKAERVTRLYHKGSFSRQELMDS-ARWKAISTR 326 + + +L G + ++ R A+ + Sbjct: 358 ---ASEIHAIIKKLVLSGDIPESRINEAYERILALKNK 392 >UniRef50_A5DD90 Putative uncharacterized protein n=2 Tax=Pichia guilliermondii RepID=A5DD90_PICGU Length = 990 Score = 288 bits (738), Expect = 2e-76, Method: Composition-based stats. Identities = 85/343 (24%), Positives = 146/343 (42%), Gaps = 27/343 (7%) Query: 7 DVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS------RNRLVVA 60 EG + + ++ + + L +N Q+ +L++ ++ + + L+ A Sbjct: 10 GFEGATVTPQAYHLIVDLHISTMFLSKKNAVSVQQMAKLIQDLQYIAFTRGNYKYPLMFA 69 Query: 61 VDQEGGRVQRF--REGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 +D+EGG + E T+ P A AA E L + +A+E+ + + Sbjct: 70 IDEEGGMMNSLFDPENLTQFPCAMGLAATGNTE----LVYKVSKALATELKHIGFSMILG 125 Query: 119 PVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSH 176 PVLDV +G RS+ + G+ + G+ T GKHFPG G T DS Sbjct: 126 PVLDVVTKLSHQLVGVRSFGTTLESVTRYGMACARGLQDGGLITMGKHFPGIGNATVDSL 185 Query: 177 KETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPRPASGSPYWLKTV 234 E P IR + F+ LI+E LD I A + D A SP L + Sbjct: 186 LELPMVTDSIDSIRRLNCVPFAELIKEGVLDGISAAGAGVPTISPDEVHACLSPVVLNQL 245 Query: 235 LRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSP 294 LRQEL FDGV+ S+ L M+ + ++ AGCD+++VC++ NL Sbjct: 246 LRQELKFDGVVISECLEMDALHHSVGLGQGVILAVFAGCDLVMVCHDP--------NLQN 297 Query: 295 IKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQEE 337 A+ + + G+ +++++ + +L + +W E Sbjct: 298 EAADSLEKAVANGNLDED-IVNASLERI--EKLQKKLPKWSEI 337 >UniRef50_A5EWZ9 Beta-hexosaminidase n=1 Tax=Dichelobacter nodosus VCS1703A RepID=A5EWZ9_DICNV Length = 334 Score = 288 bits (738), Expect = 2e-76, Method: Composition-based stats. Identities = 114/302 (37%), Positives = 171/302 (56%), Gaps = 9/302 (2%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQ 63 + + +E +L ++ L PL G++LF+RNY +P QLR L+++IR R ++++ VD Sbjct: 3 LFIGIEQTKLCRADKRRLQSPLCSGVVLFSRNYQNPEQLRALIQEIRQL-RPQILIGVDH 61 Query: 64 EGGRVQRFREGFTRLPAAQSFAALSGMEE--GGKLAQEAGWLMASEMIAMDIDISFAPVL 121 EGGRVQRFREGFT LPAA + + +AQ G ++A E+ A ID SFAPVL Sbjct: 62 EGGRVQRFREGFTLLPAAAQIGRIFDQDPVISWSVAQACGVILAYELRAQSIDFSFAPVL 121 Query: 122 DV-GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETP 180 D+ S IG+R++H +P ++ GM GM GKHFPGHG V+ DSH E P Sbjct: 122 DLYDPKSTVIGDRAFHENPAIVSVLSCSMRRGMKTVGMAAVGKHFPGHGRVSGDSHHELP 181 Query: 181 CDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQELG 240 D R AE R D+ F ++++ ++A+M AH++ D D PAS S ++ + ++ G Sbjct: 182 IDTRSDAE-RRADLVPFIFNVQQD-IEALMMAHILMPD-DTLPASFSAKIIQEL--RDWG 236 Query: 241 FDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERV 300 F G + SDDL M A + + QA LDAG D ++CN+ V+D ++ Sbjct: 237 FKGAVVSDDLDMAAAKQFATPTQCVQAVLDAGADAAMICNHFADMDRVIDEGVTVRDSAA 296 Query: 301 TR 302 ++ Sbjct: 297 SQ 298 >UniRef50_C8N6C8 Beta-hexosaminidase n=1 Tax=Cardiobacterium hominis ATCC 15826 RepID=C8N6C8_9GAMM Length = 335 Score = 288 bits (738), Expect = 2e-76, Method: Composition-based stats. Identities = 129/292 (44%), Positives = 167/292 (57%), Gaps = 8/292 (2%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 M V++ +EG L A+E L P G+ILF+RNY PAQL+ L IRA ++ Sbjct: 1 MSAVIIGIEGTMLLAQEAAQLRRPFCAGVILFSRNYSSPAQLKRLCATIRAQH-PDAIIC 59 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGMEE--GGKLAQEAGWLMASEMIAMDIDISFA 118 VDQEGGRVQRFREGFT LPA + +LA+ G ++A E+ ID SF Sbjct: 60 VDQEGGRVQRFREGFTALPAMGALGRCFDRRPLLACELAEAVGVIIAYELRQFSIDFSFT 119 Query: 119 PVLDVG-HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 PVLD+ +S IG+R+ HA+P + +A G+ GM GKH+PGHG V DSH Sbjct: 120 PVLDLHDPVSEVIGDRALHANPAVVVTLADALRRGLGNVGMIAVGKHYPGHGRVVGDSHH 179 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQ 237 P DPR E R D+ F I E+ ++A+MPAH+++ D RPAS SP +K LR Sbjct: 180 VLPQDPRSTDE-REPDLYPFIRQI-ESGIEALMPAHIVFPQRDHRPASFSPVHIKE-LRA 236 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVL 289 GFDG I SDDL M GA + AER QA+LDAG D+ L+CN+ L Sbjct: 237 R-GFDGAIISDDLDMAGALGIADPAERIQAALDAGADVTLMCNDFSAIRQAL 287 >UniRef50_C0CJ14 Putative uncharacterized protein n=1 Tax=Blautia hydrogenotrophica DSM 10507 RepID=C0CJ14_9FIRM Length = 441 Score = 288 bits (737), Expect = 2e-76, Method: Composition-based stats. Identities = 79/332 (23%), Positives = 142/332 (42%), Gaps = 31/332 (9%) Query: 11 YELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRL----VVAVDQEGG 66 E+ E VGG++ + QL + R R + L +AVD+EGG Sbjct: 125 LEVGELTCEAFRQYPVGGIVYMENHILSEEQLTAMNRSFRELGQETLGVSPFLAVDEEGG 184 Query: 67 RVQRFREGFTRLP----AAQS-FAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVL 121 V R P A ++ A + G + + + + ++ FAPV Sbjct: 185 TVTRIAGNQ-AFPVENVGNMENVGAGRDSDQ----AYQVGQTLGAYLKSYGFNMDFAPVA 239 Query: 122 DV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKET 179 DV + + ER++ +DP ++ + G+ ++ KHFPGHGA DSH Sbjct: 240 DVLVNSENTVVKERAFGSDPNLVSSMVEAEVRGLKGEKIEAVLKHFPGHGATAEDSHNGY 299 Query: 180 PCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP--RPASGSPYWLKTVLRQ 237 R E+R ++ F + I + + +M H+ + ++ PAS S + + +L+ Sbjct: 300 AYAQRSLEELRETELVPFQAGI-DAGAEFVMVGHICFPLIEDGQMPASLSSWAVTELLKG 358 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKA 297 E+GF+GV +D ++M A S AE ++ AG DM+L+ + + A + Sbjct: 359 EMGFEGVAVTDAMNMGAIAENYSSAEAAVQAIQAGIDMVLMPADFEAAYN---------- 408 Query: 298 ERVTRLYHKGSFSRQELMDSARWKAISTRLNQ 329 V + S++ L D+ R + ++ +L Sbjct: 409 -GVLQAVSSQEISQERLHDALR-RILTVKLEM 438 >UniRef50_C5EIW3 Glycoside hydrolase n=2 Tax=Clostridiales RepID=C5EIW3_9FIRM Length = 446 Score = 287 bits (736), Expect = 3e-76, Method: Composition-based stats. Identities = 77/328 (23%), Positives = 140/328 (42%), Gaps = 29/328 (8%) Query: 11 YELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASR----NRLVVAVDQEGG 66 + RE L VGGLI F+ N D QL+ + + + L +++D+EGG Sbjct: 129 TQAGEVTREALRQYPVGGLIYFSNNLEDAGQLKTMTGDTQKMAMEEIGIPLFLSIDEEGG 188 Query: 67 RVQRFREGF----TRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLD 122 + R + ++P+ A + A AG + + + +++ FAP D Sbjct: 189 DIARIGKHDGFRVAQVPSMAVIGASGDRVQ----AYRAGSTIGAYLHEYGLNLDFAPDAD 244 Query: 123 V--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETP 180 V + + RS+ +PQ ++ +++G+ E G+ KHFPGHGA DSH+ Sbjct: 245 VLTNPDNTVVKTRSFGRNPQLVTEMSLAYLEGLWEQGVYGAPKHFPGHGATKGDSHEGFA 304 Query: 181 CDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP--RPASGSPYWLKTVLRQE 238 + E+ ++ F +++ N++ IM H+ + P S S ++ LR Sbjct: 305 YTDKTWEELEEAELKPFYAMVG-NQVPFIMAGHISLPAITGGDLPCSLSGQVIQGYLRDR 363 Query: 239 LGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAE 298 +G++G+I +D L+M + +L AG DM+L+ + P E Sbjct: 364 MGYEGIIITDALNMGAIEDHYPSGQAAVMALQAGVDMLLMPADF-----------PSAYE 412 Query: 299 RVTRLYHKGSFSRQELMDSARWKAISTR 326 V G L D++ + +S + Sbjct: 413 AVVEGAGNGEIPLDRL-DASVLRILSLK 439 >UniRef50_B0P7U2 Putative uncharacterized protein n=1 Tax=Anaerotruncus colihominis DSM 17241 RepID=B0P7U2_9FIRM Length = 555 Score = 287 bits (736), Expect = 3e-76, Method: Composition-based stats. Identities = 96/371 (25%), Positives = 156/371 (42%), Gaps = 48/371 (12%) Query: 2 GPVML-DVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAA----SRNR 56 G + L + G LD + L +G +ILF+ N P Q EL +++ + Sbjct: 8 GQLFLTSIPGDTLDEQTIAFLNETNIGNIILFSNNTPSPIQTAELTAALQSCITQNTGVP 67 Query: 57 LVVAVDQEGGRVQRFREGFTRL-----------------------PAAQSFAALSGMEEG 93 ++AVDQEGG + R P A + AA G Sbjct: 68 AIIAVDQEGGEISRIIGSAYEYAPDENGIKYGEIPLGAGIETFLYPCAMAIAASGGAPTA 127 Query: 94 GK------LAQEAGWLMASEMIAMDIDISFAPVLDVGHI--SAAIGERSYHADPQKALAI 145 + + AG + E+ A+ I+ + APV D+ + IG RSY D ++ A Sbjct: 128 QEPVPEFNAVRTAGLYIGQELRAVGINCNLAPVADINSNPANPVIGTRSYGDDAERVAAD 187 Query: 146 ASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENK 205 A+ F G+ E+G+ KH+PGHG + DSH P + E+R+ ++ F + ++ Sbjct: 188 AAAFATGLQESGVLAVAKHYPGHGDTSIDSHYGLPSVNKTLEELRSLELIPF-NAVQNAD 246 Query: 206 LDAIMPAHVIYSDVD--PRPASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAE 263 + A+M AH+++ +D PA+ S L VLR+ GFDG+I +D + M + E Sbjct: 247 IGAVMAAHIVFPQIDSTGLPATLSQPVLTGVLRKG-GFDGLILTDSMRMGAIMQNFGFGE 305 Query: 264 RGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAI 323 +++AG D+I + L + V GS S + L D Sbjct: 306 ACVTAINAGADLITTGTGGDNIQGL--ALQRAAYDAVLAAVKSGSISPETLDDRV----- 358 Query: 324 STRLNQLHERW 334 R+ ER+ Sbjct: 359 -GRILLYKERF 368 >UniRef50_A4CPY8 Glycosyl hydrolase, family 3 n=1 Tax=Robiginitalea biformata HTCC2501 RepID=A4CPY8_9FLAO Length = 534 Score = 287 bits (735), Expect = 4e-76, Method: Composition-based stats. Identities = 89/319 (27%), Positives = 139/319 (43%), Gaps = 28/319 (8%) Query: 19 EILAHPLVGGLILF----------------TRNYHDPAQLRELVRQIRAASRNRLVVAVD 62 E++ +GGL F N L+EL+R+ + A+R L++A+D Sbjct: 42 ELIREFHIGGLCFFHSPASAATNFEGKKEIPHNPDSLGTLKELIRRYQRAARYPLLIAMD 101 Query: 63 QEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLD 122 E G R G P A ALS + L + G MA++ I + APV D Sbjct: 102 AEWGLAMRVENGE-PYPYALCLGALSDRDP---LLYQVGLRMAADCRDAGIHWNLAPVAD 157 Query: 123 VGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETP 180 V + IG R++ P+ AS F G+ + G+ GKHFPGHG DSH P Sbjct: 158 VNCNPGNPVIGYRAFGTTPEGVARRASLFYRGLKDGGILGCGKHFPGHGDTAVDSHLALP 217 Query: 181 CDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR--PASGSPYWLKTVLRQE 238 + AE+ + + F L+ E +DAIM H+ +DP PAS SP + +RQ Sbjct: 218 AIDKGAAELESTEFVPFRRLVAEG-VDAIMTGHLAVPALDPEGLPASLSPAVIGQGIRQN 276 Query: 239 LGFDGVIFSDDLSMEGAAIMG-SYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKA 297 LGFDG + SD L+M + E + AG DM+ + A + + Sbjct: 277 LGFDGAVISDALNMHAVSKNYPGPGEVAWRAFAAGNDMLCFAEDIPEAHRRI--CREGET 334 Query: 298 ERVTRLYHKGSFSRQELMD 316 +++ + + ++ + + Sbjct: 335 DQIEASFRRVWQLKERVRE 353 >UniRef50_Q6BNW2 DEHA2E18546p n=3 Tax=Saccharomycetaceae RepID=Q6BNW2_DEBHA Length = 1044 Score = 287 bits (735), Expect = 4e-76, Method: Composition-based stats. Identities = 89/362 (24%), Positives = 157/362 (43%), Gaps = 34/362 (9%) Query: 7 DVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS------RNRLVVA 60 +G + A+ ++ V +IL +N Q+ +L+R ++ + + L+ A Sbjct: 18 GFQGTNVTAQAYHLIVQHHVSAMILSKKNAVSVKQMTKLIRDLQYIAYTQANYKYPLLFA 77 Query: 61 VDQEGGRVQRFREGF--TRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 +D+EGG + + + P A + AA + L + +A E+ + I Sbjct: 78 IDEEGGMMNSLFDPNFLNQFPGAMAMAATGDAD----LVYKISKAIAIELKHIGFSIILG 133 Query: 119 PVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSH 176 PVLDV +G RS+ + G+ + G+ T GKHFPG G + DS Sbjct: 134 PVLDVVTKLSHQLVGVRSFGTTVEDVTKYGKACARGLKDGGLITVGKHFPGIGNASVDSL 193 Query: 177 KETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPRPASGSPYWLKTV 234 E P +I+ + F+ LI+E LD I A + D A SP + + Sbjct: 194 LELPMMADSLDQIKHFNSVPFAELIKEKLLDGISAAGCGVPTISPDETHACLSPIVINQL 253 Query: 235 LRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSP 294 LRQ+LGFDG++ S+ L M+ + +L AGCD+++VC+++ ++++ Sbjct: 254 LRQDLGFDGMVISECLEMDALYHSIGLGQGVILALYAGCDLVMVCHDKTLQDEAVESIDK 313 Query: 295 IKA-------------ERVTRLYHKGSFSRQEL----MDSARWKAISTRLNQLHERWQEE 337 A R+ +L K S EL DSA+ + ++ HE W++ Sbjct: 314 ALANGNLDDEIISSCLRRIEKLQKKLP-SWNELFPRGEDSAKEDELPLFKDKYHEMWKQH 372 Query: 338 KA 339 ++ Sbjct: 373 QS 374 >UniRef50_C0BMZ8 Glycoside hydrolase family 3 domain protein n=1 Tax=Flavobacteria bacterium MS024-3C RepID=C0BMZ8_9BACT Length = 558 Score = 287 bits (734), Expect = 6e-76, Method: Composition-based stats. Identities = 78/347 (22%), Positives = 156/347 (44%), Gaps = 30/347 (8%) Query: 1 MGPVMLDVEGYELDAEEREI----LAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNR 56 +G + + + + +E ++ L GG+I + P + +L ++ S Sbjct: 51 IGQLFVVMVQSKAPEKELKVFEEELKRHQPGGVIF---SLGTPYKQAQLTNHYQSFSTVP 107 Query: 57 LVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 L++++D E G R + P + A+ + L G M + + + ++ Sbjct: 108 LMISMDAEWGVAMRLDS-VPKYPWNMTLGAIQDL----SLLSTIGRHMGMQAKRLGVHMN 162 Query: 117 FAPVLDVG--HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 FAPV+DV + IG RS+ +D ++ A A GM+ AG+ T KHFPGHG + D Sbjct: 163 FAPVVDVNINPNNPIIGNRSFGSDVRRVSARALALSQGMNAAGVFTCAKHFPGHGDTSVD 222 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR---PASGSPYWL 231 SHK P P + + + ++ + + + + + +M AH+ + P S S Sbjct: 223 SHKALPVLPFTRERLDSIELYPYKTPLLQG-VQGVMVAHLDVPVLTGEPGLPVSLSHKVT 281 Query: 232 KTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDN 291 +L++EL F G++F+D L+M+GA+ + +L AG D++++ N+ +++ L Sbjct: 282 TGLLQKELNFKGLVFTDALNMKGASDFSPDGDVSLKALLAGNDVLVIPNDLGLSIAHL-- 339 Query: 292 LSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQEEK 338 + KG + L S + K ++ + + ++ + Sbjct: 340 ---------SEALKKGVLTEARLAYSVK-KILAAKYDLGLNQYSPVQ 376 >UniRef50_D1YED3 Glycosyl hydrolase family 3 N-terminal domain protein n=3 Tax=Propionibacterium acnes RepID=D1YED3_PROAC Length = 629 Score = 286 bits (733), Expect = 7e-76, Method: Composition-based stats. Identities = 90/365 (24%), Positives = 151/365 (41%), Gaps = 71/365 (19%) Query: 18 REILAHPLVGGLILF--TRNYH---DPAQLRELVRQIRAASRN----RLVVAVDQEGGRV 68 + + +GG++ F + N +PA + L ++ A+R L +AVDQEGG V Sbjct: 76 AQAVQKYHLGGVLYFNWSGNISTPANPAAVAHLSNGLQEAARTSSTAPLAIAVDQEGGIV 135 Query: 69 QRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI-- 126 R T P + A++ + A+ G ++ E+ A+ I++ FAP +DV Sbjct: 136 ARITSPATEFPGNMALGAVNDP----RAARAQGAVLGRELAALGINVDFAPDVDVNTNPA 191 Query: 127 SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQ 186 + IG RS DP + + I G+ G+ T KHFPGHG DSH P + Sbjct: 192 NPVIGVRSMGDDPVRVGVLGVEQIRGIQSRGVAATAKHFPGHGDTQIDSHLGLPVVEYGR 251 Query: 187 AEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQELGFDGVI 245 + + F I ++D IM AH+I +DP PA+ S L +LR+++G+ G+I Sbjct: 252 TTLER-HLVPFRMAIS-AEVDMIMTAHIIVKTLDPTMPATLSKKVLTGLLREQMGYRGII 309 Query: 246 FSDDLSMEGAAIM-------GSY----------------------------------AER 264 +D L MEGA + G+Y Sbjct: 310 TTDALDMEGAQLAVMTEDEKGTYTRLKAAADAEGNDPTAADEGHASAEFNAFMKPIRGRV 369 Query: 265 GQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAIS 324 + +AG D++L ++ V+ + R G S + L +S + ++ Sbjct: 370 AVQAFEAGSDILLNPHDADAVVTAM-----------RRAIADGRVSERRLDES-VLRILA 417 Query: 325 TRLNQ 329 + + Sbjct: 418 WKRRR 422 >UniRef50_UPI00019730D7 beta-glucosidase-like glycosyl hydrolase n=1 Tax=Clostridium sp. M62/1 RepID=UPI00019730D7 Length = 505 Score = 286 bits (733), Expect = 7e-76, Method: Composition-based stats. Identities = 85/331 (25%), Positives = 147/331 (44%), Gaps = 29/331 (8%) Query: 11 YELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRN----RLVVAVDQEGG 66 Y + + VGG++ F++N D AQ ++++ + ++ +++AVD+EGG Sbjct: 183 YAAGEATKAAIEQYPVGGIVYFSQNLRDEAQTKDMLTRTNQYAKERTGLPMLLAVDEEGG 242 Query: 67 RVQRF--REGFT--RLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLD 122 +V R REGF P A E+ A E G + + + + ++ FAPV D Sbjct: 243 QVARISGREGFALDAFPDMSDIGAQGDSEK----AYEVGDAIGAYLAELGFNMDFAPVAD 298 Query: 123 V--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETP 180 V + + RS+ +D + + R +G+ + G+ KHFPGHG DSH+ Sbjct: 299 VLTNPDNTVVKRRSFGSDAKTVANMVVRETEGLKKHGISPVLKHFPGHGGTAEDSHEGAA 358 Query: 181 CDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPRPASGSPYWLKTVLRQE 238 R E+ ++ F + I E D +M H+ + D RPA S + VLR Sbjct: 359 VIHRTVDELLENELIPFYTGI-EAGADFVMVGHISDPEAVGDDRPAVFSKTLVTEVLRDT 417 Query: 239 LGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAE 298 L FDG+I +D ++M S + +L+AG DMIL+ + K A + Sbjct: 418 LKFDGIIITDAMNMGAITESCSSGDAAVKALEAGVDMILMPEDFKAA-----------CQ 466 Query: 299 RVTRLYHKGSFSRQELMDSARWKAISTRLNQ 329 V G+ + +S + ++ + + Sbjct: 467 GVEEAVEAGTIGEDRIEES-VLRILTLKAEK 496 >UniRef50_D1SAP6 Glycoside hydrolase family 3 domain protein n=4 Tax=Actinomycetales RepID=D1SAP6_9ACTO Length = 499 Score = 286 bits (733), Expect = 7e-76, Method: Composition-based stats. Identities = 82/317 (25%), Positives = 135/317 (42%), Gaps = 16/317 (5%) Query: 7 DVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGG 66 G L +G ++LF RN D Q+ L +RA R ++VA+D+E G Sbjct: 23 GFVGTTPPPWVCRWLGEG-LGSVVLFARNVVDHEQVATLTANLRA-ERPDVIVAIDEEAG 80 Query: 67 RVQRFREGF-TRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGH 125 V R + P + A+ L +E + +E+ A+ + + +AP DV Sbjct: 81 DVTRIESARGSSRPGNYALGAVDDP----ALTEEVARDLGAELAAVGVTLDYAPDADVNS 136 Query: 126 I--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDP 183 + IG RS+ ADP + ++ G+ G+ KHFPGHG DSH + P Sbjct: 137 NPANPVIGVRSFGADPDMVARHTAAWVRGLQAGGVAACAKHFPGHGDTRVDSHHDLPRIG 196 Query: 184 RPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQELGFD 242 A + A +++ F + + M H++ +DP PA+ SP L +LR+E+GF Sbjct: 197 GDLARLDAVELAPFRAAVAAGVQAV-MTGHLLVPALDPELPATLSPRVLGGLLREEMGFG 255 Query: 243 GVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTR 302 GV+ +D + M A + +L AG D I V R + + + Sbjct: 256 GVVVTDAVEMRAVADRYGFTGAAVRALVAGADAICVGGERATEADARE-----LRDAIVA 310 Query: 303 LYHKGSFSRQELMDSAR 319 G + L ++A+ Sbjct: 311 AVISGELPEERLAEAAK 327 >UniRef50_A9WJ64 Glycoside hydrolase family 3 domain protein n=2 Tax=Chloroflexus RepID=A9WJ64_CHLAA Length = 515 Score = 286 bits (733), Expect = 8e-76, Method: Composition-based stats. Identities = 87/319 (27%), Positives = 150/319 (47%), Gaps = 23/319 (7%) Query: 3 PVMLDVEGYELDAEEREILAHPLVGGLILFTR-NYHDPAQLRELVRQIRAASRN----RL 57 +++ G E+ + +++ + V G+ L+ N PA++R L ++ + L Sbjct: 7 QLLIWFAGDEVPSHLPDLIFNHRVAGVTLYRHLNGASPARIRALTAHLQQLAAASGYGPL 66 Query: 58 VVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISF 117 ++ DQEGG++Q G T P A + A + LA G + +E+ A+ +++ + Sbjct: 67 LIGADQEGGQLQALP-GLTAFPGAMALGAT----QSPALAFRVGSAIGAELAALGVNLVY 121 Query: 118 APVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADS 175 APV DV + IG R++ DP+ +A+ I G+ + G+ T KHFPGHG DS Sbjct: 122 APVCDVNTNPQNPVIGIRAFGDDPKIVGVLAAAMIAGLQQHGVAATAKHFPGHGDTFGDS 181 Query: 176 HKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD--PRPASGSPYWLKT 233 H ++ + FS+ I E + A+M H+ + RPA+ +P + Sbjct: 182 HYGLVRVNHDYERLQHVEWLPFSAAI-EAGVQAVMVGHLAVPALQAADRPATRAPQVIAH 240 Query: 234 VLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILV--CNNRKGAVSVLDN 291 LR LGF GV SD L M+ A YA ++ AG D++L+ N+ + + L Sbjct: 241 -LRHHLGFRGVTISDALDMKAFAPDDGYARALSEAVHAGIDLLLLGAGNDLEQTIKFLST 299 Query: 292 L-----SPIKAERVTRLYH 305 + S I ++R+ L Sbjct: 300 IDRNTKSTISSQRIAALRQ 318 >UniRef50_UPI0001B54569 sugar hydrolase n=1 Tax=Streptomyces sp. AA4 RepID=UPI0001B54569 Length = 483 Score = 286 bits (732), Expect = 9e-76, Method: Composition-based stats. Identities = 92/319 (28%), Positives = 139/319 (43%), Gaps = 19/319 (5%) Query: 7 DVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGG 66 G R LA +GG+ILF RN D Q+ L +R+ R+ +V+ +D+EGG Sbjct: 17 GFAGTTAPDWVRRRLAE-GLGGVILFGRNVVDDEQVAALTSALRS-ERDGVVIGIDEEGG 74 Query: 67 RVQRFREGFTRL-PAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV-- 123 V R G P + A E L + + A + ++ AP D+ Sbjct: 75 DVTRLDVGKGSFVPGPLALGAAGDPE----LTASVAAALGERLAACGVTVNLAPCADLTL 130 Query: 124 GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDP 183 IG R++ +DP A + F+ GM + G+ KHFPGHGA T DSH P P Sbjct: 131 AAEDPGIGVRAFGSDPVAASPHVAAFVTGMQKYGVAACAKHFPGHGAATEDSHLSLPVLP 190 Query: 184 RPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQELGFDG 243 R AE+R ++ F++ I + + +M H++ + PA+ + L VLR ELGF G Sbjct: 191 RTAAELREVELVPFAAAI-DAGVRTVMSGHLVVREWGEEPATFNRTALVDVLRGELGFTG 249 Query: 244 VIFSDDLSMEGAAIMG----SYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAER 299 + +D L M + E +L G D + + A LD AE Sbjct: 250 TVITDALEMGAISGAYGRHDGIGEASVRALLGGADALCLGGAAFDA-ETLDR----TAEA 304 Query: 300 VTRLYHKGSFSRQELMDSA 318 + +G S + L D+A Sbjct: 305 IVAAVTEGRLSVERLADAA 323 >UniRef50_A6Q7E8 Glycosyl hydrolase, family 3 n=1 Tax=Sulfurovum sp. NBC37-1 RepID=A6Q7E8_SULNB Length = 361 Score = 286 bits (732), Expect = 9e-76, Method: Composition-based stats. Identities = 97/345 (28%), Positives = 162/345 (46%), Gaps = 38/345 (11%) Query: 4 VMLDVEGYELDAEER--EILAHPLVGGLILFT---------RNYHDPAQLRELVRQIRAA 52 +M+ G R + + +G +ILF +N QL++L Q++ Sbjct: 30 LMVGFHGTSAPKNSRICQDIRKYHIGAVILFDYNPVDKSKPKNIASRKQLKKLTAQLQKC 89 Query: 53 SRN-RLVVAVDQEGGRVQRFRE---GFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEM 108 S + +L++AVDQEGG+VQR + + P A + K ++ MA E+ Sbjct: 90 SYDGKLLIAVDQEGGKVQRLKRKYGFYGNFPKASDVTRM-----DKKQIKKTYRKMAKEL 144 Query: 109 IAMDIDISFAPV--LDVGHISAAIGE--RSYHADPQKALAIASRFIDGMHEAGMKTTGKH 164 ++ I+ APV LD+ + I RS+ DP+ A AS FID MH G+ T+ KH Sbjct: 145 KSVGINYDLAPVVDLDINPKNHVIHGLGRSFGKDPKTVAAYASTFIDAMHHYGVLTSLKH 204 Query: 165 FPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-P 223 FPGHG+ D+HK + ++ + L R+ D +M AHV +D R P Sbjct: 205 FPGHGSSVGDTHKGFVDVT---NLWKRVELEPYRLLNRKA--DTVMVAHVFNKKLDARYP 259 Query: 224 ASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCN--N 281 AS S + +LR++LGF+GV+ +DDL M + + +++AG D++L N + Sbjct: 260 ASFSNRTVNGILRKQLGFNGVVITDDLQMGAISKKYGLKNTLKLAINAGDDILLFGNQLD 319 Query: 282 RKGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTR 326 + VS + E + L +G + + + D A + + + Sbjct: 320 PRKTVS-----TKKLVETIKSLLKRGEVNPKSI-DYAYIRIQNLK 358 >UniRef50_C4XU31 Putative beta-N-acetylhexosaminidase n=1 Tax=Desulfovibrio magneticus RS-1 RepID=C4XU31_DESMR Length = 568 Score = 286 bits (732), Expect = 1e-75, Method: Composition-based stats. Identities = 104/342 (30%), Positives = 150/342 (43%), Gaps = 27/342 (7%) Query: 5 MLDVEGYELDAEEREILAHPLVGGLILFT--RNYHDPAQLRELVRQIRAAS-----RNRL 57 M+ G + + ++ +GG+IL+ N P QL L ++A S L Sbjct: 48 MVWFAGPAVSDDIAGLIRQRHLGGVILYGVPGNIESPGQLAALNAGLQAVSAATRHGLGL 107 Query: 58 VVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISF 117 ++ VDQEG V R R+GFT P+ + AA + LA+ A A E+ A+ I+++F Sbjct: 108 LIGVDQEGAPVARLRKGFTLFPSQMAQAATGRAD----LARLAASATARELRAVGINVNF 163 Query: 118 APVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADS 175 APV DV + IG RS+ + P G AGM T KHFPGHG T DS Sbjct: 164 APVADVNVNPANPVIGIRSFGSAPADVARFTGAATAGYGAAGMICTPKHFPGHGDTTIDS 223 Query: 176 HKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR---PASGSPYWLK 232 H P A + D A+M AH++ +D PA+ S L+ Sbjct: 224 HVGLPRSDHDAAALDKLDF-PPFRAAFAAGAPAVMTAHMVVPALDGEPDLPATLSRRVLE 282 Query: 233 TVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL 292 LR +GFDGVIF+D L M A E +L AG D++LV + + + L Sbjct: 283 GTLRGRMGFDGVIFTDSLGMGAVADTYGIPEASVRALAAGADVLLVGADAGRSPG--ERL 340 Query: 293 SPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERW 334 + + + V G AR A R+ +L ER+ Sbjct: 341 AAM--DAVAEAVRSGRVPM------ARLNAAVGRVLRLKERY 374 >UniRef50_C3J950 Glycosyl hydrolase, family 3 n=2 Tax=Bacteria RepID=C3J950_9PORP Length = 1001 Score = 285 bits (731), Expect = 1e-75, Method: Composition-based stats. Identities = 84/340 (24%), Positives = 150/340 (44%), Gaps = 26/340 (7%) Query: 3 PVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVD 62 P++ G A+ ++A GG++ +D Q + + + S+ L++ D Sbjct: 66 PIIYPSSGASQMADAARVVARCHAGGILFQKGEVYD--QWL-MTQHLNKTSKLPLLITAD 122 Query: 63 QEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLD 122 E G R R P ++ A + + L E G MA + + I ++FAPV+D Sbjct: 123 AEWGLAMRL-SNTIRYPRNRALANVRDL----GLITEWGRSMAEQCKTVGIHVNFAPVVD 177 Query: 123 VGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETP 180 V + + IG RS+ Q + + + G+ E + + KHFPGHG DSHK P Sbjct: 178 VNNNPKNPVIGTRSFGEKTQDVIDYSLAYSRGLEEGRVLSVAKHFPGHGNTALDSHKTLP 237 Query: 181 CDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD--PRPASGSPYWLKTVLRQE 238 ++E+ +++ F++ R L IM H+ +D PAS S +L+QE Sbjct: 238 TIKGSRSELEKTELAPFAAYFRAG-LGGIMVGHLNVPALDKSGTPASLSYAITTQLLQQE 296 Query: 239 LGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAE 298 +GF G+IF+D + M+G G+ +L AG D++L + A + Sbjct: 297 MGFSGLIFTDGMQMQGMQQRGATP-ISVRALLAGNDLLLGPTDPVKAHA----------- 344 Query: 299 RVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQEEK 338 + +G SR+++ R K + + + + Q+ + Sbjct: 345 EILAAVQQGVLSRKQIEQHCR-KVLLYKWALMGQSLQQPQ 383 >UniRef50_Q1AUI8 Glycoside hydrolase, family 3-like protein n=1 Tax=Rubrobacter xylanophilus DSM 9941 RepID=Q1AUI8_RUBXD Length = 386 Score = 285 bits (730), Expect = 1e-75, Method: Composition-based stats. Identities = 101/335 (30%), Positives = 159/335 (47%), Gaps = 22/335 (6%) Query: 5 MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASR-----NRLVV 59 ++ + G D +++ VGG++LF N Q+R L +++ S L+V Sbjct: 67 VVSLGGTRSDYYIEKMVRERNVGGVLLFGYNMRSLPQVRRLTGELQRLSMATEPAVPLLV 126 Query: 60 AVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAP 119 AVDQEGG VQ + P+A A E A+ + SE++ ++ AP Sbjct: 127 AVDQEGGEVQS-APWVSPQPSAAEIGASGDPEA----ARRVARRIGSELLEAGVNTDLAP 181 Query: 120 VLDVGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKET 179 V+D G AAIG RSY +P+ + S ++G EAG+ + KHFP HG T+DSH Sbjct: 182 VVDTGF-GAAIGSRSYGENPELVSRMGSAAVEGFREAGVISAAKHFPNHGPATSDSHTGL 240 Query: 180 PCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD-PRPASGSPYWLKTVLRQE 238 P + ++D+ F++ +R + +M H++Y +D PAS SP ++ +LR+ Sbjct: 241 PIVDHDMDTVLSRDLPPFAAAVRAG-VPMVMVGHLLYPAIDQENPASMSPQAIR-LLRER 298 Query: 239 LGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAE 298 LGFDGV+ +DDL+M GA GS A ++ AG DM++V + D L Sbjct: 299 LGFDGVVITDDLAMAGARQGGSVARAAVRAVSAGADMLIVSSPPPQQAEAYDAL------ 352 Query: 299 RVTRLYHKGSFSRQELMDSARWKAISTRLNQLHER 333 R G +++ S R L+ R Sbjct: 353 --VRAVRSGGIPERQVRASVRRIVEMKEDYNLYAR 385 >UniRef50_D2QKL5 Glycoside hydrolase family 3 domain protein n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QKL5_9SPHI Length = 1002 Score = 285 bits (730), Expect = 1e-75, Method: Composition-based stats. Identities = 77/347 (22%), Positives = 153/347 (44%), Gaps = 29/347 (8%) Query: 1 MGPVMLDVEGYE-----LDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRN 55 + +++ V GY + ++ +GG+++F P + +L +++A S Sbjct: 53 IAQLIM-VAGYSNRKPAYEDSLIRLVQTNKLGGVVMFQGG---PVRQAQLTNRLQAGSPV 108 Query: 56 RLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDI 115 L++A+D E G R R P + A+ G L + G +A + + + + Sbjct: 109 PLLIAMDAEWGIAMRLDS-TVRYPYQMTLGAMQGAASDS-LIYQMGANLAKQARRLGMHV 166 Query: 116 SFAPVLDVGHISAAIGE--RSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTA 173 +FAP +DV + RS+ D A ++ GM + + T+ KHFPGHG Sbjct: 167 NFAPSVDVNNNPNNPVINFRSFGEDKYAVARKALAYMRGMQDNQLLTSIKHFPGHGDTGT 226 Query: 174 DSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD---PRPASGSPYW 230 DSH + P + + ++ + ++ F LIR +M AH+ +D RP++ SP Sbjct: 227 DSHYDLPLIAKSRTQLDSLELYPFRELIRAG-ATGVMIAHLNIPALDTTRNRPSTLSPAI 285 Query: 231 LKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLD 290 + +L+ ELGF G++FSD ++M+ + + L+AG D++ + A++ Sbjct: 286 VTNLLKNELGFKGLVFSDAMNMKAVTKFYPSGKADELGLEAGMDVLEFTEDVPAALA--- 342 Query: 291 NLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQEE 337 +V + G ++ + D+ K + + + ++ Sbjct: 343 --------QVKQAIVDGRITQASI-DARCLKVLQAKAWAGLDMYKPI 380 >UniRef50_Q1IQ59 Beta-N-acetylhexosaminidase n=1 Tax=Candidatus Koribacter versatilis Ellin345 RepID=Q1IQ59_ACIBL Length = 372 Score = 285 bits (730), Expect = 1e-75, Method: Composition-based stats. Identities = 93/329 (28%), Positives = 150/329 (45%), Gaps = 10/329 (3%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQ 63 +++ +G E+ A R +LA G I F RN Q +L + +AA L VD Sbjct: 17 MIVGFDGTEMSARVRTLLATIQPAGTIFFKRNVATAEQTWKLNYEAQAAVSTPLFRCVDL 76 Query: 64 EGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV 123 EGG V R R+ P+ + AA K+ + +A+E A+ + FAPV D+ Sbjct: 77 EGGTVDRLRDAVAPAPSLSNVAATGSK----KVMRRFARTLAAEARALGFNTDFAPVFDL 132 Query: 124 GHISA--AIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPC 181 + + + R+ ADP+ + +A F+ G + + GKHFPG GA DSH E P Sbjct: 133 RTVESVKVLAGRTIAADPKHIIELAREFLKGFKDENVLGCGKHFPGLGAGAVDSHYELPT 192 Query: 182 DPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPRPASGSPYWLKTVLRQEL 239 +P + +D+ + L E + M AH +Y + + PAS S +W+ +LR+++ Sbjct: 193 ISKPWKALWEEDLLPYRKLKDE--IAFAMVAHCVYPNATKEKAPASISRFWMTDILRKKI 250 Query: 240 GFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAER 299 GF +I SDD+ M+G S E ++ G D+ LVCNN + Sbjct: 251 GFKHLICSDDMEMKGVQKAVSIEEACIQAVRGGADLFLVCNNESLVWRCFHAVLREAERD 310 Query: 300 VTRLYHKGSFSRQELMDSARWKAISTRLN 328 + + SR+ R +A+ + N Sbjct: 311 KSFAKQIAAASRRVFEFKKRSRAVRAKFN 339 >UniRef50_Q5LHV9 Possible beta-N-acetylglucosaminidase/beta-lactamase fusion protein n=36 Tax=Bacteroides RepID=Q5LHV9_BACFN Length = 996 Score = 284 bits (727), Expect = 3e-75, Method: Composition-based stats. Identities = 78/334 (23%), Positives = 138/334 (41%), Gaps = 33/334 (9%) Query: 2 GPVMLDVEGYELDAEEREIL----AHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRL 57 G + + D + +++ VG + D L + S+ L Sbjct: 52 GQLFVYTLAPRADKDTEKLVGKLTRKFKVGAFLYSEGTVED---QANLTNYAQRQSKIPL 108 Query: 58 VVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISF 117 ++ D E G R E P + +S L + G +A E+ + ++F Sbjct: 109 MITFDGEWGLAMRL-ENTPVFPRNAALGCISDNT----LIEAYGQEVARELREIGAHVNF 163 Query: 118 APVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADS 175 AP DV + I RS+ +P+ + G+ G+ + KHFPGHG DS Sbjct: 164 APDADVNTNPENPVIHVRSFGENPKTVAEKVIAYGRGLETGGILSVSKHFPGHGDTDVDS 223 Query: 176 HKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP---RPASGSPYWLK 232 H+ P +A + + ++ F I + L +M H+ ++P P+S S + Sbjct: 224 HQALPAVYYNRARLDSVELYPFKEAI-QAGLGGVMVGHLQVPALEPDRITPSSLSHSIVT 282 Query: 233 TVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL 292 +LR ELGF+G++F+D L+M+G A ++ +L AG DM+LV N + A Sbjct: 283 DLLRGELGFNGLVFTDALAMKGVAAE---SDVTVKALKAGNDMVLVQQNVEKAQ------ 333 Query: 293 SPIKAERVTRLYHKGSFSRQELMDSARWKAISTR 326 E V + G + +E+ R + ++ + Sbjct: 334 -----ESVVQAIKDGRLTMEEIDAKCR-RILAYK 361 >UniRef50_UPI0001C334F9 beta-glucosidase-like glycosyl hydrolase n=1 Tax=cyanobacterium UCYN-A RepID=UPI0001C334F9 Length = 521 Score = 283 bits (726), Expect = 4e-75, Method: Composition-based stats. Identities = 81/309 (26%), Positives = 135/309 (43%), Gaps = 21/309 (6%) Query: 14 DAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFRE 73 + + L +GG+IL ++ RQ++ S L +A D E G QRF Sbjct: 24 NDKLFSWLNTLNIGGVILLGG---SAGEITLRTRQLQKLSNIPLFIAADIEEGVGQRF-S 79 Query: 74 GFTRLPAAQSFA--ALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHISAAIG 131 G T P + A ++ KLA G ++A E +++ I+ AP++D+ + Sbjct: 80 GGTHFPPPMALGEIAKHDLKTATKLAYSMGEMIAEEALSIGINWMLAPIVDINNNPNNPV 139 Query: 132 E--RSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEI 189 RS+ P +A FI G + TT KHFPGHG + DSH E P + Sbjct: 140 INIRSFGETPDVVSQLAKAFIKGAKNHPILTTAKHFPGHGNTSNDSHIELPVINHDSKRL 199 Query: 190 RAKDMSVFSSLIRENKLDAIMPAHVIYSDVD-PRPASGSPYWLKTVLRQELGFDGVIFSD 248 ++ F I K+D+IM AH++ + D PA+ S L LR++L F G+I +D Sbjct: 200 NNLELVPFKEAISL-KIDSIMSAHILINAWDRENPATLSKKILTKQLREKLNFKGLIITD 258 Query: 249 DLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGS 308 L M G + S E ++ AG +++L+ ++ + + + + G Sbjct: 259 ALMMGGISKYFSPKEIAIRAIKAGANVLLMPDDPENTIKAIYD-----------SVISGE 307 Query: 309 FSRQELMDS 317 + + S Sbjct: 308 LEIETIDQS 316 >UniRef50_C5SMQ9 Beta-N-acetylhexosaminidase n=2 Tax=Caulobacteraceae RepID=C5SMQ9_9CAUL Length = 363 Score = 283 bits (726), Expect = 4e-75, Method: Composition-based stats. Identities = 103/348 (29%), Positives = 166/348 (47%), Gaps = 19/348 (5%) Query: 2 GPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNR-LVVA 60 + G EL A E+ A G ILF RN P Q+R L+ ++AA+ + +++ Sbjct: 11 SAAIYGCAGLELSAAEQAFFASKKPLGFILFKRNIDTPEQVRALISTLKAAAGHPDVLIL 70 Query: 61 VDQEGGRVQRFR-EGFTRLPAAQSFAALSGME-EGGKLAQEAGWLMASEMIAMDIDISFA 118 VDQEGGRVQRF+ + P A ++ A++ E + + LMA ++ A+ +++ A Sbjct: 71 VDQEGGRVQRFKAPHWPSYPPAAAYEAVANSPFEQREYVRMGAQLMAHDLRALGVNVDCA 130 Query: 119 PVLDVGH--ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSH 176 PVLDV +G+R+Y + + +G G+ KH PGHG DSH Sbjct: 131 PVLDVPQAGTHDVVGDRAYGLTAETVAIMGRAAAEGFLAGGVLPVIKHAPGHGRAQVDSH 190 Query: 177 KETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVL 235 E P + E+ D FS + M AHV+Y+ +D R PA+ S +K V+ Sbjct: 191 HELPVVKARRDELEKVDFLPFSV---NADMPLAMTAHVVYTALDRRNPATTSKKVIK-VI 246 Query: 236 RQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPI 295 R+E+GFDG++ SDD+ M G R +ASL AGCD++L C+ + + P Sbjct: 247 REEVGFDGLLLSDDIGMGAL--KGPVGARARASLRAGCDIVLHCSGDLEEMKAVSVFVPT 304 Query: 296 KAERVTR------LYHKGSFSRQELMDSA-RWKAISTRLNQLHERWQE 336 +V + L K + ++ R + + R + E +++ Sbjct: 305 IRGKVQQRLERALLRLKHKIEPLNIEEARGRLQVLLHRPVTVREEYED 352 >UniRef50_A4SCK6 Glycoside hydrolase, family 3 domain protein n=3 Tax=cellular organisms RepID=A4SCK6_PROVI Length = 595 Score = 283 bits (725), Expect = 5e-75, Method: Composition-based stats. Identities = 84/336 (25%), Positives = 152/336 (45%), Gaps = 24/336 (7%) Query: 1 MGPVML-----DVEGYELDAEER--EILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS 53 +G +++ E +E+ ++ +GG++ + L +++ + Sbjct: 62 IGQMLIADSPGGFETDNTPQQEKLESLVRSGTIGGVMFLKGGMF---KTAMLANHLQSIA 118 Query: 54 RNRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDI 113 L+++ D E G R EG T P A + AA + +LA G +A+E A+ I Sbjct: 119 PKPLLLSADMENGAAMRL-EGATEFPPAMAIAAT----QRPRLAAAMGAAIAAEAKAVGI 173 Query: 114 DISFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAV 171 ++APV D+ + I RS+ P+ A+ + +I+G+ G+ T KHFPGHG V Sbjct: 174 RQNYAPVGDLNSNQMNPVINTRSFGDRPETAVRMTGAYIEGLQRGGLIATIKHFPGHGDV 233 Query: 172 TADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP--RPASGSPY 229 DSH P P+ ++ ++ F + I + + ++M H+ + PA+ SP Sbjct: 234 HVDSHLALPVLSAPKEHLQQYELEPFRAAIGQG-VMSVMVGHLAVPAITGTMTPATLSPE 292 Query: 230 WLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVL 289 + +LRQELGF G+I +D L+M+ + ++ AG D++L + L Sbjct: 293 IITGLLRQELGFKGLIITDALNMKALYNGSNIPGISVRAVLAGNDLLLFSPDPALTHRSL 352 Query: 290 DNLSPIKAERVTRLYHKGSFSRQELMDSARWKAIST 325 L +K+ +T S R +M + RW + Sbjct: 353 --LEAVKSGLITEERINQSVLR--IMQAKRWLKLQQ 384 >UniRef50_B2KAV5 Glycoside hydrolase family 3 domain protein n=1 Tax=Elusimicrobium minutum Pei191 RepID=B2KAV5_ELUMP Length = 541 Score = 283 bits (724), Expect = 7e-75, Method: Composition-based stats. Identities = 93/351 (26%), Positives = 163/351 (46%), Gaps = 34/351 (9%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS-----RN 55 +G ++ + + + + LVGG+++ NY Q EL ++++ + + Sbjct: 31 LGQTLVAFVDTDNAHKYQSAIEKGLVGGVLVQWGNY-SLEQTTELAAKLQSWAAKSPHKI 89 Query: 56 RLVVAVDQEGGRV-QRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDID 114 L++++D EGG V GF LP AA + E ++ AG E+ I Sbjct: 90 PLLISIDYEGGTVYTPVTLGFEYLPPNMMIAAANDEEAAARIFYLAGL----ELRKAGIH 145 Query: 115 ISFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVT 172 I+F+PV+DV + IG RS+ + P+ + + + G+ A + + KHFPGHG Sbjct: 146 INFSPVVDVNINPGNPIIGVRSFGSSPELVGRMGAAVVSGLSAANVMSVAKHFPGHGNTV 205 Query: 173 ADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWL 231 DSH P + E++ ++ F I E + IM AH+IY + DP+ PA+ S L Sbjct: 206 LDSHYSLPVLNITKKEMQDVHLAPFKKAI-EAGVPGIMTAHIIYKNYDPKNPATYSKRIL 264 Query: 232 KTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDN 291 +LR E+ F GVI SD L M+GA + G+ A +L+AG DM L+ + Sbjct: 265 NDLLRTEMKFKGVIISDALDMKGATLDGNIALSAAKTLEAGSDMALLGRFLNADKTFNKI 324 Query: 292 LSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHER---WQEEKA 339 + E S++ + ++++ ++ L ++ + E+K Sbjct: 325 YGYVGTE----------LSQKRIEEASK------KILDLKKQMGLFDEQKE 359 >UniRef50_Q5ZW95 Glycosyl hydrolase family 3 n=6 Tax=Legionella RepID=Q5ZW95_LEGPH Length = 395 Score = 283 bits (724), Expect = 8e-75, Method: Composition-based stats. Identities = 85/366 (23%), Positives = 163/366 (44%), Gaps = 56/366 (15%) Query: 4 VMLDVEGYELDAE--EREILAHPLVGGLILFT---------RNYHDPAQLRELVRQIRAA 52 ++L +G +D + + + +GG+ILF +N P Q+ L Q++ Sbjct: 51 LILGFQGKVVDEKSPIAQAIEKNNIGGVILFDFNQQSQTFDKNIESPEQVNRLNNQLQTL 110 Query: 53 SRN----------RLVVAVDQEGGRVQRFRE--GFTRLPAAQSFAALSGMEEGGKLAQEA 100 ++ L+++VD EGG V R GF +P+A+ ++S + A + Sbjct: 111 TQKANRKYHRDNLPLLISVDYEGGNVNRLHPRYGFPEMPSAKDIGSMS-----LERADAS 165 Query: 101 GWLMASEMIAMDIDISFAPVLDVG--HISAAIG--ERSYHADPQKALAIASRFIDGMHEA 156 LMA+ + + +++F P LDV + IG +RS+ + P+ A + + + Sbjct: 166 AELMANTLKSTGFNLNFFPELDVNINPDNPIIGKKDRSFSSIPEIVTQYAQLYTEQFIKH 225 Query: 157 GMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRE-NKLDAIMPAHVI 215 + + KHFPGHG+ +DSH +E +++ F L+ + N D +M AH++ Sbjct: 226 QIHCSYKHFPGHGSSLSDSHLGFVDISDTWSE---QELIPFLQLLSQPNHCDMVMIAHIV 282 Query: 216 YSDVD--PRPASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGC 273 +D PAS S + +LR++L FDGV+ +DD+ M+ S++AG Sbjct: 283 NRKLDMTGLPASLSYQIINGLLRRDLKFDGVVITDDMQMKAITNYYGLETAVTLSINAGA 342 Query: 274 DMILVCNN----RKGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQ 329 DM++ N + + ++D + + G S Q + ++ R+ + Sbjct: 343 DMLIFGNQLVEKFQDSTEIIDM--------IEQKVRSGEISEQRINEA------YQRIVK 388 Query: 330 LHERWQ 335 + + Sbjct: 389 MKRSFN 394 >UniRef50_C7QCS9 Glycoside hydrolase family 3 domain protein n=6 Tax=Actinomycetales RepID=C7QCS9_CATAD Length = 594 Score = 282 bits (723), Expect = 1e-74, Method: Composition-based stats. Identities = 84/318 (26%), Positives = 139/318 (43%), Gaps = 18/318 (5%) Query: 12 ELDAEEREILAHPLVGGLILFTRNYH-DPAQLRELVRQIRAASRNRLVVAVDQEGGRVQR 70 + R L +GG+ + RN P + EL +RA + ++VAVD+EGG V R Sbjct: 24 RVPDWVRRALTEDGLGGVAYYGRNIAATPERTGELSAALRAEN-PHVLVAVDEEGGDVTR 82 Query: 71 FR-EGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI--S 127 + P + + E L + + + I++++AP D+ + Sbjct: 83 LDVATGSAFPGNYALGYVDDAE----LTETVARQIGRSLWRAGINLNYAPDADLNTNPDN 138 Query: 128 AAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQA 187 IG RS+ +DP A ++ ++ G+ AG+ KHFPGHG DSH P Sbjct: 139 PVIGTRSFGSDPVVAARHSAAYVRGLQAAGVAGCAKHFPGHGDTAVDSHHGLPLVQHSDK 198 Query: 188 EIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQELGFDGVIF 246 E K ++ F + I E + ++M AH++ DP P++ S L +LR ELGF G++ Sbjct: 199 EW-EKHLAPFRAAI-EAGVKSVMTAHILAPRYDPEFPSTMSRKVLVDLLRGELGFTGLVV 256 Query: 247 SDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHK 306 +D + M A AE +L AG D + V + L + + + Sbjct: 257 TDGIEMAAIADTFGIAEGTVLALAAGVDAVCVGGGLRDEGDYL-----MLRDALVDAVRA 311 Query: 307 GSFSRQELMDSA-RWKAI 323 G + L D+A R + + Sbjct: 312 GRLPAERLHDAATRVREL 329 >UniRef50_B1YER8 Beta-N-acetylhexosaminidase n=1 Tax=Exiguobacterium sibiricum 255-15 RepID=B1YER8_EXIS2 Length = 412 Score = 282 bits (722), Expect = 1e-74, Method: Composition-based stats. Identities = 94/339 (27%), Positives = 153/339 (45%), Gaps = 10/339 (2%) Query: 6 LDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAA--SRNRLVVAVDQ 63 + V G L +R +L +GG++L N QL+ I A + + + D+ Sbjct: 72 IGVTGTALTQADRTLLQDHALGGVLLLGGNITSTDQLKTFTAAIAEAQPADAKAFLGFDE 131 Query: 64 EGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV 123 EGGRV R + +L + SF + L +E G + S ++ FAPVLDV Sbjct: 132 EGGRVSRVPDAQLKLSPSLSFGQ----KNDPALMKEVGQTLGSISRFYGFNMDFAPVLDV 187 Query: 124 GHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPC 181 + IG+R+ ADPQ+ + + GM + + KHFPGHG T DSH P Sbjct: 188 NSNPANPIIGDRALSADPQQVARLGVPLMQGMKKEDIVPVVKHFPGHGDTTVDSHVGLPR 247 Query: 182 DPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD-PRPASGSPYWLKTVLRQELG 240 +AE+ ++ F I E + +M AH+++ +D RP+S S + VLR EL Sbjct: 248 VDATKAELEQTELVPFKRAIEEGT-EMVMVAHILFPALDKNRPSSLSKPVMTGVLRDELK 306 Query: 241 FDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERV 300 FDGVI +DDL M A+ +L+ G DM + + ++ IK +++ Sbjct: 307 FDGVIITDDLVMGAITEQYGLAQAASLALENGADMAMFSAPGAYTSVHKEIMARIKQDKL 366 Query: 301 TRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQEEKA 339 +R R + + + R+++ ++ KA Sbjct: 367 SRADLDQKVIRVLRLKQTYDLSQTKRVDEYQRLIEQVKA 405 >UniRef50_A8ETM1 Glycosyl hydrolase n=2 Tax=Campylobacterales RepID=A8ETM1_ARCB4 Length = 364 Score = 282 bits (721), Expect = 2e-74, Method: Composition-based stats. Identities = 87/347 (25%), Positives = 168/347 (48%), Gaps = 34/347 (9%) Query: 4 VMLDVEGYELD--AEEREILAHPLVGGLILFTR---------NYHDPAQLRELVRQIRAA 52 V+L G +D E + + +GG+ILF + N D QL+ L Q++A Sbjct: 36 VILGFNGTSVDKNDEIYKNIQS-GLGGVILFDKDPNDKTKIKNIKDKNQLKNLNTQLQAI 94 Query: 53 SRNRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMD 112 S +L++++DQEGG VQR ++ + + +A G A+++ +A ++ + Sbjct: 95 SNQKLLISIDQEGGVVQRLKKDSGFV---DTLSAKDIATNGEDFAKQSYKALAKDLKDVG 151 Query: 113 IDISFAPVLD--VGHISAAI--GERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGH 168 I++ FAPV+D + + I RS+ + + +S F+D + + + + KHFPGH Sbjct: 152 INLDFAPVVDLSINKNNKVIVTRGRSFGESSSEVIKYSSIFVDELRKQNVISVLKHFPGH 211 Query: 169 GAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGS 227 G+ ADSH + K++ + I+ N +D IM AHV ++D PA+ S Sbjct: 212 GSSLADSHLGFVDITKT---WNQKELEPYKYFIKNNDVDMIMTAHVFNENLDKNYPATLS 268 Query: 228 PYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVS 287 +LR++LG+ GV+ +DDL M + + ++++G +M+L N V+ Sbjct: 269 YNINTKLLREDLGYKGVLVTDDLQMSAISKHYDLKQTLTLAINSGVNMLLFANQLAKPVT 328 Query: 288 VLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERW 334 + + + + + S Q+++DS + ++++L ++ Sbjct: 329 LKEIVDTVYLQ-----IQNKQISLQKIIDS------NNKIDKLLKKI 364 >UniRef50_B6BGC2 B-N-acetylglucosaminidase, glycoside hydrolase family 3 protein n=1 Tax=Campylobacterales bacterium GD 1 RepID=B6BGC2_9PROT Length = 555 Score = 281 bits (720), Expect = 2e-74, Method: Composition-based stats. Identities = 99/348 (28%), Positives = 166/348 (47%), Gaps = 38/348 (10%) Query: 4 VMLDVEGYELD--AEEREILAHPLVGGLILFTR---------NYHDPAQLRELVRQIRAA 52 +++ E ++ E E + +GG++LF R N P QL+EL +++ Sbjct: 29 LVVGFENQSVNNNTEIVEYIQQYELGGVVLFDRFYKKRSQIKNIDTPEQLQELTAKLQKF 88 Query: 53 SRNRLVVAVDQEGGRVQRFRE--GFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIA 110 SR L+++VDQEGG + R + GF R P+A+ + LS + A++ + + Sbjct: 89 SRKPLLISVDQEGGNIARLKHYYGFDRFPSAKEVSYLSLTQ-----AKDIYNRQSKMLSN 143 Query: 111 MDIDISFAPV--LDVGHISAAIGE--RSYHADPQKALAIASRFIDGMHEAGMKTTGKHFP 166 +I+ +FAPV LD+ + I E RS+ D +K A ID + + + KHFP Sbjct: 144 NNINCNFAPVVDLDINPKNKVISELERSFGKDSEKVTHYAKIVIDSQTKHNVISVLKHFP 203 Query: 167 GHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PAS 225 GHG+ DSHK K++ + +LI + D IM AHV S++D PA+ Sbjct: 204 GHGSSLGDSHKGFVDIS---NTWSQKELVPYQNLINSKEADMIMSAHVFNSNLDALYPAT 260 Query: 226 GSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGA 285 S +LRQ++GF GV+ SDDL M+ + E ++++G D++L N Sbjct: 261 LSYSTNTKLLRQKMGFTGVLISDDLQMKAITDHYNLKESVTLAINSGVDILLFGN----- 315 Query: 286 VSVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHER 333 + N E + G+ S++ +++S R R+ LH + Sbjct: 316 -QLASNDVKELVEIIYAQVKSGAISKKRIIESNR------RIENLHTK 356 >UniRef50_Q982A1 Glycosyl hyrolase, family 3 n=2 Tax=Mesorhizobium RepID=Q982A1_RHILO Length = 394 Score = 281 bits (720), Expect = 2e-74, Method: Composition-based stats. Identities = 93/312 (29%), Positives = 157/312 (50%), Gaps = 20/312 (6%) Query: 4 VMLDVEGYELDAE-EREILAHPLVG---GLILFTRNYHDPAQLRELVRQIRAASRNRLVV 59 V++ G D + + ++ G G+I RN +R+L + ++ + L+V Sbjct: 54 VLVGFVGDSPDDDGYKRVVKQAEAGEITGVIYLGRNISSLEAVRQLNKGLQQYATAPLLV 113 Query: 60 AVDQEGGRVQRFRE--GFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISF 117 A+DQEGGR+QR GF +P+ + A +E + Q+ +AS++ ++ +++ Sbjct: 114 AIDQEGGRIQRLTGEVGFREVPSEATVAKTMSPDEASVVYQD----LASDLSSLGFNLNL 169 Query: 118 APVLD--VGHISAAIG--ERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTA 173 APV+D V + IG RS+ ADPQK A A FI+ G+ T KHFPGHG+ T Sbjct: 170 APVVDLNVNPANPIIGKLGRSFSADPQKVEAYARAFIEAHRAKGVLTALKHFPGHGSSTK 229 Query: 174 DSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD---PRPASGSPYW 230 DSHK + K++ + LI +D +M AHVI + + PAS S Sbjct: 230 DSHKGIADVTKT---WSDKELLPYKGLIASGDVDMVMAAHVINAKLAFSPDIPASLSRAT 286 Query: 231 LKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLD 290 L T+LR + F GV+ SDD+ M+ S+ + ++ AG D+++ N++ ++ + Sbjct: 287 LTTLLRDNMHFKGVVISDDMQMQAITDNVSFEDSVLRAVMAGNDILIFANDKHPDPAIPE 346 Query: 291 NLSPIKAERVTR 302 ++ I +E R Sbjct: 347 KVAAILSEEAQR 358 >UniRef50_B1ZLT8 Beta-N-acetylhexosaminidase n=61 Tax=Alphaproteobacteria RepID=B1ZLT8_METPB Length = 351 Score = 281 bits (719), Expect = 3e-74, Method: Composition-based stats. Identities = 96/297 (32%), Positives = 144/297 (48%), Gaps = 12/297 (4%) Query: 3 PVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRA-ASRNRLVVAV 61 V+L G L EE G ILF RN P +R L +R R + + Sbjct: 19 AVILGCSGTALTPEEAAFFRDLRPWGFILFKRNIGTPDAVRALTAALRETVGRADAPILI 78 Query: 62 DQEGGRVQRF-REGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 DQEGGRVQR + + PA F AL E A+ LMA ++ + I++ AP+ Sbjct: 79 DQEGGRVQRMGPPHWPKYPAGGRFDALGD--EAAATARLGARLMAHDLSEVGINVDCAPM 136 Query: 121 LDV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE 178 LDV IG+R+Y P + + + +G+ G+ KH PGHG T DSH + Sbjct: 137 LDVPAAGSDNIIGDRAYGDTPGRVIELGRAVAEGLMAGGVLPVIKHIPGHGRATCDSHLD 196 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPASGSPYWLKTVLRQ 237 P + + D F +L + M AHV+++ +DP RPA+ S ++ V+R Sbjct: 197 LPVVEADRVSLGESDFRPFRAL---ADMPLAMSAHVVFNALDPERPATLSERVMREVVRG 253 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSP 294 E+GFDG++ +DDLSM G + +R +A+ AG D++L CN R + + +P Sbjct: 254 EIGFDGLLMTDDLSMHAL--HGPFRDRAEAAYRAGIDVVLHCNGRMDEMRAVAEATP 308 >UniRef50_C1A778 Beta-N-acetylhexosaminidase n=1 Tax=Gemmatimonas aurantiaca T-27 RepID=C1A778_GEMAT Length = 658 Score = 280 bits (717), Expect = 5e-74, Method: Composition-based stats. Identities = 81/318 (25%), Positives = 137/318 (43%), Gaps = 34/318 (10%) Query: 17 EREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFREGF- 75 + +GG+I+ + PA + V ++ +++ L+V D E G R R G+ Sbjct: 119 IERQVRELELGGIII---SVGGPADIALKVNALQRSAKLPLLVGADLETGAAFRARGGWF 175 Query: 76 ----------TRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGH 125 T P A LA E G + A E AM I ++FAPVLDV + Sbjct: 176 LPNAIELGGATSFPYQMGIGAT----RDTMLAYEMGRVTAVEGRAMGIHMAFAPVLDVNN 231 Query: 126 I--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDP 183 + I RS+ DP + + + G+ E GM TGKHFPGHG +SH E Sbjct: 232 NPKNPVISGRSFGEDPALVSRMGAALVRGIQENGMLATGKHFPGHGDTDQNSHLELSRVN 291 Query: 184 RPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRP--ASGSPYWLKTVLRQELGF 241 +A + + ++ F + I + + IM H +D A+ S + +L++++ F Sbjct: 292 VSRARLDSVELRPFQAAI-DAGVRGIMTFHGDLPALDTTHTAATLSRAVMTDLLKRQMRF 350 Query: 242 DGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVT 301 +G++ +D L M G + E +L+AG D++L+ + + ++ + Sbjct: 351 NGLLITDALDMNGVLGNLTMQEATLRALEAGNDVLLMPTDARASIQAM-----------V 399 Query: 302 RLYHKGSFSRQELMDSAR 319 G S L +S R Sbjct: 400 DAVASGRVSEARLDESVR 417 >UniRef50_A2E8H3 Glycosyl hydrolase family 3 N terminal domain containing protein n=1 Tax=Trichomonas vaginalis RepID=A2E8H3_TRIVA Length = 474 Score = 280 bits (717), Expect = 5e-74, Method: Composition-based stats. Identities = 83/303 (27%), Positives = 138/303 (45%), Gaps = 20/303 (6%) Query: 32 FTRNYHDPAQLRELVRQIRAASRN----RLVVAVDQEGGRVQRFREGFTRLPAAQSFAAL 87 RN + Q+R L I+ S N ++++DQE G R T P++ + AA Sbjct: 1 MGRNLRNQEQIRNLTATIQNYSYNQYNLPSLISIDQESGSNIRMFRNSTVFPSSMTIAAT 60 Query: 88 SGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI--SAAIGERSYHADPQKALAI 145 + +E A ++ +L A E+ + I+++FAPV DV + +G+RS+ +P Sbjct: 61 NNLEN----AYKSSYLQALELNDVGINLNFAPVSDVNSNIHNPIVGDRSFGDNPNVVSKY 116 Query: 146 ASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENK 205 +I G +AG+ KHFPGHG + DSH E P R ++ D+ F + I Sbjct: 117 VQNYIKGHKKAGVLCCSKHFPGHGETSQDSHYELPRINRTFESLKEIDLPPFKASINYG- 175 Query: 206 LDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERG 265 +D IM H+I + P S S + LR ELG++GV +D + M+ Y E Sbjct: 176 VDMIMLGHLITPLDETMPCSTSSKCIN-YLRNELGYEGVTITDSMVMKAIKQEK-YNETI 233 Query: 266 QASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAIST 325 ++ AG D+I C V + + + KG + + +S + + I Sbjct: 234 LEAILAGADLICSCG------KVFEPFQYYTIDYLMECVEKGLLPMKRVEESLK-RIIKL 286 Query: 326 RLN 328 +L+ Sbjct: 287 KLS 289 >UniRef50_Q1IHT6 Glycoside hydrolase, family 3-like n=1 Tax=Candidatus Koribacter versatilis Ellin345 RepID=Q1IHT6_ACIBL Length = 624 Score = 280 bits (716), Expect = 6e-74, Method: Composition-based stats. Identities = 87/326 (26%), Positives = 154/326 (47%), Gaps = 36/326 (11%) Query: 18 REILAHPLVGGLILFTRNYHD-------PAQLRELVRQIRAASRNRLVVAVDQEGGRVQR 70 R+ +A +GG + N+ D P + ++ +++ S L++A D E G R Sbjct: 78 RDTMARYHLGGFGVTV-NFEDGFLFKTEPYEAAMMINELQQGSEIPLIIAADFERGLSMR 136 Query: 71 FREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI--SA 128 +E T P A +F A + A+E G + A E A+ ++ ++ P DV + Sbjct: 137 LKEA-TDFPHAMAFGATNNP----AYAKEFGRITALESRAIGVEWNWFPDADVNSNPANP 191 Query: 129 AIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAE 188 I RS+ DPQ ++ +I+G H G+ TT KHFPGHG D+H T +P Sbjct: 192 IINTRSFGEDPQAVASMVKAYIEGAHAEGLLTTVKHFPGHGDTDTDTHLATARINQPLEH 251 Query: 189 IRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP---RPASGSPYWLKTVLRQELGFDGVI 245 I+ ++ F + I + +D++M H++ +DP R A+ SP + L+++LGF G++ Sbjct: 252 IQNVELVPFKAAI-DAGVDSVMIGHLVVPALDPDTNRVATISPKIVNGTLKKDLGFQGLV 310 Query: 246 FSDDLSMEGAAIMGSYAERG-----QASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERV 300 +D + M G A + + G A++ AG DM+L+ ++ GA E + Sbjct: 311 VTDAMEMNGLAKLFGFGPEGSARAAVAAVKAGDDMLLLPSDLDGAY-----------EGL 359 Query: 301 TRLYHKGSFSRQELMDSARWKAISTR 326 + +G + +S R K + + Sbjct: 360 IKAVKRGEIPESRIDESVR-KILRMK 384 >UniRef50_C1F257 Glycosyl hydrolase, family 3 n=1 Tax=Acidobacterium capsulatum ATCC 51196 RepID=C1F257_ACIC5 Length = 361 Score = 279 bits (715), Expect = 8e-74, Method: Composition-based stats. Identities = 91/327 (27%), Positives = 148/327 (45%), Gaps = 10/327 (3%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQ 63 +M+ + G L ER L G+ILF RN Q L+R+ + VD Sbjct: 16 MMVGLSGTALSELERAWLKLIRPTGIILFRRNIETIPQTVALLREASRIIDRPTLRCVDM 75 Query: 64 EGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV 123 EGG V R R+ T +P+ A ++ G L+ + + APV+D+ Sbjct: 76 EGGMVDRLRDVLTPMPSPARVFAT----RDPQMFYRHGKLIGKAAHMAGFNATLAPVVDL 131 Query: 124 GHISA--AIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPC 181 ++ + R A P + A F++G+ + KHFPG G DSH P Sbjct: 132 ALPNSAGVMKTRVVSALPHEVRQYALTFLEGLETEEVFGCAKHFPGLGGGIVDSHLTMPV 191 Query: 182 DPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPRPASGSPYWLKTVLRQEL 239 R A++ +D++ F SL R ++ M AH Y V + PAS SP+W++ +LR+ + Sbjct: 192 IDRKWADLWNEDLAPFRSLAR--RVPITMVAHAAYPLVTRNHTPASLSPHWIQKILRKRV 249 Query: 240 GFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAER 299 GF G++ SDD+ M G S E ++ AG D++ +C+N ++ + L +R Sbjct: 250 GFRGLVLSDDMEMGGVLSHCSMGEAVIHAIAAGTDVVEICHNPALILTAYEALIREAEQR 309 Query: 300 VTRLYHKGSFSRQELMDSARWKAISTR 326 V + +R+ + RW I T+ Sbjct: 310 VAFRRQLTAAARRVEEWARRWSLIQTQ 336 >UniRef50_C0BK44 Glycoside hydrolase family 3 domain protein n=1 Tax=Flavobacteria bacterium MS024-2A RepID=C0BK44_9BACT Length = 969 Score = 279 bits (715), Expect = 9e-74, Method: Composition-based stats. Identities = 85/334 (25%), Positives = 152/334 (45%), Gaps = 29/334 (8%) Query: 1 MGPVMLDVEGYELDA----EEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNR 56 +G + + + E D+ + ++ + VGGL+ + P + + ++ ++ Sbjct: 52 IGQLFMPMVFSERDSAHYNQTLNLVKNEKVGGLVF---SLGGPVGQSQWLNSFQSHAKTP 108 Query: 57 LVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 L+VA+D E G R P + A+ L + G M + + I S Sbjct: 109 LLVAMDAEWGVAMRLDS-VKAFPWPMTLGAVKD----SVLIRAIGKRMGEQEKRLGIQYS 163 Query: 117 FAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 F+PVLD+ + IG RSY + ++ A + G H+AG+ T+GKHFPGHG + D Sbjct: 164 FSPVLDINTNANNPIIGNRSYGSSNERVSRQALAIMQGHHDAGVLTSGKHFPGHGDTSQD 223 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPRPASGSPYWLK 232 SHK P I +++ + +LI E L ++M AH+ + + P S S ++ Sbjct: 224 SHKTLPTVNFSADRINTIELAPYKALI-EKGLSSVMVAHLNVPSITKEGLPTSLSSDVIQ 282 Query: 233 TVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL 292 +L+++LGF G+I +D L+M+G + + AG D++L+ NN ++ + N Sbjct: 283 GLLKRDLGFKGLIVTDALNMKGVSEYSKVDNIDLTAFLAGHDLLLISNNIYEGINAIKN- 341 Query: 293 SPIKAERVTRLYHKGSFSRQELMDSARWKAISTR 326 Y KG + L S + K + + Sbjct: 342 ----------AYRKGKVTEARLAYSVK-KILRAK 364 >UniRef50_A4C3R7 Beta-glucosidase-related glycosidase n=1 Tax=Pseudoalteromonas tunicata D2 RepID=A4C3R7_9GAMM Length = 538 Score = 279 bits (714), Expect = 1e-73, Method: Composition-based stats. Identities = 70/345 (20%), Positives = 141/345 (40%), Gaps = 18/345 (5%) Query: 2 GPVML-DVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRA-----ASRN 55 G + + G++ + + ++ +GG + N D A +L ++ A Sbjct: 21 GQLFVLAFAGHDYEYAQ-HLIQTHHIGGFYITDDNAADLASAAKLANTLQQLAALRACDA 79 Query: 56 RLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDI 115 L++ VDQEG + P + + + L + + A +M ++ + Sbjct: 80 PLILGVDQEGAWGI-LTQQTDLGPGNLALGKANDL----ALTYDMYRVFAQQMQSIGYNT 134 Query: 116 SFAPVLDVG--HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTA 173 +P DV + IG+R++ + + + + G+ KHFPGHG Sbjct: 135 LLSPCADVNADPKNPIIGQRAFGETAEHVASHVAAAVKGILSTDNFACAKHFPGHGDTHT 194 Query: 174 DSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLK 232 DSH+ P + E+ +D+ F I E + IM +H+ Y +D PA+ S L Sbjct: 195 DSHQALPVVNKSLNEMMQQDLLPFKRAI-EAGVSMIMTSHINYPKIDAHYPATLSATILT 253 Query: 233 TVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVC-NNRKGAVSVLDN 291 T+L+++LGF G+I +D ++M S + +L AG +I++ + + + + Sbjct: 254 TLLKEQLGFKGLIITDSMNMWAMRKNFSPVDAAVQALKAGAHLIMLSEEHYENSTTAYKE 313 Query: 292 LSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQE 336 + V +G + L+D ++ + + L + Sbjct: 314 IQAQTIAGVIDAVQQGELG-ETLIDDILKHVLAYKYSYLKADAKA 357 >UniRef50_A7ZD64 Beta-hexosaminidase A (N-acetyl-beta-glucosaminidase) (Beta-N-acetylhexosaminidase) (Chitobiase) n=2 Tax=Campylobacter RepID=A7ZD64_CAMC1 Length = 351 Score = 278 bits (713), Expect = 2e-73, Method: Composition-based stats. Identities = 88/335 (26%), Positives = 156/335 (46%), Gaps = 26/335 (7%) Query: 4 VMLDVEGYEL-DAEEREILAHP---LVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVV 59 +M+ G DA R +L+ GG++L RN AQL+ ++ I+ S + + Sbjct: 32 IMVGFNGASTKDAAFRAMLSDAGYERFGGVMLLGRNVTSKAQLKASIKAIKEKSPK-IFI 90 Query: 60 AVDQEGGRVQRFRE--GFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISF 117 A+D+EGG V R ++ P+A A+ ++ A + MA + I+++F Sbjct: 91 AIDEEGGNVSRMKDKSFDGPYPSAYEVASTLDIKS----AYDLYSKMAINLKECGINLNF 146 Query: 118 APVLDVGHISAAI---GERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 APV+D+ ++ I +R++ K + A F+D E G+ TT KHFPGHG+ D Sbjct: 147 APVVDLHDENSPIIAAKQRAFSEYASKVVIYADAFMDAFKEQGILTTLKHFPGHGSSKED 206 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPASGSPYWLKT 233 SHK + + + + + I + IM H+ +D PA+ S + Sbjct: 207 SHK-----NKSEVTLSKDALLPYKDAISTGRAQIIMVGHLFVKGIDDDNPATLSKKIITD 261 Query: 234 VLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLS 293 +LR EL F+GV+ SDD+ M+G A++ ++AG D++L + D ++ Sbjct: 262 LLRNELKFNGVVISDDMLMKGVGDEA-LAQKVVKFINAGGDILLFSEFKINNQRTADLIT 320 Query: 294 PIKAERVTRLYHKGSFSRQELMDS-ARWKAISTRL 327 I + ++ S++ + S R A+ +L Sbjct: 321 QI----IIDAVNEKKISKERIDASYKRIMALKAKL 351 >UniRef50_A9WLT0 Predicted O-Glycosyl hydrolase, family 3 n=1 Tax=Renibacterium salmoninarum ATCC 33209 RepID=A9WLT0_RENSM Length = 486 Score = 278 bits (713), Expect = 2e-73, Method: Composition-based stats. Identities = 91/339 (26%), Positives = 156/339 (46%), Gaps = 17/339 (5%) Query: 2 GPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAV 61 G + G + + L+ + G++ F +N QL++L QIR + + ++ V Sbjct: 12 GVIWPGFLGSSVPDWLNDALSD-GLAGVVYFRQNIESAKQLKDLSAQIRTLNPS-AIIGV 69 Query: 62 DQEGGRVQRF-REGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 D+EGG V R E ++LP AA+ G + + + AG + I+++ APV Sbjct: 70 DEEGGNVTRLQTEHGSQLPG----AAVLGRRDELAVTEAAGTAIGRLCRQSGINLAIAPV 125 Query: 121 LDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE 178 DV + IG RS+ AD + + G+ + G+ KHFPGHG DSH Sbjct: 126 ADVNTNPANPVIGVRSFGADTGLVARHTAAMVRGIQKTGIAACAKHFPGHGDTHTDSHLG 185 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQE 238 P A++R + + F + E ++A+M AH++ +D+ PA+ + L+ +LR E Sbjct: 186 LPRLEFDVAQLREQHLPPFQAA-SEAGVEAVMTAHIVLADLGEEPATTNSSTLQ-MLR-E 242 Query: 239 LGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVC---NNRKGAVSVLDNLSPI 295 GFDGVI +D L M ++ AG D++ V N R A D + Sbjct: 243 SGFDGVIITDALDMAAIRETVGSGAGAVKAILAGTDLLCVGNPANPRASADPEPDRTDYL 302 Query: 296 KA-ERVTRLYHKGSFSRQELMDS-ARWKAISTRLNQLHE 332 + + + G+ R+ L + R ++++TR+ E Sbjct: 303 EVRDALLAAVQSGALPRERLKAAYQRNQSLATRIALATE 341 >UniRef50_A3HZ06 Possible beta-N-acetylglucosaminidase/beta-lactamase fusion protein n=1 Tax=Algoriphagus sp. PR1 RepID=A3HZ06_9SPHI Length = 520 Score = 278 bits (712), Expect = 2e-73, Method: Composition-based stats. Identities = 87/341 (25%), Positives = 150/341 (43%), Gaps = 29/341 (8%) Query: 19 EILAHPLVGGLILF------TRNYH----------DPAQLRELVRQIRAASRNRLVVAVD 62 E++ VGGL F N+ +L EL+ + + AS+ L++++D Sbjct: 33 ELIRLEEVGGLTFFHSRISAAANFEKRQEVLDVEGTLEKLVELINRFQQASKTPLIISID 92 Query: 63 QEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLD 122 E G R E P A S +S EG K +E + M EM + I ++ AP D Sbjct: 93 GEFGLAMRI-ENTPIYPYAISLGGISL--EGEKQVEEVAYKMGLEMKTVGIHLNLAPCAD 149 Query: 123 VGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETP 180 V + IG RSY +D +A G++ AG+ KHFPGHG DSH P Sbjct: 150 VNTNPLNPVIGYRSYGSDTNHVARLALAAYKGLNRAGIGACLKHFPGHGDTQVDSHLNLP 209 Query: 181 CDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP---RPASGSPYWLKTVLRQ 237 + + E+ +++ F I ++ ++ IM H+ + AS S + +L++ Sbjct: 210 VVNKSKKELVEEELYPFQYGI-DHDVEMIMVGHLAVPALSNGAEISASISREIITDLLKK 268 Query: 238 ELGFDGVIFSDDLSMEGAA-IMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIK 296 E+GF G++ SD L+M+ + + + + + AG D++ N K + + L Sbjct: 269 EMGFQGIVISDALNMKAVSRLFDTPGQLEWEAFHAGNDILCFSENVKEGIDFI--LEKAT 326 Query: 297 AERVTRLYHKGSFSRQELMDSARWKAISTRLN-QLHERWQE 336 + + K +++L R A + N + HE++ E Sbjct: 327 EAEINASFEKLQDLKEKLGVFERRPAEVPKFNWESHEKFLE 367 >UniRef50_Q0BSP6 Anhydromuramoyl-peptide exo-beta-N-acetylglucosaminidase n=2 Tax=Acetobacteraceae RepID=Q0BSP6_GRABC Length = 348 Score = 278 bits (711), Expect = 3e-73, Method: Composition-based stats. Identities = 103/351 (29%), Positives = 161/351 (45%), Gaps = 16/351 (4%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 M ++ + G L + E +L G+ILF RN DPAQL+ L ++R+ + + Sbjct: 1 MKAAIIGLSGPTLTSAEEAMLHTQRPAGVILFRRNISDPAQLKALTARLRSILPSEAALL 60 Query: 61 VDQEGGRVQRFREGFTRL-PAAQSFAALSGMEEGGKL--AQEAGWLMASEMIAMDIDISF 117 VDQEGGRV R R R P A L ++ L A G L+ + + D+ Sbjct: 61 VDQEGGRVARLRPPHWRSHPPAAVIGRLYEADKKAGLRAAWLTGALIGLDCMGAGFDVVC 120 Query: 118 APVLD--VGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADS 175 APVLD V +G+R+Y DP + ++G+ AG++ KH PGHG ADS Sbjct: 121 APVLDLLVPGAHDIVGDRAYGTDPGTVTLLGGAVMEGLLAAGVRPVMKHVPGHGRAHADS 180 Query: 176 HKETPCDPRPQAEIRAKDMSVF-SSLIRENKLDA-IMPAHVIYSDVDP-RPASGSPYWLK 232 H E P P ++ F + R L M AH++Y D RPA+ SP + Sbjct: 181 HLELPVVPDEL----PAELEPFTRNAARFAGLPVWAMTAHILYEAWDRLRPATLSPVVIG 236 Query: 233 TVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL 292 ++RQ +GF+G++ SDD++M+ + G + +L AGCD++L C + + Sbjct: 237 QIIRQRIGFNGLLVSDDIAMKALS--GDPVSLTRQTLAAGCDVVLHCTGVQAETQAVLEG 294 Query: 293 SPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHE--RWQEEKAGH 341 P+ ++ L + + D RL L + R ++ GH Sbjct: 295 CPLLTDQAHTLLAARPVASISIGDGESLAQERERLGLLEDGIRAKDPTEGH 345 >UniRef50_C8WFM5 Beta-N-acetylhexosaminidase n=3 Tax=Zymomonas mobilis RepID=C8WFM5_ZYMMN Length = 344 Score = 278 bits (711), Expect = 3e-73, Method: Composition-based stats. Identities = 96/336 (28%), Positives = 156/336 (46%), Gaps = 16/336 (4%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 M PV + + L +E+++ G ILF RN D QL++L +RA S+ + + Sbjct: 1 MKPVFIGIADTRLSDQEKKLFRQYEPAGYILFQRNITDRTQLKKLTADLRANSQQNIPIL 60 Query: 61 VDQEGGRVQRFREG-FTRLPAAQSFAALS--GMEEGGKLAQEAGWLMASEMIAMDIDISF 117 +DQEGGRV R + + PAA F L + + +A + + I++ Sbjct: 61 IDQEGGRVARMKPPVWPAFPAAARFDELYQKAPVSAIEAVRANSHALALLLQDVGINVDC 120 Query: 118 APVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADS 175 P+LD+ IG+RS+ +P++ A+ +DG+ + + KH PGHG DS Sbjct: 121 MPLLDIRDPEGDNIIGDRSFGHNPEQVAALGRAVLDGLAKGSVCGIIKHIPGHGQAKVDS 180 Query: 176 HKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRP-ASGSPYWLKTV 234 HK P P EI +D+ F L M AH+IY D AS S + + Sbjct: 181 HKALPVVDSPI-EILERDIRPFQHL---RNAKMAMTAHIIYQAWDKESCASYSSKIIHDI 236 Query: 235 LRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVC----NNRKGAVSVLD 290 +R+++GF G++ SDD+ M G R DAGCD+ L C ++ K + +D Sbjct: 237 IRKKIGFKGLLMSDDIGMNAL--DGDPVTRALRVFDAGCDVALHCSGKFDDMKAILENMD 294 Query: 291 NLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTR 326 + P + ++ ++ S + +D A KA+ R Sbjct: 295 EIRPESFAALEKVTNEFSSASDFDLDRAENKALFDR 330 >UniRef50_B3C6M6 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=B3C6M6_9BACE Length = 1051 Score = 277 bits (709), Expect = 4e-73, Method: Composition-based stats. Identities = 84/352 (23%), Positives = 141/352 (40%), Gaps = 33/352 (9%) Query: 1 MGPVMLDVEGYELDAEEREILAH----PLVGGLILFTRNYHDPAQLRELVRQIRAASRNR 56 +G + + + D R++L VGGL LF+ L + + + Sbjct: 68 IGQLFIYTIAPQQDKANRDLLRKVVDGYKVGGL-LFSGGLM--ENQAILTNEAQKMADVP 124 Query: 57 LVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 L++ D E G R R G P + + L E G MA + + + ++ Sbjct: 125 LMITFDGEWGLSMRLR-GTPVFPRNMILGCI----QNDSLLYEYGREMARQCRELGVQVN 179 Query: 117 FAPVLDVG--HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 FAPV DV + I RS+ P + G+ E G+ + KHFPGHG D Sbjct: 180 FAPVADVNINPKNPVINTRSFGESPANVADKVIAYARGLEEGGVLSVSKHFPGHGDTDVD 239 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD---PRPASGSPYWL 231 SH P +A + + ++ F I + L +M H+ ++ P+S S + Sbjct: 240 SHHSLPKLTFSRARLDSVELYPFKKAI-QAGLSGMMVGHLEVPILESKRGVPSSLSRSVV 298 Query: 232 KTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVL-- 289 +L QE+ F G++F+D L+M+G + + +L AG D++LV K V + Sbjct: 299 HDLLTQEMKFQGLVFTDALAMKGVSANNT--SICLQALKAGHDLLLVPRRIKEEVDAILD 356 Query: 290 --------DNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHER 333 + K +V + S++ R +S R+N H R Sbjct: 357 AVKNGELTEAEIEAKCRKVLTYKYALGLSKKPF---VRLSGLSNRINTAHTR 405 >UniRef50_UPI0001B588EB hydrolase n=1 Tax=Streptomyces sp. SPB78 RepID=UPI0001B588EB Length = 518 Score = 277 bits (709), Expect = 4e-73, Method: Composition-based stats. Identities = 87/315 (27%), Positives = 137/315 (43%), Gaps = 18/315 (5%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 +G + G + + ++LF+RN P Q+ L ++RA + LV+A Sbjct: 12 LGVLQPGFVGTTAPEWLLAAIRD-GLASVVLFSRNIETPEQVAALTTRLRAENPE-LVIA 69 Query: 61 VDQEGGRVQRF--REGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 +D+E G V R R G TR P + L +AQ+ + + A + + +A Sbjct: 70 IDEESGDVTRIESRTGSTR-PGNHALGYLDDTATTHAVAQD----LGHTLRAAGVSLDYA 124 Query: 119 PVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSH 176 P DV + IG RS+ ADP S +I G+ AG+ KHFPGHG V DSH Sbjct: 125 PSADVNSNPDNPIIGVRSFGADPDSVARHTSAWIRGLQSAGVAACAKHFPGHGDVAVDSH 184 Query: 177 KETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPASGSPYWLKTVL 235 P +I + + F + I M AH++ +DP RPA+ S L T+L Sbjct: 185 HGLPVYEASDRDIARQALPPFRAAIAAGARAI-MSAHLLVPALDPHRPATLSRRILTTLL 243 Query: 236 RQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPI 295 R+ELG++G+I +D + M A ++ AG D I V + + ++ Sbjct: 244 REELGYEGLIVTDAIEMGAVADEYGIEGAAVRAITAGADAICVGGE-----NASEGITHS 298 Query: 296 KAERVTRLYHKGSFS 310 A+ + G Sbjct: 299 LADALVEAVLGGDLP 313 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_Q8RD12 Beta-glucosidase-related glycosidases n=2 Tax=Th... 412 e-113 UniRef50_UPI0001693BCF Beta-glucosidase-related glycosidase n=1 ... 409 e-113 UniRef50_B0K1P2 Glycoside hydrolase, family 3 domain protein n=9... 402 e-111 UniRef50_C5DA89 Glycoside hydrolase family 3 domain protein n=6 ... 402 e-110 UniRef50_C6J2I0 Glycoside hydrolase n=2 Tax=Bacillales RepID=C6J... 402 e-110 UniRef50_Q8CX63 Beta-N-acetylhexosaminidase (Beta-hexosaminidase... 396 e-109 UniRef50_A5CZV7 Beta-glucosidase-related glycosidases and D-alan... 395 e-108 UniRef50_D2M1Q8 Glycoside hydrolase family 3 domain protein n=1 ... 393 e-108 UniRef50_C4KZ16 Beta-N-acetylhexosaminidase n=2 Tax=Firmicutes R... 390 e-107 UniRef50_A9NEX4 Glycoside hydrolase, family 3 n=1 Tax=Acholeplas... 390 e-107 UniRef50_Q8ETJ0 Beta-hexosaminidase n=1 Tax=Oceanobacillus iheye... 389 e-107 UniRef50_UPI0001C42D84 glycoside hydrolase family 3 domain prote... 388 e-106 UniRef50_A6TMT8 Glycoside hydrolase, family 3 domain protein n=1... 388 e-106 UniRef50_P40406 Uncharacterized lipoprotein ybbD n=5 Tax=Bacillu... 387 e-106 UniRef50_B9Y750 Putative uncharacterized protein n=1 Tax=Holdema... 387 e-106 UniRef50_A3I7T1 Beta-hexosaminidase n=2 Tax=Bacillaceae RepID=A3... 385 e-106 UniRef50_D2MLA9 Beta-N-acetylglucosaminidase n=1 Tax=Candidatus ... 385 e-106 UniRef50_A7FH41 Beta-hexosaminidase n=129 Tax=Gammaproteobacteri... 382 e-105 UniRef50_C7IJM6 Glycoside hydrolase family 3 domain protein n=1 ... 382 e-104 UniRef50_C7P2G4 Glycoside hydrolase family 3 domain protein n=1 ... 380 e-104 UniRef50_B5Y6T8 Beta-N-Acetylglucosaminidase n=2 Tax=Firmicutes ... 378 e-103 UniRef50_C0ZKK5 Probable beta-hexosaminidase n=1 Tax=Brevibacill... 377 e-103 UniRef50_B1YFC7 Glycoside hydrolase family 3 domain protein n=6 ... 377 e-103 UniRef50_Q8ERK3 Beta-N-acetylglucosaminidase n=2 Tax=Bacillaceae... 377 e-103 UniRef50_C6J1S3 Glycoside hydrolase n=2 Tax=Bacillales RepID=C6J... 377 e-103 UniRef50_A2QDM1 Contig An02c0230, complete genome n=9 Tax=Tricho... 376 e-103 UniRef50_A3DC81 Glycoside hydrolase, family 3-like protein n=3 T... 375 e-102 UniRef50_D1CIC2 Glycoside hydrolase family 3 domain protein n=1 ... 375 e-102 UniRef50_C1I263 Beta-hexosamidase A n=1 Tax=Clostridium sp. 7_2_... 375 e-102 UniRef50_C1XLF1 Beta-glucosidase-like glycosyl hydrolase n=2 Tax... 375 e-102 UniRef50_C6D1D6 Glycoside hydrolase family 3 domain protein n=1 ... 373 e-102 UniRef50_B6Q1F2 Beta-N-acetylglucosaminidase, putative n=1 Tax=P... 369 e-101 UniRef50_C1TNN0 Beta-glucosidase-like glycosyl hydrolase n=1 Tax... 369 e-101 UniRef50_A8VQ50 Choline dehydrogenase n=1 Tax=Bacillus selenitir... 369 e-101 UniRef50_B8FZS5 Glycoside hydrolase family 3 domain protein n=2 ... 369 e-101 UniRef50_B4BR59 Glycoside hydrolase family 3 domain protein n=2 ... 368 e-100 UniRef50_C7MQM3 Beta-glucosidase-like glycosyl hydrolase n=2 Tax... 367 e-100 UniRef50_A8MJC3 Glycoside hydrolase family 3 domain protein n=2 ... 367 e-100 UniRef50_A9KMQ8 Glycoside hydrolase family 3 domain protein n=1 ... 365 1e-99 UniRef50_C8PT26 Glycosyl hydrolase, family 3 n=1 Tax=Treponema v... 363 4e-99 UniRef50_Q1CXY4 Glycosyl hyrolase, family 3 n=6 Tax=Cystobacteri... 363 6e-99 UniRef50_B8M4K9 Beta-N-acetylglucosaminidase, putative n=9 Tax=T... 363 6e-99 UniRef50_A3CMS4 Beta-hexosamidase A, putative n=1 Tax=Streptococ... 363 6e-99 UniRef50_D1CEL0 Glycoside hydrolase family 3 domain protein n=1 ... 362 9e-99 UniRef50_C6CWS0 Beta-N-acetylhexosaminidase n=1 Tax=Paenibacillu... 362 2e-98 UniRef50_Q15N77 Glycoside hydrolase, family 3-like n=1 Tax=Pseud... 361 2e-98 UniRef50_D1AF10 Glycoside hydrolase family 3 domain protein n=1 ... 361 3e-98 UniRef50_Q67SQ8 Putative beta-N-acetylglucosaminidase n=1 Tax=Sy... 360 3e-98 UniRef50_Q04TR7 Beta-glucosidase-related glycosidase n=4 Tax=Lep... 360 4e-98 UniRef50_Q6MRE1 BglX2 protein n=2 Tax=Bdellovibrio bacteriovorus... 359 9e-98 UniRef50_C9NRN7 Beta-hexosaminidase n=1 Tax=Vibrio coralliilytic... 359 1e-97 UniRef50_Q84IZ5 Beta-N-acetylglucosaminidase n=1 Tax=Clostridium... 358 1e-97 UniRef50_B3QTH0 Beta-N-acetylhexosaminidase n=12 Tax=Chlorobiace... 358 2e-97 UniRef50_C6WFB0 Glycoside hydrolase family 3 domain protein n=1 ... 357 3e-97 UniRef50_D2BER2 Beta-hexosamidase A n=1 Tax=Streptosporangium ro... 357 5e-97 UniRef50_UPI00016BFC75 Beta-glucosidase-related glycosidase n=1 ... 356 6e-97 UniRef50_Q3SKU2 Beta-hexosaminidase n=27 Tax=Proteobacteria RepI... 356 8e-97 UniRef50_C6C0L9 Glycoside hydrolase family 3 domain protein n=1 ... 356 9e-97 UniRef50_C0EVT4 Putative uncharacterized protein n=1 Tax=Eubacte... 355 1e-96 UniRef50_A1SW90 Beta-hexosaminidase n=3 Tax=Alteromonadales RepI... 354 3e-96 UniRef50_A8F7W1 Glycoside hydrolase family 3 domain protein n=1 ... 354 3e-96 UniRef50_C5CGK3 Glycoside hydrolase family 3 domain protein n=1 ... 353 7e-96 UniRef50_D2PU34 Glycoside hydrolase family 3 domain protein n=1 ... 352 9e-96 UniRef50_C1YLC2 Beta-glucosidase-like glycosyl hydrolase n=1 Tax... 352 1e-95 UniRef50_Q08U63 Beta-glucosidase n=1 Tax=Stigmatella aurantiaca ... 352 1e-95 UniRef50_A8L1X4 Glycoside hydrolase family 3 domain protein n=3 ... 352 1e-95 UniRef50_P48823 Beta-hexosaminidase A n=3 Tax=Alteromonadales Re... 351 2e-95 UniRef50_Q47M03 Beta-N-acetylhexosaminidase. Glycosyl Hydrolase ... 350 4e-95 UniRef50_A6NU14 Putative uncharacterized protein n=1 Tax=Bactero... 349 7e-95 UniRef50_UPI0000E0E41F Beta-hexosaminidase A precursor n=1 Tax=G... 349 7e-95 UniRef50_B3E4C8 Glycoside hydrolase family 3 domain protein n=3 ... 349 9e-95 UniRef50_B3TAA3 Putative glycosyl hydrolase family 3 N terminal ... 349 1e-94 UniRef50_Q1K0E5 Glycoside hydrolase, family 3-like n=1 Tax=Desul... 348 1e-94 UniRef50_Q31G32 Beta-hexosaminidase n=1 Tax=Thiomicrospira cruno... 347 3e-94 UniRef50_D2PT32 Glycoside hydrolase family 3 domain protein n=26... 347 5e-94 UniRef50_Q2JJA4 Glycosyl hydrolase domain protein n=1 Tax=Synech... 346 8e-94 UniRef50_Q4QLU8 Beta-hexosaminidase n=138 Tax=Gammaproteobacteri... 346 9e-94 UniRef50_B3QRM6 Glycoside hydrolase family 3 domain protein n=1 ... 345 1e-93 UniRef50_Q9RUP9 Glycosyl hyrolase, family 3 n=2 Tax=Bacteria Rep... 345 2e-93 UniRef50_A7BX55 Glycosyl hydrolase, family 3 n=4 Tax=Proteobacte... 344 3e-93 UniRef50_B0P2K0 Putative uncharacterized protein n=2 Tax=Clostri... 344 3e-93 UniRef50_B0MSB5 Putative uncharacterized protein n=1 Tax=Eubacte... 344 3e-93 UniRef50_A7EUL4 Putative uncharacterized protein n=1 Tax=Sclerot... 342 1e-92 UniRef50_Q7PC47 Beta-N-acetylhexosaminidase. Glycosyl Hydrolase ... 342 1e-92 UniRef50_A1SM50 Glycoside hydrolase, family 3 domain protein n=7... 342 1e-92 UniRef50_Q1QVU0 Glycoside hydrolase, family 3-like protein n=1 T... 341 2e-92 UniRef50_C9LY46 Beta-hexosaminidase n=1 Tax=Selenomonas sputigen... 341 2e-92 UniRef50_B0SE07 Beta-glucosidase-related glycosidase n=2 Tax=Lep... 341 3e-92 UniRef50_C9KM62 Glycosyl hydrolase domain protein n=1 Tax=Mitsuo... 340 3e-92 UniRef50_A9M1Z4 Beta-hexosaminidase n=23 Tax=Neisseriaceae RepID... 340 4e-92 UniRef50_A0M1W2 Family 3 glycosyl hydrolase/beta-lactamase fusio... 340 5e-92 UniRef50_D1R988 Putative uncharacterized protein n=1 Tax=Parachl... 340 6e-92 UniRef50_UPI000196C637 hypothetical protein CATMIT_02394 n=1 Tax... 340 6e-92 UniRef50_Q67LG3 Putative beta-N-acetylglucosaminidase n=1 Tax=Sy... 340 6e-92 UniRef50_C2M7P2 Family 3 glycosyl hydrolase/beta-lactamase fusio... 339 7e-92 UniRef50_Q11Z59 B-glycosidase, glycoside hydrolase family 3 prot... 339 8e-92 UniRef50_B1WW93 Beta-glucosidase n=12 Tax=Cyanobacteria RepID=B1... 339 9e-92 UniRef50_B7S2D9 Glycosyl hydrolase family 3 N terminal domain pr... 339 9e-92 UniRef50_Q0HJG7 Beta-hexosaminidase n=35 Tax=Gammaproteobacteria... 338 2e-91 UniRef50_A1RBY1 Putative beta-N-Acetylglucosaminidase n=1 Tax=Ar... 338 2e-91 UniRef50_Q5WDA0 Beta-N-acetylhexosaminidase n=1 Tax=Bacillus cla... 338 2e-91 UniRef50_B3QV76 Glycoside hydrolase family 3 domain protein n=1 ... 337 4e-91 UniRef50_A8SRF1 Putative uncharacterized protein n=1 Tax=Coproco... 336 7e-91 UniRef50_A1ZDA7 Glycosyl hydrolase, family 3 n=1 Tax=Microscilla... 336 8e-91 UniRef50_Q47VB6 Beta-hexosaminidase n=1 Tax=Colwellia psychreryt... 335 1e-90 UniRef50_B2SGG8 Glycosyl hydrolase family 3 n=17 Tax=Francisella... 335 1e-90 UniRef50_A4BUN0 Glycoside hydrolase, family 3-like protein n=1 T... 335 2e-90 UniRef50_Q30Y91 Beta-N-acetylhexosaminidase n=3 Tax=Desulfovibri... 334 3e-90 UniRef50_Q4FV07 Glycoside hydrolase family protein n=32 Tax=Pseu... 333 4e-90 UniRef50_Q73RR1 Glycosyl hydrolase, family 3 n=1 Tax=Treponema d... 333 4e-90 UniRef50_D0KBJ7 Glycoside hydrolase family 3 domain protein n=6 ... 333 4e-90 UniRef50_UPI0000E11708 beta-hexosaminidase n=1 Tax=Rhodobacteral... 333 5e-90 UniRef50_Q2NRJ7 Putative glycosidase n=1 Tax=Sodalis glossinidiu... 333 6e-90 UniRef50_C3RKZ3 Beta-hexosamidase A n=5 Tax=Bacteria RepID=C3RKZ... 333 6e-90 UniRef50_A9BCX1 Beta-glucosidase-related glycosidase n=2 Tax=Cya... 332 9e-90 UniRef50_C5BIH5 Glycoside hydrolase family 3 domain protein n=1 ... 332 1e-89 UniRef50_A3HXS5 Putative hydrolase/beta lactamase fusion protein... 332 1e-89 UniRef50_A1AVY9 Beta-N-acetylhexosaminidase n=2 Tax=Gammaproteob... 332 1e-89 UniRef50_D1BPZ2 Glycoside hydrolase family 3 domain protein n=3 ... 331 2e-89 UniRef50_C6W0V4 Glycoside hydrolase family 3 domain protein n=2 ... 331 3e-89 UniRef50_A1ZJ01 Beta-N-acetylglucosaminidase n=1 Tax=Microscilla... 330 3e-89 UniRef50_C9RS08 Glycoside hydrolase family 3 domain protein n=1 ... 330 4e-89 UniRef50_B6J7B4 Anhydromuramoyl-peptide exo-beta-N-acetylglucosa... 330 4e-89 UniRef50_Q2SV66 Beta-N-Acetylglucosaminidase n=31 Tax=Burkholder... 330 5e-89 UniRef50_B4S1A2 Beta-hexosaminidase A n=2 Tax=Alteromonas macleo... 330 6e-89 UniRef50_B2FPW9 Beta-hexosaminidase n=18 Tax=Xanthomonadaceae Re... 329 9e-89 UniRef50_Q31RD3 Beta-glucosidase-related glycosidases-like n=5 T... 329 1e-88 UniRef50_B3EKH1 Glycoside hydrolase family 3 domain protein n=3 ... 328 1e-88 UniRef50_A1C3U3 Beta-N-acetylglucosaminidase, putative n=7 Tax=L... 328 1e-88 UniRef50_C8WZJ0 Glycoside hydrolase family 3 domain protein n=1 ... 328 1e-88 UniRef50_Q6MA50 Putative beta-N-acetylglucosaminidase n=1 Tax=Ca... 328 1e-88 UniRef50_Q6AJ70 Related to glycosyl hydrolase n=1 Tax=Desulfotal... 328 1e-88 UniRef50_D2RL81 Glycoside hydrolase family 3 domain protein n=1 ... 328 2e-88 UniRef50_A2EGF0 Glycosyl hydrolase family 3 N terminal domain co... 328 2e-88 UniRef50_Q7X3A0 Putative glycosylhydrolase n=1 Tax=Candidatus Fr... 328 2e-88 UniRef50_B5EPA1 Glycoside hydrolase family 3 domain protein n=2 ... 327 3e-88 UniRef50_A5FAE4 Candidate beta-glycosidase; Glycoside hydrolase ... 326 5e-88 UniRef50_B6GD75 Putative uncharacterized protein n=1 Tax=Collins... 326 5e-88 UniRef50_C1D8P0 NagZ n=4 Tax=Proteobacteria RepID=C1D8P0_LARHH 326 6e-88 UniRef50_A4C9R3 Beta-hexosaminidase n=1 Tax=Pseudoalteromonas tu... 326 6e-88 UniRef50_B2RGN3 Glycosyl hydrolase family 3 n=2 Tax=Porphyromona... 326 7e-88 UniRef50_A1VC58 Glycoside hydrolase, family 3 domain protein n=3... 326 8e-88 UniRef50_Q2BGB3 Beta-hexosaminidase n=1 Tax=Neptuniibacter caesa... 326 8e-88 UniRef50_B9E727 Putative uncharacterized protein n=1 Tax=Macroco... 326 9e-88 UniRef50_B3EU27 Putative uncharacterized protein n=1 Tax=Candida... 325 1e-87 UniRef50_A5CLY4 Putative beta-N-acetylglucosaminidase n=1 Tax=Cl... 325 1e-87 UniRef50_C6W3E7 Beta-lactamase n=1 Tax=Dyadobacter fermentans DS... 325 1e-87 UniRef50_D0L0K9 Glycoside hydrolase family 3 domain protein n=1 ... 325 1e-87 UniRef50_A2DGR7 Glycosyl hydrolase family 3 N terminal domain co... 325 2e-87 UniRef50_A9F261 Put. Beta-glucosidase n=1 Tax=Sorangium cellulos... 325 2e-87 UniRef50_A2P4H6 Beta-N-Acetylglucosaminidase n=9 Tax=Vibrio RepI... 324 2e-87 UniRef50_Q3B6L6 Beta-N-acetylglucosaminidase n=2 Tax=Chlorobiace... 324 2e-87 UniRef50_A6GP56 Beta-hexosaminidase n=1 Tax=Limnobacter sp. MED1... 323 5e-87 UniRef50_C4JZX3 Predicted protein n=4 Tax=Onygenales RepID=C4JZX... 323 6e-87 UniRef50_B5CPR6 Putative uncharacterized protein n=1 Tax=Ruminoc... 323 7e-87 UniRef50_C6X002 Glycoside hydrolase n=2 Tax=Flavobacteriaceae Re... 323 7e-87 UniRef50_C4ZK55 Glycoside hydrolase family 3 domain protein n=6 ... 322 1e-86 UniRef50_A6LEE9 Glycoside hydrolase family 3, candidate beta-gly... 322 1e-86 UniRef50_C4XU31 Putative beta-N-acetylhexosaminidase n=1 Tax=Des... 322 1e-86 UniRef50_D2AR90 Beta-glucosidase-related glycosidase-like protei... 321 1e-86 UniRef50_Q3AQ16 Beta-N-acetylglucosaminidase n=3 Tax=Chlorobium/... 321 3e-86 UniRef50_D2AY03 Beta-N-acetylhexosaminidase n=2 Tax=Streptospora... 320 4e-86 UniRef50_Q3A4Z0 Putative glycosyl hydrolase n=1 Tax=Pelobacter c... 320 5e-86 UniRef50_B8HWS2 Beta-N-acetylhexosaminidase n=5 Tax=Cyanobacteri... 320 5e-86 UniRef50_B3T2B0 Putative glycosyl hydrolase family 3 N terminal ... 320 5e-86 UniRef50_Q2SIQ7 Beta-glucosidase-related Glycosidase n=1 Tax=Hah... 320 6e-86 UniRef50_C4Y752 Putative uncharacterized protein n=1 Tax=Clavisp... 320 6e-86 UniRef50_C5CH66 Beta-N-acetylhexosaminidase n=1 Tax=Kosmotoga ol... 319 8e-86 UniRef50_Q11XV7 B-N-acetylglucosaminidase, glycoside hydrolase f... 319 8e-86 UniRef50_Q5ZW95 Glycosyl hydrolase family 3 n=6 Tax=Legionella R... 319 9e-86 UniRef50_C6N603 Beta-hexosaminidase n=2 Tax=Legionella RepID=C6N... 319 1e-85 UniRef50_Q5LHV9 Possible beta-N-acetylglucosaminidase/beta-lacta... 319 1e-85 UniRef50_A4SCK6 Glycoside hydrolase, family 3 domain protein n=3... 318 1e-85 UniRef50_C0BMZ8 Glycoside hydrolase family 3 domain protein n=1 ... 318 2e-85 UniRef50_B5LAN7 HexI n=3 Tax=cellular organisms RepID=B5LAN7_9BURK 318 2e-85 UniRef50_C7PJZ6 Beta-lactamase n=1 Tax=Chitinophaga pinensis DSM... 318 3e-85 UniRef50_D2PXL5 Glycoside hydrolase family 3 domain protein n=1 ... 317 3e-85 UniRef50_B0P7U2 Putative uncharacterized protein n=1 Tax=Anaerot... 317 3e-85 UniRef50_B6BUL6 Beta-hexosaminidase n=1 Tax=beta proteobacterium... 317 3e-85 UniRef50_C1A778 Beta-N-acetylhexosaminidase n=1 Tax=Gemmatimonas... 317 3e-85 UniRef50_C0CJ14 Putative uncharacterized protein n=1 Tax=Blautia... 317 4e-85 UniRef50_Q1IZF6 Beta-N-acetylhexosaminidase n=2 Tax=Deinococcus ... 317 4e-85 UniRef50_D0LSI1 Glycoside hydrolase family 3 domain protein n=1 ... 316 5e-85 UniRef50_C6XX68 Glycoside hydrolase family 3 domain protein n=4 ... 316 5e-85 UniRef50_B2AR40 Predicted CDS Pa_4_8720 n=3 Tax=Leotiomyceta Rep... 316 5e-85 UniRef50_A6RET6 Putative uncharacterized protein n=1 Tax=Ajellom... 316 5e-85 UniRef50_Q6MGM9 Beta-hexosamidase A n=1 Tax=Bdellovibrio bacteri... 316 5e-85 UniRef50_A9BFQ0 Glycoside hydrolase family 3 domain protein n=1 ... 316 6e-85 UniRef50_B2KAV5 Glycoside hydrolase family 3 domain protein n=1 ... 316 6e-85 UniRef50_Q8YVY6 All1831 protein n=11 Tax=Nostocaceae RepID=Q8YVY... 316 8e-85 UniRef50_A1JQZ0 Glycosyl hydrolase family protein n=8 Tax=Entero... 316 8e-85 UniRef50_B4RS37 Beta-hexosaminidase n=2 Tax=Alteromonas macleodi... 316 1e-84 UniRef50_D1SAP6 Glycoside hydrolase family 3 domain protein n=4 ... 315 2e-84 UniRef50_A6Q7E8 Glycosyl hydrolase, family 3 n=1 Tax=Sulfurovum ... 315 2e-84 UniRef50_A4C3R7 Beta-glucosidase-related glycosidase n=1 Tax=Pse... 315 2e-84 UniRef50_D1YED3 Glycosyl hydrolase family 3 N-terminal domain pr... 314 3e-84 UniRef50_UPI00019730D7 beta-glucosidase-like glycosyl hydrolase ... 314 3e-84 UniRef50_C8PQ37 Glycosyl hydrolase, family 3 n=1 Tax=Treponema v... 313 4e-84 UniRef50_UPI0001B54569 sugar hydrolase n=1 Tax=Streptomyces sp. ... 313 5e-84 UniRef50_UPI0000E87F6B beta-hexosaminidase n=1 Tax=Methylophilal... 313 5e-84 UniRef50_C5EIW3 Glycoside hydrolase n=2 Tax=Clostridiales RepID=... 313 5e-84 UniRef50_B3C6M6 Putative uncharacterized protein n=2 Tax=Bactero... 313 7e-84 UniRef50_Q0VQT2 Beta-hexosaminidase n=3 Tax=Gammaproteobacteria ... 313 7e-84 UniRef50_UPI0001C379AD glycoside hydrolase family 3 domain prote... 311 2e-83 UniRef50_B4CZU1 Beta-N-acetylhexosaminidase n=2 Tax=Verrucomicro... 311 2e-83 UniRef50_B9WKL4 Beta-n-acetylglucosaminidase, putative n=4 Tax=S... 311 2e-83 UniRef50_Q1AUI8 Glycoside hydrolase, family 3-like protein n=1 T... 310 4e-83 UniRef50_C7QCS9 Glycoside hydrolase family 3 domain protein n=6 ... 310 4e-83 UniRef50_Q6MFP1 Related to beta-N-acetylglucosaminidase n=9 Tax=... 310 4e-83 UniRef50_Q2S5L3 Beta-N-acetylglucosaminidase n=2 Tax=Rhodotherma... 310 5e-83 UniRef50_UPI0001C334F9 beta-glucosidase-like glycosyl hydrolase ... 310 5e-83 UniRef50_A9E2K6 B-N-acetylglucosaminidase, glycoside hydrolase f... 310 7e-83 UniRef50_C6HUU9 Beta-N-acetylhexosaminidase n=1 Tax=Leptospirill... 309 9e-83 UniRef50_A5DD90 Putative uncharacterized protein n=2 Tax=Pichia ... 309 9e-83 UniRef50_A9WLT0 Predicted O-Glycosyl hydrolase, family 3 n=1 Tax... 309 1e-82 UniRef50_D2QKL5 Glycoside hydrolase family 3 domain protein n=1 ... 308 2e-82 UniRef50_A4CPY8 Glycosyl hydrolase, family 3 n=1 Tax=Robiginital... 308 3e-82 UniRef50_B8NYN9 Beta-N-acetylglucosaminidase, putative n=1 Tax=A... 308 3e-82 UniRef50_B2IHJ4 Glycoside hydrolase family 3 domain protein n=3 ... 306 6e-82 UniRef50_C3J950 Glycosyl hydrolase, family 3 n=2 Tax=Bacteria Re... 306 7e-82 UniRef50_Q1IHT6 Glycoside hydrolase, family 3-like n=1 Tax=Candi... 306 8e-82 UniRef50_Q1N543 Glycoside hydrolase, family 3-like protein n=1 T... 306 9e-82 UniRef50_C0BK44 Glycoside hydrolase family 3 domain protein n=1 ... 306 1e-81 UniRef50_A0L847 Beta-N-acetylhexosaminidase n=1 Tax=Magnetococcu... 306 1e-81 UniRef50_A2E8H3 Glycosyl hydrolase family 3 N terminal domain co... 305 2e-81 UniRef50_Q5P081 Beta-hexosaminidase n=63 Tax=Proteobacteria RepI... 304 3e-81 UniRef50_A9WJ64 Glycoside hydrolase family 3 domain protein n=2 ... 304 3e-81 UniRef50_B1YER8 Beta-N-acetylhexosaminidase n=1 Tax=Exiguobacter... 303 4e-81 UniRef50_B6BGC2 B-N-acetylglucosaminidase, glycoside hydrolase f... 303 4e-81 UniRef50_Q1NMP4 Glycoside hydrolase, family 3-like n=3 Tax=Delta... 303 5e-81 UniRef50_C7G6M3 Glycosyl hydrolase domain protein n=2 Tax=Rosebu... 303 7e-81 UniRef50_A8LWX2 Glycoside hydrolase family 3 domain protein n=6 ... 303 9e-81 UniRef50_A0LL99 Beta-N-acetylhexosaminidase n=1 Tax=Syntrophobac... 302 1e-80 UniRef50_D2AXZ5 Beta-N-acetylhexosaminidase n=2 Tax=Streptospora... 301 2e-80 UniRef50_B2UPP0 Beta-N-acetylhexosaminidase n=1 Tax=Akkermansia ... 301 2e-80 UniRef50_A3EUK5 Beta-N-acetylhexosaminidase n=2 Tax=Leptospirill... 301 2e-80 UniRef50_Q982A1 Glycosyl hyrolase, family 3 n=2 Tax=Mesorhizobiu... 300 5e-80 UniRef50_Q6BNW2 DEHA2E18546p n=3 Tax=Saccharomycetaceae RepID=Q6... 300 5e-80 UniRef50_C0D0H0 Putative uncharacterized protein n=1 Tax=Clostri... 300 5e-80 UniRef50_C2MBZ7 Beta-hexosaminidase n=1 Tax=Porphyromonas uenoni... 300 7e-80 UniRef50_UPI0001B588EB hydrolase n=1 Tax=Streptomyces sp. SPB78 ... 299 1e-79 UniRef50_D2L090 Glycoside hydrolase family 3 domain protein n=1 ... 298 3e-79 UniRef50_C7TNJ8 Beta-N-acetylhexosaminidase (GH3) n=3 Tax=Lactob... 297 3e-79 UniRef50_B3DX93 Beta-glucosidase-related glycosidase n=1 Tax=Met... 297 4e-79 UniRef50_A7ZD64 Beta-hexosaminidase A (N-acetyl-beta-glucosamini... 296 5e-79 UniRef50_Q1IQ59 Beta-N-acetylhexosaminidase n=1 Tax=Candidatus K... 296 6e-79 UniRef50_A0Q7X6 Glycosyl hydrolase family 3 n=12 Tax=Francisella... 296 6e-79 UniRef50_C7HX27 Beta-N-acetylhexosaminidase n=1 Tax=Thiomonas in... 296 7e-79 UniRef50_A3HZ06 Possible beta-N-acetylglucosaminidase/beta-lacta... 295 1e-78 UniRef50_A8ETM1 Glycosyl hydrolase n=2 Tax=Campylobacterales Rep... 295 1e-78 Sequences not found previously or not previously below threshold: >UniRef50_Q8RD12 Beta-glucosidase-related glycosidases n=2 Tax=Thermoanaerobacteraceae RepID=Q8RD12_THETN Length = 591 Score = 412 bits (1059), Expect = e-113, Method: Composition-based stats. Identities = 89/354 (25%), Positives = 164/354 (46%), Gaps = 38/354 (10%) Query: 1 MGPVMLD-VEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS-RNRLV 58 +G +++ ++G + + +++ +GG+ILF+ N Q+ +L I++ S + L Sbjct: 58 IGQMLMPAIDGTKATPQLEKMIKDYHLGGVILFSENIQSIEQIVKLTNDIQSLSLQIPLF 117 Query: 59 VAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 ++ DQEGG V R +G T+ P + A + LA + G ++ SE+ A+ I+ A Sbjct: 118 ISTDQEGGLVYRLSQG-TKFPGNMALGATFSED----LAYKVGKVIGSELKAVGINWDLA 172 Query: 119 PVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSH 176 PV+DV + IG RS+ +P + F+ G+ E + + KHFPGHG V DSH Sbjct: 173 PVMDVNVNPYNPVIGIRSFGGNPDVVAKLGVAFMKGLQENKIASCVKHFPGHGDVDVDSH 232 Query: 177 KETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-------------- 222 + + + ++ F I + DA+M AH+ + +D Sbjct: 233 LGLGTVNKNRETLEKIELKPFEEAI-QAGTDALMTAHLSFPALDDTKVIAKTSEGVQKEI 291 Query: 223 --PASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCN 280 PA+ S L ++R+ G++GVI +D ++M+ + + ++ AG D+IL+ Sbjct: 292 TIPATLSHKILTDLVRKTWGYEGVIITDAMNMKAISDNFGPVDAVVRAVKAGADIILMPV 351 Query: 281 NRKGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDSA-RWKAISTRLNQLHER 333 + GA + L + KG S + + DS R + +L + + Sbjct: 352 DLNGAFNELVLET-----------KKGIISEKRIDDSVRRILKLKYKLGLMETK 394 >UniRef50_UPI0001693BCF Beta-glucosidase-related glycosidase n=1 Tax=Paenibacillus larvae subsp. larvae BRL-230010 RepID=UPI0001693BCF Length = 536 Score = 409 bits (1053), Expect = e-113, Method: Composition-based stats. Identities = 97/348 (27%), Positives = 164/348 (47%), Gaps = 23/348 (6%) Query: 1 MGPV-MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS----RN 55 +G + M E D + ++ +G +I F RN P Q+ L +++ + Sbjct: 13 IGQMVMCGFESRVPDEQIETLITKYKLGNVIYFRRNLDTPLQVFRLSSKLQELAEKEVGI 72 Query: 56 RLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDI 115 L++++DQEGG V R EG +P + AL + A ++ ++ A E+ A+ I++ Sbjct: 73 PLLISIDQEGGMVNRTHEGVVGMPGNMTLGALRDP----QAAYDSAYVSAEELRAIGINM 128 Query: 116 SFAPVLDVG--HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTA 173 +FAP LDV + IG RSY + + + G + G+ T KHFPGHG A Sbjct: 129 NFAPCLDVNNNPDNPVIGVRSYGETAELVSKLGVEAMKGYQDGGVAATIKHFPGHGDTQA 188 Query: 174 DSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR--PASGSPYWL 231 DSH P P P +R ++ F I DA+M AHVI+ ++P P++ S + Sbjct: 189 DSHHALPLIPHPARRLRELELVPFQKAIN-AGADAVMSAHVIFPALEPEHLPSTLSRQVM 247 Query: 232 KTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDN 291 +LR+E+GFDGVI +D L M+ AE +++AG D++LV + V+ + Sbjct: 248 TRLLREEMGFDGVITTDCLEMKAIDDHYGVAEGAVKAIEAGVDLVLVSHTLTKQVAAI-- 305 Query: 292 LSPIKAERVTRLYHKGSFSRQELMDSA-RWKAISTRLNQLHERWQEEK 338 + + G + + + +S R + +L + EE+ Sbjct: 306 ------GAILQALESGRLTEERIDESVDRILRLKQKLKLERDSLSEER 347 >UniRef50_B0K1P2 Glycoside hydrolase, family 3 domain protein n=9 Tax=Clostridia RepID=B0K1P2_THEPX Length = 520 Score = 402 bits (1035), Expect = e-111, Method: Composition-based stats. Identities = 97/347 (27%), Positives = 169/347 (48%), Gaps = 23/347 (6%) Query: 1 MGPV-MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIR----AASRN 55 +G + M+ D ++++ +G +ILF+RN QL+ L I+ + Sbjct: 7 IGQMLMVGFPSPTYDDHVEKLVSEYKIGNIILFSRNVKGKEQLQNLCNNIQMNIIKHTGI 66 Query: 56 RLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDI 115 +VA+DQEGG V R + T LP + AA + +E A + G + E+ A+ I+I Sbjct: 67 PALVAIDQEGGMVTRIYKDATYLPGNMAIAATNEVEN----AYKIGEIAGKELRALGINI 122 Query: 116 SFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTA 173 +FAPVLD+ + + IG RSY +P+K +I G+ + + T KHFPGHG + Sbjct: 123 NFAPVLDINNNLSNPVIGVRSYGENPEKVAEFGINYIKGLQKENVIATAKHFPGHGDTSV 182 Query: 174 DSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPRPASGSPYWL 231 DSH + P + + ++ F I EN ++AIM AH+++S + + PA+ S L Sbjct: 183 DSHLDLPLVSYNKERLYRIELYPFKKAI-ENGVEAIMTAHILFSALEENKLPATLSHNIL 241 Query: 232 KTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDN 291 +LR+EL F+G+I +D + M A A+ ++ AG D++LV + +K + + Sbjct: 242 TNILRKELRFNGLIITDCMEMNAIAKYFGTAKAAAMAIKAGADIVLVSHTKKLQIEAFNE 301 Query: 292 LSPIKAERVTRLYHKGSFSRQELMDSA-RWKAISTRLNQLHERWQEE 337 + +G S + + +S R + + + +E Sbjct: 302 --------IKEAVLRGEISIERIDESVERIIQLKEKYKLFENPYPDE 340 >UniRef50_C5DA89 Glycoside hydrolase family 3 domain protein n=6 Tax=Bacillaceae RepID=C5DA89_GEOSW Length = 698 Score = 402 bits (1034), Expect = e-110, Method: Composition-based stats. Identities = 92/363 (25%), Positives = 166/363 (45%), Gaps = 41/363 (11%) Query: 1 MGPVMLD----VEG---YELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS 53 +G +++ G ++ E +E++ +GG+ILF N Q +LV + A+ Sbjct: 160 IGQMLMPDFRNFNGKPVTDMLPEIKELIQKYHLGGIILFRENVVTTEQTAKLVADYQQAA 219 Query: 54 -RNRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMD 112 + L++++DQEGG V R + G T +P + A E LA + G + E+ A+ Sbjct: 220 EKFGLLISIDQEGGIVTRLQSG-TNMPGNMALGATRSKE----LAWKVGHAIGEELHALG 274 Query: 113 IDISFAPVLDVG--HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGA 170 I+++ APVLDV + IG RS+ +P+ + +I G+ ++G+ T KHFPGHG Sbjct: 275 INMNLAPVLDVNNNPDNPVIGVRSFGENPELVAELGVSYIKGLQDSGVAATAKHFPGHGD 334 Query: 171 VTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-------- 222 DSH P + ++ ++ F + E +DA+M AHV + VD Sbjct: 335 TAVDSHLGLPEVSHDRDRLQKVELYPFQKAM-EAGIDAVMTAHVTFPKVDDTKVISQKDG 393 Query: 223 -----PASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMIL 277 PA+ S L ++R+E+GFDGVI +D ++M+ + + ++ AG D++L Sbjct: 394 TAISLPATLSYKVLTELMREEMGFDGVIITDAMNMKAISDHFGPVDAAVRAVQAGADIVL 453 Query: 278 VCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDSA-RWKAISTRLNQLHERWQE 336 + + A + + G S++ + S R + + + Sbjct: 454 MPVGLEE-----------VANGLKKAVQNGGISQERINASVKRILTLKVKRGIFKQETPP 502 Query: 337 EKA 339 + Sbjct: 503 DMQ 505 >UniRef50_C6J2I0 Glycoside hydrolase n=2 Tax=Bacillales RepID=C6J2I0_9BACL Length = 544 Score = 402 bits (1033), Expect = e-110, Method: Composition-based stats. Identities = 101/337 (29%), Positives = 156/337 (46%), Gaps = 24/337 (7%) Query: 1 MGPVML-DVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASR----N 55 +G + + D + R ++ VGG++ F RN QL EL R ++ R Sbjct: 12 IGQLFICGFHSLVPDEQIRTLVQQYHVGGVVYFRRNIASVEQLAELSRSLQGLPRSKPEL 71 Query: 56 RLVVAVDQEGGRVQRF-REGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDID 114 L +++DQEGG V R EG +R+P + A + L +E L A EM+ + I+ Sbjct: 72 PLFISIDQEGGMVARLDHEGMSRIPGNMALGAANDP----ALTEEVARLAAKEMLQLGIN 127 Query: 115 ISFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVT 172 +FAP +DV + + IG RS+ P + + + G G+ T KHFPGHG Sbjct: 128 FNFAPCVDVNNNPANPVIGVRSFGEQPDRVAVHGAAAVRGYQAEGVIATAKHFPGHGDTA 187 Query: 173 ADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPRPASGSPYW 230 DSH+ P + + ++ F I E +DAIM AHVI+ + PA+ S Sbjct: 188 VDSHRGLASVPHGKERLYQVELVPFREAIAEG-VDAIMTAHVIFPAFEPEGTPATLSHRV 246 Query: 231 LKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLD 290 L +LR+ELG+ G+I +D L M+ A AE +++AG D++LV + + + Sbjct: 247 LTGLLREELGYGGLIVTDCLEMQAIAGYYGIAEGAVQAIEAGADLVLVSHTLAEQRAAI- 305 Query: 291 NLSPIKAERVTRLYHKGSFSRQELMDSA-RWKAISTR 326 ERV G S L S R A+ + Sbjct: 306 -------ERVAEAVRSGRISEARLDRSLERILALKAK 335 >UniRef50_Q8CX63 Beta-N-acetylhexosaminidase (Beta-hexosaminidase) n=1 Tax=Oceanobacillus iheyensis RepID=Q8CX63_OCEIH Length = 529 Score = 396 bits (1018), Expect = e-109, Method: Composition-based stats. Identities = 96/336 (28%), Positives = 161/336 (47%), Gaps = 27/336 (8%) Query: 1 MGPVML-DVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASR----- 54 +G + + +G + ++++ +GG+ILFTRN P ++ L ++ +R Sbjct: 6 IGELFVIGFDGKVVPKSIKKLIHDYHIGGIILFTRNIGSPDEVLRLTASLQKEARDAGYE 65 Query: 55 NRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDID 114 L++ VDQE G V+R EG T P + A E A + G+ +E+ A+ I+ Sbjct: 66 RPLLICVDQENGTVRRLGEGTTIFPGGMTIGASDSEEN----AYQVGYASGTELRALGIN 121 Query: 115 ISFAPVLDVG--HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVT 172 + APV+DV + IG RSY P++ +A + GM +AG+ TT KHFPGHG Sbjct: 122 WNLAPVVDVNVNPDNPVIGVRSYGESPERVARLAKAAVKGMQDAGVVTTLKHFPGHGDTK 181 Query: 173 ADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR---PASGSPY 229 DSH P P + ++ F + + D IM AH+ + ++P PA+ S Sbjct: 182 VDSHLALPVIPHSIERLEEIELRPFQASLES---DVIMSAHIYFPALEPEADKPATLSKA 238 Query: 230 WLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVL 289 L +LR++LG+ G+I +D + M + + A+L AG D+ +V + + V L Sbjct: 239 VLTGLLREKLGYTGLITTDCMEMNAISETIGTEKGVVAALQAGVDLAMVSHTLQRQVGAL 298 Query: 290 DNLSPIKAERVTRLYHKGSFSRQELMDSA-RWKAIS 324 + V G SR+ ++ SA R + + Sbjct: 299 --------QEVKAAVESGELSRESILTSAERIRRVK 326 >UniRef50_A5CZV7 Beta-glucosidase-related glycosidases and D-alanyl-D-alanine dipeptidase n=4 Tax=Bacteria RepID=A5CZV7_PELTS Length = 1139 Score = 395 bits (1017), Expect = e-108, Method: Composition-based stats. Identities = 98/352 (27%), Positives = 160/352 (45%), Gaps = 41/352 (11%) Query: 1 MGPV-MLDV------EGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS 53 +G + M+ + E++ E + +GG+ILF N Q LV Q++ AS Sbjct: 573 IGQMLMVSFRQWKSKDVTEINGEIAATIKKYHLGGVILFRENVAGTGQTAGLVAQLQKAS 632 Query: 54 R-NRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMD 112 L +AVDQEGGRV R + G T +P + A E+ + E+ A+ Sbjct: 633 GDIPLFIAVDQEGGRVVRLQTG-TVMPGNMALGAAGSPED----TYAVAKAIGEELNALG 687 Query: 113 IDISFAPVLDVG--HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGA 170 I++ APV+DV + IG RS+ +DPQ + +I+G+ +AG+ T KHFPGHG Sbjct: 688 INVDLAPVVDVNINPDNPVIGIRSFGSDPQLVADMGVAYINGLRDAGVIATVKHFPGHGD 747 Query: 171 VTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-------- 222 DSH P P A ++ ++ F + K+D +M AHV + +D Sbjct: 748 TDVDSHLGLPSVPHGPARLKTVELKPFQQAFDQ-KVDMVMTAHVTFPAIDNTKAISKNDG 806 Query: 223 -----PASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMIL 277 PA+ S L ++R+E+GFDGVI +D L M+ A E ++ AG D+ L Sbjct: 807 REISIPATLSHKVLTDLVRKEMGFDGVIITDALEMKAIADHFGPREAVIGAVKAGADIAL 866 Query: 278 VCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQ 329 + + A + + G + +S + + I ++++ Sbjct: 867 MPADLDQAYN-----------GLLAAVRSGEIPESRIDESVK-RLIRLKISK 906 >UniRef50_D2M1Q8 Glycoside hydrolase family 3 domain protein n=1 Tax=Bacillus cellulosilyticus DSM 2522 RepID=D2M1Q8_BACS4 Length = 518 Score = 393 bits (1011), Expect = e-108, Method: Composition-based stats. Identities = 80/344 (23%), Positives = 154/344 (44%), Gaps = 25/344 (7%) Query: 1 MGPVML-DVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRN---- 55 +G + + EG ++ + +E++ VG +ILF+RN P ++ L +++ ++ Sbjct: 8 IGQMFVVGFEGKQVTEQMKELIHEYRVGAIILFSRNIGSPEEVLTLTTELQKEAKIARHK 67 Query: 56 -RLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDID 114 L++ +DQE G V+R +G T P A + A A + G E+ A+ I+ Sbjct: 68 HPLLICIDQENGIVRRLGDGTTVFPGAMALGATQKPNH----AYDIGVATGRELKALGIN 123 Query: 115 ISFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVT 172 + APVLDV + + IG RS+ + + GM A + TT KHFPGHG Sbjct: 124 WNLAPVLDVNNNPKNPVIGVRSFGESAELVSQFGIAAMKGMQAAEVVTTLKHFPGHGDTD 183 Query: 173 ADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP---RPASGSPY 229 DSH P + +++ F + + + D +M AH+ + +++ PA+ S Sbjct: 184 VDSHVALPVITHDLNRLEEIELAPFKACM-DANADVVMSAHIHFPEIEKDPQVPATLSKA 242 Query: 230 WLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVL 289 + +LR++L +DGV+ +D + M+ + A++ AG D++++ + + L Sbjct: 243 IITGLLREKLQYDGVVTTDCMEMDAISETIGTERGAVAAIVAGVDLVMISHIANRQIGAL 302 Query: 290 DNLSPIKAERVTRLYHKGSFSRQELMDS-ARWKAISTRLNQLHE 332 + V+ G + + R + + + Sbjct: 303 --------KEVSTAVSNGELDMSLIEQAFERVMKLKEKYLSWED 338 >UniRef50_C4KZ16 Beta-N-acetylhexosaminidase n=2 Tax=Firmicutes RepID=C4KZ16_EXISA Length = 649 Score = 390 bits (1002), Expect = e-107, Method: Composition-based stats. Identities = 96/355 (27%), Positives = 161/355 (45%), Gaps = 35/355 (9%) Query: 1 MGPVML-------DVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS 53 +G +++ L E ++ +GG+ILF N + Q +LV ++ Sbjct: 55 IGQMIMPDFRLWNGANHTSLAPEVARVIDRFDLGGVILFAENVKETEQTTKLVHDLQEVV 114 Query: 54 R------NRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASE 107 + L V +DQEGG V R G T LP + A + A +AG ++ SE Sbjct: 115 KQDASNDIPLFVTIDQEGGIVTRLGTG-TNLPGNMALGATRN----RQYAYDAGEIIGSE 169 Query: 108 MIAMDIDISFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHF 165 + A+ ++++F PVLDV + + IG RS+ +DP+ + S G+ + G+ T KHF Sbjct: 170 LHALGVNVNFGPVLDVNNNPGNPVIGVRSFSSDPELVGELGSAMTQGIQDQGVAATAKHF 229 Query: 166 PGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD----- 220 PGHG DSH P + E+R ++ F I E +D IM AH+ ++ Sbjct: 230 PGHGDTAVDSHYGLPIVDKSLDELRGLELLPFKRAITEG-IDMIMTAHIGMPQIEDEVVE 288 Query: 221 ------PRPASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCD 274 P PA+ S + VLR+ELG++G++ +D L+M+ A + AE + +AG D Sbjct: 289 SERGTFPLPATLSDDVITGVLREELGYEGIVVTDALNMQAIADNFTEAEAVIKTFEAGVD 348 Query: 275 MILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQ 329 + L+ + V+ E V G S + +S + + + + Sbjct: 349 IALMPTILRSEADVIK--LEAIFEEVIAAVKDGRLSEATIDESV-ERILKLKAER 400 >UniRef50_A9NEX4 Glycoside hydrolase, family 3 n=1 Tax=Acholeplasma laidlawii PG-8A RepID=A9NEX4_ACHLI Length = 523 Score = 390 bits (1002), Expect = e-107, Method: Composition-based stats. Identities = 94/351 (26%), Positives = 165/351 (47%), Gaps = 27/351 (7%) Query: 1 MGPVML-DVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS----RN 55 +G +++ G + E ++ VG ++LF+RN AQL +L + ++ + Sbjct: 15 IGQLIVAGFHGTTVGPEIENLIREYKVGNILLFSRNIESAAQLFKLNQDLQKIAMETFGI 74 Query: 56 RLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDI 115 L + +DQEGG V R E T P A + AA + +E A + G LM E+I + I++ Sbjct: 75 PLYITIDQEGGMVTRIFEDGTFFPGAMTIAATNNVE----YAYKIGDLMGKELIDLGINM 130 Query: 116 SFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTA 173 APVLDV + + IG RS+ DP K + I G+ + T KHFPGHG Sbjct: 131 DLAPVLDVNNNPKNPVIGVRSFSDDPDKVALFGNANIRGLQ-NHVVATAKHFPGHGDTHV 189 Query: 174 DSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPRPASGSPYWL 231 DSH P +P E+ + F + I + L AIM AH+ ++ + D P + S L Sbjct: 190 DSHLALPKVEKPLNELEKFEFKPFKAAINDG-LHAIMSAHINFTHLTEDGLPVTLSKRAL 248 Query: 232 KTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDN 291 +LR+E+GF+G+I +D + M+ E +++AG +++ +C++ + Sbjct: 249 TGLLREEMGFEGLIITDGIEMKAIHDNYGTIEATLKTVNAGANLVCICHDLPYQIGA--- 305 Query: 292 LSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQ---LHERWQEEKA 339 ++R + G + L + + ++ + L +++ K Sbjct: 306 -----SDRFNKALETGELTMDTLNERV-ARILTYKEKLNVDLTRTYEDVKE 350 >UniRef50_Q8ETJ0 Beta-hexosaminidase n=1 Tax=Oceanobacillus iheyensis RepID=Q8ETJ0_OCEIH Length = 424 Score = 389 bits (1000), Expect = e-107, Method: Composition-based stats. Identities = 96/337 (28%), Positives = 166/337 (49%), Gaps = 21/337 (6%) Query: 1 MGPVMLD-VEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS---RNR 56 +G +++ ++G E++ + ++++ + VGG+ILF N AQ L+ I+ + + Sbjct: 57 IGQLIVGGIDGTEINTDTKDMIENYHVGGVILFADNIESKAQTVNLMNDIKQVNENNPHP 116 Query: 57 LVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 L++ VD+EGG V R + T LP + S E LA E G L+ +M + +++ Sbjct: 117 LLLGVDEEGGSVTRMPDEITSLPPSGSIGKAQDEE----LAFEVGILLGQQMRGLGFNLN 172 Query: 117 FAPVLDVG--HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 FAPVLDV + IG+RS+ P + + + G+ G+ + KHFPGHG + D Sbjct: 173 FAPVLDVNSNPDNPVIGDRSFGDSPDIVTDMGIQSMKGIQSEGIISVVKHFPGHGDTSED 232 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKT 233 SH + P + E+ ++ F I + D +M AH++ +D PAS S + Sbjct: 233 SHLKLPKVDKSIEELSEVELVPFEKAISKG-ADVVMTAHILLPQIDTEYPASMSEKIISG 291 Query: 234 VLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLS 293 +LR+++GFDGV+ SDDL+M E S+ AG DM+L+ ++ VSV Sbjct: 292 LLREDMGFDGVVISDDLTMGAITENIRIEEAAIQSVKAGSDMVLIAHHPDVVVSV----- 346 Query: 294 PIKAERVTRLYHKGSFSRQELMDSA-RWKAISTRLNQ 329 +++ G S ++ +S R + + Sbjct: 347 ---HKKLKAAVQNGVISEAKIDESVERIIQLKRKYKL 380 >UniRef50_UPI0001C42D84 glycoside hydrolase family 3 domain protein n=1 Tax=Bacillus pseudofirmus OF4 RepID=UPI0001C42D84 Length = 699 Score = 388 bits (998), Expect = e-106, Method: Composition-based stats. Identities = 94/352 (26%), Positives = 164/352 (46%), Gaps = 41/352 (11%) Query: 1 MGPVML-------DVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS 53 +G +++ E+ E +++ +GG+ILF N Q +LV +AAS Sbjct: 159 IGQLLMPDFRTYNGANVEEMLPEIEQLVKDYHLGGVILFRENVVTTEQTTKLVADYQAAS 218 Query: 54 -RNRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMD 112 + L++ +DQEGG V R + G T +P + A E LA+ G ++ SE+ ++ Sbjct: 219 EKFGLLMTIDQEGGIVTRLQSG-TDMPGNMALGATRSPE----LAENVGRVIGSELHSLG 273 Query: 113 IDISFAPVLDVG--HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGA 170 I+++FAPV+DV + IG RS+ DP+ + + G+ +G+ T KHFPGHG Sbjct: 274 INMNFAPVMDVNNNPDNPVIGVRSFGEDPKLVADMGVAYTHGLQSSGVAGTAKHFPGHGD 333 Query: 171 VTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-------- 222 DSH P P +R ++ F + + +DAIM AHV + ++D Sbjct: 334 TAVDSHLGLPEVPHDIERLREVELYPFQQAM-DAGIDAIMTAHVTFPNIDDTKVISQKTG 392 Query: 223 -----PASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMIL 277 PA+ S L ++R+++GFDGVI +D L+M A + ++ AG D+IL Sbjct: 393 EEIAIPATLSYKVLTELMREDMGFDGVITTDALNMLAIADHFGPVDAAVRAVQAGSDIIL 452 Query: 278 VCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQ 329 + + + G + + + +S + ++ +L + Sbjct: 453 MPVGLSEVRN-----------GLLEAVEAGKLTVERIEESV-ERILALKLKR 492 >UniRef50_A6TMT8 Glycoside hydrolase, family 3 domain protein n=1 Tax=Alkaliphilus metalliredigens QYMF RepID=A6TMT8_ALKMQ Length = 434 Score = 388 bits (997), Expect = e-106, Method: Composition-based stats. Identities = 84/346 (24%), Positives = 159/346 (45%), Gaps = 34/346 (9%) Query: 1 MGPVML--------DVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAA 52 +G + + +++ +E + +GG+ILF N Q ++L+ ++ Sbjct: 72 IGQLFMPAFRSDHEGRPLKKINESVKEQIKQYHLGGIILFAENIESIDQTKKLIDDMQKN 131 Query: 53 SRNRLVVAVDQEGGRVQRFRE-----GFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASE 107 SR + +AVD+EGGRV R T+LP + L+ E G L+ E Sbjct: 132 SRLPMFIAVDEEGGRVTRLNHSRSQLPATQLPGNEVLGKTKDPT----LSYEVGRLLGRE 187 Query: 108 MIAMDIDISFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHF 165 ++++ +++ APV DV + IG+RS+ +PQ+ +AS G+ + KHF Sbjct: 188 LLSLGFNMNLAPVADVNTNAKNPVIGDRSFSDNPQEVGIMASEMARGLQNENVSAVFKHF 247 Query: 166 PGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPRP 223 PGHG D+H + +++ + F I E +DAIM AH+ + + P Sbjct: 248 PGHGDTEFDTHHQAVVINHDLERLQSVEWVPFRKGI-EVSVDAIMTAHIQMPQITGNDFP 306 Query: 224 ASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRK 283 A+ S + +LR+E+G++G++ +D L M + + AE +++AG D++L+ + + Sbjct: 307 ATLSSKIITEILREEMGYEGMVITDALEMSAVSQHYTSAEAAVLAIEAGVDILLMPRSLE 366 Query: 284 GAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQ 329 A V G + + + +S R + + +L + Sbjct: 367 EAY-----------GGVLEAVSLGLITEERIEESVR-RILQVKLKR 400 >UniRef50_P40406 Uncharacterized lipoprotein ybbD n=5 Tax=Bacillus RepID=YBBD_BACSU Length = 642 Score = 387 bits (996), Expect = e-106, Method: Composition-based stats. Identities = 87/358 (24%), Positives = 153/358 (42%), Gaps = 38/358 (10%) Query: 1 MGPVML-DVEG------------YELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVR 47 +G +++ D +++ E ++ GG+ILF N Q +L Sbjct: 48 LGQMLMPDFRNWQKEGESSPQALTKMNDEVASLVKKYQFGGIILFAENVKTTKQTVQLTD 107 Query: 48 QIRAAS-RNRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMAS 106 + AS + L++++DQEGG V R EG T P + A A + G ++ Sbjct: 108 DYQKASPKIPLMLSIDQEGGIVTRLGEG-TNFPGNMALGAARSRIN----AYQTGSIIGK 162 Query: 107 EMIAMDIDISFAPVLDVG--HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKH 164 E+ A+ I+ F+PV+D+ + IG RS+ ++ + + + G+ + + KH Sbjct: 163 ELSALGINTDFSPVVDINNNPDNPVIGVRSFSSNRELTSRLGLYTMKGLQRQDIASALKH 222 Query: 165 FPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP--- 221 FPGHG DSH P Q +R ++ F I + D +M AHV + D Sbjct: 223 FPGHGDTDVDSHYGLPLVSHGQERLREVELYPFQKAI-DAGADMVMTAHVQFPAFDDTTY 281 Query: 222 ----------RPASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDA 271 PA+ S + +LRQE+GF+GVI +D L+M+ A E ++ A Sbjct: 282 KSKLDGSDILVPATLSKKVMTGLLRQEMGFNGVIVTDALNMKAIADHFGQEEAVVMAVKA 341 Query: 272 GCDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQ 329 G D+ L+ + + + + G Q++ +S + IS ++ + Sbjct: 342 GVDIALMPASVTSLKE--EQKFARVIQALKEAVKNGDIPEQQINNSV-ERIISLKIKR 396 >UniRef50_B9Y750 Putative uncharacterized protein n=1 Tax=Holdemania filiformis DSM 12042 RepID=B9Y750_9FIRM Length = 571 Score = 387 bits (994), Expect = e-106, Method: Composition-based stats. Identities = 93/342 (27%), Positives = 168/342 (49%), Gaps = 19/342 (5%) Query: 2 GPV-MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRA----ASRNR 56 G + ++ E E+D + L +G +I++ +N + Q L Q+++ A++ Sbjct: 56 GQLVIVSFETTEVDTKTEAWLRTNKIGNVIVYAKNIENAEQATMLTGQLQSTIRGATQIP 115 Query: 57 LVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 + +DQEGG V R REG T P+ + AA + W MA E+ M +++ Sbjct: 116 AFIGIDQEGGMVNRVREGVTIFPSPMAIAA-----GRHENLYSLAWSMADELSGMGFNMN 170 Query: 117 FAPVLDVG--HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 FAPVLDV + I RSY DP+ + AS +I G+ E G+ + KHFPGHG + D Sbjct: 171 FAPVLDVNSNPDNPVIHLRSYSDDPKAVASFASTWIKGLQEGGVVSVAKHFPGHGDTSED 230 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR--PASGSPYWLK 232 SH P + +++ ++ F + IR + + AIM +H+++ ++ PA+ S + Sbjct: 231 SHFALPTVNKTLDQLKETELIPFETAIR-SGVSAIMTSHILFPKIEKEKIPATLSKTIIT 289 Query: 233 TVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL 292 +L+ ELG++G++ SD L M+ AE ++ AG DM+++ + + D Sbjct: 290 DLLKDELGYNGIVISDSLQMDAIQSHYGMAEAAVQAIQAGVDMLILGDGKVLQPDSEDVQ 349 Query: 293 SPIKAERVTRLYHKGSFSRQELMDSARWKAISTR--LNQLHE 332 +P+ E + ++G+ S + L D+ + + + Sbjct: 350 TPV-IEALIEAVNQGTISAERLDDAV-LSILRIKNDYGLFED 389 >UniRef50_A3I7T1 Beta-hexosaminidase n=2 Tax=Bacillaceae RepID=A3I7T1_9BACI Length = 566 Score = 385 bits (991), Expect = e-106, Method: Composition-based stats. Identities = 94/342 (27%), Positives = 159/342 (46%), Gaps = 26/342 (7%) Query: 1 MGPVML-DVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS---RNR 56 +G +M V G L E + ++ +GGLIL+ N P Q L+ + A+ R Sbjct: 217 LGQMMFAGVSGTSLQQETKSLIRDYKIGGLILYANNLETPQQTVSLMNDLMTANGTNRLP 276 Query: 57 LVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 L + DQEGG+V R P Q ++ + E G L+ ++ A ++ Sbjct: 277 LFLGTDQEGGKVVRLPGPLKNFPTNQKIGQINQPQFS----YEIGKLLGEQLKAFGFNLD 332 Query: 117 FAPVLDVG--HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 FAPV+DV + IG+RS+ P + + + G+ + KHFPGHG + D Sbjct: 333 FAPVMDVNSNPNNPIIGDRSFSNKPDIVSQLGIQTMQGLESQQVIPVIKHFPGHGDTSVD 392 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKT 233 SH E P + ++ ++ F + I + +D +M H++ +D + P+S S + Sbjct: 393 SHLELPKVSKSLDDLNKLELIPFKAAIDKG-VDVVMVGHILLPKIDQQFPSSMSKEIITG 451 Query: 234 VLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLS 293 +LR +LGFDGV+ +DD++M+ S + S+ AG D+IL+ + + +D Sbjct: 452 LLRNQLGFDGVVMTDDMTMKAITNHFSIGQAAVDSVKAGNDIILIAHEFANVTTAID--- 508 Query: 294 PIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQ 335 + G S Q++ DS R R+ QL E++Q Sbjct: 509 -----ALKAAVKNGKISEQQINDSVR------RIIQLKEKYQ 539 >UniRef50_D2MLA9 Beta-N-acetylglucosaminidase n=1 Tax=Candidatus Poribacteria sp. WGA-A3 RepID=D2MLA9_9BACT Length = 354 Score = 385 bits (991), Expect = e-106, Method: Composition-based stats. Identities = 108/341 (31%), Positives = 167/341 (48%), Gaps = 19/341 (5%) Query: 2 GPV-MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASR-NRLVV 59 G + M+ +G EL + L GG ILF+RN DP Q+ L ++A + + L++ Sbjct: 8 GQLLMVGFQGTELSRAFLDWLQECRPGGAILFSRNLVDPEQVARLTNALQAHAPNSPLLI 67 Query: 60 AVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAP 119 A+D+EGGRV R +GFT PAA AA + A + A E+ A+ I+++ AP Sbjct: 68 AIDEEGGRVSRLPQGFTAFPAASKVAACHSADT----AYGTAEVTAQELRAVGINMNLAP 123 Query: 120 VLDVG--HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 VLDV + IG+R+Y P++ + G+ + G+ GKHFPGHG DSH Sbjct: 124 VLDVNSNPDNPIIGDRAYGTQPERVCTYGMAVMQGLEDNGIIPCGKHFPGHGDTMTDSHH 183 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD-PRPASGSPYWLKTVLR 236 P +A + A ++ F IR L AIM AHV Y +D PA+ S L +LR Sbjct: 184 ALPVVEADRARLDAVELEPFRRAIRRG-LPAIMTAHVRYPALDADAPATLSQPILTDLLR 242 Query: 237 QELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIK 296 +LGF+GV +DD+ M S E +L AG DM+++C+ ++ + Sbjct: 243 NQLGFEGVTLTDDMEMRAILDHQSIGEASVRALQAGADMVVICHQQERQQEAVS------ 296 Query: 297 AERVTRLYHKGSFSRQELMDSA-RWKAISTRLNQLHERWQE 336 + + +G S ++L S R +A+ + + E Sbjct: 297 --AIEQALERGELSWEKLTASVARIRALKKQRLEPFEPVDP 335 >UniRef50_A7FH41 Beta-hexosaminidase n=129 Tax=Gammaproteobacteria RepID=NAGZ_YERP3 Length = 343 Score = 382 bits (983), Expect = e-105, Method: Composition-based stats. Identities = 260/339 (76%), Positives = 289/339 (85%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 MGPVMLDV YELDAEEREIL HPLVGGLILF+RN+HD QLRELVRQIRAAS RLVVA Sbjct: 1 MGPVMLDVASYELDAEEREILKHPLVGGLILFSRNFHDAEQLRELVRQIRAASHERLVVA 60 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 VDQEGGRVQRFR+GFTRLPAAQSFAA++ +LAQEAGWLMA+EM+AMDIDISFAPV Sbjct: 61 VDQEGGRVQRFRDGFTRLPAAQSFAAINDAATAAQLAQEAGWLMAAEMMAMDIDISFAPV 120 Query: 121 LDVGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETP 180 LD+GH+SAAIGERS+H+DPQ+A +A FI GMH AGMKTTGKHFPGHGAVTADSHKETP Sbjct: 121 LDIGHVSAAIGERSFHSDPQQARIMAECFIRGMHSAGMKTTGKHFPGHGAVTADSHKETP 180 Query: 181 CDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQELG 240 D RP AEIR DM +F LI+ LDAIMPAHVIY++ D RPASGS YWL+ +LRQELG Sbjct: 181 HDNRPLAEIRTHDMVIFRELIQRKLLDAIMPAHVIYTEADARPASGSAYWLQEILRQELG 240 Query: 241 FDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERV 300 F+G+IFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNR+GAVSVLDNLSPIKA+++ Sbjct: 241 FEGIIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRQGAVSVLDNLSPIKADQL 300 Query: 301 TRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQEEKA 339 TRLYH G F RQ LM S+RW+ + L L ERW K Sbjct: 301 TRLYHSGQFDRQTLMASSRWQQANKALTALSERWDAHKQ 339 >UniRef50_C7IJM6 Glycoside hydrolase family 3 domain protein n=1 Tax=Clostridium papyrosolvens DSM 2782 RepID=C7IJM6_9CLOT Length = 448 Score = 382 bits (981), Expect = e-104, Method: Composition-based stats. Identities = 90/333 (27%), Positives = 164/333 (49%), Gaps = 21/333 (6%) Query: 1 MGPV-MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIR---AASRNR 56 +G + ++ ++GYE D R+++ VGG ILFT N + Q+ EL+ ++ A++ Sbjct: 103 IGQLVIVGLDGYEADEHARQMIEEYRVGGFILFTSNIQNSNQMLELLNSLKNINRANKVP 162 Query: 57 LVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 L ++VD+EGGRV R + F ++P+ Q L+ + G ++ E+ + +++ Sbjct: 163 LFLSVDEEGGRVSRLPDEFLKIPSNQIIGKLNNSSVS----HQVGSIIGEELKSFGMNMD 218 Query: 117 FAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 FAPVLD+ + IG+R++ P + + + G+ + KHFPGHG + D Sbjct: 219 FAPVLDINSNPKNPIIGDRAFGTGPNLVSKLGVQTMKGLQSQNIIPVVKHFPGHGDTSTD 278 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKT 233 SH P +R ++ F+ I EN +A+M AH++ ++DP PAS S + Sbjct: 279 SHVGLPKVDNDLKRLRNFELKPFARAI-ENGAEAVMVAHILLPEIDPDNPASFSKTIIND 337 Query: 234 VLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLS 293 VLR E+ +DGV+ +DD +M + + S+ AG D++LVC++ + +V+ Sbjct: 338 VLRLEMNYDGVVITDDFTMGAIVKNYNIGQAAVKSILAGGDIVLVCHDFEKQKAVIS--- 394 Query: 294 PIKAERVTRLYHKGSFSRQELMDSARWKAISTR 326 + G + S + + + Sbjct: 395 -----ALKDAAQSGQLPMDRIDRSV-IRILRLK 421 >UniRef50_C7P2G4 Glycoside hydrolase family 3 domain protein n=1 Tax=Halomicrobium mukohataei DSM 12286 RepID=C7P2G4_HALMD Length = 528 Score = 380 bits (978), Expect = e-104, Method: Composition-based stats. Identities = 97/336 (28%), Positives = 155/336 (46%), Gaps = 26/336 (7%) Query: 2 GPVML-DVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS-----RN 55 G + + EG E + RE+L G +I F+RN P Q+ EL R+++ + Sbjct: 16 GQLFVVGFEGPEPTEDLRELLTDYRCGNVIYFSRNIDSPEQVAELSRELQTIATEAGPEI 75 Query: 56 RLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDI 115 L V DQEGG V R G T P+ S A + LA+ G + +E+ ++ ++ Sbjct: 76 PLFVTADQEGGVVSRTDWG-TEPPSQMSIGAGRDAD----LARSVGGAVGAELASIGVNF 130 Query: 116 SFAPVLDVG--HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTA 173 PVLDV + IG RS+ +P+ + + DGM G+ GKHFPGHG +A Sbjct: 131 DLTPVLDVNNNPDNPVIGVRSFGEEPELVGDLGAAMADGMQSEGVLACGKHFPGHGDTSA 190 Query: 174 DSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV---DPRPASGSPYW 230 DSH P + + A +++ F I + +DAIM HV + + D PA+ S Sbjct: 191 DSHHSLPVVDHDRERLDAVELAPFRRAI-DAGIDAIMTTHVSFPTITGDDELPATVSRDV 249 Query: 231 LKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLD 290 +LR++LGFDG++ +D + M A E +++AGCD++LVC+ + Sbjct: 250 QTGLLREQLGFDGLVVTDGMEMNAIADEMGTPEGCVQAVEAGCDLLLVCHTPAVQKDSV- 308 Query: 291 NLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTR 326 E V G + D+ + + + Sbjct: 309 -------EAVIDAVESGRIDESRIDDAV-ERVLEYK 336 >UniRef50_B5Y6T8 Beta-N-Acetylglucosaminidase n=2 Tax=Firmicutes RepID=B5Y6T8_COPPD Length = 392 Score = 378 bits (971), Expect = e-103, Method: Composition-based stats. Identities = 92/334 (27%), Positives = 164/334 (49%), Gaps = 22/334 (6%) Query: 1 MGPV-MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS---RNR 56 +G + ++ G + A+ ++ VGG+ILF+ N D QL L+ I+A + + Sbjct: 38 IGQMFIVSFTGTNVSADLITLVHDYHVGGVILFSENIQDDKQLLALLNGIKALNEGNKLP 97 Query: 57 LVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 ++++ DQEGGR+ R T+ P+ + L+ + G ++ +EM A ++ Sbjct: 98 ILISTDQEGGRISRLPSRATKFPSNLIIGKRNSAT----LSYQIGSILGTEMKAYGFNMD 153 Query: 117 FAPVLDV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 FAPVLDV + IG+RS+ ++P + + G+ +G+ KHFPGHG + D Sbjct: 154 FAPVLDVVTNPQNKVIGDRSFGSNPDLVSTLGVATMKGISSSGVIPVVKHFPGHGDTSVD 213 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD--PRPASGSPYWLK 232 SH+E P +R+ ++ F I +N +M AH+ +++VD PAS S ++ Sbjct: 214 SHQELPVSNATLPHLRSFELLPFEKAI-QNGASVVMVAHIKFANVDPSGLPASLSEVFIS 272 Query: 233 TVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL 292 +LR ELGF GV+ +DDL ME S + ++ AG DM+L+C++ ++ Sbjct: 273 DILRNELGFSGVVITDDLHMEAITKHYSVGDAAIKAVQAGADMVLICHSLDEQKQAIN-- 330 Query: 293 SPIKAERVTRLYHKGSFSRQELMDSARWKAISTR 326 + G S + + +S + + + Sbjct: 331 ------ALVHAVKTGQISEERINESIK-RIAMLK 357 >UniRef50_C0ZKK5 Probable beta-hexosaminidase n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0ZKK5_BREBN Length = 537 Score = 377 bits (970), Expect = e-103, Method: Composition-based stats. Identities = 87/342 (25%), Positives = 150/342 (43%), Gaps = 24/342 (7%) Query: 2 GPV-MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS-----RN 55 G + M+ ++ E + +G +ILF+RN P + + V+++++ + Sbjct: 13 GQLLMIGYPTLDIPEEMEVWIKQKQIGSVILFSRNIGTPEETYQRVQKLQSWAYESWHPY 72 Query: 56 RLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDI 115 L+V DQE G V R ++G TR P A + A +++ A+ E+ A I + Sbjct: 73 PLLVGTDQENGVVSRLQKGATRFPGAMALGATGNLDQ----AKRIYEATGEELRAAGISV 128 Query: 116 SFAPVLDVG--HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTA 173 +FAP +DV + IG RS+ P++ I G+ +G+ KHFPGHG + Sbjct: 129 NFAPTVDVNNNPDNPVIGVRSFGEAPEQVARFGEAAIQGLLASGVIPAIKHFPGHGDTSI 188 Query: 174 DSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD--PRPASGSPYWL 231 DSH+ P + + ++ F I + +M H+ +D PA+ S + Sbjct: 189 DSHEAVPVLGHDRTRLDQVELIPFQRSI-AAGVPMVMVGHISLPSMDESGSPATYSKEIV 247 Query: 232 KTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDN 291 +LR+ LGFDGV +D + M A AE S+ AG D++LV + + D Sbjct: 248 TGILRESLGFDGVAITDCMEMAAIAGTIGVAEAAVRSVQAGIDLVLVSHTHEVQQKTYD- 306 Query: 292 LSPIKAERVTRLYHKGSFSRQELMDSA-RWKAISTRLNQLHE 332 R+ H G S + + ++ R + R + Sbjct: 307 -------RLVAAIHSGELSEERVNEAVNRVLQLKKRFLSWEQ 341 >UniRef50_B1YFC7 Glycoside hydrolase family 3 domain protein n=6 Tax=Firmicutes RepID=B1YFC7_EXIS2 Length = 646 Score = 377 bits (970), Expect = e-103, Method: Composition-based stats. Identities = 96/355 (27%), Positives = 159/355 (44%), Gaps = 35/355 (9%) Query: 1 MGPVML-------DVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAA- 52 +G +++ +L E I+ +GG+ILF N + Q +LV +R Sbjct: 52 IGQMIMPDFRLWDGANHTKLAPEVGRIIDRFDLGGVILFAENVSETEQTTKLVHDLREVV 111 Query: 53 -----SRNRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASE 107 + L V +DQEGG V R G T LP + A A+ AG ++ SE Sbjct: 112 EQDPSNDIPLFVTIDQEGGIVTRLGTG-TNLPGNMALGAT----RSSAYAEAAGGIIGSE 166 Query: 108 MIAMDIDISFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHF 165 + A+ I+++F PVLDV + + IG RS+ +DP + S G+ E G+ T KHF Sbjct: 167 LHALGINVNFGPVLDVNNNPGNPVIGVRSFSSDPNLVGELGSAMTQGIQEQGVAATAKHF 226 Query: 166 PGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD----- 220 PGHG DSH P + AE++ ++ F I E +D +M H+ ++ Sbjct: 227 PGHGDTAVDSHYGLPVVDKSLAELKQLELIPFKRAIDEG-IDMVMTGHIGMPQIEDEVVE 285 Query: 221 ------PRPASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCD 274 P PA+ S + VLR+ELG+ G++ +D L+M+ A + +E + AG D Sbjct: 286 SKQGTFPLPATLSDDVITGVLREELGYQGLVITDALNMQAIADNFTESEAVIKTFKAGVD 345 Query: 275 MILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQ 329 + L+ + A V + V + G ++ S + ++ + + Sbjct: 346 IALMPTILRSASDVTK--LETIFDDVIKAVESGDLKEADIDRSV-ERILTLKATR 397 >UniRef50_Q8ERK3 Beta-N-acetylglucosaminidase n=2 Tax=Bacillaceae RepID=Q8ERK3_OCEIH Length = 668 Score = 377 bits (969), Expect = e-103, Method: Composition-based stats. Identities = 92/366 (25%), Positives = 153/366 (41%), Gaps = 48/366 (13%) Query: 2 GPV-MLDVEGYELDA---------------EEREILAHPLVGGLILF--TRNYHDP---A 40 G + ++ V G +E++ + +GG+I F ++N P Sbjct: 45 GQLFIVHVYGQTPTDPNYESTNLTNNRGGKNFQEVIENYHIGGIIYFNWSQNIGTPLEKQ 104 Query: 41 QLRELVRQIRAA-----SRNRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGK 95 Q+ +L ++ S L V+ DQEGG VQR T P + A + ++ Sbjct: 105 QVHDLSNGLQDIALEQDSEIPLFVSTDQEGGIVQRVISPGTVFPGNMAIGATNSID---- 160 Query: 96 LAQEAGWLMASEMIAMDIDISFAPVLDVG--HISAAIGERSYHADPQKALAIASRFIDGM 153 A +G ++ E+ A+ I+++FAP +DV + IG RS+ DP + + I Sbjct: 161 FASRSGEILGKELDALGINMNFAPTVDVNLNPANPVIGVRSFGEDPDMVSQLGAAQITAY 220 Query: 154 HEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAH 213 + + T KHFPGHG DSH P + D+ F + I + + AIM H Sbjct: 221 EDQDVVATAKHFPGHGDTDVDSHYGLPVIDHDLETLHEVDLPPFQAAI-DAGVGAIMTGH 279 Query: 214 VIYSDVD--PRPASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDA 271 ++ +D PA+ S L +LR+++GF+G+I +D L M G A + S A Sbjct: 280 IVVPALDDSGMPATLSYPILTELLREDMGFEGLIITDSLGMSG-ANVVPPERVAVESFLA 338 Query: 272 GCDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDSA-RWKAISTRLNQL 330 G D++L + P+ E V G S + L +S R + Sbjct: 339 GADILLNPPD-----------VPVAYEGVMDAVESGEISEERLDESVYRILSAKMEYGLF 387 Query: 331 HERWQE 336 + + E Sbjct: 388 EDPYTE 393 >UniRef50_C6J1S3 Glycoside hydrolase n=2 Tax=Bacillales RepID=C6J1S3_9BACL Length = 426 Score = 377 bits (969), Expect = e-103, Method: Composition-based stats. Identities = 91/349 (26%), Positives = 161/349 (46%), Gaps = 21/349 (6%) Query: 1 MGPV-MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASR---NR 56 +G + ++ ++G E + R+++ VGG I + N D Q L ++ A++ Sbjct: 77 IGQLVLVGMDGTEPNDVTRQLIEEYHVGGFIFYKDNIQDTKQALTLFNGLKTANQNKAVP 136 Query: 57 LVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 L ++VD+EGGRV R T+LPAA+ E LA + G ++ E+ +++ Sbjct: 137 LFLSVDEEGGRVSRMPGELTKLPAARKIGNTVSEE----LAGQIGGIIGQELAGFGLNLD 192 Query: 117 FAPVLDVG--HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 FAPVLDV + IG+R++ P+ + + G+ E G+ KHFPGHG + D Sbjct: 193 FAPVLDVNSNPDNPVIGDRAFGTKPEIVSRMGIAVMKGIREQGVIPVVKHFPGHGDTSVD 252 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKT 233 SH P +R ++ F I+E D +M AH++ +DP PAS S + Sbjct: 253 SHLGLPVVEHDLDRLRKLELVPFKQAIQEG-ADVVMIAHLLMPKLDPEHPASFSKAVIHD 311 Query: 234 VLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLS 293 +LR+ELG++G I SDD++M + + AG +++LV ++ + V+ Sbjct: 312 LLREELGYEGTIISDDMTMGAIVEHYDIRDAAVQFIQAGGNIVLVGHDEEKEKQVI---- 367 Query: 294 PIKAERVTRLYHKGSFSRQELMDSAR-WKAISTRLNQLHERWQEEKAGH 341 + + G+ S + L + + + ++ Q Sbjct: 368 ----QALRDAVSSGTISAETLDERVYNVLKLKQKYKLTNDPAQGPNTKQ 412 >UniRef50_A2QDM1 Contig An02c0230, complete genome n=9 Tax=Trichocomaceae RepID=A2QDM1_ASPNC Length = 859 Score = 376 bits (966), Expect = e-103, Method: Composition-based stats. Identities = 90/348 (25%), Positives = 159/348 (45%), Gaps = 26/348 (7%) Query: 1 MGPVM-LDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASR----- 54 +G + + G E + ++ +G ++LF RN D AQL+ L ++ +R Sbjct: 10 IGQLFAVGFHGLTPSPEIKTLIHDYGIGAIVLFKRNVSDAAQLQALTHALQEEARLAGHE 69 Query: 55 NRLVVAVDQEGGRVQRFREG-FTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDI 113 + L++ +DQE G V R +LP + A + E LA + G + A I Sbjct: 70 HPLLIGIDQENGLVTRVSPPVAPQLPGPMALGAANSPE----LAYQVGAATGEILSASGI 125 Query: 114 DISFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAV 171 ++++APV D+ + IG RS+ +P+ AS G+ E + T KHFPGHG Sbjct: 126 NMNYAPVCDINSEPLNPVIGVRSFGDNPEFVARFASASAQGLREHKVVPTVKHFPGHGDT 185 Query: 172 TADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV-DPR-PASGSPY 229 DSH P P+ + ++ + ++ F E ++AIM AH+ + D R PA+ SP Sbjct: 186 AVDSHYGLPVIPKTREQLESCELVPFRQAANEG-IEAIMTAHISLPAIGDGRLPATLSPD 244 Query: 230 WLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVL 289 ++ +LR+++G+ G++ +D L M+G + +L AGCD I++C+ V+ + Sbjct: 245 VMR-ILREDMGYSGMVITDCLEMDGIRATYGTEQGAVLALAAGCDSIMICHTYSVQVASI 303 Query: 290 DNLSPIKAERVTRLYHKGSFSRQELMDS-ARWKAISTRLNQLHERWQE 336 V + G L ++ R + +R + Sbjct: 304 LQ--------VCQAAESGKIPSARLDEAIRRVSELKSRFLSWETALRP 343 >UniRef50_A3DC81 Glycoside hydrolase, family 3-like protein n=3 Tax=Clostridium thermocellum RepID=A3DC81_CLOTH Length = 444 Score = 375 bits (964), Expect = e-102, Method: Composition-based stats. Identities = 90/345 (26%), Positives = 161/345 (46%), Gaps = 33/345 (9%) Query: 1 MGPVML--------DVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAA 52 +G + + LD + + + GG+ILF+ N Q ++L+R ++ A Sbjct: 79 IGQIFIVAFRKGKSSRPLKVLDNSTKLKIQNFNPGGIILFSENIDTIPQTQKLIRDMQEA 138 Query: 53 SRNRLVVAVDQEGGRVQRFRE----GFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEM 108 S+ + +AVD+EGGR+ R T++P+AQ+ E LA EAG ++ +E+ Sbjct: 139 SKIPMFIAVDEEGGRIARIGNNPKMHSTKIPSAQTIGLADDPE----LAYEAGRILGAEL 194 Query: 109 IAMDIDISFAPVLDVG--HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFP 166 A+ +++FAPV DV + IG+RS+ +DP K + GM E + T KHFP Sbjct: 195 SALGFNMNFAPVADVNTNPDNPVIGDRSFGSDPYKVGLMVQAMSKGMQEQNVCTVLKHFP 254 Query: 167 GHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP--RPA 224 GHG + DSH +R +++ F I+ D +M AH+I ++ PA Sbjct: 255 GHGDTSYDSHLGQVVINHDIERLRQIELTPFKMGIK-AGADGVMTAHIIMPNITGSNLPA 313 Query: 225 SGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKG 284 + S L +LR EL + +I +D + M+ + S ++ + AG D+IL+ + + Sbjct: 314 TLSEEILSGLLRNELKHEKLIITDAMEMKAISNYWSSSKAAVMAFKAGADIILMPESFEE 373 Query: 285 AVSVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQ 329 A + + + G + + L S + ++ + + Sbjct: 374 AYN-----------GILKAVKDGEITEERLNQSL-QRILALKFER 406 >UniRef50_D1CIC2 Glycoside hydrolase family 3 domain protein n=1 Tax=Thermobaculum terrenum ATCC BAA-798 RepID=D1CIC2_THET1 Length = 543 Score = 375 bits (963), Expect = e-102, Method: Composition-based stats. Identities = 96/338 (28%), Positives = 162/338 (47%), Gaps = 24/338 (7%) Query: 2 GPVM-LDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS-----RN 55 G V+ + G+ E +++ VG +ILF+RN D QLREL ++ A+ R Sbjct: 12 GQVVAVGFPGHAPTPEVLQLIQRYRVGNVILFSRNVRDVQQLRELTATLQRAAFDAGHRQ 71 Query: 56 RLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDI 115 L++ +DQE G V+R +G + LP + + A E A++ E+ A+ +++ Sbjct: 72 PLLIMIDQENGVVRRLGQGASVLPGSMALGAAGDPE----CARQVALTTGRELRALGVNM 127 Query: 116 SFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTA 173 + APV DV + + IG RS+ DPQ + + ++G EAG+ + KHFPGHG Sbjct: 128 NLAPVADVNNNPANPVIGVRSFGEDPQAVGDMVAAAVEGYAEAGVISCLKHFPGHGDTAV 187 Query: 174 DSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPRPASGSPYWL 231 DSH P P +A + + ++ F I E D+IM AH+ + + P++ SP + Sbjct: 188 DSHLALPVVPHDRARLESVELPPFRRGI-EAGTDSIMVAHLHLPAIMANEAPSTVSPEVI 246 Query: 232 KTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDN 291 +LR ELGF G++ +D L M + ++ +G D+ LV + V + Sbjct: 247 DGLLRGELGFGGLVITDCLEMRAVSETIGTERSAVLAIASGADVALVSHRHDRQVGAV-- 304 Query: 292 LSPIKAERVTRLYHKGSFSRQELMDSA-RWKAISTRLN 328 + +G+ S L ++A R + RL+ Sbjct: 305 ------RALYAALREGALSEDRLQEAADRVMCLKERLS 336 >UniRef50_C1I263 Beta-hexosamidase A n=1 Tax=Clostridium sp. 7_2_43FAA RepID=C1I263_9CLOT Length = 862 Score = 375 bits (963), Expect = e-102, Method: Composition-based stats. Identities = 89/340 (26%), Positives = 157/340 (46%), Gaps = 30/340 (8%) Query: 11 YELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS------RNRLVVAVDQE 64 E++ E EI+ +GG+ILF +N Q +L+ ++ + L+ +DQE Sbjct: 74 TEMNDEVAEIIDEFDLGGVILFAQNVKTTEQTTKLIHDMQQVAINDQDGNLPLLTTIDQE 133 Query: 65 GGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVG 124 GG V R G T P + A + + G ++ E+ ++ I+++FAP +D+ Sbjct: 134 GGIVTRLGTG-TNFPGNMAIGATKNEQSS----YDTGLVIGRELNSLGINVNFAPSVDIN 188 Query: 125 HI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCD 182 + + IG RS+ ++P + + I G+ E G+ KHFPGHG DSH P Sbjct: 189 NNPLNPVIGLRSFSSNPDLVAKLGVKMIKGIQENGVSVAAKHFPGHGDTATDSHYGLPKV 248 Query: 183 PRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV---------DP----RPASGSPY 229 + E++ ++ F + I +N +D IM AH+ + + D PA+ S Sbjct: 249 DKSLEELKEAELKPFQAAI-DNGVDMIMTAHIQFPQIEKDTYTSIKDGQEIQIPATLSDD 307 Query: 230 WLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVL 289 L +LR+++ FDGVI +D L+M + E + + DAG D+ L+ + V Sbjct: 308 ILTGLLREDMNFDGVIITDALNMAAISENFGEVETVKIAFDAGVDIALMPTILRSKADVP 367 Query: 290 DNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQ 329 E V KG S + + +SA + + ++++ Sbjct: 368 K--LRAIVEGVKEAVAKGEISEERINESA-ARVVKLKIDR 404 >UniRef50_C1XLF1 Beta-glucosidase-like glycosyl hydrolase n=2 Tax=Meiothermus RepID=C1XLF1_MEIRU Length = 511 Score = 375 bits (963), Expect = e-102, Method: Composition-based stats. Identities = 102/340 (30%), Positives = 169/340 (49%), Gaps = 18/340 (5%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQ 63 +M+D+ G LDA R L GG+ LF +N +P Q+R+LV +IRA + +A+DQ Sbjct: 11 MMVDIPGPTLDAATRAHLERYRFGGVCLFRKNIQNPEQVRQLVAEIRAVLGPQAWIAIDQ 70 Query: 64 EGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV 123 EGG VQR E P+ + A+ ++ A+ G + +I++ I+ +FAP +DV Sbjct: 71 EGGAVQRVLEFALA-PSPMALGAVGEVKT----AEAVGAAVGRALISLGINWNFAPSVDV 125 Query: 124 GHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPC 181 + IG+RS+ +DP+K ++ + G+ +AG+ KHFPGHG DSH E P Sbjct: 126 NTNPQNPVIGDRSFGSDPKKVARLSLAWARGLEQAGVMAAIKHFPGHGDTFLDSHLELPV 185 Query: 182 DPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQELG 240 + E+ ++ F + E ++ +IM AH+ + +D + PA+ S L LR++LG Sbjct: 186 VNKTLEELERTELYPFRKAV-EAQISSIMSAHIRFPALDKQHPATLSKKILTGFLRKKLG 244 Query: 241 FDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERV 300 F G+I +D L M+ + E SL AG DMIL + ++ +A + Sbjct: 245 FQGIIVTDALDMKAISKNYPIGEAAVKSLQAGADMILSLGQPEVHIA--------QATAI 296 Query: 301 TRLYHKGSFSRQELMDSA-RWKAISTRLNQLHERWQEEKA 339 + GS S ++ S R + R + + + Sbjct: 297 QQALENGSLSEEQARQSQLRLLEAALRFPGQAQPYPPAQQ 336 >UniRef50_C6D1D6 Glycoside hydrolase family 3 domain protein n=1 Tax=Paenibacillus sp. JDR-2 RepID=C6D1D6_PAESJ Length = 403 Score = 373 bits (959), Expect = e-102, Method: Composition-based stats. Identities = 88/335 (26%), Positives = 159/335 (47%), Gaps = 23/335 (6%) Query: 1 MGPVM-LDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAA-----SR 54 +G ++ + ++G E+ E +++ VGG+IL++ N +Q +L ++ ++ Sbjct: 55 IGQMILVGMDGTEIQPEISKLIKESHVGGIILYSNNIESASQTIKLANDLKQINQDNGAK 114 Query: 55 NRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDID 114 L+++ DQEGGRV R + PA+++ + + A + G + + A+ ++ Sbjct: 115 LPLLLSTDQEGGRVSRLPKQIKPFPASRTIGKTNDP----QYAYQVGTALGEAVKAIGLN 170 Query: 115 ISFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVT 172 FAPVLD+ + IG R++ + + + + G+ G+ KHFPGHG + Sbjct: 171 TDFAPVLDINTNPKNPVIGNRAFGTTAELVSRMGVQEMLGIQSQGVIPVVKHFPGHGDTS 230 Query: 173 ADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPASGSPYWL 231 DSH P +R+ + FSS I+E +M AH++ + +DP PAS S + Sbjct: 231 VDSHLGLPVVDHDLERLRSIEFVPFSSAIKEG-APMVMVAHILMTKLDPDTPASMSKKVI 289 Query: 232 KTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDN 291 + LR EL F GV+ +DD++M + + S+ AG D+ILV ++ +V+D Sbjct: 290 QDYLRGELKFSGVVITDDMTMGAVGKVKAIGPATVQSVLAGADIILVGHDPVQQQTVID- 348 Query: 292 LSPIKAERVTRLYHKGSFSRQELMDSARWKAISTR 326 +T G S L S + I + Sbjct: 349 -------ALTAAAQSGEISPSVLDASVY-RIIKLK 375 >UniRef50_B6Q1F2 Beta-N-acetylglucosaminidase, putative n=1 Tax=Penicillium marneffei ATCC 18224 RepID=B6Q1F2_PENMQ Length = 877 Score = 369 bits (949), Expect = e-101, Method: Composition-based stats. Identities = 90/348 (25%), Positives = 154/348 (44%), Gaps = 35/348 (10%) Query: 2 GPVM-LDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASR-----N 55 G + + G+ E R ++ +G +ILF RN D QL+ L ++ ++ + Sbjct: 14 GQLFAVGFHGFTSSPEIRTLIRDYGLGAVILFRRNIKDATQLQALTLALQQEAKDAGHEH 73 Query: 56 RLVVAVDQEGGRVQRF-REGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDID 114 L + +DQE G V R + + P + A + A + G + I+ Sbjct: 74 PLFIGIDQEHGLVTRITPQTAAQQPGQMTLGATQSADN----AYKIGKCTGEMLSFFGIN 129 Query: 115 ISFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVT 172 +++APV DV + IG RS+ +P ++S G+ E G+ T KHFPGHG Sbjct: 130 MNYAPVCDVNSEPLNPVIGVRSFGDNPNFVAKMSSATARGLRETGVVPTVKHFPGHGDTA 189 Query: 173 ADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV------DPRPASG 226 DSH P + ++E+ ++ F + E ++A+M AH+ + + PA+ Sbjct: 190 VDSHHGLPVINKSRSELERCELIPFRRSVAEG-IEAVMTAHIALPRISSGPESNRLPATL 248 Query: 227 SPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAV 286 SP L +LR ++ +DGVI +D L M G + SL AG D +++C+ V Sbjct: 249 SPEAL-GILRNDMKYDGVIVTDCLEMNGIRGTYGTVDGTLMSLKAGSDSVMICHTYAVQV 307 Query: 287 SVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERW 334 + + ERV + G + ++ R R+ L ER+ Sbjct: 308 TSI--------ERVVQAVKSGELPESRICEALR------RITSLKERF 341 >UniRef50_C1TNN0 Beta-glucosidase-like glycosyl hydrolase n=1 Tax=Dethiosulfovibrio peptidovorans DSM 11002 RepID=C1TNN0_9BACT Length = 541 Score = 369 bits (948), Expect = e-101, Method: Composition-based stats. Identities = 99/331 (29%), Positives = 154/331 (46%), Gaps = 17/331 (5%) Query: 2 GPVM-LDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASR--NRLV 58 G V+ L G LD E R L GGLIL+ +N +Q+R L+ IR R L+ Sbjct: 33 GQVLALSFSGTSLDEETRFRLEEIKPGGLILYGKNVESLSQVRSLLSSIRERVRLDIPLM 92 Query: 59 VAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 VAVDQEGG V R R P + A + LA+ G++M ++ ++ ID+ +A Sbjct: 93 VAVDQEGGTVARIRGHGADFPGNMALGATGDPD----LAERQGFIMGRQLKSLGIDLDYA 148 Query: 119 PVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSH 176 PV+DV + IG RS+ D ++ + I G A M + KHFPGHG V DSH Sbjct: 149 PVVDVNSNPSNPIIGVRSFGDDVATVSSMGASMIKGFSRARMGCSAKHFPGHGDVDIDSH 208 Query: 177 KETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVL 235 P R +R + F S + E + A+M AH++ + PA+ SP + ++L Sbjct: 209 LGLPVLNRSLESMRYLEFPPFRSAV-EAGVPAVMTAHIVVPSLTGELPATLSPEAI-SLL 266 Query: 236 RQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPI 295 R E+GF+GV+ SD + M + + +L AG D +L+ + + Sbjct: 267 RDEMGFEGVVLSDSMGMRAISNGWGVPDAVVMALRAGVDFVLLGADPAFPPEGHREVR-- 324 Query: 296 KAERVTRLYHKGSFSRQELMDSARWKAISTR 326 +R+ G + L D+ + + + Sbjct: 325 --DRIVEAVRSGELDKARLDDAV-ERILRWK 352 >UniRef50_A8VQ50 Choline dehydrogenase n=1 Tax=Bacillus selenitireducens MLS10 RepID=A8VQ50_9BACI Length = 535 Score = 369 bits (948), Expect = e-101, Method: Composition-based stats. Identities = 101/344 (29%), Positives = 162/344 (47%), Gaps = 28/344 (8%) Query: 1 MGPVML-DVEGY---ELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASR-- 54 +G +M+ E+ +E++ VG +ILF RN P ++R L +++ +R Sbjct: 9 LGQMMVFGFSASKPDEVSEGIKEMIETHRVGNIILFGRNLGSPGEIRSLTGKLQKIARDA 68 Query: 55 ---NRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAM 111 L ++VDQE G V+R EG T P A + A E LA E G+ A E+ A+ Sbjct: 69 GHERPLFISVDQENGAVRRLGEGTTVFPGAMTTGATGEPE----LAYECGYATALELKAL 124 Query: 112 DIDISFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHG 169 I+ + APV DV + + IG RS+ DP++A A A+ ++GM +A + T KHFPGHG Sbjct: 125 GINWNLAPVADVNNNRHNPVIGVRSFSEDPEQAAAFAAASMNGMQKANIMTAVKHFPGHG 184 Query: 170 AVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD---PRPASG 226 DSH P P + ++ F I E+ D IM +H+ + ++ P + Sbjct: 185 DTEVDSHLSLPVIPHDMERLNHVELVPFRRCI-ESGADVIMSSHIYFPALEKEKNLPVTL 243 Query: 227 SPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAV 286 S L +LR ++GF+G+I +D L M+ + A ++ AG D +++ Sbjct: 244 SHEVLTNLLRVKMGFEGLITTDCLEMDAISESVGTAVGAVKAIHAGVDFVMISMRTDLQK 303 Query: 287 SVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQL 330 L E V R KG S + +S + I+ + L Sbjct: 304 QAL--------EAVKRDIDKGLISESRIEES-YQRIIAAKDRYL 338 >UniRef50_B8FZS5 Glycoside hydrolase family 3 domain protein n=2 Tax=Desulfitobacterium hafniense RepID=B8FZS5_DESHD Length = 426 Score = 369 bits (948), Expect = e-101, Method: Composition-based stats. Identities = 93/340 (27%), Positives = 176/340 (51%), Gaps = 21/340 (6%) Query: 2 GPV-MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRA---ASRNRL 57 G + M+ V+GYE++A ++++ + VGG +L +N D Q+ L+ ++ +R L Sbjct: 75 GQLVMVGVDGYEINANAQQLIENYHVGGFVLLKKNVRDSGQMLNLINTLKETNGVNRIPL 134 Query: 58 VVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISF 117 +A+D+EGGR+ R F ++P++Q A + G L ++ G ++ E+ ++++F Sbjct: 135 FLALDEEGGRISRMPAEFKKMPSSQQVGA----QNSGALGKKMGEILGREVKGFGMNVNF 190 Query: 118 APVLDV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADS 175 APVLD+ + IG+R++ ++P+ + + + G+ E G+ + KHFPGHG + DS Sbjct: 191 APVLDIFSNPKNKVIGDRAFGSNPELVSKVGIQTMRGIQEQGIISVVKHFPGHGDTSVDS 250 Query: 176 HKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTV 234 H P +R ++ F+ I N +DAIM AH++ +DP PAS S ++ + Sbjct: 251 HVGLPRVDYDLERLRNFELRPFAEAI-ANDVDAIMLAHILLPKLDPDYPASFSEVLIRDI 309 Query: 235 LRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSP 294 LR+E+ ++GV+ +DD++M + E S+ AG D++LVC++ +VL Sbjct: 310 LRKEMDYNGVVITDDMTMGAIVENYNIGEAAVKSILAGSDIVLVCHDFAKEEAVL----- 364 Query: 295 IKAERVTRLYHKGSFSRQELMDSA-RWKAISTRLNQLHER 333 + + G + +S R + + + Sbjct: 365 ---KEILHAAETGKIPVDRIDESVYRVLKLKEKYALADRQ 401 >UniRef50_B4BR59 Glycoside hydrolase family 3 domain protein n=2 Tax=Geobacillus RepID=B4BR59_9BACI Length = 457 Score = 368 bits (947), Expect = e-100, Method: Composition-based stats. Identities = 90/339 (26%), Positives = 162/339 (47%), Gaps = 26/339 (7%) Query: 1 MGPV-MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS-----R 54 +G + ++ E D ++++ VGG IL+ RN P Q+ L ++ + + Sbjct: 117 IGQLVIVGFPSTEPDENVKQMIRQYHVGGAILYDRNMKTPEQVATLTNDLQRVALASRFK 176 Query: 55 NRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDID 114 LV ++DQEGG++ R R+ + +P+ Q + + +E+ AM I Sbjct: 177 IPLVFSIDQEGGKIVRMRQYVSPIPSQQQLGQSKDDAK----VYQIAHRTGTELAAMGIQ 232 Query: 115 ISFAPVLDVGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 ++FAPVLD+ + RS+ DP A + + + G+ +AG+ KHFPG+G D Sbjct: 233 VNFAPVLDLSATDS----RSFGEDPVLAGRLGQQVVTGLTDAGVTAVLKHFPGNGRSQVD 288 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLD--AIMPAHVIYSDVDPR-PASGSPYWL 231 H ++ ++ KD+ F +I D +M H+ Y D PAS S + Sbjct: 289 PHHDSSSVEAGMEDLENKDILPFRQIITNINHDRFFVMVTHIKYPVYDKEHPASLSSAII 348 Query: 232 KTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDN 291 + +LR++LG++G++ +DDL M + +Y E G ++ AG D++LVC+ V+D Sbjct: 349 QQLLREKLGYEGIVVTDDLEMGAVSKYFTYRELGYRAIAAGADLLLVCHTFDHQKEVIDG 408 Query: 292 LSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQL 330 + G S + + +S R K ++ +L+Q Sbjct: 409 --------IWDAVQTGKLSEERINESVR-KILAYKLSQF 438 >UniRef50_C7MQM3 Beta-glucosidase-like glycosyl hydrolase n=2 Tax=Actinomycetales RepID=C7MQM3_SACVD Length = 618 Score = 367 bits (943), Expect = e-100, Method: Composition-based stats. Identities = 108/369 (29%), Positives = 165/369 (44%), Gaps = 50/369 (13%) Query: 2 GPVML-DVEGYELDAEE------------REILAHPLVGGLILFTR----NYHDPAQLRE 44 G + + DV G D E++ VGG+I F N P Q+ Sbjct: 67 GQLFVADVWGQSADEAHPGNQEKYGVDTPAEVVRKYHVGGVIYFNHSGTDNIETPRQVAR 126 Query: 45 LVRQIRAAS-----RNRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQE 99 L ++ A+ R L+V+VDQEGGRV R E T P+A + A +E A Sbjct: 127 LSNGLQRAALQSQPRVPLIVSVDQEGGRVTRIAEHVTEYPSAMALGASRDVEGARTAAAI 186 Query: 100 AGWLMASEMIAMDIDISFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAG 157 + E+ AM I+ +FAPV DV + IG RS+ ADP+ A +DG + Sbjct: 187 SAA----ELRAMGINQNFAPVADVNSNPLNPVIGSRSFSADPELAGEFVEAQVDGYQNSR 242 Query: 158 -----MKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPA 212 + KHFPGHG +ADSH + P R + + RA D+ F + E +DAIM A Sbjct: 243 QPRQTVSAAAKHFPGHGDASADSHIDLPVIDRSEEDWRANDLPPFERAV-EAGIDAIMTA 301 Query: 213 HVIYSDVD--PRPASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLD 270 H+ +D PA+ S + +LR+ELG+DGV+ +D L M G M +E +L+ Sbjct: 302 HISVPSLDDSGDPATLSEPIITGLLREELGYDGVVVTDSLGMAGVRQMYPDSEIPVRALE 361 Query: 271 AGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQ- 329 AG D +L+ + AV+ V G + + + +S + + + + Sbjct: 362 AGVDQMLMPPDLDAAVN-----------GVLDAVASGRITEERIDESV-LRILKLKYERG 409 Query: 330 -LHERWQEE 337 + + E Sbjct: 410 IVSSPFVPE 418 >UniRef50_A8MJC3 Glycoside hydrolase family 3 domain protein n=2 Tax=Clostridiaceae RepID=A8MJC3_ALKOO Length = 418 Score = 367 bits (943), Expect = e-100, Method: Composition-based stats. Identities = 90/336 (26%), Positives = 166/336 (49%), Gaps = 21/336 (6%) Query: 1 MGPVM-LDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRA---ASRNR 56 +G ++ + ++GY+++ +++ + VGG+IL+ RN QL +L ++ A+ Sbjct: 70 IGQLLLVGIDGYDMNDPTADLIQNYRVGGVILYGRNVKGTKQLLDLTNALKTSNTANNIP 129 Query: 57 LVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 L ++VD+EGG V R + +LP A++ ++ LA E G L+ + + + +++ Sbjct: 130 LFISVDEEGGAVSRSPKEVKKLPTARAIGKTEDVD----LAYEVGNLLGAMVKSFGYNMN 185 Query: 117 FAPVLDV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 FAPVLD+ + I +RS+ P + + GM G+ KHFPGHG + D Sbjct: 186 FAPVLDIDSNPANPVIEKRSFGKTPDIVSKMGIAEMKGMASEGVIPVVKHFPGHGDTSVD 245 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD-PRPASGSPYWLKT 233 SH P I + + F S I E + +M AH++ +D +PAS S + Sbjct: 246 SHIGLPIIHHELERIMDFEAAPFKSAIEEG-AEVVMVAHILLKKIDAEKPASMSKVIITD 304 Query: 234 VLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLS 293 +LR+ LGF GV+ +DD++M + A ++++AG D+ILV + + A++ + Sbjct: 305 ILRENLGFGGVVITDDMTMGAISKNYDMAAAVVSAINAGADIILVGHQYENAIAAFNG-- 362 Query: 294 PIKAERVTRLYHKGSFSRQELMDSA-RWKAISTRLN 328 + + +G L +S R ++ + N Sbjct: 363 ------IKKGIEEGRIKMDRLDESVHRILSLKNKYN 392 >UniRef50_A9KMQ8 Glycoside hydrolase family 3 domain protein n=1 Tax=Clostridium phytofermentans ISDg RepID=A9KMQ8_CLOPH Length = 517 Score = 365 bits (938), Expect = 1e-99, Method: Composition-based stats. Identities = 89/324 (27%), Positives = 147/324 (45%), Gaps = 21/324 (6%) Query: 2 GPVML-DVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIR----AASRNR 56 G ++ G +D+E + + +G ILF N D QL L ++ + +R Sbjct: 10 GQRIVAGFPGTTIDSELEDFIRTYKIGNFILFKENIVDANQLSNLCEGLQQLTKKYTGHR 69 Query: 57 LVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 + +DQEGG V R E +P A + AA + A AG + ++ + + Sbjct: 70 AFITIDQEGGMVTRLSEDSVNIPGAMAIAATRDEKN----AYMAGRITGQQLRTLGFNFD 125 Query: 117 FAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 APV D+ + IG RSY +P + + G+ + G+ + KHFPGHG D Sbjct: 126 LAPVADINSNMDNPVIGVRSYGDEPDQVAKYCVAMMKGLTDGGVLASAKHFPGHGDTNVD 185 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKT 233 SH P + E+ ++ F +LI E + AIM +H+I+ ++ PA+ S + Sbjct: 186 SHLGLPKVHKSLEEMELCELVSFKALI-EAGIPAIMSSHIIFPALEEELPATMSRKIITG 244 Query: 234 VLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLS 293 +L+++LGF G++ SD + M E + +++AG D+I V + A D L+ Sbjct: 245 LLKEKLGFKGLVISDCMEMSAIKKYYGSIEGIKHAIEAGVDLIFVSHTMSVAREASDVLT 304 Query: 294 PIKAERVTRLYHKGSFSRQELMDS 317 LY KG S E+ S Sbjct: 305 G--------LYEKGELSMDEMDAS 320 >UniRef50_C8PT26 Glycosyl hydrolase, family 3 n=1 Tax=Treponema vincentii ATCC 35580 RepID=C8PT26_9SPIO Length = 520 Score = 363 bits (934), Expect = 4e-99, Method: Composition-based stats. Identities = 90/346 (26%), Positives = 160/346 (46%), Gaps = 26/346 (7%) Query: 1 MGPVML-DVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAA----SRN 55 +G + G E+ + ++ VG +ILF N AQL+ L I+A + + Sbjct: 10 IGQMFACGFPGTEMPEDFIALVKKYKVGNVILFKHNIKSLAQLKALCADIQALITAETGH 69 Query: 56 RLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDI 115 + +DQEGG V R E + +P A + AA A AG + + A ++ Sbjct: 70 SAFITIDQEGGAVSRLAEDYYNVPGAMALAATGDERN----AYTAGKITGEILRAHGVNF 125 Query: 116 SFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTA 173 APVLDV + IG RSY +P + I G+ G+ + KHFPGHG + Sbjct: 126 DLAPVLDVNSNPKNPVIGVRSYGDNPALVDRYGIQMIRGLTAGGVLSAAKHFPGHGDTSV 185 Query: 174 DSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLK 232 DSH P + +AE+ +++ F++ I+ L A+ +H+++ +D + PA+ S + Sbjct: 186 DSHLTLPTVRKSKAELEQTEIAPFAAAIKTGLLPAVTISHILFPAIDEQFPATISKKIVT 245 Query: 233 TVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL 292 +L++ELGF+G++ SD + M+ E A+++AG ++I + + A + + Sbjct: 246 GLLKEELGFNGLVISDCMEMKAMQTFFGTEESAAAAINAGIELIFISHTIPTACAAI--- 302 Query: 293 SPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQEEK 338 E VTR + + +AR ++ L ++ + Sbjct: 303 -----ETVTRACAENRIPLATVEQAAR------KIIALKKQLTPAQ 337 >UniRef50_Q1CXY4 Glycosyl hyrolase, family 3 n=6 Tax=Cystobacterineae RepID=Q1CXY4_MYXXD Length = 369 Score = 363 bits (933), Expect = 6e-99, Method: Composition-based stats. Identities = 105/339 (30%), Positives = 158/339 (46%), Gaps = 24/339 (7%) Query: 5 MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQE 64 M+ G +D+E ++ + G ILF RN A L ++ + +++VDQE Sbjct: 16 MVGFPGTHIDSELAALMDDG-IYGAILFKRNVGSAADTAALCHALKTRAGRPFILSVDQE 74 Query: 65 GGRVQRFRE-GFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV 123 GGRV R R FT LP + + + G L+A E+ A+ D FAPVLDV Sbjct: 75 GGRVARLRGAPFTALPPMRELGQRGDAAQ----VERVGRLLAYELRALGFDWDFAPVLDV 130 Query: 124 --GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPC 181 + IG+RS+ + ++ + G+ G+ + GKHFPGHG T DSH P Sbjct: 131 DTNPANPVIGDRSFSREAEEVARLGVALARGLEAGGVASCGKHFPGHGDTTTDSHLTLPR 190 Query: 182 DPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD-PRPASGSPYWLKTVLRQELG 240 P +R+ ++ F + L ++M AHV++ +D PA+ S L+ VLR+ELG Sbjct: 191 LPHDLERLRSVELVPFR-AFAQAGLASLMTAHVLFDALDPGVPATMSQRVLQGVLREELG 249 Query: 241 FDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERV 300 FDGV+ SDDL M+ A S E AG D+ LVC+N + E + Sbjct: 250 FDGVLVSDDLEMKAIAGHYSVEEATVQGTLAGVDLFLVCHNADVQRRAI--------EAL 301 Query: 301 TRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQEEKA 339 + G SR+ + + R RL+ L R+ Sbjct: 302 VKAVESGRVSRERIAQAHR------RLDALSARFAHPAE 334 >UniRef50_B8M4K9 Beta-N-acetylglucosaminidase, putative n=9 Tax=Trichocomaceae RepID=B8M4K9_TALSN Length = 2237 Score = 363 bits (932), Expect = 6e-99, Method: Composition-based stats. Identities = 90/347 (25%), Positives = 148/347 (42%), Gaps = 30/347 (8%) Query: 2 GPVM-LDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASR-----N 55 G + + G+ E + ++ +G +ILF RN D QL+ L ++ ++ + Sbjct: 1376 GQLFAVGFHGFTPSPEIKTLIRDYGLGAVILFRRNIKDAVQLKTLTLALQQEAKEAGHEH 1435 Query: 56 RLVVAVDQEGGRVQRFREG-FTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDID 114 L + +DQE G V R + P + A E A E G + I+ Sbjct: 1436 PLFIGIDQENGLVTRIMPPLAVQQPGQMTIGATQSAEN----AYEVGKCTGELLSFFGIN 1491 Query: 115 ISFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVT 172 +++AP D+ + IG RS DP + G+ E+G+ T KHFPGHG Sbjct: 1492 MNYAPDCDINSEPLNPVIGVRSAGDDPNFVARFSCATAKGLRESGVVPTAKHFPGHGDTA 1551 Query: 173 ADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV------DPRPASG 226 DSH P + ++E+ ++ F + E ++ +M AH+ V + PA+ Sbjct: 1552 VDSHHGLPVISKSRSELERCELIPFRRAVAEG-IETVMTAHIALPKVTTGPDSEGLPATL 1610 Query: 227 SPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAV 286 SP L +LR ++ +DGVI +D L M+G E SL AG D I++C+ V Sbjct: 1611 SPEAL-GILRNDMKYDGVIVTDCLEMDGIRATYGTVEGSLMSLIAGSDSIMICHTYAVQV 1669 Query: 287 SVLDNLSPIKAERVTRLYHKGSFSRQELMDSA-RWKAISTRLNQLHE 332 + ERV + G+ S + +S R + R + Sbjct: 1670 KSI--------ERVVQAVQLGALSESRIDESLHRIGKLKQRFLSWED 1708 >UniRef50_A3CMS4 Beta-hexosamidase A, putative n=1 Tax=Streptococcus sanguinis SK36 RepID=A3CMS4_STRSV Length = 930 Score = 363 bits (932), Expect = 6e-99, Method: Composition-based stats. Identities = 89/348 (25%), Positives = 157/348 (45%), Gaps = 32/348 (9%) Query: 8 VEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS--------RNRLVV 59 + EL+ E E + GG+ILF N AQ L + ++ A+ + L++ Sbjct: 167 FDMTELNKEVAEAVDKYDFGGVILFAENVKGTAQTLALTQALQQAAISNQANNGKLPLLL 226 Query: 60 AVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAP 119 A+DQEGG V R G T LP + A E LA +AG ++ E+ A+ ++++FAP Sbjct: 227 AIDQEGGIVYRLGSG-TALPGNMAVGATRDPE----LANQAGQIIGRELSALGLNVNFAP 281 Query: 120 VLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 V D + + IG RS+ +DP+ + + ++G+ + + KHFPGHG DSH Sbjct: 282 VFDTNNNPQNPVIGLRSFSSDPKVVARLGTAMMEGIQKYNVAVAAKHFPGHGDTAVDSHT 341 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-------------RPA 224 P + AE+ A ++ F + I +N D +M AH+ Y ++ PA Sbjct: 342 GLPLVDKSYAELEALELLPFKAAIDKN-ADMLMTAHIQYPQIEKDTVISKQTGETIYIPA 400 Query: 225 SGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKG 284 + S + ++R++ G+ GV+ SD + M+ A +E ++ +G D++L+ + Sbjct: 401 TLSDDIITGIVRKKFGYKGVVVSDAMGMDAIAKNFGESEAAIMAIKSGVDVVLMPTVLRS 460 Query: 285 AVSVLDNLSPIKAERVTRLYHKGSFSRQELMDSA-RWKAISTRLNQLH 331 + + +G L +S R + + L Sbjct: 461 KSDL--AKIDKIINDIELAVQRGDIPVSRLDESVTRILNLKEKRGILD 506 >UniRef50_D1CEL0 Glycoside hydrolase family 3 domain protein n=1 Tax=Thermobaculum terrenum ATCC BAA-798 RepID=D1CEL0_THET1 Length = 596 Score = 362 bits (931), Expect = 9e-99, Method: Composition-based stats. Identities = 87/335 (25%), Positives = 140/335 (41%), Gaps = 28/335 (8%) Query: 15 AEEREILAHPLVGGLILFT--RNYHDPAQLRELVRQIRAAS-----RNRLVVAVDQEGGR 67 +I+ VG +I FT N + Q+ L I+ +S +++ DQEGG Sbjct: 86 DNASQIIQKYHVGSIIYFTWTGNIKNAQQVARLSNGIQRSSMTQRVPVPTLISTDQEGGI 145 Query: 68 VQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI- 126 + R R P + A A A +++ E+ A+ I+ + AP DV Sbjct: 146 ISRLPSQIVRSPGNMALGAARD----KTYAFRAAYIVGKELRALGINQNLAPDADVNINP 201 Query: 127 -SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRP 185 + IG RS+ + + + G + T KHFPGHG DSH P Sbjct: 202 FNPVIGVRSFGDRSWLVARLTAAQVRGYQSVDVAATAKHFPGHGDTNIDSHTGLPVIRHT 261 Query: 186 QAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD--PRPASGSPYWLKTVLRQELGFDG 243 Q ++ D+ F + I ++ +D IM AH++ +D RPA+ S L +LRQ+LGF G Sbjct: 262 QKQLEDIDLPPFRAAI-DSGVDVIMSAHIVVPSLDPSGRPATLSKPILTGLLRQKLGFKG 320 Query: 244 VIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRL 303 VI +D L M G M + ++ AG D++L+ + A++ V Sbjct: 321 VIMTDSLEMAGVRQMFPDSRVPVEAIKAGADLLLMPPDLNLAIN-----------SVVNA 369 Query: 304 YHKGSFSRQELMDSA-RWKAISTRLNQLHERWQEE 337 +G S + S R + RL + + Sbjct: 370 VERGEISVSRINASVLRILELKARLGLPANPYVDA 404 >UniRef50_C6CWS0 Beta-N-acetylhexosaminidase n=1 Tax=Paenibacillus sp. JDR-2 RepID=C6CWS0_PAESJ Length = 535 Score = 362 bits (929), Expect = 2e-98, Method: Composition-based stats. Identities = 93/334 (27%), Positives = 149/334 (44%), Gaps = 23/334 (6%) Query: 2 GPVML-DVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS----RNR 56 G + + GYE E ++ +GG I F RN D Q+ +L ++ + R Sbjct: 16 GQLFVFGFHGYEPTEEITSLIEKYGIGGTIYFGRNVRDAKQVHQLSSSLKNIAGRAGRPN 75 Query: 57 LVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 ++VA+DQEGG V R G T +P + A E G E + E+ + + ++ Sbjct: 76 MLVAIDQEGGMVARIVNGVTLMPGNMALGATGSAEGAG----ETARVCGEELRLLGVTMN 131 Query: 117 FAPVLDVG--HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 +AP LDV + I RSY Q + + G + + T KHFPGHG + D Sbjct: 132 YAPCLDVNNNPDNPVINVRSYGDRAQIVSELGKAALLGYQASKVAATVKHFPGHGDTSVD 191 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD--PRPASGSPYWLK 232 SH + P P + + A +++ F + I M AHV +D +P++ S L Sbjct: 192 SHHDLPVLPHSRERLDAIELAPFRAAIAAGTAAI-MTAHVCLPALDPSGKPSTLSQPVLT 250 Query: 233 TVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL 292 +LR+ELG++GVI +D L M+ A ++ AG DMILV + + V+ L+ + Sbjct: 251 GLLREELGYNGVIVTDCLEMDAIDRFYGPANGAVQAIQAGADMILVSHTYEKQVAALEAV 310 Query: 293 SPIKAERVTRLYHKGSFSRQELMDS-ARWKAIST 325 G S + + +S AR + Sbjct: 311 VAAVE--------NGELSEERINESLARVMQLKQ 336 >UniRef50_Q15N77 Glycoside hydrolase, family 3-like n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Q15N77_PSEA6 Length = 633 Score = 361 bits (927), Expect = 2e-98, Method: Composition-based stats. Identities = 94/350 (26%), Positives = 147/350 (42%), Gaps = 28/350 (8%) Query: 11 YELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS----RNRLVVAVDQEGG 66 L + +E++A VGG+ILF N D Q+ L R ++AAS + L++++DQEGG Sbjct: 64 TTLPDDLKEMIADTGVGGIILFANNLVDTEQMLRLNRDLQAASASGGHSPLLISIDQEGG 123 Query: 67 RVQRFREGF-TRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVG- 124 RV R + T + A E G A +G ++A E+ + +++FAP +DV Sbjct: 124 RVVRIPQNISTAFSGNMAIGATY-AEHGTHFATLSGDVIAKELAVLGFNVNFAPSVDVNV 182 Query: 125 -HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDP 183 + I RS+ DPQ + + M + G+ KHFPGHG + DSH P Sbjct: 183 NPENPVINVRSFGEDPQVVAELGLAQMQAMQQNGIIAALKHFPGHGDTSTDSHSGLPLVE 242 Query: 184 RPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP------------RPASGSPYWL 231 +I+A D++ F I IM AH+ Y +D PA+ S L Sbjct: 243 HDLHKIKAVDLAPFQYAIDNGDPGMIMTAHIQYPALDSSTFTATDGSKTILPATMSRAIL 302 Query: 232 KTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDN 291 +LR ++GF GVI +D L M A + + AG D+ L+ + A + Sbjct: 303 TDLLRDQMGFRGVIITDALDMAAIAHFYEPTQAVLQTFKAGTDIALMPLAIRTAQDIPK- 361 Query: 292 LSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQEEKAGH 341 + G + E+ S R+ L + Sbjct: 362 -LKKMIADLAYAVQIGDVTLAEIETSV------ARIQTLKHNYIPSNTAQ 404 >UniRef50_D1AF10 Glycoside hydrolase family 3 domain protein n=1 Tax=Thermomonospora curvata DSM 43183 RepID=D1AF10_THECD Length = 543 Score = 361 bits (927), Expect = 3e-98, Method: Composition-based stats. Identities = 102/346 (29%), Positives = 160/346 (46%), Gaps = 25/346 (7%) Query: 2 GPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAV 61 G + + DAE++ + VGGLI F N P Q L ++ AS+ L++AV Sbjct: 64 GQLFVPTFASRQDAEQK--IKKYHVGGLIYFPDNARSPQQTARLSNALQRASKIPLLLAV 121 Query: 62 DQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVL 121 D+E G V R TR P + A + + A+EA ++ +E+ A+ I+ ++APV Sbjct: 122 DEEQGLVTRL-SYVTRFPGNMALGATAKPQT----AREAAKVIGTELRAVGINQNYAPVA 176 Query: 122 DVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKET 179 DV + IG RS+ +DP + + G +AG+ T KHFPGHG DSH Sbjct: 177 DVNVNPANPVIGVRSFGSDPGLVSLMLGGALQGYRDAGVAATVKHFPGHGDTDTDSHTGL 236 Query: 180 PCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD--PRPASGSPYWLKTVLRQ 237 P +AE D F + I + +DAIM AH++ +D PA+ S L +LR+ Sbjct: 237 PVIRHSRAEWERLDAPPFRAAIAQG-VDAIMTAHIVMPGLDDSGDPATLSQAVLTGLLRK 295 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKA 297 ELG+ GV+ +D LSM GA + ++ AG D +L+ + GA + + Sbjct: 296 ELGYRGVVVTDSLSMAGARTRYGAEQAAVRAVQAGADQLLMPPDLAGAHAAVLAAV---- 351 Query: 298 ERVTRLYHKGSFSRQELMDSA-RWKAISTRLNQLHE-RWQEEKAGH 341 G S++ L +S R + + + +AG Sbjct: 352 -------RDGRISQRRLEESVTRILRLKAERGLFGDVQVDPARAGQ 390 >UniRef50_Q67SQ8 Putative beta-N-acetylglucosaminidase n=1 Tax=Symbiobacterium thermophilum RepID=Q67SQ8_SYMTH Length = 523 Score = 360 bits (926), Expect = 3e-98, Method: Composition-based stats. Identities = 101/317 (31%), Positives = 152/317 (47%), Gaps = 21/317 (6%) Query: 5 MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASR-NRLVVAVDQ 63 M+ G + ++ VGG+ILF RN Q R L +I++ S L++AVDQ Sbjct: 18 MIGFRGTDPGDVPEALIRTEGVGGVILFARNLVSADQARRLTARIQSCSPDLPLLIAVDQ 77 Query: 64 EGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV 123 EGG V R T P A S A + L + G +A E+ AM I+++ APVLDV Sbjct: 78 EGGIVLRLA--VTPWPGAMSLGAAGSPD----LTRRVGAGIARELRAMGINMNLAPVLDV 131 Query: 124 GHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPC 181 + IG RS+ +DP + + +I G H+ G+ T KHFPGHG DSH Sbjct: 132 NVNPANPVIGVRSFGSDPARVAEHGAAYIRGHHDEGVLATAKHFPGHGDTDVDSHLALAR 191 Query: 182 DPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP---RPASGSPYWLKTVLRQE 238 P + A ++ F I +DAIM AHV++ V+P RPA+ SP L +LR++ Sbjct: 192 VPHGMDRLEAVELVPFRWAI-AAGVDAIMTAHVVFPAVEPDPDRPATLSPAVLTGLLREQ 250 Query: 239 LGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAE 298 LGFDG++ +D + M+ A AE ++ AG D++L+ + + + + A Sbjct: 251 LGFDGLVITDCMEMKAIADHAGTAEGAVQAIAAGADLVLISHTERLQREAIAAVRAAIA- 309 Query: 299 RVTRLYHKGSFSRQELM 315 G + Sbjct: 310 -------SGRIPAWRVE 319 >UniRef50_Q04TR7 Beta-glucosidase-related glycosidase n=4 Tax=Leptospira RepID=Q04TR7_LEPBJ Length = 579 Score = 360 bits (925), Expect = 4e-98, Method: Composition-based stats. Identities = 87/337 (25%), Positives = 160/337 (47%), Gaps = 23/337 (6%) Query: 2 GPVM-LDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS----RNR 56 G V+ + + G LD + + L GG+ILF N ++ +L +++ S + Sbjct: 56 GQVIHVAIPGKTLDETAAKEIEEILPGGIILFGTNLGTKEEILKLNLELQKKSMESSKLP 115 Query: 57 LVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 L++++DQEGGRV R ++G T+ P A + + A + G++ + ++ + + Sbjct: 116 LLISIDQEGGRVLRVKDGVTQFPGAMALGQTKNAD----YAYKVGFVTSYQLRKLGFNFV 171 Query: 117 FAPVLDVG--HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 FAP LD+ + I RS + P+ + G G T KHFPGHG D Sbjct: 172 FAPDLDINNNPDNPVINTRSMGSTPETVSISGIGYEKGARIGGAIPTIKHFPGHGDTNVD 231 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKT 233 SH P + E+ ++ F I++ + +M AH++Y +DP PA+ S L Sbjct: 232 SHLGLPKIDKTLEELERMELIPFQESIQQG-AEVVMSAHIVYPKLDPDFPATLSSKILTG 290 Query: 234 VLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLS 293 +LR ++GF GV+ +D + M + + G ++ AG D++L+ + K ++ + Sbjct: 291 ILRDKMGFKGVVITDAMEMNAIDVHYKDRDPGVLAILAGADILLMTSWGKTTRNMKNQ-- 348 Query: 294 PIKAERVTRLYHKGSFS--RQELMDSARWKAISTRLN 328 ++ Y KG+F ++L+ A ++ I +L Sbjct: 349 ------ISTAYKKGTFQKGEKDLLKEAVYRQILLKLK 379 >UniRef50_Q6MRE1 BglX2 protein n=2 Tax=Bdellovibrio bacteriovorus RepID=Q6MRE1_BDEBA Length = 370 Score = 359 bits (922), Expect = 9e-98, Method: Composition-based stats. Identities = 89/342 (26%), Positives = 170/342 (49%), Gaps = 23/342 (6%) Query: 1 MGP-VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS-----R 54 +G + + V G+ L A+E++ + +GG+ LF RN +P Q+REL +I++ + Sbjct: 7 IGQHMFIGVSGHALTADEKKFIVENNIGGVCLFGRNVAEPKQVRELCAEIQSLRHKQVDK 66 Query: 55 NRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDID 114 L + +D EGGRV R + FT P + L A G EM A+ I+ Sbjct: 67 APLFIGIDMEGGRVHRLKAPFTVWPPLRKLGDLDAPTVSFHFANRMG----LEMKAVGIN 122 Query: 115 ISFAPVLDV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVT 172 + FAP +D+ + IG+RS +DP+ AS + G ++ + T KHFPGHG Sbjct: 123 LDFAPCVDIFTNPGNTVIGDRSISSDPEMVAKHASALVRGYIKSDVITCAKHFPGHGNTI 182 Query: 173 ADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWL 231 DSH++ P + + + ++ F + +++D +M +H+ + +DP P + S ++ Sbjct: 183 VDSHEDLPVENTDLERLESCELIPFKKTFK-SRVDMVMTSHIKFPKIDPEWPVTLSETFV 241 Query: 232 KTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDN 291 + ++R+E+ + G+I +DDL M+ E +L AG D++L CN+ + Sbjct: 242 RKMIREEMRYRGLIITDDLGMKAMTKHYGIEEVPVRALKAGVDLLLYCNDPE-------- 293 Query: 292 LSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHER 333 + P + + +GS +++L +++ + + + ++ Sbjct: 294 VPPQAYDAILGALAQGSLKKEDL-EASYHRIMDFKKVKIQNP 334 >UniRef50_C9NRN7 Beta-hexosaminidase n=1 Tax=Vibrio coralliilyticus ATCC BAA-450 RepID=C9NRN7_9VIBR Length = 616 Score = 359 bits (922), Expect = 1e-97, Method: Composition-based stats. Identities = 98/340 (28%), Positives = 156/340 (45%), Gaps = 24/340 (7%) Query: 11 YELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNR----LVVAVDQEGG 66 E++ R I+ + VGG+ILF+ N D Q EL+ ++ A L V +DQEGG Sbjct: 19 TEINESVRHIIGNDKVGGVILFSSNLVDINQTGELIWNMQNAVEEDQEVGLFVTIDQEGG 78 Query: 67 RVQRF-REGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV-- 123 V R R T LP + A + +LA E+G ++A E+ ++ +++FAPV+DV Sbjct: 79 NVVRLPRAQATNLPGNMALGAAYLNDNNPELAYESGRILAKEIKSVGFNVNFAPVVDVQS 138 Query: 124 GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDP 183 ++ I RSY DPQ + + GM + + T KH+PGHG +DSH P Sbjct: 139 NPLNPVINVRSYGEDPQLVGLLGGQSAKGMADENVIGTFKHYPGHGDTESDSHYGLPIVN 198 Query: 184 RPQAEIRAKDMSVFSSLIREN-KLDAIMPAHVIYSDVDP-------------RPASGSPY 229 + E A D++ + I + D IM AH+ Y +D PA+ S Sbjct: 199 KSIEEAYAIDLAPYKEAIDQGLAPDMIMTAHIQYPALDDSEVVGSRSGETIVVPATLSHK 258 Query: 230 WLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVL 289 +LR+EL + G+ +D L M+G A A+ + AG D+ L+ + Sbjct: 259 IQTELLREELNYQGLTITDALDMQGIAAHFEEADAVIKTFQAGVDIALMPTLFRTTSG-- 316 Query: 290 DNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQ 329 + + V G S Q + +S + I+T+L + Sbjct: 317 EGQLGELIDEVVAAVEAGELSEQSIDESV-ERIIATKLKR 355 >UniRef50_Q84IZ5 Beta-N-acetylglucosaminidase n=1 Tax=Clostridium paraputrificum RepID=Q84IZ5_9CLOT Length = 413 Score = 358 bits (921), Expect = 1e-97, Method: Composition-based stats. Identities = 87/350 (24%), Positives = 150/350 (42%), Gaps = 26/350 (7%) Query: 2 GPVML-DVEGYELDAEEREILAHPLVGGLILFT-RNY--HDPAQLRELVRQIRAASR--- 54 G +M V G +D + + GG+IL+ RN+ + V I+ A+R Sbjct: 51 GQMMFYGVNGTNVDDKVVNLFEDQHAGGIILYGHRNFWGSSLDNNVKYVNSIKKANRQNS 110 Query: 55 -NRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDI 113 L + D+EGG + + + R P+ + LA G A ++ + I Sbjct: 111 DIPLFIGFDEEGGSMSQLPQELMRTPSKGELGNTND----SSLATGIGAGTAKKLKLLGI 166 Query: 114 DISFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAV 171 + F VLD+ + IG RSY + +K + + G+ T KHFPGHG Sbjct: 167 NTDFGTVLDINTNKNNPIIGVRSYGSTKEKVTEFGINELKAIQNEGVIPTVKHFPGHGDT 226 Query: 172 TADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYW 230 DSH P +++ ++ F + I N +D +M AH++ +D PA+ S Sbjct: 227 EVDSHLGLPSLNHDLNRLKSTELVPFQTAIN-NGVDMVMTAHIMLPQIDKEYPATMSKKI 285 Query: 231 LKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLD 290 L +LR E+G+ GVI +DDL M+ + E S++AG D++LVC+ + V + Sbjct: 286 LTDLLRDEMGYKGVIITDDLEMQAISKNWDLGEAAIKSVEAGADILLVCHTIENQQKVYN 345 Query: 291 NLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLN-QLHERWQEEKA 339 V + + G + +S R + + + +L ++ Sbjct: 346 --------AVVQGVNDGKIDENRIDESVR-RILRLKYQYKLSDKANNPTQ 386 >UniRef50_B3QTH0 Beta-N-acetylhexosaminidase n=12 Tax=Chlorobiaceae RepID=B3QTH0_CHLT3 Length = 384 Score = 358 bits (919), Expect = 2e-97, Method: Composition-based stats. Identities = 98/349 (28%), Positives = 171/349 (48%), Gaps = 33/349 (9%) Query: 1 MGPV-MLDVEGYELDAEEREI--LAHPLVGGLILF---------TRNYHDPAQLRELVRQ 48 +G + M+ G + + I + +GG+ILF +RN P Q+ L+++ Sbjct: 46 IGQLLMIGFRGLHVADSSKVIVDIRQQRIGGVILFDYDLPLKQASRNIQSPEQVCALIKE 105 Query: 49 IRAASRNRLVVAVDQEGGRVQRFREGFTRLP---AAQSFAALSGMEEGGKLAQEAGWLMA 105 ++AA+ L++A+DQEGGRV R +E F P +A ++ ++ K A++ A Sbjct: 106 LQAAAPIPLLIAIDQEGGRVNRLKEKFG-FPKSVSAAWLGEVNSLDSTQKYAEQ----TA 160 Query: 106 SEMIAMDIDISFAPVLDVGHI--SAAIG--ERSYHADPQKALAIASRFIDGMHEAGMKTT 161 + A+ I+++FAPVLDV + IG RS+ +P+ + H+ G+ + Sbjct: 161 KTLAALGINLNFAPVLDVNTNPENPVIGKLARSFSENPEIVAEHGLATVKAFHQFGVLSA 220 Query: 162 GKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP 221 KHFPGHG+ DSHK + ++ F +I+ + D +M AHV + +D Sbjct: 221 VKHFPGHGSAWNDSHKGLADVT---ETWQPLEIEPFKRVIQAGECDMVMTAHVFNAKLDT 277 Query: 222 R-PASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCN 280 PA+ S + +LR+ELGFDGV+ SDD+ ME + +L+AG D+++ N Sbjct: 278 TFPATLSQNVITGLLRKELGFDGVVVSDDMQMEAIRSFYGLETAVRLALNAGVDLLVFAN 337 Query: 281 NRKGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDS-ARWKAISTRLN 328 N + + + + + +G SR+ + S R + +RL+ Sbjct: 338 NSVFEPDIAER----AHQMIRAMVLQGKVSRERIDASYQRLMKLKSRLS 382 >UniRef50_C6WFB0 Glycoside hydrolase family 3 domain protein n=1 Tax=Actinosynnema mirum DSM 43827 RepID=C6WFB0_ACTMD Length = 580 Score = 357 bits (918), Expect = 3e-97, Method: Composition-based stats. Identities = 93/354 (26%), Positives = 147/354 (41%), Gaps = 43/354 (12%) Query: 2 GPVMLD-VEGYELD------------AEEREILAHPLVGGLILFTR----NYHDPAQLRE 44 G + + V G D +++ GG+I F N PAQ+ Sbjct: 51 GQLFVTYVHGQSADEVHPGNLRDFGVDTPAQVVTRYRPGGVIYFNNSSYDNIDTPAQIAA 110 Query: 45 LVRQIRAASRNRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLM 104 L ++ ASR L ++ DQE G V R T+ P + + A + A+ A + Sbjct: 111 LSNGLQQASRVPLTISTDQEMGIVTRIGPPLTQFPGSMALGAGRD----ERAAERAASVT 166 Query: 105 ASEMIAMDIDISFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAG----- 157 A E+ A+ I +FAP DV + IG RS+ +DP A + + + G + Sbjct: 167 ARELRALGITQNFAPDADVNSNPANPVIGVRSFSSDPDLAATMVAAQVRGYQQRHLSRTA 226 Query: 158 MKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYS 217 + KHFPGHG DSH P R + R D F + I +D+IM AH+ Sbjct: 227 VSAAAKHFPGHGDTADDSHTSLPTVDRTVEQWREVDAKPFRAAI-AAGVDSIMTAHIRMP 285 Query: 218 DVD--PRPASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDM 275 +D PA+ S + +LR+ELG+DGV+ +D L+M G + + AE +L AG D Sbjct: 286 AIDPSGEPATLSKPVVTGLLREELGYDGVVITDSLAMAGVRQLHTDAEIPVLALKAGVDQ 345 Query: 276 ILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQ 329 +L+ + V G S + + S + + + + Sbjct: 346 LLMP-----------VKLGEAIDAVVAAVRAGELSERRIDQSV-LRVLRMKFLR 387 >UniRef50_D2BER2 Beta-hexosamidase A n=1 Tax=Streptosporangium roseum DSM 43021 RepID=D2BER2_STRRD Length = 427 Score = 357 bits (916), Expect = 5e-97, Method: Composition-based stats. Identities = 97/319 (30%), Positives = 150/319 (47%), Gaps = 24/319 (7%) Query: 18 REILAHPLVGGLILF--TRNYHDPAQLRELVRQIRAASR-NRLVVAVDQEGGRVQRFREG 74 + +A GG+ILF N + Q+ L ++ AS L+V DQE GRV R Sbjct: 91 AKAVARYRPGGVILFPWAGNVKNVRQVVALTNGLQKASPEIPLLVGADQENGRVSRMAPL 150 Query: 75 FTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVG--HISAAIGE 132 T +P A + LA++A + +E+ A+ I++ FAPV DV + IG Sbjct: 151 VTEMPGASVIGSTGDP----SLARKAAKVTGTELRALGINLDFAPVADVNINPRNPVIGP 206 Query: 133 RSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAK 192 R+Y +DP+K + + + G H+AG+ +T KHFPGHG DSH P ++ Sbjct: 207 RAYGSDPKKVAPMVAAAVQGFHDAGIASTAKHFPGHGDTNVDSHSGLPVIQHSLSQWNKL 266 Query: 193 DMSVFSSLIRENKLDAIMPAHVIYSDVD--PRPASGSPYWLKTVLRQELGFDGVIFSDDL 250 D F++ I +N +DAIM AHV+ +D PA+ S L +LR++LGFDGV+ +D L Sbjct: 267 DAPPFAAAIGKN-IDAIMSAHVVMPKLDPSGDPATLSKPILTGLLREKLGFDGVVSTDAL 325 Query: 251 SMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFS 310 M G + ++ AG D++L+ + P V G S Sbjct: 326 DMAGVRKKYGDGQVAVRAIQAGVDLLLMPPDF-----------PKAYGAVLAAVKSGKIS 374 Query: 311 RQELMDSARWKAISTRLNQ 329 L S R + + + + Sbjct: 375 TARLDQSVR-RLLKLKAAR 392 >UniRef50_UPI00016BFC75 Beta-glucosidase-related glycosidase n=1 Tax=Epulopiscium sp. 'N.t. morphotype B' RepID=UPI00016BFC75 Length = 509 Score = 356 bits (915), Expect = 6e-97, Method: Composition-based stats. Identities = 82/341 (24%), Positives = 157/341 (46%), Gaps = 22/341 (6%) Query: 2 GPV-MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS----RNR 56 G + M+ + ++++ ++ VG +ILF RN P Q+ +L++ ++ + Sbjct: 8 GQLFMVGFDDTKINSHIENLIKDFNVGNVILFERNCKTPEQILKLIQNLQKLAIKYNGVP 67 Query: 57 LVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 + + +DQE G V R EG P + A + EE + + A+ I+ + Sbjct: 68 MFIGIDQENGMVTRIGEGVCTFPGNMAQANGATEEE----IYQVAKYTGEMLEALGINYN 123 Query: 117 FAPVLDVGHISAAI--GERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 AP LD+ + G RS+ +P+ + I G +AG+ +T KHFPGHG D Sbjct: 124 LAPSLDINNNPNNPIIGIRSFGENPETVTNMGLACIKGFQDAGIISTAKHFPGHGDTAID 183 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKT 233 SH PC P + + ++ F + I + + +IM AH+I+ +P PA+ SP L Sbjct: 184 SHLTLPCIPHSKERLEKVELYPFKAAI-DAGVKSIMTAHIIFPAYEPVYPATLSPLVLTN 242 Query: 234 VLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLS 293 +LRQ+LGF +I +D + M+ + + + ++ AG D++ + + K L+ Sbjct: 243 LLRQDLGFHNLIITDCMEMKAISDTYTTQQASVMAIAAGADIVCISHTEKLQREALE--- 299 Query: 294 PIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERW 334 E + + + ++ +A + +S + + + Sbjct: 300 ----ETLIQAEQNMXL-QAKIDTAA-DRILSIKKVLNVDEY 334 >UniRef50_Q3SKU2 Beta-hexosaminidase n=27 Tax=Proteobacteria RepID=NAGZ_THIDA Length = 359 Score = 356 bits (914), Expect = 8e-97, Method: Composition-based stats. Identities = 153/296 (51%), Positives = 198/296 (66%), Gaps = 4/296 (1%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 +GP+MLDV G EL ++R L+ PLVGG+ILF+RNY DPAQL L +I A L++ Sbjct: 3 LGPLMLDVAGTELCDDDRRRLSDPLVGGVILFSRNYRDPAQLAALCEEIHALKAAPLLIG 62 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSG--MEEGGKLAQEAGWLMASEMIAMDIDISFA 118 VD EGGRVQRFR+GFTRLP +S A+ + +LA+E G+++ASE+ A +D SFA Sbjct: 63 VDHEGGRVQRFRDGFTRLPPMRSLGAIWNEHPQRARELARETGYVLASELRAHGVDFSFA 122 Query: 119 PVLDVGH-ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 PVLD+ + S+ IG+R++HADPQ + + GM +AGM GKHFPGHG V ADSH Sbjct: 123 PVLDLDYGASSVIGDRAFHADPQAVFQLGQAVMLGMKDAGMAACGKHFPGHGFVIADSHV 182 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQ 237 + P D R +I D+ F L+ E L A+MPAHVIY +VD +PA S WL+ +LRQ Sbjct: 183 DIPVDRRTLGDIARADLVPFR-LMVEAGLAAMMPAHVIYPEVDAQPAGFSRVWLQKILRQ 241 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLS 293 +LGFDG +FSDDL M GAA G ER +A+LDAGCDM LVCN+ + A VL L Sbjct: 242 QLGFDGAVFSDDLCMAGAAFAGGIVERVRAALDAGCDMALVCNHPELADEVLAKLD 297 >UniRef50_C6C0L9 Glycoside hydrolase family 3 domain protein n=1 Tax=Desulfovibrio salexigens DSM 2638 RepID=C6C0L9_DESAD Length = 373 Score = 356 bits (914), Expect = 9e-97, Method: Composition-based stats. Identities = 94/348 (27%), Positives = 161/348 (46%), Gaps = 31/348 (8%) Query: 1 MGPV-MLDVEGYELDAE--EREILAHPLVGGLILFT---------RNYHDPAQLRELVRQ 48 +G + M+ G EL A + VGG+ILF+ RN +QL +L Sbjct: 34 IGQMVMVGFRGLELSANNPVVGDIRDARVGGVILFSKDCALNSTVRNIASSSQLIKLTSD 93 Query: 49 IRAASRNRLVVAVDQEGGRVQRFRE--GFTRLPAAQSFAALSGMEEGGKLAQEAGWLMAS 106 +++ ++ L +++DQEGG ++R GF P+A + A AG + Sbjct: 94 LQSYAQVPLFISIDQEGGIIKRLTGEMGFPETPSAAELGNSGDT----QAAFRAGKTIGK 149 Query: 107 EMIAMDIDISFAPVLDVGHI--SAAIGE--RSYHADPQKALAIASRFIDGMHEAGMKTTG 162 + I+++FAPV+DV + I RS+ DP+ A +IDG+H G+ + Sbjct: 150 SLKEAGINMNFAPVVDVNRNAANPVIAALQRSFSDDPRIVARFADSYIDGLHSEGIISCL 209 Query: 163 KHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR 222 KHFPGHG+ DSH + ++ F LI +NK D +M AH+ + D + Sbjct: 210 KHFPGHGSSRGDSHLGFTDVTDS---WDSSELYPFQKLISDNKADMVMTAHIFNARWDRK 266 Query: 223 -PASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNN 281 PA+ S + +LR++LGF+G+I +DD+ M+ + + E ++ AG D++L NN Sbjct: 267 YPATLSRNVIHGLLRRKLGFEGIIVTDDMQMQAVSGEYGFKEGVYRAVKAGADILLFGNN 326 Query: 282 RKGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQ 329 L + +L H+G + + + S + + + Sbjct: 327 LIYEP----GLGTKAVSTLKQLVHEGKITERRIRQS-YDRIMREKQGM 369 >UniRef50_C0EVT4 Putative uncharacterized protein n=1 Tax=Eubacterium hallii DSM 3353 RepID=C0EVT4_9FIRM Length = 409 Score = 355 bits (912), Expect = 1e-96, Method: Composition-based stats. Identities = 86/344 (25%), Positives = 154/344 (44%), Gaps = 32/344 (9%) Query: 1 MGPVML------DVEG-YELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS 53 +G + + D + E+ + ++ L GG+I F+ N D Q++ ++ I+ ++ Sbjct: 67 IGQMFIVCTDSLDFDAETEVTKKMQQKLEEYKPGGVIFFSYNLKDREQVKSMIADIQKSN 126 Query: 54 RNRLVVAVDQEGGRVQRFREG----FTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMI 109 L ++VD+EGG V R T+ PA ++ A E G + E+ Sbjct: 127 DIPLFISVDEEGGSVARIANSKNMQTTKFPAMSEIGKSGDSKK----ACEVGETIGKEIR 182 Query: 110 AMDIDISFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPG 167 + ++ FAPV DV + IG RS+ +D + + S+ + G+ G+ +T KHFPG Sbjct: 183 ELGFNLDFAPVADVNTNAENTEIGNRSFSSDAKTVAGMVSQEVKGLQSQGVSSTLKHFPG 242 Query: 168 HGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP--RPAS 225 G D+HK +R + F S I D +M +HV S V PAS Sbjct: 243 QGQCGEDTHKGYVDLNATIDRLREVEFLPFESGI-AAGADMVMMSHVAVSQVTGKETPAS 301 Query: 226 GSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGA 285 + + +LR+EL F+ VI +D ++M+ + ++ AG DMIL+ +N + A Sbjct: 302 LTKLMVTDILREELQFNNVIITDAMNMKVITKFYDADQAAVMAVQAGNDMILMPDNFEQA 361 Query: 286 VSVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQ 329 E V G+ S ++ ++ + +S ++ + Sbjct: 362 F-----------EGVLEAVKDGTISESKINEAV-SRILSVKIRR 393 >UniRef50_A1SW90 Beta-hexosaminidase n=3 Tax=Alteromonadales RepID=NAGZ_PSYIN Length = 337 Score = 354 bits (910), Expect = 3e-96, Method: Composition-based stats. Identities = 164/293 (55%), Positives = 214/293 (73%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 M PV+LDVEGYELD+EE+EILAHPLV G+ILFTRNY+D QL+ LV+ IR + N L++A Sbjct: 1 MRPVILDVEGYELDSEEKEILAHPLVAGIILFTRNYYDIEQLKALVKDIRRYAGNELLIA 60 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 VD EGGRVQRFR+ FTRLP+A S + M+ +LA + W+MASE+IA DID SFAPV Sbjct: 61 VDHEGGRVQRFRDDFTRLPSAGSLIEKNDMKTACELAFSSAWVMASELIACDIDFSFAPV 120 Query: 121 LDVGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETP 180 LD+ IS I R++ + + + +A +I+GM AGM +TGKHFPGHG+V ADSH P Sbjct: 121 LDLNGISNVIQNRAFSSSITETVTLAEAYINGMKSAGMVSTGKHFPGHGSVEADSHTALP 180 Query: 181 CDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQELG 240 D R + EI KD+ F +LI++ LDA+MP+HV+YS D +PA S YWL VLR LG Sbjct: 181 VDSRSELEIFTKDIKPFENLIKKGALDAVMPSHVVYSQCDLQPAGFSSYWLDDVLRTRLG 240 Query: 241 FDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLS 293 F GV+ SDDLSM GA+ +G++ R ++++ AGCD+IL CN+R GAVS+LDNL Sbjct: 241 FKGVVISDDLSMHGASFVGNHLSRAESAIQAGCDLILACNDRSGAVSILDNLK 293 >UniRef50_A8F7W1 Glycoside hydrolase family 3 domain protein n=1 Tax=Thermotoga lettingae TMO RepID=A8F7W1_THELT Length = 502 Score = 354 bits (909), Expect = 3e-96, Method: Composition-based stats. Identities = 100/334 (29%), Positives = 163/334 (48%), Gaps = 23/334 (6%) Query: 5 MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQE 64 M+ ++G EL E +L G +ILF+RN P QL +L++++R + +V ++DQE Sbjct: 12 MVGIDGTELTDEAARVLEQVKPGFIILFSRNIESPEQLYQLIKELRKVVKKPVVFSIDQE 71 Query: 65 GGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVG 124 GG V R R GFT A + AA A AG ++A EM ++ I + APVLD+ Sbjct: 72 GGIVTRLRNGFTVSCGAMALAATKDENN----AYLAGKILALEMKSLGITWNLAPVLDIN 127 Query: 125 --HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCD 182 + IG RS+ PQK + A F G+ E + GKHFPG G+V D+H + P Sbjct: 128 DNPNNPGIGVRSFGDTPQKVIKFAKAFYKGLSEESVAACGKHFPGKGSVEVDAHLDMPVL 187 Query: 183 PRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPRPASGSPYWLKTVLRQELG 240 + E++ ++ F L++E L++IMP+H+ + D PA+ S L +LR+ L Sbjct: 188 QKSLEELKRWELIPFVELMKEG-LESIMPSHIHLPQLQPDRLPATVSKEILTDLLRKTLN 246 Query: 241 FDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERV 300 + G+ +DDL M G S + S +G D++ +C+ + +S + + Sbjct: 247 YHGIAVADDLLMGGITKAMSVEDAVINSFKSGMDVLSLCHEPQAQISA--------KKAL 298 Query: 301 TRLYHKGSFSRQELMDSARWKAISTRLNQLHERW 334 K + L +S R+ E++ Sbjct: 299 LEQIKKNPELEKRLSES------LGRIKNFKEKF 326 >UniRef50_C5CGK3 Glycoside hydrolase family 3 domain protein n=1 Tax=Kosmotoga olearia TBF 19.5.1 RepID=C5CGK3_KOSOT Length = 528 Score = 353 bits (906), Expect = 7e-96, Method: Composition-based stats. Identities = 103/339 (30%), Positives = 164/339 (48%), Gaps = 21/339 (6%) Query: 2 GPVM-LDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASR-NRLVV 59 G + +G + + + ++ V G ++F RN D Q++ELV QI+ V Sbjct: 30 GQLFLFGFQGISYNDQIQLLIDKG-VRGFVIFGRNVKDKDQVKELVEQIKTHCDDIPPFV 88 Query: 60 AVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAP 119 AVDQEGG V R R+ F P A +++ E L + G+L + + I++ FAP Sbjct: 89 AVDQEGGMVARIRDIFVP-PNAMLLGSVNDTE----LTYKVGYLTGKALRDLGINMDFAP 143 Query: 120 VLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 VLDV + IG RS+ +P + F G+ AG+ GKHFPGHG DSH Sbjct: 144 VLDVNSNPFNPIIGVRSFWNEPTRVATNGVAFFKGLLSAGVIPVGKHFPGHGDTDTDSHT 203 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQ 237 P +P++ D+S F + ENK+ A+M AH++ D PA+ S + + LR+ Sbjct: 204 SLPVINKPESNFIETDVSPFKIAV-ENKIPALMTAHILLPFWDNVPATISKKII-SYLRE 261 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKA 297 +LGFDGVI SDD+ M+ + S E + SL AG DM ++ + + + V A Sbjct: 262 KLGFDGVIISDDMLMKAVSEGKSVKEAVKQSLLAGVDMFIIWKDLETQLEV--------A 313 Query: 298 ERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQE 336 + + + G+ + + ++ + + RL E Sbjct: 314 DYILQEVKNGNIPEEVIDNAVK-RITKLRLLAYASELNE 351 >UniRef50_D2PU34 Glycoside hydrolase family 3 domain protein n=1 Tax=Kribbella flavida DSM 17836 RepID=D2PU34_9ACTO Length = 606 Score = 352 bits (905), Expect = 9e-96, Method: Composition-based stats. Identities = 95/365 (26%), Positives = 147/365 (40%), Gaps = 44/365 (12%) Query: 1 MGPVMLDVEGYELDAEE--------------REILAHPLVGGLILF--TRNYHDPAQLRE 44 +G + + GY DA + E++A GG I F N +P QL Sbjct: 62 IGQMFVLF-GYGPDAHKPDQRNTNLYGVATPAEVVAKYKPGGWIYFNARDNVQNPTQLAT 120 Query: 45 LVRQIRAAS-----RNRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQE 99 Q++ + R + +A DQE G V R T+ + A + A+ Sbjct: 121 YSNQLQTVATSTGIRVPMTIATDQEQGVVVRIGPPATQFGGNMAHGAARSTAD----ART 176 Query: 100 AGWLMASEMIAMDIDISFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMH-EA 156 A + E+ AM I +APV DV + IG RS+ +DPQ + + G +A Sbjct: 177 AAGITGRELKAMGIRQDYAPVADVNVNALNPVIGVRSFSSDPQLVSDLTVAQVQGYQKDA 236 Query: 157 GMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIY 216 G+ T KHFPGHG DSH P + + D F + I+ +D+IM AH++ Sbjct: 237 GIIATAKHFPGHGDTVDDSHTSLPTINHTREQWNTIDAPPFKAAIK-AGIDSIMTAHIVI 295 Query: 217 SDVD--PRPASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCD 274 +D PA+ S + VLR+ELGF G+I +D L M AE +++AG D Sbjct: 296 PSLDPSGDPATLSKPIMTGVLRKELGFKGLIITDALEMAAVRAKYGDAEVAVRAIEAGVD 355 Query: 275 MILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDSA-RWKAISTRLNQLHER 333 +L+ V G S + + +S R + + L Sbjct: 356 QLLLPPAPDVQFRA-----------VVDAVKSGRISERRIDESLMRILLLKLKKGVLFHP 404 Query: 334 WQEEK 338 + + Sbjct: 405 FVDPA 409 >UniRef50_C1YLC2 Beta-glucosidase-like glycosyl hydrolase n=1 Tax=Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 RepID=C1YLC2_NOCDA Length = 558 Score = 352 bits (904), Expect = 1e-95, Method: Composition-based stats. Identities = 100/352 (28%), Positives = 155/352 (44%), Gaps = 29/352 (8%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS-----RN 55 +G +++ AE + GGLI F N D Q+ + ++ + Sbjct: 64 IGQLLVLTAQGTSAAENAAQIEAYRPGGLIYFDANLTDAEQIATMSAGVQDLAADQGRGV 123 Query: 56 RLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDI 115 L V +DQE G V R G TR P A + A E LA+ A E+ A+ +++ Sbjct: 124 PLFVGIDQEQGLVARLPVG-TRFPDAMAVGATRDTE----LAELRASTTAEELTALGVNL 178 Query: 116 SFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTA 173 ++AP DV + IG RS+ +DP +A D AG+ + KHFPGHG Sbjct: 179 NYAPDADVNTDPGNPVIGIRSFGSDPDLVAQMAVAESDAYSGAGVVSVVKHFPGHGDTDV 238 Query: 174 DSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP----RPASGSPY 229 DSH P P+ + A + F + I + +DAIM AHV+ +D PA+ SP Sbjct: 239 DSHSGLPVIDMPREQWEAGHLPPFRAAI-DADVDAIMTAHVLMPQLDGSEDPEPATISPE 297 Query: 230 WLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVL 289 + +LR ELG+DGV+ +D L+MEG S E L+AG D +L+ + AVS Sbjct: 298 LIDGILRDELGYDGVVTTDALNMEGVRQRHSDGEIAVRVLEAGVDQLLMPPDPAAAVSA- 356 Query: 290 DNLSPIKAERVTRLYHKGSFSRQELMDSA-RWKAISTRLNQLHERWQEEKAG 340 + +G + + + +S R A+ + L + + Sbjct: 357 ----------IREAVEQGRLTEERIDESVLRVLALKEKRGILEAEPVDAQGA 398 >UniRef50_Q08U63 Beta-glucosidase n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q08U63_STIAU Length = 611 Score = 352 bits (903), Expect = 1e-95, Method: Composition-based stats. Identities = 105/354 (29%), Positives = 157/354 (44%), Gaps = 30/354 (8%) Query: 1 MGP-VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS--RNRL 57 +G +M+ G LD E ++ VGG+ LF N H Q+ L +R Sbjct: 76 IGQVMMVGFRGTVLDEEVESLVRGRRVGGVCLFKHNIHSARQVARLNDGLRRLLADHIPP 135 Query: 58 VVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISF 117 VA+DQEGG V R + RLP + A E LA AG ++ + +++ Sbjct: 136 FVALDQEGGNVVRVSDQVVRLPGNMALGATRSTE----LAYAAGRAQGEDLRRLGFNMNL 191 Query: 118 APVLDVG--HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADS 175 APVLDV + IG RSY + F G EAG+ T KHFPGHG+ + DS Sbjct: 192 APVLDVNLNPHNPVIGIRSYGDSVSLVSEMGRAFARGQQEAGLVTVAKHFPGHGSTSTDS 251 Query: 176 HKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPRPASGSPYWLKT 233 H+ P + E+ M F ++++E LD +M AHV + D PA+ SP L+ Sbjct: 252 HEVLPVMRETREEVL-TQMEPFRAVLQEG-LDGLMTAHVAIPGLTGDSVPATLSPQVLEG 309 Query: 234 VLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLS 293 +LR++LGFDG++ +D+L ME ++ AG DM+LV + V Sbjct: 310 LLRRDLGFDGLVLTDELEMEAIVQRYGVGRAAVLAMKAGADMVLVPWRPEKKTEVY---- 365 Query: 294 PIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQ--------LHERWQEEKA 339 E + H+G + L + R + + +L + L ER Sbjct: 366 ----EALLDAAHEGELPPERLEQAVR-RILIAKLRRGLFEAPPLLEERLAAPPP 414 >UniRef50_A8L1X4 Glycoside hydrolase family 3 domain protein n=3 Tax=Actinomycetales RepID=A8L1X4_FRASN Length = 656 Score = 352 bits (903), Expect = 1e-95, Method: Composition-based stats. Identities = 96/360 (26%), Positives = 157/360 (43%), Gaps = 47/360 (13%) Query: 2 GPVMLDV----EGYELDAEE-------------REILAHPLVGGLILFT----------R 34 G +M+ G + E +A +GG+I F Sbjct: 125 GQMMMGYVFGTAGADRSPAVVTANRRTSGVDTAAEAVARRGLGGVIYFDAGGTGPGALPD 184 Query: 35 NYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGG 94 N +P Q++ L + AA+ L++A DQE G V R R+G T LP + A + Sbjct: 185 NIVNPNQVKTLSADLSAAASIPLLIAADQEQGTVLRVRDGVTLLPGQMAQGATGRPTD-- 242 Query: 95 KLAQEAGWLMASEMIAMDIDISFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDG 152 A++A + +++ A+ I++ FAP DV + IGERS+ DP + ++G Sbjct: 243 --ARDAAQITGADLRALGINVDFAPDADVNSDPANPVIGERSFGDDPTAVGRFTAAAVEG 300 Query: 153 MHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPA 212 + G+ KHFPGHGA + DSH + P R +A + A D+ F + I + +M Sbjct: 301 YRQVGVAAAAKHFPGHGATSVDSHADLPTITRDRAALTALDLPPFRAAI-AAGVPMVMVG 359 Query: 213 HVIYSDVDP-RPASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDA 271 H+ +DP PA+ S + +LR ELGFDGVI +D L+M + ++ A Sbjct: 360 HLNVPALDPAAPATLSKPVVDGLLRHELGFDGVIVTDALNMAAITEHNTPGGAAVRAVQA 419 Query: 272 GCDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLH 331 G DM+L+ + A+ + V G+ + + S R + + + H Sbjct: 420 GVDMLLMPPDLTQAL-----------DAVVSAVRSGAIVPERIDASVR-RILRMKWRLAH 467 >UniRef50_P48823 Beta-hexosaminidase A n=3 Tax=Alteromonadales RepID=HEXA_PSEO7 Length = 598 Score = 351 bits (901), Expect = 2e-95, Method: Composition-based stats. Identities = 90/340 (26%), Positives = 152/340 (44%), Gaps = 24/340 (7%) Query: 11 YELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASR-----NRLVVAVDQEG 65 L E E+++ +GG ILF N + AQ+ L +++A++ L +A+DQEG Sbjct: 49 TTLPPELSELISRYDIGGAILFAENVQNTAQIISLTNALQSAAQQSKSQLPLFIAIDQEG 108 Query: 66 GRVQRF-REGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVG 124 GRV R RE T S A ++G A + + E+ ++ I+++FAP +DV Sbjct: 109 GRVARINREQATSFTGNMSIGATY-PKQGDIYATKVASAIGKELNSLGINVNFAPTVDVN 167 Query: 125 --HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCD 182 + I RS+ +P + + AG+ + KHFPGHG DSH P Sbjct: 168 SNPNNPVINVRSFSENPTVVTKLGLAQVKAFEAAGVLSALKHFPGHGDTHVDSHTGLPRV 227 Query: 183 PRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP------------RPASGSPYW 230 + +I +D+ F+ +I+ + IM AH+ Y +D RPA+ S Sbjct: 228 DHDRDKINQQDLLPFAEIIKASPPGMIMTAHIQYPALDNSKVVNSQGESMIRPATMSYQI 287 Query: 231 LKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLD 290 + +LR ELG+ GV +D L M G + + + + +AG D+ L+ + Sbjct: 288 MTQLLRHELGYQGVTVTDALDMAGISDFFNPVDATIETFNAGVDIALMPIAIRN--RADI 345 Query: 291 NLSPIKAERVTRLYHKGSFSRQELMDS-ARWKAISTRLNQ 329 ++ ++++L S AR + T+L Q Sbjct: 346 KRFEQYMAQLADALETNKLNQEQLSSSMARIAKLKTKLPQ 385 >UniRef50_Q47M03 Beta-N-acetylhexosaminidase. Glycosyl Hydrolase family 3 n=1 Tax=Thermobifida fusca YX RepID=Q47M03_THEFY Length = 552 Score = 350 bits (900), Expect = 4e-95, Method: Composition-based stats. Identities = 93/326 (28%), Positives = 147/326 (45%), Gaps = 26/326 (7%) Query: 2 GPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS-----RNR 56 G +++ V E E++ GG I F N P Q+ + ++ + Sbjct: 59 GQLLVPVLSGTTAEENAEVIERYHPGGFIYFPENLETPEQVAAMSNGLQERATGTGAGIP 118 Query: 57 LVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 L + VD+E G V R G R P A + A E LA+E A ++ A+ I++ Sbjct: 119 LFLGVDEEQGLVSRLPFGA-RFPDAMALGAARDPE----LARELAAATAEQLAAVGINLD 173 Query: 117 FAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 +APV DV + IG RS+ +DP+ +A+ + + G+ KHFPGHG D Sbjct: 174 YAPVADVNVNADNPVIGIRSFSSDPELVGELAAAEVAAFQDGGVVAVAKHFPGHGDTDVD 233 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP--RPASGSPYWLK 232 SH P + + E D+ F L+ + +D +M AHV+ + PA+ SP+ + Sbjct: 234 SHTGLPVIDKSRDEWERVDLPPFQRLV-DAGVDMVMTAHVLMPQLGDAEEPATLSPHLIT 292 Query: 233 TVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL 292 VLR+ELG+DGV+ +D L+MEG E ++ AG D +L+ + A S Sbjct: 293 GVLREELGYDGVVTTDALNMEGVRQTHDDGEVAVRAVLAGADQLLMPPDVDLAYSA---- 348 Query: 293 SPIKAERVTRLYHKGSFSRQELMDSA 318 V +G S + L +S Sbjct: 349 -------VLSAVEEGRISEERLDESV 367 >UniRef50_A6NU14 Putative uncharacterized protein n=1 Tax=Bacteroides capillosus ATCC 29799 RepID=A6NU14_9BACE Length = 435 Score = 349 bits (897), Expect = 7e-95, Method: Composition-based stats. Identities = 97/334 (29%), Positives = 157/334 (47%), Gaps = 22/334 (6%) Query: 2 GPVML-DVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASR--NRLV 58 G + + G E ++ VGGLI F RN +L LV +RA + + Sbjct: 88 GQLFIAGFYGTEDGDYVDSLIRDYKVGGLIFFGRNVGTAEELVSLVNDLRAKNGDYIPMF 147 Query: 59 VAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 +VDQEGG V+R + + L A ++ E G L G ++A+E +A ++ FA Sbjct: 148 FSVDQEGGTVERLPDEVSPLEDAYTYGQSGSSEVGYAL----GQVLANECVAFGFNLDFA 203 Query: 119 PVLDV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSH 176 P LD+ + IG R++ + A+ G+ + G+ KHFPGHG DSH Sbjct: 204 PSLDIWSNPENTVIGTRAFGTTAEAVEAVGPWAAYGLMDGGVIPVVKHFPGHGDTAVDSH 263 Query: 177 KETPCDPRPQAEIRAKDMSVFSSLI--REN-KLDAIMPAHVIYSDVD-PRPASGSPYWLK 232 P + E+ ++ F S I RE + A+M AH++ + +D PAS S + Sbjct: 264 VGLPTVSKTVDELLTSELIPFQSAIEGREGEGVPAVMVAHILMTAIDQEHPASLSQAVVT 323 Query: 233 TVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL 292 +LR++LGFDGV+F+DDL+M E +L+AGCD++LVC+N + Sbjct: 324 GLLREQLGFDGVVFTDDLTMGAITENYGLDEAAVLALEAGCDVLLVCHN--------EGD 375 Query: 293 SPIKAERVTRLYHKGSFSRQELMDSARWKAISTR 326 + + V G + + + S + +S + Sbjct: 376 LALARQAVLDAVASGRLTEERIDRSVY-RILSLK 408 >UniRef50_UPI0000E0E41F Beta-hexosaminidase A precursor n=1 Tax=Glaciecola sp. HTCC2999 RepID=UPI0000E0E41F Length = 599 Score = 349 bits (897), Expect = 7e-95, Method: Composition-based stats. Identities = 90/343 (26%), Positives = 153/343 (44%), Gaps = 23/343 (6%) Query: 11 YELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRA---ASRNRLVVAVDQEGGR 67 L E ++ +GG+ILF+ N Q+++L I+ SR + +A+DQEGGR Sbjct: 47 TALPNELASLIKKHQLGGVILFSENVQSVTQIQQLTSDIQKLRPHSRLPMFIAIDQEGGR 106 Query: 68 VQRFREGFTR-LPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVG-- 124 V R F+ + A G A + G + ++ ++ I+++FAP +DV Sbjct: 107 VARLPTAFSPAYSGNMAIGATYDA-HGTAYAAKVGEYIGRDLASVGINVNFAPSIDVNAN 165 Query: 125 HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPR 184 + I RS+ + + + M G+ T KHFPGHG DSH P Sbjct: 166 PSNPVINVRSFGESAEMVSELGGALVQHMQSQGVLTALKHFPGHGDTVVDSHTGLPQVNH 225 Query: 185 PQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP------------RPASGSPYWLK 232 I +D++ F +I+E++ +M AH+ Y +D RPA+ SP L Sbjct: 226 DLPTIMKQDIAPFKHIIKEHEPAFVMTAHIQYPQLDNSTLLAKNGTEQVRPATLSPVILT 285 Query: 233 TVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL 292 +LR LGF G++ +D L M G A S E + +AG D+ L+ + + + Sbjct: 286 DLLRGTLGFKGIVITDALDMAGIAQYFSPQEAIIETFNAGADIALMPYTIRN-LEDIQAF 344 Query: 293 SPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQ 335 S + E +L G F+ + ++ + ++T+ + Q Sbjct: 345 SSLMDELERQLEKSGDFTA---LFASYTRIMNTKQKLVSSPRQ 384 >UniRef50_B3E4C8 Glycoside hydrolase family 3 domain protein n=3 Tax=Desulfuromonadales RepID=B3E4C8_GEOLS Length = 395 Score = 349 bits (896), Expect = 9e-95, Method: Composition-based stats. Identities = 97/351 (27%), Positives = 166/351 (47%), Gaps = 31/351 (8%) Query: 1 MGPVML-DVEGYELDAE--EREILAHPLVGGLILF---------TRNYHDPAQLRELVRQ 48 + ++L G L+ + + + GG++LF RN PAQL++L Sbjct: 53 IAQMLLLGFPGETLEPDSPIDQAIRRYGAGGVVLFDNNIDLGVTGRNITAPAQLKQLTTA 112 Query: 49 IRAASRNRLVVAVDQEGGRVQRFRE--GFTRLPAAQSFAALSGMEEGGKLAQEAGWLMAS 106 ++ AS L++AVDQEGG V R ++ GF +A+ + L + + +A+ Sbjct: 113 LQQASELPLLIAVDQEGGVVARLKDRYGFPATVSAKYLGQ----QNRLDLTRSSSENLAA 168 Query: 107 EMIAMDIDISFAPVLDV--GHISAAIG--ERSYHADPQKALAIASRFIDGMHEAGMKTTG 162 + +++ APV+D+ + I ERS+ ADP A A+ FI H + T Sbjct: 169 TLDEYGFNLNLAPVVDLATNPDNPVIAFKERSFSADPALVAAHAAEFIKSHHRHHILTCL 228 Query: 163 KHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR 222 KHFPGHG+ DSH ++ + LI + D +M AH + +DP Sbjct: 229 KHFPGHGSSRDDSHLGLVDVT---RYWHENELQPYRDLISQGLCDMVMTAHTFNTAIDPD 285 Query: 223 -PASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNN 281 PA+ S + +LR++LGFDGV+ SDDL M SY + +++AG D+++V N+ Sbjct: 286 HPATLSRATIDGILRKQLGFDGVVVSDDLYMGAIIQHYSYETAVEKAINAGVDLLVVAND 345 Query: 282 RKGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHE 332 + + ++ P E + L +G R+ + ++R + I+ + L Sbjct: 346 KLYSPDIM----PRTIELLLNLVQQGRIPRERIEQASR-RIIAMKQRLLKP 391 >UniRef50_B3TAA3 Putative glycosyl hydrolase family 3 N terminal domain protein n=2 Tax=environmental samples RepID=B3TAA3_9ARCH Length = 392 Score = 349 bits (896), Expect = 1e-94, Method: Composition-based stats. Identities = 107/335 (31%), Positives = 158/335 (47%), Gaps = 26/335 (7%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASR-----NRLV 58 ++ EG LDA E++ +GGLILF RNY +P QL +L +++ + L Sbjct: 2 IISGFEGTTLDARTEELIVQQGIGGLILFERNYKNPDQLLQLTHDLQSLAAGHAEIPPLF 61 Query: 59 VAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 ++VDQEGGRV R + FT+ P +E LA G M E+ A+ I++ +A Sbjct: 62 ISVDQEGGRVARLGKPFTQFPPMSCLGQADSVE----LAYRFGLGMGKELRAVGINMDYA 117 Query: 119 PVLDV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSH 176 PVLDV + IG R+ DP K + + I G ++AG+ GKHFPGHG + DSH Sbjct: 118 PVLDVHSNPENPIIGLRALDTDPAKVARLGAELIRGFYDAGVIPVGKHFPGHGDTSQDSH 177 Query: 177 KETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVL 235 P R + + ++ F I++ L+ IM AHV+Y D R PA+ S L VL Sbjct: 178 LTLPQVERARESLEKSELIPFIHAIQQG-LEVIMTAHVVYPAWDARQPATFSRSILNDVL 236 Query: 236 RQELGFDGVIFSDDLSMEGAAIM-GSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSP 294 R L F+G+I SDDL M+ S E G AG D+ L+C++ + Sbjct: 237 RNSLQFEGLIISDDLDMQAIDEKLESIPELGTR---AGVDLFLICHDLEKVC-------- 285 Query: 295 IKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQ 329 + + R R + S + + Sbjct: 286 TLQDSMIRDIETARIPRTTIKQSL-GRIFKVKKKL 319 >UniRef50_Q1K0E5 Glycoside hydrolase, family 3-like n=1 Tax=Desulfuromonas acetoxidans DSM 684 RepID=Q1K0E5_DESAC Length = 411 Score = 348 bits (894), Expect = 1e-94, Method: Composition-based stats. Identities = 98/353 (27%), Positives = 168/353 (47%), Gaps = 31/353 (8%) Query: 1 MGPVM-LDVEGYELDAE--EREILAHPLVGGLILF---------TRNYHDPAQLRELVRQ 48 +G ++ + G L+ + +GG+ILF RN P+QL++L Sbjct: 48 IGQLLLVGFRGQTLEQAPTLVNDIQKRHLGGVILFDYDVQLRQSGRNIASPSQLKQLTSD 107 Query: 49 IRAASRNRLVVAVDQEGGRVQRFRE--GFTRLPAAQSFAALSGMEEGGKLAQEAGWLMAS 106 ++A S L++A+DQEGGR+ R + GF + + A + + +L E G +A+ Sbjct: 108 LQALSNLPLLIAIDQEGGRIARLKPAQGFPVTLSHRELAEQADV----RLTFEHGKKLAT 163 Query: 107 EMIAMDIDISFAPVLDV--GHISAAIG--ERSYHADPQKALAIASRFIDGMHEAGMKTTG 162 + I+++ APV+D+ + I +R + A P + A +I G H+ + T Sbjct: 164 TLATFGINLNLAPVVDLCSNPDNPVIARLDRCFSATPDQVTDQAGAYIAGHHQTAVLTCL 223 Query: 163 KHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD-P 221 KHFPGHG+ T DSH+ + +++ + LI+ ++D IM AHV S +D Sbjct: 224 KHFPGHGSSTTDSHQGFTDITQT---WNEEELVPYRELIQHGRVDTIMTAHVFNSHLDAN 280 Query: 222 RPASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNN 281 PA+ S + +LR LG++GV+ SDDL M+ + + +L+AG DM++ NN Sbjct: 281 DPATLSRPIITGLLRGVLGYEGVVISDDLQMKAISAHYGLETAIEKALNAGVDMLVFGNN 340 Query: 282 RKGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDS-ARWKAISTRLNQLHER 333 ++ I + RL +G S + +S R + RL +R Sbjct: 341 LSYNEHSVEQAVTI----IQRLIKQGKVSEARIDESWRRITMLKRRLTVKEQR 389 >UniRef50_Q31G32 Beta-hexosaminidase n=1 Tax=Thiomicrospira crunogena XCL-2 RepID=NAGZ_THICR Length = 357 Score = 347 bits (892), Expect = 3e-94, Method: Composition-based stats. Identities = 144/321 (44%), Positives = 194/321 (60%), Gaps = 8/321 (2%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 +G +M+D+E L E+E L PLV G+ILF+RNY QL+ L +I +L++A Sbjct: 11 LGCIMVDLESTTLQPHEKERLLDPLVAGVILFSRNYESIEQLQALTTEIHQLRHPKLLIA 70 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALS--GMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 VD EGGRVQRF+EGF+ LPA +EG +LAQ+ GWLMA+E++A+ +D SFA Sbjct: 71 VDHEGGRVQRFKEGFSMLPAMGQLGKCFRANEKEGLELAQQVGWLMATELLAVGVDFSFA 130 Query: 119 PVLDVGH-ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 PV+D+ + S IG+R++ +DP + + GM +AGM + KHFPGHG + AD+H Sbjct: 131 PVVDLDYGDSRVIGDRAFDSDPVIVGKLGKALVQGMRDAGMASVAKHFPGHGYIQADTHL 190 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQ 237 E D R EI KD+ F LI EN LDA+MPAHV Y VD PA S WL+ VLRQ Sbjct: 191 EVAVDHREFQEIAHKDIQPFLKLI-ENGLDAVMPAHVRYPKVDDLPAGFSKVWLQEVLRQ 249 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLS---- 293 + FDG I SDD+SM A G R A+L AGCD++LVCN+ A VL ++ Sbjct: 250 QCYFDGAIISDDMSMHAATEFGDAPTRVTAALKAGCDLVLVCNDPVAADEVLSQVTWETG 309 Query: 294 PIKAERVTRLYHKGSFSRQEL 314 P+ R+ RL+ KG +L Sbjct: 310 PLSHARLIRLHGKGHLKLSQL 330 >UniRef50_D2PT32 Glycoside hydrolase family 3 domain protein n=26 Tax=Actinomycetales RepID=D2PT32_9ACTO Length = 599 Score = 347 bits (890), Expect = 5e-94, Method: Composition-based stats. Identities = 87/323 (26%), Positives = 146/323 (45%), Gaps = 31/323 (9%) Query: 18 REILAHPLVGGLILFTR--NYHDPAQLRELVRQIRAAS-------RNRLVVAVDQEGGRV 68 E++A +GG+I F + +P Q+ L ++ AS L+++ DQE G V Sbjct: 83 AEVVAKYKLGGVIYFAWTDSVANPPQIAALSNGLQQASLAVGDKVSVPLLISTDQEHGVV 142 Query: 69 QRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI-- 126 R T+ P A + A + + A + +E+ A+ ++ ++APV DV Sbjct: 143 FRVGPPATQFPGAMALGAGRSTADAREAA----AIAGAELRAVGVNQAYAPVADVNVNAL 198 Query: 127 SAAIGERSYHADPQKALAIASRFIDGMHEAG-MKTTGKHFPGHGAVTADSHKETPCDPRP 185 + IG RS+ +DP + + + G+ G + + KHFPGHG DSH P Sbjct: 199 NPVIGVRSFSSDPTLVADLTAAQVTGLEGGGGIASCAKHFPGHGDTVDDSHTSLPTINHT 258 Query: 186 QAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD--PRPASGSPYWLKTVLRQELGFDG 243 + + D F + I E +D+IM AH++ +D PA+ S L +LR++LG+DG Sbjct: 259 REQWNTIDAPPFEAAI-EAGIDSIMTAHIVVPSLDPSGEPATLSKPILTGILREQLGYDG 317 Query: 244 VIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRL 303 V+ +D L+M G AE +L+AGCD +L+ A + V Sbjct: 318 VVITDSLAMAGVRQKYGDAEVAVRALEAGCDQLLMSPAMDVAYNA-----------VLAA 366 Query: 304 YHKGSFSRQELMDSARWKAISTR 326 G + + L S + + + Sbjct: 367 VRSGRITEKRLDQSVY-RVLRLK 388 >UniRef50_Q2JJA4 Glycosyl hydrolase domain protein n=1 Tax=Synechococcus sp. JA-2-3B'a(2-13) RepID=Q2JJA4_SYNJB Length = 511 Score = 346 bits (888), Expect = 8e-94, Method: Composition-based stats. Identities = 93/335 (27%), Positives = 159/335 (47%), Gaps = 16/335 (4%) Query: 2 GPVM-LDVEGYELDAEEREILAHPLVGGLILF--TRNYHDPAQLRELVRQIRAASRNRLV 58 G + + EG E + E++ VGGLI + N Q+ +L+ +A + L Sbjct: 13 GQLFLVYFEGAEFSSALEEMIRDYHVGGLIFYSIAGNIQTLPQVAQLIAAAQAQALVPLW 72 Query: 59 VAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 +A+DQEGG V R EG T P+A + AA + + A+ + +E+ ++ +++FA Sbjct: 73 IAIDQEGGPVVRLTEGVTVFPSAMAVAAAGSLAK----AEAMARAVGTELKSLGFNLNFA 128 Query: 119 PVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSH 176 PV+DV + IG R++ +DP++ G +AG+ T KHFPGHG DSH Sbjct: 129 PVVDVNSNPHNPIIGIRAFGSDPKQVTDYGLALWRGYRQAGILCTPKHFPGHGDTDIDSH 188 Query: 177 KETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVL 235 P RP ++ ++ F +LI + +A+M AHV+ + PA+ +P + L Sbjct: 189 LGLPRVDRPLEALQTTELRPFQALI-QAGAEAVMTAHVVMPALGSVRPATLAPEVVSGWL 247 Query: 236 RQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPI 295 RQ LGF G++ +D L+M A E + + AG D+++ + + ++ Sbjct: 248 RQRLGFQGLVLTDSLTMGAIARTYGIPEAAELAFRAGADVLVFGADPGFSPAI----QKE 303 Query: 296 KAERVTRLYHKGSFSRQELMDSA-RWKAISTRLNQ 329 + R + G S ++L S R + R Sbjct: 304 AYAHLLREFRAGRLSVEQLDRSVERILTLKRRYGL 338 >UniRef50_Q4QLU8 Beta-hexosaminidase n=138 Tax=Gammaproteobacteria RepID=NAGZ_HAEI8 Length = 351 Score = 346 bits (888), Expect = 9e-94, Method: Composition-based stats. Identities = 180/348 (51%), Positives = 240/348 (68%), Gaps = 8/348 (2%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 M +++D++G EL+ EE E+L+HPLV GLILFTRN+ D Q++EL+R +R + L++ Sbjct: 1 MSTLLIDLKGKELEQEEVELLSHPLVAGLILFTRNFEDREQIQELIRSVRQRVKKPLLIT 60 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 VDQEGGRVQRFR+GFT LP+ Q+F E A+EAGW MA+EMIA+DID+SFAPV Sbjct: 61 VDQEGGRVQRFRDGFTMLPSMQAFQETLSATEQVSFAKEAGWQMAAEMIALDIDLSFAPV 120 Query: 121 LDVGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETP 180 LD+GH AIG+RS+ +D + A+ +A FIDGMH+AGM +TGKHFPGHG V ADSH ETP Sbjct: 121 LDLGHECRAIGDRSFSSDVKSAVNLAINFIDGMHQAGMASTGKHFPGHGHVLADSHLETP 180 Query: 181 CDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQELG 240 D R + EI D+ F LI +NKLDAIMPAHVIY D +PASGS YWLK +LR++L Sbjct: 181 YDDRAKDEIFGYDLQPFQQLISQNKLDAIMPAHVIYGQCDSQPASGSEYWLKEILRKKLN 240 Query: 241 FDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAE-- 298 F G IFSDDL M+GA +MG++ ER + +L+AGCD++L+CN R+G + VLDNL + E Sbjct: 241 FQGTIFSDDLGMKGAGVMGNFVERSKKALNAGCDLLLLCNEREGVIQVLDNLKLTENEPH 300 Query: 299 ------RVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQEEKAG 340 R+ L+ + +L+ RWK L + RW + +A Sbjct: 301 FMARQARLKNLFKRSVIEWNDLVLDQRWKLNHRNLADIQSRWLDIQAA 348 >UniRef50_B3QRM6 Glycoside hydrolase family 3 domain protein n=1 Tax=Chlorobaculum parvum NCIB 8327 RepID=B3QRM6_CHLP8 Length = 570 Score = 345 bits (887), Expect = 1e-93, Method: Composition-based stats. Identities = 83/354 (23%), Positives = 148/354 (41%), Gaps = 35/354 (9%) Query: 2 GPVMLD-----VEGYELDAE--EREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASR 54 G +++ G + +GG++ + L + + S Sbjct: 49 GQMIVAKVYARYTGDNDPEYRLISRLATEGKIGGIMFLKGDLESA---ARLANRFQNLSA 105 Query: 55 NRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDID 114 L+V+ D E G R +G T PAA + +A E LA+ G ++A E A+ I Sbjct: 106 VPLLVSADMERGLAMRL-DGATEFPAAMAVSAAGDPE----LARSMGEIIAREARAVGIQ 160 Query: 115 ISFAPVLD--VGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVT 172 ++AP +D + + I RS+ +++ FIDGM G+ T KHFPGHG VT Sbjct: 161 QNYAPTVDLNINPANPVINTRSFGDRIPLVNTMSAAFIDGMQSNGLVATAKHFPGHGDVT 220 Query: 173 ADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP--RPASGSPYW 230 DSH P + + ++ F S I + + ++M H+ + PAS S Sbjct: 221 VDSHLALPVLEGDRKRLERYELQPFRSAISQG-VMSVMVGHLAVPKLTGSLEPASLSRKI 279 Query: 231 LKTVLRQELGFDGVIFSDDLSMEGAAIMG-SYAERGQASLDAGCDMILVCNNRKGAVSVL 289 + +LR++LGF+G+I +D L+M+ G + E +++AG D++L + + Sbjct: 280 VTGLLRKDLGFNGLIVTDALNMKALQSGGLTPEEVAVKAVEAGNDILLYPEDPERTFEA- 338 Query: 290 DNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTR--LNQLHERWQEEKAGH 341 V G S + + S R + + + + ++ + H Sbjct: 339 ----------VCAAVESGKISERRIDQSVR-RILLVKHWVGLDRQKLVDPSRLH 381 >UniRef50_Q9RUP9 Glycosyl hyrolase, family 3 n=2 Tax=Bacteria RepID=Q9RUP9_DEIRA Length = 348 Score = 345 bits (885), Expect = 2e-93, Method: Composition-based stats. Identities = 91/302 (30%), Positives = 148/302 (49%), Gaps = 11/302 (3%) Query: 2 GPV-MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 G + M+D+ G ELD + L +G + LF +N QLR L +R ++A Sbjct: 16 GQLLMIDLPGPELDKDTAAYLREHGIGAVCLFGKNVESAEQLRRLCADLREVMGEHALIA 75 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 +D EGG + R E + P+A S A +L ++ +A ++ ++ I+ +F PV Sbjct: 76 IDHEGGAITR-PEFWPFAPSAMSLGAADD----QQLTEDVNAALARQLRSVGINWNFTPV 130 Query: 121 LD--VGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE 178 LD V + IG+R+Y +D + + G G+ KHFPGHG DSH Sbjct: 131 LDINVNPANPVIGDRAYGSDAARVTRHGRAALAGHTREGVAPCAKHFPGHGDTHQDSHLA 190 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQ 237 P + +AE+ A +++ F +L+ E AIM AH++Y +D PA+ SP L +LR+ Sbjct: 191 LPRVSKSRAELDAGELAPFRALLPET--PAIMTAHIVYDALDAEHPATLSPRILTGLLRE 248 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKA 297 E G+DGVI +D + M+ E +L AG D+++ R+ + L ++ Sbjct: 249 EWGYDGVIVTDSMGMQAIDANYGRGEAAVRALRAGADLVMALGRREVQQATLAAVAEYVP 308 Query: 298 ER 299 E Sbjct: 309 EN 310 >UniRef50_A7BX55 Glycosyl hydrolase, family 3 n=4 Tax=Proteobacteria RepID=A7BX55_9GAMM Length = 436 Score = 344 bits (883), Expect = 3e-93, Method: Composition-based stats. Identities = 100/353 (28%), Positives = 168/353 (47%), Gaps = 36/353 (10%) Query: 1 MGPVM-LDVEGYELDA----EEREILAHPLVGGLILF---------TRNYHDPAQLRELV 46 +G ++ + G + + + + +GG+ILF RN P Q++ LV Sbjct: 93 IGQMLLVGFRGMSIRERGVKQIIQNIQQFHLGGVILFDYDIVLKSSRRNIKSPRQVKSLV 152 Query: 47 RQIRAASRNRLVVAVDQEGGRVQRFRE--GFTRLPAAQSFAALSGMEEGGKLAQEAGWLM 104 ++++AAS+ L +A+DQEGG+V+R +E GF R +AQ + + L + M Sbjct: 153 KELQAASQIPLFIAIDQEGGKVKRLKEKFGFPRTVSAQYLGQKNNV----ALTYQQASKM 208 Query: 105 ASEMIAMDIDISFAPVLD--VGHISAAIG--ERSYHADPQKALAIASRFIDGMHEAGMKT 160 A + + I+ +FAPV+D + + IG ERS+ A PQ A FI H G+ Sbjct: 209 AKTLATVGINFNFAPVVDLNINPNNPVIGKLERSFSAYPQTVTKHALEFIKAHHAYGVLC 268 Query: 161 TGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRE---NKLDAIMPAHVIYS 217 + KHFPGHG+ T DSH ++ + +I +D+IM AH+ Sbjct: 269 SLKHFPGHGSSTRDSHLGMVDVT---DTWNEAELEPYQKIINADKMGIVDSIMIAHIFNR 325 Query: 218 DVDPR-PASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMI 276 DP PA+ S + +LR+ L +DGVI SDD+ M+ + Q +L+AG D+I Sbjct: 326 RFDPDYPATLSKRIVTGLLRENLEYDGVIVSDDMQMKAITSHYRFEVAIQKTLEAGIDII 385 Query: 277 LVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDS-ARWKAISTRLN 328 ++ NN K ++ I + +L G + + +S R + + ++ Sbjct: 386 VIGNNLKYEPDIVTRTVGI----IKQLIQAGKITEARIEESYQRIQQLKSQFQ 434 >UniRef50_B0P2K0 Putative uncharacterized protein n=2 Tax=Clostridiales RepID=B0P2K0_9CLOT Length = 421 Score = 344 bits (883), Expect = 3e-93, Method: Composition-based stats. Identities = 89/349 (25%), Positives = 160/349 (45%), Gaps = 35/349 (10%) Query: 1 MGPV-MLDV---------EGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIR 50 +G + M+ V + ++ ++ L GG+I+F +N + P Q +++ +++ Sbjct: 78 IGQLFMVSVYQLDEAESKNQTSVTSQMKKTLKKYPAGGVIMFAKNINTPDQTKKMTDELQ 137 Query: 51 AASRNRLVVAVDQEGGRVQRFREG----FTRLPAAQSFAALSGMEEGGKLAQEAGWLMAS 106 AS L +AVD+EGG+V R T P+AQ ++ + G Sbjct: 138 DASYIPLFMAVDEEGGQVSRVASNPKMKMTVYPSAQEVGRTYNNKK----IAQMGKTQGK 193 Query: 107 EMIAMDIDISFAPVLDV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKH 164 E+ + +++ APV DV + IG+RS+ AD +K I + + M + + T KH Sbjct: 194 ELKELGFNMNLAPVADVLTNKNNTEIGDRSFGADSKKVADIITTLVKNMQKQQISATLKH 253 Query: 165 FPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPR 222 FPG G D+H+ + + +R D F + I+ K DA+M +H++ S+V + Sbjct: 254 FPGSGQTGGDTHRGSTETDQTINALRDTDFKPFKAGIK-AKADAVMVSHLMLSNVTDEKE 312 Query: 223 PASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNR 282 P+S S + +LR EL + GVI +D ++M+ S E ++ AG D+I++ +N Sbjct: 313 PSSLSSRVVSDILRDELEYKGVIMTDAMNMKAITDNYSSGEAAVKAIQAGVDLIVMPDNY 372 Query: 283 KGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLH 331 K A + + G + S R + I T+L + Sbjct: 373 KEAYKA-----------IKKALKSGKIKESRIDKSVR-RIIYTKLKRGE 409 >UniRef50_B0MSB5 Putative uncharacterized protein n=1 Tax=Eubacterium siraeum DSM 15702 RepID=B0MSB5_9FIRM Length = 406 Score = 344 bits (883), Expect = 3e-93, Method: Composition-based stats. Identities = 88/323 (27%), Positives = 146/323 (45%), Gaps = 26/323 (8%) Query: 12 ELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRF 71 +D + VGG+ LF+RN QL + ++++S+ L +AVD+EGGRV R Sbjct: 96 RVDDAVISRIEEYPVGGIALFSRNITSAEQLPMFISDLQSSSKYPLFIAVDEEGGRVARI 155 Query: 72 RE----GFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHIS 127 + + + A+E G + S + + ++ FAPV D Sbjct: 156 ANSDFFNVASYKSMEDIGKSGDASK----AEEVGRQIGSYLKELGFNLDFAPVADTNTNP 211 Query: 128 A--AIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRP 185 IG+RSY +DP + S +DGMH++G+ T KHFPGHG D+H + Sbjct: 212 QNIVIGDRSYGSDPALVARMVSAQLDGMHDSGIMGTLKHFPGHGDTKDDTHSGYVSIEKT 271 Query: 186 QAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPRPASGSPYWLKTVLRQELGFDG 243 E++ ++ F + + + D +M +H+ +V D P S S + LR ELG+DG Sbjct: 272 WDELKECELVPFITALPKA--DMVMVSHITAVNVTSDKLPTSMSETMITGKLRNELGYDG 329 Query: 244 VIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRL 303 VI +D ++M A + AE ++ AG D++L+ N A + V Sbjct: 330 VIITDAMAMGAVADNYTSAEAAVTAVKAGVDIVLMPQNLDEAFN-----------GVMNA 378 Query: 304 YHKGSFSRQELMDSARWKAISTR 326 G S + L +S + + + Sbjct: 379 VTDGEISMERLDESV-LRILKMK 400 >UniRef50_A7EUL4 Putative uncharacterized protein n=1 Tax=Sclerotinia sclerotiorum 1980 UF-70 RepID=A7EUL4_SCLS1 Length = 553 Score = 342 bits (879), Expect = 1e-92, Method: Composition-based stats. Identities = 93/347 (26%), Positives = 153/347 (44%), Gaps = 39/347 (11%) Query: 3 PVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNR-----L 57 P ++ G+ + + + ++ VG +ILF RN D AQL +L ++ + + L Sbjct: 33 PFIVGFHGHVTNEDIKTLITVHRVGTIILFQRNIADAAQLFQLAASLQEIAEDSGHAQSL 92 Query: 58 VVAVDQEGGRVQRFREG-FTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 +A DQE G V R T LP + + A + A A + A I+++ Sbjct: 93 FIATDQENGLVTRVNPPIATELPGSMALGAANDPSN----AFSVAVCTAKILKAYGINMN 148 Query: 117 FAPVLDVG--HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 +APV DV + IG RS+ DP S ++G+ + G+ + KHFPGHG D Sbjct: 149 YAPVADVNSEPKNFVIGVRSFSDDPGTVGRFVSSQVEGLQQNGILSCVKHFPGHGDTAVD 208 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP--------RPASG 226 SH P + + E+ A ++ F ++ +DAIM AH+ ++ PAS Sbjct: 209 SHHGLPVITKSKEELEACELVPFWRAVK-AGVDAIMTAHISLPQLNSNSNVATNQLPASL 267 Query: 227 SPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMIL---VCNNRK 283 SP ++ +LRQE+ +DG+I SD L M+G + +L L C+ K Sbjct: 268 SPDAIR-ILRQEMNYDGLIVSDCLEMDGVRATYGTEKGAVMALK-----ELTVSWCHTMK 321 Query: 284 GAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQL 330 V + E V G S++ + +S+ + I + L Sbjct: 322 AQVGAI--------EAVIAAVKSGEISQEMI-ESSVNRVIRLKTKYL 359 >UniRef50_Q7PC47 Beta-N-acetylhexosaminidase. Glycosyl Hydrolase family 3 n=2 Tax=Gammaproteobacteria RepID=Q7PC47_SACD2 Length = 344 Score = 342 bits (878), Expect = 1e-92, Method: Composition-based stats. Identities = 157/296 (53%), Positives = 208/296 (70%), Gaps = 6/296 (2%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 +GPV+ D+ G L E+ ++ +PL+GGLILFTRNY P+QL LV+QIR+ R +++A Sbjct: 9 LGPVIADIAGQTLSDEDIALIKNPLIGGLILFTRNYSTPSQLDALVKQIRSV-RADIILA 67 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGM--EEGGKLAQEAGWLMASEMIAMDIDISFA 118 VD EGGRVQRFREGFTR+PA Q FA+ E LA GWLMASE+ A DIDISFA Sbjct: 68 VDHEGGRVQRFREGFTRIPAMQVFASAYKARAELTLALACNTGWLMASELRAYDIDISFA 127 Query: 119 PVLDVGHI-SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 PVLDV S+ IG+R++ +DP+ A+A FIDGM +AGM TGKHFPGHG+V ADSH Sbjct: 128 PVLDVDDSFSSIIGDRAFSSDPKAVTALAGAFIDGMQQAGMACTGKHFPGHGSVRADSHL 187 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQ 237 E P D R I D+ F+ L ++KLDA+MPAH+++ +VD +P S WL+ +LR Sbjct: 188 ELPVDYRSLEAIEQLDLMPFAKL--QSKLDAVMPAHILFPEVDDQPVGFSSVWLQKILRD 245 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLS 293 ++ +DGVIFSDDL+MEGAA+ GS+ ER ++ AGCD +LVCNNR+ + V+ L+ Sbjct: 246 KMAYDGVIFSDDLTMEGAAVAGSFGERAIKAMSAGCDTLLVCNNREATLEVIQTLA 301 >UniRef50_A1SM50 Glycoside hydrolase, family 3 domain protein n=7 Tax=Actinobacteria (class) RepID=A1SM50_NOCSJ Length = 631 Score = 342 bits (878), Expect = 1e-92, Method: Composition-based stats. Identities = 97/345 (28%), Positives = 150/345 (43%), Gaps = 32/345 (9%) Query: 15 AEEREILAHPLVGGLILF-----TRNYHDPAQLRELVRQIRAASR-----NRLVVAVDQE 64 + +A GG+I F N P Q+ L ++AA+R L ++VDQE Sbjct: 101 DTPAQAIAKYQPGGVIYFTTRNGDDNIGTPEQVARLSNGLQAAARALPGGIPLQISVDQE 160 Query: 65 GG-RVQRFR--EGFTRLPAAQSFAALSGMEEGGKL-AQEAGWLMASEMIAMDIDISFAPV 120 GG V RF G T+LP + A + G A+ + ++ +E+ A+ + +APV Sbjct: 161 GGALVARFGAASGATQLPGNMALGAGALGTGGSAADARRSATVIGAELAAVGVTQDYAPV 220 Query: 121 LDVG--HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE 178 DV + IG RS +DP + + + G H G+ KHFPGHG DSH Sbjct: 221 ADVNVNPNNPVIGIRSIGSDPALVSDLVAAQVRGFHRGGVSAVAKHFPGHGDTGVDSHFG 280 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD-PRPASGSPYWLKTVLRQ 237 P ++++ D+ F + I +D IM AHV+ +D PA+ S L +LR+ Sbjct: 281 LPEVTHTRSQLEEIDLPPFRAAI-AAGVDTIMTAHVVLPAIDPGVPATMSRKILTGLLRR 339 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKA 297 ELGFDG+I +D L M GA +L AG D +L+ A Sbjct: 340 ELGFDGLIVTDALDMGGATATYPPDVAPVRALLAGADQLLIPPEMDTAYRA--------- 390 Query: 298 ERVTRLYHKGSFSRQELMDSARWKAISTRL--NQLHERWQEEKAG 340 V + G SR+ L +S + + + + + + A Sbjct: 391 --VLKAVRSGQISRERLNESVY-RILLHKYERGLFGDPYVDRAAA 432 >UniRef50_Q1QVU0 Glycoside hydrolase, family 3-like protein n=1 Tax=Chromohalobacter salexigens DSM 3043 RepID=Q1QVU0_CHRSD Length = 340 Score = 341 bits (877), Expect = 2e-92, Method: Composition-based stats. Identities = 162/336 (48%), Positives = 213/336 (63%), Gaps = 8/336 (2%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 +G VMLD+EG +L EER +L P VGG+ILF RN D Q+R L R+IR R +++A Sbjct: 5 LGTVMLDIEGTQLGDEERRLLERPEVGGVILFARNTRDAEQVRRLTREIREL-RPDMLLA 63 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSG--MEEGGKLAQEAGWLMASEMIAMDIDISFA 118 +DQEGGRVQR REG TRLP+ + AA +E EAGWL+ EM A D++FA Sbjct: 64 IDQEGGRVQRLREGVTRLPSMAALAAGYADAPDEVRSRVHEAGWLLGMEMAACGFDVTFA 123 Query: 119 PVLDVGHIS-AAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 PVLDV AIG+RS+ ADP A+ FI+G+HEAGM GKHFPGHG VT DSH Sbjct: 124 PVLDVDDQRSPAIGDRSFSADPTVVAALGEAFIEGLHEAGMVAVGKHFPGHGGVTLDSHH 183 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQ 237 P D RP + +R D+ F +L KLDA+MPAHV+Y+ D RPA SP WL +LR+ Sbjct: 184 ALPEDNRPLSVLREHDLVPFKAL--SGKLDAMMPAHVVYTAFDTRPAGFSPSWL-GMLRE 240 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKA 297 E+ F GV+FSDDLSM GA + G+ A R +A+ AGCDM+LVCN+R A+ ++D + + Sbjct: 241 EMAFKGVVFSDDLSMAGAHVAGTPAARAEAAWSAGCDMVLVCNDRAAALEIVDAAAGRTS 300 Query: 298 ERVTRLYHKGSFSR-QELMDSARWKAISTRLNQLHE 332 +R+ +L + + + L ARW+ RL L E Sbjct: 301 KRLGKLRYGRARPELETLPALARWRRAHARLEALSE 336 >UniRef50_C9LY46 Beta-hexosaminidase n=1 Tax=Selenomonas sputigena ATCC 35185 RepID=C9LY46_9FIRM Length = 408 Score = 341 bits (877), Expect = 2e-92, Method: Composition-based stats. Identities = 101/344 (29%), Positives = 164/344 (47%), Gaps = 31/344 (9%) Query: 2 GPV-MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS---RNRL 57 G + M+ ++G ELDA+ R +L+ +GG+ILF RN Q+R L ++ + L Sbjct: 82 GQLLMIGIQGAELDADSRFMLSEYHIGGVILFDRNMKSQEQVRALNDSLQKNASDAGLPL 141 Query: 58 VVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISF 117 +A+D+EGG V R +E F PAA + A A + AM +++F Sbjct: 142 FLAIDEEGGAVARMKEAFLPPPAAAEIGRTGDP----QAAYRYAADTAHGLKAMGFNLNF 197 Query: 118 APVLDVGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 APV D+G RSY D A + ++G +AG+ T KHFPG G +D+H+ Sbjct: 198 APVADLGAT----YGRSYADDAATATKFVAAALEGHADAGLLATLKHFPGLGRGESDTHE 253 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKL---------DAIMPAHVIYSDVDPR-PASGS 227 +T +A + A D+ F +I + +M H +Y +D + PAS S Sbjct: 254 DTVTVRADRATLEASDLVPFREMITKRNAAKNAKGASGWFVMATHTMYPALDAKNPASLS 313 Query: 228 PYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVS 287 P L+ +LR+EL +DGVI +DDL M + + G ++ AG D++LVC++ + Sbjct: 314 PAILQGLLREELSYDGVIVTDDLEMGAISRHYGFDRAGVEAILAGADLVLVCHDYAHETA 373 Query: 288 VLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLH 331 V + + + G S+ L S R + + ++ L Sbjct: 374 VYNG--------LLKAVKSGEISKDRLDASVR-RIVKAKIEYLE 408 >UniRef50_B0SE07 Beta-glucosidase-related glycosidase n=2 Tax=Leptospira biflexa serovar Patoc RepID=B0SE07_LEPBA Length = 588 Score = 341 bits (875), Expect = 3e-92, Method: Composition-based stats. Identities = 87/337 (25%), Positives = 153/337 (45%), Gaps = 16/337 (4%) Query: 2 GPVM-LDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASR----NR 56 G + + + +D + + GG+ILF +N ++ L ++ +++ Sbjct: 59 GQTIHIAIPSKTVDQIALDEIKQTKPGGIILFGKNLGKKEEILSLTENLQRSAKENGLPP 118 Query: 57 LVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 +++ DQEGGRV R ++G T P A + E G + G++ + E+ ++ ++ Sbjct: 119 FLISTDQEGGRVFRVQDGITHYPGAMAVGQTGKKEWG----EIVGFVTSYELRSLGLNFL 174 Query: 117 FAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 FAPVLD+ + + I RS+ +D + +A + G G KHFPGHG T D Sbjct: 175 FAPVLDINNNPLNPVINTRSFGSDVHRVSDVAVAYERGARAGGCLPVIKHFPGHGDTTVD 234 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKT 233 SH P + E+ ++ F IR +A+M AH++Y +DP PA+ S L Sbjct: 235 SHLGLPIINKSLTELEKLELVPFQESIR-GGAEAVMSAHIVYPKIDPNFPATLSKTILTD 293 Query: 234 VLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLS 293 VLR+ L FDG+I +D + M + G ++ AG +++L+ + + A L Sbjct: 294 VLRKSLQFDGIIITDAMEMHAISKNYEKDRPGVLTILAGANIVLLTSWGETAKKFKAQLM 353 Query: 294 PIKAERVTRLY-HKGSFSRQELMDSARWKAISTRLNQ 329 +G + +L+ A K I +L Sbjct: 354 DAYKNGEFEFVDKEGK--KHDLLKEAVQKQIRKKLEL 388 >UniRef50_C9KM62 Glycosyl hydrolase domain protein n=1 Tax=Mitsuokella multacida DSM 20544 RepID=C9KM62_9FIRM Length = 381 Score = 340 bits (874), Expect = 3e-92, Method: Composition-based stats. Identities = 89/332 (26%), Positives = 158/332 (47%), Gaps = 24/332 (7%) Query: 2 GPV-MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS--RNRLV 58 G + M+ V+G E+ + +L +GG+ILF RN Q ++L+ ++ + + L Sbjct: 63 GQMVMIGVQGTEVTDDSLYMLHQYHMGGVILFDRNMESADQTKKLIADLQEQADQKVPLF 122 Query: 59 VAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 + VD+EGG+V R + T P+ Q + A+E+ A ++ + +++FA Sbjct: 123 IGVDEEGGQVVRGKSFLTPPPSEQEIGRSGEVTR----AEESARQTAEKLKKLGFNVNFA 178 Query: 119 PVLDVGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE 178 PV DVG RS+ DP++ + G + + KHFPG G T DSH++ Sbjct: 179 PVADVGD-----YARSFGPDPEQTAKFVAAAAQGYEQEHVMYALKHFPGIGRGTVDSHED 233 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLD--AIMPAHVIYSDVDPR-PASGSPYWLKTVL 235 + E+ +D+ F ++I E + + ++ +H+ Y +D PAS S L Sbjct: 234 ISSITATKDELMKRDLVPFRTVIDERQPEDYFVLVSHLRYPALDAENPASLSKAIQTDFL 293 Query: 236 RQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPI 295 R ELG+ G+I +DD+ M A S+ E G +++AG D++LVC+ V Sbjct: 294 RGELGYRGLIITDDVEMGALAKHYSFRELGVKAVEAGSDIVLVCHEYPHETDVYLG---- 349 Query: 296 KAERVTRLYHKGSFSRQELMDSARWKAISTRL 327 + G+ S + + +S R + + +L Sbjct: 350 ----LLDAVQDGTISMERVNESVR-RIVKAKL 376 >UniRef50_A9M1Z4 Beta-hexosaminidase n=23 Tax=Neisseriaceae RepID=NAGZ_NEIM0 Length = 361 Score = 340 bits (873), Expect = 4e-92, Method: Composition-based stats. Identities = 132/305 (43%), Positives = 185/305 (60%), Gaps = 5/305 (1%) Query: 2 GPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAV 61 G VM D+ + L EE++ L P VGG+ILF RN+ + AQL+EL +I+A L++AV Sbjct: 9 GLVMADIAAFRLTEEEKQRLLDPAVGGVILFRRNFQNIAQLKELTAEIKALRTPELIIAV 68 Query: 62 DQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKL---AQEAGWLMASEMIAMDIDISFA 118 D EGGRVQRF EGFTRLPA + + + A + G ++A+E+ A ID+SF Sbjct: 69 DHEGGRVQRFIEGFTRLPAMSTLGEIWDKDGASAAETAAGQVGRVLATELSACGIDLSFT 128 Query: 119 PVLDVGHIS-AAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 PVLD+ + IG RS+H +P+ +A G+ + GMK+ GKHFPGHG V DSH Sbjct: 129 PVLDLDWGNCPVIGNRSFHRNPEAVARLALALQKGLEKGGMKSCGKHFPGHGFVEGDSHL 188 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQ 237 P D R +E+ A D++ F + RE + A+MPAHV+Y VD +PA S WLK +LR+ Sbjct: 189 VLPEDGRSLSELEAADLAPFRIMSREG-MAAVMPAHVVYPQVDTKPAGFSEIWLKQILRR 247 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKA 297 ++GF GVIFSDDL+MEGA G ER + S +AGCD++LVCN + ++ Sbjct: 248 DIGFKGVIFSDDLTMEGACGAGGIKERARISFEAGCDIVLVCNRPDLVDELREDFRIPDN 307 Query: 298 ERVTR 302 + + Sbjct: 308 PALAQ 312 >UniRef50_A0M1W2 Family 3 glycosyl hydrolase/beta-lactamase fusion protein n=16 Tax=Bacteroidetes RepID=A0M1W2_GRAFK Length = 978 Score = 340 bits (872), Expect = 5e-92, Method: Composition-based stats. Identities = 85/345 (24%), Positives = 147/345 (42%), Gaps = 29/345 (8%) Query: 2 GPVM---LDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLV 58 G + + + R+ + +GGLI P +L + + S L+ Sbjct: 59 GQLFMASIWSQNENEADSIRKQIKENHIGGLIFSKG---GPVMQAQLTNEFQEMSEIPLL 115 Query: 59 VAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 V +D E G R F LP + A+ KL +EAG ++ + + +FA Sbjct: 116 VGMDAEWGLAMRLDSTF-ALPWNMTLGAIQD----NKLVEEAGAAISRHTKRLGVHFNFA 170 Query: 119 PVLDVG--HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSH 176 PV+D+ + IG RS+ D A F++GMH G ++ KHFPGHG DSH Sbjct: 171 PVVDINTNPDNPIIGNRSFGEDKINVTEKALAFMNGMHREGGLSSAKHFPGHGDTDTDSH 230 Query: 177 KETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV---DPRPASGSPYWLKT 233 K P I ++ + LI + L ++M AH+ + D +PAS S + Sbjct: 231 KTLPTINFTSERIDEVELYPYRKLIPKG-LSSVMVAHLNVPALESKDGKPASLSQKIITD 289 Query: 234 VLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLS 293 +L++ +GF+G+IF+D L M+G + E + AG D++L+ + A Sbjct: 290 ILKKNMGFNGLIFTDALDMKGVSRNKEPGEVDLDAFMAGNDVLLMSEDIGKA-------- 341 Query: 294 PIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQEEK 338 ++ + + G S + L S + K + + +++ Sbjct: 342 ---SKSIIEAVNSGLVSEERLELSVK-KILYAKYKVGLNKFKPIT 382 >UniRef50_D1R988 Putative uncharacterized protein n=1 Tax=Parachlamydia acanthamoebae str. Hall's coccus RepID=D1R988_9CHLA Length = 544 Score = 340 bits (872), Expect = 6e-92, Method: Composition-based stats. Identities = 83/341 (24%), Positives = 161/341 (47%), Gaps = 23/341 (6%) Query: 2 GPV-MLDVEGYELDAEEREILAHPLVGGLILF--TRNYHDPAQLRELVRQIRAAS----- 53 G + M+ G E + + ++++ +GG+I + D Q++ L R +++ + Sbjct: 36 GQLLMVHFYGTEANGDAKKLIQDFHIGGIIYYEWANGLTDAQQVQILSRGLQSLAQERAH 95 Query: 54 RNRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDI 113 L +A+DQEGG V R + GFT+ P + + + A+ + E+ ++ I Sbjct: 96 PIPLFLAIDQEGGVVNRLKNGFTQFPGNYALGQI----KKAGDAEACAYATGLELKSVGI 151 Query: 114 DISFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAV 171 +++ APV+DV + IG R++ + P+ ++ ++G ++ + + KHFPGHG V Sbjct: 152 NVNLAPVVDVNCNEKNPIIGIRAFGSSPKDVASLGKEMMEGFRKSSILSVLKHFPGHGDV 211 Query: 172 TADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPA-SGSPYW 230 DSH+ PC + + E+ ++ F L+ + +A+M +H+ +D + + S Sbjct: 212 EVDSHQALPCLNKTRGELDQVELYPFHQLLPLS--EAVMTSHLFVPALDAKTCITFSCDA 269 Query: 231 LKTVLRQELGFDGVIFSDDLSMEG-AAIMGSYAERGQASLDAGCDM----ILVCNNRKGA 285 + VLR+E+GF+GVI +D L+M+G + S AG D+ G Sbjct: 270 VTQVLRKEMGFEGVILTDSLAMQGCLGQSQGVVDAAIQSFKAGHDVLLLGGKQLLGEGGG 329 Query: 286 VSVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTR 326 + + + + L G S Q L S R + ++ + Sbjct: 330 FEITIDEIAKIHQTLVFLVQTGEISEQRLDASVR-RILAAK 369 >UniRef50_UPI000196C637 hypothetical protein CATMIT_02394 n=1 Tax=Catenibacterium mitsuokai DSM 15897 RepID=UPI000196C637 Length = 790 Score = 340 bits (872), Expect = 6e-92, Method: Composition-based stats. Identities = 92/349 (26%), Positives = 162/349 (46%), Gaps = 28/349 (8%) Query: 7 DVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS----RNRLVVAVD 62 E +++ + R+I+ G LI F +N + Q L ++ A+ L+++ D Sbjct: 65 GHEFTKMNDQVRKIVEDYDFGALIYFAQNIQETEQSYNLSMAMQKAATKDGGIPLLISAD 124 Query: 63 QEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLD 122 QEGG V R G T LP + A E A++AG ++ SE+ ++ I+ + APV+D Sbjct: 125 QEGGSVYRLGSG-TALPGNMALGATHNTE----YAKKAGQIIGSELSSIGINTNLAPVVD 179 Query: 123 VGHISAA--IGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETP 180 V + + IG RSY + +AS I G+ + + KHFPGHG DSH P Sbjct: 180 VNNNANNPVIGLRSYSDNADVVGNMASATIAGLKDYNVIGCAKHFPGHGDTATDSHYGLP 239 Query: 181 CDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-------------PASGS 227 + +AE+ ++ + I + ++ IM AH++Y +D PA+ S Sbjct: 240 IVNKSKAELLNNELKPYKIAINQG-IEMIMTAHILYPQLDNTTVVSEKTGNEEKKPATMS 298 Query: 228 PYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVS 287 + +L+ E+GF+GV+ +D ++M+ A ++ + +++AG D+I + Sbjct: 299 KAIITDLLKGEMGFNGVVTTDAMNMKAIADTFGESQAVKLAIEAGADLICMPTVLYNQED 358 Query: 288 VLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQE 336 V + V KG S L D R + ++ + N+ W+E Sbjct: 359 VKK--LDTIIDYVEDAVKKGEISESRLDDGCR-RILTVKENRGILDWKE 404 >UniRef50_Q67LG3 Putative beta-N-acetylglucosaminidase n=1 Tax=Symbiobacterium thermophilum RepID=Q67LG3_SYMTH Length = 637 Score = 340 bits (872), Expect = 6e-92, Method: Composition-based stats. Identities = 106/343 (30%), Positives = 165/343 (48%), Gaps = 26/343 (7%) Query: 1 MGPVM-LDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASR----- 54 +G + + + G EL E R +LA VGG+I F R DP LR L +++AA+ Sbjct: 101 LGQLFWVGLPGTELTDEIRGLLAEGKVGGVIFFGRQGSDPDVLRRLSGELQAAAAARDRA 160 Query: 55 -NRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDI 113 LV+AVD EGG VQRF FT+ P A + A A+ G MA E++A+ + Sbjct: 161 TPGLVIAVDHEGGLVQRFGPPFTQWPGAMALGAAGSSAH----AEAMGRAMARELLAVGV 216 Query: 114 DISFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAV 171 +++ APV DV + + IG RS+ P+ +A+ G+ G+ KHFPGHG Sbjct: 217 NMNLAPVADVNNNPANPVIGTRSFGEQPELVARMAAALAAGLQAEGVSAVAKHFPGHGDT 276 Query: 172 TADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPRPASGSPY 229 DSH + P + + ++ F + + +DA+M AHV++ V D RPAS SP Sbjct: 277 AVDSHLDLPVIDHARERLDRVELVPFRAAVG-AGIDAVMVAHVVFPAVAGDGRPASLSPD 335 Query: 230 WLKTVLRQELGFDGVIFSDDLS-MEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSV 288 + +L+ ELGF GV +D L M E +++AG D++LV + ++ Sbjct: 336 VVTGLLKGELGFAGVAMTDALDGMAAITDTYGVEEGLVLAVEAGADVLLVTESFGRQQAL 395 Query: 289 LDNLSPIKAERVTRLYHKGSFSRQELMDSA-RWKAISTRLNQL 330 R+ + +G + D+A R A+ + L Sbjct: 396 Y--------RRLLQAVEEGRIPEGRVNDAAGRVLALKEKRGLL 430 >UniRef50_C2M7P2 Family 3 glycosyl hydrolase/beta-lactamase fusion protein n=1 Tax=Capnocytophaga gingivalis ATCC 33624 RepID=C2M7P2_CAPGI Length = 769 Score = 339 bits (871), Expect = 7e-92, Method: Composition-based stats. Identities = 80/343 (23%), Positives = 144/343 (41%), Gaps = 28/343 (8%) Query: 1 MGPVML--DVEGYELDAE--EREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNR 56 +G + + + + +E++ +GG+I P +L + + A++ Sbjct: 47 IGQLFMVQAFSSHTAKQQKPIKELVKKYHIGGIIFSKG---SPVAQAKLTNEFQEAAQVP 103 Query: 57 LVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 L++A+D E G R P + AL L ++AG +A + + I Sbjct: 104 LMIAMDAEWGLAMRLDS-VYAFPWNMTLGALKD----NALVEQAGKRIAQHCKRLGVHID 158 Query: 117 FAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 FAP +D+ + IG RS+ F GM G+ GKHFPGHG D Sbjct: 159 FAPDIDINTNPQNPIIGNRSFGESKFNVAQKGVAFTKGMQSVGVLACGKHFPGHGDTEKD 218 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD-PRPASGSPYWLKT 233 SHK P + + + + F LI + ++M H+ +D +PAS S + Sbjct: 219 SHKTLPTVSFSRERLESTEFFPFQKLIG-AGVASLMVGHLNVPALDEGKPASISYKIVSG 277 Query: 234 VLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLS 293 +++Q+LGF G+IFSD L M+G A ++ + AG D++L+ ++ + Sbjct: 278 LIKQQLGFQGLIFSDALGMKGVADYAESSQVDLQAFLAGNDVLLMSSDPIKGI------- 330 Query: 294 PIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQE 336 E + Y G+ + L S + K + + +++ Sbjct: 331 ----ETLKNAYSAGTINEYRLAHSVK-KILKAKYWAGLNKYKP 368 >UniRef50_Q11Z59 B-glycosidase, glycoside hydrolase family 3 protein n=1 Tax=Cytophaga hutchinsonii ATCC 33406 RepID=Q11Z59_CYTH3 Length = 990 Score = 339 bits (871), Expect = 8e-92, Method: Composition-based stats. Identities = 77/346 (22%), Positives = 147/346 (42%), Gaps = 29/346 (8%) Query: 1 MGPVMLDVEGYELDA----EEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNR 56 +G + + E +++ + +GG+I F P + +L + ++ S Sbjct: 54 LGQLFMVAAYSNKPEAHALEIEQLIKNNGIGGVIFFQG---GPVRQAQLTNRYQSVSNVP 110 Query: 57 LVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 L VA+D E G R P + A+ G + + + + I+ Sbjct: 111 LFVAMDAEWGLGMRLDS-TMNFPKQMTLGAIQD----NAAIFNMGVEIGKQCKRLGVHIN 165 Query: 117 FAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 FAPV+DV + IG RS+ D A ++ GM + KHFPGHG D Sbjct: 166 FAPVVDVNSNANNPVIGVRSFGEDKINVSQKAIAYMKGMQSVHVMANAKHFPGHGNTDTD 225 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR--PASGSPYWLK 232 SH P + E+ ++ F LI ++ + +I+ AH+ +D PA+ S + Sbjct: 226 SHFSLPVVNKNVQELNDTELYPFRQLI-DSGVGSIIVAHMNVPSLDNTNKPATLSKPIVT 284 Query: 233 TVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL 292 +LR ++GF G+IF+D L+M+G + + E +L AG D++L N Sbjct: 285 DLLRNDMGFRGLIFTDALNMKGVSNLYKPGEVDVKALLAGNDILLYAEN----------- 333 Query: 293 SPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQEEK 338 P+ +++ + + +++E+ + K + + +++ + Sbjct: 334 VPLAIKKIVKAINDKDITKEEIH-ARVKKILLAKYWAGLNHFKKIE 378 >UniRef50_B1WW93 Beta-glucosidase n=12 Tax=Cyanobacteria RepID=B1WW93_CYAA5 Length = 539 Score = 339 bits (871), Expect = 9e-92, Method: Composition-based stats. Identities = 89/356 (25%), Positives = 156/356 (43%), Gaps = 40/356 (11%) Query: 1 MGPVML----DV---------EGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVR 47 +G +++ +A+ L +GG+IL A+L + Sbjct: 17 IGQMIVVRASGYLFDHQICYPVWEASNAQLSHWLETLNLGGVILLGG---SAAELLLRTQ 73 Query: 48 QIRAASRNRLVVAVDQEGGRVQRFREGFTRLPAAQSFA--ALSGMEEGGKLAQEAGWLMA 105 Q++ S+ L +A D E G QRF G T P + A S + K A + G + A Sbjct: 74 QLQQWSKTPLFIAADIEEGVGQRF-SGATWFPPPMALGEIAKSDLPLARKYAYQMGEITA 132 Query: 106 SEMIAMDIDISFAPVLDVG--HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGK 163 E + + I+ +P++DV + I RS+ +P+ +A FI+G TT K Sbjct: 133 KESVKIGINWILSPIVDVNNNPDNPVINIRSFGDNPKVVSELAKAFIEGAKNHPALTTAK 192 Query: 164 HFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD-PR 222 HFPGHG + DSH P + ++ F + I + K+D++M AH++ + D Sbjct: 193 HFPGHGDTSNDSHLNLPVISHDNERLEGLELVPFQAAI-DAKVDSVMTAHLLINAWDKDN 251 Query: 223 PASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNR 282 PA+ S L L+QELGF+G+I +D L M G A E +++AG D++L+ ++ Sbjct: 252 PATLSKTILTNKLKQELGFNGLIVTDALIMGGVAKFADSEEIAIKAVEAGADILLMPDDP 311 Query: 283 KGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQEEK 338 + A++ V G + + + +S ++ Q ++ + Sbjct: 312 EMAINA-----------VYNAVETGRLTTERIDES------LQKIWQAKQKLFNHQ 350 >UniRef50_B7S2D9 Glycosyl hydrolase family 3 N terminal domain protein n=1 Tax=marine gamma proteobacterium HTCC2148 RepID=B7S2D9_9GAMM Length = 567 Score = 339 bits (870), Expect = 9e-92, Method: Composition-based stats. Identities = 104/345 (30%), Positives = 158/345 (45%), Gaps = 21/345 (6%) Query: 8 VEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGR 67 V +L + + L G+ILF N AQ ++L +IR ++A+DQEGGR Sbjct: 31 VPVQQLPPQLKHGLVAISPCGVILFRENLASIAQCQQLTAEIRECLPANTLIAIDQEGGR 90 Query: 68 VQRF-REGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI 126 V R R+ T + AA E KLA+E G A+E+ A+ I+I+F P LDV Sbjct: 91 VTRLPRQVSTSFSGNMALAACPAGEN-EKLAREVGAAQAAELKALGININFVPSLDVNSD 149 Query: 127 --SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPR 184 + IG RS+ DP ++ S + G+ E G+ KHFPGHG + DSH + PC R Sbjct: 150 PANPVIGVRSFGDDPAAVASLGSELLSGLQEGGVAGALKHFPGHGDTSQDSHIDLPCVGR 209 Query: 185 PQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP---------RPASGSPYWLKTVL 235 + E A D++ FS +I + +M AH+ Y +D PA+ S + +L Sbjct: 210 SRDEAFAVDLAPFSRVIAQANPAMVMTAHIQYPALDDSCIAGTDVAVPATLSRAMMTGLL 269 Query: 236 RQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPI 295 R+++ FDGVI +D L M+ + + E AG D+ L+ S N Sbjct: 270 REQMDFDGVIITDALDMKAISARMTPTEAVLRCFAAGVDIALMP--LLIRSSASFNQLQQ 327 Query: 296 KAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQEEKAG 340 H G E+ S TR+ L +++ ++ G Sbjct: 328 LVSTAVEAIHSGELDESEVRASV------TRILALQQKFAPQQPG 366 >UniRef50_Q0HJG7 Beta-hexosaminidase n=35 Tax=Gammaproteobacteria RepID=NAGZ_SHESM Length = 342 Score = 338 bits (867), Expect = 2e-91, Method: Composition-based stats. Identities = 162/293 (55%), Positives = 199/293 (67%), Gaps = 3/293 (1%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 M +MLD+ ++ E E+L HP VGGLILF+RN+ QL LV+QIR R +++A Sbjct: 1 MSYLMLDLLSLDVSEAEAEMLRHPQVGGLILFSRNFSSREQLIALVQQIRQI-RPEILIA 59 Query: 61 VDQEGGRVQRFREGFTRLPAAQSF--AALSGMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 VD EGGRVQRFREGFT +PA AA M + A E G+LMA E++A DID+SFA Sbjct: 60 VDHEGGRVQRFREGFTLIPAMGDILPAAKGDMVLAKRWACELGFLMAIELLACDIDLSFA 119 Query: 119 PVLDVGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE 178 PVLD+ IS IG+RS+ A P + +A+A FI+GM EAGM GKHFPGHG+V ADSH Sbjct: 120 PVLDLNGISQVIGKRSFSAKPDEVIALAQSFIEGMAEAGMGAVGKHFPGHGSVAADSHIA 179 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQE 238 P D R I +D+ F LI + KL IMPAHVIY VDP PA S YWLK +LR+E Sbjct: 180 QPIDEREAEAIFNQDILPFKELIAKGKLSGIMPAHVIYPKVDPNPAGFSSYWLKQILRKE 239 Query: 239 LGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDN 291 LGF+GVIFSDDL M+GAA G Y R QA+LDAGCDMILVCN+ G +S+L+ Sbjct: 240 LGFNGVIFSDDLGMKGAAFAGDYLGRAQAALDAGCDMILVCNDNPGVMSLLNG 292 >UniRef50_A1RBY1 Putative beta-N-Acetylglucosaminidase n=1 Tax=Arthrobacter aurescens TC1 RepID=A1RBY1_ARTAT Length = 681 Score = 338 bits (867), Expect = 2e-91, Method: Composition-based stats. Identities = 89/335 (26%), Positives = 144/335 (42%), Gaps = 31/335 (9%) Query: 18 REILAHPLVGGLILFT--RNYHDPAQLRELVRQIRAAS------RNRLVVAVDQEGGRVQ 69 E++ +GG++ F N ++P Q+ L ++ + L V +DQEGG V Sbjct: 95 AEVVEKYNLGGVLYFAWSGNTNNPQQVAGLSNGLQQTALREDGNGIPLAVTIDQEGGLVA 154 Query: 70 RFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVG--HIS 127 R T LP + A + LA+ G ++ SEM AM I++ FAPVLD+ + Sbjct: 155 RIGPPATVLPGNMALGATADAN----LAKAQGEILGSEMRAMGINVDFAPVLDLNSNPDN 210 Query: 128 AAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQA 187 IG RS DP A+ I+G+ + KHFPGHG + DSH P +A Sbjct: 211 PVIGIRSMGEDPALVSALGVAQIEGIQAHNVGAAAKHFPGHGDTSVDSHYGLPTVTYDRA 270 Query: 188 EIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQELGFDGVIF 246 + + F + I + +D +M AH+I +D P + S L +LR E+GF G++ Sbjct: 271 TLNE-HLKPFKAAI-DGGVDMVMTAHIIVEAIDAEMPGTLSHKVLTGLLRDEMGFKGLVT 328 Query: 247 SDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHK 306 +D L M A E ++ AG D++L + + + V Sbjct: 329 TDALDMAAMAAEWPQEEIAVKAIQAGSDILLNSPDVDASFA-----------GVRAAVES 377 Query: 307 GSFSRQELMDSARWKAI--STRLNQLHERWQEEKA 339 G + L +S R + + + + + A Sbjct: 378 GEITETRLDESVR-RILEWKVKRGVFEQPMADPAA 411 >UniRef50_Q5WDA0 Beta-N-acetylhexosaminidase n=1 Tax=Bacillus clausii KSM-K16 RepID=Q5WDA0_BACSK Length = 521 Score = 338 bits (867), Expect = 2e-91, Method: Composition-based stats. Identities = 102/343 (29%), Positives = 157/343 (45%), Gaps = 25/343 (7%) Query: 1 MGPVML-DVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASR----- 54 +G +++ G E + ++ + VG +ILF+RN P QL L ++ A++ Sbjct: 9 LGQLLVFGFHGTTATPEIKAMIRNHYVGNIILFSRNIETPEQLLTLTSDLQKAAKEAGHE 68 Query: 55 NRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDID 114 L + VDQE G V+R G T LP A A + A G L A E+ A+ ++ Sbjct: 69 RPLAICVDQENGTVRRL-SGVTALPGAMLLGAT----HKRRYASIVGQLTAKELRAVGVN 123 Query: 115 ISFAPVLDVGHISAA--IGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVT 172 + APV DV + IG RS+ DP K A + GM G+ T KHFPGHG Sbjct: 124 WNLAPVADVNNNPNNPVIGVRSFGEDPSKVGDFAKEMMTGMQAEGVIATLKHFPGHGDTV 183 Query: 173 ADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPRPASGSPYW 230 DSH P P ++ ++ F + I E +AIM AH+ + + +P PA+ S Sbjct: 184 VDSHHAMPTIPHGLKRLQQVELVPFKACI-EAGAEAIMSAHIHFPALETEPVPATISYKV 242 Query: 231 LKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLD 290 + +LR ELG+ GV+ +D L ME A A+ A+L AG DM++V + + Sbjct: 243 MTELLRHELGYSGVLMTDCLEMEAIAGSIGTAQGAVAALLAGADMLVVSHCESRQRPTVA 302 Query: 291 NLSPIKAERVTRLYHKGSFSRQELMDS-ARWKAISTRLNQLHE 332 + + +G F ++ L S R A+ + Sbjct: 303 --------ALEAAFAEGRFDQKRLDMSFQRIDALKAKYLSWDR 337 >UniRef50_B3QV76 Glycoside hydrolase family 3 domain protein n=1 Tax=Chloroherpeton thalassium ATCC 35110 RepID=B3QV76_CHLT3 Length = 639 Score = 337 bits (865), Expect = 4e-91, Method: Composition-based stats. Identities = 82/343 (23%), Positives = 147/343 (42%), Gaps = 32/343 (9%) Query: 2 GPVMLDVEGYELD-------AEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASR 54 G +++ E E++ + LVGG++ D +L L + ++ ++ Sbjct: 81 GQMLVASSSSYYQSTDDKSYQELSELVKNDLVGGIMFSRG---DVYELAMLANRYQSLAK 137 Query: 55 NRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDID 114 L+++ D E G R + T P AA A E G +A E A+ I Sbjct: 138 YPLLISADMEWGVAMRVKR-TTEFPNNMGVAATWNP----MYAYEMGRAIAEEARALGIH 192 Query: 115 ISFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVT 172 ++AP +D+ + + I R++ + Q A+A F+ G A + T KHFPGHG Sbjct: 193 QNYAPAVDLNNNPENPVINTRAFSENVQLTNAMAKAFVMGSQSARVIATAKHFPGHGDTE 252 Query: 173 ADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV---DPRPASGSPY 229 DSH++ P P + + + + F I N + ++M H+ ++ + PA+ SP Sbjct: 253 IDSHRDLPVLPFTRNRLDSLEFRPFKYAIT-NGVMSVMVGHLALPNLVSSEQVPATLSPG 311 Query: 230 WLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVL 289 + +LR+E FDG+I +D ++M G S E ++ AG D++L+ + Sbjct: 312 IISRILRKEFAFDGLIVTDAMTMYGIRKNYSVGEAAVKAVLAGNDILLMPPD-------- 363 Query: 290 DNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHE 332 + E + + K + +S R I +LH Sbjct: 364 ---VAVAHEAIVKAVEKADIPLSHIDESVRRILIVKEWLRLHR 403 >UniRef50_A8SRF1 Putative uncharacterized protein n=1 Tax=Coprococcus eutactus ATCC 27759 RepID=A8SRF1_9FIRM Length = 416 Score = 336 bits (863), Expect = 7e-91, Method: Composition-based stats. Identities = 89/338 (26%), Positives = 153/338 (45%), Gaps = 28/338 (8%) Query: 3 PVML----DVEGYE----LDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASR 54 +M + G + ++ L VGG+I F N+ + Q R+++ ++ S+ Sbjct: 92 QMMFVTPEGLTGTDDVMVAGDATKQALQAYPVGGIIYFANNFDNVDQTRDMISNTQSYSK 151 Query: 55 NRLVVAVDQEGGRVQRFREGF-TRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDI 113 L +A D+EGGRV R + T ++G A E +AS+M A+ Sbjct: 152 IGLFIATDEEGGRVNRIMDTLGTTYIGPMY----DYKDDGPDAAFENAKTIASDMSALGF 207 Query: 114 DISFAPVLDV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAV 171 ++ FAPV DV + IG+R+Y D +A + + G + G+ T KHFPGHG Sbjct: 208 NLDFAPVADVWSNPDNTVIGDRAYSDDYAQAAELVGSAVKGFQDGGVICTLKHFPGHGDT 267 Query: 172 TADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWL 231 DSH + R +A+I A +M F + I + + +M H+I D+D P++ S Sbjct: 268 AEDSHYSSAYVNRTKAQIMADEMQPFRTGI-DAGAELVMVGHLIVPDIDELPSTLSYKIA 326 Query: 232 KTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDN 291 +LR ++ FDGV +D + ME A A+ ++ AG DM+L + AV+ Sbjct: 327 TEMLRGDMKFDGVAITDSMEMESIADNYGVADSAVMAVQAGMDMLLQPADLAVAVNA--- 383 Query: 292 LSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQ 329 + G + + +S R + ++ + + Sbjct: 384 --------IVTAVQNGDITEPRIDESVR-RILTLKAER 412 >UniRef50_A1ZDA7 Glycosyl hydrolase, family 3 n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZDA7_9SPHI Length = 383 Score = 336 bits (862), Expect = 8e-91, Method: Composition-based stats. Identities = 87/348 (25%), Positives = 157/348 (45%), Gaps = 29/348 (8%) Query: 1 MGPV-MLDVEGYELD--AEEREILAHPLVGGLILFTR---------NYHDPAQLRELVRQ 48 +G + M+ G + + + + +GG+IL++R N +QL+ L+ Sbjct: 45 IGQMLMVGFRGTSIKGKSHIKRDIRKHHLGGVILYSRDLLMGAKPRNIVSKSQLKTLITD 104 Query: 49 IRAASRNRLVVAVDQEGGRVQRF--REGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMAS 106 ++ S L+VAVD+EGG+V R + GF Q ++ + ++ +A Sbjct: 105 LKKLSDVPLLVAVDEEGGKVSRLNAKFGFATTQTPQEIGEINDL----VATEKWATHIAQ 160 Query: 107 EMIAMDIDISFAPVLDVGHIS--AAIG--ERSYHADPQKALAIASRFIDGMHEAGMKTTG 162 ++ AM +++FAPV D+ IG RS+ D + A +F + + G+ Sbjct: 161 KVKAMGFNVNFAPVTDLNVNPKAPIIGKLGRSFSGDVNTLVQHALKFTEVHQKHGIVCAI 220 Query: 163 KHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR 222 KHFPGHG+ DSH + ++ F +I + +M AHV +D + Sbjct: 221 KHFPGHGSAIMDSHLGFTDVTTT---WKPIELKPFKQMIEQGFDGMVMTAHVFNKKLDAK 277 Query: 223 -PASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNN 281 PA+ S + LR++ G+ G+I SDD+ M A E + S++AG D++L NN Sbjct: 278 YPATLSKKIMNDYLRKKWGWQGIIISDDMQMNAIAKNFGIEEALEKSINAGVDIVLFSNN 337 Query: 282 RKGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQ 329 + + N+ P + +L +G SR+ + +S + + Sbjct: 338 GRIFYN--KNIVPEAINIIKKLIKQGKISRKRIDES-YQRIKKMKQGL 382 >UniRef50_Q47VB6 Beta-hexosaminidase n=1 Tax=Colwellia psychrerythraea 34H RepID=Q47VB6_COLP3 Length = 637 Score = 335 bits (861), Expect = 1e-90, Method: Composition-based stats. Identities = 90/316 (28%), Positives = 148/316 (46%), Gaps = 27/316 (8%) Query: 11 YELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS-----RNRLVVAVDQEG 65 EL E ++++ VGG++LF N + AQ+ +L I+AA+ N L++++DQEG Sbjct: 68 TELPKELAQMISTSNVGGIVLFAENVQEIAQVVKLTHDIQAAALKSEQGNPLIISIDQEG 127 Query: 66 GRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGH 125 GRV RF + T + A E G A + ++A E+ A+ I+ ++APV+DV Sbjct: 128 GRVARFAK-MTGFAGNMAIGASY-PEHGTHFATQVNAVIAKELKALGINNNYAPVVDVNT 185 Query: 126 I--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDP 183 + I RS+ + + + + + G+ T KHFPGHG DSH P Sbjct: 186 NVDNPVINTRSFGESAVQVAELGASAVTAIQAQGVMATLKHFPGHGDTHVDSHLGLPRVD 245 Query: 184 RPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP------------RPASGSPYWL 231 +A I D++ F+ I K IM AH+ + +D RPA+ S L Sbjct: 246 HDRATIDKVDLAPFAWAIDHAKPAMIMTAHIQFPALDNSLFISKSGDKLIRPATMSRKIL 305 Query: 232 KTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILV------CNNRKGA 285 +LR+E+ FDG+I +D L M G + E + AG D+ ++ N+ Sbjct: 306 TNLLREEMEFDGIIATDALDMAGVTHFFTDIEAVVETFVAGADLAVMPFKVRTPNDIDNF 365 Query: 286 VSVLDNLSPIKAERVT 301 + + ++ A R+ Sbjct: 366 YNFIKAVAKALAIRIE 381 >UniRef50_B2SGG8 Glycosyl hydrolase family 3 n=17 Tax=Francisella RepID=B2SGG8_FRATM Length = 555 Score = 335 bits (860), Expect = 1e-90, Method: Composition-based stats. Identities = 84/307 (27%), Positives = 138/307 (44%), Gaps = 35/307 (11%) Query: 1 MGPV-MLDV------------EGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVR 47 +G + M+D + + +I +GG ILF N + Q+ L+R Sbjct: 11 LGQLIMMDFRYWGEDSNNQRIPFTKTNDIVNKIFKDYNLGGFILFRENIQNNEQVISLLR 70 Query: 48 QIRAASRNRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASE 107 ++A + + A DQEGGRV R ++G T + AA A ++ E Sbjct: 71 DLQANTNTPIFFATDQEGGRVNRLQQG-TSGCGNMALAATDNPHN----AYTMAKIIGDE 125 Query: 108 MIAMDIDISFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHF 165 + ++ I+I+FAP +DV + IG RSY +P + A I+G H+A + KHF Sbjct: 126 LYSLGININFAPAVDVNSNKNNPIIGVRSYSDNPDIVIDYAKNAINGYHDAKIIDCIKHF 185 Query: 166 PGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP---- 221 PGHG DSH + E++ ++ FS L R+ IM AH+ +D Sbjct: 186 PGHGDTATDSHLGNVNLDKTLKELQTTELLPFSKLARD--CSMIMTAHISVPALDDTQYQ 243 Query: 222 ---------RPASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAG 272 PA+ S + +L+Q++ FDG++ SD + M A E + ++ AG Sbjct: 244 SVSTSENIYVPATLSYKIITKLLKQQMKFDGLVVSDAMDMHAIAKHFGTIEASKLAILAG 303 Query: 273 CDMILVC 279 D++L+ Sbjct: 304 IDILLMP 310 >UniRef50_A4BUN0 Glycoside hydrolase, family 3-like protein n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BUN0_9GAMM Length = 337 Score = 335 bits (860), Expect = 2e-90, Method: Composition-based stats. Identities = 140/312 (44%), Positives = 183/312 (58%), Gaps = 8/312 (2%) Query: 5 MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQE 64 M+ + G L EERE+L P +GG+ILF+RNY QL +L I A R L++AVDQE Sbjct: 1 MIGIRGLVLLPEERELLCDPRIGGVILFSRNYESIEQLSQLTEAIHAIRRPSLLIAVDQE 60 Query: 65 GGRVQRFREGFTRLPAAQSFAALSG--MEEGGKLAQEAGWLMASEMIAMDIDISFAPVLD 122 GG VQRFR GFT LPA L A+ GWLMA+E+ A +D+SFAPVLD Sbjct: 61 GGCVQRFRTGFTALPAPACLGRLYDHGPPVALAAAERLGWLMAAELRAAGVDLSFAPVLD 120 Query: 123 VGHI-SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPC 181 V S IGER++H + ++ GM +AGM GKHFPGHGAV ADSH E P Sbjct: 121 VERGVSRVIGERAFHRRAGAVADLTLSWVRGMRQAGMAAVGKHFPGHGAVVADSHHELPV 180 Query: 182 DPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQELGF 241 D RP I + D F LI L+ +M AHV+Y VD RPAS S W+ VLR+ LGF Sbjct: 181 DDRPLDTIASVDFLPFRRLI-AVGLEGVMMAHVLYPQVDARPASFSGVWIAGVLRETLGF 239 Query: 242 DGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL----SPIKA 297 G + SDDL M+ AA +G +R +L AGCD++L+ N+ + A S+++ L P++A Sbjct: 240 TGAVVSDDLGMQAAACIGDLYQRAGVALAAGCDLLLLGNDWEPAASIVERLPSVERPVRA 299 Query: 298 ERVTRLYHKGSF 309 R+ L +G Sbjct: 300 ARLAALRGEGRL 311 >UniRef50_Q30Y91 Beta-N-acetylhexosaminidase n=3 Tax=Desulfovibrionales RepID=Q30Y91_DESDG Length = 398 Score = 334 bits (857), Expect = 3e-90, Method: Composition-based stats. Identities = 103/358 (28%), Positives = 159/358 (44%), Gaps = 31/358 (8%) Query: 2 GPVM-LDVEGYELD--AEEREILAHPLVGGLILF---------TRNYHDPAQLRELVRQI 49 G ++ + G L + H +GG+ILF RN QL L + Sbjct: 47 GQLLLVGFRGESLSGADSVVRDVTHYNLGGVILFDRDVALGSDRRNIRSVRQLAALTAAL 106 Query: 50 RAASRNRLVVAVDQEGGRVQRF--REGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASE 107 + A++ L VAVDQEGGRV R R GF LP+AQ +G + +AG + Sbjct: 107 QKAAQTPLFVAVDQEGGRVARLSPRHGFAALPSAQE----TGAAATTAASADAGSRAGAM 162 Query: 108 MIAMDIDISFAPVLDVG--HISAAIG--ERSYHADPQKALAIASRFIDGMHEAGMKTTGK 163 + A+ ++++FAPV+DV + IG RS+ AD + A F+ GM G+ K Sbjct: 163 LHAVGVNLNFAPVVDVNVRPDNPVIGRAGRSFSADSEAVARHAQAFLQGMASHGVLGCVK 222 Query: 164 HFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR- 222 HFPGHG+ ADSH ++ + L+ + IM H+ S +DP Sbjct: 223 HFPGHGSSAADSHLGVADISAS---WNENELVPYRRLLDGGHVQMIMTGHLFNSRLDPDH 279 Query: 223 PASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNR 282 PA+ S + +LR LGF GV+ SDD+ M+ + +L+AG D++L NN Sbjct: 280 PATLSRAVITGLLRGRLGFGGVVVSDDMQMKAITDRYGLEQAVALALNAGVDILLFGNNL 339 Query: 283 KGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDS-ARWKAISTRLNQLHERWQEEKA 339 ++ P + + L +G R + +S AR + L R + + Sbjct: 340 TYDADIV----PEVVDMIESLVERGVVPRSRIEESFARVLRLKESAGLLPARVRTVQE 393 >UniRef50_Q4FV07 Glycoside hydrolase family protein n=32 Tax=Pseudomonadales RepID=Q4FV07_PSYA2 Length = 360 Score = 333 bits (856), Expect = 4e-90, Method: Composition-based stats. Identities = 137/283 (48%), Positives = 183/283 (64%), Gaps = 5/283 (1%) Query: 2 GPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAV 61 G +M+D++ L AE+ ++ VGG+ILF RN D AQ+R L IR +++ V Sbjct: 3 GVLMIDIDSTALTAEDVSLIRQAQVGGVILFARNVTDAAQVRALCDDIR-CHNADILIGV 61 Query: 62 DQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKL--AQEAGWLMASEMIAMDIDISFAP 119 DQEGGRV R R GFT LPA L + L A + G+LMA+E++A+ ID+SFAP Sbjct: 62 DQEGGRVARLRNGFTPLPAMGRLGELFNQDPNRALECAYDCGYLMAAEVLAVGIDLSFAP 121 Query: 120 VLDVGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKET 179 VLD IS IG+RS+H DPQ +A+AS+F+ GM AGM TTGKHFPGHG++ DSH Sbjct: 122 VLDRDGISQVIGDRSFHHDPQAIIALASQFMRGMKAAGMATTGKHFPGHGSIAPDSHVAE 181 Query: 180 PCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQEL 239 D R +I DM F+ + LDA+MPAHVI+S VD +PA S WLK ++R++L Sbjct: 182 AIDDRSLDDIMNSDMQPFAQTLP--WLDALMPAHVIFSQVDNKPAGFSKIWLKDIIREQL 239 Query: 240 GFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNR 282 F+GV+FSDDLSM GA G + R A+++AGCD+ LVCN+R Sbjct: 240 NFEGVLFSDDLSMAGAKAAGDVSARVTAAIEAGCDIALVCNDR 282 >UniRef50_Q73RR1 Glycosyl hydrolase, family 3 n=1 Tax=Treponema denticola RepID=Q73RR1_TREDE Length = 557 Score = 333 bits (856), Expect = 4e-90, Method: Composition-based stats. Identities = 87/346 (25%), Positives = 156/346 (45%), Gaps = 25/346 (7%) Query: 5 MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS-----RNRLVV 59 M G + + +G + +F N D +L + + ++ S L V Sbjct: 65 MFGWAGQDPGDLLLSWIQDSGLGSIKVFGWNTGDSRKLAKSISILQKKSLEGRFGIPLFV 124 Query: 60 AVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAP 119 A DQEGG V+ + + P + A ++ +G+ ++ E+ A+ I+++FAP Sbjct: 125 ATDQEGGWVRHVKGLTSETPGNLAIGASGIPQDS----YYSGYYISREIRALGINLNFAP 180 Query: 120 VLDV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 +D+ H S+ IG RS+ P + + F+ G +AG+ TT KHFPGHG + DSH Sbjct: 181 TVDLLTDHDSSIIGPRSFGDSPHAVGILGAAFVRGSRDAGVLTTAKHFPGHGDTSIDSHG 240 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPRPASGSPYWLKTVL 235 P + R +++ F LI + + AIM H+ +S + + PA+ S Y L +L Sbjct: 241 RLPKIDISEETFRNRELIPFKYLI-DAGVPAIMTGHLNFSSILPNGEPATFSKYLLTDIL 299 Query: 236 RQELGFDGVIFSDDLSM-EGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSP 294 R ELGF G+I +DD+ M G A+ + +L+AG D+I + + + Sbjct: 300 RGELGFKGLIITDDMMMHGAMNFAGGIAKAVKMALEAGNDIIESSTTPRHYQAFWKDN-- 357 Query: 295 IKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLH-ERWQEEKA 339 R + ++ + D+A ++ + +L + + Sbjct: 358 ------IRAMKEEPQFKERVKDAA-FRILLEKLKYFKSDNYVPILP 396 >UniRef50_D0KBJ7 Glycoside hydrolase family 3 domain protein n=6 Tax=Gammaproteobacteria RepID=D0KBJ7_PECWW Length = 598 Score = 333 bits (856), Expect = 4e-90, Method: Composition-based stats. Identities = 99/365 (27%), Positives = 156/365 (42%), Gaps = 34/365 (9%) Query: 1 MGP-VMLDV-----------EGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVR- 47 +G +MLD + D +++ +GG+ILF N D Q+++L Sbjct: 16 IGQKLMLDFRYWNSTGKSNQDMTIPDDAIGKLITDHHIGGVILFANNLKDKLQIKKLTAW 75 Query: 48 --QIRAASRNRLVVAVDQEGGRVQRFREG-FTRLPAAQSFAALSGMEEGGKLAQEAGWLM 104 + RL ++ D EGG V R G + P + +A ++A E G LM Sbjct: 76 YTAMETCGGVRLFISTDNEGGNVFRLPRGDYASFPGNMALSAAIEGGADIQIASEQGRLM 135 Query: 105 ASEMIAMDIDISFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTG 162 A +M+ I+ +FAPV+DV + I R++ D +A + GM + G+ T Sbjct: 136 AQDMLFTHINTNFAPVVDVNTNPFNPVINVRAFSDDADTVYRVAEKMTAGMKQQGLITAY 195 Query: 163 KHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIR-ENKLDAIMPAHVIYSDVDP 221 KHFPGHG+ + DSH P R + + A D+ + I + D +M AH+ Y +D Sbjct: 196 KHFPGHGSTSTDSHTGLPRVDRSRKDAFAIDIMPYKRAIDSQTVPDMVMTAHIQYPALDN 255 Query: 222 ------------RPASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASL 269 PA+ S +LR ELG+ GV SD L M A S E + Sbjct: 256 SQVTTRNGEKITVPATMSYAIQTDILRGELGYAGVTISDALDMGAIADHFSQEEAVRQVF 315 Query: 270 DAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDSA-RWKAISTRLN 328 AG D+ L+ + + L P + + + KG+ E+ S R + R + Sbjct: 316 TAGVDIALMPISISSPDQI--KLLPDLIQHIVDMVKKGNIREAEIDASVERILTLKARYH 373 Query: 329 QLHER 333 L E+ Sbjct: 374 LLCEK 378 >UniRef50_UPI0000E11708 beta-hexosaminidase n=1 Tax=Rhodobacterales bacterium HTCC2255 RepID=UPI0000E11708 Length = 349 Score = 333 bits (856), Expect = 5e-90, Method: Composition-based stats. Identities = 147/292 (50%), Positives = 193/292 (66%), Gaps = 1/292 (0%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 M P+MLDV G+EL +E+E++AHPL GG+ILF RNYHDP QLR L QI+ ++ + +++A Sbjct: 1 MHPIMLDVAGFELTQQEQEVMAHPLTGGVILFARNYHDPEQLRHLTSQIKHSAGHPILIA 60 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 VDQEGGRVQRF+ GFT +PA + +LA G ++A+E I +DISFAPV Sbjct: 61 VDQEGGRVQRFKNGFTAIPAMGDI-EHHDHVQARQLAYACGRVIATECIGHGVDISFAPV 119 Query: 121 LDVGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETP 180 LD+ IS IG RS+ P L++A+ FI G+ GM GKHFPGHG V ADSH P Sbjct: 120 LDINAISDVIGTRSFSNAPSTILSLATEFIHGLKAIGMPAIGKHFPGHGNVKADSHIAMP 179 Query: 181 CDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQELG 240 D R + I DM++F+ +++ +D IMPAHV+Y D+D RPA S WL+ +LRQ G Sbjct: 180 SDERSKDTIFELDMAIFTHFMQQQLVDGIMPAHVVYPDIDSRPAGFSSIWLQQILRQHCG 239 Query: 241 FDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL 292 F GVIFSDDLSM A+ GS +R + +L AG DM L+CN A+ VLDNL Sbjct: 240 FQGVIFSDDLSMHAASEAGSMLDRVEHALAAGVDMALICNAPDEAIMVLDNL 291 >UniRef50_Q2NRJ7 Putative glycosidase n=1 Tax=Sodalis glossinidius str. 'morsitans' RepID=Q2NRJ7_SODGM Length = 650 Score = 333 bits (855), Expect = 6e-90, Method: Composition-based stats. Identities = 94/334 (28%), Positives = 150/334 (44%), Gaps = 24/334 (7%) Query: 12 ELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAA---SRNRLVVAVDQEGGRV 68 L+A IL +GG+ILF N Q+R L Q+ A L++A+D EGG V Sbjct: 94 ALNANIEHILHTYHIGGVILFANNLKKTDQIRRLTAQLAAVPQTHGIGLMIAIDNEGGNV 153 Query: 69 QRFREG-FTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI- 126 R G +T AL L G +MA E+ ++ I+++FAPV+DV Sbjct: 154 FRLPRGEYTAFAGNMPLGALDQGHSESVLPYLQGKIMAQELRSLGINVNFAPVVDVNSNQ 213 Query: 127 -SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRP 185 + I R++ P + +A + G A + T KHFPGHG DSH + P R Sbjct: 214 ANPVINVRAFSDSPARVDKLARELVKG--SAQIVTCYKHFPGHGDTHTDSHMDLPRVDRN 271 Query: 186 QAEIRAKDMSVFSSLIRENKL-DAIMPAHVIYSDVDP------------RPASGSPYWLK 232 + E A D++ + + I EN+ D IM AH+ Y +D PA+ S Sbjct: 272 REEAYAIDLAPYRAAIAENEAPDMIMTAHIQYPALDDSRITRLDGEDIVVPATLSRRIQH 331 Query: 233 TVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL 292 +LRQELG+ GV +D L M+ + + + +AG D+ L+ + + + Sbjct: 332 DLLRQELGYRGVTITDALDMKAISANFDPRDVVKRVFNAGIDIALMPVSIQAPEE--EEK 389 Query: 293 SPIKAERVTRLYHKGSFSRQELMDSARWKAISTR 326 + V G + +E+ D++ + +S + Sbjct: 390 LRELIDYVATQVEHGEINGEEI-DASVARILSLK 422 >UniRef50_C3RKZ3 Beta-hexosamidase A n=5 Tax=Bacteria RepID=C3RKZ3_9MOLU Length = 1092 Score = 333 bits (854), Expect = 6e-90, Method: Composition-based stats. Identities = 102/373 (27%), Positives = 171/373 (45%), Gaps = 51/373 (13%) Query: 1 MGP-VMLDV-------EGY------ELDAEEREILAHPLVGGLILFTRNYHDPAQL-REL 45 +G +ML +G ++ E EI+ +G +ILF N++ A++ EL Sbjct: 53 IGQKLMLSFRSGWTMRDGTKISSVQTINDEIHEIIGEYDIGSVILFAANFNSDAKVNVEL 112 Query: 46 VRQIRAASR--------NRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLA 97 ++ A+ L++A DQEGG V R G T LP + A E A Sbjct: 113 TDGLQKAAMDKDLGKNSIPLLIATDQEGGIVYRLTGG-TALPGNMALGASGNTEN----A 167 Query: 98 QEAGWLMASEMIAMDIDISFAPVLDVGHISAA--IGERSYHADPQKALAIASRFIDGMHE 155 +AG ++ SE+ A+ ++++FAP DV + IG RS+ ++PQ A S +I+G+ Sbjct: 168 VKAGNIIGSELNAVGVNVNFAPDADVNNNPNNPVIGLRSFSSNPQLAAKFVSAYIEGVQS 227 Query: 156 AGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVI 215 + T KHFPGHG V DSH P P + E+ ++ F + I + D +M AH+ Sbjct: 228 NNVATAAKHFPGHGNVATDSHTGLPSVPATKEELYKTELVPFQAAI-DAGTDMVMTAHIQ 286 Query: 216 YSDV-------DPR------PASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYA 262 + ++ D + PA+ S L +LR E+ FDGVI +D ++M+G A Sbjct: 287 FPNIVTEEIYSDKKDELMSPPATLSREILTDLLRDEMKFDGVIVTDSMTMQGVANYFDTN 346 Query: 263 ERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLY--HKG----SFSRQELMD 316 ER ++ AG D++ + ++ +++ Y G S +EL Sbjct: 347 ERNLLAVKAGVDILDIPFTDISSMKDMESKLIPLINAFVDAYTKEDGYNGIKLSLEELDK 406 Query: 317 SARWKAISTRLNQ 329 S + + + N+ Sbjct: 407 SV-ERILVCKYNR 418 >UniRef50_A9BCX1 Beta-glucosidase-related glycosidase n=2 Tax=Cyanobacteria RepID=A9BCX1_PROM4 Length = 543 Score = 332 bits (853), Expect = 9e-90, Method: Composition-based stats. Identities = 81/323 (25%), Positives = 139/323 (43%), Gaps = 21/323 (6%) Query: 8 VEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGR 67 +EL + VGG+IL+ + ++ Q+R + + + D E G Sbjct: 32 YPNWELTNHRLQQFLEEGVGGVILYGGSI---EEITNRCAQLRMWAGKPIFLCADVEEGV 88 Query: 68 VQRFREGFTRLPAAQSFAALS--GMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGH 125 QRF+ G +P + + E LA+ G L+ E + ++ APV DV Sbjct: 89 GQRFQGGTWLIP-PMALGRIYLKEPEYAISLAEHYGALIGYESVICGLNWVLAPVCDVNS 147 Query: 126 I--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDP 183 + I R++ +PQ +A F G+ G+ KHFPGHG DSH E P Sbjct: 148 NPLNPVINMRAWSDNPQTVADLACAFHRGLTSQGVLGCAKHFPGHGDTKVDSHLELPVLD 207 Query: 184 RPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD-PRPASGSPYWLKTVLRQELGFD 242 + + ++ F +LI++ + +IM AH+I + VD P + S L +LR+++ F+ Sbjct: 208 NDLSRLAEIELPPFQALIQQG-VSSIMSAHLILNRVDCNYPVTFSKRILTDLLRKKMCFE 266 Query: 243 GVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTR 302 G+I +D L M + S + +AG D+IL+ N A+ + + Sbjct: 267 GMIVTDALVMRAISKTFSSGSAAVMAFEAGADLILMPQNPSEAI-----------DAIVE 315 Query: 303 LYHKGSFSRQELMDSARWKAIST 325 G L DS + + ++ Sbjct: 316 SLISGRLPISRLEDSLQRRQLAL 338 >UniRef50_C5BIH5 Glycoside hydrolase family 3 domain protein n=1 Tax=Teredinibacter turnerae T7901 RepID=C5BIH5_TERTT Length = 343 Score = 332 bits (852), Expect = 1e-89, Method: Composition-based stats. Identities = 154/340 (45%), Positives = 200/340 (58%), Gaps = 10/340 (2%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 +GPV+LDV+G L + +L +P VGG+ILF+RNY + QL L +IR AS +V+A Sbjct: 6 LGPVVLDVDGTTLSESDVNLLQNPWVGGVILFSRNYTNVMQLCGLTAEIRKAS-PNIVIA 64 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALS--GMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 VD EGGRVQRF FT +PA Q L E L E GWL+A+E+IA +D+SF Sbjct: 65 VDHEGGRVQRFGGEFTPIPAMQQLGNLYVKSPVEALALTHETGWLLAAELIACGLDMSFT 124 Query: 119 PVLDVGH-ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 PVLDV S IG RS+ DP +A++ DG+ EAGM TGKHFPGHG V DSH Sbjct: 125 PVLDVDDCRSQIIGSRSFGPDPAAVIALSEALTDGLIEAGMSATGKHFPGHGGVVEDSHL 184 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQ 237 E P D R E+ D+ F +L+ KL+AIMPAH+ + + P S +WL+ VLR+ Sbjct: 185 ELPVDNRSFDEVARHDLQPFKALLP--KLEAIMPAHIHFPQIASEPVGFSSFWLQQVLRE 242 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSP--- 294 +LGF+G+IFSDDL+MEGA G Y ER +L AGCD +L+CNN GA L L Sbjct: 243 DLGFNGIIFSDDLTMEGAVTAGDYGERTLKALAAGCDSVLICNNPAGAQEALARLDATFS 302 Query: 295 -IKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHER 333 + R+ + K F L SAR K + +L L R Sbjct: 303 TVFPSRLECMRAKRCFEWDSLRGSARAKGVRAQLESLVAR 342 >UniRef50_A3HXS5 Putative hydrolase/beta lactamase fusion protein n=1 Tax=Algoriphagus sp. PR1 RepID=A3HXS5_9SPHI Length = 984 Score = 332 bits (852), Expect = 1e-89, Method: Composition-based stats. Identities = 86/346 (24%), Positives = 143/346 (41%), Gaps = 30/346 (8%) Query: 1 MGPVMLDVEGYELDA----EEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNR 56 +G + + D E E++ + +GGLI F P + L +A S+ Sbjct: 58 LGQLFMVAAYSNKDQRHVNEISELVKNENLGGLIFFQG---GPDRQARLTNYYQAQSKTP 114 Query: 57 LVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 L +A+D E G R P A + A+ + L + G MA + + + I+ Sbjct: 115 LFIAMDAEWGISMRLDS-IPDFPKAMTLGAVQDTD----LIYQMGTEMARQFKELGMHIN 169 Query: 117 FAPVLDVG--HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 FAPV+DV + IG R++ + + A ++ G+ + G+ KHFPGHG D Sbjct: 170 FAPVVDVNSNPDNPVIGYRAFGENKKLVAQRAVSYMKGLQDHGVIANAKHFPGHGDTEVD 229 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD---PRPASGSPYWL 231 SH P P+ I D+ + L REN L ++M AH+ +D S S + Sbjct: 230 SHYSLPVIKNPEQRIWDVDLYPYQELFREN-LMSVMVAHLNIPSLDSRADMATSLSKKVV 288 Query: 232 KTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDN 291 +L+ + F G+IF+D L+M G A S E +L AG D++L + A ++ Sbjct: 289 TDLLQNRMNFQGLIFTDALNMRGVANTNSPGEVDLKALLAGNDILLYSQDVPKAKAM--- 345 Query: 292 LSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQEE 337 + G S E+ + K + + + Sbjct: 346 --------IKSAVEDGVISESEIDRRVK-KVLKAKYWAGLNDYHPI 382 >UniRef50_A1AVY9 Beta-N-acetylhexosaminidase n=2 Tax=Gammaproteobacteria RepID=A1AVY9_RUTMC Length = 327 Score = 332 bits (852), Expect = 1e-89, Method: Composition-based stats. Identities = 134/294 (45%), Positives = 189/294 (64%), Gaps = 6/294 (2%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 MGP+M+DV G L +E++ L P +GG+ILF RN+ Q+ L++ IR ++N L++A Sbjct: 2 MGPIMMDVSGLALTTQEKQQLIKPSIGGVILFGRNFESIEQVTLLIKDIRQVNQN-LLIA 60 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSG--MEEGGKLAQEAGWLMASEMIAMDIDISFA 118 VD EGGRVQRFR GFT LPA AL + A G+++A E++ + +D SFA Sbjct: 61 VDHEGGRVQRFRYGFTHLPAMSKLGALYDKNPDIVIDKAFSCGFVLAYELLDIGVDFSFA 120 Query: 119 PVLDVGHI-SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 PVLD+ + S IG+R++H++P + +A I GMH+AGMK GKHFPGHG VTADSH Sbjct: 121 PVLDINYDTSQVIGDRAFHSNPDVIVKLAGSLISGMHKAGMKCVGKHFPGHGFVTADSHI 180 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQ 237 + P D RP +++ DM F LI LDA+M AHV+Y+ VD + A S W+K +L+ Sbjct: 181 DLPIDDRPMSDLLQ-DMKPFKRLINHG-LDALMYAHVVYTQVDDKIAGFSSKWIKDILQN 238 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDN 291 +L F GVIFSDDLSM+ A + R + SLD+GCDM+L+CN+ + ++D Sbjct: 239 QLKFKGVIFSDDLSMQSAHFIKDITNRVKTSLDSGCDMVLICNHPEFVAQIIDK 292 >UniRef50_D1BPZ2 Glycoside hydrolase family 3 domain protein n=3 Tax=Veillonella RepID=D1BPZ2_VEIPT Length = 381 Score = 331 bits (850), Expect = 2e-89, Method: Composition-based stats. Identities = 93/336 (27%), Positives = 169/336 (50%), Gaps = 26/336 (7%) Query: 2 GPV-MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASR----NR 56 G + M+ + G L+ + + +L VGG+ILF RN Q++ L+ I A + Sbjct: 63 GQLLMIGIHGTTLNDDAKFMLNEYRVGGIILFDRNMESKDQVKTLITDINKAGKSAGLTP 122 Query: 57 LVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 L + +DQEGG V R + ++P A+ + +E+ LA+++G +E+ + +I+ Sbjct: 123 LFLGIDQEGGAVARMDDKLIKVPPAEEVGKMP-VEQAVSLAKQSG----AELKDLGFNIN 177 Query: 117 FAPVLDVGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSH 176 FAPV D+G RSY P + + A EAG+ + KHFPG G D H Sbjct: 178 FAPVADLGLT----YGRSYSTSPDEVVRYAGAVGKAYDEAGLWYSYKHFPGIGKTDVDLH 233 Query: 177 KETPCDPRPQAEIRAKDMSVFSSLIRENKLDA--IMPAHVIYSDVDPR-PASGSPYWLKT 233 +T P + + ++D VF LI+++K + IM +H +Y +DP PAS S + Sbjct: 234 ADTSIVPVSKETLLSEDTKVFIDLIKQSKPNTYTIMVSHAMYPQIDPDHPASLSKAIITD 293 Query: 234 VLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLS 293 LR+++G++GV+ +DD+ M A ++ + S+ AG D++LVC+ + + Sbjct: 294 WLRKDIGYNGVVVTDDMDMGALAKHYTFGDMAVQSILAGSDILLVCHEYEHMQEAYNG-- 351 Query: 294 PIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQ 329 + + G S++ L +S + + + ++++ Sbjct: 352 ------LMKAVKDGRISKERLDESVK-RILLMKMSK 380 >UniRef50_C6W0V4 Glycoside hydrolase family 3 domain protein n=2 Tax=Flexibacteraceae RepID=C6W0V4_DYAFD Length = 1018 Score = 331 bits (849), Expect = 3e-89, Method: Composition-based stats. Identities = 78/346 (22%), Positives = 141/346 (40%), Gaps = 31/346 (8%) Query: 1 MGPVMLDVEGYELDAE----EREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNR 56 +G + + D +++ +GGLI F AQ L + +A S Sbjct: 83 IGQLFMVATFSNRDESHYRYIDKLINDYQIGGLIFFQGGPVSQAQ---LTNRYQAHSNIP 139 Query: 57 LVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 L + +D E G R + P A+ +L G +A + + I I+ Sbjct: 140 LFIGIDGEWGLGMRLDSTIS-FPKQMVLGAIQD----DRLIYRMGDDIARQCKRLGIHIN 194 Query: 117 FAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 FAPV DV + IG RS+ D + + ++ G+ + T KHFPGHG AD Sbjct: 195 FAPVSDVNSNPANPVIGIRSFGEDKENVARKSIAYMKGLQHNRIIATAKHFPGHGDTDAD 254 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRP---ASGSPYWL 231 SH P ++ D+ + +I ++ L ++ AH+ +D P +S S + Sbjct: 255 SHFTLPVLNHSLEQL-NSDLYPYREMIADS-LMGVLNAHLYIPALDNTPNQASSLSDKVV 312 Query: 232 KTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDN 291 +LR++LGF G++F+D ++M G G A+ +L AG D++L + Sbjct: 313 NGLLRKDLGFRGLVFTDAMNMRGVLKNGKPADVNLKALVAGNDVLLYPESI--------- 363 Query: 292 LSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQEE 337 R+ + S + + D + + + + +++ Sbjct: 364 --AETVSRIKDAINSKLISEKIIDDKVK-RILQAKYWAGLNQYKPI 406 >UniRef50_A1ZJ01 Beta-N-acetylglucosaminidase n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZJ01_9SPHI Length = 994 Score = 330 bits (848), Expect = 3e-89, Method: Composition-based stats. Identities = 77/342 (22%), Positives = 145/342 (42%), Gaps = 30/342 (8%) Query: 1 MGPVMLDVEGYELDA----EEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNR 56 +G + + L +++ VGG+I H QL + ++ Sbjct: 60 LGQLFMVAANTTLIDTHKVSVAKLIRQQHVGGVIFMRGYAH--KQLT-WTNYFQKQAKTP 116 Query: 57 LVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 L+VA+D E G R + P + A+ + L + G +A +M + I+ Sbjct: 117 LMVAMDAEWGLGMRLKN-MMSFPFQMTLGAI----QEDTLIYQMGAQIAQQMKRLGTHIN 171 Query: 117 FAPVLDVG--HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 FAPV+DV + IG RS+ + + ++ G+ + G++ KHFPGHG D Sbjct: 172 FAPVVDVNINPKNPVIGYRSFSENRETVARKGVAYMKGLQDNGIQANAKHFPGHGDTGTD 231 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD---PRPASGSPYWL 231 SH P + + ++ F +LI+ + ++M AH+ D RP + S + Sbjct: 232 SHFALPVIKHDTNRLDSIELYPFKALIKHG-VKSMMVAHLNLPAYDSAQNRPTTLSKNVV 290 Query: 232 KTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDN 291 +L+++LG+ G+IF+D L+M+ E +L AG D++L N K A+ V Sbjct: 291 TKLLKEKLGYQGLIFTDALNMKAVTNFYKPGEVDLQALIAGNDVLLAPENVKKAIEV--- 347 Query: 292 LSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHER 333 + + + S+ + + K ++ + + + Sbjct: 348 --------IQKGILQKKISQTAIDQKVK-KVLAFKYSLKLNK 380 >UniRef50_C9RS08 Glycoside hydrolase family 3 domain protein n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RS08_FIBSS Length = 384 Score = 330 bits (847), Expect = 4e-89, Method: Composition-based stats. Identities = 89/316 (28%), Positives = 142/316 (44%), Gaps = 26/316 (8%) Query: 22 AHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFREG----FTR 77 + +GG+IL+ N D QL + +IR + + L+ A+D+EGGRV R + Sbjct: 87 KNYPIGGMILYAHNIVDETQLGNFIAEIRELNGSPLL-AIDEEGGRVARIANNENFDVPK 145 Query: 78 LPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHISA--AIGERSY 135 + + A EE A +A + + + + DI +APV DV IG R++ Sbjct: 146 YESMAAIAESDDPEE----AYKAAFTIGTYVKKYGFDIDYAPVADVNTNPDNIVIGPRAF 201 Query: 136 HADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMS 195 DP+ A +++G+ AG+ T KHFPGHG V+ D+H + E+ +M Sbjct: 202 SDDPETAAKFVVSYLNGLESAGVIGTLKHFPGHGDVSTDTHYGYASTDKTWDEMLKCEMI 261 Query: 196 VFSSLIRENKLDAIMPAHVIYSDV--DPRPASGSPYWLKTVLRQELGFDGVIFSDDLSME 253 F + I+ IM AH+ V D PA+ S L+ LR ELGFDGVI +D + M Sbjct: 262 PFKAGIK-AGAQMIMTAHIAAPKVSGDDLPATLSSVILQDKLRGELGFDGVIVTDAMDMG 320 Query: 254 GAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFSRQE 313 S AE S+ AG D++L + V + KG Sbjct: 321 AITKQFSNAEAAIKSIQAGVDVVLCSREFTQVF-----------DAVVKAVKKGEIKESR 369 Query: 314 LMDSARWKAISTRLNQ 329 + +S + + + + ++ Sbjct: 370 IDESVK-RILKLKKSK 384 >UniRef50_B6J7B4 Anhydromuramoyl-peptide exo-beta-N-acetylglucosaminidase n=6 Tax=Coxiella burnetii RepID=B6J7B4_COXB1 Length = 313 Score = 330 bits (847), Expect = 4e-89, Method: Composition-based stats. Identities = 137/295 (46%), Positives = 187/295 (63%), Gaps = 7/295 (2%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 MG ++D+EG L +ERE+L HP V G+ILFTRNY +P+QLR L++ I+ L VA Sbjct: 1 MGFAIIDLEGVALSPQERELLQHPQVAGVILFTRNYENPSQLRSLIQTIKKI--RPLFVA 58 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALS--GMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 VDQEGGRVQRF GFT LP+ + F + K + A + E+ + ID++ Sbjct: 59 VDQEGGRVQRFHRGFTDLPSMRYFGECYQHDPLQARKHLENALKTVVHELKEVGIDVNMT 118 Query: 119 PVLDVGH-ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 PVLD+ + IGERS+ DP+ A+ S ID +H M KHFPGHG VTADSH+ Sbjct: 119 PVLDIDQGKNEIIGERSFSGDPRVVEALGSVVIDTLHANRMPAVAKHFPGHGGVTADSHQ 178 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQ 237 P D R I D FS+L ++LDAIMP H+IY D +PA+ S YWL+ +LRQ Sbjct: 179 ALPIDERDPEAILQCDWIPFSAL--SSRLDAIMPTHIIYPAFDEKPATFSRYWLQDMLRQ 236 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL 292 L F GVI SDDL+M+GAA++G Y R +L+AGCD++ +CN+R+G +++LD+L Sbjct: 237 RLNFRGVIISDDLTMQGAAVIGDYPTRAAQALEAGCDLLTICNSREGVIAILDSL 291 >UniRef50_Q2SV66 Beta-N-Acetylglucosaminidase n=31 Tax=Burkholderiaceae RepID=Q2SV66_BURTA Length = 699 Score = 330 bits (847), Expect = 5e-89, Method: Composition-based stats. Identities = 97/338 (28%), Positives = 145/338 (42%), Gaps = 23/338 (6%) Query: 11 YELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAA----SRNRLVVAVDQEGG 66 E A R+ L VGG+ILF+ N Q R L+ IRAA S L++ +D+EGG Sbjct: 88 TEFPAAARDALRDSGVGGVILFSNNLTGIEQTRRLIDGIRAAPAADSPVGLLIGIDEEGG 147 Query: 67 RVQRFRE-GFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGH 125 V R T + A +LA + G ++A+E+ A+ +++FAP +DV Sbjct: 148 NVFRLPRVEATAFSGNMALGAAYEATRDDRLAYDMGRVLAAEIAAVGFNVNFAPDVDVNS 207 Query: 126 I--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDP 183 + I R++ DP + R + GM + T KHFPGHG DSH P Sbjct: 208 NPLNPVINVRAFGDDPATIGLLGRRMVQGMASERVIGTFKHFPGHGDTDTDSHYGLPVVI 267 Query: 184 RPQAEIRAKDMSVFSSLIRENKL-DAIMPAHVIYSDVDPR------------PASGSPYW 230 + +A+ A D++ + I + D IM AH+ Y +D PA+ S Sbjct: 268 KSRADAYAIDLAPYRQAIESGEAPDMIMTAHIQYPSLDDTRVTTRTGEQMIAPATMSRRI 327 Query: 231 LKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLD 290 +LR E G+ GV +D L M+G A A+ A D+ L+ + A Sbjct: 328 QHDILRGEFGYRGVTITDALDMKGIADFFDEADAVIKVFQADVDIALMPVEFRTAADA-- 385 Query: 291 NLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLN 328 +RV G R E S R + + T+L Sbjct: 386 GRLAALVDRVAAAVDSGRIDRAEFDSSVR-RIVLTKLR 422 >UniRef50_B4S1A2 Beta-hexosaminidase A n=2 Tax=Alteromonas macleodii RepID=B4S1A2_ALTMD Length = 673 Score = 330 bits (846), Expect = 6e-89, Method: Composition-based stats. Identities = 85/339 (25%), Positives = 146/339 (43%), Gaps = 26/339 (7%) Query: 11 YELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIR----AASRNRLVVAVDQEGG 66 ++ A +L+ +GG+ILF+ N + QL L ++ + L +A+DQEGG Sbjct: 90 TKIPASLLALLSGHNIGGVILFSENIENTEQLITLNYTMQQHMLKVGKQPLFIAIDQEGG 149 Query: 67 RVQRFREG-FTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVG- 124 RV R + A + G AQ + ++ + I+ +FAP +D+ Sbjct: 150 RVARLPSDMLAPFAGNMAIGATF-HQHGSTFAQRVARHIGKTLLPLGINTNFAPSVDINS 208 Query: 125 -HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDP 183 + I RS+ P++ A+ F++ M G+ + KHFPGHG DSH P Sbjct: 209 EPKNPVINVRSFSEHPEQVAALGQTFVEAMQGTGVISAIKHFPGHGDTHVDSHSGLPLVS 268 Query: 184 RPQAEIRAKDMSVFSSLIRENKLD-AIMPAHVIYSDVDP------------RPASGSPYW 230 +A+ D+ FS++IR +M AH+ Y +D PA+ S Sbjct: 269 HTRAQAMTGDILPFSNIIRSATPPAMVMSAHIQYPSLDDTKITDKNGKRQIVPATLSKKI 328 Query: 231 LKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVC---NNRKGAVS 287 L +LR++LG+ G+I +D L M G S+ E + AG D+ L+ N+ + Sbjct: 329 LTNLLRKQLGYKGLIVTDALDMAGITQFFSHEEALVRAFSAGADIALMPFTIRNKTDINA 388 Query: 288 VLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTR 326 + L + + + S + S+ + I + Sbjct: 389 FVKLLDNVAVKLADSALD--TSSESAFLVSSYNRIIEAK 425 >UniRef50_B2FPW9 Beta-hexosaminidase n=18 Tax=Xanthomonadaceae RepID=NAGZ_STRMK Length = 335 Score = 329 bits (845), Expect = 9e-89, Method: Composition-based stats. Identities = 139/328 (42%), Positives = 191/328 (58%), Gaps = 6/328 (1%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQ 63 +++ V G EL A+ER+ L H V G++LF RN+ Q+ +L IRAA+ ++ VDQ Sbjct: 2 LLIGVAGTELTAQERDWLQHDAVAGVVLFKRNFASRQQVTDLSAAIRAAAPRPQLICVDQ 61 Query: 64 EGGRVQRFREGFTRLPAAQSFAALS--GMEEGGKLAQEAGWLMASEMIAMDIDISFAPVL 121 EGGRVQRFREG++ LP Q AL ++ LA+ WLMASE+ A +D+SFAPV+ Sbjct: 62 EGGRVQRFREGYSDLPPLQDIGALYATDPQQALALAERHAWLMASEVRASGLDLSFAPVV 121 Query: 122 DVGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPC 181 D+G + AIG R++ DPQ A + ++ GMH GM T KHFPGHG V D+H +T Sbjct: 122 DLGRGNRAIGNRAFSEDPQVVAAFTAAYVRGMHSVGMAATLKHFPGHGTVLEDTHVDTAI 181 Query: 182 DPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQELGF 241 DPR E+RA+D+ F + I DA+M AHVIY + P PA SP W++ +LR ELGF Sbjct: 182 DPRALDELRAQDLVPFQAGI-AAGADAVMMAHVIYPQIAPEPAGYSPRWIQDILRGELGF 240 Query: 242 DGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIK--AER 299 GV+FSDD+ M + G R A LDAGCD++LVC+ + L + Sbjct: 241 RGVVFSDDIGMAASHSAGGVPARVHAHLDAGCDVVLVCH-PELVDEALHAVQGRSLNTAA 299 Query: 300 VTRLYHKGSFSRQELMDSARWKAISTRL 327 + L +G+ L+ AR TRL Sbjct: 300 LLGLIGRGALGWDGLLADARHGDTQTRL 327 >UniRef50_Q31RD3 Beta-glucosidase-related glycosidases-like n=5 Tax=Cyanobacteria RepID=Q31RD3_SYNE7 Length = 542 Score = 329 bits (844), Expect = 1e-88, Method: Composition-based stats. Identities = 86/296 (29%), Positives = 149/296 (50%), Gaps = 19/296 (6%) Query: 38 DPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFREGFTRLPAAQSFAA--LSGMEEGGK 95 + A++ +Q++ SR L++A D E G QRF G + P + +A E + Sbjct: 65 NAAEVALRCQQLQEWSRVPLLIAADIEEGVGQRF-AGAVQFPPPLALSAIAAQDPERAQQ 123 Query: 96 LAQEAGWLMASEMIAMDIDISFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGM 153 LA+ G + A+E IA+ ++ AP++D+ + + I R++ DP+ + + FI G Sbjct: 124 LARSFGEITATEAIAIRLNWVLAPIVDINNNPANPVINVRAFGEDPETVSQLTTAFIAGA 183 Query: 154 HEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAH 213 + + TT KHFPGHG DSH E P P ++ + A ++ F I +DA+M AH Sbjct: 184 QQFPVLTTAKHFPGHGDTATDSHLELPQIPHDRSRLEAVELEPFRRAI-AAGVDAVMTAH 242 Query: 214 VIYSDVDPR-PASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAG 272 + +DP PA+ SP L+ +LR+ELGF+G+I +D L M+ A E + + +AG Sbjct: 243 LSIPALDPDYPATLSPAVLQGLLRRELGFEGLIVTDALVMQAIAAHYGPGEAARLAFEAG 302 Query: 273 CDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDS-ARWKAISTRL 327 D++L+ + + A+ + + + +G + L S AR ++ Sbjct: 303 ADILLMPVDPEAAI-----------QEIAKAIREGQITYDRLEQSLARIWRAKQKV 347 >UniRef50_B3EKH1 Glycoside hydrolase family 3 domain protein n=3 Tax=Chlorobiaceae RepID=B3EKH1_CHLPB Length = 583 Score = 328 bits (843), Expect = 1e-88, Method: Composition-based stats. Identities = 78/353 (22%), Positives = 152/353 (43%), Gaps = 32/353 (9%) Query: 1 MGPVMLD-----VEGYELD--AEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS 53 +G +++ + + +++ VGG++ N +D L + + + Sbjct: 62 IGQMLIAHSPAKFRSTDDSYYKKLSLLVSQGKVGGIMFLKGNTNDA---AVLANRFQFIA 118 Query: 54 RNRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDI 113 L+++ D E G R +G T + + +A + L + ++A E A+ I Sbjct: 119 PRPLLISADMEKGLAMRL-DGATEFAPSMALSATGRPD----LVFKMAGVIAQEAKALGI 173 Query: 114 DISFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAV 171 S+ P +D+ + I RSY + + +++ FIDG+ G+ T KHFPGHG V Sbjct: 174 YHSYGPSVDLNTNPLNPVINTRSYGDNIPLTIEMSNAFIDGLQSNGIIATAKHFPGHGDV 233 Query: 172 TADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP--RPASGSPY 229 T DSH P + + ++ F + I ++ + +IM H+ PA+ S Sbjct: 234 TVDSHINLPVLNADKKRLERVELKPFIAAI-DHGIMSIMIGHLAIPAYTGSMTPATLSWR 292 Query: 230 WLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVL 289 + +LR+ELGFDG+I +D L+M+ + + +++AG D++L + + + Sbjct: 293 IVTKLLRKELGFDGLIITDALNMKALYQSYTLEDISLRAVEAGNDLLLFSPDPERTHTT- 351 Query: 290 DNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTR-LNQLHERWQEEKAGH 341 + +G S +++ S R ++ R L R + H Sbjct: 352 ----------LLNAVRRGKLSEKQINKSVRRILLAKRWLGLDKNRLVNLNSIH 394 >UniRef50_A1C3U3 Beta-N-acetylglucosaminidase, putative n=7 Tax=Leotiomyceta RepID=A1C3U3_ASPCL Length = 931 Score = 328 bits (843), Expect = 1e-88, Method: Composition-based stats. Identities = 84/352 (23%), Positives = 148/352 (42%), Gaps = 29/352 (8%) Query: 1 MGPV-MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASR----- 54 MG + M+ +G ++ E R ++ + +G ++L +N + LV +++ +R Sbjct: 19 MGQLFMMGFDGTSVNPEIRSLIENYHLGSVLLTAKNLKSAEEATRLVLELQTIARDAGHR 78 Query: 55 NRLVVAVDQEGGRVQRFRE--GFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMD 112 L++A+DQE G V + + P+A AA LA + + A E+ A+ Sbjct: 79 VPLLIALDQENGGVNSLYDEIYIRQFPSAMGIAATGS----KALAHDVAFATAQELKAVG 134 Query: 113 IDISFAPVLDV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGA 170 ++ PVLDV +G R+ DPQ+ F+ G EAG+ T GKHFP +G Sbjct: 135 VNWILGPVLDVLTNVRKQPLGVRTSGDDPQEVSQYGVEFMKGYQEAGLVTCGKHFPSYGN 194 Query: 171 VT-ADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSD--VDPRPASGS 227 + S + P ++ + F + I LD++M V S V+ A S Sbjct: 195 LEFLGSQTDVPIITESLEQLSLTALVPFRNAIIHG-LDSMMVGGVSMSSAGVNVMHACLS 253 Query: 228 PYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVS 287 + +LR++L F+GV+ S+ L ME + +AGCD+IL+C + Sbjct: 254 EQVVDDLLRKDLKFNGVVVSECLEMEALTHNIGVGGGTVMAKNAGCDIILLCRSFPVQQE 313 Query: 288 VLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLN--QLHERWQEE 337 ++ L G R + S R + ++ + + Sbjct: 314 AINGLKVG--------VENGILGRARIEQSLR-RVLAMKARCTSWEQALNPP 356 >UniRef50_C8WZJ0 Glycoside hydrolase family 3 domain protein n=1 Tax=Desulfohalobium retbaense DSM 5692 RepID=C8WZJ0_DESRD Length = 379 Score = 328 bits (843), Expect = 1e-88, Method: Composition-based stats. Identities = 100/350 (28%), Positives = 160/350 (45%), Gaps = 27/350 (7%) Query: 2 GPV-MLDVEGYELDAE--EREILAHPLVGGLILFTR---------NYHDPAQLRELVRQI 49 G + M+ G +A+ E + +GG++L+ R N AQL+ L + Sbjct: 38 GQMLMIGFRGTTFEAKSPLGEAITEGNLGGVVLYGRDVALNKPMRNIRSAAQLQALTADL 97 Query: 50 RAASRNRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMI 109 + +R L +AVD+EGG+V R F P + AA G + + + Sbjct: 98 QDHARIPLFIAVDEEGGQVSRLAPRFG-FPETVT-AATWGRRNDPAWTRRGARAIGQRLR 155 Query: 110 AMDIDISFAPVLDVG--HISAAIG--ERSYHADPQKALAIASRFIDGMHEAGMKTTGKHF 165 + I+ APV+D+ + AIG ERS+ A+P A+ FI G+H+AG+ KHF Sbjct: 156 NLGCTINLAPVVDLNTNPDNPAIGKLERSFGANPDTVTRQAAAFIHGLHDAGILACIKHF 215 Query: 166 PGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPA 224 PGHG+ DSH K++ + L+ DA+M AHV ++ +DP PA Sbjct: 216 PGHGSAYNDSHLGLTDISTT---WSPKELEPYKRLVDRGLADAVMTAHVFHAGLDPKVPA 272 Query: 225 SGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKG 284 + S + +LR+E+G++GV+ SDDL M S + + L+A D+ L NN + Sbjct: 273 TLSAEIIPDILRREIGYEGVVISDDLQMGAIRQSFSLRQTVRRCLEADVDIFLFGNNLEY 332 Query: 285 AVSVLDNLSPIKAERVTRLYHKGSFSRQELMDS-ARWKAISTRLNQLHER 333 V + I V L + SR + S R + + R++ R Sbjct: 333 EPFVWRRVQRI----VRDLVDQNIVSRSRIERSYERIQRLKERMDLFQGR 378 >UniRef50_Q6MA50 Putative beta-N-acetylglucosaminidase n=1 Tax=Candidatus Protochlamydia amoebophila UWE25 RepID=Q6MA50_PARUW Length = 550 Score = 328 bits (843), Expect = 1e-88, Method: Composition-based stats. Identities = 85/342 (24%), Positives = 158/342 (46%), Gaps = 24/342 (7%) Query: 1 MGPV-MLDVEGYELDAEEREILAHPLVGGLILF--TRNYHDPAQLRELVRQIRAAS---- 53 +G + ++ G + + E +++ VGG+I + + + P Q++ L ++AA+ Sbjct: 45 IGQLLIVHFRGEDCNKEALQMIYEAHVGGVIYYPWSNGLNSPLQIQHLSAHLQAAAFQKK 104 Query: 54 -RNRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMD 112 + L+++VDQEGG V R +G T P + +LA E A+E++AM Sbjct: 105 NKIPLLISVDQEGGVVNRLTKGMTLFPGNYALGQTKDP----QLAFEYASASANELLAMG 160 Query: 113 IDISFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGA 170 I+++ APV+DV + IG RS+ P + + S+ ++G ++AG+ T KHFPGHG Sbjct: 161 INMNLAPVIDVNTQPCNPVIGIRSFSTCPDEIVRFGSKVVEGFNKAGLIATLKHFPGHGD 220 Query: 171 VTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPA-SGSPY 229 +DSH P + E+ ++ F +L + + +M AH++ +D + S Sbjct: 221 TNSDSHHTLPIVNKTLVELEKVELVPFKALASQTPV--VMTAHLLVPALDALNCVTFSKS 278 Query: 230 WLKTVLRQELGFDGVIFSDDLSMEG-AAIMGSYAERGQASLDAGCD----MILVCNNRKG 284 + VLR+E+G G+I +D L M+G + + SL AG D N Sbjct: 279 AI-DVLRKEIGEKGIILTDSLVMQGVLQDGKTVEQAALKSLQAGHDLLLLGGKRLLNESS 337 Query: 285 AVSVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTR 326 + + + + S + S + + ++ + Sbjct: 338 EFELEAKDVLKIHQFLVNAVRQNLISEARIDISVK-RILALK 378 >UniRef50_Q6AJ70 Related to glycosyl hydrolase n=1 Tax=Desulfotalea psychrophila RepID=Q6AJ70_DESPS Length = 377 Score = 328 bits (843), Expect = 1e-88, Method: Composition-based stats. Identities = 84/347 (24%), Positives = 163/347 (46%), Gaps = 30/347 (8%) Query: 4 VMLDVEGYELDA--EEREILAHPLVGGLILFTR---------NYHDPAQLRELVRQIRAA 52 +M G E+ + +GG+ILF + N P Q+ +L +++ Sbjct: 42 LMTGFIGEEISQCSSTVNDIKEHGLGGVILFDKCLATGSRKNNITSPEQMAKLTSDLQSC 101 Query: 53 SRNRLVVAVDQEGGRVQRFRE--GFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIA 110 S L++AVDQEGG V RFR GF+ P+A +E+ + A++ + + Sbjct: 102 SAEPLLIAVDQEGGMVSRFRSAYGFSTSPSALMLGQQDDLEKCREAAEQ----TSQMLKD 157 Query: 111 MDIDISFAPVLDVGHI--SAAIG--ERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFP 166 ++ + APV+D+ + IG +RS+ + Q + A ++ ++G+ + KHFP Sbjct: 158 AGVNFNLAPVVDLDLTALNPIIGKYQRSFGDNAQTVIECALSWMSAHRKSGILSCLKHFP 217 Query: 167 GHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PAS 225 GHG+ ADSH+ +++ + + + ++DAIM H+ + ++DP PAS Sbjct: 218 GHGSSLADSHRGFVDITES---WDREELLPYKKIHQRTEIDAIMMGHLFHKNLDPLYPAS 274 Query: 226 GSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGA 285 S + ++LRQE+ + G+I SDD+ M + ++ AG DM+++ NN Sbjct: 275 LSKRIIDSLLRQEMNYQGLIISDDMQMGAITKYYGLEKACCRAICAGVDMVIIGNNISSD 334 Query: 286 VSVLDNLSPIKAERVTRLYHKGSFSRQELMDS-ARWKAISTRLNQLH 331 ++ ++ + + +G S + + R + + +L++ Sbjct: 335 PKIVTKIT----DSLKNSIDQGLLSEARVDQAWQRVQGLKNKLHKGE 377 >UniRef50_D2RL81 Glycoside hydrolase family 3 domain protein n=1 Tax=Acidaminococcus fermentans DSM 20731 RepID=D2RL81_ACIFE Length = 369 Score = 328 bits (843), Expect = 2e-88, Method: Composition-based stats. Identities = 86/330 (26%), Positives = 153/330 (46%), Gaps = 22/330 (6%) Query: 2 GPVM-LDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRN-RLVV 59 G +M + +EG E A ++ L GG+ILF RN DP Q++ L +++ +R L++ Sbjct: 54 GQLMLVGMEGTEPSAAFQKELQENHYGGVILFDRNMKDPNQVKALNSRLQDLARPEPLLI 113 Query: 60 AVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAP 119 VD+EGG+V R ++ P+ ++ + A+ + + A+ +++ AP Sbjct: 114 GVDEEGGQVARMKDQLPAPPSQEAIGRTGDPAQ----AEAWAAKTGTRLKALGFNLNLAP 169 Query: 120 VLDVGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKET 179 V DVG ERSY + P A + G + + KHFPG G D+HK Sbjct: 170 VADVGSG----MERSYSSHPATAAKFVEGALAGYQKVHLLACLKHFPGLGKGKEDTHKGQ 225 Query: 180 PCDPRPQAEIRAKDMSVFSSLIRENKLD--AIMPAHVIYSDVDPR-PASGSPYWLKTVLR 236 + + +D F + + +M HV Y +DP PAS SP + +LR Sbjct: 226 VIVSADRETLLQEDGLPFRLALEKYPEPWFGVMVTHVRYPALDPDYPASLSPKIMGDLLR 285 Query: 237 QELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIK 296 + F G++ +DDL+M A E ++ AG D+++VC+ + V Sbjct: 286 GQFHFSGLVLTDDLTMGAIASQYPLEEAAVQAVRAGADLVMVCHGPETIRRV-------- 337 Query: 297 AERVTRLYHKGSFSRQELMDSARWKAISTR 326 + + +G+ R+++ D+A ++ + T+ Sbjct: 338 HGALAKAIREGTLPREQV-DAALYRILRTK 366 >UniRef50_A2EGF0 Glycosyl hydrolase family 3 N terminal domain containing protein n=1 Tax=Trichomonas vaginalis RepID=A2EGF0_TRIVA Length = 515 Score = 328 bits (843), Expect = 2e-88, Method: Composition-based stats. Identities = 98/347 (28%), Positives = 165/347 (47%), Gaps = 22/347 (6%) Query: 3 PV-MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS----RNRL 57 + M ++ G +LDAE ++ +G LILFT N + ++++L + S Sbjct: 18 QMFMFNLPGKQLDAEIFNLIKKYHIGNLILFTINLGNGKEIKKLTSDFQEYSLKEFGFPA 77 Query: 58 VVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISF 117 ++ D EGG V+R+ E R P A + + + + L++ + ++++ I+++ Sbjct: 78 FISSDMEGGNVERYSEDIPRFPGAAAIVSTNSIANCQDLSESVAKV----LLSLGINMNL 133 Query: 118 APVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADS 175 APV D+ + IG RS+ DPQK I+G +AG T KHFPGHG+ DS Sbjct: 134 APVCDINDNSRNPVIGARSFGFDPQKISNYIKAHINGHKKAGCLTCAKHFPGHGSTFQDS 193 Query: 176 HKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVL 235 H+E P E KD+ F I E +D+IM AH++ + PAS SP + + Sbjct: 194 HRELPHIIHKYDEFMKKDIPPFKDAI-EAGVDSIMTAHLLSPLDEENPASLSPKVINYI- 251 Query: 236 RQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPI 295 R+ELGFDG++ +D L M+G +I G + ++ AG ++ L+ N +G + + Sbjct: 252 RKELGFDGLLVTDSLVMKGVSIQG-IDVAVRKAILAGNNI-LIANTMEGDFAEFYKM--- 306 Query: 296 KAERVTRLYHKGSFSRQELMDSARWKA---ISTRLNQLHERWQEEKA 339 E V G Q + DS R + + ++ E + Sbjct: 307 -IEGVLNDVKSGVIPEQLIDDSCRRIISSKLKLKHALETNKYDENEE 352 >UniRef50_Q7X3A0 Putative glycosylhydrolase n=1 Tax=Candidatus Fritschea bemisiae RepID=Q7X3A0_9CHLA Length = 389 Score = 328 bits (842), Expect = 2e-88, Method: Composition-based stats. Identities = 86/346 (24%), Positives = 149/346 (43%), Gaps = 18/346 (5%) Query: 1 MGPVMLDVEGYELDAE-EREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVV 59 +G + + + +E ++L +G LIL + H Q+ L ++ S+ L+ Sbjct: 33 IGQLFIVPACPKFSSEMIEKMLDVYHIGALIL--KQAHPKEQIPFL-NALQKRSKLPLLC 89 Query: 60 AVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAP 119 D E G R +E + P + + L +G + + + I ++FAP Sbjct: 90 TGDAEWGLGMRMQETLS-FPRNEIMGKVKD----QSLLFSSGKEIGRQCRIVGIHLNFAP 144 Query: 120 VLDVG--HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 V+D+ + IG RS+ +DP+K AS I GM + + KHFPGHG DSHK Sbjct: 145 VVDINNHPNNTVIGNRSFGSDPEKVSRCASLIIQGMRAGKVLSCLKHFPGHGDTDVDSHK 204 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLR 236 P + + + F + DAIM H++ +D PAS S + +L+ Sbjct: 205 GLPTILHSRKHLEQVEFLPFKENLESA--DAIMTGHLMLPVLDKNFPASLSKLIVTDLLQ 262 Query: 237 QELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILV-CNNRKGAVSVLDNLSPI 295 +E GF G+I +D L+M+ S E + AG D++L + ++L+N P+ Sbjct: 263 KEWGFKGLIITDALNMKALTQNYSVEEIALKAFLAGHDLLLYGSHRYDDVKNLLENAIPL 322 Query: 296 KAERVTRLYHKGSFSRQELMDSARWKAI--STRLNQLHERWQEEKA 339 + + G +L+D+ K + RL +R+ Sbjct: 323 AYKSIQNGIINGEID-SDLLDARVLKILQVKERLGLHKDRFNPNPE 367 >UniRef50_B5EPA1 Glycoside hydrolase family 3 domain protein n=2 Tax=Acidithiobacillus ferrooxidans RepID=B5EPA1_ACIF5 Length = 354 Score = 327 bits (840), Expect = 3e-88, Method: Composition-based stats. Identities = 139/344 (40%), Positives = 194/344 (56%), Gaps = 11/344 (3%) Query: 2 GPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAV 61 GP+M+D+ G A E +L P VGG+ILF RN D Q+R L ++ A L+V + Sbjct: 13 GPLMVDIAGLRPTAAECAMLRQPAVGGVILFARNCADAGQVRALCAELHALRTTPLLVGI 72 Query: 62 DQEGGRVQRFREGFTRLPAAQSFAALSG---MEEGGKLAQEAGWLMASEMIAMDIDISFA 118 DQEGGRVQR R G TR P + L+ +E A++ G L+ +E+ A+ +D+ F Sbjct: 73 DQEGGRVQRLRAGVTRFPPMATLGRLADRDGLERALVAAEDWGLLLGTELRALGVDLDFT 132 Query: 119 PVLDVGH-ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 P +D+ ISA IG+R+ H DP AIA+ GM G++ KHFPGHGAV ADSH Sbjct: 133 PCVDLDRGISAVIGDRAIHRDPVWVGAIAAALWKGMARTGLQGVAKHFPGHGAVAADSHL 192 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLR 236 + P D RP + I+ D+ F++LI + AIMPAH +Y VD PA SP WL+ VLR Sbjct: 193 DLPLDDRPFSAIQE-DLQPFAALI-AAGIPAIMPAHCLYPRVDGEHPAGFSPRWLQDVLR 250 Query: 237 QELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIK 296 +E+GF GV+ SDDLSM GA G A R A+L AG D++LVCN+ +GA + + ++ Sbjct: 251 KEMGFTGVVVSDDLSMAGAHGAGGVAARVDAALTAGADLLLVCNDPEGAETAIAHVRSRD 310 Query: 297 A----ERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQE 336 A +R+ RL + + D R L L + + Sbjct: 311 ALPTGDRLERLAGNAATAALSAADRVRCLQQIEELRALAPGYTD 354 >UniRef50_A5FAE4 Candidate beta-glycosidase; Glycoside hydrolase family 3 n=3 Tax=Flavobacteriaceae RepID=A5FAE4_FLAJ1 Length = 997 Score = 326 bits (838), Expect = 5e-88, Method: Composition-based stats. Identities = 85/345 (24%), Positives = 152/345 (44%), Gaps = 30/345 (8%) Query: 2 GPVMLDVEGYELDA----EEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRL 57 G + + D+ + +++ VG +I F P + +L ++ ++ L Sbjct: 79 GQLFMVSAYSNKDSIHVNQVSKLIEEYKVGSVIFFQG---GPVRQAKLTNLYQSKAKVPL 135 Query: 58 VVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISF 117 V +D E G R R P + A+ + KL ++ G MA+E + + +F Sbjct: 136 FVGIDAEWGLAMRLDS-TYRYPWNMTLGAIQDL----KLIEKVGRNMANENKRIGVHFNF 190 Query: 118 APVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADS 175 APVLD+ + IG RS+ D A ++G+ G+ TGKHFPGHG DS Sbjct: 191 APVLDINTNPKNPIIGNRSFGEDKVNVSEKAIALMNGIQGNGVFCTGKHFPGHGDTATDS 250 Query: 176 HKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR---PASGSPYWLK 232 H P + + ++ + + E + ++M AH+ ++P P+S S + Sbjct: 251 HHALPTVNFSKERLDLVELYPYKRMFDEGLV-SVMVAHLNIPSLEPIPNLPSSASYNVVT 309 Query: 233 TVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL 292 +L++ELGFDG+IF+D L+M+GAA + A L AG D++L N Sbjct: 310 NLLQKELGFDGLIFTDGLAMKGAANFKGPGDLEIAVLLAGNDILLCPEN----------- 358 Query: 293 SPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQEE 337 P+ +++ Y+ + + L S + K + + ++ Sbjct: 359 VPVAVQKLEEAYNNNIITEERLAHSVK-KILHYKYKAGLNHYKPI 402 >UniRef50_B6GD75 Putative uncharacterized protein n=1 Tax=Collinsella stercoris DSM 13279 RepID=B6GD75_9ACTN Length = 806 Score = 326 bits (838), Expect = 5e-88, Method: Composition-based stats. Identities = 95/357 (26%), Positives = 157/357 (43%), Gaps = 38/357 (10%) Query: 11 YELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS----------RNRLVVA 60 L+ E ++ +GG+ILF N + Q L ++ A+ L++ Sbjct: 90 TVLNEEIASVIDSFDLGGVILFANNVKETEQTLRLAMDMQQAALRNTAQAPAGDIPLLLT 149 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 +DQEGG V R G T LP + A E+ + G ++ E+ A+ I+++FAP Sbjct: 150 IDQEGGIVYRLGSG-TALPGNMALGATRSEEDAL----QCGQVIGRELEALKINVNFAPS 204 Query: 121 LDVGHISAA--IGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE 178 DV + IG RSY +DP+ + + + GM E G+ T KHFPGHG DSH Sbjct: 205 FDVNNNPNNPVIGLRSYSSDPELVAQLGTAMMSGMQEHGIATAAKHFPGHGDAGTDSHTG 264 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD--------------PRPA 224 P + + E+ + F S I E +D +M AH+ + V+ PA Sbjct: 265 LPRIEKTKEELDNLEFIPFRSAI-EKGVDMVMTAHIQFPKVETAKVPSADPSMGEIELPA 323 Query: 225 SGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKG 284 + SP ++ +LR E+GF GV +D L+M+ + Y + + + AG D+ L+ + Sbjct: 324 TLSPVFMTDILRTEMGFTGVSITDALNMDAISKNFGYIDAVKRTFKAGVDIALMPVTLRS 383 Query: 285 AVSVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQEEKAGH 341 V P E V L + + L S R + + + + +++ H Sbjct: 384 QQDV-----PKLKELVDALEQDAELTDERLNTSVR-RILELKKRRGILAYKDTAGSH 434 >UniRef50_C1D8P0 NagZ n=4 Tax=Proteobacteria RepID=C1D8P0_LARHH Length = 362 Score = 326 bits (837), Expect = 6e-88, Method: Composition-based stats. Identities = 148/316 (46%), Positives = 186/316 (58%), Gaps = 8/316 (2%) Query: 2 GPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAV 61 GPVM+DV G L ++R L+HP+VG +ILF RN+ QLR L +IR+ L++AV Sbjct: 8 GPVMVDVAGCSLSDDDRRRLSHPMVGAVILFRRNFESVGQLRSLTEEIRSLRSPHLLIAV 67 Query: 62 DQEGGRVQRFREGFTRLPAAQSFAALS--GMEEGGKLAQEAGWLMASEMIAMDIDISFAP 119 D EGGRVQRF GFTRLP Q+ AL E LA + G ++A+E+ A +D+SF P Sbjct: 68 DHEGGRVQRFLTGFTRLPPMQALGALWEKSPEAALDLAHDVGEVLATELRASGVDLSFTP 127 Query: 120 VLDVGHISA-AIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE 178 VLD+ H IG RS+HADP +A G+ GM GKHFPGHGAV DSH Sbjct: 128 VLDIDHGRCAVIGNRSFHADPNIVATLAEALTRGLVAGGMIACGKHFPGHGAVEGDSHHV 187 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQE 238 P D R A I D+ F+ L + A+MPAHV+Y VDP PA SP+WL+TVLR+ Sbjct: 188 IPVDHRSYASIEEADLIPFARL-AAAGMHAVMPAHVVYDAVDPLPAGFSPFWLQTVLRER 246 Query: 239 LGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL----SP 294 +GF G IFSDDL MEGAA G A R QA+ AGCDM LVCN A +L L +P Sbjct: 247 MGFGGTIFSDDLCMEGAAGAGGIAARAQAAFAAGCDMALVCNRPDLADELLAQLDMPVTP 306 Query: 295 IKAERVTRLYHKGSFS 310 R+ + G + Sbjct: 307 ELTGRLAAMAGHGEVA 322 >UniRef50_A4C9R3 Beta-hexosaminidase n=1 Tax=Pseudoalteromonas tunicata D2 RepID=A4C9R3_9GAMM Length = 338 Score = 326 bits (837), Expect = 6e-88, Method: Composition-based stats. Identities = 151/336 (44%), Positives = 199/336 (59%), Gaps = 6/336 (1%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 MGP++ + G L EE E L HPLV G+ILFTRNYHD QL L QI+ S L+++ Sbjct: 1 MGPILTGISGLSLLPEEVEYLQHPLVAGVILFTRNYHDNEQLIALTAQIKKIS-PMLLIS 59 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 VD EGGRVQRFR+GF+ +PA + LA + G+ +A+E+ A +D SF PV Sbjct: 60 VDHEGGRVQRFRDGFSAIPAMFEATSFQKKHNSPILA-DLGFTLAAELRAHGVDFSFTPV 118 Query: 121 LDVGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETP 180 LD+ IS I R++ P + + A FI G+H AG+K KHFPGHG+V DSH P Sbjct: 119 LDLNGISQVITTRAFAISPIEVIDAAKHFIGGLHSAGVKNVVKHFPGHGSVGPDSHIAMP 178 Query: 181 CDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQELG 240 D R +AEI DMSVF +L+ D +MPAHVIY DP+PA S YWLK VLR +LG Sbjct: 179 VDERSKAEIFNSDMSVFKALVDSGCADGVMPAHVIYPHCDPKPACFSDYWLKQVLRNKLG 238 Query: 241 FDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKA--- 297 F G +FSDD+ M+GA MG Y R Q++LDAGCD+ L+CN G VLD L Sbjct: 239 FKGPVFSDDMGMQGAVQMGDYVTRTQSALDAGCDLALLCNEPHGLYEVLDGLEINLYQNH 298 Query: 298 -ERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHE 332 +R K + + EL RW A + ++++ + Sbjct: 299 GQRGLAYLQKKNVTMMELKTDHRWLAAQSVISKMED 334 >UniRef50_B2RGN3 Glycosyl hydrolase family 3 n=2 Tax=Porphyromonas gingivalis RepID=B2RGN3_PORG3 Length = 1003 Score = 326 bits (837), Expect = 7e-88, Method: Composition-based stats. Identities = 87/337 (25%), Positives = 147/337 (43%), Gaps = 31/337 (9%) Query: 2 GPVMLDVEGYELDAE----EREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRL 57 G +++ V L E +++ +GG ILF + + + R+++ A+ L Sbjct: 70 GQLLMPVVYPSLQEEKVKQAEQLVRTCHIGG-ILFQKGTLS--EQYTMTRRLQEAAGTPL 126 Query: 58 VVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISF 117 ++A+D E G R ++ R P +L G +A + M I I+F Sbjct: 127 LIALDGEWGLHMRLKD-APRFPRNMGLGHQKD----NQLLYNYGREVARQCRLMGIHINF 181 Query: 118 APVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADS 175 APVLDV + + IG RS+ +P++ + G+ + G+ KHFPGHG T DS Sbjct: 182 APVLDVNNNPKNPVIGTRSFGDNPRRVAERGIAYAQGLEDGGVMAVAKHFPGHGNTTEDS 241 Query: 176 HKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD---PRPASGSPYWLK 232 HK P + E+ ++ F R L +M AH+ ++ P+S S Sbjct: 242 HKTLPTVFASREELENTELFPFKEFFR-AGLSGVMTAHLNVPALEAKKNTPSSLSHAICT 300 Query: 233 TVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL 292 +LRQE+GF G+IF+D L+M+G GS ++ AG D++L + S Sbjct: 301 DLLRQEMGFKGLIFTDGLAMQGVQTAGSQP-ISVRAILAGNDILLGPVDPVKTFS----- 354 Query: 293 SPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQ 329 V + S++ L + R K ++ + Sbjct: 355 ------EVLAAVEDRTISKELLDEKCR-KILAFKYAL 384 >UniRef50_A1VC58 Glycoside hydrolase, family 3 domain protein n=3 Tax=Desulfovibrio vulgaris RepID=A1VC58_DESVV Length = 481 Score = 326 bits (836), Expect = 8e-88, Method: Composition-based stats. Identities = 101/343 (29%), Positives = 162/343 (47%), Gaps = 32/343 (9%) Query: 2 GPV-MLDVEGYELDAE--EREILAHPLVGGLILFTR---------NYHDPAQLRELVRQI 49 G + ML G E +A VGG+ILF R N QL L + Sbjct: 61 GQMLMLGFRGAEPADAGPILRSIAAGHVGGVILFDRDMDPSVRVRNIVSKEQLSRLTGAL 120 Query: 50 RAASRNRLVVAVDQEGGRVQRFRE--GFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASE 107 +AA+ L +AVDQEGGRV+R + GF P+A S E+ + +A+E Sbjct: 121 QAAAPVPLFIAVDQEGGRVRRLKPEYGFFAYPSAASLGK-GSPED----TRRMASTLATE 175 Query: 108 MIAMDIDISFAPVLD--VGHISAAIG--ERSYHADPQKALAIASRFIDGMHEAGMKTTGK 163 M + +++ F PV+D V + I ERSY +DP + + A+ F++G+ G+ + K Sbjct: 176 MAEVGLNVDFGPVVDLAVNPSNPVIARLERSYGSDPCRVASHAAAFVNGLAWRGVVASLK 235 Query: 164 HFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-R 222 HFPGHG+ DSH R +++ ++ L+R++ +M H+ + +D Sbjct: 236 HFPGHGSSLQDSHLGVTDISST---WRREELGPYALLLRDDWAGMVMVGHLYNNRIDAAH 292 Query: 223 PASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNR 282 PA+ S + +LR++LG+ GV+ +DDL M S E + +++AG D++L NN Sbjct: 293 PATLSQRTIDGLLRRDLGWKGVVVTDDLQMGAITARYSLDETVRLAVEAGADILLFGNNL 352 Query: 283 KGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDS-ARWKAIS 324 + L+ + RL +G S Q L S R + Sbjct: 353 VWD----EGLAEKVHATLVRLVREGKVSEQRLRQSWERIMRLK 391 >UniRef50_Q2BGB3 Beta-hexosaminidase n=1 Tax=Neptuniibacter caesariensis RepID=Q2BGB3_9GAMM Length = 342 Score = 326 bits (836), Expect = 8e-88, Method: Composition-based stats. Identities = 134/330 (40%), Positives = 198/330 (60%), Gaps = 11/330 (3%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQ 63 + LD++G EL EE+ +L PL+GG+ILF RN + Q+ ELV IRA R L++++DQ Sbjct: 13 LFLDLDGLELTDEEKVLLLDPLIGGVILFARNTQNAKQVAELVASIRAI-RPDLILSLDQ 71 Query: 64 EGGRVQRFREGFTRLPAAQSFAA-LSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLD 122 EGGRVQR ++G T LP ++ + + +EE A++ G+LMA+E+ +DID+SFAPV+D Sbjct: 72 EGGRVQRLKDGVTLLPPVKALSKFSTSLEENRSTAEQLGYLMAAELRQLDIDLSFAPVVD 131 Query: 123 VGH-ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPC 181 + + + IG R++ D ++S +I+GM +AGM T KHFPGHG AD+H Sbjct: 132 IDYGRNTVIGNRAFGNDVSTVTVLSSGYIEGMRDAGMPATVKHFPGHGWANADTHHADAI 191 Query: 182 DPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQELGF 241 D R + I DM F I N ++A+M AHV+Y D PA S +W+ VLR LGF Sbjct: 192 DEREFSVIEQADMVPFKEAIT-NGVEAMMFAHVLYPQCDASPAGFSDFWIGNVLRDGLGF 250 Query: 242 DGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVT 301 GVIFSDDLSM+ A G YA+R + ++ AGC ++L CN+RKG ++VL+ + E + Sbjct: 251 QGVIFSDDLSMKAAHSAGGYAKRAEHAIRAGCQVLLCCNDRKGTLAVLEYMRSACVEPYS 310 Query: 302 RLY----HKGSFSRQELMDSARWKAISTRL 327 L L + + +++RL Sbjct: 311 ALSSLKGKAIKVDEVRLEQA---RVLASRL 337 >UniRef50_B9E727 Putative uncharacterized protein n=1 Tax=Macrococcus caseolyticus JCSC5402 RepID=B9E727_MACCJ Length = 379 Score = 326 bits (836), Expect = 9e-88, Method: Composition-based stats. Identities = 88/333 (26%), Positives = 162/333 (48%), Gaps = 19/333 (5%) Query: 3 PVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQI-RAASRNRLVVAV 61 ++L +G + +A + A +GG+ILF RN + +QL +L R I A+ + + + Sbjct: 61 QLVLGFDGTDYNAALKRYSAR-NIGGVILFKRNIVNESQLTQLNRDIFHNAANIPMFIGI 119 Query: 62 DQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVL 121 DQEGGRV R + +A + + + A + G + + M ++ FAPVL Sbjct: 120 DQEGGRVNRLPASIQNIESAYTIG----LSHDKQYAYQQGRYIGENVKRMGFNLDFAPVL 175 Query: 122 DV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKET 179 D+ + IG+RS+ D ++ F G+ E G+ T+GKHFPGHG +DSH+ Sbjct: 176 DIWSNPANKVIGDRSFGHDAHMVSGMSRSFRQGITETGIITSGKHFPGHGDTVSDSHETL 235 Query: 180 PCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQE 238 P + E+ ++ F I + +D +M +H+ + D+D + PAS S + +L+++ Sbjct: 236 PVSNKSLDELMKFELIPFKQHIDDG-IDMLMVSHISFPDIDNKYPASISKEIVTNILKKK 294 Query: 239 LGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAE 298 LG+ GVI SDDLSM + S E L +G ++L+ ++ +++ + Sbjct: 295 LGYKGVIISDDLSMGAISDRYSLNEAVVRGLQSGETIMLIGSDAVDVDTLIQYVRSN--- 351 Query: 299 RVTRLYHKGSFSRQELMDSA-RWKAISTRLNQL 330 G+ ++ + ++ + + + L Sbjct: 352 -----VENGNIDKKIIDENNEKIIRLKLKYGIL 379 >UniRef50_B3EU27 Putative uncharacterized protein n=1 Tax=Candidatus Amoebophilus asiaticus 5a2 RepID=B3EU27_AMOA5 Length = 1007 Score = 325 bits (835), Expect = 1e-87, Method: Composition-based stats. Identities = 73/319 (22%), Positives = 139/319 (43%), Gaps = 29/319 (9%) Query: 1 MGPVML----DVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNR 56 +G + + +G + ++ +GGLI F DP +L Q + ++ Sbjct: 46 IGQLFMVAAYSNQGEKHHEFIENLIQRYNIGGLIFFQG---DPISQAKLTNQYQLKAKTP 102 Query: 57 LVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 L++A+D E G R + P + A+ +L + G +A ++ + I ++ Sbjct: 103 LLLAIDAEWGLGMRLTNTIS-YPRQMTLGAIQD----HQLIYDMGAEIARQLKLLGIHVN 157 Query: 117 FAPVLDVG--HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 FAPV+D+ + IG R++ ++ +I G+ + G+ KHFPG G + D Sbjct: 158 FAPVIDINNNPDNPGIGNRAFGDGKGSVISKGLAYIQGLQDNGILAVAKHFPGIGDASKD 217 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRP---ASGSPYWL 231 H E P P + + ++ F IR + IM +H+ + P AS S + + Sbjct: 218 PHHELPTIPYDITRLESIELYPFRKAIR-ANVGGIMVSHIYLPAYEKTPNRAASLSSHIV 276 Query: 232 KTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDN 291 +L+ +LGF G+IF+D L+M+ + E +L AG D++L + A+++ Sbjct: 277 TQLLKNKLGFKGLIFTDALNMKAVSKYYQPGEVDLLALQAGNDILLFPEDVPKAIAL--- 333 Query: 292 LSPIKAERVTRLYHKGSFS 310 + +G + Sbjct: 334 --------IKSAIEQGKLA 344 >UniRef50_A5CLY4 Putative beta-N-acetylglucosaminidase n=1 Tax=Clavibacter michiganensis subsp. michiganensis NCPPB 382 RepID=A5CLY4_CLAM3 Length = 607 Score = 325 bits (834), Expect = 1e-87, Method: Composition-based stats. Identities = 97/327 (29%), Positives = 146/327 (44%), Gaps = 31/327 (9%) Query: 18 REILAHPLVGGLILFTR---NYHDPAQLRELVRQIRAAS----RNRLVVAVDQE-GGRVQ 69 E++A GG+I F N DP Q+ L ++ AS R L+V+ DQE GG V Sbjct: 97 AEVVARHRPGGIIYFAGEKDNLVDPRQIARLSGDLQRASQGAGRIPLLVSTDQEQGGYVV 156 Query: 70 RFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV--GHIS 127 R T LP A+ + + E++AM I+ +FAPV+DV + Sbjct: 157 R--APCTLLPGQMGLASSPSSSAD---VRAVATITRDELLAMGINQAFAPVVDVASNPAN 211 Query: 128 AAIGERSYHADPQKALAIASRFIDGMHE-AGMKTTGKHFPGHGAVTADSHKETPCDPRPQ 186 I RS+ DP + +A I + G+ KHFPGHGA DSH P + R + Sbjct: 212 PIIHVRSFGDDPDRVALLAQSQIGTFQKRQGIVAAAKHFPGHGATDTDSHTGLPVNNRSR 271 Query: 187 AEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD--PRPASGSPYWLKTVLRQELGFDGV 244 +E+ D+ F++ I+ + + IM AH++ +D PA+ S L VLRQ+LG+DGV Sbjct: 272 SELEDNDLPPFAAAIK-SGVGVIMSAHIVVPALDSSGDPATLSYPILTGVLRQKLGYDGV 330 Query: 245 IFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLY 304 + +D L M G S A ++ AG D +L + A + V Sbjct: 331 VITDALDMAGVRQRYSDARVPVLAIKAGVDQLLTPPDFFAA-----------RDGVLAAI 379 Query: 305 HKGSFSRQELMDSA-RWKAISTRLNQL 330 G + + +S R + R L Sbjct: 380 RAGELTEARIDESVTRILRLKQRFGLL 406 >UniRef50_C6W3E7 Beta-lactamase n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6W3E7_DYAFD Length = 992 Score = 325 bits (834), Expect = 1e-87, Method: Composition-based stats. Identities = 80/331 (24%), Positives = 144/331 (43%), Gaps = 25/331 (7%) Query: 11 YELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQR 70 +++ VGG++ F P L ++ S+ L+VA+D E G R Sbjct: 76 TSDPTVVEKLIRENKVGGIVFFQG---GPMPQARLTNYYQSISKVPLLVAMDAEFGLAMR 132 Query: 71 FREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVG--HISA 128 R P + A+ + L E G +A + + + I+FAPV DV + Sbjct: 133 IDS-TVRYPYQMTLGAI---QGNNDLIYEMGAQLAKQARRLGMHINFAPVADVNNNPDNP 188 Query: 129 AIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAE 188 I RS+ + K A ++ GM + G+ T+ KHFPGHG DSH + P P Sbjct: 189 VISFRSFGENKYKVAEKAVAYMRGMQDNGLLTSAKHFPGHGDTGTDSHYDLPLIPHSAQR 248 Query: 189 IRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR---PASGSPYWLKTVLRQELGFDGVI 245 I + ++ F +LI +N L +M AH+ +D P++ S + +LR +LGF+G+I Sbjct: 249 IDSLELYPFRALI-DNGLSGMMIAHLSIPALDKTPNLPSTLSKPIISNLLRHQLGFEGLI 307 Query: 246 FSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLYH 305 +SD ++M+G + L+AG D++ + +++ + + Sbjct: 308 YSDAMNMKGVTKYFPNGKADAMGLEAGMDLLEFTEDVNKSIA-----------EIKKSIA 356 Query: 306 KGSFSRQELMDSARWKAISTRLNQLHERWQE 336 +G ++ E+ R K + + ++ Sbjct: 357 EGKITQAEIDRRCR-KVLEAKAWAGLNHYKP 386 >UniRef50_D0L0K9 Glycoside hydrolase family 3 domain protein n=1 Tax=Halothiobacillus neapolitanus c2 RepID=D0L0K9_HALNC Length = 362 Score = 325 bits (834), Expect = 1e-87, Method: Composition-based stats. Identities = 136/350 (38%), Positives = 194/350 (55%), Gaps = 11/350 (3%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 +GP+M+DV G L ER L +P VGG+ILF RN+ QL L +IRA L++A Sbjct: 4 LGPLMVDVAGLTLTDAERTRLMNPAVGGVILFARNFSSTEQLTALTAEIRALRHPHLLIA 63 Query: 61 VDQEGGRVQRFRE-GFTRLPAAQSFAALS--GMEEGGKLAQEAGWLMASEMIAMDIDISF 117 VDQEGGRVQRFR+ GF LP AL + G + A+ GWLMA ++ +D+SF Sbjct: 64 VDQEGGRVQRFRDDGFIHLPPMGRLGALFKHDRQAGLEAAKSLGWLMAVQLRMAGVDLSF 123 Query: 118 APVLDVGHI-SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSH 176 APVLD+ S IG+R++ DP +A+A +I GM EAGM GKHFPGHG++ DSH Sbjct: 124 APVLDLDRGVSGVIGDRAFSRDPASIIALAVAWISGMKEAGMAAVGKHFPGHGSIAEDSH 183 Query: 177 KETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLR 236 P D R + I D+ F ++I E + A+M AHV+Y +D PA S W++ +LR Sbjct: 184 IAHPVDHRALSTIERDDLRPFVAMI-ERFIPALMMAHVVYPKIDALPAGFSKVWVQDLLR 242 Query: 237 QELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLD----NL 292 + + G +F+DDL+M A++ G R A+L AGCDM L+CN+ + +L Sbjct: 243 KRWQYQGAVFTDDLTMAAASVAGDIGARVDAALAAGCDMALICNHPQWVDQLLTHRKFTA 302 Query: 293 SPIKAERVTRLYHKGSF-SRQELMDSARWKAISTRLNQLHER-WQEEKAG 340 P++ R+ L G S + R + +L + +E+A Sbjct: 303 DPLREVRLAHLMGHGPVPSPNRFLHDHRTATAKAWVERLQNASFGDERAA 352 >UniRef50_A2DGR7 Glycosyl hydrolase family 3 N terminal domain containing protein n=1 Tax=Trichomonas vaginalis RepID=A2DGR7_TRIVA Length = 553 Score = 325 bits (833), Expect = 2e-87, Method: Composition-based stats. Identities = 84/346 (24%), Positives = 150/346 (43%), Gaps = 21/346 (6%) Query: 2 GPVM-LDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS----RNR 56 G + +D+ G L ++ IL +G IL RN + + ++ S Sbjct: 55 GQLFLMDLSGLTLTDSDKAILKKYKLGNFILMGRNLGTIEETAAYSKSLQEYSMKEFGIP 114 Query: 57 LVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 +++DQE G + R + T P A S AA ++ +A+ LMA+E+ A ++++ Sbjct: 115 AFISIDQEAGSILRMYKNCTVFPGAMSLAATDSLD----IAKNISKLMATELRAAGVNMN 170 Query: 117 FAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 AP DV + IG+RS+ +P+ +I G HEAG+ TT KHFPGHGA + D Sbjct: 171 HAPTSDVNSNPKNPIIGDRSFGDNPENVSKFVDYYIQGHHEAGILTTSKHFPGHGATSVD 230 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTV 234 SHK P ++ D+ F ++ + +D+IM H+I + + S + Sbjct: 231 SHKALPSIDHNMTQLLEIDIPPFQQAVK-SGVDSIMVGHLITPIDNESACTQSKKCV-DF 288 Query: 235 LRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSP 294 R +L +DG++ +D + M+ E ++ +G D+I C ++ Sbjct: 289 ARNQLKYDGILITDSMQMKAIKQE-DLNESIATAIKSGIDLICDCGGDIVGTDPYYSVIA 347 Query: 295 IKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQEEKAG 340 + K + + +A + I T+L L + + Sbjct: 348 ----YIVEQVQKNIIPEKIIDQAA-LRIIKTKLAML--PATDPSSA 386 >UniRef50_A9F261 Put. Beta-glucosidase n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9F261_SORC5 Length = 371 Score = 325 bits (833), Expect = 2e-87, Method: Composition-based stats. Identities = 104/331 (31%), Positives = 160/331 (48%), Gaps = 18/331 (5%) Query: 2 GPVMLD-VEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRN--RLV 58 G +++ +G E A + LA GG ILF RN D A L + + A Sbjct: 12 GQLIVGGFDGAEPPARYLKALAEGRRGGAILFRRNVPDVAATARLCQALAEAGGPDRPPF 71 Query: 59 VAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 + +DQEGGRV R F LP+ +S L+ + L + A +ASE+ A+ I+++FA Sbjct: 72 IGIDQEGGRVTRLPAPFLTLPSMRSLGELADL----PLLRRAARAVASELRAVGINLNFA 127 Query: 119 PVLDV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSH 176 PVLDV + IG+R++ DP A F++G+ E G+ GKHFPGHG DSH Sbjct: 128 PVLDVDSNPANPIIGDRAFGRDPGAVARGAVAFLEGLQEEGVLACGKHFPGHGDTALDSH 187 Query: 177 KETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD-PRPASGSPYWLKTVL 235 P + + ++ F + + ++M AH++ +D PA+ S +L Sbjct: 188 LALPTLSHARGRLDEIELPPFRAACG-AGVASLMTAHIVVEALDPGVPATLSRAICTDLL 246 Query: 236 RQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPI 295 R E+GF+GV+FSDDL M A + E A++ AGCD +LVC++ V + L Sbjct: 247 RGEIGFEGVLFSDDLEMAAIAAHNAVEEAAAAAVWAGCDALLVCHSEDLQDRVHEAL--- 303 Query: 296 KAERVTRLYHKGSFSRQELMDSARWKAISTR 326 V + + F + +AR + R Sbjct: 304 ----VRKAEREPRFRERCAEAAARCLGLRRR 330 >UniRef50_A2P4H6 Beta-N-Acetylglucosaminidase n=9 Tax=Vibrio RepID=A2P4H6_VIBCH Length = 535 Score = 324 bits (832), Expect = 2e-87, Method: Composition-based stats. Identities = 81/337 (24%), Positives = 142/337 (42%), Gaps = 16/337 (4%) Query: 2 GPVML-DVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS----RNR 56 G + + G A R ++A +GG L N AQ + L R ++ R Sbjct: 21 GQLFILAFPGKSAQAAHR-MIAEYNLGGCYLSQDNAETFAQAQALNRGLQQILEQHQRLP 79 Query: 57 LVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 L++ VDQEG E T P + + + + + EM + Sbjct: 80 LLLGVDQEGAWGVLVGESTTG-PGNLALG----VSDQAVGTERMYQVFGEEMRGAGFNCI 134 Query: 117 FAPVLDVGHIS--AAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 P DV I RS+ P+K ++ + G+++ G+ KHFPGHG D Sbjct: 135 LGPCCDVNLNPASPIIDTRSFGDQPEKVSLHSAAAVRGLYQGGIAACAKHFPGHGDTHGD 194 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD-PRPASGSPYWLKT 233 +H++ P + AE++ D++ F + I + +D +M +H++Y +D PA+ S L+ Sbjct: 195 THRDIPRVDKSYAELKLNDLAPFQAAI-DAGVDLVMTSHILYPQLDANAPATLSATILQQ 253 Query: 234 VLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLS 293 VLRQ++GFDGVI SD ++M + + ++ AG MI++ + Sbjct: 254 VLRQDMGFDGVIISDSMNMGAIVNHYTPVDAAVKAMQAGITMIMLSEEHYDHSEAYLDKQ 313 Query: 294 PIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQL 330 V +G + + + K + +L +L Sbjct: 314 LAMIHGVIDAVEQGVLAESVIDQAL-EKVVRLKLEKL 349 >UniRef50_Q3B6L6 Beta-N-acetylglucosaminidase n=2 Tax=Chlorobiaceae RepID=Q3B6L6_PELLD Length = 585 Score = 324 bits (832), Expect = 2e-87, Method: Composition-based stats. Identities = 88/336 (26%), Positives = 142/336 (42%), Gaps = 32/336 (9%) Query: 2 GPV-MLDVEG--YELDAEER----EILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASR 54 G + M D G E D R +++ VGG++ + A L ++ + Sbjct: 63 GQMLMADSPGAYTEKDDPARMKLDRLVSEGKVGGIMFLKGSMFSAAMLA---NHFQSIAP 119 Query: 55 NRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDID 114 N L+++ D E G R EG TR P+A + +A E G + G +A E A + Sbjct: 120 NPLLISADMERGLAMRL-EGATRFPSAMAVSAAGSSELSG----KMGEAIALEAKAAGLQ 174 Query: 115 ISFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVT 172 +FAP D+ + I RSY + A +A I+GM G+ T KHFPGHG VT Sbjct: 175 WNFAPSADLNSNPLNPVINTRSYGDTSEGASRMADALIEGMQSKGLLATAKHFPGHGDVT 234 Query: 173 ADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR--PASGSPYW 230 DSH P + ++ F S IR + A+M H+ + PAS S Sbjct: 235 VDSHFALPVLEADSLRLEEYELKPFRSAIRCG-VMAVMTGHLAVPKLTGTMDPASISAPI 293 Query: 231 LKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLD 290 + +LR+ LGF G+I +D L+M+ + E ++ AG D++L + + + Sbjct: 294 VTGLLRKRLGFKGLIVTDALNMKALYDGHNVEEISVRAVAAGNDVLLFSPDPERSFHA-- 351 Query: 291 NLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTR 326 + S + +S R + + + Sbjct: 352 ---------IVNAVRDSVISEARIDESVR-RILQAK 377 >UniRef50_A6GP56 Beta-hexosaminidase n=1 Tax=Limnobacter sp. MED105 RepID=A6GP56_9BURK Length = 340 Score = 323 bits (829), Expect = 5e-87, Method: Composition-based stats. Identities = 130/302 (43%), Positives = 181/302 (59%), Gaps = 4/302 (1%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 +GP ++ + G L E+ + +P VGGLILFTRN+ AQL +LV IR+ V Sbjct: 2 LGPFIIGLTGLTLSKEDIARIQNPAVGGLILFTRNFESKAQLIDLVESIRSNGGLSKPVF 61 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFA--ALSGMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 VD EGGRVQRFR GFT +P+ + A + ++ LA+EAG++MA+E+ A ID+SFA Sbjct: 62 VDHEGGRVQRFRNGFTAIPSMRQIGQRAETDFDDAVLLAREAGFVMAAELRACGIDMSFA 121 Query: 119 PVLDVGHISA-AIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 PVLD+ ++ IG+RS+ D + ++S ++GM AGM+ GKHFPGHG V+ DSH Sbjct: 122 PVLDLDWGNSEIIGDRSFGKDARMVSRLSSALMNGMALAGMQACGKHFPGHGWVSLDSHL 181 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQ 237 P D R EI D + L + +IMPAHV+YS+VDP PA S +W++ VLR Sbjct: 182 ALPHDNRSLNEILKVDALPYQ-LNSTLNMASIMPAHVVYSEVDPMPACFSSFWIQEVLRV 240 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKA 297 + FDG + SDDL M GA +G R A+L AGCD +L CNN ++++ P Sbjct: 241 KFSFDGAVISDDLDMVGAHGVGDIRARASAALQAGCDAVLACNNFDDIDTLVNKPVPEVH 300 Query: 298 ER 299 E Sbjct: 301 EN 302 >UniRef50_C4JZX3 Predicted protein n=4 Tax=Onygenales RepID=C4JZX3_UNCRE Length = 852 Score = 323 bits (829), Expect = 6e-87, Method: Composition-based stats. Identities = 84/352 (23%), Positives = 143/352 (40%), Gaps = 27/352 (7%) Query: 1 MGPV-MLDVEGYELDAEEREILAH-PLVGGLILFTRNYHDPAQLRELVRQIRAASR---- 54 +G + ++ G E + ++ G +ILF RN QL L+ ++ +R Sbjct: 10 LGQLFIVGFRGPTATEEIKSLIRAPYYCGSIILFQRNIKSAEQLIRLIHDLQRTAREAGH 69 Query: 55 -NRLVVAVDQEGGRVQRFREG-FTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMD 112 L +AVDQE G V R + +LP + + A +++ A + + Sbjct: 70 TRPLFIAVDQENGVVTRIKPPIAAQLPGSMAIGASGDLDD----ASRVSGATGELLRGLG 125 Query: 113 IDISFAPVLDVG--HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGA 170 +++++AP+ DV + IG RS D I SR G+ E + KHFPGHG Sbjct: 126 VNMNYAPLCDVNSEPANPVIGVRSPGDDGTFVGRITSRIARGLRENNIVPCVKHFPGHGD 185 Query: 171 VTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR--PASGSP 228 DSH P + + ++ ++ F + E ++A+M +HVI S D PAS + Sbjct: 186 SKVDSHHGVPVVNKTKEKLEECELIPFRRAVAEG-IEAVMTSHVIMSAFDDSIYPASVNR 244 Query: 229 YWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSV 288 + LR+ L FDG+I +D L M+ + +L AG D ++ + V Sbjct: 245 NVVN-FLRRGLQFDGLIVTDCLEMDAIRVQFGTERGAVLALAAGVDCPMISHTYSLQVRA 303 Query: 289 LDNLSPIKAERVTRLYHKGSFSRQELMDSA-RWKAISTRLNQLHERWQEEKA 339 L+ +++ S R A+ + Q A Sbjct: 304 LEEAFRWC--------RNHPTMPRQISRSVSRVFALKDKFLDWDSTLQPRPA 347 >UniRef50_B5CPR6 Putative uncharacterized protein n=1 Tax=Ruminococcus lactaris ATCC 29176 RepID=B5CPR6_9FIRM Length = 408 Score = 323 bits (828), Expect = 7e-87, Method: Composition-based stats. Identities = 82/324 (25%), Positives = 148/324 (45%), Gaps = 24/324 (7%) Query: 11 YELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS----RNRLVVAVDQEGG 66 + L VGG+I F N D QL +++ ++ S + ++VD+EGG Sbjct: 94 TAAGDTTKRCLEQYPVGGMIYFAGNIEDRQQLSDMLHTTQSYSTEQSGIPIFLSVDEEGG 153 Query: 67 RVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV--G 124 V R + S + A E G + + + + ++ +APV DV Sbjct: 154 TVARVAGSGI---SNVSVVEDMCEVTDEERAYEIGQTIGNYLAELGFNLDYAPVGDVLTN 210 Query: 125 HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPR 184 + + +RS+ D K A++ + +DG+HE + KHFPGHGA D+H+ + Sbjct: 211 PENTVVKKRSFGNDSAKVAALSQKVLDGLHEENVYGVIKHFPGHGATAGDTHEGYAYTDK 270 Query: 185 PQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPRPASGSPYWLKTVLRQELGFD 242 E+ +++ F + I EN + IM H+ +V D P+S S + VLR+++GFD Sbjct: 271 SLEELEQQELVPFQNAI-ENHIAIIMAGHISVPNVTGDNTPSSLSKIMITDVLREKMGFD 329 Query: 243 GVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTR 302 G++ +D L+M + S AE ++AG DM+L+ N + A + + Sbjct: 330 GIVITDALNMGAISQHYSSAEVSVKVIEAGGDMLLMPENFQEAY-----------QGILE 378 Query: 303 LYHKGSFSRQELMDSARWKAISTR 326 G+ + + + +S R + + + Sbjct: 379 AVQNGTLTEERIDESVR-RILKVK 401 >UniRef50_C6X002 Glycoside hydrolase n=2 Tax=Flavobacteriaceae RepID=C6X002_FLAB3 Length = 523 Score = 323 bits (828), Expect = 7e-87, Method: Composition-based stats. Identities = 78/335 (23%), Positives = 145/335 (43%), Gaps = 26/335 (7%) Query: 10 GYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQ 69 G + R+ + +GGLIL D A+ LV + + S+ +++ +D E G Q Sbjct: 11 GEDFIDNVRKTVVKEQIGGLILMQ---DDAAREINLVNEFQQKSKVPMMIGMDAEWGVYQ 67 Query: 70 RFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI--S 127 R + P A + A+ L E +A + M ++ FAPV+DV + Sbjct: 68 RIAA-AYKFPWAMTLGAIQD----KNLITEMAAKIAEDCQRMGVNWDFAPVVDVNTNPQN 122 Query: 128 AAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQA 187 IG RS+ ++ + A + +G+ + + KHFPGHG DSH + P P Sbjct: 123 PIIGNRSFGSEVSNVVGSAMAYSNGLQQNNVLAAIKHFPGHGDTDTDSHLDLPVVSHPLK 182 Query: 188 EIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD---PRPASGSPYWLKTVLRQELGFDGV 244 + A +++ F +L+ + + +M AH+ ++ PAS S + +L+++ G+ G+ Sbjct: 183 RLEAVELAPFKALMDKG-IGGVMVAHLYIPALEKEKGVPASISKNIITGLLKEKYGYKGL 241 Query: 245 IFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLY 304 I +D L+M A E + AG D++L + ++ Sbjct: 242 IITDALNMGAVAKRYKPGELDALAFKAGNDIMLFSEGVEEGKRLIQL-----------AI 290 Query: 305 HKGSFSRQELMDSARWKAISTRLNQLHERWQEEKA 339 KG S+ + +S + K + T+ E+++ Sbjct: 291 DKGEISQNRVEESVK-KILLTKYFLGLEKYKPRNP 324 >UniRef50_C4ZK55 Glycoside hydrolase family 3 domain protein n=6 Tax=Betaproteobacteria RepID=C4ZK55_THASP Length = 381 Score = 322 bits (826), Expect = 1e-86, Method: Composition-based stats. Identities = 147/301 (48%), Positives = 188/301 (62%), Gaps = 9/301 (2%) Query: 2 GPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAV 61 GPVM+DV G L EERE L PLVGG+ILF RNY QLR L +IR L++AV Sbjct: 10 GPVMIDVAGTALTDEERERLRDPLVGGVILFARNYTGSEQLRALTAEIRGLRDPALIIAV 69 Query: 62 DQEGGRVQRFR-EGFTRLPAAQSFAALSGMEEGGKL--AQEAGWLMASEMIAMDIDISFA 118 D EGGRVQRFR +GFTRLP+ +S AL + L A+ G ++A+E+ A +D+SF Sbjct: 70 DHEGGRVQRFRTDGFTRLPSMRSLGALWAQDHLVALDAARATGVVLAAELRAHGVDLSFT 129 Query: 119 PVLDVGHISA-AIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 PVLD+ + AIG R++H DPQ A+A GM EAGM GKHFPGHG V ADSH Sbjct: 130 PVLDLDYGCCRAIGNRAFHRDPQVVAALAQALCAGMAEAGMGCVGKHFPGHGFVEADSHH 189 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD----PRPASGSPYWLKT 233 + P D R + +D++ + + +L +MPAHV+Y + D P+PA SP+WLK Sbjct: 190 DVPVDERDFDTVWNEDIAPYRHRLGR-QLAGVMPAHVVYPNADPSPEPQPAGFSPFWLKE 248 Query: 234 VLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLS 293 VLR LGF VIFSDDL+MEGA + G R +A+ AGCDM+LVCN A +LD + Sbjct: 249 VLRDRLGFQWVIFSDDLNMEGARVAGDIVGRAKAAYAAGCDMLLVCNRPDLAAELLDRWA 308 Query: 294 P 294 P Sbjct: 309 P 309 >UniRef50_A6LEE9 Glycoside hydrolase family 3, candidate beta-glycosidase n=6 Tax=Bacteroidales RepID=A6LEE9_PARD8 Length = 999 Score = 322 bits (826), Expect = 1e-86, Method: Composition-based stats. Identities = 81/344 (23%), Positives = 146/344 (42%), Gaps = 30/344 (8%) Query: 1 MGPVMLDVEGYELDAEEREILAHP----LVGGLILFTRNYHDPAQLRELVRQIRAASRNR 56 +G + + + + D + L +GG++ DP E+ +++ ASR Sbjct: 51 IGQLFMVIANPKSDNRNMQRLMRYVNDIKIGGILFHKG---DPVTQAEVTNRLQKASRIP 107 Query: 57 LVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 ++V++D E G R G TR P A+ L +E G + + M I I+ Sbjct: 108 MLVSLDGEWGLSMRL-SGTTRFPKNMMLGAIED----NALIEEYGKEVGRQCREMGIHIN 162 Query: 117 FAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 FAP +DV + IG RS+ +P+ + G+ G+ + KHFPGHG + D Sbjct: 163 FAPDMDVNSNVDNPVIGLRSFGENPEAVSEKGIAYARGLENTGILSVSKHFPGHGDTSED 222 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP--RPASGSPYWLK 232 SH+ P +A + + ++ F I + IM H+ +D +PAS S + Sbjct: 223 SHETLPVVRHNRARLDSVELLPFKRYIYDGF-GGIMTGHLYVPALDKSHKPASFSKAVVT 281 Query: 233 TVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL 292 +L++ELGF G+ F+D L+M+GA+ + +L AG D++L + Sbjct: 282 DLLQKELGFQGLCFTDALAMKGASTKKT-DNPSVKALLAGNDILLAPAAPINDFTA---- 336 Query: 293 SPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQE 336 V +G + + ++ K + + ++ Sbjct: 337 -------VKEAIEEGVLDLEAI-EAKCLKILRYKYIAGLNAYKP 372 >UniRef50_C4XU31 Putative beta-N-acetylhexosaminidase n=1 Tax=Desulfovibrio magneticus RS-1 RepID=C4XU31_DESMR Length = 568 Score = 322 bits (826), Expect = 1e-86, Method: Composition-based stats. Identities = 102/344 (29%), Positives = 144/344 (41%), Gaps = 23/344 (6%) Query: 2 GPV-MLDVEGYELDAEEREILAHPLVGGLILFT--RNYHDPAQLRELVRQIRAAS----- 53 G V M+ G + + ++ +GG+IL+ N P QL L ++A S Sbjct: 44 GQVFMVWFAGPAVSDDIAGLIRQRHLGGVILYGVPGNIESPGQLAALNAGLQAVSAATRH 103 Query: 54 RNRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDI 113 L++ VDQEG V R R+GFT P+ + AA + LA+ A A E+ A+ I Sbjct: 104 GLGLLIGVDQEGAPVARLRKGFTLFPSQMAQAATGRAD----LARLAASATARELRAVGI 159 Query: 114 DISFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAV 171 +++FAPV DV + IG RS+ + P G AGM T KHFPGHG Sbjct: 160 NVNFAPVADVNVNPANPVIGIRSFGSAPADVARFTGAATAGYGAAGMICTPKHFPGHGDT 219 Query: 172 TADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR---PASGSP 228 T DSH P A + D A+M AH++ +D PA+ S Sbjct: 220 TIDSHVGLPRSDHDAAALDKLDF-PPFRAAFAAGAPAVMTAHMVVPALDGEPDLPATLSR 278 Query: 229 YWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSV 288 L+ LR +GFDGVIF+D L M A E +L AG D++LV + + Sbjct: 279 RVLEGTLRGRMGFDGVIFTDSLGMGAVADTYGIPEASVRALAAGADVLLVGADAGRSP-- 336 Query: 289 LDNLSPIKAERVTRLYHKGSFSRQELMDSA-RWKAISTRLNQLH 331 + V G L + R + R L Sbjct: 337 --GERLAAMDAVAEAVRSGRVPMARLNAAVGRVLRLKERYGLLD 378 >UniRef50_D2AR90 Beta-glucosidase-related glycosidase-like protein n=20 Tax=Actinomycetales RepID=D2AR90_STRRD Length = 520 Score = 321 bits (825), Expect = 1e-86, Method: Composition-based stats. Identities = 97/323 (30%), Positives = 153/323 (47%), Gaps = 17/323 (5%) Query: 7 DVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGG 66 +G R L+ +GG +LF RN PAQ ELV +R + +VVAVD+EGG Sbjct: 19 GFDGTAPPDWLRRALSEG-LGGAVLFARNLAGPAQTAELVAGLRRENPA-VVVAVDEEGG 76 Query: 67 RVQRF-REGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGH 125 V R + P ++ E ++A+E G L+A + DI + +APV+DV Sbjct: 77 AVTRLEARTGSSWPGNRALGVADDAERTERVAREIGRLLA----SADITLDYAPVVDVNA 132 Query: 126 I--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDP 183 + IG RS+ DP+ + +I G+ AG+ KHFPGHG DSH P Sbjct: 133 NPANPVIGIRSFGPDPELVSRQTTAWITGLQGAGVAACAKHFPGHGDTVTDSHHALPTVH 192 Query: 184 RPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD-PRPASGSPYWLKTVLRQELGFD 242 ++ +D+ F + ++ + A+M H++ +D PA+ S L +LR+E+GF Sbjct: 193 ADLELLQERDLPPFRAAVK-AGVQAVMCGHLLVPALDPGNPATLSRRILTGLLREEMGFG 251 Query: 243 GVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTR 302 G++ +D + M A + E +L AG D I V + G SV + + R Sbjct: 252 GMLVTDAIEMGAVAALHPPGEIAVRALAAGVDAICVGVSSPGGESVY-----ALRDAIVR 306 Query: 303 LYHKGSFSRQELMDSA-RWKAIS 324 H G + L ++A R A++ Sbjct: 307 AVHDGRLPEERLAEAAGRVLALA 329 >UniRef50_Q3AQ16 Beta-N-acetylglucosaminidase n=3 Tax=Chlorobium/Pelodictyon group RepID=Q3AQ16_CHLCH Length = 592 Score = 321 bits (823), Expect = 3e-86, Method: Composition-based stats. Identities = 82/346 (23%), Positives = 149/346 (43%), Gaps = 32/346 (9%) Query: 2 GPVML-DVEGYELDAEEREIL------AHPLVGGLILFTRNYHDPAQLRELVRQIRAASR 54 G +++ D+E +++L +GG+I D +V ++A + Sbjct: 76 GQMIIADIEPTAFSPRNKKVLLLSRLAQEGKIGGVIFMKG---DAKSTGAVVNHLQALAP 132 Query: 55 NRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDID 114 L+ + D E G R G T P + A + KLA++ +A E + + Sbjct: 133 LPLLFSSDMERGVAMRI-SGTTEFPPNMALGATADP----KLAEDMATAIAQEATLLGMH 187 Query: 115 ISFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVT 172 ++AP +D+ + I R++ + +A+ I G+ G+ T KHFPGHG VT Sbjct: 188 HNYAPTVDLNSNPRNPVINTRAFGDTIPLTIVMANAIIKGLQSHGVLATAKHFPGHGNVT 247 Query: 173 ADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP--RPASGSPYW 230 DSH P + ++ A ++ F + I + + IM H+ + PA+ SP Sbjct: 248 VDSHVALPVLQATREQLEAYELIPFRAAIEQG-VATIMVGHLAVPALTGNMEPATISPAI 306 Query: 231 LKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLD 290 + T+LRQELGF G+I +D L+M+ + A ++ AG D++L + + S Sbjct: 307 VTTLLRQELGFKGLIITDALNMKALYNGSNVATLSVRAVQAGNDLLLFSPDPEATHSA-- 364 Query: 291 NLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQE 336 V + G +++ S R + + + E+ +E Sbjct: 365 ---------VVQAVEAGQIPLEQINASVR-RILQAKQWLKLEKHRE 400 >UniRef50_D2AY03 Beta-N-acetylhexosaminidase n=2 Tax=Streptosporangineae RepID=D2AY03_STRRD Length = 484 Score = 320 bits (822), Expect = 4e-86, Method: Composition-based stats. Identities = 89/344 (25%), Positives = 138/344 (40%), Gaps = 24/344 (6%) Query: 2 GPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAV 61 G +++ G L + +GG+ LF N DPAQL L ++R A V+++ Sbjct: 19 GTLLVAFRGTTAPEWVLRDLENG-LGGVTLFGFNVADPAQLSGLTARLREAGEP--VISL 75 Query: 62 DQEGGRVQRFREGF-TRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 D+EGG V R + P + A+ +E L + + ++ +++ AP Sbjct: 76 DEEGGDVTRLDYHVGSPYPGNAALGAVDDVE----LTRRVYRSIGDDLARCGVNLDMAPS 131 Query: 121 LDVGH--ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE 178 DV + IG RS+ D + G+ AG+ KHFPGHGA DSH E Sbjct: 132 ADVNTADDNPVIGTRSFGPDTALVARHTVAAVHGLQAAGVAACVKHFPGHGATRQDSHLE 191 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPASGSPYWLKTVLRQ 237 P +R +++ F + I + +IM AHV + PA+ S L +LR Sbjct: 192 IPLVDASLELLRERELVPFRAAI-DAGTRSIMTAHVRVPALTGTTPATLSAAALTVLLRG 250 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKA 297 ELG+DGVI SD L M+ A SL AG D++ + Sbjct: 251 ELGYDGVIVSDALDMKAVTDTYGLAGGSVLSLAAGTDLLCLGPLPTEDD------VRRII 304 Query: 298 ERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQEEKAGH 341 + G L +A R+ +L + + G Sbjct: 305 TEIVAAVGDGRLPLARLEAAA------ERVARLRAWFGTSQTGQ 342 >UniRef50_Q3A4Z0 Putative glycosyl hydrolase n=1 Tax=Pelobacter carbinolicus DSM 2380 RepID=Q3A4Z0_PELCD Length = 382 Score = 320 bits (821), Expect = 5e-86, Method: Composition-based stats. Identities = 101/338 (29%), Positives = 169/338 (50%), Gaps = 30/338 (8%) Query: 1 MGPV-MLDVEGYELDAE--EREILAHPLVGGLILF---------TRNYHDPAQLRELVRQ 48 +G + M+ G +LD E + +GG++LF N P QLR L Q Sbjct: 41 LGQLLMVGFAGADLDKHHPILEDIKQRHLGGVVLFNYGPDRNHPAGNIRSPQQLRRLTAQ 100 Query: 49 IRAASR-NRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASE 107 ++ A+ L++A+DQEGGR+ R ++ P S+A+L L + A Sbjct: 101 LQQAAGNTPLLIAIDQEGGRINRLKQ-AYGFPPTVSYASLG-RRNDLALTRRQAQNTART 158 Query: 108 MIAMDIDISFAPVLD--VGHISAAIG--ERSYHADPQKALAIASRFIDGMHEAGMKTTGK 163 + I+++ +PV+D V + IG +RS+ A+P ++ A I H+ G+ K Sbjct: 159 LQQAGINLNLSPVVDLNVNPANPIIGRLQRSFSAEPGTVISHAREVIRAHHQQGVLCCLK 218 Query: 164 HFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR- 222 HFPGHG+ TADSHK +++ F++LI E DA++ AHV S++DPR Sbjct: 219 HFPGHGSSTADSHKGFVDVT---NTWSPRELLPFATLIDEGLADAVLTAHVFNSELDPRY 275 Query: 223 PASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNR 282 PA+ S + VLR++LGF GV+ SDDL+M A + + +L+AG D++L+ +N Sbjct: 276 PATLSERTVHGVLRKQLGFQGVVISDDLTMGAIADQYRLEDAVEKALNAGVDILLLADNS 335 Query: 283 KGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDSARW 320 S + + + +L G +R+ ++ + + Sbjct: 336 PDTTSRMIAI-------MQKLIDSGRVTRKRIVQALKR 366 >UniRef50_B8HWS2 Beta-N-acetylhexosaminidase n=5 Tax=Cyanobacteria RepID=B8HWS2_CYAP4 Length = 552 Score = 320 bits (821), Expect = 5e-86, Method: Composition-based stats. Identities = 81/296 (27%), Positives = 134/296 (45%), Gaps = 19/296 (6%) Query: 38 DPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFREGFTRLPAAQSFAAL--SGMEEGGK 95 A++ +Q+++ + L++A D E G QRF G T P + AA+ Sbjct: 64 TAAEIGLRTQQLQSWATIPLLLAADIEEGVGQRF-SGATWFPPPMALAAIARQNPALAAD 122 Query: 96 LAQEAGWLMASEMIAMDIDISFAPVLDVG--HISAAIGERSYHADPQKALAIASRFIDGM 153 A + G A+E +A+ ++ APV+DV + I R++ + + + + FI+G Sbjct: 123 YAHQMGAHTAAEALAIGLNWVLAPVVDVNNNPDNPVINVRAFGENTGEVSQLTTAFIEGA 182 Query: 154 HEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAH 213 + T KHFPGHG DSH E P P P ++A ++ F + I +D++M AH Sbjct: 183 QTHPVLTAAKHFPGHGDTATDSHLELPLIPHPLERLQAVELPPFQAAI-AAGVDSVMSAH 241 Query: 214 VIYSDVDPR-PASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAG 272 ++ +DP PA+ S L LRQ LGF G+I +D L M A E +L+AG Sbjct: 242 LLIPALDPDYPATLSQRILTGELRQNLGFAGLIVTDALVMGAIANRYGLNEAPLMALEAG 301 Query: 273 CDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDSA-RWKAISTRL 327 D++L+ + GA+ + G + + S R ++ Sbjct: 302 ADILLMPLDPAGAIQA-----------ICAAVESGRITVDRIKTSVERIWRAKQKV 346 >UniRef50_B3T2B0 Putative glycosyl hydrolase family 3 N terminal domain protein n=2 Tax=prokaryotic environmental samples RepID=B3T2B0_9ZZZZ Length = 340 Score = 320 bits (821), Expect = 5e-86, Method: Composition-based stats. Identities = 132/335 (39%), Positives = 195/335 (58%), Gaps = 7/335 (2%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 +GP+ + VEG L +++IL+H LVGG++LF+RNY + QL+ L++ I++ L+VA Sbjct: 3 LGPIFVSVEGKTLTPGDQKILSHNLVGGVVLFSRNYENKDQLKLLIKSIKSLKSPDLLVA 62 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGM--EEGGKLAQEAGWLMASEMIAMDIDISFA 118 VDQEGGRVQRF GF +P+ +S L E G + A ++ E++ + +D SFA Sbjct: 63 VDQEGGRVQRFISGFYPIPSMRSLGNLYDQSCENGMQATCIAAEILGLELLEVGVDFSFA 122 Query: 119 PVLDVGHI-SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 PVLD+ H S IG RS+H+DP+ + +F +G+ +AGM KHFPGHG V DSH+ Sbjct: 123 PVLDIDHGVSEVIGSRSFHSDPKVVDCLGEQFYEGLQKAGMNCVAKHFPGHGGVREDSHQ 182 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQ 237 +TP D R I D+ + +LI+ L IM AHV Y VD + A+ S YWL+ L++ Sbjct: 183 KTPVDNREMKAIIE-DLQPYKTLIKRG-LSGIMTAHVRYPLVDDQIATFSHYWLRKCLQK 240 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKA 297 EL F+G+IFSDDL M+G + S ++ S AGCD +L+CN+ + L L K Sbjct: 241 ELHFNGIIFSDDLMMKGTDFITSIPKKALLSFAAGCDFVLICNSPETVNQCLSELHYAKD 300 Query: 298 E--RVTRLYHKGSFSRQELMDSARWKAISTRLNQL 330 + + R Q + + I T+LN+L Sbjct: 301 QFANLERKIQYLIPFHQGQRQAKYLQEIKTKLNRL 335 >UniRef50_Q2SIQ7 Beta-glucosidase-related Glycosidase n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SIQ7_HAHCH Length = 346 Score = 320 bits (820), Expect = 6e-86, Method: Composition-based stats. Identities = 145/345 (42%), Positives = 197/345 (57%), Gaps = 21/345 (6%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 +GP+++D++G ++ AEE+E+L+ P+VGG+I F RNY DP QL LVRQIRA R L++ Sbjct: 7 LGPLIIDLKGLDVSAEEKELLSRPIVGGVIFFARNYTDPQQLDALVRQIRAI-RPELLLC 65 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALS--GMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 VDQEGGRVQRFREG T LPA A +A E GWL A E+ +D+D SFA Sbjct: 66 VDQEGGRVQRFREGRTLLPAPGKLRASYQENPAAAKAMATELGWLTAVELRELDLDFSFA 125 Query: 119 PVLDVGH-ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 PVLDV + + I +R++ P A A+A F GM EAG GKHFPGHG VT D+H Sbjct: 126 PVLDVDYGKNKVIADRAFGDTPTMAAALAQAFWRGMGEAGCAGVGKHFPGHGFVTEDTHI 185 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQ 237 E+ DPR E+ DM F +I E ++A+MPAHVIY DP PAS S +WL VLR+ Sbjct: 186 ESARDPRAYEELLGADMRPFIEMI-EAGIEAMMPAHVIYPACDPSPASQSSFWLTQVLRE 244 Query: 238 ELGFDGVIFSDDLSMEG--AAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPI 295 L + G I SDD+ M+G + +L AGCD+++ CNN + + V D L + Sbjct: 245 RLQYQGAIISDDMGMQGADLEAEEDESASVVRALRAGCDLLMACNNPERLLRVADQLESL 304 Query: 296 KAERV-----------TRLYHKGSFSRQELMDSARWKAISTRLNQ 329 E + R +G+ L + + ++ L Q Sbjct: 305 PGELLFPEDSLRRRHSMRRRPQGAVDPARLAQA---RELAQSLRQ 346 >UniRef50_C4Y752 Putative uncharacterized protein n=1 Tax=Clavispora lusitaniae ATCC 42720 RepID=C4Y752_CLAL4 Length = 985 Score = 320 bits (820), Expect = 6e-86, Method: Composition-based stats. Identities = 88/352 (25%), Positives = 151/352 (42%), Gaps = 18/352 (5%) Query: 1 MGPVMLD-VEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS------ 53 +G ++ G + E++ V +IL RN Q+ L+R ++ + Sbjct: 6 LGQLVCGGFLGTTATPQAYELIVKHRVSTMILSRRNAESAEQMAHLIRDLQRIALEHAHY 65 Query: 54 RNRLVVAVDQEGGRVQRFRE--GFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAM 111 + L++A+D+EGG + + T+ P A + AA E L E A E+ + Sbjct: 66 KYPLLIAIDEEGGMLNTLFDSRDLTQFPGAMALAATGDTE----LVYEVARATARELRHI 121 Query: 112 DIDISFAPVLDVGHISAA--IGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHG 169 + PVLDV +A + RS+ + + G+ + G+ T GKHFPG G Sbjct: 122 GFSVILGPVLDVVTKMSAQLVSVRSFGSSVEDVARYGRACARGLRDGGLITVGKHFPGIG 181 Query: 170 AVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPRPASGS 227 T DS E P ++R ++ F+ LI+E LD I A + D A S Sbjct: 182 NATVDSLLEVPIMAESLDQLRGSNVVPFAELIQEGLLDGISAAGCGVPAIAPDEVHACLS 241 Query: 228 PYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVS 287 P L +LR+EL FDG + S+ L M+ + ++ AGCD+++ C++ + V Sbjct: 242 PVLLTQLLREELHFDGFVISECLEMDALYHSIGLGQGAVLAVCAGCDLVMCCHDFERQVE 301 Query: 288 VLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQEEKA 339 +D+L+ A + + S Q + R A + ++E + Sbjct: 302 AIDSLAKALANGMLDE-RIIAASMQRIETVQRRVASWAEIFSATGPFREAQP 352 >UniRef50_C5CH66 Beta-N-acetylhexosaminidase n=1 Tax=Kosmotoga olearia TBF 19.5.1 RepID=C5CH66_KOSOT Length = 504 Score = 319 bits (819), Expect = 8e-86, Method: Composition-based stats. Identities = 90/328 (27%), Positives = 158/328 (48%), Gaps = 18/328 (5%) Query: 5 MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQE 64 M+ + G +DAE +IL G +ILF+RN +P+Q+++ + + + + ++A+DQE Sbjct: 12 MIGLPGKSMDAETVKILERVNPGSIILFSRNIENPSQVKDFIDECKRVLGYKPLIAIDQE 71 Query: 65 GGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVG 124 GG V R +GF P A + AA E A A ++A EM A+ ID APV+D+ Sbjct: 72 GGIVTRLTKGFAISPGAMAIAATGNSEN----AFIAASILAKEMRAVGIDWDLAPVVDIN 127 Query: 125 HISAAIG--ERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCD 182 + G RS+ +D + +F G+ E G+ KHFPG G V+ D+H + P Sbjct: 128 NNPNNPGIGVRSFSSDVNTVVEFGRKFTKGLQENGIIACLKHFPGKGRVSVDAHIDMPVL 187 Query: 183 PRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD--PRPASGSPYWLKTVLRQELG 240 E++ ++ F + + + MP+H+ + PA+ S L ++R++L Sbjct: 188 DVSYDELKKFELIPFKKI----EALSWMPSHIYIPALQHTKIPATLSEEILTELVRKKLH 243 Query: 241 FDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL------SP 294 + GV+ +DDL M G E + AG D++ VC+N ++V + L P Sbjct: 244 YKGVLIADDLLMGGITNYFPVEEATLKAFAAGMDILTVCHNDDIQIAVKNYLVKQIEKEP 303 Query: 295 IKAERVTRLYHKGSFSRQELMDSARWKA 322 +R+ K ++ + + + + Sbjct: 304 KLQKRLEESLSKIEAMKKLINKAPKIEI 331 >UniRef50_Q11XV7 B-N-acetylglucosaminidase, glycoside hydrolase family 3 protein n=1 Tax=Cytophaga hutchinsonii ATCC 33406 RepID=Q11XV7_CYTH3 Length = 395 Score = 319 bits (819), Expect = 8e-86, Method: Composition-based stats. Identities = 85/341 (24%), Positives = 158/341 (46%), Gaps = 26/341 (7%) Query: 1 MGPV-MLDVE---GYELDAEEREILAHPLVGGLILFTRNYH---DPAQLRELVRQIRAAS 53 +G + M+ + + VGG+I+F +N A L++L+ ++A++ Sbjct: 61 IGQMIMIGINDRKSIAATDSVLIEIKSGKVGGIIMFEKNIAAANSAATLKKLISDLQASA 120 Query: 54 RNRLVVAVDQEGGRVQRFRE--GFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAM 111 L +++D+EGG+V R +E GF +P+A L+ ++ +A E+ + Sbjct: 121 STPLFMSIDEEGGKVHRLKEKYGFVPMPSAGYLGNLNNLDSTTYYTTN----LAKELKEL 176 Query: 112 DIDISFAPVLDV--GHISAAI--GERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPG 167 ++++AP LD+ + I RSY ADP A+ I+ H G+KT KHFPG Sbjct: 177 GFNLNYAPCLDLATNPNNPIIAKVGRSYSADPDLVSKQAAASINAHHTQGIKTIVKHFPG 236 Query: 168 HGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD--PRPAS 225 HG+ +DSH + ++ + +LI+ +DA+M AH+I +D P + Sbjct: 237 HGSSMSDSHLGIVDVT---NQWNIIELMPYKTLIQSGTIDAVMTAHIINEHLDQSKLPGT 293 Query: 226 GSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGA 285 S + +LR LGF GV+FSDD+ M + + S+ AG D+++ NN + Sbjct: 294 LSKVVVTDLLRNMLGFTGVVFSDDMQMYAISKNYGQENAIKLSILAGVDVLVFGNNVSAS 353 Query: 286 VSVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTR 326 + + + +L G + ++ + ++ + Sbjct: 354 DRIK---ASEIHAIIKKLVLSGDIPESRINEA-YERILALK 390 >UniRef50_Q5ZW95 Glycosyl hydrolase family 3 n=6 Tax=Legionella RepID=Q5ZW95_LEGPH Length = 395 Score = 319 bits (818), Expect = 9e-86, Method: Composition-based stats. Identities = 83/357 (23%), Positives = 157/357 (43%), Gaps = 44/357 (12%) Query: 1 MGPV-MLDVEGYELDAE--EREILAHPLVGGLILFT---------RNYHDPAQLRELVRQ 48 +G + +L +G +D + + + +GG+ILF +N P Q+ L Q Sbjct: 47 IGQMLILGFQGKVVDEKSPIAQAIEKNNIGGVILFDFNQQSQTFDKNIESPEQVNRLNNQ 106 Query: 49 IRAASR----------NRLVVAVDQEGGRVQRFRE--GFTRLPAAQSFAALSGMEEGGKL 96 ++ ++ L+++VD EGG V R GF +P+A+ ++ + Sbjct: 107 LQTLTQKANRKYHRDNLPLLISVDYEGGNVNRLHPRYGFPEMPSAKDIGSM-----SLER 161 Query: 97 AQEAGWLMASEMIAMDIDISFAPVLDVG--HISAAIG--ERSYHADPQKALAIASRFIDG 152 A + LMA+ + + +++F P LDV + IG +RS+ + P+ A + + Sbjct: 162 ADASAELMANTLKSTGFNLNFFPELDVNINPDNPIIGKKDRSFSSIPEIVTQYAQLYTEQ 221 Query: 153 MHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRE-NKLDAIMP 211 + + + KHFPGHG+ +DSH +++ F L+ + N D +M Sbjct: 222 FIKHQIHCSYKHFPGHGSSLSDSHLGFVDIS---DTWSEQELIPFLQLLSQPNHCDMVMI 278 Query: 212 AHVIYSDVD--PRPASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASL 269 AH++ +D PAS S + +LR++L FDGV+ +DD+ M+ S+ Sbjct: 279 AHIVNRKLDMTGLPASLSYQIINGLLRRDLKFDGVVITDDMQMKAITNYYGLETAVTLSI 338 Query: 270 DAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTR 326 +AG DM++ N S + + + G S Q + ++ + + + Sbjct: 339 NAGADMLIFGNQLVEKFQ----DSTEIIDMIEQKVRSGEISEQRINEA-YQRIVKMK 390 >UniRef50_C6N603 Beta-hexosaminidase n=2 Tax=Legionella RepID=C6N603_9GAMM Length = 358 Score = 319 bits (818), Expect = 1e-85, Method: Composition-based stats. Identities = 125/288 (43%), Positives = 173/288 (60%), Gaps = 8/288 (2%) Query: 3 PVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVD 62 M+D+EG EL A E IL P VG +ILFTRN+ +P QL +L+ I + L +AVD Sbjct: 4 QFMVDIEGTELSALEHTILKQPNVGAVILFTRNFTNPEQLAKLICDIDEINPE-LFIAVD 62 Query: 63 QEGGRVQRF-REGFTRLPAAQSFAALSGMEEGGKLAQ--EAGWLMASEMIAMDIDISFAP 119 EGG VQRF R+GF+ LPAA+++ +A E G +MA E++A +D+S AP Sbjct: 63 HEGGNVQRFQRQGFSALPAARAYGRAYDKNHEIGIAYTQEYGEIMAKELLACGVDLSIAP 122 Query: 120 VLDVGHISAAIGE--RSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 VLDV S I R++H D + +A FI GM+ AGM GKHFPGHG+V +DSH Sbjct: 123 VLDVHDKSQVIAGLNRAFHKDADAIIDLAGAFIKGMNTAGMPAVGKHFPGHGSVVSDSHI 182 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLR 236 P E+ A D+ F LI + L A+MPAHV Y+ +D PA S WL+ +LR Sbjct: 183 AMPVSQASMTELAANDLKPFVELINKGLLTALMPAHVTYAAIDKENPAGFSKIWLQDILR 242 Query: 237 QELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKG 284 +LGF G++ SD LSM+G A +G+ R + +L AGCDM+++C+ + Sbjct: 243 TQLGFKGLVLSDCLSMKG-ADIGNLTTRAEKALAAGCDMLIICHQPRE 289 >UniRef50_Q5LHV9 Possible beta-N-acetylglucosaminidase/beta-lactamase fusion protein n=36 Tax=Bacteroides RepID=Q5LHV9_BACFN Length = 996 Score = 319 bits (818), Expect = 1e-85, Method: Composition-based stats. Identities = 79/341 (23%), Positives = 139/341 (40%), Gaps = 33/341 (9%) Query: 2 GPVMLDVEGYELDAEEREIL----AHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRL 57 G + + D + +++ VG + D L + S+ L Sbjct: 52 GQLFVYTLAPRADKDTEKLVGKLTRKFKVGAFLYSEGTVED---QANLTNYAQRQSKIPL 108 Query: 58 VVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISF 117 ++ D E G R E P + +S L + G +A E+ + ++F Sbjct: 109 MITFDGEWGLAMRL-ENTPVFPRNAALGCISD----NTLIEAYGQEVARELREIGAHVNF 163 Query: 118 APVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADS 175 AP DV + I RS+ +P+ + G+ G+ + KHFPGHG DS Sbjct: 164 APDADVNTNPENPVIHVRSFGENPKTVAEKVIAYGRGLETGGILSVSKHFPGHGDTDVDS 223 Query: 176 HKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP---RPASGSPYWLK 232 H+ P +A + + ++ F I + L +M H+ ++P P+S S + Sbjct: 224 HQALPAVYYNRARLDSVELYPFKEAI-QAGLGGVMVGHLQVPALEPDRITPSSLSHSIVT 282 Query: 233 TVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL 292 +LR ELGF+G++F+D L+M+G A ++ +L AG DM+LV N + A Sbjct: 283 DLLRGELGFNGLVFTDALAMKGVAAE---SDVTVKALKAGNDMVLVQQNVEKAQ------ 333 Query: 293 SPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHER 333 E V + G + +E+ R + ++ + R Sbjct: 334 -----ESVVQAIKDGRLTMEEIDAKCR-RILAYKYRLGLSR 368 >UniRef50_A4SCK6 Glycoside hydrolase, family 3 domain protein n=3 Tax=cellular organisms RepID=A4SCK6_PROVI Length = 595 Score = 318 bits (817), Expect = 1e-85, Method: Composition-based stats. Identities = 77/337 (22%), Positives = 147/337 (43%), Gaps = 32/337 (9%) Query: 1 MGPVML-----DVEGYELDAEER--EILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS 53 +G +++ E +E+ ++ +GG++ + L +++ + Sbjct: 62 IGQMLIADSPGGFETDNTPQQEKLESLVRSGTIGGVMFLKGGMF---KTAMLANHLQSIA 118 Query: 54 RNRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDI 113 L+++ D E G R EG T P A + AA + +LA G +A+E A+ I Sbjct: 119 PKPLLLSADMENGAAMRL-EGATEFPPAMAIAAT----QRPRLAAAMGAAIAAEAKAVGI 173 Query: 114 DISFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAV 171 ++APV D+ + I RS+ P+ A+ + +I+G+ G+ T KHFPGHG V Sbjct: 174 RQNYAPVGDLNSNQMNPVINTRSFGDRPETAVRMTGAYIEGLQRGGLIATIKHFPGHGDV 233 Query: 172 TADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP--RPASGSPY 229 DSH P P+ ++ ++ F + I + + ++M H+ + PA+ SP Sbjct: 234 HVDSHLALPVLSAPKEHLQQYELEPFRAAIGQG-VMSVMVGHLAVPAITGTMTPATLSPE 292 Query: 230 WLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVL 289 + +LRQELGF G+I +D L+M+ + ++ AG D++L + Sbjct: 293 IITGLLRQELGFKGLIITDALNMKALYNGSNIPGISVRAVLAGNDLLLFSPDP------- 345 Query: 290 DNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTR 326 + + G + + + S + + + Sbjct: 346 ----ALTHRSLLEAVKSGLITEERINQSV-LRIMQAK 377 >UniRef50_C0BMZ8 Glycoside hydrolase family 3 domain protein n=1 Tax=Flavobacteria bacterium MS024-3C RepID=C0BMZ8_9BACT Length = 558 Score = 318 bits (815), Expect = 2e-85, Method: Composition-based stats. Identities = 77/347 (22%), Positives = 154/347 (44%), Gaps = 30/347 (8%) Query: 1 MGPVMLDVEGYELDAEEREI----LAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNR 56 +G + + + + +E ++ L GG+I + P + +L ++ S Sbjct: 51 IGQLFVVMVQSKAPEKELKVFEEELKRHQPGGVIF---SLGTPYKQAQLTNHYQSFSTVP 107 Query: 57 LVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 L++++D E G R + P + A+ + L G M + + + ++ Sbjct: 108 LMISMDAEWGVAMRLDS-VPKYPWNMTLGAIQDL----SLLSTIGRHMGMQAKRLGVHMN 162 Query: 117 FAPVLDVG--HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 FAPV+DV + IG RS+ +D ++ A A GM+ AG+ T KHFPGHG + D Sbjct: 163 FAPVVDVNINPNNPIIGNRSFGSDVRRVSARALALSQGMNAAGVFTCAKHFPGHGDTSVD 222 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR---PASGSPYWL 231 SHK P P + + + ++ + + + + + +M AH+ + P S S Sbjct: 223 SHKALPVLPFTRERLDSIELYPYKTPLLQG-VQGVMVAHLDVPVLTGEPGLPVSLSHKVT 281 Query: 232 KTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDN 291 +L++EL F G++F+D L+M+GA+ + +L AG D++++ N+ Sbjct: 282 TGLLQKELNFKGLVFTDALNMKGASDFSPDGDVSLKALLAGNDVLVIPNDL--------- 332 Query: 292 LSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQEEK 338 + ++ KG + L S + K ++ + + ++ + Sbjct: 333 --GLSIAHLSEALKKGVLTEARLAYSVK-KILAAKYDLGLNQYSPVQ 376 >UniRef50_B5LAN7 HexI n=3 Tax=cellular organisms RepID=B5LAN7_9BURK Length = 521 Score = 318 bits (815), Expect = 2e-85, Method: Composition-based stats. Identities = 95/326 (29%), Positives = 159/326 (48%), Gaps = 18/326 (5%) Query: 2 GPV-MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 G + M+ + G LD + + L + + LF +N +P QL +L +RA ++ Sbjct: 22 GQLIMIRMPGTVLDEDTAQFLRDNHIRAVCLFRQNMVNPEQLAKLTADLRAVMGPNALIG 81 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 +DQEGG V R PAA + A+ + LA+ G +A + ++ + +FAPV Sbjct: 82 IDQEGGAVVR-ATWLPAPPAAMALGAVDDV----ALARSVGAAVARGIRSLGFNWNFAPV 136 Query: 121 LDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE 178 LD+ + + IGERS+ +DPQ+A +A +++G E G+ KHFPGHG DSH++ Sbjct: 137 LDLNNNPANPVIGERSFGSDPQRATELALAWMEGSLEQGVACCVKHFPGHGDTHVDSHRD 196 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQ 237 P +P++ + A +++ F + + A+M AH++Y +DP PA+ S L +LR Sbjct: 197 LPTVAKPRSALDALELAPFKAA---RRAPAMMSAHIVYPALDPEYPATLSRKILTDLLRS 253 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKA 297 E + GVI +D + M A +L AG DM++ NR L +L+ Sbjct: 254 EWDYRGVIITDGMDMHAIAGRYGVGNAAVRALAAGADMVMALGNRSTQEETLVSLTM--- 310 Query: 298 ERVTRLYHKGSFSRQELMDSARWKAI 323 + G + +AR + Sbjct: 311 -ALATAEEHGDLGSATV--AARLARL 333 >UniRef50_C7PJZ6 Beta-lactamase n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PJZ6_CHIPD Length = 1012 Score = 318 bits (815), Expect = 3e-85, Method: Composition-based stats. Identities = 81/342 (23%), Positives = 146/342 (42%), Gaps = 30/342 (8%) Query: 1 MGPVMLDVEGYELDAE----EREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNR 56 + +++ L A+ + + + VGGLI F P + L ++ S+ Sbjct: 75 IAQLIMIRAHSNLGADHIKKVTKDIQNNKVGGLIFFQG---GPVRQANLTNYYQSISKTP 131 Query: 57 LVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 L++A+D E G R T LP A+ LA E G +A + + I + Sbjct: 132 LMIAIDGEWGLGMRLDS-VTSLPRNLMLGAMQD----STLAYEYGKQLAMQCRRLGIHVD 186 Query: 117 FAPVLDVGHISAAIG--ERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 +AP +DV + +RS+ D + + + I GM + + KHFPGHG D Sbjct: 187 YAPDMDVNNNPNNPVINDRSFGQDKFQVARMGVQVIKGMQDQNVMAVAKHFPGHGDTDVD 246 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD---PRPASGSPYWL 231 SH + P + +A++ + ++ F I E + +IM AH+ +D P S S + Sbjct: 247 SHLDMPVIRKSRAQLDSLELYPFRKAI-EAGVQSIMMAHLYIPALDSTPNTPTSISHKAI 305 Query: 232 KTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDN 291 L+ ELG+ G++ +D L M+G A + E SL AG DM+++ + Sbjct: 306 TDYLKGELGYKGIVITDALEMKGIAKFYTGGEEAARSLLAGNDMMMLPST---------- 355 Query: 292 LSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHER 333 + + + R +G + +E+ + K + + + Sbjct: 356 -AAGSVDAIKRAIRRGDITWEEVN-ARVKKVLMAKYKLGLNK 395 >UniRef50_D2PXL5 Glycoside hydrolase family 3 domain protein n=1 Tax=Kribbella flavida DSM 17836 RepID=D2PXL5_9ACTO Length = 487 Score = 317 bits (814), Expect = 3e-85, Method: Composition-based stats. Identities = 94/330 (28%), Positives = 153/330 (46%), Gaps = 19/330 (5%) Query: 2 GPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAV 61 G +++ G + R +A +G +ILFTRN D Q+ L +RA R L++A+ Sbjct: 13 GALIVGFTGTTAPDDLRRAVAAG-LGAVILFTRNVTDAEQVAALTTALRAE-RPDLLIAI 70 Query: 62 DQEGGRVQRFRE-GFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 D EGG L + +S L + L + + A+ + A+ ID+ AP Sbjct: 71 DHEGGEFSHLAPANPWPLSSPRSLGELDDTD----LTRASARDAAATLAALGIDVMLAPS 126 Query: 121 LDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE 178 +DV + I RS+ F+ G+H+AG+ KHFPGHG V DSH Sbjct: 127 VDVNSNPANPIISTRSFGTTAGVVSRHGVAFVQGVHDAGIAACPKHFPGHGDVAVDSHLA 186 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQE 238 P R ++ A +++ F + I E D +M AH+I+S D +PA+ S L+ +LR E Sbjct: 187 LPRVDRSVEQVTAVELAPFKATI-EAGADLVMSAHIIFSAYDDQPATLSHRILQGLLRDE 245 Query: 239 LGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAE 298 LGF GVI +D L M+ S + AS+ AG D+ ++ ++ + Sbjct: 246 LGFTGVITTDALEMQAITATRSIEDAAVASIRAGADLAMI--------AIGEADPRALTA 297 Query: 299 RVTRLYHKGSFSRQELMDSA-RWKAISTRL 327 + G+ L ++A R + ++ +L Sbjct: 298 HLVDAVSNGTLDAGRLAEAAGRVRELAGKL 327 >UniRef50_B0P7U2 Putative uncharacterized protein n=1 Tax=Anaerotruncus colihominis DSM 17241 RepID=B0P7U2_9FIRM Length = 555 Score = 317 bits (814), Expect = 3e-85, Method: Composition-based stats. Identities = 95/377 (25%), Positives = 155/377 (41%), Gaps = 43/377 (11%) Query: 2 GPVML-DVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAA----SRNR 56 G + L + G LD + L +G +ILF+ N P Q EL +++ + Sbjct: 8 GQLFLTSIPGDTLDEQTIAFLNETNIGNIILFSNNTPSPIQTAELTAALQSCITQNTGVP 67 Query: 57 LVVAVDQEGGRVQRFREGFTRL-----------------------PAAQSFAALSG---- 89 ++AVDQEGG + R P A + AA G Sbjct: 68 AIIAVDQEGGEISRIIGSAYEYAPDENGIKYGEIPLGAGIETFLYPCAMAIAASGGAPTA 127 Query: 90 --MEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI--SAAIGERSYHADPQKALAI 145 + AG + E+ A+ I+ + APV D+ + IG RSY D ++ A Sbjct: 128 QEPVPEFNAVRTAGLYIGQELRAVGINCNLAPVADINSNPANPVIGTRSYGDDAERVAAD 187 Query: 146 ASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENK 205 A+ F G+ E+G+ KH+PGHG + DSH P + E+R+ ++ F + ++ Sbjct: 188 AAAFATGLQESGVLAVAKHYPGHGDTSIDSHYGLPSVNKTLEELRSLELIPF-NAVQNAD 246 Query: 206 LDAIMPAHVIYSDVD--PRPASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAE 263 + A+M AH+++ +D PA+ S L VLR+ GFDG+I +D + M + E Sbjct: 247 IGAVMAAHIVFPQIDSTGLPATLSQPVLTGVLRKG-GFDGLILTDSMRMGAIMQNFGFGE 305 Query: 264 RGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDSA-RWKA 322 +++AG D+I + L + V GS S + L D R Sbjct: 306 ACVTAINAGADLITTGTGGDNIQGL--ALQRAAYDAVLAAVKSGSISPETLDDRVGRILL 363 Query: 323 ISTRLNQLHERWQEEKA 339 R + + + Sbjct: 364 YKERFGIIDGQAAPQSG 380 >UniRef50_B6BUL6 Beta-hexosaminidase n=1 Tax=beta proteobacterium KB13 RepID=B6BUL6_9PROT Length = 339 Score = 317 bits (814), Expect = 3e-85, Method: Composition-based stats. Identities = 130/294 (44%), Positives = 188/294 (63%), Gaps = 7/294 (2%) Query: 2 GPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAV 61 G +++D+EG E+ + + L+HP+V G+ILFTRN+ + Q+++L+ I R+ L++ V Sbjct: 4 GSIIIDIEGKEITQRDIQRLSHPVVCGVILFTRNFENKNQIQDLISSI-KLVRDDLLITV 62 Query: 62 DQEGGRVQRFREGFTRLPAAQSFAALS--GMEEGGKLAQEAGWLMASEMIAMDIDISFAP 119 D EGGRVQRFRE F +P+ L E+G +LA+ AGWL+ASE+ + ID++++P Sbjct: 63 DHEGGRVQRFRENFYAMPSMSELGDLYAHDSEQGLRLARYAGWLIASELGELHIDLNYSP 122 Query: 120 VLDVGH-ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE 178 VLD+ + S+ IG RS+ +P+ + +A FIDG H AGM + GKHFPGHG + DSH+E Sbjct: 123 VLDINYGKSSVIGNRSFSDNPKAIIELAKAFIDGQHHAGMTSVGKHFPGHGFIHEDSHEE 182 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQE 238 D R AE+ D+ F SL+ +L A+MP HVIY D +PAS S +WLK LR++ Sbjct: 183 ISIDNRSLAEMEN-DIDCFKSLVNH-RLGAVMPGHVIYKKSDNKPASLSSFWLKDFLRKK 240 Query: 239 LGFDGVIFSDDLSMEGAAIMG-SYAERGQASLDAGCDMILVCNNRKGAVSVLDN 291 L F G+I SDDLSM+G A ER +L AGCD++L+CN + LD Sbjct: 241 LNFKGIIISDDLSMKGVADYIPDIYERVSQALTAGCDVVLICNKPEDVDEFLDR 294 >UniRef50_C1A778 Beta-N-acetylhexosaminidase n=1 Tax=Gemmatimonas aurantiaca T-27 RepID=C1A778_GEMAT Length = 658 Score = 317 bits (814), Expect = 3e-85, Method: Composition-based stats. Identities = 83/336 (24%), Positives = 139/336 (41%), Gaps = 35/336 (10%) Query: 16 EEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFREGF 75 + +GG+I+ + PA + V ++ +++ L+V D E G R R G+ Sbjct: 118 SIERQVRELELGGIII---SVGGPADIALKVNALQRSAKLPLLVGADLETGAAFRARGGW 174 Query: 76 -----------TRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVG 124 T P A LA E G + A E AM I ++FAPVLDV Sbjct: 175 FLPNAIELGGATSFPYQMGIGATRDT----MLAYEMGRVTAVEGRAMGIHMAFAPVLDVN 230 Query: 125 HI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCD 182 + + I RS+ DP + + + G+ E GM TGKHFPGHG +SH E Sbjct: 231 NNPKNPVISGRSFGEDPALVSRMGAALVRGIQENGMLATGKHFPGHGDTDQNSHLELSRV 290 Query: 183 PRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRP--ASGSPYWLKTVLRQELG 240 +A + + ++ F + I + + IM H +D A+ S + +L++++ Sbjct: 291 NVSRARLDSVELRPFQAAI-DAGVRGIMTFHGDLPALDTTHTAATLSRAVMTDLLKRQMR 349 Query: 241 FDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERV 300 F+G++ +D L M G + E +L+AG D++L+ + + ++ + Sbjct: 350 FNGLLITDALDMNGVLGNLTMQEATLRALEAGNDVLLMPTDARASIQAM----------- 398 Query: 301 TRLYHKGSFSRQELMDSAR-WKAISTRLNQLHERWQ 335 G S L +S R ER Sbjct: 399 VDAVASGRVSEARLDESVRKLLMAKHEFGLHRERLV 434 >UniRef50_C0CJ14 Putative uncharacterized protein n=1 Tax=Blautia hydrogenotrophica DSM 10507 RepID=C0CJ14_9FIRM Length = 441 Score = 317 bits (813), Expect = 4e-85, Method: Composition-based stats. Identities = 77/332 (23%), Positives = 140/332 (42%), Gaps = 31/332 (9%) Query: 11 YELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQI----RAASRNRLVVAVDQEGG 66 E+ E VGG++ + QL + R + +AVD+EGG Sbjct: 125 LEVGELTCEAFRQYPVGGIVYMENHILSEEQLTAMNRSFRELGQETLGVSPFLAVDEEGG 184 Query: 67 RVQRFREGFTRLP----AAQS-FAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVL 121 V R P A ++ A + G + + + + ++ FAPV Sbjct: 185 TVTRIAGNQ-AFPVENVGNMENVGAGRDSDQ----AYQVGQTLGAYLKSYGFNMDFAPVA 239 Query: 122 DV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKET 179 DV + + ER++ +DP ++ + G+ ++ KHFPGHGA DSH Sbjct: 240 DVLVNSENTVVKERAFGSDPNLVSSMVEAEVRGLKGEKIEAVLKHFPGHGATAEDSHNGY 299 Query: 180 PCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP--RPASGSPYWLKTVLRQ 237 R E+R ++ F + I + + +M H+ + ++ PAS S + + +L+ Sbjct: 300 AYAQRSLEELRETELVPFQAGI-DAGAEFVMVGHICFPLIEDGQMPASLSSWAVTELLKG 358 Query: 238 ELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKA 297 E+GF+GV +D ++M A S AE ++ AG DM+L+ + + A + Sbjct: 359 EMGFEGVAVTDAMNMGAIAENYSSAEAAVQAIQAGIDMVLMPADFEAAYN---------- 408 Query: 298 ERVTRLYHKGSFSRQELMDSARWKAISTRLNQ 329 V + S++ L D+ R + ++ +L Sbjct: 409 -GVLQAVSSQEISQERLHDALR-RILTVKLEM 438 >UniRef50_Q1IZF6 Beta-N-acetylhexosaminidase n=2 Tax=Deinococcus RepID=Q1IZF6_DEIGD Length = 478 Score = 317 bits (813), Expect = 4e-85, Method: Composition-based stats. Identities = 87/314 (27%), Positives = 146/314 (46%), Gaps = 14/314 (4%) Query: 5 MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQE 64 M+D+ G LDA+ L + + LF +N + AQLR+L +R ++A+D E Sbjct: 1 MVDIPGKALDADTASHLRRHGIRAVCLFGKNVQNAAQLRQLCADLREVMGEAALIALDHE 60 Query: 65 GGRVQRFREGFTRL-PAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV 123 GG + R F P+A S A + L ++ +A ++ ++ ++ +FAPVLDV Sbjct: 61 GGAI--LRPTFWPFAPSAMSLGAADNV----ALTEDVNAALARQLRSVGVNWNFAPVLDV 114 Query: 124 GHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPC 181 + IGER+Y AD + + + + G + KHFPGHG DSH P Sbjct: 115 NVNPANPVIGERAYGADVDRVIRHGTAALLGHTRERVAGCVKHFPGHGDTHLDSHYALPR 174 Query: 182 DPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPASGSPYWLKTVLRQELG 240 + +AE+ A + + F + + A+M AH++Y +DP PA+ S L +LR E G Sbjct: 175 VSKSRAELDAVEFAPFRACLP--LAPAVMTAHIVYEALDPCSPATLSRTVLTELLRGEWG 232 Query: 241 FDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERV 300 + GV +D + M+ E +L AG D+++ R+ L ++ + Sbjct: 233 YQGVTVTDSMGMQAIDAHYGRGEAAVLALRAGADLVMALGRREAQDVTLVAIARALNSEL 292 Query: 301 TRLYHKGSFSRQEL 314 + + S L Sbjct: 293 DA--REVAASLARL 304 >UniRef50_D0LSI1 Glycoside hydrolase family 3 domain protein n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LSI1_HALO1 Length = 387 Score = 316 bits (812), Expect = 5e-85, Method: Composition-based stats. Identities = 99/299 (33%), Positives = 153/299 (51%), Gaps = 16/299 (5%) Query: 2 GPVM-LDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQ-----LRELVRQI-RAASR 54 G ++ EG A+ + +A LVGG++LF RN + LR + R++ ++AS Sbjct: 12 GQLLWFGYEGDYPPADMQREIAAGLVGGVLLFKRNLPPEDEHFAPALRAINRRLHQSASG 71 Query: 55 --NRLVVAVDQEGGRVQRFREGFTRLPAAQ---SFAALSGMEEGGKLAQEAGWLMASEMI 109 L +A+DQEGG VQR R+ T P + A G + LA+ G + E+ Sbjct: 72 DATPLWIAIDQEGGTVQRVRQPGTVWPPMLSFDALAEARGDQYAIALAERVGLALGHELA 131 Query: 110 AMDIDISFAPVLDV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPG 167 A++ID+ FAPVLDV + IG+R+ +DP++ G+ AG+ + GKHFPG Sbjct: 132 ALEIDVDFAPVLDVHTNPANPVIGKRALSSDPEQVAQRGLALARGLERAGVVSCGKHFPG 191 Query: 168 HGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPASG 226 HG DSH P A + A ++ F+ + IM AHV+++ +D PA+ Sbjct: 192 HGDTDTDSHLALPRLTHGSARLDAIELLPFARA-AAADIPMIMTAHVVFAALDDTVPATL 250 Query: 227 SPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGA 285 S + +LR+ LG+ G+I SDDL M A E ++ AGCD++L+C +R Sbjct: 251 SHAVITGLLRRRLGYQGLIVSDDLDMRAIAAHFGIGEAAVMAVRAGCDVLLLCRDRAHQ 309 >UniRef50_C6XX68 Glycoside hydrolase family 3 domain protein n=4 Tax=Sphingobacteriaceae RepID=C6XX68_PEDHD Length = 568 Score = 316 bits (812), Expect = 5e-85, Method: Composition-based stats. Identities = 73/325 (22%), Positives = 138/325 (42%), Gaps = 26/325 (8%) Query: 10 GYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQ 69 G + +++ VGGL+ F P + L ++ +R L++ D E G Sbjct: 64 GKAFEDSIGKVIKKERVGGLVFFQG---GPGRQAILTNTYQSLARVPLLITSDGEWGLGM 120 Query: 70 RFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI--S 127 R + P + A+ + L + G +A + + + ++ AP +DV + + Sbjct: 121 RLDSTIS-YPYQMALGAVQNKD----LLYKMGLEVARDYKRIGMHMNLAPDVDVNNNPKN 175 Query: 128 AAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQA 187 I RS+ + A+ ++ GM + G+ + KHFPGHG DSH + P +A Sbjct: 176 PVINFRSFGENKYNVATKAAAYMKGMQDGGLLVSIKHFPGHGDTDVDSHYDLPQLNFTRA 235 Query: 188 EIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR---PASGSPYWLKTVLRQELGFDGV 244 + + ++ F LIRE +M AH+ +D P++ S + +L++ELGF G+ Sbjct: 236 RLDSLEIYPFRELIREG-AAGVMIAHMNIPALDNTPNMPSTLSKPIVTGLLKEELGFKGI 294 Query: 245 IFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLY 304 + SD + M+G E + AG D++ + N S + V + Sbjct: 295 VISDAMGMKGVVKNFKDGEADVMGIIAGNDILELSEN-----------SARAIKLVRKAV 343 Query: 305 HKGSFSRQELMDSARWKAISTRLNQ 329 S +++ S + K ++ + Sbjct: 344 RADRISMEQIDASVK-KILTAKYWA 367 >UniRef50_B2AR40 Predicted CDS Pa_4_8720 n=3 Tax=Leotiomyceta RepID=B2AR40_PODAN Length = 898 Score = 316 bits (812), Expect = 5e-85, Method: Composition-based stats. Identities = 78/339 (23%), Positives = 147/339 (43%), Gaps = 26/339 (7%) Query: 1 MGPV-MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRN---- 55 +G + ++ +G E+ + R ++ +G ++L +N Q +LV++++ + Sbjct: 39 IGQMVIMGWDGTEVTPQIRHLIEEHHLGSILLTAKNLKSAHQTAKLVQELQTIAHQAGHL 98 Query: 56 -RLVVAVDQEGGRVQRFRE--GFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMD 112 L++A+DQE G V + + P+A AA + LA + A+E+ A+ Sbjct: 99 QPLLIALDQENGGVNSLYDEDYICQFPSAMGQAAAGNAD----LAYKVAKATATEVSAVG 154 Query: 113 IDISFAPVLDV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGA 170 +++ PVLDV +G R+ DPQ+ ++G +AG+ T GKHFP +G Sbjct: 155 VNLILGPVLDVLTNARYQPLGVRAVGDDPQEVSQYGIAAMNGYKDAGVATCGKHFPSYGN 214 Query: 171 VT-ADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAH--VIYSDVDPRPASGS 227 + S + P + E+ + F + I KLDA+ + ++ A S Sbjct: 215 LDFLGSSLDVPIITQTLEELSLSALVPFRNAIATGKLDAMFVGGCGITNPSMNVNHACLS 274 Query: 228 PYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVS 287 + +LR ELGF GV S+ L ME +++AGCD++L+C + Sbjct: 275 DQVVDDLLRNELGFTGVAISECLEMEALRSEIGVKTGTVMAVEAGCDLVLLCRAYDVQLE 334 Query: 288 VLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTR 326 + L +++ + S R + + + Sbjct: 335 AVAGLKLG--------VENELLTKERIYTSLR-RVLKMK 364 >UniRef50_A6RET6 Putative uncharacterized protein n=1 Tax=Ajellomyces capsulatus NAm1 RepID=A6RET6_AJECN Length = 876 Score = 316 bits (812), Expect = 5e-85, Method: Composition-based stats. Identities = 80/333 (24%), Positives = 144/333 (43%), Gaps = 26/333 (7%) Query: 6 LDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASR-----NRLVVA 60 + +G +++E R ++ +G +IL +N + +LV +++ +R L++ Sbjct: 1 MGFDGTSVNSEIRSLIEDYHLGSVILTAKNLKSAEETTKLVYELQTIARDAGHPVPLLIC 60 Query: 61 VDQEGGRVQRFRE--GFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 +DQE G V + + P+A AA LA+E A E+ A+ I+ Sbjct: 61 LDQENGGVNTLFDEIYIRQFPSAMGMAATGS----KSLAREVCKATAQELSAVGINWILG 116 Query: 119 PVLD--VGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVT-ADS 175 PVLD + +G RS DPQ+ A F+ G EAG+ T GKHFP +G + S Sbjct: 117 PVLDALTNARNQPLGVRSVGDDPQEVSAYGIEFMKGYQEAGLVTCGKHFPSYGNLDYMGS 176 Query: 176 HKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSD--VDPRPASGSPYWLKT 233 + P ++ + + + I +D++M + + A S + Sbjct: 177 QSDVPIITDTIEQLSVSALVPYRNAITHG-IDSMMVGGCSMASAGMTVMHACLSEQVVDE 235 Query: 234 VLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLS 293 +LR++L FDGV+ S+ L ME + ++ AGCD+IL+C + ++ L Sbjct: 236 LLRKDLKFDGVVVSECLEMEALSYNIGVGGGTVMAMKAGCDLILLCRSFSVQQEAINGLK 295 Query: 294 PIKAERVTRLYHKGSFSRQELMDSARWKAISTR 326 G S+ + +S R + + + Sbjct: 296 LG--------VENGIISKSRIRESLR-RVLDMK 319 >UniRef50_Q6MGM9 Beta-hexosamidase A n=1 Tax=Bdellovibrio bacteriovorus RepID=Q6MGM9_BDEBA Length = 558 Score = 316 bits (812), Expect = 5e-85, Method: Composition-based stats. Identities = 82/334 (24%), Positives = 146/334 (43%), Gaps = 23/334 (6%) Query: 2 GPV-MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRA----ASRNR 56 G + ++ ++ + +A G +LF RN Q+REL + S+ Sbjct: 53 GQLFIVGFPHKDVAPDLESFIAKYKPGSFLLFKRNMISAPQIRELNLALYKSSYKHSKLP 112 Query: 57 LVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 ++A+DQEGG V R P A + LA+E G+ + + +++ Sbjct: 113 PLIAIDQEGGSVSRL-PITPAPPNALALGQTQSP----LLAEEMGYQTGLFLREVGFNMN 167 Query: 117 FAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 APVLDV ++ IG RS+ +DP+ I + G+ +A + T KHFPG G++ D Sbjct: 168 LAPVLDVVDPYSTSFIGVRSFGSDPELVRDIGVAYSKGLLKARVIPTAKHFPGTGSLNQD 227 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP--RPASGSPYWLK 232 H + E++ +D+ F+ + A+M +H+IY +D PAS S K Sbjct: 228 PHLTIVKNSASLEEMKKRDLIPFTGYSKIGANIAVMMSHLIYPGLDEDLEPASFSTKISK 287 Query: 233 TVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL 292 T+LR EL + G++ +DDL M+G+ + +L +G D++++ + D Sbjct: 288 TLLRDELKYQGLVMTDDLQMQGSKQVLRPEAAALRALQSGSDIVMLTWSFADQGKAFD-- 345 Query: 293 SPIKAERVTRLYHKGSFSRQELMDSARWKAISTR 326 V G + + R + + + Sbjct: 346 ------YVLAALKSGELPSAHVDEKLR-RILRVK 372 >UniRef50_A9BFQ0 Glycoside hydrolase family 3 domain protein n=1 Tax=Petrotoga mobilis SJ95 RepID=A9BFQ0_PETMO Length = 528 Score = 316 bits (812), Expect = 6e-85, Method: Composition-based stats. Identities = 93/334 (27%), Positives = 163/334 (48%), Gaps = 21/334 (6%) Query: 1 MGPVM-LDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVV 59 +G + + ++G L+ E ++L L G +I F+RN + QL + + I+ + Sbjct: 8 LGKLFLIGIQGTSLNNENMKVLKSTLPGVIIFFSRNIENKYQLSKFIEDIKNFLDYEPLF 67 Query: 60 AVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAP 119 +DQEGG V R ++GFT +P+A A + E A + L+A EM+A+ ID + AP Sbjct: 68 CIDQEGGTVARLKKGFTVVPSAMGITATNEAEN----AYLSANLLAKEMLAVGIDWNLAP 123 Query: 120 VLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 V+D+ + ++ IG RS+ D L A ++ G+H+ G+ + KHFPG G V D H Sbjct: 124 VVDINNNPKNSVIGIRSFSDDKNVVLKFAREYVKGLHDGGVLSCLKHFPGIGNVNTDPHL 183 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP--RPASGSPYWLKTVL 235 + P + ++ + ++ F ++ + MP HV S + PAS S L + Sbjct: 184 DLPQGELSKDDLLSTELFPFLNI----DSPSWMPTHVYLSKIQNKKEPASLSEEILTGLA 239 Query: 236 RQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPI 295 R+EL + GV+ +DD M G A S E SL+AG D++ +C++ + + + Sbjct: 240 REELKYKGVLVADDFEMGGVANFYSAQEAVIKSLNAGMDIVSICHSFEKQSAAKN----- 294 Query: 296 KAERVTRLYHKGSFSRQELMDSARWKAISTRLNQ 329 V R Y ++++ S +++ Sbjct: 295 ---AVLREYKNNENFKKKINSSLERIQSLMKISA 325 >UniRef50_B2KAV5 Glycoside hydrolase family 3 domain protein n=1 Tax=Elusimicrobium minutum Pei191 RepID=B2KAV5_ELUMP Length = 541 Score = 316 bits (812), Expect = 6e-85, Method: Composition-based stats. Identities = 90/346 (26%), Positives = 159/346 (45%), Gaps = 26/346 (7%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS-----RN 55 +G ++ + + + + LVGG+++ NY Q EL ++++ + + Sbjct: 31 LGQTLVAFVDTDNAHKYQSAIEKGLVGGVLVQWGNY-SLEQTTELAAKLQSWAAKSPHKI 89 Query: 56 RLVVAVDQEGGRV-QRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDID 114 L++++D EGG V GF LP AA + E ++ L E+ I Sbjct: 90 PLLISIDYEGGTVYTPVTLGFEYLPPNMMIAAANDEEAAARIFY----LAGLELRKAGIH 145 Query: 115 ISFAPVLDVG--HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVT 172 I+F+PV+DV + IG RS+ + P+ + + + G+ A + + KHFPGHG Sbjct: 146 INFSPVVDVNINPGNPIIGVRSFGSSPELVGRMGAAVVSGLSAANVMSVAKHFPGHGNTV 205 Query: 173 ADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWL 231 DSH P + E++ ++ F I E + IM AH+IY + DP+ PA+ S L Sbjct: 206 LDSHYSLPVLNITKKEMQDVHLAPFKKAI-EAGVPGIMTAHIIYKNYDPKNPATYSKRIL 264 Query: 232 KTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDN 291 +LR E+ F GVI SD L M+GA + G+ A +L+AG DM L+ + Sbjct: 265 NDLLRTEMKFKGVIISDALDMKGATLDGNIALSAAKTLEAGSDMALLGRFLNADKTFNKI 324 Query: 292 LSPIKAERVTRLYHKGSFSRQELMDSA-RWKAISTRLNQLHERWQE 336 + E S++ + +++ + + ++ E+ + Sbjct: 325 YGYVGTE----------LSQKRIEEASKKILDLKKQMGLFDEQKEP 360 >UniRef50_Q8YVY6 All1831 protein n=11 Tax=Nostocaceae RepID=Q8YVY6_ANASP Length = 365 Score = 316 bits (810), Expect = 8e-85, Method: Composition-based stats. Identities = 100/333 (30%), Positives = 160/333 (48%), Gaps = 25/333 (7%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHD-------PAQLRELVRQIRAASRNR 56 +++ + G L E++ L G+I F +N+ D +EL QI+ + Sbjct: 14 LIMGISGTSLSDEDKRALGELKPIGVIFFAKNFLDGVPYAVWLETFQELHSQIQEYAERD 73 Query: 57 -LVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDI 115 + +D EGGRV R TR P A A+E A E+ ++ I++ Sbjct: 74 SMFFTLDHEGGRVVRTPLPITRFPQALLL---------RSQAREVAKATAIELKSLGINL 124 Query: 116 SFAPVLDV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTA 173 S++PV D+ + IG R++ P A A A + G+ EAG+ KHFPGHG + Sbjct: 125 SWSPVADIYSHPQNPIIGSRAFGNTPDTAAAGAREYYLGLTEAGIVGCAKHFPGHGDTSK 184 Query: 174 DSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLK 232 DSH E P E++ +++ F +LI E + IM AH+++ +DP PA+ S LK Sbjct: 185 DSHVELPILNLTPEELQNRELIPFKALIEEG-IPLIMTAHILFPKIDPDLPATLSRPILK 243 Query: 233 TVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL 292 T+LR+ELGF GV+ SDDL M+ + M + + +AGCD+ +V N + Sbjct: 244 TILRKELGFQGVVVSDDLDMKAVSDMFMQSGTVARAFNAGCDLFIVSRNIHASS---IER 300 Query: 293 SPIKAERVTRLYHKGSFSRQELMDSARWKAIST 325 + AE T GS + +++SA+ + Sbjct: 301 TYKIAEDFTDCLTDGSLAES-VVESAKERIERL 332 >UniRef50_A1JQZ0 Glycosyl hydrolase family protein n=8 Tax=Enterobacteriaceae RepID=A1JQZ0_YERE8 Length = 625 Score = 316 bits (810), Expect = 8e-85, Method: Composition-based stats. Identities = 98/364 (26%), Positives = 164/364 (45%), Gaps = 39/364 (10%) Query: 1 MGP-VMLDVEGY------------ELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVR 47 +G +M+D+ + E++ +++ +G +ILF N D Q+ EL+ Sbjct: 53 LGQILMVDIRSWSHSDNSDKTAFIEINDSVSKMIRDYHLGSIILFRENLIDTPQIVELIN 112 Query: 48 QIR-AASRNRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMAS 106 ++ A S L ++ DQEGG V R R G T +P + A K AQ+AG + Sbjct: 113 NLQNARSNLPLFISTDQEGGYVTRLRVG-TEMPGNMALGATGS----SKFAQQAGSIHGY 167 Query: 107 EMIAMDIDISFAPVLDVGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFP 166 E+ ++ + ++ + IG RSY DP +A +I G+H+ + T+ KHFP Sbjct: 168 ELSSLGFNFGPVVDVNNNQNNPVIGVRSYSNDPVLVAELARSYISGIHKYNVLTSLKHFP 227 Query: 167 GHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD------ 220 GHG VT+D+H P +A + ++ F ++ DAIM AHV+ +D Sbjct: 228 GHGNVTSDTHFSLPAVNIDKAAWQQVELKPFVEVMP--VTDAIMTAHVVVPALDNSMLTN 285 Query: 221 ------PRPASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCD 274 PA+ S L +LR +L FDG+I +D + M + ++ AG D Sbjct: 286 IKGEKVGTPATLSKPILTDILRNQLKFDGLILTDAMDMGAITSNFERNWSIKQAIMAGND 345 Query: 275 MILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERW 334 ++L+ K + ++ L + T +Q + DSA + I T+LN+ R Sbjct: 346 IVLMPMEIKN-RASIEQLDALYDYLKTEATKDPEL-KQRIEDSA-QRVIYTKLNK---RI 399 Query: 335 QEEK 338 E Sbjct: 400 SPEP 403 >UniRef50_B4RS37 Beta-hexosaminidase n=2 Tax=Alteromonas macleodii RepID=B4RS37_ALTMD Length = 340 Score = 316 bits (810), Expect = 1e-84, Method: Composition-based stats. Identities = 151/309 (48%), Positives = 200/309 (64%), Gaps = 1/309 (0%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 MGP+MLD + EL EE+E LAHP+VGG+ILFTRNYHD QL LV+ IR ++N +++A Sbjct: 1 MGPLMLDCQAEELLPEEKEKLAHPVVGGVILFTRNYHDKRQLSALVQDIRRFAKNNVLIA 60 Query: 61 VDQEGGRVQRFREGFTRLPAAQS-FAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAP 119 VD EGGRVQRFR+GFT +PA +E A+ G +A E+ +DID SFAP Sbjct: 61 VDHEGGRVQRFRDGFTAIPAMGDILRQSEHQDESAAYAKACGLALAYELKTLDIDFSFAP 120 Query: 120 VLDVGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKET 179 VLD+ +S IG+R++ + + + +A++ I G+ M GKHFPGHG+V DSH Sbjct: 121 VLDINGVSQVIGDRAFSNNADEVIRLATQLIQGLKTVNMPAIGKHFPGHGSVAPDSHIAL 180 Query: 180 PCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQEL 239 P D R EIR DM VF +I LD +MPAHVIY+ V +PA S WLKT+L+ ++ Sbjct: 181 PVDERGFDEIRRTDMVVFERVIERALLDGVMPAHVIYNQVCDKPAGFSSTWLKTILKNDI 240 Query: 240 GFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAER 299 GF+G +FSDDLSM GA++ GSY ER +A+LDAGCDM+L CNN KGA S+LD L + Sbjct: 241 GFNGAVFSDDLSMHGASVAGSYVERAEAALDAGCDMVLACNNPKGAESILDGLEARFSNP 300 Query: 300 VTRLYHKGS 308 + Sbjct: 301 NNSDVARQE 309 >UniRef50_D1SAP6 Glycoside hydrolase family 3 domain protein n=4 Tax=Actinomycetales RepID=D1SAP6_9ACTO Length = 499 Score = 315 bits (808), Expect = 2e-84, Method: Composition-based stats. Identities = 82/318 (25%), Positives = 134/318 (42%), Gaps = 16/318 (5%) Query: 7 DVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGG 66 G L +G ++LF RN D Q+ L +RA R ++VA+D+E G Sbjct: 23 GFVGTTPPPWVCRWLGEG-LGSVVLFARNVVDHEQVATLTANLRAE-RPDVIVAIDEEAG 80 Query: 67 RVQRFREGF-TRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGH 125 V R + P + A+ L +E + +E+ A+ + + +AP DV Sbjct: 81 DVTRIESARGSSRPGNYALGAVDDP----ALTEEVARDLGAELAAVGVTLDYAPDADVNS 136 Query: 126 I--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDP 183 + IG RS+ ADP + ++ G+ G+ KHFPGHG DSH + P Sbjct: 137 NPANPVIGVRSFGADPDMVARHTAAWVRGLQAGGVAACAKHFPGHGDTRVDSHHDLPRIG 196 Query: 184 RPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQELGFD 242 A + A +++ F + + M H++ +DP PA+ SP L +LR+E+GF Sbjct: 197 GDLARLDAVELAPFRAAVAAGVQAV-MTGHLLVPALDPELPATLSPRVLGGLLREEMGFG 255 Query: 243 GVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTR 302 GV+ +D + M A + +L AG D I V R + + Sbjct: 256 GVVVTDAVEMRAVADRYGFTGAAVRALVAGADAICVGGERATEADA-----RELRDAIVA 310 Query: 303 LYHKGSFSRQELMDSARW 320 G + L ++A+ Sbjct: 311 AVISGELPEERLAEAAKR 328 >UniRef50_A6Q7E8 Glycosyl hydrolase, family 3 n=1 Tax=Sulfurovum sp. NBC37-1 RepID=A6Q7E8_SULNB Length = 361 Score = 315 bits (807), Expect = 2e-84, Method: Composition-based stats. Identities = 96/349 (27%), Positives = 162/349 (46%), Gaps = 35/349 (10%) Query: 1 MGPV-MLDVEGYELDAEER--EILAHPLVGGLILFT---------RNYHDPAQLRELVRQ 48 +G + M+ G R + + +G +ILF +N QL++L Q Sbjct: 26 IGQMLMVGFHGTSAPKNSRICQDIRKYHIGAVILFDYNPVDKSKPKNIASRKQLKKLTAQ 85 Query: 49 IRAASRNR-LVVAVDQEGGRVQRFR---EGFTRLPAAQSFAALSGMEEGGKLAQEAGWLM 104 ++ S + L++AVDQEGG+VQR + + P A + K ++ M Sbjct: 86 LQKCSYDGKLLIAVDQEGGKVQRLKRKYGFYGNFPKASDVTRM-----DKKQIKKTYRKM 140 Query: 105 ASEMIAMDIDISFAPVLD--VGHISAAIGE--RSYHADPQKALAIASRFIDGMHEAGMKT 160 A E+ ++ I+ APV+D + + I RS+ DP+ A AS FID MH G+ T Sbjct: 141 AKELKSVGINYDLAPVVDLDINPKNHVIHGLGRSFGKDPKTVAAYASTFIDAMHHYGVLT 200 Query: 161 TGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD 220 + KHFPGHG+ D+HK + ++ + L R+ D +M AHV +D Sbjct: 201 SLKHFPGHGSSVGDTHKGFVDVT---NLWKRVELEPYRLLNRKA--DTVMVAHVFNKKLD 255 Query: 221 PR-PASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVC 279 R PAS S + +LR++LGF+GV+ +DDL M + + +++AG D++L Sbjct: 256 ARYPASFSNRTVNGILRKQLGFNGVVITDDLQMGAISKKYGLKNTLKLAINAGDDILLFG 315 Query: 280 NNRKGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDS-ARWKAISTRL 327 N +V + E + L +G + + + + R + + +L Sbjct: 316 NQLDPRKTV---STKKLVETIKSLLKRGEVNPKSIDYAYIRIQNLKRKL 361 >UniRef50_A4C3R7 Beta-glucosidase-related glycosidase n=1 Tax=Pseudoalteromonas tunicata D2 RepID=A4C3R7_9GAMM Length = 538 Score = 315 bits (807), Expect = 2e-84, Method: Composition-based stats. Identities = 68/344 (19%), Positives = 139/344 (40%), Gaps = 18/344 (5%) Query: 2 GPVML-DVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRN----- 55 G + + G++ + ++ +GG + N D A +L ++ + Sbjct: 21 GQLFVLAFAGHDY-EYAQHLIQTHHIGGFYITDDNAADLASAAKLANTLQQLAALRACDA 79 Query: 56 RLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDI 115 L++ VDQEG + P + + + L + + A +M ++ + Sbjct: 80 PLILGVDQEGAWGI-LTQQTDLGPGNLALGKANDL----ALTYDMYRVFAQQMQSIGYNT 134 Query: 116 SFAPVLDVG--HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTA 173 +P DV + IG+R++ + + + + G+ KHFPGHG Sbjct: 135 LLSPCADVNADPKNPIIGQRAFGETAEHVASHVAAAVKGILSTDNFACAKHFPGHGDTHT 194 Query: 174 DSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLK 232 DSH+ P + E+ +D+ F I E + IM +H+ Y +D PA+ S L Sbjct: 195 DSHQALPVVNKSLNEMMQQDLLPFKRAI-EAGVSMIMTSHINYPKIDAHYPATLSATILT 253 Query: 233 TVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVC-NNRKGAVSVLDN 291 T+L+++LGF G+I +D ++M S + +L AG +I++ + + + + Sbjct: 254 TLLKEQLGFKGLIITDSMNMWAMRKNFSPVDAAVQALKAGAHLIMLSEEHYENSTTAYKE 313 Query: 292 LSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQ 335 + V +G + D + ++ + + L + Sbjct: 314 IQAQTIAGVIDAVQQGELGETLIDDILKH-VLAYKYSYLKADAK 356 >UniRef50_D1YED3 Glycosyl hydrolase family 3 N-terminal domain protein n=3 Tax=Propionibacterium acnes RepID=D1YED3_PROAC Length = 629 Score = 314 bits (806), Expect = 3e-84, Method: Composition-based stats. Identities = 88/365 (24%), Positives = 150/365 (41%), Gaps = 71/365 (19%) Query: 18 REILAHPLVGGLILF--TRNYH---DPAQLRELVRQIRAASRN----RLVVAVDQEGGRV 68 + + +GG++ F + N +PA + L ++ A+R L +AVDQEGG V Sbjct: 76 AQAVQKYHLGGVLYFNWSGNISTPANPAAVAHLSNGLQEAARTSSTAPLAIAVDQEGGIV 135 Query: 69 QRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI-- 126 R T P + A++ + A+ G ++ E+ A+ I++ FAP +DV Sbjct: 136 ARITSPATEFPGNMALGAVNDP----RAARAQGAVLGRELAALGINVDFAPDVDVNTNPA 191 Query: 127 SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQ 186 + IG RS DP + + I G+ G+ T KHFPGHG DSH P + Sbjct: 192 NPVIGVRSMGDDPVRVGVLGVEQIRGIQSRGVAATAKHFPGHGDTQIDSHLGLPVVEYGR 251 Query: 187 AEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQELGFDGVI 245 + + + F I ++D IM AH+I +DP PA+ S L +LR+++G+ G+I Sbjct: 252 TTL-ERHLVPFRMAIS-AEVDMIMTAHIIVKTLDPTMPATLSKKVLTGLLREQMGYRGII 309 Query: 246 FSDDLSMEGAAIMGSYAE-----------------------------------------R 264 +D L MEGA + + Sbjct: 310 TTDALDMEGAQLAVMTEDEKGTYTRLKAAADAEGNDPTAADEGHASAEFNAFMKPIRGRV 369 Query: 265 GQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAIS 324 + +AG D++L ++ V+ + R G S + L +S + ++ Sbjct: 370 AVQAFEAGSDILLNPHDADAVVTAMR-----------RAIADGRVSERRLDESV-LRILA 417 Query: 325 TRLNQ 329 + + Sbjct: 418 WKRRR 422 >UniRef50_UPI00019730D7 beta-glucosidase-like glycosyl hydrolase n=1 Tax=Clostridium sp. M62/1 RepID=UPI00019730D7 Length = 505 Score = 314 bits (805), Expect = 3e-84, Method: Composition-based stats. Identities = 85/331 (25%), Positives = 146/331 (44%), Gaps = 29/331 (8%) Query: 11 YELDAEEREILAHPLVGGLILFTRNYHDPAQLREL----VRQIRAASRNRLVVAVDQEGG 66 Y + + VGG++ F++N D AQ +++ + + + +++AVD+EGG Sbjct: 183 YAAGEATKAAIEQYPVGGIVYFSQNLRDEAQTKDMLTRTNQYAKERTGLPMLLAVDEEGG 242 Query: 67 RVQRF--REGFT--RLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLD 122 +V R REGF P A E+ A E G + + + + ++ FAPV D Sbjct: 243 QVARISGREGFALDAFPDMSDIGAQGDSEK----AYEVGDAIGAYLAELGFNMDFAPVAD 298 Query: 123 V--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETP 180 V + + RS+ +D + + R +G+ + G+ KHFPGHG DSH+ Sbjct: 299 VLTNPDNTVVKRRSFGSDAKTVANMVVRETEGLKKHGISPVLKHFPGHGGTAEDSHEGAA 358 Query: 181 CDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPRPASGSPYWLKTVLRQE 238 R E+ ++ F + I E D +M H+ + D RPA S + VLR Sbjct: 359 VIHRTVDELLENELIPFYTGI-EAGADFVMVGHISDPEAVGDDRPAVFSKTLVTEVLRDT 417 Query: 239 LGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAE 298 L FDG+I +D ++M S + +L+AG DMIL+ + K A + Sbjct: 418 LKFDGIIITDAMNMGAITESCSSGDAAVKALEAGVDMILMPEDFKAA-----------CQ 466 Query: 299 RVTRLYHKGSFSRQELMDSARWKAISTRLNQ 329 V G+ + +S + ++ + + Sbjct: 467 GVEEAVEAGTIGEDRIEESV-LRILTLKAEK 496 >UniRef50_C8PQ37 Glycosyl hydrolase, family 3 n=1 Tax=Treponema vincentii ATCC 35580 RepID=C8PQ37_9SPIO Length = 561 Score = 313 bits (804), Expect = 4e-84, Method: Composition-based stats. Identities = 96/337 (28%), Positives = 154/337 (45%), Gaps = 24/337 (7%) Query: 5 MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS-----RNRLVV 59 M G + +G + +F N D +L + V ++ + R L V Sbjct: 68 MFGWAGQAPTDLLMNWITERSLGSIKIFGWNTEDSVELAKAVSLLQKKANGTRFRIPLFV 127 Query: 60 AVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAP 119 A DQEGG ++ + + P + A + A G+ + E+ A+ I+++FAP Sbjct: 128 ATDQEGGWIRHVKGRTSETPGNLAIGASGYPID----AYYEGFYIGRELSALGINLNFAP 183 Query: 120 VLDV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 +D+ H S+ IG RS+ P+ A +A F+ G ++AG+ TT KHFPGHG DSH Sbjct: 184 TVDLYTNHESSIIGPRSFGQSPEAAGILARAFMQGSNKAGVLTTAKHFPGHGDTALDSHG 243 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD--PRPASGSPYWLKTVL 235 + P + +++ F LI + + AIM H+ + V PAS SPY L+TVL Sbjct: 244 KLPSIAISKETFYNRELVPFQYLI-DADVPAIMTGHLNFPAVSEHGEPASFSPYLLQTVL 302 Query: 236 RQELGFDGVIFSDDLSM-EGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSP 294 R +LGF G+I +DD+ M G+ A+ Q +L+AG D+I + + Sbjct: 303 RGQLGFKGLIITDDMMMHGAIRYAGTVAKAVQLALEAGNDIIESSTTPRLYDAFWTEN-- 360 Query: 295 IKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLH 331 + R + D+AR + I T+L Sbjct: 361 ------LQRMQTNEAFRLRVKDAAR-RIIETKLRYFK 390 >UniRef50_UPI0001B54569 sugar hydrolase n=1 Tax=Streptomyces sp. AA4 RepID=UPI0001B54569 Length = 483 Score = 313 bits (804), Expect = 5e-84, Method: Composition-based stats. Identities = 89/322 (27%), Positives = 139/322 (43%), Gaps = 19/322 (5%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQ 63 ++ G R LA +GG+ILF RN D Q+ L +R+ R+ +V+ +D+ Sbjct: 14 LLPGFAGTTAPDWVRRRLAEG-LGGVILFGRNVVDDEQVAALTSALRSE-RDGVVIGIDE 71 Query: 64 EGGRVQRFREGFTRL-PAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLD 122 EGG V R G P + A E L + + A + ++ AP D Sbjct: 72 EGGDVTRLDVGKGSFVPGPLALGAAGDPE----LTASVAAALGERLAACGVTVNLAPCAD 127 Query: 123 V--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETP 180 + IG R++ +DP A + F+ GM + G+ KHFPGHGA T DSH P Sbjct: 128 LTLAAEDPGIGVRAFGSDPVAASPHVAAFVTGMQKYGVAACAKHFPGHGAATEDSHLSLP 187 Query: 181 CDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQELG 240 PR AE+R ++ F++ I + + +M H++ + PA+ + L VLR ELG Sbjct: 188 VLPRTAAELREVELVPFAAAI-DAGVRTVMSGHLVVREWGEEPATFNRTALVDVLRGELG 246 Query: 241 FDGVIFSDDLSMEGAAIMG----SYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIK 296 F G + +D L M + E +L G D + + + Sbjct: 247 FTGTVITDALEMGAISGAYGRHDGIGEASVRALLGGADALCLGG-----AAFDAETLDRT 301 Query: 297 AERVTRLYHKGSFSRQELMDSA 318 AE + +G S + L D+A Sbjct: 302 AEAIVAAVTEGRLSVERLADAA 323 >UniRef50_UPI0000E87F6B beta-hexosaminidase n=1 Tax=Methylophilales bacterium HTCC2181 RepID=UPI0000E87F6B Length = 337 Score = 313 bits (804), Expect = 5e-84, Method: Composition-based stats. Identities = 138/339 (40%), Positives = 207/339 (61%), Gaps = 6/339 (1%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 M +M+DV LD+ + + ++HPLVGG+ILF+RNY + QL+ LV+ IR ++ +++A Sbjct: 1 MSCLMIDVASLSLDSNDIKRISHPLVGGVILFSRNYENKKQLKLLVKSIRKI-KHNILIA 59 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSG--MEEGGKLAQEAGWLMASEMIAMDIDISFA 118 VD EGGRVQRFR+ FT+LPA +L EE ++A+ GWL+A E+ DID SF Sbjct: 60 VDHEGGRVQRFRDDFTQLPAMALLGSLFDKDPEEATRIAKLCGWLIAKELGGCDIDFSFT 119 Query: 119 PVLDVGHISA-AIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 PVLDV + S+ IG R++H + Q + +A+ +G+ +AGM+ GKHFPGHG + AD+H Sbjct: 120 PVLDVNYESSSVIGNRAFHKEIQPTIHLANSLTEGLGQAGMQGVGKHFPGHGFIKADTHL 179 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQ 237 E D R I+ DM VF +LI EN L IMP+HVIYS D +P+ S +WL+ LR+ Sbjct: 180 ELGEDERAFEVIKESDMLVFKALI-ENGLKGIMPSHVIYSLCDDKPSGLSHFWLQDQLRK 238 Query: 238 ELGFDGVIFSDDLSMEGAA-IMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIK 296 LGF+G IFSDD+SM+ A + +R +L AGCDM+LVCN + VL +L Sbjct: 239 NLGFNGAIFSDDMSMQAAVFTEKNVTKRVLKALQAGCDMVLVCNAGEDVDIVLTHLDWPV 298 Query: 297 AERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQ 335 +++ ++++ + + +T +L ++ Sbjct: 299 SKKSQCRIKAMGLNKKKNQQTQYMRLTATEAGKLILKFN 337 >UniRef50_C5EIW3 Glycoside hydrolase n=2 Tax=Clostridiales RepID=C5EIW3_9FIRM Length = 446 Score = 313 bits (804), Expect = 5e-84, Method: Composition-based stats. Identities = 79/328 (24%), Positives = 142/328 (43%), Gaps = 29/328 (8%) Query: 11 YELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS----RNRLVVAVDQEGG 66 + RE L VGGLI F+ N D QL+ + + + L +++D+EGG Sbjct: 129 TQAGEVTREALRQYPVGGLIYFSNNLEDAGQLKTMTGDTQKMAMEEIGIPLFLSIDEEGG 188 Query: 67 RVQRF--REGF--TRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLD 122 + R +GF ++P+ A + A AG + + + +++ FAP D Sbjct: 189 DIARIGKHDGFRVAQVPSMAVIGASGDRVQ----AYRAGSTIGAYLHEYGLNLDFAPDAD 244 Query: 123 V--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETP 180 V + + RS+ +PQ ++ +++G+ E G+ KHFPGHGA DSH+ Sbjct: 245 VLTNPDNTVVKTRSFGRNPQLVTEMSLAYLEGLWEQGVYGAPKHFPGHGATKGDSHEGFA 304 Query: 181 CDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP--RPASGSPYWLKTVLRQE 238 + E+ ++ F +++ N++ IM H+ + P S S ++ LR Sbjct: 305 YTDKTWEELEEAELKPFYAMVG-NQVPFIMAGHISLPAITGGDLPCSLSGQVIQGYLRDR 363 Query: 239 LGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAE 298 +G++G+I +D L+M + +L AG DM+L+ + P E Sbjct: 364 MGYEGIIITDALNMGAIEDHYPSGQAAVMALQAGVDMLLMPADF-----------PSAYE 412 Query: 299 RVTRLYHKGSFSRQELMDSARWKAISTR 326 V G L D++ + +S + Sbjct: 413 AVVEGAGNGEIPLDRL-DASVLRILSLK 439 >UniRef50_B3C6M6 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=B3C6M6_9BACE Length = 1051 Score = 313 bits (803), Expect = 7e-84, Method: Composition-based stats. Identities = 77/347 (22%), Positives = 135/347 (38%), Gaps = 34/347 (9%) Query: 1 MGPVMLDVEGYELDAEEREILAH----PLVGGLILFTRNYHDPAQLRELVRQIRAASRNR 56 +G + + + D R++L VGGL+ + L + + + Sbjct: 68 IGQLFIYTIAPQQDKANRDLLRKVVDGYKVGGLLFSGGLMEN---QAILTNEAQKMADVP 124 Query: 57 LVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 L++ D E G R R G P + L E G MA + + + ++ Sbjct: 125 LMITFDGEWGLSMRLR-GTPVFPRNMILGCIQN----DSLLYEYGREMARQCRELGVQVN 179 Query: 117 FAPVLDVG--HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 FAPV DV + I RS+ P + G+ E G+ + KHFPGHG D Sbjct: 180 FAPVADVNINPKNPVINTRSFGESPANVADKVIAYARGLEEGGVLSVSKHFPGHGDTDVD 239 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD---PRPASGSPYWL 231 SH P +A + + ++ F I + L +M H+ ++ P+S S + Sbjct: 240 SHHSLPKLTFSRARLDSVELYPFKKAI-QAGLSGMMVGHLEVPILESKRGVPSSLSRSVV 298 Query: 232 KTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDN 291 +L QE+ F G++F+D L+M+G + + +L AG D++LV K Sbjct: 299 HDLLTQEMKFQGLVFTDALAMKGVSANNTS--ICLQALKAGHDLLLVPRRIKEE------ 350 Query: 292 LSPIKAERVTRLYHKGSFSRQELMDSARWKAISTR--LNQLHERWQE 336 + + G + E+ R K ++ + L + + Sbjct: 351 -----VDAILDAVKNGELTEAEIEAKCR-KVLTYKYALGLSKKPFVR 391 >UniRef50_Q0VQT2 Beta-hexosaminidase n=3 Tax=Gammaproteobacteria RepID=Q0VQT2_ALCBS Length = 345 Score = 313 bits (803), Expect = 7e-84, Method: Composition-based stats. Identities = 169/320 (52%), Positives = 205/320 (64%), Gaps = 14/320 (4%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQ 63 +MLD+EG E+ A E+++LAHP GGLILF RNY D QL ELV+Q+RA R +++AVDQ Sbjct: 17 LMLDLEGLEVSASEKDMLAHPGAGGLILFARNYVDRQQLAELVQQVRAV-RPEMLIAVDQ 75 Query: 64 EGGRVQRFREGFTRLPAAQSFAALSG--MEEGGKLAQEAGWLMASEMIAMDIDISFAPVL 121 EGGRVQRFR+GF RLP + + A G LMASE+ +DIDISFAPVL Sbjct: 76 EGGRVQRFRDGFVRLPPMAALGQRYDQAPAQAVGEATLLGELMASELTELDIDISFAPVL 135 Query: 122 DVGH-ISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETP 180 D+ + S+ IG+RS+H D +A+A FIDGM AGM TGKHFPGHG V ADSH E P Sbjct: 136 DLDYGNSSVIGDRSFHGDADAMIALAGAFIDGMSAAGMAATGKHFPGHGHVGADSHLELP 195 Query: 181 CDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQELG 240 DPRP +E+ A DM F +L KLD IMPAHVIYS V+ + A S YWL+T LR+ LG Sbjct: 196 IDPRPLSELEAADMRPFMAL--APKLDGIMPAHVIYSAVEQQTAGFSRYWLQTQLRERLG 253 Query: 241 FDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERV 300 F GVIFSDDLSM GA +G Y +R QA+L+AGCDM+L CN +GA VLD L Sbjct: 254 FRGVIFSDDLSMAGAHGVGGYPQRAQAALEAGCDMVLACNCCEGAEQVLDFLGTQNF--- 310 Query: 301 TRLYHKGSFSRQELMDSARW 320 G L R Sbjct: 311 -----SGQVPAAGLRARPRL 325 >UniRef50_UPI0001C379AD glycoside hydrolase family 3 domain protein n=2 Tax=Ruminococcus flavefaciens FD-1 RepID=UPI0001C379AD Length = 452 Score = 311 bits (799), Expect = 2e-83, Method: Composition-based stats. Identities = 85/340 (25%), Positives = 141/340 (41%), Gaps = 34/340 (10%) Query: 3 PVML--------DVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASR 54 + + + R+ L VGGLI F+ N Q+ +++R + + Sbjct: 127 QMFIVRPDELTNGTDLTSATQATRDGLEEYPVGGLIFFSDNMESREQVTDMIRTTKDYAA 186 Query: 55 ----NRLVVAVDQEGGRVQRFRE--GFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEM 108 L VD+EGG V R E G + S+ ++G A +A ++ Sbjct: 187 GLGCVPLFFGVDEEGGVVSRCAEKLGTSAFEPMYSY-----KDKGPVTAYANARKLAGDI 241 Query: 109 IAMDIDISFAPVLDV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFP 166 D+ FAPV D + IG R+Y D ++ + + G E G+ T KHFP Sbjct: 242 GQFGFDLDFAPVADTWSNPDNKVIGTRAYSDDFEETAMLVGAAVRGFREGGVHCTLKHFP 301 Query: 167 GHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASG 226 GHG DSH R E+R+ + + F I+ D +M H+ +D PAS Sbjct: 302 GHGDTYEDSHDGFAVSNRTLDELRSNEYTAFDRGIK-AGADFVMVGHITVPAIDNLPASL 360 Query: 227 SPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAV 286 S + LR LG+DG++ +D L+M A S + +++AG DM+L N + +V Sbjct: 361 SSKVINDELRGTLGYDGIVITDALAMGAIANAYSSSVCAVMAVEAGNDMLLSPKNLRESV 420 Query: 287 SVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTR 326 + + G S + + +S R + + + Sbjct: 421 A-----------GIEDAVANGVISEERIDESVR-RILKVK 448 >UniRef50_B4CZU1 Beta-N-acetylhexosaminidase n=2 Tax=Verrucomicrobia RepID=B4CZU1_9BACT Length = 366 Score = 311 bits (799), Expect = 2e-83, Method: Composition-based stats. Identities = 87/313 (27%), Positives = 140/313 (44%), Gaps = 14/313 (4%) Query: 1 MGPVML-DVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVV 59 +G ++L V G ELDA GG ILF RN QLR+L+ +R S+ ++ Sbjct: 6 LGQLLLTGVPGPELDAATAAQFRRLQPGGFILFGRNIQTANQLRKLIDDLRDLSQVEPMI 65 Query: 60 AVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAP 119 +DQEGGRV R + P A + E L + G + A + ++ P Sbjct: 66 TIDQEGGRVSRLKLVGNEPPNANQLRDKNDPE----LIRRHGTITAKLLRIFGFNLDLCP 121 Query: 120 VLDVGHI----SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADS 175 VLD+ ++ G R Y + + + A F + G+ + GKHFPG+ D+ Sbjct: 122 VLDISFDDEADNSLRG-RCYGNNVPQVIRNAGAFNSALQAGGILSCGKHFPGYSRAPLDA 180 Query: 176 HKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP--RPASGSPYWLKT 233 H E P P + E+ +++VF +K+D++M H Y ++ P+S S + Sbjct: 181 HHELPEIPLSREELETHELAVFKH--FADKVDSMMIGHGFYPALESSRTPSSLSHRVITD 238 Query: 234 VLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLS 293 +LR+E GF G+I +DDL M E + ++ AG DM ++C+ L L Sbjct: 239 LLRKEQGFKGLIMTDDLDMGAILNHYGLEETIRLAITAGNDMAMICHRVAVVEEALGYLE 298 Query: 294 PIKAERVTRLYHK 306 + + R Sbjct: 299 RVPVADLDRALEN 311 >UniRef50_B9WKL4 Beta-n-acetylglucosaminidase, putative n=4 Tax=Saccharomycetales RepID=B9WKL4_CANDC Length = 960 Score = 311 bits (798), Expect = 2e-83, Method: Composition-based stats. Identities = 84/349 (24%), Positives = 150/349 (42%), Gaps = 26/349 (7%) Query: 2 GPVMLD-VEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRN----- 55 G ++ +G + + ++ V +IL +N + Q+ +L++ ++ + Sbjct: 7 GQLLCGGFQGTTVTPQAYHLIVEHHVSAMILSRKNAINVEQMSKLIKDLQYIAMTEGRYT 66 Query: 56 -RLVVAVDQEGGRVQRF--REGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMD 112 ++ A+D+EGG + + T+ P A + AA +L E +A+E+ + Sbjct: 67 YPIMFAIDEEGGMMNSLFDPDFLTQYPGAMALAATGDP----QLVYEISRAIATEIKKIG 122 Query: 113 IDISFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGA 170 I PVLDV +G RS+ + + G+ + G+ T GKHFPG G Sbjct: 123 FSIILGPVLDVVTKLSHQLVGVRSFGTTIEDVVKYGRMCARGLQDGGLFTVGKHFPGIGN 182 Query: 171 VTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPRPASGSP 228 DS E P +++ ++ F LI E LD I A ++ D A SP Sbjct: 183 AAVDSLLELPMMGDSLEQLKHFNVVPFGELINEGLLDGISAAGCGVPNISPDETHACLSP 242 Query: 229 YWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSV 288 + +LRQEL FDG + S+ L ME + +++AGCD+++VC++ Sbjct: 243 IVINQLLRQELRFDGFVISECLEMEALYHSVGLGQGVILAINAGCDLVMVCHDMALQNEA 302 Query: 289 LDNLSPIKAERVTRLYHKGSFSRQELMDS-ARWKAISTRLNQLHERWQE 336 + + + G+ + L+ S R K + TRL + + E Sbjct: 303 --------TDSIKKAVVNGNLDEETLIKSLKRIKKLQTRLPSWAQIFPE 343 >UniRef50_Q1AUI8 Glycoside hydrolase, family 3-like protein n=1 Tax=Rubrobacter xylanophilus DSM 9941 RepID=Q1AUI8_RUBXD Length = 386 Score = 310 bits (796), Expect = 4e-83, Method: Composition-based stats. Identities = 99/332 (29%), Positives = 160/332 (48%), Gaps = 24/332 (7%) Query: 2 GPVML-DVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASR-----N 55 G + + + G D +++ VGG++LF N Q+R L +++ S Sbjct: 63 GQMFVVSLGGTRSDYYIEKMVRERNVGGVLLFGYNMRSLPQVRRLTGELQRLSMATEPAV 122 Query: 56 RLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDI 115 L+VAVDQEGG VQ + P+A A E A+ + SE++ ++ Sbjct: 123 PLLVAVDQEGGEVQS-APWVSPQPSAAEIGASGDPEA----ARRVARRIGSELLEAGVNT 177 Query: 116 SFAPVLDVGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADS 175 APV+D G AAIG RSY +P+ + S ++G EAG+ + KHFP HG T+DS Sbjct: 178 DLAPVVDTGFG-AAIGSRSYGENPELVSRMGSAAVEGFREAGVISAAKHFPNHGPATSDS 236 Query: 176 HKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD-PRPASGSPYWLKTV 234 H P + ++D+ F++ +R + +M H++Y +D PAS SP ++ + Sbjct: 237 HTGLPIVDHDMDTVLSRDLPPFAAAVR-AGVPMVMVGHLLYPAIDQENPASMSPQAIR-L 294 Query: 235 LRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSP 294 LR+ LGFDGV+ +DDL+M GA GS A ++ AG DM++V + D Sbjct: 295 LRERLGFDGVVITDDLAMAGARQGGSVARAAVRAVSAGADMLIVSSPPPQQAEAYD---- 350 Query: 295 IKAERVTRLYHKGSFSRQELMDSARWKAISTR 326 + R G +++ S R + + + Sbjct: 351 ----ALVRAVRSGGIPERQVRASVR-RIVEMK 377 >UniRef50_C7QCS9 Glycoside hydrolase family 3 domain protein n=6 Tax=Actinomycetales RepID=C7QCS9_CATAD Length = 594 Score = 310 bits (796), Expect = 4e-83, Method: Composition-based stats. Identities = 83/325 (25%), Positives = 142/325 (43%), Gaps = 18/325 (5%) Query: 5 MLDVEGYELDAEEREILAHPLVGGLILFTRNYH-DPAQLRELVRQIRAASRNRLVVAVDQ 63 ++ + + R L +GG+ + RN P + EL +RA + + ++VAVD+ Sbjct: 17 FVNDDSGRVPDWVRRALTEDGLGGVAYYGRNIAATPERTGELSAALRAENPH-VLVAVDE 75 Query: 64 EGGRVQRFR-EGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLD 122 EGG V R + P + + E L + + + I++++AP D Sbjct: 76 EGGDVTRLDVATGSAFPGNYALGYVDDAE----LTETVARQIGRSLWRAGINLNYAPDAD 131 Query: 123 VG--HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETP 180 + + IG RS+ +DP A ++ ++ G+ AG+ KHFPGHG DSH P Sbjct: 132 LNTNPDNPVIGTRSFGSDPVVAARHSAAYVRGLQAAGVAGCAKHFPGHGDTAVDSHHGLP 191 Query: 181 CDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQEL 239 E ++ F + I E + ++M AH++ DP P++ S L +LR EL Sbjct: 192 LVQHSDKEWEK-HLAPFRAAI-EAGVKSVMTAHILAPRYDPEFPSTMSRKVLVDLLRGEL 249 Query: 240 GFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAER 299 GF G++ +D + M A AE +L AG D + V + L + + Sbjct: 250 GFTGLVVTDGIEMAAIADTFGIAEGTVLALAAGVDAVCVGGGLRDEGDYL-----MLRDA 304 Query: 300 VTRLYHKGSFSRQELMDSA-RWKAI 323 + G + L D+A R + + Sbjct: 305 LVDAVRAGRLPAERLHDAATRVREL 329 >UniRef50_Q6MFP1 Related to beta-N-acetylglucosaminidase n=9 Tax=Sordariomycetes RepID=Q6MFP1_NEUCR Length = 1085 Score = 310 bits (795), Expect = 4e-83, Method: Composition-based stats. Identities = 79/341 (23%), Positives = 146/341 (42%), Gaps = 26/341 (7%) Query: 1 MGPV-MLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS-----R 54 +G + ++ +G E+ + RE++ +G ++L +N Q +LV++++ + Sbjct: 16 IGQMLIMGWDGTEVTPQIRELITDHHLGSILLTAKNLKSAHQTAKLVQELQTIAHNAGHP 75 Query: 55 NRLVVAVDQEGGRVQRF--REGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMD 112 L++A+DQE G V E + P++ AA + L+ + A+E+ A Sbjct: 76 YPLLIALDQENGGVNSLFDEEHICQFPSSMGQAAAGSPD----LSYQLAKATATEVSACG 131 Query: 113 IDISFAPVLDV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGA 170 +++ PVLDV +G R+ DPQ+ + G +AG+ T GKHFP +G Sbjct: 132 VNLILGPVLDVLTNARYQPLGVRATSDDPQEVSQYGIAAMKGYKDAGVATCGKHFPSYGN 191 Query: 171 VT-ADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAH--VIYSDVDPRPASGS 227 + S + P + E+ + F + I KLDA+ + + A S Sbjct: 192 LDFLGSSLDIPIITQTLEELALSALVPFRNAIATGKLDAMFVGGCGITNPSMKVAHACLS 251 Query: 228 PYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVS 287 + +LR+ELGF GV S+ L ME ++ AGCD++L+C + Sbjct: 252 EQVVDELLREELGFTGVAISECLEMEALRTEMGVRTGTIMAVQAGCDLVLLCRAYDVQLE 311 Query: 288 VLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLN 328 + L V +++ + S R + + + Sbjct: 312 AISGLKLGLENEV--------LTKERVYTSLR-RVLKMKRG 343 >UniRef50_Q2S5L3 Beta-N-acetylglucosaminidase n=2 Tax=Rhodothermaceae RepID=Q2S5L3_SALRD Length = 979 Score = 310 bits (795), Expect = 5e-83, Method: Composition-based stats. Identities = 82/337 (24%), Positives = 134/337 (39%), Gaps = 32/337 (9%) Query: 1 MGPVMLD-VEGYELDA------EEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS 53 + + + ++G +A E ++ GGLI P + ++A + Sbjct: 70 IAQLFVVRIDGEFQNANTPSYRETVALVEEFGAGGLIFGPG---TPMTQIAMANDLQAKA 126 Query: 54 RNRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDI 113 L+VA D E G R + T P A + A A G+ A E A+ + Sbjct: 127 ERPLLVAQDTEWGVGMRI-DAATSFPPAMAIGATRDASH----AYRVGYATAREARALGV 181 Query: 114 DISFAPVLDVGHISAAIG--ERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAV 171 +APV DV + RS+ P +AS F G G T KHFPGHG Sbjct: 182 HQLYAPVADVNNNPKNPIINVRSFGESPSLVGTMASAFTRGAQRGGTLATVKHFPGHGDT 241 Query: 172 TADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPRPASGSPY 229 DSH P ++ + + ++ F + + ++M H+ ++ D PA+ S Sbjct: 242 DVDSHINLPVLRFDRSRLDSLELPPFRQAF-DAGVRSVMTGHLALPEIAADSVPATLSRP 300 Query: 230 WLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVL 289 +LR+ELGFDG++ +D L+M+ E L+AG D++L+ N A Sbjct: 301 LTHGLLREELGFDGLVVTDALNMQAVTRTFGVGETAVRVLEAGADLVLMSTNPHAAHQA- 359 Query: 290 DNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTR 326 V + G E+ DS R + + + Sbjct: 360 ----------VRQAVTSGRIDTTEINDSVR-RLLRVK 385 >UniRef50_UPI0001C334F9 beta-glucosidase-like glycosyl hydrolase n=1 Tax=cyanobacterium UCYN-A RepID=UPI0001C334F9 Length = 521 Score = 310 bits (795), Expect = 5e-83, Method: Composition-based stats. Identities = 81/309 (26%), Positives = 134/309 (43%), Gaps = 21/309 (6%) Query: 14 DAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFRE 73 + + L +GG+IL ++ RQ++ S L +A D E G QRF Sbjct: 24 NDKLFSWLNTLNIGGVILLGG---SAGEITLRTRQLQKLSNIPLFIAADIEEGVGQRFSG 80 Query: 74 GFTRLPAAQSFA--ALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHISAAIG 131 G T P + A ++ KLA G ++A E +++ I+ AP++D+ + Sbjct: 81 G-THFPPPMALGEIAKHDLKTATKLAYSMGEMIAEEALSIGINWMLAPIVDINNNPNNPV 139 Query: 132 E--RSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEI 189 RS+ P +A FI G + TT KHFPGHG + DSH E P + Sbjct: 140 INIRSFGETPDVVSQLAKAFIKGAKNHPILTTAKHFPGHGNTSNDSHIELPVINHDSKRL 199 Query: 190 RAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQELGFDGVIFSD 248 ++ F I K+D+IM AH++ + D PA+ S L LR++L F G+I +D Sbjct: 200 NNLELVPFKEAISL-KIDSIMSAHILINAWDRENPATLSKKILTKQLREKLNFKGLIITD 258 Query: 249 DLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGS 308 L M G + S E ++ AG +++L+ ++ + + + G Sbjct: 259 ALMMGGISKYFSPKEIAIRAIKAGANVLLMPDDPENTIKA-----------IYDSVISGE 307 Query: 309 FSRQELMDS 317 + + S Sbjct: 308 LEIETIDQS 316 >UniRef50_A9E2K6 B-N-acetylglucosaminidase, glycoside hydrolase family 3 protein n=1 Tax=Kordia algicida OT-1 RepID=A9E2K6_9FLAO Length = 366 Score = 310 bits (794), Expect = 7e-83, Method: Composition-based stats. Identities = 84/343 (24%), Positives = 171/343 (49%), Gaps = 23/343 (6%) Query: 1 MGPV-MLDV-EGYEL--DAEEREILAHPLVGGLILFTRNYH---DPAQLRELVRQIRAAS 53 +G + ML + + E+ D + + +GG++++ +N L+ L+ +++ + Sbjct: 33 IGQMLMLGIGKDTEIRSDNKILADIKAQKLGGVLIYEKNITKLNSVENLKRLISKLKKHA 92 Query: 54 RNRLVVAVDQEGGRVQRFRE--GFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAM 111 + L+V++D+EGG+V R ++ GF + +A+ ++ ++ + ++A ++ + Sbjct: 93 KLPLLVSIDEEGGKVNRLKQKYGFPKTVSAKYLGEVNDIDS----TKYYSDIIAHNLLQL 148 Query: 112 DIDISFAPVLDV-GHISAAIGE--RSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGH 168 I++++APVLDV + IG+ R++ +PQ + A + + +KT KHFPGH Sbjct: 149 GINVNYAPVLDVHNDNNPPIGKNHRAFSKNPQDIINHARQVVLSHRYFNVKTVVKHFPGH 208 Query: 169 GAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP--RPASG 226 G DSH + + +++ + L+ E +DAIM AH++ +D PA+ Sbjct: 209 GNSRQDSHYNVTDVSK---YWQQQEVFPYIKLLYEGNVDAIMTAHIVNEKLDDSKLPATL 265 Query: 227 SPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAV 286 S + LR +LGFDGVIFSDD+ M+ A + + Q ++ AG D+++ N+ Sbjct: 266 SKKIMTDYLRGDLGFDGVIFSDDMQMKAIADQFGFEKSIQMAIHAGVDVLMFSNHIPMKG 325 Query: 287 SVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQ 329 + L + V ++ G S + + +S + + + Sbjct: 326 RDM-ILPQDIIDIVKKMIADGEISEKRIDES-YQRILKFKKGL 366 >UniRef50_C6HUU9 Beta-N-acetylhexosaminidase n=1 Tax=Leptospirillum ferrodiazotrophum RepID=C6HUU9_9BACT Length = 321 Score = 309 bits (793), Expect = 9e-83, Method: Composition-based stats. Identities = 104/299 (34%), Positives = 163/299 (54%), Gaps = 10/299 (3%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQ 63 + + + G +L A +RE L G+++F N + AQ++ L+ +IR+ + ++AVDQ Sbjct: 8 LWVSLPGPDLTAADREFLERVDPLGVVIFRENVRNAAQVKSLIEEIRSVPGSPRLLAVDQ 67 Query: 64 EGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV 123 EGGRV R +G LPA + A E +EAG+ + + + D+ FAPVLDV Sbjct: 68 EGGRVARLSDGVPALPAMKILGAQPERE-----IEEAGFSLGEALHRLGFDVDFAPVLDV 122 Query: 124 --GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPC 181 + IG+R++ +DP++A A F G+ AG+ GKHFPGHG + DSH P Sbjct: 123 DSNPQNPIIGDRAFSSDPEEAGVKALAFARGLLRAGILPCGKHFPGHGDTSLDSHVALPV 182 Query: 182 DPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPASGSPYWLKTVLRQELG 240 P + + A+++ F I E ++ +M AHV+Y +D PA+ S + +LR++L Sbjct: 183 VDAPLSLLEARELVPFRMAI-EREIPLMMTAHVVYPSIDKAVPATLSRAVVTGILREKLN 241 Query: 241 FDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAER 299 + G++ SDDL M A G E AS+DAGCD +LV ++ A++V D L I Sbjct: 242 YGGLVLSDDLLMAAVA-RGGVVEAALASIDAGCDGLLVLKSQSQALAVADRLEEILRGE 299 >UniRef50_A5DD90 Putative uncharacterized protein n=2 Tax=Pichia guilliermondii RepID=A5DD90_PICGU Length = 990 Score = 309 bits (793), Expect = 9e-83, Method: Composition-based stats. Identities = 85/355 (23%), Positives = 148/355 (41%), Gaps = 27/355 (7%) Query: 2 GPVMLD-VEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS------R 54 G ++ EG + + ++ + + L +N Q+ +L++ ++ + + Sbjct: 4 GQLLCGGFEGATVTPQAYHLIVDLHISTMFLSKKNAVSVQQMAKLIQDLQYIAFTRGNYK 63 Query: 55 NRLVVAVDQEGGRVQRF--REGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMD 112 L+ A+D+EGG + E T+ P A AA E L + +A+E+ + Sbjct: 64 YPLMFAIDEEGGMMNSLFDPENLTQFPCAMGLAATGNTE----LVYKVSKALATELKHIG 119 Query: 113 IDISFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGA 170 + PVLDV +G RS+ + G+ + G+ T GKHFPG G Sbjct: 120 FSMILGPVLDVVTKLSHQLVGVRSFGTTLESVTRYGMACARGLQDGGLITMGKHFPGIGN 179 Query: 171 VTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPRPASGSP 228 T DS E P IR + F+ LI+E LD I A + D A SP Sbjct: 180 ATVDSLLELPMVTDSIDSIRRLNCVPFAELIKEGVLDGISAAGAGVPTISPDEVHACLSP 239 Query: 229 YWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSV 288 L +LRQEL FDGV+ S+ L M+ + ++ AGCD+++VC++ Sbjct: 240 VVLNQLLRQELKFDGVVISECLEMDALHHSVGLGQGVILAVFAGCDLVMVCHDPNLQNEA 299 Query: 289 LDNLSPIKAERVTRLYHKGSFSRQELMDSA-RWKAISTRLNQLHERWQE-EKAGH 341 A+ + + G+ + S R + + +L + E + + ++ Sbjct: 300 --------ADSLEKAVANGNLDEDIVNASLERIEKLQKKLPKWSEIFPQGSQSAQ 346 >UniRef50_A9WLT0 Predicted O-Glycosyl hydrolase, family 3 n=1 Tax=Renibacterium salmoninarum ATCC 33209 RepID=A9WLT0_RENSM Length = 486 Score = 309 bits (792), Expect = 1e-82, Method: Composition-based stats. Identities = 86/339 (25%), Positives = 148/339 (43%), Gaps = 17/339 (5%) Query: 2 GPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAV 61 G + G + + L+ + G++ F +N QL++L QIR + + ++ V Sbjct: 12 GVIWPGFLGSSVPDWLNDALSDG-LAGVVYFRQNIESAKQLKDLSAQIRTLNPS-AIIGV 69 Query: 62 DQEGGRVQRF-REGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 D+EGG V R E ++LP A + + + AG + I+++ APV Sbjct: 70 DEEGGNVTRLQTEHGSQLPGAAVLGR----RDELAVTEAAGTAIGRLCRQSGINLAIAPV 125 Query: 121 LDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE 178 DV + IG RS+ AD + + G+ + G+ KHFPGHG DSH Sbjct: 126 ADVNTNPANPVIGVRSFGADTGLVARHTAAMVRGIQKTGIAACAKHFPGHGDTHTDSHLG 185 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQE 238 P A++R + + F + E ++A+M AH++ +D+ PA+ + L+ +LR E Sbjct: 186 LPRLEFDVAQLREQHLPPFQAA-SEAGVEAVMTAHIVLADLGEEPATTNSSTLQ-MLR-E 242 Query: 239 LGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCN----NRKGAVSVLDNLSP 294 GFDGVI +D L M ++ AG D++ V N Sbjct: 243 SGFDGVIITDALDMAAIRETVGSGAGAVKAILAGTDLLCVGNPANPRASADPEPDRTDYL 302 Query: 295 IKAERVTRLYHKGSFSRQELMDS-ARWKAISTRLNQLHE 332 + + G+ R+ L + R ++++TR+ E Sbjct: 303 EVRDALLAAVQSGALPRERLKAAYQRNQSLATRIALATE 341 >UniRef50_D2QKL5 Glycoside hydrolase family 3 domain protein n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QKL5_9SPHI Length = 1002 Score = 308 bits (790), Expect = 2e-82, Method: Composition-based stats. Identities = 74/346 (21%), Positives = 148/346 (42%), Gaps = 27/346 (7%) Query: 1 MGPVML--DVEG--YELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNR 56 + +++ + ++ +GG+++F P + +L +++A S Sbjct: 53 IAQLIMVAGYSNRKPAYEDSLIRLVQTNKLGGVVMFQG---GPVRQAQLTNRLQAGSPVP 109 Query: 57 LVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 L++A+D E G R R P + A+ G L + G +A + + + ++ Sbjct: 110 LLIAMDAEWGIAMRLDS-TVRYPYQMTLGAMQGAASD-SLIYQMGANLAKQARRLGMHVN 167 Query: 117 FAPVLDVGHISAAIG--ERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 FAP +DV + RS+ D A ++ GM + + T+ KHFPGHG D Sbjct: 168 FAPSVDVNNNPNNPVINFRSFGEDKYAVARKALAYMRGMQDNQLLTSIKHFPGHGDTGTD 227 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD---PRPASGSPYWL 231 SH + P + + ++ + ++ F LIR +M AH+ +D RP++ SP + Sbjct: 228 SHYDLPLIAKSRTQLDSLELYPFRELIR-AGATGVMIAHLNIPALDTTRNRPSTLSPAIV 286 Query: 232 KTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDN 291 +L+ ELGF G++FSD ++M+ + + L+AG D++ + Sbjct: 287 TNLLKNELGFKGLVFSDAMNMKAVTKFYPSGKADELGLEAGMDVLEFTED---------- 336 Query: 292 LSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQEE 337 P +V + G ++ + D+ K + + + ++ Sbjct: 337 -VPAALAQVKQAIVDGRITQASI-DARCLKVLQAKAWAGLDMYKPI 380 >UniRef50_A4CPY8 Glycosyl hydrolase, family 3 n=1 Tax=Robiginitalea biformata HTCC2501 RepID=A4CPY8_9FLAO Length = 534 Score = 308 bits (789), Expect = 3e-82, Method: Composition-based stats. Identities = 91/340 (26%), Positives = 143/340 (42%), Gaps = 32/340 (9%) Query: 2 GPVMLDVEGYELDAE----EREILAHPLVGGLILF----------------TRNYHDPAQ 41 G + + E E++ +GGL F N Sbjct: 21 GQLFMPAAFINDTEEAIVRLEELIREFHIGGLCFFHSPASAATNFEGKKEIPHNPDSLGT 80 Query: 42 LRELVRQIRAASRNRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAG 101 L+EL+R+ + A+R L++A+D E G R G P A ALS + L + G Sbjct: 81 LKELIRRYQRAARYPLLIAMDAEWGLAMRVENGE-PYPYALCLGALSDRDP---LLYQVG 136 Query: 102 WLMASEMIAMDIDISFAPVLDVG--HISAAIGERSYHADPQKALAIASRFIDGMHEAGMK 159 MA++ I + APV DV + IG R++ P+ AS F G+ + G+ Sbjct: 137 LRMAADCRDAGIHWNLAPVADVNCNPGNPVIGYRAFGTTPEGVARRASLFYRGLKDGGIL 196 Query: 160 TTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV 219 GKHFPGHG DSH P + AE+ + + F L+ E +DAIM H+ + Sbjct: 197 GCGKHFPGHGDTAVDSHLALPAIDKGAAELESTEFVPFRRLVAEG-VDAIMTGHLAVPAL 255 Query: 220 DPR--PASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMG-SYAERGQASLDAGCDMI 276 DP PAS SP + +RQ LGFDG + SD L+M + E + AG DM+ Sbjct: 256 DPEGLPASLSPAVIGQGIRQNLGFDGAVISDALNMHAVSKNYPGPGEVAWRAFAAGNDML 315 Query: 277 LVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMD 316 + A + + +++ + + ++ + + Sbjct: 316 CFAEDIPEAHRRI--CREGETDQIEASFRRVWQLKERVRE 353 >UniRef50_B8NYN9 Beta-N-acetylglucosaminidase, putative n=1 Tax=Aspergillus flavus NRRL3357 RepID=B8NYN9_ASPFN Length = 733 Score = 308 bits (789), Expect = 3e-82, Method: Composition-based stats. Identities = 81/327 (24%), Positives = 134/327 (40%), Gaps = 25/327 (7%) Query: 6 LDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASR-----NRLVVA 60 + +G +D + R ++ + +G ++L +N LV +++ +R L++A Sbjct: 1 MGFDGTSVDPQIRSLIENYHLGSILLTAKNLKSAEDATRLVLELQTIARDAGHPVPLLIA 60 Query: 61 VDQEGGRVQRFRE--GFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 +DQE G V + + P+A AA LA + A E+ A+ I+ Sbjct: 61 LDQENGGVNSLYDEIYIRQFPSAMGIAATGS----KALAHDVAVATAQELKAVGINWILG 116 Query: 119 PVLDV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVT-ADS 175 PVLDV S +G R+ DPQ+ F+ G EAG+ T GKHFP +G + S Sbjct: 117 PVLDVLTNVRSQPLGVRTTGDDPQEVSQYGVEFMKGYKEAGLVTCGKHFPSYGNLEFLGS 176 Query: 176 HKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSD--VDPRPASGSPYWLKT 233 + P ++ + F + I +D++M V S A S + Sbjct: 177 QSDVPIITESLEQLSLSALVPFRNAILHG-IDSMMVGGVSMSSAGFHVMHACLSEQVVDD 235 Query: 234 VLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLS 293 +LR+++ FDGV+ S+ L ME + AGCD+IL+C + + L Sbjct: 236 LLRKKMKFDGVVVSECLEMEALTHNIGVGGGTVMAKKAGCDIILLCRSFPVQQEAITGLK 295 Query: 294 PIKAERVTRLYHKGSFSRQELMDSARW 320 G R + S R Sbjct: 296 LG--------VENGIIGRARIEQSLRR 314 >UniRef50_B2IHJ4 Glycoside hydrolase family 3 domain protein n=3 Tax=Rhizobiales RepID=B2IHJ4_BEII9 Length = 370 Score = 306 bits (786), Expect = 6e-82, Method: Composition-based stats. Identities = 98/328 (29%), Positives = 154/328 (46%), Gaps = 27/328 (8%) Query: 5 MLDV-EGYELDAEEREILAHPLVGGLILFTRNYHDP---AQL----RELVRQIRAAS-RN 55 ++ + EL ++R +LA G+IL+ N+ AQ L+ I AA R Sbjct: 19 LVGLRPTPELHDQDRALLADLRPAGVILYKSNFRHDLPYAQWLEIHARLIEDIYAACGRE 78 Query: 56 RLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDI 115 L + +D EGGRV R TR ++AA G A G M E+ ++ I++ Sbjct: 79 HLWIGIDHEGGRVCRTPAPITRF----AYAAHW-----GDCAAAVGQAMGCELASLGINL 129 Query: 116 SFAPVLDV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTA 173 +FAPVLD+ + IGER++ P+ +A F+ M + GKHFPGHG + Sbjct: 130 NFAPVLDIHTNPANPVIGERAFATTPEAVTPLALGFLRAMQAENVLGCGKHFPGHGDTSV 189 Query: 174 DSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLK 232 DSH P ++R +++ F + I E L IM +H+++ +DP P + S + Sbjct: 190 DSHYGLPSQNCSLEDLRQRELKPFIAAI-EAGLSMIMTSHILFPAIDPDFPVTLSRRFCT 248 Query: 233 TVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL 292 +LR+ELG+ GV+ SDD+ M + + AGCD+++VC + V D Sbjct: 249 ELLREELGYSGVVVSDDIGMGAMNQFFAEPTAAARLIAAGCDLLMVCAHWTETERVRD-- 306 Query: 293 SPIKAERVTRLYHKGSFSRQELMDSARW 320 AE + R KG + +L S Sbjct: 307 ---FAESIVRARDKGEITPDQLTVSKVR 331 >UniRef50_C3J950 Glycosyl hydrolase, family 3 n=2 Tax=Bacteria RepID=C3J950_9PORP Length = 1001 Score = 306 bits (785), Expect = 7e-82, Method: Composition-based stats. Identities = 83/346 (23%), Positives = 151/346 (43%), Gaps = 30/346 (8%) Query: 1 MGPVMLDV----EGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNR 56 +G +++ + G A+ ++A GG++ +D Q + + + S+ Sbjct: 60 IGQLIIPIIYPSSGASQMADAARVVARCHAGGILFQKGEVYD--QWL-MTQHLNKTSKLP 116 Query: 57 LVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 L++ D E G R R P ++ A + + L E G MA + + I ++ Sbjct: 117 LLITADAEWGLAMRLSNTI-RYPRNRALANVRDL----GLITEWGRSMAEQCKTVGIHVN 171 Query: 117 FAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 FAPV+DV + + IG RS+ Q + + + G+ E + + KHFPGHG D Sbjct: 172 FAPVVDVNNNPKNPVIGTRSFGEKTQDVIDYSLAYSRGLEEGRVLSVAKHFPGHGNTALD 231 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD--PRPASGSPYWLK 232 SHK P ++E+ +++ F + L IM H+ +D PAS S Sbjct: 232 SHKTLPTIKGSRSELEKTELAPF-AAYFRAGLGGIMVGHLNVPALDKSGTPASLSYAITT 290 Query: 233 TVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL 292 +L+QE+GF G+IF+D + M+G G+ +L AG D++L + A + Sbjct: 291 QLLQQEMGFSGLIFTDGMQMQGMQQRGATP-ISVRALLAGNDLLLGPTDPVKAHA----- 344 Query: 293 SPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQEEK 338 + +G SR+++ R K + + + + Q+ + Sbjct: 345 ------EILAAVQQGVLSRKQIEQHCR-KVLLYKWALMGQSLQQPQ 383 >UniRef50_Q1IHT6 Glycoside hydrolase, family 3-like n=1 Tax=Candidatus Koribacter versatilis Ellin345 RepID=Q1IHT6_ACIBL Length = 624 Score = 306 bits (785), Expect = 8e-82, Method: Composition-based stats. Identities = 87/351 (24%), Positives = 157/351 (44%), Gaps = 47/351 (13%) Query: 2 GPVML-----DVEGYELDA--EEREILAHPLVGGL---------ILFTRNYHDPAQLREL 45 G + + + + R+ +A +GG LF +P + + Sbjct: 55 GQMFMIWSKAQFVNVQSPDFLKLRDTMARYHLGGFGVTVNFEDGFLFK---TEPYEAAMM 111 Query: 46 VRQIRAASRNRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMA 105 + +++ S L++A D E G R +E T P A +F A + A+E G + A Sbjct: 112 INELQQGSEIPLIIAADFERGLSMRLKE-ATDFPHAMAFGATNNP----AYAKEFGRITA 166 Query: 106 SEMIAMDIDISFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGK 163 E A+ ++ ++ P DV + I RS+ DPQ ++ +I+G H G+ TT K Sbjct: 167 LESRAIGVEWNWFPDADVNSNPANPIINTRSFGEDPQAVASMVKAYIEGAHAEGLLTTVK 226 Query: 164 HFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-- 221 HFPGHG D+H T +P I+ ++ F + I + +D++M H++ +DP Sbjct: 227 HFPGHGDTDTDTHLATARINQPLEHIQNVELVPFKAAI-DAGVDSVMIGHLVVPALDPDT 285 Query: 222 -RPASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYA-----ERGQASLDAGCDM 275 R A+ SP + L+++LGF G++ +D + M G A + + A++ AG DM Sbjct: 286 NRVATISPKIVNGTLKKDLGFQGLVVTDAMEMNGLAKLFGFGPEGSARAAVAAVKAGDDM 345 Query: 276 ILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTR 326 +L+ ++ GA E + + +G + +S R K + + Sbjct: 346 LLLPSDLDGAY-----------EGLIKAVKRGEIPESRIDESVR-KILRMK 384 >UniRef50_Q1N543 Glycoside hydrolase, family 3-like protein n=1 Tax=Bermanella marisrubri RepID=Q1N543_9GAMM Length = 334 Score = 306 bits (784), Expect = 9e-82, Method: Composition-based stats. Identities = 129/331 (38%), Positives = 202/331 (61%), Gaps = 7/331 (2%) Query: 3 PVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVD 62 ++ D+E + L +ER L+H + G+ILFTR++ QL+ L+ +I+A + +L++ VD Sbjct: 4 CLITDIETHSLSEDERTWLSHSYLAGIILFTRHFESKTQLKTLISEIKAIN-PKLLITVD 62 Query: 63 QEGGRVQRFREGFTRLPAAQSFAALS--GMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 EGGRVQRFR+GFTR+PA L +LA ++ ++ E++ + ID+++APV Sbjct: 63 HEGGRVQRFRDGFTRVPAMGRLGKLYQEDSARANQLAFDSAVVLCCELLDIGIDLTYAPV 122 Query: 121 LDVGHI-SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKET 179 LD+ + + +G+R + + + +AS FIDG+ G + KHFPGHG VT DSH Sbjct: 123 LDIDYQRNQVVGDRGFAHTKEAIVDLASHFIDGVKSIGFPSVAKHFPGHGWVTQDSHVSC 182 Query: 180 PCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQEL 239 P D R EI++ D+ VF L+ K+D IMPAHV+Y VD +PA S +W+ +LR ++ Sbjct: 183 PVDERSFEEIKSNDLWVFDHLMP--KIDWIMPAHVVYERVDEQPAGFSKHWVTNILRNDM 240 Query: 240 GFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAER 299 GF G + SDDLSMEGAA G YA R +A+LDAGC+++L CN+ +L+ LS + ER Sbjct: 241 GFQGPVVSDDLSMEGAAQTGGYASRAKAALDAGCNVMLACNSSVAPKEILEFLSHSEYER 300 Query: 300 VTRLYHKGSFSRQELMDSARWKAISTRLNQL 330 + L S ++ ++D ++ R+ + Sbjct: 301 L-DLSEYQSKPQKMILDMPVYQQALARIMEF 330 >UniRef50_C0BK44 Glycoside hydrolase family 3 domain protein n=1 Tax=Flavobacteria bacterium MS024-2A RepID=C0BK44_9BACT Length = 969 Score = 306 bits (784), Expect = 1e-81, Method: Composition-based stats. Identities = 84/344 (24%), Positives = 153/344 (44%), Gaps = 29/344 (8%) Query: 1 MGPVMLDVEGYELDA----EEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNR 56 +G + + + E D+ + ++ + VGGL+ + P + + ++ ++ Sbjct: 52 IGQLFMPMVFSERDSAHYNQTLNLVKNEKVGGLVF---SLGGPVGQSQWLNSFQSHAKTP 108 Query: 57 LVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 L+VA+D E G R P + A+ L + G M + + I S Sbjct: 109 LLVAMDAEWGVAMRLDS-VKAFPWPMTLGAVKD----SVLIRAIGKRMGEQEKRLGIQYS 163 Query: 117 FAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 F+PVLD+ + IG RSY + ++ A + G H+AG+ T+GKHFPGHG + D Sbjct: 164 FSPVLDINTNANNPIIGNRSYGSSNERVSRQALAIMQGHHDAGVLTSGKHFPGHGDTSQD 223 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPRPASGSPYWLK 232 SHK P I +++ + +LI E L ++M AH+ + + P S S ++ Sbjct: 224 SHKTLPTVNFSADRINTIELAPYKALI-EKGLSSVMVAHLNVPSITKEGLPTSLSSDVIQ 282 Query: 233 TVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNL 292 +L+++LGF G+I +D L+M+G + + AG D++L+ NN ++ Sbjct: 283 GLLKRDLGFKGLIVTDALNMKGVSEYSKVDNIDLTAFLAGHDLLLISNNIYEGINA---- 338 Query: 293 SPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQE 336 + Y KG + L S + K + + ++ Sbjct: 339 -------IKNAYRKGKVTEARLAYSVK-KILRAKYKVGLAHYKP 374 >UniRef50_A0L847 Beta-N-acetylhexosaminidase n=1 Tax=Magnetococcus sp. MC-1 RepID=A0L847_MAGSM Length = 364 Score = 306 bits (784), Expect = 1e-81, Method: Composition-based stats. Identities = 111/322 (34%), Positives = 163/322 (50%), Gaps = 18/322 (5%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQ 63 +++ + G L ER L G++LF+RN P Q+ L+R IR S + V+ +DQ Sbjct: 10 LVVTLAGTRLSEPERAYLQRVEPAGVMLFSRNIEQPEQVVALIRAIRQQSPS-TVIWIDQ 68 Query: 64 EGGRVQRFREGFTRLPAAQSFAA--LSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVL 121 EGGRVQR R TRLPAA+SFA ++ +LA++ G L E+ A I I+ APV+ Sbjct: 69 EGGRVQRLRCPLTRLPAAESFAKIYAYAPQQAERLARQCGALAGRELRAFGIGINCAPVV 128 Query: 122 DVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKET 179 D+ IG R++ +P +A ++ G+ EAG+ KHFPGHG DSH+ Sbjct: 129 DIPAPGADPVIGNRAFGDNPLSISRLAGAWLQGLQEAGVMGVAKHFPGHGDAMVDSHRSL 188 Query: 180 PCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPASGSPYWLKTVLRQE 238 P E+ ++ + L+ L IM AH++ S +DP RPA+ S LK LRQ Sbjct: 189 PVVHTEIRELLSRTWVPYVQLLP--LLQGIMTAHLVMSALDPLRPATWSSVVLKDWLRQR 246 Query: 239 LGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAE 298 LG+ G+I SD L M G+ +R ++ AGCD +LVC+ R + Sbjct: 247 LGYGGLIISDALEMGAL--GGNMEDRAARAVRAGCDQLLVCSGRLNHMQ-------QALN 297 Query: 299 RVTRLYHKGSFSRQELMDSARW 320 V + + S Q D+ R Sbjct: 298 GVQQALGEKSPQEQR-QDAQRI 318 >UniRef50_A2E8H3 Glycosyl hydrolase family 3 N terminal domain containing protein n=1 Tax=Trichomonas vaginalis RepID=A2E8H3_TRIVA Length = 474 Score = 305 bits (781), Expect = 2e-81, Method: Composition-based stats. Identities = 82/304 (26%), Positives = 137/304 (45%), Gaps = 20/304 (6%) Query: 32 FTRNYHDPAQLRELVRQIRAAS----RNRLVVAVDQEGGRVQRFREGFTRLPAAQSFAAL 87 RN + Q+R L I+ S ++++DQE G R T P++ + AA Sbjct: 1 MGRNLRNQEQIRNLTATIQNYSYNQYNLPSLISIDQESGSNIRMFRNSTVFPSSMTIAAT 60 Query: 88 SGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHI--SAAIGERSYHADPQKALAI 145 + +E A ++ +L A E+ + I+++FAPV DV + +G+RS+ +P Sbjct: 61 NNLEN----AYKSSYLQALELNDVGINLNFAPVSDVNSNIHNPIVGDRSFGDNPNVVSKY 116 Query: 146 ASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENK 205 +I G +AG+ KHFPGHG + DSH E P R ++ D+ F + I Sbjct: 117 VQNYIKGHKKAGVLCCSKHFPGHGETSQDSHYELPRINRTFESLKEIDLPPFKASINYG- 175 Query: 206 LDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERG 265 +D IM H+I + P S S + LR ELG++GV +D + M+ Y E Sbjct: 176 VDMIMLGHLITPLDETMPCSTSSKCIN-YLRNELGYEGVTITDSMVMKAIKQEK-YNETI 233 Query: 266 QASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAIST 325 ++ AG D+I C V + + + KG + + +S + + I Sbjct: 234 LEAILAGADLICSCG------KVFEPFQYYTIDYLMECVEKGLLPMKRVEESLK-RIIKL 286 Query: 326 RLNQ 329 +L+ Sbjct: 287 KLSL 290 >UniRef50_Q5P081 Beta-hexosaminidase n=63 Tax=Proteobacteria RepID=NAGZ_AZOSE Length = 356 Score = 304 bits (780), Expect = 3e-81, Method: Composition-based stats. Identities = 146/325 (44%), Positives = 201/325 (61%), Gaps = 16/325 (4%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 +GPVMLDV G+ L EERE L PLVGG+ILF RN+ D QL+ L +I A L++A Sbjct: 14 LGPVMLDVAGFALTEEERERLLDPLVGGVILFARNFRDSEQLQALTAEIHALRSPALIIA 73 Query: 61 VDQEGGRVQRFR-EGFTRLPAAQSFAALSGMEEGGKL--AQEAGWLMASEMIAMDIDISF 117 VD EGGRVQRFR +GFTR+P+ + L + L A+ AG+++A+E++A +D+SF Sbjct: 74 VDHEGGRVQRFRTDGFTRIPSMRCLGRLWERDHVAALESARCAGYVLAAELLAHGVDLSF 133 Query: 118 APVLDVGHI-SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSH 176 PVLD+ + S +G+R++H DP A+A + GM +AGM GKHFPGHG ADSH Sbjct: 134 TPVLDLDYGCSRVVGDRAFHRDPLVVAALAQSLVSGMADAGMGCVGKHFPGHGYAEADSH 193 Query: 177 KETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLR 236 E P D R I +D++ + + +L +MPAHVIY VDP PA S +WL+ +LR Sbjct: 194 VEIPVDEREFDAIWTEDIAPYRHRLGR-QLAGVMPAHVIYPRVDPNPAGFSRFWLQDILR 252 Query: 237 QELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIK 296 +GF GVIFSDDL+MEGA ++G R +A+ AGCD++LVCN AV +LD +P Sbjct: 253 GRVGFGGVIFSDDLTMEGATVVGDILARARAAFGAGCDVVLVCNRPDLAVDLLDRWAPDI 312 Query: 297 AERVTRLYHKGSFSRQELMDSARWK 321 A SR + ++ R + Sbjct: 313 APE----------SRARI-EALRSR 326 >UniRef50_A9WJ64 Glycoside hydrolase family 3 domain protein n=2 Tax=Chloroflexus RepID=A9WJ64_CHLAA Length = 515 Score = 304 bits (779), Expect = 3e-81, Method: Composition-based stats. Identities = 83/309 (26%), Positives = 143/309 (46%), Gaps = 18/309 (5%) Query: 3 PVMLDVEGYELDAEEREILAHPLVGGLILFTR-NYHDPAQLRELVRQIRAASRN----RL 57 +++ G E+ + +++ + V G+ L+ N PA++R L ++ + L Sbjct: 7 QLLIWFAGDEVPSHLPDLIFNHRVAGVTLYRHLNGASPARIRALTAHLQQLAAASGYGPL 66 Query: 58 VVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISF 117 ++ DQEGG++Q G T P A + A LA G + +E+ A+ +++ + Sbjct: 67 LIGADQEGGQLQAL-PGLTAFPGAMALGATQSP----ALAFRVGSAIGAELAALGVNLVY 121 Query: 118 APVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADS 175 APV DV + IG R++ DP+ +A+ I G+ + G+ T KHFPGHG DS Sbjct: 122 APVCDVNTNPQNPVIGIRAFGDDPKIVGVLAAAMIAGLQQHGVAATAKHFPGHGDTFGDS 181 Query: 176 HKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD--PRPASGSPYWLKT 233 H ++ + FS+ I E + A+M H+ + RPA+ +P + Sbjct: 182 HYGLVRVNHDYERLQHVEWLPFSAAI-EAGVQAVMVGHLAVPALQAADRPATRAPQVIAH 240 Query: 234 VLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILV--CNNRKGAVSVLDN 291 LR LGF GV SD L M+ A YA ++ AG D++L+ N+ + + L Sbjct: 241 -LRHHLGFRGVTISDALDMKAFAPDDGYARALSEAVHAGIDLLLLGAGNDLEQTIKFLST 299 Query: 292 LSPIKAERV 300 + + Sbjct: 300 IDRNTKSTI 308 >UniRef50_B1YER8 Beta-N-acetylhexosaminidase n=1 Tax=Exiguobacterium sibiricum 255-15 RepID=B1YER8_EXIS2 Length = 412 Score = 303 bits (778), Expect = 4e-81, Method: Composition-based stats. Identities = 90/330 (27%), Positives = 143/330 (43%), Gaps = 21/330 (6%) Query: 3 PVM-LDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAA--SRNRLVV 59 ++ + V G L +R +L +GG++L N QL+ I A + + + Sbjct: 68 QLLYIGVTGTALTQADRTLLQDHALGGVLLLGGNITSTDQLKTFTAAIAEAQPADAKAFL 127 Query: 60 AVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAP 119 D+EGGRV R + +L + SF + L +E G + S ++ FAP Sbjct: 128 GFDEEGGRVSRVPDAQLKLSPSLSFGQKNDP----ALMKEVGQTLGSISRFYGFNMDFAP 183 Query: 120 VLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 VLDV + IG+R+ ADPQ+ + + GM + + KHFPGHG T DSH Sbjct: 184 VLDVNSNPANPIIGDRALSADPQQVARLGVPLMQGMKKEDIVPVVKHFPGHGDTTVDSHV 243 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD-PRPASGSPYWLKTVLR 236 P +AE+ ++ F I E + +M AH+++ +D RP+S S + VLR Sbjct: 244 GLPRVDATKAELEQTELVPFKRAIEEGT-EMVMVAHILFPALDKNRPSSLSKPVMTGVLR 302 Query: 237 QELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIK 296 EL FDGVI +DDL M A+ +L+ G DM + Sbjct: 303 DELKFDGVIITDDLVMGAITEQYGLAQAASLALENGADMAMFS---------APGAYTSV 353 Query: 297 AERVTRLYHKGSFSRQELMDSARWKAISTR 326 + + + SR +L + + + Sbjct: 354 HKEIMARIKQDKLSRADLDQ-KVIRVLRLK 382 >UniRef50_B6BGC2 B-N-acetylglucosaminidase, glycoside hydrolase family 3 protein n=1 Tax=Campylobacterales bacterium GD 1 RepID=B6BGC2_9PROT Length = 555 Score = 303 bits (778), Expect = 4e-81, Method: Composition-based stats. Identities = 98/348 (28%), Positives = 166/348 (47%), Gaps = 38/348 (10%) Query: 4 VMLDVEGYELDA--EEREILAHPLVGGLILFTR---------NYHDPAQLRELVRQIRAA 52 +++ E ++ E E + +GG++LF R N P QL+EL +++ Sbjct: 29 LVVGFENQSVNNNTEIVEYIQQYELGGVVLFDRFYKKRSQIKNIDTPEQLQELTAKLQKF 88 Query: 53 SRNRLVVAVDQEGGRVQRFRE--GFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIA 110 SR L+++VDQEGG + R + GF R P+A+ + LS + A++ + + Sbjct: 89 SRKPLLISVDQEGGNIARLKHYYGFDRFPSAKEVSYLSLTQ-----AKDIYNRQSKMLSN 143 Query: 111 MDIDISFAPVLD--VGHISAAIGE--RSYHADPQKALAIASRFIDGMHEAGMKTTGKHFP 166 +I+ +FAPV+D + + I E RS+ D +K A ID + + + KHFP Sbjct: 144 NNINCNFAPVVDLDINPKNKVISELERSFGKDSEKVTHYAKIVIDSQTKHNVISVLKHFP 203 Query: 167 GHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PAS 225 GHG+ DSHK K++ + +LI + D IM AHV S++D PA+ Sbjct: 204 GHGSSLGDSHKGFVDIS---NTWSQKELVPYQNLINSKEADMIMSAHVFNSNLDALYPAT 260 Query: 226 GSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGA 285 S +LRQ++GF GV+ SDDL M+ + E ++++G D++L N Sbjct: 261 LSYSTNTKLLRQKMGFTGVLISDDLQMKAITDHYNLKESVTLAINSGVDILLFGN----- 315 Query: 286 VSVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHER 333 + N E + G+ S++ +++S R R+ LH + Sbjct: 316 -QLASNDVKELVEIIYAQVKSGAISKKRIIESNR------RIENLHTK 356 >UniRef50_Q1NMP4 Glycoside hydrolase, family 3-like n=3 Tax=Deltaproteobacteria RepID=Q1NMP4_9DELT Length = 335 Score = 303 bits (778), Expect = 5e-81, Method: Composition-based stats. Identities = 104/334 (31%), Positives = 156/334 (46%), Gaps = 20/334 (5%) Query: 2 GPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASR----NRL 57 G M+ + G ELD RE++ V ILF+RN P QL L ++ AA R Sbjct: 13 GLFMVGLPGSELDDSSRELIERQRVNDFILFSRNVESPEQLTRLCAELAAACRAAGLPPP 72 Query: 58 VVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISF 117 ++A+DQEGG V R +E FTR P A+ A+ E A+E+ A+ I+I+F Sbjct: 73 LIAIDQEGGTVTRLKEPFTRFPDARELASGPNPEAALS---AYARTCATELRAVGININF 129 Query: 118 APVLDVGHISAAIGE--RSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADS 175 APVLD+ R +P + I+ M G+ GKHFPG G D Sbjct: 130 APVLDLCPAGQGFYMERRVLGEEPATVARLGRLLIETMQGQGLAACGKHFPGLGTARLDP 189 Query: 176 HKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPASGSPYWLKTV 234 H++ P R + E+ A D+ + + IM +H +Y +DP RPA+ S L + Sbjct: 190 HQDLPTLERSREELAAYDLPP-FAAAVAVPVALIMTSHTVYPALDPQRPATLSAPILTDL 248 Query: 235 LRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSP 294 LR+++G+DGVI +DDL M E A+L AG D +L+C++ L Sbjct: 249 LRRQMGYDGVIITDDLEMGAIDHNLPLEEAVTAALTAGADQLLICHDHDKVRRAL----- 303 Query: 295 IKAERVTRLYHKGSFSRQELMDSA-RWKAISTRL 327 + + R H G+ + + + R A+ R Sbjct: 304 ---KHLRRQLHTGALAPAPVSAALTRIAALRRRF 334 >UniRef50_C7G6M3 Glycosyl hydrolase domain protein n=2 Tax=Roseburia RepID=C7G6M3_9FIRM Length = 430 Score = 303 bits (776), Expect = 7e-81, Method: Composition-based stats. Identities = 73/331 (22%), Positives = 140/331 (42%), Gaps = 29/331 (8%) Query: 11 YELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS----RNRLVVAVDQEGG 66 + VGGLI N Q E++ ++ + ++VD+EGG Sbjct: 116 TAAGDTTKTAYESRPVGGLIYMADNLLSTEQTTEMLTNMQNIAMERTGLPAFLSVDEEGG 175 Query: 67 RVQRFRE----GFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLD 122 V R G T + A ++ A +AG + + + + ++ +APV D Sbjct: 176 TVARVAANEAFGVTNVGNMSDIGAAGDAQK----AYDAGVTIGTYLKQLGFNVDYAPVAD 231 Query: 123 V--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETP 180 V + AIG RS+ +D + ++ ++G+ G+ KHFPG G V+ DSH Sbjct: 232 VLTNPGNTAIGTRSFGSDASVVADMVTKELEGLSSQGVFGAVKHFPGQGGVSGDSHDGAV 291 Query: 181 CDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPRPASGSPYWLKTVLRQE 238 + E+ ++ F + EN + +M H+ V D PAS S + VLR++ Sbjct: 292 TLDKSLEELMQTELVPFQKAV-ENGVSFVMVGHISVPQVVGDNTPASLSQMMVSNVLREQ 350 Query: 239 LGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAE 298 LG+ G++ +D ++M + + +++ G DMIL+ + + A + Sbjct: 351 LGYQGIVITDAMNMGAITGTYTADQAAVMAVNVGVDMILMPQDYETAYN----------- 399 Query: 299 RVTRLYHKGSFSRQELMDSARWKAISTRLNQ 329 + + G+ + + + +S + + +L Sbjct: 400 GLLQAVQDGTITEERIDESV-ERIVKVKLQM 429 >UniRef50_A8LWX2 Glycoside hydrolase family 3 domain protein n=6 Tax=Actinomycetales RepID=A8LWX2_SALAI Length = 499 Score = 303 bits (776), Expect = 9e-81, Method: Composition-based stats. Identities = 85/337 (25%), Positives = 145/337 (43%), Gaps = 16/337 (4%) Query: 7 DVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGG 66 G R L +G ++LF RN D Q+ L +RA R ++VA+D+E G Sbjct: 19 GFVGTTAPTWVRRWLGDG-LGAVVLFARNVVDSDQVAALTATLRAE-RPDVIVAIDEEAG 76 Query: 67 RVQRFREGF-TRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVG- 124 V R G + P + + L + + +E+ A+ I +++AP DV Sbjct: 77 DVTRIESGLGSSRPGNLALGVVDDP----ALTEAVASDLGAELAALGITLNYAPDADVNS 132 Query: 125 -HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDP 183 + IG RS+ ADP + + ++ G+ +G+ KHFPGHG DSH + P Sbjct: 133 DPDNPVIGVRSFGADPARVARHTAAWVRGLQASGVAACAKHFPGHGDTRIDSHHDLPRIV 192 Query: 184 RPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQELGFD 242 + + A +++ F + + + A+M H++ +DP PAS S L +LR ELGF Sbjct: 193 GDRTRLDAVELAPFRAALS-AGVQAVMSGHLLVPVLDPDLPASLSRRILTGLLRDELGFA 251 Query: 243 GVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTR 302 GV+ +D + M A +A +L AG D I V R + + + Sbjct: 252 GVVVTDAVEMRAVADRYGFAGAAVRALAAGADAICVGGERADEDAA-----QQLRDAIVA 306 Query: 303 LYHKGSFSRQELMDSARWKAISTRLNQLHERWQEEKA 339 G + L+++A+ ++ + + Sbjct: 307 AVVAGELPEERLVEAAKRVSLLASWTAASRGARPARQ 343 >UniRef50_A0LL99 Beta-N-acetylhexosaminidase n=1 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LL99_SYNFM Length = 337 Score = 302 bits (774), Expect = 1e-80, Method: Composition-based stats. Identities = 97/326 (29%), Positives = 149/326 (45%), Gaps = 15/326 (4%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASR----NRLVV 59 +++ G + + ++ +GG++LF RN P QLR L+ ++++ +R L V Sbjct: 13 LLVGFRGTTFQEQLKSLIDEFGIGGIVLFRRNIQTPEQLRSLLEEMQSHARQVLGRSLWV 72 Query: 60 AVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAP 119 A+DQEGG VQR FT+LP+A A +G + E A ++ M I I+ AP Sbjct: 73 AIDQEGGPVQRLVPPFTQLPSACDLAQ-----QGIEAVAEWSSKAAMDLRRMGIHINLAP 127 Query: 120 VLD--VGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 VLD V S + R DP + R+I + AG+ T KHFPG G D H Sbjct: 128 VLDLRVNANSHFMEGRCLGDDPLTVAELGCRWIKTLQGAGVSATAKHFPGLGLAELDPHH 187 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLR 236 P P E +D+ F I E + +M +H +Y +D PA+ SP +LR Sbjct: 188 FAPVIRWPDQEAMQRDLLPFRKAI-EAGVHCVMTSHALYPFIDSVWPATLSPAINNDLLR 246 Query: 237 QELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIK 296 LGF G + SDD+ M + S+ E + L A D L+C + + L Sbjct: 247 GTLGFRGTLLSDDMDMAAVSEKYSWKEMAEQGLLATIDFFLLCQRTENIEQLQGALCAAI 306 Query: 297 AE--RVTRLYHKGSFSRQELMDSARW 320 A R+ ++ + + + L D R Sbjct: 307 AGSSRIEAMHRESAKRIEWLYDRHRM 332 >UniRef50_D2AXZ5 Beta-N-acetylhexosaminidase n=2 Tax=Streptosporangium roseum DSM 43021 RepID=D2AXZ5_STRRD Length = 504 Score = 301 bits (773), Expect = 2e-80, Method: Composition-based stats. Identities = 93/324 (28%), Positives = 148/324 (45%), Gaps = 17/324 (5%) Query: 7 DVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGG 66 G E+ R L +GG+ L++RN D AQ+R L +RA + +V+A D+E G Sbjct: 25 GFVGVEVPDWLRRRLGEG-LGGVALYSRNLVDSAQVRALTAALRAEN-PDVVIATDEEAG 82 Query: 67 RVQRFREGF-TRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGH 125 V R + P + A+ E L + + +M I +++APV+DV Sbjct: 83 DVTRLEALTGSSRPGNLALGAVDDPE----LTELVARGVGRDMADAGITLNYAPVVDVNS 138 Query: 126 I--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETPCDP 183 + +G R++ ADP A ++ G+ AG+ KHFPGHGA DSH + P Sbjct: 139 DLDNPVVGTRAFGADPWLVARHARAWVTGLQSAGVAACAKHFPGHGATAVDSHNDLPTVS 198 Query: 184 RPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLRQELGFD 242 E+ A + + + I E + A+M H++ DP PA+ S L +LR+ELGFD Sbjct: 199 YSAEELAATTLPPYRTAI-EAGVRAVMTGHLLVPAYDPDFPATMSSRILVGLLREELGFD 257 Query: 243 GVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTR 302 G+I +D + M A +L AG D + V RKGA + + Sbjct: 258 GLIVTDGVEMPSVAERYGVTGAAVRALAAGADAVCVGGERKGA-----GVVGFMRNAIMA 312 Query: 303 LYHKGSFSRQELMDSA-RWKAIST 325 G+ + L ++A R + ++ Sbjct: 313 AVIDGTLPEERLAEAASRVRRLAA 336 >UniRef50_B2UPP0 Beta-N-acetylhexosaminidase n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UPP0_AKKM8 Length = 353 Score = 301 bits (772), Expect = 2e-80, Method: Composition-based stats. Identities = 93/286 (32%), Positives = 144/286 (50%), Gaps = 10/286 (3%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 M P ++ + G+E+ AEE + G ILF+RN Q+R L +R + V+A Sbjct: 1 MLPALIGISGHEVGAEEEAAIRRLQPAGFILFSRNIDSVEQVRGLTESLRKLCLHHPVIA 60 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 VDQEGGRV R LP+ S A L + +L G + A + + ++++FAPV Sbjct: 61 VDQEGGRVVRTASLGLNLPSPASLARLGSVGGIVEL----GAVTALALRYLGVNLNFAPV 116 Query: 121 LDVGHISAAIG---ERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 LD+ H +A R + + Q ++ + + G+++ GKHFPG G AD H Sbjct: 117 LDICHDPSAANALPGRCWGDNAQDVISRGGVYASNLRRGGVQSCGKHFPGMGRALADPHF 176 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLR 236 P + E+ D+ F +L L +IM AH++ +DP PA+ S ++ +LR Sbjct: 177 SLPVIGLDERELFKTDLLPFLALCPA--LSSIMSAHIMLPQIDPDYPATLSERVIRGLLR 234 Query: 237 QELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNR 282 LGF GV+F+DDL M S + SL AGCD+ L+C++ Sbjct: 235 DRLGFRGVVFTDDLCMGAITTQYSPDDAAFLSLKAGCDLPLICHDP 280 >UniRef50_A3EUK5 Beta-N-acetylhexosaminidase n=2 Tax=Leptospirillum sp. Group II RepID=A3EUK5_9BACT Length = 342 Score = 301 bits (772), Expect = 2e-80, Method: Composition-based stats. Identities = 93/340 (27%), Positives = 154/340 (45%), Gaps = 18/340 (5%) Query: 2 GP-VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS-RNRLVV 59 G + + + G L AE+ L GG++LF N +++L+ +IR A + + Sbjct: 12 GQWLWVSLPGARLGAEDIRWLHQIRPGGVVLFAENGTRAEDVKKLISEIREALFPQDVWI 71 Query: 60 AVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAP 119 A+DQEGGRV R +EG RL A++ + + + + IDI FAP Sbjct: 72 AIDQEGGRVARLKEGVPRLSPARTLG-----NGDPSEIFRSAFHLGRALRQTGIDIDFAP 126 Query: 120 VLDV--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHK 177 VLDV + IG+RS+ +P + ++ F G+ G+ GKH PGHG DSH Sbjct: 127 VLDVDSNPDNPVIGDRSFSREPWEVARLSMAFSHGLEAGGVIPCGKHVPGHGDTLLDSHL 186 Query: 178 ETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKTVLR 236 P P+A + ++ F I E + +M AHV+Y +D + PA+ S ++ +LR Sbjct: 187 ALPTVDAPEAILSGREFLPFEKAIAEG-MPMLMTAHVMYPALDKKWPATLSRKIMQDMLR 245 Query: 237 QELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIK 296 ++ F G++ SDDLSM ++ + +L CD +LV +R + + + + Sbjct: 246 KKWDFRGLLLSDDLSMAAVGTPFPFSVVVERALSVTCDGLLVLKSRDKPLEAIRIIKGLS 305 Query: 297 AERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHERWQE 336 R L S +A+ + + E Sbjct: 306 E-------KNPGLWRARLEKSIARRAVLGKRIAFCRKAAE 338 >UniRef50_Q982A1 Glycosyl hyrolase, family 3 n=2 Tax=Mesorhizobium RepID=Q982A1_RHILO Length = 394 Score = 300 bits (769), Expect = 5e-80, Method: Composition-based stats. Identities = 92/316 (29%), Positives = 156/316 (49%), Gaps = 21/316 (6%) Query: 1 MGPV-MLDVEGYELDAE-EREILAHPLVG---GLILFTRNYHDPAQLRELVRQIRAASRN 55 +G + ++ G D + + ++ G G+I RN +R+L + ++ + Sbjct: 50 IGQMVLVGFVGDSPDDDGYKRVVKQAEAGEITGVIYLGRNISSLEAVRQLNKGLQQYATA 109 Query: 56 RLVVAVDQEGGRVQRFRE--GFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDI 113 L+VA+DQEGGR+QR GF +P+ + A +E +AS++ ++ Sbjct: 110 PLLVAIDQEGGRIQRLTGEVGFREVPSEATVAKTMSPDEASV----VYQDLASDLSSLGF 165 Query: 114 DISFAPVLD--VGHISAAIG--ERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHG 169 +++ APV+D V + IG RS+ ADPQK A A FI+ G+ T KHFPGHG Sbjct: 166 NLNLAPVVDLNVNPANPIIGKLGRSFSADPQKVEAYARAFIEAHRAKGVLTALKHFPGHG 225 Query: 170 AVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD---PRPASG 226 + T DSHK + K++ + LI +D +M AHVI + + PAS Sbjct: 226 SSTKDSHKGIADVTKT---WSDKELLPYKGLIASGDVDMVMAAHVINAKLAFSPDIPASL 282 Query: 227 SPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAV 286 S L T+LR + F GV+ SDD+ M+ S+ + ++ AG D+++ N++ Sbjct: 283 SRATLTTLLRDNMHFKGVVISDDMQMQAITDNVSFEDSVLRAVMAGNDILIFANDKHPDP 342 Query: 287 SVLDNLSPIKAERVTR 302 ++ + ++ I +E R Sbjct: 343 AIPEKVAAILSEEAQR 358 >UniRef50_Q6BNW2 DEHA2E18546p n=3 Tax=Saccharomycetaceae RepID=Q6BNW2_DEBHA Length = 1044 Score = 300 bits (769), Expect = 5e-80, Method: Composition-based stats. Identities = 82/348 (23%), Positives = 150/348 (43%), Gaps = 28/348 (8%) Query: 2 GPVMLD-VEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAAS------R 54 G ++ +G + A+ ++ V +IL +N Q+ +L+R ++ + + Sbjct: 12 GQLLCGGFQGTNVTAQAYHLIVQHHVSAMILSKKNAVSVKQMTKLIRDLQYIAYTQANYK 71 Query: 55 NRLVVAVDQEGGRVQRFREG--FTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMD 112 L+ A+D+EGG + + + P A + AA + L + +A E+ + Sbjct: 72 YPLLFAIDEEGGMMNSLFDPNFLNQFPGAMAMAATGDAD----LVYKISKAIAIELKHIG 127 Query: 113 IDISFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGA 170 I PVLDV +G RS+ + G+ + G+ T GKHFPG G Sbjct: 128 FSIILGPVLDVVTKLSHQLVGVRSFGTTVEDVTKYGKACARGLKDGGLITVGKHFPGIGN 187 Query: 171 VTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPRPASGSP 228 + DS E P +I+ + F+ LI+E LD I A + D A SP Sbjct: 188 ASVDSLLELPMMADSLDQIKHFNSVPFAELIKEKLLDGISAAGCGVPTISPDETHACLSP 247 Query: 229 YWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSV 288 + +LRQ+LGFDG++ S+ L M+ + +L AGCD+++VC+++ Sbjct: 248 IVINQLLRQDLGFDGMVISECLEMDALYHSIGLGQGVILALYAGCDLVMVCHDKTLQDEA 307 Query: 289 LDNLSPIKAERVTRLYHKGSFSRQELMDSARWKA--ISTRLNQLHERW 334 + E + + G+ + ++ S + + +L +E + Sbjct: 308 V--------ESIDKALANGNLDDE-IISSCLRRIEKLQKKLPSWNELF 346 >UniRef50_C0D0H0 Putative uncharacterized protein n=1 Tax=Clostridium asparagiforme DSM 15981 RepID=C0D0H0_9CLOT Length = 447 Score = 300 bits (769), Expect = 5e-80, Method: Composition-based stats. Identities = 90/329 (27%), Positives = 147/329 (44%), Gaps = 29/329 (8%) Query: 11 YELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQI----RAASRNRLVVAVDQEGG 66 + R+ L VGGLI F N P Q+R+L R L+++VD+EGG Sbjct: 131 TQAGDSTRKALESRPVGGLIYFRNNLQTPEQVRDLTENTARIGREVCSLPLILSVDEEGG 190 Query: 67 RVQRFRE----GFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLD 122 +V R A + + A E G + S + ++ FAP D Sbjct: 191 KVTRIGGRAGFDVPVFSNMWEIGAQNDPGK----AYEVGSAIGSYLSGYGFNLDFAPDAD 246 Query: 123 V--GHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETP 180 V + IG RS+ +DP+ ++AS+ G+ + G++ KHFPGHGA D+H Sbjct: 247 VLTNPDNQVIGNRSFGSDPELVWSMASQVAKGLKDQGIQPVYKHFPGHGATLGDTHDGFS 306 Query: 181 CDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPRPASGSPYWLKTVLRQE 238 + ++ ++ F++ ++ + IM AH+ V D PAS S + VLR++ Sbjct: 307 YTEKTLDQLLEAELVPFANAVKNGE-SCIMAAHIAAPGVTGDETPASLSHALITGVLREK 365 Query: 239 LGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAE 298 LGFDGV+ +D L+M + S E A+++AG D++L+ + A Sbjct: 366 LGFDGVVVTDALNMGAIQNLYSSGEAAVAAVEAGADLLLMPADFNAAYEA---------- 415 Query: 299 RVTRLYHKGSFSRQELMDSARWKAISTRL 327 V G+ +R+ L S R + +L Sbjct: 416 -VLAAVEDGTITRERLDLSVR-RIGRMKL 442 >UniRef50_C2MBZ7 Beta-hexosaminidase n=1 Tax=Porphyromonas uenonis 60-3 RepID=C2MBZ7_9PORP Length = 436 Score = 300 bits (768), Expect = 7e-80, Method: Composition-based stats. Identities = 89/338 (26%), Positives = 142/338 (42%), Gaps = 31/338 (9%) Query: 1 MGPVMLDVEGYELDAEERE----ILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNR 56 +G +++ + D+ R+ ++ GG ILF + +P L ++ SR Sbjct: 82 IGQLIIPIWEPRYDSASRDRYLRLIDQIHPGG-ILFRK--GEPYDQYRLTHLLQERSRIP 138 Query: 57 LVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDIS 116 L++ D E G R +G R P + A EE G MA + M I +S Sbjct: 139 LLITADAEWGLAMRL-QGTIRYPRMKLLADHCTPEE----MYTLGQHMAEQCRVMGIHVS 193 Query: 117 FAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 FAPVLDV + + IG RS A P ++ G+ + G+ + KHFPGHG D Sbjct: 194 FAPVLDVNNNPKNPVIGTRSLGATPDIVISHGLALARGLEDGGVLSVAKHFPGHGNTDKD 253 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR---PASGSPYWL 231 SHK P E+ +++ F + I E L IM AH+ ++P P S S + Sbjct: 254 SHKTLPTVSGSLEELERVELAPFRAYI-EAGLGGIMVAHLSVPALEPDVTCPTSLSHAVV 312 Query: 232 KTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDN 291 +LR++LGF G+IF+D L M G A++ AG + + Sbjct: 313 TGLLREQLGFKGLIFTDGLEMRGVQRAEGLP-ISVAAILAGN-----------DLLLGPL 360 Query: 292 LSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQ 329 + + Y G+ S + + D + + +L Sbjct: 361 DPLASYKELLSAYRAGTLSAETIQDRC-LRVLCYKLLL 397 >UniRef50_UPI0001B588EB hydrolase n=1 Tax=Streptomyces sp. SPB78 RepID=UPI0001B588EB Length = 518 Score = 299 bits (767), Expect = 1e-79, Method: Composition-based stats. Identities = 85/315 (26%), Positives = 133/315 (42%), Gaps = 18/315 (5%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 +G + G + + ++LF+RN P Q+ L ++RA + LV+A Sbjct: 12 LGVLQPGFVGTTAPEWLLAAIRDG-LASVVLFSRNIETPEQVAALTTRLRAENPE-LVIA 69 Query: 61 VDQEGGRVQRF--REGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 +D+E G V R R G TR P + L + + A + + +A Sbjct: 70 IDEESGDVTRIESRTGSTR-PGNHALGYLDDT----ATTHAVAQDLGHTLRAAGVSLDYA 124 Query: 119 PVLDVG--HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSH 176 P DV + IG RS+ ADP S +I G+ AG+ KHFPGHG V DSH Sbjct: 125 PSADVNSNPDNPIIGVRSFGADPDSVARHTSAWIRGLQSAGVAACAKHFPGHGDVAVDSH 184 Query: 177 KETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP-RPASGSPYWLKTVL 235 P +I + + F + I M AH++ +DP RPA+ S L T+L Sbjct: 185 HGLPVYEASDRDIARQALPPFRAAIAAGARAI-MSAHLLVPALDPHRPATLSRRILTTLL 243 Query: 236 RQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPI 295 R+ELG++G+I +D + M A ++ AG D I V + + ++ Sbjct: 244 REELGYEGLIVTDAIEMGAVADEYGIEGAAVRAITAGADAICVGGE-----NASEGITHS 298 Query: 296 KAERVTRLYHKGSFS 310 A+ + G Sbjct: 299 LADALVEAVLGGDLP 313 >UniRef50_D2L090 Glycoside hydrolase family 3 domain protein n=1 Tax=Desulfovibrio sp. FW1012B RepID=D2L090_9DELT Length = 569 Score = 298 bits (763), Expect = 3e-79, Method: Composition-based stats. Identities = 108/351 (30%), Positives = 149/351 (42%), Gaps = 22/351 (6%) Query: 2 GPV-MLDVEGYELDAEEREILAHPLVGGLILFT--RNYHDPAQLRELVRQIRAAS----- 53 G V M+ EG ++ E ++ +GG+IL+ N PAQ+ L ++A + Sbjct: 45 GQVFMVWFEGPDVSPETAALIRDRRLGGVILYANAGNIETPAQVARLTAALQATAADTPQ 104 Query: 54 RNRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDI 113 L+V VDQEGG V R R GFT P+ + AA + L ++A E++A+ I Sbjct: 105 GLGLLVGVDQEGGPVARLRRGFTVFPSQMAQAAAGRAD----LVRQAAAATGRELLAVGI 160 Query: 114 DISFAPVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAV 171 + FAPV DV + IG RS+ A+P + G G T KHFPGHG Sbjct: 161 NTDFAPVADVNVNPANPVIGIRSFGANPADVARFTAAATAGYASVGAICTPKHFPGHGDT 220 Query: 172 TADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDV--DPRPASGSPY 229 DSH P A + D A+M AHV+ D+ D PA+ S Sbjct: 221 AVDSHVGLPRVDHDAATLSRVDF-PPFRAAFAAGAPAVMTAHVLTPDLEPDNLPATLSRR 279 Query: 230 WLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVL 289 L VLR LGFDGVIF+D L M A AE +L AG D++LV + Sbjct: 280 MLSGVLRGRLGFDGVIFTDSLGMGAVAATYGTAEAAVMALAAGADVLLVGADAGRP---- 335 Query: 290 DNLSPIKAERVTRLYHKGSFSRQELMDSA-RWKAISTRLNQLHERWQEEKA 339 + +RV G L + R + R L + A Sbjct: 336 ASERLEAMDRVLLAVRTGRVPVNRLDAAVLRVLRLKERFGLLRAAALDTPA 386 >UniRef50_C7TNJ8 Beta-N-acetylhexosaminidase (GH3) n=3 Tax=Lactobacillus RepID=C7TNJ8_LACRL Length = 641 Score = 297 bits (762), Expect = 3e-79, Method: Composition-based stats. Identities = 86/361 (23%), Positives = 154/361 (42%), Gaps = 39/361 (10%) Query: 1 MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPA---------QLRELVRQIRA 51 +G + + + + R +A +GGLI++ ++ + V+ + Sbjct: 56 IGQMFVS-RTLQDTDKARSDIAKYNLGGLIVYGADFTSVEGKTATEAQNNFKMKVQSFQN 114 Query: 52 ASRNRLVVAVDQEGGRVQRFREGFTR-----LPAAQSFAALSGMEEGGKLAQEAGWLMAS 106 ++ L++ VDQEGG V R + P+ Q+ A GM K A E G + Sbjct: 115 SASLPLLIGVDQEGGTVSRLSQNPLIANGRSFPSPQTAYANGGMANVTKEASEVGTI--- 171 Query: 107 EMIAMDIDISFAPVLDVGHI-SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHF 165 + + I+ ++APV D S+ I R++ D + I AG+ T KHF Sbjct: 172 -LKNLGINWNYAPVADSTPDTSSFIYGRTFGQDYLATANYITNVIPAWQNAGVAATLKHF 230 Query: 166 PGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PA 224 PG+G+ D+H + + +A+ +D+ F S I D++M AH++ VDP PA Sbjct: 231 PGYGSAI-DTHTDFAVVTKSKADFEKEDLLPFKSGIT-AGADSVMIAHIVMQAVDPVYPA 288 Query: 225 SGSPYWLKTVLRQELGFDGVIFSDDLSMEGA---AIMGSYAERGQASLDAGCDMILVCNN 281 S S + +LR ELG++G+I +D L M A +++AG D I+ N+ Sbjct: 289 SLSRKVVTDLLRNELGYNGLIITDALEMGAIKQFAQEHDQVPVDVLAVEAGNDCIMN-ND 347 Query: 282 RKGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDSA-RWKAISTRLNQLHERWQEEKAG 340 + A+ ++ G+ E+ + R + +L L + ++K Sbjct: 348 YETAIP-----------QIHAAVTSGTIKESEINEHVFRILDLKRKLGLLTKAQLQQKKV 396 Query: 341 H 341 Sbjct: 397 Q 397 >UniRef50_B3DX93 Beta-glucosidase-related glycosidase n=1 Tax=Methylacidiphilum infernorum V4 RepID=B3DX93_METI4 Length = 373 Score = 297 bits (762), Expect = 4e-79, Method: Composition-based stats. Identities = 85/316 (26%), Positives = 146/316 (46%), Gaps = 12/316 (3%) Query: 4 VMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQ 63 +++ + G L ++ + + G I F+RN +P R L+ Q+R+ + + A+DQ Sbjct: 18 LIIGLPGAHLSQDQIQTIREIQPSGFIFFSRNLENPLSFRNLLEQLRSLVEHEPIFAIDQ 77 Query: 64 EGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDV 123 EGGRV R + + P A+ + ++ L +E G L + ++ APVLD+ Sbjct: 78 EGGRVSRLKAFGSEPPGAKELGEKADLD----LLREHGELTGKLLRLFGLNFDLAPVLDM 133 Query: 124 G---HISAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKETP 180 ++ RS+ DPQ A FI+G+ G+ GKHFPG+ D H E P Sbjct: 134 DTSEREHNSLKGRSFAPDPQDVSRFAKAFIEGLKSQGILCCGKHFPGYSFAQCDPHLELP 193 Query: 181 CDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR--PASGSPYWLKTVLRQE 238 + + E+ + + F +L +N D+ M AH+ ++DP+ PA+ S + +++RQE Sbjct: 194 QSFKSKKELESFEWIPFRAL--QNSCDSFMIAHIRNFNLDPQGLPATLSSRMI-SIIRQE 250 Query: 239 LGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAE 298 F ++ SDD+ M S + + SL AG D++L+C+ LS Sbjct: 251 WQFTQLLVSDDIDMGAIINHYSLKDTLELSLKAGIDILLLCHRFHLVREAAAILSSFPES 310 Query: 299 RVTRLYHKGSFSRQEL 314 + R L Sbjct: 311 IKEEAAQRIENFRNRL 326 >UniRef50_A7ZD64 Beta-hexosaminidase A (N-acetyl-beta-glucosaminidase) (Beta-N-acetylhexosaminidase) (Chitobiase) n=2 Tax=Campylobacter RepID=A7ZD64_CAMC1 Length = 351 Score = 296 bits (760), Expect = 5e-79, Method: Composition-based stats. Identities = 88/335 (26%), Positives = 156/335 (46%), Gaps = 26/335 (7%) Query: 4 VMLDVEGYEL-DAEEREILAHP---LVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVV 59 +M+ G DA R +L+ GG++L RN AQL+ ++ I+ S ++ + Sbjct: 32 IMVGFNGASTKDAAFRAMLSDAGYERFGGVMLLGRNVTSKAQLKASIKAIKEKS-PKIFI 90 Query: 60 AVDQEGGRVQRFRE--GFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISF 117 A+D+EGG V R ++ P+A A+ ++ A + MA + I+++F Sbjct: 91 AIDEEGGNVSRMKDKSFDGPYPSAYEVASTLDIKS----AYDLYSKMAINLKECGINLNF 146 Query: 118 APVLDVGHISAAIGE---RSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTAD 174 APV+D+ ++ I R++ K + A F+D E G+ TT KHFPGHG+ D Sbjct: 147 APVVDLHDENSPIIAAKQRAFSEYASKVVIYADAFMDAFKEQGILTTLKHFPGHGSSKED 206 Query: 175 SHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGSPYWLKT 233 SHK + + + + + I + IM H+ +D PA+ S + Sbjct: 207 SHK-----NKSEVTLSKDALLPYKDAISTGRAQIIMVGHLFVKGIDDDNPATLSKKIITD 261 Query: 234 VLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLS 293 +LR EL F+GV+ SDD+ M+G A++ ++AG D++L + D ++ Sbjct: 262 LLRNELKFNGVVISDDMLMKGVGDEA-LAQKVVKFINAGGDILLFSEFKINNQRTADLIT 320 Query: 294 PIKAERVTRLYHKGSFSRQELMDS-ARWKAISTRL 327 I + ++ S++ + S R A+ +L Sbjct: 321 QI----IIDAVNEKKISKERIDASYKRIMALKAKL 351 >UniRef50_Q1IQ59 Beta-N-acetylhexosaminidase n=1 Tax=Candidatus Koribacter versatilis Ellin345 RepID=Q1IQ59_ACIBL Length = 372 Score = 296 bits (760), Expect = 6e-79, Method: Composition-based stats. Identities = 95/332 (28%), Positives = 150/332 (45%), Gaps = 11/332 (3%) Query: 2 GPVM-LDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVA 60 G +M + +G E+ A R +LA G I F RN Q +L + +AA L Sbjct: 14 GQLMIVGFDGTEMSARVRTLLATIQPAGTIFFKRNVATAEQTWKLNYEAQAAVSTPLFRC 73 Query: 61 VDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPV 120 VD EGG V R R+ P+ + AA K+ + +A+E A+ + FAPV Sbjct: 74 VDLEGGTVDRLRDAVAPAPSLSNVAATGS----KKVMRRFARTLAAEARALGFNTDFAPV 129 Query: 121 LDVGHISAA--IGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE 178 D+ + + + R+ ADP+ + +A F+ G + + GKHFPG GA DSH E Sbjct: 130 FDLRTVESVKVLAGRTIAADPKHIIELAREFLKGFKDENVLGCGKHFPGLGAGAVDSHYE 189 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR--PASGSPYWLKTVLR 236 P +P + +D+ + L E + M AH +Y + PAS S +W+ +LR Sbjct: 190 LPTISKPWKALWEEDLLPYRKLKDE--IAFAMVAHCVYPNATKEKAPASISRFWMTDILR 247 Query: 237 QELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIK 296 +++GF +I SDD+ M+G S E ++ G D+ LVCNN + Sbjct: 248 KKIGFKHLICSDDMEMKGVQKAVSIEEACIQAVRGGADLFLVCNNESLVWRCFHAVLREA 307 Query: 297 AERVTRLYHKGSFSRQELMDSARWKAISTRLN 328 + + SR+ R +A+ + N Sbjct: 308 ERDKSFAKQIAAASRRVFEFKKRSRAVRAKFN 339 >UniRef50_A0Q7X6 Glycosyl hydrolase family 3 n=12 Tax=Francisella RepID=A0Q7X6_FRATN Length = 378 Score = 296 bits (760), Expect = 6e-79, Method: Composition-based stats. Identities = 88/353 (24%), Positives = 156/353 (44%), Gaps = 39/353 (11%) Query: 1 MGPV-MLDVEGYELDA--EEREILAHPLVGGLILFTR----NYHDPAQLRELVRQIRAAS 53 +G + ++ G +D E + + +GG+ILF R N P QL+ L ++ + Sbjct: 29 IGQMLIVGFSGQAIDEKSEIVQHIKDNNIGGVILFDRKDKDNIKSPEQLKTLNSDLQKYT 88 Query: 54 RN----------RLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWL 103 + L++A+D EGG+V R + T+ ++F A E + Sbjct: 89 KIYNDLYMRPQVPLIIAIDYEGGKVNRLK---TKYGFPETFTAKKFSELDIQEQAHVARE 145 Query: 104 MASEMIAMDIDISFAPVLDVGHIS-AAIG--ERSYHADPQKALAIASRFIDGMHEAGMKT 160 MA M + I+++ AP +D+ + + IG ERS+ D + A+ F + + + Sbjct: 146 MAKTMYKVGINLNLAPCVDIHNDNCPVIGKYERSFSTDVNTIVDCANLFCKATEKHKIGS 205 Query: 161 TGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVD 220 KHFPGHG+ T DSH K++ + L+ ENK +M AHV+ ++D Sbjct: 206 VLKHFPGHGSSTNDSHLGAADIT---NTWTEKELIPYIRLLPENKHCMVMTAHVMNRNID 262 Query: 221 PR-PASGSPYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVC 279 PA+ S +K +LR ++ FDGVI SD + M+ E + +++AG ++ + Sbjct: 263 ETNPATLSNKIIK-ILRTDIKFDGVIISDAMEMKAVTEYYGELESLKLAINAGVNIFIFS 321 Query: 280 NNRKGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDSARWKAISTRLNQLHE 332 D S + + +L G + + S ++ + N L E Sbjct: 322 ----------DANSDTIIDNIAKLVESGEVAEATIKQS-YENIVAYKQNYLTE 363 >UniRef50_C7HX27 Beta-N-acetylhexosaminidase n=1 Tax=Thiomonas intermedia K12 RepID=C7HX27_THIIN Length = 361 Score = 296 bits (759), Expect = 7e-79, Method: Composition-based stats. Identities = 129/329 (39%), Positives = 178/329 (54%), Gaps = 11/329 (3%) Query: 3 PVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVD 62 P++LDV G +L A +R+ L +PLVGG++LF RN+ QL L QI+A R L++AVD Sbjct: 8 PLILDVAGLQLGAADRKRLRNPLVGGVVLFARNWESRGQLLSLCAQIKAV-RPDLLIAVD 66 Query: 63 QEGGRVQRFR-EGFTRLPAAQSFAALS--GMEEGGKLAQEAGWLMASEMIAMDIDISFAP 119 EGGRVQRFR +GFT LPA + A + A G+++ASE+ A +D SF P Sbjct: 67 HEGGRVQRFRTDGFTALPAMRELGAHWMADAPGAMRRAAACGFVLASELRACGVDFSFTP 126 Query: 120 VLDVGHI-SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSHKE 178 VLD+ S+ IG+R++HAD + +A + G+ AGM KHFPGHG V AD+H Sbjct: 127 VLDLDWGGSSVIGDRAFHADARVVTVLAQSLMLGLARAGMVHCAKHFPGHGYVRADTHLA 186 Query: 179 TPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQE 238 D R I + D + + L L A+MPAHVIY +VD PA S WL+ VLR+ Sbjct: 187 VARDARTLDAILSADAAPYQWL--APGLRAVMPAHVIYPEVDALPAGFSSVWLRDVLRER 244 Query: 239 LGFDGVIFSDDLSMEGAAIMG-SYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKA 297 +GF G I SDDL M A G S ++ L AGCD +LVCN + Sbjct: 245 IGFTGAILSDDLGMAAARAEGQSLSQAALQGLQAGCDAMLVCNQSVVDNG---RPLDVLL 301 Query: 298 ERVTRLYHKGSFSRQELMDSARWKAISTR 326 + + +G ++ R + T+ Sbjct: 302 RELDQALERGQWTPDVHSAQRRLDLLPTQ 330 >UniRef50_A3HZ06 Possible beta-N-acetylglucosaminidase/beta-lactamase fusion protein n=1 Tax=Algoriphagus sp. PR1 RepID=A3HZ06_9SPHI Length = 520 Score = 295 bits (757), Expect = 1e-78, Method: Composition-based stats. Identities = 83/347 (23%), Positives = 142/347 (40%), Gaps = 43/347 (12%) Query: 15 AEEREILAHPLVGGLILF------TRNYH----------DPAQLRELVRQIRAASRNRLV 58 E++ VGGL F N+ +L EL+ + + AS+ L+ Sbjct: 29 QSLEELIRLEEVGGLTFFHSRISAAANFEKRQEVLDVEGTLEKLVELINRFQQASKTPLI 88 Query: 59 VAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFA 118 +++D E G R E P A S +S EG K +E + M EM + I ++ A Sbjct: 89 ISIDGEFGLAMRI-ENTPIYPYAISLGGIS--LEGEKQVEEVAYKMGLEMKTVGIHLNLA 145 Query: 119 PVLDVGHI--SAAIGERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGHGAVTADSH 176 P DV + IG RSY +D +A G++ AG+ KHFPGHG DSH Sbjct: 146 PCADVNTNPLNPVIGYRSYGSDTNHVARLALAAYKGLNRAGIGACLKHFPGHGDTQVDSH 205 Query: 177 KETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDP---RPASGSPYWLKT 233 P + + E+ +++ F I ++ ++ IM H+ + AS S + Sbjct: 206 LNLPVVNKSKKELVEEELYPFQYGI-DHDVEMIMVGHLAVPALSNGAEISASISREIITD 264 Query: 234 VLRQELGFDGVIFSDDLSMEGAAIMGS-YAERGQASLDAGCDMILVCNNRKGAVSVLDNL 292 +L++E+GF G++ SD L+M+ + + + + AG D++ N K + + Sbjct: 265 LLKKEMGFQGIVISDALNMKAVSRLFDTPGQLEWEAFHAGNDILCFSENVKEGIDFILE- 323 Query: 293 SPIKAERVTRLYHKGSFSRQELMDS-ARWKAISTRLNQLHERWQEEK 338 + E+ S + + + +L R E Sbjct: 324 ---------------KATEAEINASFEKLQDLKEKLGVFERRPAEVP 355 >UniRef50_A8ETM1 Glycosyl hydrolase n=2 Tax=Campylobacterales RepID=A8ETM1_ARCB4 Length = 364 Score = 295 bits (757), Expect = 1e-78, Method: Composition-based stats. Identities = 86/341 (25%), Positives = 164/341 (48%), Gaps = 29/341 (8%) Query: 4 VMLDVEGYELD--AEEREILAHPLVGGLILFTR---------NYHDPAQLRELVRQIRAA 52 V+L G +D E + + +GG+ILF + N D QL+ L Q++A Sbjct: 36 VILGFNGTSVDKNDEIYKNIQSG-LGGVILFDKDPNDKTKIKNIKDKNQLKNLNTQLQAI 94 Query: 53 SRNRLVVAVDQEGGRVQRFREGFTRLPAAQSFAALSGMEEGGKLAQEAGWLMASEMIAMD 112 S +L++++DQEGG VQR ++ + + +A G A+++ +A ++ + Sbjct: 95 SNQKLLISIDQEGGVVQRLKKDSGFV---DTLSAKDIATNGEDFAKQSYKALAKDLKDVG 151 Query: 113 IDISFAPVLD--VGHISAAIG--ERSYHADPQKALAIASRFIDGMHEAGMKTTGKHFPGH 168 I++ FAPV+D + + I RS+ + + +S F+D + + + + KHFPGH Sbjct: 152 INLDFAPVVDLSINKNNKVIVTRGRSFGESSSEVIKYSSIFVDELRKQNVISVLKHFPGH 211 Query: 169 GAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPR-PASGS 227 G+ ADSH + K++ + I+ N +D IM AHV ++D PA+ S Sbjct: 212 GSSLADSHLGFVDITKT---WNQKELEPYKYFIKNNDVDMIMTAHVFNENLDKNYPATLS 268 Query: 228 PYWLKTVLRQELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVS 287 +LR++LG+ GV+ +DDL M + + ++++G +M+L N V+ Sbjct: 269 YNINTKLLREDLGYKGVLVTDDLQMSAISKHYDLKQTLTLAINSGVNMLLFANQLAKPVT 328 Query: 288 VLDNLSPIKAERVTRLYHKGSFSRQELMDS-ARWKAISTRL 327 + + + + + S Q+++DS + + ++ Sbjct: 329 LKEIVDTVYLQ-----IQNKQISLQKIIDSNNKIDKLLKKI 364 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.314 0.181 0.577 Lambda K H 0.267 0.0556 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 2,598,905,644 Number of Sequences: 3077464 Number of extensions: 145253721 Number of successful extensions: 398298 Number of sequences better than 1.0e-01: 250 Number of HSP's better than 0.1 without gapping: 2373 Number of HSP's successfully gapped in prelim test: 552 Number of HSP's that attempted gapping in prelim test: 383186 Number of HSP's gapped (non-prelim): 3242 length of query: 341 length of database: 1,040,396,356 effective HSP length: 129 effective length of query: 212 effective length of database: 643,403,500 effective search space: 136401542000 effective search space used: 136401542000 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.0 bits) S2: 94 (40.4 bits)